BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780837|ref|YP_003065250.1| putative restriction endonuclease S subunit [Candidatus Liberibacter asiaticus str. psy62] (426 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780837|ref|YP_003065250.1| putative restriction endonuclease S subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040514|gb|ACT57310.1| putative restriction endonuclease S subunit [Candidatus Liberibacter asiaticus str. psy62] Length = 426 Score = 341 bits (874), Expect = 2e-91, Method: Composition-based stats. Identities = 426/426 (100%), Positives = 426/426 (100%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT Sbjct: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL Sbjct: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR Sbjct: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL Sbjct: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN Sbjct: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE Sbjct: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID Sbjct: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 Query: 421 LRGESQ 426 LRGESQ Sbjct: 421 LRGESQ 426 >gi|152973654|ref|YP_001338694.1| putative restriction endonuclease S subunit [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|294496729|ref|YP_003560422.1| putative restriction endonuclease S subunit [Klebsiella pneumoniae] gi|150958436|gb|ABR80464.1| putative restriction endonuclease S subunit [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|293339438|gb|ADE43992.1| putative restriction endonuclease S subunit [Klebsiella pneumoniae] Length = 438 Score = 286 bits (731), Expect = 5e-75, Method: Composition-based stats. Identities = 238/436 (54%), Positives = 283/436 (64%), Gaps = 13/436 (2%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDV 56 M YKAY YKDSGV+WIG +P+HW+V ++ + + + + + DV Sbjct: 1 MSQYKAYTSYKDSGVEWIGQVPEHWEVKRLRHVGRYSNSGVDKKSYEDQQTVELCNYTDV 60 Query: 57 ESGTG--KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII-----ADFDGICST 109 +P + + KG ++ K I D G+ Sbjct: 61 YYNEFISDDMPFMQATASAHEIEQFTLKKGDVIITKDSEDPSDIGIPAFVPHDMPGVVCG 120 Query: 110 QFLVLQPKDVLPELLQGWL--LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 L + S G T + IGN P+ +PP EQ Sbjct: 121 YHLTMIRALNDNYGSYIHRSIQSDHTRAHFFVESPGITRYGLNQNTIGNAPVALPPPEEQ 180 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + ET RID L+ ++IRFIELLKEK+QAL+++ VTKGL+P+VKMKDSG+EW+G Sbjct: 181 ATIAATLDRETARIDALVEKKIRFIELLKEKRQALITHAVTKGLDPNVKMKDSGVEWIGQ 240 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 VP+HWEVKPFFALV+ELNRKN L E+NILSLSYGNIIQK ETRNMGL PESYETYQIV+ Sbjct: 241 VPEHWEVKPFFALVSELNRKNVGLAETNILSLSYGNIIQKPETRNMGLTPESYETYQIVE 300 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 GE+VFRF DLQNDKRSLRSAQV +RGIITSAYMAVKPH I STY AWLMRSYDLCKVFY Sbjct: 301 SGEVVFRFTDLQNDKRSLRSAQVTQRGIITSAYMAVKPHSIGSTYFAWLMRSYDLCKVFY 360 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 AMG GLRQSLKFEDV+RLPVL+PP+ EQ +ITN IN TARID LVEK EQSI LLKERR Sbjct: 361 AMGGGLRQSLKFEDVRRLPVLIPPVGEQSEITNTINAGTARIDALVEKTEQSITLLKERR 420 Query: 408 SSFIAAAVTGQIDLRG 423 ++FI AAVTGQIDLRG Sbjct: 421 AAFITAAVTGQIDLRG 436 >gi|282901858|ref|ZP_06309764.1| Restriction modification system DNA specificity domain protein [Cylindrospermopsis raciborskii CS-505] gi|281193254|gb|EFA68245.1| Restriction modification system DNA specificity domain protein [Cylindrospermopsis raciborskii CS-505] Length = 445 Score = 285 bits (730), Expect = 6e-75, Method: Composition-based stats. Identities = 120/433 (27%), Positives = 202/433 (46%), Gaps = 15/433 (3%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESG------KDIIYIGLE 54 K +K YP YKDSGV+W+G IP+HW+V + K+ +G T + +I ++ Sbjct: 14 KGWKRYPAYKDSGVEWLGKIPEHWEVRKVSHAFQKIGSGTTPSTNHYDYYEGNIPWVNTS 73 Query: 55 DVESGTGKYLPKDGNSRQS-DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV 113 ++ ++ D S ++++ G +L G + + I + Sbjct: 74 ELREKVITDTSAKLTNKALLDHSVLNLYPPGTLLIAMYGATIGRLGILGITACTNQACCA 133 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 L + + L + + + G + + + I +I +P PPL EQ I + Sbjct: 134 LANPISINAKFAFYWLWMR-RNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQAIAQF 192 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + ET +IDTL+ ++ R IELLKEK+ AL+S+ VTKGLNPD MKDSG+EW+G VP +W Sbjct: 193 LDRETAKIDTLVAKKERLIELLKEKRTALISHAVTKGLNPDAPMKDSGVEWLGEVPRNWP 252 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + + + K T+ ++ + T + G+++F Sbjct: 253 MIRLKHVAPVSSAKLTQKPDNLPYIGLEHIESKTGRLLLDTPVENVESTVSCFEKGDVLF 312 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSG 352 + K L + G+ T+ +A+KP + +L + + + + G Sbjct: 313 GKLRPYLAKVLLAEFE----GVSTTELLALKPSQDVNGKFLFFQLIAEGFIDQVNSFTYG 368 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + E + L + +PP+ EQ I ++ ETA+ID LV K SI LKE R++ I Sbjct: 369 TKMPRVGPEQITNLFIPLPPLPEQQAIAQFLDRETAKIDTLVAKTRTSIEKLKEYRTALI 428 Query: 412 AAAVTGQIDLRGE 424 +AAVTG+ID+R E Sbjct: 429 SAAVTGKIDVREE 441 Score = 143 bits (359), Expect = 8e-32, Method: Composition-based stats. Identities = 48/220 (21%), Positives = 86/220 (39%), Gaps = 14/220 (6%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + KG KDSG+EW+G +P+HWEV+ ++ T + Sbjct: 12 IVKGWKRYPAYKDSGVEWLGKIPEHWEVRKVSHAFQKIGSGTTPSTNHYDYYEGNIPWVN 71 Query: 267 KLETRNMGLKPE----------SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 E R + + + PG ++ + + + Sbjct: 72 TSELREKVITDTSAKLTNKALLDHSVLNLYPPGTLLIAMYGATIGRLGI----LGITACT 127 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 A A+ + A+ ++ G + ++ E ++ + + PP+ EQ Sbjct: 128 NQACCALANPISINAKFAFYWLWMRRNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQ 187 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I ++ ETA+ID LV K E+ I LLKE+R++ I+ AVT Sbjct: 188 AIAQFLDRETAKIDTLVAKKERLIELLKEKRTALISHAVT 227 >gi|145629009|ref|ZP_01784808.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 22.1-21] gi|145639608|ref|ZP_01795212.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittII] gi|144978512|gb|EDJ88235.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 22.1-21] gi|145271399|gb|EDK11312.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittII] gi|162949226|gb|ABY21299.1| probable type I restriction-modification system specificity protein [Haemophilus influenzae] gi|309750476|gb|ADO80460.1| Probable type I restriction modification system, specificity component HsdS2 [Haemophilus influenzae R2866] Length = 433 Score = 275 bits (702), Expect = 1e-71, Method: Composition-based stats. Identities = 141/433 (32%), Positives = 225/433 (51%), Gaps = 17/433 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKY 63 + Y +YKDSGV+W+G IP HW+ VP++ K + + +I+ + + + + Sbjct: 2 RRYERYKDSGVEWLGEIPTHWECVPLRSIFKFRNEKNNPIKTDNILSLSIANGVTEYSD- 60 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL--QPKDVLP 121 + GN R+ D S+ + I+ + + ++ + G S + L + Sbjct: 61 ENRGGNKRKDDLSSYKLAYPNDIVLNSMNVIVGAVGVSKYFGAISPVYYALSLHNQRANL 120 Query: 122 ELLQGWLLSIDVTQRIEAICEG------------ATMSHADWKGIGNIPMPIPPLAEQVL 169 + + + + + +G + + PI PL EQ Sbjct: 121 SYYESIFKNENFQRGLLRFGKGILIKFGENGKMNTIRMKISQDDLKKLYFPISPLDEQQK 180 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + +T +ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP Sbjct: 181 IAQFLDDKTAKIDRAVELAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVP 240 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 +HWE+ + E R N + E+ +LSLSYG II K E + GL PES+ETYQIV+P Sbjct: 241 EHWELTIGMNVFRENKRDNKGMKENTVLSLSYGKIIIKPEEKLFGLVPESFETYQIVEPN 300 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYA 348 +I+ R DLQND+ SLR+ ++GIITSAY+ + + +L + + + D+ KV Y Sbjct: 301 DIIIRCTDLQNDQTSLRTGLAQDKGIITSAYLNLKVINNYSAKFLHYYLHALDITKVLYK 360 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 GSGLRQ+L F D KRLP++ + EQ I + ++ +T++ID + I LKE +S Sbjct: 361 FGSGLRQNLSFLDFKRLPIIDISLAEQQQIADYLDKQTSKIDQAIALKTAHIEKLKEYKS 420 Query: 409 SFIAAAVTGQIDL 421 I VTG++ + Sbjct: 421 VLINDVVTGKVRV 433 >gi|145631519|ref|ZP_01787287.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae R3021] gi|144982864|gb|EDJ90381.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae R3021] Length = 433 Score = 273 bits (698), Expect = 3e-71, Method: Composition-based stats. Identities = 142/433 (32%), Positives = 224/433 (51%), Gaps = 17/433 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKY 63 + Y +YKDSGV+W+G IP HW+ +PI+ K + +I+ + + + + Sbjct: 2 RRYERYKDSGVEWLGEIPSHWECLPIRSIFKFRNEKNDPIKTDNILSLSIANGVTEYSD- 60 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL--QPKDVLP 121 + GN R+ D S+ + I+ + + ++ + G S + L + Sbjct: 61 ENRGGNKRKDDLSSYKLAYPNDIVLNSMNVIVGAVGVSKYFGAISPVYYALSLHNQRANL 120 Query: 122 ELLQGWLLSIDVTQRIEAICEG------------ATMSHADWKGIGNIPMPIPPLAEQVL 169 + + + + + +G + + PI PL EQ Sbjct: 121 SYYESIFKNENFQRGLLRFGKGILIKFGENGKMNTIRMKISQDDLKKLYFPISPLDEQQK 180 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + +T +ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP Sbjct: 181 IAQFLDDKTAKIDRAVELAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVP 240 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 +HWE+ + E R N + E+ +LSLSYG II K E + GL PES+ETYQIV+P Sbjct: 241 EHWELTIGMNVFRENKRDNKGMKENTVLSLSYGKIIIKPEEKLFGLVPESFETYQIVEPN 300 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYA 348 +I+ R DLQND+ SLR+ ++GIITSAY+ + + +L + + + D+ KV Y Sbjct: 301 DIIIRCTDLQNDQTSLRTGLAQDKGIITSAYLNLKVINNYSAKFLHYYLHALDITKVLYK 360 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 GSGLRQ+L F D KRLP++ + EQ I + ++ +TA+ID + I LKE +S Sbjct: 361 FGSGLRQNLSFLDFKRLPIIDISLAEQQKIADYLDTQTAKIDRAIALKTAHIEKLKEYKS 420 Query: 409 SFIAAAVTGQIDL 421 I VTG++ + Sbjct: 421 VLINDVVTGKVRV 433 >gi|121997944|ref|YP_001002731.1| restriction modification system DNA specificity subunit [Halorhodospira halophila SL1] gi|121589349|gb|ABM61929.1| restriction modification system DNA specificity domain [Halorhodospira halophila SL1] Length = 429 Score = 262 bits (669), Expect = 8e-68, Method: Composition-based stats. Identities = 137/433 (31%), Positives = 199/433 (45%), Gaps = 28/433 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + AYP+YKDSGV+W+G +P+HW V +KR +L +G S D S Sbjct: 1 MS-FPAYPEYKDSGVEWLGEVPEHWSVSALKRVARLESGDAISS----------DHISEE 49 Query: 61 GKYLPKDGNSRQSDTSTVSI--FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 G+Y GN + +S + F L G+ G A S +V+ P Sbjct: 50 GEYAVYGGNGIRGFSSGYTHDGFYP---LIGRQGALCGNVNYAKGRFWASEHAVVVWPGR 106 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + G LL + A + I N+ +P+PP EQ I E + ET Sbjct: 107 QIDGFWLGELLRS---MNLNQYATSAAQPGLSVETIENLYVPVPPDEEQQKIAELLDHET 163 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 RID LI E+ R IELLKEK+QA++S+ VTKGL+PDV MKDSG+EW+G VP HW+V F Sbjct: 164 ARIDALIEEQQRLIELLKEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWDVVKFV 223 Query: 239 ALVTELNRKNTKLIESNI-LSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVF 293 + E + L N I+ R M + G++++ Sbjct: 224 RCAKIAEGQVDPKQEPYRSMMLVAPNHIESGTGRLMARETAEEQGAESGKYYCYAGDVIY 283 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSG 352 I K + + Y G+ YL W + S + F Sbjct: 284 SKIRPSLRKACVAYEDCL---CSADMYPLRAQSGVYGDYLRWTILSESFSTLAFLESERV 340 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + E ++ + + +PP +EQ I+ + ETARID L+E+ E I LL+ERRS+ I+ Sbjct: 341 AMPKVNRESIEEIRIPMPPPEEQLQISRTLEKETARIDALMEEAESGIQLLQERRSALIS 400 Query: 413 AAVTGQIDLRGES 425 AAVTG+ID+R + Sbjct: 401 AAVTGKIDVRDWA 413 >gi|255320275|ref|ZP_05361460.1| restriction endonuclease S subunit [Acinetobacter radioresistens SK82] gi|255302714|gb|EET81946.1| restriction endonuclease S subunit [Acinetobacter radioresistens SK82] Length = 461 Score = 262 bits (669), Expect = 8e-68, Method: Composition-based stats. Identities = 124/449 (27%), Positives = 208/449 (46%), Gaps = 25/449 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDV 56 M Y+AY +YKDSGV+W+G +P HW + +KR+ + G S I + D+ Sbjct: 1 MAKYQAYAEYKDSGVEWLGVVPSHWIITTLKRYCYVKGGFAFSSDAFIDTGYPVIRIGDI 60 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFL 112 ++ L +S S + K Q+L G + KA + + + + Sbjct: 61 KTDGSINLENCKYIPESLAVNSRDYLVEKNQLLMAMTGATIGKAGLYTSNQPAFLNQRVG 120 Query: 113 VLQPKDVLPELLQGWLL--SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + W + + + I+ G + + + P IP EQ I Sbjct: 121 KFELLAQNMNYRYLWYILKTDGYQEYIKLTAFGGAQPNISDTAMVDYPATIPSFDEQTQI 180 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + ET +ID LI ++ R IELLKEK+QA++S+ VTKGLNP+V MKDSG+EW+G VP+ Sbjct: 181 ANFLDHETSKIDHLIEKQQRLIELLKEKRQAVISHAVTKGLNPNVPMKDSGVEWLGEVPE 240 Query: 231 HWEVKPFFALVTELNR--------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 HW + + R + + L LS NI + S++ Sbjct: 241 HWRISRLKYNASIFGRIGFRGYTVDDIVDEDEGALVLSPSNISNANKLTLEKKTYLSWKK 300 Query: 283 YQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 Y IVD +++ K ++ ++ E I +K I+ +L +L Sbjct: 301 YFESPEIIVDENDLLLVKTGSTFGKSAIIVNKL-EPMTINPQMALIKKSKIEPRFLGYLF 359 Query: 338 RSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + + +G ++ E++ P+ +P +E I+N ++ +T +ID L+EK Sbjct: 360 GSKLIKSIIENSNTGSGMPTMTQENINNFPIPLPSDEEAIIISNYLDNKTYKIDFLIEKS 419 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRGES 425 EQ+I+L++ERR++ I+AAVTG+ID+R Sbjct: 420 EQTILLMQERRTALISAAVTGKIDVRNWQ 448 >gi|113477871|ref|YP_723932.1| restriction modification system DNA specificity subunit [Trichodesmium erythraeum IMS101] gi|110168919|gb|ABG53459.1| restriction modification system DNA specificity domain [Trichodesmium erythraeum IMS101] Length = 415 Score = 262 bits (668), Expect = 1e-67, Method: Composition-based stats. Identities = 164/420 (39%), Positives = 242/420 (57%), Gaps = 15/420 (3%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 +++ YP YK SGV+W+G IP+HW++ +K + L G + +G E+ E G Sbjct: 5 NWQKYPVYKSSGVEWLGEIPEHWEMKRLKFISHLVYGDS---------LGSENREDGNIN 55 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 +G + I+ G+ G + + T +L+ Q K Sbjct: 56 VYGSNGMIGLHSKANTL---SPVIIVGRKGSFGKIQYSLFPCFCIDTAYLIDQRKTKQNL 112 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + L I ++ I + + + +P+ PL+EQ I + + +ID Sbjct: 113 KWLCYALQIL---ELDKISQDTGVPGLSREKAYQKLVPVSPLSEQQAIANFLDEKLAQID 169 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I ++ R IELLKE+K +++ VTKG+NPDV MK SGIEW+G VP+HWEV P FA+ Sbjct: 170 EYIAKKQRIIELLKEQKTVIINQAVTKGINPDVSMKYSGIEWLGEVPEHWEVLPAFAVFK 229 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 E N L+E N+LSLSYG II+K T N GL PES+ETYQIV PG I+ R DLQNDK Sbjct: 230 EQCVINRDLVEKNLLSLSYGKIIRKSFTNNFGLLPESFETYQIVTPGNIILRLTDLQNDK 289 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 RSLR V E+GIITSAY+ + P + Y+ L+ YD+ K+FY+MGSG+RQ++KF+D+ Sbjct: 290 RSLRVGLVKEKGIITSAYLCLNPQNVIPEYVYTLLHIYDILKIFYSMGSGVRQNMKFKDL 349 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 KRLP+ PP+ EQ +I + I + +I+ + IE+ I L++E R++ I+ VTG+ID+R Sbjct: 350 KRLPITFPPVSEQKEIVSFIEKKLEKIERSLTVIEKEIKLIQEYRTTLISETVTGKIDVR 409 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 44/211 (20%), Positives = 80/211 (37%), Gaps = 16/211 (7%) Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +V K SG+EW+G +P+HWE+K + + + Sbjct: 1 MVNFNWQKYPVYKSSGVEWLGEIPEHWEMKRLKFISHLVYGDSLGSENRED--------- 51 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + + P IV R + SL ++ + + Sbjct: 52 GNINVYGSNGMIGLHSKANTLSPVIIVGRKGSFGKIQYSLFPCFCIDTAY----LIDQRK 107 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + +L + ++ +L K+ G L E + V V P+ EQ I N ++ + Sbjct: 108 TKQNLKWLCYALQILELDKISQDTG---VPGLSREKAYQKLVPVSPLSEQQAIANFLDEK 164 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 A+ID + K ++ I LLKE+++ I AVT Sbjct: 165 LAQIDEYIAKKQRIIELLKEQKTVIINQAVT 195 >gi|114778243|ref|ZP_01453115.1| HsdS protein [Mariprofundus ferrooxydans PV-1] gi|114551490|gb|EAU54045.1| HsdS protein [Mariprofundus ferrooxydans PV-1] Length = 462 Score = 258 bits (660), Expect = 8e-67, Method: Composition-based stats. Identities = 123/441 (27%), Positives = 208/441 (47%), Gaps = 22/441 (4%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTG 61 Y YP+YKDSGV+W+G IP HW + K ++L + K+ +I +E +++ + Sbjct: 4 KYPPYPEYKDSGVEWLGEIPAHWVLTRTKYISELTPKKPKISRDKECSFIPMEKLKTDSI 63 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQ 115 + + FA +L K+ P I + G S++ V++ Sbjct: 64 VLDEVR--TIDDVYDGYTYFADSDVLMAKVTPCFENKNIAIAQDLVNGVGFGSSEIYVIR 121 Query: 116 PKDVLPELLQGWLLSID-VTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + L D + A GA + + N +P EQ+ I Sbjct: 122 ANQRVSNRFLFYRLQEDSFMEIAIAAMTGAGGLKRVPSDVLNNYIAAVPQHDEQMEIANF 181 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + ET +IDTLI ++ + I+LLKEK+QA++S+ VTKGLNPD M++SGIEW+G VP HWE Sbjct: 182 LDRETAKIDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMRNSGIEWLGEVPAHWE 241 Query: 234 VKPFFALVTELNR------KNTKLIESNILSLSYGNIIQ-KLETRNMGLKPE---SYETY 283 + + R K + ++ + L+ NI K++ N+ + Sbjct: 242 ISSLGFECSVKARLGWKGLKAEEYVDEGYIFLATPNIKGEKIDFENVNYITKARYDESPE 301 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ G+++ ++ + +S + IDS+YL + S + Sbjct: 302 IMLNEGDVLVTKDGSTTGTTNIVRELPSPATVNSSIAVLRSVGRIDSSYLYYFFVSTYVQ 361 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 V + G L D+++ VL+PP KEQ +I I++ + D L+ K E SI+L Sbjct: 362 NVIKRIQGGMGVPHLFQADLRKFNVLMPPFKEQKEIAAEIDMRLPKFDDLIAKAEYSILL 421 Query: 403 LKERRSSFIAAAVTGQIDLRG 423 +KERR++ I+AAVTG+ID+R Sbjct: 422 MKERRTALISAAVTGKIDVRH 442 >gi|294054710|ref|YP_003548368.1| restriction modification system DNA specificity domain protein [Coraliomargarita akajimensis DSM 45221] gi|293614043|gb|ADE54198.1| restriction modification system DNA specificity domain protein [Coraliomargarita akajimensis DSM 45221] Length = 447 Score = 257 bits (656), Expect = 3e-66, Method: Composition-based stats. Identities = 111/438 (25%), Positives = 192/438 (43%), Gaps = 16/438 (3%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 YKAYP+Y+DSG W+G +P W V K + R+ + ++ + G Sbjct: 4 RYKAYPEYRDSGFSWMGEVPSGWSVQRGKYVFSEFSERSESGNETLLSVSEYYGVKPRGD 63 Query: 63 YL-PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVL 120 + + SR ++ + + R + +DGI S + V + + Sbjct: 64 VIADGEFLSRAESLVGYKFCKANDLVMNIMLAWKRGLGVTKYDGIVSPAYSVFRFGEYAD 123 Query: 121 PELLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 P+ + L + T + G + + G+ + +P L EQ I + ET Sbjct: 124 PDYMHYLLRTDLYTGHFKTRSTGVIDSRLRLYPESFGDTSILLPSLPEQKQIARFLDHET 183 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +ID LI ++ I LLKEK+QA++S+ VTKGLNPD KMKDSG+EW+G VP+HWEV Sbjct: 184 AKIDRLIAKQQELIALLKEKRQAVISHAVTKGLNPDAKMKDSGVEWLGQVPEHWEVTYLT 243 Query: 239 ALVTELNRKNT------KLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDP 288 +V R +++ + + G++ ES + Sbjct: 244 HIVDPSRRIMYGIVLPGPNVDNGVPIVKGGDVKPGRLRLDSLCKTTYVIESNYERSRLKT 303 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+IV+ D + ++ + A D+ +L + M+S + Sbjct: 304 GDIVYSIRGTIGDVE-IVPEEINGANLTQDAARIAPKVPSDNRWLMYTMKSTSVFSQLEV 362 Query: 349 MG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + D+K+ + PP E+ DI ++++ ++D L ++ I LL+ERR Sbjct: 363 GSLGAAVRGINIRDLKKAIIPYPPQSERNDIEAFLDIQLGKLDRLSVDCKRQIELLQERR 422 Query: 408 SSFIAAAVTGQIDLRGES 425 ++ I+AAVTG+ID+R Sbjct: 423 TALISAAVTGKIDVRDWE 440 >gi|292490880|ref|YP_003526319.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] gi|291579475|gb|ADE13932.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] Length = 441 Score = 257 bits (656), Expect = 3e-66, Method: Composition-based stats. Identities = 124/432 (28%), Positives = 203/432 (46%), Gaps = 17/432 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 +Y YKDSGV+W+G IP HW+VVP+K L + + ++ YIG+E+VES TG+++ Sbjct: 2 PSYESYKDSGVEWLGEIPSHWQVVPLKYALSLASEKVITRQSNLKYIGMENVESFTGRFI 61 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + F G IL+GKL PYL K + + +G+CST+FLV + + + Sbjct: 62 ETASEVEGM----ANRFLAGDILFGKLRPYLSKVALTEVEGLCSTEFLVYRARQGSSKYF 117 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + S + A G+ M A IG +PIP EQ I + +T +I+ Sbjct: 118 RYLMTSSSFIDLVNASTYGSKMPRASADFIGIQRIPIPTKQEQTAIAAFLDRKTAQIEQA 177 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + + R I LLKE+KQ L+ VT+GLN D M+DSG+EW+G VP HW+ + L Sbjct: 178 VNIKERQITLLKERKQILIQNAVTRGLNSDAPMRDSGVEWIGHVPKHWKFAKLKHHIDML 237 Query: 245 ---------NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 N++ I+ E K + + + G++V Sbjct: 238 PGFAFKSSLYSSNSEDIKLLRGVNVNPGNTDWGEVVYWPKKEAADYSKYNLAKGDLVMAM 297 Query: 296 IDLQ---NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + SL + + ++ G+ + Y++ + S F M +G Sbjct: 298 DRPWISSGIRLSLIDEEDLPCLLLQRVVRIRGKSGVCTKYVSNTLSSNIFLSYFEPMLTG 357 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + + + + VPP EQ +I + I E+A++D + +EQ I LKE +++ I Sbjct: 358 ISVPHISTDQIGNFSCPVPPYDEQLEILDYIETESAKLDKGITLLEQQITKLKEYKATLI 417 Query: 412 AAAVTGQIDLRG 423 +AVTG+I + G Sbjct: 418 NSAVTGKIKVPG 429 >gi|226940441|ref|YP_002795515.1| Type I restriction-modification system, S subunit [Laribacter hongkongensis HLHK9] gi|226715368|gb|ACO74506.1| Type I restriction-modification system, S subunit [Laribacter hongkongensis HLHK9] Length = 453 Score = 257 bits (655), Expect = 3e-66, Method: Composition-based stats. Identities = 111/444 (25%), Positives = 176/444 (39%), Gaps = 25/444 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M YP YKDSGV+W+ +IP HW+VV +K ++ E G ++ I ++ Sbjct: 1 MS-LPKYPAYKDSGVEWLRSIPSHWEVVRLKNIFEIRKRIAGELGHSVLSITQRGIKVKD 59 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL---QPK 117 + D S I G + I+ G+ S + V Sbjct: 60 I---ESNDGQISMDYSKYQIVLPGDFAMNHMDLLTGYVDISSTHGVTSPDYRVFAMLDNA 116 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 +P + + A +G N +P PP EQ I + Sbjct: 117 HCVPRYFLHLFQNGYRQKIFYAFGQGASEFGRWRFPTDQFNNFRLPCPPDDEQAAIATFL 176 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 ET +ID LI E+ + I LL EK+QA +S+ VT+GL+P V MKDSG+EW+G VP HW + Sbjct: 177 DRETAKIDALIAEQEKLIALLAEKRQATISHAVTRGLDPAVPMKDSGVEWLGQVPAHWVI 236 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---------TYQI 285 + + + + S +++ +PE + + Sbjct: 237 CSVRRKLKRIEQGWSPECFSRPAEAGEWGVLKAGCVNGGIFRPEENKALPDTLAPDENIL 296 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERG---IITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + G+++ + + + G ++A + L Sbjct: 297 IKDGDLLMSRASGSPALVGSVAYLSAPPAHLMLSDKIFRLHLEQGTLPQFVAIAFGARYL 356 Query: 343 CKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 SG +L +K + +PP EQ +I ETA++D L E + Sbjct: 357 RHQIEQAISGAEGLANNLPQTSLKGFTIAIPPEVEQQEIVVFTQQETAKLDALKIAAEHA 416 Query: 400 IVLLKERRSSFIAAAVTGQIDLRG 423 + LLKERR++ IAAAVTGQID+RG Sbjct: 417 VSLLKERRAALIAAAVTGQIDVRG 440 >gi|298674425|ref|YP_003726175.1| restriction modification system DNA specificity domain-containing protein [Methanohalobium evestigatum Z-7303] gi|298287413|gb|ADI73379.1| restriction modification system DNA specificity domain protein [Methanohalobium evestigatum Z-7303] Length = 461 Score = 257 bits (655), Expect = 3e-66, Method: Composition-based stats. Identities = 122/448 (27%), Positives = 207/448 (46%), Gaps = 26/448 (5%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRT----SESGKDIIYIGLEDV 56 +K YP+YKDSG++W+G IP+HW V ++R K L G T + +E + Sbjct: 16 SGFKPYPEYKDSGIEWLGEIPEHWDVKQLRRVIKSLKNGTTAPQLDSGTTNYPVTRIETI 75 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICST-QFL 112 +G Y G +++D I K IL + AI D + + L Sbjct: 76 SNGYINY-NNVGYLKENDVDKRYILNKDDILISHINSLEYIGNCAIYKDNETLVHGMNLL 134 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP---LAEQVL 169 L P D + + L + I ++ A + EQ Sbjct: 135 RLIPDDNIIPDFLIYYLKSKNFKYSARIHAKPAINQASVSSTVLKSLKFSYPSNFNEQKS 194 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + ET +ID LI ++ R +ELL+EK+ AL+++ V KGL+PDV+MKDSGIEW+G +P Sbjct: 195 IANFLDKETHKIDKLIEKKQRLVELLEEKRSALINHTVAKGLDPDVEMKDSGIEWLGEIP 254 Query: 230 DHWEVKPFFA----LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-- 283 +HW+V VT+ ++ +++ I LS + IQ + + + YE + Sbjct: 255 EHWDVVKLKYLLRSKVTDGPHESPAFVDNGIPFLS-ADSIQNGKLKFENCRYVPYEDHIR 313 Query: 284 ----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + +++ K +L A + ++S L +++RS Sbjct: 314 YIRKCKPEKYDLLLGKAASVG-KVALVDVDFEFSIWSPLALIKPDTRELNSKLLYYVLRS 372 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + K + + +L ++++ L +++P + EQ I + ++ T++ID L+ KI Sbjct: 373 RYVQKQIDMLNHTNTQDNLGMKEIENLKIILPSVSEQKQIADYLDQRTSKIDELINKINH 432 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I LKE R++ I+AAVTG+ID+RGE Q Sbjct: 433 QIEYLKEYRTALISAAVTGKIDVRGEEQ 460 >gi|288986940|ref|YP_003456903.1| restriction modification system DNA specificity domain protein [Allochromatium vinosum DSM 180] gi|288898319|gb|ADC64153.1| restriction modification system DNA specificity domain protein [Allochromatium vinosum DSM 180] Length = 453 Score = 256 bits (654), Expect = 4e-66, Method: Composition-based stats. Identities = 120/450 (26%), Positives = 202/450 (44%), Gaps = 27/450 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK--LNT--GRTSESGKDIIYIGLEDV 56 M + Y +YKDSGV+W+G +P+HW + +K + +N G I I + D Sbjct: 1 MS-FPRYERYKDSGVEWLGEVPEHWILDRLKWSVEGCINGLWGDDPNGEDVIPCIRVADF 59 Query: 57 ESGTGKYLPKDGNSRQSDTSTV--SIFAKGQILYGKLG-----PYLRKAII-ADFDGICS 108 + + +D R G +L K G P + + + +CS Sbjct: 60 DRAKNRVRAEDLTYRSISEEKRLNRSLKNGDLLIEKSGGGDNQPVGVVVLFDHNLNAVCS 119 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 + + +L S+ ++I + + + D + IP + E Sbjct: 120 NFVARMPVRSNFSPRFLCYLHSVLYALRLNTKSIKQNTGIQNLDSASYLDERFGIPTVYE 179 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q LI + + ET +ID LI E+ R +ELLKEK+QA++S+ VTKGLNPD MKDSGIEW+G Sbjct: 180 QGLIADFLDRETAKIDALIAEQQRLVELLKEKRQAVISHAVTKGLNPDAPMKDSGIEWLG 239 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE----- 281 VP+HW + P L ++ + I++ + R L+ E Sbjct: 240 EVPEHWVIVPLKHLTAPGRDIMYGIVLPGPNVDNGVPIVKGGDVRPHRLRLELLNRTTEA 299 Query: 282 -----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + P +IV+ D L ++++ I ++S +L ++ Sbjct: 300 IEAPYARARLRPSDIVYSIRGSIGDAE-LVPDELLDANITQDVARISPDQTVNSLWLLFV 358 Query: 337 MRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 M+S + + + D+KR + P I+EQ I ++ ET ++D L + Sbjct: 359 MKSVRVFVQLEQRSLGAAVRGINIFDLKRARIPFPDIQEQKTIATFLDRETTKLDALTAE 418 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + +I LL+ERR++ I+AAVTG+ID+RG + Sbjct: 419 AQTAITLLQERRTALISAAVTGKIDVRGFA 448 >gi|308171852|ref|YP_003915182.1| type I restriction-modification system specificity subunit [Arthrobacter arilaitensis Re117] gi|307743224|emb|CBQ74047.1| type I restriction-modification system specificity subunit [Arthrobacter arilaitensis Re117] Length = 449 Score = 256 bits (653), Expect = 5e-66, Method: Composition-based stats. Identities = 110/438 (25%), Positives = 190/438 (43%), Gaps = 21/438 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M K YP+YKDSGV+W+G IP W P+ K + + + V + Sbjct: 1 MSQ-KPYPKYKDSGVEWLGEIPIDWSTFPLWNLFKRTKRLGNGKEELLSVYRDYGVVPKS 59 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV- 119 + + N D S I G ++ K+ + +++ +GI S + V + Sbjct: 60 SR--NDNFNKASEDLSKYQIVEIGDLVINKMKAWQGSVAVSEHNGIVSPAYFVFRALGKA 117 Query: 120 LPELLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + S Q +I G D +P+ P L+EQ I + E Sbjct: 118 DSRFIHFLMRSTPYFQHYASISAGVRPNQWDLDPVRHRKMPVLFPSLSEQRYIAAYLDRE 177 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T ID I ++ I LL E++ A ++ VTKGL+P +MKDS + +GL+P W V Sbjct: 178 TAEIDAFIADQEELIALLSERRTATITQAVTKGLDPKSRMKDSNVSNLGLIPAPWAVTGL 237 Query: 238 FAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--PESYE----TYQIVDP 288 +T+ + + +S ++ LK PE+YE T Sbjct: 238 KHFTLKITDGAHISPETDGGIYDFVSTRDVSDSGINFEGSLKTSPETYEYMVRTGCRPQN 297 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVF 346 G+++F ++ ++ S+ + ++P +D +L +L RS + + Sbjct: 298 GDVLFSKDGTVGRTVVVQGN---HDFVVASSLIIIRPDLSKLDPNFLNYLCRSAFVQEQV 354 Query: 347 YAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + G L ++ R+ + PP+ EQ +I + ++ ET ID + ++I L KE Sbjct: 355 RSFVKGAGLPRLSIANLLRVTGVFPPLNEQQEIVDYLDRETTEIDAAIADAREAIALSKE 414 Query: 406 RRSSFIAAAVTGQIDLRG 423 RR++ I+AAVTG+ID+RG Sbjct: 415 RRAAVISAAVTGKIDVRG 432 >gi|268325013|emb|CBH38601.1| putative type I restriction enzyme, DNA specificity subunit [uncultured archaeon] Length = 445 Score = 255 bits (652), Expect = 7e-66, Method: Composition-based stats. Identities = 108/438 (24%), Positives = 187/438 (42%), Gaps = 23/438 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR--TSESGKDI---IYIGLEDVESG 59 K Y +YKDSG++WIG IP+HW+ PIK + G+ T + + Y+ +++ Sbjct: 3 KPYLKYKDSGIEWIGEIPEHWEAKPIKYVGDIVLGKMLTPDDKEGYFRKPYLRAQNITWE 62 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPK 117 + + +L + G R AI + + + K Sbjct: 63 KVDTEDIKEMWFSEKELSQYRLKENDLLVSEGGEVGRTAIWQNELNECYIQNSVHKITIK 122 Query: 118 DVLPELLQGWLLSIDVTQ-RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + I ++I +++H + + I P EQ I + Sbjct: 123 SKNNPHYYLYHFQIYGKTGYFDSIVNRVSIAHLTREKLKEIMFLSPTFHEQQTIANYLDR 182 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T +IDT I + + I+LLKE++ A+++ VTKGLNP+VK+KDSGIEW+G +P+HWE++ Sbjct: 183 KTHQIDTFIENKQKLIDLLKEQRAAIINQAVTKGLNPNVKLKDSGIEWLGEIPEHWELRK 242 Query: 237 FFALVTELNRKNT-------KLIESNILSLSYGNIIQKLETRNMGLKPE----SYETYQI 285 + T I + G++ + + E Y T +I Sbjct: 243 VGRSFNLIGSGTTPKSENIGYYENGTINWVITGDLNDGILDKTSKKITEKALDEYSTLKI 302 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 G ++ K SL + + G + A A+ S ++ + + Sbjct: 303 YPVGTLLIAMYGATIGKISLMNFE----GCVNQACCALSNSPYLSNEFSFYWFLANKQNI 358 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G + ++ E V+ L + PP EQ I ++ +T RID L+E+ + I LKE Sbjct: 359 INMSFGGGQPNISQEVVRSLKIPTPPSSEQQAIIYHLDEQTTRIDKLMERQGRQIEHLKE 418 Query: 406 RRSSFIAAAVTGQIDLRG 423 R++ I+ VTG+ID+R Sbjct: 419 YRTTLISEVVTGKIDVRD 436 >gi|78356903|ref|YP_388352.1| type I restriction enzyme, S subunit [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219308|gb|ABB38657.1| type I restriction enzyme, S subunit [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 474 Score = 255 bits (652), Expect = 8e-66, Method: Composition-based stats. Identities = 93/441 (21%), Positives = 174/441 (39%), Gaps = 20/441 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M YP+YKD+GV W+G+IP HW K + K R+ +++ + + + T Sbjct: 1 MMKLAPYPEYKDAGVSWVGSIPAHWPEKRAKYYFKEIDDRSQTGDEEM--LSVSHITGVT 58 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + G ++ + ++ +++ GI S + V +P+ Sbjct: 59 PRSQKNVTMFKAESNVGQKRCQPGDLIINTMWAWMSALGVSNHAGIVSPAYGVYRPRSNQ 118 Query: 121 PELLQGW-----LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + ++ ++P+ PP EQ I + Sbjct: 119 DYDYYYLDSLLRIEGYRSEYICRSTGIRSSRLRLYPDKFLSMPVVCPPQEEQQTIARFLK 178 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 A+ I + RFIELLKE+KQ +++ VT+GL+P V+ K SG+EW+G +P+HW+ + Sbjct: 179 AQDRLFRKFIRNKRRFIELLKEQKQNVINQAVTRGLDPKVQFKPSGVEWIGDIPEHWDAR 238 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--------SYETYQIVD 287 L K + + + N + + + + + Sbjct: 239 RLRTLAAVRASGVDKNTNEDEVPVMLCNYVDVYKNDRITAAIDFMKATATPEEIRAFELK 298 Query: 288 PGEIVFRFIDLQNDKRSLRSA----QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 G+++ D ++ + A + I+ +L S + Sbjct: 299 AGDVIITKDSESWDDIAIPTFVPETIPGVVCAYHLALIRPFSGEIEGEFLFRAFSSDPVA 358 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +G R L +K +PP++EQ I IN + A I + + E+ I L Sbjct: 359 DQFRIAATGVTRFGLAQGAIKGAFFPLPPLEEQRAIIAHINEKCAEISQAISRAEREIEL 418 Query: 403 LKERRSSFIAAAVTGQIDLRG 423 ++E R+ I+ VTGQ+D+RG Sbjct: 419 MREYRTRLISDVVTGQVDVRG 439 >gi|218248669|ref|YP_002374040.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 8801] gi|218169147|gb|ACK67884.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8801] Length = 453 Score = 255 bits (652), Expect = 8e-66, Method: Composition-based stats. Identities = 126/443 (28%), Positives = 206/443 (46%), Gaps = 21/443 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGL 53 +K +K YP YK SGV ++G IP W+V +K K+ +G+T + S II++ Sbjct: 6 LKQWKPYPHYKPSGVDFLGDIPDGWEVKRLKWIVSKIGSGKTPKGGAEIYSDSGIIFLRS 65 Query: 54 EDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDG---ICST 109 +++ + ++ D + + S IL G L + +I D + Sbjct: 66 QNIHFDGLRLDDVVYINKDIDKAMSSSRVKPLDILLNITGASLGRCMIIPKDFPSSNVNQ 125 Query: 110 QFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 +L+P + P L + S + +I + G + + GN+ P L EQ Sbjct: 126 HVCILRPIVTRINPYFLNRVMSSNAIQNQIFSSEVGVSREGLTFAQAGNLISVFPSLPEQ 185 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + + ET +ID LIT + R IELLKEK+ AL+S+ VTKGLNPDV MKDSG+EW+G Sbjct: 186 EKIAQFLDEETAKIDKLITHKQRLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGF 245 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETY 283 +P+HWEVK L + + I+ I G I + N L + Sbjct: 246 IPEHWEVKKIKRLSLVKRGASPRPIDDPIYFDDNGEYVWVRISDVTASNKYLLEAEQKLS 305 Query: 284 QIVDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 +I + + L + + I ++ + YL ++ + Sbjct: 306 EIGKRKSVPLQPNELFLSICASVGKPIITKIKCCIHDGFVYFPELKENREYLYYIFLGGE 365 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L K M G + +L E + + + +PP+ EQ I ++ +T +ID +++K +SI Sbjct: 366 LYKGLGKM--GTQLNLNTEIIGDVKLPIPPVSEQQKIAEYLDEKTEQIDPIIKKTRESIE 423 Query: 402 LLKERRSSFIAAAVTGQIDLRGE 424 LKE R++ I+AAVTG+ID+R Sbjct: 424 YLKEYRTALISAAVTGKIDVRQW 446 >gi|307720089|ref|YP_003891229.1| Restriction endonuclease S subunit [Sulfurimonas autotrophica DSM 16294] gi|306978182|gb|ADN08217.1| Restriction endonuclease S subunit [Sulfurimonas autotrophica DSM 16294] Length = 442 Score = 254 bits (649), Expect = 2e-65, Method: Composition-based stats. Identities = 128/446 (28%), Positives = 197/446 (44%), Gaps = 32/446 (7%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGT 60 YK YP YKDSG+ W+G +P W V +K + + + KDI+ + + G Sbjct: 3 SKYKPYPSYKDSGIAWLGEVPIGWDVRRLKTILQERREKNSPVKTKDILSL---CMYRGV 59 Query: 61 GKYLPK--DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 Y K GN + D + + I+ + ++ + G S + +L P+ Sbjct: 60 IPYSEKGNSGNKAKDDLTAYKLAYPNDIVLNSMNVVAGSVGLSKYFGAVSPVYYMLYPRK 119 Query: 119 VLPELLQG--WLLSIDVTQRIEAICEG-------------ATMSHADWKGIGNIPMPIPP 163 ++ S + + + G + ++ MPIPP Sbjct: 120 STDDISYFNAIFQSESFQKSLIGLGNGILVKQSEKTGKLNTIRMKISMDSLNDVLMPIPP 179 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 EQ I + T +IDTLI ++ + I LLKEK+QA++S VT+GL+ V MKDSG+E Sbjct: 180 FQEQQTIANYLDNATAKIDTLIEKQTKLIALLKEKRQAVISTAVTRGLDSSVPMKDSGVE 239 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 W+G +P+HWEVK F L R + KL +ILS++ I K G Y Y Sbjct: 240 WLGEIPEHWEVKKFKYLFEIRKRISGKL-GYDILSITQKGIKVKDIESGKGQLSSDYSKY 298 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYD 341 Q V G+ +DL L G+ + Y D+ Y +L++ Sbjct: 299 QHVYKGDYAMNHMDLLTGFVDLSKYD----GVTSPDYRVFSIIEKNADANYYLFLLQMGY 354 Query: 342 LCKVFYAMGSG----LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + K+FY +G G R L + K PP +EQ I I+ + L K Sbjct: 355 INKIFYPLGQGSSQFGRWRLPSDAFKEFQAPFPPQEEQKKIAKYIDDSLTKFTKLTTKAT 414 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRG 423 ++I LLKERR++ I+A VTG+ID+R Sbjct: 415 KAIELLKERRTALISAIVTGKIDVRE 440 >gi|299531531|ref|ZP_07044937.1| type I restriction-modification system, S subunit [Comamonas testosteroni S44] gi|298720494|gb|EFI61445.1| type I restriction-modification system, S subunit [Comamonas testosteroni S44] Length = 460 Score = 253 bits (647), Expect = 3e-65, Method: Composition-based stats. Identities = 115/444 (25%), Positives = 195/444 (43%), Gaps = 19/444 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR------TSESGKDIIYIGLE 54 M + YP YKDSGV+W+G +P HW V P+KR + G+ + + + Y+ Sbjct: 1 MS-FPRYPAYKDSGVEWLGEVPAHWIVAPLKRGFSVTLGKMLQSDSSGPEDELLPYLRAA 59 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL 114 +++ G +L + G R + AD CS Q V Sbjct: 60 NIQWTGIDASDIKQMWLSPRDRVQLALQLGDLLVSEGGDVGRSCLWADEIANCSFQNSVN 119 Query: 115 QPKDVL---PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + + L W+ +I ++ +C +T++H + + +P+P P EQ I Sbjct: 120 RVRATHGGSTRFLYYWMSTIKDKGYVDVLCNKSTIAHFTAEKVAAVPVPFPLPPEQTAIV 179 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET +ID L+ E+ + I LL+EK+QA++S+ VTKGLNP+ MKDSG+EW+ VP H Sbjct: 180 RFLDHETAKIDALVAEQEKLIALLQEKRQAVISHAVTKGLNPNAPMKDSGVEWLREVPVH 239 Query: 232 WEVKPFFALVT--ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD-- 287 WEV T + + +E I S + + + + +++ Sbjct: 240 WEVTALKRHWTATDCKHVTAEFVEDGIPLASIREVQSRWVELGEAKRTTEHFYQLLIEGG 299 Query: 288 ----PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 PG+++F + + + +L +RS + Sbjct: 300 RDPRPGDLIFSRNATVGEVAQVHQDHQPFAMGQDVVLLRRITEATSPDFLQLAIRSSVVM 359 Query: 344 KVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 M + + E+++ L + PP EQ I N + + D L+ + +I L Sbjct: 360 LQLSLCMVGSTFKRINVEEIRSLVLAFPPPDEQIKIANHLLAQAESFDSLMTEARTAIAL 419 Query: 403 LKERRSSFIAAAVTGQIDLRGESQ 426 L+ERR++ I+AAVTGQID+RG +Q Sbjct: 420 LQERRTALISAAVTGQIDVRGWAQ 443 >gi|73668548|ref|YP_304563.1| hypothetical protein Mbar_A1015 [Methanosarcina barkeri str. Fusaro] gi|72395710|gb|AAZ69983.1| hypothetical protein Mbar_A1015 [Methanosarcina barkeri str. Fusaro] Length = 477 Score = 253 bits (646), Expect = 3e-65, Method: Composition-based stats. Identities = 118/449 (26%), Positives = 201/449 (44%), Gaps = 25/449 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYI-GL 53 ++ +K YP+YKDSGV+WIG IPK W+V IK T + ++ E + ++ Sbjct: 27 IREWKRYPEYKDSGVEWIGEIPKEWEVKKIKHTTYVKGRIGWQGLKSDEFIDEGPFLVTG 86 Query: 54 EDVESGTGKYLPKDG-NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQ 110 D +G+ + N + + + +L K G + A++ ++ Sbjct: 87 TDFINGSVNWGSCYHVNEERYNEDPYIQLKEKDLLITKDGTIGKVALVTRLKTKATLNSG 146 Query: 111 FLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + +P + + L S I G+T+ H P+P L+EQ Sbjct: 147 IFLTRPLTGEYYTNFMYWLLNSEVFETFFNYISNGSTIQHLYQNVFVIFSFPLPSLSEQQ 206 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + ET +I+TLI ++ R IELL+EK+ AL+S+ VTKGL+P K K+SG+EWVG + Sbjct: 207 SIVSFLDRETSKIETLIEKKQRLIELLEEKRSALISHAVTKGLDPYAKKKNSGVEWVGEI 266 Query: 229 PDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYE 281 P+ W + + +S IL L N L ++ ES + Sbjct: 267 PEGWFLSKLKYLTSKIGSGKTPRGGSEIYCDSGILFLRSQNVHFDGLRLDDVVYIDESID 326 Query: 282 TYQ---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLM 337 + V P +++ + S+ + + + IDS L + + Sbjct: 327 SEMSSTRVLPDDVLLNITGASIGRSSIVPKDFPQANVNQHVCIIRPLKKKIDSRLLHYEL 386 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEK 395 S + + ++ +G R+ L F + + +P + EQ I N ++ +T +ID + K Sbjct: 387 SSNGVQALIFSNENGTSREGLTFSQISNFVIAIPNNLDEQRHIANFLDHKTEKIDTFINK 446 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 I I LKE R++ I+AAVTG+ID+R E Sbjct: 447 ISAQIEKLKEYRTALISAAVTGKIDVREE 475 >gi|284041088|ref|YP_003391018.1| restriction modification system DNA specificity domain protein [Spirosoma linguale DSM 74] gi|283820381|gb|ADB42219.1| restriction modification system DNA specificity domain protein [Spirosoma linguale DSM 74] Length = 441 Score = 253 bits (645), Expect = 5e-65, Method: Composition-based stats. Identities = 124/433 (28%), Positives = 211/433 (48%), Gaps = 21/433 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGK 62 YPQYKDSG++WIG IP HW+V IK K+N E I Y+ + V G Sbjct: 13 RYPQYKDSGLEWIGEIPAHWEVGRIKYVCKINQRSLPESTAKSFPIHYVDIGSVTLEEGI 72 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDV 119 ++ + + + I G + + YL+ D I ST F VL P + Sbjct: 73 VQTEEFEFKNAPSRARRIANAGDTIISTVRTYLKAIAFVDEQQSQFIYSTGFAVLNPLPL 132 Query: 120 L-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + P+ L + S T+++ A +G + + +G + + PPL+EQ I E + +T Sbjct: 133 IMPKFLAMAVKSDSFTEQVSANSKGMSYPAINSTELGCLAICFPPLSEQTRIAEFLDRKT 192 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-----WVGLVPDHWE 233 +ID I ++ + IELL E++Q ++ VT+GLNP+ MKDSGI+ W+G +P HWE Sbjct: 193 AQIDQAIAQKEQLIELLNERRQVMIHRAVTRGLNPNAPMKDSGIDRGDARWIGEIPAHWE 252 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIV 292 V L TE + + I+S++ G ++ +T E + Y+ G+I Sbjct: 253 VSRINWLFTEKDETGYPDLPLLIVSINSGVTVRDMDDTEIRKQVAEDFNVYKRALAGDIA 312 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGS 351 F + + + + G+++ Y+ +P+ +S Y +L ++ + F Sbjct: 313 FNKMRMWQGAVGVVP----QDGLVSPDYVVARPNNFVNSAYYGFLFKTREYLAEFVKHSH 368 Query: 352 GL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G+ R L +ED K + +VPP++EQ I + +N + + KI++ I L+E +S Sbjct: 369 GIAWDRNRLYWEDFKSIFAMVPPLEEQNQIVDFLNAQNEEMSFASTKIQKQIQKLQELKS 428 Query: 409 SFIAAAVTGQIDL 421 + I +AVTG+I + Sbjct: 429 TLINSAVTGKIKV 441 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 50/211 (23%), Positives = 87/211 (41%), Gaps = 6/211 (2%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--KNTKLIESNILSLSYGNIIQK-- 267 N + KDSG+EW+G +P HWEV + R + I + G++ + Sbjct: 12 NRYPQYKDSGLEWIGEIPAHWEVGRIKYVCKINQRSLPESTAKSFPIHYVDIGSVTLEEG 71 Query: 268 -LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 ++T K +I + G+ + + + Q + T + Sbjct: 72 IVQTEEFEFKNAPSRARRIANAGDTIISTVRTYLKAIAFVDEQQSQFIYSTGFAVLNPLP 131 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I +LA ++S + A G+ ++ ++ L + PP+ EQ I ++ + Sbjct: 132 LIMPKFLAMAVKSDSFTEQVSANSKGMSYPAINSTELGCLAICFPPLSEQTRIAEFLDRK 191 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 TA+ID + + EQ I LL ERR I AVT Sbjct: 192 TAQIDQAIAQKEQLIELLNERRQVMIHRAVT 222 >gi|290512141|ref|ZP_06551508.1| restriction modification system DNA specificity domain-containing protein [Klebsiella sp. 1_1_55] gi|289775136|gb|EFD83137.1| restriction modification system DNA specificity domain-containing protein [Klebsiella sp. 1_1_55] Length = 431 Score = 250 bits (638), Expect = 3e-64, Method: Composition-based stats. Identities = 110/436 (25%), Positives = 184/436 (42%), Gaps = 33/436 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M YKAYP+YKDSGV+W+G +P+ W V +K + G+ +S V++ Sbjct: 1 MAKYKAYPEYKDSGVEWLGLVPESWTVCRLKNLATIKNGQDYKS-----------VQTDD 49 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 G P G+ Q ++ ++ K +L G+ G + I + T + + Sbjct: 50 G--YPVMGSGGQFTFASKFMYDKPSVLLGRKGTIDKPLYINEPFWTVDTMYYTELNEGFD 107 Query: 121 PELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + L L+I T H +E+ I E + ET Sbjct: 108 AKYLHYLALTIQFSRYSTNTALPSMTQEHLSNYKF----SVPKAESERKKITEFLDLETA 163 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID LI ++ + IELLKEK+QA++S+ VTKGLNPDV MKDSG+EW+G VP+HWEV F Sbjct: 164 KIDNLIEKQQQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGVEWLGEVPEHWEVSKFGY 223 Query: 240 LVTELNRKNTK-----------LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 + + + + ++ + L++ + L + E ++ Sbjct: 224 ISLVVRGGSPRPAGDPTLFNGDYSPWVTVAEITKDNEIYLDSTDTFLTKKGSEQCRVFKA 283 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G ++ + S + + ID Y + + + Sbjct: 284 GTLLLSNSGATLGVPKILSID----ANANDGVVGFELLNIDHEYAYFYLSTLTTNLRESI 339 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +L E VK + + +PP E I + I + + + LL+ERR+ Sbjct: 340 KQGSGQPNLNTEIVKSIAIPIPPENEIQRIVSFIKETIKLYSSIESGAMEQVKLLQERRT 399 Query: 409 SFIAAAVTGQIDLRGE 424 + I+AAVTG+ID+R Sbjct: 400 ALISAAVTGKIDVRDW 415 >gi|332800154|ref|YP_004461653.1| restriction modification system DNA specificity domain-containing protein [Tepidanaerobacter sp. Re1] gi|332697889|gb|AEE92346.1| restriction modification system DNA specificity domain protein [Tepidanaerobacter sp. Re1] Length = 431 Score = 250 bits (637), Expect = 4e-64, Method: Composition-based stats. Identities = 110/431 (25%), Positives = 183/431 (42%), Gaps = 9/431 (2%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M ++K Y +YKDSG++WIG IP+ WK+ +K + G++ +S + + G G Sbjct: 1 MSNFKRYDKYKDSGIEWIGEIPEGWKITKLKYICSITMGQSPKSEEYSLEEGGLPFLQGN 60 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 ++ + + IL P + I GI + K L Sbjct: 61 AEFTELYPQPKIYCDTANKFSKANDILLSVRAPVGKMNISDRVYGIGRGLCAITAQKVHL 120 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 W + + +G+T + N+ +PP EQ+ I + +T Sbjct: 121 ---KYLWYSMNVSLEELSINSQGSTFEAVTVADVDNLSAIVPPADEQISIANFLDQKTAE 177 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID LI ++ + IELL+EK+QA+++ VTKGLNP+VKMKDSGIEW+G +P+ W V Sbjct: 178 IDDLIADKEKLIELLQEKRQAVITEAVTKGLNPNVKMKDSGIEWIGEIPEGWRVSKIKYE 237 Query: 241 VTELNRK--NTKLIESNILSLSYGNIIQKLETRNM---GLKPESYETYQIVDPGEIVFRF 295 + + I + G++ E + K ++V G + Sbjct: 238 ALINKKTLSENTDDDFEIDYIDIGSVTSVGEINGIQSLSFKDAPSRARRVVSEGNTIVST 297 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR- 354 + + T + I YL +LMRS G+ Sbjct: 298 VRTYLKAIAFIENVHSNLVCSTGFAVLTPLSNIVPKYLFYLMRSEKYVNEIVRRSVGVSY 357 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++ D+ L ++P ++EQ +I ++ + RI+ L +IE I L+E R S I A Sbjct: 358 PAVNASDIGVLECVLPSVREQINIVEYLDKCSKRINQLTNEIELQIQKLREYRQSLIFEA 417 Query: 415 VTGQIDLRGES 425 VTG+ID+R + Sbjct: 418 VTGKIDVRDYA 428 >gi|327396330|dbj|BAK13752.1| type I site-specific restriction-modificationsystem, S subunit and related helicases [defense mechanisms] hypothetical protein [Pantoea ananatis AJ13355] Length = 451 Score = 249 bits (636), Expect = 5e-64, Method: Composition-based stats. Identities = 120/440 (27%), Positives = 201/440 (45%), Gaps = 21/440 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M YKAYP+YKDSGV+ + IPK W + +K ++ + G D++ + + ++ Sbjct: 1 MAKYKAYPEYKDSGVESLDTIPKMWSIKKLKYIFEIKKRIAGKIGFDVLSVTQKGIK--- 57 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 K + D S G+ + I+++DG+ S + V +D Sbjct: 58 IKDIESGEGQLSMDYSKYQRVYPGEFAMNHMDLLTGYVDISNYDGVTSPDYRVFAVRDKH 117 Query: 121 PELLQGWLLSIDVTQRIEAICE------GATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + +L + + + I P P L EQ+ I + Sbjct: 118 SFYSRYYLYLLQDGYKQRRFFHLGQGSAHLGRWRLPTEAFNEIVYPCPSLTEQIHIASFL 177 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE- 233 ET +ID LI ++ + IELLKEK+QA++S+ VTKGLNPDV MKDSG+EW+G VP+HW Sbjct: 178 DHETAKIDNLIEKQRQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGVEWLGEVPEHWIV 237 Query: 234 --VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK------PESYETYQI 285 K + + + + K + IL ++ NI + + + + + + Sbjct: 238 SGFKKYLSSIVDYRGKTPNKTDEGILLVTARNIKKGVLDYTLSQEFIAPSDYKEVMGRGL 297 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 D G+++F + ++ + I +D+ Y + + S + Sbjct: 298 PDIGDVLFTTEAPLGEVANVDRVDIALAQRI--IKFKGMASRLDNYYFKYFIMSSAFQQS 355 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 SG Q +K E L L+PPI EQ +I ++ E +ID+LVE+ + LL+ Sbjct: 356 LNLYSSGSTAQGIKAERFVYLRKLLPPINEQMEIVGFLDKEITKIDILVEQQFVMLSLLQ 415 Query: 405 ERRSSFIAAAVTGQIDLRGE 424 ERR++ I+AAVTG+ID+R Sbjct: 416 ERRTALISAAVTGKIDVRDW 435 >gi|291287374|ref|YP_003504190.1| restriction modification system DNA specificity domain protein [Denitrovibrio acetiphilus DSM 12809] gi|291287883|ref|YP_003504699.1| restriction modification system DNA specificity domain protein [Denitrovibrio acetiphilus DSM 12809] gi|290884534|gb|ADD68234.1| restriction modification system DNA specificity domain protein [Denitrovibrio acetiphilus DSM 12809] gi|290885043|gb|ADD68743.1| restriction modification system DNA specificity domain protein [Denitrovibrio acetiphilus DSM 12809] Length = 441 Score = 248 bits (634), Expect = 8e-64, Method: Composition-based stats. Identities = 135/444 (30%), Positives = 212/444 (47%), Gaps = 27/444 (6%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-----LNTGRTSE-------SGKDIIYI 51 YKAYP YKDSG++W+G IP+HW + K + L G S + I Sbjct: 3 YKAYPSYKDSGIEWLGEIPEHWAIERFKFQLRAGFEGLKIGPFGSQIKAELLSDEGIKVY 62 Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICST 109 G E++ + + V G IL +G R + + GI + Sbjct: 63 GQENIIKNNFDLGHRFVSEELFCELEVYETLPGDILVTMMGTAGRCQVTPEKINQGIIDS 122 Query: 110 QFLVLQPKDVLPELLQGWLLSI--DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + L+ L +L++ + +I + +G+ M + I N+ +PPL EQ Sbjct: 123 HLIRLRVNKCLLSRFCKYLINDSAYIEHQIRLMGKGSIMHGLNSTIIKNLIFILPPLKEQ 182 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 +I + + +T +ID LI ++ + IE L EK+ AL+++ VTKG+NPDVKMKDSG+EW+G Sbjct: 183 SIILKYLDKKTAQIDELIDKKKKLIEKLDEKRTALITHAVTKGMNPDVKMKDSGVEWLGE 242 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 VP+HW++ + + ++ + ++LS++ I K G Y YQIV Sbjct: 243 VPEHWDIVK-AKYLFTIEKRIAGFLGHDVLSITQTGIKVKDIESGEGQLSMDYTKYQIVK 301 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKV 345 G+ +DL + G+ + Y + + Y + M+ K+ Sbjct: 302 VGDFAMNHMDLLTGYVDISQFD----GVTSPDYRVFRLSAQNCNPQYYLYHMQRGYKEKI 357 Query: 346 FYAMGSG----LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 F+ G G R L ++ K L VPP +EQ I I+ ET ID L+ K E+SI Sbjct: 358 FFNYGHGSAQLGRWRLPTDEFKELSFPVPPYEEQQAIAEYISSETILIDSLISKTEESIS 417 Query: 402 LLKERRSSFIAAAVTGQIDLRGES 425 LLKE+RS+ I AAVTG+ID+R E+ Sbjct: 418 LLKEKRSALITAAVTGKIDVREEA 441 >gi|16124874|ref|NP_419438.1| type I restriction-modification system, S subunit [Caulobacter crescentus CB15] gi|221233594|ref|YP_002516030.1| type I restriction-modification system specificity subunit [Caulobacter crescentus NA1000] gi|13421830|gb|AAK22606.1| type I restriction-modification system, S subunit [Caulobacter crescentus CB15] gi|220962766|gb|ACL94122.1| type I restriction-modification system specificity subunit [Caulobacter crescentus NA1000] Length = 450 Score = 248 bits (633), Expect = 1e-63, Method: Composition-based stats. Identities = 115/445 (25%), Positives = 182/445 (40%), Gaps = 25/445 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLE 54 M + AY YK+SGV+W+G +P HW P+K + +G T G +I + + Sbjct: 1 MS-FPAYESYKESGVEWLGRVPSHWNFRPLKHLVIMRSGGTPSKEREDYWGGEIPWASAK 59 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY---GKLGPYLRKAIIADFDGICSTQF 111 D++ T + D + ++ G + + Sbjct: 60 DLKVDTLTDTQDHLTAEALDEGAAQLLPANAVVVLVRGMMLARTFPVCRLSRPMTINQDL 119 Query: 112 L-VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 ++ + V P L L + +V G + +P P LAEQ I Sbjct: 120 KGLIANRGVDPNYLAWSLRASEVETLCRLDEAGHGTKALRMDAWSTMELPAPSLAEQQAI 179 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + ET +ID L+ + R I LLKEK+QA++S+ VTKGL+P +MKDSG+EW+G +P Sbjct: 180 AAFLDRETAKIDALVEAQERLIALLKEKRQAVISHAVTKGLDPSAQMKDSGVEWLGQMPA 239 Query: 231 HWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---Y 280 HWEV P A + +I Sbjct: 240 HWEVVPAKNLADSIKAGPFGSALTKDMYSSAGYRVYGQEQVIPGDFRIGDYYVTSDRYNE 299 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRS 339 + V+ G+++ + E GII + +P+ D TYL L+RS Sbjct: 300 LSQYRVEVGDLLVSCVGTFGKIAIFPQGA--EPGIINPRLIRFRPNNQVDPTYLCVLLRS 357 Query: 340 YDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F + G + + + V VPP++EQ I + + D L E Sbjct: 358 AVSFEQFSYLSRGGTMDVINIGILGEIVVPVPPMQEQISIAGYLAEVQEQFDSLSAASEA 417 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRG 423 +I LL+ERR++ I+AAVTG+ID+RG Sbjct: 418 AITLLQERRAALISAAVTGKIDVRG 442 >gi|149175698|ref|ZP_01854317.1| type I restriction-modification system, S subunit [Planctomyces maris DSM 8797] gi|148845417|gb|EDL59761.1| type I restriction-modification system, S subunit [Planctomyces maris DSM 8797] Length = 450 Score = 247 bits (631), Expect = 2e-63, Method: Composition-based stats. Identities = 111/449 (24%), Positives = 198/449 (44%), Gaps = 28/449 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + Y +YK+SG++W+G +P+HW V + D+ + + + Sbjct: 1 MS-FPKYAEYKESGIEWLGKVPEHWDVFRMGILFAEVAE---SGNDDLPVLQVSIHHGVS 56 Query: 61 GKYLPKDGN----SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + L + + +R D S ++Y + + +G+ S ++V +P Sbjct: 57 DRELSESESDRKITRIDDKSKYKRVVPNDLVYNMMRAWQGGFGTVKVEGMVSPAYVVARP 116 Query: 117 KDVLPELLQ-GWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREK 173 K + +++ G T W N+ + +P +EQ I + Sbjct: 117 KIDFQTQFIEHLFRTPQAIEQMRRYSHGVTDFRLRLYWDKFKNVRVALPDKSEQQEICDY 176 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 I ET +ID L+ E+ R IELLKEK+QA++S+ VTKGLNP+ MKDSGIEW+G VP+HWE Sbjct: 177 IDVETSKIDALVAEQRRLIELLKEKRQAVISHAVTKGLNPNAPMKDSGIEWLGDVPEHWE 236 Query: 234 VKPFFAL--VTELNRKNTKLIESNILSLSY---------GNIIQKLETRNMGLKPESYET 282 V + +R + E+++ S G + E++ + + Sbjct: 237 VCSLRRYAFFVDGDRGSEYPNENDLTSDGILFLSSKNIVGGKLDLKESKFISHEKFDALN 296 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMER-----GIITSAYMAVKPHGIDSTYLAWLM 337 G+++ + + V I + + + YL+ + Sbjct: 297 RGKAQDGDLIVKVRGSTGRIGEMALFDVGAYSFETAFINAQMMIIRTGNKLTPKYLSKVS 356 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +S + G +Q L + L V +PP+ EQ +I + I+++ D L + Sbjct: 357 QSIYWMEQLSVGAYGTAQQQLSNKVFSDLFVTMPPVTEQAEIADFIDLKVGEFDSLETEA 416 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRGES 425 EQ+I LL+ERR++ I+AAVTG+I++R + Sbjct: 417 EQAIELLQERRTALISAAVTGKINVRDYA 445 >gi|78357910|ref|YP_389359.1| type I restriction-modification system, S subunit [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78220315|gb|ABB39664.1| type I restriction-modification system, S subunit [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 448 Score = 247 bits (631), Expect = 2e-63, Method: Composition-based stats. Identities = 132/448 (29%), Positives = 213/448 (47%), Gaps = 23/448 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR------TSESGKDIIYIGLE 54 M YKAYP YKDSGV+WIG +P+HWK+ P+K G+ S+ ++ Y + Sbjct: 1 MSQYKAYPAYKDSGVEWIGQVPEHWKIAPVKYHYDARLGKMIQPAAVSDRDIEVPYHRAQ 60 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQF 111 V+ ++G +L + G R AI+ + + I Sbjct: 61 TVQWERIVESDIKEMWASPRDIEQFSVSEGDLLICEGGDVCRAAIVKQPPEKNMIFQKSI 120 Query: 112 LVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 ++ K + + + ++ I+ +C T+ H +G++ P+PP EQ I Sbjct: 121 HRIRSKGEYGVGWVMRLMQHLRSSEWIDVLCNKNTIVHFTSDKLGSLECPLPPPDEQASI 180 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + ET RID LI ++ RFIELLKEK+QAL+++ VTKGL+P+VKMKDSG+EW+G VP+ Sbjct: 181 AAALDRETARIDALIQKKTRFIELLKEKRQALITHAVTKGLDPNVKMKDSGVEWLGEVPE 240 Query: 231 HWEVKPFFAL--------VTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPE 278 HW P + + ++ + I ++ GN+ ++ + + + Sbjct: 241 HWSSVPIKYMALERNSLFLDGDWIESKDISTDGIRYITTGNVGEGVYKEQGSGFISEETF 300 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 V G+++ ++ + + + + + ++ +L Sbjct: 301 HALGCTEVYGGDVLVSRLNNPIGRACMVPDLGVRVVTSVDNVIFRPDSKFNKKFIVYLFS 360 Query: 339 SYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S + K + G Q + + + V P I+EQ I ++ ETARID L+ K E Sbjct: 361 SEEYFKHTSNLARGATMQRISRGLLGNIRVATPSIEEQTQIARFLDHETARIDALIGKAE 420 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRGES 425 QSI LLKERR++FI AAVTGQIDLRGE Sbjct: 421 QSITLLKERRAAFITAAVTGQIDLRGEQ 448 >gi|237709675|ref|ZP_04540156.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|229456311|gb|EEO62032.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] Length = 428 Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats. Identities = 103/435 (23%), Positives = 176/435 (40%), Gaps = 27/435 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGT 60 K Y YKDSGV+WIG IP HW+VVP+KR G T K I + ++++ Sbjct: 2 KKYDAYKDSGVKWIGEIPNHWEVVPLKRTGSFENGLTYSPNDIRDKGYIVLRSSNIQNSK 61 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQP 116 Y + KG I+ + A F++ Sbjct: 62 MNYED---TVYVESVPNDLLVKKGDIIICSRNGSASLVGKCAKFDGKIAATFGAFMMRYS 118 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + E + + + + + +T++ I + P+PPL+EQ I + A Sbjct: 119 PSINNE--YAFFSFQILMRNYKGLFTTSTINQLTKNVIAQMVCPLPPLSEQQAIASYLDA 176 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T +ID +I + + IE L E KQ+L++ VT+GLNP+ +KDSG++W+G VP+HWE Sbjct: 177 KTEKIDKMIAKAEKKIEYLGELKQSLITRAVTRGLNPNASLKDSGVKWIGKVPEHWETIK 236 Query: 237 FFALVTELNRKN-------TKLIESNILSLSYGNIIQKLET---RNMGLKPESYETYQIV 286 + + + E L G++ L T + + K + Sbjct: 237 LSRVYSYIGSGTTPLSSQEDYYSEEGYNWLQTGDLNNGLITQTSKKITKKAIDECRMKFY 296 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 +V K L + A + P + + ++ Sbjct: 297 PKHSVVIAMYGATIGKVGLLDLEST----TNQACCVISPTQKMNPLFTFYSFMAAKKELL 352 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 A G + ++ + +K+L V VPP++EQ I + E ID ++ ++ I L+E Sbjct: 353 LASFGGGQPNISQDIIKKLRVPVPPLEEQNAIILSLKKECDTIDHIIATQKKKIAYLQEL 412 Query: 407 RSSFIAAAVTGQIDL 421 + S I VTG+I + Sbjct: 413 KQSLITNVVTGKIKV 427 >gi|300113140|ref|YP_003759715.1| restriction modification system DNA specificity domain-containing protein [Nitrosococcus watsonii C-113] gi|299539077|gb|ADJ27394.1| restriction modification system DNA specificity domain protein [Nitrosococcus watsonii C-113] Length = 482 Score = 247 bits (630), Expect = 3e-63, Method: Composition-based stats. Identities = 125/444 (28%), Positives = 193/444 (43%), Gaps = 22/444 (4%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT---SESGKDIIYIGLEDVES 58 + YP YKDSGV+W+G +P+HW +K F +LN ++ + G+ +I +E +++ Sbjct: 26 SKFPRYPAYKDSGVEWLGEVPEHWTTTSLKYFAELNPKKSDYRGDQGQLCSFIPMEKLKT 85 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFL 112 G + + + + F G +L K+ P I + G S++ Sbjct: 86 GAIQLDEVR--TIADVITGYTYFEDGDVLQAKVTPCFENGNIAIADGLTNGVGFGSSEIN 143 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIR 171 V++P + L L A GA + + I + IP EQ I Sbjct: 144 VIRPFKIDVGFLYYRLQEGVFMSICTASMIGAGGLKRVPGEVIDGFTVAIPDRNEQTQIA 203 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET RID LI E+ R IELLKEK+QA++S+ VTKGL+P V MKDSG+EW+G VP H Sbjct: 204 RFLDHETARIDALIAEQQRLIELLKEKRQAVISHAVTKGLDPTVPMKDSGVEWLGEVPAH 263 Query: 232 WEVKPFFALVTELNR---KNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQ--- 284 WEVK + K+T + L + N+ E + PES+ Sbjct: 264 WEVKKIKHYGRVIGGFAFKSTDFSDEGHLVIKISNVGHLGFEWNDASYLPESFTVRHSEF 323 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 I G ++F + + I S YL +S Sbjct: 324 IAPKGSLIFAMTRPVISGGIKIARLEKDLRPLINQRVGFISINDEALSRYLLVSSQSESF 383 Query: 343 CKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 F + ++ E ++ + + +PP +E I I + D L+ I Sbjct: 384 LSQFKNNLTITNQPNIASEGIESISIPIPPAEELRRILEYIETLIDKFDCLMLDACSGIR 443 Query: 402 LLKERRSSFIAAAVTGQIDLRGES 425 LL+ERRS+ I+AAVTG+ID+RG Sbjct: 444 LLQERRSALISAAVTGKIDVRGWQ 467 >gi|289523861|ref|ZP_06440715.1| type I restriction enzyme, S subunit [Anaerobaculum hydrogeniformans ATCC BAA-1850] gi|289502517|gb|EFD23681.1| type I restriction enzyme, S subunit [Anaerobaculum hydrogeniformans ATCC BAA-1850] Length = 489 Score = 247 bits (629), Expect = 4e-63, Method: Composition-based stats. Identities = 102/447 (22%), Positives = 179/447 (40%), Gaps = 27/447 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 + + K YP YKDSGV W+G +P+HW+V K + R+ ++++ + E Sbjct: 2 ITNLKPYPAYKDSGVPWLGHVPEHWEVRRGKTLFRCIDVRSQTGQEELLTVSSE--RGVV 59 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK--- 117 + + + ++ L + R ++ + GI S+ + V + + Sbjct: 60 PRRSANVTMFKAESYVGYKLCWPDDLVINSLWAWARGLGVSPYHGIVSSAYGVYRLRNRQ 119 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P + + S + +G P+PP EQ I + Sbjct: 120 QDNPRFIHQLVRSTPFQWELLVRSKGIWVSRLQLTDDAFLGASFPMPPSNEQTAIVRFLD 179 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN-----PDVKMKDSGIEWVGLVPD 230 RI I + + I+LL+E KQAL+ VT ++ P KDSG+EW+G VP+ Sbjct: 180 YIDRRIWRYIRAKQKLIKLLEEYKQALIHQAVTGQIDVRTGKPYPAYKDSGVEWLGEVPE 239 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR--------NMGLKPESYET 282 HWE++ + V + L + +R M S Sbjct: 240 HWEIRRLGSSVRGCVNGVWGSEPNGKDDLPCVRVADFDRSRLRVHLDKPTMRAISSSDRV 299 Query: 283 YQIVDPGEIVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWL--- 336 ++++PG+++ +TS ++A P +G DS YL +L Sbjct: 300 RRLLEPGDLLLEKSGGGDLQPVGRVVLYDHPTVAVTSNFIARMPVENGYDSIYLTYLHAA 359 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + S +L +G++ +L V PP+ EQ I ++ +TA+ID + Sbjct: 360 LYSIELNVRSIKQTTGIQ-NLDSRTYLSELVAFPPLPEQTAIVEYLDTQTAKIDAAISAA 418 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRG 423 I LL+E R+ IA VTG++D+R Sbjct: 419 RSEIELLREYRTRLIADVVTGKVDVRE 445 >gi|71064986|ref|YP_263713.1| type I restriction-modification system S subunit [Psychrobacter arcticus 273-4] gi|71037971|gb|AAZ18279.1| possible type I restriction-modification system, S subunit [Psychrobacter arcticus 273-4] Length = 457 Score = 247 bits (629), Expect = 4e-63, Method: Composition-based stats. Identities = 105/448 (23%), Positives = 199/448 (44%), Gaps = 27/448 (6%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVES 58 Y+ Y +YKDSGV+W+G IP HW++ ++ G T + + +V S Sbjct: 7 KYQRYAEYKDSGVEWLGKIPSHWELSKLRYMFSFGRGLTITKADLLDTGVPCVNYGEVHS 66 Query: 59 GTG-KYLPKDGNSRQSDT-----STVSIFAKGQILYGKL-----GPYLRKAIIADFDGIC 107 G + PK + D S ++ +G +++ G +++D Sbjct: 67 KYGFEVDPKRHYLKCVDEGYLQSSPYALLTQGDLVFADTSEDIEGSGNFTQLVSDDLIFA 126 Query: 108 STQFLVLQPKDVLPELLQGWLLS-IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 ++ +P D +L+ ++ ++ + +G + + + + +P L E Sbjct: 127 GYHTVIARPFDRQCSRFYAYLMDSKEIRTQVRHMVKGVKVFSITQSILKGVRIWLPSLDE 186 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 + I + ET +IDTLI ++ I+LLKEK+QA++S+ VTKGLNPD +KDSG+EW+G Sbjct: 187 RETIANFLDFETAQIDTLIDKQKTLIQLLKEKRQAVISHAVTKGLNPDAPLKDSGVEWLG 246 Query: 227 LVPDHWEVKPFFALV-----TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--PES 279 VP+HW V L+ N + ++ + +++ ++ + P+ Sbjct: 247 EVPEHWGVSKLKYLISEPLQYGANEAAEDVDKTQPRFVRITDVLPNGNLKDDTFRSLPQE 306 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ G+++ K + + + K + + + + Sbjct: 307 IAEPYMLMDGDVLLARSGGTVGKSFIYR-DSWGKCCFAGYLIKAKIDEEITPAEWFYLNT 365 Query: 340 ---YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + Q++ + + VPP++E + I + IN D LV K Sbjct: 366 LTDFYWKWIESIQIQATIQNVSADKYNSFVIAVPPLEESYKIISYINYNLEVFDTLVMKA 425 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRGE 424 EQ+I L++ERR++ I+AAVTG+ID+RG Sbjct: 426 EQAIQLMQERRTALISAAVTGKIDVRGW 453 >gi|149373160|ref|ZP_01892029.1| type I restriction-modification system, S subunit [unidentified eubacterium SCB49] gi|149354262|gb|EDM42832.1| type I restriction-modification system, S subunit [unidentified eubacterium SCB49] Length = 438 Score = 246 bits (627), Expect = 6e-63, Method: Composition-based stats. Identities = 108/436 (24%), Positives = 185/436 (42%), Gaps = 20/436 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVE 57 K Y YKDSG++WIG IP+HW V +K +K+ +G T K I ++ V Sbjct: 2 KTYETYKDSGIEWIGEIPEHWSSVSLKWISKIYSGGTPSKNKPEYWSDGTIPWLNSGTVN 61 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQ 115 G + S+ + IL G F+ C+ ++ Sbjct: 62 QGDITEPSEYITEEALANSSAKWIPEKAILIALAGQGKTKGMVAQTQFEATCNQSLGIIV 121 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P + L + + G + + IG+IP P+P EQ I + Sbjct: 122 PSYPELNRYLLFWLRKNYQNI-RNLGGGDKRDGINLEMIGSIPTPLPTKKEQTAITNYLD 180 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +T ID LI+E+ ++L +E+K AL++ VTKG+ PD K+K+SGIEW+G +P+ W Sbjct: 181 KKTTEIDQLISEKEELVQLYQEEKTALINQAVTKGIKPDAKLKNSGIEWLGEIPEDWNSL 240 Query: 236 PFFA---LVTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQIVDP 288 + + K+T S + L NI I + + + ++ V Sbjct: 241 RLKYLGNFINGYSFKSTDFKSSGVRVLKISNIQHMAIDWSDESFIDEEFYDTKSGFRVLQ 300 Query: 289 GEIVFRFIDLQN-DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 ++VF + E+ ++ +P + ++ +++ S + F Sbjct: 301 NDLVFALTRPIISTGIKVALMNFDEKILLNQRNSIFRPKTKMTKWIYFILLSSRFVQEFD 360 Query: 348 AM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 +G + ++ D+ + + VP +EQ I I ETA+ID + K E+ I LL E Sbjct: 361 KRIDKTGQQPNISSNDIGEISIPVPTKEEQTKIVEHIEKETAKIDTKIAKAEKYINLLTE 420 Query: 406 RRSSFIAAAVTGQIDL 421 R+S I+ VTG+I + Sbjct: 421 YRTSLISEVVTGKIKV 436 >gi|148827247|ref|YP_001292000.1| type I restriction modification DNA specificity domain-containing protein [Haemophilus influenzae PittGG] gi|148718489|gb|ABQ99616.1| type I restriction modification DNA specificity domain protein [Haemophilus influenzae PittGG] Length = 424 Score = 244 bits (623), Expect = 2e-62, Method: Composition-based stats. Identities = 96/425 (22%), Positives = 187/425 (44%), Gaps = 12/425 (2%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y YKDSGV+W+G +P HW ++P K KL + + + L ++ + + Sbjct: 2 RRYESYKDSGVEWLGEVPSHWNLIPNKYIFKLRKNVVGKRSSEYDLLSLS-LKGVIKRDM 60 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 ++ T +G ++ R ++ + G+ + + + + +V + Sbjct: 61 ENPEGKFPAEFDTYQEVKEGDFIFCLFDVEETPRTVGLSSYHGMITGAYTIFETNNVDKK 120 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + L++D +R++ + +G + + + IPPL+EQ I + + +T +ID Sbjct: 121 FIYYFYLNLDSDKRLKPLYKGL-RNTISKETFFSFNTFIPPLSEQQKIAQFLDDKTAKID 179 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HWE+ L Sbjct: 180 QAVDLAEKQIALLKEHKQILIQNSVTRGLNPDVPLKDSGVEWIGQVPEHWEILSIKRLSQ 239 Query: 243 ELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + + I++ + G I + NM L + + + + L Sbjct: 240 VKRGASPRPIDNPKYFDNDGEYAWVRISDVTASNMYLLETTQKLSNLGKSYSVPLMPGSL 299 Query: 299 QNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + I ++ + ++ +L ++ S G + + Sbjct: 300 FLSIAGSVGKPIITKIKVCIHDGFVYFPENKQNTKFLYYIFYSE--QPYIGLGKMGTQLN 357 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + V + + +PP+ EQ I + ++ +TA+ID + I LKE +S I VT Sbjct: 358 LNTDTVGAIKIPIPPLCEQQKIADYLDTQTAKIDQAIALKTAHIEKLKEYKSVLINDVVT 417 Query: 417 GQIDL 421 G++ + Sbjct: 418 GKVRV 422 >gi|169634728|ref|YP_001708464.1| putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) [Acinetobacter baumannii SDF] gi|169153520|emb|CAP02682.1| putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) [Acinetobacter baumannii] Length = 433 Score = 244 bits (622), Expect = 2e-62, Method: Composition-based stats. Identities = 106/433 (24%), Positives = 183/433 (42%), Gaps = 13/433 (3%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 +K YP YK+SGV+W+G +P+HW++V K E D I D + Sbjct: 1 MQFKQYPSYKNSGVEWLGDVPEHWQIVRTKDIFNHRKEEALE--DDEIVTAFRDGQVTLR 58 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 K DG + G ++ ++ + ++D G + + V K+ Sbjct: 59 KNRRTDGFTNSIKEHGYQHINSGDLVIHEMDAFAGAIGVSDSSGKSTPVYTVCYAKNENI 118 Query: 122 ELLQ--GWLLSIDVTQRIEAICEGATMS--HADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + ++ T I ++ +G + W N+ + PP A+Q I + E Sbjct: 119 NHHFYSHFFRTMAKTGFINSLAKGIRVRSTEFRWNESRNVYLVEPPKADQEKIVSFLDTE 178 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T RID LI+++ + IELL+E++++++S+ VTKGLNP+ MKDSG+EW+G VP+HW++ Sbjct: 179 TARIDNLISKQEKLIELLEEQRKSIISHAVTKGLNPNAPMKDSGVEWLGDVPEHWDITRL 238 Query: 238 FALVTELNRKNTKLIESN------ILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGE 290 + + E L L N+ L + S + + Sbjct: 239 KNIGKSIIGLTYSPNEICDADDDSYLVLRSSNVQNGQLSFLDNVYVKSSVSEKLKIKKND 298 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+ + D T + Y+ W++ SY + Sbjct: 299 ILICSRNGSRDLIGKNIIIKNPPKNSTFGAFMTVYRSEYADYVYWILNSYIFKAQAGSYL 358 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + L ++ + V PI EQ +I ++ E + + L+ K + I LKE R+S Sbjct: 359 TTTVNQLTINNLNNMTVPFAPISEQDEIVEFLSTENLKFNNLISKQKALIEKLKEYRASI 418 Query: 411 IAAAVTGQIDLRG 423 I+ AVTG+ID+R Sbjct: 419 ISHAVTGKIDVRE 431 >gi|300112915|ref|YP_003759490.1| type I restriction enzyme, S subunit [Nitrosococcus watsonii C-113] gi|299538852|gb|ADJ27169.1| type I restriction enzyme, S subunit [Nitrosococcus watsonii C-113] Length = 471 Score = 244 bits (622), Expect = 3e-62, Method: Composition-based stats. Identities = 107/438 (24%), Positives = 195/438 (44%), Gaps = 18/438 (4%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 +YP+YKDSGV W+ IP+ W+ K R+ ++++ + T Sbjct: 1 MKLVSYPEYKDSGVPWLEKIPRRWRFFRAKNVFYPIDLRSKTGAEELLSVSERHGV--TS 58 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-- 119 + + + + G ++ L +++ + + GI ST + V +P+ Sbjct: 59 RKSVNVTMFQAASYQGYKLCWPGDLVINSLWAWMQGLGFSKYHGIISTAYGVYRPRVRRV 118 Query: 120 LPELLQGWLLSIDVTQRI---EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 +LL + + + +P+ +P EQ I + Sbjct: 119 SDFRYFDYLLRSAAYKWELRVRSKGIWRSRYQLKDDDFLKMPILLPEAEEQTQIARFLDW 178 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T +I+ I + R IELLKE+KQ +++ VT+GL+P+V++K SG+EW+G +P HWE Sbjct: 179 KTAQINQFIRNKRRLIELLKEQKQNVINQAVTRGLDPNVRLKPSGVEWIGDIPAHWETTK 238 Query: 237 FFALVTELNRKNTKL----IESNILSLSYGNIIQKLETRNMGLKP--ESYETYQIVDPGE 290 +V+ K+ E ++ L NI + +P E + + G+ Sbjct: 239 LKRVVSFNPSKSETRANSADEEKVVFLPMENISVNGDIDCSEKRPLSEVWSGFTYFRRGD 298 Query: 291 IVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLC--KV 345 +V I N K + G T+ + ++P ID +L +LM + Sbjct: 299 VVMAKITPCFENGKGAYLQGLETGFGFGTTELIVLRPLKAIDGAFLRFLMWTKQFLLLGE 358 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 Y G+ +Q + + VK P+ +PPI+EQ +I I ++A ID + + ++ I L++E Sbjct: 359 QYMTGAAGQQRIPLDFVKNYPIGLPPIEEQREILAHIQEKSAEIDQALTRAQREIELIRE 418 Query: 406 RRSSFIAAAVTGQIDLRG 423 R+ I+ VTGQ+D+RG Sbjct: 419 YRTRLISDVVTGQVDVRG 436 >gi|158520294|ref|YP_001528164.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158509120|gb|ABW66087.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 413 Score = 244 bits (622), Expect = 3e-62, Method: Composition-based stats. Identities = 134/422 (31%), Positives = 204/422 (48%), Gaps = 15/422 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 K YP+YKDSGV+WIG +P+ W+V +K K +T+ +D IYI LE+VES TG+ Sbjct: 2 KRYPKYKDSGVEWIGEVPEQWEVKRLKFLAKNVNEQTNTKKQDEIYIALENVESWTGRIS 61 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPE 122 P+D + + S F IL+GKL PYL K + G+C +FLVL+ +VLPE Sbjct: 62 PQD--NEITFESQAKCFCSNDILFGKLRPYLAKVARPNKSGVCVGEFLVLRVLDNEVLPE 119 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L+ L S + + + GA M ADW I N+ + P EQ I + +T ID Sbjct: 120 FLEQKLRSQWFIELVNSSTFGAKMPRADWTFISNVKLTYPSPKEQNHIASYLDHKTRLID 179 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 TLI ++ + +ELL+E++ AL+S+ VTKGLNP KMKD+GIEW+G VP+HW + Sbjct: 180 TLIEKKQKLVELLQEQRTALISHAVTKGLNPKTKMKDTGIEWLGKVPEHWATASLRWYLR 239 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + + + + NI MG ++ + G + Sbjct: 240 IGSGEFLSNNDFLTEASDQKNIPVIGGNGVMGYTSKTNIQEPTIAIGRV--------GAL 291 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 I +A YL+ + DL ++ + + + Sbjct: 292 CGNVHLVNPPAWITDNALRLSNIKDFLIDYLSLFLGVLDLNRLANQNA---QPLITGSMI 348 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 K V +PPI EQ DI + + ID + + + I +L+E R++ I+ VTG+ID+R Sbjct: 349 KSQKVPIPPIPEQKDILQYCSKFSQTIDHGINTLHKQIAVLQEYRTTLISDVVTGKIDVR 408 Query: 423 GE 424 E Sbjct: 409 DE 410 >gi|322420420|ref|YP_004199643.1| restriction modification system DNA specificity domain-containing protein [Geobacter sp. M18] gi|320126807|gb|ADW14367.1| restriction modification system DNA specificity domain protein [Geobacter sp. M18] Length = 459 Score = 243 bits (619), Expect = 5e-62, Method: Composition-based stats. Identities = 97/431 (22%), Positives = 180/431 (41%), Gaps = 15/431 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M YP Y+D+GV W+G+IP HW K + K R+ +++ + + + T Sbjct: 1 MMKLAPYPDYRDAGVSWVGSIPAHWPEKRAKYYFKEIDERSQTGDEEM--LSVSHITGVT 58 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD-- 118 + + G ++ + ++ +++ GI S + V +P+ Sbjct: 59 PRSQKNVTMFKAESNVGQKRCQPGDLVINTMWAWMSALGVSNHAGIVSPAYGVYRPRSNQ 118 Query: 119 -VLPELLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 L L G ++ ++P+ PP EQ I + Sbjct: 119 AYDNYYLDHLLRIEGYRSEYICRSTGIRSSRLRLYPDKFLSMPVVCPPQEEQQTIARFLK 178 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 A+ I + R IELLKE+KQ +++ VT+GL+P VK K SG+EW+G +P+HWEV+ Sbjct: 179 AQDRLFRKFIRNKRRLIELLKEQKQNVINQAVTRGLDPKVKFKPSGVEWIGDIPEHWEVR 238 Query: 236 PFFALVTELNRKNTKLI--ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 L LN + ++ E+ I + ++ + + +S PG+++F Sbjct: 239 RLKFLCHNLNEQTSEKQPGETYIALEHVESWTGRISLPDDEISFDSQVKR--FKPGDVLF 296 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + K + + + + + +L +RS + + + G Sbjct: 297 GKLRPYLAKVT---RPQTAGVCVGEFLVLRATGNVSANFLEQKLRSKRVIDLINSSTFGA 353 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + L PP EQ +I I ++A ID + + ++ I L++E R+ I+ Sbjct: 354 KMPRADWTFIGNLKFTYPPADEQQEILEHIQEKSAEIDQAISRAQREIELMREYRTRLIS 413 Query: 413 AAVTGQIDLRG 423 VTGQ+D+RG Sbjct: 414 DVVTGQVDVRG 424 >gi|294789183|ref|ZP_06754422.1| restriction modification system DNA specificity domain protein [Simonsiella muelleri ATCC 29453] gi|294482924|gb|EFG30612.1| restriction modification system DNA specificity domain protein [Simonsiella muelleri ATCC 29453] Length = 436 Score = 243 bits (619), Expect = 5e-62, Method: Composition-based stats. Identities = 115/437 (26%), Positives = 199/437 (45%), Gaps = 21/437 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT---GRTSESGKDIIYIGLEDVESGTGK 62 Y +YKDSG+ W+G +P+HW + +K N G +++ +I+Y+ + V G Sbjct: 3 RYEKYKDSGIAWLGEVPEHWSICRLKDEVTFNDEVLGDKTDTDYEILYVDISSVSLIEGI 62 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDV 119 + S + I G ++ + YL+ + + I ST F VL+PK+ Sbjct: 63 IQKELMTFENSPSRARRIVKNGDVIVSTVRTYLKAITQIQDAEDNLIVSTGFAVLRPKEN 122 Query: 120 L-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L P L W+ S ++ I + G + + + +P+ PL EQ I + + Sbjct: 123 LFPRFLGYWVQSENMIGAIVSNSVGVSYPAINATDLVRLPIVKLPLKEQTAIAHYLDTKL 182 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF- 237 ID LI ++ +E L E++ A++++ VTKGLNP MK+SG+EW+G VP HW+V PF Sbjct: 183 GEIDALIDKQQTLLEKLAERRTAVITHAVTKGLNPAAPMKNSGVEWLGDVPAHWDVSPFK 242 Query: 238 --FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPG 289 + + K + S + ++ NI + + + + Y+ V G Sbjct: 243 LVMNSIIDYRGKTPEKTNSGVFLITARNIKNGIIDYTLSQEFIDEDNYEEVMRRGLPKLG 302 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYA 348 +++ + + + + D+ +L + + S Y Sbjct: 303 QVLMTTEAPLGEVAQI---DRTDVALAQRVLKFDGKKDKLDNRFLKYFILSKAFQASLYK 359 Query: 349 MGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +G +K E + L L+PP+ EQ I N ++ ETA+ID L E + Q+I LKE R Sbjct: 360 FATGSTALGIKSERLSYLKSLLPPVTEQTAIANYLDQETAKIDRLCETVNQTIGRLKEYR 419 Query: 408 SSFIAAAVTGQIDLRGE 424 ++ I AVTG+I + E Sbjct: 420 TALITQAVTGKIKVTDE 436 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 50/213 (23%), Positives = 100/213 (46%), Gaps = 8/213 (3%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQ 266 +N K KDSGI W+G VP+HW + VT + K E + +S ++I+ Sbjct: 1 MNRYEKYKDSGIAWLGEVPEHWSICRLKDEVTFNDEVLGDKTDTDYEILYVDISSVSLIE 60 Query: 267 KLETRN-MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + M + +IV G+++ + K + + I+++ + ++P Sbjct: 61 GIIQKELMTFENSPSRARRIVKNGDVIVSTVRTYL-KAITQIQDAEDNLIVSTGFAVLRP 119 Query: 326 HGID-STYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 +L + ++S ++ + G+ ++ D+ RLP++ P+KEQ I + ++ Sbjct: 120 KENLFPRFLGYWVQSENMIGAIVSNSVGVSYPAINATDLVRLPIVKLPLKEQTAIAHYLD 179 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ID L++K + + L ERR++ I AVT Sbjct: 180 TKLGEIDALIDKQQTLLEKLAERRTAVITHAVT 212 >gi|91225110|ref|ZP_01260332.1| hypothetical type I restriction-modification system specificity determinant [Vibrio alginolyticus 12G01] gi|91190053|gb|EAS76324.1| hypothetical type I restriction-modification system specificity determinant [Vibrio alginolyticus 12G01] Length = 464 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 108/446 (24%), Positives = 185/446 (41%), Gaps = 24/446 (5%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLED 55 Y+AYP+YKDS V+W+ IPK W +K + + + YI + D Sbjct: 8 NRYQAYPEYKDSDVEWLDDIPKDWCTRRLKHMLESPMSYGANEAAERAVSTEPRYIRITD 67 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLV 113 + S G S D ++ + IL + G + K+ I + C +L+ Sbjct: 68 MNSD-GTLKEDTFRSLPKDIASDYLLKDRDILLARSGATVGKSFIYRKEFGDCCFAGYLI 126 Query: 114 LQPKDV---LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVL 169 D + + S Q I AT+ + + G + + +P + EQ Sbjct: 127 KVSCDSARLNSDYAFWFFQSSSYWQYISGSQIQATIQNVSAEKYGEMYISLPEHVEEQTQ 186 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + ET +IDTLI ++ + I+LLKEK+QA++S+ VTKGLNP MK+SG+EW+G VP Sbjct: 187 IANFLDHETAKIDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPQAPMKNSGVEWLGEVP 246 Query: 230 DHWEVKPFFALVTEL----NRKNTKLIESNILSLSYGNIIQK-----LETRNMGLKPESY 280 +HWE + ++ ++ + L N+ E + Sbjct: 247 EHWEQIKLKHITHQIVDAEHKTAPYFDDGEYLVCRTTNVRDGKLRLDGGKYTNHAIYEEW 306 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + G+I+F + + + +V + + ++ + S Sbjct: 307 TKRGQPEVGDILFTREAPAG-EACVYTGEVPLCLGQRMVLFKLNQTRVLPEFVLHSIYSG 365 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + G D++ +P+ PP EQ I + + A+ D L Sbjct: 366 LADDFVKQLSQGSTVAHFNMSDIQNIPLFEPPKDEQAQIVDHLAKVLAKYDALTSSASLK 425 Query: 400 IVLLKERRSSFIAAAVTGQIDLRGES 425 I L++ERR++ I+AAVTG+ID+R Sbjct: 426 IELMQERRTALISAAVTGKIDVRNWQ 451 >gi|152984823|ref|YP_001345472.1| type I restriction-modification system subunit S [Pseudomonas aeruginosa PA7] gi|150959981|gb|ABR82006.1| type I restriction-modification system, S subunit [Pseudomonas aeruginosa PA7] Length = 464 Score = 242 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 119/447 (26%), Positives = 205/447 (45%), Gaps = 26/447 (5%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESG 59 + YP+Y+ SGV+W+ +P HW VPIK + KDI G+ + +G Sbjct: 3 FPCYPKYRASGVEWLDQVPDHWSSVPIKYMALERNSLFLDGDWIESKDISSDGIRYITTG 62 Query: 60 T---GKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQ 110 G Y + +T + +G +L +L + +A + G + S Sbjct: 63 NVGEGAYKEQGAGFISEETFHALRCTEVYEGDVLVSRLNNPIGRACVVPNLGGRVVTSVD 122 Query: 111 FLVLQPKDVLPELLQGW-LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 ++ +P + + S + + + GATM +GNI + P L EQ Sbjct: 123 NVIFRPDLKFYKKFIVYLFSSEEYFKHTSNLARGATMQRISRGLLGNIRVVTPSLEEQTQ 182 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + ET RID LI E+ R IELLKEK+QA++S+ VTKGL+P V MKDSG+EW+G VP Sbjct: 183 IARFLDHETARIDALIEEQQRLIELLKEKRQAVISHAVTKGLDPTVPMKDSGVEWLGEVP 242 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE----------S 279 HWEV+ ++ ++ ++ +Q L ++ +K E + Sbjct: 243 AHWEVRSISSISKKITNGYVGPTRDILVDEPGVRYLQSLHIKSNKIKFEVPYFVSEQWSA 302 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 I+ G+++ + ++ + +++W++ S Sbjct: 303 EHAKSILASGDVLIVQTGDIGQVAVVTEEHAGCN-CHALIIVSPVREVVLGEWVSWVLNS 361 Query: 340 YDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++ +G L +VK L + +PP++EQ I + I +D L+ + ++ Sbjct: 362 TYGYHSLLSIQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIESGELEMDSLMSETKR 421 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGES 425 S++LL+ERR++ I+AAVTG+ID+RG Sbjct: 422 SLLLLQERRTALISAAVTGKIDVRGWQ 448 >gi|148263547|ref|YP_001230253.1| restriction endonuclease S subunits-like protein [Geobacter uraniireducens Rf4] gi|146397047|gb|ABQ25680.1| Restriction endonuclease S subunits-like protein [Geobacter uraniireducens Rf4] Length = 443 Score = 242 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 107/442 (24%), Positives = 193/442 (43%), Gaps = 24/442 (5%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD---------IIYIGL 53 Y+AYP+YKDSG +W+G +P HW+V+ IK + + G + D + + Sbjct: 4 RYQAYPEYKDSGEEWLGDVPSHWEVIQIKHLSTVRRGASPRPIDDAKYFDDEGEYAWTRI 63 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV 113 DV + +S G + G + I F V Sbjct: 64 ADVTASEMYLFNAPQRLSDLGSSLSVKLEPGALFLSIAGTVGKPC-ITGMKACIHDGF-V 121 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 P+ +P ++ + + Q + + + T + + +G I + ++ I + Sbjct: 122 YFPELKIPSKFLFYVFAGE--QAYKGLGKFGTQLNLNTDTVGGIKIGCTENSQLEKIVQF 179 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + ET +IDTLI ++ + I+LLKEK+QA++S+ VTKGLNPD MKDSG+EW+G VP+HW+ Sbjct: 180 LDHETAKIDTLIDKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMKDSGVEWLGEVPEHWD 239 Query: 234 VK---PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD--- 287 V +T+ + +S ++ L L +V+ Sbjct: 240 VCLAKFKTHAITDGAHISPDTKNGEHYFVSIKDMCDGLINFEDALLTSKESYKYLVNTGC 299 Query: 288 ---PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 PG+I+F K + V + + + + +L +S + + Sbjct: 300 KPEPGDILFSKDGTIG-KTVVTPENVDFVVASSLIIIKPNLKKLSPQFFDYLCQSCVIQE 358 Query: 345 VFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + G + L +++ ++ + PP+ EQ I I+ + R + + +I L+ Sbjct: 359 QVNSFVKGAALKRLSIQNLLKVWGVFPPLDEQVVIAKHIDKKLIRYQQIEQTANNAIALM 418 Query: 404 KERRSSFIAAAVTGQIDLRGES 425 +ERR++ I+AAVTG+ID+R Sbjct: 419 QERRTALISAAVTGKIDVRDWQ 440 >gi|297581971|ref|ZP_06943891.1| restriction endonuclease S subunit [Vibrio cholerae RC385] gi|297533838|gb|EFH72679.1| restriction endonuclease S subunit [Vibrio cholerae RC385] Length = 437 Score = 242 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 128/435 (29%), Positives = 212/435 (48%), Gaps = 19/435 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y +YK+S V W+G IP HWK++P + + + + ++ G +Y Sbjct: 3 PYSEYKESRVPWLGKIPSHWKLLPCRAIVDNQVEKNDSGKIEEYLSLMANI--GVVRYEE 60 Query: 66 KD--GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 K GN + D + + +G ++ + + ++ F+G+CS ++VL+PK+ + E Sbjct: 61 KGDVGNKKPEDLTKCKLVKQGNLVINSMNYAIGSYGMSPFNGVCSPVYIVLEPKEQIVER 120 Query: 124 LQ--GWLLSIDVTQRIEAICEGATMSH--ADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + + + G W I +P+PPL EQ I + ET Sbjct: 121 RYALRLFENKPMQKHLAQLGNGILQHRAAIKWDDIKPQAVPVPPLEEQRAILYFLDRETQ 180 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 RID+LI E++ FI+LLKEK+QAL+S+IVTKGLNP+V+M+DSGIEW+G VP HW + Sbjct: 181 RIDSLIAEKLTFIKLLKEKRQALISHIVTKGLNPNVEMQDSGIEWIGQVPKHWGISKVRY 240 Query: 240 LVTELNRKNT--KLIESNILSLSYGNIIQKLETRNMG----LKPESYETYQIVDPGEIVF 293 L N N + +SYG++ L E V G+++F Sbjct: 241 LGQCQNGINIGGEFFGHGTPFVSYGDVYNNTSLPEKVQGLVLSTEKDRDNYSVIAGDVLF 300 Query: 294 RFIDLQNDKRSL--RSAQVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAM 349 ++ +E+ + + +P + + R+ L F Sbjct: 301 TRTSETIEEIGFSAVCKSTIEQAVFAGFLIRFRPDEGNLEVGFSEYYFRNEKLRAFFAKE 360 Query: 350 GS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + R SL + +K++PVL+PPI EQ +I N + E + + + E++I+LLKERR+ Sbjct: 361 MNLVTRASLSQDLLKKMPVLLPPIDEQNEIANYLQAECNKFSEIFAETEKTILLLKERRT 420 Query: 409 SFIAAAVTGQIDLRG 423 S I+AAVTG+ID+R Sbjct: 421 SLISAAVTGKIDVRE 435 >gi|315180942|gb|ADT87856.1| type I restriction-modification system specificity determinant [Vibrio furnissii NCTC 11218] Length = 449 Score = 241 bits (615), Expect = 1e-61, Method: Composition-based stats. Identities = 110/440 (25%), Positives = 193/440 (43%), Gaps = 19/440 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M Y++YP+YKDSG++W+G IP W +P+ R + + V + Sbjct: 1 MGKYQSYPKYKDSGIEWMGDIPNEWVTIPVGRLYYRTKRSGHSEKELLSVYRDYGVIPKS 60 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + N D + + ++ K+ + +++++GI S + V +P++ L Sbjct: 61 SR--DDNNNKESDDLTPYQLVQPNDLVMNKMKAWQGSIAVSEYEGIVSPAYFVYEPREKL 118 Query: 121 -----PELLQGWLLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREK 173 P + L + + + +G D I + +P EQ I E Sbjct: 119 FELAHPRYVHYLLRNPIYITQYMSRSKGIRVNQWDLDPDEFKTIELLLPSKDEQSKIFEF 178 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + ET +ID+LI ++ + I+LLKEK+QA++S+ VTKGLNP MKDS +EW+G VP+HW Sbjct: 179 LDHETAKIDSLIKKQQQLIKLLKEKRQAVISHAVTKGLNPQAPMKDSDVEWLGKVPEHWG 238 Query: 234 VKPFFA---LVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIV 286 F + + + GN + + + Y+ + I Sbjct: 239 TPKLFHVSTRIGDGLHSTPLYEDGTGYFFVNGNNLTNGVITIGATAKEVPLKEYQNHYIP 298 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 V I+ +L + + G SA I+ YL W + S + Sbjct: 299 LSNMSVLLSINGTIGNVALYREEKIILGK--SAAYINCKAEINPEYLRWFLTSDQAKLYY 356 Query: 347 YA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + +L ++++ VLVP ++EQ DI + ++ + L+ + LLKE Sbjct: 357 DLEVTGTTIYNLSLNSIRKMKVLVPSVQEQTDIAKFCEMSHSKYEKLILSAITQMDLLKE 416 Query: 406 RRSSFIAAAVTGQIDLRGES 425 RR++ I+AAVTG+ID+R Sbjct: 417 RRTALISAAVTGKIDVRNWQ 436 >gi|331666002|ref|ZP_08366896.1| type I restriction-modification system, S subunit [Escherichia coli TA143] gi|331057053|gb|EGI29047.1| type I restriction-modification system, S subunit [Escherichia coli TA143] Length = 467 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 107/451 (23%), Positives = 191/451 (42%), Gaps = 27/451 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGL 53 + Y+AYP+Y+DSG++W +P +WK ++ + + G T + +I Sbjct: 4 LNKYQAYPEYRDSGMEWCNELPLNWKKTKLRWLSNIFAGGTPSKNVIDYWENGTVPWISS 63 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQF 111 V G ++ + S+ KG ++ G + C+ Sbjct: 64 GAVNQGYIVEPSTYISNAALENSSAKWIPKGALVVALAGQGKTKGMVAQLGINTTCNQSM 123 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + + + I Q I + G + + +G+I P P E I Sbjct: 124 AAIVLYKK-NQSRYIFWWLISNYQNIRNMAGGDLRDGLNLELLGDIQCPKPRNDESSKIA 182 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET +ID LI ++ + IELLKEK+QA++S+ VTKGLNPDV MKDSG+ +G P H Sbjct: 183 LFLDHETAKIDDLIEKQQQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGLTGLGEAPSH 242 Query: 232 WEVKPFFALVTELNRK-----------NTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 W E ++L E + + +I R + Sbjct: 243 WFKSKLANTGDETKGCFVNGPFGSDLLASELKEEGVPVVYIRDIKATGYNRKSTVYVTHQ 302 Query: 281 ETY----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAW 335 + + +++F + + + + I + V ++ ++A+ Sbjct: 303 KAQQLEICKLSSNDVIFSKVGDPPGEACVYPKNEPDAVITQDVMRVRVNKKTFNAHFIAY 362 Query: 336 LMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L+ S + + G R+ + D K + PP+ E I N +N + A+ID+++ Sbjct: 363 LLNSNFGRQTINNISIEGTRKRVSLGDFKTTKFIFPPLGEAQSIVNALNEKCAQIDLIIF 422 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 K Q+I+L++ERR++ I+AAVTG+IDLR + Sbjct: 423 KTNQAIMLIQERRTALISAAVTGKIDLRNWT 453 >gi|86130625|ref|ZP_01049225.1| type I site-specific deoxyribonuclease [Dokdonia donghaensis MED134] gi|85819300|gb|EAQ40459.1| type I site-specific deoxyribonuclease [Dokdonia donghaensis MED134] Length = 444 Score = 241 bits (614), Expect = 2e-61, Method: Composition-based stats. Identities = 103/438 (23%), Positives = 178/438 (40%), Gaps = 18/438 (4%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 + Y YKDSGV+W+G IP+HW++ + + + + + + V Sbjct: 6 NKVQRYDSYKDSGVEWLGEIPEHWQLGRLGSILNPVSSKNHPNETLLSITREKGVIVRDI 65 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL-QPKDVL 120 + + N D + + KGQ K+ + ++ + GI S + K++ Sbjct: 66 ENEDSNHNFIPDDLTGYKLLKKGQFGMNKMKAWQGSYGVSSYTGIVSPAYYTFEFTKEIE 125 Query: 121 PELLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 P + S +G + IP+ +PPL EQ I E + +T Sbjct: 126 PRFFHIAIRSKMYVSFFGKASDGVRIGQWDLSKDRMKRIPLAVPPLPEQTAIAEFLDDKT 185 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +ID I + + I LLKE+KQ L+ VT+GL+ V +KDSG+EW+G +P+HW+VK F Sbjct: 186 TKIDDAIGIKQQQINLLKERKQILIHKAVTRGLDDSVTLKDSGVEWIGEIPEHWKVKRFR 245 Query: 239 ALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLK---------PESYETYQIV 286 + L E + ++YG I K T ++ Sbjct: 246 YIFQLGKGLTITKENLKEEGVFCVNYGEIHSKYGFEVDTNIQQLKCVDDDYLESNTNALI 305 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCK 344 G+ VF + + + I + I+S + A++ S Sbjct: 306 KEGDFVFADTSEDIEGSGNFTYLKSKDEIFAGYHTVVAKPKFKINSRFFAYVFESQSFRN 365 Query: 345 VFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G+ S+ +K V P I+EQ +I + +++ T +I+ + EQ I L Sbjct: 366 QIRTKVKGVKVYSVTQSILKEPNVWYPSIQEQREIVDFLDIGTRKIETAIGLKEQEIEKL 425 Query: 404 KERRSSFIAAAVTGQIDL 421 KE + S I VTG++ + Sbjct: 426 KEYKGSLINGVVTGKVRV 443 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 97/217 (44%), Gaps = 10/217 (4%) Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + + KDSG+EW+G +P+HW++ +++ ++ KN ++ G I+ Sbjct: 3 AIENKVQRYDSYKDSGVEWLGEIPEHWQLGRLGSILNPVSSKNHPNETLLSITREKGVIV 62 Query: 266 QKLETR--NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + +E N P+ Y+++ G+ + + S GI++ AY Sbjct: 63 RDIENEDSNHNFIPDDLTGYKLLKKGQFGMNKMKAWQGSYGVSSY----TGIVSPAYYTF 118 Query: 324 -KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDIT 379 I+ + +RS F G+ + L + +KR+P+ VPP+ EQ I Sbjct: 119 EFTKEIEPRFFHIAIRSKMYVSFFGKASDGVRIGQWDLSKDRMKRIPLAVPPLPEQTAIA 178 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ +T +ID + +Q I LLKER+ I AVT Sbjct: 179 EFLDDKTTKIDDAIGIKQQQINLLKERKQILIHKAVT 215 >gi|258513230|ref|YP_003189486.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256635133|dbj|BAI01107.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256638188|dbj|BAI04155.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-03] gi|256641242|dbj|BAI07202.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-07] gi|256644297|dbj|BAI10250.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-22] gi|256647352|dbj|BAI13298.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-26] gi|256650405|dbj|BAI16344.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-32] gi|256653396|dbj|BAI19328.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01-42C] gi|256656449|dbj|BAI22374.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-12] Length = 420 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 112/428 (26%), Positives = 191/428 (44%), Gaps = 18/428 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 + Y Y Y+DSGVQW+G P +W++ + + + S++ + + + Sbjct: 3 IAAYSKYDAYRDSGVQWVGQFPANWELARLGGLFEERRHKVSDTDFEPLSVT------KN 56 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 G + ++ +D + G + + I+ DG S +VL+PK +L Sbjct: 57 GIFPQLANAAKTNDGENRKLVRAGDFVINSRSDRKGSSGISPLDGSVSLINIVLEPKRIL 116 Query: 121 PELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 PE L S + + G + + + I + +P EQ I + + Sbjct: 117 PEFCHHLLKSYAFVEEYYRVGRGIVADLWTTRYDEMRTILIALPSPDEQRTIAAFLDGKC 176 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 ID + + + I LL E++Q L+ VT+GLNPD MKDSGI+W+G +P HWEVK Sbjct: 177 ALIDEAVRIKEKQIRLLVERRQILIQQAVTRGLNPDAPMKDSGIDWIGQIPAHWEVKRNK 236 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + E+N ++ K E + LS+S + + L ESY+ ++V G++V + Sbjct: 237 HMFVEINERSAKGEEQH-LSMSQKLGLVPADLVEKSLASESYQGAKLVRTGDLVLNRLKA 295 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-- 354 SL + G+++ Y +P G S Y L ++ F G+ Sbjct: 296 HLAVFSLAPME----GLVSPDYSVFRPLVQGASSDYFEILFKTSKYLGEFRLRVRGIVEG 351 Query: 355 -QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 L +D P+L+PP+ EQ I + TA+ ++ E I L+E ++S I A Sbjct: 352 FYRLYTDDFMDCPLLLPPLDEQLQIVEHVRATTAQFHNVIAIKESQITALREYKTSLINA 411 Query: 414 AVTGQIDL 421 AVTG+I + Sbjct: 412 AVTGKIKV 419 >gi|126462620|ref|YP_001043734.1| restriction modification system DNA specificity subunit [Rhodobacter sphaeroides ATCC 17029] gi|126104284|gb|ABN76962.1| restriction modification system DNA specificity domain [Rhodobacter sphaeroides ATCC 17029] Length = 456 Score = 240 bits (613), Expect = 2e-61, Method: Composition-based stats. Identities = 109/447 (24%), Positives = 198/447 (44%), Gaps = 33/447 (7%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDV 56 + YP YKDSGV+W+G +P+ W+V ++ +L TG + + ++ Sbjct: 2 RRYPAYKDSGVEWLGEVPEGWEVKCLRMIADELQTGPFGSQLHTEDYVTAGVPIVNPSNI 61 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSI-FAKGQILYGKLGPYLRKAIIADFDGI----CSTQF 111 G + G + + G I+ G+ G R A++ D + Sbjct: 62 LDGQIVPDDEIGVDEATALRLANHALLPGDIILGRRGELGRCAVVPDGTMPLLCGTGSLR 121 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + L+ LP+ + + + V + + G+TM + + +G I + +P L EQ I Sbjct: 122 IRLKSSQALPDFIAECIRTPRVREWLSLQSVGSTMDNLNTAIVGKIQIALPSLPEQRAIT 181 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET +ID L+ E+ R I LL EK+QA++++ VT+GLNPD +K SGI+W+G +P+ Sbjct: 182 AFLNRETAKIDALVEEQRRLIALLAEKRQAVLNHAVTRGLNPDALLKPSGIDWLGDIPEG 241 Query: 232 WEVK------PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----- 280 WEV + T + ++ +I S +I Q R + + Sbjct: 242 WEVVPIRKVARLESGHTPSRSRPEWWVDCHIPWFSLADIWQVRPGRVEYVYETAEAVSEL 301 Query: 281 ----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + +++ G ++ + A + V + YL + Sbjct: 302 GLQNSSARLLPAGTVMLSRTASVGFSAVMGIAMATTQDFAN----WVCGCRLLPDYLLYC 357 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +R MGS ++ D++ L + +PP++EQ I + + +D L++ Sbjct: 358 LRGMPSEFERLKMGS-THNTIYMPDIRTLTIPLPPLEEQKAIVDHVRASVGALDELMDTA 416 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRG 423 +I LL+ERR++ I+AAVTG+ID+R Sbjct: 417 TTAITLLQERRAALISAAVTGKIDVRD 443 >gi|260581977|ref|ZP_05849772.1| restriction endonuclease S [Haemophilus influenzae NT127] gi|260094867|gb|EEW78760.1| restriction endonuclease S [Haemophilus influenzae NT127] Length = 416 Score = 240 bits (612), Expect = 3e-61, Method: Composition-based stats. Identities = 101/425 (23%), Positives = 191/425 (44%), Gaps = 18/425 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y +YKDSGV+W+G +P HW++ +K+ + + L GK + Sbjct: 2 RRYERYKDSGVEWLGEVPSHWELKRLKQLFVEKKHK--------QSLSLNCGAISFGKVI 53 Query: 65 PK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDV 119 K D ++ + KG+ L L + +++ D + S ++VL+ K + Sbjct: 54 EKSDDKVTEATKRSYQEVLKGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQI 113 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + +LL ++ + G ++ I + + IPPL+EQ I + + +T Sbjct: 114 INKKYFSYLLHRYDVAYMKLLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTA 172 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW+V+ Sbjct: 173 KIDQAVDLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWDVQRSKF 232 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + ++ RK + + ++ YQ + G++V +D Sbjct: 233 IFKKIERKVNEEDQIVTCFRDGQVTLRANRRTEGFTNALKEHGYQGIRKGDLVIHAMDAF 292 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--- 356 + + + + + ID + A+ +R+ L ++ G+R+ Sbjct: 293 AGAIGISDSDGKATPVYS-VCLPHDKQKIDVYFYAYYLRNLALSGFISSLAKGIRERSTD 351 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ D L + +PP EQ I + ++ +T++ID + I LKE ++ I VT Sbjct: 352 FRYSDFAELLLPIPPYLEQQKIADYLDKQTSKIDRAIALKTAHIEKLKEYKNVLINDVVT 411 Query: 417 GQIDL 421 G++ + Sbjct: 412 GKVRV 416 >gi|281355061|ref|ZP_06241555.1| putative type I site-specific restriction-modification system, S subunit [Victivallis vadensis ATCC BAA-548] gi|281317941|gb|EFB01961.1| putative type I site-specific restriction-modification system, S subunit [Victivallis vadensis ATCC BAA-548] Length = 430 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 104/424 (24%), Positives = 185/424 (43%), Gaps = 12/424 (2%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 K Y +YKDSG+ WIG +P+ WK+ P + T +S ++++ + L+ + Sbjct: 2 KRYVKYKDSGIPWIGEVPEGWKICPFFAIFTPIS-ITGKSVEELLSVYLDVGVVRFSEKR 60 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 K N+ +D S G + + I+ + GI S +LV++ + Sbjct: 61 EKRANATSADMSKYQYVDIGDFVLNNQQAWRGSVGISQYKGIVSPAYLVMKTSKQINSSF 120 Query: 125 QGW---LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + G+ + W + M PPL EQ I E + + +I Sbjct: 121 ANYLVRSPACVYAYFLSSRGVGSIQRNIYWDELKRYKMVFPPLDEQREIVEYLDSVVAKI 180 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D I E+ IE L KQ+++++ VTKG+NP+ KMKDSGI W+G VP+HW + Sbjct: 181 DGYIAEKEAEIEKLGLLKQSVIAHAVTKGINPNAKMKDSGIPWIGEVPEHWLQLRGKNIF 240 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 T + R E ++K + + YQ + PG++V +D Sbjct: 241 TRMARVVEADDEVITCFRDGQVTLRKNRRTDGFTESFKEIGYQGIRPGDLVIHQMDAFAG 300 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQS---L 357 + + +G T Y+ ++P G S + +L+R ++ G+R+ Sbjct: 301 AIGVSDS----KGKGTPVYICLQPKGEQSNFYYAYLLREMARTGYIKSLYRGIRERSSDF 356 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++E +L + +PP EQ I I+ + ID + + + I LK + I+ VTG Sbjct: 357 RYETFGKLLLPIPPADEQRAIVEFIDRKVKEIDGFISAVREQIEKLKLYKQRLISDVVTG 416 Query: 418 QIDL 421 +I + Sbjct: 417 KIKV 420 >gi|257061739|ref|YP_003139627.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] gi|256591905|gb|ACV02792.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] Length = 456 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 124/445 (27%), Positives = 199/445 (44%), Gaps = 22/445 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTG-------RTSESGKDIIYIG 52 +K +K YP YK SGV W+G IP W+V ++ +K + G + + G Sbjct: 6 LKQWKLYPNYKPSGVDWLGDIPDSWEVKRLRYLSKKITAGPFGSNLTKNIYTSTGYKIYG 65 Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQ 110 E V + + + D + G IL +G + + A++ GI + + Sbjct: 66 QEQVIASDFSIGDYYISKEKYDQMSQYKINSGDILISCVGTFGKVAVVPKNIEQGIINPR 125 Query: 111 FLVLQPKDVLPELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + L P + L V +++E + G TM + + +I +PIPPL EQ Sbjct: 126 LIKLIPITEYINSVYLEKLLKSVVAFEQMEKLSRGGTMGVINIGLLSDILLPIPPLPEQE 185 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + + ET +ID LIT + R IELLKEK+ AL+S+ VTKGLNPDV MKDSG+EW+G + Sbjct: 186 KIAQFLDKETAKIDKLITLKERLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFI 245 Query: 229 PDHWEVKPFFALVTELN-----RKNTKLIESNILSLSYGNI----IQKLETRNMGLKPES 279 P+HWEVK +V + +ES I L NI I + + Sbjct: 246 PEHWEVKRLKYIVPNITVGIVVTPAKYYVESGIPCLRSVNISSGKIDNSNLVFISSQSNE 305 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + G++V + ++ + + + + +L S Sbjct: 306 LHQKSKIYKGDLVLVRTGVTGT-AAIVTDNFDGANCVDLLIIRNSRLILTLYLYYYLNSS 364 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +V ++ + L + PP +EQ I ++ +T +ID ++ K +S Sbjct: 365 TTSYQVNNYSVGAIQAHYNTSTLSELIITFPPPQEQQKIAEYLDRKTEQIDQIINKTRES 424 Query: 400 IVLLKERRSSFIAAAVTGQIDLRGE 424 I LKE R+ I+AAVTG+ID+R Sbjct: 425 IEYLKEYRTVLISAAVTGKIDVRQW 449 >gi|145633684|ref|ZP_01789410.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 3655] gi|144985444|gb|EDJ92265.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 3655] Length = 418 Score = 240 bits (611), Expect = 4e-61, Method: Composition-based stats. Identities = 102/425 (24%), Positives = 192/425 (45%), Gaps = 18/425 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y YKDSGV+W+G +P HW++ +K+ + + + L GK + Sbjct: 2 RRYESYKDSGVEWLGEVPSHWELKRLKQLFVEKKHKQN--------LSLNCGAISFGKVI 53 Query: 65 PK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDV 119 K D ++ + KG+ L L + +++ D + S ++VL+ K + Sbjct: 54 EKADDKVTEATKRSYQEVLKGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQI 113 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + +LL ++ + G ++ I + + IPPL+EQ I + + +T Sbjct: 114 INKKYFSYLLHRYDVAYMKLLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTA 172 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW+V+ Sbjct: 173 KIDRAVDLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWDVQRSKF 232 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + ++ RK + + ++ YQ + G++V +D Sbjct: 233 IFKKIERKVNEEDQIVTCFRDGQVTLRANRRTEGFTNALKEHGYQGIRKGDLVIHAMDAF 292 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--- 356 + + + + + ID + A+ +R+ L ++ G+R+ Sbjct: 293 AGAIGISDSDGKATPVYS-VCLPHNKQKIDVYFYAYYLRNLALSGFISSLAKGIRERSTD 351 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ D L + +PP EQ I + ++ +T++ID ++ I LKE +S I VT Sbjct: 352 FRYADFAELLLPIPPYLEQQKIADYLDKQTSKIDQVIALKTAHIEKLKEYKSVLINDVVT 411 Query: 417 GQIDL 421 G++ + Sbjct: 412 GKVRV 416 >gi|68248718|ref|YP_247830.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 86-028NP] gi|319896548|ref|YP_004134741.1| type i site-specific restriction-modification system, s subunit [Haemophilus influenzae F3031] gi|68056917|gb|AAX87170.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 86-028NP] gi|317432050|emb|CBY80399.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae F3031] Length = 416 Score = 239 bits (610), Expect = 6e-61, Method: Composition-based stats. Identities = 102/425 (24%), Positives = 190/425 (44%), Gaps = 18/425 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y +YKDSGV W+G +P HW++ +K+ + + L GK + Sbjct: 2 RRYERYKDSGVDWLGEVPSHWELKRLKQLFVEKKHK--------QSLSLNCGAISFGKVI 53 Query: 65 PK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDV 119 K D ++ + KG+ L L + +++ D + S ++VL+ K + Sbjct: 54 EKSDDKVTEATKRSYQEVLKGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQI 113 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + +LL ++ + G ++ I + + IPPL+EQ I + + +T Sbjct: 114 INKKYFSYLLHRYDVAYMKLLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTA 172 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW+V+ Sbjct: 173 KIDQAVDLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWDVQRSKF 232 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + ++ RK + + ++ YQ + G++V +D Sbjct: 233 IFKKIERKVNEEDQIVTCFRDGQVTLRANRRTEGFTNALKEHGYQGIRKGDLVIHAMDAF 292 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--- 356 + + + + + ID + A+ +R+ L ++ G+R+ Sbjct: 293 AGAIGISDSDGKATPVYS-VCLPHDKQKIDVYFYAYYLRNLALSGFISSLAKGIRERSTD 351 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ D L + +PP EQ I + ++ +T++ID + I LKE +S I VT Sbjct: 352 FRYSDFAELLLPIPPYLEQQKIADYLDKQTSKIDRAIALKTAHIEKLKEYKSVLINDVVT 411 Query: 417 GQIDL 421 G++ + Sbjct: 412 GKVRV 416 >gi|144900420|emb|CAM77284.1| type I restriction-modification system, S subunit [Magnetospirillum gryphiswaldense MSR-1] Length = 431 Score = 239 bits (610), Expect = 6e-61, Method: Composition-based stats. Identities = 119/441 (26%), Positives = 188/441 (42%), Gaps = 27/441 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIK-RFTKLNTGRTSE----SGKDIIYIGLED 55 M + Y YKDSGV+W+G +P HW V P+K L +G S ++I + + Sbjct: 1 MS-FPQYADYKDSGVEWLGEVPGHWDVFPLKRDLAFLTSGSRGWAEHYSDDGALFIRIGN 59 Query: 56 VESGTGKYLPKDGNSRQ---SDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICST 109 + D + + G +L+ + YL +A + S Sbjct: 60 LTRDGIHLDLSDIQRVEVPDGAEGERTRVVGGDVLFS-ITAYLGSVAVAPEELEVAYVSQ 118 Query: 110 Q--FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 L + +P + LS + G T + N+ M PPL EQ Sbjct: 119 HVALARLHQRRFIPAWVGYVTLSNIGETYLGTQGYGGTKVQLSLDDVANLIMTAPPLPEQ 178 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + +T +ID L+ E+ R + LL EK+QA++S+ VTKGLNP MKDSGIEW+G Sbjct: 179 SAIAAFLDRQTGKIDALVAEQERLLTLLAEKRQAVISHAVTKGLNPAAPMKDSGIEWLGE 238 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 VP+HW+V P T + + MG T+ ++ Sbjct: 239 VPEHWKVIPLRWFCTCKSGDSISADGVEAECDEDRTAPVIGGNGVMGYTYAPNITHPVLV 298 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G + N A V + +I + + + YL+ L+RS + Sbjct: 299 IGRV---GALCGNVHSIKLPAWVTDNALI----LDIAEGVFNQEYLSHLLRS---RNLNE 348 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + V+ + + P+ EQ I +N +TA+ID L + ++I LLKE R Sbjct: 349 IASKTAQPLITGSQVRDQRIPLAPMDEQSAIVEFLNEQTAKIDTLTAEALRAIALLKEHR 408 Query: 408 SSFIAAAVTGQIDLRG--ESQ 426 S+ I+AAVTG+ID+RG E++ Sbjct: 409 SALISAAVTGKIDVRGLVEAE 429 >gi|332974851|gb|EGK11766.1| restriction modification system DNA specificity subunit [Psychrobacter sp. 1501(2011)] Length = 442 Score = 238 bits (607), Expect = 1e-60, Method: Composition-based stats. Identities = 119/438 (27%), Positives = 193/438 (44%), Gaps = 22/438 (5%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 YKAYP+YKDSGV+WIG IP W++ IK +K G +V Sbjct: 11 RYKAYPEYKDSGVEWIGEIPSGWELTRIKYVSKCLDGARIPLNASERGEMSGNV------ 64 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDV 119 P G ++ D +F + +L G+ G K + + G V + + Sbjct: 65 --PYWGANKVVDHINDYLFDEELVLLGEDGAPFFDKNKDVAFNVSGKIWPNNHVHVLRPL 122 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + ++ +L G+T + + I + P L EQ I + ET Sbjct: 123 MEKVEPRFLKHSLNCADFYLYISGSTRDKLNQSDMNEIFIRAPKLIEQKQIANFLDYETA 182 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID LI ++ R IELL EK+QA++S+ VTKGLNPD MKDSG+EW+G VP+HW V Sbjct: 183 KIDNLIEKQQRLIELLTEKRQAVISHAVTKGLNPDAPMKDSGVEWLGDVPEHWIVTKLRQ 242 Query: 240 LVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY------QIVDPGE 290 L ++ + I +S NI + K S+E Y V+ G+ Sbjct: 243 LAFLQEGPGLRHWQFKAQGIKVISVTNITEAGIDFTRLEKFISHEEYLQSYQHFTVNKGD 302 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVF-YA 348 I+ K + ++ + H + + S + A Sbjct: 303 ILLSSSGNSWGKVATYEGDDKVILNTSTIRLNELKHRPLVQPFIKFFLLSEACREQLGLA 362 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 M + + + + +VPP+ EQ+ I+ I+ + ++I L++ E +I L++ERR+ Sbjct: 363 MTGSCQPNFGPTHLNEVKTVVPPVDEQYAISKYIDEKVSKISELLQVCESTIQLMQERRT 422 Query: 409 SFIAAAVTGQIDLRGESQ 426 + I+AAVTG+ID+R + Sbjct: 423 ALISAAVTGKIDVRDWVK 440 >gi|300724721|ref|YP_003714046.1| type I restriction-modification [Xenorhabdus nematophila ATCC 19061] gi|297631263|emb|CBJ91958.1| Type I restriction-modification [Xenorhabdus nematophila ATCC 19061] Length = 429 Score = 238 bits (606), Expect = 1e-60, Method: Composition-based stats. Identities = 119/430 (27%), Positives = 194/430 (45%), Gaps = 32/430 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVE 57 M Y+AYP+YKDSGV+W+G IPKHW V +K + G+ + S IG Sbjct: 1 MGKYRAYPEYKDSGVEWLGKIPKHWNVCRLKHLIIIRNGQDYKMVQSDAGYPVIGSG--- 57 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 Q ST ++ K +L G+ G + + + T + Sbjct: 58 -------------GQFAFSTQYMYDKPSVLLGRKGTIDKPLYVNEPFWTVDTMYYTEMRD 104 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIA 176 DV + L ++I + + + +GN + + E++LI + Sbjct: 105 DVDAKYLYYLAVTIQF----DRYSTSTALPSMTQENLGNYFFAVSNEITERLLISTFLDH 160 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 ET +ID LI ++ + I+LLKEK+QA++S+ VTKGLN DV MKDSG+EW+G +P W++ Sbjct: 161 ETAKIDILIEKQQQLIKLLKEKRQAVISHAVTKGLNLDVPMKDSGVEWLGYIPSEWDIVR 220 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 +V K + E + NI K M G+++F + Sbjct: 221 LKYIVALTGDKAPQSTE---KYVGMENISSKSGKYIMTKNALPEGVSNSFKKGDVLFGKL 277 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQ 355 K L GI +S ++ + + +L + M + + G Sbjct: 278 RPYLAKSWLAEFS----GICSSEFLVLHSLKVHPKFLNYYMLTDAFIDQVNSSTYGSKMP 333 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + LPV + K I N + +T++ID+L+EK ++ I LL+ERR+S I+AAV Sbjct: 334 RASWDFIGLLPVPITTYKSTEKIANFLGQKTSKIDMLLEKQQKVIKLLQERRTSLISAAV 393 Query: 416 TGQIDLRGES 425 TG+ID+R Sbjct: 394 TGKIDIRNWQ 403 >gi|254410563|ref|ZP_05024342.1| hypothetical protein MC7420_3078 [Microcoleus chthonoplastes PCC 7420] gi|196182769|gb|EDX77754.1| hypothetical protein MC7420_3078 [Microcoleus chthonoplastes PCC 7420] Length = 430 Score = 238 bits (606), Expect = 2e-60, Method: Composition-based stats. Identities = 110/431 (25%), Positives = 180/431 (41%), Gaps = 16/431 (3%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 + Y +YKDSGV+W+G IP+HW+ + K +L T ++ + + D+ + Sbjct: 3 FPRYERYKDSGVEWLGQIPEHWETLRTKNIFRLITEAAPKNNDEELLSVYSDIGVKPRRE 62 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 L + GN T I KG ++ KL ++ I+D+DG+ S + VL+ + Sbjct: 63 LEERGNKAS-TTDGYWIVKKGDVIVNKLLAWMGAIGISDYDGVTSPAYDVLRAYKPIDSK 121 Query: 124 LQGWLLSIDVTQRIEAICE---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +L + + G I +P PP Q I E + + Sbjct: 122 YYHYLFRSPICLSKLKQHSRGIMEMRLRLYFDEFGRIRLPYPPFEIQKRIVEFLDRKCGE 181 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I+ I + R IELL+E+K L++ VTKGL+P+ MKDSGIEW+G +P HWEVK + Sbjct: 182 IEDAIAHKKRLIELLEEQKTILINQAVTKGLDPNAPMKDSGIEWIGEIPTHWEVKKLKRI 241 Query: 241 VTEL-----NRKNTKLIESNILSLSYGNIIQKLETRNMGLK----PESYETYQIVDPGEI 291 + + +E ++ L NI + Y + + G+I Sbjct: 242 SPCITVGIVITPSKYYVEEGVICLRSLNIKPNKILVKDSVYISERSNKYLSKSKIFAGDI 301 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V + I + KP +++ M S + S Sbjct: 302 VCVRTGQPGVSAVVDRRFDGANCI--DLIIIRKPKNDLPKFVSLAMNSEVCRSQYLTGAS 359 Query: 352 GL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G +Q E + L + +PP+ EQ I N I+ L+ I++ I L+ E + Sbjct: 360 GAIQQHFNIEMAQNLVIAIPPLPEQIKIYNHISKIQKNTMDLMNFIKREIDLMNELKQIL 419 Query: 411 IAAAVTGQIDL 421 IA AVTG+I + Sbjct: 420 IAEAVTGKIKI 430 >gi|265754307|ref|ZP_06089496.1| predicted protein [Bacteroides sp. 3_1_33FAA] gi|263235016|gb|EEZ20571.1| predicted protein [Bacteroides sp. 3_1_33FAA] Length = 423 Score = 237 bits (605), Expect = 2e-60, Method: Composition-based stats. Identities = 91/429 (21%), Positives = 173/429 (40%), Gaps = 21/429 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE--------SGKDIIYIGLEDV 56 K Y YKDSGV+WIG IP HW+ + I R + T+ S K + ++ D+ Sbjct: 3 KKYDAYKDSGVKWIGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDL 62 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 +G K + D + + ++ G + K + D + + ++ P Sbjct: 63 NNGLITETSKKITPKAVDECKMKFYPIHSVVIAMYGATIGKVGLLDIETATNQACCIIVP 122 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + I + + + G + I + +P+PPL+EQ I + Sbjct: 123 SKRICPKYTFYSFIIAKEELLLSSFGG-GQPNISQDIIRKLKVPVPPLSEQQSIASYLDV 181 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T +ID +I + + E L E KQ+L++ VT+GLNP+ +KDSG+ W+G +P HW++ Sbjct: 182 KTEKIDKMIAKAEKKTEYLDELKQSLITRAVTRGLNPNTPLKDSGVNWIGNIPMHWDIAC 241 Query: 237 FFALVTELNRK----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + +N + N L L GN E + D +++ Sbjct: 242 LRFFLRLINGRAYSQNELLPSGKYKVLRVGNFFTNDSWY---YSNMELEPDKYCDKDDLL 298 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + I V+ ++ + + M Sbjct: 299 YAWSASVGPYI-----WNEAKTIYHYHIWKVQLATSMDKMYSYYLLRAVTNQKMSDMHGS 353 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + D+ + + +PP+ EQ I ++ + ++ID ++ ++ I L+E + S I Sbjct: 354 TMMHITMGDMNKTKIPIPPLSEQQQIATYLDTKCSKIDHIIATQKKKIAYLQELKQSLIT 413 Query: 413 AAVTGQIDL 421 VTG+I + Sbjct: 414 NVVTGKIKV 422 >gi|229163473|ref|ZP_04291424.1| hypothetical protein bcere0009_42390 [Bacillus cereus R309803] gi|228620042|gb|EEK76917.1| hypothetical protein bcere0009_42390 [Bacillus cereus R309803] Length = 441 Score = 237 bits (604), Expect = 2e-60, Method: Composition-based stats. Identities = 97/434 (22%), Positives = 185/434 (42%), Gaps = 21/434 (4%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK Y YK S VQWIG +PKHW++ I + + S+ + + + G Sbjct: 3 YKPYEHYKSSDVQWIGKVPKHWELKKISSIFEQRNEKVSDKDFEPLSVT------KMGIL 56 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + ++ + K + ++ FDG S V++PK + + Sbjct: 57 KQLENVAKTDNNDNRKKVLKNDFVINSRSDRKGSCGVSKFDGSVSLICTVIKPKTINTYM 116 Query: 124 LQGWLLSIDVTQRIEAICEGATM----SHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + E G + W I +PIPP EQ I + Sbjct: 117 DYYHHLFRNKMFSEEFYRWGRGIVDDLWSTKWDEFKRILIPIPPHEEQKSIVSYLNHIYE 176 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 I+ LIT + + IE +++ +++L++ VT GLNP KMKDS +EW+G +P+HW K Sbjct: 177 AIEELITHKQQQIETIQQYQRSLITEAVTSGLNPHAKMKDSSVEWIGEMPEHWITKRLDF 236 Query: 240 LVTELNR------KNTKLIESNILSLSYGNIIQ-KLETRNMGLKPES---YETYQIVDPG 289 + R ++ E+ + L+ NI + +++ N+ E ++ G Sbjct: 237 VSVVKARLGWKGLTASEYQENGYIFLAIPNIKKFQIDFENVNYISEKRYKESPEIMLQVG 296 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++ + ++ + +S + + S +L + ++S + K+ Sbjct: 297 DVLLAKDGSTLGEVNVVRYLPSPATVNSSIAVIRPKGDLHSVFLYYYLKSNYIQKIIQKK 356 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G L +D+ + + VPP+ EQ I ++ + + I+ L+ + ++ I +L++ R Sbjct: 357 KDGMGVPHLFQKDINKFIIQVPPLDEQVKIAKYLDGKISEINNLIIETQEQIDILQQYRQ 416 Query: 409 SFIAAAVTGQIDLR 422 S + VTG+ID+R Sbjct: 417 SLVYEVVTGKIDVR 430 >gi|237725172|ref|ZP_04555653.1| type I restriction-modification system [Bacteroides sp. D4] gi|229436438|gb|EEO46515.1| type I restriction-modification system [Bacteroides dorei 5_1_36/D4] Length = 423 Score = 236 bits (603), Expect = 4e-60, Method: Composition-based stats. Identities = 92/429 (21%), Positives = 174/429 (40%), Gaps = 21/429 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE--------SGKDIIYIGLEDV 56 K Y YKDSGV+WIG IP HW+ + I R + T+ S K + ++ D+ Sbjct: 3 KKYDAYKDSGVKWIGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDL 62 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 +G K + D + + ++ G + K + D + + ++ P Sbjct: 63 NNGLITETSKKITPKAVDECKMKFYPIHSVVIAMYGATIGKVGLLDIETATNQACCIIVP 122 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + I + + + G + I + +P+PPL+EQ I + Sbjct: 123 SKRICPKYTFYSFIIAKEELLLSSFGG-GQPNISQDIIRKLKVPVPPLSEQQSIASYLDV 181 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T +ID +I + + IE L E KQ+L++ VT+GLNP+ +KDSG+ W+G +P HW++ Sbjct: 182 KTEKIDKMIAKAEKKIEYLGELKQSLITRAVTRGLNPNTPLKDSGVNWIGNIPMHWDIAC 241 Query: 237 FFALVTELNRK----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + +N + N L L GN E + D +++ Sbjct: 242 LRFFLRLINGRAYSQNELLPSGKYKVLRVGNFFTNDSWY---YSNMELEPDKYCDKDDLL 298 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + I V+ ++ + + M Sbjct: 299 YAWSASVGPYI-----WNEAKTIYHYHIWKVQLATSMDKMYSYYLLRAVTNQKMSDMHGS 353 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + D+ + + +PP+ EQ I ++ + ++ID ++ ++ I L+E + S I Sbjct: 354 TMMHITMGDMNKTKIPIPPLSEQQQIATYLDTKCSKIDHIIATQKKKIAYLQELKQSLIT 413 Query: 413 AAVTGQIDL 421 VTG+I + Sbjct: 414 NVVTGKIKV 422 >gi|238918474|ref|YP_002931988.1| restriction modification system DNA specificity domain protein [Edwardsiella ictaluri 93-146] gi|238868042|gb|ACR67753.1| restriction modification system DNA specificity domain protein [Edwardsiella ictaluri 93-146] Length = 441 Score = 236 bits (603), Expect = 4e-60, Method: Composition-based stats. Identities = 110/442 (24%), Positives = 188/442 (42%), Gaps = 35/442 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M +YKAYP+YKDSGV+W+G +P+ W + +K + G+ +S V++ Sbjct: 1 MANYKAYPEYKDSGVEWLGLVPESWTICRLKNLAAIKNGQDYKS-----------VQTDD 49 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 G P G+ Q ++ ++ K +L G+ G + I + T + + Sbjct: 50 G--YPVMGSGGQFTFASKFMYDKPSVLLGRKGTIDKPLYINEPFWTVDTMYYTELNEGFD 107 Query: 121 PELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L L+I T H +E+ I + + ET Sbjct: 108 ARYLYYLALTIQFSRYSTNTALPSMTQEHLSNYKF----SVPKAESERKKITKFLDHETA 163 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID LI ++ + IELLKEK+ A++S+ VTKGLNPDV MKDSG+EW+G VP+HW + Sbjct: 164 KIDNLIEKQQQLIELLKEKRHAVISHAVTKGLNPDVPMKDSGVEWLGEVPEHWTISTLKH 223 Query: 240 LVTELNRKNTKLIESNILSLSYG------NIIQKLETRNMGLKPESYETYQIVDPG---- 289 ++ ++ + G I E S E + ++ G Sbjct: 224 HAKFIDGDRGSEYPNDNDLVDDGVVFLSSKNISNWEINIDDANYISREKFNRLNRGKAIN 283 Query: 290 -EIVFRFIDLQNDKRSLRSAQVM-----ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ + L + I + + ++ +L + + + Sbjct: 284 GDVIVKVRGSTGRIGELAIFETERLNKSTAFINAQMMIIRLKNSFNNRFLCNVAQGHYWM 343 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G +Q L + ++VPPI EQ I + +E R D L++ I L Sbjct: 344 EQLNVGAYGTAQQQLNNAIFSGMIMVVPPIDEQLTINKFLELEIKRFDGLIKNTSNMIQL 403 Query: 403 LKERRSSFIAAAVTGQIDLRGE 424 ++ERR++ I+AAVTG+ID+R Sbjct: 404 IQERRTALISAAVTGKIDVRDW 425 >gi|251791801|ref|YP_003006522.1| restriction modification system DNA specificity domain-containing protein [Dickeya zeae Ech1591] gi|247540422|gb|ACT09043.1| restriction modification system DNA specificity domain protein [Dickeya zeae Ech1591] Length = 462 Score = 236 bits (602), Expect = 4e-60, Method: Composition-based stats. Identities = 111/460 (24%), Positives = 185/460 (40%), Gaps = 38/460 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGL 53 M + Y +YK+S V+W+G +P HW V +K ++ +G T + D I ++ Sbjct: 1 MMKQQTYSEYKESDVKWLGQVPVHWNAVSLKWISQRYSGGTPDKSNDAYWENGDIPWLNS 60 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQF 111 V G +S+ K ++ G C+ Sbjct: 61 GSVNDGYITEPSTYITREGFASSSAKWVPKNALVMALAGQGKTKGMVAQLGIRATCNQSM 120 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + PK+ + + Q I + G + +G+IP P+ P EQ I Sbjct: 121 AAIIPKEKF-TPRFLYWWLVSNYQNIRNMAGGEQRDGLNLDMLGSIPCPLLPRPEQTAIA 179 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV----------TKGLNPDVKMKDSG 221 + + ET RID+L+ ++ + I LLKEK+ AL+S+IV GL P + K+S Sbjct: 180 DFLDRETGRIDSLMAKKRQLIALLKEKRCALISHIVTRGLPEAAADEFGLKPHTRFKNSD 239 Query: 222 IEWVGLVPDHWEVKPFF------------ALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 IEW+G VP+ W VK + E + K + I + +I Sbjct: 240 IEWLGQVPEGWGVKKVWIERVSRNIELQDGNHGEQHPKAEDYVGEGIPFVMANHIDNGKI 299 Query: 270 TRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 N E + + + G+++ ++ + + Sbjct: 300 DFNKCNYIEKEQADSLRIGFSNEGDVLLTHKGTIGRVGIVQKSHFPYVMLTPQVTYYRCL 359 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I + +L WLM+S + R + D K L L+P KEQF I ++ Sbjct: 360 REIQNRFLFWLMQSKFWQDQLKLLAGLGSTRAYIGLLDQKTLSFLIPSEKEQFAIATYLD 419 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ET+++D LVEK++ I L+E R++ I AAVTG+ID+R Sbjct: 420 RETSKLDRLVEKVDAVIARLQEYRTALITAAVTGKIDVRE 459 >gi|327479499|gb|AEA82809.1| restriction modification system DNA specificity domain protein [Pseudomonas stutzeri DSM 4166] Length = 491 Score = 236 bits (602), Expect = 5e-60, Method: Composition-based stats. Identities = 117/448 (26%), Positives = 186/448 (41%), Gaps = 26/448 (5%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVE 57 ++ YP YKDSGV+W+G +P+HW + +KR T G + D+ I + D + Sbjct: 26 SNFPTYPAYKDSGVEWLGEVPEHWAIFSLKRSVDGCTNGLWGDEPDGENDLAVIRVADFD 85 Query: 58 SGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLG----PYLRKAII--ADFDGICST 109 T + R + G +L K G + ++ DF+ I S Sbjct: 86 RATCRVGLDKLTYRSITQKERASRLLQSGDLLIEKSGGGEKTLVGCVVLFEHDFEAITSN 145 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIE--AICEGATMSHADWKGIGNIPMPIPPLAEQ 167 ++P + ++ AI + + + D + P LAEQ Sbjct: 146 FVARMRPLHGFDSGFLCYSFDSLYQGKVNFPAIKQTTGIQNLDSESYLQERFCFPTLAEQ 205 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + ET RID LI E+ R IELLKEK+QA++S+ VTKGL+P V MKDSG+EW+G Sbjct: 206 TQIARFLDHETARIDALIEEQQRLIELLKEKRQAVISHAVTKGLDPTVPMKDSGVEWLGE 265 Query: 228 VPDHWEVKPFFALVTEL--------NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 VP HW V T ++ + + N L ES Sbjct: 266 VPAHWNVGTLRWYATIQGGVAKGKDYEGRETVVMPYLRVANVQNGYVDLAEVKEIAVLES 325 Query: 280 YETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM- 337 + G+++ D R ++ + + A++P+G+ Sbjct: 326 EVERYRLRAGDVLMNEGGDNDKLGRGTVWQAQIDPCLHQNHVFAIRPNGLLRAEWLAAFT 385 Query: 338 --RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + S S+ +V L + +P KEQ +I + + R + L Sbjct: 386 QAEQARTYFYLNSKQSTNLASISASNVMSLALPIPSEKEQLEILTYLEADRIRHEELTAV 445 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ LL+ERRS+ I+AAVTG+ID+RG Sbjct: 446 AVSTVELLQERRSALISAAVTGKIDVRG 473 >gi|119491619|ref|ZP_01623491.1| hypothetical protein L8106_03529 [Lyngbya sp. PCC 8106] gi|119453348|gb|EAW34512.1| hypothetical protein L8106_03529 [Lyngbya sp. PCC 8106] Length = 433 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 116/437 (26%), Positives = 186/437 (42%), Gaps = 19/437 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLE-DVESG 59 MK YK+Y K SGV+W+G IP+HW++ +K L G++ +S D Y + G Sbjct: 1 MKKYKSYSTDKPSGVEWLGNIPEHWELRKLKFIADLIMGQSPDS-TDYNYEEIGVPFLQG 59 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 T ++ + N R S S K +L P + GI ++PK Sbjct: 60 TAEFGIINPNPRLSCESAKKYARKDDLLLSVRAPVGEINVADQVYGI-GRGLCAIRPKIN 118 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + G+ + N+ PPL EQ LI + ET Sbjct: 119 VFNKTFTRYFLEIGKVELVSGATGSIYDAVTVNQVANLQCLTPPLKEQKLIATFLDRETT 178 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 RIDTLIT++ I LL++K+ A+++ VTKGL P++ MKDSG+EW+G VP +WEVK Sbjct: 179 RIDTLITKKCELINLLEKKRTAIITNAVTKGLEPELPMKDSGVEWLGKVPRNWEVKKLKY 238 Query: 240 LVTELNRK-------NTKLIESNILSLSYGNII---QKLETRNMGLKPESYETYQIVDPG 289 + + K + + + N + G+I + + + L + G Sbjct: 239 IAQIVRGKFTHRPRNDPRFYDGNYPFIQTGDISAANKYITSYQQTLNELGLSVSKEFPKG 298 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +V D L + + P +L + + + ++ Sbjct: 299 TLVMTIAANIGDLAIL-----DFPACFPDSIVGFLPRNYCLDFLYYNLTAMK-SEMVKTA 352 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + +L E + L + PPI Q I ++ RID L++K SI L + R S Sbjct: 353 TLNTQMNLNIERIGGLFSICPPIAIQKQIATYLDKVNIRIDELIDKTATSISELTKYRQS 412 Query: 410 FIAAAVTGQIDLRGESQ 426 I AAVTG+ID+R E + Sbjct: 413 LITAAVTGKIDVREEVE 429 >gi|212690633|ref|ZP_03298761.1| hypothetical protein BACDOR_00120 [Bacteroides dorei DSM 17855] gi|212666733|gb|EEB27305.1| hypothetical protein BACDOR_00120 [Bacteroides dorei DSM 17855] Length = 423 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 92/429 (21%), Positives = 174/429 (40%), Gaps = 21/429 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE--------SGKDIIYIGLEDV 56 K Y YKDSGV+WIG IP HW+ + I R + T+ S K + ++ D+ Sbjct: 3 KKYDAYKDSGVKWIGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDL 62 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 +G K + D + + ++ G + K + D + + ++ P Sbjct: 63 NNGLITETSKKITPKAVDECKMKFYPIHSVVIAMYGATIGKVGLLDIETATNQACCIIVP 122 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + I + + + G + I + +P+PPL+EQ I + Sbjct: 123 SKRICPKYTFYSFIIAKEELLLSSFGG-GQPNISQDIIRKLKVPVPPLSEQQSIASYVDV 181 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T +ID +I + + IE L E KQ+L++ VT+GLNP+ +KDSG+ W+G +P HW++ Sbjct: 182 KTEKIDKMIAKAEKKIEYLGELKQSLITRAVTRGLNPNTPLKDSGVNWIGNIPMHWDIAC 241 Query: 237 FFALVTELNRK----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + +N + N L L GN E + D +++ Sbjct: 242 LRFFLRLINGRAYSQNELLPSGKYKVLRVGNFFTNDSWY---YSNMELEPDKYCDKDDLL 298 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + I V+ ++ + + M Sbjct: 299 YAWSASVGPYI-----WNEAKTIYHYHIWKVQLATSMDKMYSYYLLRAVTNQKMSDMHGS 353 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + D+ + + +PP+ EQ I ++ + ++ID ++ ++ I L+E + S I Sbjct: 354 TMMHITMGDMNKTKIPIPPLSEQQQIATYLDTKCSKIDHIIATQKKKIAYLQELKQSLIT 413 Query: 413 AAVTGQIDL 421 VTG+I + Sbjct: 414 NVVTGKIKV 422 >gi|325981608|ref|YP_004294010.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] gi|325531127|gb|ADZ25848.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] Length = 467 Score = 235 bits (598), Expect = 1e-59, Method: Composition-based stats. Identities = 116/453 (25%), Positives = 199/453 (43%), Gaps = 30/453 (6%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIK--RFTKLNTGRTSESGKDI-----IYIGLED 55 Y+AYP+YK+SGV+WIG P +W + +K + K G +D I D Sbjct: 11 KYQAYPEYKNSGVEWIGEYPLNWNLTRVKFESYVKARVGWHGLKSEDFTDEGPFLITGSD 70 Query: 56 VESGTGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFL 112 + + G +L K G + A+++ G ++ Sbjct: 71 FRGPVINWNECYHCDLARYEQDPYIQLKDGDLLITKDGTIGKVALVSGLAGKATLNSGVF 130 Query: 113 VLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 V++P + L + T ++ G+T+ H N IP EQ+ I Sbjct: 131 VVRPLTNNYTSRFYFWLLQASVFTGFVDFNKTGSTIVHLYQDTFVNFKYAIPSFNEQLTI 190 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + ET +IDTLI ++ + I+LLKEK+QA++S+ VTKGLNP+ KM+DSG+EW+G VP+ Sbjct: 191 ANFLDHETAKIDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPNAKMRDSGVEWLGEVPE 250 Query: 231 HWEVKPFFALVTELNRKNT------------KLIESNILSLSYGNIIQKLETRNMGLKPE 278 HW +K V E +R + +L + + + ++ Q R + Sbjct: 251 HWSMKIKLVSVAEGSRGSFVNGPFGSDLLSLELQDVGVPVIYIRDLKQTGYMRKSAVCVT 310 Query: 279 SYETY----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYL 333 + V G+++ + + + I + V I+ YL Sbjct: 311 EEKARQLEICKVVSGDVLIAKVGDPPGEACIYPENEPAAIITQDVIRIRVNRGVINPYYL 370 Query: 334 AWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L+ S V + R+ + D K++ ++P + EQ DI + + + +ID L Sbjct: 371 VMLLNSDLGKVVVDNISIESTRKRISLGDFKQVRFIIPSLSEQSDIVSFVELRCRKIDTL 430 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + K + + L+ ERR++ I+AAVTG+ID+R Sbjct: 431 IAKAQSMVSLIIERRTALISAAVTGKIDVRDWQ 463 >gi|167917951|ref|ZP_02505042.1| probable type I restriction-modification system [Burkholderia pseudomallei BCC215] Length = 442 Score = 233 bits (595), Expect = 3e-59, Method: Composition-based stats. Identities = 111/439 (25%), Positives = 185/439 (42%), Gaps = 17/439 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M YPQYKDSG W+G +P W VV +R + G + + + Sbjct: 1 MS-LPGYPQYKDSGASWLGRVPTSWAVVQARRLFEQRRDAALP-GDEQLSASQKYGVVPQ 58 Query: 61 GKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 ++ + S + L + + F G S + VL+ Sbjct: 59 RLFMELEDQKVVLALSGLENFKHVEPNDFVIS-LRSFQGGIEHSAFGGCVSPAYTVLRAT 117 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMS--HADWKGIGNIPMPIPPLAEQVLIREKII 175 + +LL D + + G + +P+P + EQ I + Sbjct: 118 SKIAPDFWAYLLKSDTYISALQTVTDGIRDGKNISYMQFGALCVPVPNIDEQSAIAAFLD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 ET +ID LI E+ + I LL EK+QA +SY VT+GLNPD MKDSG+ W+G VP HW ++ Sbjct: 178 CETGKIDALIAEQEKLIALLAEKRQAALSYAVTRGLNPDAPMKDSGVAWLGEVPAHWVIR 237 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLET---------RNMGLKPESYETYQIV 286 ++ + E S L + + ++ ++ + Sbjct: 238 RVKSVSVFMTSGPRGWSERISDEGSIFVQSGDLNDFLGVEFEIAKRVSVEFDAEAERTRL 297 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G++V + K ++ ++ + + + +L ++S F Sbjct: 298 ANGDVVVCITGAKTGKVAVCASVPEPAYVNQHLCLIRPSPDVLPLFLGNSLKSTIGQTQF 357 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 GL+Q L ++V+ +++PP EQ +I I+ ETAR+D L + ++I LLKER Sbjct: 358 ELSQYGLKQGLSLDNVREALIVLPPPGEQVEIVTFIDAETARLDELKAEAARAIELLKER 417 Query: 407 RSSFIAAAVTGQIDLRGES 425 RS+ IAAAVTG+ID+R + Sbjct: 418 RSALIAAAVTGKIDVRNAA 436 >gi|120601537|ref|YP_965937.1| restriction modification system DNA specificity subunit [Desulfovibrio vulgaris DP4] gi|120561766|gb|ABM27510.1| restriction modification system DNA specificity domain [Desulfovibrio vulgaris DP4] Length = 438 Score = 233 bits (594), Expect = 4e-59, Method: Composition-based stats. Identities = 111/438 (25%), Positives = 185/438 (42%), Gaps = 24/438 (5%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGK 62 + AYP+YKDSGV+W+G IP HW V + + +++ + + K Sbjct: 3 FPAYPEYKDSGVEWLGKIPSHWSVTSLYSLASECDFPNKDMLESNLLSLSYGRI---IRK 59 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKD 118 + + T I G I+ ++ + GI ++ + ++P Sbjct: 60 DINSNDGLLPESFETYQIVDHGDIVLRLTDLQNDQRSLRSGLVKERGIITSAYTAIRPTA 119 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L L + D + ++ G S + + +P+ P +EQ I + ET Sbjct: 120 SHYSYLAYLLRAYDTLKIFYSMGGGLRQSM-KFSDLRRLPILKPAYSEQSAIAVFLDHET 178 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +ID LITE+ + IELLKEK+QA++S+ VTKGL P+V MKDSG+EW+G VP+HW+V Sbjct: 179 AKIDALITEQEKLIELLKEKRQAVISHAVTKGLAPNVPMKDSGVEWLGEVPEHWKVAKLR 238 Query: 239 ALVTELNRKN-----------TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 V + + G I ++ + + ++ Sbjct: 239 RFVRAVQTGSTPSASPPNTDIEDGTYWFTPGDFSGPIRLGSSSKKVPPEAIKQGEVKVFP 298 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G + I K I + P+ S ++ Sbjct: 299 AGAVFVVSIGATLGKIGYLLTLASANQQINAII----PNADVEGLFLAYSLSSKTSEMMN 354 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + E K + + VPP+ EQ IT ++ + D LV + +++I LLKERR Sbjct: 355 LSNASTIGIMNQEKTKEIWLTVPPLCEQERITKFLDEDCVTSDALVNESQRAIDLLKERR 414 Query: 408 SSFIAAAVTGQIDLRGES 425 S+ I+AAVTG+ID+RG + Sbjct: 415 SALISAAVTGKIDVRGFA 432 >gi|261345477|ref|ZP_05973121.1| putative type I restriction-modification system specificity subunit [Providencia rustigianii DSM 4541] gi|282566524|gb|EFB72059.1| putative type I restriction-modification system specificity subunit [Providencia rustigianii DSM 4541] Length = 435 Score = 233 bits (593), Expect = 5e-59, Method: Composition-based stats. Identities = 110/429 (25%), Positives = 181/429 (42%), Gaps = 14/429 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 Y QY DSG +WIG IP HW + + + + + + V Sbjct: 8 PKYDQYIDSGYEWIGEIPLHWDLGKLGSCLFPVSVKNCPELPLLSITREQGVIERDVDDQ 67 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + N D S KGQ K+ + ++ F GI S + V + Sbjct: 68 ESNHNFIPDDLSGYKKLEKGQFGMNKMKAWQGSYGVSKFTGIVSPAYFVFDFTKAINPEF 127 Query: 125 QGWLLSIDVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 W + + + IP +P EQ LI + +T I Sbjct: 128 FNWAIRSKLYVSFFGSASDGVRIGQWDLSKTRMKVIPFVLPSEEEQSLIANFLDKKTALI 187 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D I+ + + I LLKE+KQ ++ VT+GL+P+V MKDSG++W+G +P HWEVK V Sbjct: 188 DEAISIKEQQISLLKERKQIIIQQAVTQGLDPNVPMKDSGVDWIGKIPAHWEVKRL-KYV 246 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 T++ ++ ++LS++ I K G Y YQIV G+ +DL Sbjct: 247 TKILKRIIGYEGPDVLSITQKGIKVKDIESGEGQLSMDYSKYQIVRVGDFAMNHMDLLTG 306 Query: 302 KRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQ 355 + + G+++ Y +G+ +L + + K+FY G G+ R Sbjct: 307 YVDISQFE----GVVSPDYRVFINTYNGLRDDFLLSIFQLGYQQKIFYRYGQGVSLLGRW 362 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + VPPI+EQ +I + E ++D +E + I LKE +++ I +AV Sbjct: 363 RFPADNFNNFFIPVPPIEEQAEIVQSVQREWLKLDNAIELLISQIEKLKEYKTTLINSAV 422 Query: 416 TGQIDLRGE 424 TG+I + E Sbjct: 423 TGKIKITPE 431 >gi|86153318|ref|ZP_01071522.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni HB93-13] gi|121613222|ref|YP_001000445.1| type I restriction modification DNA specificity domain-containing protein [Campylobacter jejuni subsp. jejuni 81-176] gi|167005388|ref|ZP_02271146.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 81-176] gi|57790397|gb|AAW56129.1| Cj81-057 [Campylobacter jejuni subsp. jejuni 81-176] gi|85843044|gb|EAQ60255.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni HB93-13] gi|87249367|gb|EAQ72327.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 81-176] gi|107770374|gb|ABF83711.1| putative type I restriction-modification system HsdS subunit [Campylobacter jejuni subsp. jejuni 81-176] Length = 422 Score = 232 bits (592), Expect = 8e-59, Method: Composition-based stats. Identities = 95/431 (22%), Positives = 195/431 (45%), Gaps = 24/431 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 MK++ KDSG++W+G IP+HWK++ K F L + + + L + Sbjct: 1 MKNF------KDSGIEWLGEIPEHWKLIKCKNFFVLKSIPIGDLWNKTKLLSLT-LNGVI 53 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYG--KLGPYLRKAIIADFDGICSTQFLVLQPKD 118 + + SD ST I +G +++ + R ++ +G+ ++ + + + K+ Sbjct: 54 ERDINNPEGKFPSDFSTYQIVKEGDLIFCLFDVAETPRTIGLSKLNGMITSAYTIFEIKN 113 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L+ + + +D + ++ + G + + + N+ +P+PPL EQ I + + Sbjct: 114 QEKRFLEYFFIDLDNRKNLKFLYRGL-RNTISKEDLLNLKIPLPPLKEQEQIANFLDEKC 172 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +I I ++ + I LLKE+KQA ++ TKGL+ +V KDSGIE++G +P HW++ Sbjct: 173 EQIKNFIEKKEKLITLLKEQKQAFINKATTKGLDKNVNFKDSGIEYLGEIPQHWKLVRLG 232 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP----------ESYETYQIVDP 288 ++ + I + + LK + Y +I D Sbjct: 233 LILKTSSGTTPDSGNDKYYKGGQIVWINSGDLNDGFLKDSKRKITQDALDDYSVLKIFDK 292 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 ++ K ++ + A ++ +T+ + + + ++ Sbjct: 293 DSLIIAMYGATIGKTAILKV----NACVNQACCVLEKSAWYNTFYLFYLFNRYKKELISM 348 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G + ++ + +K L + +PP+KEQ I N ++ + +ID+L+EK E+ I L+KE ++ Sbjct: 349 GSGGGQPNISQDIIKNLKIPLPPLKEQEQIANFLDEKCKKIDLLIEKTEKQIKLIKEYKT 408 Query: 409 SFIAAAVTGQI 419 + AV G+I Sbjct: 409 TLTNQAVCGRI 419 >gi|326201377|ref|ZP_08191249.1| hypothetical protein Cpap_4212 [Clostridium papyrosolvens DSM 2782] gi|325988945|gb|EGD49769.1| hypothetical protein Cpap_4212 [Clostridium papyrosolvens DSM 2782] Length = 631 Score = 232 bits (591), Expect = 9e-59, Method: Composition-based stats. Identities = 153/412 (37%), Positives = 231/412 (56%), Gaps = 18/412 (4%) Query: 28 VPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS------DTST 77 + +K K G + I + V S G L + DTS+ Sbjct: 6 IKLKYLFKFGKGLSITKENLSETGIPCVSYGQVHSKYGVILDMSKHVLPFVSESYLDTSS 65 Query: 78 VSIFAKGQILYG-----KLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLS 130 ++ KG ++ K G +++D ++ +P KDV + S Sbjct: 66 QALIKKGDFVFADTSEDKGGSGNFTCLVSDSSIFAGYHTVIARPVSKDVFYKYFAYLFDS 125 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +I+ G + + N P + Q++I + +T +ID++I ++ + Sbjct: 126 QNFRAQIQQAVSGIKVFTISQGTLKNTIASFPNIDAQIVIANYLDRKTTQIDSIIADKEK 185 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 IELLKEK+QA++S VT+GL+P V MKDSG++W+G +P+HWEVKP F + E KN+ Sbjct: 186 LIELLKEKRQAIISEAVTRGLDPSVPMKDSGVDWIGQIPEHWEVKPLFTVAFENKAKNSG 245 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 N+LSLSYG I++K N GL PES+ETYQIV+ G + R DLQNDKRSLRS V Sbjct: 246 NQCVNLLSLSYGKIVKKDIDTNFGLLPESFETYQIVEGGYTILRLTDLQNDKRSLRSGFV 305 Query: 311 MERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E+GIITSAY+ + P +D +L+ L+ +YDL K+FY++G+G+RQS+ ++D+KRLP+L+ Sbjct: 306 REKGIITSAYVGLIPSDEVDGLFLSDLLHAYDLMKIFYSLGNGVRQSMNYKDLKRLPILL 365 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 PP EQ I+N + +TA ID L+ EQ + L KE R S I+ AVTG+I + Sbjct: 366 PPKSEQKQISNYLRNKTAEIDDLISTTEQQVSLFKEYRQSIISEAVTGKIKV 417 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 48/207 (23%), Positives = 85/207 (41%), Gaps = 8/207 (3%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGK----YL 64 KDSGV WIG IP+HW+V P+ N + + +++ + + L Sbjct: 212 MKDSGVDWIGQIPEHWEVKPLFTVAFENKAKNSGNQCVNLLSLSYGKIVKKDIDTNFGLL 271 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 P+ + Q +I + K GI ++ ++ L P D + L Sbjct: 272 PESFETYQIVEGGYTILRLTDLQNDKRSLRSGFV---REKGIITSAYVGLIPSDEVDGLF 328 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 LL +I ++K + +P+ +PP +EQ I + +T ID L Sbjct: 329 LSDLLHAYDLMKIFYSLGNGVRQSMNYKDLKRLPILLPPKSEQKQISNYLRNKTAEIDDL 388 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGL 211 I+ + + L KE +Q+++S VT + Sbjct: 389 ISTTEQQVSLFKEYRQSIISEAVTGKI 415 >gi|145642021|ref|ZP_01797593.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae R3021] gi|145273292|gb|EDK13166.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 22.4-21] Length = 411 Score = 232 bits (591), Expect = 9e-59, Method: Composition-based stats. Identities = 135/413 (32%), Positives = 208/413 (50%), Gaps = 8/413 (1%) Query: 15 VQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 ++W+ IP HW + K K + + +D I D + +G + Sbjct: 1 MEWLRQIPSHWDMQRSKFIFKKVERKV--NEEDQIVTCFRDGQVTLRANRRTEGFTNALK 58 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSID 132 KG ++ + + I+D DG + + V P K + + L Sbjct: 59 EHGYQGIRKGDLVIHAMDAFTGAIGISDSDGKATPVYSVCLPHNKQKIDVYFYAYYLRNL 118 Query: 133 VTQRIEAICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + +PIPP EQ I + + +T +ID + Sbjct: 119 ALSGFISSLAKGIRERSTDFRYADFAELLLPIPPYLEQQQIAQFLDDKTAKIDRAVDLAE 178 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HWE+ + E R N Sbjct: 179 KQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWELTIGMNVFRENKRDNK 238 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + E+ +LSLSYG II K E + GL PES+ETYQIV+P +I+ R DLQND+ SLR+ Sbjct: 239 GMKENTVLSLSYGKIIIKPEEKLFGLVPESFETYQIVEPNDIIIRCTDLQNDQTSLRTGL 298 Query: 310 VMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 ++GIITSAY+ + + +L + + + D+ KV Y GSGLRQ+L F D KRLP++ Sbjct: 299 AQDKGIITSAYLNLKVINNYSAKFLHYYLHALDITKVLYKFGSGLRQNLSFLDFKRLPII 358 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + EQ I + ++ +T++ID ++ I LKE +S I VTG++ + Sbjct: 359 DISLAEQQQIADYLDKQTSKIDQVIALKTAHIEKLKEYKSVLINDVVTGKVRV 411 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 79/201 (39%), Gaps = 6/201 (2%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KDSGV+WIG +P+HW++ + N R ++ K+ + L + K K Sbjct: 207 KDSGVEWIGQVPEHWELTIGMNVFRENK-RDNKGMKENTVLSLSYGKI-IIKPEEKLFGL 264 Query: 71 RQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 T I I+ + +A GI ++ +L L+ + Sbjct: 265 VPESFETYQIVEPNDIIIRCTDLQNDQTSLRTGLAQDKGIITSAYLNLKVINNYSAKFLH 324 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + L ++ + + +P+ LAEQ I + + +T +ID +I Sbjct: 325 YYLHALDITKVLYKFGSGLRQNLSFLDFKRLPIIDISLAEQQQIADYLDKQTSKIDQVIA 384 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + IE LKE K L++ +V Sbjct: 385 LKTAHIEKLKEYKSVLINDVV 405 >gi|302345454|ref|YP_003813807.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] gi|302149142|gb|ADK95404.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] Length = 428 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 115/437 (26%), Positives = 178/437 (40%), Gaps = 27/437 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKR-FTKLNTGRTSESGK----DIIYIGLEDVESG 59 K Y +YKDS VQW+G +P HW IK +G + K D++ + D + Sbjct: 2 KRYGKYKDSAVQWLGKVPSHWNYSRIKFGLKSSFSGVWGDDEKGDDNDVVCYRVADFDYK 61 Query: 60 TGKYLPKDGNSRQSDTSTVSI--FAKGQILYGKLG-----PYLRKAIIA-DFDGICSTQF 111 G + R D T IL K G P R I D CS Sbjct: 62 NGGLSEEKITIRNIDEKTFKEREILPNDILIEKSGGGDVNPVGRAVIANLDHKATCSNFI 121 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI---CEGATMSHADWKGIGNIPMPIPPLAEQV 168 ++ + + + + + + + + + M +PPL+EQ Sbjct: 122 HCVRCNENVLNTRLLYYFFYSIYVQKVNLLFFNQTTGIQNLKVPEYLGQVMFLPPLSEQQ 181 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + A+T ID +I +R + I LL+E K A++S VTKGLNP+ KMKDSGIEW+G V Sbjct: 182 SIASFLDAKTKPIDDIIAKREQQIALLEEMKSAIISRAVTKGLNPEAKMKDSGIEWIGEV 241 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P++W + F L + + E P+ + + Sbjct: 242 PENWNLLRFRLLCRISTGDSD-----------TQDAEPDGEYPFYVRSPQVERSSKFTCE 290 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+ + D R +DS YL MR ++ Sbjct: 291 GDAILMAGDGAGAGRVFHHVDGKYAVHQRVYIFNQFNKVVDSNYLYQFMRIMFPQRMNMG 350 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 S++ ++ V +P I EQ IT+ ++ ETA+IDV ++K + I LL+E + Sbjct: 351 SAQSTVPSVRLHMIQNFVVPIPSIDEQRTITSYLDTETAKIDVRIDKRRKQIALLQEYKQ 410 Query: 409 SFIAAAVTGQIDLRGES 425 + I AVTG+ID+RG S Sbjct: 411 ALITDAVTGKIDVRGFS 427 >gi|283787023|ref|YP_003366888.1| Type I restriction-modification system, specificity (S) subunit [Citrobacter rodentium ICC168] gi|282950477|emb|CBG90140.1| putative Type I restriction-modification system, specificity (S) subunit [Citrobacter rodentium ICC168] Length = 446 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 118/439 (26%), Positives = 196/439 (44%), Gaps = 24/439 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLE 54 M YKAYP+YKDSGV+W+G IP HWK++ K G+ + + Y+ +E Sbjct: 1 MAKYKAYPEYKDSGVEWLGEIPIHWKMLRHKYVAFFTKGKNPTNLLEQPLKNTLPYLSME 60 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL 114 + + T ++ V + +GQ L G + + GI S+ Sbjct: 61 CLRNNTTD-------KYALISNDVRVALEGQPLVIWDGSNAGE-FLKGKSGILSSTMAAA 112 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 L +L I + + G + H + + +I IP + EQ + + + Sbjct: 113 TLIYPLHSQYYWYLC-ISIEPEMRKNAVGMGIPHVNGDELRSISFGIPSIYEQKQVADFL 171 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 ET +ID LI ++ + IELLKEK+QA++S+ VTKGLNPDV MKDSG+EW+G VP+HW V Sbjct: 172 DHETAKIDNLIEKQQQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGVEWLGDVPEHWRV 231 Query: 235 KPFFALVTELNR------KNTKLIESNILSLSYGNII--QKLETRNMGLKPESYETYQIV 286 + K I NI +S + ++++ S Sbjct: 232 SRIKNYAKIESGHTPSRTKPEYWISCNIPWVSLNDSKQLKEIDYIEDTFYKISELGMANS 291 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + R + D SA + ++ +A L+ Y + K F Sbjct: 292 SAHLLPARAVVFTRDASIGLSAITTKSMAVSQHLIAWICDEKFIIPEFLLLVFYAMEKEF 351 Query: 347 YAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 +++ ++V+ L PP++EQ ++ + + +I + K+E + LL+E Sbjct: 352 ERYTFGATIKTIGMDNVRGLKSTFPPVEEQRNLIDWAFSKIEKIKSSINKVEDMLSLLQE 411 Query: 406 RRSSFIAAAVTGQIDLRGE 424 RR++ I+AAVTG+ID+R Sbjct: 412 RRTALISAAVTGKIDVRDW 430 >gi|329115021|ref|ZP_08243776.1| Type-1 restriction enzyme StySJI specificity protein [Acetobacter pomorum DM001] gi|326695464|gb|EGE47150.1| Type-1 restriction enzyme StySJI specificity protein [Acetobacter pomorum DM001] Length = 434 Score = 231 bits (589), Expect = 1e-58, Method: Composition-based stats. Identities = 114/438 (26%), Positives = 185/438 (42%), Gaps = 28/438 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + YP YK+SGV+WIG IP W + P++ G+ K+ D+ Sbjct: 1 MS-FPKYPAYKNSGVEWIGEIPVGWIISPLRYLAHCLDGKRIPLNKEERSYKKGDI---- 55 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQ 115 P G + D +F + IL G+ G + + + VL+ Sbjct: 56 ----PYWGANCIVDFVDEFLFNQELILLGEDGAPFFDKTKEVSFYINEPIWPNNHVHVLK 111 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + L+ + EG+T + I +PIPPL EQ I + Sbjct: 112 VFENFSPKFLVYSLNCV---EYSSYIEGSTRDKLTQNNMNRIVVPIPPLPEQQAIASFLD 168 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 E +ID LI E+ R I LL EK+QA++S+ VTKGLNP+ MK+SGI W+G+VP+ W+ Sbjct: 169 RECGKIDALIAEQERLIALLAEKRQAVISHAVTKGLNPNAPMKESGIPWIGMVPEGWDCS 228 Query: 236 PFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + K E + L + + + Y + Sbjct: 229 RLRFVAQFNPSKTEISYIPLNEEVSFLPMEAIRDDGTINLEQKRKISDVQNGYTYFRDMD 288 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTYLAWLMRSYDLCK-- 344 IVF I + + + RGI P + YL +S K Sbjct: 289 IVFAKITPCFENGKGAVVKKLLRGIGFGTTELIVARSVPSRVIPEYLFRFFQSDIFRKPA 348 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G+G ++ + V+ V +PP+ +Q I + +++ ++ID L+ + + + L K Sbjct: 349 EASMYGAGGQKRVSERFVRDFSVYLPPLPDQQAIASFLDLTCSKIDTLIAEQKTMLTLCK 408 Query: 405 ERRSSFIAAAVTGQIDLR 422 ERR++ I+AAVTG+ID+R Sbjct: 409 ERRAALISAAVTGKIDVR 426 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 49/222 (22%), Positives = 94/222 (42%), Gaps = 14/222 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLP 65 K+SG+ WIG +P+ W ++ + N +T +++ ++ +E + G Sbjct: 210 MKESGIPWIGMVPEGWDCSRLRFVAQFNPSKTEISYIPLNEEVSFLPMEAIR-DDGTINL 268 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQ--PK 117 + + + F I++ K+ P + G +T+ +V + P Sbjct: 269 EQKRKISDVQNGYTYFRDMDIVFAKITPCFENGKGAVVKKLLRGIGFGTTELIVARSVPS 328 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 V+PE L + S + EA GA + + + + +PPL +Q I + Sbjct: 329 RVIPEYLFRFFQSDIFRKPAEASMYGAGGQKRVSERFVRDFSVYLPPLPDQQAIASFLDL 388 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 +IDTLI E+ + L KE++ AL+S VT ++ + K Sbjct: 389 TCSKIDTLIAEQKTMLTLCKERRAALISAAVTGKIDVRAQNK 430 >gi|304315216|ref|YP_003850363.1| type I restriction-modification enzyme, subunit S [Methanothermobacter marburgensis str. Marburg] gi|302588675|gb|ADL59050.1| predicted type I restriction-modification enzyme, subunit S [Methanothermobacter marburgensis str. Marburg] Length = 435 Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats. Identities = 101/439 (23%), Positives = 188/439 (42%), Gaps = 30/439 (6%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLED 55 + K YP+YKDSGV+WIG IP W V K K G+ + SG + Y+ ++ Sbjct: 1 MNLKPYPEYKDSGVEWIGEIPCGWNVHRFKIHFKYIKGKVPKDLRETPSGDSLPYLTMDY 60 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 + K D + G +L G + + + ST ++ Sbjct: 61 LRGRESKVFYCDSDG------GAVRVNDGDLLLLWDGSNAGEFLEGKDGYLSSTMVKLIV 114 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + L L ++ + G + H + I +P P L EQ I + Sbjct: 115 SEMDL---GYSKYLCKAFEPLLKDLTTGMGIPHVKDNVLATIRIPYPSLEEQRKIASFLD 171 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 ++ +ID I + R I+LL+EK+ AL++ VTKGLNP+VKMK SG++W+G +P +WE++ Sbjct: 172 SKISKIDLTIEKYTRLIDLLQEKRNALINQAVTKGLNPNVKMKYSGVKWIGEIPQNWELR 231 Query: 236 PFFALVTELNRKN-------TKLIESNILSLSYGNIIQKLETRNMGLKPE----SYETYQ 284 + + I + G++ + + Y + Sbjct: 232 KISRSFEIIGSGTTPKSQDGSYYNRGTIPWVITGDLNDSILNETSKRITKKALRDYSALK 291 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 I ++ K SL ++ + + + + +D ++ + S Sbjct: 292 IYKKNSLIVAMYGATIGKISL---LNIDACVNQACCVLSNSNILDIKFVFYWFFSNR-DN 347 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + G + ++ +K L + VPP+KEQ I + ++ +++I++ +KI++++ LLK Sbjct: 348 IISLSDGGGQPNISQHVIKNLRIQVPPLKEQKIIVSYLDQNSSKINLTTKKIQKNVDLLK 407 Query: 405 ERRSSFIAAAVTGQIDLRG 423 E + S I VTG++D++ Sbjct: 408 EYKKSLIYHLVTGKVDVKE 426 >gi|261211183|ref|ZP_05925472.1| possible type I restriction-modification system S subunit [Vibrio sp. RC341] gi|260839684|gb|EEX66295.1| possible type I restriction-modification system S subunit [Vibrio sp. RC341] Length = 469 Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats. Identities = 112/462 (24%), Positives = 197/462 (42%), Gaps = 43/462 (9%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDV 56 M Y+AYP+YKDS + W+ IP HW ++ G T I + +V Sbjct: 1 MSKYQAYPEYKDSEIDWLETIPAHWLTSKLRYTFSFGKGLTITKENLRDTGIPCVSYGEV 60 Query: 57 ESGTGKYLP------KDGNSRQSDTSTVSIFAKGQILYGKL-----GPYLRKAIIADFDG 105 S G + K TS ++ KG I++ G ++++ Sbjct: 61 HSKYGFEIDPARHPLKCVGDDYLKTSPYALLKKGDIVFADTSEDIDGSGNFTQLVSNEQV 120 Query: 106 ICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 ++ +P + +L S ++ +I +G + + + + +PPL Sbjct: 121 FAGYHTIIARPYNHECSRFYAYLLDSKELRTQIRHAVKGVKVFSITQAILRGVNIWLPPL 180 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 E+ I + ET +IDTLI ++ + I+LLKEK+QA+VS+ VTKGLNP MKDSG+EW Sbjct: 181 KERNQIANFLDHETAKIDTLIEKQQQLIKLLKEKRQAVVSHAVTKGLNPQAPMKDSGVEW 240 Query: 225 VGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPE--- 278 +G VP+HW + P V +N + + + + GNI K + P+ Sbjct: 241 LGEVPEHWSISPLKHHVNTVNGFGFSSNNFQDEGVPFIRAGNIKNKTIVKPDIHLPQAVV 300 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 I++ GE+V + ++++ V + G++ + P+ + Sbjct: 301 DKYQRVILNDGELVISMVGSD---PKIKASAVGQVGLVPPSLAGSVPNQNVVILRE---Q 354 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKR---------------LPVLVPPIKEQFDITNVIN 383 S L K + + G + P + EQ +I + ++ Sbjct: 355 SSLLKKFLFYVVCGTPYRHHLDVFSHKLANQSIISSSLIICAQFTFPELDEQKEIVDFLD 414 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + + D L+EK +SI + ER+++ I+A VTG+ID+R Sbjct: 415 TQLRKYDWLMEKATRSIEFMNERKTALISATVTGKIDVRNWQ 456 >gi|56421440|ref|YP_148758.1| type I restriction-modification system specificity determinant [Geobacillus kaustophilus HTA426] gi|56381282|dbj|BAD77190.1| type I restriction-modification system specificity determinant [Geobacillus kaustophilus HTA426] Length = 438 Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats. Identities = 110/438 (25%), Positives = 191/438 (43%), Gaps = 19/438 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTS---------ESGKDIIYI 51 M + K YP+YKDSGV+W+ +P W+V+ IKR T++ G + + + ++ Sbjct: 1 MVNLKKYPKYKDSGVEWLREVPSEWQVLQIKRLTRVRRGASPRPIDDPIYFDDNGEYSWV 60 Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF 111 + DV + +S G+ L+ + + K I + F Sbjct: 61 RISDVTKSNMYLEETEQKLSNLGSSLSVKLEPGE-LFLSIAATVGKPCITNVKCCIYDGF 119 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 V P + ++ + + + T + + +G+I + +P + EQ +I Sbjct: 120 -VYFPDYRGDKRFLYYIFEAG--EAYRGLGKLGTQLNLNTDTVGSIYIAVPTIQEQKMIS 176 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + + + ID+LI ++ + IELL+EK+Q +++ VTKGLNP+VKMKDSG+EW+G +P+ Sbjct: 177 DFLDEKVHEIDSLIADKEKLIELLEEKRQVIITEAVTKGLNPNVKMKDSGVEWIGEMPES 236 Query: 232 WEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKL-ETRNMGLKPESYETYQIV 286 WEV + +E + +S N ++ K +I+ Sbjct: 237 WEVSKIKYQADINKYTLSENTDEDLEIKYIDISSVNSRGEVVNIEKYYFKDAPSRARRIL 296 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+ + + + T + I YL +LMRS Sbjct: 297 RKGDTIISTVRTYLKAITWFEEVEENLICSTGFAVLSPKETIYPKYLFYLMRSTKYIDEI 356 Query: 347 YAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G+ ++ ++ + L+P I EQ I I+ E +ID LV++I+ I LKE Sbjct: 357 VKRSIGVSYPAITSTEIGMMECLLPNINEQKMIVEYIDNELKKIDGLVDEIKLQIQKLKE 416 Query: 406 RRSSFIAAAVTGQIDLRG 423 R S I AVTG+ID+R Sbjct: 417 YRQSLIYEAVTGKIDVRD 434 >gi|77166146|ref|YP_344671.1| restriction endonuclease S subunits-like [Nitrosococcus oceani ATCC 19707] gi|254435813|ref|ZP_05049320.1| hypothetical protein NOC27_2876 [Nitrosococcus oceani AFC27] gi|76884460|gb|ABA59141.1| Restriction endonuclease S subunits-like protein [Nitrosococcus oceani ATCC 19707] gi|207088924|gb|EDZ66196.1| hypothetical protein NOC27_2876 [Nitrosococcus oceani AFC27] Length = 487 Score = 230 bits (587), Expect = 2e-58, Method: Composition-based stats. Identities = 121/444 (27%), Positives = 199/444 (44%), Gaps = 23/444 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLED-VESG 59 Y +YK+S V WIG +P W+V P K N G D I + D G Sbjct: 29 PKYREYKNSDVVWIGEVPSFWEVKPFKWLLTHNEGGVWGDDPAGEGDTIVLRSTDQTVDG 88 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG--------PYLRKAIIADFDGICSTQF 111 + ++ G ++ K L +A Sbjct: 89 NWNVTDPAVRHLTVKENASAVLEAGDLVVTKSSGSALHIGKTTLVNVDMAKLGYCYGNFM 148 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVL 169 L+ L ++++ D+ + + +T +++ + IG I +P+PP+ EQ Sbjct: 149 QRLRLGQKYIPKLAWYVMNNDLVRLQLNLLSNSTTGLANLNATLIGEILLPVPPVEEQTQ 208 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + ET RID LI E+ R IELLKEK+QA++S+ VTKGL+P V MKDSG+EW+G VP Sbjct: 209 IARFLDHETARIDALIEEQQRLIELLKEKRQAIISHAVTKGLDPTVPMKDSGVEWLGEVP 268 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK---LETRNMGLKPESYETYQIV 286 HW KP L +K+ + + L K ++ + Y Sbjct: 269 AHWITKPLKHLAELNPKKSGYHGDRDELCSFVPMEKLKTGVIQLDEERFIADVISGYTYF 328 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + G+++ + + R++ A + G+ + + +++++L + ++ Sbjct: 329 EDGDVLQAKVTPCFENRNIAIADGLTNGVGFGSSEINVLRPFPDVNASFLYYRLQEDGYM 388 Query: 344 KVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + +G+G + + E + V VP EQ I + ++ ETAR+D LVE+ I Sbjct: 389 GICTASMIGAGGLKRVPGEVINGFTVAVPERHEQTQIAHFLDHETARVDKLVEEANVGIE 448 Query: 402 LLKERRSSFIAAAVTGQIDLRGES 425 LLKERRS+ I+AAVTG+ID+RG Sbjct: 449 LLKERRSALISAAVTGKIDVRGWQ 472 >gi|89900160|ref|YP_522631.1| putative type I site-specific restriction-modification system, S subunit [Rhodoferax ferrireducens T118] gi|89344897|gb|ABD69100.1| putative type I site-specific restriction-modification system, S subunit [Rhodoferax ferrireducens T118] Length = 422 Score = 230 bits (585), Expect = 5e-58, Method: Composition-based stats. Identities = 124/419 (29%), Positives = 200/419 (47%), Gaps = 13/419 (3%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 KDSG WIG IP+ W++ +K N+ K ++ + V K + + Sbjct: 1 MKDSGAAWIGEIPQGWEIKRMKDCFISNSRAQP--NKTVLSLSYGKV---IVKDMEEKKG 55 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + G ++ + A GI ++ +L + + + Sbjct: 56 VTPESFDSYQGVHPGDVVLRLTDLQNDQKSLRVGRATTKGIITSAYLCVSSRSLNDRYSA 115 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + Q++ G + + + +P AEQ I + + +T ID + Sbjct: 116 YLLHDVGDIQKLFYGLGGGVRQSMKFADLAELLFSLPTPAEQRAIADYLDRQTALIDQRL 175 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 T +L E ++A + VTKGLN + MKDSG+ W+G +P WE+K + Sbjct: 176 TTLAEKKAVLAELRKATIHEAVTKGLNKNAPMKDSGVAWIGEIPQGWEIKRMKDCFISNS 235 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +LSLSYG +I K G+ PES+++YQ V PG++V R DLQND++SL Sbjct: 236 RAQPNKT---VLSLSYGKVIVKDMEEKKGVTPESFDSYQGVHPGDVVLRLTDLQNDQKSL 292 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFEDVKR 364 R + +GIITSAY+ V ++ Y A+L+ D+ K+FY +G G+RQS+KF D+ Sbjct: 293 RVGRATTKGIITSAYLCVSSRSLNDRYSAYLLHDVGDIQKLFYGLGGGVRQSMKFADLAE 352 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L +P EQ I + ++ +TA ID + +++ +LK R + I AVTG+IDL G Sbjct: 353 LLFSLPTPAEQRAIADYLDRQTALIDTQLATLDEQAQVLKVLRKAIIHEAVTGKIDLSG 411 >gi|120553353|ref|YP_957704.1| restriction modification system DNA specificity subunit [Marinobacter aquaeolei VT8] gi|120323202|gb|ABM17517.1| restriction modification system DNA specificity domain [Marinobacter aquaeolei VT8] Length = 439 Score = 229 bits (584), Expect = 5e-58, Method: Composition-based stats. Identities = 109/430 (25%), Positives = 182/430 (42%), Gaps = 28/430 (6%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 YP+YK SGVQW+G +P +WK+ +K ++ G+ +S VES P Sbjct: 6 YPEYKGSGVQWLGEVPSNWKIGRLKHLLRIRGGQDYKS-----------VESYVPTDFPV 54 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 G+ Q +T ++ +L G+ G + + T F +VLP Sbjct: 55 IGSGGQFTYATDYLYDGESVLLGRKGTIDKPLYVKGKFWTVDTMFYT----EVLPGTNGR 110 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + T + + + N +P+PP EQ I + ET +ID LI Sbjct: 111 YAYYLATTIPFDLYSTNTALPSMSQFDLANHGLPLPPKCEQTQIARFLDHETAKIDALIR 170 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E+ R IELL+EK+QA++S+ VTKGL+PDV MKDSG+EW+G VP HW V + Sbjct: 171 EQERLIELLQEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWIVARIKNFARVESG 230 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + +++ LK Y ++ + + Sbjct: 231 HTPDKKKEEYWVDCDIPWVSLNDSK--QLKKADYIADTSTKVNDLGIANSSARLLPAAAV 288 Query: 307 SA----------QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQ 355 + ++ +A G L+ Y + F + Sbjct: 289 VFTRDASIGLSAITTKPMAVSQHLIAWLCAGEKLVPEYLLLIFYAMESEFERYTFGATIK 348 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ +DV+ L PP++EQ + + + ++ E++I+LLKERRS+ I++AV Sbjct: 349 TIGMDDVRSLTAAFPPMEEQKQLVTWAFRKKETLQAGLDAAEKTILLLKERRSALISSAV 408 Query: 416 TGQIDLRGES 425 TG+ID+R Sbjct: 409 TGKIDVRNWQ 418 >gi|21229080|ref|NP_635002.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20907634|gb|AAM32674.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 460 Score = 229 bits (583), Expect = 8e-58, Method: Composition-based stats. Identities = 111/430 (25%), Positives = 192/430 (44%), Gaps = 17/430 (3%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 + K YP YKDSGV W+G +P+HWK+ K + + + D + + K Sbjct: 4 NLKPYPAYKDSGVPWLGEVPEHWKLKRTKTVLRERSQKGFP---DEPLLAATQTKGVVRK 60 Query: 63 YLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 L ++ D + + + L + A GI S + +L P + Sbjct: 61 ELYENRTVLALKDLHLLKLVRVNDFVIS-LRSFQGGIEFAHEQGIISPAYTILYPVEAQN 119 Query: 122 ELLQGWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 WL ++ + D+ + +P+PP +EQ I + Sbjct: 120 HGFLAWLFKSKPYIENLSLFVTGIREGQNIDYVKLSRSELPLPPFSEQSSIVRYLDHIDR 179 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 RI I + +FI+LL+E+KQA++ VT GL+P+VK+K SG+EW+G VP+HWEVKP Sbjct: 180 RIRRYIHAKQKFIKLLEEQKQAIIHQSVTHGLDPNVKLKPSGLEWLGDVPEHWEVKPAKW 239 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 E++ +++ E + + + E ESY Y++ ++V + Sbjct: 240 YYHEIDERSSTGSEELLSVSHITGVTPRSEKNITMFMAESYVGYKLCRENDLVINTMWAW 299 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSGL--- 353 + + GI++ +Y +P S Y+ L+R+ + +G+ Sbjct: 300 MAALGVA----QQTGIVSYSYGVYRPIHKEAFLSQYIDLLLRTKPYVAEYICRSTGIHSS 355 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 R L E R+P++ PPI EQ I + I+ +T+ ++ + Q I LL+E R+ IA Sbjct: 356 RLRLYPEQFLRIPIIRPPIVEQQAILDEIHNKTSELEHAINTSNQEISLLREYRTRLIAD 415 Query: 414 AVTGQIDLRG 423 VTG++D+R Sbjct: 416 VVTGKLDVRE 425 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 51/213 (23%), Positives = 95/213 (44%), Gaps = 8/213 (3%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + L P KDSG+ W+G VP+HW++K ++ E ++K + + G + + Sbjct: 1 MIHNLKPYPAYKDSGVPWLGEVPEHWKLKRTKTVLRERSQKGFPDEPLLAATQTKGVVRK 60 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP- 325 +L L + ++V + V Q E+GII+ AY + P Sbjct: 61 ELYENRTVLALKDLHLLKLVRVNDFVISLRSFQG-----GIEFAHEQGIISPAYTILYPV 115 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + +LAWL +S + +G+R Q++ + + R + +PP EQ I ++ Sbjct: 116 EAQNHGFLAWLFKSKPYIENLSLFVTGIREGQNIDYVKLSRSELPLPPFSEQSSIVRYLD 175 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 RI + ++ I LL+E++ + I +VT Sbjct: 176 HIDRRIRRYIHAKQKFIKLLEEQKQAIIHQSVT 208 >gi|257064600|ref|YP_003144272.1| hypothetical protein Shel_19070 [Slackia heliotrinireducens DSM 20476] gi|256792253|gb|ACV22923.1| hypothetical protein Shel_19070 [Slackia heliotrinireducens DSM 20476] Length = 425 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 105/428 (24%), Positives = 187/428 (43%), Gaps = 16/428 (3%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y YKDSGV+WIG IP W + K + + + L + + Sbjct: 3 RYEAYKDSGVEWIGEIPSTWTLARTKAVFSSKKRVVGDKANEYQRLALT-MHGVLLRDKD 61 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKL---GPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + I ++++ + + ++ + GI S ++ L D Sbjct: 62 DNEGLQPEQFEGYQILEANELVFKLIDLENIKTSRVGLSPYTGIVSPAYITLTQTDSDNR 121 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 W ++ + S + + N+PM +P EQ I + A T ID Sbjct: 122 YFYYWFFALYQQNVFNQLGGNGVRSALNKDDLLNLPMLLPKQDEQRAIANYLDARTAEID 181 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 L+ + R ELL+E ++A++S VTKGL+PD MKDSG+EW+G +P+ W V+P L Sbjct: 182 ALVADCEREAELLREYRKAVISEAVTKGLDPDAPMKDSGVEWIGEIPEGWLVRPSKTLFA 241 Query: 243 ELNRKNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 E E + YG I Q +E + M + ++ + ++ V+PG+ V Sbjct: 242 EAKELRHSDDEQCAATQKYGIIPQARYIAIENQRMVVADKNLDAWKHVEPGDFVISLRSF 301 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGLR--Q 355 Q G +T Y+ +K + +++ Y +L ++ + + +R Q Sbjct: 302 QG-----GLELSEITGCVTWHYIVLKGNDLVEAGYFKYLFKTTKYIESLQRTCTYIRDGQ 356 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L++ + ++P+ +P +EQ I ++ +TA ID L+E + L+E R S I+ AV Sbjct: 357 DLRYSNFVQVPLPLPSREEQVAIGVYLDAKTAEIDALIEAKQTMADKLREYRKSLISEAV 416 Query: 416 TGQIDLRG 423 TG+ + G Sbjct: 417 TGKFKVPG 424 >gi|330992551|ref|ZP_08316499.1| hypothetical protein SXCC_02458 [Gluconacetobacter sp. SXCC-1] gi|329760750|gb|EGG77246.1| hypothetical protein SXCC_02458 [Gluconacetobacter sp. SXCC-1] Length = 432 Score = 228 bits (581), Expect = 1e-57, Method: Composition-based stats. Identities = 120/435 (27%), Positives = 197/435 (45%), Gaps = 16/435 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + YP YKDSGV+WIG IP W +K ++ G++ S G Sbjct: 1 MS-FPKYPAYKDSGVEWIGEIPVGWHSACLKHVAIVDAGQSPASTDCNTEGCGLPFLQGC 59 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + T K IL P R +AD ++P Sbjct: 60 ADFGVCYPVPKNYCTIPPKSCCKEDILLSVRAPVGR-LNVADRQYGIGRGLCSIRPSSSH 118 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + + + + + +I G+T + I N + +PPL EQ I + E + Sbjct: 119 DKKYFLYTI-LFLEEYFHSISTGSTYEAISTEQIKNTILFLPPLPEQQAIASFLDRECGK 177 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID LI E+ R I LL EK+QA++S+ VTKGLNP+ MKDSGI W+G+V + WE+ + Sbjct: 178 IDALIAEQERLIALLAEKRQAVISHAVTKGLNPNAPMKDSGIPWIGMVSEEWEIVRLGTI 237 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIVFRFID 297 E+N + + +S+ G ++L + K + Y V PG++ + + Sbjct: 238 FEEVNESGNENLPILSVSIHTGVSDEELSDEKLDRKVTRSDDRSKYIAVRPGDLTYNMMR 297 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW---LMRSYDLCKVFYAMGSGL- 353 G+++ AY+ +P I + L+R+ + G+ Sbjct: 298 AWQGGFGTVQVM----GMVSPAYVVARPKNISRQKTDFIELLLRTPNAISEMKRYSRGVT 353 Query: 354 --RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 R L +E+ K++ + +P +KEQ +I N + +T D L +I LLKERR++ I Sbjct: 354 DFRLRLYWEEFKKICIPLPILKEQDEILNFLKEKTGHFDALATTARNAITLLKERRAALI 413 Query: 412 AAAVTGQIDLRGESQ 426 +AAVTG+ID+R +S+ Sbjct: 414 SAAVTGKIDVRAQSK 428 >gi|110639314|ref|YP_679523.1| type I restriction-modification system [Cytophaga hutchinsonii ATCC 33406] gi|110281995|gb|ABG60181.1| probable type I restriction-modification system [Cytophaga hutchinsonii ATCC 33406] Length = 432 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 110/431 (25%), Positives = 190/431 (44%), Gaps = 15/431 (3%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 + YP YKDSGV+W+G IPKHW+ + +K + + + + ++++ + Sbjct: 4 KLQKYPAYKDSGVEWLGEIPKHWECIRMKHLFRDYSEKN-KQNEELLSVTQNQGVVPRS- 61 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 ++ + KG L + DGI S + VL+ K + Sbjct: 62 WVESRMVMPSGALESFKFIQKGDFAIS-LRSFEGGLEYCHHDGIISPAYTVLKTKRKIAN 120 Query: 123 LLQGWLLSI--DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +L +++ +I + + + +PIP + EQ I + +T + Sbjct: 121 QYYKYLFKSSAFISELQTSIVGIREGKNISYPELSYSLLPIPKIDEQSCIATFLDDKTAK 180 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID I+ + + IELLKE++Q L+ VT+GLNP VKMKDSG+EW+G VP+ WEVK L Sbjct: 181 IDQAISIKQKQIELLKERRQILIHKAVTRGLNPKVKMKDSGVEWIGEVPEGWEVKKLLGL 240 Query: 241 VTEL-----NRKNTKLIESNILSLSYGNIIQKLE---TRNMGLKPESYETYQIVDPGE-I 291 + K+ L + ++L YG + E N + E Y+ QIV+ G+ I Sbjct: 241 CNFIRGNSSFGKDDLLNDGEYVALQYGKTYKVNEVNEEYNYFVNNEFYKASQIVNYGDTI 300 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + + + + G+I + + P+ S K + Sbjct: 301 IIATSETIEELGHTAYYKRNDLGLIGGEQILLNPNNDKINSHYLYFTSRVFSKELRKYAT 360 Query: 352 GL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G+ D+K + + +PP+ EQ I I TA+I + E I LKE +++ Sbjct: 361 GIKVFRFNINDLKTIYIAIPPLSEQQQIVEYIETTTAKIATAISLKENEIEKLKEYKANL 420 Query: 411 IAAAVTGQIDL 421 + +AVTG+I + Sbjct: 421 VNSAVTGKIKV 431 >gi|189499714|ref|YP_001959184.1| putative type I restriction-modification system [Chlorobium phaeobacteroides BS1] gi|189495155|gb|ACE03703.1| putative type I restriction-modification system [Chlorobium phaeobacteroides BS1] Length = 436 Score = 228 bits (580), Expect = 2e-57, Method: Composition-based stats. Identities = 114/434 (26%), Positives = 188/434 (43%), Gaps = 15/434 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + YP+YK SGV+W+G +P+HW+++ +R + + + + Sbjct: 1 MS-FPRYPKYKASGVEWLGEVPEHWQMINSRRLFHQAKE-SPLTDDIQLSATQKYGVVPQ 58 Query: 61 GKYLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 ++ DG S + L + + + G S + VL+P + Sbjct: 59 SLFMESDGKVALALSGLGNFKHVEVDDFVIS-LRSFQGGIERSKYSGCVSPAYTVLRPAE 117 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMS--HADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + G+LL I ++ G IP+P PPLAEQ I E + Sbjct: 118 PIDGSYWGFLLKSRRYVEILQTMNDGLRDGKSISYQQFGQIPLPSPPLAEQTAIAEFLDR 177 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 ET +ID L+ E+ R +ELLKEK+QA++S+ VTKGLNP MK SGIEW+G VP W V Sbjct: 178 ETGKIDELVAEQRRLMELLKEKRQAVISHAVTKGLNPHAPMKPSGIEWLGDVPVGWSVLK 237 Query: 237 FFALVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + ++ I G+++ + R M + + Sbjct: 238 LGNISRFKGGAGFPDSYQGQTDNEIPFFKVGDMVNADDARVMRRANHTITEATARELRAF 297 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYA 348 VF + K R + + + G+ D + + +L+ L + Sbjct: 298 VFPESTIVFAKVGAALLLKRYRLLGQRSCIDNNMMGMTVGDGSSVDYLLYVLPLLDLELI 357 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + G S+ + + +PPI EQ +I + TA+ D L + +++I LL+ERR+ Sbjct: 358 VNPGAVPSINEGQISGQRIALPPIDEQREIVEFLTSVTAKFDTLTAEAQRTIDLLQERRT 417 Query: 409 SFIAAAVTGQIDLR 422 + I+AAVTGQID+R Sbjct: 418 ALISAAVTGQIDVR 431 >gi|119513480|ref|ZP_01632504.1| hypothetical protein N9414_06519 [Nodularia spumigena CCY9414] gi|119461860|gb|EAW42873.1| hypothetical protein N9414_06519 [Nodularia spumigena CCY9414] Length = 437 Score = 227 bits (578), Expect = 3e-57, Method: Composition-based stats. Identities = 111/434 (25%), Positives = 176/434 (40%), Gaps = 14/434 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVESGT 60 K YP YKDSG+ W+G IP+HW++V F G + + + +++ G Sbjct: 2 KRYPHYKDSGIDWLGDIPEHWEIVRFSNFINFQEGPGIMAADFKDYGVPLLRIHNLKPGF 61 Query: 61 GKYLPKDGNSRQSDTSTVSIFA--KGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQP 116 + Q T F + IL +I+ I T + L+P Sbjct: 62 VDLERCNYLEPQKVEKTWKHFKLNEDDILISCSASTGLVSIVDKKAEGSIAYTGIIRLKP 121 Query: 117 KDVLPELLQGWLLSID--VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + ++ +IE + G T+ H + I + PPL EQ I + Sbjct: 122 ANSNICREFIKIIVASELFFTQIELLKTGTTIQHYGPTHLRQIKITFPPLYEQKKIACFL 181 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 ++ ID I+ + R IELLKE+K A+++ VTKGLNP MK SGIEW+G +P HWEV Sbjct: 182 DSKLEEIDKFISNKQRLIELLKEQKTAIINRAVTKGLNPHAPMKPSGIEWLGDIPAHWEV 241 Query: 235 KP---FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + K + ++ +I +++ S + Sbjct: 242 TRAKHISYVFVPQRNKPNLNLNIGFPWITMEDITSPSISKSTFGYLVSEIDAMNAGSKLL 301 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + S+ + II A P I+ YL +L+ A Sbjct: 302 PEGSVIASCVGNFGLSSVNTLQVIINQQLQAYIPIKINPYYLRYLIGISKSYFEQIANA- 360 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + LP+++PP EQ I I+ E ID + IE+ I L+KE R++ I Sbjct: 361 TTLAYVNQAGFAELPIILPPNDEQLAIVRNIDKELTTIDKAITTIEKEIELIKEYRTTLI 420 Query: 412 AAAVTGQIDLRGES 425 + AVTG+ID+R + Sbjct: 421 SEAVTGKIDVRETA 434 >gi|119896299|ref|YP_931512.1| Type I site-specific deoxyribonuclease [Azoarcus sp. BH72] gi|119668712|emb|CAL92625.1| Type I site-specific deoxyribonuclease [Azoarcus sp. BH72] Length = 449 Score = 226 bits (576), Expect = 5e-57, Method: Composition-based stats. Identities = 114/448 (25%), Positives = 179/448 (39%), Gaps = 25/448 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGL 53 M Y +YKDSGV + IP HW+ P+KR L S + D + Sbjct: 1 MS-LPRYAEYKDSGVALLATIPAHWEPSPLKRVVALVESGVSVNAVDEPAGPDAVGVLKT 59 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDG---ICS 108 V SG + + G ++ ++ + A + + + Sbjct: 60 SCVYSGNFSHGENKAVVAEELDRVACPVRAGTLIVSRMNTPALVGAAGLVEENADNLFLP 119 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAE 166 + + +P+ W S +++ C G + M + MP+PP E Sbjct: 120 DRLWQVHFSGAVPKFAHYWTASPSYRAQVQMACAGTSASMQNLSQDEFLRFVMPLPPKDE 179 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + ET +ID LI ++ + I LL EK+QA +S+ VT+GLNPD MKDSG+ W+G Sbjct: 180 QTAIAAFLDRETAKIDALIAKQEKLIALLAEKRQATISHAVTRGLNPDAPMKDSGVAWLG 239 Query: 227 LVPDHWEVKPFF-----ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--- 278 VP HW V +R I L G I + Sbjct: 240 EVPAHWSVSALSYLASLETGATPDRGEPSYWNGTIPWLKTGEINWAPICEAEEFITDAGL 299 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 +I PG ++ + + ++ A A+ Sbjct: 300 ENSAAKIAKPGTLLMAMYGQGVTRGRVALLEI--EATYNQACAAINFRSRIIPEFGRYFF 357 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 V A + +L + ++ + VPP+ EQ + ++VETA++DVL + E+ Sbjct: 358 MAAYDHVRDAGNETSQMNLSAGLISKIRLPVPPLDEQQAVVRFLDVETAKLDVLGAESER 417 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I LLKERRS+ IAAAVTGQID+R ++ Sbjct: 418 GITLLKERRSALIAAAVTGQIDVRNTAE 445 >gi|299068119|emb|CBJ39334.1| putative type I restriction-modification methylase S subunit [Ralstonia solanacearum CMR15] Length = 445 Score = 226 bits (575), Expect = 6e-57, Method: Composition-based stats. Identities = 129/437 (29%), Positives = 194/437 (44%), Gaps = 20/437 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M HYK YP YKDSGV+W+G +P HW V + + + S KD + + + Sbjct: 1 MSHYKPYPAYKDSGVRWLGKVPAHWSVGRLANSFEERRAKV--SDKDFPALSVTKL---- 54 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 G + ++ D + KG I + +AD DG S VL PK + Sbjct: 55 GVVPQLENVAKTDDGDNRRMVLKGDIAINSRSDRKGASGLADRDGSVSLIITVLTPKPSV 114 Query: 121 P-ELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 E + S + + G + ++ + I + PP+ EQ I + E Sbjct: 115 WGEYCHHLIRSEIFQEEYFRVGNGLVADLWTTNYSSMRTIFLARPPIEEQKAIASHLDRE 174 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T RID L+ ++ RFIELL EK+QAL+++ VTKGL P MK SG+EW+G VP+HW +K Sbjct: 175 TARIDALVEKKTRFIELLGEKRQALITHAVTKGLGPGKPMKGSGVEWLGEVPEHWVIKRL 234 Query: 238 FALVTELNRKNT-----KLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPG 289 + + L N+ + E + ++ G Sbjct: 235 KFIARVQTGVAKGKDLADKDTIEVPYLRVANVQDGFLDLDEVATIEIDKRDLERYLLQLG 294 Query: 290 EIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-- 346 +++ D R + + I + AV+PHG+ S +L S F Sbjct: 295 DVLMNEGGDFDKLGRGHVWSGEISPCIHQNHVFAVRPHGVSSPWLNAFTSSAAAQFYFMG 354 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + S S+ ++ LPV VPP EQF+I + ++D +V K E+SI LL+E Sbjct: 355 KSKQSTNLASISSSNLMELPVPVPPEPEQFEILAEVQKNLEKLDNVVRKTERSIELLREH 414 Query: 407 RSSFIAAAVTGQIDLRG 423 RS+ I AAVTGQIDLR Sbjct: 415 RSALITAAVTGQIDLRD 431 >gi|120553175|ref|YP_957526.1| restriction modification system DNA specificity subunit [Marinobacter aquaeolei VT8] gi|120323024|gb|ABM17339.1| restriction modification system DNA specificity domain [Marinobacter aquaeolei VT8] Length = 461 Score = 225 bits (574), Expect = 9e-57, Method: Composition-based stats. Identities = 114/447 (25%), Positives = 193/447 (43%), Gaps = 29/447 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPI-KRFTKLNTGRTSESGKDIIYIGLEDVESG 59 M + AYP+YK++ + W+ IP W+++P RF + ++++ + + Sbjct: 1 MS-FPAYPEYKNTEIPWMQRIPSSWQLLPFFSRFFERKESNKGMKSENLLSLSFGRIVRK 59 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK----AIIADFDGICSTQFLVLQ 115 L + T + G I++ K + I + GI ++ +L + Sbjct: 60 DITTLE---GLLPASFETYQVVHPGNIVFRLTDLQNDKRSLRSAIVNEKGIITSAYLAVS 116 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 KD P + D+ + ++ G S + + +P+ P + EQ I + Sbjct: 117 AKDFNPTFSNYLFRAYDLMKVFYSMGGGLRQSM-KYDDMKWLPIVCPSINEQTQIARFLD 175 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 ET +ID LI E+ R IELL+EK+QA++S+ VTKGL+PDV MKDSG+EW+G VP HW+ Sbjct: 176 HETAKIDALIREQERLIELLQEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWDRT 235 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGN----IIQKLETRNMGLKPES----------YE 281 + + + + + I+ ++M + E Sbjct: 236 LIKHCCYINDGNHGEEYPKGDDFVDDADIGVPFIRGGNLKDMTVTTEGMLYITAEKNRSM 295 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G+I+F + S+ AY+ V+ ID YL + S Sbjct: 296 RKGRLQVGDILFVNRGEIGKLAVIPSSMNGANLNSQIAYLRVENRIIDPHYLVHYLASDT 355 Query: 342 LCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + A G + + D+ + V VPP EQ I+ + + +VL + Sbjct: 356 IKAEIKAAQEGSVLTQYPIS--DLAAIHVPVPPKDEQQKISTYLKEQLFSFNVLTSEASN 413 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGES 425 SI LL ERRS+ I+AAVTG+ID+R Sbjct: 414 SINLLSERRSALISAAVTGKIDVRNWQ 440 >gi|56750493|ref|YP_171194.1| type I restriction-modification [Synechococcus elongatus PCC 6301] gi|56685452|dbj|BAD78674.1| type I restriction-modification [Synechococcus elongatus PCC 6301] Length = 453 Score = 225 bits (574), Expect = 9e-57, Method: Composition-based stats. Identities = 101/451 (22%), Positives = 190/451 (42%), Gaps = 26/451 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSE-----SGKDIIY-IGL 53 M + YP YKD G++W+ +P HW V+ ++R ++ +G + + I + Sbjct: 1 MS-FPRYPAYKDCGIEWLEKLPSHWNVLQLRRLIPEIESGVSVNALDHAPDEGIPSVLKT 59 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICS 108 V +G+ + + ++ G+++ ++ +++ Sbjct: 60 SCVYTGSFRPEERKEIIQEDIDRAACPVKSGRLIVSRMNTPDLVGAAGLSLVDYDCVFLP 119 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAE 166 + ++ +V P W + +++ +C G + M + + +P+P E Sbjct: 120 DRLWQVRISNVYPNFAYYWTQTQIYRDQVKMVCSGTSSSMQNLSQDNFLSFILPVPSDEE 179 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q+ I + ET +ID LI E+ R I LL+EK+QA++S+ VTKGLNPD +KDSGIEW+G Sbjct: 180 QIAIASFLDRETAKIDALIAEQQRLIALLQEKRQAVISHAVTKGLNPDAPLKDSGIEWLG 239 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS---LSYGNIIQKLETRNMGLKPESYE-- 281 VP HW+ + E + S ++ + N ++ Sbjct: 240 QVPAHWKTGKIKHYFKTSSGGTPNTEEQALYYADSDSGIPWVRTTDIENQEVRSAEVSIT 299 Query: 282 ------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 T + P + V + I + + + + Sbjct: 300 NQAIQDTACEILPVDTVLVALYGGGGTVGKNGILTFPAAINQALCALLPSYYAVPMFTFR 359 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ + A+ + ++ E V+ +PP+ EQ I I+ + I L + Sbjct: 360 YIQFLRPFWMERAVSARKAGNISQELVRDTVFALPPLDEQILIVKHIHSQLEEITSLENE 419 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +S+ LL+ERRS+ I+AAVTGQID+RG ++ Sbjct: 420 STKSLSLLQERRSALISAAVTGQIDVRGLAE 450 >gi|283954324|ref|ZP_06371845.1| hypothetical protein C414_000210006 [Campylobacter jejuni subsp. jejuni 414] gi|283794123|gb|EFC32871.1| hypothetical protein C414_000210006 [Campylobacter jejuni subsp. jejuni 414] Length = 411 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 114/426 (26%), Positives = 199/426 (46%), Gaps = 20/426 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 MK++ KDSG++W+G IP+ W+VVPI+ R +++ ++ + + + Sbjct: 1 MKNF------KDSGIEWLGEIPQDWEVVPIRCCFGEFNIRCNDNDYPLLSVTIANGVVYQ 54 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-FDGICSTQFLVLQPKDV 119 K + D S I G I Y K+ + I GI S ++V P Sbjct: 55 NDITDKK-DISNDDKSNYKIVPLGAIAYNKMRMWQGAVGINMLEKGIVSPAYVVAIPNKQ 113 Query: 120 LP-ELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L S ++ E G M++ ++ NI +P+PPL EQ I + Sbjct: 114 INISFSYYLLKSRNIIGEYEKNSYGLCSDMNNLRYEDFQNIKIPLPPLKEQEQIVNFLDE 173 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 + +I I ++ + I LLKE+KQAL++ +TKGLN +V KDSGIEW+G +P+HW++ Sbjct: 174 KCEQIANFIEKKEKLISLLKEQKQALINETITKGLNKNVNFKDSGIEWLGEIPEHWKILK 233 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + + N+K+ + + NI K + E + G+I+F + Sbjct: 234 LKHIASLRNQKSNNIDFR----IGLENIESKTGKFIPSSEIVFEEDGIGFEKGDILFGKL 289 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQ 355 K GI S ++ +K + ++ +LM S + + G Sbjct: 290 RPYLAKV----FLTDRDGICVSEFLVLKIKSESNKFIKFLMLSSLFIDIVDSSTYGTKMP 345 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +E + L + +PP+KEQ I N ++ + +ID+L+EK ++ I L+KE +++ I AV Sbjct: 346 RANWEFIGNLKIPLPPLKEQEQIANFLDKKCEKIDLLIEKTKKQIKLIKEYKTTLINQAV 405 Query: 416 TGQIDL 421 G++DL Sbjct: 406 CGRMDL 411 >gi|288928859|ref|ZP_06422705.1| probable type I restriction-modification system [Prevotella sp. oral taxon 317 str. F0108] gi|288329843|gb|EFC68428.1| probable type I restriction-modification system [Prevotella sp. oral taxon 317 str. F0108] Length = 428 Score = 225 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 115/426 (26%), Positives = 204/426 (47%), Gaps = 11/426 (2%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M+ + Y YKDSGV+W+G IP+HW+V IK + + + I+ + Sbjct: 7 MEKF--Y-VYKDSGVKWLGNIPQHWEVRKIKYVFTERSQKGFPK-EPILCSTQKYGVIPQ 62 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 Y + + KG + L + A + GI S + +L D Sbjct: 63 HMY-ENRVVVVNKGLEGLKLVRKGDFVIS-LRSFQGGIEYAYYQGIISAAYTILNLNDNC 120 Query: 121 PELLQGWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +L+ ++ C + ++ + +P+PPLAEQ I + + Sbjct: 121 YSNYIKYLMKSFDFIQLLQTCVTGIREGQNINYTLLRKSSLPLPPLAEQRAIVSYLDGKV 180 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +IDT + ++ + IELLKE KQA+++ VTKG++ K+K +GI W+G VP HWE Sbjct: 181 GQIDTYVAKQTQQIELLKELKQAVIANAVTKGIDNKAKLKQTGISWIGHVPQHWERCRCK 240 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 ++TE+ + E +LSL+ +I + + G P+ + TY++V P ++VF D+ Sbjct: 241 DVLTEI-KLLVGNGEYALLSLTTNGVIVRDLSEGKGKFPKDFNTYKVVKPNDLVFCLFDV 299 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 R++ V G++T AY + +D+++L + D K + GLR+ + Sbjct: 300 DETPRTVG--LVHNHGMLTGAYNVFETKNVDTSFLYHYFIALDNRKALKPLYKGLRKVIP 357 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +P+ +PP+ EQ I + I +TA I+ L++ EQ + +KE + I+ AVTG+ Sbjct: 358 LPAFMSMPLYIPPLSEQRAIVSYIEAKTASINKLIDAYEQQVERVKEYKQRLISDAVTGK 417 Query: 419 IDLRGE 424 +++ E Sbjct: 418 MNVTDE 423 >gi|331650479|ref|ZP_08351551.1| putative type I restriction-modification system, S subunit [Escherichia coli M605] gi|331040873|gb|EGI13031.1| putative type I restriction-modification system, S subunit [Escherichia coli M605] Length = 435 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 107/434 (24%), Positives = 182/434 (41%), Gaps = 25/434 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGL 53 + Y+AYP+Y+DSG++W +P +WK ++ + + G T + +I Sbjct: 4 LNKYQAYPEYRDSGMEWCNELPLNWKKTKLRWLSNIFAGGTPSKNVIDYWENGTVPWISS 63 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQF 111 V G ++ + S+ KG ++ G + C+ Sbjct: 64 GAVNQGYIVEPSTYISNAALENSSAKWIPKGALVVALAGQGKTKGMVAQLGINTTCNQSM 123 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + + + I Q I + G + + +G+I P P E I Sbjct: 124 AAIVLYKK-NQSRYIFWWLISNYQNIRNMAGGDLRDGLNLELLGDIQCPKPRNDESSKIA 182 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET +ID LI ++ + IELLKEK+QA++S+ VTKGLNPDV MKDSG+EW+G VP H Sbjct: 183 LFLDHETAKIDNLIEKQQQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGVEWLGEVPKH 242 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 W + + + ++I + R Y ++ Sbjct: 243 WHICKLKWFANLKSGD--FITSNSIEPEGNYPVYGGNGLRGYYSYFTHNGEYVLIGRQGA 300 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + I+ ++ E ++ + +L L+R +L + S Sbjct: 301 LCGNIN-----YAIGKFWASEHAVV-----VTPNERAVTIWLGELLRIMNLNQY---SVS 347 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + L E + L + +PP +EQ +I I+ + L+E +I LLKERR++ I Sbjct: 348 AAQPGLAVERITDLYIPIPPYQEQVNIGTYISKYISLDKKLIEHSTDNIELLKERRTALI 407 Query: 412 AAAVTGQIDLRGES 425 +AAVTG+IDLR + Sbjct: 408 SAAVTGKIDLRNWT 421 >gi|189425259|ref|YP_001952436.1| type I restriction-modification system specificity subunit [Geobacter lovleyi SZ] gi|189421518|gb|ACD95916.1| type I restriction-modification system specificity subunit [Geobacter lovleyi SZ] Length = 461 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 104/432 (24%), Positives = 193/432 (44%), Gaps = 18/432 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESG 59 + K YP+Y++S + W+G +P HW P + + T K ++ + + Sbjct: 2 IAELKPYPEYRESELAWLGDVPSHWHSGPGFSAFREKKVKNTGLQEKTVLSLSYGRI--- 58 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQ 115 K K T I G I+ + I GI ++ ++ ++ Sbjct: 59 IVKPEDKLHGLVPESFETYQIVDPGDIIIRSTDLQNDKTSLRVGIVKNRGIITSAYMCMK 118 Query: 116 PKDVL-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + L PE L ++D+T+ + + G + D+ +P+ IPP+ EQ I + Sbjct: 119 VTETLMPEYGYQLLHTLDLTKILYGLGSGL-RQNLDYSDFKRLPLSIPPIDEQTSIVRFL 177 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 +RI+ I + + I LL E+KQ ++ VT+GL+P+V++K SGI W+G +P HWE Sbjct: 178 NHANLRIEKAIRAKRKVIALLNEQKQVIIHRAVTRGLDPNVQLKPSGIPWLGDIPGHWED 237 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + E++ ++ E+++ +I + L ESY ++ G++V Sbjct: 238 LRSKYVFHEVDERSVTGTETHLSMSQKYGLIPNSQIEERRLVSESYVGAKLCRSGDLVLN 297 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + +L +G+I+ Y +P + + Y + R+ G+ Sbjct: 298 RLKAHLGVFALAP----GQGLISPDYTVFRPARPMVARYFEAMYRTPACRVELRKRAKGI 353 Query: 354 RQ---SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 Q L +D + V VPP+ EQ++I ++ E I+ ++ E+ I LL+E R+ Sbjct: 354 VQGFWRLYTDDFYDIRVPVPPLDEQYEIMQYLDKELLVINTVIASTEREIDLLREYRTRL 413 Query: 411 IAAAVTGQIDLR 422 IA VTG++D+R Sbjct: 414 IADVVTGKLDVR 425 >gi|81299873|ref|YP_400081.1| type I restriction-modification [Synechococcus elongatus PCC 7942] gi|81168754|gb|ABB57094.1| type I restriction-modification [Synechococcus elongatus PCC 7942] Length = 453 Score = 224 bits (571), Expect = 2e-56, Method: Composition-based stats. Identities = 101/451 (22%), Positives = 190/451 (42%), Gaps = 26/451 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSE-----SGKDIIY-IGL 53 M + YP YKD G++W+ +P HW V+ ++R ++ +G + + I + Sbjct: 1 MS-FPRYPAYKDCGIEWLEKLPSHWNVLQLRRLIPEIESGVSVNALDHAPDEGIPSVLKT 59 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICS 108 V +G+ + + ++ G+++ ++ +++ Sbjct: 60 SCVYTGSFRPEERKEIIQEDIDRAACPVKSGRLIVSRMNTPDLVGAAGLSLVDYDYVFLP 119 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAE 166 + ++ +V P W + +++ +C G + M + + +P+P E Sbjct: 120 DRLWQVRISNVYPNFAYYWTQTQIYRDQVKMVCSGTSSSMQNLSQDNFLSFILPVPSDEE 179 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q+ I + ET +ID LI E+ R I LL+EK+QA++S+ VTKGLNPD +KDSGIEW+G Sbjct: 180 QIAIASFLDRETAKIDALIAEQQRLIALLQEKRQAVISHAVTKGLNPDAPLKDSGIEWLG 239 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS---LSYGNIIQKLETRNMGLKPESYE-- 281 VP HW+ + E + S ++ + N ++ Sbjct: 240 QVPAHWKTGKIKHYFKTSSGGTPNTEEQALYYADSDSGIPWVRTTDIENQEVRSAEVSIT 299 Query: 282 ------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 T + P + V + I + + + + Sbjct: 300 NQAIQDTACEILPVDTVLVALYGGGGTVGKNGILTFPAAINQALCALLPSYYAVPMFTFR 359 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ + A+ + ++ E V+ +PP+ EQ I I+ + I L + Sbjct: 360 YIQFLRPFWMERAVSARKAGNISQELVRDTVFALPPLDEQILIVKHIHSQLEEITSLENE 419 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +S+ LL+ERRS+ I+AAVTGQID+RG ++ Sbjct: 420 STKSLSLLQERRSALISAAVTGQIDVRGLAE 450 >gi|88811656|ref|ZP_01126910.1| type I restriction-modification system specificity subunit [Nitrococcus mobilis Nb-231] gi|88791047|gb|EAR22160.1| type I restriction-modification system specificity subunit [Nitrococcus mobilis Nb-231] Length = 710 Score = 224 bits (570), Expect = 2e-56, Method: Composition-based stats. Identities = 105/433 (24%), Positives = 185/433 (42%), Gaps = 16/433 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKY 63 K YP+YK + W+G IP+HW V+P + ++ + + I V Sbjct: 251 KPYPEYKPTAQAWLGEIPQHWSVLPNRALFNEVKDRGHPDEEMLSVTITKGIVRQKALLE 310 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--P 121 +S D S + I Y K+ + + GI S ++V++ ++ P Sbjct: 311 GSSKKDSSNLDKSAYKLVQPRDIAYNKMRAWQGAIGASALRGIISPAYVVMRLRNGDDLP 370 Query: 122 ELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + E G T M + I P PP AEQ I + Sbjct: 371 SYIHYLYRTPQFAKEAERWSYGITSDMWSLRPEHFKMIYTPEPPTAEQEAIVRFLDWANG 430 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 R++ + + I LL E+KQA++ VT+GL+ V +K SGI W+G +P HWEVK Sbjct: 431 RLERATRAKRKVIALLNEQKQAIIHQAVTRGLDSSVPLKPSGIPWLGHIPRHWEVKRIKY 490 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 L+ E++ ++T E + + ++ E + + + ++IV PG+ V + Sbjct: 491 LLREVDERSTTGSEPLLSMRMHHGLVLFAEHFSRPPQAATLVGFKIVHPGQFVVNRM--- 547 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYA------MGSG 352 + G+++ Y P G + +L L RS + F A G+ Sbjct: 548 -QAGNGVIFASTLTGLVSPDYAVFDPIGDANVDFLGELFRSRKVRAKFRAESKGLGTGTS 606 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L + + + V +PP EQ DI + E + ++ + ++E I LL+E R+ +A Sbjct: 607 GFLRLYNDRLGAIHVALPPRAEQGDIVAGLTRELSEVNTTISRLESEIELLREYRTRLVA 666 Query: 413 AAVTGQIDLRGES 425 VTG++D+R + Sbjct: 667 DVVTGKLDVREAA 679 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 52/214 (24%), Positives = 88/214 (41%), Gaps = 12/214 (5%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L P + K + W+G +P HW V P AL E+ + E ++++ G + QK Sbjct: 250 LKPYPEYKPTAQAWLGEIPQHWSVLPNRALFNEVKDRGHPDEEMLSVTITKGIVRQKALL 309 Query: 271 RNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 K Y++V P +I + + + RGII+ AY+ ++ Sbjct: 310 EGSSKKDSSNLDKSAYKLVQPRDIAYNKMRAWQGAIGASAL----RGIISPAYVVMRLRN 365 Query: 328 ID--STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 D +Y+ +L R+ K G+ SL+ E K + PP EQ I + Sbjct: 366 GDDLPSYIHYLYRTPQFAKEAERWSYGITSDMWSLRPEHFKMIYTPEPPTAEQEAIVRFL 425 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + R++ + I LL E++ + I AVT Sbjct: 426 DWANGRLERATRAKRKVIALLNEQKQAIIHQAVT 459 >gi|225076790|ref|ZP_03719989.1| hypothetical protein NEIFLAOT_01841 [Neisseria flavescens NRL30031/H210] gi|224951888|gb|EEG33097.1| hypothetical protein NEIFLAOT_01841 [Neisseria flavescens NRL30031/H210] Length = 430 Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats. Identities = 102/432 (23%), Positives = 171/432 (39%), Gaps = 18/432 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y YKDSGV+W+G IP W++ + N R ++ K+ + L + K Sbjct: 2 RRYESYKDSGVEWLGKIPSQWELTIGMNVFRENK-RDNKGMKEKTVLSLSYGQI-IIKPE 59 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVL 120 K T I I+ + +A GI ++ +L L+ + Sbjct: 60 EKLVGLVPESFETYQIVEPNDIIIRCTDLQNDQTSLRTGLAKDKGIITSAYLNLKVINNH 119 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + L ++ + + +P+ PL+EQ I + + +T + Sbjct: 120 SAKFLHYYLHTLDITKVLYKFGSGLRQNLSFLDFKRLPIIDIPLSEQQKIAQFLDDKTAK 179 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW VK + Sbjct: 180 IDQAVDLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWSVKKIKHV 239 Query: 241 ------VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGE 290 + I+ I L NI N + + V G+ Sbjct: 240 TSKIGSGITPLGGGSNYIDGGIPLLRSQNIHFDRIDLNDVARISEFTHNSMKNSKVRKGD 299 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ R E + + I++ +L L+ S K + Sbjct: 300 VLLNITGGSL-GRCFYVDSNEEMNVNQHVCIIRPNKKINTIFLNMLLASEVGQKQIWFFQ 358 Query: 351 -SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G R+ L F+ +K + +P +KEQ I ++ + A+ID + I LKE +S Sbjct: 359 QGGGREGLNFQAIKNFYLPLPDLKEQQKIAIYLDKQVAKIDQAIALKTAHIEKLKEYKSV 418 Query: 410 FIAAAVTGQIDL 421 I VTG++ + Sbjct: 419 LINDVVTGKVRV 430 >gi|52549656|gb|AAU83505.1| restriction endonuclease S subunits [uncultured archaeon GZfos29E12] Length = 438 Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats. Identities = 114/436 (26%), Positives = 190/436 (43%), Gaps = 17/436 (3%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT-----GRTSE--SGKDIIYIGLE 54 K YP+YKDS ++WIG IP+ W+V IK + + G TSE S + + Sbjct: 1 MKLKPYPKYKDSEIEWIGEIPEGWEVNKIKNTSYVKGRIGWHGLTSEEYSDEGAYLVTGT 60 Query: 55 DVESGTGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQF 111 D + G ++ + +L K G + A+I ++ Sbjct: 61 DFKDGVIEWEDCHHVGWDRYKEDPYIHLKEDDLLITKDGTIGKVALIKFLPNKATLNSGI 120 Query: 112 LVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 +++P K P+ + L S + + I GAT+SH + PIP EQV Sbjct: 121 FLVRPLNKKYFPKFMYWMLNSTVFERFFDYIKTGATISHLYQETFERFFFPIPLKQEQVA 180 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + +T +ID LI + R IELLKEK+ AL+ + VTKGL+P+VKMKD GI W+G +P Sbjct: 181 IASFLDKKTAKIDALIEKDKRLIELLKEKRTALIDHAVTKGLDPNVKMKDFGIVWIGKIP 240 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + ++ PF + + E LS + + + + + Y P Sbjct: 241 EDAKIMPFRRVCYVNQG--LQFPEDKRLSEPDEKSKIYITIKYIHADEDGVKEYIPNPPR 298 Query: 290 EIVFRFIDLQNDKRSLR--SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 ++ + D+ + E + + ID YL + ++ + KV Sbjct: 299 GVICKKEDVLLARTGATGEVITNQEGVFHNNFFKVNYNSKIDRDYLVYYLKMDSIKKVLL 358 Query: 348 AMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 L + P ++ I++Q I ++ +TA+ID ++ IE+ I LL+E Sbjct: 359 LKAGVTTIPDLNHDAFLSTPFILYSIEKQKQIAEYLDKKTAKIDKNIKLIEKKIKLLEEY 418 Query: 407 RSSFIAAAVTGQIDLR 422 + S I VTG++D+R Sbjct: 419 KKSLINHVVTGKVDVR 434 >gi|15839311|ref|NP_299999.1| type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] gi|9187842|gb|AAF85758.1|AE004078_10 type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] Length = 468 Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats. Identities = 90/431 (20%), Positives = 168/431 (38%), Gaps = 14/431 (3%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 YP Y+ ++W+ A+P+HW K F + R+ +++ + + + T + Sbjct: 8 YPNYRQPKMRWLPAVPEHWNEQRAKTFFREVDERSKTGQEEL--LSVSHLTGVTSRSQKN 65 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + + + G I+ L ++ + GI S + V +P Sbjct: 66 VTMFKAASYVGSKLCRPGDIVINTLWAWMAALGASRHVGIVSPAYGVYRPHHADSFNPAY 125 Query: 127 WLLSIDVTQRIEAICEGAT-----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + +T +I + PP EQ I + + I Sbjct: 126 LDYLLRTRAYVAEYIGRSTGIRSSRLRLYPNQFLDIALIQPPRPEQDQIVAYLRVQDAHI 185 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 I + I+LL E+K+ ++ + VT+GL+ V +K SGIEW+G VP HW+VKP V Sbjct: 186 ARFIKVKRDLIKLLTEQKRRIIDHAVTRGLDASVALKPSGIEWLGDVPVHWDVKPLKRWV 245 Query: 242 TELNR----KNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFI 296 K E + + + E + + +++ G+ + + Sbjct: 246 RLNASTLGEKTDPDFEFRYVDIGSVQTGRLAKELERIRFEVAPSRARRVLRRGDTIISTV 305 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-Q 355 S + + T + + + YL ++++S A G+ Sbjct: 306 RTYLKAIWYVSEEADDLIASTGFAVLTPGNSAEPEYLGYVIQSSAFVNRVAANSIGIAYP 365 Query: 356 SLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++ + R PV +PP + EQ I I E+A +D + + E+ I L++E R I Sbjct: 366 AIAETVLGRFPVALPPTVDEQQAIVAHIKTESAPLDDAITRTEEEITLIREYRDRLITDV 425 Query: 415 VTGQIDLRGES 425 VTGQ+D+RG Sbjct: 426 VTGQVDVRGWQ 436 >gi|300825349|ref|ZP_07105428.1| conserved hypothetical protein [Escherichia coli MS 119-7] gi|300522184|gb|EFK43253.1| conserved hypothetical protein [Escherichia coli MS 119-7] Length = 441 Score = 223 bits (569), Expect = 3e-56, Method: Composition-based stats. Identities = 88/435 (20%), Positives = 180/435 (41%), Gaps = 12/435 (2%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 + Y YKDSGV+W+G IP W ++ K +L + + + + L + Sbjct: 4 ISEMPKYEVYKDSGVEWLGDIPASWSLLANKHIFRLKKKQVGKRSSEYDLLSLT-LRGVI 62 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKD 118 + + ++ T G ++ R ++ F+G+ + + V + D Sbjct: 63 KRDMENPEGKFPAEFDTYQEVQCGDFIFCLFDVEETPRTVGLSPFNGMITGAYTVFELND 122 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + +++ + + +PP +Q I + + Sbjct: 123 NFDNRFLYYFYMNLDAKKMLKPLYRGLRNTIPKDSFLSFKTFVPPHEQQTRIANFLDKKI 182 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 ID I+ + + I LLKE KQ ++ VT+GL+P+V MKDSG++W+G +P+HWEV P Sbjct: 183 ALIDEAISIKEKQINLLKEHKQIIIQQAVTQGLDPNVPMKDSGVDWIGDIPEHWEVVPLK 242 Query: 239 A--LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP--ESYETYQIVDPGEIVFR 294 +++ + + + + + L+ + L P + + + + + G+++ Sbjct: 243 RLAVLSPSVKVSNRKSKELVTFLAMEKVSTDGFIDQDTLMPICDVSQGFTVFNRGDVIVA 302 Query: 295 FIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCK--VFYAM 349 I N K + + E G ++ + ++ +L+ S Sbjct: 303 KITPCFENGKSAWLNNLQTEFGYGSTEFHVLRCGQRIIGSFLYLIVSSPLFLNAGEAMMT 362 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 GS ++ + ++ P +P + EQ I + + ++IDV+V I LKE +++ Sbjct: 363 GSAGQKRVPSSFIQNFPTAIPGVAEQEKIVSKVKELFSQIDVVVASTVNQIEKLKEYKTT 422 Query: 410 FIAAAVTGQIDLRGE 424 I +AVTG+I + E Sbjct: 423 LINSAVTGKIKITPE 437 >gi|126664066|ref|ZP_01735060.1| type I restriction-modification system, S subunit [Flavobacteria bacterium BAL38] gi|126624015|gb|EAZ94709.1| type I restriction-modification system, S subunit [Flavobacteria bacterium BAL38] Length = 450 Score = 223 bits (567), Expect = 5e-56, Method: Composition-based stats. Identities = 103/444 (23%), Positives = 178/444 (40%), Gaps = 23/444 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVE 57 K+YP+YK S + W IP++W +K G T + DI ++ ++ Sbjct: 2 KSYPKYKPSKIVWYPEIPENWDYCKVKHIANTYAGGTPSTVVDSFWHNGDIPWLPSGKLQ 61 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + K + S+ +L G F + + + Sbjct: 62 NCEIISAEKFITNEGLIGSSTKWIKPNTVLVALTGATCANIGYLTFQACANQSVIAVDEN 121 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + +++ +I G + + + N+ + P L EQ+ I + + + Sbjct: 122 PEKANSRFLYYMFLNMRSQILTHQTGGAQAGINDSDVKNLYLLNPSLEEQIKIADYLDYK 181 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T ID I ++ R IELLKEK+QA+++ VTKGLNP+ MKDSG+EW+G +P++WEVK Sbjct: 182 TNLIDATIEKKKRLIELLKEKRQAVINEAVTKGLNPNAPMKDSGLEWLGEIPENWEVKKV 241 Query: 238 FALVTELNR----------KNTKLIESNILSLSYGNIIQKLETRNMGLKP----ESYETY 283 L++ N K L ++ I GN+I+ T E Sbjct: 242 KYLLSSENGIKIGPFGSALKLDTLTDNGIKIYGQGNVIKDDFTLGHRYIDPERFEKDFKQ 301 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD-- 341 + G+I+ + + S+ + D + L+ D Sbjct: 302 YEILDGDILITMMGTTGKSKVFNSSYEKGILDSHLLRLRFNEDLFDGRLFSILLEQSDYV 361 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 ++ + L VK L ++ P ++ Q +I N I+ ID++ KI I Sbjct: 362 FQQLALNSVGSIMAGLNSSIVKELIIITPKLEIQKEILNYIDENCKIIDIISSKILSQIE 421 Query: 402 LLKERRSSFIAAAVTGQIDLRGES 425 L+ R S I+ AVTG+ID+R Sbjct: 422 KLQTYRQSLISEAVTGKIDVREWQ 445 >gi|309776566|ref|ZP_07671546.1| type I restriction-modification system specificity determinant [Erysipelotrichaceae bacterium 3_1_53] gi|308915667|gb|EFP61427.1| type I restriction-modification system specificity determinant [Erysipelotrichaceae bacterium 3_1_53] Length = 457 Score = 223 bits (567), Expect = 6e-56, Method: Composition-based stats. Identities = 101/434 (23%), Positives = 180/434 (41%), Gaps = 19/434 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESGKD------IIYIGLEDVE 57 K Y YK ++W IP W VVP+KR + +G T +S + + +I D+ Sbjct: 2 KTYSDYKKCKIKWCPTIPSSWDVVPLKRIFSNIGSGATPKSNNNNYYGGNVSWIQSGDLH 61 Query: 58 SGTGKYLPKDGNSRQS-DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + K D S + I+ I G + I+ D + + Sbjct: 62 NHFLSSTKKRITDSALRDVSALKIYKTPFISIAMYGASIGNLSISKIDSCTNQACCNMSG 121 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 E + L + ++ G T + I N+ +P+P + EQ I + Sbjct: 122 SAGNIE--YFYYLLSSCKDYMISLSAGGTQPNISQLIIKNLILPLPSVNEQDQIVRFLDW 179 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 + I+ LI + + I ++E K+ +++ VT GLN +V MK SG+EW+G +P+HW++ Sbjct: 180 KVSEINKLINVKEKEIVQIQELKKTVINDAVTHGLNRNVPMKYSGVEWLGDIPEHWKIIK 239 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR--NMGLKPESYETYQIVDPGEIVFR 294 ++ + KN + + G I++ ++ + N P+ Y++V G+ Sbjct: 240 LRKILHPFSEKNHPELPLLSVVREKGVIVRDVDDKESNHNFIPDDLSGYKMVKKGQFAMN 299 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 + + GI++ AY + Y + +RS F G+ Sbjct: 300 KMKAWQGSYGVSDY----TGIVSPAYFIFDVDFENLEYFHYAIRSKVYVNFFAQASDGIR 355 Query: 354 --RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + L +K +P +VPP +EQ +I N I R + +E I L E ++ I Sbjct: 356 VGQWDLSMNKMKEIPFIVPPEEEQKEIVNYIPKALERYTNAINTLESQIEALHELKNKLI 415 Query: 412 AAAVTGQIDLRGES 425 + AVTG+ID+R Sbjct: 416 SDAVTGKIDVRNAE 429 >gi|54308077|ref|YP_129097.1| type I restriction-modification system specificity determinant [Photobacterium profundum SS9] gi|46912503|emb|CAG19295.1| hypothetical type I restriction-modification system specificity determinant [Photobacterium profundum SS9] Length = 437 Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats. Identities = 92/434 (21%), Positives = 178/434 (41%), Gaps = 22/434 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 Y Y +SGV+WIG IP+HW + K R+ +++ + + + T + Sbjct: 8 PKYEAYNESGVEWIGNIPEHWNITKAKYLFNEVDERSVTGHEEL--LSVSHITGVTPRSE 65 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 D S I++ + ++ +++ GI S + V + K Sbjct: 66 KNVSMFMAEDYSGSKTCQADDIVFNTMWAWMGALGVSERSGIVSPSYGVFRQKFTNTFNA 125 Query: 125 QGWLLSIDVTQRIEAICE-----GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + IE + ++ ++ M P + EQ I + + +T Sbjct: 126 KYLEYLLKTPKYIEHYNKVSTGLHSSRLRFYGHMFFDMKMGYPHIDEQNGIIKFLDNKTN 185 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID + + I LLKE+KQ ++ VT+GLNPDV M+DSG++W+G +PDHW +P Sbjct: 186 KIDEAAAIKEKQISLLKERKQIIIQQAVTRGLNPDVPMRDSGVDWIGEIPDHWCSEPIKY 245 Query: 240 L---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP------ESYETYQIVDPGE 290 + + K ++ + + +++ + K + + + + PG+ Sbjct: 246 SLKGIIDCEHKTAPFVDKKEFFVVRTSNVKQGKLVIEDAKYTNEYGYKEWTSRGVPFPGD 305 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVF-Y 347 I+ + + + + + +K + L+ S + + Sbjct: 306 ILLTREAPAGEACLVP---DDRKLCLGQRMVWLKVDRTRLLPEFALSLIYSSVVRTYIDF 362 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 D+K +PV++PPI EQ + I + +ID +E +Q I LKE + Sbjct: 363 LSAGSTVLHFNMADIKNIPVILPPINEQAILVTHIKKHSDKIDKAIELEQQQISKLKEYK 422 Query: 408 SSFIAAAVTGQIDL 421 S I +AVTG+I + Sbjct: 423 SILINSAVTGKIKV 436 >gi|38505781|ref|NP_942400.1| type I restriction-modification system S subunit [Synechocystis sp. PCC 6803] gi|38423805|dbj|BAD02014.1| type I restriction-modification system S subunit [Synechocystis sp. PCC 6803] Length = 464 Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats. Identities = 96/438 (21%), Positives = 172/438 (39%), Gaps = 21/438 (4%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVP-IKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 K YP+YKDSGV W+G IP HW + P ++ T ++ + + Sbjct: 1 MKLKPYPEYKDSGVSWLGQIPAHWDIKPGFAFLSERKEKNTGMKESTVLSLSYGQIVV-- 58 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR------KAIIADFDGICSTQFLVL 114 K K T I G I+ G L+ + GI ++ +L L Sbjct: 59 -KPPEKLHGLVPESFETYQIAEPGNIII--RGTDLQNDKVSLRVGKVRNRGIITSAYLCL 115 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + K+ LL +I + + +P+ PP +EQ I + + Sbjct: 116 ETKEKFNPDYAHLLLHGYDLMKIYYGMGSGLRQNLSFSDFKRLPLLAPPESEQSKINKYL 175 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 + V+I+ I + R IELLKE+KQ +++ VT+GL+P+VK+K SG++W+G +P++W Sbjct: 176 QSIQVQINKFIRNKRRLIELLKEQKQNIINQAVTRGLDPNVKLKPSGVKWIGDIPEYWSF 235 Query: 235 KPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMG-----LKPESYETYQIV 286 + K+ I + G++ ++ + + Sbjct: 236 LKLKRIACVKTGYAFKSDHYKSVGIPLIRIGDLKHSGLVDIKQAVKLQESDLTHFSCFKI 295 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+++ K + Q YL +++ S K Sbjct: 296 QYGDLLMAMTGATIGKVAKYQHQTEALLNQRVCSFRSFESKCFQDYLLFILSSEVYLKQV 355 Query: 347 YAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G + ++ + + VPPI EQ I + + +T D V + E+ I L++E Sbjct: 356 TIFCYGGAQPNISDSTLMSFKIPVPPISEQQAILSYVQEQTKTTDSAVSRAEREIELIQE 415 Query: 406 RRSSFIAAAVTGQIDLRG 423 + ++ VTGQ+D+R Sbjct: 416 YYTRLMSDVVTGQVDVRD 433 >gi|325297666|ref|YP_004257583.1| restriction modification system DNA specificity domain [Bacteroides salanitronis DSM 18170] gi|324317219|gb|ADY35110.1| restriction modification system DNA specificity domain [Bacteroides salanitronis DSM 18170] Length = 429 Score = 222 bits (565), Expect = 9e-56, Method: Composition-based stats. Identities = 100/431 (23%), Positives = 177/431 (41%), Gaps = 21/431 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y +YKDSGVQW+G IP HW+V + + S+ D + + G Y Sbjct: 4 RYSEYKDSGVQWLGKIPSHWEVKRLASCFTERKVKVSDKEFDPLSVT------KNGIYPQ 57 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELL 124 + ++ +D + G + + +A DG S +VL+P K++ P+ Sbjct: 58 LENVAKTNDGDNRKLVLSGDFVINSRSDRKGSSGVAKQDGSVSLINIVLKPRKNIYPDFC 117 Query: 125 QGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + G + + + I + +P L EQ I + T +ID Sbjct: 118 NYLLKCYSFIEEYYRNGRGIVADLWTTRYDEMKTIKISVPLLNEQKAIVRYLNKVTSKID 177 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I ++ + I+LL E+KQ +++ VTKGLNPDV MK+SG+EW+G +P HW Sbjct: 178 EAIAQQQKMIDLLNERKQIIINNAVTKGLNPDVPMKNSGVEWIGKIPKHWTTIRLGYCAW 237 Query: 243 ELNR------KNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIV 292 R K+ + +++ LS NI+ N + + G+I+ Sbjct: 238 IRARLGWKGLKSDEYVDNGYPFLSAFNIVNNKLDWNKLNYINKFRYEESPEIKLRIGDIL 297 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 K + + + + + +L + + S K + + Sbjct: 298 LVKDGAGIGKCARVDSLPLGEATANGSLAFITANERVYYKFLHYYIISNSFNKYKDLLIT 357 Query: 352 G-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G L ++K + + +PP+ EQ+ I ++ ID ++E Q I L+ER+ Sbjct: 358 GMGVPHLTQGEIKNMMLPIPPLNEQYIIVQRLDKNINVIDNILEHYLQQITFLQERKRII 417 Query: 411 IAAAVTGQIDL 421 I VTG++ + Sbjct: 418 INDVVTGKVKV 428 >gi|332666806|ref|YP_004449594.1| restriction modification system DNA specificity domain-containing protein [Haliscomenobacter hydrossis DSM 1100] gi|332335620|gb|AEE52721.1| restriction modification system DNA specificity domain protein [Haliscomenobacter hydrossis DSM 1100] Length = 428 Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats. Identities = 107/434 (24%), Positives = 190/434 (43%), Gaps = 24/434 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + + YP YK+SGV+WI +P HW+VV +KR T+ + L Sbjct: 1 MMNVQKYPAYKNSGVEWIETVPSHWEVVKLKRLFCEKKKITN--------VDLPCGSISF 52 Query: 61 GKYLPKDGNSRQS-DTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQ 115 GK + KD + +KG+ L L + ++D D + S+ ++VL Sbjct: 53 GKVVYKDEEKIPEATKKSYQAVSKGEYLLNPLNLNYDLISLRIALSDKDVVVSSGYIVLN 112 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 L + WLL ++ + G ++ IG+ + PPL EQ I + + Sbjct: 113 SIVKLDKTYFKWLLHRYDVAFMKTLGSGV-RQTINFSDIGDSELIFPPLPEQTAIAQFLD 171 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +T ID I + + IELLKE++Q L+ VT+GLNP+VKMK SG+EW+G VP+ WEV Sbjct: 172 RKTALIDQAIDIKQKQIELLKERRQILIHQAVTRGLNPEVKMKASGVEWIGEVPEGWEVV 231 Query: 236 PFFALVTELNR--KNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPG 289 L + K E + + N+ + + + +E ++ Sbjct: 232 RLKTLGKIKYGLGQPPKTKEDGLPLIRATNVERGRIVEKDLIFVDPEDIPWERDPMLKEN 291 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD--LCKVFY 347 +I+ ++ G I M + P I+ +L++ + + +++ Sbjct: 292 DIIVVRSGAYTGDSAIIPK--HYAGSIAGYDMVLTPTSINPRFLSYTLLAKYVLYDQLYL 349 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + L E++ + ++ PP EQ I + + +I + +Q I L+E + Sbjct: 350 LRMRAAQPHLNAEELGQTIIVCPPKLEQQQIFEYLENISKKIATAITLKQQEIAKLQEYK 409 Query: 408 SSFIAAAVTGQIDL 421 ++ I +AVTG+I + Sbjct: 410 ATLINSAVTGKIKV 423 >gi|229198631|ref|ZP_04325333.1| hypothetical protein bcere0001_41580 [Bacillus cereus m1293] gi|228584913|gb|EEK43029.1| hypothetical protein bcere0001_41580 [Bacillus cereus m1293] Length = 440 Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats. Identities = 104/439 (23%), Positives = 183/439 (41%), Gaps = 24/439 (5%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK Y Q+KDS V+WIG IP+HW++ + + + + S+ + + + G Sbjct: 3 YKQYKQHKDSSVEWIGEIPQHWEIKKVSAIFEQRSEKVSDKDFEPLSVT------KMGIL 56 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + ++ + K + ++ FDG S V++PK + Sbjct: 57 KQLENVAKTDNNDNRKKVLKNDFVINSRSDRKGSCGVSQFDGSVSLICTVIKPKTKNTYM 116 Query: 124 LQGWLLSIDVTQRIEAICEGATM----SHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + E G + W I +P PP EQ I + Sbjct: 117 DYYHHLFRNKMFSEEFYRWGRGIVDDLWSTRWDEFKRILIPSPPYEEQKSIANYLNYIYE 176 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP--- 236 I+ LI + + + L++ +Q+L++ VT GLNP KMKDSG+EW+G +P HWE+K Sbjct: 177 TIENLINNKKQQMATLQQYRQSLITETVTCGLNPYAKMKDSGLEWIGQIPSHWEIKKNKM 236 Query: 237 ----FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDP 288 + K E I L N+ + + + + + Sbjct: 237 ITNSITVGIVITPSKYYIEGEGGIPCLRSLNVKEGEIINTDLVYISNESNELLSKSKIYE 296 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G++V + I + K I+S +L +L+ S + + Sbjct: 297 GDLVSIRTGDTGVTSVVPKEYDGANCI--DLIIIRKSTKINSAFLCYLLNSNVAKQQYRN 354 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + G +Q E K + PP +EQ +I N ++++T ID +++KI++ I+LL++ R Sbjct: 355 LSGGAIQQHFNIEMAKNTYITYPPFEEQLEIVNYLDLKTTEIDSVIKKIKEQIILLEKYR 414 Query: 408 SSFIAAAVTGQIDLRGESQ 426 S I AVTG+ID+R ++ Sbjct: 415 QSLIYEAVTGKIDVRSYTE 433 >gi|254228173|ref|ZP_04921602.1| Restriction endonuclease S subunits [Vibrio sp. Ex25] gi|262394006|ref|YP_003285860.1| type I restriction-modification system specificity subunit S [Vibrio sp. Ex25] gi|151939246|gb|EDN58075.1| Restriction endonuclease S subunits [Vibrio sp. Ex25] gi|262337600|gb|ACY51395.1| type I restriction-modification system specificity subunit S [Vibrio sp. Ex25] Length = 437 Score = 222 bits (565), Expect = 1e-55, Method: Composition-based stats. Identities = 96/434 (22%), Positives = 179/434 (41%), Gaps = 21/434 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + +Y D+G+ W+G+IP HW+ P+ +KL ++ + + + ++ G ++ Sbjct: 8 PKHNEYTDTGISWLGSIPSHWEAAPLCSVSKL---KSITNHVGEPLLSV-YLDKGVIRFD 63 Query: 65 P---KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 K N D S + G + + I+ GI S +LVLQ + Sbjct: 64 EVEAKRTNVTSLDLSKYQLVEPGDFVLNNQQAWRGSVGISAHRGIVSPAYLVLQLSSKIY 123 Query: 122 ELLQGWLLSIDVT---QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +L + + G + W + + P L EQ+ I + +T Sbjct: 124 PRFGNYLFRDGSMVANYLVNSKGVGTIQRNLYWPQLKRALVFFPGLDEQIAIANYLDEKT 183 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +ID I + + IELLKE+KQ ++ VT+GLNPDV MKDSG++W+G +P+HW V Sbjct: 184 SQIDEAIAIKQKQIELLKERKQIIIQQAVTQGLNPDVPMKDSGVDWIGKIPEHWTVSKIG 243 Query: 239 ALVTELNRKNT------KLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPG 289 N E I +S G + + + L + + +I G Sbjct: 244 HYARVYNGSTPSRDVKRYWDEGTIPWMSSGKVNDYIISTPSELITTAALRECSLRIFPKG 303 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ + + + SA + +I + P + +V Sbjct: 304 TVLIGIVGQGKTRGT--SAMLAIDAVINQNVAGIIPSEKILSEFLHQYLIQAYDEVRNQG 361 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +++L + + + P I EQ +I + I +++ ++D ++ I LKE +++ Sbjct: 362 QGSNQEALNCQILSSFKIAFPSIIEQKEIVHFIAIQSQKLDQSIDIQFNQIEKLKEYKTT 421 Query: 410 FIAAAVTGQIDLRG 423 I +AVTG+I + Sbjct: 422 LINSAVTGKIKVTE 435 >gi|259156142|gb|ACV96090.1| type I restriction-modification protein [Providencia alcalifaciens Ban1] Length = 436 Score = 221 bits (564), Expect = 1e-55, Method: Composition-based stats. Identities = 97/425 (22%), Positives = 174/425 (40%), Gaps = 13/425 (3%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 QY DSG +WIG IP+HW +V + + + + + V + Sbjct: 12 QYIDSGYEWIGEIPQHWDLVKLGSCLSSVSVKNCPELPLLSITREQGVIERDVDDQELNH 71 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 N D S KGQ K+ + ++ F GI S + V + W Sbjct: 72 NFIPDDLSGYKKLEKGQFGMNKMKAWQGSYGVSKFTGIVSPAYFVFDFTKAIDPEFFNWA 131 Query: 129 LSIDVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + + IP +P +Q LI + +T +I+ I Sbjct: 132 IRSKLYVSFFGSASDGVRIGQWDLSKTRMKVIPFVLPSEEDQSLIANFLAKKTTQINDAI 191 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP----FFALV 241 + + I LLKE+KQ ++ VT+GL+P+V MKDSG+ W+G +P HWEV+ F Sbjct: 192 AIKEQQINLLKERKQIIIQQAVTQGLDPNVPMKDSGVNWIGKIPAHWEVRRSKFVFTQRK 251 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + + +L + + + ++L + + + + V+ + V Q Sbjct: 252 ERAWKDDVQLSATQAYGVIPQDQYEELTGKRVVKIQLHLDKRKHVEKDDFVISMRSFQ-- 309 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKF 359 + I +S + ID ++ +L++ S +R Q L F Sbjct: 310 --GGLERAWSQGCIRSSYVVLRALEEIDPSFYGYLLKLPSYIAALQQTASFIRDGQDLNF 367 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++ ++ + +PPI+EQ +I N ++ D +E + I LKE +++ I +AVTG+I Sbjct: 368 DNFSKVDLFIPPIEEQKEIANYVSAFMKSSDEGIELLLAQIEKLKEYKTTLINSAVTGKI 427 Query: 420 DLRGE 424 + E Sbjct: 428 KITPE 432 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 49/210 (23%), Positives = 92/210 (43%), Gaps = 10/210 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR- 271 + DSG EW+G +P HW++ + ++ ++ KN + ++ G I + ++ + Sbjct: 9 KHGQYIDSGYEWIGEIPQHWDLVKLGSCLSSVSVKNCPELPLLSITREQGVIERDVDDQE 68 Query: 272 -NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGID 329 N P+ Y+ ++ G+ + + GI++ AY ID Sbjct: 69 LNHNFIPDDLSGYKKLEKGQFGMNKMKAWQGSYGVSKF----TGIVSPAYFVFDFTKAID 124 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + W +RS F + G+ + L +K +P ++P ++Q I N + +T Sbjct: 125 PEFFNWAIRSKLYVSFFGSASDGVRIGQWDLSKTRMKVIPFVLPSEEDQSLIANFLAKKT 184 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +I+ + EQ I LLKER+ I AVT Sbjct: 185 TQINDAIAIKEQQINLLKERKQIIIQQAVT 214 >gi|220933784|ref|YP_002512683.1| type I restriction-modification system, S subunit [Thioalkalivibrio sp. HL-EbGR7] gi|219995094|gb|ACL71696.1| type I restriction-modification system, S subunit [Thioalkalivibrio sp. HL-EbGR7] Length = 458 Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats. Identities = 104/444 (23%), Positives = 190/444 (42%), Gaps = 18/444 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR------TSESGKDIIYIGLE 54 M Y+AYP+Y+++ + IP HW IK + G+ + + + + Y+ Sbjct: 1 MGKYQAYPEYRETRHDLLPPIPVHWMTGQIKNAHDVVLGKMLQSDAKTPADRLLPYLRAA 60 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFL 112 +V G G ++ + G R A+ + Sbjct: 61 NVNWGGVDLSTVKEMWFSPAERKALRLMVGDVVISEGGDVGRSAVWQGELPECYFQNAIN 120 Query: 113 VLQPKDVLPELLQGWLL-SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 +PK + + I I+ IC +T+ H + + P PP EQ I Sbjct: 121 RARPKGEHSSRYLYYWMSFIKSAGYIDIICNKSTIPHYTAEKVQGTPFLFPPAGEQAGIA 180 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET +ID LI ++ R IELLKEK+QA++S+ VTKGLNPD MKDSG+EW+G VP H Sbjct: 181 AFLDHETAKIDRLIAKQQRLIELLKEKRQAVISHAVTKGLNPDAPMKDSGVEWLGEVPAH 240 Query: 232 WEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQK---LETRNMGLKPESYETY 283 W ++ + + +I +S ++ + + ++ + + Sbjct: 241 WRLEKLKYTAIFKGGGTPSKDSPEYWGGDIPWVSPKDMKSRYVADSQDKITVEAIAASST 300 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDL 342 ++ PG+++ + + ++E + S + ++ + D Sbjct: 301 SLIGPGQVLVVVRSGILQRTIPVAVNLVEVTLNQDMKAIDFRDETRSEFFSYFVEGHEDN 360 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + +S++ E + V +PP E +I +N + + +L EK ++I L Sbjct: 361 LLLEWRKQGATVESIEQEYLGNTMVPMPPPSEMMEILQFLNGQLEKYRLLTEKATRAIEL 420 Query: 403 LKERRSSFIAAAVTGQIDLRGESQ 426 L+E R++ I+AAVTG+ID+RG + Sbjct: 421 LREHRTALISAAVTGKIDVRGWQK 444 >gi|121585820|ref|ZP_01675614.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121727684|ref|ZP_01680779.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147674628|ref|YP_001217309.1| hypothetical protein VC0395_A1366 [Vibrio cholerae O395] gi|153817792|ref|ZP_01970459.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227081913|ref|YP_002810464.1| hypothetical protein VCM66_1706 [Vibrio cholerae M66-2] gi|298498162|ref|ZP_07007969.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549958|gb|EAX59976.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629981|gb|EAX62389.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126511612|gb|EAZ74206.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146316511|gb|ABQ21050.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227009801|gb|ACP06013.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227013668|gb|ACP09878.1| conserved hypothetical protein [Vibrio cholerae O395] gi|297542495|gb|EFH78545.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 462 Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats. Identities = 102/443 (23%), Positives = 185/443 (41%), Gaps = 24/443 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDV 56 +K Y YKDS QWIG IP HW+V +K + G +I+ + + D Sbjct: 22 IKKMPKYESYKDSCEQWIGDIPAHWEVYRLKSAVYECSNGIWGSDPNGRDEIVVLRVADF 81 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLG----PYLRKAIIAD--FDGICS 108 + K + R + G +L K G + + ++ D + + S Sbjct: 82 DDHKLKISDEKLTYRSIPAKERQGRLLKNGDLLIEKSGGGDKTLVGRVVLFDKQYPAVTS 141 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 + PK+ + ++ S +I + + + D N IP E Sbjct: 142 NFVAKMTPKEWVISGFLKYVFSALYNNGVNYLSIKQTTGIQNLDASSYLNEKFCIPQKEE 201 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + + +T +I+ I + + IELLKE+KQ ++ VT+GLNPD MK SG++W+G Sbjct: 202 QYEIAKFLDNKTTQINEAIAIKQKQIELLKERKQIIIQQAVTQGLNPDATMKYSGVDWIG 261 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P HW VK L+ E+N ++ +E + + + E E Y ++ Sbjct: 262 AIPGHWIVKRAKYLLDEINERSETGLEELLSVSHMTGVTPRSEKNVTMFMAEDYTGSKLC 321 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLC 343 G++V + + GI++ +Y + YL L++S Sbjct: 322 HSGDLVINIMWAWMGALGVS----DRTGIVSPSYGVFREQREGTFVPKYLEMLLKSTKYV 377 Query: 344 KVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + + +G R + + + PP +EQ I I+ E +++D + + + Sbjct: 378 EYYNKVSTGLHSSRLRFYGHMLFDMALGFPPYEEQTQIVEYISRECSKVDEAITVQAEQV 437 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 LKE +++ I +AVTG+I + Sbjct: 438 SKLKEYKTTLINSAVTGKIKVTE 460 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 83/236 (35%), Gaps = 13/236 (5%) Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L + L+S + K + KDS +W+G +P HWEV + V E + Sbjct: 8 LWLRFRGKLMSNTMIKKMPKYESYKDSCEQWIGDIPAHWEVYRLKSAVYECSNGIWGSDP 67 Query: 254 SNILSLSYGNIIQKLETR--------NMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKR 303 + + + + + P +++ G+++ Sbjct: 68 NGRDEIVVLRVADFDDHKLKISDEKLTYRSIPAKERQGRLLKNGDLLIEKSGGGDKTLVG 127 Query: 304 SLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFE 360 + + ++ + P S +L ++ + + Q+L Sbjct: 128 RVVLFDKQYPAVTSNFVAKMTPKEWVISGFLKYVFSALYNNGVNYLSIKQTTGIQNLDAS 187 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +P +EQ++I ++ +T +I+ + ++ I LLKER+ I AVT Sbjct: 188 SYLNEKFCIPQKEEQYEIAKFLDNKTTQINEAIAIKQKQIELLKERKQIIIQQAVT 243 >gi|21243626|ref|NP_643208.1| type I restriction-modification system specificity determinant [Xanthomonas axonopodis pv. citri str. 306] gi|21109201|gb|AAM37744.1| type I restriction-modification system specificity determinant [Xanthomonas axonopodis pv. citri str. 306] Length = 426 Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats. Identities = 120/422 (28%), Positives = 192/422 (45%), Gaps = 9/422 (2%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +K SG W+G +P HW V P+ + + V + + + Sbjct: 9 AFKSSGAPWLGNVPTHWVVKPLWSMYRQKKITGYPEETLLSVYRDHGVIEKSSR--DDNK 66 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGW 127 N D S + G ++ K+ + ++ GI S + V D L Sbjct: 67 NRASEDLSGYQLVVDGDLVTNKMKTWQGSIAVSSLRGIVSPAYYVYTKLHDGNNAYLHHL 126 Query: 128 LLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L S+ ++I +G + P+ IPP EQ I + T RID L+ Sbjct: 127 LRSVPYITGYQSISKGIRVGQWDLEADKFRLFPVLIPPRPEQDAIVAHLDRATTRIDALV 186 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 ++ FIELL+EK+QA++++ VTKGL+ MKDSG+EW+G VP W+ P + + Sbjct: 187 AKKTHFIELLREKRQAMITHAVTKGLDRGAPMKDSGVEWLGEVPVTWDTAPLKSFLQLRR 246 Query: 246 RK-NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 T + +LSL+ +I++ G P S++ YQ + GE+VF D+ R+ Sbjct: 247 DIVGTASANTRLLSLTLQGVIERDLENPTGKMPASFDGYQRISAGEMVFCLFDMDETPRT 306 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDS-TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + A + G++T AY +P YL + D K GLR++++ Sbjct: 307 VGVA--QQDGMLTGAYTVFRPQSDLWARYLYYFFLHVDEYKRLKPFYKGLRKTIRPGPFL 364 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + V P E I ++ T+RID L+ K E+SI LL+E R++ I AAVTG+IDLR Sbjct: 365 SIQVPRPRDGEAEAIVAHLDRATSRIDTLIAKTERSIELLREHRTALITAAVTGKIDLRP 424 Query: 424 ES 425 + Sbjct: 425 AA 426 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 59/207 (28%), Positives = 92/207 (44%), Gaps = 8/207 (3%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 K SG W+G VP HW VKP +++ + + +G I + N Sbjct: 7 HKAFKSSGAPWLGNVPTHWVVKPLWSMYRQKKITGYPEETLLSVYRDHGVIEKSSRDDNK 66 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTY 332 E YQ+V G++V + ++ S RGI++ AY H ++ Y Sbjct: 67 NRASEDLSGYQLVVDGDLVTNKMKTWQGSIAVSSL----RGIVSPAYYVYTKLHDGNNAY 122 Query: 333 LAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L L+RS + ++ G+ + L+ + + PVL+PP EQ I ++ T RI Sbjct: 123 LHHLLRSVPYITGYQSISKGIRVGQWDLEADKFRLFPVLIPPRPEQDAIVAHLDRATTRI 182 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVT 416 D LV K I LL+E+R + I AVT Sbjct: 183 DALVAKKTHFIELLREKRQAMITHAVT 209 >gi|153817790|ref|ZP_01970457.1| restriction modification system DNA specificity domain [Vibrio cholerae NCTC 8457] gi|262169768|ref|ZP_06037459.1| type I restriction-modification system specificity determinant [Vibrio cholerae RC27] gi|126511610|gb|EAZ74204.1| restriction modification system DNA specificity domain [Vibrio cholerae NCTC 8457] gi|262022002|gb|EEY40712.1| type I restriction-modification system specificity determinant [Vibrio cholerae RC27] Length = 442 Score = 221 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 102/443 (23%), Positives = 185/443 (41%), Gaps = 24/443 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDV 56 +K Y YKDS QWIG IP HW+V +K + G +I+ + + D Sbjct: 2 IKKMPKYESYKDSCEQWIGDIPAHWEVYRLKSAVYECSNGIWGSDPNGRDEIVVLRVADF 61 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLG----PYLRKAIIAD--FDGICS 108 + K + R + G +L K G + + ++ D + + S Sbjct: 62 DDHKLKISDEKLTYRSIPAKERQGRLLKNGDLLIEKSGGGDKTLVGRVVLFDKQYPAVTS 121 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 + PK+ + ++ S +I + + + D N IP E Sbjct: 122 NFVAKMTPKEWVISGFLKYVFSALYNNGVNYLSIKQTTGIQNLDASSYLNEKFCIPQKEE 181 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + + +T +I+ I + + IELLKE+KQ ++ VT+GLNPD MK SG++W+G Sbjct: 182 QYEIAKFLDNKTTQINEAIAIKQKQIELLKERKQIIIQQAVTQGLNPDATMKYSGVDWIG 241 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P HW VK L+ E+N ++ +E + + + E E Y ++ Sbjct: 242 AIPGHWIVKRAKYLLDEINERSETGLEELLSVSHMTGVTPRSEKNVTMFMAEDYTGSKLC 301 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLC 343 G++V + + GI++ +Y + YL L++S Sbjct: 302 HSGDLVINIMWAWMGALGVS----DRTGIVSPSYGVFREQREGTFVPKYLEMLLKSTKYV 357 Query: 344 KVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + + +G R + + + PP +EQ I I+ E +++D + + + Sbjct: 358 EYYNKVSTGLHSSRLRFYGHMLFDMALGFPPYEEQTQIVEYISRECSKVDEAITVQAEQV 417 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 LKE +++ I +AVTG+I + Sbjct: 418 SKLKEYKTTLINSAVTGKIKVTE 440 >gi|303229059|ref|ZP_07315865.1| type I restriction modification DNA specificity domain protein [Veillonella atypica ACS-134-V-Col7a] gi|302516270|gb|EFL58206.1| type I restriction modification DNA specificity domain protein [Veillonella atypica ACS-134-V-Col7a] Length = 435 Score = 221 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 92/432 (21%), Positives = 169/432 (39%), Gaps = 18/432 (4%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKY 63 + KDSGV WIG IP +WK + +K +KL G +S + + D+ + Sbjct: 4 EMKDSGVPWIGKIPVNWKTIRLKYISKLINGFAFKSQDLKADGHYKVVRIGDLNNNKINL 63 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 G + I+ G +L G + K AD + C V + + L Sbjct: 64 EDCLGVDSVDNYRDYKIYM-GDVLVALSGATVGKVAFADNNIECYINQRVGIIRSLWGRL 122 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + +++ + + + IG I +PI I + + +IDT Sbjct: 123 IFHIFSLDKFIENLKSCLNDSAQPNLSIEDIGRISIPIYDKNTIKRIVRYLDIKCAQIDT 182 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 +I + IE L+E K+A+++ V KGL+ V+M D GIEW+ +P+HW++ Sbjct: 183 IIAKEQSVIEKLQEYKRAIITNAVVKGLDLTVEMADRGIEWIDSIPNHWKINRLIFSAYI 242 Query: 244 LNR------KNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVF 293 R K + LS NI + + ++ G+++ Sbjct: 243 RARLGWKGLKADEYTSEGHPFLSAVNIQNDKLVWEDLNFINDNRYDESPEIKLELGDLLL 302 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSG 352 K ++ S+ + P+ +S YL + S + +G Sbjct: 303 VKDGAGIGKCAIVDQLPYGTATTNSSLGVITPYSELNSMYLYYFFESAIFQNYISRIKNG 362 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 L ++K + V++PP EQ I ++ + + +D ++ K + I L E + S I Sbjct: 363 MGVPHLTQGNLKNIMVVIPPYCEQEAIVAYLDDKCSNLDSIILKKQSLIDKLIEYKKSLI 422 Query: 412 AAAVTGQIDLRG 423 VTG+ ++ Sbjct: 423 YEVVTGKKEVPH 434 >gi|15641771|ref|NP_231403.1| hypothetical protein VC1768 [Vibrio cholerae O1 biovar El Tor str. N16961] gi|153821172|ref|ZP_01973839.1| conserved hypothetical protein [Vibrio cholerae B33] gi|9656290|gb|AAF94917.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|126521368|gb|EAZ78591.1| conserved hypothetical protein [Vibrio cholerae B33] Length = 462 Score = 221 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 102/443 (23%), Positives = 185/443 (41%), Gaps = 24/443 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDV 56 +K Y YKDS QWIG IP HW+V +K + G +I+ + + D Sbjct: 22 IKKMPKYESYKDSCEQWIGDIPAHWEVYRLKSAVYECSNGIWGSDPNGRDEIVVLRVADF 81 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLG----PYLRKAIIAD--FDGICS 108 + K + R + G +L K G + + ++ D + + S Sbjct: 82 DDHKLKISDEKLTYRSIPAKEHQGRLLKNGDLLIEKSGGGDKTLVGRVVLFDKQYPAVTS 141 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 + PK+ + ++ S +I + + + D N IP E Sbjct: 142 NFVAKMTPKEWVISGFLKYVFSALYNNGVNYLSIKQTTGIQNLDASSYLNEKFCIPQKEE 201 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + + +T +I+ I + + IELLKE+KQ ++ VT+GLNPD MK SG++W+G Sbjct: 202 QYEIAKFLDNKTTQINEAIAIKQKQIELLKERKQIIIQQAVTQGLNPDATMKYSGVDWIG 261 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P HW VK L+ E+N ++ +E + + + E E Y ++ Sbjct: 262 AIPGHWIVKRAKYLLDEINERSETGLEELLSVSHMTGVTPRSEKNVTMFMAEDYTGSKLC 321 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLC 343 G++V + + GI++ +Y + YL L++S Sbjct: 322 HSGDLVINIMWAWMGALGVS----DRTGIVSPSYGVFREQREGTFVPKYLEMLLKSTKYV 377 Query: 344 KVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + + +G R + + + PP +EQ I I+ E +++D + + + Sbjct: 378 EYYNKVSTGLHSSRLRFYGHMLFDMALGFPPYEEQTQIVEYISRECSKVDEAITVQAEQV 437 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 LKE +++ I +AVTG+I + Sbjct: 438 SKLKEYKTTLINSAVTGKIKVTE 460 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 83/236 (35%), Gaps = 13/236 (5%) Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L + L+S + K + KDS +W+G +P HWEV + V E + Sbjct: 8 LWLRFRGKLMSNTMIKKMPKYESYKDSCEQWIGDIPAHWEVYRLKSAVYECSNGIWGSDP 67 Query: 254 SNILSLSYGNIIQKLETR--------NMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKR 303 + + + + + P +++ G+++ Sbjct: 68 NGRDEIVVLRVADFDDHKLKISDEKLTYRSIPAKEHQGRLLKNGDLLIEKSGGGDKTLVG 127 Query: 304 SLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFE 360 + + ++ + P S +L ++ + + Q+L Sbjct: 128 RVVLFDKQYPAVTSNFVAKMTPKEWVISGFLKYVFSALYNNGVNYLSIKQTTGIQNLDAS 187 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +P +EQ++I ++ +T +I+ + ++ I LLKER+ I AVT Sbjct: 188 SYLNEKFCIPQKEEQYEIAKFLDNKTTQINEAIAIKQKQIELLKERKQIIIQQAVT 243 >gi|255744817|ref|ZP_05418767.1| hypothetical protein VCH_001143 [Vibrio cholera CIRS 101] gi|262161900|ref|ZP_06030918.1| hypothetical protein VIG_003073 [Vibrio cholerae INDRE 91/1] gi|255737288|gb|EET92683.1| hypothetical protein VCH_001143 [Vibrio cholera CIRS 101] gi|262028632|gb|EEY47287.1| hypothetical protein VIG_003073 [Vibrio cholerae INDRE 91/1] Length = 442 Score = 220 bits (561), Expect = 2e-55, Method: Composition-based stats. Identities = 102/443 (23%), Positives = 185/443 (41%), Gaps = 24/443 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDV 56 +K Y YKDS QWIG IP HW+V +K + G +I+ + + D Sbjct: 2 IKKMPKYESYKDSCEQWIGDIPAHWEVYRLKSAVYECSNGIWGSDPNGRDEIVVLRVADF 61 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLG----PYLRKAIIAD--FDGICS 108 + K + R + G +L K G + + ++ D + + S Sbjct: 62 DDHKLKISDEKLTYRSIPAKEHQGRLLKNGDLLIEKSGGGDKTLVGRVVLFDKQYPAVTS 121 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 + PK+ + ++ S +I + + + D N IP E Sbjct: 122 NFVAKMTPKEWVISGFLKYVFSALYNNGVNYLSIKQTTGIQNLDASSYLNEKFCIPQKEE 181 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + + +T +I+ I + + IELLKE+KQ ++ VT+GLNPD MK SG++W+G Sbjct: 182 QYEIAKFLDNKTTQINEAIAIKQKQIELLKERKQIIIQQAVTQGLNPDATMKYSGVDWIG 241 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P HW VK L+ E+N ++ +E + + + E E Y ++ Sbjct: 242 AIPGHWIVKRAKYLLDEINERSETGLEELLSVSHMTGVTPRSEKNVTMFMAEDYTGSKLC 301 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLC 343 G++V + + GI++ +Y + YL L++S Sbjct: 302 HSGDLVINIMWAWMGALGVS----DRTGIVSPSYGVFREQREGTFVPKYLEMLLKSTKYV 357 Query: 344 KVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + + +G R + + + PP +EQ I I+ E +++D + + + Sbjct: 358 EYYNKVSTGLHSSRLRFYGHMLFDMALGFPPYEEQTQIVEYISRECSKVDEAITVQAEQV 417 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 LKE +++ I +AVTG+I + Sbjct: 418 SKLKEYKTTLINSAVTGKIKVTE 440 >gi|53718590|ref|YP_107576.1| putative type I restriction enzyme specificity protein [Burkholderia pseudomallei K96243] gi|52209004|emb|CAH34943.1| putative type I restriction enzyme specificity protein [Burkholderia pseudomallei K96243] Length = 429 Score = 220 bits (561), Expect = 3e-55, Method: Composition-based stats. Identities = 112/440 (25%), Positives = 182/440 (41%), Gaps = 36/440 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKR-FTKLNTG------RTSESGKDIIYIGL 53 M Y +YKDSGV W+G +P HW V +K + +G T + + Sbjct: 1 MS-LPQYAKYKDSGVPWLGQVPTHWLVQRLKEVIAFIESGVSVNAIDTPAGEGEPGVLKT 59 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICS 108 V SG + G ++ ++ + Sbjct: 60 SCVYSGEFTPSENKLVVPEELGRVACPVKAGTVIVSRMNTPDLVGASGVVRQNYANLYLP 119 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAE 166 + + K+ PE + W + ++E+ C G + M + + +P+PP +E Sbjct: 120 DRLWQVHFKNACPEFVHYWSQTHSYRAQVESACAGTSSSMKNLSQDEFRSFILPLPPPSE 179 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + ET +I+ LI E+ + + LL EK+QA +S VT+GLNPD KDSG+ W+ Sbjct: 180 QSAIATFLKHETRKINALIAEQEKLLTLLAEKRQATISRAVTRGLNPDAPTKDSGVAWLR 239 Query: 227 LVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 VP HW +KP V + +E NI +S G I + Sbjct: 240 EVPAHWNLKPMKRAVVFQRGHDLPSEDRVEGNIPVVSSGGISG-------------WHNA 286 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 IV + L + + +A V+ H YL ++++S Sbjct: 287 AATKGPTIVTGRYGTIGEFVLL----EEDCWPLNTALYTVQMHDNVPKYLWYMLQSLKHI 342 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + ++ S + D+ V +PP +EQ I ++ E +++D L E++I LL Sbjct: 343 FILNSLKS-AVPGVDRNDIHPAIVCLPPAEEQPAIVAFLDAEISKLDALRADAERAIDLL 401 Query: 404 KERRSSFIAAAVTGQIDLRG 423 KERRS+ IAAAVTG+ID+R Sbjct: 402 KERRSALIAAAVTGKIDVRN 421 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 46/207 (22%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KDSGV W+ +P HW + P+KR G + ED G + G S Sbjct: 231 KDSGVAWLREVPAHWNLKPMKRAVVFQRGHD---------LPSEDRVEGNIPVVSSGGIS 281 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + I+ G+ G ++ + +T +Q W + Sbjct: 282 GWHNAAATK---GPTIVTGRYGTIGEFVLLEEDCWPLNTALYTVQ--MHDNVPKYLWYML 336 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + D I + +PP EQ I + AE ++D L + R Sbjct: 337 QSLKHIFILNSLKSAVPGVDRNDIHPAIVCLPPAEEQPAIVAFLDAEISKLDALRADAER 396 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKM 217 I+LLKE++ AL++ VT ++ M Sbjct: 397 AIDLLKERRSALIAAAVTGKIDVRNVM 423 >gi|86152085|ref|ZP_01070297.1| putative type I restriction enzyme specificity protein [Campylobacter jejuni subsp. jejuni 260.94] gi|85840870|gb|EAQ58120.1| putative type I restriction enzyme specificity protein [Campylobacter jejuni subsp. jejuni 260.94] Length = 433 Score = 220 bits (560), Expect = 3e-55, Method: Composition-based stats. Identities = 114/441 (25%), Positives = 198/441 (44%), Gaps = 32/441 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLED 55 MK++ K+SG++W+G IP+HW+VV I + G E+ +I I + D Sbjct: 1 MKNF------KESGIEWLGEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFEIPVIRIGD 54 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLV 113 ++ Y +++ + + + IL G K D + + + Sbjct: 55 MQKEKILY-DNCLKTKEKEKLKQFLISNNDILIALSGATTGKIAFCDTDNKAYINQRVAI 113 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ K L ++ + L+ + IE C G+ + K IG +P+PPL EQ I Sbjct: 114 VRSKLKL---VKYYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANF 170 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + + +I I ++ + I LLKE+KQA ++ +TKGL+ ++ KDSGIEW+G +P HWE Sbjct: 171 LDEKCEQIANFIEKKEKLISLLKEQKQAFINETITKGLDKNINFKDSGIEWLGEIPQHWE 230 Query: 234 VKP---FFALVTELNRKNTKLIESNILSLSYGNIIQKL---------ETRNMGLKPESYE 281 VK F L LN + I +SYG I K + + + Sbjct: 231 VKKFKMLFTLGNGLNITKADFVSYGIPCVSYGEIHSKYPCRLNTTIHTLPFVSKTYLADK 290 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRS 339 ++ G+ VF + ++ + I + I+S Y ++L S Sbjct: 291 PQSLLQKGDFVFADTSEDIEGSGNFTSIQSDTPIFAGYHTIILKYKGKINSLYFSFLFDS 350 Query: 340 YDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 G+ S+ +K + L+PP+KEQ I N ++ + +ID+L+EK ++ Sbjct: 351 IFTRNQIRKEVCGVKVFSITKSILKEVQCLIPPLKEQEQIANFLDEKCEKIDLLIEKTKK 410 Query: 399 SIVLLKERRSSFIAAAVTGQI 419 I L+KE +++ I AV G+I Sbjct: 411 QIKLIKEYKTTLINQAVCGRI 431 >gi|218960560|ref|YP_001740335.1| putative Type I restriction-modification system specificity subunit [Candidatus Cloacamonas acidaminovorans] gi|167729217|emb|CAO80128.1| putative Type I restriction-modification system specificity subunit [Candidatus Cloacamonas acidaminovorans] Length = 440 Score = 220 bits (559), Expect = 5e-55, Method: Composition-based stats. Identities = 125/442 (28%), Positives = 196/442 (44%), Gaps = 27/442 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSE--SGKDIIYIGLEDV- 56 M YK+Y YK++G+ W+ +PKHW+++ + + + + + + V Sbjct: 1 MIKYKSYEDYKETGITWLTMVPKHWEILRTDSVTVYIRNQINPDEIKSEFVFHYSIPAVQ 60 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLV 113 E+GTG+Y + + S + K +L KL P I D ICS++F+ Sbjct: 61 ETGTGQY-----DLTEEVGSAKQLITKKSVLISKLNPRKATICIAEPKDEITICSSEFIA 115 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH--ADWKGIGNIPMPIPPLAEQVLIR 171 ++ K + L + S QR++A + T SH I +P EQ I Sbjct: 116 MEAKKCDLKYLFYLMNSEMNRQRLDAKVQSVTRSHQRVYPSDIYRFWTALPSTTEQQAIA 175 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + ET RID LI ++ R IELLKEK+ AL++ VTKGL+P+V MKDSGIEW+G VP+H Sbjct: 176 SFLDRETARIDALIQKKERMIELLKEKRIALITQAVTKGLDPNVPMKDSGIEWLGEVPEH 235 Query: 232 WEVKPFFALVTELNRKNTKLIES-----NILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 W V F + + E I ++ M S + Sbjct: 236 WTVLKFKNIGSFQGGAGFPDDEQGLEDEEIPFYKVSDMNLPGNETYMCQHNNSVSRETAL 295 Query: 287 DPGEIVFRFIDLQNDKRSLR-----SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + R + K + + I + M D + + + D Sbjct: 296 KLRASILRKNTIVFAKVGAALLLNRRRIITKDSCIDNNMMGFSTTHCDVMWCYFFLFQLD 355 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L K+ G S+ + +PV VPP +EQ I + + ET +I+ +++K+ SI+ Sbjct: 356 LGKLVNP---GAVPSVNESQMSNIPVCVPPTQEQKQIGDYLVTETTKINKMIDKVNASII 412 Query: 402 LLKERRSSFIAAAVTGQIDLRG 423 L E R+S I AVTG+IDLRG Sbjct: 413 QLSEYRASLIHHAVTGKIDLRG 434 >gi|330941784|gb|EGH44533.1| hypothetical protein PSYPI_19973 [Pseudomonas syringae pv. pisi str. 1704B] Length = 472 Score = 218 bits (556), Expect = 1e-54, Method: Composition-based stats. Identities = 90/438 (20%), Positives = 167/438 (38%), Gaps = 20/438 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 AYP Y+ ++W+ +PKHW K F + R+ +++ + + + T + Sbjct: 6 AYPSYRQPKMRWLSTVPKHWNEQRAKTFFREVNERSKTGLEEL--LSVSHLTGITPRSQK 63 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + + G I+ L ++ + +GI S + V +P Sbjct: 64 NVTMFKAASYVGSKLCRPGDIVINTLWAWMAALGTSRHEGIVSPAYGVYRPHQADSFSPA 123 Query: 126 GWLLSIDVTQRIEAICEGAT-----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + +T +I + PP EQ I + A+ Sbjct: 124 YLDYLLRTRFYVAEYIGRSTGIRASRLRLYPNQFLDIQLIQPPRPEQDQIVAYLRAQDAH 183 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I I + I+L+ E+K ++ + VT GL+ V +K S +EW+G VP HWEV + Sbjct: 184 IARFIKTKRDLIKLITEQKLHIIDHAVTGGLDASVALKPSDVEWLGEVPKHWEVAFIKHI 243 Query: 241 VTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQI----VDPGEIV 292 K + + N T +M L + +I + G+++ Sbjct: 244 ANVHFSGVDKHSHDDETPVRLCNYTDVYKNDRITDDMNLMRATATAAEIARLTLKAGDVI 303 Query: 293 FRFIDLQNDKRSLRSAQVME----RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 D + + + + P + +L + S + F+ Sbjct: 304 LTKDSETPDDIGVPAWVPEDLPGVVCAYHLGLLRPVPDRVLGEFLFRAIGSARTAQQFHI 363 Query: 349 MGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + +G R +L DVK + +PP++EQ I I E ++ ++ + E I L++E R Sbjct: 364 LATGVTRFALGKHDVKNAVIALPPVEEQQAICRWITDECRPLNDVIARTEDEIKLIREYR 423 Query: 408 SSFIAAAVTGQIDLRGES 425 IA VTGQ+D+RG Sbjct: 424 DRLIADVVTGQVDVRGWQ 441 >gi|315638759|ref|ZP_07893932.1| restriction endonuclease S subunit [Campylobacter upsaliensis JV21] gi|315481168|gb|EFU71799.1| restriction endonuclease S subunit [Campylobacter upsaliensis JV21] Length = 438 Score = 217 bits (553), Expect = 3e-54, Method: Composition-based stats. Identities = 107/442 (24%), Positives = 204/442 (46%), Gaps = 26/442 (5%) Query: 1 MKHY--KAYPQ----YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DII 49 MK + Y YK SG++W+G IPKHW++ + + + G ES I Sbjct: 1 MKKHTQSPYESSEISYKPSGIKWLGEIPKHWEICKLNKVSYFINGYAFESSHFDYSFSIP 60 Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGIC 107 I + D+++ Y Q + I+ G I+ G K + + Sbjct: 61 VIRIGDIQNDKIIYHTCLMTKEQENLKNFMIYR-GDIVIALSGATTGKFAVCNSNKKAYI 119 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + +++ + + +L + I+ +C G+ + K +GN +P+PPL EQ Sbjct: 120 NQRVAIIRSDIKILKY---YLSTFGFVNYIDMLCNGSAQPNISTKEVGNFKIPLPPLQEQ 176 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I E + + +I I ++ + I LL+EKKQAL++ +VTKGLNP+++ K+SGI ++GL Sbjct: 177 KEIAEFLDKKCEKIQNYIDKKQKLITLLQEKKQALINEVVTKGLNPNIEFKNSGIAYLGL 236 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-------PESY 280 +P HWE+K + + K S Y + L+ N+ + E Sbjct: 237 IPHHWEIKKLKYVGKVVLGKMLCNEHQKGYSHCYYLKSKNLQWLNVEVSQIEKMWFSEYE 296 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 ++ + +++ K + + ++ E I S + ++ + +L +Y Sbjct: 297 KSLYRIKKDDLLVSEGGEVG-KTCIWNNELAECYIQNSVHKITLNKFNNAKFFLYLFFTY 355 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 VF ++ S + L E + + ++VPP++EQ I N ++ + +I+ +EK ++ Sbjct: 356 GKLGVFDSVVSRVSIAHLVLEKLVNVDMVVPPLQEQKQIANFLDEKCEKINSAIEKTKRQ 415 Query: 400 IVLLKERRSSFIAAAVTGQIDL 421 I L+KE +++ I AV G+I + Sbjct: 416 IELIKEYKNTLINEAVCGRIRV 437 >gi|254491851|ref|ZP_05105030.1| Type I restriction modification DNA specificity domain protein [Methylophaga thiooxidans DMS010] gi|224463329|gb|EEF79599.1| Type I restriction modification DNA specificity domain protein [Methylophaga thiooxydans DMS010] Length = 454 Score = 217 bits (552), Expect = 3e-54, Method: Composition-based stats. Identities = 105/441 (23%), Positives = 184/441 (41%), Gaps = 25/441 (5%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 Y Y QY+ + W PK W + +K + +N G++ S G + Sbjct: 7 KYAPYSQYETVALPWFDTKPKEWMLTRLKFTSSINMGQSPNSDDCNDEGHGRPFLQGNAE 66 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + ++ + ++G +L P I GI + + Sbjct: 67 FGMRTPKAKLFCEAAKKTCSEGDVLLSVRAPVGELNIANQEYGIGRGLCAITAQSV---K 123 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 W L ++ A+ G+T + + N+ +P +EQ I + ET +ID Sbjct: 124 ADFMWWLLQASVSQLRAVATGSTFQAVSAEQVSNLTCLLPAQSEQTQIATFLDRETAKID 183 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 LI ++ R I+LL+EK+QA++S+ VTKGLNPDV MKDSG+EW+G +P W + + Sbjct: 184 RLIEKQQRLIKLLEEKRQAVISHAVTKGLNPDVPMKDSGVEWLGEIPSMWSIVQLRRGID 243 Query: 243 --------------------ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + + K + + L ++ R+ K SY + Sbjct: 244 FLTDFEANGSFAEVKKNVSLDTDNKYAWYVRATDLEHRRFGLVDG--NRSCNEKSYSYLS 301 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +D GE++ + + + + W + SY Sbjct: 302 KTTLDGGELLVAKRGEIGKVYLMPEIDCRATLAPNLYLIRLNDNFFPQFTYYWFISSYGK 361 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ A S +L +DV+ + +PP++EQ I I+ T +I L+ K+++SI L Sbjct: 362 SELVNADKSTTIGALYKDDVRACIIPMPPVQEQILIVKHISERTDKIQRLITKVQKSIAL 421 Query: 403 LKERRSSFIAAAVTGQIDLRG 423 ERR++ I+AAVTG+ID+R Sbjct: 422 STERRAALISAAVTGKIDVRD 442 >gi|146280647|ref|YP_001170800.1| type I restriction-modification system, S subunit [Pseudomonas stutzeri A1501] gi|145568852|gb|ABP77958.1| type I restriction-modification system, S subunit [Pseudomonas stutzeri A1501] Length = 421 Score = 217 bits (552), Expect = 3e-54, Method: Composition-based stats. Identities = 115/439 (26%), Positives = 189/439 (43%), Gaps = 36/439 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVE 57 M HYK YP YKDSGV+W+G +P+HW + P K ++ G + IG Sbjct: 1 MSHYKPYPAYKDSGVEWLGRVPEHWTIGPYKATIQIENGSDYKEVEADDGYPVIGSG--- 57 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 S+ ++ +L G+ G + + T + Sbjct: 58 -------------GPFAYSSKLMYDGESVLLGRKGTIDKPLYVNGAFWAVDTMYW----S 100 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + P + T + + +G+ + P EQ I + E Sbjct: 101 IIKPGAHGRFAYYTATTIPFDMYSTNTALPSMTKSVLGSHVVAFPGFEEQQAIAGHLDRE 160 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T RID L+ ++IRFIELL+EK+QAL+++ VTKGL+P VKMKDSG+EW+G VP+HW +K F Sbjct: 161 TARIDALVEKKIRFIELLREKRQALITHAVTKGLDPSVKMKDSGVEWLGAVPEHWVIKRF 220 Query: 238 FALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIV 286 + ++ N I I ++ +II + + + + + ++ + Sbjct: 221 RDICISISTGPFGTALGNEDYITGGIPVINPSHIIDEQCSPDPDITVSTETALRLSFWAM 280 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G++V + Q S + P + YL +++S + Sbjct: 281 RAGDLVTARRGELGRAAIIFGEQDGWICGTGSLRVRPNPSQALTEYLHTVLQSRYAREWL 340 Query: 347 Y-AMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 A +L + LP+ +PP EQ + + + ++ R+ + +K S+ LLK Sbjct: 341 NLASVGATMANLNEGILGSLPLALPPSTAEQEKLLSSLAAQSERLIKIEQKAALSVALLK 400 Query: 405 ERRSSFIAAAVTGQIDLRG 423 E RS+ I AAVTGQIDLR Sbjct: 401 ECRSALITAAVTGQIDLRE 419 >gi|186685410|ref|YP_001868606.1| type I restriction enzyme, S subunit [Nostoc punctiforme PCC 73102] gi|186467862|gb|ACC83663.1| type I restriction enzyme, S subunit [Nostoc punctiforme PCC 73102] Length = 440 Score = 216 bits (549), Expect = 7e-54, Method: Composition-based stats. Identities = 95/434 (21%), Positives = 168/434 (38%), Gaps = 20/434 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y YK ++W+ IP+HWK+ K + + S K I D + + Sbjct: 5 RYQAYKKCDIEWLLEIPEHWKIDRAKSLFREMSRPVSPRDKIITVFR--DGQVTLRENRR 62 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 G + + KG +++ + + ++D DG + ++LV D + Sbjct: 63 VTGFTNAIEEYGYQGIRKGDLVFHAMDAFAGAIGVSDSDGKATPEYLVYTTIDKNKIYVP 122 Query: 126 GW-----LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + +++ + +PIPP EQ I + +T + Sbjct: 123 FFGFLLRQMALSGFVLALGKSVRERSPRFKHTKFVTLDLPIPPFTEQETIAHYLDTKTAQ 182 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID I + L KQ+L++ VT GL+ V M+DSGIEW+G VP+HW++K L Sbjct: 183 IDRKIDLLTQKATLYGNLKQSLINETVTCGLDKSVPMRDSGIEWIGEVPEHWDIKRLKDL 242 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--------PESYETYQIVDPGEIV 292 N K + + + N + + + +S + G++ Sbjct: 243 SDIQNSNVDKKSHDDEIPIKLCNYVDVYKNEFINTSLDFMDATANKSEIKQFTIKEGDVF 302 Query: 293 FRFIDLQNDKRSLRS-AQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDLCKVFYA 348 D ++ + A +G+I ++ K +YL L +S F Sbjct: 303 ITKDSETCDDIAIPALAAESIKGVIYGYHLARLRTKEKVFLGSYLFRLFQSKSYGFRFVI 362 Query: 349 MGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 G R L + VP + EQ I + ++ +TA+ID +++ I I LKE R Sbjct: 363 SAKGITRVGLGQSAIADSLTPVPLLSEQKAIADYLDTKTAQIDQIIQTINTQIEKLKELR 422 Query: 408 SSFIAAAVTGQIDL 421 + I VTG+I + Sbjct: 423 KTLINDVVTGKIRV 436 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 80/211 (37%), Gaps = 4/211 (1%) Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + K IEW+ +P+HW++ +L E++R + + + +++ Sbjct: 1 MKIERYQAYKKCDIEWLLEIPEHWKIDRAKSLFREMSRPVSPRDKIITVFRDGQVTLREN 60 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 YQ + G++VF +D + + Y + + I Sbjct: 61 RRVTGFTNAIEEYGYQGIRKGDLVFHAMDAFAGAIGVSDSDGKATPEY-LVYTTIDKNKI 119 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + +L+R L A+G +R K L + +PP EQ I + ++ + Sbjct: 120 YVPFFGFLLRQMALSGFVLALGKSVRERSPRFKHTKFVTLDLPIPPFTEQETIAHYLDTK 179 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 TA+ID ++ + Q L + S I VT Sbjct: 180 TAQIDRKIDLLTQKATLYGNLKQSLINETVT 210 >gi|227540803|ref|ZP_03970852.1| type I restriction-modification system S subunit [Corynebacterium glucuronolyticum ATCC 51866] gi|227183432|gb|EEI64404.1| type I restriction-modification system S subunit [Corynebacterium glucuronolyticum ATCC 51866] Length = 442 Score = 215 bits (548), Expect = 8e-54, Method: Composition-based stats. Identities = 112/437 (25%), Positives = 190/437 (43%), Gaps = 19/437 (4%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGK 62 Y YKDSGV WI IP+ W V K + G + I + + + Sbjct: 4 YEHYKDSGVPWIDKIPQLWTVDRFSMSFKFSRGLDIKKRDLEAAGIPVLSYGQIHAKHNP 63 Query: 63 YLPKD--------GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF-----DGICST 109 + + + +G +++ + A Sbjct: 64 VVTISPDLVRFIPADKIGGGNLEDARLREGDLVFADTSEDVHGAGNFSRSDGSQMIHAGY 123 Query: 110 QFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 L+ +P++ +L S + +I +G + + + PP+ Q Sbjct: 124 HTLLARPRETYEHKYFAYLFSSEAWRHQIRRAVQGVKVYSITQGVFKHAQLLRPPVETQD 183 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + A+T ID ++ + R LL+ K+ L+++ VTKGLNP+ MKDS E++G Sbjct: 184 AIVAFLDAKTAEIDVVVEKLRRQRALLERYKRELIAHTVTKGLNPESPMKDSEYEFIGTY 243 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P W+ +P F + ++ N++L L G+II K + + + Y +V P Sbjct: 244 PADWQNRPLFDICDQVKLDNSELQTIVALQFKNGSIIAKPDWDDSPQSLDILSGYTLVSP 303 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFY 347 G IV ++L D ++ R V G ITSAY+ + PH S YL +L +S D K + Sbjct: 304 GMIVINGLNLNYDFKTKRIGLVKNNGAITSAYIVISPHRDIESRYLNYLFKSIDAQKALH 363 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 M G+R+ L ++D++RL + +P +Q I N ++ +TA ID L+ I++ I LL R Sbjct: 364 GMTEGVRKILNWKDIRRLTLPMPNSSQQIAIANYLDTKTAEIDSLIANIDRQIALLGAYR 423 Query: 408 SSFIAAAVTGQIDLRGE 424 I VTG++ + E Sbjct: 424 KQVINDVVTGKVRVSEE 440 >gi|257095818|ref|YP_003169459.1| type I restriction-modification system specificity subunit [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257048342|gb|ACV37530.1| type I restriction-modification system specificity subunit [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 475 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 89/444 (20%), Positives = 179/444 (40%), Gaps = 22/444 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV 56 + K Y +YK+SG+ W+G +P HW V + L G + + ++ Sbjct: 2 IADLKPYAEYKESGLLWLGQVPGHWDVRKPRHIGSLLKGVGGTKEDALPAGVPCVRYGEL 61 Query: 57 ESGTGKYLPKDGNSRQSDTST-VSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQ 110 + ++ + +D + + G +L+ G L + D +C Sbjct: 62 YTTHAYFVRRPKTFIHADRAADYTPLHYGDVLFAASGETLEDIGKSAVNLIDGTAVCGGD 121 Query: 111 FLVLQPKDVLPELLQGWLL-SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 ++L+P + G+++ + + + G T+ H + ++ P+PP+ EQ Sbjct: 122 VIILRPSVPVHAPFLGYVMDCRPLANQKATMGRGTTVKHVYPDELKHLVFPLPPVPEQAA 181 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + R++ I + + I LL E+KQA+V VT+GL+P V +K SGI W+G +P Sbjct: 182 IVRFLNWANGRLERAIRAKRKVIALLNEQKQAIVHRAVTRGLDPSVPLKPSGIPWLGDIP 241 Query: 230 DHWEVKPFFA---LVTELNRKNTKLIESNIL------SLSYGNIIQKLETRNMGLKPESY 280 HW V + + + ++ + G ++ + + Sbjct: 242 RHWRVWRLKFVALNIVDCLHATPRYSDAGTHPAIRTADIVAGVVLVDQAKKVSSRDYARW 301 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 T G+I++ + + A + I + S ++ WL+ S Sbjct: 302 TTRLQPQEGDILYSREGERFGIAACVPA-ATQLCISQRMMVFRIATQHCSKFVMWLLNSR 360 Query: 341 DLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + ++ + +P +EQ + I ET I+V ++++++ Sbjct: 361 STYGQALQDVMGATAPHVNISTIRNYYLALPLKREQEAVVERIGAETHPIEVAIDRLKRE 420 Query: 400 IVLLKERRSSFIAAAVTGQIDLRG 423 I LL+E R+ IA VTG++D+R Sbjct: 421 IELLREYRTRLIADVVTGKVDVRE 444 >gi|53802449|ref|YP_112811.1| hypothetical protein MCA0277 [Methylococcus capsulatus str. Bath] gi|53756210|gb|AAU90501.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath] Length = 474 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 86/437 (19%), Positives = 171/437 (39%), Gaps = 21/437 (4%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 YP Y+ + +W+ +P+HW ++ K F + R+ + ++ + ++ K Sbjct: 7 YPNYQPTRSRWVPRVPEHWSLLRAKNFLREIDDRSKTGEETLLSMRMQRGLVPHNDVSVK 66 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP----- 121 + +++ ++ + G+ S + V + Sbjct: 67 RIA--PENLIGYKKVQPNELVLNRMQAGNAMFFRSRQSGLVSPDYAVFRLLRDDNPEYLG 124 Query: 122 ELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L + W + + + G + ++ +P+PP EQ I + A+ Sbjct: 125 HLFRSWPMRGLFRSESKGLGTGTSGFLRLYSDRFASLEIPLPPRPEQDQIVAYLRAQDAH 184 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I I + I+LL E+K ++ + VT+GL+P+V++K SGI+W+G VP+HWEV + Sbjct: 185 IARYILAKRELIKLLTEQKLTIIDHAVTRGLDPNVRLKPSGIQWLGEVPEHWEVASIKHI 244 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--------SYETYQIVDPGEIV 292 K + + N + + + + + G+++ Sbjct: 245 ADVRFSGVDKHSNDDETPVRLCNYTDVYKNERITADMDLMRATATAAEIARLTLKAGDVI 304 Query: 293 FRFIDLQNDKRSLRSAQVME----RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 D + + + + P + +L + S + F+ Sbjct: 305 LTKDSETPDDIGVPAWVPEDLPGVVCAYHLGLLRPVPQRVLGEFLFRSIGSTRTAQQFHV 364 Query: 349 MGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + +G R +L DVK + +PP++EQ I I E +D + + E+ I L++E R Sbjct: 365 LATGVTRFALGKHDVKNAIIALPPVEEQQAICRWIVEECQPLDEAIARAEEEIQLIREYR 424 Query: 408 SSFIAAAVTGQIDLRGE 424 IA VTGQID+RG Sbjct: 425 DRLIADVVTGQIDVRGW 441 >gi|126664813|ref|ZP_01735797.1| type I restriction-modification [Marinobacter sp. ELB17] gi|126631139|gb|EBA01753.1| type I restriction-modification [Marinobacter sp. ELB17] Length = 444 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 113/436 (25%), Positives = 186/436 (42%), Gaps = 24/436 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M + Y +YKD+ + WI IP W++ + + + G + ++ + Sbjct: 1 MS-FPRYSEYKDTEINWIAQIPTGWQIASLSKLFSIKAGGDVNTD---VFSETRTHDRPF 56 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 Y + + ++ + + I G Y+ A D + LVL PK L Sbjct: 57 PIYTNANNPNIVYGYTSKAKYGPNCITVSGRG-YVGFAAFRDHIFDAIIRLLVLTPKKDL 115 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + + ++ E + + I + P EQ I + ET + Sbjct: 116 NCKFFEYF----INEVVDFREESSAIGQLSTNQIAPYKVAFPDCREQSKITHFLDHETAK 171 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 IDTLI E+ R IELLKEK+QA++S+ VTKGL+PDV +KDSG+EW+G VP HW V Sbjct: 172 IDTLIHEQKRLIELLKEKRQAVISHAVTKGLDPDVPIKDSGVEWLGDVPAHWGVATIRRF 231 Query: 241 VTELNRKNTKLIESNILSLSYG-NIIQKLETRNMGLKPESYETYQI----------VDPG 289 + T +E ++ G N + + ES + +I G Sbjct: 232 AKAVRTGGTPSLEMPNSEIADGINWFTPGDFNGSLMLHESEKQLRISSISSGDAKLFPGG 291 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ I K + I + V I+ +L + + + F + Sbjct: 292 SVLVVGIGATLGKVAKVDDDFSANQQIN---VIVPGKRINGHFLVYSLSAQKSQMRFVSN 348 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 S + E K + +++PP++EQ IT ++ +D LV K I+LLKERRS+ Sbjct: 349 AS-TIGIMNQEKTKDIVLVLPPVEEQTQITESLDRGVQNLDQLVIKAASGILLLKERRSA 407 Query: 410 FIAAAVTGQIDLRGES 425 I+AAVTG+ID+R Sbjct: 408 LISAAVTGKIDVRDWQ 423 >gi|258545847|ref|ZP_05706081.1| type I restriction-modification system specificity determinant [Cardiobacterium hominis ATCC 15826] gi|258518863|gb|EEV87722.1| type I restriction-modification system specificity determinant [Cardiobacterium hominis ATCC 15826] Length = 465 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 94/434 (21%), Positives = 178/434 (41%), Gaps = 14/434 (3%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 + YP Y+ + ++W +P+ W ++ K+ +L + + + + + K Sbjct: 2 FGPYPDYRRTDLKWFEYLPESWGILRAKQMFRLVIEKAPANNQMELLSVYTHIGVRPRKS 61 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 L + GN T + +G I+ KL ++ + + G+ S + +L+P Sbjct: 62 LEQRGNKAS-TTDGYWVVKEGDIICNKLLAWMGAIGASHYQGVTSPAYDILRPVKPCNTD 120 Query: 124 LQGWLLSIDVTQRIEAICE---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +L + I + G IP+P+P +EQ I + A+ Sbjct: 121 YYHFLFRTKKYLQQFKIRSRGIMDMRLRLYFDQFGQIPIPVPSRSEQDQIVAYLRAQDAY 180 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I I + I+LL E+K ++ + VT+GL+ V ++ SGIEW+G VP+HWEV+ + Sbjct: 181 IARFIKAKRDLIKLLTEQKLRIIDHAVTRGLDSSVALRPSGIEWLGEVPEHWEVQRLKNV 240 Query: 241 VTELNRK--------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + K + + S + I + ++ + G+++ Sbjct: 241 ANMVLGKMLTTEAKAGDGDFKPYLRSTNVQWIKPDVRDVKEMWVAKAEMAQLRIRKGDLL 300 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + ++ E I S + + +L +Y F A+ + Sbjct: 301 VSEGGEVG-RACMWNDELPECYIQNSVHRVAAKPMMLPEFLFHQFFTYGKRGRFNAIVNR 359 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + L E + +P VPPI+EQ I I E +D + + E+ I L++E R I Sbjct: 360 VSIAHLTREKLVTVPFTVPPIEEQKAICRWITEECQPLDDAIARAEEEIKLIREYRDRLI 419 Query: 412 AAAVTGQIDLRGES 425 A VTGQ+D+RG Sbjct: 420 ADVVTGQVDVRGWQ 433 >gi|238920394|ref|YP_002933909.1| restriction modification system DNA specificity domain protein [Edwardsiella ictaluri 93-146] gi|238869963|gb|ACR69674.1| restriction modification system DNA specificity domain protein [Edwardsiella ictaluri 93-146] Length = 435 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 102/433 (23%), Positives = 178/433 (41%), Gaps = 21/433 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT----------SESGKDIIYIGLE 54 Y YK+SGV+WI +P+ W +V IK + + G + SE YI + Sbjct: 7 PKYDTYKNSGVEWIEQVPEGWGLVKIKNYADVFNGDSLNDKQKAKYESEDQSHRSYISSK 66 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLV 113 D++ K ++G S+ + L G +K + + + Sbjct: 67 DIDVNYSKINYQNGLRIP-KGSSYKVCPSNSTLMCIEGGSAGKKIAYTNQEVCFVNKLAC 125 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + LS + + I N + +P EQV I Sbjct: 126 FLASKRIDSHFLYYYLSSVTFKSQFFNSMTGLIGGVSISAIKNFWLVLPSPTEQVAIASF 185 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + + +ID IT + + I LLKE+KQ L+ T+GL+P V MKDSG++W+G +P+HW+ Sbjct: 186 LSKKLSQIDEAITTKEQQISLLKERKQILIQQAATQGLDPCVPMKDSGVDWIGKIPEHWQ 245 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 V F L ++ K ++ + + YQ + G++V Sbjct: 246 VIRFKNLFSQSRIPVRKEDGVVTSYRDGQVTLRSNRRLDGYTEAIIEGGYQGIRKGQLVL 305 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMGS 351 +D + + G T Y+ P D + +L+R L K + + Sbjct: 306 NSMDAFEGAIGVSESD----GKCTPEYVICDPVRADVSQYYFAYLLREMALAKYIQVICN 361 Query: 352 GLRQ---SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +RQ ++F ++ +++PP EQ I I E +I+ VE ++ I LKE ++ Sbjct: 362 AVRQRAVRIRFNNLASRFMVLPPSDEQEKIVEFIESEKGKINKGVEHLKGQIEKLKEYKT 421 Query: 409 SFIAAAVTGQIDL 421 + I +AVTG+I + Sbjct: 422 TLINSAVTGKIKV 434 >gi|77361017|ref|YP_340592.1| type I restriction-modification system, S subunit [Pseudoalteromonas haloplanktis TAC125] gi|76875928|emb|CAI87149.1| putative type I restriction-modification system, S subunit [Pseudoalteromonas haloplanktis TAC125] Length = 442 Score = 215 bits (546), Expect = 2e-53, Method: Composition-based stats. Identities = 108/442 (24%), Positives = 188/442 (42%), Gaps = 32/442 (7%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVE 57 + YKAYP+Y++S + W+ IP +W+ +P++ G +S I + D++ Sbjct: 9 RKYKAYPEYQNSDIDWLRKIPNYWQTIPLRLILDTRKGVAFKSNDFTSSGIRVVKASDIK 68 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY----------LRKAIIADFDGIC 107 T + +I KG I+ +G + + Sbjct: 69 KLTINSSEVYLPTNYISIYPKAILRKGDIILSTVGSNPDVKNSAVGQIGVVPEHLDGALL 128 Query: 108 STQFLVLQPKDVLPELLQGW---LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + +V +PK+ + ++ A S + N +PIPP Sbjct: 129 NQNTVVFEPKEDKIHREFLFKVIQMNGYRDHLDLNAHGTANQSSLSISDMLNFYIPIPPK 188 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 EQ I + ET +IDTLI ++ + IELLKEK+QA++S+ VTKGLNP+ M+DSG+EW Sbjct: 189 NEQQKIASFLDHETAKIDTLIAKQEKLIELLKEKRQAVISHAVTKGLNPNAPMRDSGVEW 248 Query: 225 VGLVPDHWEVKPFFALVTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 +G VP+HW + V+ + + ++ L+E N L +I +++T Sbjct: 249 LGEVPEHWLIGSLRWKVSISSGEGLSSNLVEKNKTELKKIPVIGGNGVMGFSESSNTHKT 308 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + + L N + + D +L+ Sbjct: 309 AIAIGRVGALCGNVHLINYISWITDNALK-------------ISSWDGFDENYLISLLKA 355 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + + + E +K L V++PP+KEQ I + D L ++ + I L Sbjct: 356 ANLNNLASTTAQPLITGEQIKSLIVVIPPLKEQIKINLKLTKIVNLFDKLEKRSKDGINL 415 Query: 403 LKERRSSFIAAAVTGQIDLRGE 424 LKER+++ I+AAVTG+ID+R Sbjct: 416 LKERKTALISAAVTGKIDVRNW 437 >gi|194288966|ref|YP_002004873.1| type I restriction-modification methylase s subunit [Cupriavidus taiwanensis LMG 19424] gi|193222801|emb|CAQ68804.1| type I restriction-modification methylase S subunit [Cupriavidus taiwanensis LMG 19424] Length = 458 Score = 215 bits (546), Expect = 2e-53, Method: Composition-based stats. Identities = 102/452 (22%), Positives = 194/452 (42%), Gaps = 31/452 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDV 56 M + Y Y+DSG+ W+G +P HW+V ++ + N ++ + + ++ ++ + Sbjct: 1 MS-LQRYAAYRDSGIDWLGDMPAHWQVRRLRFAAEFNPSKSEVSHLDRDTLVSFLPMDAI 59 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQ 110 G + + + + F +G + + K+ P + G +T+ Sbjct: 60 -GEEGSLVLEQVRQVSQVETGYTYFHEGDVAFAKITPCFENGKGAVMRGLLGGVGFGTTE 118 Query: 111 FLVLQPKDV--LPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQ 167 +V +P+ E L SI + E GA + + PPL+EQ Sbjct: 119 LIVARPRSDVTCSEYLHWLFCSIPFRKLGEGAMYGAGGQKRVPEDFARDFAIAFPPLSEQ 178 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + +ET +IDTLI+E+ + + LL EK+QA +S IVT+GL P V++K G +W+G Sbjct: 179 NAIVTFLYSETSKIDTLISEQDKLLVLLAEKRQATISRIVTRGLEPKVQIKSVGADWLGE 238 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY---- 283 +P HW+ K L + + + + E+ + K+ N G+ + Sbjct: 239 IPIHWQAKRVKWLTSSIEQGWSPQCENYPAEGENEWGVLKVGCVNGGVFDAAENKKLPPE 298 Query: 284 ------QIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSAYMAVKPHG--IDSTYLA 334 + G+++ + + + + R ++ ++ +LA Sbjct: 299 LEPFPEYSLRKGDLLISRANTRELVGSAAVVPKDFHRLLLCDKLYRLRLDQAKCTPEFLA 358 Query: 335 WLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + +G ++ + L V +PP +EQ I + +N E R++ Sbjct: 359 AYLATGEARGQIELGATGASSSMLNIGQSVIMDLLVPLPPAEEQAAIMDFLNAELDRLER 418 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L +SI LLK RR++ I AAVTG+ID+R Sbjct: 419 LSLAANKSIDLLKARRTALITAAVTGKIDVRN 450 >gi|289166196|ref|YP_003456334.1| type I restriction-modification system (methylase_S) [Legionella longbeachae NSW150] gi|288859369|emb|CBJ13305.1| putative type I restriction-modification system (methylase_S) [Legionella longbeachae NSW150] Length = 466 Score = 214 bits (545), Expect = 2e-53, Method: Composition-based stats. Identities = 100/434 (23%), Positives = 193/434 (44%), Gaps = 19/434 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 K Y YK+S +W+ IP+HW K K+ R+ + +++ + + + + + Sbjct: 2 KPYSSYKNSSEKWLNKIPEHWNFKRAKSVFKIIDIRSQDGSEEL--LSVSEKQGVALRKN 59 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPE 122 + ++ + + ++ L ++ +++ GI ST + V + D Sbjct: 60 TNVTMFQAANYAGYKLCWPQDLVINSLWAWMTGLGFSEYHGIISTAYSVFRIWDQEKFNY 119 Query: 123 LLQGWLLSIDVTQRI---EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 +LL + + + ++P+ +PPL+EQ I + +T Sbjct: 120 KYGNYLLRSKIYNWEFRVRSKGIWRSRYQLSDDSFLSMPLLLPPLSEQQQIAIYLDWKTT 179 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +I+ I + + I LLKE+KQ +++ VTKG+NPDV MKDSG++W+G +P+HWE++ Sbjct: 180 KINKFIKAKKKLIALLKEQKQNIINEAVTKGINPDVNMKDSGVDWLGEIPEHWEIRKLKY 239 Query: 240 LVTEL------NRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQ---IVDPG 289 + T+ T +S I L N + ++ N+ V P Sbjct: 240 VATKFGSGVTPKGGATVYQDSGIPFLRSQNIHFEGIKLENVAYISNDVHKRMSSSHVKPN 299 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 +++ + + + + + + + S YLA+ + + + Sbjct: 300 DVLLNITGASIGRTCYVPSNLEQANVNQHVCIIRPIQKKVSSQYLAFYLSIPLIQRKILE 359 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +G R+ L +KRL V++P EQ DI N I+ ET+ I+ ++K E I L++E R Sbjct: 360 EQNGASREGLTLSSIKRLNVILPTFNEQMDILNYISTETSVINKTIKKAELEIELIQEFR 419 Query: 408 SSFIAAAVTGQIDL 421 + I+ VTG+ID+ Sbjct: 420 TRLISDVVTGKIDV 433 >gi|299141338|ref|ZP_07034475.1| type I restriction-modification system specificity determinant [Prevotella oris C735] gi|298577298|gb|EFI49167.1| type I restriction-modification system specificity determinant [Prevotella oris C735] Length = 407 Score = 214 bits (545), Expect = 2e-53, Method: Composition-based stats. Identities = 94/421 (22%), Positives = 180/421 (42%), Gaps = 19/421 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y YKDSG QW+G IP HW++ K K R+ + + ++ + D Sbjct: 2 QTYDSYKDSGEQWLGRIPSHWEIRRSKFLWKETDRRSQKGTEQLLSVSQYDG------VR 55 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + SR K + + + +L +++F+G+ S + V KD Sbjct: 56 EANAESRSESLVGYKYVHKDEFVINIMLAWLGGLGVSNFEGVVSPAYCVYHLKDKQNPRF 115 Query: 125 QGWLLSIDVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 +L A + G++ +PP+ EQ + + + +T +I Sbjct: 116 LHYLYRTPQYLAEFARHSTGIVPSRWRMYTDDFGDVLTILPPIEEQNRMVQYLDEQTSQI 175 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D +I ++ + I+LL E+KQ +++ VTKGL+P+V MKDSGI+W+G +P+HWE+K + L Sbjct: 176 DEVIAQQQKMIDLLNERKQIIINNAVTKGLDPNVSMKDSGIDWIGKMPNHWELKQYKYLF 235 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + + + + Y +D ++ Sbjct: 236 YNFDNLRKPITADQRSRDNPMYDYYGASGVIDKID------YYNIDDKVLLIGEDGANLL 289 Query: 302 KRSLR-SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 R+L + + + + +KP + ++A +M + D + L Sbjct: 290 MRNLPLVYKAKGKFWVNNHAHILKPIKDNYDFMALVMEAADYTLFI---TGSAQPKLSQA 346 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ + + +PPI+EQ I N +N +D ++K ++ + LL+ER+ I VTG+I+ Sbjct: 347 NLNSVKLPIPPIEEQEKIVNFVNENAGILDFPLKKAKKQVELLQERKQIIINEVVTGKIN 406 Query: 421 L 421 + Sbjct: 407 V 407 >gi|282849443|ref|ZP_06258828.1| type I restriction modification DNA specificity domain protein [Veillonella parvula ATCC 17745] gi|282581147|gb|EFB86545.1| type I restriction modification DNA specificity domain protein [Veillonella parvula ATCC 17745] Length = 427 Score = 214 bits (545), Expect = 2e-53, Method: Composition-based stats. Identities = 95/429 (22%), Positives = 175/429 (40%), Gaps = 23/429 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKD 67 + KDSGV+W+G IPK W + + L + R+++ S KD + + G + Sbjct: 3 EMKDSGVRWLGMIPKSWD---LDKIVSLYSERSTKVSDKDYPALSVT----KQGIVPQLE 55 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 ++ + + K + I++++G CS +VL PK+ + + Sbjct: 56 SAAKTDNGDNRKLIKKNDFVINSRSDRRGSCGISEYEGSCSLINIVLAPKNNMVNRYYNY 115 Query: 128 LLSIDVTQRIEAICEGATMSHAD---WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L ++ + W + NI +P P L EQ I E + + +IDT+ Sbjct: 116 LFKTELFADEFYKWGNGIVDDLWSTKWSNMKNIMVPFPSLEEQQAIAEHLDTKCAQIDTI 175 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I + IE L+E K+A+++Y V KGL+ + DSGIEW+ +P HW++K Sbjct: 176 IAKEQSVIEKLQEYKRAIITYAVVKGLDITAETADSGIEWIDSIPSHWKIKRLIFSAYIR 235 Query: 245 NR------KNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFR 294 R K + LS NI + + ++ G+++ Sbjct: 236 ARLGWKGLKADEYTSEGHPFLSAVNIQNDKLVWEDLNFINDDRYDESPEIKLEIGDLLLV 295 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSG- 352 K ++ S+ + P+ +S YL + S + +G Sbjct: 296 KDGAGIGKCAVVDQLPYGTATTNSSLGVITPYPELNSMYLYYFFESAIFQNYISRIKNGM 355 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L ++K + V++PP EQ I ++ + A +D ++ + + I L E + S I Sbjct: 356 GVPHLTQGNLKNIMVIIPPYCEQEAIVTYLDEKCANLDSVILRKQSRIDKLTEYKKSLIY 415 Query: 413 AAVTGQIDL 421 VTG+ ++ Sbjct: 416 EVVTGKKEV 424 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 89/205 (43%), Gaps = 10/205 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 +MKDSG+ W+G++P W++ +L +E + K + + G + Q Sbjct: 1 MREMKDSGVRWLGMIPKSWDLDKIVSLYSERSTKVSDKDYPALSVTKQGIVPQ----LES 56 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 K ++ + +++ + V D+R E + + + + Y Sbjct: 57 AAKTDNGDNRKLIKKNDFVINSRS---DRRGSCGISEYEGSCSLINIVLAPKNNMVNRYY 113 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSL---KFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +L ++ FY G+G+ L K+ ++K + V P ++EQ I ++ + A+ID Sbjct: 114 NYLFKTELFADEFYKWGNGIVDDLWSTKWSNMKNIMVPFPSLEEQQAIAEHLDTKCAQID 173 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAV 415 ++ K + I L+E + + I AV Sbjct: 174 TIIAKEQSVIEKLQEYKRAIITYAV 198 >gi|134045681|ref|YP_001097167.1| restriction modification system DNA specificity subunit [Methanococcus maripaludis C5] gi|132663306|gb|ABO34952.1| restriction modification system DNA specificity domain [Methanococcus maripaludis C5] Length = 447 Score = 214 bits (544), Expect = 3e-53, Method: Composition-based stats. Identities = 108/444 (24%), Positives = 192/444 (43%), Gaps = 30/444 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYL 64 KDSG++WIG IP W V +K LNTG + + + D+ S + Sbjct: 4 AMKDSGIEWIGDIPADWGVKKLKYILGLNTGLSITKAELVENGVDCVNYGDIHSKYTFDI 63 Query: 65 PKDGNSRQS------DTSTVSIFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQF 111 ++ DT+ +I ++G ++ + + + + Sbjct: 64 VSSRDNLPKVPVEFIDTNPSAIASEGDFIFCDTSEDIEGSGNCLFIRESNNKPIFAGSHT 123 Query: 112 LVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 ++ +P + G+L S D+ +I+ G + K + +I + +PP+ EQ I Sbjct: 124 ILGRPLINVNSTYLGYLLKSPDIKSQIQKRVVGIKVYSITQKILKSISLILPPVDEQQEI 183 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + + + +ID++I + I+ K KQ++++ VTKGL+P V MKDSGIEW+G +P+ Sbjct: 184 AQYLDDKVGQIDSIIEKTKSSIDEYKSYKQSIITETVTKGLDPTVTMKDSGIEWIGDIPE 243 Query: 231 HWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNIIQKLE----TRNMGLKPESYETYQ 284 HW++ + K++ S +SYG++ + E + E ++ Sbjct: 244 HWDIIKIRYLGTLQNGISKSSSYFGSGYPFVSYGDVYKNYELPKSVEGLVESNEFDKSNY 303 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITSAYMAVKP---HGIDSTYLAWLMRS 339 V+ G++ F D+ + M + + +P ++ Y + RS Sbjct: 304 SVEYGDVFFTRTSETIDEIGFTATCMHTMNDAVFAGFLIRFRPFDSKLLNPLYSKYYFRS 363 Query: 340 YDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F + R SL E +K+LPVLVPP EQ I I ID L+ K +Q Sbjct: 364 DMHRRFFVKEMNLVTRASLSQELLKKLPVLVPPHNEQIAIGKFIEETCQTIDQLITKKQQ 423 Query: 399 SIVLLKERRSSFIAAAVTGQIDLR 422 I LK + S I VTG+ +++ Sbjct: 424 LITELKAYKKSLIYEVVTGKKEVK 447 >gi|163788850|ref|ZP_02183295.1| hypothetical protein FBALC1_11452 [Flavobacteriales bacterium ALC-1] gi|159876087|gb|EDP70146.1| hypothetical protein FBALC1_11452 [Flavobacteriales bacterium ALC-1] Length = 440 Score = 213 bits (543), Expect = 3e-53, Method: Composition-based stats. Identities = 101/416 (24%), Positives = 174/416 (41%), Gaps = 11/416 (2%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 S + W+ IP HWK V + + S KD + + G + ++ Sbjct: 11 SKIDWLNKIPNHWKEVRLGSVFNERKEKV--SDKDFPPLSVT----KNGIVPQLENAAKS 64 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +D + G + ++ +DG S +VL+P D++P Q L S Sbjct: 65 NDGDNRKLVLSGDFAINSRSDRKGSSGLSIYDGSVSLINIVLKPIDIIPVFSQYLLKSYF 124 Query: 133 VTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G + + + N+ +P+PP EQ I + +T +I+ IT++ + Sbjct: 125 FKEEYYRYGRGIVEDLWTTRYSEMKNMIIPLPPKQEQTTIANFLDYKTEKINRFITKKKQ 184 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 IELL E+K A+++ V KG+NP+V MKDSGIEW+G +P+HWEV+ V Sbjct: 185 LIELLNEQKAAIINQAVIKGINPNVPMKDSGIEWLGEIPEHWEVRKLKYSVRLNMHTEFN 244 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 ES ++ NI K + I G+++F + K + S Sbjct: 245 NKESIKNKIALENIEGKTGRILALNENSFEGVGTIFKKGDVLFGKLRPYLAK--VVSPNF 302 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + + + YL + M + D + G + + L + Sbjct: 303 EGSCVNELLVLTPNRNDWNPKYLKYRMLASDFISIVDNSTYGAKMPRASWNFIGTLKISK 362 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 P EQ +I I ET + + I + I L++E +++ IA AVTG+ID+R + Sbjct: 363 PNKTEQSEIVRFIEKETELVSKTIITIAKEISLVEEYKTALIADAVTGKIDVRDFT 418 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 70/206 (33%), Positives = 107/206 (51%), Gaps = 6/206 (2%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDG 68 KDSG++W+G IP+HW+V +K +LN + + I I LE++E TG+ L + Sbjct: 211 MKDSGIEWLGEIPEHWEVRKLKYSVRLNMHTEFNNKESIKNKIALENIEGKTGRILALNE 270 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQG 126 NS + +IF KG +L+GKL PYL K + +F+G C + LVL P P+ L+ Sbjct: 271 NSFEGVG---TIFKKGDVLFGKLRPYLAKVVSPNFEGSCVNELLVLTPNRNDWNPKYLKY 327 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L+ D ++ GA M A W IG + + P EQ I I ET + I Sbjct: 328 RMLASDFISIVDNSTYGAKMPRASWNFIGTLKISKPNKTEQSEIVRFIEKETELVSKTII 387 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLN 212 + I L++E K AL++ VT ++ Sbjct: 388 TIAKEISLVEEYKTALIADAVTGKID 413 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 44/207 (21%), Positives = 89/207 (42%), Gaps = 11/207 (5%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 N + S I+W+ +P+HW+ ++ E K + + G + Q Sbjct: 3 NNHSYLNTSKIDWLNKIPNHWKEVRLGSVFNERKEKVSDKDFPPLSVTKNGIVPQ----L 58 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 K + ++V G+ + L G ++ + +KP I Sbjct: 59 ENAAKSNDGDNRKLVLSGDFAINSRSDRKGSSGLSIYD----GSVSLINIVLKPIDIIPV 114 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKF---EDVKRLPVLVPPIKEQFDITNVINVETAR 388 + +L++SY + +Y G G+ + L ++K + + +PP +EQ I N ++ +T + Sbjct: 115 FSQYLLKSYFFKEEYYRYGRGIVEDLWTTRYSEMKNMIIPLPPKQEQTTIANFLDYKTEK 174 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 I+ + K +Q I LL E++++ I AV Sbjct: 175 INRFITKKKQLIELLNEQKAAIINQAV 201 >gi|114047283|ref|YP_737833.1| restriction modification system DNA specificity subunit [Shewanella sp. MR-7] gi|113888725|gb|ABI42776.1| restriction modification system DNA specificity domain [Shewanella sp. MR-7] Length = 448 Score = 213 bits (543), Expect = 3e-53, Method: Composition-based stats. Identities = 99/428 (23%), Positives = 181/428 (42%), Gaps = 24/428 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 Y YKDSGV+W+G IP++WKV+ K + TG + K +GTGKY Sbjct: 33 PKYEAYKDSGVEWLGDIPQNWKVMRFKFLASITTGGKNTEDK-----------TGTGKYP 81 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPEL 123 + + S + IL G + K + + + + Sbjct: 82 FFVRSQIPEKIDSYS-YDGEAILTAGDGAGVGKVYHYINGKFDFHQRVYKFSDFNEVIGQ 140 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRID 182 L ++ +T+ I + + P +Q LI I + +ID Sbjct: 141 YLFHYLYVNFFNVAVLGTAKSTVDSLRLPLIQDFQVCYPSDNWQQQLIVSYINKKAAQID 200 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I + + I LLKE+KQ ++ VT+GL+P+V MKDSG++W+G +P HWEV+ + Sbjct: 201 DAIAIKEQQISLLKERKQIIIQQAVTQGLDPNVPMKDSGVDWIGKIPAHWEVRRAKYIFD 260 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 E++ ++ E + + + E E Y ++ ++V + Sbjct: 261 EIDERSKNGDEELLSVSHMTGVTPRSEKNVSMFMAEDYTGSKLCIENDLVINIMWAWMGA 320 Query: 303 RSLRSAQVMERGIITSAYMAVK---PHGIDSTYLAWLMRSYDLCKVFYAMGSG---LRQS 356 + GI++ +Y + + + TYL +L++S + + + +G R Sbjct: 321 LGVSDRV----GIVSPSYGVFRQKLKNTFNPTYLEYLLKSVKYVEYYNKVSTGLHSSRLR 376 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + + P +EQ +I ++ +T RID+ ++ I LKE +++ I +AVT Sbjct: 377 FYGHMLFAMKMGYPSYEEQNEIMAYLHEQTKRIDLAIDSQLAQIEKLKEYKTTLINSAVT 436 Query: 417 GQIDLRGE 424 G+I + E Sbjct: 437 GKIKITPE 444 >gi|239828721|ref|YP_002951344.1| restriction modification system DNA specificity domain protein [Geobacillus sp. WCH70] gi|239809014|gb|ACS26078.1| restriction modification system DNA specificity domain protein [Geobacillus sp. WCH70] Length = 445 Score = 213 bits (543), Expect = 3e-53, Method: Composition-based stats. Identities = 110/444 (24%), Positives = 185/444 (41%), Gaps = 34/444 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKD 67 + KDSGV+WIG IP WK++ +K K + S +I+ + +E G Y K Sbjct: 4 KMKDSGVEWIGEIPSDWKILRLKNVLKERNEKNSPIKTNEILSLT---IEKGVIPYKEKK 60 Query: 68 --GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPEL 123 GN + D S + I+ + + I+ + G S + VL D Sbjct: 61 SGGNKAKEDLSNYKLAYPNDIVLNSMNVIVGAVGISKYYGCVSPVYYVLYSDDVEQNIRF 120 Query: 124 LQGWLLSIDVTQRIEAICEGATMSH------------ADWKGIGNIPMPIPPLAEQVLIR 171 S + + + G M + N+ +P+PP++ Q I Sbjct: 121 YNYLFQSSAFQKSLIGLGNGIMMKQSSTGKLNTIRLRIPLDRLKNVYLPVPPVSVQQKIV 180 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + + IDT+I + + IE LK+ KQ+L++ VTKGL+P+V+MKDSGIEWVG +P H Sbjct: 181 NFLDEKVSHIDTIIEKNKQSIEELKKYKQSLIAETVTKGLDPNVEMKDSGIEWVGEIPKH 240 Query: 232 WEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMG------LKPESYET 282 WE++ + K+ + E + + Y N+ K E + E+ Sbjct: 241 WEIRRLRDISIITRGTVDKSKEKNEIPVYLVQYTNVYYKREQKINDDDYLPITVSENEYK 300 Query: 283 YQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMR 338 V G+I+ D + + + S + + +D Y + M Sbjct: 301 KYKVRKGDILLTASSETKDDIGHSTVIVEDLPNHVFGSDIIRIRIPNKIVDLNYKKYFME 360 Query: 339 SYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 +Y F + G R + K L ++PPI+EQ I ++ T I+ L+ E Sbjct: 361 NYYYLAKFDKLSRGITRFRFGMDQFKSLKYVIPPIEEQVKIAKYLDNITNHINQLICNKE 420 Query: 398 QSIVLLKERRSSFIAAAVTGQIDL 421 + I L+ + S I VTG+ ++ Sbjct: 421 KLINELESYKKSLIYEYVTGKKEV 444 >gi|325289015|ref|YP_004265196.1| restriction modification system DNA specificity domain protein [Syntrophobotulus glycolicus DSM 8271] gi|324964416|gb|ADY55195.1| restriction modification system DNA specificity domain protein [Syntrophobotulus glycolicus DSM 8271] Length = 443 Score = 213 bits (543), Expect = 4e-53, Method: Composition-based stats. Identities = 103/441 (23%), Positives = 180/441 (40%), Gaps = 25/441 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR---------TSESGKDIIYIGLED 55 K Y YKDSG++WIG IP HW+V + + + + +D I + Sbjct: 2 KKYNSYKDSGIEWIGEIPGHWEVKKFGYISYMKGRIGWQGLKQAEFTSNPEDPFLITGMN 61 Query: 56 VESGTGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAII----ADFDGICSTQ 110 G ++ + + + + +L+ K G + + ++ Sbjct: 62 FHDGKIRWDEVYHILEERYNEAPEIQLKESDVLFTKDGTIGKLLYVDSIPYPHKASLNSH 121 Query: 111 FLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 LVL+P P + L + +E G T + +G +P L EQ Sbjct: 122 LLVLRPLNNFYNPRFIYYQLKGLPFKHHVELTKTGTTFYGITQEAMGQYKALLPSLPEQT 181 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + +T ID LI ++ R +EL +E+K A+++ VTKG+NPD MKDSGIEW+G + Sbjct: 182 AIANYLDRKTAEIDELIADKKRLLELYEEEKTAIINQAVTKGINPDAPMKDSGIEWLGEI 241 Query: 229 PDHWEVKPFFALVTELNRK-------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 P+HWEVK + + K ++ + + + + ++ E Sbjct: 242 PEHWEVKRLKYVANIVLGKMLTTEDKGEYYLKPYLRAANLNWLSVNVDDVKEMWFSEREL 301 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 ++ +++ + + ++ E I S + DS Y L Y Sbjct: 302 NKYRLNRNDLLVSEGGEVG-RTCIWKEELEECYIQNSVHKVTLNDNSDSNYFLQLFYLYG 360 Query: 342 LCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 F + + + L E +K + + PP +EQ I + I E A ID + E+ I Sbjct: 361 KKGAFDLIVNKISIAHLTVEKLKEIKFITPPFEEQQSIVHHIKTECASIDAKKFRNEKLI 420 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 L E R++ I+ VTG+I + Sbjct: 421 EFLTEYRTALISEVVTGKIKV 441 >gi|307244176|ref|ZP_07526291.1| type I restriction modification DNA specificity domain protein [Peptostreptococcus stomatis DSM 17678] gi|306492326|gb|EFM64364.1| type I restriction modification DNA specificity domain protein [Peptostreptococcus stomatis DSM 17678] Length = 433 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 121/433 (27%), Positives = 195/433 (45%), Gaps = 23/433 (5%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI---------IYIGLEDV 56 Y +YKDSG+ WIG IP+HW V+ K F L TG + + YI +DV Sbjct: 2 KYEKYKDSGIDWIGEIPEHWGVIKFKYFADLFTGNSIPDEEKYMYEFKENGHPYIATKDV 61 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI-IADFDGICSTQFLVLQ 115 G+ +G + I L G + D + Sbjct: 62 YMD-GRINYDNGMIIPYEHKKFKIAPVNSTLMCIEGGSAGVKKSFLEEDVCFGNKLCCFN 120 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 K+ + + LS +R A+ + + + + N IP + EQ I + Sbjct: 121 VKEGFNKKYIFYFLSSPDYERYFAMNLNGLIGGVNIQRLKNFEAIIPSIVEQEKIAAYLD 180 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +T +ID++I E E L+ K+ L++++VTKGLN +V MKDSG++W+G VP+HW+V+ Sbjct: 181 EKTEKIDSIIKELEDQREKLELYKRKLIAHVVTKGLNENVPMKDSGVDWIGAVPEHWKVE 240 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + R++ + E +LS++ I K +N G ESYE YQ+V+PG+ Sbjct: 241 KIKWNFEIVKRQDGR-EERPVLSITQQGIKIKDIEKNDGQMAESYEKYQLVEPGDYAMNS 299 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSG 352 +DL + G+ + Y + D+ Y +L + ++FY +G G Sbjct: 300 MDLLTGWIDCSKYE----GVTSPDYRVFRLKNSELNDNQYFNYLFQMCYTRRIFYRIGQG 355 Query: 353 L----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + R L+ E + + VPP EQ +I N+I + +I + KI+ I L E R Sbjct: 356 VSNLGRWRLQREPFLNMEIPVPPTDEQKEIANLIKEKDLQIRKVDRKIKLQIEKLNEYRK 415 Query: 409 SFIAAAVTGQIDL 421 S I AVTG+I + Sbjct: 416 SIIHDAVTGKIKI 428 >gi|302037229|ref|YP_003797551.1| putative type I restriction system, specificity protein HsdS [Candidatus Nitrospira defluvii] gi|300605293|emb|CBK41626.1| putative Type I restriction system, specificity protein HsdS [Candidatus Nitrospira defluvii] Length = 452 Score = 213 bits (542), Expect = 5e-53, Method: Composition-based stats. Identities = 109/429 (25%), Positives = 172/429 (40%), Gaps = 32/429 (7%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG---- 61 YP YKDSGV W+G +P W + + + ++ + +V TG Sbjct: 7 PYPAYKDSGVPWLGEVPLTWSISRNGGLF-IQR-----NETGFAHLPILEVSLKTGVRVR 60 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVL 120 SD K + Y + + IA DG+ S ++V +P K V Sbjct: 61 NLDGSGRKQIMSDRDKYKRARKDDLAYNMMRMWQGAIGIAPTDGLVSPAYVVARPLKGVE 120 Query: 121 PELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 P + ++ G + W+G +P P+PP EQ I I Sbjct: 121 PRFFLNLFRTDAYMGEVDKFSHGIVKDRNRLYWEGFKQMPSPVPPPDEQAAIVRFIDHAD 180 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 RI I + + I+LL+E+KQA++ VT+GL+P+V++K SG+EW+G VP+HWE++ Sbjct: 181 RRIKCYIRAKQKLIKLLEEQKQAIIHRAVTRGLDPNVRLKPSGVEWLGDVPEHWEMRRLK 240 Query: 239 ALVTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 L + IE YG + T N G+ V Sbjct: 241 TLCRMRSGDGITAMAIEPVGDYPVYGGNGVRGYTSNFT------------HDGDFV---- 284 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 L + +L + RG ++ AV L W + + + + Sbjct: 285 -LIGRQGALCGNVHLARGRFWASEHAVVASLSSGYILEWFAAILMVMNLNQYSIAAAQPG 343 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L E V L + VPP +Q I I ET+ I+ +V + + I L E R+ IA VT Sbjct: 344 LAVERVLNLWLPVPPADDQKRIATQIEDETSDINQVVGRARREIEFLIEYRTRLIADVVT 403 Query: 417 GQIDLRGES 425 G+ D+R + Sbjct: 404 GKRDVREAA 412 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 48/210 (22%), Positives = 86/210 (40%), Gaps = 8/210 (3%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L P KDSG+ W+G VP W + L + N + +SL G ++ L+ Sbjct: 5 LTPYPAYKDSGVPWLGEVPLTWSISRNGGLFIQRNETGFAHLPILEVSLKTGVRVRNLDG 64 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGID 329 + Y+ ++ + + + + G+++ AY+ +P G++ Sbjct: 65 SGRKQIMSDRDKYKRARKDDLAYNMMRMWQGAIGIAPTD----GLVSPAYVVARPLKGVE 120 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + L R+ G+ R L +E K++P VPP EQ I I+ Sbjct: 121 PRFFLNLFRTDAYMGEVDKFSHGIVKDRNRLYWEGFKQMPSPVPPPDEQAAIVRFIDHAD 180 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 RI + ++ I LL+E++ + I AVT Sbjct: 181 RRIKCYIRAKQKLIKLLEEQKQAIIHRAVT 210 >gi|114563773|ref|YP_751286.1| restriction modification system DNA specificity subunit [Shewanella frigidimarina NCIMB 400] gi|114335066|gb|ABI72448.1| restriction modification system DNA specificity domain [Shewanella frigidimarina NCIMB 400] Length = 462 Score = 213 bits (541), Expect = 5e-53, Method: Composition-based stats. Identities = 109/457 (23%), Positives = 192/457 (42%), Gaps = 34/457 (7%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-----LNTGRTSES-------GKDIIY 50 YKAY +YKDSGV+W+ +P W+V+ +K K + G + K I Sbjct: 4 RYKAYSEYKDSGVEWLKLLPSTWQVLKVKFLLKNGSEGIKIGPFGSALKLEDMVEKGIRV 63 Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICS 108 G E++ + + + V G IL +G + ++ + GI Sbjct: 64 YGQENIIKRDFTLGKRFISQTKYKDMKVYTAEAGDILITMMGTSGKCQVVPENADLGIID 123 Query: 109 TQFLVLQPKDVLPELLQGWLLSI--DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 + L L+ + L L+ ++ +I +G+ M + + + P+P + E Sbjct: 124 SHLLKLRTNSKILPELFRLLVDEAQEIKDQISKQGKGSIMLGLNSSIVKELEFPLPSIEE 183 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + ET +ID LI ++ + IELLKEK+QA++S+ VTKGLNPD MK+SG+ W+G Sbjct: 184 QTQILCFLDHETAKIDDLIAKQEKLIELLKEKRQAVISHAVTKGLNPDSPMKNSGVVWLG 243 Query: 227 LVPDHWEVKPFFALVTELNR-----------KNTKLIESNILSLSYGNIIQKLETRNMGL 275 VP+HW V + + K+ ++ + + N L Sbjct: 244 EVPEHWVVCCLKHIKGKEKGSFVDGPFGSNLKSEHFVDDGDVYVIESNFATTGMLDTSKL 303 Query: 276 KPESYETYQIV-----DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 K S ++ + G I+ I + S+ + + + Sbjct: 304 KTISVAHFETISRSETKEGAIILAKIGARYGMNSILPCLPHKAVVSGNCLSLKINEKTMD 363 Query: 331 TYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + ++ + + + + +L + LP L PP KEQ +I + I Sbjct: 364 VLYCHQLLTHLKQEGAMDDGVNVTAQPALSLGQLNNLPFLSPPQKEQSEIASFIQQRDES 423 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +L+ K + I L KER+++ I+A +TG+ID+ S Sbjct: 424 FSILINKAIKLIELSKERKTALISAVLTGKIDVLDWS 460 >gi|52426224|ref|YP_089361.1| HsdS protein [Mannheimia succiniciproducens MBEL55E] gi|52308276|gb|AAU38776.1| HsdS protein [Mannheimia succiniciproducens MBEL55E] Length = 449 Score = 212 bits (539), Expect = 8e-53, Method: Composition-based stats. Identities = 101/461 (21%), Positives = 178/461 (38%), Gaps = 57/461 (12%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT--SESGKDIIYIGLEDVESGTGK 62 + Y +YK SGV+W+G +P+ W+V IK +L ++ +E K+ ++ +E ++ G Sbjct: 2 QKYDKYKPSGVEWLGDVPEGWEVTKIKYIAELTPKKSELTELDKECSFVPMEKLKLGNLV 61 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQP 116 + + + F +L K+ P + G S++ VL+ Sbjct: 62 LDETR--TISDVYNGYTYFEDNDLLIAKVTPCFENKNFVIAEKLVNGIGFGSSEIYVLRV 119 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKII 175 K+ L L L GA + + + N + +PPL EQ I + Sbjct: 120 KNCLNRYLFYRLQENTFMDLAIGSMTGAGGLKRIPSEFLNNYSIALPPLEEQTAIAHYLD 179 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN----------------------- 212 +T ID LI + +E L EK+ AL++ V L Sbjct: 180 QKTAYIDRLIDRQQTLLEKLSEKRTALITEAVCGRLPIAPYSASLKRGTGFDEENGSPNT 239 Query: 213 ------------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ +KDSGI+W+G VP+ WEV + + N ++ + Sbjct: 240 AQTAPLFSKEGLGEICLKDSGIQWLGKVPEGWEVIRL-RFLCNIQTGNMDTQDNEPDGIY 298 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + R+ E E ++ + + + G Y Sbjct: 299 PFYVRSPIIERSNNYTFEDDEA--------VLMAGDGVG--AGKVFHYVQGKYGCHQRVY 348 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + I +L + +R + K+ S++ +K P VPP+ EQ IT+ Sbjct: 349 SLNQFQNITGRFLFYYLREFFSRKIEEGGAKSTVDSVRLPMLKDFPTCVPPLSEQTTITH 408 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ ETA+ID L +IE I LKE R + I VTG++ + Sbjct: 409 YLDQETAKIDRLRTQIETVIERLKEYRMALITQVVTGKVKV 449 >gi|332975485|gb|EGK12375.1| type I restriction-modification system specificity determinant [Desmospora sp. 8437] Length = 461 Score = 212 bits (539), Expect = 1e-52, Method: Composition-based stats. Identities = 114/445 (25%), Positives = 198/445 (44%), Gaps = 31/445 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----------------GKDI 48 + Y YK+S + WIG +P HW V+P+KR K N + Sbjct: 14 RKYGSYKESNIAWIGKVPVHWDVLPMKRLDKNNMEMAQTGPFGSHLHASDYMDSDLKNGV 73 Query: 49 IYIGLEDVESGTGK--YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII--ADFD 104 I ++ V +P+ S+ + S + K I++ ++G R A + + Sbjct: 74 PLILIKHVNDFKIIDHNMPRVSKSKAEELSVYKL-KKNDIVFSRVGTMGRVAPVTKKEEG 132 Query: 105 GICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 + S Q L ++ KD+ + L L S T+ ++ + G+T + + N+ + P Sbjct: 133 WLISGQMLRLRIKSKDIDNQFLLYLLSSDISTKYLQLVSVGSTRDSINTDILRNMVIVRP 192 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 L EQ I + ET ++D L+ ++ R IELL+EK+QAL++ VTKGLNP+V MKDSGI Sbjct: 193 SLPEQQAIANFLDRETGKLDRLVEKKQRLIELLREKRQALITQAVTKGLNPNVPMKDSGI 252 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI------IQKLETRNMGLK 276 EW+G VP+HW+V L + + IE I + G + Sbjct: 253 EWLGEVPEHWKVLKIKWLSKVKRGASPRPIEDPIYFDNNGEYAWVRIADVTSSNMYLKKT 312 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 ++ ++ + L + + I ++ + + ++ Sbjct: 313 SQTLSELGASLSVKLPPGKLFLSIAGSVGKPCISGIKCCIHDGFVYFPDLQENEKFFYYV 372 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + G + +L + V + VP IKEQ +I ++ +T++ID L+ K+ Sbjct: 373 FASGAPYGGLGKL--GTQLNLNTDIVGDIYTGVPEIKEQLEIVKYLDNQTSKIDTLISKL 430 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 + I +KE R + I+AAVTG+ID+ Sbjct: 431 QTQITKIKEYRQALISAAVTGKIDV 455 >gi|28199931|ref|NP_780245.1| type I restriction-modification system specificity determinant [Xylella fastidiosa Temecula1] gi|182682685|ref|YP_001830845.1| restriction modification system DNA specificity subunit [Xylella fastidiosa M23] gi|28058062|gb|AAO29894.1| type I restriction-modification system specificity determinant [Xylella fastidiosa Temecula1] gi|182632795|gb|ACB93571.1| restriction modification system DNA specificity domain [Xylella fastidiosa M23] gi|307578969|gb|ADN62938.1| restriction modification system DNA specificity subunit [Xylella fastidiosa subsp. fastidiosa GB514] Length = 444 Score = 212 bits (539), Expect = 1e-52, Method: Composition-based stats. Identities = 86/424 (20%), Positives = 164/424 (38%), Gaps = 23/424 (5%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 YP Y+ ++W+ A+P+HW K F + R+ +++ + + + T + Sbjct: 7 YPNYRQPKMRWLPAVPEHWNEQRAKTFFREVDERSKTGQEEL--LSVSHLTGVTSRSQKN 64 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + + + G I+ L ++ + GI S + V +P Sbjct: 65 VTMFKAASYVGSKLCRPGDIVINTLWAWMAALGASRHVGIVSPAYGVYRPHHADSFNPAY 124 Query: 127 WLLSIDVTQRIEAICEGAT-----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + +T +I + PP EQ I + A+ I Sbjct: 125 LDYLLRTRAYVAEYIGRSTGIRSSRLRLYPNQFLDIALIQPPRPEQDQIVAYLRAQDAHI 184 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 I + I+LL E+K ++ + VT+GL+ V +K SGIEW+G VP H ++ + Sbjct: 185 ARFIKAKRDLIKLLTEQKLRIIDHAVTRGLDASVALKPSGIEWLGDVPVHCRIERLKWVC 244 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + GN+ +G+ ++ +V P ++ R Sbjct: 245 RFTYGDSLSDANR-----RQGNVPVYGSNGPVGM----HDVANVVGPCIVIGRKGSF--- 292 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + ++ I T+ ++ K + +L +++ L ++ L D Sbjct: 293 -GKVNYSESDLFAIDTTYFVDKKCTKANIRWLYYVLIWCRLDRISKD---SAVPGLDRTD 348 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 V VP EQ I +++ETA ++ + K+E+ I L++E R I VTGQ+D+ Sbjct: 349 ALNTLVPVPDGAEQEQIAKQLDIETAEVNDAITKVEEEITLIREYRDRLITDVVTGQVDV 408 Query: 422 RGES 425 RG Sbjct: 409 RGWQ 412 >gi|189485041|ref|YP_001955982.1| type I restriction-modification system substrate-binding subunit [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287000|dbj|BAG13521.1| type I restriction-modification system substrate-binding subunit [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 434 Score = 211 bits (538), Expect = 1e-52, Method: Composition-based stats. Identities = 111/430 (25%), Positives = 194/430 (45%), Gaps = 20/430 (4%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 Y +YK SG++WIG IPK+W V + R + K+ Y+ L G Y K Sbjct: 4 YSKYKPSGIEWIGDIPKNWNFVSCRLIVSERNERN-KGMKNNNYLSLMA-NIGVIPYEEK 61 Query: 67 D--GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ--PKDVLPE 122 GN + + I +G ++ + ++ I+ +DGICS ++VL K + P Sbjct: 62 GDIGNKKPENLEKCKIVYEGDLIINSMNYFIGSYGISKYDGICSPVYIVLYANTKVIEPR 121 Query: 123 LLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + ++ G +W + NI +P+P L EQ I + +T + Sbjct: 122 FAFRVFENPKFQGVAQSFGNGILEHRRAINWDILKNIKIPVPLLEEQRNILSFLDKKTEK 181 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID LI+++ + I+LL+E +Q+++S VTKGL+ V+MK SGIEW+G +P W+V F + Sbjct: 182 IDALISDKEKLIKLLREYRQSIISETVTKGLDKKVQMKHSGIEWIGDIPYDWKVNKFNRI 241 Query: 241 VTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPG 289 + R N KL + + ++ N + + + E I+ Sbjct: 242 IIRVSTGLNPRNNFKLGDGDCYYVTIKNFKKGKLFLDEKCDRMTKEALNIINERSDLKID 301 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+F I + + + + + V + + +L+ + Sbjct: 302 DILFSSIGEEAEAYLISEHPTNWNINESVFTIRVNKDLVLPNFFYYLIANKSFFNDLLKD 361 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +G +S+K + V VP +K Q +I N+++ +T +ID L+E I + I L+E R Sbjct: 362 ATGSTFKSIKINSLIEKKVPVPSLKTQKEIANLLDDKTEKIDNLIENITKQIKKLQEYRK 421 Query: 409 SFIAAAVTGQ 418 S I AVTG+ Sbjct: 422 SIIGEAVTGK 431 >gi|319775047|ref|YP_004137535.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae F3047] gi|317449638|emb|CBY85844.1| Putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae F3047] Length = 419 Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats. Identities = 109/432 (25%), Positives = 187/432 (43%), Gaps = 29/432 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y YKDSGV W+G +P HW++ +K+ + + L GK + Sbjct: 2 RRYESYKDSGVDWLGEVPSHWELKRLKQLFVEKKHK--------QSLSLNCGAISFGKVI 53 Query: 65 PK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDV 119 K D ++ + KG+ L L + +++ D + S ++VL+ K + Sbjct: 54 EKSDDKVTEATKRSYQEVLKGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQI 113 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + +LL ++ + G ++ I + + IPPL+EQ I + + +T Sbjct: 114 INKKYFSYLLHRYDVAYMKLLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTA 172 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HWEV Sbjct: 173 KIDQAVDLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWEVVSMKR 232 Query: 240 LVTELNRKNTKL----IESNILSLSYGNIIQKLETRNMGL------KPESYETYQIVDPG 289 +V E + + NI L N + + K + + IV Sbjct: 233 VVKEHSGNGFPIDLQGNNGNIPFLKVSNFSENQDKYIFKWNNSVTNKVIKQKKWNIVPKN 292 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 IV I K + + II + + ++ D + +L + D Sbjct: 293 SIVTAKIGEALRKNHRKILSI--DSIIDNNCLGIEIKKADVLFGYYLHCALDFD---LFT 347 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G SL + + +++PP +EQ +I + + +TA+ID + I LKE +S Sbjct: 348 NPGTIPSLAMDKYRNQKIVLPPFQEQQEIADYLEQQTAKIDQAIALKTAHIEKLKEYKSV 407 Query: 410 FIAAAVTGQIDL 421 I VTG++ + Sbjct: 408 LINDVVTGKVQV 419 >gi|229847074|ref|ZP_04467180.1| type I restriction-modification system S subunit [Haemophilus influenzae 7P49H1] gi|229810158|gb|EEP45878.1| type I restriction-modification system S subunit [Haemophilus influenzae 7P49H1] Length = 434 Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats. Identities = 104/435 (23%), Positives = 182/435 (41%), Gaps = 20/435 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y YKDSGV+W+G IP +W + + N R ++ K+ + L + K Sbjct: 2 RRYESYKDSGVEWLGKIPSYWDLTIGMNVFRENK-RDNKGMKEKTVLSLSYGQI-IIKPE 59 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVL 120 K T I I+ + +A GI ++ +L L+ + Sbjct: 60 EKLVGLVPESFETYQIVKPNDIIIRCTDLQNDQTSLRTGLAKDKGIITSAYLNLKVINNH 119 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + L ++ + + +P+ PL+EQ I + + +T + Sbjct: 120 SAKFLHYYLHTLDITKVLYKFGSGLRQNLSFLDFKRLPIIDIPLSEQQKIAQFLNDKTAK 179 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID + + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW VK + Sbjct: 180 IDQAVDLAEKQIVLLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWNVKKLKYM 239 Query: 241 VTELNRKNTKLIESN-----------ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + K + + + N Q E +K + E IV Sbjct: 240 GYLYSGLTGKSADDFSKEVKEGFREFVPFTTICNFSQIKENVFQYVKVMNLENQNIVKKH 299 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFY 347 +++F + + S ++++ +++ S ++ +L+ S + F Sbjct: 300 DLLFLMSSETLEDIAKSSVYLLDQESFLNSFCKGFRFIEKHSSIFINYLINSNNYRAYFN 359 Query: 348 AMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +G G R ++K E V + VL+PP EQ I + ++ +T +ID + I LKE Sbjct: 360 LVGRGFTRINIKQEFVNSVYVLLPPFSEQQKIADYLDKQTTKIDQAIALKTAHIEKLKEY 419 Query: 407 RSSFIAAAVTGQIDL 421 +S I VTG++ + Sbjct: 420 KSVLINNVVTGKVQV 434 >gi|229520259|ref|ZP_04409685.1| type I restriction-modification system specificity subunit S [Vibrio cholerae TM 11079-80] gi|167832523|gb|ACA01833.1| type I site-specific restriction-modification system S subunit [Vibrio cholerae] gi|229342625|gb|EEO07617.1| type I restriction-modification system specificity subunit S [Vibrio cholerae TM 11079-80] Length = 458 Score = 211 bits (537), Expect = 2e-52, Method: Composition-based stats. Identities = 99/440 (22%), Positives = 187/440 (42%), Gaps = 22/440 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG----------KDIIY 50 +K Y YK+SG++W+ +P+ W+ +K + TG + Y Sbjct: 22 IKQMPKYESYKESGIEWLDEVPQTWQTSKLKYLASIFTGDSISPTLKDTYVSTELSGRAY 81 Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICST 109 I +D++ T + ++G D + + L G +K D Sbjct: 82 IASKDIDVQTSRIDYENGVRIPFDRRHFKVAPEQSTLLCIEGGSAGKKIAYTAQDVCFVN 141 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + + + VL + L L S + + G + I N + +PP EQ+ Sbjct: 142 KLACIASEKVLNKYLYYSLFSEPFQSQFKLSMSGL-IGGVSVSSINNFIVVVPPEKEQIR 200 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + +++ I + + IE L+E+K ++ VT+GL +V M+DSG++W+G +P Sbjct: 201 IVSYLDKKVSQLNEAIYIKQQQIERLRERKHVIIQQAVTQGLETNVPMQDSGVDWIGEIP 260 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 HW + F L T+ K ++ + YQ + G Sbjct: 261 KHWGIVRFKNLFTQSRLPVRKGDGVVTSYRDGQVTLRSNRRVGGYTEAILEGGYQGIRKG 320 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVF 346 ++V +D + + G T Y+ P + Y A+L+R L K Sbjct: 321 QLVLNSMDAFEGAIGVSDSD----GKCTPEYVICDPINSVNVSQYYFAYLLREMALAKYI 376 Query: 347 YAMGSGLRQS---LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + +RQ +++ ++ L ++VPP+KEQ DI + I E+A++D ++ + + I L Sbjct: 377 QVICNAVRQRAVRIRYNNLAPLFMVVPPVKEQEDIVSFIEKESAKLDAGIKHLNEQISKL 436 Query: 404 KERRSSFIAAAVTGQIDLRG 423 KE +++ I +AVTG+I + Sbjct: 437 KEYKTTLINSAVTGKIKVTE 456 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 85/232 (36%), Gaps = 9/232 (3%) Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKL 251 L K + + K + K+SGIEW+ VP W+ L + + L Sbjct: 8 LWLRFKGKRMIDTMIKQMPKYESYKESGIEWLDEVPQTWQTSKLKYLASIFTGDSISPTL 67 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF-------RFIDLQNDKRS 304 ++ + + G + ++ YE + F + ++ Sbjct: 68 KDTYVSTELSGRAYIASKDIDVQTSRIDYENGVRIPFDRRHFKVAPEQSTLLCIEGGSAG 127 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + A + + + + + YL + + S F SGL + + Sbjct: 128 KKIAYTAQDVCFVNKLACIASEKVLNKYLYYSLFSEPFQSQFKLSMSGLIGGVSVSSINN 187 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 V+VPP KEQ I + ++ + ++++ + +Q I L+ER+ I AVT Sbjct: 188 FIVVVPPEKEQIRIVSYLDKKVSQLNEAIYIKQQQIERLRERKHVIIQQAVT 239 >gi|126434812|ref|YP_001070503.1| restriction modification system DNA specificity subunit [Mycobacterium sp. JLS] gi|126234612|gb|ABN98012.1| restriction modification system DNA specificity domain [Mycobacterium sp. JLS] Length = 451 Score = 210 bits (535), Expect = 3e-52, Method: Composition-based stats. Identities = 94/449 (20%), Positives = 175/449 (38%), Gaps = 27/449 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKL----NTGRTSESGKDIIYIGLEDV 56 M + +YP+Y DSGV+W+G +P W V P+K + + ++ + DV Sbjct: 1 MS-WPSYPRYNDSGVEWLGRVPSGWAVSPLKNVATVFPSSVDKHSHDNEIPVQLCNYTDV 59 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIADF------DGICS 108 D + + +G + K I+ + D +C Sbjct: 60 YKNERISGALDFMKATATPEEIKKFTLKQGDTIITKDSETADDIGISAYVEETLPDVLCG 119 Query: 109 TQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 V++P L ++ S + +E G T I N+ +P+PP EQ Sbjct: 120 YHLSVVRPLPGLDGRFVKRLFDSHYLKASMEVSANGLTRVGLGQYAIDNLNIPLPPPDEQ 179 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + I + + AET +ID LI ++ I L+E + A +++ VTKGL+P V M + Sbjct: 180 LQIADFLEAETAKIDALIAKQEHLIATLREDRTATITHAVTKGLDPTVDMVQPHNSELPA 239 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE------ 281 P HW + + E+ T + ++ + G+ + + Sbjct: 240 CPKHWTLLISLKRLAEVQTGLTLGKSVDPAEAVDVPYLRVANVQTSGVNLDEVKTVAVHR 299 Query: 282 ---TYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSA-YMAVKPHGIDSTYLAWL 336 ++ G+++ D+ R + + I + + + +L +L Sbjct: 300 SELKRYLLRDGDVLMTEGGDIDKLGRGCVWSGEIAPCIHQNHVFAVRCSDALSGDFLVYL 359 Query: 337 MRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + F+ + S + +PP EQ +I + +N A +D L+ Sbjct: 360 LDTAVARNYFFMTAKKTTNLASTNSTTLGAFTFSLPPRAEQDEIVDHLNERCAGLDALIA 419 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 K I +L+E R++ I AVTG+ID+RG Sbjct: 420 KANAVITVLREYRAALITDAVTGKIDVRG 448 >gi|293115630|ref|ZP_05792396.2| putative type I restriction-modification system [Butyrivibrio crossotus DSM 2876] gi|292809171|gb|EFF68376.1| putative type I restriction-modification system [Butyrivibrio crossotus DSM 2876] Length = 441 Score = 210 bits (534), Expect = 4e-52, Method: Composition-based stats. Identities = 98/427 (22%), Positives = 181/427 (42%), Gaps = 14/427 (3%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + KDSG++W+G IP++WKV+ K +L+ + + L + G Sbjct: 16 EMKDSGIEWVGKIPENWKVLKNKYNFELSKEIIGTKWVETQLLSLTKYGVKAINDGEQTG 75 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKA--IIADFDGICSTQFLVLQPKDVLPELLQG 126 ST K I+ I++FDG+ S + ++ K L Sbjct: 76 KV-PESLSTYQKVNKDDIVMCLFDLDCSAVFSGISNFDGMISPAYKCIRCKPHLCPQYVD 134 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + R N+P+ +PP+ Q I E + + IDTL + Sbjct: 135 YYFRTVFVDRKYKRYSKNVRFSISSDEFMNLPIIVPPIDIQKKIAEFLNFKCFEIDTLHS 194 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA--LVTEL 244 + + I+ L+E K+++++ VTKGL+PDV+MKDSGI ++G +P HW+V Sbjct: 195 DIEKQIKTLEEYKKSIITEAVTKGLDPDVEMKDSGISYIGNIPKHWKVTNLKYLGKCQNG 254 Query: 245 NRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K + + +SYG++ + + + ++ + V G++ F Sbjct: 255 ISKGGEYFGNGFPFVSYGDVYKNYSIPQNVDGLIMSTKTEQNIYSVKYGDVFFTRTSETI 314 Query: 301 DKRSLRSA--QVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGS-GLRQ 355 ++ S + ++ + + +P D + + RS K F + R Sbjct: 315 EEIGFASTCLKSIDNSVFAGFLIRFRPTSSDLIPEFSKFYFRSNIHRKFFVKEMNLVTRA 374 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 SL + RLPVL+PP+ EQ I + + A ID +E+ ++ + L++ + S I V Sbjct: 375 SLSQNLLGRLPVLLPPLCEQQMIAKNLEKKCAEIDGAIEEKKEQLETLEQYKKSLIYEYV 434 Query: 416 TGQIDLR 422 TG+ +++ Sbjct: 435 TGKKEVK 441 >gi|50086399|ref|YP_047909.1| putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) [Acinetobacter sp. ADP1] gi|49532375|emb|CAG70087.1| putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) [Acinetobacter sp. ADP1] Length = 448 Score = 210 bits (533), Expect = 5e-52, Method: Composition-based stats. Identities = 95/443 (21%), Positives = 187/443 (42%), Gaps = 23/443 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLN-----TGRTSESGKDII-YIGLE 54 M Y YK+SGVQW+G IP HW+V +K KD YI + Sbjct: 1 MSQLPCYESYKNSGVQWLGEIPSHWEVKRMKFLLSEKLKYGANESAESEDKDQPRYIRIT 60 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQF 111 D+ + +G S + + + + IL + G + K+ + D + C + Sbjct: 61 DI-NDSGTLREDTFKSLEIEKAQEYLLNDLDILLARSGATVGKSYLHKKDKVNVACYAGY 119 Query: 112 LV---LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 L+ ++ P+ + +L S IE++ AT+ + + ++ + IP LAEQ Sbjct: 120 LIRARFNKENYDPQFINLFLQSKAYWSWIESVNIQATIQNVSAEKYNDLALSIPSLAEQK 179 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 +I + + ++D LI ++ +E L E++ AL+S+ VTKGLNPDV+MK+S + +G + Sbjct: 180 IIADFLDKRLAQVDALIAKQETLLEKLAEQRVALISHAVTKGLNPDVEMKESDVVLLGNI 239 Query: 229 PDHWEVKPFF-------ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 P+ W +K + ++ + ++ + L+ Sbjct: 240 PNTWNIKRLKFLLSEKLKYGANESAESEDKENPRYIRITDIDDSGNLKDETFKSLESEKA 299 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRS 339 ++D +I+ K L A+ + + + + ++ + ++S Sbjct: 300 QEYLLDDLDILLARSGATVGKSYLYKAESVGIACYAGYLIRARLDQENYNPEFVNYFLQS 359 Query: 340 YDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++ Q++ E L + +P ++EQ + + E + + + K ++ Sbjct: 360 KQYWDWISSINIQATIQNVSAEKYNDLTLAIPSLEEQKQLIEYLKNEDEKFNRAISKGKK 419 Query: 399 SIVLLKERRSSFIAAAVTGQIDL 421 + LL E RS+ I VTG+ID+ Sbjct: 420 LVHLLNEYRSTLITQVVTGKIDV 442 >gi|124485664|ref|YP_001030280.1| hypothetical protein Mlab_0842 [Methanocorpusculum labreanum Z] gi|124363205|gb|ABN07013.1| restriction modification system DNA specificity domain [Methanocorpusculum labreanum Z] Length = 446 Score = 210 bits (533), Expect = 5e-52, Method: Composition-based stats. Identities = 89/422 (21%), Positives = 159/422 (37%), Gaps = 15/422 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 K Y +Y D+G WI +PK W+ PI T L+ R + + + Sbjct: 3 KGYEEYMDTGYDWIPQVPKTWEQRPIHSITTLSNERNGKRKDLELLSVYREFGVIKKSSR 62 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + N D S G ++ K+ + I+ ++GI S ++V + + Sbjct: 63 DDNHNVESQDLSNYKYVNSGYLVMNKMKMWQGSLGISQYEGIVSPAYIVCKVDQDIIGKY 122 Query: 125 QGW---LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + + N+ + +P EQ I + A+ +I Sbjct: 123 LHYLLRSSHFKIFYNRISYGVRVGQWDLRYNDLKNLKIYLPTSDEQNQIVRYLNAKVAKI 182 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + LI+ + + I LLKE KQA+++ VTKG+ V MK+SG+EW+G +P+ WE + L Sbjct: 183 NRLISAKKKEIALLKEYKQAIITRAVTKGICAGVPMKESGVEWIGEIPEGWEERKLKYLC 242 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + N P+ GE V D Sbjct: 243 SINTGDKD-----------TINRNDDGLYPFYVRSPKIEHIDTYSFDGEAVLMAGDGVG- 290 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + + Y + Y+ + ++ K+ A S++ Sbjct: 291 AGKVFHYVSGKFDYHQRVYNLHYFKDVCGKYIYYYLKENFWRKIEEASAKSTVDSVRLPM 350 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + PV+ I EQ I + ++ + + ID ++K E +I L S I VTG++D+ Sbjct: 351 LLEFPVVFGQIGEQQQIVSYLDAKCSAIDATIQKRELAIEKLTAYNQSLIYECVTGKVDV 410 Query: 422 RG 423 RG Sbjct: 411 RG 412 >gi|264677663|ref|YP_003277569.1| hypothetical protein CtCNB1_1527 [Comamonas testosteroni CNB-2] gi|262208175|gb|ACY32273.1| hypothetical protein CtCNB1_1527 [Comamonas testosteroni CNB-2] Length = 429 Score = 209 bits (532), Expect = 6e-52, Method: Composition-based stats. Identities = 92/430 (21%), Positives = 161/430 (37%), Gaps = 19/430 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG--K 62 + Y YK S W+G +P HW V P++ T L + + D+ + + E G Sbjct: 2 QRYESYKPSEATWLGNVPSHWDVQPLRAVTSLKSDKNRP---DLPVLSV-YREYGVILKD 57 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + N+ DTST + G ++ K+ + ++ GI S ++ K Sbjct: 58 SRDDNHNATSLDTSTYKVVKPGDLVVNKMKAWQGSMGVSSHHGIVSPAYITCTTKADRAR 117 Query: 123 LLQGWL----LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + ++ IP+P+PP EQ I + +T Sbjct: 118 PAYLHYLLRSSPLIGVYNSLSYGVRVGQWDMHYEDFKQIPIPLPPNDEQDRIVAFLDQKT 177 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 ID I ++ R LLKE++ L++ VTKGL+P+ M W+ P HW++ Sbjct: 178 AEIDAAIEKKERLASLLKEQQFKLINLAVTKGLDPNAAMTCGRSPWIESYPAHWQLMRIK 237 Query: 239 AL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + + K + E + + ++ E K TY+ I Sbjct: 238 HVLRAIVDTEHKTPPMYEEGPALMVRTSNVKNGELVFKNAKYTDELTYRRWTRRAIPVAG 297 Query: 296 IDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 L + A V+ GI + V P +D + + S + Sbjct: 298 DILFTREAPAGEACVLPDGIKAAIGQRMVLFKVDPERLDPHFAVHSIYSGAAKAFIELLS 357 Query: 351 -SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 D+ +P+L+PP++EQ I I + L++ I L+E + + Sbjct: 358 VGSTVAHFNMSDIGNIPLLLPPLQEQQKIAVGIKSIQRQFQPLIDSAANGIEQLQELKRT 417 Query: 410 FIAAAVTGQI 419 IA+AV GQI Sbjct: 418 LIASAVLGQI 427 >gi|188535437|ref|YP_001909234.1| type I restriction-modification system, specifity subunit [Erwinia tasmaniensis Et1/99] gi|188030479|emb|CAO98373.1| type I restriction-modification system, specifity subunit [Erwinia tasmaniensis Et1/99] Length = 435 Score = 208 bits (530), Expect = 1e-51, Method: Composition-based stats. Identities = 97/435 (22%), Positives = 178/435 (40%), Gaps = 17/435 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M Y YK+S + WI IP W++ K R+ + ++ + + Sbjct: 4 MAELPKYEAYKESCLNWIDTIPYDWELKRFKYILDEINLRSKTGKETLLSLSKYNGVLPK 63 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV- 119 + G ++ K+ ++ +GI S + V + K+ Sbjct: 64 DSLEERSGC--AETLVGYKRVGIKDLVINKMQAVNGLLAVSRIEGITSPDYSVYRSKNNL 121 Query: 120 --LPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + L LL + + G + + +I +P + Q +I + + Sbjct: 122 ILNIDFLGYLLLQPEYIGEFKKRVTGVMEGFIRLYTEDLYSIHAILPDVKTQFIIVKYLD 181 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 ++ +ID I + + I LLKE+KQ ++ VT+GL+P+V+MKDSG++W+G +P HWE++ Sbjct: 182 KKSAQIDEAIKIKQQQITLLKERKQIIIQKAVTQGLDPNVQMKDSGVDWIGKIPVHWEIR 241 Query: 236 PFFALVTELNRKNTKLIESNILSLSYG----NIIQKLETRNMGLKPESYETYQIVDPGEI 291 L T+ K + +YG + L + + + + V+ + Sbjct: 242 RSKFLFTQRKEKALNDDVQLSATQAYGVIPQEKYEALTGKRVVKIQFHLDKRKHVEKDDF 301 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V Q L A I +S + ID + +L++ S Sbjct: 302 VISMRSFQG---GLERAWSCG-CIRSSYVVLKALQNIDPLFYGYLLKLPSYIAALQQTAS 357 Query: 352 GLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +R Q L F++ R+ + +PP++EQ I N + D + IEQ I LKE +++ Sbjct: 358 FIRDGQDLNFDNFSRVDLFIPPLEEQTAIANYVESFLTSSDEAMNLIEQQIEKLKEYKTT 417 Query: 410 FIAAAVTGQIDLRGE 424 I +AVTG+I + E Sbjct: 418 LINSAVTGKIKITPE 432 >gi|309780966|ref|ZP_07675705.1| type I restriction enzyme, S subunit [Ralstonia sp. 5_7_47FAA] gi|330824638|ref|YP_004387941.1| hypothetical protein Alide2_2050 [Alicycliphilus denitrificans K601] gi|308920269|gb|EFP65927.1| type I restriction enzyme, S subunit [Ralstonia sp. 5_7_47FAA] gi|329310010|gb|AEB84425.1| hypothetical protein Alide2_2050 [Alicycliphilus denitrificans K601] Length = 474 Score = 208 bits (530), Expect = 1e-51, Method: Composition-based stats. Identities = 87/439 (19%), Positives = 165/439 (37%), Gaps = 21/439 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YP Y+ +W+ +P+HW ++ K F + R+ + ++ + ++ Sbjct: 6 PYPNYQPLRSRWVPRVPEHWSLLRAKNFLREIDDRSKAGEETLLSMRMQRGLVPHNDVSV 65 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---- 121 K + +++ ++ G+ S + V + Sbjct: 66 KRIA--PENLIGYKKAQPDELVLNRMQAGNAMFFRNRQPGLVSPDYAVFRLLRDDNPEYL 123 Query: 122 -ELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + W + + + G + + +P+PP EQ I + A+ Sbjct: 124 GHLFRSWPMRGLFRSESKGLGTGTSGFLRLYSDRFTALEIPLPPRPEQDQIVAYLRAQDA 183 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 I I + I+LL E+K ++ + VT GL+ V +K SGIEW+G VP+HWEV Sbjct: 184 HIARFIQVKRDLIKLLTEQKLRIIDHAVTHGLDASVTLKPSGIEWLGEVPEHWEVAFIKH 243 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG--------LKPESYETYQIVDPGEI 291 + K + + N + + E+ + G++ Sbjct: 244 IADVRFSGVDKHSHDHETPVRLCNYTDVYKNDRITGDMDLMRATATEAEIARLTLKAGDV 303 Query: 292 VFRFIDLQNDKRSLRSAQVME----RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + D + + + + P+ + +L + S + F+ Sbjct: 304 ILTKDSETPDDIGVPAWVPEDLPGVVCAYHLGLLRPVPNRVLGEFLFRAIGSARTAQQFH 363 Query: 348 AMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +G R +L DVK V +PP++EQ I I E +D + + E+ I L++E Sbjct: 364 VLATGVTRFALGKHDVKNAVVALPPVEEQQSICRWITNECQPLDDAIARTEEEIKLIREY 423 Query: 407 RSSFIAAAVTGQIDLRGES 425 R IA VTGQ+D+RG Sbjct: 424 RDRLIADVVTGQVDVRGWQ 442 >gi|188585426|ref|YP_001916971.1| restriction modification system DNA specificity domain [Natranaerobius thermophilus JW/NM-WN-LF] gi|179350113|gb|ACB84383.1| restriction modification system DNA specificity domain [Natranaerobius thermophilus JW/NM-WN-LF] Length = 441 Score = 208 bits (529), Expect = 1e-51, Method: Composition-based stats. Identities = 118/438 (26%), Positives = 206/438 (47%), Gaps = 18/438 (4%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT---GRTSESGKDIIYIGLEDVE 57 M+ +K Y +YKDSG++W+G +P HW + + +TK R + GK + Y + +E Sbjct: 1 MEKFKQYKKYKDSGIEWLGKVPSHWDINRMDAYTKYYKKSIEREALRGKTVFYYSIPAIE 60 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVL 114 + + N + KL P + I ICS++F+ L Sbjct: 61 ETGDGVVEEGSNIDSNKLLLKGEELL----VSKLNPRKGRIIPTKEKEMPIICSSEFVPL 116 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIRE 172 P++ E ++ S V Q++ + + AT + + I I P +EQ I + Sbjct: 117 VPRNCSREFIRYIYQSELVKQKLSSAVQSATNSHQRVNPRDISKIYFAFPSKSEQDNIVK 176 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 + ++T +ID+LI ++ IE L+E KQ+L+++ VTKGL+P+VKMKDSG+EW+G VP+HW Sbjct: 177 YLNSKTSQIDSLINKKQNLIEKLQEYKQSLITHTVTKGLDPNVKMKDSGVEWIGEVPEHW 236 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 E+ L+ N + + + + L T N L + + E + Sbjct: 237 EILKGKYLLDIYNGYPPEELSLSANGQVKYIQVDDLNTENDELVIKDSKLKLKNKKTEAL 296 Query: 293 FRFIDLQNDKRSLR----SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I L + + ++++G+I S M +KP + + +L+ KV Sbjct: 297 DHPIILIPKRGAAIFTNKVKILVDKGLIDSNIMGLKPKK--NCNIHYLVYMIKARKVDDI 354 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + + + LP+ +PPI+EQ I ++ + I+ + I+ +I LKE R Sbjct: 355 ADTSTIPQINNKHINPLPLTIPPIEEQNKIAEYLDEKVDNINNCILNIKVAIQKLKEYRQ 414 Query: 409 SFIAAAVTGQIDLRGESQ 426 S I AVTG+ID+R + Sbjct: 415 SLITHAVTGKIDVRDWAD 432 >gi|167039866|ref|YP_001662851.1| restriction modification system DNA specificity subunit [Thermoanaerobacter sp. X514] gi|300915378|ref|ZP_07132692.1| restriction modification system DNA specificity domain protein [Thermoanaerobacter sp. X561] gi|307724809|ref|YP_003904560.1| restriction modification system DNA specificity domain-containing protein [Thermoanaerobacter sp. X513] gi|166854106|gb|ABY92515.1| restriction modification system DNA specificity domain [Thermoanaerobacter sp. X514] gi|300888654|gb|EFK83802.1| restriction modification system DNA specificity domain protein [Thermoanaerobacter sp. X561] gi|307581870|gb|ADN55269.1| restriction modification system DNA specificity domain protein [Thermoanaerobacter sp. X513] Length = 463 Score = 208 bits (528), Expect = 2e-51, Method: Composition-based stats. Identities = 83/433 (19%), Positives = 173/433 (39%), Gaps = 20/433 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 K YP+YK++ W+ +IP HW+ I+ + + S+ + + G Sbjct: 3 KPYPKYKETPALWLNSIPNHWESHKIRELFVERSEKVSDKDYSPLSVS------KAGVVP 56 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-EL 123 ++ ++ + KG + + I+++DG S +VL+P+ + Sbjct: 57 QIATVAKTNNGDNRKLVIKGDFVINSRSDRRGSSGISNYDGSVSLINIVLKPRSFVNGRY 116 Query: 124 LQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + L S + G + + + +I +P+P + EQ I + + +I Sbjct: 117 MHYLLKSHYFIEEFYRNGRGIVADLWTTRYTEMKSIYLPVPSIEEQDQIVRFLDWKLAKI 176 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + LI + + I LL E ++A + ++ G+NP K+SG+ W+G +P HW V + Sbjct: 177 NKLIQAKKKQIALLTEYRKATIDNVIMYGINPHANRKESGVIWLGEIPSHWSVMKLKRIC 236 Query: 242 TELNRKNTKLI----ESNILSLSYGNIIQKLETR--NMGLKPESYETYQIVDPGEIVFRF 295 ++L E ++ L NI + + + +++ Sbjct: 237 RINASITSQLEKYSLEDYVVFLPMENISSDGKIDCCEKRKLKDVRNGFSSFAKNDVIVAK 296 Query: 296 IDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMG 350 I N K + G T+ + ++ + ++ ++ + + G Sbjct: 297 ITPCFENGKGACLDTLETNIGFGTTELIVLRANEKVLPRYLYMITQLQQFRIEGANVMTG 356 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 S ++ + + + +P I EQ +I ++ A+ D L E + + I LL E R Sbjct: 357 SAGQKRVPSSFISNFELGIPSIAEQSEILEYLDNRLAKFDKLYETLNREIELLTEYRIRL 416 Query: 411 IAAAVTGQIDLRG 423 I+ VTG++D+R Sbjct: 417 ISDVVTGKVDVRD 429 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 87/204 (42%), Gaps = 10/204 (4%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L P K K++ W+ +P+HWE L E + K + S + G + Q Sbjct: 2 LKPYPKYKETPALWLNSIPNHWESHKIRELFVERSEKVSDKDYSPLSVSKAGVVPQ---- 57 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 K + + ++V G+ V D+R + + + ++ Sbjct: 58 IATVAKTNNGDNRKLVIKGDFVINSRS---DRRGSSGISNYDGSVSLINIVLKPRSFVNG 114 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKF---EDVKRLPVLVPPIKEQFDITNVINVETA 387 Y+ +L++S+ + FY G G+ L ++K + + VP I+EQ I ++ + A Sbjct: 115 RYMHYLLKSHYFIEEFYRNGRGIVADLWTTRYTEMKSIYLPVPSIEEQDQIVRFLDWKLA 174 Query: 388 RIDVLVEKIEQSIVLLKERRSSFI 411 +I+ L++ ++ I LL E R + I Sbjct: 175 KINKLIQAKKKQIALLTEYRKATI 198 >gi|307826306|ref|ZP_07656513.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] gi|307732662|gb|EFO03532.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] Length = 435 Score = 208 bits (528), Expect = 2e-51, Method: Composition-based stats. Identities = 107/435 (24%), Positives = 186/435 (42%), Gaps = 27/435 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVES 58 Y YKDSGV+W+G IP+HW++ +K ++G + ++ + + D+ Sbjct: 9 PKYEIYKDSGVEWLGEIPEHWEIKRLKFIIAEHSGNGFPVEEQGKHTGELPFYKVSDI-G 67 Query: 59 GTGKYLPKDGNSRQSDTSTV---SIFAKGQILYGKLGPYLRKAI--IADFDGICSTQFLV 113 G Y+ N T+ ++ G ++ K+G LRK I+ I + Sbjct: 68 GDSMYISHASNYVNFKTAKKLKWNLIPSGSLITAKIGEALRKNHRKISTSSSIIDNNCIA 127 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + ID + + +P+P EQ I Sbjct: 128 FEAVSIGVVFNYYLHKVIDFDW----FTNPGAVPCISVPKYKSFHIPLPAFTEQTAIAAF 183 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + +T ++D + + + I L KE KQ L+ VT+ LNPD M+DSG+EW+G +P HW Sbjct: 184 LDRKTAQLDQAVAIKEKQITLFKEHKQILIQNAVTRSLNPDAPMRDSGVEWLGKIPAHWA 243 Query: 234 ---VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + F E L+ +I + I + E +K E Y +V G+ Sbjct: 244 ILANRVIFRERVEPGEDGLPLLSVSIHTAVSSEEISEDENIRGRIKIEDKTKYSLVQIGD 303 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAM 349 I F + +G+++ AY+ P+ S+Y + R + + Sbjct: 304 IAFNMMRAWQGAIGAVKI----KGMVSPAYIVAVPNEKIVSSYFEYQYRCPEFIQQMDRY 359 Query: 350 GSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G+ R+ L + + K+L +VPP++EQ I I E+A+ID + +Q I LKE Sbjct: 360 SKGITDFRKRLYWNEFKQLVTVVPPVEEQTAIVTHIETESAKIDQAISIQQQQIDKLKEY 419 Query: 407 RSSFIAAAVTGQIDL 421 +++ I +AVTG+I + Sbjct: 420 KATLINSAVTGKIKV 434 >gi|56459752|ref|YP_155033.1| restriction endonuclease S subunit [Idiomarina loihiensis L2TR] gi|56178762|gb|AAV81484.1| Restriction endonuclease S subunit [Idiomarina loihiensis L2TR] Length = 448 Score = 206 bits (524), Expect = 5e-51, Method: Composition-based stats. Identities = 112/416 (26%), Positives = 177/416 (42%), Gaps = 19/416 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ WK++ +K + TG S I + D+++ S Sbjct: 20 LPERWKLIKLKLVCNIETGFAFPSEVFGETGTPVIRITDIKNREINLSEIKRVDDLLLKS 79 Query: 77 TVSI--FAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 KG I+ G + K D + + P + L L S Sbjct: 80 KPKRPSVNKGDIIMAMTGATIGKVGYYNSDKPSYLNQRVCRFIPASIDRGYLWHTLNSEI 139 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRF 191 + IE G ++ + N P P+P L EQ I + + ET +ID LI E+ R Sbjct: 140 YKKYIELEAFGGAQANISDSQLLNFPAPLPELEAEQQKIAQFLDYETAKIDALIDEQKRL 199 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 IELLKEK+QA++S+ VTKGLNPD MKDSGIEW+G VP+HWE+K L+ K Sbjct: 200 IELLKEKRQAVISHAVTKGLNPDAPMKDSGIEWLGEVPEHWEIKKLKFCSRMLSDKGKDN 259 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + L I+ + + + + +P +I+F + K L Sbjct: 260 TNAISLE-----NIENGTGAFIKTESNFDQEGVLFEPLDILFGKLRPYLAKVYLAR---E 311 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 + + I +L + + S + + G E +K L + VP Sbjct: 312 HGSALGDILVFRANKDISPEFLFFRLISQEFIRQVDQSSYGSKMPRANPELIKSLQIAVP 371 Query: 371 PIKEQFDITNVI-NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 PI+EQ +++ + N++ +I V + LL+ERRS+ I+AAVTG+ID+R Sbjct: 372 PIEEQVKVSDYLANLQFNKIMPSVINASSLVKLLEERRSALISAAVTGKIDVRDWQ 427 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 56/205 (27%), Positives = 109/205 (53%), Gaps = 9/205 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 KDSG++W+G +P+HW++ +K +++ + + + I LE++E+GTG ++ + N Sbjct: 225 MKDSGIEWLGEVPEHWEIKKLKFCSRMLSDK---GKDNTNAISLENIENGTGAFIKTESN 281 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWL 128 Q +F IL+GKL PYL K +A G LV + + PE L L Sbjct: 282 FDQEGV----LFEPLDILFGKLRPYLAKVYLAREHGSALGDILVFRANKDISPEFLFFRL 337 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI-IAETVRIDTLITE 187 +S + ++++ G+ M A+ + I ++ + +PP+ EQV + + + + +I + Sbjct: 338 ISQEFIRQVDQSSYGSKMPRANPELIKSLQIAVPPIEEQVKVSDYLANLQFNKIMPSVIN 397 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN 212 ++LL+E++ AL+S VT ++ Sbjct: 398 ASSLVKLLEERRSALISAAVTGKID 422 >gi|71276008|ref|ZP_00652290.1| putative type I restriction enzyme, S subunit [Xylella fastidiosa Dixon] gi|71899046|ref|ZP_00681211.1| putative type I restriction enzyme, S subunit [Xylella fastidiosa Ann-1] gi|71163241|gb|EAO12961.1| putative type I restriction enzyme, S subunit [Xylella fastidiosa Dixon] gi|71731159|gb|EAO33225.1| putative type I restriction enzyme, S subunit [Xylella fastidiosa Ann-1] Length = 457 Score = 206 bits (524), Expect = 5e-51, Method: Composition-based stats. Identities = 95/429 (22%), Positives = 166/429 (38%), Gaps = 20/429 (4%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFT--KLNTGRTSESGKDIIYIGLEDVESGTG--- 61 YP Y +SG+ WI +P+ W+V+ ++ G + + +V TG Sbjct: 7 YPTYCNSGLAWIPKLPEGWQVLRNGCLFGHRVEMG--------FPDLPILEVSLRTGVRV 58 Query: 62 -KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 S KG I Y + + +A DG+ S ++V++P Sbjct: 59 RDMENLKRKQVISQKEKYKRATKGDIAYNMMRMWQGAVGLAPVDGLVSPAYVVVKPYAEA 118 Query: 121 PELLQGW-LLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + Q + G + W+ +P +PPL EQ I + A+ Sbjct: 119 NSTYYSYLFRTAAYMQEVNKYSRGIVADRNRLYWESFKQMPSLVPPLPEQKQIVTYLRAQ 178 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I I + I+LL E+K ++ + VT+GL+ V +K SGIEW+G VP HWEV+ Sbjct: 179 DAHIARFIKAKRDLIKLLTEQKLRIIDHAVTRGLDASVALKPSGIEWLGDVPVHWEVRRL 238 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 L + + T I R + + E T + +++F + Sbjct: 239 KFLASNTTSQTTTKARDEIYLAMEHVQSWTGVARPLEGEVEFASTVKRFVVDDVLFGKLR 298 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQS 356 K + A+ + + + I YL ++R + + + +G Sbjct: 299 PYLAK--VTRAKCNGVCVSEFLVLRSRKEFILPAYLEQMLRCKRVIDLINSSTAGAKMPR 356 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + + + VP Q I + I ET + + + E I L++E R I VT Sbjct: 357 ADWIFIGNVRLPVPCKDVQEAILSHIESETKDLGEAITRTEDEIKLIREYRDRLITDVVT 416 Query: 417 GQIDLRGES 425 GQ+D+RG Sbjct: 417 GQVDVRGWQ 425 >gi|260557402|ref|ZP_05829617.1| restriction endonuclease S subunit [Acinetobacter baumannii ATCC 19606] gi|260409028|gb|EEX02331.1| restriction endonuclease S subunit [Acinetobacter baumannii ATCC 19606] Length = 451 Score = 206 bits (524), Expect = 6e-51, Method: Composition-based stats. Identities = 101/436 (23%), Positives = 181/436 (41%), Gaps = 30/436 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQS- 73 G +P HW + +KR+ + G S I + D+++ L +S Sbjct: 2 GVVPSHWIITTLKRYCYVKGGFAFSSDAFIDTGYPVIRIGDIKTDGSINLENCKYIPESL 61 Query: 74 -DTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLL- 129 S + K Q+L G + KA + + + + + W + Sbjct: 62 AVNSRDYLVEKNQLLMAMTGATIGKAGLYTSNQPAFLNQRVGKFELLAQNMNYRYLWYIL 121 Query: 130 -SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + I+ G + + + P IP EQ I + ET +ID LI ++ Sbjct: 122 KTDGYQEYIKLTAFGGAQPNISDTAMVDYPATIPSFDEQTQIANFLDHETSKIDHLIEKQ 181 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT------ 242 + IELLKEK+QA++S+ VTKGL+P+V MKDSG+ W+G VP+HW++ P L+ Sbjct: 182 QKLIELLKEKRQAVISHAVTKGLDPNVPMKDSGVAWLGEVPEHWDITPIRNLIRSGNLIL 241 Query: 243 ------ELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIV 292 EL+ +E+ I L NI + + + G+++ Sbjct: 242 QDGNHGELHPTANDYVETGIPFLMANNIRNGNLFMEDVKRIPKHLADTLRIGFAKAGDML 301 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + + + ++T Y + + Y + +S + +G Sbjct: 302 LTHKGTVGEVALVPQDIKEDYWMLTPQVTYYRWQGKKFLNKYFYYQFQSSSIQTQLEIIG 361 Query: 351 --SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 R + L V +PP EQ +I++ I + +++ K + +I L++ERR+ Sbjct: 362 AKQSTRAYVGLIAQGDLIVAIPPSHEQLEISSYILEKDQSYQLMIAKAQTAIQLMQERRT 421 Query: 409 SFIAAAVTGQIDLRGE 424 + I+AAVTG+ID+R Sbjct: 422 ALISAAVTGKIDVRHW 437 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 49/224 (21%), Positives = 93/224 (41%), Gaps = 21/224 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-----LNTGRTSES--------GKDIIYIGLEDV 56 KDSGV W+G +P+HW + PI+ + L G E I ++ ++ Sbjct: 210 MKDSGVAWLGEVPEHWDITPIRNLIRSGNLILQDGNHGELHPTANDYVETGIPFLMANNI 269 Query: 57 ESGTGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAI----IADFDGICSTQ- 110 +G + +DT + G +L G A+ I + + + Q Sbjct: 270 RNGNLFMEDVKRIPKHLADTLRIGFAKAGDMLLTHKGTVGEVALVPQDIKEDYWMLTPQV 329 Query: 111 -FLVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV 168 + Q K L + S + ++E I + +T ++ G++ + IPP EQ+ Sbjct: 330 TYYRWQGKKFLNKYFYYQFQSSSIQTQLEIIGAKQSTRAYVGLIAQGDLIVAIPPSHEQL 389 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 I I+ + +I + I+L++E++ AL+S VT ++ Sbjct: 390 EISSYILEKDQSYQLMIAKAQTAIQLMQERRTALISAAVTGKID 433 >gi|206975754|ref|ZP_03236666.1| type I restriction-modification system specificity determinant [Bacillus cereus H3081.97] gi|206746216|gb|EDZ57611.1| type I restriction-modification system specificity determinant [Bacillus cereus H3081.97] Length = 434 Score = 205 bits (520), Expect = 2e-50, Method: Composition-based stats. Identities = 109/435 (25%), Positives = 190/435 (43%), Gaps = 22/435 (5%) Query: 7 YPQYKDSGVQWIGAIPKHWK----VVPIKRFTKLNTGRTSES-GKDIIYIGLEDVESGTG 61 YPQYK + ++W+ IP W+ + G+T E + I + ++++ G Sbjct: 3 YPQYKKTNLEWLENIPSEWEYGGLTKYLDSIVDY-RGKTPEKVEEGIFLVTAKNIKHGQI 61 Query: 62 KYLPKDGNSRQSDTSTVS---IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ--P 116 Y + + V + G +L+ P A + D + + + + Sbjct: 62 DYSLSQEFVKIEEYEEVMRRGLPEIGDVLFTTEAPLGEVANVDRIDIALAQRIIKFRGIE 121 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L+ W+ S + G+T + + ++ + +P +EQ I + Sbjct: 122 NVLDNYYLKYWIQSHGFQSNLRTFATGSTAAGIKASKLSSLQVLLPSYSEQKQIVLFLDN 181 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 + ID LIT++ + I LL+EK+Q+++ VTKGLNP+VKMKDS +EW+G +P+ W +K Sbjct: 182 KVHEIDGLITQKEQMISLLEEKRQSMIIEAVTKGLNPNVKMKDSSVEWIGEIPESWNIKK 241 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 +LSL+ + K G ESYE YQ V+ + V + Sbjct: 242 IKYKFDIRKVIQPT-EAPTVLSLTQKGLKVKDLNDFSGQHAESYEKYQRVEIDDYVMNGM 300 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGL- 353 DL + G+ + Y + + + + + K+FY G G+ Sbjct: 301 DLLTGYVDCAKFE----GVTSPDYRVFRLRYPEECHDYYLRYFQMCYFAKIFYGHGQGVS 356 Query: 354 ---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 R L+ + K P+ PPI EQF I+ ++V+ I+ ++ I+ I LK+ R S Sbjct: 357 HLGRWRLQTDVFKGFPIPEPPIDEQFAISKYLSVKEIEINEAIDMIKVQIQNLKDYRQSL 416 Query: 411 IAAAVTGQIDLRGES 425 I AVTG+ID+R Sbjct: 417 IYEAVTGKIDVRDFE 431 >gi|149180787|ref|ZP_01859290.1| putative type I restriction enzyme specificity protein [Bacillus sp. SG-1] gi|148851577|gb|EDL65724.1| putative type I restriction enzyme specificity protein [Bacillus sp. SG-1] Length = 454 Score = 204 bits (519), Expect = 2e-50, Method: Composition-based stats. Identities = 98/441 (22%), Positives = 168/441 (38%), Gaps = 28/441 (6%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGN 69 + W +P+ W +K + G +S K + I D+++G + + Sbjct: 11 DINWYERVPEDWSEKKLKYLVETIKGYAFKSQLFGDKGVPIIKTTDIKNGKIQDSDIFID 70 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPY----------LRKAIIADFDGICSTQ--FLVLQPK 117 R K IL +G + K + + L + K Sbjct: 71 ERFEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYEGALLNQNAVILRCKSK 130 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 D+ L +L S + ++ G + K I + MP+P Q I E + Sbjct: 131 DITNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQHQISEFLDH 190 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T ++TLI ++ + IELL+EK+QA+V+ VT+GLNPDVKMKDSG++W+G +P+HW++ Sbjct: 191 KTSDVETLIADKQKLIELLEEKRQAIVTEAVTRGLNPDVKMKDSGVKWIGDIPEHWDISK 250 Query: 237 FFALVT-----ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-----QIV 286 + L G + S E Y + Sbjct: 251 IKYSTYVKGRIGWQGLRSDEFIDEGPYLVTGTDFKDGIIHWDTCYHISEERYSEAPPIQL 310 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 +++ +++ + + ++ W++ S Sbjct: 311 KENDLLITKDGTIGKVAIVKNKPGKAILNSGIFVTRCQDKEYLTKFMYWILTSEVFKNYI 370 Query: 347 YAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 M +G + L E +P I+EQ I + + ID + ++I I LLKE Sbjct: 371 KYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVREIDSVKKEISDQIELLKE 430 Query: 406 RRSSFIAAAVTGQIDLRGESQ 426 R S I AVTG+IDLR + Sbjct: 431 YRQSLIYEAVTGKIDLRDYQE 451 >gi|120597918|ref|YP_962492.1| putative type I site-specific restriction-modification system, S subunit [Shewanella sp. W3-18-1] gi|120558011|gb|ABM23938.1| putative type I site-specific restriction-modification system, S subunit [Shewanella sp. W3-18-1] Length = 429 Score = 204 bits (519), Expect = 2e-50, Method: Composition-based stats. Identities = 101/434 (23%), Positives = 187/434 (43%), Gaps = 22/434 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 + Y YKDS WIG +P+HW + +K + + + + + G Sbjct: 4 IAEMPKYQTYKDSTEGWIGDVPEHWDIRKLKHLFYE------KKHRPNMSLNSGAISFGK 57 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQP 116 D S ++ G+ L L + +++ D + S ++V++ Sbjct: 58 V-VTKDDEKILLSTKASYQEVLSGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVIKE 116 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 K+ L + +LL ++ + G + I N + PPL EQ LI + Sbjct: 117 KEELQKQYFKYLLHRYDVAYMKLLGSGV-RQTISFNHIANSLLVFPPLEEQSLIANYLEK 175 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T ++D I + + I LLKE+KQ ++ VT+GL+P+V MKDSG++W+G VP HWEV+ Sbjct: 176 KTAQVDEAIAIKEQQISLLKERKQIIIQQAVTQGLDPNVPMKDSGVDWIGKVPAHWEVRR 235 Query: 237 ----FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 F + + +L + + + ++L + + + + V+ + V Sbjct: 236 SKFVFTQRKERAWKDDVQLSATQAYGVIPQDQYEELTGKRVVKIQFHLDKRKHVEKDDFV 295 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 Q + I +S + ID ++ +L++ S Sbjct: 296 ISMRSFQ----GGLERAWSQGCIRSSYVVLRALDEIDPSFYGYLLKLPSYIAALQQTASF 351 Query: 353 LR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +R Q L F++ ++ + +PPI+EQ +I N ++ D +E + I LKE ++S Sbjct: 352 IRDGQDLNFDNFSKVDLFIPPIEEQKEIANYVSAFMKSSDEGIELLFAQIEKLKEYKTSL 411 Query: 411 IAAAVTGQIDLRGE 424 I +AVTG+I + E Sbjct: 412 INSAVTGKIKITPE 425 >gi|255308175|ref|ZP_05352346.1| putative type I site-specific restriction-modification system, S subunit [Clostridium difficile ATCC 43255] Length = 455 Score = 203 bits (517), Expect = 3e-50, Method: Composition-based stats. Identities = 95/453 (20%), Positives = 183/453 (40%), Gaps = 37/453 (8%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 Y+ + KDSGV+WIG IPK W+V IK +L ++ E ++ + + ++ + Sbjct: 4 RYRDDEEMKDSGVEWIGKIPKDWEVKRIKHLFELKKDKSDEENPTVLSLTQKGLK---IR 60 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + S + K + + A+ +G+ S + + K+ + Sbjct: 61 DVSNNEGQLASTYVGYTKIEKNDFILNPMDLISGYTDKAEIEGVISPAYTTFRSKNKVNI 120 Query: 123 LLQGWLLSIDVTQRIEAICEG------ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + + + N+P+ L EQ I + Sbjct: 121 NHDYYKRYFQMHYHHNFLFPWGEGVSFEHRWTLKNEVFLNLPVITNRLEEQEKIANFLDE 180 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV-------------KMKDSGIE 223 +T + + +I+++ I+ L+E K++L+S +VT + +MKDSGIE Sbjct: 181 KTSQFEFIISKKEELIKKLEEAKKSLISEVVTGKVKVVKTDDGYKLVKRSSEEMKDSGIE 240 Query: 224 WVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKL---------ETR 271 W+G +P WEVK F + T L ES I ++YG I K + + Sbjct: 241 WLGEIPKDWEVKNFKYMFTLNKGLSITKADLKESGIPCVNYGEIHSKYRFELKPSKHKLK 300 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA--YMAVKPHGID 329 + + ++ G+ VF + + + +A ++ Sbjct: 301 YVDESYLKSNSISLLKYGDFVFCDTSEDIEGCGNFTYLNENNKVFAGYHTIIARTLEQVN 360 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+++L S D SG+ S+ +K V++P + EQ +I ++ + Sbjct: 361 YRYMSYLFDSNDWRIQIRTKVSGVKVFSITQSILKGTKVILPDLLEQRNIAQYLDSKCKG 420 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ID +++K + I LKE + S I+ AVTG+I++ Sbjct: 421 IDSIIDKTKLQIDKLKEAKQSLISEAVTGKIEI 453 >gi|57506069|ref|ZP_00371992.1| type I restriction-modification system specificity subunit, putative [Campylobacter upsaliensis RM3195] gi|57015677|gb|EAL52468.1| type I restriction-modification system specificity subunit, putative [Campylobacter upsaliensis RM3195] Length = 427 Score = 203 bits (516), Expect = 5e-50, Method: Composition-based stats. Identities = 86/423 (20%), Positives = 182/423 (43%), Gaps = 21/423 (4%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G IP HW+V +K ++ + + +++ + + + + + + Sbjct: 6 GKIPAHWEVRRLKYLFYISKEESRDEFPNVLSLTQNGIIE---RDITTNKGQLAQNYIGY 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT---- 134 +I +G I+ + + F+G+ S ++ ++P + L Sbjct: 63 NIVKRGDIILNPMDLSSGYVAKSTFEGVISQAYIKIRPLETLNLSYYENFFQNLYHYKIL 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + NI +P+PPL EQ I E + + +I I ++ + I L Sbjct: 123 WHLGKGISYDHRWTLGNDVFLNIKIPLPPLQEQKEIAEFLDKKCEKIQNYINKKQKLITL 182 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR-------- 246 L+EKKQAL++ +TKGLNP+++ K+SGIEW+G +P HWE+K + Sbjct: 183 LQEKKQALINEAITKGLNPNIEFKNSGIEWLGEIPKHWEIKKLKYIGEIFGGVIGKTIKD 242 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKP---ESYETYQIVDPGEIVFRFIDLQNDKR 303 + + + +++ N+ ++ + E V +I+F + Sbjct: 243 FSKEYKPNFKPYITFTNVCNNAIINPNSMEYVFIDFDEKQNKVLKNDILFLQSSETFEDV 302 Query: 304 SLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFE 360 + + + + + + YL +L+ S + F ++ SG R +L+ E Sbjct: 303 GKSAIYLNDDEVYLNTFCKGFRIEREAYPMYLNYLLSSLSYKRYFMSVCSGFTRINLRQE 362 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +P+++PP++EQ +I ++ + +I+ +EK ++ I ++E +++ I AV G+I Sbjct: 363 HFLDIPLILPPLQEQKEIAEFLDEKCKKINSAIEKTKKQIEFVREYKNTLINEAVCGRIK 422 Query: 421 LRG 423 L+ Sbjct: 423 LKE 425 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 89/215 (41%), Gaps = 15/215 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI---------IYIGLEDVESG 59 ++K+SG++W+G IPKHW++ +K ++ G ++ KD YI +V + Sbjct: 204 EFKNSGIEWLGEIPKHWEIKKLKYIGEIFGGVIGKTIKDFSKEYKPNFKPYITFTNVCNN 263 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFL-V 113 + K IL+ + + D + +T Sbjct: 264 AIINPNSMEYVFIDFDEKQNKVLKNDILFLQSSETFEDVGKSAIYLNDDEVYLNTFCKGF 323 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ P L L S+ + ++C G T + + +IP+ +PPL EQ I E Sbjct: 324 RIEREAYPMYLNYLLSSLSYKRYFMSVCSGFTRINLRQEHFLDIPLILPPLQEQKEIAEF 383 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + +I++ I + + IE ++E K L++ V Sbjct: 384 LDEKCKKINSAIEKTKKQIEFVREYKNTLINEAVC 418 >gi|150005916|ref|YP_001300660.1| type I restriction-modification system S subunit [Bacteroides vulgatus ATCC 8482] gi|149934340|gb|ABR41038.1| type I restriction-modification system S subunit [Bacteroides vulgatus ATCC 8482] Length = 430 Score = 203 bits (515), Expect = 6e-50, Method: Composition-based stats. Identities = 92/430 (21%), Positives = 170/430 (39%), Gaps = 17/430 (3%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYL 64 Y YKDSG+QW+G IP HW++ + N S + + + +E + + Sbjct: 3 KYNSYKDSGIQWLGKIPSHWEIKRSRLIFDENVETNSTCNNTNQLQFRFGTIEPKKSQEM 62 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 D S +I G I+ ++ GI ++ ++ L+PK+ + Sbjct: 63 DSDLKKI---ISKYTIVQNGDIMINGLNLNYDFVSQRVAQVKEKGIITSAYIALRPKENI 119 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +LL +++ +K N +PIPPL EQ + + T Sbjct: 120 CSDYFTYLLKGMDARKVFHGMGCGVRLTLSFKEFRNELLPIPPLEEQQSMATYLDKATAE 179 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID I ++ R I+LL E+KQ ++ VTKGL+ +V+MK+SG+ W+G +P HWE P + Sbjct: 180 IDKAIAQQQRMIDLLNERKQIIIQRAVTKGLDGNVEMKNSGLNWLGQIPSHWESLPLTYV 239 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPESYETYQIVDPGEI 291 N + + + + R G + ++ G Sbjct: 240 FEMRNGYTPSKNDPTYWTNGSIPWYRMEDIRKSGRFLREAMQYVTTKAINGKGTFKAGSY 299 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + ++ A + + + + S Sbjct: 300 IMAICTASIGEHAMLIADSLANQRFANFKIRKSLIESFYPLFLFYYMYVVGDFCRENSNS 359 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 Q + +KR P+ P ++EQ +I + + +I+ +E+I++ I LL+ER+ I Sbjct: 360 TCFQYVDMGALKRFPIPKPSMEEQKNIVSSLTQNLQQINTALERIQKQITLLQERKQIII 419 Query: 412 AAAVTGQIDL 421 + VTG+I + Sbjct: 420 SEVVTGKIKV 429 >gi|311694470|gb|ADP97343.1| type I site-specific restriction-modification system, S subunit [marine bacterium HP15] Length = 427 Score = 202 bits (514), Expect = 7e-50, Method: Composition-based stats. Identities = 106/430 (24%), Positives = 186/430 (43%), Gaps = 28/430 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG 59 Y YKDSG W+G IP +W K ++ G+ + Y+ +E + Sbjct: 9 KYEAYKDSGADWLGMIPINWTSKKFKYLARVKKGKVPKRIVSENRSGLPPYLSMEYLRGA 68 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 +D ++ + + G IL G + ++ + ST + Sbjct: 69 EANQFVEDRDAI--------VVSDGSILLLWDGSNAGEFVVGRGGVVSSTLAAIDFFSVD 120 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 W + + G + H D + + N + IP L EQ LI + + +T Sbjct: 121 ---RKFAWYACQVTEIELRSTTVGMGIPHVDGEQLKNSFLAIPSLDEQSLIAKFLDKKTT 177 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +ID I + + I LLKE+KQ ++ VT+GL+P V MK SG++W+G +P HWEV F Sbjct: 178 QIDEAIAIKEQQIVLLKERKQIIIQKAVTQGLDPTVPMKLSGVDWIGEIPKHWEVVRFKN 237 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDL 298 + +R ++ + + S G + + R G E YQ + G++V +D Sbjct: 238 -LFSQSRLPVRIGDGVVTSYRDGQVTLRTNRRLEGYTEAIIEGGYQGIRKGQLVLNSMDA 296 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + G T Y+ P+ I Y A+L+R L K + + +RQ Sbjct: 297 FEGAIGVSDSD----GKCTPEYVICDPNRGGISQYYFAYLLREMALGKYIQVICNAVRQR 352 Query: 357 ---LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ ++ ++VPP EQ +I I E +I ++ ++ I LKE +++ I + Sbjct: 353 AVRIRYNNLAPRFMVVPPESEQEEIVKFIESEKVKIGDGIDHLQSQIEKLKEYKTTLINS 412 Query: 414 AVTGQIDLRG 423 AVTG+I + Sbjct: 413 AVTGKIKITP 422 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 84/208 (40%), Gaps = 6/208 (2%) Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 L+ KDSG +W+G++P +W K F L K K I S S + + Sbjct: 5 HQLHKYEAYKDSGADWLGMIPINWTSKKFKYLARVKKGKVPKRIVSENRSGLPPYLSMEY 64 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + +V G I+ + V G+++S A+ + Sbjct: 65 LRGAEANQFVEDRDAIVVSDGSILLLWDGSN-----AGEFVVGRGGVVSSTLAAIDFFSV 119 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 D + + + ++ +G G+ + E +K + +P + EQ I ++ +T + Sbjct: 120 DRKFAWYACQVTEIELRSTTVGMGI-PHVDGEQLKNSFLAIPSLDEQSLIAKFLDKKTTQ 178 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ID + EQ IVLLKER+ I AVT Sbjct: 179 IDEAIAIKEQQIVLLKERKQIIIQKAVT 206 >gi|148653129|ref|YP_001280222.1| restriction modification system DNA specificity subunit [Psychrobacter sp. PRwf-1] gi|148572213|gb|ABQ94272.1| restriction modification system DNA specificity domain [Psychrobacter sp. PRwf-1] Length = 431 Score = 202 bits (514), Expect = 9e-50, Method: Composition-based stats. Identities = 116/431 (26%), Positives = 194/431 (45%), Gaps = 19/431 (4%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTG 61 YKAYP+YK SGV+W+G IP+HW ++ K K L + D + + + V + Sbjct: 11 RYKAYPEYKGSGVEWLGEIPRHWGLLRGKWRFKSLKEVNRNLQCMDRLALTMRGVIERSI 70 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK---AIIADFDGICSTQFLVLQP-K 117 D + S + IF K +++ + K + GI S ++ L+P K Sbjct: 71 D---SDDGLQPSAFTGYQIFEKDDLVFKLIDLENYKTSRVGLVFKKGIMSPAYIRLKPNK 127 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 +L + + + + I S + + I + +P EQ I + + E Sbjct: 128 GMLSKFFYYFYFDLYLRGIYNQIGGQGVRSALNASDLLEIEICVPSREEQAEIADFLDYE 187 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T +IDTLI ++ R IELL EK+QA +S+ VTKGLNPDV MKDSG+EW+G VP HWEVK Sbjct: 188 TAKIDTLIKKQQRLIELLTEKRQATISHAVTKGLNPDVPMKDSGVEWLGEVPAHWEVKDI 247 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + LN + + + Y I D ++ Sbjct: 248 KFQLKSLNSRRIPINSQDRGDREGIYRYYGASGVI------DYIDDYIFDEPTVLVGEDG 301 Query: 298 LQNDKRSLR-SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 RS + + + + ++ + + A ++ D+ + + Sbjct: 302 ANLLSRSTPLAFSAHGKYWVNNHAHILEAKDGLADFWAEVIDIIDVTPLV---TGSAQPK 358 Query: 357 LKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L E + L + PP IKE+ +I + I+ + L+ +++I L++ERR++ I+AAV Sbjct: 359 LTAEALSNLKIAFPPTIKERKEIESFIHSSKYKYGELIGYAKKAIQLMQERRTALISAAV 418 Query: 416 TGQIDLRGESQ 426 TG+ID+R + Sbjct: 419 TGKIDVRDWVK 429 >gi|209523387|ref|ZP_03271942.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] gi|209496129|gb|EDZ96429.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] Length = 415 Score = 200 bits (508), Expect = 4e-49, Method: Composition-based stats. Identities = 108/408 (26%), Positives = 187/408 (45%), Gaps = 18/408 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P WK +K G + + ++++ + + N + + S Sbjct: 17 PLGWKKSYVKYLGNYINGYPFKPDNWSFQGKPILRIQNLSNPNADF-----NRYEGEISE 71 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + KG IL L D + ++ L L+ + + Sbjct: 72 AYLVHKGDILISWS-ASLGVYKWLGEDAWLNQHIFKVEINTKLVFEEYFVWLASWFIKEL 130 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E G+TM H W GN P+ +PP+ EQ I + ET +ID LI + R +ELL E Sbjct: 131 EHKAHGSTMQHLTWNAFGNFPVLLPPMPEQKAIAHYLDKETAKIDQLIEAKKRLLELLDE 190 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K++AL+++ VT+GLNPDV M+DSG+EW+G +P HW+V+ L E++ ++T E + Sbjct: 191 KRRALITHAVTRGLNPDVPMRDSGVEWIGEIPKHWKVEFAKWLFKEIDDRSTTGQEELLT 250 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 I + E K ES E Y++ G+++ + + + GI++ Sbjct: 251 VSHITGITPRSEKDVNMFKAESMEGYKVCQSGDLIINTLWAWMGAMGVS----FQPGIVS 306 Query: 318 SAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIK 373 +Y +P YL +L+R + G+ R L E+ ++ + VPP++ Sbjct: 307 PSYHVYRPQGEYHPVYLDYLVRIPIFAEEAIRYSKGVWISRLRLYPEEFFQILLPVPPLE 366 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ+ I + +T ++D L ++++ LL+ERR+S I AAVTGQ+ + Sbjct: 367 EQYKIGKYLMEKTKKLDNLSIATKKTMDLLQERRTSLITAAVTGQLKI 414 Score = 96.4 bits (238), Expect = 9e-18, Method: Composition-based stats. Identities = 45/205 (21%), Positives = 91/205 (44%), Gaps = 5/205 (2%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 +DSGV+WIG IPKHWKV K K R++ +++ + + + T + Sbjct: 210 MRDSGVEWIGEIPKHWKVEFAKWLFKEIDDRSTTGQEEL--LTVSHITGITPRSEKDVNM 267 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + G ++ L ++ ++ GI S + V +P+ + +L+ Sbjct: 268 FKAESMEGYKVCQSGDLIINTLWAWMGAMGVSFQPGIVSPSYHVYRPQGEYHPVYLDYLV 327 Query: 130 SIDVT---QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 I + + + + I +P+PPL EQ I + ++ +T ++D L Sbjct: 328 RIPIFAEEAIRYSKGVWISRLRLYPEEFFQILLPVPPLEEQYKIGKYLMEKTKKLDNLSI 387 Query: 187 ERIRFIELLKEKKQALVSYIVTKGL 211 + ++LL+E++ +L++ VT L Sbjct: 388 ATKKTMDLLQERRTSLITAAVTGQL 412 >gi|298529187|ref|ZP_07016590.1| putative type I restriction enzyme, S subunit [Desulfonatronospira thiodismutans ASO3-1] gi|298510623|gb|EFI34526.1| putative type I restriction enzyme, S subunit [Desulfonatronospira thiodismutans ASO3-1] Length = 460 Score = 198 bits (503), Expect = 2e-48, Method: Composition-based stats. Identities = 88/432 (20%), Positives = 171/432 (39%), Gaps = 15/432 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 + K Y +YKDSG+ W IP HWKV K + R+ ++ + + E Sbjct: 2 IDELKPYAEYKDSGLPWASKIPTHWKVRRAKNLFRCIDVRSKTGTEERLTVSAE--RGVV 59 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + K ++ L + R +A G+ S+ + V + + Sbjct: 60 PRSSMKVTMFEAKSYIGHKRCWPDDLVINSLWAWGRGLGVARHHGLVSSAYGVYRLRPEF 119 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKG-----IGNIPMPIPPLAEQVLIREKII 175 E + + + P+ +P + E I I Sbjct: 120 DEYAPFIHHLVRSKVYHWELRTRSKGVWISRLQLTDDAFLRAPILVPSVEEGKAITRFIR 179 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +++ I R R I++L E+KQA+++ VT+GL+P+V +K SG++W+G +P HWE Sbjct: 180 DIDRKVNAFIRNRRRLIKVLNEQKQAIINRAVTRGLDPNVPLKPSGVDWLGNIPKHWEKN 239 Query: 236 PFFALVTELNRKNT--KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 LV +N + + + E + ++ + ++ S V G+++F Sbjct: 240 RLKFLVRNVNEQTSTRQPDEVYVALEHVEGWTGRITLPSEDIEFGSQVKRFHV--GDVLF 297 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + K + V + + K + +L +RS + + G Sbjct: 298 GKLRPYLAK--VTRPSVKGVCVGEFLVLRRKNEALLPEFLEQELRSKLFIDIINSATFGA 355 Query: 354 -RQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 ++ + L ++ PP EQ +I + +TA + ++K + + L++E R+ I Sbjct: 356 KMPRADWDFIGNLLIVYPPTHAEQLEILSDTGKQTASLQAAIDKANREVSLIQEYRTRLI 415 Query: 412 AAAVTGQIDLRG 423 A VTG++D+R Sbjct: 416 ADVVTGKVDVRN 427 >gi|332664152|ref|YP_004446940.1| restriction modification system DNA specificity domain-containing protein [Haliscomenobacter hydrossis DSM 1100] gi|332332966|gb|AEE50067.1| restriction modification system DNA specificity domain protein [Haliscomenobacter hydrossis DSM 1100] Length = 417 Score = 197 bits (501), Expect = 2e-48, Method: Composition-based stats. Identities = 115/412 (27%), Positives = 192/412 (46%), Gaps = 17/412 (4%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-------PKDGNSRQSDTSTVSI 80 + +K L R YIGLE +ES +G+ L ++ S ++ Sbjct: 7 IKLKHSVSLRKERVEGLENSRPYIGLEHIESSSGRLLISPLENGDLPDEMAEAGESLCNL 66 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F G +L+GKL PYL KA +A+F G C+T+ +VL PK + P L+ L ++ I Sbjct: 67 FEPGDVLFGKLRPYLAKAWVANFSGRCTTELIVLIPKLIDPYYLKYNFLEKELLDAITGS 126 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ M ADW IG+ + PP+ Q I + ET RID LI+ + R I LL EK+Q Sbjct: 127 SFGSKMPRADWGFIGDQYIFFPPIDIQRRIASYLDRETTRIDGLISAKERLITLLAEKRQ 186 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--KNTKLIESNILS 258 AL++ VT+G + ++KMK +G+EW+G VPD W L + + Sbjct: 187 ALITQAVTRGFDQEIKMKHAGVEWIGEVPDGWMEIRVKYLGDIFYGLSQPPGYHADGLPL 246 Query: 259 LSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + N+ + + + I+ G+I+ +L + + E Sbjct: 247 VRATNVYRGEIRKEGLVFVNEDDLPESKKVILKTGDIIIVRSGAYTADSALVT-EEWEGA 305 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPP- 371 + + ++ +LA+++ S + ++ + L E++ V++PP Sbjct: 306 VAGFDMVFKPNKRVNPNFLAYVLLSPYVLESQLIPMSVRAAQPHLNAEELGSTIVVLPPS 365 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + EQF I + + +++D L +SI LLKERR + I+AAVTGQI++ Sbjct: 366 VDEQFAIIQCLEKKISKLDALRVANTKSIELLKERRKALISAAVTGQIEITD 417 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 89/212 (41%), Gaps = 9/212 (4%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLP 65 + K +GV+WIG +P W + +K + G + G + + +V G + Sbjct: 202 KMKHAGVEWIGEVPDGWMEIRVKYLGDIFYGLSQPPGYHADGLPLVRATNVYRGEIRKEG 261 Query: 66 KDGNSRQS-DTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPE 122 + S I G I+ + G Y + + +++G + +V +P + Sbjct: 262 LVFVNEDDLPESKKVILKTGDIIIVRSGAYTADSALVTEEWEGAVAGFDMVFKPNKRVNP 321 Query: 123 LLQGWLLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETV 179 ++L + I H + + +G+ + +PP + EQ I + + + Sbjct: 322 NFLAYVLLSPYVLESQLIPMSVRAAQPHLNAEELGSTIVVLPPSVDEQFAIIQCLEKKIS 381 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 ++D L + IELLKE+++AL+S VT + Sbjct: 382 KLDALRVANTKSIELLKERRKALISAAVTGQI 413 >gi|217979675|ref|YP_002363822.1| hypothetical protein Msil_3571 [Methylocella silvestris BL2] gi|217505051|gb|ACK52460.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 458 Score = 197 bits (500), Expect = 3e-48, Method: Composition-based stats. Identities = 88/426 (20%), Positives = 170/426 (39%), Gaps = 19/426 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + Y +G+ W+G +P HW V IK + R+ + ++ + + G ++ Sbjct: 6 RPYADTNPTGLPWLGDVPAHWNVRRIKTLLREVDSRSKTGEERLLSLRMR---QGLVDHI 62 Query: 65 PKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 G I GQ++ ++ +A+ G+ S + V +P Sbjct: 63 DAGGKLIPPESLVNFKIVEPGQVVMNRMRAAAGLFGVANVRGLVSPDYAVFEPLPEAFNP 122 Query: 124 LQGWLLSID-----VTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + + G + G IP+P PPL EQ LI + Sbjct: 123 YLLQAFRLPSLSAVFRAESKGLGTGESGFLRLYTDRFGPIPVPYPPLDEQRLIVRFLDWH 182 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + LI + + I LL E+KQA++ VT+GL+P+V++K SGI W+G +P+ WEV Sbjct: 183 GAQTAKLIRAKKKIIALLNEQKQAIIHRAVTRGLDPNVRLKPSGIPWLGDIPEDWEVSRV 242 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 LN + L + ++ + + E + D ++ Sbjct: 243 KTEFQCLNYRRVPLSGTERGRMTVRQYDYYGASGVIDKVDE-----FLFDDKLLLIAEDG 297 Query: 298 LQNDKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 R+L A + E + + + +KP D +LA ++ + + Sbjct: 298 ANLVLRNLPLAIIAEGKFWVNNHAHILKPRRGDIRFLAAILEGLNFLPWI---SGAAQPK 354 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + + + + VPP +Q +I + E + + + + ++ ++E R+ IA VT Sbjct: 355 LTQDRLMGIAIAVPPGHKQLEIIQSCDEEVSELVRAINVASKELIFIQEFRTRLIADVVT 414 Query: 417 GQIDLR 422 G++D+R Sbjct: 415 GKLDVR 420 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 52/217 (23%), Positives = 93/217 (42%), Gaps = 11/217 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + GL P +G+ W+G VP HW V+ L+ E++ ++ E + ++ Sbjct: 1 MIDGLRPYADTNPTGLPWLGDVPAHWNVRRIKTLLREVDSRSKTGEERLLSLRMRQGLVD 60 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 ++ + PES ++IV+PG++V + + + RG+++ Y +P Sbjct: 61 HIDAGGKLIPPESLVNFKIVEPGQVVMNRMRAAAGLFGVANV----RGLVSPDYAVFEPL 116 Query: 327 GI-DSTYLAWLMRSYDLCKVFYAMGSG------LRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + YL R L VF A G L + +PV PP+ EQ I Sbjct: 117 PEAFNPYLLQAFRLPSLSAVFRAESKGLGTGESGFLRLYTDRFGPIPVPYPPLDEQRLIV 176 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ A+ L+ ++ I LL E++ + I AVT Sbjct: 177 RFLDWHGAQTAKLIRAKKKIIALLNEQKQAIIHRAVT 213 >gi|237809016|ref|YP_002893456.1| restriction modification system DNA specificity domain-containing protein [Tolumonas auensis DSM 9187] gi|237501277|gb|ACQ93870.1| restriction modification system DNA specificity domain protein [Tolumonas auensis DSM 9187] Length = 421 Score = 196 bits (499), Expect = 4e-48, Method: Composition-based stats. Identities = 102/426 (23%), Positives = 176/426 (41%), Gaps = 22/426 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 Y YKDSGV W+G IP+ W +K ++ GR + D Y Sbjct: 8 PKYEAYKDSGVDWLGEIPEEWSTRKVKYLFRIGRGRVISQQEL-------DDNGCYPVYS 60 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + N + F QI + G + C+ LQP + L Sbjct: 61 SQTQNDGILGYISTFDFDCEQITWTTDGANAGTVFLRKGKHNCTNVCGTLQPINKQKISL 120 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + ++ + + + + + I + PPL Q I + +T +ID Sbjct: 121 EFLKNALSIAAQFYKRPDTNG-AKIMNGEMAEIFVTFPPLEAQTAIANFLDEKTAKIDEA 179 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I + + IELLKE+KQ ++ VT+GL+P V MKDSG+EW+G +P HWEV+ + + Sbjct: 180 IAIKEKQIELLKERKQIIIQQAVTQGLDPTVPMKDSGVEWIGKIPAHWEVRRSRFVFCQR 239 Query: 245 NRKNTKLIESNILSLSYGNIIQKLET----RNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + + +YG I Q+ R + + + V+ + V Q Sbjct: 240 KERARSNDVQLSATQAYGVIPQEQYEEMVGRKVVKISFHLDKRKHVEINDFVISMRSFQG 299 Query: 301 DKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSGLR--Q 355 + G I S+Y+ ++P ID+ + ++L++ S +R Q Sbjct: 300 -----GLERAWASGCIRSSYVVLRPVNSEEIDAGFFSYLLKLPSYINALQMTASFIRDGQ 354 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L F++ ++ + +PPI EQ I + I + I+ E I +E +++ I +AV Sbjct: 355 DLNFDNFSQVDLFIPPIDEQRAIFSAIQSKVDEINKATEIFIGQITKYQEYKTTLINSAV 414 Query: 416 TGQIDL 421 TG+I + Sbjct: 415 TGKIKV 420 >gi|150399018|ref|YP_001322785.1| restriction modification system DNA specificity subunit [Methanococcus vannielii SB] gi|150011721|gb|ABR54173.1| restriction modification system DNA specificity domain [Methanococcus vannielii SB] Length = 407 Score = 196 bits (498), Expect = 6e-48, Method: Composition-based stats. Identities = 92/419 (21%), Positives = 172/419 (41%), Gaps = 22/419 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 KDSG++WIG IP W V+ K ++TG + G P Sbjct: 4 AMKDSGIEWIGDIPADWNVIKTKHLCDISTGNQDT------------INRVDGGDYPFFI 51 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 S+ + F +L G + D + + + Sbjct: 52 RSKNVERINTYSFDGEAVLTAGDGDVGKIFHYIDGKFDYHQRVYKFSDFRSVIGRYFYYY 111 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 +S ++ + + T+ + P+ + + EQ I + + + +ID++I + Sbjct: 112 ISSNLIRELGKYNAKTTVESLRLPWLKEFPVIVSKIEEQQQIAQYLDDKVGQIDSIIEKT 171 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 IE K+ KQ++++ VTKGL+P V MKDSG+EW+G +P+HW++ +L+ E+N +N Sbjct: 172 KSSIEEYKKYKQSIMTETVTKGLDPTVMMKDSGVEWIGDIPEHWDMVKIKSLLYEINERN 231 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + + E G K + Y+IV G+++ + + Sbjct: 232 VDENAVLLSLFTALGVAPRSEMEEKGNKAVTVINYKIVKRGDLIVNKLLAWMGAIAFSDY 291 Query: 309 QVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDV 362 + G+ + Y + H + + W R Y G G+ R Sbjct: 292 E----GVTSPDYDVYRFHENAEALTEFYEWYFRFTKFKDDCYKFGRGIMMMRWRTYPAQF 347 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 K + V+ PP++EQ I + + + A ID L++K ++ I L+ + S I VTG+ ++ Sbjct: 348 KNIYVVNPPLEEQKQIIDYLKQKIADIDQLIDKKQRLITELESYKKSLIYEVVTGKKEI 406 >gi|226949372|ref|YP_002804463.1| restriction modification system DNA specificity domain protein [Clostridium botulinum A2 str. Kyoto] gi|226840941|gb|ACO83607.1| restriction modification system DNA specificity domain protein [Clostridium botulinum A2 str. Kyoto] Length = 450 Score = 196 bits (497), Expect = 6e-48, Method: Composition-based stats. Identities = 101/447 (22%), Positives = 186/447 (41%), Gaps = 35/447 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-GKDIIYIGLEDVESGTGKYLPKD 67 + KDSGV+WIG + WK +P+K K + S K+ + + + + + + Sbjct: 4 KMKDSGVEWIGYMNTCWKTMPLKFILKERRQKNSPIITKERLSLSIGVGVTLYSEKT-TN 62 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + D + + ++ + + I+++ G S + V+ + + + + Sbjct: 63 LDRFKDDVTQYKVAYPNDLVINSMNVIVGAEGISNYLGCVSPAYYVMCSSNPQKFITKYY 122 Query: 128 LLSIDVTQRIEAICE---------------GATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 + +A+ +G + P+P + EQ I E Sbjct: 123 DYCFKTSTIQKALFYLGKGIMAIDRGEGRVNTCRLKVSSYDLGRLEFPVPSVNEQHRIVE 182 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 + +ID I + + IE LKE KQ++++ VTKGLNPDVKMKDSG+EW+G +P HW Sbjct: 183 FLDNRCNKIDQTIQKEKQVIEKLKEYKQSVITEAVTKGLNPDVKMKDSGVEWIGEIPKHW 242 Query: 233 EVKPFFALVTELNR---KNTKLIESNILSLSYGN---------IIQKLETRNMGLKPESY 280 +V+ + + L+E I +SYG I R +G + Sbjct: 243 KVEKLKHIFSFKKGLSITKDNLVEEGIKVISYGQIHSKSNIGVCINDSLIRYVGEEYLET 302 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-----STYLAW 335 +V + +F + I + + D Y A+ Sbjct: 303 GKQSLVLRNDFIFADTSEDLEGAGNYVYVGKNEEIFAGYHTIILTPIKDDIMSEWKYFAY 362 Query: 336 LMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L ++ + SG++ S+ + +K+ V+VP IKEQ +IT+ ++ + + ID L+ Sbjct: 363 LYKTDCWRSQIRSRVSGIKLFSITQKILKQTEVIVPDIKEQKEITDYLDKKCSSIDKLIS 422 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDL 421 E+ I L E + S I VTG+ ++ Sbjct: 423 DKEKVIKKLTEYKKSLIYECVTGKKEV 449 >gi|120612012|ref|YP_971690.1| restriction modification system DNA specificity subunit [Acidovorax citrulli AAC00-1] gi|120590476|gb|ABM33916.1| restriction modification system DNA specificity domain protein [Acidovorax citrulli AAC00-1] Length = 429 Score = 196 bits (497), Expect = 7e-48, Method: Composition-based stats. Identities = 128/413 (30%), Positives = 205/413 (49%), Gaps = 15/413 (3%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST---- 77 P+ W++ +K L R S Y+GLE++ES TG+ + + Sbjct: 10 PEVWRLARLKFVAPLRNERMSAGSDHPGYLGLENIESWTGRIIEVESKRDDEPADQSAGL 69 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQR 136 +IF +G +L+ KL PYL KA A DG+ ST+ LV++P ++L P L +L+ D Sbjct: 70 ANIFREGDVLFCKLRPYLAKACHAPRDGVGSTELLVMRPSELLEPRFLLYSILTPDFVGA 129 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++A GA M A+W IG++ + +PPL EQ LI + ET ID LI E+ R + LL+ Sbjct: 130 VDASTFGAKMPRANWDFIGSLEVKVPPLEEQRLIANYLDRETAGIDGLIAEKERMLALLE 189 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT-------ELNRKNT 249 EK+ AL+S +VT+GL+P+ +K SG EW+G +P HW ++ L Sbjct: 190 EKRAALISRVVTRGLDPNAPLKPSGQEWLGEIPVHWGLQRLKQLAEVRGGLTLGKQYSGE 249 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSA 308 L + + + KL+ P S ++ G+++ D+ R Sbjct: 250 LLEYPYLRVANVQDGYLKLDDVLTVEVPASEAASNLLVYGDVLMNEGGDIDKLGRGCVWR 309 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLP 366 + + + AV+PH +DS +LA + + F + S S+ ++K LP Sbjct: 310 DEISPCLHQNHVFAVRPHSVDSDWLALWTSTIQAKRYFESRAKRSTNLASISGSNIKELP 369 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 V +PP+ EQ I N + V +R++ L ++ S+ LL ERR++ I A VTGQI Sbjct: 370 VPLPPVSEQLAIQNFLAVRHSRLETLRGELRDSLRLLIERRAALITAGVTGQI 422 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 82/212 (38%), Gaps = 11/212 (5%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLP 65 K SG +W+G IP HW + +K+ ++ G T + Y+ + +V+ G K Sbjct: 211 KPSGQEWLGEIPVHWGLQRLKQLAEVRGGLTLGKQYSGELLEYPYLRVANVQDGYLKLDD 270 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGIC---STQFLVLQPKDV 119 + + ++ G +L + G R + D C + F V Sbjct: 271 VLTVEVPASEAASNLLVYGDVLMNEGGDIDKLGRGCVWRDEISPCLHQNHVFAVRPHSVD 330 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L ++ I +P+P+PP++EQ+ I+ + Sbjct: 331 SDWLALWTSTIQAKRYFESRAKRSTNLASISGSNIKELPVPLPPVSEQLAIQNFLAVRHS 390 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 R++TL E + LL E++ AL++ VT + Sbjct: 391 RLETLRGELRDSLRLLIERRAALITAGVTGQI 422 >gi|91215847|ref|ZP_01252816.1| putative type I restriction enzyme (specificity subunit) [Psychroflexus torquis ATCC 700755] gi|91185824|gb|EAS72198.1| putative type I restriction enzyme (specificity subunit) [Psychroflexus torquis ATCC 700755] Length = 426 Score = 196 bits (497), Expect = 8e-48, Method: Composition-based stats. Identities = 86/431 (19%), Positives = 159/431 (36%), Gaps = 23/431 (5%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDV 56 Y YKDSG++W+G IP HW+V +K L G+ + + ++ DV Sbjct: 3 KYDTYKDSGIEWLGEIPVHWEVKRVKEIFNLVRGKFTHRPRNDQRMYNNGTFPFLQTGDV 62 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + L ++ F KG ++ + + + FD + Sbjct: 63 AKSSKYVLQYKQVLNENGIKVSRQFKKGTLVMT-IAANIGDVALLGFDAYFPDSLVAFNT 121 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 K + + L ++ + T + + + + ++ PPL+EQ +I + Sbjct: 122 KHNIN---FYYYLLSVTKSELDTVKITNTQDNLNLERLNSLLKICPPLSEQTIIANYLDK 178 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T ID I + + KE +++L++ VT GL+ + K ++ +G + K Sbjct: 179 KTTAIDQKINLLTKKTDKYKELRKSLINQTVTDGLDKNTIWKTYRLKDIGQIYSGLSGKN 238 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 E + N I + N + E V ++ F Sbjct: 239 GDDFKKEKDPNNRGF----IPFTNIANNTYLDVEHLSKVIISPTENQNKVQKNDLFFLMS 294 Query: 297 DLQNDKRSLRSA--QVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMGSG 352 + + + + + S + + +L+ S D G G Sbjct: 295 SEGYEDIGKSAVLKEDIPETYLNSFCKGFRITNTNVDAFFINYLLLSDDNRNKMVIQGKG 354 Query: 353 -LRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 R +LK E V + +P EQ I N ++ +T+ ID +V IE+ I LKE R + Sbjct: 355 FTRINLKIEKVNNFSITIPSTKAEQTAIANYLDEKTSTIDAIVSNIERQINHLKELRKTV 414 Query: 411 IAAAVTGQIDL 421 I VTG+I + Sbjct: 415 INDVVTGKIKV 425 >gi|297619043|ref|YP_003707148.1| restriction modification system DNA specificity subunit [Methanococcus voltae A3] gi|297378020|gb|ADI36175.1| restriction modification system DNA specificity subunit [Methanococcus voltae A3] Length = 440 Score = 194 bits (493), Expect = 2e-47, Method: Composition-based stats. Identities = 88/437 (20%), Positives = 175/437 (40%), Gaps = 22/437 (5%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 Y+ + KDSGV+WIG IPK W ++ K K+++ + L V + Sbjct: 5 KYRKAEELKDSGVEWIGQIPKDWDIIKGKNIFYNKKVNNRGILKNVLSLTLNGVID---R 61 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKL---GPYLRKAIIADFDGICSTQFLVLQPKDV 119 + + D F K +++ + + I G+ S ++ + K Sbjct: 62 DPMSNEGLQPKDFKGYQEFEKNNLVFKLIDLENINTSRVGITHKSGLMSPAYIRIINKYQ 121 Query: 120 LP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + S + + + S + + N+ + EQ I + +T Sbjct: 122 ICVKYYYYTYYSYYLKKIYNNLGNSGVRSAMNSCDLLNLEVLQTFEKEQEKIANFLDIKT 181 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLN---------PDVKMKDSGIEWVGLVP 229 I+ +I+++ + I L+E K++L+S +VT ++KDSG+EW+G +P Sbjct: 182 EEIENIISKKEKLINKLEEAKKSLISEVVTGKFKIIDVKLIKREKEELKDSGVEWIGQIP 241 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + W+VK V+ + K + L + +N Sbjct: 242 NDWDVKKLKYEVSLRSIKGEYTKNLKYIGLEHIESSTGKYIKNSEELNIE-GICNKFKKN 300 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+F + K + + G+ +S + + I++ +L +++ + + Sbjct: 301 DILFGKLRPYLAKCIIANFD----GVCSSELLVLNTQRINNIFLKYVILNSKFINYINSS 356 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G ++ V + + P ++EQ I+N ++ +T ID L+ + I LKE + Sbjct: 357 TYGAKMPRTNWDFVGNIKIPHPNMQEQETISNFLDTKTEEIDNLINNTKLQIEKLKEAKQ 416 Query: 409 SFIAAAVTGQIDLRGES 425 S I+ AVTG+IDLR Sbjct: 417 SLISEAVTGKIDLREWE 433 >gi|229520170|ref|ZP_04409597.1| restriction modification system DNA specificity domain [Vibrio cholerae TM 11079-80] gi|229342764|gb|EEO07755.1| restriction modification system DNA specificity domain [Vibrio cholerae TM 11079-80] Length = 434 Score = 194 bits (492), Expect = 3e-47, Method: Composition-based stats. Identities = 104/421 (24%), Positives = 184/421 (43%), Gaps = 26/421 (6%) Query: 25 WKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W +VP KR + + + + ++ V + + L + SD S IF K Sbjct: 16 WNLVPAKRLFTSSKEINQGMKESNRLALTMKGVINRSLDDLQ---GLQSSDYSVYQIFEK 72 Query: 84 GQILYGKL---GPYLRKAIIADFDGICSTQFLVL--QPKDVLPELLQGWLLSIDVTQRIE 138 +++ + + I GI S ++ + + P + ++ +T Sbjct: 73 DDLVFKLIDLENIKTSRVGIVHERGIMSPAYIRVSACSNSIYPRFYYWYFFALYLTNIYN 132 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G + + IP+P+ ++ Q + + ET RID+LI E+ FI LLKEK Sbjct: 133 KL-GGGVRQNLTAGDLLEIPVPLIDISLQKQVSAFLDRETQRIDSLIEEKQTFITLLKEK 191 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +QAL+S++VTKGLNP+V+M+DSGIEW+G VP HW VK V + + + ES + Sbjct: 192 RQALISHVVTKGLNPNVEMQDSGIEWIGQVPKHWVVKKIKYDVLGIEQGWSPQCESTPVP 251 Query: 259 LSYGNIIQKLETRNMGLKPESYETY----------QIVDPGEIVFRFIDLQNDKRSLRSA 308 + + K+ N G+ + G+++ + + S Sbjct: 252 DDHTWGVVKVGCVNRGIFNPEQNKKLPEELEPRKEYAIKKGDLLVSRANAKEWVGSAAVP 311 Query: 309 QVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDV 362 ++ + D + A+ + S + +G ++ + Sbjct: 312 DRDYDNLLLCDKIYRIKLDLEKADPEFFAYYLASDQAREQIEIDATGTSSSMLNIGQGTI 371 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P+ P + EQ I I +T++ID L+ ++ SI LLKE R+S I+AAVTG+ID+R Sbjct: 372 LNMPIPAPELPEQQSIVRGIKNKTSQIDRLMLEVLDSIELLKEHRTSLISAAVTGKIDVR 431 Query: 423 G 423 Sbjct: 432 E 432 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 56/221 (25%), Positives = 91/221 (41%), Gaps = 17/221 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIK-RFTKLNTGRTS-------ESGKDIIYIGLEDVESGT 60 + +DSG++WIG +PKHW V IK + G + + + V G Sbjct: 209 EMQDSGIEWIGQVPKHWVVKKIKYDVLGIEQGWSPQCESTPVPDDHTWGVVKVGCVNRGI 268 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDG----ICSTQF-LV 113 + + KG +L + ++ A + D D +C + + Sbjct: 269 FNPEQNKKLPEELEPRKEYAIKKGDLLVSRANAKEWVGSAAVPDRDYDNLLLCDKIYRIK 328 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI--PPLAEQVLIR 171 L + PE +L S ++IE G + S + + MPI P L EQ I Sbjct: 329 LDLEKADPEFFAYYLASDQAREQIEIDATGTSSSMLNIGQGTILNMPIPAPELPEQQSIV 388 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 I +T +ID L+ E + IELLKE + +L+S VT ++ Sbjct: 389 RGIKNKTSQIDRLMLEVLDSIELLKEHRTSLISAAVTGKID 429 >gi|260578144|ref|ZP_05846064.1| restriction modification system DNA specificity domain protein [Corynebacterium jeikeium ATCC 43734] gi|258603683|gb|EEW16940.1| restriction modification system DNA specificity domain protein [Corynebacterium jeikeium ATCC 43734] Length = 383 Score = 193 bits (490), Expect = 4e-47, Method: Composition-based stats. Identities = 104/382 (27%), Positives = 180/382 (47%), Gaps = 11/382 (2%) Query: 48 IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 I+ I ++ ++ + + +G + + FDG+ Sbjct: 1 ILSITQSGIKPKNI---LQNEGQMARNYDGYQVVNQGDFAMNSMDLLTGWVDQSPFDGLT 57 Query: 108 STQFLVLQPKD--VLPELLQGWLLSIDVTQRIEAIC----EGATMSHADWKGIGNIPMPI 161 S + V + ++ + ++ + ++ I N P+P+ Sbjct: 58 SPDYRVFRARNLEFINGRYFLYVFQLLYSRHIYYKFGQGVSNMGRWRLPADVFLNFPLPV 117 Query: 162 PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 PP EQ I + +T ID LI + ELL+ ++ L++ VT+GL+PD M+DSG Sbjct: 118 PPRLEQAEISNYLDEKTAEIDGLIGKLGHQAELLERYRRELIARTVTRGLDPDAPMRDSG 177 Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-PESY 280 I+W G +P W +PF AL + + N+ L N L G I+ K K E+ Sbjct: 178 IDWAGDMPKTWRTQPFVALFSVEKKINSDLRIRNALQFRNGEIVVKPGWYPEDRKLDETL 237 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRS 339 TY++V PG IV ++L D R+ R V + G+ITSAY+ + + A +L++S Sbjct: 238 ATYKVVTPGMIVINGLNLNYDFRTKRIGLVTQNGVITSAYITLSANLGIDERFASYLLKS 297 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 D +F+ M G+R+ L + D++R + VPP++EQ +I + + ++A ID +E I++ Sbjct: 298 MDSRLLFHGMAEGVRKILSWADIRREKIPVPPLREQTEIADFLEEKSAEIDTTIEGIKRQ 357 Query: 400 IVLLKERRSSFIAAAVTGQIDL 421 I LL + R I AVTG+I + Sbjct: 358 IELLGKYRKQVINDAVTGKIRV 379 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 82/207 (39%), Gaps = 7/207 (3%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDG 68 +DSG+ W G +PK W+ P + S+ ++ + ++ G Y Sbjct: 173 MRDSGIDWAGDMPKTWRTQPFVALFSVEKKINSDLRIRNALQFRNGEIVVKPGWYPEDR- 231 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPELL 124 +T + G I+ L ++ + +G+ ++ ++ L + E Sbjct: 232 -KLDETLATYKVVTPGMIVINGLNLNYDFRTKRIGLVTQNGVITSAYITLSANLGIDERF 290 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +LL ++ + W I +P+PPL EQ I + + ++ IDT Sbjct: 291 ASYLLKSMDSRLLFHGMAEGVRKILSWADIRREKIPVPPLREQTEIADFLEEKSAEIDTT 350 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGL 211 I R IELL + ++ +++ VT + Sbjct: 351 IEGIKRQIELLGKYRKQVINDAVTGKI 377 >gi|154508213|ref|ZP_02043855.1| hypothetical protein ACTODO_00707 [Actinomyces odontolyticus ATCC 17982] gi|153797847|gb|EDN80267.1| hypothetical protein ACTODO_00707 [Actinomyces odontolyticus ATCC 17982] Length = 385 Score = 193 bits (489), Expect = 6e-47, Method: Composition-based stats. Identities = 123/379 (32%), Positives = 192/379 (50%), Gaps = 8/379 (2%) Query: 51 IGLEDVESGTGKYLPKDGN---SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 + + +Y+ K G+ + D + + G + + + +++ G Sbjct: 6 VSQQYGVIPQSEYVKKTGSHVVVVEKDFTILKAVYPGDFVI-HMRSFQGGLELSEVKGCT 64 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH---ADWKGIGNIPMPIPPL 164 S+ +++L P + + E + W +P+P PP Sbjct: 65 SSAYVMLIPGPQIHSARYYRWVFKCDGYINELRSTSNLVRDGQAMRWANFIQVPIPFPPP 124 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 Q I + + ET RI+ L I+ L K++++ VTKGL+P+ M DS I+W Sbjct: 125 EVQDSIAKYLDRETERIEELKDSIRAQIDALDSYKRSVILDAVTKGLDPNRDMVDSKIDW 184 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + +P +W V P E KN E+N+LSLSYG II+K GL P ++ Y Sbjct: 185 IDRLPRNWNVAPLRHFFHEHKAKNLFRQETNLLSLSYGRIIRKDIGTVDGLLPSNFNGYN 244 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLC 343 IV PG+IV R DLQND++SLR+ V ERGI+TSAY+A++ H DSTY +L +YD+C Sbjct: 245 IVGPGDIVLRLTDLQNDQKSLRTGLVNERGIVTSAYIALRKHRELDSTYFHYLFHTYDIC 304 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 +VFY MGSG+RQ L F ++ RLP++ PP+ EQ I +N E +ID + K + + LL Sbjct: 305 RVFYNMGSGVRQGLTFSELSRLPLVAPPLDEQRRIGRFLNEEITKIDEVQRKKRKQLDLL 364 Query: 404 KERRSSFIAAAVTGQIDLR 422 + S I VTG+ ++ Sbjct: 365 DAYKKSLIYEVVTGKREVP 383 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 78/201 (38%), Gaps = 8/201 (3%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNS 70 DS + WI +P++W V P++ F + + +++ + + + Sbjct: 179 DSKIDWIDRLPRNWNVAPLRHFFHEHKAKNLFRQETNLLSLSYGRIIRKDIGTVD---GL 235 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDVLPELLQG 126 S+ + +I G I+ + + + GI ++ ++ L+ L Sbjct: 236 LPSNFNGYNIVGPGDIVLRLTDLQNDQKSLRTGLVNERGIVTSAYIALRKHRELDSTYFH 295 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L R+ + + +P+ PPL EQ I + E +ID + Sbjct: 296 YLFHTYDICRVFYNMGSGVRQGLTFSELSRLPLVAPPLDEQRRIGRFLNEEITKIDEVQR 355 Query: 187 ERIRFIELLKEKKQALVSYIV 207 ++ + ++LL K++L+ +V Sbjct: 356 KKRKQLDLLDAYKKSLIYEVV 376 >gi|332885123|gb|EGK05375.1| hypothetical protein HMPREF9456_02874 [Dysgonomonas mossii DSM 22836] Length = 452 Score = 192 bits (487), Expect = 1e-46, Method: Composition-based stats. Identities = 105/451 (23%), Positives = 187/451 (41%), Gaps = 32/451 (7%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII---------YIGLED 55 K Y YK S + ++ IP HW+ + ++ L G T +S D +I + Sbjct: 2 KKYDSYKLSHIDFLDHIPSHWQEIRMRFLGYLYGGLTGKSADDFNQIGNIENKAFIPFTN 61 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIAD--FDGICST 109 + + + + K +D + KG + + + A++ D D ++ Sbjct: 62 IANNSKIDISKLQEVIITDGEKQNKAQKGDLFFLMSSENYEDVGKSAVLCDDVEDMYLNS 121 Query: 110 QF--LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + K++ E L L S ++ + G T + + ++ + IP EQ Sbjct: 122 FCKGFRVVAKNINSEFLNYQLSSSEIRHNLLTEANGFTRINLKIDKVNDLIVAIPTEHEQ 181 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + +T ID LI ++ R IEL +E+K A+++ VTKG++ +VKM+DSGIEW+G Sbjct: 182 TAIASFLDRKTAEIDQLIADKKRLIELYEEEKAAIINQAVTKGIDSNVKMQDSGIEWLGE 241 Query: 228 VPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLET---------RNMGL 275 +P HWEV+ F +L + L + I +SYG I K + + Sbjct: 242 IPGHWEVRRFNSLFSFSRGLTITKENLQDEGIPCISYGEIHSKYSFEVNPEKDILKCVDK 301 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA- 334 +++ G+ VF + E + + + + Sbjct: 302 NYLISSEKSLLNHGDFVFADTSEDIKGSGNFTYLNSETRAFAGYHTIIANPIENFMHRYV 361 Query: 335 -WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S +G S+ +K +L+PPI EQ I I+ E +RI+ Sbjct: 362 AYFFDSLSFRNQIRCKVTGTKVYSITQSILKCTFILLPPIHEQNSIVQYIDAECSRINSK 421 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +EK ++ I LL E R++ I+ VTG+I + Sbjct: 422 IEKTKKLIDLLTEYRTTLISEIVTGKIKVTD 452 >gi|293401124|ref|ZP_06645268.1| type I restriction-modification system specificity determinant [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291305250|gb|EFE46495.1| type I restriction-modification system specificity determinant [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 464 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 88/434 (20%), Positives = 168/434 (38%), Gaps = 20/434 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y +YK + W+ IP HW V K+ ++ + I + L + + + Sbjct: 3 RYEEYKKIDLPWLNEIPAHWDVYRNKQIFTEMKDEVGKNSSNYILLSLT-LNGVIPRDVK 61 Query: 66 KDGNSRQSDTSTVSIFAKGQILYG--KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + I K + + + R +A G+ + + +++ ++ P Sbjct: 62 SGKGKFPASFDKYKIVEKDNLAFCLFDMDETPRTVGLAKCSGMLTGAYTIMKVSNINPRY 121 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + LS+D + + + G + I MP+P EQ I + + RI++ Sbjct: 122 AYYYYLSLDNVKGMRPLYTGL-RKTINVGTFLGIKMPVPTEEEQEQIVRFLDWQLSRINS 180 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF---FAL 240 +I + + IELL+EK+Q ++ V + + + W +P+ W++ F F+ Sbjct: 181 IIKIKRKEIELLQEKRQQIIDAKVLTS-SRTKVTRAAEGGWNVNIPEGWDILKFNGVFSF 239 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLE---------TRNMGLKPESYETYQIVDPGEI 291 LN L E I +SYG + K R + +V PG+ Sbjct: 240 GKGLNITKANLEEEGIPVISYGQVHSKNNPGTKIDDSLIRFVNESYLETSPNSLVYPGDF 299 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +F + E + + +A G + YL++L +S Sbjct: 300 IFADTSEDFEGVGNCVFVDREGPLFAGYHTVIARPKDGNGNRYLSYLFKSSTWRYQLRKN 359 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +G+ S+ + +K V +PP+ EQ +I ++ ID L+ I + I LLK Sbjct: 360 VNGVKVFSITQKVLKNAYVFLPPLDEQREIVEFLDEHCEGIDSLITDIAKEIDLLKAYEM 419 Query: 409 SFIAAAVTGQIDLR 422 I+ TG++D+R Sbjct: 420 RLISDVSTGKVDVR 433 >gi|313892700|ref|ZP_07826281.1| conserved hypothetical protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442631|gb|EFR61042.1| conserved hypothetical protein [Veillonella sp. oral taxon 158 str. F0412] Length = 470 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 83/441 (18%), Positives = 168/441 (38%), Gaps = 27/441 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKY 63 K Y YK +W+G IP HW + IKR + R + D I+ + + + Sbjct: 2 KKYESYKPMKEKWLGDIPSHWDALRIKRIFQERKERNNPVTTDFILSLTAKQGVVPVAEK 61 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 GN + D S +I + +L + A ++ + G S + L P+D Sbjct: 62 EGVGGNKPKDDLSKYNICRENDLLVNCMNVVSGSAGVSKWVGAISPVYYALYPRDEEACN 121 Query: 124 LQGWLLSIDVTQRIEAICE---------------GATMSHADWKGIGNIPMPIPPLAEQV 168 + + + ++ + N+ +P+PP EQ Sbjct: 122 IWYYHQIFRLITFQRSLLGLGKGILMHESSTGKLNTVRMRISMDYLNNVVLPLPPRDEQD 181 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + + +I+ +I+ + + I + E V+ VT G+ + ++K+SGI W+G + Sbjct: 182 QIVRYLDWQISKINKMISNKRKQISRINEHLVFAVNEAVTHGI-RNEQLKESGIFWMGKI 240 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P +W L E N +N + + +I + + Y++V P Sbjct: 241 PVNWNPIKIKWLFDETNERNIECEAELLTFSRKRGLIPFSDASDKEPSASDLSNYRLVSP 300 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH---GIDSTYLAWLMRSYDLCKV 345 G+++ + + + + G ++ Y P ++ + ++ R+ + Sbjct: 301 GQLLENRMQAWSGMFICVTRE----GCVSPDYSVFNPSKDRYVNVKFYEYVFRNPLQVEQ 356 Query: 346 FYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F G+ L + + PP +EQ I ++ + + IE I + Sbjct: 357 FANASRGVGSGFNRLYTPSFGAIYTVYPPKEEQDAIVEYLDGLKDKYKSATDVIESEIEV 416 Query: 403 LKERRSSFIAAAVTGQIDLRG 423 L E + ++ AV+G+ID+R Sbjct: 417 LHEIKDRLVSDAVSGKIDVRN 437 >gi|303242151|ref|ZP_07328641.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] gi|302590338|gb|EFL60096.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] Length = 638 Score = 191 bits (485), Expect = 2e-46, Method: Composition-based stats. Identities = 98/433 (22%), Positives = 176/433 (40%), Gaps = 24/433 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESG 59 KDSG++WIG IP+ W+ + IK + + G + + ++ + DV Sbjct: 4 AMKDSGIEWIGEIPQEWETIKIKYLSPVLRGASPRPIDNPIYFNENGEYVWTRIADVSKC 63 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + TS ++ + I I Sbjct: 64 NRYFEKYYEYMSDLGTSKSIKIEPNSLIVSICATVGKPIITKVKCCIHDGFVYFPLLDPK 123 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + L + + + T + + + +G+I +PI +E I + + + Sbjct: 124 YNDFLYYIFNNG---SCFAGLGKLGTQLNLNTETVGSISIPIIDDSELKSIIKYLDEKCS 180 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 ID ++ I+ K+ KQ++++ VTKGLNP V+MKDSGIEW +P HW+V Sbjct: 181 EIDNIVENTKASIDEYKKYKQSVITEAVTKGLNPSVEMKDSGIEWNRHIPLHWKVVNGRR 240 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRF 295 L K S YG + Q LE + + + ++ + V+P + V Sbjct: 241 LFELRKDKAMPEDRQLTASQKYGIMYQDEFMQLENQRVVTVQKDFDILKHVEPNDFVISM 300 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLR 354 Q RG I+SAY+ + P+ Y WL +S + + +R Sbjct: 301 RSFQG-----GLEYSQLRGCISSAYVMLIPNEKVYCPYFRWLFKSVKYINALQSTSNLVR 355 Query: 355 --QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q+++F + ++P+ + PI EQ I + +N + A ID ++ K +Q + L+ + S I Sbjct: 356 DGQAMRFSNFVQIPLFLIPIDEQKRIADYLNAKCAEIDNIISKKQQIVTELENYKKSLIY 415 Query: 413 AAVTGQIDLRGES 425 VTG+ ++ E Sbjct: 416 ECVTGKRSVQTEE 428 >gi|149927743|ref|ZP_01915995.1| hypothetical protein LMED105_16098 [Limnobacter sp. MED105] gi|149823569|gb|EDM82799.1| hypothetical protein LMED105_16098 [Limnobacter sp. MED105] Length = 428 Score = 191 bits (484), Expect = 2e-46, Method: Composition-based stats. Identities = 106/433 (24%), Positives = 169/433 (39%), Gaps = 29/433 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKL-NTGRTSE----SGKDIIYIGLEDVESGT 60 YP+YK SG G IP HW+ ++ + G + DI I + D + + Sbjct: 3 RYPEYKSSGSPVFGDIPSHWEKKRLRDCIECCVNGIWGDEPDGGEDDIPVIRVADFDRPS 62 Query: 61 GKYLPKDGNSRQSDTSTV-SIFAKGQILYGKLG-----PYLRKAIIADFDG-ICSTQFLV 113 K + + T V G +L K G P +G +CS Sbjct: 63 RKVEKFETVRKVEKTQRVGRALYNGDMLIEKSGGGEQQPVGMVVSYQGPEGAVCSNFVAK 122 Query: 114 LQPKDVLPELLQGWLLSIDVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + PK+ + +L S +I + + + D + + +P L EQV I Sbjct: 123 MTPKENIASRFLVYLHSHLYASGVTNISIKQTTGIQNLDSTAYLSESIYVPSLGEQVAIA 182 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV-GLVPD 230 + + ET RIDTLI E+ I LL E KQ++ ++TKGL+ ++ K S +EW+ G Sbjct: 183 QYLDIETARIDTLIYEKEALIGLLDEWKQSVTEQVLTKGLSANIDFKTSDVEWLQGAEIP 242 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 V + + T E+ S Y K E I G Sbjct: 243 SQWVTKSIKHIAHMRSGETITSENIDDSGKYPVYGGNGLRGFTTEKTHDGEYLLIGRQGA 302 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + E A + IDS + +++R DL + A Sbjct: 303 LCGN-----------VHHVKGEFWATEHAVVVTLNTDIDSRWAFYMLRFMDLGQYSLA-- 349 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + L E + L + VP EQ I++ ++ E AR++ L+ + I LL+E R++ Sbjct: 350 -AAQPGLSVEKIVGLKLPVPSCHEQKAISDHLDKEMARLEDLINHTTKEIELLRELRAAT 408 Query: 411 IAAAVTGQIDLRG 423 IA AV G+ID+R Sbjct: 409 IADAVLGRIDVRD 421 >gi|310826741|ref|YP_003959098.1| hypothetical protein ELI_1147 [Eubacterium limosum KIST612] gi|308738475|gb|ADO36135.1| hypothetical protein ELI_1147 [Eubacterium limosum KIST612] Length = 415 Score = 190 bits (483), Expect = 3e-46, Method: Composition-based stats. Identities = 89/426 (20%), Positives = 169/426 (39%), Gaps = 22/426 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESG 59 M Y + KDSG++WIG++P HW+V + + + K+++ + ++ Sbjct: 6 MSEITTYEKTKDSGIEWIGSVPSHWRVHTLYQLVTQVKEKNGNLQEKNLLSLSYGKIKRK 65 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQ 115 + +I G I+ + +A GI ++ + L+ Sbjct: 66 DIDSPD---GLLPASFDGYNIIEDGDIVLRLTDLQNDHTSLRVGLATERGIITSAYTTLR 122 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P D +LL ++ ++ + + + +P E+ I + + Sbjct: 123 PIDTSNSKYLFYLLHAFDLKKGFYGMGSGVRQGLNYAEVKELRIVLPRQDEKDTIVQFLD 182 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 + +IDTLI E + K+ + ++V VTKGLNP +MKDS I+W+G +P HW + Sbjct: 183 EQCAQIDTLIEEAKLSVAEYKKWRASIVFEAVTKGLNPLAEMKDSHIDWIGQMPTHWGIL 242 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 L + KN L+ I + R K Y +V + Sbjct: 243 SLKYLCSMQAGKN--LVSDQIDEAGEYPVYGGNGIRGYYSKYNYEGEYLLVGRQGALCG- 299 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 N + E ++T G++ ++L +L+ +L + A S + Sbjct: 300 ----NVHKIKGCFWATEHAVVTKNV-----EGVELSFLYYLLNGMNLNRY--ASNSAAQP 348 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L ++ + + PPI+EQ +I+ +N ID ++E + I L+ + S I V Sbjct: 349 GLSVNTIQNIKTVFPPIEEQIEISTYLNDICHSIDSIIENKQSLIFELESYKKSLIFETV 408 Query: 416 TGQIDL 421 TG+ + Sbjct: 409 TGKRKV 414 >gi|86738913|ref|YP_479313.1| type I restriction-modification system specificity determinant [Frankia sp. CcI3] gi|86565775|gb|ABD09584.1| type I restriction-modification system specificity determinant [Frankia sp. CcI3] Length = 416 Score = 190 bits (483), Expect = 3e-46, Method: Composition-based stats. Identities = 91/419 (21%), Positives = 163/419 (38%), Gaps = 21/419 (5%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DSGV W+G +P HW P+ + + + V + + + N Sbjct: 10 DSGVSWLGKVPPHWTTKPLWSMFERIKDVDHPEEQMLSVFREYGVVAKDSR---DNINKT 66 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + S + G ++ ++ + I+ GI S ++ P+ WLL Sbjct: 67 AENRSIYQLVHPGWLVANRMKAWQGSVGISSLRGIVSGHYICFAPRHSEDARYLNWLLRS 126 Query: 132 DVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 A+ + D +P+ +PPL EQ I + + ET RIDTLI E+ Sbjct: 127 TTYTNGYALLSRGVRIGQAEIDNDEFRLMPILLPPLGEQRAIADYLDRETARIDTLIEEQ 186 Query: 189 IRFIELLKEKKQALVSYIVTKGLNP---DVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 R IE+L+E+++A+ + + + ++ K+ S G P + Sbjct: 187 QRLIEMLRERRRAVALHAIDQQIHAGATTDKLGRSTRIGNGSTPRRETASYWRDGEFPWL 246 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + + + +V PG ++ + Sbjct: 247 NSSAVNESRVTHA-----------DQFVTDIALYECHLPVVAPGSVLVGLTGQGKTRGMA 295 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFEDVKR 364 ++ AY+A YL W +R SYD + + L + +K+ Sbjct: 296 TLLEIEATVNQHVAYIAPDRGTWLPEYLLWSLRASYDDLRRLSEENGSTKGGLTCQALKQ 355 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + VPP+ EQ + ++ +TA+ID L+ + E+ I L +ERR + I AAVTGQ+D+RG Sbjct: 356 YRLAVPPLDEQRRVAAYLDEQTAKIDSLIGETERFIELARERRVALITAAVTGQVDVRG 414 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 56/198 (28%), Positives = 99/198 (50%), Gaps = 9/198 (4%) Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 + DSG+ W+G VP HW KP +++ E + E + ++ K N+ Sbjct: 7 DLVDSGVSWLGKVPPHWTTKPLWSMF-ERIKDVDHPEEQMLSVFREYGVVAKDSRDNINK 65 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLA 334 E+ YQ+V PG +V + + S RGI++ Y+ P D+ YL Sbjct: 66 TAENRSIYQLVHPGWLVANRMKAWQGSVGISSL----RGIVSGHYICFAPRHSEDARYLN 121 Query: 335 WLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 WL+RS + + G+ + + ++ + +P+L+PP+ EQ I + ++ ETARID Sbjct: 122 WLLRSTTYTNGYALLSRGVRIGQAEIDNDEFRLMPILLPPLGEQRAIADYLDRETARIDT 181 Query: 392 LVEKIEQSIVLLKERRSS 409 L+E+ ++ I +L+ERR + Sbjct: 182 LIEEQQRLIEMLRERRRA 199 >gi|269123432|ref|YP_003306009.1| restriction modification system DNA specificity domain-containing protein [Streptobacillus moniliformis DSM 12112] gi|268314758|gb|ACZ01132.1| restriction modification system DNA specificity domain protein [Streptobacillus moniliformis DSM 12112] Length = 473 Score = 190 bits (481), Expect = 5e-46, Method: Composition-based stats. Identities = 91/444 (20%), Positives = 167/444 (37%), Gaps = 28/444 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYL 64 Y +Y +S + W AIP+HW V I R + + S K+I+ + + S Sbjct: 3 RYEKYSNSELTWSEAIPEHWGVKRIARVFDIRKEKNSPIKTKEILSLSAKHGVSLYSDKK 62 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 K GN + D ++ ++ G IL + I+++ G S + L + Sbjct: 63 EKGGNKPKEDLTSYNLCYLGDILINCMNVVAGSVGISNYFGAVSPVYYPLVNMNQDENGT 122 Query: 125 QGWLLSIDVTQRIEAICE---------------GATMSHADWKGIGNIPMPIPPLAEQVL 169 + ++ W + +PIPP+ EQ Sbjct: 123 RYMEYVFRNYNFQRSLVGLGKGIQMSEADDGRLYTVRMRISWDILKTQLLPIPPINEQEQ 182 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + ID LI I+ LK +++ V KG+N + K S I+W+ +P Sbjct: 183 IANYLDWKINEIDRLIQIEKEKIKELKRLTLNIIAEFVLKGIN-TLNYKKSNIKWIDNIP 241 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK--------LETRNMGLKPESYE 281 HW V + ++ + Y + N + + Y+ Sbjct: 242 SHWNEISIRGCVNIIRGNSSFTKDDLKNQGEYVGLQYGKVYKTEIIDSEFNFYVNDKFYK 301 Query: 282 TYQIVDPGEIVFRFID-LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 T Q+V +I+ D + G+I + +KP ++ + + S Sbjct: 302 TSQVVTRNDIIIVSTSETVEDLGHTSFYDRHDIGLIGGEQILLKPLNNINSKFLFYL-SK 360 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +G+ K D+K+L + +PPI+EQ +I + I ++ ++D V+ Sbjct: 361 IFRTQLQLCATGIKVYRFKISDLKQLYIPLPPIEEQENIVSNIELKLKQLDERVKNNYNL 420 Query: 400 IVLLKERRSSFIAAAVTGQIDLRG 423 I L+ + S I+ VTG+ID+R Sbjct: 421 IKELELLKQSLISEVVTGKIDVRN 444 >gi|170731314|ref|YP_001776747.1| putative type I restriction enzyme, S subunit [Xylella fastidiosa M12] gi|167966107|gb|ACA13117.1| putative type I restriction enzyme, S subunit [Xylella fastidiosa M12] Length = 457 Score = 190 bits (481), Expect = 6e-46, Method: Composition-based stats. Identities = 88/425 (20%), Positives = 160/425 (37%), Gaps = 12/425 (2%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 Y Y+ +W+ +P+HW ++ K F + R+ + ++ + K Sbjct: 7 YSTYQPLRSRWVPRVPEHWSLLRAKNFLQEIDDRSKTGEETLLSMRKHCGLVPHNDVSIK 66 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 N + +++ ++ G+ S + V + G Sbjct: 67 RTNPKN--LIGYKKVQPDELVLNRMQAGNAMFFHNYLSGLVSPDYAVFRLLRDDNPEYLG 124 Query: 127 W---LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I R E+ G W + +P+PPL EQ I + A+ V I Sbjct: 125 YLFRSWPICGLFRSESKGIGTGFLRLYWDRFAALEIPLPPLPEQDQIVAYLRAQDVHIAR 184 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I + I LL E+K ++ + VT+GL+ V +K SGIEW+G VP +WEV+ L + Sbjct: 185 FIKAKRDLISLLIEQKLRIIDHAVTRGLDASVALKPSGIEWLGDVPVNWEVRRLKFLASN 244 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + T I R + + E T + +++F + K Sbjct: 245 TTSQTTTKARDEIYLAMEHVQSWTGVARPLEGEVEFASTVKRFVVDDVLFGKLRPYLAKV 304 Query: 304 SLRSAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + G+ S ++ + I YL ++R + + + +G + Sbjct: 305 TRAKC----NGVCVSEFLVLRSRKEFILPAYLEQMLRCKRVIDLINSSTAGAKMPRADWI 360 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + + + VP Q I + I ET + + + E I L++E R I VTGQ+D Sbjct: 361 FIGNVRLPVPCKDVQEAILSHIESETKDLGEAITRTEDEIKLIREYRDRLITDVVTGQVD 420 Query: 421 LRGES 425 +RG Sbjct: 421 VRGWQ 425 >gi|283796106|ref|ZP_06345259.1| putative type I restriction enzyme specificity protein [Clostridium sp. M62/1] gi|291076320|gb|EFE13684.1| putative type I restriction enzyme specificity protein [Clostridium sp. M62/1] Length = 435 Score = 189 bits (480), Expect = 6e-46, Method: Composition-based stats. Identities = 95/423 (22%), Positives = 181/423 (42%), Gaps = 30/423 (7%) Query: 26 KVVPIKRFTK--LNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS---T 77 K +K + G + I ++ E V++G + K G SD Sbjct: 16 KKKKLKYIVSTPITDGPHETPELLDEGIPFLSAESVKNGILDFNYKRGYISLSDHKLFCK 75 Query: 78 VSIFAKGQILYGKLGPYLRKAII---ADFDGICST-QFLVLQPKDVLPELLQGWLLSIDV 133 K I K G I + I S + VL + + + L Sbjct: 76 KVRPQKNDIFIVKSGATTGNCGIVTTDEEFSIWSPLALIRCDNISVLQKFIYYYSLCYSF 135 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 T ++E T + +GN+ + +P EQ I + + E +ID++ + + I Sbjct: 136 THQVEQSWSYGTQQNIGMGVLGNLYVTLPSSNEQQSIVDYLDKECAQIDSIAADLEKQIA 195 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK------ 247 LL++ K++L++ VTKGL+ V MKDSG+EW+G +P+HW+V+P VT N Sbjct: 196 LLQQYKKSLITETVTKGLDKSVPMKDSGVEWIGKIPEHWDVEPIKYRVTFHNGDRGENYP 255 Query: 248 -NTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQ---IVDPGEIVFRFIDLQNDK 302 ++L I ++ G+ L NM E + PG+I++ Sbjct: 256 SKSELQSEGIPFINAGHLEGDGLNMDNMDYISEEKYRIMGGVKLRPGDILYCLRGSVGKN 315 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY--DLCKVFYAMGSGLRQSLKFE 360 + M +G + S+ +A++ I + YL + + S+ ++ + + G + +L + Sbjct: 316 AIV----DMNQGTVASSLVAIRSVRILAEYLYYCLNSHIEEVQRYLWDNG-TAQPNLSAD 370 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ + +PP++EQ I +N ++ID LV ++ + +++ + S I VTG+ Sbjct: 371 NLGKYKFCIPPVEEQKAIVKYLNNICSQIDNLVIGKKKQLSTIQQHKKSLIYEYVTGKKR 430 Query: 421 LRG 423 ++ Sbjct: 431 VKE 433 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 47/207 (22%), Positives = 82/207 (39%), Gaps = 9/207 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTG--------RTSESGKDIIYIGLEDVESGTG 61 KDSGV+WIG IP+HW V PIK + G ++ + I +I +E Sbjct: 219 MKDSGVEWIGKIPEHWDVEPIKYRVTFHNGDRGENYPSKSELQSEGIPFINAGHLEGDGL 278 Query: 62 KYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + G ILY G + AI+ G ++ + ++ +L Sbjct: 279 NMDNMDYISEEKYRIMGGVKLRPGDILYCLRGSVGKNAIVDMNQGTVASSLVAIRSVRIL 338 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 E L L S + G + +G IPP+ EQ I + + + Sbjct: 339 AEYLYYCLNSHIEEVQRYLWDNGTAQPNLSADNLGKYKFCIPPVEEQKAIVKYLNNICSQ 398 Query: 181 IDTLITERIRFIELLKEKKQALVSYIV 207 ID L+ + + + +++ K++L+ V Sbjct: 399 IDNLVIGKKKQLSTIQQHKKSLIYEYV 425 >gi|323351172|ref|ZP_08086828.1| hypothetical protein HMPREF9398_0876 [Streptococcus sanguinis VMC66] gi|322122396|gb|EFX94107.1| hypothetical protein HMPREF9398_0876 [Streptococcus sanguinis VMC66] Length = 433 Score = 188 bits (478), Expect = 1e-45, Method: Composition-based stats. Identities = 85/433 (19%), Positives = 181/433 (41%), Gaps = 24/433 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTG 61 K+SG+ WIG IP+ W+V+ +K F GR + + D ++G Sbjct: 4 MKESGIDWIGQIPEEWEVIKVK-FFTYMKGRIGWQGLKADEFIDEGPYLVTGTDFKNGRV 62 Query: 62 KYLPKD-GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQP-- 116 + + ++ + + +G +L K G + A+I + ++ LVL+P Sbjct: 63 NWDTAYHISQKRYEQAPEIQLKQGDLLVTKDGTVGKLALIDELPDSASLNSHLLVLRPLF 122 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 L L S++ + + G+TM + +G +P + EQ I + Sbjct: 123 NRYENHFLYYVLSSLEFKNYFQKVSIGSTMDSLSQEKMGEFIFALPNINEQNSISRYLDK 182 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +T ++D + + I+ LK+ + +L+ VTKGL+ V +KDSGI+W+G VP+ W VK Sbjct: 183 KTAQLDKVKSLLEEQIQKLKDYRSSLIYETVTKGLDKTVPLKDSGIDWIGHVPEGWGVKA 242 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR--------NMGLKPESYETYQIVDP 288 + E+ +T ++ I N IQ + + + + +++ + Sbjct: 243 IKYIFDEIGSGSTPKSDNEIFYDGDINWIQSGDLYQTDTVTSVSKTISYQGFKSTSALKI 302 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 + F + + + ++ + + + S + S + ++ Sbjct: 303 YQQPFVALAMYGASVGNVAVSYIDACVNQAVVAMLGSSEKVSFGKYAIEASKS--NLIFS 360 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G + ++ +K + P +EQ + + ++ +T +ID L++ + I + ++R Sbjct: 361 AQGGTQPNISQNLIKNWSIPQPKNEEQEQVVDFLDKKTVQIDKLIQIKNEQIKNINKQRQ 420 Query: 409 SFIAAAVTGQIDL 421 + I VTG+ + Sbjct: 421 TLIYDYVTGKRRV 433 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 83/214 (38%), Gaps = 11/214 (5%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 +MK+SGI+W+G +P+ WEV R + ++++ ++ + +N Sbjct: 1 MTRMKESGIDWIGQIPEEWEVIKVKFFTYMKGRIGWQGLKADEFIDEGPYLVTGTDFKNG 60 Query: 274 GLKPESYET----------YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + ++ + G+++ + + Sbjct: 61 RVNWDTAYHISQKRYEQAPEIQLKQGDLLVTKDGTVGKLALIDELPDSASLNSHLLVLRP 120 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++ +L +++ S + F + SL E + +P I EQ I+ + Sbjct: 121 LFNRYENHFLYYVLSSLEFKNYFQKVSIGSTMDSLSQEKMGEFIFALPNINEQNSISRYL 180 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +TA++D + +E+ I LK+ RSS I VT Sbjct: 181 DKKTAQLDKVKSLLEEQIQKLKDYRSSLIYETVT 214 >gi|259502615|ref|ZP_05745517.1| conserved hypothetical protein [Lactobacillus antri DSM 16041] gi|259169430|gb|EEW53925.1| conserved hypothetical protein [Lactobacillus antri DSM 16041] Length = 422 Score = 188 bits (477), Expect = 1e-45, Method: Composition-based stats. Identities = 91/419 (21%), Positives = 181/419 (43%), Gaps = 12/419 (2%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KDSG++W+G IP +W VPI ++ + + + + + ++ +S Sbjct: 7 KDSGIKWVGEIPDNWDSVPIYYVSQEVRKKNNNISQKVALKFTYGTIVRKKNFSIEEDSS 66 Query: 71 RQSDTSTVSIFAKGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + + I+ ++ F G ++ ++V++ K+ + E Sbjct: 67 LRKTIENYKVVKPKDIVINGLNLNFDFVTQRVGFVTFPGAITSAYIVIRAKNNINEKYLL 126 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +LL + + G ++ + I +P PP+ EQ I + + + +ID L++ Sbjct: 127 YLLKSYDSVKAFHNMGGGVRKILNFSILSKIKIPFPPMKEQKRITDFLDKKCGKIDKLLS 186 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + I+ LK+ + +L+ +VTKGL+ +V KDSGIEW+G +P+ W V ++T L+R Sbjct: 187 QINDEIDTLKKYQHSLIIRVVTKGLDSNVPTKDSGIEWIGTMPEKWNVVKGKFILTLLDR 246 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K E ++K + YQ VD G++V +D + Sbjct: 247 PTKKDDEVITCFRDGQVTLRKKRRTDGFTISTKEIGYQGVDVGDLVVHAMDGFAGAIGIS 306 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL---KFEDVK 363 ++ ++ V + YL + +R VF A+ G+R ++ + Sbjct: 307 DSRGKASPVLN-----VMDSSENKNYLKYYLRCCAYLGVFNALAKGIRVRTADTRWSTLA 361 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 L +P EQ DI++ ++ + A I L+ + + + LL + ++S I VTG+ + Sbjct: 362 NLKFPLPTKNEQKDISDYLDQKCAEIRALINEKNRQLDLLTKYKNSLIFEYVTGKKQVP 420 Score = 122 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 69/206 (33%), Positives = 115/206 (55%), Gaps = 3/206 (1%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 KDSGI+WVG +PD+W+ P + + E+ +KN + + L +YG I++K Sbjct: 3 MQVNKDSGIKWVGEIPDNWDSVPIYYVSQEVRKKNNNISQKVALKFTYGTIVRKKNFSIE 62 Query: 274 GLKP--ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDS 330 ++ E Y++V P +IV ++L D + R V G ITSAY+ ++ + I+ Sbjct: 63 EDSSLRKTIENYKVVKPKDIVINGLNLNFDFVTQRVGFVTFPGAITSAYIVIRAKNNINE 122 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 YL +L++SYD K F+ MG G+R+ L F + ++ + PP+KEQ IT+ ++ + +ID Sbjct: 123 KYLLYLLKSYDSVKAFHNMGGGVRKILNFSILSKIKIPFPPMKEQKRITDFLDKKCGKID 182 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVT 416 L+ +I I LK+ + S I VT Sbjct: 183 KLLSQINDEIDTLKKYQHSLIIRVVT 208 >gi|302380080|ref|ZP_07268555.1| type I restriction modification DNA specificity domain protein [Finegoldia magna ACS-171-V-Col3] gi|302312100|gb|EFK94106.1| type I restriction modification DNA specificity domain protein [Finegoldia magna ACS-171-V-Col3] Length = 422 Score = 188 bits (477), Expect = 2e-45, Method: Composition-based stats. Identities = 78/421 (18%), Positives = 168/421 (39%), Gaps = 6/421 (1%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + KDSG++WIG IP+ W++ KR++K G T E + + Sbjct: 4 KMKDSGIEWIGEIPEDWEISKFKRYSKSAMGNTILKTDLEENNNKETIPVYSATQEDVVF 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + TV I K ++ G + + + TQ + + + E + Sbjct: 64 GYIDENNVTV-ILKKNNLVIPARGNSIGFTKLVPYAKATCTQTTIFSRLNNINEKFVYYC 122 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 E + + + + N +PI + EQ I + + + + Sbjct: 123 SIAFKDSWFE--FDQTAIPQITVQQVENNNIPICSIEEQCKITNFLSNKLENVKNIKIII 180 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 IE L+ K+++++ VTKGL+ +V+MKDSGIEW+G +P HW++ + +++ Sbjct: 181 TNQIENLENYKKSVITEAVTKGLDKNVEMKDSGIEWIGEIPKHWDLIKLKFIAHSISKGI 240 Query: 249 TKLI--ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + E+ ++ + N+ E ++ +++ ++ Sbjct: 241 SPHYVEETLTPVVNQATFSKGFFDSNLKYCSEKPIGEGLLKMNDVLLATTGGGVLGKTYY 300 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + T S ++ +++ +YDL +A GS + L+ + + + Sbjct: 301 FEEKGKYLASTDVAYIRNKDKYISKFIYYILSVNYDLLNGIFAKGSTNQTHLQMDLLSNM 360 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + +P E I + ++V ID + ++ + L+E + S I VTG+ +++ Sbjct: 361 HIPLPNQNELSKIISKLDVVNTNIDDSIAIKQKQLDTLEEYKKSLIYEYVTGKKEVKDGE 420 Query: 426 Q 426 + Sbjct: 421 E 421 >gi|257791267|ref|YP_003181873.1| restriction modification system DNA specificity subunit [Eggerthella lenta DSM 2243] gi|257475164|gb|ACV55484.1| restriction modification system DNA specificity subunit [Eggerthella lenta DSM 2243] Length = 395 Score = 188 bits (476), Expect = 2e-45, Method: Composition-based stats. Identities = 99/415 (23%), Positives = 167/415 (40%), Gaps = 29/415 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + KDSGV WIG +P +W++VPIK + G + DI G G+ G Sbjct: 3 ETKDSGVDWIGEVPVNWEIVPIKADVSIGHGSDPTTPGDIPVWGSG------GEPFKTCG 56 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + +L G+ G ++ T F L + Sbjct: 57 EHKNGPA----------VLLGRKGTLDCPQLVTGLYWNVDTAFDAKITSKKLSLKFFYYA 106 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + N +P PPLAEQ I + ID + +R Sbjct: 107 ATCVDIKP---YMTNTAKPSMTQFDWDNSRIPRPPLAEQRRIISYLDERCAAIDEDVAKR 163 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 I LKE K++L+++ VTKGL+P+ +MKDSG++W+G VP +W + + N K Sbjct: 164 RDVIGKLKEYKKSLIAHAVTKGLDPNTEMKDSGVDWIGEVPANWRLTKIGQVYDLRNTKV 223 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + G + Q + K ++++ ++V G+ V + + Sbjct: 224 SDCDYEPLSVTMQGIVPQ----LDSAAKTDAHDDRKLVMEGDFVINSRSDRRGSCGIARQ 279 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL---KFEDVKRL 365 +I + + ++ + WL + FY G G+ L K+ ++K + Sbjct: 280 D-GSVSLINTVLI--PREHMEPRFYDWLFHTTLFADEFYKNGHGIVDDLWTTKWAEMKGI 336 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ PP + Q + N ++ A ID + + EQ I L E R S I AVTG+ID Sbjct: 337 TIVEPPFETQITVANYLDERCAAIDEAIARQEQLIEKLGEYRKSVIHHAVTGKID 391 >gi|303235367|ref|ZP_07321984.1| type I restriction modification DNA specificity domain protein [Finegoldia magna BVS033A4] gi|302493488|gb|EFL53277.1| type I restriction modification DNA specificity domain protein [Finegoldia magna BVS033A4] Length = 426 Score = 187 bits (475), Expect = 3e-45, Method: Composition-based stats. Identities = 87/425 (20%), Positives = 168/425 (39%), Gaps = 10/425 (2%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + KDSG++WIG IP+ W++ KR++K G T E + + Sbjct: 4 KMKDSGIEWIGEIPEDWEISKFKRYSKSAMGNTILKTDLEENNNKETIPVYSATQEDVVF 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + TV I K ++ G + + + TQ + + + E + Sbjct: 64 GYIDENNVTV-ILKKNNLVIPARGNSIGFTKLVPYAKATCTQTTIFSRLNNINEKFVYYC 122 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 E + + + + N +PI + EQ I + + + + Sbjct: 123 SIAFKDSWFE--FDQTAIPQITVQQVENNNIPICSIEEQCKITNFLSNKLENVKNIKIII 180 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 IE L+ K+++++ VTKGL+ +V+MKDSGIEW+G +P HWE+K L + R Sbjct: 181 TNQIENLENYKKSVITEAVTKGLDKNVEMKDSGIEWIGKIPKHWEIKNIKNLTLKSERGT 240 Query: 249 TKLIESNILSLSYGNIIQK-----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + + N ++ K + + ++ G+++ + Sbjct: 241 SPSYIEDDTKSKVVNQATFSQGFFDKSNIKYSKIPTNNSRGLLKKGDVLIASTGGGVLGK 300 Query: 304 SLRSAQVMERGIITSA-YMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFED 361 + + E + +S L ++ +Y+L A GS + L+ E Sbjct: 301 THFFIEDGEYVADGHITILRTDSLEQNSKILYYIFSVNYELINGILAKGSTNQTELQSEW 360 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +K V PI+EQ I N ++ + ID + ++ + L+E + S I VTG+ ++ Sbjct: 361 LKSFKVPYAPIEEQIRIVNYLDEKCKLIDDSISLKKKQLETLEEYKKSLIYEYVTGKKEV 420 Query: 422 RGESQ 426 + + Sbjct: 421 KDGEE 425 >gi|329123045|ref|ZP_08251616.1| type I site-specific restriction-modification system, S subunit [Haemophilus aegyptius ATCC 11116] gi|327471976|gb|EGF17416.1| type I site-specific restriction-modification system, S subunit [Haemophilus aegyptius ATCC 11116] Length = 408 Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats. Identities = 102/422 (24%), Positives = 182/422 (43%), Gaps = 29/422 (6%) Query: 15 VQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK-DGNSRQS 73 + W+G +P HW++ +K+ + + L GK + K D ++ Sbjct: 1 MDWLGEVPSHWELKRLKQLFVEKKHK--------QSLSLNCGAISFGKVIEKSDDKVTEA 52 Query: 74 DTSTVSIFAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + KG+ L L + +++ D + S ++VL+ K ++ + +LL Sbjct: 53 TKRSYQEVLKGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQIINKKYFSYLL 112 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++ + G ++ I + + IPPL+EQ I + + +T +ID + Sbjct: 113 HRYDVAYMKLLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTAKIDQAVDLAE 171 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--- 246 + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HWEV +V E + Sbjct: 172 KQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWEVVSMKRVVKEHSGNGF 231 Query: 247 -----KNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQ 299 N I +S N + + N + + + + IV IV I Sbjct: 232 PIDLQGNNGNIPFLKVSDFSENQDKYIFKWNNSVTNKVIKQKKWNIVPKNSIVTAKIGEA 291 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 K + + II + + ++ D + +L + D G SL Sbjct: 292 LRKNHRKILSI--DSIIDNNCLGIEIKKADVLFGYYLHCALDFD---LFTNPGAIPSLAM 346 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + +++PP +EQ +I + + +TA+ID + I LKE +S I VTG++ Sbjct: 347 DKYRNQKIVLPPFQEQQEIADYLEQQTAKIDQAIALKTAHIEKLKEYKSVLINDVVTGKL 406 Query: 420 DL 421 + Sbjct: 407 QV 408 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 58/211 (27%), Positives = 90/211 (42%), Gaps = 14/211 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLP 65 KDSGV+WIG +P+HW+VV +KR K ++G + +I ++ + D KY+ Sbjct: 200 KDSGVEWIGQVPEHWEVVSMKRVVKEHSGNGFPIDLQGNNGNIPFLKVSDFSENQDKYIF 259 Query: 66 KDGNSRQSDTSTVS---IFAKGQILYGKLGPYLRKAI--IADFDGICSTQFLVLQPKDVL 120 K NS + I K I+ K+G LRK I D I L ++ K Sbjct: 260 KWNNSVTNKVIKQKKWNIVPKNSIVTAKIGEALRKNHRKILSIDSIIDNNCLGIEIKKAD 319 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 ++D + + N + +PP EQ I + + +T + Sbjct: 320 VLFGYYLHCALDF----DLFTNPGAIPSLAMDKYRNQKIVLPPFQEQQEIADYLEQQTAK 375 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGL 211 ID I + IE LKE K L++ +VT L Sbjct: 376 IDQAIALKTAHIEKLKEYKSVLINDVVTGKL 406 >gi|288553769|ref|YP_003425704.1| Type 1 restriction-modification system (S) endonuclease subunit [Bacillus pseudofirmus OF4] gi|288544929|gb|ADC48812.1| Type 1 restriction-modification system (S) endonuclease subunit [Bacillus pseudofirmus OF4] Length = 443 Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats. Identities = 102/425 (24%), Positives = 179/425 (42%), Gaps = 25/425 (5%) Query: 24 HWKVVPIKRFTKL--NTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+V+ IKR + G + ++ E V++G + K G Q D Sbjct: 15 DWQVMKIKRVLDIPITDGPHETPELLEDGVPFLSAESVKNGNLNFDLKRGYISQEDHEKY 74 Query: 79 -SIFAK--GQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWLLSI 131 I K G + D D S + + + V+P+ L ++ S+ Sbjct: 75 IKKCKPQRDDIFMVKSGATTGNIAMVDTDEEFSIWSPLALIRAKKEIVIPKYLYYFVGSL 134 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++E T + K I N+ + IP L Q I I + ID LI ++ +F Sbjct: 135 AFREQVEVSWSYGTQQNIGMKVIENLFISIPSLEIQKRIVRYIEYKVKDIDILIKQKGKF 194 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--- 248 I+LL++++Q++++ VTKGLNP++ MKDSG++W+G +P+HWEVK + Sbjct: 195 IKLLEQQRQSILTEAVTKGLNPNMNMKDSGVKWIGEIPEHWEVKKVKHFAIHVGSGKTPS 254 Query: 249 ---TKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQND 301 ++ I L N + +++ E V P +I+ Sbjct: 255 GGAEIYLDEGIPFLRSLNVHFDGIHLKDLAFISEEINEEMKTSQVQPLDILLNITGASIG 314 Query: 302 KRSLRSAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKF 359 + ++ + + + + Y LM S + + +A R+ L F Sbjct: 315 RTTIVPKDFGRANVNQHVCIIRLNQNKVYPYYFNMLMASDVINQQIWFAQNGSSREGLNF 374 Query: 360 EDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 V+ L +PP ++EQ +I I + +I L+ +++ I LKE R S I AVTG+ Sbjct: 375 AQVRELIFAIPPTLEEQREINEWIYNKQMKIFNLINLVKEQIEKLKEYRQSLIYEAVTGK 434 Query: 419 IDLRG 423 ID+R Sbjct: 435 IDVRE 439 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 50/217 (23%), Positives = 87/217 (40%), Gaps = 14/217 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSES------GKDIIYIGLEDVESGTGK 62 KDSGV+WIG IP+HW+V +K F + +G+T + I ++ +V Sbjct: 220 MKDSGVKWIGEIPEHWEVKKVKHFAIHVGSGKTPSGGAEIYLDEGIPFLRSLNVHFDGIH 279 Query: 63 YLPK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKD 118 + ++ S IL G + + I + +++ Sbjct: 280 LKDLAFISEEINEEMKTSQVQPLDILLNITGASIGRTTIVPKDFGRANVNQHVCIIRLNQ 339 Query: 119 V--LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKII 175 P + S + Q+I G++ ++ + + IPP L EQ I E I Sbjct: 340 NKVYPYYFNMLMASDVINQQIWFAQNGSSREGLNFAQVRELIFAIPPTLEEQREINEWIY 399 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + ++I LI IE LKE +Q+L+ VT ++ Sbjct: 400 NKQMKIFNLINLVKEQIEKLKEYRQSLIYEAVTGKID 436 >gi|170680371|ref|YP_001746681.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli SMS-3-5] gi|170518089|gb|ACB16267.1| type I restriction modification DNA specificity domain protein [Escherichia coli SMS-3-5] gi|323160768|gb|EFZ46703.1| type I restriction modification DNA specificity domain protein [Escherichia coli E128010] gi|330908618|gb|EGH37137.1| type 1 restriction-modification system, specificity subunit S [Escherichia coli AA86] Length = 428 Score = 187 bits (474), Expect = 3e-45, Method: Composition-based stats. Identities = 111/414 (26%), Positives = 185/414 (44%), Gaps = 18/414 (4%) Query: 25 WKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W VP KR + + + + ++ V + + L + SD S IF K Sbjct: 16 WNSVPAKRLFTSSKEINQGMKESNRLALTMKGVINRSLDDLQ---GLQSSDYSVYQIFEK 72 Query: 84 GQILYGKL---GPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQRIEA 139 +++ + + I GI S ++ V + + W I Sbjct: 73 DDLVFKLIDLENIKTSRVGIVHERGIMSPAYIRVSASSNSIYPRFYYWYFFALYLTNIYN 132 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + IP+P+ ++ Q + + ET RID+LI E+ FI+LLKEK+ Sbjct: 133 KLGGGVRQNLTAGDLLEIPVPLIDISLQKQVSTFLDRETQRIDSLIEEKQTFIKLLKEKR 192 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI---ESNI 256 QAL+S++VTKGL P+V+M+DSGIEW+G VP HWEVK + + ++ + Sbjct: 193 QALISHVVTKGLYPNVEMQDSGIEWIGQVPKHWEVKKIKHICSNFMYGTSQDCNQSDVGY 252 Query: 257 LSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 L N + + + + TY + +V R N Sbjct: 253 PVLRIPNIKSTNVDFEDLKYANISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDSNG 312 Query: 313 RGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLV 369 + + S + + P +D+++L M S + + F + S +L + + + Sbjct: 313 QYLFASYLIKLTPKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAI 372 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 PPI EQ ITN ++ T ID+L+++ ++SI LLKE R+S I AAVTG+ID+R Sbjct: 373 PPIDEQKTITNYLSAATINIDLLIQETDKSIDLLKEHRTSLINAAVTGKIDVRE 426 Score = 90.6 bits (223), Expect = 5e-16, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 89/219 (40%), Gaps = 13/219 (5%) Query: 7 YP--QYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSE---SGKDIIYIGLEDVESGT 60 YP + +DSG++WIG +PKHW+V IK G + + S + + +++S Sbjct: 205 YPNVEMQDSGIEWIGQVPKHWEVKKIKHICSNFMYGTSQDCNQSDVGYPVLRIPNIKSTN 264 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQ 115 + + + + ++G IL + ++ + ++ + L Sbjct: 265 VDFEDLKYANISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDSNGQYLFASYLIKLT 324 Query: 116 PKDVLPELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 PK + ++ + + N + IPP+ EQ I Sbjct: 325 PKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAIPPIDEQKTITNY 384 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + A T+ ID LI E + I+LLKE + +L++ VT ++ Sbjct: 385 LSAATINIDLLIQETDKSIDLLKEHRTSLINAAVTGKID 423 >gi|228930124|ref|ZP_04093134.1| hypothetical protein bthur0010_48060 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228829623|gb|EEM75250.1| hypothetical protein bthur0010_48060 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 418 Score = 186 bits (473), Expect = 4e-45, Method: Composition-based stats. Identities = 99/425 (23%), Positives = 182/425 (42%), Gaps = 20/425 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 KDS ++WIGAIP +WKVVP N+ + E + + + +++ + Sbjct: 1 MKDSKIEWIGAIPNYWKVVP-SNLFFYNSSKKVEGNVEQLTASQKYGVISQSRFMKLESQ 59 Query: 70 --SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 ++ D S + KG + L + IA G + + VL+ K Sbjct: 60 MPVQKRDLSDLKQVDKGDFVIS-LRSFQGGLEIAQESGGITPAYTVLKEKTKQTYAGYYK 118 Query: 128 LLSIDVTQRIEA----ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + +P+ +PPL EQ I E + +T I+ Sbjct: 119 YFFKSEMYIQALRGTVLDTIRDGKAIRFSNFSMVPIVLPPLNEQKKIVEVLDEKTKTINN 178 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 +I++ + I+ LK+ KQ+L++ VTKGLN +V +KDS IEW+G +P W + L Sbjct: 179 IISDTQQSIKELKKYKQSLITEAVTKGLNRNVGIKDSEIEWIGEMPKEWNLVKVNRLFAI 238 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + ++LS++ + K TRN G Y YQIV P + V +DL Sbjct: 239 K-KNIANQNGYDVLSVTQSGLKVKDITRNEGQMAADYSKYQIVKPKDFVMNHMDLLTGWI 297 Query: 304 SLRSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGSGL----RQS 356 + + + G+ + Y + + ++ + ++FY +G G+ R Sbjct: 298 DIAA----QEGVTSPDYRVFYTKDTELVSNEFYLYVFQICYTNRIFYGLGQGVSNLGRWR 353 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L+ + + +PP+ EQ I +N + I+ ++E+ + + L++ + S I VT Sbjct: 354 LQTDKFLNFYLPLPPVNEQQAIVKFLNGKLVEINSMIEQKKDLLGELEQYKKSLIYECVT 413 Query: 417 GQIDL 421 G+ ++ Sbjct: 414 GKKEV 418 >gi|257091992|ref|YP_003165633.1| restriction modification system DNA specificity protein-containing protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257044516|gb|ACV33704.1| restriction modification system DNA specificity domain protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 417 Score = 186 bits (473), Expect = 4e-45, Method: Composition-based stats. Identities = 94/405 (23%), Positives = 167/405 (41%), Gaps = 16/405 (3%) Query: 28 VPIKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 +K +N ++++ ++ YI + +V+S + + + + I G Sbjct: 9 RRLKYAATINDETLSESTDADFELAYIDIGNVDSQGRFHDIVNHRFDDAPSRARRIVRDG 68 Query: 85 QILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAI 140 ++ + YL+ + + I ST F V++P + L + + +E+ Sbjct: 69 DVIVSTVRTYLQAIASVENPPDNLIVSTGFAVVRPSNELDHRFCKYALRASSFLWGVESR 128 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + + +G+I + +P L Q LI + ET RID LI E+ R + LL+EK+ Sbjct: 129 STGVSYPAINASDLGDINVSLPELGAQRLIASYLDRETARIDGLIAEKERMLALLEEKRA 188 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 AL+S +VT+GL+P+ +K SG EW+G +P HW F V + + + + Sbjct: 189 ALISRVVTRGLDPNSPLKPSGQEWLGEIPAHWPTTKFSWDVFISEGQVDPEDDRFLEMIL 248 Query: 261 YGNIIQKLETRNMGLKPES-----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + T + S G++++ I K L + Sbjct: 249 VAPNHIESRTGEVTHTETSADQGAMSGKYFCKQGDVLYSKIRPALRKVVLAEDDCL---C 305 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKE 374 Y + YL + + S D + E + + VPP++E Sbjct: 306 SADMYALRPSKRLMPEYLQYFLLSEDFSVWAELESARVAMPKINRETFSAIRIPVPPLEE 365 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 Q I I RID+ + + S+ LLKERR++ I AAV+GQI Sbjct: 366 QERIVLEIRDGAKRIDLQRKAVRGSVELLKERRAALITAAVSGQI 410 Score = 93.7 bits (231), Expect = 6e-17, Method: Composition-based stats. Identities = 62/205 (30%), Positives = 99/205 (48%), Gaps = 4/205 (1%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKD 67 K SG +W+G IP HW ++ G+ ++I + +ES TG+ + Sbjct: 206 KPSGQEWLGEIPAHWPTTKFSWDVFISEGQVDPEDDRFLEMILVAPNHIESRTGEVTHTE 265 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQG 126 ++ Q S +G +LY K+ P LRK ++A+ D +CS L+P K ++PE LQ Sbjct: 266 TSADQGAMSGKYFCKQGDVLYSKIRPALRKVVLAEDDCLCSADMYALRPSKRLMPEYLQY 325 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +LLS D + E M + + I +P+PPL EQ I +I RID Sbjct: 326 FLLSEDFSVWAELESARVAMPKINRETFSAIRIPVPPLEEQERIVLEIRDGAKRIDLQRK 385 Query: 187 ERIRFIELLKEKKQALVSYIVTKGL 211 +ELLKE++ AL++ V+ + Sbjct: 386 AVRGSVELLKERRAALITAAVSGQI 410 >gi|317132749|ref|YP_004092063.1| restriction modification system DNA specificity subunit [Ethanoligenens harbinense YUAN-3] gi|315470728|gb|ADU27332.1| restriction modification system DNA specificity subunit [Ethanoligenens harbinense YUAN-3] Length = 462 Score = 186 bits (472), Expect = 6e-45, Method: Composition-based stats. Identities = 79/434 (18%), Positives = 159/434 (36%), Gaps = 26/434 (5%) Query: 9 QYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +Y D + W+ IP HW++ I +T S KD + + G Sbjct: 3 EYTDVINTDAAWLPQIPAHWQLQKIDALFTER--KTKVSDKDYAPLSVT----KKGILPQ 56 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + ++ +D+ + G + ++ DG S LVL P+ L Sbjct: 57 LEHAAKSNDSDNRKLVKAGDFVINSRSDRKGSCGVSKLDGSVSLINLVLTPRSKLNNDYV 116 Query: 126 GWLLSID-VTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +LL ++ G + + + I +P+PP AEQ I + + I+ Sbjct: 117 HYLLRNYRFSEEYYRNGRGIVADLWTTRYSEMRTILLPVPPRAEQDQIVRFLDWKVSEIN 176 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP----FF 238 LI R + I+ + K +++ VT GL ++ + + ++P W + Sbjct: 177 KLIGIRRKEIQEFNQLKNTVITKTVTTGL-KREELCGTDNSYYRMIPKGWRITKTLRVLS 235 Query: 239 ALVTELNRKNTKLIESNILSLS-----YGNIIQKLETRNMGLKPESYETYQ---IVDPGE 290 +T+ +L E I +S GN + + YE + + Sbjct: 236 QPLTDGPHTTPQLYEEGIPFVSAEAVSCGNGKIDFNHIRGFISQDFYEECCKKYVPKIDD 295 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AM 349 I + S+ + A + +L + +++ + Sbjct: 296 IYMIKSGATTGRVSIVDTDRIFTIWSPLAVFRCNQEVMLPRFLFYALQALPYQQQVQDGW 355 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G +Q++ +++L + P + EQ I ++ + +D ++ E I L+E +S+ Sbjct: 356 SYGTQQNIGMRVLEQLKLAYPDVTEQEKIACYLDDKCDMLDKAIQLAESKIKALQELKST 415 Query: 410 FIAAAVTGQIDLRG 423 I+ VTG+ID+R Sbjct: 416 IISDVVTGKIDVRN 429 >gi|167771153|ref|ZP_02443206.1| hypothetical protein ANACOL_02508 [Anaerotruncus colihominis DSM 17241] gi|167666823|gb|EDS10953.1| hypothetical protein ANACOL_02508 [Anaerotruncus colihominis DSM 17241] Length = 444 Score = 186 bits (471), Expect = 8e-45, Method: Composition-based stats. Identities = 96/429 (22%), Positives = 179/429 (41%), Gaps = 17/429 (3%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M Y YK W+ IP W+ + IK + + + + + +++ Sbjct: 1 MSK---YESYKPIEELWLTQIPDSWEDIKIKFLFSERSEKGYP--DEPLLVASQNMGVVP 55 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 D + + G + L + A + GI S + ++ PK + Sbjct: 56 KGVYGNRTVQATKDLHLLKLVRVGDFVIS-LRSFQGGIEYAYYQGIISPAYTIMVPKQKI 114 Query: 121 PELLQGWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +L + + +C + D+ + N +P+PP EQ I + +T Sbjct: 115 VPGYFRYLAKSRLFIELLQLCVTGIREGQNIDYGKLKNHLIPVPPSEEQDQIVRYLDWQT 174 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +++ LI + R I LL+E++QA ++Y+VT+GL+ + ++ DSGI+++G VP HW+V Sbjct: 175 SKVNRLINAKKRIISLLEEQQQATIAYVVTRGLDQNAELMDSGIDYIGKVPAHWKVL-LN 233 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + + + E+ + ++ + L SYE +++V P ++V Sbjct: 234 HRIYKEKSRKFGEEETVLSLSQKDGLLPYENMKERSLHTASYENWKLVFPNDLVLNRFKA 293 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLR--- 354 RGI+T Y +P S+ + + + +VF + +G+ Sbjct: 294 HL----GVFFSSNYRGIVTFHYGVYEPVMKISSKYYEALYHTPEFRRVFASKSNGMTVGL 349 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 Q+L + + + PP +EQ I I + L+ KI Q I L E R+ I+ Sbjct: 350 QNLSNTNFYSVYTVYPPHEEQCQIVCKIKEIEEKYRDLIAKINQEIDCLHEYRTRLISDV 409 Query: 415 VTGQIDLRG 423 VTGQID+R Sbjct: 410 VTGQIDVRN 418 >gi|228964022|ref|ZP_04125152.1| hypothetical protein bthur0004_8820 [Bacillus thuringiensis serovar sotto str. T04001] gi|228795674|gb|EEM43151.1| hypothetical protein bthur0004_8820 [Bacillus thuringiensis serovar sotto str. T04001] Length = 409 Score = 185 bits (469), Expect = 1e-44, Method: Composition-based stats. Identities = 92/403 (22%), Positives = 172/403 (42%), Gaps = 17/403 (4%) Query: 38 TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYL 95 + ++ DI +I +ED + + +F G +L + Sbjct: 6 RDKPTKFDGDIPWIRIEDFNGKYISDSKSRQYVSKELVKGMNLKVFPIGTVLCTCSCS-M 64 Query: 96 RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 I + I + F+ + P + L +L+ +R++ +GA + Sbjct: 65 GATAIVEQPLISNQTFIGIVPGENLDSEYLFYLMQASA-ERLQLFAQGAIQQYLSKHNFE 123 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV 215 ++ +P+P L Q + + + +D LI + + I+LL+EK+Q L++ VT+GLNP+V Sbjct: 124 HLKIPLPSLKIQKRLLVFLNRKLKDLDELIENKKQLIDLLEEKRQTLITEAVTRGLNPNV 183 Query: 216 KMKDSGIEWVGLVPDHWEVKPFFAL------VTELNRKNTKLIESNILSLSYGNIIQKL- 268 KMKDSG+EW+G +P+HW +K + + ES +L L N+ Sbjct: 184 KMKDSGVEWIGEIPEHWTIKKIKHISNLVGSGKTPKGGSEIYPESGVLFLRSMNVHYDGI 243 Query: 269 ---ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV-K 324 + ++ + + V +++ + + + + + + Sbjct: 244 RLKDIVHITPEIDEDMRSTRVKSKDVLLNITGASIGRSCIVPESLGKANVNQHVCIIRSN 303 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQFDITNVI 382 + L+ +M S + + +G R+ L F VK L + ++EQ +I N I Sbjct: 304 TKVVVPELLSKIMASNFIMQQILMSQNGSSREGLNFTQVKNLEFPLTRDLQEQIEIANHI 363 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +VET +I+ L+ IE+ I LKE R S I VTG+ID+R Sbjct: 364 SVETNKINSLIGMIEEQIQKLKEYRQSLIYEVVTGKIDVRDFE 406 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 54/218 (24%), Positives = 99/218 (45%), Gaps = 14/218 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLN-TGRTSESGKDI------IYIGLEDVESGTG 61 + KDSGV+WIG IP+HW + IK + L +G+T + G +I +++ +V Sbjct: 184 KMKDSGVEWIGEIPEHWTIKKIKHISNLVGSGKTPKGGSEIYPESGVLFLRSMNVHYDGI 243 Query: 62 KYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQ--FLVLQ 115 + + + D + +L G + ++ I + + Sbjct: 244 RLKDIVHITPEIDEDMRSTRVKSKDVLLNITGASIGRSCIVPESLGKANVNQHVCIIRSN 303 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI-PPLAEQVLIREKI 174 K V+PELL + S + Q+I G++ ++ + N+ P+ L EQ+ I I Sbjct: 304 TKVVVPELLSKIMASNFIMQQILMSQNGSSREGLNFTQVKNLEFPLTRDLQEQIEIANHI 363 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 ET +I++LI I+ LKE +Q+L+ +VT ++ Sbjct: 364 SVETNKINSLIGMIEEQIQKLKEYRQSLIYEVVTGKID 401 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 66/173 (38%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 R + +I + + K + + + S E + ++ + Sbjct: 4 PMRDKPTKFDGDIPWIRIEDFNGKYISDSKSRQYVSKELVKGMNLKVFPIGTVLCTCSCS 63 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 +A V + I ++ + P + + + ++ ++Q L + + Sbjct: 64 MGATAIVEQPLISNQTFIGIVPGENLDSEYLFYLMQASAERLQLFAQGAIQQYLSKHNFE 123 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + +P +K Q + +N + +D L+E +Q I LL+E+R + I AVT Sbjct: 124 HLKIPLPSLKIQKRLLVFLNRKLKDLDELIENKKQLIDLLEEKRQTLITEAVT 176 >gi|49484938|ref|YP_042159.1| putative type I restriction enzyme specificity protein [Staphylococcus aureus subsp. aureus MSSA476] gi|49243381|emb|CAG41798.1| putative type I restriction enzyme specificity protein [Staphylococcus aureus subsp. aureus MSSA476] Length = 436 Score = 184 bits (468), Expect = 2e-44, Method: Composition-based stats. Identities = 79/434 (18%), Positives = 165/434 (38%), Gaps = 22/434 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGK 62 + K SG++WIG IPK+W + +K +G +S + I ++ + Sbjct: 4 EMKYSGIEWIGYIPKYWTITKLKNIIDFISGYAFKSELFTISDNNKKVITIKSFNTKEII 63 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLP 121 ++ T + IL+ G + +I D + V + Sbjct: 64 LDNLSYSNESLKFPTKYLLKNNDILFAMSGGTTGKNLLIEQVDDLYYINQRVGIIRSSFS 123 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + ++ + ++ I G+ + I N + +P I I + I Sbjct: 124 KFIYYYINTGLFSEYINLFSSGSAQPNISATDIQNFIIALPEKETIKKIEIYINYQLKII 183 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 +I + IE LK+ KQ+L++ VTKG++P+V+MK+SG +W+G +P +W V+ Sbjct: 184 SNIIDTTYQSIEELKKYKQSLITEAVTKGIDPNVEMKESGNDWIGSIPSNWSVRKIKHDF 243 Query: 242 T-----ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-----QIVDPGEI 291 + ++ L G +K R S E + + ++ Sbjct: 244 NLKGRIGWQGLTSNEYQTVGPYLITGTDFKKGIIRWDSCVRISEERFEEAPDIHIKENDL 303 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAM 349 + K +L + + + + + + + I+ ++ + + S + + Sbjct: 304 LITKDGTIG-KVALATNVPKKVSLNSGVLLIREKLKNTINKKFMYYNLLSNMFWNWYNSN 362 Query: 350 GSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 G + L +P + EQ I ++ + + ID L+E + I L+ + Sbjct: 363 NQGASTIKHLYQGQFYNYSYAIPLLHEQQQIVQYLDDKVSTIDRLIEDKTKVIKELENYK 422 Query: 408 SSFIAAAVTGQIDL 421 S I VTG+ ++ Sbjct: 423 KSLIYEYVTGKKEV 436 >gi|114320942|ref|YP_742625.1| restriction modification system DNA specificity subunit [Alkalilimnicola ehrlichii MLHE-1] gi|114227336|gb|ABI57135.1| restriction modification system DNA specificity domain protein [Alkalilimnicola ehrlichii MLHE-1] Length = 419 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 93/413 (22%), Positives = 162/413 (39%), Gaps = 17/413 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W +K N ES + I Y+ + V G + ++ + Sbjct: 6 LPATWSSKRLKYLATYNDEVLPESTDEEAEIDYVEISGVSLSRGVEQVERITFGKAPSRA 65 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQ--GWLLSID 132 G IL + YLR D I ST F V++P + S Sbjct: 66 RRKVRSGDILISTVRTYLRAIAKVDEASPDLIASTGFCVVRPDREEVDSGYLGWAAKSEP 125 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G + + + I MP+PPL Q I + + +T RID LI ++ + Sbjct: 126 FVSEVVSRSVGVSYPAINASELVTIEMPLPPLETQRRIAQFLDEKTARIDGLIEKKRALL 185 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKL 251 + L EK+QAL++ VTKGLNP+ MK SGI+W+G +P HW++ PF + + + + Sbjct: 186 DRLAEKRQALITRAVTKGLNPEAPMKPSGIDWLGDIPAHWDLVPFKWRCQVQSGQVDPRE 245 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPES----YETYQIVDPGEIVFRFIDLQNDKRSLRS 307 E + L + I+ R + G +++ I K +L Sbjct: 246 PEYTDMPLIAPDYIESGTGRLYDVPSAEEQGAISGKYFCSEGSVLYSKIRPALRKVALFD 305 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 + + Y + YL + + + + E + Sbjct: 306 SVCL---CSADMYAIDPGKYFERRYLFYFLLTDAFTAYAELESLRVAMPKVNREALGAFV 362 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + +P + EQ +I + + +++++S+ L+E RS+ I AAVTGQI Sbjct: 363 LPIPFLDEQTEIADYCSRVDRENRFAADEVKRSVQKLEEYRSALITAAVTGQI 415 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 57/206 (27%), Positives = 89/206 (43%), Gaps = 4/206 (1%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPK 66 K SG+ W+G IP HW +VP K ++ +G+ D+ I + +ESGTG+ Sbjct: 210 MKPSGIDWLGDIPAHWDLVPFKWRCQVQSGQVDPREPEYTDMPLIAPDYIESGTGRLYDV 269 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQ 125 Q S ++G +LY K+ P LRK + D +CS + P K L Sbjct: 270 PSAEEQGAISGKYFCSEGSVLYSKIRPALRKVALFDSVCLCSADMYAIDPGKYFERRYLF 329 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +LL+ T E M + + +G +PIP L EQ I + Sbjct: 330 YFLLTDAFTAYAELESLRVAMPKVNREALGAFVLPIPFLDEQTEIADYCSRVDRENRFAA 389 Query: 186 TERIRFIELLKEKKQALVSYIVTKGL 211 E R ++ L+E + AL++ VT + Sbjct: 390 DEVKRSVQKLEEYRSALITAAVTGQI 415 >gi|71735008|ref|YP_272417.1| type I restriction-modification system specificity subunit [Pseudomonas syringae pv. phaseolicola 1448A] gi|71555561|gb|AAZ34772.1| type I restriction-modification system specificity subunit [Pseudomonas syringae pv. phaseolicola 1448A] Length = 448 Score = 183 bits (465), Expect = 4e-44, Method: Composition-based stats. Identities = 105/416 (25%), Positives = 180/416 (43%), Gaps = 22/416 (5%) Query: 25 WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W++ +K +N + G+ ++ +E V S G+ + ++ S + F Sbjct: 25 WRICRLKHVALINPYLSLSRVRWGEPASFLPMEAV-SADGQVDYSEPKDSKNLVSGFTNF 83 Query: 82 AKGQILYGKLGPYLRKAI------IADFDGICSTQFLV-LQPKDVLPELLQGWLLSIDVT 134 G ++ K+ P + G ST+F V K +P + S Sbjct: 84 EAGDVILAKITPCFENGKGAVLSDMPTRVGFGSTEFHVLRANKKAIPNFIYYITKSDLFM 143 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ EA+ G+ + N + +P L EQ I + + +T I I+++ IE Sbjct: 144 RQGEALMIGSAGQKRVSTSYVENFQLALPSLHEQRKIVDFLEEKTSLIAQAISKKEHQIE 203 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL+E+KQ LV VT+GL+P M+++GIEW+G +P HWEV+ + K Sbjct: 204 LLEERKQILVQQAVTRGLDPASPMRNAGIEWIGEIPKHWEVRRSKFTFAQRKELARKNDI 263 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + SYG I Q +G K E + V+ + V Q L A Sbjct: 264 QLSATQSYGVIPQDEYEEKVGRKVVKILFNLEKRKHVEVDDFVISMRSFQG---GLERAW 320 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPV 367 I +S + GID Y ++L++S A + +R Q L FE+ + + Sbjct: 321 ASG-CIRSSYVILKPLPGIDPDYYSYLLKSKRYIAALQATANFIRDGQDLNFENFALVDL 379 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +PP+ EQ +I + ++ D + +EQ I LKE +++ I +AVTG+I + G Sbjct: 380 PIPPLDEQKEIARYLASWLSKADRSLYLLEQQITKLKEYKATLINSAVTGKIKVPG 435 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 72/208 (34%), Gaps = 9/208 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI-IYIGLEDVESGTGKYLPKDG 68 +++G++WIG IPKHW+V K + DI + +Y K G Sbjct: 227 MRNAGIEWIGEIPKHWEVRRSK--FTFAQRKELARKNDIQLSATQSYGVIPQDEYEEKVG 284 Query: 69 ---NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + + + A G + +++L+P + Sbjct: 285 RKVVKILFNLEKRKHVEVDDFVIS-MRSFQGGLERAWASGCIRSSYVILKPLPGIDPDYY 343 Query: 126 GWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 +LL +++ + +PIPPL EQ I + + + D Sbjct: 344 SYLLKSKRYIAALQATANFIRDGQDLNFENFALVDLPIPPLDEQKEIARYLASWLSKADR 403 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 + + I LKE K L++ VT + Sbjct: 404 SLYLLEQQITKLKEYKATLINSAVTGKI 431 >gi|323160944|gb|EFZ46868.1| type I restriction modification DNA specificity domain protein [Escherichia coli E128010] Length = 394 Score = 183 bits (465), Expect = 4e-44, Method: Composition-based stats. Identities = 104/366 (28%), Positives = 169/366 (46%), Gaps = 14/366 (3%) Query: 72 QSDTSTVSIFAKGQILYGKL---GPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGW 127 SD S IF K +++ + + I GI S ++ V + + W Sbjct: 27 SSDYSVYQIFEKDDLVFKLIDLENIKTSRVGIVHERGIMSPAYIRVSASSNSIYPRFYYW 86 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I G + + IP+P+ ++ Q + + ET RID+LI E Sbjct: 87 YFFALYLTNIYNKLGGGVRQNLTAGDLLEIPVPLIDISLQKQVSTFLDRETQRIDSLIEE 146 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + FI+LLKEK+QAL+S++VTKGL P+V+M+DSGIEW+G VP HWEVK + + Sbjct: 147 KQTFIKLLKEKRQALISHVVTKGLYPNVEMQDSGIEWIGQVPKHWEVKKIKHICSNFMYG 206 Query: 248 NTKLI---ESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 ++ + L N + + + + TY + +V R N Sbjct: 207 TSQDCNQSDVGYPVLRIPNIKSTNVDFEDLKYANISDVDALTYLLSRGDILVIRTNGNPN 266 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSL 357 + + S + + P +D+++L M S + + F + S +L Sbjct: 267 LVGQSALFDSNGQYLFASYLIKLTPKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGNYNL 326 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + + +PPI EQ ITN ++ T ID+L+++ ++SI LLKE R+S I AAVTG Sbjct: 327 SIPSLANTSIAIPPIDEQKTITNYLSAATINIDLLIQETDKSIDLLKEHRTSLINAAVTG 386 Query: 418 QIDLRG 423 +ID+R Sbjct: 387 KIDVRE 392 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 89/219 (40%), Gaps = 13/219 (5%) Query: 7 YP--QYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSE---SGKDIIYIGLEDVESGT 60 YP + +DSG++WIG +PKHW+V IK G + + S + + +++S Sbjct: 171 YPNVEMQDSGIEWIGQVPKHWEVKKIKHICSNFMYGTSQDCNQSDVGYPVLRIPNIKSTN 230 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQ 115 + + + + ++G IL + ++ + ++ + L Sbjct: 231 VDFEDLKYANISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDSNGQYLFASYLIKLT 290 Query: 116 PKDVLPELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 PK + ++ + + N + IPP+ EQ I Sbjct: 291 PKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAIPPIDEQKTITNY 350 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + A T+ ID LI E + I+LLKE + +L++ VT ++ Sbjct: 351 LSAATINIDLLIQETDKSIDLLKEHRTSLINAAVTGKID 389 >gi|255657323|ref|ZP_05402732.1| type I restriction-modification system [Clostridium difficile QCD-23m63] Length = 453 Score = 183 bits (464), Expect = 5e-44, Method: Composition-based stats. Identities = 87/430 (20%), Positives = 167/430 (38%), Gaps = 20/430 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y +Y+D+G+ WI +PK W + I + S+ + + + G + Sbjct: 3 KYERYRDTGLIWINKVPKKWNLQKINAVFDERREKVSDKDYEALSVT------KNGIFKQ 56 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 D ++ D + + ++ FDG S VL+ K P + Sbjct: 57 LDNVAKTIDGDNRKKVKINDFVINSRSDRKGSSGLSRFDGSVSLINTVLKIKKEYPRYMH 116 Query: 126 GWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L S+ + +G + +++ + +I +PIPP+ EQV I + + I+ Sbjct: 117 YLLKSVPFQEEFYRNGKGIVADLWSTNFQSMKSIILPIPPIEEQVQIANYLDWKINEINR 176 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 LI I+ L+ + ++S V +G+ K S I W+ +P HW V Sbjct: 177 LIQIEKEKIKELETLRFNVISEFVLRGIG-TQNYKKSSINWLDEIPSHWNEVSIRWCVNI 235 Query: 244 LNRKNTKLIESNILSLSYGNIIQK--------LETRNMGLKPESYETYQIVDPGEIVFRF 295 + +T + Y + + + + Y+ Q+V+ +I+ Sbjct: 236 IRGNSTFTKDDLQNRGKYVGLQYGKVYKTEIIDSEFDFYVSDKFYKPAQVVNRNDIIIVS 295 Query: 296 ID-LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 D + G+I + +KP ++ + + S +G+ Sbjct: 296 TSETVEDLGHTSFYDRDDIGLIGGEQILLKPSNNINSKYLFYL-SKIFRMQLQLCATGIK 354 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 K D+K++ V +PP+KEQ I + I ++ +ID V+ I L+ + S I+ Sbjct: 355 VYRFKISDLKQIYVPLPPMKEQEKIVSNIELKLEQIDERVKNNYAFIKELELLKQSLISE 414 Query: 414 AVTGQIDLRG 423 VTG+ID+R Sbjct: 415 VVTGKIDVRN 424 >gi|269140413|ref|YP_003297114.1| type I restriction modification DNA specificity domain protein [Edwardsiella tarda EIB202] gi|267986074|gb|ACY85903.1| type I restriction modification DNA specificity domain protein [Edwardsiella tarda EIB202] Length = 441 Score = 183 bits (464), Expect = 5e-44, Method: Composition-based stats. Identities = 85/431 (19%), Positives = 160/431 (37%), Gaps = 30/431 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY---IGLEDVESGTGKYLPK 66 +K + V G IP+ W+VVP + +G+ S + I + +E+GTG+ + K Sbjct: 18 FKLTEV---GVIPEDWEVVPFFDVVSIVSGQISPICEPYSSMTLIAPDHIETGTGRLISK 74 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQ 125 Q S +F G +Y K+ PYLRKAI A+FDG+CS L+PK+ + P+ + Sbjct: 75 KSAKEQGAISGKYVFHAGDTIYSKIRPYLRKAIYANFDGLCSADMYPLRPKEGIEPKYIL 134 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTL 184 +L ++ E++ + + + I + IP E Q I + I L Sbjct: 135 PLVLGNRFSKYAESVSVRSGIPKINRTEIADFLFVIPRQREEQTAIANVLFDTEALIAAL 194 Query: 185 ITERIRFIELLKEKKQALVS------YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 + + Q L++ K S +G +P+ W V Sbjct: 195 EQILAKKQAIKTAAMQQLLTGKTRLPQFAMWEDGTTKGYKKS---ELGEIPEDWVVTNIG 251 Query: 239 ALVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGE 290 K + +S G + K + + + V Sbjct: 252 QFTDCCAGGTPGTKVSAYWGGTHPWMSSGELHLKQVHTVADYITDEGLANSSTKYVPKNS 311 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ Q R + +E S + +L + + + + G Sbjct: 312 VLVGLAG-QGKTRGTVAINRIELCTNQSIAAIFPGEHHSTEFLFYNLDNRYEELRSLSTG 370 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G R L +++L + PP +EQ I +++ ID ++ ++Q + ++ + Sbjct: 371 DGGRGGLNLTIIRKLHLAFPPKEEQTAIAAILSD----IDEDIQTLQQRLNKTRQLKQGM 426 Query: 411 IAAAVTGQIDL 421 + +TG+I L Sbjct: 427 MQELLTGKIRL 437 >gi|255102540|ref|ZP_05331517.1| type I restriction-modification system [Clostridium difficile QCD-63q42] Length = 453 Score = 181 bits (460), Expect = 1e-43, Method: Composition-based stats. Identities = 87/430 (20%), Positives = 166/430 (38%), Gaps = 20/430 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y +Y+D+G+ WI +PK W + I + S+ + + + G + Sbjct: 3 KYERYRDTGLIWINKVPKKWNLQKINAVFDERREKVSDKDYEALSVT------KNGIFKQ 56 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 D ++ D + + ++ FDG S VL+ K P + Sbjct: 57 LDNVAKTIDGDNRKKVKINDFVINSRSDRKGSSGLSRFDGSVSLINTVLKIKKEYPRYMH 116 Query: 126 GWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L S+ + +G + +++ + +I +PIPP+ EQV I + + I+ Sbjct: 117 YLLKSVPFQEEFYRNGKGIVADLWSTNFQSMKSIILPIPPIEEQVQIANYLDWKINEINR 176 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 LI I+ L+ + +S V +G+ K S I W+ +P HW V Sbjct: 177 LIQIEKEKIKELETLRFNAISEFVLRGIG-TQNYKKSSINWLDEIPSHWNEVSIRWCVNI 235 Query: 244 LNRKNTKLIESNILSLSYGNIIQK--------LETRNMGLKPESYETYQIVDPGEIVFRF 295 + +T + Y + + + + Y+ Q+V+ +I+ Sbjct: 236 IRGNSTFTKDDLQNRGKYVGLQYGKVYKTEIIDSEFDFYVSDKFYKPAQVVNRNDIIIVS 295 Query: 296 ID-LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 D + G+I + +KP ++ + + S +G+ Sbjct: 296 TSETVEDLGHTSFYDRDDIGLIGGEQILLKPSNNINSKYLFYL-SKIFRMQLQLCATGIK 354 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 K D+K++ V +PP+KEQ I + I ++ +ID V+ I L+ + S I+ Sbjct: 355 VYRFKISDLKQIYVPLPPMKEQEKIVSNIELKLEQIDERVKNNYAFIKELELLKQSLISE 414 Query: 414 AVTGQIDLRG 423 VTG+ID+R Sbjct: 415 VVTGKIDVRN 424 >gi|291540900|emb|CBL14011.1| Restriction endonuclease S subunits [Roseburia intestinalis XB6B4] Length = 445 Score = 181 bits (459), Expect = 2e-43, Method: Composition-based stats. Identities = 88/441 (19%), Positives = 164/441 (37%), Gaps = 25/441 (5%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESG 59 Q K S + WIG +PK W V PIK G SE+ + I +I +E Sbjct: 3 EQMKSSRIDWIGDVPKSWDVEPIKYRVSFYNGDRSENYPSKNEIQSEGIPFINAGHIEGN 62 Query: 60 TGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 + + +G ILY G + I+ G ++ + ++ Sbjct: 63 CLNMNDMDYISEEKYRVMGGVKLQQGDILYCLRGSVGKNIIVNIDKGTVASSLVAIRSNG 122 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +L + L L S + G + +G + IPP EQ I + + E Sbjct: 123 ILNKYLYYCLNSNVEEVQRCLWDNGTAQPNLSADSLGKFKICIPPDHEQQAIADFLDKEC 182 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +ID++ + + I+LL++ K++L++ VTKGL+ V MKDSG+EW+G +P HW+ K Sbjct: 183 AQIDSIAADLEKQIDLLQQYKKSLITETVTKGLDKSVPMKDSGVEWIGKIPAHWDFKRLK 242 Query: 239 ALV-----------TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-- 285 ++ + + + ++ + N E Sbjct: 243 FMLENSSDSMKVGPFGSALSGSDFTDEGKWVYNQRVVLDNNFSENTTFVSEEKFQEMRSF 302 Query: 286 -VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLC 343 V PG+I+ + V I L + S + Sbjct: 303 AVYPGDILITTRGTIGKVAIVPEGANEGILHPCIIKFRVDKEMIIPELLQLIFNESDFVK 362 Query: 344 KVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F M + + + +K + + V P EQ I ++ + ID ++ + ++++ Sbjct: 363 DQFTLMSNATTIEVIYSYSLKDILLPVIPADEQTKIYGYLSKKCIVIDGIIAEKQKALAT 422 Query: 403 LKERRSSFIAAAVTGQIDLRG 423 + + + S I V G+ ++ Sbjct: 423 ITQHKKSLIYEYVAGKKRVKE 443 >gi|168362839|ref|ZP_02696013.1| probable type I restriction-modification system [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|171903131|gb|EDT49420.1| probable type I restriction-modification system [Ureaplasma urealyticum serovar 13 str. ATCC 33698] Length = 453 Score = 181 bits (459), Expect = 2e-43, Method: Composition-based stats. Identities = 87/430 (20%), Positives = 166/430 (38%), Gaps = 20/430 (4%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 Y +Y+D+G+ WI +PK W + I + S+ + + G + Sbjct: 3 KYERYRDTGLIWINKVPKKWNLQKINAVFDERREKVSDKDYVALSVT------KNGIFKQ 56 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 D ++ D + + ++ FDG S VL+ K P + Sbjct: 57 LDNVAKTIDGDNRKKVKINDFVINSRSDRKGSSGLSRFDGSVSLINTVLKIKKEYPRYMH 116 Query: 126 GWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L S+ + +G + +++ + +I +PIPP+ EQV I + + I+ Sbjct: 117 YLLKSVPFQEEFYRNGKGIVADLWSTNFQSMKSIILPIPPIEEQVQIANYLDWKINEINR 176 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 LI I+ L+ + ++S V +G+ K S I W+ +P HW V Sbjct: 177 LIQIEKEKIKELETLRFNVISEFVLRGIG-TQNYKKSSINWLDEIPSHWNEVSIRWCVNI 235 Query: 244 LNRKNTKLIESNILSLSYGNIIQK--------LETRNMGLKPESYETYQIVDPGEIVFRF 295 + +T + Y + + + + Y+ Q+V+ +I+ Sbjct: 236 IRGNSTFTKDDLQNRGKYVGLQYGKVYKTEIIDSEFDFYVSDKFYKPAQVVNRNDIIIVS 295 Query: 296 ID-LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 D + G+I + +KP ++ + + S +G+ Sbjct: 296 TSETVEDLGHTSFYDRDDIGLIGGEQILLKPSNNINSKYLFYL-SKIFRMQLQLCATGIK 354 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 K D+K++ V +PP+KEQ I + I ++ +ID V+ I L+ + S I+ Sbjct: 355 VYRFKISDLKQIYVPLPPMKEQEKIVSNIELKLEQIDERVKNNYAFIKELELLKQSLISE 414 Query: 414 AVTGQIDLRG 423 VTG+ID+R Sbjct: 415 VVTGKIDVRN 424 >gi|15597930|ref|NP_251424.1| hypothetical protein PA2734 [Pseudomonas aeruginosa PAO1] gi|9948811|gb|AAG06122.1|AE004701_5 hypothetical protein PA2734 [Pseudomonas aeruginosa PAO1] Length = 431 Score = 180 bits (457), Expect = 3e-43, Method: Composition-based stats. Identities = 83/355 (23%), Positives = 139/355 (39%), Gaps = 6/355 (1%) Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW-LLSIDV 133 KG I Y + + DG+ S ++V++P + + Sbjct: 47 KEKYKRAVKGDIAYNMMRMWQGAVGPVPEDGLVSPAYVVVKPYAEANSTYFSYLFRTAAY 106 Query: 134 TQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q + G + W+ +P +PP EQ I + + I I + Sbjct: 107 MQEVNKFSRGIVADRNRLYWESFKQMPSLVPPRPEQDQIVTYLRTQDAHIACFIRAKRDL 166 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I LL E+K ++ + VT+GL+ VK+K S IEW+G VP HWEVK L + + T Sbjct: 167 IALLTEQKLRIIDHAVTRGLDASVKLKPSDIEWLGEVPAHWEVKRLKFLAGNITSQTTTK 226 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + I R +G + E T + +++F + K + Sbjct: 227 ADDEIYLALEHVQSWTGVARPLGGEVEFASTVKRFVADDVLFGKLRPYLAK--VTRVVCA 284 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 + + + I YL L+R + + + +G + + + + +P Sbjct: 285 GVCVSEFLVLRSRQELILPAYLEQLLRCKRVIDLISSSTAGAKMPRADWNFIGNVRLPIP 344 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 EQ I + I ET +D + + E I L++E R IA AVTGQ+DLRG Sbjct: 345 RKDEQEAILSHIGRETKDLDETIARAEDEIKLIREYRDRLIADAVTGQVDLRGWQ 399 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 70/204 (34%), Positives = 103/204 (50%), Gaps = 4/204 (1%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 K S ++W+G +P HW+V +K T +T+ D IY+ LE V+S TG + + Sbjct: 193 KPSDIEWLGEVPAHWEVKRLKFLAGNITSQTTTKADDEIYLALEHVQSWTG--VARPLGG 250 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWL 128 STV F +L+GKL PYL K G+C ++FLVL+ + +LP L+ L Sbjct: 251 EVEFASTVKRFVADDVLFGKLRPYLAKVTRVVCAGVCVSEFLVLRSRQELILPAYLEQLL 310 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 V I + GA M ADW IGN+ +PIP EQ I I ET +D I Sbjct: 311 RCKRVIDLISSSTAGAKMPRADWNFIGNVRLPIPRKDEQEAILSHIGRETKDLDETIARA 370 Query: 189 IRFIELLKEKKQALVSYIVTKGLN 212 I+L++E + L++ VT ++ Sbjct: 371 EDEIKLIREYRDRLIADAVTGQVD 394 >gi|145635506|ref|ZP_01791206.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittAA] gi|145267271|gb|EDK07275.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittAA] Length = 348 Score = 180 bits (457), Expect = 3e-43, Method: Composition-based stats. Identities = 84/346 (24%), Positives = 159/346 (45%), Gaps = 9/346 (2%) Query: 83 KGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 KG+ L L + +++ D + S ++VL+ K ++ + +LL ++ Sbjct: 3 KGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQIINKKYFSYLLHRYDVAYMK 62 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G ++ I + + IPPL+EQ I + + +T +ID + + I LLKE Sbjct: 63 LLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTAKIDRAVDLAEKQIALLKEH 121 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW+V+ + ++ RK + + Sbjct: 122 KQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWDVQRSKFIFKKIERKVNEEDQIVTCF 181 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++ YQ + G++V +D + + + + Sbjct: 182 RDGQVTLRANRRTEGFTNALKEHGYQGIRKGDLVIHAMDAFAGAIGISDSDGKATPVYS- 240 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIKEQ 375 + ID + A+ +R+ L ++ G+R+ ++ D L + +PP EQ Sbjct: 241 VCLPHNKQKIDVYFYAYYLRNLALSGFISSLAKGIRERSTDFRYADFAELLLPIPPYLEQ 300 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + ++ +T++ID ++ I LKE +S I VTG++ + Sbjct: 301 QKIADYLDKQTSKIDQVIALKTAHIEKLKEYKSVLINDVVTGKVRV 346 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 78/202 (38%), Gaps = 7/202 (3%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KDSGV+WIG +P+HW V K K + + +D I D + +G + Sbjct: 141 KDSGVEWIGQVPEHWDVQRSKFIFKKIERKV--NEEDQIVTCFRDGQVTLRANRRTEGFT 198 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWL 128 KG ++ + + I+D DG + + V P K + + Sbjct: 199 NALKEHGYQGIRKGDLVIHAMDAFAGAIGISDSDGKATPVYSVCLPHNKQKIDVYFYAYY 258 Query: 129 LSIDVTQRIEAICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + + + +PIPP EQ I + + +T +ID +I Sbjct: 259 LRNLALSGFISSLAKGIRERSTDFRYADFAELLLPIPPYLEQQKIADYLDKQTSKIDQVI 318 Query: 186 TERIRFIELLKEKKQALVSYIV 207 + IE LKE K L++ +V Sbjct: 319 ALKTAHIEKLKEYKSVLINDVV 340 >gi|296330135|ref|ZP_06872617.1| restriction modification system DNA specificity domain protein [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305673379|ref|YP_003865051.1| Type I restriction modification system DNA specificity domain protein (HsdS) [Bacillus subtilis subsp. spizizenii str. W23] gi|296152724|gb|EFG93591.1| restriction modification system DNA specificity domain protein [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305411623|gb|ADM36742.1| Type I restriction modification system DNA specificity domain protein (HsdS) [Bacillus subtilis subsp. spizizenii str. W23] Length = 433 Score = 179 bits (455), Expect = 5e-43, Method: Composition-based stats. Identities = 96/430 (22%), Positives = 179/430 (41%), Gaps = 26/430 (6%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFT-----KLNTGRTSES-------GKDIIYIGLEDVESG 59 +S +Q +GAIP HW + +K + G + + G E++ Sbjct: 5 ESNIQGVGAIPSHWNIKKLKHCLLPGSEGIKIGPFGSALKSEILITEGYKVYGQENLIKD 64 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPK 117 + + + + + +L +G + ++ GI + + ++ Sbjct: 65 DFTLGHRFISEEKFNELKSYEIIENDVLISMMGTVGKCKVVPSIIEKGIMDSHLIRIRFN 124 Query: 118 DVLP---ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + SI + +I+ +G+ MS + I N+ + +PP+ EQ +I + I Sbjct: 125 ESIILPEFAAYLIQDSIYIKVQIDLNSKGSIMSGLNSSIIKNLKLILPPIEEQRIILKYI 184 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 + +++ L + I LL E++Q++++ VTKGLNP+VKMK+SGIEW+G +P+HW++ Sbjct: 185 SRKNMQLYQLSNSKNILINLLNEQRQSIITEAVTKGLNPNVKMKNSGIEWIGEIPEHWDM 244 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 K L+ K L G + +K+ Y + D I+ Sbjct: 245 KKVKYTFNNLDYKRIPLSSE-----ERGKMTEKVYDYYGASGVIDKVDYYLFDETLILIG 299 Query: 295 FIDLQNDKRSLR-SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 RS + + + + +KP D Y L+ S D Sbjct: 300 EDGANLFSRSTPLAFLARGKYWVNNHAHILKPKNGDIDYFVNLLESIDYSIYI---SGSA 356 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + L E + + + +PPI+EQ +I ++ ++ ++ I LKE R S I Sbjct: 357 QPKLTQEALGNITLPLPPIEEQSEIGELVKNVLIEHKEIISTLKNQIEKLKEYRQSLIYE 416 Query: 414 AVTGQIDLRG 423 AVTG+ID+R Sbjct: 417 AVTGKIDVRD 426 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 53/209 (25%), Positives = 88/209 (42%), Gaps = 16/209 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + K+SG++WIG IP+HW + +K + + E+ T K G Sbjct: 226 KMKNSGIEWIGEIPEHWDMKKVKYTFNNLDYKRIP-------LSSEERGKMTEKVYDYYG 278 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDVLPEL 123 S D +F + IL G+ G L A +A + +L+PK+ + Sbjct: 279 ASGVIDKVDYYLFDETLILIGEDGANLFSRSTPLAFLARGKYWVNNHAHILKPKNGDIDY 338 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L SID + G+ + +GNI +P+PP+ EQ I E + + Sbjct: 339 FVNLLESIDYSI----YISGSAQPKLTQEALGNITLPLPPIEEQSEIGELVKNVLIEHKE 394 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLN 212 +I+ IE LKE +Q+L+ VT ++ Sbjct: 395 IISTLKNQIEKLKEYRQSLIYEAVTGKID 423 >gi|124008338|ref|ZP_01693033.1| type I restriction-modification system specificity subunit [Microscilla marina ATCC 23134] gi|123986127|gb|EAY25963.1| type I restriction-modification system specificity subunit [Microscilla marina ATCC 23134] Length = 424 Score = 179 bits (455), Expect = 5e-43, Method: Composition-based stats. Identities = 65/433 (15%), Positives = 151/433 (34%), Gaps = 36/433 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTG 61 YKDS +G IP+ W+VV + K++ G T K I ++ D+ + Sbjct: 6 YKDSP---LGEIPEDWEVVKLGDIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSII 62 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDV 119 + + S ++ ++ K +L G + + + + + + L K Sbjct: 63 EDTEEKITSLALKETSCNLLPKNTVLVAMYGGFNQIGRTGLLKIEATTNQAISALNIKSD 122 Query: 120 LPELLQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + + K + + P+ IPPLAEQ I + + Sbjct: 123 NIYPEFILAWLNAKVEVWKKFAASSRKDPNITKKDVEHFPIVIPPLAEQQEIADIL---- 178 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +D I + ++ K+ L+ + T+GL K S +G +P+ WEV Sbjct: 179 STVDEKIATIDERLAHTQQLKKGLMQRLFTRGLG-HTSFKASP---LGEIPESWEVVKLG 234 Query: 239 ALVTELNRKN-------TKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDP 288 + +I + ++ + + ++ Sbjct: 235 DIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITSLALKETSCNLLPK 294 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 ++ N ++ + + +K I ++ + + +A Sbjct: 295 NTVLVAMYGGFNQIGRTGLLKIEATTNQAISALNIKSDNIYPEFILAWLNAKVEVWKKFA 354 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 S ++ +DV+ P+++PP+ EQ +I +++ ++++L EK + + Sbjct: 355 ASSRKDPNITKKDVEHFPIVIPPLAEQQEIADILGGVDEKLELLAEKK----EAYQGLKK 410 Query: 409 SFIAAAVTGQIDL 421 + +TG++ + Sbjct: 411 GLMQQLLTGKVRV 423 >gi|309390280|gb|ADO78160.1| restriction modification system DNA specificity domain protein [Halanaerobium praevalens DSM 2228] Length = 465 Score = 179 bits (454), Expect = 7e-43, Method: Composition-based stats. Identities = 97/459 (21%), Positives = 180/459 (39%), Gaps = 38/459 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK--LNTGRTSESG----KDIIYIGLE 54 M+ YK Y +Y+DSG++WI IPK+W + IK K ++ G + ++ ++ Sbjct: 2 MREYKRYEEYQDSGIEWIADIPKNWIISKIKYLVKEPVSDGPHETPDYVYDNGVPFLSVD 61 Query: 55 DVESGTGKYLP--KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQ 110 +++G + + S K IL GK + A I I S Sbjct: 62 SIQNGKLVFENCRQISVKDHKIYRNKSNPEKEDILLGKAASVGKVAKINVDFPFSIWSPL 121 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 L+ + L+ + S + + + T + K I +I + P + EQ I Sbjct: 122 ALIKPNYKIESSYLEYSMKSSYFQIQTDLLSNSNTQKNLGMKDINDILVLKPSIEEQQKI 181 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG------------LNPDVKMK 218 + +T ID + ++ + I+ L++ K+++++ VTKG L +V+MK Sbjct: 182 ASFLDQKTAEIDEITNKKEKLIDQLEKYKKSVITDAVTKGKLGDKYLNEDGDLVDEVEMK 241 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS-------------LSYGNII 265 DSGIEW+ VP +++ + R + + L ++ N + Sbjct: 242 DSGIEWIRDVPHFYDISKVKYIADIHGRIGYRGYTKDDLVDKGQGALTLGGKHINDRNQL 301 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +++ +V + I + + V Sbjct: 302 DLSDPTYISWDKYYESPEIMIEYNNLVVVQRGSIGKVAIIDKNI--GEATINPSLILVNN 359 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I + Y + + S + + F + S + E + L + Q I N ++ Sbjct: 360 LEIKAKYFYYYLISNSVSEFFNLIVSSTAVPMISQEQLDNLYLPKIDKHSQNKIINYLDK 419 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +T ID L++K + SI KE + S I AVTG+IDLR Sbjct: 420 KTELIDNLIQKTKTSIQKYKEYKKSLIFEAVTGKIDLRD 458 >gi|291566232|dbj|BAI88504.1| type I restriction-modification system S subunit [Arthrospira platensis NIES-39] Length = 396 Score = 179 bits (454), Expect = 7e-43, Method: Composition-based stats. Identities = 93/406 (22%), Positives = 160/406 (39%), Gaps = 27/406 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W +K G ++ G K +G + Sbjct: 3 WLQAKLKYVAHFAYGDALPKDQE---------REGDFKVFGSNGAYDNYGRANTQ---AP 50 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G+ G Y + T F + +LL ++ + A Sbjct: 51 VIIVGRKGSYGKVNWSDHPCFASDTTFFIDATTTHHHLRWLFYLLQTL---NLDQGTDEA 107 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + IPPL EQ I + ET +ID LI + R ++LL EK++AL++ Sbjct: 108 AVPGLSRDDAYAKKVFIPPLGEQKAIAHYLDKETAKIDQLIEAKKRLLQLLDEKRRALIT 167 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + VT+GLNPDV M+DSG+EW+G +P HW+ K + ++ + + Sbjct: 168 HTVTRGLNPDVPMRDSGVEWIGKIPKHWKCSKIKHHYEITLGKMLQNEPHSLEDVEVPYL 227 Query: 265 IQKLETRNMGLKPESYETYQ---------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + L V G+++ + ++ S++ + I Sbjct: 228 KSQHVQSDRILMDNELPQMWANPWEIANLNVIKGDLLVCEGGEIG-RSAIISSKPPDNCI 286 Query: 316 ITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIK 373 I +A V+P D +L +L+ + + E ++ + +PP+ Sbjct: 287 IQNALHLVRPKPTGDVNFLKYLLNHAISQRWLDVLCNKATIAHFTVEKFSQMSIELPPLS 346 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 EQ I N ++ ETA+I+ L + +I LL+ERR+S I AAVTGQI Sbjct: 347 EQKAIANYLDKETAKINQLRSAVRDTITLLQERRTSLITAAVTGQI 392 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 11/213 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR------TSESGKDIIYIGLEDVESGTGKY 63 +DSGV+WIG IPKHWK IK ++ G+ S ++ Y+ + V+S Sbjct: 180 MRDSGVEWIGKIPKHWKCSKIKHHYEITLGKMLQNEPHSLEDVEVPYLKSQHVQSDRILM 239 Query: 64 LPKDGNSRQSDTSTVSI-FAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDV 119 + + ++ KG +L + G R AII + I +++PK Sbjct: 240 DNELPQMWANPWEIANLNVIKGDLLVCEGGEIGRSAIISSKPPDNCIIQNALHLVRPKPT 299 Query: 120 LPELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +LL+ + + ++ +C AT++H + + + +PPL+EQ I + ET Sbjct: 300 GDVNFLKYLLNHAISQRWLDVLCNKATIAHFTVEKFSQMSIELPPLSEQKAIANYLDKET 359 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 +I+ L + I LL+E++ +L++ VT + Sbjct: 360 AKINQLRSAVRDTITLLQERRTSLITAAVTGQI 392 >gi|331084240|ref|ZP_08333345.1| hypothetical protein HMPREF0992_02269 [Lachnospiraceae bacterium 6_1_63FAA] gi|330401775|gb|EGG81352.1| hypothetical protein HMPREF0992_02269 [Lachnospiraceae bacterium 6_1_63FAA] Length = 456 Score = 177 bits (449), Expect = 3e-42, Method: Composition-based stats. Identities = 97/433 (22%), Positives = 180/433 (41%), Gaps = 23/433 (5%) Query: 5 KAYPQYKDSGVQWIGAI--PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 K Y +YK+SG+ W I P W V K + N +++ + + L ++ Sbjct: 3 KGYEKYKESGIPW--EICEPTTWDCVRGKALFE-NPKYINKNNEYKNVLSLT-LKGVIRN 58 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKL---GPYLRKAIIADFDGICSTQFL--VLQPK 117 + T +F K +++ + + I GI S ++ VL+ K Sbjct: 59 NIENPNGLVPRSYDTYQLFEKDDLVFKLIDLENISTSRVGIVGEQGIMSPAYIRLVLRKK 118 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + ++I + + + +PP EQ I + + + Sbjct: 119 EKQNIKYYYYQYFSLYQRQIFNSLGAGVRQTLSARELLEQKIMVPPKPEQDKIVQFLEWK 178 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 T I+ I ++ + I+LL+E K ++ +VTKGL +VK K S +EW+G +P+HW+V Sbjct: 179 TSEINRFIHQKKKQIKLLEELKLTRINNLVTKGLTHNVKYKQSNVEWLGEIPEHWDVDYI 238 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 ++ ++LS++ I +K + N G +SY YQ V G+ +D Sbjct: 239 KQHFKVK-KRIAGKEGYDVLSITQQGIKKKDISSNEGQMAQSYANYQFVYSGDFAMNHMD 297 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL---CKVFYAMGSGL- 353 L + + G+ + Y + + + +R + + K+FY G G Sbjct: 298 LLTGYIDISK----QFGVTSPDYRVFNLSDSEHCFAPFYLRVFQIGYKRKIFYKFGKGAA 353 Query: 354 ---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 R L + VPPI EQ +I + +I+ ++ I + I L++E R+ Sbjct: 354 NQGRWRLPITAFYDYAIQVPPIDEQREIARQCDEVEKQINEMISGINKEITLVEELRTKL 413 Query: 411 IAAAVTGQIDLRG 423 I+ VTGQ+D+ Sbjct: 414 ISDVVTGQVDVSD 426 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 63/209 (30%), Positives = 104/209 (49%), Gaps = 4/209 (1%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L K K+SGI W P W+ AL N N+LSL+ +I+ Sbjct: 2 LKGYEKYKESGIPWEICEPTTWDCVRGKALFENPKYINKNNEYKNVLSLTLKGVIRNNIE 61 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA---VKPHG 327 GL P SY+TYQ+ + ++VF+ IDL+N S R V E+GI++ AY+ K Sbjct: 62 NPNGLVPRSYDTYQLFEKDDLVFKLIDLENISTS-RVGIVGEQGIMSPAYIRLVLRKKEK 120 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + Y + S ++F ++G+G+RQ+L ++ ++VPP EQ I + +T+ Sbjct: 121 QNIKYYYYQYFSLYQRQIFNSLGAGVRQTLSARELLEQKIMVPPKPEQDKIVQFLEWKTS 180 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I+ + + ++ I LL+E + + I VT Sbjct: 181 EINRFIHQKKKQIKLLEELKLTRINNLVT 209 >gi|111026979|ref|YP_708957.1| type I restriction-modification system specificity subunit [Rhodococcus jostii RHA1] gi|110825518|gb|ABH00799.1| type I restriction-modification system specificity subunit [Rhodococcus jostii RHA1] Length = 391 Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats. Identities = 88/415 (21%), Positives = 156/415 (37%), Gaps = 36/415 (8%) Query: 16 QWIGAI-PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ I P W V ++R G + VE G Y P G+ + Sbjct: 5 PWLPEILPSGWVVAQMRRIATFRNGADYKE-----------VEVTEGGY-PVYGSGGEFR 52 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ ++ +L+G+ G + +++ T F ++ P L + ++ Sbjct: 53 RASQYLYDGESVLFGRKGTIDKPLLVSGRFWTVDTMFFTELTSNIEPRYLHYYATTMPF- 111 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + +G +P+PP+ EQ I + + ET RIDTLI E+ R IEL Sbjct: 112 ---DYYSTSTALPSMTQGELGGHRIPLPPITEQGAIADFLDRETARIDTLIREQRRLIEL 168 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+E++ A+ V G+ W P K+ + Sbjct: 169 LRERRIAVAEGPVV------------GLSW--STPLRSVTALIQTGPFGSQLKSDEYETG 214 Query: 255 NILSLSYGNIIQKLETRNMGL----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 ++ +++ + + S + G+++ +R+ Sbjct: 215 GTPVINPSHLVMGRIEPDERVAVSASKASELGRHALRAGDVIAARRGELGRCAVVRAENT 274 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLV 369 SA + ++ D +LA + S A +L + + L + + Sbjct: 275 GFLCGTGSALIRLRETVADPEFLALVFSSRRNRDSLSLASVGATMDNLNADIIATLRIPM 334 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 PP+ EQ I + T +ID L+ + E I L KERRS+ I AAVTGQID+R E Sbjct: 335 PPLPEQRRIVESVAEATTKIDTLITETESFIDLAKERRSALITAAVTGQIDVRDE 389 >gi|126666657|ref|ZP_01737635.1| type I restriction-modification system, S subunit [Marinobacter sp. ELB17] gi|126629045|gb|EAZ99664.1| type I restriction-modification system, S subunit [Marinobacter sp. ELB17] Length = 429 Score = 176 bits (446), Expect = 5e-42, Method: Composition-based stats. Identities = 101/422 (23%), Positives = 177/422 (41%), Gaps = 21/422 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGR------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P HW + + + G+ S++ + Y+ ++ + Sbjct: 7 VPSHWIKASVGNYCDVQLGKMLQSDPASQNDESKRYLRAINITKHGLDLSHDFSMWIKPQ 66 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQP---KDVLPELLQGWLLS 130 +G IL + G R A+ D + ++P +LPE + W Sbjct: 67 EMEKFRLQRGDILVSEGGDAGRTAVFDCDEEFYFQNAINRIRPAGNSTILPEFIYYWFTF 126 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V +E +C AT++H + + P+ +PPL Q I + + +T RID LI ++ Sbjct: 127 LKVAGYVEMVCNVATIAHFTAEKVKAAPLALPPLKTQHSIAQFLDEKTARIDGLIEKKCA 186 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-- 248 ++ L EK+QAL++ +TKGL+P+ MK SG EW+G +P +WEVK + + + Sbjct: 187 LLDRLAEKRQALITRAITKGLDPNAIMKPSGTEWLGHIPANWEVKKLRRVRRYMTSGSRD 246 Query: 249 --TKLIESNILSLSYGNIIQKL------ETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + L N+ + ETR + L + T V G+I+ Sbjct: 247 WAAYYADEGDRFLRMTNVTGEGIELDLSETRYVNLDGATEGTRTSVREGDILITITAELG 306 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKF 359 +R A P +S +L + + F G G +Q L F Sbjct: 307 AVAVIRKEIEGAYINQHLALFRPSPELCESGFLVNFLSTDMARAQFMLSGQGGTKQGLGF 366 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 E V + + PP++EQ I N + + + + + ++ SI L E RS+ I AAVTGQ+ Sbjct: 367 EQVNNVIIGFPPLREQELIGNFCSEIRRQSESVEQPLKLSIDKLIEYRSAVITAAVTGQL 426 Query: 420 DL 421 ++ Sbjct: 427 EI 428 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 82/214 (38%), Gaps = 12/214 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYL 64 K SG +W+G IP +W+V ++R + T + + + + ++ + +V + Sbjct: 213 MKPSGTEWLGHIPANWEVKKLRRVRRYMTSGSRDWAAYYADEGDRFLRMTNVTGEGIELD 272 Query: 65 PKDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDV 119 + D + T + +G IL A+I + + +P Sbjct: 273 LSETRYVNLDGATEGTRTSVREGDILITITAELGAVAVIRKEIEGAYINQHLALFRPSPE 332 Query: 120 LPELLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L E +L + + +G T ++ + N+ + PPL EQ LI Sbjct: 333 LCESGFLVNFLSTDMARAQFMLSGQGGTKQGLGFEQVNNVIIGFPPLREQELIGNFCSEI 392 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + +++ I+ L E + A+++ VT L Sbjct: 393 RRQSESVEQPLKLSIDKLIEYRSAVITAAVTGQL 426 >gi|322379476|ref|ZP_08053842.1| Restriction modification system DNA specificity domain [Helicobacter suis HS1] gi|322380457|ref|ZP_08054656.1| type I restriction-modification system specificity subunit [Helicobacter suis HS5] gi|321147102|gb|EFX41803.1| type I restriction-modification system specificity subunit [Helicobacter suis HS5] gi|321148083|gb|EFX42617.1| Restriction modification system DNA specificity domain [Helicobacter suis HS1] Length = 402 Score = 175 bits (444), Expect = 1e-41, Method: Composition-based stats. Identities = 99/402 (24%), Positives = 174/402 (43%), Gaps = 28/402 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G + K I TGKY P G++ + Sbjct: 11 KWVRLGEILSLEYGDSLPEYKRI-----------TGKY-PIMGSNGVVGYHNTFLIRSPA 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I+ G+ G + I T + V + + + L ++ + E + G Sbjct: 59 IIVGRKGSAGKVNYIDQDCYPIDTTYFVQLKTECSLKFIYYVLTNLQL----EHLKTGGG 114 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + + + I +P+PPL EQ I + + +I I ++ R + LLKE KQAL+S Sbjct: 115 VPGLNREHVYQILIPLPPLKEQHAIATFLDHKCAKIVACIAKKTRMLALLKEYKQALISK 174 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS-LSYGNI 264 I TKGLNP K SG+ W+G +P HW + P + + N + ILS + + Sbjct: 175 ITTKGLNPQEHFKPSGVAWLGDIPGHWGLIPLGRIFKIRDEINKDRAITLILSLVKDIGV 234 Query: 265 IQKLETRNMGLKPE-SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + E N+G K + YQ+V G++V ++ + + G+++ Y+ + Sbjct: 235 LPYSEKGNIGNKAKADLSQYQVVRSGDLVLNKMNAVIGSLGVSNYD----GLVSPIYLVL 290 Query: 324 KPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQ---SLKFEDVKRLPVLVPPIKEQFD 377 + + + S L + G+ + S+ F K++ + VPP+KEQ Sbjct: 291 FIQNKNLHLMQYYASLFASKALQQSLGQYAYGIMKIRESIDFMSFKQMLLPVPPLKEQHA 350 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 I ++ A++D L+ K++ I LK+ +S+ I+ AV GQI Sbjct: 351 IAAFLDHRLAKLDTLITKLQTQIQDLKDYKSALISEAVLGQI 392 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 53/207 (25%), Positives = 96/207 (46%), Gaps = 9/207 (4%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 +K SGV W+G IP HW ++P+ R K+ + +I ++D+ G Y K Sbjct: 184 EHFKPSGVAWLGDIPGHWGLIPLGRIFKIRDEINKDRAITLILSLVKDI--GVLPYSEKG 241 Query: 68 --GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 GN ++D S + G ++ K+ + ++++DG+ S +LVL ++ L+Q Sbjct: 242 NIGNKAKADLSQYQVVRSGDLVLNKMNAVIGSLGVSNYDGLVSPIYLVLFIQNKNLHLMQ 301 Query: 126 GWLLSIDVTQRIEAICE-----GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + +++ + D+ + +P+PPL EQ I + + Sbjct: 302 YYASLFASKALQQSLGQYAYGIMKIRESIDFMSFKQMLLPVPPLKEQHAIAAFLDHRLAK 361 Query: 181 IDTLITERIRFIELLKEKKQALVSYIV 207 +DTLIT+ I+ LK+ K AL+S V Sbjct: 362 LDTLITKLQTQIQDLKDYKSALISEAV 388 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 64/169 (37%), Gaps = 5/169 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + ILSL YG+ + + + ++ + K S Sbjct: 9 KVKWVRLGEILSLEYGDSLPEYKRITGKYPIMGSNGVVGYHNTFLIRSPAIIVGRKGSAG 68 Query: 307 SAQVMERGI--ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 +++ I + Y ++ +++ + L + G L E V + Sbjct: 69 KVNYIDQDCYPIDTTYFVQLKTECSLKFIYYVLTNLQLEHL---KTGGGVPGLNREHVYQ 125 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + +PP+KEQ I ++ + A+I + K + + LLKE + + I+ Sbjct: 126 ILIPLPPLKEQHAIATFLDHKCAKIVACIAKKTRMLALLKEYKQALISK 174 >gi|237756452|ref|ZP_04584989.1| type I restriction-modification system specificity subunit [Sulfurihydrogenibium yellowstonense SS-5] gi|237691382|gb|EEP60453.1| type I restriction-modification system specificity subunit [Sulfurihydrogenibium yellowstonense SS-5] Length = 428 Score = 175 bits (444), Expect = 1e-41, Method: Composition-based stats. Identities = 69/434 (15%), Positives = 158/434 (36%), Gaps = 33/434 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKY 63 +K++ IG IP+ W+V + ++ G+ + ++ ++ +V Sbjct: 7 FKETE---IGLIPEDWEVARLGEVFEVKQGKQLSAKENRDGKVLKPFLRTSNVLWNKIDL 63 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ---PKDVL 120 KG IL + G R A+ S Q + + KD + Sbjct: 64 SELSYMPFSESEFKNLKLKKGDILVCEGGDVGRTAVWDGQIDEISYQNHLHRLRSVKDSI 123 Query: 121 PELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + + T+ + + P+P+PPL EQ I + + Sbjct: 124 NNYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQKAIADIL---- 179 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVK 235 + I + + I K+ K++++ ++ T G ++ K+K E +GL P+HWEV Sbjct: 180 STVQNAIEKTEKVINATKQLKKSMMKHLFTYGAVAVDEIDKVKLKESE-IGLTPEHWEVV 238 Query: 236 PFFALVTELN------RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 +V ++ R + +I + ++ + + + E + + Sbjct: 239 RLGEVVEKMKAGGTPKRSEKRFWGGSIPFILIEDLTKNNLYIEDAREYITEEGLENSNAW 298 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYA 348 + + L ++A + A + + P + + + ++ Sbjct: 299 IVPENSLLLSMYATIGKTAVNLIPVATNQAILGIIPKRDRLNVEFGAYLLKFHSKRLLSQ 358 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 ++++ V+ + +PP+ EQ I N++ ID ++ E+ V L+ Sbjct: 359 NIQTTQRNVNKGIVENFLIPLPPLDEQQKIANILTT----IDQKIQAEEKKKVALRSLFK 414 Query: 409 SFIAAAVTGQIDLR 422 + + +TG+I +R Sbjct: 415 TLLHQLMTGKIRVR 428 >gi|284052081|ref|ZP_06382291.1| restriction modification system DNA specificity subunit [Arthrospira platensis str. Paraca] gi|78773866|gb|ABB51216.1| type I RM system S subunit [Arthrospira platensis] Length = 392 Score = 175 bits (444), Expect = 1e-41, Method: Composition-based stats. Identities = 95/405 (23%), Positives = 166/405 (40%), Gaps = 22/405 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W +K G ++ G K +G + Sbjct: 3 WLQAKLKYVAHFAYGDALPKDQE---------REGDFKVFGSNGAYDNYGRANTQ---AP 50 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G+ G Y + T F + +LL ++ + A Sbjct: 51 VIIVGRKGSYGKVNWSDHPCFASDTTFFIDATTTHHHLRWLFYLLQTL---NLDQGTDEA 107 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + IPPL EQ I + ET +ID LI + R + LL EK++AL++ Sbjct: 108 AVPGLSRDDAYAKKVFIPPLGEQKAIAHYLDIETAKIDQLIKAKKRLLALLDEKRRALIT 167 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI--ESNILSLSYG 262 + VT+GLNPDV M+DSG+EW+G +P HWE+ P ++ ++ ++ + E NI L G Sbjct: 168 HAVTRGLNPDVPMRDSGVEWIGEIPKHWEILPLRRILQTMDYGISESVGSEGNIAVLRMG 227 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVMERGIITSA 319 ++ + + + + + I+ +++F + R+ + + Sbjct: 228 DVDEGEISYDNVGFVDDVDHDLILKANDLLFNRTNSLDKIGKVAIFRNNFLFPVSFASYL 287 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + YL +L+ S + + + + +L + + +PPI+EQ + Sbjct: 288 VRMRCNDSVIPEYLNYLLNSLPVLTWAKSNALPAIGQVNLNPNRYSYIKIPIPPIEEQLN 347 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 IT I T +I L E++I LL+ERR+S I AAVTGQI + Sbjct: 348 ITEYIQTNTKKIKKLCLSSEETIKLLQERRTSLITAAVTGQIKIT 392 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 53/213 (24%), Positives = 89/213 (41%), Gaps = 14/213 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFT---KLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 +DSGV+WIG IPKHW+++P++R + S +I + + DV+ G Y Sbjct: 180 MRDSGVEWIGEIPKHWEILPLRRILQTMDYGISESVGSEGNIAVLRMGDVDEGEISYDNV 239 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFD----GICSTQFLVLQPKDV 119 D I +L+ + + AI + S + V Sbjct: 240 GFV---DDVDHDLILKANDLLFNRTNSLDKIGKVAIFRNNFLFPVSFASYLVRMRCNDSV 296 Query: 120 LPELLQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +PE L L S+ V ++ + + I +PIPP+ EQ+ I E I T Sbjct: 297 IPEYLNYLLNSLPVLTWAKSNALPAIGQVNLNPNRYSYIKIPIPPIEEQLNITEYIQTNT 356 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 +I L I+LL+E++ +L++ VT + Sbjct: 357 KKIKKLCLSSEETIKLLQERRTSLITAAVTGQI 389 >gi|225868036|ref|YP_002743984.1| type I restriction modification DNA specificity protein [Streptococcus equi subsp. zooepidemicus] gi|225701312|emb|CAW98328.1| type I restriction modification DNA specificity protein [Streptococcus equi subsp. zooepidemicus] Length = 415 Score = 175 bits (444), Expect = 1e-41, Method: Composition-based stats. Identities = 101/418 (24%), Positives = 178/418 (42%), Gaps = 15/418 (3%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKD 67 + KDSG+ WIG +P +W+VVPIK F +G++++ + + V + L Sbjct: 3 KMKDSGIDWIGEVPYNWRVVPIKSFLSKKKEILEKWTGENVLSLTMNGVV---IRNLENP 59 Query: 68 GNSRQSDTSTVSIFAKGQILYG--KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + KG ++ + R IA DG+ S + + + + Sbjct: 60 SGKMPTTFDGYQKIDKGSLILCLFDIDVTPRCVGIAYNDGVTSPAYSQYRIINGNLKFYY 119 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 LL +D + + S + G + + IPPL+EQ I + + + ID ++ Sbjct: 120 YLLLMMDNDKILLPYSR-TLRSTLTDEYFGAVKVVIPPLSEQEKIAQFLDKKIALIDDIV 178 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 T+ IE LK KQ+L++ IVTKGL+P VK+ SGIEWVG VP+ WEV + N Sbjct: 179 TDTKTSIEELKAYKQSLITEIVTKGLDPTVKLVSSGIEWVGNVPEGWEVVKIKNISQLRN 238 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K+ L+L + + Y Q+V ++VF + K ++ Sbjct: 239 EKDIYETGQKFLALEKMLSYRPGYIDLLTEVEGGY--QQVVKIDDVVFSKLRPYLAKVAI 296 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 + G T + I+ + + S + + + G+ + + + Sbjct: 297 SDFE----GFGTGELLVFHNIKINRKLFMYKLISEQILQPVRSSSYGVKMPRVNPDFIMN 352 Query: 365 LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L + P + EQ I + ++ +TA+ID L+ + E I + + S I VTG+ + Sbjct: 353 LLISFPKSLYEQHIIADHLDQKTAQIDTLIVEKENLIREYETYKKSMIYEYVTGKKQV 410 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 56/203 (27%), Positives = 95/203 (46%), Gaps = 2/203 (0%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 KMKDSGI+W+G VP +W V P + +++ K N+LSL+ ++ + Sbjct: 1 MRKMKDSGIDWIGEVPYNWRVVPIKSFLSKKKEILEKWTGENVLSLTMNGVVIRNLENPS 60 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 G P +++ YQ +D G ++ D+ R + A G+ + AY + + + Sbjct: 61 GKMPTTFDGYQKIDKGSLILCLFDIDVTPRCVGIA--YNDGVTSPAYSQYRIINGNLKFY 118 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +L+ D K+ LR +L E + V++PP+ EQ I ++ + A ID +V Sbjct: 119 YYLLLMMDNDKILLPYSRTLRSTLTDEYFGAVKVVIPPLSEQEKIAQFLDKKIALIDDIV 178 Query: 394 EKIEQSIVLLKERRSSFIAAAVT 416 + SI LK + S I VT Sbjct: 179 TDTKTSIEELKAYKQSLITEIVT 201 >gi|150017995|ref|YP_001310249.1| restriction modification system DNA specificity subunit [Clostridium beijerinckii NCIMB 8052] gi|149904460|gb|ABR35293.1| restriction modification system DNA specificity domain [Clostridium beijerinckii NCIMB 8052] Length = 469 Score = 174 bits (442), Expect = 2e-41, Method: Composition-based stats. Identities = 101/468 (21%), Positives = 180/468 (38%), Gaps = 44/468 (9%) Query: 1 MK-HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-----LNTGRTSESGKDIIYIGLE 54 MK Y++ + KDSGV+WIG IP+ W+V IK G + K +I Sbjct: 1 MKFRYRSEEEMKDSGVKWIGKIPRDWEVSKIKYIKSPDKNSFVDGPFGSNLKSEHFIENG 60 Query: 55 DVESGTGKY---------LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---AD 102 +V + K ++ +T S + I+ K+G + I D Sbjct: 61 EVYVIESNFATQGILKLDSLKKISTEHFETIKRSEVKENDIVIAKIGAQFGLSNILPRID 120 Query: 103 FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI--EAICEGATMSHADWKGIGNIPMP 160 + S L L + + + I + NI + Sbjct: 121 KKAVVSGNSLKLSVDKQKSNTQYIHYQLLHIKNNGTLDLIVSTTAQPAISLGDMNNINIV 180 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN-------- 212 +P + Q I + + +T ++D++I+++ I++L+E K++L+S VT + Sbjct: 181 LPNVQRQDKIVKFLNEKTAQVDSIISKKEALIQILEEAKKSLISDAVTGKVKVVKTSDGY 240 Query: 213 -----PDVKMKDSGIEWVGLVPDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNII 265 +MKDSG++W+G VP W+VK + K+ +SYG++ Sbjct: 241 ELVERKKEEMKDSGVKWLGDVPKEWDVKRLRFLGNLQNGISKSGDEFGFGYPFVSYGDVY 300 Query: 266 QKLETRN--MGLKPESYETYQI--VDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITSA 319 + + GL S +I V G++ F D+ S + Sbjct: 301 KNISIPKFVNGLVNSSLNDRRIYSVLEGDVFFTRTSETVDEIGFASTCLNTITDATFAGF 360 Query: 320 YMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQF 376 + +P + + + R K F + R SL + L V +P KEQ Sbjct: 361 LIRFRPFKDKLYKGFSKYYFRCDLNRKFFVKEMNLVTRASLSQNLLNNLAVALPLYKEQQ 420 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +I + + + I+ + K+ I LKE + + I+ AVTG+I + E Sbjct: 421 EIYSALEFKVGGIECSINKLRCQIQKLKEAKQALISEAVTGKIKILDE 468 >gi|327184406|gb|AEA32851.1| type i site-specific restriction-modification system, s subunit [Lactobacillus amylovorus GRL 1118] Length = 425 Score = 174 bits (440), Expect = 3e-41, Method: Composition-based stats. Identities = 86/422 (20%), Positives = 174/422 (41%), Gaps = 14/422 (3%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KDS ++W+G IP +WKV P+ + GK+ + L + K + G Sbjct: 7 KDSNIEWLGKIPSNWKVKPL-YLFFFERKNKNNKGKEKNLLSLSYGKIIQ-KDINSTGGL 64 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQG 126 +T ++ G I+ K + GI ++ ++ L P Sbjct: 65 LPQSYNTYNVIEAGDIIIRPTDLQNDKHSLRTAFSKEHGIITSAYIDLAPLKDTNSEYFH 124 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++L +++ ++ + +PIP EQ I + + +ID L Sbjct: 125 YVLHAYDIEKVFYNMGNGVRQGLNYSEFSKLKLPIPSSEEQKEIVNFLNNQVSQIDKLSK 184 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + I L+E ++++++ VTKGLNP+V MKDSGI W+G +P +W++ L L++ Sbjct: 185 KIQQEIIDLEEYRKSIITKAVTKGLNPNVPMKDSGIPWIGKIPQNWKIIKGKYLFRLLSK 244 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + ++ + YQ +D G++V +D + Sbjct: 245 PVKKDDQVITCFRDGQVTLRVKRRTTGFTMSDQEIGYQGIDKGDLVVHGMDGFAGAIGIS 304 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVK 363 ++ ++ V D Y+ + +R+ VF A+ G+R ++ + Sbjct: 305 DSRGKGSPVLN-----VLDSNQDKKYMMYCLRATAQLGVFQALAKGIRVRSADTRWPTLA 359 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L +PP EQ ++ N ++ + +++ +++ + + L + + S I VTG+ + Sbjct: 360 NLKYAIPPQSEQANVVNYLSNNSYKLNAIIQAKKDLVEKLNQYKQSIIYEYVTGKKQVPT 419 Query: 424 ES 425 E Sbjct: 420 EE 421 Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats. Identities = 93/201 (46%), Positives = 132/201 (65%), Gaps = 1/201 (0%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 +KDS IEW+G +P +W+VKP + E KN K E N+LSLSYG IIQK GL Sbjct: 6 LKDSNIEWLGKIPSNWKVKPLYLFFFERKNKNNKGKEKNLLSLSYGKIIQKDINSTGGLL 65 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAW 335 P+SY TY +++ G+I+ R DLQNDK SLR+A E GIITSAY+ + P +S Y + Sbjct: 66 PQSYNTYNVIEAGDIIIRPTDLQNDKHSLRTAFSKEHGIITSAYIDLAPLKDTNSEYFHY 125 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ +YD+ KVFY MG+G+RQ L + + +L + +P +EQ +I N +N + ++ID L +K Sbjct: 126 VLHAYDIEKVFYNMGNGVRQGLNYSEFSKLKLPIPSSEEQKEIVNFLNNQVSQIDKLSKK 185 Query: 396 IEQSIVLLKERRSSFIAAAVT 416 I+Q I+ L+E R S I AVT Sbjct: 186 IQQEIIDLEEYRKSIITKAVT 206 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 3/199 (1%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 KDSG+ WIG IP++WK++ K +L + D + D + G Sbjct: 215 MKDSGIPWIGKIPQNWKIIKGKYLFRLLSK--PVKKDDQVITCFRDGQVTLRVKRRTTGF 272 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + KG ++ + + I+D G S VL ++ Sbjct: 273 TMSDQEIGYQGIDKGDLVVHGMDGFAGAIGISDSRGKGSPVLNVLDSNQDKKYMMYCLRA 332 Query: 130 SIDVTQRIEAICEGATMS-HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + S W + N+ IPP +EQ + + + +++ +I + Sbjct: 333 TAQLGVFQALAKGIRVRSADTRWPTLANLKYAIPPQSEQANVVNYLSNNSYKLNAIIQAK 392 Query: 189 IRFIELLKEKKQALVSYIV 207 +E L + KQ+++ V Sbjct: 393 KDLVEKLNQYKQSIIYEYV 411 >gi|260552461|ref|ZP_05825837.1| type I restriction-modification system specificity determinant [Acinetobacter sp. RUH2624] gi|260405268|gb|EEW98764.1| type I restriction-modification system specificity determinant [Acinetobacter sp. RUH2624] Length = 461 Score = 174 bits (440), Expect = 3e-41, Method: Composition-based stats. Identities = 101/448 (22%), Positives = 177/448 (39%), Gaps = 34/448 (7%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFT-----KLNTGRTSES-------GKDIIYIGLE 54 Y ++K S + +P HW+ + + G + I I L Sbjct: 4 YSEFKYSDY-FKTELPSHWQEKRLGFLSMQTKNAFVDGPFGSDLKSDDYLDEGIPLIQLN 62 Query: 55 DVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAI-----IADFDGICS 108 ++ G S+ + I+ K+ + +A ++ + Sbjct: 63 NIRDGKHILRNMKFISQNKKIDLIRHLALPQDIVIAKMAEPVARAAVVSDEYDEYVIVAD 122 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 L + V L + S V + E + G T + + + +P P L+EQV Sbjct: 123 CVKLSPDLELVDLNFLIWAINSDCVRENAELVSTGTTRIRINLGELKKLKVPYPSLSEQV 182 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 IR+ + ET +IDTLI ++ I LLKEK+QA++S+ VTKGLNP+V MKDSG+EW+G V Sbjct: 183 KIRQYLDHETAKIDTLIAKQEELIALLKEKRQAVISHAVTKGLNPNVPMKDSGVEWLGEV 242 Query: 229 PDHWEVKPFFALVTELNRKNTK-----------LIESNILSLSYGNIIQKLETRNMGLKP 277 P+HW V F + + + + ++ + L + L Sbjct: 243 PEHWTVSKFGYISQVVRGGSPRPAGDPALFNGDYSPWVTVAEITKDDELYLTSTETFLTK 302 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + E ++ G ++ + S + + ID Y + + Sbjct: 303 KGSEQCRVFQSGTLLLSNSGATLGVPKILSI----NANANDGVVGFEDLKIDIEYAYFYL 358 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + +L + VK +P+ +PP E I I + L+ E Sbjct: 359 SILTNDLRERVKQGSGQPNLNTDIVKAIPIAIPPENEIKKIVVDIKKKIDHFSKLMGSAE 418 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRGES 425 ++I L++ERR++ I+A VTG+ID+R Sbjct: 419 KAIQLMQERRTALISAVVTGKIDVRNWQ 446 >gi|296328649|ref|ZP_06871166.1| type I restriction-modification system specificity subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296154248|gb|EFG95049.1| type I restriction-modification system specificity subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 455 Score = 173 bits (438), Expect = 5e-41, Method: Composition-based stats. Identities = 73/440 (16%), Positives = 140/440 (31%), Gaps = 39/440 (8%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLP 65 Y YK++ + W+G IP HW+ I + + + + K+++ + S + Sbjct: 4 YDSYKETDIPWLGEIPSHWETKKIGKIFDIRKEKNSPVKTKEVLSLSSMYGVSLYSERKE 63 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 K GN + + ++ G IL + I+++ G S + LQ Sbjct: 64 KGGNKPKENLEAYNLCYPGDILVNSMNIVAGSVGISNYFGAISPVYYSLQNLSEKKYSKY 123 Query: 126 GWLLSIDVTQRIEAICE---------------GATMSHADWKGIGNIPMPIPPLAEQVLI 170 ++ W + + P PP+ EQ+ I Sbjct: 124 YLEYLFRNYNFQRSLVGLGKGIQMSETEDGRLFTVRMRISWDTLKSQEFPTPPIEEQIQI 183 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + + ID LI I+ L+ KQ + I + Sbjct: 184 ANYLDWKINEIDRLILIEKEQIKELENLKQKYIDE----------------IYQNIKTKN 227 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----V 286 + L +S+ ++ YG+I K + E + Sbjct: 228 FISLSKIGTFFKGGGFSRENLSDSDYGAILYGDIYTKYNYFFEECISKIDENAYFNSKCI 287 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGII--TSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 D ++F + A V + I ++A+ + ++ Sbjct: 288 DGNVVLFTGSGETKEDIGKNVAYVGTKKIALGGDIIALKPNKNFSPKFIAYFSNTSNIKA 347 Query: 345 VFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + +G + + +K + + I+EQ DI I+ + L+ IE I Sbjct: 348 FKHMKSTGDIIVHITLGAIKSIKIPFISIEEQKDIVKKIDEYILNLKNLIALIEDKIKYF 407 Query: 404 KERRSSFIAAAVTGQIDLRG 423 + S IA VTG+ID+R Sbjct: 408 LSLKQSLIAEVVTGKIDVRN 427 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 89/217 (41%), Gaps = 23/217 (10%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 +N K++ I W+G +P HWE K + KN+ + +LSLS + Sbjct: 1 MNNYDSYKETDIPWLGEIPSHWETKKIGKIFDIRKEKNSPVKTKEVLSLSSMYGVSLYSE 60 Query: 271 RNM---GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-- 325 R E+ E Y + PG+I+ +++ + + G I+ Y +++ Sbjct: 61 RKEKGGNKPKENLEAYNLCYPGDILVNSMNIVAGSVGISNY----FGAISPVYYSLQNLS 116 Query: 326 -HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-------------QSLKFEDVKRLPVLVPP 371 YL +L R+Y+ + +G G++ + ++ +K PP Sbjct: 117 EKKYSKYYLEYLFRNYNFQRSLVGLGKGIQMSETEDGRLFTVRMRISWDTLKSQEFPTPP 176 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 I+EQ I N ++ + ID L+ ++ I L+ + Sbjct: 177 IEEQIQIANYLDWKINEIDRLILIEKEQIKELENLKQ 213 >gi|259048036|ref|ZP_05738437.1| conserved hypothetical protein [Granulicatella adiacens ATCC 49175] gi|259035326|gb|EEW36581.1| conserved hypothetical protein [Granulicatella adiacens ATCC 49175] Length = 459 Score = 172 bits (436), Expect = 8e-41, Method: Composition-based stats. Identities = 77/438 (17%), Positives = 167/438 (38%), Gaps = 29/438 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYL 64 Y +Y +S + W +IP+HW V I + ++ + S K+I+ + + S Sbjct: 3 RYEKYSNSEITWSESIPEHWDVKRIAKVFEIRKEKNSPIKTKEILSLSAKYGVSLYTDKK 62 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 K GN + D ++ ++ G IL + I+++ G S + L Sbjct: 63 EKGGNKPKEDLTSYNLCYPGDILVNCMNIVAGSVGISNYLGAVSPVYYPLVNISQENNNT 122 Query: 125 QGWLLSIDVTQRIEAICE---------------GATMSHADWKGIGNIPMPIPPLAEQVL 169 + ++ W + +PIPP+ EQ Sbjct: 123 RYMEYVFRNYNFQRSLVGLGKGIQMSETDAGRLNTVRMRISWDILKTQLLPIPPINEQKQ 182 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + ID LI I+ +++ + ++ + + K+ ++ Sbjct: 183 IANYLDWKINEIDRLIEINKEKIKCIRKYIISSHEKLILQNSD----FKEWIVKDNIYNF 238 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + K + +N L +S+I+ S + +GL + YQ V+ G Sbjct: 239 KNKNFKIRKLKSILVKIENDALPDSDIIICSNSGKSFVRGDKKIGLYSDDINMYQNVNYG 298 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ +D + + S + V D Y+ + +R K++ Sbjct: 299 QIMIHGMDTWHGAICISKYSGR-----CSRVVHVCETSEDKMYVYYYLRLLAFLKMYKPF 353 Query: 350 GSGLRQSLK----FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 +G+RQ+ ++ + ++ +++P I++Q I + + + ++++I + I +L Sbjct: 354 SNGVRQNTSDFRSWDRLGQVNIILPAIEQQHKIADKLTKLINNSEKMIDEIMKEIDMLGN 413 Query: 406 RRSSFIAAAVTGQIDLRG 423 + S I+ VTG+ID+R Sbjct: 414 LKQSLISEVVTGKIDVRN 431 >gi|110681177|ref|YP_684184.1| type I restriction-mod [Roseobacter denitrificans OCh 114] gi|109457293|gb|ABG33498.1| type I restriction-mod [Roseobacter denitrificans OCh 114] Length = 414 Score = 172 bits (436), Expect = 1e-40, Method: Composition-based stats. Identities = 94/411 (22%), Positives = 175/411 (42%), Gaps = 20/411 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG--NSRQSDTSTVSIFA 82 WK P K + + + + ++ G Y G + + Sbjct: 10 WKEYPFWAVAKPKSVSNASAESLLSV----YLDRGVIPYSEGGGLVHKPAESLEKYQLVE 65 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV----TQRIE 138 G ++ + ++ + GI S + + + + + L + Sbjct: 66 PGDLVLNNQQAWRGSLGVSTYRGIVSPAYRIFELNGEVVDTRFSHYLFRSRPYVEKIMLA 125 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ G W + + + +P ++ Q I E + ET RID LI ++ RFI LLKEK Sbjct: 126 SLSVGDIQRQVKWPLLRVLLLRVPNISTQSKIAEYLDCETARIDGLIEKKTRFIALLKEK 185 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP----FFALVTELNRKNTKLIES 254 + A++++ VTKG++ V MK SG +W+ +P HW V P F L + Sbjct: 186 RIAVITHAVTKGIDAAVVMKPSGEDWLSDIPAHWTVVPPTALFTESKERAREGTQMLSAT 245 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + ++LE R + + + + V+ G+ V + L A+ + Sbjct: 246 QKYGVIPLAEFERLEQRQVTMALVHLDKRKHVEVGDFVISMRSMDG---GLERARAVGNV 302 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPI 372 + + + PH + +L++S + S +R Q + F +++ + P+ Sbjct: 303 RSSYSVLKCGPHVE-GRFYGYLLKSGLYIQALRLTSSFIRDGQDMNFSHFRKVKLPKLPV 361 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 EQ I + I+ +TARID L+ K ++SI LL+E+R++ I AAVTG+ID+R Sbjct: 362 AEQAAIADHIDTQTARIDSLITKTDRSIALLREKRAALITAAVTGKIDMRH 412 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 48/215 (22%), Positives = 78/215 (36%), Gaps = 13/215 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY 63 K SG W+ IP HW VVP + R E + I L + E + Sbjct: 204 MKPSGEDWLSDIPAHWTVVPPTALFTESKERAREGTQMLSATQKYGVIPLAEFE----RL 259 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + G + A G + + VL+ + Sbjct: 260 EQRQVTMALVHLDKRKHVEVGDFVISMRSMDGG-LERARAVGNVRSSYSVLKCGPHVEGR 318 Query: 124 LQGWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 G+LL + + + ++ + +P P+AEQ I + I +T RI Sbjct: 319 FYGYLLKSGLYIQALRLTSSFIRDGQDMNFSHFRKVKLPKLPVAEQAAIADHIDTQTARI 378 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 D+LIT+ R I LL+EK+ AL++ VT ++ Sbjct: 379 DSLITKTDRSIALLREKRAALITAAVTGKIDMRHM 413 >gi|239995433|ref|ZP_04715957.1| restriction endonuclease S subunits-like protein [Alteromonas macleodii ATCC 27126] Length = 407 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 82/399 (20%), Positives = 167/399 (41%), Gaps = 24/399 (6%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 + ++ + DV + + +S Q+ G + I + Sbjct: 1 NGEYAWVRIADVTASNSYLHHTTQKMSKIGSSLSVKLEPNQLFLSIAGTVGKPCI--NKI 58 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 +C V P +P ++ + + Q + + + T + + +G+I + +P Sbjct: 59 KVCIHDGFVYFPDLSIPHKFLYYVFAGE--QAYKGLGKMGTQLNLNTDTVGSIKVALPKD 116 Query: 165 AEQVL-IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 ++ I + + ET +IDTLI ++ + I+LLKEK+QA++S+ VTKGLNP MKDSG+E Sbjct: 117 EIEIQGIIDFLDHETAKIDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPYAPMKDSGVE 176 Query: 224 WVGLVPDHWEVK-----------PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 W+G VP+HW + + + + I Sbjct: 177 WLGEVPEHWSPATPIKYLSSLKGRLGWQGLKADEYKDDGPHVVSSAHFNNHEINWGMCPR 236 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP------H 326 + + ++ ++ G+I+ K + + + + S + +P Sbjct: 237 VSEERYELDSNIQLESGDILLMKDGAAMGKLAYVD-DLPGKACLNSHLLLFRPLLRDDIK 295 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + ++ +LM++ G+G + + + +++P +EQ I ++ + Sbjct: 296 TFHTKFMFYLMQTEHFQGFIRNNGTGATFLGISQQAIGNHRLILPDYEEQLSIAKFLDEQ 355 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +++ L K Q + LL ERR++ I+AAVTG+ID+R Sbjct: 356 VSKLSALENKKNQMMALLFERRAALISAAVTGKIDVRNW 394 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 87/225 (38%), Gaps = 18/225 (8%) Query: 6 AYPQYKDSGVQWIGAIPKHWK-VVPIKRFTKLNTG------RTSESGKDII-YIGLEDVE 57 Y KDSGV+W+G +P+HW PIK + L + E D + Sbjct: 166 PYAPMKDSGVEWLGEVPEHWSPATPIKYLSSLKGRLGWQGLKADEYKDDGPHVVSSAHFN 225 Query: 58 SGTGKYLPK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLV 113 + + + + + + G IL K G + K D ++ L+ Sbjct: 226 NHEINWGMCPRVSEERYELDSNIQLESGDILLMKDGAAMGKLAYVDDLPGKACLNSHLLL 285 Query: 114 LQP------KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 +P K + + + + I GAT + IGN + +P EQ Sbjct: 286 FRPLLRDDIKTFHTKFMFYLMQTEHFQGFIRNNGTGATFLGISQQAIGNHRLILPDYEEQ 345 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + I + + + ++ L ++ + + LL E++ AL+S VT ++ Sbjct: 346 LSIAKFLDEQVSKLSALENKKNQMMALLFERRAALISAAVTGKID 390 >gi|261419107|ref|YP_003252789.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC61] gi|319765924|ref|YP_004131425.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC52] gi|261375564|gb|ACX78307.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC61] gi|317110790|gb|ADU93282.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC52] Length = 477 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 73/427 (17%), Positives = 133/427 (31%), Gaps = 29/427 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P +W V K +G T G DI +I ++ G + Sbjct: 26 EVPGNWVWVRSGHVAKWGSGGTPSRKRLEYYGGDIPWIKTGELNDGIITGSEETITEEGL 85 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ IF KG I+ G + + I D + V QP + L + Sbjct: 86 QKSSAKIFPKGSIVIAMYGATIGRLGILGIDAATNQACAVGQPYEFLDSK-YMFYYFFAR 144 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A+ +G + I + P +PPL EQ I +KI +ID E Sbjct: 145 RSDLVALGKGGAQPNISQTIIKDFPFALPPLNEQKRIADKIERLFAKIDEAKRLIEEVKE 204 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIE---------------WVGLVPDHWEVKPFF 238 +++++ ++ L + + S +E W VP +W Sbjct: 205 SIEQRRAVMLEKAFKGQLGTNDPSEKSILETSDDLSEKDVIPKEQWPYEVPGNWTWIKLK 264 Query: 239 ALVTELNRKNTKLIESNILSLSY------GNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + L T + Y N ET + ++ G+IV Sbjct: 265 SCLKRLQYGYTATSSTLTEGPKYLRITDIQNDNVDWETVPYCKIDDKLLEKYKLNKGDIV 324 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 K L + ++ YL ++S K + G Sbjct: 325 IARTGATTGKSFLIDDMPFCSVFASYLIRLTMNENLNPYYLWNYLKSSMYWKQITIVKKG 384 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + + L V +PP+ EQ I ++ +++ + + L + S + Sbjct: 385 IAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLENEKQLVLAVEEKLDLLKQSVL 444 Query: 412 AAAVTGQ 418 A G+ Sbjct: 445 QKAFRGE 451 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 66/204 (32%), Gaps = 12/204 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNT-----KLIESNILSLSYGNIIQKLETRNMGLKP 277 E VP +W + + + +I + G + + T + Sbjct: 22 EQPYEVPGNWVWVRSGHVAKWGSGGTPSRKRLEYYGGDIPWIKTGELNDGIITGSEETIT 81 Query: 278 ES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E + +I G IV + + + A +P+ + Sbjct: 82 EEGLQKSSAKIFPKGSIVIAMYGATIGRLGI----LGIDAATNQACAVGQPYEFLDSKYM 137 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + G + ++ +K P +PP+ EQ I + I A+ID Sbjct: 138 FYYFFARRSDLVALGKGGAQPNISQTIIKDFPFALPPLNEQKRIADKIERLFAKIDEAKR 197 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 IE+ +++RR+ + A GQ Sbjct: 198 LIEEVKESIEQRRAVMLEKAFKGQ 221 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 80/203 (39%), Gaps = 8/203 (3%) Query: 17 WIGAIPKHWKVVPIKRFTK-LNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 W +P +W + +K K L G T+ S + Y+ + D+++ + Sbjct: 250 WPYEVPGNWTWIKLKSCLKRLQYGYTATSSTLTEGPKYLRITDIQNDNVDWETVPYCKID 309 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVLPELLQGWL 128 KG I+ + G K+ + D C S + +++ P L +L Sbjct: 310 DKLLEKYKLNKGDIVIARTGATTGKSFLIDDMPFCSVFASYLIRLTMNENLNPYYLWNYL 369 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S ++I + +G A+ + IG + +P+PP+ EQ I EK+ +++ Sbjct: 370 KSSMYWKQITIVKKGIAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLENEKQLV 429 Query: 189 IRFIELLKEKKQALVSYIVTKGL 211 + E L KQ+++ L Sbjct: 430 LAVEEKLDLLKQSVLQKAFRGEL 452 >gi|148825619|ref|YP_001290372.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittEE] gi|229845500|ref|ZP_04465629.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 6P18H1] gi|148715779|gb|ABQ97989.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittEE] gi|229811603|gb|EEP47303.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae 6P18H1] Length = 358 Score = 172 bits (435), Expect = 1e-40, Method: Composition-based stats. Identities = 87/357 (24%), Positives = 155/357 (43%), Gaps = 19/357 (5%) Query: 83 KGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 KG+ L L + +++ D + S ++VL+ K ++ + +LL ++ Sbjct: 3 KGEFLINPLNLNYDLISLRIALSEIDVVVSAGYIVLKEKQIINKKYFSYLLHRYDVAYMK 62 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G ++ I + + IPPL+EQ I + + +T +ID + + I LLKE Sbjct: 63 LLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTAKIDRAVDLAEKQIALLKEH 121 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP---FFALVTELNRKNTKLIESN 255 KQ L+ VT+GLNPDV +KDSG+EW+G VP+HW++K F L+ L + Sbjct: 122 KQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWDIKRFRNLFDFGKGLSITKENLQDEG 181 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYE--------TYQIVDPGEIVFRFIDLQNDKRSLRS 307 I ++YG + + + + +++ G+ VF + + Sbjct: 182 IPCVNYGEVHSRYGFEVIPERDALKCVDSKYLVFNNSMLNKGDFVFADTSEDIEGSGNFT 241 Query: 308 AQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 I + + Y+A+ S G+ S+ +K Sbjct: 242 YLNSSTRIFAGYHTVITRLKITAIHRYIAYYFDSLSFRNQIRNKVKGVKVFSITQSILKG 301 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 VL+P +KEQ I + ++ +TA+ID + I LKE +S I VTG++ + Sbjct: 302 TFVLLPNLKEQQQIADYLDTQTAKIDQAIALKTAHIEKLKEYKSVLINDVVTGKVRV 358 Score = 91.0 bits (224), Expect = 4e-16, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 83/212 (39%), Gaps = 15/212 (7%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTG-KYLP 65 KDSGV+WIG +P+HW + + G + + I + +V S G + +P Sbjct: 141 KDSGVEWIGQVPEHWDIKRFRNLFDFGKGLSITKENLQDEGIPCVNYGEVHSRYGFEVIP 200 Query: 66 KDGNSRQSDTS----TVSIFAKGQILYGKL-----GPYLRKAIIADFDGICSTQFLVLQP 116 + + D+ S+ KG ++ G + + ++ + Sbjct: 201 ERDALKCVDSKYLVFNNSMLNKGDFVFADTSEDIEGSGNFTYLNSSTRIFAGYHTVITRL 260 Query: 117 KDV-LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 K + + + S+ +I +G + + + +P L EQ I + + Sbjct: 261 KITAIHRYIAYYFDSLSFRNQIRNKVKGVKVFSITQSILKGTFVLLPNLKEQQQIADYLD 320 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +T +ID I + IE LKE K L++ +V Sbjct: 321 TQTAKIDQAIALKTAHIEKLKEYKSVLINDVV 352 >gi|237756251|ref|ZP_04584811.1| type I restriction enzyme MjaXIP specificity protein [Sulfurihydrogenibium yellowstonense SS-5] gi|237691588|gb|EEP60636.1| type I restriction enzyme MjaXIP specificity protein [Sulfurihydrogenibium yellowstonense SS-5] Length = 421 Score = 170 bits (431), Expect = 3e-40, Method: Composition-based stats. Identities = 73/427 (17%), Positives = 158/427 (37%), Gaps = 33/427 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 ++K++ IG IP+ W+V+ + ++ G++ Y G ++ Sbjct: 11 KFKETE---IGLIPEDWEVMRLGEVAEITMGQSPPGDTYNTYGKGIPFLQGKAEFGNISP 67 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + T + I KG +L P IA+ D L K+ + E + Sbjct: 68 KHIKYTTKPLKIAKKGSVLISVRAPV-GDVNIANMDYCIGRGLASLNLKNGINE--FLFY 124 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + IE G+ + + + + +P+PPL EQ I + + + + Sbjct: 125 SLLFFKHLIEKESYGSVFKAINKENLARLKIPLPPLEEQKAITDIL----STVQNTTEKT 180 Query: 189 IRFIELLKEKKQALVSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + I K+ K++++ ++ T G ++ K+K E +GL+P+ WEV +V Sbjct: 181 EKVINATKQLKKSMMKHLFTYGAVAVDEIDKVKLKESE-IGLIPEDWEVVRLGDIVNFKI 239 Query: 246 RKNT------KLIESNILSLSYGNIIQKLETRNMGLKPE----SYETYQIVDPGEIVFRF 295 + +S ++ + E ++ G ++ F Sbjct: 240 GRTPPRKNKDYWTNGKYYWVSISDMKNPYINNTSEMVSEKAHKEIFKEKLTPAGTLLMSF 299 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 L II+ + K + + +L + + + D + G Sbjct: 300 KLTIGRTAILNVDAYHNEAIIS---IYPKENKVLKEFLFYYLPAVDYSNLQDKAIKG--N 354 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +L + ++P+ +PP+ EQ I N++ ID ++ E+ L+ + + + Sbjct: 355 TLNTSKLNKIPIPLPPLDEQQKIANILTT----IDQKIQAEEKKKEALQNLFKTLLQQLM 410 Query: 416 TGQIDLR 422 TG+I ++ Sbjct: 411 TGKIRVK 417 >gi|324991451|gb|EGC23384.1| restriction modification system DNA specificity subunit [Streptococcus sanguinis SK353] Length = 408 Score = 170 bits (430), Expect = 4e-40, Method: Composition-based stats. Identities = 83/418 (19%), Positives = 158/418 (37%), Gaps = 19/418 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDG 68 K+SG+ WIG IP+ W+V + + + + K+++ + + + Sbjct: 4 MKESGIDWIGQIPEEWEVAKVNHIFEEHKQKNRGNKEKNLLSLSYGRIIRKSID---SSF 60 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRK----AIIADFDGICSTQFLVLQPKDVLPELL 124 T +I +G I+ K +A +GI ++ +L L+ K++ Sbjct: 61 GLLPESFDTYNIIQRGDIVLRLTDLQNDKRSLRVGLARENGIITSAYLTLRLKNLESNDS 120 Query: 125 QGWLLSIDVTQRI-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + L G W I + + IPP EQ I + + + ++D Sbjct: 121 YMYYLLHTYDICKVFYNFGGGVRQGGTWSDIYKMELLIPPCNEQQKIADYLDKKIAQLDR 180 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + I+ LK+ + +L+ VTKGL+ V MKDSGI+W+G VP+ W V Sbjct: 181 AKRLLEKQIQKLKDYRASLIYETVTKGLDKTVPMKDSGIDWIGQVPEGWGVSRLKYFFDI 240 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + E N + N + + ++ + T G+ V K Sbjct: 241 YAGGDI--DERNTVDEYSENHPYPVISNSLENEGILGYTNNFRFQGDCVTVTGRGDVGKA 298 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 R+ + + + +D + + + S + K L + + Sbjct: 299 VYRNIKFYP---VVRLLVCTPKIQVDCRFATYWINSAIIEK-----NQTAVSQLTIQMLG 350 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L P EQ I + ++ +T +ID L++ Q I + ++R + I VTG+ + Sbjct: 351 ELIFTNVPYVEQKKIADFLDKKTVQIDKLIQIKNQQIKNINKQRQTLIYDYVTGKRRV 408 Score = 136 bits (343), Expect = 6e-30, Method: Composition-based stats. Identities = 86/205 (41%), Positives = 128/205 (62%), Gaps = 2/205 (0%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 +MK+SGI+W+G +P+ WEV + E +KN E N+LSLSYG II+K + Sbjct: 1 MTRMKESGIDWIGQIPEEWEVAKVNHIFEEHKQKNRGNKEKNLLSLSYGRIIRKSIDSSF 60 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDST 331 GL PES++TY I+ G+IV R DLQNDKRSLR E GIITSAY+ ++ + + Sbjct: 61 GLLPESFDTYNIIQRGDIVLRLTDLQNDKRSLRVGLARENGIITSAYLTLRLKNLESNDS 120 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 Y+ +L+ +YD+CKVFY G G+RQ + D+ ++ +L+PP EQ I + ++ + A++D Sbjct: 121 YMYYLLHTYDICKVFYNFGGGVRQGGTWSDIYKMELLIPPCNEQQKIADYLDKKIAQLDR 180 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 +E+ I LK+ R+S I VT Sbjct: 181 AKRLLEKQIQKLKDYRASLIYETVT 205 >gi|34762432|ref|ZP_00143432.1| TYPE I RESTRICTION-MODIFICATION SYSTEM SPECIFICITY SUBUNIT [Fusobacterium nucleatum subsp. vincentii ATCC 49256] gi|27887900|gb|EAA24968.1| TYPE I RESTRICTION-MODIFICATION SYSTEM SPECIFICITY SUBUNIT [Fusobacterium nucleatum subsp. vincentii ATCC 49256] Length = 447 Score = 169 bits (429), Expect = 5e-40, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 155/426 (36%), Gaps = 19/426 (4%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 Y YK + + W+G IP HW++ +KRF + + + ++ + + V+ + + Sbjct: 4 YEAYKKTDIPWLGKIPSHWEIKRVKRFFYIFKDISYKKNPVVLSLARDKVK---IRDIES 60 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLR-KAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + + + KG +L + Y I+++DG+ S ++ L+ + Sbjct: 61 NKGQLAENYNNYNSVKKGDLLLNPMDLYSGANCNISNYDGVISPAYINLRSNKDISVNFF 120 Query: 126 GWLLSIDVTQRIEAIC----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 ++ + T + + N +PIPP++EQ+ I + + I Sbjct: 121 DYIFKLQYTSLAFQSVGKGVSKYNRWTLSNETLLNYQLPIPPISEQIQIANYLDWKINEI 180 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D LI I+ L+ KQ + +++ + +K GL + Sbjct: 181 DKLILIEKEQIKELENLKQKYIDKLISSISSEFKPLKSIFEFGKGLS-------ITKENL 233 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 E + E + + + + + + + + +F Sbjct: 234 GENGVRCISYGEIHNKFIFSFSSTNPNLKGLEKTEGITISKFAELKKNDFIFADTSEDLK 293 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 + + + + Y V + Y+A+ + S K G+ S+ Sbjct: 294 GCGNFTFLEDDVKRVYAGYHTVVAKPILTFNPRYVAYYLESNKWRKQIRMEVKGIKVYSI 353 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +K + +P I Q ++ I+ + L+ +++ I L+ + S IA VTG Sbjct: 354 TQAILKSSRLQLPEIDIQESVSKKIDAFVQYKNALISIMDEKISNLQALKQSLIAEVVTG 413 Query: 418 QIDLRG 423 +ID+R Sbjct: 414 KIDVRN 419 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 90/211 (42%), Gaps = 9/211 (4%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 +N K + I W+G +P HWE+K + + +LSL+ + + Sbjct: 1 MNNYEAYKKTDIPWLGKIPSHWEIKRVKRFFYIF-KDISYKKNPVVLSLARDKVKIRDIE 59 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 N G E+Y Y V G+++ +DL + G+I+ AY+ ++ + S Sbjct: 60 SNKGQLAENYNNYNSVKKGDLLLNPMDLYS---GANCNISNYDGVISPAYINLRSNKDIS 116 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++ + F ++G G+ R +L E + + +PPI EQ I N ++ + Sbjct: 117 VNFFDYIFKLQYTSLAFQSVGKGVSKYNRWTLSNETLLNYQLPIPPISEQIQIANYLDWK 176 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ID L+ ++ I L+ + +I ++ Sbjct: 177 INEIDKLILIEKEQIKELENLKQKYIDKLIS 207 >gi|237741778|ref|ZP_04572259.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 4_1_13] gi|256845106|ref|ZP_05550564.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 3_1_36A2] gi|294785606|ref|ZP_06750894.1| conserved hypothetical protein [Fusobacterium sp. 3_1_27] gi|229429426|gb|EEO39638.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 4_1_13] gi|256718665|gb|EEU32220.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 3_1_36A2] gi|294487320|gb|EFG34682.1| conserved hypothetical protein [Fusobacterium sp. 3_1_27] Length = 447 Score = 169 bits (428), Expect = 7e-40, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 155/426 (36%), Gaps = 19/426 (4%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 Y YK + + W+G IP HW++ +KRF + + + ++ + + V+ + + Sbjct: 4 YEAYKKTDIPWLGKIPSHWEIKRVKRFFYIFKDISYKKNPVVLSLARDKVK---IRDIES 60 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLR-KAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + + + KG +L + Y I+++DG+ S ++ L+ + Sbjct: 61 NKGQLAENYNNYNSVKKGDLLLNPMDLYSGANCNISNYDGVISPAYINLRSNKDISVNFF 120 Query: 126 GWLLSIDVTQRIEAIC----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 ++ + T + + N +PIPP++EQ+ I + + I Sbjct: 121 DYIFKLQYTSLAFQSVGKGVSKYNRWTLSNETLLNYQLPIPPISEQIQIANYLDWKINEI 180 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D LI I+ L+ KQ + +++ + +K GL + Sbjct: 181 DKLILIEKEQIKELENLKQKYIDKLISSISSEFKPLKSIFEFGKGLS-------ITKENL 233 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 E + E + + + + + + + + +F Sbjct: 234 GENGVRCISYGEIHNKFIFSFSSTNLNLKGLEKTEGITISKFAELKKNDFIFADTSEDLK 293 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 + + + + Y V + Y+A+ + S K G+ S+ Sbjct: 294 GCGNFTFLEDDVKRVYAGYHTVVAKPILTFNPRYVAYYLESNKWRKQIRMEVKGIKVYSI 353 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +K + +P I Q ++ I+ + L+ +++ I L+ + S IA VTG Sbjct: 354 TQAILKSSRLQLPEIDIQESVSKKIDAFVQYKNALISIMDEKISNLQALKQSLIAEVVTG 413 Query: 418 QIDLRG 423 +ID+R Sbjct: 414 KIDVRN 419 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 90/211 (42%), Gaps = 9/211 (4%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 +N K + I W+G +P HWE+K + + +LSL+ + + Sbjct: 1 MNNYEAYKKTDIPWLGKIPSHWEIKRVKRFFYIF-KDISYKKNPVVLSLARDKVKIRDIE 59 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 N G E+Y Y V G+++ +DL + G+I+ AY+ ++ + S Sbjct: 60 SNKGQLAENYNNYNSVKKGDLLLNPMDLYS---GANCNISNYDGVISPAYINLRSNKDIS 116 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++ + F ++G G+ R +L E + + +PPI EQ I N ++ + Sbjct: 117 VNFFDYIFKLQYTSLAFQSVGKGVSKYNRWTLSNETLLNYQLPIPPISEQIQIANYLDWK 176 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ID L+ ++ I L+ + +I ++ Sbjct: 177 INEIDKLILIEKEQIKELENLKQKYIDKLIS 207 >gi|310658568|ref|YP_003936289.1| restriction modification system DNA specificity domain [Clostridium sticklandii DSM 519] gi|308825346|emb|CBH21384.1| Restriction modification system DNA specificity domain [Clostridium sticklandii] Length = 405 Score = 169 bits (428), Expect = 7e-40, Method: Composition-based stats. Identities = 99/422 (23%), Positives = 179/422 (42%), Gaps = 29/422 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG--TGKYLPK 66 + KDS + W+G + W +VP+K + TG+ +DV G G+Y Sbjct: 3 EMKDSELLWLGEYNETWDLVPLKHLVNITTGK-------------KDVNQGHPDGEYPFF 49 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + +S S F +L G +++ K++ P L+ Sbjct: 50 TCSMTPYRSSNYS-FDSEALLVAGNGMVGFTQYYNGKFEAYQRTYVLSDFKEIHPLYLKH 108 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ + + G+ + + + + P + +Q I + + ID ++ Sbjct: 109 YITELLPKYLTDKSV-GSVIDFIKLGDLKSFGIVRPSITDQKKISSYLEQKVALIDNILE 167 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + IE K+ KQ+L++ VTKGLNPDVKMKD GIEW+G +P+ W+V + E+++ Sbjct: 168 KTKQSIEEYKKYKQSLITETVTKGLNPDVKMKDIGIEWIGEIPEQWKVLKL-KCIFEISK 226 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + + ++LS++ + K T N G Y YQ V + V +DL L Sbjct: 227 RISGELGHSVLSVTQNGLKIKDLTSNEGQLSSDYSKYQYVYKTDFVMNHMDLLTGWVDLS 286 Query: 307 SAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKF 359 G+ + Y K Y ++ + L ++FY +G G+ R L+ Sbjct: 287 PYD----GVTSPDYRVFKMKDGLKYSKEYYLYIFQVCYLNQIFYGLGQGISNLGRWRLQT 342 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + VPPI EQ I + + I+ V+ E + L+ + S I VTG+ Sbjct: 343 DKFINFSLPVPPIDEQKKIAKFLQNKLGEIEKFVKTKESLLKELEAYKKSLIYEVVTGKK 402 Query: 420 DL 421 ++ Sbjct: 403 EI 404 >gi|237755834|ref|ZP_04584432.1| restriction modification system DNA specificity domain protein [Sulfurihydrogenibium yellowstonense SS-5] gi|237691999|gb|EEP61009.1| restriction modification system DNA specificity domain protein [Sulfurihydrogenibium yellowstonense SS-5] Length = 424 Score = 168 bits (426), Expect = 1e-39, Method: Composition-based stats. Identities = 83/433 (19%), Positives = 174/433 (40%), Gaps = 32/433 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 +K++ IG IP+ W+VV + ++ G++ Y G ++ Sbjct: 7 FKETE---IGLIPEDWEVVRLGEVAEITMGQSPPGDTYNTYGKGIPFLQGKAEFGNISPK 63 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + T + I KG +L P IAD D L K+ + E + Sbjct: 64 HIKYTTKPLKIAKKGSVLISVRAPV-GDVNIADMDYCIGRGLASLNLKNGINE--FLFYS 120 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + IE G+ + + + + + +P+PPL EQ I + + + I + Sbjct: 121 LLFFKHLIEKESYGSVFNAINKENLARLKIPLPPLEEQKAIADIL----STVQNAIEKAE 176 Query: 190 RFIELLKEKKQALVSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + I K+ K++++ ++ T G ++ K+K E +GL+P+HWEV +V Sbjct: 177 KVINATKQLKKSMMKHLFTYGAVVVDEIDKVKLKESE-IGLIPEHWEVVRLGEVVDLDRG 235 Query: 247 KNTKLIES--------NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF--RFI 296 + + E I + + ++ + + + +I+F Sbjct: 236 ISWRKFEEGNKDNGHLIISIPNIKDGYIDFNSKYNHYLIKHIPKNKQIQLNDILFVGSSG 295 Query: 297 DLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDL-CKVFYAMGSG 352 ++N R++ + GI ++++ VK + + +L ++ SY K + S Sbjct: 296 SIENVGRNVFIENLPFEGIGFASFVFRARVKVNTVIPKFLYFMANSYWFNYKDYVRRSSD 355 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + + + K + + +PP+ EQ I N++ ID ++ E+ V L+ + + Sbjct: 356 GKYNFQLTEFKSIKIPLPPLDEQQKIANILTT----IDQKIQAEEKKKVALRSLFKTLLH 411 Query: 413 AAVTGQIDLRGES 425 +TG+I +R S Sbjct: 412 QLMTGKIRVRHPS 424 >gi|310778850|ref|YP_003967183.1| restriction modification system DNA specificity domain protein [Ilyobacter polytropus DSM 2926] gi|309748173|gb|ADO82835.1| restriction modification system DNA specificity domain protein [Ilyobacter polytropus DSM 2926] Length = 433 Score = 168 bits (426), Expect = 1e-39, Method: Composition-based stats. Identities = 88/435 (20%), Positives = 170/435 (39%), Gaps = 18/435 (4%) Query: 1 MKHYKAYPQYKDSGVQW---IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE 57 +K K+Y YK+ + W + IP+ W ++P + + ++++ + + Sbjct: 2 IKEKKSYLNYKN--IPWYEYVKEIPQDWNILPNIALFDERIKK-KNNNEELLSVTISKGI 58 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 K + D S + G I Y K+ + + + GI S ++VL+ K Sbjct: 59 IKQSDIENKK-DISNEDKSNYKLVKIGDIAYNKMRMWQGSVGYSQYRGIVSPAYIVLKSK 117 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + +L + + + + +P + Q I E + Sbjct: 118 LKINSKYFHYLYRTEYYSNYARRYSYGLCDDQLNLRYVDFKRMYSIVPHIEIQDKIVEYL 177 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 + + + I ++ + IELLKE+K+ +++ +VTKGLN VKM+DSG+EW+G VP HWE+ Sbjct: 178 ETKEKQSNKFIEKQQKMIELLKEQKKTIINEVVTKGLNTLVKMQDSGVEWLGKVPKHWEI 237 Query: 235 KPFFALVTELNRKNTKLIESNILS------LSYGNIIQKLETRNMGLKPESYETYQIVDP 288 K + +NR T + R E ++ Sbjct: 238 KKLKEMSDFVNRGTTPNYTEKSDYKVVNQATFSKGYFDESSIRFHKTYKIEKEKGKLKYK 297 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV--F 346 ++ K ++ + + + + + + + + Sbjct: 298 DILLASTGGGVLGKVAIFTEKEGVYLADSHVTIIRDSKKRFIPEYLYYFYYVNYNLIDGY 357 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + GS + L+ E ++++ + P +KEQ I N I + +ID + K E+ I L K+ Sbjct: 358 FGQGSTNQTELQREWLRQMYLPYPDLKEQKQIVNYIEKQNTKIDTTILKTEKEIELAKDY 417 Query: 407 RSSFIAAAVTGQIDL 421 S I VTGQI + Sbjct: 418 MESLIYNVVTGQICV 432 >gi|260428510|ref|ZP_05782489.1| type I restriction-modification system, S subunit [Citreicella sp. SE45] gi|260423002|gb|EEX16253.1| type I restriction-modification system, S subunit [Citreicella sp. SE45] Length = 426 Score = 168 bits (425), Expect = 2e-39, Method: Composition-based stats. Identities = 103/421 (24%), Positives = 159/421 (37%), Gaps = 23/421 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P H+ +PI K N G + S Y+ +V +G K S Sbjct: 7 VPSHYIKLPIIAVAKKNGGIFIDGDWIESKDLSDSGFRYLTTGNVGAGEFKDQGTGYISD 66 Query: 72 QSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQG 126 + + G IL +L + +A I G ++ + L Sbjct: 67 STFHRLRCTEVMPGDILVSRLNLPIGRACIVPDVGERMVTAVDNVIIRPSDEFDRRFLVF 126 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + ++ + + G TM +G + +PP EQ I + ET R+D LI Sbjct: 127 LFSAQHHSEMMANLARGTTMQRVSRSALGRARVYLPPFEEQTAIANYLDLETARLDGLIE 186 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 ++ RFIELLKEK A VT + M+ SGI+W +P W V+ L E+ R Sbjct: 187 KKGRFIELLKEKALAYSDRCVTGQTDSARDMRTSGIQWSPQLPAEWGVRRGKDLFREMAR 246 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 E ++ E YQ + G++V +D + Sbjct: 247 PVRSDDEIITAFRDGQVCLRSRRRTEGYTFAEKEVGYQRILKGDLVIHTMDAFAGAIGIS 306 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGLR---QSLKFED 361 + G T Y P D + ++R + + +R +F Sbjct: 307 E----DNGKATGEYAVCTPKSPDIIPEYYALILRCMARRNYIFVLCPSVRERAPRFRFVR 362 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + VPP EQ I I T R L+ K E+SI LLKE+RS+ I AAVTG+ID+ Sbjct: 363 FAPVMLPVPPRAEQEQIVASIEEHTRRAKALIAKTERSIELLKEKRSALITAAVTGKIDV 422 Query: 422 R 422 R Sbjct: 423 R 423 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 48/207 (23%), Positives = 78/207 (37%), Gaps = 6/207 (2%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 + SG+QW +P W V K + D I D + +G Sbjct: 217 MRTSGIQWSPQLPAEWGVRRGKDLFREM--ARPVRSDDEIITAFRDGQVCLRSRRRTEGY 274 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGW 127 + KG ++ + + I++ +G + ++ V PK +PE Sbjct: 275 TFAEKEVGYQRILKGDLVIHTMDAFAGAIGISEDNGKATGEYAVCTPKSPDIIPEYYALI 334 Query: 128 LLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + I +C + + +P+PP AEQ I I T R LI Sbjct: 335 LRCMARRNYIFVLCPSVRERAPRFRFVRFAPVMLPVPPRAEQEQIVASIEEHTRRAKALI 394 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLN 212 + R IELLKEK+ AL++ VT ++ Sbjct: 395 AKTERSIELLKEKRSALITAAVTGKID 421 >gi|89075001|ref|ZP_01161446.1| Restriction modification system DNA specificity domain protein [Photobacterium sp. SKA34] gi|89049240|gb|EAR54804.1| Restriction modification system DNA specificity domain protein [Photobacterium sp. SKA34] Length = 402 Score = 168 bits (425), Expect = 2e-39, Method: Composition-based stats. Identities = 84/410 (20%), Positives = 167/410 (40%), Gaps = 24/410 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIY---IGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W + TK+ G+ + IG E+V S TG+ S S Sbjct: 2 VPNGWVKTTFGKITKIGNGQVDPKVEPYSSMTHIGPENVVSNTGQITKLKSCSALGLISG 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQR 136 F + I+Y K+ P L K DF G+CS + +D L L ++L + Sbjct: 62 KYEFDENSIVYSKIRPNLNKVCRPDFKGVCSADMYPIWSEDNLDINYLYHYMLGPYFNRI 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 A+ M + + ++ + +PPL EQ I + + D I + I+ K Sbjct: 122 AIAMSMRTGMPKINRSDLNSLSIVLPPLPEQRKIAKIL----STWDRGIASTEKLIDASK 177 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++K+AL+ ++T K + E ++WE +L+ E +N + + Sbjct: 178 QQKKALMQQLLTG------KKRLVDPETGKAFEENWERTHLKSLLIEEKSRNKDNKITRV 231 Query: 257 LSL-SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 LS+ ++ + + + + E+ Y+IV G+ + L S G+ Sbjct: 232 LSVTNHSGFVLPEDQFSKRVASENISNYKIVKQGQFGYNPSRLN--VGSFACLNQFSEGV 289 Query: 316 ITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 ++ Y+ + YL++ M S++ + G +R+S+ F+ + P ++P + Sbjct: 290 LSPMYVVFSTNDSKLQRDYLSYWMDSHEAKQRIKNSTQGSVRESVGFDALCNFPFILPAL 349 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 EQ I +V+ +E +E + LK+ + + + +TG+ ++ Sbjct: 350 NEQQKIASVLTAADKE----IELLEAKLAHLKQEKKALMQQLLTGKRRVK 395 >gi|310639248|ref|YP_003944007.1| restriction endonuclease S subunit [Ketogulonicigenium vulgare Y25] gi|308752824|gb|ADO43968.1| putative restriction endonuclease S subunit [Ketogulonicigenium vulgare Y25] Length = 376 Score = 168 bits (425), Expect = 2e-39, Method: Composition-based stats. Identities = 96/375 (25%), Positives = 155/375 (41%), Gaps = 14/375 (3%) Query: 56 VESGTGKYLPKDGNSRQSDTSTV--SIFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFL 112 + S + + +G L G+ G A S + Sbjct: 7 ITSDEIREADDYPVFGGNGLRGYTDRFNREGDFVLIGRQGALCGNINYAAGKFWASEHAI 66 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 V + WL + + + A + I N+ +P+PP + Q I Sbjct: 67 VADTQG---NAEVRWLGELLSFMNLNQYSQSAAQPGIAVEVIANLSIPVPPSSTQHAIAL 123 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 + ET IDTLI + ++L+ EK++A+V+ V +GL+P V ++ SGIEW+G +P HW Sbjct: 124 FLNRETADIDTLIAAKQSLLDLMAEKRRAIVAETVMRGLDPSVPLRPSGIEWLGDIPAHW 183 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 E++ L TE ++++ E + + + E + ES Y++ G++ Sbjct: 184 EIERSRWLFTERDQRSQTGKEEMLTVSHLTGVTPRSEKDVNMFEAESTAGYKLCLAGDLA 243 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGS 351 + GI++ AY P Y+ L+R + + Sbjct: 244 INTLWAWMGAMGTARVD----GIVSPAYNVYTPGPRLLPDYVDALVRIHVFAQEVTRYSK 299 Query: 352 GL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G+ R L E VPP+ EQ I I+ ET +ID L E SI LLKERR+ Sbjct: 300 GVWSSRLRLYPEGFFETWWPVPPLDEQQQIVEHISAETTKIDRLRAATENSIALLKERRA 359 Query: 409 SFIAAAVTGQIDLRG 423 + IAAAVTGQI++ Sbjct: 360 ALIAAAVTGQIEIPE 374 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 49/204 (24%), Positives = 83/204 (40%), Gaps = 5/204 (2%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 + SG++W+G IP HW++ + R+ +++ + + + T + Sbjct: 169 RPSGIEWLGDIPAHWEIERSRWLFTERDQRSQTGKEEM--LTVSHLTGVTPRSEKDVNMF 226 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLL 129 T+ + G + L ++ A DGI S + V P LP+ + + Sbjct: 227 EAESTAGYKLCLAGDLAINTLWAWMGAMGTARVDGIVSPAYNVYTPGPRLLPDYVDALVR 286 Query: 130 SIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 Q + +G +G P+PPL EQ I E I AET +ID L Sbjct: 287 IHVFAQEVTRYSKGVWSSRLRLYPEGFFETWWPVPPLDEQQQIVEHISAETTKIDRLRAA 346 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 I LLKE++ AL++ VT + Sbjct: 347 TENSIALLKERRAALIAAAVTGQI 370 >gi|229829856|ref|ZP_04455925.1| hypothetical protein GCWU000342_01962 [Shuttleworthia satelles DSM 14600] gi|229791154|gb|EEP27268.1| hypothetical protein GCWU000342_01962 [Shuttleworthia satelles DSM 14600] Length = 407 Score = 167 bits (423), Expect = 3e-39, Method: Composition-based stats. Identities = 75/426 (17%), Positives = 145/426 (34%), Gaps = 25/426 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESG 59 M Y KDSG++WIG +P W V + + + S+ + +++ + ++ Sbjct: 1 MSETVRYTDMKDSGIKWIGEVPASWNVRTLYQLATRVNNKNSDLAEQNLLSLSYGKIKRK 60 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQ 115 + +I G I+ + A GI ++ + LQ Sbjct: 61 DI---NTKDGLLPASFDGYNIIEAGDIVLRLTDLQNDHTSLRVGQATERGIITSAYTTLQ 117 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P + +LL ++ ++ + + +P EQ I + Sbjct: 118 PINPSNARYLYYLLHAFDLKKGFYGMGSGVRQGLNYDEVKELRSVLPSQIEQDAIVSYLD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +ID +I E IE K + + + + T G++ DV+M D+ I W +P +W + Sbjct: 178 DVCQQIDLIIEEAKSSIEGYKGWRLSTIKEVTTHGISKDVEMVDTEISWAKSIPSNWNIG 237 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + + + YG+ +T N E+ ++F Sbjct: 238 KGHYFASTYSGRAIPGDGTTGSIPVYGSGGSFKKTENPLYAGEA-----------VLFGR 286 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + + T Y + YL +++ +D Sbjct: 287 KGTLGKPIYV---NRPFWAVDTIYYAVCNEKWMLPKYLYYMLTIFDWESFI---THTALP 340 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 S+ +V + PPI EQ I ++ D +E E I L+ + S I AV Sbjct: 341 SIVANEVFSSVFICPPISEQLQIIKKLDCVCDNADTAIEATEHLIEELELYKRSLIYEAV 400 Query: 416 TGQIDL 421 TG+ + Sbjct: 401 TGKRKV 406 >gi|74318700|ref|YP_316440.1| putative restriction endonuclease S subunit [Thiobacillus denitrificans ATCC 25259] gi|74058195|gb|AAZ98635.1| putative restriction endonuclease S subunit [Thiobacillus denitrificans ATCC 25259] Length = 400 Score = 167 bits (422), Expect = 4e-39, Method: Composition-based stats. Identities = 103/405 (25%), Positives = 177/405 (43%), Gaps = 24/405 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W+ + +K L +G TG Y GN + T++ + Sbjct: 9 PISWRRMKLKYLVALKSGEAIPGES----------IKETGDYPVYGGNGFRGYTNSFT-- 56 Query: 82 AKGQ-ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +G+ IL G+ G A+ + +V PK WL + Sbjct: 57 HEGERILIGRQGALCGNINYAEGKFWATEHAIVATPKT---NFETAWLGETLRVMNLNQY 113 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + A + + N+ + +PP EQ I + + + ID LI E+ + + LL EK++ Sbjct: 114 SQSAAQPGIAVEVVENLVIAVPPEGEQRRIADSLHQLSAPIDKLILEKQKLLTLLTEKRR 173 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +++ + KGLN D +DS I W+G +P HW+V+ L TE + ++ E + Sbjct: 174 TVIADFLIKGLNKDTPRRDSDIPWLGEIPAHWKVERAKWLFTERDDRSDSGDEELLTVSH 233 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + E ES E Y+ + G++V + + + GI++ AY Sbjct: 234 LTGVTSRAEKDVNMFMAESLEGYKRCEAGDLVINTLWAWMGAMGIAR----QPGIVSPAY 289 Query: 321 MAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQF 376 +P +D Y+ L+R+ + G+ R L E + + VPP+ EQ Sbjct: 290 NVYQPVAQLDPEYIDLLVRTPRFVEEITRYSKGVWSSRLRLYPEGLYEAWLPVPPLDEQR 349 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 DI + ET ++D L E E+++ +L+ERRS+ I+AAVTGQ+DL Sbjct: 350 DIVARVQAETRKLDALAEATERTVTVLQERRSALISAAVTGQLDL 394 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 48/205 (23%), Positives = 83/205 (40%), Gaps = 5/205 (2%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 +DS + W+G IP HWKV K R+ +++ + + + T + Sbjct: 191 RDSDIPWLGEIPAHWKVERAKWLFTERDDRSDSGDEEL--LTVSHLTGVTSRAEKDVNMF 248 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLL 129 G ++ L ++ IA GI S + V QP L + Sbjct: 249 MAESLEGYKRCEAGDLVINTLWAWMGAMGIARQPGIVSPAYNVYQPVAQLDPEYIDLLVR 308 Query: 130 SIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + I +G +G+ +P+PPL EQ I ++ AET ++D L Sbjct: 309 TPRFVEEITRYSKGVWSSRLRLYPEGLYEAWLPVPPLDEQRDIVARVQAETRKLDALAEA 368 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN 212 R + +L+E++ AL+S VT L+ Sbjct: 369 TERTVTVLQERRSALISAAVTGQLD 393 >gi|259156571|gb|ACV96514.1| restriction modification system DNA specificity domain [Vibrio fluvialis Ind1] Length = 389 Score = 166 bits (419), Expect = 8e-39, Method: Composition-based stats. Identities = 85/386 (22%), Positives = 170/386 (44%), Gaps = 27/386 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKY 63 Y YK+SG WIG IP W++VPI+ K + + ++I+ + + + + Sbjct: 8 PKYEVYKNSGEDWIGDIPSGWELVPIRSIFKFRNEKNSPVKTEEILSLSIANGVTKYSD- 66 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + GN R+ D S I I+ + + ++ + G S + L + + Sbjct: 67 KGRGGNKRKDDISAYKIAHPKDIVLNSMNVIVGAVGMSKYHGAISPVYYALYTESEDVLV 126 Query: 124 LQGWLLSID--VTQRIEAICEG------------ATMSHADWKGIGNIPMPIPPLAEQVL 169 + ++ + + +G + ++ P PP+AEQ L Sbjct: 127 EYYEKIFLNEGFQRGLLKFGKGILIKLSGTGKLNTIRMKVSTDDLKSLYFPKPPIAEQNL 186 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + +T +ID I+ + + I LLKE+ Q ++ VT+GLN +V MKD+GI+W+G +P Sbjct: 187 IFSFLDKKTAQIDEAISIKEQQINLLKERNQIIIHKAVTQGLNSNVLMKDTGIDWIGKIP 246 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 +HW++ ++ +LNR K ++ I S + L N GL + YQ VD G Sbjct: 247 EHWDISLAKHILKKLNRPRKKNGDTVI--CSNHGCSKLLGEVNQGLVSLTQHDYQGVDEG 304 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++ +D + ++ ++ + V + Y+A+ ++ + V+ + Sbjct: 305 DLLVHGMDAWHGAIAISEHTGD-----CTSVVHVCDSHFNKVYIAYFLKMLAIMNVYKVI 359 Query: 350 GSGLRQSL----KFEDVKRLPVLVPP 371 +G+R + + + +++PP Sbjct: 360 SNGVRGNTSDFRSWSKFGEIQIILPP 385 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 58/221 (26%), Positives = 93/221 (42%), Gaps = 21/221 (9%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 K+SG +W+G +P WE+ P ++ N KN+ + ILSLS N + K + Sbjct: 9 KYEVYKNSGEDWIGDIPSGWELVPIRSIFKFRNEKNSPVKTEEILSLSIANGVTKYSDKG 68 Query: 273 MG--LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 G + + Y+I P +IV +++ + G I+ Y A+ D Sbjct: 69 RGGNKRKDDISAYKIAHPKDIVLNSMNVIVGAVGMSKY----HGAISPVYYALYTESEDV 124 Query: 331 TYLAW--LMRSYDLCKVFYAMGSGL-------------RQSLKFEDVKRLPVLVPPIKEQ 375 + + + + G G+ R + +D+K L PPI EQ Sbjct: 125 LVEYYEKIFLNEGFQRGLLKFGKGILIKLSGTGKLNTIRMKVSTDDLKSLYFPKPPIAEQ 184 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I + ++ +TA+ID + EQ I LLKER I AVT Sbjct: 185 NLIFSFLDKKTAQIDEAISIKEQQINLLKERNQIIIHKAVT 225 >gi|315124281|ref|YP_004066285.1| putative type I restriction enzyme specificity protein [Campylobacter jejuni subsp. jejuni ICDCCJ07001] gi|315018003|gb|ADT66096.1| putative type I restriction enzyme specificity protein [Campylobacter jejuni subsp. jejuni ICDCCJ07001] Length = 393 Score = 165 bits (418), Expect = 9e-39, Method: Composition-based stats. Identities = 98/399 (24%), Positives = 169/399 (42%), Gaps = 32/399 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLED 55 MK++ K+SG++W+G IP+HW+VV I + G E+ +I I + D Sbjct: 1 MKNF------KESGIEWLGEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFEIPVIRIGD 54 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLV 113 ++ Y +++ + + + IL G K D + + + Sbjct: 55 MQKEKILY-DNCLKTKEKEKLKQFLISNNDILIALSGATTGKIAFCDTDNKAYINQRVAI 113 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ K L ++ + L+ + IE C G+ + K IG +P+PPL EQ I Sbjct: 114 VRSKLKL---VKYYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANF 170 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + + +I I ++ + I LLKE+KQA ++ +TKGL+ ++ KDSGIEW+G +P HWE Sbjct: 171 LDEKCEQIANFIEKKEKLISLLKEQKQAFINETITKGLDKNINFKDSGIEWLGEIPQHWE 230 Query: 234 VKP---FFALVTELNRKNTKLIESNILSLSYGNIIQKL---------ETRNMGLKPESYE 281 VK F L LN + I +SYG I K + + + Sbjct: 231 VKKFKMLFTLGNGLNITKADFVSYGIPCVSYGEIHSKYPCRLNTTIHTLPFVSKTYLADK 290 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRS 339 ++ G+ VF + ++ + I + I+S Y ++L S Sbjct: 291 PQSLLQKGDFVFADTSEDIEGSGNFTSIQSDTPIFAGYHTIILKYKGKINSLYFSFLFDS 350 Query: 340 YDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 G+ S+ +K + L+PP+KEQ Sbjct: 351 IFTRNQIRKEVCGVKVFSITKSILKEVQCLIPPLKEQNK 389 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 87/209 (41%), Gaps = 10/209 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN----ILSLSYGNIIQKLE 269 K+SGIEW+G +P+HWEV +VT +N + + N I + G++ ++ Sbjct: 1 MKNFKESGIEWLGEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFEIPVIRIGDMQKEKI 60 Query: 270 TRNM--GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + ++ +I+ K + + I V+ Sbjct: 61 LYDNCLKTKEKEKLKQFLISNNDILIALSGATTGKIAFC--DTDNKAYINQRVAIVRSKL 118 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + A + ++ +++ + +PP+KEQ I N ++ + Sbjct: 119 KLVK--YYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANFLDEKCE 176 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +I +EK E+ I LLKE++ +FI +T Sbjct: 177 QIANFIEKKEKLISLLKEQKQAFINETIT 205 >gi|302874007|ref|YP_003842640.1| restriction modification system DNA specificity domain [Clostridium cellulovorans 743B] gi|307689744|ref|ZP_07632190.1| restriction modification system DNA specificity domain [Clostridium cellulovorans 743B] gi|302576864|gb|ADL50876.1| restriction modification system DNA specificity domain [Clostridium cellulovorans 743B] Length = 457 Score = 165 bits (418), Expect = 1e-38, Method: Composition-based stats. Identities = 71/416 (17%), Positives = 150/416 (36%), Gaps = 20/416 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQS 73 +P++W +K L TG T + I +I D+ G Sbjct: 23 EVPENWVWSNLKSIADLVTGNTPSKNNEEFYGGKIPFIKPTDLNQGRI-LNSSTETLSNI 81 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + I KG +G + K + +G + Q + PK + + + LS Sbjct: 82 GATKARILPKGSTAVCCIGATIGKVAYLNVEGATNQQINSIIPKKIYNLYVYYYTLSSYF 141 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + T+ + +G + +P+PPL EQ I +I ++D E Sbjct: 142 HDTLIENSSSTTLPIINKSRMGELLIPLPPLKEQQRIVNRIENLFEKLDKAKELIEEARE 201 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGI-EWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +++K A+ S LN K + I E +P +W+ + ++ Sbjct: 202 GFEKRKAAITSKAFRGILNYRKGEKVNPINEGFYKLPYNWKWTKLEDICEKITDGTHNSP 261 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESY---------ETYQIVDPGEIVFRFIDLQNDKR 303 +S + ++ + L +Y V G+I++ Sbjct: 262 KSYEYGDYKYVTAKNIKEWGIDLSSITYVTKKEHIPIYKRCDVKYGDILYIKDGATT-GI 320 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDV 362 + + E +++S + ID+ YL +++ S+++ K G L + + Sbjct: 321 ATINELTEEFSLLSSVALIRVGKCIDNKYLYYILNSFEIKKRILESVKGVAITRLTLKKI 380 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +PP++EQ +I +++ + ++++ Q + + S +A A GQ Sbjct: 381 NDIIIPLPPLEEQKEIVKILDKLLEE-ESKIKELTQLEDQINLIKKSILAKAFRGQ 435 >gi|91213998|ref|YP_543984.1| EcoKI restriction-modification system protein HsdS [Escherichia coli UTI89] gi|117626660|ref|YP_859983.1| specificity determinant for hsdM and hsdR [Escherichia coli APEC O1] gi|91075572|gb|ABE10453.1| HsdS, type I site-specific deoxyribonuclease [Escherichia coli UTI89] gi|115515784|gb|ABJ03859.1| HsdS, type I site-specific deoxyribonuclease [Escherichia coli APEC O1] gi|294493695|gb|ADE92451.1| type I restriction modification DNA specificity protein [Escherichia coli IHE3034] gi|307629515|gb|ADN73819.1| EcoKI restriction-modification system protein HsdS [Escherichia coli UM146] gi|323950568|gb|EGB46446.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli H252] Length = 455 Score = 165 bits (418), Expect = 1e-38, Method: Composition-based stats. Identities = 73/417 (17%), Positives = 155/417 (37%), Gaps = 25/417 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDVESGTGKYLP---K 66 G +P+ W+ + I + +G T +SG + + ++ D+ KY+ + Sbjct: 4 GKLPEGWEQIEIGDIADVISGGTPKSGVAENFAPSGEGVAWLTPADLSGYKEKYISHGAR 63 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 D + + + + KG IL+ P AI A+ I + Q Sbjct: 64 DLTTLGYSSCSAKLMPKGTILFSSRAPIGYVAIAANE--IATNQGFKSFAFPSDIFPDYA 121 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + ++ E + G T +P + P AEQ +I EK+ ++D+ Sbjct: 122 YYFLRNIRHIAEEMGTGTTFKEISGSSAKTLPFVLVPFAEQKIIAEKLDTLLAQVDSTKA 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + ++LK +QA++ V L D + S W + + + Sbjct: 182 RLEQIPQILKRFRQAVLGAAVRGKLTEDWRDNSSLSGWR----EGKLGEFIKKPSYGTSS 237 Query: 247 KNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRS 304 K+ E I L GN KL+ ++ ++ E ++ +++F + Sbjct: 238 KS--NKEGLIPVLRMGNLQGGKLDWTDLVYTSDTIEIEKYKLEYNDVLFNRTNSPELVGK 295 Query: 305 LRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFED 361 + + I + V+ + YL + + S + Y++ S + ++ + Sbjct: 296 TAIYKSEQPAIYAGYLIRVQCLPDLNPDYLNYHLNSILGRQYCYSVKSDGVSQSNINAQK 355 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P+ VPP+ EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 356 LIAYPITVPPLPEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 412 >gi|227500130|ref|ZP_03930201.1| restriction endonuclease S subunits [Anaerococcus tetradius ATCC 35098] gi|227217772|gb|EEI83072.1| restriction endonuclease S subunits [Anaerococcus tetradius ATCC 35098] Length = 424 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 92/423 (21%), Positives = 159/423 (37%), Gaps = 22/423 (5%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNT-----GRTSES--GKDIIYIGLEDVESGTGKY 63 KDSG+ WIG +P WKV IK +LN G TS + I D + G + Sbjct: 5 KDSGINWIGTMPNDWKVKKIKYIGELNGRIGWQGLTSNEYIDEGPFLITGTDFKDGRIDW 64 Query: 64 LPKDGNSRQSDTSTVSI-FAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDV 119 I G +L K G + AI+ + +G+ S + + Sbjct: 65 DTCVHIDHSRWEEAKKIQIKNGDLLITKDGTVGKVAIVENLEGLASLNSGVLKIDLKEGY 124 Query: 120 LPELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L + L L S G T+ H K N IP EQ +I + + Sbjct: 125 LAKFLFYVLQSDVFWTWFNYTSSGNSTILHLYEKDFNNFTFSIPDKDEQEVIINFLDSNV 184 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I+ I++ + I++LKE ++LVS +TKGL +V+ KD+ I+W+G +P +W +K Sbjct: 185 GSINLKISKIEKQIKILKEYIKSLVSETITKGLEKNVEYKDTSIDWIGKIPANWSIKRLK 244 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 L + + + Y + + I L Sbjct: 245 HLFYIYAGGDIDYSDYAEAENEIQKYPILSNSLEHDGV-IGYTSKFRFEGDTITVTGRGL 303 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + + + + D Y ++ + S ++ L Sbjct: 304 ----VGVAVPRNFKFYPVVRLLVGEPKDRDDVRYFSYCINSANVIGD-----QTAMAQLT 354 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 E + + V P K Q +I N ++ E ++I+ ++E EQ I L + ++S + VTG+ Sbjct: 355 REKLGDIKVPYPLKKIQIEIANFLDTEVSKINHVIETKEQQITKLIDYKNSLVYEYVTGK 414 Query: 419 IDL 421 + Sbjct: 415 KRV 417 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 80/215 (37%), Gaps = 13/215 (6%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 +KDSGI W+G +P+ W+VK + R + + SN +I + ++ Sbjct: 1 MSNLKDSGINWIGTMPNDWKVKKIKYIGELNGRIGWQGLTSNEYIDEGPFLITGTDFKDG 60 Query: 274 GLKPE----------SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + + G+++ K ++ + + Sbjct: 61 RIDWDTCVHIDHSRWEEAKKIQIKNGDLLITKDGTVG-KVAIVENLEGLASLNSGVLKID 119 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 G + +L ++++S F SG L +D +P EQ I N Sbjct: 120 LKEGYLAKFLFYVLQSDVFWTWFNYTSSGNSTILHLYEKDFNNFTFSIPDKDEQEVIINF 179 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ I++ + KIE+ I +LKE S ++ +T Sbjct: 180 LDSNVGSINLKISKIEKQIKILKEYIKSLVSETIT 214 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 42/201 (20%), Positives = 75/201 (37%), Gaps = 13/201 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +YKD+ + WIG IP +W + +K + G DI Y + E+ KY Sbjct: 222 EYKDTSIDWIGKIPANWSIKRLKHLFYIYAGG------DIDYSDYAEAENEIQKYPILSN 275 Query: 69 NSRQSDTSTV--SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + +G + + A+ +F + LV +PKD Sbjct: 276 SLEHDGVIGYTSKFRFEGDTITVTGRGLVGVAVPRNFKFYPVVRLLVGEPKDRDDVRYFS 335 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + I + M+ + +G+I +P P Q+ I + E +I+ +I Sbjct: 336 YC-----INSANVIGDQTAMAQLTREKLGDIKVPYPLKKIQIEIANFLDTEVSKINHVIE 390 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + + I L + K +LV V Sbjct: 391 TKEQQITKLIDYKNSLVYEYV 411 >gi|330879482|gb|EGH13631.1| restriction modification system DNA specificity domain protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 293 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 95/277 (34%), Positives = 150/277 (54%), Gaps = 11/277 (3%) Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV 215 N+ +P+PP++EQ I + ET RID LI E+ R IELLKEK+QA++S+ VTKGL+P V Sbjct: 6 NLRIPLPPISEQNQIARFLDHETARIDALIEEQQRLIELLKEKRQAVISHAVTKGLDPTV 65 Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 MKDSG EW+G VP HWE A+ +E K + +S+ +G ++L Sbjct: 66 PMKDSGAEWLGEVPAHWETLRIGAVYSEAADKGLAELPVLRVSIHHGVSDKELSEEESDR 125 Query: 276 KP---ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-ST 331 K + E Y+ V PG++V+ + G+++ AY+ +P D S Sbjct: 126 KITRIDDREKYKRVRPGDLVYNMMRAWQGGFGAVLV----NGLVSPAYVVARPKNEDISR 181 Query: 332 YLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+ L+R+ + G+ R L ++ K + +++PP E+ I I+ Sbjct: 182 YVEQLLRTGCAVEEMRKNSYGITDFRLRLYWDQFKNIVIVIPPEVERLQIMERIDSLINE 241 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + L + ++ IV+L+ERRS+ I+AAVTG+ID+RG Sbjct: 242 SEALKSEADRLIVILQERRSALISAAVTGKIDVRGWQ 278 Score = 92.9 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 45/210 (21%), Positives = 91/210 (43%), Gaps = 10/210 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 KDSG +W+G +P HW+ + I + + ++ + + + K L ++ + Sbjct: 67 MKDSGAEWLGEVPAHWETLRIGAV---YSEAADKGLAELPVLRVSIHHGVSDKELSEEES 123 Query: 70 ----SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELL 124 +R D G ++Y + + +G+ S ++V +PK+ + + Sbjct: 124 DRKITRIDDREKYKRVRPGDLVYNMMRAWQGGFGAVLVNGLVSPAYVVARPKNEDISRYV 183 Query: 125 QGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + L + + + G T W NI + IPP E++ I E+I + + Sbjct: 184 EQLLRTGCAVEEMRKNSYGITDFRLRLYWDQFKNIVIVIPPEVERLQIMERIDSLINESE 243 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLN 212 L +E R I +L+E++ AL+S VT ++ Sbjct: 244 ALKSEADRLIVILQERRSALISAAVTGKID 273 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 25/58 (43%), Positives = 38/58 (65%) Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ L + +PPI EQ I ++ ETARID L+E+ ++ I LLKE+R + I+ AVT Sbjct: 1 MSVIENLRIPLPPISEQNQIARFLDHETARIDALIEEQQRLIELLKEKRQAVISHAVT 58 >gi|332184238|gb|AEE26492.1| Type I restriction-modification system specificity subunit [Francisella cf. novicida 3523] Length = 390 Score = 164 bits (414), Expect = 3e-38, Method: Composition-based stats. Identities = 54/406 (13%), Positives = 126/406 (31%), Gaps = 25/406 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + +P W+ + + G + S I ++++ D N Sbjct: 4 LYKLPAGWEWKKLGDLAEYVNGMAFKPKDWSNDGFPIIRIQNLNGSD------DFNYFSG 57 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G IL L + I + + + + Sbjct: 58 EAKEKYYVKNGDILISWS-ASLDVYKWQGGNAILNQHIFNTIINYDVVDYDFFYHTIKYS 116 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G M H NI +P+PPL EQ I K+ + +ID I + I Sbjct: 117 LSEVMNNLHGVGMKHITKGKFENIQIPLPPLPEQKRIVAKLDSLFEKIDKAIELHQQNIT 176 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + + K K ++++ + + T + + Sbjct: 177 NANTLMASTLDKTFKKLEGEYNSKK---LDYLSENIRYGYTDKAKEKGNARFIRITDIND 233 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 K E+ + +K + Y+++ G+I+ K +L + + Sbjct: 234 QGKF---------KDESVYVDIKNTDLDRYKLL-VGDILVARSGATAGKVALFTLDELSV 283 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPI 372 + ++ ++ + S + G + ++ ++K + + +PP+ Sbjct: 284 FASYLIRIRLQIDKALPLFIFYFCYSSKYWNQLDQIKIGGAQPNVNATNLKNIKIPLPPL 343 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q ++ ++D + + EQ + LK ++S + A G+ Sbjct: 344 PIQQQTVEYLDSIATKVDKIKQLNEQKLEKLKALKASILDKAFRGE 389 >gi|162448114|ref|YP_001621246.1| type I restriction enzyme, S subunit [Acholeplasma laidlawii PG-8A] gi|161986221|gb|ABX81870.1| type I restriction enzyme, S subunit [Acholeplasma laidlawii PG-8A] Length = 437 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 98/429 (22%), Positives = 176/429 (41%), Gaps = 27/429 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY-------IGLEDV-ESGTGKYLPKDGNS 70 G +P +WK + IK F +L +G T S D Y + + D+ + K Sbjct: 9 GVVPDNWKKMKIKHFYELYSGGTPLSSVDSNYAEEGVCFVNISDMTNTEYITDTTKKLTD 68 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + I G ILY + L S L L PK + Sbjct: 69 KGIKNKNLKILKSGTILYS-IYASLGSVSELKTKATISQAILALIPKMGISIDKNYLKFL 127 Query: 131 IDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + I G T S+ + + N+P+ IP L Q+ I + +T+ I+ LI + Sbjct: 128 LMIAKENIFYFSNGTTQSNLNADIVNNLPLIIPELNNQIRISLYVGNKTLIINKLIDNQK 187 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL--------- 240 + IE LKE KQ+L+S +VTKGLNP+V+ KDS ++W+GL+P +++V Sbjct: 188 QQIEKLKEYKQSLISEVVTKGLNPNVEFKDSNVKWIGLIPKNYDVSLISRHTFVTKLAGF 247 Query: 241 -VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---YQIVDPGEIVFRFI 296 TE+ KN + + + K G +D I+ FI Sbjct: 248 EFTEILSKNINEFDDIPIVRAQNIKNDKFIKDFTGYINNDTARKLVRSNLDKPCILMTFI 307 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS----TYLAWLMRSYDLCKVFYAMGSG 352 + ++ + + + + A + + D L +LM + + + + Sbjct: 308 GAGVGEVAIFNEEKLHQLAPNVAKIEILKSHEDRISLRYLLYYLMSNAANFEKDQYLKAT 367 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ++ ++ L ++PPI +Q I ++ + R+D L+E + I L + S I Sbjct: 368 AQPNISMTIIRGLRFVLPPIDDQNKIIKYLDNKVLRLDELIELKNKKIDELYNYKKSLIY 427 Query: 413 AAVTGQIDL 421 VTG+ ++ Sbjct: 428 EYVTGKKEV 436 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 81/217 (37%), Gaps = 18/217 (8%) Query: 9 QYKDSGVQWIGAIPKHWK---------VVPI-KRFTKLNTGRTSESGKDIIYIGLEDVES 58 ++KDS V+WIG IPK++ V + + DI + +++++ Sbjct: 214 EFKDSNVKWIGLIPKNYDVSLISRHTFVTKLAGFEFTEILSKNINEFDDIPIVRAQNIKN 273 Query: 59 GT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF-------DGICSTQ 110 K N+ + S K IL +G + + I + + + Sbjct: 274 DKFIKDFTGYINNDTARKLVRSNLDKPCILMTFIGAGVGEVAIFNEEKLHQLAPNVAKIE 333 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 L + L +L+S + + + I + +PP+ +Q I Sbjct: 334 ILKSHEDRISLRYLLYYLMSNAANFEKDQYLKATAQPNISMTIIRGLRFVLPPIDDQNKI 393 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + +R+D LI + + I+ L K++L+ V Sbjct: 394 IKYLDNKVLRLDELIELKNKKIDELYNYKKSLIYEYV 430 >gi|201067913|ref|ZP_03217796.1| putative type I restriction-modification [Campylobacter jejuni subsp. jejuni BH-01-0142] gi|200004510|gb|EDZ04991.1| putative type I restriction-modification [Campylobacter jejuni subsp. jejuni BH-01-0142] Length = 269 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 62/269 (23%), Positives = 120/269 (44%), Gaps = 14/269 (5%) Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 PPL EQ I + + +I I ++ + + LLKE+KQA ++ TKGL+ +V KDS Sbjct: 3 FPPLKEQEQIANFLDEKCKKIANFIEKKEKLMTLLKEQKQAFINKATTKGLDKNVNFKDS 62 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP--- 277 GIE++G +P HW++ ++ + I + + LK Sbjct: 63 GIEYLGEIPQHWKLVRLGLILKTSSGTTPDSGNDKYYKGGQIVWINSGDLNDGFLKDSKR 122 Query: 278 -------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + Y +I D ++ K ++ + A ++ + Sbjct: 123 KITQDALDDYSVLKIFDKDSLIVAMYGATIGKTAILKV----NACVNQACCVLEKSAWYN 178 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 T+ + + + ++ G + ++ + +K + + +PP+KEQ I N ++ + +ID Sbjct: 179 TFYLFYLFNRYKKELISMGSGGGQPNISQDIIKNIKIPLPPLKEQEQIANFLDEKCKKID 238 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +L+EK E+ I L+KE + + AV G+I Sbjct: 239 LLIEKTEKQIKLIKEYKITLTNQAVCGRI 267 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 56/211 (26%), Positives = 95/211 (45%), Gaps = 9/211 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGK 62 +KDSG++++G IP+HWK+V + K ++G T +SG D I++I D+ G K Sbjct: 59 FKDSGIEYLGEIPQHWKLVRLGLILKTSSGTTPDSGNDKYYKGGQIVWINSGDLNDGFLK 118 Query: 63 YLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + D S + IF K ++ G + K I + + VL+ Sbjct: 119 DSKRKITQDALDDYSVLKIFDKDSLIVAMYGATIGKTAILKVNACVNQACCVLEKSAWYN 178 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 +L + + ++ G + I NI +P+PPL EQ I + + +I Sbjct: 179 TFYLFYL-FNRYKKELISMGSGGGQPNISQDIIKNIKIPLPPLKEQEQIANFLDEKCKKI 237 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLN 212 D LI + + I+L+KE K L + V +N Sbjct: 238 DLLIEKTEKQIKLIKEYKITLTNQAVCGRIN 268 >gi|188996333|ref|YP_001930584.1| restriction modification system DNA specificity domain [Sulfurihydrogenibium sp. YO3AOP1] gi|188931400|gb|ACD66030.1| restriction modification system DNA specificity domain [Sulfurihydrogenibium sp. YO3AOP1] Length = 435 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 75/441 (17%), Positives = 169/441 (38%), Gaps = 40/441 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKY 63 +K++ IG IP+ W+V + ++ G+ + ++ ++ +V Sbjct: 7 FKETE---IGLIPEDWEVARLGEVFEVKQGKQLSAKENRDGKVLKPFLRTSNVLWNKIDL 63 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ---PKDVL 120 KG IL + G R A+ S Q + + KD + Sbjct: 64 SELSYMPFSESEFKNLKLKKGDILVCEGGDVGRTAVWDGQIDEISYQNHLHRLRSVKDNI 123 Query: 121 PELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + + T+ + + P+P+PPL EQ I + + Sbjct: 124 NNYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQRAIADIL---- 179 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVK 235 + I + + I K+ K++++ ++ T G ++ ++K E +GL+P+HWEV Sbjct: 180 STVQNAIEKTEKVINATKQLKKSMMKHLFTYGAVAVDEIDRIKLKESE-IGLIPEHWEVV 238 Query: 236 PFFALVTELNRKNTKLIES--------NILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 +V + + E I + + ++ + + + Sbjct: 239 RLGEVVDLDRGISWRKFEEGSKDNGHLIISIPNIKDGYIDFNSKYNHYLIKHIPKNKQIQ 298 Query: 288 PGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDL 342 +I+F ++N R++ + GI ++++ VK + + +L ++ S+ Sbjct: 299 LNDILFVGSSGSIENVGRNVFIENLSFEGIGFASFVFRARVKVNTVIPKFLYFMANSHWF 358 Query: 343 C-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 K + S + + + + K + + +PP+ EQ I N++ ID ++ E+ V Sbjct: 359 NYKDYVRRSSDGKYNFQLTEFKTIKIPLPPLDEQQKIANILTT----IDQKIQAEEKKKV 414 Query: 402 LLKERRSSFIAAAVTGQIDLR 422 L+ + + +TG+I +R Sbjct: 415 ALRSLFKTLLHQLMTGKIRVR 435 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 68/209 (32%), Gaps = 14/209 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN-----ILSLSYGNI-IQKLETR 271 K +GL+P+ WEV + K E+ L N+ K++ Sbjct: 5 KGFKETEIGLIPEDWEVARLGEVFEVKQGKQLSAKENRDGKVLKPFLRTSNVLWNKIDLS 64 Query: 272 NMGLKP--ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + P ES + G+I+ + I+ Sbjct: 65 ELSYMPFSESEFKNLKLKKGDILVCEGGDVGRTAVWDGQIDEISYQNHLHRLRSVKDNIN 124 Query: 330 STYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + A+ M K +L +K P+ +PP++EQ I ++++ Sbjct: 125 NYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQRAIADILSTV-- 182 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +EK E+ I K+ + S + T Sbjct: 183 --QNAIEKTEKVINATKQLKKSMMKHLFT 209 >gi|254172634|ref|ZP_04879309.1| type I restriction modification system, subunit S [Thermococcus sp. AM4] gi|214033563|gb|EEB74390.1| type I restriction modification system, subunit S [Thermococcus sp. AM4] Length = 428 Score = 163 bits (413), Expect = 4e-38, Method: Composition-based stats. Identities = 70/425 (16%), Positives = 159/425 (37%), Gaps = 31/425 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYL----PK 66 IG IP+ WKVV ++ + TG T + ++ +I D+ G + Sbjct: 12 IGEIPRDWKVVRVREIFDVKTGTTPSTKQTDYWENGEMNWITPTDLSKLNGNIYMGDSER 71 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + + +S+ KG ++ P A++ + + L PKD + + Sbjct: 72 KITKKALEDYNLSLLPKGSLILSTRAPVGYIAVLTEE-ATFNQGCKGLVPKDQNKIIPEF 130 Query: 127 WLLSIDVTQRI-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + ++ E++ G+T + +P+PP EQ I E + +D I Sbjct: 131 YAYYFKFKRQHLESLSGGSTFKELAKAMLERFLVPLPPRLEQKKIAEIL----RTVDEAI 186 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + IE + K+ L+ ++TKG+ + K + I + ++ + Sbjct: 187 EKTDLAIEKTERLKKGLMLRLLTKGI-KHERFKKTEIGEIPEEWRVVRLEEITRRIKRGP 245 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-----YQIVDPGEIVFRFIDLQN 300 K T E+ ++ ++ I K S E +++ G+++ ++ Sbjct: 246 SKKTDDNETGVVYVTSDYIDDHGNLNFDNPKYLSLEKIDRLDKYLLEEGDLIINCVNSLE 305 Query: 301 DKRSLRSAQVMERGII--TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--S 356 + + + I + + ++ Y+ + SY + ++ Q S Sbjct: 306 KIGKVAVFEGYSKKAIVGFNNFALTLVSTVNPYYVKYFFLSYKGKALIKSISKAAVQQVS 365 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +D+ RL + +PP+ EQ I +++ ++ E + + L+ + + +T Sbjct: 366 FSSKDLLRLKIPLPPLPEQKQIAEILSTVDKKL----ELLRKRREKLELVKRGLMKGLLT 421 Query: 417 GQIDL 421 G+ + Sbjct: 422 GRRRV 426 >gi|91776956|ref|YP_546712.1| restriction modification system DNA specificity subunit [Methylobacillus flagellatus KT] gi|91710943|gb|ABE50871.1| restriction modification system DNA specificity domain [Methylobacillus flagellatus KT] Length = 429 Score = 163 bits (412), Expect = 5e-38, Method: Composition-based stats. Identities = 94/420 (22%), Positives = 166/420 (39%), Gaps = 25/420 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W ++ + N ++ ++ ++ ++ V G L + + Sbjct: 9 LPAGWSRRRLRFDVRTNPVKSELELPGDAEVSFVPMDAVGELGGLRLDQ-TRELADVYAG 67 Query: 78 VSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLS- 130 + FA G + K+ P + + +T+ VL+P L +L Sbjct: 68 YTYFADGDVCIAKITPCFENGKGAIAEGLKNGVAFGTTELHVLRPLPTLDARFLFYLTIA 127 Query: 131 IDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 D E+ GA + + P+P + Q I + +T +ID LI ++ Sbjct: 128 HDFRSHGESEMLGAGGQKRVPEGFLKDWTPPLPCIQVQQRIARFLDEKTAQIDGLIEKKR 187 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT--ELNRK 247 ++ L EK+QAL++ VTKG+NPD MK SGI+W+G +P HWEV+ T + Sbjct: 188 ALLDRLAEKRQALITRAVTKGMNPDAPMKPSGIDWLGDIPAHWEVRGLTKCTTRVDYRGA 247 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ------IVDPGEIVFRFIDLQND 301 + S + ++ NI + + + Y+ + GE++F + Sbjct: 248 TPEKSSSGVFLVTAKNIKNGRIDYQISQEYIPEDIYEQAMRRGLPKLGEVLFTTEAPLGE 307 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG-LRQSLKF 359 + + + + YLA+ M S + +G LK Sbjct: 308 ---IAQVDREDIALAQRIIKFTTSTPELENDYLAYWMMSMPFQAQIQSRATGSTAVGLKA 364 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + LP L+PP EQ DI + + +ID + I S+ E R++ I AAVTGQI Sbjct: 365 SKIVDLPCLLPPKDEQKDIISQVRQSLMKIDEIETAISDSLEFKIEYRAALITAAVTGQI 424 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 87/210 (41%), Gaps = 8/210 (3%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLN--TGRTS-ESGKDIIYIGLEDVESGTGKY--L 64 K SG+ W+G IP HW+V + + T G T +S + + +++++G Y Sbjct: 215 MKPSGIDWLGDIPAHWEVRGLTKCTTRVDYRGATPEKSSSGVFLVTAKNIKNGRIDYQIS 274 Query: 65 PKDGNSRQSDTSTVSIFAK-GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + + + K G++L+ P A + D + + + E Sbjct: 275 QEYIPEDIYEQAMRRGLPKLGEVLFTTEAPLGEIAQVDREDIALAQRIIKFTTSTPELEN 334 Query: 124 LQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 W++S+ +I++ G+T I ++P +PP EQ I ++ ++I Sbjct: 335 DYLAYWMMSMPFQAQIQSRATGSTAVGLKASKIVDLPCLLPPKDEQKDIISQVRQSLMKI 394 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGL 211 D + T +E E + AL++ VT + Sbjct: 395 DEIETAISDSLEFKIEYRAALITAAVTGQI 424 >gi|225026441|ref|ZP_03715633.1| hypothetical protein EUBHAL_00690 [Eubacterium hallii DSM 3353] gi|224956233|gb|EEG37442.1| hypothetical protein EUBHAL_00690 [Eubacterium hallii DSM 3353] Length = 408 Score = 163 bits (412), Expect = 6e-38, Method: Composition-based stats. Identities = 85/419 (20%), Positives = 163/419 (38%), Gaps = 27/419 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + KDSG++WIG IP W + P + ++ + + E K + Sbjct: 10 EMKDSGIEWIGDIPSSWTIFPANGVFSEVKEKNTDLKFTNAF-SFKYGEIVDKKQVGDVD 68 Query: 69 NSRQSDTSTVSIFAKGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPE 122 N+ + S+ +I K I+ ++ I + GI ++ +L +QP + P Sbjct: 69 NNLKETLSSYTIVRKNTIMINGLNLNYDFVSQRVAIVNESGIITSAYLAIQPDENKINPR 128 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + L S D Q + G ++ I + P L+EQ +I + + +ID Sbjct: 129 FVLYLLKSYDYQQVFHGLGSG-IRKTLKYQDFKKIMIVAPTLSEQQVIADYLDKTCSQID 187 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 +I E I KE KQ+++ VTKGL+ +V+MKDSG+ W+G +P WE+ ++ Sbjct: 188 EIIAEAKASIYEYKELKQSVIFEAVTKGLDKNVEMKDSGVYWIGKIPLDWEIIKTKYVIK 247 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 N N E I G +S++T G V + + Sbjct: 248 IENGSNPS-TEGKIPVYGSG--------------AKSFKTCGEYKEGPTVL--LGRKGAT 290 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + +A+ + + + L + + + + Sbjct: 291 LHIPHYIEGKYWNVDTAFNTIPIN--NKIELKYFYYVASCFDYNKYISQTTLPGMTQTNY 348 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + P I Q ++ ++ + +D L+ + E I L+ + S I VTG+ + Sbjct: 349 RNIYMPYPSITIQEELVKWLDNKIFELDSLISEKESLINDLEAYKKSLIYEVVTGKRKV 407 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 75/207 (36%), Positives = 126/207 (60%), Gaps = 3/207 (1%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+ +MKDSGIEW+G +P W + P + +E+ KNT L +N S YG I+ K + + Sbjct: 7 PETEMKDSGIEWIGDIPSSWTIFPANGVFSEVKEKNTDLKFTNAFSFKYGEIVDKKQVGD 66 Query: 273 MGLK-PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGID 329 + E+ +Y IV I+ ++L D S R A V E GIITSAY+A++P + I+ Sbjct: 67 VDNNLKETLSSYTIVRKNTIMINGLNLNYDFVSQRVAIVNESGIITSAYLAIQPDENKIN 126 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ +L++SYD +VF+ +GSG+R++LK++D K++ ++ P + EQ I + ++ ++I Sbjct: 127 PRFVLYLLKSYDYQQVFHGLGSGIRKTLKYQDFKKIMIVAPTLSEQQVIADYLDKTCSQI 186 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVT 416 D ++ + + SI KE + S I AVT Sbjct: 187 DEIIAEAKASIYEYKELKQSVIFEAVT 213 >gi|260891564|ref|ZP_05902827.1| hypothetical protein GCWU000323_02779 [Leptotrichia hofstadii F0254] gi|260858672|gb|EEX73172.1| hypothetical protein GCWU000323_02779 [Leptotrichia hofstadii F0254] Length = 461 Score = 161 bits (407), Expect = 2e-37, Method: Composition-based stats. Identities = 75/438 (17%), Positives = 168/438 (38%), Gaps = 28/438 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYL 64 Y +YK + + W +P +W + I + + + K+I+ + + S Sbjct: 3 KYERYKSTELSWSKHLPYYWNIKRIASIFDIRKEKNSPVRTKEILSLSAKYGVSLYSDKK 62 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 K GN + D ++ + G IL + I+++ G S + L + Sbjct: 63 EKGGNKPKEDLTSYYLCYSGDILVNCMNIVAGSVGISNYFGAVSPVYYPLLNMNADENCT 122 Query: 125 QGWLLSIDVTQRIEAICE---------------GATMSHADWKGIGNIPMPIPPLAEQVL 169 + ++ W + +P+PP+ EQV Sbjct: 123 RYMEYVFRNYNFQRSLVGLGKGIQMSESEDGKLFTVRMRISWDILKTQLLPVPPIEEQVQ 182 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + I+ LI +KE ++ ++S LN D ++K IE Sbjct: 183 IANYLDWKINEINKLIEINKEK---IKEIRKYIISEHERLILNNDSEVKKLIIENNIYDY 239 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 ++K + ++S+I+ S + +GL ++ + YQ ++ G Sbjct: 240 SDKKIKIKRLKSVLKKIEKEASLDSDIIICSNNGKSFVRGDKKIGLYSDNIKMYQNINKG 299 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++ +D + + + + V D Y+ + +R +++ Sbjct: 300 QLMIHGMDTWHGAICISDYNGR-----CTKVVHVCETNEDKMYIYYYLRLLAFLEMYKPF 354 Query: 350 GSGLRQSLK----FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 +G+RQ+ ++ + ++ +++P I++Q++I+N + + L+ +I +L + Sbjct: 355 SNGVRQNTSDFRSWDKLGQINIIIPLIEKQYEISNTLTEIINNSEKLILEIINESEMLNK 414 Query: 406 RRSSFIAAAVTGQIDLRG 423 + S I+ VTGQID+R Sbjct: 415 LKQSLISEVVTGQIDVRD 432 >gi|320352395|ref|YP_004193734.1| restriction modification system DNA specificity domain-containing protein [Desulfobulbus propionicus DSM 2032] gi|320120897|gb|ADW16443.1| restriction modification system DNA specificity domain protein [Desulfobulbus propionicus DSM 2032] Length = 357 Score = 160 bits (405), Expect = 3e-37, Method: Composition-based stats. Identities = 89/333 (26%), Positives = 134/333 (40%), Gaps = 19/333 (5%) Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + P + +L S + +E T + I N+ +P+ + EQ Sbjct: 15 AAIRCNPSRADKRFIYFYLQSKEFQTGVELSWSFGTQQNIGMGVIQNLAVPLGTIPEQTA 74 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV----------TKGLNPDVKMKD 219 I + + ET RIDTL+T++ R I LL EK+ AL+S V GL P + KD Sbjct: 75 IADFLDRETGRIDTLVTKKRRLIALLGEKRTALISRTVTRGLPAEAAREFGLKPHTRFKD 134 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII-----QKLETRNMG 274 SGIEW+G VP+ WEV F V + E + G +L + Sbjct: 135 SGIEWLGEVPEGWEVVKFSREVKIAEGQVDPEREPYSTMVLIGPEHVEAGTGRLVSEATA 194 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + GE+++ I K + Y + + Y+ Sbjct: 195 EDQAAISGKYYCHKGEVIYSKIRPALRKVVKAKNDCL---CSADMYPLGGRDKLLNDYIY 251 Query: 335 WLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 WL S + + L + VP EQ I +N ETA+ID L Sbjct: 252 WLFLSDQFAAWSVLEADRVAMPKINRNTLNELRLPVPVGSEQAAIATYLNRETAKIDQLF 311 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 K+E +IV L E R++ I AAVTG+ID+RG++ Sbjct: 312 TKVEAAIVRLLEYRTALITAAVTGKIDVRGKAD 344 Score = 114 bits (284), Expect = 4e-23, Method: Composition-based stats. Identities = 60/212 (28%), Positives = 101/212 (47%), Gaps = 4/212 (1%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII---YIGLEDVESGTG 61 K + ++KDSG++W+G +P+ W+VV R K+ G+ + IG E VE+GTG Sbjct: 127 KPHTRFKDSGIEWLGEVPEGWEVVKFSREVKIAEGQVDPEREPYSTMVLIGPEHVEAGTG 186 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-L 120 + + + Q+ S KG+++Y K+ P LRK + A D +CS L +D L Sbjct: 187 RLVSEATAEDQAAISGKYYCHKGEVIYSKIRPALRKVVKAKNDCLCSADMYPLGGRDKLL 246 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + LS + M + + + +P+P +EQ I + ET + Sbjct: 247 NDYIYWLFLSDQFAAWSVLEADRVAMPKINRNTLNELRLPVPVGSEQAAIATYLNRETAK 306 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 ID L T+ I L E + AL++ VT ++ Sbjct: 307 IDQLFTKVEAAIVRLLEYRTALITAAVTGKID 338 >gi|171915570|ref|ZP_02931040.1| putative restriction endonuclease S subunit [Verrucomicrobium spinosum DSM 4136] Length = 299 Score = 160 bits (404), Expect = 4e-37, Method: Composition-based stats. Identities = 87/281 (30%), Positives = 144/281 (51%), Gaps = 8/281 (2%) Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G TM + + + +P+ P Q I + ET RID L++ + R +EL+ EK++AL Sbjct: 16 GTTMDNLGAETVAELPIQAPSPPRQHSIATYLDRETKRIDELVSVKERLLELVAEKRRAL 75 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 ++ VT+GLNP ++DSGI W+G +P+HW+V+ L TE + +++ E + Sbjct: 76 ITRAVTRGLNPKAALRDSGIPWLGAIPEHWQVERSKWLFTERDERSSTGEEEMLTVSHLT 135 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + E + E+ E Y++ P ++V + A G+++ AY Sbjct: 136 GVTPRAEKDVNMFEAETTEGYKLCQPNDLVINTLWAWMGAMGTARA----PGMVSPAYHV 191 Query: 323 VKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDI 378 P +DS Y+ L+R + G+ R L E + + VPP++EQ I Sbjct: 192 YTPGDRLDSDYVDALVRIPIFAQEAIRFSKGVWSSRLRLYPEGLYEIWFPVPPLEEQRAI 251 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 I ETA++D L E++I LLKERR++ I+AAVTG+I Sbjct: 252 VTHIARETAKLDALRASAERTIALLKERRAALISAAVTGKI 292 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 51/204 (25%), Positives = 82/204 (40%), Gaps = 5/204 (2%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 +DSG+ W+GAIP+HW+V K R+S +++ + + + T + Sbjct: 91 RDSGIPWLGAIPEHWQVERSKWLFTERDERSSTGEEEM--LTVSHLTGVTPRAEKDVNMF 148 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLL 129 T + ++ L ++ A G+ S + V P D L + Sbjct: 149 EAETTEGYKLCQPNDLVINTLWAWMGAMGTARAPGMVSPAYHVYTPGDRLDSDYVDALVR 208 Query: 130 SIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 Q +G +G+ I P+PPL EQ I I ET ++D L Sbjct: 209 IPIFAQEAIRFSKGVWSSRLRLYPEGLYEIWFPVPPLEEQRAIVTHIARETAKLDALRAS 268 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 R I LLKE++ AL+S VT + Sbjct: 269 AERTIALLKERRAALISAAVTGKI 292 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/68 (33%), Positives = 32/68 (47%) Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +L E V LP+ P Q I ++ ET RID LV E+ + L+ E+R Sbjct: 14 SVGTTMDNLGAETVAELPIQAPSPPRQHSIATYLDRETKRIDELVSVKERLLELVAEKRR 73 Query: 409 SFIAAAVT 416 + I AVT Sbjct: 74 ALITRAVT 81 >gi|229819988|ref|YP_002881514.1| restriction modification system DNA specificity domain protein [Beutenbergia cavernae DSM 12333] gi|229565901|gb|ACQ79752.1| restriction modification system DNA specificity domain protein [Beutenbergia cavernae DSM 12333] Length = 427 Score = 160 bits (404), Expect = 4e-37, Method: Composition-based stats. Identities = 104/412 (25%), Positives = 166/412 (40%), Gaps = 31/412 (7%) Query: 34 TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV---SIFAKGQILYGK 90 + + DI + L DV G GK+ K +T S G IL + Sbjct: 19 GDWVESKDQDPDGDIRLLQLADV--GDGKFKDKSDRWINEETFRRLRCSWVHPGDILIAR 76 Query: 91 L-GPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + P R ++ + G + L P L + S +E +GAT Sbjct: 77 MPDPLGRACVVPEGLGKTITVVDVAVLRPDPDQADAGYLTYAINSAKTRSEVERQQDGAT 136 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 K +G + +P+PPL EQ I + + AET +ID LI E+ R I LLKE++ + + Sbjct: 137 RQRIPRKRLGRVSIPLPPLEEQRRIADFLDAETTQIDALIAEQERLIGLLKERRASGILQ 196 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL------VTELNRKNTKLIESNILSL 259 VT+GL DV +K S + WV VP HW V T ++++I Sbjct: 197 AVTRGL-RDVDLKPSTLTWVDAVPLHWTVANIRRFAAMKTGHTPSRSNPEYWVDTHIPWF 255 Query: 260 SYGNIIQKLETRNMGLKPESY---------ETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + ++ Q + R L +++ G +V + Sbjct: 256 TLADVWQVRDGRRTHLGETENTISDLGLANSAAELLPAGTVVLSRTASVGFSGVMPRPM- 314 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + V + YL +L R+ +GS +++ + V VP Sbjct: 315 ---ATSQDFWNWVCGPELVPEYLMYLFRAMRGEFNALMIGS-THKTIYQPVAAAIRVPVP 370 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 P++EQ +I I+ T + D L+ + E +I L KERR++ I AAVTGQID+ Sbjct: 371 PLEEQHEIVARIDERTRKTDALINEAEHNIALSKERRAALITAAVTGQIDVT 422 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 52/215 (24%), Positives = 82/215 (38%), Gaps = 15/215 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLED---VESGT 60 K S + W+ A+P HW V I+RF + TG T I + L D V G Sbjct: 208 KPSTLTWVDAVPLHWTVANIRRFAAMKTGHTPSRSNPEYWVDTHIPWFTLADVWQVRDGR 267 Query: 61 GKYLPKDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 +L + N+ S + G ++ + + + + S F Sbjct: 268 RTHLGETENTISDLGLANSAAELLPAGTVVLSRT-ASVGFSGVMPRPMATSQDFWNWVCG 326 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L +L + A+ G+T I +P+PPL EQ I +I Sbjct: 327 PELVPEYLMYLFR-AMRGEFNALMIGSTHKTIYQPVAAAIRVPVPPLEEQHEIVARIDER 385 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 T + D LI E I L KE++ AL++ VT ++ Sbjct: 386 TRKTDALINEAEHNIALSKERRAALITAAVTGQID 420 >gi|330469019|ref|YP_004406762.1| restriction modification system DNA specificity subunit [Verrucosispora maris AB-18-032] gi|328811990|gb|AEB46162.1| restriction modification system DNA specificity subunit [Verrucosispora maris AB-18-032] Length = 428 Score = 160 bits (404), Expect = 4e-37, Method: Composition-based stats. Identities = 86/428 (20%), Positives = 158/428 (36%), Gaps = 34/428 (7%) Query: 24 HWKVVPIKRFTK-LNTGRTSES----GKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTST 77 W + +K + + TG DI+ + + D + G + S Sbjct: 5 SWPRMRLKSLVEPVQTGVWGAEPAGDNDDILCVRVADFDRQRLGLKSVETVRSVSEADRA 64 Query: 78 VSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFL--VLQPKDVLPELL-QGWLL 129 + G IL K G P + S+ F+ V P Sbjct: 65 TRLLRAGDILLEKSGGTEAKPVGFTVMFDGGYPAVSSNFIGRVRMRDGQHPRFWLYALAA 124 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + + + + + + D N +P L EQ I + + ET RIDTLI E+ Sbjct: 125 SYLTRRTQKCVRQTTGIQNLDQGAFFNEVFAVPTLGEQRAIADYLDRETTRIDTLIEEQQ 184 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 IE+L+E++ AL ++ G V +S + W +P W V P ++ + Sbjct: 185 HLIEMLRERRNALRVHVALHG-TRQVAEVESPLPWASKIPASWRVVPLTSVAQLESGHTP 243 Query: 250 ------KLIESNILSLSY-------GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + I +S G + + + + +++ +V Sbjct: 244 SRSREDWWTDCYIPWVSLHDVGAMRGTKYLHDTEQRISDAGIANSSARLLPARTVVLSRD 303 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQ 355 + + + A + W++ + + F + +G + Sbjct: 304 ATVGRTAIMAV-----PMATSQHFAAWVCGPLLDPEYLWVLFADAMQPFFDSFQNGSTIR 358 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ D+K + +PP+ EQ I ++ ET +ID L+ + E+ I L +ERR++ I AAV Sbjct: 359 TIGMGDLKAFRIPLPPLDEQRRIVEYLDEETPKIDTLIVETERFIELARERRAALITAAV 418 Query: 416 TGQIDLRG 423 TGQID+R Sbjct: 419 TGQIDVRE 426 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 49/212 (23%), Positives = 88/212 (41%), Gaps = 12/212 (5%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDV----ESGT 60 +S + W IP W+VVP+ +L +G T ++ I ++ L DV + Sbjct: 213 ESPLPWASKIPASWRVVPLTSVAQLESGHTPSRSREDWWTDCYIPWVSLHDVGAMRGTKY 272 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + S+ + ++ + + + I S F +L Sbjct: 273 LHDTEQRISDAGIANSSARLLPARTVVLSR-DATVGRTAIMAVPMATSQHFAAWVCGPLL 331 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L + + ++ G+T+ + +P+PPL EQ I E + ET + Sbjct: 332 DPEYLWVLFADAMQPFFDSFQNGSTIRTIGMGDLKAFRIPLPPLDEQRRIVEYLDEETPK 391 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 IDTLI E RFIEL +E++ AL++ VT ++ Sbjct: 392 IDTLIVETERFIELARERRAALITAAVTGQID 423 >gi|218550389|ref|YP_002384180.1| Specificity determinant for hsdM and hsdR [Escherichia fergusonii ATCC 35469] gi|218357930|emb|CAQ90574.1| Specificity determinant for hsdM and hsdR (modular protein) [Escherichia fergusonii ATCC 35469] Length = 502 Score = 159 bits (403), Expect = 5e-37, Method: Composition-based stats. Identities = 69/456 (15%), Positives = 145/456 (31%), Gaps = 56/456 (12%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ 72 G +P+ W ++ +G T D I +I D+ + Sbjct: 4 GKLPEGWVETNLQNVASWGSGGTPSRNHDEYYNGNIPWIKTGDLGPKIITNASEYITDAG 63 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ F KG + G + K I D + V P + + L + ++ Sbjct: 64 VQNSSAKFFPKGSVAIAMYGATIGKTSILGIDATTNQACAVGTPLEGITSTLFLYYFLLN 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +G + I + +PPLAEQ +I EK+ ++D+ + Sbjct: 124 EKNAFIKKGKGGAQPNISQTVIKEHIIYLPPLAEQKIITEKLDTLLAQVDSTKARLEQIP 183 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMK----DSGIEWVGLV-------------------- 228 ++LK +QA++ V L + S E + + Sbjct: 184 QILKRFRQAVLERAVNGKLTECWRDCVGELTSAEEIITEIKKYRKASLSTEGSSASTESK 243 Query: 229 ------------------PDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQK 267 P W F V + + K ++ I + +I Sbjct: 244 RQIAKIEKHCFKVPKINLPKGWVWTTFLQSMEKVVDCHNKTAPYVDQGIHLIRTPDIRNG 303 Query: 268 LETRNMGLKPES-----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + + ++ + G+I+F + + ++ G Sbjct: 304 VISLDNTKYIDNDTYLYWSKRCPPRSGDIIFTREAPMGEAGIVPENTIICMGQRMMLLRP 363 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + + L ++ S ++ + L+ DV+ L +PPI+EQ +I + Sbjct: 364 IPEYIHNKYVLLNILSSSFQTRMISQAIGTGVKHLRVADVESLTYPLPPIEEQHEIVRRV 423 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A D + +++ ++ + S +A A G+ Sbjct: 424 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 459 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 67/205 (32%), Gaps = 9/205 (4%) Query: 21 IPKHWKVVPI----KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W ++ + + I I D+ +G + Sbjct: 261 LPKGWVWTTFLQSMEKVVDCHNKTAPYVDQGIHLIRTPDIRNGVISLDNTKYIDNDTYLY 320 Query: 77 TVSIFAK--GQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSI 131 G I++ + P I+ + IC L P+ + + + +LS Sbjct: 321 WSKRCPPRSGDIIFTREAPMGEAGIVPENTIICMGQRMMLLRPIPEYIHNKYVLLNILSS 380 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R+ + G + H + ++ P+PP+ EQ I ++ DT+ + Sbjct: 381 SFQTRMISQAIGTGVKHLRVADVESLTYPLPPIEEQHEIVRRVEQLFAYADTIEKQVNNA 440 Query: 192 IELLKEKKQALVSYIVTKGLNPDVK 216 + + Q++++ L + Sbjct: 441 LARVNNLTQSILAKAFRGELTAQWR 465 >gi|313634897|gb|EFS01303.1| putative type-1 restriction enzyme MjaXIP specificity protein [Listeria seeligeri FSL N1-067] Length = 439 Score = 159 bits (403), Expect = 5e-37, Method: Composition-based stats. Identities = 79/414 (19%), Positives = 154/414 (37%), Gaps = 26/414 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +V +K + + TG T + I ++ ++ + L Sbjct: 15 EVSKLKYVSDIITGNTPSKLNESFYENGIIDWVKPNNITDDY-RLLKSKDKLSIKGVRKA 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + + L +G + A+ + V+ K + + Sbjct: 74 RVVPRNSTLVCAIGTIGKLALSEEEVTTNQQINSVIFTKINKKYGFYILVCME---NEFK 130 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +S + + N+ + P Q I + ++ ID LI+ + + I+LL+E+ Sbjct: 131 KYSNKVVVSILNKTSMENLKIISPSPIRQERICLFLDSKLSEIDFLISSKEKQIKLLEEQ 190 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT-----ELNRKNTKLIE 253 +QA+++ VTKGLN V+MKDSG+EW+G +P HWE+ + Sbjct: 191 RQAMITEAVTKGLNSSVRMKDSGVEWIGEIPKHWEIAKIKYTTYVKGRIGWQGLRSDEFI 250 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQI-----VDPGEIVFRFIDLQNDKRSLRSA 308 + L G + S + Y + +++ ++ Sbjct: 251 DDGPYLVTGTNFKNGIVDWQDCYHISEDRYNEAVPIQLKEDDLLITKDGTIGKLALVK-- 308 Query: 309 QVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 ++ + I+ S +P + + YL W + S + M +G + L E Sbjct: 309 EMPGKTILNSGIFVTRPLANKYINNYLYWNLNSASFSQYIRTMETGSTIKHLYQETFVNY 368 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +P ++EQ I+ +N + ++ +++ I I LKE R S I AVTG+I Sbjct: 369 SYALPSLEEQESISCYLNNKNQKLGNVIQNITIQISKLKEYRHSLIHEAVTGKI 422 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 54/214 (25%), Positives = 89/214 (41%), Gaps = 12/214 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYI-GLEDVESGTGK 62 KDSGV+WIG IPKHW++ IK T + R+ E D Y+ + ++G Sbjct: 209 MKDSGVEWIGEIPKHWEIAKIKYTTYVKGRIGWQGLRSDEFIDDGPYLVTGTNFKNGIVD 268 Query: 63 YLPKDGNSRQSDTSTVSI-FAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQP--K 117 + S V I + +L K G + A++ + G I ++ V +P Sbjct: 269 WQDCYHISEDRYNEAVPIQLKEDDLLITKDGTIGKLALVKEMPGKTILNSGIFVTRPLAN 328 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + L L S +Q I + G+T+ H + N +P L EQ I + + Sbjct: 329 KYINNYLYWNLNSASFSQYIRTMETGSTIKHLYQETFVNYSYALPSLEEQESISCYLNNK 388 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 ++ +I I LKE + +L+ VT + Sbjct: 389 NQKLGNVIQNITIQISKLKEYRHSLIHEAVTGKI 422 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 66/195 (33%), Gaps = 2/195 (1%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG--LKPESYE 281 W +EV + + + + + ++ + LK + Sbjct: 6 WYDKCMPDFEVSKLKYVSDIITGNTPSKLNESFYENGIIDWVKPNNITDDYRLLKSKDKL 65 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + + V +V R L ++ + E + T+ + + + + Sbjct: 66 SIKGVRKARVVPRNSTLVCAIGTIGKLALSEEEVTTNQQINSVIFTKINKKYGFYILVCM 125 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + L ++ L ++ P Q I ++ + + ID L+ E+ I Sbjct: 126 ENEFKKYSNKVVVSILNKTSMENLKIISPSPIRQERICLFLDSKLSEIDFLISSKEKQIK 185 Query: 402 LLKERRSSFIAAAVT 416 LL+E+R + I AVT Sbjct: 186 LLEEQRQAMITEAVT 200 >gi|331007189|ref|ZP_08330402.1| Type I restriction-modification system, specificity subunit S [gamma proteobacterium IMCC1989] gi|330419021|gb|EGG93474.1| Type I restriction-modification system, specificity subunit S [gamma proteobacterium IMCC1989] Length = 288 Score = 159 bits (403), Expect = 6e-37, Method: Composition-based stats. Identities = 78/289 (26%), Positives = 134/289 (46%), Gaps = 7/289 (2%) Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + I N + +PP EQV I + +T +ID I + + I LLKE Sbjct: 1 MKLLGSGVRQTISFNHIANSLLILPPETEQVAIANFLDQKTAQIDEAIAIKEKQIALLKE 60 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +KQ ++ VT+GLNPDV MKDSG++W+G +PDHW VK ++ E N ++ E + Sbjct: 61 RKQIIIQKAVTQGLNPDVPMKDSGVDWIGQIPDHWGVKRLKYVLDERNERSKTGEEPLFM 120 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ + + + S ++V ++VF + + + G+++ Sbjct: 121 VSQVHGLVVRADYHDKAEVAASNIDNKVVYKNDLVFNKLKA--HLGVFFKSNIEFEGLVS 178 Query: 318 SAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQ---SLKFEDVKRLPVLVPPI 372 Y K D YL L R + F +G+ + L D+ +PV + P Sbjct: 179 PDYAVYKCKAHIADVKYLELLFRHSSYIEQFIIRATGIVEGLIRLYTGDLFDIPVPIAPE 238 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ +I I ++ D V+ ++ I LKE +++ I +AVTG+I + Sbjct: 239 NEQLEILAYIEKQSKTFDRAVDLQQRQIQKLKEYKTTLINSAVTGKIKV 287 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 80/208 (38%), Gaps = 8/208 (3%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 KDSGV WIG IP HW V +K R+ + + + Y K Sbjct: 80 MKDSGVDWIGQIPDHWGVKRLKYVLDERNERSKTGEEPLFMVSQVHGLVVRADYHDK--A 137 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAI--IADFDGICSTQFLVLQPKDVLPELLQ-- 125 + + K +++ KL +L +F+G+ S + V + K + ++ Sbjct: 138 EVAASNIDNKVVYKNDLVFNKLKAHLGVFFKSNIEFEGLVSPDYAVYKCKAHIADVKYLE 197 Query: 126 GWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ G + + +IP+PI P EQ+ I I ++ D Sbjct: 198 LLFRHSSYIEQFIIRATGIVEGLIRLYTGDLFDIPVPIAPENEQLEILAYIEKQSKTFDR 257 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 + + R I+ LKE K L++ VT + Sbjct: 258 AVDLQQRQIQKLKEYKTTLINSAVTGKI 285 >gi|147920296|ref|YP_685933.1| type I restriction modification system, specificity subunit [uncultured methanogenic archaeon RC-I] gi|110621329|emb|CAJ36607.1| type I restriction modification system, specificity subunit [uncultured methanogenic archaeon RC-I] Length = 484 Score = 159 bits (402), Expect = 7e-37, Method: Composition-based stats. Identities = 83/436 (19%), Positives = 154/436 (35%), Gaps = 39/436 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P W + + + + I YIGLE +E TGK L ++ TST Sbjct: 6 ELPTGWCSTDLGDIISPSKEKIEPVKTESIPYIGLEHIEKDTGKLLSFGNST--EVTSTK 63 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SIDVTQRI 137 ++F KG +LYGKL PYL K + + DGICST LV + L L + + D + Sbjct: 64 TVFHKGDLLYGKLRPYLNKVCVTEIDGICSTDILVFNEQRFLSNKLLKYRMLCPDFVRYA 123 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G D+K I + + +PPLAEQ I KI ++D + + E +K+ Sbjct: 124 NQNATGVNHPRVDFKKIASFEIALPPLAEQHRIVAKIEELFTQLDAGVEALKKAKEQIKQ 183 Query: 198 KKQALVSYIVTKGLNPDVKM--------------------------KDSGIEWVGLVPDH 231 +QA++ L ++ +E +P+ Sbjct: 184 YRQAVLESAFNGKLTEKWRLSSKEYIAPISEFISNVQKTRSTDGKTVCDQLESTLEMPNG 243 Query: 232 WEVKPFFALVTELNR------KNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYET 282 W + + I ++ + + T+ E Sbjct: 244 WLGVLLYQIADIGTGATPLRSNKNYYENGTIPWITSSAVNSQYITKADEFITELAIKETN 303 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +I ++ + + + A + + L + Sbjct: 304 AKIFPKNSLIIALYGEGKTRGKVSELLIEAATNQACAAIIFNDQTVVLKPFIKLYFQKNY 363 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G++ +L +K + +PP+ EQ I I + ++ + + I+QS+ Sbjct: 364 EDLRKLASGGVQPNLNLGIIKSTLIPLPPLAEQEIIVGEIEKKFPIMEDIEKTIDQSLSY 423 Query: 403 LKERRSSFIAAAVTGQ 418 + R S ++ A +G+ Sbjct: 424 SETLRQSILSQAFSGK 439 >gi|124515150|gb|EAY56661.1| probable restriction endonuclease, S subunit [Leptospirillum rubarum] Length = 232 Score = 159 bits (402), Expect = 8e-37, Method: Composition-based stats. Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 10/239 (4%) Query: 1 MKH--YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES 58 M + YP+YKDSGV+W+G +P+HW+V +K L+T + + LE++ES Sbjct: 1 MNQSPWPPYPKYKDSGVEWLGELPEHWEVKKLKYCLLLSTRKIEPQKSQ---VALENIES 57 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 TG+++ + + F +G IL+GKL PYL K +A F G F V++P Sbjct: 58 WTGRFIETETKFEGDGIA----FEEGDILFGKLRPYLAKVFLAQFSGEAVGDFFVMRPFP 113 Query: 119 VLPELLQGWLLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + I++ GA M DW+ +GN+ + +P L+EQ+ I + E Sbjct: 114 TTDGRFIQYQILNKTFISIIDSSTFGAKMPRVDWEFMGNMELTLPSLSEQLAIASFLDRE 173 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 T RIDTLI+E+ R I LL+E +QAL+S+ VTKGL+P VKMKDSG+EW+G VP+HWE+ Sbjct: 174 TSRIDTLISEKERLISLLQEYRQALISHAVTKGLDPKVKMKDSGVEWLGEVPEHWEIYK 232 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 48/205 (23%), Positives = 88/205 (42%), Gaps = 9/205 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P K KDSG+EW+G +P+HWEVK + RK L I+ R Sbjct: 8 PYPKYKDSGVEWLGELPEHWEVKKLKYCLLLSTRKIEPQKSQVALE-----NIESWTGRF 62 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + + + G+I+F + K L + ++ D + Sbjct: 63 IETETKFEGDGIAFEEGDILFGKLRPYLAKVFLAQFSGE---AVGDFFVMRPFPTTDGRF 119 Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + + + + G + +E + + + +P + EQ I + ++ ET+RID Sbjct: 120 IQYQILNKTFISIIDSSTFGAKMPRVDWEFMGNMELTLPSLSEQLAIASFLDRETSRIDT 179 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 L+ + E+ I LL+E R + I+ AVT Sbjct: 180 LISEKERLISLLQEYRQALISHAVT 204 >gi|256375104|ref|YP_003098764.1| restriction modification system DNA specificity domain protein [Actinosynnema mirum DSM 43827] gi|255919407|gb|ACU34918.1| restriction modification system DNA specificity domain protein [Actinosynnema mirum DSM 43827] Length = 442 Score = 159 bits (401), Expect = 9e-37, Method: Composition-based stats. Identities = 85/435 (19%), Positives = 155/435 (35%), Gaps = 38/435 (8%) Query: 18 IGAIP--KHWKVVPIKRFTKL-NTGRTSESGKDIIY--IGLEDVESGTGKYLPKDGNSRQ 72 +G IP W P+KR T + N G E + I + G + ++ Sbjct: 5 LG-IPISDTWTTSPLKRITSVLNRGSAPEYVDESPVRVISQAANQYGGLDWSRTRFHNFN 63 Query: 73 SDTSTVS-IFAKGQILYGKLGP-YLRKAIIADFD-----GICSTQFLVLQPKDV--LPEL 123 D + + + I+ G L + + V++ K P Sbjct: 64 GDPTKLKGHLQENDIIINSTGTGTLGRVGYFTEPLNGIPCMADGHVTVVRVKKHKVNPRF 123 Query: 124 LQGWLLSIDVTQRIEAI--CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + WL S + I + + + + +P PP++EQ I + + AET I Sbjct: 124 VYYWLTSKPFQEYIHSSLAIGATNQIELNRDRLSDTHIPNPPISEQQRIVDFLEAETAHI 183 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF--- 238 D LI + R +E L E++ A ++ V+ + S + W+ +P W+ Sbjct: 184 DRLIETQNRVLEKLAERRMAGITQAVSG--TDQTGTRPSSLTWLEKIPSTWKEVRLSLIA 241 Query: 239 ---ALVTELNRKNTKLIESNILSLSYG--NIIQKLETRNMGLKPESYETYQIV------- 286 + T ++ I ++ G ++ ++ E + Sbjct: 242 RMGSGHTPSRSHPEWWVDCTIPWITTGEVRQVRNDRLEDLHETREKISELGLANSAAELR 301 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G +V + + ++ YL W +R+ + Sbjct: 302 PAGTVVLCRTASAGYSAVMG----TDMATSQDFVTWTCGPRLNPYYLLWCLRAMRPDLLG 357 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +++ D++ L + +PPI EQ I I + ARID L + + + LL ER Sbjct: 358 RLAMGSTHKTIYVPDLQMLRIPLPPIGEQQKIVQQIREQNARIDRLADAVRLQVALLAER 417 Query: 407 RSSFIAAAVTGQIDL 421 R + I AAVTGQID+ Sbjct: 418 RQALITAAVTGQIDV 432 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 78/215 (36%), Gaps = 14/215 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKY 63 + S + W+ IP WK V + ++ +G T I +I +V Sbjct: 218 RPSSLTWLEKIPSTWKEVRLSLIARMGSGHTPSRSHPEWWVDCTIPWITTGEVRQVRNDR 277 Query: 64 LP------KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 L + + S + G ++ + + + D S F+ Sbjct: 278 LEDLHETREKISELGLANSAAELRPAGTVVLCRT-ASAGYSAVMGTDMATSQDFVTWTCG 336 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L W L + + G+T + + +P+PP+ EQ I ++I + Sbjct: 337 PRLNPYYLLWCLRAMRPDLLGRLAMGSTHKTIYVPDLQMLRIPLPPIGEQQKIVQQIREQ 396 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 RID L + LL E++QAL++ VT ++ Sbjct: 397 NARIDRLADAVRLQVALLAERRQALITAAVTGQID 431 >gi|256023434|ref|ZP_05437299.1| predicted type I restriction-modification enzyme, S subunit [Escherichia sp. 4_1_40B] Length = 446 Score = 159 bits (401), Expect = 9e-37, Method: Composition-based stats. Identities = 76/433 (17%), Positives = 139/433 (32%), Gaps = 30/433 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDG 68 YK + V G IP+ W VP K NT + S + + +IG++DV + Sbjct: 22 YKLTEV---GVIPEDWDCVPFGNLFKTNTKKKKVSDYELVSFIGMQDVSED-AQLKNNTQ 77 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPY------LRKAIIADFDGICSTQFLVLQPKDVLP- 121 + S + F KG +L K+ P A + G ST+F VL+ + Sbjct: 78 LPFKEVKSGFTYFEKGDVLLAKITPCFENGKGCHTADLPTNVGFGSTEFHVLRENEDSDS 137 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG--NIPMPIPPLAEQVLIREKIIAETV 179 + W +E+ G+ + P L EQ I + + Sbjct: 138 RFIYFWTTDKKFRASLESEMVGSAGHRRVPLVAIEKYLIPCPPNLQEQSAIADSLSDINN 197 Query: 180 RIDTLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 I L ++ + Q L++ + L D K +G +P+ W V Sbjct: 198 FILALEKLIVKKQAIKTATMQRLLTGKTRLPQFALRKDGSAKGYKKSELGEIPEDWVVTS 257 Query: 237 FFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDP 288 +S G + K + + + V Sbjct: 258 IGQFTDCCAGGTPSTKISAYWGGTHPWMSSGELHLKQVYAVADYITDEGLVNSSTKYVPK 317 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 ++ Q R + +E S + +L + + S + Sbjct: 318 NSVLVGLAG-QGKTRGTVAINRIELCTNQSIAAIFPSKHHSTEFLFYNLDSRYEELRSLS 376 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G G R L +++L + PP +EQ I +++ I L +Q + ++ + Sbjct: 377 TGDGGRGGLNLTIIRKLHLAFPPKEEQTAIATILSDMDKEIQTL----QQRLDKTRQLKQ 432 Query: 409 SFIAAAVTGQIDL 421 + +TG+ L Sbjct: 433 GMMQELLTGKTRL 445 >gi|208779809|ref|ZP_03247153.1| conserved hypothetical protein [Francisella novicida FTG] gi|208744264|gb|EDZ90564.1| conserved hypothetical protein [Francisella novicida FTG] Length = 414 Score = 159 bits (401), Expect = 1e-36, Method: Composition-based stats. Identities = 63/409 (15%), Positives = 142/409 (34%), Gaps = 22/409 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQS- 73 + +P W+ + + G + +++++ + S Sbjct: 19 LYKLPAGWEWKKLGEVFDVKDGTHDSPKYKEIGYPLVTSKNLKNNSLDLTSCKFISNDDF 78 Query: 74 -DTSTVSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + S KG +L+ +G I+ + D L L ELL+ WL S Sbjct: 79 IKINQRSKVDKGDLLFAMIGTIGSPTIVDFEPDFAIKNVALFKPSNTYLIELLKYWLSSH 138 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 TQ++ +GAT + N P P+PPLAEQ I K+ + +ID I + Sbjct: 139 LTTQKMLEEAKGATQKFVGLTYLRNFPAPLPPLAEQKRIVAKLDSLFEKIDKAIELHQQN 198 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I + + K L + K + K + Sbjct: 199 ITNANTLMASALDKTFKK-LEREYSFKILD-------------CLSENIRYGYTDKAKEK 244 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + ++ N K + ++ + ++ + + G+I+ K +L + Sbjct: 245 GNARFIRITDINDQGKFKDESVYVDIKNTDLDRYKLLVGDILVARSGATAGKVALFTLDE 304 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLV 369 + ++ + +++ + S + + G + ++ ++K + + + Sbjct: 305 FSVFASYLIRIRLQIDKVLPSFIFYFCYSSNYWNQLDQIKIGGAQPNVNATNLKNIKIPL 364 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP+ Q ++ ++D + + EQ + LK ++S + A G+ Sbjct: 365 PPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRGE 413 >gi|23452777|gb|AAN33159.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452791|gb|AAN33167.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 403 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 69/409 (16%), Positives = 145/409 (35%), Gaps = 21/409 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 IL +G L K + G C+ Q + P K+++ E + + +S Sbjct: 63 FDKARQLPPKTILVVCIGS-LGKVALTKVIGSCNQQINAIIPHKNIISEYIYYYCISSKF 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + T++ + + + P + EQ I + +ID I + + Sbjct: 122 QSILFSKAPQTTLAILNKTEFSKLEIIYPKDIKEQERIVGILDFAFSKIDENIKKAKENL 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV-PDHWEVKPFFALVTELNRKNTKL 251 + E Q+ + + + W D + + N + Sbjct: 182 ANIDELMQSALQKAFNPLNDNTKENYQLPQSWEWKSLGDTSNYGKTSQVKPSQLKGNDWI 241 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +E + G ++QK+ ++ K + + G+I+F + K + Sbjct: 242 LELEDIEKESGVLLQKVLFQDRQSKSNKIK----FNKGDILFGTLRPYLKKVIIA----D 293 Query: 312 ERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLV 369 + G +S M + I + ++ + + + L ++ G R L +D K L + + Sbjct: 294 DNGACSSEIMPFSTGNSITNHFIYYYLFANFLHDRISSLTYGARMPRLGTKDGKSLQIPL 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP++EQ I ++ + L E + + +E + S + A G+ Sbjct: 354 PPLQEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 402 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 54/198 (27%), Positives = 89/198 (44%), Gaps = 8/198 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYI-GLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ + N G+T K +I LED+E +G L K + Sbjct: 208 QLPQSWEWKSLGD--TSNYGKTSQVKPSQLKGNDWILELEDIEKESGVLLQKVLFQDRQS 265 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDV 133 S F KG IL+G L PYL+K IIAD +G CS++ + + + + +L + + Sbjct: 266 KSNKIKFNKGDILFGTLRPYLKKVIIADDNGACSSEIMPFSTGNSITNHFIYYYLFANFL 325 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 RI ++ GA M K ++ +P+PPL EQ I E + + L + ++ Sbjct: 326 HDRISSLTYGARMPRLGTKDGKSLQIPLPPLQEQEQIAEHLDFVFEKAKALKELYTKELK 385 Query: 194 LLKEKKQALVSYIVTKGL 211 +E KQ+L+ L Sbjct: 386 DYEELKQSLLDKAFKGEL 403 >gi|315231355|ref|YP_004071791.1| Type I restriction-modification system specificity subunit S [Thermococcus barophilus MP] gi|315184383|gb|ADT84568.1| Type I restriction-modification system specificity subunit S [Thermococcus barophilus MP] Length = 408 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 72/418 (17%), Positives = 155/418 (37%), Gaps = 33/418 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSR 71 IG IP+ W+VV + + G+ + + Y+ E + + K + Sbjct: 10 IGEIPEDWQVVKLGKIIGYTKGKKPKMVAKEPKDGWLPYLSTEYLRNNNPTQFVKITGNE 69 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 I G IL G + +A + ST + K V L +LL Sbjct: 70 I-------IVEDGDILLLWDGSNAGEFFLAKKGVLSSTMVKIFLKKHVYDSLFLFYLLKH 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ +G + H D + + +P+PPL EQ I E + +D I + Sbjct: 123 R-EPFLKGQTKGTGIPHVDKNVLNALLLPLPPLEEQKQIAEIL----RTVDEAIEKTDLA 177 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTK 250 IE + K+ L+ ++TKG+ K +G +P+ W V + + K Sbjct: 178 IEKTERLKKGLMQRLLTKGIKHKRFKKT----EIGEIPEEWRVVRIGEVTGLFQYGLSIK 233 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + + + I E + + +K E ++ G+I+ + Sbjct: 234 MHDKGKYPIIKMDSIINGEVKPVNIKYVDLDEDTFKKYRLEKGDILINRTNSYELVGRTG 293 Query: 307 SAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + S + ++P ID +L + + + A + + ++ ++K+ Sbjct: 294 VFMLDGDYVFASYLIRIRPDKKQIDPRFLTFYLIFANDKLRQLATRAVSQANINASNLKK 353 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +PP++EQ I ++ ++ E + + L+ + + +TG+ ++ Sbjct: 354 FKIPLPPLEEQKQIAEILMTVDKKL----ELLRKRKEKLERIKRGLMKDLLTGRRRVK 407 >gi|83590507|ref|YP_430516.1| restriction modification system DNA specificity subunit [Moorella thermoacetica ATCC 39073] gi|83573421|gb|ABC19973.1| Restriction modification system DNA specificity domain [Moorella thermoacetica ATCC 39073] Length = 442 Score = 158 bits (400), Expect = 1e-36, Method: Composition-based stats. Identities = 73/435 (16%), Positives = 160/435 (36%), Gaps = 31/435 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 YK++ IG +P+ W+VV + + + R + + K+ + + + G + Sbjct: 10 EGYKETE---IGVLPEDWEVVRLGKVFEEVDRRVN-NVKNAASLPVLSLTKNNGIIPQTE 65 Query: 68 GNSR---QSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLV--LQPKDVL 120 + D S + K +++Y + I + G+ S + V + K Sbjct: 66 RFKKRIATDDLSNYKVVYKKELVYNPYVIWEGAIHILNRLEAGLVSPVYPVLSVNKKVAD 125 Query: 121 PELLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 WL + + G NI P+PPL EQ I + Sbjct: 126 AYFFDFWLRTPSAIKAYSRYASGAVNRRRAIRKTDFKNIDAPLPPLHEQRKIAYVL---- 181 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKP 236 I I + + I +E K++L+ ++ T G P ++ ++ +G+VP+HWEV Sbjct: 182 STIQRAIQLQDKVIAATRELKKSLMRHLFTYGPVPVDQIDRVPLKETEIGMVPEHWEVVR 241 Query: 237 FFALVTELNRKNTKLIESNILSLSY--GNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + + NI + I + + + + + G+++ Sbjct: 242 LREVADFTKKPRGLNYSGNIPFIPMELIPIGRVNIQKYIIKPSSEISSGVYCEQGDLLLA 301 Query: 295 FI--DLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVF--YA 348 I +N K+ + S T+ + + ++ YL + + + + Sbjct: 302 KITPSFENYKQGIISQIPKPFAFATTEVYPIKARKDFLEILYLFYYLLIPQVRQDIAGKM 361 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G+ RQ + ++ + +PP+ EQ I + +I+ E+ +S L+ Sbjct: 362 EGTTGRQRISKSVIQNYLIPIPPLSEQRQIARFLITVDKKIEA--EEYRKS--TLQSLFQ 417 Query: 409 SFIAAAVTGQIDLRG 423 + + +TG++ ++ Sbjct: 418 TMLHLLMTGKVRVKD 432 >gi|289706815|ref|ZP_06503158.1| type I restriction modification DNA specificity domain protein [Micrococcus luteus SK58] gi|289556500|gb|EFD49848.1| type I restriction modification DNA specificity domain protein [Micrococcus luteus SK58] Length = 410 Score = 158 bits (399), Expect = 2e-36, Method: Composition-based stats. Identities = 90/407 (22%), Positives = 165/407 (40%), Gaps = 15/407 (3%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 L + E + + E + TG+ + + ++ F G +L Sbjct: 8 RKFGWCVGLVSDTAPEESE--FRVAAESMVGHTGRLVTDHEIDSEGRGTS---FRAGDLL 62 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT-QRIEAICEGATM 146 + KL PYL K+ +A+ DG V +P D + G+L+ +++ A G M Sbjct: 63 FSKLRPYLAKSWVANRDGEALGDIHVYRPVDEMCSRYLGYLVLSSFFLEQVNASTYGTRM 122 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 A+W I I + P Q I + + ET ID LI ++ + LL +++ ++ ++ Sbjct: 123 PRANWDFIKTIEVWAPDFDTQRRIADYLDRETATIDALIEKQRALLTLLIDRRASVRKHL 182 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 +G M + EW G +P HW P ++ + + + + I Sbjct: 183 ALRGPESRTSMVQAPEEWAGQIPSHWRFVPLLSVARLGSGHTPSKSRPELWTDTTIPWIS 242 Query: 267 KLETRNMGLKPESYETYQIVDP--------GEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + +M YET+ + + + L D R+A + + Sbjct: 243 LRDVGSMRATTYLYETHTSISELGLASSSARILPAGTVVLSRDATIGRTAIMGRDMATSQ 302 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + A L+ + + ++ +++ D++ L V +PP+ EQ Sbjct: 303 HFAAWTCGPQLLPQYLHLVLADAMQDHLESLTDGSTLRTVGMGDIRALRVPLPPVHEQRR 362 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 I + ETA+ID L+ K E+ I L +ERR++ I AAVTGQI++ E Sbjct: 363 IIDESETETAKIDALIAKAERFIELAQERRAALITAAVTGQIEIPSE 409 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 55/207 (26%), Positives = 92/207 (44%), Gaps = 12/207 (5%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVES-GTGKYLPKD 67 +W G IP HW+ VP+ +L +G T I +I L DV S YL + Sbjct: 199 EWAGQIPSHWRFVPLLSVARLGSGHTPSKSRPELWTDTTIPWISLRDVGSMRATTYLYET 258 Query: 68 GNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 S +S+ I G ++ + + + I D S F L Sbjct: 259 HTSISELGLASSSARILPAGTVVLSR-DATIGRTAIMGRDMATSQHFAAWTCGPQLLPQY 317 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L+ + +E++ +G+T+ I + +P+PP+ EQ I ++ ET +ID L Sbjct: 318 LHLVLADAMQDHLESLTDGSTLRTVGMGDIRALRVPLPPVHEQRRIIDESETETAKIDAL 377 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGL 211 I + RFIEL +E++ AL++ VT + Sbjct: 378 IAKAERFIELAQERRAALITAAVTGQI 404 >gi|326387108|ref|ZP_08208718.1| type I restriction-modification methylase S subunit [Novosphingobium nitrogenifigens DSM 19370] gi|326208289|gb|EGD59096.1| type I restriction-modification methylase S subunit [Novosphingobium nitrogenifigens DSM 19370] Length = 318 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 79/303 (26%), Positives = 138/303 (45%), Gaps = 16/303 (5%) Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + E+ GAT + + + N +PP AEQ I + E +ID L E+ R I LL Sbjct: 3 QWESSIGGATFRALNLEPLANTLGCLPPFAEQEAIAGFLDREVGKIDRLAAEQERLIALL 62 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 KEK+QA++S+ VTKGLNP+ +KDSGIEW+ +P HWEV +V + + + ++ Sbjct: 63 KEKRQAVISHAVTKGLNPNAPLKDSGIEWLCQIPAHWEVVRIKHVVVTIEQGWSPQCDAT 122 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYET----------YQIVDPGEIVFRFIDLQNDKRSL 305 + K+ N S + G+++ + + S Sbjct: 123 PADGPEQWGVLKVGCVNGDRFNASENKALPDDLEPLPELSLRAGDLLISRANTRELVGSA 182 Query: 306 RSAQVMERGIITSA---YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKF 359 + ++ + ++ +L +RS + SG ++ Sbjct: 183 ALVEQDHDHLLLCDKLYRLRLQTSVASPEFLTLFLRSSMVRGQIEIAASGASSSMLNIGQ 242 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + + +PP+ EQ +I I +I+ L+ + +I LL+ERR++ I+AAVTG+I Sbjct: 243 SVILEMALPLPPLGEQGEIATWILKCREQIEALINDAQSAITLLQERRAALISAAVTGKI 302 Query: 420 DLR 422 D+R Sbjct: 303 DVR 305 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 56/231 (24%), Positives = 92/231 (39%), Gaps = 17/231 (7%) Query: 11 KDSGVQWIGAIPKHWKVVPIKR-FTKLNTGRTS-------ESGKDIIYIGLEDVESGTGK 62 KDSG++W+ IP HW+VV IK + G + + + + + V Sbjct: 85 KDSGIEWLCQIPAHWEVVRIKHVVVTIEQGWSPQCDATPADGPEQWGVLKVGCVNGDRFN 144 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGK--LGPYLRKAII----ADFDGICSTQF-LVLQ 115 + G +L + + A + D +C + L LQ Sbjct: 145 ASENKALPDDLEPLPELSLRAGDLLISRANTRELVGSAALVEQDHDHLLLCDKLYRLRLQ 204 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM--PIPPLAEQVLIREK 173 PE L +L S V +IE GA+ S + + M P+PPL EQ I Sbjct: 205 TSVASPEFLTLFLRSSMVRGQIEIAASGASSSMLNIGQSVILEMALPLPPLGEQGEIATW 264 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 I+ +I+ LI + I LL+E++ AL+S VT ++ ++ +E Sbjct: 265 ILKCREQIEALINDAQSAITLLQERRAALISAAVTGKIDVRAAAANTTVEM 315 >gi|302037816|ref|YP_003798138.1| putative type I restriction-modification system, specificity protein [Candidatus Nitrospira defluvii] gi|300605880|emb|CBK42213.1| putative Type I restriction-modification system, specificity protein [Candidatus Nitrospira defluvii] Length = 444 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 72/420 (17%), Positives = 135/420 (32%), Gaps = 21/420 (5%) Query: 25 WKVVPIKR-FTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTS 76 W + IK +K+ +G T S I + ++V + + Sbjct: 16 WPLDRIKDNVSKIGSGVTPTGGATSYSDSGIPLLRSQNVHFEGIRLDDVAFIDEEIHAEM 75 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWL-LSID 132 + + +L G + + +G + +++P L + + Sbjct: 76 RGTQLKEKDVLLNITGASIGRCTFVPDGFGEGNVNQHVCIIRPSSRLDHRFLTYCLAAPW 135 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +I A GA+ + +G I +P+P Q + + A ID + + R I Sbjct: 136 GQDQIFAGFTGASRQGLGQRDLGEIQIPLPDRTTQEKVIAYLDASCAAIDAAVAAKRRQI 195 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN---- 248 E L+ +++ ++ + +GLNP V++K SG W+G +P HW L+ E Sbjct: 196 EALERTRKSTITRAMVRGLNPAVQLKTSGQHWLGNIPTHWTAPSLKRLLIEPLTYGLNEA 255 Query: 249 ---TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 L ++ + L P + +++F K L Sbjct: 256 AELEDRELPRYLRITDFDESGALRDDTFRSLPREVAREAPLVTNDVLFARSGATVGKTFL 315 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVK 363 + A + +L + Q++ Sbjct: 316 FRDYQGDACFAGYLIRARTAPWKINPLFLYLFTKTTAYETWKNLTFTQATIQNISAAKYN 375 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L + +PP+ EQ I + A L I + I L R S I VTGQ + Sbjct: 376 YLVIPLPPLSEQHSICGFVEQCNADFARLTASINRQITTLTAYRKSLIHECVTGQRRITE 435 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 71/208 (34%), Gaps = 12/208 (5%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKL-----NTGRTSESGKDII-YIGLEDVESGTGKYL 64 K SG W+G IP HW +KR +++ Y+ + D + +G Sbjct: 221 KTSGQHWLGNIPTHWTAPSLKRLLIEPLTYGLNEAAELEDRELPRYLRITDFDE-SGALR 279 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQ--PKDV 119 S + + + +L+ + G + K + D + + + P + Sbjct: 280 DDTFRSLPREVAREAPLVTNDVLFARSGATVGKTFLFRDYQGDACFAGYLIRARTAPWKI 339 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 P L + + AT+ + + +P+PPL+EQ I + Sbjct: 340 NPLFLYLFTKTTAYETWKNLTFTQATIQNISAAKYNYLVIPLPPLSEQHSICGFVEQCNA 399 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIV 207 L R I L +++L+ V Sbjct: 400 DFARLTASINRQITTLTAYRKSLIHECV 427 >gi|294637839|ref|ZP_06716110.1| restriction endonuclease S subunit [Edwardsiella tarda ATCC 23685] gi|291089013|gb|EFE21574.1| restriction endonuclease S subunit [Edwardsiella tarda ATCC 23685] Length = 284 Score = 157 bits (397), Expect = 3e-36, Method: Composition-based stats. Identities = 75/265 (28%), Positives = 125/265 (47%), Gaps = 15/265 (5%) Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + ET +ID LI ++ + IELLKEK+QA++S+ VTKGLNPDV MKDSG+EW+G VP Sbjct: 9 IVSFLEHETAKIDNLIEKQQQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGVEWLGEVP 68 Query: 230 DHWEVKPFFALVTELNRK-------NTKLIESNILSLSYGNI---IQKLETRNMGLKPES 279 +HW +K + K + L + + + G++ + +ET + L + Sbjct: 69 EHWSIKSYRYACLIYRGKFGHRPRNDPSLYDGDYPFIQTGDVARASKFIETYSQTLNEKG 128 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Q+ G ++ D L ++ + M + Sbjct: 129 KAVSQLFPSGTLMMAIAANIGDTAILGFEAYAPDSVVG---FKPYQNLHLEFLRYSFMAA 185 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + +L + + + + PP++EQ DI N ++ + E Q+ Sbjct: 186 LPALEQ--TSTQSTQANLNIDRIGAVKAVFPPLEEQLDIINYLDDMLYLYYSIEENTNQA 243 Query: 400 IVLLKERRSSFIAAAVTGQIDLRGE 424 I LL+ERR++ I+AAVTG+ID+R Sbjct: 244 IQLLQERRAALISAAVTGKIDVRDW 268 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 80/211 (37%), Gaps = 10/211 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTG 61 KDSGV+W+G +P+HW + + + G+ D +I DV + Sbjct: 56 MKDSGVEWLGEVPEHWSIKSYRYACLIYRGKFGHRPRNDPSLYDGDYPFIQTGDVARASK 115 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + +F G ++ + + I F+ + +P L Sbjct: 116 FIETYSQTLNEKGKAVSQLFPSGTLMMA-IAANIGDTAILGFEAYAPDSVVGFKPYQNLH 174 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + +E +T ++ + IG + PPL EQ+ I + Sbjct: 175 LEFLRYSFMAAL-PALEQTSTQSTQANLNIDRIGAVKAVFPPLEEQLDIINYLDDMLYLY 233 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLN 212 ++ + I+LL+E++ AL+S VT ++ Sbjct: 234 YSIEENTNQAIQLLQERRAALISAAVTGKID 264 >gi|188997268|ref|YP_001931519.1| restriction modification system DNA specificity domain [Sulfurihydrogenibium sp. YO3AOP1] gi|188932335|gb|ACD66965.1| restriction modification system DNA specificity domain [Sulfurihydrogenibium sp. YO3AOP1] Length = 425 Score = 157 bits (397), Expect = 3e-36, Method: Composition-based stats. Identities = 72/433 (16%), Positives = 156/433 (36%), Gaps = 37/433 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDG 68 +K++ IG IP+ W+VV + + + K+I+ + + + + Sbjct: 7 FKETE---IGLIPEDWEVVRLGEILEEKNEKVKNYDFKNIVVLSITSKDGLIEQNRKFKH 63 Query: 69 NSRQSDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + S + KG+++YG + + + G S + + K Sbjct: 64 RVASQNISDYKLVRKGELVYGFPINEGVIAFLWRYEMGAVSPAYYTWKLKYPEKTYYIFL 123 Query: 128 -----LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 I + IP+P+PPL EQ I + + + Sbjct: 124 DYLLRSPIILNLFKPFISNTVHRRKIIKPHDFKQIPIPLPPLEEQKAIADIL----STVQ 179 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 I + + I K+ K++++ ++ T G ++ K+K E +GL+P+HWEV F Sbjct: 180 NAIEKTEKVINATKQLKKSMMKHLFTYGAVVVDEIDKVKLKESE-IGLIPEHWEVVRFGD 238 Query: 240 LVTELNRKNT------KLIESNILSLSYGNIIQKLETRNMGLKPE----SYETYQIVDPG 289 +V + + +S ++ + + E ++ G Sbjct: 239 IVNFKIGRTSPRKNKDYWTNGKYYWVSISDMKNRYINNTSEMVSEKAHKEIFKEKLTPAG 298 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ F L II+ + K + + +L + + + D + Sbjct: 299 TLLMSFKLTIGRTAILNVDAYHNEAIIS---IYPKENKVLKEFLFYYLPAVDYSNLQDKA 355 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G +L + ++P+ +P + EQ I N++ ID ++ E+ L+ + Sbjct: 356 IKG--NTLNTSKLNKIPIPLPLLDEQQKIANILTT----IDQKIQAEEKKKEALQNLFKT 409 Query: 410 FIAAAVTGQIDLR 422 + +TG+I +R Sbjct: 410 LLQQLMTGKIRVR 422 >gi|237807949|ref|YP_002892389.1| restriction modification system DNA specificity domain-containing protein [Tolumonas auensis DSM 9187] gi|237500210|gb|ACQ92803.1| restriction modification system DNA specificity domain protein [Tolumonas auensis DSM 9187] Length = 445 Score = 157 bits (396), Expect = 3e-36, Method: Composition-based stats. Identities = 71/438 (16%), Positives = 140/438 (31%), Gaps = 28/438 (6%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGT 60 YK + V G IP+ W + + TG ++ I + + G Sbjct: 10 EGYKQTEV---GVIPEDWDIQRLGVHATFKTGPFGSALHKSDYVDGGIPVVNPMQIIDGK 66 Query: 61 GKYLPK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQ-FLVLQP 116 K + + + G I+ G+ G R A+I + +C T +V Sbjct: 67 VKPTSSMAISDEAAKKLSEYRLIAGDIVIGRRGDMGRCAVISEIENGWLCGTGSMIVRVK 126 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKII 175 ++ LQ L + IE+ G TM + + + + + IP EQ I + Sbjct: 127 ENADAAFLQRVLSNPQTITAIESASVGTTMINLNQGTLRALLILIPRDKQEQTAIANALS 186 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 I+ L + + Q L++ + L D K +G +P+ W Sbjct: 187 DVDALINELEKLIAKKQAIKTATMQQLLTGKTRLPQFALREDGTPKGYKASELGEIPEDW 246 Query: 233 EVKPFFALVTELNRKNTKLI---ESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDP 288 EV + + E L L N+ L N IV Sbjct: 247 EVVSLAEIGQTIIGLTYSPNDVAEHGTLVLRSSNVQNNVLAYDNNVYVNMDLPERVIVKK 306 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+I+ + + ++ + +S + Sbjct: 307 GDILICVRNGSRQLIGKCALIDKNADGAAFGAFMSIFRTKSFGFVFYQFQSDIIQNQINE 366 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + +D+ + +P + KEQ IT++++ I L +Q + ++ + Sbjct: 367 IMGATINQITNKDMAGFRIPLPTLQKEQVAITSILSDMDTEIQSL----QQRLTKTRQIK 422 Query: 408 SSFIAAAVTGQID-LRGE 424 + +TG+ ++ E Sbjct: 423 QGMMQELLTGKTRLVKPE 440 >gi|78777142|ref|YP_393457.1| restriction modification system DNA specificity subunit [Sulfurimonas denitrificans DSM 1251] gi|78497682|gb|ABB44222.1| Restriction modification system DNA specificity domain [Sulfurimonas denitrificans DSM 1251] Length = 420 Score = 157 bits (396), Expect = 3e-36, Method: Composition-based stats. Identities = 71/428 (16%), Positives = 156/428 (36%), Gaps = 31/428 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLN--TGRTS-ESGKDIIYIGLEDVESGTGKYLPK 66 YK + V G IP+ W+VV IK T G+T ++GK I + ++++ G Y Sbjct: 8 YKQTKV---GIIPEDWEVVKIKEATSYVDYRGKTPIKTGKGIFLVTAKNIKQGFIDYEAS 64 Query: 67 DGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PE 122 + + + G IL P A I + + + + + K + + Sbjct: 65 SEFVSEVEYHEIMKRGMPKIGDILITTEAPLGNVAQIDKENIALAQRVIKFRSKKNVKND 124 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L+ + LS + + G T+ K + N+ + +PPL EQ I + + I Sbjct: 125 FLKHYFLSNRFQSYLYRMAIGTTVLGIQGKELHNMSIVLPPLKEQEKIAQILTTWDEAIT 184 Query: 183 TLITERIRFIELLKEKKQALVSYIVTK--GLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 L K Q L+S V + + + + + P + Sbjct: 185 KQTELLEAKELLKKALMQKLLSGEVRFSGFSDEWEEARLDKLVFFQEGP---------GV 235 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 RK+ + + + + ET + + ++D G+++ + + Sbjct: 236 RNTQYRKSGVKLLNVGNLNNNTLNLSSTETYISEEEAYGAYKHFLIDEGDLLISCSGINS 295 Query: 301 DKRSLRSAQVMERGI-----ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLR 354 + + A + + ++ + + YL + ++ K + + + Sbjct: 296 ESFKKKIAFAKKEDLPLCMNTSTMRFKNLKNKLLLEYLYFFFQTLFFEKQVFGVLTGSAQ 355 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + +K + +P + EQ I V++V I+ L + + LK ++ + + Sbjct: 356 FNFGPTHIKWFKIKLPTLPEQQKIAEVLSVADDEINQL----KSELEELKLQKKALMQQL 411 Query: 415 VTGQIDLR 422 +TGQ+ ++ Sbjct: 412 LTGQVRVK 419 >gi|89098144|ref|ZP_01171029.1| type I restriction modification system, subunit S [Bacillus sp. NRRL B-14911] gi|89087001|gb|EAR66117.1| type I restriction modification system, subunit S [Bacillus sp. NRRL B-14911] Length = 435 Score = 157 bits (396), Expect = 4e-36, Method: Composition-based stats. Identities = 73/434 (16%), Positives = 137/434 (31%), Gaps = 29/434 (6%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGT 60 +YK + +G IP W+V IK + +G T I++ D+ Sbjct: 11 ERYKMTE---LGEIPVEWEVRLIKEVADVISGGTPSKAVTEYWNEGTILWATPTDITRNN 67 Query: 61 GKYLPK---DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 KY+ + S+ ++ G IL ++ IA + F Sbjct: 68 SKYIYETELSITELGLKKSSANLLPAGSILMTSRATIGERS-IATAPISTNQGFKSFVCH 126 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 D L + + Q G+T + I N M IPP EQ I E + Sbjct: 127 DGLSNE-YMYYYLEILKQYFLLNASGSTFLEVSKQVIENQVMAIPPHKEQQKIVEVLSTV 185 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWE 233 +I+ + EL K Q L++ + ++ + +EW + + Sbjct: 186 DEQIENTEQLIEKTKELKKGLMQQLLTKGIGHTEFKVTEIGEIPVEWEAKKLEDLISDKV 245 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG---- 289 V N I K S E G Sbjct: 246 VISHIDGNHGSLYPRASEFVDRGTPYISANSIVSGSIDFSKAKYLSEERGNKFKKGVAKN 305 Query: 290 -EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 +++F L+++ + + + +YL++ + S + Sbjct: 306 EDVLFAHNATVGPVAILKTSAPKVILSTSLTLYRCDNNFLLPSYLSYYLDSPMFKIQYQK 365 Query: 349 -MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 M R + ++ L+P I+EQ I N + + RI+ ++ E+ E + Sbjct: 366 VMSQTTRNQVPITAQRKFLFLIPTIQEQEIIANTLGLVDERINYFTQEKER----YTELK 421 Query: 408 SSFIAAAVTGQIDL 421 + +TG+I + Sbjct: 422 KGLMQQLLTGKIRV 435 >gi|224369051|ref|YP_002603215.1| HsdS2 [Desulfobacterium autotrophicum HRM2] gi|223691768|gb|ACN15051.1| HsdS2 [Desulfobacterium autotrophicum HRM2] Length = 426 Score = 157 bits (396), Expect = 4e-36, Method: Composition-based stats. Identities = 74/437 (16%), Positives = 157/437 (35%), Gaps = 38/437 (8%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSES------GKDIIYIGLEDVESGT 60 YK + + W IP+ W V + K+ +G T K + + +++ GT Sbjct: 5 EGYKKTKIGW---IPEDWDCVKLGGIVNKVGSGITPRGGSKVYCDKGVPFFRSQNILHGT 61 Query: 61 GKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQP 116 S + +L G + + + + G + +++P Sbjct: 62 VSVKDIVYISENLHQKMKNTHLQPADVLLNITGASIGRCCVFPNNFKKGNVNQHVCIIRP 121 Query: 117 KDVLPELLQG-WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + L S ++I G +++ I + +P+PPL EQ I + + Sbjct: 122 DGTIKSQYLCSLLNSPIGQKQIWNFQAGGNREGLNFQQIRSFILPLPPLPEQQKIADVL- 180 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +D I+ + I+ ++ K+ L+ ++T+G+ + KD+ I G +P W+V Sbjct: 181 ---STVDDKISSIDQQIQQTEQLKKGLMEKLLTEGIG-HTEFKDTEI---GQIPASWDVV 233 Query: 236 PFFALVTELN-----RKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQIV 286 + + + I + NI I + + + + Sbjct: 234 KLKTICHRIFVGIATSTSEHYTNDGIPIIRNQNIKENSISGDDLLKITNDFNEKNHSKKL 293 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV- 345 G+I+ + + T+ I YL+ + S K+ Sbjct: 294 MVGDIITARTGYPGM-SCVIPKKFEGAQTFTTLVSRPNKERIFPHYLSRYINSDIGKKIV 352 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G +Q+L +K +P+++PP++EQ I +++ +IDVL K Sbjct: 353 LSNQAGGAQQNLNAGRLKEIPIILPPLEEQKQIATILSSVDDKIDVLRSKKTS----YTT 408 Query: 406 RRSSFIAAAVTGQIDLR 422 + + +TGQ+ ++ Sbjct: 409 LKKGLMGQLLTGQMRVK 425 >gi|305432343|ref|ZP_07401506.1| iron-sulfur cluster assembly accessory protein [Campylobacter coli JV20] gi|304444691|gb|EFM37341.1| iron-sulfur cluster assembly accessory protein [Campylobacter coli JV20] Length = 404 Score = 156 bits (394), Expect = 6e-36, Method: Composition-based stats. Identities = 62/412 (15%), Positives = 136/412 (33%), Gaps = 28/412 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ G+T + I++ + D++S + ++ Sbjct: 4 LPQGWEVKKLGDIAEIQIGKTPSRNNIDFFQGENIWLSIRDLKSKFVSSSSEKISNEAIS 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + + KG +L L K A+ D + + K+ + T Sbjct: 64 KTNMKVVPKGTLLMS-FKLTLGKTAFAECDLYTNEAIAAIFIKNK-NINKYFLDYVLKFT 121 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + + I + +P + EQ I + +ID I + + Sbjct: 122 DLEKYVDNAVKGKTLNKQKLKQIEILLPKNIKEQERIVGILDESFAKIDESIKILEQDLL 181 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L E Q+ + + ++ +P WE K + + Sbjct: 182 NLDELMQSALQKAFNPLKD--------NVKENYKLPQSWEWKSLGEIGEIITGTTPSKNN 233 Query: 254 SNILSLSYGNIIQKLETRNMGLKPES-------YETYQIVDPGEIVFRFIDLQNDKRSLR 306 N Y ++ +K S ++ + + I+ I K L Sbjct: 234 PNFYGNEYPLFKPSDLNGDIIIKYASDNLSKLGFDNARNLPKDTILVVCIGASIGKVGLS 293 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRL 365 I + S YL ++ S + S + + +L Sbjct: 294 GVNGSCNQQINAII---PNSAFTSKYLFFVCLSNYFQTILKKNASQTTLPIINKTEFSKL 350 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + +PPIKEQ IT+ ++ ++ + L + + I L+E ++S + A G Sbjct: 351 QIPLPPIKEQEQITSHLDELSSHVKNLKQNYQAQIKDLQELKNSLLDKAFKG 402 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 66/199 (33%), Gaps = 8/199 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+ + ++ TG T + D+ +G N + Sbjct: 207 KLPQSWEWKSLGEIGEIITGTTPSKNNPNFYGNEYPLFKPSDL-NGDIIIKYASDNLSKL 265 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SID 132 K IL +G + K ++ +G C+ Q + P ++ S Sbjct: 266 GFDNARNLPKDTILVVCIGASIGKVGLSGVNGSCNQQINAIIPNSAFTSKYLFFVCLSNY 325 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ T+ + + +P+PP+ EQ I + + + L I Sbjct: 326 FQTILKKNASQTTLPIINKTEFSKLQIPLPPIKEQEQITSHLDELSSHVKNLKQNYQAQI 385 Query: 193 ELLKEKKQALVSYIVTKGL 211 + L+E K +L+ L Sbjct: 386 KDLQELKNSLLDKAFKGNL 404 >gi|149280202|ref|ZP_01886325.1| putative type I restriction-modification system, S subunit [Pedobacter sp. BAL39] gi|149229039|gb|EDM34435.1| putative type I restriction-modification system, S subunit [Pedobacter sp. BAL39] Length = 394 Score = 156 bits (394), Expect = 7e-36, Method: Composition-based stats. Identities = 97/405 (23%), Positives = 174/405 (42%), Gaps = 30/405 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG- 84 + +K + +G T ESG+ + G D T L K GN D S I +G Sbjct: 2 NQISVKYIFNIFSGSTPESGQAFFWDG--DHNWFTPDDLGKIGNKIYVDESNRKITDEGV 59 Query: 85 -----------QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 I+ K P I C+ +L+ K+ + + + Sbjct: 60 ENANLKFGVANSIIITKRAPI-GNLAITTLPSSCNQGCFILEQKNSDINVKYYYYYFLIQ 118 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + G+T + + + P+P L++Q I + + E +ID LI ++ + + Sbjct: 119 KDKLNNLGRGSTFLELNADEMKSYKAPLPSLSQQNKIVDYLDNEVAKIDALIEKKTQLVT 178 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 +L+EKK+A+++ VTKGL+P+V MKDSGI+W+G +P HWE+ + + Sbjct: 179 ILEEKKKAVINQTVTKGLDPNVSMKDSGIQWLGYIPKHWELVKLKYVSNLKSGD------ 232 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 L NI ++ + + G G+++ L + +L Sbjct: 233 ----FLPAENIKEEGDFKVFGGNGARGYFDNYNHEGDLI-----LIGRQGALCGNINFAN 283 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + A+ + I WL + +L + + + L + +K L ++ PPI Sbjct: 284 EKFWATEHAIVCNPIALFDYYWLGKQLELMNLNQYSLAAAQPGLSVDVIKNLFIVFPPIN 343 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ I+N + + + V+KI SI LLKE+R++ I+AAV G+ Sbjct: 344 EQRSISNYLLELDKKNGLAVKKIRDSIDLLKEKRTAVISAAVNGE 388 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 83/206 (40%), Gaps = 16/206 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 KDSG+QW+G IPKHW++V +K + L +G ++ E+++ G + G Sbjct: 201 SMKDSGIQWLGYIPKHWELVKLKYVSNLKSG---------DFLPAENIKE-EGDFKVFGG 250 Query: 69 NSRQSDTSTVSIFAKGQ-ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 N + + +G IL G+ G A+ + +V P + W Sbjct: 251 NGARGYFDNYN--HEGDLILIGRQGALCGNINFANEKFWATEHAIVCNPIALFD---YYW 305 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L + A I N+ + PP+ EQ I ++ + + + Sbjct: 306 LGKQLELMNLNQYSLAAAQPGLSVDVIKNLFIVFPPINEQRSISNYLLELDKKNGLAVKK 365 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNP 213 I+LLKEK+ A++S V LN Sbjct: 366 IRDSIDLLKEKRTAVISAAVNGELNA 391 >gi|23452718|gb|AAN33132.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 403 Score = 156 bits (394), Expect = 7e-36, Method: Composition-based stats. Identities = 69/409 (16%), Positives = 145/409 (35%), Gaps = 21/409 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKTLSEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 IL +G L K + G C+ Q + P K+++ E + + +S Sbjct: 63 FDKARQLPPKTILVVCIGS-LGKVALTKVIGSCNQQINAIIPHKNIISEYIYYYCISSKF 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + T++ + + + P + EQ I + +ID I + + Sbjct: 122 QSILFSKAPQTTLAILNKTEFSKLEIIYPKDIKEQERIVGILDFAFSKIDENIKKAKENL 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV-PDHWEVKPFFALVTELNRKNTKL 251 + E Q+ + + + W D + + N + Sbjct: 182 ANIDELMQSALQKAFNPLNDNTKENYQLPQSWEWKSLGDTSNYGKTSQVKPSQLKGNDWI 241 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +E + G ++QK+ ++ K + + G+I+F + K + Sbjct: 242 LELEDIEKESGVLLQKVLFQDRQSKSNKIK----FNKGDILFGTLRPYLKKVIIA----D 293 Query: 312 ERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLV 369 + G +S M + I + ++ + + + L ++ G R L +D K L + + Sbjct: 294 DNGACSSEIMPFSTGNSITNHFIYYYLFANFLHDRISSLTYGARMPRLGTKDGKSLQIPL 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP++EQ I ++ + L E + + +E + S + A G+ Sbjct: 354 PPLQEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 402 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 54/198 (27%), Positives = 89/198 (44%), Gaps = 8/198 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYI-GLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ + N G+T K +I LED+E +G L K + Sbjct: 208 QLPQSWEWKSLGD--TSNYGKTSQVKPSQLKGNDWILELEDIEKESGVLLQKVLFQDRQS 265 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDV 133 S F KG IL+G L PYL+K IIAD +G CS++ + + + + +L + + Sbjct: 266 KSNKIKFNKGDILFGTLRPYLKKVIIADDNGACSSEIMPFSTGNSITNHFIYYYLFANFL 325 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 RI ++ GA M K ++ +P+PPL EQ I E + + L + ++ Sbjct: 326 HDRISSLTYGARMPRLGTKDGKSLQIPLPPLQEQEQIAEHLDFVFEKAKALKELYTKELK 385 Query: 194 LLKEKKQALVSYIVTKGL 211 +E KQ+L+ L Sbjct: 386 DYEELKQSLLDKAFKGEL 403 >gi|19881267|gb|AAM00872.1|AF486555_3 HsdS [Campylobacter jejuni] Length = 411 Score = 156 bits (393), Expect = 8e-36, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 136/412 (33%), Gaps = 19/412 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IL +G + A+ ++ K+++ E + + +S Sbjct: 63 FDKARQLPPKTILVVCIGSLGKVALTRVIGSCNQQINAIIPHKNIIAEYIYYYCISSKFQ 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + T++ + + + P + EQ I + +ID I + + Sbjct: 123 SILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIKILEQDLL 182 Query: 194 LLKEKKQALVSYIVTKGLN--PDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 L E Q+ + + + G EW +G + + + + E+ Sbjct: 183 NLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYV 242 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L NI + N ++ + +K E ++ +I+F + Sbjct: 243 HLRTHNISTDGNLNFDTLIKIKREFIK----EKQSFIEKNDILFNNTNSTELVGKTALVT 298 Query: 310 VMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLP 366 ++ + +S + + K F + + + + +K++ Sbjct: 299 QNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGINIDKLKKIQ 358 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 359 IPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLNKAFKGE 410 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 70/204 (34%), Gaps = 12/204 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIY----IGLEDVES-GTGKYLPKDGNSRQS 73 +P+ W+ + + + G + +I + ++ + G + R+ Sbjct: 208 KLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYVHLRTHNISTDGNLNFDTLIKIKREF 267 Query: 74 DTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPEL--LQGW 127 S K IL+ + +++ S ++ K+ + + Sbjct: 268 IKEKQSFIEKNDILFNNTNSTELVGKTALVTQNYNYAFSNHLTKIKLKNQYNSKLVVFYF 327 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +L + + + S + + I +P+PPL EQ I E + + L Sbjct: 328 VLLLKNKYFEKICHQWIGQSGINIDKLKKIQIPLPPLKEQEQIAEHLDFVFEKAKALKEL 387 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 + ++ +E KQ+L++ L Sbjct: 388 YTKELKDYEELKQSLLNKAFKGEL 411 >gi|15678961|ref|NP_276078.1| type I restriction modification system, subunit S [Methanothermobacter thermautotrophicus str. Delta H] gi|2622039|gb|AAB85439.1| type I restriction modification system, subunit S [Methanothermobacter thermautotrophicus str. Delta H] Length = 407 Score = 156 bits (393), Expect = 9e-36, Method: Composition-based stats. Identities = 71/424 (16%), Positives = 161/424 (37%), Gaps = 34/424 (8%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTG 61 ++KDS V G IP W V + + TG T + D++++ D+ + G Sbjct: 5 EFKDSPV---GRIPVDWGVSRVSEVFDVFTGTTPSTKIDEFWDDGDVVWVTPADMSNLNG 61 Query: 62 KYL---PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 + + + + +++ K IL P + + + + L PK Sbjct: 62 IMIADSERKVTVKALKRTNLNLIPKLSILISTRAPV-GYVALNTVECVFNQGCKALVPKS 120 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + L I+ + + + G+T + K + I +P+PPL EQ I E + Sbjct: 121 HVDTRYFAYYLLINKKRLQD-LSGGSTFKELNKKTLEKIYLPVPPLEEQKRISEILQDVD 179 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I + + I + ++ K+ L+ ++ +G+N + KDS + G +P W+V Sbjct: 180 GA----IEKVNKEIGVTEKLKRGLMQRLLMEGIN-HTEFKDSHV---GRIPVDWDVVNLE 231 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +V + K L E + + Y I + ++ Sbjct: 232 DVVEIHDNKRIPLSEKERIKMKGDYPYCGANGII------DYINDYIFNGEFVLLAEDGG 285 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + + + + ++ +L+ + + + R+ L Sbjct: 286 DYSSFGSSAYIMNGKFWVNNHAHVIEA-LPSKITNRFLLHILIYLDLTHYVVGSTRKKLN 344 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++++ + +PP++EQ I+ ++ R+++L E+ V L+ + + +TG+ Sbjct: 345 QGIMRKIKIPLPPLEEQKRISEILQDVDRRLELLTERK----VKLENIKRGLMNDLLTGK 400 Query: 419 IDLR 422 +R Sbjct: 401 RRVR 404 >gi|148925704|ref|ZP_01809392.1| putative type I specificity subunit HsdS [Campylobacter jejuni subsp. jejuni CG8486] gi|157415770|ref|YP_001483026.1| hypothetical protein C8J_1451 [Campylobacter jejuni subsp. jejuni 81116] gi|19881216|gb|AAM00830.1|AF486546_4 HsdS [Campylobacter jejuni] gi|19881256|gb|AAM00863.1|AF486553_4 HsdS [Campylobacter jejuni] gi|19881280|gb|AAM00883.1|AF486557_4 HsdS [Campylobacter jejuni] gi|19881299|gb|AAM00895.1|AF486564_1 HsdS [Campylobacter jejuni] gi|23452712|gb|AAN33130.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452721|gb|AAN33133.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452731|gb|AAN33137.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452734|gb|AAN33138.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452736|gb|AAN33139.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452751|gb|AAN33145.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452754|gb|AAN33146.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452759|gb|AAN33148.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452761|gb|AAN33149.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|145845714|gb|EDK22805.1| putative type I specificity subunit HsdS [Campylobacter jejuni subsp. jejuni CG8486] gi|157386734|gb|ABV53049.1| hypothetical protein C8J_1451 [Campylobacter jejuni subsp. jejuni 81116] gi|307748412|gb|ADN91682.1| Putative type I specificity subunit HsdS [Campylobacter jejuni subsp. jejuni M1] gi|315931058|gb|EFV10033.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 327] Length = 403 Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats. Identities = 69/409 (16%), Positives = 145/409 (35%), Gaps = 21/409 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKTLSEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 IL +G L K + G C+ Q + P K+++ E + + +S Sbjct: 63 FGKARQLPPKTILVVCIGS-LGKVALTKVIGSCNQQINAIIPHKNIISEYIYYYCISSKF 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + T++ + + + P + EQ I + +ID I + + Sbjct: 122 QSILFSKAPQTTLAILNKTEFSKLEIIYPKDIKEQERIVGILDFAFSKIDENIKKAKENL 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV-PDHWEVKPFFALVTELNRKNTKL 251 + E Q+ + + + W D + + N + Sbjct: 182 ANIDELMQSALQKAFNPLNDNTKENYQLPQSWEWKSLGDTSNYGKTSQVKPSQLKGNDWI 241 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +E + G ++QK+ ++ K + + G+I+F + K + Sbjct: 242 LELEDIEKESGVLLQKVLFQDRQSKSNKIK----FNKGDILFGTLRPYLKKVIIA----D 293 Query: 312 ERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLV 369 + G +S M + I + ++ + + + L ++ G R L +D K L + + Sbjct: 294 DNGACSSEIMPFSTGNSITNHFIYYYLFANFLHDRISSLTYGARMPRLGTKDGKSLQIPL 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP++EQ I ++ + L E + + +E + S + A G+ Sbjct: 354 PPLQEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 402 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 54/198 (27%), Positives = 89/198 (44%), Gaps = 8/198 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYI-GLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ + N G+T K +I LED+E +G L K + Sbjct: 208 QLPQSWEWKSLGD--TSNYGKTSQVKPSQLKGNDWILELEDIEKESGVLLQKVLFQDRQS 265 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDV 133 S F KG IL+G L PYL+K IIAD +G CS++ + + + + +L + + Sbjct: 266 KSNKIKFNKGDILFGTLRPYLKKVIIADDNGACSSEIMPFSTGNSITNHFIYYYLFANFL 325 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 RI ++ GA M K ++ +P+PPL EQ I E + + L + ++ Sbjct: 326 HDRISSLTYGARMPRLGTKDGKSLQIPLPPLQEQEQIAEHLDFVFEKAKALKELYTKELK 385 Query: 194 LLKEKKQALVSYIVTKGL 211 +E KQ+L+ L Sbjct: 386 DYEELKQSLLDKAFKGEL 403 >gi|23452738|gb|AAN33140.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 411 Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 136/412 (33%), Gaps = 19/412 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IL +G + A+ ++ K+++ E + + +S Sbjct: 63 FDKARQLPPKTILVVCIGSLGKVALTRVIGSCNQQINAIIPHKNIISEYIYYYCISSKFQ 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + T++ + + + P + EQ I + +ID I + + Sbjct: 123 SILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIKILEQDLL 182 Query: 194 LLKEKKQALVSYIVTKGLN--PDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 L E Q+ + + + G EW +G + + + + E+ Sbjct: 183 NLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYV 242 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L NI + N ++ + +K E ++ +I+F + Sbjct: 243 HLRTHNISTDGNLNFDTLIKIKREFIK----EKQSFIEKNDILFNNTNSTELVGKTALVT 298 Query: 310 VMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLP 366 ++ + +S + + K F + + + + +K++ Sbjct: 299 QNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGINIDKLKKIQ 358 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 359 IPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 410 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 70/204 (34%), Gaps = 12/204 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIY----IGLEDVES-GTGKYLPKDGNSRQS 73 +P+ W+ + + + G + +I + ++ + G + R+ Sbjct: 208 KLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYVHLRTHNISTDGNLNFDTLIKIKREF 267 Query: 74 DTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPEL--LQGW 127 S K IL+ + +++ S ++ K+ + + Sbjct: 268 IKEKQSFIEKNDILFNNTNSTELVGKTALVTQNYNYAFSNHLTKIKLKNQYNSKLVVFYF 327 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +L + + + S + + I +P+PPL EQ I + + + L Sbjct: 328 VLLLKNKYFEKICHQWIGQSGINIDKLKKIQIPLPPLKEQEQIAKHLDFVFEKTKALKEL 387 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 + ++ +E KQ+L++ L Sbjct: 388 YTKELKDYEELKQSLLNKAFKGEL 411 >gi|330874481|gb|EGH08630.1| type I restriction-modification system specificity subunit [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 421 Score = 155 bits (391), Expect = 2e-35, Method: Composition-based stats. Identities = 99/417 (23%), Positives = 175/417 (41%), Gaps = 32/417 (7%) Query: 20 AIPKH----------WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPK 66 +P+ W++ +K +N + G+ ++ +E V + G+ Sbjct: 10 QVPEGTCSSSDTAKKWRICRLKHVALINPYLSLSRVRWGEPATFLPMEAVSTD-GQVDYS 68 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLV-LQPKDV 119 + ++ S + F G ++ K+ P + G ST+F V K Sbjct: 69 EPEDSKNLVSGFTNFEAGDVILAKITPCFENGKGAVLSDMPTRVGFGSTEFHVLRVNKKA 128 Query: 120 LPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +P + S ++ EA+ G+ + N + +P L EQ I + + +T Sbjct: 129 IPNFIYYITKSDLFMRQGEALMIGSAGQKRVSTSYVENFQLALPSLHEQRKIVDFLEEKT 188 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I I+++ IELL+E+KQ LV VT+GL+P M+++GIEW+G +P HWEV+ Sbjct: 189 SLIAEAISKKEYQIELLEERKQILVQQAVTRGLDPAAPMRNAGIEWIGEIPKHWEVRRSK 248 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFR 294 + K + SYG I Q +G K E + V+ G+ V Sbjct: 249 FTFNQRKELARKNDIQLSATQSYGVIPQDEYEEKVGRKVVKILFNLEKRKHVEVGDFVIS 308 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 Q L A I +S + GID Y ++L++S A + +R Sbjct: 309 MRSFQG---GLERAWASG-CIRSSYVILKPLPGIDPGYYSYLLKSKRYIAALQATANFIR 364 Query: 355 --QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 Q L FE+ + + +PP+ EQ +I + ++ D + +EQ I+ LKE +++ Sbjct: 365 DGQDLNFENFALVDLPIPPLDEQKEIARYLASWLSKADRGLYLLEQQIIKLKEYKAT 421 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 58/141 (41%), Gaps = 5/141 (3%) Query: 281 ETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMA-VKPHGIDSTYLAWLM 337 + + G+++ I N K ++ S G ++ + ++ ++ Sbjct: 78 SGFTNFEAGDVILAKITPCFENGKGAVLSDMPTRVGFGSTEFHVLRVNKKAIPNFIYYIT 137 Query: 338 RSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 +S + +GS ++ + V+ + +P + EQ I + + +T+ I + K Sbjct: 138 KSDLFMRQGEALMIGSAGQKRVSTSYVENFQLALPSLHEQRKIVDFLEEKTSLIAEAISK 197 Query: 396 IEQSIVLLKERRSSFIAAAVT 416 E I LL+ER+ + AVT Sbjct: 198 KEYQIELLEERKQILVQQAVT 218 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 60/177 (33%), Gaps = 9/177 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI-IYIGLEDVESGTGKYLPKDG 68 +++G++WIG IPKHW+V K N + DI + +Y K G Sbjct: 227 MRNAGIEWIGEIPKHWEVRRSK--FTFNQRKELARKNDIQLSATQSYGVIPQDEYEEKVG 284 Query: 69 ---NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + G + + + A G + +++L+P + Sbjct: 285 RKVVKILFNLEKRKHVEVGDFVIS-MRSFQGGLERAWASGCIRSSYVILKPLPGIDPGYY 343 Query: 126 GWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +LL +++ + +PIPPL EQ I + + + Sbjct: 344 SYLLKSKRYIAALQATANFIRDGQDLNFENFALVDLPIPPLDEQKEIARYLASWLSK 400 >gi|325662102|ref|ZP_08150720.1| hypothetical protein HMPREF0490_01458 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471551|gb|EGC74771.1| hypothetical protein HMPREF0490_01458 [Lachnospiraceae bacterium 4_1_37FAA] Length = 435 Score = 154 bits (390), Expect = 2e-35, Method: Composition-based stats. Identities = 87/433 (20%), Positives = 186/433 (42%), Gaps = 28/433 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLED--VESGTGKY 63 KDSG++W+G IP WK + ++ T T+ +++IY ED + T Sbjct: 6 KDSGIKWVGEIPSDWKALKLRYICDKITDYTASGSFASLAENVIYRDYEDYAMLVRTADL 65 Query: 64 LPKDGNSRQS-DTSTVSIFAK-----GQILYGKLGPYLRKAIIADFD--GICSTQFLVLQ 115 K S+ D + + G+++ +G + + +++Q Sbjct: 66 SNKRETSKVYVDEHAYNYLSNSNLFGGEVILPNIGSVGEVYLYQPIYERATLAPNAIMIQ 125 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + + L + + ++ + T + + N+ + IPP + + I + Sbjct: 126 APEEVEKFLYYYFSTYGAFDDLKNLGNATTQIKFNKTQLRNLKVVIPPKEKMLKINCFLD 185 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 +I++ +T + I+ L+E K+++V V+KG+ ++KD+ + +P W++ Sbjct: 186 RRCEKIESFVTVVQQQIDTLEELKRSVVYEAVSKGIKKA-ELKDTDSDVWAKIPKDWQLV 244 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + E+ ++ +ILS++ + K + N G ++Y YQIV P + V Sbjct: 245 DV-KYLFEIVKRIAGKEGIDILSVTQQGLKVKDISSNEGQIADNYSGYQIVYPTDYVMNH 303 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSG 352 +DL + G+ + Y + + L + M+ +C++FY++G G Sbjct: 304 MDLLTGWVDCSTM----FGVTSPDYRVFRLMDKANNSLRYYKYVMQCCYMCRIFYSLGQG 359 Query: 353 L----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + R L+ V PP+KEQ I + I + + I+ L+ + +L++ + Sbjct: 360 VSTLGRWRLQTSSFLNFKVPAPPLKEQEIIADYIEEKVSGIERLINLKIEQQRVLEDYKK 419 Query: 409 SFIAAAVTGQIDL 421 + IA VTG+ ++ Sbjct: 420 TLIADYVTGKKEV 432 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 84/219 (38%), Gaps = 18/219 (8%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-------------TKLIESNILSLS 260 KDSGI+WVG +P W+ + ++ + E + + Sbjct: 2 MQIKKDSGIKWVGEIPSDWKALKLRYICDKITDYTASGSFASLAENVIYRDYEDYAMLVR 61 Query: 261 YGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++ K ET + + +Y + + GE++ I + + + ER + Sbjct: 62 TADLSNKRETSKVYVDEHAYNYLSNSNLFGGEVILPNIGSVGEVYLYQP--IYERATLAP 119 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFD 377 + ++ +L + +Y +G+ + ++ L V++PP ++ Sbjct: 120 NAIMIQAPEEVEKFLYYYFSTYGAFDDLKNLGNATTQIKFNKTQLRNLKVVIPPKEKMLK 179 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I ++ +I+ V ++Q I L+E + S + AV+ Sbjct: 180 INCFLDRRCEKIESFVTVVQQQIDTLEELKRSVVYEAVS 218 >gi|19881249|gb|AAM00857.1|AF486552_3 HsdS [Campylobacter jejuni] gi|19881273|gb|AAM00877.1|AF486556_3 HsdS [Campylobacter jejuni] Length = 417 Score = 154 bits (390), Expect = 2e-35, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 136/412 (33%), Gaps = 19/412 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 10 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 68 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IL +G + A+ ++ K+++ E + + +S Sbjct: 69 FDKARQLPPKTILVVCIGSLGKVALTRVIGSCNQQINAIIPHKNIISEYIYYYCISSKFQ 128 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + T++ + + + P + EQ I + +ID I + + Sbjct: 129 SILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIKILEQDLL 188 Query: 194 LLKEKKQALVSYIVTKGLN--PDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 L E Q+ + + + G EW +G + + + + E+ Sbjct: 189 NLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYV 248 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L NI + N ++ + +K E ++ +I+F + Sbjct: 249 HLRTHNISTDGNLNFDTLIKIKREFIK----EKQSFIEKNDILFNNTNSTELVGKTALVT 304 Query: 310 VMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLP 366 ++ + +S + + K F + + + + +K++ Sbjct: 305 QNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGINIDKLKKIQ 364 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 365 IPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 416 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 70/204 (34%), Gaps = 12/204 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIY----IGLEDVES-GTGKYLPKDGNSRQS 73 +P+ W+ + + + G + +I + ++ + G + R+ Sbjct: 214 KLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYVHLRTHNISTDGNLNFDTLIKIKREF 273 Query: 74 DTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPEL--LQGW 127 S K IL+ + +++ S ++ K+ + + Sbjct: 274 IKEKQSFIEKNDILFNNTNSTELVGKTALVTQNYNYAFSNHLTKIKLKNQYNSKLVVFYF 333 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +L + + + S + + I +P+PPL EQ I + + + L Sbjct: 334 VLLLKNKYFEKICHQWIGQSGINIDKLKKIQIPLPPLKEQEQIAKHLDFVFEKTKALKEL 393 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 + ++ +E KQ+L++ L Sbjct: 394 YTKELKDYEELKQSLLNKAFKGEL 417 >gi|118474615|ref|YP_892156.1| type I restriction-modification system, S subunit [Campylobacter fetus subsp. fetus 82-40] gi|118413841|gb|ABK82261.1| type I restriction-modification system, S subunit [Campylobacter fetus subsp. fetus 82-40] Length = 401 Score = 154 bits (390), Expect = 2e-35, Method: Composition-based stats. Identities = 78/426 (18%), Positives = 161/426 (37%), Gaps = 41/426 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 YK + V G IPK W+VV + + T + + + +++ I ++ + K Sbjct: 4 SYKQTAV---GRIPKEWEVVRLGDVFQRVTRKNTVNSDNVLTISAQNGLIKQENFFTK-- 58 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPEL 123 + D S + KG+ Y K G+ S ++ + K+ + Sbjct: 59 SVASKDLSNYILLEKGEFAYNKSYSSGYPMGATKRLNFYNYGVLSNLYIYFKIKNGNSDF 118 Query: 124 LQGWLLSIDVTQRIEAICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + + + I I + +H NI + +PPL EQ I E + Sbjct: 119 YEQYFEAGLLNKEIHQIAQEGARNHGLLNISVVDFFNILIVLPPLKEQEKIAEIL----S 174 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 D +I+ I+ K AL+ + L+ ++ K+ W F Sbjct: 175 TCDKVISNLDELIKAKTNLKTALMQNL----LSAKIRFKEFTDPW-----QEKFGDKLFK 225 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 +TE+N+ + +S +G I + L + + +S Y++V G + Q Sbjct: 226 TITEINQ--NYDLPILAISQEFGAIPRNLIDYKVIVSYKSISNYKVVRKGNFIISLRSFQ 283 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQS-- 356 GI + AY+ +KP + + +S+D + + G+R Sbjct: 284 G-----GIEYSKYDGICSPAYIILKPIQQIFDNFFKYYFKSHDYIQKLNSKLEGIRDGKM 338 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + F+ + + +P + EQ I V++ D + ++ + LK ++ + +T Sbjct: 339 VSFKQFSEIKIPLPNLAEQQKIAEVLSA----CDDEINLLKDKLSNLKLQKQGLMQNLLT 394 Query: 417 GQIDLR 422 G++ +R Sbjct: 395 GKVRVR 400 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 81/207 (39%), Gaps = 10/207 (4%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G +P WEV + + RKNT ++ + + +I++ + + Y + Sbjct: 11 GRIPKEWEVVRLGDVFQRVTRKNTVNSDNVLTISAQNGLIKQENFFTKSVASKDLSNYIL 70 Query: 286 VDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ GE + + + G++++ Y+ K +S + + L K Sbjct: 71 LEKGEFAYNKSYSSGYPMGATKRLNFYNYGVLSNLYIYFKIKNGNSDFYEQYFEAGLLNK 130 Query: 345 VFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + G R ++ D + +++PP+KEQ I +++ I L E I+ Sbjct: 131 EIHQIAQEGARNHGLLNISVVDFFNILIVLPPLKEQEKIAEILSTCDKVISNLDELIKAK 190 Query: 400 IVLLKERRSSFIAAAVTGQIDLRGESQ 426 +++ + ++ +I + + Sbjct: 191 ----TNLKTALMQNLLSAKIRFKEFTD 213 >gi|120597149|ref|YP_961723.1| restriction modification system DNA specificity subunit [Shewanella sp. W3-18-1] gi|120557242|gb|ABM23169.1| restriction modification system DNA specificity domain [Shewanella sp. W3-18-1] Length = 417 Score = 154 bits (390), Expect = 2e-35, Method: Composition-based stats. Identities = 61/414 (14%), Positives = 144/414 (34%), Gaps = 17/414 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W + ++ +KL+ G T + I ++ +V + + Sbjct: 5 VPDGWMLKIVRDTSKLSAGGTPSTQVTEYWENGTIPWMSSGEVHKKRVHSVDNCITTLGL 64 Query: 74 DTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + S+ +F IL G I++ + + + KD + Sbjct: 65 ENSSAKMFPSKSILVALAGQGKTRGTVAISEIELTTNQSIAAIIVKDKSVYPDFLYHNLD 124 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + G+ + + +G++ + +PPL EQ I + + + I+ + + Sbjct: 125 SRYEELRGVSGGSGRAGLNLAILGDLDVLLPPLPEQQKIAKILTSVDQVIEKTQAQIDKL 184 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +L Q L++ V P + KDS + W+ D + F ++ Sbjct: 185 KDLKTGMMQELLTQGVGVDGKPHTEFKDSPVGWIPKTWDLEPLANFTTFISYGFTNPMPE 244 Query: 252 IESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E ++ ++ K++ + + I L D R A V Sbjct: 245 AEVGPYMITAKDVNDLKVQYSTSRKTTQEAFDNLLTRKSRPQVNDILLTKDGTLGRVALV 304 Query: 311 ME-RGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV 367 + I + + P+ +L +L+ S + G + + V ++ V Sbjct: 305 TDSNCCINQSVAVLTPNERVIPKFLLYLLASPRYQQEMLENAGGSTIKHIYITVVDKMLV 364 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 VP + EQ + ++ + ++ E E + L + + + + +TG++ + Sbjct: 365 GVPSVTEQQKLVDIFDSVFRKL----ELTENKLSKLNDTKKALMQDLLTGKVRV 414 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 84/216 (38%), Gaps = 11/216 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT-GRT---SESGKDIIYIGLEDVESGT 60 K + ++KDS V W IPK W + P+ FT + G T E+ I +DV Sbjct: 205 KPHTEFKDSPVGW---IPKTWDLEPLANFTTFISYGFTNPMPEAEVGPYMITAKDVNDLK 261 Query: 61 GKY-LPKDGNSRQSDTSTVSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 +Y + D IL K G R A++ D + + VL P Sbjct: 262 VQYSTSRKTTQEAFDNLLTRKSRPQVNDILLTKDGTLGRVALVTDSNCCINQSVAVLTPN 321 Query: 118 DV-LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + +P+ L L S Q + G+T+ H + + + +P + EQ + + + Sbjct: 322 ERVIPKFLLYLLASPRYQQEMLENAGGSTIKHIYITVVDKMLVGVPSVTEQQKLVDIFDS 381 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 +++ + + + K Q L++ V ++ Sbjct: 382 VFRKLELTENKLSKLNDTKKALMQDLLTGKVRVNID 417 >gi|152998552|ref|YP_001355473.1| restriction modification system DNA specificity subunit [Shewanella baltica OS185] gi|151367566|gb|ABS10565.1| restriction modification system DNA specificity domain [Shewanella baltica OS185] Length = 388 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 98/428 (22%), Positives = 167/428 (39%), Gaps = 46/428 (10%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M HYK Y +YK + + W+ ++P HWK+ KR +N G + +ES Sbjct: 1 MSHYKPYLEYKGTDLAWLKSVPSHWKIAQFKRLISINNGSDHKQ-----------IESDD 49 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 G P G+ + I +L G+ G + + T + +L Sbjct: 50 G--YPVYGSGGVFAYAKDYIHDGESVLLGRKGTIDKPLYVKGKFWTVDTMYW----SKIL 103 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P + I T + +G+ + P EQ I + ET R Sbjct: 104 PTANGKFCYYIATTIPFGLYSTNTALPSMTQTDLGSHVVAFPDYNEQTEITRVLDCETTR 163 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID LI ++ RF+EL+KEK ALV G ++K + Sbjct: 164 IDALIRKKSRFLELIKEKILALVMNEQINGNGKFDRLK--------------------RM 203 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQ 299 ++R T + ++L N + L + + L K ++ V+ G+++ Sbjct: 204 TNVVSRPATIVDSDEYVALGLYNRGRGLFHKPVTLGKDMGDSSFFYVEEGDLILSGQFAW 263 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--- 356 ++ + + +++ Y ++ I + YL L + + G Sbjct: 264 EGAVTMATEKETG-CVVSHRYPVIRGKSIATEYLFALFMTNFGDFLLNESSRGAAGRNRP 322 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + + +P + Q + + A+ DV K+++SI LLKERRS+FI AAVT Sbjct: 323 LNINLLLNEKIRIPSPEVQRE-VKRLMYLKAQADV---KVKKSIALLKERRSAFITAAVT 378 Query: 417 GQIDLRGE 424 G+IDLRGE Sbjct: 379 GKIDLRGE 386 >gi|147920567|ref|YP_685636.1| type I restriction modification system, specificity subunit [uncultured methanogenic archaeon RC-I] gi|110621032|emb|CAJ36310.1| type I restriction modification system, specificity subunit [uncultured methanogenic archaeon RC-I] Length = 449 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 70/440 (15%), Positives = 158/440 (35%), Gaps = 34/440 (7%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGK 62 K YK++ + G IP+ W +V IK + + K YI + V + + K Sbjct: 18 KTDDGYKETPM---GRIPEEWSIVSIKNIVEKTEQIDPQKQPDKYFKYIDVSSVSNESLK 74 Query: 63 YLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLV--LQP 116 + + + + I I++ + P L++ I D +CST F V Sbjct: 75 VVSVNEFKGINAPSRARRIVRTDDIIFATIRPNLKRVAIICDDLEGQLCSTAFCVLRCMK 134 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 P + + + ++ + G+ + + + +PP++EQ I + Sbjct: 135 NIAEPYFVFQTVTTDRFIGKLCDLQCGSGYPAVTDNDLLDQQILLPPISEQRKIAAILGT 194 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 I+ R + + K+ L+ +T+G + + +G++P HW+ P Sbjct: 195 LDSLIEE----TDRVVARTGQLKKGLIQEFLTEG----MGNVELEDTALGMIPKHWKCVP 246 Query: 237 FFA---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---KPESYETYQIVDPGE 290 F K+ K S + NI ++ + G+ Sbjct: 247 FATLSLTYKNGIYKHDKYYGSGYPCIRMYNIADGTVNTINSPLLNVTDAELKEYELAEGD 306 Query: 291 IVFRFI---DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 ++ + DL + + + + + I ++ ++S Sbjct: 307 LLINRVNSRDLVGKAGIVPAGLGHVTFESKNIRVRLNRSMILPEFMGLFIQSSMYRNQVN 366 Query: 348 AMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + ++ +D+ + V +PP EQ I +VI ++I + + + I L+ Sbjct: 367 KFVKSAIAQSTINQDDLDNILVPLPPKDEQEKIASVIREINSKITWEI-RYRERIELV-- 423 Query: 406 RRSSFIAAAVTGQIDLRGES 425 + + + +TG+I ++ ++ Sbjct: 424 -KKALMQDLLTGRIRVKPDT 442 >gi|324115278|gb|EGC09242.1| type I restriction modification DNA specificity domain-containing protein [Escherichia fergusonii B253] Length = 449 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 76/411 (18%), Positives = 161/411 (39%), Gaps = 19/411 (4%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 G +P+ W + + TG+T + + +I +I D++ G G + + Sbjct: 4 GKLPEGWVECELSELGNIVTGKTPSTKEPSNFGGNIPFIKPGDLDLG-GYIMNTADTLTE 62 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S V ++ +G L K I + Q L P + L + + + Sbjct: 63 KGLSLVPTLPANSVVVTCIG-NLGKVGITVKKSASNQQINALIPSEKLN-VKFVYYQILT 120 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +E+ T++ + P PPLAEQ +I EK+ ++++ + Sbjct: 121 LKPWLESQSAATTIAIVNKSKFSQAPFKFPPLAEQKIIAEKLDTLLAQVESTKARLEQIP 180 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL- 251 ++LK +QA+++ V L + + + S + + E+ + K+ Sbjct: 181 QILKRFRQAVLAIAVNGQLTKEWR-ELSELSAIWPSLTLGELVTIERGSSPRPIKDYITA 239 Query: 252 IESNILSLSYGNII---QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 ES + + G+ + + + + PE + + V PG+ + + Sbjct: 240 SESGVNWIKIGDAREGEKYIHSTKEKITPEGAKKSRKVTPGDFILSNSMSLGRAYIV--- 296 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPV 367 +E + ++ P ID Y +L+ S L + F + G+ Q+++ E VK+ V Sbjct: 297 -DIEGYVHDGWFILRLPQHIDKNYFYYLLSSSQLQEQFSNLAVGGVVQNIRSELVKQAIV 355 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +P KEQ +I + A D + +++ ++ + S +A A G+ Sbjct: 356 NIPSEKEQHEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 406 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 58/200 (29%), Gaps = 8/200 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDII--------YIGLEDVESGTGKYLPKDGNSRQSDTS 76 W + + + G + KD I +I + D G Sbjct: 213 WPSLTLGELVTIERGSSPRPIKDYITASESGVNWIKIGDAREGEKYIHSTKEKITPEGAK 272 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G + R I+ + F++ P+ + L S + ++ Sbjct: 273 KSRKVTPGDFILSNSMSLGRAYIVDIEGYVHDGWFILRLPQHIDKNYFYYLLSSSQLQEQ 332 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G + + + + + IP EQ I ++ DT+ + + + Sbjct: 333 FSNLAVGGVVQNIRSELVKQAIVNIPSEKEQHEIVRRVEQLFAYADTIEKQVNNALARVN 392 Query: 197 EKKQALVSYIVTKGLNPDVK 216 Q++++ L + Sbjct: 393 NLTQSILAKAFRGELTAQWR 412 >gi|14520513|ref|NP_125988.1| type I restriction-modification enzyme, S subunit [Pyrococcus abyssi GE5] gi|5457728|emb|CAB49219.1| hsdS type I restriction-modification enzyme, S subunit [Pyrococcus abyssi GE5] Length = 427 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 69/415 (16%), Positives = 153/415 (36%), Gaps = 26/415 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP+ W+VV + ++ ++ ++ I +E V + + + + S+ Sbjct: 26 IPEEWEVVELGEVARIRKKKSVRDIAEVAVIPMEKVPQDNELFAEFEIKAIEDVKSSTY- 84 Query: 81 FAKGQILYGKLGPYLR-------KAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSID 132 G +L K+ P + + + +T+ + P + L ++ Sbjct: 85 CEAGDLLLAKITPSFENGKQGIVPFNVPNGFALATTEVYPIVPSENLDVFFLFYILKDKR 144 Query: 133 VTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +E G T + + +P+PPL EQ I E + + I + R Sbjct: 145 FRKILEVRMTGTTGRQRVQKTDLLKLQIPLPPLEEQKKIAEILRSIDEAIQAVDESIARL 204 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L K + L++ + V++ +E +P+ W+V + N Sbjct: 205 ERLKKGTMERLLTRGINHTRFKTVELNGRKVE----IPEEWDVVELGEVAERRNESVNPA 260 Query: 252 IESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 NI + +I + G E + PG+I++ + DK + + Sbjct: 261 NMGNIPFVGLEHIEPGNIRLSQWGNSSEVKSSKSKFYPGDILYGKLRPYLDKAVIADFE- 319 Query: 311 MERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 GI ++ + +K YL W++ S + + G+ ++ +K+ + Sbjct: 320 ---GICSTDIIVIKAKEDKTIPEYLIWVIHSKEFIEYAKKTMKGVNHPRTSWKSIKQFQI 376 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +PP++EQ I ++ ID +E L+ + + + +TG++ +R Sbjct: 377 PLPPLEEQKKIAEILRT----IDEAIEAKRAKKEKLERMKKAVMEKLLTGEVRVR 427 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 57/192 (29%), Positives = 92/192 (47%), Gaps = 5/192 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 IP+ W VV + + + + +I ++GLE +E G + + S+ Sbjct: 236 EIPEEWDVVELGEVAERRNESVNPANMGNIPFVGLEHIEPGNIRLSQ--WGNSSEVKSSK 293 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQR 136 S F G ILYGKL PYL KA+IADF+GICST +V++ K+ +PE L + S + + Sbjct: 294 SKFYPGDILYGKLRPYLDKAVIADFEGICSTDIIVIKAKEDKTIPEYLIWVIHSKEFIEY 353 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + +G WK I +P+PPL EQ I E + I+ ++ + + K Sbjct: 354 AKKTMKGVNHPRTSWKSIKQFQIPLPPLEEQKKIAEILRTIDEAIEAKRAKKEKLERMKK 413 Query: 197 EKKQALVSYIVT 208 + L++ V Sbjct: 414 AVMEKLLTGEVR 425 >gi|23452704|gb|AAN33127.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 411 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 136/412 (33%), Gaps = 19/412 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IL +G + A+ ++ K+++ E + + +S Sbjct: 63 FDKARQLPPKTILVVCIGSLGKVALTRVIGSCNQQINAIIPHKNIISEYIYYYCISSKFQ 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + T++ + + + P + EQ I + +ID I + + Sbjct: 123 SILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILNESFAKIDESIKILEQDLL 182 Query: 194 LLKEKKQALVSYIVTKGLN--PDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 L E Q+ + + + G EW +G + + + + E+ Sbjct: 183 NLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYV 242 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L NI + N ++ + +K E ++ +I+F + Sbjct: 243 HLRTHNISTDGNLNFDTLIKIKREFIK----EKQSFIEKNDILFNNTNSTELVGKTALVT 298 Query: 310 VMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLP 366 ++ + +S + + K F + + + + +K++ Sbjct: 299 QNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGINIDKLKKIQ 358 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 359 IPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 410 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 70/204 (34%), Gaps = 12/204 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIY----IGLEDVES-GTGKYLPKDGNSRQS 73 +P+ W+ + + + G + +I + ++ + G + R+ Sbjct: 208 KLPQGWEWKSLGEISNLIQNGFAASKNNEIPSGYVHLRTHNISTDGNLNFDTLIKIKREF 267 Query: 74 DTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPEL--LQGW 127 S K IL+ + +++ S ++ K+ + + Sbjct: 268 IKEKQSFIEKNDILFNNTNSTELVGKTALVTQNYNYAFSNHLTKIKLKNQYNSKLVVFYF 327 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +L + + + S + + I +P+PPL EQ I + + + L Sbjct: 328 VLLLKNKYFEKICHQWIGQSGINIDKLKKIQIPLPPLKEQEQIAKHLDFVFEKTKALKEL 387 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 + ++ +E KQ+L++ L Sbjct: 388 YTKELKDYEELKQSLLNKAFKGEL 411 >gi|293374802|ref|ZP_06621106.1| type I restriction modification DNA specificity domain protein [Turicibacter sanguinis PC909] gi|292646560|gb|EFF64566.1| type I restriction modification DNA specificity domain protein [Turicibacter sanguinis PC909] Length = 397 Score = 153 bits (387), Expect = 4e-35, Method: Composition-based stats. Identities = 67/414 (16%), Positives = 163/414 (39%), Gaps = 36/414 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W+++ ++ +L+ GR + + + I ++++ +D N + + Sbjct: 2 SNWEIIKVQDIGQLHNGRAFKPNEWSNQGLPIIRIQNLNG------SQDFNYFDGNFESK 55 Query: 79 SIFAKGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +L+ G G I + + + K+ + ++ ++L +Q Sbjct: 56 HEVNYEDLLFAWSGSRGTSFGPYIWKGDRSLLNQHIFKVDLKEGIDKVFIYYMLKRLTSQ 115 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 A + H K + + +PPL EQ I E + + I + + I Sbjct: 116 IEYNAHGSAGLVHITKKELEKFELHLPPLKEQQKIAEILSSVDAA----IEKTEQVIAKT 171 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL-----NRKNTK 250 +E K+ L+ ++TKG+ + K + I G +P WEVK + + + NRK ++ Sbjct: 172 EEVKKGLMQQLLTKGIG-HTEFKQTEI---GEIPVSWEVKKISQVASTMSGGTPNRKKSE 227 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +I + G +I K + E + + +++ G ++ K ++ Sbjct: 228 YYNGDIPWVKTGELIHKYLNNSEEKITELGLNNSSAKLMPVGTVLIAMYGATVGKSTILG 287 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + +++ YL + ++ Y K+ + ++ + +K L + Sbjct: 288 ISASTNQACCG--IIPNKDYLNNEYLYYRLQ-YWKDKLISMATGAAQPNISQQLIKELLI 344 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P + EQ I +++N++ + + + ++ LKE + + +TGQ+ + Sbjct: 345 PLPNLSEQEKIVDILNIQDEK----IANEKANLDSLKEIKQGLMQRLLTGQVRV 394 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 74/210 (35%), Gaps = 9/210 (4%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGK 62 ++K + IG IP W+V I + +G T K DI ++ ++ Sbjct: 191 EFKQTE---IGEIPVSWEVKKISQVASTMSGGTPNRKKSEYYNGDIPWVKTGELIHKYLN 247 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + S+ + G +L G + K+ I + + P Sbjct: 248 NSEEKITELGLNNSSAKLMPVGTVLIAMYGATVGKSTILGISASTNQACCGIIPNKDYLN 307 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + ++ ++ GA + + I + +P+P L+EQ I + + + +I Sbjct: 308 NEYLYYRLQYWKDKLISMATGAAQPNISQQLIKELLIPLPNLSEQEKIVDILNIQDEKIA 367 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLN 212 E+ + Q L++ V ++ Sbjct: 368 NEKANLDSLKEIKQGLMQRLLTGQVRVQID 397 >gi|57168615|ref|ZP_00367747.1| HsdS [Campylobacter coli RM2228] gi|57019896|gb|EAL56576.1| HsdS [Campylobacter coli RM2228] Length = 408 Score = 153 bits (387), Expect = 4e-35, Method: Composition-based stats. Identities = 62/412 (15%), Positives = 128/412 (31%), Gaps = 28/412 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + WK + + G I ++ ++++ G S + + Sbjct: 6 QGWKWKSLGEICFITDGTHKTPNYIETGIPFLSVKNISKGFFDLSDVKYISLEEHNKLIK 65 Query: 80 IFAKG--QILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDV 133 IL ++G + I +F S L + K + L+ I+ Sbjct: 66 RAKPEFEDILICRIGTLGKAIKISLEFEFSIFVSLGLLKPKVKIISDYLVYFLNSCFIEE 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G + + + P+ +PPL EQ I + +ID I + + Sbjct: 126 WINDNKVGGGTHTAKLNLNILEKCPIALPPLKEQERIVGILDENFAKIDENIKILEQDLL 185 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L E Q+ + + + +P WE K + + Sbjct: 186 NLDELMQSALQKAFNPLKD--------NAKENYKLPQGWEWKSLGEIGEIITGTTPSKNN 237 Query: 254 SNILSLSYGNIIQKLETRNMGLKPES-------YETYQIVDPGEIVFRFIDLQNDKRSLR 306 N Y ++ +K S ++ + + I+ I K L Sbjct: 238 PNFYGNEYPLFKPSDLNGDIIIKYASDNLSKLGFDNARNLPKDTILVVCIGASIGKVGLS 297 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRL 365 I + S YL ++ S + S + + +L Sbjct: 298 GVNGSCNQQINAII---PNSAFTSKYLFFVCLSNYFQTILKKNASQTTLPIINKTEFSKL 354 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + +PP+KEQ I + ++ ++ + L + + I L+E ++S + A G Sbjct: 355 QIPLPPLKEQEQIASHLDELSSHVKNLKQNYQAQIKNLQELKNSLLDKAFKG 406 Score = 84.8 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 66/199 (33%), Gaps = 8/199 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+ + ++ TG T + D+ +G N + Sbjct: 211 KLPQGWEWKSLGEIGEIITGTTPSKNNPNFYGNEYPLFKPSDL-NGDIIIKYASDNLSKL 269 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SID 132 K IL +G + K ++ +G C+ Q + P ++ S Sbjct: 270 GFDNARNLPKDTILVVCIGASIGKVGLSGVNGSCNQQINAIIPNSAFTSKYLFFVCLSNY 329 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ T+ + + +P+PPL EQ I + + + L I Sbjct: 330 FQTILKKNASQTTLPIINKTEFSKLQIPLPPLKEQEQIASHLDELSSHVKNLKQNYQAQI 389 Query: 193 ELLKEKKQALVSYIVTKGL 211 + L+E K +L+ L Sbjct: 390 KNLQELKNSLLDKAFKGNL 408 >gi|295135948|ref|YP_003586624.1| type I restriction-modification system specificity determinant [Zunongwangia profunda SM-A87] gi|294983963|gb|ADF54428.1| type I restriction-modification system specificity determinant [Zunongwangia profunda SM-A87] Length = 350 Score = 153 bits (386), Expect = 5e-35, Method: Composition-based stats. Identities = 83/348 (23%), Positives = 148/348 (42%), Gaps = 8/348 (2%) Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G + + +++ DG S +V++P D+ +L S + I Sbjct: 2 KAGDFVINSRSDRKGSSGVSESDGSVSLINIVMEPNDIFGSFCNYFLKSKAFVEENYRIG 61 Query: 142 EGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + + NI M PP EQ I + ++DT++ ++ + I LLKE+K Sbjct: 62 HGIVADLWTTRYDEMKNIIMAFPPKPEQQAIANFLDETCEKLDTVVAQKEKMIALLKERK 121 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT----KLIESN 255 QAL+ VT+GLN +V MKDSG++W+G +P +WEVK + K E N Sbjct: 122 QALIQNAVTRGLNKNVPMKDSGVDWIGEIPKNWEVKRLKFICVLNKESLPENLNKKQEIN 181 Query: 256 ILSLSYGNIIQKLETRNMGLK-PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + + + L ++ G+ + + + Sbjct: 182 YVDIGSVTFEDGILSTEYYLFQNAPSRARKVAKNGDTIVSTVRTYLKAIDFIDENKSKYV 241 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK 373 T + I + YL +R+ + G+ ++ D+ R+ V VP + Sbjct: 242 YSTGFAILSPNKNILNKYLYNQVRADAFTEQVSYNSKGMSYPAINSTDLGRIWVCVPSKQ 301 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I N I+ ++ ++D V + EQ+IV LKE ++S I + V G+I + Sbjct: 302 EQEKIVNYIDAQSRKLDQAVTQQEQAIVKLKEYKASLIDSCVLGKIKV 349 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 50/205 (24%), Positives = 91/205 (44%), Gaps = 7/205 (3%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPK 66 KDSGV WIG IPK+W+V +K LN E ++I Y+ + V G + Sbjct: 139 MKDSGVDWIGEIPKNWEVKRLKFICVLNKESLPENLNKKQEINYVDIGSVTFEDGILSTE 198 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVL-PE 122 + + + + G + + YL+ D + + ST F +L P + + Sbjct: 199 YYLFQNAPSRARKVAKNGDTIVSTVRTYLKAIDFIDENKSKYVYSTGFAILSPNKNILNK 258 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + + T+++ +G + + +G I + +P EQ I I A++ ++D Sbjct: 259 YLYNQVRADAFTEQVSYNSKGMSYPAINSTDLGRIWVCVPSKQEQEKIVNYIDAQSRKLD 318 Query: 183 TLITERIRFIELLKEKKQALVSYIV 207 +T++ + I LKE K +L+ V Sbjct: 319 QAVTQQEQAIVKLKEYKASLIDSCV 343 >gi|23452748|gb|AAN33144.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 378 Score = 153 bits (386), Expect = 6e-35, Method: Composition-based stats. Identities = 62/395 (15%), Positives = 126/395 (31%), Gaps = 25/395 (6%) Query: 31 KRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 ++++G T K I ++ ++D++ + S+ +F K Sbjct: 1 GDIAEISSGGTPSRNKKEYWENGIIPWVKIKDIKENFISTTEEFITEDGLKNSSAKLFKK 60 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G +LY + L + I D D + + K+ L + +I + G Sbjct: 61 GTLLYS-IFATLGEVAILDIDATTNQAIAGINIKENNINSLYLMYFLKSIKDKICSKGRG 119 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 ++ + + I +P+PPL EQ I + +ID I + + L E Q+ + Sbjct: 120 VAQNNLNLTILKQIQIPLPPLKEQERIVGILDESFAKIDESIKILEQNLLNLDELMQSAL 179 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + +P WE K + ++ K + Sbjct: 180 QKAFNPLKD--------NAKENYKLPQSWEWKSLEEISENISAGGDKPKNCTESKTAKNQ 231 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I N + I+ P I + + + I+ Y+ Sbjct: 232 IPVYANGVNNNGLVGYTDKATIIKPSL----TISARGTIGFVCIRKEPYFPIVRLIYLIP 287 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + + YL + + L K L + +PP+KEQ I ++ Sbjct: 288 CENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLD 342 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L E + + +E + S + A G+ Sbjct: 343 FVFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 377 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 65/193 (33%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 195 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 250 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 251 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLIYLIPCENILCLHYLYFCLNFFIAKGE 309 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I E + + L + ++ +E Sbjct: 310 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEEL 365 Query: 199 KQALVSYIVTKGL 211 KQ+L+ L Sbjct: 366 KQSLLDKAFKGEL 378 >gi|33240157|ref|NP_875099.1| restriction endonuclease S subunit [Prochlorococcus marinus subsp. marinus str. CCMP1375] gi|33237684|gb|AAP99751.1| Restriction endonuclease S subunit [Prochlorococcus marinus subsp. marinus str. CCMP1375] Length = 425 Score = 153 bits (385), Expect = 6e-35, Method: Composition-based stats. Identities = 87/403 (21%), Positives = 160/403 (39%), Gaps = 11/403 (2%) Query: 28 VPIKRFTKLN-TGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TV 78 I K+ +G T + +I +I D+ G + D + ++ Sbjct: 24 KKISHLCKIIGSGTTPDKNDARNFTKGNIPWILSGDLNDGIIEKPNSYVTQYALDNNPSL 83 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I+ + I+ G + + I F + VL P + EL + I + + Sbjct: 84 KIYPRNSIIIAMYGATIGRVSIPKFSFTVNQACCVLSPFNK-CELKYLFYCLIGLRHVLF 142 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ G + + + I ++ + +P EQ I + + E ++I+ I + I LL EK Sbjct: 143 SMAIGGAQPNINQELIKSLKILLPSNYEQKKIYKFLDQEIIKINLAIQNQYNLITLLDEK 202 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 KQALV +TKGL+ +V MK+S + +G +P+HW+ K L KN++ + S Sbjct: 203 KQALVLDAITKGLDKEVSMKNSKLFLLGKIPNHWQSKKLSQLFKTSKGKNSQKLTKEYCS 262 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G+ +Y D GE K + + Sbjct: 263 KNEGDYPVYSGQTQSDGI-MAYINTFEFDAGEKGVILTTTVGAKAMSVKLIKGRFNLSQN 321 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + T S + ++ S + ED +++ + +PPIKEQ I Sbjct: 322 CMVISAKDNSCHTAYFEYCFSSIFKIEKNKIPIHMQPSFRKEDFQKIRIPIPPIKEQIQI 381 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +N ++ E +I + E + I L ++R + I+ A + QIDL Sbjct: 382 SNFLHKEVEKIKQMNESSKLLISKLIDKRFALISFATSNQIDL 424 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 72/210 (34%), Gaps = 12/210 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 K+S + +G IP HW+ + + K + G+ S+ + E G Y G Sbjct: 220 SMKNSKLFLLGKIPNHWQSKKLSQLFKTSKGKNSQK------LTKEYCSKNEGDYPVYSG 273 Query: 69 NSRQSDTSTVSIF------AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 ++ KG IL +G + S +V+ KD Sbjct: 274 QTQSDGIMAYINTFEFDAGEKGVILTTTVGAKAMSVKLIKGRFNLSQNCMVISAKDNSCH 333 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + I +PIPP+ EQ+ I + E +I Sbjct: 334 TAYFEYCFSSIFKIEKNKIPIHMQPSFRKEDFQKIRIPIPPIKEQIQISNFLHKEVEKIK 393 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLN 212 + I L +K+ AL+S+ + ++ Sbjct: 394 QMNESSKLLISKLIDKRFALISFATSNQID 423 >gi|295402727|ref|ZP_06812668.1| restriction modification system DNA specificity domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294975226|gb|EFG50863.1| restriction modification system DNA specificity domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 472 Score = 152 bits (384), Expect = 1e-34, Method: Composition-based stats. Identities = 96/433 (22%), Positives = 176/433 (40%), Gaps = 30/433 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W + + ++ ++Y+G+EDVE+ TG + S S Sbjct: 22 EIPPNWIWTRLDNVCYEDRQTVKPDSEEAKRLLYLGMEDVEANTGII---NKISEDVGKS 78 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQ 135 F ILYGKL PYL K + DF+G C+T+F+ L+P+ + +L + V Sbjct: 79 NTYKFDSTHILYGKLRPYLNKVALPDFEGRCTTEFIPLKPEGGISREYLALFLRTQKVID 138 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + A +G+ M AD K + +I P+PP++EQ I +K+ + ID + E + ELL Sbjct: 139 TVMAKSKGSRMPRADMKVLMSIEFPLPPVSEQRRIIKKVKSYFKIIDKIEKELAKAKELL 198 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVG-------------LVPDHWEVKPFFALVT 242 K++ ++L+ L + S + + +P++W Sbjct: 199 KKRHESLLQKAFRGELVKREENDKSTFDLLNIKVSSSTDENDPYDIPENWVWLELGDCGV 258 Query: 243 ELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFR 294 K NIL +S ++ + E + +++ ++F Sbjct: 259 ITGGGTPSKKVPSFWNGNILWVSPKDMKRDKINDTEDKITELAIEKSSAKLIPKNSVLFV 318 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGL 353 + +E + I+ +YL + + ++ + A Sbjct: 319 VRSGILRHSLPVAINDVELTVNQDIKAITPHEFINVSYLFYAFKCFEKSWLQEASKIGAT 378 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +SL E VK+L V VPP+ EQ I + E + +V+ IE++ L++ R S + Sbjct: 379 VESLDMEKVKKLKVPVPPLSEQLKIIERLEKEFEKEQAIVQSIERAEEKLQKMRQSLLQK 438 Query: 414 AVTGQ-IDLRGES 425 A G+ ++ R E Sbjct: 439 AFRGELVEQRPEE 451 >gi|168232879|ref|ZP_02657937.1| type I restriction enzyme EcoKI specificity protein [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194472515|ref|ZP_03078499.1| type I restriction enzyme EcoKI specificity protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194458879|gb|EDX47718.1| type I restriction enzyme EcoKI specificity protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205332898|gb|EDZ19662.1| type I restriction enzyme EcoKI specificity protein [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 486 Score = 152 bits (383), Expect = 1e-34, Method: Composition-based stats. Identities = 90/443 (20%), Positives = 170/443 (38%), Gaps = 46/443 (10%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G +P+ W + G+ ++ D + LED+E + K L S + Sbjct: 4 GKLPEGWVDTQLGNIVDY--GKATKRVLSDVNDDTWVLELEDIEKESSKLLSTIRASERP 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSID 132 ST + F +G +LYGKL PYL K IIA DG+C+T+ + L + + + WL S Sbjct: 62 FKSTKNSFKRGDVLYGKLRPYLNKIIIAKEDGVCTTEIIPLCAEPSCCNKYIFYWLKSST 121 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G M P+ + PLAEQ +I EK+ +ID+ + Sbjct: 122 FQGYVNDVSYGVNMPRLGTADGLKAPLRLAPLAEQKIIAEKLDTLLAQIDSTKARLEQIP 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKM----------------------------KDSGIEW 224 ++LK +QA+++ V+ L + +M S ++ Sbjct: 182 QILKRFRQAVLAAAVSGNLTAEWRMNNNSNIVEEEIEKVKNKLIAKKIIKKDLIYSKLDR 241 Query: 225 VGLVPDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLE-----TRNMGLK 276 +P W ++ +T+ K K + L +S NI + Sbjct: 242 KYPIPSDWLYVKLQSIATKITDGEHKTPKREPAGQLLISARNIQDGYLKLSDVDYVGDAE 301 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + D G+++ + L + + A + + + + Y+ +L Sbjct: 302 FQKLRNRCDPDSGDVLISCSGSIG-RVCLVDENSKYVMVRSVALIKLMQDFVINKYMMYL 360 Query: 337 MRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++S L K S + +L +K L + +PP+ EQ +I + A D + ++ Sbjct: 361 LQSPLLQKEIEENSKSTAQANLFLGPIKNLGIPLPPVPEQAEIVRRVEQLFAYADTIEKQ 420 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 + ++ + S +A A G+ Sbjct: 421 VNSALTRVNSLTQSILAKAFRGE 443 >gi|307720726|ref|YP_003891866.1| restriction modification system DNA specificity domain-containing protein [Sulfurimonas autotrophica DSM 16294] gi|306978819|gb|ADN08854.1| restriction modification system DNA specificity domain protein [Sulfurimonas autotrophica DSM 16294] Length = 409 Score = 152 bits (383), Expect = 1e-34, Method: Composition-based stats. Identities = 52/418 (12%), Positives = 140/418 (33%), Gaps = 30/418 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNS 70 + +P W+ ++ T+L T T+ + + I ++ +E++ SG + Sbjct: 4 LYELPDGWEWKKLEEITELITKGTTPTTNGYKFLNEGINFLKIENIVSGEIDLSTIEMFI 63 Query: 71 RQSDTSTVSI--FAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQG 126 + + +L+ G AI+ + +++PK+ L Sbjct: 64 SKEAHQAQRRSQLKENDVLFSIAGTIGDTAIVKKEHLPMNINQAIALIRPKESLNSKFLK 123 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + L V+Q + G + + + N P+PPL EQ I K+ + +ID ++ Sbjct: 124 YSLLSIVSQNTKDKQRGGAIKNISLGDMKNTNYPLPPLQEQKRIVGKLDSLFEKIDRVVA 183 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + ++ ++++ + K N + +G Sbjct: 184 LHQKNMDEADAFMGSVLNDVFGKFSNKKIVALKGITSKIG-------------SGATPRG 230 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDK 302 I + NI + + ++ ++ +++ + Sbjct: 231 GQKSYKTEGISFIRSMNIYDTGFREKGLAFIDDEQAQKLNNVTIEENDVLINITGASVAR 290 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFE 360 + + + + + I ++L + + S + F + G R+++ Sbjct: 291 CCIVDKKYLPARVNQHVSILRLKDRIIPSFLHYYLISPFIKSELLFNSSGGATREAITKT 350 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ V + + Q ++ + + + E ++ + LK ++S + A G+ Sbjct: 351 MLEEFQVPLISLSLQQKTVTYLDKISLYLKRIKEVQKEKMENLKALKASILDEAFRGK 408 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 73/206 (35%), Gaps = 13/206 (6%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTK------LIESNILSLSYGNIIQKLETRNMGLKP 277 + +PD WE K + + + T + I L NI+ + Sbjct: 3 ELYELPDGWEWKKLEEITELITKGTTPTTNGYKFLNEGINFLKIENIVSGEIDLSTIEMF 62 Query: 278 ESYETYQI-----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 S E +Q + +++F D ++ + + I + + ++S + Sbjct: 63 ISKEAHQAQRRSQLKENDVLFSIAGTIGD-TAIVKKEHLPMNINQAIALIRPKESLNSKF 121 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L + + S G +++ D+K +PP++EQ I ++ +ID + Sbjct: 122 LKYSLLSIVSQNTKDKQRGGAIKNISLGDMKNTNYPLPPLQEQKRIVGKLDSLFEKIDRV 181 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQ 418 V ++++ S + G+ Sbjct: 182 VALHQKNMDEADAFMGSVLNDVF-GK 206 >gi|148926926|ref|ZP_01810603.1| putative type I restriction enzyme specificity S protein [Campylobacter jejuni subsp. jejuni CG8486] gi|145845010|gb|EDK22107.1| putative type I restriction enzyme specificity S protein [Campylobacter jejuni subsp. jejuni CG8486] Length = 375 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 84/385 (21%), Positives = 158/385 (41%), Gaps = 28/385 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLED 55 MK++ K+SG++W+G IP+HW+VV I + G E+ +I I + D Sbjct: 1 MKNF------KESGIEWLGEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFEIPVIRIGD 54 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLV 113 ++ Y +++ + + + IL G K D + + + Sbjct: 55 MQKEKILY-DNCLKTKEKEKLKQFLISNNDILIALSGATTGKIAFCDTDNKAYINQRVAI 113 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ K L ++ + L+ + IE C G+ + K IG +P+PPL EQ I Sbjct: 114 VRSKLKL---VKYYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANF 170 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + + +I I ++ + I LLKE+KQAL++ +TKGL+ ++ KDSGIEW+G +P HW Sbjct: 171 LDEKCKKIANFIEKKEKLITLLKEQKQALINETITKGLDKNINFKDSGIEWLGEIPQHWR 230 Query: 234 VKPFFALVTELNR------KNTKLIESNILSLSYGNIIQKLETR--NMGLKPESYETYQI 285 + + + + L NI + +K + QI Sbjct: 231 IVKLKYVAFTNIGLVYTPDDIIENPDEGYPVLRANNIQNGKIDYQDLIYIKSKQIGKKQI 290 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G+++ + + L ++ G + + Y W+ ++ L K Sbjct: 291 ISSGDLLMCVRNGSENL--LGKTAKIQDGYFSFGAFTAIIKSQFNDYFYWIFQTNMLRKS 348 Query: 346 FYA-MGSGLRQSLKFEDVKRLPVLV 369 + S + +D+K + Sbjct: 349 IASFSASNGIGQISQDDIKNFIISF 373 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 86/209 (41%), Gaps = 10/209 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN----ILSLSYGNIIQKLE 269 K+SGIEW+G +P+HWEV +VT +N + + N I + G++ ++ Sbjct: 1 MKNFKESGIEWLGEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFEIPVIRIGDMQKEKI 60 Query: 270 TRNM--GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + ++ +I+ K + + I V+ Sbjct: 61 LYDNCLKTKEKEKLKQFLISNNDILIALSGATTGKIAFC--DTDNKAYINQRVAIVRSKL 118 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + A + ++ +++ + +PP+KEQ I N ++ + Sbjct: 119 KLVK--YYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANFLDEKCK 176 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +I +EK E+ I LLKE++ + I +T Sbjct: 177 KIANFIEKKEKLITLLKEQKQALINETIT 205 >gi|262196003|ref|YP_003267212.1| restriction modification system DNA specificity domain protein [Haliangium ochraceum DSM 14365] gi|262079350|gb|ACY15319.1| restriction modification system DNA specificity domain protein [Haliangium ochraceum DSM 14365] Length = 423 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 90/425 (21%), Positives = 181/425 (42%), Gaps = 28/425 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI---IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W+ V + L +G+ + +ES TG+ L + Q+ S Sbjct: 6 QVPTRWRRVRLLDHVDLPSGQVDPRDPQYRSQPLVAPNHIESQTGRLLALESAESQNAIS 65 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQ 135 F+ G ++Y K+ PYLRKAI+A FDG+CS L+ K + P L LL + + Sbjct: 66 GKYTFSAGDVVYSKIRPYLRKAILASFDGLCSADMYPLRAKTSVEPGFLLALLLGEEFSS 125 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 E++ + + K +G+ +PPL EQ I + +D I IE + Sbjct: 126 FAESVSMRTGIPKLNRKELGSYHARLPPLGEQRKIAAIL----GAVDEAIARTQAVIEQV 181 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + K+ L+ ++T+GL P + E +G +P+ W ++ ++ + ++ Sbjct: 182 QVVKKGLMQDLLTRGL-PGRHTRFKQTE-IGQIPESWSAVRLGDVLDGIDAGWSPKCANH 239 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQ---------IVDPGEIVFRFIDLQNDKRSLR 306 +++ + KPE + V PG+++ D + Sbjct: 240 PAGNGEWGVLKVSSVSSGIYKPEENKMLPDDLIPKPELEVRPGDVIIARASGVLDLVGVC 299 Query: 307 --SAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 + R +++ + V+P+ DS YLA ++S + + +G +++ + Sbjct: 300 SFVYKTRPRLMLSDKTLRVRPNRTLLDSFYLALTLQSPVVRSLVLEKATGSHMRNISQKA 359 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + V +P + EQ +++ I ARID +S+ L E +S+ ++ +TG++ + Sbjct: 360 IGSVTVALPSLDEQVKVSSGIMAMDARIDN----DTRSVESLTELKSALMSVLLTGEVRV 415 Query: 422 RGESQ 426 + + Sbjct: 416 TPDEE 420 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 76/216 (35%), Gaps = 22/216 (10%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSE------SGKDIIYIGLEDVESGTGK 62 +K + IG IP+ W V + ++ G + + + + + V SG Sbjct: 204 FKQTE---IGQIPESWSAVRLGDVLDGIDAGWSPKCANHPAGNGEWGVLKVSSVSSGI-- 258 Query: 63 YLPKDGNSRQSDTSTV--SIFAKGQILYGKLGPYLRKAIIADFDG------ICSTQFLVL 114 Y P++ D G ++ + L + F + S + L + Sbjct: 259 YKPEENKMLPDDLIPKPELEVRPGDVIIARASGVLDLVGVCSFVYKTRPRLMLSDKTLRV 318 Query: 115 QPKDVLPELLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +P L + L S V + G+ M + K IG++ + +P L EQV + Sbjct: 319 RPNRTLLDSFYLALTLQSPVVRSLVLEKATGSHMRNISQKAIGSVTVALPSLDEQVKVSS 378 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I+A RID EL L++ V Sbjct: 379 GIMAMDARIDNDTRSVESLTELKSALMSVLLTGEVR 414 >gi|2129238|pir||B64316 restriction modification system S chain homolog - Methanococcus jannaschii Length = 425 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 74/440 (16%), Positives = 146/440 (33%), Gaps = 46/440 (10%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESG 59 +K + IG IP+ W++V +K K + G T + I ++ +ED+ + Sbjct: 6 ENFKKTE---IGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNS 62 Query: 60 TGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + + S I K +L+ G A I + + L + PK Sbjct: 63 NKYLTNTKIKITEEGLNNSNAWIVPKNSVLFAMYGSIGETA-INKIEVATNQAILGIIPK 121 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 D + E + + + T + + + + + +P+PPL EQ I + + Sbjct: 122 DNILESEFLYYILAKNKNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKIL--- 178 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 +ID I + I L+ K+ L+ ++TKG+ K +G +P+ WEV Sbjct: 179 -TKIDEGIEIIEKSINKLERIKKGLMHKLLTKGIGHSRFKKS----EIGEIPEDWEVFEI 233 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQ-------------KLETRNMGLKPESYETYQ 284 + +S N I R + Sbjct: 234 KDIFEVKTGTTPSTKKSEYWENGEINWITPLDLSRLNEKIYIGSSERKVTKIALEKCNLN 293 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ G I+ L + P DS + K Sbjct: 294 LIPKGSIIISTRAPVGYVAVLTV-----ESTFNQGCKGLVPKNNDSVNTEFYAYYLKFKK 348 Query: 345 VF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G + L ++ + +PP++EQ I +++ D +E +Q Sbjct: 349 NLLENLSGGSTFKELSKSMLENFKIPLPPLEEQKQIAKILSSV----DKSIELKKQKKEK 404 Query: 403 LKERRSSFIAAAVTGQIDLR 422 L+ + + +TG++ ++ Sbjct: 405 LQRMKKKIMELLLTGKVRVK 424 >gi|303244598|ref|ZP_07330931.1| restriction modification system DNA specificity domain protein [Methanothermococcus okinawensis IH1] gi|302485024|gb|EFL47955.1| restriction modification system DNA specificity domain protein [Methanothermococcus okinawensis IH1] Length = 421 Score = 151 bits (382), Expect = 2e-34, Method: Composition-based stats. Identities = 74/423 (17%), Positives = 149/423 (35%), Gaps = 27/423 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYL---PKDGNSR 71 +P W+V + + G T + K DI +I +D+ + KY+ ++ + Sbjct: 6 LPDGWEVRKLGEVANVIGGGTPSTKKSEYWNGDIPWITPKDLSNYIFKYICKGERNISRE 65 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + G IL P IA + + F + + Sbjct: 66 GLKNSSAKLLPPGTILLSSRAPI-GYVAIAKNELTTNQGFRSFITNEDKLNYEFLYYWLK 124 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +E++ G+T I N+ + +PPL EQ I E + + +I+ I Sbjct: 125 TKKKVLESLAGGSTFKEISGTTIKNLEILLPPLKEQQKIAEILSSLDDKIELNIKMNKTL 184 Query: 192 IELLKEKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTEL 244 E + + P+ K SG E +G +P W +KP ++ + Sbjct: 185 E----EMAKTIFKRWFIDFEFPNEEGKPYKSSGGEFINSELGEIPKGWSIKPIKNILNFI 240 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + N I+ + R++ E + + +I+ + + S Sbjct: 241 RGIEPG--SKYYTLIKKENHIRFIRIRDLNSNSEKVYIPKEMAKNKILNSEDIIISLDGS 298 Query: 305 LRSAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 L + G +S V P + Y+ L++S ++ SG + Sbjct: 299 LGVVKFGYNGAYSSGIRKVCPISEYDIPNMYIYCLLKSDNIQNTIKNYASGTTILHAGKS 358 Query: 362 VKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 V+ + + +P I E I + T I + ++ I L + R + + +TG+I Sbjct: 359 VEHMKITLPKKIDEMKRILKLFGDLTKPIFNQILNNQKEIQTLTKIRDTLLPKLITGKIR 418 Query: 421 LRG 423 ++ Sbjct: 419 VKP 421 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 71/217 (32%), Gaps = 23/217 (10%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVE 57 YK SG ++ +G IPK W + PIK G + I +I + D+ Sbjct: 209 YKSSGGEFINSELGEIPKGWSIKPIKNILNFIRGIEPGSKYYTLIKKENHIRFIRIRDLN 268 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 S + + + I I+ G + ++G S+ + P Sbjct: 269 SN------SEKVYIPKEMAKNKILNSEDIIISLDGSLG--VVKFGYNGAYSSGIRKVCPI 320 Query: 118 DVL---PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + L S ++ I+ G T+ HA + E I + Sbjct: 321 SEYDIPNMYIYCLLKSDNIQNTIKNYASGTTILHAGKSVEHMKITLPKKIDEMKRILKLF 380 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 T I I + I+ L + + L+ ++T + Sbjct: 381 GDLTKPIFNQILNNQKEIQTLTKIRDTLLPKLITGKI 417 >gi|115502461|sp|Q57594|T1S1_METJA RecName: Full=Type-1 restriction enzyme MjaXIP specificity protein; Short=S.MjaXIP; AltName: Full=Type I restriction enzyme MjaXIP specificity protein; Short=S protein gi|61680619|pdb|1YF2|A Chain A, Three-Dimensional Structure Of Dna Sequence Specificity (S) Subunit Of A Type I Restriction-Modification Enzyme And Its Functional Implications gi|61680620|pdb|1YF2|B Chain B, Three-Dimensional Structure Of Dna Sequence Specificity (S) Subunit Of A Type I Restriction-Modification Enzyme And Its Functional Implications Length = 425 Score = 151 bits (381), Expect = 2e-34, Method: Composition-based stats. Identities = 72/438 (16%), Positives = 151/438 (34%), Gaps = 42/438 (9%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESG 59 +K + IG IP+ W++V +K K + G T + I ++ +ED+ + Sbjct: 6 ENFKKTE---IGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNS 62 Query: 60 TGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + + S I K +L+ G A I + + L + PK Sbjct: 63 NKYLTNTKIKITEEGLNNSNAWIVPKNSVLFAMYGSIGETA-INKIEVATNQAILGIIPK 121 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 D + E + + + T + + + + + +P+PPL EQ I + + Sbjct: 122 DNILESEFLYYILAKNKNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKIL--- 178 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 +ID I + I L+ K+ L+ ++TKG+ K +G +P+ WEV Sbjct: 179 -TKIDEGIEIIEKSINKLERIKKGLMHKLLTKGIGHSRFKKS----EIGEIPEDWEVFEI 233 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQ-------------KLETRNMGLKPESYETYQ 284 + +S N I R + Sbjct: 234 KDIFEVKTGTTPSTKKSEYWENGEINWITPLDLSRLNEKIYIGSSERKVTKIALEKCNLN 293 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ G I+ L +G +++ + A+ ++ Sbjct: 294 LIPKGSIIISTRAPVGYVAVLTVESTFNQGC--KGLFQKNNDSVNTEFYAYYLKFKKNLL 351 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + G + L ++ + +PP++EQ I +++ D +E +Q L+ Sbjct: 352 ENLS-GGSTFKELSKSMLENFKIPLPPLEEQKQIAKILSSV----DKSIELKKQKKEKLQ 406 Query: 405 ERRSSFIAAAVTGQIDLR 422 + + +TG++ ++ Sbjct: 407 RMKKKIMELLLTGKVRVK 424 >gi|310830282|ref|YP_003965382.1| type I restriction enzyme, S subunit [Ketogulonicigenium vulgare Y25] gi|308753188|gb|ADO44331.1| type I restriction enzyme, S subunit [Ketogulonicigenium vulgare Y25] Length = 300 Score = 151 bits (380), Expect = 3e-34, Method: Composition-based stats. Identities = 96/301 (31%), Positives = 154/301 (51%), Gaps = 10/301 (3%) Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 R+ GA M A+W +G+I +P PPL EQ I + ET RID LI ++ R Sbjct: 4 PGFIDRVNGSTTGAKMPRAEWGFVGSIKVPTPPLEEQTAIAIFLDRETARIDGLIKKKGR 63 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP----FFALVTELNR 246 FIELLKEK+ AL+++ VTKG++ V MKDSG +W+G +P+HW+ P F + Sbjct: 64 FIELLKEKRAALITHAVTKGIDAGVPMKDSGQDWLGQIPEHWDTVPPTALFTESKERAHE 123 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + L + + + LE R + + + + + + G+ V + L Sbjct: 124 GDQMLSATQKYGVIPLEEFEALEQRQVTMAVTNLDKRKHTEIGDFVISMRSMDG---GLE 180 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKR 364 A+ + + + + P + +L++S + S +R Q + F ++ Sbjct: 181 RARAVGSVRSSYSVLRCGPEVE-GRFFGYLLKSSLYIQALRLTTSFIRDGQDMNFSHFRK 239 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + + P+ EQ I + I+ ETARID LV K ++SI LLKE+RS+ I AAVTG+ID+R Sbjct: 240 VKLPRVPVDEQIRIADHIDRETARIDGLVAKTDRSIELLKEKRSTLITAAVTGKIDVRNA 299 Query: 425 S 425 + Sbjct: 300 A 300 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 81/211 (38%), Gaps = 13/211 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKY 63 KDSG W+G IP+HW VP + R E + I LE+ E+ Sbjct: 90 MKDSGQDWLGQIPEHWDTVPPTALFTESKERAHEGDQMLSATQKYGVIPLEEFEA----L 145 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + ++ G + A G + + VL+ + Sbjct: 146 EQRQVTMAVTNLDKRKHTEIGDFVISMRSMDGG-LERARAVGSVRSSYSVLRCGPEVEGR 204 Query: 124 LQGWLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 G+LL + + + ++ + +P P+ EQ+ I + I ET RI Sbjct: 205 FFGYLLKSSLYIQALRLTTSFIRDGQDMNFSHFRKVKLPRVPVDEQIRIADHIDRETARI 264 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLN 212 D L+ + R IELLKEK+ L++ VT ++ Sbjct: 265 DGLVAKTDRSIELLKEKRSTLITAAVTGKID 295 >gi|77166355|ref|YP_344880.1| restriction modification system DNA specificity subunit [Nitrosococcus oceani ATCC 19707] gi|76884669|gb|ABA59350.1| Restriction modification system DNA specificity domain [Nitrosococcus oceani ATCC 19707] Length = 425 Score = 151 bits (380), Expect = 3e-34, Method: Composition-based stats. Identities = 77/429 (17%), Positives = 150/429 (34%), Gaps = 36/429 (8%) Query: 21 IPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD-- 74 +P+ W+V P+ + + + +T S + DV D + + Sbjct: 5 VPEGWEVKPLGKLVDVRSSNIDKKTETSEIPVRLCNYTDVYYNNRITSAIDFMAASAKQR 64 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG------ICSTQFLVLQPKDVLPELLQ--G 126 KG ++ K + + +C +L+P + Sbjct: 65 EIDRFSLEKGDVIITKDSETPDDIAVPSYVSDDLSGVVCGYHLTLLKPDQDESDGEFLSH 124 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 V + G T I P+ PPL EQ I + +D +I Sbjct: 125 LFQLPSVQHYFYILANGITRFGLTADAINEAPLLTPPLPEQQKIAAIL----SSVDDVIE 180 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA-----LV 241 + I LK+ K A++ ++TKG+ + KDS + G +P W + +V Sbjct: 181 KTRAQIHKLKDLKTAMMQELLTKGIG-HTEFKDSPV---GRIPVGWSICSAGEVAVAIMV 236 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR---FIDL 298 + + +ES + +L N+ + T + LK S ++ +I+ ++ + Sbjct: 237 GVVVKPAQYYVESGVPALRSANVRENGLTMD-NLKYFSEDSNEILKKSRLIKGDLLTVRT 295 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSL 357 + E + IDS + + S G +Q Sbjct: 296 GYPGTTAVVTDEFEGCNCIDVVITRPSSRIDSDFFCLWVNSDHGKGQVLKAQGGLAQQHF 355 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 D+K L V+VP + EQ I N +N T + + E+ + LL + + + + +TG Sbjct: 356 NVSDMKNLTVVVPSLTEQKAIFNAVNSVTKK----IALTEKRLTLLLDTKKALMQDLLTG 411 Query: 418 QIDLRGESQ 426 ++ + E + Sbjct: 412 KVRVNVEQE 420 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 12/209 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGK 62 ++KDS V G IP W + + + +V Sbjct: 209 EFKDSPV---GRIPVGWSICSAGEVAVAIMVGVVVKPAQYYVESGVPALRSANVRENGLT 265 Query: 63 YLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVL 120 K + ++ S KG +L + G A++ D C+ ++ +P + Sbjct: 266 MDNLKYFSEDSNEILKKSRLIKGDLLTVRTGYPGTTAVVTDEFEGCNCIDVVITRPSSRI 325 Query: 121 PELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 ++ D ++ G H + + N+ + +P L EQ I + + T Sbjct: 326 DSDFFCLWVNSDHGKGQVLKAQGGLAQQHFNVSDMKNLTVVVPSLTEQKAIFNAVNSVTK 385 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVT 208 +I ++ K Q L++ V Sbjct: 386 KIALTEKRLTLLLDTKKALMQDLLTGKVR 414 >gi|23452768|gb|AAN33154.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452773|gb|AAN33157.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452787|gb|AAN33165.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 403 Score = 150 bits (379), Expect = 4e-34, Method: Composition-based stats. Identities = 72/415 (17%), Positives = 143/415 (34%), Gaps = 33/415 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V ++ ++ G++ + K+ I + G + K + Sbjct: 4 LPQGWEVKRLEEVCEVVMGQSPNGNCIFDKDKNKDLI---EFHQGKIAFSDKYIDESNFV 60 Query: 75 TST-VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 TS I K ++ P I I + K + + + Sbjct: 61 TSDVKKIAKKNSVVLCVRAPVGEVNITTKDIAIGRGLCSLNGVKINNN---FLFFYLLTL 117 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+T + + I +P+PPL EQ I + +ID I + + Sbjct: 118 KKYFNDNSTGSTFKAINVRVIKETKIPLPPLKEQERIVGILDFAFSKIDENIKKAKENLA 177 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN--RKNTKL 251 + E Q+ + + + +P W+ K + + K Sbjct: 178 NIDELMQSALQKAFNPLND--------NTKENYQLPQSWKWKGLGEICFITDGTHKTPNY 229 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLR 306 IE+ I LS NI + + E +++ G+I+ I +++ Sbjct: 230 IETGIPFLSVKNISKGFFDLSDVKYISLEEHNKLIKRAKPEFGDILICRIGTLGK--AIK 287 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLR-QSLKFEDVK 363 + E I S + I S YL + + SY + +G G L ++ Sbjct: 288 ISLEFEFSIFVSLGLLKPKVKIISDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNLNILE 347 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + PV +PP+KEQ I + ++ + L E + + +E + S + A G+ Sbjct: 348 KCPVALPPLKEQEQIASHLDSVFEKTKALKELYTKELKDYEELKQSLLDKAFKGE 402 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 70/201 (34%), Gaps = 9/201 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ WK + + G I ++ ++++ G S + Sbjct: 203 QLPQSWKWKGLGEICFITDGTHKTPNYIETGIPFLSVKNISKGFFDLSDVKYISLEEHNK 262 Query: 77 TVSIFAK--GQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G IL ++G + I+ +F+ +L+PK + + L+ Sbjct: 263 LIKRAKPEFGDILICRIGTLGKAIKISLEFEFSIFVSLGLLKPKVKIISDYLVYFLNSYF 322 Query: 134 TQRIEAICE---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G + + + P+ +PPL EQ I + + + L + Sbjct: 323 IEGWINNNKVGGGTHTAKLNLNILEKCPVALPPLKEQEQIASHLDSVFEKTKALKELYTK 382 Query: 191 FIELLKEKKQALVSYIVTKGL 211 ++ +E KQ+L+ L Sbjct: 383 ELKDYEELKQSLLDKAFKGEL 403 >gi|220931290|ref|YP_002508198.1| restriction modification system DNA specificity domain protein [Halothermothrix orenii H 168] gi|219992600|gb|ACL69203.1| restriction modification system DNA specificity domain protein [Halothermothrix orenii H 168] Length = 422 Score = 150 bits (379), Expect = 4e-34, Method: Composition-based stats. Identities = 75/418 (17%), Positives = 166/418 (39%), Gaps = 27/418 (6%) Query: 21 IPKHWKVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IPK W+ +K + G T ++ K +I+++ +ED+ + GKY+ ++ Sbjct: 18 IPKEWEFRNFGLISKYIKAGGTPKADKKEYYGGEILFVKIEDM-TKNGKYIYNTKSTITE 76 Query: 74 D---TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 D S+ I K +L G Y K I + + L + P + + + Sbjct: 77 DGLKNSSAWIVPKKSLLLSMYGSY-GKVSINKVELATNQAILGIIPSEEVNLDYLYYFSL 135 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +++ + T ++ + + N P+ PPL EQ I + +D I + Sbjct: 136 GCLKPYFKSLVKATTQANLTKQIVNNTPVLSPPLPEQKKIAAIL----STVDKAIEKTDE 191 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 IE KE K+ L+ ++TKG+ + +P W + F + + N K Sbjct: 192 IIEKSKELKKGLMQQLLTKGIGHSEFKEVRIGTKKIKIPVVWTLIKFGEVFKKRNEKANV 251 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E + L + ++ + + ++ G+I++ + K ++ Sbjct: 252 EKEYKYVGLEHLG-TGEINLLGYDRNGNNKSSKRLFKSGDILYGKLRPYLKKAAITDFD- 309 Query: 311 MERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 GI ++ + + YL +L+ S + G + +K L + Sbjct: 310 ---GICSTDIIPIYATKKSVNNYLIYLVHSKMFVDFAVSTMEGTNLPRTSWRVIKNLIIP 366 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +PP++EQ I ++++ + ++K ++ L+E + + +TG++ ++ E + Sbjct: 367 LPPLQEQKKIASILSSVDEK----IQKEQEYREKLEELKKGLMQKLLTGEVRVKVEDE 420 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 59/190 (31%), Positives = 86/190 (45%), Gaps = 4/190 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W ++ K + + K+ Y+GLE + +G L D N S+ Sbjct: 228 KIPVVWTLIKFGEVFKKRNEKANV-EKEYKYVGLEHLGTGEINLLGYDRNGNN--KSSKR 284 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQRIE 138 +F G ILYGKL PYL+KA I DFDGICST + + K + L + S Sbjct: 285 LFKSGDILYGKLRPYLKKAAITDFDGICSTDIIPIYATKKSVNNYLIYLVHSKMFVDFAV 344 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + EG + W+ I N+ +P+PPL EQ I + + +I R + EL K Sbjct: 345 STMEGTNLPRTSWRVIKNLIIPLPPLQEQKKIASILSSVDEKIQKEQEYREKLEELKKGL 404 Query: 199 KQALVSYIVT 208 Q L++ V Sbjct: 405 MQKLLTGEVR 414 >gi|332678457|gb|AEE87586.1| Type I restriction-modification system, specificity subunit S [Francisella cf. novicida Fx1] Length = 409 Score = 150 bits (378), Expect = 4e-34, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 116/404 (28%), Gaps = 16/404 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + +P W+ + + G + KD IGL + D N + Sbjct: 18 LYKLPAGWEWKKLGELAEYVNGMAFKP-KDWSNIGLPIIRIQNLN-GSDDFNYFSGEAKE 75 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G IL L + I + + + + + Sbjct: 76 KYYVKSGDILISWS-ASLDVYKWQGGNAILNQHIFNTIINYDVVDYDFFYHTIKYSLSEV 134 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G M H NI +P+PPLAEQ I K+ + +ID I + I Sbjct: 135 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFEKIDKAIELHQQNITNANT 194 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + K E+ ++ + Sbjct: 195 LMASTLDKTF----------KKLEREYSLEKVENIASTIQSGFPVNKKNEEPNGYVHLRT 244 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 N +T E ++ G+I+F + + + Sbjct: 245 HNISINGELNFDTVIKVKPSMIKEKLSYIEKGDILFNNTNSTELVGKTAIVREDYNYAFS 304 Query: 318 SAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKE 374 + I + + + K+F + + + + +K + ++VPP+ Sbjct: 305 NHLTKIKVADSILPNFFVYAFLNLFNKKLFEKICNKWIGQSGVNTTMLKNIEIIVPPLPI 364 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q ++ ++D + + EQ + LK ++S + A G+ Sbjct: 365 QQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRGE 408 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 63/196 (32%), Gaps = 6/196 (3%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + + K + + + +P WE K L +N K + + + L I + Sbjct: 5 YKNEQNKKNKMSELYKLPAGWEWKKLGELAEYVNGMAFKPKDWSNIGLPIIRIQNLNGSD 64 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + V G+I+ + + I+ + Sbjct: 65 DFNYFSGEAKEKYYVKSGDILISWSASLD-----VYKWQGGNAILNQHIFNTIINYDVVD 119 Query: 332 Y-LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 Y + Y L +V + + + + + + +PP+ EQ I ++ +ID Sbjct: 120 YDFFYHTIKYSLSEVMNNLHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFEKID 179 Query: 391 VLVEKIEQSIVLLKER 406 +E +Q+I Sbjct: 180 KAIELHQQNITNANTL 195 >gi|296116346|ref|ZP_06834962.1| type I restriction-modification system specificity subunit [Gluconacetobacter hansenii ATCC 23769] gi|295977165|gb|EFG83927.1| type I restriction-modification system specificity subunit [Gluconacetobacter hansenii ATCC 23769] Length = 322 Score = 149 bits (377), Expect = 5e-34, Method: Composition-based stats. Identities = 88/318 (27%), Positives = 147/318 (46%), Gaps = 15/318 (4%) Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ L S I+ G+T++H N P+P EQ I + E +ID Sbjct: 1 MKWVLESNIFKIFIDLHSHGSTINHLYQNVFENFSFPLPAFPEQQAIASFLDRECGKIDA 60 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 LI E+ R I LL EK+QA++S+ VTKGLNP+ MKDSGI W+G+VP+ WEV LV Sbjct: 61 LIAEQERLIALLAEKRQAVISHAVTKGLNPNAPMKDSGIPWIGMVPEGWEVSRLKYLVQC 120 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-------------SYETYQIVDPGE 290 + +L ++ K+ + + + + ++ G+ Sbjct: 121 YDGIQMGPFGGMLLDINSEPTGYKVYGQENTISGDFGLGHRWISTDRYNDLRRYSLNGGD 180 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAM 349 +V + R + + + V + +LA L+ + + A Sbjct: 181 LVLTRKGSLGNARLVSKLPYPGIADSDTIRIRVDKSKVYPEFLATLLHEANYIESQINAS 240 Query: 350 GSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G L +++ L V+ PPI EQ +I N + + + D +++ +I LLKERR+ Sbjct: 241 KRGAILSGLNTKNISDLIVIYPPIYEQNNILNYLKISSEEFDCSIQQSAIAITLLKERRA 300 Query: 409 SFIAAAVTGQIDLRGESQ 426 + I+AAVTG+ID+R +S+ Sbjct: 301 ALISAAVTGKIDVRAQSK 318 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 87/219 (39%), Gaps = 16/219 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTG-----------RTSESGKDIIYIGLEDVES 58 KDSG+ WIG +P+ W+V +K + G + G E+ S Sbjct: 94 MKDSGIPWIGMVPEGWEVSRLKYLVQCYDGIQMGPFGGMLLDINSEPTGYKVYGQENTIS 153 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFL---V 113 G + ++ + + G ++ + G +++ + GI + + V Sbjct: 154 GDFGLGHRWISTDRYNDLRRYSLNGGDLVLTRKGSLGNARLVSKLPYPGIADSDTIRIRV 213 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + K L + + +I A GA +S + K I ++ + PP+ EQ I Sbjct: 214 DKSKVYPEFLATLLHEANYIESQINASKRGAILSGLNTKNISDLIVIYPPIYEQNNILNY 273 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + + D I + I LLKE++ AL+S VT ++ Sbjct: 274 LKISSEEFDCSIQQSAIAITLLKERRAALISAAVTGKID 312 >gi|163784191|ref|ZP_02179124.1| type I restriction-modification enzyme, S subunit [Hydrogenivirga sp. 128-5-R1-1] gi|159880541|gb|EDP74112.1| type I restriction-modification enzyme, S subunit [Hydrogenivirga sp. 128-5-R1-1] Length = 475 Score = 149 bits (377), Expect = 5e-34, Method: Composition-based stats. Identities = 81/434 (18%), Positives = 154/434 (35%), Gaps = 41/434 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGK--YLPKDGNSRQS 73 IP+ W+VV + ++ G+T + K I ++D E+ Y + + + Sbjct: 2 IPEDWEVVRLGDIAEIQQGKTPKRDLYDDRKGYRIIKVKDFENEKFVKHYPNGERSFVKV 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ------PKDVLPELLQGW 127 D +G IL G + ++ V + Sbjct: 62 DLGNRYTLEQGDILILSAGHSSKVVGQKIGFYNVNSNNKVFFVSELLRIRANNKTNPLFL 121 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 SI + + I E H + + N+ +P+PPL EQ I + +I I + Sbjct: 122 FFSIISQKSRKQIKEEIKGGHLYPRDLVNLKIPLPPLPEQKAIATVLD----KIRQAIEQ 177 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNP--DVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 I+ KE K++L+ + T G+ P + +GL+P+HWE+K V + Sbjct: 178 TEEVIQANKELKKSLMKHFFTYGVVPPEETDKVKLKETEIGLIPEHWEIKTLKDSVDSIE 237 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---------YQIVDPGEIVFRFI 296 + I +N I T+ L I+ G+++F + Sbjct: 238 YGYSVSIPANEDQKGIPIISTADITKEGKLLYNKIRKIKPPKRLTEKLILKDGDVLFNWR 297 Query: 297 DLQNDKRSLRSAQV-----MERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAM 349 + + + I S + ++ +S +L+ Y F + Sbjct: 298 NSPELIGKTTVFEAEKVSKDDFYIYASFILRIRSKESESNNFYLKYLLNYYREIGTFIKL 357 Query: 350 GSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + ++ L + +PPI EQ I ++N +ID +E E L++ Sbjct: 358 ARRAVNQANYNRNEIYNLKIPLPPIDEQKQIAKILN----KIDNKIEAEENKKEALEKLF 413 Query: 408 SSFIAAAVTGQIDL 421 S + +TG+I L Sbjct: 414 KSLLNNLMTGKIRL 427 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 69/206 (33%), Gaps = 19/206 (9%) Query: 18 IGAIPKHWKVVPIKR-FTKLNTGRT-----SESGKDIIYIGLEDV-ESGTGKYLPKDGNS 70 IG IP+HW++ +K + G + +E K I I D+ + G Y Sbjct: 217 IGLIPEHWEIKTLKDSVDSIEYGYSVSIPANEDQKGIPIISTADITKEGKLLYNKIRKIK 276 Query: 71 RQSDTSTVSIFAKGQILYGKLGPY----------LRKAIIADFDGICSTQFLVLQPKDVL 120 + I G +L+ K DF S + + Sbjct: 277 PPKRLTEKLILKDGDVLFNWRNSPELIGKTTVFEAEKVSKDDFYIYASFILRIRSKESES 336 Query: 121 PELLQGW--LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + ++ I+ ++ + I N+ +P+PP+ EQ I + + Sbjct: 337 NNFYLKYLLNYYREIGTFIKLARRAVNQANYNRNEIYNLKIPLPPIDEQKQIAKILNKID 396 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 +I+ ++ +L K L++ Sbjct: 397 NKIEAEENKKEALEKLFKSLLNNLMT 422 >gi|117922225|ref|YP_871417.1| restriction modification system DNA specificity subunit [Shewanella sp. ANA-3] gi|117614557|gb|ABK50011.1| restriction modification system DNA specificity domain [Shewanella sp. ANA-3] Length = 425 Score = 149 bits (377), Expect = 6e-34, Method: Composition-based stats. Identities = 70/423 (16%), Positives = 146/423 (34%), Gaps = 27/423 (6%) Query: 21 IPKHWKVVPIKRFTKLN---TGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P +W V+P+ K GRT + G +I + +V+ G + + + Sbjct: 5 VPDNWNVLPLGSVIKQVIDFRGRTPKKLGMEWGGGNIRALSANNVQMGRVDFNKECYLAS 64 Query: 72 QSDTST---VSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFL--VLQPKDVLPELLQ 125 G IL+ P A++ D I S + + + L Sbjct: 65 DELYDKWMTKGTTEVGDILFTMEAPLGNIALVPNDDRYILSQRVILLKNDKSKASSDFLF 124 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L S + G T K + + + +PPL EQ I + + + I+ Sbjct: 125 QQLRSDSFQDTLRENATGTTAQGIQQKRLVTLDVVLPPLPEQQKIAKILTSVDEVIEKTQ 184 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + +L Q L++ V P + KDS + + + +K +T+ Sbjct: 185 AQIDKLKDLKTGMMQELLTQGVGIDGKPHTEFKDSPVGRIPKAWNCVTLKNLSKRITDGT 244 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG------EIVFRFIDLQ 299 + K + Y + ++ + E Y++ G +I++ + Sbjct: 245 HQTVKTSPDGTIPFLYVSCVRDGNIDWEKASFLTEEMYELASKGRKPENGDILYTAVGSY 304 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLK 358 ++ S A++ IDS +L + S K G + ++ Sbjct: 305 GH-AAIVSGDNRFSFQRHIAFIQPNHEKIDSEFLVSFLNSPLGKKQADLYAIGNAQLTVT 363 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D+ + V +P I EQ I + ID + +++ + L + + + +TG+ Sbjct: 364 LGDLGKFKVALPDIAEQQRIAKI----FNGIDNRIIVVQRKLTSLGNTKKALMQDLLTGK 419 Query: 419 IDL 421 + + Sbjct: 420 VRV 422 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 78/218 (35%), Gaps = 13/218 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESG 59 K + ++KDS V G IPK W V +K +K T T I ++ + V G Sbjct: 211 KPHTEFKDSPV---GRIPKAWNCVTLKNLSKRITDGTHQTVKTSPDGTIPFLYVSCVRDG 267 Query: 60 TGKYLPKDGNSRQSDT--STVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQP 116 + + + S G ILY +G Y AI++ D +QP Sbjct: 268 NIDWEKASFLTEEMYELASKGRKPENGDILYTAVGSYGHAAIVSGDNRFSFQRHIAFIQP 327 Query: 117 KD--VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + E L +L S ++ + G +G + +P +AEQ I + Sbjct: 328 NHEKIDSEFLVSFLNSPLGKKQADLYAIGNAQLTVTLGDLGKFKVALPDIAEQQRIAKIF 387 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 RI + + K Q L++ V ++ Sbjct: 388 NGIDNRIIVVQRKLTSLGNTKKALMQDLLTGKVRVAID 425 >gi|300118614|ref|ZP_07056352.1| Type I restriction-modification enzyme, S subunit [Bacillus cereus SJ1] gi|298724003|gb|EFI64707.1| Type I restriction-modification enzyme, S subunit [Bacillus cereus SJ1] Length = 415 Score = 149 bits (375), Expect = 1e-33, Method: Composition-based stats. Identities = 62/419 (14%), Positives = 164/419 (39%), Gaps = 28/419 (6%) Query: 26 KVVPIKRFT-KLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + +K K+ G T KD I ++ ++D+ + + + + S Sbjct: 3 EWQKLKDVVVKIVGGGTPSRKKDEYYHGDIPWVTVKDLIATSISDAQEKITPQAIQESAA 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ K ++ L KA I + D + L P + ++IE Sbjct: 63 NLIPKSNVIIATRMA-LGKAFINEVDVAINQDLKALIPNKEKVIPKYLLYTYLSNKEKIE 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G T+ + I + + +P L EQ I + +D +I + IE ++ Sbjct: 122 ILGSGTTVKGIRLEQINGLEIFVPSLEEQKKITFIL----SSVDQIIEKTKAIIEQTEKV 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK---LIESN 255 K+ L+ ++T+G+ K K++ I + + + L+T+ T ++ Sbjct: 178 KKGLMQQLLTEGIG-HTKFKETDIGNIPEEWEVLTFEEISDLITKGTTPTTYGFSYEDTG 236 Query: 256 ILSLSYGNI-----IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + NI + K + + L I+ +I+F + + ++ + Sbjct: 237 VNFIRTENIDEQGKVVKDYMKKISLAAHQKLKRSILKEKDILFSIAGVGLGQCTIVKEDL 296 Query: 311 MERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVL 368 + + + + + + + L+ S+ + K A+ + G + ++ + + + Sbjct: 297 LPANTNQALAIIRISNPLFDHHFVYTLLLSHYITKQIKAVSTIGAQPNISLKQIGDFKIP 356 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR-GESQ 426 P ++EQ I ++++ + +++ + + L+ ++ + + +TG+I ++ E++ Sbjct: 357 KPTLREQKRIVDILSSVGEK----IQREKVKLDTLQTIKTGLMQSLLTGEIRVKADEAE 411 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 83/210 (39%), Gaps = 17/210 (8%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVES--G 59 ++K++ IG IP+ W+V+ + + L T T+ + + +I E+++ Sbjct: 194 KFKETD---IGNIPEEWEVLTFEEISDLITKGTTPTTYGFSYEDTGVNFIRTENIDEQGK 250 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQP 116 K K + SI + IL+ G L + I + +++ Sbjct: 251 VVKDYMKKISLAAHQKLKRSILKEKDILFSIAGVGLGQCTIVKEDLLPANTNQALAIIRI 310 Query: 117 KDVLPELLQGWL--LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + L + + LS +T++I+A+ + K IG+ +P P L EQ I + + Sbjct: 311 SNPLFDHHFVYTLLLSHYITKQIKAVSTIGAQPNISLKQIGDFKIPKPTLREQKRIVDIL 370 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVS 204 + +I + + Q+L++ Sbjct: 371 SSVGEKIQREKVKLDTLQTIKTGLMQSLLT 400 >gi|23452743|gb|AAN33142.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 414 Score = 148 bits (374), Expect = 1e-33, Method: Composition-based stats. Identities = 57/419 (13%), Positives = 122/419 (29%), Gaps = 30/419 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T GKD + D E G N + Sbjct: 4 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IL +G + A+ ++ K+++ E + + +S Sbjct: 63 FDKARQLPPKTILVVCIGSLGKVALTRVIGSCNQQINAIIPHKNIIAEYIYYYCISSKFQ 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + T++ + + + P + EQ I + +ID I + + Sbjct: 123 SILFSKAPQTTLAILNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIKILEQNLL 182 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L E Q+ + + + +P W+ K + L Sbjct: 183 NLDELMQSALQKAFNPLKD--------NAKENYKLPQGWKWKSLGEICEILGGGTPDTKN 234 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV-------------FRFIDLQN 300 S + +Q ++ ++ + Y +I + + Sbjct: 235 PIFWYSSQADEVQFEKSYYWATLVDTKQKYLYGTKRKITQKGLDCSNAILLPINSVIFSS 294 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKF 359 + Y Y K + G + + Sbjct: 295 RASIGEISIAKVETATNQGYKNFICDESILYYEFLYFALKHFTKEIELLAQGTTYKEVSK 354 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +K + +PP+KEQ IT ++ + L E + + +E + S + A G+ Sbjct: 355 AKIKDFKIPLPPLKEQEQITKHLDFIFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 413 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 69/208 (33%), Gaps = 17/208 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLE----------------DVESGTGKY 63 +P+ WK + ++ G T ++ I + + D + Sbjct: 208 KLPQGWKWKSLGEICEILGGGTPDTKNPIFWYSSQADEVQFEKSYYWATLVDTKQKYLYG 267 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + + D S + +++ + + IA + + + + + Sbjct: 268 TKRKITQKGLDCSNAILLPINSVIFSS-RASIGEISIAKVETATNQGYKNFICDESILYY 326 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + T+ IE + +G T I + +P+PPL EQ I + + + Sbjct: 327 EFLYFALKHFTKEIELLAQGTTYKEVSKAKIKDFKIPLPPLKEQEQITKHLDFIFEKAKA 386 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 L + ++ +E KQ+L+ L Sbjct: 387 LKELYTKELKDYEELKQSLLDKAFKGEL 414 >gi|323699617|ref|ZP_08111529.1| restriction modification system DNA specificity domain [Desulfovibrio sp. ND132] gi|323459549|gb|EGB15414.1| restriction modification system DNA specificity domain [Desulfovibrio desulfuricans ND132] Length = 405 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 68/422 (16%), Positives = 131/422 (31%), Gaps = 39/422 (9%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYL-PKDGNSRQ 72 IP+ W+ K+ G + + I+++ E+V G PK Sbjct: 2 IPEGWQKAKGVEIADKITKGASPKWQGFEYQENGILFVTSENVRDGFLDISRPKFLPDEF 61 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQG-WL 128 + S A G IL +G + ++ I + +G+ + +L+ K +L Sbjct: 62 GEKMKNSRLADGDILINIVGASIGRSCIYENNGVPANINQAVCLLRLKKGYNVRFFSLYL 121 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + I + + I N P EQ I + I+ Sbjct: 122 QLPSTVRMLLGIQSDSARPNLSLADIRNCLFVFPKEQEQKAIATILSTWDRAIEKAEALI 181 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 Q L++ V G E+ G + F V + Sbjct: 182 KAKERRKTGLMQRLLTGKVRFG------------EFAGEAWKEVPLGTLFEPVADTVGDK 229 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + E + Y Y + GE + + + + Sbjct: 230 DI---PPYSISAGIGFVSQREKWGKDIAGRQYANYTHLRKGEFAYNKGNSKKYQCGCAYL 286 Query: 309 QVMERGIITSAYMAVKPHGIDS----TYLAWLMRSYDLCKVFYAMGSGLRQ----SLKFE 360 + I D Y + + Y ++ + SG R +L + Sbjct: 287 LRDQDEISVPNVFISFRPKSDQVSADFYEHFFIADYHARELKRYITSGARSDGLLNLNKK 346 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 D ++ V PP +EQ I V+N A ID + + LKE++ + +TG++ Sbjct: 347 DFFKINVPCPPPREQEAIAKVLNAAVAEID----EHRNQLAALKEQKKGLMQQLLTGKVR 402 Query: 421 LR 422 ++ Sbjct: 403 VK 404 >gi|313673806|ref|YP_004051917.1| restriction modification system DNA specificity domain [Calditerrivibrio nitroreducens DSM 19672] gi|312940562|gb|ADR19754.1| restriction modification system DNA specificity domain [Calditerrivibrio nitroreducens DSM 19672] Length = 451 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 66/411 (16%), Positives = 150/411 (36%), Gaps = 15/411 (3%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P+HW + + L G + +S + I + + D++ G L + Sbjct: 6 PEHWVLTELGNILYLKNGYSFKSTDYCEEGIPLVRISDIQDGRIN-LDTTVKVPNRLLKS 64 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVT 134 I G +L G K I + V K L L + Sbjct: 65 DFIIENGDLLIAMSGATTGKFGIYIGNETILQNQRVGNLKLYSKSLVSTKYRDYLIASLR 124 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ G + + I + +P+PPL EQ I K+ A ++ + + + Sbjct: 125 DIIQKSAYGGAQPNISPEKIHKLIIPLPPLNEQKRIVAKLDAILPKVKSARDRLEKIPAI 184 Query: 195 LKEKKQALVSYIVTKGLNPDVK--MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LK+ +Q++++ + L D + + + + + + + Sbjct: 185 LKKFRQSVLAAACSGRLTEDWREEYAQHTGKELPEWEEKKIFELTEKVENLNVKNINLHD 244 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + +S N I+ + + QI+ G+++F + + ++ V Sbjct: 245 KFLYIDISSINNIKNTIETHKEYSYYEAPSRAKQIIKHGDVLFSNVRVYLKNIAIVDNPV 304 Query: 311 MERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPV 367 I ++ + + + + + + YL + + D K + G ++K +D+ + Sbjct: 305 YNDQICSTGFTVLRAQKNKLLNKYLFYSLIRDDFIKEVSELQVGSSYPAIKKDDLISRFI 364 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP++EQ +I + A D + EK +++ L++ + +A A G+ Sbjct: 365 PLPPLEEQHEIVRRVEKLFALADSIEEKYKKAHERLEKLEQAILAKAFRGE 415 >gi|261212598|ref|ZP_05926882.1| type I restriction-modification system specificity subunit S [Vibrio sp. RC341] gi|260837663|gb|EEX64340.1| type I restriction-modification system specificity subunit S [Vibrio sp. RC341] Length = 248 Score = 148 bits (373), Expect = 2e-33, Method: Composition-based stats. Identities = 64/235 (27%), Positives = 106/235 (45%), Gaps = 6/235 (2%) Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL--- 251 +KEK+QA++S+ VTKGLN + MKDSG+EW+G VP+HW++K + N Sbjct: 1 MKEKRQAVISHAVTKGLNSNAPMKDSGVEWLGEVPEHWDMKRLKYIGEARNGLTYSPDDV 60 Query: 252 --IESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 E IL L NI +L + T +++ + + Sbjct: 61 VTQEEGILVLRSSNIQDARLSFSDNVYVNMDIPTRIRTKENDLLICSRNGSRQLIGKNAL 120 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 E + V + YL W++ S + + L +++ + + Sbjct: 121 ITKEAADMAFGAFMVVFRSKINPYLYWVLNSPLFDYQSGSFLTSTINQLTIGNLENMEIP 180 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +PP EQ +I N + ++ D L K + LLKER+++ I+AAVTG+ID+R Sbjct: 181 LPPECEQEEIKNYLIKKSDYFDDLTSKALHKVNLLKERKTALISAAVTGKIDVRH 235 Score = 94.9 bits (234), Expect = 3e-17, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 87/213 (40%), Gaps = 13/213 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKY 63 KDSGV+W+G +P+HW + +K + G T + I+ + +++ + Sbjct: 23 MKDSGVEWLGEVPEHWDMKRLKYIGEARNGLTYSPDDVVTQEEGILVLRSSNIQ--DARL 80 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDV 119 D D T + +L + A+I + ++ + Sbjct: 81 SFSDNVYVNMDIPTRIRTKENDLLICSRNGSRQLIGKNALITKEAADMAFGAFMVVFRSK 140 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + L L S + + T++ + N+ +P+PP EQ I+ +I ++ Sbjct: 141 INPYLYWVLNSPLFDYQSGSFLTS-TINQLTIGNLENMEIPLPPECEQEEIKNYLIKKSD 199 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 D L ++ + + LLKE+K AL+S VT ++ Sbjct: 200 YFDDLTSKALHKVNLLKERKTALISAAVTGKID 232 >gi|237721641|ref|ZP_04552122.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|229449437|gb|EEO55228.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] Length = 407 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 56/406 (13%), Positives = 119/406 (29%), Gaps = 31/406 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP +W + +G T +I ++ D+ G +P+ Sbjct: 4 EIPDNWVWTTLGEVGTWQSGGTPSRSNKSYYGGNIPWLKTGDLNDGLISDIPESITEEAV 63 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 +S+ I G +L G + K I F + + + L + + Sbjct: 64 ASSSAKINPTGSVLIAMYGATIGKLGILTFPATTNQACCACIEFNAI-TQLYLFYFLLSQ 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G + + I N +P+PPL+EQ I +I ID + R Sbjct: 123 RSTFISKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIIMEIEKWFALIDQIEQGRADLQT 182 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV-------------------GLVPDHWEV 234 +K+ K ++ + L P + IE + +P W Sbjct: 183 TIKQTKNKILDLAIHGKLVPQDMNDEPAIEQLKRINPDFIPCDNRHSGKLPYKIPKTWVW 242 Query: 235 KPFFALVTELNRK---NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +++ + + I+ + + + G+I Sbjct: 243 CSHNSILDISGGSQPAKSYFETIPKPNCIRLYQIRDYGESPVPVYIPINLASKQTKKGDI 302 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + K + A+ + + + + I + + S + Sbjct: 303 LLARYGGSLGK--VFYAEQGAYNVAMAKVIFKFENLIYKEFAYYYYLSDLYQGKLKEISR 360 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + D + +PPI EQ I + + +D + + +E Sbjct: 361 TAQTGFNITDFNDMYFPLPPINEQQRIVQKMEKLFSSLDDIQKNLE 406 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 62/200 (31%), Gaps = 12/200 (6%) Query: 227 LVPDHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPES-- 279 +PD+W + T + N NI L G++ L + E Sbjct: 4 EIPDNWVWTTLGEVGTWQSGGTPSRSNKSYYGGNIPWLKTGDLNDGLISDIPESITEEAV 63 Query: 280 -YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +I G ++ K + + A A + + Sbjct: 64 ASSSAKINPTGSVLIAMYGATIGKLGILTF----PATTNQACCACIEFNAITQLYLFYFL 119 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 G G + ++ E + + +PP+ EQ I I A ID + + Sbjct: 120 LSQRSTFISKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIIMEIEKWFALIDQIEQGRAD 179 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 +K+ ++ + A+ G+ Sbjct: 180 LQTTIKQTKNKILDLAIHGK 199 >gi|304315081|ref|YP_003850228.1| type I restriction-modification enzyme, subunit S [Methanothermobacter marburgensis str. Marburg] gi|302588540|gb|ADL58915.1| predicted type I restriction-modification enzyme, subunit S [Methanothermobacter marburgensis str. Marburg] Length = 368 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 90/394 (22%), Positives = 162/394 (41%), Gaps = 32/394 (8%) Query: 31 KRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 +G+ ++GLE + SG K + ST F G ILYG Sbjct: 2 GEVVDQRRESIQPAGEGKNNFVGLEHIRSGETKLCEYVSDEGI--RSTKYRFYTGDILYG 59 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATMSH 148 KL PYL KA++AD +GICST +VL P D + PE L ++ + QR + G Sbjct: 60 KLRPYLDKAVLADINGICSTDLIVLTPSDRIIPEFLIYFIHTNQFIQRAVSTTSGTNHPR 119 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 WK I M +PPL EQ I E + I + + I + ++ K+ L+ ++ Sbjct: 120 TSWKAISKFRMALPPLEEQKRISEILQDVDGA----IEKVNKEIGVTEKLKRGLMQRLLM 175 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +G+N + KDS + G +P W+V L T N K ++ + + N L Sbjct: 176 EGIN-HTEFKDSPV---GRIPVDWDVVKLGDLFTFKNGKRPPVLNEGEIPIYGANGKMGL 231 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + K ++ ++ GE+ V + I T +Y + Sbjct: 232 TSNYLKTKDKALIFGRVGSSGEVHLSKG----------CVWVSDNAIYTESY---DSKRV 278 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + ++ +L++ + + + V +PP++EQ I+ ++ R Sbjct: 279 NVHFMFYLIK---FKDLKRFATKTTHPIITQTFINNFKVPLPPLEEQKRISEILQDVDRR 335 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +++L E+ + L+ + + +TG+ +R Sbjct: 336 LELL---TERKV-KLENIKRGLMNDLLTGKRRVR 365 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 51/166 (30%), Gaps = 18/166 (10%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 ++KDS V G IP W VV + G+ G+ Sbjct: 182 EFKDSPV---GRIPVDWDVVKLGDLFTFKNGKRPP-------------VLNEGEIPIYGA 225 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 N + TS +++G++G + + + ++ Sbjct: 226 NGKMGLTSNYLKTKDKALIFGRVGSSGEVHLSKGCVWVSDNAIYTE--SYDSKRVNVHFM 283 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + ++ T I N +P+PPL EQ I E + Sbjct: 284 FYLIKFKDLKRFATKTTHPIITQTFINNFKVPLPPLEEQKRISEIL 329 >gi|194436488|ref|ZP_03068589.1| type I restriction-modification enzyme S subunit [Escherichia coli 101-1] gi|194424520|gb|EDX40506.1| type I restriction-modification enzyme S subunit [Escherichia coli 101-1] Length = 414 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 77/425 (18%), Positives = 159/425 (37%), Gaps = 35/425 (8%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSESGK-------DIIYIGLEDVESGTGKYLP-KDGNS 70 +P+ W + K+ G + + ++ G ++ + Sbjct: 2 KLPEGWHNKLLGDLFTKIVVGYVGNVNDHYCDAAIGVPFYRTLNIRDGYFRHDDIRYVTP 61 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ-----FLVLQPKDVLPELLQ 125 +D + S IL ++G L A S K+ P+ Sbjct: 62 EFNDKNKKSQIENDDILIARVGANLGMVCKATGLNRTSNMANAIIIKSKSAKNADPDFYT 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +LLS +I A G + K I +P+PPLAEQ I + + D I Sbjct: 122 YFLLSTYGKSQIYAGAAGGAQGVFNTKLTQEIAVPVPPLAEQKKIAQIL----SAWDKAI 177 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + + +++K+AL+ +++ + ++G+ + G WEV L+ E Sbjct: 178 SVTEKLLTNSQQQKKALMQQLLS---GKKRLLDENGVMFSGE----WEVVRLKQLIHEEK 230 Query: 246 RKNTKLIESNILSL-SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++N +LS+ ++ + E + + E TY+IV + + L S Sbjct: 231 KRNRDNHIQRVLSVTNHSGFVLPEEQFSKRVASEDVSTYKIVKKNQYGYNPSRLN--VGS 288 Query: 305 LRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + G+++ Y+ + +S Y M S + + G +R S+ F+ Sbjct: 289 FARLDNYDEGVLSPMYVVFSINHERLNSDYFLNWMSSNEAKQRIAGSTQGSVRDSVGFDA 348 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +P + EQ I V++ A + +E+ + LKE + + + +TG+ + Sbjct: 349 LCSFSFSLPTLMEQQKIAAVLSAADAE----ITTLEKKLACLKEEKKALMQQLLTGKRRV 404 Query: 422 RGESQ 426 + E + Sbjct: 405 KVEVE 409 >gi|91217916|ref|ZP_01254869.1| type I restriction-modification enzyme 1, S subunit [Psychroflexus torquis ATCC 700755] gi|91183893|gb|EAS70283.1| type I restriction-modification enzyme 1, S subunit [Psychroflexus torquis ATCC 700755] Length = 441 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 69/456 (15%), Positives = 150/456 (32%), Gaps = 48/456 (10%) Query: 1 MKHYKAYP----------------QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES 44 M +K Y YK + +G IP+ W V + + + + G+ Sbjct: 1 MSKHKQYDVATSTLLSTGLEGKRVGYKKTK---LGWIPEDWNVKSLDQLGEFSKGKGITK 57 Query: 45 GK-------DIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLR 96 + + ++ + Q + + G IL+ G L Sbjct: 58 KDILEDEVGGLPCVRYAEIYTIYHYNTTVLKSKINQESAANSNPINCGDILFAGSGETLE 117 Query: 97 -----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 A + +L+ + P+ L + V ++ I +G ++ H Sbjct: 118 DIGKSIAYLNKETAYAGGDICILKHHNQDPQFLGYLFNNDVVRSQLYKIGQGHSVVHIYS 177 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 G+ + +PIPPL EQ I + I + QAL + ++ + L Sbjct: 178 SGLKKVSVPIPPLPEQQKIASILNTWDKAIAAQEKLIAQK--------QALKNGLMQQLL 229 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + EW + N+ + I ++ G + T Sbjct: 230 TGKKRFAGFVEEWEEKSLNDIVKYLGGEAFKSTNQVENGVRWLKIANVGIGVVKWGDSTT 289 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQND---KRSLRSAQVMERGIITSAYMAVKPHGI 328 + ++ G+ V + K ++ + + + + + Sbjct: 290 FLPTSFIDENPKYVLKAGDAVMALTRPILNDKLKIAVFNKEDGIALLNQRVAKLISKNKN 349 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D ++ ++ ++ AM +G ++ +D+ + V +P +EQ I +VI Sbjct: 350 DLKFIYYIHQTPYFIYTMNAMMAGTDPPNISIKDLAKKKVFIPGYEEQKKIVSVIESFDN 409 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ID L+ K + LK+++ + +TG+ ++G Sbjct: 410 EIDNLINKGK----HLKKQKQGLMQQLLTGEKRVKG 441 >gi|326201156|ref|ZP_08191028.1| restriction modification system DNA specificity domain [Clostridium papyrosolvens DSM 2782] gi|325988724|gb|EGD49548.1| restriction modification system DNA specificity domain [Clostridium papyrosolvens DSM 2782] Length = 397 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 73/423 (17%), Positives = 144/423 (34%), Gaps = 41/423 (9%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGT 60 YK + +G IP+ W+V I+ + TG T GK ++ ++ D+ Sbjct: 4 EGYKMTE---LGEIPQEWEVRKIEDLYSVLTGATPLRGKQEYYLNGNVAWVKTLDLNDRY 60 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKD 118 + ++ + +G +L G + + + I + L + Sbjct: 61 IYDTQEKITDLALKETSCKVQDEGTVLIAMYGGFNQIGRTGILKTKAATNQAICSLPLIE 120 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + L + + + + +PPL+EQ I + + Sbjct: 121 EIYPEYLNYFLIKNRNVWRNVAASTRKDPNITKGDVEKFNIIVPPLSEQYKIADIL---- 176 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 ID I + IE +E K+ L+ ++ KG+ +G +P WEVK Sbjct: 177 STIDEQIDKTDALIEKTRELKKGLMQKLLIKGIGHTEFRDT----EIGRIPKGWEVKKLE 232 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +V KN K +E + N + D ++ Sbjct: 233 EIVQICYGKNQKEVEIEGGIYKILGTGGVIGNTND----------YLWDKPSVLIGRKGT 282 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + + + + G + +L + + DL K A G SL Sbjct: 283 IDKPMYI----EEPFWTVDTLFYTKVDEGYVAKWLYYYLNKIDLKKYNEATG---VPSLS 335 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +LVPP KEQ I+ +++ + ID E L+ + + + +TG+ Sbjct: 336 VAVLNTILILVPPFKEQQKISKILSAVDSDID----VYESKKNKLENAKKALMNHLLTGK 391 Query: 419 IDL 421 I + Sbjct: 392 IRV 394 >gi|295394613|ref|ZP_06804832.1| type I restriction-mod [Brevibacterium mcbrellneri ATCC 49030] gi|294972506|gb|EFG48362.1| type I restriction-mod [Brevibacterium mcbrellneri ATCC 49030] Length = 388 Score = 147 bits (371), Expect = 3e-33, Method: Composition-based stats. Identities = 70/409 (17%), Positives = 142/409 (34%), Gaps = 36/409 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN-SRQSDTSTVSIFAK 83 W+ VP + RT ++++ + E G + +D N +R + + + Sbjct: 4 WQSVPFHTLFRRVPKRTGFPAEELLSV---YREYGVIRKSDRDDNFNRPGNLNDYQLVKT 60 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G ++ K+ + I+ GI S + V P E + L A Sbjct: 61 GDLVLNKMKAWQGSLGISPHTGIVSPAYFVYTPVSDNDESFLHYALRCRDAVDYYAAHST 120 Query: 144 ---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + +P+P+P LA Q I + + E ++ LI E R +L+ ++ Sbjct: 121 GIRVNQWDVSPEWLDAMPVPVPDLATQRRIVDYLDKEISEMNALIEEVQRLTKLVIARRD 180 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + + +P F V + +E L Sbjct: 181 A------------------TAGSLLADLP--VAPVSMFWRVIDCLHITAPFVEVGTNFLV 220 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVD-------PGEIVFRFIDLQNDKRSLRSAQVMER 313 + ET+ I+ PG+++ + K S+ Sbjct: 221 SIEQLGHRNLDLTRANRTDDETFSILRVGDRKPAPGDVIMSR-NASVGKCSIVRETDPPI 279 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPI 372 + + K DS L + S + + + + +K+LP V + Sbjct: 280 ALGQDVVIFKKNDKHDSRLLLHFLGSDVIKRTIEMSTVGSTLKRINVGTIKKLPYPVATL 339 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++Q +I + ++ E R+D L+E+ + I LK +++ I VTG+ ++ Sbjct: 340 EKQREIADELDREFMRMDSLIEESTRLIENLKAHKTALITEVVTGRKEV 388 >gi|312973901|ref|ZP_07788072.1| type I restriction modification DNA specificity domain protein [Escherichia coli 1827-70] gi|310331435|gb|EFP98691.1| type I restriction modification DNA specificity domain protein [Escherichia coli 1827-70] Length = 426 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 76/431 (17%), Positives = 163/431 (37%), Gaps = 35/431 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTG--KYLPKDGNSRQSD 74 +PK W + G S + I + + D + Sbjct: 2 VPKGWSESYLGEVVTYKKGYAFNSSLYAEEGIRIVRISDTTRDSIHSDNPVFIAGGNVEG 61 Query: 75 TSTVSIFAKGQILYGKLGPYLR----------KAIIADFDGICSTQFLVLQPKD--VLPE 122 S+F I+ +G K + + + + + L PK + E Sbjct: 62 LEQYSLFE-NDIILSTVGSRPHLLDSMVGKAVKVPRSAHNSLLNQNLVKLIPKKTKITNE 120 Query: 123 LLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L + + Q I + G + +P L EQ I + + I Sbjct: 121 YLFSMLKTKEFIQFISNLVRGNANQVSITLADLFKYKFILPSLPEQKKIAQILSTWDKAI 180 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + K Q L++ GL + I G +P W+ + Sbjct: 181 SVTEKLLTNSQQQKKALMQQLLTGKKRLGLPAGSY--EFKITRYGSIPKDWDYPAIKEIC 238 Query: 242 TELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 T+++ KN+ ++ +LS S + + L+ N + + Y+++ G F F Sbjct: 239 TQVSEKNSAAVDHPVLSCSKHDGFVDSLKYFNKKVYSDDLSGYRLIHRG--CFGFPSNHI 296 Query: 301 DKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQ 355 ++ S+ + + GI++ Y+ + P +D++YL ++++ ++F A + R Sbjct: 297 EEGSIGLQNLYDTGIVSPIYVVFRASPTKVDNSYLYAVLKTDHYKQIFGAATNASVDRRG 356 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 SL++++ ++ V +PP+KEQ I+ V++ A + +E+ + LK+ + + + + Sbjct: 357 SLRWKEFNQIHVPLPPLKEQQKISAVLSAADAE----ITTLEKKLACLKDEKKALMQQLL 412 Query: 416 TGQIDLR-GES 425 TG+ ++ E+ Sbjct: 413 TGKRRVKVDEA 423 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 61/192 (31%), Gaps = 7/192 (3%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G+IPK W IK + + + D + + + D S Sbjct: 223 GSIPKDWDYPAIKEICTQVSEKN-SAAVDHPVLSCSKHDGFVDSLKYFNKKVYSDDLSGY 281 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQ--PKDVLPELLQGWLLSIDVT 134 + +G + + + GI S ++V + P V L L + Sbjct: 282 RLIHRGCFGFPSNHIEEGSIGLQNLYDTGIVSPIYVVFRASPTKVDNSYLYAVLKTDHYK 341 Query: 135 QRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 Q A WK I +P+PPL EQ I + A I TL + Sbjct: 342 QIFGAATNASVDRRGSLRWKEFNQIHVPLPPLKEQQKISAVLSAADAEITTLEKKLACLK 401 Query: 193 ELLKEKKQALVS 204 + K Q L++ Sbjct: 402 DEKKALMQQLLT 413 >gi|213580643|ref|ZP_03362469.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] Length = 468 Score = 147 bits (370), Expect = 4e-33, Method: Composition-based stats. Identities = 66/417 (15%), Positives = 148/417 (35%), Gaps = 18/417 (4%) Query: 19 GAIPKHWKVVPIKRFTKL-NTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +P+ W + G T++S D+ ++ D+ G + + Sbjct: 10 GKLPEGWVTTHLSEICSKPQYGYTTKSSSMGDVKFLRTTDITKGAVDWSSVPYCMDAPED 69 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSID 132 + I+ + G ++ + S L+ +L S D Sbjct: 70 VSKYQLQDRDIVISRAGSVGFSFLVQNPPSQVVFASYLIRFKPVNYFSEYYLKRFLESSD 129 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + G + + + + + + +PIPP+AEQ +I EK+ ++D+ + Sbjct: 130 YWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLAQVDSTKARLEQIP 189 Query: 193 ELLKEKKQALVSYIVTKGL----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++LK +QA+++ V+ L + S +W +P W V + LV K Sbjct: 190 QILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQW-PDLPSTWSVHKYSELVDSRLGKM 248 Query: 249 TKLIESNILSLSYGNIIQ------KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ + Y I LE L + + G+++ Sbjct: 249 LDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGDVLICEGGEPGRC 308 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFED 361 Q + + + A I +L + +++ + + + L + Sbjct: 309 AIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDSNNISLSQLFTGTTIKHLTGKA 368 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P+ VPP++EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 369 LANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVNSLTQSILAKAFRGE 425 >gi|268316651|ref|YP_003290370.1| restriction modification system DNA specificity domain-containing protein [Rhodothermus marinus DSM 4252] gi|262334185|gb|ACY47982.1| restriction modification system DNA specificity domain protein [Rhodothermus marinus DSM 4252] Length = 444 Score = 147 bits (370), Expect = 4e-33, Method: Composition-based stats. Identities = 78/436 (17%), Positives = 161/436 (36%), Gaps = 30/436 (6%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLP 65 P Y+ + +G +P+ W+VV + + E+ D +Y L G L Sbjct: 10 PGYRMTE---LGPLPEEWRVVRLGEVLTPVYKKLRETLVEDDKVYRLLTVRLYAKGITLR 66 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDV--L 120 + + T + G ++ K+ G+ S F +L + Sbjct: 67 SEEKGNRIKTKKLYCTKSGDFVFSKIDARNGAWGFVTDELEGGLVSGDFPILTLERHKAD 126 Query: 121 PELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 ++ L V + + I G T + + + +PPLAEQ I + Sbjct: 127 QSFIELQLAQPTVWEPLRNIAVGTTNRRRLHTFQLLQVAVALPPLAEQRAIAHVL----R 182 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPF 237 + R I LKE K++L+ ++ T G P + + ++ +G +P HW V Sbjct: 183 TVQEAKEATERVIAALKELKRSLMRHLFTYGPVPLDQTEAVELQETEIGPLPTHWRVVRL 242 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--PESYETYQIVDPGEIVFRF 295 + +R +L + + + I + + K P+ + +V G+++ Sbjct: 243 EEVANIGHRGQKRLFQVQVPFIPMALIPEDGLYLDKWEKRAPQDVRSGVLVKNGDLLLAK 302 Query: 296 IDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAM- 349 I N K+ + G T+ + P +LA+ ++ ++ + + Sbjct: 303 ITPCFENGKQGIVRNLPDGWGYATTEVFPIYPKDHQRLLLEFLAYYLKVENVRQALASKM 362 Query: 350 -GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G+ RQ L + + +PP+ EQ +I ++ AR +E E+ L+ Sbjct: 363 EGTTGRQRLPKAVLIECKIPLPPLPEQQEIARMLQAVDAR----IEAEEKKKAALEALFK 418 Query: 409 SFIAAAVTGQIDLRGE 424 + + +T ++ + E Sbjct: 419 TLLHHLMTAKVRVPEE 434 >gi|167627752|ref|YP_001678252.1| type I restriction-modification system subunit S [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167597753|gb|ABZ87751.1| type I restriction-modification system, subunit S [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 407 Score = 146 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 135/414 (32%), Gaps = 24/414 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKD-G 68 + +P W+ + G ++ K I + ++ + + Sbjct: 4 LYKLPAGWEWKKLGEECLFENGDRGKNYPSKSAFVSKGIPVVSATNLTGWSIDRSKLNFI 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQG 126 + + K IL+ G + A++ D + I S+ ++ +++ L Sbjct: 64 TEERYNLIGGGKIKKNDILFCLRGSLGKCALVTDIERGVIASSLVIIRTCENLSNIFLMY 123 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L S + I GA + K + +P+PPLAEQ I K+ + +ID I Sbjct: 124 YLNSHLIQDFINKYNNGAAQPNLSAKNLSLFNIPLPPLAEQKRIVAKLDSLFEKIDKAIE 183 Query: 187 ERIRFIELLKEKKQALVSYIVTK--GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + I + + K G + + G P + + Sbjct: 184 LHQQNITNANTLMASTLDKTFKKLEGEYSLIPLHKITTAVGGGTPKRNIKEYWGNGEIVW 243 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 IL++ + + S + +++ G +++ Sbjct: 244 LSPTDLGAIGEILNI-------RESRDKITELGLSKSSARLLPVGTVLYSSRATIGKIAI 296 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 +G I + +LA+ + + ++ S + + +K+ Sbjct: 297 NEIEVCTNQGFTN---FICDKDKIYNYFLAYSL-AKYTEEITSLSNSTTFKEVSKTSIKK 352 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PP+ Q ++ ++D + + EQ + LK ++S + A G+ Sbjct: 353 FEIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRGE 406 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 65/195 (33%), Gaps = 14/195 (7%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS-----------YGNIIQKLETRN 272 + +P WE K N K S +S G I + + Sbjct: 3 ELYKLPAGWEWKKLGEECLFENGDRGKNYPSKSAFVSKGIPVVSATNLTGWSIDRSKLNF 62 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + + + +I+F + + I +S + + + + Sbjct: 63 ITEERYNLIGGGKIKKNDILFCLRGSLGKCALVTDIERG--VIASSLVIIRTCENLSNIF 120 Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L + + S+ + +G + +L +++ + +PP+ EQ I ++ +ID Sbjct: 121 LMYYLNSHLIQDFINKYNNGAAQPNLSAKNLSLFNIPLPPLAEQKRIVAKLDSLFEKIDK 180 Query: 392 LVEKIEQSIVLLKER 406 +E +Q+I Sbjct: 181 AIELHQQNITNANTL 195 >gi|19881261|gb|AAM00867.1|AF486554_3 HsdS [Campylobacter jejuni] Length = 397 Score = 146 bits (369), Expect = 5e-33, Method: Composition-based stats. Identities = 65/406 (16%), Positives = 126/406 (31%), Gaps = 27/406 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ WKV + +++ TG T GKD + D E G N + Sbjct: 10 LPQGWKVKTLSEISEIVTGSTPSKSNLDFYGKDYPFFKPSDFEQGYF-LENAGDNLSKLG 68 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 IL +G L K + G C+ Q + P K+++ E + + +S Sbjct: 69 FDKARQLPPKTILVVCIGS-LGKVALTRVIGSCNQQINAIIPHKNIIAEYIYYYCISSKF 127 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + T++ + + + P + EQ I + ID I + + Sbjct: 128 QSILFSKAPQTTLAILNKTEFSKLEIIYPKDIKEQERIVRILDESFANIDESIKILEQDL 187 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L E Q+ + + + +P WE K + ++ K Sbjct: 188 LNLDELMQSALQKAFNPLKD--------NAKENYKLPQGWEWKSLEEISENISAGGDKPK 239 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + I N + I+ P I + + + Sbjct: 240 NCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPS----LTISARGTIGFVCIRKEPY 295 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 I+ + + + YL + + L K L + +PP+ Sbjct: 296 FPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQIPLPPL 350 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 KEQ I ++ + L E + + +E + S + A G+ Sbjct: 351 KEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 396 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 214 KLPQGWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 269 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 270 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 328 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I + + + L + ++ +E Sbjct: 329 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEEL 384 Query: 199 KQALVSYIVTKGL 211 KQ+L++ L Sbjct: 385 KQSLLNKAFKGEL 397 >gi|19881224|gb|AAM00836.1|AF486548_3 HsdS [Campylobacter jejuni] Length = 395 Score = 146 bits (368), Expect = 6e-33, Method: Composition-based stats. Identities = 62/406 (15%), Positives = 125/406 (30%), Gaps = 27/406 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+V + ++ TG T KD + D E G N + Sbjct: 8 LPQGWEVKTLSEIGEIITGSTPSKSNVEFYRKDYPFFKPSDFEQGYF-LENAGDNLSKLG 66 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 IL +G L K + G C+ Q + P K+++ E + + +S Sbjct: 67 FGKARQLPPKTILVVCIGS-LGKVALTRVIGSCNQQINAIIPHKNIISEYIYYYCISSKF 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + T++ + + + P + EQ I + +ID I + + Sbjct: 126 QSILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDFAFSKIDENIKKAKENL 185 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + E Q+ + + + +P WE K + ++ K Sbjct: 186 ANIDELMQSALQKAFNPLKD--------NAKENYKLPQSWEWKSLEEISENISAGGDKPK 237 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + I N + I+ P I + + + Sbjct: 238 NCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPS----LTISARGTIGFVCIRKEPY 293 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 I+ + + + YL + + L K L + +PP+ Sbjct: 294 FPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQIPLPPL 348 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 KEQ I ++ + L E + + +E + S + A G+ Sbjct: 349 KEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLNKAFKGE 394 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 212 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 267 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 268 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 326 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I E + + L + ++ +E Sbjct: 327 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEEL 382 Query: 199 KQALVSYIVTKGL 211 KQ+L++ L Sbjct: 383 KQSLLNKAFKGEL 395 >gi|168821023|ref|ZP_02833023.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205342187|gb|EDZ28951.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320088959|emb|CBY98715.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 462 Score = 146 bits (368), Expect = 6e-33, Method: Composition-based stats. Identities = 66/417 (15%), Positives = 148/417 (35%), Gaps = 18/417 (4%) Query: 19 GAIPKHWKVVPIKRFTKL-NTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +P+ W + G T++S D+ ++ D+ G + + Sbjct: 4 GKLPEGWVTTHLSEICSKPQYGYTTKSSSMGDVKFLRTTDITKGAVDWSSVPYCMDAPED 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSID 132 + I+ + G ++ + S L+ +L S D Sbjct: 64 VSKYQLQDRDIVISRAGSVGFSFLVQNPPSQVVFASYLIRFKPVNYFSEYYLKRFLESSD 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + G + + + + + + +PIPP+AEQ +I EK+ ++D+ + Sbjct: 124 YWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLAQVDSTKARLEQIP 183 Query: 193 ELLKEKKQALVSYIVTKGL----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++LK +QA+++ V+ L + S +W +P W V + LV K Sbjct: 184 QILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQW-PDLPSTWSVHKYSELVDSRLGKM 242 Query: 249 TKLIESNILSLSYGNIIQ------KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ + Y I LE L + + G+++ Sbjct: 243 LDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGDVLICEGGEPGRC 302 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFED 361 Q + + + A I +L + +++ + + + L + Sbjct: 303 AIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDSNNISLSQLFTGTTIKHLTGKA 362 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P+ VPP++EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 363 LANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALTRVNSLTQSILAKAFRGE 419 >gi|257440746|ref|ZP_05616501.1| type I restriction-modification [Faecalibacterium prausnitzii A2-165] gi|257196807|gb|EEU95091.1| type I restriction-modification [Faecalibacterium prausnitzii A2-165] Length = 275 Score = 146 bits (368), Expect = 6e-33, Method: Composition-based stats. Identities = 63/273 (23%), Positives = 122/273 (44%), Gaps = 14/273 (5%) Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 +PP Q+ + + A+ IDT++++ IE K+ KQA+++ VTKG+ + +MKD Sbjct: 5 LPPKEIQIRSAQYLNAKCTEIDTMLSKTRSSIEEYKKLKQAVITQAVTKGVRGEREMKDC 64 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNM 273 G+EW GLVP HW V ++ + +++ ++ I + ++ + Sbjct: 65 GVEWAGLVPHHWGVAKIGSIGQTSSGATPLRSKESSFFDDATIRWVRTLDLNDGFVYDSS 124 Query: 274 GLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 E + I+ G + +M A ++ + Sbjct: 125 EKITELALASSACSIMPKGTVCVAMYGGAGTIGKCG--LLMSDCATNQAVCSIVCNRKIV 182 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + + LM+ L + G R ++ + V R+ +L+PP+ EQ +IT+ ++ + A Sbjct: 183 SPIFLLMQLLALKPYWMKYAVGTRKDPNISQDIVARMKILIPPLDEQKEITDYLDAKCAE 242 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ID L+ K EQ + L+ + S I VTG+ ++ Sbjct: 243 IDKLIAKKEQLVKELESYKKSLIYEVVTGKREV 275 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 81/210 (38%), Gaps = 11/210 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGT 60 + KD GV+W G +P HW V I + ++G T K+ I ++ D+ G Sbjct: 60 EMKDCGVEWAGLVPHHWGVAKIGSIGQTSSGATPLRSKESSFFDDATIRWVRTLDLNDGF 119 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKD 118 + +S SI KG + G + K + D + + Sbjct: 120 VYDSSEKITELALASSACSIMPKGTVCVAMYGGAGTIGKCGLLMSDCATNQAVCSIVCNR 179 Query: 119 VLPELLQGWLLSIDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + + + + G + + + + IPPL EQ I + + A+ Sbjct: 180 KIVSPIFLLMQLLALKPYWMKYAVGTRKDPNISQDIVARMKILIPPLDEQKEITDYLDAK 239 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIV 207 ID LI ++ + ++ L+ K++L+ +V Sbjct: 240 CAEIDKLIAKKEQLVKELESYKKSLIYEVV 269 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 4/63 (6%) Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT-GQIDLRG 423 + + +PP + Q +N + ID ++ K SI K+ + + I AVT G +RG Sbjct: 1 MLLALPPKEIQIRSAQYLNAKCTEIDTMLSKTRSSIEEYKKLKQAVITQAVTKG---VRG 57 Query: 424 ESQ 426 E + Sbjct: 58 ERE 60 >gi|21229244|ref|NP_635166.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20907818|gb|AAM32838.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 398 Score = 146 bits (368), Expect = 6e-33, Method: Composition-based stats. Identities = 59/415 (14%), Positives = 141/415 (33%), Gaps = 39/415 (9%) Query: 20 AIPKHWKVVPIKRFTKLN---TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W+ + ++N ++ ++ ++ ++ +E TG S + + Sbjct: 4 KLPEGWEWKKLGEIAEINPKFDKKSVSESTEVTFLPMKCIEELTGNVDTSITKSLEEVSK 63 Query: 77 TVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDV-LPELLQGWLL 129 + + ++Y K+ P + + + G ST+F V++ K + +L+ Sbjct: 64 GYTPLIENDLIYAKITPCMENGKAAIATGLKNNLGFASTEFHVIRFKKNAYNKFFFFYLI 123 Query: 130 SIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + G+ + N+ +P+PPL Q I + Sbjct: 124 QKRIREHAAMNMTGSAGQKRVPATFLKNLLVPLPPLETQQKIVSILEKAEET-------- 175 Query: 189 IRFIELLKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + E Q L+ + +P ++ + +G + + + +R Sbjct: 176 RKLRAQADELTQKLLQSVFLEMFGDPVKNSREWKLHKLGEIGN-------WTSGGTPSRS 228 Query: 248 NTKLIESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + I + G + + + + + ++ G ++ D K Sbjct: 229 MPEYFHGEIPWFTAGELNDSYVYGSKEKITKEALNSSSAKLFPAGTMLIGMYDTAAFKMG 288 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 + + A A P L L ++ F + G+ +++L +K Sbjct: 289 I----LKNPASSNQACAAFSPKVEVINTLFALYLFKEMKDSFLSQRRGIRQKNLSQSIIK 344 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + V VPPI+ Q + +ID + E +QS + + + A TG+ Sbjct: 345 KFEVPVPPIELQKQFAD----MVQKIDQIKESQKQSSLETNNLFDALMQKAFTGK 395 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 56/194 (28%), Gaps = 10/194 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK+ + +G T +I + ++ + ++S+ Sbjct: 207 EWKLHKLGEIGNWTSGGTPSRSMPEYFHGEIPWFTAGELNDSYVYGSKEKITKEALNSSS 266 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +F G +L G K I + PK + L L ++ Sbjct: 267 AKLFPAGTMLIGMYDTAAFKMGILKNPASSNQACAAFSPKVEVINTLFALYLFKEMKDSF 326 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G + I +P+PP+ Q + +ID + + + Sbjct: 327 LSQRRGIRQKNLSQSIIKKFEVPVPPIELQKQFAD----MVQKIDQIKESQKQSSLETNN 382 Query: 198 KKQALVSYIVTKGL 211 AL+ T L Sbjct: 383 LFDALMQKAFTGKL 396 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 70/200 (35%), Gaps = 18/200 (9%) Query: 226 GLVPDHWEVKPFFALVTELNR--KNTKLIESNILSLSYGNI---IQKLETRNMGLKPESY 280 +P+ WE K + + K + + + L I ++T E Sbjct: 3 NKLPEGWEWKKLGEIAEINPKFDKKSVSESTEVTFLPMKCIEELTGNVDTSITKSLEEVS 62 Query: 281 ETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWL 336 + Y + ++++ I ++N K ++ + G ++ + K + + + +L Sbjct: 63 KGYTPLIENDLIYAKITPCMENGKAAIATGLKNNLGFASTEFHVIRFKKNAYNKFFFFYL 122 Query: 337 M-RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + GS ++ + +K L V +PP++ Q I +++ E+ Sbjct: 123 IQKRIREHAAMNMTGSAGQKRVPATFLKNLLVPLPPLETQQKIVSILEKA--------EE 174 Query: 396 IEQSIVLLKERRSSFIAAAV 415 + E + + Sbjct: 175 TRKLRAQADELTQKLLQSVF 194 >gi|16763331|ref|NP_458948.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29144809|ref|NP_808151.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|56416308|ref|YP_153383.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197365231|ref|YP_002144868.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|213052555|ref|ZP_03345433.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] gi|213864869|ref|ZP_03386988.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. M223] gi|289825441|ref|ZP_06544672.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] gi|25289196|pir||AB1069 chain S of type I restriction-modification system [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16505640|emb|CAD03369.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Typhi] gi|29140448|gb|AAO72011.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|56130565|gb|AAV80071.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197096708|emb|CAR62331.1| subunit S of type I restriction-modification system [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] Length = 462 Score = 146 bits (368), Expect = 6e-33, Method: Composition-based stats. Identities = 66/417 (15%), Positives = 148/417 (35%), Gaps = 18/417 (4%) Query: 19 GAIPKHWKVVPIKRFTKL-NTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +P+ W + G T++S D+ ++ D+ G + + Sbjct: 4 GKLPEGWVTTHLSEICSKPQYGYTTKSSSMGDVKFLRTTDITKGAVDWSSVPYCMDAPED 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSID 132 + I+ + G ++ + S L+ +L S D Sbjct: 64 VSKYQLQDRDIVISRAGSVGFSFLVQNPPSQVVFASYLIRFKPVNYFSEYYLKRFLESSD 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + G + + + + + + +PIPP+AEQ +I EK+ ++D+ + Sbjct: 124 YWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLAQVDSTKARLEQIP 183 Query: 193 ELLKEKKQALVSYIVTKGL----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++LK +QA+++ V+ L + S +W +P W V + LV K Sbjct: 184 QILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQW-PDLPSTWSVHKYSELVDSRLGKM 242 Query: 249 TKLIESNILSLSYGNIIQ------KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ + Y I LE L + + G+++ Sbjct: 243 LDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGDVLICEGGEPGRC 302 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFED 361 Q + + + A I +L + +++ + + + L + Sbjct: 303 AIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDSNNISLSQLFTGTTIKHLTGKA 362 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P+ VPP++EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 363 LANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVNSLTQSILAKAFRGE 419 >gi|313896063|ref|ZP_07829617.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] gi|320529368|ref|ZP_08030456.1| conserved domain protein [Selenomonas artemidis F0399] gi|312975488|gb|EFR40949.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] gi|320138334|gb|EFW30228.1| conserved domain protein [Selenomonas artemidis F0399] Length = 223 Score = 146 bits (368), Expect = 7e-33, Method: Composition-based stats. Identities = 70/213 (32%), Positives = 112/213 (52%), Gaps = 2/213 (0%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 K + W+ VP HW L KN E NILSL+ +++ + + Sbjct: 4 YKTYKTTDQSWLTNVPKHWGYVKCKTLFATQTEKNKNNEEGNILSLTLQGVVRNNREKPI 63 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTY 332 GL P Y TYQI + ++VF+ IDL+N S R V ERGI++SAY+ + ++ Y Sbjct: 64 GLSPSDYRTYQIFEKDDLVFKLIDLENISTS-RVGLVPERGIMSSAYIRLSAKCDINTRY 122 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + L ++F +G+G+RQ+L D+ + ++VPP EQ I ++ + + ID Sbjct: 123 FYFQYYDLWLRQIFNGLGAGVRQTLSANDLLNIKIVVPPRDEQDQIVRYLDSKISAIDAG 182 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + K+E+ I LKE +S+ I+ VTG+ID+R Sbjct: 183 ISKLEEQIKCLKELKSTLISDVVTGKIDVRDAE 215 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 47/212 (22%), Positives = 83/212 (39%), Gaps = 7/212 (3%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKY 63 K Y YK + W+ +PKHW V K T + + + +I+ + L+ V Sbjct: 2 KRYKTYKTTDQSWLTNVPKHWGYVKCKTLFATQTEKNKNNEEGNILSLTLQGVVRNNR-- 59 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKL---GPYLRKAIIADFDGICSTQFLVLQPKDVL 120 K SD T IF K +++ + + + GI S+ ++ L K + Sbjct: 60 -EKPIGLSPSDYRTYQIFEKDDLVFKLIDLENISTSRVGLVPERGIMSSAYIRLSAKCDI 118 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + ++I + NI + +PP EQ I + ++ Sbjct: 119 NTRYFYFQYYDLWLRQIFNGLGAGVRQTLSANDLLNIKIVVPPRDEQDQIVRYLDSKISA 178 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 ID I++ I+ LKE K L+S +VT ++ Sbjct: 179 IDAGISKLEEQIKCLKELKSTLISDVVTGKID 210 >gi|213418182|ref|ZP_03351248.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] Length = 471 Score = 146 bits (367), Expect = 7e-33, Method: Composition-based stats. Identities = 66/417 (15%), Positives = 148/417 (35%), Gaps = 18/417 (4%) Query: 19 GAIPKHWKVVPIKRFTKL-NTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +P+ W + G T++S D+ ++ D+ G + + Sbjct: 13 GKLPEGWVTTHLSEICSKPQYGYTTKSSSMGDVKFLRTTDITKGAVDWSSVPYCMDAPED 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSID 132 + I+ + G ++ + S L+ +L S D Sbjct: 73 VSKYQLQDRDIVISRAGSVGFSFLVQNPPSQVVFASYLIRFKPVNYFSEYYLKRFLESSD 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + G + + + + + + +PIPP+AEQ +I EK+ ++D+ + Sbjct: 133 YWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLAQVDSTKARLEQIP 192 Query: 193 ELLKEKKQALVSYIVTKGL----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++LK +QA+++ V+ L + S +W +P W V + LV K Sbjct: 193 QILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQW-PDLPSTWSVHKYSELVDSRLGKM 251 Query: 249 TKLIESNILSLSYGNIIQ------KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ + Y I LE L + + G+++ Sbjct: 252 LDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGDVLICEGGEPGRC 311 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFED 361 Q + + + A I +L + +++ + + + L + Sbjct: 312 AIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDSNNISLSQLFTGTTIKHLTGKA 371 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P+ VPP++EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 372 LANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVNSLTQSILAKAFRGE 428 >gi|315446766|ref|YP_004079645.1| restriction endonuclease S subunit [Mycobacterium sp. Spyr1] gi|315265069|gb|ADU01811.1| restriction endonuclease S subunit [Mycobacterium sp. Spyr1] Length = 411 Score = 146 bits (367), Expect = 8e-33, Method: Composition-based stats. Identities = 89/413 (21%), Positives = 176/413 (42%), Gaps = 25/413 (6%) Query: 25 WKVVPIKRFTKLNTGR---TSESGKDII--YIGLEDVE-SGTGKYLPKDGNSRQSDTSTV 78 W+ +K + G+ + ++G D+ Y+ +V+ G + PK + S+ + Sbjct: 9 WRRGQVKNVADVKLGKMLQSDDTGDDVQADYMRAANVQPDGALRLQPKQMWFKPSELEGL 68 Query: 79 SIFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ ++ +A D G ++ + L +L+++ + Sbjct: 69 SLKRGDVVVVEGGVGGFGRAAYLPNDLDGWGFQNSINRIRPTAATDGRFLAYYLIALRAS 128 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 IE C +M H + + +P+P+P ++Q I + + ET RIDTLI E+ F+ L Sbjct: 129 GFIERYCNIVSMPHLTAEKLAALPVPVPDRSDQCAIADFLDRETARIDTLIAEQQLFVGL 188 Query: 195 LKEKKQALVSYIVTK-GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L+E++QA++ V+ P + + G + + + Sbjct: 189 LRERRQAVIDSTVSVVKAEPVQLRRVIELVTSG------------SRGWGDYYSDAGVRF 236 Query: 254 SNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I +L ++ + E + + L P+ + + G+++F + A Sbjct: 237 LRIGNLPRTDLAIRGEVQLVDLPPDVTEGERTRLVVGDVLFSITAYLGSVAVVDDAWEGG 296 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPP 371 A + P +S ++ W+M + D G +Q L +D++ L V +P Sbjct: 297 YVSQHVALCRLDPLRANSRFVGWVMLTTDGQDQLRQGAAGGTKQQLGLDDIRELRVPLPL 356 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + EQ I ++ +T++ID L+ + + I L +ERR++ I AAVTGQID+R E Sbjct: 357 LDEQHRIVAFLDEQTSKIDTLIAETKVFIELSRERRTALITAAVTGQIDVRNE 409 >gi|300721108|ref|YP_003710376.1| type I restriction-modification enzyme subunit S [Xenorhabdus nematophila ATCC 19061] gi|297627593|emb|CBJ88112.1| Type I restriction-modification enzyme subunit S [Xenorhabdus nematophila ATCC 19061] Length = 452 Score = 146 bits (367), Expect = 8e-33, Method: Composition-based stats. Identities = 73/431 (16%), Positives = 142/431 (32%), Gaps = 27/431 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLN-TGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 YK + V G IP+ W V + + +S D+I+IG++D+ + L + Sbjct: 29 YKKTEV---GVIPEDWDAVFFGDLFEDKLPRKALKSNDDVIFIGMQDLSE-NAQLLSQHK 84 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLV-LQPKDVLP 121 S ++ F KG +L K+ P GI ST+F V K Sbjct: 85 VKYGSLKGGLTYFEKGDVLVAKITPCFENGKGCHTKNLLTEIGIGSTEFHVLRATKHTNA 144 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKG--IGNIPMPIPPLAEQVLIREKIIAETV 179 + + W + +E+ G+ + EQ+ I + V Sbjct: 145 DFIYFWTTKKYFRKTLESEMVGSAGHKRVPLQAIQNFLLPCPRNNIEQIAIANTLSDIDV 204 Query: 180 RIDTLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 I L T + + Q L++ + L + K +G +P+ WE+ Sbjct: 205 LISELETLLAKKQAIKTATMQQLLTGRTRLPQFALCENGSKKGYKQSELGEIPEDWEIIC 264 Query: 237 FFALVTELN----RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + + + +SL + L T +++ G+++ Sbjct: 265 IKDVGFVDPENLGSTTSLDYKFDYISLEQIDAGVLLGTVKCTFNTAPLRARRVLQQGDVL 324 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSA-YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + V + T + + YL S + + S Sbjct: 325 ISTVRPNLMSHYFVREDVRDLVCSTGFSVVRCLKDKLRPGYLYQHFFSAVINNQIDMLIS 384 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G ++ DVK L + + + EQ I +++ I L EQ + ++ + Sbjct: 385 GSNYPAINSSDVKNLKIQLGSVNEQTAIATILSDMDTEIQAL----EQKLDKTRQIKQGM 440 Query: 411 IAAAVTGQIDL 421 + +TG+ L Sbjct: 441 MQELLTGKTRL 451 >gi|66769483|ref|YP_244245.1| putative restriction endonuclease S subunits [Xanthomonas campestris pv. campestris str. 8004] gi|188992673|ref|YP_001904683.1| Type I site-specific deoxyribonuclease (specificity subunit) [Xanthomonas campestris pv. campestris str. B100] gi|66574815|gb|AAY50225.1| putative restriction endonuclease S subunits [Xanthomonas campestris pv. campestris str. 8004] gi|167734433|emb|CAP52643.1| Type I site-specific deoxyribonuclease (specificity subunit) [Xanthomonas campestris pv. campestris] Length = 438 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 82/425 (19%), Positives = 156/425 (36%), Gaps = 28/425 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P+ W ++ N ++ ++ ++ ++ V G L + + Sbjct: 9 LPQGWTRRRLRFDCLSNPVKSKLDIPDDTEVSFVPMDAVGELGGLRLDQ-TRELADVYNG 67 Query: 78 VSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLS- 130 + FA G + K+ P + + +T+ VL+P L +L Sbjct: 68 YTYFADGDVCIAKITPCFENGKGAIAEGLVNGVAFGTTELHVLRPSATLDTRFLFYLTIA 127 Query: 131 IDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 D EA G + + + + +P + Q I + +T RID LI ++ Sbjct: 128 HDFRSHGEAEMLGASGQKRVPEEFLKDWTPSLPRMDVQQRIARFLDDKTARIDALIEKKQ 187 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 +E L+EK+QAL++ VTKGLNPD+ MK SG++W+G VP HWEVK V + + + Sbjct: 188 ELLERLEEKRQALITRAVTKGLNPDLPMKPSGVDWLGYVPRHWEVKTLRRHVQRIEQGWS 247 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ---------IVDPGEIVFRFIDLQN 300 E + +++ + V +++ Sbjct: 248 PQTERRMAEPDEWGVLKSGCVNLGIYDENEQKALPGTLDPKPELEVRANDVLMCRASGSM 307 Query: 301 DKRSLR--SAQVMERGIITSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGL--- 353 + + + + + ++ + +M + L + SG Sbjct: 308 QYIGSVALVERTRTKLMFSDKTYRISLSSANTDREYFVRMMSAKHLREQIRLSVSGAEGL 367 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ +V PP+ EQ I + + +D KI S + R + + A Sbjct: 368 ANNIPQSNVLEYLHAFPPLLEQVQIADFLRESIGDLDEAEGKIRASSESWRAYRLALVTA 427 Query: 414 AVTGQ 418 AVTGQ Sbjct: 428 AVTGQ 432 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 66/219 (30%), Gaps = 17/219 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESG------KDIIYIGLEDVESGTGK 62 K SGV W+G +P+HW+V ++R ++ G + ++ + + V G Sbjct: 215 MKPSGVDWLGYVPRHWEVKTLRRHVQRIEQGWSPQTERRMAEPDEWGVLKSGCVNLGIYD 274 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGK-------LGPYLRKAIIADFDGICSTQFLVLQ 115 + D +L + +G + + Sbjct: 275 ENEQKALPGTLDPKPELEVRANDVLMCRASGSMQYIGSVALVERTRTKLMFSDKTYRISL 334 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATM---SHADWKGIGNIPMPIPPLAEQVLIRE 172 ++S + + ++ + PPL EQV I + Sbjct: 335 SSANTDREYFVRMMSAKHLREQIRLSVSGAEGLANNIPQSNVLEYLHAFPPLLEQVQIAD 394 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + +D + E + + ALV+ VT L Sbjct: 395 FLRESIGDLDEAEGKIRASSESWRAYRLALVTAAVTGQL 433 >gi|23466327|ref|NP_696930.1| HsdS specificity protein of type I restriction-modification system [Bifidobacterium longum NCC2705] gi|23327082|gb|AAN25566.1| HsdS specificity protein of type I restriction-modification system [Bifidobacterium longum NCC2705] Length = 406 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 62/397 (15%), Positives = 134/397 (33%), Gaps = 20/397 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + +G T +G +I +I +++ + S+ Sbjct: 19 WEQRKLGELALTYSGGTPSAGNSAYYGGEIPFIRSAEID---CDSTELSLTVAGLNNSSA 75 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG +LY G + I+ G + L + D+ + L E Sbjct: 76 KLVDKGMVLYAMYGATSGEVAISKIKGAINQAILAMDASDMAANRFIAYWLRRQKKSITE 135 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G + I + +P P L EQ I +D LIT R + L Sbjct: 136 TFLQG-GQGNLSGAIIKELGIPQPSLDEQRQIGSF----FSNLDDLITLHQRKYDKLVIF 190 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K++++ + K +++ +G + F T + ++ IL Sbjct: 191 KKSMLEKMFPKDGESVPEIRFAGFTDPWEQRKLENLASFGGGHTPSMADASNYVDGKILW 250 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++ ++ Q + E + P + + + ++ A++ + + Sbjct: 251 VTSQDVKQHYIENTTTMISEKGAATLTLYPSDSIVIVARSGILRHTIPVAKLRKPATVNQ 310 Query: 319 AY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + S L + + S Y +S+ F +K ++VP I+EQ Sbjct: 311 DIKVIQTVDSCDSSWLLQYFIASNKTLLREYGKTGTTVESIDFAKMKSTALMVPYIEEQQ 370 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +R+D L+ ++ + LL+ + S + Sbjct: 371 AIGSF----FSRLDNLITLHQRKLELLQNIKKSLLDK 403 >gi|303242499|ref|ZP_07328979.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] gi|302589967|gb|EFL59735.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] Length = 415 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 76/415 (18%), Positives = 155/415 (37%), Gaps = 31/415 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQS 73 IG IPK W V IK ++ TG T + Y + + G+ KY+ K Sbjct: 19 IGRIPKEWNVAQIKNVGEIITGNTPSTKHPEYYGDTYMFVAPGDIGSSKYVRKTEKYLSG 78 Query: 74 D-TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + I+ +G + K IA + Q + P ++ + L+ Sbjct: 79 KGFEISRKVPQNSIMMICIGSTIGKIAIASEMLTTNQQINSIIPNEIYNNEYVYYALNYY 138 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + E+ E + I +P EQ I + + D I + + I Sbjct: 139 FNKIKESKIEKQAVPIISKSKFSEICIPHIEKQEQRKIADIL----SAWDKAIELKEKLI 194 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E KE+K+ L++ ++T L K SG + W +K + + RKN Sbjct: 195 EQKKEQKRGLMNKLLTGKL------KLSG------FNNEWTLKRLKEICIRIIRKNNGQD 242 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL-QNDKRSLRSAQVM 311 + S + + E + + ++ E Y ++ GE + + + + Sbjct: 243 VPVLTISSLSGFLDQSERFSKVIAGKNVEKYTLLKHGEFSYNKGNSKTYPYGCIFRLEDY 302 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-----LKFEDVKRLP 366 E ++ + Y++ +G+DS + + + + A+ + ++ L ++ + Sbjct: 303 EEALVPNVYISFSMNGVDSNFYKYYFEAGLMNDQLAAIINTGVRNDGLLNLNADEFFDIT 362 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + VP EQ I +++V T I+ +Q + LK ++ + +TG + + Sbjct: 363 LPVPSEYEQKQIGEILDVATKEIN----LHQQELEALKLQKKGLMQLLLTGIVRV 413 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 65/207 (31%), Gaps = 14/207 (6%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG-------NIIQKLETRNMGLK 276 +G +P W V + + +Y + + L Sbjct: 18 EIGRIPKEWNVAQIKNVGEIITGNTPSTKHPEYYGDTYMFVAPGDIGSSKYVRKTEKYLS 77 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + +E + V I+ I K ++ S + I S ++ Y+ + Sbjct: 78 GKGFEISRKVPQNSIMMICIGSTIGKIAIASEMLTTNQQINSII---PNEIYNNEYVYYA 134 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + Y + + + + +EQ I ++++ D +E Sbjct: 135 LNYYFNKIKESKIEKQAVPIISKSKFSEICIPHIEKQEQRKIADILSAW----DKAIELK 190 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRG 423 E+ I KE++ + +TG++ L G Sbjct: 191 EKLIEQKKEQKRGLMNKLLTGKLKLSG 217 >gi|73670718|ref|YP_306733.1| type I restriction-modification system specificity subunit [Methanosarcina barkeri str. Fusaro] gi|72397880|gb|AAZ72153.1| type I restriction-modification system specificity subunit [Methanosarcina barkeri str. Fusaro] Length = 492 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 67/460 (14%), Positives = 156/460 (33%), Gaps = 57/460 (12%) Query: 21 IPKHWKVVPIKRFTK-LNTGRTSESGKDII---YIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W+ + + G T S + I ++ + D+++ + + Sbjct: 18 LPNDWQWTRLGEIADNIQYGYTESSSDEPIGPKFLRITDIQNNEVNWKSVPYCEIDNTKK 77 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSID 132 + G +++ + G + K+ + D S V +++ + + S+ Sbjct: 78 QNYLLKDGDLVFARTGATVGKSYLLKGDFPESVFASYLIRVRLLEEISESFVYNFFQSLT 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++I G + + + + +P+ PL EQ I KI +D I+ Sbjct: 138 YWKQITEGQVGIGQPNVNGTKLSLLIVPVAPLLEQRAIVSKIEQLFSELDNGISNLKLAQ 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW---------------------------- 224 E LK +QA++ L + ++ +E Sbjct: 198 EQLKVYRQAVLKKAFEGKLTKKWREENPDVEDSKYVLNKIKNQISTQKKTKEIQDIQYGE 257 Query: 225 -VGLVPDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMG-----L 275 +P W + +T+ + + +S + + NI + Sbjct: 258 VPYELPFKWNWVSLSDVSISITDGDHQAPPKADSGVPFIVISNISSGKLDMSETMYVPEK 317 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLA 334 E+ + P +I++ + R ++PH I S YL Sbjct: 318 YYENLAAKRKPQPRDILYSVTGSYGIPILISEN---YRFCFQRHIALIRPHMEISSKYLY 374 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++++S + K + +G + ++ ++ + V +PPI EQ I I + + + Sbjct: 375 YILKSPFVYKQATKVATGTAQLTVPLSGLRTIKVPIPPIAEQQAIVQEIETRLSVCEKIE 434 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQI-------DLRGESQ 426 + I+ ++ + R S + A G++ ++RG Sbjct: 435 QDIKDNLERAEALRQSILKKAFEGKLLNEKELAEVRGAED 474 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 70/215 (32%), Gaps = 10/215 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 K+K E + P+ + L + ES+ ++ + +N Sbjct: 1 MKKIKPIIEEEIAEYPNLPNDWQWTRLGEIADNIQYGYTESSSDEPIGPKFLRITDIQNN 60 Query: 274 GL---------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + ++ G++VF K L E + Sbjct: 61 EVNWKSVPYCEIDNTKKQNYLLKDGDLVFARTGATVGKSYLLKGDFPESVFASYLIRVRL 120 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I +++ +S K G+ + ++ + L V V P+ EQ I + I Sbjct: 121 LEEISESFVYNFFQSLTYWKQITEGQVGIGQPNVNGTKLSLLIVPVAPLLEQRAIVSKIE 180 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +D + ++ + LK R + + A G+ Sbjct: 181 QLFSELDNGISNLKLAQEQLKVYRQAVLKKAFEGK 215 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 70/205 (34%), Gaps = 12/205 (5%) Query: 19 GAIPKH----WKVVPIKRF-TKLNTGRT---SESGKDIIYIGLEDVESGTGKYLP--KDG 68 G +P W V + + G ++ + +I + ++ SG Sbjct: 256 GEVPYELPFKWNWVSLSDVSISITDGDHQAPPKADSGVPFIVISNISSGKLDMSETMYVP 315 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST-QFLVLQPKDVLPELLQGW 127 + + ILY G Y +I++ C +++P + + Sbjct: 316 EKYYENLAAKRKPQPRDILYSVTGSYGIPILISENYRFCFQRHIALIRPHMEISSKYLYY 375 Query: 128 LLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L ++ + G G+ I +PIPP+AEQ I ++I + + Sbjct: 376 ILKSPFVYKQATKVATGTAQLTVPLSGLRTIKVPIPPIAEQQAIVQEIETRLSVCEKIEQ 435 Query: 187 ERIRFIELLKEKKQALVSYIVTKGL 211 + +E + +Q+++ L Sbjct: 436 DIKDNLERAEALRQSILKKAFEGKL 460 >gi|312128027|ref|YP_003992901.1| restriction modification system DNA specificity domain-containing protein [Caldicellulosiruptor hydrothermalis 108] gi|311778046|gb|ADQ07532.1| restriction modification system DNA specificity domain protein [Caldicellulosiruptor hydrothermalis 108] Length = 433 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 77/441 (17%), Positives = 167/441 (37%), Gaps = 47/441 (10%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 K+YK +KDS +G IP+ W+VV + K+ TG ++ + TG Sbjct: 7 KNYK----FKDSP---LGRIPEEWEVVRLGDIAKIKTGNSNVQD-----------AAETG 48 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 YL D + S +F K ++ G + + + VL Sbjct: 49 DYLFFDRSGE-IKRSNRYLFDKEAVIVPGEGTEFLPKYYCGKFDLHQRAYAIFDFSSVLS 107 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + G T+ N+ + +PPL EQ I E + I Sbjct: 108 GEYLFYAMH-KFNRILANWAVGTTVKSLRLPMFENLLLLLPPLPEQRKIAEILETIDNAI 166 Query: 182 DTLITERIRFIELLKEKKQALVSYIVT---KGLNPDVKMKDSGIEW-----VGLVPDHWE 233 + ++ + + Q L++ V +G + +++D I+ +G +P+ W+ Sbjct: 167 EKTDAIIEKYKRIKQGLMQDLLTKGVVSEGEGESERWRLRDENIDKFKDSPLGRIPEEWK 226 Query: 234 VKPFFA-----LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES-------YE 281 + ++T+ + + + +E++ + I + K S Sbjct: 227 ICKLDHREITIMITDGSHYSPQPVENSEYYIVNIENIINGKIEFETCKKISPKDYKKLVS 286 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 G+++F +L + +++S + + +DS YL + + + Sbjct: 287 NKCNPKYGDVLFTKDGTVG--ITLVFSGERNVVLLSSIAIIRPSNCLDSYYLKYSLETEQ 344 Query: 342 LCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + K + G + + +D+K L + +PPI EQ I +++ ++ID +EK Sbjct: 345 IKKQIDILIGGSVLKRIVLKDIKSLVIFIPPIPEQQRIASIL----SQIDEAIEKERAYK 400 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 L+ + + +TG++ + Sbjct: 401 EKLERIKKGLMEDLLTGKVRV 421 >gi|23452795|gb|AAN33170.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 420 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 58/424 (13%), Positives = 131/424 (30%), Gaps = 34/424 (8%) Query: 21 IPKHWKVVPIKRFTK-----LNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDG 68 +P+ WK+ + + G + K I + + + Sbjct: 4 LPQGWKMETLGEILSSDKYSIKRGPFGSTLKKSFFVEKGIRIFEQYNPINNDPHWKRYFI 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQF--LVLQPKDVLPELL 124 + + +G +L G + + GI + + L +L Sbjct: 64 SHEKFQELEAFKATEGDLLISCSGTLGKIVELPKDTEMGIINQSLLKIRLNNIKILNSYF 123 Query: 125 QGWLLSIDVTQRIEAICEGATMSHA-DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + S + ++I G+ + + K + I +P+PPL +Q I + V+ID Sbjct: 124 IYYFNSPIMQEKILESTLGSAIKNIASVKILKQIEIPLPPLKKQERIVGILDESFVKIDE 183 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I + + L E Q+ + + + +P WE K + Sbjct: 184 SIKILEQNLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQGWEWKSLGEIGNT 235 Query: 244 LNR------KNTKLIESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + K +I L G + N+ + + +I G ++ Sbjct: 236 SSGGTPLRNKKEYWENGSIKWLKSGELNDGYIDFIEENITEEAIENSSAKIFQKGTLLIA 295 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + + + + + + K+ G + Sbjct: 296 MYGATAGRLGILNLDSATNQAVCAFLHKDNKNIKFLEKFLFYFLFFIRDKIIKDSFGGAQ 355 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++ +K L + +PP+KEQ I ++ + L E + + +E + S + A Sbjct: 356 PNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKA 415 Query: 415 VTGQ 418 G+ Sbjct: 416 FKGE 419 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 72/202 (35%), Gaps = 10/202 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W+ + ++G T K I ++ ++ G ++ ++ Sbjct: 219 KLPQGWEWKSLGEIGNTSSGGTPLRNKKEYWENGSIKWLKSGELNDGYIDFIEENITEEA 278 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLL 129 + S+ IF KG +L G + I + D + KD + Sbjct: 279 IENSSAKIFQKGTLLIAMYGATAGRLGILNLDSATNQAVCAFLHKDNKNIKFLEKFLFYF 338 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + +I G + I N+ +P+PPL EQ I + + + L Sbjct: 339 LFFIRDKIIKDSFGGAQPNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYT 398 Query: 190 RFIELLKEKKQALVSYIVTKGL 211 + ++ +E KQ+L++ L Sbjct: 399 KELKDYEELKQSLLNKAFKGEL 420 >gi|253576958|ref|ZP_04854282.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251843689|gb|EES71713.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 403 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 121/411 (29%), Gaps = 30/411 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W V P+ L G T ++ + +++ G + D Sbjct: 3 VPNGWAVKPLLECCDLLQGLTYSPSNIQSYGLLVLRSSNIQDGKLVLDDCVYVNCSIDEI 62 Query: 77 TVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 IL + +I F + + + D Sbjct: 63 KY--VKPNDILICVRNGSSALIGKSCVIDRPYNATFGAF--MSVLRGDTTGYLAHMFASD 118 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRF 191 V Q+ AT++ + +I +PIP EQ I + I L + Sbjct: 119 VVQQQIRNRSSATINQITKRDFEDIKIPIPFDEEEQRAIAAALSDADAYITALEKLITKK 178 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +A+ + + L ++ EW+ + Sbjct: 179 --------RAVKQGAMQELLTGKRRLPGFKGEWIEKKIHEIGDTSSGGTPSRSVPTYFNG 230 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + + + + + ++ G ++ K + Sbjct: 231 NIPWVTTSELNDNYIRSTAEKITSEALNNSSAKLFPKGTVLMAMYGATIGKLGILDVD-- 288 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 A A+ + + + + Y ++ + ++ ++ L +PP Sbjct: 289 --ATTNQACCALFFNKDIDSVFMYFLLLYHRTEIIELGSGAGQPNISQMIIRNLTFTIPP 346 Query: 372 -IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + EQ I V++ A ID L K+E++ + + ++ +TG+I L Sbjct: 347 TLAEQTAIAAVLSDMDAEIDALTAKLEKA----RRIKQGMMSELLTGRIRL 393 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 63/200 (31%), Gaps = 10/200 (5%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSL----SYGNIIQKLETRNMGLKPESYETYQ 284 P+ W VKP L L S KL + S + + Sbjct: 4 PNGWAVKPLLECCDLLQGLTYSPSNIQSYGLLVLRSSNIQDGKLVLDDCVYVNCSIDEIK 63 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 V P +I+ + + A+M+V YLA + S + + Sbjct: 64 YVKPNDILICVRNGSSALIGKSCVIDRPYNATFGAFMSVLRGDTTG-YLAHMFASDVVQQ 122 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 S + D + + + +P +EQ I ++ A I L + I + Sbjct: 123 QIRNRSSATINQITKRDFEDIKIPIPFDEEEQRAIAAALSDADAYITALEKLITKK---- 178 Query: 404 KERRSSFIAAAVTGQIDLRG 423 + + + +TG+ L G Sbjct: 179 RAVKQGAMQELLTGKRRLPG 198 >gi|300173282|ref|YP_003772448.1| type I R/M system specificity subunit [Leuconostoc gasicomitatum LMG 18811] gi|299887661|emb|CBL91629.1| type I R/M system specificity subunit [Leuconostoc gasicomitatum LMG 18811] Length = 417 Score = 145 bits (365), Expect = 2e-32, Method: Composition-based stats. Identities = 62/407 (15%), Positives = 140/407 (34%), Gaps = 26/407 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKDGNSRQS---DTS 76 W+ + + + G T + + G D E G Y+ K + S Sbjct: 16 DWEERKLGELSNIVGGGTPSTSNPEYWDGDIDWYAPAEIGEQSYVSKSKKTITELGLKKS 75 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + I G +L+ AI+A + F + P + + + ++ + Sbjct: 76 SARILPVGTVLFTSRAGIGNTAILAKE-ATTNQGFQSIVPDQNKLDSYFIFSRTNELKRY 134 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 E G+T K + + + +P L+EQ I ++D I R ++LLK Sbjct: 135 GEVTGAGSTFVEVSGKQMSKMSIMVPELSEQQKIGSF----FKQLDDTIALHQRKLDLLK 190 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGI--EWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E+K+ + + K +++ SG +W + + Sbjct: 191 EQKKGFLQKMFPKNGAKVPELRFSGFADDWEERKLEDAAEIIDGDRGKNYPSGDDFKNSG 250 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSL---RS 307 + L LS N+ ++ ++ + V+ +I+ + Sbjct: 251 HTLFLSATNVTKQGFVFKENQYITKLKSELLGNGKVNLNDIILTSRGSIGHIGLYDERIN 310 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 + I + + +++A +++ K + + L +D+K+ Sbjct: 311 ENIPHARINSGMLILRTDKFNSPSFIAQFLKAPLGIKQIKLISFGSAQPQLTKKDIKKFK 370 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P I+EQ I + ++D + ++ + LLKE++ F+ Sbjct: 371 ITLPKIEEQIKIGAFL----KQLDHTIALHQRKLNLLKEQKKGFLQK 413 >gi|150388684|ref|YP_001318733.1| restriction modification system DNA specificity subunit [Alkaliphilus metalliredigens QYMF] gi|149948546|gb|ABR47074.1| restriction modification system DNA specificity domain [Alkaliphilus metalliredigens QYMF] Length = 467 Score = 145 bits (365), Expect = 2e-32, Method: Composition-based stats. Identities = 66/422 (15%), Positives = 149/422 (35%), Gaps = 33/422 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLP---KDGNS 70 +P++W + T + G T S I +I D+ T Y+ K+ Sbjct: 28 VPENWVWTRLGNVTTIIGGGTPPSRVIEYYENGSIPWISPVDLSGYTDIYISHGKKNITE 87 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 S+ + + +L P IAD + + F P + Sbjct: 88 LGLKKSSARLLPENTVLLSSRAPI-GYVAIADNELCTNQGFKSFLPSPCYL-PKYLYFYL 145 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +EA G T + + P+PPLAEQ I ++I + +++ Sbjct: 146 KSSKKLLEAYASGTTFLELSGRKAAIVEFPLPPLAEQQRIVDRIESLFEKLNQAKALIQD 205 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR---K 247 ++ + +K A++ + L + E G+ W+ K +V Sbjct: 206 ALDSFENRKAAILHKAFSGELTEKWR------EENGVGMGSWKKKSIKEVVKFRAGYAFD 259 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMG-------LKPESYETYQIVDPGEIVFRFIDLQN 300 + + + GN+ + L S ++ G+I+ + Sbjct: 260 SKNFSSTGHQVIRMGNLYNGVLDLTRNPVYISPDLIDNSIIKRFSINEGDILLTLTGTKY 319 Query: 301 DK--RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQS 356 + + E ++ +++ P I++ YL + ++S VF++ +G + + Sbjct: 320 KRDYGYAVLIKESENLLLNQRILSLTPESIETNYLLYYLQSDFFRDVFFSNETGGVNQGN 379 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + V+++ + + EQ +I +++ + D ++ I + + S +A A Sbjct: 380 VSSKFVEKIEIPIFSSLEQKEIVRILDYIFEK-DKNANQLCDLIDNIDLMKKSILARAFR 438 Query: 417 GQ 418 G+ Sbjct: 439 GE 440 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 65/205 (31%), Gaps = 11/205 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +VP++W + T + I ++ S+ Sbjct: 23 EKSNVVPENWVWTRLGNVTTIIGGGTPPSRVIEYYENGSIPWISPVDLSGYTDIYISHGK 82 Query: 283 YQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLA 334 I + G + + L + A + + P YL Sbjct: 83 KNITELGLKKSSARLLPENTVLLSSRAPIGYVAIADNELCTNQGFKSFLPSPCYLPKYLY 142 Query: 335 WLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + ++S K+ A SG L + +PP+ EQ I + I +++ Sbjct: 143 FYLKSS--KKLLEAYASGTTFLELSGRKAAIVEFPLPPLAEQQRIVDRIESLFEKLNQAK 200 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 I+ ++ + R+++ + A +G+ Sbjct: 201 ALIQDALDSFENRKAAILHKAFSGE 225 >gi|35381319|gb|AAQ84547.1| type I restriction-modification enzyme subunit S [Klebsiella pneumoniae] Length = 448 Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 73/438 (16%), Positives = 150/438 (34%), Gaps = 28/438 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGK 62 YK + G IP+ W V I ++ G + D I + +EDV Sbjct: 17 YKLTEA---GVIPEDWDVRKIGDIAEVIRGASPRPKGDKRFYGGNIPRLMVEDVTRDGKY 73 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 P + ++ KG + G +I+A I + + K + Sbjct: 74 VTPSVDSLTEAGAKLSRPCDKGTLTLVCSGTVGIPSILAVNACIHDGFLGLTKVKKSVSI 133 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRI 181 + + + G ++ G+ + +P EQ+ I + I Sbjct: 134 DYLYHFFTTQQEKFNNSATHGGVFTNLTTDGVKEFLLALPRNKNEQIAIANFLSDTDTFI 193 Query: 182 DTLITERIRFIELLKEKKQALV---SYIVTKGLNPDVKMKDSGIEWVGLVPDHWE---VK 235 L I+ + Q L+ + + PD +K +G +P+ W+ V Sbjct: 194 TELEQLIIKKQSIKTATMQQLLTGRTRLPQFAKYPDGTIKSYKASELGSIPEDWKVLSVG 253 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDPGE 290 L+T ++K S I L N+ + + + G+ E + G+ Sbjct: 254 QVCDLLTGFPFSSSKYSNSGIRLLRGSNVKRGITDWSDGITQYWPEISADIKQYELCAGD 313 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 IV + + + ++ V+ + + +L + S + A+ Sbjct: 314 IVISMDGSLVGRSFAQLSDSDLPAVLLQRVARVRTNFVVQGFLKEWICSQFFTEHCDAVK 373 Query: 351 S-GLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + +D++ L+PP EQ I N+++ A + +EQ + +++ + Sbjct: 374 TVTAIPHISPQDIRSFKFLMPPTNDEQKTIANILSDMNAEL----TALEQKLAKVRDIKQ 429 Query: 409 SFIAAAVTGQIDLRGESQ 426 + +TG+I L E Q Sbjct: 430 GMMQQLLTGRIRLPLEQQ 447 >gi|16767768|ref|NP_463383.1| type I restriction enzyme specificity protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167991322|ref|ZP_02572421.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|168243978|ref|ZP_02668910.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194449649|ref|YP_002048547.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197262037|ref|ZP_03162111.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|135211|sp|P06187|T1S_SALTY RecName: Full=Type-1 restriction enzyme StySJI specificity protein; Short=S.StySJI; AltName: Full=Type I restriction enzyme StySJI specificity protein; Short=S protein gi|47739|emb|CAA68580.1| S polypeptide [Salmonella enterica subsp. enterica serovar Typhimurium] gi|16423091|gb|AAL23342.1| specificity determinant for hsdM and hsdR [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|194407953|gb|ACF68172.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197240292|gb|EDY22912.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205330268|gb|EDZ17032.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|205337035|gb|EDZ23799.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|261249609|emb|CBG27479.1| type I restriction enzyme [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267996882|gb|ACY91767.1| type I restriction enzyme specificity protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301161007|emb|CBW20544.1| type I restriction enzyme [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312915621|dbj|BAJ39595.1| type I restriction enzyme StySJI specificity protein [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|323132866|gb|ADX20296.1| type I restriction enzyme specificity protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332991333|gb|AEF10316.1| type I restriction enzyme specificity protein [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 469 Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 72/423 (17%), Positives = 154/423 (36%), Gaps = 23/423 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G +P+ W I LN + D+ ++ + V + + Sbjct: 4 GKLPEGWATSTINEMCNLNPKLKLDDDLDVGFMPMAGVPTTYLGKCNFETKKWSEVKKGF 63 Query: 79 SIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQPKDVLPELLQGW---LL 129 + F +++ K+ P + G ST++ VL+ + L + Sbjct: 64 TQFQNDDVIFAKITPCFENGKAVVIKEFPNGYGAGSTEYYVLRSINGLINPHWLFALVKT 123 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 +T + + + N +P+PPLAEQ +I EK+ ++D+ Sbjct: 124 KDFLTNGALNMSGSVGHKRVTKEFLENYGVPVPPLAEQKVIAEKLDTLLAQVDSTKARLE 183 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSG--IEWVGLVPDHWEVKPFFALVTELNRK 247 + ++LK +Q+++ V L ++ K+ E +P W++ K Sbjct: 184 QIPQILKRFRQSVIVAAVNGQLTKELHKKNKFKLTELNISIPSLWKISEIGQFADVKGGK 243 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPE----------SYETYQIVDPGEIVFRFID 297 ES I + I+ + +N + PE + V G++ + Sbjct: 244 RLPKGESLIAENTGFPYIRAGQLKNGTVLPEGQLYLEEYIQKSISRYTVSSGDLYITIVG 303 Query: 298 LQNDKRSLRSAQVMERGII-TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQ 355 + + +A + I + +L+ +RS L + + + SG + Sbjct: 304 ACIGDAGIIPDVYNNANLTENAAKICNLNENIFNRFLSLWLRSSYLQDIINSEIKSGAQG 363 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L +K LP+++PP++EQ +I + A D + +++ ++ + S +A A Sbjct: 364 KLALARIKSLPLILPPLQEQHEIVRRVEQLFAYADTIEKQVNNALTRVNSLTQSILAKAF 423 Query: 416 TGQ 418 G+ Sbjct: 424 RGE 426 >gi|86145621|ref|ZP_01063951.1| type I restriction-modification system, S subunit [Vibrio sp. MED222] gi|85836592|gb|EAQ54718.1| type I restriction-modification system, S subunit [Vibrio sp. MED222] Length = 424 Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 70/419 (16%), Positives = 146/419 (34%), Gaps = 29/419 (6%) Query: 23 KHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTST 77 + W V + + G + I + ++V SG K+ D ++D S Sbjct: 13 EDWNVSNLSECSLFIKDGTHGTHKRTPTGIPLLSAKNVTASGKIKWDVNDSLVSEADYSK 72 Query: 78 ---VSIFAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSI 131 K +L +G R+A++ S + V P + + S Sbjct: 73 IHSKYELEKDDLLLTVVGTLGRRALVDGSAKFTIQRSVGVIRPDKNKVTPNFIFHFCGSD 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++E + + +P+P PPL EQ I + + I+ + + Sbjct: 133 FFQNQLELRANATAQAGVYLGELAKVPVPSPPLPEQKKIAAILTSVDEVIEKTQAKIDKL 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA--LVTELNRKNT 249 +L Q L++ V P + KDS + G VP WEV V + Sbjct: 193 KDLKTGMMQELLTCGVGVDGKPHTEFKDSPV---GRVPKGWEVVELDRAAKVIDCKHATP 249 Query: 250 KLIESNILSLSYGNIIQKLE-----TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K + + GNI + + ++ G+I++ Sbjct: 250 KYFSNGFPVVKPGNIREGFLELRGCSLTDKAGFDNLNENHTPTIGDIIYSRNQTYGVGAY 309 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVK 363 + + I + P +S +L +++ S + + + +G + + ++ Sbjct: 310 VNRSM---EFCIGQDVCVISPKKCNSIFLFYMINSPLVKEQVELLAAGSTFKRINLGSIR 366 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +L + +P I+EQ I ID V +E+ ++ K+ + + + +TG+ ++ Sbjct: 367 KLKIALPCIEEQQAIG----AVFESIDNKVSLLEKKLIKKKDTKKALMQDLLTGKKRVK 421 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 77/206 (37%), Gaps = 9/206 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTG 61 K + ++KDS V G +PK W+VV + R K+ + + + ++ G Sbjct: 213 KPHTEFKDSPV---GRVPKGWEVVELDRAAKVIDCKHATPKYFSNGFPVVKPGNIREGFL 269 Query: 62 KYLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKD 118 + + + + G I+Y + Y + + V+ PK Sbjct: 270 ELRGCSLTDKAGFDNLNENHTPTIGDIIYSRNQTYGVGAYVNRSMEFCIGQDVCVISPKK 329 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L + S V +++E + G+T + I + + +P + EQ I + Sbjct: 330 CNSIFLFYMINSPLVKEQVELLAAGSTFKRINLGSIRKLKIALPCIEEQQAIGAVFESID 389 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 ++ L + I+ + K Q L++ Sbjct: 390 NKVSLLEKKLIKKKDTKKALMQDLLT 415 >gi|253569703|ref|ZP_04847112.1| type I restriction-modification system [Bacteroides sp. 1_1_6] gi|251840084|gb|EES68166.1| type I restriction-modification system [Bacteroides sp. 1_1_6] Length = 478 Score = 144 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 62/409 (15%), Positives = 133/409 (32%), Gaps = 32/409 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P +W + + +G T +I ++ D+ G +P+ Sbjct: 70 EVPDNWVWMTLGEVGTWQSGGTPSRSNKTYYGGNIPWLKTGDLNDGLISDIPESITEEAV 129 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ I G +L G + K I F + + + L + + Sbjct: 130 ANSSAKINPAGSVLIAMYGATIGKLGILTFPATTNQACCACIEFNAI-TQLYLFYFLLSQ 188 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 A G + + I N +P+PPL+EQ I +I ID + + Sbjct: 189 RNGFIAKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVEQGKADLQN 248 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV---------------GLVPDHWEV---K 235 +K+ K ++ + L P + I+ + +P W Sbjct: 249 TIKQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPDFTPCDNGHSRKLPQGWYSVTAN 308 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQK--LETRNMGLKPESYETYQI-VDPGEIV 292 +++ ++ + ++ I L GNI ++ + SY+ V G+I+ Sbjct: 309 DVCSIIGGVSYNKADIQDTGIRVLRGGNIQNGKVIDCFDDVFISLSYQNNDNQVQRGDII 368 Query: 293 FRFIDLQ---NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 K + + I + S Y+ + ++ + Sbjct: 369 VVASTGSQTLIGKTGFADRDIPKTQIGAFLRIVRPKQKTLSPYIRLIFQTDAYKDYIRNV 428 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 G ++K ++ + +PP++EQ I I + +D ++ +E Sbjct: 429 AKGSNINNVKNAHLQNFQICLPPLEEQQRIVQKIEELFSSLDDILTALE 477 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 62/200 (31%), Gaps = 12/200 (6%) Query: 227 LVPDHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPES-- 279 VPD+W + T + N NI L G++ L + E Sbjct: 70 EVPDNWVWMTLGEVGTWQSGGTPSRSNKTYYGGNIPWLKTGDLNDGLISDIPESITEEAV 129 Query: 280 -YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +I G ++ K + + A A + + Sbjct: 130 ANSSAKINPAGSVLIAMYGATIGKLGILTF----PATTNQACCACIEFNAITQLYLFYFL 185 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 G G + ++ E + + +PP+ EQ I I A ID + + Sbjct: 186 LSQRNGFIAKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVEQGKAD 245 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 246 LQNTIKQTKSKILDLAIHGK 265 >gi|300087441|ref|YP_003757963.1| restriction modification system DNA specificity domain-containing protein [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299527174|gb|ADJ25642.1| restriction modification system DNA specificity domain protein [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 385 Score = 144 bits (363), Expect = 2e-32, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 134/404 (33%), Gaps = 30/404 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+V + ++ G T +I + + D+ + S Sbjct: 3 NGWQVKALGDICQVIGGGTPSKSIAEYYVGNIPWATVRDMRTDLITETEHKITHVAVKNS 62 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I + G ++ + ++ I ++ + + + V Sbjct: 63 ATKIISNGNVVIATRVGLGKVCLLGQDTAINQDLRGIVPKDSNILFVRYLFWWLKSVVDT 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I A GAT+ I ++ +P+PPL EQ I + I T + + ++ + Sbjct: 123 IVAEGTGATVQGVKLPFIKSLQIPLPPLPEQQRIVTILDEAFEGIATAKAKAEKNLQNAR 182 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++ ++ + ++ V+ + S I + +KN L Sbjct: 183 ALFESHLNSVFSRRGEGWVERRLSDI--------------CVFINGRAYKKNEMLSAGKY 228 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L GN + E + D G++++ + + + + Sbjct: 229 PLLRVGNFFTNNDWY---YTDLDLEPAKYCDTGDLLYAWSASFGPRI-----WEGGKVVY 280 Query: 317 TSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 V P+ + + S+D+ ++ G+G + +++ V VPP+++ Sbjct: 281 HYHIWKVIPNINLTNKRFLLYLLSWDVEQIKQLHGTGTTMMHVSKGSIEKRIVPVPPLEQ 340 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q I N ++ L ++ + L+E + S + A +G+ Sbjct: 341 QKYIVNNLDKLKTETQHLQSIYQKKLAALEELKKSLLHQAFSGE 384 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 62/189 (32%), Gaps = 1/189 (0%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W + GR + + + + G + D D Sbjct: 198 EGWVERRLSDICVFINGRAYKKNEMLSAGKYPLLRVGNF-FTNNDWYYTDLDLEPAKYCD 256 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G +LY + + + V+ ++ + +LLS DV Q + Sbjct: 257 TGDLLYAWSASFGPRIWEGGKVVYHYHIWKVIPNINLTNKRFLLYLLSWDVEQIKQLHGT 316 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G TM H I +P+PPL +Q I + L + + + L+E K++L Sbjct: 317 GTTMMHVSKGSIEKRIVPVPPLEQQKYIVNNLDKLKTETQHLQSIYQKKLAALEELKKSL 376 Query: 203 VSYIVTKGL 211 + + L Sbjct: 377 LHQAFSGEL 385 >gi|149177179|ref|ZP_01855785.1| type I restriction enzyme specificity protein [Planctomyces maris DSM 8797] gi|148843893|gb|EDL58250.1| type I restriction enzyme specificity protein [Planctomyces maris DSM 8797] Length = 398 Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats. Identities = 50/393 (12%), Positives = 114/393 (29%), Gaps = 12/393 (3%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + L GR + + + + G + + + +G Sbjct: 6 QKCRLGEICTLLNGRAYKKKELLDSGKYPVLRVGNF-FTNRSWYYSDLELDDNKYCEEGD 64 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +LY + + + V + + + + D + G T Sbjct: 65 LLYAWSASFGPRIWSGPKVIYHYHIWKVQLDESKVNKNFLCYWFGWDSEKIRSEQGTGTT 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H + + + +PPL+EQ I + I R + +E + ++ Sbjct: 125 MIHVTKGSMEDRELCLPPLSEQKRIVAILDEAFGAIARAKENAARNLANARELFDSYLNR 184 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + T+ + K S I + I + N Sbjct: 185 VFTEKGEGWEEKKLSEIAKTFGRGKSRHRPRNDKSLYGGEY-------PFIQTGEIRNAN 237 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + ++ G + + L + +I + P Sbjct: 238 HYITKFTQTYNEKGLAQSKLWPVGTLCITIAANIAETAILTFDACIPDSVIG---LVCDP 294 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + ++ +L++++ GS + ++ +R+ P + EQ I +N Sbjct: 295 EKANVDFVEYLLQNFKSGLQAEGKGS-AQDNINMGTFERMLFPFPSVSEQEKIVCELNAI 353 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L +Q + L E + S + A TGQ Sbjct: 354 AESCNNLSPIYQQKLTALDELKQSLLQKAFTGQ 386 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 65/199 (32%), Gaps = 10/199 (5%) Query: 23 KHWKVVPIKRFTK-LNTGRT---SESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQS 73 + W+ + K G++ + K + +I ++ + + Sbjct: 191 EGWEEKKLSEIAKTFGRGKSRHRPRNDKSLYGGEYPFIQTGEIRNANHYITKFTQTYNEK 250 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + ++ G + + + + I FD + L + L + Sbjct: 251 GLAQSKLWPVGTLCIT-IAANIAETAILTFDACIPDSVIGLVCDPEKANVDFVEYLLQNF 309 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++A +G+ + + + P P ++EQ I ++ A + L + + Sbjct: 310 KSGLQAEGKGSAQDNINMGTFERMLFPFPSVSEQEKIVCELNAIAESCNNLSPIYQQKLT 369 Query: 194 LLKEKKQALVSYIVTKGLN 212 L E KQ+L+ T L Sbjct: 370 ALDELKQSLLQKAFTGQLT 388 >gi|146297668|ref|YP_001181439.1| restriction modification system DNA specificity subunit [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145411244|gb|ABP68248.1| restriction modification system DNA specificity domain [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 455 Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats. Identities = 68/432 (15%), Positives = 146/432 (33%), Gaps = 37/432 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 PK W +V ++R L +G + S + I +G E + G + + Sbjct: 22 EFPKEWTIVSLERDCVLISGLRPKGGASDEGIPSLGGEHITLDGRINFSDVNAKYIPEKF 81 Query: 76 ST---VSIFAKGQILYGKLGPYLRKAIIADFDGI----CSTQFLVLQPKDVLPELLQGWL 128 + IL K G K + + +++ K + + + Sbjct: 82 FKIMTKGKTEENDILINKDGANTGKVAMLKKKFYKDIAINEHLFIIRSKKLFVQQYLFYW 141 Query: 129 LSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L ++I G+ I N +P PPL EQ I E + I+ Sbjct: 142 LFSRFGQKQITDRITGSAQPGLSSTFIKNFLVPRPPLPEQRKIAEILETIDSAIEKTDAI 201 Query: 188 RIRFIELLKEKKQALVSYIVT---KGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFA 239 ++ + + Q L++ V +G + +++D I+ +G +P+ WEV + Sbjct: 202 IEKYKRIKQGLMQDLLTKGVVSEGEGESERWRLRDENIDKFKDSPLGRIPEEWEVVDVYG 261 Query: 240 LVTELNRKNT-------KLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPG 289 V +N LS+ NI ++ + E +++ G Sbjct: 262 RVNLINGGTPSTARPEFWNGSIPWLSVEDFNIGKRWVFSSSKYITELGLKQSATKLLKKG 321 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ L + + + K S + + Sbjct: 322 MLIISARGTVGVLAQLGADMAFNQSCYG---LDAKDKMKLSNDFLYYALKNFITSFLSLA 378 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + ++ E K + + +PP+ EQ I +++ ++ID ++EK + L+ + Sbjct: 379 YGNVFNTITRETFKEILIPLPPLPEQQRIASIL----SQIDEVIEKEQAYKEKLERIKKG 434 Query: 410 FIAAAVTGQIDL 421 + +TG++ + Sbjct: 435 LMEDLLTGKVRV 446 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 68/212 (32%), Gaps = 17/212 (8%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRF---TKLNTGRTSES------GKDIIYIGLEDVES 58 ++KDS +G IP+ W+VV L G T + I ++ +ED Sbjct: 240 DKFKDSP---LGRIPEEWEVV---DVYGRVNLINGGTPSTARPEFWNGSIPWLSVEDFNI 293 Query: 59 GTGKYL--PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 G K S + KG ++ G A + + + + Sbjct: 294 GKRWVFSSSKYITELGLKQSATKLLKKGMLIISARGTVGVLAQLGADMAFNQSCYGLDAK 353 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + + ++ G + + I +P+PPL EQ I + Sbjct: 354 DKMKLSNDFLYYALKNFITSFLSLAYGNVFNTITRETFKEILIPLPPLPEQQRIASILSQ 413 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I+ + + + K + L++ V Sbjct: 414 IDEVIEKEQAYKEKLERIKKGLMEDLLTGKVR 445 >gi|300114985|ref|YP_003761560.1| restriction modification system DNA specificity domain-containing protein [Nitrosococcus watsonii C-113] gi|299540922|gb|ADJ29239.1| restriction modification system DNA specificity domain protein [Nitrosococcus watsonii C-113] Length = 393 Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats. Identities = 57/407 (14%), Positives = 131/407 (32%), Gaps = 42/407 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + WK++ + ++ G ++ K++ +++ DV + + Sbjct: 3 EGWKIISLGEIATVSAGSSAPQNKELFEGGTHLFVRTSDVGKIRVGLINNSADKLNEKGI 62 Query: 77 TV-SIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 +F G IL+ K G +L +I + S+ ++ K V Sbjct: 63 KKLKLFPSGTILFPKSGASTFLNHRVILTCNAYVSSHLAAIKAKTQSALDRYLLHYLTTV 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + I I +P+P + EQ I + I + + + Sbjct: 123 KAQD--LIQDHKYPSLKVSDIQGIEIPLPSIPEQKRIVAILDEAFEGIGRAVANAEKNLA 180 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 E ++ ++ + T+ V+ K +G V + + K ++ N Sbjct: 181 NACELFESYLNSVFTQKGEGWVERK------LGDVCKNLDSKRIPITKSKRKSGNIPYYG 234 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVM 311 ++ + Y I D ++ R+ + + Sbjct: 235 ASGIV--------------------DYVADFIFDEDLLLVSEDGANLLARTYPIAFSISG 274 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + A++ ++ + + S L M + L + + +PV +PP Sbjct: 275 KTWVNNHAHVLRFDEISSQRFIEYYLNSISLVPYVSGMA---QPKLNQKALNSIPVSLPP 331 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ I ++ +A +Q + L E + S + A TG+ Sbjct: 332 ADEQRKIVTQLDKLSAETHRFEAIYQQKLTALAELKQSLLHKAFTGE 378 >gi|325919356|ref|ZP_08181389.1| restriction endonuclease S subunit [Xanthomonas gardneri ATCC 19865] gi|325550169|gb|EGD20990.1| restriction endonuclease S subunit [Xanthomonas gardneri ATCC 19865] Length = 408 Score = 143 bits (361), Expect = 4e-32, Method: Composition-based stats. Identities = 55/416 (13%), Positives = 142/416 (34%), Gaps = 38/416 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W P+ K+ +G T + G +I +I +V+ T + S+ Sbjct: 5 GWSQHPLGDIAKVTSGGTPDRSTPSYWGGNIPWITTGEVQFNTITDSAEKITELGLKNSS 64 Query: 78 VSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +F G +L G + + + + + + Sbjct: 65 AKLFPIGTLLVAMYGQGKTRGQIAKLGIEAATNQACAAILFDARND-PDFHFQYLASQYE 123 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + T + + + I +P+PP+ EQ I + D I R + Sbjct: 124 ELRELGNAGTQKNLNGGILKRILVPVPPIQEQRRIAHIL----STWDQAIATTERLLANA 179 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +++ L + + G + + W+ + + R+NT + Sbjct: 180 CTQRKTLTNALFVHGRHSSM------------TTHGWKFADLDEVFERVTRRNTTANSNV 227 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF-IDLQNDKRSLRSAQVMERG 314 + ++ + + N + E+ Y +++ GE + +++ ++G Sbjct: 228 LTISGTRGLVSQRDYFNKSVASENLSGYTLIERGEFAYNKSYSAGYPMGAIKPLTRYDQG 287 Query: 315 IITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLP 366 +++S Y+ + D+ + L + + G R ++ D +L Sbjct: 288 VVSSLYICFRLRDGVEADADFFRHYFEVGMLNEGLSGIAQEGARNHGLLNVGVGDFFKLR 347 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +P + EQ + ++N+ + + I + L++ + + ++ +TG+ +R Sbjct: 348 LHIPDVTEQRRVAAILNMAEQK----EQLITAQLDKLRDEKKALMSQLLTGKRRVR 399 >gi|17230179|ref|NP_486727.1| type I restriction-modification enzyme S subunit [Nostoc sp. PCC 7120] gi|17131780|dbj|BAB74386.1| type I restriction-modification enzyme S subunit [Nostoc sp. PCC 7120] Length = 427 Score = 143 bits (361), Expect = 4e-32, Method: Composition-based stats. Identities = 63/430 (14%), Positives = 134/430 (31%), Gaps = 31/430 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKY 63 Y+ + G +P WK+ + T T ++ K I ++ V+ G + Sbjct: 9 ESYQKTE---FGIVPNDWKIRKLVECCNKITDGTHDTPKPLAQGIPFLTAIHVKEGFIDF 65 Query: 64 LPKDGNSRQSDTSTVSIF--AKGQILYGKLGPYLRKAIIADFDGICS--TQFLVLQPKDV 119 + S K +L +G + + D + S L+ K+ Sbjct: 66 NNCYYLPQSIHESIYKRCNPEKNDVLMVNIGAGVATTALIDVEYEFSLKNVALLKPDKNN 125 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAET 178 L + LS++ + + G K IG I +PIPP + EQ I + + Sbjct: 126 LIGSYLNYCLSLNKFRITNQLLSGGAQPFLSLKQIGEISIPIPPTIEEQEAIAQSLSDVD 185 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I + + Q L L + ++ EW + Sbjct: 186 ALITECDRIIAKKHNTKQGTMQQL--------LTGEKRLPGFSGEWEVEEFEQVLKVVDG 237 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFR 294 + L LS N+ + + + + ++ + ++V Sbjct: 238 DRGDNYPSNDELFDNGYCLFLSAKNVTKGGFKFSDCTFITKEKDNLLGNGKLCKKDVVLT 297 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMG-S 351 + + E I S + ++ +D++YL ++S+ Sbjct: 298 TRGTVGNIAFFDYSVPFENIRINSGMVILRSEDKNLDNSYLYSFLKSHLFQTQIDRAVFG 357 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + L + + + + V + EQ I +++ + +EQ K + + Sbjct: 358 SAQPQLTVKGISKFKIPVSSLPEQKAIAQILSDMDTE----IAALEQKRDKYKAIKQGMM 413 Query: 412 AAAVTGQIDL 421 +TG+ L Sbjct: 414 QELLTGKTRL 423 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 74/211 (35%), Gaps = 14/211 (6%) Query: 224 WVGLVPDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGNIIQKLETRNMGL----- 275 G+VP+ W+++ + K + I L+ ++ + N Sbjct: 15 EFGIVPNDWKIRKLVECCNKITDGTHDTPKPLAQGIPFLTAIHVKEGFIDFNNCYYLPQS 74 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 ES + +++ I +L + E + A + + + +YL + Sbjct: 75 IHESIYKRCNPEKNDVLMVNIGAGVATTALIDVE-YEFSLKNVALLKPDKNNLIGSYLNY 133 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVE 394 + + G + L + + + + +PP I+EQ I ++ D L+ Sbjct: 134 CLSLNKFRITNQLLSGGAQPFLSLKQIGEISIPIPPTIEEQEAIAQSLSDV----DALIT 189 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + ++ I + + +TG+ L G S Sbjct: 190 ECDRIIAKKHNTKQGTMQQLLTGEKRLPGFS 220 >gi|118497744|ref|YP_898794.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. novicida U112] gi|194323716|ref|ZP_03057492.1| type I restriction modification DNA specificity domain protein [Francisella tularensis subsp. novicida FTE] gi|118423650|gb|ABK90040.1| type I restriction-modification system, subunit S [Francisella novicida U112] gi|194322080|gb|EDX19562.1| type I restriction modification DNA specificity domain protein [Francisella tularensis subsp. novicida FTE] Length = 407 Score = 143 bits (361), Expect = 4e-32, Method: Composition-based stats. Identities = 66/414 (15%), Positives = 138/414 (33%), Gaps = 24/414 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + +P W+ + K+ + + K I + ++ + Sbjct: 4 LYKLPAGWEWKKLGDLFKITSSKRVHKKDWLDKGIPFYRAREIVKLAQNGYVDNELFISE 63 Query: 74 DT-----STVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQG 126 D S + + IL +G ++ D + L+ ++ Sbjct: 64 DMYNSFASKYGLPKENDILVTGVGTLGIPFVVKKNDKFYFKDGNIIWLKNENGTNPKYIE 123 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + S + G+T++ N +P+PPLAEQ I K+ + +ID I Sbjct: 124 YCFSSQDVRNQINSNNGSTVATYTITNANNTIIPLPPLAEQKRIVAKLDSLFEKIDKAIE 183 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + I + + K ++G ++ Sbjct: 184 LHQQNITNANTLMASTLDKTFKKLEGEYGMNDILDGIYIG--------CRKGYKPEIIDG 235 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + +I + N LE K ++ V G+I QN+K S+ Sbjct: 236 KVPFIGMQDIDQYNGINTNYVLEDYEKVSKGKTKFEKNAVLVGKITPC---TQNNKTSIV 292 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKR 364 + + T Y + ++ YL + +RS D+ + G+ RQ + + + Sbjct: 293 PSNINGGFATTEVYALHSKNNMNPFYLNYFVRSKDINDYLVSTMIGATGRQRVPSDAITS 352 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L + +PP+ Q ++ ++D + + EQ + LK ++S + A G+ Sbjct: 353 LKIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRGE 406 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 56/194 (28%), Gaps = 12/194 (6%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIES-----------NILSLSYGNIIQKLETRN 272 + +P WE K L + K + I+ L+ + + Sbjct: 3 ELYKLPAGWEWKKLGDLFKITSSKRVHKKDWLDKGIPFYRAREIVKLAQNGYVDNELFIS 62 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + Y + +I+ + ++ +G + Y Sbjct: 63 EDMYNSFASKYGLPKENDILVTGVGTLGIPFVVKKNDKFYFKDGN-IIWLKNENGTNPKY 121 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + S D+ + + + + +PP+ EQ I ++ +ID Sbjct: 122 IEYCFSSQDVRNQINSNNGSTVATYTITNANNTIIPLPPLAEQKRIVAKLDSLFEKIDKA 181 Query: 393 VEKIEQSIVLLKER 406 +E +Q+I Sbjct: 182 IELHQQNITNANTL 195 >gi|194335173|ref|YP_002019739.1| restriction modification system DNA specificity domain [Prosthecochloris aestuarii DSM 271] gi|194312991|gb|ACF47385.1| restriction modification system DNA specificity domain [Prosthecochloris aestuarii DSM 271] Length = 417 Score = 143 bits (360), Expect = 5e-32, Method: Composition-based stats. Identities = 65/405 (16%), Positives = 132/405 (32%), Gaps = 34/405 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY--LPKDGNSRQSDT 75 W V + + G T +I +I DV + + + + Sbjct: 26 EWGVACLGDLGEFAGGGTPSKTISEYWDGNIPWISSSDVSDESITDVSISRFITNEAIKC 85 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S + G IL + AII D S F P L +L S Sbjct: 86 SATKLIPSGSILLVSRVGVGKLAII-DSPVCTSQDFTNFTPSKDNALFLGYYLKSNG--H 142 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +E +C+G + + I + +P L EQ I + + + I + Sbjct: 143 ALENLCQGMAIKGFTKNDVSKIVLALPDLTEQQKIADCLFSLNALIAAHAEKIEALKT-- 200 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ L+ + + +++ G F V + + Sbjct: 201 --HKKGLMQQLFPREGETVPRLRFPEFRDAGE-----WESAFGDNVFDQVSNKEHNSDLP 253 Query: 256 ILSLS--YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +L+++ +G I + + ++ + +S E Y++VD G+ + Q + Sbjct: 254 VLAITQEHGAIPRDMIDYHVSVTDKSIEGYKVVDVGDFIISLRSFQG-----GIEYSRFK 308 Query: 314 GIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--LKFEDVKRLPVLVP 370 GI + AY + G + Y +++ GLR + ++ L + +P Sbjct: 309 GICSPAYVILRLRKGYSAGYFRQYLKTDRFISQLTKNLEGLRDGKMISYKQFSELSLPIP 368 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + + + +D L+ + + LK + + Sbjct: 369 SQNEQQKIADCL----SSLDALIAAHAEKLDALKTHKKGLMQQLF 409 >gi|160903326|ref|YP_001568907.1| restriction modification system DNA specificity subunit [Petrotoga mobilis SJ95] gi|160360970|gb|ABX32584.1| restriction modification system DNA specificity domain [Petrotoga mobilis SJ95] Length = 433 Score = 143 bits (360), Expect = 6e-32, Method: Composition-based stats. Identities = 86/438 (19%), Positives = 173/438 (39%), Gaps = 39/438 (8%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 Y+ +YK++ +G +PK W+VV + ++L G+T + Y G ++ + Sbjct: 6 YQK-EEYKETE---LGLLPKDWEVVRLGEVSELQQGKTPKRDDYEDYKGYRIIKVKDYEN 61 Query: 64 LPKDGNSRQSDTSTVS-------IFAKGQILY-------GKLGPYLRKA--IIADFDGIC 107 K N + D S V +G L +G + I + Sbjct: 62 ENKISNIIKGDRSFVKTDFGERCRIKEGDSLILSAAHSSNIVGQKIGYVKEIPSQKTFFV 121 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + V K+++P L+ + +I +G H K +G I +P+PPL+EQ Sbjct: 122 AELIRVRPKKNIIPYFCFLSLILMSSRNQIREEVKGG---HLYPKNLGKIRIPLPPLSEQ 178 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN--PDVKMKDSGIEWV 225 I + + + I+ KE K+++++++ T G +V+ + Sbjct: 179 KKIAYVL----SSVQEAKEKTEDVIKATKELKKSMMNHLFTYGPVSLEEVEKVPLKETEI 234 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 GLVP+ WEVK F +V + I ++ R GL Sbjct: 235 GLVPEEWEVKNFGEIVEIRKEIIDPSNGNYIYVGLEHIESGNIKLRKTGLSKGVKSAKYK 294 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCK 344 + P +I++ + DK L + GI ++ + K + ++++A+L + + Sbjct: 295 IYPNDILYAKLRPYLDKGILVE----QEGICSTDLLVFKAKENVYASFIAYLEHTNYFRE 350 Query: 345 VFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 +G+ + + +L + +PP+ EQ I ++++ ID +E E L Sbjct: 351 YAIKTMTGVNHPRTSWRALSQLTIPLPPLSEQKKIASILSA----IDQKIEAEESKKKAL 406 Query: 404 KERRSSFIAAAVTGQIDL 421 ++ S + +T +I + Sbjct: 407 EDLFKSLLHNLMTAKIRV 424 >gi|297618847|ref|YP_003706952.1| restriction modification system DNA specificity domain-containing protein [Methanococcus voltae A3] gi|297377824|gb|ADI35979.1| restriction modification system DNA specificity domain protein [Methanococcus voltae A3] Length = 412 Score = 143 bits (360), Expect = 6e-32, Method: Composition-based stats. Identities = 62/426 (14%), Positives = 150/426 (35%), Gaps = 29/426 (6%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-----IIYIGLEDVESGTGK 62 YK++ IG IP W+V + + K +I ++++GT Sbjct: 4 EGYKETK---IGLIPNDWEVKKLGDVCSFIGDGIHSTPKYCTNGKYYFINGNNLKNGTIV 60 Query: 63 YLPKDGNSRQSDTSTVS-IFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVL 120 + + + + A+ +L G + + + + + + Sbjct: 61 HTNDTKLISFEEFNKLKQKIAEDALLLSINGTIGNCSYYNNEKILLGKSVAYINLKNKNI 120 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + S + + G+T+ + K + N+ +P+PPL EQ I E + + Sbjct: 121 KNFIYYVIQSPRTVSQFYSELTGSTIKNLSLKSLRNLCIPLPPLKEQQKIAEIL----TK 176 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 D I I +E K+ L+ ++T + ++ +G + + Sbjct: 177 WDNHIETLENLISKKEEYKKGLMQNLLTGKVRFPGFNEEWKEVKLGEICKFLKGNGLSKE 236 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF-RFIDLQ 299 N K ++ + E L ++ + G+I+ Sbjct: 237 KLNKNGKFKCILYGELY-------TTYSEVIKEVLSKTDFKEKIHSEKGDILIPASTTTT 289 Query: 300 NDKRSLRSAQVMERGIITSAYMA---VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + +A E I+ + ++ +LA+ + ++ Sbjct: 290 GIDLANATAINEENVILGGDINILRKKYENKYNNEFLAYYLTYGKKYELAKYAQGTTIVH 349 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L +D+K + + +P ++EQ I V++++ +E +++ + LLK ++ + +T Sbjct: 350 LYGKDIKNMKIQLPTLEEQEQIAEVLSLQDKE----IEILKEKLELLKMQKKGLMQKLLT 405 Query: 417 GQIDLR 422 G+I ++ Sbjct: 406 GEIRVK 411 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 78/208 (37%), Gaps = 10/208 (4%) Query: 225 VGLVPDHWEVKPFFALV----TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 +GL+P+ WEVK + ++ ++ N+ K S+ Sbjct: 11 IGLIPNDWEVKKLGDVCSFIGDGIHSTPKYCTNGKYYFINGNNLKNGTIVHTNDTKLISF 70 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT-SAYMAVKPHGIDSTYLAWLMRS 339 E + + + N S E+ ++ S + ++ ++++S Sbjct: 71 EEFNKLKQKIAEDALLLSINGTIGNCSYYNNEKILLGKSVAYINLKNKNIKNFIYYVIQS 130 Query: 340 YDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 FY+ + ++L + ++ L + +PP+KEQ I ++ I+ L I + Sbjct: 131 PRTVSQFYSELTGSTIKNLSLKSLRNLCIPLPPLKEQQKIAEILTKWDNHIETLENLISK 190 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +E + + +TG++ G ++ Sbjct: 191 K----EEYKKGLMQNLLTGKVRFPGFNE 214 >gi|325283709|ref|YP_004256250.1| restriction modification system DNA specificity domain-containing protein [Deinococcus proteolyticus MRP] gi|324315518|gb|ADY26633.1| restriction modification system DNA specificity domain protein [Deinococcus proteolyticus MRP] Length = 396 Score = 143 bits (360), Expect = 6e-32, Method: Composition-based stats. Identities = 56/408 (13%), Positives = 127/408 (31%), Gaps = 42/408 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ V + + +G T K DI +I ++ SG + Sbjct: 18 VPEGWRGVKLGEMVECFSGGTPSRTKPEYYGGDIPWIKSGELNSGNIYATEETITEAGLQ 77 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + G +L+ G D + L ++P + L + LS V Sbjct: 78 NSSAKVAKAGTLLFALYGATAGVIGRTRIDAAINQAILAIEPSEELLSEFLEYFLSSSVG 137 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + + + P+ +PPL EQ I + + L + Sbjct: 138 NLLHLTQGG--QPNFNAGIVKGFPLLLPPLPEQRKIAAILSTWDDSLANLTDLLAAKRQQ 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + +AL L ++ EW + ++ + + + L S Sbjct: 196 KRGLAEAL--------LTGQKRLPGFEGEW-----EEKKLGDIAKVYQPVTITSADLKAS 242 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I Y+ Y +V ++ + Sbjct: 243 GYPVYGANGKIGY------------YDKYNHEQWQTLVTCRGSSSG-----AVSRSEDYA 285 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 IT M + + ++ + + + + + ++ + +PP+ E Sbjct: 286 WITGNAMVINVDNVLKVDKQFIYQMMLSKDFSSLVSGSGQPQITKKPLEDFAISLPPLPE 345 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 Q I +V++ + I L ++E++ + +TG++ ++ Sbjct: 346 QQAIASVLSTLDSEIASLEALK----AKVQEQKRGLMDELLTGRVRVK 389 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 68/210 (32%), Gaps = 21/210 (10%) Query: 229 PDHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPES---Y 280 P+ W +V + + +I + G + E+ Sbjct: 19 PEGWRGVKLGEMVECFSGGTPSRTKPEYYGGDIPWIKSGELNSGNIYATEETITEAGLQN 78 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + ++ G ++F + ++ I + + S +L + + S Sbjct: 79 SSAKVAKAGTLLFALYGAT---AGVIGRTRIDAAINQAILAIEPSEELLSEFLEYFLSSS 135 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + G + + VK P+L+PP+ EQ I +++ + L + + Sbjct: 136 VGN--LLHLTQGGQPNFNAGIVKGFPLLLPPLPEQRKIAAILSTWDDSLANLTDLLAAK- 192 Query: 401 VLLKERRSSFIAAAVTGQIDLRG----ESQ 426 ++++ A +TGQ L G + Sbjct: 193 ---RQQKRGLAEALLTGQKRLPGFEGEWEE 219 >gi|199581429|gb|ACH89416.1| FclIS [Flavobacterium columnare] Length = 393 Score = 143 bits (359), Expect = 7e-32, Method: Composition-based stats. Identities = 65/418 (15%), Positives = 148/418 (35%), Gaps = 38/418 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 Q+K++ IG IP+ W+V + L GR + + + G + Sbjct: 5 QFKNTD---IGLIPEDWEVKQLGEVITLINGRAYSQNELLFNGKYRVLRVGNF-FSSDKW 60 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + ++ KG ++Y + I ++ + L + ++ Sbjct: 61 YWSNLELASKFYVNKGDLMYAWS-ASFGPKFWKNEKTIYHYHIWKIELSEYLDKFYLFYV 119 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 L D + I +G TM H + + +PIP L EQ I E + I++L Sbjct: 120 LEKD-KENILNQSQGGTMFHITKESMEKRKIPIPSLKEQQAIAEVLSDTDAWIESLEKLI 178 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + + Q L++ +D ++ +G + + V + + Sbjct: 179 TKKRLVKQGAMQQLLT-----------PKEDWEVKKLGEIAE----------VRDGTHQT 217 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRSL 305 +ES I S ++ + + + ++ ++ G+I+ I D + + Sbjct: 218 PTYVESGIPFYSVESVTKNDFKNTKYISEQEHKILTKSFRIEKGDILMTRIGSIGDCKLI 277 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVK 363 + S + + YL ++ + K ++ S + + + + Sbjct: 278 --DWDVNASFYVSLALLKVKPIFSANYLCHYSKTENFKKEIDINSLQSAIPKKINLGPIS 335 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + P + EQ I +++ A I+ L E+ + K+ + + +TG+I L Sbjct: 336 NVKIEFPSLDEQQRIATILSDMDAEIEHL----EKKLNKAKQLKQGIMQQLLTGKIRL 389 >gi|327401773|ref|YP_004342612.1| restriction modification system DNA specificity domain-containing protein [Archaeoglobus veneficus SNP6] gi|327317281|gb|AEA47897.1| restriction modification system DNA specificity domain protein [Archaeoglobus veneficus SNP6] Length = 420 Score = 143 bits (359), Expect = 7e-32, Method: Composition-based stats. Identities = 77/423 (18%), Positives = 162/423 (38%), Gaps = 31/423 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IG IP+ W+VV + TK+N + ++ YI ++ +++ K K + + Sbjct: 10 IGKIPEDWEVVRLGDVTKVNPESINPAKEAPDEEFYYIEIDSIQNSKIK-SVKKIIGKNA 68 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWL-- 128 + + + ++ + PYL+ +I ICST F VL+ K+ L E Sbjct: 69 PSRARRVVRENDVIMSTVRPYLKAFVIVPKKYDGQICSTGFAVLRCKNELIEPKYLLYNL 128 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 ++ + G + + + +P+PPL EQ I E + +D I + Sbjct: 129 FMDRTIEQCNRLMVGGQYPALNQSHVEQLKIPLPPLPEQRKIAEIL----STVDEAIQKV 184 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR-- 246 I + K+ L+ ++TKG+ + KD+ I G +P WEV + E Sbjct: 185 DEAIVKTERLKKGLMQELLTKGIG-HTEFKDTEI---GRIPKEWEVVRLGDVAYEFISGG 240 Query: 247 ----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 K K +I + +I + + + E + I + + Sbjct: 241 TPSTKVAKYWNGDIPWIRSVHITKFYIDERSIGQYITKEGLENSAAKIIPKNNLIIATRV 300 Query: 303 RSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKF 359 +SA + I + + + +L W + S + + + + + Sbjct: 301 GIGKSAVNLIDVAINQDLTGIMLNKSKAEPFFLVWYLNSPKIVSLLESFSRGTTIKGIPQ 360 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + +K+L + +PP+ EQ I +++ ++ E + L+ + + +TG+ Sbjct: 361 DYIKKLLIPLPPLPEQQKIAEILSTVDKKL----ELERKRKEKLERIKKGLMNDLLTGRR 416 Query: 420 DLR 422 ++ Sbjct: 417 RVK 419 >gi|323141886|ref|ZP_08076747.1| type I restriction modification DNA specificity domain protein [Phascolarctobacterium sp. YIT 12067] gi|322413633|gb|EFY04491.1| type I restriction modification DNA specificity domain protein [Phascolarctobacterium sp. YIT 12067] Length = 464 Score = 143 bits (359), Expect = 8e-32, Method: Composition-based stats. Identities = 67/410 (16%), Positives = 158/410 (38%), Gaps = 14/410 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P++W V + ++ TG T G + + D++ G Y + S + Sbjct: 36 EVPENWVWVRLGAIAEIVTGGTPSKKHPEYYGGNFPFYKPSDLDQGRLTYDASEYLSEEG 95 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + I K +G + + G + Q PK + L +L + + Sbjct: 96 KNVS-RIIPKNSTAVCCIGSIGKCGYLMCE-GTTNQQINSAIPK-INSLCLYYYLCTENF 152 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 Q + ++ T++ + + + P+PPL+EQ I E+I ++D Sbjct: 153 VQDLLSMASATTIAIVNKSKMESCAFPLPPLSEQQRIVERIEELFAKLDEAKERLQEGAY 212 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 +K A++ T L + ++ + +V + + L Sbjct: 213 SFAVRKAAILHKAFTGELTKQWRRENGVSDESWEDKLLGDVCTVNPKKIDAKNLDDNLEV 272 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVM 311 S + + +++ ++ + + + G+++F I ++N K ++ V Sbjct: 273 SFVPMAAVSDVLGEIVNHEVKNLQDVRTGFTNFSKGDVIFAKITPCMENGKSAIVGPLVN 332 Query: 312 ERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVL 368 + G ++ Y+ +++ YL ++R+ A+ +G+ +Q + ++ +L Sbjct: 333 DIGYGSTEFYVLRCKEELNNKYLYHMVRNTTFRAEAKAVMTGVVGQQRVPKTFLQEYQLL 392 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +P + EQ +I +I+ AR + EQ++ + + S +A A G+ Sbjct: 393 LPTLSEQHEIVRLIDDLLARERAAQQAAEQALASIDLMKKSILARAFRGE 442 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 62/203 (30%), Gaps = 12/203 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNR-----KNTKLIESNILSLSYGNIIQKLETRN--MGL 275 E VP++W A+ + K+ + N ++ Q T + L Sbjct: 32 EQPYEVPENWVWVRLGAIAEIVTGGTPSKKHPEYYGGNFPFYKPSDLDQGRLTYDASEYL 91 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E +I+ I L G + P + Sbjct: 92 SEEGKNVSRIIPKNSTAVCCIGSIGKCGYLMC-----EGTTNQQINSAIPKINSLCLYYY 146 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 L + + + + ++ +PP+ EQ I I A++D E+ Sbjct: 147 LCTENFVQDLLSMASATTIAIVNKSKMESCAFPLPPLSEQQRIVERIEELFAKLDEAKER 206 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 +++ R+++ + A TG+ Sbjct: 207 LQEGAYSFAVRKAAILHKAFTGE 229 >gi|257076850|ref|ZP_05571211.1| type I restriction-modification enzyme, S subunit, putative [Ferroplasma acidarmanus fer1] Length = 420 Score = 143 bits (359), Expect = 8e-32, Method: Composition-based stats. Identities = 58/423 (13%), Positives = 131/423 (30%), Gaps = 25/423 (5%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESGK---DIIYIGLEDVESGTGKYLPK-DGNSRQ 72 IG IP+ W V + + G T + K +E + ++ + Sbjct: 4 IGEIPQEWGFVKLGDVLSLIKNGVTYKQNKKDSGYPVTRIETISEEKIDTAKVGYIDNIK 63 Query: 73 SDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 ++ +G IL+ + + AI + +L + ++ +L+ Sbjct: 64 TENINDYRLIEGDILFSHINSLEHIGKTAIYEGEPELLLHGMNLLLLRSDKSKIEPSYLV 123 Query: 130 SIDVTQRIEAICEGA-----TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 R + + + + + + I +P+PPL EQ I E + I + Sbjct: 124 YSLKFYRAKELFKSMAKRAVNQASINQTELKRIKIPLPPLPEQQKIAEILSTADDEIQKM 183 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFAL 240 + +L K Q L++ + ++ + EW +G + Sbjct: 184 DEQIALAEQLKKGLMQKLLTRGIGHTRFKTTEIGEIPEEWDTFGLGEIFKTITGTTPSTK 243 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 V + T + N I R + K I+ I+ Sbjct: 244 VKDYWHGGTIEWLTPKDLNKLNNTITLPPSERKVTEKALKENNLNILPENSILISTRAPV 303 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + +G + + + A+ ++S + + L Sbjct: 304 GYVGINNTKITFNQGC--KGLVPLNRDVSFPFFYAYYLKS-KTTFLNSLSTGSTFKELSK 360 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 E + + V +PP+ EQ I +++ ++ E + L + + TG++ Sbjct: 361 EGLDDVVVPLPPLPEQQKIGEILSTVDNKL----ELLGNKREKLNVLKKGLMNDLFTGKV 416 Query: 420 DLR 422 ++ Sbjct: 417 RVK 419 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 86/206 (41%), Gaps = 18/206 (8%) Query: 224 WVGLVPDHWEVKPFFALV--------TELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 +G +P W ++ + N+K++ + I ++S I + Sbjct: 3 EIGEIPQEWGFVKLGDVLSLIKNGVTYKQNKKDSGYPVTRIETISEEKIDTAKVGYIDNI 62 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTY 332 K E+ Y++++ G+I+F I+ + ++ + I+ +Y Sbjct: 63 KTENINDYRLIE-GDILFSHINSLEHIGKTAIYEGEPELLLHGMNLLLLRSDKSKIEPSY 121 Query: 333 LAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L + ++ Y ++F +M + S+ ++KR+ + +PP+ EQ I +++ Sbjct: 122 LVYSLKFYRAKELFKSMAKRAVNQASINQTELKRIKIPLPPLPEQQKIAEILSTADDE-- 179 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVT 416 ++K+++ I L ++ + + +T Sbjct: 180 --IQKMDEQIALAEQLKKGLMQKLLT 203 >gi|266619618|ref|ZP_06112553.1| restriction modification system DNA specificity domain protein [Clostridium hathewayi DSM 13479] gi|288868820|gb|EFD01119.1| restriction modification system DNA specificity domain protein [Clostridium hathewayi DSM 13479] Length = 456 Score = 142 bits (358), Expect = 8e-32, Method: Composition-based stats. Identities = 71/418 (16%), Positives = 146/418 (34%), Gaps = 17/418 (4%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKD 67 +W +P +W V +K + TG T K + D++ G + Sbjct: 24 EEEWPYEVPGNWCWVRLKDVAFVITGGTPSKNKPEYYGGTFPFFKPADLDYGRNMVAASE 83 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 S + + I AK +G K DG + Q PK L + Sbjct: 84 FLSEEGKAVSRCIPAK-STAVCCIGSI-GKCGYLCVDGTTNQQINSAIPKV-NSLFLYYY 140 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +I T+++ T+S + + P+PPL EQ I I ++D + + Sbjct: 141 CNTILFTKQLRLKASATTISIVNKSKMEQCLFPLPPLREQQRIANHIEEMFYKLDEIKEK 200 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 +E +++K A++ + L + K G+ + G + ++ Sbjct: 201 TQLVLESSEDRKAAILYKAFSGALTAKWR-KHKGVSFEGWITKPLSEVATLQTGLMKGKR 259 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFR-FIDLQNDKRS 304 N + L+ + + + G+++F D RS Sbjct: 260 NNQKTVLLPYLRVANVQDGYLDLKEIKNIEVDVLKIERYRLKKGDVLFTEGGDFDKLGRS 319 Query: 305 LRSAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFE 360 + + I + + +D +L+ S F + + S+ Sbjct: 320 SVWNEEIPDCIHQNHIFVVRTQTDTLDPYFLSLQAGSRYGKTYFIGCSKQTTNLASINST 379 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +K PVL+P I+EQ +I N++N + + + + + + ++E + S ++ A G+ Sbjct: 380 QLKNFPVLIPTIEEQREIVNILNFFLGKEEQIKQNCLKLLEKIEEIKKSILSRAFRGE 437 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 61/206 (29%), Gaps = 14/206 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG----- 274 EW VP +W + + + ++ Sbjct: 23 PEEEWPYEVPGNWCWVRLKDVAFVITGGTPSKNKPEYYGGTFPFFKPADLDYGRNMVAAS 82 Query: 275 --LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 L E + + I L G + P +S + Sbjct: 83 EFLSEEGKAVSRCIPAKSTAVCCIGSIGKCGYLCV-----DGTTNQQINSAIPKV-NSLF 136 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L + + K S S+ +++ +PP++EQ I N I ++D Sbjct: 137 LYYYCNTILFTKQLRLKASATTISIVNKSKMEQCLFPLPPLREQQRIANHIEEMFYKLDE 196 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTG 417 + EK + + ++R+++ + A +G Sbjct: 197 IKEKTQLVLESSEDRKAAILYKAFSG 222 >gi|170079468|ref|YP_001736104.1| type 1 restriction-modification system specificity subunit [Synechococcus sp. PCC 7002] gi|169887137|gb|ACB00849.1| type 1 restriction-modification system specificity subunit [Synechococcus sp. PCC 7002] Length = 398 Score = 142 bits (358), Expect = 9e-32, Method: Composition-based stats. Identities = 58/410 (14%), Positives = 135/410 (32%), Gaps = 33/410 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+V + + G + K I +I + D + + + Sbjct: 3 WEVKTLDDLCDIARGGSPRPIKSYLTNEPDGINWIKIGDASASSKYIYETQEKIKPEGIK 62 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQ 135 G L + R I+ I ++ + + +L S + Sbjct: 63 KSRFVEPGDFLLSNSMSFGRPYIMRTSGCIHDGWLVLKDKSGLFDQDYLYYFLGSQAAYK 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + G+T+ + + + + +P+PP+AEQ I E + I+ + + Sbjct: 123 QFDKLAAGSTVRNLNTTLVKKVLVPVPPIAEQKRIVEILDESFSGIERAEAIARQNLTNA 182 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +E + ++ I + + + L+ + K E+ Sbjct: 183 RELFDSYLNKIFLDFV---------------ERKNTQTLNCITDLIVDCEHKTAPTQETG 227 Query: 256 ILSLSYGNIIQK-LETRNMGLKPESYETYQ----IVDPGEIVFRFIDLQNDKRSLRSAQV 310 S+ NI + L N+ E G+++ + + + Sbjct: 228 FPSIRTPNIGKGHLILDNVYRVSEETYKQWTRRAKPQSGDLILAREAPAGNVGVIPEGER 287 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV-L 368 + + + I+ YLA+ + + + + SG Q + +D++ L + Sbjct: 288 V--CLGQRTVLIRPKENINPQYLAFFLLHPKMQERLLSKSSGATVQHVNMKDIRALKMGD 345 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PPI+ Q + + + L E ++ I L + + S + A +GQ Sbjct: 346 LPPIEIQDRLIESLLDVQEKSKKLEEVYQRKIEALGKLKQSILQKAFSGQ 395 >gi|297530924|ref|YP_003672199.1| restriction modification system DNA specificity domain protein [Geobacillus sp. C56-T3] gi|297254176|gb|ADI27622.1| restriction modification system DNA specificity domain protein [Geobacillus sp. C56-T3] Length = 485 Score = 142 bits (358), Expect = 1e-31, Method: Composition-based stats. Identities = 75/439 (17%), Positives = 149/439 (33%), Gaps = 45/439 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P +W V + R K+N + + D ++ + V+ GK + +S Sbjct: 26 EVPGNWVWVRLGRIVKINPPKPKLAYGDDHICSFLPMSAVDPVEGKIAYLEERPFRSVKK 85 Query: 77 TVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQ-PKDVLPELLQGWLL 129 + F + IL+ K+ P + + + G ST+F V++ PK V + + Sbjct: 86 GYTYFEENDILFAKITPCMENGNSVITEGLLNGFGFGSTEFYVIRTPKTVDNRYIYYLVR 145 Query: 130 SIDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S ++ + + G + P+ +PPL EQ I +KI +ID Sbjct: 146 SERFRKQAKNVMAGAVGQQRVPKFFLEAYPIALPPLNEQKRIADKIERLFAKIDEAKRLI 205 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE---------------WVGLVPDHWE 233 E +++++ ++ L + + S +E W VP +W Sbjct: 206 GEVKESIEQRRAVMLEKAFKGQLGTNDPSEKSILETSDDLSEKDVIPKEQWPYEVPGNWV 265 Query: 234 VKPFFALVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD- 287 LV + + + I GN+ + + + V Sbjct: 266 WVRLKHLVDFFSGSAFPNQYQGYNDLEIPFYKVGNLKDTDSNYYIYSEENTISEEIRVKL 325 Query: 288 ------PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRS 339 I+F I R R V + I + M K + D+ L + Sbjct: 326 KAKKVPKDTILFAKIG--EAIRLNRRGLVPKPACIDNNLMGFKSNENILDNKLLLYWSLK 383 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 D K A S++ ++ + +PP+ EQ I ++ +++ + + Sbjct: 384 EDFYKYSQA---TAVPSIRKSTLEAIAFPLPPLNEQKRIAEKLDNLLEKLENEKQLVLAV 440 Query: 400 IVLLKERRSSFIAAAVTGQ 418 L + S + A G+ Sbjct: 441 EEKLDLLKQSVLQKAFRGE 459 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 71/207 (34%), Gaps = 14/207 (6%) Query: 17 WIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVE-SGTGKYLPKDGN 69 W +P +W V +K +G + +I + + +++ + + Y+ + N Sbjct: 256 WPYEVPGNWVWVRLKHLVDFFSGSAFPNQYQGYNDLEIPFYKVGNLKDTDSNYYIYSEEN 315 Query: 70 SRQSDTS---TVSIFAKGQILYGKLGPYLR--KAIIADFDGICSTQFLVLQPKDVLPELL 124 + + K IL+ K+G +R + + + K L Sbjct: 316 TISEEIRVKLKAKKVPKDTILFAKIGEAIRLNRRGLVPKPACIDNNL--MGFKSNENILD 373 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 LL + + + + + I P+PPL EQ I EK+ +++ Sbjct: 374 NKLLLYWSLKEDFYKYSQATAVPSIRKSTLEAIAFPLPPLNEQKRIAEKLDNLLEKLENE 433 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGL 211 + E L KQ+++ L Sbjct: 434 KQLVLAVEEKLDLLKQSVLQKAFRGEL 460 >gi|330506918|ref|YP_004383346.1| type I restriction-modification enzyme, S subunit [Methanosaeta concilii GP-6] gi|328927726|gb|AEB67528.1| type I restriction-modification enzyme, S subunit [Methanosaeta concilii GP-6] Length = 418 Score = 142 bits (358), Expect = 1e-31, Method: Composition-based stats. Identities = 90/405 (22%), Positives = 160/405 (39%), Gaps = 20/405 (4%) Query: 22 PKHWKVVPIKRFTKLNTG-RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 P WK++ + +L D+ Y+GLE ++S + K S S+ S Sbjct: 7 PSSWKMISLDEVCELRKEAIHPNKYPDLPYVGLEHIDSSNS--ILKRSGSSFEVNSSKSK 64 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEA 139 F G ILYGKL PYL K+++ DFDG+CST LVL+ K+ + P+ L + + Sbjct: 65 FHSGDILYGKLRPYLDKSVLVDFDGMCSTDILVLKTKESIVPQFLVNIIHTSQFINYAVN 124 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +G W I +PPL EQ I + R+R I L +E+K Sbjct: 125 SSKGLNHPRTSWSSISAFKFLLPPLPEQRAIAR----AMRAVQAAREARLREIALERERK 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 AL+ ++ T G K + I + +++ + + + +L + Sbjct: 181 AALMEHLFTHG-TRGEPTKMTEIGEMPESWSMIQLEEACIKIVDCPHSTPHFSPAGVLVV 239 Query: 260 SYGNIIQK-LETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 NI L+ + E G+++F + L V Sbjct: 240 RNFNIRNGRLDLKFPSYTTEEEYSERVKRCEPTEGDVLFSREAPVG-EACLVPPDVRLCL 298 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 + V ++ +L + S + + A+ SG + L DVKRL + ++ Sbjct: 299 GQRMMLLRVDTSKLNRFFLVQVFYSNAIRSIMMAISSGVTAKHLNVADVKRLRIPFSSME 358 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ I+++++ ++I L + E S+ E + + + G+ Sbjct: 359 EQKQISDILSACDSKITAL--EHEASLH--DELFRAMLEELMNGR 399 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 72/203 (35%), Gaps = 13/203 (6%) Query: 18 IGAIPKHWKVVPIKRFT-KLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IG +P+ W ++ ++ K+ S S ++ + ++ +G + + Sbjct: 202 IGEMPESWSMIQLEEACIKIVDCPHSTPHFSPAGVLVVRNFNIRNGRLDLKFPSYTTEEE 261 Query: 74 DTSTVSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWL 128 + V G +L+ + P ++ +C Q L + + L Sbjct: 262 YSERVKRCEPTEGDVLFSREAPVGEACLVPPDVRLCLGQRMMLLRVDTSKLNRFFLVQVF 321 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + + AI G T H + + + +P + EQ I + + D+ IT Sbjct: 322 YSNAIRSIMMAISSGVTAKHLNVADVKRLRIPFSSMEEQKQISDIL----SACDSKITAL 377 Query: 189 IRFIELLKEKKQALVSYIVTKGL 211 L E +A++ ++ L Sbjct: 378 EHEASLHDELFRAMLEELMNGRL 400 >gi|254448290|ref|ZP_05061752.1| restriction modification system DNA specificity domain [gamma proteobacterium HTCC5015] gi|198262157|gb|EDY86440.1| restriction modification system DNA specificity domain [gamma proteobacterium HTCC5015] Length = 416 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 68/421 (16%), Positives = 133/421 (31%), Gaps = 34/421 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IPK WK VP+ ++ R S +++ I + K + S Sbjct: 5 IPKDWKRVPLSSVSERMKRRNSAGNTNVLTISAVHGLVNQKDFFNK--IVASDNLSNYFH 62 Query: 81 FAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDV 133 KG Y K + + +G+ S ++ K + + S Sbjct: 63 LKKGDFAYNKSYSHGYPVGVVRRLEMYDEGVLSPLYICFSMKGEGVDDKFAAYFFDSHWF 122 Query: 134 TQRIEAICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + I I + +H ++ +PPL EQ I + + I+ + Sbjct: 123 IEEINEIAKEGARNHGLLNVGVGDFFDLDFVLPPLPEQQKIAAILSSVDEVIEKTRAQID 182 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-- 247 + +L Q L++ + KDS + G +P+ W+V L Sbjct: 183 KLKDLKTGMMQELLTKGIGH-----AAFKDSPV---GRIPEGWDVVALGDLGKWKGGGTP 234 Query: 248 ---NTKLIESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDK 302 N NI +S ++ + T+ E E+ + + V + K Sbjct: 235 SKSNKDYWNGNIPWVSPKDMKSEFITQTSDQITEEAISESSTNLVSRDSVLVVVRSGILK 294 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFE 360 +L A + A+ + S + + KV A +S+ F+ Sbjct: 295 HTLPVALASCDLALNQDMRALSVNSDHSERFVFQYLQANNHKVLRATLKAGNTVESIDFK 354 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + PP++EQ I + RI + + + + +TG++ Sbjct: 355 VFSDYLIPCPPLEEQEKIALAVEAVGNRIRA----KAAQLDAYVIMKQALMQDLLTGKVR 410 Query: 421 L 421 + Sbjct: 411 V 411 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 75/209 (35%), Gaps = 17/209 (8%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGK 62 +KDS V G IP+ W VV + K G T +I ++ +D++S Sbjct: 204 AFKDSPV---GRIPEGWDVVALGDLGKWKGGGTPSKSNKDYWNGNIPWVSPKDMKSEFIT 260 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDV 119 S+ ++ ++ +L L+ +A D + L Sbjct: 261 QTSDQITEEAISESSTNLVSRDSVLVVVRSGILKHTLPVALASCDLALNQDMRALSVNSD 320 Query: 120 LP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + +L + + + G T+ D+K + +P PPL EQ EKI Sbjct: 321 HSERFVFQYLQANNHKVLRATLKAGNTVESIDFKVFSDYLIPCPPLEEQ----EKIALAV 376 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIV 207 + I + ++ KQAL+ ++ Sbjct: 377 EAVGNRIRAKAAQLDAYVIMKQALMQDLL 405 >gi|332662759|ref|YP_004445547.1| restriction modification system DNA specificity domain-containing protein [Haliscomenobacter hydrossis DSM 1100] gi|332331573|gb|AEE48674.1| restriction modification system DNA specificity domain protein [Haliscomenobacter hydrossis DSM 1100] Length = 404 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 61/409 (14%), Positives = 130/409 (31%), Gaps = 25/409 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + W++ + + G + S I +I + D + + Sbjct: 3 EGWEMKKLGEVVSIERGGSPRPIEKYITNSPDGINWIKISDATASEKYIYETKEKITRDG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSI 131 + +G + + R I G +LV K E L L S Sbjct: 63 LHKTRVVNEGDFILSNSMSFGRP-YIMKTRGCIHDGWLVLKQKDNKIFETEFLYYLLSSP 121 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V Q+ + G+T+ + + + ++ +P PPL EQ I + I + Sbjct: 122 FVFQQFNSKAAGSTVRNLNIALVSSVDVPTPPLPEQHRIVAILDEAFAAIAKAKANAEQN 181 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ KE ++ + + + + + +G + H K KN Sbjct: 182 LKNAKELFESYLQGVFEQRGDGW------EEKTLGEIAKHSLGKMLDKN------KNKGT 229 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ++ + + S L E+ + G+++ + ++ Sbjct: 230 LQKYLRNQSVRWFSFNLNDLTEMPFLENEKEKYTAIKGDVMVCEGGYPG-RAAIWEEDYP 288 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + +L +L S K+ Q E + + V VPP Sbjct: 289 IYFQKAIHRVRFHKIEYNKLFLYYLFISDKSGKLKTHFSGTGIQHFTGEALHKFVVPVPP 348 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + E I ++ +A+ L +Q I L+E + S + A +G++ Sbjct: 349 VNEAKSIVQKLDALSAQTKKLEAIYQQKINDLEELKKSILQKAFSGELK 397 >gi|229526955|ref|ZP_04416352.1| type I restriction-modification system specificity subunit S [Vibrio cholerae 12129(1)] gi|229335567|gb|EEO01047.1| type I restriction-modification system specificity subunit S [Vibrio cholerae 12129(1)] Length = 413 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 71/426 (16%), Positives = 150/426 (35%), Gaps = 41/426 (9%) Query: 21 IPKHWKVVPIKRFT-KLNTGR---TSESGKD-----IIYIGLEDVESGTGKYLPKDGNSR 71 +PK W + +K K+ G + I ++ + + G G + Sbjct: 2 VPKGWDALNLKNVAQKIQDGNYGADYPKADELVASGIPFLTSKVI-GGNGTVNQDKFDYI 60 Query: 72 QSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELL- 124 + + G IL+ G + I G Q +++ + + + Sbjct: 61 SEEKHQKLKKAQITSGDILFTNRGANVGTIAITPDYLSDGNIGPQLTLIRCNEKIEKDFL 120 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L +++ G+ M+ K + +PPL EQ I + + D Sbjct: 121 FQFLRGSFFQKQVCQQDSGSAMNFFGIKDTERFKILVPPLPEQKKIAKIL----STWDKA 176 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 IT + + +++K+AL+ ++T + +G+ + G W V L + Sbjct: 177 ITTTEQLLANSQQQKKALMQQLLT---GRKRLLDKNGVRFSGE----WRVSKLSKLFERV 229 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KR 303 KN + + +I++ E + E+ + Y ++ G+ + Sbjct: 230 TTKNNGQSTNVVTISGQHGLIRQEEFFKKAVASETLDGYFLLRQGQFAYNKSYSNGYPMG 289 Query: 304 SLRSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ---- 355 +++ G++T+ Y+ DS + S L K + G R Sbjct: 290 AIKRLNRYPDGVVTTLYICFELSDSGRADSDFWEHYFESGLLNKGLSQIAHEGGRAHGLL 349 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++K D L V P +EQ I V++ I L +Q + LK+ + + + + Sbjct: 350 NVKPSDFFSLKVSTPSFEEQQKIAAVLSTADQEISAL----QQKLDALKQEKKALMQQLL 405 Query: 416 TGQIDL 421 TG+ + Sbjct: 406 TGKRRV 411 >gi|16132169|ref|NP_418768.1| specificity determinant for hsdM and hsdR [Escherichia coli str. K-12 substr. MG1655] gi|89111057|ref|AP_004837.1| specificity determinant for hsdM and hsdR [Escherichia coli str. K-12 substr. W3110] gi|238903436|ref|YP_002929232.1| specificity determinant for hsdM and hsdR [Escherichia coli BW2952] gi|331650829|ref|ZP_08351857.1| type I restriction enzyme EcoKI specificity protein (S protein)(S.EcoKI) [Escherichia coli M718] gi|135209|sp|P05719|T1SK_ECOLI RecName: Full=Type-1 restriction enzyme EcoKI specificity protein; Short=S.EcoKI; AltName: Full=Type I restriction enzyme EcoKI specificity protein; Short=S protein gi|322812244|pdb|2Y7C|A Chain A, Atomic Model Of The Ocr-Bound Methylase Complex From The Type I Restriction-Modification Enzyme Ecoki (M2s1). Based On Fitting Into Em Map 1534. gi|322812249|pdb|2Y7H|A Chain A, Atomic Model Of The Dna-Bound Methylase Complex From The Type I Restriction-Modification Enzyme Ecoki (M2s1). Based On Fitting Into Em Map 1534. gi|41746|emb|CAA23554.1| hsdS [Escherichia coli] gi|537190|gb|AAA97245.1| CG Site No. 619; alternate gene name hss [Escherichia coli str. K-12 substr. MG1655] gi|1790807|gb|AAC77304.1| specificity determinant for hsdM and hsdR [Escherichia coli str. K-12 substr. MG1655] gi|85677088|dbj|BAE78338.1| specificity determinant for hsdM and hsdR [Escherichia coli str. K12 substr. W3110] gi|238863570|gb|ACR65568.1| specificity determinant for hsdM and hsdR [Escherichia coli BW2952] gi|260450839|gb|ACX41261.1| restriction modification system DNA specificity domain protein [Escherichia coli DH1] gi|315138903|dbj|BAJ46062.1| EcoKI restriction-modification system protein HsdS [Escherichia coli DH1] gi|331051283|gb|EGI23332.1| type I restriction enzyme EcoKI specificity protein (S protein)(S.EcoKI) [Escherichia coli M718] Length = 464 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 63/420 (15%), Positives = 144/420 (34%), Gaps = 22/420 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNS 70 G +P+ W + P+ T L G T + + I ++++G Sbjct: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 Query: 71 RQSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLV---LQPKDVLPELL 124 + + I + I+ + K+ CS K + + Sbjct: 64 KNLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + S +I ++ GA +++ I +PIPPLAEQ +I EK+ ++D+ Sbjct: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + ++LK +QA++ V L + + + + ++ Sbjct: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ +S + + + R + ES + G+++F + + Sbjct: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLEC-SESELNRHKLQDGDLLFTRYNGSLEFVG 301 Query: 305 LR---SAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLK 358 + + + + + Y+ S + ++ + Sbjct: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +D+K VL+PP+KEQ +I + A D + +++ ++ + S +A A G+ Sbjct: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 >gi|21232335|ref|NP_638252.1| type I restriction enzyme specificity chain-like protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767532|ref|YP_242294.1| type I restriction enzyme specificity chain-like protein [Xanthomonas campestris pv. campestris str. 8004] gi|188990645|ref|YP_001902655.1| type I site-specific DNA methyltransferase specificity subunit [Xanthomonas campestris pv. campestris str. B100] gi|21114106|gb|AAM42176.1| type I restriction enzyme (specificity chain) homolog [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572864|gb|AAY48274.1| type I restriction enzyme (specificity chain) homolog [Xanthomonas campestris pv. campestris str. 8004] gi|167732405|emb|CAP50599.1| type I site-specific DNA methyltransferase specificity subunit [Xanthomonas campestris pv. campestris] Length = 415 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 51/422 (12%), Positives = 124/422 (29%), Gaps = 37/422 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ + + +G T ++ D+ + + Sbjct: 2 LPDGWRRTTLGNIGSVKSGSTPARSQHDRYFVDGKWPWVKTMDLTNSEILTTDEVITDAA 61 Query: 73 SDTSTVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 S+ +F G +L G + + + + + + + + Sbjct: 62 LAESSCRLFPAGTVLVAMYGGFKQIGRTGLLREKSAINQAISAIDIERNQADPEFVLHWL 121 Query: 131 IDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + + + P+ +P L EQ I + I T Sbjct: 122 NGSVETWKNYAASSRKDPNITRENVCDFPVILPTLGEQRRIAHILSTWDQAIATTERLLK 181 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + + L P K + E + L+ K Sbjct: 182 NSQKQMDILLRDLTLGTQRTTSTPSPWAKFTLGE-------------LGRTYSGLSGKKG 228 Query: 250 KLIESNILSLSYGNIIQKLETRNMGL---KPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + Y N+ + K E V G+I+F ++ + Sbjct: 229 EDFGFGAKFIPYTNVFKNNRIDIEDFSLVKISENENQTRVKSGDIIFTISSETPNEVGMA 288 Query: 307 SAQVMERG-----IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFE 360 S + + Y + Y +++R+ + + + G R ++ Sbjct: 289 SVLLDDVNELYLNSFCFGYRLNDFKTLLPEYAGFVLRAPHIRALMTQIAQGSTRFNISKA 348 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +V R+ + +P I EQ I +++ + + L + LK + ++ +TG+ Sbjct: 349 NVMRMELALPSIAEQKRIASILGGAHSTVKNL----RDQLARLKAEKVILMSQLLTGKRR 404 Query: 421 LR 422 +R Sbjct: 405 VR 406 >gi|242399587|ref|YP_002995012.1| putative type I specificity subunit HsdS [Thermococcus sibiricus MM 739] gi|242265981|gb|ACS90663.1| putative type I specificity subunit HsdS [Thermococcus sibiricus MM 739] Length = 434 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 53/415 (12%), Positives = 135/415 (32%), Gaps = 25/415 (6%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDV-ESGTGKYLPKD 67 W +P+ W+ V + +L G T I ++ + D+ +SG + + Sbjct: 32 PW--ELPEGWRWVRLGDIAELKAGGTPSRRVKEYWENGTIPWVKISDIPDSGLVEKTEEK 89 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 S+ + + G IL+ + + K I + + + PK + + Sbjct: 90 ITELGLKNSSAKLLSPGTILFS-IFATISKVGILKIPAATNQAIVGIIPKISIDRGYLFY 148 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L + + G + + + + + +P+PP+ EQ I K+ R++ Sbjct: 149 SLKYFGQELVYQ-GRGGVQDNINMRILSKLKIPLPPIEEQKRIVAKLDEVHRRLEEAKRL 207 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 E + + + + +K + G + P K Sbjct: 208 AREAREEAERLMASALHEVFSKAEEKGWEWTTIGKVSREMKPGFARNKK---------HI 258 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + + + + L + + G+++F + Sbjct: 259 SRDGVPHLRPNNVDVGRLNLKKIVKVTLDDKINIEEYYLKKGDVLFNNTNSFELVGRAAI 318 Query: 308 AQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDVK 363 + ++ + VK I +L + + F + + + + + Sbjct: 319 VPEDLKYGYSNHITRIRVKKEVILPEWLTLAINYLWMQGYFREVCTRWVGQAGVNMNTLA 378 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +P ++EQ I + ++ R LV+ E+ L++ + + A G+ Sbjct: 379 KTRIPLPSLEEQKRIVSYLDSIQERAQKLVKLYEEREKELEKLFPAILDKAFRGE 433 >gi|218708015|ref|YP_002415534.1| EcoKI restriction-modification system protein HsdS [Escherichia coli UMN026] gi|293403006|ref|ZP_06647103.1| type-1 restriction enzyme EcoKI specificity protein [Escherichia coli FVEC1412] gi|298378533|ref|ZP_06988417.1| type-1 restriction enzyme EcoKI specificity protein [Escherichia coli FVEC1302] gi|300899292|ref|ZP_07117558.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 198-1] gi|301646864|ref|ZP_07246710.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 146-1] gi|218435112|emb|CAR16068.1| specificity determinant for hsdM and hsdR [Escherichia coli UMN026] gi|291429921|gb|EFF02935.1| type-1 restriction enzyme EcoKI specificity protein [Escherichia coli FVEC1412] gi|298280867|gb|EFI22368.1| type-1 restriction enzyme EcoKI specificity protein [Escherichia coli FVEC1302] gi|300357071|gb|EFJ72941.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 198-1] gi|301074917|gb|EFK89723.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 146-1] Length = 464 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 63/420 (15%), Positives = 143/420 (34%), Gaps = 22/420 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNS 70 G +P+ W + P+ T L G T + + I ++++G Sbjct: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 Query: 71 RQSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLV---LQPKDVLPELL 124 + I + I+ + K+ CS K + + Sbjct: 64 KNLVKENQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + S +I ++ GA +++ I +PIPPLAEQ +I EK+ ++D+ Sbjct: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + ++LK +QA++ V L + + + + ++ Sbjct: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ +S + + + R + ES + G+++F + + Sbjct: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLEC-SESELNRHKLQDGDLLFTRYNGSLEFVG 301 Query: 305 LR---SAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLK 358 + + + + + Y+ S + ++ + Sbjct: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +D+K VL+PP+KEQ +I + A D + +++ ++ + S +A A G+ Sbjct: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 >gi|119357296|ref|YP_911940.1| restriction modification system DNA specificity subunit [Chlorobium phaeobacteroides DSM 266] gi|119354645|gb|ABL65516.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides DSM 266] Length = 479 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 65/437 (14%), Positives = 135/437 (30%), Gaps = 55/437 (12%) Query: 30 IKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + GR + + + I ++++ K+ N + KG Sbjct: 11 LGDVAEYINGRAFKPSEWGKEGLPIIRIKNLNDENSKF-----NYSNEVFEKRYLVKKGD 65 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L+ L I + + +++P + +L + +TQ + + G+ Sbjct: 66 LLFAWS-ASLGAYIWKKDEAWLNQHIFLVKPSPFIAKL-YLYYFLDKITQELYSAAHGSG 123 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K + +PPL+EQ I KI +D I + E LK +QA++ Sbjct: 124 MVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKKAQEQLKVYRQAVLKQ 183 Query: 206 IVTKGLNPDV------------------------------------KMKDSGIEWVGLVP 229 L + ++ + +P Sbjct: 184 AFEGELTKSWREQQANLPSAQDLLDTIKTEREQAAKNQGKKLKPVTPLAKVELDELTELP 243 Query: 230 DHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESY--ETYQI 285 D W L + + L + + + GNI Q N + + Sbjct: 244 DGWCWIKLGELTIGVEYGTSTKSLEKGEVPVIRMGNIQQGRIDWNDLAFTDDKADISKYR 303 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLC 343 + G+++F + I + + YL + + S+ Sbjct: 304 LLKGDVLFNRTNSPELVGKAAIYNGEMPAIFAGYLIRVNQIKELLHCKYLNFFLNSHPAK 363 Query: 344 KVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 ++ + + ++ E +K P+ KEQ I I + D + I +S+ Sbjct: 364 VYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQEIEARLSVCDNMEATIRESLE 423 Query: 402 LLKERRSSFIAAAVTGQ 418 + R S + A G+ Sbjct: 424 KAEALRQSILKKAFEGK 440 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 63/190 (33%), Gaps = 6/190 (3%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDP 288 +H + + +N + K E L I E E +E +V Sbjct: 4 NHHVIAILGDVAEYINGRAFKPSEWGKEGLPIIRIKNLNDENSKFNYSNEVFEKRYLVKK 63 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+++F + + + VKP + + +++ A Sbjct: 64 GDLLFAWSASLG-----AYIWKKDEAWLNQHIFLVKPSPFIAKLYLYYFLDKITQELYSA 118 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + + + +PP+ EQ I + I + +D + ++++ LK R Sbjct: 119 AHGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKKAQEQLKVYRQ 178 Query: 409 SFIAAAVTGQ 418 + + A G+ Sbjct: 179 AVLKQAFEGE 188 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 66/204 (32%), Gaps = 11/204 (5%) Query: 18 IGAIPKHWKVVPIKRF---TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + +P W + + + T S ++ I + +++ G + ++D Sbjct: 239 LTELPDGWCWIKLGELTIGVEYGTSTKSLEKGEVPVIRMGNIQQGRIDWNDLAFTDDKAD 298 Query: 75 TSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKD----VLPELLQGW 127 S + KG +L+ + + AI +L+ + L Sbjct: 299 ISKYRLL-KGDVLFNRTNSPELVGKAAIYNGEMPAIFAGYLIRVNQIKELLHCKYLNFFL 357 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +G S+ + + + + P+P EQ I ++I A D + Sbjct: 358 NSHPAKVYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQEIEARLSVCDNMEAT 417 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 +E + +Q+++ L Sbjct: 418 IRESLEKAEALRQSILKKAFEGKL 441 >gi|215486217|ref|YP_002328648.1| predicted type I restriction-modification enzyme, S subunit [Escherichia coli O127:H6 str. E2348/69] gi|215264289|emb|CAS08642.1| predicted type I restriction-modification enzyme, S subunit [Escherichia coli O127:H6 str. E2348/69] Length = 449 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 77/439 (17%), Positives = 138/439 (31%), Gaps = 35/439 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLN---TGRTSES------GKDIIYIGLEDVESGT 60 YK + ++ IP+ W V I T GRT + DI+ + +V+ G Sbjct: 18 YKLTEME---MIPEDWVVSTILNLTTNIIDYRGRTPKKLGMDWGDGDIVALSAANVKKGY 74 Query: 61 GKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQP 116 + + T KG I + P A I D I S + ++LQ Sbjct: 75 IDLSTECYFGSEELYKRWMTSGHPQKGDIAFTMEAPLGNAASIPDNKKYILSQRTILLQI 134 Query: 117 KDVL--PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREK 173 P L+ LLS I G+T + + + IP + EQ I Sbjct: 135 DRENFSPSLILQILLSERFQSYISESATGSTAQGIKRSVLEKLCISIPKNIVEQKAIANV 194 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPD 230 + I +L + + Q L++ + L D K +G +P+ Sbjct: 195 LTNVDSLILSLEKLLSKKQSIKTATMQQLLTGKTRLPQFALRKDGSAKGYKKSELGAIPE 254 Query: 231 HWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLKPES---YET 282 W V +S G + K + + Sbjct: 255 DWVVTSIGQFTDCCAGGTPSTKISAYWGGTHPWMSSGELHLKQVYAVADYITDEGLVNSS 314 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + V ++ Q R + +E S + +L + + S Sbjct: 315 TKYVPKNSVLVGLAG-QGKTRGTVAINRIELCTNQSIAAIFPSKHHSTEFLFYNLDSRYE 373 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G G R L +++L + PP +EQ I +++ I L +Q + Sbjct: 374 ELRSLSTGDGGRGGLNLTIIRKLHLAFPPKEEQTAIATILSDMDKEIQTL----QQRLDK 429 Query: 403 LKERRSSFIAAAVTGQIDL 421 ++ + + +TG+ L Sbjct: 430 TRQLKQGMMQELLTGKTRL 448 >gi|187779696|ref|ZP_02996169.1| hypothetical protein CLOSPO_03292 [Clostridium sporogenes ATCC 15579] gi|187773321|gb|EDU37123.1| hypothetical protein CLOSPO_03292 [Clostridium sporogenes ATCC 15579] Length = 459 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 80/433 (18%), Positives = 165/433 (38%), Gaps = 28/433 (6%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 + Y YK + + W+ IPKHW ++ K K+ + + L Sbjct: 10 NELRPYEDYKKTELLWLDYIPKHWNMIRNKNVMKVEKEIVGRNHSKYTLLSLTK-RGIIP 68 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYG--KLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + L D + I++ + R ++ G+ + + V + +++ Sbjct: 69 RDLENAKGKFPKDFEAYQVVNPNNIVFCLFDMDETPRTVGLSSMKGMITGSYNVFKIENI 128 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + L + LS+D ++++ ++ G + MP PP+ EQ I + + + Sbjct: 129 NEKYLYYYYLSLDNSKKLRSLYTGL-RKVIHIETFLRTKMPNPPMEEQKQIVKYLDCKLS 187 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV---------KMKDSGIEWVGLVPD 230 +I I E+ + I+LLK++K+ ++ + + + +MK SGI+WV +P+ Sbjct: 188 KIRKFIKEKKKIIDLLKQQKKVFINEAIIGKIKIENGECKVRYKSEMKPSGIQWVEEIPN 247 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 HW L + + S I + R K Y ++ Sbjct: 248 HWIKCKLKHLGKFKSGDSI--TSSQIDMKGKYPVYGGNGLRGYFDKYTHDGNYLLIGRQG 305 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + + L + A + ++ + +L+ + +L + Sbjct: 306 ALCGNVHLVKGRFWASE----------HAVVVTTNSNVNVDWAKYLIETMNLNQY---SQ 352 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 S + L E + + ++PPI+EQ I + I T +ID + I + I L+ E Sbjct: 353 SAAQPGLAIERIINIYTMLPPIEEQKKIVDYIIRITDKIDKSILHINKEISLITEYGIRL 412 Query: 411 IAAAVTGQIDLRG 423 I+ V G++D+R Sbjct: 413 ISDIVIGKVDVRN 425 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 55/216 (25%), Positives = 104/216 (48%), Gaps = 3/216 (1%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL-SLSYGNII 265 + L P K + + W+ +P HW + ++ + L SL+ II Sbjct: 8 IHNELRPYEDYKKTELLWLDYIPKHWNMIRNKNVMKVEKEIVGRNHSKYTLLSLTKRGII 67 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + G P+ +E YQ+V+P IVF D+ R++ + + +G+IT +Y K Sbjct: 68 PRDLENAKGKFPKDFEAYQVVNPNNIVFCLFDMDETPRTVGLSSM--KGMITGSYNVFKI 125 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+ YL + S D K ++ +GLR+ + E R + PP++EQ I ++ + Sbjct: 126 ENINEKYLYYYYLSLDNSKKLRSLYTGLRKVIHIETFLRTKMPNPPMEEQKQIVKYLDCK 185 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++I +++ ++ I LLK+++ FI A+ G+I + Sbjct: 186 LSKIRKFIKEKKKIIDLLKQQKKVFINEAIIGKIKI 221 >gi|188496427|ref|ZP_03003697.1| type I restriction-modification system specificity subunit [Escherichia coli 53638] gi|188491626|gb|EDU66729.1| type I restriction-modification system specificity subunit [Escherichia coli 53638] gi|322616181|gb|EFY13097.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 315996572] gi|322620878|gb|EFY17737.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-1] gi|322623031|gb|EFY19873.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-3] gi|322634726|gb|EFY31457.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-1] gi|322638707|gb|EFY35402.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-2] gi|322646507|gb|EFY43016.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. NC_MB110209-0054] gi|322654496|gb|EFY50818.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] gi|322660785|gb|EFY57018.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 19N] gi|322665113|gb|EFY61301.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 81038-01] gi|322667857|gb|EFY64017.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. MD_MDA09249507] gi|322671731|gb|EFY67852.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 414877] gi|322677223|gb|EFY73287.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 366867] gi|322680114|gb|EFY76153.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 413180] gi|322685457|gb|EFY81453.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 446600] gi|323193666|gb|EFZ78870.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 609458-1] gi|323199973|gb|EFZ85061.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 556150-1] gi|323204704|gb|EFZ89701.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 609460] gi|323205730|gb|EFZ90693.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 507440-20] gi|323213700|gb|EFZ98483.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 556152] gi|323216765|gb|EGA01489.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB101509-0077] gi|323231919|gb|EGA16026.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB111609-0052] gi|323234446|gb|EGA18533.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 2009083312] gi|323237897|gb|EGA21956.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 2009085258] gi|323243502|gb|EGA27521.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 315731156] gi|323249499|gb|EGA33413.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2009159199] gi|323254257|gb|EGA38075.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008282] gi|323255082|gb|EGA38868.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008283] gi|323261256|gb|EGA44844.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008284] gi|323266621|gb|EGA50108.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008285] gi|323271347|gb|EGA54773.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008287] Length = 394 Score = 141 bits (354), Expect = 3e-31, Method: Composition-based stats. Identities = 72/416 (17%), Positives = 146/416 (35%), Gaps = 37/416 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W ++ + KL G + + K + I ++++ +G+G Y G + Sbjct: 2 VPKGWMLLQVSDICKLQNGNSFKPHEWDTKGLPIIRIQNL-NGSGNYNYFSGVPQD---- 56 Query: 77 TVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + GQ+L+ G I G+ + + + + E L Sbjct: 57 -KWLVEPGQLLFSWAGTKGVSFGPFIWNGPKGVLNQHIYKVFANENVHEHWLYLALLHIT 115 Query: 134 TQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + T+ H K I N + PP+AEQ I + + + I+ + + Sbjct: 116 QKIEAQAHGFKSTLLHVQKKDIDNQFVLTPPVAEQKKISQIL----STWNKAISVTEKLL 171 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +++K+AL+ ++T + ++G+ + G W + + + K Sbjct: 172 ANSQQQKKALIQQLLT---GKKRLLDENGVRFSGE----WCTCTLSEVAHIIMGSSPKSE 224 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQV 310 N L I + + P Y + PG+I+ Sbjct: 225 AYNDNGLGLPLIQGNADIKCRVSCPRVYTSDITKECTPGDILLSVRAPVGTVA-----LS 279 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + I A+K S + + K Y +S+ +D+K L + VP Sbjct: 280 QHKACIGRGISAIKSKRKMSQSFLYQWFLWFEPKWCYLSQGSTFESINSDDIKTLKLSVP 339 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR-GES 425 +EQ I V++ I L E+ + LK + + + +TG+ ++ E+ Sbjct: 340 NFEEQQKIAAVLSAADTEISTL----EKKLACLKNEKKALMQQLLTGKRRVKVDEA 391 >gi|320450634|ref|YP_004202730.1| restriction modification system DNA specificity domain-containing protein [Thermus scotoductus SA-01] gi|320150803|gb|ADW22181.1| restriction modification system DNA specificity domain protein [Thermus scotoductus SA-01] Length = 450 Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats. Identities = 70/445 (15%), Positives = 162/445 (36%), Gaps = 43/445 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFT---------KLNTGRTSESGKDIIYIGLEDV-ESG 59 +KD+ +G +P+ W+VV + G +++G + ++ ++ ++G Sbjct: 11 FKDTE---LGPLPEEWQVVRLGDLLLKGALWMKNGFPQGEHNQAGLGVPHLRPFNITDTG 67 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDG--ICSTQF--LV 113 ++ S + G +++ + K + G + S + Sbjct: 68 DITLSQIKYVPPPAEDSPYWVL-PGDVIFNNTNSEELVGKTAYFNLKGKFVISNHMTLIR 126 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIRE 172 + + + +L + + + +C + + + + +P+P L+EQ I Sbjct: 127 VSSDQLDAYWISKYLHWLWSQRVFQGLCRRHVNQASVSIERLKQVAIPLPSLSEQRAIAH 186 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI--EWVGLVPD 230 + + R I L+E K++L+ ++ T G P + + + +G +P+ Sbjct: 187 VL----RTVQEAKQATERVIAALRELKKSLMRHLFTYGPVPLDQAESVPLRDTEIGPIPE 242 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG------LKPESYETYQ 284 HW+V +V T + + I + + + +S Sbjct: 243 HWQVVRLGEVVERPQYGYTASASDAPVGPKFLRITDIQDGKVVWPSVPFCEIAQSQVENY 302 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA---VKPHGIDSTYLAWLMRSYD 341 ++ PG+I+ I K L + I S + G+ S YL + + Sbjct: 303 LLKPGDILVARIGATTGKTFLVAECPP--AIFASYLIRLRVAPDKGLLSDYLWYFTDTEA 360 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + G L+Q + ++ L + +PP+ EQ +I V+ RI E +++ Sbjct: 361 YWAQINSNKGGRLKQGINIPILENLVIPLPPLPEQREIARVLQAVDRRI-QAEEAYARAL 419 Query: 401 VLLKERRSSFIAAAVTGQIDLRGES 425 L S + +TG++ + + Sbjct: 420 DDL---FKSLLHELMTGRLRVAPWT 441 >gi|135210|sp|P07990|T1S_SALPO RecName: Full=Type-1 restriction enzyme StySPI specificity protein; Short=S.StySPI; AltName: Full=Type I restriction enzyme StySPI specificity protein; Short=S protein gi|79033|pir||A26652 type I site-specific deoxyribonuclease (EC 3.1.21.3) - Salmonella sp gi|154135|gb|AAA27145.1| hsdS specificity protein [Salmonella enterica subsp. enterica serovar Potsdam] Length = 463 Score = 140 bits (353), Expect = 4e-31, Method: Composition-based stats. Identities = 71/423 (16%), Positives = 147/423 (34%), Gaps = 29/423 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNS 70 G +P+ W P+ T L G T + + I ++++G Sbjct: 4 GKLPEGWATAPVSTVTTLIRGVTYKKEQALNYLQDDYLPIIRANNIQNGKFDTTDLVFVP 63 Query: 71 RQSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLV---LQPKDVLPELL 124 + + I + I+ + K+ CS K + P + Sbjct: 64 KNLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQRLPFECSFGAFCGALRPEKFISPNYI 122 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + S +I ++ GA +++ I +PIP LAEQ +I EK+ ++D+ Sbjct: 123 AHFTKSSFYRNKISSLSAGANINNIKPASFDLINIPIPSLAEQKIIAEKLDTLLAQVDST 182 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + ++LK +QA+++ V+ L ++ S I W + Sbjct: 183 KARLEQIPQILKRFRQAVLAAAVSGTLTTALRNSHSLIGW-----HSTNLGALIVDSCNG 237 Query: 245 NRKNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFR-FIDLQ 299 K L + I L + R + L + Y + + +V R Sbjct: 238 LAKRQGLNGNEITILRLADFKDAQRIIGNERKIKLDSKEENKYSLENDDILVIRVNGSAD 297 Query: 300 NDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQ 355 R + + ++ ++ + I S +L ++ + S + Sbjct: 298 LAGRFIEYKSNGDIEGFCDHFIRLRLDSNKIMSRFLTYIANEGEGRFYLRNSLSTSAGQN 357 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ +K L L+PP+KEQ +I + A D + +++ ++ + S +A A Sbjct: 358 TINQTSIKGLSFLLPPLKEQAEIVRRVEQLFAYADTIEKQVNNALTRVNSLTQSILAKAF 417 Query: 416 TGQ 418 G+ Sbjct: 418 RGE 420 >gi|15669726|ref|NP_248539.1| type I restriction-modification enzyme subunit S [Methanocaldococcus jannaschii DSM 2661] gi|2496187|sp|Q58926|Y1531_METJA RecName: Full=Uncharacterized protein MJ1531 gi|1592162|gb|AAB99552.1| type I restriction-modification enzyme, S subunit, putative [Methanocaldococcus jannaschii DSM 2661] Length = 425 Score = 140 bits (352), Expect = 4e-31, Method: Composition-based stats. Identities = 79/438 (18%), Positives = 167/438 (38%), Gaps = 37/438 (8%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSESGKD---IIYIGLEDVES 58 + +K + IG IP+ W+V +K + + G T++ KD +E + Sbjct: 6 QFYKEENFKKTE---IGEIPEDWEVRELKDILEVIRNGLTAKQNKDKIGYPITRIETISD 62 Query: 59 GTGKYLP--KDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLV 113 + +Q D + + G IL+ + + AI + Sbjct: 63 SKIDITKLGYVEDIKQEDIAKYRLII-GDILFSHINSEEHIGKVAIYEGKPEFLLHGMNL 121 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGA-----TMSHADWKGIGNIPMPIPPLAEQV 168 L + ++ +LL + + + I + S + + ++ +P+PPL EQ Sbjct: 122 LLLRPNKNKIEPYYLLYLLRHFKQKNIFKYIAKRAVNQSSINQTQLKHLKIPLPPLEEQK 181 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + + I + IE+L + K+ ++ + TKG+ K S I G + Sbjct: 182 QIAKILSDFDNL----IGTINKQIEVLNKAKKGMMKKLFTKGVFEHKSFKKSEI---GEI 234 Query: 229 PDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI- 285 P+ WEV + ++ N + K E N+ P Y + Sbjct: 235 PEDWEVVELGNEKYFKIIMGQSPPSSSYNKEGEGVPFLQGKAEFGNIYPNPVLYTNKPLK 294 Query: 286 -VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 VD +I+ D + RG+ + +D+ ++ + + SY K Sbjct: 295 VVDDEDILISVRAPVGDVNIAPFKLCIGRGLAG---IKSNKEKVDNFFVFYYL-SYIKPK 350 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + Y G + +++ +D++ + + +PP++EQ I + ID L+E + ++ Sbjct: 351 IEYLGGGAVFKAITKKDLESIKIPLPPLEEQKAIAKRLKA----IDDLIEIKRKEKEQIE 406 Query: 405 ERRSSFIAAAVTGQIDLR 422 + + + +TG+I ++ Sbjct: 407 KAKKKIMNLLLTGKIRVK 424 >gi|289208800|ref|YP_003460866.1| restriction modification system DNA specificity domain protein [Thioalkalivibrio sp. K90mix] gi|288944431|gb|ADC72130.1| restriction modification system DNA specificity domain protein [Thioalkalivibrio sp. K90mix] Length = 419 Score = 140 bits (352), Expect = 5e-31, Method: Composition-based stats. Identities = 54/411 (13%), Positives = 129/411 (31%), Gaps = 27/411 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTS----------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 + WK + + G+T E +++ + D+ + Sbjct: 3 EGWKTAKLSELCDIQLGKTPARANSSYWDQERSTGNVWLSIADLLKSEANNVSDSKEYLS 62 Query: 73 SDTSTV-SIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLL 129 + + I KG +L L + A D + L + + ++ + L Sbjct: 63 DKGAKLCKIVKKGTLLVS-FKLTLGRVAFAGKDLYTNEAIAALTIHDEQIINRDYLFYFL 121 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + I + +PPL EQ I + IDT + Sbjct: 122 HFFDWVKAAQDDVKLKGMTLNKAKLKEILVVVPPLPEQKRIVAILDEAFASIDTAVANTE 181 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + +E ++ ++ +V S + DH + V + KN Sbjct: 182 KNLANARELFESYLNAVVDTAFRKSTVTVLSDLAEEITDGDHMPPPKAPSGVPFITIKNI 241 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + + + E + + G++++ + Sbjct: 242 DKRTRKVDFENTFRVPRSY--------FEGLKPNKRPRKGDVLYTVTGSFGIPVVVG--- 290 Query: 310 VMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 ++P DS++L +L+ S + +G ++++ + ++ V Sbjct: 291 QKTEFCFQRHIGLIRPKSGTDSSWLYYLLMSPQIFAQATDGATGTAQKTVSLKVLRSFRV 350 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P+ +Q D ++ A ++ L Q + L E + S + A +G+ Sbjct: 351 PTIPLDQQVDNVQQLDNLLADVEGLESIYRQQLRNLGELKQSLLQKAFSGE 401 >gi|257900170|ref|ZP_05679823.1| predicted protein [Enterococcus faecium Com15] gi|257838082|gb|EEV63156.1| predicted protein [Enterococcus faecium Com15] Length = 424 Score = 139 bits (351), Expect = 6e-31, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 137/404 (33%), Gaps = 24/404 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + W+ + T+ +G T +GK DI +I ++ S + + S Sbjct: 29 EDWEQRKLGDITESFSGGTPTAGKSEYYGGDIPFIRSGEISS---ELTELFITENGLNNS 85 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + G ILY G + I+ +G + L ++P L L Sbjct: 86 SAKMVKAGDILYALYGATSGEVSISRINGAINQAILAIRPTKNDNSYLIVQWLRKQKDTI 145 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I +G + + ++ + +P E+ KI A ++D I R ++LLK Sbjct: 146 ISTYLQG-GQGNLSGSIVKDLVITLPQDKEEQ---NKIGAFFKQLDDTIALHQRKLDLLK 201 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E K+ + + K +++ G + + K E Sbjct: 202 ETKKGFLQKMFPKNGAKVPEVRFPGFTEDWEQRKLNDFISGDISDGDWIEKEHIKDEGKY 261 Query: 257 LSLSYGNIIQK-LETRNMGLKPESYETYQIVD-----PGEIVFRFIDLQNDKRSLRSAQV 310 + GNI + K ++ I+ PG+++ + + + Sbjct: 262 RIIQTGNIGNGVYIDKEKSAKYMDQNSFDILKANEIFPGDLLVSRLAEPAGRTVILPNIE 321 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + D+ +L M + + SG + + ++++++ + Sbjct: 322 DRMVTAVDVAILRQNENFDAYFLLSQMNTSKILNKVSKNVSGTSHKRISRKNLEKVTIDS 381 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I+EQ I ++D + ++ + LLKE + F+ Sbjct: 382 TSIEEQNKIGAF----FKQLDDTIALHQRKLDLLKETKKGFLQK 421 >gi|160902532|ref|YP_001568113.1| restriction modification system DNA specificity subunit [Petrotoga mobilis SJ95] gi|160360176|gb|ABX31790.1| restriction modification system DNA specificity domain [Petrotoga mobilis SJ95] Length = 429 Score = 139 bits (351), Expect = 6e-31, Method: Composition-based stats. Identities = 63/430 (14%), Positives = 148/430 (34%), Gaps = 38/430 (8%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 V+ + +P+ WK+ +K GR + +++ E + + N + Sbjct: 3 EVKEMEKLPEGWKISSVKDLFIDGRGRVISEEE------IKNKEGIYPVFSSQTKNKGEL 56 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 F I + G + C+ L+ ++ L +V Sbjct: 57 GKINTYDFEGEYITWTTDGANAGTVFYRNGRFNCTNVCGTLEARNKEVCSKYFAYLLSNV 116 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + + + + + KI +D I + + IE Sbjct: 117 LKKYVSYIGNPKLMN------NVVRGIKLVHPANYFAQCKIAEIIKTVDNAIEKTDKIIE 170 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALVTELNR-- 246 K KQ L+ ++TKG++ + +++ G +G +P+ WEV + Sbjct: 171 KYKRIKQGLMQDLLTKGIDENGQIRSEGTHRFKDSPLGRIPEEWEVVELIKGLGNNPSLI 230 Query: 247 ---------KNTKLIESNILSLSYGNII-QKLETRNMGLKPESYET---YQIVDPGEIVF 293 K E I L N+ K +++ E Y + G+IV Sbjct: 231 VAGPFGSSLKVEDYKEIGIPILRLQNVDENKFIDKDIKFITEKKAKALSYHSFEEGDIVL 290 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-S 351 + + K + + ++ + V P + ++++++ K A Sbjct: 291 AKLGMPVGKACIVPEKYKYGIVVADVVRIRVSPKFANKEFISYILNYSICRKQLNAYIIG 350 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 R + ++ + + +P + EQ I +++ ++ID +EK ++ L+ + + Sbjct: 351 TTRPRVNLTQIRNILIPLPSLPEQHRIASIL----SQIDETIEKEQRYKEKLERIKQGLM 406 Query: 412 AAAVTGQIDL 421 +TG++ + Sbjct: 407 EDLLTGKVRV 416 >gi|253732461|ref|ZP_04866626.1| possible type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus USA300_TCH959] gi|253723851|gb|EES92580.1| possible type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus USA300_TCH959] Length = 409 Score = 139 bits (349), Expect = 9e-31, Method: Composition-based stats. Identities = 46/400 (11%), Positives = 123/400 (30%), Gaps = 25/400 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + +G T K DI +I D+ + + + + + S+ Sbjct: 20 EWEEKQLGEVGTFTSGGTPLKSKSEYWNGDIPWITTGDIHNIKRENITNFITEKGLNESS 79 Query: 78 VSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL G + I +F+ + + Q + + + + Sbjct: 80 AKLITNEAILIAMYGQGKTRGMSAILNFEATTNQACAIYQTNQNIN---FVFQYFQKLYK 136 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + ++ + + + I + P EQ I +I+ + + Sbjct: 137 FLRSLSNEGSQKNLSLSLLKEITLNYPNEQEQKKIGVFFSKLDRQIELEEQKLELLQQQK 196 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K Q + S + D + +G + + ++ ++ Sbjct: 197 KGYMQKIFSQELRFKDENGEDYPDWKEKKLGDITE-------QSMYGIGASATRFDSKNI 249 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERG 314 + ++ + + P+ + +I+F K + + + + Sbjct: 250 YIRITDIDEKSRKLNYQNLTTPDEVNNKYKLKRNDILFARTGASTGKSYIHKEEKDIYNY 309 Query: 315 IITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + + +S + S V + + E+ +LP+++P Sbjct: 310 YFAGFLIKFEINEQNSPLFIYQFTLTSKFNKWVKVMSVRSGQPGINSEEYAKLPLVLPNK 369 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I ++ R D +E +Q I +L++++ + Sbjct: 370 LEQQKIAEFLD----RFDQQIELEKQKIEILQQQKKGLLQ 405 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 66/190 (34%), Gaps = 11/190 (5%) Query: 24 HWKVVPIKRFTKLNT---GRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + T+ + G ++ IYI + D++ + K ++ + + Sbjct: 220 DWKEKKLGDITEQSMYGIGASATRFDSKNIYIRITDIDEKSRKLNYQNLTTPDEVNNKYK 279 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL------VLQPKDVLPELLQGWLLSIDV 133 + + IL+ + G K+ I + + + P + + L+ Sbjct: 280 L-KRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFLIKFEINEQNSPLFIYQFTLTSKF 338 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + + + + +P+ +P EQ I E + +I+ + + Sbjct: 339 NKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKIAEFLDRFDQQIELEKQKIEILQQ 398 Query: 194 LLKEKKQALV 203 K Q++ Sbjct: 399 QKKGLLQSMF 408 >gi|161617829|ref|YP_001591794.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161367193|gb|ABX70961.1| hypothetical protein SPAB_05693 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 467 Score = 139 bits (349), Expect = 1e-30, Method: Composition-based stats. Identities = 67/422 (15%), Positives = 143/422 (33%), Gaps = 23/422 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLP-KDGNS 70 G +P+ W I ++ G+ GK + YI + D E+G+ K +S Sbjct: 4 GKLPEEWVKTTIGVICEVKGGKRLPKGKALLNTATEHPYIRVTDFENGSVNLSTIKYLDS 63 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGW 127 + +K + G I + + + + L+ Sbjct: 64 DTYSAISNYTISKNDLYISIAGTIGLIGEIPEQLDNANLTENAAKLCFILGTDKKYLKHV 123 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S ++ + + I + P P+ EQ +I EK+ ++D+ Sbjct: 124 LSSNKTIEQFDDKTTSSGQPKLALFRIRDCEFPYAPINEQKIIAEKLDTLLAQVDSTKAR 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGL----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + ++LK +QA+++ V+ L + S +W +P W V + LV Sbjct: 184 LEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQW-PDLPSTWSVHKYSELVDS 242 Query: 244 LNRKNTKLIESNILSLSYGNIIQ------KLETRNMGLKPESYETYQIVDPGEIVFRFID 297 K ++ + Y I LE L + + G+++ Sbjct: 243 RLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGDVLICEGG 302 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQS 356 Q + + + A I +L + +++ + + + Sbjct: 303 EPGRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDSNNISLSQLFTGTTIKH 362 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + + P+ VPP++EQ +I + A D + +++ ++ + S +A A Sbjct: 363 LTGKALANYPIRVPPLEEQHEIVRRVEQLFAYADTIEKQVNNALTRVNSLTQSILAKAFR 422 Query: 417 GQ 418 G+ Sbjct: 423 GE 424 >gi|319954803|ref|YP_004166070.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] gi|319423463|gb|ADV50572.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] Length = 391 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 66/417 (15%), Positives = 135/417 (32%), Gaps = 38/417 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLP 65 YK++ IG IP W+V K + GR K I + L+++ G + Sbjct: 7 YKNTE---IGIIPDEWEVKKQKEIVRYINGRAYSLHEWEKKGIPVVRLQNLTKKGGNFYY 63 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + N KG +++ I I L+ K Sbjct: 64 SNLNLPD-----YQYMNKGDLIF-MWSASFGPYIWWGNKAIFHYHIWKLECKKGKAVKDF 117 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + +++T+ ++ G+TM H + N + +PPL EQ I + + TL Sbjct: 118 YYFKLLEITEELKKGTSGSTMLHLTKGFMENYLISVPPLPEQTAIANVLSDTDNLLQTLE 177 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + + + Q L+ +G K + I P Sbjct: 178 KKIAKKRLIKQGAMQELLKP--KEGWVVKSLGKVADIATGTTPPTRDLE----------- 224 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + +S + + + L + + +I I+ I K + Sbjct: 225 ---NYGNQFCFVSPADLGKEKYITKTVKNLSKKGFSVSRIFPKNSIMVTCIGSTIGKIGI 281 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 S + I + + P+ + + S + K+ + + + Sbjct: 282 ASKVLTSNQQINAIF----PNENFDSEFVYYHLSLNAKKIRLMASEQAVPMINKSEFSEV 337 Query: 366 PVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +P +K EQ I +++ I+ L E+ + K+ + + +TG+I L Sbjct: 338 KINIPLLKSEQTKIATILSDMDTEIESL----EKQLSKYKQVKQGLMQNLLTGKIRL 390 >gi|300837085|ref|YP_003754139.1| putative type I restriction-modification system specificity determinant [Klebsiella pneumoniae] gi|299474889|gb|ADJ18713.1| putative type I restriction-modification system specificity determinant [Klebsiella pneumoniae] Length = 424 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 60/420 (14%), Positives = 138/420 (32%), Gaps = 28/420 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +P W+ + +++ + + + + S G + + + Sbjct: 18 LGMLPTGWQKLSLEKCLNIEARKAYIQDNQEYDL-VTVKRSRGGVIRREHLKGKDISVKS 76 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKD-VLPELLQGWLLSIDV 133 +G L K + + I S ++ VL K ++ S+ Sbjct: 77 QFYIKEGDFLISKRQIVHGACGLVPKELSGSIVSNEYCVLTGKSGFYLPYMEFLSESLYF 136 Query: 134 TQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q G + P IPPL+EQ I + + D I+ + Sbjct: 137 QQTCFHSSIGVHIEKMIFKLDSWFKWPFNIPPLSEQKRIVKIL----STWDKAISVTEKL 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--- 248 + +++K+AL+ +VT + ++G+ + G W+ A+ + Sbjct: 193 LANSQQQKKALMQQLVT---GKKRLLDENGVRFSGE----WKRVKLGAIADINSGGTPKS 245 Query: 249 --TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + NI +S ++ + K + + Sbjct: 246 TVEEYYGGNIPWVSISDMTSNGKWIATTEKYLTELGLNSSSARIYPKNSVLYAMYASIGE 305 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + A + ++P + + + K+ G + +L VK Sbjct: 306 CSIAAVNLTSSQAILGIRPKDCLNYEFLYFYLTSLKEKIKLQGQQGTQSNLNAGMVKEFE 365 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR-GES 425 + +P I+EQ I V++ A I L E+ + L++ + + + +TG+ ++ E+ Sbjct: 366 LDLPSIREQQKIAAVLSAADAEISTL----EKKLACLRDEKKALMQQLLTGKRRVKVDEA 421 >gi|288817339|ref|YP_003431686.1| restriction endonuclease S subunit [Hydrogenobacter thermophilus TK-6] gi|288786738|dbj|BAI68485.1| restriction endonuclease S subunit [Hydrogenobacter thermophilus TK-6] gi|308750946|gb|ADO44429.1| restriction modification system DNA specificity domain protein [Hydrogenobacter thermophilus TK-6] Length = 426 Score = 138 bits (348), Expect = 1e-30, Method: Composition-based stats. Identities = 68/426 (15%), Positives = 145/426 (34%), Gaps = 39/426 (9%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ WK V + + L G + + +E + G + + Sbjct: 6 KLPEGWKKVKLGEVIEKLRNGYVYSFSEIRKEGLPITRIETISEGKIDKSKLGYITEELR 65 Query: 75 TS-TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ------PKDVLPELLQGW 127 +G IL+ + A +DG T + K +L Sbjct: 66 YKVNKYQMQRGDILFSHINSIEHIGKCAIYDGSIPTLIHGMNLLLLRTKKHILDPFFLIN 125 Query: 128 LLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + + G + +P+PPL EQ I E + +D I Sbjct: 126 FLKKEDIRSRLRNLSGQAVNQVSIKPSELAKFAIPLPPLPEQQKIAEIL----ETVDRAI 181 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLN--------PDVKMKDSGIEWVGLVPDHWEVKPF 237 + + IE K KQ L+ ++TKG++ K KDS I G +P+ WEV Sbjct: 182 EKTDKIIEKYKRIKQGLMQDLLTKGIDENGKIRSEKTHKFKDSPI---GRIPEEWEVVRL 238 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRF 295 + + ++ + N + E + P ++ I I+ Sbjct: 239 GEVCFIIMGQSPSSVLINKKEKGIPFLQGNAEFTSKYPNPINWIEKPLKIAKKESILLSV 298 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + RG+ + + + ++ + ++ + + + + Sbjct: 299 RAPVGALNIANREYCIGRGLCS---IVTNKSITHNLFIWYYLQ-FSINNLINLSQGSTFE 354 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ ++K + +PP+ EQ I ++ ++ID ++EK + L+ + + + Sbjct: 355 AISSRELKNYSIPLPPLTEQQRIAEIL----SQIDNVIEKEQAYRQKLERIKKGLMEDLL 410 Query: 416 TGQIDL 421 TG++ + Sbjct: 411 TGKVRV 416 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 74/206 (35%), Gaps = 16/206 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGK 62 ++KDS IG IP+ W+VV + + G++ + K I ++ G + Sbjct: 220 KFKDSP---IGRIPEEWEVVRLGEVCFIIMGQSPSSVLINKKEKGIPFL------QGNAE 270 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + K N + I K IL P IA+ + + + Sbjct: 271 FTSKYPNPINWIEKPLKIAKKESILLSVRAPV-GALNIANREYCIGRGLCSIVTNKSITH 329 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L W + + +G+T + + N +P+PPL EQ I E + I+ Sbjct: 330 NLFIWYYLQFSINNLINLSQGSTFEAISSRELKNYSIPLPPLTEQQRIAEILSQIDNVIE 389 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 R + + K + L++ V Sbjct: 390 KEQAYRQKLERIKKGLMEDLLTGKVR 415 >gi|255523605|ref|ZP_05390572.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|255512660|gb|EET88933.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] Length = 447 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 55/409 (13%), Positives = 129/409 (31%), Gaps = 17/409 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPK---DGN 69 +P++W + I R ++ +G T +S D+ +I D+ + Y+ + + Sbjct: 23 EVPENWVWIEIGRVIEVVSGGTPKSNVSDYYENGDVAWITPADLSGYSNIYISRGKRNIT 82 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + S+ + K +L P IA + + F P V + Sbjct: 83 KLGLEKSSAKLMPKNSVLMSSRAPI-GYVAIAKNEISTNQGFKNFLPSPVYL-PKYLYFY 140 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 IE G T +P PI PL EQ I ++I + ++D Sbjct: 141 LKYSKDLIETYASGTTFLEISGAKAKLLPFPIAPLKEQQRIVDRIESLFEKLDKAKELIE 200 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 E +++K A++ L + + + ++ N++ Sbjct: 201 EAREEFEKRKSAILEKAFRGELTEKWRDDTKINSFKDTKFEELFAFIGGGTPSKANKEYW 260 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + N + + + + GEI+ Sbjct: 261 NGEINWASVKDIKNNYLYDTIDKITEEGVKNSSTNVAKNGEIILVTRISPGKVTI----A 316 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + I + + + Y + + + ++ ++ + + Sbjct: 317 QKDIAINQDLKIVRPKIEEIDYKYMYYLFLYKEKDLISKSQGTTVKGITINELNKIQISL 376 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P ++EQ +I +++ + +E++ Q ++ + S +A A G+ Sbjct: 377 PVLEEQKEIVRILDKLLEE-ESKIEELTQLEDQIELVKKSILAKAFRGE 424 >gi|311747175|ref|ZP_07720960.1| ribosomal protein L10 [Algoriphagus sp. PR1] gi|126578884|gb|EAZ83048.1| ribosomal protein L10 [Algoriphagus sp. PR1] Length = 384 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 61/400 (15%), Positives = 134/400 (33%), Gaps = 34/400 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W P+K+ L G S +GKY N + + Sbjct: 17 KDWVEKPLKQIAPLQRGFDLPST-----------HLASGKYPVVYSNGI-GNYHNKYMVK 64 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I+ G+ G + + +T V + + L I + E Sbjct: 65 APGIVTGRSGTIGKVMYLGKDFWPHNTSLWVTNFHGNDTKFIYYLYLFIGL----ERFST 120 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ + + + + + +PP EQ I + + I L + + + K QAL Sbjct: 121 GSGVPTLNRNDVHDFRVSLPPFHEQQGIAQVLSDTDKLIKFLEKKIEKKKLIKKGVMQAL 180 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 ++ + ++ +G + + ++ F + K+ + + Sbjct: 181 LT-----------PKEGWEVKKLGEIANITKLAGFEYSNYFNSYKDRGEVIVLRGTNITA 229 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N + + + + K + + G++VF ++ ++ G TS Sbjct: 230 NKLDLSDIKTIPRKTSDFLKRSKLYCGDLVFAYVGTIGPVYLVKENNRFHLGPNTS--KI 287 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + ++S +L S+ + S G + SL ++ + +P ++EQ +I V Sbjct: 288 SASNLLNSEFLFHYFTSWYIQDEIVEHTSIGAQPSLSMSKIRSFNINLPNLEEQVEIARV 347 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + I L + +++ R + +TG+I L Sbjct: 348 LTAFDNEIKDLTKLLQK----YGHLRQGMMQQLLTGKIRL 383 >gi|309702142|emb|CBJ01457.1| putative restriction-modification DNA specificity domain protein [Escherichia coli ETEC H10407] Length = 413 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 70/424 (16%), Positives = 149/424 (35%), Gaps = 35/424 (8%) Query: 22 PKHWKVVPIKRFTK--LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK W + + ++ G I + + D+ + S +++ + Sbjct: 2 PKGWNSWILSDICRKQISYGIVQTGDNLPNGIPCLRVVDLTRDVMRLEDMIKTSEETNKA 61 Query: 77 -TVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWLLSI 131 +I K +I+ G +I D + + + K VLPE L L S Sbjct: 62 YRKTILEKDEIVMALRGEIGLARLIDDNLVGANITRGLARISPETKVVLPEFLLWELRSP 121 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + G+ + + + +PPL EQ I + + D I+ + Sbjct: 122 QFRADLIRRVGGSALQEISLTELRKVRTLLPPLLEQKKIAQIL----STWDKAISVTEKL 177 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + +K+AL+ ++T K E + W++ L + KN Sbjct: 178 LTNSQRQKKALMQQLLT-------GKKRLLDENGTRFSETWKLYALSKLFQRVTTKNNGK 230 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQV 310 + + +I++ + + ++ + Y ++ G+ + +++ Sbjct: 231 SNNVVTISGQHGLIKQEDFFKKTVASDTLDGYFLLKKGQFAYNKSYSNGYPMGAIKRLNR 290 Query: 311 MERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDV 362 G++T+ Y+ P Y S L + G R ++K D Sbjct: 291 YPEGVVTTLYICFELTTPKKSCGDYWEHYFESGLLNNSLSQIAHEGGRAHGLLNVKPSDF 350 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 L V VP +EQ I +V++ I L E+ + LK+ + + + +TG+ ++ Sbjct: 351 FSLKVAVPGFEEQQKIASVLSAADTEISTL----EKKLACLKDEKKALMQQLLTGKRRVK 406 Query: 423 -GES 425 E+ Sbjct: 407 VDEA 410 >gi|153808175|ref|ZP_01960843.1| hypothetical protein BACCAC_02461 [Bacteroides caccae ATCC 43185] gi|149129078|gb|EDM20294.1| hypothetical protein BACCAC_02461 [Bacteroides caccae ATCC 43185] Length = 473 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 70/408 (17%), Positives = 127/408 (31%), Gaps = 35/408 (8%) Query: 20 AIPKHWKVVPIKRFTKL---NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W + + N + D + LED+E T + ++ Sbjct: 70 EVPDSWTWTTLGEISNYGDCNNVSIIDIATDEWILELEDLEKDTASIIQMLSKKERNIKG 129 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQ 135 F KG +LY KL YL K ++A G C+T+ + + + S Sbjct: 130 VRHKFDKGDVLYSKLRTYLNKVLVAPKTGYCTTEIIPFNSYCGISNFYLCHVLRSAYFLD 189 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G M +P+PP+AEQ I +I ID + +++ + Sbjct: 190 YTQQCGYGVKMPRLSTNDACKGMIPLPPIAEQQRIVVEIEKWFALIDQVEQDKVDLQTTI 249 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWV-------------------GLVPDHWEVKP 236 K+ K ++ + L P IE + +P W Sbjct: 250 KQTKSKILDLAIHGKLVPQDPNDKPAIELLKRINPDFTPCDNGHYPNFPFDIPKKWNWVT 309 Query: 237 FFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD---P 288 + + N +I L G++ T E V Sbjct: 310 LGEIGKWQSGSTPSRLNKDYYNGDIPWLKTGDLNDGYITHIPEYITEKALNETSVKLNPT 369 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G I+ K + + A A + + + Sbjct: 370 GSILMAMYGATIGKLGI----LTYPATTNQACCACEIYTGIEKEFLFYFLLSHRADFIKL 425 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 G G + ++ E + + +PP +EQ I N +N A++DV++E + Sbjct: 426 GGGGAQPNISKEKIINTYIPLPPSEEQKRIVNAVNDVFAQLDVIMESL 473 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 54/138 (39%), Gaps = 6/138 (4%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYD 341 D G++++ + +K + + G T+ + + I + YL ++RS Sbjct: 131 RHKFDKGDVLYSKLRTYLNKVLVAP----KTGYCTTEIIPFNSYCGISNFYLCHVLRSAY 186 Query: 342 LCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 G G+ L D + + +PPI EQ I I A ID + + Sbjct: 187 FLDYTQQCGYGVKMPRLSTNDACKGMIPLPPIAEQQRIVVEIEKWFALIDQVEQDKVDLQ 246 Query: 401 VLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 247 TTIKQTKSKILDLAIHGK 264 >gi|15669403|ref|NP_248213.1| type I restriction-modification enzyme 1 subunit S [Methanocaldococcus jannaschii DSM 2661] gi|2496161|sp|Q58615|Y1218_METJA RecName: Full=Uncharacterized protein MJ1218 gi|1591847|gb|AAB99219.1| type I restriction-modification enzyme 1, S subunit [Methanocaldococcus jannaschii DSM 2661] Length = 425 Score = 138 bits (347), Expect = 2e-30, Method: Composition-based stats. Identities = 84/421 (19%), Positives = 157/421 (37%), Gaps = 37/421 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+VV I F K G+ E Y+ E + G K N Sbjct: 21 VPEDWEVVRIGDFIKYIKGKKPAVMVDEELEGYYPYLSTEYLRDGIASKFVKITNKEI-- 78 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I + IL G + + GI S+ + L+ K+ + + L + Sbjct: 79 -----IVNENDILLLWDGSNAGEIFLGKK-GILSSTMVKLEQKNKIMDDLYLFYSLKLKE 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +++ +G + H D K NI +P+PPL EQ I + + I + IE+ Sbjct: 133 SFLKSQTKGTGIPHVDKKIFENIKIPLPPLEEQKQIAKILSDFDNL----IGTINKQIEV 188 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L + K+ ++ + TKG+ K S I G +P+ WEV +V + K K E Sbjct: 189 LNKAKKGMMKKLFTKGVFEHKSFKKSEI---GEIPEDWEVVKLKEVVDIQSGKYFKYSEF 245 Query: 255 NILSLS----YGNIIQKLETRNMGLKPESYETYQ---IVDPGEIVF--RFIDLQNDKRSL 305 + K+ + PE Y ++ G+IV + + Sbjct: 246 CENGVKCLKIDNVGFGKIFWETVSFLPEDYLNKYPQLVLKSGDIVLALNRPIIGGKIKIG 305 Query: 306 RSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 + E I+ K ID +L +L+ S K + G + ++ + Sbjct: 306 ILKDIDEPAILYQRVGRFIFKSEKIDKQFLFYLLMSEYFKKELSKLLIGTDQPYIRTPVL 365 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + +P ++EQ + + ID L+E + +++ + + +TG+I ++ Sbjct: 366 LNIKIPLPHLEEQKAMAERL----KSIDNLIEIKRKEKEQIEKAKKKIMNLLLTGKIRVK 421 Query: 423 G 423 Sbjct: 422 N 422 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 82/211 (38%), Gaps = 20/211 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYL 64 +K S IG IP+ W+VV +K + +G+ + + + + +++V G GK Sbjct: 210 SFKKSE---IGEIPEDWEVVKLKEVVDIQSGKYFKYSEFCENGVKCLKIDNV--GFGKIF 264 Query: 65 PKDGNSRQSDTSTVS---IFAKGQILYGKLGPYLRKAIIA------DFDGICSTQF--LV 113 + + D + G I+ P + I D I + + Sbjct: 265 WETVSFLPEDYLNKYPQLVLKSGDIVLALNRPIIGGKIKIGILKDIDEPAILYQRVGRFI 324 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + + + L L+S + + + G + + NI +P+P L EQ + E+ Sbjct: 325 FKSEKIDKQFLFYLLMSEYFKKELSKLLIGTDQPYIRTPVLLNIKIPLPHLEEQKAMAER 384 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + I+ E+ + + K+ L++ Sbjct: 385 LKSIDNLIEIKRKEKEQIEKAKKKIMNLLLT 415 >gi|259910155|ref|YP_002650511.1| putative type I restriction-modification system specificity subunit [Erwinia pyrifoliae Ep1/96] gi|224965777|emb|CAX57309.1| putative type I restriction-modification system specificity subunit [Erwinia pyrifoliae Ep1/96] gi|283480261|emb|CAY76177.1| type I restriction-modification system specificity subunit [Erwinia pyrifoliae DSM 12163] Length = 300 Score = 138 bits (346), Expect = 2e-30, Method: Composition-based stats. Identities = 72/277 (25%), Positives = 121/277 (43%), Gaps = 10/277 (3%) Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + EQ I + +T ID I + + I LLKE+KQ ++ VT+GL+ Sbjct: 25 QDFQICYPADIKEQERIIYFLEKKTSEIDEAIAIKEKQISLLKERKQIIIQKAVTQGLDA 84 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG----NIIQKLE 269 +V KDSG+ W+G +P+HWE++ L T+ K + +YG + L Sbjct: 85 NVPRKDSGVSWIGKIPEHWEIRRSKFLFTQRKEKALNDDVQLSATQAYGVIPQEKYEALT 144 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + + + V+ + V Q L A I +S + ID Sbjct: 145 GKRVVKIQFHLDKRKHVEKDDFVISMRSFQG---GLERAWSCG-CIRSSYVVLKALQTID 200 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +L++ S +R Q L F++ R+ + +PP++EQ I N + Sbjct: 201 PLFYGYLLKLPSYIAALQQTASFIRDGQDLNFDNFSRVDLFIPPLEEQTAIANYVESFLT 260 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 D + IEQ I LKE +++ I +AVTG+I + E Sbjct: 261 SSDEAMNLIEQQIEKLKEYKTTLINSAVTGKIKITPE 297 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 71/211 (33%), Gaps = 5/211 (2%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV--ESGTGKYLPKDG 68 KDSGV WIG IP+HW++ K + + V + K Sbjct: 89 KDSGVSWIGKIPEHWEIRRSKFLFTQRKEKALNDDVQLSATQAYGVIPQEKYEALTGKRV 148 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 Q K + + + A G + ++VL+ + L G+L Sbjct: 149 VKIQFHLDKRKHVEKDDFVIS-MRSFQGGLERAWSCGCIRSSYVVLKALQTIDPLFYGYL 207 Query: 129 LSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L + ++ + + IPPL EQ I + + D + Sbjct: 208 LKLPSYIAALQQTASFIRDGQDLNFDNFSRVDLFIPPLEEQTAIANYVESFLTSSDEAMN 267 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + IE LKE K L++ VT + +M Sbjct: 268 LIEQQIEKLKEYKTTLINSAVTGKIKITPEM 298 >gi|323937173|gb|EGB33453.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli E1520] Length = 417 Score = 138 bits (346), Expect = 2e-30, Method: Composition-based stats. Identities = 68/430 (15%), Positives = 149/430 (34%), Gaps = 42/430 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 +PK W + G + + + V G K L + ++ + +V+ Sbjct: 2 VPKGWSSSQLGEIMSFKNGLNFTKTDNGDSVKI--VGVGDFKDLSELSSTEHLELISVAG 59 Query: 80 ------IFAKGQILYGKLGPYLRKAIIADFD------GICSTQFLV--LQPKDVLPELLQ 125 + G +L+ + F S + + + LP + Sbjct: 60 RIRDEELLNNGDLLFVRSNGNKDLIGRCMFFPEVRERLSFSGFTIRGRVINESTLPAYMA 119 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S +I G +S+ + + +I + +PPL EQ I E + D I Sbjct: 120 IVARSSQFQMQISKASGGTNISNLSQQILNDINLLLPPLIEQKKIAEIL----STWDKAI 175 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + + + +K+AL+ ++T K E + W++ L + Sbjct: 176 SVTEKLLTNSQLQKKALMQQLLT-------GKKRLLDENGTRFSETWKLYALSKLFQRVT 228 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRS 304 KN + + +I++ + + ++ + Y ++ G+ + + Sbjct: 229 TKNNGKSNNVVTISGQHGLIKQEDFFKKTVASDTLDGYFLLKKGQFAYNKSYSNGYPMGA 288 Query: 305 LRSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----S 356 ++ G++T+ Y+ P Y S L + G R + Sbjct: 289 IKRLNRYPEGVVTTLYICFELTTPKKSCGDYWEHYFESGLLNNSLSQIAHEGGRAHGLLN 348 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +K D L V VP +EQ I +V++ I L E+ + L++ + + + +T Sbjct: 349 VKPSDFFSLKVAVPGFEEQQKIASVLSAADTEISTL----EKKLACLRDEKKALMQQLLT 404 Query: 417 GQIDLR-GES 425 G+ ++ E+ Sbjct: 405 GKRRVKVDEA 414 >gi|20807979|ref|NP_623150.1| restriction endonuclease S subunits [Thermoanaerobacter tengcongensis MB4] gi|20516553|gb|AAM24754.1| Restriction endonuclease S subunits [Thermoanaerobacter tengcongensis MB4] Length = 398 Score = 138 bits (346), Expect = 3e-30, Method: Composition-based stats. Identities = 59/409 (14%), Positives = 134/409 (32%), Gaps = 30/409 (7%) Query: 20 AIPKHWKVVPIKRFT--KLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTS 76 +P W+ V + T +Y+ + ++S GK + PK+ + + + Sbjct: 7 KLPPGWRWVRLGEVCLPTERRDPTKNPSTYFVYVDISAIDSTVGKIVSPKEILGQHAPSR 66 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G +++ PYL+ + D ICST F V++ E + L Sbjct: 67 ARKVIRSGDVIFATTRPYLKNIALVPPDLDGQICSTGFCVIRANREFAEPEFLFHLCRSD 126 Query: 134 TQRIE---AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G + + N +P+PPL EQ I K+ A R+ + R Sbjct: 127 FITNQLTASKMRGTSYPAVTDNDVYNTLIPLPPLEEQRRIVAKVEALMERVREVRRLRAE 186 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + + Q ++ + +P W + + ++ Sbjct: 187 AQKDTELLMQTALAEVFPHP--------------GADLPPGWRWVRLGEVCDIIMGQSPP 232 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSA 308 N K + ++ P + + ++ PG+++ + Sbjct: 233 SSTYNFEGNGLPFFQGKADFGDLHPTPRIWCSAPQKVARPGDVLISVRAPVG-----STN 287 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 I A++P + Y ++ ++ +D++ + + Sbjct: 288 VANLACCIGRGLAALRPRDSLERFWLLYYLHYLEPELSKMGAGSTFNAITKKDLQNVFIP 347 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +PP++EQ I ++ ++ L ++ LK + + A G Sbjct: 348 LPPLEEQRRIVAYLDQIQQQVAALKRAQAETEAELKRLEQAILDKAFRG 396 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 65/192 (33%), Gaps = 2/192 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W+ V + + G++ S G + R ++ Sbjct: 209 DLPPGWRWVRLGEVCDIIMGQSPPSSTYNFEGNGLPFFQGKADFGDLHPTPRIWCSAPQK 268 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L P +A+ L+P+D L + L + + Sbjct: 269 VARPGDVLISVRAPV-GSTNVANLACCIGRGLAALRPRDSLERFWLLYYLH-YLEPELSK 326 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G+T + K + N+ +P+PPL EQ I + ++ L + LK + Sbjct: 327 MGAGSTFNAITKKDLQNVFIPLPPLEEQRRIVAYLDQIQQQVAALKRAQAETEAELKRLE 386 Query: 200 QALVSYIVTKGL 211 QA++ L Sbjct: 387 QAILDKAFRGDL 398 >gi|313157426|gb|EFR56848.1| type I restriction modification DNA specificity domain protein [Alistipes sp. HGB5] Length = 426 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 62/415 (14%), Positives = 147/415 (35%), Gaps = 22/415 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +G IP+ W+V+ + + +G + ++ Y+ +ED+ + +SR+ Sbjct: 23 LGIIPQEWEVMRLGDIVSITSGESPSLYHLKAEGKYPYVKVEDLNNCE----KYQESSRE 78 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G I++ K G + + ++ + +L Sbjct: 79 YSDDNNTTIKAGSIIFPKRGASILNNKVRIAAKDIQMDSNMMAITPHTTIVDTEFLYIRI 138 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +R+ I + +++ + K I + +PPLAEQ I E + D I ++ R I Sbjct: 139 LHERLYRIADTSSIPQINNKHIIPYKIAVPPLAEQRKIAEVL----GVWDEAIEKQARLI 194 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E L +K+AL+ +++ L + +G + N K I Sbjct: 195 EKLALRKRALMQRLLSAKLRLPGFSEPWEKVKLGDIGHFLSSNTLSRDCLNEQIGNIKNI 254 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + I+ + + + G+I+F ++ Sbjct: 255 HYGDILIKLPTIVDASFIHIPYVNDDVIVKSDYLKNGDIIFADTAEDYTVGKAIEIINIQ 314 Query: 313 RGIITSAYMAVKPHGID----STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 +TS + + +L + + S D + + G+ S+ + + + Sbjct: 315 AIPVTSGLHTIPFRPKSGIFVNRFLGYYVNSTDYRRQLQPLIQGIKVYSISKTALCKTTL 374 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P + EQ I V+ +E ++ + L+ ++ + +TG+ ++ Sbjct: 375 KIPTLSEQTAIAEVLTAADRE----IELAKEKLERLRRQKRGLMQQLLTGKRRIK 425 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 73/205 (35%), Gaps = 10/205 (4%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIE---SNILSLSYGNIIQKLETRNMGLKPESYE 281 +G++P WEV +V+ + ++ L + E + S + Sbjct: 23 LGIIPQEWEVMRLGDIVSITSGESPSLYHLKAEGKYPYVKVEDLNNCEKYQESSREYSDD 82 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G I+F +R A + + +D+ +L + Sbjct: 83 NNTTIKAGSIIFPKRGASILNNKVRIAAKDIQMDSNMMAITPHTTIVDTEFLYIRILHER 142 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L ++ + + + + + VPP+ EQ I V+ V D +EK + I Sbjct: 143 LYRIAD---TSSIPQINNKHIIPYKIAVPPLAEQRKIAEVLGVW----DEAIEKQARLIE 195 Query: 402 LLKERRSSFIAAAVTGQIDLRGESQ 426 L R+ + + ++ ++ L G S+ Sbjct: 196 KLALRKRALMQRLLSAKLRLPGFSE 220 >gi|313205415|ref|YP_004044072.1| restriction modification system DNA specificity domain [Paludibacter propionicigenes WB4] gi|312444731|gb|ADQ81087.1| restriction modification system DNA specificity domain [Paludibacter propionicigenes WB4] Length = 624 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 67/433 (15%), Positives = 140/433 (32%), Gaps = 37/433 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKD-G 68 + IPKHW+V + K+ G ++ + ++ ++E + Sbjct: 4 LNNIPKHWQVKRLFEIGKVINGDRGKNYPSRAHYVEYGVPFVSAGNIEEYYINSNNLNFI 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGW 127 + + + I+Y G + AI +G S+ +L+ + + + Sbjct: 64 SKDKFEALNNGKLQNRDIIYCLRGSLGKCAISNLNEGAISSSLCILRLDQTIEERYVYYY 123 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S I G + K N +PIPPL EQ+ I KI ++ + Sbjct: 124 LCSPFGRAEILKHDNGTAQPNLSAKNFSNYIIPIPPLHEQLSIVSKIEELLSDLENGKQQ 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR- 246 + + LK +Q+L+ L G +P+ W+ L Sbjct: 184 LLTAQQQLKVYRQSLLKAAFEGRLTNKEVKD-------GELPEGWKWVTITDLAENNKHA 236 Query: 247 ----------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVF 293 K +E+ +I E + P +I+ Sbjct: 237 LKAGPFGSALKKEFYVETGYKIYGQEQVIIDNPNFGDYYVNEEKYQELKSCRIKPFDILI 296 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGS 351 + L + GII + + + + + S + + + Sbjct: 297 SLVGTVGKVLILPENCM--DGIINPRLIKISLNRQKYLPKFFKYYFESSSVKAHYKSQAQ 354 Query: 352 GL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G L +K++P + ++EQ + + + + D + E I QS+ + + S Sbjct: 355 GTTMDVLNLGIIKKVPFPLTTLEEQQRVIDELESKLTVCDKIEETINQSLQQAETLKQSI 414 Query: 411 IAAAVTGQIDLRG 423 + A G++ ++ Sbjct: 415 LKKAFEGRL-VKP 426 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 81/205 (39%), Gaps = 11/205 (5%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKN-------TKLIESNILSLSYGNIIQKLETRNMGLK 276 + +P HW+VK F + +N +E + +S GNI + N L Sbjct: 3 ELNNIPKHWQVKRLFEIGKVINGDRGKNYPSRAHYVEYGVPFVSAGNIEEYYINSN-NLN 61 Query: 277 PESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 S + ++ ++ G++ R I L+ + + E I +S + I+ Y+ Sbjct: 62 FISKDKFEALNNGKLQNRDIIYCLRGSLGKCAISNLNEGAISSSLCILRLDQTIEERYVY 121 Query: 335 WLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + S + +L ++ + +PP+ EQ I + I + ++ Sbjct: 122 YYLCSPFGRAEILKHDNGTAQPNLSAKNFSNYIIPIPPLHEQLSIVSKIEELLSDLENGK 181 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 +++ + LK R S + AA G+ Sbjct: 182 QQLLTAQQQLKVYRQSLLKAAFEGR 206 >gi|269976583|ref|ZP_06183568.1| restriction modification system DNA specificity subunit [Mobiluncus mulieris 28-1] gi|269935384|gb|EEZ91933.1| restriction modification system DNA specificity subunit [Mobiluncus mulieris 28-1] Length = 445 Score = 137 bits (345), Expect = 3e-30, Method: Composition-based stats. Identities = 62/419 (14%), Positives = 154/419 (36%), Gaps = 16/419 (3%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +W IPKHWK V ++ ++ G+T ++ + + + + + Sbjct: 24 EEEWPYPIPKHWKWVRLESVVEMRIGKTPARAEEKYWDSYDYPWVKISDFTDEGVIAGSQ 83 Query: 74 DTSTV---------SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + + + G +L + K I D + + + + P+ + Sbjct: 84 EQISSVAFREVFKGRLVPAGTLLMS-FKLTIGKCAILDIAAVHNEAIISIFPQCSIVNRD 142 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + +TQ S + + +P+P+PPL EQ I + + +ID++ Sbjct: 143 YLFHCLPTITQFGIQR-SAVKGSTLNSNSLNALPLPLPPLTEQKQIVAYLDEKLGKIDSV 201 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + F++ ++K L+ +T L + + S ++ + T Sbjct: 202 REKLQDFLDHADKRKDNLIQAAITGHLTHQWRDQHSVSMASWKQVQLGKLGKWGGGGTPS 261 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDK 302 K++ I ++ ++ + E+ + + + + Sbjct: 262 KSKSSFWDGGTIRWITSKDMKTSEILDTLDHITAKAVEESTANLYQEPAICVVMRSGILR 321 Query: 303 RSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKF 359 R+L A+V + + G++ ++ L+ D + +S++F Sbjct: 322 RTLPIAKVNGEFTVNQDLKVLHAFADGVEPDFIYLALLGHSDRILDVCSKSGTTVESIEF 381 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +K + +P + EQ +I +++ + ARID K+++++ L + ++AA+ G+ Sbjct: 382 SKLKDYEIELPVLPEQEEIARILDEQLARIDAADSKVQEALDQLNLLKEQLVSAALAGR 440 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 78/204 (38%), Gaps = 6/204 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 EW +P HW+ ++V K E ++ + + G+ S Sbjct: 23 PEEEWPYPIPKHWKWVRLESVVEMRIGKTPARAEEKYWDSYDYPWVKISDFTDEGVIAGS 82 Query: 280 YET-----YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E ++ V G +V L + K ++ +++ + + + Sbjct: 83 QEQISSVAFREVFKGRLVPAGTLLMSFKLTIGKCAILDIAAVHNEAIISIFPQCSIVNRD 142 Query: 335 WLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +L F S ++ S L + LP+ +PP+ EQ I ++ + +ID + Sbjct: 143 YLFHCLPTITQFGIQRSAVKGSTLNSNSLNALPLPLPPLTEQKQIVAYLDEKLGKIDSVR 202 Query: 394 EKIEQSIVLLKERRSSFIAAAVTG 417 EK++ + +R+ + I AA+TG Sbjct: 203 EKLQDFLDHADKRKDNLIQAAITG 226 >gi|161528115|ref|YP_001581941.1| restriction modification system DNA specificity subunit [Nitrosopumilus maritimus SCM1] gi|160339416|gb|ABX12503.1| restriction modification system DNA specificity domain [Nitrosopumilus maritimus SCM1] Length = 438 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 66/424 (15%), Positives = 141/424 (33%), Gaps = 25/424 (5%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSES--------GKDIIYIGLEDVESGTGK-YLPKDGN 69 IP+ WK+ + TK+ G ES I +I ++ K + Sbjct: 18 EIPETWKICNLGDLLTKIQDGNYGESYPKESEFLDSGIPFIRGTEITKNFIDGKKVKYIS 77 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQF--LVLQPKDVLPELL 124 + D + G +L+ G R I D Q L K + + L Sbjct: 78 KTKHDELQKAHIETGDVLFLNRGGITRTVAIVPPKYDDANIGPQLTLLRCNTKIIHNKYL 137 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++ + +++ + G + + + +P + EQ I + + + + Sbjct: 138 YYFIQGENFKKQVISSDAGTALQFFGIEKTKKFKITLPEIREQQKIVSVLNSIDNLLSSY 197 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLN---PDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 +L K Q L++ + P + K+ I + ++ + Sbjct: 198 DKTIQTTQKLKKGLMQKLLTKGIDHKKFKKVPWLFGKEIEIPEEWEIKKIEDLFKLKSGS 257 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQ 299 T + + S K+ + + PE +++ G + L+ Sbjct: 258 TPSRKIPEYFAGNIPWITSTDLNRSKITSTLEKITPEAVKQTNLKLLPKGTFLIATYGLE 317 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLK 358 + A MA P ++ + Y K+ +++ G +Q+L Sbjct: 318 AAGTRGKCGITKMESTCNQACMAFLPSSEITSEFLFYFYLYFGEKIIFSIAQGTKQQNLY 377 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +K++ + VPP KEQ I N ++ + + L K L + + I +T + Sbjct: 378 SDTLKKVSMFVPPQKEQKRIVNFLDQIDSHLFELESKK----TGLDKIKKGLIQKLLTSK 433 Query: 419 IDLR 422 I ++ Sbjct: 434 IRVK 437 >gi|11499300|ref|NP_070538.1| type I restriction-modification enzyme, S subunit [Archaeoglobus fulgidus DSM 4304] gi|2648839|gb|AAB89535.1| type I restriction-modification enzyme, S subunit [Archaeoglobus fulgidus DSM 4304] Length = 341 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 54/348 (15%), Positives = 122/348 (35%), Gaps = 20/348 (5%) Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG I+ P I D D + L PK + + +++E + Sbjct: 2 PKGSIIVSTRAPV-GYVAIVDEDTTFNQGCKGLIPKSSEINTEFYCYYLLLIKRKLEQLS 60 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+T K + + +P+ PL EQ I E + +D I + I + K+ Sbjct: 61 GGSTFKELPKKSLEELLIPLLPLPEQQKIAEIL----STVDKAIEKVDEAIAKTERLKKG 116 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIESNILSL 259 L+ ++TKG+ K+ +G +P WEV ++ + + Sbjct: 117 LMQELLTKGIGH----KEFKDTEIGRIPKEWEVVRLGDVLELCQYGLSVPLKDKGKYPVI 172 Query: 260 SYGNIIQKLETRNMGLK---PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 I+ ++ E ++ G+++F + + + Sbjct: 173 RMDEIVNGYVVTDIAKYADLDEETFKNFKLEKGDVLFNRTNSLELVGRTGIFLLDGYYVF 232 Query: 317 TSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 S + ++P +L + + A + + ++ ++K+ + +PP+ E Sbjct: 233 ASYLIRLRPKHEILHPHFLTFYLIFSQSRLKQLATVAVHQANINATNLKKFKIPLPPLPE 292 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 Q I +++ ++ E + L+ + + +TG+ +R Sbjct: 293 QQKIAEILSTVDKKL----ELERKRKEKLERIKKGLMNDLLTGRRRVR 336 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 76/204 (37%), Gaps = 11/204 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKL-NTGRT-SESGKD-IIYIGLEDVESGTGKYLP 65 ++KD+ IG IPK W+VV + +L G + K I ++++ +G Sbjct: 130 EFKDTE---IGRIPKEWEVVRLGDVLELCQYGLSVPLKDKGKYPVIRMDEIVNGYVVTDI 186 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLP 121 +T KG +L+ + + D + ++ + L+PK + Sbjct: 187 AKYADLDEETFKNFKLEKGDVLFNRTNSLELVGRTGIFLLDGYYVFASYLIRLRPKHEIL 246 Query: 122 ELLQGWLLSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 I R++ + ++ + + +P+PPL EQ I E + + Sbjct: 247 HPHFLTFYLIFSQSRLKQLATVAVHQANINATNLKKFKIPLPPLPEQQKIAEILSTVDKK 306 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 ++ + + + K L++ Sbjct: 307 LELERKRKEKLERIKKGLMNDLLT 330 >gi|315059000|gb|ADT73329.1| Type I restriction-modification system, specificity subunit S [Campylobacter jejuni subsp. jejuni S3] Length = 398 Score = 136 bits (343), Expect = 5e-30, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 127/411 (30%), Gaps = 30/411 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+V ++ + G+ GK+++ YI + D + L ++ Sbjct: 4 LPQGWEVKKLEEIANIKGGKRLPKGKNLLDNNTKFAYIRVADFQDNGTINLQNIKFINEN 63 Query: 74 DTS--TVSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLV---LQPKDVLPELLQGW 127 + + G + II +G T+ V ++ + + + Sbjct: 64 TYNVLKNYKIYDDNLYISIAGTIGKSGIIPKELNGAILTENAVKLEYIQNNISNKFMYFF 123 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 LS +I+ + + I +P+PPL EQ I + +ID I Sbjct: 124 TLSNIFKTQIQTSTKIVAQPKLAITRLKQIQIPLPPLKEQERIVGILDESFAKIDESIKI 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + L E Q+ + + + +P WE K + ++ Sbjct: 184 LEQDLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQSWEWKSLEEISENISAG 235 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K + I N + I+ P I + + Sbjct: 236 GDKPKNCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPS----LTISARGTIGFVCI 291 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I+ + + + YL + + L K L + Sbjct: 292 RKEPYFPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQI 346 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 347 PLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLNKAFKGE 397 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 215 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 270 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 271 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 329 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I E + + L + ++ +E Sbjct: 330 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEEL 385 Query: 199 KQALVSYIVTKGL 211 KQ+L++ L Sbjct: 386 KQSLLNKAFKGEL 398 >gi|24371978|ref|NP_716020.1| type I restriction-modification system, S subunit [Shewanella oneidensis MR-1] gi|24345830|gb|AAN53465.1|AE015486_6 type I restriction-modification system, S subunit [Shewanella oneidensis MR-1] Length = 439 Score = 136 bits (343), Expect = 5e-30, Method: Composition-based stats. Identities = 65/428 (15%), Positives = 146/428 (34%), Gaps = 29/428 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP W+ I + TG +S + + ++ G+ ++ Sbjct: 10 GKIPNDWEYQIIIDNVEFLTGPAFDSSLFNTESRGARLVRGINLTQGSTRWGEDKTKYWD 69 Query: 73 SDTSTVSIFA--KGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQG 126 + + + + IL G G + K D + + L+ K L Sbjct: 70 VELNNLKKYQLAINDILIGMDGSLVGKNYAYLKQSDLPALLVQRVARLRAKSNLHSKYLY 129 Query: 127 WLLSIDVT-QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 ++ + D +E + + + H I N P PPL EQ I + + I+ Sbjct: 130 YMYATDFWLDYVEVVKTNSGIPHISNGDIKNFRFPFPPLPEQQKIAAILTSVDEVIEKTQ 189 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLN----PDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + + +L Q L++ V P ++ KDS + + + + + Sbjct: 190 AQIDKLKDLKSGMMQELLTKGVGIKQGDKYVPHIEFKDSPVGKIPKSWEVKPLNSVVLKI 249 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP------ESYETYQIVDPGEIVFRF 295 + K ++ + + + ++ E +K + I G+++F Sbjct: 250 IDCEHKTAPYVDKSEYLVVRTSNVRHGELVLDDMKYTHADGYAEWTNRAIPSLGDVLFTR 309 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL-CKVFYAMGSGLR 354 + L + + I S + + + S C ++ Sbjct: 310 EAPAG-ESCLVPENTKVCMGQRMVLLRPDANVIFSNFFSLFLTSEAASCAIYERSIGTTV 368 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ED+KR+P +VPP+ EQ +I + + + ++ + LK + + + Sbjct: 369 SRINIEDIKRIPCIVPPLSEQQEI----SKAIQSVQNSILNKQEKLQSLKNLKKALMQDL 424 Query: 415 VTGQIDLR 422 +TG++ ++ Sbjct: 425 LTGKVRVK 432 >gi|205356617|ref|ZP_03223379.1| putative Type I RM HdsS [Campylobacter jejuni subsp. jejuni CG8421] gi|205345474|gb|EDZ32115.1| putative Type I RM HdsS [Campylobacter jejuni subsp. jejuni CG8421] Length = 404 Score = 136 bits (343), Expect = 5e-30, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 127/411 (30%), Gaps = 30/411 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+V ++ + G+ GK+++ YI + D + L ++ Sbjct: 10 LPQGWEVKKLEEIANIKGGKRLPKGKNLLDNNTKFAYIRVADFQDNGTINLQNIKFINEN 69 Query: 74 DTS--TVSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLV---LQPKDVLPELLQGW 127 + + G + II +G T+ V ++ + + + Sbjct: 70 TYNVLKNYKIYDDNLYISIAGTIGKSGIIPKELNGAILTENAVKLEYIQNNISNKFMYFF 129 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 LS +I+ + + I +P+PPL EQ I + +ID I Sbjct: 130 TLSNIFKTQIQTSTKIVAQPKLAITRLKQIQIPLPPLKEQERIVGILDESFAKIDESIKI 189 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + L E Q+ + + + +P WE K + ++ Sbjct: 190 LEQDLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQSWEWKSLEEISENISAG 241 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K + I N + I+ P I + + Sbjct: 242 GDKPKNCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPSL----TISARGTIGFVCI 297 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I+ + + + YL + + L K L + Sbjct: 298 RKEPYFPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQI 352 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 353 PLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLNKAFKGE 403 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 221 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 276 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 277 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 335 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I E + + L + ++ +E Sbjct: 336 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEEL 391 Query: 199 KQALVSYIVTKGL 211 KQ+L++ L Sbjct: 392 KQSLLNKAFKGEL 404 >gi|325104608|ref|YP_004274262.1| restriction modification system DNA specificity domain protein [Pedobacter saltans DSM 12145] gi|324973456|gb|ADY52440.1| restriction modification system DNA specificity domain protein [Pedobacter saltans DSM 12145] Length = 424 Score = 136 bits (343), Expect = 6e-30, Method: Composition-based stats. Identities = 71/414 (17%), Positives = 149/414 (35%), Gaps = 22/414 (5%) Query: 20 AIPKHWKVVPIKRFTKLNT--GRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P++W++ +K G + K + ++ D++ G + Sbjct: 14 DLPENWRMQRLKNLCTEKNTYGVNIPNSKYEESGVRFLRTTDIDE-NGNIGEGGIFIAKK 72 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSI 131 + KG +L+ + G R + + FLV + + L + S Sbjct: 73 NVPEGYFLNKGDVLFSRSGTIGRCYFHKNEEEYTYAGFLVKFKPKNIDISKWLYYFSFSK 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIR 190 ++ +T+ + + + + IP + I + + +I+ I ++ + Sbjct: 133 YFKYQLSTEAIESTIFNFNGNKYSVLKVAIPNEIETVRKINNFLDKKCEQINQFIADKKQ 192 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 I LLKE++Q++++ V K + D I WV H F + ++ K Sbjct: 193 LINLLKEQRQSVINSHVNKSEDEDS------INWVTHKLKHISDIKFSNVDKLTHKGEVK 246 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA-- 308 + N + + + I + V G+I+ + + + Sbjct: 247 VKLCNYVDVYKNDYITNNIEFMLATATLEEIEKFKVFKGDIIITKDSESANDIGIPAFVS 306 Query: 309 QVMERGIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 + ++ + + I +L + S ++ F +G R L D+ + Sbjct: 307 ENIDNLVCAYHLAMIRANQEIILDEFLFRKIESKEVNSQFEVNATGVTRVGLSIADISNV 366 Query: 366 PVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P I Q DI I ET ID + K E+ + L+ E + + I+ AVTGQ Sbjct: 367 LISYPKDINIQKDIIAKIKSETKTIDETIFKTEEELRLVAEYKEALISNAVTGQ 420 >gi|124008028|ref|ZP_01692727.1| type I restriction enzyme StySJI specificity protein [Microscilla marina ATCC 23134] gi|123986442|gb|EAY26248.1| type I restriction enzyme StySJI specificity protein [Microscilla marina ATCC 23134] Length = 436 Score = 136 bits (343), Expect = 6e-30, Method: Composition-based stats. Identities = 62/428 (14%), Positives = 140/428 (32%), Gaps = 34/428 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLP-KDGNSRQSD 74 +W+ I+ F ++ G+ +GK+ Y+ + D+ +G+ + + Sbjct: 2 SNWEEKKIQDFAEVKGGKRLPAGKEFSLTPTKHPYLRVTDMVNGSIDTSNLQYVDEEIEK 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVLPELLQGWLLS 130 + + G I + + + K ++ + + LS Sbjct: 62 VIRNYRISADDLYITIAGTIGSVGNIPELLHNALLTENAAKITNIDKSIIDKNYLQYYLS 121 Query: 131 IDVTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + T+ I G + I N+ + PPL Q I + + ID Sbjct: 122 SEETKSQINKEIGIGGGVPKLALYRILNLVVQYPPLTYQRKIAQILSTVDRVIDGTQRAI 181 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-----WVGLVPDHWEVKPFFA---L 240 ++ L + Q L S + + E +G +P + Sbjct: 182 EKYQTLKEGLMQDLFSRGIDVSTGKLRPPRQVAPELYQKTELGWIPKDYSFVRLEDLTLK 241 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDPGEIVFRF 295 + + K ES I L ++ K + E + G+++ Sbjct: 242 IIDGTHHTPKYTESGIPFLRVTDVQTKDINFDKLKFVSLEEHQILTKRCNPEKGDLLLSK 301 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLR 354 + + ++ A + I+ YL + ++S + G Sbjct: 302 NGTIGIPKVVDWDWEFS-IFVSLALIKPNHRLINVEYLLYFLKSELIKNQIIRQAKQGTV 360 Query: 355 QSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +L E+++ + PP I+EQ +I +N ++ +E ++S LK + + + Sbjct: 361 TNLHLEEIREFKIAQPPSIQEQNNIVEKLN----NLEKQIESEQKSFQKLKTLKQALMQD 416 Query: 414 AVTGQIDL 421 +TG++ + Sbjct: 417 LLTGKVSV 424 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 74/200 (37%), Gaps = 10/200 (5%) Query: 18 IGAIPKHWKVVPIKRFT-KLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +G IPK + V ++ T K+ G I ++ + DV++ + S + Sbjct: 223 LGWIPKDYSFVRLEDLTLKIIDGTHHTPKYTESGIPFLRVTDVQTKDINFDKLKFVSLEE 282 Query: 74 DTSTVSIF--AKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWL 128 KG +L K G ++ +F S + + + E L +L Sbjct: 283 HQILTKRCNPEKGDLLLSKNGTIGIPKVVDWDWEFSIFVSLALIKPNHRLINVEYLLYFL 342 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPM-PIPPLAEQVLIREKIIAETVRIDTLITE 187 S + +I + T+++ + I + P + EQ I EK+ +I++ Sbjct: 343 KSELIKNQIIRQAKQGTVTNLHLEEIREFKIAQPPSIQEQNNIVEKLNNLEKQIESEQKS 402 Query: 188 RIRFIELLKEKKQALVSYIV 207 + L + Q L++ V Sbjct: 403 FQKLKTLKQALMQDLLTGKV 422 >gi|256810222|ref|YP_003127591.1| restriction modification system DNA specificity domain protein [Methanocaldococcus fervens AG86] gi|256793422|gb|ACV24091.1| restriction modification system DNA specificity domain protein [Methanocaldococcus fervens AG86] Length = 402 Score = 136 bits (342), Expect = 7e-30, Method: Composition-based stats. Identities = 82/421 (19%), Positives = 144/421 (34%), Gaps = 45/421 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ WK V +K K G+ ++ + Y+ + +G K Sbjct: 3 ELPEGWKWVKLKEIIKTEKGKKPKNLIKEKNNNALPYLTADYFRTGILK-------QYSE 55 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + I G ++ G I+D +GI ++ + L K+ + + Sbjct: 56 ENEKLRIVKPGDLVLIWDGSKAGDIFISDIEGILASTMVKLIIKNKEVHPKFIYFVIKHY 115 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP------LAEQVLIREKIIAETVRIDTLITE 187 + GA + H + N+ +PIP L +Q I EKI ID I Sbjct: 116 FPILNKNTTGAGIPHVSKEVFNNLLIPIPFKDGKPDLEKQKQIVEKIEKIFNEIDKAIKL 175 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 R + I KE A+++ I K++ + F T + Sbjct: 176 REKAINETKELFNAVLNKIF----------KEAEEGERWKWVKFENIVDFKMGKTPKRSE 225 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPES----YETYQIVDPGEIVFRFIDLQNDKR 303 +S G++ K E +IV G ++ F Sbjct: 226 KRYWENGVYHWVSIGDMQDKYINTTKEKISEEAFREVFKGKIVPKGTLLMSFKLTIGRTA 285 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 L V II+ I YL W+++S D K G +L E +K Sbjct: 286 ILNIDAVHNEAIIS----IYPKEEILRDYLYWVLQSIDYKKYINPAIKG--HTLNKEILK 339 Query: 364 RLPVLVP------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 L + +P I++Q I N ++ + +I L + E+ + L KE + S + A G Sbjct: 340 NLLIPIPYKDNKPDIEKQKQIANYLDNLSEKIKQLEQLQEKQLNLFKELKESILNKAFEG 399 Query: 418 Q 418 + Sbjct: 400 E 400 >gi|302871463|ref|YP_003840099.1| restriction modification system DNA specificity domain [Caldicellulosiruptor obsidiansis OB47] gi|302574322|gb|ADL42113.1| restriction modification system DNA specificity domain [Caldicellulosiruptor obsidiansis OB47] Length = 457 Score = 136 bits (342), Expect = 8e-30, Method: Composition-based stats. Identities = 78/431 (18%), Positives = 157/431 (36%), Gaps = 37/431 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 PK W +V ++R L +G + S + I +G E + G + + Sbjct: 22 EFPKEWTIVSLERDCVLISGLRPKGGASDEGIPSLGGEHITLDGRINFSDVNAKYIPEKF 81 Query: 76 ST---VSIFAKGQILYGKLGPYLRKAIIADFDGI----CSTQFLVLQPKDVLPELLQGWL 128 + IL K G K I + +L+ K + + + Sbjct: 82 FKIMTKGKAEENDILINKDGANTGKVAILKKKFYKDIAINEHLFILRSKKLFVQQYLFYW 141 Query: 129 LSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L ++I G+ I N +P PPL+EQ I E + ID I + Sbjct: 142 LFSRFGQKQITDRITGSAQPGLSSTFIKNFLVPRPPLSEQRKIAEIL----ETIDNAIEK 197 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALVT 242 IE K KQ L+ ++TKG++ + ++++ +G +P+ W+V + Sbjct: 198 TDAIIEKYKRIKQGLMQDLLTKGIDENWQIRNEKTHKFKDSLLGRIPEEWKVVKLKDVAD 257 Query: 243 ELNRKNTKLIE--------SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 K + N L + + I K + + G+++ Sbjct: 258 IRLSNVDKKTDLKGKIIQLCNYLEVYQNDYIIKGMNFMHASATNNEIKKFKISKGDVIIT 317 Query: 295 FIDLQNDKRSLRSA--QVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + + + +E I +KP + I+ +L+ ++ ++ F + Sbjct: 318 KDSEEYNDIAKPAYVRDEIENLICGYHLALIKPLNNINGLFLSKVLSFRNVNIYFQQRAN 377 Query: 352 G-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G R L E + + +P I EQ I ++ ++ID ++EK + L+ + Sbjct: 378 GITRFGLTKETITGAIIPLPLIPEQERIATIL----SQIDEVIEKEQAYKEKLERIKKGL 433 Query: 411 IAAAVTGQIDL 421 + +TG++ + Sbjct: 434 MEDLLTGKVRV 444 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 68/213 (31%), Gaps = 16/213 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKL----NTGRTSESGKDIIYIGLEDVESGTGKYL 64 ++KDS +G IP+ WKVV +K + +T GK I +V Sbjct: 234 KFKDS---LLGRIPEEWKVVKLKDVADIRLSNVDKKTDLKGKIIQLCNYLEVYQNDYIIK 290 Query: 65 PKDGNSRQSDTSTVSIFA--KGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKD 118 + + + + F KG ++ K + A + D + + K Sbjct: 291 GMNFMHASATNNEIKKFKISKGDVIITKDSEEYNDIAKPAYVRDEIENLICGYHLALIKP 350 Query: 119 VLPELLQGWLLSIDVTQ---RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + + G T + I +P+P + EQ I + Sbjct: 351 LNNINGLFLSKVLSFRNVNIYFQQRANGITRFGLTKETITGAIIPLPLIPEQERIATILS 410 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I+ + + + K + L++ V Sbjct: 411 QIDEVIEKEQAYKEKLERIKKGLMEDLLTGKVR 443 >gi|193212616|ref|YP_001998569.1| restriction modification system DNA specificity domain [Chlorobaculum parvum NCIB 8327] gi|193086093|gb|ACF11369.1| restriction modification system DNA specificity domain [Chlorobaculum parvum NCIB 8327] Length = 578 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 95/489 (19%), Positives = 163/489 (33%), Gaps = 97/489 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W+ V + T N + + D + LED+E T + L + S + S Sbjct: 93 ELPDGWEWVRLGEITAYNGRKNISGDQIDPDTWVLDLEDIEKDTSRILYRAKFSERQSKS 152 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQ 135 T S F KG +LYGKL PYL K ++AD DG+C+T+ + + L + L Sbjct: 153 TKSTFLKGDVLYGKLRPYLDKIVVADRDGVCTTEIVPIVSFVGLHSDFLKWLLKRPAFLS 212 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR--------------- 180 + ++ G M P+PPL EQ I +I Sbjct: 213 YVNSLMYGVKMPRLGTDNAVASIHPLPPLPEQHRIVARIDELMAHCDELEKLRAEREQKR 272 Query: 181 --------------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 I E E + E ++A++ V L P Sbjct: 273 VKVHAAAVRQLLDTTEPESSANAWQFISRNFRELYSDKENVAELRKAILQLAVMGKLVPQ 332 Query: 215 VKMKDSGIEWVGLV-------------------------------PDHWEVKPFFALVTE 243 E + + PD WE +++ Sbjct: 333 DPNDPPACELLKEIEAEKQRLVKEGKIKKPKAVSPIKPDEVPYPLPDSWEWVRLGDVISY 392 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---------VDPGEIVFR 294 ++ + E+ S S +++ + + P +T I V+ +I+ Sbjct: 393 MDAGWSPKCETGPASDSEWGVLKTTAVQKLEFLPHENKTLPIKLTPRPEYQVEEKDILIT 452 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGS 351 +N A + ++ S + D Y A + + + S Sbjct: 453 RAGPKNRVGICCVATSIRPKLMLSDKIIRFKIYGDLISPDYCALSLNTGYCSEQIEMFKS 512 Query: 352 GL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G+ + ++ + VKRL +L+PP+ EQ I I+ A D L EQ I R+ Sbjct: 513 GMAESQMNISQDKVKRLLMLIPPLPEQHRIVARIDQLMALCDTL----EQQIDDAT-RKQ 567 Query: 409 S-FIAAAVT 416 + + A +T Sbjct: 568 TELLNAVMT 576 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 53/201 (26%), Gaps = 16/201 (7%) Query: 21 IPKHWKVVPIKRFTKLNT-GRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ V + G + + S + + V+ + Sbjct: 377 LPDSWEWVRLGDVISYMDAGWSPKCETGPASDSEWGVLKTTAVQKLEFLPHENKTLPIKL 436 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI-----CSTQFLVLQPKDVLPELLQG-- 126 + IL + GP R I I S + + + L Sbjct: 437 TPRPEYQVEEKDILITRAGPKNRVGICCVATSIRPKLMLSDKIIRFKIYGDLISPDYCAL 496 Query: 127 --WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + + + + M IPPL EQ I +I DTL Sbjct: 497 SLNTGYCSEQIEMFKSGMAESQMNISQDKVKRLLMLIPPLPEQHRIVARIDQLMALCDTL 556 Query: 185 ITERIRFIELLKEKKQALVSY 205 + E A+++ Sbjct: 557 EQQIDDATRKQTELLNAVMTQ 577 >gi|164687375|ref|ZP_02211403.1| hypothetical protein CLOBAR_01016 [Clostridium bartlettii DSM 16795] gi|164603799|gb|EDQ97264.1| hypothetical protein CLOBAR_01016 [Clostridium bartlettii DSM 16795] Length = 380 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 64/403 (15%), Positives = 130/403 (32%), Gaps = 36/403 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 ++ + K+ +G T K DI +I D++S + S+ Sbjct: 2 ELKKLGDIFKITSGGTPSKKKEEYYLDGDIPWIKTGDLKSKNIYKSSQYITELGVKNSSA 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +F K +L G + I + + P + + ++I Sbjct: 62 KLFPKDTVLIAMYGATIGATSILKIEAATNQACAAFLPTKDV-MPEYLYYFFKYNKEKII 120 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G + + + +P+ L EQ I + + + EL Sbjct: 121 SKGIGGAQPNISATILKDFKIPLLCLDEQEKIVNILNKAQNTTNKRKEQINLLDEL---- 176 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + S + +P +K + + V A V + + Sbjct: 177 ---VKSRFIEMFGDPIRNIKCWQTKRMDEV----------APVINYKGNFKQNEIWLLNL 223 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + K+ N E + D +++ + +K + E G TS Sbjct: 224 DMVESNTGKIIAYNYVTASEVGSSTCTFDTTNVLYSKLRPYLNKVVIPK----EIGYATS 279 Query: 319 AYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQ 375 M ++P D YLA+++R+ SG + D + V +PPI+ Q Sbjct: 280 EMMPLQPVKGILDRYYLAYMLRNKVFVDYISEKVSGAKMPRVTMNDFRDFKVPIPPIELQ 339 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N + +D L ++E+S+ L++ +S + A G+ Sbjct: 340 NQFANFV----IEVDKLKFEMEKSLKELEDNFNSLMQRAFKGE 378 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 68/190 (35%), Gaps = 6/190 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + + + +I + L+ VES TGK + + + S+ F Sbjct: 195 WQTKRMDEVAPVINYKGNFKQNEIWLLNLDMVESNTGKIIAYNYVTASEVGSSTCTFDTT 254 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSID-VTQRIEAICE 142 +LY KL PYL K +I G +++ + LQP K +L ++L I Sbjct: 255 NVLYSKLRPYLNKVVIPKEIGYATSEMMPLQPVKGILDRYYLAYMLRNKVFVDYISEKVS 314 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M + +PIPP+ Q +I + + +L Sbjct: 315 GAKMPRVTMNDFRDFKVPIPPIELQNQFANFVIEVDKLKFEMEKSLKELEDNFN----SL 370 Query: 203 VSYIVTKGLN 212 + L Sbjct: 371 MQRAFKGELF 380 >gi|332289275|ref|YP_004420127.1| EcoKI restriction-modification system protein HsdS [Gallibacterium anatis UMN179] gi|330432171|gb|AEC17230.1| EcoKI restriction-modification system protein HsdS [Gallibacterium anatis UMN179] Length = 414 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 62/415 (14%), Positives = 139/415 (33%), Gaps = 25/415 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ + + L G + S + I + D+ T + + Sbjct: 5 KLPVGWEEKKLGEYLYLKNGYAFKRSAYIEKSNNSVPIIRISDINGNTASDELAIHTTEK 64 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSI 131 + KG +L G K I + V K L Sbjct: 65 VEG---FELQKGDLLIAMSGATTGKLGIYIGNTPAYQNQRVGNLKLKNEGCEEFRNHLMF 121 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + G + K + ++ + +PPL EQ + +K +D + + R Sbjct: 122 YLQDEVRKLGYGNAQPNISGKMLEDLDIVLPPLPEQQKLAQKFTELLSMVDHMKQKLERI 181 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 LLK +QA++ V+ L+ + E + W+ + T +K+ Sbjct: 182 PLLLKTYRQAVLVKAVSGELSSKWR------EENKISRTSWKNTKVEDISTVTPKKDKIS 235 Query: 252 IESNILSLSYGNIIQKLE---TRNMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKRSLR 306 + + S + + + L E + + G+++ I N K ++ Sbjct: 236 DDLTVSFSSMHLMSENINQHLNFEKKLWNEVKKGFSFFKNGDVLLAKITPCFENGKSAVA 295 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWL-MRSYDLCKV--FYAMGSGLRQSLKFEDVK 363 + G ++ +M +P+ + +L + + GS + + E V Sbjct: 296 RNLINGIGTGSTEFMVFRPNSELLSDFLYLHFNTDKFRQEGSMNMTGSVGHRRVPKEFVL 355 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PP +EQ I + + L ++ +Q++ + + + +A G+ Sbjct: 356 NWEIELPPREEQKFIVQQVEELLNFAEKLEQQAQQALAKVNLLKQAILAKGFRGE 410 >gi|50914976|ref|YP_060948.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10394] gi|50904050|gb|AAT87765.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10394] Length = 402 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 58/401 (14%), Positives = 126/401 (31%), Gaps = 25/401 (6%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ + + + G + +S DI ++ + DV G+ + Sbjct: 17 EWEEKKLGEISNIVRGASPRPIQDPKWFDSKSDIGWLRISDVTEQEGRITYLQQRISELG 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +L + I G+ + L PK + Sbjct: 77 QEKTRVLKDPHLLLSIAATVGKPVINYVKTGVHDGFLVFLDPKF---NREFMFQWLDMFR 133 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + + N + +P L EQ I E +D L+ + + + Sbjct: 134 PYWNKYGQPGSQVNLNSEIVRNQVINLPSLPEQEAIGE----LFQTVDQLLQLQDQKLAT 189 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE+KQ + + +++ G + E+ F+ T + Sbjct: 190 LKEQKQTFLRKMFPAQGQKVPEIRLQGFDGEWEEKKLGEISRMFSGGTPNVGIPEYYNGN 249 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I + I ++ K S + ++V+ +++ + + L G Sbjct: 250 -IPFIRSAEINSDQTELSITDKGLSNSSAKLVEKNTLLYALYGATSGEVGLSRIS----G 304 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 I A +A+ P S+ + G + +L VK L + P + E Sbjct: 305 AINQAILAIIPEKKYSSLFIKNWLYKQKSSIIEKYLQGGQGNLSGSIVKELTIHFPSLSE 364 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q I N +D + + E+ + LK + + + Sbjct: 365 QEAIGNF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 401 >gi|23452707|gb|AAN33128.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452715|gb|AAN33131.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452757|gb|AAN33147.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 398 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 60/411 (14%), Positives = 127/411 (30%), Gaps = 30/411 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+V ++ + G+ G++++ YI + D + L ++ Sbjct: 4 LPQGWEVKKLEEIANIKGGKRLPKGENLLDNNTKFAYIRVADFQDNGTINLQNIKFINEN 63 Query: 74 DTS--TVSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLV---LQPKDVLPELLQGW 127 + + G + II +G T+ V ++ + + + Sbjct: 64 TYNVLKNYKIYDDNLYISIAGTIGKSGIIPKELNGAILTENAVKLEYIQNNISNKFMYFF 123 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 LS +I+ + + I +P+PPL EQ I + +ID I Sbjct: 124 TLSNIFKTQIQTSTKIVAQPKLAITRLKQIQIPLPPLKEQERIVGILDESFAKIDESIKI 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + L E Q+ + + + +P WE K + ++ Sbjct: 184 LEQNLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQSWEWKSLEEISENISAG 235 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K + I N + I+ P I + + Sbjct: 236 GDKPKNCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPS----LTISARGTIGFVCI 291 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I+ + + + YL + + L K L + Sbjct: 292 RKEPYFPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQI 346 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 347 PLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 397 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 65/193 (33%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 215 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 270 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 271 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 329 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I E + + L + ++ +E Sbjct: 330 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEEL 385 Query: 199 KQALVSYIVTKGL 211 KQ+L+ L Sbjct: 386 KQSLLDKAFKGEL 398 >gi|86152966|ref|ZP_01071171.1| HsdS [Campylobacter jejuni subsp. jejuni HB93-13] gi|23452710|gb|AAN33129.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452723|gb|AAN33134.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452725|gb|AAN33135.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452728|gb|AAN33136.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452741|gb|AAN33141.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452803|gb|AAN33175.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|23452806|gb|AAN33177.1| putative type I specificity subunit HsdS [Campylobacter jejuni] gi|85843851|gb|EAQ61061.1| HsdS [Campylobacter jejuni subsp. jejuni HB93-13] Length = 398 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 60/411 (14%), Positives = 127/411 (30%), Gaps = 30/411 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+V ++ + G+ G++++ YI + D + L ++ Sbjct: 4 LPQGWEVKKLEEIANIKGGKRLPKGENLLDNNTKFAYIRVADFQDNGTINLQNIKFINEN 63 Query: 74 DTS--TVSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLV---LQPKDVLPELLQGW 127 + + G + II +G T+ V ++ + + + Sbjct: 64 TYNVLKNYKIYDDNLYISIAGTIGKSGIIPKELNGAILTENAVKLEYIQNNISNKFMYFF 123 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 LS +I+ + + I +P+PPL EQ I + +ID I Sbjct: 124 TLSNIFKTQIQTSTKIVAQPKLAITRLKQIQIPLPPLKEQERIVGILDESFAKIDESIKI 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + L E Q+ + + + +P WE K + ++ Sbjct: 184 LEQDLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQSWEWKSLEEISENISAG 235 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K + I N + I+ P I + + Sbjct: 236 GDKPKNCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPS----LTISARGTIGFVCI 291 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I+ + + + YL + + L K L + Sbjct: 292 RKEPYFPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFKSLQI 346 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 347 PLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLDKAFKGE 397 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 65/193 (33%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 215 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 270 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 271 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 329 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I E + + L + ++ +E Sbjct: 330 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEEL 385 Query: 199 KQALVSYIVTKGL 211 KQ+L+ L Sbjct: 386 KQSLLDKAFKGEL 398 >gi|332535595|ref|ZP_08411363.1| type I restriction-modification system, specificity subunit S [Pseudoalteromonas haloplanktis ANT/505] gi|332034979|gb|EGI71500.1| type I restriction-modification system, specificity subunit S [Pseudoalteromonas haloplanktis ANT/505] Length = 877 Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats. Identities = 57/418 (13%), Positives = 133/418 (31%), Gaps = 32/418 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLED---VESGTGKYLPKDGNS 70 IP W + R T+ +G T I +I L D +++G K+ + Sbjct: 9 QIPDSWTYDLLDRLTERVSGHTPSKSYPEYWNGGIKWISLADTFRLDNGYVYETDKEISQ 68 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + S+ + ++ + + ++A+ + + + Sbjct: 69 EGLNNSSAQLHPAETVVLSRDAGIGKSGVMAEPMAVSQHFIAWKCDNEKKMNSWFLYNWL 128 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 E G+T+ + + PP EQ I + + D I+ R Sbjct: 129 QFHKSEFERQAVGSTIKTIGLPFFKKLKIAAPPYKEQRKIAQIL----STWDKAISTTER 184 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL-VPDHWEVKPFFALVTELNRKNT 249 I+ K +K+AL+ ++T + DSG + G + + Sbjct: 185 LIDNSKYQKKALMQQLLT---GKKRLLDDSGKRFDGEWDEKRISELGEISSGGTPSTSKP 241 Query: 250 KLIESNILSLSYGNIIQKLETRNMG------LKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + + NI ++ +I ++ L + +++ G ++ + Sbjct: 242 EYWDGNITWVTPTDITKQDNIYIESSVRQVSLDGVKNSSAKLLPKGTLLVCTRATIGEMA 301 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + P+ + + + ++ K+ L + Sbjct: 302 V-----SSHEMSTNQGFKNIVPNENTNIEFVYYLLNFYKHKLISKASGSTFLELSKSAFE 356 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ +P +EQ I V+ ID+L Q + LK + + + +TG+ + Sbjct: 357 QMEFHIPEYQEQHKIATVLLKADHEIDIL----RQQLADLKHEKKALMQQLLTGKRRV 410 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 74/209 (35%), Gaps = 11/209 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESY 280 E + + + V+ + I +S + + E Sbjct: 8 EQIPDSWTYDLLDRLTERVSGHTPSKSYPEYWNGGIKWISLADTFRLDNGYVYETDKEIS 67 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM-ERGIITSAYMAVKP---HGIDSTYLAWL 336 + ++ + + + + VM E ++ ++A K ++S +L Sbjct: 68 QEGLNNSSAQLHPAETVVLSRDAGIGKSGVMAEPMAVSQHFIAWKCDNEKKMNSWFLYNW 127 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 ++ + A+GS +++ K+L + PP KEQ I +++ D + Sbjct: 128 LQFHKSEFERQAVGS-TIKTIGLPFFKKLKIAAPPYKEQRKIAQILSTW----DKAISTT 182 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRGES 425 E+ I K ++ + + +TG+ L +S Sbjct: 183 ERLIDNSKYQKKALMQQLLTGKKRLLDDS 211 >gi|254225986|ref|ZP_04919587.1| type I restriction modification DNA specificity domain protein [Vibrio cholerae V51] gi|125621520|gb|EAZ49853.1| type I restriction modification DNA specificity domain protein [Vibrio cholerae V51] Length = 466 Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats. Identities = 75/439 (17%), Positives = 156/439 (35%), Gaps = 42/439 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT 75 +PK W I + +L G +S I + +V+ G + + Sbjct: 3 QLPKGWVCTSISQCFELKNGYAFKSSDYTEDGDFVIRIGNVQDGHIILSNPAYVAAEKLG 62 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFL-VLQPKDVLPELLQGWLLSID 132 + +G IL G R +++ + + + + V L L + Sbjct: 63 ADSFKLNEGDILISLTGNVGRIGMVSKEHLPAVLNQRVAKICVVNSVEIRWLFYLLRTRL 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 Q + ++ +GA + K I + +PPLAEQ I EK+ ++DT+ Sbjct: 123 FQQHVLSLAKGAAQLNISTKDIQSFDFALPPLAEQTRIVEKLDEVLAQVDTIKARLDGIP 182 Query: 193 ELLKEKKQALVSYIVTKGLNPDVK--------------------MKDSGIEWVGLVPDHW 232 +LK +Q++++ V+ L + + + DS + + +P W Sbjct: 183 AILKRFRQSVLAAAVSGKLTEEWRQLNPNQPSHPKVGKVKYKTDLFDSASKSLPELPPEW 242 Query: 233 EVKP----FFALVTELNRKNTKLIESNILSLSYGNIIQK------LETRNMGLKPESYET 282 V P + + S L L N+ + + + L Sbjct: 243 LVIPAAHLLEYVTSGSRGWANYYASSGALFLRMSNVRYDTTKLDLSDLQYVNLPENVEGK 302 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYD 341 +V ++V R +E + +P ID+ +LA + S + Sbjct: 303 RSLVKENDLVISITADVGRVA--RVDSEIEEAYVNQHLALARPASHIDAEFLAKCIASVN 360 Query: 342 L-CKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + K A+ G + L +D++ + + P + EQ +I +++ A D + ++++ Sbjct: 361 IGIKQVQALKRGATKAGLGLDDIRSMAIPFPHLAEQKEIVRLVDQYFAFADTIEALVKKA 420 Query: 400 IVLLKERRSSFIAAAVTGQ 418 + + S +A A G+ Sbjct: 421 QARVDKLTQSILAKAFRGE 439 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 76/224 (33%), Gaps = 16/224 (7%) Query: 9 QYK----DSGVQWIGAIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESG 59 +YK DS + + +P W V+P + T + + +++ + +V Sbjct: 222 KYKTDLFDSASKSLPELPPEWLVIPAAHLLEYVTSGSRGWANYYASSGALFLRMSNVRYD 281 Query: 60 TGKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVL 114 T K D + S+ + ++ R A + + + Sbjct: 282 TTKLDLSDLQYVNLPENVEGKRSLVKENDLVISITADVGRVARVDSEIEEAYVNQHLALA 341 Query: 115 QPKDVLPELL--QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +P + + ++++A+ GAT + I ++ +P P LAEQ I Sbjct: 342 RPASHIDAEFLAKCIASVNIGIKQVQALKRGATKAGLGLDDIRSMAIPFPHLAEQKEIVR 401 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + DT+ + + + Q++++ L P Sbjct: 402 LVDQYFAFADTIEALVKKAQARVDKLTQSILAKAFRGELVPQDP 445 >gi|288457861|ref|YP_003422729.1| restriction modification system DNA specificity domain protein [Zymomonas mobilis subsp. mobilis ZM4] gi|285026836|gb|ADC33926.1| restriction modification system DNA specificity domain protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 424 Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats. Identities = 55/408 (13%), Positives = 127/408 (31%), Gaps = 32/408 (7%) Query: 27 VVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ + G + + +I + D + Sbjct: 18 WMPLGEIASVQRGSSPRPISKFITSDKNGVPWIKIGDTTPKSKYVTKTAEKITPDGAKKS 77 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQRI 137 + +KG + + R I+ I V + K L +L S V Sbjct: 78 RLLSKGDFIISNSMSFGRPYILGIDGAIHDGWASVSEFKSKLNSDFLYHYLSSHSVQNYW 137 Query: 138 EAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIR 190 ++S+ + K I ++ +PIP LA Q I + T L E Sbjct: 138 LTKINSGSVSNLNSKLIQSLLIPIPCPDDPAKSLAIQEEIVRILDTFTELTAELTAELTA 197 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K++ ++T + G+E++ + + F + Sbjct: 198 ELTQRKKQYNHYREQLLTFDED--------GVEYLPMGDERVG--KFIRGGGLQKKDFIS 247 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I R E + ++ PG +V +D A Sbjct: 248 SGVGCIHYGQIYTHYGTHTGRTKSYVSEDFARKARMAKPGNLVIATTSENDDDVCKAVAW 307 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + +R I S+ H ++ ++++ ++ +G + + +++ ++ + Sbjct: 308 LGDRDIAVSSDACFYAHKLNPKFVSYFFQTEQFQVQKRPYITGTKVRRVNADNLAKILIP 367 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +P ++EQ I +++ L E + + I L ++ R ++ Sbjct: 368 IPSLEEQARIAAILDKFDTLTSSLTEGLPREIALREKQYAYYRDQLLS 415 >gi|312115544|ref|YP_004013140.1| restriction modification system DNA specificity domain protein [Rhodomicrobium vannielii ATCC 17100] gi|311220673|gb|ADP72041.1| restriction modification system DNA specificity domain protein [Rhodomicrobium vannielii ATCC 17100] Length = 418 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 77/426 (18%), Positives = 139/426 (32%), Gaps = 25/426 (5%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLN---TGRTSE--SGKDIIYIGLEDVESGTGK 62 P YK + V G IP W+V +L RT + + + +V G Sbjct: 5 PGYKQTEV---GIIPNEWQVTTAANICELVVDCKNRTPPLCNDESFAVVRTPNVRDGQFV 61 Query: 63 YLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDV 119 + + G I + P + D + ++ +P V Sbjct: 62 REDLRYTDLSSFIKWTERATPRTGDIFITREAPLGEVCMAPSDLKVCLGQRMMMYRPDTV 121 Query: 120 L--PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L LLS V + + G+T+ HA I + +P+P + EQ I + Sbjct: 122 NVTSSFLLYALLSEQVRKNLLEKVGGSTVGHAKVDDIRFLTVPLPSMEEQRAIAAALSDA 181 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I R I ++ KQA + ++T I + V Sbjct: 182 D----EWIARLDRLIAKKRDIKQAAMQQLLTGKTRLPGFKGAWTIATLRDVCGFENGDRG 237 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 ++ + N + G I ++ K +S + PG+I+F Sbjct: 238 GNYPSKADFTEGGYAFINAGHVRDGKIDKRSLDFITKEKYDSLGGGKFF-PGDILFCLRG 296 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQS 356 K + I +S + + +L +S K+ G + + Sbjct: 297 SLG-KFGVVDGDSGAGAIASSLIIVRPRANVSPRFLVSYFKSDLCKKMIEKWAGGAAQPN 355 Query: 357 LKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L +D+ R + +PP +EQ I + I+ A ID L E + + + + Sbjct: 356 LGGQDLARFQIYLPPTFEEQDAIGSAISDTDAEIDQL----EAKRDKARSIKQGMMQELL 411 Query: 416 TGQIDL 421 TG++ L Sbjct: 412 TGRVRL 417 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 77/219 (35%), Gaps = 22/219 (10%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 K + + + LV + + L ++ ++ + L+ Sbjct: 7 YKQTEVGIIPNEWQVTTAANICELVVDCKNRTPPLCNDESFAVVRTPNVRDGQFVREDLR 66 Query: 277 PESYETYQI------VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID- 329 ++ G+I + + A + + M +P ++ Sbjct: 67 YTDLSSFIKWTERATPRTGDIFITREAPLGE---VCMAPSDLKVCLGQRMMMYRPDTVNV 123 Query: 330 -STYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI---NV 384 S++L + + S + K + +G K +D++ L V +P ++EQ I + + Sbjct: 124 TSSFLLYALLSEQVRKNLLEKVGGSTVGHAKVDDIRFLTVPLPSMEEQRAIAAALSDADE 183 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 AR+D L+ K ++ + + + +TG+ L G Sbjct: 184 WIARLDRLIAKK-------RDIKQAAMQQLLTGKTRLPG 215 >gi|289664155|ref|ZP_06485736.1| specificity determinant for hsdM and hsdR [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 450 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 81/420 (19%), Positives = 154/420 (36%), Gaps = 30/420 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTV 78 +P W I R E+ + + YI + V+ G + P+ ++ + Sbjct: 3 ELPAGWVSASIGEICSQGEQRIPEADEQLTYIDIASVDRGRKTVMGPQLLRGYEAPSRAR 62 Query: 79 SIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + A G ++ P L + I ST F VL+P +V P + + S + Sbjct: 63 KVVATGDVIVSMTRPNLNAVALIGQRHDSAIASTGFDVLRPIEVDPRWIFAAVKSAHFVK 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + A +GA I +P+PPLAEQ I +K+ A ++DTL LL Sbjct: 123 AMSAKVQGALYPAIKADDIRKHEIPLPPLAEQKRIAQKLDALLAQVDTLKARIDAIPALL 182 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK-- 250 K ++++V V L+ D K E +G + + W +L K+ Sbjct: 183 KRFRKSVVHSAVIGRLSADLRVPIEKPEEQEQLGPL-ELWREVALASLGELSRGKSKHRP 241 Query: 251 -----LIESNILSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 L S + G++ L + + + ++ G + D Sbjct: 242 RNDSRLYGSAYPFIQTGDVANSRGTLTSSKVFYSEFGLKQSRLFPSGTLCITIAANIADT 301 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFED 361 L ++ + ++ +++ D + A+ + ++++ + Sbjct: 302 AMLAIDACFPDSVVG---FIPNKDDCVAQFIKYVI--DDNKESLEALAPATAQKNINLKV 356 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEK---IEQSIVLLKERRSSFIAAAVTGQ 418 + ++ + +PPIKEQ +I + A D L K +Q I L S +A A G+ Sbjct: 357 LSQVKLRIPPIKEQTEIVRRVEQLFAYADQLEAKVAAAQQRIDALT---QSLLAKAFRGE 413 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 71/152 (46%), Gaps = 9/152 (5%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 ++V G+++ + +L Q + I ++ + ++P +D ++ ++S Sbjct: 62 RKVVATGDVIVSMTRPNLNAVAL-IGQRHDSAIASTGFDVLRPIEVDPRWIFAAVKSAHF 120 Query: 343 CKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 K A G ++K +D+++ + +PP+ EQ I ++ A++D L +I+ Sbjct: 121 VKAMSAKVQGALYPAIKADDIRKHEIPLPPLAEQKRIAQKLDALLAQVDTLKARIDAIPA 180 Query: 402 LLKERRSSFIAAAVTGQ----IDL---RGESQ 426 LLK R S + +AV G+ + + + E Q Sbjct: 181 LLKRFRKSVVHSAVIGRLSADLRVPIEKPEEQ 212 >gi|254470121|ref|ZP_05083525.1| restriction modification system DNA specificity domain protein [Pseudovibrio sp. JE062] gi|211960432|gb|EEA95628.1| restriction modification system DNA specificity domain protein [Pseudovibrio sp. JE062] Length = 492 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 68/451 (15%), Positives = 146/451 (32%), Gaps = 54/451 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W I+ ++ G + + +I + D SG + + Sbjct: 3 ELPEGWVETEIENIYEVARGGSPRPIKSYLTADDDGLNWIKISDATSGGYRIESTEQKIT 62 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWL 128 + G +L + + I+ +G +LV K V + L Sbjct: 63 SEGLHKTRLIYPGDLLLSNSMSFGKP-YISAIEGCIHDGWLVLGGFGKKCVDTRYMHLAL 121 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S V ++ + G+T+ + + + ++ +P+ PLAEQ I KI + T + Sbjct: 122 SSEGVQKQFDEKASGSTVRNLNTGIVNSVRVPLAPLAEQKRIVAKIESLTAKSRIARENL 181 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDS--------------GIEWVGL------- 227 R L K KQA++ + L D + K S + W Sbjct: 182 ARIDTLTKRYKQAILKKAFSGELTADWREKSSKDCLIDLNDVLKEHEVIWQNNIAKKGKY 241 Query: 228 -VPDHWEVKPFFALV------------TELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 P+ + + + I + G++ + G Sbjct: 242 ARPNVKPADDLRSWHELSLEGLAYVVDPHPSHRTPPKEIGGIPYVGVGDVKLDGKLDFAG 301 Query: 275 LKP------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + G+ + I L AQ + + Sbjct: 302 ARKVSPKVLKDHLKRYSLKRGDFAYGKIGTIGQPFLLPEAQEY-ALSANVILIQPRSKFA 360 Query: 329 DSTYLAWLMRSYDL-CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +L + S + K+ A + + + + ++ + +P + EQ +I I A Sbjct: 361 TAEFLYYFFLSPVVTQKILGASVATSQAAFGIKKMREVLTPLPSLSEQNEIVTRIEKAFA 420 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +ID L E+ ++++ + +A A G+ Sbjct: 421 KIDKLAEEAKRALHSVDRLDEKILAKAFRGE 451 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 74/204 (36%), Gaps = 11/204 (5%) Query: 24 HWKVVPIKRFTKLN----TGRTSESG-KDIIYIGLEDVE-SGTGKYL--PKDGNSRQSDT 75 W + ++ + + RT I Y+G+ DV+ G + K D Sbjct: 254 SWHELSLEGLAYVVDPHPSHRTPPKEIGGIPYVGVGDVKLDGKLDFAGARKVSPKVLKDH 313 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLSID 132 +G YGK+G + ++ + + + + K E L + LS Sbjct: 314 LKRYSLKRGDFAYGKIGTIGQPFLLPEAQEYALSANVILIQPRSKFATAEFLYYFFLSPV 373 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 VTQ+I + + K + + P+P L+EQ I +I +ID L E R + Sbjct: 374 VTQKILGASVATSQAAFGIKKMREVLTPLPSLSEQNEIVTRIEKAFAKIDKLAEEAKRAL 433 Query: 193 ELLKEKKQALVSYIVTKGLNPDVK 216 + + +++ L P Sbjct: 434 HSVDRLDEKILAKAFRGELVPQDP 457 >gi|212639882|ref|YP_002316402.1| Restriction endonuclease S subunit [Anoxybacillus flavithermus WK1] gi|212561362|gb|ACJ34417.1| Restriction endonuclease S subunit [Anoxybacillus flavithermus WK1] Length = 416 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 68/418 (16%), Positives = 134/418 (32%), Gaps = 33/418 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQS-- 73 W + ++ G T K I +I +D+ +Y+ + N+ Sbjct: 2 SEWINCTLGDIAEVIGGGTPSKSKPEYYEGGTIPWITPKDLSGYPYRYIERGENNITELG 61 Query: 74 -DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ + KG +L+ P IA + F + L + Sbjct: 62 LAKSSARMLPKGAVLFSSRAPI-GYVAIAKNPLCTNQGFKSFICDEKKVNNLFLYYFLKS 120 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 IE + G+T IP+ +PPL Q I I + +I+ + Sbjct: 121 NLPMIENMANGSTFKEISGSVAKTIPISLPPLNIQEKIVSIIGSLDDKIELNLKMNETLG 180 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL------NR 246 E+ + + V G D + +S +G++P W+ K L R Sbjct: 181 EMAMTLYK---HWFVDFGPFQDGEFVES---ELGMIPKGWKAKKLGDLYDTSSGGTPSRR 234 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKR 303 K + I L + E + ++ ++ K Sbjct: 235 KTEYYQDGTINWLKTKELNDNFIFETEEKITELGLENSSAKVFPKNTVIIAMYGATVGKL 294 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + S ++ + S LA+L ++ K+ G +Q++ + ++ Sbjct: 295 GILSEPSSTNQAC---CAVIEKNQSFSYVLAYLYLLFNRTKIVGLANGGAQQNINQQIIR 351 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L ++VP N+I + + L+ EQ L R + ++G+ID+ Sbjct: 352 DLLIVVPT----EKALNIIQPKLLVLFELIRTNEQENRYLINLRDYLLPRLLSGEIDV 405 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 66/194 (34%), Gaps = 7/194 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNS 70 +G IPK WK + ++G T I ++ +++ + Sbjct: 207 LGMIPKGWKAKKLGDLYDTSSGGTPSRRKTEYYQDGTINWLKTKELNDNFIFETEEKITE 266 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + S+ +F K ++ G + K I + + K+ + +L Sbjct: 267 LGLENSSAKVFPKNTVIIAMYGATVGKLGILSEPSSTNQACCAVIEKNQSFSYVLAYLYL 326 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +I + G + + + I ++ + +P +I+ K++ I T E Sbjct: 327 LFNRTKIVGLANGGAQQNINQQIIRDLLIVVPTEKALNIIQPKLLVLFELIRTNEQENRY 386 Query: 191 FIELLKEKKQALVS 204 I L L+S Sbjct: 387 LINLRDYLLPRLLS 400 >gi|161503349|ref|YP_001570461.1| hypothetical protein SARI_01422 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160864696|gb|ABX21319.1| hypothetical protein SARI_01422 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 412 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 71/422 (16%), Positives = 159/422 (37%), Gaps = 36/422 (8%) Query: 23 KHWKVVPIKRFTK--LNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 K WK V +K + G + +S +GL + G + + Sbjct: 4 KDWKSVTLKELLDGPIKNGYSPNATDSETGYWVLGLGAL-GDEGINSSEIKPVLPEERVL 62 Query: 78 VSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 +I L + R + CS L+++ + + + ++ + Sbjct: 63 QNILRTDDFLVSRSNTPDKVGRSIRFRNEIENCSYPDLMMRFRIDENKADKAFIEHQLKS 122 Query: 135 QRIEAICE------GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + +TM + + P+ +PP+ EQ I + + D I+ Sbjct: 123 AAVRTYFKNCAAGSSSTMVKINKGILEKTPLVVPPVKEQKKIAQIL----STWDKAISVT 178 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + +++K+AL+ +++ + ++G+ + G WEV L+ E ++N Sbjct: 179 EKLLTNSQQQKKALMQQLLS---GKKRLLDENGVMFSGE----WEVVRLKQLIHEEKKRN 231 Query: 249 TKLIESNILSL-SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +LS+ ++ + E + + E TY+IV + + L S Sbjct: 232 RDNHIQRVLSVTNHSGFVLPEEQFSKRVASEDVSTYKIVKKNQYGYNPSRLN--VGSFAR 289 Query: 308 AQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKR 364 + G+++ Y+ + +S Y M S + + G +R S+ F+ + Sbjct: 290 LDNYDEGVLSPMYVVFSINHERLNSDYFLNWMSSNEAKQRIAGSTQGSVRDSVGFDALCS 349 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +P + EQ I V++ A + +E+ + LKE + + + +TG+ ++ E Sbjct: 350 FSFSLPTLMEQQKIAAVLSAADAE----MSMLEKKLACLKEEKKALMQQLLTGKRRVKVE 405 Query: 425 SQ 426 S+ Sbjct: 406 SE 407 >gi|292491161|ref|YP_003526600.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] gi|291579756|gb|ADE14213.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] Length = 406 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 66/423 (15%), Positives = 130/423 (30%), Gaps = 47/423 (11%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESG 59 P YK + IG IP+ W V + L G+ S G DI +I DV + Sbjct: 21 PGYKRTE---IGVIPEDWAVRYLGDIALLERGKFSARPRNDPKFFGGDIPFIQTGDVTNS 77 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 G + +F + + + + + +A F+ C + + PK Sbjct: 78 NGSIISYSQTLNDEGLRVSKLFPRNTLFFT-IAANIGDVGVASFETACPDSLIAIFPKPN 136 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + ++ E + + + + + + +PPL EQ I + Sbjct: 137 VEKRWL-FNALRSQKKKFEGLATQNAQLNINLEKLNPYLLALPPLPEQRAIAAALSDLDA 195 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 I L + + Q L+ +G + + WEVK Sbjct: 196 LIAALDKLIAKKRAIKTAAMQQLL----------------TGKQRLPGFEGEWEVKRLGD 239 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + K ++ + Q +E N S++ I+ PGE Sbjct: 240 VSVVKTGKKNNEDKAEDGKYPFFVRSQTVERINTY----SFDGEAILVPGE--------- 286 Query: 300 NDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 S+ + Y ++ + ++ + + + SL+ Sbjct: 287 GGIGSIFHYVNGKFDYHQRVYKISNFAADTNGKFIYYCLLQTFNKQAMRNSVKATVDSLR 346 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L P EQ I +++ A I L + + + + +TG+ Sbjct: 347 LPTFIEFEFLAPCFDEQQAIATILSDMDAEITTLEARR----DKTQAIKQGMMQELLTGR 402 Query: 419 IDL 421 I L Sbjct: 403 IRL 405 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 74/212 (34%), Gaps = 19/212 (8%) Query: 224 WVGLVPDHWEVKPFFALVTELNRK-------NTKLIESNILSLSYGNIIQK---LETRNM 273 +G++P+ W V+ + K + K +I + G++ + + + Sbjct: 27 EIGVIPEDWAVRYLGDIALLERGKFSARPRNDPKFFGGDIPFIQTGDVTNSNGSIISYSQ 86 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 L E ++ + F D E S ++ +L Sbjct: 87 TLNDEGLRVSKLFPRNTLFFTIAANIGDVGVAS----FETACPDSLIAIFPKPNVEKRWL 142 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +RS A + + ++ E + + +PP+ EQ I ++ +D L+ Sbjct: 143 FNALRSQKKKFEGLATQN-AQLNINLEKLNPYLLALPPLPEQRAIAAALSD----LDALI 197 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +++ I + +++ + +TG+ L G Sbjct: 198 AALDKLIAKKRAIKTAAMQQLLTGKQRLPGFE 229 >gi|189499314|ref|YP_001958784.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides BS1] gi|189494755|gb|ACE03303.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides BS1] Length = 430 Score = 134 bits (337), Expect = 3e-29, Method: Composition-based stats. Identities = 84/416 (20%), Positives = 160/416 (38%), Gaps = 32/416 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +DS + I ++P WK + G + YIGLE + G ++ + Sbjct: 35 MRDSNLL-IESLPDRWKNHKFGDLCDRVKNSYQPVDGGEKPYIGLEHLAQGFPAFIGRG- 92 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 S+ ++F G IL+GKL PYLRK ADFDGICST LV + K + ++ Sbjct: 93 -KECEVKSSKTVFKSGDILFGKLRPYLRKGAQADFDGICSTDILVFRAKPICESNFLRFV 151 Query: 129 LS-IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + G W + + +PPL EQ I + + I Sbjct: 152 IHSEEFVAHAKTTTSGVRHPRTSWPLLREFYISLPPLPEQKKIAHIL----STVQRAIEA 207 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + R I+ E K+AL+ + T+GL + + + +GLVP+ WEV + + K Sbjct: 208 QDRIIQTTTELKKALMHKLFTEGLRNEPQKEA----EIGLVPESWEVVEIGDVFKFTSGK 263 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-- 305 + S+ + Y +++ ++ + L Sbjct: 264 TKPKDTAPEPSVERTVPVYGGNGV------LGYSAQSLLNEDVLILGRVGEYCGCAHLTK 317 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + V + + Y + ++ +Y +L + MG + + + R+ Sbjct: 318 PVSWVTDNAL----YAKEEKRSVNRSYARTHFAHLNLNQYSNKMG---QPLITQGIINRV 370 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P +EQ ++ N +D +E+I L++ + + +T +I++ Sbjct: 371 KFGLPSREEQDELAN----AFETLDTRIEQINAKKKSLQDLFHTLLHELMTAKINV 422 >gi|253687261|ref|YP_003016451.1| restriction modification system DNA specificity domain protein [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753839|gb|ACT11915.1| restriction modification system DNA specificity domain protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 413 Score = 134 bits (337), Expect = 3e-29, Method: Composition-based stats. Identities = 60/415 (14%), Positives = 123/415 (29%), Gaps = 34/415 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P W + L G L G + + + Sbjct: 18 VPAGWLQCKLGDVLTLQRG-----------FDLPQRLRKEGNIPIISSSGESGWHNNAIV 66 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G I+ G+ G I +T V + K + P L ++D Sbjct: 67 SPPG-IVTGRYGTIGEVFFIDKPFWPLNTTLYVREFKGITPSYAYFLLKTVDFQSHSGKS 125 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + + +PP+ EQ+ I + I L + + Q Sbjct: 126 ----GVPGVNRNDVHQENILLPPIKEQIAITTTLSNIDELISALERLLSKKQAIKTATMQ 181 Query: 201 ALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL------ 251 L++ + L D K +G +P+ W V ++ + T Sbjct: 182 QLLTGKTRLPQFALREDGAAKGYQKSELGEIPEDWTVTLLNDVIDSCSSGATPYRGISEY 241 Query: 252 ---IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I S + + Y +I G + L+ Sbjct: 242 YKGNNRWITSGELNYCVINDTIEKISDSAIKYTNLKIHPAGTFLMAITGLEAAGTRGACG 301 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPV 367 V + + MA+ P+ + + Y+ + + G +Q S ++++P+ Sbjct: 302 IVGKPSATNQSCMAIYPNNKLDSNYLYHWYVYNGDTLAFKYCQGTKQLSYTAGLIRKIPL 361 Query: 368 LVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P KEQ I +++ I L +Q + ++ + + +TG+ L Sbjct: 362 FLPTDKKEQTAIAAILSDMDKDIQTL----QQRLEKTRQLKQGMMQELLTGKTRL 412 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 61/207 (29%), Gaps = 15/207 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSESG------KDIIYIGLEDVESGTGK 62 Y+ S +G IP+ W V + ++G T G + +I ++ Sbjct: 204 YQKSE---LGEIPEDWTVTLLNDVIDSCSSGATPYRGISEYYKGNNRWITSGELNYCVIN 260 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKD 118 + + + + I G L G I + + + P + Sbjct: 261 DTIEKISDSAIKYTNLKIHPAGTFLMAITGLEAAGTRGACGIVGKPSATNQSCMAIYPNN 320 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAE 177 L + C+G I IP+ +P EQ I + Sbjct: 321 KLDSNYLYHWYVYNGDTLAFKYCQGTKQLSYTAGLIRKIPLFLPTDKKEQTAIAAILSDM 380 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS 204 I TL + +L + Q L++ Sbjct: 381 DKDIQTLQQRLEKTRQLKQGMMQELLT 407 >gi|223940843|ref|ZP_03632673.1| restriction modification system DNA specificity domain protein [bacterium Ellin514] gi|223890493|gb|EEF57024.1| restriction modification system DNA specificity domain protein [bacterium Ellin514] Length = 405 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 59/418 (14%), Positives = 136/418 (32%), Gaps = 34/418 (8%) Query: 20 AIPKHWKVVPIKRFTKL-NTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W+V+P + G ++ + D I +G++++ G D S Sbjct: 2 KLPTEWRVLPFGEVVEHSQYGISTPTSPDGTIPILGMKNINDGQVVVGNPDRVSITEAVL 61 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVL----PELLQGWLL 129 G +L+ + + + + +LV + + Sbjct: 62 AKQRLKDGDLLFNRTNSLDLVGKTGLFRESGDFVCASYLVRFRLRRNLVDPRYVCYLFNT 121 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGN-IPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + + ++ + + +P+PP EQV I + + D I Sbjct: 122 SHSQRIMRQLATKAVAQANINPTSLQRKFLLPLPPRQEQVAIADLLEF----WDDDICRT 177 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + E K+ L+ ++T G + K W +V R Sbjct: 178 ESRLGKKLEFKRGLMQQLLT-GQTQFKEFKG----------KPWRKLHLGDIVNFEPRVV 226 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 K + + + + R+ + + + ++ ++V ++ + Sbjct: 227 PKPKGAFLAAGIRSHGKGVFLKRDFEAEDIALDELFVLRADDLVVNITFGWEGAAAIVPS 286 Query: 309 QVMERGIITSA-YMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQS-LKFEDVKR 364 + + KP Y +++ + G R L + R Sbjct: 287 EADGALVSHRFPTFTFKPAVSFPGYFRHVIKQKRFVHAMGLASPGGAGRNRVLSKTEFMR 346 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P+ +P + EQ I V+N D +E +++ + LKE++ + +TG+I ++ Sbjct: 347 IPIDLPSMAEQERIATVLND----CDREIELLQKQLDALKEQKRGLMQKLLTGEIRVK 400 >gi|294502094|ref|YP_003566159.1| Type I restriction-modification system, S subunit [Salinibacter ruber M8] gi|294342078|emb|CBH22743.1| Type I restriction-modification system, S subunit [Salinibacter ruber M8] Length = 494 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 58/416 (13%), Positives = 126/416 (30%), Gaps = 27/416 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +P WK+ + + + G + S G + + Sbjct: 77 LGRVPDDWKIRSLPKVAVIEMGSSPPSATYNEEGEGLPFYQGNADFGHMKPKVSTWCSDP 136 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 V + +L P IAD L+P V L + ++ + Sbjct: 137 VKTADRDDVLISIRAPV-GDLNIADEHCCIGRGLAALRPNGVN--GLYLYYGLAQRSRWL 193 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+T + + +P+PPL EQ I + +D I + IE + Sbjct: 194 ARLASGSTFKSVSSADLEKVDLPVPPLPEQRKIASVL----YAVDQAIQKTEAIIEQAQR 249 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + L+ + G+ + K + L + + Sbjct: 250 VSRGLIEKLTMWGIG-HSEFKKIDVTPKFLDVEVPRKWEKVSYAEVTENITYGFTNPMPE 308 Query: 258 SLSYGNIIQKLETRNMGLKPESYET------------YQIVDPGEIVFRFIDLQNDKRSL 305 S I + R + + + G ++ + Sbjct: 309 SDYGRWRITAKDIREGKIHYDEAGKTTEEAYRERLTGKSRPEVGNVLVTKDGTLGRVGVV 368 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 + + S + K I S YLA ++S + K+ + +K ++ Sbjct: 369 DRQGICINQSVAS--IRPKKEKITSEYLALTIKSPLVKKLIKSHNPQTTIGHIKISELAE 426 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +PP++EQ +I +++ +I K ++ L+ + + +TG++ Sbjct: 427 WEFPLPPVEEQNEIVRIVDSVREKIQNERNKKQR----LQRLKKGLMQDLLTGEVR 478 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 75/213 (35%), Gaps = 20/213 (9%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 + +G VPD W+++ + + N + +M K ++ Sbjct: 73 EVFGLGRVPDDWKIRSLPKVAVIEMGSSPPSATYNEEGEGLPFYQGNADFGHMKPKVSTW 132 Query: 281 ETYQIV--DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + D +++ D E I A++P+G++ YL + + Sbjct: 133 CSDPVKTADRDDVLISIRAPVGDL-----NIADEHCCIGRGLAALRPNGVNGLYLYYGL- 186 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + +S+ D++++ + VPP+ EQ I +V+ D ++K E Sbjct: 187 AQRSRWLARLASGSTFKSVSSADLEKVDLPVPPLPEQRKIASVLYAV----DQAIQKTEA 242 Query: 399 SIVLLKERRSSFIAAA-VTG-------QIDLRG 423 I + I + G +ID+ Sbjct: 243 IIEQAQRVSRGLIEKLTMWGIGHSEFKKIDVTP 275 >gi|166711005|ref|ZP_02242212.1| type I restriction enzyme specificity protein [Xanthomonas oryzae pv. oryzicola BLS256] Length = 451 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 69/419 (16%), Positives = 139/419 (33%), Gaps = 28/419 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W I + + +I ++ + + L + + Sbjct: 4 ELPGGWVETTIGEICAMGPKSAWDDDMEIGFVPMSHAPTNFRGPLNYEARRWHEVKKAYT 63 Query: 80 IFAKGQILYGKLGPYLR------KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI-- 131 F +++ K+ P A + + G S++F VL+ +D + Sbjct: 64 HFENDDVIFAKVTPCFENGKAALVAGLPNGAGAGSSEFHVLRRRDAGISPSYLLAVIKSA 123 Query: 132 -DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + E + + + N P+ +PP AEQ I +K+ A ++DT Sbjct: 124 QFLREGEENMTGAIGLRRVPRAFVENFPVRLPPEAEQKRIAQKLDALLAQVDTFKARIDA 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 LLK +Q+++++ V+ L D W + + V Sbjct: 184 IPALLKRFRQSVINHGVSGSLALDQHASFDTTTW-----RNMRAEDVCTKVQSGGTPKEG 238 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQ------IVDPGEIVFRFIDLQNDKRS 304 I L NI+ + + + + +Q I PG+++ + K + Sbjct: 239 FTTEGIPFLKVYNIVDGIIEFEYRPQYIAADIHQGSCRKSITIPGDVLMNIVGPPLGKIA 298 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDV 362 + V E I + + I S ++ ++ + GS + ++ Sbjct: 299 VVPQGVDEWNINQAITLFRPSESISSAWIHLVLLEGTNIRRVSQETKGSAGQVNISLSQC 358 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEK---IEQSIVLLKERRSSFIAAAVTGQ 418 + VPP + Q +I + A D L K +Q I L S +A A G+ Sbjct: 359 RDFVFPVPPTQIQDEIVRRVEQLFAYADQLEAKVAAAQQRIDALT---QSLLAKAFRGE 414 >gi|312963117|ref|ZP_07777602.1| restriction modification system DNA specificity domain [Pseudomonas fluorescens WH6] gi|311282628|gb|EFQ61224.1| restriction modification system DNA specificity domain [Pseudomonas fluorescens WH6] Length = 406 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 67/420 (15%), Positives = 146/420 (34%), Gaps = 41/420 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNSRQSDT 75 P W+V+ + K +G T + +I +D++ + Sbjct: 2 PDGWRVLELGELAKFTSGGTPSKSNESYWGGNHPWISGKDLKQHY--LSTSIDSLTDEGF 59 Query: 76 STVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSI 131 S+ + G L G L A + L P+ + L +L Sbjct: 60 SSANKAPAGSTLVLVRGMTLLKDFPVGFATKPLAFNQDLKALIPEKNVDGLFLSFLLAGN 119 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R G D + + P+ P EQ I + + D IT + Sbjct: 120 KEKIRQLVSTAGHGTGRLDTESLKAFPVLTPKPLEQKKIAKIL----STWDQAITTTEQI 175 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ +++K+AL+ ++ G + W + L + + KNT++ Sbjct: 176 LKSSQQQKKALMQQLL------------IGKRRLSGYQRPWTMFKLEQLFSRVTTKNTEI 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQV 310 + + + +I++ + N + E + Y ++ G+ + +++ Sbjct: 224 NTNVVTISAQHGLIRQEDFFNKTIASEILDNYFLLKKGQFAYNKSYSNGYPMGAIKRLNK 283 Query: 311 MERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDV 362 E+G++T+ Y+ + + + S L + + G R ++K + Sbjct: 284 YEKGVVTTLYICFEASNEAKCNPEFFEHYFESGRLNNGLSKIANEGGRAHGLLNVKPSEF 343 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 L V VP + EQ I V+N I L + + L++++ S + +TG+ ++ Sbjct: 344 FGLTVFVPEVAEQKAIATVLNTADQEIQTL----QIKLSSLRDQKKSLMQQLLTGKRRVK 399 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 62/186 (33%), Gaps = 10/186 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + N N +S ++ Q L T L E + + G + + K Sbjct: 24 KSNESYWGGNHPWISGKDLKQHYLSTSIDSLTDEGFSSANKAPAGSTLVLVRGMTLLKDF 83 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFEDVK 363 + +D +L++L+ + + + + L E +K Sbjct: 84 PVGFATKPLAFNQDLKALIPEKNVDGLFLSFLLAGNKEKIRQLVSTAGHGTGRLDTESLK 143 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL-- 421 PVL P EQ I +++ D + EQ + ++++ + + + G+ L Sbjct: 144 AFPVLTPKPLEQKKIAKILSTW----DQAITTTEQILKSSQQQKKALMQQLLIGKRRLSG 199 Query: 422 --RGES 425 R + Sbjct: 200 YQRPWT 205 >gi|10954528|ref|NP_044167.1| type I restriction enzyme subunit S [Methanocaldococcus jannaschii DSM 2661] gi|12229988|sp|Q60296|T1SH_METJA RecName: Full=Putative type-1 restriction enzyme MjaXP specificity protein; Short=S.MjaXP; AltName: Full=Type I restriction enzyme MjaXP specificity protein; Short=S protein gi|1522674|gb|AAC37110.1| hypothetical protein MJ_ECL41 [Methanocaldococcus jannaschii DSM 2661] Length = 432 Score = 134 bits (336), Expect = 4e-29, Method: Composition-based stats. Identities = 69/439 (15%), Positives = 155/439 (35%), Gaps = 31/439 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLE 54 M ++ ++K++ IG IPK W V IK ++ G T + G DI +I + Sbjct: 4 MVKFRWETEFKETD---IGKIPKDWDVKKIKDIGEVAGGSTPSTKIKEYWGGDIPWITPK 60 Query: 55 DVESGTGKYL---PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF 111 D+ + Y+ ++ + ++ IF KG IL P IA + F Sbjct: 61 DLANYEYIYISRGERNITEKAVKECSLRIFPKGTILLTSRAPI-GYVAIAKNPLTTNQGF 119 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + PKD + +L I G+T + + +P P EQ I Sbjct: 120 RNIIPKDGVVSEYLYYLFKTKTMSEYLKDISGGSTFPELKGSTLKEVEIPYPSPEEQQKI 179 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + I+ + ++ E + ++ + + + + E +P Sbjct: 180 ATVLSYFDDLIENKKKQNEILEKIALELFK---NWFIDFEPFKNEEFVYND-ELDKEIPK 235 Query: 231 HWEVKPFFALVTELNRKN-----TKLIESNILSLSYGNIIQKLETRNMGLKPE---SYET 282 WEVK ++ + N + I + ++++ + + E Sbjct: 236 GWEVKRLGDILKVESGSNAPQREIYFENAKIPFVRVKHLVKGVCIESSDFINELALKDYK 295 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 ++ + I+F+ + + + + + Y + + + L Sbjct: 296 MKLYNEKSIIFQKSGESLKEARVNIVPFKFTA-VNHLAVIDSSMLNEKHYFIYCLLRFLL 354 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ Y++ LK D++ +++PP I + + + ++ I++ Sbjct: 355 KEIVYSVKGTTLPYLKISDIENKYIIIPP----QPILQKFHSLVQPLFEKIINNQKQIMV 410 Query: 403 LKERRSSFIAAAVTGQIDL 421 LK+ R + + V G++ + Sbjct: 411 LKKIRDALLPKLVFGELRV 429 >gi|260549264|ref|ZP_05823484.1| predicted protein [Acinetobacter sp. RUH2624] gi|260407670|gb|EEX01143.1| predicted protein [Acinetobacter sp. RUH2624] Length = 396 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 59/416 (14%), Positives = 137/416 (32%), Gaps = 40/416 (9%) Query: 18 IGAIPKHWKVVPIKRFT-KLNTGRTSESGK---DIIYIGLEDV-----ESGTGKYLPKDG 68 + +P W + K+ G + + + + +V + +P+D Sbjct: 5 LYKLPDGWDWKTLGDVCFKVTDGSHNPPKEVEVGLPMLSSRNVMDNGLVWDNFRLIPEDA 64 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQG 126 + ++G +L +G R ++ + D VL ++++PE L Sbjct: 65 F---ESEHKRTRVSEGDVLLTIVGTIGRSCVVRNLDRLFTLQRSVAVLSSEELIPEFLSY 121 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + + +G+ K + + PP+ EQ I EK+ A RID I Sbjct: 122 QFRAPFIQEHFISNAKGSAQKGIYLKQLKATYLVCPPIEEQNRIVEKLDALFTRIDIAIE 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 ++L K+ +++ V + G P Sbjct: 182 HLQSKLDLSKQLFDSVLDEFFKLPDCDSVPLTQVVEFIGGSQP----------------- 224 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ + I+ ++ N + +S T + +++ + Sbjct: 225 PKSQFSDVQKEGYVRLIQIRDYKSDNHIVYVDSASTKKFCTKDDVMIGRYGPP-----VF 279 Query: 307 SAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDV 362 G A M P+ YL W ++S + + + + + + Sbjct: 280 QILRGLDGAYNVALMKAVPNEDLLMKDYLFWFLQSPSIQNYVIGISQRAAGQSGVNKKAL 339 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + VP Q DI + + ++ L ++ I L + ++S + +A G+ Sbjct: 340 EKYLIPVPSKAIQNDIVDKVGQLVSKSRHLEAEVTAEIAFLSQLKASILDSAFKGE 395 >gi|158337894|ref|YP_001519070.1| restriction modification system DNA specificity subunit [Acaryochloris marina MBIC11017] gi|158308135|gb|ABW29752.1| restriction modification system DNA specificity domain [Acaryochloris marina MBIC11017] Length = 295 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 68/292 (23%), Positives = 123/292 (42%), Gaps = 15/292 (5%) Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + D+ +P+ IPP+ EQ I E + +TV +D I + R IELL+E+K Sbjct: 8 MGSGLRQNLDYTDFKYLPLTIPPIDEQRRIVEFLDRKTVELDDAIATKQRLIELLQEQKA 67 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK-------LIE 253 L++ VTKGL+P+V M D GI + VP+HW++ L L K + Sbjct: 68 ILINQAVTKGLDPNVPMCDRGIHGLEKVPNHWKLCSVKRLTQILRGKFSHRPRNDARFYG 127 Query: 254 SNILSLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + G+I Q ++ + L Y + G +V + + S+ + Sbjct: 128 GQYPFIQTGDISQAGRRITKYSQTLNARGYAVSKEFPAGTVVMVITGAKTGEVSI----L 183 Query: 311 MERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + P+ + S + M ++ +++L + + L Sbjct: 184 GFNACFPDSAVGFFPNPGEVSADFLYYMFGVLKTRLDEVSIVSTQENLNVDRIGALYTAC 243 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 PP++EQ I + ++ + + E+ I L+E R I+ AVTG+I + Sbjct: 244 PPVEEQNQIVDFLDNRLLGFETAQVRAEEQINKLQEFREILISHAVTGKIKV 295 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 31/75 (41%), Positives = 48/75 (64%) Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + KV+Y MGSGLRQ+L + D K LP+ +PPI EQ I ++ +T +D + ++ I Sbjct: 1 MLKVYYGMGSGLRQNLDYTDFKYLPLTIPPIDEQRRIVEFLDRKTVELDDAIATKQRLIE 60 Query: 402 LLKERRSSFIAAAVT 416 LL+E+++ I AVT Sbjct: 61 LLQEQKAILINQAVT 75 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 76/208 (36%), Gaps = 8/208 (3%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKY 63 D G+ + +P HWK+ +KR T++ G+ S ++ +I D+ + Sbjct: 86 DRGIHGLEKVPNHWKLCSVKRLTQILRGKFSHRPRNDARFYGGQYPFIQTGDISQAGRRI 145 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + F G ++ G + I F+ + P Sbjct: 146 TKYSQTLNARGYAVSKEFPAGTVVMVITGAKTGEVSILGFNACFPDSAVGFFPNPGEVSA 205 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + R++ + +T + + IG + PP+ EQ I + + + +T Sbjct: 206 DFLYYMFGVLKTRLDEVSIVSTQENLNVDRIGALYTACPPVEEQNQIVDFLDNRLLGFET 265 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 I L+E ++ L+S+ VT + Sbjct: 266 AQVRAEEQINKLQEFREILISHAVTGKI 293 >gi|84387346|ref|ZP_00990366.1| type I site-specific deoxyribonuclease [Vibrio splendidus 12B01] gi|84377795|gb|EAP94658.1| type I site-specific deoxyribonuclease [Vibrio splendidus 12B01] Length = 413 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 57/417 (13%), Positives = 135/417 (32%), Gaps = 27/417 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W + ++ + G+ S G +I ++ D+ S + + Sbjct: 2 VPNGWSIKTLESLATVERGKFSARPRNDPKYYGGEIPFVQTGDIASAKTYLSSFNQTLNE 61 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +F + IL + + I F+ C + +QPK + Sbjct: 62 DGLKVSRLFPENSILIT-IAANIGDTAITTFEVACPDSLVGIQPKQDIDCFWLN-SFLET 119 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + + + + + + PP EQ I + + D IT + I Sbjct: 120 CKDELDGKATQNAQKNINLQVLKPLEILTPPYKEQQKIAKIL----STWDKAITTTEKLI 175 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE-LNRKNTKL 251 K++K+AL+ ++T + D+G + G + + + + Sbjct: 176 ATSKQQKKALMQQLLTGK--KRLVNPDTGKTFEGEWEEVKLGDVCSKVTDGAHHSPKSVE 233 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRS 307 +LS+ + E + E YE + +I+ + Sbjct: 234 CGYPMLSVKDMRATKFSENTARHISKEDYEALVKQNCKPELNDILIAKDGSILKYCFVVR 293 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRL 365 ++ + + A + K I ++A + + +D K + Sbjct: 294 EEIEGVILSSIALLRPKLSIISPNFIAQYFSQESVRFFVGKALTSGSGVPRIILKDFKGI 353 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +P + EQ I +V+ +E E + K+ + + + +TG+ ++ Sbjct: 354 HLRIPSLLEQQKIASVLTAADKE----IEVFEAKLAHFKQEKKALMQQLLTGKRRVK 406 >gi|296106107|ref|YP_003617807.1| hypothetical protein lpa_00830 [Legionella pneumophila 2300/99 Alcoy] gi|295648008|gb|ADG23855.1| hypothetical protein lpa_00830 [Legionella pneumophila 2300/99 Alcoy] Length = 448 Score = 133 bits (335), Expect = 5e-29, Method: Composition-based stats. Identities = 77/416 (18%), Positives = 144/416 (34%), Gaps = 16/416 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W + K K D + D DG + Sbjct: 16 EDWHIKRFKYLFKKLN--RPVMDDDGVITAFRDGLVTLRSNRRMDGFTFADKEIGYQGVE 73 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ + + ++D G CS + + + P P+ +L ++ IE+ Sbjct: 74 PNDLVIHAMDSFAGAIGVSDSRGKCSPVYSIAIPINPNAAYPKFWGYYLRNLATAGFIES 133 Query: 140 ICEGATMS--HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + +G WK I N+ + P Q I + + ET RID LI +++ I +LKE Sbjct: 134 LAKGIRERSTDFRWKDISNLLVNFPNYEIQKGIADFLDHETDRIDQLIEKKVGLISVLKE 193 Query: 198 KKQALVSYIVTKGL----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR---KNTK 250 K ALV+ V +G K + + + ++P E K Sbjct: 194 KSTALVTENVLQGHRVYPEKTSAEKYTHPDKFWPDGLNGLLQPLKFFCEETASLSDKTDP 253 Query: 251 LIESNILSLSYGNIIQKLETRNMG-LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +E + + + + L+ K Q++ +++ + + Sbjct: 254 NMEIHYIDIGNVSFADGLKGSAKYLFKDAPSRARQVLRMHDVIISTVRTYLKACAYIDKD 313 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 + T + I YL ++S G+ ++ + +K L + Sbjct: 314 LPNLIASTGFCVLRPNDKIHPKYLYRAIQSDPFISGVVVRSEGVSYPAVNDKMIKALKIP 373 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 VP + Q I++ I E + IE+SI LL +SS I AVTG++D+ Sbjct: 374 VPDLGLQKSISDKIEQEIHSVTQTTRLIEKSIDLLSSFKSSLITEAVTGKLDINSW 429 >gi|227820721|ref|YP_002824691.1| putative restriction endonuclease type I, S subunit [Sinorhizobium fredii NGR234] gi|227339720|gb|ACP23938.1| putative restriction endonuclease type I, S subunit [Sinorhizobium fredii NGR234] Length = 496 Score = 133 bits (334), Expect = 5e-29, Method: Composition-based stats. Identities = 75/457 (16%), Positives = 143/457 (31%), Gaps = 58/457 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W V I+ + TG T + G I +I V Y + Sbjct: 3 ELPRGWCVTTIQEIADVGTGATPKRGTRAFYESGTIPWITSGAVSQRQITYADEFITEAA 62 Query: 73 SDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLL 129 ++ +F G IL G D + ++ P D + Sbjct: 63 IRSTNCKVFPTGTILVAMYGEGKTRGSVARLAIDAATNQALAAIVLPNDDIVSSEFLMNF 122 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++ + G + + + I + P+PPLAEQ I K+ A + + TE Sbjct: 123 LTSQYSQLRGLAAGGVQPNLNLQLIRSTSFPLPPLAEQKRIVAKLDALSAKSARARTELA 182 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDS---------------GIE----------- 223 R L+ KQA++ + L D ++ G+E Sbjct: 183 RIETLVYRYKQAVLGKAFSGELTVDFRLSRRHLQSEAKAGSIHGEEGVERKLKVRGTTDV 242 Query: 224 ----WVGLVPDHWEVKP-----------FFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + +P+ W A K + I + ++ Sbjct: 243 MKGIQLSPLPESWNWVKNHRLAQNRANAICAGPFGTIFKAKDFRDKGIPIIFLRHVAAGE 302 Query: 269 ETRNM-----GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MA 322 + + V GE++ + + A V + M+ Sbjct: 303 YRTHKPGFMDKKVWQELHQPYSVFGGELLVTKLGDPPGVACIFPAGVGTAMVTPDVMKMS 362 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 V + +L + S + + + G R + K PV P ++EQ +I Sbjct: 363 VDENASVPKFLMFYFNSPIAKNIIHQLAFGLTRLRVDLAMFKTFPVPHPSLEEQLEIVRR 422 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I A+ID L + ++++ L+ + + +A A G+ Sbjct: 423 IESAFAKIDRLAAEAKRALDLVGKLDEAILAKAFRGE 459 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 67/205 (32%), Gaps = 11/205 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNT------KLIESNILSLSYGNIIQKLETRNMGLKPES- 279 +P W V + I ++ G + Q+ T E+ Sbjct: 3 ELPRGWCVTTIQEIADVGTGATPKRGTRAFYESGTIPWITSGAVSQRQITYADEFITEAA 62 Query: 280 --YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++ G I+ + S+ + A + + I S+ Sbjct: 63 IRSTNCKVFPTGTILVAMYGEGKTRGSVARLAIDAATNQALAAIVLPNDDIVSSEFLMNF 122 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + ++ G++ +L + ++ +PP+ EQ I ++ +A+ ++ Sbjct: 123 LTSQYSQLRGLAAGGVQPNLNLQLIRSTSFPLPPLAEQKRIVAKLDALSAKSARARTELA 182 Query: 398 QSIVLL-KERRSSFIAAAVTGQIDL 421 + I L + + + A +G++ + Sbjct: 183 R-IETLVYRYKQAVLGKAFSGELTV 206 Score = 45.2 bits (105), Expect = 0.023, Method: Composition-based stats. Identities = 48/214 (22%), Positives = 76/214 (35%), Gaps = 21/214 (9%) Query: 21 IPKHWKVVPIKRFTK-----LNTGRTSE-------SGKDIIYIGLEDV---ESGTGKYLP 65 +P+ W V R + + G K I I L V E T K Sbjct: 251 LPESWNWVKNHRLAQNRANAICAGPFGTIFKAKDFRDKGIPIIFLRHVAAGEYRTHKPGF 310 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVL--QPKDVL 120 D Q S+F G++L KLG A I + + + + + Sbjct: 311 MDKKVWQELHQPYSVF-GGELLVTKLGDPPGVACIFPAGVGTAMVTPDVMKMSVDENASV 369 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P+ L + S I + G T D P+P P L EQ+ I +I + + Sbjct: 370 PKFLMFYFNSPIAKNIIHQLAFGLTRLRVDLAMFKTFPVPHPSLEEQLEIVRRIESAFAK 429 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 ID L E R ++L+ + +A+++ L P Sbjct: 430 IDRLAAEAKRALDLVGKLDEAILAKAFRGELVPQ 463 >gi|116250869|ref|YP_766707.1| type I restriction enzyme specificity subunit [Rhizobium leguminosarum bv. viciae 3841] gi|115255517|emb|CAK06594.1| putative type I restriction enzyme specificity subunit [Rhizobium leguminosarum bv. viciae 3841] Length = 456 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 77/417 (18%), Positives = 153/417 (36%), Gaps = 19/417 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT-ST 77 +PK W ++ + N + + + ++ + V+ TG + K S+ Sbjct: 4 LPKGWVEATLEELCQFNPKHDPDVDQSLGVNFVPMPAVDDETGAIIDKSVVRPLSEIWKG 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVL-PELLQGWLLS 130 + FA +++ K+ P + IA + ST+F VL+ K + P+ L +L Sbjct: 64 YTHFADRDVIFAKITPCMENGKIAVARDLANGMACGSTEFHVLRSKGAVEPDFLWRFLRR 123 Query: 131 IDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + Q E G + + +P+PPL EQ I K+ + TE Sbjct: 124 KNYRQVAEHSMTGAVGQRRVPRQFLETTSLPLPPLNEQKRIVAKLDTLNAKSARARTELA 183 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL--NRK 247 R L+ KQA++S + L D + + + +P V + + K Sbjct: 184 RIEILVSRFKQAVLSKAFSGELTKDWRSGQTTLAPWENLPLSQLVSHGPSNGWSPKADGK 243 Query: 248 NTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + L + + S G + + + + + ++ ++ R L+ ++ Sbjct: 244 VSGLKSLKLSATSSGRLRLDESTIKYLDQTLPEDSKFWLLSDDIVIQRANSLELLGTTVL 303 Query: 307 SAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFED 361 I + V + YLA + S F A +G + Sbjct: 304 FDGPPGEFIFPDLMMRIRVNDKKTNPRYLATYLNSDSARSYFRANATGSAGNMPKINGST 363 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 V+ V PP++EQ +I + I A D L + +++ L+ + + +A A G+ Sbjct: 364 VRETRVPTPPLEEQQEIVHRIESAFAMTDRLAAEAMRALDLVGKLGEAILAKAFRGE 420 >gi|51598166|ref|YP_072357.1| restriction modification enzyme [Yersinia pseudotuberculosis IP 32953] gi|186897390|ref|YP_001874502.1| restriction modification system DNA specificity subunit [Yersinia pseudotuberculosis PB1/+] gi|51591448|emb|CAH23119.1| possible restriction modification enzyme [Yersinia pseudotuberculosis IP 32953] gi|186700416|gb|ACC91045.1| restriction modification system DNA specificity domain protein [Yersinia pseudotuberculosis PB1/+] Length = 409 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 60/427 (14%), Positives = 145/427 (33%), Gaps = 42/427 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ WK+ TG +S +DI + +++E ++ Q Sbjct: 2 VPEGWKLSTFGNHVDCLTGFAFKSKSYSNNPEDIRLLRGDNIEPSRLRWRDAKFWPAQEY 61 Query: 75 TS-TVSIFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLVLQPKDVL-PELLQG 126 KG + ++ D + + ++ + L LL+ Sbjct: 62 EKLEKFQLRKGDFVIAMDRTWVSSGLKVAEVQHTDIPCLLVQRVARIRARSTLEQSLLRQ 121 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + Q ++++ + H I + +PP+ EQ I + D I Sbjct: 122 YFSDNKFEQYVKSVQTATAVPHISPNDIKDFTFLLPPINEQKKIARIL----STWDKAIA 177 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + + +K+AL+ ++ +G + + W L Sbjct: 178 TTEQLLANSQLQKKALMQQLL------------TGKKRFPGFSEEWTEVHLSDLCFINPS 225 Query: 247 KNTKLIESNILSLSYGNIIQ--KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ K + +S + + KL + + + +++ I + Sbjct: 226 RSEKPENGVVSFISMDGVSEDAKLIKTEDRYYSDVSKGFTSFKDDDVLVAKITPCFENGK 285 Query: 305 LRSAQVMERGI---ITSAYMAVKPHGIDSTYLAWL--MRSYDLCKVFYAMGSGLRQSLKF 359 + GI T ++ G+++ Y+ +L M + + GS ++ + Sbjct: 286 GAYVINLTNGIGFGSTEFHVLRAKEGVNAKYIYYLTVMTEFRVRGEMNMQGSAGQKRVTT 345 Query: 360 EDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +K L + VP EQ I V+ V + ++Q + LK+ + + + +TG+ Sbjct: 346 DYLKSLKLTVPISFTEQNKIATVLTVSDQE----IATLKQKLNHLKQEKKALMQQLLTGK 401 Query: 419 IDLRGES 425 ++ ++ Sbjct: 402 RRVKVDA 408 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 49/152 (32%), Gaps = 8/152 (5%) Query: 279 SYETYQIVDPGEIVFRFIDLQ---NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + G+ V K + + ++ ++ + L Sbjct: 62 EKLEKFQLRKGDFVIAMDRTWVSSGLKVAEVQHTDIPCLLVQRVARIRARSTLEQSLLRQ 121 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + ++ + + D+K L+PPI EQ I +++ D + Sbjct: 122 YFSDNKFEQYVKSVQTATAVPHISPNDIKDFTFLLPPINEQKKIARILSTW----DKAIA 177 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 EQ + + ++ + + +TG+ G S+ Sbjct: 178 TTEQLLANSQLQKKALMQQLLTGKKRFPGFSE 209 >gi|290473110|ref|YP_003465971.1| Type I restriction-modification enzyme subunit S [Xenorhabdus bovienii SS-2004] gi|289172404|emb|CBJ79171.1| Type I restriction-modification enzyme subunit S [Xenorhabdus bovienii SS-2004] Length = 452 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 121/435 (27%), Gaps = 30/435 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGK 62 YK + G IP+ W V I + G + D I + +EDV Sbjct: 24 YKQTEA---GVIPEAWVVKSIGELANVIRGASPRPKGDKRYYDGKIPRLMVEDVTRDGKF 80 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 P + ++ +G + G +I+A I + + + Sbjct: 81 VTPIVDSLTEAGAKLSRPCLRGTLTLVCSGNVGIPSILAIDACIHDGFLALTKVSKNISI 140 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRI 181 S + + G ++ +G+ + +P E Q I + I Sbjct: 141 DYLYHFFSTQREKFNNSATHGGVFTNLTTEGVREFLVALPFCYEEQTTIANILSDVDGLI 200 Query: 182 DTLITERIRFIELLKEKKQALVS------YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 L + + Q L++ + K S + + + + Sbjct: 201 SELEKLLAKKQAIKIATMQQLLTGRTRLPQFAFREDGSKKGYKRSELREIPEDWNPISIG 260 Query: 236 P---FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-----QIVD 287 A + + +E+ L G + S Y + Sbjct: 261 KDAVLKARIGWQALTTKEYLETGEYYLVTGTNFDAGTVKWEDCWYVSEWRYKQDSNIQLK 320 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 +++ + ++ + K + YL +++ S + Sbjct: 321 EDDVLITKDGTIGKVGYVEFLRLPSTLNSGVFVIRPKNNAFHPRYLFYILTSKIFNEFMK 380 Query: 348 A-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 L +D + P I+EQ I ++ I L +Q + ++ Sbjct: 381 GITAGSTITHLYQKDFVNFNFIAPNIEEQTTIATILLDMDTEIQAL----KQRLGKTRQI 436 Query: 407 RSSFIAAAVTGQIDL 421 + + +TG+ L Sbjct: 437 KQGMMQELLTGKTRL 451 >gi|153000716|ref|YP_001366397.1| restriction modification system DNA specificity subunit [Shewanella baltica OS185] gi|151365334|gb|ABS08334.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS185] Length = 427 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 69/437 (15%), Positives = 140/437 (32%), Gaps = 46/437 (10%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFT-------KLNTGRTSESGKDIIYIGLEDVESGT 60 YK + V G IP+ W+V +K + +N + I + V Sbjct: 11 EGYKQTEV---GVIPEDWEVKKLKEISPSQSVGLVINPSSYYSNSGTIPMLVGSHVFENK 67 Query: 61 GKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPK 117 K+ + + S G ++ ++G A++ C++ ++ Q Sbjct: 68 IKWSKANKITAESNLRLPASRLKTGDLVTVRVGEPGITAVVPPELNQSNCASMMIIRQGP 127 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L + S RIE + G + + P P EQ I + Sbjct: 128 KFDSHWLCFLMNSKIGKSRIEGVQYGTAQKQFNIIDAVDFLFPFPTKEEQTAIANALSDM 187 Query: 178 TVRIDTLITERIRFIELLKEKKQALV-------SYIVTKGLNPDV-----KMKDSGIEWV 225 + L + + Q L+ + +N K K + + Sbjct: 188 DALLSELEKLIAKKQAIKTATMQQLLTGKNRLPQFAFYSDINSIEGAVEGKRKGTKPSEL 247 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G +PD WEVK F ++ + K+ K I+ + ++ + N + Sbjct: 248 GEIPDDWEVKKFGQVMHIRHGKDQKSIQVSGGLYPIFGTGGQMGSTNT----------PL 297 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 D ++ N R + + + + + ++ + D + Sbjct: 298 YDKPSVLIGRKGTINKPR----FTDYPFWTVDTLFYSEVANTESVKFIYYKFCMIDWMQY 353 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 A G SL ++ + P IKEQ I +++ ID ++ EQ + ++ Sbjct: 354 NEASG---VPSLNASTIENVLASFPDIKEQTAIVTILSD----IDNEIQAFEQRLSKTRQ 406 Query: 406 RRSSFIAAAVTGQIDLR 422 + + +TG+ L Sbjct: 407 IKQGMMQELLTGKTRLP 423 >gi|325832692|ref|ZP_08165455.1| type I restriction modification DNA specificity domain protein [Eggerthella sp. HGA1] gi|325485831|gb|EGC88292.1| type I restriction modification DNA specificity domain protein [Eggerthella sp. HGA1] Length = 393 Score = 133 bits (334), Expect = 6e-29, Method: Composition-based stats. Identities = 55/399 (13%), Positives = 125/399 (31%), Gaps = 35/399 (8%) Query: 24 HWKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +W+ + + L+ G ++ + YI + D++ T +L D S + Sbjct: 18 NWEEKTLGELCEPLSYGMNAAATKFDGENRYIRITDIDDETHAFLSNDVVSPSGELDDKY 77 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + KG IL + G K+ + + L+ + Sbjct: 78 LVKKGDILLARTGASTGKSYLYHPKDGKLFYAGFLIKAHVLPSSDDYFIYSQTLTDRYGK 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + + + +P L EQ I + + A I TE + + Sbjct: 138 WVKTTSMRSGQPGINANEYASYSFSVPSLPEQRKIADLLSAVDDVIAAQKTEVAAWEKRK 197 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K Q L S V + D + +G + + + + L + Sbjct: 198 KGVMQKLFSQEVRFKADDGSDFPDWEEKTLGDI---CMYERQRSEGANFIGTESMLKDFG 254 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 ++ + + + PG+ + I K L +G Sbjct: 255 GVAFD---------------NSKDDGSGTLYHPGDTLMSNIRPYLKKAWLA----DRKGT 295 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 ++ + P ++ YL WL+ S + + G + + +P+L+P E Sbjct: 296 CSTDVLVFHPTSVEPGYLYWLIASDAFVRYVMSAAKGSKMPRGDKKHIMEMPLLLPNKDE 355 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I + + ++ ++ K + + +E + + Sbjct: 356 QRKI----DDCLSSLNDVIIKAKNELAKWQELKKGLLQQ 390 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 18/189 (9%), Positives = 54/189 (28%), Gaps = 14/189 (7%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGEIVFRFIDLQNDKRSLR 306 E+ + ++ + N + P + +V G+I+ K L Sbjct: 40 TKFDGENRYIRITDIDDETHAFLSNDVVSPSGELDDKYLVKKGDILLARTGASTGKSYLY 99 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRL 365 + + A D ++ + K + + + Sbjct: 100 HPKDGKLFYAGFLIKAHVLPSSDDYFIYSQTLTDRYGKWVKTTSMRSGQPGINANEYASY 159 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR--- 422 VP + EQ I ++++ D ++ + + ++R+ + + ++ + Sbjct: 160 SFSVPSLPEQRKIADLLSAV----DDVIAAQKTEVAAWEKRKKGVMQKLFSQEVRFKADD 215 Query: 423 -----GESQ 426 + Sbjct: 216 GSDFPDWEE 224 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/184 (22%), Positives = 74/184 (40%), Gaps = 12/184 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + R+ + +IG E + G D + +++ Sbjct: 221 DWEEKTLGDICMYERQRS----EGANFIGTESMLKDFGGV----AFDNSKDDGSGTLYHP 272 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G L + PYL+KA +AD G CST LV P V P L + S + + + +G Sbjct: 273 GDTLMSNIRPYLKKAWLADRKGTCSTDVLVFHPTSVEPGYLYWLIASDAFVRYVMSAAKG 332 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + M D K I +P+ +P EQ I + + ++ +I + + +E K+ L+ Sbjct: 333 SKMPRGDKKHIMEMPLLLPNKDEQRKIDDCL----SSLNDVIIKAKNELAKWQELKKGLL 388 Query: 204 SYIV 207 + Sbjct: 389 QQMF 392 >gi|23217024|ref|NP_690631.2| type I R/M system specificity subunit [Lactococcus lactis subsp. lactis bv. diacetylactis] gi|23200589|dbj|BAC11874.2| type I R/M system specificity subunit [Lactococcus lactis subsp. lactis bv. diacetylactis] Length = 414 Score = 132 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 68/414 (16%), Positives = 152/414 (36%), Gaps = 37/414 (8%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKD 67 +P+ W+ + + + G T + + G D E G +Y+ K Sbjct: 15 KVPELRFKGFTDDWEERKLGELSNIVGGGTPSTSNSEYWDGDIDWYAPAEIGEQRYVSKS 74 Query: 68 GNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + S+ I G +L+ AI+ + F + P + Sbjct: 75 KKTITELGLKKSSARILPVGTVLFTSRAGIGNTAILGKE-ATTNQGFQSIVPNPNKLDSY 133 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + ++ + E G+T K + + + +P L+EQ I ++D Sbjct: 134 FIYSRTNELKRYGEVTGAGSTFVEISGKQMSKMSIMVPELSEQKKIGSF----FEQLDNT 189 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I R ++LLKE+K+ + + K +++ +G D WE + ++ Sbjct: 190 IALHQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG------FADDWEERKLSSMTNYK 243 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVF--RFIDLQND 301 N K+ + +S L N+ + + + E + ++V + + Sbjct: 244 NGKSHEDKQSTSGKLELINLNSISISGGLKHSGKFIDEADDTLQKDDLVMILSDVGHGDL 303 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKF 359 + +R ++ ++P+ D +L + ++ F A G+G+ + ++ Sbjct: 304 LGRVALIPEDDRFVLNQRVALLRPNTTADPQFLFSYINAHQY--YFKAQGAGMSQLNISK 361 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V+ VP I+EQ I + ++D + ++ + LLKE++ F+ Sbjct: 362 GSVENFISFVPIIEEQKKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 411 >gi|323340760|ref|ZP_08081012.1| type I restriction-modification system specificity subunit [Lactobacillus ruminis ATCC 25644] gi|323091883|gb|EFZ34503.1| type I restriction-modification system specificity subunit [Lactobacillus ruminis ATCC 25644] Length = 419 Score = 132 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 59/425 (13%), Positives = 147/425 (34%), Gaps = 35/425 (8%) Query: 11 KDSGVQWIGAIPK--------HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV 56 KDS V +P W+ + +++ G T + DI + ++ Sbjct: 5 KDSKV-----VPNVRFKGFTDDWEQRKLGDVSEIIGGGTPSTNHPEYWDGDIDWYSPAEI 59 Query: 57 ESG-TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 K + D S+ + G +L+ + AI++ + F + Sbjct: 60 SDQIYVKRSRRRITQLGYDNSSAKLLPPGTVLFTSRAGIGKTAILSQKS-CTNQGFQSIV 118 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P + + + + + + E + G+T + K + + + +P ++ + I Sbjct: 119 PHENELDTYFIFSRTNVLKRYGELVGAGSTFAEVSGKQMSAMNLMLPTTIQEQ---QLIG 175 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD-HWEV 234 ++D LIT + I L + K+A++ + K + +++ +G Sbjct: 176 QFFKKLDCLITLHQQKITRLIKLKKAMLEKMFPKKGSVIPEIRFNGFANAWEQCKLGDIA 235 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGE 290 + + R + L + + ++ + I + + + + + + G Sbjct: 236 TMHARIGWQNLRTSEFLNSGDYMLITGTDFIDGTINFDTCHYVKRERYEQDKHIQISNGS 295 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 I+ ++ + + + +D+ YL +++ L Y Sbjct: 296 ILITKDGTLGKVAYIQGLTMPATLNAGVFNVEIKDENKVDNRYLFQYLKAPFLMNYVYKK 355 Query: 350 G-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G + L + PV++P EQ I +D L+ ++ I +LK+ +S Sbjct: 356 ATGGTIKHLNQNILVNFPVVLPQKTEQKVIGE----LFTNLDHLITLHQRKIDMLKKLKS 411 Query: 409 SFIAA 413 + ++ Sbjct: 412 ACLSE 416 >gi|260776597|ref|ZP_05885492.1| hsdS type I site-specific deoxyribonuclease [Vibrio coralliilyticus ATCC BAA-450] gi|260607820|gb|EEX34085.1| hsdS type I site-specific deoxyribonuclease [Vibrio coralliilyticus ATCC BAA-450] Length = 563 Score = 132 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 65/418 (15%), Positives = 149/418 (35%), Gaps = 30/418 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDV---ESGTGKYLPKD 67 +P +W I + +G T ++G + I ++ D+ + +D Sbjct: 3 KLPFNWVETEIGNLALVVSGGTPKAGDELNFAEPGAGIAWVTPADLSGYKQKEIANGRRD 62 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + D+S+ + KG +L+ P IA+ + + F D + + Sbjct: 63 LSPKGLDSSSAKLMPKGTLLFSSRAPI-GYVAIAENEISTNQGFKSFIFTDHVNST-YAY 120 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + E+ G T +P + PL EQ+ I +K+ + ++D Sbjct: 121 YYLKSIKDLAESWGSGTTFKELSGAVAKKLPFRLAPLNEQIRIADKLDSILAKVDHAQER 180 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + ++LK +Q++++ + L EW W ++ N Sbjct: 181 LDKIPDILKRFRQSVLAAATSGELTR---------EWREGKEHQWPRVQLKSVGRGFNYG 231 Query: 248 NT--KLIESNILSLSYGNIIQK-LETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKR 303 ++ E + L GN+ L N+ + E +++ G+++F + Sbjct: 232 SSAKSKPEGEVPVLRMGNLQGGQLHWDNLVYTSDKEEIDKYLLEKGDVLFNRTNSPELVG 291 Query: 304 SLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFE 360 + ++ I + +K +D+ +L + S + + + + ++ + Sbjct: 292 KTSIYRGEQKAIYAGYLIRIKGSEHLDTEFLNIQLNSPHARDYCWQVKTDGVSQSNINAK 351 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ +P I EQ +I ++ +R D+ + S L S + A GQ Sbjct: 352 KLQAYEFDLPEIDEQLEIVRRVSELFSRADLFEYQYLASKKYLNRLTQSILVKAFNGQ 409 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 71/157 (45%), Gaps = 8/157 (5%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 R++ K + +++ G ++F + +G + + ++S Sbjct: 61 RDLSPKGLDSSSAKLMPKGTLLFSSRAPIGYVAIAENEISTNQGFKSFIFT----DHVNS 116 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 TY + ++S + + + GSG + L K+LP + P+ EQ I + ++ A++ Sbjct: 117 TYAYYYLKS--IKDLAESWGSGTTFKELSGAVAKKLPFRLAPLNEQIRIADKLDSILAKV 174 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 D E++++ +LK R S +AAA +G++ R + Sbjct: 175 DHAQERLDKIPDILKRFRQSVLAAATSGELT-REWRE 210 >gi|291167081|gb|EFE29127.1| type I restriction enzyme StySJI specificity protein [Filifactor alocis ATCC 35896] Length = 465 Score = 132 bits (333), Expect = 7e-29, Method: Composition-based stats. Identities = 63/424 (14%), Positives = 139/424 (32%), Gaps = 32/424 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG--------KDIIYIGLEDVESGTGKYLPKDGNSR 71 IP++W + ++ G T S I I +V+ + Sbjct: 27 QIPENWVWTRLGYVSEFERGITFPSSAKKRTLDENMIPCIRTANVQEELMINDLIYVDKS 86 Query: 72 QSDTSTVSIFAKGQILYGK------LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + K I+ +G + + K L Sbjct: 87 YTKNNKSKCLKKNDIIMSSANSKELVGKTCFVYQVPFPMTFGGFVLTIRAKKVSSEFLFY 146 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L I + +++ + K + P+PPL EQ I E+I + ++D Sbjct: 147 MLRLEFLSGNFIRESTQTTNIANINTKMLSKYSFPLPPLLEQQRIVERIESLFSKLDEAK 206 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + ++ + +K A++ + L + E G+ D WE + Sbjct: 207 EKIQMALDSFETRKSAILYQAFSGELTKKWR------EENGIRLDDWEKEELRERCHINP 260 Query: 246 RK------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 +K + + + I S +++ ++ + E + Y + G+++F I Sbjct: 261 KKIATKELSDSIDITFIPMASVSDVLGQVSMPMIKKLGEYKKGYTNFNQGDVLFAKITPC 320 Query: 300 NDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--R 354 + + +E I T Y+ + ++ L+R + + SG + Sbjct: 321 MENGKIAIVGELENNIGFGSTEFYVFRCKENTYNRFIYHLLRWKKFREEARNVMSGAVGQ 380 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 Q + ++ + P ++EQ +I ++ + E I I + + S +A A Sbjct: 381 QRVPKSFLEEYKLCFPSLEEQKEIVRILYTIFEKEQDTQELI-DLIEKIDLMKKSILARA 439 Query: 415 VTGQ 418 G+ Sbjct: 440 FRGE 443 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 83/227 (36%), Gaps = 23/227 (10%) Query: 223 EWVGLVPDHWEVKPFFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGL 275 E +P++W + K L E+ I + N+ ++L ++ Sbjct: 23 EQPYQIPENWVWTRLGYVSEFERGITFPSSAKKRTLDENMIPCIRTANVQEELMINDLIY 82 Query: 276 KPESYETYQI---VDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSAYMAVKPHGIDST 331 +SY + +I+ + + + QV + ++ + S Sbjct: 83 VDKSYTKNNKSKCLKKNDIIMSSANSKELVGKTCFVYQVPFPMTFGGFVLTIRAKKVSSE 142 Query: 332 YLAWLMRSYDL--CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L +++R L + + + ++ + + + +PP+ EQ I I +++ Sbjct: 143 FLFYMLRLEFLSGNFIRESTQTTNIANINTKMLSKYSFPLPPLLEQQRIVERIESLFSKL 202 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQ----------IDLRGESQ 426 D EKI+ ++ + R+S+ + A +G+ I L + Sbjct: 203 DEAKEKIQMALDSFETRKSAILYQAFSGELTKKWREENGIRLDDWEK 249 >gi|197119930|ref|YP_002140357.1| type I restriction-modification system DNA specificity subunit [Geobacter bemidjiensis Bem] gi|197089290|gb|ACH40561.1| type I restriction-modification system DNA specificity subunit [Geobacter bemidjiensis Bem] Length = 395 Score = 132 bits (333), Expect = 8e-29, Method: Composition-based stats. Identities = 53/406 (13%), Positives = 127/406 (31%), Gaps = 26/406 (6%) Query: 24 HWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W + + G + ++ I +I + D ++ + + R Sbjct: 4 GWVTKKLGEICDIERGGSPRPIDSFLTDAPDGINWIKIGDTKTISKYIFTTEQKIRPEGA 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL--SIDV 133 + +G + + R I G +LVL+ K+ + + S V Sbjct: 64 KRSRMVFEGDFILSNSMSFGRP-YIMKTTGCIHDGWLVLREKEPNVNQDYLYHVLSSDLV 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + + G+T+ + + + + +PIP ++EQ I + RI T + ++ Sbjct: 123 YRQFDRLAAGSTVRNLNIGLVKGVEVPIPSISEQQRIVGILDEAFDRIATAKANAEKNLQ 182 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + ++ + T+ ++ +G + +H K KN ++ Sbjct: 183 NARALFESHLQSTFTQRCAGWT------VKTIGDLAEHSLGKMLDKA------KNKGELQ 230 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + +++ L + G+++ + Sbjct: 231 PYLRNINVRWFTFNLSDLLEMPFRTTEVGKYTAVKGDVLICEGGYPGRAAIWTEDYPVYF 290 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPI 372 +P + + + + + D SG Q E + R + + P+ Sbjct: 291 QKALHRVRFHEPEH--NKWFLYYLYAQDKSGELKKHFSGTGIQHFTGEALSRFKLPLAPL 348 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 E V L ++ + L+E + S + A TGQ Sbjct: 349 PELRRNVARFEVLLEETQRLESICQRKLTALEELKKSLLDRAFTGQ 394 >gi|288947723|ref|YP_003445106.1| restriction modification system DNA specificity domain protein [Allochromatium vinosum DSM 180] gi|288898239|gb|ADC64074.1| restriction modification system DNA specificity domain protein [Allochromatium vinosum DSM 180] Length = 448 Score = 132 bits (333), Expect = 8e-29, Method: Composition-based stats. Identities = 70/404 (17%), Positives = 139/404 (34%), Gaps = 23/404 (5%) Query: 25 WKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ VP+ + G + + + I + DV SG K D Sbjct: 25 WERVPLGDVCDILNGFPFKSQHFNNSEGAPVIRIRDVTSGFCK------TFYSGDIPVGY 78 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ G G + + + + + + + L P + + + + I Sbjct: 79 WVEPFDMVVGMDGDFNCR-LWSSERSLLNQRVCKLTPHEDFLDKKFLSYVLPAYLRLIND 137 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H K I IP P+PPLAEQ I K+ R E L++ K Sbjct: 138 HTHSITVKHLSSKTIAKIPFPLPPLAEQRRIVAKLDRLFERTRRAREELSHIPRLIENYK 197 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +A++ L D + K G+ V K + + K+ ++ L Sbjct: 198 KAILVAAFRGDLTKDWREKR-GLPMPKEVKLGEVAKKLSYGTSAKSSKS-----GDVPVL 251 Query: 260 SYGNIIQ-KLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 GNI +++ +++ + E ++ G+++F + + I Sbjct: 252 RMGNIQNMRIDWKDLVYTSDVEEIEKYSLNAGDVLFNRTNSPELVGKTAIYKGERPAIYA 311 Query: 318 SAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKE 374 + + + YL + + S + + S + ++ + + L+P E Sbjct: 312 GYLIKIKCGNRLVPEYLNYCLNSPLGRSYCWRVKSDGVSQSNINAKKLADFSFLLPTHDE 371 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q +I I +D LV + Q+ LL + +A A G+ Sbjct: 372 QKEIVFRIEKTLDWLDSLVIEERQASHLLDHLDQANLAKAFRGE 415 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 67/202 (33%), Gaps = 15/202 (7%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 + + ++ K+ S + + + V+P Sbjct: 25 WERVPLGDVCDILNGFPFKSQHFNNSEGAPVIRIRDVTSGFCK--TFYSGDIPVGYWVEP 82 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVF 346 ++V N + ER ++ + PH D +L++++ ++ Sbjct: 83 FDMVVGMDGDFNCR-----LWSSERSLLNQRVCKLTPHEDFLDKKFLSYVL--PAYLRLI 135 Query: 347 YAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + L + + ++P +PP+ EQ I ++ R E++ L++ Sbjct: 136 NDHTHSITVKHLSSKTIAKIPFPLPPLAEQRRIVAKLDRLFERTRRAREELSHIPRLIEN 195 Query: 406 RRSSFIAAAVTGQIDL-RGESQ 426 + + + AA G DL + + Sbjct: 196 YKKAILVAAFRG--DLTKDWRE 215 >gi|307274412|ref|ZP_07555596.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2134] gi|306508922|gb|EFM78008.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2134] Length = 413 Score = 132 bits (332), Expect = 9e-29, Method: Composition-based stats. Identities = 52/407 (12%), Positives = 137/407 (33%), Gaps = 32/407 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + W++ + ++ G + ++ D+ ++ + DV G+ + ++ Sbjct: 18 EDWELCKLGTLAEIVRGASPRPIQDSKWFDNTSDVGWLRISDVTEQNGRIYKLEQKLSKA 77 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + K +L + + G+ + L P L + + Sbjct: 78 GQEKTRVLRKPHLLLSIAATVGKPVVNYVNTGVHDGFLIFLNP---LFDREFMFQWLEMF 134 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 T + + + + + + + + N + +P EQ EKI +D IT R ++ Sbjct: 135 TPKWQKYGQPGSQLNLNSELVRNQELRMPSTNEQ----EKIGMLFKYLDDTITLHQRKLD 190 Query: 194 LLKEKKQALVSYIV---TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 LK+ K+A + + N K++ + E + +V + N + Sbjct: 191 QLKKLKKAYLHAMFVSMNTKKNKVPKLRFTDFEGDWELCKLGQVANYRRGSFPQPYGNKE 250 Query: 251 LIE-----SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + + G+ ++ +E + + V G++V Sbjct: 251 WYDGENSMPFVQVVDVGDNLRLVEDTKQKISELAQPKSVFVKEGKVVVTLQGSIGRVAIT 310 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + ++R ++ +D Y A++++ G +++ E + Sbjct: 311 QYPAYVDRTLL---IFESYKAEMDEYYFAYVIQQL-FEYEKTRAPGGTIKTVTKEALSDF 366 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + P I+EQ + ++D + + + L E + S++ Sbjct: 367 TISFPSIEEQKK----LGKFFEQLDDTITLHQNKLEQLNELKKSYLQ 409 >gi|329119169|ref|ZP_08247859.1| type I site-specific deoxyribonuclease [Neisseria bacilliformis ATCC BAA-1200] gi|327464728|gb|EGF11023.1| type I site-specific deoxyribonuclease [Neisseria bacilliformis ATCC BAA-1200] Length = 487 Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats. Identities = 68/433 (15%), Positives = 138/433 (31%), Gaps = 70/433 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 IP++W V + ++ G ++ + +I + +G L K G + Sbjct: 68 DIPENWVWVRLGDLAQVLNGDRGKNYPGKEFWVSEGKPFINAGSLNNG---ILDKSGFNY 124 Query: 72 QSDTSTVSIFA-----KGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELL 124 SD S+ K LY G + ++ DFD I S+ ++ + L Sbjct: 125 ISD-DRYSLLRSGFIQKNDFLYCLRGSLGKFSLNKDFDEGVIGSSLCIIRTHQSSLIPFF 183 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L + + I+ + G + + + N +P+PPLAEQ I EK+ ID L Sbjct: 184 FYYLQTDLAQEDIKKVSNGTAQPNLSAENVRNFLIPLPPLAEQQAIAEKLTRLLAEIDRL 243 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI---------------------- 222 E L K Q L + ++ + ++ + S Sbjct: 244 KAEEQSLASLQKAYPQTLRASVLAAAIKGELTERSSENARDLLLRIQNEKQALQAKGSLK 303 Query: 223 -----------EWVGLVPDHWEVKPFFALVTELNRKN------TKLIESNILSLSYGNII 265 E +P++W ++ + +I S ++ Sbjct: 304 KTKAPAPVTADEVSFDIPENWVWVRLGDVILQNIGGGTPSKQEPSYWNGDIPWASVKDLN 363 Query: 266 QKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + T+ + + ++ G ++ + ++ I Sbjct: 364 CDVLTKTIDSITAEGLENSSSNLIPKGTLIICTR----MGLGKIALAEIDVAINQDLRAI 419 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P ++ Y R+ + + + E++ P +PP+ EQ I + Sbjct: 420 FLPECLNKHYFYHFYRTLKMEGK-----GATVKGITVEELHNTPFPLPPLAEQQAIVEKL 474 Query: 383 NVETARIDVLVEK 395 + A ID L Sbjct: 475 SALLAEIDALENA 487 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 78/211 (36%), Gaps = 16/211 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII------QKLETRNMGLK 276 E +P++W L LN K +S G G Sbjct: 64 EAPFDIPENWVWVRLGDLAQVLNGDRGKNYPGKEFWVSEGKPFINAGSLNNGILDKSGFN 123 Query: 277 PESYETYQIVDPGEIVFRF--IDLQNDKRSLRSAQVMERGII-TSAYMAVKPHGIDSTYL 333 S + Y ++ G I L+ + + G+I +S + + Sbjct: 124 YISDDRYSLLRSGFIQKNDFLYCLRGSLGKFSLNKDFDEGVIGSSLCIIRTHQSSLIPFF 183 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + +++ + + +G + +L E+V+ + +PP+ EQ I + A ID L Sbjct: 184 FYYLQTDLAQEDIKKVSNGTAQPNLSAENVRNFLIPLPPLAEQQAIAEKLTRLLAEIDRL 243 Query: 393 VEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 + EQS+ L++ R+S +AAA+ G+ Sbjct: 244 KAE-EQSLASLQKAYPQTLRASVLAAAIKGE 273 >gi|169634835|ref|YP_001708571.1| specificity determinant for hsdM and hsdR [Acinetobacter baumannii SDF] gi|169153627|emb|CAP02819.1| specificity determinant for hsdM and hsdR [Acinetobacter baumannii] Length = 386 Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats. Identities = 62/392 (15%), Positives = 147/392 (37%), Gaps = 31/392 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W + I L GR +S + + I ++++ + + N D Sbjct: 8 PPSWCIASIGEVCNLINGRAFKSTEWTDRGLPIIRIQNLNN-----PDANFNFFNGDLDN 62 Query: 78 VSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDV 133 KG +L+ G I G + ++ ++ + + ++ + Sbjct: 63 KHRVEKGDLLFAWSGTPGTSFGAHIWDGDIGALNQHIFKIVFNDSLIDKRFIRYAINQTL 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G + H + PPL EQ +I +K+ ++ T R + Sbjct: 123 DELVSGARGGVGLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVATTKVRLERILN 182 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 +LK +Q+++S V+ L + + K+ + W+ + V++ + + + Sbjct: 183 ILKTFRQSILSSAVSGKLTEEWR-KNKKLNWIKSTLAN-----ICRSVSDGDHQAPPRAD 236 Query: 254 SNILSLSYGNIIQKLETRNMG------LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 I L NI + + ES + + + +I++ +++S Sbjct: 237 FGIPFLVISNISKGEIDFSSVNRWVPESYYESLKDIRKPEINDILYTVTGSFGIPVTVKS 296 Query: 308 AQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 +KP+ +D YL + + S ++ K ++ +G ++++ ++ Sbjct: 297 ---TTPFCFQRHIAIIKPNHSSVDYKYLFYYLASPEVFKHATSIATGTAQKTVSLSHLRN 353 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +L+PPI+EQ +I + + A D + +K+ Sbjct: 354 FNILLPPIEEQTEIVHRVEELLAFADGIEKKL 385 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 70/183 (38%), Gaps = 5/183 (2%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 L+ K+T+ + + + N+ N + V+ G+++F + Sbjct: 20 CNLINGRAFKSTEWTDRGLPIIRIQNLNNPDANFN--FFNGDLDNKHRVEKGDLLFAWSG 77 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + G + + + ID ++ + + V A G + Sbjct: 78 TPGTSFG-AHIWDGDIGALNQHIFKIVFNDSLIDKRFIRYAINQTLDELVSGARGGVGLK 136 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++ PP+ EQ I + ++ A++ ++E+ + +LK R S +++AV Sbjct: 137 HVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVATTKVRLERILNILKTFRQSILSSAV 196 Query: 416 TGQ 418 +G+ Sbjct: 197 SGK 199 >gi|257060103|ref|YP_003137991.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] gi|256590269|gb|ACV01156.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] Length = 433 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 57/417 (13%), Positives = 140/417 (33%), Gaps = 29/417 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W V + ++ G+ + ++ +++ D + Sbjct: 12 WSFVRVDEIFEIQQGKQVSQKNRVGDNQKPFLRTKNILWNRLDLTDLDTMHFKPTDERRL 71 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-----GWLLSIDVT 134 G +L + G R AI + C Q + + + + + + + + + Sbjct: 72 KLKSGDLLLCEGGSVGRTAIWQEDIEECYYQNHLHRLRVINNKCSYQFALYWFWYAFEYS 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 T+ + + +P+P+PP+ EQ I + I I E+ I L Sbjct: 132 SFYSGRKNITTIPNLSRSRLAELPIPLPPIEEQRKIASVL----TLIQETIQEQENAIAL 187 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E K+AL+ + T+G+N + K + I + + ++ F + K Sbjct: 188 TTELKKALMQKLFTEGIN-NEPQKMTEIGLIPESWEVLPLRKMFKIKHGYAFKGEYFTSE 246 Query: 255 NILSLSYGNIIQ-----KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L + + ++ +++ + ++ + Sbjct: 247 GKFILMTPGHFNEDGGFRDQQDKTKYYIGEVPNDYLLKKDDLLVAMTEQKSGLLGSSAFV 306 Query: 310 VMERGIITSAYMAVKPH----GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 + + + + +D +L L + K +G + + + Sbjct: 307 PESNKYLHNQRLGLIEELDESYLDKKFLFHLFNYEYVRKEISQTATGSKVKHTSPDKILN 366 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + V +P + EQ DI +++ +I+++V K +Q L++ S+ + +T QI + Sbjct: 367 VMVGLPNLNEQKDIIFLLDEFDIKINIIVLKKQQ----LQDLFSTLLHQLMTAQIRV 419 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 72/201 (35%), Gaps = 14/201 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDV---ESGTGKYLPKDGNSR 71 IG IP+ W+V+P+++ K+ G + + +I + E G + Sbjct: 214 IGLIPESWEVLPLRKMFKIKHGYAFKGEYFTSEGKFILMTPGHFNEDGGFRDQQDKTKYY 273 Query: 72 QSDTSTVSIFAKGQILYG----KLGPYLRKAIIADFDGICSTQ----FLVLQPKDVLPEL 123 + + K +L K G A + + + Q L + + Sbjct: 274 IGEVPNDYLLKKDDLLVAMTEQKSGLLGSSAFVPESNKYLHNQRLGLIEELDESYLDKKF 333 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L V + I G+ + H I N+ + +P L EQ I + ++I+ Sbjct: 334 LFHLFNYEYVRKEISQTATGSKVKHTSPDKILNVMVGLPNLNEQKDIIFLLDEFDIKINI 393 Query: 184 LITERIRFIELLKEKKQALVS 204 ++ ++ + +L L++ Sbjct: 394 IVLKKQQLQDLFSTLLHQLMT 414 >gi|52082594|ref|YP_081385.1| hypothetical protein BL02387 [Bacillus licheniformis ATCC 14580] gi|52787992|ref|YP_093821.1| hypothetical protein BLi04316 [Bacillus licheniformis ATCC 14580] gi|52005805|gb|AAU25747.1| HsdS [Bacillus licheniformis ATCC 14580] gi|52350494|gb|AAU43128.1| putative protein [Bacillus licheniformis ATCC 14580] Length = 387 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 53/399 (13%), Positives = 133/399 (33%), Gaps = 22/399 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + G++ + +G ++ K +Q + + G Sbjct: 9 WENGNLSDIADITMGQSPPGNSYNDIKDGIGLINGPTEFTNKYPVVKQWTSKPTKLCKAG 68 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G + IAD + ++ K E + ++ G+ Sbjct: 69 DILLCVRGSSTGRMNIADDEYCIGRGVASIRAKKDKAETSFIYYTLNYKVNQLLQKTAGS 128 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + I ++ + IP AEQ I + I+ + E K Q L++ Sbjct: 129 TFPNLSSNEIKDMIVGIPLFAEQQKIASILSTWDKAIELKEKLIEQKKEQKKGLMQKLLT 188 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V D K E + ++ KN +L + + L+ + Sbjct: 189 GKVRLPGFSDKWEKKKIGELLEES--------------KVIAKNPQLDKRITVRLNLKGV 234 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 ++ ++ E T I G+ ++ +L L ++ + Sbjct: 235 CKR---EISTVEKEGATTQYIRKEGQFIYGKQNLHKGAFGLIPKELDGFQSSSDIPCFDF 291 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 G+D + + + SG + ++ +++ +L + +P ++EQ + ++ Sbjct: 292 KEGVDGLWFYYYFSRESFYTNLENISSGTGSKRIQPKELYKLTIKLPSLREQQRQSKILE 351 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +E+ + ++++ + +TG++ ++ Sbjct: 352 CSDKE----IYLLEKELETYRKQKQGLMQLLLTGKVRVK 386 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 61/168 (36%), Gaps = 7/168 (4%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + N + + +K + + ++ G+I+ + ++ E I Sbjct: 38 IGLINGPTEFTNKYPVVKQWTSKPTKLCKAGDILLCVRGSSTGRMNIA---DDEYCIGRG 94 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 T + +Y + ++ +L ++K + V +P EQ I Sbjct: 95 VASIRAKKDKAETSFIYYTLNYKVNQLLQKTAGSTFPNLSSNEIKDMIVGIPLFAEQQKI 154 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 ++++ D +E E+ I KE++ + +TG++ L G S Sbjct: 155 ASILSTW----DKAIELKEKLIEQKKEQKKGLMQKLLTGKVRLPGFSD 198 >gi|317486937|ref|ZP_07945747.1| type I restriction modification DNA specificity domain-containing protein [Bilophila wadsworthia 3_1_6] gi|316921812|gb|EFV43088.1| type I restriction modification DNA specificity domain-containing protein [Bilophila wadsworthia 3_1_6] Length = 450 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 62/428 (14%), Positives = 148/428 (34%), Gaps = 27/428 (6%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 + YP +P+ WK V + + ++N ++ ++ +E +E G Sbjct: 29 QPYP------------LPEGWKWVRLGKLYQINPRIIADDNTMSSFVPMEKIEPGMKGTF 76 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKD 118 + + FA G + + K+ P + + G +T+ ++L+ Sbjct: 77 TFEILPWGKAKKGHTQFADGDVAFAKISPCFENGKSMLVRGLKNGIGAGTTELIILRQPS 136 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 VL + + S D Q+ G + N P+P+PP+ Q I + I + Sbjct: 137 VLQKYTFYIICSSDFIQKGTHTYSGTVGQQRISMDFVRNYPVPLPPVDVQQRIVDCIESL 196 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM--KDSGIEWVGLVPDHWEVK 235 ++D + + + +K A++ T L + S W + Sbjct: 197 FAKLDEAREKAEAVFDGFESRKAAILHKAFTGELTEKWRKEKNISLESWDSCRLISVLKE 256 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + ++S LS + + + + + ++ + G+I+ + Sbjct: 257 KPRNGYSPKPVECKTNVKSMTLSATTSGFFRPEFFKYID-EEIPENSHLWLSQGDILIQR 315 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + + + I + Y+A+++ + F + +G Sbjct: 316 ANSLEKVGTSAIYTGGDHEFIYPDLIMKLQVRAPHSYKYIAYILSTQPTLSYFRSKATGT 375 Query: 354 ---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + + V P++VP +EQ +I +++ A+ + E + + + S Sbjct: 376 AGNMPKINQQIVSNTPIVVPSCEEQNEIVRILDGLLAKDQQARDAAESVLERIDLMKKSI 435 Query: 411 IAAAVTGQ 418 +A A G+ Sbjct: 436 LAKAFRGE 443 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 81/220 (36%), Gaps = 6/220 (2%) Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES--NILSLSYG 262 L PD ++ E +P+ W+ L R + Sbjct: 10 KAQGTLLTPDEVVEIPVEEQPYPLPEGWKWVRLGKLYQINPRIIADDNTMSSFVPMEKIE 69 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII--TSAY 320 ++ T + ++ + + G++ F I + + ++ GI T+ Sbjct: 70 PGMKGTFTFEILPWGKAKKGHTQFADGDVAFAKISPCFENGKSMLVRGLKNGIGAGTTEL 129 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDI 378 + ++ + Y +++ S D + SG +Q + + V+ PV +PP+ Q I Sbjct: 130 IILRQPSVLQKYTFYIICSSDFIQKGTHTYSGTVGQQRISMDFVRNYPVPLPPVDVQQRI 189 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + I A++D EK E + R+++ + A TG+ Sbjct: 190 VDCIESLFAKLDEAREKAEAVFDGFESRKAAILHKAFTGE 229 >gi|330999089|ref|ZP_08322812.1| type I restriction modification DNA specificity domain protein [Parasutterella excrementihominis YIT 11859] gi|329575610|gb|EGG57144.1| type I restriction modification DNA specificity domain protein [Parasutterella excrementihominis YIT 11859] Length = 429 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 89/407 (21%), Positives = 151/407 (37%), Gaps = 33/407 (8%) Query: 14 GVQWIG--------AIPKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGK 62 V+ IG IPK WK V + E+ +D + LED+E TG+ Sbjct: 29 EVEQIGKAPKENPFEIPKKWKWVRLDDIAPYGKCERIEACSFDRDTWLLNLEDIEKDTGR 88 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 L K+ + +F KG +LY +L PYL K ++AD DG+C+T+ + L+PK+ Sbjct: 89 LLQKNKIIKNQG--AKYLFNKGDVLYSRLRPYLNKVLVADEDGVCTTEIIPLKPKENTLS 146 Query: 123 LLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +L S Q + G M N + +PPL EQ I EK+ + + Sbjct: 147 GSYLSFFLKSQYFVQYAVSQSYGVKMPRVGTATAKNALVALPPLDEQKRIVEKLESLFAK 206 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG--------LVPDHW 232 IDT+ +L ++ L+ ++ L P + ++ +E +G +P+ W Sbjct: 207 IDTIQKSIDEVSQLGASLEKQLLQSSISGKLVPQLD-EELEVEQIGDAPEEVPFEIPEKW 265 Query: 233 EVKPFFALVTELNRKNTKLIE------SNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + +L T + K K E +S N + + +I Sbjct: 266 KWVRLESLGTLFSGKTPKADELTSSGNIPYFKISDMNSSENQKYMRHTEHYLKTTPKKIF 325 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G I+F R + + + +D +Y L+ S D ++ Sbjct: 326 KAGSIIFPKNGGAVFTNKRRFLVRDSIVDLNTGGFYPNKNYLDESYAFLLLSSIDFREI- 384 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ +K V +PP+ EQ I I L Sbjct: 385 --SKGTALPTIDSSKLKSYLVPLPPLGEQRRIVEKFEKLMLEIQKLK 429 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 80/231 (34%), Gaps = 19/231 (8%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ + L P + ++ +E +G P + + + Sbjct: 11 LLDLAIRGKLVPQID-GENEVEQIGKAPKENPFEIPKKWKWVRLDDIAPYGKCERIEACS 69 Query: 262 GNIIQKLETRN-----------MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + L ++ + + G++++ + +K + Sbjct: 70 FDRDTWLLNLEDIEKDTGRLLQKNKIIKNQGAKYLFNKGDVLYSRLRPYLNKVLVA---- 125 Query: 311 MERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 E G+ T+ + +KP +YL++ ++S + + G+ + K V Sbjct: 126 DEDGVCTTEIIPLKPKENTLSGSYLSFFLKSQYFVQYAVSQSYGVKMPRVGTATAKNALV 185 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+ EQ I + A+ID + + I++ L + ++++G+ Sbjct: 186 ALPPLDEQKRIVEKLESLFAKIDTIQKSIDEVSQLGASLEKQLLQSSISGK 236 >gi|209523412|ref|ZP_03271967.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] gi|209496154|gb|EDZ96454.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] Length = 407 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 126/411 (30%), Gaps = 31/411 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVE---SGTGKYLPKDGNSRQSDT 75 K W +V ++ K+ + + I + ++++ +G + + Sbjct: 2 KGWDIVALEDLGKITSSKRIFKKDYVDSGIPFYRTKEIKELANGKEVSTELFISRDSFNE 61 Query: 76 STVSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G +L +G ++ D ++ E I Sbjct: 62 IKAKFGTPSVGDLLITAIGTVGEIYVVDRTDFYFKDGNVLWLRDFKAIEPNFLKYALIAF 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I ++ G+T + + + P ++EQ I + ID I + + Sbjct: 122 VDEINSLSHGSTYKALPIEKLKKHKIYKPSISEQKRIVAILDEAFEGIDAAIANTQKNLA 181 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 +E ++ ++ I T+ + V+ K I E + E Sbjct: 182 NARELFESYLNGIFTRKGDGWVEKKLGEI----------------CHKVEYGSSSKSQPE 225 Query: 254 SNILSLSYGNIIQKLETRN--MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +I + GNI + + ++ +++F + + + Sbjct: 226 GDIPVIRMGNIQNNMIDWTDLVYTSNPDEINRYLLQYNDVLFNRTNSADHVGKSAIYKGE 285 Query: 312 ERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPV 367 + I + V D +L + + Y + ++ S + ++ +K P+ Sbjct: 286 KPAIFAGYLIRVHYKKDVIDPDFLNFYLNCYKTREYGKSVMSRSVNQVNINGTKLKNYPI 345 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P + Q I + L + + L+E + S + A TG+ Sbjct: 346 YHPDLYTQKQIIKKLYFLFRETQRLETIYRRKLEALQELKQSILQKAFTGE 396 >gi|48477149|ref|YP_022855.1| type I restriction-modification system specificity subunit [Picrophilus torridus DSM 9790] gi|48429797|gb|AAT42662.1| type I restriction-modification system specificity subunit [Picrophilus torridus DSM 9790] Length = 441 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 54/438 (12%), Positives = 130/438 (29%), Gaps = 36/438 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTG 61 +KD+ IG IP+ W++ + ++L G + + I++ L ++ G Sbjct: 11 FKDTA---IGRIPREWEIKRLNEISELQRGLSYSGKEKSINKIQDGYIFLTLNSIKEDGG 67 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA-------------IIADFDGICS 108 + +G I+ +++ + S Sbjct: 68 LKSDGWSWIKSDRLKERHFVREGDIVIANTDIGMQRGHILGVPAIVRFPEWYKKEKAVYS 127 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGI-GNIPMPIPPLAEQ 167 L K ++ + Q G + H + ++ +P+PPL EQ Sbjct: 128 MDLSKLNLKISSCDITFLFYYLSFTQQLARKYHTGTGVWHLNLDSWAKDLFLPLPPLEEQ 187 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + + +I+ + E +L L + + D ++ EW Sbjct: 188 KKIADILSTADEKINLIDKEIQLTEKLKNGIMHKLFTEGIGHTEFKDTEIGRIPKEWEIK 247 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQ 284 +K N + + + K L + Sbjct: 248 KLKDVVIKAKSGGTPRRNVADYWNGSISFAKIEDITKSNKYLHVTKELISKKGLENSNAW 307 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 I+ ++ + + I+ + I + Y Sbjct: 308 IIPSNSLLLAIYGSLGLVAINKIDVATNQAIVG----IIVDDKIIYKEFLYYWYLYYKPY 363 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + G + +L + + +PP EQ I ++++ ++++L K + L+ Sbjct: 364 WSRFIKKGTQPNLTLGIILDSIIPLPPFDEQKRIADILSTADEKLELLNLKKQN----LE 419 Query: 405 ERRSSFIAAAVTGQIDLR 422 + + +TG++ ++ Sbjct: 420 NLKKGLMDDLLTGRVRVK 437 >gi|257465994|ref|ZP_05630305.1| restriction modification system DNA specificity domain protein [Fusobacterium gonidiaformans ATCC 25563] gi|315917150|ref|ZP_07913390.1| type I restriction-modification system specificity subunit [Fusobacterium gonidiaformans ATCC 25563] gi|313691025|gb|EFS27860.1| type I restriction-modification system specificity subunit [Fusobacterium gonidiaformans ATCC 25563] Length = 495 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 67/455 (14%), Positives = 145/455 (31%), Gaps = 64/455 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W V + ++N G++ GK++ + + G + + ++ Sbjct: 26 EIPDSWVWVRLGSICEINMGQSPL-GKNVNFEKGIGLIGGPSDMGEQYPDIKRYTIQATK 84 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + I+ + L KAI +D ++ K + P LL+ + + +T + Sbjct: 85 LSTLDDIIVS-IRATLGKAIFSDGKYCLGRGVCAIKSKSINPVLLKYYFM--YITDYLYQ 141 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I G T + + + N+ L+ Q I +K+ + E ++ +K Sbjct: 142 IATGTTFAQISKEDVYNLKFAFSSLSAQQRIVKKLDFLFEKTKKAKKLLQEVKEEIEMRK 201 Query: 200 QALVSYIVTKGLNPDVK------------------------------------------- 216 ++++ L + + Sbjct: 202 ISILNKAFRGELTKNWREENKTGSVLDLLQEIQNEKMKKWEEECREAEKNGSKKPKKIKL 261 Query: 217 -----MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL-----IESNILSLSYGNIIQ 266 M E +PD W+ + T + N Sbjct: 262 SKIEEMIVPKEEEPYKIPDTWKWVRLREVTENNQYGYTSKSTLEGKIKYLRITDIQNENV 321 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 +T ++ + + + +IV K R ++ + + S + ++ Sbjct: 322 DWDTVPYIVEENNNISQFFLRKNDIVIARTGSTTGKSY-RIDKIEDVAVFASYLIRIRVI 380 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+S YL S + SG + + + ++ L +PP++EQ +I V+ Sbjct: 381 KINSEYLLRFTHSNVYWNQIIELSSGIAQPGVNAQKLENLYFPLPPLEEQQEIVRVLEEV 440 Query: 386 TARIDVLVEKI--EQSIVLLKERRSSFIAAAVTGQ 418 + + E I E+ I LL+ S + A G+ Sbjct: 441 LEKEKKVKELIDLEEQIELLE---KSILDKAFRGK 472 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 61/201 (30%), Gaps = 9/201 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI-LSLSYGNIIQKLETRNMGLKPE 278 S E +PD W ++ ++ N + + + +K Sbjct: 19 SKEEQPYEIPDSWVWVRLGSICEINMGQSPLGKNVNFEKGIGLIGGPSDMGEQYPDIKRY 78 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + ++ +I+ + + A+K I+ L + Sbjct: 79 TIQATKLSTLDDIIVSIRATLGKAIF-----SDGKYCLGRGVCAIKSKSINPVLLKYYF- 132 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + Y + +G + EDV L + Q I ++ + + ++ Sbjct: 133 -MYITDYLYQIATGTTFAQISKEDVYNLKFAFSSLSAQQRIVKKLDFLFEKTKKAKKLLQ 191 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 + ++ R+ S + A G+ Sbjct: 192 EVKEEIEMRKISILNKAFRGE 212 >gi|237731956|ref|ZP_04562437.1| predicted protein [Citrobacter sp. 30_2] gi|226907495|gb|EEH93413.1| predicted protein [Citrobacter sp. 30_2] Length = 394 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 60/420 (14%), Positives = 135/420 (32%), Gaps = 45/420 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W + + G S I + + ++ + S Sbjct: 2 VPKGWTLGTLNDLADTIMGYAFRSEDFVPTGIPLLRMGNLYQNSLDLNRNPVYLPDSFKV 61 Query: 77 TVS--IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL--------VLQPKDVLPELLQG 126 + G ++ G ++ + +TQ+ ++ + + Sbjct: 62 DYKRFLVKPGDLVMSMTGTMGKRDYGFTVEIPSNTQYSLLNQRVLKIVPKNNSSSGYILN 121 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L S + + + G ++ K + +P+ IPPLAEQ I E + D I+ Sbjct: 122 LLRSELILSVLYSFPGGTKQANLSAKQVQELPVFIPPLAEQKKIGEIL----SIWDKAIS 177 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + +++K+AL+ ++T ++ G+ + K + V ++ + Sbjct: 178 VTENLLTNSQQQKKALMQQLLTGN--------KRLLDENGVRFNGKWEKKHLSDVADVYQ 229 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 T S + E+ +I Sbjct: 230 PKTISQSMMSDSGYPVYGANGVIGFYQEFNHETE---------QIAVTCRGST----CGI 276 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + IT M + +L + + + Y + + + ++K Sbjct: 277 VNWTQAKSWITGNAMVINTDNYSYVSKKFLFYTLNGSDLKYLISGSGQPQIT-GNIKTHI 335 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR-GES 425 + +P I+EQ I V++ A I L E+ I LK+ + + + +TG+ ++ E+ Sbjct: 336 INLPCIEEQQKIATVLSAADAEISTL----EKKIACLKDEKKALMQQLLTGKRRVKVDEA 391 >gi|323439266|gb|EGA96992.1| hypothetical protein SAO11_1944 [Staphylococcus aureus O11] gi|323442204|gb|EGA99836.1| hypothetical protein SAO46_1906 [Staphylococcus aureus O46] Length = 417 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 53/404 (13%), Positives = 126/404 (31%), Gaps = 23/404 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + ++ +G T +I ++ D+ + + + ++ Sbjct: 20 EWEEKKLGEIFQIISGSTPLKSNKKFYENGNINWVKTTDLNNSKVTHSKEKITEYAMNSL 79 Query: 77 TVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + K +L G + + + + D + L L+ V Sbjct: 80 KLKLVPKNSVLIAMYGGFNQIGRTGLLKIDATINQAISALLMNHETNPEFIQAYLNYQVK 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + K I +P + EQ I E +I+ + + Sbjct: 140 GWKRYAASSRKDPNITKKDIEQFKVPYVSINEQQKIGEFFSKLDRQIELEEQKLELLQQQ 199 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q + S + + + +G + + A K + Sbjct: 200 KKGYMQKIFSQELRFKDENGEDYPEWEEKQLGELGVTYAGLSGKAKEDFGFGK-----DV 254 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM--E 312 + ++ + + E V G+I+F + + S + + Sbjct: 255 YVSYVNVFKNNIATLEMVENVSIKPGEKQNNVKFGDILFTTSSEVPHEVGMSSVWLYEKD 314 Query: 313 RGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + S I+ +LA +RS+++ K+ + G R ++ +++ +L V + Sbjct: 315 NVYLNSFCFGFRTTVSFINPIFLARYLRSFEMRKLITILAQGSTRFNISKKELMKLIVKI 374 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P + EQ I N + +D +E + LK+R+ + Sbjct: 375 PRLDEQNRIIN----LFSILDGGIELQSMKVRKLKKRKQGLLQK 414 >gi|323526111|ref|YP_004228264.1| restriction modification system DNA specificity domain-containing protein [Burkholderia sp. CCGE1001] gi|323383113|gb|ADX55204.1| restriction modification system DNA specificity domain protein [Burkholderia sp. CCGE1001] Length = 443 Score = 131 bits (330), Expect = 2e-28, Method: Composition-based stats. Identities = 82/411 (19%), Positives = 155/411 (37%), Gaps = 25/411 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +PK W + T +E + D + LED+E + + + + + ST Sbjct: 4 LPKGWLETTLGEVVDYGTTLKAEPDEISDDEWVLELEDIEKDKSRIVSRLTFADRKSKST 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQR 136 + F+KG +LYGKL PYL K ++AD +G+C+T+ + + Q V + WL Sbjct: 64 KNRFSKGDVLYGKLRPYLNKVVLADSNGLCTTEIIPIKQTAAVDNRYVFHWLRGPRFLSY 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G M + P +PPLAEQ I +K+ + R++ R +L Sbjct: 124 AIGVSHGLNMPRLGTDAGRSAPFILPPLAEQKRIADKLDSVLSRVEAACARMGRVPTILT 183 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++A +V L D K + G + + ++ Sbjct: 184 RLRRA---ALVATLLGQDGDAKPTPRIAFG---------SLINSIRGGTTAVPQSDKTAY 231 Query: 257 LSLSYGNIIQKLETRNMGLKPESY---ETYQIVDPGEIVFRF----IDLQNDKRSLRSAQ 309 L ++ Q S E + +++F ++ + + S Sbjct: 232 PILRSSSVRQGRIDFEDVRYLTSEQSGEEKNFIRENDVLFTRLNGNVNYVGNCAVVPSVS 291 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPV 367 + + Y A I Y A+ D+ K A S + + +D+K + + Sbjct: 292 LNKYQYPDRLYCARLKETIVPKYCAYAFALPDIRKEIERRAKSSAGHKRISIQDIKEMEI 351 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+ EQ + N I A D L + ++++ ++ + +A A G+ Sbjct: 352 PLPPVAEQLRMVNQIERIFATCDRLEKTLDEAKIVADHLTPALLAKAFRGE 402 >gi|228288745|ref|YP_002841997.1| restriction modification system DNA specificity domain protein [Sulfolobus islandicus Y.N.15.51] gi|228014315|gb|ACP50075.1| restriction modification system DNA specificity domain protein [Sulfolobus islandicus Y.N.15.51] Length = 576 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 139/423 (32%), Gaps = 27/423 (6%) Query: 18 IGAIPKHWKVVPIKR-FTKLNTGRTSESG------KDIIYIGLEDV--ESGTGKYLPKDG 68 IG PK W V +K K +G T +I + ++D+ + Sbjct: 11 IGEFPKDWDVRKLKDVIIKAKSGGTPRRNVPEYWNGNIPFAKIQDITKSGKYLYNTEEFI 70 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + S I K +L G L I + + + P + + + Sbjct: 71 TEKGLENSNAWIVPKDSLLLTIYGS-LGFVAINKIPVATNQAIIGIIPNKNIIDTEFLYY 129 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + T + + + N +PI PL EQ I E + T TL Sbjct: 130 WYLYFKPYWSKFIKKGTQPNLTLEIVLNSSVPILPLEEQKKIVELLQKATDIYYTLKDYI 189 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 I+ + + + ++TKG+ + +G P WEV+ + + + Sbjct: 190 IQIRNSTETITKVIRKELLTKGIGHRDYV----ETDIGEFPKDWEVRRLNEIAIIRSGFS 245 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQND 301 + + N + ET + + E Y + ++ + Sbjct: 246 ERKRDENSKVIHLRPDNIDNETDRIVFHRIVYIPESPKIERYLLRHLDIVLVNTNGSIDH 305 Query: 302 KRSLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQ 355 L + IT + + + ++ Y+ +L+ Y L F + Sbjct: 306 IGKLGIIDMPLNQKITFSNHLTAIRIVSKDVEPYYIYYLLSWYHLNGSFKKVVKNQAGKW 365 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +L + ++ L + +PP++EQ I ++ I + ++ S + A+ Sbjct: 366 NLNLDTIRNLLIPLPPLEEQKKIVELLQKVDELIIRFNDFLQNLEDEANTLYKSILRLAL 425 Query: 416 TGQ 418 TG+ Sbjct: 426 TGK 428 >gi|93005779|ref|YP_580216.1| restriction modification system DNA specificity subunit [Psychrobacter cryohalolentis K5] gi|92393457|gb|ABE74732.1| restriction modification system DNA specificity domain [Psychrobacter cryohalolentis K5] Length = 419 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 79/441 (17%), Positives = 153/441 (34%), Gaps = 43/441 (9%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 M K YK + IG IP+ W+V I + G+ + VES Sbjct: 1 MNEVKMPEGYKQTE---IGVIPEDWEVKDIGEALTIRHGKDQKQ-----------VESTR 46 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 G+Y P G Q ++ ++ K +L G+ G + I T F Sbjct: 47 GQY-PIFGTGGQMGWASDFLYDKPSVLIGRKGSINKPRYINVPFWTVDTLFYSQVHNGYD 105 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + ID EA + + I N+ + +P EQ I + Sbjct: 106 EKFMFYKFCLIDWMNYNEAS----GVPSLNASTISNVKISVPKKPEQTAIATALSDIDNL 161 Query: 181 IDTLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I +L + + Q +++ + D K+ +G +P+ WEV F Sbjct: 162 IQSLEKLIAKKEAIKTGTMQQILTGKTRLPEFATRDDGSAKELKQTELGQIPEDWEVIEF 221 Query: 238 FALVTELNRKNTKLIESNILS-----------LSYGNIIQKLETRNMGLKPESYETYQIV 286 L+ E + + I + L+ E + + V Sbjct: 222 GKLLKEFRNGYSFSAKDYIKNGTPIITMSQIGLNGSFQYNPNEVKKWDASQFEHLKDFWV 281 Query: 287 DPGEIVFRFIDLQNDKRSLR---SAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYD 341 G+++ D+ DK + A++ ++ + + S YL +L Sbjct: 282 KDGDLLIAMTDVTPDKNLIGQMTIAELTHTALLNQRVGLLRLNKDLAQSNYLRYLSSLPL 341 Query: 342 LCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + S G++ ++ +++K+ V +P ++EQ I +++ A I L E + Sbjct: 342 WRTYCKGVASLGVQANIGTKEIKQASVTLPLVEEQTAIATILSDMDAEIQAL----EGRL 397 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 K+ + + +TG++ L Sbjct: 398 EKTKDIKQGMMQQLLTGKVRL 418 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 73/202 (36%), Gaps = 21/202 (10%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +G++P+ WEVK +T + K+ K +ES ++ + L Sbjct: 14 EIGVIPEDWEVKDIGEALTIRHGKDQKQVESTRGQYPIFGTGGQMGWASDFLYD------ 67 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 P ++ R + + ++ + + +G D ++ + D Sbjct: 68 ---KPSVLIGRKGSINKPRYINVPFWTVDTLFYSQVH-----NGYDEKFMFYKFCLIDWM 119 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 A G SL + + + VP EQ I ++ ID L++ +E+ I Sbjct: 120 NYNEASG---VPSLNASTISNVKISVPKKPEQTAIATALSD----IDNLIQSLEKLIAKK 172 Query: 404 KERRSSFIAAAVTGQIDLRGES 425 + ++ + +TG+ L + Sbjct: 173 EAIKTGTMQQILTGKTRLPEFA 194 >gi|301166116|emb|CBW25691.1| putative type I restriction enzyme specificity protein [Bacteriovorax marinus SJ] Length = 412 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 69/413 (16%), Positives = 134/413 (32%), Gaps = 23/413 (5%) Query: 28 VPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V + K G T G I ++ +V + + S+ Sbjct: 4 VKLGNHIKSYAGGTPSRGNMDYYRNGTIPWVKSGEVCRKYITSVEEKITEEAVQGSSAKW 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F + +L G + I G + L + D +LL+ + + Sbjct: 64 FPENSVLVALYGATAGQVSITKIKGTSNQAVLSVNGLDDFDNEYLYYLLTHSTPELLVK- 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G+ + K I + + + LAEQ I E + + I+ E + L K Q Sbjct: 123 VQGSGQPNLSKKIIDELQVELKELAEQKKIAEILTSVDKVIELTEIEIEKLKNLKKGMMQ 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-----TKLIESN 255 L++ + D + W F V + N + E Sbjct: 183 DLLTKGIRHTKFKDTPIGKIPESWECSQIKDLIKNGFIEKVQDGNHGGAYPRVSDFTEKG 242 Query: 256 ILSLSYGNIIQKLETRNMGL--KPESYETYQIV---DPGEIVFRFIDLQNDKRSLRSAQV 310 I +S N+ + + PESY + PG+++F + ++ Sbjct: 243 IPFVSAKNLHEHGYVKFNECPKLPESYLPKLRIGFGKPGDVIFAHNATVGPTAYVPNSGQ 302 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLV 369 ++ +D+ YL + S MG R + K + + V Sbjct: 303 DFIVSTSTTLYRSNSEKLDNYYLYASLLSPLFQTQISKVMGQTTRNQVPITAQKEMYLTV 362 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 PP+ EQ +I N + + L+ K E+ + L + + +TG++ ++ Sbjct: 363 PPLNEQNEINNAVKAI---LGTLISK-EEKLQKLVSLKKGLMQDLLTGKVRVK 411 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 74/220 (33%), Gaps = 23/220 (10%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKL---------NTGRTSE-----SGKDIIYIGLE 54 ++KD+ IG IP+ W+ IK K N G + K I ++ + Sbjct: 193 KFKDTP---IGKIPESWECSQIKDLIKNGFIEKVQDGNHGGAYPRVSDFTEKGIPFVSAK 249 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSI--FAKGQILYGKLGPYLRKAII----ADFDGICS 108 ++ + +S + I G +++ A + DF S Sbjct: 250 NLHEHGYVKFNECPKLPESYLPKLRIGFGKPGDVIFAHNATVGPTAYVPNSGQDFIVSTS 309 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 T + + L LLS +I + T + + + +PPL EQ Sbjct: 310 TTLYRSNSEKLDNYYLYASLLSPLFQTQISKVMGQTTRNQVPITAQKEMYLTVPPLNEQN 369 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I + A + + + + + L K Q L++ V Sbjct: 370 EINNAVKAILGTLISKEEKLQKLVSLKKGLMQDLLTGKVR 409 >gi|85711391|ref|ZP_01042450.1| putative type IC restriction-modification system specificity subunit [Idiomarina baltica OS145] gi|85694892|gb|EAQ32831.1| putative type IC restriction-modification system specificity subunit [Idiomarina baltica OS145] Length = 419 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 59/419 (14%), Positives = 149/419 (35%), Gaps = 24/419 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLED---VESGTGKYLPKDGNSR 71 +PK W+ + + +G T I ++ L D ++ G K+ ++ Sbjct: 10 VPKRWRYELLDKMATRCSGHTPSKSYPEYWNGGIKWVSLTDSYRLDQGYIYETDKEISAE 69 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + ++ + + A++A+ + + + Sbjct: 70 GIKNSSAQLHPAETVILSRDAGIGKSAVLAEPMAVSQHFIAWICDNKETLHSWFLYNWLQ 129 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 E G+T+ + + PP +EQ I + + D IT + Sbjct: 130 LNKPEFERQAVGSTIKTIGLPYFKKLKVLAPPFSEQQKIAQIL----STWDKAITTTEQL 185 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + +++K+AL+ ++T + ++G+ + G F + K+ Sbjct: 186 LANSQQQKKALMQQLLT---GKKRLLDENGVRFGGEWECFTLNDLFTFKRGKGLSKSDIS 242 Query: 252 IESNILSLSYGNIIQKLETRNMGLKP-ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + YG + + + ++ + G+I+ + + + Sbjct: 243 TTGKNRCVLYGELYTRYAEVIDNVNSRTDKNEAELSESGDILIPSSTTTSGIDLANATAI 302 Query: 311 MERGII-TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 +E G++ ++P S+ + ++ ++ G L D+K+L V Sbjct: 303 LENGVLLGGDINILRPRSKLSSQFMAHVLTHIKRYEIASLAQGITIIHLYGSDLKKLKVW 362 Query: 369 VP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +P + EQ I+ V++ ID K++ + +LK+ + S + +TG+ ++ + + Sbjct: 363 IPRKLDEQIKISQVLSA----IDKASLKLQIKLDILKQEKKSLMQQLLTGKRRVKVDEE 417 >gi|58038319|ref|YP_190288.1| Type I restriction-modification enzyme S subunit [Gluconobacter oxydans 621H] gi|58000733|gb|AAW59632.1| Type I restriction-modification enzyme S subunit [Gluconobacter oxydans 621H] Length = 402 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 64/422 (15%), Positives = 139/422 (32%), Gaps = 45/422 (10%) Query: 21 IPKHWKVVPIKRFTKLN----TGRTSESGKDI------IYIGLEDVESGTGKY-LPKDGN 69 +P+ W I G+ + +++ ++V + L + Sbjct: 4 LPEGWDCKNINEIGIQVIDGDRGKNYPKDNEFQATGSCLFLSAKNVTKAGFDFSLGQFIT 63 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC-----STQFLVLQPKDVLPELL 124 S + G I+ G A L + P+ L Sbjct: 64 SEKHKILNKGAVELGDIVITTRGSIGHFAYYNQKKYQTIRINSGMAILRSNVNYINPDFL 123 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 S + +IE G+ I +P+PPL+EQ I + D Sbjct: 124 YEVCRSQIIKTQIEKASFGSAQPQLTIAIIKKFRIPLPPLSEQKKIAAIL----STWDRA 179 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I + + +++K+AL+ ++ +G + + W K + + Sbjct: 180 IEGTEKLLANSQQQKKALMQQLL------------TGKKRLPGFSGKWLWKRSKEIFKSI 227 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGEIVFRFIDLQNDKR 303 + KN + + ++ + + P+ S + Y++V+PG + Q Sbjct: 228 SIKNNPMDCELLSVTQDQGVVLRSLLERRVVMPDGSVQGYKLVNPGNFIISLRSFQG--- 284 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLR--QSLKFE 360 RG+++ AY + + +SY+ G+R + + ++ Sbjct: 285 --GLEYSYYRGLVSPAYTVLDNKIEIENDFYKFYFKSYNFIGHLAVATIGIRDGKQISYQ 342 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 D + + PP+ EQ I V+ + IE +V L++ + + + +TG+ Sbjct: 343 DFSFIKLPYPPLPEQQAIAAVLTTADEE----ITAIESDLVRLRQEKKALMQQLLTGKRR 398 Query: 421 LR 422 + Sbjct: 399 VT 400 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 70/193 (36%), Gaps = 10/193 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFR 294 + N + L LS N+ + ++G S + + V+ G+IV Sbjct: 24 DRGKNYPKDNEFQATGSCLFLSAKNVTKAGFDFSLGQFITSEKHKILNKGAVELGDIVIT 83 Query: 295 FIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SG 352 + I + A + + I+ +L + RS + Sbjct: 84 TRGSIGHFAYYNQKKYQTIRINSGMAILRSNVNYINPDFLYEVCRSQIIKTQIEKASFGS 143 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + L +K+ + +PP+ EQ I +++ D +E E+ + ++++ + + Sbjct: 144 AQPQLTIAIIKKFRIPLPPLSEQKKIAAILSTW----DRAIEGTEKLLANSQQQKKALMQ 199 Query: 413 AAVTGQIDLRGES 425 +TG+ L G S Sbjct: 200 QLLTGKKRLPGFS 212 >gi|282850459|ref|ZP_06259838.1| type I restriction modification DNA specificity domain protein [Veillonella parvula ATCC 17745] gi|282579952|gb|EFB85356.1| type I restriction modification DNA specificity domain protein [Veillonella parvula ATCC 17745] Length = 422 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 64/413 (15%), Positives = 132/413 (31%), Gaps = 29/413 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS- 76 + W+ + ++TG +S + + I +++ T L GN D S Sbjct: 18 EDWEQRKLGECIDISTGYPFDSQDFNENGEYLVITNGNIQENTPFVLNNVGNRIDLDDSL 77 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I +L G R AI+ + + + + V + K P + L + Q Sbjct: 78 KKYILDIDDLLITMDGTVGRVAIVVNNKLVLAQR--VCRIKSNEPYYIYQLLSKNNFIQS 135 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + I G T+ H I P ++ + KI D LIT R + LK Sbjct: 136 MNKIGHGGTIKHISLSEISEYQDFYPKSQKERI---KISTVLTNCDKLITLHQRKLNNLK 192 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIE------WVGLVPDHWEVKPFFA---LVTELNRK 247 K++AL+ + K +++ G +G + ++ + N K Sbjct: 193 LKRKALLQKLFPKNGEGYPELRFPGFTDAWEQRKLGEIFEYLQNNTLSRDSLNYKIPNIK 252 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + + + K S + ++ G+++F + Sbjct: 253 NIHYGDILVKFNEILDGSNKDIPYINPDLDLSKFSKSLLRDGDVIFSDTAEDDTVGKAIE 312 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAW---LMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 Q + I S + + + S + G+ S+ +K Sbjct: 313 LQNVNAPFILSGLHTIPCRPLIPFGKGYLGNFFNSNSYRLQIRPLVQGIKVSSISKSALK 372 Query: 364 RLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P + EQ I + I ++ ++ + L+ ++ S + Sbjct: 373 DTMIKYPKNLDEQEKIGS----LFQSITKMITLHQRKLKHLQIQKKSLLQKLF 421 >gi|206579118|ref|YP_002240752.1| type I restriction modification DNA specificity domain protein [Klebsiella pneumoniae 342] gi|206568176|gb|ACI09952.1| type I restriction modification DNA specificity domain protein [Klebsiella pneumoniae 342] Length = 455 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 69/438 (15%), Positives = 149/438 (34%), Gaps = 33/438 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK----LNTGRTSESG---KDIIYIGLEDVESGTGK 62 +K + V G IP+ W + + T+ ++ G I I + D+ +G + Sbjct: 24 FKLTEV---GVIPEDWTIEALSAITEPSRPISYGIVQTGPAVINGIPCIRVVDISNGKIQ 80 Query: 63 YLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKD 118 S + S +I +G I+ G AI+ + L+ + Sbjct: 81 TGNLITTSGKISESYRRTILQEGDIVIPLRGKVGEIAIVDRNIRGANLTRGVALIALKDE 140 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAE 177 P+ ++ +L S + R+ A G+ + + P+ +P + EQ+ I + Sbjct: 141 YYPQYVKQYLSSRESADRLLASMNGSALQEITIATLRRFPLAVPRSIKEQIAIACVLSDT 200 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 I+TL + + Q L++ + L D +K +G +P+ W + Sbjct: 201 DKLINTLEQFITKKQAIKTATMQKLLTGKTRLPQFTLRADGMVKGYKKSELGEIPEDWTI 260 Query: 235 KPFFALVTELNRKNTKL---------IESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 ++ + T I S + + +I Sbjct: 261 TLLNDVIDSCSSGATPYRGISEYYKGNNRWITSGELNYCVINDTIEKISDSAIKDTNLKI 320 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 G + L+ V + + MA+ P+ + + Y+ + Sbjct: 321 HPAGTFLMAITGLEAAGTRGACGIVGKPSATNQSCMAIYPNNKLDSNYLYHWYVYNGDTL 380 Query: 346 FYAMGSGLRQ-SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + G +Q S ++++P+ +P KEQ I +++ I L +Q + Sbjct: 381 AFKYCQGTKQLSYTAGLIRKIPLFLPTDKKEQTAIAAILSDMDKDIQTL----QQRLDKT 436 Query: 404 KERRSSFIAAAVTGQIDL 421 ++ + + +TG+ L Sbjct: 437 RQLKQGMMQELLTGKTRL 454 >gi|220934948|ref|YP_002513847.1| type I restriction-modification system specificity subunit [Thioalkalivibrio sp. HL-EbGR7] gi|219996258|gb|ACL72860.1| type I restriction-modification system specificity subunit [Thioalkalivibrio sp. HL-EbGR7] Length = 419 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 61/423 (14%), Positives = 126/423 (29%), Gaps = 37/423 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGK 62 YK + V G +P W+V+ + +F + +G+ G+ + YI + D+ G Sbjct: 22 YKQTEV---GLVPLDWEVISLDKFADVTSGKRLPLGRSLTEHETPHPYIRVSDMRPGYVC 78 Query: 63 YLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDV 119 I G I + + Sbjct: 79 VDEIRYVPVDVFPKIKRYRIYTDDIFISVAGTLGIVGKIPKRLNGANLTENADRITNIKC 138 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAET 178 L L+S + +IE+I I +P+PP EQ I + Sbjct: 139 SQNYLLHVLMSPLIQSKIESIQTVGAQPKLALTRIRKFEIPLPPTDREQQAIASALSDAD 198 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I + + ++ KQ + ++T + ++ +G V PF Sbjct: 199 AL----IESLSQLLAKKRQIKQGAMQELLTGKRRLPGFSGEWDVKRLGSVLKFQVGFPF- 253 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 ++ + + + Y +V G+++ Sbjct: 254 ---------SSIYFNDEFQGIRLIKNRDLKASDQIISYTGDYRHEFLVKDGDLLIGMDGD 304 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + ++ V P A+ L K+ + S + L Sbjct: 305 -----FIPCLWGEGVALLNQRVGRVIPLSGLDAKFAYYYLIAPLKKIEDSTSSTTVKHLS 359 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 DV+ + +P ++EQ I ++ A + +E + ++ + + A +TG+ Sbjct: 360 HGDVEGIEEPLPEVEEQIAIATTLSDMDAE----IATLEAKLAKARQLKQGMMQALLTGR 415 Query: 419 IDL 421 I L Sbjct: 416 IRL 418 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 70/214 (32%), Gaps = 18/214 (8%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKL------IESNILSLSYGNIIQK----LETRNM 273 VGLVP WEV + K L E+ + ++ E R + Sbjct: 26 EVGLVPLDWEVISLDKFADVTSGKRLPLGRSLTEHETPHPYIRVSDMRPGYVCVDEIRYV 85 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + +I + + + + YL Sbjct: 86 PVDVFPKIKRYRIYTDDIFISVAGTLGIVGKIPKRLNGANLTENADRITNIKCSQN--YL 143 Query: 334 AWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDV 391 ++ S + ++ + G + L +++ + +PP EQ I + ++ A I+ Sbjct: 144 LHVLMSPLIQSKIESIQTVGAQPKLALTRIRKFEIPLPPTDREQQAIASALSDADALIES 203 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 L + + + ++ + + +TG+ L G S Sbjct: 204 LSQLLAKK----RQIKQGAMQELLTGKRRLPGFS 233 >gi|271498974|ref|YP_003331999.1| restriction modification system DNA specificity domain-containing protein [Dickeya dadantii Ech586] gi|270342529|gb|ACZ75294.1| restriction modification system DNA specificity domain protein [Dickeya dadantii Ech586] Length = 396 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 64/417 (15%), Positives = 130/417 (31%), Gaps = 35/417 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +PK W +V + + + I +V G S++ Sbjct: 2 VPKGWSLVEANEVCESISVGVVIKPAQYYVDESVGIKAFRSANVREGFINDSGWVYFSQK 61 Query: 73 SDTSTVS-IFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + G +L + G ++ + Q VLPE L + Sbjct: 62 GHLANKNSQLKSGDVLIVRTGYPGTACVVTPEFEGANAIDIVIARPQKDKVLPEYLCAYT 121 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S ++ + G H + + + +PPL EQ I + + I T Sbjct: 122 NSSVGKSQVLNLQGGMAQKHLNVSAYQTLKIKLPPLLEQKKIAKILSTWDKAIATTEQLL 181 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + K Q L+ +G + + WE + + + Sbjct: 182 TNSQQQKKVLMQELL----------------TGKKRLPGFSGKWEYYTLSDIAVIVMGSS 225 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 K N + I + +N P + + E + I L A Sbjct: 226 PKSDAYNENGVGLPLIQGNADIKNRRSVPRIFTSEI---TKECLPDDILLSVRAPVGTIA 282 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 I ++ + + + K + +S+ +D+K+L + Sbjct: 283 ISNHNACIGRGIATIRAKMDFNQAFIYQWLLWFEPKWYSLSQGSTFESINSDDIKQLKIR 342 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 VP I+EQ I ++ V I+ L +Q + LK+ + + + +TG+ ++ E+ Sbjct: 343 VPSIEEQKSIAKILAVADGEIETL----KQKLHHLKQEKKALMQQLLTGKRRVKTEA 395 >gi|218680840|ref|ZP_03528737.1| type I restriction-modification system specificity determinant [Rhizobium etli CIAT 894] Length = 482 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 71/268 (26%), Positives = 131/268 (48%), Gaps = 8/268 (2%) Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 P+P L Q I + ET RID LI + R +E+L+E+K A+ + GL+ + Sbjct: 6 PVPDLDAQRAIAAFLDRETTRIDKLIETKERQVEVLREQKSAITKEYIHSGLHAGRERVA 65 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + W L+P W+ + L R+ ++ + +G I++ N+ PE Sbjct: 66 TQNSWFPLIPQGWQPRRMRFLFRAAKRQGMPDLDVLSVYRDFGVILKSSRDDNINKTPED 125 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMR 338 +YQ+V+PG++V + + RGI + Y+ +P ++ Y+ +L+R Sbjct: 126 LSSYQLVEPGDLVVNKMKAWQGSLGISEL----RGITSPDYLVYRPVAPMNGRYMHYLLR 181 Query: 339 SYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + +F + +G+ + L+ + +P + EQ +I I+ T+RI+ +V+ Sbjct: 182 TRPMPSLFLTISNGIRIDQWRLEHAKFMDVVAWLPSLDEQAEIAAAIDARTSRIERIVKS 241 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + SI LL+E R++ I AAV G ID+R Sbjct: 242 VSDSIELLREHRAALITAAVAGHIDIRE 269 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 75/203 (36%), Gaps = 13/203 (6%) Query: 17 WIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL----PKDGNSRQ 72 W IP+ W+ ++ + + + + + + V G L + N Sbjct: 70 WFPLIPQGWQPRRMRFLFR------AAKRQGMPDLDVLSVYRDFGVILKSSRDDNINKTP 123 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW---LL 129 D S+ + G ++ K+ + I++ GI S +LV +P + + Sbjct: 124 EDLSSYQLVEPGDLVVNKMKAWQGSLGISELRGITSPDYLVYRPVAPMNGRYMHYLLRTR 183 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + ++ +P L EQ I I A T RI+ ++ Sbjct: 184 PMPSLFLTISNGIRIDQWRLEHAKFMDVVAWLPSLDEQAEIAAAIDARTSRIERIVKSVS 243 Query: 190 RFIELLKEKKQALVSYIVTKGLN 212 IELL+E + AL++ V ++ Sbjct: 244 DSIELLREHRAALITAAVAGHID 266 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 15/47 (31%), Positives = 28/47 (59%) Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + + VP + Q I ++ ET RID L+E E+ + +L+E++S+ Sbjct: 2 SVNIPVPDLDAQRAIAAFLDRETTRIDKLIETKERQVEVLREQKSAI 48 >gi|186684994|ref|YP_001868190.1| restriction modification system DNA specificity subunit [Nostoc punctiforme PCC 73102] gi|186467446|gb|ACC83247.1| restriction modification system DNA specificity domain protein [Nostoc punctiforme PCC 73102] Length = 530 Score = 131 bits (328), Expect = 3e-28, Method: Composition-based stats. Identities = 65/473 (13%), Positives = 136/473 (28%), Gaps = 72/473 (15%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 + +P+ W+ + ++ G T I ++ +V + Sbjct: 5 LTELPEGWQWKNLGEVFEIFVGATPSRKIPEYWDGSIPWVSSGEVAFCEIYETRETITEL 64 Query: 72 QSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 ++ + G +L G +G +A I + ++ ++ + Sbjct: 65 GLKNTSTELHPPGTVLLGMIGEGKTRGQAAILKIYATHNQNSAAIRVSEIGLPPEYVYYF 124 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 +R I G + + + P+PPL EQ I I R Sbjct: 125 LKLEYERTRQIGSGNNQQALNKSRVQLMSFPVPPLNEQKRIVANIEELNDRTQRAKEALD 184 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-------------------------- 223 +L +Q++++ L D + ++ +E Sbjct: 185 SIPQLCDRFRQSVLAAAFRGDLTADWRDQNPDVEPASVLLERIRRDRRCRWEELEVAKMQ 244 Query: 224 ------------------------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL-- 257 + +P+ W + + N + E Sbjct: 245 SKGKVVEECKWKEKYQEPDPLSNFDLPELPNGWVWTKWEQVGFCQNGRAFPSKEYQTNGV 304 Query: 258 ------SLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFI---DLQNDKRSLR 306 +L I+ ++ L + E Y ++ E+V + Sbjct: 305 KLLRPGNLHVSGEIEWNDSNTRYLSEDWAEQYPDYLISTNELVINLTAQSLADEFLGRIC 364 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 ER ++ + P I +L WL +S + +G Q + + + Sbjct: 365 LTGEDERCLLNQRIARLVPIIISPRFLFWLFKSKLFRSYVDDLNTGSLIQHIFTPQINKF 424 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+KEQ I N+I + I+ + K Q S +A A G+ Sbjct: 425 HFPLPPLKEQQMIVNLIETQINSIENIGLKAGQMQNAFPHLNQSILAKAFRGE 477 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 71/203 (34%), Gaps = 9/203 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELN-----RKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 + + +P+ W+ K + RK + + +I +S G + Sbjct: 3 DELTELPEGWQWKNLGEVFEIFVGATPSRKIPEYWDGSIPWVSSGEVAFCEIYETRETIT 62 Query: 278 E---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E + ++ PG ++ I + ++ SA + V G+ Y+ Sbjct: 63 ELGLKNTSTELHPPGTVLLGMIGEGKTRGQAAILKIYATHNQNSAAIRVSEIGLPPEYVY 122 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + ++ G+ +Q+L V+ + VPP+ EQ I I R E Sbjct: 123 YFLKLEYERTRQIGSGN-NQQALNKSRVQLMSFPVPPLNEQKRIVANIEELNDRTQRAKE 181 Query: 395 KIEQSIVLLKERRSSFIAAAVTG 417 ++ L R S +AAA G Sbjct: 182 ALDSIPQLCDRFRQSVLAAAFRG 204 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 71/212 (33%), Gaps = 15/212 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVE-SGTGKYLPKDGNSRQ 72 + +P W ++ GR S + + + ++ SG ++ + Sbjct: 270 LPELPNGWVWTKWEQVGFCQNGRAFPSKEYQTNGVKLLRPGNLHVSGEIEWNDSNTRYLS 329 Query: 73 SDTST---VSIFAKGQILYGKL-----GPYLRKAIIA--DFDGICSTQFLVLQPKDVLPE 122 D + + + +++ +L + + D + + + L P + P Sbjct: 330 EDWAEQYPDYLISTNELVINLTAQSLADEFLGRICLTGEDERCLLNQRIARLVPIIISPR 389 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L S ++ + G+ + H I P+PPL EQ +I I + I+ Sbjct: 390 FLFWLFKSKLFRSYVDDLNTGSLIQHIFTPQINKFHFPLPPLKEQQMIVNLIETQINSIE 449 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 + + + Q++++ L P Sbjct: 450 NIGLKAGQMQNAFPHLNQSILAKAFRGELVPQ 481 >gi|291277578|ref|YP_003517350.1| putative type I restriction-modification system S protein [Helicobacter mustelae 12198] gi|290964772|emb|CBG40628.1| putative type I restriction-modification system S protein [Helicobacter mustelae 12198] Length = 441 Score = 130 bits (327), Expect = 4e-28, Method: Composition-based stats. Identities = 56/421 (13%), Positives = 127/421 (30%), Gaps = 31/421 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + ++ + G T +I + +ED+ + Sbjct: 13 PHGVEFKTLEEVFTIGNGYTPSKKNPEFWENGNIPWFRMEDIRQNGRILEDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F KG I+ A++ D + + +F L K + + Sbjct: 73 LKGGKLFPKGSIIISTTATIGEHALLI-VDSLANQRFTFLSKKVNCDIALDEKYFFYHCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + D P+PPL Q I + + T L TE Sbjct: 132 VLGEWCRKNINVSGFASVDMAAFRKYKFPLPPLEVQREIVKILDTFTELNTELNTELKLR 191 Query: 192 IELLKEKKQALVS--------YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + + + L+S + L K + L P E + + Sbjct: 192 KKQYEYYRNWLLSFGDVDASKEGAEQRLRDKSYPKALKALLLSLCPHGVEFRKLGEVGRF 251 Query: 244 LNRK---NTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFI 296 + L + YG I + E + + + P +I+ Sbjct: 252 TRGNGLLKSNLQTHGKPVVHYGQIYTRYGLATEKTISYVSETLFAKLKKAKPKDILIAVT 311 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ- 355 + + + + S M + ++A+ ++ K + +G + Sbjct: 312 SENVKDVGKSTVWLGDEEVAFSGEMYSYSTDQNPKFIAYYFQTSKFQKEKEKIVTGTKVI 371 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 + +D+K++ + +PP++ Q +I +++ + + L I I K+ R + Sbjct: 372 RIHEDDLKQIKIPLPPLEVQREIVKILDDFSTLTEDLSSGIPAEIAARKKQYEYYRDKLL 431 Query: 412 A 412 Sbjct: 432 T 432 >gi|2921239|gb|AAC64909.1| putative type I S-subunit protein [Streptococcus thermophilus] Length = 412 Score = 130 bits (327), Expect = 4e-28, Method: Composition-based stats. Identities = 63/410 (15%), Positives = 139/410 (33%), Gaps = 27/410 (6%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKD 67 +P+ W+ + + G T + + G D E G Y+ K Sbjct: 11 EVPELRFKGFTDDWEERKLGELANIVGGGTPSTSNPEYWDGDIDWYAPAEIGEQSYVSKS 70 Query: 68 GNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + S+ I G +L+ AI+A + F + P + Sbjct: 71 KKTITELGLKNSSARILPVGTVLFTSRAGIGNTAILAKE-ATTNQGFQSIVPDQNKLDSY 129 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + ++ + E G+T K + + + +P L+EQ I ++D Sbjct: 130 FIFSRTNELKRYGEVTGAGSTFVEVSGKQMSKMSIMVPELSEQQKIGSF----FKQLDET 185 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 IT R ++LLKE+K+ + + K +++ G E+ T Sbjct: 186 ITLHQRKLDLLKEQKKGFLQKMFPKNGAKVPELRLKGFTDDWEERKLGELANLVGGGTPR 245 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 N + + +I + I + + K + + + + + Sbjct: 246 TS-NPEYWDGDIDWYAPA-EIGEQSYVSKSKKTITELGLKNSSARILPVGTVLFTSRAGI 303 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVK 363 +A + + Y+++ P R+ +L + G+G + + + Sbjct: 304 GNTAILAKEATTNQGYLSIVPDQNKLDSYFIFSRTNELKRYGEVTGAGSTFVEVSGKQMS 363 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ ++VP + EQ I +D + ++ + LLKE++ F+ Sbjct: 364 KMSIMVPELSEQQKIGLF----FKHLDDTITFHQRKLDLLKEQKKGFLQK 409 >gi|156976838|ref|YP_001447744.1| type I restriction-modification system S subunit [Vibrio harveyi ATCC BAA-1116] gi|156528432|gb|ABU73517.1| hypothetical protein VIBHAR_05614 [Vibrio harveyi ATCC BAA-1116] Length = 432 Score = 130 bits (327), Expect = 4e-28, Method: Composition-based stats. Identities = 64/432 (14%), Positives = 148/432 (34%), Gaps = 49/432 (11%) Query: 29 PIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + N G + + K ++++ +++SGT + + + I Sbjct: 11 RLGELASGNRGVSYKPENLKAAIDDKSVVFLRSNNIQSGTLNFENVQIVPDSLVSDS-QI 69 Query: 81 FAKGQI----------LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 KG I L GK G + G + F + E ++ S Sbjct: 70 LKKGDIAVCMSNGSRQLVGKSGMLQHEVEYPLTVGAFCSVF--RCQNEDDSEYVRYLFQS 127 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 I+ G+ +++ + I +P P A + I E + ID I Sbjct: 128 QAYQHGIDVTLAGSAINNLKNSDVEAIEVPTAPKALRKKIAEIL----STIDNQIDATQA 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW---------VGLVPDHWEVKPFFALV 241 I+ KQ +++ + ++G++P+ K +E +G++P W+VK + Sbjct: 184 LIDKYTAIKQGMMADLFSRGIDPETKALRPTLEEAPELYHKTPLGMLPKGWDVKTLGDIS 243 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----------VDPGEI 291 ++ + + I L ++ + +S + I + PG+I Sbjct: 244 EKITSGSRDWAKFYSPEGDLFVRISNLTREHVNFRWDSVKHVNIGGGSEGERTQLQPGDI 303 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG- 350 + + +A + + +G ++ ++ + S + F Sbjct: 304 LVSITADLGIVGVVPENMGRAYINQHTALIRLSTYGENARFIGNYLSSRCGQEQFEKNND 363 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 SG + + + L +P KEQ I + I+ +D ++ +++ + Sbjct: 364 SGAKAGINLPTIASLRCPIPEEKEQLLIASKIDA----LDEVIADLKREKSKSLSLKQGL 419 Query: 411 IAAAVTGQIDLR 422 + +TG++ + Sbjct: 420 MQDLLTGKVSVP 431 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 72/202 (35%), Gaps = 12/202 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKY---LPKDGN 69 +G +PK W V + ++ T + + S + +++ + ++ + K N Sbjct: 227 LGMLPKGWDVKTLGDISEKITSGSRDWAKFYSPEGDLFVRISNLTREHVNFRWDSVKHVN 286 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQ--FLVLQPKDVLPELLQ 125 + G IL ++ + G + + L + Sbjct: 287 IGGGSEGERTQLQPGDILVSITADLGIVGVVPENMGRAYINQHTALIRLSTYGENARFIG 346 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L S ++ E + + + I ++ PIP EQ+LI KI A I L Sbjct: 347 NYLSSRCGQEQFEKNNDSGAKAGINLPTIASLRCPIPEEKEQLLIASKIDALDEVIADLK 406 Query: 186 TERIRFIELLKEKKQALVSYIV 207 E+ + + L + Q L++ V Sbjct: 407 REKSKSLSLKQGLMQDLLTGKV 428 >gi|154245043|ref|YP_001416001.1| restriction modification system DNA specificity subunit [Xanthobacter autotrophicus Py2] gi|154159128|gb|ABS66344.1| restriction modification system DNA specificity domain [Xanthobacter autotrophicus Py2] Length = 450 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 67/420 (15%), Positives = 135/420 (32%), Gaps = 23/420 (5%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLP 65 S +W +P W + G T +G + + ++ D+ Y+ Sbjct: 2 SEARW--QVPHSWLWASFGEVADIVGGGTPPTGDEANFTKQGVPWLTPADLTGYRETYIS 59 Query: 66 KDGNSRQSD---TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + S + KG +L+ P IA + + F K + Sbjct: 60 RGRRDLSEKGYRESAARLLPKGTVLFSSRAPV-GYCAIASENVSTNQGFKSFILKGDI-S 117 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + T+ E+ G T + +P+PPL EQ I KI + T + Sbjct: 118 PEYVRHYLLGSTEYAESKASGTTFKELSGSRATELALPLPPLPEQRRIVAKIDSLTAKSR 177 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 L+++ KQA+++ L D +G + + + Sbjct: 178 RARDHLEHIPRLVEKYKQAILAAAFDGRLTELSP-HDIVHPELGELIEFGPQNGLYLPKD 236 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 L N N I + + ++ + G+++ ++ + Sbjct: 237 RYGEGTPILRIQNYGF----NFIDEPTNWHRVTVSDAIAAQFAMSDGDLIINRVNSPSHL 292 Query: 303 R-SLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLK 358 S+ + M I S M ++ + + ++ + S + S+ Sbjct: 293 GKSMVVTKAMAGAIFESNMMRIRLNALAEPKFVQLYLSSSQGRGSLTKDAKWAVNQASIN 352 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 DV R PV +P + +Q + + I A ID L + + L+ + +A A G+ Sbjct: 353 QGDVSRTPVPLPGLSDQIAVLDRIETAFAWIDRLAAEATSARTLIDRLDQAVLAKAFRGE 412 >gi|10956224|ref|NP_053442.1| specificity subunit Lla33I [Lactococcus lactis] gi|22855174|ref|NP_690625.1| type I R/M system specificity subunit [Lactococcus lactis subsp. lactis bv. diacetylactis] gi|6573270|gb|AAF17614.1|AF207855_3 specificity subunit Lla33I [Lactococcus lactis] gi|22775344|dbj|BAC11868.1| type I R/M system specificity subunit [Lactococcus lactis subsp. lactis bv. diacetylactis] Length = 414 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 148/416 (35%), Gaps = 41/416 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGK 62 +P+ W+ + T + G + + DI ++ + DV G+ Sbjct: 15 KVPELRFPGFTDDWEERKLGSLTTVVRGASPRPIQDPKWFDKESDIGWLRIADVTEQNGR 74 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + + + + +L + + G+ + L P E Sbjct: 75 IYHLEQHISKLGQEKTRVLTEPHLLLSIAATVGKPVVNYVKTGVHDGFLIFLNPTF---E 131 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + + + + + N + +P EQ I ++D Sbjct: 132 REFMFQWLEMFRPKWQKYGQPGSQVNLNSELVRNQEIVLPNYKEQQKIGSF----FKQLD 187 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 IT R ++LLKE+K+ + + K +++ +G D WE + ++ Sbjct: 188 NTITLHQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG------FADDWEERKLSSMTN 241 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVF--RFIDLQ 299 N K+ + +S L N+ + + + E + ++V + Sbjct: 242 YKNGKSHEDKQSTSGKLELINLNSISISGGLKHSGKFIDEADDTLQKDDLVMILSDVGHG 301 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 + + +R ++ ++P+ D +L + ++ F A G+G+ + ++ Sbjct: 302 DLLGRVALIPEDDRFVLNQRVALLRPNTTADPQFLFSYINAHQY--YFKAQGAGMSQLNI 359 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V+ VP I+EQ I + ++D + ++ + LLKE++ F+ Sbjct: 360 SKGSVENFISFVPIIEEQKKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 411 >gi|10956235|ref|NP_062461.1| hypothetical protein pCI305_p3 [Lactococcus lactis subsp. lactis] gi|9294803|gb|AAF86681.1| HsdS [Lactococcus lactis subsp. lactis] Length = 402 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 54/412 (13%), Positives = 134/412 (32%), Gaps = 41/412 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGK 62 +P+ W+ + T + G + + DI ++ + DV G+ Sbjct: 11 KVPELRFPGFTDDWEERKLGSLTTVVRGASPRPIQDPKWFDKESDIGWLRIADVTEQNGR 70 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + + + + +L + + G+ + L P E Sbjct: 71 IYHLEQHISKLGQEKTRVLTEPHLLLSIAATVGKPVVNYVKTGVHDGFLIFLNPTF---E 127 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + + + + + N + +P EQ I ++D Sbjct: 128 REFMFQWLEMFRPKWQKYGQPGSQVNLNSELVRNQEIVLPNYKEQQKIGSF----FKQLD 183 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 IT R ++LLKE+K+ + + K +++ +G + + + Sbjct: 184 NTITLHQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG-------FADDWEERKLSDIV 236 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 K++ E + + +++ K + + I++ + Sbjct: 237 SRLSKSSNNSELPRVEFEDIVSGEGRLNKDVSHKFDD-RKGILFSSQNILYGKLRPYLKN 295 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 L +GI + K D ++ L++S KV ++ V Sbjct: 296 WLLADF----KGIALGDFWVFKSINSDPKFVYSLIQSNHYQKVANDTSGTKMPRSDWKKV 351 Query: 363 KRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I + ++D + ++ + LLKE++ F+ Sbjct: 352 SSTEFQIPSSLEEQKKIGSF----FKKLDDTIALHQRKLDLLKEQKKGFLQK 399 >gi|322369761|ref|ZP_08044324.1| restriction modification system DNA specificity domain protein [Haladaptatus paucihalophilus DX253] gi|320550679|gb|EFW92330.1| restriction modification system DNA specificity domain protein [Haladaptatus paucihalophilus DX253] Length = 437 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 72/422 (17%), Positives = 143/422 (33%), Gaps = 35/422 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP+ W VP + +LN Y+ ++ V+ + R+ D T + Sbjct: 26 IPEEWDAVPFEEAIELNPRYDKPDNGPFNYLPMDAVDEDKQTI--EYWTEREKDDCTTTW 83 Query: 81 FAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWL--LSID 132 F G +Y K+ P I G ST+FLV P+ + + + + Sbjct: 84 FKNGDTVYAKITPCTENGKIAFINGLETEVGSGSTEFLVFHPRKGVTDEQFVYYLSNLPE 143 Query: 133 VTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ EG+T G + +P+P L EQ I + + +D I + Sbjct: 144 FRSVTISLMEGSTGRQRVPSDVFKGGLQIPLPSLPEQRRIADIL----STVDERIQQTDV 199 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 IE E + + T G ++ G + P W++ P T+ Sbjct: 200 IIEKTNELLSGVQKDLFTTG---YSDDREVGTRRLIEAPLDWDIAPLSEFTTDSAYGPRF 256 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPE---------SYETYQIVDPGEIVFRFIDLQND 301 + + + + + G+ E S ++ G+ + Sbjct: 257 SSDEYDENGALATLRTTDLNDDGGINHETMPLADLDPSDFEDHLLKKGDFIISRTGAY-C 315 Query: 302 KRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKF 359 + + + + G++ +L + S K + G +++L Sbjct: 316 GICTIWDDYEIPTVPGAYMIRFRLDDGLNPLFLREYVNSSVGSKKVDVLARGSSQKNLAG 375 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 D+ +P+ VP EQ I VI RI E ++ L++ + + +TG++ Sbjct: 376 SDLLSMPIPVPSRTEQDRIVEVIQAVKKRIQNEREYKQK----LQDLKRGLMQDLLTGKV 431 Query: 420 DL 421 + Sbjct: 432 RV 433 >gi|304320735|ref|YP_003854378.1| type I restriction-modification system specificity subunit [Parvularcula bermudensis HTCC2503] gi|303299637|gb|ADM09236.1| type I restriction-modification system specificity subunit [Parvularcula bermudensis HTCC2503] Length = 399 Score = 129 bits (325), Expect = 7e-28, Method: Composition-based stats. Identities = 58/416 (13%), Positives = 145/416 (34%), Gaps = 31/416 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ WK+ + + + ++ E + + + +Y + D + Sbjct: 2 VPEGWKMESLGNWIEAYREKSVEKDQYPVLTSSREGLIPQSEYYGE-SRITSRDNVGFHV 60 Query: 81 FAKGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 Y G + GI S + V +R Sbjct: 61 IPPQFFTYRSRSDDGLFFFNRNDTGQTGIISHFYPVFDFPKGNS--DFFLAALNFWRKRF 118 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G++ + ++ +PIPP Q I + + D I + I + Sbjct: 119 AGYAVGSSQVVLSLNALKSVKLPIPPKHVQDEIADIL----TSWDRAIKTTEKLIANSQA 174 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +K++L+ ++T + S + ++ + +++E+ + Sbjct: 175 QKKSLMQQLLT---GKKRLPRFSDV------WREVQLGELGDFIKGKGIPRDEVVETGLP 225 Query: 258 SLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM-E 312 ++ YG I +ET + E+ ++ G+I+F D+ A + + Sbjct: 226 AIRYGEIYTTHHFIVETFASFISEEAAAQSVPLNNGDILFTCSGETADEIGKCVAYLGND 285 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 R + + HG + +L + + S ++ + G G + ++ ++ +++P Sbjct: 286 RSFAGGDIILFREHGQCAHFLGYALNSSEVVRQKTRFGQGNSVVHINARNLSQITLMLPS 345 Query: 372 IKEQFDITNVINVETARIDVL-VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 ++EQ I ++++ I L +E I R++ + +TG+ ++ E + Sbjct: 346 LEEQEAIADILDTARRDIRQLEIELQNLQIE-----RAALMQQLLTGKRRVKVEKE 396 >gi|307264084|ref|ZP_07545681.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|306870562|gb|EFN02309.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 510 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 68/441 (15%), Positives = 132/441 (29%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPK---DGN 69 IPK W V + ++ G T ++ +D I +I D++ +GKY+ K + Sbjct: 70 EIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNIT 129 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +S+ + +K I+Y P I + + + F + + + + Sbjct: 130 ENGLRSSSTRLLSKNSIVYSSRAPI-GYIAITETELCTNQGFKSIDLYNKE-IVDYLYYS 187 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I T I++ G T GN +P+PPL EQ I KI I+ + Sbjct: 188 LIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQYAEKEE 247 Query: 190 RFIELLKEKK----QALVSYIVTKGLNPDVKM---------------------------- 217 + L ++ ++++ + L Sbjct: 248 KLTALHQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPK 307 Query: 218 --------------------KDSGIEWVGLVPDHWEVKPFFALVTE------LNRKNTKL 251 + E +P++W + + Sbjct: 308 VVSEIILRDNLPYEIINGEERCIADEVPFEIPENWCWVRLGEIGNWGAGATPNRHEPKYY 367 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I L G++ + T E V + I + + Sbjct: 368 ENGTIPWLKTGDLNDGIITEIPEYITELAIEKTSVKLNPVGSVLIAMYGATIGKLGILNI 427 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 E + + GI + YL + + S + GSG + ++ E + +PP Sbjct: 428 EATTNQACCACIPYTGIYNKYLFYYLMSQKTELQKRSEGSG-QPNISKEKIVNYLFPLPP 486 Query: 372 IKEQFDITNVINVETARIDVL 392 + EQ I I + + L Sbjct: 487 LNEQKCIVEKIETLFSTLQNL 507 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 62/211 (29%), Gaps = 13/211 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S ++ +P W L + K E + + I + + + K S Sbjct: 63 SQQDFSFEIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYIS 122 Query: 280 YETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 I + G + I + A + ++ + + Sbjct: 123 KGNRNITENGLRSSSTRLLSKNSIVYSSRAPIGYIAITETELCTNQGFKSIDLYNKEIVD 182 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + Y ++ + + + +PP+ EQ I I I+ Sbjct: 183 YLYYSLIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQY 242 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 + E+ + L ++ + S + AA+ G+ Sbjct: 243 -AEKEEKLTALHQQFPEQLKKSILQAAIQGK 272 >gi|256958288|ref|ZP_05562459.1| type I R/M system specificity subunit [Enterococcus faecalis DS5] gi|256948784|gb|EEU65416.1| type I R/M system specificity subunit [Enterococcus faecalis DS5] Length = 406 Score = 129 bits (324), Expect = 8e-28, Method: Composition-based stats. Identities = 52/405 (12%), Positives = 136/405 (33%), Gaps = 32/405 (7%) Query: 25 WKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++ + ++ G + ++ D+ ++ + DV G+ + ++ Sbjct: 13 WELCKLGTLAEIVRGASPRPIQDSKWFDNTSDVGWLRISDVTEQNGRIYKLEQKLSKAGQ 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + K +L + + G+ + L P L + + T Sbjct: 73 EKTRVLRKPHLLLSIAATVGKPVVNYVNTGVHDGFLIFLNP---LFDREFMFQWLEMFTP 129 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + + + + + + N + +P EQ EKI +D IT R ++ L Sbjct: 130 KWQKYGQPGSQLNLNSELVRNQELRMPSTNEQ----EKIGMLFKYLDDTITLHQRKLDQL 185 Query: 196 KEKKQALVSYIV---TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 K+ K+A + + N K++ + E + +V + N + Sbjct: 186 KKLKKAYLHAMFVSMNTKKNKVPKLRFTDFEGDWELCKLGQVANYRRGSFPQPYGNKEWY 245 Query: 253 E-----SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + G+ ++ +E + + V G++V + Sbjct: 246 DGENSMPFVQVVDVGDNLRLVEDTKQKISELAQPKSVFVKEGKVVVTLQGSIGRVAITQY 305 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++R ++ +D Y A++++ G +++ E + + Sbjct: 306 PAYVDRTLL---IFESYKAEMDEYYFAYVIQQL-FEYEKTRAPGGTIKTVTKEALSDFTI 361 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 P I+EQ + ++D + + + L E + S++ Sbjct: 362 SFPSIEEQKK----LGKFFEQLDDTITLHQNKLEQLNELKKSYLQ 402 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 22/193 (11%), Positives = 59/193 (30%), Gaps = 14/193 (7%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W++ + + G + + ++ + DV + Sbjct: 218 DWELCKLGQVANYRRGSFPQPYGNKEWYDGENSMPFVQVVDVGDNLRLVEDTKQKISELA 277 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 +G+++ G R I + L+ + + + + Sbjct: 278 QPKSVFVKEGKVVVTLQGSIGR-VAITQYPAYVDRTLLIFESYKAEMDEYYFAYVIQQLF 336 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G T+ + + + + P + EQ +K+ ++D IT +E Sbjct: 337 EYEKTRAPGGTIKTVTKEALSDFTISFPSIEEQ----KKLGKFFEQLDDTITLHQNKLEQ 392 Query: 195 LKEKKQALVSYIV 207 L E K++ + + Sbjct: 393 LNELKKSYLQNMF 405 >gi|10956231|ref|NP_053053.1| specificity determinant HsdS [Lactococcus lactis] gi|5453329|gb|AAD43536.1| specificity determinant HsdS [Lactococcus lactis] Length = 410 Score = 129 bits (324), Expect = 8e-28, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 148/416 (35%), Gaps = 41/416 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGK 62 +P+ W+ + T + G + + DI ++ + DV G+ Sbjct: 11 KVPELRFPGFTDDWEERKLGSLTTVVRGASPRPIQDPKWFDKESDIGWLRIADVTEQNGR 70 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + + + + +L + + G+ + L P E Sbjct: 71 IYHLEQHISKLGQEKTRVLTEPHLLLSIAATVGKPVVNYVKTGVHDGFLIFLNPTF---E 127 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + + + + + N + +P EQ I ++D Sbjct: 128 REFMFQWLEMFRPKWQKYGQPGSQVNLNSELVRNQEIVLPNYKEQQKIGSF----FKQLD 183 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 IT R ++LLKE+K+ + + K +++ +G D WE + ++ Sbjct: 184 NTITLHQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG------FADDWEERKLSSMTN 237 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVF--RFIDLQ 299 N K+ + +S L N+ + + + E + ++V + Sbjct: 238 YKNGKSHEDKQSTSGKLELINLNSISISGGLKHSGKFIDEADDTLQKDDLVMILSDVGHG 297 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 + + +R ++ ++P+ D +L + ++ F A G+G+ + ++ Sbjct: 298 DLLGRVALIPEDDRFVLNQRVALLRPNTTADPQFLFSYINAHQY--YFKAQGAGMSQLNI 355 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V+ VP I+EQ I + ++D + ++ + LLKE++ F+ Sbjct: 356 SKGSVENFISFVPIIEEQKKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 407 >gi|88809186|ref|ZP_01124695.1| type I restriction-modification system, S subunit [Synechococcus sp. WH 7805] gi|88787128|gb|EAR18286.1| type I restriction-modification system, S subunit [Synechococcus sp. WH 7805] Length = 405 Score = 129 bits (324), Expect = 8e-28, Method: Composition-based stats. Identities = 66/417 (15%), Positives = 144/417 (34%), Gaps = 34/417 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 + W + + F L+ G T ++ DI ++ +V + R Sbjct: 3 ESWSKLRVGDFCNLSAGGTPDTNNPDYWEGGDIPWMSSGEVHDQRIRRTRSHITERGLQD 62 Query: 76 STVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ F G +L G K I++ + + + + E + Sbjct: 63 SSAKFFPIGSVLVALAGQGKTRGKVAISEIELTTNQSIAAIIADKGVCEPDFLFYNLDSR 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + G+ + + + ++ + +PPL EQ I E + +I L + + I Sbjct: 123 YEELRTLSGGSGRAGLNLSILSDVEISLPPLPEQKKIAEILSGVDKQIYALENKISKLIS 182 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 E + L S G N V K+S + ++ V + + E Sbjct: 183 TKTEIFRDLFSCFDELGGN-GVCKKESDTKI-------MPLESVCEAVIDCKNRTPPYTE 234 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQI------VDPGEIVFRFIDLQNDKRSLRS 307 S + N+ RN LK +Y+I P +++F + + Sbjct: 235 SGHPVVRTPNVRNGKLVRN-DLKYTDISSYEIWTARSVPRPMDVLFTREAPLGEVCLVPE 293 Query: 308 AQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKR 364 + + M + ID YL + + S + + G + DV+ Sbjct: 294 NF---KCCLGQRMMLFRADKSLIDPRYLLFSLMSPFVQDQLLKSKGGTTVGHARVADVRD 350 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L + + P ++Q I + + I+ +E + + L+ ++S+ + ++G+ + Sbjct: 351 LLIPIVPKEKQLRIAS----VFSSIETFLEGVTRKKEKLEIQKSALASDLLSGRKRV 403 >gi|323344379|ref|ZP_08084604.1| type I site-specific deoxyribonuclease [Prevotella oralis ATCC 33269] gi|323094506|gb|EFZ37082.1| type I site-specific deoxyribonuclease [Prevotella oralis ATCC 33269] Length = 418 Score = 129 bits (324), Expect = 8e-28, Method: Composition-based stats. Identities = 64/433 (14%), Positives = 131/433 (30%), Gaps = 36/433 (8%) Query: 1 MKHYK-----AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLED 55 MK K +P++K SG WK +K + +E+ +++ I + Sbjct: 1 MKELKHIPNIRFPEFKKSG---------EWKPKCLKSLFDRVKTKNNENNSNVLTISAQY 51 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQ 110 ++ K +++ D S + +KG Y K + G+ ST Sbjct: 52 GLVNQIEFFSKSVSAK--DISGYYLLSKGDFAYNKSRSIGYPFGVVRRLKKYEKGVVSTL 109 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAIC-----EGATMSHADWKGIGNIPMPIPPLA 165 ++ + KD + + D+ Q+ + + + + P Sbjct: 110 YMCFRAKDHRNTEFYEYYFNTDIFQKRVGKIAQEGARSHGLLNISTESFLQLEFLFPSSI 169 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 EQ I E + + I+ + K Q L+ + + P ++ G Sbjct: 170 EQKKIAECLSSLDDYINATQEKIDLLQAHKKGLMQQLLPAL--GKIMPQKRLPKFGKSKK 227 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNMGLKPESYET 282 E+ T I +I + + E+ + Sbjct: 228 WSPYSMEEMFKIRNGYTPSKSNPKFWENGTIPWFRMEDIREHGHILSDSIQHITKEAVKG 287 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + I+ + + + + + ID Y + M D Sbjct: 288 KGLFPANSIIVATTATIGEHALIIVDSLANQRFTFLTKRKSFDNQIDMKYFYYYMYIID- 346 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +G S+ KRL V +P +EQ +I ID L++ +Q +V+ Sbjct: 347 EWCKQHTNAGGFASVDMNGFKRLSVSLPSPEEQKEIAE----CFTSIDDLIDSTKQKLVM 402 Query: 403 LKERRSSFIAAAV 415 L+ + + Sbjct: 403 LQNHKRGLMQQLF 415 >gi|225850846|ref|YP_002731080.1| type I restriction-modification system specificty subunit [Persephonella marina EX-H1] gi|225645153|gb|ACO03339.1| type I restriction-modification system specificty subunit [Persephonella marina EX-H1] Length = 448 Score = 129 bits (324), Expect = 9e-28, Method: Composition-based stats. Identities = 75/432 (17%), Positives = 158/432 (36%), Gaps = 38/432 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNT--GRTSESGKDI--IY---IGLEDVESGTGK 62 YK + IG IP+ W+V + +K+ G + KDI + I L + GK Sbjct: 9 YKKTE---IGIIPEDWEVKRLGEVSKIVGRIGFRGYTKKDIVKPWRGAISLSPINIVDGK 65 Query: 63 YLPKD----GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVL 114 K + + + S G I++ K G L K I + I ++ Sbjct: 66 LNLKSNLTFVSWNKYEESPEIKIKTGDIIFVKTGSTLGKVAIIEKVVFPTTINPQLVIIK 125 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 K + + +L S + + +G + + N+ +P+PPL EQ I + + Sbjct: 126 VFKRTNNKFINFYLNSFTFKNLLNKVLDGQAIPTLSQYQLSNLLLPLPPLPEQKDIAKVL 185 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVS--YIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 I++L + + K Q L++ + VKMK + + Sbjct: 186 SDIDNLIESLDKLIEKKKLIKKGAMQELLTGKKRLQGFKGKWVKMKLGEVFDIKRGASPR 245 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 ++ + I + L + + E + +V+ +++ Sbjct: 246 PIEK--------YITKKSNGINWIKISDVKPEDKYLVKTEIKITQEGAKQSVVVNYNDLI 297 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + ++ I + + + + +L+ SY + K F + +G Sbjct: 298 LS----NSMSYGRPYISKIKGCIHDGWLLLKRKGKQNIEFFYYLLSSYKVQKSFDLLAAG 353 Query: 353 -LRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +LK + VK L + +PP ++EQ I +++ A I+ L ++ ++ + Sbjct: 354 SGVNNLKIDSVKELSIYIPPTLEEQQAIAKILSDMDAEIEAL----KKKKEKYEQIKKGA 409 Query: 411 IAAAVTGQIDLR 422 + +TG++ L+ Sbjct: 410 MELLLTGKVRLK 421 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 65/168 (38%), Gaps = 5/168 (2%) Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 +++ G + K + + G+I+F K ++ V I Sbjct: 59 INIVDGKLNLKSNLTFVSWNKYEESPEIKIKTGDIIFVKTGSTLGKVAIIEKVVFPTTIN 118 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 + ++ ++ + + S+ + + G +L + L + +PP+ EQ Sbjct: 119 PQLVIIKVFKRTNNKFINFYLNSFTFKNLLNKVLDGQAIPTLSQYQLSNLLLPLPPLPEQ 178 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 DI V++ ID L+E +++ I K + + +TG+ L+G Sbjct: 179 KDIAKVLSD----IDNLIESLDKLIEKKKLIKKGAMQELLTGKKRLQG 222 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 73/208 (35%), Gaps = 9/208 (4%) Query: 25 WKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W + + + G + + I +I + DV+ + + Q Sbjct: 227 WVKMKLGEVFDIKRGASPRPIEKYITKKSNGINWIKISDVKPEDKYLVKTEIKITQEGAK 286 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + ++ Y R I I L+ + E L S V + Sbjct: 287 QSVVVNYNDLILSNSMSYGRPYISKIKGCIHDGWLLLKRKGKQNIEFFYYLLSSYKVQKS 346 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + G+ +++ + + + IPP L EQ I + + I+ L ++ ++ ++ Sbjct: 347 FDLLAAGSGVNNLKIDSVKELSIYIPPTLEEQQAIAKILSDMDAEIEALKKKKEKYEQIK 406 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIE 223 K + L++ V + + K++ IE Sbjct: 407 KGAMELLLTGKVRLKTINNGEGKNNDIE 434 >gi|13249034|gb|AAK16650.1|AF142640_3 type I R/M system specificity subunit [Lactococcus lactis subsp. cremoris] Length = 410 Score = 129 bits (324), Expect = 9e-28, Method: Composition-based stats. Identities = 54/413 (13%), Positives = 144/413 (34%), Gaps = 35/413 (8%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGK 62 +P+ W+ + T + G + + DI ++ + DV G+ Sbjct: 11 KVPELRFPGFTDDWEERKLGSLTTVVRGASPRPIQDPKWFDKESDIGWLRIADVTEQNGR 70 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + + + + +L + + G+ + L P E Sbjct: 71 IYHLEQHISKLGQEKTRVLTEPHLLLSIAATVGKPVVNYVKTGVHDGFLIFLNPTF---E 127 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + + + + + N + +P EQ I ++D Sbjct: 128 REFMFQWLEMFRPKWQKYGQPGSQVNLNSELVRNQEIVLPNYKEQQKIGSF----FKQLD 183 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 IT R ++LLKE+K+ + + K +++ +G + ++ Sbjct: 184 NTITLHQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG---FADDWEERKLSSMTNYKN 240 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + ++ + + ++ I ++ G + + D ++ + + Sbjct: 241 GKSHEDKQSTSGKLELINLNAISISGGLKHSGKFIDEADDTLQKDDLVMILSDVGHGDLL 300 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + +R ++ ++P+ D +L + ++ F A G+G+ + ++ Sbjct: 301 GRVALIPEDDRFVLNQRVALLRPNTTADPQFLFSYINAHQY--YFKAQGAGMSQLNISKG 358 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V+ VP I+EQ I + ++D + ++ + LLKE++ F+ Sbjct: 359 SVENFISFVPIIEEQKKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 407 >gi|134046197|ref|YP_001097682.1| restriction modification system DNA specificity subunit [Methanococcus maripaludis C5] gi|132663822|gb|ABO35468.1| restriction modification system DNA specificity domain [Methanococcus maripaludis C5] Length = 402 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 54/414 (13%), Positives = 130/414 (31%), Gaps = 29/414 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNS 70 I +P W+V + ++ G T K I ++ + D++ K + Sbjct: 2 IDNLPDGWEVKKLGDIGNISAGGTPSRSKPEYWNNGSIPWVKIADMKEKHVKNTSEFITE 61 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + S+ IF KG IL + L I D D + + + Sbjct: 62 EGLNKSSAKIFKKGTILIS-IFASLGTVGILDIDASTNQAIAGINVNSKKVIPEYLYYYL 120 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G ++ + + + + +PPL Q I E + +I+ I R + Sbjct: 121 KSLKNYFMGAGRGVAQNNINLSILKDTEIFVPPLETQQKIVEIL----EKIEYGINLREK 176 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 I + +A+ + +P ++ +G + + + K Sbjct: 177 AILETENLVKAV---FLDMFGDPVSNPMGWDVKKIG----TFVNDIISGWSVGGDERPKK 229 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E +L +S + + + + E + G+++F + + ++ Sbjct: 230 ADELAVLKISSVTSGKFKSSEHKVVNSEITKKLVHPLKGDLLFSRANTRELVAAVCIVDN 289 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGL---RQSLKFEDVKR 364 + + + + ++ +G ++ + Sbjct: 290 DYMDLFLPDKLWKIILNKNIVSSYYFRQVLQDPTYRANLTKKATGTSGSMLNISKSKLIE 349 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PPI Q +I +++ + EK E S +++ + + A G+ Sbjct: 350 NEFPIPPIGLQNKFAKIIE----KLEEIKEKQENSKKEMEDLFNLSLQKAFKGE 399 >gi|262369031|ref|ZP_06062360.1| type I restriction-modification system protein [Acinetobacter johnsonii SH046] gi|262316709|gb|EEY97747.1| type I restriction-modification system protein [Acinetobacter johnsonii SH046] Length = 412 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 58/402 (14%), Positives = 120/402 (29%), Gaps = 18/402 (4%) Query: 24 HWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W I T+ T + + YI +++ + S+ Sbjct: 14 DWSRYKIAEVTEYLVDGTHFSPKTTEGEFKYITSKNIRNDGLDLTNISYISKDEHEKIYK 73 Query: 80 --IFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G IL K G + +F + S L + + L S Sbjct: 74 RCKVQLGDILLTKDGANTGNCCLNTLDEEFSLLSSVAVLRGKKDSFNNNFILQILQSDLG 133 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I + G ++ + + P L EQ I + A +I L + + Sbjct: 134 QDTIISSMSGQAITRITLAKLKDYSFFFPELTEQTQITSFLSAVDEKISQLTQKHELLSQ 193 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + Q L S + + + + G VG + + PF + E+ + Sbjct: 194 YKQGMMQKLFSQQIRFKADDGSEFGEWGKAKVGNITETIFGYPFDSK--EMVEDTNGIPL 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL-QNDKRSLRSAQVME 312 +++ +I E LK S V +IV + + + Sbjct: 252 MRGINIGECHIRHSFELDRFFLKDTSKLEKYFVRVNDIVLSMDGSKVGRNSAFVTEKDAG 311 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ + + + Y+ + S + + S + + ++ + P Sbjct: 312 SLLVQRVCILREKANTNIQYVYQWIISKEFHRYVDQVKTSSGIPHISGKQIQDYEISYPC 371 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++EQ I N ++ ID +E + Q I K + + Sbjct: 372 LEEQTKIANFLSA----IDQKIEVVAQQIEQAKTWKKGLLQQ 409 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 25/209 (11%), Positives = 62/209 (29%), Gaps = 5/209 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K+ +W + ++ + + Sbjct: 4 PKLRFKEFDGDWSRYKIAEVTEYLVDGTHFSPKTTEGEFKYITSKNIRNDGLDLTNISYI 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + E V G+I+ L + + + A + K ++ + Sbjct: 64 SKDEHEKIYKRCKVQLGDILLTKDGANTGNCCLNTLDEEFSLLSSVAVLRGKKDSFNNNF 123 Query: 333 LAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +++S + +M + +K P + EQ IT+ ++ +I Sbjct: 124 ILQILQSDLGQDTIISSMSGQAITRITLAKLKDYSFFFPELTEQTQITSFLSAVDEKISQ 183 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQID 420 L +K LL + + + + QI Sbjct: 184 LTQKH----ELLSQYKQGMMQKLFSQQIR 208 >gi|209526228|ref|ZP_03274758.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] gi|209493325|gb|EDZ93650.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] Length = 493 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 69/457 (15%), Positives = 148/457 (32%), Gaps = 63/457 (13%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W + ++L G++ + G ++ + + ++ Sbjct: 3 ELPKGWAETKLGEISQLEMGQSPPGTATNSDAKGIPLIGGASDFVGEQIKPNRFTSAPTK 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I ++ + + K +A+ ++P++V + L+ L+ ++A Sbjct: 63 ICQPNDLILC-VRATIGKLAVAESAYCLGRGVAGIRPRNVNQDWLRYRLIGDA--SALDA 119 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+T D + + + + +PPL EQ I K+ R E R L++ K Sbjct: 120 AGTGSTFRQIDKQTLVSWNINLPPLNEQRRIVAKLDRLFARSRCAREELGRVSRLVQRYK 179 Query: 200 QALVSYIVTKGLNPDVKMKDSGI-------------------EWVGLV------------ 228 QA+++ L D + ++ + E Sbjct: 180 QAVLAAAFRGDLTADWRAENPDVEPASELLRQILIRRKQRYNEKYNESKLKNKKKPRKDF 239 Query: 229 ---------------PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII-------- 265 P W V L + + + G I Sbjct: 240 VDQIPSIQSEVEISLPKTWAVTNIDYLAHVTKLAGFEYTKHFKTNDVAGIPIIRAQNVQM 299 Query: 266 QKLETRNMGLKPESYETY---QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 K N+ E Y + E++ FI L + R + Sbjct: 300 GKFIETNIKYISEDVSNYLERSQLHGREVLMVFIGAGTGNVCLAPQER--RWHLAPNVAK 357 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + I S YL ++S + + S + SL E ++++ V + P++EQ +I Sbjct: 358 IDVDEISSNYLCLYLQSSIGQNYVDSWIKSTAQPSLSMETIRKIIVFLSPLEEQKEIVRR 417 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + ID++ ++ +++ LL + ++ A G+ Sbjct: 418 VEKLFKAIDLIEQEHQKASKLLDRLEKATLSKAFRGE 454 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 75/218 (34%), Gaps = 17/218 (7%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNT--G----RTSESGK--DIIYIGLEDVESGTGKY 63 S V+ ++PK W V I + G + ++ I I ++V+ G K+ Sbjct: 247 QSEVEI--SLPKTWAVTNIDYLAHVTKLAGFEYTKHFKTNDVAGIPIIRAQNVQMG--KF 302 Query: 64 LPKDGNSRQSDTSTV---SIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKD 118 + + D S S ++L +G + + + + + Sbjct: 303 IETNIKYISEDVSNYLERSQLHGREVLMVFIGAGTGNVCLAPQERRWHLAPNVAKIDVDE 362 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + L +L S +++ + + I I + + PL EQ I ++ Sbjct: 363 ISSNYLCLYLQSSIGQNYVDSWIKSTAQPSLSMETIRKIIVFLSPLEEQKEIVRRVEKLF 422 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 ID + E + +LL ++A +S L P Sbjct: 423 KAIDLIEQEHQKASKLLDRLEKATLSKAFRGELVPQDP 460 >gi|283853811|ref|ZP_06371033.1| restriction modification system DNA specificity domain protein [Desulfovibrio sp. FW1012B] gi|283570798|gb|EFC18836.1| restriction modification system DNA specificity domain protein [Desulfovibrio sp. FW1012B] Length = 543 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 62/409 (15%), Positives = 134/409 (32%), Gaps = 25/409 (6%) Query: 25 WKVVPIKRFTKLNTGR--TSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W I + G T ++ I++G+ + G + + Sbjct: 133 WNTKGIGEVADIFDGPHATPKTVDTGPIFLGIGALNDGMINLRETRHVTENDFKTWTRRV 192 Query: 82 AK--GQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSIDVTQR 136 G +++ + AII D C + + +V+P+ +S Sbjct: 193 RPQAGDVVFSYETRLGQAAIIPDNIDCCLGRRMGLVRFKTNEVIPKFFLYQYISPSYRNF 252 Query: 137 IEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +++ GAT+ K P+ IP + EQ I + IDT I + I Sbjct: 253 LDSKTIRGATVDRISIKEFPFFPIAIPSIEEQKRIVSILDDAFECIDTAIANTEKNIANA 312 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +E ++ + + + + + I + P + P + + Sbjct: 313 RELFESYLDRVFAEKGDGWEEKNLEDI--LSFQPRNGWSPPASHH-------SDRGTPVL 363 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 LS G +K + + Y V+ G+++ + + + Sbjct: 364 TLSSVTGFQFKKEALKYTSAQVNPKAHYW-VENGDLLMTRSNTPELVGHVAVCDGVSANT 422 Query: 316 ITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLV 369 I + H + ++ + +RS L + +G + +K V+ LP+ + Sbjct: 423 IYPDLIMKMKVDKHIALTEFVYFQLRSSKLRNIIKDGATGANPTMKKVKKSTVQNLPLAM 482 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P + Q I + + +LV+K + L + S + A +G+ Sbjct: 483 PALPVQQAIVDNLRNLNETSRLLVKKCVSKVKALTRLKQSLLQKAFSGE 531 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 51/155 (32%), Gaps = 9/155 (5%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYL 333 +++ G++VF + + + + K + + + Sbjct: 184 DFKTWTRRVRPQAGDVVFSYETRLGQAAIIPDNI---DCCLGRRMGLVRFKTNEVIPKFF 240 Query: 334 AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + S + + ++ P+ +P I+EQ I ++++ ID Sbjct: 241 LYQYISPSYRNFLDSKTIRGATVDRISIKEFPFFPIAIPSIEEQKRIVSILDDAFECIDT 300 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + E++I +E S++ + D G + Sbjct: 301 AIANTEKNIANARELFESYLDRVFAEKGD--GWEE 333 >gi|154150575|ref|YP_001404193.1| restriction modification system DNA specificity subunit [Candidatus Methanoregula boonei 6A8] gi|153999127|gb|ABS55550.1| restriction modification system DNA specificity domain [Methanoregula boonei 6A8] Length = 457 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 74/442 (16%), Positives = 145/442 (32%), Gaps = 54/442 (12%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 I W+ VP+ + K+ G +S K I + D+ + K Sbjct: 21 IDPSWERVPLGKIAKVLNGFAFKSELFNDKKGTPLIRIRDIGNN------KTECYYDGVF 74 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + G +L G G + + + + ++ + + Sbjct: 75 DEAYVIHPGDLLVGMDGDFNCST-WRGPKALLNQRVCKIEVNIEQYNRKFLEYVLPGYLK 133 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I T+ H + I I +P PPL EQ I ++ A ++ R ++ Sbjct: 134 AINENTSSQTVKHLSSRSISEILLPNPPLTEQQRIVARVEALLSHVNAARERLSRVPLIM 193 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEW---------------------------VGLV 228 K+ +QA+++ + GL + ++ IE + + Sbjct: 194 KKFRQAVLAAACSGGLTEGWRKENPDIEEANKLVKRLESIRKQFKIREISSIDNLELSDL 253 Query: 229 PDHWEVKPFFAL--VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 PD W + V + + K K + I+ +S + + + K S E + + Sbjct: 254 PDSWTWIRLANIAIVMDPDHKMPKSSDGGIIFISPKDFKENYQIDMTKTKRISDEEFLRL 313 Query: 287 DPG------EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 +I++ I K + + A + +S YL WL+ S Sbjct: 314 SKKFVPRPLDILYSRIGADLGKARKAPQDIKFHISYSLAVIRQLGEMENSDYLFWLLNSM 373 Query: 341 DLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR---IDVLVEKI 396 + F + S L D+ + +PP+ EQ++I + + R ID VE Sbjct: 374 FIRNQAFENVRSIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFERADAIDREVEAA 433 Query: 397 EQSIVLLKERRSSFIAAAVTGQ 418 + L + + A G+ Sbjct: 434 TRRCERLT---QAVLGKAFRGE 452 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 66/206 (32%), Gaps = 12/206 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + +P W + + + S II+I +D + + K Sbjct: 250 LSDLPDSWTWIRLANIA-IVMDPDHKMPKSSDGGIIFISPKDFKENYQIDMTKTKRISDE 308 Query: 74 DT---STVSIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQG 126 + S + ILY ++G L KA F S + + + L Sbjct: 309 EFLRLSKKFVPRPLDILYSRIGADLGKARKAPQDIKFHISYSLAVIRQLGEMENSDYLFW 368 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L S+ + + + + I N +P+PPLAEQ I ++ R D + Sbjct: 369 LLNSMFIRNQAFENVRSIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFERADAIDR 428 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLN 212 E + QA++ L Sbjct: 429 EVEAATRRCERLTQAVLGKAFRGELT 454 >gi|135207|sp|P06991|T1SD_ECOLX RecName: Full=Type-1 restriction enzyme EcoDI specificity protein; Short=S.EcoDI; AltName: Full=Type I restriction enzyme EcoDI specificity protein; Short=S protein gi|41744|emb|CAA23553.1| hsdS [Escherichia coli] Length = 444 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 67/422 (15%), Positives = 152/422 (36%), Gaps = 46/422 (10%) Query: 19 GAIPKHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G +P WK V + KL+TG+ +++ + + NS D Sbjct: 4 GKLPVDWKTVELGELIKLSTGKLDANAADNDGQYPFFTCAE--------SVSQINSWAFD 55 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 TS V + G I + G + + +L + + L Sbjct: 56 TSAVLLAGNGSF------------SIKKYTGKFNAYQRTYVIEPILIKTEFLYWLLRGNI 103 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++I G+T+ + I +I + +P +EQ LI EK+ ++++ + ++ Sbjct: 104 KKITENGRGSTIPYIRKGDITDISVALPSPSEQTLIAEKLDTLLAQVESTKARLEQIPQI 163 Query: 195 LKEKKQALVSYIVTKGLNPDVK---------------MKDSGIEWVGLVPDHWEVKPFFA 239 LK +QA++++ + L + + +K + + +P++W F Sbjct: 164 LKRFRQAVLTFAMNGELTKEWRSQNNNPAFFPAEKNSLKQFRNKELPSIPNNWSWMRFDQ 223 Query: 240 LVTELNRKNTKLIESNILSLSYGN---IIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + ++ + L N + L+ + K L+ G+I++ I Sbjct: 224 VADIASKLKSPLDYPNTIHLAPNHIESWTGKASGYQTILEDGVTSAKHEFYTGQIIYSKI 283 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 K ++ + G+ ++ + W++ + A + Sbjct: 284 RPYLCKVTIATFD----GMCSADMYPINSKIDTHFLFRWMLTNTFTDWASNAESRTVLPK 339 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +D+ +PV PP+ EQ +I + A D + +++ ++ + S +A A Sbjct: 340 INQKDLSEIPVPTPPLPEQHEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 399 Query: 417 GQ 418 G+ Sbjct: 400 GE 401 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 47/200 (23%), Positives = 82/200 (41%), Gaps = 2/200 (1%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG-RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + +IP +W + + + + ++ + I++ +ES TGK TS Sbjct: 209 LPSIPNNWSWMRFDQVADIASKLKSPLDYPNTIHLAPNHIESWTGKASGYQTILEDGVTS 268 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 F GQI+Y K+ PYL K IA FDG+CS + K + L W+L+ T Sbjct: 269 AKHEFYTGQIIYSKIRPYLCKVTIATFDGMCSADMYPINSK-IDTHFLFRWMLTNTFTDW 327 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + K + IP+P PPL EQ I ++ DT+ + + + Sbjct: 328 ASNAESRTVLPKINQKDLSEIPVPTPPLPEQHEIVRRVEQLFAYADTIEKQVNNALARVN 387 Query: 197 EKKQALVSYIVTKGLNPDVK 216 Q++++ L + Sbjct: 388 NLTQSILAKAFRGELTAQWR 407 >gi|308183634|ref|YP_003927761.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori PeCan4] gi|308065819|gb|ADO07711.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori PeCan4] Length = 399 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 73/409 (17%), Positives = 144/409 (35%), Gaps = 26/409 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W+ V + ++N T + YI LE+VE G + ++ + + Sbjct: 6 LPLNWQRVRLGDIAEINPPTTIPNV--FYYIDLENVEKGQL-LNKQLMTKNKAPSRARRL 62 Query: 81 FAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +K ILY + PY R +G + ST + ++ P L L S + Sbjct: 63 LSKNDILYQLVRPYQRNNYFFTLNGNYVASTGYAQIRT-LQNPSFLYFALHSNYFVNAVL 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 CEG + + + IPPL EQ+ I + + L ++ + K Sbjct: 122 DRCEGTSYPAISSNELKKCEVIIPPLNEQIAIANILSGLDRYLCALDALILKKEGVKKAL 181 Query: 199 KQALVSYIVT-KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 L+S KG N + G +G+ K + + I Sbjct: 182 SFELLSQRKRLKGFNQAWQRVRLGD--IGITISGLVGKTKQDFINGNAK--------YIT 231 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGI 315 L+ N + + +K E ++ F + + + +++ Sbjct: 232 FLNVLNNVIIDTSILENVKIYPNEKQNSFKKYDLFFNTSSETPKEVGMCAVLLDDIDQVF 291 Query: 316 ITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 + S + +D +L++L+ S K F + G R +L + + +PP+ Sbjct: 292 LNSFCFGFRIFDKAVDGLFLSYLINSEIGRKAFENLAQGSTRYNLSRSGFNNVCLFLPPL 351 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I N+++ I L K Q + + + ++ +I + Sbjct: 352 NEQIAIANILSALDNEITSLKNKKRQ----FENIKKALNHDLMSAKIRV 396 >gi|163790647|ref|ZP_02185075.1| HsdS specificity protein of type I restriction-modification system [Carnobacterium sp. AT7] gi|159874095|gb|EDP68171.1| HsdS specificity protein of type I restriction-modification system [Carnobacterium sp. AT7] Length = 412 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 51/401 (12%), Positives = 130/401 (32%), Gaps = 18/401 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +W + + +++G T G+ DI + +V+ + + S Sbjct: 17 NNWVKCELGEVSDISSGGTPSRGESSYWNGDIPWATTAEVKYSEITDTKEKITIDGLNNS 76 Query: 77 TVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + G IL G + I + + +Q + + L Sbjct: 77 SAKLMPVGTILLAMYGQGKTRGQLGILSIEAATNQANANIQVHRYIYNYFVYYQLVKKY- 135 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + ++ + ++ + + ++ L KI ++D +IT + + + Sbjct: 136 NLLRNLANEGGQANLSLGIVKSVNIVVTNNLDEQL---KIGEFFKQLDNIITLQQQLLND 192 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K+ K+A++ + + K++ +G + A N Sbjct: 193 HKQLKKAMLQKMFPQKGESIPKIRFAGFTQKWENLKLSSLYVKGASGGTPKSTNKSYYIG 252 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 NI L +I K S E + I L + A + Sbjct: 253 NIPFLGISDISASNGYIYDTKKRISQEGLDSSSAWLVPKEAISLAMYASVGKVAILKTDV 312 Query: 315 IITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + A+ + I + +L++ + +G + +L + ++ L ++VP + Sbjct: 313 ATSQAFYNMIFKDIATRNFIFQYLLKKESTNGWNKLISTGTQANLNAKKIQDLQIMVPSL 372 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I + ++D + E+ + + + + + Sbjct: 373 EEQEKIGD----LFGKLDKTITLHEKKLETYQNLKKAMLQK 409 >gi|302343958|ref|YP_003808487.1| restriction modification system DNA specificity domain protein [Desulfarculus baarsii DSM 2075] gi|301640571|gb|ADK85893.1| restriction modification system DNA specificity domain protein [Desulfarculus baarsii DSM 2075] Length = 411 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 82/407 (20%), Positives = 149/407 (36%), Gaps = 30/407 (7%) Query: 24 HWKVVPIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WK+V K R E+ +GLE ++ + NS TS F Sbjct: 9 GWKMVKFGEVVKNANLVEREPEANGVEKIVGLEHIDPENLHI--RRWNSVVDGTSFTRKF 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIE 138 GQ L+GK Y RK A+F+GICS L +PK+ LPELL S Sbjct: 67 VPGQTLFGKRRAYQRKVAYAEFEGICSGDILTFEPKNRKVLLPELLPFICQSNAFFDHAL 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ W + + +PPL EQ I E + A + Sbjct: 127 GTSAGSLSPRTSWTALQDFEFQLPPLDEQKRIAEILWAADEAFNQHQQSNDNL----MSV 182 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKLIESNIL 257 K+ L+S + +G+ + + +G +P HW + + + + L ES Sbjct: 183 KRTLLSRLTVRGIG----QQATQHTRLGEIPVHWRLATVEDVTSICQYGLSIPLNESGQY 238 Query: 258 SLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + LK + V G+I+F + + + + Sbjct: 239 PILRMMNYDDGRIIANDLKYVDLDDSDFNSFKVHKGDILFNRTNSADLVGKVGIFDLEGD 298 Query: 314 GIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GL-RQSLKFEDVKRLPVLV 369 + S + I +L + + S + A + G+ + ++ ++K++ V + Sbjct: 299 YVFASYLVRLRADEDQILPDFLNYYLNSGLGQRRLLAYATPGVSQTNISAGNLKKVLVPL 358 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQS-IVLLKERRSSFIAAAV 415 PP++EQ I V+N L + ++++ + ++ ++ I V Sbjct: 359 PPMEEQKQIVEVLNNL-----ELRKHLQRNHVAEAQKCLAALINNLV 400 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 68/170 (40%), Gaps = 10/170 (5%) Query: 18 IGAIPKHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +G IP HW++ ++ T + G + + Y L + G+ + D D S Sbjct: 205 LGEIPVHWRLATVEDVTSICQYGLSIPLNESGQYPILRMMNYDDGRIIANDLKYVDLDDS 264 Query: 77 --TVSIFAKGQILYGKLGP--YLRKAIIADFDG-ICSTQFLVL---QPKDVLPELLQGWL 128 KG IL+ + + K I D +G +LV +LP+ L +L Sbjct: 265 DFNSFKVHKGDILFNRTNSADLVGKVGIFDLEGDYVFASYLVRLRADEDQILPDFLNYYL 324 Query: 129 LSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 S +R+ A G + ++ + + +P+PP+ EQ I E + Sbjct: 325 NSGLGQRRLLAYATPGVSQTNISAGNLKKVLVPLPPMEEQKQIVEVLNNL 374 >gi|183597751|ref|ZP_02959244.1| hypothetical protein PROSTU_01052 [Providencia stuartii ATCC 25827] gi|188023031|gb|EDU61071.1| hypothetical protein PROSTU_01052 [Providencia stuartii ATCC 25827] Length = 376 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 68/404 (16%), Positives = 146/404 (36%), Gaps = 38/404 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P WK + + + G+ + +E G+ G + + Sbjct: 2 VPNGWKQTTLDKVLTIGGGKDYK-----------HLEEGSIPVYGSGGYMLSV---SDYL 47 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + G+ G + + T F KD P + + ++ + Sbjct: 48 YDGESVCIGRKGTIDKPIFLKGKFWTVDTLFYTHSFKDSEPYYIYQFFQTV----PWRRL 103 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 E + + I + + +PPL EQ I + + D I + I+ +++K+ Sbjct: 104 NEASGVPSLAKSIINKVKINLPPLPEQRKIAKIL----STWDKAIATTEKLIDASQQQKK 159 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 AL+ ++T + ++G + G WE P + E K++ + + + S Sbjct: 160 ALMQQLLTGK--KRLVNPETGKAFEGE----WEEVPLSNWLVEFKEKSSAQDQHRVYTSS 213 Query: 261 YGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ + E N + + I+ PG + +R + + E GII+ Sbjct: 214 RSGLVPQDEYFGNSRISDRKNIGFHILPPGHMTYRSRSDDGY-FTFNLFKGNENGIISHY 272 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 Y G + ++A L VF G ++ L F +K + VP EQ I Sbjct: 273 YPVFTSKGSNDFFIALL---EQYRNVFGKHSVGTSQKVLSFNALKAIRFFVPSTYEQQKI 329 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +V+ +E ++ + LK+ + + + +TG+ ++ Sbjct: 330 ASVLIAADKE----IELLQAKLAHLKDEKKALMQQLLTGKRRVK 369 >gi|197334792|ref|YP_002156836.1| restriction modification system DNA specificity domain protein [Vibrio fischeri MJ11] gi|197316282|gb|ACH65729.1| restriction modification system DNA specificity domain protein [Vibrio fischeri MJ11] Length = 376 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 53/404 (13%), Positives = 123/404 (30%), Gaps = 42/404 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + KL G+ + + GKY N ++ + + Sbjct: 2 SWVEKSLDEVLKLEYGKPLDKS----------LRKEGGKYPAYGANGIKAWSDEYFH-DE 50 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I+ G+ G + + V K+ +LL + + Sbjct: 51 ETIVVGRKGSAGELTLTDGKFWPLDVTYFVKTNKNDYDIKFLYYLLLSLDLPSLATGVK- 109 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I P + Q + ++ I+ T + ++ +E + + Sbjct: 110 ---PGINRNNVYKIQAKFPSYSTQKQVAGQLDKAFDGIEQARTNTEKNLQNARELFDSYL 166 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL--IESNILSLSY 261 + + + W+ L T+ + E + + Sbjct: 167 QQVFS------------------ECGEGWKKTTLNELCTKFEYGTSSKSSQEGEVPVIRM 208 Query: 262 GNIIQKLETRNMGLKP--ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 GNI + + E ++ +++F + + ER I Sbjct: 209 GNIQDGRIVMDKLVYSLNEEDNQKYRLNFNDVLFNRTNSAELVGKTAIYKSEERAIFAGY 268 Query: 320 YMAVKPHGI--DSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVP-PIKE 374 + + + ++ YL + + S K S + ++ +K P+ +P ++E Sbjct: 269 LIRIHRNEKLLNADYLNFYLNSPIARKYGEQVMSQSTNQANISGTKLKTYPISIPVSLEE 328 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q I + I+ +++ L + + L E + S + A TGQ Sbjct: 329 QQSIVDKISTLKEKVEELEATHKSKLTALDELKQSLLQQAFTGQ 372 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 70/201 (34%), Gaps = 12/201 (5%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + WK + K G +S+S + ++ I + +++ G + + D Sbjct: 175 EGWKKTTLNELCTKFEYGTSSKSSQEGEVPVIRMGNIQDGRIVMDKLVYSLNEEDNQKYR 234 Query: 80 IFAKGQILYGKLGP---YLRKAIIA-DFDGICSTQFLVLQPKDVL---PELLQGWLLSID 132 + +L+ + + AI + I + + + + L L I Sbjct: 235 L-NFNDVLFNRTNSAELVGKTAIYKSEERAIFAGYLIRIHRNEKLLNADYLNFYLNSPIA 293 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + ++ + P+ IP L EQ I +KI +++ L Sbjct: 294 RKYGEQVMSQSTNQANISGTKLKTYPISIPVSLEEQQSIVDKISTLKEKVEELEATHKSK 353 Query: 192 IELLKEKKQALVSYIVTKGLN 212 + L E KQ+L+ T L Sbjct: 354 LTALDELKQSLLQQAFTGQLT 374 >gi|217977716|ref|YP_002361863.1| type I restriction enzyme [Methylocella silvestris BL2] gi|217503092|gb|ACK50501.1| type I restriction enzyme [Methylocella silvestris BL2] Length = 439 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 77/412 (18%), Positives = 156/412 (37%), Gaps = 17/412 (4%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIG-LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 K + + + ++ + V+ + ++ SR +G + Sbjct: 5 RFKNVMRERVDLSETGEETLLSVSEYYGVKPRAEAFQGEEYESRAESLEGYRQVQRGDFV 64 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEG-- 143 + + I+++DGI S + V Q + + L S + + +G Sbjct: 65 MNYMLAWKGAYGISEYDGIVSPAYAVFQIDKSKIDLKYLHHRTRSNPMRALFRSRSKGII 124 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + +P LA Q +I + + ET RID LI ++ RF L E+ +A + Sbjct: 125 DSRLRLYPDALLATEIDLPGLAAQKVIADFLDRETARIDQLIEKKERFSALAAERWRATL 184 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + SG ++ VP W + P LV ++ + Sbjct: 185 DAEILGRTTAGKRSLTSGQPYISDVPADWVLTPLKHLVDPRRPVMYGIVLPGPNVENGIM 244 Query: 264 IIQKLETRNMGLKPESYET----------YQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I++ + + L P+ + G++V D + A + Sbjct: 245 IVKGGDVKPNRLSPDRLCKTSREIEAGYVRSRLRGGDLVMAIRGGIGDVE-IVPADIEGA 303 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPI 372 + A HG+ + +L + +++ + A +G + + DV R+ V VPP Sbjct: 304 NLTQDAARIAPRHGVLNRWLRYALQAPSVFAPLGAGANGAAVRGVNIFDVDRVLVPVPPT 363 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 EQ I + ++++ +I + EKI L++E R++ I AAV GQI++ Sbjct: 364 AEQIVIADRLDIKEQQILRMREKIFDHAKLIQEFRAALITAAVAGQINVDTW 415 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 78/213 (36%), Gaps = 15/213 (7%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNT----GRT---SESGKDIIYIGLEDVESGTGKYLP 65 SG +I +P W + P+K G I+ + DV+ + P Sbjct: 201 SGQPYISDVPADWVLTPLKHLVDPRRPVMYGIVLPGPNVENGIMIVKGGDVKPN--RLSP 258 Query: 66 KDGNSRQSDTSTVSI---FAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDV 119 + + G ++ G I+ + + + V Sbjct: 259 DRLCKTSREIEAGYVRSRLRGGDLVMAIRGGIGDVEIVPADIEGANLTQDAARIAPRHGV 318 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L L+ L + V + A GA + + + + +P+PP AEQ++I +++ + Sbjct: 319 LNRWLRYALQAPSVFAPLGAGANGAAVRGVNIFDVDRVLVPVPPTAEQIVIADRLDIKEQ 378 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 +I + + +L++E + AL++ V +N Sbjct: 379 QILRMREKIFDHAKLIQEFRAALITAAVAGQIN 411 >gi|134097471|ref|YP_001103132.1| restriction modification system DNA specificity domain-containing protein [Saccharopolyspora erythraea NRRL 2338] gi|133910094|emb|CAM00207.1| putative restriction modification system DNA specificity domain [Saccharopolyspora erythraea NRRL 2338] Length = 411 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 83/403 (20%), Positives = 141/403 (34%), Gaps = 12/403 (2%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 VVP+K + G+ S + I G ++ R + I G + Sbjct: 3 VVPLKYVAYIRPGQAPPSTEVSDLIDGLPFLQGNAEFQAAHPVPRLQCDTASKIAKCGDV 62 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L P I GI V W +R++A+ G T Sbjct: 63 LLSVRAPVGALNIADREYGIGRGLCSVSATGCD---ARFLWWWLHSAGERLDAVSTGTTY 119 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + +G +P P L EQ I + + AET RID L R R +++L+EK V Sbjct: 120 RAVTGEDVGMLPFPRVSLEEQRRIADFLDAETTRIDKLSALRERQLDILEEKAMRRVYDT 179 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 V G + SG+ W+G VP HW V K + L + Sbjct: 180 VR-GTGVVGARRPSGLSWLGSVPVHWRVAAVSHYFEVELGKMLNQERARGDHLRPYLRVA 238 Query: 267 KLETRNMGLK-------PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ + P + + PG+++ + ++ S ++ E + Sbjct: 239 NVQWGVVDTTELAMMDFPPEEQKRYRLQPGDLLVNEGGSWPGRAAVWSGEIEEIYYQKAL 298 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + +L + + + + KVF G S L E ++ P + EQ Sbjct: 299 HRIRPRGMESTWWLYFCLVAAERMKVFQVQGNSSTMTHLTREQLRPQRFPFPDLAEQEQA 358 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + A+ + + + L ERR + I AAVTG+ D+ Sbjct: 359 VERLKDAEAKDRQIRRVLSRQQATLAERRQALITAAVTGEFDV 401 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 80/211 (37%), Gaps = 9/211 (4%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLP 65 + SG+ W+G++P HW+V + + ++ G+ + Y+ + +V+ G Sbjct: 190 RPSGLSWLGSVPVHWRVAAVSHYFEVELGKMLNQERARGDHLRPYLRVANVQWGVVDTTE 249 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPE 122 + G +L + G + +A + + ++P+ + Sbjct: 250 LAMMDFPPEEQKRYRLQPGDLLVNEGGSWPGRAAVWSGEIEEIYYQKALHRIRPRGMEST 309 Query: 123 LLQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + L ++ + +TM+H + + P P LAEQ E++ + Sbjct: 310 WWLYFCLVAAERMKVFQVQGNSSTMTHLTREQLRPQRFPFPDLAEQEQAVERLKDAEAKD 369 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + R L E++QAL++ VT + Sbjct: 370 RQIRRVLSRQQATLAERRQALITAAVTGEFD 400 >gi|301022250|ref|ZP_07186148.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 196-1] gi|135206|sp|P06990|T1SB_ECOLX RecName: Full=Type-1 restriction enzyme EcoBI specificity protein; Short=S.EcoBI; AltName: Full=Type I restriction enzyme EcoBI specificity protein; Short=S protein gi|41742|emb|CAA23552.1| hsdS [Escherichia coli] gi|299881300|gb|EFI89511.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 196-1] Length = 474 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 62/416 (14%), Positives = 140/416 (33%), Gaps = 29/416 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + + + G +S + + I + DV G Sbjct: 24 SWLRISMDSVANITNGFAFKSSEFNNRKDGVPLIRIRDVLKGN------TSTYYSGQIPE 77 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G G + I + + + ++ ++ + I Sbjct: 78 GYWVYPEDLIVGMDGDF-NATIWCSEPALLNQRVCKIEVQEDKYNKRFFYHALPGYLSAI 136 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A T+ H + + + +P+PPLAEQ +I EK+ ++D+ + ++LK Sbjct: 137 NANTSSVTVKHLSSRTLQDTLLPLPPLAEQKIIAEKLDTLLAQVDSTKARLEQIPQILKR 196 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLV----PDHW----------EVKPFFALVTE 243 +QA+++ VT L + K + + P+ W +P V + Sbjct: 197 FRQAVLAAAVTGRLTKEDKDFITKKVELDNYKILIPEDWSETILNNIINTQRPLCYGVVQ 256 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 IE + + R + + + V +I+ + + Sbjct: 257 PGDDIKDGIELIRVCDINDGEVDLNHLRKISKEIDLQYKRSKVRKNDILVTIVGAIG-RI 315 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDV 362 + + A ++ + I +L + S + + + R++L +D+ Sbjct: 316 GIVREDINVNIARAVARISPEYKIIVPMFLHIWLSSPVMQTWLVQSSKEVARKTLNLKDL 375 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K V +P I+EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 376 KNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVNNALARVNNLTQSILAKAFRGE 431 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 67/207 (32%), Gaps = 11/207 (5%) Query: 21 IPKHWKVVPIKRFTKLNT----GRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP+ W + G I I + D+ G S++ Sbjct: 231 IPEDWSETILNNIINTQRPLCYGVVQPGDDIKDGIELIRVCDINDGEVDLNHLRKISKEI 290 Query: 74 DTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVL--PELLQGWLL 129 D S K IL +G R I+ + + + + P+ + P L WL Sbjct: 291 DLQYKRSKVRKNDILVTIVGAIGRIGIVREDINVNIARAVARISPEYKIIVPMFLHIWLS 350 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + + + + K + N +P+P + EQ I ++ D++ + Sbjct: 351 SPVMQTWLVQSSKEVARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVN 410 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 + + Q++++ L + Sbjct: 411 NALARVNNLTQSILAKAFRGELTAQWR 437 >gi|21228805|ref|NP_634727.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20907324|gb|AAM32399.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 440 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 59/421 (14%), Positives = 135/421 (32%), Gaps = 40/421 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W IK +N G+ + D G +G S Sbjct: 6 ELPEGWAECQIKDIVVINYGKGLKK---------SDRVEGQFDVFGSNGIV---GKHNQS 53 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + ++ G+ G + ++ T + + + L L ++++ Sbjct: 54 LTNGPTVIIGRKGSVGEINLSSEPCWPIDTTYYIDNFYGINRIFLYYLLKTLNL----AN 109 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + I + +P+PPL+EQ I I A R+D + R E+LK+ + Sbjct: 110 YDTSTAIPGINRNDIYSQLVPLPPLSEQHRIVSAIEALFARLDATNEKLDRVQEILKKFR 169 Query: 200 QALVSYIVTKGLNPDV---KMKDSGIEWVGLV----------PDHWEVKPF---FALVTE 243 +++++ L + + + + P W + V + Sbjct: 170 ESVLAAACDGRLTEEWRKENLHCNEYFAIDEDQFNLVKQWRIPTVWSWSTLEDSCSHVVD 229 Query: 244 LNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPE----SYETYQIVDPGEIVFRFIDL 298 K + + + + ++ N E G+I++ Sbjct: 230 CPHSTPKWTDIGVYCVRTSELKCGHIDFSNAKYVSEATYLERIKRLKPQEGDILYSREGT 289 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSL 357 + S + + + + + ++ ++ S + Sbjct: 290 VGIASLVPSNVKI--CLGQRLMLFRTKNNLIPSFFVKVLNSPYIYDSVKKSTMGSTAPRF 347 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 D+K+ P +PP+ EQ +I ++ A D + K+ + ++ R S +A A +G Sbjct: 348 NVADIKKFPTPLPPLPEQQEIVRRVDALFAFADSIETKVAAAREKTEKLRQSILAKAFSG 407 Query: 418 Q 418 Q Sbjct: 408 Q 408 >gi|158522104|ref|YP_001529974.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158510930|gb|ABW67897.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 477 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 65/442 (14%), Positives = 136/442 (30%), Gaps = 57/442 (12%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W P+++ +++ G+ K + G Y NS + + Sbjct: 5 LPEGWVAAPLQKISQIVYGKGLPKNK----------FNKQGLYPVFGANSIIGYYDS-FL 53 Query: 81 FAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + Q+L G I + S +V P + + E Sbjct: 54 YEDPQVLISCRGANSGTINISPPKCFVTSNSLVVQLPNTLHQSFKYLYYALES--SDKEK 111 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I G + + +P+PP EQ I ++ RID L T + ++K + Sbjct: 112 IVTGTAQPQVTIDNLKSFCVPLPPFNEQKRIVARLDQIIPRIDKLKTRLDKIPTIIKRFR 171 Query: 200 QALVSYIVTKGLNPDVKMKDSGIE------------------------------------ 223 Q++++ VT L + +E Sbjct: 172 QSVLTAAVTGRLTEKWREDHPDVEGAEATVQSIYYRRLDESQTNQQKNKIEKLFAEVETE 231 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIES--NILSLSYGNIIQK-LETRNMGLKPESY 280 GL+P+ W+ + + +I L GN+ ++ N+ Sbjct: 232 DNGLLPETWKYTFLNKICESFQYGTSSKSSKKGDIPVLRMGNLQNGAIDWSNLVYSSNKK 291 Query: 281 E-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMR 338 E ++ ++F + I + +DS YL + + Sbjct: 292 EIEKYKLEKNTVLFNRTNSPELVGKTAIYLGERAAIFAGYLIRINNMDILDSHYLNYSLN 351 Query: 339 SYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + + ++ + + R + PP++EQ +I + A D L Sbjct: 352 TDYAKAFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQKEIVRQVERSFALADKLEAHY 411 Query: 397 EQSIVLLKERRSSFIAAAVTGQ 418 + + + + S +A A G+ Sbjct: 412 QNARARVDKLARSVLAKAFRGE 433 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 79/216 (36%), Gaps = 10/216 (4%) Query: 19 GAIPKHWKVVPIKRFTK-LNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +P+ WK + + + G +S+S K DI + + ++++G + +S + + Sbjct: 234 GLLPETWKYTFLNKICESFQYGTSSKSSKKGDIPVLRMGNLQNGAIDWSNLVYSSNKKEI 293 Query: 76 STVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSI 131 + K +L+ + + AI +L+ + + L+ Sbjct: 294 EKYKL-EKNTVLFNRTNSPELVGKTAIYLGERAAIFAGYLIRINNMDILDSHYLNYSLNT 352 Query: 132 DVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 D + +G S+ + + +G +P PPL EQ I ++ D L Sbjct: 353 DYAKAFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQKEIVRQVERSFALADKLEAHYQ 412 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 + + +++++ L P + + + Sbjct: 413 NARARVDKLARSVLAKAFRGELTPQDPNDEPAEKLL 448 >gi|254415490|ref|ZP_05029250.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] gi|196177671|gb|EDX72675.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] Length = 424 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 62/411 (15%), Positives = 131/411 (31%), Gaps = 20/411 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + +G T +I + +++ + S Sbjct: 16 WQWKKLSELANTTSGGTPRRNHLEYFQGNINWFKSGELKDAEIFDSEEKITVEAIKESNA 75 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV----LPELLQGWLLSIDVT 134 IF KG +L G + K + + + + PK L + + Sbjct: 76 KIFPKGTLLIAMYGATVGKLGLLGVEAATNQAICAIFPKKQFGLPLLNNWFLFYYFKYIR 135 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ GA + I +PIP + +L + RI++L+ E +L Sbjct: 136 HQLINRSFGAAQPNISQTLIKETYIPIPFPKDIILSLDVQNRIVSRIESLLGELKGDHQL 195 Query: 195 --LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + V L ++ K +G + +K + +N + Sbjct: 196 LDKMRRDTSRVMEATLTELINEIDKKYPDSPTIGELLSSKYIKILG--GGTPSTENEEYW 253 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +I S ++ + ++ + IV G ++ + Sbjct: 254 GGSIPWTSPRDMKRWYIDTTQKYISQTALQDKKLNIVPEGSVLIVVRGMILAHTLPVGVT 313 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRS--YDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 E I V + S YL +++R+ + + G R LK + +K++ + Sbjct: 314 KNEVTINQDMKALVPEKNLLSEYLGYILRARAPFILQQVETAAHGTR-RLKTDTLKKVVI 372 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + I EQ I +N ++ + ++Q LL+ S + A GQ Sbjct: 373 PIVSISEQRSIIEYLNFFQTKVHEMKNIMQQDAQLLERLEQSILEKAFQGQ 423 >gi|71900230|ref|ZP_00682368.1| similar to Restriction endonuclease S subunits [Xylella fastidiosa Ann-1] gi|71730003|gb|EAO32096.1| similar to Restriction endonuclease S subunits [Xylella fastidiosa Ann-1] Length = 307 Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats. Identities = 89/263 (33%), Positives = 136/263 (51%), Gaps = 4/263 (1%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G++W+ +P HW+V +K + + +++ +D IY+ LE V+S TG P G Sbjct: 27 GIEWLQDVPGHWEVQRLKFIARNMSEQSTVKARDEIYLALEHVQSWTGVARPLKGTV--E 84 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWLLSI 131 STV F IL+GKL PYL K A+ G+C ++FLVL+P+ +LP L+ L Sbjct: 85 FASTVKRFFADDILFGKLRPYLAKVTRANCVGVCVSEFLVLRPRKELILPSYLEHLLRCK 144 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V I + GA M DW IGN+ +P+PPL EQ I + A+ V I I + Sbjct: 145 RVIDLINSATAGAKMPRVDWAFIGNVRLPLPPLPEQKQIAAYLRAQDVHIARFIKVKRDL 204 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I+LL E+K ++ + VT+GL+ V +K SGIEW+G VP HW+ V+ ++ T Sbjct: 205 IKLLTEQKLRIIDHAVTRGLDASVALKPSGIEWLGDVPVHWDTSRLKYCVSRIHAGGTPD 264 Query: 252 IESNILSLSYGNIIQKLETRNMG 274 + + I L ++ Sbjct: 265 TGVDGYWSDSSDGIPWLLIADVT 287 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 86/223 (38%), Gaps = 3/223 (1%) Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + +KK + + VT + V + GIEW+ VP HWEV+ + ++ ++T Sbjct: 1 MTQKKHTISTAAVTSRHDASVPLPTFGIEWLQDVPGHWEVQRLKFIARNMSEQSTVKARD 60 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I R + E T + +I+F + K + A + Sbjct: 61 EIYLALEHVQSWTGVARPLKGTVEFASTVKRFFADDILFGKLRPYLAK--VTRANCVGVC 118 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 + + + I +YL L+R + + + +G + + + + + +PP+ Sbjct: 119 VSEFLVLRPRKELILPSYLEHLLRCKRVIDLINSATAGAKMPRVDWAFIGNVRLPLPPLP 178 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 EQ I + + I ++ I LL E++ I AVT Sbjct: 179 EQKQIAAYLRAQDVHIARFIKVKRDLIKLLTEQKLRIIDHAVT 221 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 33/77 (42%), Gaps = 10/77 (12%) Query: 11 KDSGVQWIGAIPKHWKVVPIKR-FTKLNTGRTSESG---------KDIIYIGLEDVESGT 60 K SG++W+G +P HW +K ++++ G T ++G I ++ + DV Sbjct: 231 KPSGIEWLGDVPVHWDTSRLKYCVSRIHAGGTPDTGVDGYWSDSSDGIPWLLIADVTRAD 290 Query: 61 GKYLPKDGNSRQSDTST 77 K ++ S Sbjct: 291 RVVGSKKRVTQAGLESK 307 >gi|294676511|ref|YP_003577126.1| type I restriction-modification system RcaSBIP subunit S [Rhodobacter capsulatus SB 1003] gi|294475331|gb|ADE84719.1| type I restriction-modification system RcaSBIP, S subunit [Rhodobacter capsulatus SB 1003] Length = 541 Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats. Identities = 66/415 (15%), Positives = 151/415 (36%), Gaps = 24/415 (5%) Query: 20 AIPKHWKVVPIKRFT--KLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTS 76 +P W + + T T T + YI + +++ T PK R + + Sbjct: 3 ELPNGWAETTLGKVTLPFETTDPTRRPDETFQYIDIGSIDNQTQTITQPKSILGRDAPSR 62 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLP-ELLQGWLLSID 132 + K +L+ + YL+ + + ST VL+P + L L W+ S + Sbjct: 63 ARRVVKKDDVLFSTVRTYLKNIAVVPESLDSQLTSTGIAVLRPSEALDGRYLFNWVKSDE 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +G K + +P+PPL EQ I K+ T R + R Sbjct: 123 FISTMSKAQDGTLYPAVTDKDVSGGRIPLPPLHEQKRIVAKVDGLTARTARARADLDRIP 182 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L+ KQ+L++ + L + + ++ + ++ VT+ + + Sbjct: 183 TLIARYKQSLLALAFSGELTAGWRKTKALNDF-----ETVKLHSLCLSVTDGDHQAPPRS 237 Query: 253 ESNILSLSYGNIIQKLETRNMGL------KPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 +S I ++ + + + + + G+++F + Sbjct: 238 DSGIPFITISAMNTGRIDLSKATRAVPRSYFDEIKESRRPAIGDVLFSVTGSIGIPALV- 296 Query: 307 SAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 + + +KP+ +L++L+ S + + A+ +G + ++ ++ Sbjct: 297 --ETDLPFVFQRHIAIMKPNTERVSGRFLSYLLASPQIREQVDAIATGTAQLTVPLGGLR 354 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P ++EQ +I +I +D + + LL + ++ +A A G+ Sbjct: 355 QFDFPCPTLEEQAEIVRLITSAFNWVDRMAADHAAAADLLPKLDAAILAKAFRGE 409 >gi|117921400|ref|YP_870592.1| restriction modification system DNA specificity subunit [Shewanella sp. ANA-3] gi|117613732|gb|ABK49186.1| restriction modification system DNA specificity domain [Shewanella sp. ANA-3] Length = 587 Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats. Identities = 85/460 (18%), Positives = 157/460 (34%), Gaps = 69/460 (15%) Query: 21 IPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVES------GTGKYLPKDG 68 +PK W V I ++++G +S + DV G Sbjct: 6 LPKGWAVTTIGAVARVSSGVGFPIKYQGKSEGLYPVYKVGDVSKAVTSKHGNLAVAGHYV 65 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + ++ IF G L+ K+G R+A + + V+ K L Sbjct: 66 DKEEAAELKGEIFPVGATLFAKIGEAVKLNRRAFVRKPGLADNNVMAVIPDKSDCNRFLY 125 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L +ID+T+ + T+ I +I + +PPLAEQ++I +K+ +++T Sbjct: 126 QFLRAIDLTETSRS----TTVPSIRKGDIEDIELYLPPLAEQIVIADKLDTLLAQVETTK 181 Query: 186 TERIRFIELLKEKKQALVSYIVTKGL------------------------------NPDV 215 R E+LK +Q+++S V+ L NP + Sbjct: 182 ARLERIPEILKSFRQSVLSAAVSGKLTQEWRESHGNGTGEEVVKADAINKSVLLNENPAL 241 Query: 216 KMKDSGIE------WVGLVPDHW---EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 K K S IE ++ +P+ W +T K +S + L+ ++ Sbjct: 242 KKKKSTIESQIDTEYIFDLPESWGFTTWGKISEWITYGFTKPMPKSDSGVKLLTAKDVQY 301 Query: 267 KLETRNM-----GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N +S G+++ +R E I + Sbjct: 302 FDVNINDAGLTTSSAFQSLSDKDRPIKGDLLITKDGSIGRAALVR---TDEPFCINQSVA 358 Query: 322 AVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDI 378 ++ YL +L S + G Q L D + P+ VP ++EQ +I Sbjct: 359 VCWLRSTSMNKDYLEFLANSEFTQRFVKDKAQGMAIQHLSIIDYAKCPLPVPSLEEQTEI 418 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A D + +K ++ + S +A A G+ Sbjct: 419 VRRVEELFAFADSIEQKATAALARVNNLTQSILAKAFRGE 458 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 71/210 (33%), Gaps = 9/210 (4%) Query: 16 QWIGAIPKHWKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 ++I +P+ W + ++ + G T +S + + +DV+ + Sbjct: 255 EYIFDLPESWGFTTWGKISEWITYGFTKPMPKSDSGVKLLTAKDVQYFDVNINDAGLTTS 314 Query: 72 QS--DTSTVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQ--G 126 + S KG +L K G R A++ D + V + Sbjct: 315 SAFQSLSDKDRPIKGDLLITKDGSIGRAALVRTDEPFCINQSVAVCWLRSTSMNKDYLEF 374 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + ++ +G + H P+P+P L EQ I ++ D++ Sbjct: 375 LANSEFTQRFVKDKAQGMAIQHLSIIDYAKCPLPVPSLEEQTEIVRRVEELFAFADSIEQ 434 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + + + Q++++ L D + Sbjct: 435 KATAALARVNNLTQSILAKAFRGELTADWR 464 >gi|42525031|ref|NP_970411.1| type I restriction-modification system, S subunit [Bdellovibrio bacteriovorus HD100] gi|39577242|emb|CAE81065.1| type I restriction-modification system, S subunit [Bdellovibrio bacteriovorus HD100] Length = 417 Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats. Identities = 56/416 (13%), Positives = 132/416 (31%), Gaps = 19/416 (4%) Query: 22 PKHWKVVPIKRF----TKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P W + + G + + +I D G + S+ Sbjct: 5 PADWDKHILDELLEDNFNITYGVVQPGDEAPNGVKFIRGGDFPKGKIEENKLRTISKDIS 64 Query: 75 TS-TVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLS 130 S ++ G++L +G A++ I L+ L ++ +L S Sbjct: 65 ESYKRTVLNGGELLVALVGYPGTVAVVPRSLRGANIARQTALIRLAPKYLNTYVKYFLES 124 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 I G+ + K + + + P + EQ I E + + I+ E + Sbjct: 125 DFGQGEILRGSLGSAQQVINLKDLKLVQVYTPKIDEQKKIAEFLTSVDKVIELTEIEIEK 184 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW--EVKPFFALVTELNRKN 248 L K Q L+S + + + W V + + + + + Sbjct: 185 LQNLKKGMMQDLLSKGIGHSTTIESAVGPVPKSWSIEVLSDLVLKGRKITYGIVQPGSYD 244 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + E + ++ E ++ G++V ++ Sbjct: 245 ERGVLLVRGQDYISGWAEAGEVFKVSVEIEKKFERARLNVGDVVICIAGAGVGAVNVVPM 304 Query: 309 QVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 + I + A ++ I YL + ++ K G + L DV++ Sbjct: 305 RFNGANITQTTARVSCDEKKILGKYLYYYLQEGTGLKQIQKYIKGSAQPGLNLNDVEKFL 364 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + VPP+ EQ I ++ +++ + + + + + +TG++ ++ Sbjct: 365 IKVPPLAEQSSIVKALDSVELKVENTKVL----LAKYQSLKKALMQDLLTGRVRVK 416 >gi|227533322|ref|ZP_03963371.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus paracasei subsp. paracasei ATCC 25302] gi|227189041|gb|EEI69108.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus paracasei subsp. paracasei ATCC 25302] Length = 419 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 61/405 (15%), Positives = 132/405 (32%), Gaps = 24/405 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ T + + + I +D + K D + + Sbjct: 20 WEERKLSSISERVTRKNKNNESTLPLTISAQDGLVDQNDFFNK--QVASRDVTGYFLVKN 77 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLS-IDVTQRI 137 G+ Y K G+ ST ++V +P + + L + + + Sbjct: 78 GEFAYNKSYSNGYPWGAIKRLDKYDMGVLSTLYIVFRPTKINSQFLVSYYDTTRWYREVS 137 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + EGA + + + + V ++KI + ++D IT R + LKE Sbjct: 138 KNAAEGARNHGLLNIAPTDFFNTLLVVPKIVDEQQKIGSFFKQLDDTITLHQRKLAKLKE 197 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 KQ + + + + +++ +G ++ A + N + ++ +ES Sbjct: 198 LKQGYLQKLFPRNGSKFPQLRFAGFADAWEQRKLSDIATLNARIGWQNLRTSEFLESGDY 257 Query: 258 SLSYGNIIQKLETRNMGLKPESYETY-----QIVDPGEIVFRFIDLQNDKR---SLRSAQ 309 L G + Y V+ G I+ L Sbjct: 258 MLITGTNFHDGTVDYSTVHYVEKNRYEQDTKIQVENGSILITKDGTLGKVALVQGLNMPA 317 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVL 368 + G+ P ID Y+ + + L K G + L + PVL Sbjct: 318 TLNAGVFN--VKIKDPETIDVDYVYQYLAAPFLMKYANAKSTGGTIKHLNQNILIDFPVL 375 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P +EQ + ++N +D + ++ + L+E + ++ Sbjct: 376 LPRKREQVKLAELLNG----LDNTITLHQRKLEKLQELKKGYLQK 416 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 74/192 (38%), Gaps = 12/192 (6%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVDPGE 290 WE + ++ + RKN + L++S + + + + N + Y +V GE Sbjct: 20 WEERKLSSISERVTRKNKNNESTLPLTISAQDGLVDQNDFFNKQVASRDVTGYFLVKNGE 79 Query: 291 IVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + +++ + G++++ Y+ +P I+S +L + + Sbjct: 80 FAYNKSYSNGYPWGAIKRLDKYDMGVLSTLYIVFRPTKINSQFLVSYYDTTRWYREVSKN 139 Query: 350 GS-GLRQ----SLKFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + G R ++ D ++VP I EQ I + ++D + ++ + L Sbjct: 140 AAEGARNHGLLNIAPTDFFNTLLVVPKIVDEQQKIGSF----FKQLDDTITLHQRKLAKL 195 Query: 404 KERRSSFIAAAV 415 KE + ++ Sbjct: 196 KELKQGYLQKLF 207 >gi|222823384|ref|YP_002574958.1| type I restriction-modification system, S subunit [Campylobacter lari RM2100] gi|222538606|gb|ACM63707.1| type I restriction-modification system, S subunit [Campylobacter lari RM2100] Length = 390 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 81/408 (19%), Positives = 144/408 (35%), Gaps = 36/408 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK+ I ++ + I +D SG P G S D IF + Sbjct: 4 WKISIIDNTCEILNNKRVP-------ISQKDRISG---IYPYYGASGIVDYIDKYIFDEE 53 Query: 85 QILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G K G + A IA + +L+P + + +E Sbjct: 54 LVLIGEDGAKWGAFENSAFIASGKYWVNNHAHILKPNNEILINKFLVYFLNY--SNLEKY 111 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 GAT+ + + + I + +PPL EQ I + ID I + + L E Q Sbjct: 112 ITGATVKKLNQQKLKQIEILLPPLKEQERIVGILDESFANIDESIKILEQDLLNLDELMQ 171 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN--RKNTKLIESNILS 258 + + + + +P WE K + + K IE+ I Sbjct: 172 SALQKTFNPLKD--------NAKENYQLPQDWEWKSLGEICFITDGTHKTPNYIETGIPF 223 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLRSAQVMER 313 LS NI + + E +++ G+I+ I +++ + E Sbjct: 224 LSVKNISKGFFDLSDIKYISLEEHNKLIKRAKPEFGDILICRIGTLGK--AIKISLEFEF 281 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLR-QSLKFEDVKRLPVLVP 370 I S + I S YL + + SY + +G G L +++ P+ +P Sbjct: 282 SIFVSLGLLKPKVKIISDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNLNILEKCPIALP 341 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +KEQ I + ++ + I L + + I L+E + S + A G+ Sbjct: 342 SLKEQEQIASYLDEFSLNIKDLKQNYQAQIKNLQELKKSLLDKAFKGK 389 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 70/201 (34%), Gaps = 9/201 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W+ + + G I ++ ++++ G S + Sbjct: 190 QLPQDWEWKSLGEICFITDGTHKTPNYIETGIPFLSVKNISKGFFDLSDIKYISLEEHNK 249 Query: 77 TVSIFAK--GQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G IL ++G + I+ +F+ +L+PK + + L+ Sbjct: 250 LIKRAKPEFGDILICRIGTLGKAIKISLEFEFSIFVSLGLLKPKVKIISDYLVYFLNSYF 309 Query: 134 TQRIEAICE---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G + + + P+ +P L EQ I + ++ I L Sbjct: 310 IEGWINNNKVGGGTHTAKLNLNILEKCPIALPSLKEQEQIASYLDEFSLNIKDLKQNYQA 369 Query: 191 FIELLKEKKQALVSYIVTKGL 211 I+ L+E K++L+ L Sbjct: 370 QIKNLQELKKSLLDKAFKGKL 390 >gi|294782550|ref|ZP_06747876.1| type I restriction-modification enzyme, S subunit [Fusobacterium sp. 1_1_41FAA] gi|294481191|gb|EFG28966.1| type I restriction-modification enzyme, S subunit [Fusobacterium sp. 1_1_41FAA] Length = 386 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 71/397 (17%), Positives = 141/397 (35%), Gaps = 29/397 (7%) Query: 28 VPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + +G T K+ I +I + D + K+ + ++S+ I Sbjct: 8 KKLGEICDFISGGTPSKSKNEYWKNGNIPWIKISDFKEKYIKFSDEKITKIGLESSSAKI 67 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 KG ILY + + K I D + + + + K+ + + + I+ Sbjct: 68 LKKGTILYT-IFASVGKVAILDIEATTNQAVVGINLKEDNSIDKDFLYYFLCSIENNIKK 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G ++ + + NI +PI P++ Q I + + +D L +++ L K Sbjct: 127 QARGVAQNNINISILKNINIPILPMSFQKNIVKTLNKLENILDNLKQKKLLINFLNKSLF 186 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + I K + + P++ F L+T N K K+ S + + Sbjct: 187 TTMFGDIEKKSEYHKLSNICDVRDGTHDSPEYITTDKRFPLITSKNLKGDKIDFSEVNFI 246 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 S + VD G+I+ I + ++ + I A Sbjct: 247 SEADF-------------NKINVRSKVDIGDILMPMIGTIGNPIIVK--IDKKFSIKNLA 291 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + K I +T+L +L+ S + G ++ L D++ + +PPI+ Q Sbjct: 292 LIKFKNSQIINTFLKFLLLSDYFNLIISQKNKGGTQKFLSLSDIRNFLIPIPPIELQNKF 351 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I +I+ L +IE+SI + S I+ Sbjct: 352 AERIE----KIEKLKFEIEKSIETAQNLYDSLISKYF 384 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 53/189 (28%), Gaps = 9/189 (4%) Query: 26 KVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTV 78 + + + G + K I ++++ + + S + + Sbjct: 198 EYHKLSNICDVRDGTHDSPEYITTDKRFPLITSKNLKGDKIDFSEVNFISEADFNKINVR 257 Query: 79 SIFAKGQILYGKLGPYLRKAI--IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 S G IL +G I I I + + + ++ L+ LLS Sbjct: 258 SKVDIGDILMPMIGTIGNPIIVKIDKKFSIKNLALIKFKNSQIINTFLKFLLLSDYFNLI 317 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I +G T I N +PIPP+ Q E+I + L Sbjct: 318 ISQKNKGGTQKFLSLSDIRNFLIPIPPIELQNKFAERIEKIEKLKFEIEKSIETAQNLYD 377 Query: 197 EKKQALVSY 205 Sbjct: 378 SLISKYFDN 386 >gi|257051191|ref|YP_003129024.1| restriction modification system DNA specificity domain protein [Halorhabdus utahensis DSM 12940] gi|256689954|gb|ACV10291.1| restriction modification system DNA specificity domain protein [Halorhabdus utahensis DSM 12940] Length = 442 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 61/413 (14%), Positives = 137/413 (33%), Gaps = 39/413 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W +V + L G S S +P G++ Q DT + + Sbjct: 38 ESWNLVRLGEILTLEYGDNLPSD------------SRESGTVPVFGSNGQVDTHSEAAVE 85 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I+ G+ G T + + + +LL ++E + Sbjct: 86 KPGIILGRKGSIGEIDFSDRPFWPIDTTYYITSEETSQNLRFLYYLLQN---IQLERLNA 142 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + +PP EQ I + I + + + +Q + Sbjct: 143 ASAIPGLNRNDAYGLKALMPPAEEQRKIASVLYTVDQAIQKSEEIIEQTERVRRGTEQDV 202 Query: 203 VSYIVTK----GLNPDVKMKDSGIEWVGLVPDHWEVKPF-----FALVTELNRKNTKLIE 253 +S V + + DV + S WVG +P W+VK + + V + + + + Sbjct: 203 LSRGVREDGTLRPDDDVAYRSS---WVGDIPCDWDVKQYSKLISDSSVGIVVKPSQYYDD 259 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETY-----QIVDPGEIVFRFIDLQNDKRSLRSA 308 + + I + + + S E+ + G+++ + S Sbjct: 260 DGTVPILRSKDISRDGIVDGDFEYMSEESNAENENSRLQEGDVITVRSG--DPGLSCVVD 317 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV 367 + ++ +D Y A + S+ K +G ++ +++L V Sbjct: 318 GEFDGANCADLLISTPGPKLDPHYAAMWINSFAGRKQIDRFQAGLAQKHFNLGALRKLRV 377 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 VP + EQ I ++ + ++ E Q L+ + + ++G++ Sbjct: 378 GVPSLDEQKRIVEKVSSISESLESQRESKRQ----LQRLKQGLMQDLLSGKVR 426 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 78/216 (36%), Gaps = 21/216 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTK-------LNTGRTSESGKDIIYIGLEDVE---- 57 Y+ S W+G IP W V + + + + + + +D+ Sbjct: 220 AYRSS---WVGDIPCDWDVKQYSKLISDSSVGIVVKPSQYYDDDGTVPILRSKDISRDGI 276 Query: 58 -SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVL 114 G +Y+ ++ N+ ++ +G ++ + G ++ C+ + Sbjct: 277 VDGDFEYMSEESNAENENSR----LQEGDVITVRSGDPGLSCVVDGEFDGANCADLLIST 332 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + P W+ S ++I+ G H + + + + +P L EQ I EK+ Sbjct: 333 PGPKLDPHYAAMWINSFAGRKQIDRFQAGLAQKHFNLGALRKLRVGVPSLDEQKRIVEKV 392 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + + +++ + + L + Q L+S V Sbjct: 393 SSISESLESQRESKRQLQRLKQGLMQDLLSGKVRTH 428 >gi|332520738|ref|ZP_08397200.1| restriction modification system DNA specificity domain [Lacinutrix algicola 5H-3-7-4] gi|332044091|gb|EGI80286.1| restriction modification system DNA specificity domain [Lacinutrix algicola 5H-3-7-4] Length = 457 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 60/416 (14%), Positives = 133/416 (31%), Gaps = 19/416 (4%) Query: 20 AIPKHWKVVPIKR-FTKLNTGRTSESGKD-----IIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK+W + ++ G + + ++ + +E + + T Sbjct: 4 KLPKNWVETDLDTVILRMTNGSSLKQEEEPFQGSLPISRIETIWNETIDLDRVKYVDASE 63 Query: 74 DTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGIC----STQFLVLQPKDVLPELLQGW 127 D KG +L+ + +L K + + D + L P+ L Sbjct: 64 DDIEKYGLQKGDVLFSHINSDKHLGKTAVFNLDQTIIHGINLLLLRAMPQFDGDLLNYIL 123 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + IE S + K + + +P+PPLAEQ I K+ +D+L T Sbjct: 124 RHYRFSGKFIEVAQRSVNQSSINQKKLKSFLVPLPPLAEQQRIVAKLDELFGHLDSLKTR 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 E+LK +QA+++ VT L + + + + Sbjct: 184 LNHIPEILKNFRQAVLNQAVTGKLTEEW--RVGKALEEWEEVELETIAKVVDPQPSHRTP 241 Query: 248 NTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 +S+ N ++ N + + + G+ F I Sbjct: 242 PIHEDGIPYVSIKDVNKKGEVILENARPVSKVVLAEHIKRYDLQEGDFGFGKIGTLGKPF 301 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL-CKVFYAMGSGLRQSLKFEDV 362 L + + + + + +L + + S + K+ S + + + Sbjct: 302 LLPMFPERKYTLSANIILIQPRSKGNPKFLYYYLNSSIIEQKLREGTNSTSQPAFGIKKA 361 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ P P +EQ +I + + D + + + + + +A A G+ Sbjct: 362 RKFPTPNPSPEEQTEIVKRVEHLFDKADAIEAQYQSLKTKIDSLPQAILAKAFKGE 417 >gi|89891079|ref|ZP_01202587.1| putative type I site-speicific deoxyribonuclease specificity subunit [Flavobacteria bacterium BBFL7] gi|89516723|gb|EAS19382.1| putative type I site-speicific deoxyribonuclease specificity subunit [Flavobacteria bacterium BBFL7] Length = 468 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 57/420 (13%), Positives = 137/420 (32%), Gaps = 35/420 (8%) Query: 20 AIPKHWKVVPIKRFTK----LNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDG-N 69 +PK W I G + + ++ I L D+ G + + N Sbjct: 4 ELPKGWVETNISSLVDDTGLFKDGDWVESKDQDPNGNVRLIQLADIGLGNFRDKSQRFLN 63 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQ 125 ++ + + IL ++ + ++ + G + + K + + L Sbjct: 64 QETAERLNCNFLEQNDILVARMPDPIGRSCLFPLKGENVTVVDVAIIRPSKKHINYKWLS 123 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 W+ S + I + G+T + + IP P+PP AEQ I K+ A + + Sbjct: 124 HWINSPVFHKNISELASGSTRKRISRRNLDKIPFPLPPRAEQDRIVAKVDALMAQHAAIQ 183 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 R +LLK+ +Q +++ + + Sbjct: 184 QAMERIPQLLKDFRQQVLNQSFERNIERVAL--------------EDCCHKIQDGAHHSP 229 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP------GEIVFRFIDLQ 299 + + + E N+ I+ + L + + + + P G+++ Sbjct: 230 KYVSPIREKNMFPYVTSKNIRNDYMKLDTLTYVNEDFHNTIYPRCSPEFGDVLLTKDGAS 289 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLK 358 +L + + + + YL + ++S F M + + Sbjct: 290 TGNVTLNEFDEPISLLSSVCLIKTDKKKLIPAYLKYFIQSSIGFSEFTGKMTGTAIKRVV 349 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +K+ + +P + EQ +I + + + ++ EQ + + + + A G+ Sbjct: 350 LKKIKKATIPLPSVPEQQEIVRRVESLFEKATAIEQRYEQLKLQIDSLPQAILHKAFKGE 409 >gi|158421619|ref|YP_001527846.1| restriction modification system DNA specificity subunit [Deinococcus geothermalis DSM 11300] gi|158342862|gb|ABW35148.1| restriction modification system DNA specificity domain [Deinococcus geothermalis DSM 11300] Length = 426 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 65/421 (15%), Positives = 131/421 (31%), Gaps = 38/421 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W P+ ++ G+T + ++ +V D S Sbjct: 5 NWNWRPLGELFEIGAGKTMSAAARAGADKVPFLRTSNVLWDEIDLTQVDEMSISPTELVD 64 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD---VLPELLQGWLLSIDVTQ 135 G +L + G R A+ + S Q + + + + + L TQ Sbjct: 65 KSLKAGDLLVCEGGEIGRAAVWDGRVPVMSFQNHLHRLRRKQDDVDAHFYVYFLQSAFTQ 124 Query: 136 --RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 E T+ + + + +P PP EQ + + + ++ I + Sbjct: 125 LGIFEGAGNKTTIPNLSRNRLAALDVPHPPKPEQQSVAQVL----AKVREAIAVHDQATS 180 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 E K A+++ + T+GL + + +GLVP+ W L + E Sbjct: 181 TALELKHAVMNDLFTRGLRGEPQK----ETEIGLVPESWAEVSIADLGEIVTGTTPPTRE 236 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYET--------YQIVDPGEIVFRFIDLQNDKRSL 305 I + + + + + + G I K Sbjct: 237 RAYYDDGNIPFISPGDIEHGTPIASTQKCITDSGLAVSRALPAGTTCVVCIGSTIGKVGR 296 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 +A V G D YL+ L+ +Y V A L ++L Sbjct: 297 TTAAAS--ATNQQINAIVPGVGYDPNYLSHLL-TYQSNIVRNAASPSPVPILSKGAFEKL 353 Query: 366 PVLV---PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + P EQ +I +++ +ID ++ +++E S + +TG+I + Sbjct: 354 VLFTSTNP--DEQVEIATILDAVDRKID----LHQKKRKVVEELFESLLHKLMTGEIAVS 407 Query: 423 G 423 Sbjct: 408 D 408 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 62/202 (30%), Gaps = 12/202 (5%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKY 63 K++ IG +P+ W V I ++ TG T + +I +I D+E GT Sbjct: 204 KETE---IGLVPESWAEVSIADLGEIVTGTTPPTRERAYYDDGNIPFISPGDIEHGT-PI 259 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 S + G +G + K + Q + V + Sbjct: 260 ASTQKCITDSGLAVSRALPAGTTCVVCIGSTIGKVGRTTAAASATNQQINAIVPGVGYDP 319 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRID 182 L + + + + + + + QV I + A +ID Sbjct: 320 NYLSHLLTYQSNIVRNAASPSPVPILSKGAFEKLVLFTSTNPDEQVEIATILDAVDRKID 379 Query: 183 TLITERIRFIELLKEKKQALVS 204 +R EL + L++ Sbjct: 380 LHQKKRKVVEELFESLLHKLMT 401 >gi|254373735|ref|ZP_04989218.1| type I restriction-modification system [Francisella novicida GA99-3548] gi|151571456|gb|EDN37110.1| type I restriction-modification system [Francisella novicida GA99-3548] Length = 394 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 55/416 (13%), Positives = 135/416 (32%), Gaps = 40/416 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK WK + + T + D I I + + G ++ + Sbjct: 5 ELPKGWKAIELGEITSYVNRGVAPKYTDEHGITVINQKCIREGNINLELARVHNPDKKYT 64 Query: 77 TVSIFAKGQILYGKLG-PYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G IL G + I + I T +++ + Sbjct: 65 AEKQLHLGDILINSTGVGTAGRVGIFTDSINAIVDTHVSIVRLNKEYAYPKFVYYNLRFR 124 Query: 134 TQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +E EG+T I ++ + +P L EQ I + + + + I + Sbjct: 125 EKELEETAEGSTGQIELKRDAIKSLNILLPQLTEQKAIADVLSSLDDK----IDLLHKQN 180 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + L++ Q L IE + + ++ K+ + Sbjct: 181 QTLEDMAQTLFREWF--------------IEKADEGWEEMPLSEVCSVTAGYAFKSKDFV 226 Query: 253 ESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + N I + + + + E+ + +IV K L S Sbjct: 227 DIGVPVVKIKNISNGHIDYNDLQFIDISESDVESKYRLYDNDIVMAMTGATIGKIGLVST 286 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 + ++ ++ + L +++ S DL + +G + ++ + + V Sbjct: 287 FEHDYLLLNQRVAVLRSNHQ--ALLWFMLNSLDLENEILNLSNGAVQANISSTSIGQ--V 342 Query: 368 LVPPIKEQ--FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P + Q N ++ + +++ ++ I L++ R + + ++GQ+ + Sbjct: 343 PIPGMSNQMMQKFNNAVHPMFEK----IQQNKKQIKSLEQTRDTLLPKLMSGQVRV 394 >gi|297538977|ref|YP_003674746.1| restriction modification system DNA specificity domain-containing protein [Methylotenera sp. 301] gi|297258324|gb|ADI30169.1| restriction modification system DNA specificity domain protein [Methylotenera sp. 301] Length = 401 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 45/411 (10%), Positives = 119/411 (28%), Gaps = 30/411 (7%) Query: 24 HWKVVPIKRFTKL-NTGRTSE--SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVS 79 W+ + L N G + + I + + + Y + + + S Sbjct: 4 GWQTEKLGEVCALLNRGISPKYIESSGICVLNQKCIRDHRVSYDQARRHDLAEKSVSENR 63 Query: 80 IFAKGQILYGKLGP-YLRKAII----ADFDGICSTQFLVLQPK---DVLPELLQGWLLSI 131 G +L G L + + + +++PK +L Sbjct: 64 FIQLGDVLVNSTGTGTLGRVAQVRETPEEPTTVDSHVTIVRPKEGKFYQDFFGYMLILIE 123 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + C G T + +Q I + I + Sbjct: 124 EAIKESGEGCGGQTELARSVLAEKFSVSYPISIEQQQRIVAILDQAFEGIAKARANAEQN 183 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ + ++ + + T+ + + + + K+ + Sbjct: 184 LQNARALFESHLQSVFTQHGEGWMVTTVGAV--------------CDKVEYGTSSKSKEQ 229 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + +L + + + + ++ ++ +++F + + Sbjct: 230 GKIPVLRMGNIQNRRFDWDKLVYTDDDNEIEKYLLKHNDVLFNRTNSPELVGKTAIYKSE 289 Query: 312 ERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPV 367 I + + ++ YL + + S A+ S + ++ + +K P+ Sbjct: 290 SPAIFAGYLIRIHRKEDLINADYLNYFLNSQIAMDYGKTVAISSVNQANINGKKLKGYPI 349 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 VP + EQ I ++ L ++ I LL E + S + A G+ Sbjct: 350 PVPSLSEQESIVMKMDALKIETQRLEALYQRKIKLLDELKKSLLQQAFAGE 400 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 65/199 (32%), Gaps = 11/199 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + W V + TS K+ I + + ++++ + ++ Sbjct: 204 EGWMVTTVGAVCDKVEYGTSSKSKEQGKIPVLRMGNIQNRRFDWDKLVYTDDDNEIEKY- 262 Query: 80 IFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVL----PELLQGWLLSID 132 + +L+ + + AI +L+ + L I Sbjct: 263 LLKHNDVLFNRTNSPELVGKTAIYKSESPAIFAGYLIRIHRKEDLINADYLNYFLNSQIA 322 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + ++ + K + P+P+P L+EQ I K+ A + L R I Sbjct: 323 MDYGKTVAISSVNQANINGKKLKGYPIPVPSLSEQESIVMKMDALKIETQRLEALYQRKI 382 Query: 193 ELLKEKKQALVSYIVTKGL 211 +LL E K++L+ L Sbjct: 383 KLLDELKKSLLQQAFAGEL 401 >gi|281421790|ref|ZP_06252789.1| type I restriction-modification enzyme S subunit [Prevotella copri DSM 18205] gi|281404148|gb|EFB34828.1| type I restriction-modification enzyme S subunit [Prevotella copri DSM 18205] Length = 450 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 73/388 (18%), Positives = 142/388 (36%), Gaps = 15/388 (3%) Query: 20 AIPKHWKVVPIKRFTKL---NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W + T + + + LED+E T K + + + Sbjct: 67 ELPKGWVWTTVGEITNYGDSVNVQVEDIDNSDWVLELEDIEKDTAKIIQHLNKNERKING 126 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQ 135 T F KGQILY KL YL K ++A DG C+T+ + +L ++ S+ Sbjct: 127 TRHKFQKGQILYSKLRTYLNKVLVAPNDGFCTTEIMAFGSYGILSNNYICYVLRSLYFLD 186 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G M N +P+PPLAEQ I +I ID + + + Sbjct: 187 YTLQCGYGVKMPRLSTTDACNGLIPLPPLAEQERIVNEIQRLFSIIDIVENGKDGLQTAI 246 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 ++ K ++ + + L P + E + + E+ +L + + N Sbjct: 247 QQAKNKILDHAIHGKLVPQDPNDEPASELLKRINPKAEITCDNPQYGKLPKGWCETTLGN 306 Query: 256 ILSLSYGNIIQKLETRNMGLKP------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + G+ I+ + R Y VD I+ + + + Sbjct: 307 TIVIKSGDAIKVRDNRIGKYPIYGGNGITGYNESYNVDGINIIIGRVGFYCGSVHYVNNK 366 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + + + +L +L++ YDL + S + + + V + V++ Sbjct: 367 IWVTD--NAFVTKIMGNVYTPKFLYYLLQQYDLQQY---SNSTAQPVISGKTVYPINVML 421 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIE 397 PP+ EQ+ I I +++D + ++ Sbjct: 422 PPLSEQYRIVAKIEELFSQLDKIESSLQ 449 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 48/145 (33%), Gaps = 14/145 (9%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K+ + + T G+I++ + +K + + T Sbjct: 112 KIIQHLNKNERKINGTRHKFQKGQILYSKLRTYLNKVLVAPN---DGFCTTEIMAFGSYG 168 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + Y+ +++RS G G+ L D + +PP+ EQ I N I Sbjct: 169 ILSNNYICYVLRSLYFLDYTLQCGYGVKMPRLSTTDACNGLIPLPPLAEQERIVNEIQRL 228 Query: 386 TARID----------VLVEKIEQSI 400 + ID +++ + I Sbjct: 229 FSIIDIVENGKDGLQTAIQQAKNKI 253 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 49/165 (29%), Gaps = 14/165 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G +PK W + + +G + V P G + + + Sbjct: 293 GKLPKGWCETTLGNTIVIKSGDAIK------------VRDNRIGKYPIYGGNGITGYNES 340 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I+ G++G Y + + V + + + + ++ Sbjct: 341 YNVDGINIIIGRVGFYCGSVHYVNNKIWVTDNAFVTKIMGNVYTPKFLYY--LLQQYDLQ 398 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 K + I + +PPL+EQ I KI ++D Sbjct: 399 QYSNSTAQPVISGKTVYPINVMLPPLSEQYRIVAKIEELFSQLDK 443 >gi|190890488|ref|YP_001977030.1| type I restriction-modification system protein, specificity subunit [Rhizobium etli CIAT 652] gi|190695767|gb|ACE89852.1| probable type I restriction-modification system protein, specificity subunit [Rhizobium etli CIAT 652] Length = 424 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 60/419 (14%), Positives = 133/419 (31%), Gaps = 38/419 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD- 74 P+ W + + +L +G T + I ++ L D ++ GK L + + Sbjct: 24 PEGWALERLCDIARLESGHTPSRNRPDYWDGGIPWLSLHDSKTIEGKVLQNTKMTISARG 83 Query: 75 --TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ + +G + + + A++ S F L L Sbjct: 84 LANSSARLLPEGTVALSRTATIGKVALLGREMA-TSQDFACYICGPRLLNK-YLAHLFRG 141 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + E + G+T + N+ + +PP+ EQ I + + I R I Sbjct: 142 MELEWERLMAGSTHNTIYMPTFENMQILVPPMEEQEAIADALSDADAL----IEGLERLI 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKN 248 KQ + + L ++ EW +G Sbjct: 198 AKKWLIKQGTMQDL----LTAKRRLPGYSAEWTMAKLGDFLSFKNGLNKAKAFFGHGTPI 253 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 ++ + G I + + E+ ++ + G+++F ++ L + Sbjct: 254 INYMD-----VFRGGAINEGSIDGLVEVTEAEQSAYGIRNGDVLFTRTSETPEEIGLAAV 308 Query: 309 QVM--ERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVK 363 + + + + +P + + RS + + + + R + Sbjct: 309 ADGVLDGTVFSGFVLRGRPKSQALTIAFSKYCFRSGAVRRQIISRATYTTRALTNGRQLS 368 Query: 364 RLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + VP EQ I V+N A I L + + ++ + + +TG+I L Sbjct: 369 AVDISVPRDADEQNAIAEVLNDMDAEIQALETR----LDKARQVKEGMMQNLLTGRIRL 423 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 66/210 (31%), Gaps = 15/210 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ------KLETRNM 273 +E G + +R + I LS + + + Sbjct: 20 PDVEPEGWALERLCDIARLESGHTPSRNRPDYWDGGIPWLSLHDSKTIEGKVLQNTKMTI 79 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + +++ G + L E + + + YL Sbjct: 80 SARGLANSSARLLPEGTVALSRTATIGKVALLGR----EMATSQDFACYICGPRLLNKYL 135 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 A L R +L M ++ + + +LVPP++EQ I + ++ D L+ Sbjct: 136 AHLFRGMELEWE-RLMAGSTHNTIYMPTFENMQILVPPMEEQEAIADALSDA----DALI 190 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 E +E+ I + + +T + L G Sbjct: 191 EGLERLIAKKWLIKQGTMQDLLTAKRRLPG 220 >gi|254164265|ref|YP_003047375.1| specificity determinant for hsdM and hsdR [Escherichia coli B str. REL606] gi|253976168|gb|ACT41839.1| specificity determinant for hsdM and hsdR [Escherichia coli B str. REL606] Length = 474 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 63/416 (15%), Positives = 141/416 (33%), Gaps = 29/416 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + + + G +S + + I + DV G Sbjct: 24 SWLRISMDSVANITNGFAFKSSEFNNRKDGVPLIRIRDVLKGN------TSTYYSGQIPE 77 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G G + I + + + ++ ++ + I Sbjct: 78 GYWVYPEDLIVGMDGDF-NATIWCSEPALLNQRVCKIEVQEDKYNKRFFYHALPGYLSAI 136 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A T+ H + + + +P+PPLAEQ +I EK+ ++D+ + ++LK Sbjct: 137 NANTSSVTVKHLSSRTLQDTLLPLPPLAEQKIIAEKLDTLLAQVDSTKARLEQIPQILKR 196 Query: 198 KKQALVSYIVTKGLNP----DVKMKDSGIEWVGLVPDHW----------EVKPFFALVTE 243 +QA+++ VT L + K + L+P+ W +P V + Sbjct: 197 FRQAVLAAAVTGRLTKEDKDFIIKKVELDNYKILIPEDWSETILNNIINTQRPLCYGVVQ 256 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 IE + + R + + + V +I+ + + Sbjct: 257 PGDDIKDGIELIRVCDINDGEVDLNHLRKISKEIDLQYKRSKVRKNDILVTIVGAIG-RI 315 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDV 362 + + A ++ + I +L + S + + + R++L +D+ Sbjct: 316 GIVREDINVNIARAVARISPEYKIIVPMFLHIWLSSPVMQTWLVQSSKEVARKTLNLKDL 375 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K V +P I+EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 376 KNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVNNALARVNNLTQSILAKAFRGE 431 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 67/207 (32%), Gaps = 11/207 (5%) Query: 21 IPKHWKVVPIKRFTKLNT----GRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP+ W + G I I + D+ G S++ Sbjct: 231 IPEDWSETILNNIINTQRPLCYGVVQPGDDIKDGIELIRVCDINDGEVDLNHLRKISKEI 290 Query: 74 DTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVL--PELLQGWLL 129 D S K IL +G R I+ + + + + P+ + P L WL Sbjct: 291 DLQYKRSKVRKNDILVTIVGAIGRIGIVREDINVNIARAVARISPEYKIIVPMFLHIWLS 350 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + + + + K + N +P+P + EQ I ++ D++ + Sbjct: 351 SPVMQTWLVQSSKEVARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVN 410 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 + + Q++++ L + Sbjct: 411 NALARVNNLTQSILAKAFRGELTAQWR 437 >gi|307248454|ref|ZP_07530474.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306855022|gb|EFM87205.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 508 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 58/441 (13%), Positives = 129/441 (29%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPK---DGN 69 IPK W V + ++ G T ++ +D I +I D++ +GKY+ K + Sbjct: 70 EIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNIT 129 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +S+ + +K I+Y P I + + + F + + + + Sbjct: 130 ENGLRSSSTRLLSKNSIVYSSRAPI-GYIAITETELCTNQGFKSIDLYNKE-IVDYLYYS 187 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I T I++ G T GN +P+PPL EQ I KI I+ + Sbjct: 188 LIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQYAEKEE 247 Query: 190 RFIELLKEKK----QALVSYIVTKGLNPDVKM---------------------------- 217 + L ++ ++++ + L Sbjct: 248 KLTALHQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPK 307 Query: 218 --------------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN-- 255 + E +P++W + + Sbjct: 308 VVSEIILRDNLPYEIINGEERCIADEVPFEIPENWCWVRLGEIGETNIGLTYAPNDVVLE 367 Query: 256 -ILSLSYGNIIQKLETRNMGL--KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + L GNI + + + + +++ + + + + Sbjct: 368 GTIVLRSGNIQNGKIDVSSDVVRVNLNIPENKKCYKNDLLICARNGSKNLVGKAAIVDKD 427 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + Y+ + + S F + + + ++ + +PP+ Sbjct: 428 GYSFGAFMAIFRSPFY--QYIYYYLSSPLFRNDFDGINTTTINQITQNNLNNRLIPLPPL 485 Query: 373 KEQFDITNVINVETARIDVLV 393 EQ I I + + L Sbjct: 486 NEQKRIVEKIEKLFSTLQNLE 506 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 62/211 (29%), Gaps = 13/211 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S ++ +P W L + K E + + I + + + K S Sbjct: 63 SQQDFPFEIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYIS 122 Query: 280 YETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 I + G + I + A + ++ + + Sbjct: 123 KGNRNITENGLRSSSTRLLSKNSIVYSSRAPIGYIAITETELCTNQGFKSIDLYNKEIVD 182 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + Y ++ + + + +PP+ EQ I I I+ Sbjct: 183 YLYYSLIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQY 242 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 + E+ + L ++ + S + AA+ G+ Sbjct: 243 -AEKEEKLTALHQQFPEQLKKSILQAAIQGK 272 >gi|110800744|ref|YP_696988.1| type I restriction-modification enzyme, S subunit [Clostridium perfringens ATCC 13124] gi|110675391|gb|ABG84378.1| type I restriction-modification enzyme, S subunit [Clostridium perfringens ATCC 13124] Length = 417 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 76/426 (17%), Positives = 158/426 (37%), Gaps = 31/426 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYL 64 YK + +G IP W+V I K+N+ + + + YI +E V +G + Sbjct: 7 EGYKMTE---LGEIPNEWEVCRIDDLCKVNSKSLNSKTEPNLVVNYIDIESVSTGKINNI 63 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLP 121 + S Q+ + + K ++ + PYL+ + + +CST F VL+ + + Sbjct: 64 KQMIFS-QAPSRARRVVKKNDVIMSTVRPYLKAFVKVKSSLNNLVCSTGFAVLEVNEGVN 122 Query: 122 ELL-QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +LS ++I+ G+ + + + +P + EQ I E + Sbjct: 123 SEFVYQSILSNYFIEQIKNKMVGSNYPAVNSDDVKESKLILPSIQEQEKIAEIL----ST 178 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF--- 237 +D I + I+ +E K+ L+ ++TKG+ K +G +P W++ Sbjct: 179 VDEQIENTEKLIQKNQELKKGLMQQLLTKGIGHTEFKKT----ELGYIPKEWKIMKLGEV 234 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 ++ I I + N L + + Y + +I Sbjct: 235 CDFKQGFQIPRSEQINEEKDGYIRYLYITDFFSNNNKLFIKGSDKYYYIKSDDITIANTG 294 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQS 356 K + ++ + + + + +L + S K + + Sbjct: 295 NTCGKAFKGAEGILSNNMF---KIFNNKEVLLNDFLWQYLNSNYYWKELNKYFNTAGQPH 351 Query: 357 LKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +++ L + +P + EQ +I ++ + ID +EK E LKE + + + Sbjct: 352 VGHKNMANLMIAIPESLNEQSEIALIL----SSIDKRIEKYENKKEKLKELKKGLMQQLL 407 Query: 416 TGQIDL 421 TG I L Sbjct: 408 TGYIRL 413 >gi|117923448|ref|YP_864065.1| restriction modification system DNA specificity subunit [Magnetococcus sp. MC-1] gi|117607204|gb|ABK42659.1| restriction modification system DNA specificity domain [Magnetococcus sp. MC-1] Length = 427 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 51/425 (12%), Positives = 128/425 (30%), Gaps = 37/425 (8%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGT 60 +P+++D+G W+ I K G+++ + ++ + Sbjct: 17 RFPEFRDAG---------EWEKTTIGEIGKFYYGKSAPKWSLEEDAPTPCVRYGELYTKF 67 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPK 117 G + + + D + G+IL ++G K G + ++ + Sbjct: 68 GPIITETYSRTNIDPGKLRFSKGGEILVPRVGEKTEDFGKCCCYLPLGDIAIGEMISVFE 127 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L + Q + EG + + + + + + PPL EQ I + + + Sbjct: 128 TAQNPLFYTYYFRRLYRQF-SKVVEGQNVKNLYYVELEPLEIYRPPLTEQQKIADCLSSL 186 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I + I+ LK K+ L+ + + +++ + G +H Sbjct: 187 DAL----IAAQADKIDALKTHKKGLIQQLFPREGKTVPRLRFPEFQEAGEWTEHRLENMA 242 Query: 238 FA-LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE------SYETYQIVDPGE 290 N+K I +S + + + K E + + + G Sbjct: 243 KRGSGHTPNKKFPSYYNGGIKWVSLADSNKLDDGYIYDTKVEISDDGINNSSAVLYPAGT 302 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ L S + + S + + + Sbjct: 303 VILSRDAGVGKSAVLYSPM----AVSQHFMAWQCYENMLSNWFFYYLLQKLKATFESIAV 358 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +++ K + + P + EQ I + + +D ++ + + LK ++ Sbjct: 359 GNAIKTIGAAYFKEMTITAPSLPEQQKIADCLVS----LDGMIAAHTEKLDSLKTHKNGL 414 Query: 411 IAAAV 415 + Sbjct: 415 MQQLF 419 >gi|198282371|ref|YP_002218692.1| restriction modification system DNA specificity protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198246892|gb|ACH82485.1| restriction modification system DNA specificity domain [Acidithiobacillus ferrooxidans ATCC 53993] Length = 395 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 63/409 (15%), Positives = 136/409 (33%), Gaps = 32/409 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNSRQSDT 75 + W+V + + + G + + I +I DV G+ + Sbjct: 3 EGWEVKLLGEVSAIGAGNPAPQDRHYFEQGTIPFIRTSDVGRIHIGEIFGAADLVNELAA 62 Query: 76 STVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSI 131 +++ G IL+ K G ++ +I + + S+ + +P +L + L +LL+I Sbjct: 63 RKLAMLPVGTILFPKSGASTFINHRVIMGIEAVASSHLATIKAKPHTLLDKFLFYYLLTI 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D + + + I I P+PPL EQ I + I T + Sbjct: 123 D----AKTLVADSNYPSLRISDIATISTPLPPLPEQRRIVAILDEAFEGIATAKANAEKN 178 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ E ++ ++ + ++ G WV ++ R + KL Sbjct: 179 LQNAHEIFESYLNAVFSQR----------GEGWVDRRLGDVAMEFGRGKSKHRPRNDPKL 228 Query: 252 IESNILSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 N + G++ + + + ++ G + + L Sbjct: 229 YGGNFPFIQTGDVRNSSHLITSYDQTYNDAGLAQSKLWPKGTLCITIAANIAETGILDFD 288 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 II + + Y+ +L+ S+ F GS + ++ + Sbjct: 289 ACFPDSIIG---LVANEKISTNKYIEYLLTSFKSRLQFLGKGS-AQDNINLATFESQYFP 344 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 PP+ Q +I ++ + L +Q + L E + S + A G Sbjct: 345 FPPLSNQKEIVSIFDDLHEETQHLKFIYQQKLAALDELKQSLLHQAFNG 393 >gi|315618354|gb|EFU98942.1| type I restriction modification DNA specificity domain protein [Escherichia coli 3431] Length = 384 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 59/370 (15%), Positives = 113/370 (30%), Gaps = 25/370 (6%) Query: 72 QSDTSTVSIFAKGQILYGKLGPY------LRKAIIADFDGICSTQFLVLQPKDVLP-ELL 124 + S + F KG +L K+ P A + G ST+F VL+ + + Sbjct: 19 KEVKSGFTYFEKGDVLLAKITPCFENGKGCHTADLPTNVGFGSTEFHVLRENEDSDSRFI 78 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIG--NIPMPIPPLAEQVLIREKIIAETVRID 182 W +E+ G+ + P L EQ I + + I Sbjct: 79 YFWTTDKKFRASLESEMVGSAGHRRVPLVAIEKYLIPCPPNLQEQSAIADSLSDINNFIL 138 Query: 183 TLITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 L ++ + Q L++ + L D K +G +P+ W V Sbjct: 139 ALEKLIVKKQAIKTATMQRLLTGKTRLPQFALRKDGSAKGYKKSELGEIPEDWVVTSIGQ 198 Query: 240 LVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEI 291 +S G + K + + + V + Sbjct: 199 FTDCCAGGTPSTKISAYWGGTHPWMSSGELHLKQVYAVADYITDEGLVNSSTKYVPKNSV 258 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + Q R + +E S + +L + + S + G Sbjct: 259 LVGLAG-QGKTRGTVAINRIELCTNQSIAAIFPSKHHSTEFLFYNLDSRYEELRSLSTGD 317 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 G R L +++L + PP +EQ I +++ I L +Q + ++ + + Sbjct: 318 GGRGGLNLTIIRKLHLAFPPKEEQTAIATILSDMDKEIQTL----QQRLDKTRQLKQGMM 373 Query: 412 AAAVTGQIDL 421 +TG+ L Sbjct: 374 QELLTGKTRL 383 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 59/203 (29%), Gaps = 11/203 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY 63 YK S +G IP+ W V I +FT G T + G ++ ++ Sbjct: 179 YKKSE---LGEIPEDWVVTSIGQFTDCCAGGTPSTKISAYWGGTHPWMSSGELHLKQVYA 235 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLP 121 + S+ K +L G G I + + + P Sbjct: 236 VADYITDEGLVNSSTKYVPKNSVLVGLAGQGKTRGTVAINRIELCTNQSIAAIFPSKHHS 295 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + L + + I + + PP EQ I + I Sbjct: 296 TEFLFYNLDSRYEELRSLSTGDGGRGGLNLTIIRKLHLAFPPKEEQTAIATILSDMDKEI 355 Query: 182 DTLITERIRFIELLKEKKQALVS 204 TL + +L + Q L++ Sbjct: 356 QTLQQRLDKTRQLKQGMMQELLT 378 >gi|258593064|emb|CBE69375.1| putative Restriction modification system DNA specificity domain [NC10 bacterium 'Dutch sediment'] Length = 450 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 79/419 (18%), Positives = 152/419 (36%), Gaps = 35/419 (8%) Query: 21 IPKHWKVVPIKRFT--KLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTST 77 IP W+ VP++ T T YI + V + K + + + Sbjct: 30 IPDGWRTVPLRSLCLATELTDPTKSPATSFQYIDVSAVSNDLWKITGSTEHLGTTAPSRA 89 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLV--LQPKDVLPELLQGWLLSID 132 + +++ + P LR+ + I ST F V P + LL+ + Sbjct: 90 RKLVGANDVIFATVRPMLRRIAMIPEYLDGQIVSTAFCVLRANPTQADSRFIYYTLLTDE 149 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +RI + GA+ I + +PPLAEQ I + +I + + + + Sbjct: 150 FIERIGNLQRGASYPAVTDGDILGQEILVPPLAEQHAIAAVL----SKIQAAVEVQDKLV 205 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK-- 250 LKE K A + + +GL +K + I G +P+ WEV L + K T Sbjct: 206 AALKELKAATTAKLFCEGL-RGEPLKQTEI---GEIPESWEVMRLCELASIERGKFTHRP 261 Query: 251 -----LIESNILSLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 I + G++ + ++ T + L E ++ G IV D Sbjct: 262 RNEPRFYGGAIPFIQTGDVAKSNGRIRTYSQTLNEEGLAISRLFPKGTIVLTIAANIADT 321 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 L ++ +D+ +L +R+ + + G ++++ + Sbjct: 322 AILEFDSAFPDSLVG----ITPDGTMDAAFLECYLRTQKA-DMNHLAPKGTQKNININFL 376 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 K P I+EQ +I + + ++++ + LK SS + +TGQ+ + Sbjct: 377 KPWSTPRPSIEEQQEIAHSLRCLDNKLELAWARR----DTLKSLFSSMLHLLMTGQVRV 431 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 66/206 (32%), Gaps = 13/206 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGK 62 K + IG IP+ W+V+ + + G+ + ++ I +I DV G+ Sbjct: 230 KQTE---IGEIPESWEVMRLCELASIERGKFTHRPRNEPRFYGGAIPFIQTGDVAKSNGR 286 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + +F KG I+ AI+ + + + Sbjct: 287 IRTYSQTLNEEGLAISRLFPKGTIVLTIAANIADTAILEFDSAFPDSLVGITPDGTMDAA 346 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L+ +L + + + T + + + P P + EQ I + +++ Sbjct: 347 FLECYLRTQ--KADMNHLAPKGTQKNININFLKPWSTPRPSIEEQQEIAHSLRCLDNKLE 404 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 R L L++ V Sbjct: 405 LAWARRDTLKSLFSSMLHLLMTGQVR 430 >gi|312952955|ref|ZP_07771811.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0102] gi|310629096|gb|EFQ12379.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0102] Length = 404 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 64/402 (15%), Positives = 141/402 (35%), Gaps = 27/402 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + W+ ++ + T + + + + E ++ KD ++ ++ + I Sbjct: 16 EDWEQRKLEDISDKVTEKNKNNEFTETLTNSAEFGIINQREFFDKDISNEKN-LNGYYIV 74 Query: 82 AKGQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + +Y + G+ S + V +P D+ + L+ + + + Sbjct: 75 REDDFVYNPRISNYAPVGPIKRNKLGRTGVMSPLYYVFRPHDIDKKFLEYFFGTTIWHKF 134 Query: 137 IEAICEGATM---SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + +P+P P + EQ + + ++ I R ++ Sbjct: 135 MKLNGDSGARADRFAIKDSVFKTMPIPYPSIEEQKKVGKFFDD----LNDTIALHQRKLD 190 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LLKE K+ + + P K I + G WE + V + K + + Sbjct: 191 LLKETKKGFLQKMF-----PKNGAKVPEIRFPGFT-KDWEQRKLGDFVVDYVEKTSVQNQ 244 Query: 254 SNILSLSYGNII--QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +L+ S I Q+ N + E+ Y ++ G FR ND ++ Sbjct: 245 FPMLTSSQQKGIVLQEDYFANRQVTTENNIGYFVLPRGYFTFRSRS-DNDVFVFNRNDII 303 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 +RGII+ Y DS + + + ++ + L + K + + P Sbjct: 304 DRGIISYFYPVFTLKSADSDFFLRRINNGIQRQLSIQAEGTGQHVLSLKKFKNIVAMFPS 363 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I ++D + ++ + LLKE + F+ Sbjct: 364 EEEQQKIGTF----FKQLDDTITLHQRKLDLLKETKKGFLQK 401 >gi|42779918|ref|NP_977165.1| type I restriction-modification enzyme, S subunit, putative [Bacillus cereus ATCC 10987] gi|42735836|gb|AAS39773.1| type I restriction-modification enzyme, S subunit, putative [Bacillus cereus ATCC 10987] Length = 476 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 70/437 (16%), Positives = 157/437 (35%), Gaps = 44/437 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPK---DGNS 70 +P++W ++ +G T + I +I D+ Y+ K + Sbjct: 21 VPENWIWTWTGAIAEVISGGTPKSKVEEYYKDGTISWITPADLSGYQDMYISKGKRNITE 80 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + S+ + +L P IA D + F P + + Sbjct: 81 LGLNKSSAKMLPINTVLLSSRAPI-GYVAIAAKDLCTNQGFKSFAPSNAY-YPKYLYWYL 138 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +E++ G+T IP+P+PP+ EQ + EK+ +++ T Sbjct: 139 KFSKYYMESMASGSTFKELSSNKSKEIPIPLPPINEQKRVSEKVERLLNKVEEAKTLIEE 198 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDS--------------GIEWVGLVPDHWEVKP 236 E + ++ A++ + L + ++S E +P W+ Sbjct: 199 AKETFELRRAAILDKAFSGDLTGKWRKENSFQQNEECISDNELRDSEVFYPIPKTWKWTK 258 Query: 237 FFALVTELNR---KNTKLIESNILSLSYGNIIQK--LETRNMGLKPESYETYQI----VD 287 + T N K+ +E I + GN+ + RN P ++ I V+ Sbjct: 259 LKDVATFKNGYAFKSKDFVEQGIQLIRMGNLYKNELRLDRNPVYIPLDFDEKIIEKYTVE 318 Query: 288 PGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 G+I+ + ++R + ++ +++KPH +D Y+ + ++S Sbjct: 319 KGDILLSLTGTKYKRDYGYAVRVDGRDKNLLLNQRILSLKPHMMD-EYIYYYLQSSVFRN 377 Query: 345 VFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI-V 401 F++ +G + ++ + V+ + + +PP E +I + + + +I Sbjct: 378 AFFSFETGGVNQGNVGSKAVESILIPIPPADEAKEIEKKLARLLN--NEKEALVVLAIEE 435 Query: 402 LLKERRSSFIAAAVTGQ 418 L+ + S ++ A G+ Sbjct: 436 KLEVLKQSALSKAFRGE 452 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 73/231 (31%), Gaps = 17/231 (7%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPK 66 +DS V IPK WK +K G +S I I + ++ + Sbjct: 242 RDSEV--FYPIPKTWKWTKLKDVATFKNGYAFKSKDFVEQGIQLIRMGNLYKNELRLDRN 299 Query: 67 DGNS---RQSDTSTVSIFAKGQILYGKLGP-------YLRKAIIADFDGICSTQFLVLQP 116 KG IL G Y + D + + + + L L+P Sbjct: 300 PVYIPLDFDEKIIEKYTVEKGDILLSLTGTKYKRDYGYAVRVDGRDKNLLLNQRILSLKP 359 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + S+ G + K + +I +PIPP E I +K+ Sbjct: 360 HMMDEYIYYYLQSSVFRNAFFSFETGGVNQGNVGSKAVESILIPIPPADEAKEIEKKLAR 419 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + +L + KQ+ +S L + +++ IE + Sbjct: 420 LLNNEKEALVVLAIEEKL-EVLKQSALSKAFRGELGTNDPTEENTIELLKE 469 >gi|146281027|ref|YP_001171180.1| type I restriction-modification system, S subunit [Pseudomonas stutzeri A1501] gi|145569232|gb|ABP78338.1| type I restriction-modification system, S subunit [Pseudomonas stutzeri A1501] Length = 472 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 70/436 (16%), Positives = 147/436 (33%), Gaps = 39/436 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W +K L+ G+T D+ ++ +D++ + + Sbjct: 3 ELPSGWTRFALKDLGGLSGGKTPSKANPEFWSTRDVPWVSPKDMKKNLLEDAEDRISQNA 62 Query: 73 SDTSTVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 D + ++++ G +L L +A + + VL+P + + ++L Sbjct: 63 VDEAGMTLYPSGSVLMVTRSGILQHTFPVALAGVELTVNQDIKVLRPIEGIVPKFSFYML 122 Query: 130 SIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + A + G T+ D + + +PPLAEQ I +K+ ++DTL Sbjct: 123 KSFGAEILSACSKDGTTVQSIDSEKLETFLFSLPPLAEQTRIAQKLDELLAQVDTLKARI 182 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 LLK +Q++++ V+ L + + E + + Sbjct: 183 DAIPALLKRFRQSVLAAAVSGRLTEEWRGSIPASESAEEYLSRVIQVRRQKPIVKFKEPV 242 Query: 249 TKLIESNILSLSYGNII------------------------QKLETRNMGLKPESYETYQ 284 +E+ L + G I+ + + G E + Sbjct: 243 PPDLETRELEVPEGWIVASVSSFAECLDSMRVPVKKELRESGEGKYPYFGANGEVDRVDE 302 Query: 285 IVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + ++V D R + + + + + YL +++ YD+ Sbjct: 303 YIFDDDLVLVTEDETFYGRVKPIAYKYSGKCWVNNHVHALRAHDAVARDYLCYVLMHYDV 362 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G+ R L + LP+ VPP EQ +I + A D L ++ + Sbjct: 363 VPWL--TGTTGRAKLTQGALLSLPIQVPPATEQTEIVRRVEQLFAFADQLEARVNAAKAC 420 Query: 403 LKERRSSFIAAAVTGQ 418 + S +A A G+ Sbjct: 421 IDRLTQSILAKAFRGE 436 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 63/209 (30%), Gaps = 14/209 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN---------MGLKP 277 +P W L K S + + + + Sbjct: 3 ELPSGWTRFALKDLGGLSGGKTPSKANPEFWSTRDVPWVSPKDMKKNLLEDAEDRISQNA 62 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + G ++ + A + ++P ++ M Sbjct: 63 VDEAGMTLYPSGSVLMVTRSGILQH-TFPVALAGVELTVNQDIKVLRPIEGIVPKFSFYM 121 Query: 338 RSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ A QS+ E ++ +PP+ EQ I ++ A++D L + Sbjct: 122 LKSFGAEILSACSKDGTTVQSIDSEKLETFLFSLPPLAEQTRIAQKLDELLAQVDTLKAR 181 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 I+ LLK R S +AAAV+G L E Sbjct: 182 IDAIPALLKRFRQSVLAAAVSG--RLTEE 208 >gi|332299057|ref|YP_004440979.1| protein of unknown function DUF45 [Treponema brennaborense DSM 12168] gi|332182160|gb|AEE17848.1| protein of unknown function DUF45 [Treponema brennaborense DSM 12168] Length = 646 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 63/436 (14%), Positives = 123/436 (28%), Gaps = 68/436 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P WK++ + + + G + + + + + ++ G+ K + Sbjct: 68 ELPNSWKLMKLSDVSIIQEGAGIRKFQYTKEGTQLLSVTNILQGSIDLNKKQLFVSTEEY 127 Query: 76 STVSI---FAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVLPELLQGWL 128 + KG IL G K I D + ST L L Sbjct: 128 KKKYLHLTPKKGDILTACSGGSWGKVAIYDKEDTVMLNTSTLRLRFFGDLADNNFLYYLC 187 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S ++++ G + + I +P+PP+ EQ I EK+ ID E Sbjct: 188 QSPLFKEQLKEQLAGM-QPNFGYAHYSRIILPLPPIEEQQRIVEKLNHILPLIDEYSKEE 246 Query: 189 IRFIELLK----EKKQALVSYIVTKGLNPDVKMKDS------------------------ 220 I L + E K++++ + L ++ S Sbjct: 247 DELIALCQKFPEEMKKSVLQAAMQGKLTRQLETDSSVDELLKKIAEEKAKLIKEGKIRKD 306 Query: 221 ---------------GIEWVGLVPDHWEVKPFFALVTE------LNRKNTKLIESNILSL 259 E +P++W + K+ I + Sbjct: 307 TTKAGASSRALAEITEDEIPFDIPENWRWTKLGLIGDWGAGATPDRGKSEYYKNGTIPWI 366 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIV---DPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 G + + T E + +++ K ++ + Sbjct: 367 KTGELNDSIITSAEEYITEMAFEKCSLRMNKMNDVLIAMYGATIGKVAIAGFDLT----T 422 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 A A P G Y + + G + ++ + P +PPI+EQ Sbjct: 423 NQACCACTPFGGIYNYYLFYFLKANKPDFVKQSAGGAQPNISRTKIVDTPFPLPPIEEQQ 482 Query: 377 DITNVINVETARIDVL 392 I +N ID + Sbjct: 483 RIVEKLNTILPIIDSM 498 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 89/231 (38%), Gaps = 16/231 (6%) Query: 204 SYIVTKGLN---PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNIL 257 V++GL + + S ++ +P+ W++ + + + + Sbjct: 42 QEYVSEGLFYNKKETEPYYSDEDYPYELPNSWKLMKLSDVSIIQEGAGIRKFQYTKEGTQ 101 Query: 258 SLSYGNIIQKLETRNMG---LKPESYETYQI---VDPGEIVFRFIDLQNDKRSLRSAQVM 311 LS NI+Q N + E Y+ + G+I+ K ++ + Sbjct: 102 LLSVTNILQGSIDLNKKQLFVSTEEYKKKYLHLTPKKGDILTACSGGSWGKVAIYDKEDT 161 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ + D+ +L +L +S + +G++ + + R+ + +PP Sbjct: 162 VMLNTSTLRLRFFGDLADNNFLYYLCQSPLFKEQLKEQLAGMQPNFGYAHYSRIILPLPP 221 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLK----ERRSSFIAAAVTGQ 418 I+EQ I +N ID ++ ++ I L + E + S + AA+ G+ Sbjct: 222 IEEQQRIVEKLNHILPLIDEYSKEEDELIALCQKFPEEMKKSVLQAAMQGK 272 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 64/190 (33%), Gaps = 8/190 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP++W+ + G T + GK I +I ++ + Sbjct: 328 DIPENWRWTKLGLIGDWGAGATPDRGKSEYYKNGTIPWIKTGELNDSIITSAEEYITEMA 387 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + ++ + +L G + K IA FD + P + + L + Sbjct: 388 FEKCSLRMNKMNDVLIAMYGATIGKVAIAGFDLTTNQACCACTPFGGIYNYYLFYFLKAN 447 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 G + I + P P+PP+ EQ I EK+ ID++ + Sbjct: 448 -KPDFVKQSAGGAQPNISRTKIVDTPFPLPPIEEQQRIVEKLNTILPIIDSMAVYGTKKK 506 Query: 193 ELLKEKKQAL 202 ++++AL Sbjct: 507 AGRPKQEEAL 516 >gi|317011668|gb|ADU85415.1| restriction modification system DNA specificity domain protein [Helicobacter pylori SouthAfrica7] Length = 397 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 132/406 (32%), Gaps = 41/406 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDT 75 +W+ + + ++ G T + +I + ++ + + Sbjct: 8 SNWEKIRLGDICEIVGGGTPSTQITSFWNGNINWFTPTEIGITKYVYKSQRTITPLGLKK 67 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S+V + G IL + I + F L P + + + L++ + Sbjct: 68 SSVKLLPIGTILLTSR-ASIGDCAILKVVATTNQGFQSLIPLEKINNE-FLYYLTLTLKN 125 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + G+T I N+ +P+PPL EQ+ I + + L + ++ + Sbjct: 126 KLLKLASGSTFLEVSPNKIKNLLIPLPPLNEQIAIANILSGVDRYLYALDSLILKKESVK 185 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K L+S ++K W + + +V L Sbjct: 186 KALSFELLSQ--------RKRLKGFNQAW-----EKIRLGDICEIVKGQQINKINL---- 228 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N K N G+ Y V I R + S Sbjct: 229 -------NNTDKYPVINGGIDFLGYTNKFNVSKNTIAISEGGTCGYVRFMTSNFWSGGHN 281 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + + +++ L +++SY+ + ++++ + +K + +PP+ EQ Sbjct: 282 YS---LQKISNRVNNLCLYHILKSYE-KDIMKLGVGSGLKNIQLKALKDFEIPLPPLNEQ 337 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 338 TAIANILSALDHEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 379 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 70/219 (31%), Gaps = 18/219 (8%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 ++ + + +G + + +T N + ++ + Sbjct: 1 MDALTTLSNWEKIRLGDICEIVGGGTPSTQITSFWNGNINWFTPTEIGITKYVYKSQRTI 60 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 +GLK S + ++ G I+ D L+ + ++ P + Sbjct: 61 TPLGLKKSSVK---LLPIGTILLTSRASIGDCAILKVV-----ATTNQGFQSLIPLEKIN 112 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR-- 388 + + K+ + +K L + +PP+ EQ I N+++ Sbjct: 113 NEFLYYLTLTLKNKLLKLASGSTFLEVSPNKIKNLLIPLPPLNEQIAIANILSGVDRYLY 172 Query: 389 -IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +D L+ K E + + ++ + L+G +Q Sbjct: 173 ALDSLILKKESV-------KKALSFELLSQRKRLKGFNQ 204 >gi|53803791|ref|YP_114325.1| type I restriction-modification system, S subunit [Methylococcus capsulatus str. Bath] gi|53757552|gb|AAU91843.1| type I restriction-modification system, S subunit [Methylococcus capsulatus str. Bath] Length = 416 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 71/418 (16%), Positives = 133/418 (31%), Gaps = 38/418 (9%) Query: 23 KHWKVVPIKRFTK-----LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + W I L G S Y G + + KD N+ S Sbjct: 3 EEWTEARIDELGNGRRPVLKAGPFGSSVTKATYKTSGYKVYGQQEVVAKDPNAEAYFVSE 62 Query: 78 VSI-------FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLV--LQPKDVLPELLQG 126 + G IL +G R + + +GI + + + +L E + Sbjct: 63 ATFTRHKSCAVKPGDILMTMMGTIGRVYRVPEGAPEGIINPRLVRLAFDTSRILSEYAEV 122 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L + + ++ G TM + + + +I + +PPL EQ I E + DT I Sbjct: 123 ALEQPSLQRLLDRRSHGGTMQGLNLEALASIRLLLPPLPEQRKIVEIL----RTWDTAIE 178 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 R I + +S ++++G +P DS E + + + + Sbjct: 179 TTERLIAAKERFYAHELSRLISRGQHPRRPNGDSASEASEPDRGSQWRTVSLSDIATVWK 238 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 E S +Y N G+ P + I Sbjct: 239 GQQLNKEHMEESGAYY-------VLNGGINPSGRTNDWNCEAKTITISSGGNS----CGF 287 Query: 307 SAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 +ER +D YL + ++S + GSG+ + D++ Sbjct: 288 INLNLERFWCGGDCFALKQISPLVDVDYLFFYLKSRQHQMMALRTGSGI-PHIYRSDIES 346 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 PV++P + Q I + + + +S+ LK ++ + +TGQ + Sbjct: 347 FPVILPDLATQTAIARYLTALREE----ITLLSRSLGALKRQKRGLMQKLLTGQWRVP 400 >gi|270292634|ref|ZP_06198845.1| putative type I restriction-modification system, S subunit [Streptococcus sp. M143] gi|270278613|gb|EFA24459.1| putative type I restriction-modification system, S subunit [Streptococcus sp. M143] Length = 384 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 51/402 (12%), Positives = 132/402 (32%), Gaps = 32/402 (7%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSI 80 K V + ++ G +S + I I + +V+ G + + S I Sbjct: 2 KKVKLGEVCEILNGFAFKSLLYVNEGIRIIRITNVQKGYIEDSDPKYYPIEYTNSIEKYI 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSID--VTQR 136 + +L G R +I+ + + L+ D L + Q Sbjct: 62 LKENDLLMSLTGNVGRVGLISKTMLPAALNQRVACLRTIDSLISKEYVFQFLNSDLFEQS 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G + + + + P + +Q LI + I+ LI R + L Sbjct: 122 AIRSSNGVAQKNLSTDWLKKVEITYPSVEQQELITSTLN----LIERLICCRKEQNKKLN 177 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E ++ + + + +++ + ++K + + S + Sbjct: 178 ELVKSRFNEMFGDPVFNEMRWR------------RCKLKDISIEKLAYGSGASAIDFSGL 225 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + +I + + P Y+ +++ G+I+F K L S + + Sbjct: 226 RYIRITDIDECGNLKLDKKSPSHYDEKYLLNTGDILFARSGATVGKTFLYSKEKYGPALY 285 Query: 317 TSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 + + P+ ++ ++ + + + + ++ + L ++PP+ Sbjct: 286 AGYLIRLIPNLSLVNPVFVYHFTNTKFYNDFIAKVQNTVAQPNINAKQYSELDFILPPLS 345 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q + + + A++D I++S+ L+ + S + Sbjct: 346 LQNEFADFV----AQVDKSQLAIQKSLEELETLKKSLMQEYF 383 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 65/194 (33%), Gaps = 19/194 (9%) Query: 25 WKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVS 79 W+ +K + +G ++ + YI + D++ G K K + Sbjct: 198 WRRCKLKDISIEKLAYGSGASAIDFSGLRYIRITDIDECGNLKLDKKSPSHYDE----KY 253 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF------LVLQPKDVLPELLQGWLLSIDV 133 + G IL+ + G + K + + + L+ V P + + + Sbjct: 254 LLNTGDILFARSGATVGKTFLYSKEKYGPALYAGYLIRLIPNLSLVNPVFVYHFTNTKFY 313 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I + + + K + +PPL+ Q + ++D + +E Sbjct: 314 NDFIAKVQNTVAQPNINAKQYSELDFILPPLSLQNEFADF----VAQVDKSQLAIQKSLE 369 Query: 194 LLKEKKQALVSYIV 207 L+ K++L+ Sbjct: 370 ELETLKKSLMQEYF 383 >gi|27380124|ref|NP_771653.1| hypothetical protein bll5013 [Bradyrhizobium japonicum USDA 110] gi|27353278|dbj|BAC50278.1| bll5013 [Bradyrhizobium japonicum USDA 110] Length = 433 Score = 127 bits (318), Expect = 4e-27, Method: Composition-based stats. Identities = 50/433 (11%), Positives = 137/433 (31%), Gaps = 42/433 (9%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W + + L TG+ G + + IG E++++ GK + N + Sbjct: 2 SDWIERSLAQLISPLETGKRPAGGVSADTEGVPSIGGENIDAA-GKMSYSNVNRISPAYA 60 Query: 77 ---TVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLS 130 G L K G K D + +++P + + Sbjct: 61 HLMKKGKLKSGDTLINKDGAQTGKVAQYDGQFADAWINEHVFIVRPDPGKIDAGYLFYSM 120 Query: 131 IDV--TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITE 187 +D +I G+ + + + +P + Q I + + +D I Sbjct: 121 LDGRAQNQIARRITGSAQPGLNSDFAKAVTLRLPRDIKLQAKIADIL----RLLDVQIEA 176 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-----------WVGLVPDHWEVKP 236 I + + L+ + T+G++ +++ S E W+ L + + Sbjct: 177 TEALITKQERVRAGLMQDLFTRGIDEHGQLRPSRDEAPQLYNRTDLGWLPLGWEAARLVN 236 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGE 290 + + + + + ++ ++ + + + G+ Sbjct: 237 LTSRIVDGVHHTPTYVPHGVPFVTVKSLTAGRGIDTRQGNFITLSDHHVFQMRADPRAGD 296 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAM 349 ++ R + ++ A + ++ +L R+ Y Sbjct: 297 VLVSKDGTLGVARYVDETVEEFSIFVSVAMLRPITSLLNPAFLCEFFRTRFYEAQMGYLS 356 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + E ++ + P + EQ I ++++ D + ++E + L+ +++ Sbjct: 357 AGSGLKHIHLEHFRKFVLPRPDLSEQAKILSILDAA----DQSIVRLEDMLRKLRLQKAG 412 Query: 410 FIAAAVTGQIDLR 422 + +TG++ + Sbjct: 413 LLQDLLTGEVSVP 425 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 64/201 (31%), Gaps = 11/201 (5%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +G +P W+ + T + G + ++ ++ + +G G + S Sbjct: 222 LGWLPLGWEAARLVNLTSRIVDGVHHTPTYVPHGVPFVTVKSLTAGRGIDTRQGNFITLS 281 Query: 74 DTSTVSI---FAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQG 126 D + G +L K G + +F S L + P L Sbjct: 282 DHHVFQMRADPRAGDVLVSKDGTLGVARYVDETVEEFSIFVSVAMLRPITSLLNPAFLCE 341 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + ++ + G+ + H + +P P L+EQ I + A I L Sbjct: 342 FFRTRFYEAQMGYLSAGSGLKHIHLEHFRKFVLPRPDLSEQAKILSILDAADQSIVRLED 401 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + Q L++ V Sbjct: 402 MLRKLRLQKAGLLQDLLTGEV 422 >gi|311278007|ref|YP_003940238.1| restriction modification system DNA specificity domain-containing protein [Enterobacter cloacae SCF1] gi|308747202|gb|ADO46954.1| restriction modification system DNA specificity domain protein [Enterobacter cloacae SCF1] Length = 394 Score = 127 bits (318), Expect = 5e-27, Method: Composition-based stats. Identities = 56/400 (14%), Positives = 126/400 (31%), Gaps = 29/400 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W + + + T T ++ + + Y+ V+ G Y + + + Sbjct: 12 EEWTMTLLSKLATKITDGTHDTPDTTSEGVPYLTAIHVKDGYIDYKNCYYLDKLTHSFIY 71 Query: 79 SIF--AKGQILYGKLGPYLRKAIIADFDGICS--TQFLVLQPKDVLPELLQGWLLSIDVT 134 K +L +G + D S LV K+ + + + Sbjct: 72 KRCNPEKNDLLIVNIGAGTGTCALNTVDYEFSLKNVALVKPNKNKIYPFYLLQVQRKNAK 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G K IG I +P+ EQ I + + + +I L + + Sbjct: 132 KLFHELTSGGAQPFLSLKEIGKIKIPLCQYDEQTKIADFLSSVDDKITLLNQQYDLLCQY 191 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q + + + + G W +L + ++K L Sbjct: 192 KKGMMQKIFNQELRFK------------DENGEEFPEWNYDEISSLFSNKSKKYNPLSGI 239 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 N + I Q+ L ++ + G+++F + K L + Sbjct: 240 NYPCIEMDCISQRDGQLLTTLDSTQQQSIKNVFKKGDVLFGKLRPYLRKYILATFD---- 295 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 G+ +S K I+++YL +++ + ++ + V P ++ Sbjct: 296 GVCSSEIWVFKGILINNSYLYQFIQTDFFINLANKSTGSKMPRADWDTISSTFVFYPCLE 355 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I N ++ +D + + + LK + + Sbjct: 356 EQSKIANFLSA----LDDKIAVKKAELDKLKTWKQGLLQQ 391 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 20/203 (9%), Positives = 57/203 (28%), Gaps = 10/203 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES-----YE 281 + +T+ + L+ ++ + Sbjct: 12 EEWTMTLLSKLATKITDGTHDTPDTTSEGVPYLTAIHVKDGYIDYKNCYYLDKLTHSFIY 71 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + +++ I +L + E + A + + I YL + R Sbjct: 72 KRCNPEKNDLLIVNIGAGTGTCALNTVD-YEFSLKNVALVKPNKNKIYPFYLLQVQRKNA 130 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 G + L +++ ++ + + EQ I + ++ D + + Q Sbjct: 131 KKLFHELTSGGAQPFLSLKEIGKIKIPLCQYDEQTKIADFLSSV----DDKITLLNQQYD 186 Query: 402 LLKERRSSFIAAAVTGQIDLRGE 424 LL + + + ++ + E Sbjct: 187 LLCQYKKGMMQKIFNQELRFKDE 209 >gi|78043287|ref|YP_359684.1| putative type I restriction-modification system, S subunit [Carboxydothermus hydrogenoformans Z-2901] gi|77995402|gb|ABB14301.1| putative type I restriction-modification system, S subunit [Carboxydothermus hydrogenoformans Z-2901] Length = 403 Score = 126 bits (317), Expect = 5e-27, Method: Composition-based stats. Identities = 65/414 (15%), Positives = 132/414 (31%), Gaps = 32/414 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W+ V + K E I E +S + L + T Sbjct: 6 KIPEGWQWVKLGDILK------YEQPYKYIVKSTEYKDSNSIPVLTAGKSFILGYTDEKD 59 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + I + S+ K+ + ++ + Sbjct: 60 GIYTNLPVIIFDDFTTESKFIKFPFKVKSSAL--KFLKEKSDNFVLKFIFESMQLIKFNN 117 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + +P+PPL EQ I E + +D I + IE K K Sbjct: 118 VGGEHKRRWIS--EFQLFKIPLPPLPEQRKIAEIL----ETVDNAIEKTDAIIEKYKRLK 171 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALVTELNRKNT----- 249 Q L+ ++TKG++ + +++D +G +P+ WEV + Sbjct: 172 QGLMQDLLTKGIDENWQIRDEKTHKFKDSPLGRIPEEWEVIMLEKCGKIVTGSTPSTEIP 231 Query: 250 --KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 E +S + + L + + + + P + I K +L S Sbjct: 232 QYYGDEFQFISPEDIQDNKYILETKKMLSKQGFNLQRKLPPKSVCVVCIGSTIGKVALTS 291 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 I + + +P + L + + Y + G + ++ + Sbjct: 292 TFSSTNQQINT--IVPRPELWEPEALYYFVSFYIQNPLRMEAGMQAVPIVNKGKFSKILI 349 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +PP+ EQ I V+ ++ID +EK + L+ + + +TG++ + Sbjct: 350 PLPPLPEQQRIAAVL----SQIDEAIEKEQAYKEKLERIKKGLMEDLLTGRVRV 399 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 75/207 (36%), Gaps = 11/207 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGK 62 ++KDS +G IP+ W+V+ +++ K+ TG T + G + +I ED++ Sbjct: 196 KFKDSP---LGRIPEEWEVIMLEKCGKIVTGSTPSTEIPQYYGDEFQFISPEDIQDNK-Y 251 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 L + + + +G + K + + Q + P+ L E Sbjct: 252 ILETKKMLSKQGFNLQRKLPPKSVCVVCIGSTIGKVALTSTFSSTNQQINTIVPRPELWE 311 Query: 123 LLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + Q + G + + I +P+PPL EQ I + I Sbjct: 312 PEALYYFVSFYIQNPLRMEAGMQAVPIVNKGKFSKILIPLPPLPEQQRIAAVLSQIDEAI 371 Query: 182 DTLITERIRFIELLKEKKQALVSYIVT 208 + + + + K + L++ V Sbjct: 372 EKEQAYKEKLERIKKGLMEDLLTGRVR 398 >gi|303239471|ref|ZP_07325998.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] gi|302593034|gb|EFL62755.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] Length = 447 Score = 126 bits (317), Expect = 6e-27, Method: Composition-based stats. Identities = 68/414 (16%), Positives = 139/414 (33%), Gaps = 22/414 (5%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNS 70 I +P+ W + + G T S + I +I +E+V G Sbjct: 4 IKDLPRGWVSEKMMEVATRITKGATPTSYGYNFLKEGINFIKVENVSFGRVDLYSISDYI 63 Query: 71 RQSDT--STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQ 125 + SI + IL+ G + I+ +T ++ K + Sbjct: 64 SEEAHLCQKKSILEENDILFSIAGTIGKTCIVRKEYLPANTNQALAIIKGVKRITLPSFL 123 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L V + +++ G M++ + + N+ + IPPL+EQ I KI +D + Sbjct: 124 VLQLESFVASKTKSMARGGAMNNISLEDLKNLEIFIPPLSEQHRIVSKIEELFSELDKGV 183 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + LK +QA++ + + + I + + + + Sbjct: 184 ESLKTAQQQLKVYRQAVLKWAFDGEMCIQNEKSVRSISSLITSGSRGWAQYYSDKGAKFI 243 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R G I E + + L ++ + G+++ I L Sbjct: 244 RIGN--------LTRVGIDIDLSEVQYVRLPEKAEGLRSRLQEGDLLIS-ITADLGSIGL 294 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKR 364 + E I + S ++AW ++S + G ++ L +D++ Sbjct: 295 VPSNFGEAYINQHIAVVRLNDSRYSKFVAWYLKSETGRRRLLEYQRGATKKGLGLDDIRD 354 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + P + I I + D + E IE+S+ + R S + A G+ Sbjct: 355 VLIPYPEVHVAQKIVQEIESRLSVCDKMEEAIEKSLAQAEALRQSILKKAFEGK 408 >gi|255527616|ref|ZP_05394478.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|296187659|ref|ZP_06856053.1| type I restriction modification DNA specificity domain protein [Clostridium carboxidivorans P7] gi|255508688|gb|EET85066.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|296047616|gb|EFG87056.1| type I restriction modification DNA specificity domain protein [Clostridium carboxidivorans P7] Length = 407 Score = 126 bits (317), Expect = 6e-27, Method: Composition-based stats. Identities = 61/418 (14%), Positives = 142/418 (33%), Gaps = 34/418 (8%) Query: 20 AIPKHWKVVPIKR-FTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQ--- 72 +PK WK V +K L +G+ + + +G E + + G + D Sbjct: 2 KLPKEWKEVNLKEYILTLESGKRPKGGAIDNGVPSLGGEHINNTGGFNIQIDKLKYVPRE 61 Query: 73 -SDTSTVSIFAKGQILYGKLGPYLRKAIIADFD-----GICSTQ-FLVLQPKDVLPELLQ 125 + K IL K G K D + + FL+ + + + L Sbjct: 62 FFKKMKSGVVKKNDILIVKDGATTGKIAFVDNNFNLKEACINEHLFLIRTNERLNNKFLS 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L S ++I GAT+ + + +PPL Q I + + ++ Sbjct: 122 YYLRSNTGRKKILEDFRGATVGGISK-NFIDFNILLPPLETQKKIVKVLEKAEETLEKRK 180 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 +L + S + +P K + +G + Sbjct: 181 ESINLLDKL-------VKSRFIGMFGDPSSNPKGWNKDTIG---SVVKSITAGWSANGEA 230 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R+ + ++ + + K + + + Y + G+++F + + + Sbjct: 231 REKREDEKAVLKVSAVTQGYFKADEYKVIGDDVEIKKYVFPEKGDLLFSRANTREMVGAT 290 Query: 306 RSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFE 360 ++ + ++ Y+ +++ + F A +G ++ + Sbjct: 291 CIIHKDYPDLLLPDKLWKVSFVERVNVFYMKYILSEPSIRAEFSAKSTGTSGSMYNVSMD 350 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K + + +PPI+ Q + +N ++D L ++E+S+ L++ S + A G+ Sbjct: 351 KFKSIEITIPPIELQNQFADFVN----QVDKLKFEMEKSLKELEDNFKSLMQKAFKGE 404 >gi|313114694|ref|ZP_07800196.1| type I restriction modification DNA specificity domain protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310622919|gb|EFQ06372.1| type I restriction modification DNA specificity domain protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 381 Score = 126 bits (317), Expect = 6e-27, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 127/396 (32%), Gaps = 27/396 (6%) Query: 29 PIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + G + S + I I ++D+ N + ++ G Sbjct: 3 KLGDIATYINGYAFKPQDWSDEGIPIIRIQDLTGN-----SYQANRYNGEYASKYEVNDG 57 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L L I + + + + + + GA Sbjct: 58 DVLISWS-ASLGVYIWHGEKAVLNQHIFKVVFDKERISKDFFVHQVGLILENAASDAHGA 116 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 TM H +P +PP +Q I E + T I + + EL + + Sbjct: 117 TMKHLTKPVFDALPFYLPPYEKQCEIAEVLDKVTSLISLRKQQLAKLDEL-------VKA 169 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V + K+ + V + + T+ + +T + I + G + Sbjct: 170 RFVEMFGDSVANTKNFPSTTLETVMTVFPQNGLYKPQTDYVQDDTGIPILRIDAFYNGKV 229 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGIITSAYMAV 323 + + + E+ ++ +IV ++ + E+ + S M Sbjct: 230 TNWNTLKRL-ICSETEIDRYLLKENDIVINRVNSIEYLGKCAHIVGLKEKTVFESNMMRF 288 Query: 324 KP--HGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 +++ Y+ ++ + D+ + S + S+ EDVK L +LVPP+ Q Sbjct: 289 HMDEKKVNAVYVTEVLCTEDIYRQILRRAKKSVNQASINQEDVKSLEILVPPLSLQNQFA 348 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + R+D + ++QS+ L+ + + + Sbjct: 349 AFVE----RVDQQKQTVQQSLEKLELMKKALMQEYF 380 >gi|304396445|ref|ZP_07378326.1| restriction modification system DNA specificity domain protein [Pantoea sp. aB] gi|304355954|gb|EFM20320.1| restriction modification system DNA specificity domain protein [Pantoea sp. aB] Length = 450 Score = 126 bits (316), Expect = 6e-27, Method: Composition-based stats. Identities = 58/406 (14%), Positives = 131/406 (32%), Gaps = 8/406 (1%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN-SRQSDTST 77 G +P+ W + + G + I + + K + +T Sbjct: 4 GKLPEGWVLSKFTDLMDVQGGTQPPKSEFIAEEKEGYIRLLQIRDFGKKPVPTYIPETKK 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + K +L G+ G L + +G + + L + L ++ Q Sbjct: 64 LKTCRKEDLLIGRYGASLGRIC-TGHEGAYNVALAKVIYPQELERSYIRYYLESEIFQFP 122 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + + + + + + P EQ +I EK+ V++D+ + ++LK Sbjct: 123 LKLLSRSAQNGFNKEDLSRFDFLLAPRDEQKIIAEKLDTLLVQVDSTKARLEQIPQILKR 182 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +QA+++ V+ L D + W + L + + Sbjct: 183 FRQAVLAAAVSGKLTEDYRENQVITSWDNTTLGTLIIDSCNGLAKRSGTDGEDITILRLA 242 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGII 316 I E R + L + Y + +V R R + Q E Sbjct: 243 DFKNAQRIHGNE-RKITLDSKEINKYSLKKSDILVIRVNGSADLAGRFIEYKQTYEIEGF 301 Query: 317 TSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPI 372 ++ + I S +L ++ + S + ++ +K L + +P + Sbjct: 302 CDHFIRLRLNSEKISSRFLTFIANEGEGRFYLRNSLSTSAGQNTINQTSIKGLALSLPTL 361 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ +I + A D + +++ ++ + S + A G+ Sbjct: 362 PEQHEIVRRVEQLFAYADTIEKQVNNALTRVNNLTQSILVKAFRGE 407 >gi|229165872|ref|ZP_04293638.1| hypothetical protein bcere0007_8480 [Bacillus cereus AH621] gi|228617577|gb|EEK74636.1| hypothetical protein bcere0007_8480 [Bacillus cereus AH621] Length = 413 Score = 126 bits (316), Expect = 7e-27, Method: Composition-based stats. Identities = 64/404 (15%), Positives = 127/404 (31%), Gaps = 28/404 (6%) Query: 25 WKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ ++ G T + + + ++ T D + + Sbjct: 20 WEQRKFSEIAEIRRGLTYKPADVRDVGVRVLRSSNINEDTFVLKSDDVFVKAEAANIDF- 78 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL R + + + ++ ++ + + Sbjct: 79 VENEDILITSANGSSRLVGKHALISGINDNTVHGGFMLLARANRPQFVNALMSSNWYDKF 138 Query: 141 CE------GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + +P EQ I E ID LI R +EL Sbjct: 139 INVFVSGGNGAIGNLSKSDLESQTVFVPNDEEQKKIGEF----FASIDNLIPLHQRKLEL 194 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE K++L+ + K +++ G E + KN E+ Sbjct: 195 LKETKKSLLQKMFPKNGANIPEIRFEGFTDAWEQRKLGEF----SEKVTEKNKNNIYSET 250 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMER 313 S YG I Q ++ + Y +V P + V+ I + ++ Sbjct: 251 LTNSAKYGIINQLDFFDKDISNEKNLDGYYVVRPDDFVYNPRISNLAPVGPINRNKLGRS 310 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLV 369 G+++ Y + H +D TYL S G R ++K + +P+ + Sbjct: 311 GVMSPLYYVFRTHNVDKTYLEKYFSSNSWHIFMKLNGDSGARSDRFAIKDSVFREMPIPI 370 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I N ++D L+ ++ + LK + S + Sbjct: 371 PSINEQTQIGNF----FKQLDNLITLHQRELNSLKNLKKSLLQQ 410 >gi|78773894|gb|ABB51239.1| type I RM system S subunit [Arthrospira platensis] Length = 417 Score = 126 bits (316), Expect = 7e-27, Method: Composition-based stats. Identities = 59/429 (13%), Positives = 128/429 (29%), Gaps = 44/429 (10%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGK 62 P YK + V G IP+ W+VV + T + S +I + ++ + Sbjct: 15 PGYKQTEV---GVIPEDWEVVRVGDLEPYVTSGSRGWAKYYSKYGASFIRITNLNKNSIY 71 Query: 63 YLPKDGNS----RQSDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQF--LV 113 + + + G IL I + + Sbjct: 72 LNLNELKFVALPNHVNEGKRTRLKNGDILISITADIGIIGYINSSVPQPAYINQHISLVR 131 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 K+++ + + +L+ V + + + + I ++ + IPPL EQ I Sbjct: 132 FDLKNIVSKYIAYFLVCEKVQRFFRGSTDQGAKAGINLDKIRSLQLAIPPLPEQKAIASV 191 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + I +L + + Q L L ++ G EW ++ Sbjct: 192 LSDVDELISSLDKLIAKKRHIKTATMQQL--------LTGKTRLPGFGGEWETKSLEYLT 243 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + + ++ + + G Y I+D I+ Sbjct: 244 ECLDNLRIPLNEVQRARMKGNYPYCGANG--------------ILDYVNEYIIDDDIILL 289 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 D+ + R +G + +L V + SG Sbjct: 290 AEDGGYFDEHTTRPIAYRMKGKCWVNNHVHILKAKPGYHQDFLFYCLVHKNVLPFLASGT 349 Query: 354 RQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 R L ++ ++ + +P +EQ I +V++ + +E+ + + + Sbjct: 350 RAKLNKSEMNKIEINLPKNSEEQKAIASVLSDMDKE----IAALEKRRAKTQAIKQGMMQ 405 Query: 413 AAVTGQIDL 421 +TG+ L Sbjct: 406 ELLTGRTRL 414 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 77/218 (35%), Gaps = 16/218 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-TKLIESNILSLSYGNIIQKLETRNMGL 275 K + + + + V VT +R + + N+ + N+ Sbjct: 17 YKQTEVGVIPEDWEVVRVGDLEPYVTSGSRGWAKYYSKYGASFIRITNLNKNSIYLNLNE 76 Query: 276 -------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPH 326 + + G+I+ I ++ V + I + Sbjct: 77 LKFVALPNHVNEGKRTRLKNGDILIS-ITADIGIIGYINSSVPQPAYINQHISLVRFDLK 135 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I S Y+A+ + + + F G + + + ++ L + +PP+ EQ I +V++ Sbjct: 136 NIVSKYIAYFLVCEKVQRFFRGSTDQGAKAGINLDKIRSLQLAIPPLPEQKAIASVLSDV 195 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 D L+ +++ I + +++ + +TG+ L G Sbjct: 196 ----DELISSLDKLIAKKRHIKTATMQQLLTGKTRLPG 229 >gi|237712397|ref|ZP_04542878.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 9_1_42FAA] gi|229453718|gb|EEO59439.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 9_1_42FAA] Length = 475 Score = 126 bits (316), Expect = 7e-27, Method: Composition-based stats. Identities = 65/408 (15%), Positives = 143/408 (35%), Gaps = 33/408 (8%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSESGK--DIIYIGLEDVES-GTGKYLPKDGNSRQSDT 75 +P+ W + +L G + +S I + + ++ + GT Y +S D Sbjct: 70 EVPESWVWCRLDDIVCELKYGTSEKSSSVGKIAVLRMGNITNVGTIDYSNLVYSSNDEDI 129 Query: 76 STVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ K +L+ + + AI + +L+ ++ +++ Sbjct: 130 EQYSL-EKNDLLFNRTNSSEWVGKTAIYKEEQPAIYAGYLIRIKPLLISPDYLNTVMNSG 188 Query: 133 VT--QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + S+ + + + + +PIPPL EQ I ++ ID + + Sbjct: 189 YYRDWCYDVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVAEMDKWISLIDIVKNGKGD 248 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG---------------LVPDHWEVK 235 + ++K+ K ++ + L P + IE + +PD W Sbjct: 249 LLTVIKQAKSKILDLAIHGQLVPQDPNDEPPIELLKRINPDFTPCDNGHYTQLPDGWCYA 308 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP---ESYETYQIVDPGEIV 292 V +N KN + + + NI + + G+I Sbjct: 309 TIKE-VFIINPKNKADDDVEVGFVPMANITDGYNNTFKYETKQWGKIKTGFTHFANGDIA 367 Query: 293 FRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I N K + G+ T+ +P +D Y + +S Sbjct: 368 VAKISPCLENRKSVVLKGLPNGIGVGTTELHVFRPLFLDVQYGLYFFKSDYFISQCVGSF 427 Query: 351 SGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +G+ +Q + ++ + + +PPI EQ I ++ A++D+++E + Sbjct: 428 NGVVGQQRVSKNIIENMIIAIPPINEQKRIACAVHKIFAKLDMIMESL 475 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 77/199 (38%), Gaps = 7/199 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIES--NILSLSYGNIIQKLETRNMGLKPESYET-- 282 VP+ W +V EL ++ S I L GNI L S + Sbjct: 70 EVPESWVWCRLDDIVCELKYGTSEKSSSVGKIAVLRMGNITNVGTIDYSNLVYSSNDEDI 129 Query: 283 -YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 ++ +++F + + + I + +KP I YL +M S Sbjct: 130 EQYSLEKNDLLFNRTNSSEWVGKTAIYKEEQPAIYAGYLIRIKPLLISPDYLNTVMNSGY 189 Query: 342 LCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 Y + + + ++ + + +L + +PP+KEQ I ++ + ID++ Sbjct: 190 YRDWCYDVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVAEMDKWISLIDIVKNGKGDL 249 Query: 400 IVLLKERRSSFIAAAVTGQ 418 + ++K+ +S + A+ GQ Sbjct: 250 LTVIKQAKSKILDLAIHGQ 268 >gi|302668599|ref|YP_003833047.1| type I restriction modification system S subunit HsdS2 [Butyrivibrio proteoclasticus B316] gi|302397563|gb|ADL36465.1| type I restriction modification system S subunit HsdS2 [Butyrivibrio proteoclasticus B316] Length = 408 Score = 126 bits (316), Expect = 8e-27, Method: Composition-based stats. Identities = 66/413 (15%), Positives = 135/413 (32%), Gaps = 34/413 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 G W+ + + + TG T + D +++ D++ G + Sbjct: 10 GKFNDDWEQRKLIEISDIVTGTTPPTKDKDNYGGDRLFVSPADIQ-GNRYVDETITTLTE 68 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G L+ +G + K + Q + P + + + + Sbjct: 69 KGYALGRELRAGTTLFVSIGSTIGKVAQIKESATTNQQINAVIPNVEMD-DNFVFTMLEN 127 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++I+ + + + G + P EQ I E I + Sbjct: 128 EAEKIKKLAATQAVPIINKTTFGETEIQFPKKEEQTRIGEYFSNLDSLITLHQRKCDETK 187 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA--LVTELNRKNTK 250 EL K Q + P + I + G D WE + + +++ + Sbjct: 188 ELKKYMLQKMF---------PKNGERVPEIRFAGFT-DDWEQRKLGELAEIGDIDHRMPP 237 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFIDLQNDKRS 304 +E I L G+ E G+K S E Y+ + + G+I+F R Sbjct: 238 TVEDGIPYLMTGDFCGINELNFEGVKHVSQEDYEQLSRKIKPEKGDIIFARYASVGAVRY 297 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVK 363 + + I S + + I+S YL + K + S + ++ + +K Sbjct: 298 V--DFTRDFLISYSCAIIKQSKKINSKYLYHYLTGDPAQKQIKLEINSSSQANIGIDSMK 355 Query: 364 R-LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + VL+P EQ I+ ++ +D L+ ++ LKE + + Sbjct: 356 NSITVLLPSADEQTKISEFLSG----LDNLITLHQRKSDELKELKKYMLKNLF 404 >gi|126179043|ref|YP_001047008.1| restriction modification system DNA specificity subunit [Methanoculleus marisnigri JR1] gi|125861837|gb|ABN57026.1| restriction modification system DNA specificity domain [Methanoculleus marisnigri JR1] Length = 394 Score = 126 bits (316), Expect = 8e-27, Method: Composition-based stats. Identities = 81/410 (19%), Positives = 153/410 (37%), Gaps = 34/410 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W++V ++ K++ S G+ YIGLE++ES TG+ + S Sbjct: 5 ELPEGWRLVKLEEVAKIDNKAVSPDEMRGELQNYIGLENIESNTGQLVSFSETLGDDIKS 64 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVT 134 F + ILYGKL PYL K + DF G+CST + ++P + E L +L + + Sbjct: 65 NKFGFTEEHILYGKLRPYLNKVYLPDFAGVCSTDIIPIKPDSDLLIREFLGYFLRTPEFV 124 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I A GA + + K + ++ +P+PP+ Q I + R Sbjct: 125 SMINAKSSGANLPRVNPKTLLDVYIPLPPIETQYKIVAILEKTEAT--------QRLRAE 176 Query: 195 LKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 Q L+ + +P K I + + R + Sbjct: 177 ADALTQKLMQNVFLEMFGDPATNPKGWDIVKL-----DAIAVLQRGKFSHRPRNEPRFYG 231 Query: 254 SNILSLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + G+I + +L T + L E + ++ G IV D L Sbjct: 232 GSYPFIQTGDISRSGGRLTTFSQTLNDEGLKISKLFKKGIIVIAIAANIGDTAILDFDSC 291 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 ++ ++ P + + ++R Y ++ + ++++ + + L V++P Sbjct: 292 FPDSVVG---VSPMPDKANPIFTEMMLRHYKNI-LWDSAPETAQRNINLKILSDLNVILP 347 Query: 371 PIKEQFDITNVIN--VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P+ Q + T I + S LL ++ A TG+ Sbjct: 348 PLDLQNRFAKIAQSIQVTRDIQNKSAVEKSS--LLNN----LMSKAFTGE 391 >gi|126726006|ref|ZP_01741848.1| putative type I restriction-modification system, S subunit [Rhodobacterales bacterium HTCC2150] gi|126705210|gb|EBA04301.1| putative type I restriction-modification system, S subunit [Rhodobacterales bacterium HTCC2150] Length = 371 Score = 126 bits (316), Expect = 8e-27, Method: Composition-based stats. Identities = 62/403 (15%), Positives = 124/403 (30%), Gaps = 37/403 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W + T+ G + G+ ST + Sbjct: 2 VPEGWGECRLGEVTEFQRGFDLPKS-----------QRQVGEIPIISSAGYSGWHSTAKV 50 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G I+ G+ G ++ + +T V + L +ID + + Sbjct: 51 ERAG-IVTGRYGSIGDVFLVYEDHWPLNTTLWVKDFHGNHIQWAYHLLQTIDYAKFSDK- 108 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + I I + +PPL EQ I E + D I + K +K+ Sbjct: 109 ---TGVPGINRNDIHRIKVRVPPLPEQRKIAEIL----GTWDRAIEVAEAQLAAAKTQKR 161 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +L+ ++T K + S E + + KN I +S Sbjct: 162 SLMQQLLTG------KRRFSEFEGQPWKEVRLGDVGQVITGSTPSTKNEHYYGGPIPFVS 215 Query: 261 YGNIIQKLETRN--MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++ + + L E + V G +F I + Sbjct: 216 PADLDGRTLIYSAQKTLTHEGMSVSRTVPKGATLFSCIGYIGKVGLAGV-----DLVTNQ 270 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 AV P+ + + S KV G + + + + +PP++EQ I Sbjct: 271 QINAVVPNSSVDSEYLFYALSAIGPKVKLLAGHNVVPIVNKSEFSLQRITLPPLREQKKI 330 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + A ++V + I L+ +++ + +TG+ + Sbjct: 331 AASLG--IADVEVAIFSTN--IENLRTEKNALMQQLLTGKRRV 369 >gi|19746826|ref|NP_607962.1| specificity determinant HsdS [Streptococcus pyogenes MGAS8232] gi|19749064|gb|AAL98461.1| putative specificity determinant HsdS [Streptococcus pyogenes MGAS8232] Length = 395 Score = 126 bits (316), Expect = 8e-27, Method: Composition-based stats. Identities = 60/404 (14%), Positives = 128/404 (31%), Gaps = 38/404 (9%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ + + + G + ++ DI ++ + DV G+ + Sbjct: 17 EWEEKKLGEISNIVRGASPRPIQDPKWFDAKSDIGWLRISDVTEQEGRITYLQQRISELG 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +L + I G+ + L PK + Sbjct: 77 QEKTRVLKDPHLLLSIAATVGKPVINYVKTGVHDGFLVFLDPKF---NREFMFQWLDMFR 133 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + I N + +P L EQ I E +D LI + + + Sbjct: 134 PYWNKYGQPGSQVNLNSEIIRNQVINLPSLPEQEAIGE----LFQTVDQLIQLQRQKLAT 189 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE+KQ + + P K I G + WE K +V ++ Sbjct: 190 LKEQKQTFLRKMF-----PAQGQKVPEIRLQGFDGE-WEEKELGDIVQITMGQSPSSQNY 243 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Y + + +N + P + T D G+I+ D ++ Sbjct: 244 TTNPSDYILVQGNADIKNGYVFPRVWTTQITKQADKGDIILSVRAPVGDVGKTNYHVIIG 303 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 RG+ + ++ +++ + + +G S+ D+K + +P Sbjct: 304 RGVAA---------IKGNEFIFQILKYLKEIGYWKRISTGSTFDSISSSDIKYAKIQIPS 354 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + EQ I N +D + + E+ + LK + + + Sbjct: 355 LSEQEAIGNF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 394 >gi|296122896|ref|YP_003630674.1| Restriction endonuclease S subunits-like protein [Planctomyces limnophilus DSM 3776] gi|296015236|gb|ADG68475.1| Restriction endonuclease S subunits-like protein [Planctomyces limnophilus DSM 3776] Length = 413 Score = 126 bits (315), Expect = 8e-27, Method: Composition-based stats. Identities = 60/416 (14%), Positives = 127/416 (30%), Gaps = 28/416 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES------GTGKYLPKDGNSRQSDTST 77 W + + N G G+ ++ + + + GT ++ Sbjct: 4 GWIYKTLDDVCEFNNGL--WKGEKPPFVTVGVIRNTNFTKEGTLDDSDIAYIEVEAKKFE 61 Query: 78 VSIFAKGQILYGKLG-----PYLRKAIIADFDGICST-----QFLVLQPKDVLPELLQGW 127 G ++ K G P R A+ G S V PK + L + Sbjct: 62 KRRLVFGDLILEKSGGGPKQPVGRVALFDKRAGDFSFSNFTAAIRVKDPKTLDFRFLHKF 121 Query: 128 LLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L ++ E + +T + + + I +P+PPL EQ I + + T Sbjct: 122 LFWTHLSGVTETMQSHSTGIRNLNGDVYKCIEVPLPPLTEQRRIVGILDEAFEGLATAKA 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKM--KDSGIEWVGLVPDHWEVKPFFALVTEL 244 + ++ + ++ + + T+ + V+ KD G + Sbjct: 182 NAEKNLQNARALFESHLQAVFTQRGDGWVEKTVKDVASPIKGSIRTGPFGSQLLHSEFVD 241 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 I++ + N + ++R + V PG+++ + Sbjct: 242 EGIAVLGIDNAV-----ANEFRWGKSRFITKDKFGQLERYRVYPGDVLITIMGTCGRCAV 296 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 + + + +YL + + + G L + Sbjct: 297 VPDDIPTAINTKHICCITLDWKKCLPSYLHLYFLHAQQSQAFLAKHAKGAIMAGLNMGLI 356 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + LPVL+PP + Q I N L ++ + L E + S + A +G+ Sbjct: 357 QELPVLLPPTQVQSAIVEAANDLREETQRLESLYQRKLAALDELKKSLLHRAFSGE 412 >gi|298253165|ref|ZP_06976957.1| restriction endonuclease S subunit [Gardnerella vaginalis 5-1] gi|297532560|gb|EFH71446.1| restriction endonuclease S subunit [Gardnerella vaginalis 5-1] Length = 420 Score = 126 bits (315), Expect = 9e-27, Method: Composition-based stats. Identities = 56/410 (13%), Positives = 126/410 (30%), Gaps = 26/410 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQS-DT 75 P + I ++ G + ++ + D++ G + + Sbjct: 14 PNGVEYKKIGDIADVSIGLATSVTKYKRDSGVLLLHNSDIQQGRIELKNIEHIDDSFAKK 73 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 ++ + K I+ G A+I D S F + + + + L Sbjct: 74 NSSKLLRKNDIITIHTGDVGTSAVITDEYAG-SIGFTTITSRIKDFNQVYPYYLCTYFNS 132 Query: 136 RIEAI----CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 I + S+ + K I +P+PPL Q I + T L E Sbjct: 133 HKCKIDIRKMTISDRSNLNQKDFIKIQVPVPPLEVQREIVRILDNFTELTAELTAELTAR 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + + L++ + + + + ++ ++ ++ + + Sbjct: 193 KKQYEYYRDTLLA------FDDNNPLHSLISRYCTNGVEYKKIGDIASVDRGGSLQKKDF 246 Query: 252 IESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 E I + YG I K + E + +IV + Sbjct: 247 CEHGIPCIHYGQIYTKYGLFASKSYTFIDSECASKQRFAHKNDIVMAVTSENIEDVCKCV 306 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLP 366 A E S + A+ H ++ YL + S + G + + D+ + Sbjct: 307 AWFGEEDAAVSGHSAIIRHNQNAKYLVYYFHSSMFFLQKKKLAHGTKVIEVTPSDLLDVK 366 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + VPP++ Q I +++ A + L + + I ++ R + Sbjct: 367 IPVPPLEVQRQIVQILDRFDALCNDLTQGLPAEIEARRKQYEYYRDQLLT 416 >gi|160934946|ref|ZP_02082332.1| hypothetical protein CLOLEP_03821 [Clostridium leptum DSM 753] gi|156866399|gb|EDO59771.1| hypothetical protein CLOLEP_03821 [Clostridium leptum DSM 753] Length = 444 Score = 126 bits (315), Expect = 9e-27, Method: Composition-based stats. Identities = 59/402 (14%), Positives = 135/402 (33%), Gaps = 9/402 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +PK+W V + L +GR ++ + + + IG+ + G + + Sbjct: 29 KVPKNWCWVRFSKIINLISGRDAKLTDCNSLGIGIPYIL-GASNLENNVFTIERWIENPQ 87 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I K +L G + + + S Q + ++ L + L +++ Sbjct: 88 VISLKNDVLLSVKGTIGKVYLQKEEKVNISRQIMAIRTSSTL-FPRFTYWLVNNISDSFR 146 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + + I +P PPL EQ I ++I + ++D + + + + Sbjct: 147 QAGNGL-IPGISREDILQKEVPFPPLPEQQRIVDRIESLFAKLDEAKQKTQEALNSYETR 205 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE-LNRKNTKLIESNIL 257 K A++ T L + + ++ + + Sbjct: 206 KAAILHKAFTGELTARWRKEHGLGMESWEKYKFNDILDVRDGTHDSPTYFDQGFPLITSK 265 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +L G I K + VD G+I+F I + + + + I Sbjct: 266 NLKDGKITDKDLKFISKEDYDKINERSKVDIGDILFAMIGTIGNPVVV---ETQPKFAIK 322 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF 376 + + ++ + + S + G ++ + ++ +L+P KEQ Sbjct: 323 NVALFKNIGKASPYFVKYFLESKKVIDRMEKDAKGSTQKFVSLGYLRAFNILLPKSKEQT 382 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +I +++ A+ E E + + + S +A A G+ Sbjct: 383 EIVRILDDLLAKEQQAKEAAEAVLDQIDLMKKSILARAFRGE 424 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 79/201 (39%), Gaps = 7/201 (3%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 E VP +W F ++ ++ ++ KL + N L + I+ N E Sbjct: 22 PDWEQPYKVPKNWCWVRFSKIINLISGRDAKLTDCNSLGIGIPYILGASNLENNVFTIER 81 Query: 280 YETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + I +++ Q E+ I+ MA++ + + Sbjct: 82 WIENPQVISLKNDVLLSVKGTIGKVY----LQKEEKVNISRQIMAIRTSSTLFPRFTYWL 137 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 ++ F G+GL + ED+ + V PP+ EQ I + I A++D +K + Sbjct: 138 -VNNISDSFRQAGNGLIPGISREDILQKEVPFPPLPEQQRIVDRIESLFAKLDEAKQKTQ 196 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 +++ + R+++ + A TG+ Sbjct: 197 EALNSYETRKAAILHKAFTGE 217 >gi|119943936|ref|YP_941616.1| restriction modification system DNA specificity subunit [Psychromonas ingrahamii 37] gi|119862540|gb|ABM02017.1| restriction modification system DNA specificity domain [Psychromonas ingrahamii 37] Length = 400 Score = 126 bits (315), Expect = 9e-27, Method: Composition-based stats. Identities = 62/409 (15%), Positives = 127/409 (31%), Gaps = 30/409 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVS 79 WK + ++ + D I I + + T Y L + N + Sbjct: 2 EWKTEKLGNVCEIIKRGIAPKYVDEGGICVINQKCIRDHTVNYSLARRHNLIIKSVNEER 61 Query: 80 IFAKGQILYGKLGP-YLRKAIIADFDGI----CSTQFLVLQPK---DVLPELLQGWLLSI 131 G +L G L + I T +++PK + Sbjct: 62 YVQVGDVLINSTGTGTLGRVAQVRNMPIEPTTVDTHVTIVRPKNGLFHNDFFGYMLIKIE 121 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + G T + EQ I + ++ T+ + Sbjct: 122 EEITSAGEGASGQTELARTKLQNDFFVSYPDSIQEQKRIVVLLDTVFADLEQTRTKTEQN 181 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ +E + + + +K V+ S I VG P + + Sbjct: 182 LKNARELFDSYLQQLFSKKSEGWVEKTLSEIAHVG-----TGGTPLKSTIGFWGGDIPWY 236 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + S ++ G ++ D +L+ + + Sbjct: 237 SSG-----ELNDTYTLASKNKITEVGLSGSNAKLFPKGSLLIGMYDT----AALKMSILD 287 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMGSGL-RQSLKFEDVKRLPVLV 369 G A VKP+ + L +++ S ++ K + G+ +++L +K +P+ + Sbjct: 288 RDGTFNQAVAGVKPNPKIN--LEFILHSINVIKPELLKLRRGVRQKNLNQSKIKNIPIRL 345 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P I EQ I IN + ++LV Q I + E + S + A +G+ Sbjct: 346 PTIAEQIKIVAEINDLEEKTNLLVNIYSQKITSIDELKKSILQKAFSGE 394 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 68/195 (34%), Gaps = 7/195 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + W + + TG T G DI + ++ S Sbjct: 202 EGWVEKTLSEIAHVGTGGTPLKSTIGFWGGDIPWYSSGELNDTYTLASKNKITEVGLSGS 261 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 +F KG +L G K I D DG + ++P + L + Sbjct: 262 NAKLFPKGSLLIGMYDTAALKMSILDRDGTFNQAVAGVKPNPKIN-LEFILHSINVIKPE 320 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + G + + I NIP+ +P +AEQ+ I +I + + L+ + I + Sbjct: 321 LLKLRRGVRQKNLNQSKIKNIPIRLPTIAEQIKIVAEINDLEEKTNLLVNIYSQKITSID 380 Query: 197 EKKQALVSYIVTKGL 211 E K++++ + L Sbjct: 381 ELKKSILQKAFSGEL 395 >gi|150401945|ref|YP_001329239.1| restriction modification system DNA specificity subunit [Methanococcus maripaludis C7] gi|150032975|gb|ABR65088.1| restriction modification system DNA specificity domain [Methanococcus maripaludis C7] Length = 432 Score = 126 bits (315), Expect = 9e-27, Method: Composition-based stats. Identities = 64/436 (14%), Positives = 142/436 (32%), Gaps = 29/436 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESGTG 61 ++KD+ IG IP W+V+ +K+ T+ + +G T ++ K DI ++ + + + Sbjct: 4 EFKDTE---IGKIPVDWEVLELKQVTENIFSGGTPDTRKPEYWNGDIPWLSSGETRNNSI 60 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDV 119 K + + S+ + K I+ G +A D + + L+ K Sbjct: 61 TETEKKITYKGVENSSTRLAKKEDIVIASAGQGYTRGQASFCKIDTYINQSIVALRTKKE 120 Query: 120 L-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L L + +++ + ++ K + + +PIPPL EQ I + + A Sbjct: 121 LVNPLFLYYNITLRYNELRAISDSHSSRGSLTTKLLAPLKIPIPPLEEQQKIAQILSALD 180 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV-----PDHWE 233 +I+ + E + + +++ G P+ W Sbjct: 181 DKIENNNQQNKILEETANSIFKEWFVDFNFLNEDGLSYLENDGEMEFNEELEIEIPEGWN 240 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE--- 290 VK + T + K + + +I + G Sbjct: 241 VKYLDEICTVMGGGTPKTNVPEYWQDGTILWATPTDMTSKKSPVIDTTEKKITELGLKES 300 Query: 291 ----IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + I + + S+ M+ ++ + S Y + + K+ Sbjct: 301 SAKLVPKGSILMTSRATIGYSSIAMKEISTNQGFINIICDKKVSNYFILYLLEHIKDKII 360 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + K + V+VP + +I + + K + L Sbjct: 361 ALANGSTFLEISKTNFKNIRVIVPDYQTMEKYNEIIEELINK----IYKNSKENQNLSNL 416 Query: 407 RSSFIAAAVTGQIDLR 422 R + ++G+I L+ Sbjct: 417 RDLLLPKLMSGEIRLK 432 >gi|153838491|ref|ZP_01991158.1| hypothetical Type I restriction enzyme EcoEIspecificity protein [Vibrio parahaemolyticus AQ3810] gi|149748114|gb|EDM58973.1| hypothetical Type I restriction enzyme EcoEIspecificity protein [Vibrio parahaemolyticus AQ3810] Length = 391 Score = 126 bits (315), Expect = 1e-26, Method: Composition-based stats. Identities = 72/406 (17%), Positives = 152/406 (37%), Gaps = 25/406 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + +P W+ I+ + +G+ G++ GN + Sbjct: 5 LYKLPDGWEWKRIEDIFTITSGKNLTKKDMH----------DEGEFPVYGGNGIAGRYND 54 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 ++ I+ G++G + + D + F + + K + + +L + T Sbjct: 55 FNLSGSN-IIIGRVGALCGNVRLVNSDIWVTDNAFFIKEYKVDILKE---YLAKVLSTLN 110 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + A A +KGI ++ +P PPL EQ I EKI A RIDT I I L Sbjct: 111 LGATANKAAQPVISYKGIKDLVIPYPPLDEQKRIVEKIDALLTRIDTAIEHLQESITLAD 170 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + ++ + P +S + G V + + + I Sbjct: 171 ALYASELNEVF-----PSDADIESLSDKAGWVSLSDICTFENGDRGKNYPSKSAFVAEGI 225 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERG 314 +S GN+ ++ + GL + E Y ++ G I I + ++ ++ G Sbjct: 226 PVVSAGNLGERYI-DHKGLNYITPERYDLLRSGRIKIGDILFCLRGSLGKVAISKDIDEG 284 Query: 315 IITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPI 372 +I S+ + ++P S + ++S + +G + +L + + + + +P Sbjct: 285 VIASSLVIIRPKACVSAEYIYKYLKSSLCQQFISFYNNGAAQPNLSAKSLGKFMLPLPNA 344 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ I + ++ + L++ + I LK ++S + +A G+ Sbjct: 345 DEQKIIIDGLDEKYQHNQKLLDALRDKIDSLKILKASILDSAFKGE 390 >gi|172040757|ref|YP_001800471.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] gi|171852061|emb|CAQ05037.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] Length = 397 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 77/409 (18%), Positives = 143/409 (34%), Gaps = 20/409 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 I H+ + P + L ++ G+ + L+ +E TG+ L G + Sbjct: 2 IDSHFPLAPFWALSSLVNEVSTPQGE---LVSLDRIEGKTGRLLQGGG----ESNANGRH 54 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SIDVTQRIEA 139 F K +L+GKL PYL K +AD G V +P +++ S T Sbjct: 55 FRKDDVLFGKLRPYLAKYWLADRPGTAQGDIHVYRPTLRTDPRFLAYIVGSDYFTGLANT 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+ M +W + +P PP Q I + + ET ID + + + LL E++ Sbjct: 115 SSTGSKMPRVEWPKVAQFRVPFPPRRTQRAIADYLDRETAEIDAMTADLDKMEALLTERR 174 Query: 200 QALVSYIVTKGLN-PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 ++ + LN P + ++G + + + N Sbjct: 175 AEILRSWFGEQLNNPRAPLATIAELYIGKMEQPRQKSADEIYAPFFHSAN---------- 224 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G +I + + ++ G++V + + + Sbjct: 225 IRPGGMIDLECSVKHMWFRPDELDHMLLRKGDVVVVEGGAAGRPGYIAKSVDGWGIQKSV 284 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFD 377 + YL + + F S E R V V + +Q Sbjct: 285 IRARPFEDKVIGKYLFYALTFAFEDGQFDLQASLATLAHFPAEKAARFRVPVRSLADQEL 344 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + ++ + + + ++ I LL ERR++ IAAAVTGQID+ + Sbjct: 345 VVARLDRDLSSLSDMLADITALRDLLAERRAALIAAAVTGQIDIPTAEE 393 >gi|237798535|ref|ZP_04586996.1| putative type I restriction-modification system restriction subunit [Pseudomonas syringae pv. oryzae str. 1_6] gi|331021388|gb|EGI01445.1| putative type I restriction-modification system restriction subunit [Pseudomonas syringae pv. oryzae str. 1_6] Length = 437 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 57/429 (13%), Positives = 140/429 (32%), Gaps = 41/429 (9%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +P+++D+ WK V +++ + T R E + I + Sbjct: 23 RFPEFRDA---------SGWKPVTLRKASVPVTERVGERKLTPVSISAGVGFVPQAEKFG 73 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPE 122 +D + +Q ++ G ++ K + G + + + + Sbjct: 74 RDISGKQYKL--YTLVRDGDFVFNKGNSLKFPQGCVYLLQGWGQVAAPNVFICFRLKDDY 131 Query: 123 LLQGWLLSIDVTQRIEAICEGAT-------MSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + Q + + T + + + + +P+P L EQ I + Sbjct: 132 SNGFFQNCFEQNQHGNQLKKHITSGARSNGLLNISKETFFGVEIPVPLLPEQQKIANCLS 191 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 + + R + LK K+ L+ I + +++ + G H Sbjct: 192 SLDELTAA----QTRKVYALKSHKKGLMQQIFPQEGETQPRLRFPEFKNAGEWNAHPFED 247 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEI 291 + + L GN+ + L P+S+E++ ++ G+I Sbjct: 248 FVAKSFYGTSSSTSP--TGQYPVLRMGNMSDGRLDFTNLVYIDLDPDSFESF-RLEEGDI 304 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAM 349 + + + Q+ I S + + + ID ++ +++ + + Sbjct: 305 LLNRTNSPALVGKISLFQLKSECITASYIVTYRLKKNRIDPSFCNYMLNTPLYQARIKKL 364 Query: 350 G--SGLRQSLKFEDVKR-LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S + ++ K+ L V VP + EQ I + + +D L+ Q + L+ Sbjct: 365 AKPSISQANINPTTFKKELIVSVPALLEQQRIADCLTA----LDDLIAAQTQRLDSLRTH 420 Query: 407 RSSFIAAAV 415 + + + Sbjct: 421 KKALMQQLF 429 >gi|148976279|ref|ZP_01813003.1| type I restriction-modification enzyme, S subunit [Vibrionales bacterium SWAT-3] gi|145964373|gb|EDK29628.1| type I restriction-modification enzyme, S subunit [Vibrionales bacterium SWAT-3] Length = 411 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 69/417 (16%), Positives = 144/417 (34%), Gaps = 21/417 (5%) Query: 21 IPKHWKVVPIKRFTK--LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ +K K ++ G + + + D+ T + S + Sbjct: 2 VPNGWEEKSLKDICKKTISYGIVQTGENIENGVPCVRVVDLSKNTLNPVEMIKTSDKIHQ 61 Query: 76 S-TVSIFAKGQILYGKLGPYL--RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S +I +G+++ G +K + L P + W L + Sbjct: 62 SYKKTILCEGELMMALRGEIGLVKKVTPELVGANITRGLARLSPIKSVDSDYLLWTLRSN 121 Query: 133 VTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G + + + + +PIPPL EQ I + + D I + Sbjct: 122 KIKNELSRKSGGSALQEIALGSLRKVVLPIPPLPEQRKIAQIL----STWDRGIATTEKL 177 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 IE K++K+AL+ ++T + ++G + G H F L +K Sbjct: 178 IETSKQQKKALMQQLLTGK--KRLVNPETGKAFEGEWERHSMSDLVFIDRKSLGKKTPDD 235 Query: 252 IESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E +SLS + E +++ G+I+ + + S + Sbjct: 236 FEFQYISLSDVAVGSISKELEVHKFASAPSRARRVIQEGDILLSTVRPNLKGFAKVSEKH 295 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLV 369 + T + + Y+ + S + ++ G ++ DV L V Sbjct: 296 ADCIASTGFSVLTPKKRVSGDYIHQYIFSSHVTGQIDSLVVGSNYPAINSSDVAGLKVYC 355 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 P +EQ I +V+ I+VL E + K+ + + + +TG ++ + + Sbjct: 356 PTYEEQQKIASVLTAADKEIEVL----EAKLAHFKQEKKALMQQLLTGNRRVKVDEE 408 >gi|71904273|ref|YP_281076.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS6180] gi|71803368|gb|AAX72721.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS6180] Length = 395 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 60/404 (14%), Positives = 129/404 (31%), Gaps = 38/404 (9%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ + + + G + ++ DI ++ + DV G+ + Sbjct: 17 EWEEKKLGEISNIVRGASPRPIQDPKWFDAKSDIGWLRISDVTEQEGRITYLQQRISELG 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +L + I G+ + L PK + Sbjct: 77 QEKTRVLKDPHLLLSIAATVGKPVINYVKTGVHDGFLVFLDPKF---NREFMFQWLDMFR 133 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + I N + +P L EQ I E +D LI + + + + Sbjct: 134 PYWNKYGQPGSQVNLNSEIIRNQVINLPSLPEQEAIGE----LFQTVDQLIQLQRQKLAI 189 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE+KQ + + P K I G + WE K +V ++ Sbjct: 190 LKEQKQTFLRKMF-----PAQGQKVPEIRLQGFDGE-WEEKELGDIVQITMGQSPSSQNY 243 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Y + + +N + P + T D G+I+ D ++ Sbjct: 244 TTNPSDYILVQGNADIKNGYVFPRVWTTQITKQADKGDIILSVRAPVGDVGKTNYHVIIG 303 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 RG+ + ++ +++ + + +G S+ D+K + +P Sbjct: 304 RGVAA---------IKGNEFIFQILKYLKEIGYWKRISTGSTFDSISSSDIKYAKIQIPS 354 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + EQ I N +D + + E+ + LK + + + Sbjct: 355 LSEQEAIGNF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 394 >gi|254436045|ref|ZP_05049552.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] gi|207089156|gb|EDZ66428.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] Length = 369 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 68/363 (18%), Positives = 130/363 (35%), Gaps = 30/363 (8%) Query: 81 FAKGQILYGKLGPYLRKAIIADFDG------ICSTQFLVLQPKDVLPELLQ--GWLLSID 132 KG ++ K + + +C +L+P + Sbjct: 15 LEKGDVIITKDSETPDDIAVPSYVSDDLSGVVCGYHLTLLKPDQDESDGEFLSHLFQLPS 74 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 V + G T I P+ PPL EQ I + +D +I + I Sbjct: 75 VQHYFYILANGITRFGLTADAINEAPLLTPPLPEQQKIAAIL----SSVDDVIEKTRAQI 130 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA-----LVTELNRK 247 LK+ K A++ ++TKG+ + KDS + G +P W + +V + + Sbjct: 131 HKLKDLKTAMMQELLTKGIG-HTEFKDSPV---GRIPVGWSICSAGEVAVAIMVGVVVKP 186 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR---FIDLQNDKRS 304 +ES + +L N+ + T + LK S ++ +I+ ++ + + Sbjct: 187 AQYYVESGVPALRSANVRENGLTMD-NLKYFSEDSNEILKKSRLIKGDLLTVRTGYPGTT 245 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVK 363 E + IDS + + S G +Q D+K Sbjct: 246 AVVTDEFEGCNCIDVVITRPSSRIDSDFFCLWVNSDHGKGQVLKAQGGLAQQHFNVSDMK 305 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L V+VP + EQ I N +N T + + E+ + LL + + + + +TG++ + Sbjct: 306 NLTVVVPSLTEQKAIFNAVNSVTKK----IALTEKRLTLLLDTKKALMQDLLTGKVRVNV 361 Query: 424 ESQ 426 E + Sbjct: 362 EQE 364 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 12/209 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGK 62 ++KDS V G IP W + + + +V Sbjct: 153 EFKDSPV---GRIPVGWSICSAGEVAVAIMVGVVVKPAQYYVESGVPALRSANVRENGLT 209 Query: 63 YLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVL 120 K + ++ S KG +L + G A++ D C+ ++ +P + Sbjct: 210 MDNLKYFSEDSNEILKKSRLIKGDLLTVRTGYPGTTAVVTDEFEGCNCIDVVITRPSSRI 269 Query: 121 PELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 ++ D ++ G H + + N+ + +P L EQ I + + T Sbjct: 270 DSDFFCLWVNSDHGKGQVLKAQGGLAQQHFNVSDMKNLTVVVPSLTEQKAIFNAVNSVTK 329 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVT 208 +I ++ K Q L++ V Sbjct: 330 KIALTEKRLTLLLDTKKALMQDLLTGKVR 358 >gi|261404908|ref|YP_003241149.1| restriction modification system DNA specificity domain-containing protein [Paenibacillus sp. Y412MC10] gi|261281371|gb|ACX63342.1| restriction modification system DNA specificity domain protein [Paenibacillus sp. Y412MC10] Length = 384 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 56/402 (13%), Positives = 129/402 (32%), Gaps = 34/402 (8%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY--LPKDGNSRQSDTS 76 W+ V + ++ G T ++ +I++I ++ T + + Sbjct: 4 WEKVRLGDVCEVIGGSTPKTSVKEYWDGEILWITPAELNDTTIIIRDTQRKITDKAISEL 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 ++ G +L P K I + C+ F L + + + + Sbjct: 64 SLKKLPVGTVLLSSRAPI-GKVAITGKEMYCNQGFKNLVCSESV-FNKYLFWFLKGKGEF 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + ++ GAT + NI P+PPL Q I + A + + + EL Sbjct: 122 LNSLGRGATFKEISKSIVENIVFPLPPLEVQKQIAATLDAASELLTMRKQQLSELDEL-- 179 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + S +P K G + + +R N + +I Sbjct: 180 -----IKSVFYEMFGDPVTNEK-------GWILSTFGNIGVLNSGGTPSRSNNSYFKGSI 227 Query: 257 LSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 S G + Q+ + + +I G ++ D K + + Sbjct: 228 NWFSAGELNQRYLLNSNEKITQLAIEQSSAKIFKAGSLLIGMYDTAAFKLGILAYDAASN 287 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + + ++ +L R + G +++L +K L + +PP+ Sbjct: 288 QACAN--IQINEQLVNIEWLYDCARIMRPHFLSNRRGVR-QKNLNLGMIKNLEIPLPPLD 344 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q +++ +I+ ++Q+I ++ S ++ Sbjct: 345 LQIQFADIV----TKIEEQKTLVKQAIDETQQLFDSLMSQYF 382 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 65/192 (33%), Gaps = 10/192 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 K W + LN+G T + I + ++ + + S Sbjct: 196 KGWILSTFGNIGVLNSGGTPSRSNNSYFKGSINWFSAGELNQRYLLNSNEKITQLAIEQS 255 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IF G +L G K I +D + +Q + L + + + + Sbjct: 256 SAKIFKAGSLLIGMYDTAAFKLGILAYDAASNQACANIQINEQLVNIEWLYDCARIMRPH 315 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G + + I N+ +P+PPL Q+ + +I+ T + I+ + Sbjct: 316 FLSNRRGVRQKNLNLGMIKNLEIPLPPLDLQIQFADI----VTKIEEQKTLVKQAIDETQ 371 Query: 197 EKKQALVSYIVT 208 + +L+S Sbjct: 372 QLFDSLMSQYFD 383 >gi|332992564|gb|AEF02619.1| restriction modification system DNA specificity domain protein [Alteromonas sp. SN2] Length = 512 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 74/467 (15%), Positives = 158/467 (33%), Gaps = 71/467 (15%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP WK + + G T +I ++ ++D+ + + Sbjct: 3 IPASWKQTELSEILLSIIGGGTPSKSIPSYYEGNIPWMSVKDMNKSILQDTVDHISEEAV 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ ++ G + L K ++A+FD + L P + + Sbjct: 63 KNSSTNVIPSGTPIVAT-RMSLGKIVVANFDSAINQDLKALFPASGVN-HEYLIGWYRSI 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++++E + G T+ + + ++ P+PPL EQ +I +K+ +++ R E Sbjct: 121 SRKVEELGMGTTVKGIRLEVLKSLEFPLPPLGEQKVIADKLDTLLAQVEATKARLERIPE 180 Query: 194 LLKEKKQALVSYIVTKGL-------NPDVKMKDSGIEWVG-------------------- 226 +LK +Q++++ V+ L N K+ +E + Sbjct: 181 ILKTFRQSVLADAVSGKLTEEWRAVNKSDFTKEERLEEIRKYKYETWIEEQEAKYEAKGK 240 Query: 227 -----------------------LVPDHWEVKPFFALVTELNRKNTKLIESN-------- 255 +P+ W +P LV R K ++++ Sbjct: 241 WPKTDSWKKKYKEAEIDPEFKSRELPESWVNQPLDGLVYISARIGWKGLKASEYTQSGPL 300 Query: 256 ---ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + SL+YG ++ E N+ + ++ +I+ K + Sbjct: 301 FLSVHSLNYGREVKLSEAFNISPERYDESPEIMLQNDDILLCKDGAGIGKIGIVKNLAEP 360 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPP 371 I +S + +L + + + ++ M L DVK + +PP Sbjct: 361 ASINSSLLLIRSGKYFVPEFLYFFLAGPTMQRLVQERMTGSAVPHLFQRDVKEFVLEIPP 420 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I EQ +I + A D + +K ++ + S +A A G+ Sbjct: 421 ISEQREIVRRVEELLAFADGIEQKANAALQRVNNLTQSILAKAFRGE 467 >gi|291566632|dbj|BAI88904.1| type I restriction-modification enzyme S subunit [Arthrospira platensis NIES-39] Length = 417 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 64/435 (14%), Positives = 128/435 (29%), Gaps = 56/435 (12%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 P YK + V G IP+ W+ + + + ++ + P Sbjct: 15 PGYKQTEV---GVIPEDWETNLLGDVVEFLDSKRKPVKEEQ--------RAKMRGIYPYY 63 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPE 122 G S D +F + IL G+ G + R + VL+PK Sbjct: 64 GASGIVDYVNDYLFDEDLILMGEDGENILSRNIRLVWQVSGKIWVNNHAHVLRPKSNFNI 123 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L G + + NI + +PPL EQ I + I Sbjct: 124 GFLTEYLESL---DYSLYNSGTAQPKLNQQTCCNIVIALPPLPEQKAIASVLSDVDELIS 180 Query: 183 TLITERIRFIELLKEKKQALVS----------------YIVTKGLNPDVKMKDSGIEWVG 226 +L + + Q L++ G K +G Sbjct: 181 SLDKLIAKKRHIKTATMQQLLTGKTRLPGFGEGMGYQKSAKGMGYQKSAKGMGYQKSAIG 240 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 L+P+ WEVK ++ + K+ N G+ P +I Sbjct: 241 LIPEDWEVKQLGDVLKICHGKSQHH-----------------IISNNGIYPILGTGGEIG 283 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 ++ + ++ A + + + + ++ ++L + Sbjct: 284 KTNTFLYNRPSVLIGRKGTIDAPIYIDTPFWTIDTLFYSQILSNANAKFIFYKFNLIDWY 343 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + SL ++ + PP+ EQ I +V++ + +E+ + Sbjct: 344 SYNEASGVPSLNAATIEDINQSFPPLPEQKAIASVLSDMDKE----IAALEKRRAKTQAI 399 Query: 407 RSSFIAAAVTGQIDL 421 + + +TG+ L Sbjct: 400 KQGMMQELLTGRTRL 414 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 72/204 (35%), Gaps = 13/204 (6%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 VG++P+ WE +V L+ K + E + Y Sbjct: 21 EVGVIPEDWETNLLGDVVEFLDSKRKPVKEEQRAKMRGIYPYYGASG------IVDYVND 74 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + D I+ R++R V + + + ++P + + +L + Sbjct: 75 YLFDEDLILMGEDGENILSRNIRLVWQVSGKIWVNNHAHVLRPK--SNFNIGFLTEYLES 132 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + L + + + +PP+ EQ I +V++ D L+ +++ I Sbjct: 133 LDYSLYNSGTAQPKLNQQTCCNIVIALPPLPEQKAIASVLSDV----DELISSLDKLIAK 188 Query: 403 LKERRSSFIAAAVTGQIDLRGESQ 426 + +++ + +TG+ L G + Sbjct: 189 KRHIKTATMQQLLTGKTRLPGFGE 212 >gi|306826651|ref|ZP_07459955.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes ATCC 10782] gi|304431178|gb|EFM34183.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes ATCC 10782] Length = 395 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 60/404 (14%), Positives = 129/404 (31%), Gaps = 38/404 (9%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ + + + G + ++ DI ++ + DV G+ + Sbjct: 17 EWEEKKLGEISNIVRGASPRPIQDPKWFDAKSDIGWLRISDVTEQEGRITYLQQRISELG 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +L + I G+ + L PK + Sbjct: 77 QEKTRVLKDPHLLLSIAATVGKPVINYVKTGVHDGFLVFLDPKF---NREFMFQWLDMFR 133 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + I N + +P L EQ I E +D LI + + + + Sbjct: 134 PYWNKYGQPGSQVNLNSEIIRNQVINLPSLPEQEAIGE----LFQTVDQLIQLQRQKLAI 189 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE+KQ + + P K I G + WE K +V ++ Sbjct: 190 LKEQKQTFLRKMF-----PAQGQKVPEIRLQGFDGE-WEEKELGDIVQITMGQSPSSQNY 243 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Y + + +N + P + T D G+I+ D ++ Sbjct: 244 TTNPSDYILVQGNADIKNGYVFPRVWTTQITKQADKGDIILSVRAPVGDVGKTNYHVIIG 303 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 RG+ + ++ +++ + + +G S+ D+K + +P Sbjct: 304 RGVAA---------IKGNEFIFQILKYLKEIGYWKRISTGSTFDSISSSDIKYAKIQIPS 354 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + EQ I N +D + + E+ + LK + + + Sbjct: 355 LSEQEAIGNF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 394 >gi|312110993|ref|YP_003989309.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y4.1MC1] gi|311216094|gb|ADP74698.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y4.1MC1] Length = 409 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 85/413 (20%), Positives = 154/413 (37%), Gaps = 30/413 (7%) Query: 23 KHWKVVPIKRFTKLNTG-RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +W V + +L + + + YIGLE + GT + + + TST S F Sbjct: 2 SNWIKVKLGDIVELKRESYHPKPDEVLPYIGLEHIGQGTLRLISVGKS--NEVTSTKSYF 59 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQRIE 138 +KG IL+GKL PY RK + F+G+CST LV P+ L + S ++ Sbjct: 60 SKGDILFGKLRPYFRKVVRPKFNGVCSTDILVLTSKNPRKFNQTFLFYLMASQEMIDLAT 119 Query: 139 AICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A G M ADWK + N+ + IP + EQ I + + +ID I E+ Sbjct: 120 ASSSGTKMPRADWKVLQNLEISIPEDVNEQERIGKILETIDDKIDINIRMNKTLEEMAMT 179 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT-----KLI 252 + + V G D + +S +G++P W+V L + K + Sbjct: 180 LYK---HWFVDFGPFQDEEFVES---ELGMIPKGWKVIQVKDLGEVITGKTPSTKVKEYY 233 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I + ++ + + +++ P + I Sbjct: 234 GDKIPFIKIPDMHGNVYIVKTETMLSELGAQSQKNKMLPPNTVCVSCIATPGLVVLTSEM 293 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + I + V G+ ++ +S V G +L D R+ +L Sbjct: 294 SQTNQQINS----VVCKEGVSPYFVYLFFKSISDNIVTLGSGGTATLNLNKGDFSRIKLL 349 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P ++ N + I L++ + + L+ R + ++G+ID+ Sbjct: 350 MPT----NEVMTGFNNKVESIFNLIKINSLNNIELENLRDYLLPRLLSGEIDV 398 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 56/194 (28%), Gaps = 8/194 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG-TGKYLPKDGNS 70 +G IPK WKV+ +K ++ TG+T + G I +I + D+ + Sbjct: 201 LGMIPKGWKVIQVKDLGEVITGKTPSTKVKEYYGDKIPFIKIPDMHGNVYIVKTETMLSE 260 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + + + + ++ + Q + K+ + Sbjct: 261 LGAQSQKNKMLPPNTVCVSCI-ATPGLVVLTSEMSQTNQQINSVVCKEGVSPYFVYLFFK 319 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G + + I + +P K+ + I I Sbjct: 320 SISDNIVTLGSGGTATLNLNKGDFSRIKLLMPTNEVMTGFNNKVESIFNLIKINSLNNIE 379 Query: 191 FIELLKEKKQALVS 204 L L+S Sbjct: 380 LENLRDYLLPRLLS 393 >gi|224826954|ref|ZP_03700052.1| restriction modification system DNA specificity domain protein [Lutiella nitroferrum 2002] gi|224600787|gb|EEG06972.1| restriction modification system DNA specificity domain protein [Lutiella nitroferrum 2002] Length = 421 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 62/419 (14%), Positives = 125/419 (29%), Gaps = 28/419 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYL 64 +P+++D P W P+ + + +T + ++ ++ E + Sbjct: 14 RFPEFQD-------EKP--WSFQPLGKLARRSTRKNTDCEVTRVLTNSAEFGVIDQRDFF 64 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 KD + Q + I +G +Y + P + G+ S + V + + Sbjct: 65 DKDI-ANQGNLEGYYIVEEGSYVYNPRISAMAPVGPISKNRVGLGVMSPLYTVFKFNNDQ 123 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + S Q + +P+P+ EQ I + + + Sbjct: 124 DDFYAHYFKSTHWHQHMRQASSTGARHDRISITNDDFMGLPLPVSGRDEQEKITDCLSSL 183 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 IT R + LK K+ L+ + + +++ G + E+ Sbjct: 184 DEL----ITAETRKLNALKTHKKGLMQQLFPREGEAVPRLRFPEFRDAG-GWEEKELGQL 238 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIVFRFI 296 LV+ L L +L L NI + + + P +I+ Sbjct: 239 GELVSGLTYSPEDLRVDGLLVLRSSNIQNGRITLDDNVYVRSDIKGANPSRPDDILICVR 298 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + E T + +L L R+ + A S Sbjct: 299 NGSKSLIGKSALIPKEMPPCTHGAFMTIFRSESARFLIHLFRTDAYERQVSADLGATINS 358 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +K+ VP EQ I + ++ D L+ Q I L R Sbjct: 359 INGNQLKKYKFFVPNPDEQQKIADFLSFA----DSLISDQAQKIEALNIHRKGLRQQLF 413 >gi|328947987|ref|YP_004365324.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328448311|gb|AEB14027.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 490 Score = 125 bits (314), Expect = 1e-26, Method: Composition-based stats. Identities = 60/397 (15%), Positives = 114/397 (28%), Gaps = 33/397 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W + TG+ + + YI ++ + D Sbjct: 85 EVPDGWAWCRLGELFYHTTGKALKKSNNKGSLRKYITTSNLYWNKFDFTEVREMYFTDDE 144 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICST-QFLVLQPKDVLPELLQGWLLSIDVT 134 KG ++ G R AI + IC L+PK L + Sbjct: 145 LDKCTIKKGDLVLCNGGDVGRAAIWNYNEDICYQNHVSRLRPKIEGINNSLYLYLLMFYK 204 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ +G ++ + + P+PPL EQ I I +I+ L E+ + Sbjct: 205 EQGMLNGKGVGITSLSANDLLSAIFPLPPLNEQNSIVTSIENIFEQIEHLDQEKSDLQTI 264 Query: 195 LKEKKQALVSYIVTKGLNPD--------------------VKMKDSGIEWVGLVPDHWEV 234 +K+ K ++ + L P K E + +P+ W Sbjct: 265 IKQTKSKILDLAIHGKLVPQDPNDEPAEELLKRIATSDNRPYKKIDEDEALFDIPESWSW 324 Query: 235 KPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 + T K N + I + + + + Sbjct: 325 CTLGEIYTHTTGKALKKTNNKGTLRKYITTSNLYWNSFDFTEVREMYFTDDELEKCTIKK 384 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+++ + S K I++++ +++ Y + Sbjct: 385 GDLILCNGGDVGRAAIWNYDYDICYQNHVSRL-RPKNKNINNSFFLYVIMIYKQQGILNG 443 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 G G+ SL D+ V +PP EQ I I Sbjct: 444 KGVGII-SLSASDLLSAVVPLPPYSEQNRIVEKIECL 479 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 38/269 (14%), Positives = 74/269 (27%), Gaps = 10/269 (3%) Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 I + P ++ + ++ K+ + S D Sbjct: 15 IHGKLVPQNPNDESATVLLEKIRAEKAEKIKKGELKADKKDSFIFVGSDKRHYEQFADGT 74 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLET 270 +KD E VPD W L K N + I + + Sbjct: 75 VKDIEDEIPFEVPDGWAWCRLGELFYHTTGKALKKSNNKGSLRKYITTSNLYWNKFDFTE 134 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G++V E + ++P Sbjct: 135 VREMYFTDDELDKCTIKKGDLVLCNGGDVGRAAIW---NYNEDICYQNHVSRLRPKIEGI 191 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L + G G+ SL D+ +PP+ EQ I I +I Sbjct: 192 NNSLYLYLLMFYKEQGMLNGKGVGITSLSANDLLSAIFPLPPLNEQNSIVTSIENIFEQI 251 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L ++ ++K+ +S + A+ G+ Sbjct: 252 EHLDQEKSDLQTIIKQTKSKILDLAIHGK 280 >gi|25026605|ref|NP_736659.1| hypothetical protein CE0049 [Corynebacterium efficiens YS-314] gi|23491884|dbj|BAC16859.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] Length = 417 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 70/406 (17%), Positives = 143/406 (35%), Gaps = 25/406 (6%) Query: 27 VVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVES--GTGKYLPKDGNSRQSDTSTVS 79 +VP++R ++ G T + D+ + D+ G + + S + Sbjct: 7 IVPLRRIARVKNGGTPGPDESNWEGDVPWATPVDLGRVHGGCLQTTERSITAMGLQSGST 66 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L P A IA D + L P + ++A Sbjct: 67 LAPAGSVLISSRAPI-GYAAIAGMDTAFNQGCKALIPLPGVSRPRFLKYAVESQMSTLQA 125 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+T + + ++P+P+ L +Q I + + ET ID + E + ++L+ E+ Sbjct: 126 AGRGSTFTEVSASDVASLPIPVTSLDKQDWIADYLDRETAEIDAMAVELDQAMDLIDERF 185 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 A V P + ++ + E +L+ Sbjct: 186 HAEVEQSFQSLDAPRMPLRS------------QIQSMTTGTSVTAAKFAPAAGEPGVLAT 233 Query: 260 SYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 S + ET + P Y + ++ ++ N + + Sbjct: 234 SAVFGDELNETAVKSVDPHEYVRLTCPLRINTLLVSRMNTMNLVGKAVTVGRHLPDVYLP 293 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 + Y+ W RS + + G ++L + + + + VPP+ +Q Sbjct: 294 DRL-WAVEVDVPRYIYWWTRSQSYREQIRGLAVGASDSMKTLSQQAFRSITLPVPPVTQQ 352 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ++ R L +++++ LL+ERR+ I+AAVTGQID+ Sbjct: 353 IAVAAQLDEAAERFSALKAELQEAKGLLEERRAVLISAAVTGQIDV 398 >gi|257893690|ref|ZP_05673343.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,408] gi|257830069|gb|EEV56676.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,408] Length = 409 Score = 125 bits (313), Expect = 2e-26, Method: Composition-based stats. Identities = 76/405 (18%), Positives = 151/405 (37%), Gaps = 30/405 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLED-------VESGTGKYLPKDGNSRQSDTS 76 W+ + + G T + + G D + K K + S Sbjct: 17 DWEQRKLGEVADIIGGGTPNTNNPEYWNGDIDWYAPAEIGKQIYVKNSQKKISQLGLQKS 76 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + I G +L+ AI+A G + F + P + + + + ++ + Sbjct: 77 SAKILPIGTVLFTSRAGIGNTAILAKE-GTTNQGFQSIVPHENKLDSYFIFSRTHELKRY 135 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 E G+T + K + +P+ IP + EQ +KI ++D IT R ++LLK Sbjct: 136 GEVTGAGSTFAEVSGKQMAKMPILIPYIDEQ----QKIGIFFKKLDDTITLHQRKLDLLK 191 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT---KLIE 253 E K+ + + P K I + G + WE + + +KN Sbjct: 192 ETKKGFLQKMF-----PKNGAKVPEIRFPGFT-EDWEARKLIDYLDVSTQKNKDEIYDKG 245 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + I+ ++E + S Y +V+ G+IV+ L+ + + + Sbjct: 246 DVLSVSGDCGIVNQIEFQGRSFAGVSVANYGVVETGDIVYTKSPLKANPYGIIKTNKGKT 305 Query: 314 GIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLP--VL 368 GI+++ Y KP I D ++ + + G + +K D L V+ Sbjct: 306 GIVSTLYAVYKPKQITDPEFVQIYFEQDVRMNNYMRPLVNKGAKNDMKVSDENALKGEVM 365 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P ++EQ I++ +++ L+ ++ + LLKE + F+ Sbjct: 366 FPKLEEQRRISSY----FEQLNNLITLHQRELDLLKETKKGFLQK 406 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 62/189 (32%), Gaps = 9/189 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQI 285 D WE + + + + I K K S Q Sbjct: 16 DDWEQRKLGEVADIIGGGTPNTNNPEYWNGDIDWYAPAEIGKQIYVKNSQKKISQLGLQK 75 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + + + +A + + G + ++ PH R+++L + Sbjct: 76 SSAKILPIGTVLFTSRAGIGNTAILAKEGTTNQGFQSIVPHENKLDSYFIFSRTHELKRY 135 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G+G + + + ++P+L+P I EQ I ++D + ++ + LLK Sbjct: 136 GEVTGAGSTFAEVSGKQMAKMPILIPYIDEQQKIGIF----FKKLDDTITLHQRKLDLLK 191 Query: 405 ERRSSFIAA 413 E + F+ Sbjct: 192 ETKKGFLQK 200 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 63/196 (32%), Gaps = 17/196 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + W+ + + ++T + + D++ + + ++ + + Sbjct: 219 EDWEARKLIDYLDVSTQKNKDEIYDKGDVLSVSGDCGIVNQIEFQGRSFAGVS--VANYG 276 Query: 80 IFAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + G I+Y K PY GI ST + V +PK + DV Sbjct: 277 VVETGDIVYTKSPLKANPYGIIKTNKGKTGIVSTLYAVYKPKQITDPEFVQIYFEQDVRM 336 Query: 136 RIEA----ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + P L EQ I +++ LIT R Sbjct: 337 NNYMRPLVNKGAKNDMKVSDENALKGEVMFPKLEEQRRISSY----FEQLNNLITLHQRE 392 Query: 192 IELLKEKKQALVSYIV 207 ++LLKE K+ + + Sbjct: 393 LDLLKETKKGFLQKMF 408 >gi|315636820|ref|ZP_07892045.1| restriction modification system DNA specificity subunit [Arcobacter butzleri JV22] gi|315478874|gb|EFU69582.1| restriction modification system DNA specificity subunit [Arcobacter butzleri JV22] Length = 432 Score = 125 bits (313), Expect = 2e-26, Method: Composition-based stats. Identities = 67/440 (15%), Positives = 148/440 (33%), Gaps = 38/440 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK---------LNTGRTSESGKDIIYIGLEDVESGT 60 YK + IG IP+ W ++ + + L + + + I + + G Sbjct: 6 YKQTD---IGLIPEDWSIIDFEDISTMNGRIGWQGLKQEEFTFTYDEPFLITGMNFKDGK 62 Query: 61 GKYLPKDGNSRQSDTSTVSI-FAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQ 115 ++ S + I IL K G + + + ++ LV + Sbjct: 63 IRWDEVYHVSEERYKQAKQIQLKTNDILMTKDGTIGKLLYVDNIPFPKKASLNSHLLVFR 122 Query: 116 PKD--VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 PK+ P+ + L Q IE G+T + +G +PP+ EQ I Sbjct: 123 PKNNTYNPKFMFYQLHGKHFLQHIELTKSGSTFFGISQESMGKYKAILPPIEEQKAIANA 182 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 + I++L + + + Q L++ G + D + K G V Sbjct: 183 LSDTDELINSLEKFISKKEAIKQGTMQQLLTGKKRLNGFSGDWEEKRLGDVIV------K 236 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 F + +I + L + L+ ++ G+++ Sbjct: 237 FQNGFAFNAKGYIKNGMPIITMAQIGLDGTFKFDTNKVNYWNLEESKNLKDFYLNNGDVI 296 Query: 293 FRFIDLQNDKRSLR---SAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 D+ +K + ++ ++ + I+ +L + Sbjct: 297 IAMTDVTPEKNLIGRMTIVNTSSTCLLNQRVGHLILDEKQINPLFLTTISNMKKWRAYSI 356 Query: 348 AMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + S G++ ++ +D+ + +P IKEQ I +++ I+ L + + K Sbjct: 357 GIASLGVQANIGTKDILNGLIKLPSIKEQNAIAEILSDMDNEIETL----KSKLSKTKAI 412 Query: 407 RSSFIAAAVTGQID--LRGE 424 + ++ +TG+I ++ E Sbjct: 413 KDGIMSELLTGKIRLKVKDE 432 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 78/216 (36%), Gaps = 19/216 (8%) Query: 225 VGLVPDHWEVKPFFAL-------VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 +GL+P+ W + F + + ++ + L G + + R + Sbjct: 11 IGLIPEDWSIIDFEDISTMNGRIGWQGLKQEEFTFTYDEPFLITGMNFKDGKIRWDEVYH 70 Query: 278 ESYETY-----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDS 330 S E Y + +I+ + + ++ + S + +P + + Sbjct: 71 VSEERYKQAKQIQLKTNDILMTKDGTIGKLLYVDNIPFPKKASLNSHLLVFRPKNNTYNP 130 Query: 331 TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ + + + SG + E + + ++PPI+EQ I N ++ Sbjct: 131 KFMFYQLHGKHFLQHIELTKSGSTFFGISQESMGKYKAILPPIEEQKAIANALSD----T 186 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 D L+ +E+ I + + + +TG+ L G S Sbjct: 187 DELINSLEKFISKKEAIKQGTMQQLLTGKKRLNGFS 222 >gi|257060919|ref|YP_003138807.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] gi|256591085|gb|ACV01972.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] Length = 424 Score = 125 bits (313), Expect = 2e-26, Method: Composition-based stats. Identities = 58/426 (13%), Positives = 132/426 (30%), Gaps = 36/426 (8%) Query: 23 KHWKVVPIKRFTKL-NTGRTSES------GKDIIYIGLEDVE--SGTGKYLPKDGNSRQS 73 + WK + + +G T + DI ++ +ED+ K Sbjct: 4 EGWKDSSLISLLTILKSGGTPNTSRSDFYNGDIPFVAIEDMSASRKYLYSTVKSLTKEGL 63 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + + +LY + L I + L + D + + + + Sbjct: 64 KNSNAWLVPENSLLYS-IYATLGLVRINKIPVATNQAILAMIVNDEVVDQDYLYYWLEYI 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 I + T S+ + + P EQ I + ID I + I Sbjct: 123 RDSIVNLSAQTTQSNLSATTVKPFLVQHPKDKEEQTQIATIL----STIDRAIEQTETLI 178 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALVTELNRK 247 + K L+ ++TKG++ + ++ +G +P WEVKP + Sbjct: 179 AKQQRIKTGLMQDLLTKGIDENGNIRSEETHQFKDSVLGRIPVEWEVKPLGEKARVRSGS 238 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---------TYQIVDPGEIVFRFIDL 298 + ++ E + + + + G ++ Sbjct: 239 TPLRSNEKFWIGGTVSWVKTSEVCFSKITETEEKITEQALKLTSLNLEPIGSVLVAMYGQ 298 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 + + A + + I+ YL + + S +G G + +L Sbjct: 299 GGTRGRCAILGIEATTNQACAAILGQQGEINQDYLFYYLSSKY--NDLRTIGHGSNQTNL 356 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++ + VP KEQ I + ++ + +++ + L ++ + +TG Sbjct: 357 NGNLLRLFLIKVPSYKEQVKIAD----SFNKLKQMQDQLFSELSKLNSIKTGLMQDLLTG 412 Query: 418 QIDLRG 423 ++ + Sbjct: 413 KVRVTE 418 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 70/209 (33%), Gaps = 12/209 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG-------LEDVESGTG 61 Q+KDS +G IP W+V P+ ++ +G T + +IG +V Sbjct: 210 QFKDS---VLGRIPVEWEVKPLGEKARVRSGSTPLRSNEKFWIGGTVSWVKTSEVCFSKI 266 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYL--RKAIIADFDGICSTQFLVLQPKDV 119 + + +++++ G +L G + I + + + + Sbjct: 267 TETEEKITEQALKLTSLNLEPIGSVLVAMYGQGGTRGRCAILGIEATTNQACAAILGQQG 326 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + I G+ ++ + + + +P EQV I + Sbjct: 327 EINQDYLFYYLSSKYNDLRTIGHGSNQTNLNGNLLRLFLIKVPSYKEQVKIADSFNKLKQ 386 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVT 208 D L +E + + Q L++ V Sbjct: 387 MQDQLFSELSKLNSIKTGLMQDLLTGKVR 415 >gi|139474405|ref|YP_001129121.1| type I restriction-modification system S protein [Streptococcus pyogenes str. Manfredo] gi|134272652|emb|CAM30919.1| type I restriction-modification system S protein [Streptococcus pyogenes str. Manfredo] Length = 391 Score = 125 bits (313), Expect = 2e-26, Method: Composition-based stats. Identities = 64/401 (15%), Positives = 134/401 (33%), Gaps = 36/401 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + +++ +G T +I +I ++ S + S+ Sbjct: 17 EWEEKKLGEISRMFSGGTPNVGIPEYYNGNIPFIRSAEINSDQ---TELSITDKGLSNSS 73 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + K +LY G + ++ G + L + P+ L L + I Sbjct: 74 AKLVEKNTLLYALYGATSGEVGLSRISGAINQAILAIIPEKKYSSLFIKNWLYKQKSSII 133 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E +G + + + + P L+EQ I E +D LI + + + LKE Sbjct: 134 EKYLQG-GQGNLSGSIVKELTIQFPSLSEQEAIGE----LFQTVDQLIQLQRQKLATLKE 188 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +KQ + + P K I G + WE K +V ++ Sbjct: 189 QKQTFLRKMF-----PAQGQKVPEIRLQGFDGE-WEEKELGDIVQITMGQSPSSQNYTTN 242 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 Y + + +N + P + T D G+I+ D ++ RG+ Sbjct: 243 PSDYILVQGNADIKNGYVFPRVWTTQITKQADKGDIILSVRAPVGDVGKTNYHVIIGRGV 302 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + ++ +++ + + +G S+ D+K + +P + E Sbjct: 303 AA---------IKGNEFIFQILKYLKEIGYWKRISTGSTFDSISSSDIKYAKIQIPSLSE 353 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q I N +D + + E+ + LK + + + Sbjct: 354 QEAIGNF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 390 >gi|241668318|ref|ZP_04755896.1| type I restriction-modification system, subunit S [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 385 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 63/406 (15%), Positives = 140/406 (34%), Gaps = 30/406 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + +P W+ +++ + S + + L+ +E+ G+Y P G + Sbjct: 4 LYKLPAGWEWEKLEKVCD----KASSN------LSLKKIENEDGEY-PIYGAKGFIKNIS 52 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + I K G + + + D L PK+ + +L + + Sbjct: 53 FFHREEPYISIIKDGAGVGRVTMLDSKSSVIGTLQYLLPKNCID---IKYLYFLLLVIDF 109 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G T+ H ++ +P+PPLAEQ I K+ + +ID I + I Sbjct: 110 GKYVSGTTIPHIYYRDYKEHLVPLPPLAEQKRIVAKLDSLFEKIDKAIELHQQNITNANT 169 Query: 198 KKQALVSYIVTK--GLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + K G +KD I+ G P + + + N + Sbjct: 170 LMASTLDKTFKKLEGEYSYKNLKDITIKIGSGATPKGGQKAYKQKGTSLIRSMNVHDMGF 229 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + L++ + Q + +N+ IV+ +++ + + + Sbjct: 230 SKKGLAFIDDSQADKLKNV-----------IVEKDDVLLNITGASVARCCVVCESALPAR 278 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + S +L + + S F + G R+++ ++ L V + Sbjct: 279 VNQHVSIIRLNDSFISKFLHYYLISPMKKTELLFSSSGGATREAITKSMIENLQVPDISL 338 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q ++ ++D + + EQ + LK ++S + A G+ Sbjct: 339 PIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRGE 384 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 58/183 (31%), Gaps = 15/183 (8%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 + +P WE + V + N L + Y K +N+ Sbjct: 3 ELYKLPAGWEWEKL-EKVCDKASSNLSLKKIENEDGEYPIYGAKGFIKNISFFHREEPYI 61 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 I+ G + + +I + + + ID YL +L+ D Sbjct: 62 SIIKDG-----------AGVGRVTMLDSKSSVIGTLQYLLPKNCIDIKYLYFLLLVIDFG 110 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 K + + D K V +PP+ EQ I ++ +ID +E +Q+I Sbjct: 111 KYV---SGTTIPHIYYRDYKEHLVPLPPLAEQKRIVAKLDSLFEKIDKAIELHQQNITNA 167 Query: 404 KER 406 Sbjct: 168 NTL 170 >gi|32266920|ref|NP_860952.1| hypothetical protein HH1421 [Helicobacter hepaticus ATCC 51449] gi|32262972|gb|AAP78018.1| hypothetical protein HH_1421 [Helicobacter hepaticus ATCC 51449] Length = 422 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 63/433 (14%), Positives = 132/433 (30%), Gaps = 47/433 (10%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 I IPK+W++ + I L +E+ TGKY P G T Sbjct: 2 INNIPKNWEIKTLAEVCTSKNSN----------IVLSSIENNTGKY-PIYGAKGFLKTID 50 Query: 78 VSIFAKGQILYGKLGP-YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + K G R ++ + + T + +++ + L +L +I+ Sbjct: 51 FYTIENESLGIVKDGAGVGRIFLLPEKSSLIGTMAYIQANENLNLKYLYHFLHTINF--- 107 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G+ + H ++ +P+PPL Q I EK+ ID + + Sbjct: 108 -NQYISGSAIPHIYFRDYKKEKIPLPPLEVQKAIVEKLENAFAHIDEAVRHLKSVQTNIP 166 Query: 197 EKKQALVSYIVTKGLNPDV---------------------------KMKDSGIEWVGLVP 229 K +L+ + L K +P Sbjct: 167 RLKSSLLHCAFSGKLTESQNSSHKVQTLKSVVGVEGEFEREEGATSPFKPLHPLIKEEIP 226 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDP 288 WE+K + + S + I +E N + P+ + + Sbjct: 227 QGWEIKTLGEVFKVIGGGTPSTANPKFWSGNIAWITSANIENENFTIIPKKFINQSAIQA 286 Query: 289 ---GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + I + + + A+ P + Sbjct: 287 SATNLVPKNTIIVVTRVGLGKVGITDVETCFSQDSQALLPLIDLNVKFMAFQIRNKAQNF 346 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + + + +K++ + +PP+ Q I ++ + A ++ L + + S+ L++ Sbjct: 347 IVSSRGTTINGITKDTLKKVALKIPPLATQNQIVQILESKFAHLEKLEQFVNASLENLQK 406 Query: 406 RRSSFIAAAVTGQ 418 +SS + A G+ Sbjct: 407 LKSSLLNQAFKGE 419 >gi|190150797|ref|YP_001969322.1| type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|189915928|gb|ACE62180.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 7 str. AP76] Length = 508 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 60/443 (13%), Positives = 133/443 (30%), Gaps = 71/443 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPK---DGN 69 IPK W V + ++ G T ++ +D I +I D++ +GKY+ K + Sbjct: 70 EIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNIT 129 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +S+ + +K I+Y P I + + + F + + + + Sbjct: 130 ENGLRSSSTRLLSKNSIVYSSRAPI-GYIAITETELCTNQGFKSIDLYNKE-IVDYLYYS 187 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I T I++ G T GN +P+PPL EQ I KI I+ + Sbjct: 188 LIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQYAEKEE 247 Query: 190 RFIELLKEKK----QALVSYIVTKGLNPDVKM---------------------------- 217 + L ++ ++++ + L Sbjct: 248 KLTALHQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPK 307 Query: 218 --------------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE---S 254 + E +P+ W + + Sbjct: 308 VVSEIILRDNLPYEIVNGKERCIADEVPFEIPESWVWVRLGEIGETNIGLTYNPSDVASD 367 Query: 255 NILSLSYGNIIQKLET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + L GNI ++ + +++ + K+ + A +++ Sbjct: 368 GTIVLRSGNIQDGKIDVSSDIVKVNLDIPENKRCYKNDLLICARN--GSKKLVGKAAIID 425 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + + Y+ + + S F + + + ++ + +P + Sbjct: 426 KDGYSFGAFMTIFRSPFNKYIYYYLSSPLFRNDFDGINTTTINQITQSNLNNRLIPLPSL 485 Query: 373 KEQFDITNVINVETARIDVLVEK 395 EQ I I + + L +K Sbjct: 486 NEQLRIVEKIETLFSTLQNLSQK 508 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 62/211 (29%), Gaps = 13/211 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S ++ +P W L + K E + + I + + + K S Sbjct: 63 SQQDFSFEIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYIS 122 Query: 280 YETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 I + G + I + A + ++ + + Sbjct: 123 KGNRNITENGLRSSSTRLLSKNSIVYSSRAPIGYIAITETELCTNQGFKSIDLYNKEIVD 182 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + Y ++ + + + +PP+ EQ I I I+ Sbjct: 183 YLYYSLIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQY 242 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 + E+ + L ++ + S + AA+ G+ Sbjct: 243 -AEKEEKLTALHQQFPEQLKKSILQAAIQGK 272 >gi|300070274|gb|ADJ59674.1| specificity determinant HsdS [Lactococcus lactis subsp. cremoris NZ9000] Length = 415 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 64/405 (15%), Positives = 147/405 (36%), Gaps = 23/405 (5%) Query: 24 HWKVVPIKR--------FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + ++ S + + +I +D++ + S++ D Sbjct: 16 DWEQRKLGELSQKISVGIATSSSKYFSSQDQGVPFIKNQDIKENRINTKNLEYISKEFDN 75 Query: 76 STV-SIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSI 131 +G I+ + G A++ +T + +L E + ++ S Sbjct: 76 KNKNKRVKQGDIITARTGYPGLSAVVPKELEGAQTFTTLITRPISEMILSEYISIFINSP 135 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I + G + + + N+ +P+P L EQ I I+ ++D I R Sbjct: 136 YGMKQISGMEAGGAQKNVNAGIVQNLLIPLPSLDEQKKISNFIL----KLDDTIALNQRK 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I+LLKE+K+ + + K +++ +G ++ F + K Sbjct: 192 IDLLKEQKKGYLQKMFPKNGAKVPELRFAGFVDDWEQRKLSDLMTFSNGINAPKENYGKG 251 Query: 252 IESN-ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + ++ + I+ N + E V+ G+++F ++ A Sbjct: 252 TKMISVMDILNPLPIKYDNILNSVSVDKKIEDKNKVENGDLIFVRSSEIVEEVGWAKAYK 311 Query: 311 MERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 R + S + ++ ++ + + ++ G R ++ E + L VL Sbjct: 312 EARYALYSGFAIRGKRISSYNAYFIELTLNYANRKEIKRRAGGSTRFNVSQEILNSLTVL 371 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I ++ +ID + ++ + LLKE++ F+ Sbjct: 372 TPSISEQNQI----DLFFTKIDDTITLHQRKLDLLKEQKKGFLQK 412 >gi|125623519|ref|YP_001032002.1| specificity determinant HsdS [Lactococcus lactis subsp. cremoris MG1363] gi|124492327|emb|CAL97261.1| probable specificity determinant HsdS [Lactococcus lactis subsp. cremoris MG1363] Length = 416 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 64/405 (15%), Positives = 147/405 (36%), Gaps = 23/405 (5%) Query: 24 HWKVVPIKR--------FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + ++ S + + +I +D++ + S++ D Sbjct: 17 DWEQRKLGELSQKISVGIATSSSKYFSSQDQGVPFIKNQDIKENRINTKNLEYISKEFDN 76 Query: 76 STV-SIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSI 131 +G I+ + G A++ +T + +L E + ++ S Sbjct: 77 KNKNKRVKQGDIITARTGYPGLSAVVPKELEGAQTFTTLITRPISEMILSEYISIFINSP 136 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I + G + + + N+ +P+P L EQ I I+ ++D I R Sbjct: 137 YGMKQISGMEAGGAQKNVNAGIVQNLLIPLPSLDEQKKISNFIL----KLDDTIALNQRK 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I+LLKE+K+ + + K +++ +G ++ F + K Sbjct: 193 IDLLKEQKKGYLQKMFPKNGAKVPELRFAGFVDDWEQRKLSDLMTFSNGINAPKENYGKG 252 Query: 252 IESN-ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + ++ + I+ N + E V+ G+++F ++ A Sbjct: 253 TKMISVMDILNPLPIKYDNILNSVSVDKKIEDKNKVENGDLIFVRSSEIVEEVGWAKAYK 312 Query: 311 MERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 R + S + ++ ++ + + ++ G R ++ E + L VL Sbjct: 313 EARYALYSGFAIRGKRISSYNAYFIELTLNYANRKEIKRRAGGSTRFNVSQEILNSLTVL 372 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I ++ +ID + ++ + LLKE++ F+ Sbjct: 373 TPSISEQNQI----DLFFTKIDDTITLHQRKLDLLKEQKKGFLQK 413 >gi|126465662|ref|YP_001040771.1| restriction modification system DNA specificity subunit [Staphylothermus marinus F1] gi|126014485|gb|ABN69863.1| restriction modification system DNA specificity domain [Staphylothermus marinus F1] Length = 463 Score = 124 bits (311), Expect = 3e-26, Method: Composition-based stats. Identities = 76/436 (17%), Positives = 144/436 (33%), Gaps = 30/436 (6%) Query: 10 YKDSGVQW--IGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVE-SGTGK 62 YK++ + IG IP+ W ++ + K+ TG+ ++ G +I IG E ++ G + Sbjct: 34 YKETDFKETPIGKIPRDWNIMRLDGLVKVETGKRAKGGGLYKGNIASIGGEHIDDEGNIR 93 Query: 63 YLPKDGNSRQSDTSTVS-IFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQP 116 + + S G IL K G K I + F++ Sbjct: 94 WNNMKFITEDFYNSLRQGKINIGDILLVKDGATTGKVAIVRELKYKKVAVNEHVFVIRSI 153 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 L + L Q + + +I +P+PP+ EQ I E + Sbjct: 154 TKKLINEFLFYFLYSKFGQMQIKTRFHGMIGGITRNDLKSILIPLPPVLEQRRIVEVLSI 213 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHW 232 I + L K Q L++ V + + +G +P W Sbjct: 214 VDEAIQKTDDVIAKVERLKKALMQELLTGKVRIKVEDGKARFYKETNFKDTKIGKIPKDW 273 Query: 233 EVKPFFALVTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 EV V L ++K + I + + SY+ IV+ G+ Sbjct: 274 EVIRLVDHVYVLKGYAFSSKFFNEKERGIPIIRIRDLGKNKTEAYYSGSYDPKYIVEKGD 333 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYA 348 ++ N +G++ + + L + Sbjct: 334 LLISMDGEFN-----IFLWKGPKGLLNQRVCKIWTKDATKLDNMYLYYALKKPLKLIEAQ 388 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + L D++R+ + +PP+ EQ I +++ ID + + LK + Sbjct: 389 TSQTTVKHLLDRDLERIKIPLPPLSEQQKIAEILST----IDKWISLEHRRKEKLKGLKK 444 Query: 409 SFIAAAVTGQIDLRGE 424 + +TG+I +R E Sbjct: 445 GLMNLLLTGRIRVRVE 460 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 88/225 (39%), Gaps = 17/225 (7%) Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK---LIESNILSLSYGNIIQ 266 G + K++ I G +P W + LV K K L + NI S+ +I Sbjct: 32 GFYKETDFKETPI---GKIPRDWNIMRLDGLVKVETGKRAKGGGLYKGNIASIGGEHIDD 88 Query: 267 KLETRNMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 + R +K + + Y ++ G+I+ K ++ ++ + Sbjct: 89 EGNIRWNNMKFITEDFYNSLRQGKINIGDILLVKDGATTGKVAIVRELKYKKVAVNEHVF 148 Query: 321 -MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + + +L + + S G+ + D+K + + +PP+ EQ I Sbjct: 149 VIRSITKKLINEFLFYFLYSKFGQMQIKTRFHGMIGGITRNDLKSILIPLPPVLEQRRIV 208 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 V+++ D ++K + I ++ + + + +TG++ ++ E Sbjct: 209 EVLSIV----DEAIQKTDDVIAKVERLKKALMQELLTGKVRIKVE 249 >gi|308063426|gb|ADO05313.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Sat464] Length = 448 Score = 124 bits (311), Expect = 3e-26, Method: Composition-based stats. Identities = 55/425 (12%), Positives = 133/425 (31%), Gaps = 35/425 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASMDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELKAR 191 Query: 192 IELLKEKKQALV------SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + + L+ S ++ K L P E + + + Sbjct: 192 KKQYQYYQNMLLDFKDIHSNHKDAKISAKTYPKRLKTLLQTLAPKGVEFRKLGDIGEFYS 251 Query: 246 R-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID- 297 ++ + + ++ N Q ++ E + G+++F Sbjct: 252 GLVGKSKKSFSQGNKFYVPYVNVFNNPQLDLNALESVQIGDKEKQNTIQLGDVLFTGSSE 311 Query: 298 -----LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + + + + ++L +R Y+ K + +G Sbjct: 312 NLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDENLFNPSFLKHFLRDYNFRKNISKVANG 371 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 R ++ + + ++ + +PP++ Q +I +++ A L+ I I K+ R Sbjct: 372 VTRFNVSKQLLSKITIPIPPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYEYYR 431 Query: 408 SSFIA 412 + Sbjct: 432 EKLLT 436 >gi|120552974|ref|YP_957325.1| restriction modification system DNA specificity subunit [Marinobacter aquaeolei VT8] gi|120322823|gb|ABM17138.1| restriction modification system DNA specificity domain [Marinobacter aquaeolei VT8] Length = 435 Score = 124 bits (311), Expect = 3e-26, Method: Composition-based stats. Identities = 54/406 (13%), Positives = 145/406 (35%), Gaps = 30/406 (7%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P +W++ + + ++ G T+ + + + + + D+++ T + + + Sbjct: 6 LPANWQLANLGEISSDISYGYTASATSEPTGVKLLRITDIQNNTVSWPNVPNCKIEPEKV 65 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD----FDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +++ + G + K+ + S V + V E L + S Sbjct: 66 GKYRLKPSDLVFARTGATVGKSYLLKGEIPESVYASYLIRVRCLEGVSIEFLANYFQSPY 125 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++I G + + + N+ +P+PPLAEQ +I +K+ +++ R Sbjct: 126 YWRQITDFSAGIGQPNVNGTKLKNLSVPVPPLAEQKVIADKLDTLLAQVENTKARLERIP 185 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ++LK +Q++++ V+ L + +E + + + V+ RK + Sbjct: 186 QILKRFRQSVLAAAVSGRLIDAQPESIAKLEELVDIENGAR-----KPVSATIRKTIQGT 240 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + + L + ++ F + Sbjct: 241 IPYYGATGIVDYLNDYTHEGRYLLVGEDGANLLSKSKDLAF--------------IVEGK 286 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + A++ + G++ ++ + S DL + L + + LP+ + Sbjct: 287 MWVNNHAHVLKERPGVNLDFVKIAINSLDLTPWI---TGSAQPKLTKKSLCGLPITNFTL 343 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ +I ++ + D + ++ ++ + S +A A G+ Sbjct: 344 DEQTEIVRRVDQLFSHADRIEQQASSALARVNNLTQSILAKAFRGE 389 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 40/167 (23%), Positives = 74/167 (44%), Gaps = 3/167 (1%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + N ++PE Y + P ++VF K L ++ E + Sbjct: 46 QNNTVSWPNVPNCKIEPEKVGKY-RLKPSDLVFARTGATVGKSYLLKGEIPESVYASYLI 104 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 G+ +LA +S + +G+ + ++ +K L V VPP+ EQ I Sbjct: 105 RVRCLEGVSIEFLANYFQSPYYWRQITDFSAGIGQPNVNGTKLKNLSVPVPPLAEQKVIA 164 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ-IDLRGES 425 + ++ A+++ ++E+ +LK R S +AAAV+G+ ID + ES Sbjct: 165 DKLDTLLAQVENTKARLERIPQILKRFRQSVLAAAVSGRLIDAQPES 211 >gi|294495711|ref|YP_003542204.1| restriction modification system DNA specificity domain protein [Methanohalophilus mahii DSM 5219] gi|292666710|gb|ADE36559.1| restriction modification system DNA specificity domain protein [Methanohalophilus mahii DSM 5219] Length = 414 Score = 124 bits (311), Expect = 3e-26, Method: Composition-based stats. Identities = 69/416 (16%), Positives = 148/416 (35%), Gaps = 24/416 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IP W++V + + + YIG E + G + + D Sbjct: 4 KIPNGWEIVKFGDVVGKVSDKFQDRSAWHFERYIGGEHFDEGAIRVTKSNPIKGNEDVIG 63 Query: 78 VS---IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSID 132 + F G +LY P LRK + DF+GICS VLQ + L LL + + Sbjct: 64 SAFHMRFKPGHVLYVSRNPRLRKGGMVDFEGICSNTTYVLQADESKLLQSLLPFIIQTEA 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G+T +WK I N + +PP+ EQ + E + + + I + + I Sbjct: 124 FVKHTTNSAHGSTNPFLNWKDIANYNLLLPPIEEQKKMAEILWSM----EDNIEKNEKLI 179 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK-- 250 + K+ K+ +++ ++TKG+ K +G +P++W++ ++ + K Sbjct: 180 KKNKQYKKIMINQLLTKGIGH----KKFKETELGRIPENWKLSKLSDIMNIIGGGTPKTS 235 Query: 251 ---LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 NI +S + + K + E + + + + Sbjct: 236 VTSYWNGNIPWISVEDFDSNSRYISSTKKTITKEGLDNSSTKILPKKSLIISARGTVGLV 295 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 Q+ + + + G + +++ ++ + S+ + + Sbjct: 296 CQLNKEMAFNQSCYGLIGKGDVIDDFLYYSLLFNIEQLKHNAYGSTFNSITKNNFDIVDA 355 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +PPI EQ I + + ++ + + + ++G+I + Sbjct: 356 AIPPIDEQKLIVEKLG----LFEKVLSDYNKQLEKTNTLKKKLTNEFLSGKIRIPE 407 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 77/204 (37%), Gaps = 13/204 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGK 62 ++K++ +G IP++WK+ + + G T ++ +I +I +ED +S + Sbjct: 202 KFKETE---LGRIPENWKLSKLSDIMNIIGGGTPKTSVTSYWNGNIPWISVEDFDSNSRY 258 Query: 63 Y--LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 K D S+ I K ++ G + + + ++ DV+ Sbjct: 259 ISSTKKTITKEGLDNSSTKILPKKSLIISARGTVGLVCQLNKEMAFNQSCYGLIGKGDVI 318 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + ++++ G+T + + IPP+ EQ LI EK+ Sbjct: 319 D--DFLYYSLLFNIEQLKHNAYGSTFNSITKNNFDIVDAAIPPIDEQKLIVEKLGLFEKV 376 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 + + + L K+ +S Sbjct: 377 LSDYNKQLEKTNTLKKKLTNEFLS 400 >gi|23452799|gb|AAN33173.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 402 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 57/415 (13%), Positives = 123/415 (29%), Gaps = 34/415 (8%) Query: 21 IPKHWKVVPIKRFTK-----LNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDG 68 +P+ WK+ + + G + K I + + + Sbjct: 4 LPQGWKMETLGEILSSDKYSIKRGPFGSTLKKSFFVEKGIRIFEQYNPINNDPHWKRYFI 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQF--LVLQPKDVLPELL 124 + + +G +L G + + GI + + L +L Sbjct: 64 SHEKFQELEAFKATEGDLLISCSGTLGKIVELPKDTEMGIINQALLKIRLNNIKILNSYF 123 Query: 125 QGWLLSIDVTQRIEAICEGATMSHA-DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + S + ++I G+ + + K + I +P+PPL +Q I + +ID Sbjct: 124 IYYFNSPIMQEKILESTLGSAIKNIASVKILKQIEIPLPPLKKQERIVGILDESFAKIDE 183 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I + + L E Q+ + + + +P WE K + Sbjct: 184 SIKILEQNLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQGWEWKSLEEISEN 235 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ K + I + + I+ P I + Sbjct: 236 ISAGGDKPKNCTESKTAKNQIPVYANGVSNNGLVGYTDKATIIKPS----LTISARGTIG 291 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + I+ + + + YL + + L K Sbjct: 292 FVCIRKEPYFPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFK 346 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L + +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 347 SLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 401 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 219 KLPQGWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVSNNGLVGYTDK 274 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 275 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 333 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I + + + L + ++ +E Sbjct: 334 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEEL 389 Query: 199 KQALVSYIVTKGL 211 KQ+L++ L Sbjct: 390 KQSLLNKAFKGEL 402 >gi|21228301|ref|NP_634223.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20906763|gb|AAM31895.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 384 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 44/409 (10%), Positives = 110/409 (26%), Gaps = 37/409 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 WK ++ ++ +G T + G I ++ +++ + + S Sbjct: 4 EWKECKLREIASEIKSGGTPSTKHQEYYGGIIPWLNTKEIHFNRIRDTDIKITEGGLNNS 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + I+ G K I + + P + + Sbjct: 64 SAKWVKENSIIVAMYGATAGKIAINKIPLTTNQACCNITPDSEKADYNFVYYNLCHRYDE 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + GA + + I N+ + +PP+ EQ I + + +ID L + + Sbjct: 124 LVNLSCGAAQQNLNVGLITNLDIILPPITEQCAIASVLSSLDDKIDLLHRQNKTLEAM-- 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + + +E + + F + T + + Sbjct: 182 ----------------AETLFRQWFVEEADEDWEEGFLPDEFDFTMGQSPPGTSYNQEGV 225 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + T ++ I Sbjct: 226 GKPMFQGNADFGFRFPEERVYTTEPTRLAYPHDTLI------SVRAPVGAQNMAKVECCI 279 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKE 374 A + + Y + L + S+ D ++ + +PP Sbjct: 280 GRGVSAFRYKANNDFYTYTYFKLRSLMDEIKKFNDEGTVFGSISKTDFLQMGIAIPPED- 338 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 I + ++ V + I LL+ R + + ++G++ + G Sbjct: 339 ---IIEKFEIHAKPLNDKVIENCIQIKLLEVMRDTLLPKLMSGEVRVEG 384 Score = 36.7 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 41/187 (21%), Gaps = 2/187 (1%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + G++ + + G + + R T + Sbjct: 196 EDWEEGFLPDEFDFTMGQSPPGTSYNQEGVGKPMFQGNADFGFRFPEERVYTTEPTRLAY 255 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC- 141 L P +A + + K + + I+ Sbjct: 256 PHDTLISVRAPV-GAQNMAKVECCIGRGVSAFRYKANNDFYTYTYFKLRSLMDEIKKFND 314 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 EG + + IPP ++ + + Sbjct: 315 EGTVFGSISKTDFLQMGIAIPPEDIIEKFEIHAKPLNDKVIENCIQIKLLEVMRDTLLPK 374 Query: 202 LVSYIVT 208 L+S V Sbjct: 375 LMSGEVR 381 >gi|296273010|ref|YP_003655641.1| restriction modification system DNA specificity domain-containing protein [Arcobacter nitrofigilis DSM 7299] gi|296097184|gb|ADG93134.1| restriction modification system DNA specificity domain protein [Arcobacter nitrofigilis DSM 7299] Length = 383 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 55/394 (13%), Positives = 123/394 (31%), Gaps = 24/394 (6%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV-SIFAKGQILY 88 I + G++ E G ++ K + T+ V K IL Sbjct: 6 IWEVCDVIAGQSPEGKFYNKEEEGIPFYQGKKEFTDKYIGKPTTWTTKVTKEAFKDDILM 65 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 P + ++ K+ + + + + + +GA + Sbjct: 66 SVRAPV-GPVNFSTEHICIGRGLAAIRVKEEINKEYLFYY--LIYHENSIVGNKGAVFNS 122 Query: 149 ADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + K I N+ +P+P L EQ I E + I+ + I+ KE Q+ ++ I Sbjct: 123 INKKQIENLKVPLPNKLEEQKQIVEILDKAFESIEQAKANIEKNIQNSKELFQSRLNEIF 182 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 ++ + + +V A T L + +I L G + Q Sbjct: 183 SQKGDGW------------EENELGKVCKTGAGGTPLKSRKEYYENGDIPWLCSGEVKQG 230 Query: 268 LETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + ++ +V + + A + Sbjct: 231 NIYSSNKYITKKGLDNSSAKLFPKNTVVIAMYGATAGDVGILRFETS----TNQAVCGIL 286 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 P+ + + SY ++ + ++ +K + + KEQ I ++ Sbjct: 287 PNELFIPEFIYYSFSYRKNELIAQATGNAQPNISQIKIKNTLIPIITKKEQIKIVQELDS 346 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L + +Q + L+E + S + A +G+ Sbjct: 347 LKEQTKQLEKHYQQKLDNLEELKKSILQKAFSGE 380 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 68/197 (34%), Gaps = 8/197 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + + K G T DI ++ +V+ G K + D S Sbjct: 188 GWEENELGKVCKTGAGGTPLKSRKEYYENGDIPWLCSGEVKQGNIYSSNKYITKKGLDNS 247 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + +F K ++ G I F+ + + P + L + Sbjct: 248 SAKLFPKNTVVIAMYGATAGDVGILRFETSTNQAVCGILPNE-LFIPEFIYYSFSYRKNE 306 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + A G + I N +PI EQ+ I +++ + + L + ++ L+ Sbjct: 307 LIAQATGNAQPNISQIKIKNTLIPIITKKEQIKIVQELDSLKEQTKQLEKHYQQKLDNLE 366 Query: 197 EKKQALVSYIVTKGLNP 213 E K++++ + L P Sbjct: 367 ELKKSILQKAFSGELIP 383 >gi|225352838|ref|ZP_03743861.1| hypothetical protein BIFPSEUDO_04471 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156327|gb|EEG69896.1| hypothetical protein BIFPSEUDO_04471 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 448 Score = 124 bits (310), Expect = 4e-26, Method: Composition-based stats. Identities = 52/394 (13%), Positives = 120/394 (30%), Gaps = 27/394 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + + R S ++I+ + + + + + + + I G Sbjct: 22 WEQRKLGELFEESDERA--SDREILSVSVANGIYPASE--SDRETNPGASLANYKIVHFG 77 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG---WLLSIDVTQRIEAIC 141 ++Y + + + +DGI S ++V +P + + + + Sbjct: 78 DVVYNSMRMWQGAVDASRYDGIVSPAYVVARPNSEVYARFFARLLRQPMLLKQYQQVSQG 137 Query: 142 EGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + +I + +P EQ I IT R + L K+ Sbjct: 138 NSKDTQVLKFDDFASIGISMPASENEQRQIGGFFDRLDSL----ITLHQRKYDKLCVLKK 193 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +++ + KG + +++ +G EV F +N L L Sbjct: 194 SMLDKMFPKGGSLYPEIRFAGFTDPWEQRKLGEVAHFIN--GRAYSQNELLSSGKYPVLR 251 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 GN L+ E G++++ + + I Sbjct: 252 VGNFYTNDSWYYSNLELEDKN---YAYEGDLLYTWSATFGPHI-----WHGNKVIYHYHI 303 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDIT 379 V+ A+ + D ++ + ++ VL+P ++EQ I Sbjct: 304 WKVQLEAALEKLFAFQLLERDKERILSDKNGSTMVHITKTGIENTSVLMPCSVEEQRRIG 363 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + R+D L+ ++ L + S + Sbjct: 364 AFFD----RLDSLITLHQRKYDKLCVLKKSMLDK 393 >gi|23452782|gb|AAN33162.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 402 Score = 124 bits (310), Expect = 4e-26, Method: Composition-based stats. Identities = 58/415 (13%), Positives = 124/415 (29%), Gaps = 34/415 (8%) Query: 21 IPKHWKVVPIKRFTK-----LNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +P+ WK+ + + G ++ K I + + + Sbjct: 4 LPQGWKMETLGEILSSDKYSIKRGPFGSALKKSFFVEKGIRIFEQYNPINNDPHWKRYFI 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQF--LVLQPKDVLPELL 124 + + +G +L G + + GI + + L +L Sbjct: 64 SHEKFQELEAFKATEGDLLISCSGTLGKIVELPKDTEMGIINQALLKIRLNNIKILNSYF 123 Query: 125 QGWLLSIDVTQRIEAICEGATMSHA-DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + S + ++I G+ + + K + I +P+PPL +Q I + +ID Sbjct: 124 IYYFNSPIMQEKILESTLGSAIKNIASVKILKQIEIPLPPLKKQERIVGILDESFAKIDE 183 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I + + L E Q+ + + + +P WE K + Sbjct: 184 SIKILEQNLLNLDELMQSALQKAFNPLKD--------NAKENYKLPQSWEWKSLEEISEN 235 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ K + I N + I+ P I + Sbjct: 236 ISAGGDKPKNCTESKTAKNQIPVYANGVNNNGLVGYTDKATIIKPS----LTISARGTIG 291 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + I+ + + + YL + + L K Sbjct: 292 FVCIRKEPYFPIVRLISLIPCENILCLHYLYFCLN-----FFIAKGEGSSIPQLTIPKFK 346 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L + +PP+KEQ I ++ + L E + + +E + S + A G+ Sbjct: 347 SLQIPLPPLKEQEQIAKHLDFIFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 401 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ ++ ++ G + ++ Y N+ + Sbjct: 219 KLPQSWEWKSLEEISENISAGGDKPKN----CTESKTAKNQIPVYANGVNNNGLVGYTDK 274 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K + G I + + + L P + + L + + E Sbjct: 275 ATIIKPSLTISARGTIGFVCIRKEPYFPI-VRLISLIPCENILCLHYLYFCLNFFIAKGE 333 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+++ ++ +P+PPL EQ I + + + L + ++ +E Sbjct: 334 ----GSSIPQLTIPKFKSLQIPLPPLKEQEQIAKHLDFIFEKTKALKELYTKELKDYEEL 389 Query: 199 KQALVSYIVTKGL 211 KQ+L++ L Sbjct: 390 KQSLLNKAFKGEL 402 >gi|78773881|gb|ABB51229.1| type I RM system S subunit [Arthrospira platensis] Length = 395 Score = 124 bits (310), Expect = 4e-26, Method: Composition-based stats. Identities = 70/411 (17%), Positives = 140/411 (34%), Gaps = 50/411 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-------SGTGKYLPKDGNSRQSD- 74 K WK+V + ++L T T+ + + V + GK+LP + D Sbjct: 2 KDWKIVSLNEISELITKGTTPTSVGFKFFDTGKVNFVKVETITDNGKFLPSKLAHIEMDC 61 Query: 75 --TSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVL---PELLQGW 127 + S G IL+ G R AI+ + +++ K PE + Sbjct: 62 HHSLKRSQLKSGDILFSIAGALGRTAIVTSDILPANTNQALAIIRLKSSNAIHPEFVFRS 121 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S + ++I+ G + I N +P+PPL EQ I + ID I Sbjct: 122 LSSGMLIKQIKKSKGGVAQQNLSLTQIKNFKIPLPPLEEQKRIVAILDEAFEGIDAAIAN 181 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + +E + + + + + + PFF E+ R Sbjct: 182 TQKNLANARELFDSYLQSLDAEKRYLGEIVDIKTGKLNANAATEDGQYPFFTCSKEIYRI 241 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + L+ N + ++ K +Y+ ++ Sbjct: 242 SEYAFDCEAILLAGNNAVGDFNVKHYKGKFNAYQRTYVI--------------------- 280 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + YL + + ++G+ + LK + +K L + Sbjct: 281 -------------AVSEASQVLYRYLYFQLLKSLKMLKIQSVGANTKF-LKLDMIKNLQI 326 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +P I++Q + V+N + L ++ + LKE + S + A TG+ Sbjct: 327 ALPDIEKQQKLVLVLNELESETQRLESIYQRKLEALKELKQSILQKAFTGE 377 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 69/200 (34%), Gaps = 2/200 (1%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 MKD I + + + V K+ + +++ + ++ + Sbjct: 1 MKDWKIVSLNEISELITKGTTPTSVGFKFFDTGKVNFVKVETITDNGKFLPSKLAHIEMD 60 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-VKPHGIDSTYLAW 335 + G+I+F + S + A + + I ++ Sbjct: 61 CHHSLKRSQLKSGDILFSIAGALGRTAIVTSDILPANTNQALAIIRLKSSNAIHPEFVFR 120 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S L K G +Q+L +K + +PP++EQ I +++ ID + Sbjct: 121 SLSSGMLIKQIKKSKGGVAQQNLSLTQIKNFKIPLPPLEEQKRIVAILDEAFEGIDAAIA 180 Query: 395 KIEQSIVLLKERRSSFIAAA 414 ++++ +E S++ + Sbjct: 181 NTQKNLANARELFDSYLQSL 200 >gi|237747136|ref|ZP_04577616.1| type I restriction-modification system specificity subunit [Oxalobacter formigenes HOxBLS] gi|229378487|gb|EEO28578.1| type I restriction-modification system specificity subunit [Oxalobacter formigenes HOxBLS] Length = 380 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 52/395 (13%), Positives = 121/395 (30%), Gaps = 37/395 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIG-LEDVESGTGK--YLPKDGNSRQSDTSTVSI 80 W+ + +G T Y G + ++SG Y + S+ + Sbjct: 19 GWEEKKLGDVCVTFSGGTPSVTNSTYYNGCIPFIKSGEINKSYTEAFLTEKGLKNSSAKL 78 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG +LY G ++ I+ +G + L ++ + + L+ L + Sbjct: 79 VKKGDLLYALYGATSGESGISKINGAINQAILCIKSDILDLKYLKNLLCFNKNRITGMYL 138 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + + I ++ P EQ I + A +I + + + K Q Sbjct: 139 QGG--QGNLSAEIIKSLKFYFPSSPEQTKIANFLSAIDEKISHINKKLDLLKQYKKGMMQ 196 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + + + G WE K F + + + K ++ + I + Sbjct: 197 KIFNQDIRFK------------DENGEEFPEWEEKEFNNVFSTIPSKKYQIFSTEINEVG 244 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 ++ + + G + I + D ++ + Sbjct: 245 QFPVLDQSQALIAGYSDQ--------QDKVCHISPIIVFGDHTTVVKYFEKPFIVGADGT 296 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + H + + +++ + Y F ++ P I+EQ I N Sbjct: 297 KLLFCHNGITKFFLYVIEFDPVIPEGYKR--------HFSLLREKNFPFPCIEEQTKIAN 348 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ ID + +E+ + +E + + Sbjct: 349 FLSA----IDEKIALVEKQLASTREYKKGLMQQLF 379 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 9/204 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 G E++G + N+ I + G I + + K Sbjct: 14 GEEFLGWEEKKLGDVCVTFSGGTPSVTNSTYYNGCIPFIKSGEINKSYTEAFLTEKGLKN 73 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + ++V G++++ + + + G I A + +K +D YL L+ + Sbjct: 74 SSAKLVKKGDLLYALYGATSGESGISKI----NGAINQAILCIKSDILDLKYLKNLL-CF 128 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + ++ G + +L E +K L P EQ I N ++ ID + I + + Sbjct: 129 NKNRITGMYLQGGQGNLSAEIIKSLKFYFPSSPEQTKIANFLSA----IDEKISHINKKL 184 Query: 401 VLLKERRSSFIAAAVTGQIDLRGE 424 LLK+ + + I + E Sbjct: 185 DLLKQYKKGMMQKIFNQDIRFKDE 208 >gi|154487134|ref|ZP_02028541.1| hypothetical protein BIFADO_00974 [Bifidobacterium adolescentis L2-32] gi|154084997|gb|EDN84042.1| hypothetical protein BIFADO_00974 [Bifidobacterium adolescentis L2-32] Length = 405 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 65/403 (16%), Positives = 131/403 (32%), Gaps = 36/403 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + + K+ E Y D + + S+ Sbjct: 22 WEQRKLGELASKRIEKNTNGIKETFTNSAEHGVVSQLDYFDHDITNDAN-IGNYSVVHPD 80 Query: 85 QILYGKL-------GPYLRKAIIADFDGICSTQFLVLQPKDVLPE----LLQGWLLSIDV 133 +Y GP R + D +G+ S + V D + + Sbjct: 81 DFIYNPRISAVAPCGPINRNKL--DRNGVMSPLYTVFSVDDTIDKLYLEHYFKTSRWHQF 138 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +P+ P L EQ LI IT R + Sbjct: 139 MFLEGNSGARSDRFSISDSIFFEMPIQCPVLEEQELIASFFGRLDSL----ITLHQRKYD 194 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L K++++ + KG + +++ +G D WE + L E + K+ + Sbjct: 195 KLCVLKKSMLDKMFPKGGSLYPEIRFAG------FTDPWEQRKLGELFEEHSEKDRDDLP 248 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + G + + RN+ S Y++VD G+ + + + Sbjct: 249 ALTIIQGGGTVHRDESNRNLQFDRNSLSNYKVVDTGDFIVHLRSFEG-----GLEKATCC 303 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMGSGLR--QSLKFEDVKRLPVLVP 370 G+++ AY + +DS + RS G+R +S+ E +K + + Sbjct: 304 GLVSPAYHIFRGKNVDSDFYYLYFRSKRFIDADLKPHVYGIRDGRSIDIEGMKTIFIPWT 363 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ I + R+D L+ ++ + LL+ + S + Sbjct: 364 NLAEQRRIGAFFD----RLDSLITLHQRKLELLRNIKKSMLDK 402 >gi|148982143|ref|ZP_01816610.1| type I restriction-modification system, S subunit [Vibrionales bacterium SWAT-3] gi|145960648|gb|EDK25995.1| type I restriction-modification system, S subunit [Vibrionales bacterium SWAT-3] Length = 390 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 64/415 (15%), Positives = 135/415 (32%), Gaps = 38/415 (9%) Query: 21 IPKHWKVVPIKRFT--KLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ +K ++ G + + + D+ T + S + Sbjct: 2 VPNGWEEKSLKDICQKTISYGIVQTGENIENGVPCVRVVDLSKNTLNPVEMIKTSDKIHQ 61 Query: 76 S-TVSIFAKGQILYGKLGPYL--RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S +I +G+++ G +K + L P + W L + Sbjct: 62 SYKKTILCEGELMMALRGEIGLVKKVTPELVGANITRGLARLSPIKSVDSDYLLWTLRSN 121 Query: 133 VTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G + + + + +PIPPL EQ I + + D I + Sbjct: 122 KIKNELSRKSGGSALQEIALGSLRKVVLPIPPLPEQRKIAQIL----STWDRGIATTEKL 177 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I+ K++K+AL+ ++T + + E + WE + K Sbjct: 178 IDASKQQKKALMQQLLT------CQKRLVDPETGKAFQEEWEDTHLSNITVIKKGKA--- 228 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 LS N++ G K Y I S + Sbjct: 229 -------LSAKNLVAGSYPVIAGGKSSPYSHVDFTHENVITVSASGAYAGYVSYHPYK-- 279 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 I S V + + + +++ G + + +D++ + + VP Sbjct: 280 ---IWASDCSVVTAKPANYLGFIFQWLQLNQIRIYSMQSGGAQPHIYPKDLEVMKLRVPK 336 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I+EQ I +V+ I+VL E + K+ + + + + G+ +R + + Sbjct: 337 IEEQQKIASVLTAADKEIEVL----EAKLAHFKQEKKALMQQLLMGKRRVRVDEE 387 >gi|15789430|ref|NP_279254.1| RmeS [Halobacterium sp. NRC-1] gi|169235142|ref|YP_001688342.1| type I site-specific deoxyribonuclease subunit rmeS [Halobacterium salinarum R1] gi|10579756|gb|AAG18734.1| type I restriction modification enzyme, S subunit [Halobacterium sp. NRC-1] gi|167726208|emb|CAP12988.1| type I site-specific deoxyribonuclease subunit rmeS [Halobacterium salinarum R1] Length = 475 Score = 123 bits (309), Expect = 5e-26, Method: Composition-based stats. Identities = 72/428 (16%), Positives = 137/428 (32%), Gaps = 40/428 (9%) Query: 22 PKHWKVVPIKRFTK-LNTGRTSESGKD-IIYIGLEDVESG-----TGKYLPKDGNSRQSD 74 P W + + + G+ D + I E + +YL +D Sbjct: 43 PGEWTAKRLGDIKQLITRGKQPTYDDDGVPVINQECIYWDGWHFENLRYLEEDV---AEG 99 Query: 75 TSTVSIFAKGQILYGKLG--PYLRK-AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 G ++ G R D + +L+ + L + Sbjct: 100 WKEKYFPESGDVIVNSTGQGTLGRAQVYPGDQRRAIDSHVTLLRTDEQLCPHFHRYFFES 159 Query: 132 DVTQ---RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + Q + + +P P+PPL EQ I + +D I + Sbjct: 160 HLGQALLYSMCVNGSTGQIELSKTRLDLLPTPLPPLEEQRKIASVLYN----VDQAIQKT 215 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV--------PDHWEVKPFFAL 240 IE ++ KQ L+ + TKGL+ ++ S + GL P W V+ L Sbjct: 216 EAVIEKIERLKQGLLDDLFTKGLSESNSLRPSPEDHPGLYKKERRQTIPSEWNVESLQNL 275 Query: 241 VTELNR----KNTKLIESNILSLSYGNIIQKLETRN--MGLKPESYETYQIVDPG-EIVF 293 E + +E + ++ ++ PE E Y E + Sbjct: 276 CVENITYGIVQPGPHVEDGVPYINTEDMTDGDIPTEGLSRTSPEIAEKYSRSQIHAEELV 335 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I ++ + V +D+T+L W +RS + A G Sbjct: 336 VTIRATIGAVDQVPPELEGANLTRGTARVVPGDKVDNTFLLWAIRSNNFQSELDARVKGT 395 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + + ++PV P I EQ I + ++ I+ +E E + LK + + Sbjct: 396 TFDEINLDQLGKIPVPHPDIDEQDRIVDELST----IEERMENEESYLEQLKRLKQGLMQ 451 Query: 413 AAVTGQID 420 ++G++ Sbjct: 452 DLLSGEVR 459 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 65/197 (32%), Gaps = 9/197 (4%) Query: 21 IPKHWKVVPIKRFT--KLNTGRTSES---GKDIIYIGLEDVESGTGKYLP-KDGNSRQSD 74 IP W V ++ + G + YI ED+ G + ++ Sbjct: 263 IPSEWNVESLQNLCVENITYGIVQPGPHVEDGVPYINTEDMTDGDIPTEGLSRTSPEIAE 322 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSI 131 + S +++ + + V+ V L + S Sbjct: 323 KYSRSQIHAEELVVTIRATIGAVDQVPPELEGANLTRGTARVVPGDKVDNTFLLWAIRSN 382 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + ++A +G T + +G IP+P P + EQ I +++ R++ + + Sbjct: 383 NFQSELDARVKGTTFDEINLDQLGKIPVPHPDIDEQDRIVDELSTIEERMENEESYLEQL 442 Query: 192 IELLKEKKQALVSYIVT 208 L + Q L+S V Sbjct: 443 KRLKQGLMQDLLSGEVR 459 >gi|262374616|ref|ZP_06067889.1| predicted protein [Acinetobacter junii SH205] gi|262310406|gb|EEY91497.1| predicted protein [Acinetobacter junii SH205] Length = 398 Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats. Identities = 60/401 (14%), Positives = 122/401 (30%), Gaps = 30/401 (7%) Query: 24 HWKVVPIKRFTK-LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W I T+ L G + YI +++ + S+ Sbjct: 14 DWSRYKIAEVTEYLVDGTHFSPKTTEGEFKYITSKNIRNDGLDLTNISYISKDEHEKIYK 73 Query: 80 --IFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G IL K G + +F + S L + + L S Sbjct: 74 RCKVQLGDILLTKDGANTGNCCLNTLDEEFSLLSSVAVLRGKKDSFNNNFILQILQSDLG 133 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I + G ++ + + P L EQ I + A +I L + + Sbjct: 134 QDTIISSMSGQAITRITLAKLKDYSFFFPELTEQTQITSFLSAVDEKISQLTQKHALLSQ 193 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + Q L S ++ K G WE K + + Sbjct: 194 YKQGMMQKLFSQ--------QIRFKADDGSEFGE----WEEKELKEVAEINPKAKKLPAN 241 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + L Q L +N+ L+ +++ G+++F+ + + Sbjct: 242 FIYIDLESVEKGQLLLQKNIELQDAPSRAQRLLAKGDVLFQMVRPYQQNNYY--FNLSGE 299 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPI 372 + ++ Y ++ DS ++ + + +G ++ D+ + + VP + Sbjct: 300 YVASTGYAQIRTKL-DSKFIYYALHEKTFLDEVMNRCTGTSYPAINSSDLSSIEIFVPCL 358 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I N ++ ID +E + Q I K + + Sbjct: 359 EEQTKIANFLSA----IDQKIEVVAQQIEQAKTWKKGLLQQ 395 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 25/209 (11%), Positives = 62/209 (29%), Gaps = 5/209 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K+ +W + ++ + + Sbjct: 4 PKLRFKEFDGDWSRYKIAEVTEYLVDGTHFSPKTTEGEFKYITSKNIRNDGLDLTNISYI 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + E V G+I+ L + + + A + K ++ + Sbjct: 64 SKDEHEKIYKRCKVQLGDILLTKDGANTGNCCLNTLDEEFSLLSSVAVLRGKKDSFNNNF 123 Query: 333 LAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +++S + +M + +K P + EQ IT+ ++ +I Sbjct: 124 ILQILQSDLGQDTIISSMSGQAITRITLAKLKDYSFFFPELTEQTQITSFLSAVDEKISQ 183 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQID 420 L +K LL + + + + QI Sbjct: 184 LTQKH----ALLSQYKQGMMQKLFSQQIR 208 >gi|288932536|ref|YP_003436596.1| restriction modification system DNA specificity domain protein [Ferroglobus placidus DSM 10642] gi|288894784|gb|ADC66321.1| restriction modification system DNA specificity domain protein [Ferroglobus placidus DSM 10642] Length = 421 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 65/417 (15%), Positives = 143/417 (34%), Gaps = 35/417 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W VV + + ++ D I +++ + G Y N + Sbjct: 24 EIPEDWGVVKLGKVVEVW---------DKYRIPVKEQDRKPGPYPYCGANGIIDYVDGYT 74 Query: 80 IFAKGQILY-----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G+ + G GP+ + A I + V K + ++ +L+ Sbjct: 75 --HDGEFVLLAEDGGYFGPFEKSAYIMRGKFWANNH--VHILKAIANKMTSEFLMFYLNF 130 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G+T + IP+P P L EQ I E + I+ + L Sbjct: 131 MDLRPFLTGSTRPKLTQTDMLRIPLPKPSLPEQKAIAEILSTVDRAIEKTDEIIAKVERL 190 Query: 195 LKEKKQALVSY----IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 K Q L++ V G + +G VP+ WEV + + Sbjct: 191 KKGLMQELLAGRVRVKVENGKIRFYRETRFKDSEIGKVPEDWEVVKLGKVAEQRKEIVDP 250 Query: 251 LIESNILSLSYGNIIQKLETR--NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + ET+ N G E + +I++ + DK + + Sbjct: 251 TEVDPVTPYVGLEHVNSGETKLSNFGKAEEVVSSKYRFYIRDILYGKLRPYLDKAVISNI 310 Query: 309 QVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 G+ ++ ++ ++ +L +++ + A +G + + + Sbjct: 311 ----NGVCSTDFIVMRTKRDYTIPDFLIYVLHTKRFIDYSTAGMTGTNHPRTSWNWIAKF 366 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +PP++EQ I +++ +D +E ++ L+ + + +TG++ ++ Sbjct: 367 EFPLPPLQEQKAIAEILST----LDKKLELEKKEKERLERIKKGLMNVLLTGRVRVK 419 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 54/173 (31%), Positives = 79/173 (45%), Gaps = 9/173 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKD 67 +KDS IG +P+ W+VV + + + + D Y+GLE V SG K + Sbjct: 220 FKDSE---IGKVPEDWEVVKLGKVAEQRKEIVDPTEVDPVTPYVGLEHVNSGETKL--SN 274 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK--DVLPELLQ 125 + S+ F ILYGKL PYL KA+I++ +G+CST F+V++ K +P+ L Sbjct: 275 FGKAEEVVSSKYRFYIRDILYGKLRPYLDKAVISNINGVCSTDFIVMRTKRDYTIPDFLI 334 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L + A G W I P+PPL EQ I E + Sbjct: 335 YVLHTKRFIDYSTAGMTGTNHPRTSWNWIAKFEFPLPPLQEQKAIAEILSTLD 387 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 67/202 (33%), Gaps = 13/202 (6%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +P+ W V +V ++ + E + Y + E Sbjct: 20 ELGCEIPEDWGVVKLGKVVEVWDKYRIPVKEQDRKPGPYPYCGANGIIDYVDGYTHDGEF 79 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + G + + + + + + + S +L + + DL Sbjct: 80 VLLAEDG----GYFGPFEKSAYIMRGKFWANNHV--HILKAIANKMTSEFLMFYLNFMDL 133 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 R L D+ R+P+ P + EQ I +++ D +EK ++ I Sbjct: 134 RPFL---TGSTRPKLTQTDMLRIPLPKPSLPEQKAIAEILSTV----DRAIEKTDEIIAK 186 Query: 403 LKERRSSFIAAAVTGQIDLRGE 424 ++ + + + G++ ++ E Sbjct: 187 VERLKKGLMQELLAGRVRVKVE 208 >gi|261855231|ref|YP_003262514.1| restriction modification system DNA specificity domain protein [Halothiobacillus neapolitanus c2] gi|261835700|gb|ACX95467.1| restriction modification system DNA specificity domain protein [Halothiobacillus neapolitanus c2] Length = 401 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 54/404 (13%), Positives = 120/404 (29%), Gaps = 20/404 (4%) Query: 26 KVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-----S 76 KVVP+K ++ + + + + + + +V + + Sbjct: 4 KVVPLKDLFQIGSSKRVLKSQWKAEGVPFYRGREVTRLAMDGFVDNELFISEAHYAELAN 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I+ +G I+ D D +L K + + + T Sbjct: 64 QYGAPRTDDIVITAIGTIGNSYIVQDGDRFYFKDASILWMKRISDVSSKFVNFWLKSTMF 123 Query: 137 IEAIC--EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + GAT+ + + ++ + +PP+AEQ I + I + + Sbjct: 124 LDQLDHGNGATVDTLTIQKLQSVQIWVPPIAEQHRIVSILDEAFEGIAKARAHAEQNRQN 183 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + L + G L + + + K + Sbjct: 184 ARA--------LFESHLQSVFTQRGEGWAEKSLEEVVDAQCTLSYGIVQPGHEYAKGMPI 235 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + +I + + K + GE++ S Sbjct: 236 VRPTDLTAKLITLNGLKRIDPKLADGYRRTTLRGGELLLCVRGSTGVLAVTSSELAGANV 295 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK 373 + P + + +LM S + G + D++++ V PP+K Sbjct: 296 TRGIVPIMFDPSLLSQDFGYFLMTSEAVQSQIRIKTYGTALMQINIGDLRKIAVSFPPLK 355 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 EQ +T + +A L +Q + L E + S + A +G Sbjct: 356 EQERMTAQLEELSAETQRLESIYQQKLAALDELKKSLLHQAFSG 399 >gi|298245052|ref|ZP_06968858.1| restriction modification system DNA specificity domain protein [Ktedonobacter racemifer DSM 44963] gi|297552533|gb|EFH86398.1| restriction modification system DNA specificity domain protein [Ktedonobacter racemifer DSM 44963] Length = 433 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 59/436 (13%), Positives = 142/436 (32%), Gaps = 35/436 (8%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTG--------RTSESGKDIIYIGLEDVESG 59 P Y+ + IG IP+ WK+ + +++N G + +YI ++ + S Sbjct: 12 PGYRLTE---IGIIPEDWKLKTFRDVSRVNQGLQIAIEKRSKKPTNNSKVYITIQYLNS- 67 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + ++ K IL + G + + + + Sbjct: 68 ------SKEAEYIDNYTSAVCCGKDDILMTRTGNTGYIVSGVEGVFHNNFFKINYDKAIL 121 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L +L I +T+ + +IP+P+P +EQ+ I + + V Sbjct: 122 DKGFLFYYLHLNSTQNIILTRAGASTIPDLNHNDFYSIPIPVPTKSEQIAIAKALSDVDV 181 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +L + ++ + Q L++ + ++ G++P+ W VK Sbjct: 182 LTASLDKLIAKKRDIKQATTQQLLTGKIRLPGFVGIRNPVYKQTEAGMIPEDWTVKKLGE 241 Query: 240 LVTELNRKN--TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP--------G 289 + N + + L++ ++ Y+ + Sbjct: 242 VCLYQNGTSLERYFNRNQGLNVISIGNYSIDGNYIDTNSYIDWKHYKEIKKFILNQDELC 301 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYA 348 ++ + + + + + M +KP + YL +++ S + + Sbjct: 302 MVLNDKTSVGAIIGRVLLIKEDNKYVFNQRSMRIKPLDEVLPGYLYYIINSNLIHDKIVS 361 Query: 349 MGS-GLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + G + + D+ L + P ++EQ I +++ A +EQ Sbjct: 362 LAKPGTQIYVNTGDITGLDIPFPQSLEEQQAIATILSDMDAEF----AALEQRREKTHAL 417 Query: 407 RSSFIAAAVTGQIDLR 422 + + +TG+ L Sbjct: 418 KQGMLQELLTGKTRLT 433 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 67/207 (32%), Gaps = 18/207 (8%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +G++P+ W++K F + + + + + + ++ N + E + Y Sbjct: 18 EIGIIPEDWKLKTFRDVSRVNQGLQIAIEKRSKKPTNNSKVYITIQYLNSSKEAEYIDNY 77 Query: 284 Q---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 +I+ S + +D +L + + Sbjct: 78 TSAVCCGKDDILMTRTGNTGYIVSGVEGVFHNNFF----KINYDKAILDKGFLFYYLHLN 133 Query: 341 DLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVI---NVETARIDVLVEKI 396 + L D +P+ VP EQ I + +V TA +D L+ K Sbjct: 134 STQNIILTRAGASTIPDLNHNDFYSIPIPVPTKSEQIAIAKALSDVDVLTASLDKLIAKK 193 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ + + +TG+I L G Sbjct: 194 -------RDIKQATTQQLLTGKIRLPG 213 >gi|182679588|ref|YP_001833734.1| restriction modification system DNA specificity subunit [Beijerinckia indica subsp. indica ATCC 9039] gi|182635471|gb|ACB96245.1| restriction modification system DNA specificity domain [Beijerinckia indica subsp. indica ATCC 9039] Length = 415 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 66/425 (15%), Positives = 130/425 (30%), Gaps = 28/425 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESG-TGK 62 YK + V G IP+ WK+ + F K+ TG T + ++ D+ S Sbjct: 5 YKQTEV---GMIPEDWKIASVSSFGKVVTGGTPPTTNRSFWNGFYPWVTPTDISSDRDIY 61 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + S +L + + A++ G C+ Q + P Sbjct: 62 LTERCITDAGMKVSGS--LPANSVLVTCIASIGKNAVL-KTFGSCNQQINAIIPNGRHDS 118 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + L R+ + S +I +PP E+ A D Sbjct: 119 I-FIYYLIEFNKNRLLSKAGITATSIISKSLFESIVFAVPPTLEEQRAIA---AALGDAD 174 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 LI I ++ KQA + ++T G +E ++ + Sbjct: 175 ALIASLEALIAKKRDIKQAAMQQLLT-GKTRLPGFSGKWVEHDFNEIFNFLRNGSNSRSD 233 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + I + S + + + + V G+++ + Sbjct: 234 LSENGDVGYIHYGDIHSSPSAFMDFSKGTFIRISNHKVSNLPRVHDGDLIIADASEDYNG 293 Query: 303 RS----LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGL-RQS 356 +R E + + S ++ L + +G+ Sbjct: 294 IGKSVEVRGISDTEVVAGLHTLLLRGKRELLSDGFKGYLQFVPALKSALIRIANGISVYG 353 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +VK + VL+PPI EQ I +V++ A + +E + + +T Sbjct: 354 ISKTNVKAISVLLPPIDEQSAIASVLSDMDAE----ITALETKRDKAHAVKQGMMQELLT 409 Query: 417 GQIDL 421 G+I L Sbjct: 410 GRIRL 414 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 68/205 (33%), Gaps = 7/205 (3%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYE 281 VG++P+ W++ + + + R++ L Sbjct: 9 EVGMIPEDWKIASVSSFGKVVTGGTPPTTNRSFWNGFYPWVTPTDISSDRDIYLTERCIT 68 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G + + + ++A + G A+ P+G + + + ++ Sbjct: 69 DAGMKVSGSLPANSVLVTCIASIGKNAVLKTFGSCNQQINAIIPNGRHDSIFIYYLIEFN 128 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSI 400 ++ G + + + VPP ++EQ I + A I L I + Sbjct: 129 KNRLLSKAGITATSIISKSLFESIVFAVPPTLEEQRAIAAALGDADALIASLEALIAKK- 187 Query: 401 VLLKERRSSFIAAAVTGQIDLRGES 425 ++ + + + +TG+ L G S Sbjct: 188 ---RDIKQAAMQQLLTGKTRLPGFS 209 >gi|7658152|gb|AAF66082.1|AF097472_2 type IC HsdS subunit [Lactococcus lactis subsp. cremoris] Length = 422 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 66/418 (15%), Positives = 149/418 (35%), Gaps = 37/418 (8%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ +K T+ R+++ D+ + + + G+ Sbjct: 15 KVPELRFPGFTDDWEERKLKDVTE--RVRSNDGRMDLPTLTMSASSGWLDQKDRFSGDIS 72 Query: 72 QSDTSTVSIFAKGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVL---PELL 124 + ++ KG++ Y KL Y + +++ + + Sbjct: 73 GKEKKNYTLLKKGELSYNHGNSKLAKYGVVFSLTNYEEALVPRVYHSFKALENTSADFIE 132 Query: 125 QGWLLSIDVTQRIEAICEGATMS---HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + I GA M + ++ NI + IP EQ+L+ ++ Sbjct: 133 YMFSTKLPDRELGKLISSGARMDGLLNINYDDFMNIHISIPNYEEQILMSAF----FRKL 188 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D I R ++LLKE+K+ + + K +++ +G + ++ Sbjct: 189 DENIALHQRKLDLLKEQKKGYLQKMFPKNGEKVPELRFAG---FADDWEERKLGDIGDTF 245 Query: 242 TELNRK-NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQ 299 T L K + ++Y N+ Q L + Q V G++ F Sbjct: 246 TGLTGKTKEDFGHGSAKFVTYVNVFQNPIATLDQLDAVEIDEKQNQVQKGDVFFTTSSET 305 Query: 300 NDKRSLRSAQVME--RGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL-RQ 355 ++ + S + + S +P D Y+A ++RS + K + G+ R Sbjct: 306 PEEVGMSSVWTYDTKNVYLNSFTFGYRPRVSFDLNYMASMLRSPSIRKKITFLAQGISRY 365 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ + + + P + EQ I + +D + ++ + LLKE++ F+ Sbjct: 366 NISKTKMLEIEIPAPNLSEQKKIGSF----FKLLDDTIALHQRKLDLLKEQKKGFLQK 419 >gi|294790581|ref|ZP_06755739.1| type I site-specific deoxyribonuclease (specificity subunit) [Scardovia inopinata F0304] gi|294458478|gb|EFG26831.1| type I site-specific deoxyribonuclease (specificity subunit) [Scardovia inopinata F0304] Length = 403 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 68/397 (17%), Positives = 126/397 (31%), Gaps = 32/397 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + + + G L+ + + +++ S + KG Sbjct: 26 WEQRKLGDVFEEYSEKNH--GDLPPLTVLQGSGTIQRDESSRVLLYKKASLSNYKLVNKG 83 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + L + I+ GI S + + + + S + + Sbjct: 84 DFIL-HLRSFEGGLEISKQRGIISPAYHTFHGEGANSKFYYLFFRSYNFINILLKPYIYG 142 Query: 145 TMS--HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + D G+ I +P P + EQ I E I ++ + + Q + Sbjct: 143 IRDGKNIDIDGMKEIMIPYPVIEEQRKIGEFFKTLDDLIAATERKKELLQKKKQAYLQLI 202 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI--ESNILSLS 260 S + WE + + ++ KN E+ S Sbjct: 203 FSQHLRFKG----------------FTKPWEQRKLNDIAYKVTEKNNNFSIRETFTNSAE 246 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSA 319 G + Q E+ Y IV + V+ I + + GII+ Sbjct: 247 LGIVSQLDFFDRNLSNAENIVNYYIVRAKDFVYNPRISAAAPVGPINCNNLEREGIISPL 306 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQ 375 Y + H ID+ YL W +S D K G R S+K +P+ P I+EQ Sbjct: 307 YTVFRTHCIDTNYLEWFFKSSDWHKFMRYYGDSGARSDRFSIKDSLFFEMPIPYPVIEEQ 366 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I +D L+ ++ I LLK R+ +++ Sbjct: 367 RKIGEF----FKTLDDLIAATDKKINLLKRRKKAYLQ 399 >gi|304310052|ref|YP_003809650.1| Type I restriction-modification system specificity subunit [gamma proteobacterium HdN1] gi|301795785|emb|CBL43984.1| Type I restriction-modification system specificity subunit [gamma proteobacterium HdN1] Length = 399 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 45/409 (11%), Positives = 119/409 (29%), Gaps = 22/409 (5%) Query: 24 HWKVVPIKRFT-KLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W P+ F + +G T S G +I ++ ++V + S Sbjct: 2 SWISAPLSDFCIDVKSGGTPSSHVESYYGGEIPWLRTQEVVFKKILDTELKITDEGLNNS 61 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + ++ G ++ + + L D + + + Sbjct: 62 SAKWIPENSVIVAMYGNSAGRSAVNKIPLTTNQACCNLIIDDETSDYRYVFYALCKSYEE 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++++ +GA ++ + + + +P PP Q I + + + I+ E + Sbjct: 122 LKSLSKGAAQNNLNAAQVKSFNIPKPPKKVQEKIGDILSSYDDLIENNRRRIQLLEESAR 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 Q ++ G ++K G + +RK + +I Sbjct: 182 LLYQEWFVHLRFPG---HEQVKIIDGVPEGWSSGVLSDFFETSSGGTPSRKIPEFYAGDI 238 Query: 257 LSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + E + ++ G ++ + + + Sbjct: 239 PWVKTQELNDSYIFNTSEKISEEAIIKSSAKLFPAGTVLIAMYGATIGETGVLAISAASN 298 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + +T L + + ++ + ++ P ++P Sbjct: 299 QAC---CALFPKNKELTTEFTHLFAMNSKQGLINLSQGAAQNNISQQIIRNFPFVLPS-- 353 Query: 374 EQFDITNVINVETARIDVLVEKIEQS-IVLLKERRSSFIAAAVTGQIDL 421 I N + I +E+ I LLK R + ++G++ + Sbjct: 354 --ELILKEFNDVVSNIYNQKFNLERQNISLLKA-RDLLLPKLMSGELTV 399 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 65/190 (34%), Gaps = 6/190 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W + F + ++G T DI ++ +++ + + Sbjct: 205 VPEGWSSGVLSDFFETSSGGTPSRKIPEFYAGDIPWVKTQELNDSYIFNTSEKISEEAII 264 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +F G +L G + + + + L PK+ L +++ Sbjct: 265 KSSAKLFPAGTVLIAMYGATIGETGVLAISAASNQACCALFPKNKELTTEFTHLFAMNSK 324 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q + + +GA ++ + I N P +P + + + L + I ++ Sbjct: 325 QGLINLSQGAAQNNISQQIIRNFPFVLPSELILKEFNDVVSNIYNQKFNLERQNISLLKA 384 Query: 195 LKEKKQALVS 204 L+S Sbjct: 385 RDLLLPKLMS 394 >gi|182683807|ref|YP_001835554.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae CGSP14] gi|182629141|gb|ACB90089.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae CGSP14] Length = 372 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 54/398 (13%), Positives = 111/398 (27%), Gaps = 36/398 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + L+ Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLLV---------- 169 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 K E PD + + + + N + + Sbjct: 170 ------------KSRFNEMFEEYPDSVFLDTYIKELRAGKSLAGEENNKNKVLKTGAVSY 217 Query: 266 QKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYM 321 + + P Y V+ G+++ ++ A + + Sbjct: 218 DYFNSSEVKNLPIDYIPLDEHKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPDRLW 277 Query: 322 AVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 V + + W + ++ K + SG +++ + ++ V PP+ Q + Sbjct: 278 KVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLALQNE 337 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + A +D I++S+ L+ + S + Sbjct: 338 FADFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 371 >gi|312876128|ref|ZP_07736116.1| restriction modification system DNA specificity domain [Caldicellulosiruptor lactoaceticus 6A] gi|311797114|gb|EFR13455.1| restriction modification system DNA specificity domain [Caldicellulosiruptor lactoaceticus 6A] Length = 445 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 75/441 (17%), Positives = 153/441 (34%), Gaps = 47/441 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 PK W +V ++R L +G + S + I +G E + G + + Sbjct: 6 EFPKEWTIVSLERDCVLISGLRPKGGASDEGIPSLGGEHITLDGRINFSDVNAKYVPEKF 65 Query: 76 ST---VSIFAKGQILYGKLGPYLRKAIIADFDGI----CSTQFLVLQPKDVLPELLQGWL 128 + IL K G K I + +L+ K + + + Sbjct: 66 FKIMTKGKAEENDILVNKDGANTGKVAILKKKFYKDIAINEHLFILRSKKLFVQQYLFYW 125 Query: 129 LSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++I G+ I N +P PPL EQ I E + ID I + Sbjct: 126 FFSRFGQKQITDRITGSAQPGLSSTFIKNFLVPRPPLPEQRKIAEIL----ETIDNAIEK 181 Query: 188 RIRFIELLKEKKQALVSYIVTKG-----------------LNPDVKMKDSGIEWVGLVPD 230 IE K KQ L+ ++TKG + +++D I+ P Sbjct: 182 TDAIIEKYKRIKQGLMQDLLTKGVVSEGEGESESKSESEGESEKWRLRDEKIDKFKDSPL 241 Query: 231 HW----EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + ++ +N K T + + + ++ E Sbjct: 242 GRIPEEWEVRHISDISLINPKTTVNPRESYPYIEMDATPIMGKRYKYITYRKASEAGVKF 301 Query: 287 DPGEIVFRFIDLQ-NDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCK 344 G+++ I + ++L + GI ++ ++ + +D+ YL +L+ S + Sbjct: 302 KKGDVLIARITPCAENGKALLVPNDIHIGIGSTEFIVFRAKENVDNVYLFYLLISDLVRN 361 Query: 345 --VFYAMGSGLRQSLKFEDVKRL-PVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSI 400 + G+ RQ + + V +P EQ + +++ ++ID ++EK + Sbjct: 362 VSIGLMEGTSGRQRIPKYVYSDIIKVAIPKSKTEQQRVASIL----SQIDEVIEKEQAYK 417 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 L+ + + +TG++ + Sbjct: 418 EKLERIKKGLMEDLLTGKVRV 438 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 78/205 (38%), Gaps = 11/205 (5%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 ++KDS +G IP+ W+V I + +N T + YI ++ +Y Sbjct: 234 DKFKDSP---LGRIPEEWEVRHISDISLINPKTTVNPRESYPYIEMDATPIMGKRYKYIT 290 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDVLPE 122 F KG +L ++ P GI ST+F+V + K+ + Sbjct: 291 YRKASEAGVK---FKKGDVLIARITPCAENGKALLVPNDIHIGIGSTEFIVFRAKENVDN 347 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + +LL D+ + + T + + + ++++ + +ID Sbjct: 348 VYLFYLLISDLVRNVSIGLMEGTSGRQRIPKYVYSDIIKVAIPKSKTEQQRVASILSQID 407 Query: 183 TLITERIRFIELLKEKKQALVSYIV 207 +I + + E L+ K+ L+ ++ Sbjct: 408 EVIEKEQAYKEKLERIKKGLMEDLL 432 >gi|289450014|ref|YP_003475630.1| type I restriction modification DNA specificity domain-containing protein [Clostridiales genomosp. BVAB3 str. UPII9-5] gi|289184561|gb|ADC90986.1| type I restriction modification DNA specificity domain protein [Clostridiales genomosp. BVAB3 str. UPII9-5] Length = 433 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 58/426 (13%), Positives = 113/426 (26%), Gaps = 54/426 (12%) Query: 18 IGAI-----PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLP 65 +G + P + I + G T DI + +ED+ + Sbjct: 4 LGELIQELCPDGVEYKRIDEICVVQNGYTPSKKNNEFWEDGDIPWFRMEDIRQNGRELDD 63 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + S+F +++ A+I + + + + Sbjct: 64 AVQHITSLGVRG-SVFPANSLIFATTATVGEHALIKVPFVCNQQLTHIHINDEYIDAIEI 122 Query: 126 GWLLSIDVT---QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +L G+T+ D + +P+PPL Q I + T Sbjct: 123 RYLFHCAFIIDEMCKNNTKGGSTLPAVDLNKFKSFKIPVPPLEVQREIVRILDNFTELTA 182 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 L E + K++ + ++T + G + + Sbjct: 183 ELTAELTAELTARKKQYEYYRDMLLTF-------------DARGEAISDVVWRTLGEVCN 229 Query: 243 ELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + K + LI + R IV Q Sbjct: 230 LQSGKAISAYLISDTQTVENSIPCYGANGLRGYVSTSNESGDKPIV----------GRQG 279 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 A + +S +L L+ + DL + +G + L Sbjct: 280 ALCGNVCFATGSYYATEHAVVVTDKGFFNSRFLYHLLVNADLNQY---KTAGAQPGLSVA 336 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIAA 413 + + V VP Q I NV++ A L + ++ R + + Sbjct: 337 RLNEVKVPVPTRTVQDRIANVLDNFDAICSDLNIGLPAEIAARQKQYEY---YRDALLTY 393 Query: 414 AVTGQI 419 A TG+I Sbjct: 394 AATGKI 399 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 64/213 (30%), Gaps = 24/213 (11%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--- 284 PD E K + N + + + R G + + + Sbjct: 12 CPDGVEYKRIDEICVVQNGYTPSKKNNEFWEDGDIPWFRMEDIRQNGRELDDAVQHITSL 71 Query: 285 -----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++F + ++ V + + ++ + ID+ + +L Sbjct: 72 GVRGSVFPANSLIFATTATVGEHALIKVPFVCNQQLT---HIHINDEYIDAIEIRYLFHC 128 Query: 340 YDLCKVF---YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + G ++ K + VPP++ Q +I +++ T L ++ Sbjct: 129 AFIIDEMCKNNTKGGSTLPAVDLNKFKSFKIPVPPLEVQREIVRILDNFTELTAELTAEL 188 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQIDLRGES 425 + K+ R + D RGE+ Sbjct: 189 TAELTARKKQYEYYRDMLLT------FDARGEA 215 >gi|325983687|ref|YP_004296089.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] gi|325533206|gb|ADZ27927.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] Length = 420 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 62/419 (14%), Positives = 120/419 (28%), Gaps = 45/419 (10%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESG 59 +P+++++G WK + G T GK +I D+ Sbjct: 15 RFPEFREAG---------EWKEAKLGLIGIFTGGGTPSKGKASYWAGTNPWISSSDISDD 65 Query: 60 TGKYL--PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + + + + + I IL + K I S F+ P Sbjct: 66 NIQDICISRFISDEAIQETATKIVPANSILLVSR-VGVGKLAITRKPLCTSQDFMNFTPA 124 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 +L+ + A +G + N+ +P+P EQ I + +I+ Sbjct: 125 Q--DDLVFLAYCLKSLKDTFLAFNQGMAIKGFTKDDAFNLRIPLPTQDEQQKIADCLISL 182 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 IT ++ ++ LK K+ L+ + K++ G Sbjct: 183 D----ERITLEVQKLDTLKTHKKGLMQQLFPAEGETLPKLRFPEFRDAGE-----WNLKL 233 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 V ++ +S N +TY Sbjct: 234 LGAVCDMQAGK----FVAATEISEQNRNDLYSCFGGNGLRGYTKTYTHS-------GRYS 282 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 L + +L + G + AV + WL + L + + L Sbjct: 283 LIGRQGALCGNVNLVDGFFHATEHAVVTTPKAGIHTDWLFYTLTLLNLNRFATGQAQPGL 342 Query: 358 KFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++ VP + EQ I N + ID L+ Q + LK + + Sbjct: 343 SVDVLNKIECAVPKDEQEQRKIANCL----TSIDDLITAQTQKLAALKTHKQGLMQQLF 397 >gi|217980299|ref|YP_002364275.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] gi|217500936|gb|ACK48908.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] Length = 406 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 64/414 (15%), Positives = 133/414 (32%), Gaps = 32/414 (7%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSESGK-------DIIYIGLEDV-ESGTGKYLPKDGNSR 71 +P+ W + I KL TG+T + K ++ + D + + +S Sbjct: 8 LPEGWHLETIGEVASKLVTGKTPSTKKAEYYSSSEVDWFTPSDFGSTAVLNNSRRKLSSL 67 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + T+ K IL +G + K +A+ + + Q + K+ + + Sbjct: 68 AIEDGTIKKMPKDSILLVAIGATIGKVGLAEDESCFNQQVTGIHFKEKIH-PKYAYYWLS 126 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I AT+ + GI + P EQ I EK+ A RIDT I Sbjct: 127 YIKPEIITKSSQATLPIINQTGIKGLSFLYPEKEEQKCIVEKLDALLTRIDTAIEHLQES 186 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I L Q+ + + + ++ +P + Sbjct: 187 ITLKNSLLQSALDGQFSAITERMTIESLAEVKGGKRLPK---------------GEKLSD 231 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDPGEIVFRFIDLQNDKRSLR 306 E+ + + K G+K S E ++ ++ + Sbjct: 232 EETEHPYIRVADFTDKGTIDLSGIKYISKEIHEQIKRYVISKDDLYISIAGTIGKTGFVP 291 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKR 364 S +A + +K +L S + A + + L + + Sbjct: 292 SELDGANLTENAAKLVIKDKQQLDLSYLYLFTLTSDFSAQAGLATKTVAQPKLALTRLSK 351 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + + ++EQ + + I ++I + I LK ++S + +A G+ Sbjct: 352 IEIPICSLEEQKSLVSTIEALKSKIHDAEAVLLGKIEDLKSLKASILDSAFKGE 405 >gi|254876851|ref|ZP_05249561.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254842872|gb|EET21286.1| predicted protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 379 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 63/402 (15%), Positives = 138/402 (34%), Gaps = 30/402 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W+ +++ + S + + L+ +E+ G+Y P G + Sbjct: 2 PAGWEWEKLEKVCD----KASSN------LSLKKIENEDGEY-PIYGAKGFIKNISFFHR 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I K G + + + D L PK+ + +L + + Sbjct: 51 EEPYISIIKDGAGVGRVTMLDSKSSVIGTLQYLLPKNCID---IKYLYFLLLVIDFGKYV 107 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G T+ H ++ +P+PPLAEQ I K+ + +ID I + I + Sbjct: 108 SGTTIPHIYYRDYKEHLVPLPPLAEQKRIVAKLDSLFEKIDKAIELHQQNITNANTLMAS 167 Query: 202 LVSYIVTK--GLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + K G +KD I+ G P + + + N + + Sbjct: 168 TLDKTFKKLEGEYSYKNLKDITIKIGSGATPKGGQKAYKQKGTSLIRSMNVHDMGFSKKG 227 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 L++ + Q + +N+ IV+ +++ + + + + Sbjct: 228 LAFIDDSQADKLKNV-----------IVEKDDVLLNITGASVARCCVVCESALPARVNQH 276 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + S +L + + S F + G R+++ ++ L V + Q Sbjct: 277 VSIIRLNDSFISKFLHYYLISPMKKTELLFSSSGGATREAITKSMIENLQVPDISLPIQQ 336 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ ++D + + EQ + LK ++S + A G+ Sbjct: 337 QTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRGE 378 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 57/179 (31%), Gaps = 15/179 (8%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 +P WE + V + N L + Y K +N+ I+ Sbjct: 1 MPAGWEWEKL-EKVCDKASSNLSLKKIENEDGEYPIYGAKGFIKNISFFHREEPYISIIK 59 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G + + +I + + + ID YL +L+ D K Sbjct: 60 DG-----------AGVGRVTMLDSKSSVIGTLQYLLPKNCIDIKYLYFLLLVIDFGKYV- 107 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + D K V +PP+ EQ I ++ +ID +E +Q+I Sbjct: 108 --SGTTIPHIYYRDYKEHLVPLPPLAEQKRIVAKLDSLFEKIDKAIELHQQNITNANTL 164 >gi|22299772|ref|NP_683019.1| type I site-specific deoxyribonuclease specificity subunit [Thermosynechococcus elongatus BP-1] gi|22295956|dbj|BAC09781.1| type I site-specific deoxyribonuclease specificity subunit [Thermosynechococcus elongatus BP-1] Length = 410 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 61/427 (14%), Positives = 141/427 (33%), Gaps = 42/427 (9%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKY 63 +P+++++G P W+V + + G ++ + + GL G G Sbjct: 15 RFPEFRNAG-------P--WEVKRLGDMCDMQAGSFIKASEIRLVPEAGLNPCYGGNG-- 63 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 G ++ L G+ G Y +A + +V+ PK Sbjct: 64 --LRGYTKSFTHIGRFP------LIGRQGAYSGNVQLAQGRFHATEHAVVVTPKQSTNID 115 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + + + G + ++ +P P L EQ I + + + Sbjct: 116 FLFF---LLIRGELSRLATGQAQPGLSVASLNSVSIPFPALPEQQKIADCLSSLDEL--- 169 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I + + +E LK K+ L+ + + +++ G + Sbjct: 170 -IELQAKKLEALKAHKKGLMQQLFPREGETTPRLRFPEFRDAGPWEVKRLGEVACEFSDG 228 Query: 244 LNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVD-----PGEIVFRFID 297 ++ I + GNI + K + S ET++ + PG+++ + Sbjct: 229 DWIESKDQSPDGIRLIQTGNIGLGKFIDNTEKARFISEETFERLSCSEVFPGDLLISRL- 287 Query: 298 LQNDKRSLRSAQVMERGI--ITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLR 354 R + +R I + + + ++ +V R Sbjct: 288 PDPAGRCCLIPNIGKRMITAVDCTIVRFDLKQAHPYFCLSYCQTDQYFKEVAARSAGSTR 347 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + +++ + V +P + EQ I + + + +D L+E + + LK + + Sbjct: 348 TRISRQNLADVRVPLPTLPEQQKIADCL----SSLDELIELQAKKLEALKAHKKGLMQQL 403 Query: 415 VTGQIDL 421 +IDL Sbjct: 404 FPQEIDL 410 >gi|242279138|ref|YP_002991267.1| restriction modification system DNA specificity domain protein [Desulfovibrio salexigens DSM 2638] gi|242122032|gb|ACS79728.1| restriction modification system DNA specificity domain protein [Desulfovibrio salexigens DSM 2638] Length = 400 Score = 122 bits (307), Expect = 7e-26, Method: Composition-based stats. Identities = 68/415 (16%), Positives = 146/415 (35%), Gaps = 44/415 (10%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + ++ G++ S K E T + G + S +F + G Sbjct: 5 LTQLASIHYGKSPSSTKA---------EESTIPIIGTGGQTGWGKDS---LFEGPATVVG 52 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + G + T + VL K++ + L L D+ E + E + Sbjct: 53 RKGTLGNPLYVETPFWPIDTTYAVLPYKNIHAKWLYYSLADCDL----EKLNEATGVPSI 108 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + +G I + +Q I + + +D I + I + KQ ++ + T+ Sbjct: 109 NRDYLGRIKISFVEFPQQRKIAKIL----TTVDNQIEKTEELIAKYESVKQGMMQDLFTR 164 Query: 210 GLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTEL-----NRKNTKLIESNI 256 G++ + K++ + +G +P WEV P + TE+ R Sbjct: 165 GVDENGKLRPKREDAPELYKKTELGWIPREWEVLPCIDVCTEIVVGIVIRPTQYYTSYGT 224 Query: 257 LSLSYGNII----QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 L N+ M + + +V G++V + + Sbjct: 225 PVLRSANVKEEGLDSSALIFMTEENNQKLSKSMVRAGDLVTVRTGYPG--TTCVIPSDFD 282 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 + ++ + I S +LA + S G +Q ++K L V++P Sbjct: 283 KANCVDIIISRPDNSISSIFLATWINSSFGKGQVLKRQGGLAQQHFNVGEMKDLLVVLPS 342 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 EQ IT +N + L+ + ++ + LK +++ + +TG+I++ + + Sbjct: 343 QTEQDKITKRLNSLKKK---LITE-KKQLTKLKHLKTALMQDLLTGKIEVTPDPE 393 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 78/216 (36%), Gaps = 12/216 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 YK + +G IP+ W+V+P ++ G + G + S K D Sbjct: 183 YKKTE---LGWIPREWEVLPCIDVCTEIVVGIVIRPTQYYTSYGTPVLRSANVKEEGLDS 239 Query: 69 ------NSRQSDTSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + S+ G ++ + G P I +DFD ++ +P + + Sbjct: 240 SALIFMTEENNQKLSKSMVRAGDLVTVRTGYPGTTCVIPSDFDKANCVDIIISRPDNSIS 299 Query: 122 ELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + ++ ++ G H + + ++ + +P EQ I +++ + + Sbjct: 300 SIFLATWINSSFGKGQVLKRQGGLAQQHFNVGEMKDLLVVLPSQTEQDKITKRLNSLKKK 359 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + T + + L Q L++ + +P+ Sbjct: 360 LITEKKQLTKLKHLKTALMQDLLTGKIEVTPDPEDM 395 >gi|189440822|ref|YP_001955903.1| restriction endonuclease S subunit [Bifidobacterium longum DJO10A] gi|189429257|gb|ACD99405.1| Restriction endonuclease S subunit [Bifidobacterium longum DJO10A] Length = 413 Score = 122 bits (307), Expect = 8e-26, Method: Composition-based stats. Identities = 69/409 (16%), Positives = 143/409 (34%), Gaps = 37/409 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + T + + +++ E ++ +++S+ + + A Sbjct: 19 WEQRKLGEIADKVTEKNLDGNITEVLTNSAEYGVINQTEFFDH-AVAKESNIAGYYVIAP 77 Query: 84 GQILYGKL-------GPYLRKAIIADFDGICSTQFLVLQPKDVLP----ELLQGWLLSID 132 G +Y GP R + G+ S + V + D + Sbjct: 78 GDFVYNPRISATAPVGPIRRNTL--GIHGVMSPLYTVFRLTDAVDGTYLSHFFKTNGWHG 135 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + +P+P+P +EQ I R+D LIT R Sbjct: 136 FMKLEGNSGARSDRFSIGDATFFEMPIPVPSSSEQYAIGSF----FSRLDDLITLHQRKY 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TK 250 + L K++++ + K +++ +G D WE + + ++ KN Sbjct: 192 DKLVIFKKSMLEKMFPKDGESVPEIRFAG------FTDPWEQRKLGEIADKVTAKNLDGN 245 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQ 309 + E S YG I Q + K + Y ++ PG+ V+ I +R Sbjct: 246 ITEVLTNSAEYGVINQTEFFDHAVAKESNIAGYYVIAPGDFVYNPRISATAPVGPIRRNT 305 Query: 310 VMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKR 364 + G+++ Y + +D TYL+ ++ G+ R S+ Sbjct: 306 LGIHGVMSPLYTVFRLTDAVDGTYLSHFFKTNGWHGFMKLEGNSGARSDRFSIGDATFFE 365 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P+ VP EQ I + +R+D L+ ++ + LL++ + S + Sbjct: 366 MPIPVPSSSEQHAIGSF----FSRLDNLITLHQRKLELLQDIKKSLLDK 410 >gi|312128927|ref|YP_003996267.1| restriction modification system DNA specificity domain [Leadbetterella byssophila DSM 17132] gi|311905473|gb|ADQ15914.1| restriction modification system DNA specificity domain [Leadbetterella byssophila DSM 17132] Length = 495 Score = 122 bits (307), Expect = 9e-26, Method: Composition-based stats. Identities = 63/474 (13%), Positives = 130/474 (27%), Gaps = 75/474 (15%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +H+ V+ I +G T DI ++ +++ G + S Sbjct: 26 EHYPVLKIADIADTTSGGTPNRGMPEYYNGDIPWVKSGELKDGVITTCDEYITEAGLKNS 85 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + +F KG +L G + K I DFD + + PK + W Sbjct: 86 SAKLFPKGTLLVAMYGANIGKTGILDFDATTNQAVCAIFPKVDISREFLSWYFKQQ-RID 144 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE------------------T 178 A+ +G + I N + +P Q I + + Sbjct: 145 FIAVGKGGAQPNISQTIINNASIVVPDEKVQKAIVKFLERIEKGDGIDYDFFIPEVLKDV 204 Query: 179 VRIDTLITERIRFIELLKEK-------KQALVSYIVTKGLNPDVKMKDSGIEWV------ 225 I + + + + QA++ V L P + E + Sbjct: 205 ETIYKYKNSYVTLSDSFESQLTQLENLNQAILQEAVQGKLVPQDPNDEPASELLKRIKAE 264 Query: 226 --------------------------GLVPDHWEVKPFFALVTELNRKNTKLIES--NIL 257 +P++W + R I Sbjct: 265 KATLRQAQGKGKKEKPLPPIKPEEIPFEIPENWVWCRLGEICEVNPRNKVDDEIDAGFIP 324 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI-- 315 + T + + + ++V I + + GI Sbjct: 325 MPMVSQLFGVKPTYEVRKWGAIKKGFTHFANNDVVIAKITPCFENSKAGIISDLPNGIGA 384 Query: 316 -ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPI 372 T + I Y+ ++ D K + G+ +Q + + + +PP+ Sbjct: 385 GTTELNVLRGNQYILPEYVYAFVKRIDFLKNGERIMKGVAGQQRVPTDYFYNTLIPLPPL 444 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 EQ I I + A+ L E I + ++ + + A +++ + Sbjct: 445 AEQKRIVAEIEKQFAKTKQLKEHIIANQQATEQLLKALLHQAF----EVKEMEE 494 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 79/214 (36%), Gaps = 10/214 (4%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 K K P K + + IP++W + ++N + D +I + V G Sbjct: 276 KKEKPLPPIKPEEIPF--EIPENWVWCRLGEICEVNPRNKVDDEIDAGFIPMPMVSQLFG 333 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQ 115 + + + FA ++ K+ P + + + G +T+ VL+ Sbjct: 334 VKPTYEVRKWGAIKKGFTHFANNDVVIAKITPCFENSKAGIISDLPNGIGAGTTELNVLR 393 Query: 116 PKDVL-PELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREK 173 + PE + ++ ID + E I +G N +P+PPLAEQ I + Sbjct: 394 GNQYILPEYVYAFVKRIDFLKNGERIMKGVAGQQRVPTDYFYNTLIPLPPLAEQKRIVAE 453 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 I + + L I + ++ +AL+ Sbjct: 454 IEKQFAKTKQLKEHIIANQQATEQLLKALLHQAF 487 >gi|78046749|ref|YP_362924.1| type I site-specific deoxyribonuclease (specificity subunit) [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78035179|emb|CAJ22824.1| type I site-specific deoxyribonuclease (specificity subunit) [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 439 Score = 122 bits (306), Expect = 9e-26, Method: Composition-based stats. Identities = 80/426 (18%), Positives = 155/426 (36%), Gaps = 29/426 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P+ W ++ ++ N ++ ++ ++ ++ V G L + + Sbjct: 9 LPQGWTRRRLRFDSRCNPVKSKLDLPDDTEVSFVPMDAVGELGGLRLDQ-TRELADVYNG 67 Query: 78 VSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLS- 130 + FA G + K+ P + + +T+ VL+P L +L Sbjct: 68 YTYFADGDVCIAKITPCFENGKGAIAEGLVNGVAFGTTELHVLRPSATLDTKFLFYLTIA 127 Query: 131 IDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 D EA GA + + + +P + Q I + +T RID LI ++ Sbjct: 128 RDFRSHGEAEMRGAGGQKRVPEEFLKDWTPSLPRMDVQQRIARFLDDKTARIDALIEKKQ 187 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDS----GIEWVGLVPDHWEVKPFFALVTELN 245 +E L+EK++AL++ VT + K+ G W+ P W+VK Sbjct: 188 ELLERLEEKRRALITSAVTGESRLRTQAKNKTQKFGSAWLEAAPSDWKVKRLRFAFESCK 247 Query: 246 RKNT------KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFID 297 + I + + +L N L+ +++ V G+++ Sbjct: 248 NGVWGAEPDDEDSIVCIRAADFDGQSGRLNNGNRTLRTIDNWSFEKVRLNFGDLILEKSG 307 Query: 298 LQND--KRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCK--VFYAMGSG 352 + + ++ +P +L +LM + L + + S Sbjct: 308 GGDKQLVGRAVLFDGHTPSVCSNFLARCRPRIGFHHRFLNYLMLAIYLGRGTYPHIKQST 367 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q++ + V +P Q DI+N ++ A IDV+ + +SI L RS I Sbjct: 368 GIQNIDTGSYFDMRVAIPEENIQIDISNFLDESVAAIDVIRSCVIRSIEKLNNFRSIVIT 427 Query: 413 AAVTGQ 418 AVTGQ Sbjct: 428 DAVTGQ 433 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 80/207 (38%), Gaps = 9/207 (4%) Query: 229 PDHWEVKPFFALVTELNRKN----TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 P W + K+ E + + + + L + Y Y Sbjct: 10 PQGWTRRRLRFDSRCNPVKSKLDLPDDTEVSFVPMDAVGELGGLRLDQTRELADVYNGYT 69 Query: 285 IVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM---RS 339 G++ I N K ++ V T+ ++P T + + R Sbjct: 70 YFADGDVCIAKITPCFENGKGAIAEGLVNGVAFGTTELHVLRPSATLDTKFLFYLTIARD 129 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + G+G ++ + E +K +P + Q I ++ +TARID L+EK ++ Sbjct: 130 FRSHGEAEMRGAGGQKRVPEEFLKDWTPSLPRMDVQQRIARFLDDKTARIDALIEKKQEL 189 Query: 400 IVLLKERRSSFIAAAVTGQIDLRGESQ 426 + L+E+R + I +AVTG+ LR +++ Sbjct: 190 LERLEEKRRALITSAVTGESRLRTQAK 216 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 71/212 (33%), Gaps = 14/212 (6%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 G W+ A P WKV ++ + G + I+ I D + +G+ + Sbjct: 223 GSAWLEAAPSDWKVKRLRFAFESCKNGVWGAEPDDEDSIVCIRAADFDGQSGRLNNGNRT 282 Query: 70 SRQSDTST--VSIFAKGQILYGKLGP-----YLRKAIIADF-DGICSTQFLVLQPKDVLP 121 R D + G ++ K G R + +CS +P+ Sbjct: 283 LRTIDNWSFEKVRLNFGDLILEKSGGGDKQLVGRAVLFDGHTPSVCSNFLARCRPRIGFH 342 Query: 122 ELLQGWLLSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 +L+ I + + + D ++ + IP Q+ I + Sbjct: 343 HRFLNYLMLAIYLGRGTYPHIKQSTGIQNIDTGSYFDMRVAIPEENIQIDISNFLDESVA 402 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 ID + + IR IE L + +++ VT L Sbjct: 403 AIDVIRSCVIRSIEKLNNFRSIVITDAVTGQL 434 >gi|254478523|ref|ZP_05091898.1| Type I restriction modification DNA specificity domain protein [Carboxydibrachium pacificum DSM 12653] gi|214035531|gb|EEB76230.1| Type I restriction modification DNA specificity domain protein [Carboxydibrachium pacificum DSM 12653] Length = 386 Score = 122 bits (306), Expect = 9e-26, Method: Composition-based stats. Identities = 56/402 (13%), Positives = 129/402 (32%), Gaps = 32/402 (7%) Query: 29 PIKRFTKLNTGR-TSESGKDII--YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + K+N R G ++ + V+ G + + F +G Sbjct: 2 RLGEVCKINPRRPRLIRGDGAPTSFVPMRAVDEFLGMIVEIQIRPFAEVRKGYTYFEEGD 61 Query: 86 ILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI-- 137 +L+ K+ P + + D G ST+F VL+P + + + +V + Sbjct: 62 VLFAKITPCMENGKAAIAKGLIDGIGFGSTEFHVLRPSLEVIAEWVWYFVRQEVFRNKAK 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E+ G + + +P+PPL EQ I K+ A R+ + R + + Sbjct: 122 ESFRGGVGQQRVPQDFLESYLLPLPPLEEQRRIVAKVEALMERVREVRRLRAEAQKDTEL 181 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q ++ + +P W + + ++ N Sbjct: 182 LMQTALAEVFPHP--------------GADLPPGWRWVRLGEVCDIIMGQSPPSSTYNFE 227 Query: 258 SLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 K + ++ P + + ++ PG+++ + Sbjct: 228 GNGLPFFQGKADFGDLHPTPRIWCSAPQKVARPGDVLISVRAPVG-----STNVANLACC 282 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I A++P + Y ++ ++ +D++ + + +PP++EQ Sbjct: 283 IGRGLAALRPRDSLERFWLLYYLHYLEPELSKMGAGSTFNAITKKDLQNVFIPLPPLEEQ 342 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I ++ ++ L ++ LK + + A G Sbjct: 343 RRIVAYLDQIQQQVAALKRAQAETEAELKRLEQAILDKAFRG 384 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 65/192 (33%), Gaps = 2/192 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W+ V + + G++ S G + R ++ Sbjct: 197 DLPPGWRWVRLGEVCDIIMGQSPPSSTYNFEGNGLPFFQGKADFGDLHPTPRIWCSAPQK 256 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L P +A+ L+P+D L + L + + Sbjct: 257 VARPGDVLISVRAPV-GSTNVANLACCIGRGLAALRPRDSLERFWLLYYLH-YLEPELSK 314 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G+T + K + N+ +P+PPL EQ I + ++ L + LK + Sbjct: 315 MGAGSTFNAITKKDLQNVFIPLPPLEEQRRIVAYLDQIQQQVAALKRAQAETEAELKRLE 374 Query: 200 QALVSYIVTKGL 211 QA++ L Sbjct: 375 QAILDKAFRGDL 386 >gi|291514831|emb|CBK64041.1| Restriction endonuclease S subunits [Alistipes shahii WAL 8301] Length = 404 Score = 122 bits (306), Expect = 1e-25, Method: Composition-based stats. Identities = 58/416 (13%), Positives = 126/416 (30%), Gaps = 46/416 (11%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +G IP+ W+V + G T + I +E + +G Y +S Sbjct: 23 LGVIPQKWEVKFLGDLLSRCTNGLTYDVSITCGIPVTRIETISTGEINYAKVGYIPNESG 82 Query: 75 TSTVSIFAKGQILYGKLGPY--LRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLS 130 T + KG ILY + + K D + L+L+ + L + + L Sbjct: 83 YETFRM-QKGDILYSHINSLSQIGKVAYYKGDKEIYHGMNLLLLRANESLDKQYLYYTLL 141 Query: 131 IDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 D + + + + + + + +PPLAEQ I E + I+ Sbjct: 142 TDHMRHMAQVIAKPAVNQASISTSDLKRVKIAVPPLAEQRKIAEVLGVWDEAIEKQARLI 201 Query: 189 IRFIELLKEKKQALVSYIV--TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + Q L+S + +P ++K + I + + F Sbjct: 202 EKLALRKRGLMQRLLSAKLRLPGFSDPWKELKINKITIIRKGEQVNKDVLFSNA------ 255 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K N G+ P Y I Sbjct: 256 --------------------KYPVINGGITPSGYLDIYNTKANTITISEGGNS----CDY 291 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + ++ + + + + + +++ +D+ L Sbjct: 292 VNFMTTPFWSGGHCYTIEAKDGINNLCIYQLLKNNEKYIMSLRVGSGLPNIQIKDLGNLK 351 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 ++P +EQ I V+ +E ++ + L+ ++ + +TG+ ++ Sbjct: 352 FMIPTYQEQTAIAEVLTASDRE----IELAKEKLERLRRQKRGLMQQLLTGKKRVK 403 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 78/209 (37%), Gaps = 11/209 (5%) Query: 225 VGLVPDHWEVKPFFALVTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 +G++P WEVK L++ I I I + Sbjct: 23 LGVIPQKWEVKFLGDLLSRCTNGLTYDVSITCGIPVTRIETISTGEINYAKVGYIPNESG 82 Query: 283 Y--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRS 339 Y + G+I++ I+ + + + + + ++ + D YL + + + Sbjct: 83 YETFRMQKGDILYSHINSLSQIGKVAYYKGDKEIYHGMNLLLLRANESLDKQYLYYTLLT 142 Query: 340 YDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + + + S+ D+KR+ + VPP+ EQ I V+ V D +EK Sbjct: 143 DHMRHMAQVIAKPAVNQASISTSDLKRVKIAVPPLAEQRKIAEVLGVW----DEAIEKQA 198 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + I L R+ + ++ ++ L G S Sbjct: 199 RLIEKLALRKRGLMQRLLSAKLRLPGFSD 227 >gi|308185304|ref|YP_003929437.1| restriction modification system DNA specificity domain protein [Helicobacter pylori SJM180] gi|308061224|gb|ADO03120.1| restriction modification system DNA specificity domain protein [Helicobacter pylori SJM180] Length = 400 Score = 122 bits (306), Expect = 1e-25, Method: Composition-based stats. Identities = 59/408 (14%), Positives = 130/408 (31%), Gaps = 25/408 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSD 74 P +W+ V + ++ G T + I + ++ + + Sbjct: 7 PLNWQRVRLGDIAEIIGGGTPSTQVTSFWNGSINWFTPTEIGITKYVHKSQRTITPLGLK 66 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + G IL + I + F L P + + + L + + Sbjct: 67 KSSAKLLPIGTILLTSR-ASIGDCAILKVVATTNQGFQSLIPLEKINNE-FLYYLMLTLK 124 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + G+T I N+ +P+PPL EQ+ I + + +L ++ + Sbjct: 125 NKLLKLASGSTFLEVSPNKIKNLLIPLPPLNEQIAIANILSDLDHYLYSLDALILKKESV 184 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K L+S ++K W + A + S Sbjct: 185 KKALSFELLSQ--------RKRLKGFNQAWQRVRLGDIAEIKRGASPRPIENPKWFCANS 236 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N+ + +I + +R + + I I + + + Sbjct: 237 NVGWVRISDISKN--SRFLYKTAQELSKKGIEKSRFIKQNSLIMSMCATIGKPIITKIDT 294 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIK 373 I ++ + ID YL + + Y + + G + +L + +K V P + Sbjct: 295 CIHDGFVVFENPKIDLNYLYYFL-CYIEKEWLESGQQGSQVNLNVDLIKNKEVFCPKDLN 353 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I N+++ I L K Q + + + ++ +I + Sbjct: 354 EQIAIANILSDLDNEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 397 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 59/204 (28%), Gaps = 10/204 (4%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY---- 283 P +W+ + + + N E +S T Sbjct: 6 TPLNWQRVRLGDIAEIIGGGTPS-TQVTSFWNGSINWFTPTEIGITKYVHKSQRTITPLG 64 Query: 284 -QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + I L + A + + ++ P + + + Sbjct: 65 LKKSSAKLLPIGTILLTSRASIGDCAILKVVATTNQGFQSLIPLEKINNEFLYYLMLTLK 124 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K+ + +K L + +PP+ EQ I N+++ + L I + Sbjct: 125 NKLLKLASGSTFLEVSPNKIKNLLIPLPPLNEQIAIANILSDLDHYLYSLDALILKK--- 181 Query: 403 LKERRSSFIAAAVTGQIDLRGESQ 426 + + + ++ + L+G +Q Sbjct: 182 -ESVKKALSFELLSQRKRLKGFNQ 204 >gi|325912594|ref|ZP_08174977.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 60-B] gi|325478015|gb|EGC81144.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 60-B] Length = 386 Score = 122 bits (306), Expect = 1e-25, Method: Composition-based stats. Identities = 64/408 (15%), Positives = 131/408 (32%), Gaps = 35/408 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK W+ + + TG + K V G + + + +T + Sbjct: 5 PKDWEEKKLGNLAFIKTGNKNNEDK---------VSGGKYPFYVRSEKVERINTFSY--- 52 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQ-FLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL G + + + + D L W + + + Sbjct: 53 DTEAILVPGEGNIGSVFHYVNGKFDVHQRVYAITNFSDTLNAKYLYWFMIKNFGSYALSQ 112 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 AT+ N + IPP EQ I + + A I+ + L EKK+ Sbjct: 113 TSKATVDSLRLPAFKNFDVVIPPFPEQQAIADALTAFDTHINN--------LAKLIEKKK 164 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + V ++ ++ +W + A + K + ++S L Sbjct: 165 MIRDGAVEDLVSGKRRLAGFSGKW--EEISFNDSVIPKARIGWQGLKKDEYLQSGYSYLI 222 Query: 261 YGNIIQKLETRNMGLKPESYETYQI-----VDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 G + + S + Y + V G+++ K ++ + Sbjct: 223 SGTDFYRGTISFEEISYVSKDRYDMDSNIQVKSGDVLVTKDGTIG-KVAIVPNIDKRATL 281 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVP-PIK 373 + ++ I +L W++RS + L +D+K+L ++P + Sbjct: 282 NSGVFVFRVIEKIKRKFLYWILRSSLFSNFIDELSAGSTIKHLYQKDLKKLKFVIPTSLS 341 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I +++ I L E+ E+ I ++ + +TG+I L Sbjct: 342 EQQAIADILTSMDKEISDLEEEKEKYIA----LKAGAMDDLLTGKIRL 385 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 52/167 (31%), Gaps = 5/167 (2%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + N + + + + + + S+ + + Sbjct: 23 NKNNEDKVSGGKYPFYVRSEKVERINTFSYDTEAILVPGEGNIGSVFHYVNGKFDVHQRV 82 Query: 320 Y-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 Y + +++ YL W M SL+ K V++PP EQ I Sbjct: 83 YAITNFSDTLNAKYLYWFMIKNFGSYALSQTSKATVDSLRLPAFKNFDVVIPPFPEQQAI 142 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + + I+ L + IE+ K R + V+G+ L G S Sbjct: 143 ADALTAFDTHINNLAKLIEKK----KMIRDGAVEDLVSGKRRLAGFS 185 >gi|253577073|ref|ZP_04854395.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251843567|gb|EES71593.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 443 Score = 122 bits (306), Expect = 1e-25, Method: Composition-based stats. Identities = 69/431 (16%), Positives = 148/431 (34%), Gaps = 36/431 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDV---ESGTGKYLPKDGNSRQ 72 + W V + + +G T ++ ++I +I +DV E+ T + + + + Sbjct: 4 ETWNNVMLGDVVDIISGGTPKTTITEYWEPEEIDWITAKDVSECENRTIRKTSRRISKKG 63 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + S+ I + G K ++ + LQ K+ +L +L+ Sbjct: 64 LENSSARILEPLTTVLIARGATTGKVALSSEGMAMNQTCYGLQAKEGNDKLFIYYLMLSR 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 AI G+ GI +I + IP Q I + + +D I Sbjct: 124 Y-NSFRAIANGSIFETVIGSGIKSIQLNIPTFPIQQSIGKIL----GALDDKIELNNAIN 178 Query: 193 ELLKEKKQALVSYIVTKGLNPDV---KMKDSGIE----WVGLVPDHWEVKPFFALVTELN 245 + L+E QAL P+ K SG E +GL+P W+V + Sbjct: 179 KNLEEMAQALFKRWFVDFEFPNENGEPYKSSGGEFEESELGLIPKGWKVVTIGDYCKVRS 238 Query: 246 R---KNTKLIE----SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 K++ + + GN I +T + + + + +PG+I+ Sbjct: 239 GFAFKSSWWQDEGIKVIKIKNIIGNTINLQDTDCVDEEKMLKASEFLANPGDILIAMTGA 298 Query: 299 QNDKRSLRS---AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 K +L ++ + ++ P + L + ++ + Sbjct: 299 TVGKIALVPRTNEALLINQRVGKFFLGENPFKKNGFLYCLLTQKVVFDQIVSVASGSAQP 358 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ ++ + +L+P K N T + + +I +L + R + + + Sbjct: 359 NISPTGIESIKILLPDPKT----LEYFNEITGSMLKNIVEINYGNKILTQIRDTLLPKLM 414 Query: 416 TGQIDLRGESQ 426 +G+I + E Sbjct: 415 SGEIRVPAEQD 425 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 77/210 (36%), Gaps = 15/210 (7%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTG 61 YK SG ++ +G IPK WKVV I + K+ +G +S + I I ++++ T Sbjct: 206 YKSSGGEFEESELGLIPKGWKVVTIGDYCKVRSGFAFKSSWWQDEGIKVIKIKNIIGNTI 265 Query: 62 KYLPKD-GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQ---FLVL 114 D + + ++ + G IL G + K + + + + + F + Sbjct: 266 NLQDTDCVDEEKMLKASEFLANPGDILIAMTGATVGKIALVPRTNEALLINQRVGKFFLG 325 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + L L V +I ++ G+ + GI +I + +P E Sbjct: 326 ENPFKKNGFLYCLLTQKVVFDQIVSVASGSAQPNISPTGIESIKILLPDPKTLEYFNEIT 385 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + ++ L+S Sbjct: 386 GSMLKNIVEINYGNKILTQIRDTLLPKLMS 415 >gi|29349928|ref|NP_813431.1| putative type I restriction enzyme EcoAI protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341839|gb|AAO79625.1| putative type I restriction enzyme S.BthVORF4518AP [Bacteroides thetaiotaomicron VPI-5482] Length = 474 Score = 122 bits (306), Expect = 1e-25, Method: Composition-based stats. Identities = 58/406 (14%), Positives = 119/406 (29%), Gaps = 30/406 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGN--SRQS 73 +P W ++ + K T +S I ++ + DV G +L Sbjct: 70 EVPSSWVWCKLEDYVKSVTDGDHQAPPKSDIGIPFLVISDVAKGKLNFLNTRFVPQEYYE 129 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S KG +L+ G Y + C + + L E L L S Sbjct: 130 KISFDRKPEKGDLLFTVTGSYGIVVPVNIDCKFCFQRHIGLIKTLNTSEYLLHLLKSSYF 189 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G + + + +PIPP AEQ I +I I+ + + Sbjct: 190 KGQCDEFATGTAQKTVGLETLRSFLLPIPPFAEQQRIVIEIEKWFSLIELIEGGKDDLQT 249 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV-------------------GLVPDHWEV 234 +K+ K ++ + L P ++ I+ + +P W Sbjct: 250 TIKQAKSKILDLAIHGKLVPQDPNEEPAIKLLKRINPDFTPCDNGHSGKLPYKIPKTWAW 309 Query: 235 KPFFALVTELNRK---NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +++ + + I+ + + + + G+I Sbjct: 310 CSHNSILDISGGSQPAKSYFETIPKPNYIRLYQIRDYGESPVPVYIPINLASKQTEKGDI 369 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + K + A+ + + + I Y + S + Sbjct: 370 LLARYGGSLGK--VFHAKQGAYNVAMVKVIFKFENLIYKEYAYYYYLSDLYQGKLKEISR 427 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + D + +PPI EQ I I + +D + + +E Sbjct: 428 TAQTGFNITDFNDMYFPLPPINEQQRIVQKIEELFSSLDNIQKSLE 473 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 67/201 (33%), Gaps = 13/201 (6%) Query: 227 LVPDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPE 278 VP W V + + + + I L ++ + E Sbjct: 70 EVPSSWVWCKLEDYVKSVTDGDHQAPPKSDIGIPFLVISDVAKGKLNFLNTRFVPQEYYE 129 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + G+++F + ++ ++ + S YL L++ Sbjct: 130 KISFDRKPEKGDLLFTVTGS----YGIVVPVNIDCKFCFQRHIGLIKTLNTSEYLLHLLK 185 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S +G ++++ E ++ + +PP EQ I I + I+++ + Sbjct: 186 SSYFKGQCDEFATGTAQKTVGLETLRSFLLPIPPFAEQQRIVIEIEKWFSLIELIEGGKD 245 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 246 DLQTTIKQAKSKILDLAIHGK 266 >gi|22416340|emb|CAC87151.1| restriction-modification enzyme type I S subunit [Streptococcus thermophilus] Length = 407 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 61/404 (15%), Positives = 146/404 (36%), Gaps = 29/404 (7%) Query: 24 HWKVVPIKR--------FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + ++ S + +I +D++ + S++ D Sbjct: 16 DWEQRKLGELSQKISVGIATSSSKYFSSQDHGVPFIKNQDIKENRINTKNLEYISKEFDN 75 Query: 76 STV-SIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSI 131 +G I+ + G A++ +T + +L E + ++ S Sbjct: 76 KNKNKRVKQGDIITARTGYPGLSAVVPKELEGAQTFTTLITRPISEMILSEYISIFINSP 135 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I + G + + + N+ +P+P L EQ I I+ ++D I R Sbjct: 136 YGMKQISGMEAGGAQKNVNAGIVQNLLIPLPSLDEQKKISNFIL----KLDDTIALHQRK 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE+K+ + + K +++ +G V EV + + K Sbjct: 192 LDLLKEQKKGYLQKMFPKNGAKVPELRFAGFADDWEVRKLNEVSDIYDGT----HQTPKY 247 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ++ ++ LS NI + + + E G+++ I D + + Sbjct: 248 QDNGVMFLSVENIKTLTSNKFISREAFEDEFKIRPQRGDVLMTRIG---DIGTANVVETD 304 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLV 369 E + K ++ +L + + + + + + ++ ++P+ V Sbjct: 305 EDLAYYVSLALFKSEELNPYFLQASIYAPFVQDQIWKRTLHIAFPKKINKNEIGQVPINV 364 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P + EQ I + ++D + ++ + LLKE++ F+ Sbjct: 365 PTLAEQTKIGSF----FKQLDKTIALHQRKLDLLKEQKKGFLQK 404 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 75/201 (37%), Gaps = 20/201 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTG--RTSE-SGKDIIYIGLEDVESGTGKYLPKDG 68 +P+ W+V + + + G +T + ++++ +E++++ T K Sbjct: 213 KVPELRFAGFADDWEVRKLNEVSDIYDGTHQTPKYQDNGVMFLSVENIKTLTS---NKFI 269 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + +G +L ++G ++ + + L L + L Sbjct: 270 SREAFEDEFKIRPQRGDVLMTRIGDIGTANVVETDEDLAYYVSLALFKSEELNPYFLQAS 329 Query: 129 LSIDVTQR--IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + Q + A + IG +P+ +P LAEQ I ++D I Sbjct: 330 IYAPFVQDQIWKRTLHIAFPKKINKNEIGQVPINVPTLAEQTKIGSF----FKQLDKTIA 385 Query: 187 ERIRFIELLKEKKQALVSYIV 207 R ++LLKE+K+ + + Sbjct: 386 LHQRKLDLLKEQKKGFLQKMF 406 >gi|31983512|ref|NP_858124.1| putative type i restriction enzyme hsds subunit [Lactobacillus delbrueckii subsp. lactis] gi|18077746|emb|CAD13349.1| putative Type I restriction enzyme hsdS subunit [Lactobacillus delbrueckii subsp. lactis] Length = 396 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 51/391 (13%), Positives = 127/391 (32%), Gaps = 19/391 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + K+ G++ S + G + R T I KG Sbjct: 20 WEQCKLGDVAKITMGQSPNSKNYTDNPKDHILVQGNADMKDGQVHPRIWTTEITKIADKG 79 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ P +D + ++ + + L + G+ Sbjct: 80 DLILSVRAPV-GDIGKTSYDVVIGRGVAAIKGNEFI----FQLLKRMKTVGYWTKYSTGS 134 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + I N + +P EQ + + + I ++ + L Q + + Sbjct: 135 TFESINSLEINNAVINLPKEHEQNKVGKILSYMDHAITLHEEKKRQLECLKSALLQKMFA 194 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 K P V+ + EW + ++ ++ + + T L+ Sbjct: 195 D---KSGYPVVRFEGFSDEW-----EERKLGDAVSISSGVTGDATLQDGEYRLTRIESIS 246 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 L +G + + +++ G+I++ I+ + + + Sbjct: 247 QGTLNVARLGFTNKKPDQKYLLNLGDILYSNINSLSHIGKVALVDTTGIYHGINLLRFQM 306 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + +DS +L + + + + + + S+ ++ + P+ +P I EQ I + Sbjct: 307 RNDVDSEFLFQRLNTTPMKNWAVSHANPAVSQASINQTELSKQPISLPTITEQQKIGSF- 365 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D + ++ + LLKE++ F+ Sbjct: 366 ---FKQLDKTIALHQRKLDLLKEQKKGFLQK 393 >gi|310659273|ref|YP_003936994.1| restriction modification system DNA specificity domain [Clostridium sticklandii DSM 519] gi|308826051|emb|CBH22089.1| Restriction modification system DNA specificity domain [Clostridium sticklandii] Length = 405 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 50/399 (12%), Positives = 126/399 (31%), Gaps = 28/399 (7%) Query: 26 KVVPIKRFT-KLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + K+++G T + +I ++ ++V+ G S+ Sbjct: 17 EWKTLDEIALKISSGGTPRTGVSEYYNGNIPWLRTQEVDFGEIWDTEIKITDVGLKNSSA 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + ++ G + K I + +Q + + + + + I+ Sbjct: 77 KLIPANCVIIAMYGATVGKVGINKIPLSTNQACANIQLDEKIADYRYVFHYISSKYEHIK 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ G + ++ + + + N +PIPPL Q I + T L E + Sbjct: 137 SLGTG-SQTNINAQIVKNYIIPIPPLKVQEEIVRILDTFTELTAELTAELTARKKQYTYY 195 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L+S +G ++ + G P + + R I Sbjct: 196 RDKLLS--FEEGEIEWKELGEIFNLKNGYTPSKANNEYWTNGTIPWFRMEDIRENGRI-- 251 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 L + + + +I I+ + + + + Sbjct: 252 ---------LSKSIQYVNKSAVKGGKIFPANSIIISTSATIGEHALITVPYLSNQRFTNL 302 Query: 319 AYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + +L ++ D CK + G + K+ + +PP+ EQ Sbjct: 303 SLKDDYINKFVIKFLYHYVFLLDDWCK--NNITVGNFAGVDMNSFKKFKIPIPPLAEQER 360 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I ++++ A + E + + I L ++ R+ ++ Sbjct: 361 IVSILDKFDALTSSITEGLPREIELRQKQYEYYRNMLLS 399 >gi|90425137|ref|YP_533507.1| type I restriction enzyme StySPI specificity protein [Rhodopseudomonas palustris BisB18] gi|90107151|gb|ABD89188.1| type I restriction enzyme StySPI specificity protein [Rhodopseudomonas palustris BisB18] Length = 460 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 63/423 (14%), Positives = 130/423 (30%), Gaps = 26/423 (6%) Query: 19 GAIPKHWKVVPIKRF--TK---LNTGRTSESGKDIIYIGLED-----VESGTGKYLPKDG 68 G +P W PI + + G S K Y G ++L D Sbjct: 3 GDLPSGWVAAPIDDLRALEPNAITDGPYGSSLKTSHYRSSGARVVRLGNIGFRRFLSADA 62 Query: 69 NSRQSDTST---VSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPE 122 D G +L LG + ++ IA + L+ L Sbjct: 63 VYISEDHFKALVKHHVRAGDVLIAALGDPVGRSCIAPSDISPALVKADCFRLRCSPHLSA 122 Query: 123 LLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L+ + + + G + +P+PP EQ I KI + + Sbjct: 123 PFIMLWLNSECAREAFSSAAHGLGRVRINLSDFRTTVVPVPPATEQGRIVAKIDNLSAKS 182 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 +L+++ KQA+++ L + ++ + +W ++ Sbjct: 183 KRSRDHLDHIPQLVEKYKQAILAAAFRGELTHEWRVNNLDQKWPWPECSLSDIANIGTGA 242 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDL 298 T + NI ++ G + + E+ ++ G I+ Sbjct: 243 TPKRGEQRYYSNGNIPWITSGAVKHAVVQAADEYITEAAVRETNCKVFPAGTILMAMYGE 302 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDS---TYLAWLMRSYDLCKVFYAMGSGLRQ 355 + + + A A++ ++ W +RS ++ G++ Sbjct: 303 GKTRGRVTV--LGINAATNQAVAAIQVRADSPAVRDFVVWHLRS-GYLELRERAAGGVQP 359 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +L V + +P EQ ++ + A ID L + + L+ + +A A Sbjct: 360 NLNLGIVNAWRIPLPSRDEQMEVVRRVQKAFAWIDRLTIETTSARKLIDRLDQAILAKAF 419 Query: 416 TGQ 418 G+ Sbjct: 420 RGE 422 >gi|187736905|ref|YP_001816643.1| HsdS [Escherichia coli 1520] gi|172051487|emb|CAP07829.1| HsdS [Escherichia coli] Length = 427 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 51/405 (12%), Positives = 134/405 (33%), Gaps = 37/405 (9%) Query: 26 KVVPIKRF-TKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + ++ K+++G T ++ DI ++ ++V S+ Sbjct: 17 EWKTLEDISIKISSGGTPKTGVSEFYDGDIPWLRTQEVNFCDIWDTEVKITESGVKNSSA 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K ++ G + K I + +Q + + + I+ Sbjct: 77 KWIPKNCVIVAMYGATVGKIGINKIPMTTNQACANIQLNEEVAHYRYVFHFLCSQYTYIK 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRF 191 ++ G + ++ + + + NI +PIP LA Q I + T L E Sbjct: 137 SLGTG-SQTNINAQIVKNIKIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELTAE 195 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 + + K++ ++ K+ +EW +G + + K F + + Sbjct: 196 LNMRKKQHNYYRDQLL--------TFKEGEVEWKALGEIGEFIRGKRFTKADYVEDGGIS 247 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + I Y ++ + + + G++V + + A Sbjct: 248 VIHYGEI----YTRYGVYTTHSLSQVRADMAASLRYAKHGDVVITDVGETVEDVGKAVAW 303 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVL 368 + + I + H ++ ++++ M++ + + + L ++ + Sbjct: 304 LGDDDIAIHDHCYAFRHSLNPKFISYYMQTDSFISEKAKYVARTKVNTLLINGFSKIMIP 363 Query: 369 VP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 VP +KEQ I +++ + + E + + I L +++ Sbjct: 364 VPYPKDHEKSLKEQARIVEILDKFDTLTNSITEGLPREIELRQKQ 408 >gi|126175911|ref|YP_001052060.1| restriction modification system DNA specificity subunit [Shewanella baltica OS155] gi|125999116|gb|ABN63191.1| restriction modification system DNA specificity domain [Shewanella baltica OS155] Length = 427 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 58/412 (14%), Positives = 122/412 (29%), Gaps = 24/412 (5%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 + ++KD+ P+ W G+ + + + + G + Sbjct: 14 RFSEFKDA--------PE-WSPTTFGATATFINGKAYKQEELLENGKYRVLRVGNF-FTN 63 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 K+ + G +LY I I + K + + Sbjct: 64 KEWYFSDLELDENKYCDNGDLLYAWS-ASFGPRIWLGEKVIYHYHIWKVLEKKHIDKNFL 122 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTL 184 LL + + A G + H I N IP + EQ I + + Sbjct: 123 FILLDYETERMKAATANGLGLMHITKSSIENWKCCIPSSIEEQKKIANSLSSLDEL---- 178 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I+ + + LK K+ L+ + K++ E+ G ++K LV+ L Sbjct: 179 ISAHTQKFDTLKAYKKGLMQQLFPAEGETVPKLRFP--EFQGE-WRKTQLKKLGELVSGL 235 Query: 245 NRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + +S +L L N+ + + ++ + + + +I+ + Sbjct: 236 TYSPADVRDSGLLVLRSSNVKNGIISLKDNVYVTPNVKGANLSKANDILICVRNGSKALI 295 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + T + ++ L ++ K A S+ Sbjct: 296 GKNALIPEGMPVCTHGAFMTVFRSPSAKFVFQLFQTNAYQKQVDADLGATINSINGRHFI 355 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + VP EQ I + + + ID L+ I LK + + Sbjct: 356 KYEFYVPESFEQQKIADCL----SSIDELITAQSHKIDALKVHKQGLMQQLF 403 >gi|298384313|ref|ZP_06993873.1| type I restriction-modification enzyme [Bacteroides sp. 1_1_14] gi|298262592|gb|EFI05456.1| type I restriction-modification enzyme [Bacteroides sp. 1_1_14] Length = 411 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 59/390 (15%), Positives = 130/390 (33%), Gaps = 20/390 (5%) Query: 20 AIPKHWKVVPIKRFTKL---NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W ++ + ++ + + LED+E T + K ++ Sbjct: 29 EIPDNWVWTTLEEISNYGDCYNVSVTDIADNEWILELEDLEKDTASIIQKLSKKERNIKG 88 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQ 135 F KG +LY KL YL K ++A G C+T+ + + + S Sbjct: 89 VRHKFKKGDVLYSKLRTYLNKVLVAPKAGYCTTEIIPFNSYCDISTHYLCHVLRSAYFLD 148 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G M +P+PPL+EQ I +I ID + + + Sbjct: 149 YTQQCGYGVKMPRLSTNDACKGMVPLPPLSEQQRIVMEIDKWLALIDQIEQGKADLQNTI 208 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ K ++ + L P + I+ + + + + + + Sbjct: 209 KQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPDFTPCDNGHYAQLPDS-WSAVPMQM 267 Query: 256 ILSLSYGNIIQKLETRNMGLKP-------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + L+ G +E N +K ++ + + V ++ + + Sbjct: 268 LCYLTDGEKQNGIERINHDVKYLRGERDAKTLTSGKYVAANSLLILVDGENSGEVFRTPI 327 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLP 366 + + + ++++ +L + + L + K + Sbjct: 328 DGYQGSTFKQLLINENMNE------EYVLQVINLHRKVLRESKVGSAIPHLNKKLFKAIE 381 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKI 396 V +PP EQ I IN +++++E + Sbjct: 382 VPIPPYNEQQRIVEAINKAFMSLNLIMESL 411 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 68/199 (34%), Gaps = 11/199 (5%) Query: 227 LVPDHWEVKPFFALV----TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 +PD+W + IL L + + K + + Sbjct: 29 EIPDNWVWTTLEEISNYGDCYNVSVTDIADNEWILELEDLEKDTASIIQKLSKKERNIKG 88 Query: 283 -YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY-LAWLMRSY 340 G++++ + +K + + G T+ + + ST+ L ++RS Sbjct: 89 VRHKFKKGDVLYSKLRTYLNKVLVAP----KAGYCTTEIIPFNSYCDISTHYLCHVLRSA 144 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 G G+ L D + V +PP+ EQ I I+ A ID + + Sbjct: 145 YFLDYTQQCGYGVKMPRLSTNDACKGMVPLPPLSEQQRIVMEIDKWLALIDQIEQGKADL 204 Query: 400 IVLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 205 QNTIKQTKSKILDLAIHGK 223 >gi|292656398|ref|YP_003536295.1| type I restriction-modification system specificity subunit [Haloferax volcanii DS2] gi|291371824|gb|ADE04051.1| type I restriction-modification system specificity subunit [Haloferax volcanii DS2] Length = 410 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 60/404 (14%), Positives = 132/404 (32%), Gaps = 22/404 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ + ++ G + G ++ +S + T Sbjct: 5 DLPKGWRQYELGEICEIIMGNSPPGESYNDEGEGVRFLQGQNEFGENTPDSDRFTTEPSR 64 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRI 137 + G IL L AD + L+PK L ++ ++ Sbjct: 65 MSKNGDILVAIRATPLGIVNQADDEYCVGRGVAALRPKKKKLDGRYLYHYMKYCKESEYW 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+T + N+ +P+PPL+EQ I +K+ + +D + + Sbjct: 125 RKVSTGSTYPSITKTNLQNLSVPLPPLSEQQKIADKLNSVIRGVDETREVSSDAKVIEEN 184 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 ++ +S ++ E G + ++ ++ + E Sbjct: 185 LLRSCISGLMP--------------EKEGSTCETVKLDTVCEVILGNSPPGESYNEEGEG 230 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + E + + + + + G+I+ + + + Sbjct: 231 MRFLQGQKEFGEKTPVSDRYTTDPSK-VGKEGDILIAIRATPL---GIINRSDDTYCLGR 286 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 +D YL + M+ + GS S+ +++ LP+ +P I +Q + Sbjct: 287 GVAGLRPEKNLDGGYLYYYMKIQHGYWEKISKGS-TYPSITKTNLQNLPIPLPKISKQQE 345 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ-ID 420 I + ARID + E E+ ++ S ++ A G+ ID Sbjct: 346 IAERLEYIEARIDDIHEASERMSDIIDVLPESVLSKAFQGELID 389 >gi|323182015|gb|EFZ67426.1| type I restriction enzyme specificity protein [Escherichia coli 1357] Length = 417 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 47/399 (11%), Positives = 121/399 (30%), Gaps = 32/399 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + + + L G T K DI + ++D+ Sbjct: 14 EWLSLSKVFNLRNGYTPSKTKKEFWENGDIPWFRMDDIRENGRILGNSLQRISSCAVKGG 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQ 135 +F + IL A+I + + +F L K+ + + + + Sbjct: 74 KLFPENSILISTSATIGEHALITVPH-LANQRFTCLALKESYVDCFDIKFLFYYCFSLAE 132 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITER 188 ++ + D G +P+P LA Q I + T L E Sbjct: 133 WCRKNTTMSSFASVDMDGFKKFLIPLPCPDNPEKSLAIQSEIVRILDKFTALTAELTAEL 192 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + L+S P + M G + +G + + Sbjct: 193 NMRKKQYNYYRDQLLS--FDNEDVPHLPM---GQKDIGEFIRGGTFQKKDFM-------- 239 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + Y + + + + G ++ ++ A Sbjct: 240 DAGVGCIHYGQIYTYYGTYAKKTKTHISATLAKKCKKAQKGNLIIATTSENDEDVCKAVA 299 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 + I S+ + H ++ Y+++ ++ +G + + +++ ++ + Sbjct: 300 WLGSEDIAVSSDACIYKHNLNPKYVSYFFQTEQFQNQKRQYITGAKVRRVNADNLSKILI 359 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 VP ++ Q I ++++ + + E + + I L +++ Sbjct: 360 PVPSMEIQERIVSILDKFDTLTNSITEGLPREIALRQKQ 398 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 54/202 (26%), Gaps = 15/202 (7%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES- 279 +EW+ +V T K +I +I + L+ S Sbjct: 12 EVEWL----SLSKVFNLRNGYTPSKTKKEFWENGDIPWFRMDDIRENGRILGNSLQRISS 67 Query: 280 --YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + ++ I+ + + + + A D +L + Sbjct: 68 CAVKGGKLFPENSILISTSATIGEHALITVPHLANQRFTCLALKESYVDCFDIKFLFYYC 127 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARID 390 S S+ + K+ + +P + Q +I +++ TA Sbjct: 128 FSLA-EWCRKNTTMSSFASVDMDGFKKFLIPLPCPDNPEKSLAIQSEIVRILDKFTALTA 186 Query: 391 VLVEKIEQSIVLLKERRSSFIA 412 L ++ R ++ Sbjct: 187 ELTAELNMRKKQYNYYRDQLLS 208 >gi|238917452|ref|YP_002930969.1| type I restriction enzyme [Eubacterium eligens ATCC 27750] gi|238872812|gb|ACR72522.1| type I restriction enzyme [Eubacterium eligens ATCC 27750] Length = 416 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 58/413 (14%), Positives = 146/413 (35%), Gaps = 19/413 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP W V+ + + ++ G + S + + +I + DV + Sbjct: 10 IPDDWSVITLGNYAQIFRGGSPRPIQAFLTTSDQGVNWIKIGDVGEEDKFIKSTEEKIVP 69 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S + +G ++ Y R I+ I ++ + V + LS Sbjct: 70 EGVSCSRMVFRGDLILSNSMSYGRPYIMNIEGCIHDGWLVIQKYDRVFDRDYLYYALSSG 129 Query: 133 VTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +T + A+ G+++ + + + + + +P P ++EQ I E + I L + Sbjct: 130 LTMKQYVAMAAGSSVQNLNKEKVSKVVLPCPRISEQKSIAEVLSDIDTLIIDLKKIIRKK 189 Query: 192 IELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ + Q LV+ G + + + + ++ + + + A + + Sbjct: 190 KDIRQGTMQMLVTGKKRLSGFDGNW--RVTTLDRLCYIVTKQTGFDYSAEIKPSLVTTPQ 247 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + + + + E Y + E+ I + ++ Sbjct: 248 IGTIPFIQNKDFEAFDINYNTDFFIPYDVAEKYPRILLNEVCL-LISISGRIGNVAIFDN 306 Query: 311 MERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVL 368 + A +A +++ + S D + ++ G + +L DV++L + Sbjct: 307 EQTSFAGGAVGIAKLYEPELASWCMLYLSSKDGQEQIFSNEKVGAQHNLTVADVRKLEIK 366 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P E+ I V+ I+VL E+ + ++ + + +TG++ L Sbjct: 367 MPAKSEREAIIKVLTDMNDEIEVL----EEKLDKYQKIKQGMMDELLTGKVRL 415 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 64/171 (37%), Gaps = 8/171 (4%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + I G + +++ + PE ++V G+++ + + Sbjct: 46 NWIKIGDVGEEDKFIKSTEEKIVPEGVSCSRMVFRGDLILSNSMSYGRPYIMNIEGCIHD 105 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 G + + D YL + + S + + Q+L E V ++ + P I Sbjct: 106 GWL---VIQKYDRVFDRDYLYYALSSGLTMKQYVAMAAGSSVQNLNKEKVSKVVLPCPRI 162 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 EQ I V++ ID L+ +++ I K+ R + VTG+ L G Sbjct: 163 SEQKSIAEVLSD----IDTLIIDLKKIIRKKKDIRQGTMQMLVTGKKRLSG 209 >gi|320526801|ref|ZP_08027991.1| type I restriction modification DNA specificity domain protein [Solobacterium moorei F0204] gi|320132769|gb|EFW25309.1| type I restriction modification DNA specificity domain protein [Solobacterium moorei F0204] Length = 395 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 50/403 (12%), Positives = 122/403 (30%), Gaps = 24/403 (5%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKY--LPKDGNSRQSDTSTVS 79 +L G T ++ K DI ++ ++D +G K + S+ Sbjct: 3 CKFSDVMELIGGGTPKTSKPEYWNGDIPWLSVKDFNNGFRYVYETEKSITQSGLENSSTK 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + +G ++ G + A + F + L+ ++ + + L ++ Sbjct: 63 LLQRGDVIVSARGTVGKIATVP-FPMAFNQSCYGLRARNGIVTSDYLYYLIKHNVSVLKK 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+ NI + IP + Q I + +++ L+++ Sbjct: 122 NTHGSVFDTITRNTFENIEVEIPSIEIQEKIASILGDYDKKME----LNNAINNNLEQQV 177 Query: 200 QALV-SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 QA+ S V D I + + + + I + Sbjct: 178 QAIFKSRFVDFEPFDKTMPSDWTIGTIDDLAKEVVCGKTPSTKKTKYYGSD--IPFITIP 235 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + R + + +I+ + I + + I + Sbjct: 236 DMHKTFYTVTTERYLSKLGADSQAKKILPKNSVCVSCIGTAGLVTLVAEESQTNQQINS- 294 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + G S Y+ LM++ +L ++PV++P + Sbjct: 295 ---IIPKDGFSSYYIYLLMQTLSDTINKLGQSGSTIVNLNKSQFGKIPVIIPTLSAMTKF 351 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + I + + ++ + L R + + ++G+ID+ Sbjct: 352 ----DETASPIFEKILQNQKENLNLASLRDTLLPKLMSGEIDV 390 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 64/191 (33%), Gaps = 9/191 (4%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNSRQS 73 P W + I ++ G+T + K DI +I + D+ ++ + + + Sbjct: 196 PSDWTIGTIDDLAKEVVCGKTPSTKKTKYYGSDIPFITIPDMHKTFYTVTTERYLSKLGA 255 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 D+ I K + +G + + + Q + PKD L+ Sbjct: 256 DSQAKKILPKNSVCVSCIG-TAGLVTLVAEESQTNQQINSIIPKDGFSSYYIYLLMQTLS 314 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+T+ + + G IP+ IP L+ E +I E + Sbjct: 315 DTINKLGQSGSTIVNLNKSQFGKIPVIIPTLSAMTKFDETASPIFEKILQNQKENLNLAS 374 Query: 194 LLKEKKQALVS 204 L L+S Sbjct: 375 LRDTLLPKLMS 385 >gi|166368439|ref|YP_001660712.1| restriction modification system DNA specificity subunit [Microcystis aeruginosa NIES-843] gi|166090812|dbj|BAG05520.1| restriction modification system DNA specificity domain [Microcystis aeruginosa NIES-843] Length = 395 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 58/408 (14%), Positives = 131/408 (32%), Gaps = 27/408 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 K W V + ++ G + E + ++ + D + ++ Sbjct: 2 KDWPSVALGDIFEIARGGSPRPIQNFLTEEPDGVNWVMIGDASDSSKYITHTKKRILKTG 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSIDV 133 + G L + I+ I ++ K V+ + L S + Sbjct: 62 VKNSRMVYPGDFLLTNSMSFGHPYIMKTSGCIHDGWLVLSNKKGVIDQDYFYHLLGSDLI 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+T+ + + + + I + +PPL EQ I + E Sbjct: 122 YAEFSRLASGSTVKNLNIEIVKGIKVSLPPLEEQRRIAAILDKADGVRRKRKEAIRLTEE 181 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L L S + +P K ++ +G + +++ + I Sbjct: 182 L-------LKSTFLEMFGDPVTNPKGWEVKRLGEICTNFQNGIGKNSEHYGHGSKVANIS 234 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + L + + P+ E Y ++ + R + E Sbjct: 235 DLYEWHRFIPEKYSL----LDVTPKEIEKYSLMRGDLLFVRSSVKREGVAVCSVYDSDEI 290 Query: 314 GIITSAYMAVKP--HGIDSTYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVP 370 + +S + V+P I+ +L+ ++R+ + + + ++ + ++ V+VP Sbjct: 291 CLFSSFMIRVRPRTDLINPEFLSLMLRTPPMRNRLILGSNTSTITNISQPGLSKIEVVVP 350 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PIK Q I T I+ V Q++ + +S + A G+ Sbjct: 351 PIKTQNLI----TKVTKNIEESVRCHLQALEQSENLFNSLLQRAFRGE 394 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 69/203 (33%), Gaps = 18/203 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDG--NSRQSDT 75 PK W+V + ++ + + + D+ K + + Sbjct: 198 PKGWEVKRLGEICTNFQNGIGKNSEHYGHGSKVANISDLYEWHRFIPEKYSLLDVTPKEI 257 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQ--GWL 128 S+ G +L+ + + D + S+ + ++P+ L L Sbjct: 258 EKYSLMR-GDLLFVRSSVKREGVAVCSVYDSDEICLFSSFMIRVRPRTDLINPEFLSLML 316 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + R+ +T+++ G+ I + +PP+ Q LI + T I+ + Sbjct: 317 RTPPMRNRLILGSNTSTITNISQPGLSKIEVVVPPIKTQNLITKV----TKNIEESVRCH 372 Query: 189 IRFIELLKEKKQALVSYIVTKGL 211 ++ +E + +L+ L Sbjct: 373 LQALEQSENLFNSLLQRAFRGEL 395 >gi|91775569|ref|YP_545325.1| restriction modification system DNA specificity subunit [Methylobacillus flagellatus KT] gi|91709556|gb|ABE49484.1| restriction modification system DNA specificity domain [Methylobacillus flagellatus KT] Length = 427 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 54/409 (13%), Positives = 120/409 (29%), Gaps = 31/409 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W P+ + + +T + ++ ++ E + KD + Q + I Sbjct: 24 WSFQPLGKLARRSTKKNADGDITRVLTNSAEYGVIDQRDFFDKDI-ANQGNLEGYYIVEM 82 Query: 84 GQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL---LSIDVTQR 136 G +Y P + G+ S + V + D + + ++ Sbjct: 83 GDYVYNPRVSVRAPVGPISKNRIGLGVMSPLYTVFRFGDKQNDFYAHYFKSTHWHHYMRQ 142 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + +P+P+ EQ I + + +D +I + ++ LK Sbjct: 143 ASSTGARHDRMSITNDDFMALPLPVSKPEEQQKIADCL----TSLDEVIAAENQKLDTLK 198 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 K+ L+ + + +++ G W KP + L + Sbjct: 199 TYKKGLMQQLFPREGETVPRLRFPEFRETGE----WCEKPLSKVCNVLQGYGFPEVLQGK 254 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 Y +R + K + + + + K + A++ E + Sbjct: 255 SEGKYPFCKVSDISRAVAEKGGVLDEATNYVGDDELLKLKAKPVPKGATVFAKIGEALRL 314 Query: 317 TSAYMAVKPHGIDSTYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 K ID+ + + G S+ ++ + Sbjct: 315 NRRAYVQKACLIDNNATGLKAIDGIADDYFVYLLSQLIDLNRHCGGAVPSVNKTTLEEIE 374 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 V+VP + EQ I + + +D L+ Q I LK + + Sbjct: 375 VVVPGLDEQKRIAENL----SSLDDLITTQSQKIDALKNHKKGLMQQLF 419 >gi|213961929|ref|ZP_03390194.1| restriction modification system DNA specificity domain protein [Capnocytophaga sputigena Capno] gi|213955282|gb|EEB66599.1| restriction modification system DNA specificity domain protein [Capnocytophaga sputigena Capno] Length = 384 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 62/409 (15%), Positives = 139/409 (33%), Gaps = 43/409 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IPKHWK+ + +++G T DI ++ D+ + + + Sbjct: 5 IPKHWKIKKLGEIANISSGTTPFRKNPLFYDNADIPWVKTTDLNNSYITTTEEKVSMYAL 64 Query: 74 DTSTVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLS 130 + +++ ++ +L G + + + + + L K+ + L+ Sbjct: 65 NNTSLRLYPTNTVLVAMYGGFNQIGRTGKLAMEATINQALSALVLKNDDVNSDYLLFWLN 124 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + + + K + + IPPL EQ I E ++ D I + Sbjct: 125 TNVEKWKRFAGSSRKDPNINGKDVAEFSILIPPLKEQEKIAEMLL----TCDKAIRLTTQ 180 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 I LK++ Q L ++T G W+ L+ N Sbjct: 181 IITQLKQRNQGLAQQLLTG-----------EKRVKGFENSVWKEVRLGELLDYEQPTNYL 229 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + ++ + ++ +T +G E + + F D D R + Sbjct: 230 VKNTDYSNEYKIPVLTAGKTFILGYTNEKEGICTNIP----LILFDDFTTDSRYI---DF 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + ++ + ++ L ++ + L K G + L + VP Sbjct: 283 PFKVKSSAVKLLKTKKNVN---LRFIFEAMKLIKY----AIGGHERHWISKYAFLTIFVP 335 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 KEQ I +++ + + EQ + LL+ ++ + + +TG++ Sbjct: 336 SFKEQNAIAQILDTAHQEL----KLYEQKLQLLQAQKKTLMQKLLTGEV 380 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 67/206 (32%), Gaps = 13/206 (6%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ---- 284 P HW++K + + + ++ + N + + Sbjct: 6 PKHWKIKKLGEIANISSGTTPFRKNPLFYDNADIPWVKTTDLNNSYITTTEEKVSMYALN 65 Query: 285 -----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++ N + + + +K ++S YL + + + Sbjct: 66 NTSLRLYPTNTVLVAMYGGFNQIGRTGKLAMEATINQALSALVLKNDDVNSDYLLFWLNT 125 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +A S ++ +DV +L+PP+KEQ I ++ D + Q Sbjct: 126 NVEKWKRFAGSSRKDPNINGKDVAEFSILIPPLKEQEKIAEMLLT----CDKAIRLTTQI 181 Query: 400 IVLLKERRSSFIAAAVTGQIDLRGES 425 I LK+R +TG+ ++G Sbjct: 182 ITQLKQRNQGLAQQLLTGEKRVKGFE 207 >gi|255658633|ref|ZP_05404042.1| putative phosphoribosylformylglycinamidine synthase [Mitsuokella multacida DSM 20544] gi|260849007|gb|EEX69014.1| putative phosphoribosylformylglycinamidine synthase [Mitsuokella multacida DSM 20544] Length = 489 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 64/424 (15%), Positives = 122/424 (28%), Gaps = 55/424 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQS-- 73 IP++W ++ T T + + I ++ ++++ + + S Sbjct: 66 DIPENWVWTRLEEILLSLTDGTHKTPVYKNEGIPFLSVKNISNHKIDFSNIKYISIDEHK 125 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 KG IL K+G II I + L+ + + L L S Sbjct: 126 KLCERCYPKKGDILLSKVGTTGIPVIIDTEKEFSIFVSVALLKFSSSIDAKYLLFLLESP 185 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V ++ G + I N +P+PPLAEQ I KI ID + + Sbjct: 186 LVQEQCRTHTRGIGNKNWVLTDIANTIVPLPPLAEQHRIVAKIEELQPDIDAYDKAQTKL 245 Query: 192 IELLKE----KKQALVSYIVTKGLNPDVK------------------------------- 216 + + K++L+ Y + L P K Sbjct: 246 QSIEQSFPDAMKKSLLQYAIEGKLVPQRKEEGTAKDLLAKIRAEKARLVKEKKIKKSKPL 305 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELN-----RKNTKLIESNILSLSYGNIIQKLETR 271 + E +PD WE L + R++ + I L G++ Sbjct: 306 PAITDDEKPFDIPDSWEWVRLGELGEWCSGATPSRQHPEYFGGKIPWLKTGDLNDGYIKE 365 Query: 272 NMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + +I G ++ K + A A + Sbjct: 366 VPEYITDDGFKNSSTKINPIGSVLIAMYGATIGKLGILKI----PATTNQACCACELVHE 421 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + + G + ++ + + +PP+ EQ+ I + Sbjct: 422 MYNKYLFYFLFANRKYFIKKGAGGAQPNISKAKITNTVMPLPPLAEQYRIVAKLEELLPL 481 Query: 389 IDVL 392 L Sbjct: 482 CQQL 485 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 74/213 (34%), Gaps = 17/213 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALV---TELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGL 275 S + +P++W ++ T+ K I LS NI K++ N+ Sbjct: 59 SMDDLPFDIPENWVWTRLEEILLSLTDGTHKTPVYKNEGIPFLSVKNISNHKIDFSNIKY 118 Query: 276 KPESYET----YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 G+I+ + + E I S + ID+ Sbjct: 119 ISIDEHKKLCERCYPKKGDILLSKVGTTGIPVII--DTEKEFSIFVSVALLKFSSSIDAK 176 Query: 332 YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 YL +L+ S + + G+ ++ D+ V +PP+ EQ I I ID Sbjct: 177 YLLFLLESPLVQEQCRTHTRGIGNKNWVLTDIANTIVPLPPLAEQHRIVAKIEELQPDID 236 Query: 391 VLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 +K + + +++ + S + A+ G+ Sbjct: 237 AY-DKAQTKLQSIEQSFPDAMKKSLLQYAIEGK 268 >gi|84616898|emb|CAJ13792.1| type I restriction-modification system, S subunit [Desulfococcus multivorans] Length = 575 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 74/510 (14%), Positives = 149/510 (29%), Gaps = 101/510 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII---YIGLEDVE 57 +K K P K V + +P+ W+ V + D + + V Sbjct: 69 IKKPKPLPSIKPEEVPY--ELPQGWEWVRLGDICSYIQRGKGPKYVDFSTHRVVSQKCVR 126 Query: 58 SGTGKYLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGP--YLRKAIIADF----DGICST 109 P + G +L+ G R ++ + + + Sbjct: 127 WYGLDLEPARYIDPASLEKYEPIRFLRVGDLLWNSTGTGTIGRACLVPQELEGVEVVADS 186 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQV 168 V++P +V P L W+ S V IE G T + + N MP+PP +EQ Sbjct: 187 HVTVVRPIEVRPLFLWRWIQSPIVQNAIEGSASGTTNQIELNTSTVINHLMPLPPPSEQH 246 Query: 169 LIREKIIAETVR-------------------------------------IDTLITERIRF 191 I +I R I E Sbjct: 247 RIVARIDQLMARCDELEKLRKEREEKRLAVHAAAIKQLLDAPNGSAWDFIQQNFGELYTV 306 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV----------------------- 228 E + E ++A++ V L P S E + + Sbjct: 307 KENVAELRKAILQLAVMGRLVPQNPNDPSASELLKEIEAEKQRLVKSKQLKIGQKTEDTK 366 Query: 229 --------PDHWEVKPFFALVTEL------NRKNTKLIESNILSLSYGNIIQKLETRNMG 274 P+ WE ++ K L + + + +++ Sbjct: 367 FICHDTAIPETWEWVKGLDILFITKLAGFEYTKYVNLQDEGEIPVIRAQNVRQFSIDTTN 426 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--------IITSAYMAVKPH 326 LK +T ++++ + + + + + E+ + Sbjct: 427 LKYIDLKTSELLERCALTKPALLVTFIGAGIGDVALFEKNERWHLAPNVAKMEPFVGCES 486 Query: 327 GIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++ YL + + S + + S + S+ ++ + +PP+ EQ I + I+ Sbjct: 487 KLNLRYLNYFLLSPLGRREIFKHLKSTAQPSISMGTIRDIDYPLPPLPEQHRIVDRIDHL 546 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A D L +Q I E++++ + A + Sbjct: 547 MALCDTL----DQQIDSATEKQTALLNAVM 572 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 64/201 (31%), Gaps = 16/201 (7%) Query: 21 IPKHWKVVPIKRF--------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSR 71 IP+ W+ V + + +I I ++V + K + + Sbjct: 374 IPETWEWVKGLDILFITKLAGFEYTKYVNLQDEGEIPVIRAQNVRQFSIDTTNLKYIDLK 433 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD-------GICSTQFLVLQPKDVLPELL 124 S+ K +L +G + + + + + + V + L Sbjct: 434 TSELLERCALTKPALLVTFIGAGIGDVALFEKNERWHLAPNVAKMEPFVGCESKLNLRYL 493 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +LLS + I + I +I P+PPL EQ I ++I DTL Sbjct: 494 NYFLLSPLGRREIFKHLKSTAQPSISMGTIRDIDYPLPPLPEQHRIVDRIDHLMALCDTL 553 Query: 185 ITERIRFIELLKEKKQALVSY 205 + E A+++ Sbjct: 554 DQQIDSATEKQTALLNAVMAQ 574 >gi|154249203|ref|YP_001410028.1| restriction modification system DNA specificity subunit [Fervidobacterium nodosum Rt17-B1] gi|154153139|gb|ABS60371.1| restriction modification system DNA specificity domain [Fervidobacterium nodosum Rt17-B1] Length = 429 Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats. Identities = 57/423 (13%), Positives = 142/423 (33%), Gaps = 35/423 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY---LPKDGNSRQSDTSTVS 79 + WK V + ++ + + G + + + +R+ +++ Sbjct: 9 EGWKRVKLGEVLSISR---IPDNEKDPNKRITVRLWNKGVFAREVREVELNREKESTIYY 65 Query: 80 IFAKGQILYGKLGPYLRK--AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 GQ +YGK I + DG CST + ++ + + Sbjct: 66 KRKAGQFIYGKQNLVRGAFGVIPPELDGYCSTSDVPSFDVSKNLDVYYLDYTLRTLYKAF 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G K + + +PP+ EQ I E + I+ ++ + + Sbjct: 126 SLYEKGTGSKRVHEKDFLSFEIFLPPIFEQQKIAEILKTVDRAIEKTGKIIEKYKRIKQG 185 Query: 198 KKQALVSYIVT---KGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALVTELNRKNT 249 Q L++ V +G + ++++ I+ +G +P+ WEV ++ Sbjct: 186 LMQDLLTKGVVSEGEGESEKWRLRNEKIDKFKDSPLGRIPEEWEVVRLGETGRIVSGATP 245 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYET----------YQIVDPGEIVFRFIDLQ 299 + + + ++ S +I+ IV Sbjct: 246 DTSKPQFWNGDIVWVTPDDLSKQKKYIYTSQRKISKDGLNSCAAKIIPRDSIVLSSRAPI 305 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 +++ +G + + + + D + + + Y + K+ + Sbjct: 306 GYLSIVKTNYATNQGCKS---IILNKNYYDEDFFYYCLHRY-INKMISLGSGTTFNEISK 361 Query: 360 EDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +L V VP + EQ I +++ ++ID ++EK + L+ + + +TG+ Sbjct: 362 SQLAKLEVKVPCLLSEQHRIASIL----SQIDEVIEKEQAYKEKLERVKKGLMEDLLTGK 417 Query: 419 IDL 421 + + Sbjct: 418 VRV 420 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 77/211 (36%), Gaps = 15/211 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTG 61 ++KDS +G IP+ W+VV + ++ +G T ++ K DI+++ +D+ S Sbjct: 214 DKFKDSP---LGRIPEEWEVVRLGETGRIVSGATPDTSKPQFWNGDIVWVTPDDL-SKQK 269 Query: 62 KYLPKDGNSRQSDTST---VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 KY+ D I + I+ P +I+ + + Sbjct: 270 KYIYTSQRKISKDGLNSCAAKIIPRDSIVLSSRAPIGYLSIVKTNYA-TNQGCKSIILNK 328 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAE 177 + + ++ ++ G T + + + + +P L+EQ I + Sbjct: 329 NYYDEDFFYYCLHRYINKMISLGSGTTFNEISKSQLAKLEVKVPCLLSEQHRIASILSQI 388 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I+ + + + K + L++ V Sbjct: 389 DEVIEKEQAYKEKLERVKKGLMEDLLTGKVR 419 >gi|187779295|ref|ZP_02995768.1| hypothetical protein CLOSPO_02891 [Clostridium sporogenes ATCC 15579] gi|187772920|gb|EDU36722.1| hypothetical protein CLOSPO_02891 [Clostridium sporogenes ATCC 15579] Length = 408 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 66/390 (16%), Positives = 140/390 (35%), Gaps = 27/390 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + TK +G T GK DI +I ++ S + + + ++S+ Sbjct: 20 WEQRKLGEVTKSYSGGTPSVGKSQYYDGDIPFIRSAEINSDS---TELYISEKGLNSSSA 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G ILY G + I+ +G + L +QP+ L + + Sbjct: 77 KKVKVGDILYALYGATSGEVGISRINGAINQAILAIQPEKGYNSQFIMQWLRGQKQKITD 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G + + ++P+ P EQ I + IT R + L++K Sbjct: 137 KYLQG-GQGNLSGSIVKDLPIEFPSYDEQYKIGTYFNSLDQL----ITLHQRKLNHLQDK 191 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K++L+ + K +++ G + ++ TE N + + + Sbjct: 192 KKSLLQKMFPKNGEKFPELRFPG---FTDPWEQRKLGELLIPSTEKNNTGKYTQDDVLAA 248 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + +K + ES + Y+IV+ G++++ ++ + GI+ S Sbjct: 249 SLGTELTKKHIFFGLRSTEESIKNYRIVNKGDVIYTKSPIKGYPNGIIKTNKGIEGIVPS 308 Query: 319 AYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLP--VLVP-PI 372 Y ++ + + L Y + + G R ++ D+ L + VP I Sbjct: 309 LYCVYNSVSDVNSRIIQSYFEDKSRLDSYLYPLVNVGARNNVNITDLGFLEGNICVPQDI 368 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVL 402 EQ I + I ++ L+ ++ + Sbjct: 369 NEQNRIVDFIE----KLSNLITLHQRKLNH 394 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 61/167 (36%), Gaps = 8/167 (4%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ + +I + I + K + + + V G+I++ + + + Sbjct: 40 GKSQYYDGDIPFIRSAEINSDSTELYISEKGLNSSSAKKVKVGDILYALYGATSGEVGIS 99 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 G I A +A++P ++ K+ G + +L VK LP Sbjct: 100 RI----NGAINQAILAIQPEKGYNSQFIMQWLRGQKQKITDKYLQGGQGNLSGSIVKDLP 155 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + P EQ+ I +D L+ ++ + L++++ S + Sbjct: 156 IEFPSYDEQYKIGTY----FNSLDQLITLHQRKLNHLQDKKKSLLQK 198 >gi|330971615|gb|EGH71681.1| restriction modification system DNA specificity domain-containing protein [Pseudomonas syringae pv. aceris str. M302273PT] Length = 233 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 54/221 (24%), Positives = 89/221 (40%), Gaps = 15/221 (6%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRK----NTKLIESNILSLSYGNIIQKLETRN 272 M++SG+EW+G VP HW+V + K S + L ++ N Sbjct: 1 MEESGVEWLGEVPAHWQVCKLSFRYSVELGKMLDEKKNTGTSPLPYLRNQDVQWGSININ 60 Query: 273 ----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + ++ YE Y V G+++ R R A ++P Sbjct: 61 GLPLIDIESSEYERYT-VRLGDLLVCEGGDVGRAAIWRIKNS--RIGYQKALHRLRPESP 117 Query: 329 DSTYLAWLMRSYDLCKVF----YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + S K + L E ++ PPI++Q +I +V+ Sbjct: 118 SRDTAEFFFYSLMAAKALGVLEESDTKATISHLPAEKFRQYRFAFPPIEDQQEIASVLGE 177 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + R D ++ E I+LL+ERRS+ I+AAVTG+ID+RG Sbjct: 178 KLKRSDEIISYAENMIMLLRERRSALISAAVTGKIDVRGWQ 218 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 84/213 (39%), Gaps = 10/213 (4%) Query: 10 YKDSGVQWIGAIPKHWKVVPIK-----RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 ++SGV+W+G +P HW+V + K+ + + + Y+ +DV+ G+ Sbjct: 1 MEESGVEWLGEVPAHWQVCKLSFRYSVELGKMLDEKKNTGTSPLPYLRNQDVQWGSININ 60 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-- 122 +S G +L + G R AI + Q + + + P Sbjct: 61 GLPLIDIESSEYERYTVRLGDLLVCEGGDVGRAAIWRIKNSRIGYQKALHRLRPESPSRD 120 Query: 123 ---LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L++ +E AT+SH + PP+ +Q I + + Sbjct: 121 TAEFFFYSLMAAKALGVLEESDTKATISHLPAEKFRQYRFAFPPIEDQQEIASVLGEKLK 180 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 R D +I+ I LL+E++ AL+S VT ++ Sbjct: 181 RSDEIISYAENMIMLLRERRSALISAAVTGKID 213 >gi|319788898|ref|YP_004090213.1| restriction modification system DNA specificity domain [Ruminococcus albus 7] gi|315450765|gb|ADU24327.1| restriction modification system DNA specificity domain [Ruminococcus albus 7] Length = 498 Score = 121 bits (303), Expect = 3e-25, Method: Composition-based stats. Identities = 53/443 (11%), Positives = 133/443 (30%), Gaps = 65/443 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + + + G + + I +I + D E + Sbjct: 56 ELPEGWRWDRLGNVSIIARGGSPRPIESYITDDENGINWIKIGDTEKDGKYIFKTKEKIK 115 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S G L + R I+ I ++ V + + LS Sbjct: 116 PEGLSKSRYVESGDFLLTNSMSFGRPYILRTDGCIHDGWLVIGNIDTVFNQDFLYYALSS 175 Query: 132 DVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 D + + + G+T+ + + ++ PIPP+ EQ I EK+ + + + +++ Sbjct: 176 DFMYQTLSLLAAGSTVKNLKSDTVKSVLFPIPPMREQKRIAEKLDSLISFVIKIESDKTD 235 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVK---------------------------------- 216 ++ K ++ + L P Sbjct: 236 LQTTIQLTKSKILDLAIRGKLVPQNPDDEPASVLLERIRAEKEELIKQGKIKRDKKESVI 295 Query: 217 ---MKDSGIEWVG------------LVPDHWEVKPFFALVTELNRKNT-----KLIESNI 256 +S E +G +PD W + + + K + ++ Sbjct: 296 FRGDDNSYYETIGSETTNIDDKIPFDLPDGWSFERLCNIASFSGGKTPSTSKDEYWGNDY 355 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ ++ K + E + + + + +L A + + I Sbjct: 356 FWITSKDMKSKYIDSSQISLSEKGAEIMQIIAPDTLLLVARSGILRHTLPVAILKRQATI 415 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKE 374 A+ + + + +++ F++ K + + +PP+ E Sbjct: 416 NQDIKAISIYNTSLVEFIYTFLKGMENSILLRYTKSGTTVENINFDEFKSIVIPIPPLNE 475 Query: 375 QFDITNVINVETARIDVLVEKIE 397 Q I + ++ + +D + E + Sbjct: 476 QKRIADKVSQLFSLLDSIAENVN 498 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 82/226 (36%), Gaps = 20/226 (8%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII- 265 + PD +K E +P+ W + + + IES I G Sbjct: 36 LHYEKFPDGSVKCIEDEIPFELPEGWRWDRLGNVSIIARGGSPRPIESYITDDENGINWI 95 Query: 266 ---------QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + +KPE + V+ G+ + LR+ + G + Sbjct: 96 KIGDTEKDGKYIFKTKEKIKPEGLSKSRYVESGDFLLTNSMSFGRPYILRTDGCIHDGWL 155 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 + + +L + + S + + + +G ++LK + VK + +PP++EQ Sbjct: 156 ---VIGNIDTVFNQDFLYYALSSDFMYQTLSLLAAGSTVKNLKSDTVKSVLFPIPPMREQ 212 Query: 376 FDITNVINVETA---RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I ++ + +I+ ++ +I L +S + A+ G+ Sbjct: 213 KRIAEKLDSLISFVIKIESDKTDLQTTIQLT---KSKILDLAIRGK 255 >gi|242278888|ref|YP_002991017.1| restriction modification system DNA specificity domain protein [Desulfovibrio salexigens DSM 2638] gi|242121782|gb|ACS79478.1| restriction modification system DNA specificity domain protein [Desulfovibrio salexigens DSM 2638] Length = 387 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 58/407 (14%), Positives = 126/407 (30%), Gaps = 32/407 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +PK W + L G + ++ G++ + + + Sbjct: 3 LPKGWDKKTLGESCTLQRGFDLPKR-----LRVK------GEHPLISSSGCIDSHNEPKV 51 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G ++ G+ G I D +T V P + L ID+ Sbjct: 52 AGPG-VVTGRSGSIGSLFYIEDDFWPLNTTLYVKNYFGNDPRFIFYLLKHIDLK----RF 106 Query: 141 CEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 GA + + + + + IP + EQ I + ID I + + +E Sbjct: 107 ASGAGVPTLNRNNVHSESILIPSDSSEQKRIVGILDKAFASIDKAIANTEKNLANARELF 166 Query: 200 QALVSYIVTKGLNPDVKMKDS------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + LV + ++PD + +S ++ G + I+ Sbjct: 167 ERLV--ADSIFVDPDAQQWESKLVADLAVKEKGSMRTGPFGSQLLHKEFVDEGIAVLGID 224 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + + + V PG+++ + + + Sbjct: 225 NAVKNEFSWGKHRFITDEKY-----EQLSRYTVHPGDVIITIMGTCGRCAVIPDDIPLAI 279 Query: 314 GIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPP 371 + + YL + + + G S L +K+LPV +P Sbjct: 280 NTKHLCCITLDHDICLPEYLHAYFLYHPTAISFLTSKAKGAIMSGLNMGIIKKLPVRLPS 339 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +KEQ DI ++ + + +Q + L++ + S + A G+ Sbjct: 340 LKEQKDIVGKVSEAKQNYLKMTQLYQQKLTNLQDLKQSILQKAFAGE 386 >gi|312886109|ref|ZP_07745730.1| restriction modification system DNA specificity domain protein [Mucilaginibacter paludis DSM 18603] gi|311301408|gb|EFQ78456.1| restriction modification system DNA specificity domain protein [Mucilaginibacter paludis DSM 18603] Length = 417 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 58/423 (13%), Positives = 138/423 (32%), Gaps = 41/423 (9%) Query: 20 AIPKHWKVVPIKRFTKLNT-GRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P++W+V +K K G E+ I I + +++ GT K + Sbjct: 14 EVPENWQVKKLKTIMKEGRLGGNYENAEANTGIPVIKMGNLDRGTIKIDKVQYLPKGESY 73 Query: 76 STVSIFAKGQILYGKLGPY--LRKAIIADF---DGICSTQFLVLQPKDV----LPELLQG 126 + + G +L+ + K + + + ++ L ++ + Sbjct: 74 NNKDVLTDGDLLFNTRNTLELVGKVAVWNNELPFAVYNSNLLRIKFDSTFVESNWFMNYA 133 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + Q +++ K + +I +PPL EQ +I + Sbjct: 134 FNSEYGLRQLKAIATGTTSVAAIYGKDLESIKFLLPPLPEQKVIASMFRIWDKAVRKTEQ 193 Query: 187 ERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 ++ + K Q L S KG K + E + P + P + + Sbjct: 194 LIVQKKQRKKWMMQQLFSGKKRLKGFGKANYKKVALDEIL--TPIRNPLIPEEKTLYQQI 251 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + L G K + ++P +V + ++ Sbjct: 252 GIRSHGKGIFHKELVSG-------------KDLGNKRVFWIEPNCLVINIVFAWEQ--AI 296 Query: 306 RSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKF 359 +E G+I S + + Y+ + +S ++ G ++L Sbjct: 297 AKTTELEIGMIASHRFPMFKPTEGKLNLDYILYYFKSPRGKQLLVNASPGGAGRNKTLGQ 356 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + +P ++EQ I+ V+ + ++ + L+E++ + +TG++ Sbjct: 357 NEFINQFISLPTLEEQTAISQVLQAADKE----ISLLKAKVEKLREQKKGLMQQLLTGRV 412 Query: 420 DLR 422 L+ Sbjct: 413 RLK 415 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 82/209 (39%), Gaps = 16/209 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLI---ESNILSLSYGNIIQKLETRNMGL---KPESY 280 VP++W+VK ++ E + I + GN+ + + K ESY Sbjct: 14 EVPENWQVKKLKTIMKEGRLGGNYENAEANTGIPVIKMGNLDRGTIKIDKVQYLPKGESY 73 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAYMAVKPHGI---DSTYLAWL 336 ++ G+++F + + + + S + +K + ++ + Sbjct: 74 NNKDVLTDGDLLFNTRNTLELVGKVAVWNNELPFAVYNSNLLRIKFDSTFVESNWFMNYA 133 Query: 337 MRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 S + A+ +G ++ +D++ + L+PP+ EQ I + D V Sbjct: 134 FNSEYGLRQLKAIATGTTSVAAIYGKDLESIKFLLPPLPEQKVIAS----MFRIWDKAVR 189 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 K EQ IV K+R+ + +G+ L+G Sbjct: 190 KTEQLIVQKKQRKKWMMQQLFSGKKRLKG 218 >gi|158520266|ref|YP_001528136.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158509092|gb|ABW66059.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 393 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 70/424 (16%), Positives = 136/424 (32%), Gaps = 60/424 (14%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 P YK + V G IP+ W+V P+ K G+ E + K++ + Sbjct: 19 PGYKQTEV---GVIPEDWEVKPLAFVVKYTNGKAHEQS----ITDSGNFVVVNSKFISTE 71 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVLP 121 G R+ KG +L +AI F + + VL P + Sbjct: 72 GIIRKFAQMRFCPAEKGDVLMVMSDVPNGRAIAKCFWVDCEDTYTVNQRICVLNPCGIDG 131 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVR 180 +LL L +GA ++ + + + P+ IP AEQ I + Sbjct: 132 KLLYYKLDRNPF---YLTFDDGAKQTNLRKEDVLSCPLSIPNTEAEQRAIAAALSDVDAL 188 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 +D L + +L + Q L++ G K WE+K + Sbjct: 189 LDGLDRLIAKKRDLKQAAMQQLLT-----GQTRLPGFKG-----------EWEIKRLGDV 232 Query: 241 VTELNRKNTKLI---ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + K+ + I + L+ G I + T I D ++ Sbjct: 233 LMVRHGKSQRGISVSDGKYPILASGGEIGRTNT-------------CIYDKPSVLIGRKG 279 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + + + + ++ ++ A G SL Sbjct: 280 TIDS----PQYVDSPFWTVDTLFFTEISTEANAKFIFSKFSIIPWRTYNEASG---VPSL 332 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + ++ + + +P EQ I V++ A + +EQ ++ + + + +TG Sbjct: 333 NAKTIENIEIFLPSPTEQTAIAQVLSDMDAE----IAALEQRRNKTRDIKQAMMQELLTG 388 Query: 418 QIDL 421 + L Sbjct: 389 KTRL 392 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 83/203 (40%), Gaps = 11/203 (5%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 VG++P+ WEVKP +V N K + ++ + N K + ++ + + Sbjct: 25 EVGVIPEDWEVKPLAFVVKYTNGKAHEQSITDSGNFVVVN--SKFISTEGIIRKFAQMRF 82 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVM--ERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G+++ D+ N + + V + + + P GID L + + Sbjct: 83 CPAEKGDVLMVMSDVPNGRAIAKCFWVDCEDTYTVNQRICVLNPCGIDGKLLYYKLDRNP 142 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSI 400 F + +L+ EDV P+ +P + EQ I ++ A +D L ++ I Sbjct: 143 FYLTFDDGAK--QTNLRKEDVLSCPLSIPNTEAEQRAIAAALSDVDALLDGL----DRLI 196 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 ++ + + + +TGQ L G Sbjct: 197 AKKRDLKQAAMQQLLTGQTRLPG 219 >gi|308063357|gb|ADO05244.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Sat464] Length = 422 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 59/409 (14%), Positives = 131/409 (32%), Gaps = 25/409 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASMDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELKAR 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL----VPDHWEVKPFFALVTELNRK 247 + + + L+ + + D K+K L P E + + N+K Sbjct: 192 KKQYQYYQNMLLDFKDIHSNHKDAKIKTYPKRLKTLLQTLAPKGVEFRKLGEVCESTNKK 251 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K+ E + + + G + + I + + Sbjct: 252 TLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGE-----NITIASRGEYAGFINYFN 306 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ G+ Y + + + +L + +++ ++ + + G +L D++ L + Sbjct: 307 EKIFAGGLCYP-YKVKDTNELLTKFLYFYLKTNEIQIMENLVSRGSIPALNKADIETLTI 365 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP++ Q +I +++ A L+ I I K+ R + Sbjct: 366 PIPPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYEYYREKLLT 414 >gi|260773572|ref|ZP_05882488.1| hsdS type I site-specific deoxyribonuclease [Vibrio metschnikovii CIP 69.14] gi|260612711|gb|EEX37914.1| hsdS type I site-specific deoxyribonuclease [Vibrio metschnikovii CIP 69.14] Length = 585 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 70/451 (15%), Positives = 147/451 (32%), Gaps = 62/451 (13%) Query: 24 HWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDVESGTGKYLP---KDGNSR 71 +W + I ++ G T + G I ++ D+ T KY+ +D + + Sbjct: 8 NWIELKIGEVAEVVAGGTPKAGNPDNFKTPGTGIAWLTPADLSGYTRKYISLGARDLSHQ 67 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++S+ I KG +L+ P IA D + F + + Sbjct: 68 GYNSSSAKILPKGSLLFSSRAPI-GYVAIAQNDISTNQGFKNFVFPCGVDSD-YAYYYLR 125 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + E++ G T +P +PPLAEQ I +K+ ++ T R Sbjct: 126 SIRDLAESLGTGTTFKEISGAVAKTLPFLLPPLAEQKAIADKLDLMLAQVATTKVRLERI 185 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-VGLVPDHWEVKPF------------- 237 +LK +Q++++ V+ L + + W V +P++ + + Sbjct: 186 PNILKTFRQSILTAAVSGKLTGNWRASSLKSAWTVRELPENNKTRRGLPDSVALPDALKE 245 Query: 238 -------------------------FALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 + K+ + E + ++ + + Sbjct: 246 SRFPESWSILSVASLLRKGVIIDLKDGNHGSNHPKSLEFTEKGLPFITAAQMSDNGKIDY 305 Query: 273 MGLKPESYETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 G S + + + G ++++ A V+ + Y+ + Sbjct: 306 DGAPKVSGKPLEKLKVGFSEAEDVIYSHKGTIGKVGIADRASVLNP---QTTYIRLNQKY 362 Query: 328 IDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + Y A +++S A R + L ++PPI EQ +I + Sbjct: 363 VLNQYYALMLKSNAFTSQVDAIKSQTTRDFVPITAHYSLFAIIPPIDEQVEIVRRVEELF 422 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 A D + +K+ + L+ S A G Sbjct: 423 ACADNIEQKVNMATELVNNLPQSIFTKAFRG 453 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/139 (23%), Positives = 66/139 (47%), Gaps = 7/139 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + +I+ G ++F ++ +G + P G+DS Y + +RS Sbjct: 72 SSAKILPKGSLLFSSRAPIGYVAIAQNDISTNQGFKNFVF----PCGVDSDYAYYYLRS- 126 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + ++G+G + + K LP L+PP+ EQ I + +++ A++ ++E+ Sbjct: 127 -IRDLAESLGTGTTFKEISGAVAKTLPFLLPPLAEQKAIADKLDLMLAQVATTKVRLERI 185 Query: 400 IVLLKERRSSFIAAAVTGQ 418 +LK R S + AAV+G+ Sbjct: 186 PNILKTFRQSILTAAVSGK 204 >gi|85859882|ref|YP_462084.1| type I restriction-modification system specificity subunit [Syntrophus aciditrophicus SB] gi|85722973|gb|ABC77916.1| type I restriction-modification system specificity subunit [Syntrophus aciditrophicus SB] Length = 404 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 63/413 (15%), Positives = 146/413 (35%), Gaps = 33/413 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIY------IGLEDV--ESGTGKYLPKDGNSRQ 72 IP WK++ + + TG+T + I D+ +S K + + + Sbjct: 8 IPSDWKMMTLGQVGVTVTGKTPSKDNPEDWGDLLSFITPTDIISDSKHLKTVARKLSGSG 67 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +T I G ++ +G + K +I +D + + Q ++ D + +LL Sbjct: 68 INTLKKMIIPAGSVVVTCIGSDMGKVVINSYDSVTNQQINSIKVNDNNNKDFVYYLLKNS 127 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G+TM + ++ P L EQ I + + +I+ L T+ Sbjct: 128 YSILRNHAIGGSTMPILNKSTFESLEFIFPSLTEQQAIAAALSSLDDKIELLRTQNKTLE 187 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKN 248 + + + G + K SG + +G +P++W + + +V + K Sbjct: 188 NITQTIFKHWFVDFEFPGKD-GNPYKSSGGKMIESALGKIPNNWRIGKYEDVVDVVTGKG 246 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 K N+ + +G E +T + + +++ + Sbjct: 247 MKKD----------NLRSNGLYKVLGANGEIGKTDEYLFDEDLILTGRVGTLGTIFISRG 296 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 +V I+ + KP ++ Y A+ + + + D+K + ++ Sbjct: 297 KV----WISDNVLISKPKSDENCYFAYF--QLRKLNLESLNRGSTQPLITQTDLKNVEII 350 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +PP +I + + + + + I L + R + + + G+I + Sbjct: 351 LPP----KEILFDWHCMASSLFTKIFNNDFQINTLSKIRDTLLPKLMKGEIRV 399 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 66/198 (33%), Gaps = 19/198 (9%) Query: 10 YKDSG---VQW-IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG ++ +G IP +W++ + + TG+ + G Y Sbjct: 211 YKSSGGKMIESALGKIPNNWRIGKYEDVVDVVTGKGMKKDN----------LRSNGLYKV 260 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 N T +F + IL G++G L I+ S L+ +PK Sbjct: 261 LGANGEIGKTDEY-LFDEDLILTGRVGT-LGTIFISRGKVWISDNVLISKPKSDENCYFA 318 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + L +E++ G+T + N+ + +PP + +I Sbjct: 319 YFQLR---KLNLESLNRGSTQPLITQTDLKNVEIILPPKEILFDWHCMASSLFTKIFNND 375 Query: 186 TERIRFIELLKEKKQALV 203 + ++ L+ Sbjct: 376 FQINTLSKIRDTLLPKLM 393 >gi|254470760|ref|ZP_05084163.1| restriction modification system DNA specificity domain protein [Pseudovibrio sp. JE062] gi|211959902|gb|EEA95099.1| restriction modification system DNA specificity domain protein [Pseudovibrio sp. JE062] Length = 400 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 53/425 (12%), Positives = 128/425 (30%), Gaps = 48/425 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV-ESGTGKYLPKDGNSRQSDT 75 +P+ W + G +S I + D G G + Sbjct: 2 VPEGWTETRLGEIVIHRKGYAFDSKDYDQAGRRIIRISDTTRDGIGNERVVCVPHEVAKD 61 Query: 76 STVSIFAKGQILYGKLGP--------YLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQ 125 G +L +G + + + + + + L P Sbjct: 62 LETYALDTGDVLLSTVGSRPHLLDSMVGKVVRVPEGVKGALLNQNLVRLDPISRDINREH 121 Query: 126 GWLLSID--VTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + D I + G + +PPL EQ I E + D Sbjct: 122 LFAVLKDKRFIYYISTLVRGNANQVSITLAELFQYKFSLPPLPEQKKIAEIL----GTWD 177 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I + ++ + +K+AL+ +++ ++K G W+ + Sbjct: 178 RAIEVAEKQLKNAEAQKRALMQHLLAG----THRLK-------GFEDSEWKTVKLGDVCE 226 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV--DPGEIVFRFIDLQN 300 L+ + ++ ++ N + ++ ++ + GE + Sbjct: 227 FLDGMRKPIKAADRATMQGQNPYYGATGIIDWVDAFIFDEPLLLLGEDGENILSRNLPHV 286 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + + + A++ + ++ + S D K + L + Sbjct: 287 FRI------EGKSWVNNHAHVLRPKSEVSHAFVCEFLESLDYRKY---NSGSAQPKLNKK 337 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + +PVL+P +EQ I ++ + D V + + L+ + + + +TG+ Sbjct: 338 VCESIPVLLPCFEEQKAIGAILEIS----DQQVHNCKAKLNHLRTEKRALMQQLLTGKKR 393 Query: 421 LRGES 425 ++ E Sbjct: 394 VKVEE 398 >gi|194335862|ref|YP_002017656.1| restriction modification system DNA specificity domain [Pelodictyon phaeoclathratiforme BU-1] gi|194308339|gb|ACF43039.1| restriction modification system DNA specificity domain [Pelodictyon phaeoclathratiforme BU-1] Length = 411 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 63/419 (15%), Positives = 145/419 (34%), Gaps = 31/419 (7%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTS-ESGKDIIYIGLEDVESGTGKYL 64 +P+++D+G +W + + + + R E Y+ ++ Y Sbjct: 7 RFPEFRDAG-EWDRDV--------LGKVSVFVNERMPLEQLSLSNYVSTVNIL---PDYE 54 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PEL 123 + + + + F IL + PYL+K A +G S +V++ K+ + Sbjct: 55 GMVTAPKLPPSGSATRFKINDILISNIRPYLKKVWFASKEGGASNDVIVIRAKEKVGDRY 114 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L L + + + +G M D + P+ P EQ I + + ID Sbjct: 115 LSFMLKNDVFIEYVMKGAKGVKMPRGDIFLMQEYPLAYPSKPEQQKIADCL----SSIDD 170 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW--EVKPFFALV 241 LIT + + ++ LK K+ L+ ++ K++ + G + ++ Sbjct: 171 LITAQTQKLDTLKTHKKGLMQHLFPAEGETLPKLRFPEFQDAGEWEEKHLGKICEIKGGK 230 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFID 297 + +++ + ++ + L S QI + ++ Sbjct: 231 RIPKGFSLTNEKTDYPYVRVSDMYMGGIDTSSVLYIPSEIEKQIRSYKISKNDLFITVAG 290 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQS 356 + + ++ +T + I YL + ++ + + + Sbjct: 291 TIGIVGEVP--EELDNANLTENANKIIVKSIAKKYLLHYLTGESAQQLISSSVTNNAQPK 348 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L E ++ P+ VP +EQ I + + + ID L+ Q + LK + + + Sbjct: 349 LALERIRLFPIPVPSPEEQQKIADCL----SSIDDLIIAQTQKLATLKTHKKALMQQLF 403 >gi|317132744|ref|YP_004092058.1| N-6 DNA methylase [Ethanoligenens harbinense YUAN-3] gi|315470723|gb|ADU27327.1| N-6 DNA methylase [Ethanoligenens harbinense YUAN-3] Length = 689 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 106/214 (49%), Positives = 138/214 (64%), Gaps = 2/214 (0%) Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 + ID LI++ ++ E+L+ K+ L+ I T GL+ + K SGI+WVG +P WEV P Sbjct: 474 KDSNIDALISDFLQQAEMLETYKRQLIINITTHGLDTALSCKSSGIDWVGEIPCDWEVFP 533 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 A+ E N KNT+++ N+LSLSYG IIQK N GL P S+E YQIV+PG +V R Sbjct: 534 LRAIAHENNTKNTEMLSENLLSLSYGRIIQKDIETNTGLLPASFEGYQIVEPGYVVLRLT 593 Query: 297 DLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 DLQNDKRSLR+ V E GIITSAY + V I Y A+L+ +YDL KVFY +G G+R Sbjct: 594 DLQNDKRSLRTGYVKETGIITSAYLSLVVHDGRILPRYFAYLLHAYDLKKVFYTLGGGVR 653 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 QSLK+ D K LP+LVPPI Q I I + +R Sbjct: 654 QSLKYSDFKMLPILVPPIPTQEKIIAYIEDKISR 687 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 62/176 (35%), Gaps = 11/176 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGN 69 K SG+ W+G IP W+V P++ N + +E ++++ + + + Sbjct: 515 KSSGIDWVGEIPCDWEVFPLRAIAHENNTKNTEMLSENLLSLSYGRIIQKDI---ETNTG 571 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRK----AIIADFDGICSTQF--LVLQPKDVLPEL 123 + I G ++ K GI ++ + LV+ +LP Sbjct: 572 LLPASFEGYQIVEPGYVVLRLTDLQNDKRSLRTGYVKETGIITSAYLSLVVHDGRILPRY 631 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + D+ + + G + +P+ +PP+ Q I I + Sbjct: 632 FAYLLHAYDLKKVFYTL-GGGVRQSLKYSDFKMLPILVPPIPTQEKIIAYIEDKIS 686 >gi|187927548|ref|YP_001898035.1| restriction modification system DNA specificity domain [Ralstonia pickettii 12J] gi|187724438|gb|ACD25603.1| restriction modification system DNA specificity domain [Ralstonia pickettii 12J] Length = 435 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 61/423 (14%), Positives = 127/423 (30%), Gaps = 32/423 (7%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYL 64 +P+++D+G +WK +++ K + SE K ++ E Y Sbjct: 24 RFPEFQDAG---------NWKTEALRKLAKRCAKKNSEGEHKRVLTNSAEYGVIDQRDYF 74 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVL 120 KD + Q + I KG +Y P + G+ S + V + Sbjct: 75 DKDI-ANQGNLEGYYIVEKGDYVYNPRISASAPVGPISKNNVGTGVMSPLYTVFRFISSE 133 Query: 121 PELLQGWLLSIDVTQRIEAICEG---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + S + N+P+P+ EQ I + + + Sbjct: 134 NDFFAHYFKSPHWHHYMRQASSTGARHDRMSITNDDFMNMPLPVSVPKEQQKIADSLSSL 193 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG-LVPDHWEVKP 236 I R + LK K L+ + + +++ G + Sbjct: 194 DEL----IMAENRKLGTLKVYKNGLMQQLFPREGETVPRLRLPGFRRDPQWISATLGDIA 249 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVF 293 R N +I ++ I + + + +I G ++ Sbjct: 250 NVQSGGTPARTNPAYWNGDIPWVTTSLIDSSTILKADEYITKAGLEESSAKIFPKGTLLM 309 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 Q R S ++ + + ST + + ++ SG Sbjct: 310 AMYG-QGRTRGRVSVLGIDAATNQACAAIILKRRGISTDFVFQNLASRYEEIRKISNSGG 368 Query: 354 RQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +++L ++ + P + EQ I N + + ID L+ Q I L+ + + Sbjct: 369 QENLSAGLIEGISFSFPDNESEQEYIANTL----SSIDGLITTQRQKIDALEIHKKGLMQ 424 Query: 413 AAV 415 Sbjct: 425 QLF 427 >gi|317178793|dbj|BAJ56581.1| anti-codon nuclease masking agent [Helicobacter pylori F30] Length = 431 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 51/425 (12%), Positives = 134/425 (31%), Gaps = 39/425 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + ++ ++ G T I + ++D+ + Sbjct: 2 EFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMDDIRENGRILKDSIQHITPKALKGK 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSIDVTQ 135 +F K I+ A++ D + + +F L K ++ + + + Sbjct: 62 KLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCNIALDMKFFFYQCFLLGE 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + D PIPPL Q I + + A T L TE ++ Sbjct: 121 WCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTELKAR 180 Query: 196 KEKKQALVSYIVTKG----------LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 K++ Q + ++ + K L P E + + + Sbjct: 181 KKQYQYYQNMLLDFKGIHSNHKDAKMGAKPYPKRLQTLLQTLAPKGVEFRKLGDIGEFYS 240 Query: 246 R-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID- 297 ++ + + ++ N Q ++ E + G+++F Sbjct: 241 GLVGKSKKSFSQGNKFYVPYVNVFNNPQLDLNALESVQIGDKEKQNTIQLGDVLFTGSSE 300 Query: 298 -----LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + + + + ++L +R Y+ K + +G Sbjct: 301 NLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDENLFNPSFLKHFLRDYNFRKNISKVANG 360 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 R ++ + + ++ + +PP++ Q +I +++ + L+ I I K+ R Sbjct: 361 VTRFNVSKQLLSKITIPIPPLEIQQEIVKILDQFSTLTTDLLAGIPAEIEARKKQYEYYR 420 Query: 408 SSFIA 412 ++ Sbjct: 421 EKLLS 425 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 56/174 (32%), Gaps = 15/174 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + + + +G +S K + Y+ +V + L + + D Sbjct: 224 PKGVEFRKLGDIGEFYSGLVGKSKKSFSQGNKFYVPYVNVFNNPQLDLNALESVQIGDKE 283 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD----------FDGICSTQFLVLQPKDVLPELLQG 126 + G +L+ L ++ + F P L+ Sbjct: 284 KQNTIQLGDVLFTGSSENLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDENLFNPSFLKH 343 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 +L + + I + G T + + + I +PIPPL Q I + + + Sbjct: 344 FLRDYNFRKNISKVANGVTRFNVSKQLLSKITIPIPPLEIQQEIVKILDQFSTL 397 >gi|222444442|ref|ZP_03606957.1| hypothetical protein METSMIALI_00053 [Methanobrevibacter smithii DSM 2375] gi|222434007|gb|EEE41172.1| hypothetical protein METSMIALI_00053 [Methanobrevibacter smithii DSM 2375] Length = 245 Score = 120 bits (301), Expect = 3e-25, Method: Composition-based stats. Identities = 65/234 (27%), Positives = 104/234 (44%), Gaps = 3/234 (1%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 ++KDS V++IG IPK WK++ K + + L K + ++ Sbjct: 7 EFKDSKVEYIGKIPKSWKIIRNKHIFNKTKVIAGPNWDKYNILSLTK-NGVIIKDIERNE 65 Query: 69 NSRQSDTSTVSIFAKGQILYG--KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 SD S I G +L + R + +GI S + L P + Sbjct: 66 GKMPSDFSIYQIVNPGNLLMCLLDIDVTPRCVGYIENNGIVSAAYTELSPIADINMKYYY 125 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 W + + + + + + PPL EQ+ I + +T +ID I Sbjct: 126 WWYLMLDIDKQLLHLSKNLRNSLSTEDFMALSVVKPPLDEQIQIANYLNKKTAKIDETIA 185 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 + I+LL+EK+ AL++++VTKGL+PDV MKDSGIEW+G +P+HWE Sbjct: 186 KNKELIDLLEEKRIALINHVVTKGLDPDVPMKDSGIEWIGNIPEHWETIKLKNC 239 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 64/205 (31%), Positives = 103/205 (50%), Gaps = 4/205 (1%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELN-RKNTKLIESNILSLSYGNIIQKLETRN 272 + + KDS +E++G +P W++ + + + NILSL+ +I K RN Sbjct: 5 NNEFKDSKVEYIGKIPKSWKIIRNKHIFNKTKVIAGPNWDKYNILSLTKNGVIIKDIERN 64 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDST 331 G P + YQIV+PG ++ +D+ R + + GI+++AY + P I+ Sbjct: 65 EGKMPSDFSIYQIVNPGNLLMCLLDIDVTPRCVG--YIENNGIVSAAYTELSPIADINMK 122 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 Y W D+ K + LR SL ED L V+ PP+ EQ I N +N +TA+ID Sbjct: 123 YYYWWYLMLDIDKQLLHLSKNLRNSLSTEDFMALSVVKPPLDEQIQIANYLNKKTAKIDE 182 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 + K ++ I LL+E+R + I VT Sbjct: 183 TIAKNKELIDLLEEKRIALINHVVT 207 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 13/26 (50%), Positives = 19/26 (73%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK 35 KDSG++WIG IP+HW+ + +K T Sbjct: 216 MKDSGIEWIGNIPEHWETIKLKNCTT 241 >gi|320352779|ref|YP_004194118.1| restriction modification system DNA specificity domain-containing protein [Desulfobulbus propionicus DSM 2032] gi|320121281|gb|ADW16827.1| restriction modification system DNA specificity domain protein [Desulfobulbus propionicus DSM 2032] Length = 521 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 62/438 (14%), Positives = 124/438 (28%), Gaps = 43/438 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKY------LPKDGNSRQS 73 IP+HW + +G T G +Y G+ ++SG + + Sbjct: 83 IPEHWAWTRLGEIGDWGSGSTPSRGNPELYDGGITWLKSGELNDNQSLAGSEETVSELAL 142 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 +T + G IL G + K I + + D + ++ + Sbjct: 143 NTCSFRRNEPGDILLAMYGATIGKVAILAESAVTNQAVCGCTVFDGVLN-RYLFIFLLSQ 201 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 R + EG + I P P+PPLAEQ I K+ D L ++ Sbjct: 202 RSRFHSASEGGAQPNISKVKIVGFPFPLPPLAEQKRIVAKVDELMALCDQLEAQQQERQA 261 Query: 194 LLKEKKQALVSYI--------VTKGLNPDVKMKDSG--------------IEWVGLVPDH 231 +A ++ + +P + + + Sbjct: 262 QHAVLVKASLARFTQAPTPDNLQFLFHPSYTVSPADLRKTILTLAVQGKLVPQESEPLLG 321 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYG------NIIQKLETRNMGLKPESYETYQI 285 K S + L + E P + Sbjct: 322 SLESILAEASVNGVSKGPTADPSAVEVLRISAGTSREDFYVNEEDFKHVDLPANEVKKFQ 381 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-----AYMAVKPHGIDSTYLAWLMRSY 340 + PG+++ + S E I + Y+ + M + Sbjct: 382 LAPGDLLACRFNGNLHFVGRFSLYRGESRRIQVNPDKLIRFRINTDLHSPRYVCYAMNAA 441 Query: 341 DLCKVFYAMGSGLRQSL--KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + AM + ++ +K + + +PP+ EQ I ++ A +D L ++ Sbjct: 442 PTREAIEAMCATTAGNIGLSAGRLKTVEIPLPPLAEQRRIVAKVDELMALVDDLETQLAA 501 Query: 399 SIVLLKERRSSFIAAAVT 416 S + ++ + T Sbjct: 502 SRTVAHNLLAALVRELTT 519 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 77/247 (31%), Gaps = 46/247 (18%) Query: 221 GIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQK----LETR 271 E L+P+HW + +R N +L + I L G + Sbjct: 76 EEELPFLIPEHWAWTRLGEIGDWGSGSTPSRGNPELYDGGITWLKSGELNDNQSLAGSEE 135 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + +++ +PG+I+ K ++ + E + A Sbjct: 136 TVSELALNTCSFRRNEPGDILLAMYGATIGKVAI----LAESAVTNQAVCGCTVFDGVLN 191 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++ + A G + ++ + P +PP+ EQ I ++ A D Sbjct: 192 RYLFIFLLSQRSRFHSASEGGAQPNISKVKIVGFPFPLPPLAEQKRIVAKVDELMALCDQ 251 Query: 392 LVE-----------KIEQSI---------VLLK------------ERRSSFIAAAVTGQI 419 L ++ S+ L+ + R + + AV G++ Sbjct: 252 LEAQQQERQAQHAVLVKASLARFTQAPTPDNLQFLFHPSYTVSPADLRKTILTLAVQGKL 311 Query: 420 DLRGESQ 426 + ES+ Sbjct: 312 -VPQESE 317 >gi|295136493|ref|YP_003587169.1| restriction modification system DNA specificity subunit [Zunongwangia profunda SM-A87] gi|294984508|gb|ADF54973.1| restriction modification system DNA specificity subunit [Zunongwangia profunda SM-A87] Length = 405 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 73/415 (17%), Positives = 141/415 (33%), Gaps = 38/415 (9%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPK--DGNSR 71 + IP++W+VV + +L G + K I + ++ + + Sbjct: 5 LNRIPENWEVVDFRNVAELKHGYQFRNYDFTDKGIKIFKITQIKGDGIADISSCSYIDIN 64 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-----KDVLPELLQG 126 + D I KG IL G + K +FD I + V + E Sbjct: 65 RIDEFKRVILNKGDILIALTGATIGKVARFNFDEIVLQNYRVGNFIPLNENILNKEYFFQ 124 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L S +I A + + + I N+ + +PPL EQ I + ID I Sbjct: 125 FLKSDFFFNQILANQTQSAQQNIGKEDINNMSVVLPPLPEQKAIANIL----SAIDAKIE 180 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + L+E AL P K E +GL+P+ W V L Sbjct: 181 NNLAINKTLEEMAMALYKEWFVD-FGPFQDGKFIESE-LGLIPEGWVVANLEDLFVLQRG 238 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + ++ K +Y V+ + + + + Sbjct: 239 FDLPKKKRIEGNVPIYAASGKS----------TYHNEYKVEAPGVTTGRSGVLGNVYFVS 288 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + ++ + + +++++ DL + +L DV R+ Sbjct: 289 E----DFWPLNTSLWIKEYRSSTPYHAFFVLKNIDLKEF---NSGSAVPTLNRNDVHRIK 341 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 V+ P I N+ +V+ A+I +E Q L + R + + V+G++ L Sbjct: 342 VVKPEKS----IINLFSVQIAKIFRKIEMNTQQKQTLTQLRDTLLPKLVSGEVRL 392 >gi|293379345|ref|ZP_06625490.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium PC4.1] gi|292642037|gb|EFF60202.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium PC4.1] Length = 424 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 59/414 (14%), Positives = 133/414 (32%), Gaps = 33/414 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLP-KDGNSRQS 73 W+ + K+ G + D +++ +V K+ K + Sbjct: 17 DWEQRKLGDSIKVMDGDRGSNYPHESDFIENGDTLFLDTGNVTKTGFKFDSVKYITKEKD 76 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--------PELLQ 125 + K ++ G + + + +L L Sbjct: 77 EQLRAGKLEKNDLVLTSRGTLGNIGFYDELIYKLHPKVRINSAMLILRNTDEQLSYSYLH 136 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L ++ + G+ H + + +P E+ +KI ++D I Sbjct: 137 TLLKGRLISDFMRKNQVGSAQPHITKSEFLKLNLNVPYDIEEQ---KKIGTFFKQLDDTI 193 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 R ++LLKE K+ + + K +++ G + G + Sbjct: 194 ALHQRKLDLLKETKKGFLQKMFPKNGAKVPEIRFPG--FTGDWEERKLGGIGKTYTGLTG 251 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRS 304 + + ++Y N+ Q + L+ + Q V G++ F ++ Sbjct: 252 KSKEDFGHGDAKFVTYMNVFQNPKATLEQLENVEIDPRQNEVKKGDVFFTTSSETPEEVG 311 Query: 305 LRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + S + I + + D YLA+++RS + K + G+ R ++ Sbjct: 312 MSSVWTHDINNIYLNSFTFAYRPTIKFDLDYLAFMLRSQSVRKKIIYLAQGISRYNISKT 371 Query: 361 DVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + V +P +EQ I ++D + ++ + LLKE + F+ Sbjct: 372 KMMDISVPIPVNFEEQQKIGAF----FKQLDDTIALHQRKLDLLKETKKGFLQK 421 >gi|317494153|ref|ZP_07952569.1| type I restriction modification DNA specificity domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] gi|316917926|gb|EFV39269.1| type I restriction modification DNA specificity domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] Length = 396 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 71/399 (17%), Positives = 147/399 (36%), Gaps = 28/399 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ R + + + S KD+ + ++ + G+ L +Q F Sbjct: 14 EWENDLFGRIVTNKSSKYNPSTESKDLPCLEMDSISQEDGRILHIYSAKQQVSIKNK--F 71 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G +L+GKL PYL+K I+A FDG CS++ VL + L ++ + + Sbjct: 72 SAGDVLFGKLRPYLKKYILAPFDGACSSEIWVLNGLTINNSFLFCYIQTKKFIEAANKS- 130 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ M ADW I + M P EQ I E + + +I L + + K Q Sbjct: 131 SGSKMPRADWSVISSEMMFFPLKEEQNKIAEFLSSVDEKIMLLNKQYDLLCQYKKGMMQK 190 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + S + + + W + + + RKN + + + Sbjct: 191 IFSQELRFKDDNENSFPQ------------WSILQLKDIAIRVTRKNKENNNTILTISGK 238 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAY 320 ++ ++ N + ++ Y ++ GE + Q +++ E+G++++ Y Sbjct: 239 DGLVDQMTYFNKQIASKNVTGYFLIKKGEFAYNKSYSQGYPMGAIKMLSNYEKGVVSTLY 298 Query: 321 MAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKE 374 + K + S S + + G R ++ D + + VP + E Sbjct: 299 ICFKLNDEQSCGFYQHYFESGLQNRAIEKVAQEGARNHGLLNIGVNDFFDIELQVPSLAE 358 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I + ++ I+ + + +LK + + Sbjct: 359 QDKIAHFLSA----IEDKIAIKRAELDMLKNWKQGLLQQ 393 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 67/197 (34%), Gaps = 10/197 (5%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPG 289 WE F +VT + K ES L + I + + R + + G Sbjct: 15 WENDLFGRIVTNKSSKYNPSTESKDLPCLEMDSISQEDGRILHIYSAKQQVSIKNKFSAG 74 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++F + K L G +S + I++++L +++ + Sbjct: 75 DVLFGKLRPYLKKYILAPFD----GACSSEIWVLNGLTINNSFLFCYIQTKKFIEAANKS 130 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + + P +EQ I ++ +I +L + LL + + Sbjct: 131 SGSKMPRADWSVISSEMMFFPLKEEQNKIAEFLSSVDEKIMLL----NKQYDLLCQYKKG 186 Query: 410 FIAAAVTGQIDLRGESQ 426 + + ++ + +++ Sbjct: 187 MMQKIFSQELRFKDDNE 203 >gi|256958290|ref|ZP_05562461.1| type I restriction-modification system specificity subunit [Enterococcus faecalis DS5] gi|256948786|gb|EEU65418.1| type I restriction-modification system specificity subunit [Enterococcus faecalis DS5] Length = 404 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 64/403 (15%), Positives = 144/403 (35%), Gaps = 29/403 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W++ + R + T + E + + + E + + + + D S + Sbjct: 14 EDWELCKLGRVVERVTRKNKELKSTLP-LTISAQEGLIDQNVFFNKSVASRDVSGYYLIY 72 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLS-IDVTQR 136 G+ Y K G+ ST +++ +PK++ L+ + + + Sbjct: 73 NGEFAYNKSYSNGYPWGAIKRLNRYDMGVLSTLYIIFKPKNIDSNFLEKYYDTSCWYHEV 132 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + EGA + + + V + KI ++D IT R +E LK Sbjct: 133 SKHAAEGARNHGLLNIAASDFLRTELTVPKSVEEQRKIGNFLKQLDDTITLHQRKLEQLK 192 Query: 197 EKKQALVSYIVT--KGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTK 250 E K+A + + P V+ EW +G + + + ++ Sbjct: 193 ELKKAYLQVMFPAKDERVPKVRFAAFEGEWAHRKLGEITESFS-------GGTPTAGKSE 245 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +I + G I + + + ++V G+I++ + + + Sbjct: 246 YYGGDIPFIRSGEISSDSTELFITENGLNSSSAKMVKVGDILYALYGATSGEVGISKI-- 303 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 G I A +A++P D++YL + G + +L VK L +++P Sbjct: 304 --TGAINQAILAIRPSKNDNSYLIIQWLRKQKNTIISTYLQGGQGNLSSSIVKNLIIMLP 361 Query: 371 -PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ + R+D ++ + + LK+ ++S++ Sbjct: 362 QNKEEQEKVGIF----FKRLDDIITLHQNKLEQLKDLKTSYLQ 400 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 57/186 (30%), Gaps = 9/186 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + T+ +G T +GK DI +I ++ S + + ++S+ Sbjct: 221 EWAHRKLGEITESFSGGTPTAGKSEYYGGDIPFIRSGEISSDSTELF---ITENGLNSSS 277 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G ILY G + I+ G + L ++P L L I Sbjct: 278 AKMVKVGDILYALYGATSGEVGISKITGAINQAILAIRPSKNDNSYLIIQWLRKQKNTII 337 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + I M EQ + I + + +L Sbjct: 338 STYLQGGQGNLSSSIVKNLIIMLPQNKEEQEKVGIFFKRLDDIITLHQNKLEQLKDLKTS 397 Query: 198 KKQALV 203 Q + Sbjct: 398 YLQNMF 403 >gi|154174911|ref|YP_001408734.1| putative type I restriction-modification system, S subunit [Campylobacter curvus 525.92] gi|153793147|gb|EAT99394.2| putative type I restriction-modification system, S subunit [Campylobacter curvus 525.92] Length = 528 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 59/448 (13%), Positives = 128/448 (28%), Gaps = 73/448 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V + + L T T ++ I ++ ++++ G + S++ Sbjct: 84 EIPQSWSWVRLGEISSLITDGTHKTPTYVSNGIPFLTIQNISKGFFDFSTIKYISKEEHK 143 Query: 76 STVSIFAK--GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 IL+ ++G DF+ S + L + +++ S Sbjct: 144 CLCKRVRPQQNDILFCRIGTLGEAIKCTLNFDFNIFVSLGLIRLHDARFVDYVVKFINSS 203 Query: 131 IDVTQRIEAICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + G T + + +IP+P+PPL+EQ I +K+ I+ ++ Sbjct: 204 VMQKWIEQNKVGGGTHTFKINLGSMYSIPLPLPPLSEQKRIVDKLEEILQLIEKYKEDKE 263 Query: 190 RFIELLKEKK----QALVSYIVTKGLNPDV------------------------------ 215 + EL ++++ Y V L Sbjct: 264 KLDELNLSFPSKLKKSILDYAVKGKLVEQNLEDESVEILLQKIGQEKQRLVKDKKLKADK 323 Query: 216 --------------------KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT------ 249 + + E +P W ++ Sbjct: 324 FPQSTIFIGEDNSPYEKIGKETRCIEDEIPFEIPSSWAWVRLGSMGVAQTGSTPSTQVRD 383 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + ++ N L + E ++ + G I+ I K Sbjct: 384 FYGDYMPFIKPADITNSGIDYNNEKLSKKGTEVGRVAEKGSILMVCIGGSLGKCYFNDRI 443 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 V I S +L+ S+ ++ + + + + + Sbjct: 444 VSFNQQINS---LTPFFSSYKFIFYYLLSSHFFEQLQDRATGTATPIVNKTSWESILIPL 500 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIE 397 PP+ EQ I I +D+L ++ Sbjct: 501 PPLPEQKRIVTKIEELLKFVDILQSSLK 528 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 40/262 (15%), Positives = 86/262 (32%), Gaps = 25/262 (9%) Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK------DSGIEWVGLVPDHW 232 ++ I + + K+ K + ++ KG + K E +P W Sbjct: 30 SKLVEQIRKEKDRLIKDKKIKPSKFDSVIFKGEDNLHYEKIGEETRCIEDEIPFEIPQSW 89 Query: 233 EVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP- 288 + +T+ K + + I L+ NI + + E + Sbjct: 90 SWVRLGEISSLITDGTHKTPTYVSNGIPFLTIQNISKGFFDFSTIKYISKEEHKCLCKRV 149 Query: 289 ----GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 +I+F I + +++ + I S + Y+ + S + K Sbjct: 150 RPQQNDILFCRIGTLGE--AIKCTLNFDFNIFVSLGLIRLHDARFVDYVVKFINSSVMQK 207 Query: 345 VF--YAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 +G G + + +P+ +PP+ EQ I + + I+ E E+ + Sbjct: 208 WIEQNKVGGGTHTFKINLGSMYSIPLPLPPLSEQKRIVDKLEEILQLIEKYKEDKEK-LD 266 Query: 402 LLK-----ERRSSFIAAAVTGQ 418 L + + S + AV G+ Sbjct: 267 ELNLSFPSKLKKSILDYAVKGK 288 >gi|168490597|ref|ZP_02714740.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC0288-04] gi|183574925|gb|EDT95453.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC0288-04] Length = 522 Score = 120 bits (301), Expect = 4e-25, Method: Composition-based stats. Identities = 73/440 (16%), Positives = 147/440 (33%), Gaps = 66/440 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIIRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 DV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 323 DISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQ 382 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA- 319 ++ L SY+ +++ G++++ L R + A Sbjct: 383 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVAD 442 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V I+ ++ + S + V SG ++ L + +K + +PP+ Sbjct: 443 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLP 502 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I + I A ID L+ Sbjct: 503 EQSRIVDKIEQFFAHIDALI 522 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIIRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 >gi|260776278|ref|ZP_05885173.1| type I restriction-modification system specificity subunit S [Vibrio coralliilyticus ATCC BAA-450] gi|260607501|gb|EEX33766.1| type I restriction-modification system specificity subunit S [Vibrio coralliilyticus ATCC BAA-450] Length = 424 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 64/416 (15%), Positives = 130/416 (31%), Gaps = 38/416 (9%) Query: 24 HWKVVPIKRF-TKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 WK + +K+ +G T G+ I +I ++V + S Sbjct: 18 SWKTTKLGALTSKVGSGATPRGGEKAYSTSGIPFIRSQNVNYNRLLLNDIRYIPENTHAS 77 Query: 77 -TVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSID 132 S IL G + ++ + G + +++ K+ P Q L S Sbjct: 78 MKRSQIQPKDILLNITGASIGRSCVVPDCFQDGNLNQHVCIIRLKNDDPYFTQSLLASYR 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G +++ I M P L EQ I + ++D I Sbjct: 138 GEKLVFQGMAGGGREGLNFESIKGFKMAFPTLPEQQKIASFL----SKVDEKIALLTEKK 193 Query: 193 ELLKEKKQALVSYIVT----------KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 + L E K+ ++ + + P ++ K + FA + Sbjct: 194 DKLAEYKKGVMQQLFNGKWQEQDGQLTFIPPTLRFKADDGSEFPDWEEKALGD--FARIY 251 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQ 299 + + K ++ + S ++ + + E Y + G+I+ I Sbjct: 252 DGTHQTPKYVDEGVPFYSVEHVTANQFEKTKYISEEVYAKECKRVTLKKGDILLTRIGSV 311 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSL 357 D R + + S + I YLA M+S + + + + + Sbjct: 312 GDVRLI--DWDVRASFYVSLALVKYNDEIVGQYLASFMQSPNFQSELWKRMIHVAFPKKI 369 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ V VP EQ I N ++ ID ++ + KE + + Sbjct: 370 NLGEIGHCLVSVPSRDEQTKIANFLSA----IDQKIDLANSELEKAKEWKRGLLQQ 421 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 62/178 (34%), Gaps = 10/178 (5%) Query: 246 RKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQND 301 S I + N +L ++ PE+ + P +I+ Sbjct: 39 GGEKAYSTSGIPFIRSQNVNYNRLLLNDIRYIPENTHASMKRSQIQPKDILLNITGASI- 97 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFE 360 RS + G + ++ D + L+ SY K+ F M G R+ L FE Sbjct: 98 GRSCVVPDCFQDGNLNQHVCIIRLKNDDPYFTQSLLASYRGEKLVFQGMAGGGREGLNFE 157 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +K + P + EQ I + + +++D + + + L E + + G+ Sbjct: 158 SIKGFKMAFPTLPEQQKIASFL----SKVDEKIALLTEKKDKLAEYKKGVMQQLFNGK 211 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 61/190 (32%), Gaps = 10/190 (5%) Query: 24 HWKVVPIKRFTKLNTG--RTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + F ++ G +T + + + + +E V + + + Sbjct: 238 DWEEKALGDFARIYDGTHQTPKYVDEGVPFYSVEHVTANQFEKTKYISEEVYAKECKRVT 297 Query: 81 FAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG IL ++G +I S + + V L ++ Sbjct: 298 LKKGDILLTRIGSVGDVRLIDWDVRASFYVSLALVKYNDEIVGQYLASFMQSPNFQSELW 357 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + A + IG+ + +P EQ I + ID I +E KE Sbjct: 358 KRMIHVAFPKKINLGEIGHCLVSVPSRDEQTKIANFL----SAIDQKIDLANSELEKAKE 413 Query: 198 KKQALVSYIV 207 K+ L+ + Sbjct: 414 WKRGLLQQMF 423 >gi|218901963|ref|YP_002449797.1| restriction modification system DNA specificity domain protein [Bacillus cereus AH820] gi|218537816|gb|ACK90214.1| restriction modification system DNA specificity domain protein [Bacillus cereus AH820] Length = 495 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 57/449 (12%), Positives = 131/449 (29%), Gaps = 52/449 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W + +KL + + + + ++ G + S Sbjct: 25 EVPGNWIWGNLNSLSKLIVDGSHNPPPKKNEGFPMLSGRNILDGEINFETDRYVSEDDYQ 84 Query: 76 STVSI--FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS-ID 132 +L +G R ++ Q V K ++ + S Sbjct: 85 KEYKRTPIESNDVLLTIVGTIGRTTVVPKEFSPFVLQRSVALIKPMVNSNYLSYYFSSPY 144 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ +G K + + +P+PPL EQ I EK+ R++ Sbjct: 145 FQYYLQKNAKGTAQKGVYLKTLKSSRIPLPPLMEQKRITEKVEGLLGRVEEAKALIEEAK 204 Query: 193 ELLKEKKQALVSYIVTKGLNPDVK-----------------------------MKDSGI- 222 + + ++ ++ L+ + +K + + Sbjct: 205 KTFEVRRATILDKAFRGELSAKWREDNRIAEDASSLLERIQIQKRNSSIKSNTLKITSVI 264 Query: 223 --EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 E +P+ W + + + + + Q + ++ L +Y Sbjct: 265 KEEEPFELPNGWTWVRLGEISYYVTSGSRDWSKYYSDEGAMFIRTQDINKNSLNLSDVAY 324 Query: 281 --------ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 +V+ +I+ K +L + E + S + S Y Sbjct: 325 VSLPEKVEGKRSLVEKADILTTITGANVGKCALVETNIKEAYVSQSVALTKLIEKSISKY 384 Query: 333 LAWLMRSY--DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + S ++ R L ED+K + + + P+ EQ I ++ + Sbjct: 385 VHLSLLSPCGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPMAEQQVIVKLVETLLE--N 442 Query: 391 VLVEKIEQSIVL-LKERRSSFIAAAVTGQ 418 SI L+ + S + A G+ Sbjct: 443 EKESLNLASIEKHLETLKQSILNKAFRGE 471 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 73/205 (35%), Gaps = 12/205 (5%) Query: 223 EWVGLVPDHWEVKPFFA--------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 E VP +W + +KN + ++ G I + + Sbjct: 21 EHPYEVPGNWIWGNLNSLSKLIVDGSHNPPPKKNEGFPMLSGRNILDGEINFETDRYVSE 80 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + ++ +++ + R+ + ++ + +KP ++S YL+ Sbjct: 81 DDYQKEYKRTPIESNDVLLTIVGTIG--RTTVVPKEFSPFVLQRSVALIKP-MVNSNYLS 137 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + S G ++ + + +K + +PP+ EQ IT + R++ Sbjct: 138 YYFSSPYFQYYLQKNAKGTAQKGVYLKTLKSSRIPLPPLMEQKRITEKVEGLLGRVEEAK 197 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 IE++ + RR++ + A G+ Sbjct: 198 ALIEEAKKTFEVRRATILDKAFRGE 222 >gi|134045656|ref|YP_001097142.1| restriction modification system DNA specificity subunit [Methanococcus maripaludis C5] gi|132663281|gb|ABO34927.1| restriction modification system DNA specificity domain [Methanococcus maripaludis C5] Length = 417 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 70/435 (16%), Positives = 148/435 (34%), Gaps = 46/435 (10%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKY 63 YK++ IG IP+ W+ V + L G + + ++ + +V + Sbjct: 6 EGYKETK---IGVIPEDWQAVKLSESVNLFGGFAFSSEDSKSEGVKWLKIANVGIDKITW 62 Query: 64 LPKDG--NSRQSDTSTVSIFAKGQILYGKLGPYLR------KAIIADFDGICSTQFLVLQ 115 + S ++ +K I+ P L K D + + + L Sbjct: 63 ENESYLPFEYLEKYSNYAL-SKNDIVMALTRPILNSKLKISKITDLDIPCLLNQRVGKLD 121 Query: 116 PKDV-LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 PK + + + G + ++ + I +P+PPL EQ I E + Sbjct: 122 PKQNTFGDYIYHSCKMPMFIHSMNVAMAGTDPPNIGFRDLSKIQIPLPPLPEQQKIAEIL 181 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 D I I E K+ L+ ++ +G D W+ Sbjct: 182 ----STWDNSIENLENLISKKIEIKKGLMQNLL------------TGNVRFPGFEDEWKE 225 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVF 293 +L+ E+ RK +S + R +T Q + + + Sbjct: 226 VKIGSLLNEVKRKIEWDDSKLYDLVSLKRRSGGIFYRESLYGHQILTKTLQPIKEDDFLI 285 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVK---PHGIDSTYLAWLMRSYDLCKVFYAMG 350 + ++ E ++S+Y P + + WL ++ + Y Sbjct: 286 SKM-QVLHGALGAVSKEFEDMYVSSSYAIFNSKTPEKFNIKFFDWLSKTPIMYHYAYISS 344 Query: 351 SGL---RQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G+ + + + + V+VP I+EQ I V++ + +E ++Q + L+K + Sbjct: 345 YGVHIEKMTFNLKLYLKEKVMVPNSIEEQESIVRVLSTQDKE----IELLKQKLELVKTQ 400 Query: 407 RSSFIAAAVTGQIDL 421 + + +TG++ + Sbjct: 401 KKGLMQNLLTGKVRV 415 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 70/213 (32%), Gaps = 15/213 (7%) Query: 225 VGLVPDHWEVKPFFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 +G++P+ W+ V ++ ++ ++ + I + + Sbjct: 13 IGVIPEDWQAVKLSESVNLFGGFAFSSEDSKSEGVKWLKIANVGIDKITWENESYLPFEY 72 Query: 278 ESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLA 334 + + +IV L + + + + ++ + P Y+ Sbjct: 73 LEKYSNYALSKNDIVMALTRPILNSKLKISKITDLDIPCLLNQRVGKLDPKQNTFGDYIY 132 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + +G ++ F D+ ++ + +PP+ EQ I +++ I+ L Sbjct: 133 HSCKMPMFIHSMNVAMAGTDPPNIGFRDLSKIQIPLPPLPEQQKIAEILSTWDNSIENLE 192 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I + I E + + +TG + G Sbjct: 193 NLISKKI----EIKKGLMQNLLTGNVRFPGFED 221 >gi|149026393|ref|ZP_01836531.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP23-BS72] gi|147929276|gb|EDK80276.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP23-BS72] Length = 522 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 73/440 (16%), Positives = 149/440 (33%), Gaps = 66/440 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEDKIKKKDL 322 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 323 DISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQ 382 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA 319 ++ L SY+ +++ G++++ L R ++ G + Sbjct: 383 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVAD 442 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V I+ ++ + S + V SG ++ L + +K + +PP+ Sbjct: 443 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLP 502 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I + I A ID L+ Sbjct: 503 EQSRIVDKIEQFFAHIDALI 522 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 >gi|313672542|ref|YP_004050653.1| restriction modification system DNA specificity domain [Calditerrivibrio nitroreducens DSM 19672] gi|312939298|gb|ADR18490.1| restriction modification system DNA specificity domain [Calditerrivibrio nitroreducens DSM 19672] Length = 395 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 58/412 (14%), Positives = 132/412 (32%), Gaps = 32/412 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG--TGKYLPKDGNSR 71 IP+ WK V + ++ TG T ++ DI ++ + D +G K + Sbjct: 4 KIPEGWKRVKLGEVIEIITGGTPKTSVPEYWNGDIPWLSITDFNNGRKYCYNAEKKITEK 63 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ST +I KGQI+ G + D + + K L + L Sbjct: 64 GLKESTTNILKKGQIIISARGTV-GVISMLGRDMAFNQSCYGINAKAGLTFNDFIYYLLK 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + GA + I + +PPL EQ I + + +ID L + Sbjct: 123 FNIPHFISNSYGAVFDTITKQTFEQIIIKLPPLPEQKAIASVLSSLDDKIDLLHRQNQTL 182 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ + + IE + + + + Sbjct: 183 EKM------------------AETLFRKWFIEDAKDDWEEVSLGNSELSTIINSGIDKFE 224 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 E L+ + + + F + Sbjct: 225 GEKIYLATGDVQDTNITGGIKITYENRPSRANMQPVKFSVWFAKKGGVRKLLMFDDYSDI 284 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 + I+++ + +K + + Y+ + + + ++ + SG ++ + E +K++ +L P Sbjct: 285 NKYILSTGFSGLKTNELSHYYIWCFILTKEFQEIKDSFVSGSVQPDITNEGIKQITILRP 344 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +Q I N + ++ + I L+ R + + ++G++ ++ Sbjct: 345 --DDQTLI--NFNKIMKPLFYKCQQNKLQIRTLENLRDTLLPKLMSGEVRVK 392 >gi|163754483|ref|ZP_02161605.1| type I restriction-modification system, S subunit [Kordia algicida OT-1] gi|161325424|gb|EDP96751.1| type I restriction-modification system, S subunit [Kordia algicida OT-1] Length = 430 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 66/424 (15%), Positives = 150/424 (35%), Gaps = 24/424 (5%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 AYP+YK +++ IP+ W ++P + + ++++ + + + Sbjct: 3 AYPKYKTIAFEYVTQIPEDWDLLPNIAIFEERNEK-GHIHEELLSVTIGKGVIKQSELNK 61 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 KD + D S + G I+Y + + +++ G+ S VL+PK + Sbjct: 62 KDSS--NPDKSNYKLVEIGDIVYS-MRFRQGASGYSNYKGLVSNACTVLKPKMKINPKFF 118 Query: 126 GWLLSIDVTQRI---EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + Q + W+ + +PPL Q I + + +I Sbjct: 119 HYQYRLPFYQNYAERYSYGIADGQKPLRWQDFKRMYAFVPPLETQNQIVTYLEEKEKQIK 178 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 + ++ R ++L + + +L G N K +W L W+++ + + Sbjct: 179 QFVKKKNRIVDLTENQLNSL-----VFGKNKYTDFK----DWKDLFNTSWKIEKAKWVFS 229 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 E N KN E + S ++ K E + + ++V + V + Sbjct: 230 ERNIKN-HPSERLLASTQDRGLVFKDEIEENYVTATQTDGLKLVCKNDFVISLRSFEGGI 288 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLR--QSLKF 359 + + Y +L +S + + SG+R +++ F Sbjct: 289 ELSEVQGITSPAYNIFYLKKEFNDIKNLKYYYKYLFKSNQFIGLLNTVVSGIREGKNISF 348 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +D + L + +P + I ++ I++ L K+ +S I + G++ Sbjct: 349 KDFRELYIPIPD----KKTIDKIYKLHLKLIDSKALIKKENELSKKLLTSLIENIIIGKM 404 Query: 420 DLRG 423 + Sbjct: 405 KVPN 408 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 86/211 (40%), Gaps = 12/211 (5%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 +N K K E+V +P+ W++ P A+ E N K E +++ G +I++ E Sbjct: 1 MNAYPKYKTIAFEYVTQIPEDWDLLPNIAIFEERNEKGHIHEELLSVTIGKG-VIKQSEL 59 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 Y++V+ G+IV+ ++ + + + + I+ Sbjct: 60 NKKDSSNPDKSNYKLVEIGDIVYSMR----FRQGASGYSNYKGLVSNACTVLKPKMKINP 115 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + R G+ ++ L+++D KR+ VPP++ Q I + + Sbjct: 116 KFFHYQYRLPFYQNYAERYSYGIADGQKPLRWQDFKRMYAFVPPLETQNQIVTYLEEKEK 175 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +I V+K + + L + + + + V G+ Sbjct: 176 QIKQFVKKKNRIVDLTENQ----LNSLVFGK 202 >gi|326406201|gb|ADZ63272.1| type I restriction enzyme, S subunit [Lactococcus lactis subsp. lactis CV56] Length = 420 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 50/409 (12%), Positives = 123/409 (30%), Gaps = 33/409 (8%) Query: 25 WKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + ++ G + +I ++ + DV G+ + ++ Sbjct: 22 WEQRELGDLAEIVRGASPRPIQNPKWFNQNSEIGWLRISDVTEQNGRIHFLEQRISEAGQ 81 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + +L + I G+ + L PK L + Sbjct: 82 GKTRVLHSSHLLLSIAATVGKPVINYVPTGVHDGFLIFLNPKFDL---EFMFQWLEMFRP 138 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + + + + + N + IP L EQ I +D I + R E + Sbjct: 139 QWQKYGQPGSQVNLNSDLVKNQKIFIPSLGEQKEISSF----FTNLDQTIAFQQRKFEKM 194 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLIE 253 K K A +S + K + G + K + Sbjct: 195 KSMKLAYLSEMFPAEGERKPKRRFPGFTDDWEQRELLSTIKSIVDFRGRTPKKLGMDWSD 254 Query: 254 SNILSLSYGNIIQKLETRNMGLKP------ESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 S L+LS N+ N + + + + + + G+++F + ++ Sbjct: 255 SGYLALSALNVKNGYIDFNEDVHYGNQELYDKWMSGKELYKGQVLFTTEAPMGNV--VQV 312 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + + +++ + G + + + +++L Sbjct: 313 PDDKGYILSQRTIAFNINKDLLTDSFLYVLLGSLKVFKDLSALSSGGTAKGVSQKSLEQL 372 Query: 366 PVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V +P I EQ I+ +D + +Q + L+ + +++ Sbjct: 373 KVCIPKDIDEQSKISEF----FINLDQTIAFQQQKLEKLQNIKKAYLNE 417 >gi|15612487|ref|NP_224140.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori J99] gi|4156033|gb|AAD06991.1| putative TYPE I RESTRICTION ENZYME (SPECIFICITY SUBUNIT) [Helicobacter pylori J99] Length = 624 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 67/418 (16%), Positives = 150/418 (35%), Gaps = 33/418 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTST 77 ++W+ V + +G ++ +D I YI +V + N + Sbjct: 218 QNWQKVRLGDIGITISGLAGKTKQDFINGNAKYITFLNVLNNVIIDTSILENVKIYPNEK 277 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + F K + + ++ + D + S F + L +L++ Sbjct: 278 QNSFKKYDLFFNTSSETPKEVGMCAVLLDDIDQVFLNSFCFGFRIFDKAVDSLFLSYLIN 337 Query: 131 IDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++ + E + +G+T + G N+ + +PPL EQ+ I + I +L ++ Sbjct: 338 SEIGRKAFENLAQGSTRYNLSKSGFNNVCLILPPLNEQIAIANILSDVDSEIISLKNKKR 397 Query: 190 RFIELLKEKKQALVSYIVT-KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 +F + K L+S KG N + + G +G+ K + + Sbjct: 398 QFENVKKALSFELLSQRKRLKGFNQNWQKVRLGD--IGITISGLAGKTKQDFINGNAK-- 453 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I L+ N + + +K E ++ F + + + Sbjct: 454 ------YITFLNVLNNVIIDTSILENVKIYPNEKQNSFKKYDLFFNTSSETPKEVGMCAV 507 Query: 309 --QVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVK 363 +++ + S + +DS +L++L+ S K F + G R +L Sbjct: 508 LLDDIDQVFLNSFCFGFRIFDKAVDSLFLSYLINSEIGRKAFENLAQGSTRYNLSKSGFN 567 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +++PP+ EQ I N+++ + I L K Q + + + ++ +I + Sbjct: 568 NVCLILPPLNEQIAIANILSDVDSEIISLKNKKRQ----FENVKKALNHDLMSAKIRV 621 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 67/426 (15%), Positives = 145/426 (34%), Gaps = 35/426 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDI--------------IYIGLEDVESGTGKYLPKD 67 P +W+ V + +L T + +I I I + E+ K Sbjct: 11 PSNWQKVRLGDILELLTDYHANGSYEILKNNVTLLKNVDFAIMIRTTNFENNDFKNDLIY 70 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK-DVLPELLQG 126 + + + + S G IL K+ + + S + + L Sbjct: 71 IDKKAYEFLSKSKVFAGDILVNKIANAGTAYFMPKLNQPVSLGMNLFLLRIKPSYNNLFI 130 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + ++ G+ I N+ +P+PPL EQ+ I + + +L Sbjct: 131 FKQIANYERVLKTFANGSATKTITKNVIKNLLIPLPPLNEQIAIANILSDVDRYLCSLDA 190 Query: 187 ERIRFIELLKEKKQALVSYIVT-KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 ++ + K L+S KG N + + G +G+ K + Sbjct: 191 LILKKESVKKALSFELLSQRKRLKGFNQNWQKVRLGD--IGITISGLAGKTKQDFINGNA 248 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + I L+ N + + +K E ++ F + + Sbjct: 249 K--------YITFLNVLNNVIIDTSILENVKIYPNEKQNSFKKYDLFFNTSSETPKEVGM 300 Query: 306 RSA--QVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFE 360 + +++ + S + +DS +L++L+ S K F + G R +L Sbjct: 301 CAVLLDDIDQVFLNSFCFGFRIFDKAVDSLFLSYLINSEIGRKAFENLAQGSTRYNLSKS 360 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + +++PP+ EQ I N+++ + I L K Q + + + ++ + Sbjct: 361 GFNNVCLILPPLNEQIAIANILSDVDSEIISLKNKKRQ----FENVKKALSFELLSQRKR 416 Query: 421 LRGESQ 426 L+G +Q Sbjct: 417 LKGFNQ 422 >gi|289435129|ref|YP_003465001.1| type I restriction-modification system, S subunit [Listeria seeligeri serovar 1/2b str. SLCC3954] gi|289171373|emb|CBH27915.1| type I restriction-modification system, S subunit [Listeria seeligeri serovar 1/2b str. SLCC3954] Length = 412 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 64/389 (16%), Positives = 129/389 (33%), Gaps = 25/389 (6%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF 103 + G E+V + + + + + S I+ +G AI+ Sbjct: 37 KRGNYRVYGQENVYKNDFSFGDRYLSKEKFEGLKSSEICSNDIVISTMGTIGHCAIVPSN 96 Query: 104 --DGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 GI + + L+ K V L+ L S + +I+ + G M I I + Sbjct: 97 ILPGIMDSHLIRLRLDNKKVNHLFLKYILQSESIQNQIKKMSVGGIMDGLSTSIIKQIEI 156 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 P + EQ I E + ID LI I+ + K A + +VT + K Sbjct: 157 SYPSINEQKNIAESL----SDIDQLINSLSELIKKKESIKNAFLENLVTG----ARRFKG 208 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 EW + A + ++ +ES L G +K + Sbjct: 209 FDGEW--ENINLGGTSLLKARIGWQGLTTSEYLESGFSYLITGTDFKKGTINWKDIHFVE 266 Query: 280 YETYQI-----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 Y V +++ +++ + + + Y+ Sbjct: 267 KHRYDQDKNIQVKDDDLLLTKDGTIGKVALVKNLNKPATLNSGVFVIRPIKNKYLTEYVY 326 Query: 335 WLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVL 392 +++ S + +G L +D+ +P +KEQ + +++ ID Sbjct: 327 YVLTSSVFRTFLNKLAAGSTISHLYQKDLTNFEFFLPSSLKEQKAVATILSD----IDKE 382 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + K+E+ + K+ + + +TG+I L Sbjct: 383 IFKLEEKLEKYKKIKQGMMEQLLTGKIRL 411 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 63/164 (38%), Gaps = 6/164 (3%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y N + K E ++ +I +IV + + S + Sbjct: 50 YKNDFSFGDRYLSKEKFEGLKSSEICS-NDIVISTMGTIGHCAIVPSNILPGIMDSHLIR 108 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + ++ +L ++++S + M G+ L +K++ + P I EQ +I Sbjct: 109 LRLDNKKVNHLFLKYILQSESIQNQIKKMSVGGIMDGLSTSIIKQIEISYPSINEQKNIA 168 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ ID L+ + + I + +++F+ VTG +G Sbjct: 169 ESLSD----IDQLINSLSELIKKKESIKNAFLENLVTGARRFKG 208 >gi|58583087|ref|YP_202103.1| Type I restriction enzyme StySPI specificity protein [Xanthomonas oryzae pv. oryzae KACC10331] gi|58427681|gb|AAW76718.1| Type I restriction enzyme StySPI specificity protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 464 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 70/436 (16%), Positives = 142/436 (32%), Gaps = 42/436 (9%) Query: 15 VQW-IGAIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLP 65 V+W + +P W + G T + + ++ SG + Sbjct: 2 VRWTVSELPGGWCTSALSGLADTVRGVTYNKLQAQSTAEEGLLPILRANNINSGKLVFDD 61 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVL 120 S + G I+ + + G VL+ + Sbjct: 62 LVFVPE-DCVSRTQVLLAGDIVVAMSSGSRSVVGKSAQVEAPWPGSFGAFCGVLRASQEI 120 Query: 121 P-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + S R+ + G +++ I +P+ PLAEQ I +K+ A Sbjct: 121 DARYLYYFTQSRAYRDRVSELAAGVNINNLKPGHFEKISVPLAPLAEQKRIAQKLDALLA 180 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKP 236 ++DTL LLK ++++V V L+ D K E +G + + W Sbjct: 181 QVDTLKARIDAIPALLKRFRKSVVHSAVIGRLSADLRVPIEKSEEQEQLGPL-ESWREVT 239 Query: 237 FFALVTELNRKNTK-------LIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIV 286 +L K+ L S + G++ + + + ++ Sbjct: 240 LASLGELSRGKSKHRPRNDSRLYGSEYPFIQTGDVANSGGALTSSKVFYSEFGLKQSRLF 299 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G + D L ++ + ++ +++ D + Sbjct: 300 PSGTLCITIAANIADTAMLAIDACFPDSVVG---FIPNKDDCVTQFIKYVI--DDNKESL 354 Query: 347 YAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK---IEQSIVL 402 A+ + ++++ + + ++ + +PPIKEQ +I + A D L K +Q I Sbjct: 355 EALAPATAQKNINLKVLNQVKLRIPPIKEQTEIVRHVEQLFAYADQLEAKVAAAQQRIDA 414 Query: 403 LKERRSSFIAAAVTGQ 418 L S +A A G+ Sbjct: 415 LT---QSLLAKAFRGE 427 >gi|84390143|ref|ZP_00991405.1| type I restriction enzyme specificity protein [Vibrio splendidus 12B01] gi|84376797|gb|EAP93672.1| type I restriction enzyme specificity protein [Vibrio splendidus 12B01] Length = 496 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 77/449 (17%), Positives = 146/449 (32%), Gaps = 50/449 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTV 78 +PK W + I E+ YI + V+ P + + + Sbjct: 3 ELPKGWITIKIDSLCAKPKQLKPEASWKFNYIDISSVDREKKLICEPSEILGSDAPSRAR 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I G +L P L + ST F VL+P + + L + S Sbjct: 63 KIVNTGDVLVSMTRPNLNAVAKVPEKYNGQVASTGFDVLKPFLIESDWLFSVVRSQPFID 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I GA I + MP+PPLAEQ I EK+ ++DT+ +LL Sbjct: 123 SISGTTIGALYPACKTSDIRDYEMPLPPLAEQKRIVEKLDEVLAQVDTIKARLDGIPDLL 182 Query: 196 KEKKQALVSYIVTKGLNPDVKM-----------KDSGIEWVGLV---------------- 228 K +Q++++ V+ L + ++ K + + G + Sbjct: 183 KRFRQSVLASAVSGTLTKEWRLTNELTKAEEELKSNFLAKSGKLKLRGKQTNFSELSLIT 242 Query: 229 -PDHWEVKP-----------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM--- 273 PD W A K + + + ++ + +N Sbjct: 243 LPDSWTWAQNYKLAKDESNAICAGPFGTIFKAKDFRDEGVPIIFLRHVKEIGFNQNKPNY 302 Query: 274 --GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDS 330 G E V GE++ + + + + + M V + Sbjct: 303 MDGDVWEELHQEYSVHGGELLVTKLGDPPGECCIYPENMGTAMVTPDVLKMNVDEDIVLR 362 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 YL S ++ A+ R + K P+ +P ++EQ +I +++ A Sbjct: 363 KYLRSYFNSPISTEIIEALAFGATRLRIDIAMFKGFPIPLPSMEEQKEIVRLVDQYFAFA 422 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D + +++++ + S +A A G+ Sbjct: 423 DTIEAQVKKAQAKVDNLTQSILAKAFRGE 451 >gi|52079177|ref|YP_077968.1| Type I RM system specificity subunit HsdIB [Bacillus licheniformis ATCC 14580] gi|52784544|ref|YP_090373.1| hypothetical protein BLi00745 [Bacillus licheniformis ATCC 14580] gi|52002388|gb|AAU22330.1| Type I RM system specificity subunit HsdIB [Bacillus licheniformis ATCC 14580] gi|52347046|gb|AAU39680.1| putative protein [Bacillus licheniformis ATCC 14580] Length = 397 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 71/400 (17%), Positives = 138/400 (34%), Gaps = 32/400 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + ++ +G T IYI ++D+ T + L + Sbjct: 17 DWEERKLGELVEIKSGWTPSDFVETQKCNGEIYIKVDDLNYSTRELLDSKMKVAI--HAK 74 Query: 78 VSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG ++ K G K I DG T + L+P+++ E L + Sbjct: 75 YHTIKKGSTIFPKRGAAIMTNKVRILGTDGYMDTNMMALEPRNINGEFLYTLID----RT 130 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + I + +T+ + K + + +P L EQ I ++D I + + L Sbjct: 131 GLFKIADTSTIPQINNKHVEPYKILLPNLYEQKNIGNF----FKQLDDTIALHQQELTTL 186 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ KQ + + K +++ G WE + + ++ KN + Sbjct: 187 KQTKQGFLQKMFPKEGESVPEVRFPG------FTGEWEQRKADEIFYSVSDKNHSNLPVL 240 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G + + ++ +S + Y+ V PG+ V Q + Sbjct: 241 SATQEKGMVYRDETGLDINYDVKSTKNYKRVLPGQFVIHLRSFQGGFAFSNIEGITSPAY 300 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIK 373 + S + ++ S K A+ G+R +S+ F D L VP Sbjct: 301 TVLDF--KNKEMYYSLFWRCVLASDTFIKRLEAVTYGIRDGKSISFSDFSTLKFRVPSHN 358 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I N ++D + + + LKE + +F+ Sbjct: 359 EQLKIGNF----FKQLDDTIALHQCELDTLKETKKAFLQK 394 >gi|326314831|ref|YP_004232503.1| restriction modification system DNA specificity domain-containing protein [Acidovorax avenae subsp. avenae ATCC 19860] gi|323371667|gb|ADX43936.1| restriction modification system DNA specificity domain protein [Acidovorax avenae subsp. avenae ATCC 19860] Length = 438 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 64/424 (15%), Positives = 149/424 (35%), Gaps = 23/424 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P W+ + ++N RT + + ++ + DV G+ + S Sbjct: 17 LPVGWRWSNMGELAQVNPPRTYPESDEAVVSFLAMGDVSED-GRIRTRQTRSYSDVAKGF 75 Query: 79 SIFAKGQILYGKLGPYL------RKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLS 130 + F +L K+ P A + + G ST+F V++ + + P L + Sbjct: 76 TSFIDDDVLVAKITPCFENGKGAHVAGLLNGVGFGSTEFHVIRARQEIAFPAFLHLHTRT 135 Query: 131 IDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + E G+ + + P+ +PP+ EQ I + A ++D + + Sbjct: 136 EAFRTKGERNMVGSAGQKRVPAEFLRAYPIALPPVLEQKGIAAILTAADDKLDVIARQIE 195 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG--LVPDHWEVKPFFALVTELNRK 247 + + Q L S + + + VG P W + + R+ Sbjct: 196 VTQTIKQGLIQTLFSKGIGSKSADGRWTRHTAFVKVGSAEYPKSWRMGRMGDFAPLVRRE 255 Query: 248 NTKLIESNILSLSYGNIIQKLETRN-MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + L + + + + + + ++ G+++ + ++ Sbjct: 256 VDVQPSKSYPELGLRSFGKGTFHKPALTGEQVGSKRLFLIKAGDLLLSNVFAWEGAVAVA 315 Query: 307 SAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDL---CKVFYAMGSGLRQSLKFEDV 362 S + R V P + ++A + + + G+G ++L + Sbjct: 316 SPEDDGRYGSHRYITCKVDPEIANVHFVARYLVTPAGLASIGLASPGGAGRNKTLGLAAL 375 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + +PP+ EQ I V+ A+I VL K L ++ +S + +TG+ ++ Sbjct: 376 ADMNIPLPPLAEQNAINEVLECVEAKIAVLQAKH----ELYRDLKSGLMQKLLTGEWRVK 431 Query: 423 GESQ 426 ++ Sbjct: 432 VDAD 435 Score = 41.3 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 66/193 (34%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 PK W++ + F L + K +GL G G + Q + + Sbjct: 235 EYPKSWRMGRMGDFAPLVRREVDVQPSKSYPELGLRSF--GKGTFHKPALTGEQVGSKRL 292 Query: 79 SIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQ---GWLLSID 132 + G +L + + + D S +++ + + + + Sbjct: 293 FLIKAGDLLLSNVFAWEGAVAVASPEDDGRYGSHRYITCKVDPEIANVHFVARYLVTPAG 352 Query: 133 VTQRIEAICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + A GA + + ++ +P+PPLAEQ I E + +I L + + Sbjct: 353 LASIGLASPGGAGRNKTLGLAALADMNIPLPPLAEQNAINEVLECVEAKIAVLQAKHELY 412 Query: 192 IELLKEKKQALVS 204 +L Q L++ Sbjct: 413 RDLKSGLMQKLLT 425 >gi|149199875|ref|ZP_01876904.1| restriction modification system DNA specificity domain [Lentisphaera araneosa HTCC2155] gi|149137046|gb|EDM25470.1| restriction modification system DNA specificity domain [Lentisphaera araneosa HTCC2155] Length = 404 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 66/432 (15%), Positives = 139/432 (32%), Gaps = 35/432 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKR-FTKLNTGRTSESGKDIIYIGLEDVESG 59 M + AYP G+ + +P+ W +K ++ E K+ + ++ + Sbjct: 1 MANQSAYPPTVQPGIPKLKIVPEGWTQSSLKNYLIEVKDKVKLEDDKEYDLVTVK--RAR 58 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQP 116 G + + + + +G L K I + I S ++ +L Sbjct: 59 GGLVRREHLLGKNISVKSQFLLKEGYFLISKRQIVHGACGIVPKELDGSIVSNEYSILDS 118 Query: 117 KDVLPELLQGWLLSIDVTQ---RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + Q +I +PPL EQ I + Sbjct: 119 NGKICLEFLKYHSHSVFFQQTCFHSSIGVHIEKMIFKLDQWFKFKFNLPPLPEQKKIAKI 178 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + D I + + I+ K K+AL+ ++ +G + + D W Sbjct: 179 L----GTWDKAIDKLDKLIDNSKTTKKALMQQLL------------TGKKRLPGFTDEWR 222 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + + L S G+I G + IV E Sbjct: 223 KIRLAECANSHDNRRIPLNSSE-REKRKGDIPYWGANGIQGYVDDFIFDETIVLLAEDGG 281 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 F + R + + + + A++ + + ++ + S + + G Sbjct: 282 NFSEFST--RPIANISYGKSWVNNHAHILMAKENTTNEWIYY---SLVHKNILGYVNGGT 336 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 R L D+ ++P+ +P I EQ +T + V+ I+ L E L + + + Sbjct: 337 RAKLNKGDMLKIPMFLPSITEQKKLTEIFVVQDKEINSL----ESQRNKLIIEKKALMQQ 392 Query: 414 AVTGQIDLRGES 425 +TG+ ++ E+ Sbjct: 393 LLTGKKRVQEEA 404 >gi|307248456|ref|ZP_07530476.1| hypothetical protein appser2_14290 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306855024|gb|EFM87207.1| hypothetical protein appser2_14290 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 457 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 71/436 (16%), Positives = 130/436 (29%), Gaps = 64/436 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W++ + +T I +GL + + L Q+ + Sbjct: 20 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 79 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ-PKDVLPELLQGWLLSIDVT 134 I K ILY + PYL+ I + D I ST F+V+ + + L +LLS T Sbjct: 80 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + + N+P+ IPPL EQ I KI I+ + + L Sbjct: 140 DFVNQEMVGVAYPAINDDKLYNLPIAIPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTAL 199 Query: 195 LKEKK----QALVSYIVTKGLNPDVKM--------------------------------- 217 ++ ++++ + L Sbjct: 200 HQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPKVVSEI 259 Query: 218 ---------------KDSGIEWVGLVPDHWEVKPFFALVTE------LNRKNTKLIESNI 256 + E +P++W + + I Sbjct: 260 ILRDNLPYEIINGEERCIADEVPFEIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTI 319 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L G++ + T E V + I + +E Sbjct: 320 PWLKTGDLNDGIITEIPEYITELAIEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTN 379 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + GI + YL + + S + GSG + ++ E + +PP+ EQ Sbjct: 380 QACCACIPYTGIYNKYLFYYLMSQKTELQKRSEGSG-QPNISKEKIVNYLFPLPPLNEQK 438 Query: 377 DITNVINVETARIDVL 392 I I + + L Sbjct: 439 CIVEKIETLFSTLQNL 454 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 76/201 (37%), Gaps = 10/201 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQ 284 +P+ WE++ ++ L +K I N I KL + L+P+ + Sbjct: 20 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 79 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLC 343 IV I++ + + I ++A++ + YL + + S Sbjct: 80 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 139 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G+ ++ + + LP+ +PP+ EQ I I I+ + E+ + Sbjct: 140 DFVNQEMVGVAYPAINDDKLYNLPIAIPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTA 198 Query: 403 L-----KERRSSFIAAAVTGQ 418 L ++ + S + AA+ G+ Sbjct: 199 LHQQFPEQLKKSILQAAIQGK 219 >gi|331655787|ref|ZP_08356776.1| type I restriction enzyme EcoR124II specificity protein (S protein)(S.EcoR124II) [Escherichia coli M718] gi|331046561|gb|EGI18650.1| type I restriction enzyme EcoR124II specificity protein (S protein)(S.EcoR124II) [Escherichia coli M718] Length = 422 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 53/398 (13%), Positives = 122/398 (30%), Gaps = 28/398 (7%) Query: 26 KVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS--TVS 79 + P+K G + + + + +++ S + Sbjct: 17 EWKPLKDVCDFKNGFAFKSSLFKETGLPIVRITNIDGFNVDLDEVKYFSLNDYKEDLSSF 76 Query: 80 IFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G IL G K I + + PK+ + + + T+ I Sbjct: 77 EVSMGNILIAMSGATTGKVGIYKKGTKCYLNQRVGKFIPKENILNNNYLYHFLLLNTETI 136 Query: 138 EAICEGATMSHADWKGIG--------NIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + G + + P LA Q I + T L E Sbjct: 137 YILAGGGAQPNLSSNALMSKLLIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELT 196 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + K++ +++ K+ +EW L + Sbjct: 197 AELNMRKKQYNYYRDQLLS--------FKEGEVEWKALGEVAKIQRGASPRPIVNYLTEQ 248 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I + ++ + E + +I++PG+ V LR Sbjct: 249 GNGIPWIKIGDTIPGSKYIDKTLQKITAEGAQKSRILNPGDFVISNSMSFGRPYILRITG 308 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVL 368 + G + ++ +++ YL + S + + + SG +L + +K LPV Sbjct: 309 AIHDGWAS---ISNFGEKLNADYLYHYLSSKKVKNYWESKINSGSVSNLNADIIKTLPVP 365 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +P ++Q I+ +++ + + E + + I L +++ Sbjct: 366 LPDKQKQERISALLDKFDTLTNSITEGLPREIELRQKQ 403 >gi|225854060|ref|YP_002735572.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae JJA] gi|307126722|ref|YP_003878753.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae 670-6B] gi|225722675|gb|ACO18528.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae JJA] gi|306483784|gb|ADM90653.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae 670-6B] Length = 522 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 73/440 (16%), Positives = 149/440 (33%), Gaps = 66/440 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 DV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 323 DISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQ 382 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA 319 ++ L SY+ +++ G++++ L R ++ G + Sbjct: 383 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVAD 442 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V I+ ++ + S + V SG ++ L + +K + +PP+ Sbjct: 443 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLP 502 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I + I A ID L+ Sbjct: 503 EQSRIVDKIEQFFAHIDALI 522 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 >gi|32477087|ref|NP_870081.1| restriction modification system S chain-like protein [Rhodopirellula baltica SH 1] gi|32447635|emb|CAD79236.1| restriction modification system S chain homolog [Rhodopirellula baltica SH 1] Length = 389 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 50/400 (12%), Positives = 115/400 (28%), Gaps = 25/400 (6%) Query: 27 VVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 V + +G T K I ++ ++ + S+ Sbjct: 5 EVALSEICDTGSGGTPSRAKQEIYYDGSIPWVKSGELRESVITETGESITELGLKESSAK 64 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + +L G + + + + + L P D E + Sbjct: 65 LLPADTLLVALYGATVGRVGMLGIEAATNQAVCYLIPDDTRVERRYLYHALRSKVPYWLT 124 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + I N +P+PPL+EQ I E + R + LL E Sbjct: 125 QRVGGGQPNISQGVIKNTKIPLPPLSEQKRIAEILDRAEALRAK----RRAALALLDELT 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q++++ ++ + G +G + + + ++ +N Sbjct: 181 QSILARLLDGSAD-------LGTTTLGNI-SRDMHQGINTVTEKIEYQNDGFPIIQSKHT 232 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + G + Y+ +++ I K L + Sbjct: 233 TQGYLDLSDARFVSKATYLKYKEKYRPARNDLLLCNIGTIG-KSLLMEQENDFLIAWNLF 291 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + + ++ F + G + + + + P+ +P + Q + Sbjct: 292 LIKLDLDQVSPSFCKHYFDRLASQHYFDRFLTGGTVKFISKKTLNATPIPLPSMDRQREF 351 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A ++VL EK ++ L + +S A G+ Sbjct: 352 ----EEQIASVEVLKEKHRSAVAELDQLFASLQHRAFRGE 387 >gi|315030630|gb|EFT42562.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4000] Length = 417 Score = 119 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 66/405 (16%), Positives = 137/405 (33%), Gaps = 24/405 (5%) Query: 23 KHWKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTS 76 + W++ + E Y+ + D++ + K++ S + + Sbjct: 18 EDWELCKLGDVADHFEYGLNASAIEYDGKNKYLRITDIDDSSRKFIQNKLTSPNINVEEA 77 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSID 132 + I G IL+ + G + K D E + L+ Sbjct: 78 SNYILTVGDILFARTGASVGKTYRYDIKDGKVYFAGFLIRARIKDSFDSEFVYWTTLTDR 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I+ + + + + K + + IP + EQ I + +ID I R + Sbjct: 138 YNTFIKIMSQRSGQPGINAKEYSSFNILIPNIKEQQKIGAFL----KKIDDTIALHQRKL 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E LKE K+A + + + K+ + ++ V E N K+ K Sbjct: 194 EQLKELKKAYLQLMFASTNTKNDKLPKLRFTGFKGYWELCKLSDISDKVKEKN-KHGKFT 252 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVM 311 E+ S YG I Q++ + +Y +V + V+ I ++ ++ Sbjct: 253 ETLTNSAEYGIINQRVFFDKDISNVNNLNSYYVVQNDDFVYNPRISNFAPVGPIKRNRLG 312 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPV 367 G+++ Y + H ID+ YL + G R ++K +P+ Sbjct: 313 RTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFMELNGDTGARADRFAIKDSIFVEMPI 372 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 P +EQ I ++D + + + LK + +++ Sbjct: 373 PYPSTEEQQKIGIF----FKKLDQSITLYKNKLNQLKALKKAYLQ 413 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 59/189 (31%), Gaps = 8/189 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + + + + E + KD ++ + ++ + Sbjct: 230 WELCKLSDISDKVKEKNKHGKFTETLTNSAEYGIINQRVFFDKDISNVNN-LNSYYVVQN 288 Query: 84 GQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +Y + G+ S + V + + L+ + ++ +E Sbjct: 289 DDFVYNPRISNFAPVGPIKRNRLGRTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFME 348 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + +I + +P ++KI ++D IT + LK Sbjct: 349 LNGDTGARADRFAIK-DSIFVEMPIPYPSTEEQQKIGIFFKKLDQSITLYKNKLNQLKAL 407 Query: 199 KQALVSYIV 207 K+A + + Sbjct: 408 KKAYLQNMF 416 >gi|315586487|gb|ADU40868.1| type I site-specific deoxyribonuclease [Helicobacter pylori 35A] gi|315586546|gb|ADU40927.1| type I site-specific deoxyribonuclease [Helicobacter pylori 35A] Length = 429 Score = 119 bits (299), Expect = 7e-25, Method: Composition-based stats. Identities = 59/401 (14%), Positives = 131/401 (32%), Gaps = 23/401 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELKAR 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKM------KDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + + L+ + T + D KM K L P E + + N Sbjct: 192 KKQYQYYQNMLLDFKDTNQNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGEVCESTN 251 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 +K K+ E + + + G + + GE + + Sbjct: 252 KKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFN------NDGENITIASRGEYAGFIN 305 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + G + Y + + + +L + +++ ++ + + G +L D++ L Sbjct: 306 YFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVSRGSIPALNKADIETL 365 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +PP++ Q +I +++ + L+ I I K++ Sbjct: 366 TIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIEARKKQ 406 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 55/191 (28%), Gaps = 21/191 (10%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPES 279 P E K + N + + R G + P++ Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++ I+ + L + + +++ K + + + + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLIVDSLANQRFT---FLSKKANCDLALDMKFFFYQ 129 Query: 340 YDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + + S+ K+ +PP++ Q +I +++ T E Sbjct: 130 CFLLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFT-------ELNT 182 Query: 398 QSIVLLKERRS 408 + LK R+ Sbjct: 183 ELNTELKARKK 193 >gi|303263138|ref|ZP_07349064.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP14-BS292] gi|302635725|gb|EFL66234.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP14-BS292] Length = 458 Score = 119 bits (299), Expect = 7e-25, Method: Composition-based stats. Identities = 74/440 (16%), Positives = 148/440 (33%), Gaps = 66/440 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 19 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 78 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 79 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 138 Query: 132 DV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 139 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 198 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 199 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 258 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 259 DISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQ 318 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA- 319 ++ L SY+ +++ G++++ L R + + A Sbjct: 319 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVAD 378 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V I+ ++ + S + V SG ++ L + +K + +PP+ Sbjct: 379 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLP 438 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I + I A ID L+ Sbjct: 439 EQSRIVDKIEQFFAHIDALI 458 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 13 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 72 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 73 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 132 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 133 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 192 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 193 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 222 >gi|254181649|ref|ZP_04888246.1| type I site-specific deoxyribonuclease [Burkholderia pseudomallei 1655] gi|184212187|gb|EDU09230.1| type I site-specific deoxyribonuclease [Burkholderia pseudomallei 1655] Length = 424 Score = 119 bits (299), Expect = 7e-25, Method: Composition-based stats. Identities = 59/418 (14%), Positives = 129/418 (30%), Gaps = 33/418 (7%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYL 64 +P+++++G W++ + + K T + SE K ++ E Y Sbjct: 24 RFPEFRETG---------EWRIEALGKLAKRCTKKNSEGEHKRVLTNSAEYGVIDQRDYF 74 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVL 120 KD + Q + I KG +Y P + G+ S + V + Sbjct: 75 DKDI-ANQGNLEGYYIVEKGDYVYNPRISASAPVGPISKNNLGTGVMSPLYTVFRFNGSA 133 Query: 121 PELLQGWLLSIDVTQRIE---AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 E + S Q + + N+P+P+ EQ I + + Sbjct: 134 NEFFAHYFKSPHWHQYMREASSTGARHDRMSITNDDFMNMPLPVSTPKEQQKIADCL--- 190 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 ID + R + LK K L+ + +++ + Sbjct: 191 -SSIDERMAAENRKLGTLKVYKNGLLQQLFPCEGETVPRLRFPEFRDAEAWKEVELSTRI 249 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + ++ + + S + G+I+ Sbjct: 250 DLISGLHLAPDEYADAGDVPYFTGPSDYANDLALVGKWTSHSANSG---RAGDILIT--- 303 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 ++ ++ + MAV+P G+ ++ + + ++ L L Sbjct: 304 VKGSGVGELLYLELDEVAMGRQLMAVRPRGVHGEFIFHFLATQR-QRLIALASGNLIPGL 362 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 D+ L V VP +EQ I + + + +D ++ Q I +L+ + + Sbjct: 363 SRGDILSLTVSVPEREEQQAIADCL----SSLDDVIAVQSQKIDVLQAHKKGLMQQLF 416 >gi|313682026|ref|YP_004059764.1| restriction modification system DNA specificity domain [Sulfuricurvum kujiense DSM 16994] gi|313154886|gb|ADR33564.1| restriction modification system DNA specificity domain [Sulfuricurvum kujiense DSM 16994] Length = 406 Score = 119 bits (299), Expect = 7e-25, Method: Composition-based stats. Identities = 65/417 (15%), Positives = 146/417 (35%), Gaps = 31/417 (7%) Query: 18 IGAIPKHWKVVPIKRF-TKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNS 70 + +P W + +++ G + + K I ++ SG + + Sbjct: 4 LYELPNGWVYKQLDEIVFRMHQGVNTAADKVEFYSDGYPIIQSRNITSGELHFENIKYVN 63 Query: 71 RQSD--TSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQ 125 + IL +G + I+ D + + Q + V + L+ Sbjct: 64 EEDWNLYEKKYKPKINDILLSNIGTIGKSIIVNQNDNFLIHWNIFLIEPQTELVSAQFLK 123 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L +D + +GAT+ K + + +P+PPL EQ I K+ + +ID I Sbjct: 124 VFLDKLDNDSYYDQFLKGATVKFVSKKNLASTLIPLPPLQEQQRIVSKLDSLFEKIDKAI 183 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + I+ ++++ + + K K + A T L Sbjct: 184 SLHQKNIDEADVFMGSVLNEVFEEMDGQYKKEKL-----------EKFDRKMSAGGTPLR 232 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDK 302 KN + I S G + Q+ + + ++ G ++ D K Sbjct: 233 AKNEYWDDGTIEWFSSGELNQQFTLPAKERITDEGLKNSSAKLFSKGTLLIGMYDTAAMK 292 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 S+ G A + +KP+ + + L G+ +Q+L Sbjct: 293 MSILHTD----GSCNQAIVGIKPNEDELNIFFLKYQLEYLKPKILEERQGVRQQNLNLSK 348 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +K + + +PP+ Q + ++ + +++ + ++ + LK ++S + A G+ Sbjct: 349 IKNVEIELPPLPIQQKVVVYLDSVSEKMEKVKTIQKEKMESLKALKASILDKAFRGE 405 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 62/167 (37%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 I + ++ + S+ +F+KG +L G K I D Sbjct: 240 DGTIEWFSSGELNQQFTLPAKERITDEGLKNSSAKLFSKGTLLIGMYDTAAMKMSILHTD 299 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 G C+ + ++P + + + +I +G + + I N+ + +PPL Sbjct: 300 GSCNQAIVGIKPNEDELNIFFLKYQLEYLKPKILEERQGVRQQNLNLSKIKNVEIELPPL 359 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 Q + + + + +++ + T + +E LK K +++ L Sbjct: 360 PIQQKVVVYLDSVSEKMEKVKTIQKEKMESLKALKASILDKAFRGEL 406 >gi|15902492|ref|NP_358042.1| type I restriction-modification system S subunit [Streptococcus pneumoniae R6] gi|116515523|ref|YP_815961.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae D39] gi|15458016|gb|AAK99252.1| Type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae R6] gi|116076099|gb|ABJ53819.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae D39] Length = 522 Score = 119 bits (299), Expect = 7e-25, Method: Composition-based stats. Identities = 73/440 (16%), Positives = 147/440 (33%), Gaps = 66/440 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 DV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 323 DISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQ 382 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA- 319 ++ L SY+ +++ G++++ L R + A Sbjct: 383 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVAD 442 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V I+ ++ + S + V SG ++ L + +K + +PP+ Sbjct: 443 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLP 502 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I + I A ID L+ Sbjct: 503 EQSRIVDKIEQFFAHIDALI 522 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 >gi|303260804|ref|ZP_07346757.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP-BS293] gi|302638053|gb|EFL68535.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP-BS293] Length = 458 Score = 119 bits (299), Expect = 7e-25, Method: Composition-based stats. Identities = 74/440 (16%), Positives = 148/440 (33%), Gaps = 66/440 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 19 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 78 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 79 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 138 Query: 132 DV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 139 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 198 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 199 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 258 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 259 DISIVSQGDDNFYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQ 318 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA- 319 ++ L SY+ +++ G++++ L R + + A Sbjct: 319 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVAD 378 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V I+ ++ + S + V SG ++ L + +K + +PP+ Sbjct: 379 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLP 438 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I + I A ID L+ Sbjct: 439 EQSRIVDKIEQFFAHIDALI 458 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 13 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 72 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 73 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 132 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 133 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 192 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 193 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 222 >gi|219848706|ref|YP_002463139.1| restriction modification system DNA specificity domain-containing protein [Chloroflexus aggregans DSM 9485] gi|219542965|gb|ACL24703.1| restriction modification system DNA specificity domain protein [Chloroflexus aggregans DSM 9485] Length = 423 Score = 119 bits (298), Expect = 8e-25, Method: Composition-based stats. Identities = 56/417 (13%), Positives = 134/417 (32%), Gaps = 29/417 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + G + ++Y G G L + + + Sbjct: 2 VRLGEVARQRKG-FITVDETLVYKRPTIKLYGQGMVLRDNVIGASLKIKKQQVCKAYDFV 60 Query: 88 YGKLGPYLRKAIIADFD---GICSTQFLVLQP-KDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ + I S+ + + + K+ + +++ + + QR Sbjct: 61 VAEIDAKCGGFAVVPPFLEGAILSSHYFIFELDKEKVDPNFMSYIVKLPLLQRQVEARGS 120 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + +P+PPL EQ I + + R I L+E K++L+ Sbjct: 121 TNYASVRPSQVITYLIPLPPLPEQRAIAHVL----RAVQRAQEASERVIAALRELKKSLM 176 Query: 204 SYIVTKGLNPDVKMKDSGI----------EWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 ++ T G + +G +P HW+V + + + Sbjct: 177 RHLFTYGPVAVSVGAQRAVGAQRAVPLQDTELGPLPAHWQVVRLGEVCQKSPQVVPTKAP 236 Query: 254 SNILSLSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + K +++ G+++F + + ++ Sbjct: 237 DWQFKYVDVSCVDNSSLNIVDYQVLTGKEAPSRARKLIKAGDVIFATVRPYLKRIAIVPP 296 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPV 367 + + T+ + +D +YL + + + + G ++ DVKR + Sbjct: 297 SLDGQVCSTAFCVLSPKPEVDGSYLFYAVSTDEFVSSVVEYQRGSSYPAITDNDVKRGFI 356 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +PP+ EQ +I ++ D +E E S L+ + + +T + L E Sbjct: 357 PLPPLAEQQEIARILQAV----DRRIEVEEVSARALETLFKTLLHELMTAKRRLPQE 409 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 76/194 (39%), Gaps = 7/194 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPK-DGNSRQSD 74 +G +P HW+VV + + + D Y+ + V++ + + +++ Sbjct: 208 LGPLPAHWQVVRLGEVCQKSPQVVPTKAPDWQFKYVDVSCVDNSSLNIVDYQVLTGKEAP 267 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGW-LLS 130 + + G +++ + PYL++ I +CST F VL PK + + + + Sbjct: 268 SRARKLIKAGDVIFATVRPYLKRIAIVPPSLDGQVCSTAFCVLSPKPEVDGSYLFYAVST 327 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G++ + +P+PPLAEQ I + A RI+ Sbjct: 328 DEFVSSVVEYQRGSSYPAITDNDVKRGFIPLPPLAEQQEIARILQAVDRRIEVEEVSARA 387 Query: 191 FIELLKEKKQALVS 204 L K L++ Sbjct: 388 LETLFKTLLHELMT 401 >gi|38505922|ref|NP_942540.1| type I site-specific deoxyribonuclease [Synechocystis sp. PCC 6803] gi|38423946|dbj|BAD02154.1| type I site-specific deoxyribonuclease [Synechocystis sp. PCC 6803] Length = 394 Score = 119 bits (298), Expect = 8e-25, Method: Composition-based stats. Identities = 61/410 (14%), Positives = 125/410 (30%), Gaps = 32/410 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +W ++ L G T E+ +G P G++ + Sbjct: 3 NNWNILNFGNLIILEYGNTLTE------------ENRSGGDYPVYGSNGIIGFHKAYLLD 50 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I+ G+ G T + V +D + L S D+ + Sbjct: 51 SPNIIVGRKGSVGEVVWANKNCWAIDTTYYVTLKQDNSLRFIYWLLKSFDLR----KLDS 106 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + I +P L EQ I E + I + ++ L Sbjct: 107 STGVPGLNRNDVYRIKCNLPSLPEQEKIAEILDTMDEAIAKTEECIAKLKKIKAGLVHDL 166 Query: 203 VSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVKPFFA-------LVTELNRKNTKLI 252 ++ + + +P + +GL+P W++K T R + Sbjct: 167 LTRGIDENGELRDPVRHPEQFKQSAIGLIPKEWDIKELSQLATVDRGKFTHRPRNDPNFY 226 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + G+I Q L E V E I + +A + Sbjct: 227 GGQYPFIQTGDIAQNLGQVIRSYTQTLNENGAKVSR-EFPVGTIAVTIAANIADTAILGI 285 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-P 371 + + V + L L + K+ ++++ ED++ L + P Sbjct: 286 PMFFPDSIVGVTVFPQFNHRLVELCIRFAKHKLDAKATQSAQKNINLEDLRPLLIPFPRN 345 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 KEQ + ++ + D ++K E + LK + + +TG++ + Sbjct: 346 PKEQ----DRMSSVYEKFDERLKKEEAYLEKLKLHKKGLMHDLLTGKVRV 391 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 64/211 (30%), Gaps = 15/211 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESG 59 Q+K S IG IPK W + + + ++ G+ + ++ +I D+ Sbjct: 185 EQFKQSA---IGLIPKEWDIKELSQLATVDRGKFTHRPRNDPNFYGGQYPFIQTGDIAQN 241 Query: 60 TGKYLPKDGNSRQSDTSTV-SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 G+ + + + + V F G I AI+ +V Sbjct: 242 LGQVIRSYTQTLNENGAKVSREFPVGTIAVTIAANIADTAILGIPMFF--PDSIVGVTVF 299 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAE 177 L +++A + + + + + + +P P EQ + Sbjct: 300 PQFNHRLVELCIRFAKHKLDAKATQSAQKNINLEDLRPLLIPFPRNPKEQDRMSSVYEKF 359 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVT 208 R+ + K L++ V Sbjct: 360 DERLKKEEAYLEKLKLHKKGLMHDLLTGKVR 390 >gi|311064526|ref|YP_003971251.1| restriction endonuclease S subunit [Bifidobacterium bifidum PRL2010] gi|310866845|gb|ADP36214.1| Restriction endonuclease S subunit [Bifidobacterium bifidum PRL2010] Length = 413 Score = 119 bits (298), Expect = 9e-25, Method: Composition-based stats. Identities = 64/408 (15%), Positives = 135/408 (33%), Gaps = 35/408 (8%) Query: 25 WKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + G T ++++ +DV+S + + Sbjct: 19 WEQRKLGEVATFGGGHTPPMADPDNYEDGYVLWVTSQDVKSNYLDRTTTQITEKGAKE-- 76 Query: 78 VSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++++ G ++ LR + V+ P+ Sbjct: 77 LTLYPAGSLVMVTRSGILRHTLPVAELRKPSTVNQDIRVILPQGECCGEWLLQFFISHNK 136 Query: 135 QRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G T+ D+ I ++ + +P EQ I + ++D+LIT R + Sbjct: 137 ELLLEFGKTGTTVESVDFGKIKDMLLYMPSTVEQQQIGDF----FAKLDSLITLHQRKYD 192 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK--L 251 L K++++ + K +++ +G D WE + + KN L Sbjct: 193 KLVIFKKSMLEKMFPKDGESVPEIRFAG------FTDPWEQRKLGEFSKKNTIKNANGAL 246 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQV 310 E+ S G I Q + + Y +V P + V+ I + ++ Sbjct: 247 SETFTNSAEQGVISQLDYFDHDITNDANISGYYVVQPDDFVYNPRISATAPCGPINRNRL 306 Query: 311 MERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRL 365 G+++ Y +D TYL ++ + G+ R S+ + Sbjct: 307 NRAGVMSPLYTVFSVDASMDKTYLEHYFKTSRWHDFMFLEGNTGARSDRFSISDATFFEM 366 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P+ P I EQ I + D L+ ++ + LL+ + S + Sbjct: 367 PIWCPEISEQMAIAKQLET----TDTLITLHQRKLELLRNIKKSLLDK 410 >gi|21911179|ref|NP_665447.1| putative type I site-specific deoxyribonuclease hsdS subunit [Streptococcus pyogenes MGAS315] gi|28896555|ref|NP_802905.1| type I site-specific deoxyribonuclease [Streptococcus pyogenes SSI-1] gi|94995073|ref|YP_603171.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10750] gi|21905391|gb|AAM80250.1| putative type I site-specific deoxyribonuclease hsdS subunit [Streptococcus pyogenes MGAS315] gi|28811809|dbj|BAC64738.1| putative type I site-specific deoxyribonuclease [Streptococcus pyogenes SSI-1] gi|94548581|gb|ABF38627.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10750] Length = 391 Score = 119 bits (298), Expect = 9e-25, Method: Composition-based stats. Identities = 62/392 (15%), Positives = 122/392 (31%), Gaps = 18/392 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ G++ S + G R T K Sbjct: 17 EWEEKELGDIVQITMGQSPSSQNYTTNPSDYILVQGNADIKNGYVFPRVWTTQITKQADK 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I+ P ++ I ++ + + L + + I G Sbjct: 77 GDIILSVRAPV-GDVGKTNYHVIIGRGVAAIKGNEFI----FQILKYLKEIGYWKRISTG 131 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T I + IP L EQ I E +D LI + + + LKE+KQ + Sbjct: 132 STFDSISSSNIKYAKIQIPSLPEQEAIGE----LFQTVDQLIQLQDQKLATLKEQKQTFL 187 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + +++ G + E+ F+ T + + I + Sbjct: 188 RKMFPAQGQKVPEIRLQGFDGEWEEKKLGEISRMFSGGTPSVGISEYYNGN-IPFIRSSE 246 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + K S + +IV+ +++ + + L G I A +A+ Sbjct: 247 INSDQTELFITNKGLSNSSAKIVEKNTLLYALYGATSGEVGLSRIS----GAINQAILAI 302 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 P S+ + G + +L VK L + +P + EQ I + Sbjct: 303 IPEKKYSSLFIKNWLYKQKSSIIEKYLQGGQGNLSGSIVKGLELYLPSLPEQEAIGDF-- 360 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D + +IE + LK + + + Sbjct: 361 --FQTLDQQMSQIEDKLTELKALKQTLLNRLF 390 >gi|2689700|gb|AAB91417.1| specificity subunit [Lactococcus lactis subsp. lactis bv. diacetylactis] Length = 409 Score = 119 bits (298), Expect = 9e-25, Method: Composition-based stats. Identities = 61/400 (15%), Positives = 138/400 (34%), Gaps = 19/400 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIFA 82 W+ +K + TG+ + G+D + + + DG+ D + Sbjct: 16 DWEKRKLKD-FTIKTGKKNSEGEDHPAYSVSNKLGLVSQTKQFDGSRLDFLDKTAYKFVN 74 Query: 83 KGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIEA 139 +G+ Y + D I S+ ++VL+ D E + ++ SI + ++ Sbjct: 75 QGEFAYNPARINVGSIAFNDLGKTVIVSSLYVVLKISDKLDNEYILQFIKSIKFIEEVKR 134 Query: 140 ICEGATMSHADWKGIGNI-PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG+ + + NI I L EQ I ++D IT R ++LLKE+ Sbjct: 135 NTEGSVREYLFFDNFKNIKFPYIKNLEEQQKIGSF----FKQLDNTITLHQRKLDLLKEQ 190 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ + + K +++ +G E+ +N + + + Sbjct: 191 KKGYLQKMFPKNGAKVPELRFAGFVDDWEQRKLGEMLVNLEAGVSVNSSDYDTGYFILKT 250 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + L + E + I+ ++ + + I Sbjct: 251 SAIKMGNIDLLEVKSIVSEEVARAKTPLIKNSIIISRMNTPELVGASGLVRESIDNIFLP 310 Query: 319 AYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIK 373 + + +L + K + +G +++ + + L + VP ++ Sbjct: 311 DRLWQGQVAGNFSPEWLIQSINIAANIKKIRDLATGTSGSMKNISKKSMLDLIINVPTLE 370 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + ++D ++ ++ + LLKE++ F+ Sbjct: 371 EQQKIGSF----FKQLDDVIALHQRKLDLLKEQKKGFLQK 406 >gi|153952538|ref|YP_001398844.1| hypothetical protein JJD26997_1911 [Campylobacter jejuni subsp. doylei 269.97] gi|152939984|gb|ABS44725.1| HsdS [Campylobacter jejuni subsp. doylei 269.97] Length = 453 Score = 119 bits (298), Expect = 1e-24, Method: Composition-based stats. Identities = 51/448 (11%), Positives = 120/448 (26%), Gaps = 51/448 (11%) Query: 21 IPKHWKVVPIKRFTK-----LNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +P+ W+ + + G ++ + + + + Sbjct: 4 LPQGWEWKSLYEILSNDKYSIKRGPFGSALKKSFFVENGVRVFEQYNAINNDPHWKRYCI 63 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQF--LVLQPKDVLPELL 124 + + +G +L G + + G+ + + L +L Sbjct: 64 SYDKFKELEAFKAMEGDLLISCSGTLGKIVELPKNTEIGVINQALLKIRLDNTKILNSYF 123 Query: 125 QGWLLSIDVTQRIEAICEGATMSHA-DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + S + +I G+ + + K + I +P+PP+ EQ I + +ID Sbjct: 124 IYYFNSPTMQDKILESTLGSAIKNIASVKILKQIEIPLPPIKEQERIVGILDFAFSKIDE 183 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFA 239 I + + + E Q+ + + + W +G + + Sbjct: 184 NIKKAKENLANIDELIQSALQKAFNPLNDNTKENYQLPQSWEWKSLGEICEILSGGTPDT 243 Query: 240 LVTELNRKNTKLIESNILSLSYG----------------------NIIQKLETRNMGLKP 277 N S+ G + K Sbjct: 244 KNPIFWYSNQTDETQFEKSVVGGLGDFKGDKGSDFAIKVPLSPLEKNYYWATLVDTKEKY 303 Query: 278 ESYETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 +I G + + + + Y Sbjct: 304 LYKTKRKITQKGLDCSNATLLPINSVIFSSRASIGEISIAKVETATNQGYKNFICDASIL 363 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y K + G + + +K + +PP+KEQ I + ++ ++ + Sbjct: 364 YYEFLYFALKHFTKEIELLAQGTTYKEVSKAKIKEFKIPLPPLKEQKQIASHLDELSSHV 423 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTG 417 L + + I L+E ++S + A G Sbjct: 424 KNLKQNYQAQIKDLQELKNSLLDKAFKG 451 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 69/236 (29%), Gaps = 45/236 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG---------------------------------- 45 +P+ W+ + ++ +G T ++ Sbjct: 219 QLPQSWEWKSLGEICEILSGGTPDTKNPIFWYSNQTDETQFEKSVVGGLGDFKGDKGSDF 278 Query: 46 ----------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYL 95 K+ + L D + + + D S ++ +++ + Sbjct: 279 AIKVPLSPLEKNYYWATLVDTKEKYLYKTKRKITQKGLDCSNATLLPINSVIFSS-RASI 337 Query: 96 RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + IA + + + + + T+ IE + +G T I Sbjct: 338 GEISIAKVETATNQGYKNFICDASILYYEFLYFALKHFTKEIELLAQGTTYKEVSKAKIK 397 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 +P+PPL EQ I + + + L I+ L+E K +L+ L Sbjct: 398 EFKIPLPPLKEQKQIASHLDELSSHVKNLKQNYQAQIKDLQELKNSLLDKAFKGNL 453 >gi|91203222|emb|CAJ72861.1| similar to type I restriction modification enzyme S chain [Candidatus Kuenenia stuttgartiensis] Length = 386 Score = 119 bits (298), Expect = 1e-24, Method: Composition-based stats. Identities = 56/410 (13%), Positives = 132/410 (32%), Gaps = 47/410 (11%) Query: 24 HWKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W++ + ++ + + ++ +ED+ T ++ + + + Sbjct: 4 NWQIKKLGEVCEIKPPKKEARDRLNDDDIVSFVPMEDLGILTKNFIATKERPLKEVSGSY 63 Query: 79 SIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPK-DVLPELLQGWLLSI 131 + F+ +L K+ P I + G S+++++ + + +V+P+ L +L Sbjct: 64 TYFSDNDVLLAKITPCFENGKIGIARNLKNGIGFGSSEYIIFRSRGEVIPDYLYYYLARD 123 Query: 132 DVTQRIEAICEGATMSHADWKGI--GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 Q + GA K L EQ I + I T + Sbjct: 124 QFRQDGKKAMTGAVGHKRVPKDFIENQKIPYPNSLPEQQRIVAILEEAFAAIATAKEKTE 183 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + ++ +E + + + + + +G + + N K Sbjct: 184 KNLQNARELFASYLQSVFANPGDGW------EEKTLGECFKLKSGDNITSKMMIENGKYP 237 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + I + Y + + IV R L + R + A Sbjct: 238 VYGGNGIAGM--------------------YNKFNLSGSNVIVGRVGALCGNVRHIEEAI 277 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + + D+ +LA+L+ +L + + ++ + + Sbjct: 278 WLTD---NGFKITDCKYDFDNAFLAYLLNLKNLRNYAR---QAAQPVISNSSLEEVLLQF 331 Query: 370 P-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P +K+Q I ++ +A L Q + L E + S + A TGQ Sbjct: 332 PKSLKDQKSIVTKLDALSAETKKLEAIYRQKLADLDELKKSVLQKAFTGQ 381 >gi|188577916|ref|YP_001914845.1| type I restriction-modification system, S subunit [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522368|gb|ACD60313.1| type I restriction-modification system, S subunit [Xanthomonas oryzae pv. oryzae PXO99A] Length = 501 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 74/466 (15%), Positives = 153/466 (32%), Gaps = 65/466 (13%) Query: 15 VQW-IGAIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDV------ESGTG 61 V+W + +P W + + +G + + DV + G Sbjct: 2 VRWMVSELPAGWAETTLGAIGSVQSGMGFPLEMQGQTEGVYPVYKVGDVSRGVLLDRGIL 61 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR--KAIIADFDGICSTQFLVLQPKDV 119 + ++ + IF +G IL+ K+G LR + I +G+ + + Sbjct: 62 RRSTNYVDAEAAAILKGHIFPEGSILFAKIGEALRLNRRAIVFREGLADNNVMGFKADQG 121 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + +L TQ + ++ T+ + +I + +PPLAEQ I +K+ A Sbjct: 122 IDDG---FLYHFLRTQDLASLSRSTTIPSIRKSDVEDITISLPPLAEQKRIVQKLDALLA 178 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS------------------- 220 ++DTL LLK ++A ++ ++ L D +++ S Sbjct: 179 QVDTLKARIDAMPALLKRFREATLTSAMSGTLTKDWRIESSQSTAPEAPRMCRQLLANER 238 Query: 221 ----------------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + V + V + + K + Sbjct: 239 ERIWRGRGKYKPAVRSGEVDASEFSNLPEVWHRGTLDEITWSVKDGPHFSPKYATDGVRF 298 Query: 259 LSYGNIIQKLETRNMGLKP-----ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +S GNI + G E + ++++ R+ Sbjct: 299 ISGGNIRPGRIDLSTGKYISQELHEELSARCKPEYLDVLYTKGGTTGFAAVNRTESEFNV 358 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPI 372 + + + P +D ++ + + S + G+ Q L + ++ + VPPI Sbjct: 359 WVHVAVLKMLPPSVVDPFFVEFALNSPECYAQSQRYTHGVGNQDLGLRRMIKIVLPVPPI 418 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ +I + A D L K+ + + S +A A G+ Sbjct: 419 GEQREIVRRVEQLFAYADQLEAKVATAKQRIDALTQSLLAKAFRGE 464 >gi|302345833|ref|YP_003814186.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] gi|302149861|gb|ADK96123.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] Length = 416 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 62/421 (14%), Positives = 141/421 (33%), Gaps = 33/421 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVES--GTGKYLPKDGNSRQSD 74 + WK + G T ++ +I ++ ++D S K + Sbjct: 2 EEWKEYKYTDLATIIGGGTPKTSVPEYWNGEIPWLSVKDFVSVAKYVYSSEKHISELGLL 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + K I+ G A+I + L+ ++ + +L + Sbjct: 62 NSSTKLLEKNDIIISARGTVGAVAMIPCPM-CFNQSCFGLRGNGIVDKNFLYYLTRTKID 120 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ G+ + N+ +PPL Q I + + + + I R + Sbjct: 121 ELKQSAH-GSVFDTITKETFDNLLCLVPPLQLQQKIGKFLSSLDSK----IEINQRINDN 175 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN--------- 245 L+++ QAL K G +P W + L + Sbjct: 176 LEQQAQALFKSWFVDFEPFLSKEFSKSDSLFGDIPVGWSIVSIKDLPIYITDYVANGSFA 235 Query: 246 --RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQND 301 ++N +L + + N K E+ + + SYE + +++ GEI+ + Sbjct: 236 SLKENVRLYDKPNYAHFIRNTDLKAESYKIYVDKHSYEFLSKSVLEGGEIIISNVGDVGS 295 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFE 360 L + + + + YL L + + + G ++ Sbjct: 296 -VFLCPKLQKPMTLGNNIILLRPKNNYSMFYLYMLFKGNVGQHLIDGITGGSAQRKFNKT 354 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 D K + +++PP+ I + I +EK + I L+ R + + ++G+++ Sbjct: 355 DFKSIKIMMPPVD----ILIKFDRIIKPIFSKIEKNREEISRLELVRDTLLPKLMSGEVE 410 Query: 421 L 421 + Sbjct: 411 I 411 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 61/206 (29%), Gaps = 20/206 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRT--------------SESGKDIIYIGLEDVESGTGKYL 64 G IP W +V IK T + +I D+++ + K Sbjct: 207 GDIPVGWSIVSIKDLPIYITDYVANGSFASLKENVRLYDKPNYAHFIRNTDLKAESYKIY 266 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPE 122 + + S+ G+I+ +G + ++L+PK+ Sbjct: 267 VDKH---SYEFLSKSVLEGGEIIISNVGDVGSVFLCPKLQKPMTLGNNIILLRPKNNYSM 323 Query: 123 LL-QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 I+ I G+ + +I + +PP+ + I +I Sbjct: 324 FYLYMLFKGNVGQHLIDGITGGSAQRKFNKTDFKSIKIMMPPVDILIKFDRIIKPIFSKI 383 Query: 182 DTLITERIRFIELLKEKKQALVSYIV 207 + E R + L+S V Sbjct: 384 EKNREEISRLELVRDTLLPKLMSGEV 409 >gi|32476611|ref|NP_869605.1| type I restriction enzyme EcoEI specificity protein [Rhodopirellula baltica SH 1] gi|32447157|emb|CAD76983.1| type I restriction enzyme EcoEI specificity protein [Rhodopirellula baltica SH 1] Length = 550 Score = 119 bits (297), Expect = 1e-24, Method: Composition-based stats. Identities = 72/485 (14%), Positives = 153/485 (31%), Gaps = 87/485 (17%) Query: 19 GA-IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS 73 G +P+ W VPI L GR + + + I ++++ KY N Sbjct: 38 GEALPEGWADVPIGDLCDLVNGRAFKPKEWSETGLPIIRIQNLNKAEAKY-----NHFDG 92 Query: 74 DTSTVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLL 129 + + + G++L+ G I + + VL +D L + + Sbjct: 93 EYADKHLVRPGELLFAWSGTPGTSFGAHIWNGPKALLNQHIFRVLIDEDDLNMTFFRFAI 152 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + I G + H +P+PPLAEQ I I + R Sbjct: 153 NHKLEELIGKAHGGVGLRHVTKGKFEATQVPLPPLAEQSRIVSAIESLQERSSRARFLLS 212 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-------------------------- 223 L+ + +Q+++ + L D + + +E Sbjct: 213 EVGPLIGQLRQSVLRDAFSGKLTADWREANPNVEPAFKLLSRIRTERRERWEAEQLAKYE 272 Query: 224 -----------------------WVGLVPDHWEVKPFFALVTELNRKNTK--------LI 252 + +PD W L+ + + Sbjct: 273 AKGKQPPKNWQDKYKEPEPVDESELPELPDGWCWCQVGDLIESFDAGRSPTALSHPARDG 332 Query: 253 ESNILSLSYGNIIQKLETRNMGLKP-ESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQV 310 E +L +S + N LK + G+++ + ++ Sbjct: 333 EYGVLKVSAVTWREFDPNANKALKDGDEIGDTPTPRKGDLLISRANTVELIGAVVLVKAD 392 Query: 311 MERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRL 365 +++ + + P + YL + +RS + K F +G ++L + Sbjct: 393 YPNLMLSDKTLRMNPASKELVPEYLLYGLRSESVRKFFEDNATGTSNSMRNLSQGKILDA 452 Query: 366 PVLVPPIKEQFDITNVI---NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI--- 419 P+ + P+ EQ + +++ + + + +E S+ L S ++ A G++ Sbjct: 453 PIALAPLAEQQAVADLLVTNDEACTSVASGLASMESSLTQLD---QSILSKAFRGELVPQ 509 Query: 420 DLRGE 424 D R E Sbjct: 510 DPRDE 514 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 39/246 (15%), Positives = 83/246 (33%), Gaps = 27/246 (10%) Query: 2 KHYKAYPQYKD------SGVQWIGAIPKHWKVVPIKRFTKLNT-GRTSE------SGKDI 48 K+++ +YK+ S + +P W + + GR+ + Sbjct: 280 KNWQ--DKYKEPEPVDESE---LPELPDGWCWCQVGDLIESFDAGRSPTALSHPARDGEY 334 Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY--LRKAIIADFDG- 105 + + V + KG +L + + ++ D Sbjct: 335 GVLKVSAVTWREFDPNANKALKDGDEIGDTPTPRKGDLLISRANTVELIGAVVLVKADYP 394 Query: 106 --ICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPM 159 + S + L + P K+++PE L L S V + E G +M + I + P+ Sbjct: 395 NLMLSDKTLRMNPASKELVPEYLLYGLRSESVRKFFEDNATGTSNSMRNLSQGKILDAPI 454 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 + PLAEQ + + ++ ++ + L + Q+++S L P + Sbjct: 455 ALAPLAEQQAVADLLVTNDEACTSVASGLASMESSLTQLDQSILSKAFRGELVPQDPRDE 514 Query: 220 SGIEWV 225 E + Sbjct: 515 PASELL 520 >gi|291526087|emb|CBK91674.1| Restriction endonuclease S subunits [Eubacterium rectale DSM 17629] Length = 377 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 49/405 (12%), Positives = 132/405 (32%), Gaps = 45/405 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + I T + TG T + K DI ++ ++ K + S+ Sbjct: 2 EYKKINELTTVVTGGTPSTRKNEYWDNGDIPWLQSGCCQNCDVDSTEKYITKEGYNNSST 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + + ++ G K F+ + + P + L + + ++I Sbjct: 62 HMMSADTVMIALTGATAGKVGYLKFEACGNQSITGILPCESLN-QRYLFFYLLSQREKIL 120 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A C G +H + N+ +PI + EQ I ++ + + Sbjct: 121 ADCVGGAQAHISQSYVKNMTIPILAIKEQEQIVGEL---------------TKVSNIVSL 165 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +Q + + D +K +E G + + ++T+ ++ K I Sbjct: 166 RQEEIQQL-------DNLVKARFVEMFGDCTNMISLSELCLIITDGTHQSPKFQHDGIPF 218 Query: 259 LSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + N+ + T + + ++ G+I+ + + + Sbjct: 219 ILVSNLSKNTVTYDTDKFISAETYKELYKRTPIEIGDILLSTVGSYGHPAVVV---EDRK 275 Query: 314 GIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 + +KP +S Y+ + S + G +++L +++++ + VP Sbjct: 276 FLFQRHIAYLKPKSDILNSYYMHGALLSPGCQRQIEEKVKGIAQKTLNLSEIRKIRIPVP 335 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + Q + ++ +++ +++++ + S + Sbjct: 336 SLDLQKQYADFVH----QVNKSKVAVQKALDETQILFDSLMQKYF 376 >gi|163761334|ref|ZP_02168409.1| HsdS, type I restriction-modification system, S subunit [Hoeflea phototrophica DFL-43] gi|162281491|gb|EDQ31787.1| HsdS, type I restriction-modification system, S subunit [Hoeflea phototrophica DFL-43] Length = 424 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 65/424 (15%), Positives = 143/424 (33%), Gaps = 34/424 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + + W+ + R +I + + + +DT Sbjct: 4 EVAEGWRPTTFSDIASDVSARNRSRD-EIPVLSVTKYDGFVPSEEYFKKKVFSADTENYK 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQP--KDVLPELLQGWLLSIDVTQ 135 I +GQ Y + G+ S + V + + P+ + + Sbjct: 63 IVRRGQFAYATIHLDEGSIDRLTRFDVGLISPMYTVFEIDERQADPDFILRLFKFYAMNG 122 Query: 136 RIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +A+ G + +G + +P+PPL EQ I E + +D I IE Sbjct: 123 QFDALGNGGVNRRKSISFSTLGKLSIPLPPLHEQRRIAEIL----SSVDEAIAATRAVIE 178 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR------- 246 ++ KQ ++ ++TKG+ + K + + ++ A +T R Sbjct: 179 QTRKVKQGVMERLLTKGIG-HTRFKQTESGEIPEGWKVATLEELLADITNPMRSGPFGSA 237 Query: 247 -KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQND 301 K+ +L++S + L NI + N + Q+ V+P ++V + Sbjct: 238 LKSEELVDSGVPFLGIDNIQVEQFVCNYKRFLSEDKFRQLRRFAVNPNDVVITIMGTVG- 296 Query: 302 KRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSL 357 R + + + A+ W M K + G+ ++ Sbjct: 297 -RCCVIPPDIGEAVSSKHIWAMSLHHEKYIPELACWQMNFAPWIVSKFTTSAQGGIMSAI 355 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +++L VP ++EQ I V + + + + + L+ +S ++ +TG Sbjct: 356 NSGILRKLVFPVPGLEEQRRILQVWQSFQSEL----QVEQAKLHNLESLKSDLMSDLLTG 411 Query: 418 QIDL 421 + + Sbjct: 412 RKRV 415 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 67/204 (32%), Gaps = 18/204 (8%) Query: 19 GAIPKHWKVVPIKR-FTKLNT-------GRTSESGK----DIIYIGLEDVESGTGK-YLP 65 G IP+ WKV ++ + G +S + + ++G+++++ Sbjct: 207 GEIPEGWKVATLEELLADITNPMRSGPFGSALKSEELVDSGVPFLGIDNIQVEQFVCNYK 266 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELL 124 + + + ++ +G R +I G S++ + + Sbjct: 267 RFLSEDKFRQLRRFAVNPNDVVITIMGTVGRCCVIPPDIGEAVSSKHIWAMSLHHEKYIP 326 Query: 125 QGWLLSIDVTQRIEAIC----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + ++ I + +G MS + + + P+P L EQ I + + Sbjct: 327 ELACWQMNFAPWIVSKFTTSAQGGIMSAINSGILRKLVFPVPGLEEQRRILQVWQSFQSE 386 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 + + L + L++ Sbjct: 387 LQVEQAKLHNLESLKSDLMSDLLT 410 >gi|313673365|ref|YP_004051476.1| restriction modification system DNA specificity domain [Calditerrivibrio nitroreducens DSM 19672] gi|312940121|gb|ADR19313.1| restriction modification system DNA specificity domain [Calditerrivibrio nitroreducens DSM 19672] Length = 865 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 67/405 (16%), Positives = 134/405 (33%), Gaps = 39/405 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKD--GNSRQSDT 75 W++V + ++ G T I + +ED+ + + Sbjct: 468 WQMVRLGEVCEIYNGSTPNRNIKEYWENGTIPWFTIEDLRRQGRIIYNTRQFITQKGYNE 527 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSID 132 S+V + K +L + + + + + QF +V + + S Sbjct: 528 SSVKLLPKHSVLLCCT-ASIGEYAFTEIELTTNQQFNGLVVKESFRDKLFPKYLFYCSPK 586 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +E + AT + N+ +P+PPL Q I +I +I + I Sbjct: 587 FKTELERLSGKATFGFVSIATLKNLQIPLPPLEVQQEIVAEIEGY----QKIIDGCRQVI 642 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K + + + L E + W + + + + I Sbjct: 643 DAWKPDVETYLDEELKTYLAEHP-------EKQEELSSGWPMVKLGEVCEIERGSSPRPI 695 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG-----EIVFRFIDLQNDKRSLRS 307 + + G K+ +Y +I G ++ + L N + Sbjct: 696 NKFVTNDKNGINWIKIGDAFSSSIYINYTKEKITPEGAKMSRKVSVGDLILSNSMSFGKP 755 Query: 308 AQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 + G I ++ P ID YL +++ S + K F + G+ +L + VK Sbjct: 756 YILNIDGCIHDGWLALRNIPKDIDKLYLYYILSSEIISKEFQNLATGGVVSNLNTKLVKS 815 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVL 402 + + +PP++ Q I + I E ID L EKI++ I Sbjct: 816 VEIPLPPLEVQSRIVDKIESERKVIDSLREMVKIYEEKIKRVIDR 860 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 71/195 (36%), Gaps = 13/195 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 + W +V + ++ G + I +I + D S + Sbjct: 670 ELSSGWPMVKLGEVCEIERGSSPRPINKFVTNDKNGINWIKIGDAFSSSIYINYTKEKIT 729 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + G ++ + + I+ I + + +L ++LS Sbjct: 730 PEGAKMSRKVSVGDLILSNSMSFGKPYILNIDGCIHDGWLALRNIPKDIDKLYLYYILSS 789 Query: 132 DVT-QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ + + + G +S+ + K + ++ +P+PPL Q I +KI +E +I Sbjct: 790 EIISKEFQNLATGGVVSNLNTKLVKSVEIPLPPLEVQSRIVDKIESE----RKVIDSLRE 845 Query: 191 FIELLKEKKQALVSY 205 +++ +EK + ++ Sbjct: 846 MVKIYEEKIKRVIDR 860 >gi|242372372|ref|ZP_04817946.1| specificity determinant HsdS [Staphylococcus epidermidis M23864:W1] gi|242349891|gb|EES41492.1| specificity determinant HsdS [Staphylococcus epidermidis M23864:W1] Length = 417 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 59/410 (14%), Positives = 128/410 (31%), Gaps = 31/410 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + WK+ + + G + + DI ++ + DV GK + Sbjct: 17 EEWKLENLGNLADIVRGASPRPIKDSKWFDDNSDIGWLRISDVTQQNGKIKFLQQHLSNE 76 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + +L + I G+ + ++PK L + + Sbjct: 77 GQKKTRVLYEPHLLLSIAASVGKPVINYVKTGVHDGFLIFMRPKFNL---YFMFNWLENF 133 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + + + + + IP E+ EK+ ++D I + + Sbjct: 134 QLKWNKYGQPGSQVNLNSDLVKSQNIYIPKSYEEQ---EKMGIFFNKLDQHIELEEQKLA 190 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L +++K+ + I ++ L S L + + Sbjct: 191 LFEQQKKGYMQKIFSQEL--CFTKLSSSNTQKCLKIKDLFNIIDGDRGKNYPNEKDFYNQ 248 Query: 254 SNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L L GN+ +K + R + + + ++ + V + Sbjct: 249 GYTLFLDTGNVTKKGFSFTKNRFINKEKDDLLRNGKLELNDFVITSRGTLGNIGFYSQDI 308 Query: 310 V--MERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKR 364 I SA + ++P D +YL +L+R + + + +D Sbjct: 309 HLQYSNMRINSAMLILRPIDKMFDYSYLYFLLRDDAINTFMKHYRVGSAQPHITKKDFGN 368 Query: 365 LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + V I EQ I N + RID LV + LK R+ + Sbjct: 369 MKINVTTDINEQKKIANFLE----RIDRLVINQGNKVETLKRRKQGLLQK 414 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 43/116 (37%), Gaps = 9/116 (7%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFED 361 ++ G+ + ++P WL + + G G + +L + Sbjct: 97 VGKPVINYVKTGVHDGFLIFMRPKFNLYFMFNWL---ENFQLKWNKYGQPGSQVNLNSDL 153 Query: 362 VKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 VK + +P +EQ + ++D +E EQ + L ++++ ++ + Sbjct: 154 VKSQNIYIPKSYEEQEKMGIF----FNKLDQHIELEEQKLALFEQQKKGYMQKIFS 205 >gi|86130655|ref|ZP_01049255.1| type I site-specific deoxyribonuclease [Dokdonia donghaensis MED134] gi|85819330|gb|EAQ40489.1| type I site-specific deoxyribonuclease [Dokdonia donghaensis MED134] Length = 395 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 55/405 (13%), Positives = 121/405 (29%), Gaps = 27/405 (6%) Query: 26 KVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV--S 79 K + + G +S K I + + + G + Sbjct: 4 KTQTLTTVCAIKNGFAFKSKDYLTKGIPLLRISNFNDGEVYINDNQIYVDAKYLKSKNDF 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQR 136 I KG +L G K I +FD + + + +++ + + + Sbjct: 64 IVEKGDVLIALSGATTGKYGIYNFDFPSLLNQRIGLIKSGESDTLNSRYFYYYLNILKSE 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I GA + K IG +P+PPL Q I + + + ++ Sbjct: 124 ILRNAGGAAQPNISTKKIGTFEIPLPPLETQKRIAQILDDAAAL----RDTTAQLLKEYD 179 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH--WEVKPFFALVTELNRKNTKLIES 254 Q++ + +P + K EW+ + P + + + I Sbjct: 180 LLAQSIFLEMFG---DPVMNPK----EWIKTRFANLVSSNCPLTYGIVQPGDEYENGIPC 232 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I + + + + I++ GEI+ + Sbjct: 233 VRPVDLTSQYISVDNLKKIDPAISNKFSRTILEGGEILLSVRGSVGVISIADDSLKGANV 292 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 + + Y +L ++ + + G + +D++ L ++ PPI+ Sbjct: 293 TRGIVPIWFDKKISNRLYFYYLYKTKRIQNQIKRLSKGATLVQINLKDLRELKIIQPPIE 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q N I A I+ +Q + ++ + + A G+ Sbjct: 353 LQNQFANKI----ALIEQQKALAKQELQESEDLFNCLLQKAFKGE 393 >gi|295401704|ref|ZP_06811671.1| restriction modification system DNA specificity domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294976324|gb|EFG51935.1| restriction modification system DNA specificity domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 487 Score = 118 bits (296), Expect = 2e-24, Method: Composition-based stats. Identities = 73/442 (16%), Positives = 155/442 (35%), Gaps = 47/442 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P++W V +K K L V S + + S + Sbjct: 27 VPENWVWVRLKSINKDKKRNIDPRDYSEEVFELYSVPSY--DLGEPEYIKGKEIGSNKQV 84 Query: 81 FAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVL-QPKDVLPELLQGWLLSIDVT 134 + +IL K+ P + + I + + ST+++V+ + K + P+ L L + Sbjct: 85 VKENEILLCKINPRINRVWIVSNNRGKYRQLASTEWIVISENKKIYPKYLLFLLKAPYFR 144 Query: 135 QRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I + G +++ A K + P+ +PP EQ I EK+ +ID Sbjct: 145 KLITSNVSGVGGSLTRAKPKEVETYPIALPPFNEQKRIAEKVERLFAKIDEAKRLIEEVK 204 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-------------------------VGL 227 + + ++++ L + K+S IE Sbjct: 205 GSFEFRWESILDKAFRGELTKKWRSKNSMIENADDIFKEIQKVYKKSNKKDEHEINPPYQ 264 Query: 228 VPDHWEVKPFFALVTELNRKNT------KLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 +P +W +V K + I S + ++E + E + Sbjct: 265 IPQNWRWVRLGDIVDINPPKKKLADIEDDQSCTFIPMPSVSDKTGEIENPEIRKYAEVKK 324 Query: 282 TYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMR 338 Y +I+F I ++N K ++ + G ++ + ++ + ++ + +L+R Sbjct: 325 GYTFFLENDILFAKITPCMENGKTAIMQNLINGFGFGSTEFHVIRTNPYINTKLIYYLLR 384 Query: 339 SYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S +G +Q + ++ +PP EQ I +++ + +V + KI Sbjct: 385 SKKFRMEAKKEMTGAVGQQRVPKSFLENYLFPLPPKAEQDKIVELLDKLYVK-EVEISKI 443 Query: 397 EQSIVLLKERRSSFIAAAVTGQ 418 E + R S + A G+ Sbjct: 444 ETLEGEIDSLRQSILNKAFRGE 465 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 79/204 (38%), Gaps = 5/204 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 E VP++W ++ + R S + Y L E Sbjct: 19 PEEEQPYPVPENWVWVRLKSINKDKKRNIDPRDYSEEVFELYSVPSYDLGEPEYIKGKEI 78 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM-ERGIITSAYMAVKPHG-IDSTYLAWLM 337 Q+V EI+ I+ + ++ + S R + ++ ++ + + I YL +L+ Sbjct: 79 GSNKQVVKENEILLCKINPRINRVWIVSNNRGKYRQLASTEWIVISENKKIYPKYLLFLL 138 Query: 338 RSYDLCKVFYAMGSGLRQSLKF---EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++ K+ + SG+ SL ++V+ P+ +PP EQ I + A+ID Sbjct: 139 KAPYFRKLITSNVSGVGGSLTRAKPKEVETYPIALPPFNEQKRIAEKVERLFAKIDEAKR 198 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 IE+ + R S + A G+ Sbjct: 199 LIEEVKGSFEFRWESILDKAFRGE 222 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 79/220 (35%), Gaps = 13/220 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP++W+ V + +N + E + +I + V TG+ + Sbjct: 264 QIPQNWRWVRLGDIVDINPPKKKLADIEDDQSCTFIPMPSVSDKTGEIENPEIRKYAEVK 323 Query: 76 STVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLP-ELLQGWL 128 + F + IL+ K+ P + + + G ST+F V++ + +L+ L Sbjct: 324 KGYTFFLENDILFAKITPCMENGKTAIMQNLINGFGFGSTEFHVIRTNPYINTKLIYYLL 383 Query: 129 LSIDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S + G + N P+PP AEQ I E + V+ I++ Sbjct: 384 RSKKFRMEAKKEMTGAVGQQRVPKSFLENYLFPLPPKAEQDKIVELLDKLYVKEVE-ISK 442 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + +Q++++ L + + IE + Sbjct: 443 IETLEGEIDSLRQSILNKAFRGELGTNDPTDEHAIELLKE 482 >gi|326407944|gb|ADZ65013.1| Type I restriction-modification system specificity subunit [Lactococcus lactis subsp. lactis CV56] Length = 397 Score = 118 bits (296), Expect = 2e-24, Method: Composition-based stats. Identities = 66/399 (16%), Positives = 136/399 (34%), Gaps = 29/399 (7%) Query: 24 HWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + + R +++ + + S ++ ++ + Sbjct: 16 DWEERKLGELGSVAMNRRIFKDQTSENEEVPFFKIGTFGSKPDAFISRELF--EEYKLKY 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G IL G R + D +V D V + Sbjct: 74 PYPEIGDILISASGSIGRTVVYQGKDEYFQDSNIVWLKHDDRLNNKFLKQFYSIVKWQGL 133 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG+T+ K I + + IP EQ KI ++D I R ++LLKE+ Sbjct: 134 ---EGSTIKRLYNKNILDTDISIPSTIEQ----NKIGMFFEQLDDTIALHQRKLDLLKEQ 186 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL- 257 K+ + + K +++ +G D WE + + + K + + + Sbjct: 187 KKGYLQKMFPKNGEKVPELRFAG------FADDWEEHKLGDYIIQYSEKTKQNNQYPVFT 240 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S G QK + + E Y IV G +R + + + + GI++ Sbjct: 241 SSRNGLFFQKDYYKGNQIASEDNIGYNIVPRGYFTYRHMS-DDLVFKFNINDLADYGIVS 299 Query: 318 SAYMAVKPHGI-DSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + Y + +S YL + + S G R + ++ + + +P ++E Sbjct: 300 TLYPVFTTNEQLNSKYLQYQLNEGSEFRRFSLLQKQGGSRTYMYLNKLQNMILNIPKLEE 359 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I + ++D + ++ + LLKE++ F+ Sbjct: 360 QQKIGSF----FQQLDETIALHQRKLDLLKEQKKGFLQK 394 >gi|315648621|ref|ZP_07901718.1| restriction modification system DNA specificity domain protein [Paenibacillus vortex V453] gi|315276000|gb|EFU39348.1| restriction modification system DNA specificity domain protein [Paenibacillus vortex V453] Length = 389 Score = 118 bits (296), Expect = 2e-24, Method: Composition-based stats. Identities = 55/394 (13%), Positives = 122/394 (30%), Gaps = 30/394 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + L G D+ +E G Y N + S A G Sbjct: 18 WEQRKVIDIAPLQRGF------DLPVSEMEA-----GSYPVIMSNGIGAYHSKYKAKAPG 66 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G+ G + +T V K + + +D+ G+ Sbjct: 67 -VVTGRSGTIGNLTFVEVDYWPHNTALWVTDFKRNDAKFIYYLYQKLDLK----RYGTGS 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + IP +AEQ I + +I+ R + K+ K L+ Sbjct: 122 GVPTLNRNDVHLTKASIPSVAEQKQISRIFDSLD----HIISLHQRKLNNAKKLKTGLLQ 177 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K + +++ G ++ ++ + I L N+ Sbjct: 178 KMFPKNGDNFPEIRFPGFTDAWEKRTLADITLKIGSGKTPKGGDSSYVLEGIPFLRSQNV 237 Query: 265 IQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + + + + V +++ + ++ V + Sbjct: 238 YEDFVDLKDVAYITPQTDEEMKNSRVVKNDVLLNITGASIGRSAVYRYSVCAN-VNQHVC 296 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + G +S ++ + S G R+ L F+ + ++ L P I+EQ I Sbjct: 297 IVRPAEGYNSDFVQLNLTSPKGQGQINNNQAGGGREGLNFQQIGKMSFLFPSIEEQDQIG 356 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L ++ + LKE + +F+ Sbjct: 357 SF----FRSLDQLTTLHQRELDALKETKKAFLQK 386 >gi|229496116|ref|ZP_04389838.1| conserved hypothetical protein [Porphyromonas endodontalis ATCC 35406] gi|229317012|gb|EEN82923.1| conserved hypothetical protein [Porphyromonas endodontalis ATCC 35406] Length = 420 Score = 118 bits (296), Expect = 2e-24, Method: Composition-based stats. Identities = 67/433 (15%), Positives = 134/433 (30%), Gaps = 40/433 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESG------KDIIYIGLEDVESGTGK 62 +KD+ IG IP+ W ++G T G +I +I ++ Sbjct: 6 FKDTE---IGQIPEEWIFSKFGDVLRTFSSGATPYRGIPGNFIGNIKWITSGELNYKPIN 62 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKD 118 + + + ++I G L G K + L + + Sbjct: 63 DTLEHISEEAVKNTNLTIHQAGTFLMAITGLEAVGTRGKCAFVGNPSTTNQSCLAINGTN 122 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAE 177 + W C+G + N+P+ P + EQ I + Sbjct: 123 KMITSYLFWFYRKYSDLLAFKYCQGTKQQSYTASIVRNLPIFHPKDIKEQSRIASAL--- 179 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 +D LI+ + IE K KQ + + L ++K WV + + Sbjct: 180 -TSVDNLISSLDKLIEKKKNIKQGTMQQL----LTGKKRLKGFSDPWV----ERKMGRMG 230 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETR---NMGLKPESYETYQIVDPGEIVFR 294 + N +++ N++ + + E G++ F Sbjct: 231 STFSGLTGKTKEDFGIGNAKYITFLNVLSNPILKRELFEEVLVREGEKQNSCHKGDLFFN 290 Query: 295 FIDLQNDKRSLRSAQVME--RGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMG 350 ++ + + E + S + + Y A+ RS + K+ + Sbjct: 291 TTSETPEEVGICAMLDTEMESLYLNSFCFGYRLNDDRVVPEYFAYYFRSNEGRKLMTLLA 350 Query: 351 SG-LRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G R ++ +L+P + EQ I NV+ I+ L K ++ + Sbjct: 351 QGVTRYNMSKSAFINAKLLMPSTVLEQKAIVNVLKGFEKEIEALEVKK----AKFEQIKQ 406 Query: 409 SFIAAAVTGQIDL 421 + +TG+I L Sbjct: 407 GMMQQLLTGKIRL 419 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 67/214 (31%), Gaps = 15/214 (7%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ---------KLETRNMG 274 +G +P+ W F ++ + T + I ++ Sbjct: 10 EIGQIPEEWIFSKFGDVLRTFSSGATPYRGIPGNFIGNIKWITSGELNYKPINDTLEHIS 69 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + I G + L+ + A V + +A+ T Sbjct: 70 EEAVKNTNLTIHQAGTFLMAITGLEAVGTRGKCAFVGNPSTTNQSCLAINGTNKMITSYL 129 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVL 392 + + + G +QS V+ LP+ P IKEQ I + + D L Sbjct: 130 FWFYRKYSDLLAFKYCQGTKQQSYTASIVRNLPIFHPKDIKEQSRIASALTSV----DNL 185 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + +++ I K + + +TG+ L+G S Sbjct: 186 ISSLDKLIEKKKNIKQGTMQQLLTGKKRLKGFSD 219 >gi|312865273|ref|ZP_07725501.1| conserved hypothetical protein [Streptococcus downei F0415] gi|311099384|gb|EFQ57600.1| conserved hypothetical protein [Streptococcus downei F0415] Length = 408 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 55/400 (13%), Positives = 135/400 (33%), Gaps = 27/400 (6%) Query: 25 WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT--STVS 79 W+ + + G + ++V +G Y S + + S Sbjct: 18 WEQHKLGEVADVRDGTHDSPKYINDGYPLLTSKNVGNGYINYDDTKCISEKDYIQINKRS 77 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 IL G +G A+I L+ + + L L + +++ + Sbjct: 78 KVDVNDILMGMIGTIGNLALIRKEPDFAIKNVALIKHTINFDYQFLFQELQTNSISKELL 137 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +G T K I ++ + +P EQ I + I R +E LK Sbjct: 138 SGMDGGTQKFIPLKKIRDLSILLPTKNEQGHIGSFFQSLDSL----IALHQRKLEELKSF 193 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K ++S + P I G + WE + + + ++ + Sbjct: 194 KATMLSKVF-----PKHGQTVPEIRLAGFDGE-WEKTKLRDVSERVQGNDGRMDLPTLTI 247 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMERGIIT 317 + + + + + + + + Y ++ GE+ + + + K + E ++ Sbjct: 248 SAAQGWLSQKDRFSQNIAGKEQKNYTLLKRGELSYNHGNSKLAKYGVVFELNNYEEALVP 307 Query: 318 SAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAM-GSGLRQ----SLKFEDVKRLPVLVPP 371 Y + K + + ++ + + + + SG R ++ ++D + +++P Sbjct: 308 RVYHSFKVNELANPRFIETMFATKQPDRELRKLVSSGARMDGLLNINYDDFMGISIIIPT 367 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + EQ I + +D L+ + + I L+ + + Sbjct: 368 VHEQETIGEF----FSNLDNLISETQSKIEELETLKKKLL 403 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 59/180 (32%), Gaps = 11/180 (6%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRF 295 V + + K I L+ N+ + VD +I+ Sbjct: 29 VRDGTHDSPKYINDGYPLLTSKNVGNGYINYDDTKCISEKDYIQINKRSKVDVNDILMGM 88 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I + +R + I A + + L + ++ M G ++ Sbjct: 89 IGTIGNLALIRK--EPDFAIKNVALIKHTINFDYQFLFQELQTNSISKELLSGMDGGTQK 146 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++ L +L+P EQ I + +D L+ ++ + LK +++ ++ Sbjct: 147 FIPLKKIRDLSILLPTKNEQGHIGSF----FQSLDSLIALHQRKLEELKSFKATMLSKVF 202 >gi|99078523|ref|YP_611781.1| restriction modification system DNA specificity subunit [Ruegeria sp. TM1040] gi|99035661|gb|ABF62519.1| type I restriction-modification system specificity determinant [Ruegeria sp. TM1040] Length = 417 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 64/424 (15%), Positives = 128/424 (30%), Gaps = 37/424 (8%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG-NSRQ 72 G+ +G P+ W P+ RF L R D L V+ G + + + R+ Sbjct: 17 GIPKLGKTPEGWLRAPLSRF--LVEVRRPIKMADNEAYRLVTVKRARGGVVERGTLDGRE 74 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLL 129 + I G L K + + + S ++ VL + +L Sbjct: 75 ISVKSQFIVEGGDFLISKRQIVHGACGLVPQELAGSVVSNEYSVLNSNGNIDLQFLNYLA 134 Query: 130 SIDVTQ---RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 Q +I +PPL+EQ I E + I+ Sbjct: 135 HTVFFQQTCFHSSIGVHVEKMIFKLDRWLKWEFDLPPLSEQRKIVEILSTWDRAIEVAEA 194 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE--- 243 + + + Q+L++ K E+ G + + + Sbjct: 195 QLANARKQKRALMQSLLTG------------KRRFPEFEGQEWREVWLADLVSAIRGGGT 242 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQN 300 ++ NT I +S ++ + + +S G IV Sbjct: 243 PDKSNTAYWGGEIPWVSVKDLKSDVLQQTKDTITQSGLNSSAANYFPKGTIVVATRMAVG 302 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + Q+ + I A+ P + K+ + + Sbjct: 303 -----AAVQLGKGMAINQDLKAIIPGPDVRNDYLFHFMQMVQPKLEALGTGSTVKGITLG 357 Query: 361 DVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 D+ RL + +P ++EQ I +++V I + I L+ + + + +TG+ Sbjct: 358 DLHRLVIGLPATLEEQDKIVQMLDVARKDISSMCVN----IGKLRAEKKALMQQLLTGKR 413 Query: 420 DLRG 423 + G Sbjct: 414 RVTG 417 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 75/206 (36%), Gaps = 8/206 (3%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR-NMGLKP 277 GI +G P+ W P + E+ R ++ + R + + Sbjct: 15 QPGIPKLGKTPEGWLRAPLSRFLVEVRRPIKMADNEAYRLVTVKRARGGVVERGTLDGRE 74 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 S ++ IV+ G+ + + + L ++ + + ID +L +L Sbjct: 75 ISVKSQFIVEGGDFLISKRQIVHGACGLVPQELAGSVVSNEYSVLNSNGNIDLQFLNYLA 134 Query: 338 RSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + + G+ + K + + +PP+ EQ I +++ D +E Sbjct: 135 HTVFFQQTCFHSSIGVHVEKMIFKLDRWLKWEFDLPPLSEQRKIVEILSTW----DRAIE 190 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQID 420 E + ++++ + + + +TG+ Sbjct: 191 VAEAQLANARKQKRALMQSLLTGKRR 216 >gi|218439052|ref|YP_002377381.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7424] gi|218171780|gb|ACK70513.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7424] Length = 417 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 65/425 (15%), Positives = 134/425 (31%), Gaps = 32/425 (7%) Query: 13 SG-VQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLP 65 SG V + +P W+ I +G T I + ++ Sbjct: 5 SGMVDNLWPLPDGWEWKKISDIATTTSGGTPSRKNSEYFTGHINWFKSGELGDSEIFNSE 64 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV----LP 121 + S+ IF K +L G + K I D + + PK + Sbjct: 65 EKITEEAIKKSSAKIFPKDTLLIAMYGATVGKLGILGIDAATNQAVCAIFPKKNLGIKIV 124 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E + + ++ G + I N+ +PIP L + RI Sbjct: 125 EEKFLFYFFKFIRSQLIERSFGGAQPNISQTIINNVTIPIPYPNNPKLSLDIQQRIVARI 184 Query: 182 DT---LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-WVGLVPDHWEVKPF 237 ++ I +E +++ + L+ + + S +E W K Sbjct: 185 ESLLGEIKHNRSLLEQMRQDTEQLLDSAIKE------CFALSRMETWKNHSCLGEIAKII 238 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 V + L + + +LE + + G I++ I Sbjct: 239 AKQVDPTLPQYQTLPHIGVDVIQANTC--QLEDYRTIEEDGVTSGKYLFTSGSILYSKIR 296 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QS 356 K L + + I ++V I+ +L W + S + R Sbjct: 297 PYLRKSVLVDFEGLCSADIYP--LSVISDEIEPKFLMWFLISPLFTDYAKSHSGRARIPK 354 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINV---ETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + ++ P +EQ I + +++ E +ID L+++ E++ L+ + + Sbjct: 355 INRDALFSFKLVYPNYEEQISIISYLDLIRFEVQKIDKLLKEDEKNFNYLE---QAILEK 411 Query: 414 AVTGQ 418 A G+ Sbjct: 412 AFRGE 416 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 76/187 (40%), Gaps = 5/187 (2%) Query: 30 IKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + K+ + T + + +IG++ +++ T + TS +F G I Sbjct: 231 LGEIAKIIAKQVDPTLPQYQTLPHIGVDVIQANTCQLEDYRTIEEDGVTSGKYLFTSGSI 290 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 LY K+ PYLRK+++ DF+G+CS + ++ P+ L +L+S T ++ A Sbjct: 291 LYSKIRPYLRKSVLVDFEGLCSADIYPLSVISDEIEPKFLMWFLISPLFTDYAKSHSGRA 350 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + + P EQ+ I + + + + +QA++ Sbjct: 351 RIPKINRDALFSFKLVYPNYEEQISIISYLDLIRFEVQKIDKLLKEDEKNFNYLEQAILE 410 Query: 205 YIVTKGL 211 L Sbjct: 411 KAFRGEL 417 >gi|303253787|ref|ZP_07339922.1| hypothetical protein APP2_0973 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302647371|gb|EFL77592.1| hypothetical protein APP2_0973 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 455 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 61/436 (13%), Positives = 127/436 (29%), Gaps = 64/436 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W++ + +T I +GL + + L Q+ + Sbjct: 20 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 79 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ-PKDVLPELLQGWLLSIDVT 134 I K ILY + PYL+ I + D I ST F+V+ + + L +LLS T Sbjct: 80 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + + N+P+ IPPL EQ I KI I+ + + L Sbjct: 140 DFVNQEMVGVAYPAINDDKLYNLPIAIPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTAL 199 Query: 195 LKEKK----QALVSYIVTKGLNPDVKM--------------------------------- 217 ++ ++++ + L Sbjct: 200 HQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPKVVSEI 259 Query: 218 ---------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN---ILSL 259 + E +P++W + + + L Sbjct: 260 ILRDNLPYEIINGEERCIADEVPFEIPENWCWVRLGEIGETNIGLTYAPNDVVLEGTIVL 319 Query: 260 SYGNIIQKLETRNMGL--KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 GNI + + + + +++ + + + + Sbjct: 320 RSGNIQNGKIDVSSDVVRVNLNIPENKKCYKNDLLICARNGSKNLVGKAAIVDKDGYSFG 379 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + Y+ + + S F + + + ++ + +PP+ EQ Sbjct: 380 AFMAIFRSPFY--QYIYYYLSSPLFRNDFDGINTTTINQITQNNLNNRLIPLPPLNEQKR 437 Query: 378 ITNVINVETARIDVLV 393 I I + + L Sbjct: 438 IVEKIEKLFSTLQNLE 453 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 76/201 (37%), Gaps = 10/201 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQ 284 +P+ WE++ ++ L +K I N I KL + L+P+ + Sbjct: 20 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 79 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLC 343 IV I++ + + I ++A++ + YL + + S Sbjct: 80 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 139 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G+ ++ + + LP+ +PP+ EQ I I I+ + E+ + Sbjct: 140 DFVNQEMVGVAYPAINDDKLYNLPIAIPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTA 198 Query: 403 L-----KERRSSFIAAAVTGQ 418 L ++ + S + AA+ G+ Sbjct: 199 LHQQFPEQLKKSILQAAIQGK 219 >gi|315172561|gb|EFU16578.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1346] Length = 402 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 58/400 (14%), Positives = 138/400 (34%), Gaps = 27/400 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K + ++ + + + S ++G + I Sbjct: 17 DWEERKLGDLLKEFSIKSKIEDEH------KVLSSTNSGMEFREGRVSGTSNLGYKIIKN 70 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ +L I + G+ S + + ++ + L + + + + Sbjct: 71 GDLVLSPQNLWLGNININNIGKGLVSPSYKTFEFINIDSSFINPQLRTQKMLEEYKNSST 130 Query: 143 GAT---MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + I + +P ++EQ I +ID I R ++LLKE+K Sbjct: 131 QGASVVRRNLEIDSFYQIKIFVPTISEQEKIGSF----FKQIDDTIDLHQRKLDLLKEQK 186 Query: 200 QALVSYIVTKGLNPDVKMKDSGI--EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + + K +++ +G +W + + K+ ++ Sbjct: 187 KGFLQKMFPKNGAKVPELRFAGFADDWEERKLSDVANHRGGTAIEKYFDKDGVYK---VI 243 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF----RFIDLQNDKRSLRSAQVMER 313 S+ + + +N+ ++V+ GE+ + + RSL Q E Sbjct: 244 SIGSYGLNSQYVDQNIRAISNEITDGRVVNSGELTMVLNDKTANGTIIGRSLLVEQDNEY 303 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I + DS + ++ KV + G + + + V L + +P I+ Sbjct: 304 VINQRTEIISPKETFDSNFAYTILNGSFREKVKRIVQGGTQIYVNYSAVSNLSLELPKIE 363 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + ++D + ++ + LLKE++ F+ Sbjct: 364 EQQKIGSF----FKQLDNTIALHQRKLDLLKEQKKGFLQK 399 >gi|229550756|ref|ZP_04439481.1| possible type IC specificity subunit protein [Lactobacillus rhamnosus LMS2-1] gi|229315867|gb|EEN81840.1| possible type IC specificity subunit protein [Lactobacillus rhamnosus LMS2-1] Length = 412 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 45/402 (11%), Positives = 129/402 (32%), Gaps = 23/402 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIG---LEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + + TG + ++ +D + + + + + S Sbjct: 19 SWEQRKLGKMGYTFTGLSGKTKEDFGHGNAKFVTYMNVFSSPVSNSEMVENVEVDSKQHQ 78 Query: 81 FAKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVLPELLQGWL-LSIDV 133 G + + ++ ++ ++ P ++ S + Sbjct: 79 VEYGDVFFTTSSETPQEVGMSSVWLETAENIYLNSFCFGYHPMVEFDPYYLAFMLRSPVI 138 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + +G + + + + +P+P + EQ I ++D IT R + Sbjct: 139 RKKFMLLAQGISRYNISKNKVMEMLVPVPEIVEQQKIGSF----FKQLDDTITLHQRKLA 194 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LKE KQ + + + + +++ +G + ++ + + K E Sbjct: 195 KLKELKQGYLQKLFPENGSKFPQLRFAG---FADAWEQRKLSDGTNKIGDGLHGTPKYSE 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV-ME 312 + GN + + M Q D + I + + A E Sbjct: 252 DGEVYFVNGNNLVNGQIVIMPETKTVTSNEQSKDDKALNESTILMSINGTIGNLAWYRGE 311 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ + ++ D ++ +++ + ++ ++L + ++ + P Sbjct: 312 NLMLGKSAAYIEVSDFDKKFIYAYLQTRPVKDYYLNSLTGTTIKNLGLKAIRNTNICTPT 371 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I EQ I V +D + ++ + L+E + ++ Sbjct: 372 IDEQAKIG----VLFQNLDKTITLHQRKLEKLQELKKGYLQK 409 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 67/175 (38%), Gaps = 9/175 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKP-ESYETYQIVDPGEIVFRFIDLQNDKRS 304 + N ++Y N+ + + ++ E V+ G++ F + Sbjct: 38 KTKEDFGHGNAKFVTYMNVFSSPVSNSEMVENVEVDSKQHQVEYGDVFFTTSSETPQEVG 97 Query: 305 LRSAQVM--ERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + S + E + S P D YLA+++RS + K F + G+ R ++ Sbjct: 98 MSSVWLETAENIYLNSFCFGYHPMVEFDPYYLAFMLRSPVIRKKFMLLAQGISRYNISKN 157 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 V + V VP I EQ I + ++D + ++ + LKE + ++ Sbjct: 158 KVMEMLVPVPEIVEQQKIGSF----FKQLDDTITLHQRKLAKLKELKQGYLQKLF 208 >gi|304383195|ref|ZP_07365668.1| type I restriction enzyme EcoAI specificity protein [Prevotella marshii DSM 16973] gi|304335666|gb|EFM01923.1| type I restriction enzyme EcoAI specificity protein [Prevotella marshii DSM 16973] Length = 420 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 50/412 (12%), Positives = 120/412 (29%), Gaps = 37/412 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----------GKDIIYIGLEDVESGTGKYLPKDGN 69 +P+ W + G S ++ K + Sbjct: 11 EVPQGWVWCKLDDLAFYKKGPFGSSLTKSMFVLKGDNTYKVYEQKNAIQKNEKLGTYYIS 70 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGW 127 + I+ G ++ GI + ++++ + E Sbjct: 71 KEKYQELIAFAIQPFDIIVSCAGTIGETFVLPQEPMEGIINQALMLVRLYNRDIEKFYLL 130 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGI-GNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + +G + + + N +P+PP +EQ I +I IDT+ Sbjct: 131 YFDYILKEEAYKESKGTAIKNIPPFDVLKNFYIPLPPFSEQQRIVAEIERWFALIDTIEQ 190 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG----------------LVPD 230 ++ +K+ K ++ + L P + IE V +P Sbjct: 191 GKVELQTAIKQTKSKILDLAIHGKLVPQDPNDEPAIELVRRINPKAQITCDNGHSRKLPQ 250 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDP 288 W + + + + + + +++ + +K + + + Sbjct: 251 SWTWVKGKNIFAPMKSTKPTNEKFQYIDIDSIDNKRQIISEVKTIKTVNAPSRANRYTQK 310 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVF 346 ++VF + + + + I ++ + P + Y +LM S ++ Sbjct: 311 NDVVFSMVRPYLRNIAKVTN---DNCIASTGFYVCSSIPQILHPDYCYYLMISDNVVNGL 367 Query: 347 YAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 G S+ + +PP+ EQ I I +D + +E Sbjct: 368 NQFMKGDNSPSINKGHIDEWLFPLPPLAEQQRIVQKIEKMFFILDDIQNALE 419 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 72/216 (33%), Gaps = 16/216 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M + VP W L + ++ L N + E +N K Sbjct: 1 MHHYEQDVPFEVPQGWVWCKLDDLAFYKKGPFGSSLTKSMFVLKGDNTYKVYEQKNAIQK 60 Query: 277 PESYETYQIVDPG------------EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 E TY I +I+ + + Q GII A M V+ Sbjct: 61 NEKLGTYYISKEKYQELIAFAIQPFDIIVSCAGTIGE--TFVLPQEPMEGIINQALMLVR 118 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLK-FEDVKRLPVLVPPIKEQFDITNVI 382 + D L Y L + Y G +++ F+ +K + +PP EQ I I Sbjct: 119 LYNRDIEKFYLLYFDYILKEEAYKESKGTAIKNIPPFDVLKNFYIPLPPFSEQQRIVAEI 178 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A ID + + + +K+ +S + A+ G+ Sbjct: 179 ERWFALIDTIEQGKVELQTAIKQTKSKILDLAIHGK 214 >gi|167461699|ref|ZP_02326788.1| putative type I restriction enzyme specificity subunit [Paenibacillus larvae subsp. larvae BRL-230010] gi|322384020|ref|ZP_08057748.1| hypothetical protein PL1_2076 [Paenibacillus larvae subsp. larvae B-3650] gi|321151387|gb|EFX44576.1| hypothetical protein PL1_2076 [Paenibacillus larvae subsp. larvae B-3650] Length = 410 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 67/387 (17%), Positives = 151/387 (39%), Gaps = 26/387 (6%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA------IIADFDGI 106 + ++ +G+ + + F + IL+ K+ P + + + G Sbjct: 1 MSSIDPVSGQITFIKEREFSKVSKGYTYFQENDILFAKITPCMENGNTVIAKGMLNKFGF 60 Query: 107 CSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPL 164 ST+F VL+P +++ + L S + +A+ G K + + P+ +PPL Sbjct: 61 GSTEFYVLRPSNIVEGRFIYYLLRSEKFRKEAKAVMSGAVGQQRVPKKFLIDYPLCLPPL 120 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 EQ I +KI + ++D E + ++ A++ L + ++ I Sbjct: 121 NEQKRIADKIESLFAKMDIAKRLIDEAKESFELRRAAILDKAFRGELTKEWRLSQVEILP 180 Query: 225 VGLV--PDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLK 276 P W+ +V R+ + + + + + I +E + Sbjct: 181 NLETKIPYGWKHVILSDVVQVNPRRTKLQHISDEQECTFVPMGAVSEISGTIEEPEVKSF 240 Query: 277 PESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYL 333 + Y + +I+F I ++N K +L S + G ++ + ++ ++ Y+ Sbjct: 241 VIVKKGYTYFEENDIIFAKITPCMENGKTALASKLINGFGFGSTEFHVIRAKQHINNKYI 300 Query: 334 AWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +L+RS +G +Q + ++ +PP++EQ I +++ + D Sbjct: 301 YFLLRSSKFRYEAKMHMTGAVGQQRVPKSFLENYKFQLPPVEEQAKIVDLLEKIYDKEDK 360 Query: 392 L--VEKIEQSIVLLKERRSSFIAAAVT 416 +E++E+SI LL + S + A Sbjct: 361 ALVIEQLEESIKLL---KQSIVQKAFR 384 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 71/156 (45%), Gaps = 5/156 (3%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + + Y +I+F I ++N + + + G ++ + ++P Sbjct: 11 ITFIKEREFSKVSKGYTYFQENDILFAKITPCMENGNTVIAKGMLNKFGFGSTEFYVLRP 70 Query: 326 -HGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++ ++ +L+RS K A+ SG +Q + + + P+ +PP+ EQ I + I Sbjct: 71 SNIVEGRFIYYLLRSEKFRKEAKAVMSGAVGQQRVPKKFLIDYPLCLPPLNEQKRIADKI 130 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A++D+ I+++ + RR++ + A G+ Sbjct: 131 ESLFAKMDIAKRLIDEAKESFELRRAAILDKAFRGE 166 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 80/221 (36%), Gaps = 13/221 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP WK V + ++N RT ++ ++ + V +G + S Sbjct: 185 KIPYGWKHVILSDVVQVNPRRTKLQHISDEQECTFVPMGAVSEISGTIEEPEVKSFVIVK 244 Query: 76 STVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVL-PELLQGWL 128 + F + I++ K+ P + + + G ST+F V++ K + + + L Sbjct: 245 KGYTYFEENDIIFAKITPCMENGKTALASKLINGFGFGSTEFHVIRAKQHINNKYIYFLL 304 Query: 129 LSIDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S + G + N +PP+ EQ I + + + D Sbjct: 305 RSSKFRYEAKMHMTGAVGQQRVPKSFLENYKFQLPPVEEQAKIVDLLEKIYDKEDKA-LV 363 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 + E +K KQ++V + L + ++S I+ + Sbjct: 364 IEQLEESIKLLKQSIVQKAFRRELGTNDSTEESAIQLLKET 404 >gi|307155045|ref|YP_003890429.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 7822] gi|306985273|gb|ADN17154.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7822] Length = 397 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 61/409 (14%), Positives = 128/409 (31%), Gaps = 28/409 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 ++W +VP+ + + + + Y + G G + + S Sbjct: 3 QNWDLVPLGEIL-IKSNTWIQIEANKKYKQITVKYWGKGVVERNEVIGTEIAASQRLQVR 61 Query: 83 KGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRI 137 GQ + ++ + I + F V +LP L + + Sbjct: 62 SGQFIVSRIDARHGSFGLIPDCLNGAIVTNDFPVFNLNINRILPHFLNWMSKTPTFIELC 121 Query: 138 EAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + EG T ++ +P+P L EQ I KI +I+ + I + Sbjct: 122 KVASEGTTNRIRLKEDKFLSMKIPLPKLEEQQRIIAKIEELVAKIEEARGLKEAGIRECE 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 A + + T N K G + + T + I Sbjct: 182 MLINAEIYNLFTICKNTHWANKKLGDIVIDD--------------CYGTSEKTHDYKVGI 227 Query: 257 LSLSYGNIIQKLETRNMGL---KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 L GNI + + E + I+ G+I+ + + Sbjct: 228 PILRMGNIQNGILDVSELKYLDIHEKNKDKLILQKGDILVNRTNSAELVGKCAVFNLKGE 287 Query: 314 GIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLV 369 S + ++ + T +A + S + ++ + +K LP+++ Sbjct: 288 YGFASYIIRLRLDKAQANPTLIAMYINSSLGRTYMFNERKQMTGQANINAKKLKALPIIL 347 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP+ EQ +I ++ +ID + ++S+ L + + A G+ Sbjct: 348 PPLSEQQEIVTYLDNLQTQIDEMKRLRQESLKELNALLPAILDKAFKGE 396 >gi|254456858|ref|ZP_05070286.1| type I restriction-modification system, S subunit [Campylobacterales bacterium GD 1] gi|207085650|gb|EDZ62934.1| type I restriction-modification system, S subunit [Campylobacterales bacterium GD 1] Length = 365 Score = 117 bits (294), Expect = 3e-24, Method: Composition-based stats. Identities = 51/391 (13%), Positives = 110/391 (28%), Gaps = 31/391 (7%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 +L +T S G+Y N + + Q+L Sbjct: 2 GELCELYQPKTISSKDMC----------EDGQYPVFGANGIIGKYDKYNH-EEPQLLITC 50 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHAD 150 G I++ + +V++P D + L + I A Sbjct: 51 RGATCGSVNISEPQSWINGNAMVVRPIDDSLHIKFVEYLFRGGIDISKTITGAAQPQITR 110 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + EQ I + I T + ++ KE ++ + + Sbjct: 111 QSLSPILISFPQSFPEQQRIVAILDEAFEAIAKAKTNAEQNLKNAKELFESYLQSVFENK 170 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 + G E +K+ ++ + + + L Sbjct: 171 GD-------------GWEEKTLEDVCKITSKLIDPKKSEFQNLVHVGAGNIESQKGTLID 217 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGI 328 + + D I++ I K +G+ ++ + P + + Sbjct: 218 LKTAKEENLISGKFLFDESMILYSKIRPYLMKV----VNCNFKGLCSADIYPLWPFDNKM 273 Query: 329 DSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +L L+ S + + + E + +PP+ EQ I +N +A Sbjct: 274 QKDFLYHLLLSKNFTEYAILGSQRAGMPKVNREHLFSYRFYLPPLSEQEQIVQKLNALSA 333 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L +++I L+E + S + A G+ Sbjct: 334 ETKRLETIYQKNIEDLEELKKSILQKAFNGE 364 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 45/193 (23%), Positives = 85/193 (44%), Gaps = 5/193 (2%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI---IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ ++ K+ + + +++G ++ES G + ++ S + Sbjct: 173 GWEEKTLEDVCKITSKLIDPKKSEFQNLVHVGAGNIESQKGTLIDLKTAKEENLISGKFL 232 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIE 138 F + ILY K+ PYL K + +F G+CS L P + + L LLS + T+ Sbjct: 233 FDESMILYSKIRPYLMKVVNCNFKGLCSADIYPLWPFDNKMQKDFLYHLLLSKNFTEYAI 292 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + A M + + + + +PPL+EQ I +K+ A + L T + IE L+E Sbjct: 293 LGSQRAGMPKVNREHLFSYRFYLPPLSEQEQIVQKLNALSAETKRLETIYQKNIEDLEEL 352 Query: 199 KQALVSYIVTKGL 211 K++++ L Sbjct: 353 KKSILQKAFNGEL 365 >gi|158333868|ref|YP_001515040.1| type I restriction-modification enzyme S subunit [Acaryochloris marina MBIC11017] gi|158304109|gb|ABW25726.1| type I restriction-modification enzyme S subunit [Acaryochloris marina MBIC11017] Length = 382 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 59/402 (14%), Positives = 122/402 (30%), Gaps = 34/402 (8%) Query: 29 PIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +K + G T K +I +I D+ S Sbjct: 2 KLKEVCRFLNGGTPSKKKPEYFEGEIPWITGADINGPIVNSARSYITEEAILNSATKRVP 61 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 +L + K ++ + S L P ++ ++ Sbjct: 62 PNTVLLVT-RTSVGKVAVSGMELCYSQDITSLWPDLEKLDIYYLTHFLRSRETYLKGQSR 120 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GAT+ + N+ + +PP+AEQ I + A LL Q+ Sbjct: 121 GATIKGVTKGVLENLSLHLPPIAEQKRIAGILDAADALRVKRRDAISTLDALL----QST 176 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + + S +E +T+ K K ES I LS Sbjct: 177 FLTLFGDPITNPMGWDASDLE------------AVSEKITDGTHKTPKYTESGIEFLSAK 224 Query: 263 NIIQKLETRNMGLKPESYETYQIV-----DPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +I N G E ++ + G+++ ++ + Sbjct: 225 DIKNGSIKWNTGKFISEDEHKSLITRCHPEIGDVLLAKSGSLGS-VAIIDRDHEFSLFES 283 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQF 376 + I++ +L ++ S + + G+ + L D+++L +L+PP+ +Q Sbjct: 284 LCLIKHNRQKIEAQFLTAMLESPRMQMHLLSRNKGISIKHLHLTDIRKLKILLPPLDKQR 343 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ A I+ + + L +S + A G+ Sbjct: 344 KFATIV----ASIEKQKAQQCAHLAELDTLFASLQSRAFNGE 381 >gi|83776722|gb|ABC46684.1| Sau1hsdS1 [Staphylococcus aureus] Length = 389 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 52/398 (13%), Positives = 108/398 (27%), Gaps = 39/398 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + +G T K DI +I D+ + + + + + S+ Sbjct: 20 EWEEKKLGEVGTFTSGGTPLKSKSEYWNGDIPWITTGDIHNIKRENITNFITEKGLNESS 79 Query: 78 VSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL G + I +F+ + + Q + + + + Sbjct: 80 AKLITNEAILIAMYGQGKTRGMSAILNFEATTNQACAIYQTNQNIN---FVFQYFQKLYE 136 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + ++ + + + I + P EQ I + +I+ + + Sbjct: 137 FLRSLSNEGSQKNLSLSLLKEITLNYPNEQEQKKIGDFFSKLDRQIELEEQKLELLQQQK 196 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K Q + S + + G WE + K Sbjct: 197 KGYMQKIFSQELRFK------------DENGNDYPEWEETTIKEIAQINXGKKDTK---- 240 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + I P Y+ GE + D + + Sbjct: 241 -------DAITNGSYDFYVRSPIVYKINTFSYEGEAILTVGDGVGVGKVF-HYVNGKFDY 292 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 Y L + L + S++ + + + V P EQ Sbjct: 293 HQRVYKISDFKNYYGLLLFYYFSQNFLKETKKYSAKTSVDSVRKDMIANMKVPRPIYIEQ 352 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I I R+D + +Q I LLK+R+ + + Sbjct: 353 KKIGQFI----KRVDNKTKIQKQVIELLKQRKKALLQK 386 >gi|261837923|gb|ACX97689.1| specificity subunit S of type I restriction-modification system [Helicobacter pylori 51] Length = 422 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 52/403 (12%), Positives = 132/403 (32%), Gaps = 29/403 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKRYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQAL------VSYIVTKGLNPDVKMKDSGIEWVGLV--PDHWEVKPFFALVTE 243 ++ K++ Q + + K ++ + P E + + Sbjct: 192 LKARKKQYQYYQNMLLDFKDANQNHKDATMSAKTYRLKSLLQTLAPKGVEFRKLGEVCEI 251 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 L+ + + ++ Y + + + + G ++ D Sbjct: 252 LDNRRIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVI------NKDNT 305 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + + + A++ + + +L + +++ D+ +G + E++K Sbjct: 306 PVVNWASGKIWVNNHAHVLQTKNELKLKFLYFYLQTIDV----SYCVAGTPPKINQENLK 361 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++ + +PP++ Q +I +++ +A L+ I I K++ Sbjct: 362 KITIPIPPLEIQQEIVKILDQFSALTTDLLAGIPAEIKARKKQ 404 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 56/191 (29%), Gaps = 17/191 (8%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPES 279 P E K + N + + R G + P++ Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++ I+ + L + + +++ K + + + + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLIVDSLANQRFT---FLSKKANCDLALDMKFFFYQ 129 Query: 340 YDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + + S+ KR +PP++ Q +I +++ T L ++ Sbjct: 130 CFLLGEWCKKNTNVSGFASVDMTAFKRYKFPIPPLEIQQEIVKILDAFTELNTELNTELN 189 Query: 398 QSIVLLKERRS 408 LK R+ Sbjct: 190 TE---LKARKK 197 >gi|242372573|ref|ZP_04818147.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus epidermidis M23864:W1] gi|242349790|gb|EES41391.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus epidermidis M23864:W1] Length = 406 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 62/403 (15%), Positives = 145/403 (35%), Gaps = 32/403 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ IK K+ +G T K +I ++ D+ + ++ +++ Sbjct: 19 WESTKIKNIFKVVSGSTPLRSKTEYYNNGNIPWVKTTDLNNRLLSKTSENITELALNSNN 78 Query: 78 VSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + K +L G + + + I + + + L + + L+ +V + Sbjct: 79 LKLLPKQTLLIAMYGGFNQIGRTAILNMEATTNQAISALISNNNVNTKFLQSYLNFNVNK 138 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + K I N +P + EQ I + +I+ + + Sbjct: 139 WKRYAASSRKDPNITKKDIENFIVPFTNIIEQNKIGDFFSKLDRQIELEEEKLGLLQQYK 198 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ L+S + N G +W + L+ E+N K + Sbjct: 199 KKYTNKLLSQEIRFKNN------------NGYNYPNWNEEKLGNLIDEVNEKTILNNQYP 246 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 +LS + ++ + E + + + Y+I+ ++V +L ++ Q + GI Sbjct: 247 LLSSTKNGLLTQEEYFKKQIGSKENKGYKILRLNQLVLSPQNLWL--GNINLNQRFDIGI 304 Query: 316 ITSAYMAVKPHGIDSTYL-AWLMRSYDLC----KVFYAMGSGLRQSLKFEDVKRLPVLVP 370 ++ +Y + + +++S + S +R++L + + V +P Sbjct: 305 VSPSYRIYNLNQRFNINFAKTVLKSPRYIYAYAQASEQGASVVRRNLNLDLFYSIKVSLP 364 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I+EQ I+ ++ + L+EK + LL R+ F+ Sbjct: 365 CIEEQNKISAFLDG----FENLIEKQYSKVDLLNHRKQGFLQK 403 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 71/209 (33%), Gaps = 8/209 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 +++ E + + T L K NI + ++ +L ++ Sbjct: 8 PELRFPEFEDKWESTKIKNIFKVVSGSTPLRSKTEYYNNGNIPWVKTTDLNNRLLSKTSE 67 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E + +++ ++ N + E + + + +++ Sbjct: 68 NITELALNSNNLKLLPKQTLLIAMYGGFNQIGRTAILNM-EATTNQAISALISNNNVNTK 126 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +L + YA S ++ +D++ V I EQ I + +++D Sbjct: 127 FLQSYLNFNVNKWKRYAASSRKDPNITKKDIENFIVPFTNIIEQNKIGDF----FSKLDR 182 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +E E+ + LL++ + + ++ +I Sbjct: 183 QIELEEEKLGLLQQYKKKYTNKLLSQEIR 211 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 49/186 (26%), Gaps = 8/186 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +W + +T + + + ++ +Y K + I Sbjct: 222 NWNEEKLGNLIDEVNEKTILNNQYPLLSSTKNGLLTQEEYFKKQI--GSKENKGYKILRL 279 Query: 84 GQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLP----ELLQGWLLSIDVTQRI 137 Q++ +L + GI S + + + + I + Sbjct: 280 NQLVLSPQNLWLGNINLNQRFDIGIVSPSYRIYNLNQRFNINFAKTVLKSPRYIYAYAQA 339 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + +I + +P + EQ I + I+ ++ + Sbjct: 340 SEQGASVVRRNLNLDLFYSIKVSLPCIEEQNKISAFLDGFENLIEKQYSKVDLLNHRKQG 399 Query: 198 KKQALV 203 Q + Sbjct: 400 FLQKMF 405 >gi|300087358|ref|YP_003757880.1| restriction modification system DNA specificity domain-containing protein [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299527091|gb|ADJ25559.1| restriction modification system DNA specificity domain protein [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 411 Score = 117 bits (293), Expect = 4e-24, Method: Composition-based stats. Identities = 49/407 (12%), Positives = 114/407 (28%), Gaps = 35/407 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + + ++ G T + + +ED+ G+ L Sbjct: 18 PDGVEYFELHDLFEIKNGYTPSKNSLEYWKNGTLPWFRMEDIR-KNGRILSDSIQHITEK 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSID 132 ++ I+ A+I + + + L + Sbjct: 77 AVKGKLYPAYSIIMATTATIGEHALIIADSLANQQFTFLTRKVNRLDCLNPKFVYYYCFL 136 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + + D P+PPL Q I + + T L E Sbjct: 137 LGEWCRNNTNISGFASVDMGKFKKYKFPVPPLPIQEEIVKILDTFTTLEAELEAELEARK 196 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTK 250 + + ++ L++ +EW +G V Sbjct: 197 KQYEYYREELLT-------------FGDDVEWKTLGEVGTLIRGNGLQKKDFVEEGVGCI 243 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 K + PE + V+ G++V D A + Sbjct: 244 HYGQVYTYYGTSTNATK-----SFVSPELANILKKVNKGDLVITSTSENIDDVCKAVAWL 298 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLV 369 E I+T + + H + YL++ ++ + G + + D+ ++ + + Sbjct: 299 GEDEIVTGGHATILKHHENPKYLSYYFQTTSFSEQKRKYAKGTKVIDVSGSDLAKIKIPI 358 Query: 370 PPIKEQFDITNVINVETARIDVLV----EKIEQSIVLLKERRSSFIA 412 PP EQ I ++++ A ++ + ++ + R + Sbjct: 359 PPSAEQERIVSILDKFDALVNDISVGLPAELNARRKQYEYYREKLLT 405 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 43/169 (25%), Gaps = 11/169 (6%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--------PES 279 PD E L N + + R G E Sbjct: 17 CPDGVEYFELHDLFEIKNGYTPSKNSLEYWKNGTLPWFRMEDIRKNGRILSDSIQHITEK 76 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL-MR 338 ++ I+ + + + + + + ++ ++ + Sbjct: 77 AVKGKLYPAYSIIMATTATIGEHALIIADSLANQQFTFLTRKVNRLDCLNPKFVYYYCFL 136 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + C+ S+ K+ VPP+ Q +I +++ T Sbjct: 137 LGEWCR--NNTNISGFASVDMGKFKKYKFPVPPLPIQEEIVKILDTFTT 183 >gi|15927381|ref|NP_374914.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus N315] gi|148268279|ref|YP_001247222.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus JH9] gi|150394344|ref|YP_001317019.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus JH1] gi|257794184|ref|ZP_05643163.1| specificity determinant HsdS [Staphylococcus aureus A9781] gi|258415888|ref|ZP_05682159.1| specificity determinant HsdS [Staphylococcus aureus A9763] gi|258420717|ref|ZP_05683656.1| specificity determinant HsdS [Staphylococcus aureus A9719] gi|258438382|ref|ZP_05689666.1| specificity determinant HsdS [Staphylococcus aureus A9299] gi|258443826|ref|ZP_05692165.1| specificity determinant HsdS [Staphylococcus aureus A8115] gi|258446037|ref|ZP_05694213.1| specificity determinant HsdS [Staphylococcus aureus A6300] gi|258448235|ref|ZP_05696362.1| specificity determinant HsdS [Staphylococcus aureus A6224] gi|258454236|ref|ZP_05702207.1| specificity determinant HsdS [Staphylococcus aureus A5937] gi|269203440|ref|YP_003282709.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus ED98] gi|282893295|ref|ZP_06301529.1| type I restriction enzyme, S subunit [Staphylococcus aureus A8117] gi|282928536|ref|ZP_06336135.1| type I restriction enzyme, S subunit [Staphylococcus aureus A10102] gi|295406112|ref|ZP_06815920.1| type I restriction enzyme [Staphylococcus aureus A8819] gi|297244964|ref|ZP_06928841.1| type I restriction enzyme [Staphylococcus aureus A8796] gi|13701600|dbj|BAB42893.1| probable specificity determinant HsdS [Staphylococcus aureus subsp. aureus N315] gi|147741348|gb|ABQ49646.1| restriction modification system DNA specificity domain [Staphylococcus aureus subsp. aureus JH9] gi|149946796|gb|ABR52732.1| restriction modification system DNA specificity domain [Staphylococcus aureus subsp. aureus JH1] gi|257788156|gb|EEV26496.1| specificity determinant HsdS [Staphylococcus aureus A9781] gi|257839481|gb|EEV63954.1| specificity determinant HsdS [Staphylococcus aureus A9763] gi|257843321|gb|EEV67731.1| specificity determinant HsdS [Staphylococcus aureus A9719] gi|257848426|gb|EEV72417.1| specificity determinant HsdS [Staphylococcus aureus A9299] gi|257851232|gb|EEV75175.1| specificity determinant HsdS [Staphylococcus aureus A8115] gi|257855279|gb|EEV78218.1| specificity determinant HsdS [Staphylococcus aureus A6300] gi|257858474|gb|EEV81350.1| specificity determinant HsdS [Staphylococcus aureus A6224] gi|257863688|gb|EEV86445.1| specificity determinant HsdS [Staphylococcus aureus A5937] gi|262075730|gb|ACY11703.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus ED98] gi|282589745|gb|EFB94830.1| type I restriction enzyme, S subunit [Staphylococcus aureus A10102] gi|282764613|gb|EFC04739.1| type I restriction enzyme, S subunit [Staphylococcus aureus A8117] gi|285817486|gb|ADC37973.1| Type I restriction-modification system, specificity subunit S [Staphylococcus aureus 04-02981] gi|294969109|gb|EFG45130.1| type I restriction enzyme [Staphylococcus aureus A8819] gi|297178044|gb|EFH37292.1| type I restriction enzyme [Staphylococcus aureus A8796] gi|312830178|emb|CBX35020.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus ECT-R 2] gi|315130554|gb|EFT86540.1| restriction modification system DNA specificity domain [Staphylococcus aureus subsp. aureus CGS03] gi|329727355|gb|EGG63811.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21172] Length = 409 Score = 117 bits (293), Expect = 4e-24, Method: Composition-based stats. Identities = 66/402 (16%), Positives = 142/402 (35%), Gaps = 27/402 (6%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGQFFSKLDQQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L + G W K ++ N++ Sbjct: 197 LELLQQQKKCYIQKIFSQEL--------RFKDEEGNYYKGWNKKQLKDVLEFSNKRTINE 248 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 E +L+ S +I + + + P + + ++ Sbjct: 249 NEYPVLTSSRQGLILQSDYYKDRKTFAESNIGYFILPKNHITYRSRSDDGIFKFNLNLMI 308 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + GII+ Y K + YL + + + L +D++ + +P Sbjct: 309 DVGIISKYYPVFKGIDANQYYLTLHLNYQLKKEYIKYATGTSQLVLSQKDLQNIKTKLPS 368 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I + + ID LVEK + LK R+ + Sbjct: 369 YEEQQKIGDF----FSEIDRLVEKQSSKVGRLKVRKKELLQK 406 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 50/185 (27%), Gaps = 14/185 (7%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKK---- 125 Query: 331 TYLAWLMRSYDL-----CKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINV 384 Y Y L K+F A G R+ L F+++ L + P I +EQ I + Sbjct: 126 EYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSK 185 Query: 385 ETARI 389 +I Sbjct: 186 LDQQI 190 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 53/184 (28%), Gaps = 5/184 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W +K + + RT + + Y KD + I Sbjct: 227 KGWNKKQLKDVLEFSNKRTINENEYPVLTSSRQGLILQSDY-YKDRKTFAESNIGYFILP 285 Query: 83 KGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K I Y G + + GI S ++ + + L+ + + Sbjct: 286 KNHITYRSRSDDGIFKFNLNLMIDVGIIS-KYYPVFKGIDANQYYLTLHLNYQLKKEYIK 344 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + K + NI +P EQ I + ++ ++ R KE Sbjct: 345 YATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDFFSEIDRLVEKQSSKVGRLKVRKKELL 404 Query: 200 QALV 203 Q + Sbjct: 405 QKMF 408 >gi|281358279|ref|ZP_06244762.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] gi|281315369|gb|EFA99399.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] Length = 414 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 55/417 (13%), Positives = 120/417 (28%), Gaps = 30/417 (7%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY--LPKDGNSRQSDTST 77 K + + G T ++ G DI ++ + D + K + S+ Sbjct: 2 KEYKLSELADIIGGGTPKTSRSDYWGGDIPWLSVVDFNNDFRHVFTTEKTITEAGLNNSS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I G+I+ G A +A + L+ K + + L + I Sbjct: 62 TRILYPGEIIISARGTVGALAQVAKEMA-FNQSCYGLRAKFGITCNDYLFYLLRHSIETI 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+ +I + +P L Q I + D I + L+E Sbjct: 121 KKNTHGSVFDTITRDTFESISVILPDLKTQQKIASIL----ASFDDKIELNTQINHNLEE 176 Query: 198 KKQALV-SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + +A+ S+ V D + ++ ++ R ++S I Sbjct: 177 QAKAIFKSWFVDFEPFADDVFTNEDPVEHPASLSMVQIANIEHILETGKRPKGGAVKSGI 236 Query: 257 LSLSYGNI-----IQKLETRNMGLKPESYETYQIVDPGEIVFRFID-----LQNDKRSLR 306 S+ N+ + + + ++ E++ Sbjct: 237 PSIGAENVKKLGVFDYSSGKFIPREFADSMKRGKINGYELLIYKDGGKPGTFIPHFSMFG 296 Query: 307 SAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 E I G ++ + Y + G + ED++ Sbjct: 297 EGYPYEECYINEHVFKLDFGNKGFNAFAYFYFQTDYPYSWLANNGGKAAIPGINQEDIRS 356 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + P Q + + I + K L + R + + ++G+ID+ Sbjct: 357 IFIFDP----QHPKVKEFSAYVSPIFTTIMKNCLENKKLAQLRDALLPKLMSGEIDV 409 >gi|261838806|gb|ACX98572.1| type I R-M system specificity subunit [Helicobacter pylori 51] Length = 388 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 48/403 (11%), Positives = 124/403 (30%), Gaps = 29/403 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W+ V + ++ G+ + T KY +G + Sbjct: 10 LPLNWQRVRLGDICEIVKGQQINKIS----------LNNTDKYPVINGGIDFLGYTNKFN 59 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +K I + G + + + + + + L + + + I + Sbjct: 60 VSKNTITISEGGTCGYVRFMTSNFWSGGHNYSLQKISNKVNNLCL-YHILKSYEKDIMKL 118 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ + + K + + +P+PPL EQ+ I + +D + I + K+ Sbjct: 119 GVGSGLKNIQLKPLKDFEIPLPPLNEQIAIANIL----SALDRYLCALGALILKKEGVKK 174 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 AL ++++ + +G + P + N + Sbjct: 175 ALSFELLSQRKRLRGFNQAWQRVKLGTYKYRRDSFPQPYGNPQWYSDNGM---PFVQVYD 231 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G + + + + V ++ R A Sbjct: 232 VGENFKLTQKTKQKISKIAQPMSVFVPKNSVIITLQGTIG-----RVALTQYDCYCDRTI 286 Query: 321 MAVKPHGIDSTYLAWLMRSY--DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + ++ + + S + + +++ + +K +L+PP+ EQ I Sbjct: 287 LIFDNNTLNDVNKYFFVLSLFTKFEEEKRKADGSIIKTITKQTLKDFEILLPPLNEQIAI 346 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 N+++ I L K Q + + + ++ +I + Sbjct: 347 ANILSDLDNEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 385 >gi|285959362|gb|ADC39984.1| type I restriction-modification system large specifity subunit [Staphylococcus aureus] Length = 415 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 61/402 (15%), Positives = 132/402 (32%), Gaps = 19/402 (4%) Query: 24 HWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + I G++ I I ++ + + + + + Sbjct: 18 EWSLSTIGALGDFYYGKSAPKWSITKDVGIPCIRYGELYTKFNNVVNEIYSYTSMPKEKL 77 Query: 79 SIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G+IL ++G + + + D V K+ L + + + Sbjct: 78 RFSKGGEILIPRVGEDPLDFAKCVYLPQKDIAIGEMISVYNTKE--NPLFLTYYFNTKMK 135 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 EGA++S+ + + +I + IP + EQ + +I+ + + Sbjct: 136 YEFAKRVEGASVSNLYYSYLEDIKLKIPDIREQQKLGVFFSKLDRQIELEEEKLELLEQQ 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDS--GIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + Q + + + + +K S +E + + +++ KN + Sbjct: 196 KRGYMQKIFTQELKFKNSQLENIKWSYKTLEELNSFFTDGNYGESYPKSEDMSDKNDGVA 255 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 +L G I + K T + +IV + V Sbjct: 256 FLRGSNLKKGRITLEDANYISKKKHSELTTGHLFL-DDIVIAVRGSLGAVGYVNENMVGN 314 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPP 371 A + + YL + + S K + +G + L + +K++ V VP Sbjct: 315 NINSQLAIIRTSSSLLYGKYLLYYLMSNQGKKELLSRVTGTALKQLPIKQIKQIKVPVPK 374 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ I N + + +D L++ + I LLKER+ F+ Sbjct: 375 LYEQHKIANFL----SELDNLIDNQTEKIELLKERKKGFLQK 412 >gi|94989255|ref|YP_597356.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS9429] gi|94542763|gb|ABF32812.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS9429] Length = 399 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 58/396 (14%), Positives = 124/396 (31%), Gaps = 18/396 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ G++ S + G R T K Sbjct: 17 EWEEKELGDIVQITMGQSPSSQNYTTNPSDYILVQGNADIKNGYIFPRVWTTQITKQADK 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I+ P ++ I ++ + + L + + I G Sbjct: 77 GDIILSVRAPV-GDVGKTNYHVIIGRGVAAIKGNEFI----FQILKYLKEIGYWKRISTG 131 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T I + IP L EQ I E +D LI + + + LKE+KQ + Sbjct: 132 STFDSISSSDIKYAKIQIPSLPEQEAIGE----LFQTVDQLIQLQDQKLATLKEQKQTFL 187 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + +++ G + EV + +++ E ++S+ Sbjct: 188 RKMFPPQIQKVPEIRLQGFKGEWEEKKLGEVSTHRSGTAIEKYFDSEG-EFKVISIGSYG 246 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVMERGIITSA 319 +N+ ++V GE+ D + + + + + Sbjct: 247 TNNLYVDQNIRAVSNELTNSKLVASGELTMVLNDKTANGAIIGRCLLITENNKYVVNQRT 306 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + I S YL + + G + + + V++L + +P +KEQ I Sbjct: 307 EIIRPDINISSYYLFHYLNGEFRNGIIKIAQGGTQIYVNYSSVEQLKINIPTLKEQEAIG 366 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 N +D + + E+ + LK + + + Sbjct: 367 NF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 398 >gi|284037969|ref|YP_003387899.1| restriction modification system DNA specificity domain protein [Spirosoma linguale DSM 74] gi|283817262|gb|ADB39100.1| restriction modification system DNA specificity domain protein [Spirosoma linguale DSM 74] Length = 520 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 56/490 (11%), Positives = 129/490 (26%), Gaps = 103/490 (21%) Query: 27 VVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V I + ++G T G I ++ ++ G + + S+ + Sbjct: 30 VKRIGSIAETSSGGTPTRGNPEFYNGTIPWLKSGELNDGLITECEEYITEKGLKNSSAKL 89 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F +G +L G K I FD + + PK + E + I Sbjct: 90 FPEGTLLVAMYGATAGKVGILSFDASTNQAVCAVFPKADI-ERDFLFWYFRQQRFDFIEI 148 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA------------ETVRIDTLITER 188 +G + I N +PIP +A Q + + + + I Sbjct: 149 SKGGAQPNISQTVINNAVIPIPEVAVQKQVVKFLNILETEQRIDNNLVLNEEVAQQIARY 208 Query: 189 IRFI--------------ELLKEKKQALVSYIVTKGLNPDVK-----MKDSGIEWVGLVP 229 + +LL + +Q+++ V L + + + +G P Sbjct: 209 FKIRTEAAEVEDIYIEQKKLLTQLRQSILQEAVQGKLTKKFRETEKLAQQDHVRVLGSNP 268 Query: 230 DHWEVKP-----------------------------------------FFALVTELNRKN 248 + Sbjct: 269 SRTATPQLETGADLLARIRAEKAELIRQGKLRKEKPLPPITDAEKPFELPEGWVWCRLGD 328 Query: 249 TKLIESNILSLSYGNIIQKLE-----------------TRNMGLKPESYETYQIVDPGEI 291 S G+ I+ M S V G++ Sbjct: 329 VCESSFYGPRFSNGDYIKNGIPTIRTTDMTDDGRIVLKNTPMVKVSSSKLELYQVLDGDL 388 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVF-YAM 349 + + + I ++ + + I Y+ ++++ ++ + Sbjct: 389 LITRSGSIG---IMAVFRGSYTAIPSAYLIRFRFVSSIFPEYVFSVLKAPFWQRLMGLST 445 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-VEKIEQSIVLLKERRS 408 S + ++ + + +P EQ I + ++ L +E +Q + + + Sbjct: 446 TSTAQVNINASSINSFLIPLPSFTEQQAIVAQVKQLLNQVSALEIENKQQQVEVSQ-LMQ 504 Query: 409 SFIAAAVTGQ 418 ++ A G+ Sbjct: 505 VVLSEAFAGK 514 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 60/196 (30%), Gaps = 8/196 (4%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGK----DIIYIGLEDV-ESGTGKYLPKDGNSRQS 73 +P+ W + + G +G I I D+ + G S Sbjct: 316 ELPEGWVWCRLGDVCESSFYGPRFSNGDYIKNGIPTIRTTDMTDDGRIVLKNTPMVKVSS 375 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSI 131 + G +L + G A+ + +L+ + PE + L + Sbjct: 376 SKLELYQVLDGDLLITRSGSIGIMAVFRGSYTAIPSAYLIRFRFVSSIFPEYVFSVLKAP 435 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + I + +P+P EQ I ++ ++ L E + Sbjct: 436 FWQRLMGLSTTSTAQVNINASSINSFLIPLPSFTEQQAIVAQVKQLLNQVSALEIENKQQ 495 Query: 192 IELLKEKKQALVSYIV 207 + + Q ++S Sbjct: 496 QVEVSQLMQVVLSEAF 511 >gi|227878603|ref|ZP_03996526.1| restriction endonuclease S subunit [Lactobacillus crispatus JV-V01] gi|256850433|ref|ZP_05555861.1| restriction modification system DNA specificity subunit [Lactobacillus crispatus MV-1A-US] gi|262046416|ref|ZP_06019378.1| type I site-specific deoxyribonuclease chain S [Lactobacillus crispatus MV-3A-US] gi|227861809|gb|EEJ69405.1| restriction endonuclease S subunit [Lactobacillus crispatus JV-V01] gi|256712830|gb|EEU27823.1| restriction modification system DNA specificity subunit [Lactobacillus crispatus MV-1A-US] gi|260573287|gb|EEX29845.1| type I site-specific deoxyribonuclease chain S [Lactobacillus crispatus MV-3A-US] Length = 480 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 63/416 (15%), Positives = 135/416 (32%), Gaps = 59/416 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-----IIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W+ V + L G+T + Y ++D+ + Y+ N Sbjct: 73 DIPDSWEWVRLGDVGLLKNGKTPKKEDTSSDNIYPYFKVKDMNNNNL-YMENVKNWVGEK 131 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK---DVLPELLQGWLLSI 131 S + K I++ P AI+ I S LV +L ++ + Sbjct: 132 YS-RQVMPKNTIIF----PKNGGAILTAKKRILSQDSLVDLNTGGLIPYNDLNHKFIFYL 186 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ I+ +G+ + + K + +P+PPL EQ I KI + + + ++ Sbjct: 187 FLSLDIKDFVKGSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVESSTQQY 246 Query: 192 IELLKEKKQALVSYIVTKGL---NPDVK----------------------------MKDS 220 +L K ++ + L +P + + Sbjct: 247 AKLQTLLKSKVLDLAMRGKLVEQDPHDEPASVLLEKIKAEKRKMIKEKEIKKSKPLPPIT 306 Query: 221 GIEWVGLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 E +PD WE T N+ + I +++ N Sbjct: 307 DEEKPFDIPDSWEWVRLGNIAKRITDGTHNPPPNSHEGKQVISAINIKKGKIDFSLSNRF 366 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + + G+++ + + + ++ +AV I S Sbjct: 367 VSEDQFLKEDKRTNIRKGDVLLTIVGSLGNAAVV----DTDKLFTAQRSVAVISSNILSK 422 Query: 332 YLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L +++ S +A G ++ + + L + +PP+ EQ I + I+ Sbjct: 423 FLYYVLISAMFKTQIFANAKGTTQKGIYLSKLINLKLPLPPLAEQNRIVDKIDNLF 478 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 72/204 (35%), Gaps = 9/204 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + E +PD WE + N K K +++ ++ ++ + N+ ++ Sbjct: 66 TDDEKPFDIPDSWEWVRLGDVGLLKNGKTPKKEDTSSDNIYPYFKVKDMNNNNLYMENVK 125 Query: 280 YE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + Q++ I+F R + + + + ++ ++ Sbjct: 126 NWVGEKYSRQVMPKNTIIFPKNGGAILTAKKRILSQDSLVDLNTGGLIPY-NDLNHKFIF 184 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +L S D+ ++ + +K V +PP++EQ I I A + + Sbjct: 185 YLFLSLDIKDFVK---GSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVES 241 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 +Q L +S + A+ G+ Sbjct: 242 STQQYAKLQTLLKSKVLDLAMRGK 265 >gi|146302129|ref|YP_001196720.1| restriction modification system DNA specificity subunit [Flavobacterium johnsoniae UW101] gi|146156547|gb|ABQ07401.1| restriction modification system DNA specificity domain protein [Flavobacterium johnsoniae UW101] Length = 414 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 44/404 (10%), Positives = 114/404 (28%), Gaps = 27/404 (6%) Query: 26 KVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + + G T + +ED+ G+ L Sbjct: 14 EWKTVDDIFYIKNGYTPSKSSQEYWTNGTNPWFRMEDIR-KNGRVLSDSIQHVSDSAVKG 72 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---TQ 135 + + +L A++ + ++ +L Sbjct: 73 QLIPENSLLMSTTATIGEHALVLVPYLTNQQITNFSLKTSFIDKVSIKYLFYCFFDFGKW 132 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 IE + +S + + IP L Q I + + T L E + Sbjct: 133 CIENANKNGGLSIIGTNKLKEYTIAIPSLEIQQKIVAILDSFTELTAELTAELTAELTAR 192 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVG--LVPDHWEVKPFFALVTELNRKNTKLIE 253 K + + T N + E +G + + + Sbjct: 193 KMQYSYYREKLYTFDKNKVQHLPM-DDESIGVFQRGKRFVKTDLISEGVPVIHYGEMYTH 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + +N L+ + + G++V + + +A + + Sbjct: 252 YGTWADKTKSFLSEELVKNKNLR--------VANKGDVVIVAAGETIEDIGMGTAWLGDE 303 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPI 372 G++ ++ ++A+ R+ SG ++ + + + VP Sbjct: 304 GVVVHDACFSYKTTLNPKFVAYFTRTKQFHDQIKKHISSGKISAINANGLGKAIIPVPSK 363 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +EQ + ++++ + + E + + I L K+ R + Sbjct: 364 EEQERVVSILDKFDVLTNSISEGLPKEIELRKKQYEYYRDLLLT 407 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 17/184 (9%), Positives = 57/184 (30%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 G+E D ++ +++ + + ++ + ++ +S Sbjct: 10 GVEVEWKTVDDIFYIKNGYTPSKSSQEYWTNGTNPWFRMEDIRKNGRVLSDSIQHVSDSA 69 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 Q++ ++ + + + + I + + YL + + Sbjct: 70 VKGQLIPENSLLMSTTATIGEHALVLVPYLTNQQITNFSLKTSFIDKVSIKYLFYCFFDF 129 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + A +G + +K + +P ++ Q I +++ T L ++ + Sbjct: 130 GKWCIENANKNGGLSIIGTNKLKEYTIAIPSLEIQQKIVAILDSFTELTAELTAELTAEL 189 Query: 401 VLLK 404 K Sbjct: 190 TARK 193 >gi|303241303|ref|ZP_07327808.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] gi|302591142|gb|EFL60885.1| restriction modification system DNA specificity domain protein [Acetivibrio cellulolyticus CD2] Length = 421 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 62/433 (14%), Positives = 135/433 (31%), Gaps = 40/433 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYL- 64 YK + V G IP+ W+VV + G +SG + I + D + K Sbjct: 7 YKMTEV---GVIPEDWEVVDFGDIVEYTKGFAFKSGDYCQDGVRIIRVSDTTYDSIKDDN 63 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGP--------YLRKAIIADFDG--ICSTQFLVL 114 P +++ I + +++ +G + +I + + +++ Sbjct: 64 PIYIDTKNCTKYRKWILIEHDLIFSTVGSKPPMYDSLVGKVIMITKRYAGSLLNQNAVLI 123 Query: 115 QPKDVLPELLQ----GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVL 169 + K+ + + + + + A + K + P+P+P EQ Sbjct: 124 RSKEKNVFIQKLLLNHFRTNRYIRYIETIFRGNANQASITLKELFKFPIPLPINYSEQKA 183 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + I +L + + + Q L++ K E VGL+P Sbjct: 184 IATALSDTDELIQSLEKLIAKKRAIKQGVMQKLLTGKKRLQKFNQETEKYKNTE-VGLIP 242 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + W + + S + + I E G Sbjct: 243 EDWNIVKIKNIALISTG-----------SRNTQDKIDSGEYPFFVRSQTVERINSYSYDG 291 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 E V D + ++ ID + ++ ++ Sbjct: 292 EAVLTAGDGVGTGKVFHYISGKFDFHQRVYKISDFKDNIDGYFFFLYFKNSFYNRIMQMT 351 Query: 350 GSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 S++ E + + + +PP EQ I ++++ A + +E + K+ + Sbjct: 352 AKSSVDSVRMEMIAEMQIPIPPTQNEQKAIASILSDMDAE----ITALETKLEKYKKIKQ 407 Query: 409 SFIAAAVTGQIDL 421 + +TG+I L Sbjct: 408 GMMQNLLTGKIRL 420 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 69/204 (33%), Gaps = 16/204 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 +YK++ V G IP+ W +V IK ++TG + K ++SG + + Sbjct: 231 EKYKNTEV---GLIPEDWNIVKIKNIALISTGSRNTQDK---------IDSGEYPFFVRS 278 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + ++ + G+ + + G V + D + + Sbjct: 279 QTVERINSYSY----DGEAVLTAGDGVGTGKVFHYISGKFDFHQRVYKISDFKDNIDGYF 334 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I + S D + I P+ ++ I + +D IT Sbjct: 335 FFLYFKNSFYNRIMQMTAKSSVDSVRMEMIAEMQIPIPPTQNEQKAIASILSDMDAEITA 394 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 +E K+ KQ ++ ++T + Sbjct: 395 LETKLEKYKKIKQGMMQNLLTGKI 418 >gi|269978366|gb|ACZ55917.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 459 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 54/435 (12%), Positives = 134/435 (30%), Gaps = 45/435 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKNNTNVSGFASVDMPAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKGLNPD--VKMKDSGIEWVGLVPDHWEVKPFFAL--------- 240 + ++ Y L+ + + E + P +K Sbjct: 192 LNTELNARKKQYQYYQNMLLDFNGINQNHKDAKERLAQKPYPKRLKTLLQTLAPKGVEFR 251 Query: 241 ------------VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 V + + ++ + + ++ N Q ++ E + Sbjct: 252 KLGDIGEFYGGLVGKSKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKEKQNTIQL 311 Query: 289 GEIVFRFID------LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 G+++F + + + + + + + + ++L +R Y+ Sbjct: 312 GDVLFTGSSENLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKHFLRDYNF 371 Query: 343 CKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 K + +G R ++ + + ++ + +PP++ Q +I +++ + L+ I I Sbjct: 372 RKNISKVANGVTRFNVSKQLLSQITIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIK 431 Query: 402 LLKE----RRSSFIA 412 K+ R + Sbjct: 432 ARKKQYEYYREKLLT 446 >gi|315036576|gb|EFT48508.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0027] Length = 398 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 64/401 (15%), Positives = 143/401 (35%), Gaps = 29/401 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W++ + R + T + E + + + E + + + + D S + G Sbjct: 10 WELCKLGRVVERVTRKNKELKSTLP-LTISAQEGLIDQNVFFNKSVASRDVSGYYLIYNG 68 Query: 85 QILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLS-IDVTQRIE 138 + Y K G+ ST +++ +PK++ L+ + + + + Sbjct: 69 EFAYNKSYSNGYPWGAIKRLNRYDMGVLSTLYIIFKPKNIDSNFLEKYYDTSCWYHEVSK 128 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EGA + + + V + KI ++D IT R +E LKE Sbjct: 129 HAAEGARNHGLLNIAASDFLRTELTVPKSVEEQRKIGNFLKQLDDTITLHQRKLEQLKEL 188 Query: 199 KQALVSYIVT--KGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLI 252 K+A + + P V+ EW +G + + + ++ Sbjct: 189 KKAYLQVMFPAKDERVPKVRFAAFEGEWAHRKLGEITESFS-------GGTPTAGKSEYY 241 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 +I + G I + + + ++V G+I++ + + + Sbjct: 242 GGDIPFIRSGEISSDSTELFITENGLNSSSAKMVKVGDILYALYGATSGEVGISKI---- 297 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-P 371 G I A +A++P D++YL + G + +L VK L +++P Sbjct: 298 TGAINQAILAIRPSKNDNSYLIIQWLRKQKNTIISTYLQGGQGNLSSSIVKNLIIMLPQN 357 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ + R+D ++ + + LK+ ++S++ Sbjct: 358 KEEQEKVGIF----FKRLDDIITLHQNKLEQLKDLKTSYLQ 394 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 57/186 (30%), Gaps = 9/186 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + T+ +G T +GK DI +I ++ S + + ++S+ Sbjct: 215 EWAHRKLGEITESFSGGTPTAGKSEYYGGDIPFIRSGEISSDSTELF---ITENGLNSSS 271 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G ILY G + I+ G + L ++P L L I Sbjct: 272 AKMVKVGDILYALYGATSGEVGISKITGAINQAILAIRPSKNDNSYLIIQWLRKQKNTII 331 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + I M EQ + I + + +L Sbjct: 332 STYLQGGQGNLSSSIVKNLIIMLPQNKEEQEKVGIFFKRLDDIITLHQNKLEQLKDLKTS 391 Query: 198 KKQALV 203 Q + Sbjct: 392 YLQNMF 397 >gi|146329709|ref|YP_001209147.1| type I restriction modification DNA specificity domain-containing protein [Dichelobacter nodosus VCS1703A] gi|146233179|gb|ABQ14157.1| type I restriction modification DNA specificity domain protein [Dichelobacter nodosus VCS1703A] Length = 412 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 61/425 (14%), Positives = 139/425 (32%), Gaps = 29/425 (6%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYL 64 P YK + V G IP+ W ++ + ++ +++ YI LE V+ G Sbjct: 5 PGYKMTEV---GVIPEDWDLLLVSELANVDPENLSASTDPNFSFNYISLEQVDFGKL-IG 60 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLV--LQPKDV 119 R + + + IL + P L+ + ICST F V +P Sbjct: 61 TFREVFRTAPSRARRVVRHDDILMSTVRPNLKAHLHFRSQVSDTICSTGFAVLRAKPDAT 120 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAET 178 P + L + + ++IE G+ + + + + +P+PP + EQ I + + Sbjct: 121 DPAYIFAHLFASPLNKQIEKTLAGSNYPAINSRDVRELKIPVPPTIEEQRAIAQALSDVD 180 Query: 179 VRIDTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + L + +L + Q L++ G + + ++K +E + + Sbjct: 181 ALLAALDKIIAKKRDLKQATMQQLLTGETRLPGFSGEWEVKR--LEELAEIRSGGTPSTG 238 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + ++ G+ + +R + + + +++ +V Sbjct: 239 EPSFWD---GDIPWCTPTDITALNGHKYLRETSRLITPLGLNASSAEMIPAQSVVMTSRA 295 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + + P + + + G + Sbjct: 296 TIGECAINAVPLS-----TNQGFKNFIPFVKTDVDFLYYLLGTQKQGLIALCGGSTFLEI 350 Query: 358 KFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + V +P EQ I V++ + + VL E + + + + +T Sbjct: 351 GKTQLAAYEVRLPSTKAEQTAIATVLSEMDSELSVL----ESRRDKTRNIKQAMMQELLT 406 Query: 417 GQIDL 421 G+ L Sbjct: 407 GKTRL 411 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 69/181 (38%), Gaps = 7/181 (3%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N +SL + + + T + ++V +I+ + Sbjct: 39 TDPNFSFNYISLEQVDFGKLIGTFREVFRTAPSRARRVVRHDDILMSTVRPNLKAHLHFR 98 Query: 308 AQVMERGIITS-AYMAVKPHGIDSTY-LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 +QV + T A + KP D Y A L S ++ + ++ DV+ L Sbjct: 99 SQVSDTICSTGFAVLRAKPDATDPAYIFAHLFASPLNKQIEKTLAGSNYPAINSRDVREL 158 Query: 366 PVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + VPP I+EQ I ++ D L+ +++ I ++ + + + +TG+ L G Sbjct: 159 KIPVPPTIEEQRAIAQALSDV----DALLAALDKIIAKKRDLKQATMQQLLTGETRLPGF 214 Query: 425 S 425 S Sbjct: 215 S 215 >gi|224419063|ref|ZP_03657069.1| hypothetical protein HcanM9_07270 [Helicobacter canadensis MIT 98-5491] gi|313142571|ref|ZP_07804764.1| HsdS [Helicobacter canadensis MIT 98-5491] gi|313131602|gb|EFR49219.1| HsdS [Helicobacter canadensis MIT 98-5491] Length = 303 Score = 117 bits (292), Expect = 5e-24, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 91/296 (30%), Gaps = 17/296 (5%) Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G+ + H ++ +P+PPL EQ+ I + + + +ID + + L Sbjct: 10 FSKYILGSAIPHIYFRDYKKEQIPLPPLEEQMRIVKILDSAFEKIDKSVELLKANLANLD 69 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV------PDHWEVKPFFALVTELNRKNTK 250 E Q+++ + + + P HWE K + + Sbjct: 70 ELAQSVLDRAFNPLGDSIDSTESTQNPSTHDTQSPYPLPQHWEWKTLGEIGEIITGSTPS 129 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPES------YETYQIVDPGEIVFRFIDLQNDKRS 304 Y +K +E + + ++ I K Sbjct: 130 KNNPKFYGNDYPLFKPSDLGSGNTIKASDNLSKLGFENARKLPKNTLLVVCIGASIGKIG 189 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVK 363 L I + + S YL ++ S + S + + Sbjct: 190 LSGIIGSCNQQINAII---PSPNVLSKYLFFVCHSKYFQSILKKNASQTTLPIINKTEFS 246 Query: 364 RLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +L + +P IKEQ I ++ +I L E + +E + S + A +G+ Sbjct: 247 KLEIPLPKDIKEQEQIAMHLDSVFDKIQKLKELYNAQLQDYEELKQSLLNQAFSGK 302 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 71/199 (35%), Gaps = 10/199 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+HW+ + ++ TG T D D+ G+G + N + Sbjct: 107 LPQHWEWKTLGEIGEIITGSTPSKNNPKFYGNDYPLFKPSDL--GSGNTIKASDNLSKLG 164 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDV 133 K +L +G + K ++ G C+ Q + P L + L S Sbjct: 165 FENARKLPKNTLLVVCIGASIGKIGLSGIIGSCNQQINAIIPSPNVLSKYLFFVCHSKYF 224 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ T+ + + +P+P + EQ I + + +I L + Sbjct: 225 QSILKKNASQTTLPIINKTEFSKLEIPLPKDIKEQEQIAMHLDSVFDKIQKLKELYNAQL 284 Query: 193 ELLKEKKQALVSYIVTKGL 211 + +E KQ+L++ + L Sbjct: 285 QDYEELKQSLLNQAFSGKL 303 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 38/83 (45%), Gaps = 3/83 (3%) Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + +++ D K + F D K+ + +PP++EQ I +++ +ID Sbjct: 1 MFYFLKNLDFSKYIL---GSAIPHIYFRDYKKEQIPLPPLEEQMRIVKILDSAFEKIDKS 57 Query: 393 VEKIEQSIVLLKERRSSFIAAAV 415 VE ++ ++ L E S + A Sbjct: 58 VELLKANLANLDELAQSVLDRAF 80 >gi|188527366|ref|YP_001910053.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Shi470] gi|188143606|gb|ACD48023.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Shi470] Length = 424 Score = 117 bits (292), Expect = 5e-24, Method: Composition-based stats. Identities = 54/404 (13%), Positives = 130/404 (32%), Gaps = 27/404 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK + ++ ++ G T I + +ED+ + Sbjct: 12 VPKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPK 71 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLS 130 +F K I+ A++ D + + QF L K ++ + Sbjct: 72 ALKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDIALDMKFFFYQC 130 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + + + D PIPPL Q I + + A T L TE Sbjct: 131 FLLGEWCKKNTNVSGFASMDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELKA 190 Query: 191 FIELLKEKKQALV------SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + + + L+ S ++ K L P E + + L Sbjct: 191 RKKQYQYYQNMLLDFKDIHSNHKDAKISAKTYPKRLKTLLQTLAPKGVEFRKLGEVCEIL 250 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + + ++ Y + + + + G ++ D Sbjct: 251 DNRRIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVI------NKDNTP 304 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + + + A++ + + +L + +++ D+ +G + E++K+ Sbjct: 305 VVNWASGKIWVNNHAHVLQTKNELKLKFLYFYLQTIDV----SYCVAGTPPKINQENLKQ 360 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + +PP++ Q +I +++ A L+ I I K++ Sbjct: 361 ITIPIPPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYQ 404 >gi|152997207|ref|YP_001342042.1| restriction modification system DNA specificity subunit [Marinomonas sp. MWYL1] gi|150838131|gb|ABR72107.1| restriction modification system DNA specificity domain [Marinomonas sp. MWYL1] Length = 400 Score = 117 bits (292), Expect = 5e-24, Method: Composition-based stats. Identities = 70/427 (16%), Positives = 131/427 (30%), Gaps = 51/427 (11%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPK 66 Y DS +PK W + + G I + + + +G Y Sbjct: 6 YNDS-------LPKGWVLAKANDVMDVRDGTHDSPKAQATGIPLVTSKSLVNGKIDYSTC 58 Query: 67 DGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPE 122 S Q S S G ILY +G I+ I + D+ Sbjct: 59 TYISEQDHESISKRSAVDDGDILYAMIGTIGNPVIVKKDFDFSIKNVALFKFTKTDLSNR 118 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + +L S ++ E G T I + +P+PPL EQ I + Sbjct: 119 YIFHYLNSGLAKRQFENNSRGGTQKFVSLGNIRELMIPLPPLEEQKRIAAILDKADAIRR 178 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 E L S + +P K I + VT Sbjct: 179 KRQQAIDLADEF-------LRSVFLDMFGDPVTNPKGKRIVPLIE---------LCNKVT 222 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFID 297 + ++ K ES I L NI+ + + + ++ G++++ + Sbjct: 223 DGTHQSPKWEESGIPFLFISNIVNGKISFDTNKFISKETLDELTRSTPIEKGDVLYTTVG 282 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LR 354 + + +KP+ ++ +L ++ S + + ++ G + Sbjct: 283 SYGN---VARVTDDTEFCFQRHIAHIKPNHEIVNAEFLTSMLASSVVRRQADSLVRGIAQ 339 Query: 355 QSLKFEDVKRLPVLVPPIKEQF---DITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 ++L ++K + V ++ Q I I+ D V LL +S I Sbjct: 340 KTLNLRELKEILVFDVSLENQKSYLKIVEPIHKIKDNYDNSVN------ELLNN-FNSLI 392 Query: 412 AAAVTGQ 418 A +G+ Sbjct: 393 QKAFSGE 399 >gi|167461217|ref|ZP_02326306.1| type I restriction-modification enzyme S subunit [Paenibacillus larvae subsp. larvae BRL-230010] Length = 386 Score = 117 bits (292), Expect = 5e-24, Method: Composition-based stats. Identities = 51/399 (12%), Positives = 115/399 (28%), Gaps = 27/399 (6%) Query: 26 KVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V + +++G +S + + I + DV SG+ + + Sbjct: 7 DEVKLGGLVHIDSGYAFKSSYFNEKEGLPIIRIRDVTSGSI------STYYSGEYDEKYL 60 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL G + + + + + + + + ++IE Sbjct: 61 VENNDILISMDGTFSVRKWSTGKALLNQRVCRIKSLNEKILLDDYLYYILPKYLKKIEDK 120 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 T+ H K I I + +P + Q + I + E Sbjct: 121 TSFVTVKHLSVKDINEIFLLLPNIEAQRKTVLILDKAQELI--------NKRKKQIEACD 172 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L+ + V +E +G + N KN + S Sbjct: 173 KLIKGLFYDMFGDPVLNNKFTLESLG---SVSLKITDGTHHSPENTKNGVPYITAKHLGS 229 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 T + + G++++ + + + Sbjct: 230 GSLDFYNAPTFISLEDHKKIFARCNPEKGDVLYIKDGATTGIACINHYDFEFSMLSSLEL 289 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + S YL + + + K M G + L + + +P+L+PPI Q Sbjct: 290 IKTDITKLSSIYLVSYLNNDQVKKKVLQDMAGGAIKRLTLKKINAIPILLPPIHLQNRFA 349 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +I+ +++QS+ L+ + + A G+ Sbjct: 350 E----QVEKIEQQKLRLQQSLTELENNFKALMQRAFKGE 384 >gi|307250674|ref|ZP_07532611.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306857282|gb|EFM89401.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 452 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 61/436 (13%), Positives = 128/436 (29%), Gaps = 67/436 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W++ + +T I +GL + + L Q+ + Sbjct: 20 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 79 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ-PKDVLPELLQGWLLSIDVT 134 I K ILY + PYL+ I + D I ST F+V+ + + L +LLS T Sbjct: 80 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + + N+P+ IPPL EQ I KI I+ + + L Sbjct: 140 DFVNQEMVGVAYPAINDDKLYNLPIAIPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTAL 199 Query: 195 LKEKK----QALVSYIVTKGLNPDVKM--------------------------------- 217 ++ ++++ + L Sbjct: 200 HQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPKVVSEI 259 Query: 218 ---------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN---ILSL 259 + E +P++W + + + L Sbjct: 260 ILRDNLPYEIINGEERCIADEVPFEIPENWCWVRLGEIGETNIGLTYAPNDVVLEGTIVL 319 Query: 260 SYGNIIQKLETRNMGL--KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 GNI + + + + +++ + + + + Sbjct: 320 RSGNIQNGKIDVSSDVVRVNLNIPENKKCYKNDLLICARNGSKNLVGKAAIVDKDGYSFG 379 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + + Y+ + + S F + + + ++ + +PP+ EQ Sbjct: 380 AFMAIFR-----NQYIYYYLSSPLFRNDFDGINTTTINQITQNNLNNRLIPLPPLNEQKR 434 Query: 378 ITNVINVETARIDVLV 393 I I + + L Sbjct: 435 IVEKIEKLFSTLQNLE 450 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 76/201 (37%), Gaps = 10/201 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQ 284 +P+ WE++ ++ L +K I N I KL + L+P+ + Sbjct: 20 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 79 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLC 343 IV I++ + + I ++A++ + YL + + S Sbjct: 80 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 139 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G+ ++ + + LP+ +PP+ EQ I I I+ + E+ + Sbjct: 140 DFVNQEMVGVAYPAINDDKLYNLPIAIPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTA 198 Query: 403 L-----KERRSSFIAAAVTGQ 418 L ++ + S + AA+ G+ Sbjct: 199 LHQQFPEQLKKSILQAAIQGK 219 >gi|322381543|ref|ZP_08055521.1| hypothetical protein PL1_2426 [Paenibacillus larvae subsp. larvae B-3650] gi|321154501|gb|EFX46799.1| hypothetical protein PL1_2426 [Paenibacillus larvae subsp. larvae B-3650] Length = 381 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 51/399 (12%), Positives = 115/399 (28%), Gaps = 27/399 (6%) Query: 26 KVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V + +++G +S + + I + DV SG+ + + Sbjct: 2 DEVKLGGLVHIDSGYAFKSSYFNEKEGLPIIRIRDVTSGSI------STYYSGEYDEKYL 55 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL G + + + + + + + + ++IE Sbjct: 56 VENNDILISMDGTFSVRKWSTGKALLNQRVCRIKSLNEKILLDDYLYYILPKYLKKIEDK 115 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 T+ H K I I + +P + Q + I + E Sbjct: 116 TSFVTVKHLSVKDINEIFLLLPNIEAQRKTVLILDKAQELI--------NKRKKQIEACD 167 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L+ + V +E +G + N KN + S Sbjct: 168 KLIKGLFYDMFGDPVLNNKFTLESLG---SVSLKITDGTHHSPENTKNGVPYITAKHLGS 224 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 T + + G++++ + + + Sbjct: 225 GSLDFYNAPTFISLEDHKKIFARCNPEKGDVLYIKDGATTGIACINHYDFEFSMLSSLEL 284 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + S YL + + + K M G + L + + +P+L+PPI Q Sbjct: 285 IKTDITKLSSIYLVSYLNNDQVKKKVLQDMAGGAIKRLTLKKINAIPILLPPIHLQNRFA 344 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +I+ +++QS+ L+ + + A G+ Sbjct: 345 E----QVEKIEQQKLRLQQSLTELENNFKALMQRAFKGE 379 >gi|315637036|ref|ZP_07892259.1| restriction endonuclease S [Arcobacter butzleri JV22] gi|315478572|gb|EFU69282.1| restriction endonuclease S [Arcobacter butzleri JV22] Length = 405 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 65/416 (15%), Positives = 143/416 (34%), Gaps = 36/416 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ P+ T++ + +S I I ++ G K T Sbjct: 7 LPDGWEWKPLISLTEVFSDGDWIESKDQSDDGIRLIQTGNIGIGIFKDREDKSRFISEST 66 Query: 76 ---STVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQF-LVLQPKDVLPELLQGWL 128 + + L +L + ++ + D I S +V + VLP+L + Sbjct: 67 FERLNCTEIYENDCLISRLPEPVGRSCLIPKMDLKLITSVDCTIVRFKESVLPKLFVYYS 126 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S I GAT K + NIP+P+PPL+EQ I K+ +ID I Sbjct: 127 QSNYYFNMIMNNSTGATRLRISKKNLSNIPIPLPPLSEQQRIVAKLDNLFAKIDKAIALH 186 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + I+ ++++ + + + + + +V + KN Sbjct: 187 QKNIDEANVFMASVLNDVFV------------------ELEEKYGLIKINDVVVKTKNKN 228 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLK-----PESYETYQIVDPGEIVFRFIDLQNDKR 303 + + + I + + K + V G+IV+ Sbjct: 229 PLNEKDTPFTYIDISSIDNKSFKIVEPKQLIGSEAPSRAKKEVFQGDIVYSTTRPNLKNI 288 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDV 362 ++ S T + ++YL + + + L + G + + D+ Sbjct: 289 AIVSENYNNPIASTGFCVLRTNEKTINSYLFYFLITEKLFEQIEPNIRGAQYPATSDNDL 348 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K + P + Q + + ++ + +++ + + ++ + LKE ++S + G+ Sbjct: 349 KNCNIPNAPYETQQKVVSYLDEISNKMEKIKQIQKEKMQSLKELKASILDQGFKGE 404 >gi|253315009|ref|ZP_04838222.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus str. CF-Marseille] Length = 403 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 66/402 (16%), Positives = 142/402 (35%), Gaps = 27/402 (6%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 14 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 73 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 74 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 133 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 134 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGQFFSKLDQQIELEEQK 190 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L + G W K ++ N++ Sbjct: 191 LELLQQQKKCYIQKIFSQEL--------RFKDEEGNYYKGWNKKQLKDVLEFSNKRTINE 242 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 E +L+ S +I + + + P + + ++ Sbjct: 243 NEYPVLTSSRQGLILQSDYYKDRKTFAESNIGYFILPKNHITYRSRSDDGIFKFNLNLMI 302 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + GII+ Y K + YL + + + L +D++ + +P Sbjct: 303 DVGIISKYYPVFKGIDANQYYLTLHLNYQLKKEYIKYATGTSQLVLSQKDLQNIKTKLPS 362 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I + + ID LVEK + LK R+ + Sbjct: 363 YEEQQKIGDF----FSEIDRLVEKQSSKVGRLKVRKKELLQK 400 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 50/185 (27%), Gaps = 14/185 (7%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 4 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 63 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 64 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKK---- 119 Query: 331 TYLAWLMRSYDL-----CKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINV 384 Y Y L K+F A G R+ L F+++ L + P I +EQ I + Sbjct: 120 EYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSK 179 Query: 385 ETARI 389 +I Sbjct: 180 LDQQI 184 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 53/184 (28%), Gaps = 5/184 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W +K + + RT + + Y KD + I Sbjct: 221 KGWNKKQLKDVLEFSNKRTINENEYPVLTSSRQGLILQSDY-YKDRKTFAESNIGYFILP 279 Query: 83 KGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K I Y G + + GI S ++ + + L+ + + Sbjct: 280 KNHITYRSRSDDGIFKFNLNLMIDVGIIS-KYYPVFKGIDANQYYLTLHLNYQLKKEYIK 338 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + K + NI +P EQ I + ++ ++ R KE Sbjct: 339 YATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDFFSEIDRLVEKQSSKVGRLKVRKKELL 398 Query: 200 QALV 203 Q + Sbjct: 399 QKMF 402 >gi|154252791|ref|YP_001413615.1| restriction modification system DNA specificity subunit [Parvibaculum lavamentivorans DS-1] gi|154156741|gb|ABS63958.1| restriction modification system DNA specificity domain [Parvibaculum lavamentivorans DS-1] Length = 392 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 68/407 (16%), Positives = 129/407 (31%), Gaps = 39/407 (9%) Query: 29 PIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 PI + G+ S G DI ++ D+ + D + + Sbjct: 9 PISEIASVERGKFSARPRNDPRYFGGDIPFLQTGDIARAGRFIVGWDQTLNAQGLAVSRL 68 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F +G I + + I+ FD C + + P++ + + + ++ Sbjct: 69 FPRGTIFMS-IAANVGDVAISTFDAACPDSVVAVIPRNGADAEWL-FQILRHCKDGLSSL 126 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 ++ + I +P+PPL EQ I E + I+ L R + L Q Sbjct: 127 ATQNAQANLSLEKITPFRVPVPPLPEQCKIAEILRTWDEAIEKLEALRAAKRDRLTGLTQ 186 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L+ G PD W+ +P A+ T + R+N + + Sbjct: 187 KLL-------------------GIGGAFPDRWKQRPLSAISTRVRRQNGGGDHPVMTISA 227 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + E + + S + Y ++ GE + + ++ Y Sbjct: 228 KSGFRLQSEKFSRDMAGSSVDRYIVLHEGEFAYNKGNSLTAPYGCIFPLDRPTALVPFVY 287 Query: 321 MAVKPHGIDSTYLA-WLMRSYDLCKVFYA-MGSGLRQ----SLKFEDVKRLPVLVPPIKE 374 S L + L + SG+R +L ED V VPP E Sbjct: 288 FCFALKADLSREFFAHLFAAGALNHQLSRLINSGVRNDGLLNLNPEDFFGCKVPVPPADE 347 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 Q I + + + +E I L ++ + +TG+ + Sbjct: 348 QSAIASTLTTAKQE----IGLLETEIETLTRQKRGLMQKLLTGEWRV 390 Score = 38.2 bits (87), Expect = 2.7, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 57/192 (29%), Gaps = 11/192 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P WK P+ + + ++ I + + +D D + Sbjct: 196 PDRWKQRPLSAISTRVRRQNGGGDHPVMTISAKSGFRLQSEKFSRDMAGSSVD--RYIVL 253 Query: 82 AKGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-----SID 132 +G+ Y K PY + + + K L L + Sbjct: 254 HEGEFAYNKGNSLTAPYGCIFPLDRPTALVPFVYFCFALKADLSREFFAHLFAAGALNHQ 313 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +++ I + + + + + +P+PP EQ I + I L TE Sbjct: 314 LSRLINSGVRNDGLLNLNPEDFFGCKVPVPPADEQSAIASTLTTAKQEIGLLETEIETLT 373 Query: 193 ELLKEKKQALVS 204 + Q L++ Sbjct: 374 RQKRGLMQKLLT 385 >gi|94993143|ref|YP_601242.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS2096] gi|94546651|gb|ABF36698.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS2096] Length = 399 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 58/396 (14%), Positives = 124/396 (31%), Gaps = 18/396 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ G++ S + G R T K Sbjct: 17 EWEEKELGDIVQITMGQSPSSQNYTTNPSDYILVQGNADIKNGYVFPRVWTTQITKQADK 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I+ P ++ I ++ + + L + + I G Sbjct: 77 GDIILSVRAPV-GDVGKTNYHVIIGRGVAAIKGNEFI----FQILKYLKEIGYWKRISTG 131 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T I + IP L EQ I E +D LI + + + LKE+KQ + Sbjct: 132 STFDSISSSDIKYAKIQIPSLPEQEAIGE----LFQTVDQLIQLQDQKLATLKEQKQTFL 187 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + +++ G + EV + +++ E ++S+ Sbjct: 188 RKMFPPQIQKVPEIRLQGFKGEWEEKKLGEVSTHRSGTAIEKYFDSEG-EFKVISIGSYG 246 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVMERGIITSA 319 +N+ ++V GE+ D + + + + + Sbjct: 247 TNNLYVDQNIRAVSNELTNSKLVASGELTMVLNDKTANGAIIGRCLLITENNKYVVNQRT 306 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + I S YL + + G + + + V++L + +P +KEQ I Sbjct: 307 EIIRPDINISSYYLFHYLNGEFRNGIIKIAQGGTQIYVNYSSVEQLKINIPTLKEQEAIG 366 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 N +D + + E+ + LK + + + Sbjct: 367 NF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 398 >gi|15924797|ref|NP_372331.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus Mu50] gi|156980123|ref|YP_001442382.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus Mu3] gi|255006593|ref|ZP_05145194.2| specificity determinant HsdS [Staphylococcus aureus subsp. aureus Mu50-omega] gi|14247579|dbj|BAB57969.1| probable specificity determinant HsdS [Staphylococcus aureus subsp. aureus Mu50] gi|156722258|dbj|BAF78675.1| probable specificity determinant HsdS [Staphylococcus aureus subsp. aureus Mu3] Length = 409 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 66/402 (16%), Positives = 141/402 (35%), Gaps = 27/402 (6%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGQFFSKLDQQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L + G W K ++ N++ Sbjct: 197 LELLQQQKKCYIQKIFSQEL--------RFKDEEGNYYKGWNKKQLKDVLEFSNKRTINE 248 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 E +L S +I + + + P + + ++ Sbjct: 249 NEYPVLISSRQGLILQSDYYKDRKTFAESNIGYFILPKNHITYRSRSDDGIFKFNLNLMI 308 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + GII+ Y K + YL + + + L +D++ + +P Sbjct: 309 DVGIISKYYPVFKGIDANQYYLTLHLNYQLKKEYIKYATGTSQLVLSQKDLQNIKTKLPS 368 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I + + ID LVEK + LK R+ + Sbjct: 369 YEEQQKIGDF----FSEIDRLVEKQSSKVGRLKVRKKELLQK 406 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 50/185 (27%), Gaps = 14/185 (7%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKK---- 125 Query: 331 TYLAWLMRSYDL-----CKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINV 384 Y Y L K+F A G R+ L F+++ L + P I +EQ I + Sbjct: 126 EYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSK 185 Query: 385 ETARI 389 +I Sbjct: 186 LDQQI 190 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 54/184 (29%), Gaps = 5/184 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W +K + + RT + + I Y KD + I Sbjct: 227 KGWNKKQLKDVLEFSNKRTINENEYPVLISSRQGLILQSDY-YKDRKTFAESNIGYFILP 285 Query: 83 KGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K I Y G + + GI S ++ + + L+ + + Sbjct: 286 KNHITYRSRSDDGIFKFNLNLMIDVGIIS-KYYPVFKGIDANQYYLTLHLNYQLKKEYIK 344 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + K + NI +P EQ I + ++ ++ R KE Sbjct: 345 YATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDFFSEIDRLVEKQSSKVGRLKVRKKELL 404 Query: 200 QALV 203 Q + Sbjct: 405 QKMF 408 >gi|260768975|ref|ZP_05877909.1| type I restriction-modification enzyme S subunit [Vibrio furnissii CIP 102972] gi|260617005|gb|EEX42190.1| type I restriction-modification enzyme S subunit [Vibrio furnissii CIP 102972] Length = 415 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 55/408 (13%), Positives = 110/408 (26%), Gaps = 33/408 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ I T T ++ K + YI V+ G+ + + + + Sbjct: 19 WEQKSITEVATKVTDGTHDTPKPVESGMPYITAIHVKDGSIDFDNCYYVTPEVHQAIYKR 78 Query: 81 F--AKGQILYGKLGPYLRKAIIADFDGICS--TQFLVLQPKDVLPELLQGWLLSIDVTQR 136 KG +L +G I +D S L+ ++++ + + Sbjct: 79 CNPEKGDLLLVNIGAGTATCAINTYDAEFSMKNVALIKPDREIIDPYFLEQIQRKSTARL 138 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G K I + P L EQ I + ++D IT L Sbjct: 139 FHRLTSGGAQPFFSLKEIKKLIHNYPNLPEQQKIASFL----SKVDEKITLLTEKKAKLT 194 Query: 197 EKKQALVSYIVTKGLN----------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E K+ ++ + + P ++ K + + Sbjct: 195 EYKKGVMQQLFNGKWDEQDGQLIFIPPTLRFKADDGSEFPDWTKSTLGDIGKVKMCKRIM 254 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-GEIVFRFIDLQNDKRSL 305 N +I G ++ + + Y G+I+ Sbjct: 255 ANQTSENGDIPFFKIGTFGREPDAFISQELYDEYRHKFSFPNVGDILMSASGTLGRTV-- 312 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + + T +L Y + K G Q L + Sbjct: 313 --VYDGSPAYFQDSNIVWIENDGSFTTNEFLFYVYQIVKY--QSEGGTIQRLYNNIIMSA 368 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P +KEQ I ++ ID ++ + KE + + Sbjct: 369 VFDNPSLKEQKKIVKFLSA----IDQKIDLANSELEKAKEWKRGLLQQ 412 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 63/185 (34%), Gaps = 10/185 (5%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-----PESYETYQIVDPGEIVFRF 295 VT+ K +ES + ++ ++ + ++ + G+++ Sbjct: 31 VTDGTHDTPKPVESGMPYITAIHVKDGSIDFDNCYYVTPEVHQAIYKRCNPEKGDLLLVN 90 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I ++ + E + A + ID +L + R G + Sbjct: 91 IGAGTATCAINTYDA-EFSMKNVALIKPDREIIDPYFLEQIQRKSTARLFHRLTSGGAQP 149 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +++K+L P + EQ I + + +++D + + + L E + + Sbjct: 150 FFSLKEIKKLIHNYPNLPEQQKIASFL----SKVDEKITLLTEKKAKLTEYKKGVMQQLF 205 Query: 416 TGQID 420 G+ D Sbjct: 206 NGKWD 210 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 51/189 (26%), Gaps = 14/189 (7%) Query: 24 HWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + K+ + DI + + ++ ++ Sbjct: 235 DWTKSTLGDIGKVKMCKRIMANQTSENGDIPFFKIGTFGREPDAFISQEL--YDEYRHKF 292 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S G IL G R + +V V Q ++ Sbjct: 293 SFPNVGDILMSASGTLGRTVVYDGSPAYFQDSNIVWI---ENDGSFTTNEFLFYVYQIVK 349 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG T+ I + P L EQ I + + ID I +E KE Sbjct: 350 YQSEGGTIQRLYNNIIMSAVFDNPSLKEQKKIVKFL----SAIDQKIDLANSELEKAKEW 405 Query: 199 KQALVSYIV 207 K+ L+ + Sbjct: 406 KRGLLQQMF 414 >gi|315444137|ref|YP_004077016.1| restriction endonuclease S subunit [Mycobacterium sp. Spyr1] gi|315262440|gb|ADT99181.1| restriction endonuclease S subunit [Mycobacterium sp. Spyr1] Length = 422 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 59/420 (14%), Positives = 133/420 (31%), Gaps = 23/420 (5%) Query: 18 IGA--IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGN 69 IG +P WK I + G T G + + +DV++ Sbjct: 4 IGKSPLPSGWKECRIGELFESWGGHTPSKSMPSYWGDGVPWASSKDVKAPRLASTTHTVT 63 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF-LVLQPKDVLPELLQ 125 + + + + + G +L L + D + + + E L Sbjct: 64 PQAVEETGLKVCPVGSVLVVMRSGILAHTLPVTVTDVPVAINQDLKAFHSSEPFMNEWLA 123 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L + EG T+ + + +P+PP E+ I + + + + Sbjct: 124 LFLRMSASALLASSRREGTTVQSIQYPLLKGTLIPVPPEDERAQIIGAVRMAVEKQASAL 183 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 ++ +QA+++ + L D + G+ VG + + Sbjct: 184 PHVKTAARAIERFRQAVLTAACSGRLTEDWR----GVAGVGDWDFERAADVCDKVQSGGT 239 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFIDLQ 299 ++ E + L NI+ + + + V PG+++ + Sbjct: 240 PRSGFTDEPGVPFLKVYNIVSQQVDFGHRPQYVPETVHHRVLKKSVAYPGDVIMNIVGPP 299 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMGSGLRQSLK 358 K ++ E + + + I +L + +RS GS + ++ Sbjct: 300 LGKVAIIPDDFPEWNLNQAITIFRPGDRILREWLYYYLRSGLFMDADLITRGSAGQSNIS 359 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L + VP I EQ + I D L+ +++ + + + + A G+ Sbjct: 360 LTQCRDLQIPVPTIAEQQVLVQRIGELMDHADSLLARVDTAGRRTERISQAVLVKAFRGE 419 >gi|318042340|ref|ZP_07974296.1| restriction modification system DNA specificity domain protein [Synechococcus sp. CB0101] Length = 386 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 52/398 (13%), Positives = 120/398 (30%), Gaps = 28/398 (7%) Query: 38 TGRTSESGKD--IIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVSIFAKGQILYGKLGP- 93 G + ++ I + + + + Y + N G +L G Sbjct: 1 RGISPSYAEEGGICVLNQKCIRDHSINYAHSRRHNLDSKKVPAERYIQIGDVLVNSTGTG 60 Query: 94 YLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI---CEGATM 146 L + + +++P + + + + ++ C G T Sbjct: 61 TLGRVAQVREQPQEATTVDSHVTIVRPDRSIFYREFFGYMLVIIEDALKEAGEGCGGQTE 120 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 L +Q I + + + T + + ++ + Sbjct: 121 LSRSALAEQFSVSYPASLTKQQRIVDILDEAFEALATAKANAEQNLRNALAVFESHLEAA 180 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + + + + + + T E +I+ L Sbjct: 181 FNQKEEGWTEKRLG---ELADFKNGLNFSRNSSGQTLRMVGVGDFQERSIVPLDKLQCTT 237 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND--KRSLRSAQVMERGIITSAYMAVK 324 ++ G+I+ + D R + V E + + ++ Sbjct: 238 IDGNVTEDY---------LIREGDILTVRSNGSKDLVGRCMLVPAVNEMISYSGFIIRIR 288 Query: 325 PHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 P G +L + M+S + G G ++ + LPVL+PP+K+Q +I N Sbjct: 289 PDGQTTSPRFLLYFMKSRTARSRLTSDGGGTSISNINQAKLATLPVLLPPLKKQEEIANH 348 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++ + L E+ I L+E ++S + A +G+I Sbjct: 349 LDAFSKESKRLTSIYERKIAALEELKTSLLHQAFSGKI 386 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 67/201 (33%), Gaps = 12/201 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGL---EDVESGTGKYLPKDG-NSRQSDTSTV 78 + W + G + + D + + L K + + + Sbjct: 186 EGWTEKRLGELADFKNGLNFSRNSSGQTLRMVGVGDFQERSIVPLDKLQCTTIDGNVTED 245 Query: 79 SIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDV--LPELLQGWLLS 130 + +G IL + + S + ++P P L ++ S Sbjct: 246 YLIREGDILTVRSNGSKDLVGRCMLVPAVNEMISYSGFIIRIRPDGQTTSPRFLLYFMKS 305 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 R+ + G ++S+ + + +P+ +PPL +Q I + A + L + R Sbjct: 306 RTARSRLTSDGGGTSISNINQAKLATLPVLLPPLKKQEEIANHLDAFSKESKRLTSIYER 365 Query: 191 FIELLKEKKQALVSYIVTKGL 211 I L+E K +L+ + + Sbjct: 366 KIAALEELKTSLLHQAFSGKI 386 >gi|15675718|ref|NP_269892.1| putative type I site-specific deoxyribonuclease [Streptococcus pyogenes M1 GAS] gi|71911435|ref|YP_282985.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS5005] gi|13622936|gb|AAK34613.1| putative type I site-specific deoxyribonuclease [Streptococcus pyogenes M1 GAS] gi|71854217|gb|AAZ52240.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS5005] Length = 399 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 58/396 (14%), Positives = 123/396 (31%), Gaps = 18/396 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ G++ S + G R T K Sbjct: 17 EWEEKELGDIVQITMGQSPSSQNYTTNPSDYILVQGNADIKNGYVFPRVWTTQITKQADK 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I+ P ++ I ++ + + L + + I G Sbjct: 77 GDIILSVRAPV-GDVGKTNYHVIIGRGVAAIKGNEFI----FQILKYLKEIGYWKRISTG 131 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T I + IP L EQ I E +D LI + + + LKE+KQ + Sbjct: 132 STFDSISSSDIKYAKIQIPSLPEQEAIGE----LFQMVDQLIQLQDQKLATLKEQKQTFL 187 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + +++ G + EV + +++ E ++S+ Sbjct: 188 RKMFPAQGQKVPEIRLQGFKGEWEEKKLREVSTHRSGTAIEKYFDSEG-EFKVISIGSYG 246 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVMERGIITSA 319 +N+ ++V GE+ D + + + + + Sbjct: 247 TNNLYVDQNIRAVSNELTNSKLVASGELTMVLNDKTANGAIIGRCLLITENNKYVVNQRT 306 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + I S YL + + G + + + V++L + +P +KEQ I Sbjct: 307 EIIRPDINISSYYLFHYLNGEFRNGIIKIAQGGTQIYVNYSSVEQLKINIPTLKEQEAIG 366 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 N +D + + E+ + LK + + + Sbjct: 367 NF----FQTLDQQIAQSEEKLTELKALKQTLLNRLF 398 >gi|293556631|ref|ZP_06675197.1| type IC specificity subunit [Enterococcus faecium E1039] gi|291601217|gb|EFF31503.1| type IC specificity subunit [Enterococcus faecium E1039] Length = 418 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 54/408 (13%), Positives = 137/408 (33%), Gaps = 25/408 (6%) Query: 23 KHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDV-----ESGTGKYLPKDGNSRQS 73 + W+ + + + + + + I ++ D+ YL Sbjct: 16 EDWEERKLGDMMDVTSVKRIHQSDWTNSGIRFLRARDIVSAAKNEEPSDYLYISEEKYNE 75 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSI 131 + ++G +L +G +I D + I + + + + + Sbjct: 76 YSKISGKVSQGDLLVTGVGSIGVPLLITDDNPIYFKDGNIIWFKNEHKIDGNFFYYSFIN 135 Query: 132 DVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + Q+ G T+ P+ +P EQ+ I ++D I R Sbjct: 136 NKIQKYIRDVAGIGTVGTYTIDSGKKTPISLPTYDEQIKIGSF----FKQLDNTIALHQR 191 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++LLKE K+ + + K +++ G E+ F + Sbjct: 192 KLDLLKETKKGFLQKMFPKNGAKVPEIRFPGFTEDWEQRKLGEIGNFKNGMNFDKSAMGH 251 Query: 251 LIESNIL-SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRS 307 L ++ N+++ +E + E + + G+++F ++ Sbjct: 252 GSPFINLQNIFGRNVLESIEGLGLAESSEKQKAEYNLLNGDVLFVRSSVKPSGVGETALV 311 Query: 308 AQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRL 365 ++ + + +P+ D+ + ++ + D+ S ++ E + ++ Sbjct: 312 SRDYPGTTYSGFIIRFRPNIEFDNNFKRYIFGTKDVRNQIMAKSTSSANTNINQESLAKI 371 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P I+EQ I A++D + ++ + LLKE + F+ Sbjct: 372 NIRLPKIEEQEKIGKF----FAQLDQTITLHQRKLDLLKETKKGFLQK 415 >gi|307274410|ref|ZP_07555594.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2134] gi|306508920|gb|EFM78006.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2134] Length = 398 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 63/401 (15%), Positives = 143/401 (35%), Gaps = 29/401 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W++ + R + T + E + + + E + + + + D S + G Sbjct: 10 WELCKLGRVVERVTRKNKELKSTLP-LTISAQEGLIDQNVFFNKSVASRDVSGYYLIYNG 68 Query: 85 QILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLS-IDVTQRIE 138 + Y K G+ ST +++ +PK++ L+ + + + + Sbjct: 69 EFAYNKSYSNGYPWGAIKRLNRYDMGVLSTLYIIFKPKNIDSNFLEKYYDTSCWYHEVSK 128 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EGA + + + V + K+ ++D IT R +E LKE Sbjct: 129 HAAEGARNHGLLNIAASDFLRTELTVPKSVEEQRKVGNFLKQLDDTITLHQRKLEQLKEL 188 Query: 199 KQALVSYIVT--KGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLI 252 K+A + + P V+ EW +G + + + ++ Sbjct: 189 KKAYLQVMFPAKDERVPKVRFAAFEGEWAHRKLGEITESFS-------GGTPTAGKSEYY 241 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 +I + G I + + + ++V G+I++ + + + Sbjct: 242 GGDIPFIRSGEISSDSTELFITENGLNSSSAKMVKVGDILYALYGATSGEVGISKI---- 297 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-P 371 G I A +A++P D++YL + G + +L VK L +++P Sbjct: 298 TGAINQAILAIRPSKNDNSYLIIQWLRKQKNTIISTYLQGGQGNLSSSIVKNLIIMLPQN 357 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ + R+D ++ + + LK+ ++S++ Sbjct: 358 KEEQEKVGIF----FKRLDDIITLHQNKLEQLKDLKTSYLQ 394 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 57/186 (30%), Gaps = 9/186 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + T+ +G T +GK DI +I ++ S + + ++S+ Sbjct: 215 EWAHRKLGEITESFSGGTPTAGKSEYYGGDIPFIRSGEISSDSTELF---ITENGLNSSS 271 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G ILY G + I+ G + L ++P L L I Sbjct: 272 AKMVKVGDILYALYGATSGEVGISKITGAINQAILAIRPSKNDNSYLIIQWLRKQKNTII 331 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + I M EQ + I + + +L Sbjct: 332 STYLQGGQGNLSSSIVKNLIIMLPQNKEEQEKVGIFFKRLDDIITLHQNKLEQLKDLKTS 391 Query: 198 KKQALV 203 Q + Sbjct: 392 YLQNMF 397 >gi|30022539|ref|NP_834170.1| Type I restriction-modification system specificity subunit [Bacillus cereus ATCC 14579] gi|29898097|gb|AAP11371.1| Type I restriction-modification system specificity subunit [Bacillus cereus ATCC 14579] Length = 414 Score = 116 bits (291), Expect = 6e-24, Method: Composition-based stats. Identities = 49/417 (11%), Positives = 121/417 (29%), Gaps = 35/417 (8%) Query: 21 IPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ +W P+ + T + + + + + + + Sbjct: 6 IPEIRFAGFTGNWGKKPLTELVERVTRKNKKGESRLP-LTISAQYGLVDQETYFNKTVAS 64 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGW 127 ++ + KG+ Y K G+ S+ ++ +P + Sbjct: 65 TNLEGYYLLYKGEFAYNKSYSNGYPYGAIKRLEKHDKGVLSSLYICFRPLNYSVSSDFLT 124 Query: 128 LLSIDVTQRIEAIC------EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E + + IP L EQ I + ++ Sbjct: 125 HYFESAVWHKEVSMISVEGARNHGLLNISVSDFFETLHLIPNLVEQTQIGNFL----KQL 180 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D +I + + LK+ K+ + + K +++ G + Sbjct: 181 DDMIALHQQELTTLKQTKKGFLQKMFPKEGESVPEVRFPGFTGDWEQRKLESIYEKIRNA 240 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFID 297 +E L N+ RN + + + G++V Sbjct: 241 FVGT-ATPYYVEDGHFYLESNNVKDGQINRNTEVFINDEFYEKQKNNWLHTGDLVMVQSG 299 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQS 356 ++ ++ + D +L + +++ K + +G + Sbjct: 300 HVGH-TAVIPEELDNTAAHALIMFSNYREKADPYFLNYQFQTHKSKKKLNNITTGNTIKH 358 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + ++K+ V +P +EQ I N ++D + ++ + LKE + +F+ Sbjct: 359 ILASEMKKFLVDIPKYEEQKMIGNF----FKQLDDAIALHQRELDALKETKKAFLQK 411 >gi|289422992|ref|ZP_06424812.1| restriction modification system DNA specificity domain protein [Peptostreptococcus anaerobius 653-L] gi|289156566|gb|EFD05211.1| restriction modification system DNA specificity domain protein [Peptostreptococcus anaerobius 653-L] Length = 439 Score = 116 bits (290), Expect = 6e-24, Method: Composition-based stats. Identities = 56/419 (13%), Positives = 133/419 (31%), Gaps = 36/419 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGT--GKYLPKDGNSRQSDT 75 P + +K + G + +S + + + +++ + G+++ + + + Sbjct: 13 PDGVEYKKLKEVCRFQNGFSFKSSKFTNEGKPILRITNIQDNSISGEFVCFSKDDYKENL 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + G + G K D + + + P + + Sbjct: 73 ESY-LVSPGDTVVAMSGATTGKIGYNYSDKYYYLNQRVGLFVPNESWLMKRYLFHWLSSQ 131 Query: 134 TQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 TQ I + G+ + + +P+PPL Q I + + T+ L E + Sbjct: 132 TQNIYNVSSGSGAQPNLSSVKMMEFVIPVPPLEVQREIVRILDSFTLLTAELTAELTAEL 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ++ Y + L P + P + ++ K ++ Sbjct: 192 TAELTARKKQYDYYRDELLKPKANI-----------PMVKLKEIATSIYRGAGIKRDQVT 240 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETY----QIVDPGEIVFRFIDLQNDKRSLRSA 308 E I + YG I T + E Y + + G+I+F + + A Sbjct: 241 EEGIPCVRYGEIYTTYNTWFGECVSHTKEEYVPSPKYFEHGDILFAITGESVEDIAKSIA 300 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPV 367 V + + V H + YLA ++ + + + ++++ + Sbjct: 301 YVGHDKCLAGGDIVVMKHEQNPRYLAHVLNTSMAREQKSKGKVKSKVVHSNVPSIEQIEI 360 Query: 368 LVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIAAAVTGQI 419 +PP+ Q V++ + L +E ++ + + A TG I Sbjct: 361 PLPPLDVQKRYAEVLDNFEKICNDLNIGLPAEIEARQKQYEFYRNL---LLTFAETGNI 416 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 56/189 (29%), Gaps = 8/189 (4%) Query: 228 VPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-- 282 PD E K + N K++K L NI + + Sbjct: 12 CPDGVEYKKLKEVCRFQNGFSFKSSKFTNEGKPILRITNIQDNSISGEFVCFSKDDYKEN 71 Query: 283 --YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 +V PG+ V K + + YL + S Sbjct: 72 LESYLVSPGDTVVAMSGATTGKIGYNYSDKYYYLNQRVGLFVPNESWLMKRYLFHWLSSQ 131 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + GSG + +L + + VPP++ Q +I +++ T L ++ + Sbjct: 132 TQNIYNVSSGSGAQPNLSSVKMMEFVIPVPPLEVQREIVRILDSFTLLTAELTAELTAEL 191 Query: 401 V-LLKERRS 408 L R+ Sbjct: 192 TAELTARKK 200 >gi|330721464|gb|EGG99514.1| Type I restriction-modification system2C specificity subunit S [gamma proteobacterium IMCC2047] Length = 413 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 55/414 (13%), Positives = 127/414 (30%), Gaps = 31/414 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP W + + + + E T Y N + Sbjct: 18 IPNDWLFLKLTDICN-----------PKQWRTIASNEMSTSGYPVFGANGFVGFYHEYNH 66 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + G + + L ++ +++I Sbjct: 67 -EDETVAITCRGNTCGTINRIPPKTYITGNSMALDDIKSDLVSQNYLFYALKYRGVVDSI 125 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ G+ I P PPL EQ I + + I+ + + +L +Q Sbjct: 126 -SGSAQPQITGAGLKFIEFPAPPLPEQQKIAAILSSVDEVIEKTRAQIDKLKDLKTGMRQ 184 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK---NTKLIESNIL 257 L++ V + KDS + + + D ++ + E + Sbjct: 185 ELLTKGVGH-----TEFKDSPVGRIPVGWDVVPLEKLVKAGKNITYGIVQAGPHYEGGVP 239 Query: 258 SLSYGNIIQKLETRNMGL----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + ++ + +RN L + V G+IV+ + + + + Sbjct: 240 YIRVSDMTGRSLSRNGMLLTSPEIAEKYERSAVSSGDIVYALRGVIGHVQ-IVPKDLDGA 298 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 + +D+ YL W M+S + G + + ++++ + P + Sbjct: 299 NLTQGTARVSPNELVDTRYLLWAMKSPYVEYQNDLEAKGSTFREVTLASLRKIQIATPEL 358 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 EQ I +++ +I +E +V L+ + + + +TG++ + E + Sbjct: 359 NEQKRIASILGSVELKIFA----VEDKLVHLESIKKALMQDLLTGKVRVNVEQK 408 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 78/214 (36%), Gaps = 20/214 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTK----LNTGRT---SESGKDIIYIGLEDVESGTG 61 ++KDS V G IP W VVP+++ K + G + YI + D+ TG Sbjct: 195 EFKDSPV---GRIPVGWDVVPLEKLVKAGKNITYGIVQAGPHYEGGVPYIRVSDM---TG 248 Query: 62 KYLPKDGNSRQSDTSTVSI----FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQ 115 + L ++G S + G I+Y G I+ + + Sbjct: 249 RSLSRNGMLLTSPEIAEKYERSAVSSGDIVYALRGVIGHVQIVPKDLDGANLTQGTARVS 308 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 P +++ W + + + +G+T + I + P L EQ I + Sbjct: 309 PNELVDTRYLLWAMKSPYVEYQNDLEAKGSTFREVTLASLRKIQIATPELNEQKRIASIL 368 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + ++I + + + + K Q L++ V Sbjct: 369 GSVELKIFAVEDKLVHLESIKKALMQDLLTGKVR 402 >gi|189463334|ref|ZP_03012119.1| hypothetical protein BACCOP_04051 [Bacteroides coprocola DSM 17136] gi|189429953|gb|EDU98937.1| hypothetical protein BACCOP_04051 [Bacteroides coprocola DSM 17136] Length = 468 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 54/399 (13%), Positives = 117/399 (29%), Gaps = 22/399 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W + + G + I +D ++G + S + Sbjct: 70 EVPESWVWCKFQDCMDVRDGTHDSPKYTQEGYPLITSKDFKNGQFDFSKTRYISEVDYKN 129 Query: 77 --TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF----LVLQPKDVLPELLQGWLLS 130 S G ILY +G + I D L ++ Sbjct: 130 IIKRSKVDIGDILYSMIGGNIGSMIYIQHDNYFDMAIKNVALFKPYQNSDISTKYIAYFL 189 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +AI G N +P+PPLAEQ I +I ID + ++ Sbjct: 190 ESKIKEYQAIAIGGAQPFVGLDIFRNTLVPLPPLAEQHRIITEIEKWLALIDQIEQGKVD 249 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++K+ K ++ + L P + I+ + + + Sbjct: 250 LQTIIKQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPDFTPCDNGHSRKLPQGWAYC 309 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVF--------RFIDL 298 + + + + G++ + +++ G I L Sbjct: 310 QLSNVLKITMGQSPKGDSLNNKRGIEFHQGKICFSDKFLLESGIFTNEPTKIAEPNSILL 369 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + I A+ P + + +L+++ + G +++ Sbjct: 370 CVRAPVGVVNITKNQICIGRGLCALTPFEGNVDFYFYLLQTLQDSFDNQSTG-TTFKAIS 428 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 E ++ +++PP+ EQ I I D + +E Sbjct: 429 GEIIRNENIILPPLAEQQRIVQKIEELFHVFDNIQNALE 467 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 8/200 (4%) Query: 227 LVPDHWEVKPFFAL--VTELNRKNTKLIESNILSLSYGNIIQKLETRNMG-----LKPES 279 VP+ W F V + + K + ++ + + + ++ Sbjct: 70 EVPESWVWCKFQDCMDVRDGTHDSPKYTQEGYPLITSKDFKNGQFDFSKTRYISEVDYKN 129 Query: 280 YETYQIVDPGEIVFRFIDL-QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 VD G+I++ I ++ + I A + ST Sbjct: 130 IIKRSKVDIGDILYSMIGGNIGSMIYIQHDNYFDMAIKNVALFKPYQNSDISTKYIAYFL 189 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + G + + + + V +PP+ EQ I I A ID + + Sbjct: 190 ESKIKEYQAIAIGGAQPFVGLDIFRNTLVPLPPLAEQHRIITEIEKWLALIDQIEQGKVD 249 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 ++K+ +S + A+ G+ Sbjct: 250 LQTIIKQTKSKILDLAIHGK 269 >gi|218247027|ref|YP_002372398.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 8801] gi|218167505|gb|ACK66242.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8801] Length = 457 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 67/421 (15%), Positives = 137/421 (32%), Gaps = 30/421 (7%) Query: 22 PKHWKVVPIKR-FTKLNTGRT-----SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSD 74 P W+++P+K T ++ G + I + D+ ++G Y Sbjct: 32 PPEWQLIPLKNAVTYIDYGYSHSIPKIPPENGIKIVSTADISKTGELLYSQIRKVEAPLK 91 Query: 75 TSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 T G +L+ + I + VL+ K + + + Sbjct: 92 TIQRLTLHDGDVLFNWRNSSYLIGKTTIFEEQSEPHIFASFVLRLKCDEIKSHNYFFKYL 151 Query: 132 DVTQRIEAICEGATMSHADWKGIG-----NIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 R I E + ++ +P+PP+ EQ I + I I Sbjct: 152 LNYYRYSGIFESLARRAVNQANFNKNEVSDLIIPLPPIEEQRKIASVL----TLIQEAIQ 207 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK---PFFALVTE 243 E+ I L E K+AL+ + T+G+N + K + I + + + + T Sbjct: 208 EQENAIALTTELKKALMQKLFTEGIN-NEPQKMTEIGLIPESWEVVNLGNLAKLKSGGTP 266 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQN 300 +K +I + I L T + + ++ G ++ Sbjct: 267 SRKKIEYWENGSIPWVKTTEINYDLITTTEEYITKEGLVNSSAKMFSKGTLLMAMYGQGV 326 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + + + + ST + SY K+ + +L Sbjct: 327 TRGRVGILDIDATTNQACVAIMPNSEDKLSTKFLYHYFSYHYEKLRNQGHGANQSNLSST 386 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +K P+ P I+EQ I N + +++ + I +L++ S+ + +T QI Sbjct: 387 ILKMFPITFPKIQEQLIIINHFDTLNLKLEQ----SHKRITILQDLFSTLLHQLMTAQIR 442 Query: 421 L 421 + Sbjct: 443 V 443 Score = 84.8 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 67/197 (34%), Gaps = 10/197 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNS 70 IG IP+ W+VV + KL +G T K I ++ ++ + Sbjct: 242 IGLIPESWEVVNLGNLAKLKSGGTPSRKKIEYWENGSIPWVKTTEINYDLITTTEEYITK 301 Query: 71 RQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGW 127 S+ +F+KG +L G + I D D + + + P + Sbjct: 302 EGLVNSSAKMFSKGTLLMAMYGQGVTRGRVGILDIDATTNQACVAIMPNSEDKLSTKFLY 361 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +++ GA S+ + P+ P + EQ++I ++++ Sbjct: 362 HYFSYHYEKLRNQGHGANQSNLSSTILKMFPITFPKIQEQLIIINHFDTLNLKLEQSHKR 421 Query: 188 RIRFIELLKEKKQALVS 204 +L L++ Sbjct: 422 ITILQDLFSTLLHQLMT 438 >gi|253583390|ref|ZP_04860588.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251833962|gb|EES62525.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 507 Score = 116 bits (290), Expect = 7e-24, Method: Composition-based stats. Identities = 70/466 (15%), Positives = 157/466 (33%), Gaps = 70/466 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP++W+ V + + + TG T K+I ++ D+ + + Sbjct: 26 EIPENWEWVKLGKVNNVITGSTPSKANEKYWENKNIFFVKPSDLYQK-RNLKSSEEYIDE 84 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL--LS 130 V I K L +G K ++ + + Q L PK + L + S Sbjct: 85 RARDNVRILPKYSTLICCIGSI-GKVAYSEVEVSTNQQINSLVPKKEIIFSLYNYYVANS 143 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ T++ + N+ P+PPL EQ I EK+ + +I+ Sbjct: 144 NFFQSQMLNSAVATTIAILNKTNTENLRFPLPPLEEQKRIVEKLDSMFEKINRAKELIQE 203 Query: 191 FIELLKEKKQALVSYIVTKGLNPDV----------------------------------- 215 E ++ +K+++++ L + Sbjct: 204 AKENIENRKESILNKAFRGELTVEWRKNNQTEDAIELLKSINDEKIKNWEQECVEAEKNG 263 Query: 216 -------------KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT-KLIESNILSLSY 261 M S E +P W+ ++ +K + E+ +S Sbjct: 264 KKKPSKPKIEDIQNMIISKEEEPYEIPSKWKWVKLEYIIEINPKKKMLNIDENEKISFLP 323 Query: 262 GNIIQKLETRNMGLKPESYET----YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI-- 315 I + ++ ESY Y +I+F I + A+ ++ I Sbjct: 324 MRSISDITGEISNIEYESYSKLKKGYTQFLENDILFAKITPCMENGKCVIAKNLKNEIGY 383 Query: 316 -ITSAYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPI 372 T ++ + +++ +L +R + + GS + + E +K +PP+ Sbjct: 384 GTTEFHVLRTNYILNNKFLHNFLRQESFRQEAKYNMTGSVGFRRVPTEFLKEYMFPLPPL 443 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +EQ +I +++ + + ++++ + ++ S + A G+ Sbjct: 444 EEQKEIVRILDEILEK-ESKIKELVELEEAIELLEKSILDKAFRGK 488 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 71/208 (34%), Gaps = 12/208 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNT------KLIESNILSLSYGNIIQKLETRNMGLK 276 E +P++WE + + NI + ++ QK ++ Sbjct: 22 EQPYEIPENWEWVKLGKVNNVITGSTPSKANEKYWENKNIFFVKPSDLYQKRNLKSSEEY 81 Query: 277 PESY--ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + +I+ + I + I + + K I S Y Sbjct: 82 IDERARDNVRILPKYSTLICCIGSIGKVAYSEVEVSTNQQINS---LVPKKEIIFSLYNY 138 Query: 335 WLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ S + + L + + L +PP++EQ I ++ +I+ Sbjct: 139 YVANSNFFQSQMLNSAVATTIAILNKTNTENLRFPLPPLEEQKRIVEKLDSMFEKINRAK 198 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 E I+++ ++ R+ S + A G++ + Sbjct: 199 ELIQEAKENIENRKESILNKAFRGELTV 226 >gi|312977435|ref|ZP_07789183.1| phosphoribosylformylglycinamidine synthase [Lactobacillus crispatus CTV-05] gi|310895866|gb|EFQ44932.1| phosphoribosylformylglycinamidine synthase [Lactobacillus crispatus CTV-05] Length = 480 Score = 116 bits (290), Expect = 8e-24, Method: Composition-based stats. Identities = 63/416 (15%), Positives = 135/416 (32%), Gaps = 59/416 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W+ V + L G+T + Y ++D+ + Y+ N Sbjct: 73 DIPDSWEWVRLGDVGLLKNGKTPKKEDISSDNIYPYFKVKDMNNNNL-YMENVKNWVGEK 131 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK---DVLPELLQGWLLSI 131 S + K I++ P AI+ I S LV +L ++ + Sbjct: 132 YS-RQVMPKNTIIF----PKNGGAILTAKKRILSQDSLVDLNTGGLIPYNDLNHKFIFYL 186 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ I+ +G+ + + K + +P+PPL EQ I KI + + + ++ Sbjct: 187 FLSLDIKDFVKGSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVESSTQQY 246 Query: 192 IELLKEKKQALVSYIVTKGL---NPDVK----------------------------MKDS 220 +L K ++ + L +P + + Sbjct: 247 AKLQTLLKSKVLDLAMRGKLVEQDPHDEPASVLLEKIKAEKRKMIKEKEIKKSKPLPPIT 306 Query: 221 GIEWVGLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 E +PD WE T N+ + I +++ N Sbjct: 307 DEEKPFDIPDSWEWVRLGNIAKRITDGTHNPPPNSHEGKQVISAINIKKGKIDFSLSNRF 366 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + + G+++ + + + ++ +AV I S Sbjct: 367 VSEDQFLKEDKRTNIRKGDVLLTIVGSLGNAAVV----DTDKLFTAQRSVAVISSNILSK 422 Query: 332 YLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L +++ S +A G ++ + + L + +PP+ EQ I + I+ Sbjct: 423 FLYYVLISAMFKTQIFANAKGTTQKGIYLSKLINLKLPLPPLAEQNRIVDKIDNLF 478 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 71/204 (34%), Gaps = 9/204 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + E +PD WE + N K K + + ++ ++ + N+ ++ Sbjct: 66 TDDEKPFDIPDSWEWVRLGDVGLLKNGKTPKKEDISSDNIYPYFKVKDMNNNNLYMENVK 125 Query: 280 YE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + Q++ I+F R + + + + ++ ++ Sbjct: 126 NWVGEKYSRQVMPKNTIIFPKNGGAILTAKKRILSQDSLVDLNTGGLIPY-NDLNHKFIF 184 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +L S D+ ++ + +K V +PP++EQ I I A + + Sbjct: 185 YLFLSLDIKDFVK---GSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVES 241 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 +Q L +S + A+ G+ Sbjct: 242 STQQYAKLQTLLKSKVLDLAMRGK 265 >gi|297205945|ref|ZP_06923340.1| type Ic restriction-modification system [Lactobacillus jensenii JV-V16] gi|297149071|gb|EFH29369.1| type Ic restriction-modification system [Lactobacillus jensenii JV-V16] Length = 428 Score = 116 bits (290), Expect = 8e-24, Method: Composition-based stats. Identities = 65/405 (16%), Positives = 138/405 (34%), Gaps = 29/405 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQS---DTST 77 WK V + ++ G T + + G E G YL + S+ Sbjct: 38 WKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGKTIYLHESQRKLSELGLKKSS 97 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G IL+ II + + F +QP + + + LS + + Sbjct: 98 ARLLNPGAILFTSRAGIGNTGIIINPSA-TNQGFQSIQPNKNIIDSYFIFCLSSRLKRYA 156 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+T + + + I EQ I I + + + +L + Sbjct: 157 LKHSAGSTFTEISGSEMKKAKIRICAKNEQNKISTCIKSLDSLLSLQQRKLELENQLKQF 216 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q L S + L P V+ + W + V + RKN L + L Sbjct: 217 NLQNLFSD--EQRLYPKVRFRGFDEPW--------KKVKLGRNVKRIRRKNKNLETNIPL 266 Query: 258 SLS-YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGI 315 ++S ++ + + + E+ Y ++ GE + + ++ + G Sbjct: 267 TISAQFGLVDQRDFFGRVVASENLANYILLKRGEFAYNKSYSKEAPYGSIKRLEKYNEGA 326 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVP 370 +++ Y+A P I+S +L + + + G R ++ +D + + +P Sbjct: 327 LSTLYIAFTPENINSDFLKAFFDTTKWYSHIVQVSTEGARNHGLLNISPQDFFEMSITIP 386 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ +I+ + N+ + L+ +Q I ++ + + Sbjct: 387 KSDEQNNISRIYNLM----NSLLSLQQQDINTTQQLKQFLLQNLF 427 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 74/233 (31%), Gaps = 21/233 (9%) Query: 204 SYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + L P V+ + W +G V + E N + Sbjct: 18 THADEQRLYPKVRFRGFDEPWKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGK 77 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + +GLK + ++++PG I+F + + + +G + Sbjct: 78 TIYLHESQRKLSELGLK---KSSARLLNPGAILFTSRAGIGNTGIIINPSATNQGFQS-- 132 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 I +Y + + S + ++K+ + + EQ I+ Sbjct: 133 --IQPNKNIIDSYFIFCLSSRLKRYALKHSAGSTFTEISGSEMKKAKIRICAKNEQNKIS 190 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG------QIDLRGESQ 426 I +D L+ ++ + L + + + + ++ RG + Sbjct: 191 TCI----KSLDSLLSLQQRKLELENQLKQFNLQNLFSDEQRLYPKVRFRGFDE 239 >gi|298736552|ref|YP_003729078.1| type I restriction enzyme subunit S [Helicobacter pylori B8] gi|298355742|emb|CBI66614.1| type I restriction enzyme, S subunit [Helicobacter pylori B8] Length = 442 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 50/428 (11%), Positives = 129/428 (30%), Gaps = 43/428 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNLEFWKNGTIPWFRMEDLRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCNLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T ++ Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEVQQEIVKILDAFTELNTE-----LKA 186 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV---------PDHWEVKPFFALVT 242 + E Q ++ N + + P E + + Sbjct: 187 RKKQYEYYQNMLLDFKDIKQNHKDAKEKLAQKTYPKRLKTLLQTLAPKGVEFRKLGDIGE 246 Query: 243 ELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 ++ + + ++ N Q ++ E + G+++F Sbjct: 247 FYGGLVGKNKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKEKQNTIQLGDVLFTG 306 Query: 296 ID------LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + + + + + + + + ++L +R Y+ K + Sbjct: 307 SSENLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDENLFNPSFLKHFLRDYNFRKNISKV 366 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE--- 405 +G R ++ + + ++ + +PP++ Q +I +++ + L+ I I K+ Sbjct: 367 ANGVTRFNVSKQLLSKITIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYE 426 Query: 406 -RRSSFIA 412 R + Sbjct: 427 YYREKLLT 434 >gi|199599710|ref|ZP_03213071.1| restriction modification system DNA specificity domain [Lactobacillus rhamnosus HN001] gi|199589396|gb|EDY97541.1| restriction modification system DNA specificity domain [Lactobacillus rhamnosus HN001] Length = 420 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 66/405 (16%), Positives = 148/405 (36%), Gaps = 27/405 (6%) Query: 25 WKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTV 78 W + + + G E Y+ + D++ + +LP+ S + T Sbjct: 24 WVQRNLADLSDGFSYGLNAAAKEYDGVHGYLRITDIDEVSHSFLPEGLTSPDVPENQLTD 83 Query: 79 SIFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + I+Y + G K I K+ + + L+ Sbjct: 84 YRMDEQSIVYARTGASTGKTYIYRDSDGELYYAGFLIRQKVNKETSAQFVYQNTLTKAWE 143 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ + + + + + +G + IP AEQ +KI +D LI R ++L Sbjct: 144 RYVQVMSQRSGQPGINAQEVGRFELTIPEKAEQ----DKIAHLFNSLDNLIAANQRKLDL 199 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKLIE 253 LKE+K+ + + K + +++ +G + ++ + T L K Sbjct: 200 LKEQKKGYLQKMFPKNGSKFPQLRFAG---FADAWEQRKLGELGSTFTGLTGKTKEDFGH 256 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQ--V 310 + ++Y N+ Q L + Q V G++ F ++ + S Sbjct: 257 GDAKFVTYMNVFQNAVASLEQLDSVEIDPKQNEVKKGDVFFTTSSETPEEVGMSSVWKYN 316 Query: 311 MERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + + S +P+ D YLA ++RS + K + G+ R ++ + + V Sbjct: 317 YDNVYLNSFTFGYRPNIEFDLDYLAAMLRSTTVRKKITFLAQGISRYNISKTKMMDIEVP 376 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 VP ++EQ I + + ++ + ++ + L+E + ++ Sbjct: 377 VPSLEEQAKIGAFL----SNVEQTITLHQRKLEKLQELKKGYLQK 417 >gi|21674692|ref|NP_662757.1| type I restriction system specificity protein [Chlorobium tepidum TLS] gi|21647899|gb|AAM73099.1| type I restriction system specificity protein [Chlorobium tepidum TLS] Length = 474 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 58/427 (13%), Positives = 132/427 (30%), Gaps = 56/427 (13%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + +L GR D+ S +G+Y +G S + +++ + Sbjct: 17 EWKALGEIIQLEKGRQLNK----------DLLSSSGRYPAYNGGMSYSGFTDSYNYSENK 66 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + + G + V+ P + + + +R+ + GA Sbjct: 67 TIISQGGASAGFVNFVTTKFYANAHCYVVLPDTEVVDNRYIYHFLKLNEERLTSCQHGAG 126 Query: 146 MSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I ++ +PIP LA Q I + A T L E + Sbjct: 127 IPALRASEITSLKIPIPCPDNPKKSLAIQAEIVRILDAFTELTAELTAELTARKKQYAYY 186 Query: 199 KQALVSYIVTKGLNP-------------------DVKMKDSGIEWVGLVPDHWEVKPFFA 239 + L+++ +P S E P + Sbjct: 187 RDRLLTFTTPPYGHPSKGGELFSLFGHPSEGGELFTPYGHSVEERELNSPSLKGWQAQPD 246 Query: 240 LVTELNRKNTKLIESNIL---------------SLSYG----NIIQKLETRNMGLKPESY 280 V + K + I + YG + + PE Sbjct: 247 GVVPVEWKTLGEVGHFIRGSGIQKSDFKASGVGCIHYGQIHTHYGTWTTETKSFIDPEFA 306 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + PG++V +D + A + + S + H + Y+++ ++ Sbjct: 307 NRLKKAKPGDLVIATTSEDDDAVAKAVAWIGTEDVAVSTDAYIFRHTANPKYMSYFFQTD 366 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + +G + + +++ ++ + +PP+ EQ I +++ A + L E + + Sbjct: 367 MFQEQKKPYITGTKVRRISGDNLAKILIPIPPLAEQERIVAILDQFDALTNSLTEGLPRE 426 Query: 400 IVLLKER 406 I L +++ Sbjct: 427 IELRQKQ 433 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 62/201 (30%), Gaps = 13/201 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G +P W + G ++ + I + + G + + + + Sbjct: 247 GVVPVEW--KTLGEVGHFIRGSGIQKSDFKASGVGCIHYGQIHTHYGTWTTETKSFIDPE 304 Query: 75 TSTV-SIFAKGQILYGKLGPYLRKA-----IIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + G ++ I D ST + + P+ + + Sbjct: 305 FANRLKKAKPGDLVIATTSEDDDAVAKAVAWIGTEDVAVSTDAYIFR-HTANPKYMSYFF 363 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + ++ + G + + I +PIPPLAEQ I + ++L Sbjct: 364 QTDMFQEQKKPYITGTKVRRISGDNLAKILIPIPPLAEQERIVAILDQFDALTNSLTEGL 423 Query: 189 IRFIELLKEKKQALVSYIVTK 209 R IEL +++ + + Sbjct: 424 PREIELRQKQYAYYRDLLFSF 444 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 14/149 (9%), Positives = 44/149 (29%), Gaps = 10/149 (6%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 N G+ + + + + + + + +D+ Sbjct: 47 YNGGMSYSGFTDSYNYSENKTIISQGGASAGFVNFVTTKFYANAHC--YVVLPDTEVVDN 104 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVIN 383 Y+ ++ + G+G+ +L+ ++ L + +P + Q +I +++ Sbjct: 105 RYIYHFLKLNEERLTSCQHGAGI-PALRASEITSLKIPIPCPDNPKKSLAIQAEIVRILD 163 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIA 412 T L ++ R + Sbjct: 164 AFTELTAELTAELTARKKQYAYYRDRLLT 192 >gi|146281033|ref|YP_001171186.1| hypothetical protein PST_0638 [Pseudomonas stutzeri A1501] gi|145569238|gb|ABP78344.1| conserved hypothetical protein [Pseudomonas stutzeri A1501] Length = 215 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 46/178 (25%), Positives = 83/178 (46%), Gaps = 9/178 (5%) Query: 257 LSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 +SYG++ + E + + ++ G++ F D+ + + E Sbjct: 23 PFVSYGDVYKNDVLPAEVTGLVQSSPEDQQRYSIEYGDVFFTRTSETVDEIGFSATCLQE 82 Query: 313 --RGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPV 367 + + +P + + + R+ L F + R SL + +K LPV Sbjct: 83 LPNAVFAGFLIRFRPTGKSLTPGFSKYYFRNQGLRIFFNKEMNLVTRASLSQDLLKLLPV 142 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +PP+ EQ I++ ++ TA L+E+ ++I LLKERRS+ I+AAVTG+ID+RG Sbjct: 143 TLPPVVEQIKISDFLDRVTAEFASLLEQGIKAIDLLKERRSALISAAVTGKIDVRGWQ 200 Score = 44.0 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 69/194 (35%), Gaps = 12/194 (6%) Query: 31 KRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-FAKGQI 86 + + G G ++ DV G + S G + Sbjct: 2 RYLGECQNGINIGGEAFGSGSPFVSYGDVYKNDVLPAEVTGLVQSSPEDQQRYSIEYGDV 61 Query: 87 LYGKLGPYLRKAIIADFD------GICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIE 138 + + + + + + + + +P K + P + + + + Sbjct: 62 FFTRTSETVDEIGFSATCLQELPNAVFAGFLIRFRPTGKSLTPGFSKYYFRNQGLRIFFN 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 T + + +P+ +PP+ EQ+ I + + T +L+ + I+ I+LLKE+ Sbjct: 122 KEMNLVTRASLSQDLLKLLPVTLPPVVEQIKISDFLDRVTAEFASLLEQGIKAIDLLKER 181 Query: 199 KQALVSYIVTKGLN 212 + AL+S VT ++ Sbjct: 182 RSALISAAVTGKID 195 >gi|227499338|ref|ZP_03929450.1| restriction modification system DNA specificity domain protein [Anaerococcus tetradius ATCC 35098] gi|227218591|gb|EEI83829.1| restriction modification system DNA specificity domain protein [Anaerococcus tetradius ATCC 35098] Length = 495 Score = 116 bits (289), Expect = 9e-24, Method: Composition-based stats. Identities = 68/451 (15%), Positives = 142/451 (31%), Gaps = 62/451 (13%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIG 52 K K P+ + + + IP+ WK V + + G ++ DI +I Sbjct: 50 KKQKPLPEITEEEIPF--DIPESWKWVRLGDVFQFINGDRGKNYPAKSKLKENGDIPFIS 107 Query: 53 LEDVESGTGK---YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII-ADFDGICS 108 +++ GT L D N + S + K I+ G + I + I S Sbjct: 108 AINLKDGTVDENNLLYLDINQYERLGSGKLL--KNDIVLCIRGSLGKNCIYPFEKGAIAS 165 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + ++ K + E + +L S + G + + I +P+PPL EQ Sbjct: 166 SLVILRNYKKIKLEFVLNYLNSYLFYSETKKYDNGTAQPNLSAQNAKKILLPLPPLKEQE 225 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDSGIEW 224 I EKI + +D +L K+ ++L+ + L K + +G E Sbjct: 226 RIVEKIEDLMLLVDKYGKNWQMLEDLNKKFPEDLKKSLLQEAIKGRLVEQRKEEGTGEEL 285 Query: 225 V-------------------------------GLVPDHWEVKPFFAL---VTELNRKNTK 250 +P+ W+ + +T+ K Sbjct: 286 FELIKEEKNKLIKEGKIKKQKPLPEITEEEIPFDIPESWKWVRLGEITLKLTDGAHKTPT 345 Query: 251 LIESNILSLSYGNIIQKLETR-----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 I LS +I + + + G+++ + + Sbjct: 346 YTNEGIPFLSVKDISSGKIDYSSCRFISKKEHDKLFERCNPERGDLLLTKVGTTGIPVVI 405 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 ++ A + I+ +L L+ S + G+ ++ D+ Sbjct: 406 -DTDEEFSLFVSVALLKFPKKLINIYFLKHLINSPLVQVQVKENTRGVGNKNWVMRDIAN 464 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + +PP+ EQ + + + +++ Sbjct: 465 TIIPLPPLAEQKRLVEKLEELLPLCEQVIKN 495 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 41/246 (16%), Positives = 87/246 (35%), Gaps = 20/246 (8%) Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR-- 246 +L++E+K L+ K P ++ + E +P+ W+ + +N Sbjct: 30 EELYKLIQEEKNKLIKEGKVKKQKPLPEI--TEEEIPFDIPESWKWVRLGDVFQFINGDR 87 Query: 247 ------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K+ +I +S N+ N L Y+ + G+++ I L Sbjct: 88 GKNYPAKSKLKENGDIPFISAINLKDGTVDEN-NLLYLDINQYERLGSGKLLKNDIVLCI 146 Query: 301 DKRSLRSAQVMER--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSL 357 ++ I +S + I ++ + SY + +L Sbjct: 147 RGSLGKNCIYPFEKGAIASSLVILRNYKKIKLEFVLNYLNSYLFYSETKKYDNGTAQPNL 206 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERRSSFIA 412 ++ K++ + +PP+KEQ I I +D K Q + L ++ + S + Sbjct: 207 SAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDKY-GKNWQMLEDLNKKFPEDLKKSLLQ 265 Query: 413 AAVTGQ 418 A+ G+ Sbjct: 266 EAIKGR 271 >gi|331000344|ref|ZP_08324025.1| type I restriction modification DNA specificity domain protein [Parasutterella excrementihominis YIT 11859] gi|329572140|gb|EGG53805.1| type I restriction modification DNA specificity domain protein [Parasutterella excrementihominis YIT 11859] Length = 417 Score = 116 bits (289), Expect = 1e-23, Method: Composition-based stats. Identities = 68/407 (16%), Positives = 137/407 (33%), Gaps = 37/407 (9%) Query: 25 WKVVPIKRFT-KLNTGRTSESGK------DIIYIGLEDVESGT--GKYLPKDGNSRQSDT 75 W+ ++ K G T + +I +I D+E + K + Sbjct: 18 WEQRKLEELASKFTGGGTPNTSNPNYWNGEIPWIQSSDLEEDDVLSLTVKKHISQEGLKN 77 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S I K ++ + + S FL L L L S+ Sbjct: 78 SAAKIIPKNSLVIVTRVGVGKLVVNTQEIA-TSQDFLSLSGIKGNSRFLAYSLYSLLKKI 136 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +G ++ + +P L EQ I ++ IT R + Sbjct: 137 TQR--VQGTSIKGITKTDFLKEAIFVPSLEEQEKISSCMVEVDKL----ITLHQRKLNRF 190 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 ++ + + + K ++ +G WE + + RKN K + Sbjct: 191 QKIRTTFLQKMFPKNGETKPAIRLTG------FNADWEQEKLQNFAVRITRKNIKKQNNR 244 Query: 256 ILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF-IDLQNDKRSLRSAQVMER 313 L++S ++ + N + + Y ++ GE + ++R E Sbjct: 245 PLTISAQHGLVDQTVYFNNRVAAQDVSNYYLIKKGEFAYNRSTSKDAPVGAVRRLVDYEE 304 Query: 314 GIITSAYMAV---KPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQ----SLKFEDVKRL 365 G++++ Y+ P +D YL++ + + G R ++ +D L Sbjct: 305 GVLSTLYLVFSITDPQHVDPNYLSYFFETTGWHSWILERAAEGARNHGLLNVSSQDFLSL 364 Query: 366 PVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 PV++P ++EQ I +ID + +E+ LLK+ +SS + Sbjct: 365 PVMLPSSLEEQQKIGEF----FQKIDDCIILLERQADLLKQIKSSLL 407 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 69/202 (34%), Gaps = 5/202 (2%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 N VK++ G E+ F N N I + ++ + Sbjct: 4 NKSVKIRFKGFTEAWEQRKLEELASKFTGGGTPNTSNPNYWNGEIPWIQSSDLEEDDVLS 63 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 K S E + I + + + + + ++++ +S Sbjct: 64 LTVKKHISQEGLKNSAAKIIPKNSLVIVTRVGVGKLVVNTQEIATSQDFLSLSGIKGNSR 123 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +LA+ + S L K+ + + + D + + VP ++EQ I + +D Sbjct: 124 FLAYSLYSL-LKKITQRVQGTSIKGITKTDFLKEAIFVPSLEEQEKI----SSCMVEVDK 178 Query: 392 LVEKIEQSIVLLKERRSSFIAA 413 L+ ++ + ++ R++F+ Sbjct: 179 LITLHQRKLNRFQKIRTTFLQK 200 >gi|256843209|ref|ZP_05548697.1| restriction modification system DNA specificity subunit [Lactobacillus crispatus 125-2-CHN] gi|293382104|ref|ZP_06628050.1| type I restriction modification DNA specificity domain protein [Lactobacillus crispatus 214-1] gi|256614629|gb|EEU19830.1| restriction modification system DNA specificity subunit [Lactobacillus crispatus 125-2-CHN] gi|290921339|gb|EFD98395.1| type I restriction modification DNA specificity domain protein [Lactobacillus crispatus 214-1] Length = 480 Score = 116 bits (289), Expect = 1e-23, Method: Composition-based stats. Identities = 63/416 (15%), Positives = 134/416 (32%), Gaps = 59/416 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W+ V + L G+T + Y ++D+ + Y+ N Sbjct: 73 DIPDSWEWVRLGDVGLLKNGKTPKKEDISSDNIYPYFKVKDMNNNNL-YMENVKNWVGEK 131 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK---DVLPELLQGWLLSI 131 S + K I++ P AI+ I S LV +L ++ + Sbjct: 132 YS-RQVMPKNTIIF----PKNGGAILTAKKRILSQDSLVDLNTGGLIPYNDLNHKFIFYL 186 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ I+ +G+ + + K + +P+PPL EQ I KI + + + ++ Sbjct: 187 FLSLDIKDFVKGSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVESSTQQY 246 Query: 192 IELLKEKKQALVSYIVTKGL---NPDVK----------------------------MKDS 220 +L K ++ + L +P + + Sbjct: 247 AKLQTLLKSKVLDLAMRGKLVEQDPHDEPASVLLEKIKAEKRKMIKEKEIKKSKPLPPIT 306 Query: 221 GIEWVGLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 E +PD WE T N+ + I +++ N Sbjct: 307 DEEKPFDIPDSWEWVRLGNIAKRITDGTHNPPPNSHEGKQVISAINIKKGKIDFSLSNRF 366 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + + G+++ + + + ++ +AV I S Sbjct: 367 VSEDQFLKEDKRTNIRKGDVLLTIVGSLGNAAVV----DTDKLFTAQRSVAVISSNILSK 422 Query: 332 YLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L +++ S +A G ++ + + L + +PP+ EQ I I+ Sbjct: 423 FLYYVLISAMFKTQIFANAKGTTQKGIYLSKLINLKLPLPPLAEQNRIVAKIDNLF 478 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 71/204 (34%), Gaps = 9/204 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + E +PD WE + N K K + + ++ ++ + N+ ++ Sbjct: 66 TDDEKPFDIPDSWEWVRLGDVGLLKNGKTPKKEDISSDNIYPYFKVKDMNNNNLYMENVK 125 Query: 280 YE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + Q++ I+F R + + + + ++ ++ Sbjct: 126 NWVGEKYSRQVMPKNTIIFPKNGGAILTAKKRILSQDSLVDLNTGGLIPY-NDLNHKFIF 184 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +L S D+ ++ + +K V +PP++EQ I I A + + Sbjct: 185 YLFLSLDIKDFVK---GSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVES 241 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 +Q L +S + A+ G+ Sbjct: 242 STQQYAKLQTLLKSKVLDLAMRGK 265 >gi|260660505|ref|ZP_05861420.1| hypothetical protein HMPREF0974_00007 [Lactobacillus jensenii 115-3-CHN] gi|260548227|gb|EEX24202.1| hypothetical protein HMPREF0974_00007 [Lactobacillus jensenii 115-3-CHN] Length = 423 Score = 116 bits (289), Expect = 1e-23, Method: Composition-based stats. Identities = 65/405 (16%), Positives = 138/405 (34%), Gaps = 29/405 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQS---DTST 77 WK V + ++ G T + + G E G YL + S+ Sbjct: 33 WKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGKTIYLHESQRKLSELGLKKSS 92 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G IL+ II + + F +QP + + + LS + + Sbjct: 93 ARLLNPGAILFTSRAGIGNTGIIINPSA-TNQGFQSIQPNKNIIDSYFIFCLSSRLKRYA 151 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+T + + + I EQ I I + + + +L + Sbjct: 152 LKHSAGSTFTEISGSEMKKAKIRICAKNEQNKISTCIKSLDSLLSLQQRKLELENQLKQF 211 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q L S + L P V+ + W + V + RKN L + L Sbjct: 212 NLQNLFSD--EQRLYPKVRFRGFDEPW--------KKVKLGRNVKRIRRKNKNLETNIPL 261 Query: 258 SLS-YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGI 315 ++S ++ + + + E+ Y ++ GE + + ++ + G Sbjct: 262 TISAQFGLVDQRDFFGRVVASENLANYILLKRGEFAYNKSYSKEAPYGSIKRLEKYNEGA 321 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVP 370 +++ Y+A P I+S +L + + + G R ++ +D + + +P Sbjct: 322 LSTLYIAFTPENINSDFLKAFFDTTKWYSHIVQVSTEGARNHGLLNISPQDFFEMSITIP 381 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ +I+ + N+ + L+ +Q I ++ + + Sbjct: 382 KSDEQNNISRIYNLM----NSLLSLQQQDINTTQQLKQFLLQNLF 422 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 74/233 (31%), Gaps = 21/233 (9%) Query: 204 SYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + L P V+ + W +G V + E N + Sbjct: 13 THADEQRLYPKVRFRGFDEPWKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGK 72 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + +GLK + ++++PG I+F + + + +G + Sbjct: 73 TIYLHESQRKLSELGLK---KSSARLLNPGAILFTSRAGIGNTGIIINPSATNQGFQS-- 127 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 I +Y + + S + ++K+ + + EQ I+ Sbjct: 128 --IQPNKNIIDSYFIFCLSSRLKRYALKHSAGSTFTEISGSEMKKAKIRICAKNEQNKIS 185 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG------QIDLRGESQ 426 I +D L+ ++ + L + + + + ++ RG + Sbjct: 186 TCI----KSLDSLLSLQQRKLELENQLKQFNLQNLFSDEQRLYPKVRFRGFDE 234 >gi|291278717|ref|YP_003495552.1| type I restriction-modification system, S subunit [Deferribacter desulfuricans SSM1] gi|290753419|dbj|BAI79796.1| type I restriction-modification system, S subunit [Deferribacter desulfuricans SSM1] Length = 388 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 64/415 (15%), Positives = 135/415 (32%), Gaps = 43/415 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK WK V + +N + G + +E + + K + Sbjct: 4 KVPKGWKRVKLGDVAVINPSEILKKGTLAKKVPMEALH----PFTKKISIYEIKPFNGGV 59 Query: 80 IFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLPELLQGWLL--S 130 F G L ++ P L + + G ST+F+VL+ K L + + S Sbjct: 60 KFRNGDTLVARITPSLENGKTAYVDILEENEIGFGSTEFIVLREKKGLSDSHFLYYFAIS 119 Query: 131 IDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + G+T + N PPL+EQ I + + +ID Sbjct: 120 PEFRDVAIKSMTGSTGRQRVQTDVVFNYKFLFPPLSEQKAIASVLSSLDDKID------- 172 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE-LNRKN 248 LL+ + Q L + + IE + + + +K Sbjct: 173 ----LLQRQNQTLEQMA-------ETLFRKWFIEDAKEDWEEKPLDTIANFLNGLACQKY 221 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + L + ++ + N + IV G+++F + + Sbjct: 222 PPKNNFDKLPVLKIKELKNGFSENSDWATSDVPSEYIVVNGDVIFSWSGSL-----IVKI 276 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPV 367 E+ ++ V + + ++ Y + + +K +D+ + V Sbjct: 277 WDGEKCVLNQHLFKVTSEKYPKWFYYFWIKYYLQQFITIAESKATTMGHIKRDDLSKALV 336 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 LVPP +E ++ E + + I I L + R + + ++G++ ++ Sbjct: 337 LVPPDEELLK----MDKEISPFIEKIIAINNQIRTLAKLRDTLLPKLMSGEVRVK 387 >gi|292493382|ref|YP_003528821.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] gi|291581977|gb|ADE16434.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] Length = 431 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 51/423 (12%), Positives = 130/423 (30%), Gaps = 34/423 (8%) Query: 24 HWKVVPIKRFTKLNTGRTS----------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 W + R + G T E + ++ + D+ + + Sbjct: 3 DWAEEQLGRLASIEIGGTPAREVAEYWAREEDEGHPWVSIADLGPRIVFDTKERITNAGI 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S KG ++ + + IA D + + P D + + D Sbjct: 63 LNSNAKRVPKGTLMMS-FKLTIGRVGIAGRDLYINEAIATIIPTDGRLDGRFLYYALPDT 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ K G + L EQ I E + +D I I Sbjct: 122 ARSAITDTAVKGVTLNKQKLGGLLIRFPERLDEQQRIAEIL----STVDEAIEHTEALIA 177 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFAL----- 240 +++ K L+ + T+G+ PD +++ E +G +P W+ + + Sbjct: 178 KMQQIKAGLMHDLFTRGVTPDGQLRPPREEAPRLYKKSPLGWIPREWDTELLDNIALRGS 237 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 ++ + + I +S + + + + + + I + + Sbjct: 238 GHTPSKNHPEYWNGEIKWISLADSWRLDRVHIVDTDHKITQAGIENSSAVVHPAGIVVLS 297 Query: 301 DKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + + V ++ +M + + Y + Y + ++ Sbjct: 298 RDAGVGKSAVTTCEMAVSQHFMCWRCGPRLNNYYLYYWLQYRKWEFENIATGSTIPTIGL 357 Query: 360 EDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +P + EQ I + ++ L + + L++ R+ + +TG+ Sbjct: 358 RFFRHYRINIPLEVSEQEHIAATLLAADEKVFSLEDDV----GKLRQLRAGLMHDLLTGR 413 Query: 419 IDL 421 + + Sbjct: 414 VPV 416 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 61/211 (28%), Gaps = 16/211 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M D E +G + V E + + G I + Sbjct: 1 MCDWAEEQLGRLASIEIGGTPAREVAEYWAREEDEGHPWVSIADLGPRIVFDTKERITNA 60 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + V G ++ F + I T + +D +L + Sbjct: 61 GILNSNAKRVPKGTLMMSFKLTIGRVGIAGRDLYINEAIAT---IIPTDGRLDGRFLYYA 117 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEK 395 + + G+ +L + + L + P + EQ I +++ D +E Sbjct: 118 LPDTARSAITDTAVKGV--TLNKQKLGGLLIRFPERLDEQQRIAEILSTV----DEAIEH 171 Query: 396 IEQSIVLLKERRSSFIAAAVT------GQID 420 E I +++ ++ + T GQ+ Sbjct: 172 TEALIAKMQQIKAGLMHDLFTRGVTPDGQLR 202 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 70/209 (33%), Gaps = 15/209 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLED---VESGT 60 YK S +G IP+ W + +G T +I +I L D ++ Sbjct: 212 YKKSP---LGWIPREWDTELLDNIALRGSGHTPSKNHPEYWNGEIKWISLADSWRLDRVH 268 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + S+ + G I+ + K+ + + S F+ + L Sbjct: 269 IVDTDHKITQAGIENSSAVVHPAG-IVVLSRDAGVGKSAVTTCEMAVSQHFMCWRCGPRL 327 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETV 179 + E I G+T+ + + + IP ++EQ I ++A Sbjct: 328 NN-YYLYYWLQYRKWEFENIATGSTIPTIGLRFFRHYRINIPLEVSEQEHIAATLLAADE 386 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVT 208 ++ +L + + +L L++ V Sbjct: 387 KVFSLEDDVGKLRQLRAGLMHDLLTGRVP 415 >gi|208780346|ref|ZP_03247687.1| conserved hypothetical protein [Francisella novicida FTG] gi|208743714|gb|EDZ90017.1| conserved hypothetical protein [Francisella novicida FTG] Length = 396 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 60/411 (14%), Positives = 132/411 (32%), Gaps = 28/411 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK WK + + T + D I I + + G ++ + Sbjct: 5 ELPKGWKAIELGEITSYVNRGVAPKYTDEHGITVINQKCIREGNINLELARVHNPDKKYT 64 Query: 77 TVSIFAKGQILYGKLG-PYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G IL G + I + I T +++ + Sbjct: 65 AEKQLHLGDILINSTGVGTAGRVGIFTDSINAIVDTHVSIVRLNKEYAYPKFVYYNLRFR 124 Query: 134 TQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +E EG+T I ++ + +PPLAEQ I E + +D I + Sbjct: 125 EKELEETAEGSTGQIELKRDAIKSLNILLPPLAEQKAIAEVL----SSLDDKIDLLHKQN 180 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + L++ Q L + + W + ++ Sbjct: 181 QTLEDMAQTLFREWFIERADEG---------WEEVPLSEVADIKIGRTPPRKEKQWFSND 231 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 ++ +S ++ Q+ N + + E + IV + L R E Sbjct: 232 PKDVKWISIKDMGQEGVFINGTSEYLTQEAVEKFKIPIIVKNTVILSFKMTLGRVKITGE 291 Query: 313 RGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + A + + YL +++Y + S + S+ +K + +++P Sbjct: 292 NMLSNEAIAHFNITNDKLYNEYLYLFLKTYPYQTL--GSTSSIVTSINSAMIKNILIILP 349 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 K + VI+ + ++ ++ I L++ R + + ++GQ+ + Sbjct: 350 DFKVKKSFKEVISPMFEK----IQNNQKQIKTLEQTRDTLLPKLMSGQVRV 396 >gi|237750145|ref|ZP_04580625.1| HsdS [Helicobacter bilis ATCC 43879] gi|229374332|gb|EEO24723.1| HsdS [Helicobacter bilis ATCC 43879] Length = 413 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 55/409 (13%), Positives = 124/409 (30%), Gaps = 32/409 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYL--PKDGNSRQSD 74 + W+ V + ++ +G T ++ +I ++ + D +G + K Sbjct: 20 EQWQEVRLGEVAEIVSGGTPKTSIPEYWNGEIPWLSVADFNNGKKYVVASEKFITQLGLQ 79 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + + I+ G A+I + + + + T Sbjct: 80 ESSTKLLQRDDIIISARGTVGVIAMIPYPMAFNQSCYGLRICS--NAHSHFIYYCLKVFT 137 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + GA K + + +PPL Q I E + + +ID L + L Sbjct: 138 KYFIHQSYGAVFDTITTKILSDFTFLLPPLTIQQKIAEILSSFDDKIDLLHRQNKTLESL 197 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + I + + S I + + L Sbjct: 198 ALTLFRHYF--IDNPKRDEWEEKPLSEIAEI----QNGYAFKNSDYAERGMETYEVLKMG 251 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER- 313 +I S K +K T +++ +IV D+++ L ++++ Sbjct: 252 HIESGGGLRYFPKAHY----VKINDKMTKWVLNEDDIVLAMTDMKDSLGILGYPAMVDKS 307 Query: 314 --GIITSAYMAVKPHGIDSTYLAWLM-----RSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 ++ + D + + ++ K+ G++ +L E +K Sbjct: 308 NYYVLNQRVARIYLKSKDDFLHNYFLFLYLSLQENIQKLQSLANGGVQVNLSTEAIKNFT 367 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +PP++ Q N I + + I L+ R + A Sbjct: 368 ITIPPLEFQSQ----NNQAFINIIKKYKNNRKQIQNLQAMRDMLLKAIF 412 >gi|162280800|gb|ABX83062.1| hypothetical protein [Staphylococcus aureus] gi|163568097|gb|ABY27021.1| hypothetical protein [Staphylococcus aureus] gi|163568120|gb|ABY27028.1| hypothetical protein [Staphylococcus aureus] gi|163568152|gb|ABY27035.1| hypothetical protein [Staphylococcus aureus] Length = 399 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 63/403 (15%), Positives = 142/403 (35%), Gaps = 37/403 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W++ IK + +G T +I ++ D+ + + + Sbjct: 18 EWEMKIIKELFNVVSGSTPLRSNTSYYENGNIPWVKTTDLNNSLINDTSEKVTDIA--LN 75 Query: 77 TVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + K +L G + + + I + + L K L+ +V Sbjct: 76 NLKVLPKDTVLIAMYGGFNQIGRTGILNIKATTNQAISALIKKGNYNSKFLQSYLNFNVK 135 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q + + I +P L EQ EKI +ID I + ++L Sbjct: 136 QWRRFAASSRKDPNITKRDIEKFKIPYTCLEEQ----EKIGGFFSKIDRQIELEEKKLDL 191 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+++K+ + I ++ L + G +W+ +++ E RK Sbjct: 192 LEQQKRGYMQKIFSQEL--------RFKDENGNKYPNWQTVKIGSILKE--RKERSGDGE 241 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + I++ E + Y+ V +I + + + G Sbjct: 242 MLSVTINHGIVKFDEIDRKDNSSKDKSNYKKVYKNDIAYNSMRMWQGASGKAEFD----G 297 Query: 315 IITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVP 370 I++ AY V P I+S ++A+ +++++ F GL +LK++ +K + + + Sbjct: 298 IVSPAYTVVTPIENINSNFIAYYFKTHNMIHKFRINSQGLTSDTWNLKYKQLKDIKISIC 357 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I +++ +D ++K + +L + + Sbjct: 358 SKEEQDKIADLL----TILDTRIKKQNHKLEILNINKKGLLQK 396 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 62/179 (34%), Gaps = 4/179 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + NI + ++ L + V P + V + ++ Sbjct: 39 SNTSYYENGNIPWVKTTDLNNSLINDTSEKVTDIALNNLKVLPKDTVLIAMYGGFNQIGR 98 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 ++ + +K +S +L + +A S ++ D+++ Sbjct: 99 TGILNIKATTNQAISALIKKGNYNSKFLQSYLNFNVKQWRRFAASSRKDPNITKRDIEKF 158 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + ++EQ I ++ID +E E+ + LL++++ ++ + ++ + E Sbjct: 159 KIPYTCLEEQEKIGGF----FSKIDRQIELEEKKLDLLEQQKRGYMQKIFSQELRFKDE 213 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 60/183 (32%), Gaps = 7/183 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +W+ V I K R+ +++ + + + KD + D S K Sbjct: 220 NWQTVKIGSILKERKERS--GDGEMLSVTINHGIVKFDEIDRKDNS--SKDKSNYKKVYK 275 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I Y + + + A+FDGI S + V+ P + + + I Sbjct: 276 NDIAYNSMRMWQGASGKAEFDGIVSPAYTVVTPIENINSNFIAYYFKTHNMIHKFRINSQ 335 Query: 144 ---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + +K + +I + I EQ I + + RI + K Q Sbjct: 336 GLTSDTWNLKYKQLKDIKISICSKEEQDKIADLLTILDTRIKKQNHKLEILNINKKGLLQ 395 Query: 201 ALV 203 + Sbjct: 396 KMF 398 >gi|307250672|ref|ZP_07532609.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306857280|gb|EFM89399.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 435 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 64/434 (14%), Positives = 128/434 (29%), Gaps = 71/434 (16%) Query: 27 VVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPK---DGNSRQSDTS 76 V + ++ G T ++ +D I +I D++ +GKY+ K + +S Sbjct: 2 WVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNITENGLRSS 61 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + +K I+Y P I + + + F + + + + I T Sbjct: 62 STRLLSKNSIVYSSRAPI-GYIAITETELCTNQGFKSIDLYNKE-IVDYLYYSLIYFTPE 119 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I++ G T GN +P+PPL EQ I KI I+ + + L + Sbjct: 120 IQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVVKIEELLPYIEQYAEKEEKLTALHQ 179 Query: 197 EKK----QALVSYIVTKGLNPDVKM----------------------------------- 217 + ++++ + L Sbjct: 180 QFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEKKLKKPKVVSEIIL 239 Query: 218 -------------KDSGIEWVGLVPDHWEVKPFFALVTE------LNRKNTKLIESNILS 258 + E +P++W + + I Sbjct: 240 RDNLPYEIINGEERCIADEVPFEIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTIPW 299 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 L G++ + T E V + I + +E + Sbjct: 300 LKTGDLNDGIITEIPEYITELAIEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTNQA 359 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + GI + YL + + S + GSG + ++ E + +PP+ EQ I Sbjct: 360 CCACIPYTGIYNKYLFYYLMSQKTELQKRSEGSG-QPNISKEKIVNYLFPLPPLNEQKCI 418 Query: 379 TNVINVETARIDVL 392 I + + L Sbjct: 419 VEKIETLFSTLQNL 432 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 60/171 (35%), Gaps = 8/171 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP++W V + G T + I ++ D+ G +P+ Sbjct: 262 EIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTIPWLKTGDLNDGIITEIPEYITELA 321 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + ++V + G +L G + K I + + + P + + L Sbjct: 322 IEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTNQACCACIPYTGIYNKYLFYYLMSQ 381 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 T+ + EG+ + + I N P+PPL EQ I EKI + Sbjct: 382 KTELQKRS-EGSGQPNISKEKIVNYLFPLPPLNEQKCIVEKIETLFSTLQN 431 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 56/198 (28%), Gaps = 13/198 (6%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE-- 290 L + K E + + I + + + K S I + G Sbjct: 1 MWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNITENGLRS 60 Query: 291 -----IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + I + A + ++ + + + Y ++ Sbjct: 61 SSTRLLSKNSIVYSSRAPIGYIAITETELCTNQGFKSIDLYNKEIVDYLYYSLIYFTPEI 120 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-- 403 + + + +PP+ EQ I I I+ + E+ + L Sbjct: 121 QSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVVKIEELLPYIEQY-AEKEEKLTALHQ 179 Query: 404 ---KERRSSFIAAAVTGQ 418 ++ + S + AA+ G+ Sbjct: 180 QFPEQLKKSILQAAIQGK 197 >gi|160914345|ref|ZP_02076564.1| hypothetical protein EUBDOL_00353 [Eubacterium dolichum DSM 3991] gi|160915331|ref|ZP_02077543.1| hypothetical protein EUBDOL_01339 [Eubacterium dolichum DSM 3991] gi|158432722|gb|EDP11011.1| hypothetical protein EUBDOL_01339 [Eubacterium dolichum DSM 3991] gi|158433818|gb|EDP12107.1| hypothetical protein EUBDOL_00353 [Eubacterium dolichum DSM 3991] Length = 517 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 70/433 (16%), Positives = 132/433 (30%), Gaps = 71/433 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP +W+ V I + G+T KDI Y+ +++S Sbjct: 88 EIPDNWEWVHINDIAESYLGKTLNKTKDIGESVPYLCSINIQSDYIDMNTIKIAKFNEAE 147 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDV 133 + G +L + G R A+ + L V + + P Q L V Sbjct: 148 KQKYLLQDGDLLICEGGDAGRSAVWNKNKTMYYQNALHRVRFYEKLNPVFYQRVLSFYKV 207 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ ++ +G T+ H K + +I P+PPL EQ I K+ I+ + E Sbjct: 208 SKILDNYFKGVTIKHFVQKSLFSIYFPLPPLQEQHRIVAKLQELEPLIEKYRIAEEQLHE 267 Query: 194 LL----KEKKQALVSYIVTKGLNPDVKM-------------------------------- 217 L + K++++ Y + L P Sbjct: 268 LNSNIKDQLKKSILQYAIEGKLVPQDPNDEPASVLLERIREEKQQLIKEGKIKKDKNESI 327 Query: 218 ----------KDSGIEWV------GLVPDHWEVKPFFALVTE------LNRKNTKLIESN 255 K +GIE+ +P+ W+ + N Sbjct: 328 IFRRDNSYYEKINGIEYCIDNEIPFEIPNSWQWARLNNIGNWGAGATPSKSNNEYYSNGT 387 Query: 256 ILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I L G++ T E + ++ G ++ K + + Sbjct: 388 IPWLLTGDLNDGYITNIPNHITELALEKTSVKLNPSGSVLIAMYGATIGKLGILTFPATT 447 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + S YL + + + G + ++ E + + + VPP+ Sbjct: 448 NQACCACLV---YKPFYSKYLFFYLLANK-RNFVKKGEGGAQPNISKEKIIKTLIAVPPL 503 Query: 373 KEQFDITNVINVE 385 KEQ I N++ Sbjct: 504 KEQIRIVNLLGKV 516 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 69/213 (32%), Gaps = 15/213 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRK---NTKLIESNILSLSYGNIIQKLETRNMG 274 + E +PD+WE + K TK I ++ L NI N Sbjct: 79 RCIEDELPFEIPDNWEWVHINDIAESYLGKTLNKTKDIGESVPYLCSINIQSDYIDMNTI 138 Query: 275 LK---PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E+ + ++ G+++ + M + + ++ Sbjct: 139 KIAKFNEAEKQKYLLQDGDLLICEGGDAGRSAVWNKNKTMYYQ--NALHRVRFYEKLNPV 196 Query: 332 YLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + ++ Y + K+ G + + + + +PP++EQ I + I+ Sbjct: 197 FYQRVLSFYKVSKILDNYFKGVTIKHFVQKSLFSIYFPLPPLQEQHRIVAKLQELEPLIE 256 Query: 391 VLVEKIEQSIVLLK-----ERRSSFIAAAVTGQ 418 E+ + L + + S + A+ G+ Sbjct: 257 KYR-IAEEQLHELNSNIKDQLKKSILQYAIEGK 288 >gi|254440097|ref|ZP_05053591.1| Type I restriction modification DNA specificity domain protein [Octadecabacter antarcticus 307] gi|198255543|gb|EDY79857.1| Type I restriction modification DNA specificity domain protein [Octadecabacter antarcticus 307] Length = 410 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 62/427 (14%), Positives = 125/427 (29%), Gaps = 41/427 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKL--NTGRTSE---SGKDIIYIGLEDVESGT-GKY 63 Y+ S V G IP W+V + + G E GK I + D+ G+ Sbjct: 9 YRLSEV---GVIPDDWEVSTLANLAEYPMQNGVFFEANRKGKGCPMINVGDLYGGSPIPV 65 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI-------CSTQFLVLQP 116 + D G + + + + I + + +P Sbjct: 66 GFLERFDASPDEQKRFQVNDGDLFFTRSSIVPSGIAQCNHVSIAEGDTVVFDSHVIRYRP 125 Query: 117 KDVLPELLQGWLLS--IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + L + + + + + + TM+ D + + P+ PPL EQ I E + Sbjct: 126 NPKIIDALFLFRACTASNTRRYLISHAKTGTMTTIDQRVLSACPITFPPLPEQRAIAEAL 185 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 I L + +L + Q L++ SG EW + Sbjct: 186 SDADALIAALEAMIAKKRDLKQAAMQQLLT-------GKTRLPGFSG-EW-----KVSQQ 232 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + + + S N+ E Y + G++++ Sbjct: 233 QDVITFINGRAYGRHEWETSGTPVCRLQNLTGSGEKFYYSKLVLPERQYML--EGDLIYM 290 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + I + + + MG Sbjct: 291 WSASFGPHIWTGPRAIFHYHI----WKLECDTEEVDRQFYYYKLVEITEALQATMGGSTM 346 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 L +++ V +PPI+EQ I V++ D + +E + + + Sbjct: 347 LHLTKTGMEKFLVNLPPIEEQTAIAEVLSDM----DADLAALEARAAKARTVKQGMMQEL 402 Query: 415 VTGQIDL 421 +TG++ L Sbjct: 403 LTGKVRL 409 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 64/198 (32%), Gaps = 12/198 (6%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 P V + K + YG + + V+ G++ F Sbjct: 32 YPMQNGVFFEANRKGKGCPMINVGDLYGGSPIPVGFLERFDASPDEQKRFQVNDGDLFFT 91 Query: 295 FIDLQNDKRS---LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR----SYDLCKVFY 347 + + S + + S + +P+ L +L R S + Sbjct: 92 RSSIVPSGIAQCNHVSIAEGDTVVFDSHVIRYRPNPKIIDAL-FLFRACTASNTRRYLIS 150 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +G ++ + P+ PP+ EQ I ++ D L+ +E I ++ + Sbjct: 151 HAKTGTMTTIDQRVLSACPITFPPLPEQRAIAEALSDA----DALIAALEAMIAKKRDLK 206 Query: 408 SSFIAAAVTGQIDLRGES 425 + + +TG+ L G S Sbjct: 207 QAAMQQLLTGKTRLPGFS 224 >gi|198283099|ref|YP_002219420.1| restriction modification system DNA specificity protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198247620|gb|ACH83213.1| restriction modification system DNA specificity domain [Acidithiobacillus ferrooxidans ATCC 53993] Length = 426 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 60/415 (14%), Positives = 130/415 (31%), Gaps = 29/415 (6%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESG-TGKYLPKDGNSRQSD 74 W+ + +G T + + + +I + + K + Sbjct: 18 SDWQKTTVGEIASGFLSGGTPSTSRADFWEGENPWITSKWLGDKLELTTGEKFVSEGAVK 77 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSID 132 + I K I++ + K I D + +++ + + L L Sbjct: 78 KTATKIVPKDSIIFAT-RVGVGKVGINRIDLAINQDLAGVLIDNERYDIKFLAYQLGIDS 136 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + Q + GAT+ + I + +PPL EQ I + + I + R I Sbjct: 137 IQQYVAMNKRGATIKGITRDCLEQIRLNLPPLPEQKKIAHIL----STVQRAIEAQERII 192 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + E K+AL+ + T+GL + K + I + + E+ +T K Sbjct: 193 QTTTELKKALMHKLFTEGL-RNEPQKQTEIGPIPESWEVVEIGDLGKCITGSTPKTKVDS 251 Query: 253 ESNILSLSYG-----NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + + + + PE T + + ++ I K + Sbjct: 252 FYDPPTEDFIAPADLGARRYVYDSEKKISPEGMATIRPIPRNAVMCVCIGSSIGKVGMSY 311 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 E ++ + + SY G L + V Sbjct: 312 R---EESATNQQINSIICGEGRDPEFVYCLLSYRSDYWKSFATFGPVPILSKGRFSTIGV 368 Query: 368 LVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P + EQ I + ++ VE E+ + +LK+ + + +T + + Sbjct: 369 PIPSSLDEQQAIAKPLVA----LETKVEVAEKKVTVLKDLFRTLLHELMTAKTRV 419 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 72/201 (35%), Gaps = 10/201 (4%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG-----LEDVESGTGKYLP 65 K + IG IP+ W+VV I K TG T ++ D Y + + G +Y+ Sbjct: 217 KQTE---IGPIPESWEVVEIGDLGKCITGSTPKTKVDSFYDPPTEDFIAPADLGARRYVY 273 Query: 66 KDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 +T+ + ++ +G + K ++ + + Q + + Sbjct: 274 DSEKKISPEGMATIRPIPRNAVMCVCIGSSIGKVGMSYREESATNQQINSIICGEGRDPE 333 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDT 183 + L + ++ + I +PIP L EQ I + ++A +++ Sbjct: 334 FVYCLLSYRSDYWKSFATFGPVPILSKGRFSTIGVPIPSSLDEQQAIAKPLVALETKVEV 393 Query: 184 LITERIRFIELLKEKKQALVS 204 + +L + L++ Sbjct: 394 AEKKVTVLKDLFRTLLHELMT 414 >gi|296453351|ref|YP_003660494.1| type I restriction-modification system specificity determinant [Bifidobacterium longum subsp. longum JDM301] gi|296182782|gb|ADG99663.1| type I restriction-modification system specificity determinant [Bifidobacterium longum subsp. longum JDM301] Length = 409 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 49/407 (12%), Positives = 123/407 (30%), Gaps = 32/407 (7%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P + P+ KL G + + K I I + + + + + + Sbjct: 13 PNGVERKPLGAIAKLYRGNGLQKKDFTDKGIGCIHYGQIYTRYDTFTSQTISFVDKKLAD 72 Query: 78 VSI-FAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + ++ + I + ++ + L + ++D Sbjct: 73 KLLKVHPNDLIVTATSENLEDVCKAVAWLGGSDIVTGGHSIVVRHHQNAKYLSYYFQTLD 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 QR A G + + I +P+PPL Q I + + + L E + Sbjct: 133 FFQRKRAYVHGTKVMEIKKDDLAKIVVPVPPLPVQEEIVRILDSFSSLEAELEAELEAEL 192 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E +++ + ++T + I + K Sbjct: 193 EARRKQYAYYRNELLTFDRERVITACIQDI------------CTRICSGGTPSSKRHDYY 240 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + N+ L +I + + + Q + ++ K ++ S Sbjct: 241 DGNVPWLRTQDIDFNVINQTSATISDEGLKNSAAQWIPANCVIVAMYGATAAKVAVNSIP 300 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + + + D Y+ + + + A+G G + ++ + V+ P+ + Sbjct: 301 LTTNQACCN--LQIDESKADVRYVFHWLSNEY--EHLKALGEGSQSNINAKKVRLYPISL 356 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIV----LLKERRSSFIA 412 PP +EQ I ++++ + L + I + R ++ Sbjct: 357 PPFEEQQRIVSILDRFDKLTNDLSSGLPAEIEARHKQYEYYRDRLLS 403 >gi|20090963|ref|NP_617038.1| type I site-specific deoxyribonuclease [Methanosarcina acetivorans C2A] gi|19916047|gb|AAM05518.1| type I site-specific deoxyribonuclease [Methanosarcina acetivorans C2A] Length = 487 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 54/466 (11%), Positives = 126/466 (27%), Gaps = 60/466 (12%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + + +P+ W + + +L G++ + +G ++ + Sbjct: 11 EISELPKLPEGWVWIRLDSAGELFCGQSPSIAEVNQEKRGVPYVTGPEQWDGSKIKETKW 70 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + +G I G + K P + + L +I Sbjct: 71 TEFPKRLVPEGCIFITVKGAGVGKI-FPGVSCAIGRDIYAFLPSSKVD--FKYTLHAIKH 127 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + I + + + L EQ I KI +D I E Sbjct: 128 QIDVLIMKAQGDIPGLSKNHILDHVIGLCSLEEQRAIVFKIEQLFSELDNGIANLKLAQE 187 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL-------------------------- 227 LK +QA++ L + + + G Sbjct: 188 QLKVYRQAVLKKAFEGELTKKWREQQVDLPDAGELLERIRKEREEVAKDTGKKVKIIKPP 247 Query: 228 ----------VPDHWEVKPFFALVTELNRKNTK-------LIESNILSLSYGNII---QK 267 +P W L K+ L + G + Sbjct: 248 TNAELVELPMIPKEWMWVKLDYLGDLGRGKSKHRPRNDKTLFGGKYPFIQTGEVKAANHT 307 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 +++ ++ G + + L I+ + Sbjct: 308 IKSFEKTYSDVGLAQSKLWPKGTLCITIAANIAETAFLGFEGCFPDSIVGFTAI---ESL 364 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + Y+ + ++ +A ++++ ++ L + + + EQ DI I + Sbjct: 365 VGKEYVYYFFKANQSKIESFAPA-TAQKNINLNILENLLIPLCSLPEQQDIVQEIETRLS 423 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI-------DLRGESQ 426 D + + IE ++ + R S + A G++ ++RG Sbjct: 424 VCDKIEQDIETNLEKAEALRQSILKKAFEGKLLNERELEEVRGAED 469 >gi|189463339|ref|ZP_03012124.1| hypothetical protein BACCOP_04056 [Bacteroides coprocola DSM 17136] gi|189429958|gb|EDU98942.1| hypothetical protein BACCOP_04056 [Bacteroides coprocola DSM 17136] Length = 389 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 69/388 (17%), Positives = 126/388 (32%), Gaps = 18/388 (4%) Query: 27 VVPIKRF---TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + N D + LED+E T K + S++S F K Sbjct: 2 WTTLGEISNYGECNNVSVDSITDDDWVLELEDLEKDTAKIIQTLSRSKRSIKGVRHRFNK 61 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICE 142 G ILY KL YL K ++A G C+T+ + + + S + Sbjct: 62 GDILYSKLRTYLNKVLVAPQSGYCTTEIMPFNSYCNVSSYYLNHVLRSAYFLDYTQQCGY 121 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G M N +P+PPLAEQ I ++I ID + + + +K+ K + Sbjct: 122 GVKMPRLSTTDACNGMIPLPPLAEQKRIVKEIEHWFSLIDVIESGKEDLQATIKQAKSKI 181 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + L P + E + + E+ +L + SN+L ++ G Sbjct: 182 LDLAIHGKLVPQDPNDEPASELLKRINSKAEITCDNGHSRKLPQGWAYCQLSNVLKITMG 241 Query: 263 NIIQKLETRNMGLKPESYETYQIVDP-------------GEIVFRFIDLQNDKRSLRSAQ 309 + N D I L Sbjct: 242 QSPKGDSLNNKRGIEFHQGKICFSDKFLLESGIFTNEPTKIAEPNSILLCVRAPVGVVNI 301 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + I A+ P + + +L+++ + G +++ E ++ +++ Sbjct: 302 TKNQICIGRGLCALTPFEGNVDFYFYLLQTLQDSFDNQSTG-TTFKAISGEIIRNENIIL 360 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIE 397 PP+ EQ I I D + +E Sbjct: 361 PPLAEQQRIVQKIEELFHVFDNIQNALE 388 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 52/153 (33%), Gaps = 4/153 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K+ K + G+I++ + +K + T Sbjct: 40 KIIQTLSRSKRSIKGVRHRFNKGDILYSKLRTYLNKVLVAPQSGY---CTTEIMPFNSYC 96 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + S YL ++RS G G+ L D + +PP+ EQ I I Sbjct: 97 NVSSYYLNHVLRSAYFLDYTQQCGYGVKMPRLSTTDACNGMIPLPPLAEQKRIVKEIEHW 156 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + IDV+ E +K+ +S + A+ G+ Sbjct: 157 FSLIDVIESGKEDLQATIKQAKSKILDLAIHGK 189 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 47/164 (28%), Gaps = 3/164 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + K+ G++ + G+E + S Sbjct: 222 KLPQGWAYCQLSNVLKITMGQSPKGDSLNNKRGIEFHQGKICFSDKFLLESGIFTNEPTK 281 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I IL P I L P + + + L + + Sbjct: 282 IAEPNSILLCVRAPV-GVVNITKNQICIGRGLCALTPFEGN--VDFYFYLLQTLQDSFDN 338 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G T + I N + +PPLAEQ I +KI D Sbjct: 339 QSTGTTFKAISGEIIRNENIILPPLAEQQRIVQKIEELFHVFDN 382 >gi|268611919|ref|ZP_06145646.1| type I restriction-modification system specificity subunit [Ruminococcus flavefaciens FD-1] Length = 406 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 135/404 (33%), Gaps = 31/404 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-----GNSRQSDTSTVS 79 W+ G E KD + + + + G + G + D + Sbjct: 16 WEQRKFSD-FTFAAG---ERNKDDLDLEPFAITNNQGFIAQSEAHDDFGYMKDVDRNMYI 71 Query: 80 IFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQR 136 + Y + + + I S+ + V Q + V + L+ W + Sbjct: 72 VVKPNSFAYNPARINVGSLGYYEGAENVIVSSLYEVFQTAEYVDDKFLKHWFKTKAFQDW 131 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 IE + EG+ + + + M +P + EQ I + +ID LIT R + L+ Sbjct: 132 IERLQEGSVRLYFYYDKLCECIMNMPSVEEQRRIGAYLD----KIDNLITLHQRKCDALQ 187 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIE------WVGLVPDHWEVKPFFALVTELNRKNTK 250 + K++++ + + +++ +G +G + K Sbjct: 188 KFKKSMLQKMFPQNGESVPEIRFAGFTDAWEQRKLGEIYRDIGNAFVGTATPYYVDKGHF 247 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +ESN + N ++ + + + + G++V ++ ++ Sbjct: 248 YLESNNIKDGQINHNTEIFINDEFY---EKQKDKWLHTGDMVMVQSGHVGH-AAVIPEKL 303 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 I+ +L + ++ K + +G + + +++ V V Sbjct: 304 NNSAAHALIMFRNPKMIINPYFLNYQYQTIKAKKKIENITTGNTIKHILASNMQSFVVDV 363 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I + +D L+ ++ + L++ + S + Sbjct: 364 PNIDEQELIGAF----FSNLDSLITIHQRKLETLQKMKKSLLQK 403 >gi|300718522|ref|YP_003743325.1| type I restriction-modification system, S subunit [Erwinia billingiae Eb661] gi|299064358|emb|CAX61478.1| Type I restriction-modification system, S subunit [Erwinia billingiae Eb661] Length = 576 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 76/477 (15%), Positives = 141/477 (29%), Gaps = 82/477 (17%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W+ + + T +E D + LED+E T K L + S + S Sbjct: 100 ELPEGWEWMRLGFITNYGECDKAEPTDANADTWIVELEDIEKSTSKLLNRVKFSERPFKS 159 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQ 135 + + F K +LYGKL PYL K ++AD G+C+T+ + + ++LP+ ++ L S Sbjct: 160 SKNKFYKNDVLYGKLRPYLDKVLVADDSGVCTTEIIPIKGYGNILPDYIRLLLKSPRFIA 219 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR--------------- 180 G + + + + +AEQ I K+ Sbjct: 220 YANKSTHGMNLPRLGTDKAIHAVVELTSIAEQARIVNKVDELMSLCDQLEQQSLTSLEAH 279 Query: 181 --------------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 I + KQ ++ V L P Sbjct: 280 QHLVGTLLATLTESQNAEELAENWARISQHFDTLFTTEASIDALKQTILQLAVMGKLVPQ 339 Query: 215 VK-------------------------MKDSGIEWVGLV---PDHWEVKPFFALVTELNR 246 K +E +G P W Sbjct: 340 DPNDEPASELLKRIEQEKAQLVKEGKIKKHPPVEPLGEPALLPRSWLNIVVQDFADIRLG 399 Query: 247 KNTK-----LIESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +I +S G N + + + + ++ G ++ I Sbjct: 400 STPDRTEKKYWNGDIPWVSSGEVANEVILDTKEKVTSEGFKNSSTNMIPAGSLLMAIIGQ 459 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + A +D Y+ + +S L G G + +L Sbjct: 460 GKTRGQTAILGIDACTNQNVAAFVFNRELVDPEYVWFWAKSKYLSHRGDGHG-GAQPALN 518 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + V+ + PIKEQ I + + D L ++ + + + AA+ Sbjct: 519 GKKVRSFIFPLAPIKEQQRIVSEVKRFNDICDTLKSHLQSAQQTQQHLADALTDAAL 575 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 69/219 (31%), Gaps = 19/219 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGA---IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYI 51 +K + V+ +G +P+ W + ++ F + G T + + DI ++ Sbjct: 366 IKKHPP--------VEPLGEPALLPRSWLNIVVQDFADIRLGSTPDRTEKKYWNGDIPWV 417 Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICST 109 +V + + S S+ ++ G +L +G + I D + Sbjct: 418 SSGEVANEVILDTKEKVTSEGFKNSSTNMIPAGSLLMAIIGQGKTRGQTAILGIDACTNQ 477 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 L + W + G + K + + P+ P+ EQ Sbjct: 478 NVAAFVFNRELVDPEYVWFWAKSKYLSHRGDGHGGAQPALNGKKVRSFIFPLAPIKEQQR 537 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I ++ DTL + + + AL + Sbjct: 538 IVSEVKRFNDICDTLKSHLQSAQQTQQHLADALTDAALN 576 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 62/197 (31%), Gaps = 14/197 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPF----FALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 ++ S E +P+ WE + ++ I+ L Sbjct: 90 LEISEEEKPFELPEGWEWMRLGFITNYGECDKAEPTDANADTWIVELEDIEKSTSKLLNR 149 Query: 273 MGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDS 330 + +++ ++++ + DK + + G+ T+ + I Sbjct: 150 VKFSERPFKSSKNKFYKNDVLYGKLRPYLDKVLVA----DDSGVCTTEIIPIKGYGNILP 205 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y+ L++S G+ L + V + I EQ I N ++ + Sbjct: 206 DYIRLLLKSPRFIAYANKSTHGMNLPRLGTDKAIHAVVELTSIAEQARIVNKVDELMSLC 265 Query: 390 DVLVEKIEQSIVLLKER 406 D L +QS+ L+ Sbjct: 266 DQLE---QQSLTSLEAH 279 >gi|254976625|ref|ZP_05273097.1| restriction modification system DNA specificity domain protein [Clostridium difficile QCD-66c26] gi|255094010|ref|ZP_05323488.1| restriction modification system DNA specificity domain protein [Clostridium difficile CIP 107932] gi|255315761|ref|ZP_05357344.1| restriction modification system DNA specificity domain protein [Clostridium difficile QCD-76w55] gi|255518422|ref|ZP_05386098.1| restriction modification system DNA specificity domain protein [Clostridium difficile QCD-97b34] gi|255651540|ref|ZP_05398442.1| restriction modification system DNA specificity domain protein [Clostridium difficile QCD-37x79] gi|260684595|ref|YP_003215880.1| restriction modification system dna specificity domain [Clostridium difficile CD196] gi|260688253|ref|YP_003219387.1| restriction modification system dna specificity domain [Clostridium difficile R20291] gi|306521355|ref|ZP_07407702.1| restriction modification system dna specificity domain [Clostridium difficile QCD-32g58] gi|260210758|emb|CBA65671.1| restriction modification system dna specificity domain [Clostridium difficile CD196] gi|260214270|emb|CBE06581.1| restriction modification system dna specificity domain [Clostridium difficile R20291] Length = 394 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 66/399 (16%), Positives = 145/399 (36%), Gaps = 22/399 (5%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 I ++ + + Y+ +++ L + G++++ Sbjct: 6 KILDVVSISRENVKKFDGERSYLSTGNLDFNKISNLEI-VTYENKPSRANQTVNIGEVIF 64 Query: 89 GKLGPYLRKAIIA--DFDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQRIEAICEGAT 145 K+ + +I + + I ST F VL+P K++LP+ L +L S + + +GAT Sbjct: 65 AKMKDTKKTLVINKTNKNIIVSTGFYVLKPSKEILPQYLYHYLNSSYFLNQKNRLSKGAT 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 S + +G+ NI + + L Q + + ID + EL + S Sbjct: 125 QSALNNEGLANIKIRMYNLKVQEKVVRVLDKAQELIDKRKEQIEVLDEL-------VKSR 177 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + P K+ I +G D E R N L+++ +L Sbjct: 178 FIEMFGTPSKNEKNWEISEIGKYLDVLTDYH-SNGSYETLRDNVTLLDTKGYALMVRTTD 236 Query: 266 QKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + G+K Y ++ GE++ I + + Sbjct: 237 LENNNFEKGVKYIDEHAYNYLEKSKVFGGEVIINKIGSAGKVYLMPFLNKPVSLAMNQFM 296 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++ +L L+ + + G +++ + V+++ ++VPPI+ Q Sbjct: 297 LRFNEDKVNHIFLYNLLLTSYMESKIKEKVRGAVTKTITKDAVRKINIIVPPIRLQNQFA 356 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N + +++ L ++E S+ L++ +S + A G+ Sbjct: 357 NFV----KQVNSLKFEMETSLKELEDNFNSLMQKAFKGE 391 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 25/208 (12%), Positives = 66/208 (31%), Gaps = 22/208 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIY--------------IGLEDVESGTGKYLPKDG 68 K+W++ I ++ + T S + + + D+E+ + K Sbjct: 190 KNWEISEIGKYLDVLTDYHSNGSYETLRDNVTLLDTKGYALMVRTTDLENNNFEKGVKYI 249 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + S G+++ K+G + ++ + S + ++ +L Sbjct: 250 DEHAYNYLEKSKVFGGEVIINKIGSAGKVYLMPFLNKPVSLAMNQFMLRFNEDKVNHIFL 309 Query: 129 LSIDVTQRIEAICE----GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++ +T +E+ + GA + I + +PP+ Q + + Sbjct: 310 YNLLLTSYMESKIKEKVRGAVTKTITKDAVRKINIIVPPIRLQNQFANFVKQVNSLKFEM 369 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLN 212 T + +L+ L Sbjct: 370 ETSLKELEDNFN----SLMQKAFKGELF 393 >gi|317010197|gb|ADU80777.1| restriction modification system DNA specificity domain protein [Helicobacter pylori India7] Length = 398 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 48/412 (11%), Positives = 128/412 (31%), Gaps = 35/412 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDII--------YIGLEDVESGTGKYLPKDGNSRQ 72 +PK+W++ + + +N G + + YI ++ + + + Sbjct: 8 LPKNWEIKTFRDISTINQGLQIPISQRLKAPTEHAKFYITIQALNN-------RKEFEYI 60 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + + K IL + G I + F + + ++ + + LS++ Sbjct: 61 KTYNESVVCHKDDILMTRTGNT-GMVITNIEGVFHNNFFKINFDRTLINKDFLVYFLSLE 119 Query: 133 -VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I +T+ + ++ +P+PPL EQ+ I + + + Sbjct: 120 QTQKTILRKAGTSTIPDLNHNDFYSLSIPLPPLNEQIAIANILSDVDHYLYS----LDAL 175 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I + K+AL ++++ + +G P + N Sbjct: 176 ILKKESVKKALSFELLSQRKRLRGFNQAWQRVRLGTYKYRRGSFPQPYGNPQWYSDNGM- 234 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + G + + + + V ++ R A Sbjct: 235 --PFVQVYDVGENFKLTQKTKQKISKIAQPMSVFVPKNSVIITLQGTIG-----RVALTQ 287 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSY--DLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + ++ + + S + + +++ + +K + + Sbjct: 288 YDCYCDRTILIFDNNTLNDVNKYFFVLSLFTKFEEEKRKADGSIIKTITKQTLKDFEIPL 347 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 PP+ EQ I N+++ I L K Q + + + ++ +I + Sbjct: 348 PPLNEQIAIANILSALDNEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 395 >gi|188527306|ref|YP_001909993.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Shi470] gi|188143546|gb|ACD47963.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Shi470] Length = 422 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 52/407 (12%), Positives = 122/407 (29%), Gaps = 19/407 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK + ++ ++ G T I + +ED+ + Sbjct: 12 VPKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPK 71 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLS 130 +F K I+ A++ D + + QF L K ++ + Sbjct: 72 ALKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDIALDMKFFFYQC 130 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + + + D PIPPL Q I + + A T L TE Sbjct: 131 FLLGEWCKKNTNVSGFASMDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNT 190 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ K++ Q + ++ D+ + L + Sbjct: 191 ELKARKKQYQYYQNMLLDF---KDIHSNHKDAKISAKTYPKRLKTLLQTLAPKGVEFRKL 247 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I+ + L+ + ++ I + + Sbjct: 248 GEVCEIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQ 307 Query: 311 MERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 ++ +V P + YL +++ + + S + S+ ++ ++ + + Sbjct: 308 NQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPI 367 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 PP++ Q +I +++ A L+ I I K+ R + Sbjct: 368 PPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYEYYREKLLT 414 >gi|308182946|ref|YP_003927073.1| HP0790-like protein [Helicobacter pylori PeCan4] gi|308065131|gb|ADO07023.1| HP0790-like protein [Helicobacter pylori PeCan4] Length = 424 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 53/407 (13%), Positives = 118/407 (28%), Gaps = 24/407 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIRNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITSKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKNNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTR 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + + L+ N + E + P +K + KL Sbjct: 192 KKQYQYYQNMLLD------FNDINQSHKDAKEKLAQKPYPKRLKTLLQTLAPKGVGFRKL 245 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQ 309 E + + + + + + I S Sbjct: 246 GEVCDFQKGKSITKKAVTFGKVPVISGGRQPAYYHNEANRSGETIAISSSGVYAGYVSYW 305 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + S ++ K + YL + + + +G + +D++ + + Sbjct: 306 DIPVFLADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIPHVYSKDLQNFLIPI 364 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 PP++ Q +I +++ + L+ I I K+ R + Sbjct: 365 PPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 411 >gi|209523719|ref|ZP_03272272.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] gi|209495751|gb|EDZ96053.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] Length = 406 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 120/400 (30%), Gaps = 26/400 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + G+ + + + + G + + + +G Sbjct: 14 EWKPLGKVCRFINGKAYKQAELLEQGKYPVLRVGNF-FTNSNWYYSNLELEEDKYCDRGD 72 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +LY + + D + V+ + + +LL D E G+T Sbjct: 73 LLYAWSASFGPRIWDGDKVIYHYHIWKVVPDAKSIDKKYLYYLLDWDTKALKEEHGTGST 132 Query: 146 MSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 M H I +PIP LA Q I + T L E + +++ Sbjct: 133 MMHVSKGSIEKRLVPIPCPDNPDRSLAIQAEIVRILDTFTALTAELTAELSAELSDRQKQ 192 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++T + EW +G + + Sbjct: 193 YNYYRDRLLT--------FEKGEAEWKTLGEIVTFRRGSFPQPYGNSGWYDGEGSMPFVQ 244 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ ++ + + V G ++ + ++R + Sbjct: 245 VADVSDFGFTLIKETKQRISKLAQPKSVFVKAGTVIVTLQGTIGRVAITQYDCYVDRTL- 303 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 A I+ Y A+ +++ + YA GS +++ E+ + +PP+ EQ Sbjct: 304 --AIFTGYKENINKKYFAYQLKNKFDIEKEYARGS-TLKTITKEEFSNFEIPIPPLAEQA 360 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I +++ + E + + I L ++ R + Sbjct: 361 RIVAILDKFDTLTTSIREGLPREIELRQQQYEYYRDLLLT 400 >gi|312880985|ref|ZP_07740785.1| restriction modification system DNA specificity domain [Aminomonas paucivorans DSM 12260] gi|310784276|gb|EFQ24674.1| restriction modification system DNA specificity domain [Aminomonas paucivorans DSM 12260] Length = 407 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 50/412 (12%), Positives = 119/412 (28%), Gaps = 43/412 (10%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P + I +L G ++ + + I + + G + + + + + Sbjct: 13 PDGVEYKAIGDLGELVRGNGMPKSDFADSGVGCIHYGQIYTYYGVWAKETRSFIPHEKAE 72 Query: 78 VSI-FAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 I G ++ + I + + D P+ L +L + Sbjct: 73 RLIKVYPGDLVITNTSENVEDVCKAVAWLGDVQIVTGGHATVLKHDQDPKYLSYYLQTPR 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +A+ G + K + I +P+PPL Q I + + A T L E Sbjct: 133 FFAEKKALATGTKVIEVTAKSLAKIKIPVPPLEVQREIVKVLDAFTQLEAELEAELEARR 192 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + AL + M + G Sbjct: 193 RQYRHYRDALF--ALGNQDVSWTTMAEVG-----------------EFFRGRRFTKDDYA 233 Query: 253 ESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + YG+I + ++ + + G++V + + A Sbjct: 234 PDGVECIHYGDIYTQYGVAATATVSHVRSDMMPILRFAKRGDVVIAGVGETVEDVGKAVA 293 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPV 367 + + + H ++ ++++ ++ + L E + +L + Sbjct: 294 WLGDGEVAIHDDCFAFRHSLNPKFVSYYFQTTAFHAEKNKFVARAKVKRLSGESLGKLAI 353 Query: 368 LVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIA 412 VPP+ EQ I +++ A + L + + R ++ Sbjct: 354 PVPPLAEQERIVAILDAFDALVSDLSCGLPAEIAARRRQYEH---YRDRLLS 402 >gi|254491609|ref|ZP_05104788.1| Type I restriction modification DNA specificity domain protein [Methylophaga thiooxidans DMS010] gi|224463087|gb|EEF79357.1| Type I restriction modification DNA specificity domain protein [Methylophaga thiooxydans DMS010] Length = 402 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 53/408 (12%), Positives = 126/408 (30%), Gaps = 32/408 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST-VSIFAK 83 W + F ++ G++ E G ++ + + Q T+ Sbjct: 3 WSSHKLGDFCEVIAGQSPEGKYYNDSGDGLPFYQGKKEFGERYIGAPQKWTTKITKKANS 62 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G IL P I+ ++ D L + L Q EG Sbjct: 63 GDILMSVRAPV-GPINISIEQICIGRGLAAIRASDKLDRDFLFYYLLS--KQDEIQGNEG 119 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 A + + I + + L EQ I + ID + ++ +E ++ + Sbjct: 120 AVFASINKSQIEELSISYVDLKEQKRIVAILDQAFADIDKARALTEQNLKNARELFESYL 179 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL-SYG 262 + + +G + + K+ ++ + L L + G Sbjct: 180 QQVFNQ---------------LGEEVVQTSLGNICSFKHGFAFKSEYFVDDSALVLLTPG 224 Query: 263 NIIQKLETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 N ++ R+ G K + Y+ ++ G+++ + + + + + Sbjct: 225 NFYEEGGYRDRGHKQKYYDGPFPQEFLLSKGDLLVAMTEQAEGLLGSPALIPEDEVFLHN 284 Query: 319 ------AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP- 370 + +D +L L + SGL + + ++ + V +P Sbjct: 285 QRLGLVDIKSEYSESVDLEFLYHLFNTKYFRAKVQETASGLKVRHTSPKKMEAIKVSIPT 344 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +Q I + + + L L++ ++S + A +G+ Sbjct: 345 SLNQQKTIAKSLFNLKEKCNQLESIYLLKQAELEDLKNSLLQKAFSGE 392 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 60/189 (31%), Gaps = 6/189 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + ++ F ++ + + +S Y + E + + + + Sbjct: 1 MPWSSHKLGDFCEVIAGQSPEGKYYNDSGDGLPFYQGKKEFGERYIGAPQKWTTKITKKA 60 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + G+I+ + RG+ +D +L + + S Sbjct: 61 NSGDILMSVRAPVGPINISIEQICIGRGLA----AIRASDKLDRDFLFYYLLSK--QDEI 114 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + S+ ++ L + +KEQ I +++ A ID EQ++ +E Sbjct: 115 QGNEGAVFASINKSQIEELSISYVDLKEQKRIVAILDQAFADIDKARALTEQNLKNAREL 174 Query: 407 RSSFIAAAV 415 S++ Sbjct: 175 FESYLQQVF 183 >gi|290969063|ref|ZP_06560598.1| type I restriction modification DNA specificity domain protein [Megasphaera genomosp. type_1 str. 28L] gi|290781019|gb|EFD93612.1| type I restriction modification DNA specificity domain protein [Megasphaera genomosp. type_1 str. 28L] Length = 625 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 55/402 (13%), Positives = 123/402 (30%), Gaps = 34/402 (8%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIA 101 I ++ E V G + K GN + + + + K G K Sbjct: 218 DDGIPFLSAEAVSDGKIHFDKKRGNITKEFDEECCKKYKPQRNDVFMVKSGSTTGKVGYV 277 Query: 102 D---FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 D I S + + L L S V I++ + + + + Sbjct: 278 DTDERFNIWSPIAALRVNDNNSSRYLFHLLQSTSVQNMIKSKASHGSQPNLGMRVLEQFE 337 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLIT----ERIRFIELLKEKKQALVSYIVTKGLNPD 214 +P+PPL Q+ I E + L E + + + AL++Y T + P Sbjct: 338 VPMPPLDVQIKIAEVLDNFDAICSDLNIGLPAEIEARQKQYEYYRDALLTYAATGKIIPR 397 Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTEL---------------NRKNTKLIESNILSL 259 + + + H + V + N + ++ + + Sbjct: 398 QTDRQTDRQTDRQTDRHNALIKLCQYVFGVVLVKLSDIAMITRGGNFQKKDFTDTGVPCI 457 Query: 260 SYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 YG + + + + ++ +IV + +A + I Sbjct: 458 HYGQMYTHFGIYATEPLKYISEDVAKKSKMAVKNDIVMAVTSENVEDVCKCTAWLGNENI 517 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKE 374 S + A+ H ++ YL++ S + G + + + + + +P + E Sbjct: 518 AVSGHTAIIHHNQNAKYLSYYFHSAMFFAQKKRLAHGTKVIEVTPNTLNDIVIPLPSLAE 577 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q I +++ A + + I ++ R + + Sbjct: 578 QERIVGILDRFDALCHDISTGLPAEIEARQKQYEYYRDTLLN 619 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 61/404 (15%), Positives = 119/404 (29%), Gaps = 44/404 (10%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 +K +++ G TS + K+ + V G D N+R + I Sbjct: 21 LKNISEMQRG-TSLTKKNATSGNIPVVSGGREPAFYCDTNNRDGE----------TITVA 69 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 G S F V + + I + +G + H Sbjct: 70 GSGAGAGYVQYWIEPIFVSDAFSVKSNEKT--TTKYLYYCLEGKQDFIYSTQKGGGVPHV 127 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 I N+ +P+PPL Q I + + LI + + K++ + ++ Sbjct: 128 HISSIENMKLPVPPLEVQREIVRILDSFMELTAELIAKLTAELTARKKQYEFYRDELLNN 187 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 N +V+ + +T+ L++ I LS + Sbjct: 188 NQNVNVR-------------VGKLIDMLSQPITDGPHTTPVLVDDGIPFLSAEAVSDGKI 234 Query: 270 TRNMGL------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + E ++ K I + Sbjct: 235 HFDKKRGNITKEFDEECCKKYKPQRNDVFMVKSGSTTGKVGYVDTDERFN-IWSPIAALR 293 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 S YL L++S + + + S G + +L +++ V +PP+ Q I V+ Sbjct: 294 VNDNNSSRYLFHLLQSTSVQNMIKSKASHGSQPNLGMRVLEQFEVPMPPLDVQIKIAEVL 353 Query: 383 NVETARIDVL-------VEKIEQSIVLLKERRSSFIAAAVTGQI 419 + A L +E ++ R + + A TG+I Sbjct: 354 DNFDAICSDLNIGLPAEIEARQKQYEY---YRDALLTYAATGKI 394 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 49/167 (29%), Gaps = 11/167 (6%) Query: 27 VVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV-SIF 81 +V + + G + + I + + G Y + D + + Sbjct: 429 LVKLSDIAMITRGGNFQKKDFTDTGVPCIHYGQMYTHFGIYATEPLKYISEDVAKKSKMA 488 Query: 82 AKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 K I+ A + + + S ++ + + L + S + Sbjct: 489 VKNDIVMAVTSENVEDVCKCTAWLGNENIAVSGHTAIIH-HNQNAKYLSYYFHSAMFFAQ 547 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + G + + +I +P+P LAEQ I + Sbjct: 548 KKRLAHGTKVIEVTPNTLNDIVIPLPSLAEQERIVGILDRFDALCHD 594 >gi|258513149|ref|YP_003189405.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256635052|dbj|BAI01026.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256638107|dbj|BAI04074.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-03] gi|256641161|dbj|BAI07121.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-07] gi|256644216|dbj|BAI10169.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-22] gi|256647271|dbj|BAI13217.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-26] gi|256650324|dbj|BAI16263.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-32] gi|256653315|dbj|BAI19247.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01-42C] gi|256656368|dbj|BAI22293.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-12] Length = 384 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 61/421 (14%), Positives = 135/421 (32%), Gaps = 59/421 (14%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 +P+ WK ++++ + G + + + + G + G Q D Sbjct: 2 LPEGWKETTLEKYIHVKHGYAFKG--EYFSNSGKYIVLTPGNFFETGGFKEQKDKIKYYS 59 Query: 80 -------IFAKGQILYGKL----GPYLRKAIIADFDGICSTQFL----VLQPKDVLPELL 124 I KG + G A I + D Q + + P V + L Sbjct: 60 GEIPKEYILKKGDCILAMTEQGAGLLGSAAFIPNDDKFLHNQRIGLIEITDPNSVSSDFL 119 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 I G + H K + +I + +PPL+EQ I + D Sbjct: 120 YWLYNDPKNRLIISNEAGGTKVKHTSPKKLVDISILLPPLSEQKKIAAIL----STWDRA 175 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I E + + +++K+AL+ ++ +G + + W+ K + Sbjct: 176 IEETEKLLANSQQQKKALMQQLL------------TGKKRLPGFTGEWKTKYLGDIADIQ 223 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + ++ + + + T N L I + Sbjct: 224 TGSSNRQDSLTNGEYTFFDRSEDIRTSNRYLFDCE--------------AVIVPGEGQDF 269 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAW---LMRSYDLCKVFYAMGSGLRQSLKFED 361 + V + + Y D ++ + RS+ +SL+ Sbjct: 270 VPKYFVGKFDLHQRTYAISCFQACDGKFIFYTVGYHRSHF----LSQAVGSTVKSLRLPM 325 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +++P+ +PP+ EQ I V+ ++ +E + L++ + + + +TG+ + Sbjct: 326 FQKMPLKLPPLSEQRAIAAVLTTADEKL----AALESDLSRLRQEKKALMQQLLTGKRRV 381 Query: 422 R 422 Sbjct: 382 T 382 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 66/207 (31%), Gaps = 9/207 (4%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 W + + N+ + K + + Sbjct: 6 WKETTLEKYIHVKHGYAFKGEYFSNSGKYIVLTPGNFFETGGFKEQKDKIKYYSGEIPKE 65 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM----AVKPHGIDSTYLAWLMRS 339 I+ G+ + + + + + + + P+ + S +L WL Sbjct: 66 YILKKGDCILAMTEQGAGLLGSAAFIPNDDKFLHNQRIGLIEITDPNSVSSDFLYWLYND 125 Query: 340 YDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + G + + + + +L+PP+ EQ I +++ D +E+ E+ Sbjct: 126 PKNRLIISNEAGGTKVKHTSPKKLVDISILLPPLSEQKKIAAILSTW----DRAIEETEK 181 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGES 425 + ++++ + + +TG+ L G + Sbjct: 182 LLANSQQQKKALMQQLLTGKKRLPGFT 208 >gi|296100301|ref|YP_003620471.1| type I restriction enzyme specificity protein [Leuconostoc kimchii IMSNU 11154] gi|295831618|gb|ADG39502.1| type I restriction enzyme specificity protein [Leuconostoc kimchii IMSNU 11154] Length = 389 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 64/397 (16%), Positives = 156/397 (39%), Gaps = 35/397 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + K + ++ + + + S ++G + I G Sbjct: 17 WEERKLGDLLKEFSIKSKIEDEH------KVLSSTNSGMEFREGRVSGTSNLGYKIIKNG 70 Query: 85 QILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE- 142 ++ +L I + G+ S + + ++ + L + + + + Sbjct: 71 DLVLSPQNLWLGNININNIGKGLVSPSYKTFEFINIDSSFINPQLRTQKMLEEYKNSSTQ 130 Query: 143 --GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + I + +P ++EQ I ++D I R ++LLKE+K+ Sbjct: 131 GASVVRRNLEIDSFYQIKIFVPTISEQEKIGSF----FKQLDNTIDLHQRKLDLLKEQKK 186 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + K +++ +G D WE + L+ E + K+ E +LS + Sbjct: 187 GFLQKMFPKNGEKVPELRFAG------FADDWEERKLGDLLKEFSIKSKIEDEHKVLSST 240 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + E R + S Y+I+ G++V +L ++ + +G+++ +Y Sbjct: 241 NSGM----EFREGRVSGTSNLGYKIIKNGDLVLSPQNLWLGNINI---NNIGKGLVSPSY 293 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAM----GSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + IDS+++ +R+ + + + S +R++L+ + ++ + VP I EQ Sbjct: 294 KTFEFINIDSSFINPQLRTQKMLEEYKNSSTQGASVVRRNLEIDSFYQIKIFVPTISEQE 353 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++D ++ ++ + LLKE++ F+ Sbjct: 354 KIGSF----FKQLDNTIDLHQRKLDLLKEQKKGFLQK 386 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 92/203 (45%), Gaps = 16/203 (7%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + K I + G D WE + L+ E + K+ E +LS + + E R Sbjct: 1 MSNKVPQIRFNGYS-DTWEERKLGDLLKEFSIKSKIEDEHKVLSSTNSGM----EFREGR 55 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + S Y+I+ G++V +L ++ + +G+++ +Y + IDS+++ Sbjct: 56 VSGTSNLGYKIIKNGDLVLSPQNLWLGNINI---NNIGKGLVSPSYKTFEFINIDSSFIN 112 Query: 335 WLMRSYDLCKVFYAM----GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +R+ + + + S +R++L+ + ++ + VP I EQ I + ++D Sbjct: 113 PQLRTQKMLEEYKNSSTQGASVVRRNLEIDSFYQIKIFVPTISEQEKIGSF----FKQLD 168 Query: 391 VLVEKIEQSIVLLKERRSSFIAA 413 ++ ++ + LLKE++ F+ Sbjct: 169 NTIDLHQRKLDLLKEQKKGFLQK 191 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 66/200 (33%), Gaps = 22/200 (11%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + K + ++ + + + S ++G Sbjct: 199 KVPELRFAGFADDWEERKLGDLLKEFSIKSKIEDEH------KVLSSTNSGMEFREGRVS 252 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLS 130 + I G ++ +L I + G+ S + + ++ + L + Sbjct: 253 GTSNLGYKIIKNGDLVLSPQNLWLGNININNIGKGLVSPSYKTFEFINIDSSFINPQLRT 312 Query: 131 IDVTQRIEAICE---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + + + I + +P ++EQ I ++D I Sbjct: 313 QKMLEEYKNSSTQGASVVRRNLEIDSFYQIKIFVPTISEQEKIGSF----FKQLDNTIDL 368 Query: 188 RIRFIELLKEKKQALVSYIV 207 R ++LLKE+K+ + + Sbjct: 369 HQRKLDLLKEQKKGFLQKMF 388 >gi|229512707|ref|ZP_04402175.1| type I restriction-modification system specificity subunit S [Vibrio cholerae TMA 21] gi|229350217|gb|EEO15169.1| type I restriction-modification system specificity subunit S [Vibrio cholerae TMA 21] Length = 379 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 57/398 (14%), Positives = 125/398 (31%), Gaps = 43/398 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W +K ++ G+ + D IY P G+ + ++ Sbjct: 15 NGWPRCTLKDTFTIHYGKDHKLLSDGIY--------------PLLGSGGVMRYVSSYLYD 60 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K +L G+ G + I+ T F + +P + + R + E Sbjct: 61 KPSVLIGRKGTIDKPQFISTPFWTVDTLFYTEIKNNFVPYFVYLL----SLRIRWKKYSE 116 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + I I + +P + EQ I + +DT I + LLKE K+ + Sbjct: 117 ATGVPSLNVTSIYGIQINVPSVEEQQKIANFL----TTVDTKINQLTEKHRLLKEYKKGV 172 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + ++ K + W + + ++ + ++ + Sbjct: 173 MQQLFSQ--------KIRFKDEGHKAFPDWTQERLDYFIERISDPVSVDSQTEYREIGIR 224 Query: 263 NIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-- 319 + + + + + V P +V + R++ E G I S Sbjct: 225 SHGKGIFHKESTTGDDIGNKRVFWVKPNALVLNIVFAWE--RAVAVTSNNENGFIASHRF 282 Query: 320 -YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 K + D YL + S + G ++L + +L V +P +EQ Sbjct: 283 PMYIPKANRADVNYLLYFFLSPKGEALLNLASPGGAGRNKTLGQSEFMKLKVRLPSQQEQ 342 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++ + + + I K+ + + Sbjct: 343 QKIAQFLQALDSK----ITAVSEQIEQTKQFKKGLLQQ 376 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 48/127 (37%), Gaps = 5/127 (3%) Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 L K ++ Q + T + + + S + Y+ +G+ SL Sbjct: 65 LIGRKGTIDKPQFISTPFWTVDTLFYTEIKNNFVPYFVYLLSLRIRWKKYSEATGV-PSL 123 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + + + VP ++EQ I N + +I+ L EK LLKE + + + Sbjct: 124 NVTSIYGIQINVPSVEEQQKIANFLTTVDTKINQLTEKHR----LLKEYKKGVMQQLFSQ 179 Query: 418 QIDLRGE 424 +I + E Sbjct: 180 KIRFKDE 186 >gi|220908458|ref|YP_002483769.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 7425] gi|219865069|gb|ACL45408.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7425] Length = 388 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 62/412 (15%), Positives = 129/412 (31%), Gaps = 36/412 (8%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 K + V G IP++W + + L G L + G + Sbjct: 11 KQTEV---GLIPENWADLLLGEVITLQRG-----------FDLPNRSRRKGDIPIISSSG 56 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 ++ S G ++ G+ G + +T V K P + +L + Sbjct: 57 VTDTHNSASALGPG-VITGRYGTIGEVFFVEGDYWPLNTTLFVSNFKGNDPLFIYFFLKT 115 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ID + + + + I + PPL EQ I + + I L + Sbjct: 116 IDYK----TYSGKSGVPGVNRNDLHEIRIKCPPLPEQRSIAQALSDVDALIAALDKTIAK 171 Query: 191 FIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + Q L++ G N ++K + + + +P + N + Sbjct: 172 KRAIKTATMQQLLTGKKRLPGFNGVWEVKQ--LRELAHIQRGASPRPIDNPIWFNNNSSV 229 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + ++ S L L P + + VD ++ R Sbjct: 230 GWVRISDVTRSGM----YLSETEQKLSPLGVQHSRPVDKNSLIMS-----ICATVGRPII 280 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 I ++ + ++ ++++ + +G + +L E + V V Sbjct: 281 TEIDVCIHDGFVVFDSLQAEQRFMYYVLKWIEP-DWSKHGQTGSQMNLNTELINSTTVRV 339 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 PP EQ I V++ A + +E V + + + +TG+ L Sbjct: 340 PPPPEQTAIATVLSDMDAE----IAALEARRVKTQAIKQGMMQELLTGRTRL 387 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 19/159 (11%), Positives = 58/159 (36%), Gaps = 12/159 (7%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + +++ + + PG ++ + + + + + Sbjct: 46 KGDIPIISSSGVTDTHNSASALGPG-VITGRYGTIGEVFFV----EGDYWPLNTTLFVSN 100 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 G D ++ + +++ D G + D+ + + PP+ EQ I ++ Sbjct: 101 FKGNDPLFIYFFLKTIDYKTY---SGKSGVPGVNRNDLHEIRIKCPPLPEQRSIAQALSD 157 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 D L+ ++++I + +++ + +TG+ L G Sbjct: 158 V----DALIAALDKTIAKKRAIKTATMQQLLTGKKRLPG 192 >gi|309704072|emb|CBJ03418.1| specificity determinant for hsdM and hsdR [Escherichia coli ETEC H10407] Length = 396 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 47/400 (11%), Positives = 110/400 (27%), Gaps = 25/400 (6%) Query: 29 PIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 I+ F +G T K DI +I D++ + + + S+ I Sbjct: 7 KIEDFCSTGSGGTPSRAKPEYYEGGDIPWIKSGDLKDSKIYEANEYITAAGLENSSAKIV 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K IL G + + I + + ++P + ++ + Sbjct: 67 EKDSILIAMYGATVGRLAILGINAATNQAICNIRPDTTIADMKYLYYFLKSKFSYFVENA 126 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + I ++ +P+P L EQ I + + + L+ Sbjct: 127 VGGAQPNISQGLIKSLEVPLPSLDEQKRIADILDKAAGVCQKREQAIKLADDFLRATFLE 186 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + V G+ + K+ + I S++ Sbjct: 187 IFGDPVKNPKGWKKNKIKKGVLDITSGWSATGENIPC--------KSDEFGVLKISSVTS 238 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G + + + G+++F + + ++ + + Sbjct: 239 GIFKPEENKMVDSETILASKKLIFPKKGDLLFSRANTRELVAAICMVHQDYDNLFLPDKL 298 Query: 322 AVKPHGID---STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 D + L+++ + + +G ++ + ++ P I Q Sbjct: 299 WSIKLDHDLLLPEFFLVLIQNEKIRDLLTKQATGTSGSMLNISKNKFEETEIIFPEINVQ 358 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + L EK+ +S L E +S Sbjct: 359 K----YFCNTFRKTINLKEKLIKSNELANESFNSLSQKVF 394 >gi|251791239|ref|YP_003005960.1| restriction modification system DNA specificity domain-containing protein [Dickeya zeae Ech1591] gi|247539860|gb|ACT08481.1| restriction modification system DNA specificity domain protein [Dickeya zeae Ech1591] Length = 459 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 64/416 (15%), Positives = 146/416 (35%), Gaps = 25/416 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + I +LN E ++ ++ + V + + + + + Sbjct: 5 KLPQGWVLSAIGNVCELNPKDKLEDELEVGFMPMAGVPTNYLGHCKFEKKTWIQVKKGFT 64 Query: 80 IFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGW---LLS 130 F G ++ K+ P + + G ST++ VL+P + + + + Sbjct: 65 QFKNGDAIFAKITPCFENSKAAVINGFPNNYGAGSTEYYVLRPNNSVVDAHWLFALVKTK 124 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +T + + P+P+PPL EQ +I EK+ ++D++ + Sbjct: 125 EFLTIGAMNMSGSVGHKRVPKDFVLRYPLPLPPLIEQSIIIEKLDTLLAQVDSIKAHLEK 184 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++K+ ++A++S ++ L+ +K I + + K + Sbjct: 185 IPLIIKKFRRAMLSSVINSKLSNTSIIKKVKISDITNIISGIAFKKNQYS----ESGSKL 240 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR----FIDLQNDKRSLR 306 L +NI N+ + ++ +IV + + Sbjct: 241 LQIANISYGETCWNNTSYIPFNL----ADDYSRCDLETNDIVLALNRPITNNSLKVALIN 296 Query: 307 SAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 A + A + V ID +L +M S + K G + L + Sbjct: 297 DADLPATLYQRVARIRVPSKFIDIIYPKFLFIIMLSDEFRKEVERNLQGSDQPYLNTSQL 356 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + PP++EQ +I + A D + ++++ ++ + S +A A G+ Sbjct: 357 YNFEIQYPPLEEQAEIVRRVGQLFAYADGVEKQVQSALERVNNLTQSILAKAFRGE 412 >gi|261820963|ref|YP_003259069.1| restriction modification system DNA specificity domain protein [Pectobacterium wasabiae WPP163] gi|261604976|gb|ACX87462.1| restriction modification system DNA specificity domain protein [Pectobacterium wasabiae WPP163] Length = 493 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 70/463 (15%), Positives = 140/463 (30%), Gaps = 78/463 (16%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +P+ WK + + +L G++ L Y N S Sbjct: 4 VGKLPEGWKNIHLGDVIELKYGKS-----------LAAQVRDGIGYPVFGSNGIVGKHSI 52 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I G ++ G+ G Y + T + + + + +L + +T Sbjct: 53 PLIKQSG-LIVGRKGSYGVVQKSVEPFFPIDTTYYIDELFNQPINFWFYYLSFLPLT--- 108 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + T+ + N+ + +PPL EQ +I EK+ ++D+ + ++LK Sbjct: 109 -KLNRSTTIPGLNRDDAYNLSINLPPLVEQKIIAEKLDTLLAQVDSTKARLEQIPKILKR 167 Query: 198 KKQALVSYIVTKGLNPDVK----------------MKDSGIEWV----------GLVPDH 231 +QA+++ + L + K WV G P + Sbjct: 168 FRQAVLASALRGELTKKWRIDNKTGQDISSFKASVKKYRFESWVKEQEQKFINKGKQPRN 227 Query: 232 WEVKPFFALVTELNRKNTKLIESNILS--------------------------------- 258 K + + K I L Sbjct: 228 DNWKKKYQEAIISQDISDKDIPDGWLFEPLDGLVYISARIGWKGLKASEYTVKGPLFLSV 287 Query: 259 --LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L+YG + ++ + +I+ K S+ I Sbjct: 288 HSLNYGKEANLEQAYHISEHRYDESPEIKLQNNDILLCKDGAGIGKLSIVKNLNEPATIN 347 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +S + YL + + ++ + M L DVK + VPP+ EQ Sbjct: 348 SSLLLIRGGDFFVPEYLFYFLSGPEMQNLVKERMTGSAVPHLFQRDVKEFVLEVPPLNEQ 407 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +I + A D + +++ ++ + S +A A G+ Sbjct: 408 HEIVRRVEQLFAYADTIEKQVNTALSRVNNLTQSILAKAFRGE 450 >gi|194426529|ref|ZP_03059083.1| type I restriction modification DNA specificity domain protein [Escherichia coli B171] gi|194415268|gb|EDX31536.1| type I restriction modification DNA specificity domain protein [Escherichia coli B171] gi|195183369|dbj|BAG66906.1| predicted type I restriction system specificity protein [Escherichia coli O111:H-] Length = 421 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 59/406 (14%), Positives = 125/406 (30%), Gaps = 45/406 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + +P+ + L G T K DI + ++D+ Sbjct: 17 EWLPLSKVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGNSLQKISSCAVKGG 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQ 135 +F + IL A+I + + +F L K+ + + + + Sbjct: 77 KLFPENSILISTSATIGEHALITVPH-LANQRFTCLALKESYADCFDIKFLFYYCFSLAE 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITER 188 ++ + D G +P P LA Q I + T L E Sbjct: 136 WCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFTALTAELTAEL 195 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + K++ +++ +S +EW L+ V + K Sbjct: 196 TAELNMRKKQYNYYRDQLLS--------FDESSVEWKTLLEACDYV--------DYRGKT 239 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFIDLQNDK 302 K +S I ++ NI + + S E Y IV G+++ Sbjct: 240 PKKTQSGIFLVTAKNIRMGYIDYHASQEFISEEDYAIVMRRGLPKKGDVLITTEAPCGFV 299 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYL--AWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + + + + S +L+ S K+ A + +K Sbjct: 300 AQVNRENI---ALAQRVIKYRSKNTQLSNSFLKHYLLGSQFQDKLMQAATGSTVKGIKGS 356 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +L + +P EQ I +++ + + E + + I L +++ Sbjct: 357 RLHQLKIPIPSKVEQDRIVAILDKFDTLTNSITEGLPREIELRQKQ 402 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 58/210 (27%), Gaps = 19/210 (9%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNM 273 M +EW+ L +V T K +I +I + Sbjct: 11 MDGVEVEWLPLS----KVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGNSLQ 66 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + ++ I+ + + + + A D +L Sbjct: 67 KISSCAVKGGKLFPENSILISTSATIGEHALITVPHLANQRFTCLALKESYADCFDIKFL 126 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVET 386 + S S+ + K+ + P + Q +I +++ T Sbjct: 127 FYYCFSLA-EWCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFT 185 Query: 387 ARIDVLVEKIEQSIVLLKE----RRSSFIA 412 A L ++ + + K+ R ++ Sbjct: 186 ALTAELTAELTAELNMRKKQYNYYRDQLLS 215 >gi|110679503|ref|YP_682510.1| type I restriction enzyme specificity subunit, putative [Roseobacter denitrificans OCh 114] gi|109455619|gb|ABG31824.1| type I restriction enzyme specificity subunit, putative [Roseobacter denitrificans OCh 114] Length = 379 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 50/400 (12%), Positives = 120/400 (30%), Gaps = 31/400 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+V P+ KL+ G+ + +GT +G SD ++ Sbjct: 4 GWEVKPLGEVAKLHYGKALAESERSP--------NGTVPVYGANGVLGWSDH---TLTEG 52 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G+ G + + + L + L + + Sbjct: 53 PSLIVGRKGSAGEVNRVDGPFWPSDVTYYTEHDPNRLDFDYFHYGLMTLNLPSLAKGVK- 111 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + +PIPPL EQ I + A R+D ++ +E + Sbjct: 112 ---PGINRNDVYELGLPIPPLEEQKRIVAILDAAFERLDRAKENAEANLQNARELFDRTL 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + + +K + + + + + +L ++ N Sbjct: 169 ERVFAELVAVHATIKLEEV-----------TSKITKGSSPKWQGFSYVDSPGVLFVTSEN 217 Query: 264 IIQKLETRNMGLKPES----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + E + I+ PG+++ + + ++ + Sbjct: 218 VGKNELLLEKTKYVEEGFNQKDRKSILAPGDVLSNIVGASIGRTAVFDLDAVANINQAVC 277 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 M P + +L++L+ S ++ + R +L + V +P ++ Q I Sbjct: 278 LMRCLPERLSPKFLSFLLNSPYFKARLHEGESNMARANLSLAFFREFLVPLPELEAQERI 337 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + + + R S + A G+ Sbjct: 338 VQEIEELATHSAECETNYRTKLTDIADLRQSLLQKAFAGE 377 >gi|104774037|ref|YP_619017.1| Type I restriction-modification system, specificity subunit [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842] gi|103423118|emb|CAI97857.1| Type I restriction-modification system, specificity subunit [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842] Length = 387 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 55/391 (14%), Positives = 124/391 (31%), Gaps = 27/391 (6%) Query: 36 LNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 L +G T K D +I D+ +G K + + I + I+ Sbjct: 8 LYSGNTPSRKKSANFGGDTPWIRTADLNNGLIKSATEFLTY--EGIKQLKILPENTIVLA 65 Query: 90 KLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G + + + I F + + L P + + L+ + Sbjct: 66 MYGGFNQIGRTGILGFPATINQALVALTPHKNINQFFAQSYLNRHIIDWRRVAASSRKDP 125 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + L EQ I + I I ++ + L Q + + Sbjct: 126 NITKEDVEKSEFSFGSLEEQNRISKLISRLDHTITLHEEKKRQLERLKSALLQKMFAD-- 183 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + P V+ + EW + ++ L K++K + + + NI+ Sbjct: 184 -ESGYPVVRFEGFSDEW-----EERKLGDIAPLRGGFAFKSSKFRNTGVPIVRISNILSS 237 Query: 268 LET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 E + + I+ V K ++ S ++ +P Sbjct: 238 GEVGGDFAYYDEQDKDDKYILPDKSAVLAMSGATTGKVAILSQTDYDKVYQNQRVGYFQP 297 Query: 326 -HGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVI 382 ID +++ ++RS + + SG + ++ E++ ++P + +EQ I Sbjct: 298 VDYIDYGFISTIVRSELFMMQLESVLVSGAQPNVSSEEIDSFNFMIPILVQEQQKIGQF- 356 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D + +Q I + + S + Sbjct: 357 ---FKQLDDTIALHQQKINNINSVKKSLLQK 384 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 63/193 (32%), Gaps = 13/193 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + L G +S K + + + ++ S +G+ + D Sbjct: 198 EWEERKLGDIAPLRGGFAFKSSKFRNTGVPIVRISNILS-SGEVGGDFAYYDEQDKDDKY 256 Query: 80 IFAKGQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 I + G K I + + QP D + ++ + Sbjct: 257 ILPDKSAVLAMSGATTGKVAILSQTDYDKVYQNQRVGYFQPVDYIDYGFISTIVRSELFM 316 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++E++ + + I + IP L ++ +KI ++D I + I Sbjct: 317 MQLESVLVSGAQPNVSSEEIDSFNFMIPILVQEQ---QKIGQFFKQLDDTIALHQQKINN 373 Query: 195 LKEKKQALVSYIV 207 + K++L+ + Sbjct: 374 INSVKKSLLQKMF 386 >gi|320120588|gb|EFE29118.2| type I restriction/modification specificity protein [Filifactor alocis ATCC 35896] Length = 387 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 58/404 (14%), Positives = 116/404 (28%), Gaps = 30/404 (7%) Query: 28 VPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 ++ TG+T + YI E++ + T KG + Sbjct: 3 CKLEEICSFRTGKTDVANLTTERYISTENMLPNKSGIVNATSLPIVDLTQAY---EKGDV 59 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGAT 145 L + PY +K A +G CS LV K+ ++L+ D A +G Sbjct: 60 LVSNIRPYFKKIWKAKINGGCSNDVLVFTAKENTDSDFLYYVLANDAFFAYAMATSKGTK 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M D K I +P + Q I + ID I L+++ + L Sbjct: 120 MPRGDKKSIMQYEVPCYDIETQQKIASIL----KSIDEKIELNNAINNNLEQQAKTLFKS 175 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSL 259 + G PD W + + K + I + Sbjct: 176 WFVDC-----------EPFNGKQPDDWILGTIDDLAKDVVCGKTPSTKKEEYYGGYIPFI 224 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + ++ + + N + V L + + Sbjct: 225 TIPDMHNCVYSLNTARSLSTLGAESQSKKTLPVNSVCVSCIGTAGLVTLVPVPSQTNQQI 284 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + + Y+ LM++ +L ++ V++P K + Sbjct: 285 NSIIPKNTVSPYYVYLLMKTMSEIINKLGQSGSTIVNLNKAQFGKIEVIIPSTKVMLEF- 343 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 I L+ ++ L R + + ++G++D+ Sbjct: 344 ---TELVEPIFELILLNQKENNRLSNLRDTLLPKLMSGELDVSD 384 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 62/194 (31%), Gaps = 9/194 (4%) Query: 19 GAIPKHWKVVPIKRFTK-LNTGRTSESGKD------IIYIGLEDVESG-TGKYLPKDGNS 70 G P W + I K + G+T + K+ I +I + D+ + + ++ Sbjct: 185 GKQPDDWILGTIDDLAKDVVCGKTPSTKKEEYYGGYIPFITIPDMHNCVYSLNTARSLST 244 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 +++ + + +G + + Q + PK+ + L+ Sbjct: 245 LGAESQSKKTLPVNSVCVSCIG-TAGLVTLVPVPSQTNQQINSIIPKNTVSPYYVYLLMK 303 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G+T+ + + G I + IP + E + I E R Sbjct: 304 TMSEIINKLGQSGSTIVNLNKAQFGKIEVIIPSTKVMLEFTELVEPIFELILLNQKENNR 363 Query: 191 FIELLKEKKQALVS 204 L L+S Sbjct: 364 LSNLRDTLLPKLMS 377 >gi|189485198|ref|YP_001956139.1| type I restriction-modification system substrate-binding subunit [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287157|dbj|BAG13678.1| type I restriction-modification system substrate-binding subunit [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 415 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 58/408 (14%), Positives = 136/408 (33%), Gaps = 25/408 (6%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS---DTS 76 +V + ++ G T ++ ++I ++ ++ K S Sbjct: 8 EVKKLGEICEIVNGGTPKTNVREYWNGTNLWITPAEMGKREIPFVEKTVRQLSDSGLKNS 67 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + ++ P I + L P L L + L V Sbjct: 68 SAKLLPPYSVILSSRAPIGHLVINTKPMA-TNQGCKGLIPSGKLFYLFLYYYLYFSV-DY 125 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ + G T + + +P P L+EQ I K+ + I L + I+ +K Sbjct: 126 LDKLGTGTTFKELPTWKLKEVEIPFPLLSEQKRIVTKLDKFSENIKRLEDAARKNIQNVK 185 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + ++++ + + V E+ F + K I + Sbjct: 186 DLFNSVLNETFKNKSAVVNDNRQVYKKAHWEVKKLGEICTFINGLW--AGKKCPFINVYV 243 Query: 257 LSLSYGNIIQKLETRNM---GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + KL+ N+ L+ + YE ++ I+ + +++ Sbjct: 244 IRNTNFTKDGKLDLSNVVNLSLEKKQYEKKRLEYDDIILEKSGGGPKQPVGRVVLFDIKK 303 Query: 314 GIITSAYMAVKPHGIDSTYLA--------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 G + + I+ Y+ + + +V + +G+R +L F++ K++ Sbjct: 304 GNFSFSNFTSVIRIINKRYVYPKYLYNYLFYCYISGMTEVMQSHSTGIR-NLNFDEYKKI 362 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ P I EQ I ++ + + L ++ I L E + S + Sbjct: 363 NIVFPSISEQKKIVARLDKLSTKTKKLEIVYQEKIDGLAELKKSVLKQ 410 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 71/204 (34%), Gaps = 19/204 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES----GTGKYLPKDGNSRQSDTSTV- 78 HW+V + G +GK +I + + + GK + + + Sbjct: 214 HWEVKKLGEICTFINGL--WAGKKCPFINVYVIRNTNFTKDGKLDLSNVVNLSLEKKQYE 271 Query: 79 -SIFAKGQILYGKLG-----PYLRKAIIADFDGICS-----TQFLVLQPKDVLPELLQGW 127 I+ K G P R + G S + ++ + V P+ L + Sbjct: 272 KKRLEYDDIILEKSGGGPKQPVGRVVLFDIKKGNFSFSNFTSVIRIINKRYVYPKYLYNY 331 Query: 128 LLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L ++ E + +T + + ++ I + P ++EQ I ++ + + L Sbjct: 332 LFYCYISGMTEVMQSHSTGIRNLNFDEYKKINIVFPSISEQKKIVARLDKLSTKTKKLEI 391 Query: 187 ERIRFIELLKEKKQALVSYIVTKG 210 I+ L E K++++ G Sbjct: 392 VYQEKIDGLAELKKSVLKQTFDCG 415 >gi|299148889|ref|ZP_07041951.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_23] gi|298513650|gb|EFI37537.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_23] Length = 464 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 60/399 (15%), Positives = 131/399 (32%), Gaps = 31/399 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +PK W K T + + I +++++G + KD + + Sbjct: 69 EVPKGWVWTTFGNVCKKLTDGSHNPPPKCSNGYTVISAQNIKNGKIVFTDKDRYTDELGF 128 Query: 76 ST----VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 I IL G AI + + + + + V L S Sbjct: 129 QKENPRTQITNGDIILGIIGGSIGNVAIYDLSVPVIAQRSISIIDTYVSNIYCFYLLQST 188 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G + + + +P+PPL+EQ I +I I+ + ++ Sbjct: 189 IFQSLFLEKSIGNAQAGVYLGELDKLYIPLPPLSEQQRIVTEIKRWFALIEQIEFDKADL 248 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG---------------LVPDHWEVKP 236 +K+ K ++ + L P + IE + +P W Sbjct: 249 QTTIKQTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTQLPKGWTTIK 308 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRF 295 + N + K + L I + + P++YE+ ++ G+++F + Sbjct: 309 VGDVAIYTNGRAFKPEDWMHEGLPIIRIQNLNDNSASYNRTPKTYESKYLIHNGDLLFAW 368 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLR 354 + + V P+ YL + ++ + GSG+ Sbjct: 369 AASLGTYI-----WNGGKAWLNQHIFKVDPYPFIEKQYLYHVFKAMITEFYTQSHGSGMV 423 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + + +L+PP++EQ I + + ++DV++ Sbjct: 424 -HITKKQFENIKLLLPPLEEQKRIVQTLEQISTKLDVIM 461 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 66/199 (33%), Gaps = 7/199 (3%) Query: 227 LVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 VP W F +T+ + + +S NI + + Sbjct: 69 EVPKGWVWTTFGNVCKKLTDGSHNPPPKCSNGYTVISAQNIKNGKIVFTDKDRYTDELGF 128 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERG---IITSAYMAVKPHGIDSTYLAWLMRSY 340 Q +P + + + +I +++ + + Y +L++S Sbjct: 129 QKENPRTQITNGDIILGIIGGSIGNVAIYDLSVPVIAQRSISIIDTYVSNIYCFYLLQST 188 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +F G + + ++ +L + +PP+ EQ I I A I+ + Sbjct: 189 IFQSLFLEKSIGNAQAGVYLGELDKLYIPLPPLSEQQRIVTEIKRWFALIEQIEFDKADL 248 Query: 400 IVLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 249 QTTIKQTKSKILDLAIHGK 267 >gi|255102189|ref|ZP_05331166.1| restriction modification system DNA specificity domain protein [Clostridium difficile QCD-63q42] Length = 394 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 66/399 (16%), Positives = 145/399 (36%), Gaps = 22/399 (5%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 I ++ + + Y+ +++ L + G++++ Sbjct: 6 KILDVVSISGENVKKFDGERSYLSTGNLDFNKISNLEI-VTYENKPSRANQTVNIGEVIF 64 Query: 89 GKLGPYLRKAIIA--DFDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQRIEAICEGAT 145 K+ + +I + + I ST F VL+P K++LP+ L +L S + + +GAT Sbjct: 65 AKMKDTKKTLVINKTNKNIIVSTGFYVLKPSKEILPQYLYHYLNSSYFLNQKNRLSKGAT 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 S + +G+ NI + + L Q + + ID + EL + S Sbjct: 125 QSALNNEGLANIKIRMYNLKVQEKVVRVLDKAQELIDKRKEQIEVLDEL-------VKSR 177 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + P K+ I +G D E R N L+++ +L Sbjct: 178 FIEMFGTPSKNEKNWEISEIGKYLDVLTDYH-SNGSYETLRDNVTLLDTKGYALMVRTTD 236 Query: 266 QKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + G+K Y ++ GE++ I + + Sbjct: 237 LENNNFEKGVKYIDEHAYNYLEKSKVFGGEVIINKIGSAGKVYLMPFLNKPVSLAMNQFM 296 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++ +L L+ + + G +++ + V+++ ++VPPI+ Q Sbjct: 297 LRFNEDKVNHIFLYNLLLTSYMESKIKEKVRGAVTKTITKDAVRKINIIVPPIRLQNQFA 356 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N + +++ L ++E S+ L++ +S + A G+ Sbjct: 357 NFV----KQVNSLKFEMETSLKELEDNFNSLMQKAFKGE 391 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 25/208 (12%), Positives = 66/208 (31%), Gaps = 22/208 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIY--------------IGLEDVESGTGKYLPKDG 68 K+W++ I ++ + T S + + + D+E+ + K Sbjct: 190 KNWEISEIGKYLDVLTDYHSNGSYETLRDNVTLLDTKGYALMVRTTDLENNNFEKGVKYI 249 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + S G+++ K+G + ++ + S + ++ +L Sbjct: 250 DEHAYNYLEKSKVFGGEVIINKIGSAGKVYLMPFLNKPVSLAMNQFMLRFNEDKVNHIFL 309 Query: 129 LSIDVTQRIEAICE----GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++ +T +E+ + GA + I + +PP+ Q + + Sbjct: 310 YNLLLTSYMESKIKEKVRGAVTKTITKDAVRKINIIVPPIRLQNQFANFVKQVNSLKFEM 369 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLN 212 T + +L+ L Sbjct: 370 ETSLKELEDNFN----SLMQKAFKGELF 393 >gi|225352844|ref|ZP_03743867.1| hypothetical protein BIFPSEUDO_04478 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156322|gb|EEG69891.1| hypothetical protein BIFPSEUDO_04478 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 399 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 80/396 (20%), Positives = 146/396 (36%), Gaps = 31/396 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + GR + + + G Y + + +G Sbjct: 25 WEQRKLGEVAHFINGRAYSQNELLSSGKYPVLRVGNF-YTNDSWYYSNLELEDKNYAYEG 83 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +LY I I +Q + L E L + L +RI + G+ Sbjct: 84 DLLYTWS-ATFGPHIWHGNKVIYHYHIWKVQLEAAL-EKLFAFQLLERDKERILSDKNGS 141 Query: 145 TMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 TM H GI N + +P + EQ I IT R + L K++++ Sbjct: 142 TMVHITKTGIENTSVLMPCSVEEQRRIGAFFDRLDSL----ITLHQRKYDKLCVLKKSML 197 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + KG + +++ +G D WE + L E + + + + ILS+S N Sbjct: 198 DKMFPKGGSLYPEIRFAG------FTDPWEQRKLGELFEESDERAS---DREILSVSVAN 248 Query: 264 IIQKLETRNMGLKP-ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I + P S Y+IV G++V+ + + GI++ AY+ Sbjct: 249 GIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYD----GIVSPAYVV 304 Query: 323 VKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP-PIKEQFD 377 +P+ + + + A L+R L K + + G Q LKF+D + + +P EQ Sbjct: 305 ARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQRQ 364 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + R+D L+ ++ + LL+ + S + Sbjct: 365 IGGFFD----RLDSLITLHQRKLELLRNIKKSMLDK 396 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 63/187 (33%), Gaps = 12/187 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + + R S ++I+ + + + + + + + I G Sbjct: 220 WEQRKLGELFEESDERA--SDREILSVSVANGIYPASE--SDRETNPGASLANYKIVHFG 275 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG---WLLSIDVTQRIEAIC 141 ++Y + + + +DGI S ++V +P + + + + Sbjct: 276 DVVYNSMRMWQGAVDASRYDGIVSPAYVVARPNSEVYARFFARLLRQPMLLKQYQQVSQG 335 Query: 142 EGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + +I + +P EQ I IT R +ELL+ K+ Sbjct: 336 NSKDTQVLKFDDFASIGISMPASENEQRQIGGFFDRLDSL----ITLHQRKLELLRNIKK 391 Query: 201 ALVSYIV 207 +++ + Sbjct: 392 SMLDKMF 398 >gi|15611793|ref|NP_223444.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori J99] gi|4155286|gb|AAD06303.1| putative TYPE I RESTRICTION ENZYME (SPECIFICITY SUBUNIT) [Helicobacter pylori J99] Length = 454 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 63/430 (14%), Positives = 132/430 (30%), Gaps = 39/430 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + + + G +S K + Y+ +V + L + + D Sbjct: 13 PKGVEFRKLGDIGEFYGGLVGKSKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKE 72 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD----------FDGICSTQFLVLQPKDVLPELLQG 126 + G +L+ L ++ + F P L+ Sbjct: 73 KQNTIQLGDVLFTGSSENLEDCAMSCVVTQKIEEDIYLNSFCFGFRFFDKNLFNPSFLKH 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + + I + G T + + + I +PIPPL Q I + A T L T Sbjct: 133 FLRDYNFRKNISKVANGVTRFNVSKQLLLKITIPIPPLEIQQEIVTILDAFTELNTELNT 192 Query: 187 ERIRFIELLKEKKQALVSYIVTKG------------LNPDVKMKDSGIEWVGLVPDHWEV 234 E + K++ Q + ++ L K L P E Sbjct: 193 ELNTELNARKKQYQYYQNMLLDFNDINQSRKDAKERLAQKPYPKRLKQLLHTLAPKGVEF 252 Query: 235 KPFFALVTELNRK---NTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVD 287 + + + L + + YG I + ++ + + + Sbjct: 253 RKLGDIGEFTRGNGLLKSDLQDKGRPVVHYGQIHTQYNLSIDKTISYVNDALFHKLKKAK 312 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 P +I+ A + + S M + ++ + ++Y K Sbjct: 313 PNDILIATTSENVKDVGKSIAWLGNEEVAFSGEMYSYSTNENPKFIIYYFQTYFFQKEKE 372 Query: 348 AMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE- 405 +G + D+K++ + +PP++ Q +I +++ +A L I I K+ Sbjct: 373 KKITGTKVMRIHENDLKQITIPIPPLEIQQEIVTILDQFSALTTDLQAGIPAEIKARKKQ 432 Query: 406 ---RRSSFIA 412 R + Sbjct: 433 YEYYREKLLT 442 >gi|75907381|ref|YP_321677.1| restriction modification system DNA specificity subunit [Anabaena variabilis ATCC 29413] gi|75701106|gb|ABA20782.1| Restriction modification system DNA specificity domain protein [Anabaena variabilis ATCC 29413] Length = 454 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 59/439 (13%), Positives = 136/439 (30%), Gaps = 39/439 (8%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESG 59 K YK+ G +P+ WK+V + + G +S + I + ++ Sbjct: 21 KTDESYKNRD---FGFVPESWKIVKFENILSIFNGYAFKSTDAVDSSNTQLIRMGNLYQN 77 Query: 60 TGKYLPKDGNSRQSDTSTV--SIFAKGQILYGKLGPYLR-------KAIIADFDGICSTQ 110 + +G ++ G + K D + + + + Sbjct: 78 KLDLERSPVFYPDYYAQKYSKYLLKEGDLIISLTGTSEKEDYGFTVKINRTDKNLLLNQR 137 Query: 111 FLVLQPKDVLPELLQGWLLSID--VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + + + +G ++ I + + IPPL EQ Sbjct: 138 VARIDVISADINHDYIFYFLRSRIFLTPLYLTAKGMKQANLSTNTIKTLNVLIPPLEEQ- 196 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 KI + IT++ + I L E K+ L+ + T+G + + +G + Sbjct: 197 ---RKIAWILSLVQDAITQQEQIISLTTELKKVLMQKLFTEGTRGEPQKMT----EIGFI 249 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-----MGLKPESYETY 283 P WEV F ++ + + + L G+ + + + Sbjct: 250 PKSWEVIRFADAISVTSGQVNPKEKPYSEMLHVGSENIESNSGRLLCLQTNQELNISSGN 309 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + +I++ I +K +L + + K + +L + S Sbjct: 310 YYFNNDDILYSKIRPYLNKVALPDFE--GTCSADMYPIRSKNGCFNRNFLFHFLLSDIFR 367 Query: 344 KVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + + ++ P + EQ +I+ ++ D + Q++ Sbjct: 368 NQAISFQDRTGIPKINRAQLGSTLLIRPSLLEQNEISYALD----LCDKRINTAYQNLST 423 Query: 403 LKERRSSFIAAAVTGQIDL 421 K+ + +T QI + Sbjct: 424 SKDLFRILLHQLMTAQIRV 442 >gi|255525762|ref|ZP_05392693.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|296188046|ref|ZP_06856438.1| type I restriction modification DNA specificity domain protein [Clostridium carboxidivorans P7] gi|255510585|gb|EET86894.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|296047172|gb|EFG86614.1| type I restriction modification DNA specificity domain protein [Clostridium carboxidivorans P7] Length = 402 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 53/396 (13%), Positives = 132/396 (33%), Gaps = 22/396 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + G++ + + G R + G Sbjct: 19 WEQRKLGDVVPITMGQSPDGSTYSDTPSDYILVQGNADLKNGWVTPRVWTSQVTKKAEAG 78 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ P + ++ + ++ + + L+ ++ + + G+ Sbjct: 79 DLIMSVRAP-AGEIGKTAYNAVIGRGVAAIKGNEFI----FQSLVKMNGEGYWKKLSCGS 133 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + I N + IP L EQ I +D LIT R + L++KK++L+ Sbjct: 134 TFESLNSDNIKNAKIMIPNLDEQAQIGVF----FKNLDNLITLHQRKLIDLQDKKKSLLQ 189 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K +++ G ++ + + ++ I + G++ Sbjct: 190 KMFPKNGEDFPELRFPGFTDPWEQRKLEDIADVIDP--HPSHRAPEVKTVGIPFIGIGDV 247 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-----AQVMERGIITSA 319 + + + Y + SL + + + + Sbjct: 248 DEVGNINYGTARIVDEKIYDEHHKRYDLANTSIGIGRVASLGKVIRLRNDIGKYAVSPTM 307 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP-PIKEQFD 377 + I+ Y+ M + + F + +G RQS+ +D+++L + +P I EQ Sbjct: 308 SIIQFHSDIEINYVYSCMNTPLFQQQFTSQSNGSTRQSVGIQDLRKLILNIPLDIGEQKL 367 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +ID L+ ++ + L+E++ + + Sbjct: 368 IGD----LFWQIDHLITLHQRKLNHLQEQKKALLQQ 399 >gi|332285464|ref|YP_004417375.1| hypothetical protein PT7_2211 [Pusillimonas sp. T7-7] gi|330429417|gb|AEC20751.1| hypothetical protein PT7_2211 [Pusillimonas sp. T7-7] Length = 439 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 65/438 (14%), Positives = 132/438 (30%), Gaps = 39/438 (8%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGKD-------IIYIGLEDVES-GTGKYLPKDGNSRQS 73 W + K+ +G T + GKD I ++V + G Q+ Sbjct: 3 SEWVSKRLGDCCLKIGSGATPKGGKDAYLENGPFKLIRSQNVYNDGFSPNGLTYIGEEQA 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQ--FLVLQPKDVLPELLQGWL 128 A G +L G + + + + P L+ +L Sbjct: 63 RKLDGVAVAAGDVLLNITGDSVARVCQAPEQHMPARVNQHVAIIRPNPSLFDARYLRYFL 122 Query: 129 LSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S + + GAT + I + +P PPL+ Q I + + RI L Sbjct: 123 ASPSQQSLMLGLAAAGATRNALTKGMIEDFIVPCPPLSVQQEIANVLGSLDDRITLLRET 182 Query: 188 RIRFIELLKEKKQALVS-----YIVTKGLNPDVK-------MKDSGIE-WVGLVPDHWEV 234 + + ++ +G P+ DS E + L+P W + Sbjct: 183 NKTLESIAQAIFKSWFVNFDPVRAKMEGRQPEGMDEATAALFSDSFEESELSLIPRGWSL 242 Query: 235 KPFFALVTELNRKNTKLIES-----NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 L + K + ++ ++ ++ + S + Sbjct: 243 GHISDLGGVICGKTPPTSDMSNYGNDVPFITIPDMH-GCLVITETARRLSTQGADNQKKK 301 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYA 348 + + + AQV E +V PH T + L+ Sbjct: 302 YLPVGSVSVSCIATPGLVAQVTEPSQTNQQINSVIPHEHWGTAFSLLLLRGVGNDVRIAG 361 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G + +L + +++ +L+P Q I + A + + ++ + L E R Sbjct: 362 SGGSVFHNLNKSNFEKIKILLP----QETIAQEFDRLIAPFIKQITENQRQVQTLIELRD 417 Query: 409 SFIAAAVTGQIDLRGESQ 426 + V+G++ L + Sbjct: 418 VLLPKLVSGRLRLPNGEE 435 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 62/198 (31%), Gaps = 12/198 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG-TGKYLPKDGNSRQS 73 IP+ W + I + G+T + G D+ +I + D+ + +++ + Sbjct: 236 IPRGWSLGHISDLGGVICGKTPPTSDMSNYGNDVPFITIPDMHGCLVITETARRLSTQGA 295 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 D G + + + Q + P + LL Sbjct: 296 DNQKKKYLPVGSVSVSCI-ATPGLVAQVTEPSQTNQQINSVIPHEHWGTAFSLLLLRGVG 354 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 A G+ + + I + + Q I ++ ITE R ++ Sbjct: 355 NDVRIAGSGGSVFHNLNKSNFEKIKILL----PQETIAQEFDRLIAPFIKQITENQRQVQ 410 Query: 194 LLKEKKQALVSYIVTKGL 211 L E + L+ +V+ L Sbjct: 411 TLIELRDVLLPKLVSGRL 428 >gi|225619381|ref|YP_002720607.1| restriction modification system DNA specificity domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225214200|gb|ACN82934.1| restriction modification system DNA specificity domain protein [Brachyspira hyodysenteriae WA1] Length = 523 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 62/423 (14%), Positives = 142/423 (33%), Gaps = 32/423 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 ++W+ V + ++N G + K + ++ + D S +Y+ Sbjct: 110 ENWQEVRLGDICQINRGASPRPIQKYIADKGMPWVKISDATSSNSRYIKTTKEFIDFSGV 169 Query: 77 TVSI-FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S+ G ++ G I + ++++ + + + + Sbjct: 170 SKSVKIDVGTLILSNSGTT-GIPKIMGIEACVHDGWIIISNINKNVLKEFLYYEFLYIRN 228 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I + G + + + + +PPL EQ I + +D I ++L Sbjct: 229 SISNLATGTVLQNLKTDIVKQFKINLPPLEEQKKIASIL----SSLDDKIELNNCMNKIL 284 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSG----IEWVGLVPDHWEVKPFFALVTELNRKN 248 +E Q + P+ K SG +G +PD WEV + T + + Sbjct: 285 EETAQTIFKEWFINFNFPNEEGKPYKKSGGKMIESELGEIPDGWEVTTLENISTIITKGT 344 Query: 249 TKLIE--SNILSLSYGNIIQKLETRNMGLKPESYETYQ------IVDPGEIVFRFIDLQN 300 T I + NI+ L ET+ I+ +I+F Sbjct: 345 TPKKFTLQGINYIKVENILDNHSIDKSKLSFIDSETHNNLLKRSIIKEKDILFSIAGTLA 404 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKF 359 + + + A + V + I+ ++ + + F + ++ +L Sbjct: 405 KFAFVTNNILPANTNQAIAIIRVDSNIINPLFVFNFFLADLHKEHCFKNLQQSVQPNLSL 464 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++ L ++ P K + I +I +E+ ++ L R S + ++G+I Sbjct: 465 TTIRNLKLIFPESKILKKYEDSILHIFYKIYRNIEENQK----LAGIRDSILPKLMSGEI 520 Query: 420 DLR 422 ++ Sbjct: 521 RIK 523 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 72/209 (34%), Gaps = 14/209 (6%) Query: 10 YKDSG---VQW-IGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGK 62 YK SG ++ +G IP W+V ++ + + T T+ + I YI +E++ Sbjct: 309 YKKSGGKMIESELGEIPDGWEVTTLENISTIITKGTTPKKFTLQGINYIKVENILDNHSI 368 Query: 63 YLPKDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQ 115 K + SI + IL+ G + A + + +T + + Sbjct: 369 DKSKLSFIDSETHNNLLKRSIIKEKDILFSIAGTLAKFAFVTNNILPANTNQAIAIIRVD 428 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + P + + L+ + + + + I N+ + P + I+ Sbjct: 429 SNIINPLFVFNFFLADLHKEHCFKNLQQSVQPNLSLTTIRNLKLIFPESKILKKYEDSIL 488 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS 204 +I I E + + L+S Sbjct: 489 HIFYKIYRNIEENQKLAGIRDSILPKLMS 517 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 62/183 (33%), Gaps = 5/183 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII----QKL 268 ++ G + + ++W+ + + + I+ I + Sbjct: 93 KEIIYNTEGAQEIIGNKENWQEVRLGDICQINRGASPRPIQKYIADKGMPWVKISDATSS 152 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 +R + E + + +I + L N + + + ++ + Sbjct: 153 NSRYIKTTKEFIDFSGVSKSVKIDVGTLILSNSGTTGIPKIMGIEACVHDGWIIISNINK 212 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + +G Q+LK + VK+ + +PP++EQ I ++++ Sbjct: 213 NVLKEFLYYEFLYIRNSISNLATGTVLQNLKTDIVKQFKINLPPLEEQKKIASILSSLDD 272 Query: 388 RID 390 +I+ Sbjct: 273 KIE 275 >gi|238760354|ref|ZP_04621495.1| type I restriction enzyme, S subunit [Yersinia aldovae ATCC 35236] gi|238701414|gb|EEP93990.1| type I restriction enzyme, S subunit [Yersinia aldovae ATCC 35236] Length = 416 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 58/418 (13%), Positives = 134/418 (32%), Gaps = 34/418 (8%) Query: 20 AIPK--------HWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVESGTGKYLPKD 67 +P+ W + + + + + + + DV S Sbjct: 6 KVPEIRFKGFGGEWVENNLGELIDIRSAARVHKEQWTEAGVPFFRTSDVVSIYKGQENTK 65 Query: 68 GNSRQSDTSTVS----IFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVL 120 + +S K +L G ++ + + + K Sbjct: 66 AYISHEVYNDLSEKIGKVTKDDLLITGGGSIGIPYLVPNDDPLYFKDADLLWLKNNKKFN 125 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L + S + I+ I T++H + P+ EQ I Sbjct: 126 GYFLYTFFFSAPFKKHIKGISHTGTIAHYTIEQAKATPINTCYDEEQTQIGNYFQKLDAL 185 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I+ + + L+ K++++ + K +++ G + G + + Sbjct: 186 INQH----QQKHDKLRNIKKSMLEKMFPKQGETIPEIRFKG--FNGEWEEAKLGEIGDTF 239 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQ 299 + ++Y N+ + ++P +T Q V G++ F Sbjct: 240 TGLSGKTKDDFGHGQGRFVTYLNVFSNAISNENSVEPIEIDTNQNEVKKGDVFFTTSSET 299 Query: 300 NDKRSLRSAQVME--RGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL-RQ 355 ++ + S + E + S +P DS YLA+++RS + + G+ R Sbjct: 300 PEEVGMSSIWMSEIKNVYLNSFCFGYRPKQQFDSYYLAYMLRSNSFREKIVFLAQGISRY 359 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ V + V +P + EQ + N ++D L+ + +Q I L + + ++ Sbjct: 360 NISKTKVMDIKVSIPCLSEQEKVGNY----FQKLDALINQHQQQITKLNNIKQACLSK 413 >gi|148266053|ref|YP_001232759.1| restriction modification system DNA specificity subunit [Geobacter uraniireducens Rf4] gi|146399553|gb|ABQ28186.1| restriction modification system DNA specificity domain [Geobacter uraniireducens Rf4] Length = 428 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 59/430 (13%), Positives = 135/430 (31%), Gaps = 40/430 (9%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W VP + K + G T + +I +I D + + Sbjct: 2 SEWSTVPFGQIAKKIVNGGTPSTDIDRYWNGNIPWITGADFTPSGIGEFRRFVSEEAVRQ 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S ++ +GQ+L + K IA D S + D + + Sbjct: 62 SATNVIQQGQLLLVTR-TGVGKIAIAPCDIAISQDITGVYVDDNQVATSFLFHRMRQGVE 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + +G +++ + + +P L +Q I E + +D I + I + Sbjct: 121 DLKKLNQGTSINGIIRSDLVAYLVELPALPQQRRIAEIL----STLDETIEQTEVLIAKM 176 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELNR- 246 ++ K L+ + T+G+ PD ++ + +G +P WEV+ ++ + Sbjct: 177 QQVKAGLMHDLFTRGVTPDGHLRPTREHAPGLYKESPLGWIPKEWEVERLGNILRKCGGY 236 Query: 247 ----------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET----YQIVDPGEIV 292 + + + +I L + + G++V Sbjct: 237 LQTGPFGSQLHAHEYQAEGVPVVMPQDINNGLIGTENIARIHEARANDLARHRMSLGDMV 296 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-S 351 + ++R ++ + + + + + A + R + + Sbjct: 297 IARRGDLSRAAAIRESEQGWVCGTGCFLLRLGQSALTADFAAQVYRQDFVQRQIVGRAVG 356 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 SL ++ L + EQ I + I L QS+ L + + Sbjct: 357 TTMPSLNNSVMEGLFFPFCDLDEQVRIVERLEWMEMNICAL--NESQSVNRL--IKRGLM 412 Query: 412 AAAVTGQIDL 421 +TG + + Sbjct: 413 HDLMTGNVQV 422 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 78/217 (35%), Gaps = 22/217 (10%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-----LNTGRTSE-------SGKDIIYIGLEDVE 57 YK+S +G IPK W+V + + L TG + + + +D+ Sbjct: 209 YKESP---LGWIPKEWEVERLGNILRKCGGYLQTGPFGSQLHAHEYQAEGVPVVMPQDIN 265 Query: 58 SGTG--KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFL- 112 +G + + + +R +D + G ++ + G R A I + + +C T Sbjct: 266 NGLIGTENIARIHEARANDL-ARHRMSLGDMVIARRGDLSRAAAIRESEQGWVCGTGCFL 324 Query: 113 -VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 L + + V ++I G TM + + + P L EQV I Sbjct: 325 LRLGQSALTADFAAQVYRQDFVQRQIVGRAVGTTMPSLNNSVMEGLFFPFCDLDEQVRIV 384 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 E++ + I L + + + L++ V Sbjct: 385 ERLEWMEMNICALNESQSVNRLIKRGLMHDLMTGNVQ 421 >gi|299148892|ref|ZP_07041954.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_23] gi|298513653|gb|EFI37540.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_23] Length = 418 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 55/403 (13%), Positives = 115/403 (28%), Gaps = 28/403 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W+ VP+ LN + +I + V G + + Sbjct: 18 EVPEGWQSVPVSELFCLNPKSEITDATSVGFIPMACVNDGFSGNHQFEERIWKEVKKGYC 77 Query: 80 IFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 F G I K+ P + + G +T+ ++L+P ++ + S Sbjct: 78 HFQNGDIGIAKISPCFENLKSTIFQNLPNNYGAGTTELVILRPLNIHAKFYLYLFKSQWY 137 Query: 134 TQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +G ++ +P+PPLAEQ I +I ID + + Sbjct: 138 ISEGTKYFKGVVGQQRVHKGIFTDLQIPLPPLAEQYRIVAEIEKWFALIDQIEQGKTGLQ 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------------GLVPDHWEVKPFFAL 240 ++ + K ++ + L P + E + G P W L Sbjct: 198 TIVMQTKSKILDLAIHGKLVPQDPNDEPAFELLKRINPDFTPCDNGHYPIGWLETILGEL 257 Query: 241 VTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 K +++ + + + E V G+++ Sbjct: 258 FNHNTGKALNSSNKEGVMKDYLTTSNVYWNKFDFTVIKQMPFKEIELDKCTVTKGDLLVC 317 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + ++P + +Y Sbjct: 318 EGGDIGRSAIW---NYDYDICIQNHIHRLRPKIDLCVPFYYYTLAYLKENNLIGGKGIGL 374 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + ++ + +PP+ EQ I I + +D + +E Sbjct: 375 LGLSSNALHKIEMPLPPLTEQQRIVQKIEELFSVLDNIQNALE 417 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 4/169 (2%) Query: 19 GAIPKHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G P W + NTG+ +++ G Y+ +V + + Sbjct: 243 GHYPIGWLETILGELFNHNTGKALNSSNKEGVMKDYLTTSNVYWNKFDFTVIKQMPFKEI 302 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 KG +L + G R AI IC + + + + + Sbjct: 303 ELDKCTVTKGDLLVCEGGDIGRSAIWNYDYDICIQNHIHRLRPKIDLCVPFYYYTLAYLK 362 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +G + + I MP+PPL EQ I +KI +D Sbjct: 363 ENNLIGGKGIGLLGLSSNALHKIEMPLPPLTEQQRIVQKIEELFSVLDN 411 >gi|114778591|ref|ZP_01453418.1| type I restriction-modification system, S subunit [Mariprofundus ferrooxydans PV-1] gi|114551180|gb|EAU53740.1| type I restriction-modification system, S subunit [Mariprofundus ferrooxydans PV-1] Length = 312 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 48/305 (15%), Positives = 113/305 (37%), Gaps = 16/305 (5%) Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + L + V +E + G+T + I + +PPL EQ I + + Sbjct: 17 YNEYLYQLILFVRPELEKMSAGSTFQEISSTNVKAIKLLLPPLPEQKKIASIL----TSV 72 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D +I ++ I L++ K+A++ ++TKG+ + KDS + + + + + L Sbjct: 73 DEVIEKQEAQISKLQDLKKAMMQELLTKGIG-HTEFKDSPVGMIPKGWEVVRLGKYVKLQ 131 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---KPESYETYQIVDPGEIVFRFIDL 298 K+ + + + NI + + + V + + Sbjct: 132 GGYAFKSENFTDKGVPVVRISNISKSGDVDLSNAAFHDEINISEAFEVSHSDSLIAMSGA 191 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 K E+ + G+ S + S K+ G + + Sbjct: 192 TTGKVG--RYNFREKAYLNQRVGKFVSKGMVEMSYIHHVVSSSSFTEKLLIDAIGGAQPN 249 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++ + + PP+ EQ +I+++++ ID V + ++ +K + S + +T Sbjct: 250 ISGGQIEGVEIAFPPLDEQKNISSILDS----IDNAVGAKQLKLMHIKSLKKSLMQDLLT 305 Query: 417 GQIDL 421 G++ + Sbjct: 306 GKVRV 310 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 42/103 (40%), Gaps = 4/103 (3%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + ++K + + + ++ Q + +VK + +L+PP+ Sbjct: 1 MTTNQGFQSLKCREKVYNEYLYQLILFVRPELEKMSAGSTFQEISSTNVKAIKLLLPPLP 60 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 EQ I +++ D ++EK E I L++ + + + +T Sbjct: 61 EQKKIASILTSV----DEVIEKQEAQISKLQDLKKAMMQELLT 99 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 73/207 (35%), Gaps = 10/207 (4%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYL 64 ++KDS V G IPK W+VV + ++ KL G +S K + + + ++ L Sbjct: 106 EFKDSPV---GMIPKGWEVVRLGKYVKLQGGYAFKSENFTDKGVPVVRISNISKSGDVDL 162 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVL-P 121 + + S + L G K +F + + K ++ Sbjct: 163 SNAAFHDEINISEAFEVSHSDSLIAMSGATTGKVGRYNFREKAYLNQRVGKFVSKGMVEM 222 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + S T+++ G + I + + PPL EQ I + + + Sbjct: 223 SYIHHVVSSSSFTEKLLIDAIGGAQPNISGGQIEGVEIAFPPLDEQKNISSILDSIDNAV 282 Query: 182 DTLITERIRFIELLKEKKQALVSYIVT 208 + + L K Q L++ V Sbjct: 283 GAKQLKLMHIKSLKKSLMQDLLTGKVR 309 >gi|325958865|ref|YP_004290331.1| restriction modification system DNA specificity domain-containing protein [Methanobacterium sp. AL-21] gi|325330297|gb|ADZ09359.1| restriction modification system DNA specificity domain protein [Methanobacterium sp. AL-21] Length = 403 Score = 114 bits (284), Expect = 4e-23, Method: Composition-based stats. Identities = 63/413 (15%), Positives = 137/413 (33%), Gaps = 30/413 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQS-- 73 G +PK WK+ + + G + + ++V +G + + S+ Sbjct: 5 GDLPKVWKIKKLTEICDVRDGTHDSPKYKNEGYPLVTSKNVATGFIDFSDVNLISKDDYD 64 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + S G IL +G I+ I + + DV + ++ +L SI Sbjct: 65 NINKRSYVDDGDILMPMIGTIGNPIIVKKDRKFAIKNVALIKFTKTDVSNKYVKLFLESI 124 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 I+ I G T K I N P+ +PP+ +Q + + + + + Sbjct: 125 HFKHYIKKINRGGTQKFISLKDIRNFPVILPPIEKQNKLIKILEKAEKIKEWRVEADKLT 184 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 E LK + + + +S ++ ++ + + Sbjct: 185 DEYLKSVFLEIYNSASHHPDLKADYLSES-------------LRDVKNGLSRRRKISENK 231 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND--KRSLRSAQ 309 + + L N E + V+ +++F ++ D R + Sbjct: 232 GDIVLRLKDIRENKIDLTELNRIPLNELEKEKYGVERNDLLFIRVNGNKDYVGRCAVFRE 291 Query: 310 VMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRL 365 E M + + + +LA+L+ S K S + ++ + ++RL Sbjct: 292 FNENIYFNDHIMRVKIDSNQFNPIFLAFLINSEYGKKQLKNHLRTSAGQYTINQKGLERL 351 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P I Q + +I++L + S + +KE + + A G+ Sbjct: 352 KFYQPDISLQNSFVD----LFNKIEILKKDQFNSEIKIKELFDTLMQKAFKGE 400 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 52/185 (28%), Gaps = 13/185 (7%) Query: 36 LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY----GKL 91 L+ R K I + L+D+ + + +L+ G Sbjct: 221 LSRRRKISENKGDIVLRLKDIRENKIDLTELNRIPLNELEKEKYGVERNDLLFIRVNGNK 280 Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-----QGWLLSIDVTQRIEAICEGATM 146 R A+ +F+ +++ K + Q + A Sbjct: 281 DYVGRCAVFREFNENIYFNDHIMRVKIDSNQFNPIFLAFLINSEYGKKQLKNHLRTSAGQ 340 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + KG+ + P ++ Q + +I+ L ++ +KE L+ Sbjct: 341 YTINQKGLERLKFYQPDISLQNSFVD----LFNKIEILKKDQFNSEIKIKELFDTLMQKA 396 Query: 207 VTKGL 211 L Sbjct: 397 FKGEL 401 >gi|319901496|ref|YP_004161224.1| restriction modification system DNA specificity domain protein [Bacteroides helcogenes P 36-108] gi|319416527|gb|ADV43638.1| restriction modification system DNA specificity domain protein [Bacteroides helcogenes P 36-108] Length = 409 Score = 114 bits (284), Expect = 4e-23, Method: Composition-based stats. Identities = 61/430 (14%), Positives = 132/430 (30%), Gaps = 40/430 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLN-----TGRTSES---GKDIIYIGLEDVESGT 60 ++K + IG IP+ W+V + + + G T+ + I D+ G+ Sbjct: 5 KFKQTE---IGLIPEDWEVFSVGKDCIVKARIGWQGLTTSEYLETGEYALITSTDIIDGS 61 Query: 61 GKYLPKDGNSRQSDTSTVSI-FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + S+ V I + IL K G + I+ + + V + Sbjct: 62 IDWKTCYFVSKFRYEQDVKIQVQENDILISKDGTIGKVGIVRNQPFPATLNSGVFVIRAK 121 Query: 120 LPELLQGW---LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKII 175 + +G+ +S + I + G+T+ H K I + P+P EQ I + Sbjct: 122 NDKKQKGFSLAFVSDYFREFINRLTAGSTIVHLYQKDIVHFKFPLPIDTYEQQRIATALS 181 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 I L + + + + Q L++ G + K +G + + Sbjct: 182 DIDALISALNKKIEKKKLIKQGAMQQLLTGQKRLTGFSEPWVEKR-----LGEIGNLSMC 236 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 K F ++ G Q+ + K E ++ + Sbjct: 237 KRIFQE--------ETSESGDVPFFKIGTFGQQADAYISSTKYEKFKQMYRFPVKGSILI 288 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + ++A I + L + + + Sbjct: 289 SAAGTIGRTVVYDGEPAYFQDSNIVWLAHDEETILNAVLYHVYHIVEW-----NTENTTI 343 Query: 355 QSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 L ++ + +P + EQ I +++ + K E + + + Sbjct: 344 ARLYNDNFNNTVINIPVSLSEQAAIAEILSDMDKE----IAKWEVKRTKCECIKQGMMQQ 399 Query: 414 AVTGQIDLRG 423 +TG+I L Sbjct: 400 LLTGKIRLTD 409 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 47/166 (28%), Gaps = 7/166 (4%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I + + V +I+ K + Q + + ++ Sbjct: 60 GSIDWKTCYFVSKFRYEQDVKIQVQENDILISKDGTIG-KVGIVRNQPFPATLNSGVFVI 118 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 + + S + L +D+ +P EQ I Sbjct: 119 RAKNDKKQKGFSLAFVSDYFREFINRLTAGSTIVHLYQKDIVHFKFPLPIDTYEQQRIAT 178 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 ++ ID L+ + + I K + + +TGQ L G S+ Sbjct: 179 ALSD----IDALISALNKKIEKKKLIKQGAMQQLLTGQKRLTGFSE 220 >gi|15672634|ref|NP_266808.1| type I restriction enzyme specificity protein [Lactococcus lactis subsp. lactis Il1403] gi|12723557|gb|AAK04750.1|AE006298_3 type I restriction enzyme specificity protein [Lactococcus lactis subsp. lactis Il1403] Length = 407 Score = 114 bits (284), Expect = 4e-23, Method: Composition-based stats. Identities = 59/404 (14%), Positives = 124/404 (30%), Gaps = 31/404 (7%) Query: 24 HWKVVPIKRFTKLN---TGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ + + G++ + + + +V++G + Sbjct: 18 DWEERKLLDNVEKVLDYRGKSPAKFGMEWGTEGYLVLSALNVKNGYIDKSVEAKYGDHEL 77 Query: 75 TS---TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLL 129 + KG +++ P A + D +G Q V L Sbjct: 78 FDRWMGNNRLEKGDVVFTTEAPLGNVAQVPDNNGYILNQRAVAFKSLQETDDNFFAQLLR 137 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S V ++A G T K + +P E+ KI ++D I Sbjct: 138 SPIVQNTLKASSSGGTAKGIGMKEFAKLNARVPETHEEQ---RKIGLFFKQLDDTIVLHQ 194 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R ++LLKE+K+ + + K + +++ E+ + + L +++ Sbjct: 195 RKLDLLKEQKKGYLQKMFPKNGSKIPELR--FAEFADDWEERKLGEVATFLNGRAYKQDE 252 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L L GN VD G++V+ + Sbjct: 253 LLDSGKYKVLRVGNFYTNDSWY---YSNMELGDKYYVDKGDLVYTWSATFGPHI-----W 304 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E+ I V+ + D ++ + + D++ V + Sbjct: 305 SGEKVIYHYHIWKVELSKFLDRNFTLQLLEADKARLLSSTNGSTMIHVTKGDMESKIVSI 364 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I + ++D + ++ + LLKE++ F+ Sbjct: 365 PNIDEQKQIGSF----FKQLDNTITLHQRKLDLLKEQKKGFLQK 404 >gi|282933737|ref|ZP_06339092.1| conserved hypothetical protein [Lactobacillus jensenii 208-1] gi|281302116|gb|EFA94363.1| conserved hypothetical protein [Lactobacillus jensenii 208-1] Length = 404 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 61/404 (15%), Positives = 138/404 (34%), Gaps = 27/404 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKY--LPKDGNSRQSDTSTV 78 WK V + + + G I +++E+GT + + + + Sbjct: 14 WKKVKLGQIADVRDGTHESPKYVSQNGYPLITSKNLENGTINFDDISYISKKDYEEINKR 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S+ K IL+G +G AI+ L+ ++ L + S + Sbjct: 74 SLVEKNDILFGMIGTIGNVAIVKKSGFAIKNVALIKSNSEIPSINLIQIIQSDIFKKYTN 133 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G + I + +E +LI + + + +L + Sbjct: 134 RLNSGNSQKFISLGDIRKFDFKMASKSENMLISKLFKKVDTLLSLQQRKLELENQLKQFN 193 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q L S + L P V+ + W + V + RKN L + L+ Sbjct: 194 LQNLFSD--EQRLYPKVRFRGFDEPW--------KKVKLGRNVKRIRRKNKNLETNIPLT 243 Query: 259 LS-YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGII 316 +S ++ + + + E+ Y ++ GE + + ++ + G + Sbjct: 244 ISAQFGLVDQRDFFGRVVASENLANYILLKRGEFAYNKSYSKEAPYGSIKRLEKYNEGAL 303 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPP 371 ++ Y+A P I+S +L + + + G R ++ +D + + +P Sbjct: 304 STLYIAFTPENINSDFLKAFFDTTKWYSHIVQVSTEGARNHGLLNISPQDFFEMSITIPK 363 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ +I+ + N+ + L+ +Q I ++ + + Sbjct: 364 SDEQNNISRIYNLM----NSLLSLQQQDINTTQQLKQFLLQNLF 403 >gi|224543617|ref|ZP_03684156.1| hypothetical protein CATMIT_02827 [Catenibacterium mitsuokai DSM 15897] gi|224523443|gb|EEF92548.1| hypothetical protein CATMIT_02827 [Catenibacterium mitsuokai DSM 15897] Length = 390 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 59/393 (15%), Positives = 129/393 (32%), Gaps = 28/393 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + V ++ + +G T K +I +I ++D++S + + S+ Sbjct: 2 EYVKLEEICTIVSGGTPSRSKPNYWNNGNIPWIKIKDMKSKYIDSAEEFITEEGLNNSST 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSIDVTQRI 137 + + ILY + L + I D + L K+ + + + Sbjct: 62 KMLKRDTILYS-IFATLGEVGILKIDACTNQAIAGLSLKEDSNILKEYLYYYLKSKKKDV 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G ++ + + +P+ PL +Q I E + + + I R + LL E Sbjct: 121 NNLGRGVAQNNINLSLLRKFKIPVIPLRQQKKIIEVLDN----VSSTINNYERELALLDE 176 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +A + + + + I + A + K ++ Sbjct: 177 LVKARFVEMFGRPTDKITRYPKVKIGNLIKEGK----ASIKAGPFGSSLKKEFYVKKGFK 232 Query: 258 SLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +I+ T E V G+I+ + +++ + E G Sbjct: 233 IYGQEQVIKNDPTFGDYYINEDRFNSLKSCEVHAGDILISLVGT--CGKTMIMPENFEPG 290 Query: 315 IITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP 371 II +A I Y + S DL K+ + + +K + +++ P Sbjct: 291 IINPRLLKIAFDAEFIIPIYFKYFFGSDDLQKILNSASGHSTMNVVNAGMLKNVELIMAP 350 Query: 372 IKEQFDITNVINVET-ARIDVLVEKIEQSIVLL 403 I+ Q + + +R+ L+ + I LL Sbjct: 351 IELQNQFASFVEEVDKSRLRELLAI--KQIKLL 381 >gi|121609380|ref|YP_997187.1| restriction modification system DNA specificity subunit [Verminephrobacter eiseniae EF01-2] gi|121554020|gb|ABM58169.1| restriction modification system DNA specificity domain [Verminephrobacter eiseniae EF01-2] Length = 426 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 52/416 (12%), Positives = 123/416 (29%), Gaps = 32/416 (7%) Query: 22 PKHWKVVPIKR-FTKLNTGRTSES------GKDIIYIGLEDVES--GTGKYLPKDGNSRQ 72 P + P+ +K G T DI + + D+ + Sbjct: 13 PNGVEFKPLGECISKNLGGGTPSRSVASYWDGDIPWASVGDLSIPGNFIRTTRSLITKDG 72 Query: 73 SDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S ++ G ++ K+ P K D + L D + + Sbjct: 73 LKNSPSNVIRAGDVIVAVKISPGKMKIAATDI--AINQDLRGLTLHDFIDSSFLVY---- 126 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q I G + + + +P+PPL Q I + + T L E Sbjct: 127 -YFQTFSIIGNGTIVKGITTDTLERVKVPVPPLEVQREIVKVLDTFTELEAELEAELEAE 185 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF------ALVTELN 245 +E + + + + + K + + + + Sbjct: 186 LEARRRQYKYYRDALFSFDERMSGASKQASKQASKQASKQAISIRWMTLSEVGKFMRGRR 245 Query: 246 RKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 +E + + YG I ++PE + PG++V + + Sbjct: 246 FTKADYVEDGVGCIHYGEIYTHYGTSANEVISHVRPEMKSGLRFAKPGDVVVADVGETVE 305 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFE 360 A + + + H ++ ++++ M++ + + + L + Sbjct: 306 DVGKAVAWMGTDDVAIHDHCYAFRHSMNPKFVSYCMQTTSFISEKAKYVARTKVNTLLID 365 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLV----EKIEQSIVLLKERRSSFIA 412 ++ + VPP++EQ I +++ A + + +I+ + R + Sbjct: 366 GFSKIRIPVPPLEEQERIVAILDKFDALVSDISFGLPAEIKARRQQYEHYRDRLLT 421 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTK------LIESNILSLSYGNIIQKLETRNMGLKPESY 280 P+ E KP +++ T + +I S G++ + Sbjct: 11 HCPNGVEFKPLGECISKNLGGGTPSRSVASYWDGDIPWASVGDLSIPGNFIRTTRSLITK 70 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMRS 339 + + I + + + I IDS++L + ++ Sbjct: 71 DGLKNSPSNVIRAGDVIVAVKISPGKMKIAATDIAINQDLRGLTLHDFIDSSFLVYYFQT 130 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + + + + + ++R+ V VPP++ Q +I V++ T Sbjct: 131 FSIIG-----NGTIVKGITTDTLERVKVPVPPLEVQREIVKVLDTFT 172 >gi|238787648|ref|ZP_04631446.1| Restriction modification system DNA specificity domain [Yersinia frederiksenii ATCC 33641] gi|238724435|gb|EEQ16077.1| Restriction modification system DNA specificity domain [Yersinia frederiksenii ATCC 33641] Length = 418 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 66/405 (16%), Positives = 140/405 (34%), Gaps = 30/405 (7%) Query: 27 VVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKG 84 PI T + N + S+S + YI L V T K + + S + K Sbjct: 15 WKPIGEVTLRTNNIKWSDSTRSYRYIDLASVSIETKKITETSVVAANNAPSRAQKLVEKD 74 Query: 85 QILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEA 139 +++ P + + D + ST + VL+ K LP+ + WL S + + +E Sbjct: 75 DVIFATTRPTQMRYCLIDEKYSGEVASTGYCVLRVKKDEVLPKWILHWLSSREFKKYLEE 134 Query: 140 ICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFI 192 G+ + +PIP L Q+ I + T L E + Sbjct: 135 NQSGSAYPAISDAKVKEFRIPIPYPDNPKKSLEIQMKIVRILDTFTELTAELTAELTAEL 194 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 K++ +++ ++ G+EW L + + + Sbjct: 195 TARKKQYNYYREQLLS--------FEEGGVEWKALGEVAIVQRGASPRPIAKYITDDENG 246 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + + + + E + +I++ G+ + L + Sbjct: 247 VPWIKIGDTSHGSKYVNQTAQKITQEGAQKSRILNSGDFIISNSMSFGRPYILGIRGAIH 306 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPP 371 G + ++ ++S +L + S + + + SG +L + +K LPV +P Sbjct: 307 DGWAS---ISGFNGTLNSDFLYHYLSSNGVQNYWAGKINSGSVSNLNADIIKALPVPIPA 363 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + Q +I ++++ + L E + + I K+ R ++ Sbjct: 364 LSVQKEIASILDNFDILTNSLSEGLPREINQRKKQYEYYRDLLLS 408 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 48/167 (28%), Gaps = 9/167 (5%) Query: 26 KVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + + G + + + +I + D G+ Q Sbjct: 217 EWKALGEVAIVQRGASPRPIAKYITDDENGVPWIKIGDTSHGSKYVNQTAQKITQEGAQK 276 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQR 136 I G + + R I+ I + L +L S V Sbjct: 277 SRILNSGDFIISNSMSFGRPYILGIRGAIHDGWASISGFNGTLNSDFLYHYLSSNGVQNY 336 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++S+ + I +P+PIP L+ Q I + + ++ Sbjct: 337 WAGKINSGSVSNLNADIIKALPVPIPALSVQKEIASILDNFDILTNS 383 >gi|192361516|ref|YP_001981157.1| type I restriction-modification system, S subunit [Cellvibrio japonicus Ueda107] gi|190687681|gb|ACE85359.1| type I restriction-modification system, S subunit [Cellvibrio japonicus Ueda107] Length = 795 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 57/490 (11%), Positives = 116/490 (23%), Gaps = 104/490 (21%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG--------KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W+ V + ++ G T + + +++ Sbjct: 87 LPQGWEWVRLGELAEIIRGVTYSKSQSNEIRFHDSVELLRANNIQ--EIINFQGTVFVPS 144 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQG- 126 S S G IL ++ + V++P+ L Sbjct: 145 SLVSESQKIKNGDILIAMSSGSPHLVGKAAQFESNRECTFGAFCAVIRPRCTLLFEYFRV 204 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT--- 183 + + + +G + + + + + N+ + PPL EQ I KI R D Sbjct: 205 FSQTPLYRSQTRQEGKGIGIQNLNKEALENLLVAAPPLNEQHRIIAKIDELMTRCDELEK 264 Query: 184 --------------------------------------LITERIRFIELLKEKKQALVSY 205 E E + E ++A++ Sbjct: 265 LRAAQQEKRRTVHAAAIKQLLNIADPEQHQHAQSFLAEHFGELYTVKENVAELRKAILQL 324 Query: 206 IVTKGLNPDVKMKDSGIEWVGLV-------------------------------PDHWEV 234 V L P E + + P WE Sbjct: 325 AVMGKLVPQNPNDQPASELLKEIEAEKQRLVEEGKIKKPKPFPPVSDEEKPYALPQGWEW 384 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--------ETYQIV 286 +V + + KPE+Y +I Sbjct: 385 VRVIDIVDVGTGSTPATTNKDYYGGEIPWYTSSATNKLFTEKPETYITEKALKETNCKIF 444 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G ++ + + + A M + ++ Sbjct: 445 PSGSLIIALYGQGKTRGQISELSIAGATNQAIAAMVFYGSSSGTKKYLKYFFIKIYEEIR 504 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE---TARIDVLVEKIE-QSIVL 402 + +L +K V +PP+ EQ I I+ +D + + L Sbjct: 505 KIAEGAAQPNLNVGKIKETLVPLPPLSEQNRIVTKIDELMVFCDTLDQQINIATSKQSEL 564 Query: 403 LKERRSSFIA 412 L + + Sbjct: 565 LN----ALMH 570 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 60/194 (30%), Gaps = 11/194 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETR-- 271 + E +P WE VT ++ ++ + + L N IQ++ Sbjct: 79 TDEEKPYALPQGWEWVRLGELAEIIRGVTYSKSQSNEIRFHDSVELLRANNIQEIINFQG 138 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERGIITSAYMAVKPHGIDS 330 + + Q + G+I+ + + + ++P Sbjct: 139 TVFVPSSLVSESQKIKNGDILIAMSSGSPHLVGKAAQFESNRECTFGAFCAVIRPRCTLL 198 Query: 331 TYLAWLM-RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + ++ G G+ Q+L E ++ L V PP+ EQ I I+ R Sbjct: 199 FEYFRVFSQTPLYRSQTRQEGKGIGIQNLNKEALENLLVAAPPLNEQHRIIAKIDELMTR 258 Query: 389 IDVLVEKIEQSIVL 402 D L + Sbjct: 259 CDELEKLRAAQQEK 272 >gi|77164669|ref|YP_343194.1| restriction modification system DNA specificity subunit [Nitrosococcus oceani ATCC 19707] gi|254434340|ref|ZP_05047848.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] gi|76882983|gb|ABA57664.1| Restriction modification system DNA specificity domain [Nitrosococcus oceani ATCC 19707] gi|207090673|gb|EDZ67944.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] Length = 407 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 144/416 (34%), Gaps = 46/416 (11%) Query: 21 IPKHWKVVPIKR-FTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P+ WK++P+ + G + S + +G++D+++G + Sbjct: 16 LPRGWKLLPVGKALIDSQYGTNAASVDAGNTPVVGMKDIQAGKVLTSNLVRANLSDKERA 75 Query: 78 VSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + KG IL + + + K I D D + +++ K L +++ +L Sbjct: 76 KYLLEKGDILINRTNSFDLVGKVGIYDSDIEAAFASYLVRLKADLSQVMPEYLNYWLNGH 135 Query: 136 RIEAICEGATMSHADWKGIG------NIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + +P+P + EQ + D +I + Sbjct: 136 VAQTTIKRIATKAISQANVNPTEFKKHCYIPLPSIGEQREAVSVL----KTNDRVIEKIE 191 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R I +++ ++L+ + + + W L ++ K Sbjct: 192 RLIAAKQKRFKSLIQQL------------------INKNCELWPHFKARDLFRNISIKGY 233 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + ++ G + + + + S + Y++V PG V Q Sbjct: 234 GNEKLLSVTQDCGVLPRTMLEGRVMSPEGSTDNYKLVVPGNFVISLRSFQG-----GLEY 288 Query: 310 VMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLC-KVFYAMGSGLR--QSLKFEDVKRL 365 +GI++ AY + P +SY K G+R + + D + + Sbjct: 289 SKYKGIVSPAYTILFPKKEIHDDFYRHFFKSYIFIEKYLVIAVVGIRDGKQISSSDFESV 348 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + PP++EQ I ++N T I + ++ + ++ + +TG+ + Sbjct: 349 KIPYPPVQEQRYIAEILNTATEEIKKFKQLAKK----YRTQKRGLMQKLLTGKWQV 400 >gi|297587004|ref|ZP_06945649.1| phosphoribosylformylglycinamidine synthase [Finegoldia magna ATCC 53516] gi|297574985|gb|EFH93704.1| phosphoribosylformylglycinamidine synthase [Finegoldia magna ATCC 53516] Length = 485 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 73/424 (17%), Positives = 133/424 (31%), Gaps = 51/424 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ WK V + + NTG+ ++ YI ++ + + Sbjct: 66 DIPETWKWVRVGTIFQHNTGKALNRANREGIELEYITTSNLYWDRFELDNLKKMYFKEIE 125 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG +L + G R AI + + + L Sbjct: 126 LKKYGVMKGDLLVCEGGDVGRAAIWEYESSVMIQNHIHRLRAYYSICTRFFYYLFYLYKN 185 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +G + + +G+I P+PPLAEQ I EKI +D + EL Sbjct: 186 AGLIGGKGIGIKGLSTRALGSIVFPLPPLAEQKRIVEKIEELMPLVDKYEKNWQKLEELN 245 Query: 196 KE----KKQALVSYIVTKGLNPDVKMKDSGIEWVG------------------------- 226 K+ K++L+ + L K + +G E Sbjct: 246 KKFPEDMKKSLLQEAIKGKLVEQRKEEGTGAELFEKIQKEKKKLVEEGRIKKQKALPQIT 305 Query: 227 ------LVPDHWEVKPFFALV--TELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKP 277 +P++W+ + + K I ++ N K++ N Sbjct: 306 EEEIPFDIPENWKWTRLNECIDVRDGTHDTPKYIAKGYPLVTSKNLKHGKIDFSNCKFIS 365 Query: 278 ES----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + VD +I+F I + +R E I A + + YL Sbjct: 366 KEDHIKISKRSKVDVNDILFAMIGSIGNPVKVR--DDNEFSIKNMALFKPIKNNFNMDYL 423 Query: 334 AWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 W + + G QS + + ++ + +PP+ EQ I ++ A D L Sbjct: 424 FWFLYIS--QDNMKKIAYGAVQSFVSLKFLREYLIPLPPLAEQKRIVEKLDEMLAYCDEL 481 Query: 393 VEKI 396 ++ I Sbjct: 482 LKII 485 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 56/209 (26%), Gaps = 15/209 (7%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL------ETRNMG 274 E +P+ W+ + K + L Y + Sbjct: 60 EDEIPFDIPETWKWVRVGTIFQHNTGKALNRANREGIELEYITTSNLYWDRFELDNLKKM 119 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E V G+++ + +I + ++ + T Sbjct: 120 YFKEIELKKYGVMKGDLLVCEGGDVGRAAIW---EYESSVMIQNHIHRLRAYYSICTRFF 176 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + + L + + +PP+ EQ I I +D E Sbjct: 177 YYLFYLYKNAGLIGGKGIGIKGLSTRALGSIVFPLPPLAEQKRIVEKIEELMPLVDKY-E 235 Query: 395 KIEQSIVLL-----KERRSSFIAAAVTGQ 418 K Q + L ++ + S + A+ G+ Sbjct: 236 KNWQKLEELNKKFPEDMKKSLLQEAIKGK 264 >gi|21228395|ref|NP_634317.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20906868|gb|AAM31989.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 406 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 134/426 (31%), Gaps = 44/426 (10%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTK----LNTGRTSESGKDIIYIGLEDVESGTGKY 63 P YK + V G IP+ W ++ K + G I + +++ Y Sbjct: 12 PGYKQTEV---GVIPEDWNDPKLEDIVKEESPICYGIVQVGSYTANGIPVLAIKNLNSDY 68 Query: 64 LPKDGNSRQSDTSTV--SIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVL 120 + S +L G R I+ F G S L ++ + Sbjct: 69 TTNIHRASVEVERPYLRSRVYPEDVLISVKGTTGRIGIVPLGFYGNISRDLARLHLREGI 128 Query: 121 -PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAET 178 P+ + L S + Q + G T + + +P PP AEQ I E + Sbjct: 129 VPKFIFQMLQSNLMQQHLGVAVVGTTRMELSISILKKVRIPFPPTKAEQESIAEALSYTD 188 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I++L + ++ + Q L+ +G + WE KP Sbjct: 189 AFIESLEQLIAKKRQIKQGAMQELL----------------TGKRRLPGFSKEWETKPLG 232 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFI 296 + ++ N I + N + E + G+I+ Sbjct: 233 DVAEITMGQSPSSANYNSKGEGLPLIQGNADIFNRKTIKRVFTTEITRRGKCGDIIMSVR 292 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + RG+ Y + +L + S + + GS S Sbjct: 293 APVGEVSRAEFDICLGRGVCAIRY--------SNNFLYHTLISKESTWAKLSKGS-TFDS 343 Query: 357 LKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + DVK + +P EQ I +++ A + +E+ + ++ + + + Sbjct: 344 VNSADVKAFDIELPTDSAEQEAIATILSDMDAE----ITALEEKLAKARQIKQGMMQELL 399 Query: 416 TGQIDL 421 TG+ L Sbjct: 400 TGRTRL 405 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 68/214 (31%), Gaps = 18/214 (8%) Query: 224 WVGLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 VG++P+ W + + + I L+ N+ T Sbjct: 18 EVGVIPEDWNDPKLEDIVKEESPICYGIVQVGSYTANGIPVLAIKNLNSDYTTNIHRASV 77 Query: 278 ESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E Y V P +++ + + G I+ + + Sbjct: 78 EVERPYLRSRVYPEDVLISVKGTTGRIGIVP---LGFYGNISRDLARLHLREGIVPKFIF 134 Query: 336 LMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVL 392 M +L + + R L +K++ + PP EQ I + + D Sbjct: 135 QMLQSNLMQQHLGVAVVGTTRMELSISILKKVRIPFPPTKAEQESIAEAL----SYTDAF 190 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +E +EQ I ++ + + +TG+ L G S+ Sbjct: 191 IESLEQLIAKKRQIKQGAMQELLTGKRRLPGFSK 224 >gi|119357207|ref|YP_911851.1| restriction modification system DNA specificity subunit [Chlorobium phaeobacteroides DSM 266] gi|119354556|gb|ABL65427.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides DSM 266] Length = 557 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 68/463 (14%), Positives = 126/463 (27%), Gaps = 87/463 (18%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQSDT 75 IP W V + N+G+T + G++ YI ++ G + + D Sbjct: 86 DIPSSWIWVRFGDIARHNSGKTLDKGRNTGESRDYITTSNLYWGKFELENVRQMLIREDE 145 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDV 133 K +L + G R A+ +C + KD+ P + + + Sbjct: 146 LEKCTAKKDDLLICEGGEAGRAAMWPFDSEVCFQNHIHRARFYKDIDPYFVYRFFEKLSA 205 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR------------- 180 T I +G +S+ K + +I P+PP +EQ I +I R Sbjct: 206 TGEINQHRKGVGISNMSSKSLASIVFPLPPFSEQHRIVARIDQLMARCNELEKLRKEREE 265 Query: 181 ------------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 I E E + E ++A++ V L P + Sbjct: 266 KRLIVHAAAIKQLFDAPDGSAWGFIQQHFNELYSVKENVAELRKAILQLAVMGRLVPQDQ 325 Query: 217 MKDSGIEWVGL-------------------------------VPDHWEVKPFFALVTELN 245 E + +P +W F + + Sbjct: 326 NDPPASELLKEIEKEKASHECTKSRRKGEKLPEIFNEEMPHKIPSNWAWVRFGDIAQHNS 385 Query: 246 RK---NTKLIESNILSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 K + ++ N+ + LE L E +++ Sbjct: 386 GKTLDKGRNTGQPREYITTSNLYRGRFELENVRQMLIREDELEKCTAKKDDLLICEGGEA 445 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLK 358 R+ E + A ID + G+ ++ Sbjct: 446 G--RAAVWPFDSEVCFQNHIHRARFYKDIDPYFAYRFFEKLSATGEINQHRKGVGISNMS 503 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + + +PP EQ I + D L +Q I Sbjct: 504 SKALASIVFPLPPQPEQHRIVARTDQLMTLCDQL----DQQID 542 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 67/190 (35%), Gaps = 6/190 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQSDT 75 IP +W V + N+G+T + G++ YI ++ G + + D Sbjct: 367 KIPSNWAWVRFGDIAQHNSGKTLDKGRNTGQPREYITTSNLYRGRFELENVRQMLIREDE 426 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDV 133 K +L + G R A+ +C + KD+ P + + Sbjct: 427 LEKCTAKKDDLLICEGGEAGRAAVWPFDSEVCFQNHIHRARFYKDIDPYFAYRFFEKLSA 486 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 T I +G +S+ K + +I P+PP EQ I + D L + + Sbjct: 487 TGEINQHRKGVGISNMSSKALASIVFPLPPQPEQHRIVARTDQLMTLCDQLDQQIDDAVG 546 Query: 194 LLKEKKQALV 203 E A++ Sbjct: 547 KQTEILNAVL 556 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 52/187 (27%), Gaps = 9/187 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLK 276 E +P W F + + K NT I + + +LE L Sbjct: 82 EIPYDIPSSWIWVRFGDIARHNSGKTLDKGRNTGESRDYITTSNLYWGKFELENVRQMLI 141 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 E +++ R+ E + A ID ++ Sbjct: 142 REDELEKCTAKKDDLLICEGGEAG--RAAMWPFDSEVCFQNHIHRARFYKDIDPYFVYRF 199 Query: 337 MRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 G+ ++ + + + +PP EQ I I+ AR + L + Sbjct: 200 FEKLSATGEINQHRKGVGISNMSSKSLASIVFPLPPFSEQHRIVARIDQLMARCNELEKL 259 Query: 396 IEQSIVL 402 ++ Sbjct: 260 RKEREEK 266 >gi|315453995|ref|YP_004074265.1| putative type I restriction-modification system [Helicobacter felis ATCC 49179] gi|315133047|emb|CBY83675.1| putative type I restriction-modification system [Helicobacter felis ATCC 49179] Length = 437 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 54/418 (12%), Positives = 130/418 (31%), Gaps = 32/418 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P+ + V + L G + + + + I + ++ G Sbjct: 15 PQGVEFVELGEVCSLLNGYSFKKSDYVEKSNTLLIRMGNIRPNGGFNPEHKPIYLPDSFL 74 Query: 77 TVSI---FAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWL- 128 + G IL G + + + + + + + Sbjct: 75 EKYKNYALSDGDILIAMSGNNVGMTSLIKNIKGRKLLLNQRVAKPHNLSPNIHVPFLYYV 134 Query: 129 -LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ V + I+++ + A + I + +P+PPL Q I + T Sbjct: 135 LITQRVKKYIQSLSDAAAQPNLSTASILALKIPLPPLIIQEKIVTILDCFTEL------- 187 Query: 188 RIRFIELLKEKKQALVSYIVTKG------LNPDVKMKDSG-IEW--VGLVPDHWEVKPFF 238 + K++ ++ ++ G L +K+S +EW +G + + Sbjct: 188 -SAELSARKKQYSYYLNALLDFGTPTSPRLGRHALLKESFKVEWVELGTIGEFVRGSGLT 246 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + N +L+ + + + E + + V G +V + Sbjct: 247 KADLHPDNPNGELVGAIHYGEIHTFYNVHTSKTKSFITQELAKKLKPVYCGNLVIVGVSE 306 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 A + + I H + YLA+L ++ G L Sbjct: 307 NPADVCKAVAYLGQETIYIGGDTFALRHQQNPKYLAYLFQTQAFKDFKLKYTCGAKVSRL 366 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D+K + +PP+ Q I +++ A L + + I +++ + ++ A + Sbjct: 367 NLQDLKTFLIPLPPLALQEKIVEILDQFNALTTDLQQGLPAEIEAREKQYTHYLNALL 424 >gi|293393084|ref|ZP_06637399.1| type I restriction enzyme EcoKI specificity protein [Serratia odorifera DSM 4582] gi|291424230|gb|EFE97444.1| type I restriction enzyme EcoKI specificity protein [Serratia odorifera DSM 4582] Length = 433 Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats. Identities = 55/408 (13%), Positives = 140/408 (34%), Gaps = 39/408 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W + +L G++ L + N S Sbjct: 5 KLPKGWGCTLLGHVIELKYGKS-----------LSAQTRDGVGFHVYGSNGVVGKHSIPL 53 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I G ++ G+ G + + T + + + E +L + +T Sbjct: 54 INHSG-LIVGRKGSFGVVQKSTEPFFPIDTTYYIDDFYNQPLEYWFYYLSFLPLT----K 108 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + N+ + +PP+ EQ +I EK+ + D + ++LK + Sbjct: 109 LNRSTAIPGLNRDDAYNLDIVLPPITEQKIIAEKLDTLLAQADRTKARLEQIPQILKRFR 168 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q +++ IV L+ + + + +K +++ + + E+ I L Sbjct: 169 QVMLAAIVNGKLSTNTEQWKI-----------YSLKNLCVSISDGDHQAPPKSETGIPFL 217 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDP------GEIVFRFIDLQNDKRSLRSAQVMER 313 ++ + + Y + +I++ + + Sbjct: 218 VISDVNKGKIDLVNVSRWVPESYYLALKEIRKPSLNDILYTVTGSFGIPVVV---NTTKP 274 Query: 314 GIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 +KP+ + +L++ + S + K + +G ++++ ++ + VP Sbjct: 275 FCFQRHIAIIKPNSNLINYRFLSFYLESPQIFKHASDVATGTAQKTVSLSSLRNFELSVP 334 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +KEQ I + + A D + ++++ ++ + S +A A G+ Sbjct: 335 SLKEQAVIVHRVEQLFAYADTIEKQVKSALTRVNNLTQSILAKAFRGE 382 >gi|312622840|ref|YP_004024453.1| restriction modification system DNA specificity domain [Caldicellulosiruptor kronotskyensis 2002] gi|312203307|gb|ADQ46634.1| restriction modification system DNA specificity domain [Caldicellulosiruptor kronotskyensis 2002] Length = 417 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 62/420 (14%), Positives = 132/420 (31%), Gaps = 36/420 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ WK V + K I+ D G P Sbjct: 7 KLPEGWKWVKLGEVLAYEQ-----PNKYIVKDEQYDKRHGIPVLTPGKTFILGFTQEHQG 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I+ ++ + I F S+ +L+ K L + Sbjct: 62 IYNNIPVIIFDDFTTESRYIAFPFKLK-SSAVKILKSKCNFVNLYYVYNSMQL-----LN 115 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+ +P+PPL EQ I E + I+ ++ + + Sbjct: 116 FKPGSEHKRFWISEYSKFLIPLPPLPEQRKIAEILETIDNAIEKTDAIIEKYKRIKQGLM 175 Query: 200 QALVSYIVTK---GLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALVTELNRKNT-- 249 Q L++ V G + +++D I+ +G +P+ WEV + V +N Sbjct: 176 QDLLTKGVVNEGEGESERWRLRDENIDKFKDSPLGRIPEEWEVVDVYGHVNLINGGTPST 235 Query: 250 -----KLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQND 301 LS+ NI ++ + E +++ G ++ Sbjct: 236 ERPEFWNGSIPWLSVEDFNIGKRWVFSSSKYITELGLKQSATKLLKKGMLIISARGTVGV 295 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 L + + + K S + + + + ++ E Sbjct: 296 LAQLGADMAFNQSCYG---LDAKDKMKLSNDFLYYALKHFITSFLSLAYGNVFNTITRET 352 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 K + + +PP+ EQ I +++ ++ID ++EK + L+ + + +TG++ + Sbjct: 353 FKEILIPLPPLPEQQRIASIL----SQIDEVIEKEQAYKEKLERIKKGLMEDLLTGKVRV 408 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 68/209 (32%), Gaps = 11/209 (5%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTG 61 ++KDS +G IP+ W+VV + L G T + I ++ +ED G Sbjct: 202 DKFKDSP---LGRIPEEWEVVDVYGHVNLINGGTPSTERPEFWNGSIPWLSVEDFNIGKR 258 Query: 62 KYL--PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 K S + KG ++ G A + + + + + Sbjct: 259 WVFSSSKYITELGLKQSATKLLKKGMLIISARGTVGVLAQLGADMAFNQSCYGLDAKDKM 318 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + ++ G + + I +P+PPL EQ I + Sbjct: 319 KLSNDFLYYALKHFITSFLSLAYGNVFNTITRETFKEILIPLPPLPEQQRIASILSQIDE 378 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVT 208 I+ + + + K + L++ V Sbjct: 379 VIEKEQAYKEKLERIKKGLMEDLLTGKVR 407 >gi|229015569|ref|ZP_04172564.1| type I restriction-modification enzyme, S subunit, EcoA [Bacillus cereus AH1273] gi|229021767|ref|ZP_04178345.1| type I restriction-modification enzyme, S subunit, EcoA [Bacillus cereus AH1272] gi|228739514|gb|EEL89932.1| type I restriction-modification enzyme, S subunit, EcoA [Bacillus cereus AH1272] gi|228745716|gb|EEL95723.1| type I restriction-modification enzyme, S subunit, EcoA [Bacillus cereus AH1273] Length = 404 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 61/403 (15%), Positives = 147/403 (36%), Gaps = 26/403 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W+ + + G+ + + I ++ + G+ + + ++ + Sbjct: 13 EEWETYSLADIADFHKGKGISKNELSSEGELCILYGELYTKYGEVTTEIYSKTNIESKEL 72 Query: 79 SIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 K +L G + + + + +L+ K+ + + ++ Sbjct: 73 IKSKKYDVLIPSSGETAKDIACSTCVLQENILIGGDLNILRFKNNIDGRFISYQINGIKK 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q + +GAT+ H +GI + + IP L EQ I +++ + I + + I+L Sbjct: 133 QELSKYAQGATVVHLYSQGIKKLYLKIPNLEEQQKISNLLLSLDEK----IQLQQQKIDL 188 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+E+K+ + + K +M+ +G WE + + + Sbjct: 189 LQEQKKGFLQKMFPKADEAQPEMRFAG------FTGDWEERALKEVGDFVRTSIDPQAAP 242 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + Y + +S ++ ++ GE++ KR + Sbjct: 243 DSEFIEYSMPSYDNGRLPEHVLGKSMQSMRLKISGEVLLINKLNVRQKRIWLIEDAPDNA 302 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP 371 + ++ +M ID T+L LM S + ++ SG ++ + DV + + +P Sbjct: 303 VASNEFMPFTSEKIDMTFLEQLMLSDKTTRDLESISSGTSNSQKRITPPDVLKYQIKLPK 362 Query: 372 -IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I ++D ++ +Q I + KE++ F+ Sbjct: 363 ERDEQEKIGIF----FKQLDNIIVLHQQKIDIYKEQKKGFMQQ 401 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 72/202 (35%), Gaps = 7/202 (3%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K+ EW ++ F + + E IL ++ T Sbjct: 4 PKLRFKEFDEEW--ETYSLADIADFHKGKGISKNELSSEGELCILYGELYTKYGEVTTEI 61 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDST 331 + +++ + + + E +I + + ID Sbjct: 62 YSKTNIESKELIKSKKYDVLIPSSGETAKDIACSTCVLQENILIGGDLNILRFKNNIDGR 121 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++++ + ++ L + +K+L + +P ++EQ I+N++ +D Sbjct: 122 FISYQINGIKKQELSKYAQGATVVHLYSQGIKKLYLKIPNLEEQQKISNLLLS----LDE 177 Query: 392 LVEKIEQSIVLLKERRSSFIAA 413 ++ +Q I LL+E++ F+ Sbjct: 178 KIQLQQQKIDLLQEQKKGFLQK 199 >gi|269978346|gb|ACZ55907.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 427 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 53/407 (13%), Positives = 129/407 (31%), Gaps = 21/407 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + K++ Q + L+ + ++ + P +K + + KL Sbjct: 192 LNARKKQYQYYQN----MFLDFNDINQNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKL 247 Query: 252 IESNILSLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E + +++ + + ++ I + + Sbjct: 248 GEVCEIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQ 307 Query: 311 MERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 ++ +V P+ + YL +++ + + S + S+ ++ ++ + + Sbjct: 308 NQKFWANDVCFSVIPNETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPI 367 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 PP++ Q +I +++ +A L+ I I K+ R + Sbjct: 368 PPLEIQQEIVKILDQFSALTTDLLAGIPAEIKARKKQYEYYREKLLT 414 >gi|224437016|ref|ZP_03657997.1| type I restriction-modification system specificity subunit [Helicobacter cinaedi CCUG 18818] gi|313143488|ref|ZP_07805681.1| predicted protein [Helicobacter cinaedi CCUG 18818] gi|313128519|gb|EFR46136.1| predicted protein [Helicobacter cinaedi CCUG 18818] Length = 404 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 58/400 (14%), Positives = 114/400 (28%), Gaps = 26/400 (6%) Query: 23 KHWKVVPIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 + W+ V + K + +G T +S +I ++ ++V++ + Sbjct: 20 EQWQEVRLGEVAKQIVSGGTPKSTQAEYYNGNIPWLNTKEVKNCRIYATERQITELGLCN 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S+ K ++ G K I + + + +D Sbjct: 80 SSAKWIDKNSVIVAMYGATAGKVAINKIPLTTNQACCNISVDSEKANYNFIYYTLLDSFD 139 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R++ + GA + + I N +PPL Q I E + + +ID L + L Sbjct: 140 RLDQMTSGAAQQNLNVGLISNFTFLLPPLTTQQKIAEILSSFDDKIDLLHRQNKTLESLA 199 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + + + +K G G P E I+ Sbjct: 200 LTLFRHYFIDNPNRSEWEEKPLKYFGNIICGKTP--------PKNQKEYFNGTYPFIKIP 251 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + GL + +T I + + ++ I Sbjct: 252 DMHNNVFVFQTADSLTQQGLDSQKAKTLPPFSVCVSCIATIGVVSMNANIAQTNQQINSI 311 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + +L M+S A G +L D + +L+P KE Sbjct: 312 V-------PHKEHYRYFLYCSMKSSFDELEAMASGGTATANLNTTDFSNMKLLLPREKE- 363 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + ET + + I L+ R + A Sbjct: 364 --ILRF-HTETLPFFDKIYNNTKQIQNLQAMRDVLLKAIF 400 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 18/190 (9%), Positives = 52/190 (27%), Gaps = 8/190 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDG-NSRQSDT 75 W+ P+K F + G+T + +I + D+ + + D + D+ Sbjct: 214 SEWEEKPLKYFGNIICGKTPPKNQKEYFNGTYPFIKIPDMHNNVFVFQTADSLTQQGLDS 273 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + + + + Q + P + + Sbjct: 274 QKAKTLPPFSVCVSCI-ATIGVVSMNANIAQTNQQINSIVPHKEHYRYFLYCSMKSSFDE 332 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G ++ + N+ + +P E + + + +I + + Sbjct: 333 LEAMASGGTATANLNTTDFSNMKLLLPREKEILRFHTETLPFFDKIYNNTKQIQNLQAMR 392 Query: 196 KEKKQALVSY 205 +A+ Sbjct: 393 DVLLKAIFKE 402 >gi|261417780|ref|YP_003251462.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC61] gi|319767407|ref|YP_004132908.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC52] gi|261374237|gb|ACX76980.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC61] gi|317112273|gb|ADU94765.1| restriction modification system DNA specificity domain protein [Geobacillus sp. Y412MC52] Length = 429 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 64/422 (15%), Positives = 141/422 (33%), Gaps = 36/422 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + +N G + ++DV ++ K + ++ ++ S F Sbjct: 5 GWKETRLIDVIDINPRTPLRKGTLAKKVSMQDV----AEFTRKIQSYEIAEFTSGSKFKN 60 Query: 84 GQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVT 134 G L ++ P L + + ST+F+VL+ K+ + + + +S + Sbjct: 61 GDTLLARITPCLENGKTAYVDILEDNEIAFGSTEFIVLRAKEGITDSKFVYYLAISPEFR 120 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G++ + N + +PPL EQ I + ID I + Sbjct: 121 NVAIKSMTGSSGRQRVQSDVLANTVICLPPLQEQKRIANLL----SAIDDKIELNNEMNK 176 Query: 194 LLKEKKQALVSYIVTKGLNPDV---KMKDSG----IEWVGLVPDHWEVKPFFALVTELNR 246 L+E Q + P+ K SG +G++P+ W V L + Sbjct: 177 TLEELAQTIFKRWFVDFEFPNENGEPYKSSGGKFVESELGMIPEGWRVATIGDLGDVVGG 236 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE-------IVFRFIDLQ 299 + + + I + N + + I + G + + Sbjct: 237 GTPSKKREDYFTQNGIPWITPKDLSNSKNRYVERGSVDITEEGLKNSSAKLLPKGTVLFS 296 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + A + +V PH + + + Y+ + + + Sbjct: 297 SRAPIGYIAIAKNEVTTNQGFKSVIPHKDIGSEFVFQVLKYNKDLIESRASGTTFKEISG 356 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++K++P+++P ++ + N + L+ E+ I L R S + ++G+I Sbjct: 357 GELKKVPIVLPKME----VIQRYNEAVRSLGKLICNNEEEINALISMRDSLLPKLMSGEI 412 Query: 420 DL 421 + Sbjct: 413 RV 414 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 71/209 (33%), Gaps = 16/209 (7%) Query: 10 YKDSG---VQW-IGAIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVES 58 YK SG V+ +G IP+ W+V I + G T I +I +D+ + Sbjct: 203 YKSSGGKFVESELGMIPEGWRVATIGDLGDVVGGGTPSKKREDYFTQNGIPWITPKDLSN 262 Query: 59 GTGKYLPK---DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 +Y+ + D S+ + KG +L+ P IA + + F + Sbjct: 263 SKNRYVERGSVDITEEGLKNSSAKLLPKGTVLFSSRAPI-GYIAIAKNEVTTNQGFKSVI 321 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P + + + IE+ G T + +P+ +P + E + Sbjct: 322 PHKDI-GSEFVFQVLKYNKDLIESRASGTTFKEISGGELKKVPIVLPKMEVIQRYNEAVR 380 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS 204 + I E I + L+S Sbjct: 381 SLGKLICNNEEEINALISMRDSLLPKLMS 409 >gi|218666566|ref|YP_002426003.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518779|gb|ACK79365.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] Length = 399 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 57/415 (13%), Positives = 118/415 (28%), Gaps = 42/415 (10%) Query: 24 HWKVVPIKRFTK-LNTGRTS--ESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVS 79 W V P+ + + +N G + + + + + Y L + S Sbjct: 4 GWHVEPLSKVCQLINRGISPVYLDDGGTAVLNQKCIRDHSINYDLGRRHCVTTKRVSADK 63 Query: 80 IFAKGQILYGKLGP-YLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + G +L G L + + +++PKD L I + Sbjct: 64 LVRVGDVLVNSTGTGTLGRVAQVRDEPHEPTTVDSHVTIVRPKDGLFFPEFFGYALIAIE 123 Query: 135 QRIEAICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +I+ EG + EQ I + I T + Sbjct: 124 NQIQEGGEGCGGQTELARSKLANDYHVSFPTSIPEQRRIVAILDEAFEGIATAKANAEKN 183 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ +E ++ ++ + + + W K + K Sbjct: 184 LQNAREVFESHLNAVFS------------------QRGEGWVEKRLDEVGKTQTGSTPKA 225 Query: 252 IES-----NILSLSYGNIIQK--LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 E +I + G+ + N GL ++V + I K + Sbjct: 226 SEPENLGKHIPFVKPGDFKPDGSITYDNEGLSQNGAAKARLVMAPSAIMVCIGATIGKSA 285 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVK 363 + + I S GI + + + M + D + G + Sbjct: 286 YANRIIATNQQINS---LTPATGISAKMVYYQMITVDFQRRVHENAGQATLPIINKSKWS 342 Query: 364 RLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 L + +PP + EQ I ++ L ++ ++ L E + S + A G Sbjct: 343 SLSIFIPPTVDEQNHIVARLDNLHEETQRLESLYQKKLIALDELKQSLLHQAFNG 397 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 74/197 (37%), Gaps = 9/197 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + W + K TG T ++ GK I ++ D + G + Q+ + Sbjct: 204 EGWVEKRLDEVGKTQTGSTPKASEPENLGKHIPFVKPGDFK-PDGSITYDNEGLSQNGAA 262 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQ 135 + + +G + K+ A+ + Q L P + +++ ++++D + Sbjct: 263 KARLVMAPSAIMVCIGATIGKSAYANRIIATNQQINSLTPATGISAKMVYYQMITVDFQR 322 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIEL 194 R+ AT+ + ++ + IPP + EQ I ++ L + + + Sbjct: 323 RVHENAGQATLPIINKSKWSSLSIFIPPTVDEQNHIVARLDNLHEETQRLESLYQKKLIA 382 Query: 195 LKEKKQALVSYIVTKGL 211 L E KQ+L+ L Sbjct: 383 LDELKQSLLHQAFNGDL 399 >gi|312902064|ref|ZP_07761325.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0470] gi|311290846|gb|EFQ69402.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0470] Length = 380 Score = 113 bits (282), Expect = 6e-23, Method: Composition-based stats. Identities = 66/395 (16%), Positives = 137/395 (34%), Gaps = 33/395 (8%) Query: 31 KRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 T+ +G T ++G +I +I ++ + + + S+ I KG Sbjct: 2 GDITESFSGGTPQAGNSDYYDGEIPFIRSGEINDSQTELF---ITEKGLNNSSAKIVEKG 58 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ILY G + I+ +G + L ++P E + + I Sbjct: 59 DILYALYGATSGEVGISQINGAINQAILAIRP-IKEDEPYLIAQWLLKQKESIIRTYLQG 117 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + +P + KI ++D IT R +E LKE K A + Sbjct: 118 GQGNLSSSIVKELVLKLPKDKAEQ---AKIGTFFKQLDDTITLHQRKLEQLKELKTAYLQ 174 Query: 205 -YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 V+ + K ++ G W+ + L + KN + ++ G Sbjct: 175 VMFVSMKTKNNKVPKLRFADFGGE----WDQRKSKELFIPKSEKNQPNLPVLSVTQDSGV 230 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + ++ S + Y++V+ + V Q ++GI + AY Sbjct: 231 VYRDQVGIDINYDLTSLKNYKVVNKNDFVISLRSFQG-----GFELSDKKGITSPAYTIF 285 Query: 324 KPHG---IDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVP-PIKEQFD 377 P D+ + +++ + + G+R +S+ F + L + P KEQ Sbjct: 286 VPKDIKLHDNLFWKTQFKTFQFIEALKTVTFGIRDGKSISFTEFGDLKLCFPKNKKEQQK 345 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I +D + + + LK + S++ Sbjct: 346 IGKF----FEELDYAISLHQNKLTQLKSLKKSYLQ 376 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 52/189 (27%), Gaps = 12/189 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP----KDGNSRQSDTSTVS 79 W K +S K+ + + V +G D N + Sbjct: 198 EWDQRKSKELF------IPKSEKNQPNLPVLSVTQDSGVVYRDQVGIDINYDLTSLKNYK 251 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIE 138 + K + L + ++D GI S + + PKD+ L + L Sbjct: 252 VVNKNDFVIS-LRSFQGGFELSDKKGITSPAYTIFVPKDIKLHDNLFWKTQFKTFQFIEA 310 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + ++KI +D I+ + LK Sbjct: 311 LKTVTFGIRDGKSISFTEFGDLKLCFPKNKKEQQKIGKFFEELDYAISLHQNKLTQLKSL 370 Query: 199 KQALVSYIV 207 K++ + + Sbjct: 371 KKSYLQNMF 379 >gi|253995601|ref|YP_003047665.1| restriction modification system DNA specificity domain-containing protein [Methylotenera mobilis JLW8] gi|253982280|gb|ACT47138.1| restriction modification system DNA specificity domain protein [Methylotenera mobilis JLW8] Length = 397 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 68/419 (16%), Positives = 144/419 (34%), Gaps = 46/419 (10%) Query: 23 KHWKVVPIKRFTKLNTG--RTSESGK-DIIYIGLEDVESGTGKYLP--KDGNSRQSDTST 77 WK V ++ + +T + + + D++ G K S Sbjct: 2 SEWKTVKLEEIASVIDSLHQTPSYSDLGLPMVRVTDIKKGKLNLTNTLKVSKEVFDKFSK 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG IL+ ++G Y I+ D C Q + L +L S + I Sbjct: 62 NHTPKKGDILFSRVGSYGNTCIVDDETEFCLGQNTAFIVPNENSLFLYYFLNSPNGINEI 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E+ G+T K I N +P PP EQ I + + +ID L + + + Sbjct: 122 ESSVAGSTQPTVSLKSIKNFEIPQPPHREQKAIASVLSSLDDKIDLLHRQNNTLEYMTE- 180 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + +E E+ + ++ K+ +L S Sbjct: 181 -----------------TLFRQWFVEEALEDWAFVELGEYVNCFNGVSYKSAELNPSKTA 223 Query: 258 SLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQ------NDKRSLRSAQ 309 ++ + + R G K + Y+ +V G++V D+ + + ++ Sbjct: 224 MVTLKSFDRNGGFRLDGFKEFTGRYKEQHVVVQGDLVVAHTDITQNAEVIGNPVLVVASP 283 Query: 310 VMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 E +I+ + V + + +L +MR+ + + +G L + + Sbjct: 284 DYETIVISMDLVKVTSKFDWLSNEFLYRMMRTREFKEHCLGYSNGSTVLHLSKQAIPTYE 343 Query: 367 VLVPPIK-EQ--FDIT-NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +PP + Q I +++ + I+ I +L++ R + + ++G+I + Sbjct: 344 FFLPPKEKIQSFTTIAKDMLGKKFKNIE--------QIQILEKLRDTLLPKLMSGEIRI 394 >gi|59713721|ref|YP_206496.1| type I restriction-modification system specificity subunit [Vibrio fischeri ES114] gi|59481969|gb|AAW87608.1| type I restriction-modification system specificity subunit [Vibrio fischeri ES114] Length = 406 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 75/403 (18%), Positives = 150/403 (37%), Gaps = 21/403 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDII--YIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK V + + I +GLE +ESG YL + + S T T Sbjct: 15 SDWKKVKFGDVVFEPKESVKDPVSEGIEHVVGLEHIESGD-MYLRRSASIEGSTTFTKKF 73 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIE 138 KG +L+G+ YL+KA A F GICS V++ +D L P+L+ + + Sbjct: 74 V-KGDVLFGRRRAYLKKAAKAKFSGICSGDITVMRARDELLLPDLVPFIVNNEKFFDYAI 132 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G +K + N IP +Q + + + + + + + L+ + Sbjct: 133 THSAGGLSPRVKFKDLANFEFFIPSKTDQKKLLSLLEGLDESLQNELILKQKLVSNLEAQ 192 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + G + +K + ++K ++ ++++ES I Sbjct: 193 IEHQIHGEHLDGKTINQVIKSLSSKK-----KIIKLKGLGEIIKGKGIAKSEVVESGIPC 247 Query: 259 LSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME-R 313 + YG + K + + + +S E + +++F + +A Sbjct: 248 VRYGELYTKHHRMIRKFHSYISLKSSEKSVKLRVNDVLFAGSGETISEIGKSAAFTESVD 307 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 S + +P +D +YL +LM S + +G+G + D++++ V Sbjct: 308 AYAGSDILIFRPKDMDGSYLGYLMNSLLVRHQLNKLGTGATVMHVYGSDIQKIVVPYRDK 367 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I N + + I +L ++ I KE + ++ Sbjct: 368 DEQVQIANCLEEIASNIRLL----DRKIHKTKELLAVLLSKVF 406 >gi|302037933|ref|YP_003798255.1| putative type I restriction-modification system, specificity subunit [Candidatus Nitrospira defluvii] gi|300605997|emb|CBK42330.1| putative Type I restriction-modification system, specificity subunit [Candidatus Nitrospira defluvii] Length = 404 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 54/417 (12%), Positives = 124/417 (29%), Gaps = 39/417 (9%) Query: 24 HWKVVPIKRFT-KLNTGR--------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ ++ + G S +YI +++ + Sbjct: 4 GWQTKKLRDVCVTIQDGAHESPQRQFDSPGKGRFLYITSKNIRNNCLDLGNVSYVEEDFH 63 Query: 75 TSTVSIFAK--GQILYGKLGPYLRKAIIADFDGICS----TQFLVLQPKDVLPELLQGWL 128 G +L K G + D S + +P + P L ++ Sbjct: 64 NRIYPRCKPSVGDVLLTKDGANTGNVTLNTLDEPFSLLSSVCLIKTKPDALKPGFLSYYI 123 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S D + I GA + + I P+PIPPL+EQ I + + Sbjct: 124 QSPDGLESITGQMTGAAIKRIILRDIKLAPIPIPPLSEQRRIVGILDEAFDGLARATANA 183 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + + ++ + +VT+ G ++ +T Sbjct: 184 EQNLRNARALFESHLQSVVTQR---------------GEGWVDRKLDSLCREITVGYVGP 228 Query: 249 --TKLIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ ++ + L N + ++ + + + PG++ Sbjct: 229 MASEYTDTGVTFLRSQNIRPFHVSLENVLSISREFDVKIAKSRLRPGDVAVVRTGYPGTA 288 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFED 361 + ++ + + + ++ +LA S + ++ Sbjct: 289 AVIPAS--LPKANCADLVIVRPGSEVEPQFLAAFFNSSYGKLHVSGKVVGAAQKHFNVGA 346 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K + +PP+++Q I N A L +Q + L E + S + A +G+ Sbjct: 347 AKETVLHLPPLQDQRRIIVKFNALAAETQRLESIYQQKLAALGELKKSLLHEACSGK 403 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 60/198 (30%), Gaps = 9/198 (4%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYL------PKDGNSRQSDT 75 + W + ++ G + G+ + S + + Sbjct: 207 EGWVDRKLDSLCREITVGYVGPMASEYTDTGVTFLRSQNIRPFHVSLENVLSISREFDVK 266 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S G + + G A+I C+ +V +V P+ L + S Sbjct: 267 IAKSRLRPGDVAVVRTGYPGTAAVIPASLPKANCADLVIVRPGSEVEPQFLAAFFNSSYG 326 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + GA H + + +PPL +Q I K A L + + + Sbjct: 327 KLHVSGKVVGAAQKHFNVGAAKETVLHLPPLQDQRRIIVKFNALAAETQRLESIYQQKLA 386 Query: 194 LLKEKKQALVSYIVTKGL 211 L E K++L+ + L Sbjct: 387 ALGELKKSLLHEACSGKL 404 >gi|257891262|ref|ZP_05670915.1| predicted protein [Enterococcus faecium 1,231,410] gi|257827622|gb|EEV54248.1| predicted protein [Enterococcus faecium 1,231,410] Length = 421 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 141/406 (34%), Gaps = 27/406 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + + T G ++ + + + + GN + ++ K Sbjct: 22 DWKKLKLSSVTSRVRG--NDGRMSLPTLTISARNGWLDQRERFSGNIAGKEQKNYTLLRK 79 Query: 84 GQILYGKLGPYLRKAIIADF----------DGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G++ Y K L K + S + + L + + ++ Sbjct: 80 GELSYNKGNSKLAKYGVVFMLDNFEEALVPRVYHSFKTTNEASSKYIEYLFETKKPNKEL 139 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + I + + + ++ I + IP + EQ +K+ + +ID IT + + + Sbjct: 140 RKLITSGARMDGLLNINYDDFMGIKITIPKIKEQ----KKLGSLFKQIDGTITLQQQLLT 195 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD-----HWEVKPFFALVTELNRKN 248 K+ K+AL+ + + K++ +G + + +V + E ++ Sbjct: 196 DYKQFKKALLQQLFPQKGESVPKIRFTGFSDDWELKELKEFIGEDVSDGDWIQKEHIHES 255 Query: 249 TKLIESNILSLSYGNIIQKLET-RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + ++ G I K E+ + + + ++PG+I+ + + + Sbjct: 256 GEYRIVQTGNIGIGRYIDKPESAKYLNQESFDELKANEINPGDILISRLADPAGRALILP 315 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLP 366 + + S +L M S + SG + L ++++++ Sbjct: 316 FTSSKMVTAVDVAIIRPNKNFISHFLVTRMNSSETLNDISKQVSGTSHKRLSRKNLEKIE 375 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + VP I+EQ I ++D + EQ + +E + + + Sbjct: 376 LNVPNIEEQEKIG----QLFKKLDEAIAGHEQKLATYQELKKALLQ 417 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 62/161 (38%), Gaps = 11/161 (6%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERGIITSAY 320 + + E + + + + Y ++ GE+ + + + K + E ++ Y Sbjct: 53 NGWLDQRERFSGNIAGKEQKNYTLLRKGELSYNKGNSKLAKYGVVFMLDNFEEALVPRVY 112 Query: 321 MAVKP-HGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQ----SLKFEDVKRLPVLVPPIKE 374 + K + S Y+ +L + K + SG R ++ ++D + + +P IKE Sbjct: 113 HSFKTTNEASSKYIEYLFETKKPNKELRKLITSGARMDGLLNINYDDFMGIKITIPKIKE 172 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q + +ID + +Q + K+ + + + Sbjct: 173 QKK----LGSLFKQIDGTITLQQQLLTDYKQFKKALLQQLF 209 >gi|150006640|ref|YP_001301384.1| type I restriction enzyme EcoAI specificity protein [Bacteroides vulgatus ATCC 8482] gi|212691152|ref|ZP_03299280.1| hypothetical protein BACDOR_00642 [Bacteroides dorei DSM 17855] gi|149935064|gb|ABR41762.1| type I restriction enzyme EcoAI specificity protein [Bacteroides vulgatus ATCC 8482] gi|212666384|gb|EEB26956.1| hypothetical protein BACDOR_00642 [Bacteroides dorei DSM 17855] Length = 449 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 60/383 (15%), Positives = 131/383 (34%), Gaps = 16/383 (4%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGK--DIIYIGLEDVES-GTGKYLPKDGNSRQSDTS 76 +P W+ ++ +L G + +S I + + ++ + GT Y +S D Sbjct: 68 LPNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIK 127 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ K +L+ + + AI +L+ ++ +++ Sbjct: 128 LYSL-EKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSY 186 Query: 134 TQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + S+ + + + + +PIPPL EQ I ++ IDT+ + Sbjct: 187 YRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTIKNSKEDL 246 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +K+ K +++ + L P + IE + + + Sbjct: 247 QTTIKQAKSKILNLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTQLPEGWAIC- 305 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I S++ G + +ET N + + K ++ + + Sbjct: 306 KMKQITSITNGKSQKNVETLNGIYPIYGSGGVIGRANQYLCIAGSTIIGRKGTINNPIFV 365 Query: 312 ERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E +A+ I YL + S+D K+ S SL + + + + Sbjct: 366 EEHFWNVDTAFGLKANDAILDKYLYYFCLSFDFSKL---DKSTAMPSLTKTSIGNVLIPI 422 Query: 370 PPIKEQFDITNVINVETARIDVL 392 PP KEQ I I++ ++ + Sbjct: 423 PPYKEQERIVAKIDMVLDTMNEI 445 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 74/197 (37%), Gaps = 7/197 (3%) Query: 229 PDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYE---TY 283 P+ WE +V EL ++ L I L GNI L S Sbjct: 69 PNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIKL 128 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ +++F + + + I + ++P I S YL +M S Sbjct: 129 YSLEKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSYYR 188 Query: 344 KVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 Y + + + ++ + + +L + +PP+KEQ I + + ID + E Sbjct: 189 NWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTIKNSKEDLQT 248 Query: 402 LLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 249 TIKQAKSKILNLAIHGK 265 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 16/164 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + +K+ T + G++ + +VE+ G Y P G+ + Sbjct: 297 QLPEGWAICKMKQITSITNGKSQK-----------NVETLNGIY-PIYGSGGVIGRANQY 344 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G + G+ G + + T F + +L + L + LS D Sbjct: 345 LCIAGSTIIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDF----SK 400 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + M IGN+ +PIPP EQ I KI ++ Sbjct: 401 LDKSTAMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNE 444 >gi|83776726|gb|ABC46686.1| Sau1hsdS1 [Staphylococcus aureus] Length = 407 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 69/406 (16%), Positives = 156/406 (38%), Gaps = 37/406 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGQFFSKLDQQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L + HWE + E N ++ Sbjct: 197 LELLQQQKKGYMQKIFSQEL--------RFKDENSEDYPHWESSKIEKYLKERNERSD-- 246 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + II+ E + Y++V +I + + + + Sbjct: 247 KGQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASGKSNY--- 303 Query: 312 ERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPV 367 GI++ AY + P S+ + +++ + F GL +LK++ +K + + Sbjct: 304 -NGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINI 362 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 363 DIPVLEEQEKIGDF----FKKMDILISKQKMKIEILEKEKQSFLQK 404 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 51/181 (28%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I + + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSKLDQQ 189 Query: 389 I 389 I Sbjct: 190 I 190 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 HW+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 228 HWESSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 282 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 283 KNDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 342 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 343 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFL 402 Query: 200 QALV 203 Q + Sbjct: 403 QKMF 406 >gi|121583287|ref|YP_973723.1| restriction modification system DNA specificity subunit [Polaromonas naphthalenivorans CJ2] gi|120596545|gb|ABM39981.1| restriction modification system DNA specificity domain [Polaromonas naphthalenivorans CJ2] Length = 412 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 71/415 (17%), Positives = 150/415 (36%), Gaps = 29/415 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W ++ T LN ++ ++ ++ ++ + G + Sbjct: 2 SGWAQTRLRYVTDLNPPVRADLLAALDTELSFLPMDSI-GENGSLNLARTRPIAEVRNGY 60 Query: 79 SIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S F G + + K+ P + G +T+ VL+PK +++ + Sbjct: 61 SYFEDGDVAFAKVTPCFENGKGALMQGLEKGAGFGTTEITVLRPKTGTNARYLRYIVQSE 120 Query: 133 VTQR--IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + ++ + A+ + + P Q I + +T RID LI E+ R Sbjct: 121 MFRQLGVGAMTGAGGLKRVPDDFTRDFKTVWPEAVAQERIANFLDDKTARIDALIAEKER 180 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-WVGLVPDHWEVKPFFALVTELNRKNT 249 + LL E + ++ + ++ + SG+ +G D F + + N Sbjct: 181 LLALLNEHRLSVSAQVLAEA--------SSGLRAKLGFCVDLLPGYAFPSDEFSRDAGNI 232 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ---NDKRSLR 306 L+ ++ + I+ ET + +S + G++V + ++ Sbjct: 233 PLLRGINVAPAS---IRWDETVYWSREYDSSLERFRLQQGDVVLGMDRPWISSGARVAMI 289 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 ++ + +L + + S + + +G+ L E + R Sbjct: 290 DEASAGSFLLQRVCRLRGGVRLTQRFLFFALLSDEFRQSVEVDLTGVSVPHLSPEQILRF 349 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 V V + EQ +V + + +I+ L Q + L+E RSS I+AAVTGQ+D Sbjct: 350 KVPVLTVDEQRVRCDVADRQLLKIEQLEAHTLQMLDRLREYRSSLISAAVTGQLD 404 >gi|319939009|ref|ZP_08013373.1| type I restriction-modification system specificity subunit [Streptococcus anginosus 1_2_62CV] gi|319812059|gb|EFW08325.1| type I restriction-modification system specificity subunit [Streptococcus anginosus 1_2_62CV] Length = 392 Score = 113 bits (282), Expect = 7e-23, Method: Composition-based stats. Identities = 59/414 (14%), Positives = 128/414 (30%), Gaps = 44/414 (10%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESG 59 +P++K++ P WK + +G T G +I +I ++ S Sbjct: 14 RFPEFKNT--------PA-WKQRKLGEVAVSFSGGTPSIGNSKYYNGEIPFIRSAEINSA 64 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 S+ + G ILY G + I+ +G + L ++P D Sbjct: 65 ---ITELYLTEEGLKNSSAKMVNVGDILYALYGATSGEVGISKINGAINQAILAIKPYDA 121 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L I+ +G + + + +P P L EQ I + Sbjct: 122 YNSKFIEQWLKNQKKNIIDKYLQG-GQGNLSAAIVKKLLIPFPSLPEQTAIGDF----FS 176 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +D I R +E LK +K++L+ + K K++ + WE + Sbjct: 177 TLDRSIALHQRELENLKNRKKSLLQKMFPKNGESVPKIRFPEFKNAPA----WEQRKLGE 232 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 +V+ + K S+ + ++ + Sbjct: 233 VVSAEKKGKAKADMIGDESVYLDTEYLNGGQIVKVNAVKDTYLDDVIILWDGSQAGTLYY 292 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + +L S +S ++ ++S ++ + + Sbjct: 293 GFEGALGSTLKAYTISESSLFI------------YQQLKS-RQQIIYEKYRTPNIPHVIK 339 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + V +P + EQ I + + +D + ++ + LK R+ + + Sbjct: 340 TFLDEFGVYIPSLPEQTAIGDF----FSTLDRSIALHQRKLEHLKLRKKALLQK 389 >gi|254881457|ref|ZP_05254167.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 4_3_47FAA] gi|254834250|gb|EET14559.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 4_3_47FAA] Length = 450 Score = 112 bits (281), Expect = 7e-23, Method: Composition-based stats. Identities = 59/383 (15%), Positives = 131/383 (34%), Gaps = 16/383 (4%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGK--DIIYIGLEDVES-GTGKYLPKDGNSRQSDTS 76 +P W+ ++ +L G + +S I + + ++ + GT Y +S D Sbjct: 69 LPNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIK 128 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ K +L+ + + AI +L+ ++ +++ Sbjct: 129 LYSL-EKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSY 187 Query: 134 TQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + S+ + + + + +PIPPL EQ I ++ I+T+ + Sbjct: 188 YRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLINTIKNSKEDL 247 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +K+ K +++ + L P + IE + + + Sbjct: 248 QTTIKQAKSKILNLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTQLPEGWAIC- 306 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I S++ G + +ET N + + K ++ + + Sbjct: 307 KMKQITSITNGKSQKNVETLNGIYPIYGSGGVIGRANQYLCIAGSTIIGRKGTINNPIFV 366 Query: 312 ERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E +A+ I YL + S+D K+ S SL + + + + Sbjct: 367 EEHFWNVDTAFGLKANDAILDKYLYYFCLSFDFSKL---DKSTAMPSLTKTSIGNVLIPI 423 Query: 370 PPIKEQFDITNVINVETARIDVL 392 PP KEQ I I++ ++ + Sbjct: 424 PPYKEQERIVAKIDMVLDTMNEI 446 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 74/197 (37%), Gaps = 7/197 (3%) Query: 229 PDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYE---TY 283 P+ WE +V EL ++ L I L GNI L S Sbjct: 70 PNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIKL 129 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ +++F + + + I + ++P I S YL +M S Sbjct: 130 YSLEKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSYYR 189 Query: 344 KVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 Y + + + ++ + + +L + +PP+KEQ I + + I+ + E Sbjct: 190 NWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLINTIKNSKEDLQT 249 Query: 402 LLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 250 TIKQAKSKILNLAIHGK 266 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 16/164 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + +K+ T + G++ + +VE+ G Y P G+ + Sbjct: 298 QLPEGWAICKMKQITSITNGKSQK-----------NVETLNGIY-PIYGSGGVIGRANQY 345 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G + G+ G + + T F + +L + L + LS D Sbjct: 346 LCIAGSTIIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDF----SK 401 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + M IGN+ +PIPP EQ I KI ++ Sbjct: 402 LDKSTAMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNE 445 >gi|254303928|ref|ZP_04971286.1| type I site-specific deoxyribonuclease specificity subunit [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324120|gb|EDK89370.1| type I site-specific deoxyribonuclease specificity subunit [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 429 Score = 112 bits (281), Expect = 7e-23, Method: Composition-based stats. Identities = 61/397 (15%), Positives = 134/397 (33%), Gaps = 31/397 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + + L G T DI + +ED+ G L + Sbjct: 13 PNGVEYKKLGEIFNLKNGYTPSKANKEYWENTDINWFRIEDINI-NGGILEDSIQKVNTK 71 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSI 131 S+F+ ++ + A+I D IC+ QF L K+ + + Sbjct: 72 GIKGSLFSAKSLIVSTTATIGKHALILK-DFICNQQFTCLTIKEDYEKIYNGKFMYYYFF 130 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + ++ D +P+PPL Q I + T ++ L + Sbjct: 131 KINELTKKNLKVSSFPSVDMDKFKKFLIPLPPLEIQDEIVRVLDNYTKSVEELKEKLNEE 190 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + K++ Y++ I +G + + V + + Sbjct: 191 LIARKKQYSWYRDYLLKFE-------NKIEIVKLGDIVE----------VYDGTHQTPDY 233 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ++ I +S NI T + + + Y+I + +F K ++ + Sbjct: 234 KKTGIPFISVENIDNIYNTEKYISEEDYEKNYRIKPKIDDIFMTRIGTIGKCAIVTKNNP 293 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLV 369 ++ A + + IDS YL +++ S K + + + D+ +L + + Sbjct: 294 LAYYVSLALLRPNKNKIDSAYLKYIIESGIGKKELNKRILFTAVPIKINKGDIDKLEIPL 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 PP++ Q I V++ + L + I +++ Sbjct: 354 PPLEVQKRIVGVLDNFEKICNDLNIGLPAEIEARQKQ 390 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 16/168 (9%), Positives = 41/168 (24%), Gaps = 9/168 (5%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P+ E K + N + N + + G E Sbjct: 12 CPNGVEYKKLGEIFNLKNGYTPSKANKEYWENTDINWFRIEDININGGILEDSIQKVNTK 71 Query: 288 --------PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ + + + + ++ + Sbjct: 72 GIKGSLFSAKSLIVSTTATIGKHALILKDFICNQQFTCLTIKEDYEKIYNGKFMYYYFFK 131 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + S S+ + K+ + +PP++ Q +I V++ T Sbjct: 132 INELTKKNLKVSS-FPSVDMDKFKKFLIPLPPLEIQDEIVRVLDNYTK 178 >gi|309750367|gb|ADO80351.1| Probable type I restriction-modification system specificity determinant [Haemophilus influenzae R2866] Length = 408 Score = 112 bits (281), Expect = 7e-23, Method: Composition-based stats. Identities = 60/394 (15%), Positives = 133/394 (33%), Gaps = 29/394 (7%) Query: 26 KVVPIKRFTKLN---TGRTS--ESGKDIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTV 78 + +K N T + +I YI ++++ G + + + S Sbjct: 18 EWKTVKSLCNDNFWLMPATPEFDDNGNIPYITSKNIKGGKIDFQNTKYINEYVYQELSRT 77 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQ 135 + IL +G I+ D Q + L + V + + + + Sbjct: 78 RCIIENDILISMIGTIGEAVIVKKEDLYFYGQNMYVLRLNNELVNHKFFLYYFTAPFILN 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + + I ++ +PIPPL+ Q I + + A T L +E I + Sbjct: 138 SLLSKKNSSNQGYLKAGQIESLKIPIPPLSVQTEIVKILDALTTLTSELTSELILRQKQY 197 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + ++ L+S ++ G EW K + T N Sbjct: 198 EYYREKLLSE---------EELGKVGFEW---KTIDEISKKISSGGTPTTSNNGYYDNGT 245 Query: 256 ILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I L + K E + + + ++ K ++ + Sbjct: 246 IPWLRTQEVDFKEIWDTNIKITEDALNNSSAKWIPANCVIVAMYGATVGKTAINKIPLTT 305 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + + Y+ + S + ++GSG + ++ + +K+L V VPPI Sbjct: 306 NQACAN--IEINDKLACYRYIFHYLTSKY--EYIKSLGSGSQTNINAQIIKKLKVPVPPI 361 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +EQ I ++++ + + E + +I ++R Sbjct: 362 EEQHRIVSILDKFETLTNSITEGLPLAIEQSQKR 395 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 54/188 (28%), Gaps = 5/188 (2%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLET-----RNMGLKPESYETYQIVDPGEI 291 + NI ++ NI + + + +I Sbjct: 26 CNDNFWLMPATPEFDDNGNIPYITSKNIKGGKIDFQNTKYINEYVYQELSRTRCIIENDI 85 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + I + ++ + G +L + + L + S Sbjct: 86 LISMIGTIGEAVIVKKEDLYFYGQNMYVLRLNNELVNHKFFLYYFTAPFILNSLLSKKNS 145 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + LK ++ L + +PP+ Q +I +++ T L ++ + R + Sbjct: 146 SNQGYLKAGQIESLKIPIPPLSVQTEIVKILDALTTLTSELTSELILRQKQYEYYREKLL 205 Query: 412 AAAVTGQI 419 + G++ Sbjct: 206 SEEELGKV 213 >gi|251798708|ref|YP_003013439.1| restriction modification system DNA specificity domain protein [Paenibacillus sp. JDR-2] gi|247546334|gb|ACT03353.1| restriction modification system DNA specificity domain protein [Paenibacillus sp. JDR-2] Length = 456 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 71/442 (16%), Positives = 139/442 (31%), Gaps = 52/442 (11%) Query: 20 AIPKHWKVVPIKRFTKLN-------------TGRTSESGKDIIYIGLEDVESGTGKYLPK 66 +P +W V + L S+ +Y+ L D+ G G K Sbjct: 9 EVPGNWVWVKLGSLAYLTDFVANGSFQSLRENVEVSDDTDYALYVRLTDLRLGLGHEGQK 68 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPEL 123 + + S G+IL +G + + + D + +VL+ + + Sbjct: 69 YVDETSYKFLSKSSLTGGEILIANIGANVGEVFVMPNVDLLATIAPNMIVLRCNHYVENI 128 Query: 124 LQGWL-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + S + + I G + G+ I + +PPL EQ I +K+ +I+ Sbjct: 129 FLNYFLSSPQGKKLLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIADKVERLLDKIN 188 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG--------------IEWVGLV 228 + ++ A++ L + + S E L+ Sbjct: 189 QAKQLIEEAKATFELRQAAILDKAFRGELTKKWRGEHSNQISTVRSISEDINPNEIPFLL 248 Query: 229 PDHWEVKPFFALVTELNRKNTK-------LIESNILSLSYG---NIIQKLETRNMGLKPE 278 P W L T K+ L + G N +E+ N L Sbjct: 249 PAGWNWVRLKDLGTLERGKSKHRPRNDPKLFGGEYPFIQTGDVANAGDYIESYNQTLSEF 308 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 ++ G + D L+ ++ K I S YL + MR Sbjct: 309 GLLQSKLFPEGTVCITIAANIADTALLKFPCCFPDSVVG---FIPKDAYISSLYLHYYMR 365 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI---TNVINVETARIDVLVEK 395 + YA ++++ + ++ + V VPP E +I N++ + ++ Sbjct: 366 TIKSNLEHYAPA-TAQKNINLKVLQEILVPVPPKTEHDEILHMINLLMQKDEEAQTIMNV 424 Query: 396 IEQSIVLLKERRSSFIAAAVTG 417 L+ + S ++ A G Sbjct: 425 ASD----LEILKQSVLSKAFQG 442 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 62/149 (41%), Gaps = 1/149 (0%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + + + GEI+ I + + + I + + H +++ Sbjct: 68 KYVDETSYKFLSKSSLTGGEILIANIGANVGEVFVMPNVDLLATIAPNMIVLRCNHYVEN 127 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L + + S K+ + +G + + +K + V +PP+ EQ I + + +I Sbjct: 128 IFLNYFLSSPQGKKLLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIADKVERLLDKI 187 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + IE++ + R+++ + A G+ Sbjct: 188 NQAKQLIEEAKATFELRQAAILDKAFRGE 216 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 61/199 (30%), Gaps = 10/199 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W V +K L G++ G + +I DV + + + Sbjct: 248 LPAGWNWVRLKDLGTLERGKSKHRPRNDPKLFGGEYPFIQTGDVANAGDYIESYNQTLSE 307 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +F +G + + + + F + PKD L Sbjct: 308 FGLLQSKLFPEGTVCIT-IAANIADTALLKFPCCFPDSVVGFIPKDAYISSLYLHYYMRT 366 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +E + + K + I +P+PP E I I + + T Sbjct: 367 IKSNLEHYAPATAQKNINLKVLQEILVPVPPKTEHDEILHMINLLMQKDEEAQTIMNVAS 426 Query: 193 ELLKEKKQALVSYIVTKGL 211 +L KQ+++S L Sbjct: 427 DLEI-LKQSVLSKAFQGNL 444 >gi|78188779|ref|YP_379117.1| specificity determinant HsdS-like [Chlorobium chlorochromatii CaD3] gi|78170978|gb|ABB28074.1| specificity determinant HsdS-like protein [Chlorobium chlorochromatii CaD3] Length = 363 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 57/401 (14%), Positives = 117/401 (29%), Gaps = 51/401 (12%) Query: 26 KVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 ++ + + + G+ E ++ YI + K++ +G S + + K Sbjct: 5 ELTTLGKSCEFFNGKAHEKSIDENGQYIVV------NSKFISSEGKSFKRTNEQMFPLYK 58 Query: 84 GQILYGKLGPYLRK------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G I+ K I D + + ++ + L L + Sbjct: 59 GDIVMVMSDVPNGKALAKCFIIDKDDTYSLNQRICCIRSNKFDTKYLYYQLNR---HEHF 115 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A ++ I P+ P + EQ I + ID + ++ KE Sbjct: 116 LAFNNSENQTNLRKDDILACPLIKPSMEEQQRIVSILDEAFAAIDQAKANAEQNLKNAKE 175 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + + + K +G V KP + N K + I Sbjct: 176 LFDGYLQSVFENQGDDWEEKK------LGEVIKLEYGKPLDETKRKSNGKYPMYGANGIK 229 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + Y IV R + ++ Sbjct: 230 GRT--------------------DEYYHDKKSIIVGRKGSAGEINLTENKFWPLDV---- 265 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 Y I + + S + G++ + +V + L P ++EQ Sbjct: 266 -TYFVTFDEKIYDLMFLYFLLS---RFDLPKLAKGVKPGINRNEVYEIQALFPSLEEQQT 321 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I ++ A+ L E ++ I L+E + S + A G+ Sbjct: 322 IVRQLDTLRAKTQKLEEIYQRKIADLEELKKSMLQKAFAGE 362 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 56/188 (29%), Gaps = 15/188 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + KL G+ + K GKY N + T K Sbjct: 191 DWEEKKLGEVIKLEYGKPLDETK----------RKSNGKYPMYGANGIKGRTDEYYH-DK 239 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I+ G+ G + + + V + + + +L +++ Sbjct: 240 KSIIVGRKGSAGEINLTENKFWPLDVTYFVTFDEKIYDLMFLYFL----LSRFDLPKLAK 295 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I P L EQ I ++ + L R I L+E K++++ Sbjct: 296 GVKPGINRNEVYEIQALFPSLEEQQTIVRQLDTLRAKTQKLEEIYQRKIADLEELKKSML 355 Query: 204 SYIVTKGL 211 L Sbjct: 356 QKAFAGEL 363 >gi|304438090|ref|ZP_07398033.1| type I restriction-modification system specificity determinant [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304368863|gb|EFM22545.1| type I restriction-modification system specificity determinant [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 411 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 51/400 (12%), Positives = 128/400 (32%), Gaps = 32/400 (8%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT- 75 P + + + G + + I + ++ + G + + Sbjct: 13 PDGVEYKKLGEIATNVFRGAGIKRDELTAMGIPCVRYGEIYTTYGIWFDSCVSHTDETFL 72 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + F G IL+ G + D + +V+ + P+ L L + Sbjct: 73 TNPKYFGHGDILFAITGESVEEIAKSTAYIGHDKCVAGGDIVVLQHEQNPKYLSYVLSTE 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ + + H+ I I +P+PPL Q I + + T L E Sbjct: 133 MAQRQKSKGRVKSKVVHSSVPAIKEIVIPVPPLPIQNEIVKMLDNFTELTAELTAELTLR 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + +L++ D +EW + +++ +++ Sbjct: 193 KKQYSFYRDSLLN----------FSRDDVDVEW-------KTLGDVCDILSGYPFDSSQF 235 Query: 252 IESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + I + N+ + + E N K + +I+ + Sbjct: 236 VNNGIRLMRGMNVKRGVLDFQEGNNRYWKNTDGLDKYKLKADDIIIAMDGSLVGQSYGLV 295 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLP 366 + ++ V+ +S Y+ + S L + +G + +D+++ Sbjct: 296 KKEHLPLLLVQRVARVRSKESNSHYVYHYISSGKLTEYVNAKRTAGAVPHISLKDIEKFE 355 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + P I++Q I +++ A + L + + I K++ Sbjct: 356 IPFPDIEKQNKIAEILDRFDALCNDLTQGLPAEIAARKKQ 395 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 66/194 (34%), Gaps = 9/194 (4%) Query: 228 VPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 PD E K + T + R K +L I + YG I + ET+ Sbjct: 12 CPDGVEYKKLGEIATNVFRGAGIKRDELTAMGIPCVRYGEIYTTYGIWFDSCVSHTDETF 71 Query: 284 ----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + G+I+F ++ + +A + + + V H + YL++++ + Sbjct: 72 LTNPKYFGHGDILFAITGESVEEIAKSTAYIGHDKCVAGGDIVVLQHEQNPKYLSYVLST 131 Query: 340 YDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + +K + + VPP+ Q +I +++ T L ++ Sbjct: 132 EMAQRQKSKGRVKSKVVHSSVPAIKEIVIPVPPLPIQNEIVKMLDNFTELTAELTAELTL 191 Query: 399 SIVLLKERRSSFIA 412 R S + Sbjct: 192 RKKQYSFYRDSLLN 205 >gi|257883802|ref|ZP_05663455.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,501] gi|257819640|gb|EEV46788.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,501] Length = 413 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 56/402 (13%), Positives = 125/402 (31%), Gaps = 22/402 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ ++ T+ G ++ D+ + + + GN + ++ Sbjct: 20 EEWEERKLRDITERVRG--NDGRMDLPTLTISASSGWLDQRDRFSGNIAGKEQKNYTLLK 77 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-------PELLQGWLLSIDVTQ 135 KGQ+ Y L K + + + + Sbjct: 78 KGQLSYNHGNSKLAKYGAVFELTTYDEALVPRVYHSFDTNELASSNFIEYMFATKRPDRE 137 Query: 136 RIEAICEGATMS---HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + GA M + ++ I + +P + EQ I ++D I R + Sbjct: 138 LAKLVSSGARMDGLLNINFDEFMGINVSVPSVGEQQKIGTF----FKQLDDTIALHQRKL 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +LLKE K+ + + K +++ G +E+ + Sbjct: 194 DLLKETKKGFLQKMFPKNGAKVPEIRFPGFTEDWEERKVFEISKVTYGGGTPKTNTKEFW 253 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 NI + ++ K + E + I I + + A + Sbjct: 254 NGNIPWIQSSDLEINRLFNISPKKKITSEAVKKSAAKIIPPNSIAIVTRVGVGKLALMPF 313 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP- 371 + ++++ IDS + + + S L K + + + D+ V +P Sbjct: 314 EYATSQDFLSLSELQIDSYFGIFSLYSM-LQKELKNIQGTSIKGMTKSDLLEKKVTIPKK 372 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I N ++D + + + LKE + S + Sbjct: 373 YEEQQKIGNF----FKQLDDTIALHQHELDSLKEMKKSLLQQ 410 >gi|291615455|ref|YP_003522563.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] gi|291582517|gb|ADE16973.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] Length = 416 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 65/409 (15%), Positives = 129/409 (31%), Gaps = 24/409 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ WK+ + ++ TG T + K ++ ++ D+++ ++ SR Sbjct: 5 LPQGWKLAKLGEVGEVITGSTPSTSKPEYYGSEVPFVTPVDLDNDDPVTKAQNYLSRSG- 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S + ++ +G L K IA + Q L G+ + Sbjct: 64 ASQARLLPPDAVMVCCIGS-LGKVGIAGIQLATNQQINSLIFDKSKILPRYGYHFCKTLK 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +E + T++ + I +P PPL EQ + + I + + + Sbjct: 123 PILEHMAPSTTVAIVNKSRFSEITIPFPPLPEQRRLAAILDKADA-----IRRKRQRAIV 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL---VPDHWEVKPFFALVTELNRKNTKL 251 L E L S + +P K G + P F ++ + Sbjct: 178 LTEDF--LRSAFLEMFGDPVTNPKGWGAGTIDEVVSNPKEDIRCGPFGTQLKVRELVPEG 235 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I + + + + + K + PG+++ + + Sbjct: 236 IPLLGIENVHNDHFVSNTEKFLTEKKAEELSRFDACPGDVLITRMGSIGRACVVPKGIGK 295 Query: 312 ERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 R + P +L A + RS + G L +K + L+ Sbjct: 296 ARISYHLFRIRTNPDKCLPEFLAATICRSGTFQHQLRRLAHGAIMDGLSTSILKEIVFLL 355 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP++ Q + +NV L+ KI S SS A G+ Sbjct: 356 PPVEMQ---LHYLNVVRKVERNLI-KINHSAENANILFSSLTHRAFRGE 400 >gi|167628750|ref|YP_001679249.1| type i restriction modification DNA specificity domain [Heliobacterium modesticaldum Ice1] gi|167591490|gb|ABZ83238.1| type i restriction modification DNA specificity domain [Heliobacterium modesticaldum Ice1] Length = 538 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 62/434 (14%), Positives = 129/434 (29%), Gaps = 58/434 (13%) Query: 20 AIPKHWKVVPIKRFTKL----NTGRTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSD 74 +P+ W + ++ N + + Y+ ++ +++ K + Sbjct: 66 DLPEGWVWCRLGELIQIAENNNIHKNLPENTLVNYVDIDAIDNKKYCIKDVKQIPVKSLS 125 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + KG I+Y + PYL + + I ST F+V +P + +LLS Sbjct: 126 SRARRVLQKGFIVYSLVRPYLNNIAVVEDEKENYIGSTGFVVFKPIKIEINYFISFLLSP 185 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V ++ G + + P+PPLAEQ I K+ D L Sbjct: 186 FVKTYYLSLLSGFNSPSVSQEDFLSTLFPLPPLAEQQRIVTKVNELMALCDELEAAEQEL 245 Query: 192 IELLKEK----KQALVSYIVTKGLNPDVK------------------------------- 216 L ++++ V L P Sbjct: 246 DALESRFEEYLPKSILQAAVQGKLVPQDIHDEPASVLLERIRAEKARLVKEGKIKKEKPL 305 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLET 270 S E +P+ W ++ + N NI S ++ + Sbjct: 306 PPISEDEIPYDLPEGWVWCRLGDIIIQNIGGGTPSKQNLAYWNGNIPWASVKDLTGPILD 365 Query: 271 RNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + E E+ + P + + K ++ + V + A + + + Sbjct: 366 KTRDCITELGLEESSSNLIPANSIIVCTRMGLGKIAINTIPVAINQDL-RALIISRMNID 424 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +A+ + + + E++ + +PP+ EQ I N A Sbjct: 425 LRYIIAYY------KTLSIRGEGTTVKGISIEELHNMLFPLPPLAEQQRIVAKANELMAL 478 Query: 389 IDVLVEKIEQSIVL 402 + + + I Sbjct: 479 CEEIKAVKTKPIEQ 492 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 79/212 (37%), Gaps = 15/212 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI-------IQKLETRN 272 S E +P+ W L+ N ++Y +I + + Sbjct: 59 SEDETPYDLPEGWVWCRLGELIQIAENNNIHKNLPENTLVNYVDIDAIDNKKYCIKDVKQ 118 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + +K S +++ G IV+ + + ++ E I ++ ++ KP I+ Y Sbjct: 119 IPVKSLSSRARRVLQKGFIVYSLVRPYLNNIAVVE-DEKENYIGSTGFVVFKPIKIEINY 177 Query: 333 LAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + S + + ++ SG S+ ED +PP+ EQ I +N A D Sbjct: 178 FISFLLSPFVKTYYLSLLSGFNSPSVSQEDFLSTLFPLPPLAEQQRIVTKVNELMALCDE 237 Query: 392 LVEKIEQSIVLLKERR-----SSFIAAAVTGQ 418 L EQ + L+ R S + AAV G+ Sbjct: 238 LEA-AEQELDALESRFEEYLPKSILQAAVQGK 268 >gi|3057063|gb|AAC38347.1| HsdS [Lactococcus lactis] Length = 456 Score = 112 bits (281), Expect = 8e-23, Method: Composition-based stats. Identities = 59/404 (14%), Positives = 124/404 (30%), Gaps = 31/404 (7%) Query: 24 HWKVVPIKRFTKLN---TGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ + + G++ + + + +V++G + Sbjct: 18 DWEERKLLDNVEKVLDYRGKSPAKFGMEWGTEGYLVLSALNVKNGYIDKSVEAKYGDHEL 77 Query: 75 TS---TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLL 129 + KG +++ P A + D +G Q V L Sbjct: 78 FDRWMGNNRLEKGDVVFTTEAPLGNVAQVPDNNGYILNQRAVAFKSLQETDDNFFAQLLR 137 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S V ++A G T K + +P E+ KI ++D I Sbjct: 138 SPIVQNTLKASSSGGTAKGIGMKEFAKLNARVPETHEEQ---RKIGLFFKQLDDTIVLHQ 194 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R ++LLKE+K+ + + K + +++ E+ + + L +++ Sbjct: 195 RKLDLLKEQKKGYLQKMFPKNGSKIPELR--FAEFADDWEERKLGEVATFLNGRAYKQDE 252 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L L GN VD G++V+ + Sbjct: 253 LLDSGKYKVLRVGNFYTNDSWY---YSNMELGDKYYVDKGDLVYTWSATFGPHI-----W 304 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E+ I V+ + D ++ + + D++ V + Sbjct: 305 SGEKVIYHYHIWKVELSKFLDRNFTLQLLEADKARLLSSTNGSTMIHVTKGDMESKIVSI 364 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I + ++D + ++ + LLKE++ F+ Sbjct: 365 PNIDEQKQIGSF----FKQLDNTITLHQRKLDLLKEQKKGFLQK 404 >gi|150020303|ref|YP_001305657.1| restriction modification system DNA specificity subunit [Thermosipho melanesiensis BI429] gi|149792824|gb|ABR30272.1| restriction modification system DNA specificity domain [Thermosipho melanesiensis BI429] Length = 402 Score = 112 bits (281), Expect = 9e-23, Method: Composition-based stats. Identities = 60/427 (14%), Positives = 125/427 (29%), Gaps = 49/427 (11%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT-----GRTSESGKDIIYIGLEDV-ESGTG 61 P YK + IG IP+ WK+ ++ ++ E + I ++G+ D+ E+G Sbjct: 7 PGYKKTE---IGIIPEDWKIGELEEIAEVIDPHPSHRAPPEVSRGIPFVGIGDLDENGNI 63 Query: 62 KYLPKDGN--SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + I G++ + + + S +++ + Sbjct: 64 INDNVRIVHPKILEEHKKRYNLYDNLIGLGRVASIGKVVKLKEGKYAVSPTMGIIKSNYI 123 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAET 178 L L S V ++ I G+T S + +P PP + EQ I + Sbjct: 124 EWRYLYYILQSKYVIEQFNKIMTGSTRSSVGMIVLRKSKIPYPPTIEEQRAIARVLSDVD 183 Query: 179 VRIDTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I++L + + K Q L++ G + K G P+ Sbjct: 184 KLIESLDKLIEKKKLIKKGAMQELLTGKKRLPGFKGEWVRKKLGEVAEIYQPETISQSQL 243 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + Y Y I+ Sbjct: 244 SNV--------------------------GYNVYGANGIIGKYHKYNHEFWQNIITCRGS 277 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + ++ IT M + S ++ + + + Sbjct: 278 TCGMV-----NRTTDKCWITGNAMVINVDKNKSIDKLFMFYLLKFQDFTKLITGSGQPQI 332 Query: 358 KFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + + P I+EQ I +++ A I+ L E+ + + + +T Sbjct: 333 IRKPLVEFIIHYPSDIEEQRAIAQILSDMDAEIEAL----EKKKAKYEMIKKGMMQLLLT 388 Query: 417 GQIDLRG 423 G++ L+ Sbjct: 389 GKVRLKD 395 >gi|78357539|ref|YP_388988.1| subunit S of type I restriction-modification system [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219944|gb|ABB39293.1| subunit S of type I restriction-modification system [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 565 Score = 112 bits (281), Expect = 9e-23, Method: Composition-based stats. Identities = 72/483 (14%), Positives = 138/483 (28%), Gaps = 93/483 (19%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W + + G + + YI +DV G K+G Sbjct: 87 LPQSWTWTRLGTIGNIFNGNSINAREKETKYAGANGLTYIATKDVGYGLDALDYKNGIYI 146 Query: 72 QSDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 I +G +L + G +K I + D + + +P +L Sbjct: 147 PESEDKFKIAHQGAVLICAEGGSAGKKCGITEQDICFGNKLFANELFGGIPSKFILYLYL 206 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---------- 180 V + + +P+P+PPL EQ I +KI R Sbjct: 207 SPVFRESFNAAMTGIIGGVSIAKFLELPVPLPPLKEQHRIVDKIDQLMARCDELENLRTE 266 Query: 181 ---------------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLN- 212 I+ E E + E ++ ++ V L+ Sbjct: 267 REEKRLAVHAAAIKQLLDAPDGSAWDFIEQHFGELYTVKENVTELRKGILQLAVMGRLSE 326 Query: 213 -------------------------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL--- 244 + +S +P+ W+ ++ Sbjct: 327 QKTNDESVSTLLTNVHAERQRLKIRKTTDLINSPRPLGYEIPEQWKWVCLDDVLIYGPTN 386 Query: 245 -NRKNTKLIESNILSLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 E+NI SL+ K E S ++ + G+I+ + + Sbjct: 387 GFSPRAVDYETNIRSLTLSATTSGTFKGEYSKFIDADISNDSDLWLRDGDILVQRGNTIE 446 Query: 301 DKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQ 355 + + M +D+ Y+ + M S + A SG Sbjct: 447 YVGVSAVYRGNPGVYVYPDLMMKLRVSSHMDTDYVYYAMSSVPAREYLRAHASGTSGTMP 506 Query: 356 SLKFEDVKRLPVLVPPIKEQFDIT---NVINVETARIDVLVEKIE-QSIVLLKERRSSFI 411 + + +K LP+ VPP++EQ I + +D ++ + LL + + Sbjct: 507 KINQKTLKSLPIPVPPLEEQHRIVVKIKRLMDLCEILDQQIDDATGKQTELLN----AVM 562 Query: 412 AAA 414 A A Sbjct: 563 AQA 565 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 63/200 (31%), Gaps = 13/200 (6%) Query: 20 AIPKHWKVVPIKRFTKL--NTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP+ WK V + G + + +I + L SGT K Sbjct: 366 EIPEQWKWVCLDDVLIYGPTNGFSPRAVDYETNIRSLTLSATTSGTFKGEYSKFIDADIS 425 Query: 75 TSTVSIFAKGQILYGKLGPYLRK----AIIADFDGICSTQFLV--LQPKDVLPELLQGWL 128 + G IL + + ++ + + + + Sbjct: 426 NDSDLWLRDGDILVQRGNTIEYVGVSAVYRGNPGVYVYPDLMMKLRVSSHMDTDYVYYAM 485 Query: 129 LSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S+ + + A G TM + K + ++P+P+PPL EQ I KI + L Sbjct: 486 SSVPAREYLRAHASGTSGTMPKINQKTLKSLPIPVPPLEEQHRIVVKIKRLMDLCEILDQ 545 Query: 187 ERIRFIELLKEKKQALVSYI 206 + E A+++ Sbjct: 546 QIDDATGKQTELLNAVMAQA 565 >gi|208434760|ref|YP_002266426.1| HP0790-like protein [Helicobacter pylori G27] gi|208432689|gb|ACI27560.1| HP0790-like protein [Helicobacter pylori G27] Length = 429 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 52/407 (12%), Positives = 126/407 (30%), Gaps = 19/407 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + K++ Q + ++ N + E + +K + + KL Sbjct: 192 LNTRKKQYQYYQNMLLD--FNDINQNHKDAKEKLACKTYPKRLKTLLQTLAPKGVEFRKL 249 Query: 252 IESNILSLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E + +++ + + ++ I + + Sbjct: 250 GEVCEIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQ 309 Query: 311 MERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 ++ ++ P + YL +++ + + S + S+ ++ ++ + + Sbjct: 310 NQKFWANDVCFSLIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPI 369 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 PP++ Q +I +++ + L+ I I K+ R + Sbjct: 370 PPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 416 >gi|219850149|ref|YP_002464582.1| restriction modification system DNA specificity subunit [Chloroflexus aggregans DSM 9485] gi|219544408|gb|ACL26146.1| restriction modification system DNA specificity subunit [Chloroflexus aggregans DSM 9485] Length = 430 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 67/431 (15%), Positives = 142/431 (32%), Gaps = 39/431 (9%) Query: 24 HWKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + ++N + I Y+ + V G+ P+ ++ + + Sbjct: 4 EWRKARLGEVVRINPDALGSDWPFSYIKYVDISSVGEGSIVEPPRILRLDEAPSRAKRLV 63 Query: 82 AKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSID--VTQR 136 +G + + P R + ST F VL+P E + D +T+ Sbjct: 64 REGDTVLSTVRPGRRSMFFVKEPEPEWVVSTGFAVLRPCREYIEPRYLYACVFDRGLTEF 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + +GA + I + + +PPL EQ I + +D I R E L+ Sbjct: 124 LIKREKGAAYPAVLPEDIADAIIKLPPLPEQRAIAHIL----GTLDDKIELNRRMSETLE 179 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIE---------------WVGLVPDHWEVKPFFALV 241 + QAL +P G E +G +P+ W+V LV Sbjct: 180 QMAQALFKAWFVD-FDPVRAKCRGGFETRPYTDLFPDRLMDSELGKIPEGWDVVTLPKLV 238 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + + E L N+ + + + ++ G+ + I + Sbjct: 239 EINPGRPLRKGEIA-PYLDMANMPTRGHAPDQVAHRPFTSGTRFIN-GDTLVARITPCLE 296 Query: 302 KRSLRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLM-RSYDLCK--VFYAMGSGLRQ 355 +E G + ++ Y+ + P + + RS + + G+ RQ Sbjct: 297 NGKTAFVDFLEEGQVGWGSTEYIVLHPKPPLPEEFGYCLARSDAFREFAIQSMTGTSGRQ 356 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + + + PP ++ AR V + L R + + + Sbjct: 357 RVQADSIGHFKLPRPPDSVAVAFGRLVKPLFARSSDAV----RESRTLAALRDALLTKLI 412 Query: 416 TGQIDLRGESQ 426 +G++ ++ + Sbjct: 413 SGELRVKDAEK 423 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 37/140 (26%), Positives = 56/140 (40%), Gaps = 15/140 (10%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP+ W VV + + ++N GR G+ Y+ + ++ T + P R Sbjct: 219 DSE---LGKIPEGWDVVTLPKLVEINPGRPLRKGEIAPYLDMANM--PTRGHAPDQVAHR 273 Query: 72 QSDTSTVSIFAKGQILYGKLGPYL--RKAIIADF-----DGICSTQFLVLQPKDVLPELL 124 + T I G L ++ P L K DF G ST+++VL PK LPE Sbjct: 274 PFTSGTRFI--NGDTLVARITPCLENGKTAFVDFLEEGQVGWGSTEYIVLHPKPPLPEEF 331 Query: 125 -QGWLLSIDVTQRIEAICEG 143 S + G Sbjct: 332 GYCLARSDAFREFAIQSMTG 351 >gi|219851546|ref|YP_002465978.1| restriction modification system DNA specificity domain protein [Methanosphaerula palustris E1-9c] gi|219545805|gb|ACL16255.1| restriction modification system DNA specificity domain protein [Methanosphaerula palustris E1-9c] Length = 471 Score = 112 bits (280), Expect = 9e-23, Method: Composition-based stats. Identities = 70/453 (15%), Positives = 155/453 (34%), Gaps = 54/453 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P+ WK+V I ++N + + + ++ + V++ G + Sbjct: 18 EVPEGWKLVTILNACEVNPPKPPRDFLPADAPVTFVPMPAVDADMGAITNPEIKPYLEVR 77 Query: 76 STVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVL-PELLQGWL 128 + + F G ++ K+ P + + + G ST+F V++ + + PE L ++ Sbjct: 78 NGFTSFRDGDVIMAKITPCMENGKAAIVRGMKNGIGFGSTEFHVMRSRGEILPEYLFYYI 137 Query: 129 LSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 E+ G+ I +P+PPLAEQ I +I A +D Sbjct: 138 RQKSFRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRIVARIEALLSHVDAAGDR 197 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 R ++K +QA+++ + L + + E L+ + + ++ Sbjct: 198 LSRVPLIMKRFRQAVLAAACSGRLTEEWREDKDNFEDPKLLLQDIQNYRLQHGINKIKID 257 Query: 248 NTKLIESNILSLSYGNIIQKLE-------------------------------------- 269 + I N + + I +E Sbjct: 258 SKVNITENPIEIPNTWIWSTIEKIADISGGIQKQPMRAPQRNFYPYLRVANVLRGSLDLH 317 Query: 270 -TRNMGLKPESYETYQIVDPGEIVF-RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 +NM L E Y + ++ RS +E + + + V+ Sbjct: 318 EIKNMELFAGELERYHLELNDILIVEGNGSFSEIGRSAIWNGEIENCVHQNHIIRVRVRK 377 Query: 328 IDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 Y+ S ++ A+ + +L + + +LP+ +PPI EQ +I + + Sbjct: 378 FLPQYVNLYWNSPLGSELSSGAAVTTSGLYTLSTKKIAQLPIPLPPISEQHEIVRRVGLL 437 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 R D + ++ + + + + A +G+ Sbjct: 438 FERADAIEREVVAAGRRCERLTQAVMIKAFSGR 470 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 68/204 (33%), Gaps = 12/204 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W I++ ++ G Y+ + +V G+ + Sbjct: 268 EIPNTWIWSTIEKIADISGGIQKQPMRAPQRNFYPYLRVANVLRGSLDLHEIKNMELFAG 327 Query: 75 TSTVSIFAKGQILY----GKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWL 128 IL G R AI + + + ++ + LP+ + + Sbjct: 328 ELERYHLELNDILIVEGNGSFSEIGRSAIWNGEIENCVHQNHIIRVRVRKFLPQYVNLYW 387 Query: 129 LSIDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S ++ A + + K I +P+P+PP++EQ I ++ R D + E Sbjct: 388 NSPLGSELSSGAAVTTSGLYTLSTKKIAQLPIPLPPISEQHEIVRRVGLLFERADAIERE 447 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 + + QA++ + L Sbjct: 448 VVAAGRRCERLTQAVMIKAFSGRL 471 >gi|257452211|ref|ZP_05617510.1| restriction modification system DNA specificity domain protein [Fusobacterium sp. 3_1_5R] gi|317058754|ref|ZP_07923239.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] gi|313684430|gb|EFS21265.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] Length = 503 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 57/459 (12%), Positives = 128/459 (27%), Gaps = 64/459 (13%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNS 70 IP W V + ++ G + + + + ++ + Sbjct: 26 EIPDSWVWVRLGSIVSVHRGLSYSKVDEIIRENNDEGYLVLRGGNLTEDGLNFEDNVYVR 85 Query: 71 RQSDTSTVSIFAKGQILYGKLGP---YLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQ 125 + + + IL G R I+ ++ +P + + + Sbjct: 86 EEIGRRAIELEENDVILVASTGSSKVIGRACIVEHKLEKTTIGAFLMLCRPVTSISKWVH 145 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 I I +G+ + + + I N + PP+ EQ I +K+ + Sbjct: 146 YIFKGNSYRNYISNISKGSNIKNIKGEYITNYAISFPPIEEQQRIVKKLDFLFEKTKKAK 205 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMK------------------------DSG 221 E ++ +K +++ L + K Sbjct: 206 KLLQEVKEEIEMRKISILDKAFRGELTKKWREKNKTGSVLELLQEIQNEKMKKWEEECCE 265 Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ--------------K 267 E G + + I + G I Q Sbjct: 266 AEKNGRKKPKKIKLSKIEEMIVPKEEEPYKIPDTWKWVKLGEISQISMGQSPLGEKVNSL 325 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER------GIITSAYM 321 + +G + E Y I+ + D A + + + Sbjct: 326 IGVGLIGGPSDMGENYPIITRYTSQITKLSSIGDIIVSIRATLGKNIFSDGEYCLGRGVC 385 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++ +++ L + + + ++ + ED+ L +PP++EQ +I V Sbjct: 386 GIRSKIVNNILLRFYF-TNSIEYLYKISSGTTFAQVSKEDISNLYFSLPPLEEQQEIVRV 444 Query: 382 INVETARIDVLVEKI--EQSIVLLKERRSSFIAAAVTGQ 418 + + + E I E+ I LL+ S + A G+ Sbjct: 445 LEEVLEKEKKVKELIDLEEKIDLLE---KSILDKAFRGK 480 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 74/212 (34%), Gaps = 13/212 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII---------QKLET 270 S E +PD W ++V+ + ++ I + + L Sbjct: 19 SKEEQPYEIPDSWVWVRLGSIVSVHRGLSYSKVDEIIRENNDEGYLVLRGGNLTEDGLNF 78 Query: 271 RNMGLKPESYETYQI-VDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + E I ++ +++ + R+ +E+ I + M +P Sbjct: 79 EDNVYVREEIGRRAIELEENDVILVASTGSSKVIGRACIVEHKLEKTTIGAFLMLCRPVT 138 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 S ++ ++ + + G +++K E + + PPI+EQ I ++ Sbjct: 139 SISKWVHYIFKGNSYRNYISNISKGSNIKNIKGEYITNYAISFPPIEEQQRIVKKLDFLF 198 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +++ ++ R+ S + A G+ Sbjct: 199 EKTKKAKKLLQEVKEEIEMRKISILDKAFRGE 230 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 77/206 (37%), Gaps = 5/206 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP WK V + ++++ G++ K IG+ + G + + Sbjct: 295 KIPDTWKWVKLGEISQISMGQSPLGEKVNSLIGVG-LIGGPSDMGENYPIITRYTSQITK 353 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + + G I+ + L K I +D + ++ K V LL+ + + + + Sbjct: 354 LSSIGDIIVS-IRATLGKNIFSDGEYCLGRGVCGIRSKIVNNILLRFYFTNS--IEYLYK 410 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I G T + + I N+ +PPL EQ I + + + + E I E + + Sbjct: 411 ISSGTTFAQVSKEDISNLYFSLPPLEEQQEIVRVLEEVLEK-EKKVKELIDLEEKIDLLE 469 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWV 225 ++++ L + +E + Sbjct: 470 KSILDKAFRGKLGTQDINDEPALELL 495 >gi|295105614|emb|CBL03158.1| Restriction endonuclease S subunits [Faecalibacterium prausnitzii SL3/3] Length = 402 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 71/410 (17%), Positives = 130/410 (31%), Gaps = 48/410 (11%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVS 79 P + PI K + + + YI L V+ T K N+ + + Sbjct: 13 PDGVEFKPIGDCVHKTQNIKWATADGSYSYIDLTSVDRDTHKITETQTINAGNAPSRAQQ 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVL--PELLQGWLLSIDVT 134 I +G +L+ P L++ + D + ICST F VL+ K+ + P L + S + Sbjct: 73 IVLEGDVLFATTRPTLKRYCLIDEEYDGQICSTGFCVLRAKESIVSPRWLFHVVSSSEFY 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +EA +GA+ K + MP+PPL Q I + T +L Sbjct: 133 YYVEANQKGASYPAISDKEVKQFKMPVPPLEVQSEIVRILDNFTELTARKKQYEFYRDKL 192 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + G + + + Sbjct: 193 ------------------------LTFGDVRGGATSDVVWRTLAEIADISTGSSNTDDAV 228 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + Q+ ++ Y+ I+ G+ V + + Sbjct: 229 EGGCYPFFVRSQQPLAKDEY----EYDEEAIITAGDGV--------GVGKVFHYINGKYA 276 Query: 315 IITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + AY + G+ YL + + M G S++ + + V +P + Sbjct: 277 LHQRAYRIHPATDGLLGKYLYHYFVATFPKYIGQQMYQGSVPSIRRPMLNKFQVAIPSLD 336 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQI 419 Q I NV++ A L + I ++ R + + A TGQI Sbjct: 337 VQKRIVNVLDNFDAICSDLKIGLPAEIEARQKQYEFYRDALLTYAATGQI 386 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 69/203 (33%), Gaps = 21/203 (10%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG----NIIQKLETRNMGLKPESYETY 283 PD E KP V + + + + + + ET+ + Sbjct: 12 CPDGVEFKPIGDCVHKTQNIKWATADGSYSYIDLTSVDRDTHKITETQTINAGNAPSRAQ 71 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDL 342 QIV G+++F + L + + T + + +L ++ S + Sbjct: 72 QIVLEGDVLFATTRPTLKRYCLIDEEYDGQICSTGFCVLRAKESIVSPRWLFHVVSSSEF 131 Query: 343 CKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 A G ++ ++VK+ + VPP++ Q +I +++ T ++ Sbjct: 132 YYYVEANQKGASYPAISDKEVKQFKMPVPPLEVQSEIVRILDNFTEL-----TARKKQYE 186 Query: 402 LLKERRSSFIAAAVT-GQIDLRG 423 R +T G D+RG Sbjct: 187 F---YRD----KLLTFG--DVRG 200 >gi|32455490|ref|NP_862616.1| hypothetical protein pAH82_p17 [Lactococcus lactis subsp. lactis] gi|7767523|gb|AAF69139.1|AF228680_3 HsdS [Lactococcus lactis] gi|9789464|gb|AAF98316.1|AF243383_17 HsdS [Lactococcus lactis subsp. lactis] Length = 421 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 136/411 (33%), Gaps = 30/411 (7%) Query: 24 HWKVVPIKRFTKL-NTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ---- 72 W+ + + + DI + D+ L + Sbjct: 17 DWEERKVDECFNFPVSTNSLSRALLNYDEGDIKSVHYGDILIKYPTILNIKNDKIPYITG 76 Query: 73 SDTSTVS--IFAKGQILYG------KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + G +++ +G + + + + + +V + KD E Sbjct: 77 GKLEKYKSSLLENGDLIFADAAEDETVGKAVEVNGLTEENLVAGLHTIVARSKDKKAEFF 136 Query: 125 QGWLLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G+ ++ + +++ + +G+ +S + + P E+ +KI + ++D Sbjct: 137 LGYYINSNTYHRQLLRLIQGSKVSSISKGNLQKTLVSFPKDFEEQ---QKIGSFFKQLDD 193 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I R ++LLKE+K+ + + K +++ +G + T Sbjct: 194 TIALHQRKLDLLKEQKKGYLQKMFPKNGEKVPELRFAGFADDWEERKLGQYTKLITKGTT 253 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 K + + + N + + ++Y ++ +I+F Sbjct: 254 PKDKTGIGDVNFVKVENITNGKIYPINKIKQNEHDNYLKRSRLEEKDILFSIAGTLGRTA 313 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDV 362 + + + A ++ + D+ +L + + + G + +L E V Sbjct: 314 IVNKSILPAN--TNQALAIIRGYDFDTNFLITSLAGNVVKEYIRRNPTVGAQPNLSLEQV 371 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 L V P +EQ I + ++D + ++ + LLKE++ F+ Sbjct: 372 GNLLVNTPNAEEQQKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 418 >gi|31983518|ref|NP_858129.1| putative type i restriction hsds subunit [Lactobacillus delbrueckii subsp. lactis] gi|18077752|emb|CAD15744.1| putative Type I restriction hsdS subunit [Lactobacillus delbrueckii subsp. lactis] Length = 392 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 47/398 (11%), Positives = 114/398 (28%), Gaps = 37/398 (9%) Query: 25 WKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ +K ++ G + + ++ ++ + DV G+ + ++ Sbjct: 20 WEQCKLKNKAEIVRGASPRPISNPKWFDDNSNVGWLRISDVTEQKGRIHHLSQHISKAGQ 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S + + +L I G+ + L P + + Sbjct: 80 SKTRVITEPHLLLSIAATVGSPVINYVNTGVHDGFLIFLNPTF---NKEFMFQWLLMFKP 136 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + + +GN + P +EQ I I ++ + L Sbjct: 137 YWNKYGQPGSQVNLNSDIVGNQSVAFPTTSEQERIANFFSELDTAITLHEEKKQQLKCLK 196 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 Q + + K P ++ + EW E + + + Sbjct: 197 SALLQKMFA---YKSGYPAIRFEGFSDEW--------EQCKLGEVFNYEQPTKYIVKSTE 245 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 ++ ++ +G E +V D + S V Sbjct: 246 YDDNFNTPVLTAGKSFLLGYTDEISGIKNATVENPVVI------FDDFTTDSHYVDFPFK 299 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I S+ M + +S ++ + K + + P +EQ Sbjct: 300 IKSSAMKLLSLNDNSDNFYFMFNTLKNIKYVPQS----HERHWISKFSSFKIYKPSQEEQ 355 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + + ++D + ++ + LLKE++ F+ Sbjct: 356 KKIGSFL----KQLDDTIALHQRKLDLLKEQKKGFLQK 389 >gi|297569971|ref|YP_003691315.1| restriction modification system DNA specificity domain protein [Desulfurivibrio alkaliphilus AHT2] gi|296925886|gb|ADH86696.1| restriction modification system DNA specificity domain protein [Desulfurivibrio alkaliphilus AHT2] Length = 439 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 66/435 (15%), Positives = 141/435 (32%), Gaps = 43/435 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPI-KRFTKLNTGRTSES-GKDIIY-------IGLEDVESGT 60 YK + V G IP+ W++ P+ K +L G + S +DI + V G Sbjct: 22 YKLTEV---GVIPEDWELAPLGKEVEQLEAGVSVNSVDEDIRSYAHYQAILKTSAVIGG- 77 Query: 61 GKYLPKDGNSRQSDTSTVSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQF------- 111 ++LP + + I+ ++ F Sbjct: 78 -RFLPHENKKIAPRDIGRARLNPRFDTIIISRMNTPDLVGECGYVFADFPNLFLPDRLWM 136 Query: 112 -LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQV 168 + V L L S +I+ + G + M + + +P+ PP EQ Sbjct: 137 THIRSGSKVNVRWLNYLLSSRPYKSQIKELATGTSGSMKNIAKDSLLAMPVAYPPPLEQR 196 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGL 227 I + + L + +L + Q L++ + + + K G + Sbjct: 197 AIAAALTDVDALLAKLDQLIAKKRDLKQATMQQLLTGQTRLPSFSGEWETKLLGEIGDFI 256 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + R N L + + I K E ET + Sbjct: 257 KGKGVSRDQAQSGRLPCVRYGEIYTIHNDLIREFHSWISK----------EVAETATSLK 306 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+++F ++ A + + + ++P ++S +L + + S + + Sbjct: 307 SGDLLFAGSGETKEEIGKCVAFIDDTEAYAGGDIVVLRPRSVNSIFLGYALNSPAVNRQK 366 Query: 347 YAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 ++G G + + + + + +P EQ I V++ A + +E+ + Sbjct: 367 ASLGQGDAVVHISAKALADITIFLPGDAEQTAIAAVLSDMDAE----IAALERRREKTRF 422 Query: 406 RRSSFIAAAVTGQID 420 + + +TG+I Sbjct: 423 IKQGMMQELLTGRIR 437 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 51/144 (35%), Gaps = 17/144 (11%) Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP----HGIDSTYLAWLMRSYDLCK 344 I+ ++ + + + + ++ +L +L+ S Sbjct: 102 DTIIISRMNTPDLVGECGYVFADFPNLFLPDRLWMTHIRSGSKVNVRWLNYLLSSRPYKS 161 Query: 345 VFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVI---NVETARIDVLVEKIEQ 398 + +G +++ + + +PV PP EQ I + + A++D L+ K Sbjct: 162 QIKELATGTSGSMKNIAKDSLLAMPVAYPPPLEQRAIAAALTDVDALLAKLDQLIAKK-- 219 Query: 399 SIVLLKERRSSFIAAAVTGQIDLR 422 ++ + + + +TGQ L Sbjct: 220 -----RDLKQATMQQLLTGQTRLP 238 >gi|153808172|ref|ZP_01960840.1| hypothetical protein BACCAC_02458 [Bacteroides caccae ATCC 43185] gi|149129075|gb|EDM20291.1| hypothetical protein BACCAC_02458 [Bacteroides caccae ATCC 43185] Length = 341 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 50/344 (14%), Positives = 115/344 (33%), Gaps = 28/344 (8%) Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + +L G L + ++ G S +++ V PE +LS Sbjct: 2 KGTEVLANDLLLNITGGSLGRCVVVPADFNCGNVSQHVCIMRSVLVEPEYFHALVLSSYF 61 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ G+ + + P+PPL EQ I +I ID + + Sbjct: 62 AKSMK--ITGSGREGLPKYNLEQMGFPLPPLTEQQRIVAEIKHWFALIDQIEQGKSDLQT 119 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV---------------GLVPDHWEVKPFF 238 ++K+ K ++ + + P + IE + +P +W Sbjct: 120 IIKQTKSKILDLAIHGKVVPQDPNDEPAIELLKRINPDFTPCDNGHSEKLPQNWTWVKGK 179 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFI 296 + + K E + + + +++ + +K E+ + + +++F + Sbjct: 180 NIFAPMKSTKPKNEEFQYIDIDSIDNRRQIISEIKTIKTENAPSRASRYTQKNDVIFSMV 239 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-L 353 + + + I ++ + P +S Y +LM S ++ G Sbjct: 240 RPYLRNIAKVAN---DNCIASTGFYVCSPIPQLLNSDYCYYLMISDNVVNGLNQFMKGDN 296 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S+ + +PP+ EQ I I + +D + +E Sbjct: 297 SPSINKGHIDEWLFPLPPLAEQQRIVQKIEELFSALDNIQTALE 340 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 37/168 (22%), Positives = 65/168 (38%), Gaps = 5/168 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 +P++W V K T ++ YI ++ +++ K + + + Sbjct: 168 KLPQNWTWVKGKNIFA-PMKSTKPKNEEFQYIDIDSIDNRRQIISEIKTIKTENAPSRAS 226 Query: 79 SIFAKGQILYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPELLQGWLLSI--DVTQ 135 K +++ + PYLR +A+ + I ST F V P L + L I +V Sbjct: 227 RYTQKNDVIFSMVRPYLRNIAKVANDNCIASTGFYVCSPIPQLLNSDYCYYLMISDNVVN 286 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +G + I P+PPLAEQ I +KI +D Sbjct: 287 GLNQFMKGDNSPSINKGHIDEWLFPLPPLAEQQRIVQKIEELFSALDN 334 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 54/134 (40%), Gaps = 2/134 (1%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 V +++ R + G ++ ++ ++ Y L+ S K Sbjct: 6 VLANDLLLNITGGSL-GRCVVVPADFNCGNVSQHVCIMRSVLVEPEYFHALVLSSYFAKS 64 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 GSG R+ L +++++ +PP+ EQ I I A ID + + ++K+ Sbjct: 65 MKITGSG-REGLPKYNLEQMGFPLPPLTEQQRIVAEIKHWFALIDQIEQGKSDLQTIIKQ 123 Query: 406 RRSSFIAAAVTGQI 419 +S + A+ G++ Sbjct: 124 TKSKILDLAIHGKV 137 >gi|297580645|ref|ZP_06942571.1| type I restriction-modification system specificity subunit [Vibrio cholerae RC385] gi|297535061|gb|EFH73896.1| type I restriction-modification system specificity subunit [Vibrio cholerae RC385] Length = 405 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 60/397 (15%), Positives = 109/397 (27%), Gaps = 29/397 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ P + ++ +G+ + + G G R ++ Sbjct: 24 WENKPFSKLFEIGSGKDHK-----------HLADGDIPVYGSGGYMRSV---NDYLYEGK 69 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 G+ G + ++ T F K+ +PE + +ID + E Sbjct: 70 SACIGRKGTINKPMFLSGKFWTVDTLFYTHSFKNCIPEFIYLLFQNID----WLKLNEAG 125 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I I + IP EQ I + I + I + K Q L Sbjct: 126 GVPSLSKVIINKIEVVIPKEEEQQKIVDCIYSVDDLITVNTKKLESLKLHKKGLMQKLFP 185 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + W H V + +T + N Sbjct: 186 AEGENKPDFRFPEFSMEANWKKEKL-HNLVDLLSGHAFKSEYFSTTGKKMVTPKNFTKNG 244 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ-----NDKRSLRSAQVMERGIITSA 319 N E + I G+++ DL + L + E + Sbjct: 245 FASFSEDNTKYTSEDFNERYICREGDLLLLLTDLTPSCELLGRPMLLTPSDGEVLLNQRI 304 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDI 378 + I+S +L + S K +G + + V +L+P + EQ I Sbjct: 305 AKVILKGNINSNFLKYFFLSNSFRKRIINTATGSTVRHTSNKIVLSTELLLPNLSEQNKI 364 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ID L+ I LKE + + Sbjct: 365 AACLLS----IDELIRSQADKIETLKEYKKGLMQQLF 397 >gi|319642846|ref|ZP_07997483.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_40A] gi|317385521|gb|EFV66463.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_40A] Length = 409 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 59/383 (15%), Positives = 131/383 (34%), Gaps = 16/383 (4%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGK--DIIYIGLEDVES-GTGKYLPKDGNSRQSDTS 76 +P W+ ++ +L G + +S I + + ++ + GT Y +S D Sbjct: 28 LPNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIK 87 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ K +L+ + + AI +L+ ++ +++ Sbjct: 88 LYSL-EKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSY 146 Query: 134 TQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + S+ + + + + +PIPPL EQ I ++ I+T+ + Sbjct: 147 YRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLINTIKNSKEDL 206 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +K+ K +++ + L P + IE + + + Sbjct: 207 QTTIKQAKSKILNLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTQLPEGWAIC- 265 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I S++ G + +ET N + + K ++ + + Sbjct: 266 KMKQITSITNGKSQKNVETLNGIYPIYGSGGVIGRANQYLCIAGSTIIGRKGTINNPIFV 325 Query: 312 ERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E +A+ I YL + S+D K+ S SL + + + + Sbjct: 326 EEHFWNVDTAFGLKANDAILDKYLYYFCLSFDFSKL---DKSTAMPSLTKTSIGNVLIPI 382 Query: 370 PPIKEQFDITNVINVETARIDVL 392 PP KEQ I I++ ++ + Sbjct: 383 PPYKEQERIVAKIDMVLDTMNEI 405 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 74/197 (37%), Gaps = 7/197 (3%) Query: 229 PDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYE---TY 283 P+ WE +V EL ++ L I L GNI L S Sbjct: 29 PNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIKL 88 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ +++F + + + I + ++P I S YL +M S Sbjct: 89 YSLEKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSYYR 148 Query: 344 KVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 Y + + + ++ + + +L + +PP+KEQ I + + I+ + E Sbjct: 149 NWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLINTIKNSKEDLQT 208 Query: 402 LLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 209 TIKQAKSKILNLAIHGK 225 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 16/164 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + +K+ T + G++ + +VE+ G Y P G+ + Sbjct: 257 QLPEGWAICKMKQITSITNGKSQK-----------NVETLNGIY-PIYGSGGVIGRANQY 304 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G + G+ G + + T F + +L + L + LS D Sbjct: 305 LCIAGSTIIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDF----SK 360 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + M IGN+ +PIPP EQ I KI ++ Sbjct: 361 LDKSTAMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNE 404 >gi|294793953|ref|ZP_06759090.1| putative type I restriction enzyme specificity protein [Veillonella sp. 3_1_44] gi|294455523|gb|EFG23895.1| putative type I restriction enzyme specificity protein [Veillonella sp. 3_1_44] Length = 412 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 56/406 (13%), Positives = 125/406 (30%), Gaps = 25/406 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ ++ + + +GL GT + + +T+ + Sbjct: 14 EDWEQRKLESLFTKYEDKVNTPDSGYWRLGLRSHCKGT--FHTYVDAGHELETTEMYRVK 71 Query: 83 KGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRI 137 G + + R + D D + S +F +P + + Sbjct: 72 AGNFILNITFAWERALAVTDDEDQDKLVSHRFPQFKPNSDLVIDFFKHTLMDKRLKHHLE 131 Query: 138 EAICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + GA + + + +P + EQ +I + I + + K Sbjct: 132 LSSPGGAGRNKVLKVSDMLKYELLVPSIQEQNIISSFLNNIDHIITLHQCKLKKLNLAKK 191 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH---WEVKPFFALVTELNRKNTKLIE 253 Q L P V+ K W + +K++ + + Sbjct: 192 SLLQKLFPR--NGSQIPGVRFKGFTDAWEQRKFLDLLDTQNGIRRGPFGSSLKKDSFVKK 249 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 S+ + N I + E Y + PG+ + + + Sbjct: 250 SDYVVYEQQNAIYDNYVTRYFISKEKYNELIRFNIQPGDFIMSGAGTIGRISMVP--DGI 307 Query: 312 ERGIITSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMG-SGLRQSL-KFEDVKRLPV 367 ++G+ A + K L + M+S + K +L +++K+ V Sbjct: 308 KKGVFNQALIRFKVDKNSVNPLYFLKFMQSDMMQKQLTQANPGSAMTNLVPMDELKKWDV 367 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I+N IN +ID + ++ + L+E + + Sbjct: 368 TIPSLEEQNKISNFIN----QIDESITLHQRKLERLQEVKKGLLQK 409 >gi|146305601|ref|YP_001186066.1| restriction modification system DNA specificity subunit [Pseudomonas mendocina ymp] gi|145573802|gb|ABP83334.1| restriction modification system DNA specificity domain protein [Pseudomonas mendocina ymp] Length = 415 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 133/423 (31%), Gaps = 43/423 (10%) Query: 23 KHWKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W +++ GR + Y+G +E G Y Sbjct: 2 SDWAESSLEQLVTFQKGRKVDTSSFAQDGFAPYLGASGIEGGDDGYAATQFAVMS----- 56 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 IL G G+ ++ L P D + + LS I Sbjct: 57 ----KPTDILMLWDGERSGLVGYGK-TGVVASTVSKLSPNDAINPKYLFFALSDRF-AWI 110 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G + H + + P + KI + DTLI + I ++ Sbjct: 111 QHRRTGTGVPHVPKDLGRILRLRYPSDPRLQI---KIASIFEATDTLIQKSEALIAKYQQ 167 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELN---- 245 K ++ + T+G+ P+ +++ E +G +P W++K + T + Sbjct: 168 IKAGMMHDLFTRGVLPNGQLRPPRSEAPELYQDTSIGWIPSMWKLKRCADICTRICVGIV 227 Query: 246 -RKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + +ES + + NI I + V G+I+ Sbjct: 228 IQPTQYYVESGVPAFRSANIREDGIDPSNLVFISHASNEVVAKSQVKAGDILSVRTGYPG 287 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKF 359 + I ++ + S YL + S + G +Q Sbjct: 288 TSAVVPVHFDRANCI--DILISTPSAQVISEYLCDWINSPFGKEQVLRQQGGMAQQHFNV 345 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +++ L V +P +EQ DI N I V ++ + + L+ ++ + +TG++ Sbjct: 346 GEMRELLVALPSREEQGDIRNRIGVVAKKL----AAEKALLEKLQYQKLGLMHDLLTGKV 401 Query: 420 DLR 422 +R Sbjct: 402 SVR 404 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 60/207 (28%), Gaps = 12/207 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSES-----GKDIIYIGLEDVESGTGKY 63 Y+D+ + W IP WK+ + G + + ++ Sbjct: 198 YQDTSIGW---IPSMWKLKRCADICTRICVGIVIQPTQYYVESGVPAFRSANIREDGIDP 254 Query: 64 LP-KDGNSRQSDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVL 120 + ++ S G IL + G + C + V+ Sbjct: 255 SNLVFISHASNEVVAKSQVKAGDILSVRTGYPGTSAVVPVHFDRANCIDILISTPSAQVI 314 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 E L W+ S +++ G H + + + + +P EQ IR +I + Sbjct: 315 SEYLCDWINSPFGKEQVLRQQGGMAQQHFNVGEMRELLVALPSREEQGDIRNRIGVVAKK 374 Query: 181 IDTLITERIRFIELLKEKKQALVSYIV 207 + + L++ V Sbjct: 375 LAAEKALLEKLQYQKLGLMHDLLTGKV 401 >gi|323438647|gb|EGA96390.1| hypothetical protein SAO11_2475 [Staphylococcus aureus O11] Length = 406 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 116/394 (29%), Gaps = 16/394 (4%) Query: 24 HWKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ FTK+N G K L + + I Sbjct: 20 EWEEKRFADFTKINQGLQIAINERKTEYSPELYFYITNEFLRPNSQTKYFIENPPQSVIA 79 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K IL + G + + + L L S + +I ++ Sbjct: 80 NKEDILMTRTGNTGKVVTNVFGAFHNNFFKIKFDKNLYDRLFLVEVLNSSKIQNKILSLA 139 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +T+ + +I P L EQ I + +I+ + + K Q Sbjct: 140 GSSTIPDLNHSDFYSISSSYPLLREQQKIGQFFSKLDRQIELQEQKLELLQQQKKGYMQK 199 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + S + D D + +G + + ++ ++ + ++ Sbjct: 200 IFSQELRFKDENDEDYPDWKEKKLGDITE-------QSMYGIGASATRFDSKNIYIRITD 252 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAY 320 + + P+ + +I+F K + + + + Sbjct: 253 IDEKSRKLNYQNLTTPDELNNKYKLKRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFL 312 Query: 321 MAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + ++ + S V + + E+ +LP+++P EQ I Sbjct: 313 IKFEIDEQNNPLFIYQFTLTSKYNKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKI 372 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ R D +E +Q I +L++++ + Sbjct: 373 AEFLD----RFDQQIELEKQKIEILQQQKKGLLQ 402 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 29/213 (13%), Positives = 67/213 (31%), Gaps = 13/213 (6%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ EW + + RK E + Sbjct: 10 PELRFPGFEDEWEEKRFADFTKINQGLQIAINERKTEYSPELYFYITNEFLRPNSQTKYF 69 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + P+S I + +I+ + V + + D + Sbjct: 70 IENPPQSV----IANKEDILMTRTGNTGKVVT----NVFGAFHNNFFKIKFDKNLYDRLF 121 Query: 333 LAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L ++ S + ++ GS L D + P ++EQ I +++D Sbjct: 122 LVEVLNSSKIQNKILSLAGSSTIPDLNHSDFYSISSSYPLLREQQKIGQF----FSKLDR 177 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +E EQ + LL++++ ++ + ++ + E Sbjct: 178 QIELQEQKLELLQQQKKGYMQKIFSQELRFKDE 210 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 66/190 (34%), Gaps = 11/190 (5%) Query: 24 HWKVVPIKRFTKLNT---GRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + T+ + G ++ IYI + D++ + K ++ + + Sbjct: 217 DWKEKKLGDITEQSMYGIGASATRFDSKNIYIRITDIDEKSRKLNYQNLTTPDELNNKYK 276 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL------VLQPKDVLPELLQGWLLSIDV 133 + + IL+ + G K+ I + + + P + + L+ Sbjct: 277 L-KRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFLIKFEIDEQNNPLFIYQFTLTSKY 335 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + + + + +P+ +P EQ I E + +I+ + + Sbjct: 336 NKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKIAEFLDRFDQQIELEKQKIEILQQ 395 Query: 194 LLKEKKQALV 203 K Q++ Sbjct: 396 QKKGLLQSMF 405 >gi|212703157|ref|ZP_03311285.1| hypothetical protein DESPIG_01198 [Desulfovibrio piger ATCC 29098] gi|212673423|gb|EEB33906.1| hypothetical protein DESPIG_01198 [Desulfovibrio piger ATCC 29098] Length = 393 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 46/402 (11%), Positives = 122/402 (30%), Gaps = 29/402 (7%) Query: 30 IKRFTK-LNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 I+ +K + +G T + DI ++ +++ N S+ Sbjct: 9 IREISKKILSGGTPSTKNKGYYYNGDIPWLNTKEINFKRIYKTENYINQDGLRNSSAKWI 68 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + ++ G K I + L + + + + ++ ++I + Sbjct: 69 PRDSVIVAMYGATAGKVAINKIPLTTNQACCNLIINEKVADFNFIYYYLVNEYEKIIKLA 128 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE-KKQ 200 GA + + I N + +PPL EQ I + + +ID L + + + +Q Sbjct: 129 SGAAQQNLNVSIISNYIIFLPPLYEQKAIVGVLSSLDDKIDLLQRQNATLEAMAETLFRQ 188 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + + S IE +G + ++ Sbjct: 189 WFIEEAQE---DWEEYPLSSFIEIIGGGTPKTSEESYWHGDILWMSGGDIASSHKSFIF- 244 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + + + ++ V + + E+ + Sbjct: 245 -------DTDKKISSEGLENSSANLLPKFSTVITARGTVG-----KICLLGEQAAFSQTN 292 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + P + + +L+ S LC + + + ++ + + P I N Sbjct: 293 YGILPRIAGTPFFTFLLMSDLLCYLKQSAYGSVFDTITRSTFEEIKFNCPTD---NYIVN 349 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + + I L++ R + + ++G++ +R Sbjct: 350 F-ENMISPFFQKMFSNCRQIRTLEKLRDTLLPKLMSGEVRVR 390 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 60/195 (30%), Gaps = 11/195 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV---ESGTGKYLPKDGNSRQS 73 + W+ P+ F ++ G T ++ + DI+++ D+ K +S Sbjct: 196 EDWEEYPLSSFIEIIGGGTPKTSEESYWHGDILWMSGGDIASSHKSFIFDTDKKISSEGL 255 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + S+ ++ K + G + ++ + T + +L P + + Sbjct: 256 ENSSANLLPKFSTVITARGTVGKICLLGEQAAFSQTNYGILPRIAGTPFFTFLLMSDLLC 315 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A G+ I P V I ++ + + + Sbjct: 316 YLKQSAY--GSVFDTITRSTFEEIKFNCPTDNYIVNFENMISPFFQKMFSNCRQIRTLEK 373 Query: 194 LLKEKKQALVSYIVT 208 L L+S V Sbjct: 374 LRDTLLPKLMSGEVR 388 >gi|256826063|ref|YP_003150023.1| hypothetical protein Ksed_22810 [Kytococcus sedentarius DSM 20547] gi|256689456|gb|ACV07258.1| hypothetical protein Ksed_22810 [Kytococcus sedentarius DSM 20547] Length = 418 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 66/421 (15%), Positives = 135/421 (32%), Gaps = 29/421 (6%) Query: 17 WIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-GKYLPKDGNSRQSDT 75 W+ +P W + + ++ R D + + T +Y+ + G+ + Sbjct: 4 WLEHLPSGWDTIQPR--SRFRERREPSRPDDEHLTPSQHLGVLTQREYMERTGSRVVLNL 61 Query: 76 STV---SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S G L + + G ST + V++ D W+ Sbjct: 62 SGADKMKHVEPGDF-IAHLRSFQGGLETSALRGKVSTAYTVMRAIDGAHHPYFRWVFKSH 120 Query: 133 VTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 A ++ IG++ +P+PP +Q I + + RID +I R Sbjct: 121 AFIGELASTTQQLRDGQTVRFQDIGSLRLPLPPEPDQRRIADFLDDRVSRIDRIIAARNT 180 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + + L+ + +T + G V + + + Sbjct: 181 QRGQVAAQAGQLIDHQLTDHGDRW-----------GAVRLGRLLTKLEQGWSPAADQQPA 229 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFI----DLQNDKRS 304 + + + + + P++ E + G+++ DL Sbjct: 230 ELGQWGVMRAGCVNSGEFRAEDNKRLPDAVEPRLEYEIKGGDLIMSRASGSLDLIGSVAL 289 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFED 361 + + + + Y G+ Y A +R + + SG +L Sbjct: 290 VPDSVRDQLLLCDKLYRLRTVAGLVPQYTAHALRHHANRQRIRQGVSGAEGMANNLPSGV 349 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ L + +P Q + + E A + +SI LL E + S I AAVTGQ+D+ Sbjct: 350 IRSLMIPLPDRSTQIEAIDRWEDEMAGNRRTQAALTRSIELLTEYKQSLITAAVTGQLDV 409 Query: 422 R 422 Sbjct: 410 T 410 >gi|149199121|ref|ZP_01876160.1| putative restriction-modification system specificity determinant [Lentisphaera araneosa HTCC2155] gi|149137718|gb|EDM26132.1| putative restriction-modification system specificity determinant [Lentisphaera araneosa HTCC2155] Length = 402 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 59/413 (14%), Positives = 126/413 (30%), Gaps = 41/413 (9%) Query: 30 IKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 I + G + + + ++ G Y K + + Sbjct: 6 IGDISSQIRGVSYKKNDVVDEPTERYTPVMRANNINEGFLNY-DKLVYVKSEVIKEHQLL 64 Query: 82 AKGQILYG----KLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVT 134 KG +L L + D F K V P + S Sbjct: 65 QKGDVLICASSGSLNLVGKAGSFLDSTSSSFGAFCKVLRPDTKKVFPRFFHFYFQSQGYK 124 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I+A+ EGA +++ + + ++ +P+P L EQ I + + E Sbjct: 125 RSIKALAEGANINNIKNEHLDDLKIPLPSLEEQKRIAAILDKADELRQKRREAISQCNEF 184 Query: 195 LKEKKQALVSYIV--TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LK ++ V KG + + + G P + Sbjct: 185 LKSTFLSMFGDPVTNPKGWDKIIFDELLDNIDGGWSPKCETWPATLDEWGVMKLGALTTC 244 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E K E L ++ + P +++F + + Sbjct: 245 EY------------KEEENKAMLPGLETKSNIEIQPRDLLFSRKNTHELVAACAYVWDTR 292 Query: 313 RGIITSAYMAVKPHG----IDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRL 365 ++ S M ++S Y+ L+ + K A+ SG ++ +++K + Sbjct: 293 PQLMMSDLMFRFKFKASAEVNSIYMWKLLVNERQRKEVQALASGAAGSMPNISKKNLKTI 352 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +PPI+ Q + + + +++QS+ L + + + A G+ Sbjct: 353 KLPIPPIELQNQFAEI----AKKTESSKSQMQQSLKELDDNFDALMQKAFKGE 401 >gi|325474566|gb|EGC77752.1| type I restriction-modification system [Treponema denticola F0402] Length = 532 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 63/449 (14%), Positives = 126/449 (28%), Gaps = 73/449 (16%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W + + G + + I +I + D + + + Sbjct: 86 VPEGWAWCRLGVVADIARGGSPRPIEDFITDKKNGINWIKIGDTVPESKYIISAKEKIKP 145 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G L + R I+ I + L + + LS Sbjct: 146 EGKKHSRFVHAGDFLLTNSMSFGRPYILKIDGCIHDGWLVFADIIKYLLKDFLYYALSSK 205 Query: 133 VTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ G+T+ + + + P+PPL EQ I I A +ID L + Sbjct: 206 YIYNSFSLVAAGSTVKNLKADTVKQVLFPLPPLLEQKRIITNIEAIFAQIDLLEQNKADL 265 Query: 192 IELLKEKKQALVSYIVTKGLNPD----------------------------------VKM 217 +K+ K ++ + L P Sbjct: 266 QTAVKQAKSKILDLAIRGKLVPQDPTDEPASVMLEKLHAEKEAKIVAGEIKRGKYDSYIY 325 Query: 218 KDS-----------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN----- 255 K+S E VP+ W + + T N Sbjct: 326 KNSTDNCYYQKYTDGREENISDEIPFTVPEGWACCRLPEVCRKPTTDGTHNSPPNSASGA 385 Query: 256 ILSLSYGNIIQ-----KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 L ++ NI T ES + + +++ + + Sbjct: 386 FLYITAKNIKNLEICLDDATYVSKEIHESIYSRCSPELNDVLLTKDGTIGEVA--VNNLN 443 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 +++S + GI S +LA+++ S L G + + + + + Sbjct: 444 YPFSMLSSVALIKPSKGILSWFLAYILISDLLQNKMKKNAKGSALKRIILTQINDFLIPL 503 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQ 398 PP+ EQ I I A++D + + + Sbjct: 504 PPLAEQKRIVAKIEELFAQLDFITTTLTK 532 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 78/212 (36%), Gaps = 14/212 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG----------NIIQK 267 KD E VP+ W + + + IE I G + Sbjct: 76 KDIEDEIPFAVPEGWAWCRLGVVADIARGGSPRPIEDFITDKKNGINWIKIGDTVPESKY 135 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + +KPE + + V G+ + L+ + G + A Sbjct: 136 IISAKEKIKPEGKKHSRFVHAGDFLLTNSMSFGRPYILKIDGCIHDGWL---VFADIIKY 192 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + +L + + S + F + +G ++LK + VK++ +PP+ EQ I I Sbjct: 193 LLKDFLYYALSSKYIYNSFSLVAAGSTVKNLKADTVKQVLFPLPPLLEQKRIITNIEAIF 252 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A+ID+L + +K+ +S + A+ G+ Sbjct: 253 AQIDLLEQNKADLQTAVKQAKSKILDLAIRGK 284 >gi|118496896|ref|YP_897946.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. novicida U112] gi|194324121|ref|ZP_03057895.1| type I restriction modification DNA specificity domain protein [Francisella tularensis subsp. novicida FTE] gi|118422802|gb|ABK89192.1| type I restriction-modification system, subunit S [Francisella novicida U112] gi|194321568|gb|EDX19052.1| type I restriction modification DNA specificity domain protein [Francisella tularensis subsp. novicida FTE] Length = 406 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 57/422 (13%), Positives = 132/422 (31%), Gaps = 40/422 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK W+ + F + G + + I + + G G + Sbjct: 5 ELPKGWRECKLGDFISVKHGYAFKGKNITTEANENILVTPGNFNIGGG-FKKDKFKYFND 63 Query: 74 DTSTVSIFAKGQILYGKLGPYLR----------KAIIADFDGICSTQFLVLQPKDVLPEL 123 D + I + I+ I + + + + ++Q + L Sbjct: 64 DYPSEYILNESDIIVTMTDLSKESDTLGYSAKVPKSIKNEKYLHNQRIGLVQFINQLCNK 123 Query: 124 LQGWL--LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + I G ++ H I + PPLAEQ I E + + Sbjct: 124 EYIYWLLRTREYQNYIVGSASGTSIMHTSPSRICDYVFLCPPLAEQKAIAEVL----SSL 179 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 D I + + L++ Q L + + W + Sbjct: 180 DDKIDLLHKQNQTLEDMAQTLFREWFIEKADEG---------WEEMPLSEVADIKIGRTP 230 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 ++ ++ +S ++ Q+ N + + E + IV + L Sbjct: 231 PRKEKQWFSNDPKDVKWISIKDMGQEGVFINGTSEYLTQEAVEKFKIPIIVKNTVILSFK 290 Query: 302 KRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 R E + A + + YL +++Y + S + S+ Sbjct: 291 MTLGRVKITGENMLSNEAIAHFNITNDKLYNEYLYLFLKTYPYQTL--GSTSSIVTSINS 348 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +K + +++P K + VI+ + ++ ++ I L++ R + + ++GQ+ Sbjct: 349 AMIKNILIILPDFKVKKSFKEVISPMFEK----IQNNQKQIKTLEQTRDTLLPKLMSGQV 404 Query: 420 DL 421 + Sbjct: 405 RV 406 >gi|89890209|ref|ZP_01201719.1| putative type I restriction-modification specificity subunit [Flavobacteria bacterium BBFL7] gi|89517124|gb|EAS19781.1| putative type I restriction-modification specificity subunit [Flavobacteria bacterium BBFL7] Length = 408 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 48/406 (11%), Positives = 122/406 (30%), Gaps = 33/406 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD-----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + G+ + I ++ + + + + + S++ Sbjct: 21 EWEKFILGEIATFGKGKNISKSDISEDGVLECIRYGELYTEYNEVISEVKSKTNLPISSL 80 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQR 136 + + +L G A + K + + L+ + Sbjct: 81 ILSEENDVLIPASGETRIDIATASCVKKAGVALGGDLNIIKTKKNGVYLSYYLNSEKKFD 140 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I + +G ++ H + + + P EQ I + A +I L +++ + + K Sbjct: 141 IARLAQGNSVVHVYNSQLKTLKLNFPSQLEQQKIATFLTAVDDKISQLTSKKEQLTQYKK 200 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 Q L S + + D + +G V + E+ + + I Sbjct: 201 GVMQQLFSQELRFQDENGKQFPDWEEKRLGEV--------LKQQIREIPKPKQNYLAIGI 252 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S G + N + + +V ++V ++ + + G++ Sbjct: 253 RSHVKGTFQKPDSDPNK----IAMKKLFVVKENDLVVNITFAWEGAIAIVKKE-DDGGLV 307 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM------GSGLRQS-LKFEDVKRLPVLV 369 + + + + + K F G R L ++ ++ Sbjct: 308 SHRFPTYTFKE--NQTCYEYFKHIIVDKKFRFTLDLISPGGAGRNRVLSKKEFLKIKWSF 365 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P +KEQ I ++ +D +E ++ I +E + + Sbjct: 366 PSLKEQQKIATYLSA----LDDKIEAVQVQIEKTQEFKKGLLQQLF 407 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 60/184 (32%), Gaps = 8/184 (4%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQN 300 K+ + + + YG + + +K ++ + + + +++ Sbjct: 38 NISKSDISEDGVLECIRYGELYTEYNEVISEVKSKTNLPISSLILSEENDVLIPASGETR 97 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + S V + G+ + + + YL++ + S + + Sbjct: 98 IDIATASC-VKKAGVALGGDLNIIKTKKNGVYLSYYLNSEKKFDIARLAQGNSVVHVYNS 156 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +K L + P EQ I + +I L K EQ L + + + + ++ Sbjct: 157 QLKTLKLNFPSQLEQQKIATFLTAVDDKISQLTSKKEQ----LTQYKKGVMQQLFSQELR 212 Query: 421 LRGE 424 + E Sbjct: 213 FQDE 216 >gi|182626285|ref|ZP_02954041.1| type I restriction enzyme S subunit [Clostridium perfringens D str. JGS1721] gi|177908383|gb|EDT70925.1| type I restriction enzyme S subunit [Clostridium perfringens D str. JGS1721] Length = 387 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 64/405 (15%), Positives = 138/405 (34%), Gaps = 35/405 (8%) Query: 26 KVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 K IK ++TG T KDI++I +D++ + Sbjct: 3 KEYKIKELGDISTGNTPSKKNKEFYDSKDIMFIKPDDIDEDIKELSSSKEYISFIAKEKS 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I K +L +G K I +G + Q + P + + + ++ ++++ Sbjct: 63 RIIPKNTLLVTCIGSI-GKIAINKEEGAFNQQINAIVPNNKIFSSKYLAYVFMNNKEKLK 121 Query: 139 AICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 AI + + + I L Q I + + I+ + L Sbjct: 122 AIANAPVVPIINKTQFSEFKVYIHDDLGVQKKIVDILDKAQKLINKRKLQIEELDLL--- 178 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + S + +P K + + + + R ++ NI Sbjct: 179 ----VKSKFIEMFGDPVKNQKKLAKVKLSELGE-------WKTGGTPLRSKSEYYNGNIP 227 Query: 258 SLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 LS G + K ++ + ES +I++ G ++ D +L+S M Sbjct: 228 WLSSGELNNKYCFKSNEMITESAIIESAAKIIEVGSLLLGMYDT----AALKSTINMIEC 283 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 A K + + + + + G+ +++L +K L +L+P ++ Sbjct: 284 SCNQAIAYSKLNENLVNTVYVYYCIQIGKEFYKSQQRGVRQKNLNLSMIKGLEILMPELE 343 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q + R+D L ++E+S+ L++ +S + A G+ Sbjct: 344 LQNQFAEFV----KRVDKLKFEMEKSLKELEDNFNSLMQKAFKGE 384 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 53/192 (27%), Gaps = 10/192 (5%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 V + + TG T K +I ++ ++ + + S I Sbjct: 200 VKLSELGEWKTGGTPLRSKSEYYNGNIPWLSSGELNNKYCFKSNEMITESAIIESAAKII 259 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G +L G K+ I + C+ + + L + + + ++ Sbjct: 260 EVGSLLLGMYDTAALKSTINMIECSCNQAIAYSKLNENLVNTVYVYYCIQIGKEFYKSQQ 319 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + I + + +P L Q E + + + + Sbjct: 320 RGVRQKNLNLSMIKGLEILMPELELQNQFAEFVKRVDKLKFEMEKSLKELEDNFN----S 375 Query: 202 LVSYIVTKGLNP 213 L+ L Sbjct: 376 LMQKAFKGELFK 387 >gi|229541312|ref|ZP_04430372.1| restriction modification system DNA specificity domain protein [Bacillus coagulans 36D1] gi|229325732|gb|EEN91407.1| restriction modification system DNA specificity domain protein [Bacillus coagulans 36D1] Length = 427 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 51/417 (12%), Positives = 130/417 (31%), Gaps = 30/417 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IG IPK WKV +K + + +++ I + + K + Sbjct: 28 IGTIPKDWKVKKLKDISNRVQRKNDGKSHNVLTISSKGGFLNQTERFSKVIAGEN--LAK 85 Query: 78 VSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + K + Y K + + + + ++ + E + + ++ Sbjct: 86 YILLRKNEFAYNKGNSKTYPYGCIYRLEDYEEALVPNVYYCFEIREGVTEFYKHYFITGK 145 Query: 133 VTQRIEAICEGATMS----HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + + + + + ++P+ PP+ EQ I + I+ Sbjct: 146 LNKFLARVINTGVRNDGLLNLNVTDFFDVPVAAPPIKEQQKIASILSTWDKAIELNEKLI 205 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + K Q L++ V + EW + ++KN Sbjct: 206 EQKKKQKKGLMQKLLTGEVR--------LPGFEGEWGKFKIKEVCNVVSGGTPSTNDKKN 257 Query: 249 TKLIESNIL--SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ + + + K + ++ I+ + R Sbjct: 258 WDGNIPWCTPTDITSSGKFIRNTKQTITEKGLKNSSANLLPKNSILMCSRATIGPRSINR 317 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 +G + V + + S + + +DV+ Sbjct: 318 VEMATNQGFKS----FVCNEEYLDYEFFYYLLSIYIPIFKKLASGSTFLEVSKKDVENTK 373 Query: 367 VLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +P +KEQ I ++I +D +E +E+ LK+++ + +TG++ ++ Sbjct: 374 IFIPKDVKEQKAIGSIIG----NLDKAIELLEEETKELKQQKKGLMQLLLTGKVRVK 426 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 81/207 (39%), Gaps = 10/207 (4%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G +P W+VK + + RKN + + S G + + E + + E+ Y Sbjct: 28 IGTIPKDWKVKKLKDISNRVQRKNDGKSHNVLTISSKGGFLNQTERFSKVIAGENLAKYI 87 Query: 285 IVDPGEIVFRFIDL-QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ E + + + + E ++ + Y + + + + L Sbjct: 88 LLRKNEFAYNKGNSKTYPYGCIYRLEDYEEALVPNVYYCFEIREGVTEFYKHYFITGKLN 147 Query: 344 KVFYAMGSGLRQS-----LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K + + ++ L D +PV PPIKEQ I ++++ D +E E+ Sbjct: 148 KFLARVINTGVRNDGLLNLNVTDFFDVPVAAPPIKEQQKIASILSTW----DKAIELNEK 203 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGES 425 I K+++ + +TG++ L G Sbjct: 204 LIEQKKKQKKGLMQKLLTGEVRLPGFE 230 >gi|121534615|ref|ZP_01666437.1| restriction modification system DNA specificity domain [Thermosinus carboxydivorans Nor1] gi|121306867|gb|EAX47787.1| restriction modification system DNA specificity domain [Thermosinus carboxydivorans Nor1] Length = 438 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 70/432 (16%), Positives = 142/432 (32%), Gaps = 42/432 (9%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVES---GTGKYLPKDGNSRQSDTS 76 KVV IK K+ TG+T + G +I D+ S ++ + + + D Sbjct: 7 KVVKIKDVGKVITGKTPPTSQPELFGDKYPFITPSDISSFDVRYIDFVERGLSDKGFDKQ 66 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSIDVT 134 K + Y +G + K + + + Q +V+ P+ + L + Sbjct: 67 KRYALPKDTVCYVCIGSTIGKVCLTNKVSFTNQQINSIVVNRDKFNPKYVYYLLRAETPK 126 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + GA + + +I + + PL Q I + A I+ Sbjct: 127 IQAISGGTGAGKAILNKSSFEDIDLNVFPLPIQNKIAAILSAYDDLIENNTRRIKILE-- 184 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E Q L K P + +G +P+ WEVK +L+ + Sbjct: 185 --EMAQLLYREWFVKFRFPGHEKVRMVDSELGPIPEGWEVKNIGSLLAHTIGGGWGEVSR 242 Query: 255 NILSLSYGNIIQKLETRNM----------GLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + +I+ + N+ ES + + G+IVF + Sbjct: 243 SDKYTVPAYVIRGTDIPNVRQGSIESCPLRYHTESNFRSRKLKAGDIVFEVSGGSKGQPV 302 Query: 305 LRS-------AQVMERGIITSAY---MAVKPHGIDSTYLAWLMRSYDLCKVFYAM--GSG 352 R+ + +I +++ + + + + V S Sbjct: 303 GRALLINQSLLNSYDNDVICASFCKLIRPDKETMLPELIYLHLLEIYANGVIEKYQVQST 362 Query: 353 LRQSLKFEDV-KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + K+E K +LVP K Q + + I I +++K+ L+ R + Sbjct: 363 GITNFKYEFFLKNDQILVPDRKIQQNFADHI----IPIFDMIQKLGAMNRNLRRTRDLLL 418 Query: 412 AAAVTGQIDLRG 423 ++G++D+ Sbjct: 419 PKLISGELDVED 430 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 63/221 (28%), Gaps = 23/221 (10%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDV-ESGTGKY 63 DS +G IP+ W+V I G R+ + I D+ G Sbjct: 210 DSE---LGPIPEGWEVKNIGSLLAHTIGGGWGEVSRSDKYTVPAYVIRGTDIPNVRQGSI 266 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILY-----GKLGPYLRKAII-------ADFDGICSTQF 111 G I++ K P R +I D D IC++ Sbjct: 267 ESCPLRYHTESNFRSRKLKAGDIVFEVSGGSKGQPVGRALLINQSLLNSYDNDVICASFC 326 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 +++P +L +++ + L I+ Sbjct: 327 KLIRPDKETMLPELIYLHLLEIYANGVIEKYQVQSTGITNFKYEFFLKNDQILVPDRKIQ 386 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + + I +I + L+ + L+ +++ L+ Sbjct: 387 QNFADHIIPIFDMIQKLGAMNRNLRRTRDLLLPKLISGELD 427 >gi|324990378|gb|EGC22316.1| type I restriction-modification system specificity subunit [Streptococcus sanguinis SK353] Length = 415 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 62/400 (15%), Positives = 133/400 (33%), Gaps = 24/400 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + G+ G I + + G + +V + Sbjct: 29 WEQRKLGEVADFTKGKGYSKGDIEMSGTPIILYGRLYTNYGTIIDNVDTYVTMKEHSV-L 87 Query: 81 FAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI-DVTQ 135 +I+ R +++A I +++ + LS + Sbjct: 88 SEGNEIIVPSSGESSEEISRASVVAKKGVILGGDLNIIRLNSKFSSVFVAITLSNGSQQK 147 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + +G ++ H + + + P L EQ+ I +D LIT + R ++L+ Sbjct: 148 ELSKRAQGKSVVHLHNSDLKEVNLFYPTLPEQIAIGSF----FQELDQLITLQQRELKLI 203 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 KE K+ L+S + K ++ G +V F+ T N + + Sbjct: 204 KEGKKTLLSKMFPKDGENFPGIRFPGFTDAWEQRKLGDVFTSFSGGTPAAG-NKRYYGGD 262 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + I + + + ++V G+I++ + + + G Sbjct: 263 IPFIRSAEIHSDSTELFLTNEGLENSSAKLVKKGDILYALYGATSGEVDISKI----NGA 318 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I A + + P S+ Y + G + +L VK L + +P EQ Sbjct: 319 INQAILCIVPKINYSSGFIMQWLKYQKKNITDKYLQGGQGNLSGTLVKELDISLPTPPEQ 378 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + +D L+ ++ + +LK +S+ + A+ Sbjct: 379 RAIGSF----FQELDHLITLQQRELEILKTMKSTLL-KAM 413 >gi|282866390|ref|ZP_06275435.1| restriction modification system DNA specificity domain protein [Streptomyces sp. ACTE] gi|282558786|gb|EFB64343.1| restriction modification system DNA specificity domain protein [Streptomyces sp. ACTE] Length = 392 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 68/399 (17%), Positives = 136/399 (34%), Gaps = 27/399 (6%) Query: 27 VVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYL---PKDGNSRQSDTST 77 VP+ ++ +G T ++G +I + +D+ S +GKY+ P+ D+ Sbjct: 6 EVPLSECCEVVSGGTPKTGVASYWHGEIPWATPKDLGSLSGKYISETPRKITQEGLDSCG 65 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G +L+ P + + F PK + + + Sbjct: 66 ATLLPAGSVLFSSRAPIGH-VAVNAISMATNQGFKSFIPKPDYLDASYLYHWLRASRPYL 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E++ GAT +G + +P+P + +Q + + R I+LL E Sbjct: 125 ESLGNGATFKEISKSTVGKVKIPLPSIDDQRKVARVLDRVDELCAK----RCEAIDLLDE 180 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q++ + + ++ + +G + N +N I Sbjct: 181 LAQSIFLDMFGDPVVNSRELPTLPMSEIGKITTGSTPPR-------SNPRNYGNSIEWIK 233 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S + N L + L + + +IVDPG I+ I SA R Sbjct: 234 SDNIDNSSVYLTSAAERLSEDGAKIARIVDPGSILVTCIAGSTAAIG-SSAIANRRVSFN 292 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 A+ P DS +L + +R + + G++ + + +L PP EQ + Sbjct: 293 QQINAITPFNADSLFLYYQLRLAKPL-ILEKVTGGVKFLVSKSRFGSVVLLNPPHAEQRE 351 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + L + + L+ SS +A Sbjct: 352 FSKRAGLVLG----LQDMNRAHLAELRSLFSSLQHSAFR 386 >gi|261416113|ref|YP_003249796.1| restriction modification system DNA specificity domain protein [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372569|gb|ACX75314.1| restriction modification system DNA specificity domain protein [Fibrobacter succinogenes subsp. succinogenes S85] gi|302327173|gb|ADL26374.1| putative type I restriction-modification system, S subunit [Fibrobacter succinogenes subsp. succinogenes S85] Length = 383 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 71/406 (17%), Positives = 138/406 (33%), Gaps = 39/406 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ +K + K I D++ TG Y + + + Sbjct: 4 WEWKKLKDICE----------KGSSNIKQSDLKDLTGDYPIFGASGYIQNVDFYQR-NRD 52 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I K G + + ++ + PK+ + L +GA Sbjct: 53 YIGIIKDGSGVGRTMLLPAFSSVIGTLQYILPKEGNDIKFINYALQN---IDFSKSIQGA 109 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + H +K G + +PPL+EQ I + + E +I+TL T ++ KE ++ + Sbjct: 110 AIPHIYFKDYGETEILVPPLSEQKSIVKFLDEEFSKIETLKTNAETNLKNAKELFESTLE 169 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + G G +P WE K L KN L Sbjct: 170 KELNPG-------------KNGTLPSGWEWKTLRELCILRPSKNEALSHLKGTDEVSFLP 216 Query: 265 IQKLETRNMGLKP-------ESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGI 315 ++ L R P E + +Y G+++ + N K + S + G Sbjct: 217 MEDLNIRERNTIPHKSRALSEVHGSYTFFAEGDVLLAKVTPCFENGKMGIASNLLNGVGF 276 Query: 316 ITSAYMAVKP-HGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 +S Y+ + + ++YL +++ S +G+ + L E V+ + +PP+ Sbjct: 277 GSSEYIVFRTTKSMINSYLFYVLMSSRFISGGKKQMLGACGLKRLSKEYVESFQIPLPPL 336 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q +I ++ + + L +Q I E + S + G+ Sbjct: 337 SVQKEIVARLDKLSENVKRLEVNYKQIIANCDELKKSILKKTFEGE 382 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 78/202 (38%), Gaps = 13/202 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G +P W+ ++ L + + ++ ++ +ED+ +P + Sbjct: 178 GTLPSGWEWKTLRELCILRPSKNEALSHLKGTDEVSFLPMEDLNIRERNTIPHKSRALSE 237 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQ-PKDVLPELLQG 126 + + FA+G +L K+ P + + G S++++V + K ++ L Sbjct: 238 VHGSYTFFAEGDVLLAKVTPCFENGKMGIASNLLNGVGFGSSEYIVFRTTKSMINSYLFY 297 Query: 127 WLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L+S + GA + + + + +P+PPL+ Q I ++ + + L Sbjct: 298 VLMSSRFISGGKKQMLGACGLKRLSKEYVESFQIPLPPLSVQKEIVARLDKLSENVKRLE 357 Query: 186 TERIRFIELLKEKKQALVSYIV 207 + I E K++++ Sbjct: 358 VNYKQIIANCDELKKSILKKTF 379 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 55/170 (32%), Gaps = 4/170 (2%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + +K + E ++ ++ + + I Sbjct: 1 MSKWEWKKLKDICEKGSSNIKQSDLKDLTGDYPIFGASGYIQNVDFYQRNRDYIGIIK-D 59 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 +I + + G D ++ + +++ D K ++ + F Sbjct: 60 GSGVGRTMLLPAFSSVIGTLQYILPKEGNDIKFINYALQNIDFSK---SIQGAAIPHIYF 116 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +D +LVPP+ EQ I ++ E ++I+ L E ++ KE S Sbjct: 117 KDYGETEILVPPLSEQKSIVKFLDEEFSKIETLKTNAETNLKNAKELFES 166 >gi|217974626|ref|YP_002359377.1| restriction modification system DNA specificity domain-containing protein [Shewanella baltica OS223] gi|217499761|gb|ACK47954.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] Length = 419 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 49/411 (11%), Positives = 121/411 (29%), Gaps = 42/411 (10%) Query: 26 KVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 K P++ + G + + ++ + D+ + + Sbjct: 17 KWKPLEDVAEFRRGSFPQPYGNSEWYDGEGSMPFVQVVDLLDDSFELKEITKQRISKKAQ 76 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 S+F + + L + + + +D + + Sbjct: 77 PKSVFVRNGTVIVTLQGTIGRVALTQYDCYVDRTLAIFTNYIECINTKYFAYQLKSKFEV 136 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERI 189 + G+T+ +PIP LA Q I + A T L E Sbjct: 137 EKKNARGSTLKTITKAEFSKFQIPIPCPNNPEKSLAIQAEIVRILDAFTAMTAELTAELN 196 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRK 247 + + L+S ++ +EW +G V T Sbjct: 197 MRKKQYNYYRDQLLS------------FEEGEVEWKTLGDVTQ------LITKGTTPKEF 238 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLR 306 + + L N I+ + + + + E I++ G+I+F K ++ Sbjct: 239 VSDGVNFIKLESFDDNQIKPDKFMFITPEVHNKELKRSILEEGDILFAIAGATIGKCAIV 298 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRL 365 V+ + + ++ + + M++ + + + ++ + + Sbjct: 299 DKSVLPANTNQALAIVRLTQQVNVKFAFYYMQTTAMTDYIAKFNKTSAQPNINLKQMSEF 358 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + VP I EQ I +++ + E + + I L ++ R ++ Sbjct: 359 KIPVPTINEQIRIVKILDNFNTLTSSIKEGLPREIELRQKQYEYYRDLLLS 409 >gi|308064290|gb|ADO06177.1| type I R-M system specificity subunit [Helicobacter pylori Sat464] Length = 369 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 61/406 (15%), Positives = 121/406 (29%), Gaps = 46/406 (11%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 2 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 59 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 60 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 116 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + + L Sbjct: 117 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQNAIANILSGLDRYLYAL----------- 165 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 AL+ L + K E + + V + N + Sbjct: 166 ----DALI-------LKKEGVKKALSFELLSQRKRLKGFNQAWQRVRLGDIANYLTSNLS 214 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ I+ + N ++ I D I R L + I Sbjct: 215 VEQITQQGKIKVYDVNNFIGYTDTT---FISDKPYISIVKDGSVGRVRILPP----KTNI 267 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +++ + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 268 LSTMGALIANHRTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQ 324 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 325 NAIANILSALDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 366 >gi|312872212|ref|ZP_07732285.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2062A-h1] gi|311092296|gb|EFQ50667.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2062A-h1] Length = 401 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 53/409 (12%), Positives = 132/409 (32%), Gaps = 26/409 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYL---PKDGNSRQS 73 +WK+ I + G T + G I +I +D+ +G+++ ++ + Sbjct: 3 NWKICTIGDLGMVIGGATPSTKAAENYDGGTIAWITPKDLAGFSGRFISYGERNITKQGL 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K +L+ P IA+ + + F + P D + L Sbjct: 63 KSCSAKLMPKHTVLFSSRAPI-GYIAIANQELCTNQGFKSVVPNDDTD-YKFLYYLLKYN 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 +IE + G T + +I + +P + EQ I + +D I + Sbjct: 121 KNKIENLGSGTTFKEVSGSTMRDIEVSVPTSIEEQRKIASVL----SLLDDKIEKNASIN 176 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + L+++ QA+ + L+ I+W+ D + K + Sbjct: 177 KNLEQQAQAIFK---SWFLDYKPFNGVRPIDWINGTIDDLA--KEVVCGKTPSTKVKEYY 231 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 S++ + ++ +Y L + + Sbjct: 232 GSDVPFIKIPDMHGNTYVVTTEQYLSNYGAASQAKKTLPPNSICVSCIGTAGLVTLVASK 291 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 V Y+ LM++ +L +++ ++P I Sbjct: 292 SQTNQQINAIVPKDKYSPFYIYLLMQTLSEVINKLGQSGSTIVNLNKTQFEKIKAIIPSI 351 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + + L+ + ++ + L R++ + ++G++D+ Sbjct: 352 TDMKTF----DALVSPLFALILENQKENIRLSSLRNTLLPKLMSGELDV 396 >gi|308062110|gb|ADO03998.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori Cuz20] Length = 429 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 59/409 (14%), Positives = 128/409 (31%), Gaps = 22/409 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYI--GLEDVESGTG------KYLPKDGNSRQS 73 PK + + + G T + ++I + G++ + + + ++ Sbjct: 13 PKGVEFRKLGDIGEYIRGVTYKKNQEINNLECGIKVLRANNITLSNHLNFEDIKVINKNV 72 Query: 74 DTSTVSIFAKGQILY---GKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWL 128 K IL ++ K DFD + V++ ++V + Sbjct: 73 KIRKEQYLKKNDILICAGSGSSEHIGKVAFINTDFDYVFGGFMGVIRIREVNSRFVYHIF 132 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S Q +E T+++ + + N +PIPPL Q I + + A T L TE Sbjct: 133 TSNIFKQYLEKSLNTTTINNLNANILQNFLIPIPPLEIQQEIVKILDAFTELNTELNTEL 192 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN-RK 247 ++ K++ Q ++ + + KD+ I+ V + Sbjct: 193 NTELKARKKQYQ-YYQNMLLDFKDTNQSHKDAKIKTYPKRLKTLLQTLAPKGVEFRKLGE 251 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +++ L+ K N G+ Y D +I+ + Sbjct: 252 VINILKGKQLNKELLLDYGKYPVMNGGIYASGYWNEYNTDCPKIIISQGGAS---AGYVN 308 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + Y + + + + +L D++ L + Sbjct: 309 YMTSKFWAGAHCYAIELNSEKLNYKFLYYFLKNSQTILMKSQFGAGIPALNKADIETLTI 368 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP++ Q +I +++ A L+ I I K+ R + Sbjct: 369 PIPPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYEYYREKLLT 417 >gi|226310299|ref|YP_002770193.1| type I restriction modification system specificity protein [Brevibacillus brevis NBRC 100599] gi|226093247|dbj|BAH41689.1| putative type I restriction modification system specificity protein [Brevibacillus brevis NBRC 100599] Length = 411 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 54/400 (13%), Positives = 137/400 (34%), Gaps = 26/400 (6%) Query: 26 KVVPIKRFT-KLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + T + R ++ YI L V + ++ + + + K Sbjct: 17 EWKALGDVTLPTSNIRWRDTKDTYRYIDLTSVSREKNIIIETTEISAENAPSRAQKLVIK 76 Query: 84 GQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIE 138 +++ P ++ + + + ST + +L+ + LP+ + + S +E Sbjct: 77 NDVIFATTRPTQQRLCLITEEFSGEVASTGYCILRARKDEVLPKWIYHSITSSRFKNYVE 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + + +PIPPL Q I + A T L +E + K++ Sbjct: 137 ENQSGSAYPAISDAKVKDFKIPIPPLKVQEEIVRILDAFTEFTSELTSELTSELTARKKQ 196 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + + ++ +EW +G V + + Sbjct: 197 YTYYR--------DKLLTFEEGEVEWKTLGEVAKFRRGSFPQPYGKDEWYGGEG-AMPFV 247 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + G ++ ++ + + V G ++ + ++R + Sbjct: 248 QVVDVGEDMRLVQNTKNKISKLAQPKSVFVQEGTVIVTLQGSIGRVAITQYDCYVDRTL- 306 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 A I+ Y A+ +++ + A GS +++ E+ + + +PP+ EQ Sbjct: 307 --AIFESFQVKINKKYFAYQLQAKFAFEKEKARGS-TIKTITKEEFTKFQIPIPPLTEQE 363 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I ++++ A + E + + I L ++ R+ ++ Sbjct: 364 RIVSILDKFDALTSSITEALPREIELRQKQYEYYRNLLLS 403 >gi|254414907|ref|ZP_05028671.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178396|gb|EDX73396.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] Length = 506 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 61/409 (14%), Positives = 136/409 (33%), Gaps = 32/409 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W V + + N G++ SG G +G Y + + Sbjct: 5 PLSWIGVTLGDLLRFNYGKSLPERARSGAGFPVYG----SNGIVGYHDEPLTDGE----- 55 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G+ G T + V Q + L ++ +++ Sbjct: 56 -------TLIIGRKGSVGEVHFSPGACFPIDTTYYVDQFHGMPTRYWFYQLKNLGLSE-- 106 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + + + K + + + PL EQ I +K+ A R+D IR ++++ Sbjct: 107 --LDKATAIPSLNRKDAYRVQIHLSPLNEQKRIADKLDALLARVDACRDRLIRVSFIIQQ 164 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN-RKNTKLIESNI 256 +QA+++ ++ + ++ ++ F ++ + I Sbjct: 165 LRQAILTDGISGKITQYWSKNNAENLAYNHQNIVGKLSDFADVIDPNPSHRYPSYKGGTI 224 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI- 315 L+ + + K Y+ Y+ + K L A+ + I Sbjct: 225 PILATEQMSGLNDWDTSSAKLIKYDFYEARKAAHDFLNDDIIFARKGRLGLARNPPQNIR 284 Query: 316 ----ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLV 369 T + VK I +YL W +R + +L ++RLP+ + Sbjct: 285 YVFSHTVFIIRVKADNILPSYLLWFLRQEFCIDWLLSEMNSNAGVPTLGKSVMERLPITI 344 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P EQ +I I A D + + + ++ +++ + ++ A G+ Sbjct: 345 PDYAEQQEIVQCIEKLYAYADRIEARYQNALTRVEQLTPTLLSKAFRGE 393 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 43/122 (35%) Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 L ++ G + + + SL Sbjct: 57 LIIGRKGSVGEVHFSPGACFPIDTTYYVDQFHGMPTRYWFYQLKNLGLSELDKATAIPSL 116 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +D R+ + + P+ EQ I + ++ AR+D +++ + ++++ R + + ++G Sbjct: 117 NRKDAYRVQIHLSPLNEQKRIADKLDALLARVDACRDRLIRVSFIIQQLRQAILTDGISG 176 Query: 418 QI 419 +I Sbjct: 177 KI 178 >gi|307290561|ref|ZP_07570472.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0411] gi|306498382|gb|EFM67888.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0411] gi|315158690|gb|EFU02707.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0312] Length = 387 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 52/402 (12%), Positives = 125/402 (31%), Gaps = 44/402 (10%) Query: 23 KHWKVVPIKRFTK--------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + W+ + + G S +G V S + Sbjct: 16 EDWEQRKLGEVVESVGTGRSTFTNGIVQTSETPYAVLGSTSVISYDSMFD---------- 65 Query: 75 TSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G + ++G + S + ++ + Sbjct: 66 -------HSGDFILTARVGANAGNLYKYFGEVKISDN------TVYIQADNLDFIYYLLT 112 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + G + N+ + P E+ +KI +D IT R ++ Sbjct: 113 KYDLKRLSFGTGQPLVKASEVKNLKLNFPQKNEEQ---QKIGTFFKNLDDTITLHQRKLD 169 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LLKE K+ + + K +++ G E+ + T + N E Sbjct: 170 LLKETKKGFLQKMFPKNGAKVPEIRFPGFTEDWEERKLGEIVRISSGFTGDSSLNIGQYE 229 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + +P+ ++D G+I+F I+ + + + + Sbjct: 230 LTRIETIATGQVNPNKVGYSNTEPD---KKYLLDKGDILFSNINSLSHIGKIALFDLDMK 286 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPP 371 + ++P ++S +L + + + + + + S+ ++ + LVP Sbjct: 287 LYHGINLLRLQPMNVNSQFLYQSFQLNNHLEWAKSHANQAVSQASINQTELSKQVFLVPS 346 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I ++D + ++ + LLKE + F+ Sbjct: 347 QQEQQKIGTF----FKQLDDTIALHQRKLDLLKETKKGFLQK 384 >gi|198284112|ref|YP_002220433.1| restriction modification system DNA specificity protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248633|gb|ACH84226.1| restriction modification system DNA specificity domain [Acidithiobacillus ferrooxidans ATCC 53993] Length = 423 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 46/400 (11%), Positives = 113/400 (28%), Gaps = 24/400 (6%) Query: 24 HWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK+ P+ + + E ++ E + K+ + Q + + + Sbjct: 32 GWKLAPLSQLATRTKQKNRDEKITRVLTNSAEFGVMDQRDFFDKEIAT-QGNLESYFVVE 90 Query: 83 KGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL---SIDVTQ 135 G +Y P + G+ S + V + KD + + + + Sbjct: 91 LGSYVYNPRISATAPVGPISKNKVGTGVMSPLYTVFKFKDGGNDFYEHYFKTTGWHTYMR 150 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + +P+P+P EQ I E + + + + Sbjct: 151 QASSTGARHDRMAISSDDFMAMPLPVPTPKEQQKIAECLSSVDALMAAQARKVDALKT-- 208 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ L+ + +++ + G + ++ L Sbjct: 209 --HKKGLMQQLFPTEGETQPRLRFPEFQNAGEWNKTTLGEAATFFNGRAYKQEELLESGK 266 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 L GN L+ + + D G++++ + + + I Sbjct: 267 YPVLRVGNFFTNNNWYYSDLELDETK---YCDKGDLLYAWSASFGPRMWHGVKVIYHYHI 323 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + GID +L + + + + ++ P EQ Sbjct: 324 ----WKVEQHSGIDRQFLFITLENETERMKSNSANGLGLLHITKGTIEGWDTAFPSPPEQ 379 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + + + +D L+ Q + LK + + Sbjct: 380 HRIASCL----SSLDALITLETQKLEALKTHKKGLMQQLF 415 >gi|303252529|ref|ZP_07338692.1| hypothetical protein APP2_1506 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307247278|ref|ZP_07529327.1| Restriction modification system DNA specificity domain [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|302648497|gb|EFL78690.1| hypothetical protein APP2_1506 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|306856251|gb|EFM88405.1| Restriction modification system DNA specificity domain [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 414 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 56/410 (13%), Positives = 126/410 (30%), Gaps = 38/410 (9%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDV--ESGTG 61 KD V+W + K G T + + ++ + Sbjct: 8 KDCEVEW----------KSLGEVAKYVRGLTYNKTNESDEKAGGYYVLRANNITLSNNQL 57 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFL--VLQ 115 + + T K IL + A I++ F+ V Sbjct: 58 NFDDVKLVKFDTKTKPEQKLYKDDILISAASGSKEHVGKVAFISENMDFYFGGFMGVVRC 117 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 +++LP L L S + + +T+++ + K + +PIPPL Q I + + Sbjct: 118 SQEILPRFLFHILTSSLFKTYLNEVLNSSTINNLNAKVMNEFQIPIPPLEIQEKIVKILD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 T TL + L ++ ++ + + K++ +G + Sbjct: 178 KFTELEATLEATLEAELSLRVKQYNYYRD-LLLNENDKNPFFKNTEYRCLGDI------- 229 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + ++ S+ N + L S ++V +++F Sbjct: 230 TLVSSNIKWKNNTNTYKYIDLTSVDRENHSIGETIKISALTAPS-RAQKLVAKDDVIFAT 288 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + + + I ++ Y P+ + ++ + S D SG Sbjct: 289 TRPTQLRFAF-INEEFANSIASTGYCVLRANPNLVLPKWIYHNLGSIDFKNFLEENQSGS 347 Query: 354 R-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ VK + VP + Q I +++ + + + + I L Sbjct: 348 AYPAVSDSKVKDYKIPVPSLDVQEKIIAILDNFENLANSIKNGLPREIEL 397 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 60/205 (29%), Gaps = 10/205 (4%) Query: 218 KDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 KD +EW +G V + + E + K +++ N + + Sbjct: 8 KDCEVEWKSLGEVAKYVRGLT-YNKTNESDEKAGGYYVLRANNITLSNNQLNFDDVKLVK 66 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM--AVKPHGIDSTYL 333 + Q + +I+ + + +M I +L Sbjct: 67 FDTKTKPEQKLYKDDILISAASGSKEHVGKVAFISENMDFYFGGFMGVVRCSQEILPRFL 126 Query: 334 AWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 ++ S + S +L + + + +PP++ Q I +++ T L Sbjct: 127 FHILTSSLFKTYLNEVLNSSTINNLNAKVMNEFQIPIPPLEIQEKIVKILDKFTELEATL 186 Query: 393 VEKIEQSIVL----LKERRSSFIAA 413 +E + L R + Sbjct: 187 EATLEAELSLRVKQYNYYRDLLLNE 211 >gi|311063621|ref|YP_003970346.1| type I restriction-modification system specificity determinant [Bifidobacterium bifidum PRL2010] gi|310865940|gb|ADP35309.1| Type I restriction-modification system specificity determinant [Bifidobacterium bifidum PRL2010] Length = 403 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 49/410 (11%), Positives = 119/410 (29%), Gaps = 44/410 (10%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-S 76 P K + + G + I + + G + + + Sbjct: 13 PDGVKHQTLGEIGEFIRGNGIQKRDFRDSGFGCIHYGQIYTYYGLFADHTKSFIDPNLAE 72 Query: 77 TVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 KG ++ A + + S + + P+ + S Sbjct: 73 KRKKAYKGDLVIATTSENEEDVCKACAWLGEEPIAISGDAYIFR-HHQNPKYISYCFQSE 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G + + + I +P+PPL Q I + + + L E Sbjct: 132 LFQSQKKKYITGTKVLRVNGDAMAKIHVPVPPLPVQEEIVRILDSFSSLEAELEAELEAR 191 Query: 192 IELLKEKKQALVS--YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + L++ +VT + SG + K Sbjct: 192 RKQYAYYRNELLTFERVVTVCIQDICIRICSG--------------------GTPSSKRH 231 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLR 306 + N+ L +I + + + Q + ++ K ++ Sbjct: 232 DYYDGNVPWLRTQDIDFNVINQTSATISDEGLRNSAAQWIPANCVIVAMYGATAAKVAVN 291 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 S + + + D Y+ + + + A+G G + ++ + VK P Sbjct: 292 SIPLTTNQACCNLQIDETK--ADVRYVFHWLSNEY--EHLKALGEGSQSNINAKKVKSYP 347 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + +PP++EQ I ++++ + L + I ++ R ++ Sbjct: 348 ISLPPLEEQRRIVSILDRFDKLTNDLSSGLPAEIEARRKQYEYYRDRLLS 397 >gi|187934035|ref|YP_001886271.1| Sau1hsdS1 [Clostridium botulinum B str. Eklund 17B] gi|187722188|gb|ACD23409.1| Sau1hsdS1 [Clostridium botulinum B str. Eklund 17B] Length = 422 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 63/403 (15%), Positives = 138/403 (34%), Gaps = 39/403 (9%) Query: 25 WKVVPIKRFTKLN---TGRTSESGKDI------IYIGLEDVESGTGKY-LPKDGNSRQSD 74 W+ + L G+ S D I++ ++ + + +S+ Sbjct: 20 WEQRRLSDIANLIDGDRGKNYPSSTDFYEDGHTIFLSATNITRNGFSFESNQYITEEKSN 79 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG-------ICSTQFLVLQPKDVLPELLQGW 127 I+ G + I S ++ + V P + + Sbjct: 80 VLGNGKVEINDIVLTSRGSLGHIGWYNNDIKSLIPFARINSGMLIIRSMEAVEPSYIAQY 139 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + S ++IE I G+ K + N + I +EQ I IT Sbjct: 140 MKSSLGKRQIELISFGSAQPQLTKKDMSNYKISITKKSEQDKIGFFFNNLDNL----ITL 195 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 R + L++KK+ L+ + K +++ G D WE + + + RK Sbjct: 196 HQRKLNHLQDKKKGLLQKMFPKEGEKFPELRFPG------FTDPWEQRKLGDIAKRITRK 249 Query: 248 NTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK-RSL 305 NT+L + L++S ++ ++ N + Y ++ GE + + ++ Sbjct: 250 NTELKSTLPLTISAQYGLVDQITFFNKRVASRDVSGYYLLRKGEFAYNKSYSEGYPWGAI 309 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQ----SLKFE 360 + + E G++++ Y+ K ++S +L + + K G R ++ E Sbjct: 310 KRLERYENGVLSTLYICFKLSDVNSNFLVSYYNTNNWHKEIAQRAAEGARNHGLLNISAE 369 Query: 361 DVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 D + +P +EQ I +++ L+ + + Sbjct: 370 DFFDTKLTIPKSKEEQARIGEY----FKQLNNLITLHHRKLNH 408 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 23/215 (10%), Positives = 63/215 (29%), Gaps = 22/215 (10%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 +P + + S I + + + + LS NI + Sbjct: 13 FPGFTDPWEQRRLSDIANLIDG----------DRGKNYPSSTDFYEDGHTIFLSATNITR 62 Query: 267 KLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSL---RSAQVMERGIITSA 319 + ++ + V+ +IV + + I + Sbjct: 63 NGFSFESNQYITEEKSNVLGNGKVEINDIVLTSRGSLGHIGWYNNDIKSLIPFARINSGM 122 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + ++ +Y+A M+S + + + L +D+ + + EQ I Sbjct: 123 LIIRSMEAVEPSYIAQYMKSSLGKRQIELISFGSAQPQLTKKDMSNYKISITKKSEQDKI 182 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ + L++++ + Sbjct: 183 GFF----FNNLDNLITLHQRKLNHLQDKKKGLLQK 213 >gi|217968470|ref|YP_002353704.1| restriction modification system DNA specificity domain protein [Thauera sp. MZ1T] gi|217505797|gb|ACK52808.1| restriction modification system DNA specificity domain protein [Thauera sp. MZ1T] Length = 378 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 52/395 (13%), Positives = 114/395 (28%), Gaps = 36/395 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + G+ + G+D G+Y N + I+ Sbjct: 7 VTLGEVVDFFNGKAIKPGQD-------------GEYPAYGSNGLIGGAPDWKY--ENSII 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G++G Y S +V +PK L ++++ GA Sbjct: 52 IGRVGAYCGSVAYCKSRFWASDNTIVARPKSGDVGYFYYLLKALEL----NRYAGGAAQP 107 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + +P +P + Q I + A I+ E + + Sbjct: 108 LVTQTVLKGVPARVPDIPTQRRIASILSAYDDLIENNTRRIAILE----EMARRIYEEWF 163 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + P + +GL+P+ W+ + +RK L + Sbjct: 164 VRFRFPGHEQVKMVESELGLIPEGWKATNIGEVAENHDRKRKPLSKMQREKFKGPYPYYG 223 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 ++ ++ ++ + + R + ++ Sbjct: 224 AAKIFDYVEDYIFDGRFVLMAED-----GSVITPDGFPVLQLANGRFWANNHTHILRGTP 278 Query: 328 IDSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ST +L + S + + + ++ R+PV +PP T ++ + Sbjct: 279 DASTEFIYLRLSSQKVSGYI---TGAAQPKITQANMNRIPVCLPPRDLMARFTELVGPKF 335 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ID L K L+ R + ++G++D+ Sbjct: 336 DLIDCLERKHTN----LRATRDLLLPKLISGELDV 366 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 55/192 (28%), Gaps = 16/192 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP+ WK I + + + K P G ++ D Sbjct: 181 LGLIPEGWKATNIGEVAENHDRKRKPLSKMQ--------REKFKGPYPYYGAAKIFDYVE 232 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 IF +L + G + + + +L P+ ++ Sbjct: 233 DYIFDGRFVLMAEDGSVITPDGFPVLQLANGRFWANNHTHIL---RGTPDASTEFIYLRL 289 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +Q++ GA + IP+ +PP E + + ID L + Sbjct: 290 SSQKVSGYITGAAQPKITQANMNRIPVCLPPRDLMARFTELVGPKFDLIDCLERKHTNLR 349 Query: 193 ELLKEKKQALVS 204 L+S Sbjct: 350 ATRDLLLPKLIS 361 >gi|157372317|ref|YP_001480306.1| restriction modification system DNA specificity subunit [Serratia proteamaculans 568] gi|157324081|gb|ABV43178.1| restriction modification system DNA specificity domain [Serratia proteamaculans 568] Length = 409 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 51/371 (13%), Positives = 120/371 (32%), Gaps = 19/371 (5%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFAKGQILYGKLGPYLRKAIIADFD--- 104 Y+ + D++ + + S +D S + KG IL+ + G + K I + Sbjct: 48 YLRITDIDEKSRNFDYCQLTSPDADLSKSDNYLLKKGDILFARTGASVGKTYIYNEQDGK 107 Query: 105 -GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + + + + + + + + + + K G + P Sbjct: 108 VYFAGFLIRASINHEASAQFIFQNTQTHEYARFVATTSQRSGQPGINAKEYGEYRLFSPT 167 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 EQ I ++DTLI + + + L K+A++ + K +++ G Sbjct: 168 EPEQTQIGNY----FQKLDTLINQHQQKHDKLSSIKKAMLEKMFPKQGETIPEIRFKGFS 223 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 + T + I ++ +I + + ++ Sbjct: 224 GEWEEKSVGQFGEIITGSTPSTQNLINYSNDGIPWVTPTDISRNVTFNTAKRLSQTGCKV 283 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL- 342 + P + + K ++ + +G + P+ D+ S Sbjct: 284 ARIVPKDTILVTCIASIGKNTI----LGTQGGFNQQINGIIPNQKDNHPYFIFSASILWS 339 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K+ + SG Q + + L P +EQ I N ++D L+++ +Q I Sbjct: 340 EKLKRSAASGTMQIVNKTEFSELKTRAPKKEEQTAIGNY----FQKLDSLIDQHQQQITK 395 Query: 403 LKERRSSFIAA 413 L + + ++ Sbjct: 396 LNNIKQACLSK 406 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 61/202 (30%), Gaps = 22/202 (10%) Query: 21 IPK--------HWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLP 65 IP+ W+ + +F ++ TG T S I ++ D+ Sbjct: 214 IPEIRFKGFSGEWEEKSVGQFGEIITGSTPSTQNLINYSNDGIPWVTPTDISRNVTFNTA 273 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 K + I K IL + + I+ G + Q + P Sbjct: 274 KRLSQTGCKV--ARIVPKDTILVTCIASIGKNTIL-GTQGGFNQQINGIIPNQKDNHPYF 330 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + SI +++++ TM + + P EQ I ID Sbjct: 331 IFSASILWSEKLKRSAASGTMQIVNKTEFSELKTRAPKKEEQTAIGNYFQKLDSLIDQH- 389 Query: 186 TERIRFIELLKEKKQALVSYIV 207 + I L KQA +S + Sbjct: 390 ---QQQITKLNNIKQACLSKMF 408 >gi|332975815|gb|EGK12694.1| restriction endonuclease S subunit [Psychrobacter sp. 1501(2011)] Length = 574 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 68/481 (14%), Positives = 139/481 (28%), Gaps = 89/481 (18%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W + ++ ++N G +S I I + D K D Sbjct: 96 LPLGWSWIKLEDIAEINGGFAFKSSDYTSDGIRVIRISDFNEMGFKSDKVVRYPYSLDLE 155 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + IL G + K+++ I + + ++ + L+ ++ Sbjct: 156 RYRL-EENNILMAMTGGTVGKSLLVQALPEPMIVNQRVATIKLIQGINSTYINSLIRSEL 214 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE---------------- 177 Q + + +T + K I N +P+PP AEQ I K+ Sbjct: 215 IQSVINEAKNSTNDNISMKSIKNFLIPLPPFAEQKRIVAKVDELMLLCDQLEQQTETSID 274 Query: 178 -------------------------TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 RI + +K KQ ++ V L Sbjct: 275 AHATLVEVLLSTLTDSADADELAQNWARIAEHFDSLFTTEQSIKSLKQTVLQLAVMGKLV 334 Query: 213 PDVK------------------------MKDSGIEWVGL------VPDHWEVKPFFA--L 240 P ++ S + + P WE Sbjct: 335 PQNPDDEPASVLLERINEVKSKLVKEEGLRTSASKELNADDKYLTQPHGWEWMRLGNLAK 394 Query: 241 VTELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQ----IVDPGEIVFRF 295 + K IE+ + ++ N ++ + E T G+ +F Sbjct: 395 FIDYRGKTPTKIENGVRLITAKNIRYGYVDLKPEEFISEDEYTSWMTRGFPKQGDTLFTP 454 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LR 354 + ++ + + A S +L + + +G Sbjct: 455 EAPLGNAANI--DIKGKFALAQRAICFQWHISEISDFLLLQILAQPFQLQLIDNATGMTA 512 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 +K +K +P+++PP+ EQ I ++ A D L +++QS + + I A Sbjct: 513 TGIKASKLKEIPMIIPPLAEQHRIVTKVDELMAICDQLKARLQQSQETQVQLTDALIDKA 572 Query: 415 V 415 + Sbjct: 573 L 573 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 60/187 (32%), Gaps = 5/187 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + E + +P W + K++ I + + + + ++ Sbjct: 88 TENEKIFTLPLGWSWIKLEDIAEINGGFAFKSSDYTSDGIRVIRISDFNEMGFKSDKVVR 147 Query: 277 PES--YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 ++ I+ K L A + GI+STY+ Sbjct: 148 YPYSLDLERYRLEENNILMAMTGGTVGKSLLVQALPEPMIVNQRVATIKLIQGINSTYIN 207 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L+RS + V + ++ + +K + +PP EQ I ++ D L + Sbjct: 208 SLIRSELIQSVINEAKNSTNDNISMKSIKNFLIPLPPFAEQKRIVAKVDELMLLCDQLEQ 267 Query: 395 KIEQSIV 401 + E SI Sbjct: 268 QTETSID 274 >gi|261415742|ref|YP_003249425.1| restriction modification system DNA specificity domain protein [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372198|gb|ACX74943.1| restriction modification system DNA specificity domain protein [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326810|gb|ADL26011.1| type I restriction-modification system, S subunit [Fibrobacter succinogenes subsp. succinogenes S85] Length = 408 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 71/418 (16%), Positives = 136/418 (32%), Gaps = 33/418 (7%) Query: 25 WKVVPIKRFT-KLNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST--- 77 W+ V + + G ++ I ++ + ++ S Q + Sbjct: 3 WEKVKLGDVCVSIADGDHLPPPKADCGIPFVTISNITSANQFDFTNTMFVPQEYYDSLDE 62 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSIDVT 134 ILY +G + + I D L + L +LS D Sbjct: 63 KRKPKVNDILYSVVGSFGKPVFIKDDSPFVFQRHIAILRPDESKIYSRYLYYKMLSNDFY 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +A+ GA + N+ + IP Q I + + A I+ + I+L Sbjct: 123 MMADAVAVGAAQRTVSLTALRNMEINIPNKETQKRIADILSAYDDLIEN----NQKQIKL 178 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+E Q L P + V +P W + +V + + E Sbjct: 179 LEEAAQRLYKQWFIDLKFPGYETTPI----VDGLPQGWWKEKLGDVVDYVRGTSYTSNEL 234 Query: 255 NILS------LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + L N + Y+ I++ G++V D+ ++R + Sbjct: 235 SDNEGVLLVNLKNINAFGGYKRNAEKRFTGKYKENGILESGDLVMGCTDMTKERRLVGHV 294 Query: 309 QVMER----GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 ++ I T + + P I T+ R L + +G L+ E++ Sbjct: 295 ALIPNLKECAIFTMDLLKILPKTISKTFFYAQCRFGGLSYKISPLANGANVLHLRPENMA 354 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + VL P I + + A + +EK+E I L E R+ + + G+I + Sbjct: 355 DIEVLCPEKS----IVEMYDNVFASMISKIEKLEDQIQLAAESRNRLLPKIMNGEISV 408 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 62/197 (31%), Gaps = 14/197 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P+ W + G + S + ++ + L+++ + G Y Sbjct: 208 LPQGWWKEKLGDVVDYVRGTSYTSNELSDNEGVLLVNLKNI-NAFGGYKRNAEKRFTGKY 266 Query: 76 STVSIFAKGQILYGK------LGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGW 127 I G ++ G A+I + I + L + PK + Sbjct: 267 KENGILESGDLVMGCTDMTKERRLVGHVALIPNLKECAIFTMDLLKILPKTISKTFFYAQ 326 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ +I + GA + H + + +I + P + + + +I+ L + Sbjct: 327 CRFGGLSYKISPLANGANVLHLRPENMADIEVLCPEKSIVEMYDNVFASMISKIEKLEDQ 386 Query: 188 RIRFIELLKEKKQALVS 204 E +++ Sbjct: 387 IQLAAESRNRLLPKIMN 403 >gi|229105722|ref|ZP_04236351.1| Type I restriction modification DNA specificity domain protein [Bacillus cereus Rock3-28] gi|228677611|gb|EEL31859.1| Type I restriction modification DNA specificity domain protein [Bacillus cereus Rock3-28] Length = 401 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 53/399 (13%), Positives = 124/399 (31%), Gaps = 23/399 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ ++ T + + I E Y K + D S + Sbjct: 14 EWENQKLENVVDRVTRKNKNLESKLPLTISAERGLVDQITYFNKSIAGK--DLSGYYLLK 71 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ ST ++ + ++ + L+ + + + + Sbjct: 72 SGEFAYNKSYSNGYPWGAIKRLDNYEMGVLSTLYICFKATNIHGDFLKHYFETDKWYKGV 131 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + +H N I + + KI A +++ I + + I+LL++ Sbjct: 132 SMMAAEGARNHGLLNIAVNDFFKIHLSFPEENEQRKIAAFFEKLNQKIQFQQQKIDLLQK 191 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +K+ + I + + + G W ++ R+ K ES I Sbjct: 192 QKKGYMHRIFEQEI--------PFKDENGGNHFEWRELAVSDILILHLREIPKPNESYIR 243 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + + + +V G+ + ++ + + + Sbjct: 244 LGLRSHAKGTFHEIIDNPETITMDKLFVVHEGDFIINITFAWEQALAILDKEDHGKLVSH 303 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIKE 374 G S + + + G L +D + V VP +E Sbjct: 304 RFPTYRFNEGHYSGFYKYYFTTKYFKYCLGNASPGGAGRNRVLNKKDFMNIIVKVPKYEE 363 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I N + +++D ++ E+ + LK+++ F+ Sbjct: 364 QIKIANFL----SKLDEKIQLEEKKLEDLKKQKKGFMQQ 398 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 84/218 (38%), Gaps = 15/218 (6%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG-NIIQKLETRN 272 ++K E+ G WE + +V + RKN L L++S ++ ++ N Sbjct: 1 MNELKLRFKEFSGE----WENQKLENVVDRVTRKNKNLESKLPLTISAERGLVDQITYFN 56 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + Y ++ GE + +++ E G++++ Y+ K I Sbjct: 57 KSIAGKDLSGYYLLKSGEFAYNKSYSNGYPWGAIKRLDNYEMGVLSTLYICFKATNIHGD 116 Query: 332 YLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L + K M + G R ++ D ++ + P EQ I Sbjct: 117 FLKHYFETDKWYKGVSMMAAEGARNHGLLNIAVNDFFKIHLSFPEENEQRKIAAF----F 172 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +++ ++ +Q I LL++++ ++ +I + E Sbjct: 173 EKLNQKIQFQQQKIDLLQKQKKGYMHRIFEQEIPFKDE 210 >gi|150006638|ref|YP_001301382.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides vulgatus ATCC 8482] gi|149935062|gb|ABR41760.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides vulgatus ATCC 8482] Length = 447 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 62/418 (14%), Positives = 132/418 (31%), Gaps = 41/418 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W ++ + +G T + + YI + ++ + + + Sbjct: 30 ELPNSWVWCRLEDIAYVASGSTPDKTCFVENGVPYIKMYNLRNQKIDFAYHPQYITEEVH 89 Query: 76 STV---SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLL 129 + S G ++ +GP L K I + ++++P L+ + Sbjct: 90 NGKLQRSRTEVGDLIMNIVGPPLGKLAIIPTTLPQANFNQAAVLIRPYKFKEVLVSYLKV 149 Query: 130 SIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ I +I A + N+ +PIPPL E I E++ + ID+L Sbjct: 150 YLEEMSEINSIATRGSAGQVNISLTQSQNMRIPIPPLNEVRRIIEEVSKYDILIDSLKQN 209 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV----------------GLVPDH 231 L+ K ++ + L P + IE + VP Sbjct: 210 ITDIQNLIAYTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTFDVPSG 269 Query: 232 WEVKPFFALVT-----------ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 W ++ + I LS ++ + Sbjct: 270 WITTNLGSIFNVVSAKRILKSDWKHSGVPFYRAREIAKLSIYGLVDNELYISEEHYNSLK 329 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 E + + +I+ + ++ + + + I++ Y+ +MRS Sbjct: 330 EKFPVPKASDIMISAVGTIGKCYIVKESDKFYYKDAS-VLCLCNDYQINAKYIYHIMRSE 388 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + K Y G ++ E K+ + +PP+ EQ I I + D + +E Sbjct: 389 YMLKQMYDNSKGTTVDTITIEKAKQYILPLPPLAEQQRIVAKIEETFSIFDGIQNSLE 446 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 60/194 (30%), Gaps = 12/194 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNT---KLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +P+ W + + +E+ + + N+ + + + E + Sbjct: 30 ELPNSWVWCRLEDIAYVASGSTPDKTCFVENGVPYIKMYNLRNQKIDFAYHPQYITEEVH 89 Query: 284 Q------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAW 335 + G+++ + K ++ + + +A + +YL Sbjct: 90 NGKLQRSRTEVGDLIMNIVGPPLGKLAIIPTTLPQANFNQAAVLIRPYKFKEVLVSYLKV 149 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + GS + ++ + + + +PP+ E I ++ ID L + Sbjct: 150 YLEEMSEINSIATRGSAGQVNISLTQSQNMRIPIPPLNEVRRIIEEVSKYDILIDSLKQN 209 Query: 396 IEQSIVLLKERRSS 409 I I L S Sbjct: 210 ITD-IQNLIAYTKS 222 >gi|240949221|ref|ZP_04753565.1| hypothetical protein AM305_09766 [Actinobacillus minor NM305] gi|240296337|gb|EER46981.1| hypothetical protein AM305_09766 [Actinobacillus minor NM305] Length = 394 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 51/388 (13%), Positives = 124/388 (31%), Gaps = 21/388 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ G++ SG + G R T I K Sbjct: 19 DWEQRKLGEIAEIVMGQSPNSGNYTNNPKDHILVQGNADIKNGKVFPRIWTTQITKIGKK 78 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ P A D+D + ++ + + L + + A+ G Sbjct: 79 NDLIMSVRAPVGDMAK-TDYDVVLGRGVCAIKGNEFI----YQILSKMKIDGYWNALSTG 133 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T + I + + ++ + I ++D I R +E ++ K + + Sbjct: 134 STFDAINSND---IKKTLISIPKEQKEQTAIGNFFKQLDDTIALHQRALEKYQKLKISYL 190 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + K +++ EV F + K + ++ LS + Sbjct: 191 EKMFPKENEQFPELRFPNFTDAWEQRKLGEVVDIFDGT----HQTPKYTDKGVMFLSVED 246 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + + E G+++ I D + + + + Sbjct: 247 IKTLSSNKFISEVDFKKEFKNFPRKGDVLMTRIG---DVGTANVVLSDHKVAYYVSLALL 303 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 KP GI S +LA + S + + + + +++++ + +P +EQ I N Sbjct: 304 KPKGIHSFFLATAISSSSVQSDIWKRTLHIAFPKKINKSEIEKIDIFLPSSEEQQKIGNF 363 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSS 409 ++D + ++ + K+ + + Sbjct: 364 ----FKQLDDTIALHQREVEKYKKIKQA 387 >gi|257094280|ref|YP_003167921.1| restriction modification system DNA specificity protein-containing protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046804|gb|ACV35992.1| restriction modification system DNA specificity domain protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 440 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 63/428 (14%), Positives = 135/428 (31%), Gaps = 36/428 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + +G T G D I ++ +D++S + +D S + Sbjct: 3 SEWEETTLGDCADWLSGGTPFKGNDAYWSGPIPWVSAKDMKS-FRLHDAEDHMSPLAVGK 61 Query: 77 TVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G IL G L + + + L P + + L Sbjct: 62 GGKVVPAGTILLLVRGMTLHNDVPICMVTREMAFNQDIKALHPAKNVDGAFLAYWLLAHK 121 Query: 134 TQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + ++ G + + +PPLAEQ I E + A +I+ Sbjct: 122 PDLLASVDHAGHGTGRLVTGTLKGKAVQLPPLAEQKAIAEVLGALDDKIELNRRMNATLE 181 Query: 193 ELLKEKKQALVS--YIVTKGLN-----------PDVKMKDSGIEWVGLVPDHWEVKPFFA 239 + + Q+ + V L+ + +G +P W K Sbjct: 182 AMARALFQSWFVDIHPVRAKLDGRQPAGLDSATAALFPDHLEGSPLGHIPKGWSAKSLSE 241 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 +V R+ + + L N+ + + + + E + + G+ + I Sbjct: 242 VVEVNPRRTLR-TGTIAPYLDMKNLPTQGHSADEVVDRE-FSSGTKFQNGDTLLARITPC 299 Query: 300 NDKRSLRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLMRSYDLCK---VFYAMGSGL 353 + +E G + ++ Y+ + P +L+ D + + G+ Sbjct: 300 LENGKTGYVDFLEEGQVGWGSTEYIVLAPKPPLPPQFGYLLARSDPLRTHAIHNMTGTSG 359 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 RQ + E K + VPP I + TA + ++ L R + + Sbjct: 360 RQRVPSECFKSFLIAVPPP----AIACRFDELTAPLMTEIKANANQSRTLATLRDTLLPK 415 Query: 414 AVTGQIDL 421 ++G++ + Sbjct: 416 LLSGELSV 423 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 69/205 (33%), Gaps = 13/205 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IPK W + ++N RT +G Y+ ++++ + + S+ Sbjct: 227 LGHIPKGWSAKSLSEVVEVNPRRTLRTGTIAPYLDMKNLPTQG----HSADEVVDREFSS 282 Query: 78 VSIFAKGQILYGKLGPYL--RKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW--L 128 + F G L ++ P L K DF G ST+++VL PK LP Sbjct: 283 GTKFQNGDTLLARITPCLENGKTGYVDFLEEGQVGWGSTEYIVLAPKPPLPPQFGYLLAR 342 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 T I + + + + + +PP A E I + Sbjct: 343 SDPLRTHAIHNMTGTSGRQRVPSECFKSFLIAVPPPAIACRFDELTAPLMTEIKANANQS 402 Query: 189 IRFIELLKEKKQALVSYIVTKGLNP 213 L L+S ++ G P Sbjct: 403 RTLATLRDTLLPKLLSGELSVGSCP 427 >gi|227508545|ref|ZP_03938594.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] gi|227191877|gb|EEI71944.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] Length = 405 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 46/403 (11%), Positives = 117/403 (29%), Gaps = 34/403 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + K + ++ YI D+ + + KY+ K + Sbjct: 18 WEQRKLGEGLKQLKSYSLPRKYEVPESDTEYIHYGDIHTSSRKYVDKSFRLPNIKSGDFQ 77 Query: 80 IFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G I+ ++ I + + ++ K P LS Sbjct: 78 LLQTGDIVLADASEDYKEIAEPMLMKNIKGRKVVSGLHTIAIRLKCGDPVYYLYLFLSPG 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G + ++ + + +P EQ I + + I ++ + Sbjct: 138 FRHYVYKVGTGLKVFGINYDKVQKYFLAVPDEKEQKYIGKILFLTDQLIAANQSKLEQLK 197 Query: 193 ELLKEKKQALVSYI--VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 L K Q + + +P + K ++ + Sbjct: 198 RLKKLLMQKIFNQEWRFKGFTDPWEQRKLGEVKTIKDGTHDSPRYV-----------PKG 246 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +L+ + + +S V+ G+I+F I + L Sbjct: 247 YPLVTSKNLNDSGLNLSDVSYISESDFDSINKRSKVNVGDIIFGMIGTIGNPVLL----D 302 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLV 369 I + + I + +L ++S ++ ++ + ++ L + Sbjct: 303 ESNFAIKNVALLKNDGPIQNHWLIQYLKSDVFNRLTSEKTAGNTQKFIGLNVIRNLIIDT 362 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 P I EQ I + + D ++ ++ + L+ + + Sbjct: 363 PSIHEQVIIGSFL----KLTDSIIAANQRRLDHLQSLKKYLMQ 401 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 66/211 (31%), Gaps = 17/211 (8%) Query: 213 PDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P ++ K W +G + I +K Sbjct: 7 PKIRFKGFDDPWEQRKLGEGLKQLKSYSLPRKYEVPESDTEY-----IHYGDIHTSSRKY 61 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQVMERGIITSAYMAVKP 325 ++ L +Q++ G+IV + + L + + +A++ Sbjct: 62 VDKSFRLPNIKSGDFQLLQTGDIVLADASEDYKEIAEPMLMKNIKGRKVVSGLHTIAIRL 121 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 D Y +L S Y +G+GL + ++ V++ + VP KEQ I ++ Sbjct: 122 KCGDPVYYLYLFLSPGFRHYVYKVGTGLKVFGINYDKVQKYFLAVPDEKEQKYIGKILF- 180 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 D L+ + + LK + + Sbjct: 181 ---LTDQLIAANQSKLEQLKRLKKLLMQKIF 208 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 51/184 (27%), Gaps = 5/184 (2%) Query: 25 WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVS 79 W+ + + G K + +++ S + S Sbjct: 221 WEQRKLGEVKTIKDGTHDSPRYVPKGYPLVTSKNLNDSGLNLSDVSYISESDFDSINKRS 280 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G I++G +G ++ + + L+ + L +L S + Sbjct: 281 KVNVGDIIFGMIGTIGNPVLLDESNFAIKNVALLKNDGPIQNHWLIQYLKSDVFNRLTSE 340 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G T I N+ + P + EQV+I + I L K Sbjct: 341 KTAGNTQKFIGLNVIRNLIIDTPSIHEQVIIGSFLKLTDSIIAANQRRLDHLQSLKKYLM 400 Query: 200 QALV 203 Q + Sbjct: 401 QNMF 404 >gi|260221108|emb|CBA29344.1| hypothetical protein Csp_A11670 [Curvibacter putative symbiont of Hydra magnipapillata] Length = 449 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 63/417 (15%), Positives = 147/417 (35%), Gaps = 27/417 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS--D 74 +P+ W P+ + + + + K + + ++ + S + + Sbjct: 3 LPQSWTTAPLGKLCEKLSDGSHNPPKAQETGMPMLSARNINDRKITFDEFRLISPEEFAE 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + + G +L +G R A++ VL+P + L + Sbjct: 63 EDRRTRVSSGDVLLTIVGAIGRTAVVPQGAPQFTLQRSVAVLKPIKSDSRYISYALEAPA 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + ++ +G K + + +P+ P EQ I +K+ R+D + T R Sbjct: 123 LQKYLQDNAKGTAQKGIYLKALAGVEIPVAPEPEQKRIADKLDTVLTRVDAVNTRLARVA 182 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP-----FFALVTELNRK 247 LLK +Q++++ + L D + G +P+ E +T+ Sbjct: 183 PLLKRFRQSVLAAATSGRLTEDWR--------NGSIPEVKEWSEKALSEVCRTITDGEHI 234 Query: 248 NTKLIESNILSLSYGNIIQKL-ETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDK 302 + L + +S ++ + + + E + G+++ + Sbjct: 235 SPPLAPHGVPLVSAKDVREWGVDFSDTKFVSEEFADASRKRCGPICGDVLVVSRGATVGR 294 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFED 361 L ++ + + I S +LA ++ S + A G+ + ++ D Sbjct: 295 TCLVKSKEKFCLMGSVLLFQPTATLIKSEFLAHVLASPLGLEQLTKASGATAQAAIYIRD 354 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K L + +P I+EQ +I + A D L ++ Q+ + +A A +G+ Sbjct: 355 AKGLKIRLPSIEEQTEIVRRVETLFAFADRLEARLAQAQAAATRLTPALLAKAFSGE 411 >gi|317181215|dbj|BAJ59001.1| Type I R-M system specificity subunit [Helicobacter pylori F32] Length = 373 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 62/406 (15%), Positives = 123/406 (30%), Gaps = 46/406 (11%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ+ I + A + L Sbjct: 121 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSALDRYLYAL----------- 169 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 AL+ L + K E + + V + N + Sbjct: 170 ----DALI-------LKKEGVKKALSFELLSQRKRLKGFNQAWQRVRLGDIANYLTSNLS 218 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ I+ + N ++ I D I R L + I Sbjct: 219 VEQITQQGKIKVYDVNNFIGYTDTT---FISDKPYISIVKDGSVGRVRILPP----KTNI 271 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +++ + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 272 LSTMGALIANHRTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQ 328 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 329 SAIANILSALDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 370 >gi|332535331|ref|ZP_08411130.1| type I restriction-modification system, specificity subunit S [Pseudoalteromonas haloplanktis ANT/505] gi|332035244|gb|EGI71751.1| type I restriction-modification system, specificity subunit S [Pseudoalteromonas haloplanktis ANT/505] Length = 394 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 63/414 (15%), Positives = 134/414 (32%), Gaps = 47/414 (11%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPI--KRFTKLNTGRTS-ESGKDIIYIGLEDVESGTGK 62 +P++K+ W+ + K R E K Y+ E++ + Sbjct: 14 RFPEFKN---------DAEWEKKVLNNKDIATFVKDREPLEQLKLNSYVSTENLLADYAG 64 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 ++ +L + PYL+K AD G S +V++P + Sbjct: 65 VAKASKLPPSGSFTSYK---PNDVLISNIRPYLKKVWCADKIGAASNDVIVIRPNAKVSA 121 Query: 123 -LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + L + + + +G M D I P+ +P L EQ I + + + + Sbjct: 122 AYMLHILKNDEFINFVMKGAKGVKMPRGDIASIKAYPVALPRLPEQQKIADCLSSLDKLV 181 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + ++ LK K+ L+ + +++ E F + Sbjct: 182 SA----NNQKLDALKAHKKGLMQQLFPAEGETVPELRFPEFE-NQTSWKKRSFSKLFEIG 236 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + K L NI G ++ + + ++ + Sbjct: 237 GGKDHK--HLPSGNIPVYGSGGYMRSVNEFLYDGESACIGRKGTINKPMFL--------- 285 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + + + + +G + ++ L ++ D + A G SL Sbjct: 286 --------NGKFWTVDTLFYTHSFNGCTARFIYLLFQNIDWLSLNEA---GGVPSLSKVI 334 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++ V++P IKEQ IT+ I+ ++ L+ Q I LK + + Sbjct: 335 INKIEVMIPEIKEQHRITDCIDS----LEELITAQSQKIGALKTHKRGLMQQLF 384 >gi|226951289|ref|ZP_03821753.1| type I restriction modification enzyme protein S [Acinetobacter sp. ATCC 27244] gi|226837962|gb|EEH70345.1| type I restriction modification enzyme protein S [Acinetobacter sp. ATCC 27244] Length = 399 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 58/408 (14%), Positives = 131/408 (32%), Gaps = 29/408 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 ++V I + G + + + +++ G L +S S Sbjct: 3 QIVKIGNISTQIRGVSYSKSDAVSNMQEGYLPVLRANNIQE-QGLILEDFVYVPESKISK 61 Query: 78 VSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLS 130 G ++ G + + A + F + V P + + Sbjct: 62 KQRILAGDVIIAASSGSISLVGKAASAKEDINAGFGAFCKILRPNTELVDPRYFANYFQT 121 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 Q I + GA +++ + + ++ +P+PPL+EQ I + V + Sbjct: 122 QQYRQIISNLAAGANINNLKNEHLDDLEIPLPPLSEQRRIASILDQADVLRQKRQQAIEK 181 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +LL+ + +P K ++ + D ++ PF + + + Sbjct: 182 LDQLLQAT-------FIDMFGDPVSNPKGFEVKKLSEQVDLIQIGPFGTQLHQEDYIENG 234 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + N + G I+ L+ LK Y + +++ + +V Sbjct: 235 IPLINPSHIKNGKIVPNLKLSVSQLKYGELSQYH-LKLHDVLLGRRGEMGRCAVVTQNEV 293 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 S ++ I+ +L L+ S + + + G +L V +P++ Sbjct: 294 GWLCGTGSLFLRPNVEKINPFFLEMLLSSDSIKRYLENVSQGQTMANLNKTIVGSIPLIA 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 P I+ Q + + I+ + ++E S + S A G Sbjct: 354 PSIEIQNKF--FL--ISEEINKMKTELENSKNQVNNLFQSLQNHAFNG 397 >gi|224282782|ref|ZP_03646104.1| Type I restriction-modification system specificity subunit [Bifidobacterium bifidum NCIMB 41171] gi|313139941|ref|ZP_07802134.1| type I restriction-modification system specificity subunit [Bifidobacterium bifidum NCIMB 41171] gi|313132451|gb|EFR50068.1| type I restriction-modification system specificity subunit [Bifidobacterium bifidum NCIMB 41171] Length = 397 Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 124/398 (31%), Gaps = 29/398 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + + + + + D+ + + G + S Sbjct: 18 DWDEMTLGDVGSVAMCKRVFKEQTCEVGDVPFFKIGTF--GGAPDSYISQSLFDELKSKY 75 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + G IL G R+ D +V + + L T Sbjct: 76 AYPKVGTILLSASGTIGRQVEYKGEDAYYQDSNIVW---LEHDDTVLDSYLKQFYTVVKW 132 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG+T+ K I + P P L EQ I + + A I E + + K Sbjct: 133 QGLEGSTIKRLYNKTILDTPFYRPSLPEQRKIADFLSAVDAVIAAQQAEVDAWEQRKKGV 192 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q L S V + + +G A + NT E ++ + Sbjct: 193 MQKLFSQEVRFKADDGSDFPKWEEKTLGE-----YCTQLKASIDPRKSPNTIFAEYSMPA 247 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + R E +I+ ++ ++++ + L + + +S Sbjct: 248 FDESRKARFVSGR------EMNSARKILSEPCVLVNKLNVRKRRIWLVK-NPEQNAVCSS 300 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 ++ + + I+ T+L++ + SG ++ + + + + +P + EQ Sbjct: 301 EFVPLSSNAINLTFLSYFALTDRFTSYLMDCSSGSSNSQKRVVPDVILNYVMQIPSLPEQ 360 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + + A +D ++K + + +E + + Sbjct: 361 RKIADCL----ASMDEAIQKSKDELAKWQELKKGLLQQ 394 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 45/153 (29%), Gaps = 10/153 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + L E Y G I+ + E + + Sbjct: 60 DSYISQSLFDELKSKYAYPKVGTILLSASGTIGRQV----EYKGEDAYYQDSNIVWLE-- 113 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D T L ++ + + + + L + + P P + EQ I + ++ Sbjct: 114 HDDTVLDSYLKQFYTVVKWQGLEGSTIKRLYNKTILDTPFYRPSLPEQRKIADFLSAV-- 171 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 D ++ + + ++R+ + + ++ Sbjct: 172 --DAVIAAQQAEVDAWEQRKKGVMQKLFSQEVR 202 >gi|325953723|ref|YP_004237383.1| restriction modification system DNA specificity domain protein [Weeksella virosa DSM 16922] gi|323436341|gb|ADX66805.1| restriction modification system DNA specificity domain protein [Weeksella virosa DSM 16922] Length = 407 Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats. Identities = 58/407 (14%), Positives = 129/407 (31%), Gaps = 38/407 (9%) Query: 26 KVVPIKRFTKL-NTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSI--- 80 + + + NTG + D + + L + V+ +Y+ D + + I Sbjct: 14 EWKTLGEVVDIANTGVDKKINADELTVRLLNFVDVFKNQYISNDTPTMIVTATERKIADC 73 Query: 81 -FAKGQILYGKLGPYLRKAI-----IADFDGICSTQFL----VLQPKDVLPELLQGWLLS 130 KG + + + I DFD + + + + + P L S Sbjct: 74 NVKKGDVFITPTSELIDEIGFSAMAIEDFDNVVYSYHIMRLRINNQNYLFPAYLNYLFKS 133 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 D+ ++I +G T +I +PIPPL Q I + T Sbjct: 134 KDIRKQIRKKAQGITRYGLTQPNWKSIQIPIPPLDVQQEIVRILDRFTELTAE------- 186 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 L +KQ +N + M + +EW + T N Sbjct: 187 ---LTARQKQYEYYREQLLMVNDEGLMNNEKVEW---KKLGEIAVKISSGGTPSTSINDY 240 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + + + + +I+ ++ K + Sbjct: 241 YDGDIPWLRTQEVDFKDIWDTEIKITEAGLKNSSAKIIPENCVIVAMYGATVGKIGINKI 300 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + V + + Y+ + S + ++G+G + ++ + +K + Sbjct: 301 PLSTNQACAN--IHVDENIANYRYVFHYLSSKY--EHIKSLGTGTQTNINAQIIKNYLIP 356 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 VPP+ EQ I +++ + E + + I L ++ R + Sbjct: 357 VPPLAEQERIVAILDKFDTLTSSITEGLPREIELRQKQYEYYRDQLL 403 >gi|298735606|ref|YP_003728129.1| type I R-M system specificity subunit [Helicobacter pylori B8] gi|298354793|emb|CBI65665.1| type I R-M system specificity subunit [Helicobacter pylori B8] Length = 377 Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats. Identities = 55/406 (13%), Positives = 116/406 (28%), Gaps = 41/406 (10%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPSNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYTYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ+ I + + L ++ + Sbjct: 121 HVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSDVDRYLYNLDALILKKEGVK 180 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K L+S ++K W + + + +L Sbjct: 181 KALSFELLSQ--------RKRLKGFNQAW-----QKVRLGDIAEIKRGVRITKNELDVFG 227 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G + T N + Q G + F+ + Sbjct: 228 KYPVVSGGVGFLGYTNNFNRYENTITIAQYGTAGYVNFQKNKFWANDVCF---------- 277 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + I + +L + ++ + + S+ + + +L+PP+ EQ Sbjct: 278 ----CIYPNKDIIKNIFLYYFLKVNQNYLYEISNRNATPYSISKDKILDFEILLPPLNEQ 333 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 334 IAIANILSALDNEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 375 >gi|254190416|ref|ZP_04896924.1| type I restriction-modification system, endonuclease S subunit [Burkholderia pseudomallei Pasteur 52237] gi|157938092|gb|EDO93762.1| type I restriction-modification system, endonuclease S subunit [Burkholderia pseudomallei Pasteur 52237] Length = 387 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 70/399 (17%), Positives = 150/399 (37%), Gaps = 34/399 (8%) Query: 23 KHWKVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W++ + R Y+GLE +++ + K + + +T + Sbjct: 9 NGWRIWRFDQMATNVNVRIDNPSESGVEHYVGLEHLDADSLKI--RRWGTPDDVEATKLM 66 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIE 138 F KG I++G+ Y RK +A+FDGICS +V +P VLP+ L ++ S +R Sbjct: 67 FKKGDIIFGRRRAYQRKLGVAEFDGICSAHAMVLRAKPDVVLPDFLPFFMQSDLFMKRAV 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I G+ +WK + +PP+ EQV E + I + Sbjct: 127 EISVGSLSPTINWKTMAIQEFVLPPIDEQVRHVELL--------QAIERASESHRKIGCS 178 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 LV +++ LN + + D G V ++ + +++ Sbjct: 179 ADKLVRSLLSDVLNREWPVVDLG----------SVVYETQYGLSINAGSEGRYPMLRMMN 228 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G ++ + + + L + +E Y++V G+++F + ++ + S Sbjct: 229 IEDGLCVEN-DIKYVDLSDKDFEAYRLVH-GDVLFNRTNSYELVGRTGVYELDGDHVFAS 286 Query: 319 AY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKE 374 + P ++ +LA + S + A + + ++ ++ R+ + +PP+ Sbjct: 287 YLVRIKTNPERLEPKFLAQYLNSDFGRRQVLAFATKAVSQANVNASNLLRIRLPLPPLDV 346 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q + E A+ ++E + +A Sbjct: 347 QQQ----LLDEIAKAKSAETAATVRRSYVEEMKKQLLAE 381 >gi|172039948|ref|YP_001799662.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] gi|171851252|emb|CAQ04228.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] Length = 384 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 56/405 (13%), Positives = 122/405 (30%), Gaps = 37/405 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W +V + + +G T ++ +I D+ + S Sbjct: 6 DWPMVKLGDLGRFASGGTPNRKREEFYQGETPWISSADISEDGKITARRFITDEAIAKSA 65 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G +L + AI +L L + + Sbjct: 66 TTEVPAGTLLVAVRIGVGKTAITTSPTCFSQDVVALLDTDPNEVSTGFLQHLITWLRPHL 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E I G T+ + ++ +P+PPLAEQ I + + ++I + L E Sbjct: 126 EQIARGVTIKGITIGDLKDLNIPLPPLAEQRRIAKILDTVNIQIHR---TKEASNYLKDE 182 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +A + P + + +RK+ + +I Sbjct: 183 LARAFFQQLGR-----------------NSQPAQIKTLATVTTGSTPSRKHPEYYGGSIP 225 Query: 258 SLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + + +I PG ++ Q R + Sbjct: 226 WVKTNEVSGTAITSTEETITETGLENSSCKINPPGTVLVAMYG-QGRTRGSAGILRIPAT 284 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIK 373 + + DS Y+ + +++ + ++G G + +L ++ + PP Sbjct: 285 TNQACAAISCTNPADSDYVYFALKASY--EELRSLGRGGTQPNLNLGLIRGFSIPYPP-A 341 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ + + + +++ L E + L+E +S A A G+ Sbjct: 342 EQRE---ELTITIKKMENLNHAYETQLQKLEELNASLSARAFAGK 383 >gi|71024881|ref|YP_263290.1| hypothetical protein pAG6_01 [Lactococcus lactis subsp. cremoris] gi|70067198|dbj|BAE06236.1| HsdS [Lactococcus lactis subsp. cremoris] Length = 388 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 51/402 (12%), Positives = 119/402 (29%), Gaps = 39/402 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + ++ G++ S + G R Sbjct: 15 KVPELRFKGFTDEWEQRKLGDEVRIVMGQSPNSENYTDDPNDYILVQGNADMKNGRVLPR 74 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 T K ++ P +D + ++ + + L + Sbjct: 75 VWTTQVTKQAEKDDLILSVRAPV-GDIGKTAYDVVIGRGVAAIKGNEFI----FQNLGKM 129 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G+T + I + +P + EQ I ++D I R Sbjct: 130 KSDGYWTRYSTGSTFESINSTDIKEAIISVPAIEEQDKIGSF----FKQLDNTIALHQRK 185 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I+LLKE+K+ + + K +++ G + RK+ +L Sbjct: 186 IDLLKEQKKGYLQKMFPKNGAKVPELRFEGFADDWEL-----------------RKSKEL 228 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + ++ + + E V D + Sbjct: 229 CTISTGKGNTQDKVDDGAYPFYVRSATIEKSDEYLYDQEAVLTVGDGVGT-GKVYHYVNG 287 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + Y + + Y + +V S++ E + + ++ P Sbjct: 288 KYNLHQRVYRMYDFKDVSAKYFYYYFSKNFYKRVMSMTAKTSVDSVRMEMIADMNIVFPS 347 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +KEQ +I + +D + ++ + LLKE++ F+ Sbjct: 348 VKEQENIVE----LFSNLDNTIALHQRKLDLLKEQKKGFLQK 385 >gi|85711478|ref|ZP_01042536.1| putative specificity protein s [Idiomarina baltica OS145] gi|85694630|gb|EAQ32570.1| putative specificity protein s [Idiomarina baltica OS145] Length = 426 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 64/424 (15%), Positives = 130/424 (30%), Gaps = 37/424 (8%) Query: 25 WKVVPIKRFTK-----LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 W + + TG S + +D+ G + Sbjct: 3 WNESTLGDICDAGQGIIKTGPFGSQLHQSDYSDAGTPVVMPKDIVGG--RVSESSIARVA 60 Query: 73 SDTS---TVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQ--PKDVLPELLQ 125 + + G I+YG+ G R A++ + +C T L + +V P+ L Sbjct: 61 EEHVERLSHHQLYPGDIVYGRRGDIGRCALVTPRESGWLCGTGCLRIHLGNGEVSPKFLF 120 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L + I GATM + + + +IP+ P Q I + A I+ Sbjct: 121 YFLNNPSTVDWIYNQAVGATMPNLNTSILRSIPVRYPTRETQERIAAFLSAYDDLIENNT 180 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 E + L P + +G +P+ WEV+ V Sbjct: 181 RRIEILE----EMARRLYEEWFVHFRFPGHEGVSFKESELGDIPEGWEVRRLEDAVALNP 236 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R + + + L+ ++ + G+ +F I + Sbjct: 237 RTKVPKEGEKLFVP--MGALSESSMIVGSLERKTGNSGAKFQNGDTLFARITPCLENGKT 294 Query: 306 RSAQVMER----GIITSAYMAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKF 359 + ++ ++ ++ + + L RS + G+ RQ ++ Sbjct: 295 GFVDFLPEDQPTACGSTEFIVLRSVSLCPEMVYLLARSDRFRDVAIKSMSGATGRQRVRV 354 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 E + PV+ P ++ + L K L+ +R + V+G+I Sbjct: 355 ESLVEFPVVQPDNATLEAFQRFVSPCFKQARTLALKN----ANLRAQRDLLLPKLVSGEI 410 Query: 420 DLRG 423 D+ Sbjct: 411 DVSD 414 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 57/145 (39%), Gaps = 15/145 (10%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +K+S +G IP+ W+V ++ LN + +++ + + + G Sbjct: 210 SFKESE---LGDIPEGWEVRRLEDAVALNPRTKVPKEGEKLFVPMGALSESSMIV----G 262 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYL--RKAIIADF------DGICSTQFLVLQPKDVL 120 + + ++ + F G L+ ++ P L K DF ST+F+VL+ + Sbjct: 263 SLERKTGNSGAKFQNGDTLFARITPCLENGKTGFVDFLPEDQPTACGSTEFIVLRSVSLC 322 Query: 121 PELLQGWLLSIDVTQRIEAICEGAT 145 PE++ S GAT Sbjct: 323 PEMVYLLARSDRFRDVAIKSMSGAT 347 >gi|145628526|ref|ZP_01784326.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae 22.1-21] gi|145639724|ref|ZP_01795326.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae PittII] gi|144978996|gb|EDJ88682.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae 22.1-21] gi|145271092|gb|EDK11007.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae PittII] Length = 394 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 51/385 (13%), Positives = 127/385 (32%), Gaps = 25/385 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + T + TG++ + +G Y + Sbjct: 18 EWKSLGKVTDIKTGQSVSKN---------IIAQNSGIYPVINSGREPLGFINEWNTENDP 68 Query: 86 ILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I G + + + + V + + + + + + I +C Sbjct: 69 IGITTRGAGVGSITWQEGKYFRGNLNYSVTIKSEYELNVRFLYHVLLHFQKEIHNLCSFT 128 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + +PIPPL+ Q I + + A T L +E + L +++ + Sbjct: 129 GIPALNASELKKLEIPIPPLSVQTEIVKILDALTALTSELTSELTSELILRQKQYEYYRE 188 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 ++++ ++ G EW K + T N I L + Sbjct: 189 KLLSEE-----ELGKVGFEW---KTIDEISKKISSGGTPTTSNNGYYDNGTIPWLRTQEV 240 Query: 265 IQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 K E + + + ++ K ++ + + + Sbjct: 241 DFKEIWDTNIKITEDALNNSSAKWIPANCVIVAMYGATVGKTAINKIPLTTNQACAN--I 298 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + Y+ + S + ++GSG + ++ + +K+L V VPPI+EQ+ I ++ Sbjct: 299 EINDKLACYRYIFHYLTSKY--EYIKSLGSGSQTNINAQIIKKLKVPVPPIEEQYRIVSI 356 Query: 382 INVETARIDVLVEKIEQSIVLLKER 406 ++ + + E + +I ++R Sbjct: 357 LDKFETLTNSITEGLPLAIEQSQKR 381 >gi|327390070|gb|EGE88414.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 353 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 49/390 (12%), Positives = 102/390 (26%), Gaps = 39/390 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSG-TLGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + L + S Sbjct: 120 MKHLTKKYFDNIMVSYTNLREQQRIASELDLLSKLILRRQEQLEELNLL-------VKSR 172 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +P K ++ G + F + + I Sbjct: 173 FNEMFGDPLNNNKKFAVKT-GQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAW------- 224 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++D I+ + + I+ + +K Sbjct: 225 --------------KSRKYLIDNPTIIIGRVGA----YCGNVRTTHGKVWISDNAIYIKE 266 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 L +L+ + + + + ++ ++PP+ Q + + + Sbjct: 267 FKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFV--- 323 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A +D I++S+ L+ + S + Sbjct: 324 -ALVDKSQLAIQKSLEELETLKKSLMQEYF 352 >gi|300361584|ref|ZP_07057761.1| type I restriction-modification system [Lactobacillus gasseri JV-V03] gi|300354203|gb|EFJ70074.1| type I restriction-modification system [Lactobacillus gasseri JV-V03] Length = 468 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 49/408 (12%), Positives = 108/408 (26%), Gaps = 37/408 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W + + GR + + + L V + Sbjct: 54 ELPSSWDWITLGSGVTFYNGRAYKKKELLSDDKLTPVLRVGNLFTNSSWYYSDLSLDENK 113 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G ++Y + K + + +V+ + L E Sbjct: 114 YIDNGDLIYAWSASFGPKIWNGGHVIYHYHIWKLEYDNNVIDTNFLYYFLLDKRNVVGET 173 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+TM H + ++P P+PPL EQ I KI + + + ++ +L K Sbjct: 174 DLHGSTMKHITKTNMEHLPFPLPPLEEQSRIAAKIAQLFALLRKVESSTQQYAKLQTLLK 233 Query: 200 QALVSYIVTKGLNPDVK-------------------------------MKDSGIEWVGLV 228 ++ + L + E + Sbjct: 234 SKVLDLAMRGKLVKQDPHDEPASVLLEKIKAEKEQLIKEKKIKKSKPLPPITDKEKPFDI 293 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESN-----ILSLSYGNIIQKLETRNMGLKPESYETY 283 PD WE + + T ++ + N + + Sbjct: 294 PDSWEWVRLGEVAESIRYGYTASAQATGNAKLLRITDIQNNNVNWNMVPLCNISDMKLKD 353 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + +I+ K V + + S ++ +++ + Sbjct: 354 LSLHKKDILIARTGGTIGKNYFVKQIVEPTVFASYLIRVRNINKKVSNFIQYVLDAPIYW 413 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 A SG + ++ ++ +PP++EQ I + I + Sbjct: 414 NFISAKKSGTGQPNVNAAKLENFIFPIPPLEEQNRIVDKIINLIDLFN 461 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 74/213 (34%), Gaps = 12/213 (5%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE-----SNILSLSYGNII 265 L +K S IE +P W+ + VT N + K E L GN+ Sbjct: 39 LLKKNNLKRS-IEEPHELPSSWDWITLGSGVTFYNGRAYKKKELLSDDKLTPVLRVGNLF 97 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 L + + +D G++++ + K + I Y Sbjct: 98 TNSSWYYSDLSLDENK---YIDNGDLIYAWSASFGPKIWNGGHVIYHYHIWKLEY---DN 151 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + ID+ +L + + + + + +++ LP +PP++EQ I I Sbjct: 152 NVIDTNFLYYFLLDKRNVVGETDLHGSTMKHITKTNMEHLPFPLPPLEEQSRIAAKIAQL 211 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A + + +Q L +S + A+ G+ Sbjct: 212 FALLRKVESSTQQYAKLQTLLKSKVLDLAMRGK 244 >gi|212691155|ref|ZP_03299283.1| hypothetical protein BACDOR_00645 [Bacteroides dorei DSM 17855] gi|212666387|gb|EEB26959.1| hypothetical protein BACDOR_00645 [Bacteroides dorei DSM 17855] Length = 429 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 119/406 (29%), Gaps = 34/406 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ ++ G+T + +++ +D++ + Sbjct: 28 LPNGWEWCNLEDIVSFGGGKTPSMDNKEYWDNGNHLWVTSKDMKYSYITNSLMKITDKAL 87 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAI---IADFDGICSTQFLVLQPKDV-LPELLQGWLL 129 + +I+ KG +L LR + I + + + P L E L + Sbjct: 88 EVM--TIYEKGTLLVVTRSGILRHTLPLSILEKPATVNQDLKTISPHIQELSEYLYVVIK 145 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + E +G T+ D+ +P+P+ P+AEQ I + ID + ++ Sbjct: 146 ANEHFILKEYHKDGTTVDSIDFDKFRCLPIPLAPIAEQKRIIVETKRWFALIDQVEQGKV 205 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------------GLVPDHWEVKPF 237 +K+ K ++ + L P + IE + G P W Sbjct: 206 DLQTTIKQAKSKILGLAIHGKLVPQDLNDEPAIELLKRINPDFTPCDNGHYPVGWIETIL 265 Query: 238 FALVTELNRKNTKLIESNILSLSY------GNIIQKLETRNMGLKPESYETYQIVDPGEI 291 L + K + Y ES V G++ Sbjct: 266 GELFSHNTGKALNSSNKEGIFKDYLTTSNVYWNKFDFTAIKQMPFKESELNKCTVTKGDL 325 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + I + ++P + +Y Sbjct: 326 LVCEGGDIGRSAIW---NYDYDICIQNHIHRLRPKIDLCVPFYYYTFAYLKENNLIGGKG 382 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + ++ + +PP+ EQ I I + +D + +E Sbjct: 383 IGLLGLSSNALHKIEMPLPPLAEQQRIVQKIEELFSVLDNIQNALE 428 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 55/169 (32%), Gaps = 4/169 (2%) Query: 19 GAIPKHWKVVPIKRFTKLNTGR--TSESGKDI--IYIGLEDVESGTGKYLPKDGNSRQSD 74 G P W + NTG+ S + + I Y+ +V + + Sbjct: 254 GHYPVGWIETILGELFSHNTGKALNSSNKEGIFKDYLTTSNVYWNKFDFTAIKQMPFKES 313 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 KG +L + G R AI IC + + + + + Sbjct: 314 ELNKCTVTKGDLLVCEGGDIGRSAIWNYDYDICIQNHIHRLRPKIDLCVPFYYYTFAYLK 373 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +G + + I MP+PPLAEQ I +KI +D Sbjct: 374 ENNLIGGKGIGLLGLSSNALHKIEMPLPPLAEQQRIVQKIEELFSVLDN 422 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 66/198 (33%), Gaps = 8/198 (4%) Query: 229 PDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 P+ WE +V+ K N L ++ ++ T ++ + Sbjct: 29 PNGWEWCNLEDIVSFGGGKTPSMDNKEYWDNGNHLWVTSKDMKYSYITNSLMKITDKALE 88 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + + +L + + + + + PH + + +++ + Sbjct: 89 VMTIYEKGTLLVVTRSGILRHTLPLSILEKPATVNQDLKTISPHIQELSEYLYVVIKANE 148 Query: 343 CKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + S+ F+ + LP+ + PI EQ I A ID + + Sbjct: 149 HFILKEYHKDGTTVDSIDFDKFRCLPIPLAPIAEQKRIIVETKRWFALIDQVEQGKVDLQ 208 Query: 401 VLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 209 TTIKQAKSKILGLAIHGK 226 >gi|168484775|ref|ZP_02709720.1| restriction modification system DNA specificity domain [Streptococcus pneumoniae CDC1873-00] gi|172042074|gb|EDT50120.1| restriction modification system DNA specificity domain [Streptococcus pneumoniae CDC1873-00] Length = 353 Score = 111 bits (276), Expect = 3e-22, Method: Composition-based stats. Identities = 49/390 (12%), Positives = 102/390 (26%), Gaps = 39/390 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + G + +D G E + + N I G Sbjct: 2 KKVKLGEVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSTEINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWRGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + L + S Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL-------VKSR 172 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +P K ++ G + F + + I Sbjct: 173 FNEMFGDPLNNNKKFAVKT-GQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAW------- 224 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++D I+ + + I+ + +K Sbjct: 225 --------------KSRKYLIDNPTIIIGRVGA----YCGNVRTTHGKVWISDNAIYIKE 266 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 L +L+ + + + + ++ ++PP+ Q + + + Sbjct: 267 FKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFV--- 323 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A +D I++S+ L+ + S + Sbjct: 324 -ALVDKSQLAIQKSLEELETLKKSLMQEYF 352 >gi|317130965|ref|YP_004097247.1| restriction modification system DNA specificity domain [Bacillus cellulosilyticus DSM 2522] gi|315475913|gb|ADU32516.1| restriction modification system DNA specificity domain [Bacillus cellulosilyticus DSM 2522] Length = 414 Score = 110 bits (275), Expect = 3e-22, Method: Composition-based stats. Identities = 81/400 (20%), Positives = 156/400 (39%), Gaps = 27/400 (6%) Query: 24 HWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF- 81 W++VP K + + R + K YIGLE ++S T K + D + Sbjct: 14 GWRLVPFKLMAEHISKRVEPKETKLKYYIGLEHLDSKTLKI---KRHGTPEDVQGTKLVA 70 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRIEA 139 G I++GK Y K I ++D I S +VL+ ++ ++ ELL ++ S + R Sbjct: 71 KPGDIIFGKRRAYQGKVAICEWDAIVSAHSMVLRAQEEVIIKELLPFFMQSQEFYNRSLK 130 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I EG+ WK + IPP Q I EK+ + I + +E + K Sbjct: 131 ISEGSLSPTIKWKVLAEEKFIIPPKNIQRDIIEKLN----ATEDNINCKEILLEKTLKYK 186 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + LV+ ++T+G+N S +G +P WE+K + +K +S Sbjct: 187 EKLVNKLLTRGVNHSNYKPSS----IGEIPKDWELKRIDDVCNINPQKEKIADTDTEISF 242 Query: 260 SYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI- 315 I K+ E + +++ I + AQ ++ I Sbjct: 243 LTMEDISNDAKIINLRERKYSEVSSGFTSFRENDVIVAKITPCFENGKGALAQNLKNSIG 302 Query: 316 --ITSAYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 T ++ + Y+ + + + GS ++ + E ++ + +PP Sbjct: 303 FGSTEFHILRAKDEVLPKYIYYHTTNKLFRTLGEWNMTGSAGQKRVPKEFLEGFKIGIPP 362 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + EQ I +++ ++ ++ IE +I K+ + + Sbjct: 363 LTEQRKIVEILDG----LENVISNIESNIKNTKKVKEELL 398 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 80/210 (38%), Gaps = 14/210 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLN--TGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 YK S IG IPK W++ I +N + +++ +I ++ +ED+ S K + Sbjct: 203 YKPSS---IGEIPKDWELKRIDDVCNINPQKEKIADTDTEISFLTMEDI-SNDAKIINLR 258 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDV-L 120 +S + F + ++ K+ P + + G ST+F +L+ KD L Sbjct: 259 ERKYSEVSSGFTSFRENDVIVAKITPCFENGKGALAQNLKNSIGFGSTEFHILRAKDEVL 318 Query: 121 PELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 P+ + + E G+ + + + IPPL EQ I E + Sbjct: 319 PKYIYYHTTNKLFRTLGEWNMTGSAGQKRVPKEFLEGFKIGIPPLTEQRKIVEILDGLEN 378 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTK 209 I + + ++ +E L + Sbjct: 379 VISNIESNIKNTKKVKEELLIFLFDPKFYQ 408 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 68/189 (35%), Gaps = 8/189 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV-DP 288 D W + PF + ++++ + ++ K PE + ++V P Sbjct: 13 DGWRLVPFKLMAEHISKRVEPKETKLKYYIGLEHLDSKTLKIKRHGTPEDVQGTKLVAKP 72 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+I+F K ++ + S + + I L + M+S + Sbjct: 73 GDIIFGKRRAYQGKVAICEWDAIVSA--HSMVLRAQEEVIIKELLPFFMQSQEFYNRSLK 130 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + G ++K++ + ++PP Q DI +N I+ +E+++ + Sbjct: 131 ISEGSLSPTIKWKVLAEEKFIIPPKNIQRDIIEKLNATEDNINCKEILLEKTLK----YK 186 Query: 408 SSFIAAAVT 416 + +T Sbjct: 187 EKLVNKLLT 195 >gi|170025888|ref|YP_001722393.1| restriction modification system DNA specificity subunit [Yersinia pseudotuberculosis YPIII] gi|169752422|gb|ACA69940.1| restriction modification system DNA specificity domain [Yersinia pseudotuberculosis YPIII] Length = 410 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 60/418 (14%), Positives = 135/418 (32%), Gaps = 40/418 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGK 62 +P+ W+ + ++ G + +D + ++ + DV G+ Sbjct: 6 KVPEIRFKGFGGEWEDKVLGELAEIVRGASPRPIEDPKWFDSQSSVGWLRIRDVTEQDGR 65 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + + + +L + + G+ + +P L Sbjct: 66 IHYLEQRISKLGQEKTRVLHEKHLLLSIAASVGKPVVNYVETGVHDGFLIFKKPLFEL-- 123 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + + + + + + IP EQ I I+ Sbjct: 124 -EFMYQWLKSFEAKWQQFGQPGSQVNLNSDIVKSQVVAIPTNEEQTTIGNYFQKLDSLIN 182 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 + + L K+A++ + K P+++ K EW + + Sbjct: 183 QH----QQKHDKLSSIKKAMLEKMFPKQGETMPEIRFKGFSGEWN-----YLALGENAKF 233 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFIDL 298 L+ S + YG + K +T G+ E + V GE++ Sbjct: 234 TKGQGYSKGDLVTSGSPIILYGRLYTKYQTVITGVDTFVTEKNKSVKSIGGEVIVPASGE 293 Query: 299 QNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQS 356 + S S II + + I ST+LA + + L K + G Sbjct: 294 SPEDISRASVVSEPNVIIGGDLNIVLPSKKIHSTFLALAISNGHLKKKLSSKAQGKSVVH 353 Query: 357 LKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ D+ L +++P EQ I N ++D L+ + +Q I L + + ++ Sbjct: 354 IRNSDLADLDLILPTEYMEQTAIGNY----FQKLDELINQHQQQISKLNNIKQACLSK 407 >gi|167620604|ref|ZP_02389235.1| Restriction modification system DNA specificity domain [Burkholderia thailandensis Bt4] Length = 392 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 54/413 (13%), Positives = 137/413 (33%), Gaps = 46/413 (11%) Query: 29 PIKRFTKLNTGRTSESGK------DIIYIGLEDV----ESGTGKYLPKDGNSRQSDTSTV 78 + + G T + G + IY+ + D+ S + +S Sbjct: 8 RLGDIADVQQGYTFKPGYQGQSSGEWIYVKVADIGSPASSKYLRKSQNYVSSEVLREMRA 67 Query: 79 SIFAKGQILYGKLGPYLRKAI--IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + F G I++ ++G LR I + + +V+ +D + D Sbjct: 68 TPFPAGSIVFPRVGAALRNNNKRILAENSLTDDNVIVVTVRDTQICDPEYLYYWFDFHDL 127 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + C T+ + + + + +P +A Q + + ++ + R I+ + Sbjct: 128 QD-FCNAGTVPVINGRNLKIQEVMLPSIAIQRVTASALSTWDAALEKI----QRLIDAKE 182 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + + L+ ++ K L + AL ++ +N + Sbjct: 183 RRHRGLLIRLLGKRL-----------------WSDCRHERADALFASVSERNQPELPVLA 225 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ G + + + R + ++ +++V + V Q G++ Sbjct: 226 VTQDQGVVPRTMLDRRITMELSDPANFKVVRKDDFVISLRSFQG-----GLEHSEYDGLV 280 Query: 317 TSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIK 373 + AY ++ ++S D K G+R + + F D + + +P Sbjct: 281 SPAYTVLRGQPALYPPFYRHYLKSPDFLKRLAVAVVGIRDGKQIAFTDFASIKLPLPAFD 340 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 Q I V++ + ++Q L+ ++ + +TG+ + + Sbjct: 341 LQTKIAAVLDESEDE----IALMKQQAGKLRTQKRGLMQKLLTGKWRVPVPEE 389 >gi|289667520|ref|ZP_06488595.1| Type I restriction enzyme StySPI specificity protein [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 495 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 66/457 (14%), Positives = 135/457 (29%), Gaps = 63/457 (13%) Query: 20 AIPKHWKVVPIKRF---------TKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGN 69 +P+ W + + + + L D+ + + N Sbjct: 7 ELPQGWAFASLNELQAQGGIFADGDWIESKDQDPNGRNRLLQLADIGDRRFIDKSSRYVN 66 Query: 70 SRQSDTSTVSIFAKGQILYGKL-----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 D + +G IL ++ L + + +D+ L Sbjct: 67 DETFDRLNCTALEEGDILLARMPDPLGRACLMPRLPQRCLTVVDVAVFRSGSRDISHRWL 126 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L + + + I G T + + +P+PP AEQ I +K+ A ++DTL Sbjct: 127 MHTLNASPIREEISRNASGTTRKRIARGKLAELKVPVPPAAEQKRIAQKLDALLAQVDTL 186 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDV---------------------KMKDSGIE 223 LLK +Q+++ + L + + SG + Sbjct: 187 KARIDAIPALLKRFRQSVLESAFSGELTAEWRQLHPDTKAASITDVRQAWRDHYQRSGRK 246 Query: 224 ---------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + ++ ++ T + + + E Sbjct: 247 FAPPNLDPTNLRDDLPPTWQATQIGIIFDVFVGATPARDRTDFWKGSISWVSSAEVAFCR 306 Query: 275 LKPESYE---------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++ + + + PG ++ I + + +A + V Sbjct: 307 IRSTKEKITEAGYSATSTNLHPPGTVMLAMIGQGKTRGQPAILAIDACHNQNTAALRVHD 366 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 YL + + + G G +Q+L + V+ LP + P+ EQ +I + Sbjct: 367 EYCVPEYLYYYLWGKY--EETRRFGGGNNQQALNKKSVQSLPFPLAPLAEQTEIVRRVEQ 424 Query: 385 ETARIDVLVEK---IEQSIVLLKERRSSFIAAAVTGQ 418 A D L K +Q I L S +A A G+ Sbjct: 425 LFACADQLEAKVAAAQQRIDALT---QSLLAKAFRGE 458 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 62/200 (31%), Gaps = 8/200 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ I + G T + I ++ +V + + Sbjct: 260 DLPPTWQATQIGIIFDVFVGATPARDRTDFWKGSISWVSSAEVAFCRIRSTKEKITEAGY 319 Query: 74 DTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ ++ G ++ +G + I D + L+ D + Sbjct: 320 SATSTNLHPPGTVMLAMIGQGKTRGQPAILAIDACHNQNTAALRVHDEYCVPEYLYYYLW 379 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + G + K + ++P P+ PLAEQ I ++ D L + Sbjct: 380 GKYEETRRFGGGNNQQALNKKSVQSLPFPLAPLAEQTEIVRRVEQLFACADQLEAKVAAA 439 Query: 192 IELLKEKKQALVSYIVTKGL 211 + + Q+L++ L Sbjct: 440 QQRIDALTQSLLAKAFRGEL 459 >gi|86143515|ref|ZP_01061900.1| type I restriction-modification system specificity subunit [Leeuwenhoekiella blandensis MED217] gi|85829962|gb|EAQ48423.1| type I restriction-modification system specificity subunit [Leeuwenhoekiella blandensis MED217] Length = 502 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 55/468 (11%), Positives = 124/468 (26%), Gaps = 74/468 (15%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W + KL G +S K I I + D++ + + Sbjct: 3 EDWVECTLGSLLKLKNGYAFKSSKYQKDGIPVIRIGDIQDWNVDIENAKRIDDNIEYDS- 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQ 135 I KG IL G K I + D V L + L + + Sbjct: 62 HIVNKGDILIAMSGATTGKFGIYNSDKKAYQNQRVGNLIPHSEELTSNNYIYYLLYSLKR 121 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------------------------------- 163 IE G + I + + P Sbjct: 122 DIEQQAYGGAQPNISATKIEALKTKLFPLPIQQAIVKKIEELFSSLDSGIADLKKAQDQL 181 Query: 164 -LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS-- 220 + Q ++++ + + + E L ++ + + L + S Sbjct: 182 KIYRQAVLKKAFEGKLTKEWREKQTELPTAEELLKEIKKERQKHYEQQLAKWKEAVISWE 241 Query: 221 ---------------------GIEWVGLVPDHWEVKPFFALVTELNRKNTK--------- 250 IE + ++P+ W + + ++ Sbjct: 242 NNDKEGKKPGKPGKIKEFELNEIEELPIIPNTWAWEKLGNVCLKIMDGTHFSPKNIEKGD 301 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 ++ G I + + E+ V G++++ + ++ + + Sbjct: 302 FKYITAKNIKEGRIDLRNISYVTQEDHEAIFGRCDVKKGDVLYIKDGATTGRAAVNTLEE 361 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + + I+ +L + + + +G L + + Sbjct: 362 EFSLLSSVGVFRTIKSFINPKFLESFLNAQVTRNRMLSNIAGVAITRLTLVKLNNSMFSL 421 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++EQ I I + D + + I+ S+ + R S + A G Sbjct: 422 CSVEEQHQIVQEIESRLSVCDAVEQNIQDSLEKAQALRQSILKKAFEG 469 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 61/194 (31%), Gaps = 2/194 (1%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--Q 284 + L K++K + I + G+I + + Y Sbjct: 3 EDWVECTLGSLLKLKNGYAFKSSKYQKDGIPVIRIGDIQDWNVDIENAKRIDDNIEYDSH 62 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 IV+ G+I+ K + ++ + + S + + Sbjct: 63 IVNKGDILIAMSGATTGKFGIYNSDKKAYQNQRVGNLIPHSEELTSNNYIYYLLYSLKRD 122 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + G + ++ ++ L + P+ Q I I + +D + ++++ LK Sbjct: 123 IEQQAYGGAQPNISATKIEALKTKLFPLPIQQAIVKKIEELFSSLDSGIADLKKAQDQLK 182 Query: 405 ERRSSFIAAAVTGQ 418 R + + A G+ Sbjct: 183 IYRQAVLKKAFEGK 196 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 69/209 (33%), Gaps = 11/209 (5%) Query: 14 GVQWIGAIPKHWKVVPIKRFT-KLNTGRTSESGK----DIIYIGLEDVESGTGKY--LPK 66 ++ + IP W + K+ G D YI ++++ G + Sbjct: 263 EIEELPIIPNTWAWEKLGNVCLKIMDGTHFSPKNIEKGDFKYITAKNIKEGRIDLRNISY 322 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPE 122 KG +LY K G +A + +F + S + P+ Sbjct: 323 VTQEDHEAIFGRCDVKKGDVLYIKDGATTGRAAVNTLEEEFSLLSSVGVFRTIKSFINPK 382 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L+ +L + R+ + G ++ + N + + EQ I ++I + D Sbjct: 383 FLESFLNAQVTRNRMLSNIAGVAITRLTLVKLNNSMFSLCSVEEQHQIVQEIESRLSVCD 442 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGL 211 + +E + +Q+++ L Sbjct: 443 AVEQNIQDSLEKAQALRQSILKKAFEGTL 471 >gi|291277030|ref|YP_003516802.1| putative type I restriction-modification system S protein [Helicobacter mustelae 12198] gi|290964224|emb|CBG40073.1| putative type I restriction-modification system S protein [Helicobacter mustelae 12198] Length = 428 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 47/418 (11%), Positives = 121/418 (28%), Gaps = 38/418 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + + +T ++ + + SG Y + + Sbjct: 13 PHGVEFRKLGEVCEFQNKKTLKTSEVKNNGKYPVINSGRDLYGYYHDFNNDGEN------ 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQRIE 138 I G Y + V +L + L +L + + Sbjct: 67 ----ITIASRGEYAGFVNYFNEKFFAGGLCYPYKVKNSNKLLTKFLYFYLKANESQIMEN 122 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G ++ + I +P+P+PPL Q I + + T L TE + + Sbjct: 123 LVIRG-SIPALNKADIETLPIPLPPLEVQREIVKILDTFTELNTELNTELKLRKKQYEYY 181 Query: 199 KQALVS--------YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF-----------FA 239 + L+S + L K + L P E + + Sbjct: 182 RNWLLSFGDVDASKEGAEQRLRNKSYPKALKALLLSLCPHGVEFRKLGEVGEYIRGVTYR 241 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 E+N + + +++ N + + + + + + + ++ Sbjct: 242 KSQEINGQGCGIKVLRANNITLSNHLNFEDIKTIDKSVKIRKEQYLKKNDILICAGSGSS 301 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLK 358 + + ++ ++S ++ + S + + +L Sbjct: 302 EHIGKVAFIDANSDYVFGGFMGVIRIRELNSRFVYHVFTSNIFKQYLEKSLNTTTINNLN 361 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + +PP++ Q +I +++ + + L I I K+ R + Sbjct: 362 ANVLQNFKIPLPPLEVQREIVKILDDFSTLTEDLSSGIPAEIAARKKQYEYYRDKLLT 419 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 63/185 (34%), Gaps = 9/185 (4%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P E + + N+K K E K N G Y D Sbjct: 12 CPHGVEFRKLGEVCEFQNKKTLKTSEVK--------NNGKYPVINSGRDLYGYYHDFNND 63 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 I + + + G+ Y + + + +L + +++ + + Sbjct: 64 GENITIASRGEYAGFVNYFNEKFFAGGLCYP-YKVKNSNKLLTKFLYFYLKANESQIMEN 122 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + G +L D++ LP+ +PP++ Q +I +++ T L +++ + R Sbjct: 123 LVIRGSIPALNKADIETLPIPLPPLEVQREIVKILDTFTELNTELNTELKLRKKQYEYYR 182 Query: 408 SSFIA 412 + ++ Sbjct: 183 NWLLS 187 >gi|317177321|dbj|BAJ55110.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F16] Length = 422 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 49/396 (12%), Positives = 119/396 (30%), Gaps = 15/396 (3%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL---PELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCGIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ K++ Q + ++ + + L + Sbjct: 192 LKARKKQYQYYQNMLLDF---KGINQNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLG 248 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I+ + L+ + ++ I + + Sbjct: 249 EVCEIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQN 308 Query: 312 ERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 ++ +V P + YL +++ + + S + S+ ++ ++ + +P Sbjct: 309 QKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPIP 368 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 P++ Q +I +++ + L+ I I K++ Sbjct: 369 PLEIQQEIVKILDQFSILTTDLLAGIPAEIEARKKQ 404 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 56/191 (29%), Gaps = 17/191 (8%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPES 279 P E K + N + + R G + P++ Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++ I+ + L + + +++ K + + + + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLIVDSLANQQFT---FLSKKANCGIALDMKFFFYQ 129 Query: 340 YDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + + S+ K+ +PP++ Q +I +++ T L ++ Sbjct: 130 CFLLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELN 189 Query: 398 QSIVLLKERRS 408 LK R+ Sbjct: 190 TE---LKARKK 197 >gi|319642843|ref|ZP_07997481.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 3_1_40A] gi|317385587|gb|EFV66528.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 3_1_40A] Length = 484 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 129/411 (31%), Gaps = 41/411 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W ++ + +G T + + YI + ++ + + + Sbjct: 71 ELPNSWVWCRLEDIAYVASGSTPDKTCFVENGVPYIKMYNLRNQKIDFAYHPQYITEEVH 130 Query: 76 STV---SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLL 129 + S G ++ +GP L K I + ++++P L+ + Sbjct: 131 NGKLQRSRTEVGDLIMNIVGPPLGKLAIIPTTLPQANFNQAAVLIRPYKFKEVLVSYLKV 190 Query: 130 SIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ I +I A + N+ +PIPPL E I E++ + ID+L Sbjct: 191 YLEEMSEINSIATRGSAGQVNISLTQSQNMRIPIPPLNEVRRIIEEVSKYDILIDSLKQN 250 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV----------------GLVPDH 231 L+ K ++ + L P + IE + VP Sbjct: 251 ITDIQNLIAYTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTFDVPSG 310 Query: 232 WEVKPFFALVT-----------ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 W ++ + I LS ++ + Sbjct: 311 WITTNLGSIFNVVSAKRILKSDWKHSGVPFYRAREIAKLSIYGLVDNELYISEEHYNSLK 370 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 E + + +I+ + ++ + + + I++ Y+ +MRS Sbjct: 371 EKFPVPKASDIMISAVGTIGKCYIVKESDKFYYKDAS-VLCLCNDYQINAKYIYHIMRSE 429 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + K Y G ++ E K+ + +PP+ EQ I I + D Sbjct: 430 YMLKQMYDNSKGTTVDTITIEKAKQYILPLPPLAEQQRIVAKIEETFSIFD 480 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 60/194 (30%), Gaps = 12/194 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNT---KLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +P+ W + + +E+ + + N+ + + + E + Sbjct: 71 ELPNSWVWCRLEDIAYVASGSTPDKTCFVENGVPYIKMYNLRNQKIDFAYHPQYITEEVH 130 Query: 284 Q------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAW 335 + G+++ + K ++ + + +A + +YL Sbjct: 131 NGKLQRSRTEVGDLIMNIVGPPLGKLAIIPTTLPQANFNQAAVLIRPYKFKEVLVSYLKV 190 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + GS + ++ + + + +PP+ E I ++ ID L + Sbjct: 191 YLEEMSEINSIATRGSAGQVNISLTQSQNMRIPIPPLNEVRRIIEEVSKYDILIDSLKQN 250 Query: 396 IEQSIVLLKERRSS 409 I I L S Sbjct: 251 ITD-IQNLIAYTKS 263 >gi|206603725|gb|EDZ40205.1| Putative Type I restriction modification system, specificity protein [Leptospirillum sp. Group II '5-way CG'] Length = 533 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 57/436 (13%), Positives = 131/436 (30%), Gaps = 39/436 (8%) Query: 17 WI--GAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNS 70 W+ P +W P+ K G I + ++++G + + Sbjct: 107 WLYHPDFPNNWIRTPLYSLAKWINGLAFRELQFCSSGKPVIKIAEIKNG----ISEQTKF 162 Query: 71 RQSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 KG +L+ G + +G + + P + + + + Sbjct: 163 TNQSFDQSLHIKKGDLLFSWSGQPETSIDAFWWHGPNGWLNQHIYRVLPIENIDRIFFFY 222 Query: 128 --LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 I + + H + + I PPL+EQ I + +I+ Sbjct: 223 LLRYLKPNFIAIARNKQTTGLGHVTKRDLEKIEAAYPPLSEQCAIAHILGTLDDKIELNR 282 Query: 186 TERIRFIELLKEKKQALVSYIVT---------KGLNPDV---KMKDSGIEWVGLVPDHWE 233 + + + GL ++ +G +P W Sbjct: 283 RMNETLEAMAQAIFNSWFVNFDPVRAKMEGRLTGLPKEIADLFPDSFEDSDLGEIPRGWR 342 Query: 234 VKPFFALV-TELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEI 291 + I+ ++ ++ ++ + + E + G+I Sbjct: 343 IGTLGEFATRSRQSIRPNEIKEGTPYIALEHMPRRCISLFEWKMADEVESNKFEFNKGDI 402 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG 350 +F + K + G+ ++ + + P + + S + Sbjct: 403 LFGKLRSYFHKVGVAPV----NGVCSTDILVIAPQKQELFGFVLGHVSSDSFVQYTDLGA 458 Query: 351 SGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 SG R +E++KR ++VPPI ++ I +I L E L R + Sbjct: 459 SGTRMPRTNWENMKRYSLVVPPISVSEVFSSKIGPLVEKI--LSNVHESK--TLSCLRDA 514 Query: 410 FIAAAVTGQIDLRGES 425 + ++G+I ++ +S Sbjct: 515 LLHKLLSGEIKVQPDS 530 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/200 (23%), Positives = 78/200 (39%), Gaps = 8/200 (4%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLP 65 ++DS +G IP+ W++ + F + + + YI LE + Sbjct: 327 DSFEDSD---LGEIPRGWRIGTLGEFATRSRQSIRPNEIKEGTPYIALEHMPRRCISLFE 383 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELL 124 S F KG IL+GKL Y K +A +G+CST LV+ P K L + Sbjct: 384 --WKMADEVESNKFEFNKGDILFGKLRSYFHKVGVAPVNGVCSTDILVIAPQKQELFGFV 441 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 G + S Q + G M +W+ + + +PP++ + KI +I + Sbjct: 442 LGHVSSDSFVQYTDLGASGTRMPRTNWENMKRYSLVVPPISVSEVFSSKIGPLVEKILSN 501 Query: 185 ITERIRFIELLKEKKQALVS 204 + E L L+S Sbjct: 502 VHESKTLSCLRDALLHKLLS 521 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 63/195 (32%), Gaps = 10/195 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI--LSLSYGNIIQKLETRNMGLKPE 278 G + P++W P ++L +N + ++ + I+ + + Sbjct: 106 GWLYHPDFPNNWIRTPLYSLAKWINGLAFRELQFCSSGKPVIKIAEIKNGISEQTKFTNQ 165 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 S++ + G+++F + G + V P + + Sbjct: 166 SFDQSLHIKKGDLLFSWSGQPETSIDAFW-WHGPNGWLNQHIYRVLPIENIDRIFFFYLL 224 Query: 339 SY---DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 Y + + + + D++++ PP+ EQ I +++ +D +E Sbjct: 225 RYLKPNFIAIARNKQTTGLGHVTKRDLEKIEAAYPPLSEQCAIAHILGT----LDDKIEL 280 Query: 396 IEQSIVLLKERRSSF 410 + L+ + Sbjct: 281 NRRMNETLEAMAQAI 295 >gi|210623095|ref|ZP_03293582.1| hypothetical protein CLOHIR_01532 [Clostridium hiranonis DSM 13275] gi|210153898|gb|EEA84904.1| hypothetical protein CLOHIR_01532 [Clostridium hiranonis DSM 13275] Length = 632 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 58/407 (14%), Positives = 133/407 (32%), Gaps = 36/407 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + ++ TG+ + GKY G +S Sbjct: 13 PNGVEYKYLGDICEIKTGKGITKKD----------ITENGKYPIISGGKEPMGLYHLSNR 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQRIE 138 + ++G + D + + + P + + + + ++I Sbjct: 63 KANTVTISRVGANSGFVNYIEVDFYLNDKCFSIIPISKYEKKIDSKYIYEYLKNNEEKIS 122 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A+ + + K + +I + +PPL Q I + + T+ LI E + K++ Sbjct: 123 AMQSEGGVPTINTKKVSSIAIAVPPLEVQREIVRILDSFTLLTKELIKELAAELTARKKQ 182 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + +++ + K + I + + + + I Sbjct: 183 YEYYRNELISINIVKSNVSKLNEIAEI----------------YDGTHQTPEYKSKGIPF 226 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 +S N I + + N + E+Y Y+I + +F K ++ + ++ Sbjct: 227 ISVEN-IDDIYSSNKFISEEAYSKYKIKPQVDDLFMTRIGSIGKCAIMTQPKDLAYYVSL 285 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQF 376 A + +D YL + S K + + +D+ ++ + P I Q Sbjct: 286 ALIRPNKKLLDVRYLKHYIESSLGTKELAKRTLHHAVPIKINKDDIGKIVIKYPTIDIQR 345 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQI 419 I +V++ A L + I ++ R + A TG+I Sbjct: 346 RIADVLDNFDAICSDLKIGLPAEIEARQKQYEYYRDLLLTFAETGKI 392 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 56/436 (12%), Positives = 133/436 (30%), Gaps = 53/436 (12%) Query: 26 KVVPIKRFTKLNTG--RTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 V + ++ G +T E K I +I +E+++ + + S I Sbjct: 199 NVSKLNEIAEIYDGTHQTPEYKSKGIPFISVENID----DIYSSNKFISEEAYSKYKIKP 254 Query: 83 K-GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + + ++G + AI+ D S + K + L+ ++ S T+ + Sbjct: 255 QVDDLFMTRIGSIGKCAIMTQPKDLAYYVSLALIRPNKKLLDVRYLKHYIESSLGTKELA 314 Query: 139 AICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI--------------DT 183 + + IG I + P + Q I + + Sbjct: 315 KRTLHHAVPIKINKDDIGKIVIKYPTIDIQRRIADVLDNFDAICSDLKIGLPAEIEARQK 374 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG---------------IEWVGLV 228 + E + + + T + + I + Sbjct: 375 QYEYYRDLLLTFAETGKIIATDRQTDRQTDRQTDRQTDRQTDRQTDRQTDRQAIIKLIQY 434 Query: 229 PDHWEVKPFFALVTELNRKN---TKLIESNILSLSYGNIIQKLETRNMG----LKPESYE 281 + + T N +E+ + YG I + + E +E Sbjct: 435 VFGYCPVKLDDIATISRGGNLQKKDFVENGKPCIHYGQIYTHFGVSSDKTLTFVNDEVFE 494 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G+IV + +A + + I S + A+ H ++ Y+++ RS Sbjct: 495 KSKTAKTGDIVMAVTSENIEDVCSCTAWLGDEEIAVSGHTAIIKHNQNAKYMSYFFRSSS 554 Query: 342 LCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + G + + + + +++P I+EQ I ++++ + + + I I Sbjct: 555 FFGQKKKLAHGTKVIEVTPSKLGGIEIMLPSIEEQERIVSILDRFDSLCNDITSGIPAEI 614 Query: 401 VLLKE----RRSSFIA 412 ++ R + Sbjct: 615 EARQKQYEYYRDKLLT 630 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 61/190 (32%), Gaps = 14/190 (7%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P+ E K + K + K + G +P Sbjct: 12 CPNGVEYKYLGDICEIKTGKGITKKD--------ITENGKYPIISGGKEPMGLYHLSNRK 63 Query: 288 PGEIVFRFIDL-QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + + + + + ++ IDS Y+ +++ + K+ Sbjct: 64 ANTVTISRVGANSGFVNYIEVDFYLNDKCFSIIPISKYEKKIDSKYIYEYLKNNE-EKIS 122 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE- 405 G ++ + V + + VPP++ Q +I +++ T L++++ + K+ Sbjct: 123 AMQSEGGVPTINTKKVSSIAIAVPPLEVQREIVRILDSFTLLTKELIKELAAELTARKKQ 182 Query: 406 ---RRSSFIA 412 R+ I+ Sbjct: 183 YEYYRNELIS 192 >gi|15839315|ref|NP_300003.1| type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] gi|9107962|gb|AAF85511.1|AE004079_2 type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] Length = 409 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 49/412 (11%), Positives = 114/412 (27%), Gaps = 33/412 (8%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-S 76 P I +L G ++ I I + + G + + + Sbjct: 13 PNGVDYKAIGDLGELVRGNGMPKSDFVDSGIGCIHYGQIYTYYGIWTTRTKSFVSLSKAE 72 Query: 77 TVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 ++ G ++ + I + + D P+ L +L + Sbjct: 73 KLAKVDPGDLVITNTSENVEDVCKAVAWIGEVQIVTGGHATVLKHDQDPKYLSYYLQTPQ 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G + K + I +P+PPL Q I + + T L E Sbjct: 133 FSVEKKKHATGTKVIDVSAKSLAKIKIPVPPLEVQRQIVKVLDTFTTLEAELEAELEARR 192 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + AL+ +G + +++ +G + + Sbjct: 193 RQYQYYRDALLR--FEEGTDAATRVR---WMTLGEI------CKSVSSGGTPLSTRADYY 241 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +I L + E + + I+ + ++ Sbjct: 242 GGDIPWLRTQEVRYTDILDTEIKITEKGLKESAAKWIPANCIIVAISGATAARSAINKIP 301 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + ++ + Y + A+G G R L +K + + Sbjct: 302 LT----TNQHCCNLEVDSTQANYRYVFHWVSKEYERLKALGQGARADLNSGIIKNYKIPI 357 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA--AAV 415 PP++ Q I V++ ++ + + I ++ R + AV Sbjct: 358 PPLEVQARIVAVLDQFDTLVNDITAGLPAEIAARRQQYAYYRDRLLTFKEAV 409 >gi|87124163|ref|ZP_01080013.1| type I site-specific deoxyribonuclease (specificity subunit) [Synechococcus sp. RS9917] gi|86168732|gb|EAQ69989.1| type I site-specific deoxyribonuclease (specificity subunit) [Synechococcus sp. RS9917] Length = 128 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 36/110 (32%), Positives = 60/110 (54%), Gaps = 2/110 (1%) Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 I S +L +L RS + G+G ++ + V+ +P I+EQ Sbjct: 13 IVTRPVKEKITSEFLDYLFRSQTFRRLGESEMYGAGGQKRVPDSFVRDFTSALPSIEEQS 72 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +T ++ ET +ID L+ + ++ I LL+ERRS+ I+A VTGQID+RG ++ Sbjct: 73 QVTRFLDRETGKIDALIAEQQRLIELLQERRSALISAVVTGQIDVRGLAE 122 >gi|254506510|ref|ZP_05118652.1| restriction modification system DNA specificity domain protein [Vibrio parahaemolyticus 16] gi|219550684|gb|EED27667.1| restriction modification system DNA specificity domain protein [Vibrio parahaemolyticus 16] Length = 428 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 51/417 (12%), Positives = 133/417 (31%), Gaps = 34/417 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTS-T 77 ++W+ + + G + G+ + +I + D+ E+ Y G+ + Sbjct: 17 ENWQAIELGELMTFKNGINASREQYGRGVKFINVMDIIENDYITYDRIVGSVDVENKEFE 76 Query: 78 VSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLS 130 +I G IL+ + + + F++ K D + L + Sbjct: 77 KNIVEYGDILFQRSSETREEVGQANVYLDKKNVATFGGFVIRGKKVGDFDSVCMNYLLKT 136 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERI 189 + + G+T + + + + +P + EQ I + ++D I Sbjct: 137 DKARKEVTTKSGGSTRYNVGQATLSAVNIDLPPCIPEQQKIASFL----SKVDEKIALLA 192 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + L E K+ ++ + + + + Sbjct: 193 EKKDKLAEYKKGVMQQLFNGKWEEQDGQLTFVPPTLRFKAADGSEFSDWEEIELGKLSKK 252 Query: 250 KLIESNILSL--------SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 +++ S+ + G I Q + + Y +V P + V+ Sbjct: 253 STVKNKDTSVSAVLTNSATQGIIHQADYFDRDIANQSNLDGYYVVKPNDFVYNPRISIPA 312 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL----RQS 356 + ++ G+++ Y + ++ +YL + ++ + ++ + R + Sbjct: 313 PVGPINRNKLDVGVMSPLYTVFTVNKSVNLSYLEYFFKTTKWHRYMNSIANFGARHDRMN 372 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + D ++P+ VP I+EQ I ++ ++D KE + + Sbjct: 373 ITTSDFFKMPIPVPCIEEQNKIVQFVSSIDQKLD----LANSEFEKAKEWKRGLLQQ 425 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 64/177 (36%), Gaps = 14/177 (7%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N++ + + I IV+ G+I+F+ ++ + + ++ Sbjct: 49 NVMDIIENDYITYDRIVGSVDVENKEFEKNIVEYGDILFQRSSETREEVGQANVYLDKKN 108 Query: 315 IITSAYMAVKPH---GIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVP 370 + T ++ DS + +L+++ K G R ++ + + + +P Sbjct: 109 VATFGGFVIRGKKVGDFDSVCMNYLLKTDKARKEVTTKSGGSTRYNVGQATLSAVNIDLP 168 Query: 371 P-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 P I EQ I + + +++D + + + L E + + G+ + Sbjct: 169 PCIPEQQKIASFL----SKVDEKIALLAEKKDKLAEYKKGVMQQLFNGK-----WEE 216 >gi|15645409|ref|NP_207583.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori 26695] gi|2313919|gb|AAD07838.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori 26695] Length = 431 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 123/411 (29%), Gaps = 25/411 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTKATIGEHALLI-VDSLANQQFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---IDTLITER 188 + + + + + D PIPPL Q I + + A T ++T + Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++ + E Q ++ + + + V Sbjct: 192 LKARKKQYEYYQNMLLDFNDINQSHKDAKERLAQKTYPKRLKTLLQTLAPKGVEFRKLGE 251 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ---NDKRSL 305 I N N G + G+ V D D + Sbjct: 252 VCEILDNRRIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVINKDNTPV 311 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + + A++ + + +L + +++ D+ +G + E++K++ Sbjct: 312 VNWASGKIWVNNHAHVLQTKNELKLKFLYFYLQTIDV----SYCVAGTPPKINQENLKKI 367 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 368 AIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYREKLLT 418 >gi|307260751|ref|ZP_07542440.1| Restriction modification system DNA specificity domain [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869590|gb|EFN01378.1| Restriction modification system DNA specificity domain [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 410 Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats. Identities = 57/410 (13%), Positives = 128/410 (31%), Gaps = 42/410 (10%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDV--ESGTG 61 KD V+W + K G T + + + ++ + Sbjct: 8 KDCEVEW----------KSLGEVAKYVRGLTYNKTNESDEKAGGYYVLRVNNITLSNNQL 57 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFL--VLQ 115 + + T K IL + A I++ F+ V Sbjct: 58 NFDDVKLVKFDTKTKPEQKLYKDDILISAASGSKEHVGKVAFISENMDFYFGGFMGVVRC 117 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 +++LP L L S + + +T+++ + K + +PIPPL Q I + + Sbjct: 118 SQEILPRFLFHILTSSLFKTYLNEVLNSSTINNLNAKVMNEFQIPIPPLEIQEKIVKILD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 T TL E ++ + L++ + + K++ +G + Sbjct: 178 KFTELEATLEAELSLRVKQYNYYRDLLLNE-----NDKNPFFKNTEYRCLGDI------- 225 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + ++ S+ N + L S ++V +++F Sbjct: 226 TLVSSNIKWKNNTNTYKYIDLTSVDRENHSIGETIKISALTAPS-RAQKLVAKDDVIFAT 284 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + + + I ++ Y P+ + ++ + S D SG Sbjct: 285 TRPTQLRFAF-INEEFANSIASTGYCVLRANPNLVLPKWIYHNLGSIDFKNFLEENQSGS 343 Query: 354 R-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ VK + VP + Q I +++ + + + + I L Sbjct: 344 AYPAVSDSKVKDYKIPVPSLDVQEKIIAILDNFENLANSIKNGLPREIEL 393 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 60/201 (29%), Gaps = 6/201 (2%) Query: 218 KDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 KD +EW +G V + + E + K + +++ N + + Sbjct: 8 KDCEVEWKSLGEVAKYVRGLT-YNKTNESDEKAGGYYVLRVNNITLSNNQLNFDDVKLVK 66 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM--AVKPHGIDSTYL 333 + Q + +I+ + + +M I +L Sbjct: 67 FDTKTKPEQKLYKDDILISAASGSKEHVGKVAFISENMDFYFGGFMGVVRCSQEILPRFL 126 Query: 334 AWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 ++ S + S +L + + + +PP++ Q I +++ T L Sbjct: 127 FHILTSSLFKTYLNEVLNSSTINNLNAKVMNEFQIPIPPLEIQEKIVKILDKFTELEATL 186 Query: 393 VEKIEQSIVLLKERRSSFIAA 413 ++ + R + Sbjct: 187 EAELSLRVKQYNYYRDLLLNE 207 >gi|218692731|ref|YP_002400943.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Escherichia coli ED1a] gi|218430295|emb|CAV18170.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Escherichia coli ED1a] Length = 584 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 75/480 (15%), Positives = 141/480 (29%), Gaps = 96/480 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P+ W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVTFSHLGYFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTPLALE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDI 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ IE + G T+ ++ + P IPP AEQ Sbjct: 200 KVLSPFFSDISYYILLMMNGFERYIIENLTKTGTTVESLLFEDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 E++ RI Sbjct: 260 LSTVKKLMSLCDQLEQQSLTTLDAHQQLVETLLGTLTDSQNAEELAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 320 TTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNM-- 273 S E +P+ WE + +T+ + K I LS NI N Sbjct: 380 ISDEEKPFELPEGWEWCRLNDISSKITDGDHKTPPRIAEGYKLLSAKNIRDGYLDYNNCD 439 Query: 274 ---GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + E + + G+++ + + SL + ++ S + KP I+ Sbjct: 440 YISAIDYEKSRERCLPEKGDLLIVSVGGTIGRSSLIK-DCSDFALVRSVAII-KPLLIEP 497 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 YL M S L + ++ G + L ++ + PP+ EQ +I N +++ + Sbjct: 498 EYLKLAMDSKLLQSMIHSHKRGGAQPCLYLGEISKFLFPTPPLAEQRNIVNKVSILMEKC 557 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 66/201 (32%), Gaps = 10/201 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE F L K ++ + + K N+ + E Sbjct: 93 SEEEKPFELPEGWEWVTFSHLGYFFGGKTPSKMKDEYWGGTIPWVTPKDMKTNLIVDSED 152 Query: 280 -------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + V PG I+F +R A + + P D +Y Sbjct: 153 KVTPLALEDGLTKVSPGSILFVARSGIL-RRIFPVAITSIECTVNQDIKVLSPFFSDISY 211 Query: 333 LAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 LM + + +SL FED P ++PP EQ I + + + D Sbjct: 212 YILLMMNGFERYIIENLTKTGTTVESLLFEDFISHPFMIPPFAEQNRILSTVKKLMSLCD 271 Query: 391 VLVEKIEQSIVLLKERRSSFI 411 L ++ ++ ++ + + Sbjct: 272 QLEQQSLTTLDAHQQLVETLL 292 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 65/197 (32%), Gaps = 8/197 (4%) Query: 20 AIPKHWKVVPIKRF-TKLNTG--RTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQ--S 73 +P+ W+ + +K+ G +T + + +++ G Y D S Sbjct: 388 ELPEGWEWCRLNDISSKITDGDHKTPPRIAEGYKLLSAKNIRDGYLDYNNCDYISAIDYE 447 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSI 131 + + KG +L +G + ++ + +++P + PE L+ + S Sbjct: 448 KSRERCLPEKGDLLIVSVGGTIGRSSLIKDCSDFALVRSVAIIKPLLIEPEYLKLAMDSK 507 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I + G I P PPLAEQ I K+ + L Sbjct: 508 LLQSMIHSHKRGGAQPCLYLGEISKFLFPTPPLAEQRNIVNKVSILMEKCRFLFLGLQSA 567 Query: 192 IELLKEKKQALVSYIVT 208 + AL + Sbjct: 568 QQTQLHVADALTDAAIN 584 >gi|88810395|ref|ZP_01125652.1| Type I restriction enzyme StySPI specificity protein [Nitrococcus mobilis Nb-231] gi|88792025|gb|EAR23135.1| Type I restriction enzyme StySPI specificity protein [Nitrococcus mobilis Nb-231] Length = 496 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 64/459 (13%), Positives = 142/459 (30%), Gaps = 65/459 (14%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDI--------IYIGLEDVESGTGKYLPKDGNS-R 71 +P++W + +L G T + + + ++ G+ +D R Sbjct: 6 LPENWARCRVTELAQLIRGVTYKKSEASKESQPGFAPLLRANNI---NGRINHEDLVYVR 62 Query: 72 QSDTSTVSIFAKGQILYGK----LGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQ 125 ++ S + +L +G + A + G F + ++ Sbjct: 63 EARISNEQWLKESDVLIAMSSGSIGLVGKAAQLRKVKGETFGSFCGALRPTSEIDCHFFG 122 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + + + +G+ +++ I ++ P+PP EQ I EKI R+D Sbjct: 123 WFFQTRTYRECVSGDAKGSNINNLKRDHILHVDFPLPPANEQRRIVEKIETLFSRLDKGE 182 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVK-----------------MKDSGIEWVG-- 226 +LL +Q+++ VT L D + ++ W G Sbjct: 183 EALRDVQKLLSRYRQSVLKAAVTGQLTADWRAENAHRLEHGRDLLARILQTRRESWEGRG 242 Query: 227 --------------LVPDHWEVKPFFALVTE---------LNRKNTKLIESNILSLSYGN 263 +PD W L KN + ++ Sbjct: 243 KYKEPIAPSTSGLPDLPDGWVWASLAQLTHIKGGVTVDKKRESKNPVTVPYLRVANVQNG 302 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMA 322 I E + + + + E ++ G+I+ D R + I + Sbjct: 303 HIDLTEIKEITVNRDKAEQ-TLLKAGDILLNEGGDRDKLGRGWVWDGQIAPCIHQNHVFR 361 Query: 323 VKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDIT 379 +P S ++++ ++ + S+ + P+ +P EQ +I Sbjct: 362 ARPVIPEISSRFVSYYANAFGQGFFMQKGKQSVNLASISLTAISGFPIALPSADEQREIV 421 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + + + E + + R S + A TG+ Sbjct: 422 GRLEEKLIEVATVAEWCKTELTRSAALRQSILKDAFTGR 460 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 65/206 (31%), Gaps = 11/206 (5%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---- 279 +P++W L + K E++ S + + N + E Sbjct: 2 ENRALPENWARCRVTELAQLIRGVTYKKSEASKESQPGFAPLLRANNINGRINHEDLVYV 61 Query: 280 ----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM--AVKPHGIDSTYL 333 Q + +++ + +G ++ ID + Sbjct: 62 REARISNEQWLKESDVLIAMSSGSIGLVGKAAQLRKVKGETFGSFCGALRPTSEIDCHFF 121 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 W ++ + G +LK + + + +PP EQ I I +R+D Sbjct: 122 GWFFQTRTYRECVSGDAKGSNINNLKRDHILHVDFPLPPANEQRRIVEKIETLFSRLDKG 181 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQ 418 E + LL R S + AAVTGQ Sbjct: 182 EEALRDVQKLLSRYRQSVLKAAVTGQ 207 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 83/236 (35%), Gaps = 22/236 (9%) Query: 9 QYKD------SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDV 56 +YK+ SG + +P W + + T + G T + ++ + Y+ + +V Sbjct: 243 KYKEPIAPSTSG---LPDLPDGWVWASLAQLTHIKGGVTVDKKRESKNPVTVPYLRVANV 299 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLV 113 ++G + D + ++ G IL + G R + C Q V Sbjct: 300 QNGHIDLTEIKEITVNRDKAEQTLLKAGDILLNEGGDRDKLGRGWVWDGQIAPCIHQNHV 359 Query: 114 LQPKDVLP----ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + + V+P + + + ++ + ++ I P+ +P EQ Sbjct: 360 FRARPVIPEISSRFVSYYANAFGQGFFMQKGKQSVNLASISLTAISGFPIALPSADEQRE 419 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 I ++ + + + T+ + +Q+++ T L P + E + Sbjct: 420 IVGRLEEKLIEVATVAEWCKTELTRSAALRQSILKDAFTGRLVPQNPSDEPAAELL 475 >gi|261840205|gb|ACX99970.1| restriction modification system S subunit [Helicobacter pylori 52] Length = 373 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 61/406 (15%), Positives = 122/406 (30%), Gaps = 46/406 (11%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPLNWQKVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYR 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L + Sbjct: 64 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYIYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + A + Sbjct: 121 NVKWNTEYTTILRLYNDNFRNTLIPLPPLNEQSAIANILSALDRYL-------------- 166 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 AL+ L + K E + + V + N + Sbjct: 167 -CALDALI-------LKKEGVKKALSFELLSQRKRLKGFNQAWQRVRLGDIANYLTSNLS 218 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ I+ + N ++ I D I R L + I Sbjct: 219 VEQITQQGEIKVYDANNFIGYTDTT---FISDKPYISIVKDGSVGRVRILPP----KTNI 271 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +++ + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 272 LSTMGALIANHRTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQ 328 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 329 IAIANILSDLDNEITSLKNKKRQ----FENIKKALNHDLMSAKIRV 370 >gi|188528196|ref|YP_001910883.1| type I R-M system specificity subunit [Helicobacter pylori Shi470] gi|188144436|gb|ACD48853.1| type I R-M system specificity subunit [Helicobacter pylori Shi470] Length = 369 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 62/406 (15%), Positives = 123/406 (30%), Gaps = 46/406 (11%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 2 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 59 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 60 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 116 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ+ I + A + L Sbjct: 117 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSALDHYLYAL----------- 165 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 AL+ L + K E + + V + N + Sbjct: 166 ----DALI-------LKKESVKKALSFELLSQRKRLKGFNQAWQRVRLGDIANYLTSNLS 214 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ I+ + N ++ I D I R L + I Sbjct: 215 VEQITQQGKIKVYDVNNFIGYTDTT---FISDKPYISIVKDGSVGRVRILPP----KTNI 267 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +++ + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 268 LSTMGALIANHRTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQ 324 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 325 IAIANILSDLDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 366 >gi|84624926|ref|YP_452298.1| specificity determinant for hsdM and hsdR [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84368866|dbj|BAE70024.1| specificity determinant for hsdM and hsdR [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 450 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 70/427 (16%), Positives = 148/427 (34%), Gaps = 44/427 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W I ++ + + ++ + + G Sbjct: 3 ELPGGWSETEIGPVNTYSSETLNPAKAPKQTFELYSVPVFAKRKPEIVDGKDIG------ 56 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 ST +L K+ P + + + D + I S++++V++ P ++ L Sbjct: 57 -STKQKVEPDDVLLCKINPRINRVWLVGKKNDHEQIASSEWIVIRQPLFDPAFIRFQLQE 115 Query: 131 IDVTQ--RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 E G +++ A K + + + I PLAEQ I +K+ A ++DTL Sbjct: 116 SSFRDRLCAEVSGVGGSLTRAQPKKVESYKLRIAPLAEQKRIAQKLDALLAQVDTLKARI 175 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 LLK ++++V V L+ D K E +G + + W +L Sbjct: 176 DAIPALLKRFRKSVVHSAVIGRLSADLRVPIEKSEEQEQLGPL-ESWREVTLASLGELSR 234 Query: 246 RKNTK-------LIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVFRF 295 K+ L S + G++ + + + ++ G + Sbjct: 235 GKSKHRPRNDSRLYGSEYPFIQTGDVANSGGALTSSKVFYSEFGLKQSRLFPSGTLCITI 294 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLR 354 D L ++ + ++ +++ D + A+ + + Sbjct: 295 AANIADTAMLAIDACFPDSVVG---FIPNKDDCVAQFIKYVI--DDNKESLEALAPATAQ 349 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK---IEQSIVLLKERRSSFI 411 +++ + + ++ + +PPIKEQ +I + A D L K +Q I L S + Sbjct: 350 KNINLKVLNQVKLRIPPIKEQTEIVRHVEQLFAYADQLEAKVAAAQQRIDALT---QSLL 406 Query: 412 AAAVTGQ 418 A A G+ Sbjct: 407 AKAFRGE 413 >gi|315506714|ref|YP_004085601.1| restriction modification system DNA specificity domain protein [Micromonospora sp. L5] gi|315413333|gb|ADU11450.1| restriction modification system DNA specificity domain protein [Micromonospora sp. L5] Length = 413 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 48/406 (11%), Positives = 124/406 (30%), Gaps = 26/406 (6%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTS 76 P + P+ L G +T + + I + + G + + Sbjct: 13 PNGVEYKPLAEVGHLVRGNGLPKTDFTESGVGAIHYGQIYTYYGTWATDTISFVAPGTAT 72 Query: 77 TVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 ++ G ++ + + I + + P+ + WL + + Sbjct: 73 KLAKVDPGDVIITNTSENLEDVGKAVAWLGKEQIVTGGHATVFKHSQNPKFIAYWLQTPE 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + G ++ + + + +P+PP+A Q I + + + L + Sbjct: 133 FFTQKKKLATGTKVTDVSARALERVKLPVPPIAIQDEIVRVLDLFSGAVADLKVQLDAEF 192 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + + T + DV G VG + V + + Sbjct: 193 AARRLQYAYYRDNLFTFQ-DADVCFVPMG--EVGEFLRGRRFTK--SDVVDEGIPSIHYG 247 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E + +G + PG++V + + A + Sbjct: 248 EIYTTYGIAADQAVSHIREKLG------PQLRYAKPGDVVIAAVGETVEDVGRGVAWLGT 301 Query: 313 RGI-ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVP 370 + I + + +D ++ + +RS + + + E + +LP+ VP Sbjct: 302 TDVAIHDDCFLYRSNVLDPKFVCYYLRSEAHNRAKAKYVARAKVKRMSREGLAKLPIPVP 361 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQS-IVLLKE---RRSSFIA 412 +KEQ I +++ + + + + I ++ R + Sbjct: 362 SLKEQKRIVAILDELDSLLTDMAAALPSEVIARRQQYDFYRDRLLT 407 >gi|254372256|ref|ZP_04987747.1| predicted protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569985|gb|EDN35639.1| predicted protein [Francisella novicida GA99-3549] Length = 404 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 68/422 (16%), Positives = 146/422 (34%), Gaps = 42/422 (9%) Query: 20 AIPKHWKVVPIKRFTKL---NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W+ ++ L N G+ D +++ +Y+ + R D Sbjct: 5 ELPKGWRECRLEEILDLIVDNRGKNPSKYSDRGIPVIDNFMIQNQRYINLNEAKRYIDIK 64 Query: 77 TV-----SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 T +L +G A D Q + D + + Sbjct: 65 TFESFIRKHIKYKDVLITLVGNGYGNVSQAPIDKSVIIQNTIGLRVDEYADQEFLFYNLK 124 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I GA + ++ + +PPLAEQ I E + + +ID Sbjct: 125 FNNEQILNFDRGAVQPSIKVSDLKSLEINLPPLAEQKAIAEVLSSLDDKID--------- 175 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 LL ++ Q L ++ IE + ++ + + K+++L Sbjct: 176 --LLHQQNQTLEDMA-------KTLFREWFIEKADEGWEEVKLGDYVKCINGYTYKSSEL 226 Query: 252 IESNILSLSYGNIIQKLETRNMGLKP---ESYETYQIVDPGEIVFRFIDLQ------NDK 302 +ES ++ N + R G K ++ Q+V G++V D+ + Sbjct: 227 MESRNALVTLKNFARDGSLRLDGFKEFTGMKFKEAQVVIDGDLVVAHTDITQNADIIGNP 286 Query: 303 RSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKF 359 +++ ++ +IT + V+P + I +YL L +S D + Sbjct: 287 ILVKNIHNYDKLVITMDLVKVEPLVNWIKKSYLYCLFKSDDFKFHCLCNSNGSTVLHMSK 346 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + +PP + T ++ + D ++ I L++ R + + ++GQ+ Sbjct: 347 KAIPSYIFKLPPKELLVSFTKIVEDIFEKQD----LNQKQIKTLEQTRDTLLPKLMSGQV 402 Query: 420 DL 421 + Sbjct: 403 RV 404 >gi|322804999|emb|CBZ02559.1| type I restriction-modification system,specificity subunit S [Clostridium botulinum H04402 065] Length = 385 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 59/399 (14%), Positives = 121/399 (30%), Gaps = 31/399 (7%) Query: 29 PIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + ++ TG T KDI++I +D+ + + I Sbjct: 6 KLWELGEILTGNTPSKKNGEFYDAKDIMFIKPDDINNNITEIECSKEYISNKAEKKARII 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K +L +G K I + Q + + + + + QR+E+I Sbjct: 66 PKDSLLITCIGSI-GKIAINKEKSAFNQQINSIVHNEKIISSKYLAYVIMINKQRLESIS 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + I E Q I + ID + EL Sbjct: 125 NAPVVPIINKTQFSEFEVYIHEKKEIQEKIANVLDKAQSLIDKRKAQIEALDEL------ 178 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + S + + K+ + +T+ RK I + Sbjct: 179 -VKSRFIEMFGDLKSNSKNWDVSEFNE------FATIDTNMTKDFRKYKDYPHIGIECIE 231 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 ++ + + I D I++ I +K +L S S Sbjct: 232 K--NTGRILEYKLVKNSDLKSGKYIFDNRHIIYSKIRPNLNKVALPSFA--GVCSADSYP 287 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +YL +++RS A + E ++ + PPI Q Sbjct: 288 LLCNEKITTRSYLGYVLRSEFFLSYILAFSGRTNIPKVNKEQLRGFKMPTPPINLQNQFA 347 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + ++D L ++E+S+ L++ +S + A G+ Sbjct: 348 DFV----KQVDKLKFEMEKSLKELEDNFNSLMQRAFKGE 382 >gi|197302013|ref|ZP_03167076.1| hypothetical protein RUMLAC_00743 [Ruminococcus lactaris ATCC 29176] gi|197298961|gb|EDY33498.1| hypothetical protein RUMLAC_00743 [Ruminococcus lactaris ATCC 29176] Length = 406 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 137/402 (34%), Gaps = 32/402 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 + W+ + L G G ++ L+++ Sbjct: 13 EDWEQRKLGELGSLKNGMNFSKEAMGIGFPFVNLQNIFGNNVIDVTNLGKAMASDSQLKD 72 Query: 79 SIFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDV--LPELLQGWLLSI 131 G +L+ + L + + + F++ + + Sbjct: 73 YNLLNGDVLFVRSSVKLEGVGEAALVPQNLENTTYSGFIIRFRDEYGLDNNFKRFLFGIE 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V +I A + + + N+ + IP +EQ EKI +D LIT R Sbjct: 133 SVRNQIMAQATNSANKNISQTVLENLCLKIPNKSEQ----EKIGLYFSNLDHLITLHQRK 188 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 E K K+ ++ + + + +++ SG + WE + F + ++N + Sbjct: 189 CEETKTLKKYMLQKMFPQNGHSVPEIRFSG------FTEDWEQRKFADFTWDAGKRNKED 242 Query: 252 IESNILSLSYGNII---QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 ++ +++ + + +K + Y IV P + + + S+ Sbjct: 243 LDLEPYAITNEHGFIRQRDAHDDFGYMKDTDRKAYNIVQPNSFAYNP--ARINVGSIGYY 300 Query: 309 QVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 + +E I++S Y + + ++ +L ++S + + + +R ++ + Sbjct: 301 KGVENVIVSSLYEVFQTDNYVNDRFLWHWLKSDEFPRWIEKLQEGSVRLYFYYDKLCECQ 360 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +P ++EQ I ++ +D L+ + L+ + Sbjct: 361 LYMPSLEEQEKIATFLDD----LDHLITLHQHKCEELQNIKK 398 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 17/157 (10%), Positives = 50/157 (31%), Gaps = 8/157 (5%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITS 318 +GN + + + +S + G+++F ++ + Q +E + Sbjct: 50 FGNNVIDVTNLGKAMASDSQLKDYNLLNGDVLFVRSSVKLEGVGEAALVPQNLENTTYSG 109 Query: 319 AYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQF 376 + + +L + A + +++ ++ L + +P EQ Sbjct: 110 FIIRFRDEYGLDNNFKRFLFGIESVRNQIMAQATNSANKNISQTVLENLCLKIPNKSEQE 169 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +D L+ ++ K + + Sbjct: 170 KIGLY----FSNLDHLITLHQRKCEETKTLKKYMLQK 202 >gi|208434701|ref|YP_002266367.1| HP0790-like protein [Helicobacter pylori G27] gi|208432630|gb|ACI27501.1| HP0790-like protein [Helicobacter pylori G27] Length = 434 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 54/409 (13%), Positives = 125/409 (30%), Gaps = 19/409 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + ++ Y L+ + ++ L + + L T + Sbjct: 192 LNTELNTRKKQYQYYQNMLLDFNDINQNHKDAKEKLACKTYPKRLKTLLQTLAPKGVEFR 251 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + G + K E + G P ++ I + + Sbjct: 252 KLGEVCEIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVN 311 Query: 309 QVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ ++ P + YL +++ + + S + S+ ++ ++ + Sbjct: 312 WQNQKFWANDVCFSLIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITI 371 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 372 PIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 420 >gi|315585915|gb|ADU40296.1| type I R-M system specificity subunit [Helicobacter pylori 35A] Length = 373 Score = 110 bits (274), Expect = 5e-22, Method: Composition-based stats. Identities = 59/406 (14%), Positives = 120/406 (29%), Gaps = 46/406 (11%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPLNWQKVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G + I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGKAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + + Sbjct: 121 NVKWNTEYTTILRLYNDNFRNTLIPLPPLNEQSAIANILSDLDRYL-------------- 166 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 AL+ L + K E + + V + N + Sbjct: 167 -CALDALI-------LKKEGVKKSLSFELLSQRKRLKGFNQAWQRVRLGDIANYLTSNLS 218 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ I+ + N ++ I D I R L + I Sbjct: 219 VEQITQQGEIKVYDVNNFIGYTDTT---FISDKPYISIVKDGSVGRVRILPP----KTNI 271 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +++ + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 272 LSTMGALIANHRTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQ 328 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 329 SAIANILSALDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 370 >gi|295132750|ref|YP_003583426.1| restriction modification system DNA specificity subunit [Zunongwangia profunda SM-A87] gi|294980765|gb|ADF51230.1| restriction modification system DNA specificity subunit [Zunongwangia profunda SM-A87] Length = 440 Score = 110 bits (274), Expect = 6e-22, Method: Composition-based stats. Identities = 54/432 (12%), Positives = 130/432 (30%), Gaps = 39/432 (9%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESG 59 +P++KD W +K G +SG +D + + +V Sbjct: 21 RFPEFKD-----------EWDKQKLKNLAHFQAGYAFKSGDMSSELEDYQIVKMSNVYKN 69 Query: 60 TGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLR------KAIIADFDGICSTQFL 112 ++ S + + ++ G + I + + + + Sbjct: 70 ELLLDRNPSFVNSINEKSKKFLLKQNDVVLTLTGTVGKRDYGYSVNIPESNKFLLNQRLV 129 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLI 170 +L+ K + L + +G T ++ + NI + P +AEQ I Sbjct: 130 LLRGKKENSLFISYLLKTDKFYYSFFNESKGGTGNQANVSSDDVKNIKLYSPAVAEQQKI 189 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + A +I+ L ++ K Q L S + + + +G V + Sbjct: 190 ASFLSAVDEKINQLKRKKELLQAYKKGMMQQLFSQQLRFKDQNGNDFPEWEEKKLGDVFE 249 Query: 231 HWEVKPFFALVTELNR---KNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIV 286 + F + KN + + + + ++ N + + Sbjct: 250 FFSTNSFSRDKMNEEKGEVKNIHYGDIHTKYKALVDVECDEVPYVNQDVDLSKIKIENYC 309 Query: 287 DPGEIVFRFIDLQNDKRSL---RSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDL 342 G+++ + + + + +P S ++S+ Sbjct: 310 KDGDLILADASEDYNDIGKSIEVKNIGDLKVLAGLHTILARPKIQFSEGFLGQYVQSWFH 369 Query: 343 CKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 K G + + +KR+ + +P +EQ I + A+ID + +I Sbjct: 370 RKQVMFEAQGTKVLGISVGRLKRIKIQIPSKEEQTKIAMFLLAFDAKIDT----VSTAIT 425 Query: 402 LLKERRSSFIAA 413 ++ + + Sbjct: 426 KTQDFKKGLLQQ 437 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 78/217 (35%), Gaps = 10/217 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT-KLIESNILSLSYGNIIQKLETR 271 P ++ + EW + + + + + +S Y N + Sbjct: 18 PKLRFPEFKDEWDKQKLKNLAHFQAGYAFKSGDMSSELEDYQIVKMSNVYKNELLLDRNP 77 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKPHGID 329 + ++ ++V + S + E + ++ + ++ + Sbjct: 78 SFVNSINEKSKKFLLKQNDVVLTLTGTVGKRDYGYSVNIPESNKFLLNQRLVLLRGKKEN 137 Query: 330 STYLAWLMRSYDLCKVF---YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 S ++++L+++ F G+G + ++ +DVK + + P + EQ I + ++ Sbjct: 138 SLFISYLLKTDKFYYSFFNESKGGTGNQANVSSDDVKNIKLYSPAVAEQQKIASFLSAVD 197 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +I+ L ++ LL+ + + + Q+ + Sbjct: 198 EKINQL----KRKKELLQAYKKGMMQQLFSQQLRFKD 230 >gi|254881459|ref|ZP_05254169.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 4_3_47FAA] gi|254834252|gb|EET14561.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 4_3_47FAA] Length = 443 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 60/411 (14%), Positives = 129/411 (31%), Gaps = 41/411 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W ++ + +G T + + YI + ++ + + + Sbjct: 30 ELPNSWVWCRLEDIAYVASGSTPDKTCFVENGVPYIKMYNLRNQKIDFAYHPQYITEEVH 89 Query: 76 STV---SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLL 129 + S G ++ +GP L K I + ++++P L+ + Sbjct: 90 NGKLQRSRTEVGDLIMNIVGPPLGKLAIIPTTLPQANFNQAAVLIRPYKFKEVLVSYLKV 149 Query: 130 SIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ I +I A + N+ +PIPPL E I E++ + I++L Sbjct: 150 YLEEMSEINSIATRGSAGQVNISLTQSQNMRIPIPPLNEVRRIIEEVSKYDILINSLKQN 209 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV----------------GLVPDH 231 L+ K ++ + L P + IE + VP Sbjct: 210 ITDIQNLIAYTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTFDVPSG 269 Query: 232 WEVKPFFALVT-----------ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 W ++ + I LS ++ + Sbjct: 270 WITTNLGSIFNVVSAKRILKSDWKHSGVPFYRAREIAKLSIYGLVDNELYISEEHYNSLK 329 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 E + + +I+ + ++ + + + I++ Y+ +MRS Sbjct: 330 EKFPVPKASDIMISAVGTIGKCYIVKESDKFYYKDAS-VLCLCNDYQINAKYIYHIMRSE 388 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + K Y G ++ E K+ + +PP+ EQ I I + D Sbjct: 389 YMLKQMYDNSKGTTVDTITIEKAKQYILPLPPLAEQQRIVAKIEETFSIFD 439 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 60/194 (30%), Gaps = 12/194 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNT---KLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +P+ W + + +E+ + + N+ + + + E + Sbjct: 30 ELPNSWVWCRLEDIAYVASGSTPDKTCFVENGVPYIKMYNLRNQKIDFAYHPQYITEEVH 89 Query: 284 Q------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAW 335 + G+++ + K ++ + + +A + +YL Sbjct: 90 NGKLQRSRTEVGDLIMNIVGPPLGKLAIIPTTLPQANFNQAAVLIRPYKFKEVLVSYLKV 149 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + GS + ++ + + + +PP+ E I ++ I+ L + Sbjct: 150 YLEEMSEINSIATRGSAGQVNISLTQSQNMRIPIPPLNEVRRIIEEVSKYDILINSLKQN 209 Query: 396 IEQSIVLLKERRSS 409 I I L S Sbjct: 210 ITD-IQNLIAYTKS 222 >gi|18765822|gb|AAL78774.1|AF326623_1 JHP726-like protein [Helicobacter pylori] Length = 424 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 50/398 (12%), Positives = 114/398 (28%), Gaps = 21/398 (5%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD-TS 76 PK + + + G ++ K + + + + K + Sbjct: 13 PKGVEFRKLGDIGEFTRGNGLLKSDLQDKGRPVVHYGQIHTQYNLSIDKTISYVNEALFH 72 Query: 77 TVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + IL + + + + + + P+ + + + Sbjct: 73 KLKKAKPNDILIVTTSENVKDVGKSIAWLGNEEVAFSGEMYSYSTNENPKFIIYYFQTYF 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + E G + + I +PIPPL Q I + + A T L TE Sbjct: 133 FQKEKEKKITGTKVMRIHENDLKQITIPIPPLEIQQEIVKILDAFTELNTELNTELNARK 192 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + + L+ N + E + P +K + + KL Sbjct: 193 KQYQYYQNMLLD------FNDINQNHKDAKEKLAQKPYPKRLKTLLQTLAPKGVEFRKLG 246 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E + + + + + Y I S Sbjct: 247 EVCDFQKGKSITKKAVTFGKVPVISGGRQPAYYHNEANRSGETIAISSSGVYAGYVSYWD 306 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + S ++ K + YL + + + +G + +D+ + +P Sbjct: 307 IPVFLADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIPHVYSKDLLNFLIPIP 365 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 P++ Q +I +++ +A L+ I I K R+ Sbjct: 366 PLEIQQEIVKILDQFSALTTDLLAGIPAEI---KARKK 400 >gi|313158289|gb|EFR57691.1| type I restriction modification DNA specificity domain protein [Alistipes sp. HGB5] Length = 462 Score = 109 bits (273), Expect = 6e-22, Method: Composition-based stats. Identities = 58/439 (13%), Positives = 114/439 (25%), Gaps = 72/439 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP+ W+ + G T G +I+++ ++ + + Sbjct: 24 EIPQGWEWSRMGSIGDWGAGATPAKGNTSYYGGNILWLRTGELNNSIVNDTEIKITDKAL 83 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 ++ + G +L G + K IA + + P + L +L+ Sbjct: 84 KECSLRLNKAGDVLIAMYGATIGKVAIAGCELTTNQACCACTPIGIFNYYLFYFLMGN-- 141 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE----KIIAETVRIDTLITERI 189 EG + + + MPIPP+ EQ I E + + I Sbjct: 142 QVDFIKKGEGGAQPNISREKLVAHLMPIPPIQEQHRIVERIKDVLPLTDKYAHSQIALDE 201 Query: 190 RFIELLKEKKQALVSYIVTKGLNPD----------------------------------- 214 + + K++++ + L P Sbjct: 202 LNRSINGKLKKSILQEAIQGRLVPQVAEEGTAQELLEQIKLEKQKLVKEGNLKKSALSDS 261 Query: 215 ---------------VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 KD E +P+ W L N E Sbjct: 262 VIYKGDDNKYFEKIGTIEKDITDEIPFEIPNSWCWIRLNNLCNITNGFTPLRTEPKFWEN 321 Query: 260 SYGNIIQKLETRNMG---------LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 N + R G + + +IV G ++ Sbjct: 322 GNINWFTVEDIRKQGEYIYQTTQKITELAVSKDRIVRAGSVLLCCTASVGQCAMTMIPTT 381 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + ++ +L +++ + G + + + V + V +P Sbjct: 382 TNQQFNALTIKEEYRCLVNDEFLYLFVKTLAPI-LHDLAGKTTFEFISVKKVGNILVPIP 440 Query: 371 PIKEQFDITNVINVETARI 389 P+ EQ I V N A I Sbjct: 441 PVLEQCRICKVTNKAIASI 459 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 72/214 (33%), Gaps = 19/214 (8%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYG---NIIQKLE 269 K E +P WE ++ + NT NIL L G N I Sbjct: 15 KCIDEEIPFEIPQGWEWSRMGSIGDWGAGATPAKGNTSYYGGNILWLRTGELNNSIVNDT 74 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + K + ++ G+++ K ++ ++ A A P GI Sbjct: 75 EIKITDKALKECSLRLNKAGDVLIAMYGATIGKVAIAGCELT----TNQACCACTPIGIF 130 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + YL + + + G + ++ E + + +PPI+EQ I I Sbjct: 131 NYYLFYFLMGNQV-DFIKKGEGGAQPNISREKLVAHLMPIPPIQEQHRIVERIKDVLPLT 189 Query: 390 DVLVEKIEQSIVLLK-----ERRSSFIAAAVTGQ 418 D + ++ L + + S + A+ G+ Sbjct: 190 DKY-AHSQIALDELNRSINGKLKKSILQEAIQGR 222 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 51/173 (29%), Gaps = 9/173 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP W + + + G T + +I + +ED+ + Sbjct: 289 EIPNSWCWIRLNNLCNITNGFTPLRTEPKFWENGNINWFTVEDIRKQGEYIYQTTQKITE 348 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLPELLQGWLLS 130 S I G +L + A+ + + L +L Sbjct: 349 LAVSKDRIVRAGSVLLCCTASVGQCAMTMIPTTTNQQFNALTIKEEYRCLVNDEFLYLFV 408 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + T K +GNI +PIPP+ EQ I + I + Sbjct: 409 KTLAPILHDLAGKTTFEFISVKKVGNILVPIPPVLEQCRICKVTNKAIASIMS 461 >gi|71900231|ref|ZP_00682369.1| hypothetical protein XfasoDRAFT_2382 [Xylella fastidiosa Ann-1] gi|71730004|gb|EAO32097.1| hypothetical protein XfasoDRAFT_2382 [Xylella fastidiosa Ann-1] Length = 159 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 3/127 (2%) Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 ++A + +I A + ++P +L + +RS + + + + +L Sbjct: 2 YASIGKAAILGIDAVINQAILGLEPKSNVLVPEFLFFWLRSLE-RHIKNLASTSTQANLN 60 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 VK LP+ P ++EQ I I E D + + E+ I L++E R I VTGQ Sbjct: 61 AAKVKALPIFFPSVEEQKQICGWIKNECRIFDDAITRTEEEITLIREYRDRLITDVVTGQ 120 Query: 419 IDLRGES 425 +D+RG Sbjct: 121 VDVRGWQ 127 >gi|261839336|gb|ACX99101.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori 52] Length = 421 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 52/407 (12%), Positives = 128/407 (31%), Gaps = 27/407 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKVLDAFTELNTELNTELKAR 191 Query: 192 IELLKEKKQALVS--YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + + L+ I + K L P E + + L+ + Sbjct: 192 KKQYEYYQNMLLDFKGINQSHKDAKTYPKRLKTLLQTLAPKGVEFRKLGEVCEILDNRRI 251 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + ++ Y + + + + G ++ D + + Sbjct: 252 PIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVI------NKDNTPVVNWA 305 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + A++ + + +L + +++ D+ +G + E++K++ + + Sbjct: 306 SGKIWVNNHAHVLQTKNELKLKFLYFYLQTIDV----SYCVAGTPPKINQENLKKITIPI 361 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 P++ Q +I +++ + L+ I I K+ R + Sbjct: 362 LPLEIQQEIVKILDQFSVLTTDLLAGIPAEIEARKKQYEYYREKLLT 408 >gi|302878638|ref|YP_003847202.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] gi|302581427|gb|ADL55438.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] Length = 410 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 51/407 (12%), Positives = 123/407 (30%), Gaps = 32/407 (7%) Query: 25 WKVVPIKRFTK--LNTGRTSESGK---DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 W + ++ G ++ K I + D+ T Sbjct: 15 WSEESLINLSESGFTNGVFNDPKKTGRGYKLINVLDMYIETTIDENRLSLVELSDAEFKK 74 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVLPELLQ--GWLLS 130 + G+I + + ++ + ++P+ + + L + Sbjct: 75 NKVEHGEIFFTRSSLVKEGIAFSNIYLGHSQDITFDGHLIRMRPRKDVLNSVFANYLLRT 134 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +++ A + ATM+ I + + P LAEQ I + A ++ L + Sbjct: 135 SKARKQLVARGKTATMTTIGQADIAAVMVMFPSLAEQTKIANFLTAVDQKLTQLTRKHDL 194 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K Q + S + + + + + + A K++ Sbjct: 195 LTQYKKGVMQQIFSQELRFKDDDGCDFPEWDVVELEKI----------AAKVNKKNKDSA 244 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + S + G + Q + Y IV+ + V+ N Sbjct: 245 INNVLTNSATQGIVSQSDYFERDIANQNNLGGYYIVEIDDFVYNPRISANALVGPIKRNN 304 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLP 366 + G+++ Y + + ++ + ++ + R ++ E LP Sbjct: 305 LAVGVMSPLYNVFRFKAGNLNFIEQYFHTTHWHDYMKSVSNSGARHDRMNITNESFLGLP 364 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + P +KEQ I N + ID + + + +K+ + + Sbjct: 365 IPYPCLKEQTKIANFLTA----IDEKITTAKTQLEAVKQYKQGLLQQ 407 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 71/216 (32%), Gaps = 9/216 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ W + F V +K + + + Y + Sbjct: 4 PALRFDKGQAAWSEESLINLSESGFTNGVFNDPKKTGRGYKLINVLDMYIETTIDENRLS 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKPHGI-- 328 + ++ V+ GEI F L + + + + + ++P Sbjct: 64 LVELSDAEFKKNKVEHGEIFFTRSSLVKEGIAFSNIYLGHSQDITFDGHLIRMRPRKDVL 123 Query: 329 DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +S + +L+R+ K A G + ++ D+ + V+ P + EQ I N + Sbjct: 124 NSVFANYLLRTSKARKQLVARGKTATMTTIGQADIAAVMVMFPSLAEQTKIANFLTAVDQ 183 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ L K LL + + + + ++ + Sbjct: 184 KLTQLTRKH----DLLTQYKKGVMQQIFSQELRFKD 215 >gi|296454639|ref|YP_003661782.1| restriction endonuclease S subunit [Bifidobacterium longum subsp. longum JDM301] gi|296184070|gb|ADH00952.1| Restriction endonuclease S subunit [Bifidobacterium longum subsp. longum JDM301] Length = 398 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 45/404 (11%), Positives = 117/404 (28%), Gaps = 42/404 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + +G+ + +E G G D + + Sbjct: 19 WEQRKFSDIVNVCSGKDYK-----------HLEEGPIPVYGTGGFMTSVDEALSY--DRD 65 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + G+ G + ++ T F + D+ ++ + ++ E Sbjct: 66 AVGIGRKGTIDKPYLLKAPFWTVDTLFYAIPKSDMD----LEFVHCSFLNVDWKSKDEST 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL--------ITERIRFIELLK 196 + + I +P EQ + + I I ++ ++ Sbjct: 122 GLPSLSKEAINETIALVPSFNEQSRLGDFFYNLDNLITLHQRKYDKLVIFKKSMLEKMFP 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + +++ +P + K G + + + + Sbjct: 182 KDGESVPEIRFAGFTDPWEQRK------FGDCFEFLKSNTLSRAGLNDENGTARNVHYGD 235 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQ---IVDPGEIVFRFIDLQNDKRSLRS--AQVM 311 + + +G+ + + + ++ I+ G+++F Sbjct: 236 ILIKFGDCLDGERSDLPFITDDTVLPKFAGSILREGDVIFADTAEDEAAGKCVELRKLPK 295 Query: 312 ERGIITSAYMAVKPHGIDST-YLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLV 369 E I + +P T YL + S + + G++ S+ ++ V Sbjct: 296 EPTISGLHTIPARPRFFFGTGYLGHYLNSDAYHRQLLPLMQGIKVISVSKAALQDTQVRF 355 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P + EQ I + + ID L+ ++ + LL+ + S + Sbjct: 356 PGLSEQAAIGAAL----SEIDNLITLHQRKLELLQNIKKSLLDK 395 >gi|18765824|gb|AAL78775.1|AF326624_1 HP848-like protein [Helicobacter pylori] Length = 436 Score = 109 bits (273), Expect = 7e-22, Method: Composition-based stats. Identities = 64/415 (15%), Positives = 128/415 (30%), Gaps = 27/415 (6%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTG---KYLPKDGNSRQSD 74 PK + + G R + + I + ++++ + + N + Sbjct: 13 PKGVEFRKLGEVCDFQNGFAFQRKNFRNTGLPIIRISNIQNDRLLLDEVIYFSLNDYKGT 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSI 131 KG IL G K I FD V + + + Sbjct: 73 NFEPFKITKGDILIAMSGATTGKIGILTFDTTLYLNQRVGKFKPKLLLKLNNKFLYYFLL 132 Query: 132 DVTQRIEAICEGATMSHADWKGI-GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + ++ G + I I +PIPPL Q I + A T L TE Sbjct: 133 TKINFLYSLAGGGAQPNLSSNQILQQITIPIPPLEIQQEIVTILDAFTELNTELNTELNT 192 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ K++ Q + ++ N + E + P +K + K Sbjct: 193 ELKARKKQYQYYQNMLLD--FNDINQSHKDAKERLAQKPYPKRLKTLLQTLAPKGVGFRK 250 Query: 251 LIESN---------ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 L E I +S N G Y D I Sbjct: 251 LGEVCESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGENITIASRGEYAG 310 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + + + G+ Y + + + +L + +++ ++ + + G +L D Sbjct: 311 FINYFNEKFFAGGLCYP-YKVKDTNELLTKFLYFYLKTNEIQIMENLVSRGSIPALNKAD 369 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ L + +PP++ Q +I +++ +A L+ I I K+ R + Sbjct: 370 IETLTIPIPPLEIQQEIVKILDQFSALTTDLLAGIPAEIKARKKQYEYYREKLLT 424 >gi|291461103|ref|ZP_06026993.2| putative type I restriction-modification system [Fusobacterium periodonticum ATCC 33693] gi|291378944|gb|EFE86462.1| putative type I restriction-modification system [Fusobacterium periodonticum ATCC 33693] Length = 433 Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats. Identities = 56/398 (14%), Positives = 130/398 (32%), Gaps = 38/398 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + K G+T I D+ +G P ++ + Sbjct: 30 PNGVEYKELGEIVKSQRGKTITKE----LIKDGDIPVISGGQKPAYYHNESN-------- 77 Query: 82 AKGQIL-YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG+++ G Y + D S F + K L + + + +I ++ Sbjct: 78 RKGEVITIAGSGAYAGFVMYWDKPIFVSDAFTIECDKSYLN-IKYIYYFLQNNQMKIHSL 136 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G + H +K + +P+PPL Q I + T ++ L ++ Sbjct: 137 KKGGGVPHVYFKDMQKFLVPVPPLEVQNEIARILDDYTKSVEE----------LKEKLNT 186 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L++ D +K + + +E K K T +I +++ Sbjct: 187 ELITRKKQYSWYRDYLLKFENKVKIVKLGGLFEFKNGINKEKSSFGKGTPIIN--YVNVY 244 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITS 318 N I + + + + V G++ F ++ S + +E + + Sbjct: 245 KKNKIYFEDLQGLVEATDDELIRYKVKRGDVFFTRTSETIEEIGFTSVLLEDIENCVFSG 304 Query: 319 AYMAVKP--HGIDSTYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + +P + Y A+ + + + R + + ++ + +PP++ Q Sbjct: 305 FLLRARPLTDLLLPEYCAYCFSTSSMRNAIIRKSTYTTRALINGTSLSQIEIPLPPLEVQ 364 Query: 376 FDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 I V++ L +EK ++ ++ Sbjct: 365 KRIVEVLDNFEKTCKELNIELSSEIEKKQKEYEFVRNY 402 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 52/129 (40%), Gaps = 7/129 (5%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 GE++ I + + + ++ Y+ + +++ + Sbjct: 78 RKGEVI--TIAGSGAYAGFVMYWDKPIFVSDAFTIECDKSYLNIKYIYYFLQNNQMKIHS 135 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE- 405 G G+ + F+D+++ V VPP++ Q +I +++ T ++ L EK+ ++ K+ Sbjct: 136 LKKGGGV-PHVYFKDMQKFLVPVPPLEVQNEIARILDDYTKSVEELKEKLNTELITRKKQ 194 Query: 406 ---RRSSFI 411 R + Sbjct: 195 YSWYRDYLL 203 >gi|329730359|gb|EGG66749.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21193] Length = 406 Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats. Identities = 51/394 (12%), Positives = 116/394 (29%), Gaps = 16/394 (4%) Query: 24 HWKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ FTK+N G K L + + I Sbjct: 20 EWEEKQFADFTKINQGLQIAINERKTEYSPELYFYITNEFLRPNSQTKYFIENPPQSVIA 79 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K IL + G + + + L L S + +I ++ Sbjct: 80 NKEDILMTRTGNTGKVVTNVFGAFHNNFFKIKFDKNLYDRLFLVEVLNSSKIQNKILSLA 139 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +T+ + +I P L EQ I + +I+ + + K Q Sbjct: 140 GSSTIPDLNHSDFYSISSSYPLLREQQKIGKFFSKLDRQIELEEQKLELLQQQKKGYMQK 199 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + S + D + +G + + ++ ++ + ++ Sbjct: 200 IFSQELRFKDENGEDYPDWKEKKLGDITE-------QSMYGIGASATRFDSKNIYIRITD 252 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAY 320 + + P+ + +I+F K + + + + Sbjct: 253 IDEKSRKLNYQNLTTPDELNNKYKLKRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFL 312 Query: 321 MAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + K + +S + S V + + E+ +LP+++P EQ I Sbjct: 313 IKFKINEQNSPLFIYQFTLTSKFNKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKI 372 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ R D +E +Q I +L++++ + Sbjct: 373 AKFLD----RFDRQIELEKQKIEILQQQKKGLLQ 402 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 66/190 (34%), Gaps = 11/190 (5%) Query: 24 HWKVVPIKRFTKLNT---GRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + T+ + G ++ IYI + D++ + K ++ + + Sbjct: 217 DWKEKKLGDITEQSMYGIGASATRFDSKNIYIRITDIDEKSRKLNYQNLTTPDELNNKYK 276 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL------VLQPKDVLPELLQGWLLSIDV 133 + + IL+ + G K+ I + + + P + + L+ Sbjct: 277 L-KRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFLIKFKINEQNSPLFIYQFTLTSKF 335 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + + + + +P+ +P EQ I + + +I+ + + Sbjct: 336 NKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKIAKFLDRFDRQIELEKQKIEILQQ 395 Query: 194 LLKEKKQALV 203 K Q++ Sbjct: 396 QKKGLLQSMF 405 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 49/178 (27%), Gaps = 9/178 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ + EW + + RK E + Sbjct: 10 PELRFPEFEGEWEEKQFADFTKINQGLQIAINERKTEYSPELYFYITNEFLRPNSQTKYF 69 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + P+S I + +I+ + V + + D + Sbjct: 70 IENPPQSV----IANKEDILMTRTGNTGKVVT----NVFGAFHNNFFKIKFDKNLYDRLF 121 Query: 333 LAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L ++ S + ++ GS L D + P ++EQ I + +I Sbjct: 122 LVEVLNSSKIQNKILSLAGSSTIPDLNHSDFYSISSSYPLLREQQKIGKFFSKLDRQI 179 >gi|163737286|ref|ZP_02144704.1| Type I restriction-modification system specificity subunit [Phaeobacter gallaeciensis BS107] gi|161389890|gb|EDQ14241.1| Type I restriction-modification system specificity subunit [Phaeobacter gallaeciensis BS107] Length = 425 Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats. Identities = 62/426 (14%), Positives = 145/426 (34%), Gaps = 49/426 (11%) Query: 30 IKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + ++ G S + + + D+ SG + + + A G Sbjct: 7 LGQKVEVLNGFAFPSSGFTTEDGLPLVRIRDIASGQTEVNFR------GKFDPAYLLANG 60 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L G G +L + + D + + + + + + + I Sbjct: 61 DVLIGMDGDFL-VSRWSGGDALLNQRVCKVTSISSEVDQRFLYWFLQPHIEDIHRKTPQT 119 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T+ H K + +P P +Q I E + I + LK KQ L+ Sbjct: 120 TVRHLSTKDVRAVPSPAFVATQQSKIAEVLDTLDAA----IRGTEAVVAKLKAMKQGLLH 175 Query: 205 YIVTKGLNPD-----------VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK--- 250 ++T+G++ + K++ + W+ + E++ A V R Sbjct: 176 DLLTRGIDANGDLRPPHTKAPHLYKETPLGWLPKEWEVSEIQNMLASVDPAMRSGPFGSA 235 Query: 251 -----LIESNILSLSYGNIIQKLETRNMG--LKPESYET--YQIVDPGEIVFRFIDLQND 301 L+E + L N+ + RN + P + V P +++ + Sbjct: 236 LLKDELVEEGVPFLGIDNVFVERFDRNFKRFVTPGKFLQLQRYAVRPDDLMITIMGTVG- 294 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR---SYDLCKVFYA-MGSGLRQSL 357 R + R + + + + +++ S + + F G ++ Sbjct: 295 -RCCLVPLDVGRALSSKHTWTISLDEAKYSPYLAMLQVNYSDWVLRHFSKDQQGGTMSAI 353 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + E ++ + VPP EQ I +++ + R+ + + S L+ ++S + +TG Sbjct: 354 RSETIRSTLLPVPPRDEQEAIAAILSELSRRLR----EEQTSFEKLRLQKSGLMDDLLTG 409 Query: 418 QIDLRG 423 ++ + Sbjct: 410 RVPVTP 415 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 70/217 (32%), Gaps = 21/217 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKR-FTKL----NTGRTSES-------GKDIIYIGLEDV- 56 YK++ + W+ PK W+V I+ + +G + + + ++G+++V Sbjct: 199 YKETPLGWL---PKEWEVSEIQNMLASVDPAMRSGPFGSALLKDELVEEGVPFLGIDNVF 255 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + + ++ +G R ++ G + Sbjct: 256 VERFDRNFKRFVTPGKFLQLQRYAVRPDDLMITIMGTVGRCCLVPLDVGRALSSKHTWTI 315 Query: 117 KDVLPEL-----LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + + S V + +G TMS + I + +P+PP EQ I Sbjct: 316 SLDEAKYSPYLAMLQVNYSDWVLRHFSKDQQGGTMSAIRSETIRSTLLPVPPRDEQEAIA 375 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + R+ T + L++ V Sbjct: 376 AILSELSRRLREEQTSFEKLRLQKSGLMDDLLTGRVP 412 >gi|315639287|ref|ZP_07894449.1| type I site-specific deoxyribonuclease [Campylobacter upsaliensis JV21] gi|315480613|gb|EFU71255.1| type I site-specific deoxyribonuclease [Campylobacter upsaliensis JV21] Length = 406 Score = 109 bits (272), Expect = 8e-22, Method: Composition-based stats. Identities = 52/398 (13%), Positives = 127/398 (31%), Gaps = 24/398 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + P+ + + ++ ++ + + + +D D S ++ Sbjct: 21 PNGVEFKPLGEVIERVRRKV-KNLNNVNVYSVSNSQGLILSTDFRDRKLYSEDISNYTLI 79 Query: 82 AKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIE 138 KG+ Y + D G S ++V + K + + L +L S ++I Sbjct: 80 QKGEFAYNPARLNIGSIAFLTDEVGAVSPMYVVFKIDEKSLNQKFLFYFLKSPTTLRKIV 139 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ E D+K P+P+PPL Q I E + A T L E ++ Sbjct: 140 SLTETGARFRFDFKRWEKFPIPLPPLEIQYKIVEILDAFTELEAELEAELEARLKQYHYY 199 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L+S+ + + V V + L + + Sbjct: 200 RNKLLSHDELENRTAKSRNDSDPATLVPYVRLGEACEILDNLRKPITKSKRTQGIYPYYG 259 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + + I V ++ + + Sbjct: 260 ANGIQDYVNEYIFDGDFLLMGEDGSVINKDNSPVLNWVSGKFWV------------NNHA 307 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + K + + ++ + +++ D+ + G+ + +++K + + +PP+ Q +I Sbjct: 308 HILKEKSNTTNLRFVFFYLQTCDVSSIVR----GVPPKINQQNLKTIQIPLPPLAVQNEI 363 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ + L I I K+ R ++ Sbjct: 364 VELLDKFDTLTNDLTSGIPAEIEARKKQYEYYRERLLS 401 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 78/190 (41%), Gaps = 7/190 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG-NIIQKLETRNMGLKPESYETYQI 285 P+ E KP ++ + RK L N+ S+S +I + R+ L E Y + Sbjct: 19 HCPNGVEFKPLGEVIERVRRKVKNLNNVNVYSVSNSQGLILSTDFRDRKLYSEDISNYTL 78 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYD-L 342 + GE + L + E G ++ Y+ ++ +L + ++S L Sbjct: 79 IQKGEFAYNPARLN---IGSIAFLTDEVGAVSPMYVVFKIDEKSLNQKFLFYFLKSPTTL 135 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K+ +G R F+ ++ P+ +PP++ Q+ I +++ T L ++E + Sbjct: 136 RKIVSLTETGARFRFDFKRWEKFPIPLPPLEIQYKIVEILDAFTELEAELEAELEARLKQ 195 Query: 403 LKERRSSFIA 412 R+ ++ Sbjct: 196 YHYYRNKLLS 205 >gi|148360829|ref|YP_001252036.1| hypothetical protein LPC_2789 [Legionella pneumophila str. Corby] gi|148282602|gb|ABQ56690.1| hypothetical protein LPC_2789 [Legionella pneumophila str. Corby] Length = 424 Score = 109 bits (272), Expect = 8e-22, Method: Composition-based stats. Identities = 62/362 (17%), Positives = 126/362 (34%), Gaps = 22/362 (6%) Query: 72 QSDTSTVSIFAKGQILYGKL---GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 SD ST IF K +++ + + GI S ++ L P L + Sbjct: 61 SSDYSTYQIFEKDDLVFKLIDLENIKTSRVGYVPRRGIMSPAYIRLTPTSELVIPRYYYW 120 Query: 129 -LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I G + + P+P+ P Q+ I + E RID LI + Sbjct: 121 LFYAAYINNIFNGMGGGVRQNLTPTDLLEFPIPLTPKETQIEITNFLDREIDRIDQLIEK 180 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + I L++E++ V + +N V++ + T R Sbjct: 181 KKKLICLMRERESNAVREAIFSLINEGVQIWKL----------SHVCRVQRGKFTHRPRN 230 Query: 248 NTKLIESNILSLSYGNI---IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 +L + + + G++ + + L ++ + Sbjct: 231 APELYDGEVPFIQTGDVARANKFITKHKQTLSELGISVSAKFPSNTLLMAIAANVGNLAI 290 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 E S + ++S YL +++R+ + + + + + Sbjct: 291 T----TYEVYCPDSIVGFIPTEKVESEYLYYVLRAISDDISSSSTSN-AQDNTNVARLGS 345 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 L + +P I++Q ++ + +E + KI SI L E R + I+ AVTGQ+D++ Sbjct: 346 LKIPLPSIQKQKNLIDKFKIEENLLFKTTSKISNSITKLNEFRCALISEAVTGQLDIKSW 405 Query: 425 SQ 426 + Sbjct: 406 KK 407 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 61/179 (34%), Positives = 96/179 (53%), Gaps = 3/179 (1%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 W+ P + + N LIE + L+L+ G +I + GL+ Y TYQI + Sbjct: 14 YRWKSVPTKRNFRNIKQINKGLIEEHRLALTLGGVIDRSLDDVEGLQSSDYSTYQIFEKD 73 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFY 347 ++VF+ IDL+N K S R V RGI++ AY+ + P Y WL + + +F Sbjct: 74 DLVFKLIDLENIKTS-RVGYVPRRGIMSPAYIRLTPTSELVIPRYYYWLFYAAYINNIFN 132 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 MG G+RQ+L D+ P+ + P + Q +ITN ++ E RID L+EK ++ I L++ER Sbjct: 133 GMGGGVRQNLTPTDLLEFPIPLTPKETQIEITNFLDREIDRIDQLIEKKKKLICLMRER 191 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 78/219 (35%), Gaps = 11/219 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 I + ++ + ++ G+ + ++ +I DV + Sbjct: 204 INEGVQIWKLSHVCRVQRGKFTHRPRNAPELYDGEVPFIQTGDVARANKFITKHKQTLSE 263 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S + F +L + + I ++ C + P + + E + + Sbjct: 264 LGISVSAKFPSNTLLMA-IAANVGNLAITTYEVYCPDSIVGFIPTEKV-ESEYLYYVLRA 321 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ I + + + +G++ +P+P + +Q + +K E + ++ I Sbjct: 322 ISDDISSSSTSNAQDNTNVARLGSLKIPLPSIQKQKNLIDKFKIEENLLFKTTSKISNSI 381 Query: 193 ELLKEKKQALVSYIVTKGLN-PDVKMKDSGIEWVGLVPD 230 L E + AL+S VT L+ K + S E + + + Sbjct: 382 TKLNEFRCALISEAVTGQLDIKSWKKRGSTDERLDNIEE 420 >gi|148656808|ref|YP_001277013.1| restriction modification system DNA specificity subunit [Roseiflexus sp. RS-1] gi|148568918|gb|ABQ91063.1| restriction modification system DNA specificity domain [Roseiflexus sp. RS-1] Length = 290 Score = 109 bits (272), Expect = 8e-22, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 87/229 (37%), Gaps = 11/229 (4%) Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E K++L+ ++ T G P + + ++ +G +P HW V + T R Sbjct: 45 ELKKSLMQHLFTYGPVPVTERERVPLQETEIGPLPAHWRVVRLGEVATLFTRGIDPANAG 104 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + +I K + + G++++ + DK L + Sbjct: 105 AKRYIGLEHIEPGNIRIQHWGKADDVRSLKTAFQQGDVLYGKLRPYLDKAVLA---EWDG 161 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPI 372 T + + +LA+L+ + + +G+ ++ +++ P+ +PP+ Sbjct: 162 ICSTDILVIKAQSSLLPEFLAYLVHTSQFIDYAISTTTGVNHPRTSWKALQKFPISLPPL 221 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ +I ++ A+ + + L+E + + +TGQI + Sbjct: 222 DEQREIARMLQAVDAK----IAAEQARRAALEELFKTLLHQLMTGQIRV 266 Score = 83.3 bits (204), Expect = 8e-14, Method: Composition-based stats. Identities = 58/189 (30%), Positives = 85/189 (44%), Gaps = 4/189 (2%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IG +P HW+VV + G + YIGLE +E G + + S Sbjct: 75 IGPLPAHWRVVRLGEVATLFTRGIDPANAGAKRYIGLEHIEPGNIRI--QHWGKADDVRS 132 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQ 135 + F +G +LYGKL PYL KA++A++DGICST LV++ + LPE L + + Sbjct: 133 LKTAFQQGDVLYGKLRPYLDKAVLAEWDGICSTDILVIKAQSSLLPEFLAYLVHTSQFID 192 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G WK + P+ +PPL EQ I + A +I R EL Sbjct: 193 YAISTTTGVNHPRTSWKALQKFPISLPPLDEQREIARMLQAVDAKIAAEQARRAALEELF 252 Query: 196 KEKKQALVS 204 K L++ Sbjct: 253 KTLLHQLMT 261 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 12/43 (27%), Positives = 16/43 (37%), Gaps = 4/43 (9%) Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 EQ I +V+ E E I LKE + S + T Sbjct: 18 EQRAIAHVLRTV----QWAKEATEGVIAALKELKKSLMQHLFT 56 >gi|254362756|ref|ZP_04978839.1| type I site-specific deoxyribonuclease specificity subunit [Mannheimia haemolytica PHL213] gi|153094384|gb|EDN75235.1| type I site-specific deoxyribonuclease specificity subunit [Mannheimia haemolytica PHL213] Length = 495 Score = 109 bits (272), Expect = 8e-22, Method: Composition-based stats. Identities = 60/434 (13%), Positives = 123/434 (28%), Gaps = 69/434 (15%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP+ W V + K+ G D +YI ++++ +++ Sbjct: 68 EIPESWVWVRLGDICLKITDGTHHSPPNIDKSDFLYITAKNIKKDGLDLSKISYVTKEIH 127 Query: 75 TSTVSIF--AKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 S KG ILY K G +II + + S+ L+ +++ E L + Sbjct: 128 NEIFSRCNPEKGDILYIKDGATTGVSIINTLNEPFSMLSSVALIKTSQEIDNEYLNYVMN 187 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S G + + + +P+PPL EQ I +KI ++ Sbjct: 188 SHYFYNISIGSMSGTGIPRITLTKLESYLVPVPPLLEQQRIVQKIEELLPLVERYEQTEQ 247 Query: 190 RFIELLK----EKKQALVSYIVTKGLNPDVKM---------------------------- 217 + +L + K++++ + L Sbjct: 248 QLTKLNNTFPEQLKKSVLHAAIQGKLTEQDPNDELASCLIERIKAEKNRLIAEKKLKKPK 307 Query: 218 --------------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + E +P +W +R+ + Sbjct: 308 SVSEIVMRDNLPYEIKAGQERCIADEVPFEIPQNWIWVRLENYSLNHDRRRKP------V 361 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S++ + KL I D I+ + + V + Sbjct: 362 SVAQRSQQNKLYDYYGATGAIDKVASYIFDGKFILIGEDGGNFFTKKDVAFIVEGKFWAN 421 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + Y + + + +L + G L ++ + + +PPI EQ Sbjct: 422 NHVHVLSVDFNLEKYFCYYLNALNLPSMGLINGI-AVPKLNQRNLNSILIAIPPISEQHR 480 Query: 378 ITNVINVETARIDV 391 I I + I+ Sbjct: 481 IVEKIEKLFSEIEK 494 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 86/244 (35%), Gaps = 17/244 (6%) Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV----TELN 245 + ++ +K L++ K IE +P+ W + + Sbjct: 31 ELLCKIQAEKDRLIAEGKIKKNKKTADKAPYTIEPPFEIPESWVWVRLGDICLKITDGTH 90 Query: 246 RKNTKLIESNILSLSYGNIIQKL-----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + +S+ L ++ NI + + + + G+I++ Sbjct: 91 HSPPNIDKSDFLYITAKNIKKDGLDLSKISYVTKEIHNEIFSRCNPEKGDILYIKDGATT 150 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKF 359 S+ + +++S + ID+ YL ++M S+ + +M + Sbjct: 151 -GVSIINTLNEPFSMLSSVALIKTSQEIDNEYLNYVMNSHYFYNISIGSMSGTGIPRITL 209 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERRSSFIAAA 414 ++ V VPP+ EQ I I ++ E+ EQ + L ++ + S + AA Sbjct: 210 TKLESYLVPVPPLLEQQRIVQKIEELLPLVERY-EQTEQQLTKLNNTFPEQLKKSVLHAA 268 Query: 415 VTGQ 418 + G+ Sbjct: 269 IQGK 272 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 14/169 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP++W V ++ ++ + R + + S K G + D Sbjct: 337 EIPQNWIWVRLENYSLNHDRRRKP-------VSVAQ-RSQQNKLYDYYGATGAIDKVASY 388 Query: 80 IFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 IF IL G+ G A I + + VL L + +L ++++ Sbjct: 389 IFDGKFILIGEDGGNFFTKKDVAFIVEGKFWANNHVHVLSVDFNLEKYFCYYLNALNLPS 448 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + G + + + + +I + IPP++EQ I EKI I+ Sbjct: 449 M--GLINGIAVPKLNQRNLNSILIAIPPISEQHRIVEKIEKLFSEIEKF 495 >gi|157151665|ref|YP_001449873.1| HsdS specificity protein of type I restriction-modification system [Streptococcus gordonii str. Challis substr. CH1] gi|157076459|gb|ABV11142.1| HsdS specificity protein of type I restriction-modification system [Streptococcus gordonii str. Challis substr. CH1] Length = 402 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 48/400 (12%), Positives = 118/400 (29%), Gaps = 26/400 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + ++ +G T D I +I D+ + + + + + Sbjct: 15 WTKSKLGEIYEVYSGNTPSRSDDRNYQNGEIPWIKTTDLNNTVICSNEEKISVYGA--TK 72 Query: 78 VSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + + +L G + + + + + + L P + + L+ Sbjct: 73 LKVLPEKSVLIAMYGGFNQIGRTGLLAYPATINQALAALMPVNEINPNFLLNFLNFKKES 132 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + P L EQ I + + + L Sbjct: 133 WRNVAASSRKDPNITKNDVEKFKISFPSLDEQSAIGSLFRTLDDLLASYKDNLANYQSLK 192 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + P++++ EWV + E+ + Sbjct: 193 ATMLSKMFPKAGQT--VPEIRLDGFEGEWV-----EVNLGTLIDNRDEIISGASGF--PI 243 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 S G +Q + + V G + +R + + + ++ + + Sbjct: 244 ATSSRKGLYLQNDYFEGGRTGIDLTLDFHRVPMGYVTYRHMSDDSIFKFNKNNLETDVLV 303 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIK 373 + + D +L + + + L F M G R L ++++ + VP IK Sbjct: 304 SKEYPVFISNDSSDIDFLLYHLNNSRLFLRFSTMQKLGGTRVRLYYKNLITYKLAVPTIK 363 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + + +D L+ ++ I L+ + + Sbjct: 364 EQQAIGSY----FSNLDNLITAHQEKISQLETLKKKLLQD 399 >gi|295101713|emb|CBK99258.1| Restriction endonuclease S subunits [Faecalibacterium prausnitzii L2-6] Length = 372 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 52/389 (13%), Positives = 111/389 (28%), Gaps = 22/389 (5%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 ++ +N G++ +S G + R + I + IL Sbjct: 3 RLEEICAINMGQSPDSSTYNEDGNGLPFFQGNADFGEIYPAVRMWCSGPTKIAREKDILI 62 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 P IA+ + L + + W + + + G+T Sbjct: 63 SVRAPI-GALNIANCECCIGRGLAALTVNEDICAQEYLWHVLSGKVDELNSKGTGSTFKA 121 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + K + +P+PP+ EQ I + + I + + E+ + + V Sbjct: 122 INKKTLSETEIPLPPIDEQRKIAAVLDKVSGLIAKRRQQLDKLDEI-------VKAKFVE 174 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +P +G + + +N I S + L Sbjct: 175 MFGDPVGNPMGWEKIALGKR----CDIVTGNTPSRADPENYGNFIEWIKSDNINTPAVLL 230 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 L + + V+ G ++ I + A R A+ P Sbjct: 231 TEAQEYLSETGFHKCRFVEAGSLLMTCIAGSINCIG-NVAVTDRRVAFNQQINAIVPKQD 289 Query: 329 DSTYLAW--LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 D YL W L+ + G+ L + + PP++ Q + + Sbjct: 290 DVLYLYWLMLLSKPAIHSTINMALKGI---LSKGQLSEMAFPFPPLELQNQFSVFV---- 342 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + I +S+ L+ + + + Sbjct: 343 KKTEKTKANINRSLEKLETLKKALMQEYF 371 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 53/191 (27%), Gaps = 11/191 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P W+ + + + + TG T I +I +++ + ++ Sbjct: 183 PMGWEKIALGKRCDIVTGNTPSRADPENYGNFIEWIKSDNINTPAVLLTEAQEYLSETGF 242 Query: 76 STVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G +L + + + D + Q + PK ++L + L + Sbjct: 243 HKCRFVEAGSLLMTCIAGSINCIGNVAVTDRRVAFNQQINAIVPKQ--DDVLYLYWLMLL 300 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I + A + + P PPL Q + + + Sbjct: 301 SKPAIHSTINMALKGILSKGQLSEMAFPFPPLELQNQFSVFVKKTEKTKANINRSLEKLE 360 Query: 193 ELLKEKKQALV 203 L K Q Sbjct: 361 TLKKALMQEYF 371 >gi|258515814|ref|YP_003192036.1| restriction modification system DNA specificity domain-containing protein [Desulfotomaculum acetoxidans DSM 771] gi|257779519|gb|ACV63413.1| restriction modification system DNA specificity domain protein [Desulfotomaculum acetoxidans DSM 771] Length = 400 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 47/385 (12%), Positives = 119/385 (30%), Gaps = 25/385 (6%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG 92 + + + S ++ I E + + + + + G + L Sbjct: 31 IFEPISNKNHNSDLPVLAITQEHGAIP-RDQIDYNVSVTDKSLESYKVVEIGDFIIS-LR 88 Query: 93 PYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATM-SHAD 150 + + + GICS +++L+ + + + + + + + + EG Sbjct: 89 SFQGGIEYSLYHGICSPAYIILRKRVPIVDQYYKHYFKTGRFIKDLNKDLEGIRDGKMVS 148 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 ++ I +P P EQ I + + ID LI + +E L K+ L+ + Sbjct: 149 YRQFSAIMLPKPDRKEQQKIADCL----SSIDDLIAAEDKKLEALGAHKRGLMQKLFPAE 204 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 + + G W + P + L+ + + E + Sbjct: 205 GKTLPEWRFPEFRGSGE----WVISPLSEVCENLDSRRIPITEKDRKKGFTPYYGASGIV 260 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + G + + + + + A++ + Sbjct: 261 DYVDGFIFDEVLLCVSEDGANLVART------YPIAFSISGKTWVNNHAHVLKFQNSNTQ 314 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + S +L M + L + +P+ +P KEQ I + + + ID Sbjct: 315 VMVKNYINSINLEDFLTGMA---QPKLNRAKLDIIPIPLPSEKEQQKIADCL----SSID 367 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAV 415 L+ + + L+ + + Sbjct: 368 DLIAGQVKKLEALRTHKKGLMQGLF 392 >gi|297157213|gb|ADI06925.1| restriction modification system DNA specificity subunit [Streptomyces bingchenggensis BCW-1] Length = 407 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 84/419 (20%), Positives = 148/419 (35%), Gaps = 44/419 (10%) Query: 24 HWKVVPIKRFTKL-NTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W +P+K TG I I +++ GK +P ++ +T Sbjct: 2 SWATIPLKFLATSAQTGPFGSQLHSDQYITDGIPVINPSNIK--DGKLVPDRNSTVSVET 59 Query: 76 STVSIFA---KGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDV--LPELLQGWL 128 + G I++ + G R AI+ +C T + ++ L Sbjct: 60 AARLAVHRLLSGDIIFARRGELGRSAIVTKSAEGWLCGTGSIRVRINQNRLDYRFAGYAL 119 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 ++ + G+TM + + + + +P+ +P LA Q I + + +ET +ID + Sbjct: 120 QNLQTYSYFQKQSVGSTMENLNTEIVLGLPVALPTLANQRRIADFLDSETEKIDAFTHKT 179 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 R + LL EK + + + G + + V+ L+T++ R Sbjct: 180 RRLLHLLDEKIASRI-------------LGHVGASQLNDIHSGSPVREINKLLTKVVRPP 226 Query: 249 TKLIESNILSL-SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 E + + Q VD G+IV +D Sbjct: 227 IADGEVITAYRDGQVTARSLRRAEGYTVSATTEAQGQRVDRGDIVIHGLDGFAGAIGTSE 286 Query: 308 AQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL----KFEDV 362 A G + Y P +G DS + L+R L + + R+ + Sbjct: 287 A----AGNCSPVYHVCIPRNGGDSLFYGRLLRILALSEYLGPFATSTRERAVDFRNWNLF 342 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 R+P+ KEQ +I I L +++S L ERR + I AAVTGQID+ Sbjct: 343 GRIPIPDVSFKEQQEIGEWI----KSARPLRIAVDRSNALAIERRQALITAAVTGQIDV 397 >gi|315026883|gb|EFT38815.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2137] Length = 395 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 65/395 (16%), Positives = 128/395 (32%), Gaps = 20/395 (5%) Query: 25 WKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + + DI + + + ++ ++ Sbjct: 10 WEQCKLGDLGSVAMNKRIFKEQTSESGDIPFYKIGTFGATADAFISRELFET--YKKKYP 67 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +L G R D +V D L V Sbjct: 68 YPKIGDLLISASGSIGRVVEYKGNDEYFQDSNIVWLKHDDRINNLFLKQFYSIVKWHGL- 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG+T+ K I + +P EQ EKI ++D +IT R +E LKE K Sbjct: 127 --EGSTIKRLYNKNILETTIHLPVFDEQ----EKIGTLFKQLDDIITLHQRKLEQLKELK 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +A + + K +++ + E + N+ +I L Sbjct: 181 KAYLQLMFPKKDETLPRVRFADFEGEWEQCKLKNLFLKGGSGGTPTSSNSDYYNGDIPFL 240 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 S +I + K S E + + I L + A + + A Sbjct: 241 SISDITKSNGYIYTTEKCISLEGLKNSSAWIVPKESISLAMYASVGKVAILKLDIATSQA 300 Query: 320 YMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + I+ + +L++ + + +G + +L + VK VL+P EQ Sbjct: 301 FYNMIFEDINTRNYIYHYLIKKEVFNEWITLISTGTQANLNADKVKNTFVLIPSNNEQKK 360 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I ++ I+V ++ ++ I +LK + S++ Sbjct: 361 IAELLRC----IEVSIDIQQKKIHILKSLKKSYLQ 391 >gi|254786395|ref|YP_003073824.1| type I restriction modification DNA specificity domain-containing protein [Teredinibacter turnerae T7901] gi|237685013|gb|ACR12277.1| Type I restriction modification DNA specificity domain protein [Teredinibacter turnerae T7901] Length = 424 Score = 109 bits (272), Expect = 9e-22, Method: Composition-based stats. Identities = 53/402 (13%), Positives = 109/402 (27%), Gaps = 18/402 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ PI K T + +E+ ++ + I + Y K D + + K Sbjct: 24 GWERKPIGDGFKRVTNKNTENNQNALTISAQQGLVSQLDYFNKK--VAAKDLAGYYLMHK 81 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLL---SIDVTQ 135 G Y K G+ ST ++ + ++ Sbjct: 82 GDFAYNKSYSQGYPMGAIKPLKLYEKGVVSTLYICFRANRGFCNEFYEQYFEAGMLNQQI 141 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 A G + + ++ T ID LIT + ++ L Sbjct: 142 ESIAQEGGRAHGLLNVSVKEFFKDVDILVPTIEEQQKIADCLTS-IDELITLHTQKLDAL 200 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH--WEVKPFFALVTELNRKNTKLIE 253 K K+ L+ + KM+ G +V + T Sbjct: 201 KAHKKGLMQQLFPIEGKKVPKMRFPEFRKAGEWEKCALSDVATIRSGSTPSRSNPEFYEG 260 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +I + ++ T + G ++ +V Sbjct: 261 GDIPWVKTTDLNNSFITVTEECVTSKAKVKINA-IGSVLVAMYGGFKQIGRTGMLKVPAA 319 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + V + Y+ + + A S ++ DV + P+ P I Sbjct: 320 TNQALSVLNVDRKQVAPEYVLVWLNAKVGLWRKIASSSRKDPNITGSDVSKFPISFPEIG 379 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + I + ++ + + I L ++ + Sbjct: 380 EQRKIVDCIFSV----EEMISEQSEKISSLIAHKNGLVQKLF 417 >gi|307711301|ref|ZP_07647722.1| type I restriction modification DNA specificity domain protein [Streptococcus mitis SK321] gi|307616952|gb|EFN96131.1| type I restriction modification DNA specificity domain protein [Streptococcus mitis SK321] Length = 377 Score = 109 bits (272), Expect = 1e-21, Method: Composition-based stats. Identities = 58/394 (14%), Positives = 119/394 (30%), Gaps = 44/394 (11%) Query: 25 WKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W + +L GR + + + + + + + Y D S + Sbjct: 16 WVEKKLGEVAELLNGRAYKQEELLEDGEYRVLRVGNFNTNDRWYYS-DLQLEDSKYANY- 73 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +LY + + I + + + + RI+ Sbjct: 74 ----GDLLY-LWATNFGPELWKEEKVIYHYHIWKISGYSNILDKYYFYTFLEKDKDRIKQ 128 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+TM H + + P L EQ I ID LI+ + R +E+LKE+K Sbjct: 129 NTNGSTMVHITKGMMEERVLTFPSLPEQTAIGSF----FQDIDQLISLQQRKLEVLKEQK 184 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + + ++ +G E + L + L+ Sbjct: 185 KTYLKLLFPAKGQTKPALRFAGFE---DEWTSVLLGDISELYQPKTISSEDLLTEGFPVF 241 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 I + N I + + S I ++ Sbjct: 242 GANGYIGYYKDYNHKENQ----------------VTISARGEGTGTPSFVEGPVWITGNS 285 Query: 320 YMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + ++L S+D G + L E +K++ +++P + EQ Sbjct: 286 MVVNVEKQDNITKSFLYAFCLSFDFKPFV---TGGAQPQLTREVLKKVNIMLPSLSEQEA 342 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 I + +D + K E+ + LKE + + + Sbjct: 343 IGSF----FQDLDKAIAKQEEKVNQLKESKQTLL 372 >gi|253998802|ref|YP_003050865.1| restriction modification system DNA specificity domain-containing protein [Methylovorus sp. SIP3-4] gi|253985481|gb|ACT50338.1| restriction modification system DNA specificity domain protein [Methylovorus sp. SIP3-4] Length = 795 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 60/472 (12%), Positives = 128/472 (27%), Gaps = 98/472 (20%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ + ++ GR S + + I ++++ + N D Sbjct: 100 ELPAGWQWAKLGMLMEMFNGRAFSQTEWSYEGLPIIRIQNLNDKNAPF-----NYFNGDV 154 Query: 76 STVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPK-DVLPELLQGWLLSI 131 S + G L G I + G + + + + ++ Sbjct: 155 SETNYVEPGTFLISWSGTPGTSFGAFIWSGAPGALNQHINKCMIFGEEINKQYLRLAVNS 214 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR----------- 180 + IE G + H + N IPPLAEQ I K+ Sbjct: 215 CMDHLIENAQGGVGLKHVTKGTLNNCVFAIPPLAEQYRIVAKVDELMALCDQLEQQTDAS 274 Query: 181 ------------------------------IDTLITERIRFIELLKEKKQALVSYIVTKG 210 I + + + KQ ++ V Sbjct: 275 LSAHQTLVETLLNALTSTADHAQFASSWQRIAEHFDTLFTTEDSIDQLKQTILQLAVMGK 334 Query: 211 LNPDVKMKDSGIEWV-------------GLVPDHWEVKPFFALVTELNRKNTK------- 250 L P + E + G + + P A ++ Sbjct: 335 LVPQDPNDEPASELIKKIAADKARLVKKGRINKDNPLPPISADEKPFLEQSAWQFVRLLS 394 Query: 251 ------------------LIESNILSLSYGNIIQKLETRNMGLKPESYETY----QIVDP 288 + I ++ ++I + + ++ + + + + Sbjct: 395 LSYEIGTGPFGSMIHQSDYVSGGIPLVNPSHMIDDVISEDIAVAVDHEKAKELTSYRLCA 454 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+IV + + S Y+ + P I ++A + R+ Sbjct: 455 GDIVLARRGEVGRCAIVTEREDGWLCGTGSFYLRLPP-AISRRFMALVFRATTTRSYLVG 513 Query: 349 MG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +L + LP+ +PP+ EQ+ I ++ A D L ++ + Sbjct: 514 KAVGTTMVNLNHGILNSLPIALPPLGEQYRIVAKVDELIALCDQLTSRLRAA 565 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 57/198 (28%), Gaps = 4/198 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPE 278 + E +P W+ L+ N + E + L I + Sbjct: 93 TDDEQSFELPAGWQWAKLGMLMEMFNGRAFSQTEWSYEGLPIIRIQNLNDKNAPFNYFNG 152 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWL 336 V+PG + + G + I+ YL Sbjct: 153 DVSETNYVEPGTFLISWSGTPGTSFG-AFIWSGAPGALNQHINKCMIFGEEINKQYLRLA 211 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + S + A G + + + +PP+ EQ+ I ++ A D L ++ Sbjct: 212 VNSCMDHLIENAQGGVGLKHVTKGTLNNCVFAIPPLAEQYRIVAKVDELMALCDQLEQQT 271 Query: 397 EQSIVLLKERRSSFIAAA 414 + S+ + + + A Sbjct: 272 DASLSAHQTLVETLLNAL 289 >gi|1841496|emb|CAA71896.1| StySKI methylase [Salmonella enterica] Length = 587 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 76/509 (14%), Positives = 141/509 (27%), Gaps = 101/509 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ S + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTSLAIE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDL 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ +E + + G T+ + + P IPP AEQ Sbjct: 200 KVLSPFLSEISYYIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 +++ RI Sbjct: 260 LSTVKKLMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADELAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 ++ KQ ++ V L P + Sbjct: 320 TTEPSIEALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN----ILSLSYGNIIQKLETRNMG 274 S E +P+ WE + + K + N + N+ + Sbjct: 380 ISEEEKPFELPEGWEWCRLEEIAYIFSGNAFKSEDFNESAGTKCIKITNVGVHEFIESQD 439 Query: 275 LKPESYETYQ---IVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPHGID 329 P + V G+++ ++ A++ Sbjct: 440 YLPSDFNKSYHNFRVYSGDMIIAMTRPYISSGLKICICPDNYHNALLNQRVCAIRLSHF- 498 Query: 330 STYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 S Y ++S + + SGL+ +LK D+ L + VPP EQ I N IN Sbjct: 499 SEYYYLFLKSLFVLMHYQDRFNNSGLQPNLKMADISHLLIPVPPENEQNKIQNKINALYT 558 Query: 388 RIDVLVEK----IEQSIVLLKERRSSFIA 412 I+ L+E + + L + I Sbjct: 559 MIETLLELTKSAQQTQLHLADALTDAAIN 587 >gi|268610643|ref|ZP_06144370.1| hypothetical protein RflaF_14247 [Ruminococcus flavefaciens FD-1] Length = 399 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 57/372 (15%), Positives = 117/372 (31%), Gaps = 21/372 (5%) Query: 55 DVESGTGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQF 111 D +G + S + + +L K G + A + D ++ Sbjct: 41 DFVNGRINWDDCYHVSVDRFEQDKGIQLRENDLLVTKDGTVGKTAFVVDCPTQATLNSHI 100 Query: 112 LVLQPKD--VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 +++ KD V PE L L S + + I G T+ P + Q Sbjct: 101 FLVRSKDGSVEPEYLYYLLNSAVFSDFMRNILTGTTIKGLTQGNFYKFEFEAPDVPTQKK 160 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I + + ID +I + IE + +V I+T +N + +K +G Sbjct: 161 IVSVLES----IDDVIDKTRDLIEKYTSLMKGVVQDILTNDINDENTVK------IGSFA 210 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 D K K I + + + IV+ G Sbjct: 211 DALGGKRIPKGSELTIAKTAHPYIRVRDMTKPKVIELTDDYMYVEESDFHKISRYIVNAG 270 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++ + + + + G +L + ++S K Sbjct: 271 DLIISIVGTVGAVALVGETLDKANLTENCSKIVNI-KGYSPEFLYYFLKSEYGQKEIAGG 329 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G ++ L +++ + V + + EQ I + + +ID V Q ++ ++ Sbjct: 330 TVGEVQAKLPLKNILEINVPILSMPEQEAIVEKLRILDEKIDKEV----QYYNKMESIKA 385 Query: 409 SFIAAAVTGQID 420 + ++G ID Sbjct: 386 GLMHDLLSGSID 397 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 16/160 (10%), Positives = 43/160 (26%), Gaps = 5/160 (3%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 I + ++ + + + +++ + Sbjct: 40 WDFVNGRINWDDCYHVSVDRFEQDKGIQLRENDLLVTKDGTVGKTAFVVDCPTQATLNSH 99 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + K ++ YL +L+ S + + L + + P + Q Sbjct: 100 IFLVRSKDGSVEPEYLYYLLNSAVFSDFMRNILTGTTIKGLTQGNFYKFEFEAPDVPTQK 159 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I +V+ ID +++K I + +T Sbjct: 160 KIVSVLES----IDDVIDKTRDLIEKYTSLMKGVVQDILT 195 >gi|24636601|dbj|BAC22942.1| probable restriction modification system specificity subunit HsdS [Staphylococcus aureus] Length = 406 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 53/394 (13%), Positives = 124/394 (31%), Gaps = 16/394 (4%) Query: 24 HWKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ FTK+N G K L + + I Sbjct: 20 EWEEKQFADFTKINQGLQIAINERKTEYSPELYFYITNEFLRPNSQTKYFIENPPQSVIA 79 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K IL + G + + + L L S + +I ++ Sbjct: 80 NKEDILMTRTGNTGKVVTNVFGAFHNNFFKIKFDKNLYDRLFLVEVLNSSKIQNKILSLA 139 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +T+ + +I P L EQ I + +ID I + + +ELL+++K+ Sbjct: 140 GSSTIPDLNHSDFYSISSSYPLLREQQKIGDF----FSKIDRQIELQEQKLELLQQQKKG 195 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + I ++ L ++G ++ ++ ++ + ++ Sbjct: 196 YMQKIFSQEL---RFKDENGEDYPDWKEKKLGDITEQSMYGIGASATRFDSKNIYIRITD 252 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAY 320 + + P+ + +I+F K + + + + Sbjct: 253 IDEKSRKLNYQNLTTPDELNNKYKLKRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFL 312 Query: 321 MAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + ++ + S V + + E+ +LP+++P EQ I Sbjct: 313 IKFEIDEQNNPLFIYQFTLTSKFNKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKI 372 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ R D +E +Q I +L++++ + Sbjct: 373 AKFLD----RFDRQIELEKQKIEILQQQKKGLLQ 402 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 69/213 (32%), Gaps = 13/213 (6%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ + EW + + RK E + Sbjct: 10 PELRFPEFEGEWEEKQFADFTKINQGLQIAINERKTEYSPELYFYITNEFLRPNSQTKYF 69 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + P+S I + +I+ + V + + D + Sbjct: 70 IENPPQSV----IANKEDILMTRTGNTGKVVT----NVFGAFHNNFFKIKFDKNLYDRLF 121 Query: 333 LAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L ++ S + ++ GS L D + P ++EQ I + ++ID Sbjct: 122 LVEVLNSSKIQNKILSLAGSSTIPDLNHSDFYSISSSYPLLREQQKIGDF----FSKIDR 177 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +E EQ + LL++++ ++ + ++ + E Sbjct: 178 QIELQEQKLELLQQQKKGYMQKIFSQELRFKDE 210 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 66/190 (34%), Gaps = 11/190 (5%) Query: 24 HWKVVPIKRFTKLNT---GRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + T+ + G ++ IYI + D++ + K ++ + + Sbjct: 217 DWKEKKLGDITEQSMYGIGASATRFDSKNIYIRITDIDEKSRKLNYQNLTTPDELNNKYK 276 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL------VLQPKDVLPELLQGWLLSIDV 133 + + IL+ + G K+ I + + + P + + L+ Sbjct: 277 L-KRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFLIKFEIDEQNNPLFIYQFTLTSKF 335 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + + + + +P+ +P EQ I + + +I+ + + Sbjct: 336 NKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKIAKFLDRFDRQIELEKQKIEILQQ 395 Query: 194 LLKEKKQALV 203 K Q++ Sbjct: 396 QKKGLLQSMF 405 >gi|253682396|ref|ZP_04863193.1| type I restriction enzyme specificity protein [Clostridium botulinum D str. 1873] gi|253562108|gb|EES91560.1| type I restriction enzyme specificity protein [Clostridium botulinum D str. 1873] Length = 422 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 52/409 (12%), Positives = 132/409 (32%), Gaps = 29/409 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG---LEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + + TG + ++ +D + + V + + Sbjct: 20 WEQRKLGKMGDTFTGLSGKTKEDFGHGDAKFVTYVNVFGNVISDSNDVQSVEIDDKQNQV 79 Query: 82 AKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVLPELLQGWL-LSIDVT 134 G + + + ++ ++ +P ++ S ++ Sbjct: 80 KYGDVFFTTSSETPEEVGMSSVWLENTENVYLNSFCFGYRPTVEFDLYYLAFMLRSPEIR 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + +G + + + ++ +P+P L EQ + IT R ++L Sbjct: 140 KKFMFLAQGISRYNISKNKVMDMNVPVPELNEQRKVGTFFRNLDNL----ITLHQRKLDL 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP--FFALVTELNRKNTKLI 252 LK K++++ + K +++ +G E + Sbjct: 196 LKVTKKSMLQKMFPKDGESVPEIRFAGFNDPWEQRKVIEQVEKVLDYRGKSPAKFGMSWG 255 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYE------TYQIVDPGEIVFRFIDLQNDKRSLR 306 S L LS N+ +++ K E + ++ G+++F + + Sbjct: 256 NSGYLVLSSLNVKNGYIDKSVEAKYGDQELFDRWMGNERLEKGDVIFTTEAPLGN-IAQV 314 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 + +D+ +LA L+ S A SG + + ++ ++ Sbjct: 315 PDNNGYILNQRAVAFKTSSDKLDNNFLATLLSSPLFQDKLQANSSGGTAKGIGMKEFAKI 374 Query: 366 PVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++P I EQ I +D L+ ++ + LLK+ + S + Sbjct: 375 ATMLPIDIAEQKKIGLF----FKDLDNLITLHQRELDLLKDLKKSMLQQ 419 >gi|303230842|ref|ZP_07317589.1| type I restriction modification DNA specificity domain protein [Veillonella atypica ACS-049-V-Sch6] gi|302514602|gb|EFL56597.1| type I restriction modification DNA specificity domain protein [Veillonella atypica ACS-049-V-Sch6] Length = 413 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 56/407 (13%), Positives = 124/407 (30%), Gaps = 26/407 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + W+ + TG + ++ +D YI +V T + T Sbjct: 14 EDWEQRKLGSIGSTYTGLSGKTKEDFGHGEAQYITYLNVFQNTISDITMTDKVEID--IT 71 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVLPELLQGWLLSI 131 + G +L+ + ++ ++ +P + G+ L Sbjct: 72 QNEVKYGDVLFTTSSETPEEVGMSSVWLGDTPNIYLNSFCFGFRPNQKIDPYFLGYSLRA 131 Query: 132 DVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + +G + + + + + +P EQ L+ + RID +IT Sbjct: 132 PYMRDKIKILAQGISRYNISKNKVMELEISLPNNEEQKLLGTFL----QRIDLIITLHQC 187 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +E LK K+AL+ + K +++ G E FA K Sbjct: 188 KLEKLKLMKKALLQKLFPKNGKHIPEIRFKGFTDAWEQRKLGECMNSFAYGLNAAAKEYD 247 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRSLRS 307 + I + N+ + + ++ G+IVF K L + Sbjct: 248 GMHKYIRITDIDDETHNFIQSNLTSPDIDFNTDVSDYKLNVGDIVFARTGASVGKTYLYN 307 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLP 366 + D+ ++ + D + + ++ Sbjct: 308 PNDGDLYYAGFLIRGKVKDDYDAGFIYQNTLTKDYDAFIKITSQRSGQPGVNSKEYATFR 367 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P EQ I+ V+N +D L ++ + L+E + + Sbjct: 368 LNIPCKDEQRKISKVLNS----LDELFTLHQRKLERLQEVKKDLLQK 410 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 66/194 (34%), Gaps = 13/194 (6%) Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 G P ++ K +W + ++Y N+ Q Sbjct: 1 MGNKPRIRFKGFTEDW----EQRKLGSIGSTYTGLSGKTKEDFGHGEAQYITYLNVFQNT 56 Query: 269 ETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKP 325 + + T V G+++F ++ + S + + + S +P Sbjct: 57 ISDITMTDKVEIDITQNEVKYGDVLFTTSSETPEEVGMSSVWLGDTPNIYLNSFCFGFRP 116 Query: 326 HG-IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + ID +L + +R+ + + G+ R ++ V L + +P +EQ + + Sbjct: 117 NQKIDPYFLGYSLRAPYMRDKIKILAQGISRYNISKNKVMELEISLPNNEEQKLLGTFL- 175 Query: 384 VETARIDVLVEKIE 397 RID+++ + Sbjct: 176 ---QRIDLIITLHQ 186 >gi|295696353|ref|YP_003589591.1| restriction modification system DNA specificity domain protein [Bacillus tusciae DSM 2912] gi|295411955|gb|ADG06447.1| restriction modification system DNA specificity domain protein [Bacillus tusciae DSM 2912] Length = 411 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 70/422 (16%), Positives = 137/422 (32%), Gaps = 46/422 (10%) Query: 28 VPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 V + LNT ++ + + L + N + S Sbjct: 8 VKLGDIFNLNTETVCPRELPSQVFVHYSIPAFDESHRPVLERGWNIK----SNKYALKGD 63 Query: 85 QILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGWLLS--IDVTQRI 137 +L KL P + + +CST+F+V Q +L + + Sbjct: 64 SLLVSKLNPRINRVWKFLSMSNPNPSVCSTEFMVYQTIRPDVDLDFYYHFFTSHLFQAAL 123 Query: 138 EAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G T K NI +P PP EQ I + +D I R I+ Sbjct: 124 MTLQSGTTGSRMRVTPKETLNIRIPYPPFREQRKIAAIL----TSVDDAIAATQRIIDQT 179 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + K+ L+ ++T+G+ K K + I G +P W+V F +N + Sbjct: 180 ERVKRGLMQQLLTRGIG-HTKFKQTEI---GEIPAEWDVMSFRDACEIVNGQVDPKEAPY 235 Query: 256 ILSLSY-GNIIQKLETRNMGLKPESYET----YQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + N I G + + ++F I + K + Sbjct: 236 CDMIHIAPNHIVGFIGHLEGYTTAKEDCVTSGKYLFTEEHVLFSKIRPELGKVAYPGFS- 294 Query: 311 MERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPV 367 GI ++ + + + +L +++ S + G + D+ + Sbjct: 295 ---GICSADIYPIRARNGIMLPEFLKYVLMSDRFYRYSISVSGRTGIPKVNRHDLDCYQI 351 Query: 368 LVPPIKEQF---DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 VPPI EQ I + + + + L+ +S+ + +TG+I ++ + Sbjct: 352 AVPPIAEQEGMCKILRSVYSYWS------ANLAKKSSLMT-LKSALMQVLLTGKIRVKVD 404 Query: 425 SQ 426 + Sbjct: 405 EE 406 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 81/201 (40%), Gaps = 8/201 (3%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLP 65 ++K + IG IP W V+ + ++ G+ D+I+I + G Sbjct: 199 KFKQTE---IGEIPAEWDVMSFRDACEIVNGQVDPKEAPYCDMIHIAPNHIVGFIGHLEG 255 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPEL 123 TS +F + +L+ K+ P L K F GICS ++ ++ +LPE Sbjct: 256 YTTAKEDCVTSGKYLFTEEHVLFSKIRPELGKVAYPGFSGICSADIYPIRARNGIMLPEF 315 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L+ L+S + ++ + + + + +PP+AEQ + + + + Sbjct: 316 LKYVLMSDRFYRYSISVSGRTGIPKVNRHDLDCYQIAVPPIAEQEGMCKILRSVYSYWSA 375 Query: 184 LITERIRFIELLKEKKQALVS 204 + ++ + L Q L++ Sbjct: 376 NLAKKSSLMTLKSALMQVLLT 396 >gi|165975746|ref|YP_001651339.1| putative type I restriction enzyme specificity protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165875847|gb|ABY68895.1| putative type I restriction enzyme specificity protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 389 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 54/417 (12%), Positives = 132/417 (31%), Gaps = 53/417 (12%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KD V+W + L GR +E G + + Sbjct: 8 KDCEVEW----------KSLGEVATLQRGRVISK---------TYLEENKGDFPVYSSQT 48 Query: 71 RQSDTSTV--SIFAKGQIL-YGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLPELLQ 125 + + + G+ + + G + +V++ +D+L Sbjct: 49 QNNGEIGKINTYDFDGEFVNWTTDGANAGTVFYRKGKFSITNVSGLIVIKNQDLLNYKFL 108 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + L I+ + + + G + I +PIP L Q I + + T L Sbjct: 109 YYWLLIEAKKHVYS---GMGNPKLMSHQMEKIRIPIPSLEIQEKIVKILDIFTELEAALE 165 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + L ++ + ++T G + + K +G V + Sbjct: 166 ATLEAELSLRVKQYDYYRNDLLTFGDDVEWK-------TLGEVGELIRGNGLQ------- 211 Query: 246 RKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 E+ + ++ YG I + + + + + G+++ Sbjct: 212 --KKDFTETGVPAIHYGQIYTYFGTFADKTKTFVSADLAKKLKKAQFGDVLIAGTSENLQ 269 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKF 359 + + + A +P+ I++ +L +L+++ D K G + +K Sbjct: 270 DVMKPLGWLGGEIVFSGDMFAFRPNQEINTKFLTYLLQTEDFQKYKERYAQGTKVIRMKS 329 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + + +PP+ Q I +++ + + E + + I L ++ R + Sbjct: 330 DNFLKYQIPIPPLATQQKIVEILDKFDRLTNSISEGLPKEIELRRKQYEYYREQLLN 386 >gi|126665438|ref|ZP_01736420.1| restriction modification system DNA specificity domain [Marinobacter sp. ELB17] gi|126630066|gb|EBA00682.1| restriction modification system DNA specificity domain [Marinobacter sp. ELB17] Length = 456 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 59/456 (12%), Positives = 143/456 (31%), Gaps = 59/456 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W V + G+ + K+ Y+G ++V G+ + + Sbjct: 3 SEWPKVRLGDHVDSCLGKMLDKAKNRGELYPYLGNKNVRWGSFDLDDLAEMRFEKNEHDR 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G ++ + G R AI D ++P L + + Sbjct: 63 YGLRSGDLIVCEGGEPGRCAIWKDHIPGMKIQKALHRIRPLQGLNNYYLHYWFTEAYRTG 122 Query: 137 IEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I A G T+ H + I + +P+PPLA Q I + + +ID + Sbjct: 123 ILALYFTGTTIQHLTGRAISQLEIPLPPLAIQKHIASVLSSLDAKIDLNHQMNTTLETMA 182 Query: 196 KEKKQA-------LVSYIVTKG---------------------------LNPDVKMKDSG 221 + ++ ++ + G + + Sbjct: 183 QALFKSWFVDFDPVIDNALAAGNPIPEPFHARAEARKALGDQRRPLPAAIQQQFPDRFVL 242 Query: 222 IEWVGLVPDHWEVKPFFALVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLK 276 E +G VP+ WE+ V + KN + + + + +L++ + Sbjct: 243 TEEMGWVPEGWEISTVGEQVEIMGGGTPSTKNPIFWDDGVHAFCTPKDMSRLDSIVVTRT 302 Query: 277 --PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + Q + G++ + + + A + +A+ P+ Sbjct: 303 ERYLTDAGVQKITSGQLPAGVVLMSSRAPIGYLAISNIPVSVNQGIIAMLPNDSYGAMYL 362 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +++ ++ + ++ + +P LVP + V+N + + Sbjct: 363 LSWAYFNMWQITDRANGSTFMEISKKNFRPIPFLVPNL-------GVLNAFNQQAKAVYS 415 Query: 395 K---IEQSIVLLKERRSSFIAAAVTGQIDLRG-ESQ 426 K + ++I + + R + + ++G++ + E+Q Sbjct: 416 KVLSVSENIEEVTKLRDTLLPKLLSGELRVPDAEAQ 451 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 21/190 (11%), Positives = 54/190 (28%), Gaps = 6/190 (3%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 G EW + ++ + + ++ +G+ L+ ++ Sbjct: 2 GSEWPKVRLGDHVDSCLGKMLDKAKNRGELYPYLGNKNVRWGSF--DLDDLAEMRFEKNE 59 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + G+++ + + + + W +Y Sbjct: 60 HDRYGLRSGDLIVCEGGEPGRCAIWKDHIPGMKIQKALHRIRPLQGLNNYYLHYWFTEAY 119 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + Q L + +L + +PP+ Q I +V++ A+ID Q Sbjct: 120 RTGILALYFTGTTIQHLTGRAISQLEIPLPPLAIQKHIASVLSSLDAKID----LNHQMN 175 Query: 401 VLLKERRSSF 410 L+ + Sbjct: 176 TTLETMAQAL 185 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 45/156 (28%), Gaps = 12/156 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY-------IGLEDVESGTGKY---LPKDG 68 G +P+ W++ + ++ G T + I + +D+ + Sbjct: 247 GWVPEGWEISTVGEQVEIMGGGTPSTKNPIFWDDGVHAFCTPKDMSRLDSIVVTRTERYL 306 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 T G +L P I++ + + + P D + Sbjct: 307 TDAGVQKITSGQLPAGVVLMSSRAPI-GYLAISNIPVSVNQGIIAMLPNDSY-GAMYLLS 364 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + +I G+T K IP +P L Sbjct: 365 WAYFNMWQITDRANGSTFMEISKKNFRPIPFLVPNL 400 >gi|237653838|ref|YP_002890152.1| restriction modification system DNA specificity domain protein [Thauera sp. MZ1T] gi|237625085|gb|ACR01775.1| restriction modification system DNA specificity domain protein [Thauera sp. MZ1T] Length = 390 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 61/403 (15%), Positives = 133/403 (33%), Gaps = 28/403 (6%) Query: 28 VPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 V + +N + + + ++ + + + + + + + + + F G Sbjct: 3 VNLGDVASINPRLSDPLQQTELVSFVPMASLSAEEARVVSTETRAYSEVSKGYTPFRNGD 62 Query: 86 ILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIE 138 +L K+ P A + +G ST+F V++PK+ L L D+ E Sbjct: 63 VLVAKITPCFENGKIAQAHLPHPNGFGSTEFHVIRPKESLLDGRYLHHLLRQADIRVEGE 122 Query: 139 AICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+ + ++ +P+P L EQ + + EL + Sbjct: 123 RRMTGSGGQRRVPATFLSSLRIPLPRLEEQRRVAAILDQADALRAKRRKALALLDELQRG 182 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + +P K +G + + P + I Sbjct: 183 I-------FIEMFGDPVTSPKGCTAGTLGDGIEEMQYGP---RFHNEAYSPEGIRIVRIT 232 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 L + M + E+ + + + G++VF K +L + I Sbjct: 233 DLDAAGSLDFDSMPRMEVDEETRDKFA-LRAGDVVFARTGATVGKVALIK-ERDPVCIAG 290 Query: 318 SAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQ 375 + ++ ++ I Y +++S + + +A +Q+ ++RLP+ VP I+ Q Sbjct: 291 AYFIRMRFQSRILPEYAFSVLQSESVQSLIFAQSRQAAQQNFSGPGLRRLPMPVPSIERQ 350 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + K ++ LL E SS A G+ Sbjct: 351 RRFAERVEAVGSE----KSKQLSALALLDELFSSLQHRAFRGE 389 >gi|269103360|ref|ZP_06156057.1| type I restriction-modification system specificity subunit S [Photobacterium damselae subsp. damselae CIP 102761] gi|268163258|gb|EEZ41754.1| type I restriction-modification system specificity subunit S [Photobacterium damselae subsp. damselae CIP 102761] Length = 418 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 59/426 (13%), Positives = 140/426 (32%), Gaps = 36/426 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 ++ V + + + G++ +S G G + + TS + Sbjct: 2 SDFEWVQLGKIAAITMGQSPDSETYTDDDRYIPFLQGCGDFTGSYPETGVFCTSPGKVAK 61 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 +G +L P +AD D + K + L + + Sbjct: 62 EGSLLVSVRAPV-GTTNVADKDYCIGRGLAAV--KSNIVSALYLREAFTVSASFLHRRAQ 118 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+T ++ P+ + + EK+ +++ + I+ KQ + Sbjct: 119 GSTFDAI---CAKDLSEMKIPMPKNRRVGEKVTDIIQCLNSELDATQALIDKYTAIKQGM 175 Query: 203 VSYIVTKGLNPDVKMKDSGIEW---------VGLVPDHWEVKPFFALVTELNRK------ 247 ++ + ++G++P+ K E +G++P W+V L+ E+ Sbjct: 176 MADLFSRGIDPETKTLRPTFEEAPELYYKTPLGMLPKGWKVIELENLLDEVTSPMRSGPF 235 Query: 248 -----NTKLIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +L+ I L N + R + + + V ++V + Sbjct: 236 GSALLKEELVSEGIPLLGIDNIFVERFKASYKRFVTERKFRELSRYAVRERDVVITIMGT 295 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYA-MGSGLRQS 356 + + + M I + W + S F G+ + Sbjct: 296 VGRSCVIPESIGLALSSKHLWTMTFDKEQILPELVCWQLNHSPWAESWFRRESQGGVMDA 355 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ + +K+L ++VP EQ I ++ +E + S+ LK +++ + +T Sbjct: 356 IQSQTLKKLKLVVPSPVEQNAIYER----YENLNNHIEVNQTSLDKLKLQKTGLMQDLLT 411 Query: 417 GQIDLR 422 G++ + Sbjct: 412 GKVPVP 417 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 62/209 (29%), Gaps = 18/209 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------------GKDIIYIGLEDVESGTGKYLP 65 +G +PK WKV+ ++ T + I +G++++ K Sbjct: 207 LGMLPKGWKVIELENLLDEVTSPMRSGPFGSALLKEELVSEGIPLLGIDNIFVERFKASY 266 Query: 66 KDG-NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPE 122 K R+ + + ++ +G R +I + G+ S + Sbjct: 267 KRFVTERKFRELSRYAVRERDVVITIMGTVGRSCVIPESIGLALSSKHLWTMTFDKEQIL 326 Query: 123 L---LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 S +G M + + + + +P EQ I E+ Sbjct: 327 PELVCWQLNHSPWAESWFRRESQGGVMDAIQSQTLKKLKLVVPSPVEQNAIYERYENLNN 386 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVT 208 I+ T + Q L++ V Sbjct: 387 HIEVNQTSLDKLKLQKTGLMQDLLTGKVP 415 >gi|261403055|ref|YP_003247279.1| restriction modification system DNA specificity domain protein [Methanocaldococcus vulcanius M7] gi|261370048|gb|ACX72797.1| restriction modification system DNA specificity domain protein [Methanocaldococcus vulcanius M7] Length = 436 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 62/441 (14%), Positives = 147/441 (33%), Gaps = 28/441 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDV 56 M ++ ++K++ IG IPK W V + + + + + + I + +++ Sbjct: 1 MVKFRWETEFKETE---IGKIPKDWNVKRLGDLCVITSSKRIYLREYTSEGIPFYRAKEI 57 Query: 57 ES-GTGKYLPKDGNSRQSDTS----TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF 111 S G+ + + +G +L +G ++ D Sbjct: 58 ISLSQGEQVKNCLYISNEKYEEIKAKYGVPKEGDLLLTAIGTIGYVYMVKQNDKFYFKDG 117 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 VL KD + + V + + I G++ K + + +P PP EQ I Sbjct: 118 NVLWLKDFKNLYQKYLYFLLPVILKHQEIYIGSSQKALTIKDLKEVEIPYPPPEEQQKIA 177 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + I+ + ++ E + + + + + + + Sbjct: 178 TVLSYFDDLIENKKKQNETLEKIALELFKNWFID-FEPFKDEEFVYNEELDKEIPKGWEV 236 Query: 232 WEVKPFFALVTELNRKNTKL----IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + L+ ++ K++++ E+ ++L+ +T + K + Q + Sbjct: 237 KRLGEIAELIKGVSYKSSEISKEPEENIFITLNNFLRGGGFKTEYIYYKGTKAKETQKIK 296 Query: 288 PGEIVFRFIDLQ------NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 G+++ D+ + E GII+ + Y +L Y Sbjct: 297 EGDLIIALTDMTAEAKVVGAPAIVILPNNCEFGIISLDCAKIDLKDEFLKYYLYLYLKYS 356 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLP-VLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + LK E K +L+PP I + + + ++ I Sbjct: 357 QEENSTFANGVNVLHLKVELFKNSKFILIPP----QPILQKFHSLVQPLFEKIINNQKQI 412 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 ++LK+ R + + V G++ + Sbjct: 413 MVLKKIRDALLPKLVFGELRV 433 >gi|294616003|ref|ZP_06695829.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1636] gi|291591137|gb|EFF22820.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1636] Length = 380 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 61/398 (15%), Positives = 131/398 (32%), Gaps = 43/398 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + T+ G+ D+ +GKY P G + D IF Sbjct: 16 EDWEERKLGELTESFDGKRVPIDSDLRI---------SGKY-PYYGATGIIDYVDDYIFN 65 Query: 83 KGQILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +L + G + A + + +++ ++ +L+ + Sbjct: 66 GEYVLLAEDGANIIMRNYPVAYLTQGKFWLNNHAHIMRMRNGSN----YFLVQVLEKIDY 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G + K + NI + IP + EQ I ++D +I R ++LLKE Sbjct: 122 KKYNTGTAQPKLNSKIVKNIELKIPHIEEQQQIGNF----FKQLDDIIALHQRKLDLLKE 177 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K+ + + K +++ SG + WE + +V ++ Sbjct: 178 TKKGFLQKMFPKNGAKVPEIRFSG------FTEDWEQRKLGEIVQITMGQSPNSENYTEN 231 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 Y + + +N + P + T + G+++ D V+ RG+ Sbjct: 232 PEDYILVQGNADMKNNRVVPRVWTTQITKQAEKGDLILSVRAPVGDIGKTDYDVVLGRGV 291 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + L + +S+ D++ + +P +EQ Sbjct: 292 AA--------IKGNDFIFQQLGEMKESGYWNRFSTGSTFESINSNDIREALITIPTGEEQ 343 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I ++D + ++ + LLKE + F+ Sbjct: 344 QKIGAF----FKQLDDTIALHQRKLDLLKETKKGFLQK 377 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 56/185 (30%), Gaps = 9/185 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + ++ G++ S + G R T Sbjct: 204 EDWEQRKLGEIVQITMGQSPNSENYTENPEDYILVQGNADMKNNRVVPRVWTTQITKQAE 263 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 KG ++ P D+D + ++ D + L + + Sbjct: 264 KGDLILSVRAPV-GDIGKTDYDVVLGRGVAAIKGNDFI----FQQLGEMKESGYWNRFST 318 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+T + I + IP EQ I ++D I R ++LLKE K+ Sbjct: 319 GSTFESINSNDIREALITIPTGEEQQKIGAF----FKQLDDTIALHQRKLDLLKETKKGF 374 Query: 203 VSYIV 207 + + Sbjct: 375 LQKMF 379 >gi|89899861|ref|YP_522332.1| restriction modification system DNA specificity subunit [Rhodoferax ferrireducens T118] gi|89344598|gb|ABD68801.1| restriction modification system DNA specificity domain [Rhodoferax ferrireducens T118] Length = 397 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 59/408 (14%), Positives = 120/408 (29%), Gaps = 33/408 (8%) Query: 27 VVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 V + F +G T G I +I ++ + S++ Sbjct: 6 TVTLSEFCATGSGGTPSRAQMERYYEGGTIPWIKSGELRETVINGAEEHVTDVALKESSI 65 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + G IL G + + I + + + P + + + Sbjct: 66 KLVPAGAILLAMYGATVGRLGILGIEATTNQAVCHIIPDPRIAVTRYVYHALSSQVPSLI 125 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ G + + I N+ +P+P EQ I + + L Sbjct: 126 SMGVGGAQPNINQGIIKNLAIPLPAKPEQRRIAAILDQADALRAKRREALAQLDSL---- 181 Query: 199 KQALVSYIVTKGLNPDVKMKD-SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q++ + +P K +G V + L K T+ I + Sbjct: 182 TQSIFIQMFG---DPVSNPKGWPDATTLGQV---ANIASGVTKGRNLTGKVTRTIPYLAV 235 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + + + + E Y + ++ D R + I Sbjct: 236 ANVQDKSLNLSAVKEIDATEDEIERYLLKWNDLLLTEGGDPDKLGRGTLWKNELPECIHQ 295 Query: 318 SAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIK 373 + V+ + +L WL+ S K F + S+ ++ P+L+PP++ Sbjct: 296 NHIFRVRVTSQAVTPLFLNWLVGSQRGKKYFLRSAKQTTGIASINMTQLRSFPLLLPPVE 355 Query: 374 EQFD---ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q D I V+ + A S+ L+ S A G+ Sbjct: 356 LQRDFETIAEVVAEQHA-------IHSVSLAELEALFVSLQHRAFRGE 396 >gi|303249550|ref|ZP_07335757.1| putative type I restriction enzyme specificity protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307251822|ref|ZP_07533724.1| type I restriction enzyme specificity protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302651624|gb|EFL81773.1| putative type I restriction enzyme specificity protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860729|gb|EFM92740.1| type I restriction enzyme specificity protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 389 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 55/417 (13%), Positives = 133/417 (31%), Gaps = 53/417 (12%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KD V+W + L GR +E G + + Sbjct: 8 KDCEVEW----------KSLGEVATLQRGRVISK---------TYLEENKGDFPVYSSQT 48 Query: 71 RQSDTSTV--SIFAKGQIL-YGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLPELLQ 125 + + + G+ + + G + +V++ +D+L Sbjct: 49 QNNGEIGKINTYDFDGEFVNWTTDGANAGTVFYRKGKFSITNVSGLIVIKNQDLLNYKFL 108 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + L I+ + + + G + I +PIP L Q I + + T TL Sbjct: 109 YYWLLIEAKKHVYS---GMGNPKLMSHQMEKIRIPIPSLEIQEKIVKILDIFTELEATLE 165 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + L ++ + ++T G + + K +G V + Sbjct: 166 ATLEAELSLRVKQYDYYRNDLLTFGDDVEWK-------TLGEVGELIRGNGLQ------- 211 Query: 246 RKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 E+ + ++ YG I + + + + + G+++ Sbjct: 212 --KKDFTETGVPAIHYGQIYTYFGTFADKTKTFVSADLAKKLKKAQFGDVLIAGTSENLQ 269 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKF 359 + + + A +P+ I++ +L +L+++ D K G + +K Sbjct: 270 DVMKPLGWLGGEIVFSGDMFAFRPNQEINTKFLTYLLQTEDFQKYKERYAQGTKVIRMKS 329 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + + +PP+ Q I +++ + + E + + I L ++ R + Sbjct: 330 DNFLKYQIPIPPLATQQKIVEILDKFDRLTNSISEGLPKEIELRRKQYEYYREQLLN 386 >gi|110597491|ref|ZP_01385778.1| Restriction modification system DNA specificity domain [Chlorobium ferrooxidans DSM 13031] gi|110341035|gb|EAT59506.1| Restriction modification system DNA specificity domain [Chlorobium ferrooxidans DSM 13031] Length = 403 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 64/434 (14%), Positives = 132/434 (30%), Gaps = 56/434 (12%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRF-------TKLNTGRTSESGK--DIIYIGLEDVES 58 P YK + V G IP+ W+V+ + +N+ T + G + + V Sbjct: 5 PGYKQTEV---GVIPEDWEVIRLDSLISALDAGVSVNSVETEKVGYAHEGSILKTSCVYG 61 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLV 113 G + K IL ++ + + + Sbjct: 62 GKFDSEEHKKIHPRDIRRAKLNPRKNSILISRMNTPALVGECGFIDRDYPNLFLPDRLWM 121 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGAT-----MSHADWKGIGNIPMPIPPLAEQV 168 + + P + + + AI E AT M + + + +P+P EQ Sbjct: 122 TRHEGKRPTCILWFSYLLSFGSFNRAIKESATGTSGSMKNISKGSLFVLQVPLPNKIEQE 181 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I E + I + I ++ KQ ++ ++T + + +G + Sbjct: 182 AIAEALSDADA----FIESLEQLIFKKRQIKQGVMQELLTGKKRLPGFSGEWMVTSLGEI 237 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 + + + K L N G+ P Y Sbjct: 238 TIATKGSQLHGSESTKDGKYPHL--------------------NGGIAPSGYAEKSNTPA 277 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I ++ ++ ID+ +L ++ + Sbjct: 278 NTIAISEGGNS----CGYVQLMIVPYWCGGHCYSLISKCIDNGFLYQALKVQQTAIMGLR 333 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +GSGL +++ + + P EQ I V++ ID L K+E++ + + Sbjct: 334 VGSGL-PNVQKSALLSFKLEYPSDDSEQTAIAEVLSEMDDEIDALTIKLEKA----RLLK 388 Query: 408 SSFIAAAVTGQIDL 421 + + +TG+I L Sbjct: 389 QAMMHNLLTGKIRL 402 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 54/175 (30%), Gaps = 12/175 (6%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S YG E + + + I+ ++ + Sbjct: 57 SCVYGGKFDSEEHKKIHPRDI-RRAKLNPRKNSILISRMNTPALVGECGFIDRDYPNLFL 115 Query: 318 SAYMAVKPH----GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP 370 + + H + ++L+ + +G +++ + L V +P Sbjct: 116 PDRLWMTRHEGKRPTCILWFSYLLSFGSFNRAIKESATGTSGSMKNISKGSLFVLQVPLP 175 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 EQ I ++ A I+ L + I + ++ + + +TG+ L G S Sbjct: 176 NKIEQEAIAEALSDADAFIESLEQLIFKK----RQIKQGVMQELLTGKKRLPGFS 226 >gi|284048512|ref|YP_003398851.1| restriction modification system DNA specificity domain protein [Acidaminococcus fermentans DSM 20731] gi|283952733|gb|ADB47536.1| restriction modification system DNA specificity domain protein [Acidaminococcus fermentans DSM 20731] Length = 384 Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats. Identities = 69/396 (17%), Positives = 133/396 (33%), Gaps = 37/396 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + G G DG + + Sbjct: 17 DWEQRKVSDIVGRYDNLRVPVSSNKRVHGTTPYYGANGVQDYVDGYTHDGEY-------- 68 Query: 84 GQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL + G + + VLQ K + + I ++ Sbjct: 69 --ILIAEDGANDLQNYPVHYVNGRIWVNNHAHVLQGKTGIADTKFLSYAFS--QIDISSL 124 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + + + + + +P EQ EK+ ID+LIT R E+LK+ K+ Sbjct: 125 LVGGGRAKLNAGVLMKLDLLLPEHKEQ----EKLGNYFSHIDSLITLHQRKYEMLKKIKK 180 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + + K +++ SG D WE + A+ E + K + + + Sbjct: 181 SFLEKMFPKNGKRVPELRFSG------FTDDWEQRKLGAIFEEYSDKGHPNLSALTIIQG 234 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G I + RN+ +S Y+ V+ G+ + + + GII+ AY Sbjct: 235 GGTIRRDDSDRNLQYDKKSLANYKKVETGDFIVHLRSFEG-----GLEKATTSGIISPAY 289 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFD 377 G DS + RS G+R +S+ E +K + + ++EQ Sbjct: 290 HTFHGEGTDSRFYYCYFRSERFINHDLKPHVYGIRDGRSIDIEGMKTINIPWTKVEEQKA 349 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I N I+ +D L+ ++ + L+ + + + Sbjct: 350 IGNYIDC----LDNLITLHQRKLEKLQNLKKALLKK 381 >gi|218709370|ref|YP_002416991.1| type I restriction enzyme EcoKI S subunit [Vibrio splendidus LGP32] gi|218322389|emb|CAV18542.1| Type I restriction enzyme EcoKI, S subunit [Vibrio splendidus LGP32] Length = 522 Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats. Identities = 67/480 (13%), Positives = 154/480 (32%), Gaps = 87/480 (18%) Query: 20 AIPKHWKVVPIKRFT-----KLNTGRTSES-------GKDIIYIGLEDVE-----SGTGK 62 +PK W ++ G + + + +++V+ + K Sbjct: 3 ELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKNIK 62 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDV 119 Y+ +++ F G +L KLG L IA GI + L+P Sbjct: 63 YV----TDEKAEFLKRHSFKSGDLLLTKLGEPLGLTCIAPEYLNEGIIVADIVRLRPNPE 118 Query: 120 LPELLQGWLLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + +LL+ + V ++I A +G+T + + + N+ + +PPLAEQ I EKI Sbjct: 119 VNRKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNINLPPLAEQKRIVEKIDEVL 178 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 ++DT+ +LLK +Q++++ V+ L + + + + + E + F Sbjct: 179 AQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQERFE 238 Query: 239 ALV-----TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 ++++ + GN + + E + ++ + V Sbjct: 239 IWCSAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAV--EEIKAPWLLTSLDAVS 296 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL-------------------- 333 + + + ++ A + + + + + Sbjct: 297 ILTTGKTPSTAKDEYWNGDTMFVSPAQIHPEGYLHNPSRYVSKAGCQIVPLISKGSTLIV 356 Query: 334 ---------------------------------AWLMRSYDLCKVFYAMGSGLRQS--LK 358 L + L Sbjct: 357 CIGTVGKVGLLTEDVVINQQINAITPLPSVTHKYMYYWCKTLYPWIIDTARATVNAAILN 416 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P +PP++EQ +I +++ A D + +++++ + S +A A G+ Sbjct: 417 KSTMSTAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGE 476 Score = 53.3 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 67/206 (32%), Gaps = 9/206 (4%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 K + V+ I W + + + L TG+T + KD + G S + ++ Sbjct: 276 KRTAVE---EIKAPWLLTSLDAVSILTTGKTPSTAKDEYWNGDTMFVSPAQIHPEGYLHN 332 Query: 71 RQSDTSTVS-----IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 S + +KG L +G K + D + + Q + P + Sbjct: 333 PSRYVSKAGCQIVPLISKGSTLIVCIGTV-GKVGLLTEDVVINQQINAITPLPSVTHKYM 391 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + I+ + + + P +PPL EQ I + DT+ Sbjct: 392 YYWCKTLYPWIIDTARATVNAAILNKSTMSTAPFALPPLEEQKEIVRLVDQYFAFADTIE 451 Query: 186 TERIRFIELLKEKKQALVSYIVTKGL 211 + + + Q++++ L Sbjct: 452 AQVKKAQARVDNLTQSILAKAFRGEL 477 >gi|238924765|ref|YP_002938281.1| type I restriction enzyme EcoEI specificity protein [Eubacterium rectale ATCC 33656] gi|238876440|gb|ACR76147.1| type I restriction enzyme EcoEI specificity protein [Eubacterium rectale ATCC 33656] Length = 371 Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats. Identities = 45/400 (11%), Positives = 118/400 (29%), Gaps = 43/400 (10%) Query: 26 KVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 ++ + G + + + I ++D+ D Sbjct: 4 DIIKLGDVATYINGYAFKPEDRGEEGLQIIRIQDLTGN-----SYDLGFYNGKYPKKIEI 58 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G +L L + + + ++ V + + Sbjct: 59 NDGDVLISWS-ASLGVYVWNGGKALLNQHIFKVKFDKVDIDKSYFVYAVRYKLNDMGKKT 117 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GATM H + +P PPL +Q+ I + + I E EL Sbjct: 118 HGATMKHIVKRDFDATEIPYPPLKKQIEIAINLDKVLMVIKERKRELKLLDEL------- 170 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +K +E G + + ++T+ ++ K I + Sbjct: 171 ---------------IKARFVEMFGDCTNMISLSDLCLIITDGTHQSPKFQHEGIPFILV 215 Query: 262 GNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 N+ + T + + ++ G+I+ + + + Sbjct: 216 SNLSKNTVTYDTDKFISAETYKELYKRTPIEIGDILLSTVGSYGHPAVVVEDRKFLFQRH 275 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 AY+ K ++S Y+ + S + G +++L +++++ + VP + Q Sbjct: 276 -IAYLKPKSDILNSYYMHGALLSPGCQRQIEEKVKGIAQKTLNLSEIRKIRIPVPSLDLQ 334 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++ +++ +++++ + S + Sbjct: 335 KQYADFVH----QVNKSKVAVQKALDETQILFDSLMQKYF 370 >gi|116627581|ref|YP_820200.1| restriction endonuclease S subunit [Streptococcus thermophilus LMD-9] gi|116100858|gb|ABJ66004.1| Restriction endonuclease S subunit [Streptococcus thermophilus LMD-9] Length = 387 Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats. Identities = 53/395 (13%), Positives = 123/395 (31%), Gaps = 32/395 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + WK + + ++N+G+ + +E G G + I Sbjct: 18 ESWKRLKYEDVIEVNSGKDYK-----------HLEKGDIPVYGTGGYMLSVSEA---ITN 63 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K + G+ G + I+ T F + + ++ +I + E E Sbjct: 64 KDGVGIGRKGTINKPYILKAPYWTVDTLFFCIPK----NKYNLYFINAIFERTQWERFDE 119 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + I NI P EQ I + + + + K + Sbjct: 120 STGVPSLSKLTINNIQNYFPSFDEQSAIGSLFRTLDDLLASY----KDNLANYQSLKATM 175 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 +S + K +++ G E + + T K+ + + I + S Sbjct: 176 LSKMFPKAGQTVPEIRLDGFEGEWKLYELKSRAETITKGTTPKDKSWQGEVNYIKTESIN 235 Query: 263 NIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 L E Y I+ +++F + + + Sbjct: 236 RDTGSLVRTASTSLDEHLGYLKRSILKEDDVLFSIVGTLGVVGIVDKKDLPAN--TNQQI 293 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP-IKEQFDI 378 ++ D+ ++ ++S + + + G + SL ++++ V +PP ++EQ I Sbjct: 294 AIIRLKRDDAIFMLNFLKSPRIKSFIKSDSTIGAQPSLSLWQIEKIKVSLPPSLEEQQAI 353 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ I L+ + + Sbjct: 354 GTY----FSNLDNLINSHQEKISQLETLKKKLLQD 384 >gi|238019006|ref|ZP_04599432.1| hypothetical protein VEIDISOL_00868 [Veillonella dispar ATCC 17748] gi|237864490|gb|EEP65780.1| hypothetical protein VEIDISOL_00868 [Veillonella dispar ATCC 17748] Length = 407 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 135/404 (33%), Gaps = 26/404 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTV 78 + W+ + L G GK +I L+++ S + Sbjct: 14 EDWEQCKLGNLGTLKNGMNFSKEAMGKGYPFINLQNIFGSNVIDLTKLEKAEATDSQLKD 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG------ICSTQFLVLQPKDVLPELLQGWLLSI- 131 KG +L+ + L A S + + L + ++ Sbjct: 74 YNLQKGDVLFVRSSVKLEGVGEAALISEDLKDTTFSGFIIRFRDNYGLDYNFKRFIFITV 133 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +I + + + + N+ + IP EQ KI ++D IT R Sbjct: 134 LIRNQIMSQATNSANKNISQSVLNNLYLFIPTKDEQ----SKIGLIFSKLDKCITLHQRK 189 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +E LK K+AL+ + K + +++ G + + L N++N Sbjct: 190 LEKLKLAKKALLQKLFPKNGSQFPEIRFKG---FTDAWEQCKFSDITYLSGIKNKENKPY 246 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +I + + +K Y IV + + + S+ Sbjct: 247 ESYSISNEFGFIPQDEQFENGGTMKTADKSMYYIVSQNSFAYNP--ARINVGSIGYYDKP 304 Query: 312 ERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLV 369 + I++S Y K I S W +S ++ G+R ++ + + + + Sbjct: 305 DNVIVSSLYEVFKTTDIVSDKFLWHWFKSNQFNRLIEKYQEGGVRLYFYYDKLCKGTIEL 364 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I EQ I+N+++ +D+ + ++ + L+E + + Sbjct: 365 PTINEQNKISNLLDD----LDMYITLHQRKLDKLQEVKKGLLQK 404 >gi|257088128|ref|ZP_05582489.1| predicted protein [Enterococcus faecalis D6] gi|256996158|gb|EEU83460.1| predicted protein [Enterococcus faecalis D6] Length = 395 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 61/394 (15%), Positives = 137/394 (34%), Gaps = 24/394 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + K++TG+ + K VE+G P S + S ++ Sbjct: 18 EEWEQCKAEELCKISTGKGNTQDK---------VENGK---YPFYVRSENIERSNYFLYD 65 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + +L G + + + + + + S++ +R+ ++ Sbjct: 66 QEAVLTVGDGVGTGRVFHYVSGKYNLHQRVYRMYDFNKQISAKYFYYYFSLNFHRRVRSL 125 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 ++ I ++ + P EQ+ I + + IT R +E LKE K+ Sbjct: 126 TAKTSVDSVRLNMIADMEIKYPSELEQLKIFSFLDY----LIKSITLHQRKLEQLKELKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + + K +++ + E + N+ +I LS Sbjct: 182 AYLQLMFPKKDETLPRVRFADFEGEWEQCKLKNLFLKGGSGGTPTSSNSDYYNGDIPFLS 241 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 +I + K S E + + I L + A + + A+ Sbjct: 242 ISDITKSNGYIYTTEKCISLEGLKNSSAWIVPKESISLAMYASVGKVAILKLDIATSQAF 301 Query: 321 MAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + I+ + +L++ + + +G + +L + VK VL+P EQ I Sbjct: 302 YNMIFEDINTRNYIYHYLIKKEVFNEWITLISTGTQANLNADKVKNTFVLIPSNNEQKKI 361 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ I+V ++ ++ I +LK + S++ Sbjct: 362 AELLRC----IEVSIDIQQKKIHILKSLKKSYLQ 391 >gi|325917799|ref|ZP_08179981.1| restriction endonuclease S subunit [Xanthomonas vesicatoria ATCC 35937] gi|325535973|gb|EGD07787.1| restriction endonuclease S subunit [Xanthomonas vesicatoria ATCC 35937] Length = 756 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 87/479 (18%), Positives = 146/479 (30%), Gaps = 87/479 (18%) Query: 20 AIPKHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P W+ + T T + E +D + LED+E T K L K ++ S Sbjct: 82 ELPVTWEWARLGEITNFGITVKKEEIPEDAWVLDLEDIEKDTSKLLQKARFKERNSLSDK 141 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK-DVLPELLQGWLLSIDVTQRI 137 + F KG +LYGKL PYL K ++AD DG C+T+ L + L G L S + + Sbjct: 142 NFFNKGDVLYGKLRPYLNKVLVADEDGFCTTEILPFRCYGPFLANYFMGALKSPYFLRYV 201 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A G M + P+PPLAEQ I K+ D L + + Sbjct: 202 NARSYGMKMPRLGTEDGRQALFPLPPLAEQYRIVAKVDELMALCDRLDARQADADSAHVQ 261 Query: 198 KKQA-----------------------------------------LVSYIVTKGLNPDVK 216 QA L+ V L P Sbjct: 262 LVQALLDSLTQARNAEDFAQSWQRLAEHFHTLFTTEPSIDALKQILLQLAVMGKLVPQDP 321 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET----RN 272 + E + + + + + + + + L I + + Sbjct: 322 SDEPASELLRRIAKEKALLVAEGKIKKQKVLSEIEKDEALFELPSSWIWTRFGNVCAIKG 381 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI---- 328 ++PE + + + V P I L +++ S + + Sbjct: 382 ELVRPEDFPSLRQVAPDCIEKGTGRLTDNRTVKDSGVKGPNSRFFAGQIVYSKIRPSLSK 441 Query: 329 ------------DSTYLAWLMRSYDLCKVFYAM-----GSGLRQSLKFEDVK-----RLP 366 D + + S L K + +K + Sbjct: 442 AVLVDFDGLCSADMYPIDAFINSEFLLKEILSAVFLEQVRVAENRIKMPKLNQESMANFV 501 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS-------SFIAAAVTGQ 418 + +PP+ EQ I ++ A D L + L E R + I A+ G+ Sbjct: 502 LPIPPLAEQRRIVAKVDQLMALCDQLKAR-------LGEVRQVHGSLANALIGQALNGE 553 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 67/205 (32%), Gaps = 18/205 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYG-NIIQKLETRNMGLK 276 SG +P WE +K ++ +L L KL + + Sbjct: 75 SGDGVPFELPVTWEWARLGEITNFGITVKKEEIPEDAWVLDLEDIEKDTSKLLQKARFKE 134 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 S + G++++ + +K + + T + Y Sbjct: 135 RNSLSDKNFFNKGDVLYGKLRPYLNKVLVA---DEDGFCTTEILPFRCYGPFLANYFMGA 191 Query: 337 MRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL--- 392 ++S + A G+ L ED ++ +PP+ EQ+ I ++ A D L Sbjct: 192 LKSPYFLRYVNARSYGMKMPRLGTEDGRQALFPLPPLAEQYRIVAKVDELMALCDRLDAR 251 Query: 393 --------VEKIEQSIVLLKERRSS 409 V+ ++ + L + R++ Sbjct: 252 QADADSAHVQLVQALLDSLTQARNA 276 >gi|260582498|ref|ZP_05850289.1| type I restriction/modification specificity protein [Haemophilus influenzae NT127] gi|260094478|gb|EEW78375.1| type I restriction/modification specificity protein [Haemophilus influenzae NT127] Length = 455 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 61/468 (13%), Positives = 138/468 (29%), Gaps = 85/468 (18%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPK---DGNSRQ 72 +WKV+ + + G T S K +I +I +D+ +Y+ K + Sbjct: 2 SNWKVMKLSEVATIVGGGTPSSSKSEYFENGNIPWITPKDLSGYNKRYISKGERNITELG 61 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ + K +L P AI ++ ++ +PE + L + Sbjct: 62 LKNSSAKLLPKNTVLLTSRAPIGYVAIASNEISTNQGFKSLVLNNGHIPE--FFYYLLKN 119 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +E+ G+T + + + + IP Q I + + +I+ Sbjct: 120 NVHILESRATGSTFKEISGQILKDTELSIPTPDIQQKIVDILSPLDDKIELNTQINQTLE 179 Query: 193 ELLKEKKQALV---------SYIVTKGL-------------------------------- 211 ++ + ++ ++ GL Sbjct: 180 QIAQALFKSWFVDFDPVRAKVQALSDGLSLEQAELAAMQAISGKTPEELTALSQTQPDRY 239 Query: 212 ----NPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTELNRK-----NTKLIESNILSLSY 261 +E G P WE+K L + K N + + + Sbjct: 240 AELAETAKVFPCEMVEIDGVEAPRGWEMKALSDLGQIICGKTPSKSNKEFYGDAVPFIKI 299 Query: 262 GNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ ++ T N+ + +Y++ + + I I + I + Sbjct: 300 PDMHNQVFITQTTDNLSVVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINS 359 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFEDVKRLPVLVPPIKE- 374 + +L ++ + K + SG +L ++ ++ P + Sbjct: 360 ----IIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATFNLNTSTFSKIEIITPSKEII 415 Query: 375 ---QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 Q + ++ L IE L E R + ++G+I Sbjct: 416 YIFQKKVVSIFEK------TLSNSIENK--RLTEIRDLLLPRLLSGEI 455 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 13/131 (9%), Positives = 42/131 (32%), Gaps = 7/131 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESG-TGKYLPKDGNSRQSD 74 P+ W++ + ++ G+T + +I + D+ + + + ++ Sbjct: 262 PRGWEMKALSDLGQIICGKTPSKSNKEFYGDAVPFIKIPDMHNQVFITQTTDNLSVVGAN 321 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + I + ++ + ++ + E L L +T Sbjct: 322 YQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLYLSLKQPSMT 381 Query: 135 QRIEAICEGAT 145 + ++ + G T Sbjct: 382 KYLKDLASGGT 392 >gi|160939416|ref|ZP_02086766.1| hypothetical protein CLOBOL_04309 [Clostridium bolteae ATCC BAA-613] gi|158437626|gb|EDP15388.1| hypothetical protein CLOBOL_04309 [Clostridium bolteae ATCC BAA-613] Length = 378 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 60/392 (15%), Positives = 131/392 (33%), Gaps = 23/392 (5%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 I + R + +++ + + D + KD + D S+ + KG ++Y Sbjct: 4 RIGDIYAERSER-GAADMELLSVTMNDGVMQRSEIEGKDNS--SEDKSSYKVVRKGDMVY 60 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGAT-- 145 + + ++ +DGI S + VL K + + + +G T Sbjct: 61 NSMRMWQGANGVSPYDGIVSPAYTVLTAKLPICNDYFAALFKNYKLINEFRKNSQGMTSD 120 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + I I + +P + EQ I + V +D I + IE LK+ K+ ++S Sbjct: 121 TWNLKYPQIETIKVYLPVIEEQEKIASIL----VTLDKRIAAQAALIEQLKKYKRGVISA 176 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +++ NP + + V +E + + K I + + Sbjct: 177 LLSSKTNPYYSSETWKEVALCDVASGFEYG--MNAAATVYDGSHKYIRITDIDDNSHLYS 234 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Q + G E Y V +I+F K R + + + Sbjct: 235 QDVPVSPEGQVDEKY----RVRENDILFARTGASVGKSY-RYQRSDGDLYYAGFLIRIHV 289 Query: 326 HGIDSTYLAWL--MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + + + + V + + E K+ L+PP++ Q I + Sbjct: 290 NSDVNCGYVFQNTLTEAYRRWVLLESARSGQPGINAEQYKQYRFLLPPLELQNKI----S 345 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D L+ K + +++ + + + Sbjct: 346 TLATNLDNLICKEGNLLSQIEQVKIALLQRLF 377 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 12/157 (7%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Q+ E E +Y++V G++V+ + + + GI++ AY + Sbjct: 33 QRSEIEGKDNSSEDKSSYKVVRKGDMVYNSMRMWQGANGVSPYD----GIVSPAYTVLTA 88 Query: 326 H-GIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQFDITNV 381 I + Y A L ++Y L F G+ +LK+ ++ + V +P I+EQ I ++ Sbjct: 89 KLPICNDYFAALFKNYKLINEFRKNSQGMTSDTWNLKYPQIETIKVYLPVIEEQEKIASI 148 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +D + I LK+ + I+A ++ + Sbjct: 149 LVT----LDKRIAAQAALIEQLKKYKRGVISALLSSK 181 >gi|325685549|gb|EGD27638.1| type I site-specific deoxyribonuclease [Lactobacillus delbrueckii subsp. lactis DSM 20072] Length = 501 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 64/436 (14%), Positives = 135/436 (30%), Gaps = 58/436 (13%) Query: 22 PKHWKVVPIKRFTKLNTGR---------------TSESGKDIIYIGLEDVESGTGKYLPK 66 P +W+V + G +S + T Y Sbjct: 64 PTNWEVTRLIEICAKVKGAIKRGPFGSSITKAMFVPKSKNTFKVYEQGNAIRKTTDYGEY 123 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELL 124 + + G I+ G +I GI + + L + + + Sbjct: 124 YMPDSEFERLKSFEVHAGDIIISCAGTIGEAFVIPKTFERGIINQALMKLTIDENIIDKQ 183 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGI--GNIPMPIPPLAEQVLIREKIIAETVRID 182 L+ +T ++ +G+ + + + P+PPLAEQ I E++ +D Sbjct: 184 FFLLVFKSITGQLREHSKGSAIKNLASLKYLKNEVTFPLPPLAEQKRIVERLDQIMPLVD 243 Query: 183 TLITERIRFIELL----KEKKQALVSYIVTKGLNPDVKMKDSGIEWV------------- 225 R E+ K++L+ Y + L + E + Sbjct: 244 KYAETYNRLQEIDKGIGDRLKKSLLQYAMGGKLVDQDPNDEPASELLKRIRAEKSELIKK 303 Query: 226 ------------------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +PD WE L+ ++ + + S N +QK Sbjct: 304 SKIKKSKKLPEITEDEKPFDIPDSWEWVRLGELLKPESKVKPTKNFTYVDIASLDNKVQK 363 Query: 268 LETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + Q+++ +I++ + ++ + + + Sbjct: 364 IISPKYVDVSKDKIPVRATQLINRNDILYSLVRPYLKNVAIVPKKFDGAIATSGFCVLKP 423 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + YL W + S +A GL S+K D+ P+ +PP+ EQ I ++ Sbjct: 424 LKESLTQYLFWALLSPYTTDEMHARMKGLNSPSIKKGDLIGWPIPLPPLAEQKRIVTKLS 483 Query: 384 VETARIDVLVEKIEQS 399 ++D+L + +E Sbjct: 484 KLFKQVDILQKDLEAK 499 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 65/183 (35%), Gaps = 20/183 (10%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ GN I+K P+S V G+I+ + + Sbjct: 99 PKSKNTFKVYEQGNAIRKTTDYGEYYMPDSEFERLKSFEVHAGDIIISCAGTIGEAFVIP 158 Query: 307 SAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 ERGII A M + + ID + + +S ++ GS ++ + +K Sbjct: 159 K--TFERGIINQALMKLTIDENIIDKQFFLLVFKSITGQLREHSKGSAIKNLASLKYLKN 216 Query: 365 -LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE--------RRSSFIAAAV 415 + +PP+ EQ I ++ +D E + L+E + S + A+ Sbjct: 217 EVTFPLPPLAEQKRIVERLDQIMPLVDKYAETYNR----LQEIDKGIGDRLKKSLLQYAM 272 Query: 416 TGQ 418 G+ Sbjct: 273 GGK 275 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 64/170 (37%), Gaps = 9/170 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---S 76 IP W+ V + L + K+ Y+ + +++ K + D Sbjct: 323 DIPDSWEWVRLGEL--LKPESKVKPTKNFTYVDIASLDNKVQKIISPKYVDVSKDKIPVR 380 Query: 77 TVSIFAKGQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + + ILY + PYL+ I D S ++ K+ L + L LLS Sbjct: 381 ATQLINRNDILYSLVRPYLKNVAIVPKKFDGAIATSGFCVLKPLKESLTQYLFWALLSPY 440 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 T + A +G + P+P+PPLAEQ I K+ ++D Sbjct: 441 TTDEMHARMKGLNSPSIKKGDLIGWPIPLPPLAEQKRIVTKLSKLFKQVD 490 >gi|124002922|ref|ZP_01687773.1| HsdS [Microscilla marina ATCC 23134] gi|123991572|gb|EAY30980.1| HsdS [Microscilla marina ATCC 23134] Length = 402 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 122/397 (30%), Gaps = 36/397 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD------GNSRQSDTSTVSIF 81 + T L + R ++ K + + + + G ++ + R D S I Sbjct: 28 KRLGDITILVSKRNKDNKK----LPVYSINNKEGFLPQEEQFEGVISSKRGYDISLYKII 83 Query: 82 AKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIE 138 + Y + + ++ I S+ ++ Q +D + + + + Sbjct: 84 ERNTFAYNPARIDVGSIGFSGDLYNIIISSLYVCFQTEDNIDNHFLWQFFNTYYFNTTVR 143 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG ++ ++ IP+ IP EQ I + + + I +E L Sbjct: 144 NNVEGGIRNYLFYENFSRIPVAIPKKLEQQKIADCLRSLDQL----IVVHETRLESLNNH 199 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ L+ + + +++ + G + + KN Sbjct: 200 KKGLMQQLFPQEGEKVPRLRFPEFKGNGEWEEKELGSIAKVTTGNKDTKNKVDNGQYPFF 259 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + N+ + S++ I+ G+ +N + +R Sbjct: 260 VRSQNV--------ERIDSYSFDGEAILTSGD---GVGVGKNFHYIIGKFDFHQRVYA-- 306 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + Y+ Y +V S++ + +P+ P KEQ I Sbjct: 307 --IYDFTEVVLGKYIFMYFSQYFYDRVMKMSAKNSVDSVRKAMITEMPIKFPSPKEQQKI 364 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + +D L+ Q I L + + + Sbjct: 365 ADCL----SSLDTLIAAEAQKIGALGKHKKGLMQQLF 397 >gi|283477074|emb|CAY72969.1| type I restriction-modification system specificity subunit [Erwinia pyrifoliae DSM 12163] Length = 474 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 55/464 (11%), Positives = 135/464 (29%), Gaps = 72/464 (15%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIY-------IGLEDVESGTGKYLPK-DGNSRQSD 74 W V + K+ +G T + GK + I +++ + K + Sbjct: 4 DWSFVRLGDHCLKIGSGATPKGGKSVYLDNGKTSLIRSQNIYNDGFKNSGLAYITEDAAK 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQ--FLVLQPKDVLPELLQGWLL 129 G IL G + + +A + + K+ ++ +L Sbjct: 64 KLNNVEVQDGDILLNITGDSVARVCLAPEGHLPARVNQHVAIIRPNSKEFDARFIRYFLA 123 Query: 130 SIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + I GAT + I ++ + P L Q I +++ + +I + Sbjct: 124 SPAQQNVLLTIASAGATRNALTKSNIESLLICKPCLKNQKWIADQLESLDKKIHSNQQIN 183 Query: 189 IRFIELLKEKKQAL---------------------------VSYIVTKGLNPDVKMKDSG 221 ++ + ++ ++ I K + K Sbjct: 184 QTLEQMAQALFKSWFVDFEPVKAKIALLEAGGSQQEATLAAMTAISGKDADSLEVFKHKQ 243 Query: 222 IE-------------------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 E +G +P W + E Sbjct: 244 PEKYAELKATAELFPSAMQESELGEIPQGWTNSEIGEEIDIAGGATPSTKEPKFWENGDI 303 Query: 263 NIIQKLETRNMGLKPESYETYQI-------VDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N + N+ K +I + G + + + + A Sbjct: 304 NWTTPKDLSNLQDKILIKTDRKITDRGLAKISSGLLAIDTVLMSSRAPVGYLALTKIPVA 363 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I Y+A+K + + ++++ ++ + ++ +P++ P Sbjct: 364 INQGYIAMKCNYDLNPEFVLQWCNHNMPEIISRASGTTFAEISKKNFNPIPLIKPT---- 419 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + ++ E + +L+EK + +L++ R + + ++G+I Sbjct: 420 KKMVDIYTREVRSLYLLIEKNVRKTEILQQLRDTLLPKLLSGEI 463 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 59/197 (29%), Gaps = 12/197 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKY---LPKD 67 +G IP+ W I + G T + + DI + +D+ + K + Sbjct: 266 LGEIPQGWTNSEIGEEIDIAGGATPSTKEPKFWENGDINWTTPKDLSNLQDKILIKTDRK 325 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 R + + A +L P + + ++ ++ L Sbjct: 326 ITDRGLAKISSGLLAIDTVLMSSRAPV-GYLALTKIPVAINQGYIAMKCNYDLN-PEFVL 383 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I + G T + K IP+ P + ++ + + I+ + + Sbjct: 384 QWCNHNMPEIISRASGTTFAEISKKNFNPIPLIKPTKKMVDIYTREVRSLYLLIEKNVRK 443 Query: 188 RIRFIELLKEKKQALVS 204 +L L+S Sbjct: 444 TEILQQLRDTLLPKLLS 460 >gi|126665708|ref|ZP_01736689.1| type I restriction-modification enzyme S subunit [Marinobacter sp. ELB17] gi|126629642|gb|EBA00259.1| type I restriction-modification enzyme S subunit [Marinobacter sp. ELB17] Length = 576 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 71/465 (15%), Positives = 125/465 (26%), Gaps = 88/465 (18%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IPK W + TG T K DI ++ D+ +G + Sbjct: 101 IPKAWSWQALGALGYTQTGSTPSKSKSEFFGSDIPFLKPGDISENGDVRYENEGLTEAGK 160 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ K IL +G + +I + L L S Sbjct: 161 SALGKWAQKESILMVCIGTIGKCGLIERQSTFNQQINSITPYILETSRFLLLCLKSPYFQ 220 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE----------------- 177 + T+S + +IP+P+PP EQ I +K+ Sbjct: 221 KAAWEKSSSTTISILNKGKWESIPVPLPPTEEQHRIVQKVDELMALCDRLEQQSSDQLKA 280 Query: 178 ------------------------TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 R+ + + + KQ ++ V L Sbjct: 281 HETLVDTLLGTLTQSENATELADNWARLAAHFDTLFTTEQSIDKLKQTVLQLAVMGRLVE 340 Query: 214 DVKMKDSGIEWV-----------------------------GLVPDHWEVKPFFALVTEL 244 + +S E + +P W F Sbjct: 341 QNPVDESAAELIVRVSMEKAQRQKRKRTQKAPCEISAEVKPFDIPKSWLWTSLFNTGFTS 400 Query: 245 NRKNT-----KLIESNILSLSYGNIIQKLET--RNMGLKPESYETYQIVDPGEIVFRFID 297 K NI L G + E L + + PG+I+ I Sbjct: 401 TGKTPSTKVPNFFSGNIPFLGPGQVTGSGEILAPEKFLSEDGLSLSEEAIPGDIMTVCIG 460 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQS 356 K + ++ ER V+P I+ YL +++ A +G Sbjct: 461 GSIGKTA----KITERCGFNQQLNKVRPVLIEPDYLLATLKADFFQNAVLAKATGSATPI 516 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + V + P+ EQ I ++ A D L +++ Q+ Sbjct: 517 INRSKWDSIEVPIAPLAEQKRIVQKVDELMALCDQLKQRLNQASE 561 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 59/208 (28%), Gaps = 13/208 (6%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P + ++ +P W + AL +S + + N Sbjct: 86 PKAIINVPEADYPFSIPKAWSWQALGALGYTQTGSTPSKSKSEFFGSDIPFLKPGDISEN 145 Query: 273 MGLKPESYETY--------QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 ++ E+ + I+ I + + I + Sbjct: 146 GDVRYENEGLTEAGKSALGKWAQKESILMVCIGTIGKCGLIERQSTFNQQINS----ITP 201 Query: 325 PHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 S +L ++S K + S L + +PV +PP +EQ I ++ Sbjct: 202 YILETSRFLLLCLKSPYFQKAAWEKSSSTTISILNKGKWESIPVPLPPTEEQHRIVQKVD 261 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFI 411 A D L ++ + + + + Sbjct: 262 ELMALCDRLEQQSSDQLKAHETLVDTLL 289 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 72/195 (36%), Gaps = 7/195 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IPK W + +TG+T + +I ++G V +G+G+ L + + Sbjct: 383 DIPKSWLWTSLFNTGFTSTGKTPSTKVPNFFSGNIPFLGPGQV-TGSGEILAPEKFLSED 441 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S G I+ +G + K + Q ++P + P+ L L + Sbjct: 442 GLSLSEEAIPGDIMTVCIGGSIGKTAKITERCGFNQQLNKVRPVLIEPDYLLATLKADFF 501 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A G+ + +I +PI PLAEQ I +K+ D L + E Sbjct: 502 QNAVLAKATGSATPIINRSKWDSIEVPIAPLAEQKRIVQKVDELMALCDQLKQRLNQASE 561 Query: 194 LLKEKKQALVSYIVT 208 + +V+ + Sbjct: 562 TRCQLANTVVAAALD 576 >gi|239827072|ref|YP_002949696.1| restriction modification system DNA specificity domain protein [Geobacillus sp. WCH70] gi|239807365|gb|ACS24430.1| restriction modification system DNA specificity domain protein [Geobacillus sp. WCH70] Length = 428 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 66/424 (15%), Positives = 140/424 (33%), Gaps = 29/424 (6%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK +K ++ G T+ + ++ ++ + D+ + + S Sbjct: 4 SEWKTYSLKDICTDISYGYTASAKEEKVGPKFLRITDLRNEFIDWESVPYCSINEKDYKK 63 Query: 79 SIFAKGQILYGKLGPYLRK--AIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVT 134 G + + G I D D + ++ + + + P ++ S Sbjct: 64 YKLEIGDLCIARTGATTGINTVIEEDVDAVFASYLVRFKLNKEIVDPTFIKYIFKSNMWY 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +I G+ A+ + + N M IP L EQ I + + I + + Sbjct: 124 GYVNSIISGSAQPGANAQQMSNFKMSIPDLDEQKKIASVLSVLDKK----IVLNNKINKT 179 Query: 195 LKEKKQALVSYIVTKGLNPD---VKMKDSG----IEWVGLVPDHWEVKPFFALVTELNRK 247 L+E QA+ P+ K SG G++P+ W+ LV Sbjct: 180 LEEMAQAIFKRWFVDFEFPNENGKPYKSSGGKFVESESGMIPEGWKEGTLDNLVVINTAS 239 Query: 248 NTKLIESNILSLSY-GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 IL Y + + E +V P + ++ KR Sbjct: 240 VDPKENPEILYEHYSIPAFDEQKYPKFEYGREIKSNKYLVRPNSFLVSKLNPTT-KRVWD 298 Query: 307 SAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDV 362 + E I ++ ++ P I +YL ++ S + +G RQ +K + Sbjct: 299 PLCITENAISSTEFINYLPKDISYQSYLYCMLNSERFSEHLIKHATGSTGSRQRVKPAET 358 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 V++P + + I ++ + + +LK+ R + ++G+I + Sbjct: 359 LTFNVILPDTETLKKF----DNLIRPIREKLKINQINSAVLKDVRDILLPKLMSGEIRVP 414 Query: 423 GESQ 426 + Sbjct: 415 DAER 418 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 32/144 (22%), Positives = 53/144 (36%), Gaps = 9/144 (6%) Query: 10 YKDSGVQWI----GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG +++ G IP+ WK + +NT I + + + P Sbjct: 205 YKSSGGKFVESESGMIPEGWKEGTLDNLVVINTASVDPKENPEILYEHYSIPAFDEQKYP 264 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI---IADFDGICSTQFLVLQPKDV-LP 121 K R+ S + L KL P ++ + I ST+F+ PKD+ Sbjct: 265 KFEYGREIK-SNKYLVRPNSFLVSKLNPTTKRVWDPLCITENAISSTEFINYLPKDISYQ 323 Query: 122 ELLQGWLLSIDVTQRIEAICEGAT 145 L L S ++ + G+T Sbjct: 324 SYLYCMLNSERFSEHLIKHATGST 347 >gi|229542895|ref|ZP_04431955.1| restriction modification system DNA specificity domain protein [Bacillus coagulans 36D1] gi|229327315|gb|EEN92990.1| restriction modification system DNA specificity domain protein [Bacillus coagulans 36D1] Length = 483 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 61/448 (13%), Positives = 138/448 (30%), Gaps = 62/448 (13%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P +W V +K K + + G+ +++ D S Sbjct: 25 EVPGNWVWVKLKTINKDKKRNIDPKSFKDETFELYSVPSFPEGSPEFIKGDEIG-----S 79 Query: 77 TVSIFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + K +IL K+ P + + F + ST+++V+ + +LL Sbjct: 80 SKQLVNKDEILLCKINPRINRVWKVLNNHGKFRQLASTEWIVISENKAIYSEYLLYLLKS 139 Query: 132 DVTQRIEAICEGATMSHAD---WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 +++ K + P+ +PP+ EQ I +K+ +ID Sbjct: 140 PYFRKLITSNVSGVGGSLTRARPKEVETYPIAVPPIKEQKRIADKVERLLSKIDEAKRLI 199 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE------------------------- 223 E + ++ A++ L + ++ IE Sbjct: 200 EEAKETFELRRAAILDKAFRGELTRKWREENKNIEDAESLYVKIKESQSIRRKVSKEINI 259 Query: 224 --WVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLK 276 +P W+ + T + K E NI + G I + Sbjct: 260 KDLRYSIPSTWKWVRLGDVFTITSGGTPKRTIPEYYEGNIPWIKTGEIKWNAINESEEQI 319 Query: 277 PES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + +++ P ++ + R+A + A A+ P+ + Sbjct: 320 TPEAVANSSAKLLPPNTVLVAMYGQGLTRG--RAAILSVEATCNQAVCALLPNDYIAPEF 377 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET---ARID 390 + + G +++L + +PP++EQ I + ++I Sbjct: 378 IFYYFMEGYQRFRQVAKGGNQENLSVSLISDFIFPLPPLEEQRVIITTLQNIFKKESKIK 437 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +++ I + S ++ A G+ Sbjct: 438 DVIKINTDEI------KQSILSKAFRGE 459 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 47/228 (20%), Positives = 93/228 (40%), Gaps = 12/228 (5%) Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 KKQ + ++ + L P E VP +W + + R + Sbjct: 1 MRKKQKTMEELLEEALVP-------EGEQPYEVPGNWVWVKLKTINKDKKRNIDPKSFKD 53 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME-RG 314 Y + E + Q+V+ EI+ I+ + ++ + R Sbjct: 54 ETFELYSVPSFPEGSPEFIKGDEIGSSKQLVNKDEILLCKINPRINRVWKVLNNHGKFRQ 113 Query: 315 IITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF---EDVKRLPVLVP 370 + ++ ++ + + I S YL +L++S K+ + SG+ SL ++V+ P+ VP Sbjct: 114 LASTEWIVISENKAIYSEYLLYLLKSPYFRKLITSNVSGVGGSLTRARPKEVETYPIAVP 173 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PIKEQ I + + ++ID IE++ + RR++ + A G+ Sbjct: 174 PIKEQKRIADKVERLLSKIDEAKRLIEEAKETFELRRAAILDKAFRGE 221 >gi|224368580|ref|YP_002602743.1| HsdS1 [Desulfobacterium autotrophicum HRM2] gi|223691296|gb|ACN14579.1| HsdS1 [Desulfobacterium autotrophicum HRM2] Length = 393 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 57/402 (14%), Positives = 132/402 (32%), Gaps = 24/402 (5%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + + I F K +G T I +I +++ + ++ + S+ Sbjct: 4 EKISISDFCKTGSGGTPSRRNLEFYKGSIPWIKSGELKEDIIYDSEEKISAEAIENSSAK 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I + IL G + + + D + + P + + + + + Sbjct: 64 IISNKAILVAMYGATIGRVAMLGVDAATNQAICNIIPDSKRADNRYLFYALQNAVPVLLS 123 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + I + +P+PP+ EQ I + + +R + IEL E Sbjct: 124 RKVGGGQPNISQTIIKDTKIPLPPIKEQKRIAAILDKADA----IRRKREKAIELADEFL 179 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +++ + +P K + + + +K +L+ K T + ++ Sbjct: 180 KSV---FLYMFGDPVTNPKGWPEYKLSEISE---LKSGVTKGRKLDGKKTIAVPYMRVAN 233 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 I + + + + E +Q+ ++ D R + I + Sbjct: 234 VQDGHIIIDDLKEIEVLETDVEKFQLNVGDLLLTEGGDPDKLGRGAVWKGEINPCIHQNH 293 Query: 320 YMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQ 375 V+P I YL+ + S + F + + S+ +K LVPP+ Q Sbjct: 294 IFRVRPDEKRILPEYLSKQIGSARGKRYFLSSAKQTTGVASINMTQLKNFSALVPPMSLQ 353 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + + + + +++K+ + +S A G Sbjct: 354 KEFCEIAAKLESIKNKMIDKLTNQ----EHLFNSLTQRAFRG 391 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 55/193 (28%), Gaps = 14/193 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK W + ++L +G T + + Y+ + +V+ G Sbjct: 194 PKGWPEYKLSEISELKSGVTKGRKLDGKKTIAVPYMRVANVQDGHIIIDDLKEIEVLETD 253 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT- 134 G +L + G + A + G + V P+ + + Sbjct: 254 VEKFQLNVGDLLLTEGGDPDKLGRGAVWKGEINPCIHQNHIFRVRPDEKRILPEYLSKQI 313 Query: 135 -------QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + ++ + + N +PP++ Q E + +I + Sbjct: 314 GSARGKRYFLSSAKQTTGVASINMTQLKNFSALVPPMSLQKEFCEIAAKLESIKNKMIDK 373 Query: 188 RIRFIELLKEKKQ 200 L Q Sbjct: 374 LTNQEHLFNSLTQ 386 >gi|294619903|ref|ZP_06699279.1| HsdS subunit [Enterococcus faecium E1679] gi|291593840|gb|EFF25338.1| HsdS subunit [Enterococcus faecium E1679] Length = 388 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 63/398 (15%), Positives = 134/398 (33%), Gaps = 37/398 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLED-------VESGTGKYLPKDGNSRQSDTS 76 W+ + + G T + + G D + K K + S Sbjct: 17 DWEQRKLGEVADIIGGGTPNTNNPEYWNGDIDWYAPAEIGKQIYVKNSQKKISQLGLQKS 76 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + I G +L+ AI+A G + F + P + + + + ++ + Sbjct: 77 SAKILPIGTVLFTSRAGIGNTAILAKE-GTTNQGFQSIVPHENKLDSYFIFSRTHELKRY 135 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 E G+T + K + +P+ IP + EQ +KI ++D IT R ++LLK Sbjct: 136 GEVTGAGSTFAEVSGKQMAKMPILIPYIDEQ----QKIGIFFKKLDDTITLHQRTLDLLK 191 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E K+ + + P K I + G + WE + + K + + Sbjct: 192 ETKKGFLQKMF-----PKNGAKVPEIRFPGFT-EDWEERKLGDITKISTGK--LDANAMV 243 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + Y ++ + + + I G V N + + V++ ++ Sbjct: 244 ENGKYDFYTSGIKKYRIDVAAFEGPSITIAGNGATVGYMHLADNKFNAYQRTYVLQEFLV 303 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQ 375 +++ + K+ +G + + + L + +P EQ Sbjct: 304 DRSFIFSEIGNKLP------------KKIKQEARTGNIPYIVMDMLTELKLSIPQNNSEQ 351 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I ++D + ++ + LLKE + F+ Sbjct: 352 QKIGTF----FKQLDDTITLHQRKLDLLKETKKGFLQK 385 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 63/189 (33%), Gaps = 9/189 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQI 285 D WE + + + + I K K S Q Sbjct: 16 DDWEQRKLGEVADIIGGGTPNTNNPEYWNGDIDWYAPAEIGKQIYVKNSQKKISQLGLQK 75 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + + + +A + + G + ++ PH R+++L + Sbjct: 76 SSAKILPIGTVLFTSRAGIGNTAILAKEGTTNQGFQSIVPHENKLDSYFIFSRTHELKRY 135 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G+G + + + ++P+L+P I EQ I ++D + ++++ LLK Sbjct: 136 GEVTGAGSTFAEVSGKQMAKMPILIPYIDEQQKIGIF----FKKLDDTITLHQRTLDLLK 191 Query: 405 ERRSSFIAA 413 E + F+ Sbjct: 192 ETKKGFLQK 200 Score = 43.2 bits (100), Expect = 0.088, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 63/185 (34%), Gaps = 16/185 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + TK++TG+ + VE+G + + D + F Sbjct: 219 EDWEERKLGDITKISTGKLDANAM---------VENGKYDFYTSGIKKYRIDVAA---FE 266 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I G + +AD + VLQ V + + + + + Sbjct: 267 GPSITIAGNGATVGYMHLADNKFNAYQRTYVLQEFLVDRSFIFSEIGNKLPKKIKQEART 326 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + + + + ++KI ++D IT R ++LLKE K+ Sbjct: 327 GN----IPYIVMDMLTELKLSIPQNNSEQQKIGTFFKQLDDTITLHQRKLDLLKETKKGF 382 Query: 203 VSYIV 207 + + Sbjct: 383 LQKMF 387 >gi|325690778|gb|EGD32779.1| type I restriction-modification system specificity protein [Streptococcus sanguinis SK115] Length = 409 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 43/411 (10%), Positives = 125/411 (30%), Gaps = 33/411 (8%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK I + + +G T + + ++ + + K + + Sbjct: 15 SDWKEYRIGELIETIFSGGTPNTKNSDYWNGSLPWLSSGETRNRYINVTEKTITNSGAQN 74 Query: 76 STVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSID 132 S+ KG ++ G + + D + + + + VL + + LS Sbjct: 75 SSTRQALKGDVVMASAGQGYTRGQVSFLNIDTFINQSVIAIRANEKVLDKKFLFYNLSSR 134 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + K + ++ + IP L Q I + + +I+T Sbjct: 135 YEELRAISDSNSIRGSITTKMVKSMNIRIPDLNTQRAIANVLSSIDDKIETSKQINHHLE 194 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ++ + ++ G G +P+ W + ++ + Sbjct: 195 QMAQAIFKSWFVDFEPFG---------------GKMPNDWTIGKLSDVLKLIKNGINDKD 239 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + + N + ++ I +I+ + + + + + Sbjct: 240 KQKLPYVPIDILPMHSLSLNSYKSNDEAKSSLITFKKNDILLGAMRVYFHRVCISPFTGI 299 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 R + L ++S + GS + ++ + L + +P Sbjct: 300 TRSTC--FVLRPFNKIYLEYCLLTCDLKSSIEYAQSTSKGSTMPYAVWENGLAELKIPIP 357 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 K + + +++ + + I L+ R + + ++G+I + Sbjct: 358 TEKVIKNFSKIVSPLIKTLQDSIY----EIENLQNLRDTLLPKLLSGEISV 404 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 67/189 (35%), Gaps = 5/189 (2%) Query: 19 GAIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 G +P W + + K + G + + + Y+ ++ + + N S+ Sbjct: 213 GKMPNDWTIGKLSDVLKLIKNGINDKDKQKLPYVPIDILPMHSLSLNSYKSNDEA--KSS 270 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + F K IL G + Y + I+ F GI ST F++ + E + Sbjct: 271 LITFKKNDILLGAMRVYFHRVCISPFTGITRSTCFVLRPFNKIYLEYCLLTCDLKSSIEY 330 Query: 137 IEAICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ +G+TM + G+ + +PIP + + + I E L Sbjct: 331 AQSTSKGSTMPYAVWENGLAELKIPIPTEKVIKNFSKIVSPLIKTLQDSIYEIENLQNLR 390 Query: 196 KEKKQALVS 204 L+S Sbjct: 391 DTLLPKLLS 399 >gi|187729922|ref|YP_001853816.1| type IC specificity subunit [Vibrio tapetis] gi|182894481|gb|ACB99646.1| type I restriction modification system DNA specificity subunit Hsds [Vibrio tapetis] Length = 419 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 66/407 (16%), Positives = 118/407 (28%), Gaps = 26/407 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS- 76 W+ P+ L G +S + V G G K N + + Sbjct: 19 EWEEKPLGDVLSLANGYAFKSEYFCKDKTGYEVLTPGSVHIGGGFQYGKGQNYKLEGKTP 78 Query: 77 TVSIFAKGQILYGKLGPY-----LRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWL 128 IFA G + L I DG + + L L L Sbjct: 79 QKFIFAAGDVFITMTDLTPTAQMLGLPAIVPDDGTTYLHNQRLGKLIQYKGDYGFLFYLL 138 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + +I A G T+ H+ + + P EQ + +D + Sbjct: 139 STDTYRNQIVATSSGTTVKHSSPDKVKSSKFFFPNKVEQTS----LGYYFQNVDKQLKLH 194 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 L++ K+A++ + K +++ +G V F Sbjct: 195 QDKFAKLQQLKKAMLGKMFPKAGQTVPELRFAGFSEKWEVEPLGTNASFNKGKGFTKGDL 254 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 +L Q + T S + E++ + + SA Sbjct: 255 NTFGVPIVLYGRLYTNYQTIITEVDTFVS-SESKGIMSKGREVIVPASGESAEDIARASA 313 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL-CKVFYAMGSG-LRQSLKFEDVKRLP 366 + I+ + P+ L+ +Y G ++ D+K L Sbjct: 314 VLQPNVILGGDLNIIYPNNKILPSFLALIITYSCCQAELAKKAQGKSVVHVRNSDIKDLL 373 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V +P IKEQ I +D L+ +Q I LK + +++A Sbjct: 374 VPMPTIKEQTKIAEY----FQNLDRLINLQQQQIDKLKNLKQTYLAK 416 >gi|169346826|ref|ZP_02865774.1| type I restriction modification DNA specificity domain protein [Clostridium perfringens C str. JGS1495] gi|169296885|gb|EDS79009.1| type I restriction modification DNA specificity domain protein [Clostridium perfringens C str. JGS1495] Length = 404 Score = 107 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 140/406 (34%), Gaps = 36/406 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + + + + + I + ++ K S+ + + Sbjct: 16 EWEKIHLSDRVERVVRKNKGNVTNRPLTISAQYGLVNQEEFFNKVVASKNLE--GYYLLN 73 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K +G ST ++ +PK + R Sbjct: 74 NGEFAYNKSYSNGYPFGAIKRLDKYKNGAVSTLYICFKPKLNVDSDFLTQYFESSKWYRE 133 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + + + P L EQ I + I+ + + Sbjct: 134 VSMVAVEGARNHGLLNIGVSDFFDTIHRFPSLQEQEKIANFLSKVDSIIEKQEKKVEYWN 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 K Q + S K + G+ WE K +++E++ K + Sbjct: 194 SYKKGMMQKIFSQ------------KIRFKDGNGMDYPEWEKKNLKYVLSEISEKTKENN 241 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + +LS + + ++ E N + Y+I+ +IV +L ++ + Sbjct: 242 QYEVLSSTANGVFKQSEYFNREIASADNTGYKILRLNQIVLSPQNLWL--GNINYNNKYD 299 Query: 313 RGIITSAY-MAVKPHGIDSTYLAWLMRS----YDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 GI++ +Y + ++ Y+++++++ Y + S +R++L + + + Sbjct: 300 MGIVSPSYKIFNINKNLNEKYISYIIKTDRMLYGYKQASEQGASVVRRNLNMDLFYDILI 359 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I N + + ID ++EK + + LK+ + + Sbjct: 360 NIPCVEEQEKIANFL----SNIDNIIEKESKKLEELKQWKKGLLQQ 401 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 79/200 (39%), Gaps = 12/200 (6%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGE 290 WE V + RKN + + L++S ++ + E N + ++ E Y +++ GE Sbjct: 17 WEKIHLSDRVERVVRKNKGNVTNRPLTISAQYGLVNQEEFFNKVVASKNLEGYYLLNNGE 76 Query: 291 IVFRFIDLQND-KRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYA 348 + +++ + G +++ Y+ KP +DS +L S + Sbjct: 77 FAYNKSYSNGYPFGAIKRLDKYKNGAVSTLYICFKPKLNVDSDFLTQYFESSKWYREVSM 136 Query: 349 MG-SGLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + G R ++ D P ++EQ I N + +++D ++EK E+ + Sbjct: 137 VAVEGARNHGLLNIGVSDFFDTIHRFPSLQEQEKIANFL----SKVDSIIEKQEKKVEYW 192 Query: 404 KERRSSFIAAAVTGQIDLRG 423 + + + +I + Sbjct: 193 NSYKKGMMQKIFSQKIRFKD 212 >gi|302877622|ref|YP_003846186.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] gi|302580411|gb|ADL54422.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] Length = 582 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 70/483 (14%), Positives = 143/483 (29%), Gaps = 90/483 (18%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS-RQSDTSTV 78 +PK W+ V + + + +S + YI + ++S G + + + Sbjct: 102 ELPKGWEWVRVGQVGHDWGQKEPDS--NFTYIEVSAIDSTRGVVSSPGLVAPEDAPSRAR 159 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQ-GWLLSIDV 133 I KG ++Y + PYL + + + I ST F ++ P ++P + S Sbjct: 160 KIVKKGTVIYSTVRPYLLNIAVIEEEFSPEPIASTAFAIVHPFCLMPPRYFLSFFRSPVF 219 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE---------------- 177 + +E++ G + + +P+PP+ EQ I K+ Sbjct: 220 VRYVESVQMGIAYPAINDGQFFSGLIPLPPIEEQHRIVAKVDELMALCDQLENQHSNAAE 279 Query: 178 -------------------------TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 RI + KQ L+ V L Sbjct: 280 AHEKLVSHLLGTLTQSQNAEDFSANWQRIAAHFDTLFATDASIDALKQTLLQLAVMGKLV 339 Query: 213 PDVKMKD-------------------------------SGIEWVGLVPDHWEVKPFFA-- 239 P + + +E +P WE Sbjct: 340 PQNANDEPASELLKRIQAEKAKLISEGKIKKDKPLTPITDVEKPFELPLRWEWVRLSDIA 399 Query: 240 -LVTELNRKNTKLIESNILSLSYGNIIQK-----LETRNMGLKPESYETYQIVDPGEIVF 293 +T+ + I + LS N+ + + +I+ Sbjct: 400 TQITDGAHHTPEYISDGVPFLSVKNLSSGCLDFTDTRFISPVAHADLTKRCNPEFDDILL 459 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I + + ++ A + + ID YL+ ++ S + + G+ Sbjct: 460 TKIGTTGIAVVIDDPRPFS-IFVSVALIKLPKILIDRDYLSLVINSPFVRQQSEDGTEGV 518 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++L + + P+ EQ I ++ A D L +I + L ++ + Sbjct: 519 GNKNLVLRKINTFDIPFAPLAEQHRIVAKVDELMALCDQLKSRITDASRLQQKLADVLVE 578 Query: 413 AAV 415 AV Sbjct: 579 QAV 581 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 68/191 (35%), Gaps = 5/191 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + E +P WE + + +K + +S + + + + + PE Sbjct: 95 TEDEKPFELPKGWEWVRVGQVGHDWGQK-EPDSNFTYIEVSAIDSTRGVVSSPGLVAPED 153 Query: 280 YETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWL 336 + +IV G +++ + ++ + I ++A+ V P + Y Sbjct: 154 APSRARKIVKKGTVIYSTVRPYLLNIAVIEEEFSPEPIASTAFAIVHPFCLMPPRYFLSF 213 Query: 337 MRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 RS + ++ G+ ++ + +PPI+EQ I ++ A D L + Sbjct: 214 FRSPVFVRYVESVQMGIAYPAINDGQFFSGLIPLPPIEEQHRIVAKVDELMALCDQLENQ 273 Query: 396 IEQSIVLLKER 406 + ++ Sbjct: 274 HSNAAEAHEKL 284 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 68/204 (33%), Gaps = 9/204 (4%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDG 68 + V+ +P W+ V + + G + ++ ++++ SG + Sbjct: 378 TDVEKPFELPLRWEWVRLSDIATQITDGAHHTPEYISDGVPFLSVKNLSSGCLDFTDTRF 437 Query: 69 NSRQSDTSTVSIFAK--GQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPEL 123 S + IL K+G +I D F S + L + + Sbjct: 438 ISPVAHADLTKRCNPEFDDILLTKIGTTGIAVVIDDPRPFSIFVSVALIKLPKILIDRDY 497 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + S V Q+ E EG + + I +P PLAEQ I K+ D Sbjct: 498 LSLVINSPFVRQQSEDGTEGVGNKNLVLRKINTFDIPFAPLAEQHRIVAKVDELMALCDQ 557 Query: 184 LITERIRFIELLKEKKQALVSYIV 207 L + L ++ LV V Sbjct: 558 LKSRITDASRLQQKLADVLVEQAV 581 >gi|251811428|ref|ZP_04825901.1| type I restriction-modification system specificity determinant protein [Staphylococcus epidermidis BCM-HMP0060] gi|251805057|gb|EES57714.1| type I restriction-modification system specificity determinant protein [Staphylococcus epidermidis BCM-HMP0060] Length = 400 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 62/413 (15%), Positives = 134/413 (32%), Gaps = 39/413 (9%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +K + G+ + +V++ GKY P G D + ++ K +L Sbjct: 4 KLKDLVNIKYGKNQK-----------NVKNPRGKY-PILGTGGIMDYADDFLYDKPSVLI 51 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G+ G + I + T F + ++++ + LS EG T+ Sbjct: 52 GRKGSIGKVKYIEEPFWTIDTLFYTIVNENLVIPKYLYYKLS---QIDFNYYNEGTTIPS 108 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + I + +P Q + + ID I + I L+E Q L Sbjct: 109 LRTETLYKIDIDLPKKNIQKKVVNLLN----TIDEKIENNQKIIANLEELSQTLFKRWFV 164 Query: 209 KGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTE--------LNRKNTKLIE 253 PD K SG E +G +P W + + + E Sbjct: 165 DFEFPDENGNPYKSSGGEMIDSELGEIPKKWNILTINDFADDLIITGKTPSTKNKDNYSE 224 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I L+ ++ + + N +K S + V I + + + Sbjct: 225 KGIPFLTIPDMHTDVFSLN-TIKYISEVGIEKVKNKIIPENSLCVSCIATPGLVSITSSE 283 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + P D YL + ++S G +L ++ ++ P Sbjct: 284 TLTNQQINSFTPKKNDLYYLYFYIKSMKKYIEDLGSGGSATLNLNKTQFSKIKIIRPIND 343 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 ++ ++ ++ + L+E R++ + ++G+I++ + + Sbjct: 344 LLKKFHKCVDSNF----KIILTKQKENLKLQELRNTLLPKLMSGEIEIPDDIE 392 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 69/209 (33%), Gaps = 16/209 (7%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTK--LNTGRTSE-------SGKDIIYIGLEDV 56 YK SG + +G IPK W ++ I F + TG+T S K I ++ + D+ Sbjct: 176 YKSSGGEMIDSELGEIPKKWNILTINDFADDLIITGKTPSTKNKDNYSEKGIPFLTIPDM 235 Query: 57 ESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 + K + + I + + + I + + + Q Sbjct: 236 HTDVFSLNTIKYISEVGIEKVKNKIIPENSLCVSCI-ATPGLVSITSSETLTNQQINSFT 294 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 PK L ++ S+ + G+ + + I + P + + Sbjct: 295 PKKNDLYYLYFYIKSMK-KYIEDLGSGGSATLNLNKTQFSKIKIIRPINDLLKKFHKCVD 353 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS 204 + I T E ++ EL L+S Sbjct: 354 SNFKIILTKQKENLKLQELRNTLLPKLMS 382 >gi|95928602|ref|ZP_01311349.1| restriction modification system DNA specificity domain [Desulfuromonas acetoxidans DSM 684] gi|95135392|gb|EAT17044.1| restriction modification system DNA specificity domain [Desulfuromonas acetoxidans DSM 684] Length = 417 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 60/415 (14%), Positives = 123/415 (29%), Gaps = 25/415 (6%) Query: 11 KDS---GVQWIGAIPKHWKVVPIKRFTKLNT----GRTSE--SGKDIIYIGLEDVESGTG 61 KDS ++++G + W P+ G + + Y+ +V++G Sbjct: 5 KDSNVPEIRFLGYV-NGWTENPLGEIYTKIRNAFVGTATPYYTKNGYFYLQSNNVKNGKI 63 Query: 62 KYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPK 117 + + + I+ + G A+I + + + K Sbjct: 64 NRKTEIFIDEEFYFKQEKNWLRTNDIVMVQSGHVGHTAVIPNELNNSAAHALIIISKPLK 123 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 P L + + Q I I G T+ H I + PP EQ I Sbjct: 124 KSCPYYLNFYFQTYRAKQDIGNITTGNTIKHILATDIKRFNVFFPPYEEQTKIGTY---- 179 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 ++D +I R + L KQA++ + + +++ +G E +V Sbjct: 180 FKKLDRIIELHQRKHDKLVTLKQAMLQKMFPQDGASTPEIRFNGFEGDWEKKKLRDVCNS 239 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRF 295 F K I + ++ ++ G+I+F Sbjct: 240 FDYGLNAAAKKYDGRNKYIRITDIDEFSRCFSQTDLTSPEADLPSSQNYLLCEGDILFAR 299 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLR 354 K L A + + ++ + S + + Sbjct: 300 TGASVGKTYLYREIDGRVFFAGFLIRARVSNTESTDFIFYTTLSSNYENFVTITSQRSGQ 359 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + ++ LVP + EQ I + D L+ + + LK+ +S+ Sbjct: 360 PGINAKEYSEYTFLVPSVTEQKKIGTY----FRKFDALISQHATQLKKLKQIKSA 410 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 53/173 (30%), Gaps = 9/173 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK--- 302 T N N ++ + E Y + + I + Sbjct: 39 GTATPYYTKNGYFYLQSNNVKNGKINRKTEIFIDEEFYFKQEKNWLRTNDIVMVQSGHVG 98 Query: 303 -RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFE 360 ++ ++ ++ YL + ++Y + + +G + + Sbjct: 99 HTAVIPNELNNSAAHALIIISKPLKKSCPYYLNFYFQTYRAKQDIGNITTGNTIKHILAT 158 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 D+KR V PP +EQ I ++D ++E ++ L + + + Sbjct: 159 DIKRFNVFFPPYEEQTKIGTY----FKKLDRIIELHQRKHDKLVTLKQAMLQK 207 >gi|29349931|ref|NP_813434.1| putative type I restriction enzyme EcoR124II protein [Bacteroides thetaiotaomicron VPI-5482] gi|253569700|ref|ZP_04847109.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 1_1_6] gi|29341842|gb|AAO79628.1| putative type I restriction enzyme EcoR124II protein [Bacteroides thetaiotaomicron VPI-5482] gi|251840081|gb|EES68163.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 1_1_6] Length = 394 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 54/392 (13%), Positives = 118/392 (30%), Gaps = 22/392 (5%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSESGK----DIIYIGLEDVESGTGKYLP-KDGNSRQS 73 IP W I+ K+ +G T I + ++V + Y K + Sbjct: 10 EIPNSWVWTTIEEICSKIGSGSTPRGSNYSANGIPFFRSQNVYNDRLVYDDIKYISEEVH 69 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLS 130 + +L G L + + G S +++ V PE +LS Sbjct: 70 QKMKGTEVLANDLLLNITGGSLGRCAVVPADFNCGNVSQHVCIMRSVLVEPEYFHALVLS 129 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + ++ G+ + + P+PPL+EQ I +I ID + ++ Sbjct: 130 SYFAKSMK--ITGSGREGLPKYSLEQMAFPLPPLSEQQRIVMEIEKLFALIDQIEHSKVN 187 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++K+ K ++ + L P + IE + + + + Sbjct: 188 LQTIIKQTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTQLPDGWTFC 247 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 ++ I G +SY T + + + + S + Sbjct: 248 RLDQII-----GYEQSTAYIVESTAYDDSYSTPVLTAGKSFIIGYTNEATGIYSNLPCII 302 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV------FYAMGSGLRQSLKFEDVKR 364 + S + S + + + + + Sbjct: 303 FDDFTTDSKLVDFPFKVKSSAMKILKVHKDIEVDYVAMFMSITKLVGDTHKRYWISEYSK 362 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 L + +P EQ I + I+ ++D+++E + Sbjct: 363 LEIPIPSKAEQKRIIHAIHGIFTQLDLIMESL 394 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 71/200 (35%), Gaps = 10/200 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKN----TKLIESNILSLSYGNIIQK-LETRNMGLKPESYE 281 +P+ W + +++ + + + I N+ L ++ E Sbjct: 10 EIPNSWVWTTIEEICSKIGSGSTPRGSNYSANGIPFFRSQNVYNDRLVYDDIKYISEEVH 69 Query: 282 TYQI---VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 V +++ + ++ A G ++ ++ ++ Y L+ Sbjct: 70 QKMKGTEVLANDLLLNITGGSLGRCAVVPAD-FNCGNVSQHVCIMRSVLVEPEYFHALVL 128 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 S K GSG R+ L ++++ +PP+ EQ I I A ID + Sbjct: 129 SSYFAKSMKITGSG-REGLPKYSLEQMAFPLPPLSEQQRIVMEIEKLFALIDQIEHSKVN 187 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 ++K+ +S + A+ G+ Sbjct: 188 LQTIIKQTKSKILDLAIHGK 207 >gi|219870942|ref|YP_002475317.1| type I restriction enzyme specificity protein HsdS [Haemophilus parasuis SH0165] gi|219691146|gb|ACL32369.1| type I restriction enzyme specificity protein HsdS [Haemophilus parasuis SH0165] Length = 408 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 65/405 (16%), Positives = 130/405 (32%), Gaps = 34/405 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + T++ G+T I +D G + G + + Sbjct: 17 EFKSLGDVTEMKRGKT---------ITAKDASGGDIPVIS--GGQKPAYYHNEYNRNGKT 65 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G Y + + S F + + +L L + + Q+I + +G+ Sbjct: 66 ITVAGSGAYAGFIMYWEEPIFVSDAFSIKSDETLLD-LKYVYHFLLQHQQKIYGMKKGSG 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + H K + + +PIPPL Q I + A T L E + +++ Q Sbjct: 125 VPHVYPKDLSTLVIPIPPLDVQQEIVRILDAFTSLTAELTAELTAELTSRQKQYQYFRDK 184 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF-------ALVTELNRKNTKLIESNILS 258 + LN D G E + W K ++ +E I Sbjct: 185 L----LNFDDISDRGGYETNPITKALWHNKKVVFKTLGEVTTISIGLTYTPAYVEKGIKF 240 Query: 259 LSYGNI-IQKLETRNMGLKPESYETYQ----IVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +S N L+ N+ E +I+F + + Sbjct: 241 ISAQNTSKDYLDLSNVKYISEEEFENSTDNAKPQRDDILFTRVGSNIGHPVIVETDEKLC 300 Query: 314 GIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPP 371 ++ ++ VK + + YL M + K G + +L +K + +PP Sbjct: 301 IFVSLGFLRVKDNNFLFNRYLKHWMSTDLFWKQVEKNVHGSAKINLNTGWLKDFKIPIPP 360 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +EQ I +++ + + E + + I L ++ R ++ Sbjct: 361 FEEQQRIVAILDKFETLTNSIAEGLPKEIELRRKQYEYYREKLLS 405 >gi|77166476|ref|YP_345001.1| restriction modification system DNA specificity subunit [Nitrosococcus oceani ATCC 19707] gi|254436234|ref|ZP_05049741.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] gi|76884790|gb|ABA59471.1| Restriction modification system DNA specificity domain [Nitrosococcus oceani ATCC 19707] gi|207089345|gb|EDZ66617.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] Length = 564 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 59/481 (12%), Positives = 128/481 (26%), Gaps = 90/481 (18%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W+ V + +N +E ++ + + G + + + + Sbjct: 86 ELPEGWEWVRLGEIGVINPRNNAEDSIKAGFVPMPMIPEGYSEEHQFEERTWSDVKKGYT 145 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICST--------QFLVLQPKDVLPELLQGWLLSI 131 A + K+ P A F G+ + VLP L +L + Sbjct: 146 HLADSDVGMAKITPCFENAKSCVFSGLPNGLGAGTTELHIFRNTFNAVLPRFLLYYLKNP 205 Query: 132 DVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---------- 180 + G+ P+P L+EQ I +I R Sbjct: 206 HYISKTVPYMTGSAGQKRVPTPYFTEQLFPLPSLSEQQRIVARIDQLMARCDELEKLRKE 265 Query: 181 --------------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 I +E E + E ++A++ V L P Sbjct: 266 REEVRLKVHAAAIKQLLDAPDAGWPFIQQHFSELYTVKENVAELRKAILQLAVIGRLVPQ 325 Query: 215 VKMKDSGIEWVGLV-------------------------------PDHWEVKPFFALVTE 243 E + + P WE ++ Sbjct: 326 DSNDPPACELLKEIEAEKQRLVDEKKIKKLKPLPPIKPEEVPYQLPRGWEWVRLQDVLDV 385 Query: 244 LNRKNTKLIE----SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG-----EIVFR 294 + + + ++ N + S + ++I +I+F Sbjct: 386 RDGTHDSPKDAVGSDTYPLITSKNFSNGRIDFSEARMISSEDHFEITKRSKVDRLDILFS 445 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + + + + + ++ M + G + Sbjct: 446 MIGGNIGNQVIVQEDREFSIKNVALFKYYDRNLTYPYFIKRFMEHIAA-DLQQKAVGGAQ 504 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ++ + +PPI EQ+ I I+ A D L +Q I ++S+ + + Sbjct: 505 PFVSLGFLRNIVFGLPPINEQYHIVARIDELMALCDKL----DQQIEAASCKQSALLNSV 560 Query: 415 V 415 + Sbjct: 561 M 561 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 55/190 (28%), Gaps = 9/190 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIES--NILSLSYGNIIQKLETRNMGLKPESY 280 E +P+ WE + R N + + + + Sbjct: 82 EVPYELPEGWEWVRLGEIGVINPRNNAEDSIKAGFVPMPMIPEGYSEEHQFEERTWSDVK 141 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTYLAWL 336 + Y + ++ I + + G+ + + +L + Sbjct: 142 KGYTHLADSDVGMAKITPCFENAKSCVFSGLPNGLGAGTTELHIFRNTFNAVLPRFLLYY 201 Query: 337 MRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +++ Y GS ++ + +P + EQ I I+ AR D L E Sbjct: 202 LKNPHYISKTVPYMTGSAGQKRVPTPYFTEQLFPLPSLSEQQRIVARIDQLMARCDEL-E 260 Query: 395 KIEQSIVLLK 404 K+ + ++ Sbjct: 261 KLRKEREEVR 270 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 55/195 (28%), Gaps = 9/195 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKD--GNSRQ 72 +P+ W+ V ++ + G I ++ +G + + Sbjct: 369 QLPRGWEWVRLQDVLDVRDGTHDSPKDAVGSDTYPLITSKNFSNGRIDFSEARMISSEDH 428 Query: 73 SDTSTVSIFAKGQILYGKLGPYLR--KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + S + IL+ +G + + D + L L Sbjct: 429 FEITKRSKVDRLDILFSMIGGNIGNQVIVQEDREFSIKNVALFKYYDRNLTYPYFIKRFM 488 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + ++ G + NI +PP+ EQ I +I D L + Sbjct: 489 EHIAADLQQKAVGGAQPFVSLGFLRNIVFGLPPINEQYHIVARIDELMALCDKLDQQIEA 548 Query: 191 FIELLKEKKQALVSY 205 ++++ Sbjct: 549 ASCKQSALLNSVMAQ 563 >gi|297545263|ref|YP_003677565.1| restriction modification system DNA specificity domain-containing protein [Thermoanaerobacter mathranii subsp. mathranii str. A3] gi|296843038|gb|ADH61554.1| restriction modification system DNA specificity domain protein [Thermoanaerobacter mathranii subsp. mathranii str. A3] Length = 426 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 69/435 (15%), Positives = 136/435 (31%), Gaps = 48/435 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 W+ V + K+ TG+T + G +I D+ +Y + + + Sbjct: 3 SEWRKVKLSEIGKIVTGKTPSTKNKENFGDKYPFITPRDMRGQKYIRYTERYLSDIGFNL 62 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I +G + K ++ I + Q + P D + Sbjct: 63 LKSIAIPPNSICVTCIGS-MGKIAMSSKQSITNQQINSIIPNDEYD-PSFIYYCLKPKED 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++I G TM + NI + +PPL EQ I + A + I + L Sbjct: 121 YFKSISSGTTMPILNKTDFSNIEIEVPPLPEQQKIASILSAFDDK----IELNNEMNKTL 176 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKN 248 +E Q + + P+ K SG E +GL+P W+VK LV + Sbjct: 177 EEIAQVIFKHWFIDFEFPNENGEPYKSSGGEFVDSELGLIPKGWKVKSIGELVDFTISGD 236 Query: 249 TKLIESNILSLSYGNIIQKLETRNMG----------LKPESYETYQIVDPGEIVFRFIDL 298 E + I+ + + S + + G+I+ Sbjct: 237 WGNDERSQDYDKKCFCIRGADFPPIVRGDKTNIPVRFLKRSSFEKRRLKHGDILIEVSGG 296 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS---YDLCKVFYAMGSGLRQ 355 + + R+ V I V + + ++ S + + Y G + Sbjct: 297 TKGRPTGRTVFVHRNLIKQFDESLVFSNFCRLIRVNDILNSIILFLYLQFIYNKGKMTQY 356 Query: 356 SLKFEDVKRL---------PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++ + + V PI+ Q N++ + K L + Sbjct: 357 EIQSTGISNFQLKYFFENEKLAVAPIEIQEKFINLVEPIFDK------KYTFENYYLSQL 410 Query: 407 RSSFIAAAVTGQIDL 421 R + + ++G+I + Sbjct: 411 RDTLLPKLISGEIRV 425 Score = 40.5 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 59/192 (30%), Gaps = 28/192 (14%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLED--- 55 YK SG ++ +G IPK WKV I N R+ + K I D Sbjct: 201 YKSSGGEFVDSELGLIPKGWKVKSIGELVDFTISGDWGNDERSQDYDKKCFCIRGADFPP 260 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY-----GKLGPYLRKAIIA-------DF 103 + G +P R S G IL K P R + D Sbjct: 261 IVRGDKTNIPVRFLKRSSFE--KRRLKHGDILIEVSGGTKGRPTGRTVFVHRNLIKQFDE 318 Query: 104 DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + S +++ D+L ++ L + E + ++++ Sbjct: 319 SLVFSNFCRLIRVNDILNSIILFLYLQFIYNKGKMTQYEIQSTGISNFQLKYFFENEKLA 378 Query: 164 LAEQVLIREKII 175 +A + + I Sbjct: 379 VAPIEIQEKFIN 390 >gi|16799600|ref|NP_469868.1| hypothetical protein lin0525 [Listeria innocua Clip11262] gi|16412965|emb|CAC95757.1| lin0525 [Listeria innocua Clip11262] Length = 401 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 124/400 (31%), Gaps = 32/400 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ ++ G+ E +D K++ +G ++ V G Sbjct: 18 WEQRKLRDIANYRNGKAHEQVEDED----GKYTIINSKFISTNGKVQRYTNEQVEPIFDG 73 Query: 85 QILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +I KA + D + + + P + + + + ++ + Sbjct: 74 EIAMVLSDLPNGKALAKLFLVKEDGKYTLNQRIAGITPNENIDPIFLNFRMNRN--NYFL 131 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G T ++ + N P EQ I I + Sbjct: 132 KFDSGVTQTNLSKSQVENFIALYPTFDEQYKIGLFFTQLDDTIALHQRKLDALK------ 185 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI----ES 254 L+ ++ + P+ K I + + WE + K+ ++ Sbjct: 186 ---LMKKAFSQQIFPENNRKKPKIRFTSFY-EEWEQRKIGEYGYFYYGKSAPKWSVAQDA 241 Query: 255 NILSLSYGNIIQKL--ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + YG + K E + ++ G V +N + + Sbjct: 242 TTPCVRYGELYTKFGPEIDIVHSYTNIDKSNLKFSSGNEVLVPRVGENPLDFANCSWLSI 301 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + ++V ++A+ RS + + G +L + ++ + + VP I Sbjct: 302 SNVAIGEMISVYNTEQYPLFIAYYFRSKMKYEFAKRVEGGNVSNLYYSYLEDILISVPSI 361 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ I +N +ID+ + ++ + +KE + +++ Sbjct: 362 EEQKKIAEFLN----KIDITINLLQNKLGRIKELKKAYLQ 397 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 64/197 (32%), Gaps = 9/197 (4%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + + + L WE + + N K + +E + N K + N Sbjct: 1 MPLFYVFYHYFNLPFRAWEQRKLRDIANYRNGKAHEQVEDEDGKYTIIN--SKFISTNGK 58 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--IITSAYMAVKPHGIDSTY 332 ++ + E + + GEI DL N K + V E G + + P+ + Sbjct: 59 VQRYTNEQVEPIFDGEIAMVLSDLPNGKALAKLFLVKEDGKYTLNQRIAGITPNE-NIDP 117 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + R + +L V+ L P EQ+ I ++D Sbjct: 118 IFLNFRMNRNNYFLKFDSGVTQTNLSKSQVENFIALYPTFDEQYKIGLF----FTQLDDT 173 Query: 393 VEKIEQSIVLLKERRSS 409 + ++ + LK + + Sbjct: 174 IALHQRKLDALKLMKKA 190 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 65/192 (33%), Gaps = 11/192 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + W+ I + G+++ + ++ + G + + D S Sbjct: 213 EEWEQRKIGEYGYFYYGKSAPKWSVAQDATTPCVRYGELYTKFGPEIDIVHSYTNIDKSN 272 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSIDVTQ 135 + + ++L ++G + I + ++ L + + Sbjct: 273 LKFSSGNEVLVPRVGENPLDFANCSWLSISNVAIGEMISVYNTEQYPLFIAYYFRSKMKY 332 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 EG +S+ + + +I + +P + EQ I E + +ID I + + Sbjct: 333 EFAKRVEGGNVSNLYYSYLEDILISVPSIEEQKKIAEFLN----KIDITINLLQNKLGRI 388 Query: 196 KEKKQALVSYIV 207 KE K+A + + Sbjct: 389 KELKKAYLQNMF 400 >gi|114563125|ref|YP_750638.1| restriction modification system DNA specificity subunit [Shewanella frigidimarina NCIMB 400] gi|114334418|gb|ABI71800.1| restriction modification system DNA specificity domain [Shewanella frigidimarina NCIMB 400] Length = 406 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 50/407 (12%), Positives = 128/407 (31%), Gaps = 54/407 (13%) Query: 26 KVVPIKRF-TKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + P+ K+++G T ++ DI ++ ++V D S+ Sbjct: 17 EWKPLDDISVKISSGGTPKTGVAEFYDGDIPWLRTQEVNFDEIWDTGVKITEAGVDNSSA 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ G + K I + +Q + + + + I+ Sbjct: 77 KWIPANCVIVAMYGATVGKIGINKIPMTTNQACANIQLDGNIANYRYVFHFLLSQYEYIK 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRF 191 ++ G + ++ + + + +PIP LA Q I + A T L E I Sbjct: 137 SLGSG-SQTNINAGIVKKLVVPIPCPNNPEKSLAIQAEIVRILDAFTAMTAELTAELIMR 195 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 + + L+S ++ +EW +G + ++ + + Sbjct: 196 KKQYNYYRDQLLS------------FEEGEVEWKTLGDLAEN-----LDSKRKPITSGLR 238 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + E S K + S + ++ + + Sbjct: 239 EAGEIPYYGASGIVDYVKDYIFDGDYLLVSEDGANLLARN-------------TPIAFSI 285 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + A++ + Y+ + + S DL + L ++++ + + Sbjct: 286 SGKTWVNNHAHVLKFETYAERKYVEYYLNSIDLTPYI---SGAAQPKLNKKNLESINIPN 342 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 P KE+ I +++ + + E + + I L ++ R ++ Sbjct: 343 PAPKEKERIVAILDKFDSLTCSIKEGLPREIELRQKQYEYYRDLLLS 389 >gi|262375871|ref|ZP_06069102.1| predicted protein [Acinetobacter lwoffii SH145] gi|262308965|gb|EEY90097.1| predicted protein [Acinetobacter lwoffii SH145] Length = 391 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 60/397 (15%), Positives = 123/397 (30%), Gaps = 24/397 (6%) Query: 30 IKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKG 84 + F + +G +S + I + D++ G+ + ++ Q S Sbjct: 8 LGDFASVISGYAFKSEWFGSGNDKVIRIGDLQDGSVQIESALTVDANQYKISNNFKIQNK 67 Query: 85 QILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL G + K I D + + +++ KD + + S + Sbjct: 68 DILMALSGATVGKIAIASETDIGAYINQRVAIIRAKDEITADYLKFFFSGVFLDDLLKNA 127 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + K + + +P PPL+EQ I + R + +++ Q Sbjct: 128 GGAAQPNLSPKQLLFMEIPFPPLSEQRRIASILDQADEL-------RQKRQHAIEKLDQL 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L + + +P K ++ VG + + ++ + + + + + + Sbjct: 181 LQTTFIDMFGDPVSNPKGWDLKTVGEISES----KLGKMLDKKKQSSENDQYKYLRNANV 236 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 L E + G+I+ ++ + Sbjct: 237 QWFRFDLSDVFEMEFNEKDRKNCELKFGDILVCEGGEPGRAAIWKNDLENCFFQKALHRV 296 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + I Y WL Y F + L +K + V +PP+ Q + Sbjct: 297 RLDTTQILPEYFVWLFWFYSKNGGFDDHITVATIAHLTGVKMKAMQVPIPPLSMQEEF-- 354 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + I+VL +E S L + SS A G Sbjct: 355 --QKKVNEIEVLKTTLENSSKLFESLFSSLQNQAFNG 389 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 63/200 (31%), Gaps = 14/200 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK W + + ++ G+ + K Y+ +V+ Sbjct: 196 PKGWDLKTVGEISESKLGKMLDKKKQSSENDQYKYLRNANVQWFRFDLSDVFEMEFNEKD 255 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWLLSI 131 G IL + G R AI + C + L +LPE Sbjct: 256 RKNCELKFGDILVCEGGEPGRAAIWKNDLENCFFQKALHRVRLDTTQILPEYFVWLFWFY 315 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + AT++H + + +PIPPL+ Q ++K+ I+ L T Sbjct: 316 SKNGGFDDHITVATIAHLTGVKMKAMQVPIPPLSMQEEFQKKVNE----IEVLKTTLENS 371 Query: 192 IELLKEKKQALVSYIVTKGL 211 +L + +L + L Sbjct: 372 SKLFESLFSSLQNQAFNGTL 391 >gi|18765826|gb|AAL78776.1|AF326625_1 HP790-like protein [Helicobacter pylori] Length = 435 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 61/415 (14%), Positives = 123/415 (29%), Gaps = 29/415 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEIFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKNNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELKAR 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL--------VPDHWEVKPFFALVTE 243 + + + L+ + K + D K K + + P E K L Sbjct: 192 KKQYQYYQNMLLDFKDIKQSHKDAKEKLARKTYPKRLKALLQTLAPKGVEFKKIGELFKR 251 Query: 244 LNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 N + L G I + + I++ ++ + + Sbjct: 252 NKGINITAAQMKELHSDIGKVRIFAGGATKADINYKDISKKDIINCESVIIKSRGNIGFE 311 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFED 361 + S+ K + + +L + + + A S ++ L D Sbjct: 312 YYNQPFSHKNEIWSYSS----KTNQMLVKFLYYYLSNNQYYFQKLAQSSSVKLPQLSVSD 367 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 V VPP++ Q +I +++ + L+ I I K+ R + Sbjct: 368 TDEYEVPVPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYREKLLT 422 >gi|308064186|gb|ADO06073.1| type I R-M system specificity subunit [Helicobacter pylori Sat464] Length = 377 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 59/408 (14%), Positives = 117/408 (28%), Gaps = 42/408 (10%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 2 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 59 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 60 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 116 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + + L ++ + Sbjct: 117 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQNAIANILSGLDRYLYALDALILKKEGVK 176 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K L+S ++K W + +++ + NTK + N Sbjct: 177 KALSFELLSQ--------RKRLKGFNQAWQRVRLGDIFFITAGGDLSKPHYSNTKQSDFN 228 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 S + L Y ++ I+ I + + Sbjct: 229 YPIYSNAIEKKGLC---------GYSSFFIIKNKSITITARGTIG-----VAFFRDYPYV 274 Query: 316 ITSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + ++P + + + S KV + L V + +PP+ Sbjct: 275 PIGRLLVLQPKISNIDCRFYAEYINS----KVKFNTEQTTIPQLTIPKVALCEIPLPPLN 330 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I N+++ I L K Q + + + ++ +I + Sbjct: 331 EQIAIANILSALDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 374 >gi|289423479|ref|ZP_06425281.1| type I restriction-modification system specificity subunit [Peptostreptococcus anaerobius 653-L] gi|289156113|gb|EFD04776.1| type I restriction-modification system specificity subunit [Peptostreptococcus anaerobius 653-L] Length = 401 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 64/401 (15%), Positives = 128/401 (31%), Gaps = 33/401 (8%) Query: 23 KHWKVVPIKRFTK------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + W+ + + +T + G D+ + + ++ ++ + Sbjct: 21 EDWEQRKLGELGNVGMCKRIFKEQTFDEG-DVPFFKIGTFGGEADAFISRELF--EEYKK 77 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 KG IL G R D +V D + + Sbjct: 78 KYPYPEKGAILISASGTIGRTVEFTGRDEYFQDSNIVWLKHDSRLLDSFLKYVYECIKW- 136 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 EG+T+ I + +P + EQ I ID LIT R +E LK Sbjct: 137 --NGIEGSTIKRLYNNNILKTEIRLPEINEQKQISTF----FKFIDNLITLHQRKLEDLK 190 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E K+ L+ + K +++ G + WE + L +N LI + Sbjct: 191 EMKKGLLQKMFPKNNEKVPELRFPG------FTEDWEQRKLGKLYQRNTERNENLIGYDK 244 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + G S TY+++ G+I F + + GI+ Sbjct: 245 TISVATMSYKDDGN---GASESSLSTYKVLRVGDIAFEGHTNKQFHFGRFVVNDIGTGIM 301 Query: 317 TSAYMAVKP-HGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQS-LKFEDVKRLPVLVPPI 372 + + ++P + + + + S + + + +G + L + ++VP Sbjct: 302 SPRFSTLRPLNEMPVNFWKQYIHSESVMRRILVNSTKAGTMMNELVIPEFLNQTIMVPSE 361 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I +D L+ ++ + LKE + + Sbjct: 362 NEQAVIGQY----FTNLDHLITLHQRKLNHLKELKKGLLQQ 398 >gi|313618465|gb|EFR90470.1| type I site-specific restriction-modification system, S [Listeria innocua FSL S4-378] Length = 422 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 57/423 (13%), Positives = 128/423 (30%), Gaps = 27/423 (6%) Query: 23 KHWKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK V ++ + + + +I +++ S + Sbjct: 4 SEWKEVALEEIVDVLGDGLHGTPKYDENGEYYFINGNNLDGNIIIDEKTKKVSYEEFLKY 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IL G A + + ++ E ++ +LS Sbjct: 64 KKDLNERTILISINGTLGNVAFYNGEKVVLGKSACYFNVKENCSKEFIKYIMLSHAFKHY 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I G T+ + K + + +P + EQ I + +D I + + L+ Sbjct: 124 INTYSTGTTIKNMGLKQMRAFRLNLPEINEQKAIAHVL----STLDEKIEVNNQINKTLE 179 Query: 197 EKKQALVSYIVTKGLNPDV---KMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKNT 249 QA+ P+ K SG E +G++P WEV K Sbjct: 180 NMAQAIFKQWFVDFEFPNEDGEPYKSSGGEMIASELGMIPKGWEVGNLAESKLTNLVKTG 239 Query: 250 KLIESNILSLSYGNIIQKLE----TRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKR 303 S+ + K T + + F + + + Sbjct: 240 IAEFSSEKIYLATADVDKSNILSNTTKVTYNERPSRANMQPKENTVWFAKMKDSRKLIRV 299 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 S S ++E I ++ + + + +++ + + Q++ ++ Sbjct: 300 SRGSKDLIENYIFSTGFAGINVKEGLNYIWSFICSNDFDIRKNNLCHGTTMQAINNSNIS 359 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +P+L+P +E I T + ++ L E R S + ++G+I + Sbjct: 360 NIPLLLP-KEEMIQI---FEGVTNYLYESEYLRKKENEKLAEIRXSLLPKLMSGEIRVPL 415 Query: 424 ESQ 426 + + Sbjct: 416 DEE 418 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 75/215 (34%), Gaps = 13/215 (6%) Query: 10 YKDSGVQW----IGAIPKHWKVVPI--KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + +G IPK W+V + + T L +E + IY+ DV+ Sbjct: 203 YKSSGGEMIASELGMIPKGWEVGNLAESKLTNLVKTGIAEFSSEKIYLATADVDKSNILS 262 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQPK 117 + + + + + K+ + ++ + I ST F + K Sbjct: 263 NTTKVTYNERPSRANMQPKENTVWFAKMKDSRKLIRVSRGSKDLIENYIFSTGFAGINVK 322 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + L + ++ S D R +C G TM + I NIP+ +P + Sbjct: 323 EGLNYIW-SFICSNDFDIRKNNLCHGTTMQAINNSNISNIPLLLPKEEMIQIFEGVTNYL 381 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 E + E+ L+S + L+ Sbjct: 382 YESEYLRKKENEKLAEIRXSLLPKLMSGEIRVPLD 416 >gi|3057068|gb|AAC38351.1| HsdS subunit [Lactococcus lactis] Length = 425 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 65/411 (15%), Positives = 147/411 (35%), Gaps = 30/411 (7%) Query: 23 KHWKVVPIKR-FTKLN--TGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 W+ GRT + + + +V++G L + Sbjct: 22 NDWEERKFFESIASTIDFRGRTPKKLGMDWSDSGYLALSALNVKNGYIDPLADAHYGDEK 81 Query: 74 DTST---VSIFAKGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPK--DVLPELLQGW 127 KGQ+L+ P A + D +G S + + + K + + L Sbjct: 82 LYRKWMSGRELKKGQVLFTTEAPMGNVAQVPDDNGYILSQRTVAFETKEDMMTNDFLAVL 141 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S V + A+ G T K + + + +P ++ +KI + +D I Sbjct: 142 LKSPLVFNNLSALSSGGTAKGVSQKSLKGLSITVPLDIDEQ---QKIGSFFKHLDDTIAL 198 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 R ++LLKE+K+ + + K +++ +G + ++ L K Sbjct: 199 HQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG---FADDWEERKLGDIAPLRGGYAFK 255 Query: 248 NTKLIESNILSLSYGNIIQKLET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 ++K ++ + + NI+ E + + I+ V K S+ Sbjct: 256 SSKFRKTGVPIVRISNILSSGEVGGDFAYYDEQDKDDKYILPDKSAVLAMSGATTGKVSI 315 Query: 306 RSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVK 363 S ++ + ID +++ ++RS + + SG + ++ +++ Sbjct: 316 LSQTDYDKVYQNQRVGYFQSVDYIDYGFISTIVRSELFMMQLESVLVSGAQPNVSSKEID 375 Query: 364 RLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++P + +EQ I + ++D + ++ + LLKE++ F+ Sbjct: 376 SFNFMIPILVQEQQKIGSF----FKQLDDTIALHQRKLDLLKEQKKGFLQK 422 >gi|282883047|ref|ZP_06291648.1| N-6 DNA methylase [Peptoniphilus lacrimalis 315-B] gi|281297104|gb|EFA89599.1| N-6 DNA methylase [Peptoniphilus lacrimalis 315-B] Length = 412 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 54/395 (13%), Positives = 125/395 (31%), Gaps = 24/395 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + G T KDII + + G ++R+ Sbjct: 14 EWKKLGEVCEFQRGNTITK-KDIIEGVIPVIAGGQKPAYYHGISNREGV----------T 62 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G Y + S F + K++ + + +I + +G+ Sbjct: 63 IAVAGSGAYAGFVSYWEEPIFLSDAFSIEPNKNLN--KRYLYHWLLSNQHKIFELKQGSG 120 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + H K +G +PIP L Q I E + T + L E + + + L+S Sbjct: 121 IPHVYGKDLGRFEIPIPSLETQEKIVETLDKFTNYVTELQAELQARNKQYEYYRDMLLSE 180 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN-- 263 ++ + + + + + +K I + G+ Sbjct: 181 EYLNKISMKMDALTNKDYELKMTTLGEIAQINRGASPRPIKKYITEDIKGIPWIKIGDVG 240 Query: 264 -IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + + E + +I+ G+ + L + G + ++ Sbjct: 241 VNSKYVTKTAQKITLEGAKKSRILKKGDFIMSNSMSYGRPYILGIDGAIHDGWAS---IS 297 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + +DS +L + + S + + + S +L E + LP+ V + Q + V Sbjct: 298 GFYNTLDSDFLYYYLTSSKVQNYWKGKINSSSVDNLNSEIICSLPIPVIDKELQQVVAKV 357 Query: 382 INVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + +D + + I ++ R + Sbjct: 358 LDKFQSLLDDTEGLLPEEIEKRQKQYEYYREKLLT 392 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 44/131 (33%), Gaps = 2/131 (1%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 Y + E V + S E ++ A+ ++ YL + S Sbjct: 52 YHGISNREGVTIAVAGSGAYAGFVSYWE-EPIFLSDAFSIEPNKNLNKRYLYHWLLSNQ- 109 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K+F + +D+ R + +P ++ Q I ++ T + L +++ Sbjct: 110 HKIFELKQGSGIPHVYGKDLGRFEIPIPSLETQEKIVETLDKFTNYVTELQAELQARNKQ 169 Query: 403 LKERRSSFIAA 413 + R ++ Sbjct: 170 YEYYRDMLLSE 180 >gi|218703039|ref|YP_002410668.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Escherichia coli IAI39] gi|218373025|emb|CAR20914.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Escherichia coli IAI39] Length = 586 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 65/489 (13%), Positives = 133/489 (27%), Gaps = 101/489 (20%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W++ + G T G DI + ++ + + Sbjct: 101 VPQGWELCYLNDIGDWGAGATPNRTNSGYYGGDIPWFKSGELSEDYITDSEEHITALALK 160 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G +L G + K I + + P D L + Sbjct: 161 ECSLRDNQPGDVLIAMYGATIGKTSILNSRSTTNQAVCACTPFDGLSNQ-YLLIFLKASK 219 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK--------------------- 173 + A+ G + + I +PPL EQ+ I +K Sbjct: 220 KVFTAMGAGGAQPNISKEKIVATLFALPPLNEQLRIVKKVEQLMSLCDQLEQQSLTSLDA 279 Query: 174 --------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + RI + KQ ++ V L P Sbjct: 280 HQQLVETLLGTLTDSQNTEELAENWARISEHFDTLFTTEASVDALKQTILQLAVMGKLVP 339 Query: 214 DVK------------------------------------MKDSGIEWVGLVPDHWEVKPF 237 K + +P++W Sbjct: 340 QDPNDEPAENLFNRLCITRNLSLQNQLKNKEADIMLRKIKKTKPVTPPFKLPENWICTNL 399 Query: 238 ---FALVTELNRKNTKLIESNILSLSYGNIIQKLE-----TRNMGLKPESYETYQIVDPG 289 + + + K +++ I + NI + E + PG Sbjct: 400 IEICEYLVDCHNKTAPYVDAGIPIIRTTNIRNRNFQEQDLKFVNKETYEFWSRRCTPQPG 459 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+F + + G + + V I + ++ + L + Sbjct: 460 DIIFTREAPMGEALIIPPNVQWCLG-QRTMLIRVMHEFISNEFILLALTEPLLLERASKH 518 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G + L+ DV+ L + +PP+ EQ+ I + + + D K ++ I K+ + Sbjct: 519 AVGLTVKHLRVGDVETLNIPLPPLNEQYRIVAKVKILLSLCD----KAQKKIKSAKQ--T 572 Query: 409 SF-IAAAVT 416 +A A+T Sbjct: 573 QLHLADALT 581 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 61/198 (30%), Gaps = 9/198 (4%) Query: 20 AIPKHWKVVPIKRFTKL----NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P++W + + + I I ++ + + ++++ Sbjct: 389 KLPENWICTNLIEICEYLVDCHNKTAPYVDAGIPIIRTTNIRNRNFQEQDLKFVNKETYE 448 Query: 76 STVSIF--AKGQILYGKLGPYLRKAII-ADFDGICST--QFLVLQPKDVLPELLQGWLLS 130 G I++ + P II + + + + + E + L Sbjct: 449 FWSRRCTPQPGDIIFTREAPMGEALIIPPNVQWCLGQRTMLIRVMHEFISNEFILLALTE 508 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +R G T+ H + + +P+PPL EQ I K+ D + Sbjct: 509 PLLLERASKHAVGLTVKHLRVGDVETLNIPLPPLNEQYRIVAKVKILLSLCDKAQKKIKS 568 Query: 191 FIELLKEKKQALVSYIVT 208 + AL + + Sbjct: 569 AKQTQLHLADALTNAAIN 586 >gi|158522936|ref|YP_001530806.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158511762|gb|ABW68729.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 434 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 74/422 (17%), Positives = 137/422 (32%), Gaps = 35/422 (8%) Query: 29 PIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 K + +G T ++ + + + +++ + Sbjct: 7 KFKNLIEYKSGYTWSKEQENSKFVDGSVRVLTVTNIQEKLDLGSELYLTQVTKNDRERKA 66 Query: 81 FAKG-QILYGKLGP---YLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGWLLSID 132 +KG I G I D T F+ P VLP+ WL S Sbjct: 67 ASKGWSIAVSSNGNRKRIGNAVFINDDTDYLFASFLTGFIPKDPDTVLPKYFFYWLSSHP 126 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +RI ++ EG T I + + ++ I ++D +I I Sbjct: 127 IQERITSVSEGTT--GLGNLDIRFLRNMDFEYPKNTSEQKAIAGILSKVDAVIEAVENSI 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGI----EWVGLVPDHWEVKPFFALVTELNRKN 248 + + K++L+ ++T L PD + E G VP WEVKP N Sbjct: 185 KAAERLKKSLMQNLLTGKLKPDGTWRSEDDFYMDEKFGKVPKGWEVKPVGGKSLCNINPN 244 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQ--NDKRS 304 + + I L + + Y G+I+F I N K + Sbjct: 245 YNFTKGEQYDFIPMDAINDDFRGLGYLVTKKVDGGGYTRFRIGDILFAKITPCTENGKVA 304 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFED 361 L G ++ ++ +P + + S D G+ RQ + ++ Sbjct: 305 LIEKMNTTVGFASTEFIIFQPKETIDNQFYFYLLSSDRVHNLSVSLMEGTTGRQRVPWKI 364 Query: 362 VKR-LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 K + +P + EQ +I + I+ L I LK + S + +TG++ Sbjct: 365 FKNRILAPIPIDLDEQRNIAKRL----KVIEKLNVCKYSKIQSLKNLKKSLMQNLLTGKV 420 Query: 420 DL 421 + Sbjct: 421 RV 422 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 72/205 (35%), Gaps = 14/205 (6%) Query: 16 QWIGAIPKHWKVVPI--KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + G +PK W+V P+ K +N G+ +I ++ + +++ Sbjct: 219 EKFGKVPKGWEVKPVGGKSLCNINPNYNFTKGEQYDFIPMDAINDDFRGLG--YLVTKKV 276 Query: 74 DTSTVSIFAKGQILYGKLGPY--LRKAIIADFD----GICSTQFLVLQPKDVLPELLQGW 127 D + F G IL+ K+ P K + + G ST+F++ QPK+ + + Sbjct: 277 DGGGYTRFRIGDILFAKITPCTENGKVALIEKMNTTVGFASTEFIIFQPKETIDNQFYFY 336 Query: 128 LLSIDVTQRIEAI----CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 LLS D + G + L EQ I +++ Sbjct: 337 LLSSDRVHNLSVSLMEGTTGRQRVPWKIFKNRILAPIPIDLDEQRNIAKRLKVIEKLNVC 396 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 ++ L K Q L++ V Sbjct: 397 KYSKIQSLKNLKKSLMQNLLTGKVR 421 >gi|255038983|ref|YP_003089604.1| restriction modification system DNA specificity domain [Dyadobacter fermentans DSM 18053] gi|254951739|gb|ACT96439.1| restriction modification system DNA specificity domain [Dyadobacter fermentans DSM 18053] Length = 422 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 53/410 (12%), Positives = 118/410 (28%), Gaps = 24/410 (5%) Query: 24 HWKVVPIKRFTKLNTG--RTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W + ++ G +T + + + ++ +ED++ K K + Sbjct: 18 DWNRFDLVDIFEIYDGTHQTPTYTSEGVNFVSVEDIK--DLKASRKYISEAAFRKDFKIK 75 Query: 81 FAKGQILYGKL--GPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 IL ++ G AI+ D + S L ++ + Q + Sbjct: 76 PKTNDILMTRITAGTIGDTAIVRDDEPLGIYVSLALLRIKIDGSVEFFNQNINSVYFRKE 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + I A + IG + I EQ I + A ++ L ++ + Sbjct: 136 LHKRIIHTAFPKKINLGDIGGCKISICSKKEQQKIASFLTAVDEKLQALKKKKSLLEQYK 195 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK---NTKLI 252 K Q + S + + D G V N Sbjct: 196 KGVMQKIFSQELRFKGDNGEAFPDWQKVKFGEVYTFKVTNSLSRDKLNYTEGEVRNIHYG 255 Query: 253 ESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + +I ++ + N + G++V + Sbjct: 256 DIHTKFNILFDIKKEPVPFVNDDVLLNRLSEDSYCKEGDLVIADASEDYNDIGKSIEIFN 315 Query: 312 ERG-----IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRL 365 G + + + + + +LMRS + + G + S+ + + Sbjct: 316 LDGEKVLAGLHTFLARPNRNTMSPGFGGYLMRSEKVKLQLMFIAQGTKVLSISTSRLSNI 375 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +P I EQ I + + + +D + I LL+ + + Sbjct: 376 EIDLPIIFEQKKIVDFL----SNLDSTIACCTNEIQLLEIWKKGLLQRLF 421 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 58/186 (31%), Gaps = 7/186 (3%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-GEIVFRFIDLQ 299 + + + + +S +I +R + + ++I +I+ I Sbjct: 30 IYDGTHQTPTYTSEGVNFVSVEDIKDLKASRKYISEAAFRKDFKIKPKTNDILMTRITAG 89 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSL 357 + GI S + + + S K + + + + Sbjct: 90 TIGDTAIVRDDEPLGIYVSLALLRIKIDGSVEFFNQNINSVYFRKELHKRIIHTAFPKKI 149 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 D+ + + KEQ I + + ++ L ++ LL++ + + + Sbjct: 150 NLGDIGGCKISICSKKEQQKIASFLTAVDEKLQAL----KKKKSLLEQYKKGVMQKIFSQ 205 Query: 418 QIDLRG 423 ++ +G Sbjct: 206 ELRFKG 211 >gi|159901786|ref|YP_001548031.1| restriction modification system DNA specificity subunit [Herpetosiphon aurantiacus ATCC 23779] gi|159894825|gb|ABX07903.1| restriction modification system DNA specificity domain [Herpetosiphon aurantiacus ATCC 23779] Length = 418 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 72/414 (17%), Positives = 145/414 (35%), Gaps = 20/414 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSD 74 I +P HW V +K K + + + Y GL+ + G + ++ + Sbjct: 11 IWDLPSHWGVKKLKLIAKEISQQIKPADNPSTVYNYWGLDAITKGQFQEPKQNLVKGSNI 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSID 132 ST F + QI+Y KL PYL K I+ GI +T+++V++P + + L S Sbjct: 71 ESTCVTFTENQIIYSKLRPYLNKVIVPSIPGIGTTEWIVVEPDANVVDRKYLAYVLRSPA 130 Query: 133 VTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + GA M N P+P+P L+ + + VRI++L++E Sbjct: 131 FLRYVSRGENINGARMPRLRKDSFWNFPIPLPSLSNPARSLQIQQSIVVRIESLLSELGE 190 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 EL + + V+ ++ + +E + +RK+++ Sbjct: 191 IRELHRR-----IDLDVSNVMDSIFRDVYIDLENKYPSRQRIDSFTQVKTGGTPSRKHSE 245 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +I + G + L + + + G ++ + Sbjct: 246 YYNGDIPWVKTGELKDGLIKKTEEYITLEAMQNSNAKKIPIGTLLVAMYGQGQTRGRTGL 305 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 + + P+ YL + + G + +L + +K L Sbjct: 306 LAIEATTNQACCAILPNPYIFIPRYLQFWFIFMYHDLRKKSDARGGNQANLNSQIIKELK 365 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL--KERRSSFIAAAVTGQ 418 +PPI Q + + ++ + + QSI L + S + A G+ Sbjct: 366 PPLPPIFVQQQVVSYLDAAYNELIDMQSI--QSINKLLFDQIEQSILEQAFRGE 417 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 61/193 (31%), Gaps = 10/193 (5%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 I FT++ TG T DI ++ +++ G K + S Sbjct: 226 RIDSFTQVKTGGTPSRKHSEYYNGDIPWVKTGELKDGLIKKTEEYITLEAMQNSNAKKIP 285 Query: 83 KGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G +L G + + + + + P + I + + Sbjct: 286 IGTLLVAMYGQGQTRGRTGLLAIEATTNQACCAILPNPYIFIPRYLQFWFIFMYHDLRKK 345 Query: 141 C--EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G ++ + + I + P+PP+ Q + + A + + + + L + Sbjct: 346 SDARGGNQANLNSQIIKELKPPLPPIFVQQQVVSYLDAAYNELIDMQSIQSINKLLFDQI 405 Query: 199 KQALVSYIVTKGL 211 +Q+++ L Sbjct: 406 EQSILEQAFRGEL 418 >gi|300313842|ref|YP_003777934.1| Type I site-specific deoxyribonuclease specificity subunit [Herbaspirillum seropedicae SmR1] gi|300076627|gb|ADJ66026.1| Type I site-specific deoxyribonuclease specificity subunit protein [Herbaspirillum seropedicae SmR1] Length = 421 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 58/401 (14%), Positives = 116/401 (28%), Gaps = 21/401 (5%) Query: 24 HWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVES---GTGKYLPKDGNSRQSDTS 76 W + + + + + + + DV S G + S Sbjct: 20 DWDERELGELFPITSAARVHKNEWTKSGVPFFRSSDVVSHFKGEANVKAFVSVELYEELS 79 Query: 77 TVS-IFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSID 132 KG IL G ++ + + + V L +L S Sbjct: 80 AKVGRIKKGDILITGGGSIGIPFLVKNDDPLYFKDADLLWFKIREAVDSHYLFTFLSSAP 139 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTLITERIRF 191 Q +++I T++H + P+ +P E Q I E I + + Sbjct: 140 FRQYLKSISHIGTIAHYTVEQAKGTPVMLPRYPEEQTKIGEYFRELDSLIGLHQRKHDKL 199 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L K Q + P+++ K+ +WV E ++ Sbjct: 200 AALKKAMLQKMFPQ--PGATTPEIRFKNFSGDWVEKTLAELCDLFTDGDWIESKDQSPSG 257 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I N + + +++E + V G+I+ + + + Sbjct: 258 IRLLQTGNVGINEFIDKADKARWISIDTFERLKCEEVFAGDILISRLPEPAGRACIVPKL 317 Query: 310 VMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + D +L V +G G RQ + + + V Sbjct: 318 LHRVITAVDCTIVRTAKNCDPAFLVQHCSLDSYFETVNDFLGGGTRQRISRSALGKFVVK 377 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 VP +EQ I +D L+ K + L++ +S+ Sbjct: 378 VPDFEEQKKIGTY----FRTLDELISKHASQLQKLQQIKSA 414 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 42/146 (28%), Gaps = 7/146 (4%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 ++ L E + G+I+ L +D Sbjct: 69 FVSVELYEELSAKVGRIKKGDILITGGGSIG-IPFLVKNDDPLYFKDADLLWFKIREAVD 127 Query: 330 STYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETA 387 S YL + S + ++ G E K PV++P +EQ I Sbjct: 128 SHYLFTFLSSAPFRQYLKSISHIGTIAHYTVEQAKGTPVMLPRYPEEQTKIGEY----FR 183 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ L + + + Sbjct: 184 ELDSLIGLHQRKHDKLAALKKAMLQK 209 >gi|258540279|ref|YP_003174778.1| type I restriction-modification system specificity subunit [Lactobacillus rhamnosus Lc 705] gi|257151955|emb|CAR90927.1| Type I restriction-modification system specificity subunit [Lactobacillus rhamnosus Lc 705] Length = 391 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 58/402 (14%), Positives = 129/402 (32%), Gaps = 41/402 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + S I + + T + + +T ++ KG Sbjct: 11 WEKRKFGDLYSKTSEKNDGSFGPDKIISVATMSWKTNVRISSE-----DYLATYNVLRKG 65 Query: 85 QILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI--- 137 I + K + R DGI S F+V +PK + + + R Sbjct: 66 DIAFEGNKSKKFSFGRFVENDIGDGIVSHVFVVFRPKVSPIISYWKYFIHNEFVMRNILR 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ + M++ + P EQ I + I + + L + Sbjct: 126 KSTIKATMMTNLSSHDFLRQTLCTPSFKEQENIGNFLERLDSLIAATQDKLEKLSILQRG 185 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q + W H V R N + IL Sbjct: 186 FLQHFFAQT-----------------WRFSGYSHVWENHRLGDVATRVRGNDGRMNLPIL 228 Query: 258 SLSYG-NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGI 315 ++S G + + + + + + Y ++ GE+ + + + + + + + + Sbjct: 229 TISAGKGWLTQEQRFSQNIAGNELKKYTLLSKGELSYNHGNSKLAEYGAVFVLKQFKEAL 288 Query: 316 ITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYA-MGSGLRQ----SLKFEDVKRLPVLV 369 + Y + G D ++ +L S + SG R ++ ++ + VL+ Sbjct: 289 VPRVYHSFNVSGKADPDFIEYLFESGVPNHELRKLISSGARMDGLLNINYDSFMNISVLL 348 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 P I+EQ I V+ ++ L ++ + L++ + S + Sbjct: 349 PSIEEQNKIARVLE----KLKKLTDETRLRLFNLQQAKKSLL 386 >gi|319777422|ref|YP_004137073.1| hypothetical protein MfeM64YM_0698 [Mycoplasma fermentans M64] gi|318038497|gb|ADV34696.1| Hypothetical Protein MfeM64YM_0698 [Mycoplasma fermentans M64] Length = 407 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 47/391 (12%), Positives = 117/391 (29%), Gaps = 22/391 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P ++ V + + + G + S + +I + D+E G + + Sbjct: 13 PDGYEWVTLGEISSIRRGASPRPISSFLSKEGYPWIKIGDIEEGKIYLKKTKQFINEKGS 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-T 134 + KG ++ + + I I L+ + + + + Sbjct: 73 KKSVVVDKGDLILSNSMSFGKPVIADIKGCIHDGWLLIANFEKNVTSKFLYYWFLSNYSQ 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 T+S+ + + + + +P+ PL Q I E + RI + EL Sbjct: 133 SFFLQQSSPGTISNLNSEILKKLKIPLIPLKIQEKIVEILERF--RILEAELKAELKAEL 190 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 KQ L K ++ + + + K F + K + Sbjct: 191 EARGKQ------FDFTLTKIFNFKQYKLKKLWEI--TFWDKNFQEVEKFKQSKTSNFKYL 242 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + N + K E+ + +I + L + Sbjct: 243 FYKEIENYNDPKGDVKIITTGKEENLKINSKNYKKDIYSGEVLLIPGGGEANIKYHKGKF 302 Query: 315 IITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + + + + + DL + + G + +++ L + +PP Sbjct: 303 VTGDNRIGQVLNKNEVATKFLYYYFLLNLDLIRKNFR--GGSIKHPFMKNILELNIPIPP 360 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ Q I ++++ + + + I L Sbjct: 361 LETQNKIVSILDKLSEYSQEINLGLPAEIEL 391 >gi|300861378|ref|ZP_07107464.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TUSoD Ef11] gi|300849170|gb|EFK76921.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TUSoD Ef11] Length = 412 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 57/408 (13%), Positives = 130/408 (31%), Gaps = 31/408 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIG---LEDVESGTGKYLPKDGNSRQSDTSTVS 79 + W+ ++ TG T ++ +D + + + + + + Sbjct: 14 EDWEHRKVEELGDTFTGLTGKTKEDFGHGDATFVTYINVFSNPITDLKMTESVEIDAKQN 73 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVL-PELLQGWLLSID 132 G I + + ++ ++ +P L P + L S + Sbjct: 74 QVEYGDIFFTTSSETPEEVGMSSVWLGNEANVYLNSFCFGYRPVTELAPYYMAFMLRSPN 133 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 V ++ + +G + + + +I +P+P + EQ + + I + + Sbjct: 134 VRKKFIFLAQGISRYNISKNRVMDIEIPVPNIDEQRKVGQFFKDIDDLITLHQRKLEQLK 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKN 248 EL K Q + P ++ D EW +G + H TE + Sbjct: 194 ELKKTYLQVMFPR--KDERVPKLRFADFEGEWAQRKLGEISTHRSGTAIERYFTEDGK-- 249 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS- 307 ++S+ K + + ++V GE+ D +D + Sbjct: 250 -----YKVISIGSYGTDSKYVDQGIRAISNEITNARVVHKGELTMVLNDKTSDGAIIGRS 304 Query: 308 --AQVMERGIITSAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + E +I + P + A+ + + KV + G + + + VK Sbjct: 305 LLIESEEEYVINQRTEIISPKDDFNVNFAYTTLNNTFRQKVKKIVQGGTQIYVNYPAVKN 364 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L + P KEQ I + D + + + LK + +++ Sbjct: 365 LMLDFPSYKEQTKIGTF----FKQFDDTITLHQNKLDQLKTLKKTYLQ 408 >gi|170717882|ref|YP_001784937.1| restriction modification system DNA specificity subunit [Haemophilus somnus 2336] gi|168826011|gb|ACA31382.1| restriction modification system DNA specificity domain [Haemophilus somnus 2336] Length = 410 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 60/407 (14%), Positives = 124/407 (30%), Gaps = 37/407 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + + YI D+ G L Sbjct: 20 WEQRKLGDLANSIKSYPLSRNVETEEKTKTKYIHYGDIHRGIANILNDISVLPNITGEYS 79 Query: 79 SIFAKGQILYGKLGPYLRKA-------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + G ++ I + + + + ++P L L S Sbjct: 80 ELLSFGDLVVADASEDYYGVAAPCVINCIYEQNIVAGLHTIAIRPYKSHHLFLYYLLHSS 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + G + K + P EQ I +D IT R Sbjct: 140 GFKEYCKKVGTGTKVFAITSKNLLGFESFFPHYEEQQKIGAF----FTALDRYITIHQRK 195 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +E +++ K++L+ + K +++ WE + + ++ K Sbjct: 196 LENIQKLKKSLLQKMFPKNDQEFPEIRFP------EFTYAWEQRKAKEIFISVSEKGFPH 249 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + S +G I + ++ +S +TY+ V PG+ V Q A Sbjct: 250 LPVLSASQEFGMIRRDDIGIDIKYDQKSTQTYKRVSPGQFVIHLRSFQG-----GFAWSD 304 Query: 312 ERGIITSAYMAVKPH---GIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLP 366 GI + AY + S + + S K + G+R +S+ F D L Sbjct: 305 IEGITSPAYTIIDFKKKENHSSNFWKLIFTSSSFIKKLETVTYGIRDGRSISFSDFSDLR 364 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + I+EQ I +D + ++ + +++ + S + Sbjct: 365 LFYSQIQEQQKIGAF----FTALDRYITIHQRKLENMQKLKKSLLQQ 407 >gi|332204532|gb|EGJ18597.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47901] Length = 516 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 73/435 (16%), Positives = 144/435 (33%), Gaps = 62/435 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G + + + + +PPL+EQ I E I + ++D R +L Sbjct: 202 RVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLD 261 Query: 196 KEKK----QALVSYIVTKGLNPDVKMKDS------------------------------- 220 KE ++++ Y + L +S Sbjct: 262 KEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIV 321 Query: 221 --------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 E +P+ WE + + + R + + + + + Sbjct: 322 SQGDDSSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFS 381 Query: 273 MGL-------KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----Y 320 + L SY+ +++ G++++ L R ++ G + Sbjct: 382 IDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTV 441 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V I+ ++ + S + V SG ++ L + +K + +PP+ EQ I Sbjct: 442 IRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRI 501 Query: 379 TNVINVETARIDVLV 393 + I A ID L+ Sbjct: 502 VDKIEQFFAHIDALI 516 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 10/206 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + +G ++ + L + +PP+ EQ I I ++D E Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESY 254 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQ 418 + L KE + S + A+ G+ Sbjct: 255 NRLEQLDKEFPDKLKKSILQYAMQGK 280 >gi|240949255|ref|ZP_04753599.1| Type I restriction-modification system, S subunit/Type I restriction modification DNA specificity [Actinobacillus minor NM305] gi|240296371|gb|EER47015.1| Type I restriction-modification system, S subunit/Type I restriction modification DNA specificity [Actinobacillus minor NM305] Length = 446 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 65/453 (14%), Positives = 142/453 (31%), Gaps = 67/453 (14%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPK---DGNSRQS 73 W++ + + G T +S DI +I +D+ +Y+ K + Sbjct: 2 SSWELKKLSEVADIIGGATPKSDVDEYFNGDIPWITPKDLSGYKNRYISKGERNITKLGL 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + S+ + KG +L+ P IAD + + F L KD +LL ++ Sbjct: 62 ENSSAKLLPKGAVLFTSRAPI-GYVAIADNEVSTNQGFKSLVLKDGNIPEFFYYLLKHNI 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 EA G+T + + N + IP + Q I + + +I+ + Sbjct: 121 -PLFEARATGSTFKEVSGQVVKNTELLIPSIDIQKKIVDLVSPLDEKIELNTQINQTLEQ 179 Query: 194 LLKEKKQALVSYI--------------------------------------------VTK 209 + + ++ + Sbjct: 180 IAQTIFKSWFIDFDPVHAKANALASGQTTEQATQAAMAVISGKNTQELHRLQTANPEQYQ 239 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 L + SG + G VP W + + ++ K N S + E Sbjct: 240 QLWEIAEAFPSGFDEEG-VPRGWGLSTIDENYNVVMGQSPKGETYNEESNGALFYQGRAE 298 Query: 270 TRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 +P Y T ++ G I+ D +E I A+ Sbjct: 299 FGWRYPEPRLYTTDPKRMAKKGNILMSVRAPVGDL-----NVALEDCCIGRGLAALSHKS 353 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ + +++ + + S+ +D+K + V+ P I + + + Sbjct: 354 NSLSFGLYQIKNLQNEFDIFNGEGTVFGSINQKDLKAIKVINPSF----KIIKLFDDVCS 409 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ +E + + I+ L++ R + ++G+ Sbjct: 410 SNELQIENLSREIIFLRKIRDELLPKLLSGEKK 442 >gi|221231344|ref|YP_002510496.1| type I restriction-modification system S protein [Streptococcus pneumoniae ATCC 700669] gi|220673804|emb|CAR68306.1| type I restriction-modification system S protein [Streptococcus pneumoniae ATCC 700669] Length = 516 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 73/435 (16%), Positives = 144/435 (33%), Gaps = 62/435 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G + + + + +PPL+EQ I E I + ++D R +L Sbjct: 202 RVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLD 261 Query: 196 KEKK----QALVSYIVTKGLNPDVKMKDS------------------------------- 220 KE ++++ Y + L +S Sbjct: 262 KEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIV 321 Query: 221 --------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 E +P+ WE + + + R + + + + + Sbjct: 322 SQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFS 381 Query: 273 MGL-------KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----Y 320 + L SY+ +++ G++++ L R ++ G + Sbjct: 382 IDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTV 441 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V I+ ++ + S + V SG ++ L + +K + +PP+ EQ I Sbjct: 442 IRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRI 501 Query: 379 TNVINVETARIDVLV 393 + I A ID L+ Sbjct: 502 VDKIEQFFAHIDALI 516 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 10/206 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + +G ++ + L + +PP+ EQ I I ++D E Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESY 254 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQ 418 + L KE + S + A+ G+ Sbjct: 255 NRLEQLDKEFPDKLKKSILQYAMQGK 280 >gi|310780627|ref|YP_003968958.1| restriction modification system DNA specificity domain protein [Ilyobacter polytropus DSM 2926] gi|309749950|gb|ADO84610.1| restriction modification system DNA specificity domain protein [Ilyobacter polytropus DSM 2926] Length = 392 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 58/392 (14%), Positives = 128/392 (32%), Gaps = 23/392 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + +K L +G+ + +G + K + + K Sbjct: 19 EWQNIKLKDSYTLISGQHLGPDEYSQEENKTPYFTGPSDFTNKTDEISKWSLVNGKLAQK 78 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 +L+ G + + + + + + L + + + + E + G Sbjct: 79 HDVLFTVKGSGVGSLMYLNLESVMIGRQL-MAIRSRISSTKLLSHFLPKKREYFEKLASG 137 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I ++ + +P EQ I + + +I+ L +R E + Q + Sbjct: 138 NMIPGLSREDILSLNLSLPTSPEQQKIASFLTSVDSKIEKLEKKRELMAEYKRGVMQKIF 197 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + E P+ W P LV K L + I Sbjct: 198 SQEIRFKG-----------EDGKEYPE-WVELPLGDLVIISKEKYNPLRDKEIYKCIELE 245 Query: 264 IIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + +G S + G+I++ + K G+ +S + Sbjct: 246 NLSQETGKLLGYFNSSQQQSIKNKFNKGDILYGKLRPYLKKYYKADFD----GVCSSEIL 301 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +K +D+ +L L++++ + +E +K + P I EQ I N Sbjct: 302 VLKGKKLDNNFLYQLIKTFKFNSIANVSSGSKMPRADWEYMKEILFKYPSILEQQKIANF 361 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ ID +E +EQ +KE + + Sbjct: 362 LSG----IDKKIELVEQETEQVKEFKRGLLQQ 389 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 75/210 (35%), Gaps = 16/210 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K E+ G W+ T ++ ++ E + + N + Sbjct: 10 KLRFPEFNGE----WQNIKLKDSYTLISGQHLGPDEYSQEENKTPYFTGPSDFTNKTDEI 65 Query: 278 ESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + ++ +++F ++ +E +I MA++ + L+ Sbjct: 66 SKWSLVNGKLAQKHDVLFT---VKGSGVGSLMYLNLESVMIGRQLMAIRSRISSTKLLSH 122 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + F + SG + L ED+ L + +P EQ I + + ++ +E Sbjct: 123 FL--PKKREYFEKLASGNMIPGLSREDILSLNLSLPTSPEQQKIASFLTSVDSK----IE 176 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 K+E+ L+ E + + + +I +GE Sbjct: 177 KLEKKRELMAEYKRGVMQKIFSQEIRFKGE 206 >gi|300815957|ref|ZP_07096180.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 107-1] gi|300531164|gb|EFK52226.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 107-1] Length = 583 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 70/510 (13%), Positives = 144/510 (28%), Gaps = 107/510 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGL 53 +K K P+ S + +P+ W+ V + ++ G T +S I +I Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVTLATVGEIVGGGTPKSDNPQFWAKNGIKWITP 140 Query: 54 EDVESGTGKYLP---KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ 110 D+ GKY+ +D + S+ + KG +L+ P IAD + + Sbjct: 141 ADLYGLKGKYITSGARDISPAGLSNSSARLMPKGSVLFSSRAPI-GYVAIADAELSTNQG 199 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ--- 167 F P + + ++I+A G T + I +P+PPL+EQ Sbjct: 200 FKSCVPYIKE-SAEYIYYFLLASAKKIDAEASGTTFKEVSGAIVSKILLPLPPLSEQLKI 258 Query: 168 --------------------------------------VLIREKIIAETVRIDTLITERI 189 E++ RI Sbjct: 259 VSRANELMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWTRISEHFDTLF 318 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------------------------ 225 + KQ ++ V L P + E + Sbjct: 319 TTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKSLPP 378 Query: 226 -------GLVPDHWEVKPFFALVTELNRKNTK--------LIESNILSLSYGNIIQKLET 270 +P+ WE + ++ + I + G++ + Sbjct: 379 ISDEEKPFELPEGWEWSYLSDIGILARGRSKHRPRNDPTLYADGTIPLVQTGDVARSNGC 438 Query: 271 RNMG---LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 N ++ + G + D L + + P+ Sbjct: 439 INTYSALYNQLGLSQSKLWNKGTLCITIAANIADSGIL-----NFDACFPDSVVGFTPYE 493 Query: 328 IDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + L + + S ++++ + + +L PP++E I + + Sbjct: 494 NEIPVLYFHYFMMTIKSTLEKFAPSTAQKNINIDILSQLFFPCPPLEEFHRIVDKVQNLL 553 Query: 387 ARIDVL---VEKIEQ-SIVLLKERRSSFIA 412 + DVL ++ +Q + L + I Sbjct: 554 SVCDVLRAYIQSAQQTQLHLADALTDAAIN 583 >gi|317009143|gb|ADU79723.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori India7] Length = 460 Score = 107 bits (266), Expect = 5e-21, Method: Composition-based stats. Identities = 59/436 (13%), Positives = 138/436 (31%), Gaps = 45/436 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + + + G +S K + Y+ +V + L + + D Sbjct: 13 PKGVEFRKLGDIGEFYGGLVGKSKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKE 72 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD----------FDGICSTQFLVLQPKDVLPELLQG 126 + G +L+ L ++ + F P L+ Sbjct: 73 KQNTIQLGDVLFTGSSENLDDCAMSCVVTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKH 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + + I + G T + + + I +PIPPL Q I + + A T L T Sbjct: 133 FLRDYNFRKNISKVANGVTRFNVSKQLLSKITIPIPPLEVQQEIVKILDAFTELNTELNT 192 Query: 187 ERIRFIELLKEKKQALVSYIVTKG------------LNPDVKMKDSGIEWVGLVPDHWEV 234 E + K++ Q + ++ L K L P E Sbjct: 193 ELNTELNARKKQYQYYQNMLLDFKGIHQNHKDAKEKLAQKTYPKRLKALLQTLAPKGVEF 252 Query: 235 KPFFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + + ++ + + ++ N Q ++ E + Sbjct: 253 RKLGDIGEFYGGLVGKSKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKEKQNTIQ 312 Query: 288 PGEIVFRFID------LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 G+++F + + + + + + + + ++L +R Y+ Sbjct: 313 LGDVLFTGSSENLDDCAMSCVVTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKHFLRDYN 372 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 K + +G R ++ + + ++ + +PP++ Q +I +++ + L+ I I Sbjct: 373 FRKNISKVANGVTRFNVSKQLLSKITIPIPPLEVQQEIVKILDQFSLLTTDLLAGIPAEI 432 Query: 401 VLLKE----RRSSFIA 412 K+ R + Sbjct: 433 KARKKQYEYYREKLLT 448 >gi|332076345|gb|EGI86808.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41301] Length = 516 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 73/435 (16%), Positives = 142/435 (32%), Gaps = 62/435 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G + + + + +PPL+EQ I E I + ++D R +L Sbjct: 202 RVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLD 261 Query: 196 KEKK----QALVSYIVTKGLNPDVKMKDS------------------------------- 220 KE ++++ Y + L +S Sbjct: 262 KEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIV 321 Query: 221 --------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 E +P+ WE + + + R + + + + + Sbjct: 322 SQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFS 381 Query: 273 MGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----Y 320 + L SY+ +++ G++++ L R + A Sbjct: 382 IDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTV 441 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V I+ ++ + S + V SG ++ L + +K + +PP+ EQ I Sbjct: 442 IRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRI 501 Query: 379 TNVINVETARIDVLV 393 + I A ID L+ Sbjct: 502 VDKIEQFFAHIDALI 516 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 10/206 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + +G ++ + L + +PP+ EQ I I ++D E Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESY 254 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQ 418 + L KE + S + A+ G+ Sbjct: 255 NRLEQLDKEFPDKLKKSILQYAMQGK 280 >gi|312902061|ref|ZP_07761322.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0470] gi|311290843|gb|EFQ69399.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0470] Length = 407 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 64/401 (15%), Positives = 143/401 (35%), Gaps = 26/401 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W++ +K T+ G ++ D+ + + + + GN + ++ Sbjct: 18 EDWELCKLKEITERVKG--NDGRMDLPTLTISAGQGWLNQKDRFSGNIAGKEQKNYTLLL 75 Query: 83 KGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K ++ Y KL Y ++ ++ + + + E Sbjct: 76 KNELSYNHGNSKLAKYGAVFLLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKE 135 Query: 139 ------AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + ++ NI + IP + EQ I + +ID IT R + Sbjct: 136 LGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDNTITLHQRKL 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E LKE K+A + + K++ + E E+ F+ T K+ Sbjct: 192 EQLKELKKAYLQVMFPAKDERVPKVRFAAFEGEWAHRKLGEITESFSGGTPTAGKSEYY- 250 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 +I + G I + + + ++V G+I++ + + + Sbjct: 251 GGDIPFIRSGEISSDSTELFITENGLNSSSAKMVKVGDILYALYGATSGEVGISKI---- 306 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-P 371 G I A +A++P D++YL + G + +L VK L +++P Sbjct: 307 TGAINQAILAIRPSKNDNSYLIIQWLRKQKNTIISTYLQGGQGNLSSSIVKNLIIMLPQN 366 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ + R+D ++ + + LK+ ++S++ Sbjct: 367 KEEQEKVGIF----FKRLDDIITLHQNKLEQLKDLKTSYLQ 403 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 57/186 (30%), Gaps = 9/186 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + T+ +G T +GK DI +I ++ S + + ++S+ Sbjct: 224 EWAHRKLGEITESFSGGTPTAGKSEYYGGDIPFIRSGEISSDSTELF---ITENGLNSSS 280 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G ILY G + I+ G + L ++P L L I Sbjct: 281 AKMVKVGDILYALYGATSGEVGISKITGAINQAILAIRPSKNDNSYLIIQWLRKQKNTII 340 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + I M EQ + I + + +L Sbjct: 341 STYLQGGQGNLSSSIVKNLIIMLPQNKEEQEKVGIFFKRLDDIITLHQNKLEQLKDLKTS 400 Query: 198 KKQALV 203 Q + Sbjct: 401 YLQNMF 406 >gi|329913607|ref|ZP_08275981.1| Type I restriction-modification system, specificity subunit S [Oxalobacteraceae bacterium IMCC9480] gi|327545304|gb|EGF30548.1| Type I restriction-modification system, specificity subunit S [Oxalobacteraceae bacterium IMCC9480] Length = 397 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 130/405 (32%), Gaps = 26/405 (6%) Query: 27 VVPIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +V +K + G T S + ++ +++V G+ + ++ + Sbjct: 5 LVKLKDLCSLITKGTTPTSIGLDFADDGVGFLRVQNVSGGSVNFQNGTLFIAENVHQELR 64 Query: 80 I--FAKGQILYGKLGPYLRKAIIADFDGI--CSTQFLVLQPKDVLPELLQ-GWLLSIDVT 134 G IL G R ++ + C+ +++P+ + WL S D Sbjct: 65 RSQILAGDILLSIAGTIGRIGVVPENAPALNCNQALAIIRPEARVFRPFLRHWLESADAQ 124 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ T+ + +G + + +P L EQ I + + L Sbjct: 125 FQMRGATVTGTIQNLSLAQVGRLELSLPLLPEQRRIAAILDQADALRAKRREALAQLDSL 184 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 Q++ + +P K + + + + + K ++ Sbjct: 185 T----QSIFIEMFG---DPVTNSKALPTKKLSEITTFENGDRSGNYPSGDDIKIAGILFL 237 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + +++ + + + + + + V +++ + A Sbjct: 238 STKNITN-DRLDLTKRVYISKEKFDSLSRGKVLRNDLIITLRGTLGS-CCIFDAIEETAF 295 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 I + G S YL L+ S + F +G G L + LP+ VP + Sbjct: 296 INAQMMIIRPQSGCSSEYLHALLTSQQAQERFDHIGRGAAVPQLTSAQLASLPIPVPSEE 355 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +Q + V +D L K EQ + L +S A +G+ Sbjct: 356 KQREFA----VRKRTLDELKAKEEQGMAELDTLFASLQHRAFSGE 396 >gi|315641379|ref|ZP_07896454.1| type-I specificity determinant subunit [Enterococcus italicus DSM 15952] gi|315482872|gb|EFU73393.1| type-I specificity determinant subunit [Enterococcus italicus DSM 15952] Length = 410 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 50/405 (12%), Positives = 127/405 (31%), Gaps = 29/405 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + G + I + + + K ++ Sbjct: 17 EWEERKLGELASFSKGNGYTKNDLVEFGDPIILYGRLYTKYETVIEKVDTFVNKKDNS-- 74 Query: 80 IFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI-DV 133 I ++G + R +++ I +++P + + + +S Sbjct: 75 IISEGSEVIVPASGESSEDISRASVVGKSGLILGGDLNIIKPVNYIDSIFLALTISNGSQ 134 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 Q + +G ++ H + + + P L EQ I ++D I+ R + Sbjct: 135 QQEMSKRAQGKSVVHLHNSDLKQVNLLYPKLEEQQKIGSF----FKKLDNTISLHQRKLN 190 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL E+K+ + + K +++ +G + + N + + Sbjct: 191 LLNEQKKGFLQKMFPKNGEIIPEIRFAGFNDDWEERKLGDHAKYRRGSFPQPYGNKEWYD 250 Query: 254 -----SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + N + +E + + V G++V + Sbjct: 251 GEGAMPFVQVVDVTNKLTLVENTKQKISKLAQSKSVFVPKGKVVVTLQGSIGRVAITQYD 310 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 ++R ++ D + A+ ++ + A G G +++ E + V Sbjct: 311 SFVDRTLL---IFEDYEKETDERFWAYTIQKKFEIEKLKAPG-GTIKTITKEALSSFNVH 366 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P +EQ I + ++D + ++ I LK + S + Sbjct: 367 LPKFEEQQKIGSF----FKQLDDTIALHQRKIDELKLMKKSMLQK 407 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 24/200 (12%), Positives = 54/200 (27%), Gaps = 18/200 (9%) Query: 21 IPK--------HWKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGKY 63 IP+ W+ + K G + + + ++ + DV + Sbjct: 211 IPEIRFAGFNDDWEERKLGDHAKYRRGSFPQPYGNKEWYDGEGAMPFVQVVDVTNKLTLV 270 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + S KG+++ G R I +D L+ + + + Sbjct: 271 ENTKQKISKLAQSKSVFVPKGKVVVTLQGSIGR-VAITQYDSFVDRTLLIFEDYEKETDE 329 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + G T+ + + + + +P EQ I I Sbjct: 330 RFWAYTIQKKFEIEKLKAPGGTIKTITKEALSSFNVHLPKFEEQQKIGSFFKQLDDTIAL 389 Query: 184 LITERIRFIELLKEKKQALV 203 + + K Q + Sbjct: 390 HQRKIDELKLMKKSMLQKMF 409 >gi|190149564|ref|YP_001968089.1| type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307262884|ref|ZP_07544508.1| Possible type I site-specific deoxyribonuclease [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914695|gb|ACE60947.1| putative Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871789|gb|EFN03509.1| Possible type I site-specific deoxyribonuclease [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 388 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 57/421 (13%), Positives = 117/421 (27%), Gaps = 64/421 (15%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDV--ESGTG 61 KD V+W + K G T + + ++ + Sbjct: 8 KDCEVEW----------KSLGEVAKYVRGLTYNKTNESDEKAGGYYVLRANNITLSNNQL 57 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFL--VLQ 115 + + T K IL + A I++ F+ V Sbjct: 58 NFDDVKLVKFDTKTKPEQKLYKDDILISAASGSKEHVGKVAFISENMDFYFGGFMGVVRC 117 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 +++LP L L S + + +T+++ + K + +PIPPL Q I + + Sbjct: 118 SQEILPRFLFHILTSSLFKTYLNEVLNSSTINNLNAKVMNEFQIPIPPLEIQEKIVKILD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 T TL + L ++ ++ G + +EW Sbjct: 178 KFTELEATLEATLEAELSLRVKQYDYYRDDLLNFGDD---------VEW----------- 217 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 L E + S N I+ E + E Sbjct: 218 -------------KMLGEVCVRIFSGKNKIKNNEGKYNVYGSTGIIAKTDKKIYEEDLLL 264 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I E + + + D L +L + + + Sbjct: 265 IARVGANAGFVHIATGEYDVSDNTLIIKHKE--DLVILKYLYYVLENMNLNRFANGAGQP 322 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 + +K L + +PP+ Q I +++ + + + + + I L ++ R + Sbjct: 323 LITAGQLKELKIPLPPLSTQQKIVEILDKFDRLTNSISDGLPKEIELRRKQYEYYRERLL 382 Query: 412 A 412 Sbjct: 383 N 383 >gi|167718506|ref|ZP_02401742.1| putative type I restriction enzyme specificity protein [Burkholderia pseudomallei DM98] gi|167814674|ref|ZP_02446354.1| putative type I restriction enzyme specificity protein [Burkholderia pseudomallei 91] Length = 111 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 1/104 (0%) Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 V+ H YL ++++S + ++ S + D+ V +PP +EQ I Sbjct: 1 MYTVQMHDNVPKYLWYMLQSLKHIFILNSLKS-AVPGVDRNDIHPAIVCLPPAEEQPAIV 59 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ E +++D L E++I LLKERRS+ IAAAVTG+ID+R Sbjct: 60 AFLDAEISKLDALRADAERAIDLLKERRSALIAAAVTGKIDVRN 103 >gi|308190118|ref|YP_003923049.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] gi|307624860|gb|ADN69165.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] Length = 403 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 47/391 (12%), Positives = 116/391 (29%), Gaps = 26/391 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P ++ V + + + G + S + +I + D+E G + + Sbjct: 13 PDGYEWVTLGEISSIRRGASPRPISSFLSKEGYPWIKIGDIEEGKIYLKKTKQFINEKGS 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-T 134 + KG ++ + + I I L+ + + + + Sbjct: 73 KKSVVVDKGDLILSNSMSFGKPVIADIKGCIHDGWLLIANFEKNVTSKFLYYWFLSNYSQ 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 T+S+ + + + + +P+ PL Q I E + I E EL Sbjct: 133 SFFLQQSSPGTISNLNSEILKKLKIPLIPLKIQEKIVEILERF------RILEAELKAEL 186 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 KQ L K ++ + + + K F + K + Sbjct: 187 EARGKQ------FDFTLTKIFNFKQYKLKKLWEI--TFWDKNFQEVEKFKQSKTSNFKYL 238 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + N + K E+ + +I + L + Sbjct: 239 FYKEIENYNDPKGDVKIITTGKEENLKINSKNYKKDIYSGEVLLIPGGGEANIKYHKGKF 298 Query: 315 IITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + + + + + DL + + G + +++ L + +PP Sbjct: 299 VTGDNRIGQVLNKNEVATKFLYYYFLLNLDLIRKNFR--GGSIKHPFMKNILELNIPIPP 356 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ Q I ++++ + + + I L Sbjct: 357 LETQNKIVSILDKLSEYSQEINSGLPAEIEL 387 >gi|309809641|ref|ZP_07703497.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners SPIN 2503V10-D] gi|308170001|gb|EFO72038.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners SPIN 2503V10-D] Length = 408 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 63/407 (15%), Positives = 127/407 (31%), Gaps = 41/407 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + ++ TG+ + + K G Y + + + F Sbjct: 13 PNGVEYKELGEICEITTGKLNANEK-----------IDDGLYPFFTCDKLPFRINKYA-F 60 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL G + + + VL + + + + I C Sbjct: 61 NTSAILISGNGSQVGHLNSYEGKFNAYQRTYVLYEFKFVEKQYLLHYMRSYLKPYIILNC 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + ++ + + N +PIPPL Q I + + T L E + + + Sbjct: 121 KKGSVPYITLPMLENFKIPIPPLPIQREIVRILDSFTELTAELTAELTARKKQYEFYRDE 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ S E + AL+ + K S I +S Sbjct: 181 LL----------------SFGEIIKGGSTQSSKLCEIALIYDGTHKTPNYKNSGIPFISV 224 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N I + + E Y+ Y+I +F K ++ + V ++ A + Sbjct: 225 EN-INDIYGSKKFISKEDYDLYKITPQINDLFMTRIGSVGKCAIVTKNVDLAYYVSLALI 283 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ID+ YL + + S K + + + ED+ ++ + P + Q I Sbjct: 284 RPNNKIIDTGYLKYYIESVSGTKELSKRTLHNAVPIKINKEDIGKIKITYPSLDIQKKIA 343 Query: 380 NVINVETARIDVL-------VEKIEQSIVLLKERRSSFIAAAVTGQI 419 + ++ A L +E ++ R + + A TG+I Sbjct: 344 STLDNFDAICSDLNIGLPAEIEARQKQYEY---YRDALLTYAATGKI 387 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 56/183 (30%), Gaps = 4/183 (2%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + L+ EL + E + + E + GL P + F Sbjct: 1 MSRLDELIQELCPNGVEYKELGEICEITTGKLNANEKIDDGLYPFFTCDKLPFRINKYAF 60 Query: 294 RF----IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 I + ++ + Y+ + ++ YL MRSY + Sbjct: 61 NTSAILISGNGSQVGHLNSYEGKFNAYQRTYVLYEFKFVEKQYLLHYMRSYLKPYIILNC 120 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G + ++ + +PP+ Q +I +++ T L ++ + R Sbjct: 121 KKGSVPYITLPMLENFKIPIPPLPIQREIVRILDSFTELTAELTAELTARKKQYEFYRDE 180 Query: 410 FIA 412 ++ Sbjct: 181 LLS 183 >gi|225871246|ref|YP_002747193.1| type I restriction-modification system S protein [Streptococcus equi subsp. equi 4047] gi|225700650|emb|CAW95217.1| type I restriction-modification system S protein [Streptococcus equi subsp. equi 4047] Length = 623 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 52/412 (12%), Positives = 128/412 (31%), Gaps = 25/412 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDI---------IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + + G + D+ ++ + DV + K + Sbjct: 210 QWKTLGEVVNFRRGSFPQPYTDMSFYGGEDAQPFVQVVDVADEGFRLNTKTKKTISQKAI 269 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 S+F + L L + + +D + + + R Sbjct: 270 PKSVFVPKGTVIVTLQGTLGRVAVTQYDAYVDRTLAIFDGYKQEVDKRYFAHQLKFIFDR 329 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T+ + N +P+PPL Q I + + + L + IEL + Sbjct: 330 EKEFARGSTLKTITKQEFSNFKIPVPPLDIQRRIVQVLDNFDTVCNDLNIGLPKEIELHQ 389 Query: 197 EKKQALVSYIVTK-----GLNPDVKMKDSGIEWVGLV--PDHWEVKPFFALVTELNRKNT 249 ++ ++T + V+ + I + V P E+ +V + Sbjct: 390 KQYAYFRDKLLTFTAEGVYTDSTVQYRQDLIRLLTWVFGPIKVELGAVCDVVRGNGLQKK 449 Query: 250 KLIESNILSLSYGNIIQKLE----TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + YG I + PE + + G+++ + Sbjct: 450 DFVNEGYPVIHYGQIYTFYGLSARVTKSFVSPEVGQKLKKAKTGDVIVATTSENIEDVGK 509 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKR 364 + + V +S YL + ++ K + G + L +++++ Sbjct: 510 ALVWEGAEDVCIGGHSCVLHTEQNSKYLLYYFQTTVFQKQKEKLVIGTKVIELYPKNLEK 569 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER----RSSFIA 412 +++PP+ EQ I ++++ L + + + I +++ R + Sbjct: 570 AIIILPPVYEQGRIVSILDKFDTLTSDLTQGLPKEIEQRQKQYEYWRDLLLN 621 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 43/406 (10%), Positives = 106/406 (26%), Gaps = 32/406 (7%) Query: 22 PKHWKVVPIKRFTK-------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + + + + I + T + + Sbjct: 13 PDGVEWKELGEVVDYEQPTKYIVKSKEYSDDYSIPVLTAG----QTFILGYTNEVTGIYP 68 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S I++ + DF+ + + L + Sbjct: 69 ASKEHPV----IIF---DDFTTARKWVDFEFKVKSSAMKLLSIKSDRQDDVSIRYVWHYL 121 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ E +P+PPL Q I + + T + L E + Sbjct: 122 GTIKYTPEQHARQWI--GTFSKFKIPLPPLEIQGEIVKILDKFTEHVTELTAELTAELTF 179 Query: 195 LKEKKQALVSYIVTKGLNPDV--KMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTK 250 +++ +++ K ++W +G V + Sbjct: 180 RQKQYSYFRDKLLSFDDESMGGANDKVYTVQWKTLGEVVNFRRGSFPQPYTDMSFYGGED 239 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + + ++ V G ++ + Sbjct: 240 AQPFVQVVDVADEGFRLNTKTKKTISQKAIPKSVFVPKGTVIVTLQGTLGRVAVTQYDAY 299 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 ++R + A +D Y A ++ + +A GS +++ ++ + VP Sbjct: 300 VDRTL---AIFDGYKQEVDKRYFAHQLKFIFDREKEFARGS-TLKTITKQEFSNFKIPVP 355 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER----RSSFIA 412 P+ Q I V++ + L + + I L +++ R + Sbjct: 356 PLDIQRRIVQVLDNFDTVCNDLNIGLPKEIELHQKQYAYFRDKLLT 401 >gi|90410148|ref|ZP_01218165.1| type I restriction-modification system, S subunit [Photobacterium profundum 3TCK] gi|90329501|gb|EAS45758.1| type I restriction-modification system, S subunit [Photobacterium profundum 3TCK] Length = 523 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 51/474 (10%), Positives = 132/474 (27%), Gaps = 76/474 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +PK W + + G + +S + +I + D + G Sbjct: 3 QLPKGWAENSLGNLVVVERGSSPRPIKNFLTDSDDGVNWIKIGDAKKGQKLLTSTAEKIT 62 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + G + + I+ I F+ PK + + L S Sbjct: 63 KEGAMKSRFVDVGDFILSNSMSFGLPYIMGIPGYIHDGWFVFRLPKQISSDYFYYLLSSS 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPL--------------------------- 164 V + + G + + + +P+PPL Sbjct: 123 YVGAQFNNLAVGGVVKNISGDLVKKAILPLPPLAEQTRIVEKLDEVLAQVDTIKARLDGI 182 Query: 165 ------AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ------------------ 200 Q ++ + + I + E Sbjct: 183 PAIIKRFRQSVLAAAVSGKLTEEWRDINTAQDIEKFCSEITDVRKEQYLVTCQKAKLAKS 242 Query: 201 ------ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN----TK 250 + + + L+ + +W V ++V + T Sbjct: 243 KKPRKPSNIDDKIEPHLDVLDLLPSIPEQWTQKVLSFVTDNYADSIVDGPFGASINVKTD 302 Query: 251 LIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 I+ + + N + + + + + ++ G+++F + + Sbjct: 303 YIDDGVPVIRMVNIRPFQFLRENRKFVSFEKFEGLSRHKINEGDVLFAKVGATTGDCCMY 362 Query: 307 SAQVMERGI--ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + S + V +S +L ++ +Y + L + +K Sbjct: 363 PMNEPIAMLSTTGSCRITVDKQVYNSEFLVIVLNAYR-RIFNSITSQVAQPFLNMKTIKS 421 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +P+ +P ++EQ +I +++ + D + +++++ + S +A A G+ Sbjct: 422 VPIPIPALEEQKEIVRLVDQYFSFADTIEAQVKKAQARVDSLTQSILAKAFRGE 475 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 69/212 (32%), Gaps = 18/212 (8%) Query: 18 IGAIPKHWKVVPIKRFTK-----LNTG--------RTSESGKDIIYIGLEDVESGTG-KY 63 + +IP+ W + T + G +T + I + ++ + Sbjct: 265 LPSIPEQWTQKVLSFVTDNYADSIVDGPFGASINVKTDYIDDGVPVIRMVNIRPFQFLRE 324 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDV 119 K + + + + +G +L+ K+G + + +T + Sbjct: 325 NRKFVSFEKFEGLSRHKINEGDVLFAKVGATTGDCCMYPMNEPIAMLSTTGSCRITVDKQ 384 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + ++ + +I + K I ++P+PIP L EQ I + Sbjct: 385 VYNSEFLVIVLNAYRRIFNSITSQVAQPFLNMKTIKSVPIPIPALEEQKEIVRLVDQYFS 444 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 DT+ + + + Q++++ L Sbjct: 445 FADTIEAQVKKAQARVDSLTQSILAKAFRGEL 476 >gi|45656819|ref|YP_000905.1| type I restriction enzyme [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45600055|gb|AAS69542.1| type I restriction enzyme [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 393 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 64/402 (15%), Positives = 121/402 (30%), Gaps = 49/402 (12%) Query: 26 KVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + ++ L G T I + +ED+ + +S Sbjct: 17 EWKAVEEIFDLRNGYTPSKSISEYWKDGTIPWFRMEDIRANGQILNNALQKVAKSALKGG 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQ 135 +F I+ A+I + + +F L K + + + Sbjct: 77 KLFPANSIIVATSATIGEHALIT-VPYLSNQRFTNLILKTEYSDRFEIRFLFYYCFLLDD 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + ++ + D G I +PIPPL Q I + A T L +E + Sbjct: 136 WCKNNTTMSSFASVDMNGFKKIQIPIPPLPAQEEIVRILDAFTELTTELASELSARKKQY 195 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDS-GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + L+S +G + ++ + G P+ + Sbjct: 196 NYYRDQLLS--FEEGEVEWKTLGETCDVYTGGEAPESSSMSK------------------ 235 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I K G + Y +D + I R A Sbjct: 236 ------TPTDIYKYPIFGNGAEVYGYTDRYRIDKDAVTISSIGANTGTIYFRKAHFTP-- 287 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 II + K GI YL + S + S ++ DVKR+ + +PP+ E Sbjct: 288 IIRLKVVIPKQEGILPRYLFHALSS-----IAIGSKSSSVPNMNAADVKRISIPIPPLAE 342 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q I ++++ A + E + + I L ++ R ++ Sbjct: 343 QERIVDILDKFDALTSSISEGLPREIELRQKQYEYYRELLLS 384 >gi|238923270|ref|YP_002936785.1| type I restriction-modification system specificity subunit [Eubacterium rectale ATCC 33656] gi|238874944|gb|ACR74651.1| type I restriction-modification system specificity subunit [Eubacterium rectale ATCC 33656] Length = 412 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 47/405 (11%), Positives = 120/405 (29%), Gaps = 24/405 (5%) Query: 23 KHWKVVPIKRFTKLNT-------GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 K W+ + + + I ++ + + + Sbjct: 13 KDWEQRKLNEVAEKICVGFVGTCEKFYTDESGIPMYRTGNLNGLSLNRDDLKYVTNEFHQ 72 Query: 76 STVS-IFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G IL + G + + + + K + L + S + Sbjct: 73 HNQKSQLKAGDILIARHGDSGKAVNYENSEEANCLNIVIIRPDFKKCNYKFLTNCINSPE 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I+++ G+T + + I + + IP ++ I +D LIT R Sbjct: 133 CQKHIKSLSAGSTQAVINTSEIEKLGVVIPANIDEQNR---IARYFSTLDNLITLHQRKC 189 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E K+ K+ ++ + + +++ G + E+ Sbjct: 190 EQTKKLKKYMLQKMFPRNGAKVPEIRFDGFTYDWEQRKLGEIYGSIGNAFVGT-ATPYYA 248 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 E L N+ N + + + + G++V ++ Sbjct: 249 EHGHFYLESNNVKDGQINHNAEIFINDEFYEKQKDKWLHTGDMVMVQSGHVGH-AAVIPE 307 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV 367 ++ I+ +L + ++ K + +G + + D++ V Sbjct: 308 ELDNTAAHALIMFRNPKEEIEPYFLNYEYQTDKAKKQIENITTGNTIKHILASDMQEFVV 367 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P +EQ I + ++D L+ ++ LK+ + + Sbjct: 368 DIPKYEEQKVIASY----FCKLDHLITLHQRKCDELKKMKKYMLQ 408 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 68/206 (33%), Gaps = 10/206 (4%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALV--TELNRKNTKLIESNILSLSYGNIIQKLE 269 NP ++ K +W + K V E + I G + + + Sbjct: 3 NPKIRFKGFTKDWEQRKLNEVAEKICVGFVGTCEKFYTDESGIPMYRTGNLNGLSLNRDD 62 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + + + + G+I+ ++ E + + + Sbjct: 63 LKYVTNEFHQHNQKSQLKAGDILIARHGDSGK--AVNYENSEEANCLNIVIIRPDFKKCN 120 Query: 330 STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETA 387 +L + S + K ++ +G + + ++++L V++P I EQ I + Sbjct: 121 YKFLTNCINSPECQKHIKSLSAGSTQAVINTSEIEKLGVVIPANIDEQNRIARY----FS 176 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ K+ + + Sbjct: 177 TLDNLITLHQRKCEQTKKLKKYMLQK 202 >gi|229129742|ref|ZP_04258709.1| Type I restriction-modification system specificity subunit [Bacillus cereus BDRD-Cer4] gi|228653658|gb|EEL09529.1| Type I restriction-modification system specificity subunit [Bacillus cereus BDRD-Cer4] Length = 388 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 45/395 (11%), Positives = 114/395 (28%), Gaps = 27/395 (6%) Query: 35 KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 + T + + + + + + + ++ + KG+ Y K Sbjct: 2 ERVTRKNKKGESRLP-LTISAQYGLVDQETYFNKTVASTNLEGYYLLYKGEFAYNKSYSN 60 Query: 95 LRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC------EG 143 G+ S+ ++ +P + E Sbjct: 61 GYPYGAIKRLEKHDKGVLSSLYICFRPLNYSVSSDFLTHYFESAVWHKEVSMISVEGARN 120 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + IP L EQ I + ++D +I + + LK+ K+ + Sbjct: 121 HGLLNISVSDFFETLHLIPNLVEQTQIGNFL----KQLDDMIALHQQELTTLKQTKKGFL 176 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + K +++ G + +E L N Sbjct: 177 QKMFPKEGESVPEVRFPGFTGDWEQRKLESIYEKIRNAFVGT-ATPYYVEDGHFYLESNN 235 Query: 264 IIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + RN + + + G++V ++ ++ Sbjct: 236 VKDGQINRNTEVFINDEFYEKQKNNWLHTGDLVMVQSGHVGH-TAVIPEELDNTAAHALI 294 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDI 378 + D +L + +++ K + +G + + ++K+ V +P +EQ I Sbjct: 295 MFSNYREKADPYFLNYQFQTHKSKKKLNNITTGNTIKHILASEMKKFLVDIPKYEEQKMI 354 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 N ++D + ++ + LKE + +F+ Sbjct: 355 GNF----FKQLDDAIALHQRELDALKETKKAFLQK 385 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 64/194 (32%), Gaps = 14/194 (7%) Query: 24 HWKVVPIKRFTKLNT----GRTSES--GKDIIYIGLEDVESGTG-KYLPKDGNSRQSDTS 76 W+ ++ + G + Y+ +V+ G + N + Sbjct: 198 DWEQRKLESIYEKIRNAFVGTATPYYVEDGHFYLESNNVKDGQINRNTEVFINDEFYEKQ 257 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDV 133 + G ++ + G A+I + + L++ P L + Sbjct: 258 KNNWLHTGDLVMVQSGHVGHTAVIPEELDNTAAHALIMFSNYREKADPYFLNYQFQTHKS 317 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +++ I G T+ H + + IP EQ +I ++D I R ++ Sbjct: 318 KKKLNNITTGNTIKHILASEMKKFLVDIPKYEEQKMIGNF----FKQLDDAIALHQRELD 373 Query: 194 LLKEKKQALVSYIV 207 LKE K+A + + Sbjct: 374 ALKETKKAFLQKMF 387 >gi|313113034|ref|ZP_07798672.1| type I restriction modification DNA specificity domain protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310624648|gb|EFQ07965.1| type I restriction modification DNA specificity domain protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 424 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 69/404 (17%), Positives = 124/404 (30%), Gaps = 26/404 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA-- 82 W+ + + T KD G+ V++G + F Sbjct: 21 WEQRKLTNLCEKFTDGDWIEAKDQSDSGVRLVQTGNVGVTEYLDKPNNKKWISFETFEQL 80 Query: 83 ------KGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKD-VLPELLQGWLLSID 132 G IL +L +A I G I + +++P L +L S Sbjct: 81 HCEEVYPGDILISRLPEPAGRACIMPNLGTKMITAVDCTIVRPNAVTSTRFLLQYLSSQA 140 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G T + +PIP + EKI ++DTLIT R Sbjct: 141 YFDAVNTCLAGGTRQRISRGNLAQFNVPIPSSKIEQ---EKIGEILEKLDTLITLHQRKY 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E L K++++ + K +++ G E+ A + N + ++ + Sbjct: 198 EKLVNIKKSMLDKMFPKNGASVPEIRFKGFTDPWEQRKLSELTSMHARIGWQNLRTSEFL 257 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETY-----QIVDPGEIVFRFIDLQNDKRSLRS 307 +S L G E Y + G I+ ++S Sbjct: 258 DSGDYMLITGTDFDDGTVNYSTCHFVERERYEQDKNIQIRNGSILITKDGTLGKVAYVQS 317 Query: 308 AQVMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRL 365 + + +D YL +++ L G + L + Sbjct: 318 LSMPATLNAGVFNVEIRNTSIVDERYLFQYLKAPFLMDYVDKKATGGTIKHLNQNILVDF 377 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 PV++P EQ I N RID L+ ++ + L+ + S Sbjct: 378 PVVMPKKTEQVSIGNF----FQRIDTLITLHQRKLEKLQNIKKS 417 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 49/150 (32%), Gaps = 6/150 (4%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + + + + V PG+I+ + + + + + Sbjct: 65 KPNNKKWISFETFEQLHCEEVYPGDILISRLPEPAGRACIMPNLGTKMITAVDCTIVRPN 124 Query: 326 HGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIK-EQFDITNVIN 383 + +L + S + G RQ + ++ + V +P K EQ I ++ Sbjct: 125 AVTSTRFLLQYLSSQAYFDAVNTCLAGGTRQRISRGNLAQFNVPIPSSKIEQEKIGEILE 184 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D L+ ++ L + S + Sbjct: 185 ----KLDTLITLHQRKYEKLVNIKKSMLDK 210 >gi|222525221|ref|YP_002569692.1| restriction modification system DNA specificity domain-containing protein [Chloroflexus sp. Y-400-fl] gi|222449100|gb|ACM53366.1| restriction modification system DNA specificity domain protein [Chloroflexus sp. Y-400-fl] Length = 438 Score = 106 bits (265), Expect = 6e-21, Method: Composition-based stats. Identities = 65/433 (15%), Positives = 135/433 (31%), Gaps = 37/433 (8%) Query: 25 WKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W V + +N + I YI + V GT P + ++ + + Sbjct: 5 WGTVRLGDVATINPDAIGANWPFLHIRYIDISSVGEGTIIEKPSQISLSEAPSRAKRLIR 64 Query: 83 KGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSID--VTQRI 137 +G + + P R + D + ST F VL+PK + + D T + Sbjct: 65 EGDTVLSMVRPNRRSMFFVTTFEPDLVVSTGFAVLRPKPKVIHPRYLYACVFDRAFTDYL 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + +GA + I + +P PPL EQ I + +I+ ++ + Sbjct: 125 VSREKGAAYPAVLSEDIADAKIPFPPLPEQRAIAHILGTLDDKIELNRRMSETLEQMARA 184 Query: 198 KKQALVSYIVT---------------KGLNPDVKMKDSG---IEWVGLVPDHWEVKPFFA 239 +A GL +G +P+ W V Sbjct: 185 LFKAWFVDFEPVRAKIECRWQRGQSLPGLPAHFYDLFPERLVDSELGEIPEGWGVGRLSE 244 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 L+ + + E L N+ + + + + G+ + I Sbjct: 245 LIELNPPRVLRKGEVA-PYLDMANMPTRGHV-PGDVVDRPFGSGTRFINGDTLLARITPC 302 Query: 300 NDKRSLRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLM-RSYDLCK--VFYAMGSGL 353 + + G + ++ Y+ ++P A+ + RS + + G+ Sbjct: 303 LENGKTAFVDFLRNGQVGWGSTEYIVLRPREPLPAEFAYCLARSENFRDFAIQNMTGTSG 362 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 RQ ++ E + ++ PP + AR + L R + + Sbjct: 363 RQRVQTEAIAHYLLVAPPAPVAEAFGRTVKQLFARA----TRASCESRTLAALRDALLPK 418 Query: 414 AVTGQIDLRGESQ 426 + G+I ++ + Sbjct: 419 LIRGEIRVKDAEK 431 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/140 (25%), Positives = 57/140 (40%), Gaps = 15/140 (10%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP+ W V + +LN R G+ Y+ + ++ T ++P D R Sbjct: 227 DSE---LGEIPEGWGVGRLSELIELNPPRVLRKGEVAPYLDMANM--PTRGHVPGDVVDR 281 Query: 72 QSDTSTVSIFAKGQILYGKLGPYL--RKAIIADF-----DGICSTQFLVLQPKDVLP-EL 123 + T I G L ++ P L K DF G ST+++VL+P++ LP E Sbjct: 282 PFGSGTRFI--NGDTLLARITPCLENGKTAFVDFLRNGQVGWGSTEYIVLRPREPLPAEF 339 Query: 124 LQGWLLSIDVTQRIEAICEG 143 S + G Sbjct: 340 AYCLARSENFRDFAIQNMTG 359 >gi|257080966|ref|ZP_05575327.1| type I restriction enzyme MjaXIP specificity protein [Enterococcus faecalis E1Sol] gi|256988996|gb|EEU76298.1| type I restriction enzyme MjaXIP specificity protein [Enterococcus faecalis E1Sol] Length = 422 Score = 106 bits (265), Expect = 7e-21, Method: Composition-based stats. Identities = 57/422 (13%), Positives = 124/422 (29%), Gaps = 29/422 (6%) Query: 29 PIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSI 80 + + G+ GK + Y+ + D SG K ++ D+ + Sbjct: 5 KLGNLCLVKGGKRLPKGKALLDYKTEHPYLRITDYASGNIDLKNLKYISNDVFDSISKYT 64 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICS-----TQFLVLQPKDVLPELLQGWLLSIDVTQ 135 K I +G II + S + +V + L +L S Sbjct: 65 INKKDIFLSIVGTIGIVDIIDEKLDGASLTENAVKIIVKDRTKIDVNYLAYYLKSTMGQY 124 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I+ G+T I +IP+P+ + +Q I + + +I EL Sbjct: 125 EIDIRTVGSTQKKLAITRIKDIPVPVIEINKQRKIASVLSSLDSKIKLNNQIISNLEELS 184 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSG----IEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + N K SG G +P+ ++VK + + Sbjct: 185 STLFKRWFVDFEFPDEN-GNPYKSSGGKMDDSEFGEIPECFQVKKLSDIADVIGGGTPSK 243 Query: 252 IESNILSLSYGNII-------QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + I K + G + + I + Sbjct: 244 KVKEYFEDGNISWITPKDLSINKNIFIDRGKTSITRLGLNKSSAKLLPKNSILFSSRAPI 303 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 +A + ++ + ++ K + + VK Sbjct: 304 GYTAISKNELATNQGFKSLIALDGIPYQFIFHFIRNNVSKFESIATGSTFKEVSGTAVKN 363 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +++P + + +V + + ++ +E+ +L E R S + ++G+I+L + Sbjct: 364 FKIVLPTEEVLQNYADVTSPLFKK----IKIVEEENNILTELRDSLLPKLLSGEIELPED 419 Query: 425 SQ 426 + Sbjct: 420 EE 421 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 69/209 (33%), Gaps = 16/209 (7%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVES 58 YK SG + G IP+ ++V + + G T +I +I +D+ Sbjct: 205 YKSSGGKMDDSEFGEIPECFQVKKLSDIADVIGGGTPSKKVKEYFEDGNISWITPKDLSI 264 Query: 59 GTGKYLPKDGNSR---QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 ++ + S + S+ + K IL+ P AI + + F L Sbjct: 265 NKNIFIDRGKTSITRLGLNKSSAKLLPKNSILFSSRAPIGYTAISKNELA-TNQGFKSLI 323 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 D +P + + + E+I G+T + N + +P + Sbjct: 324 ALDGIP-YQFIFHFIRNNVSKFESIATGSTFKEVSGTAVKNFKIVLPTEEVLQNYADVTS 382 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS 204 +I + E EL L+S Sbjct: 383 PLFKKIKIVEEENNILTELRDSLLPKLLS 411 >gi|149012613|ref|ZP_01833610.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] gi|147763418|gb|EDK70355.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] Length = 516 Score = 106 bits (265), Expect = 7e-21, Method: Composition-based stats. Identities = 72/435 (16%), Positives = 143/435 (32%), Gaps = 62/435 (14%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G + + + + +P L+EQ I E I + ++D R +L Sbjct: 202 RVNNKSTGTSYPAINDYNFNLLLIALPSLSEQQRIVEAIESALEKVDEYAESYNRLEQLD 261 Query: 196 KEKK----QALVSYIVTKGLNPDVKMKDS------------------------------- 220 KE ++++ Y + L +S Sbjct: 262 KEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIV 321 Query: 221 --------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 E +P+ WE + + + R + + + + + Sbjct: 322 SQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFS 381 Query: 273 MGL-------KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----Y 320 + L SY+ +++ G++++ L R ++ G + Sbjct: 382 IDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTV 441 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V I+ ++ + S + V SG ++ L + +K + +PP+ EQ I Sbjct: 442 IRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRI 501 Query: 379 TNVINVETARIDVLV 393 + I A ID L+ Sbjct: 502 VDKIEQFFAHIDALI 516 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 71/206 (34%), Gaps = 10/206 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + +G ++ + L + +P + EQ I I ++D E Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPSLSEQQRIVEAIESALEKVDEYAESY 254 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQ 418 + L KE + S + A+ G+ Sbjct: 255 NRLEQLDKEFPDKLKKSILQYAMQGK 280 >gi|332292954|ref|YP_004431563.1| restriction modification system DNA specificity domain protein [Krokinobacter diaphorus 4H-3-7-5] gi|332171040|gb|AEE20295.1| restriction modification system DNA specificity domain protein [Krokinobacter diaphorus 4H-3-7-5] Length = 413 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 69/397 (17%), Positives = 143/397 (36%), Gaps = 25/397 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++W + + + ++ I +E + + L + Q Sbjct: 33 ENWTKKDFGTIVEKAKAKHNPKKSKEEYPCIEMESIAKESSILLEVFNSKDQLSIKNK-- 90 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F+KG+IL+GKL P L+K IIA FDG+CS++ VL K++ E L + + + Sbjct: 91 FSKGEILFGKLRPNLKKYIIAPFDGVCSSEIWVLNGKELSNEFLFRLIQTNKFHSSTL-V 149 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ M ADW I + P P L EQ I + +D I + + LL++ K+ Sbjct: 150 TSGSKMPRADWAYISSSIFPFPSLPEQQKIASFL----SAVDKKIQQLTKKKALLEQYKK 205 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ + + L + + +W + ++ A+ E K+ + + Sbjct: 206 GVMQQLFSGQLRFKDENGNPYPDW-----EEKKMGDILAVRNEQAPKSEQYPLMAFIKHK 260 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 R + + Y+ + G+ ++ +L L I+ Y Sbjct: 261 GVAPKGDRYNREFLVNDGDGKKYKKTEYGDFIYSSNNLDTGSIGL---NSYGSACISPVY 317 Query: 321 -MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQF 376 + D +++ + G+ + + V + +P ++EQ Sbjct: 318 SIFQIKELYDYQFISRFLVRKSFINKMLRFRQGVVYGQWKIHESAVLTIKEKIPCLEEQQ 377 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + + ID +E + I + + + Sbjct: 378 KIATYL----SSIDTKIESVHTQITQTQTFKKGLLQQ 410 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 56/178 (31%), Gaps = 9/178 (5%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPES-YETYQIVDPGEIVFRFIDLQNDKRSLR 306 N K + + +I ++ + GEI+F + K + Sbjct: 52 NPKKSKEEYPCIEMESIAKESSILLEVFNSKDQLSIKNKFSKGEILFGKLRPNLKKYIIA 111 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 G+ +S + + + +L L+++ + + Sbjct: 112 PFD----GVCSSEIWVLNGKELSNEFLFRLIQTNKFHSSTLVTSGSKMPRADWAYISSSI 167 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 P + EQ I + ++ +I L + LL++ + + +GQ+ + E Sbjct: 168 FPFPSLPEQQKIASFLSAVDKKIQQL----TKKKALLEQYKKGVMQQLFSGQLRFKDE 221 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 55/189 (29%), Gaps = 9/189 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + + +S + + ++ + G ++ D Sbjct: 228 DWEEKKMGDILAVRNEQAPKSEQYPLMAFIKHKGVAPKGDRYNREFLVNDGDGKKYKKTE 287 Query: 83 KGQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAI 140 G +Y + + S + + Q K++ +L+ ++ Sbjct: 288 YGDFIYSSNNLDTGSIGLNSYGSACISPVYSIFQIKELYDYQFISRFLVRKSFINKMLRF 347 Query: 141 CEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G + I IP L EQ I + IDT I I + Sbjct: 348 RQGVVYGQWKIHESAVLTIKEKIPCLEEQQKIATYL----SSIDTKIESVHTQITQTQTF 403 Query: 199 KQALVSYIV 207 K+ L+ + Sbjct: 404 KKGLLQQMF 412 >gi|293369054|ref|ZP_06615652.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CMC 3f] gi|292635860|gb|EFF54354.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CMC 3f] Length = 402 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 51/400 (12%), Positives = 111/400 (27%), Gaps = 24/400 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + +L +G+ K I G + + S Sbjct: 4 KVPEVWVWTTLGEILELVSGQDFPPEKYNANIAGIPYIIGASNIENEQLIINRWTESPSV 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L G + K I + + + ++ + Sbjct: 64 YSYLNDLLVVCKGAGVGKMAINNIGVAHIARQIQAVRGYTNYTDIKYIKAVVKNNIENII 123 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + ++ +P+PP++EQ I +I ID + + ++K+ K Sbjct: 124 SKANGLIPGLKRELLLSLQLPLPPISEQRRIVCEIERWFFLIDQIEQGKADLQTVIKQAK 183 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW--EVKPFFALVTELNRKNTKLIESNIL 257 ++ + L P + IE + + + + K N + Sbjct: 184 SKILDLAIHGKLVPQNPNDEPAIELLKRINPDFTPCDNRHSGKLPYEIPKTWVWCSHNSI 243 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQI--------------------VDPGEIVFRFID 297 G LKP YQI G+I+ Sbjct: 244 LDISGGSQPAKSYFETILKPNYIRLYQIRDYGESPVPVYIPINLASKQTKKGDILLARYG 303 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 K + A+ + + + + I + + S + + Sbjct: 304 GSLGK--VFYAEQGAYNVAMAKVIFKFENLIYKEFAYYYYLSDLYQGKLKEISRTAQTGF 361 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 D + +PPI EQ I + + +D + + +E Sbjct: 362 NITDFNDMYFPLPPINEQQRIVQKMEELFSSLDDIQKNLE 401 >gi|84386437|ref|ZP_00989465.1| type I restriction-modification system specificity determinant [Vibrio splendidus 12B01] gi|84378861|gb|EAP95716.1| type I restriction-modification system specificity determinant [Vibrio splendidus 12B01] Length = 404 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 49/403 (12%), Positives = 121/403 (30%), Gaps = 40/403 (9%) Query: 26 KVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSI 80 + + +L G +T + + I + + G + T + Sbjct: 17 EWKVLSEVGELVRGNGLPKTDFTESGVPAIHYGQIYTHYGLCTSSTISFVSEKTADKLKK 76 Query: 81 FAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVT 134 KG ++ L + + + + +P +++ + + + Sbjct: 77 VNKGDVIITNTSENLEDVGKSVVYLGNEQAVTGGHATIFKPSEIILGKYFAYYTQTNAFS 136 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +GA + + I +P+PP+ QV + + L +E + Sbjct: 137 SEKRKYAKGAKVIDVSASDMAKIQVPLPPIHIQVEVVRILDTFRDLTSALSSELAMRKKQ 196 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + L++ DS IEW K + ++ + + Sbjct: 197 YSYYRAKLLN------------FNDSEIEW----------KSLSEVSEYSKKRISFDLLD 234 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +S N++Q + + + + +I+ I K + G Sbjct: 235 TENYVSVENLLQNCAGKAKANRVPTSGNLTQYNSCDILIGNIRPYLKKIWYADSLGGTNG 294 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 + ++ I++ YL L+ + G + + VP ++ Sbjct: 295 DV--LVISSTDARINNRYLYQLLADDGFFEYNMQHAKGAKMPRGNKAKIMDYRIPVPSVE 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 EQ I ++++ I E + + I L ++ R ++ Sbjct: 353 EQKRIVSILDKFNTLIHSTSEGLPKEIELRQKQYEYYRDLLLS 395 >gi|313107800|ref|ZP_07793974.1| hypothetical protein PA39016_001140001 [Pseudomonas aeruginosa 39016] gi|310880476|gb|EFQ39070.1| hypothetical protein PA39016_001140001 [Pseudomonas aeruginosa 39016] Length = 378 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 72/393 (18%), Positives = 145/393 (36%), Gaps = 34/393 (8%) Query: 24 HWKVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WKV + R Y+GLE +++ + K + + +T +F Sbjct: 4 GWKVWRFDQLATNVNVRIDNPSESGMEHYVGLEHLDADSLKI--RRWGTPDDVEATKLMF 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIEA 139 KG I++G+ Y RK +A+FDGICS +V +P VLP L ++ S R Sbjct: 62 KKGDIIFGRRRAYQRKLGVAEFDGICSAHAMVLRAKPDVVLPAFLPFFMQSDLFMSRAVE 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I G+ +WK + +PPL EQ + ++ + + + + Sbjct: 122 ISVGSLSPTINWKTMAVQEFVLPPLEEQQRAVHFL----SAVEDQSEAVLHALTAATKLR 177 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +++ ++ P V+M G P R + I L Sbjct: 178 KSMALEAFSRSDYPIVRMGSVAEIKNGSTP---------------RRATDAYWKGTIPWL 222 Query: 260 SYGNIIQKLE---TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 G + +++ + K S + ++ G + I + R+A + I Sbjct: 223 PTGKVNERVIQAADEFITEKALSECSLAMIPAGATLVAMIGEGQTRG--RAAMLAIDSCI 280 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + AV P G + + + + + + +++L +K P+ VPP++ Q Sbjct: 281 NQNFGAVIPGGSLDPWYLFYLLESNYEALRHWSQGTNQRALSCGLLKNYPIPVPPLEVQQ 340 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 ++ + I+ ++ + ++ E R + Sbjct: 341 ELVGQL----KEIEATESQLALRLDMVHEMRRA 369 >gi|253578027|ref|ZP_04855299.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251850345|gb|EES78303.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 393 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 56/403 (13%), Positives = 125/403 (31%), Gaps = 25/403 (6%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 K + + + G +S I I + +V+ G + +++ + Sbjct: 2 KKIRLGDACDILNGFAFKSENYVDSGIRVIRIANVQKGYIEDNTPVFYPLETNELDKYML 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS----IDVTQRI 137 +G +L G R AI+ + V + + + +L Q+ Sbjct: 62 EEGDLLMALTGNVGRVAILKKEFMPAALNQRVACLRLKTDRVAKDYLFHVLNSAFFEQQC 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + + + +P+ P +Q LI + + I I+ +L Sbjct: 122 IQSSKGVAQKNMSTEWLKDYEIPMYPKEQQELIADILDKTRNII---ISRNYELKKLDDL 178 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K V LN K V + P + KP VT+ + I+ Sbjct: 179 IKARFVEMFGDAYLNEFGWKKIKIKNAVTVEPQNGMYKPQSDYVTDGSGIPILRIDGF-- 236 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGII 316 Y ++ + E+ ++ ++V ++ ++E + Sbjct: 237 ---YDGVVTDFSSLKRLRCSENERQKYLLYEDDVVINRVNSIEYLGKCAHINGLLEDTVY 293 Query: 317 TSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPI 372 S M + Y+ L+ S + + S+ +DV + PP+ Sbjct: 294 ESNMMRMHFDSTRFHPVYVCRLLCSRFVYDQIVNHAKQAVNQASINQKDVLDFDIYEPPL 353 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 K Q + + D +I++++ + S + Sbjct: 354 KLQIQFADFVRAV----DKSKVEIQKALDKTQMLFDSLMQEYF 392 >gi|313204425|ref|YP_004043082.1| restriction modification system DNA specificity domain [Paludibacter propionicigenes WB4] gi|312443741|gb|ADQ80097.1| restriction modification system DNA specificity domain [Paludibacter propionicigenes WB4] Length = 433 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 50/417 (11%), Positives = 114/417 (27%), Gaps = 32/417 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTG-------------KYLPKDG 68 K W + P + + ++++ G + Sbjct: 20 KEWILEPFSEIYSFLGTNSFTRDNLNYRDGNIKNIHYGDIHTKFNSHFDITKEIVPFVNL 79 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---- 124 + +G I++ L + + + ++ +L Sbjct: 80 DITVEKIKEEFFCKEGDIIFADASEDLADVGKSIEIIYLNNEKILSGLHTLLARQKDSKL 139 Query: 125 -----QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAET 178 S + +I+ +GA + + NI + P EQ I + + Sbjct: 140 RTGFGGHLFKSSSIRTQIQKESQGAKVLGISATRLSNISVYYPENKDEQQKIASCLSSLD 199 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I +E LK+ K+ L+ + K++ E G + K Sbjct: 200 EL----IAAHTYKLEALKDHKKGLMQQLFPAEGETVPKLRFKEFEGDGEWVETTLNKLGN 255 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + N E ++ S ++ + + ++ P +I+ + Sbjct: 256 LIGGLTYSPNDIRNEGLLVLRSSNIQNGLIDLNDCVYVTTEVKGANLIQPNDILICVRNG 315 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + T +++ L ++ A S+ Sbjct: 316 SKSLIGKNAIIPKDIPFATHGAFMTVFRAYQPSFIFQLFQTDLYSNQVKADLGATINSIN 375 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + +VP EQ I N + + ID + Q I LKE + + Sbjct: 376 GSNLLKYKFIVPQPNEQQKIANFL----SSIDDEIAAQVQKIEGLKEHKKGLMQGLF 428 >gi|237712394|ref|ZP_04542875.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 9_1_42FAA] gi|229453715|gb|EEO59436.1| type I restriction enzyme EcoR124II specificity protein [Bacteroides sp. 9_1_42FAA] Length = 356 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 55/352 (15%), Positives = 113/352 (32%), Gaps = 37/352 (10%) Query: 81 FAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +L G L + + G S +++ V PE +LS + + Sbjct: 6 VLANDLLLNITGGSLGRCAVVPADFNCGNVSQHVCIMRSVLVEPEYFHVLVLSSYFAKSM 65 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+ + + P+PPL EQ I +I ID + + ++K+ Sbjct: 66 K--ITGSGREGLPKYNLEQMGFPLPPLTEQQRIVAEIEHWFALIDQIEQGKADLQTIIKQ 123 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWV----------------GLVPDHWEVKPFFALV 241 K ++ + L P + IE + VP+ W L Sbjct: 124 TKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTFDVPNGWNWCKLNDLC 183 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---------KPESYETYQIVDPGEIV 292 + L+R + + + L+ + L +++ + G+++ Sbjct: 184 SFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQARFLDPSTINKWDSKYKLQTGDVL 243 Query: 293 FRFIDLQNDKRSLRSAQVM---ERGII--TSAYMAVKPHGIDSTYLAWLMRSYDLCKVF- 346 R+ + ++ + + I+S Y+ M S + + Sbjct: 244 VNSTGTGTVGRTRLFDESYLGKYPFVVPDSHVAVVRTYEEINSEYVFAYMSSQLIQQYIE 303 Query: 347 -YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 GS ++ L ++ L PPI EQ I I + +D + +E Sbjct: 304 DNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIEELFSVLDNIQNALE 355 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 55/133 (41%), Gaps = 2/133 (1%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 V +++ + ++ A G ++ ++ ++ Y L+ S K Sbjct: 6 VLANDLLLNITGGSLGRCAVVPAD-FNCGNVSQHVCIMRSVLVEPEYFHVLVLSSYFAKS 64 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 GSG R+ L +++++ +PP+ EQ I I A ID + + ++K+ Sbjct: 65 MKITGSG-REGLPKYNLEQMGFPLPPLTEQQRIVAEIEHWFALIDQIEQGKADLQTIIKQ 123 Query: 406 RRSSFIAAAVTGQ 418 +S + A+ G+ Sbjct: 124 TKSKILDLAIHGK 136 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 61/181 (33%), Gaps = 17/181 (9%) Query: 20 AIPKHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLE----DVESGTGKYLPKDGN--SRQ 72 +P W + + G++ + +D + +++ G S Sbjct: 169 DVPNGWNWCKLNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQARFLDPSTI 228 Query: 73 SDTSTVSIFAKGQILYGKLGP-------YLRKAIIADFDGIC--STQFLVLQPKDVLPEL 123 + + G +L G ++ + + + S +V +++ E Sbjct: 229 NKWDSKYKLQTGDVLVNSTGTGTVGRTRLFDESYLGKYPFVVPDSHVAVVRTYEEINSEY 288 Query: 124 LQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + ++ S + Q IE G+T + N+ P PP+ EQ I +KI +D Sbjct: 289 VFAYMSSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIEELFSVLD 348 Query: 183 T 183 Sbjct: 349 N 349 >gi|168179781|ref|ZP_02614445.1| Sau1hsdS1 [Clostridium botulinum NCTC 2916] gi|182669200|gb|EDT81176.1| Sau1hsdS1 [Clostridium botulinum NCTC 2916] Length = 404 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 54/409 (13%), Positives = 131/409 (32%), Gaps = 41/409 (10%) Query: 24 HWKVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESGTGKYLPK-DGNSRQSDT 75 W+ I TK + +G+T + G +I++ +++ +G ++ Sbjct: 15 EWEFEKIGNITKKVGSGKTPKGGNTVYTDSGVIFLRSQNILNGILALNDVAYITEDENSK 74 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSID 132 + IL G + ++ I + +++ K+ + Sbjct: 75 MKSTQVYGNDILLNITGASIGRSCIVPKIFPKANVNQHVCIIRLKENYNSYFIMNQILSY 134 Query: 133 -VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V ++I++ G +++ I + + + EQ I +I+ + Sbjct: 135 KVQKQIDSYQAGGNREGLNFQQIKQMNVAVTVYEEQQKIANFFSLIDKKIENQQEKVEAL 194 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + K Q + S + + P+ + + E Sbjct: 195 KDYKKGMMQKIFSQAIRFKGD-----------NGEEYPE--WEEKKAEKLFESISDKKHN 241 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 E +LS + + N+ +K E +Y+ V + Q Sbjct: 242 GELEVLSATQDRGVIPRSELNIDIKYEESSLSSYKRVRKNNFIISLRSFQG-----GIET 296 Query: 310 VMERGIITSAYMAVKPH---GIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKR 364 G+++ AY + + + + +S + + G+R +++ F+D Sbjct: 297 SKYDGLVSPAYTVFNFKENEKQNHDFFSLIFKSRNFINRLNTLIYGIRDGKAISFKDFAG 356 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + P I+EQ I V ++ EK ++ + L E + + Sbjct: 357 VKLQYPCIEEQEKIALFFLVIYKKL----EKEQEKLDSLNEWKKGLLQQ 401 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 75/216 (34%), Gaps = 11/216 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL---- 268 P ++ K+ EW + NT +S ++ L NI+ + Sbjct: 5 PKLRFKEFSGEWEFEKIGNIT--KKVGSGKTPKGGNTVYTDSGVIFLRSQNILNGILALN 62 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + S V +I+ + + + + + Sbjct: 63 DVAYITEDENSKMKSTQVYGNDILLNITGASIGRSCIVPKIFPKANVNQHVCIIRLKENY 122 Query: 329 DSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +S ++ + SY + K + G R+ L F+ +K++ V V +EQ I N + Sbjct: 123 NSYFIMNQILSYKVQKQIDSYQAGGNREGLNFQQIKQMNVAVTVYEEQQKIANF----FS 178 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ID +E ++ + LK+ + + + I +G Sbjct: 179 LIDKKIENQQEKVEALKDYKKGMMQKIFSQAIRFKG 214 >gi|167761134|ref|ZP_02433261.1| hypothetical protein CLOSCI_03532 [Clostridium scindens ATCC 35704] gi|167661253|gb|EDS05383.1| hypothetical protein CLOSCI_03532 [Clostridium scindens ATCC 35704] Length = 413 Score = 106 bits (264), Expect = 8e-21, Method: Composition-based stats. Identities = 56/422 (13%), Positives = 125/422 (29%), Gaps = 38/422 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESG--------------KDIIYIGLEDVESGTGKYLPKDGN 69 +W + + L T + Y+ D+E + + Sbjct: 5 NWSYCRLDEYLNLLTDYDANGSFADMAANVHTEWGHGYAWYVRATDLEQKLPLSEVRYAD 64 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQG 126 D S G++L K G + + +L+ L Sbjct: 65 KSSYDFLKKSSLFGGELLMAKRGEIGKVYFFEMKTKYATLAPNLYLLKLNDKADGRFLYY 124 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + LS + +RI+AI ++ + + +P EQ I + I L Sbjct: 125 YFLSKEGQKRIKAINASTSLGAIYKDDVKGLLVPSIRKKEQENIAASLSDVDTLITDLQK 184 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + ++ + Q LV+ K + G + + + A + Sbjct: 185 LIRKKKDIRQGTMQMLVT----------GKKRLDGYSGDWVKINLAKNSKLKARIGWQGL 234 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-----QIVDPGEIVFRFIDLQND 301 + ++ L G G +Y+ Y V G+++ Sbjct: 235 TTAEYLDEGYSFLITGTDFDGGRINWNGCHFVNYDRYAQDPNIQVSNGDLLLTKDGTIGK 294 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFE 360 + + + + ++ +++ S + +G L + Sbjct: 295 VAYVTDLKRPATLNSGVFLVKPITDAYVAHFMFYVLESSVFKDFLQQLSAGSTINHLYQK 354 Query: 361 DVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 D+ + + VPP +EQ I ++ + I L E+ + +E + + +TG++ Sbjct: 355 DLVKFDLYVPPTKEEQEAIATILFDMDSDIHKL----EEKLYKYQEIKQGMMEELLTGKV 410 Query: 420 DL 421 L Sbjct: 411 RL 412 >gi|308183527|ref|YP_003927654.1| putative type I restriction enzyme specificity protein [Helicobacter pylori PeCan4] gi|308065712|gb|ADO07604.1| putative type I restriction enzyme specificity protein [Helicobacter pylori PeCan4] Length = 382 Score = 106 bits (264), Expect = 8e-21, Method: Composition-based stats. Identities = 38/411 (9%), Positives = 107/411 (26%), Gaps = 47/411 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P +W+ V + ++ G + ++ ++ + D+ + Sbjct: 6 LPLNWQRVRLGDIAEIKRGASPRPIENPKWFCANSNVGWVRISDISKNSRFLYKTAQKLS 65 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + + ++ + I I + PK L + Sbjct: 66 KKGIEKSRLVKQNSLIMSMCATIGKPIITKIDTCIHDGFVVFENPKIDLN---YLYYFLC 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIR 190 + + + + + + I N + P L EQ+ I + + +L ++ Sbjct: 123 YIEKEWLESGQQGSQVNLNVDLIKNKEVFYPKDLNEQIAIANILSDVDRYLYSLDALILK 182 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K L+S + + W+ + Sbjct: 183 KEGVKKALSFELLSQ----------------RKRLKGFNQAWQRVRLGDICEITTGSLDA 226 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + ++ + I Sbjct: 227 NEMVHYGKYRFYTCAKEYYFIDKYAFDTEAI-------------LISGNGAYVGYVHYYK 273 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + Y+ ++ + + + + G + +K +L+P Sbjct: 274 GKFNAYQRTYVLDNFSEHI-IFVKYFLTMFLQSHIQTNRNEGNTPYIVMGTLKDFEILLP 332 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P+ EQ I N+++ I L K Q + + + ++ +I + Sbjct: 333 PLNEQIAIANILSDLDHEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 379 >gi|239994804|ref|ZP_04715328.1| restriction modification system DNA specificity domain protein [Alteromonas macleodii ATCC 27126] Length = 403 Score = 106 bits (264), Expect = 9e-21, Method: Composition-based stats. Identities = 60/401 (14%), Positives = 121/401 (30%), Gaps = 31/401 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + R S +Y Y + + + I G Sbjct: 19 WERKVFGSGVEPYIERVDSSTDLPVYSSSRAGLLAQESYFSNRRVTNEGE---YGIVPYG 75 Query: 85 QILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLSIDVTQRIEAI 140 +Y + ++ S ++ V +D S D + Sbjct: 76 YFVYRHMSDDLTFMFNINDVSPKIAVSKEYPVFCVRDWDARFIRYKLNYSNDFKKFAATQ 135 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G T + +K + IP + EQ I + + A +I L + + K Q Sbjct: 136 KLGGTRTRLYFKNLCLWETLIPNIREQQKIADFLSAVDEKITLLKEKYALLQQYKKGVVQ 195 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L S + G W PF + RKN + + + Sbjct: 196 KLFSQENRFKDDD------------GQAFPDWIELPFAECFERVTRKNKIDNRNVLTISA 243 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSA 319 +I + + N + + Y ++D GE + + +++ + G++++ Sbjct: 244 QHGLINQEKYFNKSVAAANLTGYYLLDKGEFAYNKSYSKGYPMGAIKRLNNYDLGVVSTL 303 Query: 320 YMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGS-GLRQS--LKFED---VKRLPVLVPPI 372 Y+ K + L + + G R L + + V+VP I Sbjct: 304 YICFKSKHEQIDEFWEQFFEGGMLNRQISKIAQEGARNHGLLNISVTEFFEDIKVMVPSI 363 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I N + ++D ++Q I L + + + Sbjct: 364 EEQRKIANFLQALDKKLDA----VQQQIDLTQTFKKGLLQQ 400 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 58/150 (38%), Gaps = 7/150 (4%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 E+ + + Y IV G V+R + + V + ++ Y Sbjct: 55 ESYFSNRRVTNEGEYGIVPYGYFVYRHMS-DDLTFMFNINDVSPKIAVSKEYPVFCVRDW 113 Query: 329 DSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 D+ ++ + + + K F A G R L F+++ L+P I+EQ I + ++ Sbjct: 114 DARFIRYKLNYSNDFKKFAATQKLGGTRTRLYFKNLCLWETLIPNIREQQKIADFLSAVD 173 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + +++ LL++ + + + Sbjct: 174 EK----ITLLKEKYALLQQYKKGVVQKLFS 199 >gi|284108343|ref|ZP_06386407.1| Restriction modification system DNA specificity domain [Candidatus Poribacteria sp. WGA-A3] gi|283829904|gb|EFC34190.1| Restriction modification system DNA specificity domain [Candidatus Poribacteria sp. WGA-A3] Length = 393 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 66/411 (16%), Positives = 133/411 (32%), Gaps = 39/411 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + +LN G++ V+S +P G++ + T +I Sbjct: 2 SGWQTKRLGDVLQLNYGKSLP------------VKSRVEGPIPVYGSNGVVGSHTEAIVD 49 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ G+ G + + T F V P+ +L + + I Sbjct: 50 APGLIVGRKGSAGQVHLSRGPFCPIDTTFYVTANDA--PDTDLEFLFYLLQHINLTRIIG 107 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + P + +KI + I + R I+ E K+ L Sbjct: 108 DVGVPGLNREMAYMEQVRFPVTLSEQ---KKIAHILSTVQRAIEAQERIIQTTTELKKTL 164 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + T+G K + I GL+P+ WEV P A+ N K Sbjct: 165 MHKLFTEG-TRGEPQKQTEI---GLIPESWEVMPLGAIAKIGNGSTPKRSNVGYWEYGNI 220 Query: 263 NIIQKLETRNMGLKPES---------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + V P ++ K SA V Sbjct: 221 PWLNSTKIHELFVAEADQFVTPLAVKECHLPRVAPNSLLIAITG--QGKTLGNSAIVRFE 278 Query: 314 GIITSA--YMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 I Y I ++ W M++ YD + G + +L +K + +P Sbjct: 279 TCINQHLAYAQFHSEKIIPDFVLWFMQTRYDFLRSIAQAGGSTKGALTCGYLKTHLIPIP 338 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ +I N+ +++ + I + L++ + + +T +I + Sbjct: 339 EKNEQNEIVNI----FGQLENKQKVITRKRAFLQDIFRTLLHNLMTAKIRV 385 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 73/219 (33%), Gaps = 26/219 (11%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKY 63 K + IG IP+ W+V+P+ K+ G T + +I ++ + Sbjct: 179 KQTE---IGLIPESWEVMPLGAIAKIGNGSTPKRSNVGYWEYGNIPWLNSTKIHELFVAE 235 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQ--FLVLQPKDV 119 + + A +L G L + I F+ + + + + Sbjct: 236 ADQFVTPLAVKECHLPRVAPNSLLIAITGQGKTLGNSAIVRFETCINQHLAYAQFHSEKI 295 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 +P+ + ++ + R A G+T + +PIP EQ I Sbjct: 296 IPDFVLWFMQTRYDFLRSIAQAGGSTKGALTCGYLKTHLIPIPEKNEQNEIVN------- 348 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 I ++ + + +K+A + I L+ + K Sbjct: 349 -----IFGQLENKQKVITRKRAFLQDIFRTLLHNLMTAK 382 >gi|194333152|ref|YP_002015012.1| restriction modification system DNA specificity domain [Prosthecochloris aestuarii DSM 271] gi|194310970|gb|ACF45365.1| restriction modification system DNA specificity domain [Prosthecochloris aestuarii DSM 271] Length = 456 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 65/465 (13%), Positives = 144/465 (30%), Gaps = 61/465 (13%) Query: 1 MKHYKA-------------YPQY-----------KDSGVQWIGAIPKHWKVVPIKRFTK- 35 M +++ YP+Y +DS G + WK + + + Sbjct: 1 MSNFQRGADIPVRHSEGNGYPEYNGGLENPPSVERDSES---GRDMRDWKKTTVGKVSTG 57 Query: 36 LNTGRTSESGK------DIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +G T + + +I +I + + K + + I K I++ Sbjct: 58 FLSGGTPSTSRADYWKGEIPWITSKWLGDKLELTTGEKFVSEEAIKNTATKIVPKDSIIF 117 Query: 89 GKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 + K I D + +++ ++ + L L + Q + GAT+ Sbjct: 118 AT-RVGVGKVGINRIDLAINQDLAGVLIDNENYDIKFLAYQLGIDSIQQYVAMNKRGATI 176 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + I + IPPL EQ I + + I + R I+ E K+AL+ + Sbjct: 177 KGITRDCLEQIQLNIPPLPEQKKIAHIL----STVQRAIEAQERIIQTTTELKKALMHKL 232 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN------TKLIESNILSLS 260 T+GL + + +GLVP+ WEV + + + I + Sbjct: 233 FTEGLRNEPQK----ETEIGLVPESWEVCKVGDVAKIQSGGTPSRDVPENWRDGTIPWVK 288 Query: 261 YG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 G + K + + Q+ G ++ + + + Sbjct: 289 TGEINYCVIKDTEEKITPTGLANSAAQLFPTGTLLMAMYGQGITRGKVGLLGIEAATNQA 348 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 A + S+ + + + ++++ ++ P+ P +EQ Sbjct: 349 CASIIPIDQDQISSVFLYYFFEFQYENLRQLGHGANQRNMSAGLIRGFPLSFPKFEEQAA 408 Query: 378 -ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I +D E+ + + + + + + Sbjct: 409 MIAAF-----ESLDKKRYFHERKRTQFQGLFRTLLHELMNAKTRV 448 >gi|21282122|ref|NP_645210.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus MW2] gi|49485300|ref|YP_042521.1| putative restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus MSSA476] gi|297209066|ref|ZP_06925466.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus ATCC 51811] gi|300911068|ref|ZP_07128517.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus TCH70] gi|21203558|dbj|BAB94258.1| probable specificity determinant HsdS [Staphylococcus aureus subsp. aureus MW2] gi|49243743|emb|CAG42168.1| putative restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus MSSA476] gi|296886337|gb|EFH25270.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus ATCC 51811] gi|300887247|gb|EFK82443.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus TCH70] Length = 419 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 61/408 (14%), Positives = 139/408 (34%), Gaps = 29/408 (7%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVEIHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTLITERIR 190 ++I G + ++K I N+ + P + E Q I E I +I+ + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGEFISKLDRQIELEEQKLEL 199 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K Q + S + D + + + ++ Sbjct: 200 LQQQKKGYMQKIFSQELRFKDEEGKDYPDWKSKSIQEIFENKGGTALETEFNFDG----- 254 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 ++S+ +I +N+ + I+ G++ D D + + + Sbjct: 255 --NYKVISIGSYSINSTYNDQNIRVNKNKKTEKYILSKGDLAMVLNDKTKDGKIIGRSIF 312 Query: 311 MERG---IITSAYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRL 365 +++ I + P + W + + DL K+ M + + + +K + Sbjct: 313 IDKDNQYIYNQRTERLIPFAENDNKFLWFLMNTDLIRNKIKGMMQGATQVYINYSSIKLI 372 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P ++EQ I + V + + K I LKER+ +F+ Sbjct: 373 SIQLPLLEEQQKIRGFLEV----LSGITTKQLHKIDQLKERKKAFLQK 416 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 54/181 (29%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E ++ + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVEIHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I I+ + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGEFISKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|237738767|ref|ZP_04569248.1| restriction endonuclease S [Fusobacterium sp. 2_1_31] gi|229423870|gb|EEO38917.1| restriction endonuclease S [Fusobacterium sp. 2_1_31] Length = 408 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 45/380 (11%), Positives = 117/380 (30%), Gaps = 25/380 (6%) Query: 26 KVVPIKRFTK----LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD--TSTVS 79 + K + + + + I YI +++++G + S S Sbjct: 2 EYKKTKDIVQEKFWIMPETPNFIEEGIPYITSKNIKNGFIDFKDVKYVSVDDYNRISNNR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRI 137 K +L +G AI+ D + + +L + ++ + + + Sbjct: 62 KIKKDDMLITMIGTIGEVAIVEDEIDFYGQNLYLLRMNNEIILNKYYYYYITLNKIKRTL 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ + I N+ +P+PPL Q I + T ++ L + + K+ Sbjct: 122 VEKRNTSSQGYIKAGNIENLLIPVPPLEVQEEIVRILDDYTKSVEELKEKLNAELITRKK 181 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + Y++ I +G + + + Sbjct: 182 QYSWYRDYLLKFE-------NKIKIVKLGELFEFKNGINKEKSSFGKGTPIINYVN---- 230 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGI 315 + N I + + + + V G++ F ++ S + +E + Sbjct: 231 -VYKKNKIYFEDLQGLVEATDDELIRYKVKRGDVFFTRTSETIEEIGFTSVLLEDIENCV 289 Query: 316 ITSAYMAVKP--HGIDSTYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + +P + Y A+ + + + R + + ++ + +PP+ Sbjct: 290 FSGFLLRARPLTDLLLPEYCAYCFSTSSMRNAIIRKSTYTTRALINGTSLSQIEIPLPPL 349 Query: 373 KEQFDITNVINVETARIDVL 392 + Q I V++ L Sbjct: 350 EVQKRIVEVLDNFEKTCKEL 369 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 22/192 (11%), Positives = 66/192 (34%), Gaps = 12/192 (6%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-----PESYETYQ 284 ++ + K + + IE I ++ NI + Sbjct: 2 EYKKTKDIVQEKFWIMPETPNFIEEGIPYITSKNIKNGFIDFKDVKYVSVDDYNRISNNR 61 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 + +++ I + + ++ + + I + Y + + + + Sbjct: 62 KIKKDDMLITMIGTIGEVAIV--EDEIDFYGQNLYLLRMNNEIILNKYYYYYITLNKIKR 119 Query: 345 -VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + + +K +++ L + VPP++ Q +I +++ T ++ L EK+ ++ Sbjct: 120 TLVEKRNTSSQGYIKAGNIENLLIPVPPLEVQEEIVRILDDYTKSVEELKEKLNAELITR 179 Query: 404 KE----RRSSFI 411 K+ R + Sbjct: 180 KKQYSWYRDYLL 191 >gi|189501453|ref|YP_001960923.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides BS1] gi|189496894|gb|ACE05442.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides BS1] Length = 405 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 59/411 (14%), Positives = 134/411 (32%), Gaps = 39/411 (9%) Query: 30 IKRFTK---LNTGRTSE-------SGKDIIYIGLEDVESG--TGKYLPKDGNSRQSDTST 77 + + TG + + + +D+ SG T ++ + S+ + Sbjct: 10 LGDIVIPKGIQTGPFGSQLKAEEYTEDGVPVVMPKDICSGYLTSSFISRVSQSKANKLKK 69 Query: 78 VSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVT 134 I +G I++ + G + A + IC T L + V+ ++L V Sbjct: 70 HQI-KEGDIIFPRRGDLRRIGVARKDNTGWICGTGCLRARLNSVVHSDFLHQYVLLDSVG 128 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +E G TM + I N+P+ +P L+EQ I + + A I+ Sbjct: 129 KWLERNALGQTMLNLSTDIISNLPLTLPLLSEQKAIADLLSAWDEAIEKAERLIQEKERR 188 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + L+S + + K G + ++ + + Sbjct: 189 FRWLLRELISEPRNTRKDAEWKKVRMG--------SFLTESRIPDRENDPKKRISVRLHL 240 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + G + Y I G+ ++ ++ + ++ Sbjct: 241 RGVEVR----------EYRGTESNGATAYFIRKAGQFIYGKQNVFRGAVGIVPLELDGYS 290 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 +D ++L +L + K SG + L +++ R+ + +P Sbjct: 291 STQDIPAFDIADHVDKSWLLFLFSYTNFYKKLELYASGSGSKRLHPKELFRMKITLPTFG 350 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 EQ I ++ ID+L + K ++ + + G ++ E Sbjct: 351 EQQQIAETLSSAQYEIDLLKQLA----EKYKTQKRGLMQKMLAGTWRVKPE 397 >gi|114798271|ref|YP_761234.1| type I restriction-modification system, S subunit [Hyphomonas neptunium ATCC 15444] gi|114738445|gb|ABI76570.1| type I restriction-modification system, S subunit [Hyphomonas neptunium ATCC 15444] Length = 381 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 63/407 (15%), Positives = 126/407 (30%), Gaps = 40/407 (9%) Query: 24 HWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +W + + ++ G + ++ + +I + D + + + S Sbjct: 2 NWPLRTLDEIFEIARGGSPRPIDQFITDADDGVNWIMIGDASNSSKHIRETKKKIKPSGV 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL--LSIDV 133 S + G L + R I+ D G +LVL P+ + + S + Sbjct: 62 SRSRLVKPGDFLLTNSMSFGRPYIL-DTHGCIHDGWLVLSPRRANVDHDYFYHLLGSPAI 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E + GAT+ + + + + + +PPL EQ I + R + Sbjct: 121 FGQFEKLAAGATVKNLNIDLVKRVIVALPPLEEQKRIAAILDQADELRRKRQRALDRLNQ 180 Query: 194 LLKEKKQALVSYIVTKGL-NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L QA+ + G ++ G G P + F V Sbjct: 181 L----GQAIFIDMFGDGASFESASLRTLGRVSTGSTPPTSDADSFGGPVP---------- 226 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + E L + ++V PG + I K + Sbjct: 227 ------FVTPGDLGSGEAVKRSLTEAGAQKSRLVGPGATLVCCIGATIGKMGQARERSAF 280 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 I + + + +RS + K S LK + ++L + VPP+ Sbjct: 281 NQQINAVDWGDRIGAAFGFFAVQQIRSLIIHK--GKGASTTLPILKKSEFEKLEIFVPPM 338 Query: 373 KEQFDITNVINV-ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ + + + + + + D L+E +S A G+ Sbjct: 339 VEQQEFAHRVGIVQCSLTDA--SLHNS---RLEELFASLQHRAFRGE 380 >gi|242243194|ref|ZP_04797639.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus epidermidis W23144] gi|242233348|gb|EES35660.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus epidermidis W23144] Length = 405 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 68/404 (16%), Positives = 136/404 (33%), Gaps = 31/404 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 K W+ + + + G+ I I ++ + G L K + S +++ Sbjct: 17 KEWEFQELGNLAQFSKGKLLSKKDLNISGIPCILYGELYTRYGAILNKVYSKTDSKKNSL 76 Query: 79 SIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 K QIL G AI D ++ P + + ++ Sbjct: 77 VFSKKNQILIPSSGETDIDIATATAINTDLKIAIGGDLNIITPINSDGRFISLYING-KG 135 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +G ++ H I + +P + +KI ++D I + +E Sbjct: 136 KHNLAKYAQGKSVVHLYNSDIKKLKFYLPSNNSEQ---QKIGDFFSKLDQQIELEEKKLE 192 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL+++K+ + I ++ L + G WEV +++E +K+ Sbjct: 193 LLEQQKRGYMQKIFSQEL--------RFKDENGNAYPEWEVMKLKDILSERKEYASKIGN 244 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +LS I K + N + V + + N K + + + Sbjct: 245 YPHATLSTSGISLKSDRYNRDFLVKDKNKKYKVTIMNDI--CYNPANLKFGVITRNHIGS 302 Query: 314 GIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLV 369 I + Y+ + + S L+ D G R ++K ED + Sbjct: 303 AIFSPIYITFEVNNAHSPLFIELLVTRNDFINRVRKYEQGTVYERMAVKPEDFLNYETKI 362 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P ++EQ I N +++D ++ K Q I LK R+ + Sbjct: 363 PCLEEQEKIGNF----FSKLDKVINKQRQKIDELKLRKQGLLQK 402 >gi|225573221|ref|ZP_03781976.1| hypothetical protein RUMHYD_01412 [Blautia hydrogenotrophica DSM 10507] gi|225039353|gb|EEG49599.1| hypothetical protein RUMHYD_01412 [Blautia hydrogenotrophica DSM 10507] Length = 405 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 65/416 (15%), Positives = 143/416 (34%), Gaps = 30/416 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ V + ++ G+ + K+ Y+ +V G + D Sbjct: 2 SWEKVKLGDVSESCLGKMLDKRKNKGFYKPYLANVNVRWGAFDLENLQEMRFEDDEDERY 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQR 136 G ++ + G R AI + + ++ K+ + + W L Sbjct: 62 GIKYGDLIICEGGEPGRCAIWKEELPNMKIQKALHRVRVKEEMDCRYVYYWFLLAGKQGA 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ GAT+ H + + + + PPL Q I + + I+ + I+LL+ Sbjct: 122 LKQYYTGATIMHMPEQKLKEVIIDKPPLDVQRKIGNYLESFDNLIEN----NQKQIKLLE 177 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE--- 253 E Q L P + V VP+ W + P ++ + K+ E Sbjct: 178 EAAQRLYKEWFVDLRFPGYE----DTPIVDGVPEGWAMMPLSSVFEYVRGKSYTSKELVE 233 Query: 254 ---SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL----R 306 +++L ++ Q + G+IV D+ ++R + Sbjct: 234 EGGVVMINLKNIRAFGGYNRNAEKRYEGKFKENQELFAGDIVMGVTDMTKERRLVGHVAI 293 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 + E + + + P + ++L M K + +G+ LK E + + Sbjct: 294 VPDLDETMTFSMDLVKLVPLCVKKSFLYSTMFYGGYSKRISPLANGVNVLHLKPETMMNM 353 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +LVP +I ++ +E +++ + E R + ++G+I++ Sbjct: 354 EMLVPT----EEIMEQYDILFDIYQKKIETLQKQCDIATEARERLLPKLMSGEIEV 405 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 71/211 (33%), Gaps = 14/211 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK--- 62 +P Y+D+ + + +P+ W ++P+ + G++ S + + G+ + + Sbjct: 192 RFPGYEDTPI--VDGVPEGWAMMPLSSVFEYVRGKSYTSKELVEEGGVVMINLKNIRAFG 249 Query: 63 -YLPKDGNSRQSDTSTVSIFAKGQILYG------KLGPYLRKAIIA--DFDGICSTQFLV 113 Y + G I+ G + AI+ D S + Sbjct: 250 GYNRNAEKRYEGKFKENQELFAGDIVMGVTDMTKERRLVGHVAIVPDLDETMTFSMDLVK 309 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 L P V L + ++RI + G + H + + N+ M +P Sbjct: 310 LVPLCVKKSFLYSTMFYGGYSKRISPLANGVNVLHLKPETMMNMEMLVPTEEIMEQYDIL 369 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVS 204 +I+TL + E + L+S Sbjct: 370 FDIYQKKIETLQKQCDIATEARERLLPKLMS 400 >gi|119357508|ref|YP_912152.1| restriction modification system DNA specificity subunit [Chlorobium phaeobacteroides DSM 266] gi|119354857|gb|ABL65728.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides DSM 266] Length = 414 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 115/397 (28%), Gaps = 32/397 (8%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 I ++ TG+T S G D ++ D++ + P+ S + + Sbjct: 38 IGDLGRVLTGKTPPSVRPELFGDDHPFLTPTDIDGASRYIEPERFLSPEGRNYQQRLMLP 97 Query: 84 G-QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G + +G + K + + Q + + + + L + ++A Sbjct: 98 GRSVCVVCIGATIGKVCMTGRPSFTNQQINSVVVNEQEHDPFFVYHLMTTLRDELKANAG 157 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ + I + +PPL Q I + I+ E +++ Sbjct: 158 GSATPIINKTAFSEIKVRVPPLPVQRRIAGILSTYDELIENSQRRIKILE----EMARSV 213 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 P + +G +P WE ++ + + ++ Sbjct: 214 YREWFVHFRFPGHENVSLVSSSLGAIPQGWEAGRLDDVLVLQRGFDLPKAKRMEGTVPIY 273 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + V +V D + + + ++ A Sbjct: 274 AATG----------VTGFHCEAKVKAPCVVTGRSGTIGDVIYV----QEDFWPLNTSLWA 319 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + Y +++ S L + +L D+ L VL+PP Q + Sbjct: 320 KGFPKSEPLYAYYVLSSVGLKQF---NSGAAVPTLNRNDLHGLDVLIPPCVLQKRFQKIA 376 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + L E I L+ R + ++GQ+ Sbjct: 377 GAMLLQTRNL----ELQIQNLRRTRDLLLPRLLSGQV 409 Score = 42.9 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 56/195 (28%), Gaps = 16/195 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +GAIP+ W+ + L G K + GT G + + Sbjct: 236 LGAIPQGWEAGRLDDVLVLQRGFDLPKAKRM---------EGTVPIYAATGVTGFHCEAK 286 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G+ G + + +T P L S+ + Q Sbjct: 287 ---VKAPCVVTGRSGTIGDVIYVQEDFWPLNTSLWAKGFPKSEPLYAYYVLSSVGLKQ-- 341 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 GA + + + + + IPP Q ++ A ++ L + Sbjct: 342 --FNSGAAVPTLNRNDLHGLDVLIPPCVLQKRFQKIAGAMLLQTRNLELQIQNLRRTRDL 399 Query: 198 KKQALVSYIVTKGLN 212 L+S V N Sbjct: 400 LLPRLLSGQVNPKEN 414 >gi|2865244|gb|AAC15898.1| type IC specificity subunit [Lactococcus lactis] Length = 405 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 75/401 (18%), Positives = 149/401 (37%), Gaps = 26/401 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ K ++SE ++ Y + + K P + N + T ++ K Sbjct: 17 DWEERKFGEVWK----KSSERNLNLEYSPKQVLSVAQMKLNPSNRNEQDDYMKTYNVLHK 72 Query: 84 GQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI-- 137 G I + K + R + DGI S F V +P + ++ + + Sbjct: 73 GDIAFEGNKSKSFAFGRFVLDDLQDGIVSHVFYVYRPICKMDTDFMIVYINNESVMKYLL 132 Query: 138 -EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 +A + M+ + K I + +P L EQ I ++D I R ++LLK Sbjct: 133 VKATTKTLMMTTLNTKDIVKPKLNLPSLEEQQKIGSF----FKQLDATIALHQRKLDLLK 188 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E+K+ + K +++ +G + ++ + L+E Sbjct: 189 EQKKGYFQKMFPKNGAKVPELRFAG---FADDWEDRKLGELASFSKGNGYTKNDLVEFGD 245 Query: 257 LSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + YG + K ET + + + I+ G V ++ + R++ V + GI Sbjct: 246 PIILYGRLYTKYETVIEKVDTFVNKKDKSIISGGSEVIVPASGESSEDISRASVVGKSGI 305 Query: 316 I--TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 I + + IDS +LA + + K G L D+K++ +L P + Sbjct: 306 ILGGDLNIIKPVNYIDSIFLALTISNGSQQKEMSKRAQGKSVVHLHNSDLKQVNILYPKL 365 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + ++D + ++ + LKE++ F+ Sbjct: 366 GEQQKIGSF----FKQLDNTIVLHQRKLDFLKEQKKGFLQK 402 >gi|163847375|ref|YP_001635419.1| restriction modification system DNA specificity subunit [Chloroflexus aurantiacus J-10-fl] gi|163668664|gb|ABY35030.1| restriction modification system DNA specificity domain [Chloroflexus aurantiacus J-10-fl] Length = 438 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 65/433 (15%), Positives = 135/433 (31%), Gaps = 37/433 (8%) Query: 25 WKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W V + +N + I YI + V GT P + ++ + + Sbjct: 5 WGTVRLGDVATINPDAIGANWPFLHIRYIDISSVGEGTIIEKPSQISLSEAPSRAKRLIR 64 Query: 83 KGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSID--VTQRI 137 +G + + P R + D + ST F VL+PK + + D T + Sbjct: 65 EGDTVLSMVRPNRRSRFFVTTFEPDLVVSTGFAVLRPKPKVIHPRYLYACVFDRAFTDYL 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + +GA + I + +P PPL EQ I + +I+ ++ + Sbjct: 125 VSREKGAAYPAVLSEDIADAKIPFPPLPEQRAIAHILGTLDDKIELNRRMSETLEQMARA 184 Query: 198 KKQALVSYIVT---------------KGLNPDVKMKDSG---IEWVGLVPDHWEVKPFFA 239 +A GL +G +P+ W V Sbjct: 185 LFKAWFVDFEPVRAKIECRWQRGQSLPGLPAHFYDLFPERLVDSELGEIPEGWGVGRLSE 244 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 L+ + + E L N+ + + + + G+ + I Sbjct: 245 LIELNPPRVLRKGEVA-PYLDMANMPTRGHV-PGDVVDRPFGSGTRFINGDTLLARITPC 302 Query: 300 NDKRSLRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLM-RSYDLCK--VFYAMGSGL 353 + + G + ++ Y+ ++P A+ + RS + + G+ Sbjct: 303 LENGKTAFVDFLRNGQVGWGSTEYIVLRPREPLPAEFAYCLARSENFRDFAIQNMTGTSG 362 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 RQ ++ E + ++ PP + AR + L R + + Sbjct: 363 RQRVQTEAIAHYLLVAPPAPVAEAFGRTVKQLFARA----TRASCESRTLAALRDALLPK 418 Query: 414 AVTGQIDLRGESQ 426 + G+I ++ + Sbjct: 419 LIRGEIRVKDAEK 431 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/140 (25%), Positives = 57/140 (40%), Gaps = 15/140 (10%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP+ W V + +LN R G+ Y+ + ++ T ++P D R Sbjct: 227 DSE---LGEIPEGWGVGRLSELIELNPPRVLRKGEVAPYLDMANM--PTRGHVPGDVVDR 281 Query: 72 QSDTSTVSIFAKGQILYGKLGPYL--RKAIIADF-----DGICSTQFLVLQPKDVLP-EL 123 + T I G L ++ P L K DF G ST+++VL+P++ LP E Sbjct: 282 PFGSGTRFI--NGDTLLARITPCLENGKTAFVDFLRNGQVGWGSTEYIVLRPREPLPAEF 339 Query: 124 LQGWLLSIDVTQRIEAICEG 143 S + G Sbjct: 340 AYCLARSENFRDFAIQNMTG 359 >gi|210630771|ref|ZP_03296595.1| hypothetical protein COLSTE_00480 [Collinsella stercoris DSM 13279] gi|210160367|gb|EEA91338.1| hypothetical protein COLSTE_00480 [Collinsella stercoris DSM 13279] Length = 414 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 52/405 (12%), Positives = 119/405 (29%), Gaps = 28/405 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + + D+ I + Y D S + Sbjct: 19 WEQRKLGEVAHRVIRKNEGNQSDLPLTISAQHGLVDQRDYFNN--QVASRDMSGYYLLEN 76 Query: 84 GQILYGKL----GPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K P+ + + G ST ++ P+ L + + ++ Sbjct: 77 GEFAYNKSTSGDSPWGAIKRLTKYEKGCLSTLYICFGLDQGDPDFLVTYYETNRWHGAVQ 136 Query: 139 AIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 I EGA + L +++I ++ +LIT R + L Sbjct: 137 MIAAEGARNHGLLNIAPDDFFETALTLPCLTEEQKQIGCFFTQLVSLITLHQRKYDKLCA 196 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K++++ + K +++ G D WE + ++ + Sbjct: 197 VKKSMLDKMFPKPGETKPEIRFDG------FTDPWEQRKLGSVAASFDYGLNAAATEYDG 250 Query: 258 SLSYGNIIQKLETRNMGLKPE--------SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 Y I +T + LK + + +++ G+++F K L Sbjct: 251 QNKYLRITDIDDTTHEFLKSDLTTPLADLAMSADYLLEEGDLLFARTGASVGKTYLYRQY 310 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVL 368 A D ++ + K + + ++ ++ Sbjct: 311 DGTVYFAGFLIRARIGESADPEFVYQATLTDAYKKYVAITSQRSGQPGVNAQEYADYQLM 370 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P EQ I + +D + ++ + LL+ + S + Sbjct: 371 LPSRTEQQQIGMTL----RSLDNFITLHQRKLNLLRNTKKSLLDK 411 >gi|153815629|ref|ZP_01968297.1| hypothetical protein RUMTOR_01865 [Ruminococcus torques ATCC 27756] gi|145847060|gb|EDK23978.1| hypothetical protein RUMTOR_01865 [Ruminococcus torques ATCC 27756] Length = 380 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 58/393 (14%), Positives = 114/393 (29%), Gaps = 24/393 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 VP+ +F K + R + +DI + + + +Y K+ D +T I +G Sbjct: 6 VPLGKFIKEYSERN-KGNEDIPVYSVTNSQGFCTEYFGKE--VASQDKTTYKIVPQGYFA 62 Query: 88 YGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGA 144 Y + + I S + V + + + L D+ Q I+A G+ Sbjct: 63 YNPSRINVGSVDWQRYEKRVIVSPLYNVFSVSEGIDRQYLYYFLRSDLGRQMIKAKASGS 122 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P + +Q + I E + E + + Sbjct: 123 VRDNLKLDMLKEMTIPDISVEQQKFCSSVLDKLHKLIQMRQQELQKLDEF-------IKA 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V + K + + K A L N+ + Sbjct: 176 RFVEMFGDVIHNSKKWQVCLFAEITSSRLGKMLDAKQQTGRNSYPYLANFNVQWFRF--- 232 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 LE N E + G+++ + + Sbjct: 233 --NLENLNKMDFDEKDRAEFELREGDLLVCEGGEIGRCAVWHNELQPCFFQKALHRVRCN 290 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I YLAW R F A+ L +K+L V VPP++ Q + Sbjct: 291 HQIILPDYLAWWFRYNCDYGGFSALAGAKATIAHLPGAKLKQLQVAVPPMELQEQFAVFV 350 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A+ D +++++ + S + Sbjct: 351 ----AQTDKSKVAVQKALDEAQLLFDSLMQEYF 379 >gi|60680614|ref|YP_210758.1| putative modification protein of type I restriction-modification system [Bacteroides fragilis NCTC 9343] gi|60492048|emb|CAH06810.1| putative modification protein of type I restriction-modification system [Bacteroides fragilis NCTC 9343] Length = 394 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 52/425 (12%), Positives = 121/425 (28%), Gaps = 48/425 (11%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKR-FTKLNTGRTSESG------KDIIYIGLEDVESGTG 61 ++K + + IP+ W + + G T G I +I ++ Sbjct: 5 KFKQTE---LCRIPEDWDIGTFADFLITFSAGATPYRGIPDNFVGTIPWISSGELNYCEI 61 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPK 117 + + +S + +++ G L G + + L + Sbjct: 62 ENTREHISSDAQKNTHLTLHKPGTFLIAITGLEAAGTRGRCAFVKTPATTNQSCLAINST 121 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 D + W +G+ + + +P+ P EQ I E + Sbjct: 122 DKMTVKYLFWFYRQWSDFLAFNFSQGSKQQSFTAEIVKRLPLYAPKYKEQEKIAEALSDV 181 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I ++ L EKK+A++ + + L ++ H Sbjct: 182 DKLIRE--------LDTLIEKKRAVMQGTMQELLTAHRRL---------PGFVHPWRNTL 224 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 ++ + + + I R+ + Sbjct: 225 VEKCCKITTGESNTRDQIESGIYPFYIRSATVMRSNSYIF------------DCEGVITI 272 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + + + Y+ ID + +L + +V S+ Sbjct: 273 GDGQIGKVFHYVNGKFDLHQRCYLMYDFDDIDVKFFYFLFSFFFYNRVIALSAKATVDSV 332 Query: 358 KFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + ++ + +P ++EQ I N+++ + +E IE R + +T Sbjct: 333 RRNMIAKMKINIPSTMQEQKAIANILSDM----NDGIEAIEAKRDKYIAVRQGMMQQLLT 388 Query: 417 GQIDL 421 G+I L Sbjct: 389 GKIRL 393 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 64/210 (30%), Gaps = 14/210 (6%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP------ 277 + +P+ W++ F + + T + I E ++ Sbjct: 10 ELCRIPEDWDIGTFADFLITFSAGATPYRGIPDNFVGTIPWISSGELNYCEIENTREHIS 69 Query: 278 ---ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + PG + L+ R A V + +A+ + Sbjct: 70 SDAQKNTHLTLHKPGTFLIAITGLEAAGTRGRCAFVKTPATTNQSCLAINSTDKMTVKYL 129 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + G +QS E VKRLP+ P KEQ I ++ I L Sbjct: 130 FWFYRQWSDFLAFNFSQGSKQQSFTAEIVKRLPLYAPKYKEQEKIAEALSDVDKLIRELD 189 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 IE+ ++ + +T L G Sbjct: 190 TLIEKKRAVM----QGTMQELLTAHRRLPG 215 >gi|238810328|dbj|BAH70118.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 403 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 47/391 (12%), Positives = 116/391 (29%), Gaps = 26/391 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P ++ V + + + G + S + +I + D+E G + + Sbjct: 13 PDGYEWVTLGEISSIRRGASPRPISSFLSKEGYPWIKIGDIEEGKIYLKKTKQFINEKGS 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-T 134 + KG ++ + + I I L+ + + + + Sbjct: 73 KKSVVVDKGDLILSNSMSFGKPVIADIKGCIHDGWLLIANFEKNVTSKFLYYWFLSNYSQ 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 T+S+ + + + + +P+ PL Q I E + I E EL Sbjct: 133 SFFLQQSSPGTISNLNSEILKKLKIPLIPLKIQEKIVEILERF------RILEAELKAEL 186 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 KQ L K ++ + + + K F + K + Sbjct: 187 EARGKQ------FDFTLTKIFNFKQYKLKKLWEI--TFWDKNFQEVEKFKQSKTSNFKYL 238 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + N + K E+ + +I + L + Sbjct: 239 FYKEIENYNDPKGDVKIITTGKEENLKINSKNYKKDIYSGEVLLIPGGGEANIKYHKGKF 298 Query: 315 IITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + + + + + DL + + G + +++ L + +PP Sbjct: 299 VTGDNRIGQVLNKNEVATKFLYYYFLLNLDLIRKNFR--GGSIKHPFMKNILELNIPIPP 356 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ Q I ++++ + + + I L Sbjct: 357 LETQNKIVSILDKLSEYSQEINLGLPAEIEL 387 >gi|323136163|ref|ZP_08071245.1| restriction modification system DNA specificity domain [Methylocystis sp. ATCC 49242] gi|322398237|gb|EFY00757.1| restriction modification system DNA specificity domain [Methylocystis sp. ATCC 49242] Length = 482 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 66/441 (14%), Positives = 138/441 (31%), Gaps = 42/441 (9%) Query: 20 AIPKHWKVVPIKRFT---------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG-N 69 +P+ W +P+++ + + ++ I L DV + + Sbjct: 3 DLPQGWIEIPLEKLAGPEGLVTDGDWVESKDQDPNGEVRLIQLADVGVNEFRDRSERFLT 62 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG-----ICSTQFLVLQPKDVLPELL 124 S ++ S KG +L ++ + +A + G + +P L Sbjct: 63 SDKALELRCSFLEKGDVLIARMPDPIGRACVFPGLGQSAVTVVDVMLWRSDSALSIPAWL 122 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + S DV I G T + + +P PPLAEQ I K+ A Sbjct: 123 AFIMNSPDVRASILTETSGTTRQRISGGRLKALNIPTPPLAEQRRIVVKLNALDASSKRA 182 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG------IEWVG--------LVPD 230 + R L+ KQA+++ + L D ++ +S ++ +G +P Sbjct: 183 RADLDRIPALVARAKQAILAKAFSGELTADWRLHNSEKSVSALLDEIGVDAISSSVPLPR 242 Query: 231 HWEVKPFFALVTELNR------KNTKLIESNILSLSYGNIIQK---LETRNMGLKPESYE 281 W + ++ + L N+ + L+ L Sbjct: 243 GWAWVLAGEICEVKGGLALGKKRSQDVELVEKPYLRVANVQRGWLTLDQIKTVLVTPDEA 302 Query: 282 TYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + G+I+ D R + I + ++ + + Sbjct: 303 RSLELKAGDILMNEGGDRDKLGRGWVWEGQIAGCIHQNHVFRLRLRSGKIEPKFISIYAN 362 Query: 341 DL-CKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 F + S+ ++ LP+ + E +I + I A+ID + + Sbjct: 363 AFGQDYFLDQGKQTTNLASISMSKIRALPLPLASPDEMCEIFHRIESAFAKIDRIAAEAA 422 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 + LL + ++ A G+ Sbjct: 423 SASKLLDRLDQALLSKAFRGE 443 >gi|73661361|ref|YP_300142.1| restriction endonuclease S subunit [Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305] gi|72493876|dbj|BAE17197.1| putative restriction endonuclease S subunit [Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305] Length = 411 Score = 106 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 59/401 (14%), Positives = 124/401 (30%), Gaps = 22/401 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK +K + G + E K++ + + + + + S + K Sbjct: 19 EWKKKRLKDIVEPLKGNSGE-NKNLPVLTISAKKGWLNQKERFSQVIAGNSLSKYNELKK 77 Query: 84 GQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-----T 134 G + Y K Y + + + + +PK + Sbjct: 78 GDLSYNKGNSKVALYGIVYKLGFDNALVPNVYKSFRPKPNNVSDFLEKYFHTKILDRQLR 137 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I + + + N+ + IP EQ I + ++D I + + Sbjct: 138 RVITSTARMDGLLNISDYDFYNMSLNIPVNNEQKKIGDF----FSKLDQQIELEEKKLAK 193 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+E+K+ + I ++ L + + EW + + +K Sbjct: 194 LEEQKKGYMQKIFSQELRFKDENGNDYPEW--EEINLGSLYKKGKAGGTPKSTESKYYNG 251 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + LS +I ++ + N K + E + I+ + Sbjct: 252 KVPFLSISDITKQGKFLNTTEKKITQEGLDNSTAWLVPVNSINYAMYASVGYLSINKIEV 311 Query: 315 IITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPI 372 + A + + YL + + V MG+G + +L +K + V VP Sbjct: 312 ATSQAIFNMVFEDYNLVEYLYYYLNYIRDKGVLEKLMGTGTQSNLSASIMKNITVKVPSK 371 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 E + + D L+E + LLKER+ + Sbjct: 372 NEIIKTSKFLGNV----DELIETQSSKVELLKERKEGLLQK 408 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 65/168 (38%), Gaps = 10/168 (5%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + E + + S Y + G++ + + + + + ++ + Y + Sbjct: 52 GWLNQKERFSQVIAGNSLSKYNELKKGDLSYNKGNSKVALYGIVYKLGFDNALVPNVYKS 111 Query: 323 VKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQ-----SLKFEDVKRLPVLVPPIKEQF 376 +P + S +L + L + + + + ++ D + + +P EQ Sbjct: 112 FRPKPNNVSDFLEKYFHTKILDRQLRRVITSTARMDGLLNISDYDFYNMSLNIPVNNEQK 171 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 I + +++D +E E+ + L+E++ ++ + ++ + E Sbjct: 172 KIGDF----FSKLDQQIELEEKKLAKLEEQKKGYMQKIFSQELRFKDE 215 >gi|315169212|gb|EFU13229.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1341] Length = 411 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 64/399 (16%), Positives = 129/399 (32%), Gaps = 18/399 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + T+ S + + ED+ +G G+ S++ D F Sbjct: 18 EDWEQRKLIDLVVRLNKSTNSSR--LPKLEFEDIVAGEGRL--NKDVSQKFDNRKGIEFL 73 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ILYGKL PYL+ + A F G+ F V + K+ + + + + + Sbjct: 74 PNDILYGKLRPYLKNWLKATFTGVALGDFWVFRVKNSDSDFIYSLIQADRYQKAANDTSG 133 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ-A 201 K G + L EQ I I + + L Q Sbjct: 134 TKMPRSDWKKVSGTVFYVPNDLKEQQKIGTLFKQIDDAITLHQRKLDQLKNLKNAFLQLM 193 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 VS P ++ + EW + ++ L+LS Sbjct: 194 FVSNSPENSTVPKLRFANFTEEWELCGFFDTIENTIDFRGRTPKKLGLDWSDNGYLALSA 253 Query: 262 GNIIQKLETRNMGLK------PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N+ N+ + + + + + G+++F + + Sbjct: 254 LNVKHGYIDSNIDAHYGNQELYDKWMSGKELRKGQVLFTTEAPMGNVAQI-PDNTGYILS 312 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVP-PIK 373 + K + I +LA L+ S + + G + + + + +L V +P I Sbjct: 313 QRTIAFETKKNRITDDFLAVLLGSPKIFNELSSLSSGGTAKGISQKSLSQLRVQIPCSIS 372 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ +I +++ + + + LKE + S++ Sbjct: 373 EQKEIGIF----FKQLNETITLHQNKLDQLKELKKSYLQ 407 >gi|71065438|ref|YP_264165.1| type I restriction modification system methylase [Psychrobacter arcticus 273-4] gi|71038423|gb|AAZ18731.1| probable type I restriction modification system methylase [Psychrobacter arcticus 273-4] Length = 424 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 51/423 (12%), Positives = 115/423 (27%), Gaps = 27/423 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESG--TGKYLPKDGNSRQSDTST 77 W + ++ R S + + DV G + K + D S Sbjct: 3 SDWNEDILSNIAEIIDSRHKTPVYSDSGYPMVRVVDVNGGALNLESTKKVSDDIYEDFSR 62 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G ++ ++G Y + + + C Q + L L+S V +I Sbjct: 63 GRDPQIGDLVISRVGSYGVVSYVNSNEKFCLGQNTAFIIPKINSRFLYYQLISPFVKWQI 122 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E GA K I + +PP+ EQ I + + +I+ + + Sbjct: 123 EQFVVGAVQKTISLKSIRQFQIKLPPVTEQKAIAHILGSLDDKIELNRQMNETLEAMAQA 182 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 ++ N E++ +++ + + + + Sbjct: 183 LFKSWFVDFDPVIDNALAAGNAIPDEFIERAEQRKKIERKESSDIQGLFPDEFEFTEEMG 242 Query: 258 SLSYGNII----QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + G N P+ V D N ++ +V Sbjct: 243 WIPKGWNSGTLGDFAILGNGKTSPDRAVGDIPVFGSNGKIGDCDESNRDNTIIIGRVGSY 302 Query: 314 GIITSAYMAVKP--------HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 Y + + +L + + + L ++ + Sbjct: 303 CGSLQYYPFKCWITDNAMSAEMKNKDHNIYLFQLLSRDNLNDRRTGSGQPLLNQSILRSI 362 Query: 366 PVLVPP---IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + P I E I N + K ++ L + R + + ++G++ + Sbjct: 363 KTITPSVPLIDEYSRIAN-------SFYKKINKANRNNAALAKLRDTLLPKLMSGELRIA 415 Query: 423 GES 425 + Sbjct: 416 DAA 418 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 52/186 (27%), Gaps = 18/186 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G IPK W + F L G+TS D G +G D S Sbjct: 242 GWIPKGWNSGTLGDFAILGNGKTSP-----------DRAVGDIPVFGSNGKIGDCDESN- 289 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I+ G++G Y F + + + K + +L + + Sbjct: 290 ---RDNTIIIGRVGSYCGSLQYYPFKCWITDNAMSAEMK---NKDHNIYLFQLLSRDNLN 343 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + + +I P + + +I+ +L Sbjct: 344 DRRTGSGQPLLNQSILRSIKTITPSVPLIDEYSRIANSFYKKINKANRNNAALAKLRDTL 403 Query: 199 KQALVS 204 L+S Sbjct: 404 LPKLMS 409 >gi|28199935|ref|NP_780249.1| type I restriction-modification system specificity determinant [Xylella fastidiosa Temecula1] gi|182682689|ref|YP_001830849.1| restriction modification system DNA specificity subunit [Xylella fastidiosa M23] gi|28058066|gb|AAO29898.1| type I restriction-modification system specificity determinant [Xylella fastidiosa Temecula1] gi|182632799|gb|ACB93575.1| restriction modification system DNA specificity domain [Xylella fastidiosa M23] gi|307578972|gb|ADN62941.1| restriction modification system DNA specificity subunit [Xylella fastidiosa subsp. fastidiosa GB514] Length = 405 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 66/413 (15%), Positives = 132/413 (31%), Gaps = 39/413 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRT--SESGKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 P+ + + + + G++ YI L V+ T K NS + + Sbjct: 13 PEGVGFMRVGELLERTSNIRWQDTQGEEFQYIDLSSVDRNTHIIRGTKTINSGTAPSRAQ 72 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVL--PELLQGWLLSIDV 133 I + +++G P L++ + + I ST + V +PK+ L P L L + Sbjct: 73 QIVRENDVIFGTTRPMLKRYCLIPSEYDGQISSTGYCVFRPKNELLLPNFLFHLLGTKAF 132 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +EA GA+ + + +P P+ Q I + + T L E Sbjct: 133 YSYVEANQNGASYPVITDEAVKAFRIPRLPVEVQAEIAKVLDTFTTLEAELEAELETRRR 192 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + + AL++ G D + L NT++ Sbjct: 193 QYQYYRDALLT--------------------FGEGTDAATRVRWVTLGEIATYANTRIQS 232 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIV---DPGEIVFRFIDLQNDKRSLRSAQV 310 + + SY + L ++ T V +I+ I K L + Sbjct: 233 VGLDASSYVGVDNLLPDTRGKVRSNFVPTSGTVIGYQANDILIGNIRPYLKKIWLAHSTG 292 Query: 311 MERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + + + + YL +L+ S D G + + + Sbjct: 293 GTNQDVLVIRIKDEAKAMLKPRYLYYLLASDDFFTYDSQHAKGAKMPRGDKTMIMKYKIP 352 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA--AAV 415 +PP++ Q I V++ ++ + + I ++ R + AV Sbjct: 353 IPPLEVQARIVAVLDQFDTLVNDITAGLPAEIAARRQQYAYYRDRLLTFKEAV 405 >gi|327490535|gb|EGF22316.1| hypothetical protein HMPREF9395_0052 [Streptococcus sanguinis SK1058] gi|332362947|gb|EGJ40736.1| hypothetical protein HMPREF9380_0601 [Streptococcus sanguinis SK49] Length = 411 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 52/417 (12%), Positives = 118/417 (28%), Gaps = 37/417 (8%) Query: 30 IKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIF 81 + TG + + +E + + S + I Sbjct: 6 LGDIAISQTGPFGSQLHEEDYVSEGTPIVTVEHLGDTNFTHQNLPFVSEADTKRLSKYIL 65 Query: 82 AKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRI 137 +G I++ ++G R + + S + + ++ V P L + + + Sbjct: 66 IEGDIVFSRVGSIDRNVYVDKNHEGWMFSGRCIRVRADKNKVNPRYLSYYFKQNSFKKMM 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + GATM + K + +I + + P Q I + A +I + K Sbjct: 126 MNLAVGATMPSLNTKIMNSIELDLLPRENQDKIANILSAIDDKIQINNQINQELEAMAKT 185 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGI-----EWVGLVPDHWEVKPFFALVT----ELNRKN 248 N SG E +P+ W V +V N Sbjct: 186 LYDYWFVQFDFPDQNGKPYKSSSGKMVYNPELKREIPEGWGVTKLNEVVDLISGYPFSSN 245 Query: 249 TKLIESNILSLSYGNIIQKLETRNMG----LKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + N+ + P + + G+++ Sbjct: 246 DYVTSGKYKLYTIKNVQDGYTVDKVDNYLDFLPSNMSDECQLRRGDLIMSLTGNVGRVGM 305 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 + V+ + + +++ RS M +G +++L D+ Sbjct: 306 VCEDDVL---LNQRVLKLNPINKTHKSFIYSFFRSDVTKAHLENMSTGTSQKNLSPIDIG 362 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + + P ++ ++ + LVE + L + R + + GQ+ Sbjct: 363 NMMIPFPSESL---LSKFLDNLNMLENNLVENQQ-----LTQLRDWLLPMLMNGQVK 411 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 58/172 (33%), Gaps = 7/172 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-----IIYIGLEDVESG-TGKYLPKDGNSRQS 73 IP+ W V + L +G S +++V+ G T + + S Sbjct: 220 EIPEGWGVTKLNEVVDLISGYPFSSNDYVTSGKYKLYTIKNVQDGYTVDKVDNYLDFLPS 279 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSID 132 + S +G ++ G R ++ + D + + + L L P + + S Sbjct: 280 NMSDECQLRRGDLIMSLTGNVGRVGMVCEDDVLLNQRVLKLNPINKTHKSFIYSFFRSDV 339 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +E + G + + IGN+ +P P + + + + Sbjct: 340 TKAHLENMSTGTSQKNLSPIDIGNMMIPFPSESLLSKFLDNLNMLENNLVEN 391 >gi|168211073|ref|ZP_02636698.1| type-I specificity determinant subunit [Clostridium perfringens B str. ATCC 3626] gi|170710885|gb|EDT23067.1| type-I specificity determinant subunit [Clostridium perfringens B str. ATCC 3626] Length = 396 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 62/395 (15%), Positives = 138/395 (34%), Gaps = 22/395 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + F +T T+E + +Y ++ DT+ +I + Sbjct: 16 EWKDEKLGDFLMKSTDVTTEHTDIPVLTSSRRGLFLQSEYFNRE--VAAKDTTGYNILKR 73 Query: 84 GQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIE 138 G Y + G+ S ++ V K L S+ ++ + Sbjct: 74 GYFTYRHMSDDSTFHFNINRFIDIGLVSPEYPVFTTKQDLNSYFLEQHLNSSLMFSKFCK 133 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G T + +K + N + +P L EQ I + ++D++I ++ + +E Sbjct: 134 MQKKGGTRTRLYFKVLENYKLKLPTLQEQEKIANFL----SKVDSIIEKQEKKVEYWSSY 189 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ ++ I + + + EW ++ + + KN + NI Sbjct: 190 KKGMMQKIFKQEIRFKDENGMDYPEWKINKIENIATI---EMGFTPSTKNDEAWNGNIDW 246 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 LS + K + + P + + L + ++ ++ I Sbjct: 247 LSIAGMNSKYIYSGNKKISSEILGKRKLVPIDTLIMSFKLTIGRLAIVKKDIVTNEAICQ 306 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 Y K I + Y+ + ++ G+ +L E + + V +P ++EQ I Sbjct: 307 FY--WKSKDISNEYMYAYLSVINIQSFGCRAAKGI--TLNTESLNSIVVKLPCLEEQTKI 362 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 N + + ID +++K + + LK+ + + Sbjct: 363 ANFL----SNIDNIIDKESKKLEELKQWKKGLLQQ 393 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 80/215 (37%), Gaps = 16/215 (7%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K EW + + T+ + +L+ S + + E N Sbjct: 6 PKLRFKGFEDEWKDE--------KLGDFLMKSTDVTTEHTDIPVLTSSRRGLFLQSEYFN 57 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DST 331 + + Y I+ G +R + + + ++ G+++ Y +S Sbjct: 58 REVAAKDTTGYNILKRGYFTYRHMSDDSTFH-FNINRFIDIGLVSPEYPVFTTKQDLNSY 116 Query: 332 YLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L + S + F M G R L F+ ++ + +P ++EQ I N + +++ Sbjct: 117 FLEQHLNSSLMFSKFCKMQKKGGTRTRLYFKVLENYKLKLPTLQEQEKIANFL----SKV 172 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 D ++EK E+ + + + +I + E Sbjct: 173 DSIIEKQEKKVEYWSSYKKGMMQKIFKQEIRFKDE 207 >gi|300958236|ref|ZP_07170386.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 175-1] gi|300315089|gb|EFJ64873.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 175-1] Length = 404 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 54/393 (13%), Positives = 120/393 (30%), Gaps = 36/393 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + +P+ + L GR G G V S K G+ D I Sbjct: 17 EWLPLGEVSALRRGRVMSKGYLTENFGPYPVYSSQTANNGKIGSINTFDFDGEYISWTTD 76 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 G + ++ K L+ +L + + + G Sbjct: 77 ------GANAGTVFYRTGKFSITNVCGLITLKSKY-SLIYKFLFYWLTIEAKKHVYSGMG 129 Query: 146 MSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + NIP+PIP LA Q I + + L E + Sbjct: 130 NPKLMSHQVENIPVPIPCPDNPEKSLAIQSEIVRILDTFSALTAELTAELNMRKKQYNYY 189 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L+S P + M I + + I++ + Sbjct: 190 RDQLLS--FNTEDVPHLPMGQKDI---------------GEFIRGGTFQKKDFIDAGVGC 232 Query: 259 LSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + YG I T K + + G+++ ++ A + Sbjct: 233 IHYGQIYTYYGTYTEKTKTYISTALAKKCKKAQKGDLIIATTSENDEDVCKAVAWLGSED 292 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 I S+ + H ++ Y+++ ++ +G + + +++ ++ + VP ++ Sbjct: 293 IAVSSDACIYKHNLNPKYVSYFFQTEQFQNQKRQYITGAKVRRVNADNLSKILIPVPSME 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q I ++++ + + E + + I L +++ Sbjct: 353 IQERIVSILDKFDTLTNSITEGLPREIELRQKQ 385 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 34/116 (29%), Gaps = 7/116 (6%) Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + IT+ + S +L + + L V+ Sbjct: 80 AGTVFYRTGKFSITNVCGLITLKSKYSLIYKFLFYWLTIEAKKHVYSGMGNPKLMSHQVE 139 Query: 364 RLPVLVP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +PV +P + Q +I +++ +A L ++ R ++ Sbjct: 140 NIPVPIPCPDNPEKSLAIQSEIVRILDTFSALTAELTAELNMRKKQYNYYRDQLLS 195 >gi|291289375|ref|YP_003517707.1| restriction modification system DNA specificity domain [Klebsiella pneumoniae] gi|290792336|gb|ADD63661.1| restriction modification system DNA specificity domain [Klebsiella pneumoniae] Length = 382 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 46/376 (12%), Positives = 119/376 (31%), Gaps = 34/376 (9%) Query: 48 IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 ++ ++V S+ K ++ G + K I Sbjct: 5 FPWLRTQEVNFCDIWDTEVKITESGVKNSSAKWIPKNCVIVAMYGATVGKIGINKIPMTT 64 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP---- 163 + +Q + + + I+++ G + ++ + + + NI +PIP Sbjct: 65 NQACANIQLNEEVAHYRYVFHFLCSQYTYIKSLGTG-SQTNINAQIVKNIKIPIPCPDNP 123 Query: 164 ---LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 LA Q I + T L E + + L++ K+ Sbjct: 124 EKSLAIQSEIVRILDKFTALTAELTAELNMRKKQYNYYRDQLLT------------FKEG 171 Query: 221 GIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 +EW +G + + K F + + + I Y ++ + Sbjct: 172 EVEWKALGEIGEFIRGKRFTKADYVEDGGISVIHYGEI----YTRYGVYTTHSLSQVRAD 227 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + G++V + + A + + I + H ++ ++++ M+ Sbjct: 228 MAASLRYAKHGDVVITDVGETVEDVGKAVAWLGDDDIAIHDHCYAFRHSLNPKFISYYMQ 287 Query: 339 SYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVP-------PIKEQFDITNVINVETARID 390 + + + + L ++ + VP +KEQ I +++ + Sbjct: 288 TDSFISEKAKYVARTKVNTLLINGFSKIMIPVPYPKDHEKSLKEQARIVEILDKFDTLTN 347 Query: 391 VLVEKIEQSIVLLKER 406 + E + + I L +++ Sbjct: 348 SITEGLPREIELRQKQ 363 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 14/156 (8%), Positives = 45/156 (28%), Gaps = 11/156 (7%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + + + ++ K + + + + + Sbjct: 16 CDIWDTEVKITESGVKNSSAKWIPKNCVIVAMYGATVGKIGINKIPMTTNQACAN--IQL 73 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQF 376 Y+ + S ++G+G + ++ + VK + + +P + Q Sbjct: 74 NEEVAHYRYVFHFLCSQYT--YIKSLGTGSQTNINAQIVKNIKIPIPCPDNPEKSLAIQS 131 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +I +++ TA L ++ R + Sbjct: 132 EIVRILDKFTALTAELTAELNMRKKQYNYYRDQLLT 167 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/230 (13%), Positives = 72/230 (31%), Gaps = 29/230 (12%) Query: 1 MKHYKAYPQYKDSGVQWI----GAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYI 51 M+ K Y Y+D Q + G + + + + G+ I I Sbjct: 153 MRK-KQYNYYRD---QLLTFKEGEV----EWKALGEIGEFIRGKRFTKADYVEDGGISVI 204 Query: 52 GLEDVESGTGKYLPKDGNSRQSDT-STVSIFAKGQILYGKLGPY----LRKAIIADFDGI 106 ++ + G Y + ++D +++ G ++ +G + D I Sbjct: 205 HYGEIYTRYGVYTTHSLSQVRADMAASLRYAKHGDVVITDVGETVEDVGKAVAWLGDDDI 264 Query: 107 CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP--- 163 + P+ + ++ + ++ G I +P+P Sbjct: 265 AIHDHCYAFRHSLNPKFISYYMQTDSFISEKAKYVARTKVNTLLINGFSKIMIPVPYPKD 324 Query: 164 ----LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 L EQ I E + +++ R IEL +++ + + + Sbjct: 325 HEKSLKEQARIVEILDKFDTLTNSITEGLPREIELRQKQYEYYRDLLFSF 374 >gi|56808772|ref|ZP_00366488.1| COG0732: Restriction endonuclease S subunits [Streptococcus pyogenes M49 591] gi|209560055|ref|YP_002286527.1| Putative specificity determinant HsdS [Streptococcus pyogenes NZ131] gi|209541256|gb|ACI61832.1| Putative specificity determinant HsdS [Streptococcus pyogenes NZ131] Length = 380 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 62/393 (15%), Positives = 122/393 (31%), Gaps = 31/393 (7%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + ++ TG++S I TG+ G++ S + Sbjct: 17 EWEEKKLGELASEIGTGKSSTLSDAI-----------TGEKYSILGSTSIIGYSKTYDYC 65 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 IL ++G S + +L I+ + Sbjct: 66 GDFILTARVGANAGNLYKYSGKVKISDN------TVFIKSDYINFLYHFLHRFDIKKLSF 119 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + NI + P L EQ I E +D L+ + + + LKE+KQ Sbjct: 120 GTGQPLIKSSELRNILISTPSLPEQEAIGE----LFQTVDQLLQLQRQKLATLKEQKQTF 175 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + + +++ G + E+ F+ T + I + Sbjct: 176 LRKMFPPQVQKVPEIRLQGFDGEWEEKKLGEISRMFSGGTPNVGIPEYYNGN-IPFIRSA 234 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I ++ K S + ++V+ +++ + + L G I A +A Sbjct: 235 EINSDQTELSITDKGLSNSSAKLVEKNTLLYALYGATSGEVGLSRIS----GAINQAILA 290 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + P S+ + G + +L VK L + P + EQ I N Sbjct: 291 IIPEKKYSSLFIKNWLYKQKSSIIEKYLQGGQGNLSGSIVKELTIHFPSLSEQEAIGNFF 350 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +ID E+ + LK + + + Sbjct: 351 QTLDQQIDQ----SEEKLTELKALKQTLLNRLF 379 >gi|282865861|ref|ZP_06274910.1| restriction modification system DNA specificity domain protein [Streptomyces sp. ACTE] gi|282559185|gb|EFB64738.1| restriction modification system DNA specificity domain protein [Streptomyces sp. ACTE] Length = 412 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 51/412 (12%), Positives = 128/412 (31%), Gaps = 30/412 (7%) Query: 26 KVVPIKRFTKLNTGR--TSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + VP++ ++ G+ + S + Y+ + +V G +Y+ + Sbjct: 8 QWVPVRELGEVRMGKQLSPSSREAAGQFPYLRVANVHLGRIEYVDVNEMGFTPAERVTYG 67 Query: 81 FAKGQILYGKLGP---YLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G IL + R AI + + + +P + + + Sbjct: 68 LKPGDILLNEGQSLELVGRSAIYDRAEGEFCFQNTLIRFRPNGCILSAYAQVVFEHWLRS 127 Query: 136 RIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A T ++H + P+ P Q I + + + ++ Sbjct: 128 GVFAAIAKQTTSIAHLGGDRFAALKFPLLPTGMQQRIVAVLDSLAELERRIEASIVKLRS 187 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + K S + +P +++ ++ + V + + + Sbjct: 188 VRKGIISEQFSRADVEDGSPASRLRA--LDSLADVGSGLTLGGISS------GGTLLEVP 239 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 ++ I LE +++ + P E +++ +V D R ++ Sbjct: 240 YLRVANVQDGFISTLEMKSVRVTPSDMERFRVRRDDVLVTEGGDFDKVGRGAVWDGRIDP 299 Query: 314 GIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLV 369 + + +D +L+ M S + F + S+ +K +PV Sbjct: 300 CLNQNHVFRVRCDKEVLDPHFLSLYMSSAAGRRYFLRVVKQTTNLASINSSQLKAMPVPC 359 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 PP++EQ D + + E + L+E + + ++ + Sbjct: 360 PPLEEQRRTVE----LVGSCDEQIAQEEGELTKLRELKVGLVDDLLS--RRV 405 >gi|256841218|ref|ZP_05546725.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256737061|gb|EEU50388.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 369 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 68/389 (17%), Positives = 143/389 (36%), Gaps = 26/389 (6%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 VV + + + + ++S +D+ +GLE + ++ D N+ D + F KGQ+ Sbjct: 3 VVKLGDVARESRLKWTKSKQDVPIVGLEHLIPDEIRFDAYDINT---DNTFSKRFVKGQV 59 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRIEAICEGA 144 L+G+ Y RKA IA+FDGICS V++ + ++PELL + + G+ Sbjct: 60 LFGRRRAYQRKAAIAEFDGICSGDITVIEAIEGKMVPELLPFIIQTPVFFDYANRGSAGS 119 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 W+ + + +PPL EQ ++ +K+ + + +LL + + S Sbjct: 120 LSPRVKWEHLADYEFELPPLEEQKILADKL-------WAAYRLKEAYKKLLDATDEMVKS 172 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + +P K + + L K ++ + + L GNI Sbjct: 173 QFIEMVGDPRNNPKGWPTKRLSE---------LAEYSIGLTYKPEQICDDGTIVLRSGNI 223 Query: 265 IQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 K+ ++ + V +I+ + + +T Sbjct: 224 QDGKISFSDIVRVNAPIKESLFVKEDDILMCSRNGSASLVGKVAMIPDINEPMTFGAFMT 283 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ++ YL +S D + S + + + ++ V P + ++ Sbjct: 284 IIRSAEAKYLYLYFQSQDFRERVSEGKSSTMNQITQKMLDKVEVPFPDKDVR----ETLS 339 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ D ++ +SI + + S I Sbjct: 340 AIASQADKSKFELRKSIDAIDKVIKSLIN 368 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 15/192 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 PK W + + + G T + I + +++ G + + S Sbjct: 185 PKGWPTKRLSELAEYSIGLTYKPEQICDDGTIVLRSGNIQDGKISFSDIVRVNAPIKES- 243 Query: 78 VSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + IL + A+I D + + + + + L + S D Sbjct: 244 -LFVKEDDILMCSRNGSASLVGKVAMIPDINEPMTFGAFMTIIRSAEAKYLYLYFQSQDF 302 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +R+ + + +TM+ K + + +P P + E + A + D E + I+ Sbjct: 303 RERV-SEGKSSTMNQITQKMLDKVEVPFPDKDVR----ETLSAIASQADKSKFELRKSID 357 Query: 194 LLKEKKQALVSY 205 + + ++L++ Sbjct: 358 AIDKVIKSLINN 369 >gi|188586601|ref|YP_001918146.1| restriction modification system DNA specificity domain [Natranaerobius thermophilus JW/NM-WN-LF] gi|179351288|gb|ACB85558.1| restriction modification system DNA specificity domain [Natranaerobius thermophilus JW/NM-WN-LF] Length = 490 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 61/444 (13%), Positives = 143/444 (32%), Gaps = 46/444 (10%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W V + + + G T + K I +E +++ + Sbjct: 26 ELPNNWAWVALDILAEEIKNGTTIKQSKTKPGIPVTRIESIQNNEIQLDRVRYIRDLDKI 85 Query: 76 STVSIFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVLQPKDVLPELLQGWL 128 + G I+ + + + + +LP+ LQ + Sbjct: 86 KNNDYYKIGDIVLSHINSIEHVGKTALIKEDYLPLIHGMNLLRIRVNNNMILPQFLQLYT 145 Query: 129 LSIDVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S + + + + + K + I +PI P EQ I K+ +I+ Sbjct: 146 RSYNFRKAVLKRIKMAVNQVSLNQKNLKQISIPIAPKNEQRRIVYKVDRLLSKINKAKEL 205 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN----------------PDVKMKDSGIEW----VGL 227 E + ++ A++ L K + I+ + Sbjct: 206 IGEAKETFELRRAAILDKAFKGELTWREENPRVESVDTLLAKINSEKKTDIKKSPNGLYE 265 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNI---LSLSYGNIIQKLETRNMGLKPESYE--- 281 +PD+W L+ + + +I L GNI LK ++ Sbjct: 266 LPDNWCWIDLGELICHSSYGTSAKAYKDINGLPVLRMGNIKLTGSIDLNDLKYLPFDHKD 325 Query: 282 -TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWL 336 ++ +++F + + G T A +++ I + Y+ + Sbjct: 326 VEKYKLEEYDLLFNRTNSYELVGKSAIVEPEHAGKFTYASYLIKISLFYKKILAPYICYY 385 Query: 337 MRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S+ K + + ++ + + LPV +PP +E +I ++ +A+ + ++ Sbjct: 386 INSHIGRKYLLSTVKQQVGQANINSKKLSSLPVPLPPEEEIKEINRIMKKVSAK-ENRIQ 444 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 + + E S ++ A G+ Sbjct: 445 NLLNLGTYVAELEQSILSKAFRGE 468 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 75/215 (34%), Gaps = 11/215 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL------SYGNIIQKLETRNMG 274 E +P++W L E+ T + S N +L+ Sbjct: 20 EDEEPYELPNNWAWVALDILAEEIKNGTTIKQSKTKPGIPVTRIESIQNNEIQLDRVRYI 79 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY---MAVKPHGIDST 331 + + G+IV I+ + +I + V + I Sbjct: 80 RDLDKIKNNDYYKIGDIVLSHINSIEHVGKTALIKEDYLPLIHGMNLLRIRVNNNMILPQ 139 Query: 332 YLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L RSY+ K + SL +++K++ + + P EQ I ++ ++I Sbjct: 140 FLQLYTRSYNFRKAVLKRIKMAVNQVSLNQKNLKQISIPIAPKNEQRRIVYKVDRLLSKI 199 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + E I ++ + RR++ + A G++ R E Sbjct: 200 NKAKELIGEAKETFELRRAAILDKAFKGELTWREE 234 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 82/228 (35%), Gaps = 15/228 (6%) Query: 18 IGAIPKHWKVVPIKR-FTKLNTGRTSESGKDI---IYIGLEDVE-SGTGKYLPKDGNSRQ 72 + +P +W + + + G ++++ KDI + + +++ +G+ Sbjct: 263 LYELPDNWCWIDLGELICHSSYGTSAKAYKDINGLPVLRMGNIKLTGSIDLNDLKYLPFD 322 Query: 73 SDTSTVSIFAKGQILYGKLGPY---LRKAIIADFDG----ICSTQFLV--LQPKDVLPEL 123 + +L+ + Y + AI+ S + K + P + Sbjct: 323 HKDVEKYKLEEYDLLFNRTNSYELVGKSAIVEPEHAGKFTYASYLIKISLFYKKILAPYI 382 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I + + + ++ + K + ++P+P+PP E I + + + + Sbjct: 383 CYYINSHIGRKYLLSTVKQQVGQANINSKKLSSLPVPLPPEEEIKEINRIMKKVSAK-EN 441 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 I + + E +Q+++S LN + +S IE + V Sbjct: 442 RIQNLLNLGTYVAELEQSILSKAFRGELNTNDPKDESAIELLKEVLKD 489 >gi|138894435|ref|YP_001124888.1| putative type I specificity subunit HsdS [Geobacillus thermodenitrificans NG80-2] gi|134265948|gb|ABO66143.1| Putative type I specificity subunit HsdS [Geobacillus thermodenitrificans NG80-2] Length = 509 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 68/467 (14%), Positives = 143/467 (30%), Gaps = 75/467 (16%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 +PK+W ++ TG T + ++ D++ + + + + Sbjct: 27 VPKNWVWTRTGITHEIVTGSTPSKKNNEYYGGNFPFVKPGDLDQKDSVTVASEYLTDKGK 86 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSID 132 + + K L +G K + + Q L+ K + P+ + LS Sbjct: 87 EVS-RVIPKHSTLVCCIGSI-GKVGFNLVECTTNQQINSLIPNKKVIYPKYTYYFSLSSV 144 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + T+S + + +P +PPL EQ I EK+ +ID Sbjct: 145 YQNLLSKSSSSTTVSIINKSKMSKLPFALPPLNEQKHIAEKVDRLFAKIDEAKRLIEEVK 204 Query: 193 ELLKEKKQALVSYIVTKGLNPDVK------------------------------------ 216 E + ++ A++ L + Sbjct: 205 ESFELRRAAILDKAFRGELTRSWRKKNEHLVSASLMLQEIASERKRKYSDLCRLAKINGE 264 Query: 217 -----MKDSGIEWVGLVPDHWEVKPFFALVTEL-----------NRKNTKLIESNILSLS 260 + + + P H + K L E+ + + Sbjct: 265 KKPRKLYLDEVPVIEEKPRHSLPDTWTITNIGFLAHVTKLAGFEYTKYFNLTETGDVPVI 324 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLRSAQVMERGI 315 +Q E +K + E +++ GE++ FI + R Sbjct: 325 RAQNVQMGEFIESNIKYITKEVSDLLERSQVHGGEVLMVFIGAGTGNVCMAPRDNR-RWH 383 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + I + YL ++S M + +QSL E ++ + V VPP++E Sbjct: 384 LAPNVAKITVDEILAEYLNLYLQSPIGQSYIKSKMKATAQQSLSMETIRDVLVYVPPLEE 443 Query: 375 QFDITNVINVETARIDV---LVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q++I ++ + ++ I I + S ++ A G+ Sbjct: 444 QYEIVRIVERLLDNLKNEYLILNDIHMKID---NIKQSILSKAFRGE 487 >gi|88707236|ref|ZP_01104922.1| type I restriction-modification system, endonuclease S subunit [Congregibacter litoralis KT71] gi|88698519|gb|EAQ95652.1| type I restriction-modification system, endonuclease S subunit [Congregibacter litoralis KT71] Length = 398 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 72/387 (18%), Positives = 146/387 (37%), Gaps = 24/387 (6%) Query: 29 PIKRFTKLNTGRTSESGKDI-IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + D+ Y+GLE ++ + K + S+ +F G I+ Sbjct: 12 RFDQMAVQVKEKVDPAEADVDRYVGLEHIDPESLKI--RRWGETSEVESSKILFKSGDII 69 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAICEGAT 145 +GK Y RK +ADFDGICS +VL+PK + ++ S R I G Sbjct: 70 FGKRRAYQRKLCVADFDGICSAHAMVLRPKTDVVLEDFLPFFMQSEIFMNRAVKISVGGL 129 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 +W+ + +PPL EQ I + + A + L E+ + Sbjct: 130 SPTINWRDLAKEEFALPPLQEQRRIVQLLSAA--------ERYQNALYDLSERGTSSRDS 181 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIESNILSLSYGN 263 +V + + E VG + W + P L+T + + L N Sbjct: 182 LVDHRMRGATLGATTYHERVGRYFNGWNLVPLGELLTAAQYGLSESLHGKGQYPILRMMN 241 Query: 264 IIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + L +ETY++V G+++F + + + S Sbjct: 242 LEDGKATADDLKYLDLSDSDFETYRLVS-GDVLFNRTNSYELVGRTGVYDLPGDFVFASY 300 Query: 320 YMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQ 375 + +K YL+ +R+ + + + + ++ ++KR+ V +PPI Q Sbjct: 301 LIRLKTDIDRLSPEYLSAFLRAPIGRRQVMSFATRGVSQANINASNLKRVLVPLPPIGYQ 360 Query: 376 FDITNVINVETARIDVLVEKIEQSIVL 402 ++ ++ V + + +++ + L Sbjct: 361 KEVVELLTVADSSRRWAIARLQVAREL 387 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 59/155 (38%), Gaps = 19/155 (12%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI- 328 R G E + + G+I+F K + GI ++ M ++P Sbjct: 47 IRRWGETSEVESSKILFKSGDIIFGKRRAYQRKLCVADFD----GICSAHAMVLRPKTDV 102 Query: 329 -DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L + M+S + GL ++ + D+ + +PP++EQ I +++ Sbjct: 103 VLEDFLPFFMQSEIFMNRAVKISVGGLSPTINWRDLAKEEFALPPLQEQRRIVQLLSAA- 161 Query: 387 ARIDVLVEKIEQSIVLLKER----RSSFIAAAVTG 417 E+ + ++ L ER R S + + G Sbjct: 162 -------ERYQNALYDLSERGTSSRDSLVDHRMRG 189 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 60/183 (32%), Gaps = 12/183 (6%) Query: 23 KHWKVVPIKRF---TKLNTGRTSESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTV 78 W +VP+ + + + + ++E G K + SD T Sbjct: 206 NGWNLVPLGELLTAAQYGLSESLHGKGQYPILRMMNLEDGKATADDLKYLDLSDSDFETY 265 Query: 79 SIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGW-----LLSI 131 + G +L+ + Y + + + D G +++ K + L + I Sbjct: 266 RLV-SGDVLFNRTNSYELVGRTGVYDLPGDFVFASYLIRLKTDIDRLSPEYLSAFLRAPI 324 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q + G + ++ + + + +P+PP+ Q + E + I Sbjct: 325 GRRQVMSFATRGVSQANINASNLKRVLVPLPPIGYQKEVVELLTVADSSRRWAIARLQVA 384 Query: 192 IEL 194 EL Sbjct: 385 REL 387 >gi|327191124|gb|EGE58170.1| type I restriction-modification system, S subunit [Rhizobium etli CNPAF512] Length = 559 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 77/474 (16%), Positives = 147/474 (31%), Gaps = 85/474 (17%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSESGK----DIIYIGLEDVESGTG--KYLPKDGNSRQ 72 +P W + KL G D YI ++++ + + Sbjct: 83 DLPDSWVWSRLGDILIKLTDGTHHSPDNGPVGDFRYITAKNIKEHGVALNDVTYVSSDVH 142 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLL 129 ++ + KG ILY K G I D + S+ L+ P+ +L LL +L Sbjct: 143 AEIFSRCNPEKGDILYIKDGATTGVVTINDLDEPFSMLSSVALLKLPRGLLNRLLVIFLR 202 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S ++ +GA ++ K + +P+PPLAEQ I K+ D L R Sbjct: 203 SPFFYDQMRGFMKGAAITRVTLKRMAPALLPLPPLAEQHRIVAKVDELMALCDQLEVARE 262 Query: 190 RF----------------------------------------IELLKEKKQALVSYIVTK 209 + +K+ +Q +++ V Sbjct: 263 EREAARARLAVASLARLNSPDPETFSEDARFALEALPALTARPDQIKQLRQTILNLAVRG 322 Query: 210 GLNPDVKMKDSGIEW----------VGLVPDHWEVKPFFALVTELNRK-----NTKLIES 254 L P + E+ +P W+ + N++ Sbjct: 323 KLVPQDPKDEPAEEFDEALPNALAKPFSIPSSWKWSRLSYVGKLRGGGTPSKSNSEFWRG 382 Query: 255 NILSLSYGNIIQKLETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ + ++ K + +++D G ++F + S A Sbjct: 383 EIPWVSPKDMKVDYISNAQMSISQKAVRESSVKLIDRGSLLFVVRGMIL-AHSFPVAIAQ 441 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-----VFYAMGSGLRQSLKFEDVKRLP 366 E + A+ + +R+ K G L+ D Sbjct: 442 EFVTVNQDMKALTLKK--PEMAEYFLRALKGLKPQMLARVQRSSHGT-CRLEGSDYSDFL 498 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKI----EQSIVLLKERRSSFIAAAVT 416 + +PP+ EQ I ++ + D L + E LL+ + +A A+T Sbjct: 499 MPIPPLAEQHRIVAKVDELLSLCDQLEASLMTAGEARGKLLE----ALLAEAIT 548 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/246 (12%), Positives = 71/246 (28%), Gaps = 51/246 (20%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-- 280 E +PD W ++ +L ++ + + ++ + L +Y Sbjct: 79 ELPFDLPDSWVWSRLGDILIKLTDGTHHSPDNGPVGDFRYITAKNIKEHGVALNDVTYVS 138 Query: 281 -------ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + G+I++ ++ +++S + P G+ + L Sbjct: 139 SDVHAEIFSRCNPEKGDILYIKDGATTGVVTINDLDEP-FSMLSSVALLKLPRGLLNRLL 197 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +RS G + + + + +PP+ EQ I ++ A D L Sbjct: 198 VIFLRSPFFYDQMRGFMKGAAITRVTLKRMAPALLPLPPLAEQHRIVAKVDELMALCDQL 257 Query: 393 V------------------------------EKIEQSIVLL----------KERRSSFIA 412 E ++ L K+ R + + Sbjct: 258 EVAREEREAARARLAVASLARLNSPDPETFSEDARFALEALPALTARPDQIKQLRQTILN 317 Query: 413 AAVTGQ 418 AV G+ Sbjct: 318 LAVRGK 323 >gi|220908522|ref|YP_002483833.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 7425] gi|219865133|gb|ACL45472.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7425] Length = 412 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 55/426 (12%), Positives = 122/426 (28%), Gaps = 40/426 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDII--------------YIGLEDVESGTGKYLPKDG 68 W+ + +L T S +I+ I + E K Sbjct: 2 SEWEETRLGEVLELITDYHSNGSYEILKANVSLLDEEDFAVMIRTTNFEQNNFSKNLKYV 61 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQG 126 N S+ G I+ K+ + D S +++ Sbjct: 62 NKEAYFFLDKSMVFPGDIIMNKIANAGSVYFMPDLQRPVSLAMNLFLIRVNKEKANQRFV 121 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + ++ EG+ + N+ + +P L Q I + I + +I+ L Sbjct: 122 FYYLKANEAYVKQFAEGSVTKTITKNAVRNLVIRMPSLERQNEIVKIIESVESKIENLRR 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSG----IEWVGLVPDHWEVKPFFA 239 + Q L + P+ K SG +G VP W + Sbjct: 182 QNETLE----RIAQTLFKHWFVDFEFPNADGKPYKSSGGAMVRSELGEVPSGWRIGKLRD 237 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + +N + K E I E + + G++++ + Sbjct: 238 ITAVINGRAYKQTEFREEGTPIVRIQNLTGKGQNVYSDLILENEKYISKGDLIYAWSATF 297 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSG-LRQSL 357 + Y K + + + + + ++ G+G + + Sbjct: 298 GPYIWRGVKSIY-------HYHIWKLNCFNPAFKYYLYIHLKNVSDRVKNQGTGSIFTHI 350 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 E ++ +L+P I ++ + D ++ E I L R + ++G Sbjct: 351 TKELMESQEILIPDN---RTIECWHDLAESAFDKIMLNYE-QIATLTNTRDVLLPQLMSG 406 Query: 418 QIDLRG 423 ++ ++ Sbjct: 407 KLRVKP 412 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 68/204 (33%), Gaps = 18/204 (8%) Query: 10 YKDSGV----QWIGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTG 61 YK SG +G +P W++ ++ T + GR + + + ++++ +G G Sbjct: 211 YKSSGGAMVRSELGEVPSGWRIGKLRDITAVINGRAYKQTEFREEGTPIVRIQNL-TGKG 269 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + D + KG ++Y I I + + + P Sbjct: 270 QNVYSDLILENEKYIS-----KGDLIYAWS-ATFGPYIWRGVKSI--YHYHIWKLNCFNP 321 Query: 122 ELLQG-WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 ++ +V+ R++ G+ +H + + + + IP + + + Sbjct: 322 AFKYYLYIHLKNVSDRVKNQGTGSIFTHITKELMESQEILIPDNRTIECWHDLAESAFDK 381 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I + L+S Sbjct: 382 IMLNYEQIATLTNTRDVLLPQLMS 405 >gi|237721638|ref|ZP_04552119.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 2_2_4] gi|229449434|gb|EEO55225.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 2_2_4] Length = 464 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 59/398 (14%), Positives = 130/398 (32%), Gaps = 29/398 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV-- 78 +PK W + + G+T + + + + S Sbjct: 68 LPKGWTICSLDDLATFGGGKTPSMDNRKYWNNAKHLWITSKDMKFAHIADSLLKISDAAL 127 Query: 79 ---SIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSI 131 +I+ KG +L LR I D + + + + L + + Sbjct: 128 DQMTIYGKGTLLIVTRSGILRHTFPIAILDTEATVNQDVKAISCVLSHIHTYLYYVIKAQ 187 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + +G T+ D+ + +P+PPL+EQ I E+I ID + + Sbjct: 188 EQVILKDYHKDGTTVDSIDFDKFKKLIVPLPPLSEQYRIVEEIEHWFALIDQIEQGKTDL 247 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +K+ K ++ + L P +S IE + + + N Sbjct: 248 QTTIKQIKGKILDLAIHGKLVPQDPNDESAIELLKRINPDFTPCDNRHYTQLPNGWAVCR 307 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYET---------------YQIVDPGEIVFRFI 296 ++ L RN+ +K + + IVD ++ Sbjct: 308 LDQVADVLDNLRKPINSNERNLRIKGKQIDRLYPYYGATGQVGLIDDYIVDGHYLLLGED 367 Query: 297 DL-QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 DK ++++ + + + + + P +L S + + R Sbjct: 368 GAPFLDKNAIKAYSISGKSWVNNHAHILSPKID----FEFLQYSLNQIDYSEYVNGSTRL 423 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 L D++ + +++PP+ EQ I I +++D+++ Sbjct: 424 KLTQTDMRSIRLMLPPLSEQKLIKAKIQTLFSQLDMIM 461 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 65/203 (32%), Gaps = 8/203 (3%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-------MGLK 276 +P W + L T K + + + I + + + + Sbjct: 64 ENKYLPKGWTICSLDDLATFGGGKTPSMDNRKYWNNAKHLWITSKDMKFAHIADSLLKIS 123 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + + I G ++ + E + TYL ++ Sbjct: 124 DAALDQMTIYGKGTLLIVTRSGILRHTFPIAILDTEATVNQDVKAISCVLSHIHTYLYYV 183 Query: 337 MRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 +++ + + S+ F+ K+L V +PP+ EQ+ I I A ID + + Sbjct: 184 IKAQEQVILKDYHKDGTTVDSIDFDKFKKLIVPLPPLSEQYRIVEEIEHWFALIDQIEQG 243 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 +K+ + + A+ G+ Sbjct: 244 KTDLQTTIKQIKGKILDLAIHGK 266 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 56/166 (33%), Gaps = 2/166 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W V + + + + + ++ + + P G + Q Sbjct: 298 QLPNGWAVCRLDQVADVLDNLRKPINSNERNLRIKGKQID--RLYPYYGATGQVGLIDDY 355 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I +L G+ G I ++ + P++ +L Sbjct: 356 IVDGHYLLLGEDGAPFLDKNAIKAYSISGKSWVNNHAHILSPKIDFEFLQYSLNQIDYSE 415 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 G+T + +I + +PPL+EQ LI+ KI ++D ++ Sbjct: 416 YVNGSTRLKLTQTDMRSIRLMLPPLSEQKLIKAKIQTLFSQLDMIM 461 >gi|254303655|ref|ZP_04971013.1| type I site-specific deoxyribonuclease restriction subunit [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148323847|gb|EDK89097.1| type I site-specific deoxyribonuclease restriction subunit [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 378 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 56/392 (14%), Positives = 132/392 (33%), Gaps = 31/392 (7%) Query: 30 IKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +K G + S + + I ++++ + +GN + + G Sbjct: 10 LKEVATFLNGYAFKPSDWSKEGLPIIRIQNLTGTNRDFNYYNGN-----YNKKYLIENGD 64 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL L + + GI + V+ K++ + + + ++IE G+ Sbjct: 65 ILISWS-ASLGIFLWENMTGILNQHIFKVIFDKNIEIDKIYFLHCMKFLIKKIEKNIHGS 123 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 TM H I PI + Q I +K+I T I+ + E +S Sbjct: 124 TMKHITRPEFEKIKFPIYEIDIQRKISKKLIFITKIIENNKKLLNKMEE---------LS 174 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + + K+ + +E + + + T N + + + Sbjct: 175 KSLFTKYSKNKKVVNLELEEICEFIKDGTHQTPTYVNTNENGYKFLSSKDVSKGIINWDN 234 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + +I+ + ++ + I S + Sbjct: 235 TKYISEE----LHKELYKKIAPKKNDILLAKNGTTGIAALVDKEEIFD--IYVSLAILRL 288 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + Y+ + S + + F G+ +L +++K++ + +PPI+ Q + Sbjct: 289 KKEYNPKYILEGINSIETNQQFKKSLKGIGVPNLHLKEIKKVKIPIPPIELQNKFAERVE 348 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I+ L +IE+SI ++ +S I+ Sbjct: 349 ----KIEKLKFEIEKSIEEAQKLYNSLISKYF 376 >gi|146318350|ref|YP_001198062.1| HsdS [Streptococcus suis 05ZYH33] gi|146320545|ref|YP_001200256.1| HsdS [Streptococcus suis 98HAH33] gi|253751503|ref|YP_003024644.1| type I restriction-modification system, specificity protein [Streptococcus suis SC84] gi|253753404|ref|YP_003026545.1| type I restriction-modification system, specificity protein [Streptococcus suis P1/7] gi|253755767|ref|YP_003028907.1| type I restriction-modification system, specificity protein [Streptococcus suis BM407] gi|145689156|gb|ABP89662.1| putative HsdS [Streptococcus suis 05ZYH33] gi|145691351|gb|ABP91856.1| putative HsdS [Streptococcus suis 98HAH33] gi|251815792|emb|CAZ51398.1| type I restriction-modification system, specificity protein [Streptococcus suis SC84] gi|251818231|emb|CAZ56035.1| type I restriction-modification system, specificity protein [Streptococcus suis BM407] gi|251819650|emb|CAR45408.1| type I restriction-modification system, specificity protein [Streptococcus suis P1/7] gi|319757930|gb|ADV69872.1| putative HsdS [Streptococcus suis JS14] Length = 419 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 47/423 (11%), Positives = 126/423 (29%), Gaps = 37/423 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++ + + + G++ ++ I D+++ + Sbjct: 6 WQIKSLSELGRFSRGKSKHRPRNDKKLFTNGTYPLIQTGDIKNSNLYVTKNSDYYNEFGL 65 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S ++ +G + + + + I + + L + + + + Sbjct: 66 SQSKLWKQGTLCIT-IAANIAETAILSYPMCFPDSVVGFNAHKNESSELFVYYVFELIKK 124 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I+ G+ + + + + + +P Q I + ID I + E L Sbjct: 125 EIQKTSSGSIQDNINIDYLTKLKLKVPNKDYQDRIVNLL----STIDKKILINNQINEEL 180 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNR 246 + + L Y + PD K SG + V +P+ W VK + N Sbjct: 181 EAMAKTLYDYWFVQFDFPDENGKPYKSSGGKMVYNDQLKREIPEGWGVKQLGEICEFRNG 240 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--------TYQIVDPGEIVFRFIDL 298 N + E+ N+ + + +V I+ + Sbjct: 241 INYEKSETGDTLSKIVNVRNISNSSTFVTTHDLDSITLDRRRIESYLVTDRTILITRSGI 300 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 R + + I + + ++ Y + + + +++ Sbjct: 301 PGATRIVS--DIPVNTIYSGFIIGATVANLNLFYYVFYHLKNIEMLMSNQSAGTIMKNIS 358 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +++P + Q +N + ++E + L + R + + GQ Sbjct: 359 QTTLSEIRIVIPNKEIQKVFSNEVRSLL----DVIENNLKQNQELTQLRDWLLPMLMNGQ 414 Query: 419 IDL 421 + + Sbjct: 415 VKV 417 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 61/195 (31%), Gaps = 7/195 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V + + G E + + + ++ + + D +S D Sbjct: 221 EIPEGWGVKQLGEICEFRNGINYEKSETGDTLSKIVNVRNISNSSTFVTTHDLDSITLDR 280 Query: 76 S--TVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSID 132 + IL + G I++D + F++ L + + Sbjct: 281 RRIESYLVTDRTILITRSGIPGATRIVSDIPVNTIYSGFIIGATVANLNLFYYVFYHLKN 340 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G M + + I + IP Q + ++ + I+ + + Sbjct: 341 IEMLMSNQSAGTIMKNISQTTLSEIRIVIPNKEIQKVFSNEVRSLLDVIENNLKQNQELT 400 Query: 193 ELLKEKKQALVSYIV 207 +L L++ V Sbjct: 401 QLRDWLLPMLMNGQV 415 >gi|254780039|ref|YP_003058146.1| putative type I R-M system specificity subunit [Helicobacter pylori B38] gi|254001952|emb|CAX30209.1| Putative type I R-M system specificity subunit [Helicobacter pylori B38] Length = 373 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 55/405 (13%), Positives = 118/405 (29%), Gaps = 46/405 (11%) Query: 22 PKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P +W+ V + K + +I + + + ++ K + Sbjct: 7 PLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYKT 64 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 S KG IL G R I +V D + + Sbjct: 65 KYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWIDNDETLVKNDFLFYAYSNVKW 124 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 E T+ N +P+PPL EQ+ I + + +L ++ + K Sbjct: 125 D---TEHTTILRLYNDNFKNTLIPLPPLNEQIAIANILSDVDRYLYSLDALILKKESVKK 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 L+S + + W+ + L + + Sbjct: 182 ALSFELLSQ----------------RKRLKGFNQAWQRVRLGDIANYLTSNLSAEQITQQ 225 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + ++ + N + I R L + I+ Sbjct: 226 GKIKVYDVNNFIGYTNTTFISD---------KPYISIVKDGSVGRVRILPP----KTNIL 272 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 ++ + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 273 STMGALIANHKTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQI 329 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 330 AIANILSDLDNEIISLKNKKSQ----FENIKKALNHDLMSAKIRV 370 >gi|229120554|ref|ZP_04249799.1| hypothetical protein bcere0016_8650 [Bacillus cereus 95/8201] gi|228662839|gb|EEL18434.1| hypothetical protein bcere0016_8650 [Bacillus cereus 95/8201] Length = 391 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 56/370 (15%), Positives = 121/370 (32%), Gaps = 11/370 (2%) Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF 111 G ++ K +D ++ S T + I+Y + Sbjct: 26 GQGVIDRSERKTNNRDFLTKDSTKKTYLLTKYDDIVYNPSNLKYGAIDRNKHGQGVISPI 85 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 V D +P ++ + S + QR EG K + + + + Sbjct: 86 YVTFETDEIPSFIELIVKSENFKQRALQYEEGTVTKRQSVKPESLLCLNVVLPNSKDEQI 145 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL--NPDVKMKDSGIEWVGLVP 229 I ++D I + + LK+ KQ + + K P V+ EW Sbjct: 146 R-IGNFFKQLDDTIALHQQELTTLKQTKQGFLKKMFPKEGESTPKVRFPGFTGEWEQRKL 204 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 D + ++ I + + + L Y++++ G Sbjct: 205 DSIVDRVKSYSLSRDVETIENTGYKYIHYGDIHTKVADIIDESSNLPNIKVGNYELLEKG 264 Query: 290 EIVFRFIDLQNDKRS---LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 ++V + + + + + +A++P +DS +L +L+ S K Sbjct: 265 DLVLADASEDYQGIAAPAIITIDTPYKLVSGLHTIALRPKQVDSLFLYYLINSPIFRKFG 324 Query: 347 YAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 Y G+G+ + ++ + + P ++EQ I N +ID + + + LKE Sbjct: 325 YKTGTGMKVFGISVTNLLKFESVFPLLEEQVKIGNF----FKKIDDTIALHQCKLDALKE 380 Query: 406 RRSSFIAAAV 415 + +F+ Sbjct: 381 TKKAFLQKIF 390 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 54/197 (27%), Gaps = 17/197 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + + YI D+ + + + N Sbjct: 198 EWEQRKLDSIVDRVKSYSLSRDVETIENTGYKYIHYGDIHTKVADIIDESSNLPNIKVGN 257 Query: 78 VSIFAKGQILYGK-------LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + KG ++ + I + + + L+PK V L + S Sbjct: 258 YELLEKGDLVLADASEDYQGIAAPAIITIDTPYKLVSGLHTIALRPKQVDSLFLYYLINS 317 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G + + P L EQV I +ID I Sbjct: 318 PIFRKFGYKTGTGMKVFGISVTNLLKFESVFPLLEEQVKIGNF----FKKIDDTIALHQC 373 Query: 191 FIELLKEKKQALVSYIV 207 ++ LKE K+A + I Sbjct: 374 KLDALKETKKAFLQKIF 390 >gi|197249026|ref|YP_002149447.1| type I restriction-modification system, endonuclease S subunit [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197212729|gb|ACH50126.1| type I restriction-modification system, endonuclease S subunit [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 382 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 77/397 (19%), Positives = 142/397 (35%), Gaps = 31/397 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI-IYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W++V K + R S D+ IY+GLE ++ + K K Sbjct: 5 QLPEGWQMVKFGDIAKHISKRVEPSETDLKIYVGLEHLDPDSLKI--KRHGVPADVEGQK 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQR 136 + KGQI++GK Y RK +AD+D ICS +V K V+P L ++ S R Sbjct: 63 LLVKKGQIIFGKRRAYQRKVAVADWDCICSAHAMVLEENSKMVIPGFLPFFMQSDIFMNR 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 AI EG+ WK + P Q+ + + + + Sbjct: 123 AVAISEGSLSPTIKWKVLAEQVFLFPSKNRQLKMLPIL--------SSCNLASLKNDAAL 174 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E I + ++ + + E +G V + + E +I Sbjct: 175 ESLLFFRKVIFREHISKLIIRHNVSREKLGDV-------CRISTGKTPPPNEREYWEGDI 227 Query: 257 LSLSYGNIIQKLETRNMG---LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 ++ G+I N G + + E V G ++ I K ++ S + Sbjct: 228 PFITPGDISSDSLYINSGERNITHKGLEKTPSVPKGSVLLTCIGSTIGKAAIASCDLSTN 287 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I S + + + W+ + ++ K + + + + + V VP ++ Sbjct: 288 QQINS--LICSEKILPEYLIVWIQNNLEVIKKYTGIQ--AVPIINKSTLANIDVDVPFLE 343 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 EQ + V+ D L K+++ V+L S Sbjct: 344 EQLKLVMVVREM----DSLRHKLKKKGVILTNLTKSL 376 >gi|282915752|ref|ZP_06323522.1| type-I specificity determinant subunit [Staphylococcus aureus subsp. aureus D139] gi|283768150|ref|ZP_06341065.1| predicted protein [Staphylococcus aureus subsp. aureus H19] gi|282320381|gb|EFB50721.1| type-I specificity determinant subunit [Staphylococcus aureus subsp. aureus D139] gi|283462029|gb|EFC09113.1| predicted protein [Staphylococcus aureus subsp. aureus H19] Length = 400 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 71/387 (18%), Positives = 141/387 (36%), Gaps = 18/387 (4%) Query: 30 IKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + + +D I I L+ +E TG+ + + + +S + F +LY Sbjct: 26 FGNLATNKSDKFNPQNEDASIDIELDCIEQNTGRLI--KIYNSKEFSSQKNKFNPQNVLY 83 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSIDVTQRIEAICEGATM 146 GKL PYL K G+CS++ VL+ L + + + + G+ M Sbjct: 84 GKLRPYLNKYYFTKKSGVCSSEIWVLKSTKEDKLLNLFLYYFIQTKRYSDVASKSAGSKM 143 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 ADW + NI + P L EQ I E ++D I + +ELL+++K+ + I Sbjct: 144 PRADWGLVENIRVYFPELCEQQKIGEF----FSKLDRQIELEEQKLELLQQQKKGYMQKI 199 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 ++ L + + EW ++ E ++ + N + Sbjct: 200 FSQELRFKDENGNDYPEWEKKKLKEIAYVYTGNTPSKKENIYWIKGEYVWVTPTDINNSK 259 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + L E Y+ + + ++ I LR +G AV P Sbjct: 260 NIYESEHKLTQEGYKKARQLPENTLLVTCIASIGKNAILRK-----QGSCNQQINAVVPF 314 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + + + G Q + + L + + +EQ I ++I Sbjct: 315 ENINIDYLYYISDSLSTFMKSIAGKTATQIVNKNTFENLELYLASFEEQNKIADLI---- 370 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 + ++ L+EK ++ +K R+ + Sbjct: 371 SSLEELIEKQASKLIKMKSRKQGLLQK 397 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 69/190 (36%), Gaps = 12/190 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV------ESGTGKYLPKDGNSRQSDTST 77 W+ +K + TG T ++I +I E V + + + Q Sbjct: 216 EWEKKKLKEIAYVYTGNTPSKKENIYWIKGEYVWVTPTDINNSKNIYESEHKLTQEGYKK 275 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + +L + + AI+ G C+ Q + P + + + + +S ++ + Sbjct: 276 ARQLPENTLLVTCIASIGKNAILRK-QGSCNQQINAVVPFENIN-IDYLYYISDSLSTFM 333 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++I + N+ + + EQ KI ++ LI ++ + +K Sbjct: 334 KSIAGKTATQIVNKNTFENLELYLASFEEQ----NKIADLISSLEELIEKQASKLIKMKS 389 Query: 198 KKQALVSYIV 207 +KQ L+ + Sbjct: 390 RKQGLLQKMF 399 >gi|294675507|ref|YP_003576123.1| type I restriction-modification system subunit S [Prevotella ruminicola 23] gi|294473033|gb|ADE82422.1| type I restriction-modification system, S subunit [Prevotella ruminicola 23] Length = 392 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 55/393 (13%), Positives = 127/393 (32%), Gaps = 21/393 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + K G T + I + + D+ + + S Sbjct: 6 WEYKKLGEVAKFVGGGTPSKANEDYYTGNIPWATVRDMVNFNLSKTELCITDQAVKESAT 65 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +I K I+ + ++ I V+ P+ V + + + Sbjct: 66 NIIPKDTIIISTHVGLGKICLLMQDTAINQDLKGVILPQSVD--KMFFAAWYKSIADYVI 123 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +GAT+ K + ++ +PIPP+ +Q I ++ ++ +I + ++ + Sbjct: 124 SNGKGATVKGVTMKFVNDLKIPIPPINDQQRIVAELDC----LNEMIALKQEQLKEFDKL 179 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP-FFALVTELNRKNTKLIESNIL 257 Q++ + +P K+ + +G + K V + + E L Sbjct: 180 AQSIFYNMFG---DPVTNEKEWDVIELGDKCEVTSFKRVLIEDVVDSGVPFIRGTELMAL 236 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDP-GEIVFRFIDLQNDKRSLRSAQVMERGII 316 S + + E + V G+++ I+ + + L + + Sbjct: 237 SKATKGEKIEFTLFITPEHYEQVKAISGVPAVGDLLIPSINSEGNIWILDTDEPRYYKDG 296 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 ++ V S L ++M LK ++ L ++PP+ Q Sbjct: 297 RVLWVHVNHDAYTSEALKFIMHILLKKTYSVMATGATFAELKLFVLRELKTILPPLALQQ 356 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 I I+ E +++SI ++ S Sbjct: 357 QFAEKIQA----IEAQKELVKKSIAETQQLLDS 385 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 17/166 (10%), Positives = 45/166 (27%), Gaps = 12/166 (7%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRS 304 N NI + +++ ++ + I+ I+ Sbjct: 27 NEDYYTGNIPWATVRDMVNFNLSKTELCITDQAVKESATNIIPKDTIIIST-----HVGL 81 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + +M+ I V V + + + V Sbjct: 82 GKICLLMQDTAINQDLKGVILPQSVDKMFFAAWYKSIADYVISNGKGATVKGVTMKFVND 141 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 L + +PPI +Q I ++ ++ ++ ++ + + S Sbjct: 142 LKIPIPPINDQQRIVAELDC----LNEMIALKQEQLKEFDKLAQSI 183 >gi|21227769|ref|NP_633691.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20906173|gb|AAM31363.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 412 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 51/391 (13%), Positives = 112/391 (28%), Gaps = 52/391 (13%) Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L G + K D + + + +L I+ G Sbjct: 2 LVALYGATIGKLAFLGVDAATNQAVCAIFKNGIFESKFLYYLFFHRKQDLIKEAIGG-AQ 60 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + + N+ + I PL EQ I KI ++ I E LK +QA++ Sbjct: 61 PNISQTILKNLEVTICPLPEQRAIVSKIEQLFSELENGIANLKLAKEQLKVYRQAVLKKA 120 Query: 207 VTKGLNPDV------------------------------------KMKDSGIEWVGLVPD 230 L + +E + +P Sbjct: 121 FEGELTKKWREQQTDLPDAGGLLEQIRKEKEKAAKKAGKKLKQVKPFTEDELEDLNRLPK 180 Query: 231 HWEVKPFFALVTELNRKNT--KLIESNILSLSYGNIIQK-LETRNMGLKPESYE-TYQIV 286 W L + + ++ L GNI + ++ + E ++ Sbjct: 181 EWNWVKIGNLTLGVEYGTSAKSKESGDVAVLRMGNIQNGRFDWSDLVYTSDKTEIEKYLL 240 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCK 344 +++F + + + I + + + YL + + + Sbjct: 241 SKDDVLFNRTNSPELVGKTAIYKGEKPAIFAGYLIRINQLSELAVADYLNYFLNCHIAKV 300 Query: 345 VFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ + + ++ E + P + + EQ I I + D + + IE ++ Sbjct: 301 HGNSVKTDGVNQSNINGEKLGNYPFPLCSLPEQQTIVQEIETRLSICDKIEQDIETNLEK 360 Query: 403 LKERRSSFIAAAVTGQI-------DLRGESQ 426 + R S + A G++ ++RG Sbjct: 361 AEALRQSILKKAFEGKLLNERELAEVRGAED 391 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 74/208 (35%), Gaps = 11/208 (5%) Query: 14 GVQWIGAIPKHWKVVPIKRF---TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 ++ + +PK W V I + T S+ D+ + + ++++G + S Sbjct: 171 ELEDLNRLPKEWNWVKIGNLTLGVEYGTSAKSKESGDVAVLRMGNIQNGRFDWSDLVYTS 230 Query: 71 RQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVL----PEL 123 +++ + +K +L+ + + AI +L+ + L Sbjct: 231 DKTEIEKY-LLSKDDVLFNRTNSPELVGKTAIYKGEKPAIFAGYLIRINQLSELAVADYL 289 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I +G S+ + + +GN P P+ L EQ I ++I D Sbjct: 290 NYFLNCHIAKVHGNSVKTDGVNQSNINGEKLGNYPFPLCSLPEQQTIVQEIETRLSICDK 349 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 + + +E + +Q+++ L Sbjct: 350 IEQDIETNLEKAEALRQSILKKAFEGKL 377 >gi|146291271|ref|YP_001181695.1| restriction modification system DNA specificity subunit [Shewanella putrefaciens CN-32] gi|145562961|gb|ABP73896.1| restriction modification system DNA specificity domain [Shewanella putrefaciens CN-32] Length = 399 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 62/421 (14%), Positives = 130/421 (30%), Gaps = 49/421 (11%) Query: 21 IPKHWKVVPIKRFTK-------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P WK ++ K +N+ + V G + Sbjct: 2 VPNGWKDGRVRDLIKSLNAGVSVNSEDDGNLNSSYKILKTSCVSKGVFDPNETKSVVEEI 61 Query: 74 DTSTVSIFAKGQ-ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD------VLPELLQG 126 + S + G I+ ++ + +L + + G Sbjct: 62 EISRLKEPVLGDSIIISRMNTPALVGANGYIENGIDNTYLPDRLWQAKPKSNDVNMKWLG 121 Query: 127 WLLSIDVTQRIEAIC---EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + T+ + +M + + NI + IPPL EQ I + + D Sbjct: 122 YWFASSHTRYTLSSTATGTSGSMKNITKSDVLNIKIDIPPLPEQRKIAKIL----STWDK 177 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I+ R I+ K++K+AL+ ++T + DSG + G K Sbjct: 178 AISTTERLIDNSKQQKKALMQQLLTA---KKRLLDDSGKPFEGEWTKVELGKLLDYKQPT 234 Query: 244 LN--RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + E +I L+ G + E I D +F+D Sbjct: 235 PYLVKSTDYSNEYSIPVLTAGKTFILGYSNENFGIFEEELPAIIFDDFTTASKFVDFPFK 294 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 +S ++ + ++ ++ ++ G Q Sbjct: 295 AKSSAMKILVAKQGVSIKFVYEAMQVLNYPV-------------------GGHQRHWISI 335 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L + +P + EQ I +V+ +E +EQ + LK+ + + + +TG+ + Sbjct: 336 FANLVIGLPSLLEQQKIASVLTNADKE----IELLEQQLADLKQEKKALMQQLLTGKRRV 391 Query: 422 R 422 + Sbjct: 392 K 392 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 62/183 (33%), Gaps = 9/183 (4%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N+ +S G + S ++ I+ R + Sbjct: 33 NSSYKILKTSCVSKGVFDPNETKSVVEEIEISRLKEPVLGDSIIISRMNTPALVGANGYI 92 Query: 308 AQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDV 362 ++ + KP + ++ +L + S + +G +++ DV Sbjct: 93 ENGIDNTYLPDRLWQAKPKSNDVNMKWLGYWFASSHTRYTLSSTATGTSGSMKNITKSDV 152 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + +PP+ EQ I +++ D + E+ I K+++ + + +T + L Sbjct: 153 LNIKIDIPPLPEQRKIAKILSTW----DKAISTTERLIDNSKQQKKALMQQLLTAKKRLL 208 Query: 423 GES 425 +S Sbjct: 209 DDS 211 >gi|47779388|gb|AAT38617.1| predicted type I site-specific deoxyribonuclease specificity subunit [uncultured gamma proteobacterium eBACHOT4E07] Length = 405 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 49/409 (11%), Positives = 117/409 (28%), Gaps = 25/409 (6%) Query: 24 HWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK + + +S + I + ++ G K +T Sbjct: 2 SWKTYKLSELCNVFADGDWIESKDQSPEGIRLLQTGNIGVGVFKEREDKARYVSEETFKR 61 Query: 79 S---IFAKGQILYGKLGPYLRKAIIADFDG-----ICSTQFLVLQPKDVLPELLQGWLLS 130 +G +L +L + + + + ++ V L+ ++ S Sbjct: 62 LNCEEVFEGDLLISRLPEPVGRGCLIPSITSRAITAVDCTIIRVKSDLVDKRYLEYFIQS 121 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 I++ G T K +G I + + L Q I EK+ A ID I+ Sbjct: 122 QQYQTEIQSKVTGTTRQRISRKNLGEISIVLTSLPVQKQIVEKLDAAFSDIDKAISATEM 181 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 IE + ++ + + + + + + Sbjct: 182 NIENAETLFSRILIQSFEEKIEGSIYKTLQDVSIDFSRGKSKHRPRNDPNLFGGHY---- 237 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + + N + + + + E ++ + + L Sbjct: 238 ---PFIQTGNVANSSKFITHYDKSYNEKGLEQSKLWSKNTVCITIAANIAECGILNFDAC 294 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 II S+ + + SY + +Q++ ++ P Sbjct: 295 FPDSIIG----ITVDQKQTSSEYVFYLLSYFKDFIQSKSKGAAQQNINLGTFEKEKFPFP 350 Query: 371 -PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + Q ++ +N + +++ L + + L +SS + A G+ Sbjct: 351 SSLLVQSELIAELNDVSNQLNRLKSIYSEKLKQLNSLKSSILNQAFRGE 399 >gi|156973427|ref|YP_001444334.1| hypothetical protein VIBHAR_01116 [Vibrio harveyi ATCC BAA-1116] gi|156525021|gb|ABU70107.1| hypothetical protein VIBHAR_01116 [Vibrio harveyi ATCC BAA-1116] Length = 400 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 55/411 (13%), Positives = 121/411 (29%), Gaps = 36/411 (8%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ +W+++ ++ + + R + + P G S Sbjct: 6 DVPEIRFNDFVGNWQLLKLEDVAQFHDERRKPITE----------SAREAGPHPYYGASG 55 Query: 72 QSDTSTVSIFAKGQILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 D IF + IL + G + R +A + VL+ K L Sbjct: 56 IIDYVKDYIFDEEMILLSEDGANIIDRNYRVCFLASGQYWVNNHAHVLKAKQGNNNL--- 112 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L R + G + N+P+ I EQ +I I+ Sbjct: 113 FLCESLERLRYDKYNTGTAQPKINQDVCRNLPVYITDNDEQEIIGNYFQKLDTLINQHQQ 172 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + L K ++ + P+++ +W E Sbjct: 173 KHDKLSNLKKSMQEKMFPKA--GETVPEIRFDGFSGDWDSKPLSKVASNISDGDWIEAEH 230 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGL---KPESYETYQIVDPGEIVFRFIDLQNDKR 303 I + + G ++ + + PG+I+ + + Sbjct: 231 IFPNGKFRIIQTGNIGVGEFLNNEKHAKYFHQRNFDLIKANEIYPGDILISRLAEPAGRA 290 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 ++ + + DS +L + + + KV SG + + ++ Sbjct: 291 AILPDTGFRMVTAVDVAIVRREECYDSYFLMSYLNTAECLKVVSEGVSGTSHKRISRANL 350 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ + P I EQ I +D L+ + Q I +K + + + Sbjct: 351 VKVNIPFPSIDEQIKIGKY----FENLDGLINQHNQQITKIKNIKQACLDK 397 >gi|315280772|ref|ZP_07869575.1| specificity determinant HsdS [Listeria marthii FSL S4-120] gi|313615581|gb|EFR88923.1| specificity determinant HsdS [Listeria marthii FSL S4-120] Length = 393 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 45/398 (11%), Positives = 131/398 (32%), Gaps = 38/398 (9%) Query: 25 WKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + ++ G + ++ ++ ++ + DV + G+ + ++ Sbjct: 22 WEQRKLGELAEIVRGASPRPIQDPKWFDNNSEVGWLRISDVTAQNGRINYLEQRISEAGQ 81 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + +L + I G+ + L K E + Sbjct: 82 EKTRVLKEPHLLLSIAATVGKPVINYVKTGVHDGFLIFLDIKF---EQEFLFQWLEMFRT 138 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + + + + + N + IP + EQ+ KI ++D I R +E + Sbjct: 139 SWQKYGQPGSQVNLNSELVRNQEILIPSMKEQI----KISQLFQQLDNTIALHQRKLEKI 194 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K K A +S + K + +G D WE L+ + + +L + + Sbjct: 195 KALKTAYLSEMFPAEGETKPKRRFAG------FTDDWEQHKLGDLIDKQIKGKAQLEKLS 248 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 +++Y + + ++V I + D + + +G Sbjct: 249 KGTVAYLDTFTLNGGKAFLTDGHE----------DVVETDILILWDGSKAGTVYIGFKGA 298 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + S + + + Y+ ++ + ++ + + + +P EQ Sbjct: 299 LGSTLKGYRTSI--NEQFVYQFLKYNQENIYNNYRTPNIPHVQKDFLDVFKISIPKTVEQ 356 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + ++D + ++ + L+ + +++ Sbjct: 357 AKLGSF----FQQLDKTITIHQRKLQKLQNIKKAYLNE 390 >gi|67921463|ref|ZP_00514981.1| Restriction modification system DNA specificity domain [Crocosphaera watsonii WH 8501] gi|67856575|gb|EAM51816.1| Restriction modification system DNA specificity domain [Crocosphaera watsonii WH 8501] Length = 408 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 64/417 (15%), Positives = 151/417 (36%), Gaps = 38/417 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS--DT 75 +P++WK + + G I +++++ + S + Sbjct: 10 LPQYWKWSKCQEVIDVRDGTHDTPKYVSSGYPVITSKNLKTSGIDFSNVSYISEADHKEI 69 Query: 76 STVSIFAKGQILYGKLGPYLRKAI--IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S S KG IL +G I I I + L ++ PE + L S + Sbjct: 70 SKRSKVDKGDILLAMIGTIGNPVIVDIEKEFSIKNVALFKLSKSNIYPEYFKYLLDSSII 129 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +++++ G T K + N+ +P+PPL EQ I + + E Sbjct: 130 SRQLDFEQRGGTQKFVSLKVLRNLLIPLPPLEEQKRIAKILDKADEIRRKRKESIRLTDE 189 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L L S + +P + K ++ +G L N K ++L + Sbjct: 190 L-------LRSTFLDMFGDPVINPKGWEVKTLG--------SQIKELKYGTNSKCSELQK 234 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDL---QNDKRSLR 306 +N +++ I + LK + ++ +I + G+++F + + ++ Sbjct: 235 NNNIAVLRIPNIDNEKISWNDLKYTNLDSKEISKLLLKNGDLLFVRSNGNPDYIGRCAIF 294 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWL-----MRSYDLCKVFYAMGSGLRQSLKFED 361 + + + S + + I + A++ ++ + A + ++ ++ Sbjct: 295 EEESNRKAVYASYLIRGRLKSICDFHPAFIRDIIAFPTFRSFLIREARTTAGNYNINIQE 354 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L ++ PP +Q + ++ T +I+ ++S+ + +S + A G+ Sbjct: 355 LSSLKLICPPQDKQEE---YLD-ITTKINRSFLNKQKSLQESENLFNSLLQKAFKGE 407 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/203 (10%), Positives = 66/203 (32%), Gaps = 13/203 (6%) Query: 22 PKHWKVVPIKR-FTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK W+V + +L G S+ +I + + ++++ + + S Sbjct: 206 PKGWEVKTLGSQIKELKYGTNSKCSELQKNNNIAVLRIPNIDNEKISWNDLKYTNLDSKE 265 Query: 76 STVSIFAKGQILYGKLG---PYLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGWL 128 + + G +L+ + Y+ + I + + ++ + + K + Sbjct: 266 ISKLLLKNGDLLFVRSNGNPDYIGRCAIFEEESNRKAVYASYLIRGRLKSICDFHPAFIR 325 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 I + A + ++ + + +E+ + T +I+ + Sbjct: 326 DIIAFPTFRSFLIREARTTAGNYNINIQELSSLKLICPPQDKQEEYLDITTKINRSFLNK 385 Query: 189 IRFIELLKEKKQALVSYIVTKGL 211 + ++ + +L+ L Sbjct: 386 QKSLQESENLFNSLLQKAFKGEL 408 >gi|229553104|ref|ZP_04441829.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus rhamnosus LMS2-1] gi|229313601|gb|EEN79574.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus rhamnosus LMS2-1] Length = 407 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 61/405 (15%), Positives = 134/405 (33%), Gaps = 40/405 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY---IGLEDVESGTGKYLPKDGNSRQSDTS---TV 78 W+ + L T + KD + + ++V + + D + D + Sbjct: 20 WEKRKLIDQLSLLKDGTHGTHKDGNFAFLLSAKNVIQDSIVFDDSDRKISEDDFNDIYAN 79 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVLPELLQGWLLSIDVT 134 K +L +G R A+ S L +P P L L + + Sbjct: 80 YHIKKNDVLLTIVGTIGRVALFPRLTVPVAFQRSVAILRTKPTLF-PYFLALELQTPTIQ 138 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +I+A + + + + + IP EQ+ I + T I + + L Sbjct: 139 SKIKARANMSAQAGIYLGDLKKVVISIPKSEEQIEIAMSLNRLTNLIAATQDKLEKLSIL 198 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + Q + W H V R N + Sbjct: 199 QRGFLQHFFAQT-----------------WRFSGYSHVWENHRLGDVATRVRGNDGRMNL 241 Query: 255 NILSLSYG-NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVME 312 IL++S G + + + + + + Y ++ GE+ + + + + + + + Sbjct: 242 PILTISAGKGWLTQEQRFSQNIAGNELKKYTLLSKGELSYNHGNSKLAEYGAVFVLKQFK 301 Query: 313 RGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYA-MGSGLRQ----SLKFEDVKRLP 366 ++ Y + G D ++ +L S + SG R ++ ++ + Sbjct: 302 EALVPRVYHSFNVSGKADPDFIEYLFESGVPNHELRKLISSGARMDGLLNINYDSFMNIS 361 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 VL+P I+EQ I V+ ++ L ++ + L++ + S + Sbjct: 362 VLLPSIEEQNKIARVLE----KLKKLTDETRLRLFNLQQAKKSLL 402 >gi|164551505|gb|ABY60970.1| Sau1hsdS1 [Staphylococcus aureus] Length = 419 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 61/408 (14%), Positives = 139/408 (34%), Gaps = 29/408 (7%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVEIHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTLITERIR 190 ++I G + ++K I N+ + P + E Q I E I +I+ + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGEFISKLDRQIELEEQKLEL 199 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K Q + S + D + + + ++ Sbjct: 200 LQQQKKGYMQKIFSQELRFKDEEGKDYPDWKSKSIQEIFENKGGTALETEFNFDG----- 254 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 ++S+ +I +N+ + I+ G++ D D + + + Sbjct: 255 --NYKVISIGSYSINSTYNDQNIRVNKNKKTEKYILSKGDLAMVLNDKTKDGKIIGRSIF 312 Query: 311 MERG---IITSAYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRL 365 +++ I + P + W + + DL K+ M + + + +K + Sbjct: 313 IDKDNQYIYNQRTERLIPFAENDNKFLWFLMNTDLIRNKIKGMMQGATQVYINYSSIKLI 372 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P ++EQ I + V + + K I LKER+ +F+ Sbjct: 373 SIQLPLLEEQQKIRGFLEV----LSGITTKQLHXIDQLKERKKAFLQK 416 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 54/181 (29%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E ++ + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVEIHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I I+ + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGEFISKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|326802759|ref|YP_004320577.1| type I restriction modification DNA specificity domain protein [Aerococcus urinae ACS-120-V-Col10a] gi|326650965|gb|AEA01148.1| type I restriction modification DNA specificity domain protein [Aerococcus urinae ACS-120-V-Col10a] Length = 396 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 57/398 (14%), Positives = 135/398 (33%), Gaps = 26/398 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + + N S Y+ LE V GT K + ++ + + K Sbjct: 18 DWIQDKLGNISSFNPNAELPSQ--FFYVDLESV-CGTQLVDYKFMSKEEAPSRAKRLAKK 74 Query: 84 GQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G I Y + PY R ++ + D + ST + ++ + + L L + ++ ++ Sbjct: 75 GDIFYQTVRPYQRNNLLFNEDDNEFVFSTGYAQIRTNIINNKFLFYLLQTDKFVLKVLSM 134 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 C G + + + + P + + +++ I IT + IE L+ KQ Sbjct: 135 CTGTSYPAITSAEMSKVIIHYPKKQLEQIKIGELLNRLDFI---ITLEQQKIEKLELLKQ 191 Query: 201 ALVSYIVTKG-LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 L+ + P+++ + W + ++ K + + Sbjct: 192 YLLQNMFADESGYPNLRFRGYTGPWF--------KNKGKNIFKKITEKKQAHLPVLSATQ 243 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G +++ + ++ Y++V PG+ V Q + Sbjct: 244 DKGMVLRDEFNERLQYDRKNLSNYKVVRPGQFVVHLRSFQGGFAHSNYLGITSPAYTIFD 303 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFD 377 ++ + + Y + + + + G+R +++ F D L + P + EQ Sbjct: 304 FI--NTNEHNDIYWKFYFANDHFILLLEKVTYGIRDGRTINFSDFCTLNINFPSLSEQNK 361 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I ++ +D L+ + L + +++ Sbjct: 362 IAKLLFS----LDSLINLRTTKLENLTSLKQKLLSSLF 395 >gi|294775383|ref|ZP_06740902.1| type I restriction modification DNA specificity domain protein [Bacteroides vulgatus PC510] gi|294450765|gb|EFG19246.1| type I restriction modification DNA specificity domain protein [Bacteroides vulgatus PC510] Length = 425 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 62/425 (14%), Positives = 131/425 (30%), Gaps = 51/425 (12%) Query: 20 AIPKHWKVVPIKRFTKLN-----------TGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 +P W I+ + T T S ++ + ++V G Y ++ Sbjct: 4 EVPSSWVWTNIEELFFVTKLAGFEYTDCLTKDTISSNNEVPIVRAQNVRMG---YFVENT 60 Query: 69 NSRQSDTSTVSI----FAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVL 120 N S+ + + K +L +G + I C + + + Sbjct: 61 NEAISEALSQQLERSALTKKCLLMTFIGAGIGDTCIFPALKRCHLAPNVAKIEPYSNKID 120 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + +L+S + I + I ++ + +PPLAEQ I +I Sbjct: 121 LKYALYYLMSDLGQLGVRGISKSTAQPSLSMATIRSLEIALPPLAEQHRIVAEIEKLFEL 180 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV--------------- 225 ID + + ++K+ K ++ + L P + IE + Sbjct: 181 IDQIEQGKADLQTIIKQTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHY 240 Query: 226 -GLVPDHWEVKPFFALVT-----------ELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 VP W ++ + I LS ++ + Sbjct: 241 TFDVPSGWITTNLGSIFNVVSAKRILKSDWKHSGVPFYRAREIAKLSIYGLVDNELYISE 300 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 E + + +I+ + ++ + + + I++ Y+ Sbjct: 301 EHYNSLKEKFPVPKASDIMISAVGTIGKCYIVKESDKFYYKDAS-VLCLCNDYQINTKYI 359 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +MRS + K Y G ++ E K+ + +PP+ EQ I I + D + Sbjct: 360 YHIMRSEYMLKQMYDNSKGTTVDTITIEKAKQYILPLPPLAEQQRIVAKIEETFSIFDGI 419 Query: 393 VEKIE 397 +E Sbjct: 420 QNSLE 424 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 66/200 (33%), Gaps = 2/200 (1%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + IE + V + L + N ++ ++ G ++ + Sbjct: 12 TNIEELFFVTKLAGFEYTDCLTKDTISSNNEVPIVRAQNVRMGYFVENTNEAISEALSQQ 71 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 E + ++ FI + A A + + ID Y + + S Sbjct: 72 LERSALTKK-CLLMTFIGAGIGDTCIFPALKRCHLAPNVAKIEPYSNKIDLKYALYYLMS 130 Query: 340 YDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + S + SL ++ L + +PP+ EQ I I ID + + Sbjct: 131 DLGQLGVRGISKSTAQPSLSMATIRSLEIALPPLAEQHRIVAEIEKLFELIDQIEQGKAD 190 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 ++K+ +S + A+ G+ Sbjct: 191 LQTIIKQTKSKILDLAIHGK 210 >gi|224543619|ref|ZP_03684158.1| hypothetical protein CATMIT_02829 [Catenibacterium mitsuokai DSM 15897] gi|224523445|gb|EEF92550.1| hypothetical protein CATMIT_02829 [Catenibacterium mitsuokai DSM 15897] Length = 381 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 124/398 (31%), Gaps = 27/398 (6%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + + G + + + I ++D+ D D Sbjct: 2 EYKKLGDIATYINGYAFKPEQRGSEGLPIIRIQDLTGN-----AYDLGYYNGDYPKKIEL 56 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G +L L + + + + V + L ++ Sbjct: 57 NDGDVLISWS-ASLGVYLWNRGKALLNQHIFKVVFDKVEIDKLYFMYAVEYSLDKMSLKT 115 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GATM H K N+ +P P L Q I ++ + I+ + EL Sbjct: 116 HGATMKHITKKDFDNVVIPYPDLDYQKEISYRLTSLKGIIEKYQEQLDLLDEL------- 168 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE-LNRKNTKLIESNILSLS 260 + + V +P+++ K S +++ LV + + + K + I + Sbjct: 169 IKARFVEMFGDPNIEFKYSSVKFNDLVARMTKGPFGSDMKKDLFVPKGEDTYKVYIQINA 228 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 E + + + P + + L+ + ME+G+I+ + Sbjct: 229 IQKNQSLGEYYISKEYFDRKVSRFELFPNDYIITCDGTLGK--YLKLDENMEKGVISPSL 286 Query: 321 MAV--KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFD 377 + + + I+ Y + Y L + + L + + + + VPP++ Q Sbjct: 287 LRLTLQNDKINDKYFENIWDFYMLGLMKKEARNACLVHLPSAKKIGEISIPVPPLELQNQ 346 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ID +I++S+ +E S + Sbjct: 347 FASFV----QEIDKSRSRIQKSLEASQELFDSLMQEYF 380 >gi|164551503|gb|ABY60969.1| Sau1hsdS2 [Staphylococcus aureus] gi|323438973|gb|EGA96707.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus O11] gi|323441823|gb|EGA99464.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus O46] Length = 399 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 60/403 (14%), Positives = 139/403 (34%), Gaps = 39/403 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ ++V +G + D Sbjct: 20 EWEEKQLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNVRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGKFFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L ++G E+ + F ++ Sbjct: 197 LELLQQQKKGYMQKIFSQEL---RFKDENGEEYPNWENKFIKDIFIFENNRRKPITSSLR 253 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + II + + Y + ++ + + S Sbjct: 254 EKGLYPYYGATGIIDYV------------KEYLFNNEERLLIGEDGAKWGQFETSSFIAN 301 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 + + + VK + + ++ + + K A +G L ++ + + +P Sbjct: 302 GQYWVNNHAHVVKSNDHNLFFMNYYLN----FKELRAFVTGNAPAKLTHANLCNINLKIP 357 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ + ++ ID + I LLKER+ + Sbjct: 358 CLTEQ----DKVSALLKSIDNKMTNQMNRIELLKERKKGLLQK 396 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 53/181 (29%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G+E ++ + I L N+ + Sbjct: 10 PELRFPGLEGEWEEKQLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNVRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I + + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGKFFSKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|307249506|ref|ZP_07531494.1| Possible type I site-specific deoxyribonuclease [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858499|gb|EFM90567.1| Possible type I site-specific deoxyribonuclease [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 388 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 58/421 (13%), Positives = 118/421 (28%), Gaps = 64/421 (15%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDV--ESGTG 61 KD V+W + K G T + + ++ + Sbjct: 8 KDCEVEW----------KSLGEVAKYVRGLTYNKTNESDEKAGGYYVLRANNITLSNNQL 57 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFL--VLQ 115 + + T K IL + A I++ F+ V Sbjct: 58 NFDDVKLVKFDTKTKPEQKLYKDDILISAASGSKEHVGKVAFISENMDFYFGGFMGVVRC 117 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 +++LP L L S + + +T+++ + K + +PIPPL Q I + + Sbjct: 118 SQEILPRFLFHILTSSLFKTYLNEVLNSSTINNLNAKVMNEFQIPIPPLEIQEKIVKILD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 T TL + L ++ ++ G + +EW Sbjct: 178 KFTELEATLEATLEAELSLRVKQYDYYRDDLLNFGDD---------VEW----------- 217 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 L E + S N I+ E + E Sbjct: 218 -------------KMLGEVCVRIFSGKNKIKNNEGKYNVYGSTGIIAKTDKKIYEEDLLL 264 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I E + + + D L +L + + + Sbjct: 265 IARVGANAGFVHIATGEYDVSDNTLIIKHKE--DLVILKYLYYVLENMNLNRFANGAGQP 322 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 + +K L +L+PP+ Q I +++ + + + + + I L ++ R + Sbjct: 323 LITAGQLKELKILLPPLSTQQKIVEILDKFDRLTNSISDGLPKEIELRRKQYEYYRERLL 382 Query: 412 A 412 Sbjct: 383 N 383 >gi|289450224|ref|YP_003474823.1| type I restriction modification DNA specificity domain-containing protein [Clostridiales genomosp. BVAB3 str. UPII9-5] gi|289184771|gb|ADC91196.1| type I restriction modification DNA specificity domain protein [Clostridiales genomosp. BVAB3 str. UPII9-5] Length = 396 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 48/409 (11%), Positives = 126/409 (30%), Gaps = 27/409 (6%) Query: 24 HWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + + ++ G + + + ++ + DV+ + + + S Sbjct: 3 DWENIELGNICEVVRGGSPRPIIDYITDEPDGVNWLKIGDVKETDKFFTHANEKIKPSGI 62 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-T 134 G ++ + R I I + + L + + L+ ++ Sbjct: 63 PKTREVKAGDLILSNSMSFGRAFITLIDGYIHDGWLRLRCDESRLDKEYLYYFLTSNLAQ 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +AI G+ +++ + I + +P L EQ I E + + I + Sbjct: 123 NQFKAIATGSVVNNLKSDTVKAIKIDLPTLGEQKRIAEVLSMFDDK----IKCNEEVNKN 178 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+++ QAL + N + + E+ + + T T + Sbjct: 179 LEQQAQALYREMFVNTTNDQRRTCRAE-EYFDIAIGKTPPRKEHQWFTTNPSDATWVS-- 235 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I + + + + ++V ++ F + Sbjct: 236 -ISDMGSCGTYIIRSSEQLTQEAVDKFNIKVVPSNTVLLSFKLTVGRIAITHGEMITNEA 294 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 I + YL +R D S + ++ + +K +P ++P E Sbjct: 295 IA----HFKTDKAFINEYLYCYLR--DFNYQTMGSTSSIAIAVNSKIIKAMPFVIPADDE 348 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + + + + + L + R + + ++G+ID+ Sbjct: 349 ----ISRFHSVVGPMFEQILNNQLENDSLADLRDTLLPRLMSGEIDVSD 393 >gi|146295063|ref|YP_001185487.1| restriction modification system DNA specificity subunit [Shewanella putrefaciens CN-32] gi|145566753|gb|ABP77688.1| restriction modification system DNA specificity domain [Shewanella putrefaciens CN-32] Length = 401 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 71/371 (19%), Positives = 138/371 (37%), Gaps = 10/371 (2%) Query: 26 KVVPIKRFTKLN--TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + V T + + YIGLE ++SG+ K + + G + + S +F K Sbjct: 5 QTVKFGDICCEVKLTTKDPIADGYERYIGLEHLDSGSLK-IKRWGMIAEDNPSFTRVFKK 63 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G IL+GK PYL+KA IA+FDGICS +V++P + +L + S D + G Sbjct: 64 GHILFGKRRPYLKKAAIAEFDGICSGDIIVMKPDPDVKDLFPFIVQSKDFWEWSVQTSSG 123 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + +K + N + IP + L+ E++ + T +L+K + Sbjct: 124 SLSPRTKFKSLANFELAIPDFNRRKLLLEEVKKSNEVVKTTDLLIDAQEQLIKSQYYKTF 183 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + ++ + V + + + + + ++ + L++ G Sbjct: 184 KQELGIDDDTTYPLRINSTSNVEIKLLKELLLNKPQNGQFVKKGSGGSVDCSFLNVVDGY 243 Query: 264 IIQKLETRNMGLK--PESYETYQIVDPGEIVFRFIDLQNDKRSLR--SAQVMERGIITSA 319 + + +S + G+I+F L ++ Sbjct: 244 VNSYSTEDRREIISCSQSEFEKYCLKNGDILFNRSSLVKSGIGWPFLVLNDTKQSTFDCH 303 Query: 320 YMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + V P I YL S K F +G + ++ +++ PV VP I +Q Sbjct: 304 LIRVNVDPKIILPEYLYIYALSPWARKYFLCVGQTTTMTTISQSEIENFPVPVPSIYKQE 363 Query: 377 DITNVINVETA 387 +I + Sbjct: 364 EIVTTFSNLFT 374 >gi|20091246|ref|NP_617321.1| type I restriction modification enzyme protein S [Methanosarcina acetivorans C2A] gi|19916365|gb|AAM05801.1| type I restriction modification enzyme protein S [Methanosarcina acetivorans C2A] Length = 391 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 124/405 (30%), Gaps = 29/405 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W PI + TG T ++ + DI ++ +++ T + ++ + Sbjct: 4 WPHQPIISLGTIITGSTPKTSEEHFYGGDIPFVTPAELDQ-TDPIMNAARTLSETGSQES 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + +G ++ +G L K IA + Q + + G+ + R+E Sbjct: 63 RLLPEGTVMVCCIGS-LGKVGIAGRTVASNQQINSVIFDPKIIWPRFGFYACRLLKSRLE 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + T+ + G + +P+PPL EQ I + + E Sbjct: 122 VLAPATTVPIVNKSKFGQLEIPVPPLPEQKRIADILDRAEALRAKRRVALEHLD----EL 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 QA+ + ++ + K P V + I Sbjct: 178 TQAIFIDMFGDSVSNPMGWKR--------YPLKHCVNHIQIGPFGSLLHKEDYVFGGIPL 229 Query: 259 LSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 ++ +I ++ + + G+++ + S Sbjct: 230 INPTHIENGKIVPDVNQSITVQKLAELQLYQLQQGDVIMGRRGEMGRCAIVGSEHNGTLC 289 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIK 373 S ++ + YL + S + K +L V L + +PPI+ Sbjct: 290 GTGSLFIRPDESKAIAMYLQATLSSESMRKHLEGFSLGATLPNLNRGIVGELAISLPPIE 349 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q + ++ I I+ L + S+ + E S A G+ Sbjct: 350 LQKEFSHHIES----IEKLKTTYKSSLTEIDELFLSLQYRAFRGE 390 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 67/203 (33%), Gaps = 17/203 (8%) Query: 22 PKHWKVVPIKRFTK-LNTGRTSE---SGK----DIIYIGLEDVESGTG-KYLPKDGNSRQ 72 P WK P+K + G I I +E+G + + ++ Sbjct: 193 PMGWKRYPLKHCVNHIQIGPFGSLLHKEDYVFGGIPLINPTHIENGKIVPDVNQSITVQK 252 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWL 128 + +G ++ G+ G R AI+ + F+ + LQ L Sbjct: 253 LAELQLYQLQQGDVIMGRRGEMGRCAIVGSEHNGTLCGTGSLFIRPDESKAIAMYLQATL 312 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + + +E GAT+ + + +G + + +PP+ Q I+ L T Sbjct: 313 SSESMRKHLEGFSLGATLPNLNRGIVGELAISLPPIELQKEFS----HHIESIEKLKTTY 368 Query: 189 IRFIELLKEKKQALVSYIVTKGL 211 + + E +L L Sbjct: 369 KSSLTEIDELFLSLQYRAFRGEL 391 >gi|291542117|emb|CBL15227.1| Restriction endonuclease S subunits [Ruminococcus bromii L2-63] Length = 425 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 49/409 (11%), Positives = 123/409 (30%), Gaps = 25/409 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYI-GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T + + D ++ E + KD + + + Sbjct: 19 WEQRKLGEISDKVTKKNQDVVVDEVFTNSAEYGIISQRDFFDKDIANT-ENIDGYYVVEP 77 Query: 84 GQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +Y + G S + V +P +V L+ + + + Sbjct: 78 NDFVYNPRISTTAPFGPIKRNKLERSGAMSPLYYVFRPNNVDLSYLEWFFQTSCWYPFMR 137 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 S L + + +++I +DTLIT R ++ +K+ Sbjct: 138 FNGNSGARSDRFAITDKIFNEMPISLPQDIEEQKRIGMFLTTLDTLITLHQRKLDHVKDL 197 Query: 199 KQALVSYIVTK--GLNPDVKMKDSGIEW-------VGLVPDHWEVKPFFALVTELNRKNT 249 K++++ + K L P+V+ + W + + + + K+ Sbjct: 198 KKSMLQKMFPKNGQLYPEVRFPEFTDAWEQRKLKNILVSLQNNTLSRADLSNETGVAKDV 257 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + I +I ++ K + + G++V Sbjct: 258 HYGDVLIKFGEVLDISKEKLPMITDEKVLTKYKTSFLQNGDVVVADTAEDTTVGKCSEIA 317 Query: 310 VMERGIITSAYMAVKPHGIDST---YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 + ++ S + ++ YL + + S + G+ S+ ++ Sbjct: 318 ELNDEVVISGLHTIPYRPVEKFATGYLGYYLNSDSYHNQLIPLMQGIKVTSISKSAMQDT 377 Query: 366 PVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ P +EQ I +D L+ ++ + LK + + Sbjct: 378 NIIYPNSKEEQAKIGKY----FITLDNLITLHQRELDHLKLLKKGMLQQ 422 >gi|166711015|ref|ZP_02242222.1| restriction modification system DNA specificity domain [Xanthomonas oryzae pv. oryzicola BLS256] Length = 767 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 56/491 (11%), Positives = 128/491 (26%), Gaps = 91/491 (18%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQSDT 75 +P+ W + N+G+T + G++ YI ++ G + + Sbjct: 77 ELPESWCWARFGDIAQHNSGKTLDKGRNSGVPRDYITTSNLYWGRFELSGVRQMLIEEKD 136 Query: 76 STVSIFAKGQILYGKLGPYLR--------------KAIIADFDGICSTQFL-VLQPKDVL 120 +L + G R A F G + + + Sbjct: 137 LARCTAIMNDLLICEGGEAGRAAVWDQEREICFQNHVHRARFLGGINPHYAQRFFERLNY 196 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA--------------- 165 + + + ++ + I + L Sbjct: 197 SGEIAEYRKGVGISNMSSKSLASIPVPLPPVAEQHRIVAKVDELMGLCDQMEARQADADS 256 Query: 166 -------------EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 Q E R+ + KQ L+ V L Sbjct: 257 AHAQLVQALLDSLTQARDAEDFAHSWQRLAEHFHTLFTTESSIDALKQTLLQLAVMGKLV 316 Query: 213 PDVKMKDSGIEWVGLV--------------------------------PDHWEVKPFFAL 240 ++G E + + P W F L Sbjct: 317 QQDPNDETGCELLKRIAEGGSALIASKKVKTSKAHTGLAVQGIKGVRLPATWAWARFDDL 376 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLE----------TRNMGLKPESYETYQIVDPGE 290 + ++ + ++ + +++ + +S + GE Sbjct: 377 INREYPIAYGVLVPGPDVVDGIPFVRIADLDLVAPPAKPEKSISPEVDSQFKRTRIRGGE 436 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+ + V I + V + +Y+ WL++S + K F Sbjct: 437 ILMGVVGSVGKLGIAPDTWVGAN-IARAICRIVPCGEVSKSYILWLLQSDLMRKQFLGDT 495 Query: 351 SG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + +L ++ +PP+ EQ I ++ A D L ++ ++ + + ++ Sbjct: 496 RTLAQPTLNVGLIRSALTPLPPLAEQQRIVAKVDQLMALCDQLKSRLSEARRVHEHLANA 555 Query: 410 FIAAAVTGQID 420 I+ A+ G+ Sbjct: 556 LISQALNGEKK 566 >gi|46019873|emb|CAE52399.1| putative restriction-modification enzyme type I S subunit [Streptococcus thermophilus] Length = 362 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 47/378 (12%), Positives = 117/378 (30%), Gaps = 27/378 (7%) Query: 36 LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYL 95 + + + DI ++ + DV G+ + + + + + +L Sbjct: 9 IQDPKWFDKESDIGWLRIADVTEQNGRIYHLEQHISKLGQEKTRVLTEPHLLLSIAATVG 68 Query: 96 RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + + G+ + L P E + + + + + + + + + Sbjct: 69 KPVVNYVKTGVHDGFLIFLNPTF---EREFMFQWLEMFRPKWQKYGQPGSQVNLNSELVR 125 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV 215 N + +P EQ I ++D I R ++LLKE+K+ + + K Sbjct: 126 NQEIVLPNYKEQQKIGLF----FKQLDDTIALHQRKLDLLKEQKKGFLQKMFPKNGAKVP 181 Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 +++ +G ++ + + + N L+ G T + Sbjct: 182 ELRFAGFADAWEERKLGKIFNYEQPTKYIVKSTEYDDTFNTPVLTAGKSFLLGYTDEITG 241 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + +V + + S V I S+ M + +S + Sbjct: 242 IKNATVENPVVIFDDF------------TTGSHYVDFPFKIKSSAMKLLSLNDNSDNFYF 289 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + K + + P +EQ I + ++D + Sbjct: 290 MFNTLKNIKYVPQS----HERHWISKFSEFEIYKPSQEEQQKIGSF----FKQLDDTIAL 341 Query: 396 IEQSIVLLKERRSSFIAA 413 ++ + LLKE++ F+ Sbjct: 342 HQRKLDLLKEQKKGFLQK 359 >gi|88195627|ref|YP_500433.1| type I restriction-modification enzyme, S subunit, EcoA family protein [Staphylococcus aureus subsp. aureus NCTC 8325] gi|297207478|ref|ZP_06923914.1| EcoA family type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus ATCC 51811] gi|87203185|gb|ABD30995.1| type I restriction-modification enzyme, S subunit, EcoA family, putative [Staphylococcus aureus subsp. aureus NCTC 8325] gi|296887814|gb|EFH26711.1| EcoA family type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus ATCC 51811] gi|329724465|gb|EGG60973.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21189] Length = 399 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 58/403 (14%), Positives = 136/403 (33%), Gaps = 39/403 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGKFFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I T+ L + + EW + + K Sbjct: 197 LELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIFENNRRKPITSSLREKG 256 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + ++ N ++ + + S Sbjct: 257 LYPYYGATGIIDYVKDYLFNNEE---------------RLLIGEDGAKWGQFETSSFIAN 301 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 + + + VK + + ++ + + K A +G L ++ + + +P Sbjct: 302 GQYWVNNHAHVVKSNDHNLFFMNYYLN----FKELRAFVTGNAPAKLTHANLCNINLKIP 357 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ + ++ ID + I LLKER+ + Sbjct: 358 CLTEQ----DKVSALLKSIDNKMNNQMNRIELLKERKKGLLQK 396 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 51/181 (28%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I + + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGKFFSKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|312114645|ref|YP_004012241.1| restriction modification system DNA specificity domain protein [Rhodomicrobium vannielii ATCC 17100] gi|311219774|gb|ADP71142.1| restriction modification system DNA specificity domain protein [Rhodomicrobium vannielii ATCC 17100] Length = 409 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 61/413 (14%), Positives = 130/413 (31%), Gaps = 29/413 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W V + ++ G +S I + ++++ +G G D + + Sbjct: 3 SGWPFVRLGEVCEVTPGYAFKSQDWSHAGIPVVKIKNI-AGDGTVDLNDVDCIPPTLFSR 61 Query: 79 SI----FAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G I+ G KA + + + L+P DV + S D Sbjct: 62 KLGRFELRDGDIIIAMTGATAGKAGRVRTSRSILLNQRVARLRPNDVDAAFFWALVGSKD 121 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + +GA + I + +P PP+ Q I + A I R I Sbjct: 122 YERIFFRLADGAAQPNMSSSQIEGVLIPCPPITVQRRIGSILRAYDDL----IEVNRRRI 177 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-TKL 251 +L+E + L P + G +P W L +E+ Sbjct: 178 AVLEEMARRLFEEWFVHFRFPGYQADIPR----GRLPSGWIWSTLGELASEVRDAVLPSD 233 Query: 252 IESNILSLSYGNIIQKLETRNM-GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + + ++ ++ T G E T PG+I+F I K Sbjct: 234 VSPDTPYVGLEHLPRRSTTLGEWGNVDEVTSTKLKFRPGDILFGKIRPYFHKVVWAPCDG 293 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + A + + + + S +G + + + PV + Sbjct: 294 IS---SSDAIVIRARSDDLTAIVLSVASSDAFVAHAVQTSNGTKMPRANWPVLVKYPVPL 350 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 PP++ + ++ + L ++ + L R + ++G++ + Sbjct: 351 PPLELREKFSDYVLNGV----QLAATLQAANRRLVASRDLLLPRLISGELSVT 399 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 59/158 (37%), Gaps = 5/158 (3%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 G +P W + S D Y+GLE + + + TS Sbjct: 207 GRLPSGWIWSTLGELASEVRDAVLPSDVSPDTPYVGLEHLPRRSTTLGE--WGNVDEVTS 264 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQ 135 T F G IL+GK+ PY K + A DGI S+ +V++ + ++ S Sbjct: 265 TKLKFRPGDILFGKIRPYFHKVVWAPCDGISSSDAIVIRARSDDLTAIVLSVASSDAFVA 324 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 G M A+W + P+P+PPL + + Sbjct: 325 HAVQTSNGTKMPRANWPVLVKYPVPLPPLELREKFSDY 362 >gi|187939949|gb|ACD39085.1| type I restriction modification DNA specificity protein [Pseudomonas aeruginosa] Length = 395 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 52/414 (12%), Positives = 133/414 (32%), Gaps = 40/414 (9%) Query: 24 HWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDV-----ESGTGKYLPKDGNSRQSD 74 W +V + + + + + + + +V E L D + + Sbjct: 2 SWPIVKLGEIFDITSSKRVHEIDWRNEGVPFYRAREVAVLAKEGRVDNDLFIDESMYEEF 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + + G +L +G + + D + L+ + + ++ Sbjct: 62 KAKYGVPKVGDLLVTAVGTLGKVYAVQESDRFYFKDASVIWLRARQEVDTSYIQHAMNST 121 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 QR GAT+ +P+PPL EQ I + Sbjct: 122 DVQRFIQNSSGATVGTYTISRANETEIPLPPLPEQKRIAAILDKADAIRRKRQQAIQLAD 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + L + + +P K I + + + + + Sbjct: 182 DF-------LRAVFLDMFGDPVTNSKGFPIGTIRDLVATADYGS-------SAKASETYG 227 Query: 253 ESNILSLSYGNIIQKLETRNMGLKP--ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E IL + +++ + E + +V+ G+++F + + + Sbjct: 228 EYPILRMGNITYQGRIDLEGLKYINLEEKERSKYLVEKGDLLFNRTNSKELVGKTAVYDM 287 Query: 311 MERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPV 367 + I + V+P+ + +S Y++ + S ++ + ++ ++++ +P+ Sbjct: 288 DDPVAIAGYLIRVRPNEMGNSHYISGYLNSAHGKATLRSICKSIVGMANINAQEMQNIPI 347 Query: 368 LVPPIKEQ---FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++P I+ Q ++ V + D ++ L ++ SS A +GQ Sbjct: 348 MLPSIELQRKYQELVVVTKCKLQVFDT-------ALKLTEQLFSSLSYKAFSGQ 394 >gi|154496691|ref|ZP_02035387.1| hypothetical protein BACCAP_00983 [Bacteroides capillosus ATCC 29799] gi|150273943|gb|EDN01043.1| hypothetical protein BACCAP_00983 [Bacteroides capillosus ATCC 29799] Length = 428 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 60/414 (14%), Positives = 150/414 (36%), Gaps = 23/414 (5%) Query: 19 GAIPKHWK-VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 G +P W + K K T + +I+ E+ + + D + Sbjct: 27 GIMPIDWDDSIRAKDVFKNYTDKKHNGELEILASTQENGIVPRSQ-IGIDIQCSDEGVAG 85 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQR 136 ++G + L + + ++GI S + VL+P + ++ + + QR Sbjct: 86 YKKVSQGDFVIS-LRSFQGGIEYSRYEGIVSPAYTVLKPIKSISDVYYQHYFKTSRFIQR 144 Query: 137 IEAICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + G ++ G++ + PP+ EQ I E + ++ D LI + + IE Sbjct: 145 LNSAVYGIRDGKQIGYQDFGDLYIHYPPIDEQKKIAEIL----MQCDKLIELKRQRIEEE 200 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSG--IEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 K KK+ ++ + P + S + + E ++N + Sbjct: 201 KNKKKWILEETMKP---PKGILDSSNKYTGTLEDLVSKIETGISVNSTDDVNSGIDSNHK 257 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + + + + + T V+ G ++ ++ + Sbjct: 258 FVLKTSAICDGVFIETECKKVVPEDYHRTSCAVEGGTLLVSRMNTPKLVGACAICYKSLP 317 Query: 314 GIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVL 368 + + +D +L +++ S + G +++ +D LP+ Sbjct: 318 NVYLPDRLWKVSVKATVDPRWLNYILNSAQYKNLIQERAGGTSNSMKNISQKDFLGLPIS 377 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 P ++Q I + + + ID L++K+EQ + +++ + +TG + ++ Sbjct: 378 PPSYEKQVIIGDTL----SSIDNLIQKLEQEVDAWMQKKKLMMQLLLTGIVRVK 427 >gi|124515159|gb|EAY56670.1| Restriction endonuclease S subunit [Leptospirillum rubarum] Length = 142 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 32/100 (32%), Positives = 55/100 (55%), Gaps = 2/100 (2%) Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + +DS+Y+ +++ + + + + V + +P + EQ I + ++ E Sbjct: 34 NEVDSSYVIYVLTA--GRNELFKYDRTAIPQITVDQVASNRIPIPALSEQLAIASFLDSE 91 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 T+RID L+ + I LLKE RSS I AAVTG+ID+RG + Sbjct: 92 TSRIDTLISESRTFIDLLKEYRSSLITAAVTGKIDVRGFT 131 Score = 36.7 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 50/124 (40%), Gaps = 1/124 (0%) Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 + +I + T K + + + + Sbjct: 4 ARGNSIGHVKLIHEPCTTTQTTIYSKNLKQNEVDSSYVIYVLTAGRNELFKYDR-TAIPQ 62 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + +PIP L+EQ+ I + +ET RIDTLI+E FI+LLKE + +L++ VT Sbjct: 63 ITVDQVASNRIPIPALSEQLAIASFLDSETSRIDTLISESRTFIDLLKEYRSSLITAAVT 122 Query: 209 KGLN 212 ++ Sbjct: 123 GKID 126 >gi|159904437|ref|YP_001548099.1| restriction modification system DNA specificity subunit [Methanococcus maripaludis C6] gi|159885930|gb|ABX00867.1| restriction modification system DNA specificity domain [Methanococcus maripaludis C6] Length = 397 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 60/422 (14%), Positives = 123/422 (29%), Gaps = 34/422 (8%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD 67 ++KD+ IG IP W+V I G V S P Sbjct: 3 DEFKDTE---IGKIPVDWEVKEIGELVTFQRGHDLP------------VNSRKNGIYPVV 47 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 ++ + G+ G ++ +T V + + P+ + + Sbjct: 48 ASNGIVGYHNEYKVENEGLTIGRSGNLGEPFYVSTSFWPLNTTLYVKKFHNSHPKFMYYF 107 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L ++D+ G+ + + I I + +PPL EQ I + + + +I+ + Sbjct: 108 LKTLDLK----KYNSGSAVPSLNRNYIHPIKVAVPPLHEQQKIAQILSSLDDKIENNNQQ 163 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV-----PDHWEVKPFFALVT 242 E + N +++G P W+V + + Sbjct: 164 NKILEETANSIFKEWFVNFNFLDENGLSYFENNGEMEFNEDLGSEIPKGWKVGSIYEISE 223 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + S+ L G+ + R++ S+ T + G +++ + Sbjct: 224 VIYG----APFSSKLFNECGDGYPLIRIRDLKTLNPSFFTTEQHAKGTLIYPGNIVAGMD 279 Query: 303 RSLRS-AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFE 360 R + G + KP + F L Sbjct: 280 AEFRPYFWLGNIGYLNQRVCTFKPKYEWIHNYFIYETIKEPLNFFEKSKVGTTVIHLGKS 339 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 D+ ++VP N D ++E + L R + ++G+I Sbjct: 340 DIDTFKIIVPDEVTLK---NFYITIDPIFDKIIEN-SKQNRYLSNLRDLLLPKLMSGEIR 395 Query: 421 LR 422 L+ Sbjct: 396 LK 397 >gi|269838110|ref|YP_003320338.1| restriction modification system DNA specificity domain-containing protein [Sphaerobacter thermophilus DSM 20745] gi|269787373|gb|ACZ39516.1| restriction modification system DNA specificity domain protein [Sphaerobacter thermophilus DSM 20745] Length = 532 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 61/469 (13%), Positives = 138/469 (29%), Gaps = 77/469 (16%) Query: 21 IPKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W I+ + G ++ + + I ++++ K N Sbjct: 9 LPPGWTWATIRDTGEYINGLAFRKSDWGDEGLPIIRIQNLTD-----PSKPFNRTSRQVD 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ- 135 V I +G IL L G+ + + P + L + L Sbjct: 64 PVYIVHRGDILLSWS-ATLDAFTWRGETGVLNQHIFKVVPDNRLVHSPYLYHLLRHAIDL 122 Query: 136 -RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G+TM H + + +P+ PLAEQ I +I R+D + R Sbjct: 123 LKQSSHLHGSTMKHINRGPFLSFQVPLAPLAEQRRIVAEIEKHFTRLDAAVAALERARAN 182 Query: 195 LKEKK----------------------------------QALVSY------------IVT 208 LK + Q ++ + Sbjct: 183 LKRYRAAVLKAACEGRLVPTEAELARAEGRDYETGEQLLQRILQERRAKWEAEELAKLRA 242 Query: 209 KGLNPDVKMKD--------SGIEWVGLVPDHWEVKPFFA---LVTELNRKNTKLIESNIL 257 KG P + +P+ W + K + Sbjct: 243 KGKEPKDDRWKARYKEPAAPDTSDLPELPEGWVWARLDQLLGSLRNGISKKPDSESGTPI 302 Query: 258 SLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLR--SAQVMER 313 + + S + Y ++ G+++F + + + V + Sbjct: 303 LRINAVRPLSVNMEEIRYLSGSVDQYADYVLCQGDLLFTRYNGSPELVGVCGAVRAVDRK 362 Query: 314 GIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLV 369 + + + H S+++ ++ + + + + D++ +P+ + Sbjct: 363 VVYPDKLIRARLASHLCLSSFVQIVLNVGLSREFIARRIRTTAGQSGVSGSDIRSVPLPL 422 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PP+ EQ I + + ++ L +IE ++ + R + + A G+ Sbjct: 423 PPLAEQRRIVAEVERRLSVVEELERQIEANLKRAERLRQAILKRAFAGK 471 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 71/208 (34%), Gaps = 10/208 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE-TRNMGLKPESYE 281 + +P W +N + + L I + ++ + Sbjct: 4 DNSPCLPPGWTWATIRDTGEYINGLAFRKSDWGDEGLPIIRIQNLTDPSKPFNRTSRQVD 63 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRS 339 IV G+I+ + + E G++ V P S YL L+R Sbjct: 64 PVYIVHRGDILLSWSATLD-----AFTWRGETGVLNQHIFKVVPDNRLVHSPYLYHLLRH 118 Query: 340 -YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 DL K + + + V + P+ EQ I I R+D V +E+ Sbjct: 119 AIDLLKQSSHLHGSTMKHINRGPFLSFQVPLAPLAEQRRIVAEIEKHFTRLDAAVAALER 178 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + LK R++ + AA G++ + E++ Sbjct: 179 ARANLKRYRAAVLKAACEGRL-VPTEAE 205 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 69/213 (32%), Gaps = 16/213 (7%) Query: 18 IGAIPKHWKVVPIKRF-TKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + +P+ W + + L G + +S + + V + S D Sbjct: 267 LPELPEGWVWARLDQLLGSLRNGISKKPDSESGTPILRINAVRPLSVNMEEIRYLSGSVD 326 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +G +L+ + + + V+ P ++ L L Sbjct: 327 QYADYVLCQGDLLFTRYNGSPELVGVCGAVRAVDRK--VVYPDKLIRARLASHLCLSSFV 384 Query: 135 QRI-----------EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 Q + I A S I ++P+P+PPLAEQ I ++ ++ Sbjct: 385 QIVLNVGLSREFIARRIRTTAGQSGVSGSDIRSVPLPLPPLAEQRRIVAEVERRLSVVEE 444 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 L + ++ + +QA++ L P Sbjct: 445 LERQIEANLKRAERLRQAILKRAFAGKLVPQDP 477 >gi|194442247|ref|YP_002043768.1| putative type I restriction-modification system S subunit [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|194400910|gb|ACF61132.1| putative type I restriction-modification system, S subunit [Salmonella enterica subsp. enterica serovar Newport str. SL254] Length = 571 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 81/497 (16%), Positives = 159/497 (31%), Gaps = 93/497 (18%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK--LNTGRTSESGKDIIYI-GLEDVE 57 +K K P+ S + +P W+ + + E +I LED+E Sbjct: 83 IKKQKPLPEI--SEEEKPFELPMGWEWTRLGSISNYGFCDKAEPEDVTPETWILELEDIE 140 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 T K + K + + S+ + F++G +LYGKL PYL K I+A+ G+C+T+ + + Sbjct: 141 KVTSKLINKVTFAERPFKSSKNRFSQGDVLYGKLRPYLDKVIVANEPGVCTTEIIPITSY 200 Query: 118 DVL-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + PE L+ L + + + G + + + + P+ EQ+ I ++ Sbjct: 201 GNIYPEFLRLLLKAPNFIIYANSSTHGMNLPRLGTEKAQQAVIELAPIQEQLRIVSRVDK 260 Query: 177 ETVRIDTLITERIRFIELLKE--------------------------------------- 197 D L + ++ ++ Sbjct: 261 LMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADELAENWARISEHFDTLFTTEASI 320 Query: 198 --KKQALVSYIVTKGLNPDVKMKDSGIEWV------------------------------ 225 KQ ++ V L P + E + Sbjct: 321 AALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKMKKQKPLPPISDEEK 380 Query: 226 -GLVPDHWEVKPFFALVT----ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 +P WE + + + + E + L Y + + + Sbjct: 381 PFELPIGWEWCRLGECINLISGQHLKPDEYEEECHGEMLPYITGPAEFGLISPTYSKYTN 440 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 E I G+I+ K ++ + I+ MA+ ++S YL ++ S Sbjct: 441 EKRAIAAKGDILITCKGAGLGKLNVADTNI----AISRQLMAINVIRMNSEYLKIILDSM 496 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV----LVEK 395 F + G G + EDV +++PP +EQ I + I+ + Sbjct: 497 YG--YFQSKGVGIAIPGISREDVMEPLIMLPPFEEQKRIMENLYKLNFFIEDIKFRIKSA 554 Query: 396 IEQSIVLLKERRSSFIA 412 + + L + I Sbjct: 555 QQTQLHLADALTDAAIN 571 >gi|68536333|ref|YP_251037.1| putative DNA restriction-modification system, specificity subunit [Corynebacterium jeikeium K411] gi|68263932|emb|CAI37420.1| putative DNA restriction-modification system, specificity subunit [Corynebacterium jeikeium K411] Length = 407 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 58/419 (13%), Positives = 127/419 (30%), Gaps = 33/419 (7%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W I + G + + + +I + DV G+ L D Sbjct: 4 DWIDTTIGELAVVTRGASPRPISSDRWFDDAGKVGWIRIADVNRSNGRELKVTSQRLSED 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S F L + + +I F+ L + + + L + + Sbjct: 64 GILRSRFLDSGTLILSIAASVGIPVITQIPACIHDGFVALTSVNADQKFMLYLLKAAEGR 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + + + +P+ IP AEQ I + + I +L + Sbjct: 124 --LREAGQSGSQMNINSDIVRGLPVKIPADFAEQKAISSALWEKDDLISSLERLISKKQA 181 Query: 194 LLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + Q L++ G + G +G + R + Sbjct: 182 IKQGMMQELLTGRTRLPGFSASWFSSTWGELALG-----------ISSGATPRRGVAEYW 230 Query: 253 ESNILSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I ++ + + +++ +I G + L+ + Sbjct: 231 NGEIPWVTSTELKRGPVDSIPQSITTAGLRAANLRIWPAGTFLMAITGLEAAGTRGKCGL 290 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + + MAV P T + + + + G +QS VK+LP+ Sbjct: 291 LSVAAATNQSCMAVAPGPDLDTEFLFYYYLHYGNDLAFKYVQGTKQQSYTAAIVKKLPIH 350 Query: 369 VPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +P + EQ I V+ D + +E+ + + + + +TG+ L E + Sbjct: 351 LPSDVSEQQAIAQVLRDA----DHEIAALERCLESARNIKQGMMQELLTGRTRLPFEGE 405 >gi|223040252|ref|ZP_03610530.1| restriction modification system specificity subunit [Campylobacter rectus RM3267] gi|222878505|gb|EEF13608.1| restriction modification system specificity subunit [Campylobacter rectus RM3267] Length = 420 Score = 104 bits (258), Expect = 3e-20, Method: Composition-based stats. Identities = 49/404 (12%), Positives = 119/404 (29%), Gaps = 27/404 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + + K R + ++ + + ++ + K Sbjct: 23 WNIKKLGCLMKPINERAGDKKYVLMSVTSGVGLIPQVEKFGREIAGNS--YKNYYVIRKN 80 Query: 85 QILYGKLGPYLRK------------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 Y K A I + C +L Sbjct: 81 DFAYNKSSTKEFPEGYISMLKEYEEAAIPNSIFTCFRVIDDEYEPLFFEQLFNTNYHGKW 140 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + IE D K + N+P+ +P L EQ I + + + I + + Sbjct: 141 LRKYIEIGARAHGALSIDTKHLWNMPVAVPKLPEQQKIADCLSSIDDLISAEEKKLLLLN 200 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K Q L P+ + + + + + Sbjct: 201 DYKKGWMQKLF--PAEGKTVPEWRFPEFKDSEGWEKLNIKKACYPSYSGGTPVTSKKEYY 258 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 +I + G I ++ + + + ++++ G+++ + ++ Sbjct: 259 NGDIPFIRSGEIGKEKTELFLTSEGLDNSSAKMIEKGDVLMALYGANSGDVAISPI---- 314 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-P 371 +G I A + ++ ++ + ++ + G + +L E VK + + P Sbjct: 315 KGAINQAILCLRHK--NNNAFLYHYLAFKKNWIVRTYIQGGQGNLSGEIVKSIELCSPQE 372 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I ++V ID L ++ I LK+ +++ + Sbjct: 373 PDEQNRIAAFLSV----IDELTSNQKEKIEALKQHKTALMQGLF 412 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 59/203 (29%), Gaps = 17/203 (8%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTS-ESGKDIIYIGLEDVESGTGKY 63 +P++KDS + W+ + IK+ +G T S K+ + + SG Sbjct: 222 RFPEFKDS---------EGWEKLNIKKACYPSYSGGTPVTSKKEYYNGDIPFIRSGEIGK 272 Query: 64 LPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + D S+ + KG +L G I+ G + L L+ K Sbjct: 273 EKTELFLTSEGLDNSSAKMIEKGDVLMALYGANSGDVAISPIKGAINQAILCLRHK---N 329 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVR 180 + I + + + +I + P EQ I + Sbjct: 330 NNAFLYHYLAFKKNWIVRTYIQGGQGNLSGEIVKSIELCSPQEPDEQNRIAAFLSVIDEL 389 Query: 181 IDTLITERIRFIELLKEKKQALV 203 + + Q L Sbjct: 390 TSNQKEKIEALKQHKTALMQGLF 412 >gi|170718633|ref|YP_001783832.1| restriction modification system DNA specificity subunit [Haemophilus somnus 2336] gi|168826762|gb|ACA32133.1| restriction modification system DNA specificity domain [Haemophilus somnus 2336] Length = 471 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 49/461 (10%), Positives = 113/461 (24%), Gaps = 79/461 (17%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ WK + K + K I L D Y N + Sbjct: 12 LPQGWKKYNLFEICK------PKQWKTIAVKDLTD-----TGYPVYGANGVIGYYHKYNH 60 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G + I+ + + + + + + + I Sbjct: 61 -ENATVLLTCRGATCGEIHISKPYSYINGNAMCMDNLSEKITIEFLYFYLKSI--NLSFI 117 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ G+ + +P+PPL Q I KI ID I + LK+ +Q Sbjct: 118 ISGSAQPQITQVGLKKLEIPVPPLPTQQAIVNKIETLFADIDAGIDRLKTAQKQLKQYRQ 177 Query: 201 AL------------------------------VSYIVTKGLNPDVKMKDSGIEW------ 224 +L + + S +E Sbjct: 178 SLLKNAFNGELTKDWREQNADNLPSSSELLAQIQQAREAHHAKQLADWQSAVEKWEQTRK 237 Query: 225 --------------------VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + +P W + N Sbjct: 238 IGKKPSKPKAQTQAVQFEESLEDLPSGWGTIKINQVANIFTGATPLKSNPNYYINGSIPW 297 Query: 265 IQKLETRNMGLKPES---------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N ++ +++ ++ + + Sbjct: 298 VTSGSLNNAFVECADNFVTDLALKETNLKLLPKHTLLIAMYGEGKTRGKCSELLIEATTN 357 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 A + + + S + + G++ +L V + P + EQ Sbjct: 358 QAIAGIVLYENFPISRQFLKFYMFKNYADLRRQSSGGVQPNLNLSLVGNIVFPFPCLTEQ 417 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +I ++ + + D L + + + + + + + +A + Sbjct: 418 TEIVRILESKLSAYDQLATTLSKQLKQAELLKQAVLKSAFS 458 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 55/149 (36%), Gaps = 6/149 (4%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 Y + ++ + + G + M I +L + + Sbjct: 52 IGYYHKYNHENATVLLTCRGATCGEIHISKPYSYING--NAMCMDNLSEKITIEFLYFYL 109 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 +S +L + + + +K+L + VPP+ Q I N I A ID +++++ Sbjct: 110 KSINLSFII---SGSAQPQITQVGLKKLEIPVPPLPTQQAIVNKIETLFADIDAGIDRLK 166 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + LK+ R S + A G++ + + Sbjct: 167 TAQKQLKQYRQSLLKNAFNGELT-KDWRE 194 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 65/207 (31%), Gaps = 12/207 (5%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDG 68 + + +P W + I + + TG T I ++ + + + Sbjct: 256 ESLEDLPSGWGTIKINQVANIFTGATPLKSNPNYYINGSIPWVTSGSLNNAFVECADNFV 315 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQF--LVLQPKDVLPELL 124 + + + K +L G K + + +VL + Sbjct: 316 TDLALKETNLKLLPKHTLLIAMYGEGKTRGKCSELLIEATTNQAIAGIVLYENFPISRQF 375 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + + G + + +GNI P P L EQ I + ++ D L Sbjct: 376 LKFYMFKNYADLRRQS-SGGVQPNLNLSLVGNIVFPFPCLTEQTEIVRILESKLSAYDQL 434 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGL 211 T + ++ + KQA++ + L Sbjct: 435 ATTLSKQLKQAELLKQAVLKSAFSARL 461 >gi|325924160|ref|ZP_08185722.1| restriction endonuclease S subunit [Xanthomonas gardneri ATCC 19865] gi|325545356|gb|EGD16648.1| restriction endonuclease S subunit [Xanthomonas gardneri ATCC 19865] Length = 425 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 51/411 (12%), Positives = 113/411 (27%), Gaps = 35/411 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP------KDGNSRQSDTSTVS 79 + + +L G KD G+ + G L + Sbjct: 17 EWKALGSLGELIRG-NGLQKKDFTETGIPAIHYGQIYTLYGLSTTKTKSFVSPEVAKQLR 75 Query: 80 IFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLS-IDV 133 KG ++ L + + + +L+P + L + D Sbjct: 76 KVDKGDVVITNTSENLEDVGKALVYLGESQAVTGGHATILKPGNCLLGKYFAYFTQTDDF 135 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLIT 186 + +G + + +PIP L Q I + T L T Sbjct: 136 ASQKIKYAKGTKVIDVSATDMAKTFIPIPCPDNPKKSLETQAEIVRILDIFTELTTELTT 195 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E + K++ +++ + E Sbjct: 196 ELTTELTARKKQYSYYRDRLLS--FEEGYVEWKTLPEMATDFGRGKSKHRPRNDARLYGG 253 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + +I S S+ + N + ++ G + + L Sbjct: 254 DVPFIQTGDIRSASHV-----ITDFNQTYSERGLKQSKLWPKGTLCITIAANIAETSILG 308 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRL 365 +I + S Y+ +L++S+ S + ++ ++L Sbjct: 309 FDACFPDSVIG---FVADSNKTSSGYVEYLLQSFKTKLEEKGKEKSSAQSNINLGTFEQL 365 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + PP++EQ I ++++ A + L E + I L ++ R ++ Sbjct: 366 KLPFPPLEEQVRIVSILDKFDALTNSLTEGLPLEIELRQKQYAYYRDLLLS 416 >gi|198277090|ref|ZP_03209621.1| hypothetical protein BACPLE_03298 [Bacteroides plebeius DSM 17135] gi|198269588|gb|EDY93858.1| hypothetical protein BACPLE_03298 [Bacteroides plebeius DSM 17135] Length = 529 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 70/441 (15%), Positives = 126/441 (28%), Gaps = 74/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IPK W+ I G+T ESGK+ Y+ +V + + D Sbjct: 86 EIPKGWEWARINAIGVSQLGKTLDRGKESGKEYPYLCSINVYWDSINLSKIKTFRLRDDE 145 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG +L + G Y R I + + L + Sbjct: 146 LPKYKLRKGDLLICEGGDYGRCCIWDRNEDMYYQNALHRVRFHGGLIPSFYKYVFELYRN 205 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + +G T+ H ++ + +I P+P + EQ I +I + L Sbjct: 206 IGYIVGQGQTIKHFTYENMRSILFPVPSIHEQKRIVSRIEEIQPIVKKYQRTEDALKRLN 265 Query: 196 KEKK----QALVSYIVTKGL---------------------------------------- 211 E ++++ + L Sbjct: 266 TEIFDKLKKSILQEAIQGKLVSQITEEGTAQELLKQIKTEKEKLVKKGKLKKSALTDSVI 325 Query: 212 -----NPDVKMKDSGI-----EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL-- 259 N + + E +P W + + + R + Sbjct: 326 YKGDDNKYWEKYGTETICVNDEIPFEIPATWIWVRLDNICSYIQRGKSPKYSPIKKYPVI 385 Query: 260 ------SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQV 310 G I K + + P SY +++ G++++ L + Sbjct: 386 AQKCNQWAGFCIDKAQFIDPNSLP-SYSEERLLQDGDLMWNSTGLGTLGRMAIYQSALNP 444 Query: 311 MERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLP 366 E + S ++P I S YL + S + V + GS ++ L VK Sbjct: 445 YELAVADSHVTVIRPLKEHILSQYLYYYFASDTVQSVIEDKSDGSTKQKELSTTTVKNYL 504 Query: 367 VLVPPIKEQFDITNVINVETA 387 V +PP +EQ I I T+ Sbjct: 505 VPIPPYREQQRIVEKIKTVTS 525 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 65/212 (30%), Gaps = 15/212 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKN------TKLIESNILSLSYGNIIQKLETR 271 K E +P WE A+ K + + S++ L Sbjct: 77 KCIDEEIPFEIPKGWEWARINAIGVSQLGKTLDRGKESGKEYPYLCSINVYWDSINLSKI 136 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + G+++ + R + + + + G+ + Sbjct: 137 KTFRLRDDELPKYKLRKGDLLICEGG--DYGRCCIWDRNEDMYYQNALHRVRFHGGLIPS 194 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + ++ Y G + +E+++ + VP I EQ I + I I Sbjct: 195 FYKYVFELYRNIGYIVGQGQ-TIKHFTYENMRSILFPVPSIHEQKRIVSRIEEI-QPIVK 252 Query: 392 LVEKIEQSIVLLK-----ERRSSFIAAAVTGQ 418 ++ E ++ L + + S + A+ G+ Sbjct: 253 KYQRTEDALKRLNTEIFDKLKKSILQEAIQGK 284 >gi|322615694|gb|EFY12614.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 315996572] gi|322618755|gb|EFY15644.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-1] gi|322621831|gb|EFY18681.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-3] gi|322627556|gb|EFY24347.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-4] gi|322630863|gb|EFY27627.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-1] gi|322637919|gb|EFY34620.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-2] gi|322643847|gb|EFY40395.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. NC_MB110209-0054] gi|322659905|gb|EFY56148.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 19N] gi|322661886|gb|EFY58102.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 81038-01] gi|322666368|gb|EFY62546.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. MD_MDA09249507] gi|322672787|gb|EFY68898.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 414877] gi|322676216|gb|EFY72287.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 366867] gi|322680701|gb|EFY76739.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 413180] gi|322684405|gb|EFY80409.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 446600] gi|323194257|gb|EFZ79454.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 609458-1] gi|323197404|gb|EFZ82544.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 556150-1] gi|323201479|gb|EFZ86543.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 609460] gi|323205993|gb|EFZ90955.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 507440-20] gi|323213005|gb|EFZ97807.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 556152] gi|323226181|gb|EGA10398.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. MB110209-0055] gi|323228834|gb|EGA12963.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. MB111609-0052] gi|323236555|gb|EGA20631.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 2009083312] gi|323239945|gb|EGA23992.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 2009085258] gi|323242008|gb|EGA26037.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. 315731156] gi|323247844|gb|EGA31781.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2009159199] gi|323251516|gb|EGA35387.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008282] gi|323258117|gb|EGA41794.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008283] gi|323263740|gb|EGA47261.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008284] gi|323265666|gb|EGA49162.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008285] gi|323270111|gb|EGA53559.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008287] Length = 589 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 68/511 (13%), Positives = 129/511 (25%), Gaps = 103/511 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTPLAIE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDL 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ +E + G T+ + + P IPP AEQ Sbjct: 200 KVLSPFLSEISYYIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 + + RI Sbjct: 260 LSTVKKLMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADALAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 320 TTEASIDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 S E VP+ WE + + I ++ G+I + Sbjct: 380 ISDKEKPFEVPEGWEWCKFGLISEFINGDRGSNYPNKNEYVVHGIPWINTGHIEKNGTLS 439 Query: 272 NMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + + + G++V+ K + I +S + Sbjct: 440 ITDMNFITEKKFNELRSGKIQSGDLVYCLRGATFGKTAFVKPYESG-AIASSLMIIRPFI 498 Query: 327 GIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 Y+ + S + + +L V PP++EQF I I Sbjct: 499 REMGEYIYNYLISPFGRSQIFRFDNGSAQPNLSANSVMLYAFACPPLQEQFRIHKKITEL 558 Query: 386 TARIDVLV----EKIEQSIVLLKERRSSFIA 412 D L + + L + I Sbjct: 559 FHICDNLKLQTQSAQQTQLHLADALTDAAIN 589 >gi|332686985|ref|YP_004456759.1| type I restriction-modification system, specificity subunit S [Melissococcus plutonius ATCC 35311] gi|332370994|dbj|BAK21950.1| type I restriction-modification system, specificity subunit S [Melissococcus plutonius ATCC 35311] Length = 328 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 63/334 (18%), Positives = 132/334 (39%), Gaps = 17/334 (5%) Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L+ + AI+ G + F + P + + + D+ + E G+T Sbjct: 2 LFTSRAGIGKTAILLKE-GCTNQGFQSIVPHKERLDSYFIFSKTDDLKKYGEKNGAGSTF 60 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 K I ++ + IP + EQ I + ++D +IT + +ELLK+ KQ + + Sbjct: 61 IEVSGKQISHMSIIIPEIEEQQKIGNFL----KQLDDIITLQQHKLELLKQMKQGYLQKM 116 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 K +++ +G ++ L+R L +++I + YG+I Sbjct: 117 FPKNEEDKPEIRFAGYTGAWEQRKFGDMVERVKS-YSLSRDVETLEDTDIKYVHYGDIHT 175 Query: 267 KLET---RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQVMERGIITSAY 320 K+ + L Y+ Y+ + G+++ + + V + + Sbjct: 176 KVADRVTKLSNLPFIKYDDYEFIQKGDVIVADASEDYKGIATPSVIIEDVGYKLVAGLHT 235 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 +A++P +DS +L +LM S K Y +G+G+ + + ++ P I EQ I Sbjct: 236 IALRPFDMDSVFLYYLMNSNSFRKHGYRVGTGMKVFGISYSNILNFETYFPQIDEQKKIG 295 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +ID + + + LLK+ + ++ Sbjct: 296 ----WMLLKIDDSIALHQHKLELLKQMKQGYLQK 325 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 51/119 (42%), Gaps = 5/119 (4%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LR 354 + + ++A +++ G + ++ PH ++ DL K G+G Sbjct: 1 MLFTSRAGIGKTAILLKEGCTNQGFQSIVPHKERLDSYFIFSKTDDLKKYGEKNGAGSTF 60 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + + +++P I+EQ I N + ++D ++ + + LLK+ + ++ Sbjct: 61 IEVSGKQISHMSIIIPEIEEQQKIGNFL----KQLDDIITLQQHKLELLKQMKQGYLQK 115 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 59/196 (30%), Gaps = 17/196 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + + DI Y+ D+ + + K N Sbjct: 136 WEQRKFGDMVERVKSYSLSRDVETLEDTDIKYVHYGDIHTKVADRVTKLSNLPFIKYDDY 195 Query: 79 SIFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 KG ++ + + + + L+P D+ L + S Sbjct: 196 EFIQKGDVIVADASEDYKGIATPSVIIEDVGYKLVAGLHTIALRPFDMDSVFLYYLMNSN 255 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G + + I N P + EQ +KI ++ID I Sbjct: 256 SFRKHGYRVGTGMKVFGISYSNILNFETYFPQIDEQ----KKIGWMLLKIDDSIALHQHK 311 Query: 192 IELLKEKKQALVSYIV 207 +ELLK+ KQ + + Sbjct: 312 LELLKQMKQGYLQKMF 327 >gi|168262425|ref|ZP_02684398.1| putative type I restriction-modification system, S subunit [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|205348748|gb|EDZ35379.1| putative type I restriction-modification system, S subunit [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] Length = 581 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 74/504 (14%), Positives = 153/504 (30%), Gaps = 97/504 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + +P W+ + I +T +D YI + + Sbjct: 83 IKKPKPLPEI--SEEEKPFELPAGWEWIKISEIGHDWGQKTP--DEDFTYIDVGSINKEY 138 Query: 61 GKY-LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ 115 G P +++ + + I KG ++Y + PYL I + I ST F ++ Sbjct: 139 GIIEEPSILSAKDAPSRARKIVQKGTVIYSTVRPYLLNIAIIESAFSPEPIASTAFAIIH 198 Query: 116 PKD-VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 P + + +L S +E+ G + K + + +PP +EQ I +KI Sbjct: 199 PYTAMNANFIYYYLRSPVFINYVESCQTGVAYPAINDKQFFSGIIAVPPSSEQARITKKI 258 Query: 175 IAETVRIDTLITERIRFIELLKE------------------------------------- 197 D L + ++ ++ Sbjct: 259 KELMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADELAENWARISKHFDTLFTTEA 318 Query: 198 ----KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 KQ ++ V L P + +E + + K + N+K + Sbjct: 319 SIDALKQTILQLAVMGKLVPQDPNDEP-VEKLLSRAKTHQQKRIENKEIQKNKKIDGVPY 377 Query: 254 SNILS--------------------LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 +I Y N + + + + + F Sbjct: 378 PDIQIPKTSSFILLNELAFITKLAGFEYTNYFSLEDAGEVPVVRAQNVKAFNLKKDNLKF 437 Query: 294 RFID----------------LQNDKRSLRSAQVMERG----IITSAYMAVKPHGIDSTYL 333 D + + + E + + IDS YL Sbjct: 438 ISYDVSKKLNRSALSTECLLMTFIGAGIGDTCIFEENKRWHLAPNVAKIEPFSDIDSHYL 497 Query: 334 AWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV---INVETARI 389 + S+ +F ++ + + SL ++ + V++PP++EQ I + +I Sbjct: 498 NIYLNSFTGRNEIFKSLKATAQPSLSMSTIREIMVILPPLQEQKRIVKKTNELLALCDKI 557 Query: 390 DVLVEKIEQ-SIVLLKERRSSFIA 412 + ++ +Q + L + I Sbjct: 558 NHYIQSAQQTQLHLADALTDAAIN 581 >gi|189424478|ref|YP_001951655.1| restriction modification system DNA specificity domain [Geobacter lovleyi SZ] gi|189420737|gb|ACD95135.1| restriction modification system DNA specificity domain [Geobacter lovleyi SZ] Length = 447 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 72/449 (16%), Positives = 133/449 (29%), Gaps = 70/449 (15%) Query: 23 KHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +WK I ++ G T G+ + D S T D Sbjct: 2 SNWKRARIGDLCEIIKGETGLASAPPGEYPLVATGADRRSCTTWQFDTDAVCIP------ 55 Query: 79 SIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDV 133 L G L T + PKD L LS Sbjct: 56 --------LVSSTGHGKKTLNYVHYQSGKFALGTILAAVIPKDPSVLTARFLHLYLSHFK 107 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + +GA K I ++ +P+PPL EQ + + I L+TE Sbjct: 108 DTVLVPLMKGAANVSLSMKEIASVKIPVPPLDEQQSLIDLIFRIEDEHQELLTETNHQGV 167 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKD---------------------------------- 219 LLK+ +QAL+ V L + + Sbjct: 168 LLKQLRQALLQEAVAGELTTAWRKQHPVAKGDPQYDAAALLAQIKAEKERLVKEGKIRKE 227 Query: 220 ------SGIEWVGLVPDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNII-QKLET 270 + + +P+ W + ++ L E + L GNI K++ Sbjct: 228 KPLPPITDEDKPFDLPEGWGWCRLGEVADGFQYGSSVKSLKEGKVPVLRMGNIQCGKIDW 287 Query: 271 RNMGLKPES-YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 N+ ++ V G+++F + + M I + V G Sbjct: 288 SNLVYTNDTGEIRKYRVTNGDLLFNRTNSRELVGKTGLFDGMYEAIFAGYLVRVTMLGGI 347 Query: 330 STYLAW-LMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 S + ++ S + A + + ++ ++ +PP+ EQ I ++ Sbjct: 348 SATYSNGVLNSKFHREWCDANKTDALGQSNINATKLRDYFFPLPPLAEQQAIVARVDSLM 407 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A ID L +++ + + + + A Sbjct: 408 ATIDELEKQVAERKEQAQLLMQTVLREAF 436 Score = 74.1 bits (180), Expect = 5e-11, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 59/200 (29%), Gaps = 10/200 (5%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W + G + +S K+ + + + +++ G + + + Sbjct: 241 DLPEGWGWCRLGEVADGFQYGSSVKSLKEGKVPVLRMGNIQCGKIDWSNLVYTNDTGEIR 300 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQ---GWLLS 130 G +L+ + + + +LV Sbjct: 301 KYR-VTNGDLLFNRTNSRELVGKTGLFDGMYEAIFAGYLVRVTMLGGISATYSNGVLNSK 359 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + S+ + + + P+PPLAEQ I ++ + ID L + Sbjct: 360 FHREWCDANKTDALGQSNINATKLRDYFFPLPPLAEQQAIVARVDSLMATIDELEKQVAE 419 Query: 191 FIELLKEKKQALVSYIVTKG 210 E + Q ++ G Sbjct: 420 RKEQAQLLMQTVLREAFDVG 439 Score = 70.2 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 23/105 (21%), Positives = 46/105 (43%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 G I +A + P + + +L + + + M SL +++ + + VPP+ Sbjct: 80 GTILAAVIPKDPSVLTARFLHLYLSHFKDTVLVPLMKGAANVSLSMKEIASVKIPVPPLD 139 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ + ++I L+ + VLLK+ R + + AV G+ Sbjct: 140 EQQSLIDLIFRIEDEHQELLTETNHQGVLLKQLRQALLQEAVAGE 184 >gi|57650596|ref|YP_186689.1| type I restriction-modification enzyme, S subunit, EcoA family protein [Staphylococcus aureus subsp. aureus COL] gi|87161451|ref|YP_494442.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|151221911|ref|YP_001332733.1| type I restriction modification system, site specificity determination subunit [Staphylococcus aureus subsp. aureus str. Newman] gi|161510022|ref|YP_001575681.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|294850681|ref|ZP_06791403.1| type I restriction enzyme [Staphylococcus aureus A9754] gi|57284782|gb|AAW36876.1| type I restriction-modification enzyme, S subunit, EcoA family [Staphylococcus aureus subsp. aureus COL] gi|87127425|gb|ABD21939.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|150374711|dbj|BAF67971.1| type I restriction modification system, site specificity determination subunit [Staphylococcus aureus subsp. aureus str. Newman] gi|160368831|gb|ABX29802.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|269941282|emb|CBI49677.1| type I restriction-modification system specificity protein [Staphylococcus aureus subsp. aureus TW20] gi|294822479|gb|EFG38926.1| type I restriction enzyme [Staphylococcus aureus A9754] gi|329314487|gb|AEB88900.1| Type I restriction modification system, site specificity determination subunit [Staphylococcus aureus subsp. aureus T0131] Length = 399 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 58/403 (14%), Positives = 136/403 (33%), Gaps = 39/403 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGKFFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I T+ L + + EW + + K Sbjct: 197 LELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIFENNRRKPITSSLREKG 256 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + ++ N ++ + + S Sbjct: 257 LYPYYGATGIIDYVKDYLFNNEE---------------RLLIGEDGAKWGQFETSSFIAN 301 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 + + + VK + + ++ + + K A +G L ++ + + +P Sbjct: 302 GQYWVNNHAHVVKSNDHNLFFMNYYLN----FKELRAFVTGNAPAKLTHANLCNINLKIP 357 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ + ++ ID + I LLKER+ + Sbjct: 358 CLTEQ----DKVSALLKSIDNKMNNQMNRIELLKERKKGLLQK 396 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 51/181 (28%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I + + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGKFFSKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|315145853|gb|EFT89869.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2141] Length = 406 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 129/406 (31%), Gaps = 31/406 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG---LEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ ++ TG T ++ +D + + + + + + Sbjct: 10 WEHRKVEELGDTFTGLTGKTKEDFGHGDATFVTYINVFSNPITDLKMTESVEIDAKQNQV 69 Query: 82 AKGQILYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVL-PELLQGWLLSIDVT 134 G I + + ++ ++ +P L P + L S +V Sbjct: 70 EYGDIFFTTSSETPEEVGMSSVWLGNEANVYLNSFCFGYRPVTELAPYYMAFMLRSPNVR 129 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + +G + + + +I +P+P + EQ + + I + + EL Sbjct: 130 KKFIFLAQGISRYNISKNRVMDIEIPVPNIDEQRKVGQFFKDIDDLITLHQRKLEQLKEL 189 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTK 250 K Q + P ++ D EW +G + H TE + Sbjct: 190 KKTYLQVMFPR--KDERVPKLRFADFEGEWAQRKLGEISTHRSGTAIERYFTEDGK---- 243 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS--- 307 ++S+ K + + ++V GE+ D +D + Sbjct: 244 ---YKVISIGSYGTDSKYVDQGIRAISNEITNARVVHKGELTMVLNDKTSDGAIIGRSLL 300 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + E +I + P + A+ + + KV + G + + + VK L Sbjct: 301 IESEEEYVINQRTEIISPKDDFNVNFAYTTLNNTFRQKVKKIVQGGTQIYVNYPAVKNLM 360 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + P KEQ I + D + + + LK + +++ Sbjct: 361 LDFPSYKEQTKIGTF----FKQFDDTITLHQNKLDQLKTLKKTYLQ 402 >gi|253699078|ref|YP_003020267.1| restriction modification system DNA specificity domain protein [Geobacter sp. M21] gi|251773928|gb|ACT16509.1| restriction modification system DNA specificity domain protein [Geobacter sp. M21] Length = 404 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 65/423 (15%), Positives = 139/423 (32%), Gaps = 35/423 (8%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTG 61 P YK + V G IP+ W ++ L +G SG + Y+ G Sbjct: 7 PGYKQTEV---GVIPEEWDCCMLRDGIVLLSGHHILAHYCNMSGCGVPYLT------GPA 57 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + + ++ + G IL G ++AD S Q + ++P + Sbjct: 58 DFRNGAIANTKFTNKPATLCSDGDILVTVKGSGSGTIVVADKMYCISRQLMAIRPLEWNS 117 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L LL + + I +P+PPL Q I + + V + Sbjct: 118 IFLYYSLLQNAL---HFKAASAGLIPGLSRSDILEQLVPLPPLPAQNTIADALSDVDVLL 174 Query: 182 DTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 L + +L + Q L++ G + + +K G +G ++ A+ Sbjct: 175 GALDRLIAKKRDLKQAAMQQLLTGETRLPGFHGEWAVKRLGD--LGTFLKGNGIRKDEAM 232 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + + Y + +++ N + PE + + G+++F Sbjct: 233 --------SGALPCVRYGEIYTHHNNYVKSFNSWISPEVAVSATRLKKGDLLFAGSGETK 284 Query: 301 DKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLK 358 ++ A + + + ++ ++ + + + G + Sbjct: 285 EEIGKCVACIDDCDAYAGGDIVILRLAAAHPLFMGYYCNIATVNAQKASRAQGDAVVHIG 344 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + V VP + EQ I V+ A + +EQ + + S + +TG+ Sbjct: 345 AVALSSVLVSVPSVSEQVAIAEVLFDMDAEL----AGLEQRRDKTRSLKQSIMQELLTGK 400 Query: 419 IDL 421 L Sbjct: 401 TRL 403 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 79/202 (39%), Gaps = 12/202 (5%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 VG++P+ W+ + L+ + N+ + + RN + + Sbjct: 13 EVGVIPEEWDCCMLRDGIVLLSGHHILAHYCNMSGCGVPYLTGPADFRNGAIANTKFTNK 72 Query: 284 Q--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G+I+ + + + I+ MA++P +S +L + + Sbjct: 73 PATLCSDGDILVTVKGSGSGTIVVA----DKMYCISRQLMAIRPLEWNSIFLYYSLLQNA 128 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L F A +GL L D+ V +PP+ Q I + ++ DVL+ +++ I Sbjct: 129 LH--FKAASAGLIPGLSRSDILEQLVPLPPLPAQNTIADALSDV----DVLLGALDRLIA 182 Query: 402 LLKERRSSFIAAAVTGQIDLRG 423 ++ + + + +TG+ L G Sbjct: 183 KKRDLKQAAMQQLLTGETRLPG 204 >gi|161528113|ref|YP_001581939.1| restriction modification system DNA specificity subunit [Nitrosopumilus maritimus SCM1] gi|160339414|gb|ABX12501.1| restriction modification system DNA specificity domain [Nitrosopumilus maritimus SCM1] Length = 453 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 65/438 (14%), Positives = 147/438 (33%), Gaps = 40/438 (9%) Query: 20 AIPKHWKVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD------ 67 IP+ W V + + + + G S K + +++ + + D Sbjct: 20 EIPEDWNYVILDKLTPKNEKSSIRMGPFGSSLKTHELLNSGKIKTLWIENIVNDKFTWKY 79 Query: 68 ---GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQF--LVLQPKDVL 120 + + +L +G + AI+ + G I S+ + L + +L Sbjct: 80 QKFITEEKYEKLKGFTVKPNDVLITMMGTLGKTAIVPEDIGRAIISSHLLKISLDHEKLL 139 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P+ L +L S V ++I G M + I N+ + P ++EQ I + Sbjct: 140 PKFLYYFLKSNFVYRQIIKESRGLVMGGLNTGIIKNLLIKTPKISEQQKILSILSNVDNL 199 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA- 239 I + + L Q L++ + V + + ++ ++K Sbjct: 200 IYSYEKIIDQTKHLKIGLLQQLLTKGIKHKKFKKVFDRFGNYFEIPDSWEYVKIKKLVDE 259 Query: 240 ---------LVTELNRKNTKLIESNILSLS----YGNIIQKLETRNMGLKPESYETYQIV 286 EL+ K+ I+ I ++ + I + + K Sbjct: 260 KRILEIQDGNHGELHPKSLDFIQKGIPFVTADCLMNDNINYDLCKFLPEKFLKILRIGFA 319 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 +++ + + + + Y + I +L ++ +S+D K Sbjct: 320 KQKDVLLSHKGSVGNVAVVGNKFDRIILSPQTTYYRLSSKII-PKFLYYIFQSFDFQKQL 378 Query: 347 YAMG-SGLRQSLKFEDVKRLPVL-VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 ++ R + + + L + + I+EQ I +V++ + I L K + L K Sbjct: 379 KSLAKQSTRDYIGITNQQNLLIPYISSIEEQEKIISVLSDVDSNISNLELKKKSLESLKK 438 Query: 405 ERRSSFIAAAVTGQIDLR 422 + +TG+I ++ Sbjct: 439 ----GLMQKLLTGKIRVK 452 >gi|253735166|ref|ZP_04869331.1| EcoA family type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus TCH130] gi|253726830|gb|EES95559.1| EcoA family type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus TCH130] Length = 389 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 58/395 (14%), Positives = 129/395 (32%), Gaps = 37/395 (9%) Query: 24 HWKVVPIKRFTKLNTG--RTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + K+ G +T + + + I ++ +E++++ K + + Sbjct: 20 EWEEKKLGEVAKIYDGTHQTPKYTNEGIKFLSVENIKTLNS---SKYISEEAFEKEFKIR 76 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 G IL ++G I++ + L L L L+ Q Sbjct: 77 PEFGDILMTRIGDIGTPNIVSSNEKFAYYVSLALLKTKNLNSYFLKNLILSSSIQNELWR 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + IG I + P EQ I + +I+ + + K Sbjct: 137 KTLHVAFPKKINKNEIGKIKINYPKKQEQQKIGQFFSKLDRQIELEEQKLELLQQQKKGY 196 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q + S + + G WE + F + N+ + E+ + Sbjct: 197 MQKIFSQELRFK------------DENGNDYPEWEERRFADIFKFHNKLRKPIKENLRVK 244 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMERGIIT 317 SY Y I D ++ RS V + + Sbjct: 245 GSYPYYGATGII--------DYVDDFIFDGNYLLIGEDGANIITRSAPLVYLVNGKFWVN 296 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV-PPIKEQF 376 + + P + + +L + +L + L +++K + V++ ++EQ Sbjct: 297 NHAHILSPLNGN---IQYLYQVAELVNYEKYNTGTAQPKLNIQNLKIISVVISTNLEEQQ 353 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 I + + +++D ++ EQ + LL++R+ + + Sbjct: 354 KIGSFL----SKLDRQIDLEEQKLELLQQRKKALL 384 >gi|227506258|ref|ZP_03936307.1| type I restriction modification DNA specificity protein [Corynebacterium striatum ATCC 6940] gi|227197159|gb|EEI77207.1| type I restriction modification DNA specificity protein [Corynebacterium striatum ATCC 6940] Length = 371 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 53/396 (13%), Positives = 127/396 (32%), Gaps = 40/396 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W +V + L G+ + + + G++ + Sbjct: 8 DWPMVRLGDVCHLKYGKALKKEERVA-----------GEFPVFGSAGSVGSHVEANFVGP 56 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G+ G + I T F V + + + L D+ + + Sbjct: 57 VSVV-GRKGSAGFVEWSSGNCWIIDTAFGVFPKSEEQVDSRWLYWLLKDLRLG--RLQKH 113 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 A + + +PPL EQ I + + + ++L +E L Sbjct: 114 AAVPGISKADVVEEKFLLPPLDEQRRIAAILDEVDEALFRVNQSLGDLLQLKQELFTDLF 173 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 I + +G + + ++ +N + + ++SY Sbjct: 174 LRI------------ERESTIIGEYLESTQYGT-----SDKANENVGIPILRMGNVSYNG 216 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + + + L E Y + G+++F + ++ ++ + Y+ Sbjct: 217 EIDLSDLKYVELDASDREKYS-LKAGDLLFNRTNSKDLVGKTAVVPELQEEYTYAGYLIR 275 Query: 324 K--PHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y++ + S K+ + ++ ++KRLP+ + EQ + Sbjct: 276 CRVNDKAVPEYISGFLNSVLGKKILRNTAKAIVGMANINANELKRLPIPQASLDEQQEFA 335 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + T+RID + ++++ LL+E + S A Sbjct: 336 S----LTSRIDDVESQMKRQRKLLQELQESLSTRAF 367 >gi|258451917|ref|ZP_05699935.1| type I restriction-modification enzyme [Staphylococcus aureus A5948] gi|282929312|ref|ZP_06336882.1| type I restriction enzyme, S subunit [Staphylococcus aureus A9765] gi|284024855|ref|ZP_06379253.1| type I restriction-modification enzyme, S subunit, EcoA family protein [Staphylococcus aureus subsp. aureus 132] gi|304380595|ref|ZP_07363269.1| EcoA family type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus ATCC BAA-39] gi|257860427|gb|EEV83257.1| type I restriction-modification enzyme [Staphylococcus aureus A5948] gi|282591836|gb|EFB96886.1| type I restriction enzyme, S subunit [Staphylococcus aureus A9765] gi|304340844|gb|EFM06770.1| EcoA family type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus ATCC BAA-39] gi|315195943|gb|EFU26306.1| type I restriction-modification enzyme, S subunit, EcoA family, putative [Staphylococcus aureus subsp. aureus CGS01] gi|320143744|gb|EFW35520.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus MRSA177] Length = 386 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 58/403 (14%), Positives = 136/403 (33%), Gaps = 39/403 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 7 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 66 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 67 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 126 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 127 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGKFFSKLDRQIELEEQK 183 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I T+ L + + EW + + K Sbjct: 184 LELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIFENNRRKPITSSLREKG 243 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + ++ N ++ + + S Sbjct: 244 LYPYYGATGIIDYVKDYLFNNEE---------------RLLIGEDGAKWGQFETSSFIAN 288 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 + + + VK + + ++ + + K A +G L ++ + + +P Sbjct: 289 GQYWVNNHAHVVKSNDHNLFFMNYYLN----FKELRAFVTGNAPAKLTHANLCNINLKIP 344 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ + ++ ID + I LLKER+ + Sbjct: 345 CLTEQ----DKVSALLKSIDNKMNNQMNRIELLKERKKGLLQK 383 Score = 67.1 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 48/177 (27%), Gaps = 6/177 (3%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LETRNMG 274 G E + + I L NI + + Sbjct: 1 FPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYIS 60 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + G+++ + ++ S + + + Sbjct: 61 KDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFG 120 Query: 335 -WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETARI 389 +L+ K+F A G R+ L F+++ L + P I +EQ I + +I Sbjct: 121 QYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGKFFSKLDRQI 177 >gi|148654896|ref|YP_001275101.1| restriction modification system DNA specificity subunit [Roseiflexus sp. RS-1] gi|148567006|gb|ABQ89151.1| restriction modification system DNA specificity domain [Roseiflexus sp. RS-1] Length = 392 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 55/416 (13%), Positives = 110/416 (26%), Gaps = 45/416 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W +K +N G+ + +P G + S Sbjct: 6 ELPKGWGWKRLKTLVTVNYGKGLSE------------KQRKAGNVPVYGANGVVGFHDTS 53 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL-QPKDVLPELLQGWLLSIDV-TQRI 137 I I+ G+ G T F + P+ + P+ L +L S + + Sbjct: 54 ITKGQTIVIGRKGSAGAVNWSEIACWPIDTTFFIDEFPEILYPQFLYQFLRSQQIDRLQQ 113 Query: 138 EAICEGATMSHADWKGIGNIPM--PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 A G + P LAEQ I ++ + + L Sbjct: 114 SAAIPGLNRDVLYSVEVPIPYPDDPAHSLAEQRRIVARLELLLGETRAMREDIQAMRRDL 173 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + ++ ++ + G +P W K L + Sbjct: 174 AQVMESALAEVFPNP--------------NGEMPKGWGWKSIDDLFELQQGASMSPRRRQ 219 Query: 256 ILSLSYGNIIQKLETRNMGLKP-------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + + E + G+++ Sbjct: 220 GRNPQPFLRTKNILWGEVDTSDVDVMDFTEDEIERLKLRKGDLLICEGGDVGRAAVWEDQ 279 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD--LCKVFYAMGSGLRQSLKFEDVKRLP 366 + + K D + + M++ +L +K Sbjct: 280 LPLVMYQNHIHRLRRKSDDADPKFYVYWMKAAYQLFKIYQGEESRTAIPNLSGRRLKNFL 339 Query: 367 VLVPPIKEQFDITNVINVETARI---DVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 V + EQ I + I D L+ + + I +L+ S +AAA G++ Sbjct: 340 VPTTSLTEQRRIVAYLEHIAEEIRAMDDLLAQDLRDIEVLE---QSILAAAFRGEV 392 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 64/200 (32%), Gaps = 10/200 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G +PK W I +L G + ++ +++ G D Sbjct: 190 GEMPKGWGWKSIDDLFELQQGASMSPRRRQGRNPQPFLRTKNILWGEVDTSDVDVMDFTE 249 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIAD-FDGICSTQFLVLQPKDVLPELLQGWLLSID 132 D KG +L + G R A+ D + + + + ++ + Sbjct: 250 DEIERLKLRKGDLLICEGGDVGRAAVWEDQLPLVMYQNHIHRLRRKSDDADPKFYVYWMK 309 Query: 133 VTQRIEAICEGA----TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 ++ I +G + + + + N +P L EQ I + I + Sbjct: 310 AAYQLFKIYQGEESRTAIPNLSGRRLKNFLVPTTSLTEQRRIVAYLEHIAEEIRAMDDLL 369 Query: 189 IRFIELLKEKKQALVSYIVT 208 + + ++ +Q++++ Sbjct: 370 AQDLRDIEVLEQSILAAAFR 389 >gi|223043494|ref|ZP_03613540.1| type-I specificity determinant subunit [Staphylococcus capitis SK14] gi|222443283|gb|EEE49382.1| type-I specificity determinant subunit [Staphylococcus capitis SK14] Length = 399 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 64/410 (15%), Positives = 128/410 (31%), Gaps = 26/410 (6%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKY 63 +P++KD W FTK N G + + G + Sbjct: 11 RFPEFKD-----------EWIEKAFGDFTKTNQGLQIAISNRETQYKEGYYFYITNEFLK 59 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + I K IL + G + + + K Sbjct: 60 PNNKIKYYIKNPPNSVIANKDDILMTRTGNTGKVITGVHGAFHNNFFKIKFDNKQYDRLF 119 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + L S + +I ++ +T+ + +I IP EQ +K+ ++D Sbjct: 120 IYELLKSSKINNKILSLAGTSTIPDLNHSDFYSIKSFIPKYEEQ----QKLGIFFSKLDR 175 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 I +ELL+++K+ + I ++ L + + +WV ++ Sbjct: 176 QIELEEEKLELLEQQKRGYMQKIFSQDLRFKDENGNVYPKWVTQKIKELGNVYTGNTPSK 235 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ + N + L+ L E ++ + + ++ I Sbjct: 236 KQSMYWNSNNYIWVTPTDINNKKDLKNSEYMLSDEGFKKARQLPKNTLLITCIASIGKNA 295 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 LR E G A+ P+ + + + G Q + + Sbjct: 296 ILR-----EEGSCNQQINALVPNSDKNVDFLYYAFEKVSKYMKRIAGKTATQIVNKSTFE 350 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + VP +EQ + +N D L+EK I LLK+R+ F+ Sbjct: 351 NISIEVPNFEEQLKVGRFLNS----FDKLIEKQVSKIELLKQRKQGFLQK 396 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/178 (10%), Positives = 46/178 (25%), Gaps = 9/178 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ + EW+ + + NR+ + Sbjct: 8 PELRFPEFKDEWIEKAFGDFTKTNQGLQIAISNRETQYKEGYYFYITNEFLKPNNKIKYY 67 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + P S I + +I+ + + D + Sbjct: 68 IKNPPNSV----IANKDDILMTRTGNTGKVITGVHGAFHNNFF----KIKFDNKQYDRLF 119 Query: 333 LAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + L++S + ++ L D + +P +EQ + + +I Sbjct: 120 IYELLKSSKINNKILSLAGTSTIPDLNHSDFYSIKSFIPKYEEQQKLGIFFSKLDRQI 177 >gi|283786950|ref|YP_003366815.1| type I restriction modification system HsdS component [Citrobacter rodentium ICC168] gi|282950404|emb|CBG90053.1| putative type I restriction modification system HsdS component [Citrobacter rodentium ICC168] Length = 538 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 55/423 (13%), Positives = 133/423 (31%), Gaps = 44/423 (10%) Query: 27 VVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V + GR S ++I YI ++ E+ + + Sbjct: 10 TVKLSELLITTKGRKPANVGDRSSVREIPYIDIKAFENNEI--------TSYCSPENAVL 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + +L G + + + ST + P + + + + Sbjct: 62 CNETDVLMVWDGSRSGLVGMGIYGALGSTLVAISIPFIL---PQYIYYFLLSKFDELNNN 118 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + H D +G I PI ++ Q ++ KI ID T+ + + + Sbjct: 119 TRGMGIPHIDPVYLGEIDFPITSVSNQEILYSKIDQLYNLIDDGFTKTEKALAQISILWS 178 Query: 201 ALVSYIVTKGLNPDVKMKDSG---------------IEWVGLVPDHWEVKPF---FALVT 242 ++ ++ L + + +S E + ++P W ++ Sbjct: 179 LRITEALSGKLTKNWRDSNSQGKPLPVDIISINNQLEETLPVLPSDWRYVKLSSVIESIS 238 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNM----GLKPESYETYQIVDPGEIVFRFIDL 298 K + NI+ N + + Y + + ++ R Sbjct: 239 YGTSKKCTYEPQETGVIRIPNIVNGEICDNDLKFANFTEKEKDKYSLKEDDILIIRSNGS 298 Query: 299 QNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGL 353 N + + + G + + Y+ + ++ +YL + + S L K A S Sbjct: 299 LNLVGACARVKSKDTGYLFAGYLLRLRINLELVNPSYLKYALESPLLRKQIERIAKSSSG 358 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ E+++ L + + I+EQ I N + ++ ++ + + + + Sbjct: 359 VNNINAEEIRSLIIPICSIEEQLVIVNELENIKYNLEAQQVQLRNLLEKSELTKKEIVKD 418 Query: 414 AVT 416 A + Sbjct: 419 AFS 421 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 74/212 (34%), Gaps = 13/212 (6%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSR 71 + + +P W+ V + + + TS+ I + ++ +G + Sbjct: 216 ETLPVLPSDWRYVKLSSVIESISYGTSKKCTYEPQETGVIRIPNIVNGEICDNDLKFANF 275 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--------LVLQPKDVLPEL 123 + IL + L T + L + + V P Sbjct: 276 TEKEKDKYSLKEDDILIIRSNGSLNLVGACARVKSKDTGYLFAGYLLRLRINLELVNPSY 335 Query: 124 LQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L+ L S + ++IE I + +++ + + I ++ +PI + EQ++I ++ ++ Sbjct: 336 LKYALESPLLRKQIERIAKSSSGVNNINAEEIRSLIIPICSIEEQLVIVNELENIKYNLE 395 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 + +E + K+ +V + G Sbjct: 396 AQQVQLRNLLEKSELTKKEIVKDAFSIGFKEM 427 >gi|148264154|ref|YP_001230860.1| restriction modification system DNA specificity subunit [Geobacter uraniireducens Rf4] gi|146397654|gb|ABQ26287.1| restriction modification system DNA specificity domain [Geobacter uraniireducens Rf4] Length = 393 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 51/400 (12%), Positives = 113/400 (28%), Gaps = 24/400 (6%) Query: 28 VPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 VP+ ++ G T D I + ++D++ + S ++ Sbjct: 6 VPLGGLVTISGGGTPSRNNDAYWGGSIPWATVKDLKDTMLSGTQETITPEGLRDSASNLI 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G ++ L K I D + E + ++++ Sbjct: 66 PAGSVIVATR-MGLGKVAINTMDVTINQDL-KAFSCGADLEPRYLLYFLLANASHLDSMG 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +GAT+ + ++ +P+PPL EQ I + ELL+ Sbjct: 124 KGATVKGITLDVLKDLSVPLPPLPEQKRIAAILDKADSIRRKRQEAVRLTEELLRSV--- 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVG--LVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + N M +G+ G + I++ + + Sbjct: 181 FLDMFGDPESNNWPMMTIAGVALPGVSAIRTGPFGSQLLHSEFVDEGVAVLGIDNAVANE 240 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N + + + V PG+++ + + + Sbjct: 241 FRWNERRYISEAKYR-----ELSRYTVRPGDVIITIMGTCGRCAVVPDDIPVAINTKHLC 295 Query: 320 YMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFD 377 + + ++ + ++ + G L +K +P+ +PP+K Q Sbjct: 296 CITLDQTKCLPVFVHAYFLQHCIARRYLEKTAKGAIMDGLNMGIIKDMPIPIPPLKLQEK 355 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I A I+ L ++ S + A G Sbjct: 356 FACSI----AAIEKLRHTTRSTLAEQDTLFHSLLQRAFNG 391 >gi|257454707|ref|ZP_05619962.1| restriction modification system DNA specificity domain protein [Enhydrobacter aerosaccus SK60] gi|257447888|gb|EEV22876.1| restriction modification system DNA specificity domain protein [Enhydrobacter aerosaccus SK60] Length = 384 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 58/401 (14%), Positives = 117/401 (29%), Gaps = 29/401 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 KV P++ K+ +G +S + I + DV G ++ Sbjct: 4 KVKPLRDLVKITSGFAFKSNLFNTENNGLPLIRIRDVVRGYSD------TFYDAEYKDEY 57 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G L G G + A + + + ++ + IE Sbjct: 58 VIQNGDALIGMDGEF-NLAKWRGGKALLNQRVCKIESTSEELSQGYLIRFLPKALKDIED 116 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H K I NI +P+PPL EQ I + + + ELL Sbjct: 117 KTPFVTVKHLSIKDINNIQIPLPPLTEQKRIAQILDKADELRQKRQQSIEKLDELL---- 172 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + K S I+ + PF + + + + I + Sbjct: 173 -----QACFLKIFENEKCSMSQIKDLLENEKSIRTGPFGSQLLHSEFVDEGIAVLGIDN- 226 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + N + + R + + V P +++ + + + Sbjct: 227 AVKNTFKWAKPRFITPEKYKQLKRYTVKPKDVIITIMGTCGKCAVVPDKIPLSINTKHLC 286 Query: 320 YMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 + + + +L + + + G L +K LPV +P I+ Q + Sbjct: 287 CITLDFDKCNPEFLHSYFLLHPISINFLKSRAKGAIMAGLNMSIIKDLPVELPSIEIQNE 346 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +I + K+ + S A G+ Sbjct: 347 FAE----LKTKIGLQKSKLINQLQEQDNLFQSLQQRAFNGE 383 >gi|270296269|ref|ZP_06202469.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270273673|gb|EFA19535.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 523 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 59/436 (13%), Positives = 113/436 (25%), Gaps = 67/436 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP W+ I + +G T +I ++ D+ +G S+ Sbjct: 86 EIPNGWQWERIGNIFETTSGSTPLSRNPDYYKNGNINWVRTTDLNNGILNKTEIQITSKA 145 Query: 73 SDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 +SI + + G + K I FD + +QP + Sbjct: 146 IIDYNLSILPQTSVCVAMYGGAGTIGKHCILHFDTTINQSVCAIQPNGFCNMDYIHTFIE 205 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ + + I + +PIPP EQ+ I K+ I + R Sbjct: 206 YQRPFWMDFAAGSRKDPNINQLIIKHCLLPIPPQEEQLRIVTKLNQLYPYIYQYGNSQNR 265 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDSGIEWV--------------------- 225 ++ KE ++++ + L P + + + E + Sbjct: 266 LNQINKEIWHSLKKSILQEAIQGKLVPQITEEGTAQELLEPIRQEKLQLVKEGKLKKSAL 325 Query: 226 -----------------------------GLVPDHWEVKP---FFALVTELNRKNTKLIE 253 +P+ W F + +K IE Sbjct: 326 TDSIIFRGDDNKYFEKIGKTEQDITDEIPFDIPNTWVWVRHNDLFDISGGSQPPKSKFIE 385 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I+ + + +I G+I+ K Sbjct: 386 REKEGYIRLFQIRDYGSNPQPIYIPLSTASKISQKGDILLARYGASLGKVFYAEYGAY-N 444 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + I Y+ S + ED+ L +PP+ Sbjct: 445 VALAKVIPLYESRLIFQKYIFLYYCSSIYQNEIVNRSRCAQAGFNKEDLNSLLFPLPPLS 504 Query: 374 EQFDITNVINVETARI 389 EQ+ I A I Sbjct: 505 EQYRIVEKYEKAIASI 520 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 62/213 (29%), Gaps = 12/213 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN--MGL 275 K E +P+ W+ + + + + N ++ + N + Sbjct: 77 KCIDEEIPFEIPNGWQWERIGNIFETTSGSTPLSRNPDYYKNGNINWVRTTDLNNGILNK 136 Query: 276 KPESYETYQIVDPGEIVFRFIDLQ-----NDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + I+D + + + + I + A++P+G + Sbjct: 137 TEIQITSKAIIDYNLSILPQTSVCVAMYGGAGTIGKHCILHFDTTINQSVCAIQPNGFCN 196 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y + ++ +K + +PP +EQ I +N I Sbjct: 197 MDYIHTFIEYQRPFWMDFAAGSRKDPNINQLIIKHCLLPIPPQEEQLRIVTKLNQLYPYI 256 Query: 390 DVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 + + KE + S + A+ G+ Sbjct: 257 YQYGNSQNRLNQINKEIWHSLKKSILQEAIQGK 289 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 54/167 (32%), Gaps = 3/167 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV- 78 IP W V ++ G K I + + + ST Sbjct: 356 DIPNTWVWVRHNDLFDISGGSQPPKSKFIEREKEGYIRLFQIRDYGSNPQPIYIPLSTAS 415 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I KG IL + G L K A++ + + + L ++ + + Q Sbjct: 416 KISQKGDILLARYGASLGKVFYAEYGAYNVALAKVIPLYESRLIFQKYIFLYYCSSIYQN 475 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + + ++ P+PPL+EQ I EK I + Sbjct: 476 EIVNRSRCAQAGFNKEDLNSLLFPLPPLSEQYRIVEKYEKAIASIMS 522 >gi|148977937|ref|ZP_01814490.1| Restriction endonuclease S subunit [Vibrionales bacterium SWAT-3] gi|145962883|gb|EDK28155.1| Restriction endonuclease S subunit [Vibrionales bacterium SWAT-3] Length = 585 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 67/466 (14%), Positives = 131/466 (28%), Gaps = 89/466 (19%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W + TG+T + + + ++G + + G+ L + Sbjct: 106 LPQGWAWSRLGNAGIGATGKTPSTKQTEFFEGKLPFVGPGQI-TQNGQLLEAEKFLSSEG 164 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +G IL +G + KA +A + Q L+P + + L + + Sbjct: 165 LLHSTEAVQGDILMVCIGGSIGKAALATQTVGFNQQINALRPLIMESDYLYVAVSTNSFY 224 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE----------------- 177 + + G+ + + +PI PL+EQ I K+ Sbjct: 225 EGLLDKATGSATPIINRGKWEELLVPIAPLSEQHRIVAKVDELMTLCDQLEQQTEASIEA 284 Query: 178 ------------------------TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 RI + + KQ ++ V L P Sbjct: 285 HQLLVRTLLDTLTNSADAEELMQNWARISEQFDTLFTTEASIDQLKQTILQLAVMGKLVP 344 Query: 214 DVKMKDS-------------------------------GIEWVGLVPDHWEVKPFFALVT 242 + E +P+ WE L Sbjct: 345 QDPNDEPAEKLLERIAEEKAQLIKDKKIKKQKALPPIADDEKPFELPNGWEWSKLQDLCF 404 Query: 243 ---ELNRKNTKLIESNILSLSYGNIIQKLE-----TRNMGLKPESYETYQIVDPGEIVFR 294 + K E+ LS N+ + + G+I+ Sbjct: 405 KITDGEHSTPKRTETGHYLLSARNVTNDGIILGDVDYVPDFEFARIRNRCDPNIGDILIS 464 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGL 353 + +L + ++A + + YLA ++RS L Sbjct: 465 CSGSVG-RVALVDRDNSYSMVRSAAMIRPCNTNLIKEYLALMLRSTYLQFQMKNRSKQSA 523 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + +L + L ++PP+ EQ I + ++ D L IE S Sbjct: 524 QANLFLGAISNLVGIIPPLSEQERIVSKVSELLVVCDQLKSHIEDS 569 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 58/191 (30%), Gaps = 12/191 (6%) Query: 229 PDHWEVKPFFALVTELNRKNT-----KLIESNILSLSYGNIIQKLETRN--MGLKPESYE 281 P W K + E + + G I Q + L E Sbjct: 107 PQGWAWSRLGNAGIGATGKTPSTKQTEFFEGKLPFVGPGQITQNGQLLEAEKFLSSEGLL 166 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 G+I+ I K +L + V A++P ++S YL + + Sbjct: 167 HSTEAVQGDILMVCIGGSIGKAALATQTVG----FNQQINALRPLIMESDYLYVAVSTNS 222 Query: 342 LCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + + + L V + P+ EQ I ++ D L ++ E SI Sbjct: 223 FYEGLLDKATGSATPIINRGKWEELLVPIAPLSEQHRIVAKVDELMTLCDQLEQQTEASI 282 Query: 401 VLLKERRSSFI 411 + + + Sbjct: 283 EAHQLLVRTLL 293 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 67/197 (34%), Gaps = 9/197 (4%) Query: 20 AIPKHWKVVPIKRFT-KLNTG--RTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ ++ K+ G T + + Y + +V + D Sbjct: 389 ELPNGWEWSKLQDLCFKITDGEHSTPKRTETGHYLLSARNVTNDGIILGDVDYVPDFEFA 448 Query: 76 STVSIFAK--GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + G IL G R A++ + + S + +++ E L L S Sbjct: 449 RIRNRCDPNIGDILISCSGSVGRVALVDRDNSYSMVRSAAMIRPCNTNLIKEYLALMLRS 508 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +++ + + ++ I N+ IPPL+EQ I K+ V D L + Sbjct: 509 TYLQFQMKNRSKQSAQANLFLGAISNLVGIIPPLSEQERIVSKVSELLVVCDQLKSHIED 568 Query: 191 FIELLKEKKQALVSYIV 207 A+V V Sbjct: 569 STVTQLHLTDAIVEQAV 585 >gi|213428389|ref|ZP_03361139.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] Length = 381 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 57/325 (17%), Positives = 124/325 (38%), Gaps = 12/325 (3%) Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + T F + Q L+ +L S D ++ + G + + + + + + +PIPP+ Sbjct: 15 FLLHTLFDLNQLIYFSEYYLKRFLESSDYWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPI 74 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL----NPDVKMKDS 220 AEQ +I EK+ ++D+ + ++LK +QA+++ V+ L + S Sbjct: 75 AEQKIIAEKLDTLLAQVDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCS 134 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ------KLETRNMG 274 +W +P W V + LV K ++ + Y I LE Sbjct: 135 EWQW-PDLPSTWSVHKYSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDI 193 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 L + + G+++ Q + + + A I +L Sbjct: 194 LISDIERRELSLKLGDVLICEGGEPGRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWLV 253 Query: 335 WLMRSY-DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + +++ + + + L + + P+ VPP++EQ +I + A D + Sbjct: 254 YNLKNDSNNISLSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIE 313 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 +++ ++ + S +A A G+ Sbjct: 314 KQVNNALNRVNSLTQSILAKAFRGE 338 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 68/212 (32%), Gaps = 9/212 (4%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDG 68 S QW +P W V G+ + K+ Y+G +V + Sbjct: 134 SEWQW-PDLPSTWSVHKYSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQD 192 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQ 125 G +L + G R AI D I + KD + Sbjct: 193 ILISDIERRELSLKLGDVLICEGGEPGRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWL 252 Query: 126 GWLLSIDVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + L D + + G T+ H K + N P+ +PPL EQ I ++ DT+ Sbjct: 253 VYNLKNDSNNISLSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTI 312 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + + + Q++++ L + Sbjct: 313 EKQVNNALNRVNSLTQSILAKAFRGELTAQWR 344 >gi|308272898|emb|CBX29502.1| hypothetical protein N47_J04830 [uncultured Desulfobacterium sp.] Length = 387 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 133/414 (32%), Gaps = 40/414 (9%) Query: 24 HWKVVPIKRFT--KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ ++ + + K I Y+ + G + ++ + + + Sbjct: 3 GWRKCKLRDVIASNVQSINKDYPHKTIQYLDTGSITCGKIE-SYQEIMLENTPSRAKRLV 61 Query: 82 AKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQR 136 + I+Y + P R + + ST F V++ L P + +L S ++ + Sbjct: 62 REHDIIYSTVRPIQRHYGFIVNPPANLVVSTGFSVIKTNRELAEPLFIYNFLTSNEIVEV 121 Query: 137 IEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ I +G+T I N+ + +PPL EQ I + + + I R + Sbjct: 122 LDVIADGSTSAYPSLKPSDIENLDILLPPLPEQKAIASVLSSLDGK----IDLLHRQNKT 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+ Q L +E + F V + E+ Sbjct: 178 LEAMAQTLFRQWF--------------VEEAQEDWQDGKFPDEFDYVMGASPPGESYNET 223 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + E R + + + + + + + E+ Sbjct: 224 GVGIPMFQGNAD-FEFRFPKRRIFTTDPKKFAEKYDTLVSVRAPVG-----AQNMANEKC 277 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPI 372 I A + + Y + L K + + S+ D + +++PP Sbjct: 278 CIGRGVAAFRYKRNNGYYTYTYFKMKSLMKEIQSFNDTGTVFGSISKADFEAFEIIIPPS 337 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + E ID V I L++ R + + ++G++ ++ E++ Sbjct: 338 EL----VDRCQAEIKPIDDKVITNIIQIHTLEKLRDTLLPKLMSGEVQVKYEAK 387 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 44/187 (23%), Gaps = 2/187 (1%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ G + + G + + R T Sbjct: 196 EDWQDGKFPDEFDYVMGASPPGESYNETGVGIPMFQGNADFEFRFPKRRIFTTDPKKFAE 255 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA-IC 141 K L P +A+ + K + + + I++ Sbjct: 256 KYDTLVSVRAPV-GAQNMANEKCCIGRGVAAFRYKRNNGYYTYTYFKMKSLMKEIQSFND 314 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + IPP + +I ++ T I + +L Sbjct: 315 TGTVFGSISKADFEAFEIIIPPSELVDRCQAEIKPIDDKVITNIIQIHTLEKLRDTLLPK 374 Query: 202 LVSYIVT 208 L+S V Sbjct: 375 LMSGEVQ 381 >gi|253991411|ref|YP_003042767.1| type i restriction enzyme ecobi specificity protein (s protein (s.ecobi) [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253782861|emb|CAQ86026.1| type i restriction enzyme ecobi specificity protein (s protein (s.ecobi) [Photorhabdus asymbiotica] Length = 377 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 52/392 (13%), Positives = 118/392 (30%), Gaps = 36/392 (9%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V + ++ G+ + +G G + + ++ T I G I Sbjct: 4 VCRLVDVCEITMGQAPAGSSYNEKGMGYALIAGAGDFGEMTPHPKKYTTKASKISKVGDI 63 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 + + + +D + L+PK L W + + G+T Sbjct: 64 ILC-IRATIGDLNWSDKEYCLGRGVAGLRPKKELDSK-YLWHYLNTRKSLLSSKGTGSTF 121 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 I ++ + + PL EQ I + + ++L + QA + Sbjct: 122 KQISRSHIESLEIELFPLHEQKRIAAILDKADSIHRKH----EQAVKLADDFLQATFLEM 177 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT------ELNRKNTKLIESNILSLS 260 +P V P HW + T + +E+ I + Sbjct: 178 FG---DPVV------------NPSHWNKYKLKDITTKIGSGATPKGGKSVYVENGISFIR 222 Query: 261 YGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 NI + + + V +I+ + ++ ++ + Sbjct: 223 SLNIHDNKFLHKDLVFINDAQASALNNVEVKKNDILLNITGASVCRCAIVDNNIL-PARV 281 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ ++ YL ++ S + + R++L + ++ L + +PPI+ Sbjct: 282 NQHVSIIRSEVVNHDYLLHILISPSFKQYLLSIARSAGATREALTKDQIENLSIPIPPIE 341 Query: 374 EQFDITNVINVETARIDVLVEKIEQS-IVLLK 404 Q + ++ +V E S I L Sbjct: 342 LQNKFGIIKKKIKNMVEKMVSASENSLIEALN 373 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 65/177 (36%), Gaps = 13/177 (7%) Query: 22 PKHWKVVPIKRF-TKLNTGRTSESGK------DIIYIGLEDVESGTGKYLP-KDGNSRQS 73 P HW +K TK+ +G T + GK I +I ++ + N Q+ Sbjct: 185 PSHWNKYKLKDITTKIGSGATPKGGKSVYVENGISFIRSLNIHDNKFLHKDLVFINDAQA 244 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLS 130 K IL G + + I D + + + +++ + V + L L+S Sbjct: 245 SALNNVEVKKNDILLNITGASVCRCAIVDNNILPARVNQHVSIIRSEVVNHDYLLHILIS 304 Query: 131 IDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 Q + +I GAT I N+ +PIPP+ Q ++ ++ Sbjct: 305 PSFKQYLLSIARSAGATREALTKDQIENLSIPIPPIELQNKFGIIKKKIKNMVEKMV 361 >gi|117920473|ref|YP_869665.1| restriction modification system DNA specificity subunit [Shewanella sp. ANA-3] gi|117612805|gb|ABK48259.1| restriction modification system DNA specificity domain [Shewanella sp. ANA-3] Length = 391 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 125/411 (30%), Gaps = 42/411 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDII--------YIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 V + + G + D I +I ++D + + + Sbjct: 2 VKLGDIFDIARGGSPRPIDDYITDADDGLNWISIKDASNSNKYINSTKLKIKPEGLTKTR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRI 137 + G L + R I + G +LV P V + L S + QR Sbjct: 62 MVYPGDFLLTNSMSFGRP-YIMNTTGCIHDGWLVLSGNPDKVNSDYFYYLLGSDTLKQRF 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + GA + + + + + ++ +P+PPLAEQ I + +L Sbjct: 121 SGLAAGAVVKNLNTELVKSVEVPLPPLAEQKRIAAILDKADAIRRKRQQAIQLADDL--- 177 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 L + + +P K + ++T K+ + +E + Sbjct: 178 ----LRAVFLEMFGDPVTNPKGFQKSKL---------SALADVITGFAFKSAEYVEDSDD 224 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQI-------VDPGEIVFRFIDLQ---NDKRSLRS 307 ++ + L +++ +I ++ G+++ K + Sbjct: 225 AVRLCRGVNTLTGYFEWKDTAFWDSNKINGLHNYKLEAGDVILAMDRPWISSGLKVCVFP 284 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + ++ + YL + S K + ++K + Sbjct: 285 ENERDTYLVQRVARIRSKQPRYTDYLYSSILSPAFEKHCCPTE-TTVPHISPVELKNFEI 343 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 LVP + + +++ +++E ++ + +S A +GQ Sbjct: 344 LVPD----EKSVSKYHDIVSKLRRSKDRMEMNLTEANQIFNSLSQKAFSGQ 390 >gi|32476948|ref|NP_869942.1| polypeptide HsdS [Rhodopirellula baltica SH 1] gi|32447496|emb|CAD79085.1| probable HsdS polypeptide, part of CfrA family [Rhodopirellula baltica SH 1] Length = 411 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 59/414 (14%), Positives = 125/414 (30%), Gaps = 20/414 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK P++ G+ ++ K+ + Y+ +V G + Sbjct: 2 SWKSAPLEDVADFRLGKMLDAKKNRGELMPYLANVNVRWGEFDLTDLREMRFEEHEVEKF 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G I+ + G R AI + + ++P D L + + + Sbjct: 62 ELRSGDIVMCEGGEPGRCAIWKNQCENMMIQKAIHRIRPHDCLDNRFLFYSFVDLGKRGV 121 Query: 138 EA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T+ H + + + +P+PP+ Q I + + I+ + Sbjct: 122 LSGFFTGSTIKHLPREKLALVHVPVPPIDVQQRIADVLSGYDDLIENNRRRMELLEASAR 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK---PFFALVTELNRKNTKLIE 253 + + + G W K + + Sbjct: 182 QLHEEWFVRLRFPGHEHAHFANGVPNGWEQQTIAELVEAGELELQTGPFGTQLKASDYTD 241 Query: 254 SNILSLSYGNI----IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 ++ NI ++ + + + ++ G+IVF + +RS Q Sbjct: 242 VGAPVINVRNIGLGSVRPDKLEFVPEEVAERLHKHVLASGDIVFGRKGAVDRHVLIRSMQ 301 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDVKRLPV 367 M I +T ++ R + S SL E + R+ V Sbjct: 302 HGWVQGSDCIRMRSNSERISTTLMSLAFRDERHKEWMLTQCSNKATMASLNQEVLGRIEV 361 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L+P + + + A++D L E L RS + ++G+I + Sbjct: 362 LIPSSNIRKIFLEMASTIFAQMDNL----ESQNERLVAGRSHLLPRLMSGEIPV 411 Score = 41.3 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 61/202 (30%), Gaps = 18/202 (8%) Query: 21 IPKHWKVVPIKRFT-----KLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ I +L TG T D +G + + + Sbjct: 205 VPNGWEQQTIAELVEAGELELQTGPFGTQLKASDYTDVGAPVINVRNIGLGSVRPDKLEF 264 Query: 74 DTST------VSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQ 125 + A G I++G+ G R +I + + + ++ Sbjct: 265 VPEEVAERLHKHVLASGDIVFGRKGAVDRHVLIRSMQHGWVQGSDCIRMRSNSERISTTL 324 Query: 126 G---WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + ATM+ + + +G I + IP + + E ++D Sbjct: 325 MSLAFRDERHKEWMLTQCSNKATMASLNQEVLGRIEVLIPSSNIRKIFLEMASTIFAQMD 384 Query: 183 TLITERIRFIELLKEKKQALVS 204 L ++ R + L+S Sbjct: 385 NLESQNERLVAGRSHLLPRLMS 406 >gi|227547713|ref|ZP_03977762.1| EcoA family type I restriction-modification enzyme, S subunit [Corynebacterium lipophiloflavum DSM 44291] gi|227080211|gb|EEI18174.1| EcoA family type I restriction-modification enzyme, S subunit [Corynebacterium lipophiloflavum DSM 44291] Length = 264 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 60/281 (21%), Positives = 122/281 (43%), Gaps = 24/281 (8%) Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + IP+P+PPL Q I + + E +D LI E R + L +K L+ I+ Sbjct: 1 MVNTVDLQQIPIPLPPLETQRRIADYLDKEISEMDALIEEFERLVNDLSNRKLMLIDNII 60 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 K +P++ + G+ +T+ + + IE + LS + IQ Sbjct: 61 YKS-DPELCLAPLGL-------------FLAEPITDGPHETPEFIEEGVPFLSV-DGIQN 105 Query: 268 LETRNMGLKPESYETYQIVDP------GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 E G + S E ++ G+I+ +++ + E I + + Sbjct: 106 GELTFAGCRFISQEDHERFAKKAKPRTGDILMGKAASTGKIALVKTKR--EFNIWSPLAI 163 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ID +L +++S + + + ++++ D+ R+ + V I +Q I + Sbjct: 164 IRPNASIDPRWLTLVLKSPFSQRQINDLSTFNTQRNIAMGDIPRIRIPVMEIGKQGQIAD 223 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ ETA++D L+E+ + I LK R+++ I VTG+ ++ Sbjct: 224 ELDRETAKMDALIEESTRLIENLKARKNALITEVVTGRKEV 264 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 71/167 (42%), Gaps = 4/167 (2%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK--GQILYGKLGPYLRKAII-- 100 + + ++ ++ +++G + S++ G IL GK + A++ Sbjct: 92 EEGVPFLSVDGIQNGELTFAGCRFISQEDHERFAKKAKPRTGDILMGKAASTGKIALVKT 151 Query: 101 ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 I S ++ + P L L S ++I + T + I I +P Sbjct: 152 KREFNIWSPLAIIRPNASIDPRWLTLVLKSPFSQRQINDLSTFNTQRNIAMGDIPRIRIP 211 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + +Q I +++ ET ++D LI E R IE LK +K AL++ +V Sbjct: 212 VMEIGKQGQIADELDRETAKMDALIEESTRLIENLKARKNALITEVV 258 >gi|257889087|ref|ZP_05668740.1| type I restriction-modification system DNA specificity subunit [Enterococcus faecium 1,141,733] gi|257825159|gb|EEV52073.1| type I restriction-modification system DNA specificity subunit [Enterococcus faecium 1,141,733] Length = 404 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 136/404 (33%), Gaps = 31/404 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + + G+ E D + + K++ +G ++ + Sbjct: 16 EGWEQHKLIEVARYRNGKAHEQAIDE---SGKYIVV-NSKFVSTNGRVKKYTNIIIDPLK 71 Query: 83 KGQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K ++ + KAI + D + S + + ++ Sbjct: 72 KNELAFVLSDVPNGKAIARTFLVDKEHRYSLNQRIAGITPHKDTDSYFLNVLMNRNPYFL 131 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G ++ + N P EQ I ++D I R ++LLKE Sbjct: 132 KFDNGVGQTNLTKADVENFIGHYPSYEEQQKIGTF----FKQLDDTIALHQRKLDLLKET 187 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ + + P K + + G + WE + L KN + + Sbjct: 188 KKGFLQKMF-----PKNGAKVPEVRFPGFT-EDWEERKLKELFQPSKNKNNNGLYNQKDI 241 Query: 259 LSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 L+ +I K + ES + Y+IV G++++ ++ + + GI Sbjct: 242 LAASLGTELIPKRTFFGLKSTRESVKNYRIVKTGDLIYTKSPIKGFPNGIIRSNKGNVGI 301 Query: 316 ITSAYMAVK-PHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRL--PVLVP 370 + Y I+S+ + + +F + G R ++ D++ L V +P Sbjct: 302 VPPLYCVYTLQKDINSSIIQLYFEDKNRLDFYLFPLVNVGARNNVNITDLEFLEGKVTIP 361 Query: 371 -PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I +++ + ++ + LLKE + F+ Sbjct: 362 KSYEEQSKIVQF----MEQLNTTIALHQRKLDLLKETKKGFLQK 401 >gi|163858305|ref|YP_001632603.1| type I restriction-modification system, S subunit [Bordetella petrii DSM 12804] gi|163262033|emb|CAP44335.1| type I restriction-modification system, S subunit [Bordetella petrii] Length = 797 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 61/491 (12%), Positives = 134/491 (27%), Gaps = 99/491 (20%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVES----GTGKYLPKDGNSRQS 73 +P+ W+ + T + G T K+ + + + ++ R Sbjct: 87 LPQGWEWARLGEITDIIRGITFPASEKTKEPASGRIACLRTANVQKKIEWSDLLYIDRTF 146 Query: 74 DTSTVSIFAKGQILYGK--LGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGW 127 + + + I+ + K + VL+ V P + Sbjct: 147 MSKNSQLVRQDDIVMSMANSRELVGKVAVVSEMPVNEATFGGFLGVLRTHKVAPLYVLHL 206 Query: 128 LLSIDVTQR-IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT--- 183 L + I+A + +++ + +P+PP++EQ I KI R D Sbjct: 207 LNTSYARSSLIDAASQTTNIANISLGKLNPFLVPVPPISEQHRIVAKIDELMARCDELEK 266 Query: 184 --------------------------------------LITERIRFIELLKEKKQALVSY 205 E + E ++A++ Sbjct: 267 LRTAQQGARLTVHAAAIKQLLNVAEPGQHQRAQTFLAEHFGELYTIKGNVAELRKAILQL 326 Query: 206 IVTKGLNPDVKMKDSGIEWVGLV-------------------------------PDHWEV 234 V L P E + + P WE Sbjct: 327 AVMGKLVPQDPNDQPASELLKEIEAEKQRLVQEGKIKKTKPLPPVTEEEKPYALPQGWEW 386 Query: 235 KPFFALV-------TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY---- 283 F L + I + ++ +++ + + Sbjct: 387 VRFGDLTTEISTGPFGSMIHKSDYIVDGVPLVNPSHMVDGKIFHDPSVTVSEIMAKKLDS 446 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ +IV + ++ +A+ T +++ I Y+ + ++ Sbjct: 447 HRLNTNDIVMARRGEMG-RCAIVTAESDGFLCGTGSFVLRFVDRIYRQYILTIFKTEITR 505 Query: 344 KVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + +L + ++PV +PP EQ I I+ D L ++IE + Sbjct: 506 EFLGGNSVGTTMTNLNHGILNKMPVSLPPHPEQTRIVTKIDELMVMCDALDQQIEATSSK 565 Query: 403 LKERRSSFIAA 413 E ++ I A Sbjct: 566 RTELLNALIHA 576 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 44/278 (15%), Positives = 84/278 (30%), Gaps = 56/278 (20%) Query: 197 EKKQALVSYIVTKGL--NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR-------K 247 ++ +A +V +G P + E +P WE + + K Sbjct: 54 QEIEAEKQQLVKEGQIKKPKPLPPVAEEEKPYALPQGWEWARLGEITDIIRGITFPASEK 113 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQN--DKR 303 + I L N+ +K+E ++ ++ + Q+V +IV + + K Sbjct: 114 TKEPASGRIACLRTANVQKKIEWSDLLYIDRTFMSKNSQLVRQDDIVMSMANSRELVGKV 173 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFED 361 ++ S + ++ H + Y+ L+ + A + ++ Sbjct: 174 AVVSEMPVNEATFGGFLGVLRTHKVAPLYVLHLLNTSYARSSLIDAASQTTNIANISLGK 233 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS-----------IVLL------- 403 + V VPPI EQ I I+ AR D L + I L Sbjct: 234 LNPFLVPVPPISEQHRIVAKIDELMARCDELEKLRTAQQGARLTVHAAAIKQLLNVAEPG 293 Query: 404 -----------------------KERRSSFIAAAVTGQ 418 E R + + AV G+ Sbjct: 294 QHQRAQTFLAEHFGELYTIKGNVAELRKAILQLAVMGK 331 >gi|296132421|ref|YP_003639668.1| restriction modification system DNA specificity domain protein [Thermincola sp. JR] gi|296030999|gb|ADG81767.1| restriction modification system DNA specificity domain protein [Thermincola potens JR] Length = 426 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 58/416 (13%), Positives = 135/416 (32%), Gaps = 34/416 (8%) Query: 26 KVVPIKRFT-KLNTGRTSESGKD------IIYIGLEDVESGTGKYLPK-DGNSRQSDTST 77 + + K+ +G T + GK+ I +I ++ Y + Q+ + Sbjct: 19 NLTRLGNICTKIGSGLTPKGGKNAYKESGISFIRSLNIYDFHFDYTDLAYIDDNQARKLS 78 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I + IL G + + + + + + +++ Sbjct: 79 NVIVERHDILLNITGASVARCCMVPDNVLPARVNQHVSIVRIDKSKANPYYVLYSLNSPI 138 Query: 135 QRIEAI---CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + GAT + I N + +P L Q I + A I+ Sbjct: 139 NKQRLLTLAQGGATREALTKETISNFEINLPSLTVQNKIAAILSAYDDLIENNTRRIKIL 198 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 E Q + K P + +G +P+ W+VK + + ++ Sbjct: 199 E----EMAQLIYREWFVKFRFPGHEKVRMVESELGPIPEGWKVKTLGEVCNIVMGQSP-- 252 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ES + + N + ++E Y +D I R Sbjct: 253 -ESKYYNTKGEGLPFHQGVSNFNNRYPTHEVYCTIDKRIAHAGDILFSVRAPVGRINIAD 311 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + I+ A++ ++L + M++ + G + S+ +D+ + V+VP Sbjct: 312 RKLIVGRGLAAIRHIAGLQSFLYYQMKAIFKEEDIIGNG-AIFNSITKQDLLNVKVIVPS 370 Query: 372 IKEQFDITNVINVETAR----IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + ++ + ID L+ + + ++L+ R + ++G++D+ Sbjct: 371 --------DCVDNDFNNKVEHIDQLILNLTRKNLILRRTRDLLLPKLISGELDVED 418 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 57/187 (30%), Gaps = 3/187 (1%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP+ WKV + + G++ ES G + + T Sbjct: 228 LGPIPEGWKVKTLGEVCNIVMGQSPESKYYNTKGEGLPFHQGVSNFNNRYPTHEVYCTID 287 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I G IL+ P R IAD I ++ L + + + Sbjct: 288 KRIAHAGDILFSVRAPVGR-INIADRKLIVGRGLAAIRHIAGLQS--FLYYQMKAIFKEE 344 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + I GA + + + N+ + +P K+ I L + + Sbjct: 345 DIIGNGAIFNSITKQDLLNVKVIVPSDCVDNDFNNKVEHIDQLILNLTRKNLILRRTRDL 404 Query: 198 KKQALVS 204 L+S Sbjct: 405 LLPKLIS 411 >gi|158520839|ref|YP_001528709.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158509665|gb|ABW66632.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 577 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 62/483 (12%), Positives = 133/483 (27%), Gaps = 94/483 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP W V + + GR ++ + + + ++ + Y Sbjct: 101 KIPSGWNVTRLGEVLNVLNGRAYKNHEMLQEGTPLLRVGNLFTSDIWYYS------DLAL 154 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G ++Y + + + + + VT+ Sbjct: 155 EPEKYIDNGDLIYAWSASFGPFIWQGGKVIYHYHIWKLDLFDESCLYKNFLYHYLAAVTE 214 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI-------------- 181 +I+A G M H + + + +PPLAEQ I K+ Sbjct: 215 KIKASGSGIAMIHMTKARMEKLVIMVPPLAEQHRIVTKVDELMALCDRLEQEQSQSIETH 274 Query: 182 -----------------------DTLITERIRFIELLKEKK----QALVSYIVTKGLNPD 214 I + + ++ Q ++ V L P Sbjct: 275 QTLVKTLLAALTTAGDAKACAQTWQQIADHFEILFTTEQSIDHLKQTILQLAVMGKLVPQ 334 Query: 215 VKM-------------------------------KDSGIEWVGLVPDHWEVKPFFALVTE 243 K + E +P+ WE F L+ Sbjct: 335 DPNDEPASVLLEKIDKEKARLIKAGKIKNQTPLPKITEDEKPFDLPEGWEWVRFNQLIEP 394 Query: 244 LNRKNT------KLIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + +E+ + + G+ KL +++ + + + GEI+ Sbjct: 395 NIPISYGVLVPGPDVENGVPFVRIGDLDLINPPKLPEKSIDKEIDRQYERTRLLGGEILM 454 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG- 352 + + I + I YL WL+++ + F Sbjct: 455 GVVGSIGKLGVAPDSWRGAN-IARAICRIAPTRLILKQYLIWLLQTDLMQSGFIGATRTL 513 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +L ++ +PP+ EQ I ++ A D L E++ Q+ + + + + Sbjct: 514 AQPTLNVGLIRAAATPLPPLAEQHRIVAKVDKLMALCDTLKERLHQAQTIQTQLSDAIVG 573 Query: 413 AAV 415 A+ Sbjct: 574 QAL 576 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 71/202 (35%), Gaps = 9/202 (4%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K + E +P W V ++ LN + K E + + Sbjct: 92 KITEDEKPQKIPSGWNVTRLGEVLNVLNGRAYKNHEMLQEGTPLLRVGNLFTSDIWYYSD 151 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + E + +D G++++ S +I ++ +S + Sbjct: 152 LALEPEKYIDNGDLIYA------WSASFGPFIWQGGKVIYHYHIWKLDLFDESCLYKNFL 205 Query: 338 RSY--DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 Y + + A GSG + +++L ++VPP+ EQ I ++ A D L + Sbjct: 206 YHYLAAVTEKIKASGSGIAMIHMTKARMEKLVIMVPPLAEQHRIVTKVDELMALCDRLEQ 265 Query: 395 KIEQSIVLLKERRSSFIAAAVT 416 + QSI + + +AA T Sbjct: 266 EQSQSIETHQTLVKTLLAALTT 287 >gi|49482661|ref|YP_039885.1| restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus MRSA252] gi|282903020|ref|ZP_06310913.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus C160] gi|282907409|ref|ZP_06315257.1| type I restriction enzyme S subunit [Staphylococcus aureus subsp. aureus Btn1260] gi|282912640|ref|ZP_06320436.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WBG10049] gi|282918216|ref|ZP_06325957.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C427] gi|283959868|ref|ZP_06377309.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus A017934/97] gi|295426966|ref|ZP_06819605.1| type I restriction enzyme [Staphylococcus aureus subsp. aureus EMRSA16] gi|297588823|ref|ZP_06947464.1| EcoA family type I restriction-modification system [Staphylococcus aureus subsp. aureus MN8] gi|49240790|emb|CAG39455.1| putative restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus MRSA252] gi|83776728|gb|ABC46687.1| Sau1hsdS1 [Staphylococcus aureus] gi|282317913|gb|EFB48281.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C427] gi|282324336|gb|EFB54652.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WBG10049] gi|282330308|gb|EFB59829.1| type I restriction enzyme S subunit [Staphylococcus aureus subsp. aureus Btn1260] gi|282597479|gb|EFC02438.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus C160] gi|283789460|gb|EFC28287.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus A017934/97] gi|295129418|gb|EFG59045.1| type I restriction enzyme [Staphylococcus aureus subsp. aureus EMRSA16] gi|297577334|gb|EFH96047.1| EcoA family type I restriction-modification system [Staphylococcus aureus subsp. aureus MN8] gi|312436476|gb|ADQ75547.1| EcoA family type I restriction-modification system [Staphylococcus aureus subsp. aureus TCH60] gi|315193172|gb|EFU23571.1| putative restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus CGS00] Length = 410 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 61/407 (14%), Positives = 139/407 (34%), Gaps = 36/407 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTST 77 W+ + + G G + +DV + L N + Sbjct: 20 EWEEKKVGELLEFKNGLNKGKEYFGSGSSIVNFKDVFNNRSLNTNNLTGKVNVNSKELKN 79 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S KG + + + + + + + S L +PK + + + + Sbjct: 80 YS-VEKGDVFFTRTSEVIGEIGYPSVILNDPENTVFSGFVLRGRPKSGIDLINNNFKRYV 138 Query: 132 DVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 T + ++M+ I + KI ++D I + Sbjct: 139 FFTNSFRKEMITKSSMTTRALTSGSAINKMKVIYPVSAKEQRKIGDFFSKLDRQIELEEQ 198 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +ELL+++K+ + I ++ L + HWE + E N ++ Sbjct: 199 KLELLQQQKKGYMQKIFSQEL--------RFKDENSEDYPHWENSKIEKYLKERNERSD- 249 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + II+ E + Y++V +I + + + + Sbjct: 250 -KGQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASG----RS 304 Query: 311 MERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLP 366 GI++ AY + P S+ + +++ + F GL +LK++ +K + Sbjct: 305 NYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNIN 364 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 365 IDIPVLEEQEKIGDF----FKKMDILISKQKIKIEILEKEKQSFLQK 407 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 HW+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 231 HWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 285 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 286 KNDIAYNSMRMWQGASGRSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 345 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 346 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKIKIEILEKEKQSFL 405 Query: 200 QALV 203 Q + Sbjct: 406 QKMF 409 >gi|298483406|ref|ZP_07001583.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. D22] gi|298270354|gb|EFI11938.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. D22] Length = 470 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 54/404 (13%), Positives = 112/404 (27%), Gaps = 29/404 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +PK W K T + + I +++++G + KD + + Sbjct: 69 EVPKGWVWTTFGNVCKKLTDGSHNPPPKCSNGYTVISAQNIKNGKIVFTDKDRYTDELGF 128 Query: 76 ST----VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 I IL G AI + + + + + V L S Sbjct: 129 QKENPRTQITNGDIILGIIGGSIGNVAIYDLSVPVIAQRSISIIDTYVSNIYCFYLLQST 188 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G + + + +P+PPL+EQ I +I I+ + ++ Sbjct: 189 IFQSLFLEKSIGNAQAGVYLGELDKLYIPLPPLSEQQRIVTEIKRWFALIEQIEFDKADL 248 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------------GLVPDHWEVKPFFA 239 +K+ K ++ + L P + IE + G P W Sbjct: 249 QTTIKQTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYPIGWLETILGE 308 Query: 240 LVTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 L K +++ + + + E V G+++ Sbjct: 309 LFNHNTGKALNSSNKEGVMKDYLTTSNVYWNKFDFTVIKQMPFKEIELDKCTVTKGDLLV 368 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I + ++P + +Y Sbjct: 369 CEGGDIGRSAIW---NYDYDICIQNHIHRLRPKIDLCVPFYYYTLAYLKENNLIGGKGIG 425 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + ++ + +PP+ EQ I I + +D + +E Sbjct: 426 LLGLSSNALHKIEMPLPPLTEQQRIVQKIEELFSVLDNIQNALE 469 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 66/199 (33%), Gaps = 7/199 (3%) Query: 227 LVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 VP W F +T+ + + +S NI + + Sbjct: 69 EVPKGWVWTTFGNVCKKLTDGSHNPPPKCSNGYTVISAQNIKNGKIVFTDKDRYTDELGF 128 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERG---IITSAYMAVKPHGIDSTYLAWLMRSY 340 Q +P + + + +I +++ + + Y +L++S Sbjct: 129 QKENPRTQITNGDIILGIIGGSIGNVAIYDLSVPVIAQRSISIIDTYVSNIYCFYLLQST 188 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +F G + + ++ +L + +PP+ EQ I I A I+ + Sbjct: 189 IFQSLFLEKSIGNAQAGVYLGELDKLYIPLPPLSEQQRIVTEIKRWFALIEQIEFDKADL 248 Query: 400 IVLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 249 QTTIKQTKSKILDLAIHGK 267 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 54/169 (31%), Gaps = 4/169 (2%) Query: 19 GAIPKHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G P W + NTG+ +++ G Y+ +V + + Sbjct: 295 GHYPIGWLETILGELFNHNTGKALNSSNKEGVMKDYLTTSNVYWNKFDFTVIKQMPFKEI 354 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 KG +L + G R AI IC + + + + + Sbjct: 355 ELDKCTVTKGDLLVCEGGDIGRSAIWNYDYDICIQNHIHRLRPKIDLCVPFYYYTLAYLK 414 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +G + + I MP+PPL EQ I +KI +D Sbjct: 415 ENNLIGGKGIGLLGLSSNALHKIEMPLPPLTEQQRIVQKIEELFSVLDN 463 >gi|312792864|ref|YP_004025787.1| restriction modification system DNA specificity domain [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180004|gb|ADQ40174.1| restriction modification system DNA specificity domain [Caldicellulosiruptor kristjanssonii 177R1B] Length = 419 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 63/423 (14%), Positives = 150/423 (35%), Gaps = 38/423 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ WK V + K I+ D G P+ Sbjct: 7 KLPEDWKGVELGEVLAYEQ-----PNKYIVKDEQYDKSHGIPVLTPEKTFILGFTQEHQG 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I+ ++ + I F S+ +L+ K L + Sbjct: 62 IYNNIPVIIFDDFTTESRYIAFPFKLK-SSAVKILKSKCNFVNLYYVYNSMQL-----LN 115 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+ +P PPL EQ I E + I+ + ++ + + Sbjct: 116 FKPGSEHKRFWISEYSKFLIPFPPLPEQRKIAEILETIDNAIEKIDAIIEKYKRIKQGLM 175 Query: 200 QALVSYIVT---KGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFA-----LVTELNR 246 Q L++ V +G + +++D I+ +G +P+ W+++ ++T+ + Sbjct: 176 QDLLTKGVVSEGEGESERWRLRDENIDKFKDSPLGRIPEEWKIRKLDHREITIMITDGSH 235 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPES-------YETYQIVDPGEIVFRFIDLQ 299 + + +E++ + I + K S +++F Sbjct: 236 YSPQPVENSEYYIVNIENIINGKIEFETCKKISPKDYKKLVSNKCNPKYRDVLFTKDGTV 295 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLK 358 +L + +++S + + +DS+YL + + + + K + G + + Sbjct: 296 G--ITLVFSGERNVVLLSSIAIIRPSNCLDSSYLKYSLETEQIKKQIDILIGGSVLKRIV 353 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +D+K L + +PP+ EQ + +++ ++ID ++EK + L+ + + +TG+ Sbjct: 354 LKDIKSLLIFIPPLPEQQRVASIL----SQIDEVIEKEQAYKEKLERIKKGLMEDLLTGK 409 Query: 419 IDL 421 + + Sbjct: 410 VRV 412 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 82/213 (38%), Gaps = 15/213 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPI--KRFTKLNTGRT-----SESGKDIIYIGLEDVESGT 60 ++KDS +G IP+ WK+ + + T + T + + + +E++ +G Sbjct: 202 DKFKDSP---LGRIPEEWKIRKLDHREITIMITDGSHYSPQPVENSEYYIVNIENIINGK 258 Query: 61 GKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQ 115 ++ S + S +L+ K G + + + S+ ++ Sbjct: 259 IEFETCKKISPKDYKKLVSNKCNPKYRDVLFTKDGTVGITLVFSGERNVVLLSSIAIIRP 318 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + L+ L + + ++I+ + G+ + K I ++ + IPPL EQ + + Sbjct: 319 SNCLDSSYLKYSLETEQIKKQIDILIGGSVLKRIVLKDIKSLLIFIPPLPEQQRVASILS 378 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I+ + + + K + L++ V Sbjct: 379 QIDEVIEKEQAYKEKLERIKKGLMEDLLTGKVR 411 >gi|149203575|ref|ZP_01880544.1| Restriction endonuclease S subunits-like protein [Roseovarius sp. TM1035] gi|149142692|gb|EDM30734.1| Restriction endonuclease S subunits-like protein [Roseovarius sp. TM1035] Length = 413 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 59/401 (14%), Positives = 125/401 (31%), Gaps = 29/401 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W + + ++TG+ + + G+Y P + Q Sbjct: 4 VPQGWAQSRLADWLDISTGKLD-----------ANAATENGQY-PFFTCAEQVSRIDTFA 51 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F +L + + G + + +L ++ + I Sbjct: 52 FDCEAVLL----AGNGNFNLHKYTGKFNAYQRTYVLQPHEIDLGFTFVALKSLLPEITKD 107 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+T+ + I + P+PPL EQ I K+ + R T T +L++ + Sbjct: 108 NRGSTIKYLRLGDIADTAAPLPPLPEQRRIVRKLDTLSARSTTARTHLTAIEKLVERYRT 167 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A++ + +G +H E + + + I N L+ Sbjct: 168 AVLEAAFRTAWDAGFDTTIAG------CLEHAETGLVRSKAEQTAGEGYPYIRMNHYDLA 221 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII--TS 318 + + +E YQ + +++F + + + G + + Sbjct: 222 --GRWNDRDLTYVAATSSEFERYQ-LRANDLLFNTRNSAELVGKVAIWPEGKDGYLFNNN 278 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + W M S + + ++ + P VP EQ Sbjct: 279 LLRMRFSADVLPGFAFWQMSSPPFRRYIEGFISATTSVAAIYQRSLMAAPFWVPDTDEQR 338 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +I I A+ID L + +++ LL +A A G Sbjct: 339 EIVRRIETAFAKIDRLKAEAAKALKLLGHLDQRILAKAFAG 379 >gi|164688285|ref|ZP_02212313.1| hypothetical protein CLOBAR_01930 [Clostridium bartlettii DSM 16795] gi|164602698|gb|EDQ96163.1| hypothetical protein CLOBAR_01930 [Clostridium bartlettii DSM 16795] Length = 405 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 64/416 (15%), Positives = 130/416 (31%), Gaps = 29/416 (6%) Query: 23 KHWKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK V ++ + + +I +++ + Sbjct: 4 SDWKTVKLEEVVDILGDGLHGTPKYSDDGEYYFINGNNLDGKIIVNEKTKRVGLEQYLKY 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQR 136 +L G + A D I + KDV + ++ LLS Sbjct: 64 KKDLNDRTLLVSINGTLGKVAEYGGEDIILGKSACYFNVKKDVNKKYIKYILLSDIFKHY 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I G T+ + K + P+P + EQ I + +D I + L+ Sbjct: 124 IHNYSTGTTIKNLGLKQMRKFKFPLPNIEEQEKIANIL----SSLDDKIELNNEMNKTLE 179 Query: 197 EKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKNT 249 E Q++ P+ K SG E +G++P WE+ + + Sbjct: 180 EMAQSIFKRWFIDFEFPNEDGQPYKSSGGEMVESELGMIPKEWEIAQIDDISQVTMGVSP 239 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N ++ + + +KP + E +I G++VF + Sbjct: 240 SSKTYNEDNIGLPLLNGAADFEGKLIKPSKFTSEPKKICKKGDMVFGVRATIGNIVFADK 299 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + RG+ + V+P+ + + + + +LK D+ L V Sbjct: 300 EYALGRGVAS-----VEPNDKVFREFIYYSLDNSMENLINNASGSVFLNLKKADITDLKV 354 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +I N + + + + + LLK++R + V+G+I + Sbjct: 355 CYSD-----EIVKKFNNISRVLIDKIVENDMESELLKQQRDILLPKLVSGEIRITN 405 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 76/206 (36%), Gaps = 11/206 (5%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG + +G IPK W++ I +++ G + S + +G + Sbjct: 203 YKSSGGEMVESELGMIPKEWEIAQIDDISQVTMGVSPSSKTYNEDNIGLPLLNGAADFEG 262 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 K + + I KG +++G + + + AD + ++P D + Sbjct: 263 KLIKPSKFTSEPKKICKKGDMVFG-VRATIGNIVFADKEYALGRGVASVEPNDKVFR-EF 320 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + + + G+ + I ++ + I +K + + I Sbjct: 321 IYYSLDNSMENLINNASGSVFLNLKKADITDLKVCYSDE-----IVKKFNNISRVLIDKI 375 Query: 186 TERIRFIELLKEKKQALVSYIVTKGL 211 E ELLK+++ L+ +V+ + Sbjct: 376 VENDMESELLKQQRDILLPKLVSGEI 401 >gi|332310722|gb|EGJ23817.1| HsdS [Listeria monocytogenes str. Scott A] Length = 391 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 47/400 (11%), Positives = 113/400 (28%), Gaps = 31/400 (7%) Query: 25 WKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTV 78 W+ + + + YI + D++ + + + S D Sbjct: 9 WEQRKLGEIANSFEYGLNASSKTYDGENKYIRITDIDESSHVFNQDNLTSPDISLDNLNH 68 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +G IL + G K+ + + + L+ Sbjct: 69 YLLEEGDILLARTGASTGKSYCYNKIDGKVFFAGFLIRAKIKHEYNVSFIFQSTLTERYN 128 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ + + + + + IP L EQ I + ++D I R +E Sbjct: 129 NFIQVTSQRSGQPGINAQEYARFALYIPKLKEQQKIGDF----FKQLDNTIALHQRKLEK 184 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 +K K A +S + K + G + F + K + Sbjct: 185 IKALKTAYLSEMFPAEGETKPKRRFGG-------FTDDWEQRKFIEIINRLSKTSNSSIL 237 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + +++ K +S + P I++ + +G Sbjct: 238 PKVEYEDIIAEEGRLNKDISNKFDS-RKGILFQPKNILYGKLRPYLKN----WLYPDFKG 292 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK- 373 + + + ++ L++S KV + V +P Sbjct: 293 VAVGDFWVFEAIEATPRFIYNLIQSDSYQKVANDTAGTKMPRSDWTKVSNSSFFIPKESS 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I ++D + ++ + L+ + +++ Sbjct: 353 EQKRIGTF----FKQLDDTIALHQRKLQKLQNIKKAYLNE 388 >gi|315149121|gb|EFT93137.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0012] Length = 415 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 60/404 (14%), Positives = 133/404 (32%), Gaps = 24/404 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W++ +K T+ G ++ D+ + + + + GN + ++ Sbjct: 18 EDWELCKLKEITERVKG--NDGRMDLPTLTISAGQGWLNQKDRFSGNIAGKEQKNYTLLL 75 Query: 83 KGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K ++ Y KL Y + ++ + + + E Sbjct: 76 KNELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKE 135 Query: 139 ------AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + ++ NI + IP + EQ I + +ID IT R + Sbjct: 136 LGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDNTITLHQRKL 191 Query: 193 ELLKEKKQALVSYIV---TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 E LKE K+A + + N K++ + E ++ + + N Sbjct: 192 EQLKELKKAYLQLMFVPTNTKNNKVPKLRFANFEGNWEQCKLIDLATTYIGLVTTMTTNY 251 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + ++ S + + LK E + + + + S + Sbjct: 252 TDQGTLLIRNSDIKEGKFDLNNPIYLKEEFAKQNENRSMKMGDVVTVHTGDIGTSAVITE 311 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 ++ I + ++S YL W S K M +G R + +D + ++ Sbjct: 312 DLDGTIGFATITTRPSKKLNSNYLCWYFNSNIHKKYAKRMSTGDGRSNYNMKDFNKNILV 371 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P I+EQ I +D + + + LK + S++ Sbjct: 372 IPKIEEQQTIGIF----FQNLDNTITLHQNKLDQLKSLKKSYLQ 411 >gi|323223292|gb|EGA07629.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. MB102109-0047] Length = 582 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 66/494 (13%), Positives = 127/494 (25%), Gaps = 99/494 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTPLAIE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDL 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ +E + G T+ + + P IPP AEQ Sbjct: 200 KVLSPFLSEISYYIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 + + RI Sbjct: 260 LSTVKKLMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADALAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 320 TTEASIDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 S E VP+ WE + + I ++ G+I + Sbjct: 380 ISDKEKPFEVPEGWEWCKFGLISEFINGDRGSNYPNKNEYVVHGIPWINTGHIEKNGTLS 439 Query: 272 NMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + + + G++V+ K + I +S + Sbjct: 440 ITDMNFITEKKFNELRSGKIQSGDLVYCLRGATFGKTAFVKPYESG-AIASSLMIIRPFI 498 Query: 327 GIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 Y+ + S + + +L V PP++EQF I I Sbjct: 499 REMGEYIYNYLISPFGRSQIFRFDNGSAQPNLSANSVMLYAFACPPLQEQFRIHKKITEL 558 Query: 386 TARIDVLVEKIEQS 399 D L + + + Sbjct: 559 FHICDNLKLQTQSA 572 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 62/196 (31%), Gaps = 10/196 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P WE F L K ++ + + K N+ + E Sbjct: 93 SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPKDMKTNLIVDSED 152 Query: 280 -------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + V PG I+F +R A + + P + +Y Sbjct: 153 KVTPLAIEDGLTKVSPGSILFVARSGIL-RRIFPVAITSIECTVNQDLKVLSPFLSEISY 211 Query: 333 LAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 LM + + +SL F+D P ++PP EQ I + + + D Sbjct: 212 YIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRILSTVKKLMSLCD 271 Query: 391 VLVEKIEQSIVLLKER 406 L + S+ ++ Sbjct: 272 QLEQHSLTSLDAHQQL 287 >gi|119477798|ref|ZP_01617921.1| putative specificity protein s [marine gamma proteobacterium HTCC2143] gi|119448959|gb|EAW30200.1| putative specificity protein s [marine gamma proteobacterium HTCC2143] Length = 444 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 68/431 (15%), Positives = 139/431 (32%), Gaps = 44/431 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSE------------SGKDIIYIGLEDVESGTGKYLPKDGNS 70 K W I G+ S + + D+++GT + Sbjct: 2 KGWIKKNIGELCDSGGGKVKTGPFGAQLHQSDYSYQGTPVVMPTDIKNGTI--AQERIAR 59 Query: 71 RQSDTSTV---SIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQF--LVLQPKDVLPEL 123 + +KG I+YG+ G R+A++ + +C T + L +V+PE Sbjct: 60 VSDSHVSRLAMHQLSKGDIVYGRRGDIGRQALVKEAESGWLCGTGCLRITLGESEVIPEY 119 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRID 182 L +L ++ I+ GATM + + + +P+ P A Q I A I+ Sbjct: 120 LHLYLKMPEIIGWIQNQAIGATMPNLNTSILRRVPIHFPSSKATQRNIVSLSFAYDDLIE 179 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 + + +E + G K +WV + Sbjct: 180 NIKRRINILESMGEEIYREWFVRFRFPGHKAVEFKKGVPKDWVVGRASLFFEHVKGRSYK 239 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 +T ++L N + + L + + Q+V G++V D+ ++ Sbjct: 240 SEEISDTDDESMPFVTLKSFNRGGGYRSDGLKLYSGKFSSSQVVHEGDVVMAVTDMTQNR 299 Query: 303 RSLRSAQVMER-----GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + + +I+ + + P + +TYL ++ + +G Sbjct: 300 EVVGRVARVPEMGRRGAVISLDVIKLVPKSVSATYLYSYIKYSGFSHFIKSFANGA---- 355 Query: 358 KFEDVKRLPVLVPPIKEQFDIT-------NVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +V L P + Q I I V + + I L+ R S Sbjct: 356 ---NVLHLK---PDLVTQQVIVVPTQGLREKFEAIVDPIHEQVGLLSKEIDNLEATRDSL 409 Query: 411 IAAAVTGQIDL 421 + ++G++ + Sbjct: 410 LPRLISGKLSV 420 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 62/200 (31%), Gaps = 17/200 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK W V F + GR+ +S + + ++ L+ G G Y Sbjct: 217 VPKDWVVGRASLFFEHVKGRSYKSEEISDTDDESMPFVTLKSFNRGGG-YRSDGLKLYSG 275 Query: 74 DTSTVSIFAKGQILYGKL-----GPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELL 124 S+ + +G ++ + + G + S + L PK V L Sbjct: 276 KFSSSQVVHEGDVVMAVTDMTQNREVVGRVARVPEMGRRGAVISLDVIKLVPKSVSATYL 335 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++ + I++ GA + H + + +P + + ++ L Sbjct: 336 YSYIKYSGFSHFIKSFANGANVLHLKPDLVTQQVIVVPTQGLREKFEAIVDPIHEQVGLL 395 Query: 185 ITERIRFIELLKEKKQALVS 204 E L+S Sbjct: 396 SKEIDNLEATRDSLLPRLIS 415 >gi|189423911|ref|YP_001951088.1| restriction endonuclease S subunits-like protein [Geobacter lovleyi SZ] gi|189420170|gb|ACD94568.1| restriction endonuclease S subunits-like protein [Geobacter lovleyi SZ] Length = 386 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 71/394 (18%), Positives = 135/394 (34%), Gaps = 28/394 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ V +L+ R+S+ YIGLE ++ + + + + S+F Sbjct: 9 GWQKVKFGDVVRLSKERSSDPLADGYERYIGLEHIDPEDLRV--RRWGNVADGVTFTSVF 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQRIE 138 GQ+L+GK Y RK +ADF G+CS V PK +LPELL + Q Sbjct: 67 KPGQVLFGKRRAYQRKVAVADFAGVCSGDIYVLESKDPKKLLPELLPFICQTEAFFQHAV 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ +W + + +PPL EQ I E ++A ID L++ R L K Sbjct: 127 GTSAGSLSPRTNWTSLADFEFALPPLEEQRRIVELLLAVEETIDNLVSARSSAQLLFKA- 185 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 AL+ +S E + +E R + + Sbjct: 186 --ALLESF------------NSLPENNKKKIADCYEIQLGKMSSEKARFGSNQKTYIKNN 231 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 E M Y+ + G+++ + Sbjct: 232 NVLWGKFDFGELPQMSFDEREITKYE-LRKGDLLVCEGGEIGRAAIWQDEIPGMLYQKAL 290 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFD 377 + + ++ +R + + +G + L E + +L + P Q Sbjct: 291 HRLRPRTSDDIPEFMFHYLRYCAERGILDGVATGTTIRHLPVEQLSQLALPFPKRAVQEQ 350 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + +++ ++I+ ++ I + +S+ + Sbjct: 351 VASLL----SKIESGNSMLDAKICHSRSLKSAVL 380 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 56/168 (33%), Gaps = 10/168 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P++ I ++ G+ S YI +V G + S Sbjct: 194 LPEN-NKKKIADCYEIQLGKMSSEKARFGSNQKTYIKNNNVLWGKFDFGELPQMSFDERE 252 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADF-DGICST---QFLVLQPKDVLPELLQGWLLSI 131 T KG +L + G R AI D G+ L + D +PE + +L Sbjct: 253 ITKYELRKGDLLVCEGGEIGRAAIWQDEIPGMLYQKALHRLRPRTSDDIPEFMFHYLRYC 312 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 ++ + G T+ H + + + +P P A Q + + Sbjct: 313 AERGILDGVATGTTIRHLPVEQLSQLALPFPKRAVQEQVASLLSKIES 360 >gi|254932531|ref|ZP_05265890.1| HsdS [Listeria monocytogenes HPB2262] gi|293584086|gb|EFF96118.1| HsdS [Listeria monocytogenes HPB2262] Length = 404 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 47/400 (11%), Positives = 113/400 (28%), Gaps = 31/400 (7%) Query: 25 WKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTV 78 W+ + + + YI + D++ + + + S D Sbjct: 22 WEQRKLGEIANSFEYGLNASSKTYDGENKYIRITDIDESSHVFNQDNLTSPDISLDNLNH 81 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +G IL + G K+ + + + L+ Sbjct: 82 YLLEEGDILLARTGASTGKSYCYNKIDGKVFFAGFLIRAKIKHEYNVSFIFQSTLTERYN 141 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ + + + + + IP L EQ I + ++D I R +E Sbjct: 142 NFIQVTSQRSGQPGINAQEYARFALYIPKLKEQQKIGDF----FKQLDNTIALHQRKLEK 197 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 +K K A +S + K + G + F + K + Sbjct: 198 IKALKTAYLSEMFPAEGETKPKRRFGG-------FTDDWEQRKFIEIINRLSKTSNSSIL 250 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + +++ K +S + P I++ + +G Sbjct: 251 PKVEYEDIIAEEGRLNKDISNKFDS-RKGILFQPKNILYGKLRPYLKN----WLYPDFKG 305 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK- 373 + + + ++ L++S KV + V +P Sbjct: 306 VAVGDFWVFEAIEATPRFIYNLIQSDSYQKVANDTAGTKMPRSDWTKVSNSSFFIPKESS 365 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I ++D + ++ + L+ + +++ Sbjct: 366 EQKRIGTF----FKQLDDTIALHQRKLQKLQNIKKAYLNE 401 >gi|317180610|dbj|BAJ58396.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F32] Length = 401 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 122/391 (31%), Gaps = 24/391 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + G++ K + + + + G + +R + Sbjct: 13 PKGVEFRKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEANRSGE------- 64 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G Y D + F V PK + I A Sbjct: 65 ---TIAISSSGVYAGYVSYWDIPVFLADSFSVS-PKQKTLMPKYLFHYLTTQQDAIHATK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + H K + N +PIPPL Q I + + A T L TE + + + Sbjct: 121 STGGIPHVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTELNTELKARKKQYQYYQNM 180 Query: 202 LV------SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 L+ ++ K L P E + + N+K K+ E + Sbjct: 181 LLDFKDTNQNHQDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGEVCESTNKKTLKISEVS 240 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + G + + GE + + + G Sbjct: 241 EVKNKRMYPVINSGRDLYGYYHDFN------NDGENITIASRGEYAGFINYFNEKFFAGG 294 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + Y + + + +L + +++ ++ + + G +L D++ L + +PP++ Q Sbjct: 295 LCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVSRGSIPALNKADIETLTIPIPPLEIQ 354 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKER 406 +I +++ + L+ I I K++ Sbjct: 355 QEIVKILDQFSILTTDLLAGIPAEIEARKKQ 385 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 40/113 (35%), Gaps = 8/113 (7%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I S + + S ++ K + YL + + + +G Sbjct: 68 ISSSGVYAGYVSYWDIPVFLADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIP 126 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +D++ + +PP++ Q +I +++ T E + LK R+ Sbjct: 127 HVYSKDLQNFLIPIPPLEIQQEIVKILDAFT-------ELNTELNTELKARKK 172 >gi|226947935|ref|YP_002803026.1| restriction modification system DNA specificity domain protein [Clostridium botulinum A2 str. Kyoto] gi|226842884|gb|ACO85550.1| restriction modification system DNA specificity domain protein [Clostridium botulinum A2 str. Kyoto] Length = 395 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 54/405 (13%), Positives = 133/405 (32%), Gaps = 28/405 (6%) Query: 26 KVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTVSI 80 K + + G K + ++++ + + S + + + S Sbjct: 4 KKIKCSEIIDVRDGTHDSPRYQSKGYPLVTSKNIKGNKIDFNNVNFISEEDYNKINMRSA 63 Query: 81 FAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRI 137 G IL +G ++ I + L + + +LL+ D+ ++ Sbjct: 64 VHNGDILMPMIGTIGNPVLVNTNKKFAIKNVALFKLSNNNKVDSKYFYYLLTSDIVKNQL 123 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E G T + I ++ +P+ P+ +Q+ I + ID + EL+K Sbjct: 124 ENRKRGGTQNFVSLSNIRSLEIPLVPIEKQIFISNILDKAKSLIDKRKAQIEDLDELVKS 183 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + LNP +W + +++ + L Sbjct: 184 R---FIEMFGDTKLNPF--------KWEVYRLEEIYYIIDGDRGKNYPKQDEFFERNYCL 232 Query: 258 SLSYGNIIQKLETRNMG----LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 L+ GN+ K + + + + ++V + + Sbjct: 233 FLNAGNVTSKGFCFDKSSFIAKEKDEILRKGKLQREDLVVTTRGTVGNIAYYNDNVPYDN 292 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I S + ++ + L ++ + + + + ++K + PPI+ Sbjct: 293 IRINSGMVILRKRKEINP-LYFISYFSNKLVYQSLISGTAQPQMPISNMKNANIYYPPIQ 351 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q + +N ++D L ++E+S+ L++ +S + A G+ Sbjct: 352 LQNEFAGFVN----QVDKLKFEMEKSLKELEDNFNSLMQKAFKGE 392 >gi|307824352|ref|ZP_07654578.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] gi|307734732|gb|EFO05583.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] Length = 615 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 60/480 (12%), Positives = 132/480 (27%), Gaps = 97/480 (20%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDVESGTGKYLPKDGNS 70 +PK W+ V + + G+T SG ++ + D+ + + Sbjct: 130 ELPKGWEWVHLPDVSDYKVGKTPSTKSSVYWTNSGDGFNWVSIADLNHDDSVFETNKQIT 189 Query: 71 RQSDTSTVS--IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 ++ + G IL L K I D + + + P + + Sbjct: 190 DKAVSEVFRSDPAPAGTILMS-FKLTLGKISILDKPAFHNEAIISIYPNQSVFKDF--LF 246 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR-------- 180 + + + + I + +P+PP+AEQ I K+ Sbjct: 247 KVLPARAMAGNSKSAIKGNTLNSESIAALMIPLPPMAEQQRIVAKVDELMALCDQLETQH 306 Query: 181 ---------------------------------IDTLITERIRFIELLKEKKQALVSYIV 207 I + KQ L+ V Sbjct: 307 SNAAEAHEKLVSHLLGTLTQSQNADDFSANWQRIAAYFDILFTTETSIDALKQTLLQLAV 366 Query: 208 TKGLNPDVKMKDSGIEWVGLV-------------------------------PDHWEVKP 236 L P + E + + P+ WE Sbjct: 367 MGKLVPQDPNDEPAGELLKRIQTEKAKLIAEGKIKKDKQLPPITDDEKPFGLPEGWEWIK 426 Query: 237 FFALVTELNRKNTKLIES----NILSLSYGNIIQK------LETRNMGLKPESYETYQIV 286 + + + + ++ GN+ + R++ + + + Sbjct: 427 VSEVAELITSGSRDWAQYLSNEGAKFVTMGNLSRGSYELRLGNMRHVNPPKDGEGSRTKL 486 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + +++ + + + I + + Y +MRS F Sbjct: 487 EANDLLISITGDVGNLGRI-PEDFGDAYINQHTCLLRFVSQCRNRYFPEVMRSPMAAMQF 545 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 A G++ S + D+ + + +PP+ EQ I ++ A D L +I ++ L ++ Sbjct: 546 NAPQRGIKNSFRLGDLDEMVIPLPPLAEQHRIVAKVDELMALCDQLKTRITEANQLQQKL 605 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 59/196 (30%), Gaps = 11/196 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL--------SLSYGNIIQKLETR 271 S E +P WE + K S +S ++ Sbjct: 123 SVEEKPFELPKGWEWVHLPDVSDYKVGKTPSTKSSVYWTNSGDGFNWVSIADLNHDDSVF 182 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + ++ I + + + + + A +++ P+ S Sbjct: 183 ETNKQITDKAVSEVFRSDPAPAGTILMSFKLTLGKISILDKPAFHNEAIISIYPNQ--SV 240 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + +L + + S ++ + L E + L + +PP+ EQ I ++ A D Sbjct: 241 FKDFLFKVLPARAMAGNSKSAIKGNTLNSESIAALMIPLPPMAEQQRIVAKVDELMALCD 300 Query: 391 VLVEKIEQSIVLLKER 406 L + + ++ Sbjct: 301 QLETQHSNAAEAHEKL 316 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 66/197 (33%), Gaps = 10/197 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKD---GNSRQ 72 +P+ W+ + + +L T + + S + ++ + ++ G+ + + N + Sbjct: 418 LPEGWEWIKVSEVAELITSGSRDWAQYLSNEGAKFVTMGNLSRGSYELRLGNMRHVNPPK 477 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLS 130 + + +L G I + G + +L+ ++ Sbjct: 478 DGEGSRTKLEANDLLISITGDVGNLGRIPEDFGDAYINQHTCLLRFVSQCRNRYFPEVMR 537 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + + + +P+PPLAEQ I K+ D L T Sbjct: 538 SPMAAMQFNAPQRGIKNSFRLGDLDEMVIPLPPLAEQHRIVAKVDELMALCDQLKTRITE 597 Query: 191 FIELLKEKKQALVSYIV 207 +L ++ +V + Sbjct: 598 ANQLQQKLADVVVERAI 614 >gi|298375957|ref|ZP_06985913.1| type I restriction-modification system, S subunit [Bacteroides sp. 3_1_19] gi|298266994|gb|EFI08651.1| type I restriction-modification system, S subunit [Bacteroides sp. 3_1_19] Length = 426 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 56/411 (13%), Positives = 125/411 (30%), Gaps = 29/411 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLP--KDGNSRQSD 74 + WK PI ++ G T S D I +I D+ + + Sbjct: 23 EGWKRTPILEICEIIGGGTPSSSNDVYWNGDIPWISSSDINENNISEITPTRHITKDAIK 82 Query: 75 TSTVSIFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + I + ++G + K + D S F L + L L +I Sbjct: 83 HSATKLCKAPSIHIVSRVG--VGKVAFSRVDICTSQDFTNLCNINCNYIFLSYLLSTIMK 140 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E +G ++ I N+ +P+P + EQ I + + + I+ IE Sbjct: 141 QKVQE--TQGTSIKGIASAEIKNLHVPLPEIEEQQRIADCLSSLDDL----ISAVADKIE 194 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L+E K+ L+ + ++ + G K ++T K ++++E Sbjct: 195 TLEEYKKGLMQQLFPAEGKTTPDIRFPEFQNEGKWILLPIKKCNIDILTGYAFKGSEILE 254 Query: 254 SN--------ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 N I R + + Y+++ ++ +L Sbjct: 255 DNNGTPLMRGINITEGVVRHNNDIDRFYSREDHTLSKYRLLCNDLVIAMDGSKVGRNFAL 314 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKR 364 + Q ++ + ++ + S K S + + ++ Sbjct: 315 INKQDEGSLLVQRVARLRADNIDFIMFIYQQIGSDRFKKYIDRINTSSGIPHISLKQIED 374 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++ + +D L+ + LK + + Sbjct: 375 FKIWTTRND--KEF-RMVTNCLSSVDELISTETAKLDQLKNHKKGLMQQLF 422 >gi|29294587|ref|NP_808857.1| HsdS protein [Lactococcus lactis subsp. lactis bv. diacetylactis] gi|29170399|emb|CAD79462.2| HsdS protein [Lactococcus lactis subsp. lactis bv. diacetylactis] Length = 405 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 63/415 (15%), Positives = 128/415 (30%), Gaps = 44/415 (10%) Query: 20 AIPK--------HWKVVPIKRFTKL-NTGRT---SESGKDIIYIGLEDVESGTGKYLPKD 67 +P+ W+ +K + G+ + D+ Y+ + G Sbjct: 11 KVPELRFKGFTDEWEERKLKDVVEKQIKGKAQFEKLAQGDVEYLDTSRLNGGQALLTN-- 68 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + IL G + ST L+ + Sbjct: 69 ---------GLKDVSLDDILILWDGSKAGTVYHGFEGALGST----LKAYRTSANSKFVY 115 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I + H + + IP EQ I ++D I Sbjct: 116 QYLKRHQDNIYNNYRTPNIPHVQKDFLNVFTISIPGSDEQAKIGSF----FKKLDDTIAL 171 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN-- 245 R ++LLKE+K+ + + K +++ +G + V Sbjct: 172 HQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAGFADDWEERKFESLLDKNEGVRRGPFG 231 Query: 246 ---RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 +K+ + ES + N I + E +E + F Sbjct: 232 SALKKDLFVKESPYVVYEQQNAIYDHYETRYNISKEKFEELHKFELIADDFIMSGAGTIG 291 Query: 303 RSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSL-K 358 R + + +++G+ A + + + DS Y +R+ + + SG +L Sbjct: 292 RISKVPKGIKKGVFNQALIRFRINKELTDSEYFLQFIRADFMQRKLTGANSGSAITNLVP 351 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 DVK+ + VP +EQ I + ++D + ++ + LLKE++ F+ Sbjct: 352 MSDVKKWEIKVPIKEEQQRIGSF----FKQLDDTIALHQRKLDLLKEQKKGFLQK 402 >gi|196037267|ref|ZP_03104578.1| type I restriction-modification system specificity determinant [Bacillus cereus NVH0597-99] gi|196031509|gb|EDX70105.1| type I restriction-modification system specificity determinant [Bacillus cereus NVH0597-99] Length = 424 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 54/418 (12%), Positives = 126/418 (30%), Gaps = 28/418 (6%) Query: 26 KVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVS 79 + P+ G + + I + + ++ + + Sbjct: 13 EWKPLGDIGAFINGSGMPKSMFDENGQVGAIHYGHIYTKYQNFVYEPIVKISEKNAEKLK 72 Query: 80 IFAKGQILYGKLGPYLRKA-----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 KG ++ K L + D + + + + L + S ++ Sbjct: 73 KVQKGDLVIAKTSENLDDVMKTVAYLGDEEVVAGGHSAIFKHNQNPKYLTYIFNGSSNLI 132 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G + K + I +PIPPL Q I E I T ++ L E + Sbjct: 133 MQKNRLARGTKVIELSAKHMEKIRIPIPPLEIQEKIVEIIDGFTRYVNGLTAELTAELTA 192 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKLIE 253 K++ + L+ + K S G D + T + Sbjct: 193 RKKQYAYYRDML----LSEEYLNKLSETLGNEGETNDKVIWTTLGEVAKFKYGFTTTAKD 248 Query: 254 S-NILSLSYGNIIQKLETRNMG---LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 N L +I + + + + + +V +++ K S + Sbjct: 249 IGNYRFLRITDITENGILKTENAKFVNDDEVDEDYLVGKDDVLMARTGATYGKTLYISEK 308 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVL 368 + + + S Y +S D K + G + +K++ + Sbjct: 309 INAVYASFLIKIDTDKEKLSSRYYWHFAQSGDYWKQADFLAKGGGQPQFNANVLKKVKLP 368 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAV-TGQIDL 421 +P + Q + +++++ + + + + I L K+ R + A G+ D+ Sbjct: 369 IPSLAIQAHVVSILDIFDKLTSDITQGLLKEIELRKKQYVYYREKLL--AFECGEKDV 424 >gi|114319661|ref|YP_741344.1| restriction modification system DNA specificity subunit [Alkalilimnicola ehrlichii MLHE-1] gi|114226055|gb|ABI55854.1| restriction modification system DNA specificity domain protein [Alkalilimnicola ehrlichii MLHE-1] Length = 413 Score = 103 bits (256), Expect = 7e-20, Method: Composition-based stats. Identities = 60/419 (14%), Positives = 147/419 (35%), Gaps = 40/419 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W V + +NT + + + L ++ G + K + + + Sbjct: 5 SWPDVSLGNIFTINTSAVIPNAAPNTEFYHHSLPAWDATGGPTVEKGSSIESNKVN---- 60 Query: 81 FAKGQILYGKLGPYLRKAIIADFDG-----ICSTQFLVLQPKDVLP-ELLQGWLLSIDVT 134 K +L KL P + + + G ST+F+ L+PK + Sbjct: 61 ITKPCVLVSKLNPRKPRVSVLESVGKDERHCASTEFVCLEPKAKEHLRFWGHLFSNKRFA 120 Query: 135 QRIEAICEGATMSH--ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++ + G+T SH + ++ + +P E+ LI + + I + I Sbjct: 121 GHLDRMAIGSTNSHKRFSPGVLLSLRIELPSEPERRLIARILDTLDTQ----IQKTEALI 176 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTEL 244 L++ K+ L+ ++T+G++ + +++ S + +GL+P W + + Sbjct: 177 AKLEKVKEGLLHDLLTRGIDDNGQLRPSPEQAPELYKESPLGLIPREWNAVRLYEMAENH 236 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + L +S +Y + + + GE V Sbjct: 237 DGQRIPLKKSERKHGTYPYYGASGIIDWVEGYLFEGSYVLLGEDGENVVSRNLP------ 290 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 L + A++ D+ +L ++ D + + + ++ Sbjct: 291 LAFPVTGRFWVNNHAHIYSPKDDCDTRFLVEVLEQKDYSRWVN---GSAQPKITQASLRM 347 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + PP EQ I+N + I+ +++ + I ++ +++ + +TG++ + Sbjct: 348 MWFCKPPTAEQKAISNSLEA----INQQIDEEKIKIAKVRTQKAGVMDDLLTGRVRVTP 402 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 61/204 (29%), Gaps = 21/204 (10%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 YK+S +G IP+ W V + + + G+ K E G Y + Sbjct: 212 YKESP---LGLIPREWNAVRLYEMAENHDGQRIPLKKS---------ERKHGTYPYYGAS 259 Query: 70 SRQSDTSTVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 +F +L G+ G L A + + PK + Sbjct: 260 GIIDWVEGY-LFEGSYVLLGEDGENVVSRNLPLAFPVTGRFWVNNHAHIYSPK---DDCD 315 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L+ + + G+ + + PP AEQ I + A +ID Sbjct: 316 TRFLVEVLEQKDYSRWVNGSAQPKITQASLRMMWFCKPPTAEQKAISNSLEAINQQIDEE 375 Query: 185 ITERIRFIELLKEKKQALVSYIVT 208 + + L++ V Sbjct: 376 KIKIAKVRTQKAGVMDDLLTGRVR 399 >gi|317010095|gb|ADU80675.1| type I R-M system specificity subunit [Helicobacter pylori India7] Length = 350 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 53/375 (14%), Positives = 115/375 (30%), Gaps = 37/375 (9%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI 106 +I + + + ++ K + S KG IL G R I Sbjct: 10 EIPFYKIGTFGNTADAFISKKLFL--EYQTKYSFPKKGDILISASGTIGRAVIYDGKPAY 67 Query: 107 CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 +V E L ++ E T+ N +P+PPL E Sbjct: 68 FQDSNIVWI---DNDETLVKNDFLFYAYSNVKWNTEHTTILRLYNDNFRNTLIPLPPLNE 124 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q+ I + + +L ++ + K L+S + + Sbjct: 125 QIAIANILSGLDHYLYSLRALILKKESVKKALSFELLSQ----------------RKRLK 168 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +W+ + + + I N K N G+ Y V Sbjct: 169 GFNQNWQRVRLGDICEIVKGQQINKISL--------NNTDKYPVINGGIDFLGYTNKFNV 220 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 I R ++S + + + +++ L +++SY+ + Sbjct: 221 SKNTIAISEGGTCGYVRFMKSDFWSGGHNYS---LQKISNKVNNLCLYHILKSYE-KDIM 276 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++++ + +K +L+PP+ EQ I ++++ I L K Q + Sbjct: 277 KLGVGSGLKNIQLKALKDFEILLPPLNEQSAIADILSALDKEIANLKNKKRQ----FENI 332 Query: 407 RSSFIAAAVTGQIDL 421 + + ++ +I + Sbjct: 333 KKALNHDLMSAKIRV 347 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 63/182 (34%), Gaps = 11/182 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 ++W+ V + ++ G+ + T KY +G + + Sbjct: 172 QNWQRVRLGDICEIVKGQQINKIS----------LNNTDKYPVINGGIDFLGYTNKFNVS 221 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I + G + + + + + + L + + + I + Sbjct: 222 KNTIAISEGGTCGYVRFMKSDFWSGGHNYSLQKISNKVNNLCL-YHILKSYEKDIMKLGV 280 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ + + K + + + +PPL EQ I + + A I L ++ +F + K L Sbjct: 281 GSGLKNIQLKALKDFEILLPPLNEQSAIADILSALDKEIANLKNKKRQFENIKKALNHDL 340 Query: 203 VS 204 +S Sbjct: 341 MS 342 >gi|282865356|ref|ZP_06274408.1| hypothetical protein SACTEDRAFT_4953 [Streptomyces sp. ACTE] gi|282559829|gb|EFB65379.1| hypothetical protein SACTEDRAFT_4953 [Streptomyces sp. ACTE] Length = 107 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 32/98 (32%), Positives = 56/98 (57%), Gaps = 2/98 (2%) Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D+ Y +++ + + +G+ + V++ + PP+ EQ + ++ ETA Sbjct: 10 DAGYFRYVISTDAFYDYLEPLFTGVSVPHVSEWQVRKFKMPFPPLDEQRCMARHLDAETA 69 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +ID L+ + E+ I L +ERRS+ I AAVTGQID+ GE+ Sbjct: 70 KIDTLIAESERFIELARERRSALITAAVTGQIDV-GEA 106 >gi|257424552|ref|ZP_05600981.1| type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus 55/2053] gi|257427218|ref|ZP_05603620.1| Sau1hsdS1 [Staphylococcus aureus subsp. aureus 65-1322] gi|257429854|ref|ZP_05606241.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus 68-397] gi|257432558|ref|ZP_05608921.1| restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus E1410] gi|257435462|ref|ZP_05611513.1| restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus M876] gi|282913268|ref|ZP_06321060.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M899] gi|282922896|ref|ZP_06330586.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C101] gi|293509256|ref|ZP_06667973.1| hypothetical protein SAZG_02421 [Staphylococcus aureus subsp. aureus M809] gi|293550523|ref|ZP_06673195.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M1015] gi|257273570|gb|EEV05672.1| type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus 55/2053] gi|257276849|gb|EEV08300.1| Sau1hsdS1 [Staphylococcus aureus subsp. aureus 65-1322] gi|257280335|gb|EEV10922.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus 68-397] gi|257283437|gb|EEV13569.1| restriction and modification system specificity protein [Staphylococcus aureus subsp. aureus E1410] gi|257286058|gb|EEV16174.1| restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus M876] gi|282315117|gb|EFB45503.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C101] gi|282323368|gb|EFB53687.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M899] gi|290919570|gb|EFD96646.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M1015] gi|291467895|gb|EFF10404.1| hypothetical protein SAZG_02421 [Staphylococcus aureus subsp. aureus M809] Length = 410 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 61/407 (14%), Positives = 139/407 (34%), Gaps = 36/407 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTST 77 W+ + + G G + +DV + L N + Sbjct: 20 EWEEKKVGELLEFKNGLNKGKEYFGSGSSIVNFKDVFNNRSLNTNNLTGKVNVNSKELKN 79 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S KG + + + + + + + S L +PK + + + + Sbjct: 80 YS-VEKGDVFFTRTSEVIGEIGYPSVILNDPENTVFSGFVLRGRPKSGIDLINNNFKRYV 138 Query: 132 DVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 T + ++M+ I + KI ++D I + Sbjct: 139 FFTNSFRKEMITKSSMTTRALTSGSAINKMKVIYPVSAKEQRKIGDFFNKLDRQIELEEQ 198 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +ELL+++K+ + I ++ L + HWE + E N ++ Sbjct: 199 KLELLQQQKKGYMQKIFSQEL--------RFKDENSEDYPHWENSKIEKYLKERNERSD- 249 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + II+ E + Y++V +I + + + + Sbjct: 250 -KGQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASG----RS 304 Query: 311 MERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLP 366 GI++ AY + P S+ + +++ + F GL +LK++ +K + Sbjct: 305 NYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNIN 364 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 365 IDIPVLEEQEKIGDF----FKKMDILISKQKIKIEILEKEKQSFLQK 407 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 HW+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 231 HWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 285 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 286 KNDIAYNSMRMWQGASGRSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 345 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 346 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKIKIEILEKEKQSFL 405 Query: 200 QALV 203 Q + Sbjct: 406 QKMF 409 >gi|167041820|gb|ABZ06561.1| putative Type I restriction modification DNA specificity domain protein [uncultured marine microorganism HF4000_097M14] Length = 425 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 63/424 (14%), Positives = 148/424 (34%), Gaps = 37/424 (8%) Query: 25 WKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTG---KYLPKDGNSRQSDTST 77 W + KLNT + ++++ + DV K + + + + Sbjct: 9 WIKLKFSEIGKLNTSSVDKKIQLNEQNVLLLNYMDVYRNNFISNKINFQKITATSKELES 68 Query: 78 VSIFAKGQILYGKL----GPYLRKAIIADFDGICSTQFLV-----LQPKDVLPELLQGWL 128 KG I + A+I + + K + Sbjct: 69 FK-VNKGDIFFTPSSETPDDIGHSAVIVSELINTLQSYHLVKLKLNDEKLMDLNFRGYVF 127 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITE 187 S ++ + G+T K I + P + +Q I + +D +I + Sbjct: 128 NSENILNQFRLAATGSTRFTISLKEFAKIEVYFPKSIPDQKKIASIL----TSVDDVIEK 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 I L++ K+ ++ ++ KG+ + KDS + V E+ +++ K Sbjct: 184 TQSKINKLQDLKKGTINKLLIKGIG-HTEFKDSELGIVPKSWKIMELSKVSKILSSNVDK 242 Query: 248 NTKLIESNILSLSYGNIIQKL-----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 TK E+++L +Y ++ + L +S ++ +++ D Sbjct: 243 KTKENETSVLLCNYMDVYKNLKITREINFMKASAKKSEIDKFLIKKDDVIITKDSETPDD 302 Query: 303 RSLRSAQVMERGIITSAY----MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 ++ S + Y + +D +L + + + F + +G R L Sbjct: 303 IAISSYVSENFDNVLCGYHLSIIRPNKSVLDGKFLNFFFKLDYMHHRFSILANGTTRFGL 362 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++V+ +L+P ++EQ I N+I ++ + I++ + + S + +TG Sbjct: 363 NLKEVENSKILIPELEEQKKIANIICS----LEDKILIIKKKLNKYVFIKKSLMQDLLTG 418 Query: 418 QIDL 421 ++ + Sbjct: 419 KVRV 422 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 80/214 (37%), Gaps = 17/214 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKL----NTGRTSESGKDIIYIGLEDVESGTGKYL 64 ++KDS +G +PK WK++ + + +K+ +T E+ ++ DV Sbjct: 211 EFKDSE---LGIVPKSWKIMELSKVSKILSSNVDKKTKENETSVLLCNYMDVYKNLKITR 267 Query: 65 PKDGNSRQSDTS--TVSIFAKGQILYGKLGPYLRKAIIADF------DGICSTQFLVLQP 116 + + S + K ++ K I+ + + +C +++P Sbjct: 268 EINFMKASAKKSEIDKFLIKKDDVIITKDSETPDDIAISSYVSENFDNVLCGYHLSIIRP 327 Query: 117 KDV--LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + L + + R + G T + K + N + IP L EQ I I Sbjct: 328 NKSVLDGKFLNFFFKLDYMHHRFSILANGTTRFGLNLKEVENSKILIPELEEQKKIANII 387 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + +I + + +++ + K Q L++ V Sbjct: 388 CSLEDKILIIKKKLNKYVFIKKSLMQDLLTGKVR 421 >gi|116629554|ref|YP_814726.1| restriction endonuclease S subunit [Lactobacillus gasseri ATCC 33323] gi|238854087|ref|ZP_04644436.1| restriction endonuclease S subunit [Lactobacillus gasseri 202-4] gi|116095136|gb|ABJ60288.1| Restriction endonuclease S subunit [Lactobacillus gasseri ATCC 33323] gi|238833294|gb|EEQ25582.1| restriction endonuclease S subunit [Lactobacillus gasseri 202-4] Length = 396 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 55/408 (13%), Positives = 133/408 (32%), Gaps = 35/408 (8%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDV------ESGTGKYLPKDGNSRQS 73 K+ + + +G + + D+ + R Sbjct: 5 KIKLLGEICEFYSGTGFPKKFQGNLEGKYPFYKVGDISKSADENKNFLTKSDNYVDERIV 64 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 T I I++ K+G L+ C VL K +L ++ Sbjct: 65 KTLKGKIVPPKTIVFAKIGEALKLNRRMITSTECLIDNNVLGIKPKNDSILAEYIFYFMK 124 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++E E T+ + I + +P + Q I + + + E Sbjct: 125 FVKLENYSESTTVPSVRKSELEKIKIRVPSIQNQQKIISILENIDKTKKSKTESLKKLNE 184 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L + + V +P++K KD ++ + + K + R +E Sbjct: 185 L-------IKARFVEMFGDPEIKNKDKSLKKLCDICLVNPDKR------KDPRLTNNDLE 231 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVM 311 + + +S + ++T N+ L E + + +++F I ++N K ++ Sbjct: 232 VSFVPMSAVSENGDIDTTNIKLYSEVRKGFTYFSSNDVLFAKITPCMENGKGAIAQNLKN 291 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWL----MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + G ++ + ++P S S+ GS ++ + + ++ V Sbjct: 292 DIGFGSTEFHVLRPLENLSNPYWLYVLTTFDSFRKVAEINMTGSAGQKRVPVKFLENYKV 351 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +PP+ Q + N + ++D +++S+ ++ S + Sbjct: 352 NIPPLSLQNEFANFV----QQVDKSKVAVQKSLDETQKLFDSLMQEYF 395 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 69/195 (35%), Gaps = 9/195 (4%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M+D I+ +G + + + F + +S S L + + Sbjct: 1 MEDIKIKLLGEICEFYSGTGFPKKFQGNLEGKYPFYKVGDISKSADENKNFLTKSDNYVD 60 Query: 277 PESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + +IV P IVF I R +I + + +KP DS Sbjct: 61 ERIVKTLKGKIVPPKTIVFAKIGEALKLN--RRMITSTECLIDNNVLGIKPKN-DSILAE 117 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++ K+ S S++ +++++ + VP I+ Q I +++ ID + Sbjct: 118 YIFYFMKFVKLENYSESTTVPSVRKSELEKIKIRVPSIQNQQKIISILE----NIDKTKK 173 Query: 395 KIEQSIVLLKERRSS 409 +S+ L E + Sbjct: 174 SKTESLKKLNELIKA 188 >gi|329937001|ref|ZP_08286630.1| restriction modification system DNA specificity subunit [Streptomyces griseoaurantiacus M045] gi|329303608|gb|EGG47493.1| restriction modification system DNA specificity subunit [Streptomyces griseoaurantiacus M045] Length = 210 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 74/195 (37%), Gaps = 13/195 (6%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---------TYQIV 286 + T + + +I ++ G + Q E R + + ++ Sbjct: 6 RMGSGHTPSRSRPDWWSDCHIPWITTGEVKQVREDRIEDVHETREKISDVGLANSAAELH 65 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G + + ++ + YL W +R+ + Sbjct: 66 PKGTVFLCRTASAGYSGVMG----LDMATSQDFVTWTCGPRLLPYYLLWCLRAMRPDLLG 121 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +++ D++ L + +PP++ Q I + I + AR+D L +K+++ LL+ER Sbjct: 122 RLAMGSTHKTIYVPDLQMLRIPLPPMETQEQIVDAIRRQNARVDALTDKVQRQHELLRER 181 Query: 407 RSSFIAAAVTGQIDL 421 R + I AAVTGQ D+ Sbjct: 182 RQALITAAVTGQFDV 196 >gi|118497303|ref|YP_898353.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. novicida U112] gi|194323607|ref|ZP_03057384.1| type I restriction modification DNA specificity domain protein [Francisella tularensis subsp. novicida FTE] gi|118423209|gb|ABK89599.1| type I restriction-modification system, subunit S [Francisella novicida U112] gi|194322462|gb|EDX19943.1| type I restriction modification DNA specificity domain protein [Francisella tularensis subsp. novicida FTE] Length = 406 Score = 102 bits (255), Expect = 8e-20, Method: Composition-based stats. Identities = 52/398 (13%), Positives = 122/398 (30%), Gaps = 25/398 (6%) Query: 24 HWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ IK T L + S D + + +++ +G+ D + D + Sbjct: 21 EWEENNIKALTSLLKDGSHGTHKEASESDYLLLSAKNITNGSINVYEDDRRISEEDYRQI 80 Query: 79 SI---FAKGQILYGKLGPYLRKAIIADFDGICSTQ-FLVLQPKDVLPELLQGWLLSIDVT 134 K ++ +G R A++ + D I + + K+ + + + Sbjct: 81 YRNYHLQKDDLVLTIVGTIGRSALVKEIDKIAFQRSVAFFRFKNHNSKFVYQLFNTPKFL 140 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + + I + +P EQ I + + I+ L + Sbjct: 141 NELDRRKVVSAQPGIYLGDLAKIKLTLPSKQEQQKIADCLSTWDDSIENLKSLIENKKLY 200 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q L S + + + + +G V + NT+L Sbjct: 201 KKGMMQKLFSQEIRFKADNGSDFPEWVEKRLGDVGTVIT-------GKTPSTSNTELWNG 253 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 NI ++ +I I+ G IV+ I + + Sbjct: 254 NIEFITPTDIEGAKYQTRTSRTVTEQTKMNILPIGTIVYTCIGSIG-----KMSLSTLPS 308 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 I ++ + ++ + + + + + + + VP + E Sbjct: 309 ITNQQINSLIVNEQNNNEFVYYSLLNLTPYIQSTQANTTLPIINKTEFSKFKIKVPCLAE 368 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q I N ++ I++L +++EQ + + + Sbjct: 369 QTKIANFLSCLDDEIELLEQELEQLQLQ----KKGLMQ 402 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 67/210 (31%), Gaps = 14/210 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K E+ G ++ L + + + ES+ L LS NI + Sbjct: 12 KLRFKEFSGEWEENNIKALTSLLKDGSHGTHKEASESDYLLLSAKNITNGSINVYEDDRR 71 Query: 278 ESYETY------QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 S E Y + ++V + ++ +++ + + +S Sbjct: 72 ISEEDYRQIYRNYHLQKDDLVLTIVGTIGRSALVK---EIDKIAFQRSVAFFRFKNHNSK 128 Query: 332 YLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 ++ L + + + D+ ++ + +P +EQ I + ++ I+ Sbjct: 129 FVYQLFNTPKFLNELDRRKVVSAQPGIYLGDLAKIKLTLPSKQEQQKIADCLSTWDDSIE 188 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 L IE K + + + +I Sbjct: 189 NLKSLIENK----KLYKKGMMQKLFSQEIR 214 >gi|291614891|ref|YP_003525048.1| restriction modification system DNA specificity domain protein [Sideroxydans lithotrophicus ES-1] gi|291585003|gb|ADE12661.1| restriction modification system DNA specificity domain protein [Sideroxydans lithotrophicus ES-1] Length = 426 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 60/418 (14%), Positives = 132/418 (31%), Gaps = 42/418 (10%) Query: 27 VVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESG-----TGKYLPKDGNSRQSDTSTV 78 V +K F ++ G + ++Y+ ++ +S Y+ + + + Sbjct: 11 VGRLKDFCQVGDGAHASIARQEHGVMYLSAKNFKSSGLDLSNVDYISEGDYEKHFGKTKK 70 Query: 79 SIFA--KGQILYGKLGPYL--RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ KG +L+G +G D G+ S+ ++ + P+ L ++ S Sbjct: 71 AVTTPVKGDVLFGIIGSLGTPYTVKHRDRFGLSSSVAILRPSSGLCPDYLYHFMTSSAFQ 130 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + AI G + + N+P+ + Q I + A ID + Sbjct: 131 SAVHAIKSGVAQGFLSLEMVKNLPLVTHEINVQRKIAAILSAYDELIDNNQHRIALLERM 190 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK---- 250 +E + + G K +P WE+ Sbjct: 191 AEEIYREWFVRMRFHGYEKTTFNKG--------LPSDWEICEIGRKFATCLGGTPSRAEL 242 Query: 251 -LIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLR 306 I ++ G + + E Y +I+ V + SL Sbjct: 243 SYWGGEIPWINSGEVNKLRIVEASEYLTEDGLRYSATKIMPRRTTVIAITGATLGQVSLT 302 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 V S G+ S Y+ +++ ++ + G +Q + + V++ Sbjct: 303 EIAV---CANQSVVGVYDSVGVYSEYIFQYVKT-NIENLIAKQSGGGQQHINKDIVEKEK 358 Query: 367 VLVPPIKE--Q-FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +L+PP Q I I +I L+ + + R + ++G++ + Sbjct: 359 ILLPPPDLIGQYNQIVRPI---FDQIRTLMFSTQG----YTQVRDRLLPRLISGKLSV 409 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 55/190 (28%), Gaps = 7/190 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W++ I R G T G +I +I +V + Sbjct: 216 LPSDWEICEIGRKFATCLGGTPSRAELSYWGGEIPWINSGEVNKLRIVEASEYLTEDGLR 275 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S I + + G L + + + + + + + + Sbjct: 276 YSATKIMPRRTTVIAITGATLGQVSLTEIAVCANQSVVGVYDSVGVYS-EYIFQYVKTNI 334 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + A G H + + + +PP + + +I TL+ + ++ Sbjct: 335 ENLIAKQSGGGQQHINKDIVEKEKILLPPPDLIGQYNQIVRPIFDQIRTLMFSTQGYTQV 394 Query: 195 LKEKKQALVS 204 L+S Sbjct: 395 RDRLLPRLIS 404 >gi|228478285|ref|ZP_04062893.1| restriction modification system DNA specificity domain protein [Streptococcus salivarius SK126] gi|228249964|gb|EEK09234.1| restriction modification system DNA specificity domain protein [Streptococcus salivarius SK126] Length = 405 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 109/397 (27%), Gaps = 19/397 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK +K L G +S K I + + ++ G + + + Sbjct: 17 WKKEKLKNIAPLRGGFAFKSEKFQNVGIPIVRISNI-GFDGTVGGEFEYYSKLSPDEKFV 75 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRI 137 +L G K + D + V ++ + L L + T ++ Sbjct: 76 LKGRSLLLAMSGATTGKIAMLDSEEEYYQNQRVGFFQNNGAVDYDFLSSVLQTKAFTNQL 135 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A+ + K I + IP E+ + + + + Sbjct: 136 NAVLVAGAQPNISSKEIDSFEFCIPESIEEQSAIGSLFRILEDLLA---SYRDNLANYQS 192 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K ++S + K +++ G + V + + T L K ++ L Sbjct: 193 LKMTMLSKMFPKAGQTVPELRLDGFKGDWEVKE---LGNIVDFYTGLTYKPNDMVSDGTL 249 Query: 258 SLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L N+ ++ Q V G+IV + D + E Sbjct: 250 VLRSSNVRDGEFIYKDNVFVNPDIVNCQNVKLGDIVVVVRNGSRDLIGKHALIKSEMPNT 309 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 ++ L+ + + + KR+ P +EQ Sbjct: 310 VIGAFMTGVRYDAPEFINALLDTEKFISEINKNLGSTINQITTGNFKRMKFHFPDKEEQR 369 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + D L+ + I L+ + + Sbjct: 370 AIGSY----FTNFDNLIVAHREKITQLETLKKKLLQD 402 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 53/188 (28%), Gaps = 11/188 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+V + TG T + + + +V G+++ KD D Sbjct: 220 DWEVKELGNIVDFYTGLTYKPNDMVSDGTLVLRSSNVR--DGEFIYKDNVFVNPDIVNCQ 277 Query: 80 IFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G I+ G + A+I + + PE + L + Sbjct: 278 NVKLGDIVVVVRNGSRDLIGKHALIKSEMPNTVIGAFMTGVRYDAPEFINALLDTEKFIS 337 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I G+T++ + P EQ I I + + L Sbjct: 338 EINKNL-GSTINQITTGNFKRMKFHFPDKEEQRAIGSYFTNFDNLIVAHREKITQLETLK 396 Query: 196 KEKKQALV 203 K+ Q + Sbjct: 397 KKLLQDMF 404 >gi|27466962|ref|NP_763599.1| specificity determinant HsdS [Staphylococcus epidermidis ATCC 12228] gi|27314504|gb|AAO03641.1|AE016744_44 probable specificity determinant HsdS [Staphylococcus epidermidis ATCC 12228] gi|319740868|gb|ADV68930.1| putative specificity determinant HsdS [Staphylococcus aureus] Length = 400 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 63/397 (15%), Positives = 133/397 (33%), Gaps = 22/397 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + WK + G + ES K+ L ++S + + D ++ Sbjct: 17 EEWKKRKLGEVVNYKNGGSFESLVKNHGVYKLITLKSVNTEGKLCNSGKYIDDKCVETLC 76 Query: 82 AKGQILYGKLGPYL----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ I + + + + + L PK + L + Sbjct: 77 NDTLVMILSEQAPGLVGMTAIIPNNNEYVLNQRVAALVPKQFIDSQ-FLSKLINRNQKYF 135 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G + + + N P EQ I ++D I +ELL++ Sbjct: 136 SVRSAGTKVKNISKGHVENFNFLSPNYTEQQKIGNF----FSKLDRQIELEEEKLELLEQ 191 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +K+ + I ++ L + +S +W + T + + E N Sbjct: 192 QKRGYIQKIFSQDLRFKDENGNSYPDWSIKKIEDIS--KVNKGFTPNTKNDKYWDELNEN 249 Query: 258 SLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 LS + QK + N G+ + + VD ++ F ++ I Sbjct: 250 WLSIAGMTQKYLYKGNKGITEKGASKHVKVDKDTLIMSFKLTLGKLAIVKEPIYTNEAIC 309 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + K +++ Y+ + + S ++ G+ +L + + + V +P I+EQ Sbjct: 310 ---HFVWKESNVNTEYMYYYLNSINISTFGAQAVKGV--TLNNDAINSIIVKLPVIQEQN 364 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I ++D L+EK + LLK+R+ F+ Sbjct: 365 KIAYF----FNKLDKLIEKQSSKVELLKQRKQGFLQK 397 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 15/168 (8%), Positives = 45/168 (26%), Gaps = 3/168 (1%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK---LIESNILSLSYGNIIQKLETRNMGLKPESYE 281 + W+ + +V N + + ++ ++ + + N G + Sbjct: 12 FPEFDEEWKKRKLGEVVNYKNGGSFESLVKNHGVYKLITLKSVNTEGKLCNSGKYIDDKC 71 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + + ++ ++ A+ P + + + + Sbjct: 72 VETLCNDTLVMILSEQAPGLVGMTAIIPNNNEYVLNQRVAALVPKQFIDSQFLSKLINRN 131 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +++ V+ L P EQ I N + +I Sbjct: 132 QKYFSVRSAGTKVKNISKGHVENFNFLSPNYTEQQKIGNFFSKLDRQI 179 >gi|49257052|dbj|BAD24841.1| hsdS homologue [Staphylococcus aureus] Length = 412 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 65/417 (15%), Positives = 127/417 (30%), Gaps = 38/417 (9%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK---GQ 85 +K K R + + + S K G + + + K Sbjct: 7 RLKELAKYKNERIDTNQ-----LTTSNYISTENLLPNKQGKQKANKLPSSKTVKKYTEND 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 IL + PY +K AD G S + + + + L +L Q + +G Sbjct: 62 ILISNIRPYFKKIWQADNIGGISNDVLNITSSNEKISNDYLYYYLSQDKFFQYMTQTSKG 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M D + I + +P E + I +D I IE L+E Q L Sbjct: 122 TKMPRGDKEAIMEFEIQVPKNVE---YQNFIRNLGKLLDNKIKINNEIIENLEELSQTLF 178 Query: 204 SYIVTKGLNPDV---KMKDSGIE----WVGLVPDHWEVKPFFALVTELNR----KNTKLI 252 PD K +G E +G +P W VK + +N K Sbjct: 179 KRWFVDFEFPDENGAPYKANGGEMIDSELGKIPKGWIVKSLDEIANYINGLAMQKYPSNK 238 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E ++ + + N +D G+I+F + L Sbjct: 239 EESLPIVKIKELKNGFTDENSNRCTTEIPEKAKIDNGDIIFSWSATL-----LVKMWAGG 293 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS---GLRQSLKFEDVKRLPVLV 369 + + V + + + + F + + + + + +++ Sbjct: 294 KAGLNQHLFKVTSETF--PKWFYYLWTKRYIEYFINIANDKATTMGHINRKHLSHAKIVL 351 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 P Q + N + + E+ I L E R + + ++G+I++ + + Sbjct: 352 PT---QLQLENF-DKIFHNLLEKQLNTEEEIKRLIELRDTLLPKLMSGEIEIPDDVE 404 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 56/177 (31%), Gaps = 12/177 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKY 63 + DS +G IPK W V + G S + + + ++++++G + Sbjct: 201 EMIDSE---LGKIPKGWIVKSLDEIANYINGLAMQKYPSNKEESLPIVKIKELKNG---F 254 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 ++ N ++ + G I++ L K G + + + Sbjct: 255 TDENSNRCTTEIPEKAKIDNGDIIFSWSATLLVKMWAGGKAG-LNQHLFKVTSETFPKWF 313 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 W A + TM H + K + + + +P + + + Sbjct: 314 YYLWTKRYIEYFINIANDKATTMGHINRKHLSHAKIVLPTQLQLENFDKIFHNLLEK 370 >gi|311741899|ref|ZP_07715710.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311314905|gb|EFQ84811.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 113 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 55/113 (48%), Gaps = 1/113 (0%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 + + A WL+ + + F ++ +G +++ + + +PP+ Sbjct: 1 MATSQHFAAWICGDRLLPEYLWLLFTGAMQPYFDSLTNGSTLRTIGMSIIGGFRIPLPPV 60 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 EQ I +T +ID L+ + + I L +ERRS+ I AAVTGQID+RG + Sbjct: 61 SEQVQIVQTARDQTGKIDELMAETARFIELSRERRSALITAAVTGQIDVRGAA 113 >gi|189347937|ref|YP_001944466.1| restriction modification system DNA specificity domain [Chlorobium limicola DSM 245] gi|189342084|gb|ACD91487.1| restriction modification system DNA specificity domain [Chlorobium limicola DSM 245] Length = 438 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 60/436 (13%), Positives = 129/436 (29%), Gaps = 44/436 (10%) Query: 12 DSGVQWIGAIPKHWKVV-----PIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGK 62 DSG W V + L T T ++ K + +++ G Sbjct: 24 DSG---------DWMKVGLTESTLAEVCSLVTDGTHDTPKRVETGYPLVKAKEISGGRID 74 Query: 63 YLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPK 117 + D S Q S G L+ +G L +A + I + P Sbjct: 75 FDNCDQISEQEHLKVIARSKPEFGDTLFAHIGASLGEAAFVNTTREFSIKNVALFKPNPS 134 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIA 176 + L ++S + G+ + + LA Q I + A Sbjct: 135 VIDARYLYYLVVSPAFQSLAKGTRTGSAQPFLGLSQLRGHQIQYHRDLAHQRRISGILSA 194 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 I+ E ++L P + +G++P WEVK Sbjct: 195 YDDLIENRQRRIRILE----EMARSLYREWFVHFRFPGHENHPLVPSSLGVIPQGWEVKK 250 Query: 237 FFALVTELNRK-NTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVFR 294 + + R + +E + +I ++ + + GE++F Sbjct: 251 LGDIAESMRRNVSKGKLEERTPYVGLEHIPRQSLALDAWEMATALGSNKLEFKKGEVLFG 310 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 I K S+ + + + + S + V A +G Sbjct: 311 KIRPYFHKVSVAPFVGL---CSADTIVIRALRPEHYGIVVACVSSDEFVAVASATANGAK 367 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS---F 410 + +++ V++P N+ +A ++ + + I ++ R + Sbjct: 368 MPRANWNVLEKYQVVIPK-------GNLAEKFSALFADIIAQQQTLIFKIQNLRQTRDLL 420 Query: 411 IAAAVTGQIDLRGESQ 426 + ++G++ L+ + Sbjct: 421 LPRLLSGEVKLKETDE 436 >gi|325121240|gb|ADY80763.1| restriction endonuclease S subunits-like protein [Acinetobacter calcoaceticus PHEA-2] Length = 419 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 64/400 (16%), Positives = 128/400 (32%), Gaps = 35/400 (8%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTVSIFAKGQILYG 89 +++ I ++ DV G K Q+DT +G IL Sbjct: 35 GNHGEIHPTSADYVENGIPFVMATDVFDGNVYLDKSKKITKEQADTLRKGFSIEGDILLT 94 Query: 90 KLGPYLRKAIIADFDGICS------TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 A + D T + V + ++P+ ++ S + A+C G Sbjct: 95 HKATIGNVAKVPKLDTPYIMLTPQVTYYRVRDYEKLVPDFIKSSFESKKFQNELIALCTG 154 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT + +P P +EQ I + +I L + + + Q L Sbjct: 155 ATRLYIGISEQRKLPFSYPSKSEQTKIASFLSTVDEKISQLNQKHKLLSQYKQGMMQKLF 214 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + +G E+ G V A + K + +E IL ++ N Sbjct: 215 SQQFRFKAD-------NGGEFGGWV---EIKITDVADYVDYRGKTPRKVEDGILLVTAKN 264 Query: 264 IIQKLETRNMGLKP------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 I ++ + + + G+++ + S+ E + Sbjct: 265 IRFGYIDYSISQEYICSDDFDEVMRRGRAEIGDVLITTEAPLGNVASV----DRENIALA 320 Query: 318 SAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVP-PIK 373 + + ++ +L S + + + G Q +K + L + +P I+ Sbjct: 321 QRVIKYRGKKGILNNEFLKQKFLSEEFQSLISSKATGGTVQGIKGSTLHNLEINIPEDIE 380 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I N + A ID +E + + I K+ + + Sbjct: 381 EQTKIANFL----ATIDQKIEVVAKQIEQAKQWKKGLLQQ 416 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 71/216 (32%), Gaps = 16/216 (7%) Query: 213 PDVKMKDSGIEWV-----GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 P ++ K+ W+ GLV + KP E++ + +E+ I + ++ Sbjct: 4 PKLRFKEFDGAWISTNIQGLVDQNILDKPMDGNHGEIHPTSADYVENGIPFVMATDVFDG 63 Query: 268 ----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS--AYM 321 +++ + + G+I+ + + + Y Sbjct: 64 NVYLDKSKKITKEQADTLRKGFSIEGDILLTHKATIGNVAKVPKLDTPYIMLTPQVTYYR 123 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + ++ S A+ +G R + + ++LP P EQ I + Sbjct: 124 VRDYEKLVPDFIKSSFESKKFQNELIALCTGATRLYIGISEQRKLPFSYPSKSEQTKIAS 183 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ +I L +K + LL + + + + Sbjct: 184 FLSTVDEKISQLNQKHK----LLSQYKQGMMQKLFS 215 >gi|239629951|ref|ZP_04672982.1| type I restriction modification system [Lactobacillus paracasei subsp. paracasei 8700:2] gi|239527563|gb|EEQ66564.1| type I restriction modification system [Lactobacillus paracasei subsp. paracasei 8700:2] Length = 400 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 58/406 (14%), Positives = 135/406 (33%), Gaps = 33/406 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T + + I +D K+ K + S + Sbjct: 12 WEKRKLGEVVERVTRKNRDLVSTRPLTISAQDGLVDQRKFFSK--TVASKNISNYFLLKA 69 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLL-SIDVTQRI 137 G Y K G+ ST ++V +PK + + L + + Sbjct: 70 GDFAYNKSYSVGYPWGAVKRLDKYPSGVLSTLYIVFKPKKINSQFLVTYFEGTTWYVSVS 129 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + EGA + + +E I +D LI I+ ++ Sbjct: 130 KVASEGARNHGLLNISASDFFDQQLFFPTKKTEQESIGLTIKVLDDLIAATQDKIDAFEQ 189 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK--LIESN 255 K+A + ++ + G + W + +++ KNT+ E+ Sbjct: 190 IKKAFLQHLFDQSW------------RFGEYSELWTSHLLGEITSKVTEKNTENLYHETF 237 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERG 314 S YG + Q+ + + Y +V + V+ I +R ++ G Sbjct: 238 TNSAKYGIVEQQSFFDKLISNEANLTNYYVVRENDFVYNPRISNLAPVGPVRRNKLNRTG 297 Query: 315 IITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLV 369 +++ Y K + +L + + + Y G R ++K + +++PV + Sbjct: 298 VMSPLYYVFKATNAAYPMFLEYFFKGESWYRFMYLNGDTGARSDRFAIKDKVFEQMPVKL 357 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P EQ I ++ ++ ++ + ++ + ++ + S + + Sbjct: 358 PEESEQKKIGALL----QNLETVMNQTQERLQKIRTIKDSLLKSLF 399 >gi|188528305|ref|YP_001910992.1| type I R-M system specificity subunit [Helicobacter pylori Shi470] gi|188144545|gb|ACD48962.1| type I R-M system specificity subunit [Helicobacter pylori Shi470] Length = 375 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 54/403 (13%), Positives = 121/403 (30%), Gaps = 42/403 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W+ V + + G Y + + Y S+ I Sbjct: 10 LPLNWQRVRLGDIFFITAGGDLSK---PHYSNTKQSDFNYPIYSNAIEKKGLCGYSSFFI 66 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I G A D+ + + LVLQPK + + + +++ Sbjct: 67 IKNKSITITARGTI-GVAFFRDYPYVPIGRLLVLQPKISNIDCRFY---AEYINSKVKFN 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 E T+ + +P+PPL EQ I + +D + I + K+ Sbjct: 123 TEQTTIPQLTIPKVALCEIPLPPLNEQNAIANIL----SALDRYLCALDALILKKESVKK 178 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 AL ++++ + +G + + ++ K + + L Sbjct: 179 ALSFELLSQKKRLKGFNQAWQRVRLGDIAEIKRGVRITKNELDVFGKYPV-VSGGVGFLG 237 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y N + E I + + + Sbjct: 238 YTNNFNRYE------------------------NTITIAQYGTAGYVNFQKNKFWANDVC 273 Query: 321 MAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + P+ + +L + ++ + + S+ + + +L+PP+ EQ I Sbjct: 274 FCIYPNKDIIKNIFLYYFLKVNQNYLYEISNRNATPYSISKDKILDFEILLPPLNEQIAI 333 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 N+++ I L K Q + + + ++ +I + Sbjct: 334 ANILSDLDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 372 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 54/173 (31%), Gaps = 17/173 (9%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT-S 318 Y N Q + + I+ + ++ A + + Sbjct: 35 HYSNTKQSDFNYPIYSNAIEKKGLCGYSSFFIIKNKSITITARGTIGVAFFRDYPYVPIG 94 Query: 319 AYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + ++P + + + S KV + L V + +PP+ EQ Sbjct: 95 RLLVLQPKISNIDCRFYAEYINS----KVKFNTEQTTIPQLTIPKVALCEIPLPPLNEQN 150 Query: 377 DITNVINVETARI---DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I N+++ + D L+ K E + + ++ + L+G +Q Sbjct: 151 AIANILSALDRYLCALDALILKKESV-------KKALSFELLSQKKRLKGFNQ 196 >gi|78046066|ref|YP_362241.1| type I site-specific deoxyribonuclease (specificity subunit) [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78034496|emb|CAJ22141.1| type I site-specific deoxyribonuclease (specificity subunit) [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 419 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 52/405 (12%), Positives = 124/405 (30%), Gaps = 27/405 (6%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W +VP++R +++T + ++ E Y KD + + + Sbjct: 23 SWPIVPLERIAARISTKNCNGQVTRVLTNSAEFGVLDQRDYFDKDIAT-AGKVDGYYVVS 81 Query: 83 KGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 KG +Y + + G+ S + V + + + + S + Sbjct: 82 KGDYVYNPRTSAIAPVGPISRNNLGEGVMSPLYTVFCFSEEKTDFYEHYFKSPGWHSYLR 141 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G P + ++ T + +I + R +E LK Sbjct: 142 SAASTGARHDRMSITAGAFMRMPVPSPSREEQQKIADCLTSL-EEVIAAQGRKVEALKVH 200 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ L+ + +++ P+ W +P ++ + + Sbjct: 201 KRGLMQQLFPLEGEALPRLRFP---EFRDAPE-WAERPLCQVIEVASGQVDPTEAPYCDF 256 Query: 259 LSYGNIIQKLETRNMG-----LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 G + ET ++ + + D ++++ I +K ++ Sbjct: 257 PHVGGENIESETGSLVGLKSAREDGVTSGKYLFDEKDVLYSKIRPILNKVAVPDF----N 312 Query: 314 GIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVP 370 GI ++ ++P D +L +L+RS + G + E + +P Sbjct: 313 GICSADIYPIRPSSSDITRQFLVYLLRSASFVEYATKHSERGKIPKINREALAAYGARLP 372 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + + D + + +LK + + Sbjct: 373 QQVEQQRIADCLFSV----DTAITAESAQLTVLKTHKQGLMQQLF 413 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 45/208 (21%), Positives = 85/208 (40%), Gaps = 18/208 (8%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGK 62 +P+++D+ P+ W P+ + ++ +G+ + D ++G E++ES TG Sbjct: 220 RFPEFRDA--------PE-WAERPLCQVIEVASGQVDPTEAPYCDFPHVGGENIESETGS 270 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + TS +F + +LY K+ P L K + DF+GICS ++P Sbjct: 271 LVGLKSAREDGVTSGKYLFDEKDVLYSKIRPILNKVAVPDFNGICSADIYPIRPSSSDIT 330 Query: 123 LLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L S + E + + + + +P EQ I + + + Sbjct: 331 RQFLVYLLRSASFVEYATKHSERGKIPKINREALAAYGARLPQQVEQQRIADCLFS---- 386 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVT 208 +DT IT + +LK KQ L+ + Sbjct: 387 VDTAITAESAQLTVLKTHKQGLMQQLFP 414 >gi|256825201|ref|YP_003149161.1| hypothetical protein Ksed_13680 [Kytococcus sedentarius DSM 20547] gi|256688594|gb|ACV06396.1| hypothetical protein Ksed_13680 [Kytococcus sedentarius DSM 20547] Length = 354 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 59/352 (16%), Positives = 126/352 (35%), Gaps = 31/352 (8%) Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---------TQRIE 138 + K+ +A DG+ + + V++P+ + +L+ Sbjct: 2 FNKMSIRDGAMGLAREDGLVTYHYEVMRPRPAVEARYVVYLMKSSWFGGELIKRERGIGA 61 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G + ++ + I IP + Q I + + ET +ID++I + ++ L+E+ Sbjct: 62 GGAKGVRTTEVPFRVLRTIDCYIPTVEGQRAIADFLDRETAQIDSMIEAQNVLMQELRER 121 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 ++A +S + + + VP + + S S Sbjct: 122 QRAAISNTIDSDAS------------LQRVPLRRLITGISQGWSPQCEDTPVDDPSTQWS 169 Query: 259 LSYGNIIQKLETRNMGLK--PESYETYQIV--DPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + R K P E + G+++ + + S Sbjct: 170 VLKVGCVNGGVFRPEQNKMLPGDLEPRPELGLRAGDLLMSRGNTREWVGSAAVVDRDYPT 229 Query: 315 IITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVL 368 ++ S + + S Y+A + + G Q + D++ + Sbjct: 230 LMLSDLLYRVAVDRSLVSSEYVALALSTRKARDEIEIAAKGASHSMQKVSQGDIRSTTIP 289 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + ++ Q D+ N + T R D ++ ++ I LL+ERR + I AAVTG+ID Sbjct: 290 LRSLQAQADVVNEASAITVRADAMISAAQEVIDLLRERREALITAAVTGRID 341 >gi|227539165|ref|ZP_03969214.1| type I restriction-modification system, subunit S [Sphingobacterium spiritivorum ATCC 33300] gi|227240847|gb|EEI90862.1| type I restriction-modification system, subunit S [Sphingobacterium spiritivorum ATCC 33300] Length = 409 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 50/430 (11%), Positives = 132/430 (30%), Gaps = 51/430 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +WK + + G + + I + + + G G + + D Sbjct: 2 SNWKTYKLGDLIDVKHGFAFKGEFFSDEPTEDILLTPGNFKIGGG-FKTDKFKYYKGDYP 60 Query: 77 TVSIFAKGQILYGKL-----GPYLRKAIIADFDGICST------QFLVLQPKDVLPELLQ 125 + +G IL G L + + + + D+ P+ L Sbjct: 61 KSYVLKEGDILITMTDLSKAGDTLGYSAKIPKHNEVNYLHNQRLGLVQFKSDDIDPDFLY 120 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + I G+T+ H I + +P ++ ++I +D I Sbjct: 121 WVLRTQPYQYYIVGSATGSTVKHTSPTRICSYEFQVPKDKKKQ---KEIAQILSSLDDKI 177 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + L+ QA+ + ++P+ W K + + Sbjct: 178 ELLQQMNQTLENIAQAIFKEWCCVEED--------------IIPEGWSWKKLIDIANVSS 223 Query: 246 RKN-----------TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 K + LS G I + E E + + G+I+ Sbjct: 224 SKRIFREEYKIGGIPFYRGKEVTQLSNGEAISTELFISEERYNEIKEKFGVPQIGDILIT 283 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGL 353 + + + ++ ++ ++ +++ + + ++ Sbjct: 284 SVGTIGSVWLVDNDSPFYFKDGNVTWVKDYKTVVNGEFVYEWLQTKEAKEQIKSVTIGST 343 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +Q+L ++ L +L+P + + + + +++ I L + R + + Sbjct: 344 QQALTISALRELKILIPDTET----VSKVCNQLGKLNAKRINNLNQIQTLTQTRDTLLPK 399 Query: 414 AVTGQIDLRG 423 ++GQ++++ Sbjct: 400 LMSGQLEIKN 409 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 66/197 (33%), Gaps = 13/197 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVES-GTGKYLPKDGNSRQSDT 75 IP+ W + +++ + + I + ++V G+ + + + Sbjct: 206 IPEGWSWKKLIDIANVSSSKRIFREEYKIGGIPFYRGKEVTQLSNGEAISTELFISEERY 265 Query: 76 STVS----IFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLP-ELLQGW 127 + + + G IL +G ++ + F V K V+ E + W Sbjct: 266 NEIKEKFGVPQIGDILITSVGTIGSVWLVDNDSPFYFKDGNVTWVKDYKTVVNGEFVYEW 325 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L + + ++I+++ G+T + + + IP + ++ + + + Sbjct: 326 LQTKEAKEQIKSVTIGSTQQALTISALRELKILIPDTETVSKVCNQLGKLNAKRINNLNQ 385 Query: 188 RIRFIELLKEKKQALVS 204 + L+S Sbjct: 386 IQTLTQTRDTLLPKLMS 402 >gi|254507634|ref|ZP_05119767.1| restriction modification system DNA specificity domain protein [Vibrio parahaemolyticus 16] gi|219549521|gb|EED26513.1| restriction modification system DNA specificity domain protein [Vibrio parahaemolyticus 16] Length = 594 Score = 102 bits (255), Expect = 9e-20, Method: Composition-based stats. Identities = 59/473 (12%), Positives = 135/473 (28%), Gaps = 94/473 (19%) Query: 21 IPKHWKVVPIKRFT-KLNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W+ + + ++ G T + + + + + D+++ + + + Sbjct: 106 VPKGWEWTRLGNLSSDIHYGYTASAKPNSEGVRLLRITDIQNDKVNWGTVPACDITEEKA 165 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSID 132 + IL + G + K+ + + S V + + V + +L S Sbjct: 166 KSYLLENDDILIARTGGTIGKSYLVENIDLQAVFASYLIRVKRVQAVYAPFTKVFLGSQL 225 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------------------- 173 +++ G + + + + +PP +Q I K Sbjct: 226 YWKQLIENSAGTGQPNVNATALKQLLFIVPPFNQQKRIVAKVDELMALCDQLEQQTEASI 285 Query: 174 ----------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 ++ RI E + + KQ ++ V L Sbjct: 286 EAHQVLVTTLLDTLTNSADADELMQNWERISEHFDTLFTTEESIDQLKQTILQLAVMGKL 345 Query: 212 NPDVKMKD-------------------------------SGIEWVGLVPDHWEVKPFFAL 240 + S E +P WE + Sbjct: 346 VSQDPNDEPASELLKRIAEEKAQLVKEKKIKKQKALPPISEDEKPFELPSGWEWCRVDDV 405 Query: 241 V---TELNRKNTKLIESNILSL--SYGNIIQKLETRNMGLKPESY----ETYQIVDPGEI 291 V K++ +ES+ + + GN + R+ G + + Y E I + ++ Sbjct: 406 VALKHGYAFKSSYFLESSGPYVLTTPGNFYETGGFRDRGDRTKYYDGPLEVEFIFEANDL 465 Query: 292 VFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFY 347 + + + + + P+ Y++W S L Sbjct: 466 IIPLTEQAPGLLGSAAFIPEDGRTYLHNQRLAKLTPYHDAVRKDYISWYFNSPYLRSELA 525 Query: 348 AMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +G + V+ +PP EQ +I I+ + L ++ +S Sbjct: 526 RTCTGTTVRHSSPTKVQVTLFALPPTNEQKNIVERIDSLLSICQQLKARLNES 578 Score = 80.2 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 61/189 (32%), Gaps = 7/189 (3%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS------YGNIIQKLETRNM 273 + E VP WE L ++++ T + N + N T Sbjct: 98 TEQEAPFNVPKGWEWTRLGNLSSDIHYGYTASAKPNSEGVRLLRITDIQNDKVNWGTVPA 157 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 E +++ +I+ K L ++ + + + + + Sbjct: 158 CDITEEKAKSYLLENDDILIARTGGTIGKSYLVENIDLQAVFASYLIRVKRVQAVYAPFT 217 Query: 334 AWLMRS-YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S ++ + ++ +K+L +VPP +Q I ++ A D L Sbjct: 218 KVFLGSQLYWKQLIENSAGTGQPNVNATALKQLLFIVPPFNQQKRIVAKVDELMALCDQL 277 Query: 393 VEKIEQSIV 401 ++ E SI Sbjct: 278 EQQTEASIE 286 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 57/204 (27%), Gaps = 17/204 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W+ + L G +S + V + G + G + D + Sbjct: 392 ELPSGWEWCRVDDVVALKHGYAFKSS-YFLESSGPYVLTTPGNFYETGGFRDRGDRTKYY 450 Query: 80 --------IFAKGQILYGKL----GPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQ 125 IF ++ G A I + + + + L P Sbjct: 451 DGPLEVEFIFEANDLIIPLTEQAPGLLGSAAFIPEDGRTYLHNQRLAKLTPYHDAVRKDY 510 Query: 126 --GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + S + + C G T+ H+ + +PP EQ I E+I + Sbjct: 511 ISWYFNSPYLRSELARTCTGTTVRHSSPTKVQVTLFALPPTNEQKNIVERIDSLLSICQQ 570 Query: 184 LITERIRFIELLKEKKQALVSYIV 207 L A+V V Sbjct: 571 LKARLNESQATQLHLTDAIVEQAV 594 >gi|300933509|ref|ZP_07148765.1| restriction modification system DNA specificity subunit [Corynebacterium resistens DSM 45100] Length = 400 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 51/410 (12%), Positives = 112/410 (27%), Gaps = 43/410 (10%) Query: 23 KHWKVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ V ++ + +G T + + I ++ +++ + + + Sbjct: 2 SDWREVAVEALCSRVTSGGTPSRKRADYYTDEGIPWVKSQELIGARIATTEEHISEAGLE 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + +L G + + + + + + + Sbjct: 62 RSSAKLLPPDTVLLAMYGANVGQLGWLGVEATVNQAICAMVTDPKEADARFLYYALAGAR 121 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +R+ GA + + I + +P LA Q I + + I+ Sbjct: 122 ERLVGNAHGAAQQNLSQQLIKPFKLAVPALATQQRIGAILRSIDELIENNRRRIEVLE-- 179 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + +A+ K P + +G +P+ W + K K Sbjct: 180 --KMARAIYREWFVKFRYPGHEDVPLVDSALGPIPEGWRAATIGDALELKYGKALKASAR 237 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 ++ + + + P +V R ++ + ++ Sbjct: 238 RGGGVAVVSSAGVVGWHDESFVD---------GPAIVVGRKGNVGSVHWVDGPCWPIDTA 288 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 T L S L + + L E P L+P Sbjct: 289 YFVQ------------TDLPLRFVSEQLRRTAFTNSHAAVPGLSREAAYAQPFLLPD--- 333 Query: 375 QFDITNVINVETARIDVLVEKIE---QSIVLLKERRSSFIAAAVTGQIDL 421 V++ A +D L L E R + VTGQID+ Sbjct: 334 ----VQVLDSFQALVDPLGSHATGLMSQNEKLAEVRDLLLPKLVTGQIDV 379 >gi|332661882|ref|YP_004451352.1| restriction modification system DNA specificity domain-containing protein [Haliscomenobacter hydrossis DSM 1100] gi|332337379|gb|AEE54479.1| restriction modification system DNA specificity domain protein [Haliscomenobacter hydrossis DSM 1100] Length = 404 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 49/411 (11%), Positives = 113/411 (27%), Gaps = 21/411 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +WK I ++ G + G I +I + D S + Sbjct: 3 NWKTYKISDLCEVGRGSSPRPIIDQRFFEGGSIPWIKIADATSSGKYIYYTKEYVNEFGA 62 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ-FLVLQPKDVLPELLQGWLLSIDVT 134 S KG ++ G L + G + K L + I + Sbjct: 63 SFSRYLDKGSLIIAASGVSLGQIKFLGVRGCIHDGWLYISDYKKDLISKDFLYYFLIYYS 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 GA + + + + + N + IP L+ Q I + I+ E Sbjct: 123 AGFHNFSSGAAIQNINTEILRNTLISIPHLSMQNSIASILSNYDDLIEVNNQRIKLLEET 182 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 +E + + G +K +WV FA + +T E Sbjct: 183 ARELYKEWFVRMRFPGWKETKFVKGVPEDWVYDT------CYSFADIKGGGTPSTTNPEY 236 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV--ME 312 +++ + + + + + +F R Sbjct: 237 WEGDINFFTPTDHSNSFFIFETEKKITEKGLRNSSTKMFTKYSTFITARGTVGNICLAGT 296 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + + H + + +L + + ++ K L+P Sbjct: 297 DMAMNQSCFGIVSHNENDCFFTFLFTDEMIKYLKLVANGATFDAITLNTFKNYKALIPNT 356 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + + + +I+ L+ Q L++ R + ++ ++ ++ Sbjct: 357 ELRQLFFERTSPFFYQIENLL----QQNTQLRQIRDRLLPRLISDKLTIKE 403 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 61/192 (31%), Gaps = 9/192 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKY-LPKDGNSRQS 73 +P+ W F + G T + DI + D + + K + Sbjct: 208 VPEDWVYDTCYSFADIKGGGTPSTTNPEYWEGDINFFTPTDHSNSFFIFETEKKITEKGL 267 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ +F K G + + + F ++ + +L + ++ Sbjct: 268 RNSSTKMFTKYSTFITARGTVGNICLAGTDMAMNQSCFGIV--SHNENDCFFTFLFTDEM 325 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + GAT N IP + L E+ +I+ L+ + + + Sbjct: 326 IKYLKLVANGATFDAITLNTFKNYKALIPNTELRQLFFERTSPFFYQIENLLQQNTQLRQ 385 Query: 194 LLKEKKQALVSY 205 + L+S Sbjct: 386 IRDRLLPRLISD 397 >gi|329736380|gb|EGG72649.1| type I restriction modification DNA specificity domain protein [Staphylococcus epidermidis VCU045] Length = 418 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 60/418 (14%), Positives = 144/418 (34%), Gaps = 34/418 (8%) Query: 29 PIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +K T G S Y+ + D++ G S + Sbjct: 7 KLKDLTVNGKGEYGIGAPAVKYSPNLYKYLRITDID-DNGFINTNQMKSINDKNEEKYLL 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQ---PKDVLPELLQGWLLSIDVTQR 136 I++ + G K+ + D + FL+ P + P+ L+ + L+ + Sbjct: 66 KANDIVFARTGNSTGKSYFYNSDDGPLVYAGFLIKFSLDPTKLNPKYLRYYTLTNEYKGW 125 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I G+T + + K G++ + +PP Q + + + + ++ + + + L+ Sbjct: 126 INQFSIGSTRKNINAKIFGDMVISLPPRYYQDFVVDILDS----LERKVKINKQMVANLE 181 Query: 197 EKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKNT 249 E Q L + PD K SG E +G +P W+V + + ++ Sbjct: 182 ELSQTLFKHWFVDFEFPDEDGNPYKSSGGEMIDSELGEIPSDWKVGVLSDMTEIIMGQSP 241 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 K N + + + +N +KP Y + + + Sbjct: 242 KSDTYNNNKVGLPLLNGASDFKNRNIKPTKYTSAPKKIGHNL---DYVFGVRATIGLVTE 298 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVL 368 + I K + + ++ ++ F +GSG ++ +D+K+ ++ Sbjct: 299 LDGEYAIGRGAGLSKNNEENREFIYEILNQAFT--YFERIGSGSVYINISSKDLKQYKLI 356 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +P + + + I + ++ I L R + + ++G++++ + + Sbjct: 357 IPS----KQVLMKYHYQLEPIFSELHNRKEQITSLTNLRDTLLPKLMSGELEIPDDIE 410 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 59/199 (29%), Gaps = 7/199 (3%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG + +G IP WKV + T++ G++ +S + +G + Sbjct: 205 YKSSGGEMIDSELGEIPSDWKVGVLSDMTEIIMGQSPKSDTYNNNKVGLPLLNGASDFKN 264 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 ++ + ++ I ++G + I K+ Sbjct: 265 RNIKPTKYTSAPKKIGHNLDYVFGVRATIGLVTELDGEYAIGRGA---GLSKNNEENREF 321 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + E I G+ + K + + IP + ++ + Sbjct: 322 IYEILNQAFTYFERIGSGSVYINISSKDLKQYKLIIPSKQVLMKYHYQLEPIFSELHNRK 381 Query: 186 TERIRFIELLKEKKQALVS 204 + L L+S Sbjct: 382 EQITSLTNLRDTLLPKLMS 400 >gi|120436928|ref|YP_862614.1| type I restriction-modification system DNA specificity subunit [Gramella forsetii KT0803] gi|117579078|emb|CAL67547.1| type I restriction-modification system DNA specificity subunit [Gramella forsetii KT0803] Length = 428 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 58/422 (13%), Positives = 136/422 (32%), Gaps = 33/422 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + WK V + + G + + + G G Y + Sbjct: 3 EGWKFVKLGDIIHIKHGYGFKGEFFVDEPTKNFLLTPGNFAIGGG-YKSDKIKYYDGPIN 61 Query: 77 TVSIFAKGQILYGKL---------GPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQ 125 I +G ++ G + ++ + + + + L+ +V + + Sbjct: 62 EDFILKEGDVIVTMTDLSKQADTLGYSAKIPKDSENTYLHNQRIGLISLKTDEVDLDFIY 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTL 184 L + + I + GAT+ H K I + + +P L Q I + I+ Sbjct: 122 WLLRTDYYQRYIASSSSGATVKHTSPKKIYSAKLLVPESLFVQQKIASILSGYDDLIENN 181 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + E + + + G V K++G+ P+ W + L Sbjct: 182 LKRIKLLEEKAQLTYEEWFVRMKFPGHESVVINKETGL------PEGWRITKLNKLSGVN 235 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDK 302 ++ K E +I + + +IV G+I++ + Sbjct: 236 SKNIEKTYEGDIKYIDIKGVSPNSIDSLTEYSIVDAPGRAKRIVKHGDIIWSCVRPNRRS 295 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFED 361 ++ + I ++ + + P + ++YL + + + + G ++K + Sbjct: 296 HAVVW-KPESNWIASTGFCVISPKKLPTSYLYYFLTTNSFVGYLTNLAGGAAYPAVKADH 354 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 K ++VP +I + + + L+ +Q LKE R + + G I++ Sbjct: 355 FKTAEIVVPK----DEIVKAFDEKFEKSLELIWNFKQQNQFLKEARDILLPRLMAGMINV 410 Query: 422 RG 423 Sbjct: 411 ED 412 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 65/183 (35%), Gaps = 11/183 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGN 69 K++G +P+ W++ + + + +N+ ++ + DI YI ++ V + L + + Sbjct: 215 KETG------LPEGWRITKLNKLSGVNSKNIEKTYEGDIKYIDIKGVSPNSIDSLTEY-S 267 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + I G I++ + P R + + I ST F V+ PK + L Sbjct: 268 IVDAPGRAKRIVKHGDIIWSCVRPNRRSHAVVWKPESNWIASTGFCVISPKKLPTSYLYY 327 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + + + GA + +P EK I Sbjct: 328 FLTTNSFVGYLTNLAGGAAYPAVKADHFKTAEIVVPKDEIVKAFDEKFEKSLELIWNFKQ 387 Query: 187 ERI 189 + Sbjct: 388 QNQ 390 >gi|220906631|ref|YP_002481942.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 7425] gi|219863242|gb|ACL43581.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7425] Length = 572 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 82/470 (17%), Positives = 144/470 (30%), Gaps = 92/470 (19%) Query: 24 HWKVVPIKRFTKLN--TGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS--TV 78 W+ + + G+T ++ I I ++V G + PK+ S Q+ T Sbjct: 87 GWQWERLGNLARFIDYRGKTPLKTDSGIKLITAKNVRMGFLQDEPKEYISEQTYYEWMTR 146 Query: 79 SIFAKGQILYGKLGPYLRKA-IIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQR 136 +G IL+ P A ++ D + + + LQP L L L S + + Sbjct: 147 GFPRRGDILFTTEAPLGNVAQLLIDERIALAQRIIDLQPFADLYARYLLTALTSPLMQRL 206 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI--------------- 181 + G T + IPMPIPPLAEQ I EK + Sbjct: 207 LNEKATGMTAQGIKSVKLKLIPMPIPPLAEQKRIVEKCDRLLILCDEIEKRQQQRQESLL 266 Query: 182 ----------------------DTLITERIRFIELLKEK----KQALVSYIVTKGLNPDV 215 I + + E +QA++ V L Sbjct: 267 KMNEGAIFQLLTAQNPDDFYYHWQAICNNFDLLYSIPETIPKLRQAILQLAVQGKLVQQS 326 Query: 216 KMKDSGIEWVGL--------VPDHWEVKPFFALVTELNRK-------------------- 247 + S + VG P + K + K Sbjct: 327 FDEKSLKDLVGQIQEERFALNPSEKDQKRIREEFNGIIYKFQQGNIKTLEMPAICFCNFI 386 Query: 248 --------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE------TYQIVDPGEIVF 293 N L E +I L NI+ S IV PG+I+ Sbjct: 387 TKGTTPASNELLPEGDIPYLKVYNIVNNRIDFFYKPSYISNIVHTTKLKRSIVFPGDILM 446 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + K ++ +E I + + + + + +L + + S+ + G Sbjct: 447 NIVGPPLGKVAIVPDDFLEWNINQALAVFRPVNSVYNKFLYYALSSFATLENVLGETKGT 506 Query: 354 --RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + +L E + L V + + EQ I ++ + D L +K++ + Sbjct: 507 AGQDNLSLEQCRSLRVPLYDLAEQKRIVAKVDALLSLCDALEDKLKAARD 556 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 57/162 (35%), Gaps = 3/162 (1%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ G + + + + T G+I+F + L + + + Sbjct: 121 NVRMGFLQDEPKEYISEQTYYEWMTRGFPRRGDILFTTEAPLGNVAQLLIDERI--ALAQ 178 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF 376 + + YL + S + ++ +G Q +K +K +P+ +PP+ EQ Sbjct: 179 RIIDLQPFADLYARYLLTALTSPLMQRLLNEKATGMTAQGIKSVKLKLIPMPIPPLAEQK 238 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + D + ++ +Q L + I +T Q Sbjct: 239 RIVEKCDRLLILCDEIEKRQQQRQESLLKMNEGAIFQLLTAQ 280 >gi|148378679|ref|YP_001253220.1| type I restriction enzyme S subunit [Clostridium botulinum A str. ATCC 3502] gi|153932499|ref|YP_001383063.1| putative type I restriction-modification system, S subunit [Clostridium botulinum A str. ATCC 19397] gi|153934972|ref|YP_001386612.1| putative type I restriction-modification system, S subunit [Clostridium botulinum A str. Hall] gi|148288163|emb|CAL82231.1| type I restriction enzyme S subunit [Clostridium botulinum A str. ATCC 3502] gi|152928543|gb|ABS34043.1| putative type I restriction-modification system, S subunit [Clostridium botulinum A str. ATCC 19397] gi|152930886|gb|ABS36385.1| putative type I restriction-modification system, S subunit [Clostridium botulinum A str. Hall] Length = 386 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 132/404 (32%), Gaps = 40/404 (9%) Query: 29 PIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + + TG T KDI++I +D+ + + I Sbjct: 6 KLCELGDILTGNTPSKKNGEFYDTKDIMFIKPDDINNNITEIECSKEYISNKAEKKARII 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K +L +G K I + Q + + + + + QR+E+I Sbjct: 66 PKDSLLITCIGSI-GKIAINKEKSAFNQQINSIVHNEKIICSKYLAYVLMINKQRLESIS 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + I E Q I + ID + EL+K + Sbjct: 125 NAPVVPIINKTQFSEFEVYIHEEKEIQEKIVNVLDKARSLIDKRKAQIEVLDELVKSR-- 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ +K +G + K L +S I ++ Sbjct: 183 ---------FIDMFADLKGEKHLTLGE----------CTNFIDYRGKTPVLSDSGIRIIN 223 Query: 261 ----YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + ++ S+ PG+++F + + + + Sbjct: 224 AKSVGNGFFKYIDEYISEETFNSWMKRGFPVPGDVLFVTEGHTFGNICRIPSDLQKFAMG 283 Query: 317 TSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + +++ +LA M++ +G Q ++ +++K++ + +P I+ Sbjct: 284 QRIITIQGNKEILNNAFLAQYMQTISFQIDIDKYKTGSSAQGIRSKELKKILIPIPQIEL 343 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q T+ +N ++D L ++E+S+ L++ +S + A G+ Sbjct: 344 QNQFTDFVN----QVDKLKFEMEKSLKELEDNFNSLMQRAFKGE 383 >gi|19881243|gb|AAM00852.1|AF486551_3 HsdS [Campylobacter jejuni] Length = 380 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 59/403 (14%), Positives = 125/403 (31%), Gaps = 26/403 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +WK + ++ G++ +S + IG+ ++ G + K T I Sbjct: 2 NNWKKCKLGDIAEITMGQSPKSEFYNFDNIGMPFLQ-GNRTFGRKYPYFDTYCTEYKKIA 60 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG+IL+ P A+ D + K+ E L L ++ I Sbjct: 61 KKGEILFSVRAPV-GDINFANNDICIGRGLCSMNAKNGENEFLYYLLH--NLRSVIINNE 117 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + + I + +P L EQ I + ID I + L+E Q Sbjct: 118 SGSVFGSVNKNDLQTIEILLPLLEEQRQIATIL----SSIDDKIELLHEQNKTLEELAQT 173 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L E+ + D ++ FA ++ I ++S Sbjct: 174 LFLNWFK------------DREFNSTISDFISMQNGFAFKSKDFIDYGNNGVIKIKNISN 221 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G + + ++ G+I+F + K + + + + M Sbjct: 222 GIVDIVNTDKISQNTINEVNNKFNINSGDILFAMTGAEIGKMGIVPSTNKKLWLNQRVGM 281 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + S + ++++ D++ P + +E I + Sbjct: 282 VKERFLGARFLAYIHLTSEFGYDYVINSATGSAQENISATDIENCPFVKLTSEE---IVS 338 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + ++ + I L+ R I + G+I + Sbjct: 339 YSKQLNDFFEKIIFNL-GEIQALENMRDILIPKLLNGEIKITN 380 >gi|312136019|ref|YP_004003357.1| restriction modification system DNA specificity domain-containing protein [Caldicellulosiruptor owensensis OL] gi|311776070|gb|ADQ05557.1| restriction modification system DNA specificity domain protein [Caldicellulosiruptor owensensis OL] Length = 409 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 70/420 (16%), Positives = 140/420 (33%), Gaps = 35/420 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK V ++N R G++ +I ++ VE T K S F Sbjct: 3 SEWKEVIFSEVIEINPNRELSKGQEYPFIDMQAVEPYTRKVSNIKFRKYNGSGSK---FK 59 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDV 133 G L+ ++ P L G ST+FLV K+ + S ++ Sbjct: 60 NGDTLFARITPCLENGKTAYVKELKNGEKGFGSTEFLVFSGKEGVTDNLFVYYLSRSPEI 119 Query: 134 TQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G + D + + +PPL EQ I + A + I Sbjct: 120 REYAVKNMIGTSGRQRVDKSCFNELRIKLPPLPEQQKIASILSAFDDK----IELNNEMN 175 Query: 193 ELLKEKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTELN 245 + L+E QA+ + P+ K SG E +G +P W+V ++ + Sbjct: 176 KTLEEIAQAIFKHWFIDFEFPNENGEPYKSSGGEFVDSELGPIPKGWKVVKLREILDNIC 235 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV--DPGEIVFRFIDLQNDKR 303 E L +I+++ K ++ +I+ + + K Sbjct: 236 DSVKPGKEIEGLPYVPIDIVERKSIALKQFKSWEEAKSSLIKFKKDDILLGAMRVYFHKV 295 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFED 361 S+ + + R ++ D +Y L+ D K A G ++ Sbjct: 296 SIAPCEGVTRKTC---FVLRPKKRFDLSYTLLLIFQDDTIKFADAHSKGTTMPYAVWDNG 352 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + +P K + ++ ++I + + L + R + + ++G+I + Sbjct: 353 LAEMKIALPTEKIRQRFNELLYPIISKIRDCIFENL----TLSQLRDTLLPKLISGEIRV 408 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 82/203 (40%), Gaps = 10/203 (4%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLN--TGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG ++ +G IPK WKVV ++ + + + + + Y+ ++ VE + Sbjct: 203 YKSSGGEFVDSELGPIPKGWKVVKLREILDNICDSVKPGKEIEGLPYVPIDIVERKSIAL 262 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-E 122 K S + S++ F K IL G + Y K IA +G+ VL+PK Sbjct: 263 --KQFKSWEEAKSSLIKFKKDDILLGAMRVYFHKVSIAPCEGVTRKTCFVLRPKKRFDLS 320 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + D + +A +G TM + G+ + + +P + E + +I Sbjct: 321 YTLLLIFQDDTIKFADAHSKGTTMPYAVWDNGLAEMKIALPTEKIRQRFNELLYPIISKI 380 Query: 182 DTLITERIRFIELLKEKKQALVS 204 I E + +L L+S Sbjct: 381 RDCIFENLTLSQLRDTLLPKLIS 403 >gi|218247761|ref|YP_002373132.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 8801] gi|218168239|gb|ACK66976.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8801] Length = 386 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 54/405 (13%), Positives = 129/405 (31%), Gaps = 38/405 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K + + K G++ + + GKYL G+ + + Sbjct: 7 KFIKLGNLIKFKYGKSLPNRERDP----------DGKYLVF-GSGGKIGLHNSYLTESPV 55 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I+ G+ G + T + V Q L + L+ +++ + AT Sbjct: 56 IVVGRKGSIGSTFYSDNPCWCIDTTYYVDQFSSNLYSKYLYYFLNTL---KLDRLNRAAT 112 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT---LITERIRFIELLKEKKQAL 202 + + +PIP L + RI++ I +E ++ L Sbjct: 113 IPGLSRDDLYTFSIPIPYPNNPKLSLDIQQRIVARIESLFGEIKRNRLLLEQMRLDNDLL 172 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + + + + + ++ + P + N + + Sbjct: 173 LPNALDEVVERLDSKRQTLLDVIQEKPRNGWSPKC---------DNDPNGVPVLKLGAVL 223 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + P + ++ G+I+ + + + I + Sbjct: 224 RFQYNPDEIKRTSLPTDENAHYWLEAGDILISRSNTLDLVGHASIYSGIPYPCIYPDLIM 283 Query: 323 VK---PHGIDSTYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQF 376 P+ DS +L + ++S ++ SG + +K E V +P + ++EQ Sbjct: 284 RFRVNPNKADSKFLMYWLQSKEVRHYIQTNASGASPTMKKIKQETVCNIPFPIISLEEQS 343 Query: 377 DITNVIN---VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ E +I+ ++E+ EQ+ L+ + + A G+ Sbjct: 344 YFAYHLDAIQQEVNKINRIIEEDEQNFKYLE---QAILEKAFRGE 385 >gi|283778920|ref|YP_003369675.1| restriction modification system DNA specificity domain-containing protein [Pirellula staleyi DSM 6068] gi|283437373|gb|ADB15815.1| restriction modification system DNA specificity domain protein [Pirellula staleyi DSM 6068] Length = 421 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 64/435 (14%), Positives = 138/435 (31%), Gaps = 40/435 (9%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYL 64 P +K + IG IP W + + K+ + + + + +V+S Sbjct: 5 PGFKMTE---IGEIPAEWNAYHLSQLWKVTDCKHVTATFVPEGYPVASIREVQSKFVNLH 61 Query: 65 PKDGNSRQSD---TSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDV 119 + + G ++ + + A ++ +L+ Sbjct: 62 AANHTTPHFYRLLIEGGRDPQAGDLILSRNATVGQIAQVSHSHPKFAMGQDVCLLRKTSP 121 Query: 120 LPELLQGW--LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 S + Q+I I G+T + K I +P P AEQ I + Sbjct: 122 TNSTEFIQAVFQSRIIKQQISDILVGSTFKRINVKQIKAFIVPSPSAAEQRAIAGALSDV 181 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS--YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 I++L + + + Q L++ + K + I W P + + Sbjct: 182 DALIESLEQLIAKKRAIKQGAMQELLTGKRRLPGFSGKWEKKRLQQIAWYQEGPGVQKHQ 241 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 LN N E + + E + + D G+IV Sbjct: 242 FASVGTKLLNGSNISHGELF---------LDQTERYIEDQLANGTYRHFLCDAGDIVIAS 292 Query: 296 IDLQ----NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG 350 + N+K ++ A + + TS + + + ++ M Sbjct: 293 SGISPATLNEKMAIVQASHLPLCMNTSTIRFKANQDLATQAFLFVCLQGNSFRDQIAGMA 352 Query: 351 SG-LRQSLKFEDVKRLPVLVPPIKEQFDITNV---INVETARIDVLVEKIEQSIVLLKER 406 +G + + + ++ +L+P I EQ + +V + E ++D + L+ Sbjct: 353 TGSAQLNFGPSHLNKVELLLPTISEQVAVADVIGSLEHELRKLDD-------RLTKLRLL 405 Query: 407 RSSFIAAAVTGQIDL 421 + + + +TG+I L Sbjct: 406 KQAMMQQLLTGKIRL 420 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 60/211 (28%), Gaps = 13/211 (6%) Query: 224 WVGLVPDHWEVKPFF--ALVTELNRKNTKLIESNILSLSYGNIIQKL------ETRNMGL 275 +G +P W VT+ + S + K Sbjct: 11 EIGEIPAEWNAYHLSQLWKVTDCKHVTATFVPEGYPVASIREVQSKFVNLHAANHTTPHF 70 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E + G+++ + + + + ++ Sbjct: 71 YRLLIEGGRDPQAGDLILSRNATVGQIAQVSHSHPKFAMGQDVCLLRKTSPTNSTEFIQA 130 Query: 336 LMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + +S + + + + + + +K V P EQ I ++ D L+E Sbjct: 131 VFQSRIIKQQISDILVGSTFKRINVKQIKAFIVPSPSAAEQRAIAGALSDV----DALIE 186 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +EQ I + + + +TG+ L G S Sbjct: 187 SLEQLIAKKRAIKQGAMQELLTGKRRLPGFS 217 >gi|260771740|ref|ZP_05880659.1| type I restriction-modification system specificity subunit S [Vibrio metschnikovii CIP 69.14] gi|260613324|gb|EEX38524.1| type I restriction-modification system specificity subunit S [Vibrio metschnikovii CIP 69.14] Length = 405 Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats. Identities = 51/399 (12%), Positives = 110/399 (27%), Gaps = 23/399 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W P+ +L +G T + I +V++G + D + S Sbjct: 18 EWVEKPLNHEVELFSGLTYSPKDIRKQGVFVIRSSNVKNGQI--VQADNVYVNPEVVNCS 75 Query: 80 IFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG I+ G + A + + + PE + + T Sbjct: 76 NVQKGDIIVVVRNGSRALIGKHAQVNSLMDNTVIGAFMTGVRAGHPEFINALFDTDKFTA 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++E GAT++ + P EQ I I+ + + + Sbjct: 136 QVEKNL-GATINQITNGAFNGMVFMFPEGQEQTAIGNTFQKLDSLINQHQKKHDKLSNIK 194 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K + + P+++ K EWV +H L + L + + Sbjct: 195 KAMLEKMFPK--PGETTPEIRFKGFSGEWVEKPLNHEV-----ELFSGLTYSPKDIRKQG 247 Query: 256 ILSLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + N+ + + V G+I+ + + Sbjct: 248 VFVIRSSNVKNGQIVQADNVYVNPEVVNCSNVQKGDIIVVVRNGSRALIGKHAQVNSLMD 307 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 ++ L + + + + P +E Sbjct: 308 NTVIGAFMTGVRAGHPEFINALFDTDKFTAQVEKNLGATINQITNGAFNGMVFMFPEGQE 367 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I N ++D L+ + +Q I L + + ++ Sbjct: 368 QTAIGN----TFQKLDSLINQHQQQITKLNNIKQACLSK 402 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 53/202 (26%), Gaps = 10/202 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR- 271 P+++ K EWV +H L + L + + + + N+ + Sbjct: 8 PEIRFKGFSGEWVEKPLNHEV-----ELFSGLTYSPKDIRKQGVFVIRSSNVKNGQIVQA 62 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + V G+I+ + + Sbjct: 63 DNVYVNPEVVNCSNVQKGDIIVVVRNGSRALIGKHAQVNSLMDNTVIGAFMTGVRAGHPE 122 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++ L + + + + P +EQ I N ++D Sbjct: 123 FINALFDTDKFTAQVEKNLGATINQITNGAFNGMVFMFPEGQEQTAIGN----TFQKLDS 178 Query: 392 LVEKIEQSIVLLKERRSSFIAA 413 L+ + ++ L + + + Sbjct: 179 LINQHQKKHDKLSNIKKAMLEK 200 >gi|328675907|gb|AEB28582.1| Type I restriction-modification system, specificity subunit S [Francisella cf. novicida 3523] Length = 438 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 53/391 (13%), Positives = 117/391 (29%), Gaps = 29/391 (7%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 ++ G K + YI E+V +L D + + + + ++ Sbjct: 64 GTIKNIHYGDIHTKYKSMFYISDEEV-----PFLSNDIDITKIKDQSYCMVK--DLIIAD 116 Query: 91 LGPYLRKAII---------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + T + S + ++ Sbjct: 117 ASEDYKDIGKAIEIIDLEDQKLVAGLHTYIARDLNNLTYLGFSGYLMQSYKIRSQMMKYA 176 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G ++ + I + +P L EQ I + + I+ L + K Q Sbjct: 177 TGISVLGLSKTSLSKIKINLPTLPEQQKIADCLSTWDDSIENLKSLIENKKLYKKGMMQK 236 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L S + + + +G + + +E + + S Sbjct: 237 LFSQELRFKADDGSNYPAWVEKKLGEMGNITTGSTPSTKNSEYYGGDKLFVSP-----SD 291 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N + ++ N L ++ + V G + F I K S Q+ + + Sbjct: 292 INSSRYIKRTNTTLTELGFKKGRKVSKGSVCFVCIGSTIGKVS----QLTQDSLTNQQIN 347 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + +S + + Y+ K+ G + D RL L P ++EQ I N Sbjct: 348 CITANSNNSNEFTYSLLEYNADKIKLLAGEQAVPQINKSDFSRLKFLTPCLQEQTKIANF 407 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ +D +E + Q + L+ ++ + Sbjct: 408 LSA----LDDEIELLGQELEQLQLQKKGLMQ 434 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 81/218 (37%), Gaps = 18/218 (8%) Query: 213 PDVKMKDSGIEWV----GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P ++ K+ EW+ G V ++ F + K I + Y ++ Sbjct: 26 PKLRFKEFSEEWLEKEFGSVYSFFQTNSFSRSLLNYENGTIKNIHYGDIHTKYKSMFYIS 85 Query: 269 E------TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + ++ + ++Y +V I D ++ +++ + ++ ++ + Sbjct: 86 DEEVPFLSNDIDITKIKDQSYCMVKDLIIADASEDYKDIGKAIEIIDLEDQKLVAGLHTY 145 Query: 323 VKPHGIDSTYLA---WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 + + TYL +LM+SY + +G+ L + ++ + +P + EQ I Sbjct: 146 IARDLNNLTYLGFSGYLMQSYKIRSQMMKYATGISVLGLSKTSLSKIKINLPTLPEQQKI 205 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++ I+ L IE K + + + Sbjct: 206 ADCLSTWDDSIENLKSLIENK----KLYKKGMMQKLFS 239 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 55/185 (29%), Gaps = 8/185 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + + TG T + D +++ D+ S + + + Sbjct: 255 WVEKKLGEMGNITTGSTPSTKNSEYYGGDKLFVSPSDINS-SRYIKRTNTTLTELGFKKG 313 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +KG + + +G + K D + + Q + + L +I+ Sbjct: 314 RKVSKGSVCFVCIGSTIGKVSQLTQDSLTNQQINCITANS-NNSNEFTYSLLEYNADKIK 372 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + + + P L EQ I + A I+ L E + K Sbjct: 373 LLAGEQAVPQINKSDFSRLKFLTPCLQEQTKIANFLSALDDEIELLGQELEQLQLQKKGL 432 Query: 199 KQALV 203 Q + Sbjct: 433 MQGMF 437 >gi|168183360|ref|ZP_02618024.1| putative type I restriction-modification system, S subunit [Clostridium botulinum Bf] gi|237793996|ref|YP_002861548.1| putative type I restriction-modification system, S subunit [Clostridium botulinum Ba4 str. 657] gi|182673511|gb|EDT85472.1| putative type I restriction-modification system, S subunit [Clostridium botulinum Bf] gi|229261943|gb|ACQ52976.1| putative type I restriction-modification system, S subunit [Clostridium botulinum Ba4 str. 657] Length = 386 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 58/404 (14%), Positives = 134/404 (33%), Gaps = 40/404 (9%) Query: 29 PIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + ++ TG T KDI++I +D+ + + I Sbjct: 6 KLCELGEILTGNTPSKKNGEFYDTKDIMFIKPDDINNNITEIECSKEYISNKAEKKARII 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K +L +G K I + Q + + + + + QR+E+I Sbjct: 66 PKDSLLITCIGSI-GKIAINKEKSAFNQQINSIVHNEKIISSKYLAYVLMINKQRLESIS 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + I E Q I + ID + F EL+K + Sbjct: 125 NAPVVPIINKTQFSEFEVYIHEEKEIQEKIVNVLDKARSLIDKRKAQIEVFDELVKSR-- 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ +K +G + K L +S I ++ Sbjct: 183 ---------FIDMFANLKGEKHLTLGE----------CTNFIDYRGKTPVLSDSGIRIIN 223 Query: 261 ----YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + ++ S+ PG+++F + + + + Sbjct: 224 AKSVGNGFFKYIDEYISEETFNSWMKRGFPVPGDVLFVTEGHTFGNICRIPSDLQKFAMG 283 Query: 317 TSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + +++ +LA M++ +G Q ++ +++K++ + +P I+ Sbjct: 284 QRIITIQGNKEILNNAFLAQYMQTISFQIDIDKYKTGSSAQGIRSKELKKILIPIPQIEL 343 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q T+ +N ++D L ++E+S+ L++ +S + A G+ Sbjct: 344 QNQFTDFVN----QVDKLKFEMEKSLKELEDNFNSLMQRAFKGE 383 >gi|86149261|ref|ZP_01067492.1| HsdS [Campylobacter jejuni subsp. jejuni CF93-6] gi|88596768|ref|ZP_01100005.1| HsdS [Campylobacter jejuni subsp. jejuni 84-25] gi|121612440|ref|YP_001001191.1| hypothetical protein CJJ81176_1536 [Campylobacter jejuni subsp. jejuni 81-176] gi|167006083|ref|ZP_02271841.1| HsdS [Campylobacter jejuni subsp. jejuni 81-176] gi|218563144|ref|YP_002344923.1| putative type I restriction enzyme S protein [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|19881204|gb|AAM00820.1|AF486544_3 HsdS [Campylobacter jejuni] gi|19881210|gb|AAM00825.1|AF486545_3 HsdS [Campylobacter jejuni] gi|19881237|gb|AAM00847.1|AF486550_3 HsdS [Campylobacter jejuni] gi|19881285|gb|AAM00887.1|AF486558_3 HsdS [Campylobacter jejuni subsp. jejuni 81-176] gi|19881289|gb|AAM00890.1|AF486559_1 HsdS [Campylobacter jejuni] gi|19881291|gb|AAM00891.1|AF486560_1 HsdS [Campylobacter jejuni] gi|19881293|gb|AAM00892.1|AF486561_1 HsdS [Campylobacter jejuni] gi|19881295|gb|AAM00893.1|AF486562_1 HsdS [Campylobacter jejuni] gi|19881297|gb|AAM00894.1|AF486563_1 HsdS [Campylobacter jejuni] gi|19881301|gb|AAM00896.1|AF486565_1 HsdS [Campylobacter jejuni] gi|19881303|gb|AAM00897.1|AF486566_1 HsdS [Campylobacter jejuni] gi|19881306|gb|AAM00898.1|AF486568_1 HsdS [Campylobacter jejuni] gi|19881308|gb|AAM00899.1|AF486569_1 HsdS [Campylobacter jejuni] gi|85840043|gb|EAQ57301.1| HsdS [Campylobacter jejuni subsp. jejuni CF93-6] gi|87249550|gb|EAQ72509.1| HsdS [Campylobacter jejuni subsp. jejuni 81-176] gi|88191609|gb|EAQ95581.1| HsdS [Campylobacter jejuni subsp. jejuni 84-25] gi|112360850|emb|CAL35651.1| putative type I restriction enzyme S protein [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|284926750|gb|ADC29102.1| putative type I restriction enzyme S protein [Campylobacter jejuni subsp. jejuni IA3902] gi|315926726|gb|EFV06104.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni DFVF1099] gi|315929698|gb|EFV08873.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 305] Length = 380 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 59/403 (14%), Positives = 125/403 (31%), Gaps = 26/403 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +WK + ++ G++ +S + IG+ ++ G + K T I Sbjct: 2 NNWKKCKLGDIAEITMGQSPKSEFYNFDNIGMPFLQ-GNRTFGRKYPYFDTYCTEYKKIA 60 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG+IL+ P A+ D + K+ E L L ++ I Sbjct: 61 KKGEILFSVRAPV-GDINFANNDICIGRGLCSMNAKNGENEFLYYLLH--NLRSVIINNE 117 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + + I + +P L EQ I + ID I + L+E Q Sbjct: 118 SGSVFGSVNKNDLQTIEILLPLLEEQRQIATIL----SSIDDKIELLHEQNKTLEELAQT 173 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L E+ + D ++ FA ++ I ++S Sbjct: 174 LFLNWFK------------DREFNSTISDFISMQNGFAFKSKDFIDYGNNGVIKIKNISN 221 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G + + ++ G+I+F + K + + + + M Sbjct: 222 GIVDIVNTDKISQNTINEVNNKFNINSGDILFAMTGAEIGKMGIVPSTNKKLWLNQRVGM 281 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + S + ++++ D++ P + +E I + Sbjct: 282 VKERFLGARFLAYIHLTSEFGYDYVINSATGSAQENISATDIENCPFVKLTSEE---IVS 338 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + ++ + I L+ R I + G+I + Sbjct: 339 YSKQLNDFFEKIIFNL-GEIQTLENMRDILIPKLLNGEIKITN 380 >gi|293417766|ref|ZP_06660388.1| type I restriction modification system specificity protein [Escherichia coli B185] gi|291430484|gb|EFF03482.1| type I restriction modification system specificity protein [Escherichia coli B185] Length = 420 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 63/405 (15%), Positives = 137/405 (33%), Gaps = 49/405 (12%) Query: 26 KVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAK 83 + ++ T + + + E + YI L V+ T K + + + + + Sbjct: 22 EWKTLEDITLRTSNIKWREVIRSYRYIDLTSVDIATKKITETTEITKNNAPSRAQKLVDE 81 Query: 84 GQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIE 138 +++ P ++ + D + + ST + +L+ K LP+ + W+ + D + +E Sbjct: 82 NDVIFATTRPTQQRFCLIDSEYAGEVASTGYCILRAKQDQVLPKWILHWISTSDFKKHVE 141 Query: 139 AICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRF 191 G+ + +PIP LA Q I + + T L E Sbjct: 142 ENQSGSAYPAISDSKVKECLIPIPCPDNPEKSLAIQSEIVQILDKFTALTAELTAELNMR 201 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 + + L+S K+ +EW +G + + K K+ Sbjct: 202 KKQYNYYRDQLLS------------FKEGEVEWKALGEIGEVRMCKRIL--------KSQ 241 Query: 250 KLIESNILSLSYGNIIQKLETRNM-GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 E I G ++ ++ L E E Y GE++ Sbjct: 242 TSSEGEIPFYKIGTFGKEPDSYISRKLFNEFKEKYSYPKVGEVLISASGTIGRTVIF--- 298 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + +L Y + K + G G + L +++++L + Sbjct: 299 -DGRESYFQDSNIVWIENNEKIVLNKYLFYFYKIAKWGISEG-GTIKRLYNDNLRKLMIP 356 Query: 369 VP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 VP + EQ I +++ A + + E + + I L +++ Sbjct: 357 VPFPDSPERSLVEQQKIVKLLDKFDALTNSITEGLPREIELRQKQ 401 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 49/153 (32%), Gaps = 9/153 (5%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHG 327 ET + ++VD +++F + L ++ T + K Sbjct: 62 ETTEITKNNAPSRAQKLVDENDVIFATTRPTQQRFCLIDSEYAGEVASTGYCILRAKQDQ 121 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVP-------PIKEQFDIT 379 + ++ + + D K SG ++ VK + +P + Q +I Sbjct: 122 VLPKWILHWISTSDFKKHVEENQSGSAYPAISDSKVKECLIPIPCPDNPEKSLAIQSEIV 181 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +++ TA L ++ R ++ Sbjct: 182 QILDKFTALTAELTAELNMRKKQYNYYRDQLLS 214 >gi|120435037|ref|YP_860723.1| type I restriction-modification system DNA specificity subunit [Gramella forsetii KT0803] gi|117577187|emb|CAL65656.1| type I restriction-modification system DNA specificity subunit [Gramella forsetii KT0803] Length = 418 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 50/388 (12%), Positives = 122/388 (31%), Gaps = 26/388 (6%) Query: 40 RTSESGKDIIYIGLEDVESGTGKYLPKDGNS----RQSDTSTVSIFAKGQILYGKLGPYL 95 +T + + + + G ++ N + + G + L + Sbjct: 41 KTVSNKNHNSELPILAITQDQGAIPREEINYHVSVSKKSVEGYKVVEVGDFIIS-LRSFQ 99 Query: 96 RKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIEAICEGATM-SHADWK 152 ++ GICS +++ + K++L + + + + EG +K Sbjct: 100 GGIEYSNHLGICSPAYIILRRKKKNLLNLFYKQYFKTDVFISHLNKNLEGIRDGKMVSYK 159 Query: 153 GIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 I +P P EQ I + I + I I + + Q L N Sbjct: 160 QFSEIKIPQPQTQEQQKIADCIASLDELILGFIEKLEALKRHKRGLMQNLFPQDGLNVPN 219 Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN----IIQKL 268 + EW E + ++ KN + S++ + ++ Sbjct: 220 YRFAEFKNDKEW--------EKTTLGKITNVISNKNKDNKNLPVYSINNKDGFLPQSEQF 271 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + N + Y+I++ + + S + I + + + Sbjct: 272 DDMNSKRRGYDISLYKIIEKNTFAYNPARINVGSIG-YSGNLNNILISSLYVCFKTENIV 330 Query: 329 DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D +L + + K+ G+R L +++ ++ + +P ++EQ I + Sbjct: 331 DDKFLNQYLETPYFLKLVNRNTEGGIRSYLFYKNFSKITISLPSMQEQQKIATCLTSM-- 388 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAV 415 D L+ + I +++ + + Sbjct: 389 --DDLISAQQNKIAQIEQHKKGLLQGLF 414 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 62/187 (33%), Gaps = 7/187 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES---GTGKYLPKDGNSRQSDTSTVS 79 K W+ + + T + + + + K++ + + + + ++ + R D S Sbjct: 229 KEWEKTTLGKITNVISNKNKD-NKNLPVYSINNKDGFLPQSEQFDDMNSKRRGYDISLYK 287 Query: 80 IFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I K Y + + + I S V + L +L + + Sbjct: 288 IIEKNTFAYNPARINVGSIGYSGNLNNILISSLYVCFKTENIVDDKFLNQYLETPYFLKL 347 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + EG S+ +K I + +P + EQ I + + I + + + K Sbjct: 348 VNRNTEGGIRSYLFYKNFSKITISLPSMQEQQKIATCLTSMDDLISAQQNKIAQIEQHKK 407 Query: 197 EKKQALV 203 Q L Sbjct: 408 GLLQGLF 414 >gi|294101457|ref|YP_003553315.1| restriction modification system DNA specificity domain protein [Aminobacterium colombiense DSM 12261] gi|293616437|gb|ADE56591.1| restriction modification system DNA specificity domain protein [Aminobacterium colombiense DSM 12261] Length = 389 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 55/405 (13%), Positives = 126/405 (31%), Gaps = 23/405 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ V + + G++ +S G + T+ I Sbjct: 4 NEWREVKLGEIVDIEMGQSPKSEFYNTEGLGVPFLQGNKTFGMIYPKFDVFCTNVKKIAI 63 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + IL P IA ++ K+ L L + ++ Sbjct: 64 QNDILMSVRAPV-GDLNIAQEKICIGRGICAMRMKNRNNLYLFYLLKHN--VKNLKKTES 120 Query: 143 GATMSHADWKGIGNIP-MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + K I + M P L EQ I + + I R + L+E QA Sbjct: 121 GTVFGGVNKKDIMGLSVMWTPNLQEQKTIAATLSCLDDK----IELNNRINKTLEEMAQA 176 Query: 202 LV-SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + S+ V + + DS +G +P W V ++ + K L + Sbjct: 177 IFKSWFVDFEPFQNGEFIDS---ELGKIPKGWRVGTLDEIIELFDSKRIPLSSRKREKMQ 233 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + ++ ++ + + + + + A+ Sbjct: 234 KVYPYYGATSLMDYVDDYIFDGVYVLLGED----GTVIDGKGYPILQYVWGKFWVNNHAH 289 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + +G L L+++ ++ + ++ + ++K + V++P + I Sbjct: 290 VLKGKNGFSEESLYILLKNTNVKSIV---TGAVQLKINQSNLKSVKVIIPSVD---KIAE 343 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 N AR ++ + +L R + + ++G++ + E Sbjct: 344 F-NYLIARFFAEKRRLSEENQILISVRDALLPKLMSGEVRVPIEE 387 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 19/205 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 ++ DS +G IPK W+V + +L + K P G Sbjct: 192 EFIDSE---LGKIPKGWRVGTLDEIIELFDSKRIPLSSRK--------REKMQKVYPYYG 240 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPEL 123 + D IF +L G+ G + + VL+ K+ E Sbjct: 241 ATSLMDYVDDYIFDGVYVLLGEDGTVIDGKGYPILQYVWGKFWVNNHAHVLKGKNGFSEE 300 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 LL +++I GA + + ++ + IP + + I Sbjct: 301 SLYILLKN---TNVKSIVTGAVQLKINQSNLKSVKVIIPSVDKIAEFNYLIARFFAEKRR 357 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 L E I + L+S V Sbjct: 358 LSEENQILISVRDALLPKLMSGEVR 382 >gi|153824634|ref|ZP_01977301.1| type I restriction-modification system, S subunit, putative [Vibrio cholerae MZO-2] gi|149741852|gb|EDM55881.1| type I restriction-modification system, S subunit, putative [Vibrio cholerae MZO-2] Length = 585 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 60/473 (12%), Positives = 121/473 (25%), Gaps = 92/473 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W + + + + +G T + I + ++++ + Sbjct: 100 DLPNGWSWIRLNEYGEWGSGSTPKRSNSEYYDGGIPWFKSGELKADYISESEETITELAL 159 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 ++V G +L G + K I + P L L Sbjct: 160 SETSVRYNNVGDVLVAMYGATIGKTAILSVRATTNQAVCACTPFTGLSN-TYLLTLLKAY 218 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR------------- 180 R+ + G + + I + +P AEQ I K+ Sbjct: 219 KARLIGMGAGGAQPNISREKIITTVIALPSTAEQRRIVAKVDELMALCDQLEQQTEDSIE 278 Query: 181 ----------------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 I + + KQ ++ V L Sbjct: 279 AHQVLVTTLLDTLTNSADADELMQNWARISEHFDTLFTTEASIDQLKQTILQLAVMGKLV 338 Query: 213 PDVKMKDSGIEWV-------------------------------GLVPDHWEVKPFFALV 241 P + E + +P WE +V Sbjct: 339 PQDPTDEPASELLKRIAEEKAQLVKEKKIKKEKTLPPIAEDEKPFELPSGWEWCRLEDVV 398 Query: 242 TELN-----RKNTKLIESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIVF 293 + RK I LS N+ + N + P V+ G+++ Sbjct: 399 DIQSGITKGRKLAGRELKTIPYLSVANVQRGYLILNNVKEIDLPIDELEKYSVEDGDLLI 458 Query: 294 RFIDLQ-NDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVF--YA 348 R+ + + +P +L + K F + Sbjct: 459 TEGGDWDKVGRTAIWRSEVPYMAHQNHVFKARPFLKEQSEAWLEMYLNGPFARKYFAGSS 518 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + S+ ++ + VPP E+ +I++ + D+L+E I S+ Sbjct: 519 KQTTNLASINKTQLRSCLIAVPPRDEKKEISDRVQELIGMCDLLLEGIRASLQ 571 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 59/190 (31%), Gaps = 12/190 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 S E +P+ W R N++ + I G + + + Sbjct: 93 SDEEKPFDLPNGWSWIRLNEYGEWGSGSTPKRSNSEYYDGGIPWFKSGELKADYISESEE 152 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E S + + + G+++ K ++ S R A A P S Sbjct: 153 TITELALSETSVRYNNVGDVLVAMYGATIGKTAILSV----RATTNQAVCACTPFTGLSN 208 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + ++ G + ++ E + + +P EQ I ++ A D Sbjct: 209 TYLLTLLKAYKARLIGMGAGGAQPNISREKIITTVIALPSTAEQRRIVAKVDELMALCDQ 268 Query: 392 LVEKIEQSIV 401 L ++ E SI Sbjct: 269 LEQQTEDSIE 278 >gi|189345678|ref|YP_001942207.1| N-6 DNA methylase [Chlorobium limicola DSM 245] gi|189339825|gb|ACD89228.1| N-6 DNA methylase [Chlorobium limicola DSM 245] Length = 846 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 59/404 (14%), Positives = 118/404 (29%), Gaps = 60/404 (14%) Query: 25 WKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W +V + TG T S G + ++ D+ K + + S Sbjct: 456 WPIVSLDEICTFMTGGTPTSTIAEYYEGGTVPWLVSGDIHGFEIMACEKRITQKAVENSN 515 Query: 78 VSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 + K +L G A++ C+ + + P + + Sbjct: 516 AKVLPKDSVLIALNGQGKTRGTVALLRMTGATCNQSLVAITPAPPPRAISEFIFWALRSM 575 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I A+ S + + NI +P+PPL Q + +I Sbjct: 576 YSDIRALTGDTERSGLNIPILKNIQIPLPPLEVQKEVVAEI------------------- 616 Query: 194 LLKEKKQALVS--YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 E Q +++ V P + + W + P RK+ Sbjct: 617 ---EGYQNVINGARAVLDNYRPHIPIHP-----------DWPMVPLGEACVVNPRKSEVA 662 Query: 252 IESNILSLSYGNIIQ------KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 +S+ + E ++ E +Y G+++ + + Sbjct: 663 DHVGTTVVSFVPMSDVGEHEMFFELKDTKRLDEVTTSYTYFKDGDVLLAKVTPCFENGKA 722 Query: 306 RSAQVMERGI---ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFE 360 A+ + GI + Y+ + ++ + G+G Q + Sbjct: 723 GIARNLRNGIGFGSSEFYVLRPTGDLLPQWVFMFAATPSFRTWATPQMTGTGGLQRVPRS 782 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARI---DVLVEKIEQSIV 401 V+ + VPP+ Q I I E A + L+ + E+ I Sbjct: 783 VVENYQIPVPPLATQQAIVAEIEAEQALVAANRELIVRFEKKIQ 826 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 74/200 (37%), Gaps = 14/200 (7%) Query: 21 IPKH--WKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP H W +VP+ +N ++ + ++ + DV + KD Sbjct: 637 IPIHPDWPMVPLGEACVVNPRKSEVADHVGTTVVSFVPMSDVGEHEMFFELKDTKRLDEV 696 Query: 75 TSTVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDV-LPELLQGW 127 T++ + F G +L K+ P + + G S++F VL+P LP+ + + Sbjct: 697 TTSYTYFKDGDVLLAKVTPCFENGKAGIARNLRNGIGFGSSEFYVLRPTGDLLPQWVFMF 756 Query: 128 LLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + G + + N +P+PPLA Q I +I AE + Sbjct: 757 AATPSFRTWATPQMTGTGGLQRVPRSVVENYQIPVPPLATQQAIVAEIEAEQALVAANRE 816 Query: 187 ERIRFIELLKEKKQALVSYI 206 +RF + ++ + Sbjct: 817 LIVRFEKKIQSTLARIWGKA 836 >gi|295087102|emb|CBK68625.1| Restriction endonuclease S subunits [Bacteroides xylanisolvens XB1A] Length = 433 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 59/422 (13%), Positives = 128/422 (30%), Gaps = 30/422 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P+ WK + + ++G T S + I +I ++ S + + Sbjct: 13 PQGWKEITLAEVFNTSSGATPLSTEASYYENGTIPWINSGELASPYIYDTTNFISQAGFE 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ I+ +L G KA + + + + P + + Sbjct: 73 NSSTEIYPIDTVLVAMYGATAGKASLLKMEACTNQAICAILPNKDYSSTFLKYSIDTLY- 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G+ + + + + +PP + K+++ ID I + Sbjct: 132 DHLVGLSSGSARDNLSQAELKKLKLIMPPTKNEQ---NKLVSILASIDRKIELNQAINQN 188 Query: 195 LKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELN 245 L+ + L Y + P+ K SG E V +P WE K + N Sbjct: 189 LEAMAKQLYDYWFVQFDFPNEEGKPYKSSGGEMVWNEELKREIPALWETKEVADIANVYN 248 Query: 246 RKNTKLIESNILSLSYGNIIQKL------ETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 ++ I K + G + S Y + I + Sbjct: 249 GATPSTVDELNYGGDIVWITPKDLSDQKQKFIYQGERNISQVGYDSCSTHLLPSNTILMS 308 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + A + + P + + Y L ++ + + Sbjct: 309 SRAPIGLLAIAKNELCTNQGFKSFVPKYRNIAIYLYYYLQYHLRQIEQLGAGTTFKEVSR 368 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ED+ + PVL P I ++ + + +I++ L ++R + + GQ+ Sbjct: 369 EDIIKFPVLKPSDN----ILDLWEERVSAFNDKQLEIQKENENLTKQRDELLPLLMNGQV 424 Query: 420 DL 421 + Sbjct: 425 SV 426 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 57/164 (34%), Gaps = 17/164 (10%) Query: 10 YKDSGVQ--WIGA----IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVE 57 YK SG + W IP W+ + + G T + G DI++I +D+ Sbjct: 214 YKSSGGEMVWNEELKREIPALWETKEVADIANVYNGATPSTVDELNYGGDIVWITPKDLS 273 Query: 58 SGTGKYL---PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL 114 K++ ++ + D+ + + IL P IA + + F Sbjct: 274 DQKQKFIYQGERNISQVGYDSCSTHLLPSNTILMSSRAPI-GLLAIAKNELCTNQGFKSF 332 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 PK + + L + Q IE + G T + I P Sbjct: 333 VPKYRNIAIYLYYYLQYHLRQ-IEQLGAGTTFKEVSREDIIKFP 375 >gi|293400128|ref|ZP_06644274.1| restriction modification system DNA specificity domain protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291306528|gb|EFE47771.1| restriction modification system DNA specificity domain protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 358 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 59/359 (16%), Positives = 122/359 (33%), Gaps = 29/359 (8%) Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + ++ D S + KG ++Y + + ++ +DGI S + VL K + Sbjct: 19 EGKDNSSEDKSNYKVVRKGDMVYNSMRMWQGANGVSSYDGIVSPAYTVLTAKVSICNEYF 78 Query: 126 -GWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + +G T + + I I + +P +AEQ + ++ RI Sbjct: 79 AALFKNYKLINEFRKNSQGMTSDTWNLKYPQIETIKVYLPEVAEQEKVASMLVTLDKRIA 138 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 T + + + Q + + + ++ S I K + Sbjct: 139 AQATLVEQLKKYKRGVMQRIFRNMSMLSPSGFETVQLSAI-----------FKKISRRNS 187 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 KN + + + K + Y +++ G+ V+ Sbjct: 188 NEEIKNVITNSAEYGLIPQRDFFDKDIAVDGNT-----SNYYVIEHGDFVYNPRKSNTAP 242 Query: 303 RS-LRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--LK 358 + ERGII+ Y V I+ +YLAW +S + Y GS + + Sbjct: 243 YGPFNRYEREERGIISPLYTCLVLQADIEPSYLAWYFKSDAWYRYIYDNGSQGVRHDRVS 302 Query: 359 FED--VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 D ++ +PV++P + Q I +++ +R + LK R + + Sbjct: 303 MTDGLLRGIPVIIPSKEAQLKIAKLLDCLESRF----QTELSQYESLKSIRVALLQQLF 357 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 58/192 (30%), Gaps = 11/192 (5%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 P ++ V + K++ ++E K++I E + KD +TS + Sbjct: 167 PSGFETVQLSAIFKKISRRNSNEEIKNVITNSAEYGLIPQRDFFDKDIAV-DGNTSNYYV 225 Query: 81 FAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVL--PELLQGWLLSIDV 133 G +Y + GI S + L + + L + Sbjct: 226 IEHGDFVYNPRKSNTAPYGPFNRYEREERGIISPLYTCLVLQADIEPSYLAWYFKSDAWY 285 Query: 134 TQRIEAICEGATMSHADWKG--IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +G + IP+ IP Q+ I + + R T +++ Sbjct: 286 RYIYDNGSQGVRHDRVSMTDGLLRGIPVIIPSKEAQLKIAKLLDCLESRFQTELSQYESL 345 Query: 192 IELLKEKKQALV 203 + Q L Sbjct: 346 KSIRVALLQQLF 357 >gi|225023390|ref|ZP_03712582.1| hypothetical protein EIKCOROL_00248 [Eikenella corrodens ATCC 23834] gi|224943868|gb|EEG25077.1| hypothetical protein EIKCOROL_00248 [Eikenella corrodens ATCC 23834] Length = 421 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 43/400 (10%), Positives = 109/400 (27%), Gaps = 25/400 (6%) Query: 26 KVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSI 80 + P+ L G + + + I + + G K + + + Sbjct: 20 EWKPLGEVGLLVRGNGLQKKDFTESGVPAIHYGQIYTYYGNQTDKTLSFVSPELAEKLKK 79 Query: 81 FAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 KG ++ + + + + + +P + + + Sbjct: 80 VDKGDVVITNTSENIEDVGKALLYLGEEQAVTGGHATIFKPSKEIVGKFFVYFTQTEIFD 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +G + + I +PIP L Q I + + T TL + L Sbjct: 140 KAKRKFAKGTKVIDVSATDMAKIQIPIPSLETQQKIVKILDKFTELEATLEATLEAELVL 199 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K + Q ++ L+ D ++ + K + + + Sbjct: 200 RKRQYQYYRDFL----LDFDNQIGGWIADGYKGRLKDVVWKTLGEIAEYSKDRICSDKLN 255 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + N++Q E + + S +I+ I K G Sbjct: 256 EHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVNDILIGNIRPYLKKIWQADCTGGTNG 315 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 + + V ++ YL ++ G + + + +PP+ Sbjct: 316 DV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAKGAKMPRGSKAAIMQYKIPIPPLP 373 Query: 374 EQFDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 +Q I +++ + + + +E+ Sbjct: 374 KQEKIVAILDKFDTLTHSISEGLPHEIALRRKQYEYYREQ 413 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 61/176 (34%), Gaps = 11/176 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMG----LKPESYETYQIVDPGEIVFRFIDLQNDK 302 + ES + ++ YG I + + PE E + VD G++V + Sbjct: 37 QKKDFTESGVPAIHYGQIYTYYGNQTDKTLSFVSPELAEKLKKVDKGDVVITNTSENIED 96 Query: 303 RSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKF 359 + E +T + + I + + ++ K G + + Sbjct: 97 VGKALLYLGEEQAVTGGHATIFKPSKEIVGKFFVYFTQTEIFDKAKRKFAKGTKVIDVSA 156 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 D+ ++ + +P ++ Q I +++ T L +E +VL K R + Sbjct: 157 TDMAKIQIPIPSLETQQKIVKILDKFTELEATLEATLEAELVLRKRQYQYYRDFLL 212 >gi|18765818|gb|AAL78772.1|AF326621_1 HP790-like protein [Helicobacter pylori] Length = 449 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 49/427 (11%), Positives = 119/427 (27%), Gaps = 39/427 (9%) Query: 22 PKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD-TS 76 PK + + + G ++ K + + + + K + Sbjct: 13 PKGVEFRKLGDIGEFTKGNGLLKSDLQDKGRPVVHYGQIHTQYNLSIDKTISYVNEALFH 72 Query: 77 TVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + IL + + + + + + P+ + + + Sbjct: 73 KLKKAKPNDILIVTTSENVKDVGKSIAWLGNEEVAFSGEMYSYSTNENPKFIIYYFQTYF 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + E G + + I +PIPPL Q I + + A T L TE + Sbjct: 133 FQKEKEKKITGTKVMRIHENDLKQITIPIPPLEIQQEIVKILDAFTELNTELNTELNTEL 192 Query: 193 ELLKEKKQALVSYIVTK------------GLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 K++ Q + ++ L K L P+ E + + Sbjct: 193 NARKKQYQYYQNMLLDFNDINQSHKDAKEKLAQKPYPKRLKALLQTLAPNGVEFRKLGEV 252 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPESYETYQIVDPGEI 291 N + + R G + P++ + ++ I Sbjct: 253 CEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKALKGKKLFPKNSI 312 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + L + + +++ K + + + + L + + Sbjct: 313 IISTTATIGEHALLIVDSLANQQFT---FLSKKANCGIALDMKFFFYQCFLLGEWCKKNT 369 Query: 352 --GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE---- 405 S+ K+ +PP++ Q +I +++ + L+ I I K+ Sbjct: 370 NVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEY 429 Query: 406 RRSSFIA 412 R + Sbjct: 430 YREKLLT 436 >gi|94991199|ref|YP_599299.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10270] gi|94544707|gb|ABF34755.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10270] Length = 380 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 62/393 (15%), Positives = 122/393 (31%), Gaps = 31/393 (7%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + ++ TG++S I TG+ G++ S + Sbjct: 17 EWEEKELGELASEIGTGKSSTLSDAI-----------TGEKYSILGSTSIIGYSKTYDYC 65 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 IL ++G S + +L I+ + Sbjct: 66 GDFILTARVGANAGNLYKYSGKVKISDN------TVFIKSDYINFLYHFLHRFDIKKLSF 119 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + NI + P L EQ I E +D LI + + + LKE+KQ Sbjct: 120 GTGQPLIKSSELRNILISTPSLPEQEAIGE----LFQTVDQLIQLQRQKLATLKEQKQTF 175 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + +++ G + E+ F+ T + I + Sbjct: 176 LRKMFPAQGQKVPEIRLQGFDGEWEEKKLGEISRMFSGGTPNVGIPEYYNGN-IPFIRSA 234 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I ++ K S + ++V+ +++ + + L G I A +A Sbjct: 235 EINSDQTELSITDKGLSNSSAKLVEKNTLLYALYGATSGEVGLSRIS----GAINQAILA 290 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + P S+ + G + +L VK L + P + EQ I N Sbjct: 291 IIPEKKYSSLFIKNWLYKQKSSIIKKYLQGGQGNLSGSIVKELTIHFPSLSEQEAIGNF- 349 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D + + E ++ LK + + + Sbjct: 350 ---FQTLDQQMSQTEDKLIELKALKQTLLNRLF 379 >gi|90961892|ref|YP_535808.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius UCC118] gi|90821086|gb|ABD99725.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius UCC118] Length = 401 Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats. Identities = 52/404 (12%), Positives = 131/404 (32%), Gaps = 35/404 (8%) Query: 25 WKVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESGTG--KYLPKDGNSRQSDT 75 W+ + G T ++ DI +I D+++ + K ++ + Sbjct: 13 WERKKLGDVANSYINGGTPDTQNKNYWIGDIPWIQSSDLKNDDIWNVNINKYITNKAVND 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S + I + A ++ ++ K+ L ++ Sbjct: 73 SAAKLIPANSIAIVTRVGVGKLAYMSQEYSTSQDFLSLVDIKEDLIFIMYMLYFK---IS 129 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + +G ++ K + N+ + I + I +D I +++LL Sbjct: 130 KVSSSLQGTSIKGITKKELLNLSISIVNNTAEQNR---IGQVFKILDNSINLHEDYLQLL 186 Query: 196 KEKKQALVSYIVTKG-LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + L+ + + P+++ K +W V ++ T Sbjct: 187 YDFRSFLLQKMFSINDTFPNLRFKQFNDKW---------KYKKLGEVADIVSGGTPDTTK 237 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIV-----DPGEIVFRFIDLQNDKRSLRSAQ 309 + N E N +S + + + + ++A Sbjct: 238 HDYWNGSINWYTPAEVGNKIFVSDSQRKITNIGLENSSAKILPVGTVLFTSRAGIGKTAI 297 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + E+G + ++ P S L K + G+G + +++ + + Sbjct: 298 LKEKGSTNQGFQSIVPKQKFLDSYFIFSMSNILKKYGESHGAGSTFLEISGKELAKARIS 357 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P I EQ +I+ V+ ++D ++ +Q I LK+ + + Sbjct: 358 LPSITEQKNISKVLF----KLDTIITLQKQEIDNLKKLKQFLLQ 397 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 57/171 (33%), Gaps = 6/171 (3%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 +N +I + ++ K + + I I + + Sbjct: 34 QNKNYWIGDIPWIQSSDLKNDDIWNVNINKYITNKAVNDSAAKLIPANSIAIVTRVGVGK 93 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 A + + + ++++ D ++ +++ + + KV ++ + + +++ L Sbjct: 94 LAYMSQEYSTSQDFLSLVDIKEDLIFIMYMLY-FKISKVSSSLQGTSIKGITKKELLNLS 152 Query: 367 VLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + EQ I +D + E + LL + RS + + Sbjct: 153 ISIVNNTAEQNRIG----QVFKILDNSINLHEDYLQLLYDFRSFLLQKMFS 199 >gi|229195092|ref|ZP_04321867.1| type I restriction-modification enzyme, S subunit [Bacillus cereus m1293] gi|228588321|gb|EEK46364.1| type I restriction-modification enzyme, S subunit [Bacillus cereus m1293] Length = 475 Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats. Identities = 61/440 (13%), Positives = 133/440 (30%), Gaps = 56/440 (12%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP++W K+ +G + K +P G + + T S Sbjct: 26 IPENWISTRFDSVLKIKSGDSLTKAK-----------MNEQGMIPVYGGNGITGTHDKSN 74 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I+ G++G Y + + + VL + + + + + Sbjct: 75 VETETIVIGRVGYYCGSVHLTSEEAWVTDNAFVLSFPEKIIDKKFIYWNLKHCNLGQYS- 133 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + K IG + + +PP EQ I EK+ +I+ E + ++ Sbjct: 134 -KSSAQPVISGKTIGPVGINVPPYLEQKRIVEKVERLLGKIEEAKALIEEAEETFELRRA 192 Query: 201 ALVSYIVTKGLNPDVK-----------------------------MKDSGI---EWVGLV 228 A+++ L+ + +K I E + Sbjct: 193 AILNKAFRGELSAKWREDNVIVEDASSLLERIQIQKGNSSIKSNTLKIISINKEEEPFEL 252 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-------- 280 P W+ + + + + + Q + ++ L +Y Sbjct: 253 PSGWKWVRLGEISYYVTSGSRDWSKYYSDEGAMFIRTQDINKNSLNLSDVAYVSLPEKVE 312 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 +V+ +I+ K +L + E + S + S Y+ + S Sbjct: 313 GKRSLVEKSDILTTITGANVGKCALVETNIKEAYVSQSVALTKLIEKSISKYIHLSLLSP 372 Query: 341 --DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++ R L ED+K + + + PI+EQ I ++ + + Sbjct: 373 CGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPIEEQEVIVQLVETLLNNEKESLGLVSM 432 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 L+ + S + A G+ Sbjct: 433 E-KKLETLKHSILNKAFRGE 451 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 70/219 (31%), Gaps = 12/219 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P WK V + + T + + S + ++I +D+ + S Sbjct: 251 ELPSGWKWVRLGEISYYVTSGSRDWSKYYSDEGAMFIRTQDINKNSLNLSDVAYVSLPEK 310 Query: 75 TSTVS-IFAKGQILYGKLGPYLRKAIIAD---FDGICST--QFLVLQPKDVLPELLQGWL 128 + K IL G + K + + + S L K + + L Sbjct: 311 VEGKRSLVEKSDILTTITGANVGKCALVETNIKEAYVSQSVALTKLIEKSISKYIHLSLL 370 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 +E G + I NI +P+ P+ EQ +I + + + Sbjct: 371 SPCGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPIEEQEVIVQLVETLLNNEKESLGLV 430 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 +L K ++++ L + ++S IE + Sbjct: 431 SMEKKLET-LKHSILNKAFRGELGTNDPTEESAIELLKE 468 >gi|229490942|ref|ZP_04384776.1| restriction modification system DNA specificity domain protein [Rhodococcus erythropolis SK121] gi|229322149|gb|EEN87936.1| restriction modification system DNA specificity domain protein [Rhodococcus erythropolis SK121] Length = 390 Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats. Identities = 53/403 (13%), Positives = 123/403 (30%), Gaps = 32/403 (7%) Query: 28 VPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTVSIFAK 83 VP+ + K+ IY+ L V+ P+ ++ ++ + + + Sbjct: 7 VPLGDICQKVPTWNPAKSSAEKEFIYVDLSSVDQRNKTITSPQVISTSEAPSRARQLLSP 66 Query: 84 GQILYGKLGPYLRKAIIADF---DGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIE 138 G ++ + P L + ST F V PK + L W+ + + Sbjct: 67 GDVIVSTVRPNLNAVAHVEPEFDQATASTGFTVLRGDPKRIDSRYLSQWVKTPLFVSEMV 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 GA+ + + +P+P LA+Q I + + + + +L K Sbjct: 127 RKATGASYPAVSDRIVKASTIPLPDLADQRRIATVLDHADMLRNKRREALAQLSQLTKSI 186 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + +P + +G + +R E +I Sbjct: 187 FR-------DMFGDPTYARDSTHGVRIGDSIR-------VGSGSTPSRSRPDYYEGSIPW 232 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDP---GEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + E+ + G I+ + + +V Sbjct: 233 VKTAEVNNGYIRETSEYVSETACADARLKMYPAGSILIAMYGQGKTRGRVAVLEV--AAT 290 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 A + P T + ++ G + +L + + L +L+PP+ +Q Sbjct: 291 TNQACAVLPPGDTHDTRFLFTQLLMSYERLRDLGRGGNQPNLNAKHIAGLDILLPPLDQQ 350 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +I+ L + ++ L ++ + A G+ Sbjct: 351 QEF----SRRAKQIEQLESRHRIALDALDSLFAAAQSRAFRGE 389 >gi|298736493|ref|YP_003729019.1| type I restriction enzyme subunit S [Helicobacter pylori B8] gi|298355683|emb|CBI66555.1| type I restriction enzyme, S subunit [Helicobacter pylori B8] Length = 422 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 48/407 (11%), Positives = 120/407 (29%), Gaps = 26/407 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNLEFWKNGTIPWFRMEDLRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCNLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT--LITERI 189 + + + + + D PIPPL Q I + + A T ++ Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEVQQEIVKILDAFTELNTELKARKKQY 191 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + + + + ++ K LVP E + ++ Sbjct: 192 EYYQNMLLDFKDIKQNHKDAKMSTKPYPKRLKTLLQTLVPKGVEFRKLGEVLEYDQPNKY 251 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 ++ ++ +T +G E YQ ++ + + + Sbjct: 252 CVMSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKSSPVII----FDDFTTATQWVD 307 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + ++ + + + + + + G RQ + ++ + + Sbjct: 308 FPFKVKSSAMKILLPKNPTINIRFIFFYMQTIPYNI---SGEHTRQWISR--YSQITIPI 362 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 PP++ Q +I +++ A L+ I I K+ R + Sbjct: 363 PPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYEYYREKLLT 409 >gi|331085150|ref|ZP_08334236.1| hypothetical protein HMPREF0987_00539 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330407933|gb|EGG87423.1| hypothetical protein HMPREF0987_00539 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 417 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 51/402 (12%), Positives = 121/402 (30%), Gaps = 21/402 (5%) Query: 25 WKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST--- 77 W+ + ++ T ++ + +++++G + D + + Sbjct: 16 WEQRKLGDVLEVIKDGTHGTHQDAEDGPFLLSAKNIKNGVIIWDETDRKISEDEYEKIHS 75 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQ 135 +L +G AI+ D GI + + +++ E L + + + Sbjct: 76 KFKLQNNDVLLTIVGSIGETAILKDISGITFQRSVAFLRPSEELSSEFLYSEIQTPKFQK 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + + IP ++ +KI + ID L+T R + Sbjct: 136 ELDCRKSTSAQPGIYLGDLSEIPFAYSKDKDEQ---KKIGEYFLNIDNLLTLHQRKCDET 192 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 KE K+ ++ + K +++ G E + Sbjct: 193 KELKKYMLQKMFPKKGEKVPEIRFKGFTDAWEQRKFSECYKMTSGYAFKMSDYCDTGVGL 252 Query: 256 ILSLSYGNIIQKLETRNM-GLKPESYETYQIVDPGEIVFR---FIDLQNDKRSLRSAQVM 311 I S + I + N + ++ +IV I N K + ++ Sbjct: 253 INGESIQHGIINDDNLNYLPESFIQQYSEFLLKESDIVVGLNRPITNGNLKIARIPSKYN 312 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + A V D + L+ L + + + +++P Sbjct: 313 NSLLYQRAGKIVYKIDCDKNFTYVLLSQEILKHTLVEAVGSDQPFISTSKLDNWKMMMPS 372 Query: 372 -IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++EQ I +D L+ ++ LKE + + Sbjct: 373 DMEEQEKIGLY----FTSLDHLITLHQRKCDSLKELKKYMLQ 410 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 16/151 (10%), Positives = 44/151 (29%), Gaps = 8/151 (5%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + + E + + +++ + + L+ + S Sbjct: 58 WDETDRKISEDEYEKIHSKFKLQNNDVLLTIVGSIGETAILKDISGITFQ--RSVAFLRP 115 Query: 325 PHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVI 382 + S +L +++ K + + + D+ +P EQ I Sbjct: 116 SEELSSEFLYSEIQTPKFQKELDCRKSTSAQPGIYLGDLSEIPFAYSKDKDEQKKIGEY- 174 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ID L+ ++ KE + + Sbjct: 175 ---FLNIDNLLTLHQRKCDETKELKKYMLQK 202 >gi|21283479|ref|NP_646567.1| specificity determinant HsdS [Staphylococcus aureus subsp. aureus MW2] gi|49486626|ref|YP_043847.1| putative restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus MSSA476] gi|21204920|dbj|BAB95615.1| probable specificity determinant HsdS [Staphylococcus aureus subsp. aureus MW2] gi|49245069|emb|CAG43535.1| putative restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus MSSA476] gi|164551508|gb|ABY60971.1| Sau1hsdS2 [Staphylococcus aureus] Length = 399 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 58/403 (14%), Positives = 136/403 (33%), Gaps = 39/403 (9%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGKFFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I T+ L + + EW + + K Sbjct: 197 LELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIFENNRRKPITSSLREKG 256 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + ++ N ++ + + S Sbjct: 257 LYPYYGATGIIDYVKDYLFNNEE---------------RLLIGEDGAKWGQFETSSFIAN 301 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 + + + VK + + ++ + + K A +G L ++ + + +P Sbjct: 302 GQYWVNNHAHVVKSNDHNLFFMNYYLN----FKELRAFVTGNAPAKLTHANLCNINLKIP 357 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ + ++ ID + I LLKER+ + Sbjct: 358 CLTEQ----DKVSALLKSIDNKMNNQMNRIELLKERKKELLQK 396 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 51/181 (28%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E + + I L NI + Sbjct: 10 PELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I + + Sbjct: 130 NFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGKFFSKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|194435174|ref|ZP_03067406.1| putative type I restriction-modification system, S subunit [Shigella dysenteriae 1012] gi|194416592|gb|EDX32729.1| putative type I restriction-modification system, S subunit [Shigella dysenteriae 1012] gi|320179362|gb|EFW54320.1| Type I restriction-modification system, specificity subunit S [Shigella boydii ATCC 9905] Length = 578 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 55/490 (11%), Positives = 126/490 (25%), Gaps = 101/490 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV 56 +K K P+ S + +P+ W+ V + ++ GR + + + + ++ Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVRVADLMEVINGRAYKKHEMLQTGTPLLRVGNL 140 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + G ++Y + + + + Sbjct: 141 ------FTSNEWYYSDLQLDENKYINNGDLIYAWSASFGPFIWTGEKVIYHYHIWKLNLF 194 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + +T +I++ G M H + + + +PP+ EQ I KI Sbjct: 195 AEEYSNKYFIHDFLLSITDKIKSQGNGIAMLHMTKEKMEQQIIALPPINEQQQIVRKIRE 254 Query: 177 ET-----------------------------------------VRIDTLITERIRFIELL 195 T RI + Sbjct: 255 LTVLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAKELAENWARISEHFDTLFTTEASV 314 Query: 196 KEKKQALVSYIVTKGLNPDVKMKD-------------------------------SGIEW 224 KQ ++ V L P + S E Sbjct: 315 DALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPPISDEEK 374 Query: 225 VGLVPDHWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 +P+ WE + + I ++ G+I + + Sbjct: 375 PFELPEGWEWCKFGLTSEFINGDRGSNYPNKNEYVSQGIPWINTGHIEKNGTLTVTEMNF 434 Query: 278 ESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + + G++V+ K + + I +S + Y Sbjct: 435 ITEGKFNELRSGKIQKGDLVYCLRGATFGKTAFVTPYETG-AIASSLMIIRPFITEMGGY 493 Query: 333 LAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDI---TNVINVETAR 388 + + S Y + +L V PP+ EQ+ I +++ + Sbjct: 494 IYNYLTSPFGRSQIYRFDNGSAQPNLSANSVMLYSFPCPPLTEQYRIFSQVGLLHELCDK 553 Query: 389 IDVLVEKIEQ 398 + ++ +Q Sbjct: 554 LKTRIKTAQQ 563 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 61/202 (30%), Gaps = 13/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ ++ G + + I +I +E + + Sbjct: 377 ELPEGWEWCKFGLTSEFINGDRGSNYPNKNEYVSQGIPWINTGHIEKNGTLTVTEMNFIT 436 Query: 72 QSDTSTVS--IFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQG 126 + + + KG ++Y G K + I S+ ++ + + Sbjct: 437 EGKFNELRSGKIQKGDLVYCLRGATFGKTAFVTPYETGAIASSLMIIRPFITEMGGYIYN 496 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L S +I G+ + + P PPL EQ I ++ D L T Sbjct: 497 YLTSPFGRSQIYRFDNGSAQPNLSANSVMLYSFPCPPLTEQYRIFSQVGLLHELCDKLKT 556 Query: 187 ERIRFIELLKEKKQALVSYIVT 208 + AL + + Sbjct: 557 RIKTAQQTQLHLADALTNAAIN 578 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 63/193 (32%), Gaps = 5/193 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE L+ +N + K E + + Sbjct: 93 SEEEKPFELPEGWEWVRVADLMEVINGRAYKKHEMLQTGTPLLRVGNLFTSNEWYYSDLQ 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + ++ G++++ + + I + ++ + S Sbjct: 153 LDENKYINNGDLIYAWSASFGPFIWTGEKVIYHYHIW--KLNLFAEEYSNKYFIHDFLLS 210 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + G+G + E +++ + +PPI EQ I I T D L ++ Sbjct: 211 --ITDKIKSQGNGIAMLHMTKEKMEQQIIALPPINEQQQIVRKIRELTVLCDQLEQQSLT 268 Query: 399 SIVLLKERRSSFI 411 S+ ++ + + Sbjct: 269 SLDAHQQLVETLL 281 >gi|317505566|ref|ZP_07963477.1| type I site-specific deoxyribonuclease [Prevotella salivae DSM 15606] gi|315663314|gb|EFV03070.1| type I site-specific deoxyribonuclease [Prevotella salivae DSM 15606] Length = 531 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 49/444 (11%), Positives = 120/444 (27%), Gaps = 74/444 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP W+ I +G+ S K+I +I ++ G + + Sbjct: 86 EIPNGWEWTRIGSVFNHASGKQQSSNKNIGTPQKFITTSNLYWGYFILDNVKIMNFTEEE 145 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVT 134 KG +L + G ++ I FD Q V + + + + + Sbjct: 146 IKRCSATKGDLLVCEGGAGYGRSAIWHFDYDICLQNHVHRLRPYINGICEYVYYFIYLLK 205 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G M + + +P PPL+ Q I ++ I + +L Sbjct: 206 ESNNLTSVGTAMPGLSANRLKGLLLPFPPLSAQKRIVAQLGVLLPLIAKYSDVQNSLDKL 265 Query: 195 ----LKEKKQALVSYIVTKGL---NPDVKMKDSGIEWV---------------------- 225 + K++++ + L +P + +E + Sbjct: 266 NITINDKLKKSILQEAIQGRLVSQDPTDEPASILLERIKAEKVRLVEDGVLKEKVLVSST 325 Query: 226 -------------------------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 P+ W + + K ++ + + Sbjct: 326 IFKGEDNKYYEQVGSTRLDISEVIPFEEPNGWRWCRLKDICSIFTGATFKKEDATMNGIG 385 Query: 261 YGNIIQKLET--------RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 ++ L E + ++ +IV + + + + Sbjct: 386 IRVWRGGNILPFALINKPDDLYLPNEKVKDNILLKKNDIVTPAVTSLENIGKMARTEYDM 445 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSY-------DLCKVFYAMGSGLRQSLKFEDVKRL 365 ++ + L+ + + + K ++ E + + Sbjct: 446 PHTTVGGFVFIIRPFFSVDTLSQYLLNLLSSPILIEYMKTITNKSGQAFYNIGKERLGQA 505 Query: 366 PVLVPPIKEQFDITNVINVETARI 389 + +PP+ EQ I ++ +I Sbjct: 506 LLPIPPLAEQERIVEKVSQTFDKI 529 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 26/212 (12%), Positives = 60/212 (28%), Gaps = 14/212 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETR 271 K E +P+ WE ++ + K N + I + + L+ Sbjct: 77 KCIDDEIPFEIPNGWEWTRIGSVFNHASGKQQSSNKNIGTPQKFITTSNLYWGYFILDNV 136 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + E G+++ + ++ + + ++P+ Sbjct: 137 KIMNFTEEEIKRCSATKGDLLVCEGGAGYGRSAIWHFD--YDICLQNHVHRLRPYINGIC 194 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + L +K L + PP+ Q I + V I Sbjct: 195 EYVYYFIYLLKESNNLTSVGTAMPGLSANRLKGLLLPFPPLSAQKRIVAQLGVLLPLI-A 253 Query: 392 LVEKIEQSIVLLK-----ERRSSFIAAAVTGQ 418 ++ S+ L + + S + A+ G+ Sbjct: 254 KYSDVQNSLDKLNITINDKLKKSILQEAIQGR 285 >gi|304380447|ref|ZP_07363125.1| EcoA family type I restriction-modification system [Staphylococcus aureus subsp. aureus ATCC BAA-39] gi|304340965|gb|EFM06887.1| EcoA family type I restriction-modification system [Staphylococcus aureus subsp. aureus ATCC BAA-39] Length = 390 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 130/405 (32%), Gaps = 39/405 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + K + I + +Y K +S+ + ++ Sbjct: 7 EWEEKQLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 64 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 65 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 124 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + I + P L EQ I + +I+ + Sbjct: 125 VSGIAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKFFSKLDRQIELEEQKLELLQ 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K Q + S + + G WE + E N ++ Sbjct: 185 QQKKGYMQKIFSQELRFK------------DENGEDYPDWENSKIEKYLKERNERSD--K 230 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + II+ E + Y++V +I + + + + Sbjct: 231 GQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASGKSNY---- 286 Query: 313 RGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLRQ---SLKFEDVKRLPVL 368 GI++ AY + P S+ + +++ + F GL +LK++ +K + + Sbjct: 287 NGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTPDTWNLKYKQLKNINID 346 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 347 IPVLEEQEKIGDF----FKKMDILISKQKMKIEILEKEKQSFLQK 387 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 65/184 (35%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 W+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 211 DWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 265 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 266 KNDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 325 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + +K + NI + IP L EQ I + + I + + + Sbjct: 326 QGLTPDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFL 385 Query: 200 QALV 203 Q + Sbjct: 386 QKMF 389 >gi|332364617|gb|EGJ42386.1| type I restriction-modification system specificty subunit [Streptococcus sanguinis SK1059] Length = 386 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 49/398 (12%), Positives = 122/398 (30%), Gaps = 26/398 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + ++ L G +S K I + + ++ G + + + Sbjct: 2 EYKKLQSIAPLRGGFAFKSEKFQNVGIPIVRISNI-GFDGTVGGEFEYYSKLSPDEKFVL 60 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIE 138 +L G K + D + V ++ + L L + T ++ Sbjct: 61 KGRSLLLAMSGATTGKIAMLDSEEEYYQNQRVGYFQNNGAVDYDFLSSVLQTKAFTNQLN 120 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A+ + K I + IP E+ + +D L+ + + Sbjct: 121 AVLVAGAQPNISSKEIDSFEFCIPESIEEQSAIGSL---FRTLDDLLASYKDNLANYQSL 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K ++S + K +++ G E V N K+ + Sbjct: 178 KATMLSKMFPKAGQTVPEIRLDGFEGEWEKLK-------LRDVVHTNPKSELPENFKYID 230 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 L + + R ++ G++ ++ + L ++ + ++ Sbjct: 231 LESVVGTRINKIREERKTSAPSRAQRLAKKGDVFYQTVRPYQKNNYLFKLDEIDY-VFST 289 Query: 319 AY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQ 375 Y + + DS +L L+++ +G ++ D+ + + +P +EQ Sbjct: 290 GYAQLRPIFNRCDSDFLLILLQNNRFLSNVLDRCTGTSYPAINVNDLIEILIAIPSYEEQ 349 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +D L+ ++ I L+ + + Sbjct: 350 QAIGAY----FSNLDSLISAHQEKISQLETLKKKLLQD 383 >gi|218261756|ref|ZP_03476491.1| hypothetical protein PRABACTJOHN_02162 [Parabacteroides johnsonii DSM 18315] gi|218223800|gb|EEC96450.1| hypothetical protein PRABACTJOHN_02162 [Parabacteroides johnsonii DSM 18315] Length = 431 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 55/411 (13%), Positives = 124/411 (30%), Gaps = 29/411 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYL--PKDGNSRQSD 74 + WK P+ ++ G T S DI +I D+ + + Sbjct: 28 EGWKRTPLLEICEIIGGGTPTSSNDVYWNGDIPWISSSDINENNISEITPTRHITKDAIK 87 Query: 75 TSTVSIFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + I + ++G + K + D S F L + L L I Sbjct: 88 NSATKLCKAPSIHIVSRVG--VGKVAFSRVDICTSQDFTNLCNINCNYIFLSYLLSIIMK 145 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E +G ++ I N+ +P+P + EQ I + + + I+ IE Sbjct: 146 QKVQE--TQGTSIKGIASAEIKNLHVPLPEIEEQQRIADCLSSLDDL----ISAVADKIE 199 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL-VPDHWEVKPFFALVTELNRKNTKLI 252 LKE K+ L+ + ++ + G + + L + L Sbjct: 200 TLKEYKKGLMQQLFPAEGKTIPAIRFPEFQNAGEWMLLPIKKCNIDILTGYAFKGTEILE 259 Query: 253 ESNILSLSYGNII-------QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 +++ + L G I R + + Y+++ ++ +L Sbjct: 260 DNDGIPLMRGINITEGVVRHNNDIDRFYSGEDHTLSKYRLLCNDLVIAMDGSKVGRNFAL 319 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKR 364 + Q ++ + ++ + S K S + + ++ Sbjct: 320 INKQDEGSLLVQRVARLRADNIDFIMFIYQQIGSDRFKKYIDRINTSSGIPHISLKQIED 379 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++ + +D L+ + LK + + Sbjct: 380 FKIWTTRND--KEF-RMVTNCLSSVDELISTEIAKLDQLKAHKKGLMQQLF 427 >gi|169347040|ref|ZP_02865982.1| restriction modification system DNA specificity domain [Clostridium perfringens C str. JGS1495] gi|169296723|gb|EDS78852.1| restriction modification system DNA specificity domain [Clostridium perfringens C str. JGS1495] Length = 389 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 64/401 (15%), Positives = 136/401 (33%), Gaps = 33/401 (8%) Query: 30 IKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +K ++++G +S + I I + D+ SG + D I Sbjct: 7 LKNLIEIDSGYAFKSSFFNDNFEGIPIIRIRDINSGIAE------TYYSGDYEEKFIVNN 60 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 +L G G + + + + + ++ + + + IE Sbjct: 61 DDLLIGMDG-NFKIRKWSGGKALLNQRVCRIKSISNKLSNEYLYRILPLELKLIEDKTSF 119 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 T+ H K I NI + IP + Q I + I ID + L + Sbjct: 120 VTVKHLSVKDINNIELIIPDIDIQNKIVKIIDKSQELIDNRKKQIEELDLL-------VK 172 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + P K + + + +K+ + E ++ Sbjct: 173 SKFIEMFGTPIE--KRFIGKTLPEIIAEGRYSLKRGPFGGSLKKDDFIQEGYLVYEQRHA 230 Query: 264 IIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + + + Y+ V+P +++ + + S + + GII A + Sbjct: 231 IHNDFDYAKYYISKDKYDEMIMFKVEPKDLLVSCSGVTLGRIS-EVPEGAKAGIINQALL 289 Query: 322 AVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSL-KFEDVKRLPVLVPPIKEQFD 377 + + +++ Y L R+ + + G + +VK + L PPI+ Q Sbjct: 290 KITLNQDIMNNIYFMQLFRNEQIQDKLFGFSRGSGIPNFPSMSEVKSMEFLCPPIELQNK 349 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +N ID L ++E+S+ L++ +S + A G+ Sbjct: 350 FADFVN----NIDKLKFEMEKSLKELEDNFNSLMQKAFKGE 386 >gi|218690007|ref|YP_002398219.1| putative type I restriction modification system protein [Escherichia coli ED1a] gi|218427571|emb|CAR08467.2| putative Restriction modification system, type I similar to hsdS (fragment) [Escherichia coli ED1a] Length = 521 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 57/408 (13%), Positives = 138/408 (33%), Gaps = 36/408 (8%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +P+ +G +++ + + + SDTS I K ++ Sbjct: 5 TIPLGDILT-QSGHHRAGNRELPVLSITMKNGLVDQSDKFKKRIASSDTSKYRIVYKNEL 63 Query: 87 LYG-KLGPYLRKAIIADFDGICSTQFLVLQPKD---VLPELLQGWLLSIDVTQRIEAICE 142 + G + + GI S + + + KD L+ +L S + + + + Sbjct: 64 VVGFPIDEGVLGFQTKYPVGIVSPAYGIWKLKDESVCHIPYLERYLRSSEARRLYASRMQ 123 Query: 143 GA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G ++ +P PP+ +Q I + +++ LI +R + ++ L + + Sbjct: 124 GVVARRRSLTKSDFLSLEVPFPPINDQARIANLL----AKVEGLIEQRKQLLQYLDDLLK 179 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ V +P K + +G + V + + Sbjct: 180 SV---FVDMFSDPVKNAKGWELTTIGEL-----------AVDVRYGTSVSAQGGKYKYIR 225 Query: 261 YGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 NI LK + + + G++VF + + E II Sbjct: 226 MNNITPDGYWDFENLKYIDVDNKDLDKYSLQKGDLVFNRTNSKELVGKTAVYDRDETVII 285 Query: 317 TSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V+ + + W + S + + + ++ ++++ +P+L PP++ Sbjct: 286 AGYLIRVRFDQQTNPWFVWGHLNSKFGKAKLFNLCRNIIGMANINAQELRAIPILKPPLE 345 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 Q ++ A + + +QS+ L+ A G+++L Sbjct: 346 LQNKFATIVEKAHA----IKFRYQQSLADLETLYDVVSQKAFKGELEL 389 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 73/174 (41%), Gaps = 14/174 (8%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVDPG 289 + P ++T+ E +LS++ N + + + + Y+IV Sbjct: 2 NRMTIPLGDILTQSGHHRAGNRELPVLSITMKNGLVDQSDKFKKRIASSDTSKYRIVYKN 61 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW---LMRSYDLCKVF 346 E+V + D+ L GI++ AY K ++ + +RS + +++ Sbjct: 62 ELV---VGFPIDEGVLGFQTKYPVGIVSPAYGIWKLKDESVCHIPYLERYLRSSEARRLY 118 Query: 347 YAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + G+ R+SL D L V PPI +Q I N++ A+++ L+E+ + Sbjct: 119 ASRMQGVVARRRSLTKSDFLSLEVPFPPINDQARIANLL----AKVEGLIEQRK 168 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/218 (11%), Positives = 55/218 (25%), Gaps = 11/218 (5%) Query: 23 KHWKVVPIKRFT-KLNTGRTSE-SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 K W++ I + G + G YI + ++ G + + Sbjct: 194 KGWELTTIGELAVDVRYGTSVSAQGGKYKYIRMNNITPDGYWDFENLKYIDVDNKDLDKY 253 Query: 80 IFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-- 133 KG +++ + D I + + ++ L+ Sbjct: 254 SLQKGDLVFNRTNSKELVGKTAVYDRDETVIIAGYLIRVRFDQQTNPWFVWGHLNSKFGK 313 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + M++ + + + IP+ PPL Q + Sbjct: 314 AKLFNLCRNIIGMANINAQELRAIPILKPPLELQNKFATIVEKAHAIKFRYQQSLADLET 373 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 L Q + P + G P+H Sbjct: 374 LYDVVSQKAFKGELELSRVPIPTQIFFPVS--GEEPEH 409 >gi|256845970|ref|ZP_05551428.1| anti-codon nuclease masking agent [Fusobacterium sp. 3_1_36A2] gi|256719529|gb|EEU33084.1| anti-codon nuclease masking agent [Fusobacterium sp. 3_1_36A2] Length = 592 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 127/404 (31%), Gaps = 29/404 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSI 80 K+V I + G + G K I +V Y K +D Sbjct: 195 KMVKIGDLFEFKNGINKDKGSFGKGTPIINYVNVYKKNKIYFEDLKGLVEASNDELVRYG 254 Query: 81 FAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQPKDVL--PELLQGWLLSID 132 +G + + + + + + + S L +P L PE + + Sbjct: 255 VKRGDVFFTRTSETIEEIGYTSVLLEDIENCVFSGFLLRARPITDLLLPEYCAYCFSTSN 314 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I T + + + I +P+PPL Q I E + + L I Sbjct: 315 IRNTIIKKSTYTTRALTNGTSLSQIEIPLPPLEVQKRIVEVLDNFEKICNDLNIGLPAEI 374 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E +++ + ++++T + K + + + + + K Sbjct: 375 EARQKQYEFYRNFLLTFKIENCTLPKTRQDKTRQDIIKLFMYIFGYIELELGEILKIKNG 434 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + G +TY ++ R + N + ++ Sbjct: 435 SDYKKF-----NIGNIPVYGSGGIINYIDTYIYDKESVLIPRKGSIGNLFYVDKPFWTVD 489 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 T Y + + YL + + +L K+ +G SL + ++ + +PP+ Sbjct: 490 ----TIFYTVIDKDVVIPKYLYYYLSKMNLEKL---NTAGGVPSLTQTVLNKILIPLPPL 542 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +EQ I ++++ + + E + I ++ R + Sbjct: 543 EEQQRIIDILDRFDKLCNDISEGLPAEIEARQKQYEYYREKLLT 586 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 46/396 (11%), Positives = 113/396 (28%), Gaps = 34/396 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + K+ G K +S KY +G + Sbjct: 13 PNGVEYKELGDIAKVTIGEFVHKDK----------QSENAKYPVYNGGISNTGYYDEYNE 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K +I+ G + + D + + + + + + Sbjct: 63 EKNKIIISARGANAGYINRIFVNYWAGNSCYTINANDKIINWNFLYYVLKNKEKGLLNKQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + ++ K + +I +P+PPL Q I + T L E + K++ Sbjct: 123 QTGSIPSISKKQVESILVPVPPLEVQDEIVRILDNFTALTAELTAELTAELTARKKQYSW 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 Y++ N +K +G + + + + Sbjct: 183 YRDYLLKFE-NKVKMVK------IGDLFEFKNGINKDKGSFGKGTPIINYVN-----VYK 230 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITSA 319 N I + + + V G++ F ++ S + +E + + Sbjct: 231 KNKIYFEDLKGLVEASNDELVRYGVKRGDVFFTRTSETIEEIGYTSVLLEDIENCVFSGF 290 Query: 320 YMAVKPHGID--STYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +P Y A+ + ++ + R + ++ + +PP++ Q Sbjct: 291 LLRARPITDLLLPEYCAYCFSTSNIRNTIIKKSTYTTRALTNGTSLSQIEIPLPPLEVQK 350 Query: 377 DITNVINVETARIDVL-------VEKIEQSIVLLKE 405 I V++ + L +E ++ + Sbjct: 351 RIVEVLDNFEKICNDLNIGLPAEIEARQKQYEFYRN 386 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 48/149 (32%), Gaps = 7/149 (4%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K N G+ Y + +I+ + + S Y Sbjct: 43 KYPVYNGGISNTGYYDEYNEEKNKIIISARGAN---AGYINRIFVNYWAGNSCYTINAND 99 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I + + + + +G S+ + V+ + V VPP++ Q +I +++ T Sbjct: 100 KIINWNFLYYVLKNKEKGLLNKQQTGSIPSISKKQVESILVPVPPLEVQDEIVRILDNFT 159 Query: 387 ARIDVLVEKIEQSIVLLKE----RRSSFI 411 A L ++ + K+ R + Sbjct: 160 ALTAELTAELTAELTARKKQYSWYRDYLL 188 >gi|309379402|emb|CBX21969.1| putative recognition subunit of Type I restriction/modification system [Neisseria lactamica Y92-1009] Length = 414 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 48/404 (11%), Positives = 116/404 (28%), Gaps = 37/404 (9%) Query: 26 KVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSI 80 + P+ L G + + + I + + G K + + + Sbjct: 20 EWKPLGEVGLLVRGNGLQKKDFTESGVPAIHYGQIYTYYGNQTDKTLSFVSPELAEKLKK 79 Query: 81 FAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 KG ++ + + + + + +P + + + Sbjct: 80 VDKGDVVITNTSENIEDVGKALLYLGEEQAVTGGHATIFKPSKEIVGKFFVYFTQTEIFD 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +G + + I +PIPPL Q I + + T TL E + Sbjct: 140 KAKRKFAKGTKVIDVSATDMAKIKIPIPPLEIQQKIVKILDKFTELEATLEAELVLRKRQ 199 Query: 195 LKEKKQALVSYIVTKGLN----PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + + L+ + G ++KD + +G + ++ + + + E N Sbjct: 200 YRYYRDFLLDFDNQIGGGIADGYQCRLKDVVWKTLGEIAEYSKNRICSDKLNEHN----- 254 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + N++Q E + + S +I+ I K Sbjct: 255 -------YVGVDNLLQNREGKKLSGYVPSEGKMTEYIVNDILIGNIRPYLKKIWQADCTG 307 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 G + + V ++ YL ++ G + + + + Sbjct: 308 GTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAKGAKMPRGSKTAIMQYKIPI 365 Query: 370 PPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 PPI EQ I +++ + + + +E+ Sbjct: 366 PPIPEQEKIVAILDKFDTLTHSVSEGLPHEIALRRKQYEYYREQ 409 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 62/169 (36%), Gaps = 12/169 (7%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMG----LKPESYETYQIVDPGEIVFRFIDLQNDK 302 + ES + ++ YG I + + PE E + VD G++V + Sbjct: 37 QKKDFTESGVPAIHYGQIYTYYGNQTDKTLSFVSPELAEKLKKVDKGDVVITNTSENIED 96 Query: 303 RSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKF 359 + E +T + + I + + ++ K G + + Sbjct: 97 VGKALLYLGEEQAVTGGHATIFKPSKEIVGKFFVYFTQTEIFDKAKRKFAKGTKVIDVSA 156 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK-ERR 407 D+ ++ + +PP++ Q I +++ T L +E +VL K + R Sbjct: 157 TDMAKIKIPIPPLEIQQKIVKILDKFTE----LEATLEAELVLRKRQYR 201 >gi|126667037|ref|ZP_01738012.1| Restriction modification system DNA specificity domain [Marinobacter sp. ELB17] gi|126628443|gb|EAZ99065.1| Restriction modification system DNA specificity domain [Marinobacter sp. ELB17] Length = 527 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 60/159 (37%), Gaps = 3/159 (1%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-- 320 +R + + G+I+ + + + R + Sbjct: 55 GRFLDKSSRFLTRSKARELNCTFLRAGDILVARMPDPLGRCCIFPLDEDGRYVTVVDICA 114 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +++ ++ +L+ S + A+ SG R+ + ++ +P+ +PP+ EQ I Sbjct: 115 IRFGDSRVNAKFMMYLINSPSIRGKISALQSGSTRKRISRGNLATIPLPLPPLNEQHRIV 174 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + +D +E ++ + LK R + + A G+ Sbjct: 175 AKIETLFSELDKGIESLKTAREQLKVYRQAVLKHAFEGK 213 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 71/492 (14%), Positives = 141/492 (28%), Gaps = 94/492 (19%) Query: 18 IGAIPKHWKVVPIKRFT---------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + + W I+ + + D+ I L D+ G G++L K Sbjct: 5 LNELADGWVECVIEDVVGKGGIFKDGDWVESKDQDPNGDVRLIQLADI--GDGRFLDKSS 62 Query: 69 ---NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQPKDV 119 ++ + G IL ++ L + I + + + V Sbjct: 63 RFLTRSKARELNCTFLRAGDILVARMPDPLGRCCIFPLDEDGRYVTVVDICAIRFGDSRV 122 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET- 178 + + + S + +I A+ G+T + IP+P+PPL EQ I KI Sbjct: 123 NAKFMMYLINSPSIRGKISALQSGSTRKRISRGNLATIPLPLPPLNEQHRIVAKIETLFS 182 Query: 179 ---------------------------------------------------VRIDTLITE 187 RI Sbjct: 183 ELDKGIESLKTAREQLKVYRQAVLKHAFEGKLTAKWREQNKDKLETPQQLLARIQQERQA 242 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMK------DSGIE--WVGLVPDHWEVKPF-- 237 R + + + K P K S E +P W Sbjct: 243 RYQQKLQEWQVAVKMWEENGKKENKPGKPKKLAALKETSENETRNFPQLPVGWTYVRLGL 302 Query: 238 -FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEI- 291 T K + L N I + LK S+E +++ + G++ Sbjct: 303 LIEEPTYGTSKKCSYDSGQVGVLRIPN-ISHGAIDSSNLKFASFEEHEVKALALAKGDLL 361 Query: 292 -VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL--MRSYDLCKVFYA 348 + + A+ + + ++P+ L + S+ L + + Sbjct: 362 TIRSNGSVSLVGSCALIAEEDTDFLFAGYLIRLRPNHDLVAPFFLLSVLTSHLLRRQIES 421 Query: 349 MGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++ +++ L V +P + EQ ++ + + T I V +IE + + Sbjct: 422 AAKSTSGVNNINTGEIQNLIVPLPSMVEQVELLKFLEISTPNIAVAEYEIEVQLKKSEVL 481 Query: 407 RSSFIAAAVTGQ 418 R S + A +G+ Sbjct: 482 RQSILKKAFSGK 493 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 70/210 (33%), Gaps = 13/210 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W V + + T TS+ + + + ++ G S + Sbjct: 290 QLPVGWTYVRLGLLIEEPTYGTSKKCSYDSGQVGVLRIPNISHGAIDSSNLKFASFEEHE 349 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQ---G 126 AKG +L + + D D + + + L+P L Sbjct: 350 VKALALAKGDLLTIRSNGSVSLVGSCALIAEEDTDFLFAGYLIRLRPNHDLVAPFFLLSV 409 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + Q A + +++ + I N+ +P+P + EQV + + + T I Sbjct: 410 LTSHLLRRQIESAAKSTSGVNNINTGEIQNLIVPLPSMVEQVELLKFLEISTPNIAVAEY 469 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 E ++ + +Q+++ + L P Sbjct: 470 EIEVQLKKSEVLRQSILKKAFSGKLVPQDP 499 >gi|307824516|ref|ZP_07654741.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] gi|307734500|gb|EFO05352.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] Length = 394 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 68/419 (16%), Positives = 137/419 (32%), Gaps = 54/419 (12%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN 69 YK + V G IP+ W+V + G+ E I G V K++ +G Sbjct: 22 YKQTEV---GVIPEDWEVKTVGSVAAYANGKAHEGS--ISDFGKYIVV--NSKFISTNGK 74 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQPKDVLPEL 123 ++ S ++ +IL +AI + + + VL+P + L Sbjct: 75 VKKYSDDCFSPTSESEILMVMSDVPNGRAIARCFFVDHNDLYTVNQRICVLRPNQINGRL 134 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRID 182 L + +G ++ + + P+ IPP AEQ I E + V I+ Sbjct: 135 FYYKLNRHPF---YLSFDDGVKQTNLRKNDVLSCPLTIPPTKAEQEAIAEALSDADVFIE 191 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 +L + + + Q + +G + + W VK ++T Sbjct: 192 SLEQLIAKKRHIKQGAMQE----------------RLTGKKRLPGFSGEWGVKRIGDVLT 235 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + K+ +E ++ N + D ++ + Sbjct: 236 IAHGKSQHAVEDRNGIYPILATGGQIGVAN----------CFLYDKPSVLIGRKGTIDR- 284 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + +V ++ +L + D + A G SL + Sbjct: 285 ---PQYMEQPFWTVDTLFYSVIHKQNNAKFLFYRFCLIDWKQYNEASG---VPSLNARTI 338 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + VP EQ I +++ A I L E + + + + +TG+I L Sbjct: 339 ESIEIKVPFEDEQVAIAAILSDMDAEISAL----EDKLAKTRAIKQGMMRNLLTGRIRL 393 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 83/211 (39%), Gaps = 11/211 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K VG++P+ WEVK ++ N K + S+ Y + K + N +K Sbjct: 20 KGYKQTEVGVIPEDWEVKTVGSVAAYANGKAHEGSISD--FGKYIVVNSKFISTNGKVKK 77 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI--ITSAYMAVKPHGIDSTYLAW 335 S + + EI+ D+ N + R V + + ++P+ I+ + Sbjct: 78 YSDDCFSPTSESEILMVMSDVPNGRAIARCFFVDHNDLYTVNQRICVLRPNQINGRLFYY 137 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVE 394 + + F + +L+ DV P+ +PP EQ I ++ DV +E Sbjct: 138 KLNRHPFYLSFDDGVK--QTNLRKNDVLSCPLTIPPTKAEQEAIAEALSDA----DVFIE 191 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +EQ I + + + +TG+ L G S Sbjct: 192 SLEQLIAKKRHIKQGAMQERLTGKKRLPGFS 222 >gi|281418674|ref|ZP_06249693.1| restriction modification system DNA specificity domain protein [Clostridium thermocellum JW20] gi|281407758|gb|EFB38017.1| restriction modification system DNA specificity domain protein [Clostridium thermocellum JW20] Length = 504 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 60/394 (15%), Positives = 127/394 (32%), Gaps = 37/394 (9%) Query: 25 WKVVPIKRFT-KLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+V+P+K L TGR + G + I IG E +++ L + +T+ Sbjct: 124 WEVIPLKEVLLSLETGRRPQGGVSNINEGIPSIGGEHIDTDGSLKLDDMKYIPEEFFNTL 183 Query: 79 S--IFAKGQILYGKLGPYLRKAIIAD----FDGICSTQFL--VLQPKDVLPELLQGWLLS 130 + + IL K G K + + + +LP+ L L S Sbjct: 184 TTGVIEDNNILIVKDGATTGKVAYINNLPFEKAAVNEHVFLLKADTEKILPQFLFYVLYS 243 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +I +GA + NI +P+PPL Q + ++ + I Sbjct: 244 EYGQNQILMYKKGAAQGGITRDILDNIQIPLPPLPVQQELVARLDKQQAII--------- 294 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 E+ A+ I+ G++ DS E + + + K Sbjct: 295 ------EQCNAMEKAILEAGID------DSIFEGDWEWVELESLCNDILSGGTPSTKVEA 342 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I ++ +I + K + E + I I + + Sbjct: 343 YWKGSIPWITSADI--QGIYEINVRKFITEEAVENSTTKIIPANNIIVATRVGLGKLCLN 400 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 I+ + + Q + + +K + + +P Sbjct: 401 KFDVCISQDCQGLIIKENVIPEFMLFALYNRVQSFKQESQGSTVQGVTKDHLKAIKIPLP 460 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 PI++Q +I + ++++ ++ + E + +K Sbjct: 461 PIEKQQEIVDFLDIQFKALNNIRRLKENAKQTIK 494 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 60/167 (35%), Gaps = 9/167 (5%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ V ++ + +G T + + I +I D++ + K + S Sbjct: 317 DWEWVELESLCNDILSGGTPSTKVEAYWKGSIPWITSADIQGIYEINVRKFITEEAVENS 376 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 T I I+ L K + FD S L K+ + + L V Sbjct: 377 TTKIIPANNIIVAT-RVGLGKLCLNKFDVCISQDCQGLIIKENVIPEFMLFALYNRVQS- 434 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +G+T+ + I +P+PP+ +Q I + + + ++ Sbjct: 435 FKQESQGSTVQGVTKDHLKAIKIPLPPIEKQQEIVDFLDIQFKALNN 481 >gi|227529076|ref|ZP_03959125.1| type I restriction-modification system specificity subunit [Lactobacillus vaginalis ATCC 49540] gi|227351088|gb|EEJ41379.1| type I restriction-modification system specificity subunit [Lactobacillus vaginalis ATCC 49540] Length = 382 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 129/394 (32%), Gaps = 34/394 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + + +G+ ++ + G G + S + Sbjct: 20 DWEQRKLGKLVAVKSGKDYKT-----------LNKGDIPVFGTGGYITSVNKS---LSDV 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I G+ G + I+ T F ++ + V + + +++ E Sbjct: 66 NAIGLGRKGTINKPYILKAPFWTVDTLFFLVPTQQVRLNFVYSLIQNVN----WLKYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + K I NI + EQ I + + + + +L K Q L+ Sbjct: 122 TGLPSLSKKNIQNILVFSTNYEEQNNIGNLLNLLEKLLSLQQRKLRQLKQLKKAMLQQLL 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL--SLSY 261 L P+V+ D W + ++ + K + IL S + Sbjct: 182 VSK-KDRLTPNVRFSDFSGSW--------KKCKLGEVIQDYTEKTIVENQYPILTSSQQH 232 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G I+Q + Y I+ G +R ND ++RGII+ Y Sbjct: 233 GIILQNEYFSGSRVSKTGNIGYFILPRGYFAYRNRS-DNDTYVFNRNDCIDRGIISRFYP 291 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 KP+ DS +L + + ++ A + L ++ K + P ++EQ I + Sbjct: 292 VFKPYNADSNFLLIRLNNGLRKELSLASEGTGQHVLSLKNFKNIQTQFPNLEEQHKIGDF 351 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I+ ++ L+ ++ +L + + Sbjct: 352 IST----LNSLIALHQRKANILSNLKKFLLQKLF 381 >gi|93007188|ref|YP_581625.1| restriction modification system DNA specificity subunit [Psychrobacter cryohalolentis K5] gi|92394866|gb|ABE76141.1| restriction modification system DNA specificity domain [Psychrobacter cryohalolentis K5] Length = 413 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 50/422 (11%), Positives = 121/422 (28%), Gaps = 34/422 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS-DTST 77 + W+ I K+ G +S + I ++ ++ D + + Sbjct: 3 EDWREYTIDDVAKIINGYAFKSKDFISSGVPIIKIKSLKDKMLVIDNGDFVDKDFLKLNE 62 Query: 78 VSIFAKGQILYGKLGPYLR---------KAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + G ++ + + + + KD + + + Sbjct: 63 KYHIQYDDFVIAMTGSHITLPSSAVGRVAKSRHKEKLLLNQRVGKFKVKDKICDHNFLYY 122 Query: 129 L---SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 ++ IG+I + +PPL Q I + A I+ I Sbjct: 123 FLTTDYFFQNVGLRAKGAGNQANISNGDIGSIKIHLPPLPTQQKIASILSAYYDLIENNI 182 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-VGLVPDHWEVKPFFALVTEL 244 E+ Q + + P+ + E + + + + +T+ Sbjct: 183 RRIELLE----EQAQLIYEEWFVRKKFPNYENTQIDAETGLPEGWEKKGLDYLCSKITDG 238 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQ 299 + K + ++ ++ + + E ++ G+I+F I Sbjct: 239 THDSPKQVNHGCYLVTGKHLNKGIIDFESAYQISIEDHEKIRKRSGIEKGDILFSNIGTL 298 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + + + + K + DS +L + K+ ++ Sbjct: 299 GN---IGVVTEDFEYSCKNVVIFKKKNCFDSFLYCYLTNPINKIKLDNQSSGVAQKFYSL 355 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++R P Q + + I L K+ Q LL+E R + + G I Sbjct: 356 SFIRRFQDFFP----QEPLIKKFDEIVQPIFELKYKLHQQNQLLQEARDILLPRLMMGII 411 Query: 420 DL 421 ++ Sbjct: 412 EV 413 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 71/210 (33%), Gaps = 10/210 (4%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY----IGLEDVESGT 60 K +P Y+++ + +P+ W+ + T T +S K + + + + + G Sbjct: 203 KKFPNYENTQIDAETGLPEGWEKKGLDYLCSKITDGTHDSPKQVNHGCYLVTGKHLNKGI 262 Query: 61 GKYLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 + S + S KG IL+ +G ++ + ++ + K+ Sbjct: 263 IDFESAYQISIEDHEKIRKRSGIEKGDILFSNIGTLGNIGVVTEDFEYSCKNVVIFKKKN 322 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L +L + +++ G + Q + +K Sbjct: 323 CFDSFLYCYLTNPINKIKLDNQSSGVAQKFYSL----SFIRRFQDFFPQEPLIKKFDEIV 378 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVT 208 I L + + +LL+E + L+ ++ Sbjct: 379 QPIFELKYKLHQQNQLLQEARDILLPRLMM 408 >gi|297528757|ref|YP_003670032.1| restriction modification system DNA specificity domain protein [Geobacillus sp. C56-T3] gi|297252009|gb|ADI25455.1| restriction modification system DNA specificity domain protein [Geobacillus sp. C56-T3] Length = 404 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 62/387 (16%), Positives = 130/387 (33%), Gaps = 37/387 (9%) Query: 45 GKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF 103 I + +++ + + + +++ S G IL+ K+G A + D Sbjct: 38 DDGIPVLQGKNISNFQFNFSDIRYITPQKAQELIRSKVEVGDILFVKIGSIGYSAEVTDL 97 Query: 104 DGI------CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 +G + + + V L WL S V ++ I I Sbjct: 98 NGYPFAIIPANLAKVSIDYSKVDKNYLLFWLRSDTVVNYLKKNASKTAQPALSLGKIKQI 157 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 P+ +P L Q I ++ ID + +L + + VT + Sbjct: 158 PVVMPSLETQKKISAVLLKAQELIDKRKAQIEALDQLTQSVFLEMFGDPVTNKTWERRPL 217 Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLK 276 KD V + + K + + ++ NI K++ N+ Sbjct: 218 KDIA------------------DVRDGTHDSPKYVPNGYPLVTSKNIKNGKIDLSNVNYI 259 Query: 277 PES----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 E VD G+I+ I + + + I A + + + Y Sbjct: 260 SEEDFININKRSKVDVGDIIMPMIGTIGNP--IIVDEQPNFAIKNVALIKFNNPLVVNIY 317 Query: 333 LAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L +L+ S+ L + G ++ L D++ + + +PP Q + ++ +ID Sbjct: 318 LKYLLDSHYLDYILNKNKRGGTQKFLSLTDIRNMEIPLPPRDLQDKFSEIV----KKIDS 373 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +S+ L++ +S + A G+ Sbjct: 374 QKSILHKSLRELEKNFNSLMQRAFKGE 400 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 64/170 (37%), Gaps = 11/170 (6%) Query: 247 KNTKLIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 K ++ I L N + R + + V+ G+I+F I Sbjct: 32 KVKDYVDDGIPVLQGKNISNFQFNFSDIRYITPQKAQELIRSKVEVGDILFVKIGSIGYS 91 Query: 303 RSLRSAQVMERGII--TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKF 359 + II A +++ +D YL + +RS + S + +L Sbjct: 92 AEVTDLNGYPFAIIPANLAKVSIDYSKVDKNYLLFWLRSDTVVNYLKKNASKTAQPALSL 151 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +K++PV++P ++ Q I+ V+ + L++K + I L + S Sbjct: 152 GKIKQIPVVMPSLETQKKISAVL----LKAQELIDKRKAQIEALDQLTQS 197 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 66/197 (33%), Gaps = 11/197 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS--DTST 77 K W+ P+K + G + +++++G + S + + + Sbjct: 210 KTWERRPLKDIADVRDGTHDSPKYVPNGYPLVTSKNIKNGKIDLSNVNYISEEDFININK 269 Query: 78 VSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S G I+ +G I+ + I + + V+ L+ L S + Sbjct: 270 RSKVDVGDIIMPMIGTIGNPIIVDEQPNFAIKNVALIKFNNPLVVNIYLKYLLDSHYLDY 329 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G T I N+ +P+PP Q E +ID+ + + + L Sbjct: 330 ILNKNKRGGTQKFLSLTDIRNMEIPLPPRDLQDKFSEI----VKKIDSQKSILHKSLREL 385 Query: 196 KEKKQALVSYIVTKGLN 212 ++ +L+ L Sbjct: 386 EKNFNSLMQRAFKGELF 402 >gi|281424443|ref|ZP_06255356.1| type I restriction enzyme EcoAI specificity protein [Prevotella oris F0302] gi|281401429|gb|EFB32260.1| type I restriction enzyme EcoAI specificity protein [Prevotella oris F0302] Length = 382 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 44/298 (14%), Positives = 92/298 (30%), Gaps = 9/298 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P++W+ + +G T G +I ++ D+ G +P + Sbjct: 70 EVPENWEWTTLGEIGTWQSGATPSRLRKDYYGGNIPWLKTGDLNDGLITDIPDFITQKAL 129 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + ++V + G IL G + K I F + D E + + + Sbjct: 130 EETSVKLNPIGSILIAMYGATIGKIGILTFPATTNQACCAC--SDYKIEQMYLFYFLLAN 187 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A+ G + + I MP+PPL EQ I +I ID + + Sbjct: 188 KKVFIAMGGGGAQPNISKEKIAVTFMPLPPLTEQQRIVVEIERWFKLIDAIDQSKAHLQT 247 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKLI 252 + + K ++ + L P + E + + P ++ + I Sbjct: 248 TITQTKSKILDLAIHGKLVPQDPNDEPASELLKRINPKAEIACDNEHSRKLHSKGWVQCI 307 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +++ ++ G I F K + ++ V Sbjct: 308 LNDVFTIIMGQSPDGNSINEKNGIEFHQGKLFFSSEETIKISFYTTSPIKIAKPNSLV 365 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 64/200 (32%), Gaps = 13/200 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNT-----KLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 VP++WE + T + NI L G++ L T + Sbjct: 70 EVPENWEWTTLGEIGTWQSGATPSRLRKDYYGGNIPWLKTGDLNDGLITDIPDFITQKAL 129 Query: 282 TYQIVD---PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 V G I+ K + + A A + I+ YL + + Sbjct: 130 EETSVKLNPIGSILIAMYGATIGKIGILTF----PATTNQACCACSDYKIEQMYLFYFLL 185 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + G G + ++ E + + +PP+ EQ I I ID + + Sbjct: 186 ANK-KVFIAMGGGGAQPNISKEKIAVTFMPLPPLTEQQRIVVEIERWFKLIDAIDQSKAH 244 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 + + +S + A+ G+ Sbjct: 245 LQTTITQTKSKILDLAIHGK 264 Score = 37.5 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 9/73 (12%), Positives = 20/73 (27%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W + + G++ + G+E + + TS + I Sbjct: 301 KGWVQCILNDVFTIIMGQSPDGNSINEKNGIEFHQGKLFFSSEETIKISFYTTSPIKIAK 360 Query: 83 KGQILYGKLGPYL 95 ++ P Sbjct: 361 PNSLVLCVRAPVG 373 >gi|171779404|ref|ZP_02920368.1| hypothetical protein STRINF_01249 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171282021|gb|EDT47452.1| hypothetical protein STRINF_01249 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 416 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 52/405 (12%), Positives = 120/405 (29%), Gaps = 25/405 (6%) Query: 24 HWKVVPIKRFTK-----LNTGRTSES-GKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTS 76 W+ + + T + Y+ +V+ G N + Sbjct: 19 DWEQRKLSDIYRDIGNAFVGTATPYYVEEGHFYLESNNVKDGQINHNTEVFINDEFYEKQ 78 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSI-DV 133 G ++ + G A+I + + PK + + Sbjct: 79 KDKWLHTGDMVMVQSGHVGHAAVIPEELDCSAAHALIMFRNPKFKIEPYFLNYQYQTVKA 138 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++IE I G T+ H + + + EQ I +D+LIT R + Sbjct: 139 KKKIENITTGNTIKHILASEMQKFIVDVASYDEQEKIAGF----FSHLDSLITLHQRKLN 194 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LK K+A++ + K +++ SG ++ +E Sbjct: 195 GLKNVKKAMLEKMFPKNGESVPEIRFSGFTDDWEQRKLSDIYRDIGNAFVGT-ATPYYVE 253 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 L N+ N + + + + G++V ++ + Sbjct: 254 EGHFYLESNNVKDGQINHNTEVFINDEFYEKQKDKWLHTGDMVMVQSGHVGH-AAVIPEE 312 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + I+ +L + ++ K + +G + + ++++ V Sbjct: 313 LDCSAAHALIMFRNPKFKIEPYFLNYQYQTVKAKKKIENITTGNTIKHILASEMQKFIVD 372 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V EQ I + +D L+ ++ + LK + + + Sbjct: 373 VASYDEQEKIAGF----FSHLDSLITLHQRKLDKLKTVKKAMLEK 413 >gi|24375748|ref|NP_719791.1| type I restriction-modification system, S subunit [Shewanella oneidensis MR-1] gi|24350691|gb|AAN57235.1|AE015859_4 type I restriction-modification system, S subunit [Shewanella oneidensis MR-1] Length = 495 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 69/455 (15%), Positives = 151/455 (33%), Gaps = 62/455 (13%) Query: 21 IPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVE----SGTGKYLPKDGNS 70 IP+ W + + +G ++ D + DV S G Sbjct: 5 IPEGWFSAVLGNAVDVKSGVGFPKKYQGKNSGDYPVYKVGDVSIAVTSKYGGLSEAGHYV 64 Query: 71 RQSDTSTVS--IFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 QS+ + IF +G L+ K+G L + + +G+ + + P + Sbjct: 65 SQSEAEELKGVIFREGTTLFAKIGEAVKLNRRAFVERNGLADNNVMAVVPNYTEMDRFIY 124 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + T+ I + + PPLAEQ +I +K+ ++++ Sbjct: 125 YFMRTVNLSDVS---RSTTVPSVRKGDIEELVISYPPLAEQKVIADKLDELLGQVESTKA 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLN---------------------PDVKMKDSG---- 221 +LK +Q++++ V+ L +K G Sbjct: 182 RLDAIPAILKSFRQSVLAAAVSGKLTEKWRDRNNSEMVHGGELYSLAKKHHLKFYGKKYK 241 Query: 222 ---------IEWVGLVPDHWEVKPFFALVTELNRK---NTKLIESNILSLSYGNIIQKLE 269 +E + + V +E+ ++ I + +I Sbjct: 242 APEPLDLRMLETLPQGWVYGVVSHLVEPGSEIMYGIVQPGPKLDEGIPYVRGTDIQNGQI 301 Query: 270 TRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT-SAYMAVK 324 + +K + + +I+ I K ++ ++ I +A + V Sbjct: 302 LVHQLMKTSPEIAKKYERATLSGNDILLGIIRAT--KVAIVPDELKGANITQGTARLRVF 359 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + YLA + S + ++ G+ L +DV+RLP+ +P +EQ +I + Sbjct: 360 EGVLTYKYLAIYLESPKVQSWLHSNYRGIDMPGLNLKDVRRLPIALPSKEEQTEIVRRVE 419 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D + ++ + + + S +A A G+ Sbjct: 420 DLFVFADKVEAQVNAAQLRVNNLTQSILAKAFRGE 454 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/210 (12%), Positives = 69/210 (32%), Gaps = 11/210 (5%) Query: 18 IGAIPKHWKVVPIKRFTK----LNTGRT---SESGKDIIYIGLEDVESGTGKYLPK-DGN 69 + +P+ W + + + G + + I Y+ D+++G + Sbjct: 251 LETLPQGWVYGVVSHLVEPGSEIMYGIVQPGPKLDEGIPYVRGTDIQNGQILVHQLMKTS 310 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQ--G 126 + + + IL G + + + G + L+ + + Sbjct: 311 PEIAKKYERATLSGNDILLGIIRATKVAIVPDELKGANITQGTARLRVFEGVLTYKYLAI 370 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L S V + + G M + K + +P+ +P EQ I ++ V D + Sbjct: 371 YLESPKVQSWLHSNYRGIDMPGLNLKDVRRLPIALPSKEEQTEIVRRVEDLFVFADKVEA 430 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + + Q++++ L + + Sbjct: 431 QVNAAQLRVNNLTQSILAKAFRGELTAEWR 460 >gi|257437916|ref|ZP_05613671.1| putative toxin-antitoxin system, toxin component [Faecalibacterium prausnitzii A2-165] gi|257199576|gb|EEU97860.1| putative toxin-antitoxin system, toxin component [Faecalibacterium prausnitzii A2-165] Length = 381 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 127/402 (31%), Gaps = 39/402 (9%) Query: 29 PIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + +N T ++ +I + V G++ + + + F G IL Sbjct: 3 KLGEVCLINPKSCTLRDDTEVSFIPMTKVGEH-GEFDASEIKNYSEVKKGFTNFQNGDIL 61 Query: 88 YGKLGPYLRKA------IIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEA 139 + K+ P + + + G ST+F VL+P E L + E Sbjct: 62 FAKITPCMENGKGAIAHNMKNGIGFGSTEFHVLRPDTDKITSEWLYYLTTWKAFRKEAER 121 Query: 140 ICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + N + +P + Q + + I + + EL Sbjct: 122 NMTGSAGQKRVPKTFLENYVVNLPDIDTQKSENKILRKVDDLIFLRKQQLAKLDEL---- 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + V + +V K +G + + R + + I Sbjct: 178 ---VKARFVEMFGDINVNNKKWMTYPLGEL--------CTIVRGGSPRPIERYLGGTIPW 226 Query: 259 LSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + G+ + E + +++ G ++F + + + G Sbjct: 227 IKIGDATTGENIYLNSTKEYIIQEGVKKSRMIKAGSLIFANCGVSLGFARIITFD----G 282 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIK 373 I ++A++ + L + + F +G + +L +K +VPP++ Sbjct: 283 CIHDGWLAMEDIDEKLDKIFLLYSLNQMTEYFRKTAPAGTQPNLNTNIMKMHRQIVPPME 342 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q + + D + ++QS+ L+ + + + Sbjct: 343 MQKAFISFVKCA----DRQKQIVQQSLEKLELMKKALMQEYF 380 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 60/198 (30%), Gaps = 9/198 (4%) Query: 15 VQWIGAIPKH---WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLP- 65 V+ G I + W P+ + G + G I +I + D +G YL Sbjct: 183 VEMFGDINVNNKKWMTYPLGELCTIVRGGSPRPIERYLGGTIPWIKIGDATTGENIYLNS 242 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 Q + G +++ G L A I FDG +L ++ D + + Sbjct: 243 TKEYIIQEGVKKSRMIKAGSLIFANCGVSLGFARIITFDGCIHDGWLAMEDIDEKLDKIF 302 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +T+ T + + + +PP+ Q + + + Sbjct: 303 LLYSLNQMTEYFRKTAPAGTQPNLNTNIMKMHRQIVPPMEMQKAFISFVKCADRQKQIVQ 362 Query: 186 TERIRFIELLKEKKQALV 203 + + K Q Sbjct: 363 QSLEKLELMKKALMQEYF 380 >gi|317405485|gb|EFV85794.1| type I restriction-modification system [Achromobacter xylosoxidans C54] Length = 801 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 56/491 (11%), Positives = 129/491 (26%), Gaps = 102/491 (20%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG--------KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W+ V + ++ G T + + +++ + Sbjct: 87 LPQGWEWVRLGELAEIIRGVTYSKSQSNEIRFHDSVELLRANNIQ-DVINFQGTVFVPIS 145 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQG- 126 + I G IL A+ + V++P Sbjct: 146 LVSEAQKI-KNGDILIAMSSGSSHLVGKAAQFNANRECTFGAFCAVIRPLYASQFEYFRI 204 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR------ 180 + + + +G + + + + + N+ + PP+ EQ I KI R Sbjct: 205 FSKTPLYRSQTRQEGKGIGIQNLNKEALENLLVAAPPMDEQHRIVAKIDELMARCDKLEK 264 Query: 181 -----------------------------------IDTLITERIRFIELLKEKKQALVSY 205 + E E + E ++A++ Sbjct: 265 LRTAQQEARLTVHAAAIKQLLNIAKPGQHQRAQTFLAEHFGELYTVKENVAELRKAILQL 324 Query: 206 IVTKGLNPDVK-------------------------------MKDSGIEWVGLVPDHWEV 234 V L P + E +P W Sbjct: 325 AVMGKLVPQEPGDQPASKLLQEIEAEKQRLIEGGRIKIPKSLPPVTEEEKPYALPQGWVW 384 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---------YQI 285 + L + + + +++ P+ + Sbjct: 385 ERLGNLALSSDSGWSPQCLPSARKGQEWGVLKVSAVSWGKFNPDENKALPASQNPRLDCE 444 Query: 286 VDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLC 343 V G+ + + RS+ V +++ + + ++ Sbjct: 445 VKSGDFLISRANTDELVARSVVVDDVPPHLMMSDKIVRFTFSCNVNKTFLNIVNGVPYSR 504 Query: 344 KVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + SG +++ E + LPV +PP++EQ I I+ D L ++I+ +I Sbjct: 505 AYYMENASGTSSSMKNVSRETMSLLPVSLPPLQEQRRIVAKIDELKDFCDFLEQQIDAAI 564 Query: 401 VLLKERRSSFI 411 E ++ + Sbjct: 565 SKQVELLNALM 575 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 43/250 (17%), Positives = 69/250 (27%), Gaps = 52/250 (20%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNM 273 E +P WE L + N ++ L NI + + Sbjct: 80 EEEKPYSLPQGWEWVRLGELAEIIRGVTYSKSQSNEIRFHDSVELLRANNIQDVINFQGT 139 Query: 274 GLKPESY-ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 P S Q + G+I+ + + R A+ AV S + Sbjct: 140 VFVPISLVSEAQKIKNGDILIAMSSGSSHLVGKAAQFNANRECTFGAFCAVIRPLYASQF 199 Query: 333 LAW--LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + ++ G G+ Q+L E ++ L V PP+ EQ I I+ AR Sbjct: 200 EYFRIFSKTPLYRSQTRQEGKGIGIQNLNKEALENLLVAAPPMDEQHRIVAKIDELMARC 259 Query: 390 DVLVEKIEQS-----------IVLL------------------------------KERRS 408 D L + I L E R Sbjct: 260 DKLEKLRTAQQEARLTVHAAAIKQLLNIAKPGQHQRAQTFLAEHFGELYTVKENVAELRK 319 Query: 409 SFIAAAVTGQ 418 + + AV G+ Sbjct: 320 AILQLAVMGK 329 >gi|99078515|ref|YP_611773.1| restriction modification system DNA specificity subunit [Ruegeria sp. TM1040] gi|99035653|gb|ABF62511.1| type I restriction-modification system; S subunit [Ruegeria sp. TM1040] Length = 387 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 49/397 (12%), Positives = 120/397 (30%), Gaps = 22/397 (5%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + ++ G T + DI + ++D +S + S + Sbjct: 6 LGELVEIRGGGTPDKKVPDYWDGDIPWASVKDFKSTSLASTIDRITQAGVANSATQVIPA 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I+ KA I + D + L P + + + +E G Sbjct: 66 GNIIVPTRMAV-GKAAINEIDLAINQDLKALIPSQRIDRQ-YLLHALLANAKTLEDQATG 123 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ + ++ +P+PPL EQ I + + L QA+ Sbjct: 124 ATVKGIKLDALRSLQIPLPPLQEQRRIAGILDQADALRRFRTRALDKLGTLG----QAIF 179 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + +PD E + L V + + + + ++ Sbjct: 180 HEMFGAS-SPDHAAW----EKINLSELVLPDDRINYGVVQPGPHDPEGVPIIRVADLASP 234 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 ++ + + ++ + GE++ + A + + Sbjct: 235 VVAFDSIKRIAPSIDAEYGRSRLKGGEVLIGCVGSIGTTIIAPPEFAGANVARAVARVPL 294 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++A +RS + F + +L + ++ +++PP + Q + Sbjct: 295 DTSRCEPRFVAEQLRSQRIQNYFTKEVRLVAQPTLNIKQIRETEIILPPKELQVSFVERV 354 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + I+ + ++ +S + A G++ Sbjct: 355 H----EIEAQKAQHAAALTACDVLFASLQSTAFRGEV 387 >gi|200386888|ref|ZP_03213500.1| putative type I restriction-modification system, S subunit [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|199603986|gb|EDZ02531.1| putative type I restriction-modification system, S subunit [Salmonella enterica subsp. enterica serovar Virchow str. SL491] Length = 586 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 57/491 (11%), Positives = 134/491 (27%), Gaps = 96/491 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV 56 +K K P+ S + +P W+ V + + N G T + ++ Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVRLGDIGETNIGLTYSPNNIKETGTPVLRSSNI 140 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFL 112 ++G + + S G +L + A+I + Sbjct: 141 QNGILDFTDL-VRVSGMEIKNSSYVEDGDLLICARNGSKTLVGKNALINSLSEPMAFGAF 199 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV---- 168 + + ++ +L S + ++ + T++ + + +P PP EQ Sbjct: 200 MAIFRCSYNNYVKIFLDSPSFRRNLDGVDT-TTINQITQSNLKHTLIPFPPEIEQEKIKN 258 Query: 169 -------------------------------------LIREKIIAETVRIDTLITERIRF 191 +++ RI Sbjct: 259 TVFELISLCDQLEQHSLTSLDAHQQLVETLLTKLTDSQNADELAENWARISEHFDTLFTT 318 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKD-------------------------------S 220 + KQ ++ V L P + S Sbjct: 319 EASIDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPPIS 378 Query: 221 GIEWVGLVPDHWEVKPFFALVT----ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 E +P+ WE L + + ++++ L N+ + + + Sbjct: 379 DEEKPFELPEGWEWCCINDLTFVSGGIQKQPKRRPVKNHFPYLRVANVQRGNINIDELER 438 Query: 277 PESYE---TYQIVDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DS 330 E T+ + +I+ R +E+ + + + V+ Sbjct: 439 FELESHELTFWSLKKNDILIVEGNGSADEIGRCAIWLAPIEKCVYQNHLIRVRGIMEGYQ 498 Query: 331 TYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++A + S K + + +L ++ + + +PP+ +Q I + I Sbjct: 499 EFIALYLNSPSGIKEMQRLAVTTSGLYNLSVGKIRGIKIPLPPLNQQNLILSKIREYIFI 558 Query: 389 IDVLVEKIEQS 399 D L I+ + Sbjct: 559 CDNLKISIQSA 569 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 56/200 (28%), Gaps = 5/200 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRN--MG 274 S E +P WE + + E+ L NI + + Sbjct: 93 SEEEKPFELPVGWEWVRLGDIGETNIGLTYSPNNIKETGTPVLRSSNIQNGILDFTDLVR 152 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + V+ G+++ + + + + Y+ Sbjct: 153 VSGMEIKNSSYVEDGDLLICARNGSKTLVGKNALINSLSEPMAFGAFMAIFRCSYNNYVK 212 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S + + + + ++K + PP EQ I N + + D L + Sbjct: 213 IFLDSPSFRRNLDGVDTTTINQITQSNLKHTLIPFPPEIEQEKIKNTVFELISLCDQLEQ 272 Query: 395 KIEQSIVLLKERRSSFIAAA 414 S+ ++ + + Sbjct: 273 HSLTSLDAHQQLVETLLTKL 292 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 61/200 (30%), Gaps = 20/200 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ I T ++ G + Y+ + +V+ G + +S Sbjct: 385 ELPEGWEWCCINDLTFVSGGIQKQPKRRPVKNHFPYLRVANVQRGNINIDELERFELESH 444 Query: 75 TSTVSIFAKGQILY----GKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGW 127 T K IL G R AI + + V + E + + Sbjct: 445 ELTFWSLKKNDILIVEGNGSADEIGRCAIWLAPIEKCVYQNHLIRVRGIMEGYQEFIALY 504 Query: 128 LLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L S + + + + + I I +P+PPL +Q +I I Sbjct: 505 LNSPSGIKEMQRLAVTTSGLYNLSVGKIRGIKIPLPPLNQQ-------NLILSKIREYIF 557 Query: 187 ERIRFIELLKEKKQALVSYI 206 ++ +Q + Sbjct: 558 ICDNLKISIQSAQQTQLHLA 577 >gi|312879438|ref|ZP_07739238.1| restriction modification system DNA specificity domain [Aminomonas paucivorans DSM 12260] gi|310782729|gb|EFQ23127.1| restriction modification system DNA specificity domain [Aminomonas paucivorans DSM 12260] Length = 392 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 68/384 (17%), Positives = 121/384 (31%), Gaps = 32/384 (8%) Query: 24 HWKVVPIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WK+V K R E+ +GLE ++ + + TS F Sbjct: 9 GWKMVKFGEVVKNANLAERDPEAHGIERIVGLEHLDPENLHI--RRWDPVSEGTSFTRRF 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIE 138 GQ L+GK Y RK A+F+GICS L +PKD LPELL S Sbjct: 67 VPGQTLFGKRRAYQRKVAYAEFEGICSGDILTFEPKDRKVLLPELLPWICQSNAFFDHAL 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ W + N P+PPL EQ I E + A + + + Sbjct: 127 GTSAGSLSPRTSWTALKNFEFPLPPLEEQKRIAEILWAADEAVSAYQEALTLIHITAQTR 186 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + ++ + L S ++ + P+ P ++ +LS Sbjct: 187 LEHTLNTLNCSEL--------SLLDVLSGSPESGCSAP----------PSSNETGHWVLS 228 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 L+ + + + + + G+++ + + + Sbjct: 229 LAALSANGYVRGNLKPVAKTNKMVACTLSKGDLLISRSNTIDLVGFAGIFNEDRPDVSFP 288 Query: 319 AYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPI 372 + P YL ++ S + SG + + + + VP + Sbjct: 289 DTIIRLPVNTQKALPDYLELVLLSNRGRRHMMKTASGTSSSMKKINRKILFEFKFPVPGL 348 Query: 373 KEQFDITNVINVETARIDVLVEKI 396 Q I N + R+ + Sbjct: 349 DTQERIVTEFNEQ-KRLRDAIANH 371 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 46/141 (32%), Gaps = 12/141 (8%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 L R E + PG+ +F K + + GI + + +P Sbjct: 47 NLHIRRWDPVSEGTSFTRRFVPGQTLFGKRRAYQRKVAYAEFE----GICSGDILTFEPK 102 Query: 327 GI---DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 L W+ +S +G + +K +PP++EQ I ++ Sbjct: 103 DRKVLLPELLPWICQSNAFFDHALGTSAGSLSPRTSWTALKNFEFPLPPLEEQKRIAEIL 162 Query: 383 NVETARIDVLVEKIEQSIVLL 403 D V ++++ L+ Sbjct: 163 WAA----DEAVSAYQEALTLI 179 >gi|269978370|gb|ACZ55919.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 412 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 125/406 (30%), Gaps = 31/406 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +PK + + ++ G+ + + GKY G Sbjct: 12 VPKGVEFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYN 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + I + G + + + PK+ L ++L+ Sbjct: 62 REENTITIAQYGT-AGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSIS 120 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---IDTLITERIRFIELLKE 197 A I I +PIPPL Q I + + A T ++T + ++ + E Sbjct: 121 NRSAIPYSISSNNIMQITIPIPPLEIQQEIVKILDAFTELNTELNTELNTELKARKKQYE 180 Query: 198 KKQALVSYIVTKGLN-------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 Q ++ LN K L P E + + N+K K Sbjct: 181 YYQNMLLDFKDIYLNHKDAKMSAKTYPKRLKTLLQTLAPKGVEFRKLGEVCESTNKKTLK 240 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + E + + + G + + GE + + + Sbjct: 241 ISEVSEVKNKGMYPVINSGRDLYGYYHDFN------NDGENITIASRGEYAGFINYFNEK 294 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 G + Y + + + +L + +++ ++ + + G +L D++ L + +P Sbjct: 295 FFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALNKADIETLTIPIP 354 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 P++ Q +I +++ + L+ I I K+ R + Sbjct: 355 PLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 400 >gi|301801713|emb|CBW34419.1| putative type I RM modification enzyme [Streptococcus pneumoniae INV200] Length = 373 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 44/400 (11%), Positives = 117/400 (29%), Gaps = 39/400 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L+ Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNLLV-------- 170 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 K E PD + + + + N + + Sbjct: 171 --------------KSRFNEMFEEYPDSVFLDTYIKELRAGKSLAGEENNKNKVLKTGAV 216 Query: 264 IIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSA 319 + + P Y V+ G+++ ++ A + + Sbjct: 217 SYDYFNSSEVKNLPIDYIPLDEHKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPDR 276 Query: 320 YMAVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 V + + W + ++ K + SG +++ + ++ V PP+ Q Sbjct: 277 LWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLALQ 336 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + A +D I++S+ L+ + S + Sbjct: 337 NEFADFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|317485045|ref|ZP_07943927.1| type I restriction modification DNA specificity domain-containing protein [Bilophila wadsworthia 3_1_6] gi|316923580|gb|EFV44784.1| type I restriction modification DNA specificity domain-containing protein [Bilophila wadsworthia 3_1_6] Length = 432 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 53/401 (13%), Positives = 118/401 (29%), Gaps = 32/401 (7%) Query: 25 WKVVPIKRFT---KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ +P+ + T R S S D++ I + +++ G + + + D + Sbjct: 34 WRNLPLSKICHAMTYGTARKSSSEGDVVVIRMGNLQGGEIIWSKLAYTTARDDIEKY-LL 92 Query: 82 AKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLP----ELLQGWLLSIDVT 134 + G IL+ + + +I +L+ D L + Sbjct: 93 SPGDILFNRTNSPELVGKTSIYRGERPAIYAGYLIRLDYDKNIIIGEYLNYVMNSQEERQ 152 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G ++ + K IG +P+PP+ EQ I + ++ + L Sbjct: 153 FCADVRVNGVCQANINAKKIGAFSIPVPPIDEQQYIVSCLNELLPLVEEYGKSQSALHVL 212 Query: 195 LKEKK----QALVSYIVTKGLNPD---VKMKDSGIEWVGL----VPDHWEVKPFFALVTE 243 E +L+ + L P D E +P+ W+ + Sbjct: 213 ETELPGKLRASLLQQAIMGKLVPQLDDEPAVDIDAEEPEEVPFAIPEKWKWVRLRDIGAI 272 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + K + S + + + K S I G + + L Sbjct: 273 FSGATPKTNVTEYWSPAIVPWVTPADLGKNKKKTISCGERSISKKGYLSCSAVLLPKGSV 332 Query: 304 SLR-------SAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLR 354 A ++ P+ S Y+ + + + + Sbjct: 333 VYSSRAPIGHIAITENELATNQGCKSIAPNFEIVLSEYVYYGLIALTP-DIQSRASGTTF 391 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + +PP+ EQ I +N ++ +++ Sbjct: 392 LEISSKKFGETFFPLPPLAEQRRIITRLNELLPYLNSMIKN 432 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 85/227 (37%), Gaps = 12/227 (5%) Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +L++Y + GL+ + + + + K A+ RK++ + ++ + Sbjct: 9 SLITYAMKGGLSASWRKEHNYSFELWRNL--PLSKICHAMTYGTARKSSSEGDVVVIRMG 66 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + ++ ++ PG+I+F + + I Sbjct: 67 NLQGGEIIWSKLAYTTARDDIEKYLLSPGDILFNRTNSPELVGKTSIYRGERPAIYAGYL 126 Query: 321 MA--VKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + I YL ++M S + + + + ++ + + + VPPI EQ Sbjct: 127 IRLDYDKNIIIGEYLNYVMNSQEERQFCADVRVNGVCQANINAKKIGAFSIPVPPIDEQQ 186 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLK-----ERRSSFIAAAVTGQ 418 I + +N ++ K + ++ +L+ + R+S + A+ G+ Sbjct: 187 YIVSCLNELLPLVEEY-GKSQSALHVLETELPGKLRASLLQQAIMGK 232 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 11/177 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYL---PKDGNS 70 IP+ WK V ++ + +G T ++ + ++ D+ K + + + Sbjct: 257 IPEKWKWVRLRDIGAIFSGATPKTNVTEYWSPAIVPWVTPADLGKNKKKTISCGERSISK 316 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + + + KG ++Y P AI + + + P + + Sbjct: 317 KGYLSCSAVLLPKGSVVYSSRAPIGHIAITENELA-TNQGCKSIAPNFEIVLSEYVYYGL 375 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I +T I++ G T K G P+PPLAEQ I ++ ++++I Sbjct: 376 IALTPDIQSRASGTTFLEISSKKFGETFFPLPPLAEQRRIITRLNELLPYLNSMIKN 432 >gi|298248250|ref|ZP_06972055.1| restriction modification system DNA specificity domain protein [Ktedonobacter racemifer DSM 44963] gi|297550909|gb|EFH84775.1| restriction modification system DNA specificity domain protein [Ktedonobacter racemifer DSM 44963] Length = 550 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 55/411 (13%), Positives = 125/411 (30%), Gaps = 32/411 (7%) Query: 25 WKVVPIKRFT-KLNTGRTSESGK---DIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 W V ++ T L +G + I + +V G + + Sbjct: 8 WPQVRLEEITTDLQSGFAQSPNETNQGIPQLRTNNVSAEGNLDLSDLIRVALPASEQDKY 67 Query: 80 IFAKGQILYGKLGP--YLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + KG I++ ++ K D + + S ++ + + + + Sbjct: 68 LLQKGDIIFNNTNSVEWVGKTAYFDLEEEFVFSNHMTRIRVDESIVNARFLARYLHYLWK 127 Query: 136 RIEAI---CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + + D + +P+PPL EQ I E + + + + + + Sbjct: 128 KGFSRSRSKQWVNQAAIDQSILALFKIPLPPLGEQQRIVEFLQQAEILRELRVVAKEKLK 187 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ++L Y G+ ++ + P + P + + ++ Sbjct: 188 T----VYRSLFYYHFGSGMPTKQYPITIKLKDLLDEPLVYGYSP--SEIHDIPSGTPVFT 241 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 S I + + + +I+ + + + Sbjct: 242 LSAITDQGL-----NETQIKYTPESDYVGKGDDLKKDDILITRSNTSELVGKVARYRGKP 296 Query: 313 RGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPV 367 +I M + DS Y+ +RS + + G + + D+K + Sbjct: 297 SPVIYPDLMIRINLKNPQDSPYVENYLRSDAMTALIQRKARGTSGSMKKISQGDIKEFAI 356 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L PP + + E ID +E + S+ L+ S + A TGQ Sbjct: 357 LWPPEAARQAF----SREVELIDQQLETLSISLKQLETLFQSLLTCAFTGQ 403 >gi|313674356|ref|YP_004052352.1| restriction modification system DNA specificity domain [Marivirga tractuosa DSM 4126] gi|312941054|gb|ADR20244.1| restriction modification system DNA specificity domain [Marivirga tractuosa DSM 4126] Length = 505 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 63/467 (13%), Positives = 126/467 (26%), Gaps = 76/467 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W + + + +N G++ S G ++ ++ T+ Sbjct: 3 EDWIEIELGKICNINMGQSPPSSTYNDKGEGMPFFQGKAEFTELYPVVKKWCTAPKKTAK 62 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 IL P I + P W + + ++ Sbjct: 63 VNDILISVRAPVGATNKTNIDCAIGRGLAAITYPFGNN----YLWFYLKFIERALDDQGT 118 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G T + + +P PL EQ + KI +D I E L+ +QA+ Sbjct: 119 GTTFKAISGNILKSQKIPFAPLPEQKSLVSKIEQLFSELDNGIANLKSAKEKLEVYRQAV 178 Query: 203 -------------------------VSYIVTKGLNPDVKMKDSG-----IEWV-----GL 227 +S +T G N + + +EW Sbjct: 179 LKKAFEGELTKEWRKKQTELPSAEDLSNQITFGRNKLYENQIKEWQQDLVEWNSKRKQYK 238 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG------------------------- 262 P + + I + + G Sbjct: 239 KPSKPKKLDVPEPPNSDHENKKWNIPKSWIWTQLGVIAFITKLAGFEYTKYVSYSENGDL 298 Query: 263 ----------NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 N ++ + + S+ + GE++ F+ +L Sbjct: 299 PVIKAENAGLNGFKRTNYSKVKSEDVSFLKRSKLLGGELIIVFVGAGTGNVALVPKDQNY 358 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 + +L RS + A + SL +++ PV+ P Sbjct: 359 FLGPNIGMARPYLNVE-PRFLELFFRSNFGKNLMMATAKAVAQPSLSMGTIRQSPVVFPS 417 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++EQ I + I + + L E I +S+ + R S + A +G+ Sbjct: 418 VREQRQIVSEIESRLSVSNKLAESINESLEKSEALRQSILKRAFSGE 464 Score = 44.4 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 57/203 (28%), Gaps = 12/203 (5%) Query: 21 IPKHWKVVPIKRF--------TKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSR 71 IPK W + + + D+ I E+ +G + S Sbjct: 263 IPKSWIWTQLGVIAFITKLAGFEYTKYVSYSENGDLPVIKAENAGLNGFKRTNYSKVKSE 322 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 S G+++ +G + D + + +P + Sbjct: 323 DVSFLKRSKLLGGELIIVFVGAGTGNVALVPKDQNYFLGPNIGMARPYLNVEPRFLELFF 382 Query: 130 SIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + A + I P+ P + EQ I +I + + L Sbjct: 383 RSNFGKNLMMATAKAVAQPSLSMGTIRQSPVVFPSVREQRQIVSEIESRLSVSNKLAESI 442 Query: 189 IRFIELLKEKKQALVSYIVTKGL 211 +E + +Q+++ + L Sbjct: 443 NESLEKSEALRQSILKRAFSGEL 465 >gi|289433647|ref|YP_003463519.1| type I restriction-modification system, S subunit, putative [Listeria seeligeri serovar 1/2b str. SLCC3954] gi|289169891|emb|CBH26431.1| type I restriction-modification system, S subunit, putative [Listeria seeligeri serovar 1/2b str. SLCC3954] Length = 429 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 50/415 (12%), Positives = 120/415 (28%), Gaps = 32/415 (7%) Query: 23 KHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVES-----GTGKYLPKDGNSRQS 73 W++ + + + + + + K + ++ D+ S +YL Sbjct: 20 NDWELRKLGGLMNITSVKRIHQSDWTDKGVRFLRARDIVSASKGKNPSEYLYISKKLYDE 79 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLS- 130 + G +L +G +I + + Q K + + + Sbjct: 80 HSKISGKVGVGDLLVTGVGSIGIPMLIKHEEPLYFKDGNIIWFQNKKNIDGGFFYYSFNS 139 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + I T+ G P+ +P EQ I ++D I R Sbjct: 140 HSIQKFIRDSAGIGTVGTYTIDSGGKTPIYLPNKKEQQRIGTF----FKQLDNTIALHQR 195 Query: 191 FIELLKEKKQALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 +E +K K A +S + P + +W D+ + + Sbjct: 196 KLEKIKALKTAYLSEMFPAEGETKPKRRFAGFTDDWEQRKLDNSIKVMDGDRGSNYPHDS 255 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRS 304 + L L GN+ + + + Q+ ++ + V + Sbjct: 256 DFFDNGDTLFLDTGNVTKNGFKFDNVKYITKEKDGQLRAGKLEKNDFVLTSRGTLGNVGF 315 Query: 305 LRSAQVMERG---IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKF 359 I ++ + S + +L F + + Sbjct: 316 YDKFVYKRHPKLRINSAMLILRNTDEQLSCSYLHTLLKGNLISDFMRKNQVGSAQPHITK 375 Query: 360 EDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +L + VP +KEQ I + +D + ++ + L+ + +++ Sbjct: 376 SEFLKLDLNVPCDVKEQNKIGDF----FKNLDNTITLHQRKLQKLQNIKKAYLNE 426 >gi|218667989|ref|YP_002426767.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] gi|218520202|gb|ACK80788.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] Length = 383 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 41/364 (11%), Positives = 102/364 (28%), Gaps = 23/364 (6%) Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVL 114 + K+ + Q + + + G +Y P + G+ S + V Sbjct: 28 DQRDFFDKEIAT-QGNLESYFVVELGSYVYNPRISATAPVGPISKNKVGTGVMSPLYTVF 86 Query: 115 QPKDVLPELLQGWLL---SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + KD + + + ++ + +P+P+P EQ I Sbjct: 87 KFKDGGNDFYEHYFKTTGWHTYMRQASSTGARHDRMAISSDDFMAMPLPVPTPKEQQKIA 146 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 E + + + + K+ L+ + +++ + G Sbjct: 147 ECLSSVDALMAAQARKVDALKT----HKKGLMQQLFPTEGETQPRLRFPEFQNAGEWNKT 202 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + ++ L L GN L+ + + D G++ Sbjct: 203 TLGEAATFFNGRAYKQEELLESGKYPVLRVGNFFTNNNWYYSDLELDETK---YCDKGDL 259 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 ++ + + + I + + GID +L + + + Sbjct: 260 LYAWSASFGPRMWHGVKVIYHYHI----WKVEQHSGIDRQFLFITLENETERMKSNSANG 315 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + ++ P EQ I + + + +D L+ Q + LK + + Sbjct: 316 LGLLHITKGTIEGWDTAFPSPPEQHRIASCL----SSLDALITLETQKLEALKTHKKGLM 371 Query: 412 AAAV 415 Sbjct: 372 QQLF 375 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 40/180 (22%), Gaps = 2/180 (1%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + GR + + + + G + + + K Sbjct: 198 EWNKTTLGEAATFFNGRAYKQEELLESGKYPVLRVGNF-FTNNNWYYSDLELDETKYCDK 256 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G +LY + I ++ + L + + G Sbjct: 257 GDLLYAWS-ASFGPRMWHGVKVIYHYHIWKVEQHSGIDRQFLFITLENETERMKSNSANG 315 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + H I P EQ I + + I + K Q L Sbjct: 316 LGLLHITKGTIEGWDTAFPSPPEQHRIASCLSSLDALITLETQKLEALKTHKKGLMQQLF 375 >gi|237755860|ref|ZP_04584456.1| type I restriction/modification specificity protein [Sulfurihydrogenibium yellowstonense SS-5] gi|237691971|gb|EEP60983.1| type I restriction/modification specificity protein [Sulfurihydrogenibium yellowstonense SS-5] Length = 381 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 56/399 (14%), Positives = 116/399 (29%), Gaps = 32/399 (8%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNS 70 ++++G IP+ WK V + + G K I + +++G + S Sbjct: 6 EIEYVGDIPEGWKWVKLGEIADVRDGTHDSPKKVIDGKYLITSKHIKNGKIDFSKAYKIS 65 Query: 71 RQ--SDTSTVSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGW 127 + S K IL+ +G I+ + D L L + + + Sbjct: 66 LDDFEAINKRSKVDKYDILFSMIGTIGEMVIVDFEPDFAIKNVGLFKTGNKDLSKWIYYY 125 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S + I A +G+T + + N P+ +PP E+ I E + + +I+ L + Sbjct: 126 LKSNEAQAEIRASLKGSTQQYITLGDLRNFPILLPPPPERKAIAEVLSSIDDKIELLHRQ 185 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 E+ + GL + K ++ Sbjct: 186 NKTLEEMAMTLFRQWFIEPTKDGLPDGWEEKRLKDVYIFEK-------------GIEPGS 232 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 L I ++ + + L+ + + I + +++ F Sbjct: 233 KNYLKTPGIDTVRFIRVGNMLDNKADVYVKKDLARNSICNFDDLLVSFDGTVGRVSFGLV 292 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAW---LMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 G +S + L + S ++ G + Sbjct: 293 ------GCYSSGIRKIYSKDEIYNKLWLKHQIFISEEIQDEINMHAEGTTILHASSSIDY 346 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 L + PP + I + I + + I L Sbjct: 347 LSFVFPPKE---KIEEY-DKFFDPIYKKILHNKAQIQTL 381 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 72/194 (37%), Gaps = 15/194 (7%) Query: 221 GIEWVGLVPDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-- 276 IE+VG +P+ W+ V + + K + ++ +I + K Sbjct: 6 EIEYVGDIPEGWKWVKLGEIADVRDGTHDSPKKVIDGKYLITSKHIKNGKIDFSKAYKIS 65 Query: 277 ---PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 E+ VD +I+F I + + I + + + S ++ Sbjct: 66 LDDFEAINKRSKVDKYDILFSMIGTIGEMVIV---DFEPDFAIKNVGLFKTGNKDLSKWI 122 Query: 334 AWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + ++S + A +Q + D++ P+L+PP E+ I V+ + ID Sbjct: 123 YYYLKSNEAQAEIRASLKGSTQQYITLGDLRNFPILLPPPPERKAIAEVL----SSIDDK 178 Query: 393 VEKIEQSIVLLKER 406 +E + + L+E Sbjct: 179 IELLHRQNKTLEEM 192 >gi|54308989|ref|YP_130009.1| putative Type I restriction enzyme ecoeispecificity protein [Photobacterium profundum SS9] gi|46913419|emb|CAG20207.1| hypothetical Type I restriction enzyme EcoEIspecificity protein (S protein) [Photobacterium profundum SS9] Length = 551 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 59/476 (12%), Positives = 129/476 (27%), Gaps = 98/476 (20%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLP-KDGNSRQ 72 P W +V + + L G S++ I ++ + +G + + Sbjct: 60 PHSWSIVRLGGISTLENGDRSKNYPNKSVLVDSGIPFVNAGHLVNGRIQKSEMTFITDER 119 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 D F G IL+ G + A++ + I S+ +V + +L + L + S Sbjct: 120 FDLLRAGKFKNGDILFCLRGSLGKSALVDGFENGAIASSLVIVRPDESILAKYLMLYFES 179 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK----------------- 173 + I G + + +P PPL EQ I K Sbjct: 180 PMSFRNISQYDNGTAQPNLSATDLAKFIVPTPPLEEQHRIVTKVDELMTLCDQLEQQTES 239 Query: 174 ------------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + R+ + + KQ ++ V Sbjct: 240 SIDAHKTLVEVLLATLTNSTDADELAKNWTRVSEHFDTLFTTERSIDQLKQTVLQLAVMG 299 Query: 210 GLNPDVKMKD-------------------------------SGIEWVGLVPDHWEVKPFF 238 L P + + E +P W Sbjct: 300 KLVPQDPNDEPASKLLKCIVEEKAQLIKDKKIKKQKALPEITDEEKPFELPSGWTWCRLG 359 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---------YQIVDPG 289 L + + + +++ P + V G Sbjct: 360 DLSLTSDAGWSPKCHPTPREEEHWGVLKVSAVTWNSYNPLENKELPSSLEPREQYEVQDG 419 Query: 290 EIVFRFIDLQN--DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVF 346 + + + + + + ++ +++ + + H L S + Sbjct: 420 DFLISRANTAKLVARAVVVPPKSPKKLMMSDKIIRFQFHKQVDANYINLFNDSSFARNYY 479 Query: 347 YAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 A+ G +++ E ++ L + PP++EQ I + + D L K+ ++ Sbjct: 480 AAVAGGTSSSMKNVSREQIRNLVIAFPPLEEQVKILKMKGQFSELCDELKNKLSKA 535 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 57/168 (33%), Gaps = 7/168 (4%) Query: 244 LNRKNTKLIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + L++S I ++ G+ IQK E + + G+I+F Sbjct: 82 NYPNKSVLVDSGIPFVNAGHLVNGRIQKSEMTFITDERFDLLRAGKFKNGDILFCLRGSL 141 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLK 358 + + I +S + I + YL S + + +L Sbjct: 142 GKSALVDGFENG--AIASSLVIVRPDESILAKYLMLYFESPMSFRNISQYDNGTAQPNLS 199 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 D+ + V PP++EQ I ++ D L ++ E SI K Sbjct: 200 ATDLAKFIVPTPPLEEQHRIVTKVDELMTLCDQLEQQTESSIDAHKTL 247 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 54/205 (26%), Gaps = 18/205 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTG--------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P W + L + T + + + V + L Sbjct: 348 ELPSGWTWCRLGDL-SLTSDAGWSPKCHPTPREEEHWGVLKVSAVTWNSYNPLENKELPS 406 Query: 72 QSDTSTVSIFAKGQILYGKLGP---YLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQ 125 + G L + R ++ + S + + Q + Sbjct: 407 SLEPREQYEVQDGDFLISRANTAKLVARAVVVPPKSPKKLMMSDKIIRFQFHKQVDANYI 466 Query: 126 GWLLSIDVTQRIEAICEGAT---MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + A G T M + + I N+ + PPL EQV I + + D Sbjct: 467 NLFNDSSFARNYYAAVAGGTSSSMKNVSREQIRNLVIAFPPLEEQVKILKMKGQFSELCD 526 Query: 183 TLITERIRFIELLKEKKQALVSYIV 207 L + + + +V V Sbjct: 527 ELKNKLSKAKSIQLVLADTIVGQAV 551 >gi|28377766|ref|NP_784658.1| type Ic restriction-modification system, HsdS subunit [Lactobacillus plantarum WCFS1] gi|28270599|emb|CAD63503.1| type Ic restriction-modification system, HsdS subunit [Lactobacillus plantarum WCFS1] Length = 380 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 60/395 (15%), Positives = 122/395 (30%), Gaps = 44/395 (11%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 W+ + K+ G T +S ++ + +V +G + + S S+ Sbjct: 19 WEQRKLGELGKIQGGGTPDSGIAEYWDGNVNWFTPTEVSNNGYLESSNRKITSLGLKKSS 78 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + +L + + I + + F L E + + +++ Sbjct: 79 ARLMPASTVLITS-RAGVGRMGILKYPASTNQGFQSLILNSATDEY-FIYSMQPIISKLA 136 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+T + K + I + IP EQ I + I + + L K Sbjct: 137 NRLASGSTFTEISGKQMEKIEIMIPTTGEQNRISSLMKCINNLIAANEDKLEQLKTLKKL 196 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q + S EW + ++++ + Sbjct: 197 MMQKIFSQ-----------------EWRFKGFTDPWEQRKLGDISKITAGGDIDKDKLST 239 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 Y I L + +SY+ P V D+ + K + S + R ++ Sbjct: 240 RGRYPVIANALTNNGVVGYYDSYKVKG---PAVTVTGRGDVGHAKTRIESFTPIVRLLVV 296 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 SA + + +L + + ++F S L + P EQ Sbjct: 297 SA---------PNFDINFLENAINNIRIFNE--STGVPQLTAPQLGSYEFEYPCSSEQVC 345 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I NV++ +ID L+ E + LKE + + Sbjct: 346 IGNVLH----KIDNLIAANEDKLNQLKELKKYLMQ 376 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 22/210 (10%), Positives = 60/210 (28%), Gaps = 16/210 (7%) Query: 211 LNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 L P V+ K W +G + + + E N + + N Sbjct: 6 LVPKVRFKGFSDPWEQRKLGELGKIQGGGTPDSGIAEYWDGN---VNWFTPTEVSNNGYL 62 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + R + + +++ ++ L+ + ++ + Sbjct: 63 ESSNRKITSLGLKKSSARLMPASTVLITSRAGVGRMGILK-----YPASTNQGFQSLILN 117 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 Y + M+ + + ++++ +++P EQ I + Sbjct: 118 SATDEYFIYSMQPIISKLANRLASGSTFTEISGKQMEKIEIMIPTTGEQNRI----SSLM 173 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I+ L+ E + LK + + + Sbjct: 174 KCINNLIAANEDKLEQLKTLKKLMMQKIFS 203 >gi|325957309|ref|YP_004292721.1| type I restriction-modification system, S subunit [Lactobacillus acidophilus 30SC] gi|325333874|gb|ADZ07782.1| type I restriction-modification system, S subunit [Lactobacillus acidophilus 30SC] Length = 494 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 66/431 (15%), Positives = 134/431 (31%), Gaps = 62/431 (14%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKD---IIYIGLEDVESGTG--KYLPKDGNSRQS 73 IP +W+ V + + G++ + K+ I + V+ ++ Sbjct: 66 DIPNNWEWVKLGNIVDYVQRGKSPKYDKESNSYPIISQKCVQWDGVHLEFAKHLKEDFWK 125 Query: 74 DTSTVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + KG +L G ++ D + S ++ K+V + +L Sbjct: 126 ELPSYRFVTKGDLLLNSTGTGTVGRIIKVTESFDKIPVDSHVTIIRLNKNVCNSYILYFL 185 Query: 129 LSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 +S + ++ G T I NI + IPPL EQ I KI +D Sbjct: 186 MSPIIQNNLDDYLTGTTKQKEFGLASIQNIVISIPPLEEQKRIVAKIEKLMPLVDEYAES 245 Query: 188 RIRFIELLKEK----KQALVSYIVTKGLNPDVKMKDSGIEWV------------------ 225 R ++ E KQ+++ Y + L + E + Sbjct: 246 YNRLQKIDNEFEDKLKQSVLQYAMEGKLVKQNPSDEPASELIKKIENEKAELVKEGKIKK 305 Query: 226 -------------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 +P+ WE +V K + S+ + + + N Sbjct: 306 SKKLPAITDDEKPFDIPNSWEWVRLGDIVQAQIGKTPQRHNSDYWAERDIPWVSISDLTN 365 Query: 273 MGLKPESYE----------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 L + +IV ++ F LR V I++ Sbjct: 366 GNLTETKEKISSKALKDVFHDRIVAKNTLLMSFKLTIGKVAILRINAVHNEAIVS----I 421 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + + +L + + ++ ++L + +L + +PP+KEQ I Sbjct: 422 IPFIDSEHSLRDYLFVTLPMISQNGDFKDAIKGKTLNKSSLTKLLIPLPPLKEQKRIVAK 481 Query: 382 INVETARIDVL 392 + D+L Sbjct: 482 LREFKRSADIL 492 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 78/214 (36%), Gaps = 15/214 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE-----SNILSLSYGNIIQKLETRNMG 274 + E +P++WE +V + R + + I+S Sbjct: 59 TDEEKPFDIPNNWEWVKLGNIVDYVQRGKSPKYDKESNSYPIISQKCVQWDGVHLEFAKH 118 Query: 275 LKPESYE---TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDS 330 LK + ++ +Y+ V G+++ R ++ + ++ + S + + + Sbjct: 119 LKEDFWKELPSYRFVTKGDLLLNSTGTGTVGRIIKVTESFDKIPVDSHVTIIRLNKNVCN 178 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +Y+ + + S + +G ++ ++ + + +PP++EQ I I Sbjct: 179 SYILYFLMSPIIQNNLDDYLTGTTKQKEFGLASIQNIVISIPPLEEQKRIVAKIEKLMPL 238 Query: 389 IDVLVEKIE--QSIVLLKE--RRSSFIAAAVTGQ 418 +D E Q I E + S + A+ G+ Sbjct: 239 VDEYAESYNRLQKIDNEFEDKLKQSVLQYAMEGK 272 >gi|206896558|ref|YP_002247704.1| type I restriction/modification enzyme [Coprothermobacter proteolyticus DSM 5265] gi|206739175|gb|ACI18253.1| type I restriction/modification enzyme [Coprothermobacter proteolyticus DSM 5265] Length = 678 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 56/380 (14%), Positives = 108/380 (28%), Gaps = 52/380 (13%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+VV ++ + G + + G G + S Sbjct: 338 WEVVSLREICDIQKGTSITKADTV-----------EGNVPVIAGGQEPAYYHNQSNRDGN 386 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I G Y D S + + + + + + + GA Sbjct: 387 IITVSASGAYAGFVNYFDIPIFASDCTTIKSNDEEKALTKYIFYILKSRQEDLYKLQRGA 446 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 H + NI +P+PPL Q + ++ + I E+ A+ Sbjct: 447 GQPHVYPNDLANIQIPLPPLPVQQELVARLDKQQAII---------------EQCNAMEK 491 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I+ G++ + D +G + + K + I + N Sbjct: 492 TILEAGIDDSIFEGDWEWVELGELIALRNGISISNTLVSNRGKYPVCGSNGIYGYTDNND 551 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + +Y V + I+ Sbjct: 552 KLLFGETIVVGRVGAYCGNVHYYD-----------------VPIWVTDNAIV---VTVTN 591 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + + YL + + S DL K G + + + L V +PPI++Q I + +NV Sbjct: 592 KDKLKTKYLYYFLLSKDLGKYANVTG---QPYISQSIISSLKVPLPPIEKQQKIVDFLNV 648 Query: 385 ETA---RIDVLVEKIEQSIV 401 + I L E +Q+I Sbjct: 649 QFETLTNIRRLKENAKQTIK 668 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 55/161 (34%), Gaps = 12/161 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSIFA 82 W+ V + L G + + + S GKY N T + + Sbjct: 506 DWEWVELGELIALRNGISISNT----------LVSNRGKYPVCGSNGIYGYTDNNDKLLF 555 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I+ G++G Y D + +V+ + +L +L +++ + Sbjct: 556 GETIVVGRVGAYCGNVHYYDVPIWVTDNAIVVTVTNK-DKLKTKYLYYFLLSKDLGKYAN 614 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I ++ +P+PP+ +Q I + + + + Sbjct: 615 VTGQPYISQSIISSLKVPLPPIEKQQKIVDFLNVQFETLTN 655 >gi|32477070|ref|NP_870064.1| type I restriction modification enzyme, S subunit [Rhodopirellula baltica SH 1] gi|32447618|emb|CAD79219.1| type I restriction modification enzyme, S subunit [Rhodopirellula baltica SH 1] Length = 393 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 57/409 (13%), Positives = 126/409 (30%), Gaps = 35/409 (8%) Query: 22 PKHWKVVPIKRFTK----LNTGRTSESG---KDIIYIGLEDVESGTGKYLP-KDGNSRQS 73 P W + + + G + Y+ +++ + K + Sbjct: 7 PAGWSLTKLSEICDPNAPIMYGILQPGPVILDGVPYVRPSEIDPDRIRLEDIKRTTPEIA 66 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGWLL 129 + S +L +G R A++ + S+ + L K ++ L Sbjct: 67 ERYRRSTLQTEDLLITIVGTLGRIAVVPPELNGANITQSSARIRLNRKTANLRYIRQLLR 126 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S ++ + G + + + ++ +P+PPL+EQ I E + R Sbjct: 127 SPIAIRQYDFHRLGTGVPRLNIHHVRDLQIPLPPLSEQKRIAEILDRAEALRAK----RR 182 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + LL E Q++ + +P K +E + + + K Sbjct: 183 AALALLDELTQSI---FLDMFGDPVSNPKGWPVESLSDLGK-------ITTGGTPSSKKE 232 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + ++ G+ ++ E L + V G I K S + Sbjct: 233 GMFGGTVPFVTPGD-LESDELPKRTLSDHGASEAKTVPAGATFVCCIGATIGKMGQASVR 291 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + + + + + S LK +++ + V Sbjct: 292 SAFNQQLNAIEWSNSVNDDFGLGVLRFFKKLIATW----GASTTLPILKKSSFEKIEIPV 347 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PPI+ Q I + + + I+ L S+ L + +S A G+ Sbjct: 348 PPIESQ-AI--YADRK-SEIEQLRSLHRNSLSELDQLFASLQHRAFRGE 392 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 58/197 (29%), Gaps = 17/197 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK W V + K+ TG T S K+ + ++ D+ES Sbjct: 207 PKGWPVESLSDLGKITTGGTPSSKKEGMFGGTVPFVTPGDLESDELP----KRTLSDHGA 262 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVT 134 S G +G + K A + Q + V + G L Sbjct: 263 SEAKTVPAGATFVCCIGATIGKMGQASVRSAFNQQLNAIEWSNSVNDDFGLGVLRF--FK 320 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I T+ I +P+PP+ Q + + I+ L + + Sbjct: 321 KLIATWGASTTLPILKKSSFEKIEIPVPPIESQAIYAD----RKSEIEQLRSLHRNSLSE 376 Query: 195 LKEKKQALVSYIVTKGL 211 L + +L L Sbjct: 377 LDQLFASLQHRAFRGEL 393 >gi|15804920|ref|NP_290962.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H7 EDL933] gi|15834560|ref|NP_313333.1| type I restriction-modification enzyme S subunit [Escherichia coli O157:H7 str. Sakai] gi|168749492|ref|ZP_02774514.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4113] gi|168754917|ref|ZP_02779924.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4401] gi|168760594|ref|ZP_02785601.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4501] gi|168766628|ref|ZP_02791635.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4486] gi|168773942|ref|ZP_02798949.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4196] gi|168781636|ref|ZP_02806643.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4076] gi|168784990|ref|ZP_02809997.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC869] gi|168797919|ref|ZP_02822926.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC508] gi|195937621|ref|ZP_03083003.1| type I restriction-modification enzyme S subunit [Escherichia coli O157:H7 str. EC4024] gi|208808904|ref|ZP_03251241.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4206] gi|208813833|ref|ZP_03255162.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4045] gi|208821430|ref|ZP_03261750.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4042] gi|209396465|ref|YP_002273870.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4115] gi|217325306|ref|ZP_03441390.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. TW14588] gi|254796345|ref|YP_003081182.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H7 str. TW14359] gi|261226705|ref|ZP_05940986.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H7 str. FRIK2000] gi|12519366|gb|AAG59529.1|AE005666_1 putative restriction modification enzyme S subunit [Escherichia coli O157:H7 str. EDL933] gi|13364784|dbj|BAB38729.1| type I restriction-modification enzyme S subunit [Escherichia coli O157:H7 str. Sakai] gi|187770232|gb|EDU34076.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4196] gi|188016149|gb|EDU54271.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4113] gi|189000706|gb|EDU69692.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4076] gi|189357798|gb|EDU76217.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4401] gi|189363994|gb|EDU82413.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4486] gi|189368935|gb|EDU87351.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4501] gi|189375167|gb|EDU93583.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC869] gi|189379418|gb|EDU97834.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC508] gi|208728705|gb|EDZ78306.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4206] gi|208735110|gb|EDZ83797.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4045] gi|208741553|gb|EDZ89235.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4042] gi|209157865|gb|ACI35298.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. EC4115] gi|217321527|gb|EEC29951.1| putative type I restriction-modification system, S subunit [Escherichia coli O157:H7 str. TW14588] gi|254595745|gb|ACT75106.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H7 str. TW14359] gi|320190535|gb|EFW65185.1| Type I restriction-modification system, specificity subunit S [Escherichia coli O157:H7 str. EC1212] gi|320638729|gb|EFX08387.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H7 str. G5101] gi|320644441|gb|EFX13506.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H- str. 493-89] gi|320649759|gb|EFX18283.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H- str. H 2687] gi|320654809|gb|EFX22778.1| putative restriction modification enzyme S subunit [Escherichia coli O55:H7 str. 3256-97 TW 07815] gi|320665588|gb|EFX32634.1| putative restriction modification enzyme S subunit [Escherichia coli O157:H7 str. LSU-61] gi|326345338|gb|EGD69081.1| Type I restriction-modification system, specificity subunit S [Escherichia coli O157:H7 str. 1125] gi|326346808|gb|EGD70542.1| Type I restriction-modification system, specificity subunit S [Escherichia coli O157:H7 str. 1044] Length = 584 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 75/489 (15%), Positives = 141/489 (28%), Gaps = 94/489 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + +P+ W+ V I +T KD YI + + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVRISEIGHDWGQKTP--DKDFTYIDVGSINKEY 138 Query: 61 GKYLPKDG-NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ 115 G +++ + + I +G I+Y + PYL I + + I ST F ++ Sbjct: 139 GIIEELSILSAKDAPSRARKIVQQGTIIYSTVRPYLLNIAIIENEILPEPIASTAFAIIH 198 Query: 116 PKD-VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK- 173 P + + +L S +E G + K + P+PP EQV I K Sbjct: 199 PYTAMDANFIYYYLRSPVFVCYVENCQTGVAYPAINDKQFFSGITPVPPSLEQVRIANKI 258 Query: 174 ----------------------------------------IIAETVRIDTLITERIRFIE 193 + RI Sbjct: 259 KELMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQTAEELAENWARISEYFDTLFTTEA 318 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKD-------------------------------SGI 222 + KQ ++ V L P + S Sbjct: 319 SVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPPISDE 378 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNT---------KLIESNILSLSYGNIIQKLETRNM 273 E +P+ WE F ++ + + ++ + E + + Sbjct: 379 EKPFELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGYLDLTEIKQI 438 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTY 332 + E E YQ+V ++ D R+ + +D + Sbjct: 439 EIPIEEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRNIGQDVDPYW 498 Query: 333 LAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L M S + F + + S+ ++ PV +PP E I + +++ + Sbjct: 499 LETYMNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSKLHIFYKLCE 558 Query: 391 VLVEKIEQS 399 L I+ + Sbjct: 559 ELKNHIQSA 567 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 73/195 (37%), Gaps = 3/195 (1%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPE 278 S E +P+ WE + + +K + I S +E + K Sbjct: 93 SEEEKPFELPEGWEWVRISEIGHDWGQKTPDKDFTYIDVGSINKEYGIIEELSILSAKDA 152 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLM 337 +IV G I++ + ++ +++ I ++A+ + P+ +D+ ++ + + Sbjct: 153 PSRARKIVQQGTIIYSTVRPYLLNIAIIENEILPEPIASTAFAIIHPYTAMDANFIYYYL 212 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 RS +G+ ++ + VPP EQ I N I + D L ++ Sbjct: 213 RSPVFVCYVENCQTGVAYPAINDKQFFSGITPVPPSLEQVRIANKIKELMSLCDQLEQQS 272 Query: 397 EQSIVLLKERRSSFI 411 S+ ++ + + Sbjct: 273 LTSLDAHQQLVETLL 287 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 55/202 (27%), Gaps = 13/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+ + + +G T + Y+ + +V+ G Sbjct: 383 ELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGYLDLTEIKQIEIPI 442 Query: 74 DTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + KG +L + G + R + I + + Sbjct: 443 EEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRNIGQDVDPYWLETY 502 Query: 131 IDV----TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ A + ++ + + P+ IPP +E I K+ + L Sbjct: 503 MNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSKLHIFYKLCEELKN 562 Query: 187 ERIRFIELLKEKKQALVSYIVT 208 + AL V Sbjct: 563 HIQSAQQTQLHLADALTDAAVN 584 >gi|55820774|ref|YP_139216.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus LMG 18311] gi|55822677|ref|YP_141118.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus CNRZ1066] gi|55736759|gb|AAV60401.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus LMG 18311] gi|55738662|gb|AAV62303.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus CNRZ1066] Length = 406 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 54/399 (13%), Positives = 132/399 (33%), Gaps = 36/399 (9%) Query: 30 IKRFTKLN-----TGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT--STV 78 +K LN G T E ++ ++ + D Q S Sbjct: 26 LKELVSLNGRIGFRGYTKNDIVERSNGVLTYSPTNIVNNKIVNYKNDTYISQDKYKESPE 85 Query: 79 SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL+ K G L K+ + + Q +V++ V + L +LL+ + + Sbjct: 86 IMVKNNDILFVKTGSTLGKSALVRNLTEPATINPQLIVIKTIHVDSDYLAVYLLTDSIQK 145 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + G + IG + +PPL EQ I + + + L Sbjct: 146 QVFQVKIGGAVPTLTETEIGKFVVKLPPLPEQTAIGSLFRTLDDLLASYKDNLTNYQSLK 205 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + P++++ EW ++ + + + K+ ++ Sbjct: 206 VTMLSKMFPK--VGQTVPEIRLDGFEGEW-----ENKILSEVTNITMGQSPKSENYTDNP 258 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ + + + + E ++ + G+I+ D V+ RG+ Sbjct: 259 NDYILVQGNAD-IKDKQVVPRLWTTEVTKMAEIGDIILTVRAPVGDIGKTDYNVVIGRGV 317 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + ++ + + + + +G +S+ D+K + +P ++E Sbjct: 318 AA---------IKGNDFIFYTLEKMKMTGFWNKFSTGSTFESISSNDIKEAIIQIPTLEE 368 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I + +D L+ ++ I L+ + + Sbjct: 369 QKAIGAY----FSNLDNLIVAHQEKISQLETLKKKLLQD 403 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 51/180 (28%), Gaps = 5/180 (2%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + T + G++ +S + G K R T + Sbjct: 231 EWENKILSEVTNITMGQSPKSENYTDNPNDYILVQGNADIKDKQVVPRLWTTEVTKMAEI 290 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I+ P D++ + ++ D + L + +T G Sbjct: 291 GDIILTVRAPV-GDIGKTDYNVVIGRGVAAIKGNDFI----FYTLEKMKMTGFWNKFSTG 345 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T I + IP L EQ I I + + L K+ Q + Sbjct: 346 STFESISSNDIKEAIIQIPTLEEQKAIGAYFSNLDNLIVAHQEKISQLETLKKKLLQDMF 405 >gi|148988248|ref|ZP_01819711.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP6-BS73] gi|147926712|gb|EDK77785.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP6-BS73] gi|301801394|emb|CBW34080.1| type I restriction-modification system S protein [Streptococcus pneumoniae INV200] Length = 427 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 143/426 (33%), Gaps = 66/426 (15%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGA 144 L + R I+ I + ++ L + ++LS +V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----Q 200 + + + + +I +P+PPLAEQ I E I + ++D R +L KE + Sbjct: 122 VVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDS---------------------------------------G 221 +++ Y + L +S Sbjct: 182 SILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYY 241 Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------ 275 E +P+ WE + + + R + + + + ++ L Sbjct: 242 EEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDP 301 Query: 276 -KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGID 329 SY+ +++ G++++ L R + + A + V I+ Sbjct: 302 ETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVADSHVTVIRVLSGVIN 361 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 362 CHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFA 421 Query: 388 RIDVLV 393 ID L+ Sbjct: 422 HIDALI 427 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIES 153 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 154 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 191 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 247 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 306 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 307 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVADSHVTVIRVLSGVINCHFIY 366 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 367 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 426 Query: 185 I 185 I Sbjct: 427 I 427 >gi|148377827|ref|YP_001256703.1| Type I R/M system specificity subunit [Mycoplasma agalactiae PG2] gi|148291873|emb|CAL59264.1| Type I R/M system specificity subunit [Mycoplasma agalactiae PG2] Length = 408 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 55/401 (13%), Positives = 131/401 (32%), Gaps = 26/401 (6%) Query: 25 WKVVPIKRFTKLNT-GRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + + G T + I +I +ED + + S Sbjct: 19 WEQEKFANIYQFASEGGTPSTSIKKYYENGTIPFIKVEDTVNKYIENGKYFITENGLINS 78 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQ 135 + + + I++ G + I L + PK E + L S + Sbjct: 79 SAWLVPENSIIFTN-GATIGNVAINKIKTATKQGILGIIPKQKYDVEFIYYLLSSKNFQN 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + T + + I + +P + + +D+LIT R + L Sbjct: 138 EVNRKITIGTFAMITLSNLDKIKVNLPNYDIERAKISSL---FSHLDSLITLHQRKLSSL 194 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K K L+ + + ++ + WE ++ +KN K + Sbjct: 195 KNLKNRLLDKMFCYEKSQFPSIRFK------EFTNAWEQWKARDILLPYRQKNDKNLALI 248 Query: 256 ILSLSYG-NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S+S + + E + G K + + +F + + + S+ + G Sbjct: 249 GYSVSNKEGFVDQKEFFDDGGKAVYADKKNSLIISFDMFAYNPSRINVGSIALFKNTING 308 Query: 315 IITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPI 372 +++ Y + + + +S ++ +R +L + + + +P + Sbjct: 309 LVSPIYEVFKVSANSNPDLIYLWFKSECFNEIVANNSNKSVRDTLNLKQFEDNLLNLPVL 368 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I N + +D L+ ++ + LK +++ + Sbjct: 369 QEQNKIAN----LFSHLDSLITLHQRKLNSLKNIKNTLLEK 405 >gi|312278103|gb|ADQ62760.1| Restriction modification system DNA specificity domain [Streptococcus thermophilus ND03] Length = 410 Score = 101 bits (250), Expect = 4e-19, Method: Composition-based stats. Identities = 63/417 (15%), Positives = 134/417 (32%), Gaps = 31/417 (7%) Query: 26 KVVPIKRF-TKLNTGRTSE------SGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 + + +K+ +G T + I I ++V + + Q++ Sbjct: 3 EWKELSSITSKIGSGLTPRGGNSVYTDNGISLIRSQNVLDMDFSTENLAYIDEVQAEKLK 62 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I K IL G + + I + + + +++ K+ + L Sbjct: 63 NVIVEKNDILLNITGDSIARCTIVPEEILPARVNQHVSIIRCKNTEQSKYVMYYLQYIKK 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ G T + + IG +P+ I KI ID I + + Sbjct: 123 YLLQISKVGGTRNALTKEAIGKLPIKISDDC------NKISKILDNIDQKIHTNNQINQE 176 Query: 195 LKEKKQALVSYIVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELN 245 L+ + L Y + PD K SG E +P+ W V + + N Sbjct: 177 LEAMAKTLYDYWFVQFDFPDQNAKPYKSSGGKMVYHPELKREIPEGWGVDSLWNIANFYN 236 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + Y +I+ E N K I + I Sbjct: 237 GLAMQKYRPDTNEDDYLPVIKIREMMNGFSKDTERARLDIPSEAVVDRGDILFSWSATLE 296 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFEDVKR 364 E+G + V +++ + ++SY + K + + + +K+ Sbjct: 297 VIIWGKEKGALNQHIFKVTSDTYPKSFIYFELKSYLKVFKAIAELRKTTMGHITQDHLKQ 356 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++VPPI+ + ++ + I + + +E L + R + + GQ+ + Sbjct: 357 AKIVVPPIEL----ISKLDAKLQPIMLKQQILENQNQELTQLRDWLLPMLMNGQVKV 409 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 61/191 (31%), Gaps = 16/191 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W V + G ++ +D + I + ++ +G KD + Sbjct: 218 EIPEGWGVDSLWNIANFYNGLAMQKYRPDTNEDDYLPVIKIREMMNG----FSKDTERAR 273 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 D + ++ +G IL+ L I G + + + L S Sbjct: 274 LDIPSEAVVDRGDILFSWS-ATLEVIIWGKEKGALNQHIFKVTSDTYPKSFIYFELKSYL 332 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + A TM H + + +PP+ + K+ A+ I Sbjct: 333 KVFKAIAELRKTTMGHITQDHLKQAKIVVPPI----ELISKLDAKLQPIMLKQQILENQN 388 Query: 193 ELLKEKKQALV 203 + L + + L+ Sbjct: 389 QELTQLRDWLL 399 >gi|298229449|ref|ZP_06963130.1| putative type I RM modification enzyme [Streptococcus pneumoniae str. Canada MDR_19F] Length = 372 Score = 101 bits (250), Expect = 4e-19, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 109/394 (27%), Gaps = 28/394 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + Sbjct: 120 MKHLTKKYFDNIMVSYTNLREQQRIASELDLLSKLILRRQEQLEELNL------------ 167 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 L + G + D+ + + E L L+ N+ Sbjct: 168 -----LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVT 222 Query: 266 QKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + + ++ +IV + + I S + Sbjct: 223 KNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMV 282 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++P + +++ + + L +K++ + +PP+ Q + + Sbjct: 283 ILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFADF 341 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +ID I++S+ L+ + S + Sbjct: 342 VV----QIDKSQLAIQKSLEELETLKKSLMQEYF 371 >gi|297582534|ref|YP_003698314.1| restriction modification system DNA specificity domain-containing protein [Bacillus selenitireducens MLS10] gi|297140991|gb|ADH97748.1| restriction modification system DNA specificity domain protein [Bacillus selenitireducens MLS10] Length = 411 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 56/400 (14%), Positives = 131/400 (32%), Gaps = 20/400 (5%) Query: 25 WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGT--GKYLPKDGNSRQSDTSTVS 79 W + K + G + G+ I + D+ S ++ S + Sbjct: 18 WLTRNLNEIMKFSNGINAPKEAYGQGRKMISVLDILSEEYLTYDNVRNSVSVSEILEQKN 77 Query: 80 IFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G +++ + L KA + + + S + + + L+ Sbjct: 78 KVEFGDLVFVRSSEVLNEVGLSKAYLDNEYALYSGFSIRGKKISEYDPIFVERSLNGISR 137 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++IE G+T + + + ++ + +P + EQ I E +D I + + I L Sbjct: 138 RQIERKSGGSTRYNVSQEILNSLFINMPTVQEQQKIGEF----FKNLDDRIALQQQHITL 193 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE KQ + + K +++ G V + + + Sbjct: 194 LKESKQGFLQKMFPKDGERVPEVRFDGFSGEWEVLEIKNIAAETYGGGTPKTSISDYWNG 253 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 NI + ++ + K S + I + + A V Sbjct: 254 NIPWIQSSDLKTDVLNLVSPTKFISDAGINNSATKLVPENSIAIVTRVGVGKLALVPYPY 313 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IK 373 + ++++ ID + + + + K + + + ++ + +++P +K Sbjct: 314 ATSQDFLSLSSLKIDLKFALYSLY-LIIKKEVNNLQGTSIKGITKPELLKKKIIIPSNLK 372 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I +D + E+ + LL+E + F+ Sbjct: 373 EQQKIGEF----FKNLDDSIAAHEKELELLQETKKGFLQK 408 >gi|108563887|ref|YP_628203.1| type I R-M system specificity subunit [Helicobacter pylori HPAG1] gi|107837660|gb|ABF85529.1| type I R-M system specificity subunit [Helicobacter pylori HPAG1] Length = 375 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 47/406 (11%), Positives = 106/406 (26%), Gaps = 44/406 (10%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G + I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGKAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ+ I + + +L ++ + Sbjct: 121 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSDLDHYLYSLDALILKKESVK 180 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K L+S + + W+ + + Sbjct: 181 KALSFELLSQ----------------RKRLKGFNQAWQRVRLGDICEITTGSLDANEMVH 224 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ + I + Sbjct: 225 YGKYRFYTCAKEYYFIDKYAFDTEAI-------------LISGNGAYVGYVHYYKGKFNA 271 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 Y+ ++ + + + + G + +K +L+PP+ EQ Sbjct: 272 YQRTYVLDNFSEHI-IFIKYFLTMFLQSHIQTNRNEGNTPYIVTATLKDFEILLPPLNEQ 330 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 331 IAIANILSDLDNEIISLKNKKRQ----FESIKKALNHDLMSAKIRV 372 >gi|58427672|gb|AAW76709.1| restriction endonuclease S subunits [Xanthomonas oryzae pv. oryzae KACC10331] Length = 536 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 61/418 (14%), Positives = 141/418 (33%), Gaps = 29/418 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + ++N R+ + GK +I ++ + P+ R+ S + F G Sbjct: 126 WRCTTVGDAFEVNPLRSVQRGKVTPFIPMDLLPVNER--SPERIEKREFTGSGIK-FKNG 182 Query: 85 QILYGKLGPYLRKAIIADFDGI-------CSTQFLV--LQPKDVLPELLQGWLLSIDVTQ 135 L ++ P L A G+ ST+++V +P S D + Sbjct: 183 DTLIARITPCLENGKTAFISGLQDGEVAHGSTEYIVLGGRPNHSDGLFAYYIARSPDFRR 242 Query: 136 RIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 EG + + P+ +PP++EQ +I + +I+ + Sbjct: 243 YAIGQMEGTSGRQRVPSAAVEKYPLALPPISEQRVISRILGGLDDKIELNRRMNQTLEAM 302 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF-FALVTELNRKNTKLIE 253 + ++ + V P M+ S +GL+P W++ + + IE Sbjct: 303 ARALFKS---WFVDFDGVPPDDMQKS---ELGLIPKGWKLSRLGVECSYLSRGISPEYIE 356 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQ 309 + + I+ + + + G+++ + R + Sbjct: 357 DGGVLVINQKCIRDFSIDTSKARRHDPTQRSVEERKIQFGDVLVNSTGVGTLGRVAQVLS 416 Query: 310 VMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + E ++ S V+ TYL GS + L + +P++ Sbjct: 417 LDEPTVVDSHVTVVRAGQRLRHTYLGQWFSDKQSEIQTMGEGSTGQTELSRLKLAHMPII 476 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +P Q + + + + ++ + + S L R + + +TG++ ++ + Sbjct: 477 IPS---QKLLADF-DAIVSPLNSKIALADSSSRSLATLRDALLPKLITGELRVQDAER 530 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 59/194 (30%), Gaps = 7/194 (3%) Query: 18 IGAIPKHWKVVPIK-RFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSD 74 +G IPK WK+ + + L+ G + E +D ++ I + + + + Sbjct: 327 LGLIPKGWKLSRLGVECSYLSRGISPEYIEDGGVLVINQKCIRDFSIDTSKARRHDPTQR 386 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + G +L G R A + D + + V++ L G S Sbjct: 387 SVEERKIQFGDVLVNSTGVGTLGRVAQVLSLDEPTVVDSHVTVVRAGQRLRHTYLGQWFS 446 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ + + ++P+ IP + +I + Sbjct: 447 DKQSEIQTMGEGSTGQTELSRLKLAHMPIIIPSQKLLADFDAIVSPLNSKIALADSSSRS 506 Query: 191 FIELLKEKKQALVS 204 L L++ Sbjct: 507 LATLRDALLPKLIT 520 >gi|73670136|ref|YP_306151.1| type I restriction-modification system specificity subunit [Methanosarcina barkeri str. Fusaro] gi|72397298|gb|AAZ71571.1| type I restriction-modification system specificity subunit [Methanosarcina barkeri str. Fusaro] Length = 446 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 68/421 (16%), Positives = 141/421 (33%), Gaps = 32/421 (7%) Query: 23 KHWKVVPIKRF-TKLNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT--- 75 WK VPIK L G S I++G++++ L + N + D Sbjct: 8 NSWKKVPIKNLYLGLYDGPHATPKPSLSGPIFLGIKNITEDGRLDLSQIRNISEDDFPKW 67 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIAD-FDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + +G I++ A+I F G + +++P + + Sbjct: 68 TKRVLPTEGDIVFSYEATLNLYAMIPKGFRGCLGRRLALIRPDTEIVNPKFLYYSFFGEE 127 Query: 135 QRI---EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R + + GAT+ N + IP + Q I + I+ Sbjct: 128 WRNTISKNLISGATVDRIPLINFPNFEVSIPIHSIQRKIASILSNYDNLIENNTRRIEIL 187 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + + K P + + +G +P+ W+V+ LV Sbjct: 188 E----QIAKLVYEEWFVKFRFPGHENVEMVSSELGEIPEGWKVEKLSELVKTQYGYTESA 243 Query: 252 IESNI-------LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 E I ++ + I E + PE + Y + +V R D Sbjct: 244 TEEEIGPKFLRGKDINKQSYISWDEVPFCSISPEVLDKYLLKKGDIVVIRMADP----GK 299 Query: 305 LRSAQVMERGIITSAYMA-VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 + + + S + I YL + ++S A +G R+S + Sbjct: 300 VGIVETEVNAVFASYLIRLEIIKNIKPYYLFYFLQSDKFQNYVIAASTGTTRKSASAGVI 359 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +++PP + I + ++++L+ K + L++ R + ++G+ID+ Sbjct: 360 TNIDLIIPPEYLLTLFEDKIGLLRKQLNILINKNQN----LRKTRDLLLPKLISGEIDVS 415 Query: 423 G 423 Sbjct: 416 D 416 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/193 (20%), Positives = 71/193 (36%), Gaps = 6/193 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDII---YIGLEDVES-GTGKYLPKDGNSRQS 73 +G IP+ WKV + K G T + ++ I ++ +D+ + S Sbjct: 217 LGEIPEGWKVEKLSELVKTQYGYTESATEEEIGPKFLRGKDINKQSYISWDEVPFCSISP 276 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL--QPKDVLPELLQGWLLSI 131 + + KG I+ ++ + I+ +L+ K++ P L +L S Sbjct: 277 EVLDKYLLKKGDIVVIRMADPGKVGIVETEVNAVFASYLIRLEIIKNIKPYYLFYFLQSD 336 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + A G T A I NI + IPP L +KI +++ LI + Sbjct: 337 KFQNYVIAASTGTTRKSASAGVITNIDLIIPPEYLLTLFEDKIGLLRKQLNILINKNQNL 396 Query: 192 IELLKEKKQALVS 204 + L+S Sbjct: 397 RKTRDLLLPKLIS 409 >gi|313112145|ref|ZP_07797926.1| hypothetical protein PA39016_004130024 [Pseudomonas aeruginosa 39016] gi|310884428|gb|EFQ43022.1| hypothetical protein PA39016_004130024 [Pseudomonas aeruginosa 39016] Length = 277 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 59/300 (19%), Positives = 120/300 (40%), Gaps = 30/300 (10%) Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + ++ G +S+ + +G+I +PPLAEQ I E + D IT + Sbjct: 1 MIKRQFSESGGGTNISNLSQQILGDIAFRLPPLAEQKKIAEIL----STWDQAITTSEQL 56 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +E K++K++L+ ++ SG + + W E Sbjct: 57 LENNKQQKKSLIQQLL------------SGKKRLPGFSTKWRDIRLGEAFQERVEIGFIK 104 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + ++ G +I + ET Y + PG+I + + + L + + Sbjct: 105 LPLLSITAEEG-VIDRDETGRKDTSKSDKSKYLRICPGDIGYNTMRMWQGVSGLSTLE-- 161 Query: 312 ERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPV 367 G+++ AY + P D + ++L + L FY GL +LK+ + ++ Sbjct: 162 --GLVSPAYTVLTPKPEVDPLFASYLFKLPALVHAFYRHSQGLVSDTWNLKYSNFAKIKW 219 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR-GESQ 426 +P ++EQ I V+ A D +E + + LK+ R + + +TG+ ++ E + Sbjct: 220 SIPGVEEQKAIAAVL----ASADREIEILRLQLAGLKQERKALMQQLLTGKRRVKVDEPE 275 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 60/183 (32%), Gaps = 6/183 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + + + + + E + ++ +SD S G Sbjct: 85 WRDIRLGEAFQERVEIGFIK---LPLLSITAEEGVIDRDETGRKDTSKSDKSKYLRICPG 141 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I Y + + + ++ +G+ S + VL PK + L +L + Sbjct: 142 DIGYNTMRMWQGVSGLSTLEGLVSPAYTVLTPKPEVDPLFASYLFKLPALVHAFYRHSQG 201 Query: 145 TM---SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + + I IP + EQ I + + I+ L + + K Q Sbjct: 202 LVSDTWNLKYSNFAKIKWSIPGVEEQKAIAAVLASADREIEILRLQLAGLKQERKALMQQ 261 Query: 202 LVS 204 L++ Sbjct: 262 LLT 264 >gi|253315067|ref|ZP_04838280.1| putative restriction/modification system specificity protein [Staphylococcus aureus subsp. aureus str. CF-Marseille] Length = 397 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 130/405 (32%), Gaps = 39/405 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + K + I + +Y K +S+ + ++ Sbjct: 14 EWEEKQLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 71 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 72 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 131 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + I + P L EQ I + +I+ + Sbjct: 132 VSGIAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKFFSKLDRQIELEEQKLELLQ 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K Q + S + + G WE + E N ++ Sbjct: 192 QQKKGYMQKIFSQELRFK------------DENGEDYPDWENSKIEKYLKERNERSD--K 237 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + II+ E + Y++V +I + + + + Sbjct: 238 GQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASGKSNY---- 293 Query: 313 RGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVL 368 GI++ AY + P S+ + +++ + F GL +LK++ +K + + Sbjct: 294 NGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINID 353 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 354 IPVLEEQEKIGDF----FKKMDILISKQKMKIEILEKEKQSFLQK 394 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 66/184 (35%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 W+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 218 DWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 272 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 273 KNDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 332 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 333 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFL 392 Query: 200 QALV 203 Q + Sbjct: 393 QKMF 396 >gi|291285727|ref|YP_003502545.1| putative type I restriction-modification system, S subunit [Escherichia coli O55:H7 str. CB9615] gi|290765600|gb|ADD59561.1| Putative type I restriction-modification system, S subunit [Escherichia coli O55:H7 str. CB9615] gi|320660661|gb|EFX28122.1| putative type I restriction-modification system, S subunit [Escherichia coli O55:H7 str. USDA 5905] Length = 584 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 75/489 (15%), Positives = 141/489 (28%), Gaps = 94/489 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + +P+ W+ V I +T KD YI + + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVRISEIGHDWGQKTP--DKDFTYIDVGSINKEY 138 Query: 61 GKYLPKDG-NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ 115 G +++ + + I +G I+Y + PYL I + + I ST F ++ Sbjct: 139 GIIEELSILSAKDAPSRARKIVQQGTIIYSTVRPYLLNIAIIENEILPEPIASTAFAIIH 198 Query: 116 PKD-VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK- 173 P + + +L S +E G + K + P+PP EQV I K Sbjct: 199 PYTAMDANFIYYYLRSPVFVCYVENCQTGVAYPAINDKQFFSGITPVPPSLEQVRIANKI 258 Query: 174 ----------------------------------------IIAETVRIDTLITERIRFIE 193 + RI Sbjct: 259 KELMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQTAEELAENWARISEYFDTLFTTEV 318 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKD-------------------------------SGI 222 + KQ ++ V L P + S Sbjct: 319 SVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPPISDE 378 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNT---------KLIESNILSLSYGNIIQKLETRNM 273 E +P+ WE F ++ + + ++ + E + + Sbjct: 379 EKPFELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGYLDLTEIKQI 438 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTY 332 + E E YQ+V ++ D R+ + +D + Sbjct: 439 EIPIEEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRNIGQDVDPYW 498 Query: 333 LAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L M S + F + + S+ ++ PV +PP E I + +++ + Sbjct: 499 LETYMNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSKLHIFYKLCE 558 Query: 391 VLVEKIEQS 399 L I+ + Sbjct: 559 ELKNHIQSA 567 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 73/195 (37%), Gaps = 3/195 (1%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPE 278 S E +P+ WE + + +K + I S +E + K Sbjct: 93 SEEEKPFELPEGWEWVRISEIGHDWGQKTPDKDFTYIDVGSINKEYGIIEELSILSAKDA 152 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLM 337 +IV G I++ + ++ +++ I ++A+ + P+ +D+ ++ + + Sbjct: 153 PSRARKIVQQGTIIYSTVRPYLLNIAIIENEILPEPIASTAFAIIHPYTAMDANFIYYYL 212 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 RS +G+ ++ + VPP EQ I N I + D L ++ Sbjct: 213 RSPVFVCYVENCQTGVAYPAINDKQFFSGITPVPPSLEQVRIANKIKELMSLCDQLEQQS 272 Query: 397 EQSIVLLKERRSSFI 411 S+ ++ + + Sbjct: 273 LTSLDAHQQLVETLL 287 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 55/202 (27%), Gaps = 13/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+ + + +G T + Y+ + +V+ G Sbjct: 383 ELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGYLDLTEIKQIEIPI 442 Query: 74 DTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + KG +L + G + R + I + + Sbjct: 443 EEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRNIGQDVDPYWLETY 502 Query: 131 IDV----TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ A + ++ + + P+ IPP +E I K+ + L Sbjct: 503 MNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSKLHIFYKLCEELKN 562 Query: 187 ERIRFIELLKEKKQALVSYIVT 208 + AL V Sbjct: 563 HIQSAQQTQLHLADALTDAAVN 584 >gi|329117251|ref|ZP_08245968.1| type I restriction modification DNA specificity domain protein [Streptococcus parauberis NCFD 2020] gi|326907656|gb|EGE54570.1| type I restriction modification DNA specificity domain protein [Streptococcus parauberis NCFD 2020] Length = 386 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 47/396 (11%), Positives = 112/396 (28%), Gaps = 34/396 (8%) Query: 24 HWKVVPIKRFTKL-----NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + +++E L +S Y + + + Sbjct: 16 DWEERKLGEIFNYEQPTKYIVKSTEYDDTFNTPVLTAGKSFLLGYTDEITGIKNAT---- 71 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +++ + I S+ +L D + ++ + Sbjct: 72 --VENPVVIF--DDFTTGSHYVDFPFKIKSSAMKLLSLNDNSDNFYFMFNTLKNIKYVPQ 127 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I P +KI + ++D I R ++LLKE+ Sbjct: 128 SHE----RHWISKFSEFEIYKPSQEEQ------QKIGSFFKQLDDTIALHQRKLDLLKEQ 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ + + K +++ +G + + N Sbjct: 178 KKGFLQKMFPKNGAKVPELRFAGFADDWEERKFSDFTKLSQGLQIAISDRFTEAGPNKEF 237 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + T+ E+ I + +I+ + Sbjct: 238 YITNEFLNPNNTKK--YYIENPSKNVIANTNDILMTRTGNTGKVVT----NTKGAFHNNF 291 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + P I +L +L+ S + K G+ L +D ++ V +P +EQ Sbjct: 292 FKIDYDPKKISKLFLYFLLTSIPIQKEILIRAGTSTIPDLNHKDFYKIKVYLPIFEEQQR 351 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++D + ++ + LLKE++ ++ Sbjct: 352 IGSF----FKQLDDTIALHQRKLDLLKEQKKGYLQK 383 >gi|148825521|ref|YP_001290274.1| hypothetical protein CGSHiEE_02145 [Haemophilus influenzae PittEE] gi|148715681|gb|ABQ97891.1| hypothetical protein CGSHiEE_02145 [Haemophilus influenzae PittEE] Length = 383 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 58/388 (14%), Positives = 125/388 (32%), Gaps = 33/388 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + SG Y N+ Q + +G+ Sbjct: 7 EWKPLDEVANIVNNARKP-------VKSSSRVSGNIPY--YGANNIQDYVEGYT--HEGE 55 Query: 86 ILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G A + V+ K+ L L+ A Sbjct: 56 FVLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLA-- 113 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + IP+PIPPL+ Q I + + A T L +E + L +++ + Sbjct: 114 -GKERAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELTSELILRQKQYEY 172 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++++ ++ G EW K + T N I L Sbjct: 173 YREKLLSEE-----ELGKVGFEW---KTIDEISKKISSGGTPTTSNNGYYDNGTIPWLRT 224 Query: 262 GNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + K E + + + ++ K ++ + + Sbjct: 225 QEVDFKEIWDTNIKITEDALNNSSAKWIPANCVIVAMYGATVGKTAINKIPLTTNQACAN 284 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + Y+ + S + ++GSG + ++ + +K+L V VPPI+EQ+ I Sbjct: 285 --IEINDKLACYRYIFHYLTSKY--EYIKSLGSGSQTNINAQIIKKLKVPVPPIEEQYRI 340 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 ++++ + + E + +I ++R Sbjct: 341 VSILDKFETLTNSITEGLPLAIEQSQKR 368 >gi|298695075|gb|ADI98297.1| Type I restriction-modification system, specificity subunit S [Staphylococcus aureus subsp. aureus ED133] Length = 392 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 57/397 (14%), Positives = 117/397 (29%), Gaps = 30/397 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ ++ K+N+G+ + ++ G G + Sbjct: 20 EWEEKKLESIIKVNSGKDYK-----------HLDKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I + +I+ + + K Q + Sbjct: 122 TGVPSLSKQTINKINRFVPTNKEQQKIGKFFSKLDRQIELQEQKLELLQQQKKGYMQKIF 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + + V K +ES N Sbjct: 182 SQELRFKDENGNDYPEWENVMLQKVLKDKTEG-IKRGPFGGALKKDIFVESGYAVYEQRN 240 Query: 264 IIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + + Y+ V P +I+ + +GII A + Sbjct: 241 AIYDISNFRYYINENKYKEMQSFSVQPNDIIMSCSGTIGRLALIPHNYT--KGIINQALI 298 Query: 322 AVKPHGID-STYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + S + MRS + + GS + + +++K +P +P EQ I Sbjct: 299 RFRTNHKIRSEFFLIFMRSNQMQRKILEANPGSAITNLVPVKELKLIPFPLPVKFEQDKI 358 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + I++ I+ +E+ E+ I LK R+ F+ Sbjct: 359 SQFIHI----INRRIEQSEKKIESLKNRKQGFLQKLF 391 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 61/165 (36%), Gaps = 19/165 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRS-----LRSAQVMERGIITSAYMAVKPHGID 329 +K S + Y+ +D G+I S + + + +G I Y+ P Sbjct: 30 IKVNSGKDYKHLDKGDIPVYGTGGYMTSVSEPLSEIDAVGIGRKGTINKPYLLEAPFWTV 89 Query: 330 STYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 T +++ + S SL + + ++ VP KEQ I Sbjct: 90 DTLFYCTPKKETDILFILSLFRKINWKVYDESTGVPSLSKQTINKINRFVPTNKEQQKIG 149 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +++D +E EQ + LL++++ ++ + ++ + E Sbjct: 150 KF----FSKLDRQIELQEQKLELLQQQKKGYMQKIFSQELRFKDE 190 >gi|325913553|ref|ZP_08175918.1| hypothetical protein HMPREF0523_1024 [Lactobacillus iners UPII 60-B] gi|325477132|gb|EGC80279.1| hypothetical protein HMPREF0523_1024 [Lactobacillus iners UPII 60-B] Length = 389 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 47/374 (12%), Positives = 117/374 (31%), Gaps = 11/374 (2%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 YI +++ G + + + + G IL + PY +K +D CS Sbjct: 25 YITTDNMIPNRGGVVDCESLPKAKRVTRY---EPGDILISNIRPYFKKIWFSDRISGCSN 81 Query: 110 QFLVLQPKDVLPELLQGWLL--SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 +V + D + + + + G M + K I +P + +Q Sbjct: 82 DVIVFRANDENWNKKFLYYVLSQDSFFDFMMSGSNGTKMPRGNKKTIPEFLIPDFDIDKQ 141 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + I + + A I+ + E + + + G + W Sbjct: 142 IRIADILSAYDSLIENNQKQIKLLEEAAQRLYKEWFVDLHFPGYEDVEIVDGVPEGWKKE 201 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + + ++ + I LS ++ + + E + + Sbjct: 202 RAECFFKITIGKTPPRAEKQWFVNGNNGIPWLSISDMRDAGTFIFKTREGLTEEAIKKHN 261 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + I + R A A + Y + +++ + Sbjct: 262 MKIVPPGTIFVSFKLTVGRVAIATTEMCTNEAIAHFYVNDSLQAYTYCYLSNFEYDTL-- 319 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 S + +++ + +K +P ++P DI ++ + I ++ ++ L E R Sbjct: 320 GNTSSISKAVNSKIIKAMPFIMPS----QDIIENFSMIVSPILNEIKAKQEMCNYLSEAR 375 Query: 408 SSFIAAAVTGQIDL 421 + ++G+I++ Sbjct: 376 DRLLPKLMSGEIEV 389 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 66/220 (30%), Gaps = 24/220 (10%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGL 53 H+ Y V+ + +P+ WK + F K+ G+T I ++ + Sbjct: 181 HFPGYE-----DVEIVDGVPEGWKKERAECFFKITIGKTPPRAEKQWFVNGNNGIPWLSI 235 Query: 54 EDVES-GTGKYLPKDG-NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF 111 D+ GT + ++G + I G I + + + IA + + Sbjct: 236 SDMRDAGTFIFKTREGLTEEAIKKHNMKIVPPGTI-FVSFKLTVGRVAIATTEMCTNEAI 294 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 D L +L + + S I +P + I Sbjct: 295 AHFYVNDSLQAYTYCYLSNFEY-------DTLGNTSSISKAVNSKIIKAMPFIMPSQDII 347 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 E I I + L E + L+ +++ + Sbjct: 348 ENFSMIVSPILNEIKAKQEMCNYLSEARDRLLPKLMSGEI 387 >gi|198277087|ref|ZP_03209618.1| hypothetical protein BACPLE_03295 [Bacteroides plebeius DSM 17135] gi|198269585|gb|EDY93855.1| hypothetical protein BACPLE_03295 [Bacteroides plebeius DSM 17135] Length = 475 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 70/445 (15%), Positives = 140/445 (31%), Gaps = 80/445 (17%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IPK W+ I+ ++ G T S + I + ++++G K + D + Sbjct: 30 EIPKGWEWTRIRNISQSYIGLTYSPTDVSSRGTIVLRSSNIQNG--KIVLNDVVRVSKEI 87 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S K I+ + A++ D + + K L + + +L S Sbjct: 88 SEKLQVEKNDIIICARNGSAKLVGKSAVVTDVTEPMTFGAFMAICKTALYQYVSIFLQSD 147 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID--------- 182 ++ + T++ + +PIPP EQ I EK+ + I+ Sbjct: 148 LFFSQLRGVSGTTTINQLTQNNFNDFWIPIPPANEQKRIVEKLQNVSPFIERYSKSQETL 207 Query: 183 --------------------------------------TLITERIRFIELLKEKKQALVS 204 I + + + + K+++++ Sbjct: 208 NLMNIQIKEQLKKSILQEAIQGKLVPQIAEEGTAQELLEQIRQEKQKLVKEGKLKKSVLT 267 Query: 205 YIVTKGLNPDVKMKDSGIEWVG-------LVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 V + + + G E + +P W + + + R + Sbjct: 268 DSVIYKGDDNKYWEKYGTETICVNDEIPFEIPATWIWVRLDNICSYIQRGKSPKYSPIKK 327 Query: 258 SL--------SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN---DKRSLR 306 G I K + + P SY +++ G++++ L Sbjct: 328 YPVIAQKCNQWAGFCIDKAQFIDPNSLP-SYSEERLLQDGDLMWNSTGLGTLGRMAIYQS 386 Query: 307 SAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDV 362 + E + S ++P I S YL + S + V + GS ++ L V Sbjct: 387 ALNPYELAVADSHVTVIRPLKEHILSQYLYYYFASDTVQSVIEDKSDGSTKQKELSTTTV 446 Query: 363 KRLPVLVPPIKEQFDITNVINVETA 387 K V +PP +EQ I I T+ Sbjct: 447 KNYLVPIPPYREQQRIVEKIKTVTS 471 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 26/211 (12%), Positives = 62/211 (29%), Gaps = 11/211 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES----NILSLSYGNIIQKLETRNM 273 K E +P WE + + I+ S K+ ++ Sbjct: 21 KCIDEEIPFEIPKGWEWTRIRNISQSYIGLTYSPTDVSSRGTIVLRSSNIQNGKIVLNDV 80 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + V+ +I+ + + +T Y+ Sbjct: 81 VRVSKEISEKLQVEKNDIIICARNGSAKLVGKSAVVTDVTEPMTFGAFMAICKTALYQYV 140 Query: 334 AWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + ++S G+ L + + +PP EQ I + + I+ Sbjct: 141 SIFLQSDLFFSQLRGVSGTTTINQLTQNNFNDFWIPIPPANEQKRIVEKLQNVSPFIERY 200 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 K ++++ L+ ++ + S + A+ G+ Sbjct: 201 -SKSQETLNLMNIQIKEQLKKSILQEAIQGK 230 >gi|15923422|ref|NP_370956.1| restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus Mu50] gi|15926110|ref|NP_373643.1| restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus N315] gi|57651318|ref|YP_185367.1| type I restriction-modification system S subunit [Staphylococcus aureus subsp. aureus COL] gi|87159955|ref|YP_493120.1| putative restriction/modification system specificity protein [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|88194193|ref|YP_498985.1| restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus NCTC 8325] gi|148266893|ref|YP_001245836.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus JH9] gi|150392938|ref|YP_001315613.1| restriction modification system DNA specificity subunit [Staphylococcus aureus subsp. aureus JH1] gi|151220611|ref|YP_001331433.1| type I restriction modification system, site specificity determination subunit [Staphylococcus aureus subsp. aureus str. Newman] gi|156978761|ref|YP_001441020.1| restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus Mu3] gi|161508681|ref|YP_001574340.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|221141679|ref|ZP_03566172.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus str. JKD6009] gi|255005228|ref|ZP_05143829.2| putative restriction/modification system specificity protein [Staphylococcus aureus subsp. aureus Mu50-omega] gi|257795445|ref|ZP_05644424.1| restriction modification system specificity subunit [Staphylococcus aureus A9781] gi|258413471|ref|ZP_05681746.1| restriction modification system specificity subunit [Staphylococcus aureus A9763] gi|258421405|ref|ZP_05684332.1| type I restriction modification system [Staphylococcus aureus A9719] gi|258436895|ref|ZP_05689235.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus A9299] gi|258444387|ref|ZP_05692721.1| restriction modification system specificity subunit [Staphylococcus aureus A8115] gi|258445599|ref|ZP_05693779.1| restriction modification system specificity subunit [Staphylococcus aureus A6300] gi|258448131|ref|ZP_05696260.1| type I restriction-modification system S subunit [Staphylococcus aureus A6224] gi|258455963|ref|ZP_05703918.1| type I restriction-modification system S subunit [Staphylococcus aureus A5937] gi|269202054|ref|YP_003281323.1| type I restriction-modification system S subunit [Staphylococcus aureus subsp. aureus ED98] gi|282893572|ref|ZP_06301805.1| type I restriction enzyme, S subunit [Staphylococcus aureus A8117] gi|282927466|ref|ZP_06335084.1| type I restriction enzyme, S subunit [Staphylococcus aureus A10102] gi|294850454|ref|ZP_06791184.1| type I restriction enzyme [Staphylococcus aureus A9754] gi|295405682|ref|ZP_06815492.1| type I restriction enzyme [Staphylococcus aureus A8819] gi|297245590|ref|ZP_06929458.1| type I restriction enzyme [Staphylococcus aureus A8796] gi|13700323|dbj|BAB41621.1| probable restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus N315] gi|14246200|dbj|BAB56594.1| probable restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus Mu50] gi|57285504|gb|AAW37598.1| type I restriction-modification system, S subunit, EcoA family, putative [Staphylococcus aureus subsp. aureus COL] gi|87125929|gb|ABD20443.1| putative restriction/modification system specificity protein [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|87201751|gb|ABD29561.1| restriction modification system specificity subunit, putative [Staphylococcus aureus subsp. aureus NCTC 8325] gi|147739962|gb|ABQ48260.1| restriction modification system DNA specificity domain [Staphylococcus aureus subsp. aureus JH9] gi|149945390|gb|ABR51326.1| restriction modification system DNA specificity domain [Staphylococcus aureus subsp. aureus JH1] gi|150373411|dbj|BAF66671.1| type I restriction modification system, site specificity determination subunit [Staphylococcus aureus subsp. aureus str. Newman] gi|156720896|dbj|BAF77313.1| probable restriction modification system specificity subunit [Staphylococcus aureus subsp. aureus Mu3] gi|160367490|gb|ABX28461.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|257789417|gb|EEV27757.1| restriction modification system specificity subunit [Staphylococcus aureus A9781] gi|257839718|gb|EEV64187.1| restriction modification system specificity subunit [Staphylococcus aureus A9763] gi|257842829|gb|EEV67251.1| type I restriction modification system [Staphylococcus aureus A9719] gi|257848686|gb|EEV72673.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus A9299] gi|257850646|gb|EEV74594.1| restriction modification system specificity subunit [Staphylococcus aureus A8115] gi|257855549|gb|EEV78484.1| restriction modification system specificity subunit [Staphylococcus aureus A6300] gi|257858646|gb|EEV81520.1| type I restriction-modification system S subunit [Staphylococcus aureus A6224] gi|257862175|gb|EEV84948.1| type I restriction-modification system S subunit [Staphylococcus aureus A5937] gi|262074344|gb|ACY10317.1| type I restriction-modification system S subunit [Staphylococcus aureus subsp. aureus ED98] gi|269940012|emb|CBI48388.1| type I restriction-modification system specificity protein [Staphylococcus aureus subsp. aureus TW20] gi|282590790|gb|EFB95866.1| type I restriction enzyme, S subunit [Staphylococcus aureus A10102] gi|282764258|gb|EFC04385.1| type I restriction enzyme, S subunit [Staphylococcus aureus A8117] gi|285816132|gb|ADC36619.1| Type I restriction-modification system, specificity subunit S [Staphylococcus aureus 04-02981] gi|294822657|gb|EFG39096.1| type I restriction enzyme [Staphylococcus aureus A9754] gi|294969757|gb|EFG45776.1| type I restriction enzyme [Staphylococcus aureus A8819] gi|297177576|gb|EFH36827.1| type I restriction enzyme [Staphylococcus aureus A8796] gi|302750322|gb|ADL64499.1| restriction endonuclease S subunit [Staphylococcus aureus subsp. aureus str. JKD6008] gi|312828928|emb|CBX33770.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus ECT-R 2] gi|315130060|gb|EFT86049.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus CGS03] gi|320139278|gb|EFW31157.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus MRSA131] gi|329313151|gb|AEB87564.1| Restriction modification system DNA specificity domain protein [Staphylococcus aureus subsp. aureus T0131] gi|329725596|gb|EGG62075.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21172] gi|329730503|gb|EGG66892.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21189] Length = 403 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 130/405 (32%), Gaps = 39/405 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + K + I + +Y K +S+ + ++ Sbjct: 20 EWEEKQLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 77 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 78 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 137 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + I + P L EQ I + +I+ + Sbjct: 138 VSGIAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKFFSKLDRQIELEEQKLELLQ 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K Q + S + + G WE + E N ++ Sbjct: 198 QQKKGYMQKIFSQELRFK------------DENGEDYPDWENSKIEKYLKERNERSD--K 243 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + II+ E + Y++V +I + + + + Sbjct: 244 GQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASGKSNY---- 299 Query: 313 RGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVL 368 GI++ AY + P S+ + +++ + F GL +LK++ +K + + Sbjct: 300 NGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINID 359 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 360 IPVLEEQEKIGDF----FKKMDILISKQKMKIEILEKEKQSFLQK 400 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 66/184 (35%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 W+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 224 DWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 278 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 279 KNDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 338 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 339 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFL 398 Query: 200 QALV 203 Q + Sbjct: 399 QKMF 402 >gi|302332147|gb|ADL22340.1| type I restriction modification system, site specificity determination subunit, HsdS_1 [Staphylococcus aureus subsp. aureus JKD6159] Length = 411 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 60/401 (14%), Positives = 140/401 (34%), Gaps = 25/401 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ ++ + K+ +I D+ S L DGN V Sbjct: 20 EWEEKKLEDLGLFQKSYSFSRAKEGNGKTKHIHYGDIHSKFKTVLDSDGNIPNIIEKAVF 79 Query: 80 -IFAKGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 + KG I++ + +I + + + L + ++ Sbjct: 80 ELIQKGDIVFADASEDYSDLGKAVMIDFEPNSLISGLHTHLFRPLNNAISNFLIFYTKTL 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I G ++ K + N+ + IP + +KI ++D I + Sbjct: 140 SYKKFIRQQGTGISVLGISKKSLLNLNVLIPRSELEQ---QKIGQFFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L + + EW + + N + Sbjct: 197 LELLQQQKKGYMQKIFSQELRFKDENGNDYPEWENKRIEDIANVNKGFTPSTNNNEYWDN 256 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + N LS++ N + L N G+ ++ + Y V ++ F +++ Sbjct: 257 NDKNWLSIAGMNQ-KYLYKGNKGISKDAAKNYMKVKNDTLIMSFKLTIGKLAIVKAPLYT 315 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 I + K + I++ ++ + + S ++ G+ +L + + + V +P Sbjct: 316 NEAIC---HFIWKVNKINTEFIYYYLNSLNISTFGVQAVKGV--TLNNDSINSIIVKLPN 370 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ I + ++ + K + LLK+R+ + Sbjct: 371 EEEQNIIAKFLLEVDKTVNNQLVKTK----LLKQRKKGLLQ 407 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 57/186 (30%), Gaps = 10/186 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ EW + + + N K + + N Sbjct: 10 PELRFPGFEGEWEEKKLEDLGLFQKSYSFSRAKEGNGKTKHIHYGDIHSKFKTVLDSDGN 69 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--- 329 + E ++++ G+IVF E + S ++ Sbjct: 70 IPNIIEK-AVFELIQKGDIVFADASEDYSDLGKAVMIDFEPNSLISGLHTHLFRPLNNAI 128 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETA 387 S +L + ++ K G+G+ + + + L VL+P EQ I + Sbjct: 129 SNFLIFYTKTLSYKKFIRQQGTGISVLGISKKSLLNLNVLIPRSELEQQKIGQF----FS 184 Query: 388 RIDVLV 393 ++D + Sbjct: 185 KLDRQI 190 >gi|238923780|ref|YP_002937296.1| putative type I restriction enzyme (specificity subunit) [Eubacterium rectale ATCC 33656] gi|238875455|gb|ACR75162.1| putative type I restriction enzyme (specificity subunit) [Eubacterium rectale ATCC 33656] Length = 425 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 70/422 (16%), Positives = 136/422 (32%), Gaps = 44/422 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P+ + + K G T +S KD +I ++V S L + R Sbjct: 16 PEGVEYKTLGECGKFYGGLTGKSKKDFEDGNSKFITYKNVYSNPALCLDVEDKVRIEPGE 75 Query: 77 TVSIFAKGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQ---PKDVLPELLQG 126 G I++ + I + + ++ + + P +LP+ + Sbjct: 76 RQRTLEYGDIVFTGSSETPDECGISSVVAEIPEENLYLNSFCFIFRFDDPSILLPDFAKH 135 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S ++ +I G T + K +G + P+PPL Q I + + T+ L Sbjct: 136 LFRSSELRYQIGKTASGVTRYNVSKKLMGKVSFPVPPLEVQREIVRVLDSFTLLTAELTA 195 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E + + + L++ D I +G V A T + Sbjct: 196 ELTARKQQYEFYRDYLLN-----------GNSDYDICNLGDV------CDVVAGGTPSRK 238 Query: 247 KNTKLIESNILSLSYGNIIQKLE----TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + + I L K T + + +++ + + K Sbjct: 239 VSDYWEDGCIPWLGSTVCKNKKNVDEPTEFITELGLEKSSAKMMKKDTTLIALVGATIGK 298 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + + V I Y + +Y+ + + + GS +L F Sbjct: 299 VAFTTFDVAINQNIAGVYPKDTSKI-NPSYIYYACTTLYPHFLNLTQGSKLAMANLTF-- 355 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTG 417 V+ L + VPPI Q + NV++ + L + I K+ R + + A TG Sbjct: 356 VRGLKISVPPIDVQNHLVNVLDNFESITSDLSIGLPAEIEARKKQYEYYRDALLTYASTG 415 Query: 418 QI 419 +I Sbjct: 416 KI 417 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 66/199 (33%), Gaps = 14/199 (7%) Query: 228 VPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKL---ETRNMGLKPESY 280 P+ E K + N ++Y N+ ++ E Sbjct: 15 CPEGVEYKTLGECGKFYGGLTGKSKKDFEDGNSKFITYKNVYSNPALCLDVEDKVRIEPG 74 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSA---QVMERGIITSAYMAVK---PHGIDSTYLA 334 E + ++ G+IVF D+ + S E + S + P + + Sbjct: 75 ERQRTLEYGDIVFTGSSETPDECGISSVVAEIPEENLYLNSFCFIFRFDDPSILLPDFAK 134 Query: 335 WLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 L RS +L SG R ++ + + ++ VPP++ Q +I V++ T L Sbjct: 135 HLFRSSELRYQIGKTASGVTRYNVSKKLMGKVSFPVPPLEVQREIVRVLDSFTLLTAELT 194 Query: 394 EKIEQSIVLLKERRSSFIA 412 ++ + R + Sbjct: 195 AELTARKQQYEFYRDYLLN 213 >gi|217980317|ref|YP_002364293.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] gi|217500954|gb|ACK48926.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] Length = 428 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 63/425 (14%), Positives = 134/425 (31%), Gaps = 35/425 (8%) Query: 26 KVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 K + + R + + +I D+ G + S+ G Sbjct: 4 KTYTLGEIASNTSRRFNFVGNEQVCFINTGDILDGHF-LTNERVQSKGLPGQAKKAIQHG 62 Query: 85 QILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGW----LLSIDVTQRI 137 ILY ++ P ++ ++ + D + ST+F+V+ + + + ++ Sbjct: 63 DILYSEIRPGNKRHLLVEGDVDDYVVSTKFMVITCDHDVVLPEYLYLVLTSKECEAEFKV 122 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A T + + P+ +P L EQ + E I + T +++ ++ + Sbjct: 123 IADSRSGTFPQITFDAVAYYPIELPSLNEQRNVVEIIKSITQKLNVNKDINSTSEDIAQA 182 Query: 198 KKQALVS-----YIVTKGLNPDVKMKDSG--------IEWVGLVPDHWEVKPFFA--LVT 242 ++ G P+ + +GL+P+ W + V Sbjct: 183 IFKSWFVDFDPVKAKMNGEQPEGMDAATASLFPEKLVESELGLIPEGWHIHNTQDLFEVR 242 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETR-----NMGLKPESYETYQIVDPGEIVFRFID 297 + + K E+ ++ +I + L E VD +I+ I Sbjct: 243 DGTHDSPKKAENGYYLVTSKHITKGKIDTSSAYLISELDFEQVNQRSKVDTFDILLTMIG 302 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQS 356 + + V E I W ++S+ + M +Q Sbjct: 303 TVGEVVVVYDNPV-EFAIKNVGLFKTSQKPELVWLFYWHLQSFKMKNYLEVRMAGTTQQY 361 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + ++ +PVLVP N + + + L E R + ++ Sbjct: 362 LTLKTLRTIPVLVPSQNLLQKF----NELISPLMGKISDNHNQNQSLSEMRDILLPKLLS 417 Query: 417 GQIDL 421 G+IDL Sbjct: 418 GEIDL 422 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 60/195 (30%), Gaps = 8/195 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG--RTSESGKDIIYI-GLEDVESGTGKYLPKDGNSRQ-- 72 +G IP+ W + + ++ G + + ++ Y+ + + G S Sbjct: 223 LGLIPEGWHIHNTQDLFEVRDGTHDSPKKAENGYYLVTSKHITKGKIDTSSAYLISELDF 282 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLL 129 + S IL +G ++ D I + K L L L Sbjct: 283 EQVNQRSKVDTFDILLTMIGTVGEVVVVYDNPVEFAIKNVGLFKTSQKPELVWLFYWHLQ 342 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + +E G T + K + IP+ +P E I +I + Sbjct: 343 SFKMKNYLEVRMAGTTQQYLTLKTLRTIPVLVPSQNLLQKFNELISPLMGKISDNHNQNQ 402 Query: 190 RFIELLKEKKQALVS 204 E+ L+S Sbjct: 403 SLSEMRDILLPKLLS 417 >gi|167854770|ref|ZP_02477548.1| type I restriction enzyme EcoKI subunit R [Haemophilus parasuis 29755] gi|167854068|gb|EDS25304.1| type I restriction enzyme EcoKI subunit R [Haemophilus parasuis 29755] Length = 397 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 50/406 (12%), Positives = 135/406 (33%), Gaps = 24/406 (5%) Query: 21 IPKHWKVVPIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W + I + + G+ + + ++ G + + +D + V Sbjct: 6 LPEGWNKINITKVFTQISTTGKNIATKDCLSVGKYPVIDQG-----AEYISGYFNDETKV 60 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I++ G + R + DFD I + + + + + + + Sbjct: 61 IPVENKVIVF---GDHTRNFKLIDFDFIVGADGVKIFQPAKDIDPDFFYYQCLSLNLPNK 117 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + P ++Q + +K ++ + + LLK Sbjct: 118 GYHRHFRY-------LKECDFIYPSFSQQQKLAKKFTVLLSQVAEIKQRLEKIPALLKTY 170 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +Q++++ V L+ + +++G+ V + + + N I Sbjct: 171 RQSVLARAVNGELSAKWR-EENGVSLDSWVYEKAQHICDKVQSGSTPKGNPFEQNGTIPF 229 Query: 259 LSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 L NI+ + + + E + I P +++ + K ++ + Q E I Sbjct: 230 LKVYNIVNQELNFDYKPQFVTKEQHSQRSITLPNDVLMNIVGPPLGKVAIVTNQYSEWNI 289 Query: 316 ITSA-YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPI 372 + P + + +++R + G+ + ++ + + V VP + Sbjct: 290 NQAITLFRCNPRNLHYKFFYFVLREGRFIREIEHDLKGIVGQINISLSQCRDMIVPVPTL 349 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +EQ IT + + L ++ ++ + + +A G+ Sbjct: 350 EEQNYITQAVEKHLNFANQLEAQVNAALERVNLMTQAILAKGFRGE 395 >gi|163748972|ref|ZP_02156223.1| Restriction endonuclease S subunit [Shewanella benthica KT99] gi|161331348|gb|EDQ02236.1| Restriction endonuclease S subunit [Shewanella benthica KT99] Length = 601 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 55/475 (11%), Positives = 126/475 (26%), Gaps = 98/475 (20%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSD--TS 76 +P W + + G +S + + +++ T + + S Sbjct: 107 ELPSSWAWSRMGDLAQYQKGYAFKSKDYLDSGFMITKIQNLTDNHTQNSVYIAPAKAMES 166 Query: 77 TVSIFAKGQILYGKLGPYLRKAI-----------IADFDGICSTQFLVLQPKDVLPELLQ 125 + + G I+ +G + I + D + + K+ P L Sbjct: 167 KQYLLSDGDIVMTTVGSWFTAPISAVGRSFLISKLFDNSLLNQNAVRISSVKEFDPMYLY 226 Query: 126 GWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE------- 177 + S + +G + I + + +PPLAEQ I K Sbjct: 227 ICVNSPIFKNYLVKEAQGTANQASITQASIKHFLICVPPLAEQHRIVAKADELMTLCDQL 286 Query: 178 ----------------------------------TVRIDTLITERIRFIELLKEKKQALV 203 RI +++ KQ ++ Sbjct: 287 EQQTEESLSAHQTLVEVLLSTLTESKSAEDFQTSWQRIAEYFDLLFTTELSIEKLKQTIL 346 Query: 204 SYIVTKGLNPDVKMKD-------------------------------SGIEWVGLVPDHW 232 V L P + + E +P W Sbjct: 347 QLAVMGKLVPQNPSDEPASVLLEKIAEEKAQLISDKKIKKQKALPAITDEEKPFELPSGW 406 Query: 233 EVKPFFA------LVTELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQI 285 E + + + ++ I+ L N L+ + + + Sbjct: 407 EFERLGNLTSRLGSGSTPRGGQSAYVDKGIIFLRSQNVWNDGLKLDDTAYITDETHDKMV 466 Query: 286 ---VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 V P +++ + + +++ + + +L + S + Sbjct: 467 NTHVFPNDVLLNITGASLGRSIIFPEKLVTANVSQHVTIIRLLEVSMCKFLHLGIMSPLV 526 Query: 343 CKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 K+ + G + L + +++ VPP+ EQ I ++ A + L ++ Sbjct: 527 QKLVWGRQVGMAIEGLSKKVLEQFEFPVPPLAEQQRIVAKVDELMALCEQLKARL 581 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 14/201 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + E + +P W L K+ ++S + N+ ++ + Sbjct: 100 TDEEKMFELPSSWAWSRMGDLAQYQKGYAFKSKDYLDSGFMITKIQNLTDNHTQNSVYIA 159 Query: 277 PES--YETYQIVDPGEIVFRFIDLQND------KRSLRSAQVMERGIITSAYMAVKP-HG 327 P ++ G+IV + RS +++ + ++ + + Sbjct: 160 PAKAMESKQYLLSDGDIVMTTVGSWFTAPISAVGRSFLISKLFDNSLLNQNAVRISSVKE 219 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 D YL + S G + S+ +K + VPP+ EQ I + Sbjct: 220 FDPMYLYICVNSPIFKNYLVKEAQGTANQASITQASIKHFLICVPPLAEQHRIVAKADEL 279 Query: 386 TARIDVLVEKIEQSIVLLKER 406 D L ++ E+S+ + Sbjct: 280 MTLCDQLEQQTEESLSAHQTL 300 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 64/198 (32%), Gaps = 12/198 (6%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYLPK-DGNSR 71 +P W+ + T L +G T K II++ ++V + K Sbjct: 401 ELPSGWEFERLGNLTSRLGSGSTPRGGQSAYVDKGIIFLRSQNVWNDGLKLDDTAYITDE 460 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQF-LVLQPKDVLPELLQGW 127 D + +L G L ++II S ++ + + + L Sbjct: 461 THDKMVNTHVFPNDVLLNITGASLGRSIIFPEKLVTANVSQHVTIIRLLEVSMCKFLHLG 520 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++S V + + G + K + P+PPLAEQ I K+ + L Sbjct: 521 IMSPLVQKLVWGRQVGMAIEGLSKKVLEQFEFPVPPLAEQQRIVAKVDELMALCEQLKAR 580 Query: 188 RIRFIELLKEKKQALVSY 205 A+VS Sbjct: 581 LSDAQTTQLHLADAVVSN 598 >gi|258452440|ref|ZP_05700448.1| type I restriction modification system [Staphylococcus aureus A5948] gi|282924487|ref|ZP_06332157.1| type I restriction enzyme, S subunit [Staphylococcus aureus A9765] gi|284023443|ref|ZP_06377841.1| type I restriction-modification system S subunit [Staphylococcus aureus subsp. aureus 132] gi|257859840|gb|EEV82680.1| type I restriction modification system [Staphylococcus aureus A5948] gi|282592796|gb|EFB97801.1| type I restriction enzyme, S subunit [Staphylococcus aureus A9765] gi|315196674|gb|EFU27020.1| type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus CGS01] gi|320142971|gb|EFW34764.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus MRSA177] Length = 390 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 130/405 (32%), Gaps = 39/405 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + K + I + +Y K +S+ + ++ Sbjct: 7 EWEEKQLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 64 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 65 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 124 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + I + P L EQ I + +I+ + Sbjct: 125 VSGIAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKFFSKLDRQIELEEQKLELLQ 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K Q + S + + G WE + E N ++ Sbjct: 185 QQKKGYMQKIFSQELRFK------------DENGEDYPDWENSKIEKYLKERNERSD--K 230 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + II+ E + Y++V +I + + + + Sbjct: 231 GQMLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASGKSNY---- 286 Query: 313 RGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVL 368 GI++ AY + P S+ + +++ + F GL +LK++ +K + + Sbjct: 287 NGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINID 346 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 347 IPVLEEQEKIGDF----FKKMDILISKQKMKIEILEKEKQSFLQK 387 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 66/184 (35%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 W+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 211 DWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 265 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 266 KNDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 325 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 326 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFL 385 Query: 200 QALV 203 Q + Sbjct: 386 QKMF 389 >gi|332289037|ref|YP_004419889.1| EcoKI restriction-modification system protein HsdS [Gallibacterium anatis UMN179] gi|330431933|gb|AEC16992.1| EcoKI restriction-modification system protein HsdS [Gallibacterium anatis UMN179] Length = 390 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 56/394 (14%), Positives = 120/394 (30%), Gaps = 35/394 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + KL GR IG V Y + N+ + T + F + Sbjct: 14 EWKTLGEVAKLQRGRVISKQYLSENIGDYPV------YSSQTANNGEIGTISTFDFDQEA 67 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I + G T L L +L +L + + + G Sbjct: 68 ITWTTDGANAGTVFHRLGKFSI-TNVCGLVNILDLQQLDYKFLFYWLSIEAKKYVYSGMG 126 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + I +PIPPL+ Q I + A T + L +++ Q Sbjct: 127 NPKLMSNQMEKIKIPIPPLSVQKEIARILDAFTAITSE----LTSELTLRQKQYQHYRDK 182 Query: 206 IVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 ++T G +EW +G + + + + + + Sbjct: 183 LLTFG---------DEVEWKTLGEITSPTKNIQWKNNTQAYRYIDLTSVSRENHCI---- 229 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 LET + ++V +++F + +L + + + T + Sbjct: 230 ----LETTEITALNAPSRAQRLVKKDDVIFATTRPTQLRFALINDIYSGQVVSTGYCVLR 285 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++ + + + SG ++ VK + + I+EQ I +V+ Sbjct: 286 AKEEVLPKWIYYCISTIKFKNYVEENQSGSAYPAISDAKVKEFRIPILSIQEQKRIVSVL 345 Query: 383 NVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + + L + + + I L ++ R + Sbjct: 346 DKFETLTNSLSDGLPKEIELRQKQYEYYRDLLLN 379 >gi|160945143|ref|ZP_02092369.1| hypothetical protein FAEPRAM212_02662 [Faecalibacterium prausnitzii M21/2] gi|158442874|gb|EDP19879.1| hypothetical protein FAEPRAM212_02662 [Faecalibacterium prausnitzii M21/2] Length = 424 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 56/409 (13%), Positives = 122/409 (29%), Gaps = 22/409 (5%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQS-DT 75 P + + + G + ++ + G + K + Sbjct: 13 PDGVEYKTLGEIAVDIYRGAGITRDQVTVDGTPCVRYGEIYTTYGVWFDKCVSHTDEAKL 72 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ F G +L+ G + + + +V+ + P+ L L + Sbjct: 73 TSKKYFEYGDVLFAITGESVDDIAKCCAYIGHEKCLAGGDIVVLKHNQDPKYLSYVLATT 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D Q+ + + H+ I I +PIPP+ Q I + T I L + Sbjct: 133 DARQQKSKGKVKSKVVHSSVPAIREIKVPIPPIEIQREIVRILDDYTENIVELQNQLTAE 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I +++ + ++T + + D E + + D + + Sbjct: 193 ITARQKQYEFYRDKLLTFDVLRGGTI-DFDREILCRIADLGKWSGGKTPSMAEKKYWESG 251 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + S I ++ + + G + + Sbjct: 252 TIPWVSSKDVKQPILSDTIDHITNAAVDEASMTVYPAGSVAIVTRSGILRHTFPVTYIPF 311 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 E + + V GI S Y++ +++Y + + G SL F+ V + VP Sbjct: 312 ETTVNQDIKILVTKEGISSRYVSHALQAYGESIRRTTKKQGGTVDSLDFQKVLAYKIPVP 371 Query: 371 PIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIA 412 P+ Q I NV++ L +E ++ R + Sbjct: 372 PLDVQNRIVNVLDNFEKICSDLNIGLPAEIEARQKQYEY---YRDKLLT 417 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 60/171 (35%), Gaps = 10/171 (5%) Query: 257 LSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + YG I + E + + + G+++F D + A + Sbjct: 45 PCVRYGEIYTTYGVWFDKCVSHTDEAKLTSKKYFEYGDVLFAITGESVDDIAKCCAYIGH 104 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPP 371 + + V H D YL++++ + D + + ++ + V +PP Sbjct: 105 EKCLAGGDIVVLKHNQDPKYLSYVLATTDARQQKSKGKVKSKVVHSSVPAIREIKVPIPP 164 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA-AAVTG 417 I+ Q +I +++ T I L ++ I ++ R + + G Sbjct: 165 IEIQREIVRILDDYTENIVELQNQLTAEITARQKQYEFYRDKLLTFDVLRG 215 >gi|152979297|ref|YP_001344926.1| restriction modification system DNA specificity subunit [Actinobacillus succinogenes 130Z] gi|150841020|gb|ABR74991.1| restriction modification system DNA specificity domain [Actinobacillus succinogenes 130Z] Length = 382 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 57/408 (13%), Positives = 125/408 (30%), Gaps = 46/408 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS- 76 K W+ + + T + + S +I + ++ L D ++ Sbjct: 6 KGWEYIKLGDIATTVTSGSRDWAKYYSDTGAKFIRMTNLNRNGINLLLDDLKFVNVKSNS 65 Query: 77 ---TVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQ--FLVLQPKDVLPELLQGWLL 129 + IL + I + G + + + P + + L Sbjct: 66 SDGKRTALQANDILMSITAELGKIGFIPENFGEAYINQHTALIRIDPSKAYAKFIAYVLS 125 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + Q I ++ + + + I + + IPPL+EQ+ I E + A I T + Sbjct: 126 SRTMNQTINSLNDAGAKAGLNLPTIRALSLNIPPLSEQIKIAEILSAWDNAIQTTEKQIT 185 Query: 190 RFIELLKEKKQALVS--YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + K Q L+S V+ +K S I +G + K Sbjct: 186 NSQQQKKALIQMLLSGEKRVSGFSGEWKIVKISDICNIGRG--------------RVISK 231 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + GE V D N ++ Sbjct: 232 QEIEKNQGKYPVYSSQTLNNGVMGYLDSFDFD---------GEFVTWTTDGVN-AGTIFY 281 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ K ++ +LA+++ + V + + + L + + V Sbjct: 282 RNGKFNCTNVCGVLSSKLEQLNLRFLAYILSTVSYKYVSHTLAN---PKLMNGVMGTIEV 338 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +P ++EQ I ++ I+ L ++ + LK + + + Sbjct: 339 KLPQLEEQQKIAEILTTADQEIETL----QRKLECLKLEKRALMQGVF 382 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 69/188 (36%), Gaps = 5/188 (2%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + K I L+ + N++ S + +I+ Sbjct: 26 DWAKYYSDTGAKFIRMTNLNRNGINLLLDDLKFVNVKSNSSDGKRTALQANDILMSITAE 85 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSL 357 + +A + + P + ++A+++ S + + ++ +G + L Sbjct: 86 LGKIGFIPENFGEAYINQHTALIRIDPSKAYAKFIAYVLSSRTMNQTINSLNDAGAKAGL 145 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++ L + +PP+ EQ I +++ D ++ E+ I ++++ + I ++G Sbjct: 146 NLPTIRALSLNIPPLSEQIKIAEILSAW----DNAIQTTEKQITNSQQQKKALIQMLLSG 201 Query: 418 QIDLRGES 425 + + G S Sbjct: 202 EKRVSGFS 209 >gi|328545366|ref|YP_004305475.1| Restriction modification system DNA specificity domain protein [polymorphum gilvum SL003B-26A1] gi|326415108|gb|ADZ72171.1| Restriction modification system DNA specificity domain protein [Polymorphum gilvum SL003B-26A1] Length = 298 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 46/323 (14%), Positives = 104/323 (32%), Gaps = 32/323 (9%) Query: 106 ICSTQFLVLQPK-DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 S F+ + + + E + G+T+ + + PPL Sbjct: 2 AVSQHFIAWSCSAKRVLDPWFLYAWMQTQKPFFERMAVGSTIKTIGLPIFKRLTIDFPPL 61 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 EQ I + ++ + + L L+ + L+ + + Sbjct: 62 PEQRRIAAILRTWDEALEKVTALHAAKVRRLDGLAAWLIHDEQAERLHLRDFLSEVSTRN 121 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 G +E + + + + + Y+ Sbjct: 122 RGQQ-----------------------VERVLSVTNSAGFVLAEDQFAHRVASADLSNYK 158 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLC 343 IV G+ + + S+ E G ++ Y+ + G+DS + +RS + Sbjct: 159 IVRRGQYAYNPSRIN--VGSIARLDAWEAGALSPMYVVFQVRDGLDSDFFQHWLRSAEAR 216 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G +R+++ F D+ + + VP I+ Q I+ +N + IE I Sbjct: 217 QRIALAAQGSVRETVSFGDLGSILIPVPTIERQQSISRALNAGREE----IALIEAEIEA 272 Query: 403 LKERRSSFIAAAVTGQIDLRGES 425 L ++ + +TG+ ++ E+ Sbjct: 273 LTRQKRGLMQKLLTGEWRVKLEA 295 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 33/103 (32%), Gaps = 4/103 (3%) Query: 314 GIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 ++ ++A D +L M++ +++ KRL + P Sbjct: 1 MAVSQHFIAWSCSAKRVLDPWFLYAWMQTQKPFFE-RMAVGSTIKTIGLPIFKRLTIDFP 59 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P+ EQ I ++ ++ + + L + I Sbjct: 60 PLPEQRRIAAILRTWDEALEKVTALHAAKVRRLDGLAAWLIHD 102 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 31/136 (22%), Positives = 51/136 (37%), Gaps = 3/136 (2%) Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQ-GWL 128 +D S I +GQ Y + D G S ++V Q +D L WL Sbjct: 151 SADLSNYKIVRRGQYAYNPSRINVGSIARLDAWEAGALSPMYVVFQVRDGLDSDFFQHWL 210 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + QRI +G+ + +G+I +P+P + Q I + A I + E Sbjct: 211 RSAEARQRIALAAQGSVRETVSFGDLGSILIPVPTIERQQSISRALNAGREEIALIEAEI 270 Query: 189 IRFIELLKEKKQALVS 204 + Q L++ Sbjct: 271 EALTRQKRGLMQKLLT 286 >gi|258592718|emb|CBE69027.1| putative Restriction endonuclease S subunits [NC10 bacterium 'Dutch sediment'] Length = 390 Score = 100 bits (248), Expect = 5e-19, Method: Composition-based stats. Identities = 54/368 (14%), Positives = 112/368 (30%), Gaps = 20/368 (5%) Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQP 116 G L + + T + G+ L ++ + I D I S + + Q Sbjct: 33 QGIVLRDIVSGSEIKTKKQQVCRAGEFLVAEIDAKVGGFGIVPDDLDGAIVSNHYFLFQI 92 Query: 117 KDV-LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 L + + + A + + + +PPL EQ + +I Sbjct: 93 DHTVLDCRFLDFFIRTPTFRDQVAAQGSTNYAAIRPNDVLGYKISLPPLEEQWRLVARIE 152 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 + I + E+ AL+S + S Sbjct: 153 ELAAK----IEQARDLRREAVEEAGALLSAA-------SRNLFVSDGLKAPRGRLEHFAT 201 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIV 292 + + T + S L+ PE + + PG+++ Sbjct: 202 RITKGESPEWQGFTYQELGPVFVRSENVGWGTLDLSRRTCIPEEFHHKLKRSQLQPGDVL 261 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MG 350 + + + A + E + + A ++ +DS +L + S ++ Sbjct: 262 INLVGASIGRSCVVPADLGEANVNQAVAVISPDSRQLDSNFLMHFLISAPAQTTIHSGKV 321 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 R ++ D++ L + VPP+ EQ I ++ A++D L + L S Sbjct: 322 ETARPNISLGDLRNLILPVPPLFEQQRIVAYLDNLWAKVDALKRLQAATNPELGALLPSV 381 Query: 411 IAAAVTGQ 418 + A G+ Sbjct: 382 LDKAFKGE 389 >gi|317501109|ref|ZP_07959315.1| ribosomal protein L10 [Lachnospiraceae bacterium 8_1_57FAA] gi|316897496|gb|EFV19561.1| ribosomal protein L10 [Lachnospiraceae bacterium 8_1_57FAA] Length = 380 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 73/397 (18%), Positives = 128/397 (32%), Gaps = 31/397 (7%) Query: 29 PIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + G+ S + YI E++ G Q T F K +L Sbjct: 6 KLSDICEYAKGKIKVSALDENTYISTENMLPNKGGITKAASLPTQEQTQA---FMKNDVL 62 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGATM 146 + PY +K A FDG CS LV + KD + ++L+ D A +G M Sbjct: 63 VSNIRPYFKKIWYATFDGGCSNDVLVFRAKDGVSSRFLHYVLADDTFFDYSMATSKGTKM 122 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 D K I +P +Q I + ID I + L E+ Q++ + Sbjct: 123 PRGDKKAIMEYEVPELLYEDQCKIAGVLEV----IDEKIDLNTDINKNLLEQAQSIFTQE 178 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 ++S + + + ++ + K E + L + Q Sbjct: 179 FLMFDRIPDGWQESSLLGIADYLNGLAMQKY----------RPKDDEQGLPVLKIKELRQ 228 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N L S + IV G+++F + L + V Sbjct: 229 GSCDFNSELCSPSIKPEYIVHDGDVIFSWSGSL-----LVDLWCGGTCGLNQHLFKVTSS 283 Query: 327 GIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 D + + + L K A + +K E++ + VL+P + I Sbjct: 284 TYD-KWFYYAWTDHHLQKFAAIAADMATTMGHIKREELSKAEVLIPSQSDYDRIG----G 338 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 A + LV L R + ++GQ+D+ Sbjct: 339 LLAPLYDLVIANRIENRKLASLRDELLPQLMSGQLDV 375 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 51/190 (26%), Gaps = 10/190 (5%) Query: 21 IPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W+ + G R + + + + ++++ G+ + + Sbjct: 185 IPDGWQESSLLGIADYLNGLAMQKYRPKDDEQGLPVLKIKELRQGSCDF---NSELCSPS 241 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I G +++ G L G + + W Sbjct: 242 IKPEYIVHDGDVIFSWSGSLLVDLWCGGTCG-LNQHLFKVTSSTYDKWFYYAWTDHHLQK 300 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 A TM H + + + IP ++ I + + E + L Sbjct: 301 FAAIAADMATTMGHIKREELSKAEVLIPSQSDYDRIGGLLAPLYDLVIANRIENRKLASL 360 Query: 195 LKEKKQALVS 204 E L+S Sbjct: 361 RDELLPQLMS 370 >gi|10956198|ref|NP_051027.1| type IC specificity subunit [Streptococcus thermophilus] gi|6137149|gb|AAF04358.1| type IC specificity subunit [Streptococcus thermophilus] Length = 413 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 62/408 (15%), Positives = 148/408 (36%), Gaps = 32/408 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT-STV 78 W+ + + +L+ + + + D+ + + + + N+ SD+ Sbjct: 17 DWEERKLGKLARLSLELDFQMLNKAVCKGPFYKVSDMNNPGNEVVMMNANNYASDSQIKE 76 Query: 79 SIFAKGQ-----ILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLS 130 + + +++ K+G AI D I T FL + + + + Sbjct: 77 NKWNPIDPQNSGVVFAKVGA----AIFLDRKRIVDTSFLSDNNMMSYLFDSSWNRYFGKT 132 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + R+ + + + + NI + +P + EQ I ++D +I R Sbjct: 133 LFEKLRLSRFAQVGAIPSFNGSDVENIKVMVPEIEEQQKIGSF----FKQLDEIIALHQR 188 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++LL+E+K+ + + K +++ +G + + + Sbjct: 189 KLDLLEEQKKGFLQKMFPKNGAKVPELRFAGFADDWE--ERKLGEVGNTFTGLSGKTKED 246 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQ 309 ++Y N+ GL+ + Q V G+++F ++ + S Sbjct: 247 FGHGEGKFITYMNVFSNPVADLDGLESVEIDNKQFQVKAGDVLFTTSSETPEEVGMSSMW 306 Query: 310 VM--ERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 + + + S +P D YLA+++RS + K F + G+ R ++ V Sbjct: 307 LGNADNIYLNSFCFGYRPTIEFDKYYLAFMLRSAPIRKKFQLLAQGISRYNISKNKVMEN 366 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I+EQ ++ ++ + ++ + LLKE++ F+ Sbjct: 367 VYSCPSIEEQ----ELLGAFFNNLNQTIALHQRKLDLLKEQKKGFLQK 410 >gi|164688032|ref|ZP_02212060.1| hypothetical protein CLOBAR_01677 [Clostridium bartlettii DSM 16795] gi|164602445|gb|EDQ95910.1| hypothetical protein CLOBAR_01677 [Clostridium bartlettii DSM 16795] Length = 393 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 56/403 (13%), Positives = 115/403 (28%), Gaps = 38/403 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTV 78 W + + G E GK I + DV T Y Q Sbjct: 13 EWDEKRLGDVYEFKNGLNKEKEFFGKGIPIVNYMDVNKNTHLYKNTIKGRVQLTKKEIEN 72 Query: 79 SIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQ--GWLLS 130 KG + + + + + D + S L +PK+ L + ++ Sbjct: 73 YSAKKGDLFFTRTSETIDEIGYTAVLLDDIEDAVFSGFILRARPKNELIDFKFSGYCFMT 132 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +V + I T + + + +P L EQ I + +I + Sbjct: 133 REVRKEIIKKSSMTTRALTSGTSLKQVVFYLPSLPEQTKIAHFLCTVDDKIQNQEDKITH 192 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K Q + S + + EW +++ N + Sbjct: 193 LENIKKGFMQKIFSRKIRFKDDSGEDF----PEWEEKKIKDVFKITRGYVLSANNVEKNI 248 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 E S + L E T+ F + ++ + Sbjct: 249 NKEYIYPVYSSQTKDKGLLGYYNEYLYEDAITWTTDGANAGTVHFRRGKFYCTNVCGVLI 308 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 E G +A + I Y++++ L + + + +P Sbjct: 309 SENGYANK-CIAEMINRISKKYVSYV----------------GNPKLMNNIMAEIKIDLP 351 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +KEQ I++++++ +ID E + LK+ + + Sbjct: 352 CLKEQQKISDILSLLDEKIDTDKET----LEHLKQLKKGLLQQ 390 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 71/218 (32%), Gaps = 10/218 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K+ EW + I + + ++ + Sbjct: 3 PKLRFKEFCGEWDEKRLGDVYEFKNGLNKEKEFFGKGIPIVNYMDVNKNTHLYKNTIKGR 62 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITSAYMAVKPHG--I 328 + L + E Y G++ F D+ + +E + + + +P I Sbjct: 63 VQLTKKEIENYSA-KKGDLFFTRTSETIDEIGYTAVLLDDIEDAVFSGFILRARPKNELI 121 Query: 329 DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + + ++ K S R +K++ +P + EQ I + + Sbjct: 122 DFKFSGYCFMTREVRKEIIKKSSMTTRALTSGTSLKQVVFYLPSLPEQTKIAHFLCTV-- 179 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 D ++ E I L+ + F+ + +I + +S Sbjct: 180 --DDKIQNQEDKITHLENIKKGFMQKIFSRKIRFKDDS 215 >gi|308272573|emb|CBX29177.1| hypothetical protein N47_J01580 [uncultured Desulfobacterium sp.] Length = 435 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 70/407 (17%), Positives = 121/407 (29%), Gaps = 33/407 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P W +V + I + + G + + + Sbjct: 11 LPGGWAIVSFAESCDKIS------LNGIKIKQKQYLTEGKYPVVDQGQALIGGYFDDEKL 64 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G+ Y G + R +F I VL+P L E L + I+ Sbjct: 65 IVPGKPPYVIFGDHTRVKKYINFRFIAGADGVKVLKPFAFLNEKLFFY-----FLHCIKL 119 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +G + + +P+PPL+EQ I KI +D I + LK + Sbjct: 120 PDKGYARH---LQFLEKTDIPLPPLSEQHRIVAKIEELFSSLDKGIESLKTAQQQLKIYR 176 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 QA++ + L+ ++ G +PD W+ K L N S Sbjct: 177 QAVLKWAFEGKLSNKNIVE-------GELPDGWQNKKINELGRVETGTTPSKKNPNFYSD 229 Query: 260 SYG-------NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Y N + + GL + + V + I K Sbjct: 230 EYPFYKPTDLNAGNNVVSSTDGLSELGIKEARFVPASSTLVTCIGATIGKTGFIKKGGGF 289 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP 371 I + + ++ + S D K S L + L ++ Sbjct: 290 NQQINAII---PSKEHNPKFIYYQAVSPDFQKQIQNNASATTLPILNKGKFENLTMVCCL 346 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +EQ I I + D + E IE S+ + R S + A G+ Sbjct: 347 PEEQQTIVAEIESRLSVCDKIEESIEHSLKQAEALRQSILKKAFEGK 393 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 8/205 (3%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G +P W+ I ++ TG T + + D+ +G DG + Sbjct: 196 GELPDGWQNKKINELGRVETGTTPSKKNPNFYSDEYPFYKPTDLNAGNNVVSSTDG-LSE 254 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSI 131 L +G + K G + Q ++ K+ P+ + +S Sbjct: 255 LGIKEARFVPASSTLVTCIGATIGKTGFIKKGGGFNQQINAIIPSKEHNPKFIYYQAVSP 314 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D ++I+ T+ + N+ M EQ I +I + D + Sbjct: 315 DFQKQIQNNASATTLPILNKGKFENLTMVCCLPEEQQTIVAEIESRLSVCDKIEESIEHS 374 Query: 192 IELLKEKKQALVSYIVTKGLNPDVK 216 ++ + +Q+++ L P Sbjct: 375 LKQAEALRQSILKKAFEGKLVPQDP 399 >gi|298256068|ref|ZP_06979654.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae str. Canada MDR_19A] Length = 427 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 142/426 (33%), Gaps = 66/426 (15%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGA 144 L + R I+ I + ++ L + ++LS +V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----Q 200 + + + + +I +P+PPLAEQ I E I + ++D R +L KE + Sbjct: 122 VVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDS---------------------------------------G 221 +++ Y + L +S Sbjct: 182 SILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYY 241 Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------ 275 E +P+ WE + + + R + + + + ++ L Sbjct: 242 EEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDP 301 Query: 276 -KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGID 329 SY+ +++ G++++ L R + A + V I+ Sbjct: 302 ETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVIN 361 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 362 CHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFA 421 Query: 388 RIDVLV 393 ID L+ Sbjct: 422 HIDALI 427 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIES 153 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 154 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 191 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 247 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 306 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 307 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 366 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 367 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 426 Query: 185 I 185 I Sbjct: 427 I 427 >gi|259508262|ref|ZP_05751162.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] gi|259164150|gb|EEW48704.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] Length = 329 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 54/320 (16%), Positives = 113/320 (35%), Gaps = 17/320 (5%) Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 + L P + ++A G+T + + ++P+P+ L Sbjct: 4 AFNQGCKALIPLPGVSRPRFLKYAVESQMSTLQAAGRGSTFTEVSASDVASLPIPVTSLD 63 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 +Q I + + ET ID + E + ++L+ E+ A V P + ++ Sbjct: 64 KQDWIADYLDRETAEIDAMAVELDQAMDLIDERFHAEVEQSFQSLDAPRMPLRS------ 117 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQ 284 + E +L+ S + ET + P Y Sbjct: 118 ------QIQSMTTGTSVTAAKFAPAAGEPGVLATSAVFGDELNETAVKSVDPHEYVRLTC 171 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 + ++ ++ N + + + Y+ W RS + Sbjct: 172 PLRINTLLVSRMNTMNLVGKAVTVGRHLPDVYLPDRL-WAVEVDVPRYIYWWTRSQSYRE 230 Query: 345 VFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + G ++L + + + + VPP+ +Q + ++ R L +++++ Sbjct: 231 QIRGLAVGASDSMKTLSQQAFRSITLPVPPVTQQIAVAAQLDEAAERFSALKAELQEAKG 290 Query: 402 LLKERRSSFIAAAVTGQIDL 421 LL+ERR+ I+AAVTGQID+ Sbjct: 291 LLEERRAVLISAAVTGQIDV 310 >gi|313141382|ref|ZP_07803575.1| restriction modification system DNA specificity domain-containing protein [Helicobacter canadensis MIT 98-5491] gi|313130413|gb|EFR48030.1| restriction modification system DNA specificity domain-containing protein [Helicobacter canadensis MIT 98-5491] Length = 417 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 46/401 (11%), Positives = 119/401 (29%), Gaps = 24/401 (5%) Query: 25 WKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + + K+ +S K + + +Y ++ ++ + + + K Sbjct: 23 WGITQLNMLAGKITERNKDDSIKRVFTNSATEGVIDQEEYFDRNIANKNN-LTDYFVVEK 81 Query: 84 GQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +Y + G+ S + + + K+ + + + L+ I+ Sbjct: 82 GDYVYNPRISTTALVGPISKNKLGIGVMSPLYTIFRFKNKGNDFYEHFFLTNLWHAYIKN 141 Query: 140 ICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + +P+P EQ I + + ID LI R ++ L+ Sbjct: 142 LSNTGARHDRITISVDNFMKMPLPYASPEEQQKIADCL----SSIDELIDTESRKLKALE 197 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + K+ L+ + + + + G + E+ T Sbjct: 198 KYKKGLMQKLFPTEGKTLPEWRFPEFQGCGE-----WKYEEIGNIGEVITGKTPSTSDAA 252 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERGI 315 L + + + + T +++ ++ + S+ A + I Sbjct: 253 LWDGDIQFVTPTDITENKYQHHTQRTVVKTPKMKVLPKYTIMYTCIASIGKMALSLYPCI 312 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKE 374 ++ P + + + + + D ++ V V KE Sbjct: 313 TNQQINSIVPKSFYNNEFIYYSLLQKTFLIKAGFANSTLPIINKTDFSKIQVPVILDKKE 372 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q I + + ID ++ + + I L+ + + Sbjct: 373 QEKIAGCL----SEIDTMITEQLKKIERLETHKKGLMQGLF 409 >gi|114567765|ref|YP_754919.1| restriction endonuclease S subunits-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338700|gb|ABI69548.1| Restriction endonuclease S subunits-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 413 Score = 100 bits (248), Expect = 6e-19, Method: Composition-based stats. Identities = 54/423 (12%), Positives = 139/423 (32%), Gaps = 42/423 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + +++ + G+ +Y+ E + ++ S Sbjct: 3 PKECEKTQLRKIVTIEKGKPPAKQPFFEQNAELYLTPEYLRG-------RNLAEPVLPGS 55 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G + G + A + ST + + + Sbjct: 56 NAVRVKDGDTILLWDGSNAGEFFRAREGVLASTMVRIWHDDTYDN--QYFYYAVKNWELF 113 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ G+ + H D + +GNI + EQ I + + +D I + I + Sbjct: 114 LKGQTSGSGIPHVDKEILGNIEILKYSKPEQTKIAKIL----STVDEAIEQIEALINKQQ 169 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEW-----VGLVPDHWEVKPFFALV----TELNRK 247 K L+ ++T+G++ ++ +G +P W+V P L+ + + + Sbjct: 170 RIKTGLMQELLTRGIDEYGNIRSEQTHKFKDSPLGRIPVEWDVIPLGDLIEAIDPQPDHR 229 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI------VDPGEIVFRFIDLQND 301 + + I + N + S + ++ V G+ +F I Sbjct: 230 TPQEVSGGIPYIGVSNFNNDGSIDFTNARKVSIKAFKKQQDSFSVSEGDFIFGKIGT--- 286 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFE 360 + S SA + + + W + S + K+ + S + + + Sbjct: 287 -IGMPSRLPTSTQYALSANVILLKPRETPAFFYWWISSPIVSKMVELEIHSTSQAAFGIK 345 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ L + P E+ I V++ + +++ ++ + L +++ + ++G+ Sbjct: 346 KMRTLNLPRPNKDEREKIGKVLDTQ----ELVKLNTKRDLYKLHSLKTALMQDLLSGKKR 401 Query: 421 LRG 423 + Sbjct: 402 VTP 404 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 76/204 (37%), Gaps = 11/204 (5%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNT----GRTSES-GKDIIYIGLEDVES-GTGK 62 ++KDS +G IP W V+P+ + RT + I YIG+ + + G+ Sbjct: 197 KFKDSP---LGRIPVEWDVIPLGDLIEAIDPQPDHRTPQEVSGGIPYIGVSNFNNDGSID 253 Query: 63 YLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + S + ++G ++GK+G + + + ++L Sbjct: 254 FTNARKVSIKAFKKQQDSFSVSEGDFIFGKIGTIGMPSRLPTSTQYALSANVILLKPRET 313 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P W+ S V++ +E + + K + + +P P E+ I + + + + Sbjct: 314 PAFFYWWISSPIVSKMVELEIHSTSQAAFGIKKMRTLNLPRPNKDEREKIGKVLDTQELV 373 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 + + L Q L+S Sbjct: 374 KLNTKRDLYKLHSLKTALMQDLLS 397 >gi|224417840|ref|ZP_03655846.1| restriction modification system DNA specificity domain [Helicobacter canadensis MIT 98-5491] gi|253827180|ref|ZP_04870065.1| methylase-S type I restriction modification domain containing protein [Helicobacter canadensis MIT 98-5491] gi|253510586|gb|EES89245.1| methylase-S type I restriction modification domain containing protein [Helicobacter canadensis MIT 98-5491] Length = 415 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 46/401 (11%), Positives = 119/401 (29%), Gaps = 24/401 (5%) Query: 25 WKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + + K+ +S K + + +Y ++ ++ + + + K Sbjct: 21 WGITQLNMLAGKITERNKDDSIKRVFTNSATEGVIDQEEYFDRNIANKNN-LTDYFVVEK 79 Query: 84 GQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +Y + G+ S + + + K+ + + + L+ I+ Sbjct: 80 GDYVYNPRISTTALVGPISKNKLGIGVMSPLYTIFRFKNKGNDFYEHFFLTNLWHAYIKN 139 Query: 140 ICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + +P+P EQ I + + ID LI R ++ L+ Sbjct: 140 LSNTGARHDRITISVDNFMKMPLPYASPEEQQKIADCL----SSIDELIDTESRKLKALE 195 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + K+ L+ + + + + G + E+ T Sbjct: 196 KYKKGLMQKLFPTEGKTLPEWRFPEFQGCGE-----WKYEEIGNIGEVITGKTPSTSDAA 250 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERGI 315 L + + + + T +++ ++ + S+ A + I Sbjct: 251 LWDGDIQFVTPTDITENKYQHHTQRTVVKTPKMKVLPKYTIMYTCIASIGKMALSLYPCI 310 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKE 374 ++ P + + + + + D ++ V V KE Sbjct: 311 TNQQINSIVPKSFYNNEFIYYSLLQKTFLIKAGFANSTLPIINKTDFSKIQVPVILDKKE 370 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q I + + ID ++ + + I L+ + + Sbjct: 371 QEKIAGCL----SEIDTMITEQLKKIERLETHKKGLMQGLF 407 >gi|189467554|ref|ZP_03016339.1| hypothetical protein BACINT_03944 [Bacteroides intestinalis DSM 17393] gi|189435818|gb|EDV04803.1| hypothetical protein BACINT_03944 [Bacteroides intestinalis DSM 17393] Length = 376 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 52/383 (13%), Positives = 112/383 (29%), Gaps = 25/383 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + L +T I ++S GKYL N + + Sbjct: 8 WQKIFLGEVCNLYQPKT---------IATSCLDS-NGKYLVYGANGVIGKYNEYNHKFP- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++L G I+ + +V+ PK+ L +L + GA Sbjct: 57 EVLITCRGATCGTINISKPFSWINGNAMVVHPKE-ENLLDFAFLGKAVSAIDYSKVITGA 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + IP L EQ I ++ + +I I L Q++ Sbjct: 116 AQPQITRANLQKVQIVIPTLVEQQTIASELD----AVQEMIDGYKTQITDLDALAQSI-- 169 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + +P K I +G + + K + S+ +L Sbjct: 170 -FLDMFGDPVTNPKGWEIMKIGEISEVTSSKRI-YQSEQTKSGIPFYKISDFPNLIEYGY 227 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + E + + +++ ++ ++ Sbjct: 228 SDTGIFISQAKYEELKSKKLVPNESDLLITSRGTLGLCYIVKDEDCFYFQDGMITWLKNL 287 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + S +L ++ +S +G L +++ ++VPPIK Q + + Sbjct: 288 KSTVLSAFLGFMFQSSLFKNQIEKAQNGSTIAYLSIAMIRKFDMIVPPIKLQQHFVSQVE 347 Query: 384 VETARIDVLVEKIEQSIVLLKER 406 I+ E I + + Sbjct: 348 A----IEKQKELIRDQLAETETL 366 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 70/196 (35%), Gaps = 13/196 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLED----VESGTGKYLPKDGNSRQS 73 PK W+++ I +++ + + ++ I + + D +E G ++ Sbjct: 181 PKGWEIMKIGEISEVTSSKRIYQSEQTKSGIPFYKISDFPNLIEYGYSDTGIFISQAKYE 240 Query: 74 DTSTVSIFAKG-QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD----VLPELLQGWL 128 + + + +L G I+ D D ++ K+ VL L Sbjct: 241 ELKSKKLVPNESDLLITSRGTLGLCYIVKDEDCFYFQDGMITWLKNLKSTVLSAFLGFMF 300 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S +IE G+T+++ I M +PP+ Q ++ A + + + + Sbjct: 301 QSSLFKNQIEKAQNGSTIAYLSIAMIRKFDMIVPPIKLQQHFVSQVEAIEKQKELIRDQL 360 Query: 189 IRFIELLKEKKQALVS 204 L+ E+ Q S Sbjct: 361 AETETLMAERMQYYFS 376 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 47/166 (28%), Gaps = 15/166 (9%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK-----------RS 304 + G + + + + +V V + N K Sbjct: 8 WQKIFLGEVCNLYQPKTIATSCLDSNGKYLVYGANGVIGKYNEYNHKFPEVLITCRGATC 67 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 I M V P + A+L ++ + + + ++++ Sbjct: 68 GTINISKPFSWINGNAMVVHPKEENLLDFAFLGKAVSAIDYSKVITGAAQPQITRANLQK 127 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + +++P + EQ I + ++ ID + I L S Sbjct: 128 VQIVIPTLVEQQTIASELDAVQEMID----GYKTQITDLDALAQSI 169 >gi|229508129|ref|ZP_04397634.1| type I restriction-modification system specificity subunit [Vibrio cholerae BX 330286] gi|229511632|ref|ZP_04401111.1| type I restriction-modification system specificity subunit [Vibrio cholerae B33] gi|229518771|ref|ZP_04408214.1| type I restriction-modification system specificity subunit [Vibrio cholerae RC9] gi|229607690|ref|YP_002878338.1| type I restriction-modification system specificity subunit [Vibrio cholerae MJ-1236] gi|229343460|gb|EEO08435.1| type I restriction-modification system specificity subunit [Vibrio cholerae RC9] gi|229351597|gb|EEO16538.1| type I restriction-modification system specificity subunit [Vibrio cholerae B33] gi|229355634|gb|EEO20555.1| type I restriction-modification system specificity subunit [Vibrio cholerae BX 330286] gi|229370345|gb|ACQ60768.1| type I restriction-modification system specificity subunit [Vibrio cholerae MJ-1236] Length = 167 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 66/169 (39%), Gaps = 10/169 (5%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + E E Y ++ G++V + + GI++ +Y Sbjct: 1 MTGVTPRSEKNVTMFMAEDYTGSKLCHSGDLVINIMWAWMGALGVS----DRTGIVSPSY 56 Query: 321 MAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKE 374 + YL L++S + + + +G R + + + PP +E Sbjct: 57 GVFREQREGTFVPKYLEMLLKSTKYVEYYNKVSTGLHSSRLRFYGHMLFDMALGFPPYEE 116 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 Q I I+ E +++D + + + LKE +++ I +AVTG+I + Sbjct: 117 QTQIVEYISRECSKVDEAITVQAEQVSKLKEYKTTLINSAVTGKIKVTE 165 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 58/145 (40%), Gaps = 5/145 (3%) Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD---VLPELLQGWL 128 D + + G ++ + ++ ++D GI S + V + + +P+ L+ L Sbjct: 17 AEDYTGSKLCHSGDLVINIMWAWMGALGVSDRTGIVSPSYGVFREQREGTFVPKYLEMLL 76 Query: 129 LSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + + G + ++ + PP EQ I E I E ++D IT Sbjct: 77 KSTKYVEYYNKVSTGLHSSRLRFYGHMLFDMALGFPPYEEQTQIVEYISRECSKVDEAIT 136 Query: 187 ERIRFIELLKEKKQALVSYIVTKGL 211 + + LKE K L++ VT + Sbjct: 137 VQAEQVSKLKEYKTTLINSAVTGKI 161 >gi|51594887|ref|YP_069078.1| type I restriction enzyme, S subunit [Yersinia pseudotuberculosis IP 32953] gi|51588169|emb|CAH19776.1| putative type I restriction enzyme, S subunit [Yersinia pseudotuberculosis IP 32953] Length = 427 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 47/409 (11%), Positives = 114/409 (27%), Gaps = 31/409 (7%) Query: 25 WKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 W + + + + + + + DV S + +S Sbjct: 19 WVENNLGELIDIRSAARVHKEQWTEAGVPFFRTSDVVSIYKGQENTKSYISPEVYNGLSE 78 Query: 80 ---IFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 K +L G ++ + + + K L + S Sbjct: 79 KIGKVTKDDLLITGGGSIGIPYLVPNDDPLYFKDADLLWLKNNKKFNGYFLYTFFFSAPF 138 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + I++I T++H + P+ EQ I I+ + + Sbjct: 139 KKHIKSISHTGTIAHYTIEQAKATPINTCYDEEQTQIGNYFQKLDSLINQHQQKHDKLSN 198 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL--------VTELN 245 + K + + P+++ K +W +P + Sbjct: 199 IKKAMLEKMFPK--PGKTIPEIRFKGFSGKW-EEMPFGTCFVNVSNNTLSRADLNYEDGM 255 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 KN + I + +L + + + G+I+ + Sbjct: 256 AKNIHYGDVLIKFGEVLDATNELLPFITNNDVTNKLKHAALRDGDIIIADAAEDSMVGKC 315 Query: 306 RSAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFED 361 + ++ S + S YL + + S ++ G + S+ Sbjct: 316 TELFNIGEQLVLSGLHTIAVRPTLNFASKYLGYYLNSSSYHDQLLSLMQGTKVLSISKTA 375 Query: 362 VKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 ++ ++ P +EQ +I ++D L+ + +Q I L + + Sbjct: 376 IQNTNIVFPKSAEEQVEIGKY----FQKLDALINQHQQQITKLNNIKQA 420 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 48/174 (27%), Gaps = 13/174 (7%) Query: 248 NTKLIESNILSLSYGN---IIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQN 300 + E+ + + I + E + PE Y E V +++ Sbjct: 38 KEQWTEAGVPFFRTSDVVSIYKGQENTKSYISPEVYNGLSEKIGKVTKDDLLITGGGSIG 97 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKF 359 L + +L S K ++ +G Sbjct: 98 -IPYLVPNDDPLYFKDADLLWLKNNKKFNGYFLYTFFFSAPFKKHIKSISHTGTIAHYTI 156 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 E K P+ +EQ I N ++D L+ + +Q L + + + Sbjct: 157 EQAKATPINTCYDEEQTQIGNY----FQKLDSLINQHQQKHDKLSNIKKAMLEK 206 >gi|77415002|ref|ZP_00791084.1| Type I restriction modification DNA specificity domain protein [Streptococcus agalactiae 515] gi|77158946|gb|EAO70175.1| Type I restriction modification DNA specificity domain protein [Streptococcus agalactiae 515] Length = 497 Score = 99.9 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 72/451 (15%), Positives = 143/451 (31%), Gaps = 63/451 (13%) Query: 5 KAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY---IGLEDVES 58 K Y + D V+ IP W+ V ++ + L+ K Y + +ED+E Sbjct: 47 KPYEKLSDGTIKEVEVPYDIPASWEWVRLRNISSLSFFPNISGDKIPNYSWVLDMEDIEK 106 Query: 59 GTGKYLPKDGNSRQSDT-STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 TG+ + K+ + +S S F+K +LY KL P L+K II+D DG +T+ + ++ Sbjct: 107 ETGRLVRKNYKTEKSSYKSNKVYFSKDTVLYAKLRPNLKKVIISDEDGFATTELIPIKIF 166 Query: 118 DVLPELLQGWLLS-IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + + I G M + + + +P+PPL+EQ I E+I Sbjct: 167 GGISAEYMRYCMISPSYYFNIIKSVYGVKMPRVNATFLNSTLLPLPPLSEQKRIVEQIER 226 Query: 177 ETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGLNPDVKM--------------- 217 ++D + EL K ++++ Y + L P Sbjct: 227 ALEKVDAYSESYNKLQELDKSFPDKLKKSILQYAMQGKLVPQDPNDEPVEVLLEKIQAEK 286 Query: 218 --------------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE---- 253 K G +P +W + + + K + Sbjct: 287 QKLYEEGKLKKKDLAEIVVTKGDDNSPYGKIPKNWSFLTIKDIFSITTGLSYKKTDLAIT 346 Query: 254 ---SNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I+ N + N + + +++ Sbjct: 347 KNGVRIIRGGNINPLSFKILDNDYYIDPKFITSETVYLKRNQLLTPVSTSLEHIGKFARI 406 Query: 309 QVMERGIITSAYMA----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFED 361 ++ + S YL + + S + + ++ Sbjct: 407 DKDYPNTAAGGFVFQLTPFVSSDVLSKYLLFSLSSPIFYEQLKSITKLSGQALYNIPKTK 466 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + L V + P EQ I+ + +++ L Sbjct: 467 LNELLVPLAPETEQKRISQRVEQLFEKVNQL 497 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 66/208 (31%), Gaps = 12/208 (5%) Query: 221 GIEWVGLVPDHWEVKPFFA-----LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 +E +P WE ++ + +L +N Sbjct: 59 EVEVPYDIPASWEWVRLRNISSLSFFPNISGDKIPNYSWVLDMEDIEKETGRLVRKNYKT 118 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + SY++ ++ + V N K+ + S + + T GI + Y+ + Sbjct: 119 EKSSYKSNKVYFSKDTVLYAKLRPNLKKVIISDE--DGFATTELIPIKIFGGISAEYMRY 176 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 M S G+ + + + +PP+ EQ I I ++D E Sbjct: 177 CMISPSYYFNIIKSVYGVKMPRVNATFLNSTLLPLPPLSEQKRIVEQIERALEKVDAYSE 236 Query: 395 KIEQSIVLLK----ERRSSFIAAAVTGQ 418 + L K + + S + A+ G+ Sbjct: 237 SYNKLQELDKSFPDKLKKSILQYAMQGK 264 >gi|283770884|ref|ZP_06343776.1| type I restriction enzyme S subunit [Staphylococcus aureus subsp. aureus H19] gi|283461031|gb|EFC08121.1| type I restriction enzyme S subunit [Staphylococcus aureus subsp. aureus H19] Length = 392 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 58/397 (14%), Positives = 118/397 (29%), Gaps = 30/397 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ ++ K+N+G+ + ++ G G + Sbjct: 20 EWEEKKLESIIKVNSGKDYK-----------HLDKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ + T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEEPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I + +I+ + + K Q + Sbjct: 122 TGVPSLSKQTINKINRFVPTNKEQQKIGKFFSKLDRQIELEEQKLELLQQQKKGYMQKIF 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + + V K +ES N Sbjct: 182 SQELRFKDENGNDYPEWENVMLQKVLKDKTEG-IKRGPFGGALKKDIFVESGYAVYEQRN 240 Query: 264 IIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + + Y+ V P +I+ + Q +GII A + Sbjct: 241 AIYDISNFRYYINENKYKEMQSFSVQPNDIIMSCSGTIGRLALIP--QNYTKGIINQALI 298 Query: 322 AVKPHGID-STYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + S + MRS + + GS + + +++K +P +P EQ I Sbjct: 299 RFRTNHKIRSEFFLIFMRSNQMQRKILEANPGSAITNLVPVKELKLIPFPLPVKFEQDKI 358 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + I + I+ +E+ E+ I LK R+ F+ Sbjct: 359 SQFILI----INRRIEQSEKKIESLKNRKQGFLQKLF 391 >gi|68248822|ref|YP_247934.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae 86-028NP] gi|229847392|ref|ZP_04467493.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae 7P49H1] gi|68057021|gb|AAX87274.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae 86-028NP] gi|229809718|gb|EEP45443.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae 7P49H1] Length = 390 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 60/388 (15%), Positives = 123/388 (31%), Gaps = 37/388 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + SG Y N+ Q + +G+ Sbjct: 18 EWKPLDEVANIVNNARKP-------VKSSSRVSGNIPY--YGANNIQDYVEGYT--HEGE 66 Query: 86 ILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G A + V+ K+ L L+ A Sbjct: 67 FVLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLA-- 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + IP+PIPPL+ Q I + + A T L +E I + + ++ Sbjct: 125 -GKERAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELILRQKQYEYYREK 183 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+S ++ G EW K + T N I L Sbjct: 184 LLSE---------EELGKVGFEW---KTIDEISKKISSGGTPTTSNNGYYDNGTIPWLRT 231 Query: 262 GNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + K E + + + ++ K ++ + + Sbjct: 232 QEVDFKEIWDTNIKITEDALNNSSAKWIPANCVIVAMYGATVGKTAINKIPLTTNQACAN 291 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + Y+ + S + ++GSG + ++ + +K+L V VPPI+EQ+ I Sbjct: 292 --IEINDKLACYRYIFHYLTSKY--EYIKSLGSGSQTNINAQIIKKLKVPVPPIEEQYRI 347 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 ++++ + + E + +I ++R Sbjct: 348 VSILDKFETLTNSITEGLPLAIEQSQKR 375 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 14/129 (10%), Positives = 46/129 (35%), Gaps = 3/129 (2%) Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ + + V + ++ +++ +L + + + Sbjct: 68 VLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLAGKE 127 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 R L ++++P+ +PP+ Q +I +++ TA L ++ + R Sbjct: 128 ---RAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELILRQKQYEYYREKL 184 Query: 411 IAAAVTGQI 419 ++ G++ Sbjct: 185 LSEEELGKV 193 >gi|83943084|ref|ZP_00955544.1| Restriction endonuclease S subunit-like protein [Sulfitobacter sp. EE-36] gi|83846092|gb|EAP83969.1| Restriction endonuclease S subunit-like protein [Sulfitobacter sp. EE-36] Length = 497 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 64/382 (16%), Positives = 130/382 (34%), Gaps = 20/382 (5%) Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFL 112 +++ + ST + A+ +L LR +A+ D + Sbjct: 1 MKADRIGDTKDYVTDLGIENSTTRVVAENSLLIVTRSGILRHSLPVALANKDVAFNQDIK 60 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIR 171 L + + L D ++A + G T+ D+ + + P+ I P EQ I Sbjct: 61 ALTLFSGIDPEYVLYHLKADADDILDACAKAGTTVESLDFNRLKSYPLRIAPSLEQRRIV 120 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD----VKMKDSGIEWVGL 227 EK+ T R D E R EL+ + K + T L D K +G+E + Sbjct: 121 EKLDILTGRTDRAHDELSRIPELVAKYKSCFLRLAFTGQLTSDFRGEHSRKGTGVENIPD 180 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSY-------GNIIQKLETRNMGLKPESY 280 + + + + ++++ + Y + E + +G+ P+ Sbjct: 181 SWAVKPLGEISEIQGGVQVGKKRSSSTDLVEVPYLRVANVQRGWLDLEEIKTIGVTPQEK 240 Query: 281 ETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLM 337 E ++ G+I+ D R + I + ++ +++ Sbjct: 241 E-RLLLRMGDILMNEGGDRDKLGRGWVWNNQIADCIHQNHVFRIRLKDSSLPPEFVSHYA 299 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S+ + LPV VPP E +I N I+ A ++ + + Sbjct: 300 NEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVNRIDAAFAWLERISSEQ 359 Query: 397 EQSIVLLKERRSSFIAAAVTGQ 418 + LL E ++ ++ A G+ Sbjct: 360 AAASKLLPELDAAILSKAFRGE 381 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 81/215 (37%), Gaps = 17/215 (7%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTG-----RTSESGK--DIIYIGLEDVESGTGKY 63 K +GV+ IP W V P+ +++ G + S S ++ Y+ + +V+ G Sbjct: 171 KGTGVE---NIPDSWAVKPLGEISEIQGGVQVGKKRSSSTDLVEVPYLRVANVQRGWLDL 227 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVL 120 + G IL + G R + + C Q V + + Sbjct: 228 EEIKTIGVTPQEKERLLLRMGDILMNEGGDRDKLGRGWVWNNQIADCIHQNHVFRIRLKD 287 Query: 121 ----PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 PE + + + ++ + ++ + + +P+P+PP E V I +I A Sbjct: 288 SSLPPEFVSHYANEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVNRIDA 347 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 ++ + +E+ +LL E A++S L Sbjct: 348 AFAWLERISSEQAAASKLLPELDAAILSKAFRGEL 382 >gi|15836900|ref|NP_297588.1| type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] gi|9105118|gb|AAF83108.1|AE003883_3 type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] Length = 442 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 61/388 (15%), Positives = 130/388 (33%), Gaps = 35/388 (9%) Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG-----PYLRKAII--ADFDGICSTQ 110 G + + G I+ + G P R A+ D S+ Sbjct: 50 DGLLDFGDVVELEVEDRHFASRQLQPGDIIIERSGGGPKQPVGRAALFVPFDDHTYFSSN 109 Query: 111 FL----VLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLA 165 F + P + +L ++ + E + T + + DW+ I +P PL Sbjct: 110 FTTTIRIRDRSLFDPGYVALYLHALYLDGATETLQRATTGIRNLDWREYLRIEVPAHPLQ 169 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 EQ + + + T + K++ +S I T+GL + + + Sbjct: 170 EQQS----LAHLIIGVRTAYRNEQHLSQTFMALKRSALSSIFTRGLRGEAQKDT----EI 221 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----- 280 GL+P+ W ++P A + ++ + I+ E + Sbjct: 222 GLLPESWGLEPIAAHFSVVSGGTPSRGNPAYWTGGSIPWIKTTEVAYCQITETEEHITPK 281 Query: 281 ----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 +++ G ++ + + + A M + + + YL Sbjct: 282 GLQDSAAKLLPKGTLLMAMYGQGVTRGKVAILGIEAACNQACAAMVPINNLVHTRYLYHF 341 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEK 395 + ++ + G +Q+L E V+ L PP EQ +I ++I+ +ID Sbjct: 342 L-TWRYEDIRSLAHGGQQQNLNLEMVRDLLFATPPSHVEQDEIVSIIDAIDRKID----L 396 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + +L++ S + +TG+I + Sbjct: 397 HRRKRHVLEDMFKSLLHKLMTGEISVSD 424 Score = 79.8 bits (195), Expect = 9e-13, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 13/204 (6%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKY 63 KD+ IG +P+ W + PI + +G T G I +I +V Sbjct: 217 KDTE---IGLLPESWGLEPIAAHFSVVSGGTPSRGNPAYWTGGSIPWIKTTEVAYCQITE 273 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + S + KG +L G K I + C+ + P + L Sbjct: 274 TEEHITPKGLQDSAAKLLPKGTLLMAMYGQGVTRGKVAILGIEAACNQACAAMVPINNLV 333 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGN-IPMPIPPLAEQVLIREKIIAETVR 180 + + I ++ G + + + + + + P EQ I I A + Sbjct: 334 HTRYLYHFLTWRYEDIRSLAHGGQQQNLNLEMVRDLLFATPPSHVEQDEIVSIIDAIDRK 393 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 ID +R ++ K L++ Sbjct: 394 IDLHRRKRHVLEDMFKSLLHKLMT 417 >gi|164551510|gb|ABY60972.1| Sau1hsdS1 [Staphylococcus aureus] Length = 412 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 58/402 (14%), Positives = 137/402 (34%), Gaps = 24/402 (5%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 20 EWEEKQLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 80 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYIFFGQYLLSR 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++I G + ++K I N+ + P + E+ +KI ++D I + Sbjct: 140 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQ---QKIGKFFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +ELL+++K+ + I ++ L + + EW + KP K+ L Sbjct: 197 LELLQQQKKGYLQKIFSQELRFKDENGNDYPEWRFARFKDFMYKPINIRPAINISKSELL 256 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I ++ + +E I D+ K + Sbjct: 257 TVKLHCKGIEKANINRVLKLGATNYYKRFEGQFIYGKQNFFNGAFDIVPKK-----FDGL 311 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 A+ + +++++ R + + V + +P Sbjct: 312 YSSSDVPAFEINTEKIEPNYFISYISRPSFYKSKEKYSTGTGSKRIHENTVLNFSLHLPC 371 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + EQ I + + ++ +E +E+ I L+K+++ + + Sbjct: 372 LNEQLKIASFVCF----LNRKIELLERKIYLIKKQKQALLQQ 409 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 52/181 (28%), Gaps = 6/181 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LET 270 +++ G E ++ + I L NI + Sbjct: 10 PELRFPGFEGEWEEKQLGDLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDL 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + G+++ + ++ S + + Sbjct: 70 VYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYY 129 Query: 331 TYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETAR 388 + +L+ K+F A G R+ L F+++ L + P I +EQ I + + Sbjct: 130 IFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGKFFSKLDRQ 189 Query: 389 I 389 I Sbjct: 190 I 190 >gi|258424532|ref|ZP_05687409.1| restriction modification system specificity subunit [Staphylococcus aureus A9635] gi|257845127|gb|EEV69164.1| restriction modification system specificity subunit [Staphylococcus aureus A9635] Length = 419 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 54/407 (13%), Positives = 114/407 (28%), Gaps = 31/407 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTST 77 W+ + + G G + +DV + L N + Sbjct: 20 EWEEKKVGELLEFKNGLNKGKEYFGSGSSIVNFKDVFNNRSINTNNLTGKVNVNSKELKN 79 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S KG + + + + + + + S L +PK + + + + Sbjct: 80 YS-VEKGDVFFTRTSEVIGEIGYPSVILNDPENTVFSGFVLRGRPKSGIDLINNNFKRYV 138 Query: 132 DVTQRIEAIC---EGATMSHADWKGIGNIPMPIPPL--AEQVLIREKIIAETVRIDTLIT 186 T T N I P+ EQ I + +I+ Sbjct: 139 FFTNSFRKEMITKSSMTTRALTSGTAINRMKVIYPVSAKEQKKIGDFFSKLDRQIELEEQ 198 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + K Q + S + + + + ++ + Sbjct: 199 KLELLQQQKKGYMQKIFSQELRFKDENGNDYPNWRTIELKNILENIVDNRGKTPDNAPSE 258 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K L + + I + + +I+F + + Sbjct: 259 KYPLLEVNALGYYRPAYIKVSKFVSENTYNN---WFREHLKENDILFSTVGNT----GIV 311 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL--CKVFYAMGSGLRQSLKFEDVKR 364 S + +I + ++ + + + M SY K+ ++ S+K K Sbjct: 312 SLMDNYKAVIAQNIVGLRVNNNNLPSFIYYMLSYKGNQKKIKRIQMGAVQPSVKVSQFKF 371 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + LVP EQ + ID LV K I LL++R+ + + Sbjct: 372 IKYLVPIKDEQEKVA----KLLIEIDKLVNKQLIKIELLQQRKKALL 414 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 67/188 (35%), Gaps = 8/188 (4%) Query: 24 HWKVVPIKRFTKL---NTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W+ + +K + N G+T ++ + + + + Y+ ++ + Sbjct: 231 NWRTIELKNILENIVDNRGKTPDNAPSEKYPLLEVNALGYYRPAYIKVSKFVSENTYNNW 290 Query: 79 SI--FAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQ 135 + IL+ +G +++ ++ + + + + + LP + L + Sbjct: 291 FREHLKENDILFSTVGNTGIVSLMDNYKAVIAQNIVGLRVNNNNLPSFIYYMLSYKGNQK 350 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +I+ I GA I +P EQ + + +I ++ + + + Sbjct: 351 KIKRIQMGAVQPSVKVSQFKFIKYLVPIKDEQEKVAKLLIEIDKLVNKQLIKIELLQQRK 410 Query: 196 KEKKQALV 203 K +++ Sbjct: 411 KALLKSMF 418 >gi|257083312|ref|ZP_05577673.1| type I restriction endonuclease S subunit [Enterococcus faecalis Fly1] gi|256991342|gb|EEU78644.1| type I restriction endonuclease S subunit [Enterococcus faecalis Fly1] Length = 398 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 53/407 (13%), Positives = 123/407 (30%), Gaps = 43/407 (10%) Query: 23 KHWKVVPIKRFTKL-----NTGRTSESGKDIIYIGLEDV----------ESGTGKYLPKD 67 + W++ + R + + G ++ + E+ + G +L D Sbjct: 14 EDWELCKLGRIFDVHTDFVSNGSFQSLKNNVRFYNDENYAYMIRLQDASNNWRGPWLYTD 73 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQ 125 + D S + IL G + ++ D S+ ++L+ + + Sbjct: 74 KHG--FDFLKKSTVYENDILMSDRGTIGKFFLVPKLDRPMTLSSNAVLLRSSNCNNNFIY 131 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L +ID+ +I+ I +P EQ I + + +ID Sbjct: 132 YMLNTIDIGNQIKKRTTPGVQPMISKTEFKKIITKLPVREEQKKIGDFL----KKIDETF 187 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 T R + LKE K+A + + K++ + E + + + Sbjct: 188 TLHQRKSDQLKELKKAYLQLMFPTKEERVPKLRFADFEGEWELCKLNGILDIIKGTQKSK 247 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + + Y I N+ + + + Sbjct: 248 SELSTNQNNCTPYPVYNGGINPSGYTNIYNREN---------------AITISEGGNSAG 292 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 V E+ + + D+ +L + + S ++ +++ + L Sbjct: 293 FVNFVQEKFFSGGHNYTIVNNVTDTLFLFFYLCSIQ-EEIMRLRVGTGLPNIQKPTLMNL 351 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + EQ I + ID+L+ + + LK + S++ Sbjct: 352 EIQKTTDNEQKSIGLFL----KNIDILISLTQNKLNQLKSLKKSYLQ 394 >gi|257064462|ref|YP_003144134.1| restriction endonuclease S subunit [Slackia heliotrinireducens DSM 20476] gi|256792115|gb|ACV22785.1| restriction endonuclease S subunit [Slackia heliotrinireducens DSM 20476] Length = 416 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 57/417 (13%), Positives = 115/417 (27%), Gaps = 33/417 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 VV + L +G T DI ++ + +++ + + S Sbjct: 4 NVVSLGDVVDLFSGGTPSKKNHEYWGGDIPWVSAKSMDADSINSGVLYITDKGL-ASGSR 62 Query: 80 IFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQ 135 + KG +L+ G L I + + LQ K + WL++ Sbjct: 63 LAEKGTMLFLTRGSGLFSRIPVIWVESPVAFNQDIKCLQAKKPDDARYIYHWLVAQRPVF 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G + + ++ + P + + I + I + Sbjct: 123 SKMLDVTGIGAGKINTDQLLDMEIYWPDVLTRQRITQIADPLIHAIHSNSCTNDYLA--- 179 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK------NT 249 E +AL S+ P E +G +P + P L + + Sbjct: 180 -ESIRALFSHWFVDFA-PFTGEPYVESE-IGRIPSSIRLVPLKDLTKTITKGTTPTTLGY 236 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDPGEIVFRFIDLQNDKRS 304 + E I + +I+ E I++ +++F Sbjct: 237 RFTEHGINYIKGESILDDHSFDYSKFAHIDDETNIALKRSIIENRDLLFTIAGTLGRFAM 296 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVK 363 + + I L S + ++ +L +K Sbjct: 297 AVPEILPANTNQAVGIIRPDVEKIAPEVLLSYFISGWQNDYYSRRVQQAVQANLSLTTLK 356 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 LPV + I E I I +E L + R + + ++G+ID Sbjct: 357 SLPVPM-LIGE-RRI--EYEDLIVPIVHAIESNNAQNRKLTDLRDTLLPKLMSGEID 409 >gi|91216789|ref|ZP_01253753.1| putative specificity protein s [Psychroflexus torquis ATCC 700755] gi|91184950|gb|EAS71329.1| putative specificity protein s [Psychroflexus torquis ATCC 700755] Length = 422 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 151/423 (35%), Gaps = 29/423 (6%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGN------SRQS 73 PK+WK+ + T K+ +G T GK+ G + S N +Q+ Sbjct: 2 PKNWKIYKLSEVTTKIGSGATPRGGKEAYKKFGTSLIRSQNVLDFKFSINGLAFIDEKQA 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQF-LVLQPKDVLPELLQGWLL 129 + +L G + + + + + +V L + + L Sbjct: 62 SKLDNVTIEENDVLLNITGDSVARVCSVPKEFLPARVNQHVAIVRANILKLDAIYLKYFL 121 Query: 130 SIDVTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + GAT + I + + +PPL EQ I + A +I+ + Sbjct: 122 LENTNKNMLLTLASAGATRNALTKIMIEDFRLDLPPLPEQTQIANILSAIDDKIENNLAI 181 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 ++ + + V G + + DS +GL+P WEVK +V + Sbjct: 182 NKTLEDMAMALYK---HWFVDFGPFQEGEFIDS---ELGLIPKGWEVKRLEEVVQVNSNS 235 Query: 248 NTKLIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K E I++ +++ E + + + +I+ G+I++ + R Sbjct: 236 IKKDKEPKIINYIDIASVKEGWVEEIKTIKYEDAPSRAKRIISDGDIIWSTVRPNRKSRF 295 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVK 363 L E I+++ ++ + P I +YL + D + +G ++ + + Sbjct: 296 LALG-FSENTIVSTGFVVMSPILISYSYLYLCSCTKDFVDYLVSRATGSSYPAVTGKVFE 354 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +L+P I + ++ + + + L R + + ++G++ L+ Sbjct: 355 EYEILIPE----KAILDRFSIIVEPMFLHSSSNDIENQTLTNLRDTLLPKLISGEVRLKE 410 Query: 424 ESQ 426 + Sbjct: 411 FRE 413 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 74/205 (36%), Gaps = 9/205 (4%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 ++ DS +G IPK W+V ++ ++N+ + + K+ I D+ S ++ + Sbjct: 207 EFIDSE---LGLIPKGWEVKRLEEVVQVNS-NSIKKDKEPKIINYIDIASVKEGWVEEIK 262 Query: 69 NSRQSD--TSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPEL 123 + D + I + G I++ + P + + + I ST F+V+ P + Sbjct: 263 TIKYEDAPSRAKRIISDGDIIWSTVRPNRKSRFLALGFSENTIVSTGFVVMSPILISYSY 322 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + D + + G++ K + IP A + + + Sbjct: 323 LYLCSCTKDFVDYLVSRATGSSYPAVTGKVFEEYEILIPEKAILDRFSIIVEPMFLHSSS 382 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 E L L+S V Sbjct: 383 NDIENQTLTNLRDTLLPKLISGEVR 407 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 59/178 (33%), Gaps = 15/178 (8%) Query: 228 VPDHWEVKPFFALVT------ELNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKP 277 +P +W++ + T + + N++ + N + K Sbjct: 1 MPKNWKIYKLSEVTTKIGSGATPRGGKEAYKKFGTSLIRSQNVLDFKFSINGLAFIDEKQ 60 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAW 335 S ++ +++ + R + + V+ + + D+ YL + Sbjct: 61 ASKLDNVTIEENDVLLNITG-DSVARVCSVPKEFLPARVNQHVAIVRANILKLDAIYLKY 119 Query: 336 LMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + A R +L ++ + +PP+ EQ I N+++ +I+ Sbjct: 120 FLLENTNKNMLLTLASAGATRNALTKIMIEDFRLDLPPLPEQTQIANILSAIDDKIEN 177 >gi|258540281|ref|YP_003174780.1| hypothetical protein LC705_02090 [Lactobacillus rhamnosus Lc 705] gi|257151957|emb|CAR90929.1| Putative protein without homology [Lactobacillus rhamnosus Lc 705] Length = 402 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 53/404 (13%), Positives = 123/404 (30%), Gaps = 35/404 (8%) Query: 25 WKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ K N R S ++ + I V K + ++ + Sbjct: 20 WEKRKFGELYKPNKERNESAEFSSENTLSIATMTVNR-------KGNGAAKTSLLKYKVI 72 Query: 82 AKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQR 136 G I + K + R + DGI S +F L+P + + + ++ + + Sbjct: 73 RIGDIAFEGHTSKKFAFGRFVLNDVADGIMSPRFTCLRPIHRQIIQFWKQYIHYEPILRP 132 Query: 137 I--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + G M+ + + +P + EQ LI + + I + ++ Sbjct: 133 ILIRSTKLGTMMNELVVPDLLKQNIRVPSINEQKLIGKSLSRVDDLIAATQGKLDNLEKI 192 Query: 195 LKEKKQALVSYI--VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + L +P K K + H K N + Sbjct: 193 KRALLKHLFDQSMRFRGYSDPWEKRKLIDQLSLLKDGTHGTHKD----------GNFAFL 242 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 S + + + + + +++ + V Sbjct: 243 LSAKNVIQDSIVFDDSDRKISEDDFNDIYANYHIKKNDVLLTIVGTIGRVALFPRLTVPV 302 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP 371 + A + KP +LA +++ + A + + + D+K++ + +P Sbjct: 303 AFQRSVAILRTKPTLF-PYFLALELQTPTIQSKIKARANMSAQAGIYLGDLKKVVISIPK 361 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +EQ +I +N T L+ + + L+ + + + Sbjct: 362 SEEQIEIAMSLNRLT----NLIAATQSKLSSLETLKKALLQGLF 401 >gi|237742575|ref|ZP_04573056.1| restriction modification system DNA specificity subunit [Fusobacterium sp. 4_1_13] gi|229430223|gb|EEO40435.1| restriction modification system DNA specificity subunit [Fusobacterium sp. 4_1_13] Length = 598 Score = 99.9 bits (247), Expect = 7e-19, Method: Composition-based stats. Identities = 58/397 (14%), Positives = 124/397 (31%), Gaps = 30/397 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + K G+T I D+ +G P ++ + Sbjct: 13 PNGVEYKELGEIVKSQRGKTITKE----LIKDGDIPVISGGQKPAYYHNESN-------- 60 Query: 82 AKGQIL-YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG+++ G Y + D S F + K L + + + +I ++ Sbjct: 61 RKGEVITVAGSGAYAGFVMYWDKPIFVSDAFSIECDKSYLN-IKYIYYFLQNNQMKIHSL 119 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G + H +K + +P+PPL Q I + T EL E Sbjct: 120 KKGGGVPHVYFKDMQKFLVPVPPLEVQNEIVRILDNFTALTAE--LTAELTAELTAELTA 177 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L + D +K + + D +E K K T +I +++ Sbjct: 178 ELTARKKQYSWYRDYLLKFENKVKMVKIGDLFEFKNGINKDKGSFGKGTPIIN--YVNVY 235 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITS 318 N I + + + V G++ F ++ S + +E + + Sbjct: 236 KKNKIYFEDLKGLVEASNDELVRYGVKRGDVFFTRTSETIEEIGYTSVLLEDIENCVFSG 295 Query: 319 AYMAVKPHGID--STYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + +P Y A+ + ++ + R + ++ + +PP++ Q Sbjct: 296 FLLRARPITDLLLPEYCAYCFSTSNIRNTIIKKSTYTTRALTNGTSLSQIEIPLPPLEVQ 355 Query: 376 FDITNVINVETARIDVL-------VEKIEQSIVLLKE 405 I V+ + L +E ++ + Sbjct: 356 KRIVEVLGNFEKICNDLNIGLPAEIEARQKQYEFYRN 392 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 56/404 (13%), Positives = 126/404 (31%), Gaps = 29/404 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSI 80 K+V I + G + G K I +V Y K +D Sbjct: 201 KMVKIGDLFEFKNGINKDKGSFGKGTPIINYVNVYKKNKIYFEDLKGLVEASNDELVRYG 260 Query: 81 FAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQPKDVL--PELLQGWLLSID 132 +G + + + + + + + S L +P L PE + + Sbjct: 261 VKRGDVFFTRTSETIEEIGYTSVLLEDIENCVFSGFLLRARPITDLLLPEYCAYCFSTSN 320 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I T + + + I +P+PPL Q I E + + L I Sbjct: 321 IRNTIIKKSTYTTRALTNGTSLSQIEIPLPPLEVQKRIVEVLGNFEKICNDLNIGLPAEI 380 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E +++ + ++++T + K + + + + + K Sbjct: 381 EARQKQYEFYRNFLLTFKIENCTLPKTRQDKTRQDIIKLFMYIFGYIELELGEILKIKNG 440 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + G +TY ++ R + N + ++ Sbjct: 441 SDYKKF-----NIGNIPVYGSGGIINYIDTYIYDKESVLIPRKGSIGNLFYVDKPFWTVD 495 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 T Y + + YL + + +L K+ +G SL + ++ + +P + Sbjct: 496 ----TIFYTVIDKDVVIPKYLYYYLSKMNLEKL---NTAGGVPSLTQTVLNKILISLPSL 548 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +EQ I ++++ + + E + I ++ R + Sbjct: 549 EEQERIVDILDRFDKLCNDISEGLPAEIEARQKQYEYYREKLLT 592 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 48/127 (37%), Gaps = 8/127 (6%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 GE++ + + + + ++ Y+ + +++ + Sbjct: 61 RKGEVI--TVAGSGAYAGFVMYWDKPIFVSDAFSIECDKSYLNIKYIYYFLQNNQMKIHS 118 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV----- 401 G G+ + F+D+++ V VPP++ Q +I +++ TA L ++ + Sbjct: 119 LKKGGGV-PHVYFKDMQKFLVPVPPLEVQNEIVRILDNFTALTAELTAELTAELTAELTA 177 Query: 402 LLKERRS 408 L R+ Sbjct: 178 ELTARKK 184 >gi|69245866|ref|ZP_00603683.1| Restriction modification system DNA specificity domain [Enterococcus faecium DO] gi|257879184|ref|ZP_05658837.1| restriction-modification enzyme type I S subunit [Enterococcus faecium 1,230,933] gi|257881997|ref|ZP_05661650.1| restriction-modification enzyme type I S subunit [Enterococcus faecium 1,231,502] gi|257890014|ref|ZP_05669667.1| restriction-modification enzyme type I S subunit [Enterococcus faecium 1,231,410] gi|258615582|ref|ZP_05713352.1| HsdS protein [Enterococcus faecium DO] gi|260560169|ref|ZP_05832346.1| HsdS protein [Enterococcus faecium C68] gi|293560249|ref|ZP_06676748.1| HsdS protein [Enterococcus faecium E1162] gi|314947718|ref|ZP_07851125.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0082] gi|68195568|gb|EAN10010.1| Restriction modification system DNA specificity domain [Enterococcus faecium DO] gi|257813412|gb|EEV42170.1| restriction-modification enzyme type I S subunit [Enterococcus faecium 1,230,933] gi|257817655|gb|EEV44983.1| restriction-modification enzyme type I S subunit [Enterococcus faecium 1,231,502] gi|257826374|gb|EEV53000.1| restriction-modification enzyme type I S subunit [Enterococcus faecium 1,231,410] gi|260073736|gb|EEW62061.1| HsdS protein [Enterococcus faecium C68] gi|291605793|gb|EFF35228.1| HsdS protein [Enterococcus faecium E1162] gi|313645698|gb|EFS10278.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0082] Length = 414 Score = 99.9 bits (247), Expect = 8e-19, Method: Composition-based stats. Identities = 61/407 (14%), Positives = 132/407 (32%), Gaps = 31/407 (7%) Query: 25 WKVVPIKRFTK----LNTGRTSESGKDIIYIGLED--VESGTGKYLPKDGNSRQSDTSTV 78 W+ + + G + K ++ + V + Sbjct: 18 WEQRKFECLLDKKDGVRRGPFGSALKKEFFVSNSNFVVYEQQNAIYDNYETRYKITEKKY 77 Query: 79 -----SIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSI 131 G + G R + + G+ + + L+ + + ++ I Sbjct: 78 NELIKFKLEPGDFIMSGAGTIGRISRVPKQIKPGVFNQALIRLRINKEITDSEY-FIQFI 136 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + S + + ++KI ++D I + R Sbjct: 137 RADFMQRKLTGANPGSAITNLVPMSEVKKWIVQFPILEEQKKIGNFFKQLDDTIALQQRK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE K+ + + P K I + G + WE + F + +KNTK Sbjct: 197 LDLLKETKKGFLQKMF-----PKNGAKVPEIRFPGFT-EDWEQRKFKEFSKKTGKKNTKD 250 Query: 252 IESNILSLSY--GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 ++ S+S G I Q + L Y+ V+P E + + + S+ Sbjct: 251 LDFPAYSVSNKAGLISQTEQFDGSRLDDLEKTNYKFVEPNEFAYNP--ARVNVGSIAFNN 308 Query: 310 VMERGIITSAYMA-VKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPV 367 + I++S Y+ +D+ ++ ++S K +R+ L +E+ + Sbjct: 309 LGMTVIVSSLYVVVKMSEDLDNEFILQFIKSPTFIKEVKRNTEGSVREYLFYENFANIKF 368 Query: 368 LVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I ++D + ++ + LLKE + F+ Sbjct: 369 PFTRNKEEQQKIGAF----FKQLDDTIALHQRKLDLLKETKKGFLQK 411 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 68/189 (35%), Gaps = 8/189 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIF 81 + W+ K F+K TG+ + D + + + DG+ + + Sbjct: 229 EDWEQRKFKEFSK-KTGKKNTKDLDFPAYSVSNKAGLISQTEQFDGSRLDDLEKTNYKFV 287 Query: 82 AKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + Y + + S +V +D+ E + ++ S + ++ Sbjct: 288 EPNEFAYNPARVNVGSIAFNNLGMTVIVSSLYVVVKMSEDLDNEFILQFIKSPTFIKEVK 347 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG+ + ++ NI P E+ +KI A ++D I R ++LLKE Sbjct: 348 RNTEGSVREYLFYENFANIKFPFTRNKEEQ---QKIGAFFKQLDDTIALHQRKLDLLKET 404 Query: 199 KQALVSYIV 207 K+ + + Sbjct: 405 KKGFLQKMF 413 >gi|255308058|ref|ZP_05352229.1| type I restriction-modification system S subunit [Clostridium difficile ATCC 43255] Length = 366 Score = 99.9 bits (247), Expect = 8e-19, Method: Composition-based stats. Identities = 47/399 (11%), Positives = 118/399 (29%), Gaps = 43/399 (10%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + + + +N G+T ++ + D++ ++ + + + Sbjct: 2 EYIKLNELCYINIGKTPSRNTSDYWGSGNRWLSISDLKEKYILKSKEEITDLAVEKANMK 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + K ++ R AI+ + + + + + +L T Sbjct: 62 LVPKNTVVMSFKLSIGRVAILKEDM--FTNEAIANFQIKNNELITYEYLYYALRTLNFNN 119 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + +I +P + Q + E + I+ + EL Sbjct: 120 TDRAVMGATLNKSKLNDIKIPYFTICIQNKMVEVLNKAQELINKRKEQIEALDEL----- 174 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + S + N KD E +G + + K ++ KN+ + Sbjct: 175 --VKSRFIEMFGNVITNSKDWDTELLGEISNLKAGKNIK--AKDIYEKNSHELYPCYGGN 230 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ + F I Q + A Sbjct: 231 GLRGYVKMYSHKGT-------------------FNLIGRQGALCGNVKYVNGKFYATEHA 271 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + I+S +L + ++ DL ++ + L + + + P+ Q Sbjct: 272 VVVQPKVDINSYWLYFTLKELDLNRL---STGAAQPGLTVGKLNEVEIPKVPVYLQNKFV 328 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +ID L ++E S+ L+ +S + A G+ Sbjct: 329 DFV----RQIDKLKSRMEDSLKELENNFNSLMQKAFKGE 363 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 47/191 (24%), Gaps = 17/191 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W + + L G+ I +D+ L Sbjct: 191 KDWDTELLGEISNLKAGKN---------IKAKDIYEKNSHELYPCYGGNGLRGYVKMYSH 241 Query: 83 KGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG L G+ G + + +V+QPK + + L + + Sbjct: 242 KGTFNLIGRQGALCGNVKYVNGKFYATEHAVVVQPKVDINSYWLYFTLKEL---DLNRLS 298 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + +P P+ Q + + + + Sbjct: 299 TGAAQPGLTVGKLNEVEIPKVPVYLQNKFVDFVRQIDKLKSRMEDSLKELENNFN----S 354 Query: 202 LVSYIVTKGLN 212 L+ L Sbjct: 355 LMQKAFKGELF 365 >gi|120555303|ref|YP_959654.1| restriction modification system DNA specificity subunit [Marinobacter aquaeolei VT8] gi|120325152|gb|ABM19467.1| restriction modification system DNA specificity domain [Marinobacter aquaeolei VT8] Length = 374 Score = 99.9 bits (247), Expect = 8e-19, Method: Composition-based stats. Identities = 54/370 (14%), Positives = 116/370 (31%), Gaps = 32/370 (8%) Query: 76 STVSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQ--GWL 128 + + G+ Y K + GI S ++ + L + Sbjct: 3 TNYFLLKSGEFAYNKSYSNGYPVGVVRRLKRYDSGILSPLYICFDMSSSEVDELYAEHFF 62 Query: 129 LSIDVTQRIEAICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 S I I + +H ++ +PPL EQ I + + I+ Sbjct: 63 DSQWFIDEINQIAKEGARNHGLLNVGVGEFFDLEFVLPPLPEQQKIAAILSSVDDVIEKT 122 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + + +L Q L++ + P + KDS + G VP WE+ + Sbjct: 123 RAQIDKLKDLKTGMMQELLTKGIGSDGVPHTEFKDSPV---GRVPVSWEIVRLGDVSKVQ 179 Query: 245 NR---KNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFID 297 K+ ++ L N+ + E + + S + + +IV Sbjct: 180 GGFAFKSADATDNGCRWLKIANVGRGTVVWGEKSFLPNEFLSEYSDFALKEADIVVALTR 239 Query: 298 LQNDKRSLRSAQVMERG--IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR- 354 + + ++ V P +L KV + + Sbjct: 240 PVISGELKVAQLMKSDAPSLLNQRVARVIPKL-SRVSREYLFTLLSWRKVANDIEQAIFG 298 Query: 355 ---QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 ++ + ++ L +PP +EQ I + + + RI L + L+ + + + Sbjct: 299 TDPPNVSTKQIESLCYPLPPREEQDLIASSLGAVSNRIRTL----SNKLDQLRGTKEALM 354 Query: 412 AAAVTGQIDL 421 +TG++ + Sbjct: 355 QDLLTGKVRV 364 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 47/227 (20%), Positives = 89/227 (39%), Gaps = 20/227 (8%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYL 64 ++KDS V G +P W++V + +K+ G +S ++ + +V GT + Sbjct: 154 EFKDSPV---GRVPVSWEIVRLGDVSKVQGGFAFKSADATDNGCRWLKIANVGRGTVVWG 210 Query: 65 PKDGNSRQS-DTSTVSIFAKGQILYGKLGPYL----RKAIIADFDG--ICSTQFLVLQPK 117 K + + + I+ P + + A + D + + + + PK Sbjct: 211 EKSFLPNEFLSEYSDFALKEADIVVALTRPVISGELKVAQLMKSDAPSLLNQRVARVIPK 270 Query: 118 --DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 V E L L V IE G + K I ++ P+PP EQ LI + Sbjct: 271 LSRVSREYLFTLLSWRKVANDIEQAIFGTDPPNVSTKQIESLCYPLPPREEQDLIASSL- 329 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 + I ++ L+ K+AL+ ++T + +V K+S + Sbjct: 330 ---GAVSNRIRTLSNKLDQLRGTKEALMQDLLTGKVRVNVDQKESAV 373 >gi|168482751|ref|ZP_02707703.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC1873-00] gi|172043638|gb|EDT51684.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC1873-00] Length = 521 Score = 99.9 bits (247), Expect = 8e-19, Method: Composition-based stats. Identities = 65/438 (14%), Positives = 140/438 (31%), Gaps = 66/438 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 ---------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + G +P +W V + + + K + +I + II+ + Sbjct: 323 DISIVSQGDDNSYYGNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRIIRGGNIK 381 Query: 272 NMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + Y + +++ G++ ++ Sbjct: 382 PLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFI 441 Query: 322 A----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + I S +L + + S K + ++ + L + + P +E Sbjct: 442 FQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEE 501 Query: 375 QFDITNVINVETARIDVL 392 Q IT + +++ L Sbjct: 502 QELITQKVEKLFEKVNQL 519 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 337 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 396 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 397 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 456 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 457 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 516 Query: 182 DTLI 185 + L Sbjct: 517 NQLW 520 >gi|239629953|ref|ZP_04672984.1| restriction modification system DNA specificity domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] gi|239527565|gb|EEQ66566.1| restriction modification system DNA specificity domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] Length = 402 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 54/406 (13%), Positives = 121/406 (29%), Gaps = 39/406 (9%) Query: 25 WKVVPIKRFTK-LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV-- 78 W+ K G + + K Y+ + D++ T + + S ++ S Sbjct: 20 WEKRKYGDIAKSFQYGLNAPAKKFDGINKYLRITDIDDLTRLFKQESLTSPDTNLSNAST 79 Query: 79 SIFAKGQILYGKLGP-YLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVT 134 + +G +L+ + G + DG E L+ Sbjct: 80 YLLKQGDVLFARTGASTGKTYKYRKGDGKVYFAGFLIRADLKPKFDSEFFYQTTLTDSFL 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + + + K N + +P L+EQ I + I + + Sbjct: 140 DFVKVTSQRSGQPGINSKEYANKAIQVPELSEQQRIGSVLAIYDNLIAATQDKIDALEQA 199 Query: 195 LKEKKQALVSYI--VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 K Q L +P K K S + Sbjct: 200 KKALLQRLFDQSWRFKGYSDPWEKRKVSD--------------------YLCESRIPGSN 239 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS--AQV 310 L+ + + +N Y + ++++ +D + + Sbjct: 240 GLKAKKLTVKLWGKGVVPKNETYSGSIKTKYYVRSANQLIYGKLDFLHAAFGIVPQSLDG 299 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E I + A+ G + LA +R ++ L + A GS + + + + + Sbjct: 300 WESTIDSPAFDVNTSIGNAAFLLALFLRPNFYLREGIRANGSRKAKRIHEDTFLSMSISA 359 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P KEQ I V++ + ++L+ + + L+ + + + Sbjct: 360 PQRKEQDQIAVVLD----KTELLIAATQSRLSSLELLKKALLQDLF 401 >gi|58616451|ref|YP_195580.1| Type I restriction-modification system (specificity subunit) [Azoarcus sp. EbN1] gi|56315913|emb|CAI10556.1| Type I restriction-modification system (specificity subunit) [Aromatoleum aromaticum EbN1] Length = 408 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 124/402 (30%), Gaps = 30/402 (7%) Query: 29 PIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + +N + ++ + V+ TG S + F + Sbjct: 7 RLADVCDINPRLPRTHGITDDTLVSFVPMAAVDELTGTIATSQSRSFAEVKKGYTSFREN 66 Query: 85 QILYGKLGPYLRKAI------IADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQRI 137 +L+ K+ P + + G ST+F VL+ VLPE L+ ++ + + Sbjct: 67 DVLFAKITPCMENGKAALAQSLVGGVGFGSTEFHVLRAGPQVLPEWLRYFVRREEFRREA 126 Query: 138 EAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G + +P+P L EQ I + + + ++ R Sbjct: 127 KRNFTGTAGQQRVPTTFLSGAEIPVPSLDEQRRIVDLLSRA----EGIVRLRREAQRKAA 182 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E AL V +P K + +G + + P + Sbjct: 183 EIIPAL---FVDMFGDPATNPKGWPVTTIGSLSSYTRYGP---RFPDRPYAAEGAHILRT 236 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + Y I + + + + E Y + PG ++ K ++ E I Sbjct: 237 TDMGYSGDIHWSDAPVLPVTVDELEKYH-LRPGTLLVTRTGATIGKIAIFRG-AEEPCIA 294 Query: 317 TSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + + + Y+ + S + ++ + +P+ +PP++ Sbjct: 295 GAYLIEIGFQAQVIPEYILHFLLSAFGQSQLVRGSRAVAQPNINAPTICAIPIPLPPLEI 354 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 Q ++ ++ E ++ + +S +A + Sbjct: 355 QARFAASVD----QLRAAQGLQESAMAKAEAIFNSLLAQVFS 392 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 51/168 (30%), Gaps = 10/168 (5%) Query: 22 PKHWKVVPIKRFTKLNT-GRTSES----GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDT 75 PK W V I + G + + D+ SG + D Sbjct: 200 PKGWPVTTIGSLSSYTRYGPRFPDRPYAAEGAHILRTTDMGYSGDIHWSDAPVLPVTVDE 259 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFL-VLQPKDVLPELLQGWLLSI 131 G +L + G + K I + I + + V+PE + +LLS Sbjct: 260 LEKYHLRPGTLLVTRTGATIGKIAIFRGAEEPCIAGAYLIEIGFQAQVIPEYILHFLLSA 319 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 ++ + + I IP+P+PPL Q + Sbjct: 320 FGQSQLVRGSRAVAQPNINAPTICAIPIPLPPLEIQARFAASVDQLRA 367 >gi|225860524|ref|YP_002742033.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae Taiwan19F-14] gi|225727877|gb|ACO23728.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae Taiwan19F-14] Length = 521 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 66/438 (15%), Positives = 140/438 (31%), Gaps = 66/438 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 ---------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + G +P +W V + + + K + +I + II+ + Sbjct: 323 DISIVSQGDDNSYYGNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRIIRGGNIK 381 Query: 272 NMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + Y + +++ G++ ++ Sbjct: 382 PLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFI 441 Query: 322 A----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + I S +L + + S K + ++ + L + + P +E Sbjct: 442 FQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEE 501 Query: 375 QFDITNVINVETARIDVL 392 Q IT + +++ L Sbjct: 502 QELITQKVEKLFEKVNQL 519 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 337 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 396 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 397 QFISSEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 456 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 457 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 516 Query: 182 DTLI 185 + L Sbjct: 517 NQLW 520 >gi|294620837|ref|ZP_06700041.1| HsdS protein [Enterococcus faecium U0317] gi|291599622|gb|EFF30635.1| HsdS protein [Enterococcus faecium U0317] Length = 413 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 61/407 (14%), Positives = 132/407 (32%), Gaps = 31/407 (7%) Query: 25 WKVVPIKRFTK----LNTGRTSESGKDIIYIGLED--VESGTGKYLPKDGNSRQSDTSTV 78 W+ + + G + K ++ + V + Sbjct: 18 WEQRKFECLLDKKDGVRRGPFGSALKKEFFVSNSNFVVYEQQNAIYDNYETRYKITEKKY 77 Query: 79 -----SIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSI 131 G + G R + + G+ + + L+ + + ++ I Sbjct: 78 NELIKFKLEPGDFIMSGAGTIGRISRVPKQIKPGVFNQALIRLRINKEITDSEY-FIQFI 136 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + S + + ++KI ++D I + R Sbjct: 137 RADFMQRKLTGANPGSAITNLVPMSEVKKWIVQFPILEEQKKIGNFFKQLDDTIALQQRK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE K+ + + P K I + G + WE + F + +KNTK Sbjct: 197 LDLLKETKKGFLQKMF-----PKNGAKVPEIRFPGFT-EDWEQRKFKEFSKKTGKKNTKD 250 Query: 252 IESNILSLSY--GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 ++ S+S G I Q + L Y+ V+P E + + + S+ Sbjct: 251 LDFPAYSVSNKAGLISQTEQFDGSRLDDLEKTNYKFVEPNEFAYNP--ARVNVGSIAFNN 308 Query: 310 VMERGIITSAYMA-VKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPV 367 + I++S Y+ +D+ ++ ++S K +R+ L +E+ + Sbjct: 309 LGMTVIVSSLYVVVKMSEDLDNEFILQFIKSPTFIKEVKRNTEGSVREYLFYENFANIKF 368 Query: 368 LVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I ++D + ++ + LLKE + F+ Sbjct: 369 PFTRNKEEQQKIGAF----FKQLDDTIALHQRKLDLLKETKKGFLQK 411 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 68/189 (35%), Gaps = 8/189 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIF 81 + W+ K F+K TG+ + D + + + DG+ + + Sbjct: 229 EDWEQRKFKEFSK-KTGKKNTKDLDFPAYSVSNKAGLISQTEQFDGSRLDDLEKTNYKFV 287 Query: 82 AKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + Y + + S +V +D+ E + ++ S + ++ Sbjct: 288 EPNEFAYNPARVNVGSIAFNNLGMTVIVSSLYVVVKMSEDLDNEFILQFIKSPTFIKEVK 347 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG+ + ++ NI P E+ +KI A ++D I R ++LLKE Sbjct: 348 RNTEGSVREYLFYENFANIKFPFTRNKEEQ---QKIGAFFKQLDDTIALHQRKLDLLKET 404 Query: 199 KQALVSYIV 207 K+ + + Sbjct: 405 KKGFLQKMF 413 >gi|301170024|emb|CBW29628.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 10810] Length = 466 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 55/464 (11%), Positives = 136/464 (29%), Gaps = 74/464 (15%) Query: 23 KHWKVVPIKRFT-KLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 K+W+ ++ K+ +G T +I +I +++ +G + Sbjct: 10 KNWQKYSLEEICLKITSGGTPSRQNPKLYKNGNINWIKTKELNNGYIFESEEKITEEAIK 69 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + IL G + + I + C+ L + + L Sbjct: 70 KSSAKLLPVNTILLAMYGATVGELGILGKEMACNQACCALIIDPKKADYRFIFYLLRLYK 129 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I+++ GA + K I IP L +Q I + + +ID ++ Sbjct: 130 KEIQSLATGAAQQNLSAKTIKEFSFYIPNLEKQKKIADILSELDKKIDLNTQINQTLEQI 189 Query: 195 LKEKKQALV---------SYIVTKGL---------------------------------- 211 + ++ ++ GL Sbjct: 190 AQALFKSWFVDFDPVRAKVQALSDGLSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE 249 Query: 212 --NPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTELNRKNTKLIES----------NILS 258 +E G VP WE+K ++ L + + NI Sbjct: 250 LAETAKAFPCEMVEVDGVEVPKGWEIKALPEIIDFLEGPGIRNWQYTDEEDGIKFINIRC 309 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G++ + + + ++ +IV +R + + Sbjct: 310 IQNGDLTLTTANKITKEEAFGKYKHFQLEEDDIVVSTSGTLGRFAFVRKEHLPLSLNTSV 369 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP---IKEQ 375 + ++A + + ++ +++ +K++ +LVP ++ Sbjct: 370 IRFRPIKNKSTLGFIAGFVENQLQHELEIRASGSAQRNFGPTHLKQITLLVPDFKLLELH 429 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + + ++ I +LK+ R + + G+I Sbjct: 430 QKYVSSLFEKRKQL-------LSEIDVLKDTRDLLLPKLLNGEI 466 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 58/172 (33%), Gaps = 11/172 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK W++ + G + ++ I +I + +++G + +++ Sbjct: 268 EVPKGWEIKALPEIIDFLEGPGIRNWQYTDEEDGIKFINIRCIQNGDLTLTTANKITKEE 327 Query: 74 DTSTVSIF--AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW---L 128 F + I+ G R A + S V++ + + + G+ Sbjct: 328 AFGKYKHFQLEEDDIVVSTSGTLGRFAFVRKEHLPLSLNTSVIRFRPIKNKSTLGFIAGF 387 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + +E G+ + + I + +P L ++ + + + Sbjct: 388 VENQLQHELEIRASGSAQRNFGPTHLKQITLLVPDFKLLELHQKYVSSLFEK 439 >gi|327404960|ref|YP_004345798.1| restriction modification system DNA specificity domain-containing protein [Fluviicola taffensis DSM 16823] gi|327320468|gb|AEA44960.1| restriction modification system DNA specificity domain protein [Fluviicola taffensis DSM 16823] Length = 391 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 56/403 (13%), Positives = 114/403 (28%), Gaps = 46/403 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + ++ ++ TG+ S + Y G + Sbjct: 14 EWKTLRDTCEIKTGKGITKND----------SSDSAPYPIISGGKEPMGYFEKFNRRENS 63 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQP----KDVLPELLQGWLLSIDVTQRIEAIC 141 + ++G + + + P K + ++L + E Sbjct: 64 VTISRVGANAGYVSFIVSKFYLNDKCFSVLPIENYKSKIDNKFLFYVLKTNEKSITELQS 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 EG + + +G I +PIPPL Q I + T L E + K++ Sbjct: 124 EG-GVPTINTTKVGGIQIPIPPLEIQQKIVAILDVFTELTAELTAELTAELTARKQQYNY 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + ++ +G V D K T I Sbjct: 183 YRVQLFR------FDEIEVELKSLGWVGDVRMCKRILKEQTTEIG--------VIPFYKI 228 Query: 262 GNIIQKLETRNMGLKPESY-ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G ++ + Y Y GE++ + Sbjct: 229 GTFGKEPNAYISKELFDEYRSKYNYPKVGEVLISASGTIGRAVIF----DGHDAYFQDSN 284 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIK 373 + + +L Y + K A G G Q L +++K+ + +P +K Sbjct: 285 IVWIENNESKVLNKYLFYFYQIVKWEIADG-GTIQRLYNDNLKKTKIPIPYPNDPKKSLK 343 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 EQ I ++++ D + E + + I L K+ R ++ Sbjct: 344 EQERIVSILDKFDTLTDSISEGLPKEIELRKKQYEYYRDLLLS 386 >gi|229542811|ref|ZP_04431871.1| restriction modification system DNA specificity subunit [Bacillus coagulans 36D1] gi|229327231|gb|EEN92906.1| restriction modification system DNA specificity subunit [Bacillus coagulans 36D1] Length = 393 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 133/414 (32%), Gaps = 44/414 (10%) Query: 18 IGAI--PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSD 74 IG I P W + + KL + +D + L ++ G + ++ Q Sbjct: 14 IGKITFPNDWSIYKLSDILKLV--KRPIKMEDQKFYNLVTIKRRFGGMVKRERLRGNQIQ 71 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSI 131 + G + K I I S ++ +L+ K+ L W + + Sbjct: 72 VKSQFSVKSGDFVISKRQISHGACAIVPEKLDGSIVSNEYNILRNKENLDLEFFNWYVQL 131 Query: 132 DVTQRIEAICEGATMSH---ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 QR + + + IP + EQ I + I I+ Sbjct: 132 PFMQRYFYLSSDGVHIEKLLFKLEDWLQRKVCIPEVKEQKKIAKIISTWDKAIELKEKLI 191 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + K Q L+ +G + WE F + + K Sbjct: 192 EQKKKQKKGLMQKLL----------------TGEVRLPGFYGEWEKVSFSDIFIKTKVKK 235 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 ++ + L ++ + + + + + +++ + G IVF R ++ Sbjct: 236 HQIKTNEYLESGKYPVVDQGQKKVTAYSNDEEKVFEVPETGVIVFGD-----HTREIKFI 290 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + D + + + + +G + + +K + Sbjct: 291 DFDFIIGADGTQVLMTKDDYDVRFYYYHLLIQKIPN------TGYNRHF--KFLKEMIFN 342 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 P +KEQ I+N+++ +D+L + L E++ + +TG++ ++ Sbjct: 343 KPSLKEQKAISNLLSTIDKELDLL----NAELSALNEQKKGLMQLLLTGKVRVK 392 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 73/200 (36%), Gaps = 8/200 (4%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIV 286 P+ W + ++ + R + ++ ++ + ++ V Sbjct: 19 FPNDWSIYKLSDILKLVKRPIKMEDQKFYNLVTIKRRFGGMVKRERLRGNQIQVKSQFSV 78 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+ V + + ++ ++ + + +D + W ++ + + F Sbjct: 79 KSGDFVISKRQISHGACAIVPEKLDGSIVSNEYNILRNKENLDLEFFNWYVQLPFMQRYF 138 Query: 347 YAMGSGLRQS---LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 Y G+ K ED + V +P +KEQ I +I+ D +E E+ I Sbjct: 139 YLSSDGVHIEKLLFKLEDWLQRKVCIPEVKEQKKIAKIISTW----DKAIELKEKLIEQK 194 Query: 404 KERRSSFIAAAVTGQIDLRG 423 K+++ + +TG++ L G Sbjct: 195 KKQKKGLMQKLLTGEVRLPG 214 >gi|261210084|ref|ZP_05924382.1| restriction endonuclease S subunit [Vibrio sp. RC341] gi|260840849|gb|EEX67391.1| restriction endonuclease S subunit [Vibrio sp. RC341] Length = 420 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 50/412 (12%), Positives = 122/412 (29%), Gaps = 44/412 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVSIFAK 83 W ++ L G+ ++ +++ +G P G + + + Sbjct: 24 WVSKKLEDICSLQAGK---------FVKAASIKNEKSGNLYPCYGGNGLRGFTKSFTHSG 74 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 L G+ G A + +V++PK + L + L + G Sbjct: 75 NYSLIGRQGALCGNINFASGTFHATEHAVVVEPKHGIDNLWLYYELC---RLNLNQFATG 131 Query: 144 ATMSHADWKGIGNIPMPIPPL-AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + +P + EQ I + + I + + K+ L Sbjct: 132 QAQPGLSVDNLYKVDTCVPVVGKEQQKIGACLSSMDNLIVENVKKLESLKL----HKKGL 187 Query: 203 VSYIVTKGLNPDVKMKDS-GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + ++ + ++W + ++ R + + ++ Y Sbjct: 188 MQKLFPDEGKSAPELGFTCNVKWNKKKFEEVYSLKTTNSLS---RDKLNYDDGLVKNIHY 244 Query: 262 GNIIQKLE-----------TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 G+I K N + + + G++VF D + Sbjct: 245 GDIHTKFSTLFDITKESVPFINAEIALDKVKEESYCQEGDMVFADASEDIDDVGKSIELI 304 Query: 311 MERG-----IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKR 364 G + + K + + +L +S L K G + + + + Sbjct: 305 NLNGEKLLSGLHTILARQKGSYLVKGFGGYLFKSEVLRKQIQKESQGAKVLGISATRISK 364 Query: 365 LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + V+ P EQ I + + + +D L++ + I +L + + Sbjct: 365 IDVVYPIEQSEQQRIVDCL----SSLDKLIDAQTKKIEILNIYKKGLMQQLF 412 >gi|331007889|ref|ZP_08330972.1| Restriction endonuclease S subunit [gamma proteobacterium IMCC1989] gi|330418302|gb|EGG92885.1| Restriction endonuclease S subunit [gamma proteobacterium IMCC1989] Length = 406 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 69/412 (16%), Positives = 132/412 (32%), Gaps = 50/412 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + + +K +G T DI ++ +D+++ + + Sbjct: 6 WTTKSLGKLSKFKSGGTPSKSNPKFWGGDIPWVTAKDMKTPLINNSIDKLTTEALNV--A 63 Query: 79 SIFAKGQILYGKLGPYLRK---AIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVT 134 + +L G L K I + + + K++LP L +L S Sbjct: 64 KLAPTNTLLILVRGMTLHKDLPLAITKKELAFNQDIKALTTCKEILPMFLMIYLSSQKHK 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G D + + P+ P + EQ I + ++ I+ T Sbjct: 124 VLKLVDSAGHGTGRLDTDLLKSFPVNYPSIFEQKKIVDTLVFWDNAIEKTETLIAAKENQ 183 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q L P + + E R+ + Sbjct: 184 FKWLTQKLF------------------------KPTASWQSYKLSDLFENRRETKNVDLP 219 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I +I ET E Y + P +I + + + +L + G Sbjct: 220 LISITREKGVIPHSETNRKDNSNEDKSKYLRIRPNDIGYNTMRMWQGVSALSTID----G 275 Query: 315 IITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVP 370 I++ AY KP I + ++A+L ++ + FY GL +LKF + +P Sbjct: 276 IVSPAYTVCKPKKIVNPEFMAFLFKTKPMIHKFYRYSQGLTSDTWNLKFHHFSEVKASIP 335 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSI-VLLKERRSSFIAAAVTGQIDL 421 I+ Q +I +N ID L + I + ++ + +TG+ + Sbjct: 336 DIETQSEIAKSLNSAKKEIDTL-----RKISEKYRIQKRGLMQKMLTGEWQV 382 >gi|284800798|ref|YP_003412663.1| type I restriction enzyme, S subunit [Listeria monocytogenes 08-5578] gi|284993984|ref|YP_003415752.1| type I restriction enzyme, S subunit [Listeria monocytogenes 08-5923] gi|284056360|gb|ADB67301.1| type I restriction enzyme, S subunit [Listeria monocytogenes 08-5578] gi|284059451|gb|ADB70390.1| type I restriction enzyme, S subunit [Listeria monocytogenes 08-5923] Length = 405 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 58/402 (14%), Positives = 129/402 (32%), Gaps = 34/402 (8%) Query: 25 WKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTV 78 W+ + + + YI + D++ + + + S D Sbjct: 20 WEQRKLGEIANSFEYGLNASSKTYDGENKYIRITDIDESSHVFNQDNLTSPNISLDKLNH 79 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +G IL + G K+ ++ + L+ Sbjct: 80 YLLEEGDILLARTGASTGKSYYYSKMDGKVFFAGFLIRAKIKQEYNVSFIFQNTLTERYN 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ + + + + + IP L EQ I ++D I R ++ Sbjct: 140 NFIQVTSQRSGQPGINAQEYARFALYIPELKEQQKIGVF----FKQLDNAIALHQRKLDA 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LK K+ + + ++ I + + + R Sbjct: 196 LKLMKKGFLQQMFP--------KIEADIPEIRFADFDGKWEQRKLGEIFNERSERSADGE 247 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I +I+ + Y++V G+I + + + S G Sbjct: 248 LISVTINSGVIKASKLEKKDNSSFDKSNYKVVKKGDIAYNSMRMWQGASGYSSYD----G 303 Query: 315 IITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVP 370 I++ AY + P ID+ ++A++ + D+ + F GL +LKF + + + +P Sbjct: 304 ILSPAYTVIYPRKDIDTIFIAYMFKKIDMIQTFQRNSQGLTSDTWNLKFPSLSTIKIKIP 363 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ ITN +++ + I +LK+ + +++ Sbjct: 364 ANDEQIKITN----LFQKLEYTSILHQNQIEMLKKVKKAYLQ 401 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 60/183 (32%), Gaps = 5/183 (2%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + R+++ ++I + + K KD + D S + KG Sbjct: 227 WEQRKLGEIFNERSERSADG--ELISVTINSGVIKASKLEKKDNS--SFDKSNYKVVKKG 282 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I Y + + + + +DGI S + V+ P+ + + ++ + Sbjct: 283 DIAYNSMRMWQGASGYSSYDGILSPAYTVIYPRKDIDTIFIAYMFKKIDMIQTFQRNSQG 342 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 S ++ + + T I + + L K+ K+A + Sbjct: 343 LTSDTWNLKFPSLSTIKIKIPANDEQIKITNLFQKLEYTSILHQNQIEML-KKVKKAYLQ 401 Query: 205 YIV 207 + Sbjct: 402 TMF 404 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 16/164 (9%), Positives = 47/164 (28%), Gaps = 5/164 (3%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + + + + + +++ G+I+ K S Sbjct: 47 NKYIRITDIDESSHVFNQDNLTSPNISLDKLNHYLLEEGDILLARTGASTGKSYYYSKMD 106 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLV 369 + A + +++ + + + ++ R + + Sbjct: 107 GKVFFAGFLIRAKIKQEYNVSFIFQNTLTERYNNFIQVTSQRSGQPGINAQEYARFALYI 166 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P +KEQ I ++D + ++ + LK + F+ Sbjct: 167 PELKEQQKIGVF----FKQLDNAIALHQRKLDALKLMKKGFLQQ 206 >gi|224023956|ref|ZP_03642322.1| hypothetical protein BACCOPRO_00673 [Bacteroides coprophilus DSM 18228] gi|224017178|gb|EEF75190.1| hypothetical protein BACCOPRO_00673 [Bacteroides coprophilus DSM 18228] Length = 403 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 61/420 (14%), Positives = 136/420 (32%), Gaps = 39/420 (9%) Query: 23 KHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDV-------ESGTGKYLPKDGNSR 71 WK V + + + + + + S I + +++ E Y+P + Sbjct: 2 SEWKKVKLGKLCDITSSKRCLASERSNNGIPFYCSKEIILLEKGEEIRDSDYIPIELYLS 61 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVLPELLQGW 127 + + G +L G I D + + D+ + L W Sbjct: 62 IKE--KYGVPITGDLLLTTRGTNGIPYIYKKHDCFYFADGNLSWFKNFKSDLDVKYLYYW 119 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S I++I +G G+ NI + IP + Q I E + I+ Sbjct: 120 FKSDTGKHIIDSIAKGTAQKAIPIDGLRNINISIPSIRVQCKISEILSHYDTLIENY--- 176 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + I+LL+E Q L P + ++ V + E+ F ++++ K Sbjct: 177 -QKQIKLLEESAQRLYKEWFVDLRFPGYENTKI-VDGVPEGWEKKEINEFISILSGYAFK 234 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ +E + +Q L P + + G+++ Sbjct: 235 SSSFVEDGDYKIVTIKNVQDGFFDGKNLSHIREIPNKMPKHCFLTTGDLLLSLTGNIGRV 294 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 + + ++ ++ + L RS +L + +G +Q++ Sbjct: 295 CMV----IGNNYLLNQRVAKIESVF--PAFAYCLFRSENLFTSINNLANGAAQQNVSPIK 348 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + L ++V +I + I + + I L E R + ++G+I++ Sbjct: 349 IGTLKIVV-----NNEIISKFEKVVGNIRNQILVLYSQIEELTEARDRLLPKLMSGEIEI 403 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 73/212 (34%), Gaps = 15/212 (7%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGT 60 +P Y+++ + + +P+ W+ I F + +G +S D + +++V+ G Sbjct: 199 RFPGYENTKI--VDGVPEGWEKKEINEFISILSGYAFKSSSFVEDGDYKIVTIKNVQDGF 256 Query: 61 GK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + G +L G R ++ + + + + V + + V Sbjct: 257 FDGKNLSHIREIPNKMPKHCFLTTGDLLLSLTGNIGRVCMVIGNNYLLNQR--VAKIESV 314 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 P S ++ I + GA + IG + + + I K Sbjct: 315 FPAFAYCLFRSENLFTSINNLANGAAQQNVSPIKIGTLKIVVNNE-----IISKFEKVVG 369 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 I I IE L E + L+ +++ + Sbjct: 370 NIRNQILVLYSQIEELTEARDRLLPKLMSGEI 401 >gi|254517360|ref|ZP_05129417.1| restriction modification system DNA specificity domain protein [gamma proteobacterium NOR5-3] gi|219674198|gb|EED30567.1| restriction modification system DNA specificity domain protein [gamma proteobacterium NOR5-3] Length = 570 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 89/476 (18%), Positives = 159/476 (33%), Gaps = 89/476 (18%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W+ + + T G + ++ D + LEDVE GT + L K + Sbjct: 101 ELPLSWQWIALGSCTNY--GYSDKTDGTDLGPDTWVLELEDVEKGTSRLLQKVRFEDRPF 158 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDV 133 S+ S+F G ++YGKL PYL K I+AD G+C+T+ + V + P L+ +L S Sbjct: 159 QSSKSMFEAGDVIYGKLRPYLDKVIVADEGGVCTTEMIPVRGHFGIDPRYLRLFLKSPHF 218 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII------------------ 175 Q + G + P P+PPLAEQ I K+ Sbjct: 219 VQYASSSVHGMNLPRLGTPKAREAPFPLPPLAEQKRIVAKVDELMTLADALEAGTRAGMA 278 Query: 176 -------------------AETVRIDTLITERIRFIELLKEKKQALVSYIVTKG----LN 212 + + + I + +E AL IV G L Sbjct: 279 THETLVRELLAILVNSQDAHDLAQNWSRIETHFDTLFTTEESIDALKQNIVDLGVRGMLC 338 Query: 213 PDVKMKDSGIEW---------------------VGLVPDHWEVKPFFA---LVTELNRKN 248 + +DS + + +P W ++P + + Sbjct: 339 AQDRTEDSSNQKKLRADRDAENFDLDAFEKRAALFRLPPGWTIEPLSRVSSNIVDCPHTT 398 Query: 249 TKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKR 303 K + + + I L+ E +I G+I+++ Sbjct: 399 PKWTDDGEICVKSDQIFAGHLDLSKPNYVSEDTYIERIARLEPREGDILYKREGGIL-GI 457 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDV 362 R + + + + +L ++ S L + G + V Sbjct: 458 GARIPAETKLCLGQRLMLIRANQAVLPPFLELVINSPWLQEFAKQKTTGGAAPRVNMTVV 517 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK--ERRSSFIAAAVT 416 + PV +P I+EQ I ++ L E+ +S+ L E + ++ A+T Sbjct: 518 RAYPVPIPAIREQERILQRVDELF----QLCERASKSLADLAGLEIK---LSDAIT 566 >gi|254303654|ref|ZP_04971012.1| possible type I site-specific deoxyribonuclease specificity subunit [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148323846|gb|EDK89096.1| possible type I site-specific deoxyribonuclease specificity subunit [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 387 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 62/379 (16%), Positives = 126/379 (33%), Gaps = 31/379 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK V + ++ TG T + ++ +I +++ Y+ + + Sbjct: 7 SEWKKVKLVDVCEIITGNTPLKKEKEYWDKDEVPFITPPELKYEGINYITPNIYVSKIGA 66 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I K I +G L K I D I + Q L K+ +LL + + Sbjct: 67 KQGRIIPKNSICVCCIGS-LGKLGILKEDAITNQQINSLILKNKNVDLLYLYFYLKTIKN 125 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +E+I T+ + I + +P L Q I +K+ ++ I R + L Sbjct: 126 NLESIASSTTVKIINKSSFEKIEISLPNLEIQKKISKKL----ELLENNIDFRKNQLNYL 181 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 KE ++L + + D K +E + ++ KN S Sbjct: 182 KELNKSLFTRMFGDIKTNDKNWKIVKLE------------KYINIIGGYAFKNIDFKSSG 229 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVD--PGEIVFRFIDLQND----KRSLRSAQ 309 I + GNI + E + ++ P +I+ + Sbjct: 230 IPLIRIGNINSGQFKSTNLVFIEENKKFEKFKVFPNDILISLTGTVGKDDYGNACILGDS 289 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVL 368 E + ++ + +M+ ++ K + G+RQ ++ +D+ L + Sbjct: 290 YSEYYLNQRNAKIELTDKMNKNFFLEIMKIKEVKKKLTGISRGIRQANISNKDIYNLSLP 349 Query: 369 VPPIKEQFDITNVINVETA 387 +PPI+ Q + Sbjct: 350 LPPIELQNKFAERVEKIEK 368 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 52/144 (36%), Gaps = 8/144 (5%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 T N+ + + +I+ I I L+ + + I + + +K Sbjct: 53 NYITPNIYVSKIGAKQGRIIPKNSICVCCIGSLGKLGILKEDAITNQQINS---LILKNK 109 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +D YL + +++ + S + + +++ + +P ++ Q I+ + + Sbjct: 110 NVDLLYLYFYLKTIK-NNLESIASSTTVKIINKSSFEKIEISLPNLEIQKKISKKLELLE 168 Query: 387 ARIDVLVEKIEQSIVLLKERRSSF 410 ID + + LKE S Sbjct: 169 NNID----FRKNQLNYLKELNKSL 188 >gi|332142753|ref|YP_004428491.1| type I site-specific deoxyribonuclease [Alteromonas macleodii str. 'Deep ecotype'] gi|327552775|gb|AEA99493.1| type I site-specific deoxyribonuclease [Alteromonas macleodii str. 'Deep ecotype'] Length = 364 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 58/386 (15%), Positives = 122/386 (31%), Gaps = 31/386 (8%) Query: 43 ESGKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 +S + YI ++D+ + KY D + ++ G Sbjct: 3 KSVTNNRYIQIDDLRNDNLIKYTDDD---------KGTFVEPSDVIIAWDGANAGTIGYG 53 Query: 102 DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI 161 I ST + + G L + C GAT+ H + ++ +P+ Sbjct: 54 LEGLIGSTLARLKVIIPHIDTNYLGRFLQSKFKEI-RNNCTGATIPHVSKVHLNSLLVPV 112 Query: 162 PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 PPL Q I + L Q++ + + +K S Sbjct: 113 PPLPIQKQIAAVLEKADNLRQQSQQMEQELNSLA----QSVFLDMFGDYRKDAMSLKSS- 167 Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 +G V D + ++ + E +++ +K + +E Sbjct: 168 ---LGEVADVRSGVTKGQKLEGHKLTTVPY---MRVANVQDGYLDLSEIKDITVKAKDFE 221 Query: 282 TYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRS 339 YQ + G+++ D R + + I + V+ S + A+ +++ Sbjct: 222 KYQ-LKAGDVLMTEGGDFDKLGRGAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQT 280 Query: 340 YDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + F + S+ +K LP+ I +Q +I+ + L E Sbjct: 281 PFVKQYFLKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIID----ELKALKEANF 336 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRG 423 + +S + A G++DL+ Sbjct: 337 EQQEQANAHFNSLMQRAFKGELDLKD 362 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 54/156 (34%), Gaps = 9/156 (5%) Query: 255 NILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + S++ IQ + RN L K + V+P +++ + ++ Sbjct: 1 MVKSVTNNRYIQIDDLRNDNLIKYTDDDKGTFVEPSDVIIAWDGANAGTIGYGLEGLIGS 60 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + ID+ YL ++S ++ + + L V VPP+ Sbjct: 61 TLARLKVIIPH---IDTNYLGRFLQS-KFKEIRNNCTGATIPHVSKVHLNSLLVPVPPLP 116 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 Q I V+ + D L ++ +Q L S Sbjct: 117 IQKQIAAVLE----KADNLRQQSQQMEQELNSLAQS 148 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 56/196 (28%), Gaps = 17/196 (8%) Query: 30 IKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + +G T + Y+ + +V+ G + ++ Sbjct: 168 LGEVADVRSGVTKGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKDFEKYQLKA 227 Query: 84 GQILYGKLGPY---LRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G +L + G + R AI + C + F V + E +L + V Q Sbjct: 228 GDVLMTEGGDFDKLGRGAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQTPFVKQYF 287 Query: 138 EAICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + T + + + +P+P + +Q I E Sbjct: 288 LKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIIDELKAL----KEANFEQQEQAN 343 Query: 197 EKKQALVSYIVTKGLN 212 +L+ L+ Sbjct: 344 AHFNSLMQRAFKGELD 359 >gi|77543209|gb|ABA87021.1| specificity subunit [Vibrio cholerae] gi|259156528|gb|ACV96472.1| specificity subunit [Vibrio cholerae Mex1] Length = 440 Score = 99.5 bits (246), Expect = 9e-19, Method: Composition-based stats. Identities = 61/422 (14%), Positives = 132/422 (31%), Gaps = 50/422 (11%) Query: 26 KVVPIKRFT-KLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQ---SD 74 + P+++ L TG + Y+ + +++ G +L K Sbjct: 17 EWRPLEKVIHSLKTGLNPRKNFQLNTSDAQGYYVTVREIQDGKIVFLEKTDRVNDRALEL 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV----LPELLQGWLLS 130 + S G IL+ G R A+I + + V K + LP L L S Sbjct: 77 INGRSNLEVGDILFSGTGTVGRTAVIEAKPANWNIKEGVYTIKPIQEKILPRFLSHLLNS 136 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDT 183 ++ + G + + + +PIP LA Q I + A T Sbjct: 137 SEIVKDYGKKIVGNPVVSLPMGELKKLLVPIPCPDNPEKSLAIQAEIVRILDAFTAMTAE 196 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 L E + + K++ +++ + +EW + Sbjct: 197 LTAELTAELNMRKKQYNYYRDQLLS--------FDEGDVEW-------KTLGDISDFTYG 241 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDK 302 K + ++ + ++ N KL + S E ++ +++ K Sbjct: 242 YAAKAQESGDARFVRITDINTNGKLSPADHMYVDISEENERYLLKKDDLLMARTGATFGK 301 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFED 361 + +++ P+ ++ Y +S + G + Sbjct: 302 TMIFEEDYPAIYAGFLIKLSLDPNIVNPKYYWHFAQSDLFWDQANKLVSGGGQPQFNANA 361 Query: 362 VKRLPVLVP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSF 410 +K++ + VP + EQ I +++ A L E + + I L ++ R Sbjct: 362 LKQVKLPVPYPSDTAKSLAEQARIVLILDKFDAIASSLSEGLPREIELRQKQYEYYRDLL 421 Query: 411 IA 412 ++ Sbjct: 422 LS 423 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 83/240 (34%), Gaps = 25/240 (10%) Query: 1 MKHYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLED 55 M+ K Y Y+D S + G + + + + G +++ + D ++ + D Sbjct: 207 MRK-KQYNYYRDQLLSFDE--GDV----EWKTLGDISDFTYGYAAKAQESGDARFVRITD 259 Query: 56 VESGTGKYLPKDGNSRQSDTST-VSIFAKGQILYGKLGPYLRKAIIADFDG-ICSTQFLV 113 + + GK P D + K +L + G K +I + D FL+ Sbjct: 260 INT-NGKLSPADHMYVDISEENERYLLKKDDLLMARTGATFGKTMIFEEDYPAIYAGFLI 318 Query: 114 L---QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP------- 163 P V P+ + S + + G + + + +P+P Sbjct: 319 KLSLDPNIVNPKYYWHFAQSDLFWDQANKLVSGGGQPQFNANALKQVKLPVPYPSDTAKS 378 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 LAEQ I + +L R IEL +++ + +++ +P K E Sbjct: 379 LAEQARIVLILDKFDAIASSLSEGLPREIELRQKQYEYYRDLLLSFPASPAGGPKSHSDE 438 >gi|149920795|ref|ZP_01909258.1| Restriction modification system, type I [Plesiocystis pacifica SIR-1] gi|149818313|gb|EDM77765.1| Restriction modification system, type I [Plesiocystis pacifica SIR-1] Length = 403 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 66/365 (18%), Positives = 124/365 (33%), Gaps = 20/365 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTST 77 G + W+ V + + + YI E +++ + + Sbjct: 2 GELKSGWRRVKFGDVVRQVKDKVPAKESGLSRYIAGEHMDTNDLRLRRWGEINDDYLGPA 61 Query: 78 VSI-FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDV 133 I F GQ+LYG YLRK +A FDG+C+ V++ K LPELL + + Sbjct: 62 FHIRFRPGQVLYGSRRTYLRKVAVAGFDGVCANTTFVVESKSPGILLPELLPFIMTTEAF 121 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +G+ + ++ + +PPL EQ I + ++ Sbjct: 122 HEHSVRESKGSVNPYVNFSDLAWYEFALPPLEEQGKISRILQRSAEL----QASYADLVQ 177 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + K ++ V + G K+ G + ++ + + + K + Sbjct: 178 VAKTTHRSFVDQTLGYGAQRPCFEKEPPSIRRGWA--YQPIEALCEALVDCLHRTPKYSK 235 Query: 254 SNILSLSYGNIIQKL-ETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + ++ ++ PES T PG+++F + +L Sbjct: 236 AGFPAIRTADVEPGFLRWETARRVPESEYLIQTTRLRPKPGDVLFSREGERMGMAALVPE 295 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 V I+ M ++P L + S + SG + DV+RL Sbjct: 296 GVS--LCISQRMMHLRPKPNFPANLLMEYLNSSWAQRQILMHKSGSTSPHINVADVRRLM 353 Query: 367 VLVPP 371 V VPP Sbjct: 354 VPVPP 358 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 57/166 (34%), Gaps = 10/166 (6%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYET-- 282 G + W F +V ++ K ++ ++ L R G + Y Sbjct: 2 GELKSGWRRVKFGDVVRQVKDKVPAKESGLSRYIAGEHMDTNDLRLRRWGEINDDYLGPA 61 Query: 283 -YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRS 339 + PG++++ K ++ + + ++ P + L ++M + Sbjct: 62 FHIRFRPGQVLYGSRRTYLRKVAVAGF---DGVCANTTFVVESKSPGILLPELLPFIMTT 118 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + G + + F D+ +PP++EQ I+ ++ Sbjct: 119 EAFHEHSVRESKGSVNPYVNFSDLAWYEFALPPLEEQGKISRILQR 164 >gi|77163975|ref|YP_342500.1| restriction modification system DNA specificity subunit [Nitrosococcus oceani ATCC 19707] gi|76882289|gb|ABA56970.1| Restriction modification system DNA specificity domain [Nitrosococcus oceani ATCC 19707] Length = 434 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 56/429 (13%), Positives = 134/429 (31%), Gaps = 38/429 (8%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +P+++D+G W+ V + +L +G + +G Y Sbjct: 17 RFPEFRDAG---------EWEKVALSTQVELLSGLHLSPDGYTDTGDIPYF-TGPSDYTN 66 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + T + ++ G L G + + + + D + ++ + Sbjct: 67 DLALVSKWTTRSANVGRAGDTLITVKGSGVGELLNLELDEVA-MGRQLMAVRARTAHGEF 125 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + I R+ A+ G + I ++ +P+P EQ I + + + I Sbjct: 126 IFHFLITQRLRLIALASGNLIPGLSRGDILSLKVPVPSHEEQQKIADCLSSLDAL----I 181 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTEL 244 + ++ LK K+ L+ + + +++ G F Sbjct: 182 AAQTEKLDALKTHKKGLMQQLFPRAGETVPRLRFPKFRDGGRWTSKKMSDVYRFLSTNTY 241 Query: 245 NRKNTKLIESNILSLSYGNIIQKLE-----------TRNMGLKPESYETYQIVDPGEIVF 293 +R + + ++ YG+I K N E + G+IVF Sbjct: 242 SRDKLNYEKGEVKNIHYGDIHTKFSTLFDVTQEYVPYINRTESLERIKDDSYCLEGDIVF 301 Query: 294 RFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLA---WLMRSYDLCKVFYA 348 + + I++ + + + + +L +S + + Sbjct: 302 ADASEDVEDVGKSIEIVNTGNEKILSGLHTLLARQKNNDLVIGFGGYLFKSGLIREQIKR 361 Query: 349 MGSGLRQ-SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G + + + ++ V P +EQ I + + + +D L+ + I LK Sbjct: 362 ESQGAKVLGISSGRLSKIKVCFPYEKREQQKIAHCL----SSLDALIAAQAEKIDALKTH 417 Query: 407 RSSFIAAAV 415 + + Sbjct: 418 KKGLMQQLF 426 >gi|320449896|ref|YP_004201992.1| restriction modification system DNA specificity domain-containing protein [Thermus scotoductus SA-01] gi|320150065|gb|ADW21443.1| restriction modification system DNA specificity domain protein [Thermus scotoductus SA-01] Length = 352 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 58/345 (16%), Positives = 117/345 (33%), Gaps = 33/345 (9%) Query: 105 GICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMP 160 G+ S + V + K L + G +IP+P Sbjct: 7 GLVSPVYPVWEVKPDKAYAWFIDPLLRMPNTISAYNRFASGAVNRRRAIRKNDFLSIPIP 66 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 +PPL EQ I + + R I L++ K++L+ ++ T G + Sbjct: 67 LPPLLEQRAIAHVL----RTVQEAKQATERVIAALRDLKKSLMRHLFTYGPVSIGEQHTV 122 Query: 221 GIEW--VGLVPDHWEVKPFF--------ALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 ++ +G +P HW V + + S + L NI + Sbjct: 123 PLQETEIGPIPAHWRVVRLGELVAKGILWMKNGFPQGKHNRTASGVPHLRPFNITDTGDI 182 Query: 271 RNMGLK---PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--- 324 +K P ++ V PG+++F + + +I++ ++ Sbjct: 183 TLSQVKYVPPPPEDSPYRVFPGDVIFNNTNSEELVGKTAYFDRNGTFVISNHMTLIRVLS 242 Query: 325 ---PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 S YL WL + + + S+ E +K++ + +PP+ EQ I +V Sbjct: 243 GEVNPYWLSKYLHWLWSKGVFRNLCRRHVN--QASVSLERLKQVTLPLPPLPEQRAIAHV 300 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + D + E L + S + +TG+ ++ ++ Sbjct: 301 LRTV----DRRIAAEEAYARALGDLFKSLLQELMTGRRRVKVAAE 341 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 71/204 (34%), Gaps = 18/204 (8%) Query: 18 IGAIPKHWKVVPIKRFTK---------LNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKD 67 IG IP HW+VV + G+ + + + ++ ++ ++G Sbjct: 129 IGPIPAHWRVVRLGELVAKGILWMKNGFPQGKHNRTASGVPHLRPFNITDTGDITLSQVK 188 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDG--ICSTQF--LVLQPKDVLP 121 + S +F G +++ + K D +G + S + + +V P Sbjct: 189 YVPPPPEDSPYRVF-PGDVIFNNTNSEELVGKTAYFDRNGTFVISNHMTLIRVLSGEVNP 247 Query: 122 ELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L +L + +C + + + + +P+PPL EQ I + R Sbjct: 248 YWLSKYLHWLWSKGVFRNLCRRHVNQASVSLERLKQVTLPLPPLPEQRAIAHVLRTVDRR 307 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I +L K Q L++ Sbjct: 308 IAAEEAYARALGDLFKSLLQELMT 331 >gi|312870900|ref|ZP_07731005.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 3008A-a] gi|311093590|gb|EFQ51929.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 3008A-a] Length = 378 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 71/397 (17%), Positives = 132/397 (33%), Gaps = 31/397 (7%) Query: 29 PIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + + YI E++ G Q T F K +L Sbjct: 4 KLSDICEYAKEKIKISALDENTYISTENMLPNKGGITQATSLPVQEHTQA---FMKNDVL 60 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGATM 146 + PY +K A FDG CS LV + K + ++L+ D A +G M Sbjct: 61 VSNIRPYFKKIWFATFDGGCSNDVLVFRAKKGINSRFLHYVLANDSFFNYSMATSKGTKM 120 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 D K I +P QV I + + ID I + + L+E+ Q++ + Sbjct: 121 PRGDKKAIMAYEVPKLSYKYQVKIADILEI----IDNKIELNKKINKNLEEQAQSIFANE 176 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 K + + + + ++ + ES I L + Q Sbjct: 177 FLSLDTLPEGWKQASLIDIADYLNGLAMQKY----------RPTADESGIPVLKIKELRQ 226 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N L S ++ I+ G+++F + L + V + Sbjct: 227 GCCDDNSELCSPSIKSDYIIHDGDVIFSWSGSL-----LVDFWCGGTCGLNQHLFKVTSN 281 Query: 327 GIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 D + + +Y L K A + +K E++ + VL+P + I Sbjct: 282 IYD-KWFYYSWTNYYLQKFAAIAADMATTMGHIKREELAKSRVLIPSNSDYERIG----G 336 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 A + LV L R + + ++G++D+ Sbjct: 337 LLAPLYNLVISNRIENSKLATIRDTLLPKLMSGEVDV 373 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 51/194 (26%), Gaps = 10/194 (5%) Query: 21 IPKHWKVVPIKRFTKLNTG-----RTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ WK + G + + I + ++++ G + Sbjct: 183 LPEGWKQASLIDIADYLNGLAMQKYRPTADESGIPVLKIKELRQGCCD---DNSELCSPS 239 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + I G +++ G L G + + W Sbjct: 240 IKSDYIIHDGDVIFSWSGSLLVDFWCGGTCG-LNQHLFKVTSNIYDKWFYYSWTNYYLQK 298 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 A TM H + + + IP ++ I + + + E + + Sbjct: 299 FAAIAADMATTMGHIKREELAKSRVLIPSNSDYERIGGLLAPLYNLVISNRIENSKLATI 358 Query: 195 LKEKKQALVSYIVT 208 L+S V Sbjct: 359 RDTLLPKLMSGEVD 372 >gi|192289910|ref|YP_001990515.1| restriction modification system DNA specificity domain [Rhodopseudomonas palustris TIE-1] gi|192283659|gb|ACF00040.1| restriction modification system DNA specificity domain [Rhodopseudomonas palustris TIE-1] Length = 393 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 50/411 (12%), Positives = 110/411 (26%), Gaps = 34/411 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IPK VP+ F ++ G T +I ++ +D+++ + Sbjct: 3 IPK----VPLGEFVEIKGGGTPSKSNAAFWGGNIPWVSPKDMKTWEICDSEDKITAEAVR 58 Query: 75 TSTVSIFAKG-QILYGKLGPYLR--KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S ++ ++ + G I + + + Sbjct: 59 ESATNLIPPNATLIVNRSGILKHTLPVGITRRPVAINQDIKAILVSPR-AHPEYVAHIIK 117 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + T + + + +P+PPL EQ I + Sbjct: 118 AAEPIVLKWVRATTADNFPIDNLRELEIPLPPLDEQRRIAAILDKADALRRKRKRTIELI 177 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L++ + + + + G F + Sbjct: 178 ECLMQATYRRMFVEQASNSWPKCTVASLARDIRTGPFGSQLLHSEFVDEGIAVLG----- 232 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + N + E R++ + V PG+++ + + + Sbjct: 233 -----IDNVATNEFRWGERRHIPEEKYEKLRRYTVFPGDVLITIMGTCGRCAIVPENIPL 287 Query: 312 ERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + +L + ++ D+ G L +K L + + Sbjct: 288 AINTKHLCCITLDEEKCLPEFLQSTFLQHPDVLLQLGVQAKGAVMPGLNMGIIKSLQISL 347 Query: 370 PPIKEQFDITNVINVETARIDVLVEKI--EQSIVLLKERRSSFIAAAVTGQ 418 PP++ Q D I+ + L+ E LL SS A +GQ Sbjct: 348 PPVQLQRDFVMRISKLRS---TLISSRHWEAEGELL---FSSLQHRAFSGQ 392 >gi|83776730|gb|ABC46688.1| Sau1hsdS1 [Staphylococcus aureus] Length = 419 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 54/407 (13%), Positives = 114/407 (28%), Gaps = 31/407 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTST 77 W+ + + G G + +DV + L N + Sbjct: 20 EWEEKKVGELLEFKNGLNKGKEYFGSGSSIVNFKDVFNNRSINTNNLTGKVNVNSKELKN 79 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S KG + + + + + + + S L +PK + + + + Sbjct: 80 YS-VEKGDVFFTRTSEVIGEIGYPSVILNDPENTVFSGFVLRGRPKSGIDLINNNFKRYV 138 Query: 132 DVTQRIEAIC---EGATMSHADWKGIGNIPMPIPPL--AEQVLIREKIIAETVRIDTLIT 186 T T N I P+ EQ I + +I+ Sbjct: 139 FFTNSFRKEMITKSSMTTRALTSGXAINKMKVIYPVSAKEQKKIGDFFSKLDRQIELEEQ 198 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + K Q + S + + + + ++ + Sbjct: 199 KLELLQQQKKGYMQKIFSQELRFKDENGNDYPNWRTIELKNILENIVDNRGKTPDNAPSE 258 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K L + + I + + +I+F + + Sbjct: 259 KYPLLEVNALGYYRPAYIKVSKFVSENTYNN---WFREHLKENDILFSTVGNT----GIV 311 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL--CKVFYAMGSGLRQSLKFEDVKR 364 S + +I + ++ + + + M SY K+ ++ S+K K Sbjct: 312 SLMDNYKAVIAQNIVGLRVNNNNLPSFIYYMLSYKGNQKKIKRIQMGAVQPSVKVSQFKF 371 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + LVP EQ + ID LV K I LL++R+ + + Sbjct: 372 IKYLVPIKDEQEKVA----KLLIEIDKLVNKQLIKIELLQQRKKALL 414 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 67/188 (35%), Gaps = 8/188 (4%) Query: 24 HWKVVPIKRFTKL---NTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W+ + +K + N G+T ++ + + + + Y+ ++ + Sbjct: 231 NWRTIELKNILENIVDNRGKTPDNAPSEKYPLLEVNALGYYRPAYIKVSKFVSENTYNNW 290 Query: 79 SI--FAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDVTQ 135 + IL+ +G +++ ++ + + + + + LP + L + Sbjct: 291 FREHLKENDILFSTVGNTGIVSLMDNYKAVIAQNIVGLRVNNNNLPSFIYYMLSYKGNQK 350 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +I+ I GA I +P EQ + + +I ++ + + + Sbjct: 351 KIKRIQMGAVQPSVKVSQFKFIKYLVPIKDEQEKVAKLLIEIDKLVNKQLIKIELLQQRK 410 Query: 196 KEKKQALV 203 K +++ Sbjct: 411 KALLKSMF 418 >gi|229819003|ref|YP_002880529.1| restriction modification system DNA specificity domain protein [Beutenbergia cavernae DSM 12333] gi|229564916|gb|ACQ78767.1| restriction modification system DNA specificity domain protein [Beutenbergia cavernae DSM 12333] Length = 408 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 54/410 (13%), Positives = 118/410 (28%), Gaps = 39/410 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + + + G T + + +ED+ + + Sbjct: 13 PDGVEYIELAELFSTRNGYTPPKSDASAWADGTVPWFRMEDIREKGRVLDDSIQHIATTA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLS 130 +F I+ A+I+ S + + + Sbjct: 73 VKGGRLFPANSIIVATSATIGEHALISVPHLSNQRFTSLALKPKYQDRFEIKFIFYYCFV 132 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +D + + ++ + D G+ P PPL Q I + A T L E Sbjct: 133 LD--EWCKNNTTVSSFASVDMVGLKKFKFPAPPLEVQRDIVRILDAFTELEAELEAELEA 190 Query: 191 FIELLKEKKQALVS----YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + AL+S V+ ++ SG G V + Sbjct: 191 RKQQYAHYRDALLSFGGSEAVSWATLSELCTIQSG----GTPKSDNAVYYGGDIPW---- 242 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 I ++ + + R + + + + ++ D G ++ + Sbjct: 243 -------CAISDITSASKYIRRTQRTITPEGLANSSAKVFDAGTLLLSIYASLGEVTITS 295 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + I+ + GI YL + M S ++ G + +L V Sbjct: 296 IPMATNQAIL--GLVPRDGSGILVDYLYYTMLSSK-DRLLAQRQVGSQNNLNKAIVADFR 352 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 V VP + +Q I +++ A ++ L + I ++ R ++ Sbjct: 353 VPVPAMPDQERIVALLDKFDALVNDLSSGLPAEIEARRQQYAHYRDRLLS 402 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 59/187 (31%), Gaps = 12/187 (6%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVFRF 295 T + + + +I +K + + + + ++ I+ Sbjct: 29 NGYTPPKSDASAWADGTVPWFRMEDIREKGRVLDDSIQHIATTAVKGGRLFPANSIIVAT 88 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-- 353 + + + + + A KP D + ++ + + + + Sbjct: 89 SATIGEHALISVPHLSNQRFTSLAL---KPKYQDRFEIKFIFYYCFVLDEWCKNNTTVSS 145 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI-- 411 S+ +K+ PP++ Q DI +++ T L ++E R + + Sbjct: 146 FASVDMVGLKKFKFPAPPLEVQRDIVRILDAFTELEAELEAELEARKQQYAHYRDALLSF 205 Query: 412 --AAAVT 416 + AV+ Sbjct: 206 GGSEAVS 212 >gi|86742691|ref|YP_483091.1| restriction modification system DNA specificity subunit [Frankia sp. CcI3] gi|86569553|gb|ABD13362.1| restriction modification system DNA specificity domain [Frankia sp. CcI3] Length = 436 Score = 99.5 bits (246), Expect = 1e-18, Method: Composition-based stats. Identities = 59/435 (13%), Positives = 138/435 (31%), Gaps = 47/435 (10%) Query: 15 VQWIGAIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGN 69 ++ G I ++ + ++ +G T + KD Y+ + +V+ G Sbjct: 10 IESFGEIFPG-RISTVGTEFEIQSGITLSPRRTSGRKDAPYLRVANVQRGRLTLSDVAWL 68 Query: 70 SRQSDTSTVSIFAKGQILY----GKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPEL 123 + G +L R A + C L+P+++ Sbjct: 69 EASARERIRYALDDGDLLVVEGHANPAEIGRCAQVGPESKNCLYQNHLFRLRPRNLEARF 128 Query: 124 LQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 WL S C + + + + +G +P+P+PP +Q I E + A I Sbjct: 129 ALHWLNSSFSQSYWGRNCATSSGLYTINSRQLGALPIPVPPPDKQRKISEILDAADEAIR 188 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 + + ++ + L+ V + G +PD W + L Sbjct: 189 STERLVGKLEQVFDSLRGDLLQEHVIRS---------------GRLPDCWRMDRLDRLSE 233 Query: 243 ELNR---------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + + ++ I + + + ++ ++ Y ++ G+++ Sbjct: 234 ITGGVTLGGVTSAGRSVELPYLRVANVQDGYIDTTDIKTVTVRTSEFDRY-LLQAGDVLM 292 Query: 294 R-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFY--A 348 D R ++ + + V+ I YL+ S F + Sbjct: 293 TEGGDFDKLGRGAVWDGSIDPCLHQNHIFRVRCDKIRLLPEYLSTYSASTAGRSYFMGIS 352 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + S+ + LPV +PP+ Q I + + + + + L+ + Sbjct: 353 KQTTNLASINKSQLSALPVPLPPLATQKMIIGSLGAA----ERQISSTKAELAKLRLVKQ 408 Query: 409 SFIAAAVTGQIDLRG 423 + + G++ + G Sbjct: 409 GLMDDLLMGRVQVSG 423 >gi|317180553|dbj|BAJ58339.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F32] Length = 411 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 57/393 (14%), Positives = 121/393 (30%), Gaps = 25/393 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + ++ R K I +G Y+ Sbjct: 13 PKGVEFRKLGEVCEILDNRRIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFV----- 67 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + + + K A + VLQ K+ L + + + Sbjct: 68 LVGEDGSVINKDNT--PIVNWASGKIWVNNHAHVLQTKNELKLKFLYF----YLQTIDVS 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 C T + + + I +PIPPL Q I + A T L TE + + + Sbjct: 122 YCVAGTPPKINQENLKKITIPIPPLEIQQEIVNILDAFTELNTELNTELKARKKQYQYYQ 181 Query: 200 QALV------SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L+ ++ K L P E + + N+K K+ E Sbjct: 182 NMLLDFKDTNQNHQDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGEVCESTNKKTLKISE 241 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + G + + GE + + + Sbjct: 242 VSEVKNKRMYPVINSGRDLYGYYHDFN------NDGENITIASRGEYAGFINYFNEKFFA 295 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 G + Y + + + +L + +++ ++ + + G +L D++ L + +PP++ Sbjct: 296 GGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVSRGSIPALNKADIETLTIPIPPLE 355 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q +I +++ A L+ I I K++ Sbjct: 356 IQQEIVKILDQFLALTTDLLAGIPAEIEARKKQ 388 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 53/132 (40%), Gaps = 13/132 (9%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDK--RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 Y I D ++ +K + + + + A++ + + +L + Sbjct: 55 DYIDSYIFDGDFVLVGEDGSVINKDNTPIVNWASGKIWVNNHAHVLQTKNELKLKFLYFY 114 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +++ D+ +G + E++K++ + +PP++ Q +I N+++ T E Sbjct: 115 LQTIDV----SYCVAGTPPKINQENLKKITIPIPPLEIQQEIVNILDAFT-------ELN 163 Query: 397 EQSIVLLKERRS 408 + LK R+ Sbjct: 164 TELNTELKARKK 175 >gi|291458787|ref|ZP_06598177.1| type I restriction-modification system specificity subunit [Oribacterium sp. oral taxon 078 str. F0262] gi|291418704|gb|EFE92423.1| type I restriction-modification system specificity subunit [Oribacterium sp. oral taxon 078 str. F0262] Length = 385 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 55/390 (14%), Positives = 124/390 (31%), Gaps = 19/390 (4%) Query: 32 RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-FAKGQILYGK 90 ++ + YI V+ + ++ S ++ KG I++ K Sbjct: 8 ECVEIVGSACKQYDGVKNYISTGAVDVDYIVSDEIERFEFENRPSRANLEVNKGDIIFAK 67 Query: 91 LGPYLRKAIIAD--FDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEGATM 146 + + +I D I ST F ++PKD + L + S + + C GAT Sbjct: 68 MQGTKKTLLIDDALSQNIYSTGFCAVRPKDDVLTDRCLYHLVTSEMFLSQKDKNCSGATQ 127 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 G+ I + +P Q I +++ V I + I+ EL + + Sbjct: 128 KAITNAGLEKIFIRVPDYHLQERIADELDKLAVIISKRKNQLIKLDEL-------INARF 180 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 V +P K + + + +NT + + I+ Sbjct: 181 VEMFGDPVNNEKKWSTKALEDA--CRSIVDCPHSTPNYTSENTGFMCIRTSIVKKNRIMW 238 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + + G++V+ ++ S ++ Sbjct: 239 DDIEFIPEEEYKQRTQRKKPEKGDVVYTREGAILGIAAIIDRDCNVALGQRSMLVSPDDK 298 Query: 327 GIDSTYLAWLMRSYDL-CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 S +L+ M M + D+K +++PP+K+Q + + + Sbjct: 299 ICTSEFLSVAMNFDSFLNNALKGMSGSASPHINVGDIKTFKMIMPPVKQQEEFSTFV--- 355 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I+ + Q++ + +S + Sbjct: 356 -KQIEKSKNIVGQALEETQVLFNSLMQEYF 384 >gi|168490326|ref|ZP_02714525.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP195] gi|183571333|gb|EDT91861.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP195] gi|332073221|gb|EGI83700.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17570] Length = 347 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 54/362 (14%), Positives = 95/362 (26%), Gaps = 37/362 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + G + +D G E + + N I G Sbjct: 2 KKVKLGEVATFINGYAFKP-QDWSSEGREIIRIQNLTKTSTEINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + L+ Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLLV---------- 169 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 K E G V + + L +N K + + I Sbjct: 170 ------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFPIY 217 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Y IV ++ N +R + Sbjct: 218 GSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVL 267 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 268 EKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVVQV 324 Query: 386 TA 387 Sbjct: 325 DK 326 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 41/142 (28%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + IV+ G+I+ + ++ V I Sbjct: 39 TSTEINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 185 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 232 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 233 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 289 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 290 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 320 >gi|227517374|ref|ZP_03947423.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis TX0104] gi|227075173|gb|EEI13136.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis TX0104] Length = 366 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 69/389 (17%), Positives = 132/389 (33%), Gaps = 33/389 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +K + +GR D ++G ++ GTG Y+ + D I Sbjct: 1 LKEIVDVRSGR------DYKHLGSGNIPVYGTGGYMLSVSEALSYDEDA--------IGI 46 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G+ G I+ T F + + + ++ E + Sbjct: 47 GRKGTINNPYILKAPFWTVDTLFYAIPKNNFDLNFIYSIFR----KINWKSKDESTGVPS 102 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I + + IP +EQ I E +ID IT R ++ LKE K+A + + Sbjct: 103 LSKTTINAVTVYIPSGSEQQRIGEF----FKQIDNTITLHQRKLDQLKELKKAYLQLMFA 158 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + K+ + ++ V E N K+ K E+ S YG I Q+ Sbjct: 159 STNTKNDKLPKLRFTGFKGYWELCKLSDISDKVKEKN-KHGKFTETLTNSAEYGIINQRF 217 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + ++Y +V + V+ I ++ ++ G+++ Y + H Sbjct: 218 FFDKDISNANNLDSYYVVQNDDFVYNPRISNFAPVGPIKRNKLGRTGVMSPLYYVFRTHS 277 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ID+ YL + G R ++K +P+ P +EQ I Sbjct: 278 IDNNYLEKYFDTVYWHHFMELNGDTGARADRFAIKDSIFVEMPIPYPSTEEQKKIGIF-- 335 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++D + + + LK + S++ Sbjct: 336 --FKKLDQSITLYKNKLNQLKTLKKSYLQ 362 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/188 (9%), Positives = 55/188 (29%), Gaps = 6/188 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W++ + + + + ++ S ++ + + Sbjct: 179 WELCKLSDISDKVKEKNKHGKFTETLTNSAEYGIINQRFFFDKDISNANNLDSYYVVQND 238 Query: 85 QILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +Y + G+ S + V + + L+ + ++ +E Sbjct: 239 DFVYNPRISNFAPVGPIKRNKLGRTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFMEL 298 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +I + +P ++KI ++D IT + LK K Sbjct: 299 NGDTGARADRFAIK-DSIFVEMPIPYPSTEEQKKIGIFFKKLDQSITLYKNKLNQLKTLK 357 Query: 200 QALVSYIV 207 ++ + + Sbjct: 358 KSYLQNMF 365 >gi|307259762|ref|ZP_07541482.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306866152|gb|EFM98020.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 489 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 45/434 (10%), Positives = 111/434 (25%), Gaps = 72/434 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V ++ L GR +I ++ + L + Sbjct: 70 EIPESWVWVRLEDIFHLQAGR---------FISASEIYGEYKESLYPCYGGNGLRGFVKT 120 Query: 80 IFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +G+ + G+ G A+ + +V++ L + + + Sbjct: 121 YNREGKFPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYF---LIQLNLN 177 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I ++ +P+PPL EQ I KI I+ + + L ++ Sbjct: 178 QYATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQF 237 Query: 199 K----QALVSYIVTKGLNPDVKM------------------------------------- 217 ++++ + L Sbjct: 238 PEQLKKSILQAAIQGKLTEQNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRD 297 Query: 218 -----------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE---SNILSLSYGN 263 + E +P+ W + + + L GN Sbjct: 298 NLPYEIVNGKERCIADEVPFEIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGN 357 Query: 264 IIQKLET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I ++ + +++ + + + + Sbjct: 358 IQDGKIDVSSDIVKVNLDIPENKRCYKNDLLICARNGSKKLVGKAAIIDKDGYSFGAFMA 417 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + Y+ + + S F + + + ++ + +P + EQ I Sbjct: 418 IFRSPF--NKYIYYYLSSPLFRNDFDGVNTTTINQITQSNLNNRLIPLPSLNEQLRIVEK 475 Query: 382 INVETARIDVLVEK 395 I + + L +K Sbjct: 476 IETLFSTLQNLSQK 489 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 60/204 (29%), Gaps = 18/204 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + ++ +P+ W + + E +K + Sbjct: 63 TEQDFPFEIPESWVWVRLEDIFHLQAGRFISASEIYGEYKESLYPCYGGNGLRGFVKTYN 122 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 E F I Q + + A + D+ + + + Sbjct: 123 REGK---------FPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYFLIQ 173 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +L + + + L + + + +PP+ EQ I I I+ + E+ Sbjct: 174 LNLNQY---ATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEK 229 Query: 400 IVLL-----KERRSSFIAAAVTGQ 418 + L ++ + S + AA+ G+ Sbjct: 230 LTALHQQFPEQLKKSILQAAIQGK 253 >gi|327470620|gb|EGF16076.1| restriction modification system DNA specificity subunit [Streptococcus sanguinis SK330] Length = 405 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 50/420 (11%), Positives = 124/420 (29%), Gaps = 44/420 (10%) Query: 23 KHWKVVPIKRFTKLN-TGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK I ++ +G T + + + ++ + ++ K + Sbjct: 4 SDWKEYKIADLVEIIFSGGTPNTKVNEYWNGSLPWLSSGETKNRYINSTEKTITESGAQN 63 Query: 76 STVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSID 132 S+ + G ++ G + + D + + + + VL + + LS Sbjct: 64 SSTRLALSGDVVMASAGQGYTRGQVSFLNIDTFINQSVIAIRANEKVLDKKFLFYNLSSR 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + K + ++ + IP L Q I + + +I+T Sbjct: 124 YEELRAISDSNSIRGSITTKMVKSMNIRIPDLNTQKAIANTLSSIDDKIETSKQINHHLE 183 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ++ + ++ G G +P W+ KP K Sbjct: 184 QMAQAIFKSWFVDFEPFG---------------GEMPSKWQTKPADCFFDISIGKTPPRK 228 Query: 253 ESN--------ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 E+ + +S ++ ++ N + + E + + I L Sbjct: 229 ENWCFSEDSKDVPWISISDMGKEGLFINKTSEYLTREAIDKFNVKVVPQNTILLSFKLTI 288 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 R A + A K + + + S + ++ + +K Sbjct: 289 GRIAITNCKMSTNEAIAHFKLTNKHALEWLYCFLNNINYAEL-GNTSSIATAINSKIIKS 347 Query: 365 LPVLVP---PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + +P + + I A I + I L+ R+ + ++G+I + Sbjct: 348 MLITMPDSSSLSKFHKIA-------APIFEEIRNNHGEIESLQNLRNILLPKLLSGEIPV 400 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 58/197 (29%), Gaps = 14/197 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDV--ESGTGKYLPKD 67 G +P W+ P F ++ G+T KD+ +I + D+ E + Sbjct: 202 GEMPSKWQTKPADCFFDISIGKTPPRKENWCFSEDSKDVPWISISDMGKEGLFINKTSEY 261 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 D V + + IL R AI ST + K L+ Sbjct: 262 LTREAIDKFNVKVVPQNTILLSFKLTIGRIAITNCKM---STNEAIAHFKLTNKHALEWL 318 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ E + + + K I ++ + +P + + I E Sbjct: 319 YCFLNNINYAELGNTSSIATAINSKIIKSMLITMPDSSSLSKFHKIAAPIFEEIRNNHGE 378 Query: 188 RIRFIELLKEKKQALVS 204 L L+S Sbjct: 379 IESLQNLRNILLPKLLS 395 >gi|164551500|gb|ABY60968.1| Sau1hsdM1 [Staphylococcus aureus] gi|298693766|gb|ADI96988.1| Type I restriction-modification system, specificity subunit S [Staphylococcus aureus subsp. aureus ED133] Length = 397 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 59/402 (14%), Positives = 115/402 (28%), Gaps = 39/402 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + K+ +I D+ S L DGN V Sbjct: 20 EWEEKKLGDLGLFQKSYSFSRAKEGNGKTKHIHYGDIHSKFKTVLDSDGNIPNIIEKAVF 79 Query: 80 -IFAKGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 + KG I++ + +I + + + L + ++ Sbjct: 80 ELIQKGDIVFADASEDYSDLGKAVMIDFKPNSLISGLHTHLFRPLNNAISNFLIFYTKTL 139 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I G ++ K + N+ + IP + K ++D I + Sbjct: 140 SYKKFIRQQGTGISVLGISKKSLLNLNVLIPRSELEQQKVGK---FFSKLDRQIELEEQK 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 IELL+++K+ + I ++ L + G WE + K Sbjct: 197 IELLQQQKKGYIQKIFSQEL--------RFKDENGDDYPEWEETTIKEIAQINTGKKDTK 248 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + I P Y+ GE + D + Sbjct: 249 -----------DAITNGSYDFYVRSPIVYKINTFSYEGEAILTVGDGVGVGKVF-HYVNG 296 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + Y L + L + S++ + V + V P Sbjct: 297 KFDYHQRVYKISDFKNYYGLLLFYYFSQNFLKETKKYSAKTSVDSVRKDMVANMKVPRPI 356 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I I ++D ++ +Q I LLK+R+ + + Sbjct: 357 YIEQEKIGQFI----KKVDNKIKIQKQVIELLKQRKKALLQK 394 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 73/217 (33%), Gaps = 10/217 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ EW + + N K + + N Sbjct: 10 PELRFPGFEGEWEEKKLGDLGLFQKSYSFSRAKEGNGKTKHIHYGDIHSKFKTVLDSDGN 69 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--- 329 + E ++++ G+IVF + + S ++ Sbjct: 70 IPNIIEK-AVFELIQKGDIVFADASEDYSDLGKAVMIDFKPNSLISGLHTHLFRPLNNAI 128 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETA 387 S +L + ++ K G+G+ + + + L VL+P EQ + + Sbjct: 129 SNFLIFYTKTLSYKKFIRQQGTGISVLGISKKSLLNLNVLIPRSELEQQKVGKF----FS 184 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 ++D +E EQ I LL++++ +I + ++ + E Sbjct: 185 KLDRQIELEEQKIELLQQQKKGYIQKIFSQELRFKDE 221 >gi|291277043|ref|YP_003516815.1| type I restriction-modification methylase [Helicobacter mustelae 12198] gi|290964237|emb|CBG40086.1| type I restriction-modification methylase [Helicobacter mustelae 12198] Length = 401 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 51/395 (12%), Positives = 115/395 (29%), Gaps = 19/395 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + + G+ S G+Y +G S Sbjct: 13 PHGVEFRKLGEVINICKGKQLNKE----------FLSNYGEYPVMNGGIYASGYWNTYNT 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +I+ + G V++ + + + Sbjct: 63 NSPKIIISQGGASAGYVNYMTSKFWAGAHCYVIESDSKKVNYKFLFYFLKNKESFLIKSQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + I +P+P+PPL Q I + + T L TE + + + Sbjct: 123 FGAGIPALNKADIETLPIPLPPLEVQREIVKILDTFTELNTELNTELKLRKKQYEYYRNW 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+S+ + + + + E + ++S Sbjct: 183 LLSFSDVDASKEGAEQRLRDKSY--PKALKALLLSLCPHGVEFRKLGEVGEFQKGATISK 240 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N + G + +Y GE + I S + + S + Sbjct: 241 KNAVPGEVPVIAGGRQPAYYHNHANRIGETI--AISSSGAYAGYVSYWNIPVFLSDSFSI 298 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + K + YL + ++ ++ +G + +++ LP+ +PP++ Q +I + Sbjct: 299 SPKKENLIPKYLFYWLQVKQ-DAIYATKSTGGIPHVYSKNLDNLPIPLPPLEVQREIVKI 357 Query: 382 INVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + + L I I K+ R + Sbjct: 358 LDDFSTLTEDLSSGIPAEIAARKKQYEYYRDKLLT 392 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 53/185 (28%), Gaps = 11/185 (5%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P E + ++ K + + N G+ Y + Sbjct: 12 CPHGVEFRKLGEVINICKGKQLNKEFLS--------NYGEYPVMNGGIYASGYWNTYNTN 63 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 +I+ + + Y+ + + + Sbjct: 64 SPKIIISQGGAS---AGYVNYMTSKFWAGAHCYVIESDSKKVNYKFLFYFLKNKESFLIK 120 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + +L D++ LP+ +PP++ Q +I +++ T L +++ + R Sbjct: 121 SQFGAGIPALNKADIETLPIPLPPLEVQREIVKILDTFTELNTELNTELKLRKKQYEYYR 180 Query: 408 SSFIA 412 + ++ Sbjct: 181 NWLLS 185 >gi|28867249|ref|NP_789868.1| type I restriction-modification system, S subunit, EcoA family [Pseudomonas syringae pv. tomato str. DC3000] gi|28850483|gb|AAO53563.1| type I restriction-modification system, S subunit, EcoA family [Pseudomonas syringae pv. tomato str. DC3000] Length = 435 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 52/366 (14%), Positives = 122/366 (33%), Gaps = 31/366 (8%) Query: 84 GQ-ILYGKLGPY-----LRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQ 135 G I+ ++ + + + +P + L WL V + Sbjct: 66 GDRIIISRMNTPALVGESGYVTKDEPNLFLPDRLWQTEPSDRPHSQRWLSYWLQHPGVRR 125 Query: 136 RIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I A G +M + + + ++P+P PL EQ I + A ++D + + Sbjct: 126 LIAASATGTSNSMKNISKETVLSLPVPRTPLPEQQKIAAILTAVDDKLDVIFRQIKATQA 185 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIE--WVGLVPDHWEVKPFFALVTELNRKNT-- 249 L + QAL S V ++ + + +G +P W+V V+ L + Sbjct: 186 LKQGLMQALFSRGVGTQDTTGRWIQHTEFKDSELGTIPALWDVGVIADYVSALRSGVSVN 245 Query: 250 ----KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP---GEIVFRFIDLQNDK 302 + I L +++ N E ++ +P G I+ + Sbjct: 246 AEDRMHGDDEIGVLKVSCVLRGGFYPNCHKTVVPEERERVAEPVLQGRIIVSRANTPALV 305 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDST---YLAWLMRSYDLCKVFYAMGSGL---RQS 356 + + +L++ ++S + + +G ++ Sbjct: 306 GESAYVNSAWPNLFLPDKLWQIEPSESPHSIKWLSFYLQSPFVRQEISKAATGTSGSMKN 365 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + + + P+ EQ I +++ T++I+ L K + + + +T Sbjct: 366 ISKPAFLSIRMPLVPLAEQEHIAAILSDVTSKIEALNSKQN----HFQTLKRGLMQKLLT 421 Query: 417 GQIDLR 422 G+ ++ Sbjct: 422 GEWRVK 427 >gi|317014259|gb|ADU81695.1| type I restriction-modification methylase [Helicobacter pylori Gambia94/24] Length = 400 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 108/397 (27%), Gaps = 23/397 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + G+ + Y + G Y N +D Sbjct: 13 PKGVGFRKLGEVINILKGKQLNKELLLDYGKYPVMNGGI--YASGYWNEYNTDYPK---- 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ + G ++ + + + Sbjct: 67 ----IIISQGGASAGYVNYMTSKFWAGAHCYTIELNSEKLNYKFLYYFLKNSQTILMKSQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + I + +PIPPL Q I + A T L TE + + + Sbjct: 123 FGAGIPALNKADIETLTIPIPPLEIQQEIVTILDAFTELNTELNTELNARKKQYEYYQNM 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ N + E + P +K + KL E Sbjct: 183 LLD------FNDINQSHKDAKEKLVQKPYPKRLKQLLHTLAPKGVGFRKLGEVCDFQKGK 236 Query: 262 GNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + Y I S + + S Sbjct: 237 SITKKAVTFGKVPVISGGRQPAYYHNEANRSGETIAISSSGVYAGYVSYWDIPVFLADSF 296 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ K + YL + + + + +G + +D++ + +PP++ Q +I Sbjct: 297 SVSPKQKTLMPKYLFYYLTTQQ-DAIHATKSTGGIPHVYSKDLQNFLIPIPPLEIQQEIV 355 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ +A L I I K+ R + Sbjct: 356 KILDQFSALTTDLQAGIPAEIKARKKQYEYYREKLLT 392 >gi|291556522|emb|CBL33639.1| Restriction endonuclease S subunits [Eubacterium siraeum V10Sc8a] Length = 380 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 64/394 (16%), Positives = 128/394 (32%), Gaps = 28/394 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + + +GLE + D SD + +F KG +L Sbjct: 4 VKLGEVAIEHKETCKGNKDGYPIVGLEHLVPEEVTLTAWD---EGSDNTFTKMFRKGNVL 60 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVL--QPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +G+ YL+KA +A FDGICS V+ P +LP LL + + ++ G+ Sbjct: 61 FGRRRAYLKKAAVAPFDGICSGDITVIEAIPDRILPMLLPFIIQNDELFDFAVGKSAGSL 120 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 W+ + N +P + +Q + E + A + EL + S Sbjct: 121 SPRVKWEHLKNYEFELPDMDKQRELAELLWAMDATKKSYQKLIAATDEL-------VKSQ 173 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + + +P K + +G K ++ +N + +++ Sbjct: 174 FMEQFGDPKNNQKGLPVLSIGQFGKAKGGKRLPK------GESYADCATNYPYVRVIDMV 227 Query: 266 QKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + ++ ++ + +A + Sbjct: 228 NHSVNIPALVYLTQSTHEKIAKYTISSKDVYISIAGTIGQVGAVPDSIDGANLTENAAKI 287 Query: 322 AVKPH-GIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +D YL W + + + L ++ + VLVPPI+EQ Sbjct: 288 VLDKDSPVDRDYLIWYLSLPAGAEQIEEKTMHTTQPKLALYRIEEIEVLVPPIEEQRSFA 347 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + D ++EQ++ L I+ Sbjct: 348 AFI----RQSDKSKFELEQTLSELTATYKRIISE 377 >gi|327403691|ref|YP_004344529.1| restriction modification system DNA specificity domain-containing protein [Fluviicola taffensis DSM 16823] gi|327319199|gb|AEA43691.1| restriction modification system DNA specificity domain protein [Fluviicola taffensis DSM 16823] Length = 411 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 57/414 (13%), Positives = 121/414 (29%), Gaps = 28/414 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IPK WK V I + G + + + + ++ +G + + + Sbjct: 8 IPKGWKKVKIPKVLFFQEGPGVRNWQFTESGVKLLNVGNINNGKVDLNSTSIHLSDEEAN 67 Query: 77 TVS---IFAKGQILYGKLGPYLRKAI---------IADFDGICSTQFLVLQPKDVLPELL 124 + +G +L G + ST + Sbjct: 68 GKYSHFLVDEGDLLIACSGIVVSNFHNKIAIAEKSHLPLCLNTSTMRFKSIESKIDLNYF 127 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + +L ++ T +++ + G+ + I I + +PPL Q I + + Sbjct: 128 KYYLQTVYFTAQLQKLITGSAQLNFGPSHIKKIDILLPPLETQKRIAQILDDGQALKQK- 186 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 ++ Q++ + + K +E + + + Sbjct: 187 ---TELLLKEYDALAQSIFMDMFGDPVRNPNTWKKVKLEKLC----GVGSSKRVFVEDLV 239 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + SL G I E + G+++ I Sbjct: 240 ESGVPFYRGTEVGSLGAGLEINPKLFITKKHYEELKTHTGVPKVGDLLLPSICPDGRIFR 299 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + S ++ V I+S YL L++S LK +K+ Sbjct: 300 VISENPFYFKDGRVLWIKVNQEKINSVYLKTLLKSIFYSNYSNIASGSTFAELKIFALKK 359 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +L+P IK Q I ID E +Q + ++ + + A G+ Sbjct: 360 IDLLLPDIKLQNLFAEKIE----LIDKQKELAKQELKESEDLFNCLLQKAFKGE 409 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 74/196 (37%), Gaps = 16/196 (8%) Query: 225 VGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 + +P W+ ++ +N + ES + L+ GNI N S E Sbjct: 5 LDFIPKGWKKVKIPKVLFFQEGPGVRNWQFTESGVKLLNVGNINNGKVDLNSTSIHLSDE 64 Query: 282 ------TYQIVDPGEIVFRFIDLQN----DKRSLRSAQVMERGIITSAY-MAVKPHGIDS 330 ++ +VD G+++ + +K ++ + + TS ID Sbjct: 65 EANGKYSHFLVDEGDLLIACSGIVVSNFHNKIAIAEKSHLPLCLNTSTMRFKSIESKIDL 124 Query: 331 TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y + +++ + +G + + +K++ +L+PP++ Q I +++ A + Sbjct: 125 NYFKYYLQTVYFTAQLQKLITGSAQLNFGPSHIKKIDILLPPLETQKRIAQILDDGQA-L 183 Query: 390 DVLVEKIEQSIVLLKE 405 E + + L + Sbjct: 184 KQKTELLLKEYDALAQ 199 >gi|253576201|ref|ZP_04853532.1| type I restriction-modification system specificity determinant protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251844328|gb|EES72345.1| type I restriction-modification system specificity determinant protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 420 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 61/414 (14%), Positives = 128/414 (30%), Gaps = 35/414 (8%) Query: 35 KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 +N + G I + D+E + K N ++ + S F G L ++ P Sbjct: 2 DINPFYSIRKGTLAKKISMADLE----PFTRKITNYEVAEFNGGSKFKNGDTLVARITPC 57 Query: 95 LRK-------AIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEGAT 145 L + D G ST+F+VL+ K+ + + + +S + G + Sbjct: 58 LENGKTAYVNILEKDEIGFGSTEFIVLRGKEGISDNKYVYYLSISPEFRNVAIKSMTGTS 117 Query: 146 -MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 A I +PP+ EQ I + + +I+ I E QAL Sbjct: 118 GRQRAQVDAISKWQFRLPPIKEQKEISALLSSLDDKIELNIAINKNLE----EMAQALFK 173 Query: 205 YIVTKGLNPDV---KMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKNTK-----LI 252 P+ K SG E +GL+P W+V + + K Sbjct: 174 RWFVDFEFPNENGEPYKSSGGEFEESELGLIPKGWKVGRATDIFDVQSGGTPKTSTSEYW 233 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + + L K + + + + + + A Sbjct: 234 NGEIPFFTPKDCSNSLYV-IETEKTITEDGLNNCNSKLFKTDTVFITARGTVGKVALAGR 292 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + A+ + + + + + + ++ + L +P I Sbjct: 293 DMAMNQSCYALVAKSGYTQKYVFHLTQQLVNVLRKNASGAVFDAITVSTFQNLKTTLPDI 352 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + + L+ + L++ R + + ++G+I + E Sbjct: 353 EL----VRHFDGLVNGLYSLLLEKANETQTLQQLRDTLLPKLMSGEIRVPVEQD 402 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 59/206 (28%), Gaps = 13/206 (6%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG 59 YK SG ++ +G IPK WKV + +G T ++ +I + +D + Sbjct: 189 YKSSGGEFEESELGLIPKGWKVGRATDIFDVQSGGTPKTSTSEYWNGEIPFFTPKDCSNS 248 Query: 60 -TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 K + +F + G K +A D + L K Sbjct: 249 LYVIETEKTITEDGLNNCNSKLFKTDTVFITARGTV-GKVALAGRDMAMNQSCYALVAKS 307 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + L+ + + GA N+ +P + + Sbjct: 308 GY-TQKYVFHLTQQLVNVLRKNASGAVFDAITVSTFQNLKTTLPDIELVRHFDGLVNGLY 366 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 + E +L L+S Sbjct: 367 SLLLEKANETQTLQQLRDTLLPKLMS 392 >gi|189426561|ref|YP_001953738.1| restriction modification system DNA specificity domain [Geobacter lovleyi SZ] gi|189422820|gb|ACD97218.1| restriction modification system DNA specificity domain [Geobacter lovleyi SZ] Length = 514 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 73/469 (15%), Positives = 134/469 (28%), Gaps = 76/469 (16%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQS--D 74 W VP+ L TG + G I +G E ++S G L + Sbjct: 8 NGWLTVPLSDLLMSLETGSRPKGGVRGITAGIPSLGGEHLDSNGGFKLDNIRYVPLEFAE 67 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ------FLVLQPKDVLPELLQGWL 128 T G IL K G K D S FL + + + +L Sbjct: 68 LMTRGAINNGDILVVKDGATTGKVSFVDNSFPLSIAVVNEHVFLCRCSSLLNSKYIFFYL 127 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S Q+I GA + + +P+ P AEQ I EK+ +D + E Sbjct: 128 FSNSGNQQILEDFRGAAQGGISQRFADLVKVPLAPAAEQTRIVEKLEELFSDLDAGVAEL 187 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE------------------------- 223 + L + +Q+L+ V L + + K++ E Sbjct: 188 KAAQKKLAQYRQSLLKAAVEGSLTAEWRTKNTPKETGAQLLERILKERRARWEEKQLARF 247 Query: 224 ------------------------WVGLVPDHWEVKPFFALVTELNRKNTKLIES----- 254 + +P+ W + K + Sbjct: 248 KEQAKTPPKGWQDKYPEPVQPDTTNLPELPEGWVWASVDQVGEVFLGKMLDKTKHQTGAM 307 Query: 255 --NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV-- 310 + ++S + E + G+++ Sbjct: 308 LPYLRNISVRWGSIETHDLPEMYYEEDELERYGLASGDVLVCEGGEPGRAAVCGKEHEKL 367 Query: 311 -MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 ++ + ++ + YL L ++ L + F + E LP+ + Sbjct: 368 KYQKALHRVRLFSLYESDLLVFYLEHLAKTGMLEQYF---TGSTIKHFTKESFIALPIPL 424 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PPI EQ +I + + I S+ +R + + +A +GQ Sbjct: 425 PPICEQSEIVEHLKLAIQCAQEQDAAIIHSLTQAAAQRKNILKSAFSGQ 473 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 70/202 (34%), Gaps = 8/202 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD-----IIYIGLEDVESGTGKYLPKDGNSRQ 72 + +P+ W + + ++ G+ + K + Y+ V G+ + + Sbjct: 273 LPELPEGWVWASVDQVGEVFLGKMLDKTKHQTGAMLPYLRNISVRWGSIETHDLPEMYYE 332 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLL 129 D A G +L + G R A+ Q V +LL +L Sbjct: 333 EDELERYGLASGDVLVCEGGEPGRAAVCGKEHEKLKYQKALHRVRLFSLYESDLLVFYLE 392 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + T +E G+T+ H + +P+P+PP+ EQ I E + I Sbjct: 393 HLAKTGMLEQYFTGSTIKHFTKESFIALPIPLPPICEQSEIVEHLKLAIQCAQEQDAAII 452 Query: 190 RFIELLKEKKQALVSYIVTKGL 211 + +++ ++ + L Sbjct: 453 HSLTQAAAQRKNILKSAFSGQL 474 >gi|165976841|ref|YP_001652434.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876942|gb|ABY69990.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 470 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 45/434 (10%), Positives = 111/434 (25%), Gaps = 72/434 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V ++ L GR +I ++ + L + Sbjct: 51 EIPESWVWVRLEDIFHLQAGR---------FISASEIYGEYKESLYPCYGGNGLRGFVKT 101 Query: 80 IFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +G+ + G+ G A+ + +V++ L + + + Sbjct: 102 YNREGKFPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYF---LIQLNLN 158 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I ++ +P+PPL EQ I KI I+ + + L ++ Sbjct: 159 QYATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQF 218 Query: 199 K----QALVSYIVTKGLNPDVKM------------------------------------- 217 ++++ + L Sbjct: 219 PEQLKKSILQAAIQGKLTEQNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRD 278 Query: 218 -----------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE---SNILSLSYGN 263 + E +P+ W + + + L GN Sbjct: 279 NLPYEIVNGKERCIADEVPFEIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGN 338 Query: 264 IIQKLET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I ++ + +++ + + + + Sbjct: 339 IQDGKIDVSSDIVKVNLDIPENKRCYKNDLLICARNGSKKLVGKAAIIDKDGYSFGAFMA 398 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + Y+ + + S F + + + ++ + +P + EQ I Sbjct: 399 IFRSPF--NKYIYYYLSSPLFRNDFDGINTTTINQITQSNLNNRLIPLPSLNEQLRIVEK 456 Query: 382 INVETARIDVLVEK 395 I + + L +K Sbjct: 457 IETLFSTLQNLSQK 470 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 60/204 (29%), Gaps = 18/204 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + ++ +P+ W + + E +K + Sbjct: 44 TEQDFPFEIPESWVWVRLEDIFHLQAGRFISASEIYGEYKESLYPCYGGNGLRGFVKTYN 103 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 E F I Q + + A + D+ + + + Sbjct: 104 REGK---------FPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYFLIQ 154 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +L + + + L + + + +PP+ EQ I I I+ + E+ Sbjct: 155 LNLNQY---ATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEK 210 Query: 400 IVLL-----KERRSSFIAAAVTGQ 418 + L ++ + S + AA+ G+ Sbjct: 211 LTALHQQFPEQLKKSILQAAIQGK 234 >gi|167829998|ref|ZP_02461469.1| restriction modification system DNA specificity domain [Burkholderia pseudomallei 9] gi|167847544|ref|ZP_02473052.1| restriction modification system DNA specificity domain [Burkholderia pseudomallei B7210] Length = 437 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 60/431 (13%), Positives = 137/431 (31%), Gaps = 35/431 (8%) Query: 26 KVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI--- 80 +V P++ L ++ D Y+ + + G+ + ++ I Sbjct: 4 EVRPLRDLCSLIADCPHSTPVWTDSGYLVIRNQNIKGGRLDLSSPSFTDAEHFAHRIRRA 63 Query: 81 -FAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSIDVTQR 136 +G I++ + P +I C L P V L L S V Sbjct: 64 KPREGDIVFTREAPMGEVCMIPKGLECCVGQRQVLLRPDPDVVDGRYLLYALQSPQVQHE 123 Query: 137 I-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I G+T+S+ + ++ +P P +A Q I + A RID L + Sbjct: 124 IGWNEGTGSTVSNVRIPVLESLKIPTPSIAVQRDIGSVLSALDDRIDLLRQTNATLESIA 183 Query: 196 KEKKQALV-----SYIVTKGLNPDVKMKDS--------GIEWVGLVPDHWEVKPFFALVT 242 + ++ +G PD ++ +G +P W V + Sbjct: 184 QTLFKSWFIDFDPVRAKAEGREPDGMDAETAALFPDSFEDSALGEIPKGWAVSTVGRVAQ 243 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNM-------GLKPESYETYQIVDPGEIVFRF 295 + E + + + + + S V G + Sbjct: 244 CVGGGTPSTKEQKFWEPAIHHWTTPKDLSGIAAPVLLDTERRLSDAGLAKVSSGLLPVGT 303 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + + + A + Y+A+ P G + + ++ + Sbjct: 304 LLMSSRAPIGYLAISQIPLAVNQGYIAMLPGGQLAPEYLYFWCQSNMDAIKQKANGSTFM 363 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + +P+++P + + A+I + + E+ + L+E R++ + + Sbjct: 364 EISKTAFRPIPIVLPSSE----VAACFADLAAKIFERISEGERQRIHLEEIRNTLLPRLI 419 Query: 416 TGQIDLRGESQ 426 +G++ L E++ Sbjct: 420 SGKLRL-PEAE 429 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 56/197 (28%), Gaps = 12/197 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----------LEDVESGTGKYLPKD 67 +G IPK W V + R + G T + + + L + + + Sbjct: 226 LGEIPKGWAVSTVGRVAQCVGGGTPSTKEQKFWEPAIHHWTTPKDLSGIAAPVLLDTERR 285 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + + G +L P I+ + ++ + P L + Sbjct: 286 LSDAGLAKVSSGLLPVGTLLMSSRAPI-GYLAISQIPLAVNQGYIAMLPGGQL-APEYLY 343 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I+ G+T IP+ +P + RI + Sbjct: 344 FWCQSNMDAIKQKANGSTFMEISKTAFRPIPIVLPSSEVAACFADLAAKIFERISEGERQ 403 Query: 188 RIRFIELLKEKKQALVS 204 RI E+ L+S Sbjct: 404 RIHLEEIRNTLLPRLIS 420 >gi|294784905|ref|ZP_06750193.1| type I restriction modification DNA specificity family protein [Fusobacterium sp. 3_1_27] gi|294486619|gb|EFG33981.1| type I restriction modification DNA specificity family protein [Fusobacterium sp. 3_1_27] Length = 592 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 46/396 (11%), Positives = 113/396 (28%), Gaps = 34/396 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + K+ G K +S KY +G + Sbjct: 13 PNGVEYKELGDIAKVTIGEFVHKDK----------QSENAKYPVYNGGISNTGYYDEYNE 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K +I+ G + + D + + + + + + Sbjct: 63 EKNKIIISARGANAGYINRIFVNYWAGNSCYTINANDKIINWNFLYYVLKNKEKGLLNKQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + ++ K + +I +P+PPL Q I + T L E + K++ Sbjct: 123 QTGSIPSISKKQVESILVPVPPLEVQDEIVRILDNFTALTAELTAELTAELTARKKQYSW 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 Y++ N +K +G + + + + Sbjct: 183 YRDYLLKFE-NKVKMVK------IGDLFEFKNGINKDKGSFGKGTPIINYVN-----VYK 230 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITSA 319 N I + + + V G++ F ++ S + +E + + Sbjct: 231 KNKIYFEDLKGLVEASNDELVRYGVKRGDVFFTRTSETIEEIGYTSVLLEDIENCVFSGF 290 Query: 320 YMAVKPHGID--STYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +P Y A+ + ++ + R + ++ + +PP++ Q Sbjct: 291 LLRARPITDLLLPEYCAYCFSTSNIRNTIIKKSTYTTRALTNGTSLSQIEIPLPPLEVQK 350 Query: 377 DITNVINVETARIDVL-------VEKIEQSIVLLKE 405 I V++ + L +E ++ + Sbjct: 351 RIVEVLDNFEKICNDLNIGLPAEIEARQKQYEFYRN 386 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 57/404 (14%), Positives = 127/404 (31%), Gaps = 29/404 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSI 80 K+V I + G + G K I +V Y K +D Sbjct: 195 KMVKIGDLFEFKNGINKDKGSFGKGTPIINYVNVYKKNKIYFEDLKGLVEASNDELVRYG 254 Query: 81 FAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQPKDVL--PELLQGWLLSID 132 +G + + + + + + + S L +P L PE + + Sbjct: 255 VKRGDVFFTRTSETIEEIGYTSVLLEDIENCVFSGFLLRARPITDLLLPEYCAYCFSTSN 314 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I T + + + I +P+PPL Q I E + + L I Sbjct: 315 IRNTIIKKSTYTTRALTNGTSLSQIEIPLPPLEVQKRIVEVLDNFEKICNDLNIGLPAEI 374 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E +++ + ++++T + K + + + + + K Sbjct: 375 EARQKQYEFYRNFLLTFKIENCTLPKTRQDKTRQDIIKLFMYIFGYIELELGEILKIKNG 434 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + G +TY ++ R + N + ++ Sbjct: 435 SDYKKF-----NIGNIPVYGSGGIINYIDTYIYDKESVLIPRKGSIGNLFYVDKPFWTVD 489 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 T Y + + S YL + + +L K+ +G SL + ++ + +P + Sbjct: 490 ----TIFYTVIDKDVVISKYLYYYLSKMNLEKL---NTAGGVPSLTQTVLNKILISLPSL 542 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +EQ I ++++ + + E + I ++ R + Sbjct: 543 EEQERIVDILDRFDKLCNDISEGLPAEIEARQKQYEYYREKLLT 586 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 48/149 (32%), Gaps = 7/149 (4%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K N G+ Y + +I+ + + S Y Sbjct: 43 KYPVYNGGISNTGYYDEYNEEKNKIIISARGAN---AGYINRIFVNYWAGNSCYTINAND 99 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I + + + + +G S+ + V+ + V VPP++ Q +I +++ T Sbjct: 100 KIINWNFLYYVLKNKEKGLLNKQQTGSIPSISKKQVESILVPVPPLEVQDEIVRILDNFT 159 Query: 387 ARIDVLVEKIEQSIVLLKE----RRSSFI 411 A L ++ + K+ R + Sbjct: 160 ALTAELTAELTAELTARKKQYSWYRDYLL 188 >gi|221231664|ref|YP_002510816.1| type I RM modification enzyme [Streptococcus pneumoniae ATCC 700669] gi|220674124|emb|CAR68643.1| putative type I RM modification enzyme [Streptococcus pneumoniae ATCC 700669] Length = 372 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 109/394 (27%), Gaps = 28/394 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + G + +D G E + + N I G Sbjct: 2 KKVKLGEVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSTEINYYSGMIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSG-TLGVFQWRGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNL------------ 167 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 L + G + D+ + + E L L+ N+ Sbjct: 168 -----LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVT 222 Query: 266 QKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + + ++ +IV + + I S + Sbjct: 223 KNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMV 282 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++P + +++ + + L +K++ + +PP+ Q + + Sbjct: 283 ILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFADF 341 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + A +D I++S+ L+ + S + Sbjct: 342 V----ALVDKSQLAIQKSLEELETLKKSLMQEYF 371 >gi|291004532|ref|ZP_06562505.1| restriction modification system DNA specificity domain-containing protein [Saccharopolyspora erythraea NRRL 2338] Length = 283 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 104/274 (37%), Gaps = 9/274 (3%) Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV 215 +P P L EQ I + + AET RID L R R +++L+EK V V G Sbjct: 1 MLPFPRVSLEEQRRIADFLDAETTRIDKLSALRERQLDILEEKAMRRVYDTVR-GTGVVG 59 Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 + SG+ W+G VP HW V K + L + ++ + Sbjct: 60 ARRPSGLSWLGSVPVHWRVAAVSHYFEVELGKMLNQERARGDHLRPYLRVANVQWGVVDT 119 Query: 276 K-------PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 P + + PG+++ + ++ S ++ E + + Sbjct: 120 TELAMMDFPPEEQKRYRLQPGDLLVNEGGSWPGRAAVWSGEIEEIYYQKALHRIRPRGME 179 Query: 329 DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +L + + + + KVF G S L E ++ P + EQ + A Sbjct: 180 STWWLYFCLVAAERMKVFQVQGNSSTMTHLTREQLRPQRFPFPDLAEQEQAVERLKDAEA 239 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + + L ERR + I AAVTG+ D+ Sbjct: 240 KDRQIRRVLSRQQATLAERRQALITAAVTGEFDV 273 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 79/211 (37%), Gaps = 9/211 (4%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLP 65 + SG+ W+G++P HW+V + + ++ G+ Y+ + +V+ G Sbjct: 62 RPSGLSWLGSVPVHWRVAAVSHYFEVELGKMLNQERARGDHLRPYLRVANVQWGVVDTTE 121 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPE 122 + G +L + G + +A + + ++P+ + Sbjct: 122 LAMMDFPPEEQKRYRLQPGDLLVNEGGSWPGRAAVWSGEIEEIYYQKALHRIRPRGMEST 181 Query: 123 LLQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + L ++ + +TM+H + + P P LAEQ E++ + Sbjct: 182 WWLYFCLVAAERMKVFQVQGNSSTMTHLTREQLRPQRFPFPDLAEQEQAVERLKDAEAKD 241 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + R L E++QAL++ VT + Sbjct: 242 RQIRRVLSRQQATLAERRQALITAAVTGEFD 272 >gi|254234631|ref|ZP_04927954.1| hypothetical protein PACG_00495 [Pseudomonas aeruginosa C3719] gi|126166562|gb|EAZ52073.1| hypothetical protein PACG_00495 [Pseudomonas aeruginosa C3719] Length = 416 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 53/413 (12%), Positives = 125/413 (30%), Gaps = 28/413 (6%) Query: 29 PIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + + + G++ G D + DV+ + + + Sbjct: 9 KLDQLGFVGRGKSKHRPRNDPSLYGGDYPFFQTGDVKGAELYLRCFSATYNEKGLAQSKL 68 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + G + + + I G + ++ + +++I Sbjct: 69 WQPGTLCIT-IAANIADTSILSIPGCFPDSVVGFVADPQRSDVFFVKYYLDTLKNAMQSI 127 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G T + + + + +PP+ EQ I ++A I+ E + Sbjct: 128 SHGTTQDNLSLEKLLSFDFWVPPVEEQRKIASVLLAYDDLIENNTRRIEILE----EMAR 183 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNIL 257 L + P + + +GL+P W V L+T+ K+ E+ + Sbjct: 184 RLYEEWFVQFRFPGHEGVEFKESELGLIPKSWSVVKLEEICDLITDGAHKSPPTAETGMP 243 Query: 258 SLSYGNIIQKL----ETRNMGLKPESYETYQIVDP--GEIVFRFIDLQNDKRSLRSAQVM 311 S ++ + R + P G+++ K + Sbjct: 244 MASVKDMHDWGVDVSKCRKISRSDYDELVRNNCKPMIGDVLVAKDGSYL-KHIFSVEKDQ 302 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 + +++S + + S L L+R + SG + +D ++ ++ P Sbjct: 303 DLVLLSSIAILRPINKSVSDLLVCLLRHPETIARMKGCVSGVAIPRIILKDFRKFQIVFP 362 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 Q + LV+K L+ +R + ++G+ID+ Sbjct: 363 SQDLQEAWLATASPLMRLCRKLVDKN----ANLRAQRDLLLPKLISGEIDVSD 411 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 64/206 (31%), Gaps = 13/206 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYL 64 ++K+S +G IPK W VV ++ L T +S + ++D+ Sbjct: 202 EFKESE---LGLIPKSWSVVKLEEICDLITDGAHKSPPTAETGMPMASVKDMHDWGVDVS 258 Query: 65 P-KDGNSRQSDTSTVSIFAK--GQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKD 118 + + D + G +L K G YL+ + D + S+ ++ Sbjct: 259 KCRKISRSDYDELVRNNCKPMIGDVLVAKDGSYLKHIFSVEKDQDLVLLSSIAILRPINK 318 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + +LL L + R++ G + K + P Q Sbjct: 319 SVSDLLVCLLRHPETIARMKGCVSGVAIPRIILKDFRKFQIVFPSQDLQEAWLATASPLM 378 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 L+ + L+S Sbjct: 379 RLCRKLVDKNANLRAQRDLLLPKLIS 404 >gi|317014200|gb|ADU81636.1| type I restriction-modification methylase [Helicobacter pylori Gambia94/24] Length = 404 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 52/389 (13%), Positives = 107/389 (27%), Gaps = 22/389 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + G+ + Y + G Y N +D Sbjct: 13 PKGVGFRKLGEVINILKGKQLNKELLLDYGKYPVMNGGI--YASGYWNEYNTDYPK---- 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ + G ++ + + + Sbjct: 67 ----IIISQGGASAGYVNYMTSKFWAGAHCYTIELNSEKLNYKFLYYFLKNSQTILMKSQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + I + +PIPPL Q I + A T L TE + + + Sbjct: 123 FGAGIPALNKADIETLTIPIPPLEIQQEIVTILDAFTELNTELNTELNARKKQYEYYQNM 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ N + E + P +K + KL E Sbjct: 183 LLD------FNDINQSHKDAKEKLAQKPYPKRLKQLLHTLAPKGVGFRKLGEVCDFQKGK 236 Query: 262 GNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + Y I S + + S Sbjct: 237 SITKKAVTFGKVPVISGGRQPAYYHNEANRSGETIAISSSGVYAGYVSYWDIPVFLADSF 296 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ K + YL + + + + +G + +D++ + +PP++ Q +I Sbjct: 297 SVSPKQKTLMPKYLFYYLTTQQ-DAIHATKSTGGIPHVYSKDLQNFLIPIPPLEIQQEIV 355 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRS 408 +++ +A L I I K R+ Sbjct: 356 TILDQFSALTTDLQAGIPAEI---KARKK 381 >gi|254414393|ref|ZP_05028159.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178623|gb|EDX73621.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] Length = 411 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 64/420 (15%), Positives = 138/420 (32%), Gaps = 29/420 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W + L G++ G +I D+ + ++ Sbjct: 2 SEWNEFYLSDVGTLARGKSKHRPRWADHLYGGPYPFIQTGDISAANKYINTYRQTYSEAG 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + ++ KG L + + + I + L P +L + + Sbjct: 62 LAQSKLWDKGT-LCITIAANIAEIAILELPACFPDSVLGFIPNPEKVDLNFVFYTLTFLK 120 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 RI+ + G+ + + NI P + +Q I + +I+ L + + Sbjct: 121 ARIQNLAIGSVQENINLGTFKNIKFFFPSVKKQKEIASVLSCLDRKIENLRKQNDTLEAI 180 Query: 195 LKEKKQALVSYIVTKGLNPD---VKMKDSG----IEWVGLVPDHWEVKPFFALVTEL--- 244 Q L + P+ K SG +G +P W V +V Sbjct: 181 A----QTLFKHWFVDFEFPNADGKPYKSSGGAMEPSELGEIPAGWRVGKLGDVVKVNAES 236 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K+ + E + +S I T + K ++V G++++ + N K Sbjct: 237 ISKSYQHKEIEYVDISSVGIGVLEGTTSYLFKNAPSRARRLVKHGDVIWSGVRP-NRKSY 295 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVK 363 L + E ++++ ++ + P I S+YL + + + SG ++K E + Sbjct: 296 LFISHPPENLVVSTGFITLTPDSIPSSYLYSWVTTESFVEYLTFNASGSAYPAIKAEHFE 355 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 VL+P VI +I + + L + R + ++G++ ++ Sbjct: 356 IADVLLPDKFNLTKFHAVIEPMREKIHQ----NSRQLQTLTKTRDLLLPKLMSGKLRIKP 411 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 70/204 (34%), Gaps = 10/204 (4%) Query: 10 YKDSG--VQ--WIGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKY 63 YK SG ++ +G IP W+V + K+N S+S K+I Y+ + V G + Sbjct: 202 YKSSGGAMEPSELGEIPAGWRVGKLGDVVKVNAESISKSYQHKEIEYVDISSVGIGVLE- 260 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVL 120 + + + + G +++ + P + + + + ST F+ L P + Sbjct: 261 GTTSYLFKNAPSRARRLVKHGDVIWSGVRPNRKSYLFISHPPENLVVSTGFITLTPDSIP 320 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L W+ + + + G+ + + +P I + Sbjct: 321 SSYLYSWVTTESFVEYLTFNASGSAYPAIKAEHFEIADVLLPDKFNLTKFHAVIEPMREK 380 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I + + L+S Sbjct: 381 IHQNSRQLQTLTKTRDLLLPKLMS 404 >gi|320321657|gb|EFW77756.1| restriction modification system DNA specificity domain [Pseudomonas syringae pv. glycinea str. B076] Length = 567 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 65/486 (13%), Positives = 130/486 (26%), Gaps = 91/486 (18%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P WK + + +N + ++ ++ + + + ++ + Sbjct: 82 ELPAGWKWSSLAQVAFVNPRNAAADSLEVSFVPMTFIGTRFDDQHGQEPRLWGELKQGFT 141 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICST--------QFLVLQPKDVLPELLQGWLLSI 131 FA+G I K+ P + F + + + + P + +L S Sbjct: 142 HFAEGDIGVAKITPCFENSKACVFSNLLNGLGAGTTELHIVRPITGTLDPRYVLAYLKSP 201 Query: 132 DVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID-------- 182 E G + P P+PPLAEQ I K+ D Sbjct: 202 QFLLVGETKMTGTAGQKRLPKDFVEANPFPLPPLAEQHRIIAKVDELMALCDRLEAQQAD 261 Query: 183 ---------------------------------TLITERIRFIELLKEKKQALVSYIVTK 209 + KQ L+ V Sbjct: 262 AESAHTQLVQALLDSLTQASDATDFATNWQRLAEHFHTLFTTEPSIDALKQTLLQLAVMG 321 Query: 210 GLNPDVKMKDSGIEWVGLV-------------------------------PDHWEVKPFF 238 L P + E + + P WE Sbjct: 322 KLVPQDSSDEPASELIKKIESEKYRQVKAGKFKPVKQVNGIEAADKPFQLPATWEWARLA 381 Query: 239 A---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP-----ESYETYQIVDPGE 290 +T+ IE + LS ++ N E G+ Sbjct: 382 DVAFQITDGAHHTPTYIEFGVPFLSVKDMSGGSLGFNATRYISEDAHEQLTKRCHPQRGD 441 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ I + ++ + ++ +YL L+ S + K Sbjct: 442 LLLTKIGTTG-VPVIVDTDRPFSIFVSVGLIKAPWDHLNVSYLQLLISSPFVKKQSLDGT 500 Query: 351 SGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G+ ++L + + +PP+ EQ I ++ D L ++ Q+ L ++ S+ Sbjct: 501 EGVGNKNLVLRKIANFLIAIPPLAEQHRIVIKVDELMTLCDQLKIRLTQARQLNEQLAST 560 Query: 410 FIAAAV 415 + AV Sbjct: 561 LVEQAV 566 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 65/218 (29%), Gaps = 9/218 (4%) Query: 206 IVTKGLNPDVKMKDSGIE-WVGLVPDHWEVKPFFALVTELNRKN--TKLIESNILSLSYG 262 V + + + + G E +P W+ + R L S + G Sbjct: 60 AVERKIKKKKPLAEVGEEAQPFELPAGWKWSSLAQVAFVNPRNAAADSLEVSFVPMTFIG 119 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS---- 318 + L E + + G+I I + + G+ Sbjct: 120 TRFDDQHGQEPRLWGELKQGFTHFAEGDIGVAKITPCFENSKACVFSNLLNGLGAGTTEL 179 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +D Y+ ++S G+ ++ L + V+ P +PP+ EQ Sbjct: 180 HIVRPITGTLDPRYVLAYLKSPQFLLVGETKMTGTAGQKRLPKDFVEANPFPLPPLAEQH 239 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 I ++ A D L + + + + + + Sbjct: 240 RIIAKVDELMALCDRLEAQQADAESAHTQLVQALLDSL 277 >gi|330969619|gb|EGH69685.1| type I restriction-modification system, S subunit [Pseudomonas syringae pv. aceris str. M302273PT] Length = 432 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 66/407 (16%), Positives = 142/407 (34%), Gaps = 27/407 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W ++ + S SGK + + + G + + + + Sbjct: 5 DIPASWLILDFNEIFS----QVSTSGKKVK--SADVLTEGRFPVVDQGRSFISGYLDDAN 58 Query: 80 IF---AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + K I++ G + R+ DF I + + + + ++ Sbjct: 59 LVVSENKPLIIF---GDHTREIKWIDFPFIPGADGVQILKPHPEMDTRFLYYFLRNLPIE 115 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + +PPLAEQ I K+ ++DTL LLK Sbjct: 116 SRGYARHFKI-------VKDAAYLVPPLAEQTRIAAKLDELLAQVDTLKACIDGIPSLLK 168 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 +Q++++ V+ L + + A + K + ++S I Sbjct: 169 RFRQSVLAAAVSGRLTDEWRGAVRENSDGQGFSYPVRRLGVIARFIDYRGKTPEKVDSGI 228 Query: 257 LSLSYGNIIQKLETR--NMGLKPESYETYQ---IVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ++ NI +R ++PE+YE++ I G+++ + + + Sbjct: 229 PLITAKNIKSGYISRVPREFIRPEAYESWMTRGIPKVGDVLITTEAPLGNVAVIDITE-- 286 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 + + A G S++ A ++S L + +G + +K +K + + P Sbjct: 287 KFALAQRAICLQFHEGYSSSFAAITLQSSLLQEELARRSTGTTVKGIKASVLKEIGLPAP 346 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I EQ +I + + A + L K+ ++ + S +A A G Sbjct: 347 SIDEQNEIVHRVEQLFAYAEQLETKVSEAKKRIDHLAQSILAKAFKG 393 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 82/199 (41%), Gaps = 16/199 (8%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 +P W + F + ++++ K+ +++L+ ++ + + G + + Sbjct: 4 NDIPASWLILDFNEIFSQVSTSGKKVKSADVLTEGRFPVVDQGRSFISGYLDD---ANLV 60 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 V + + F D + + + + + + +D+ +L + +R+ + Sbjct: 61 VSENKPLIIFGDHTREIKWIDFPFIPGADGVQ---ILKPHPEMDTRFLYYFLRNLPIESR 117 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 YA F+ VK LVPP+ EQ I ++ A++D L I+ LLK Sbjct: 118 GYAR--------HFKIVKDAAYLVPPLAEQTRIAAKLDELLAQVDTLKACIDGIPSLLKR 169 Query: 406 RRSSFIAAAVTGQIDLRGE 424 R S +AAAV+G L E Sbjct: 170 FRQSVLAAAVSG--RLTDE 186 >gi|168488197|ref|ZP_02712396.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae SP195] gi|183572997|gb|EDT93525.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae SP195] Length = 521 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 65/438 (14%), Positives = 140/438 (31%), Gaps = 66/438 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L K+ ++++ Y + L +S Sbjct: 263 LEQLDKKFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 ---------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + G +P +W V + + + K + +I + II+ + Sbjct: 323 DISIVSQGDDNSYYGNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRIIRGGNIK 381 Query: 272 NMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + Y + +++ G++ ++ Sbjct: 382 PLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFI 441 Query: 322 A----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + I S +L + + S K + ++ + L + + P +E Sbjct: 442 FQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEE 501 Query: 375 QFDITNVINVETARIDVL 392 Q IT + +++ L Sbjct: 502 QELITQKVEKLFEKVNQL 519 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 81/211 (38%), Gaps = 14/211 (6%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 E + + L + + S + A+ G+ Sbjct: 257 AESYNR-LEQLDKKFPDKLKKSILQYAMQGK 286 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 337 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 396 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 397 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 456 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 457 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 516 Query: 182 DTLI 185 + L Sbjct: 517 NQLW 520 >gi|93006186|ref|YP_580623.1| restriction modification system DNA specificity subunit [Psychrobacter cryohalolentis K5] gi|92393864|gb|ABE75139.1| restriction modification system DNA specificity domain [Psychrobacter cryohalolentis K5] Length = 453 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 65/457 (14%), Positives = 135/457 (29%), Gaps = 65/457 (14%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------------GKDIIYIGLEDVESGTGKYLPK-DGN 69 W + KL G + I I ++ + Sbjct: 3 SDWVKTTLGEIVKLGNGIIQTGPFGSQLHASDYVDEGIPVIMPLNIINNKIDLSGIARIT 62 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGW 127 ++ + + K I+Y + G RKA+I + C T L+++P + + + Sbjct: 63 KEDAERLSKHLVKKNDIVYSRRGDVTRKALITELEEGMFCGTGCLLVRPGNSIDARFLTY 122 Query: 128 L-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + I GATM + + + +P+ IP L Q I + + +I+ Sbjct: 123 HLSSPINQEWIIRHAVGATMPNLNTGILKRVPLNIPSLDTQKAIAHILGSLDDKIELNRQ 182 Query: 187 ERIRFIELLKEKKQA-------LVSYIVTKG-------------LNPDVKMKDSGI---- 222 + + ++ L+ + G K +S I Sbjct: 183 MNETLEAMAQALFKSWFVDFDPLIDNALAAGNAIPDEFIERAEQRKKIEKKDNSDIQDLF 242 Query: 223 -------EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 E +G +P WE + + K + +S N+ ++ + Sbjct: 243 PDAFEFAEEMGWIPKGWENGILADICSYGKGKINTSELTLENYVSTENMNKEKSGISHAA 302 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S G+ + I K L S G + + + YL Sbjct: 303 NIASTNQVPKFSVGQTLISNIRPYFKKIWLASFSG---GRSNDVLSFQAHNSVANEYLFN 359 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L+ A G + V +P + + E + + + Sbjct: 360 LLYQDSFFDYMTATSKGTKMPRGDKAAIMSWSVAIPS--------SRLMEEFSELAKPMY 411 Query: 395 KIE-----QSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 Q+I L K R + ++ ++G++ + ++ Sbjct: 412 LANNLRSLQTIELAK-LRDTLLSKLMSGELCIPDAAR 447 Score = 43.6 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 36/148 (24%), Positives = 56/148 (37%), Gaps = 9/148 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G IPK W+ + G+ + S + LE+ S K G S ++ ++ Sbjct: 253 GWIPKGWENGILADICSYGKGKINTSE-----LTLENYVSTENMNKEKSGISHAANIAST 307 Query: 79 SIFAK---GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVT 134 + K GQ L + PY +K +A F G S L Q + + E L L Sbjct: 308 NQVPKFSVGQTLISNIRPYFKKIWLASFSGGRSNDVLSFQAHNSVANEYLFNLLYQDSFF 367 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIP 162 + A +G M D I + + IP Sbjct: 368 DYMTATSKGTKMPRGDKAAIMSWSVAIP 395 >gi|304569708|ref|YP_010923.2| type I restriction-modification enzyme, S subunit [Desulfovibrio vulgaris str. Hildenborough] gi|311233889|gb|ADP86743.1| restriction modification system DNA specificity domain protein [Desulfovibrio vulgaris RCH1] Length = 416 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 56/416 (13%), Positives = 138/416 (33%), Gaps = 26/416 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLP-KDGNSRQS 73 +P+ W+ I + +G +S I + +++ G ++ K ++ Sbjct: 2 VPEGWRADIIGNHISIVSGYPFKSHEYTDNSDGIRLLRGDNIAQGYIRWSGCKRWINKDK 61 Query: 74 DTSTVSIFAKGQILYGKLGPY------LRKAIIADFDGICSTQFLVLQPKD-VLPELLQG 126 ++ + + + D + + L+ KD + ELL+ Sbjct: 62 INVERFALKPADLVIAMDRTWVSSGLKISEIRHEDCPSLLVQRVSRLRSKDSFVQELLKQ 121 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S Q ++++ + H + I P+ +PPL EQ I + D I Sbjct: 122 IFNSFRFEQYVKSVQTETAVPHISAQQIKEFPILLPPLTEQKKIARIL----STWDKAIE 177 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + IE K++K+AL+ ++T + +G + Sbjct: 178 TVDKLIENSKQQKKALMQQLLTGKKRLPGFSGEWKEVRLGDLFQVTIGGTPSRKNNAYWD 237 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + N + + +++ ++ F + + Sbjct: 238 QLKASGNKWVAISDLKNKFLVETNEYITDAGAANSNVKLIPRLTVIMSFKLTIGKRAITK 297 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + I A++ + ID+ + + DL + G +++ + ++ Sbjct: 298 TQCYTNEAIC--AFIPKHKNEIDTNFFYHHLGIIDLVQDVDQAVKG--KTINKSKIMKIR 353 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P + EQ I I + + ++ I L+KE + + + +TG+ ++ Sbjct: 354 TKLPNLLEQIAIAQRIEAFDLQ---QEDYLKTRIFLVKE-KQALMQQLLTGKRRVK 405 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 59/167 (35%), Gaps = 8/167 (4%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG---IITSA 319 I+ + K + + P ++V S E ++ Sbjct: 46 GYIRWSGCKRWINKDKINVERFALKPADLVIAMDRTWVSSGLKISEIRHEDCPSLLVQRV 105 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDI 378 L + S+ + ++ + + + +K P+L+PP+ EQ I Sbjct: 106 SRLRSKDSFVQELLKQIFNSFRFEQYVKSVQTETAVPHISAQQIKEFPILLPPLTEQKKI 165 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +++ D +E +++ I K+++ + + +TG+ L G S Sbjct: 166 ARILSTW----DKAIETVDKLIENSKQQKKALMQQLLTGKKRLPGFS 208 >gi|313639652|gb|EFS04448.1| restriction modification system DNA specificity subunit [Listeria seeligeri FSL S4-171] Length = 431 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 52/417 (12%), Positives = 124/417 (29%), Gaps = 34/417 (8%) Query: 23 KHWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDVES-----GTGKYLPKDGNSRQS 73 W+ + + + + + + K + ++ D+ S +YL Sbjct: 20 NDWEQRKLGGLMNITSVKRIHQSDWTDKGVRFLRARDIVSASKGKNPSEYLYISKKLYDE 79 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLS- 130 + G +L +G +I + + Q K + + + Sbjct: 80 HSKISGKVGVGDLLVTGVGSIGIPMLIKHEEPLYFKDGNIIWFQNKKNIDGGFFYYSFNS 139 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + I T+ G P+ +P EQ I ++D I R Sbjct: 140 HSIQKFIRDSAGIGTVGTYTIDSGGKTPIYLPNKKEQQRIGTF----FKQLDNTIALHQR 195 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +E +K K A +S + + + +G V F L+R Sbjct: 196 KLEKIKALKTAYLSEMFPAEGELKPRRRFAGFTDDWEQRKLMSVFEFPVSTNSLSRSQLN 255 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKP---------ESYETYQIVDPGEIVFRFIDLQND 301 I S+ YG+I+ ++ K +++ G+++F Sbjct: 256 YDNGEIKSVHYGDILVNYDSILEIAKDRIPFITNGVIDKYKPNLLENGDLIFADAAEDET 315 Query: 302 KRSLRSAQVMERGIITSA---YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 I + +A + + + + S + G S+ Sbjct: 316 VGKAVEVDGKTNEYIVAGLHTIVARPRRKMAKFFWGYYINSSIYHNQLLRLMQGTKVASI 375 Query: 358 KFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++++ + P EQ + N ++D + ++ + L+ + +++ Sbjct: 376 SKSNLQKTCIAYPDNFAEQQKLGNF----FKQLDNTITLHQRKLKKLQNIKKAYLNE 428 >gi|257465469|ref|ZP_05629840.1| restriction modification system DNA specificity subunit [Actinobacillus minor 202] gi|257451129|gb|EEV25172.1| restriction modification system DNA specificity subunit [Actinobacillus minor 202] Length = 374 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 48/406 (11%), Positives = 125/406 (30%), Gaps = 55/406 (13%) Query: 24 HWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ V + T + + S +I + ++ L D +++ Sbjct: 7 GWESVRLGDIAITVTSGSRDWAQYYSDTGAKFIRMTNLNRNGITLLLDDLKFVNVQSNSA 66 Query: 79 SI----FAKGQILYGKLGPYLRKAIIADFDG--ICSTQ--FLVLQPKDVLPELLQGWLLS 130 + IL + I + G + + + P + + L S Sbjct: 67 DVKRTSLQANDILISITAELGKIGFIPENFGEAYINQHTALIRIDPNKAHAKFIAYVLSS 126 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + I ++ + + + I + + +P + EQ+ I E + D I + Sbjct: 127 VAINKTINSLNDAGAKAGLNLPTIKALSLKLPSIEEQIQITETL----STWDNAIQTTEK 182 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +E +++K+AL+ ++ G + +++ +VT N Sbjct: 183 LLENTRQQKKALMQKLL-----------------NGKDWEETKLQNLCKIVTGKKDVNEG 225 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + ++ + + + N + Sbjct: 226 NDKGIYPFFTCAKEHTYSDSYSFECE-----------------ALLIAGNGVVGQTTYYK 268 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + Y+ + GI+ YL ++ + + G +K + V +P Sbjct: 269 GKFEAYQRTYVLYEFKGINVQYLYQYIKWHLQKDIEREKQHGAMPYIKLGLLTDFVVKLP 328 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 EQ I +++ I+ L ++ + LK + + + + Sbjct: 329 KSNEQQKIAEILSTADQEIETL----QRKLECLKLEKGALMQRLLR 370 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 57/146 (39%), Gaps = 9/146 (6%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + +I+ + +A + + P+ + ++A+++ S Sbjct: 69 KRTSLQANDILISITAELGKIGFIPENFGEAYINQHTALIRIDPNKAHAKFIAYVLSSVA 128 Query: 342 LCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + K ++ +G + L +K L + +P I+EQ IT ++ D ++ E+ + Sbjct: 129 INKTINSLNDAGAKAGLNLPTIKALSLKLPSIEEQIQITETLSTW----DNAIQTTEKLL 184 Query: 401 VLLKERRSSFIAAAVTGQIDLRGESQ 426 ++++ + + + G+ + Sbjct: 185 ENTRQQKKALMQKLLNGK----DWEE 206 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 56/182 (30%), Gaps = 15/182 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIF 81 K W+ ++ K+ TG+ +DV G K P +++ S F Sbjct: 202 KDWEETKLQNLCKIVTGK-------------KDVNEGNDKGIYPFFTCAKEHTYSDSYSF 248 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +L G + + + VL + + + + IE Sbjct: 249 ECEALLIAGNG-VVGQTTYYKGKFEAYQRTYVLYEFKGINVQYLYQYIKWHLQKDIEREK 307 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + M + + + + +P EQ I E + I+TL + Q Sbjct: 308 QHGAMPYIKLGLLTDFVVKLPKSNEQQKIAEILSTADQEIETLQRKLECLKLEKGALMQR 367 Query: 202 LV 203 L+ Sbjct: 368 LL 369 >gi|317179608|dbj|BAJ57396.1| Type I R-M system specificity subunit [Helicobacter pylori F30] Length = 388 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 47/379 (12%), Positives = 115/379 (30%), Gaps = 24/379 (6%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFD 104 I +I +D Y + + + +L G A+ Sbjct: 28 NYIPFIQNKDFLGHYINYKTDYFIPNEIAIRFPQILLNEKCLLISISGAIGNVAVFNHSQ 87 Query: 105 -GICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 VL+ K+ + + +L+S + + + ++ + + ++ +P+P Sbjct: 88 DAFTGGAIAVLKFKEKKSLDFVMHFLMSASGQKLLLNGVKSSSHKNLTIADLRDLLIPLP 147 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 PL EQ+ I + + I + K++L ++++ + Sbjct: 148 PLNEQIAIANILSGLDRYL----CALDALILKKESVKKSLSFELLSQRKRLKGFNQAWQR 203 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 +G + K T + ++GN ++ + L+ + Sbjct: 204 VRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIG-----TFGNTADAFISKKLFLEYRT--K 256 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 Y G+I+ + + + + +L +Y Sbjct: 257 YSFPKKGDILISASGT----IGKAVIYDGKPAYFQDSNIVWIDNDETLVKNDFLFYAYSN 312 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K L ++ + + +PP+ EQ I NV++ I L K Q Sbjct: 313 VKW--NTEHTTILRLYNDNFRNTLIPLPPLNEQSAIANVLSALDNEIISLKNKKRQ---- 366 Query: 403 LKERRSSFIAAAVTGQIDL 421 + + + ++ +I + Sbjct: 367 FENIKKALNHDLMSAKIRV 385 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 62/181 (34%), Gaps = 11/181 (6%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I G+ I + + +++ ++ + + Sbjct: 27 PNYIPFIQNKDFLGHYINYKTDYFIPNEIAIRFPQILLNEKCLLISISGAIGNVAVFNHS 86 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 Q G + + +D + +LM + + + S ++L D++ L + Sbjct: 87 QDAFTGGAIAVLKFKEKKSLD-FVMHFLMSASGQKLLLNGVKSSSHKNLTIADLRDLLIP 145 Query: 369 VPPIKEQFDITNVINVETAR---IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +PP+ EQ I N+++ +D L+ K E + S ++ + L+G + Sbjct: 146 LPPLNEQIAIANILSGLDRYLCALDALILKKESV-------KKSLSFELLSQRKRLKGFN 198 Query: 426 Q 426 Q Sbjct: 199 Q 199 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 56/185 (30%), Gaps = 10/185 (5%) Query: 25 WKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ V + K + +I + + + ++ K + S Sbjct: 201 WQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYRTKYS 258 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 KG IL G + I +V E L ++ Sbjct: 259 FPKKGDILISASGTIGKAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYSNVKW 315 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 E T+ N +P+PPL EQ I + A I +L ++ +F + K Sbjct: 316 NTEHTTILRLYNDNFRNTLIPLPPLNEQSAIANVLSALDNEIISLKNKKRQFENIKKALN 375 Query: 200 QALVS 204 L+S Sbjct: 376 HDLMS 380 >gi|149203432|ref|ZP_01880402.1| restriction modification system DNA specificity domain [Roseovarius sp. TM1035] gi|149143265|gb|EDM31304.1| restriction modification system DNA specificity domain [Roseovarius sp. TM1035] Length = 394 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 63/407 (15%), Positives = 132/407 (32%), Gaps = 32/407 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ + G T GK I + ++D ++ + + S +I Sbjct: 4 TIPLGELVSIRGGGTPSRGKKEFWGGPIPWATVKDFKTTSLDSTLESITEDGVRKSATNI 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G I+ KA I D + L PK + + + +E+ Sbjct: 64 VPAGSIVVPTRMAV-GKAAINTIDVAINQDLKALLPKGEIDTR-FLLHFLLSKSNFLESQ 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +GAT+ + ++P P L EQ I + + +R + + L E Sbjct: 122 AQGATVKGIKLDLLKSLPFPDLSLNEQRRIAAILDKADA----IRRKREQALNLADEFLM 177 Query: 201 ALVSYIVTKGL-NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + NP K+ + + PF A + + + + ++ Sbjct: 178 SVFLEMFGDPIENPHNFPKEKVKLHLSKSRAGTQSGPFGAALKKHEYVPEGIPVWGVENV 237 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y I K K Y V G+I+ R ++ ER II++ Sbjct: 238 QYNRFIDKPRLFITEDKFNDLLRYS-VQHGDILISRAGTVG--RMCIASTSEERSIISTN 294 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--------LKFEDVKRLPVLVPP 371 + V T ++ L L+ + L + +K + + +P Sbjct: 295 LIRVALDPASLTAEYFV----SLFSYLPGRVGALKANNKDDAFTFLNPKTLKEIEIPIPD 350 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +Q ++++ R+ + + + + SS A G+ Sbjct: 351 MTQQKRFVSILH----RVQHSIRRQGDQLAGFSDLFSSLSQRAFRGE 393 >gi|302880110|ref|YP_003848674.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] gi|302582899|gb|ADL56910.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] Length = 573 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 64/481 (13%), Positives = 133/481 (27%), Gaps = 95/481 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT 75 +PK W+ V + GR + + I + + ++ + + Sbjct: 102 ELPKGWEWVRFADLVNVLNGRAYKKEELIDAGTPVLRVGNL------FTSEHWYYSDLIL 155 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG +L+ + D + + L + ++ TQ Sbjct: 156 EEDKYCNKGDLLFAWSASFGPFIWDGDKAIYHYHIWKLDLYGGDLLYKRYLYTFLLEQTQ 215 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +I+A G M H + + I + +PPLAEQ I K+ D L + + Sbjct: 216 KIKAAGHGVMMIHMTKEKMEKIVVYLPPLAEQHRIAAKVDELMALCDQLENQHSNAADAH 275 Query: 196 KE-----------------------------------------KKQALVSYIVTKGLNPD 214 ++ KQ L+ V L P Sbjct: 276 EKLVSHLLGTLTQSQSAEDFSANWQRIAAYFDTLFTTDASIDALKQTLLQLAVMGKLVPQ 335 Query: 215 VKMKDSGIEWV-------------GLVPDHWEVKPFFALVTELNRKNTKLI--------- 252 ++ E + G + + P NT Sbjct: 336 DVNEEPASELLKRIHAEKVKLIAEGKMKKDKPLPPITDDEKPFELPNTWQWVKLQEVFDV 395 Query: 253 -----------ESNILSLSYGNIIQKL-ETRNMGLKPE----SYETYQIVDPGEIVFRFI 296 + ++ NI + + ++ E + V G+I+F I Sbjct: 396 RDGTHDTPKYCDIGFPLITSKNISTGILDFSDIKYISEADHLKIKDRSAVKRGDILFAMI 455 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + I A + + L +L+ + + ++ Sbjct: 456 GSIGNPVIV--NIDTDFSIKNMALFKPYSNNICDMNYLLKYLLIAAVAMR--EQSTGAVQ 511 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ++ +PP+ EQ I ++ D L +I + L ++ + A Sbjct: 512 SFVSLGIIRNYLYAMPPLAEQHRIIAKVDELMGLCDQLKSRITDASRLQQKLADVLVEQA 571 Query: 415 V 415 V Sbjct: 572 V 572 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 62/182 (34%), Gaps = 7/182 (3%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + E +P WE F LV LN + K E + + + Sbjct: 95 TEDEKPFELPKGWEWVRFADLVNVLNGRAYKKEELIDAGTPVLRVGNLFTSEHWYYSDLI 154 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 E + + G+++F + ++ I + +G D Y +L Sbjct: 155 LEEDKYCNKGDLLFAWSASFGPFI-----WDGDKAIYHYHIWKLDLYGGDLLYKRYLYTF 209 Query: 340 YDLC-KVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + A G G+ + E ++++ V +PP+ EQ I ++ A D L + Sbjct: 210 LLEQTQKIKAAGHGVMMIHMTKEKMEKIVVYLPPLAEQHRIAAKVDELMALCDQLENQHS 269 Query: 398 QS 399 + Sbjct: 270 NA 271 >gi|237743942|ref|ZP_04574423.1| type I restriction system specificity protein [Fusobacterium sp. 7_1] gi|229432973|gb|EEO43185.1| type I restriction system specificity protein [Fusobacterium sp. 7_1] Length = 590 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 53/391 (13%), Positives = 135/391 (34%), Gaps = 17/391 (4%) Query: 26 KVVPIKRFT-KLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSD-TSTVS 79 ++V +K ++ G + + I + ++ + G K + + + Sbjct: 191 EIVKLKDIAIEMYRGNGIKREEVREIGIPCVRYGEIYTDYGISFKKTKSYTDENLITNKK 250 Query: 80 IFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G IL+ G + + +++ P L L + + + Sbjct: 251 YIDYGDILFAITGESVEEIGKSTAYIGKEKCLVGGDVLVMKHKQDPVYLSYVLSTENAQK 310 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + H + IG I +P+PPL Q I E + + L IE Sbjct: 311 QKSKGKIKSKVVHTNATDIGEIEIPLPPLEVQKRIVEVLDNFEKICNDLNIGLPAEIEAR 370 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +++ + ++++T + K + +K F + + + ++++ Sbjct: 371 QKQYEFYRNFLLTFKIENCTLPKTRQDKTRQDKTRQDIIKLFMYIFGYIELELGEILKIK 430 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 S I + G +TY ++ R + N + ++ Sbjct: 431 NGSDYKKFNIGNIPVYGSGGIINYIDTYIYDKESVLIPRKGSIGNLFYVDKPFWTVD--- 487 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 T Y + + Y+ + + +L K+ +G SL + ++ + +PP++EQ Sbjct: 488 -TIFYTVIDKDIVIPKYIYYYLSKVNLEKL---NTAGGVPSLTQTVLNKILIPLPPLEEQ 543 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKER 406 I ++++ + + E + I +++ Sbjct: 544 QKIVDILDRFDKLCNGISEGLPAEIEARQKQ 574 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 50/392 (12%), Positives = 113/392 (28%), Gaps = 33/392 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + +K + G K + +Y +G S Sbjct: 13 PNGVEYKELKDLCIIKKGVQLNKEKLL----------EEAEYPVINGGILPSGYWNDYNV 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G L+ KD + ++ + Sbjct: 63 KENTITISQGGASAGYVQYIPTKFWAGAHCYYLELKDKNINYRYIYHFIKMKQDKLTSSQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + K + N+ +P+PPL Q I + T + ++K+ + Sbjct: 123 VGAGIPSVEKKILENLLIPVPPLEVQDEIVRILDNFTALTAE-----LTAELTARKKQYS 177 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + K N +K I + + + + R + I Sbjct: 178 WYRDYLLKFENKIEIVKLKDIAIEMYRGNGIKREEVREIGIPCVRYGEIYTDYGISFKKT 237 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + N +D G+I+F ++ +A + + + + Sbjct: 238 KSYTDENLITNKKY----------IDYGDILFAITGESVEEIGKSTAYIGKEKCLVGGDV 287 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 V H D YL++++ + + K D+ + + +PP++ Q I Sbjct: 288 LVMKHKQDPVYLSYVLSTENAQKQKSKGKIKSKVVHTNATDIGEIEIPLPPLEVQKRIVE 347 Query: 381 VINVETARIDVL-------VEKIEQSIVLLKE 405 V++ + L +E ++ + Sbjct: 348 VLDNFEKICNDLNIGLPAEIEARQKQYEFYRN 379 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 40/140 (28%), Gaps = 3/140 (2%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 N G+ P Y V I + Y + Sbjct: 48 NGGILPSGYWNDYNVKENTITISQGGAS---AGYVQYIPTKFWAGAHCYYLELKDKNINY 104 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + K+ + S++ + ++ L + VPP++ Q +I +++ TA Sbjct: 105 RYIYHFIKMKQDKLTSSQVGAGIPSVEKKILENLLIPVPPLEVQDEIVRILDNFTALTAE 164 Query: 392 LVEKIEQSIVLLKERRSSFI 411 L ++ R + Sbjct: 165 LTAELTARKKQYSWYRDYLL 184 >gi|315171545|gb|EFU15562.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1342] Length = 404 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 47/402 (11%), Positives = 126/402 (31%), Gaps = 22/402 (5%) Query: 24 HWKVVPIKRFT-KLNTGRTSESG------KDIIYIGLEDVESGTGKYLP--KDGNSRQSD 74 +W++ + F ++ G T ++ + +I D++ + K + + Sbjct: 8 NWELCKVGDFGREIYGGGTPKTSVKEFWSGTLPWIQSSDLKEDKVCDIKAKKHISVKAIQ 67 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +K I + + ++ S FL + + + L + Sbjct: 68 QSSAKKISKNSIAIVTRVSVGKLV-LMPYEYATSQDFL-SISVLQVDKWFGVYSLYNKLQ 125 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ + + + M PL EQ I + I + + EL Sbjct: 126 SELNSVQGTSIKGITKDELLNKKIMIPKPLKEQSKIGLFLKKIDTTIALHQRKLDQLKEL 185 Query: 195 LKEKKQALV--SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 K Q ++ + + +G + + D + T + + Sbjct: 186 KKAYLQLIIVLNSSENSTVPKLRFANFTGEWELCKLGDELALLKDGTHGTHTDSLVGPYL 245 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 S + I + + + + + + +I+ + + LR+ + + Sbjct: 246 LSAKNIKNGKINITNEDRKISQDEFDRIHSRFSLKKDDILLTIVGSIGEAAILRAPEGI- 304 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP 371 + ++ I+ +L + + K + + D+ ++P+L P Sbjct: 305 --TFQRSVAYLRSKVINPEFLYTYITGPEFQKELKNRQVVSAQPGIYLGDLDKIPILFPK 362 Query: 372 IK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I ++D + + + LK + S++ Sbjct: 363 TSREQQKIGTF----FQQLDQAITLHQNKLTQLKFLKKSYLQ 400 >gi|167771561|ref|ZP_02443614.1| hypothetical protein ANACOL_02933 [Anaerotruncus colihominis DSM 17241] gi|167666201|gb|EDS10331.1| hypothetical protein ANACOL_02933 [Anaerotruncus colihominis DSM 17241] Length = 388 Score = 98.7 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 64/402 (15%), Positives = 130/402 (32%), Gaps = 29/402 (7%) Query: 29 PIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + GR + + + I ++++ + + Y N + +G Sbjct: 3 TLGNVATYINGRAFKPSEWEDSGLPIIRIQNLTNFSAPY-----NYSSRELEEKYKVTRG 57 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L+ L I D + + P + + + + L V + G+ Sbjct: 58 DLLFAWS-ASLGAHIWKGNDAWLNQHIFRVVPSEQIEKKYLYYFLLQVVAELHAKTH-GS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M H N P+P+P L EQ I KI ++D + E E LK +QA++ Sbjct: 116 GMVHITKGPFMNTPIPVPSLPEQKRIVSKIEELFSKLDASVAELQTAKEKLKVYRQAVLK 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN- 263 +P K K +E + P + K KN I ++ Y N Sbjct: 176 EAF----DPVSKEK-ILLEDIIEKPRYGTSKKC-----SYAYKNGFKAVYRIPNICYQNG 225 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + + G + + +++ ++ R + + + + Y+ Sbjct: 226 SIDHKDIKYAGFSDDELKNLDLIENDLLIIRSNGSVSLVGRSSIVKAEDCDATFAGYLIR 285 Query: 324 ----KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP---PIKEQF 376 KP + S +L + + S+ + + + VP Q Sbjct: 286 LRLKKPSEVLSKFLHYFLESHAARTYIEHVAKSTSGVNNINSNEISNLPVPKCDDFDMQA 345 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + D + + I+ S+ + R S + A G+ Sbjct: 346 QTVVKIETNLSICDDIQQTIDTSLQQAEALRQSILKQAFEGE 387 >gi|259419409|ref|ZP_05743325.1| RmeS [Silicibacter sp. TrichCH4B] gi|259344650|gb|EEW56537.1| RmeS [Silicibacter sp. TrichCH4B] Length = 400 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 63/398 (15%), Positives = 132/398 (33%), Gaps = 32/398 (8%) Query: 36 LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG 92 + G + + ++ DV + + + ++ S A G IL+ G Sbjct: 24 IVYGIVQPGPECPGGVPFVQSRDVGGAVDVNVLNRTSQQIAEQYRRSKIALGDILFSLRG 83 Query: 93 PYLRKAIIADFDGICSTQFLVLQPK---DVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + +I + V + + PE ++ L + + I G+T Sbjct: 84 NIGQSSITPAELDGANIARGVARIRVGAKGDPEFVRYVLQGPVLQRLIARNANGSTFREL 143 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + + +P+P L EQ+ I E + ++ L R L + AL+ Sbjct: 144 SIEELRKLPIPDVSLPEQLKIAEILRTWDEALEKLTVLRAAKERRLGALRAALL------ 197 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 +++ G+ H VT K S+ + + Sbjct: 198 ----FGRLRQKGL-------RHNWAPTRLEAVTHELTKRNGTKGLGRESVMGVTKAEGVV 246 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI- 328 Y+ + P + + + S+ E +++ Y+ + Sbjct: 247 PMREQTIAADISRYKRLPPRAFAYNPMRIN--VGSIAMNDRDEAVLVSPDYVVFACNADG 304 Query: 329 -DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 D YL L ++ + GSG +RQ + ++ L + +P + EQ I V+N Sbjct: 305 LDPDYLDHLRKTSWWAHYINSGGSGSVRQRTYYANLAALKLPLPDLDEQKAIAAVLNTAR 364 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 A L+ E+ I + ++ + +TG+ + E Sbjct: 365 A---DLIA-TEREIEAVTRQKRGLMQKLLTGEWQVEEE 398 >gi|332142825|ref|YP_004428563.1| type I site-specific deoxyribonuclease [Alteromonas macleodii str. 'Deep ecotype'] gi|327552847|gb|AEA99565.1| type I site-specific deoxyribonuclease [Alteromonas macleodii str. 'Deep ecotype'] Length = 360 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 57/382 (14%), Positives = 120/382 (31%), Gaps = 31/382 (8%) Query: 47 DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG 105 + YI ++D+ + KY D + ++ G Sbjct: 3 NNRYIQIDDLRNDNLIKYTDDD---------KGTFVEPSDVIIAWDGANAGTIGYGLEGL 53 Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 I ST + + G L + C GAT+ H + ++ +P+PPL Sbjct: 54 IGSTLARLKVIIPHIDTNYLGRFLQSKFKEI-RNNCTGATIPHVSKVHLNSLLVPVPPLP 112 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 Q I + L Q++ + + +K S + Sbjct: 113 IQKQIAAVLEKADNLRQQSQQMEQELNSLA----QSVFLDMFGDYRKDAMSLKSS----L 164 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G V D + ++ + E +++ +K + +E YQ Sbjct: 165 GEVADVRSGVTKGQKLEGHKLTTVPY---MRVANVQDGYLDLSEIKDITVKAKDFEKYQ- 220 Query: 286 VDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLC 343 + G+++ D R + + I + V+ S + A+ +++ + Sbjct: 221 LKAGDVLMTEGGDFDKLGRGAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQTPFVK 280 Query: 344 KVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + F + S+ +K LP+ I +Q +I+ + L E + Sbjct: 281 QYFLKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIID----ELKALKEANFEQQE 336 Query: 402 LLKERRSSFIAAAVTGQIDLRG 423 +S + A G++DL+ Sbjct: 337 QANAHFNSLMQRAFKGELDLKD 358 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 9/152 (5%) Query: 259 LSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ IQ + RN L K + V+P +++ + ++ + Sbjct: 1 MTNNRYIQIDDLRNDNLIKYTDDDKGTFVEPSDVIIAWDGANAGTIGYGLEGLIGSTLAR 60 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + ID+ YL ++S ++ + + L V VPP+ Q Sbjct: 61 LKVIIPH---IDTNYLGRFLQS-KFKEIRNNCTGATIPHVSKVHLNSLLVPVPPLPIQKQ 116 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 I V+ + D L ++ +Q L S Sbjct: 117 IAAVLE----KADNLRQQSQQMEQELNSLAQS 144 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 56/196 (28%), Gaps = 17/196 (8%) Query: 30 IKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + +G T + Y+ + +V+ G + ++ Sbjct: 164 LGEVADVRSGVTKGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKDFEKYQLKA 223 Query: 84 GQILYGKLGPY---LRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G +L + G + R AI + C + F V + E +L + V Q Sbjct: 224 GDVLMTEGGDFDKLGRGAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQTPFVKQYF 283 Query: 138 EAICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + T + + + +P+P + +Q I E Sbjct: 284 LKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIIDELKAL----KEANFEQQEQAN 339 Query: 197 EKKQALVSYIVTKGLN 212 +L+ L+ Sbjct: 340 AHFNSLMQRAFKGELD 355 >gi|332204890|gb|EGJ18955.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47901] Length = 352 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 116/392 (29%), Gaps = 44/392 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEILSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L+ Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITRRKFQLDELNLLV-------- 170 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 K E G V + + L +N K + + Sbjct: 171 --------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFP 216 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I Y IV ++ N +R + Sbjct: 217 IYGSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEP 266 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 267 VLEKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFV- 322 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A++D I++S+ L+ + S + Sbjct: 323 ---AQVDKSQLAIQKSLEELETLKKSLMQEYF 351 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 65/185 (35%), Gaps = 19/185 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 186 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 233 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 234 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 290 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 T+ + NI +P+PPLA Q + ++D + +E L+ K++L Sbjct: 291 AVTIPSLTKSDLLNISIPLPPLALQNEFADF----VAQVDKSQLAIQKSLEELETLKKSL 346 Query: 203 VSYIV 207 + Sbjct: 347 MQEYF 351 >gi|206558820|ref|YP_002229580.1| type I restriction enzyme specificity protein [Burkholderia cenocepacia J2315] gi|198034857|emb|CAR50729.1| type I restriction enzyme specificity protein [Burkholderia cenocepacia J2315] Length = 444 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 69/439 (15%), Positives = 131/439 (29%), Gaps = 42/439 (9%) Query: 22 PKHWKVVPIKRFT-----KLNTGRTSES---GKDIIY--IGLEDVESGTGKYLPKDGNSR 71 P W+ + + TG + + V G + + Sbjct: 9 PAAWERTTLGEVVARGGGSVQTGPFGSQLHASDYVPVGIPSIMPVNIGDNRLIRDGIACI 68 Query: 72 QSDTS---TVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKD--VLPELL 124 + + I KG I+Y + G R+A++ D C T L ++ VLPE Sbjct: 69 TEVDAQRLSKHIVRKGDIIYSRRGDVERRALVRDAEDGWFCGTGCLKVRLGQGVVLPEFA 128 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L +V + I GATM + + + IP +PPL +Q LI + A +I+ Sbjct: 129 AFYLGHPEVREWIVRHAVGATMPNLNTGIMEAIPFLLPPLPQQELIAATLGALDDKIEQN 188 Query: 185 ITERIRFIELLKEKKQALVSYIVT-----------KGLNPDVKMKDSG---IEWVGLVPD 230 L + +A G+ P +G VP Sbjct: 189 RRTNRELEGLAQAMFKAWFVDFEPVKAKASGKTSFAGMPPAAFAALPDRLTDSPLGQVPQ 248 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 WE++P LV + + + + I D G Sbjct: 249 GWEIRPIGDLVAVKGGGTPSTKVAEYWDEGTHFWATPKDLSGLQDPVLLETSRCITDAGA 308 Query: 291 IVF-------RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + L + +A + ++A+ G + L Sbjct: 309 ECISSGVLQENTVLLSSRAPVGYTALAKVPTAVNQGFIAMTCDGPLPPHYVLHWTRSMLG 368 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 ++ + + + +VP + A + L+E + L Sbjct: 369 EIKSRASGTTFPEISKGAFRPILAIVPS----AVVVQAFESFAACLFDLIEVNVRQRFSL 424 Query: 404 KERRSSFIAAAVTGQIDLR 422 +E R+ + ++G + +R Sbjct: 425 EEMRNYLLPRLLSGAVKVR 443 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 57/200 (28%), Gaps = 12/200 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKY---LPKD 67 +G +P+ W++ PI + G T + + +D+ + Sbjct: 243 LGQVPQGWEIRPIGDLVAVKGGGTPSTKVAEYWDEGTHFWATPKDLSGLQDPVLLETSRC 302 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 ++ + + + +L P A +A + F+ + LP Sbjct: 303 ITDAGAECISSGVLQENTVLLSSRAPVGYTA-LAKVPTAVNQGFIAMTCDGPLP-PHYVL 360 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + I++ G T I +P I+ + + Sbjct: 361 HWTRSMLGEIKSRASGTTFPEISKGAFRPILAIVPSAVVVQAFESFAACLFDLIEVNVRQ 420 Query: 188 RIRFIELLKEKKQALVSYIV 207 R E+ L+S V Sbjct: 421 RFSLEEMRNYLLPRLLSGAV 440 >gi|213971210|ref|ZP_03399328.1| type I site-specific deoxyribonuclease (specificity subunit) [Pseudomonas syringae pv. tomato T1] gi|213924079|gb|EEB57656.1| type I site-specific deoxyribonuclease (specificity subunit) [Pseudomonas syringae pv. tomato T1] Length = 414 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 44/403 (10%), Positives = 106/403 (26%), Gaps = 29/403 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK +++ + + R +++ + E +Y K +D Sbjct: 22 GWKETQLQKIARSVSDRAVTGDGDNVLSLSGEHGLVLQSEYFGKKIAGDITD--RYLKLL 79 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + +Y GI S + + + W + Sbjct: 80 RDDFVYNDRTTKASTFGTIKRLSKYSGGIVSPIYKCFRFHTGEDPVFWEWYFESGSHEAQ 139 Query: 138 EAICEGAT----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + P EQ + E + +D I + R + Sbjct: 140 LGSLVNEGARAGRFNISIRQFLSTTAWRPDEREQQKVAEFL----SSVDDFIAAQARKVT 195 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LK K+ L + + +++ + V + N Sbjct: 196 ALKIYKKGLTQRLFPQESESQPRLRFPEFQNVEEWKVKRLSGMIELISGMHLSPNDYSTV 255 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + T + +I+ ++ + Sbjct: 256 GEVPYFTGP---SDFTNNLSNVTKWTKRTANVSKAEDILIT---VKGSGVGEIWYSTLPE 309 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 + MA++ S ++ +++ F +GSG + L + L P + Sbjct: 310 IAMGRQLMAIRSKSGASRFMFQFLQTK--KNHFKDLGSGNMIPGLSRAVILELEASFPNL 367 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + + +D L+ Q L+ + + Sbjct: 368 PEQQRIADCL----TSLDDLIAAQTQKHEALETYKMGLMQQLF 406 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 52/182 (28%), Gaps = 4/182 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + WKV + +L +G + +G + N + T ++ Sbjct: 228 EEWKVKRLSGMIELISGMHLSPNDYSTVGEVPYF-TGPSDFTNNLSNVTKWTKRTANVSK 286 Query: 83 KGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL G + + + I Q + ++ K + +L + + + Sbjct: 287 AEDILITVKGSGVGEIWYSTLPEIAMGRQLMAIRSKSGASRFMFQFLQTK--KNHFKDLG 344 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + I + P L EQ I + + + I + Q Sbjct: 345 SGNMIPGLSRAVILELEASFPNLPEQQRIADCLTSLDDLIAAQTQKHEALETYKMGLMQQ 404 Query: 202 LV 203 L Sbjct: 405 LF 406 >gi|225861216|ref|YP_002742725.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae Taiwan19F-14] gi|225726806|gb|ACO22657.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae Taiwan19F-14] Length = 347 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 54/362 (14%), Positives = 96/362 (26%), Gaps = 37/362 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + L+ Sbjct: 120 MKHLTKKYFDNIMVSYTNLREQQRIASELDLLSKLILRRQEQLEELNLLV---------- 169 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 K E G V + + L +N K + + I Sbjct: 170 ------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFPIY 217 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Y IV ++ N +R + Sbjct: 218 GSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVL 267 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + + + + Sbjct: 268 EKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVALV 324 Query: 386 TA 387 Sbjct: 325 DK 326 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 43/142 (30%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V ++EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLREQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 185 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 232 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 233 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 289 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 290 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 320 >gi|260592891|ref|ZP_05858349.1| type I restriction-modification system, S subunit [Prevotella veroralis F0319] gi|260535180|gb|EEX17797.1| type I restriction-modification system, S subunit [Prevotella veroralis F0319] Length = 429 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 62/418 (14%), Positives = 117/418 (27%), Gaps = 49/418 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W V + + + G + K+ I +I + D E Sbjct: 11 EIPLTWAWVRLNFVSIIARGSSPRPIKEYLTDSLDGINWIKIGDTEKDGMYINSTKEKIT 70 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + KG L + R I + DG +LV+ P + + L Sbjct: 71 VEGLSKSRLVHKGDFLLTNSMSFGRP-YITNVDGCIHDGWLVISPIGTSFKQKFLYYLLS 129 Query: 132 DVT--QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE-- 187 + GA + + + + P+PP EQ I +K+ +I+T Sbjct: 130 SGYAFSQFAGKVSGAVVKNLNSDKVAEAMFPLPPYNEQQRILDKLDVLVPKINTYGIMSD 189 Query: 188 --RIRFIELLKEKKQALVSYIVTKGLNPDVK--------------------------MKD 219 L + ++++ + L P KD Sbjct: 190 AIYDMNTSLRSKLHKSILQEAIQGKLIPQDPNDEPASVLLQRIKEEKQRLVKEGKLKKKD 249 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + D+ + ++ ++ LS+ + E + Sbjct: 250 VVDSIIYKGDDNKYYEQVDGTAIQIESDYDFPNTWAVVKLSHICRLIDGEKKEGQHICLD 309 Query: 280 ------YETYQIVDPGEIV--FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 T +D G+ V I L + + S V G + S + + Sbjct: 310 AKYLRGKSTGTYLDKGKFVAKGNNIILVDGENSGEVFTVPHDGYMGSTFKQLWISEAMHQ 369 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + + L E L + +PP +EQ I I++ I Sbjct: 370 PYVLYFIQFYKELLRNSKKGAAIPHLNKEIFYSLLIGIPPYQEQIRIARKIDIIVNEI 427 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 73/221 (33%), Gaps = 26/221 (11%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN-------ILSLSYGNIIQ--- 266 M E +P W + + + I+ I + G+ + Sbjct: 1 MVCIDDEIPFEIPLTWAWVRLNFVSIIARGSSPRPIKEYLTDSLDGINWIKIGDTEKDGM 60 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + E ++V G+ + L N R G I ++ + P Sbjct: 61 YINSTKEKITVEGLSKSRLVHKGDFL-----LTNSMSFGRPYITNVDGCIHDGWLVISPI 115 Query: 327 GID--STYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 G +L +L+ S F SG ++L + V +PP EQ I + ++ Sbjct: 116 GTSFKQKFLYYLLSSGYAFSQFAGKVSGAVVKNLNSDKVAEAMFPLPPYNEQQRILDKLD 175 Query: 384 VETARID------VLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 V +I+ + + S+ + S + A+ G+ Sbjct: 176 VLVPKINTYGIMSDAIYDMNTSLRS--KLHKSILQEAIQGK 214 >gi|225858688|ref|YP_002740198.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae 70585] gi|225720968|gb|ACO16822.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae 70585] Length = 347 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 53/362 (14%), Positives = 96/362 (26%), Gaps = 37/362 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + + N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSTEINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + L+ Sbjct: 120 MKHLTKKYFDNIMVSYTNLREQQRIASELDLLSKLILRRQEQLEELNLLV---------- 169 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 K E G V + + L +N K + + I Sbjct: 170 ------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFPIY 217 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Y IV ++ N +R + Sbjct: 218 GSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVL 267 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + + + + Sbjct: 268 EKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVALV 324 Query: 386 TA 387 Sbjct: 325 DK 326 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + IV+ G+I+ + ++ V I Sbjct: 39 TSTEINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V ++EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLREQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 185 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 232 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 233 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 289 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 290 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 320 >gi|220930106|ref|YP_002507015.1| restriction modification system DNA specificity domain protein [Clostridium cellulolyticum H10] gi|220000434|gb|ACL77035.1| restriction modification system DNA specificity domain protein [Clostridium cellulolyticum H10] Length = 409 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 63/406 (15%), Positives = 137/406 (33%), Gaps = 33/406 (8%) Query: 25 WKVVPIKRFTKLN-TGRTSES------GKDIIYIGLEDVESGT--GKYLPKDGNSRQSDT 75 W+ + + G T + +I +I D+ G K + Sbjct: 17 WEQRTLGEMAEETYGGGTPSTLNKAYWNGNIPWIQSSDLVEHQLFGVSPRKYITESGVCS 76 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S + + I + K F S FL + + + + Sbjct: 77 SAAKLVPENSIAIVT-RVGVGKLATMPFAFATSQDFL-SLSNLKCEIWFFAYSIYKKLQR 134 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I+A+ + + + + EQ I + +D IT R ++ L Sbjct: 135 DIDAVQGTSIKGITKNELLSKSICAPSDILEQTSIGNFL----HLLDDAITLHKRKLDDL 190 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ K + + + ++ +G + W+ + +V + RKN L + Sbjct: 191 KDLKHGYLQQMFPQAGESVPLVRFAG------FTEPWQKRTLGDVVECVTRKNKGLKSTL 244 Query: 256 ILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMER 313 +L++S +I + + + + + Y ++ GE + +++ E Sbjct: 245 VLTISAQHGLIAQKDFFDKEVASKDVSNYYLMKNGEFAYNKSYSNGYPWGAVKRLDNYEI 304 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVL 368 G++++ Y+ KP IDS +L + + G R ++ D + Sbjct: 305 GVLSTLYIVFKPTTIDSEFLTQYYETTHWHNEVAQYAAEGARNHGLLNIATSDFFETVLA 364 Query: 369 VP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P EQ I N + +D + EQ + LK+ +S+++ Sbjct: 365 IPTNSNEQTAIGNFFHT----LDRQIIAQEQKLNRLKQLKSAYLQK 406 >gi|304560216|gb|ADM42880.1| conserved hypothetical protein [Edwardsiella tarda FL6-60] Length = 340 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 109/317 (34%), Gaps = 25/317 (7%) Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL-IREKIIAETVRIDTL 184 + L ++ QR A GAT++ K + N + +P ++ + I +K+ + I L Sbjct: 27 FYQLQSNLVQRQIAETLGATINQITNKDLSNFKIAVPRNKDEYIEISDKLASIDGLIIDL 86 Query: 185 ITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + + Q L++ + L D +K G +P+ WE + Sbjct: 87 KKIVNKKQAIKTATMQQLLTGKTRLPQFALRKDGTVKGYRRSEFGDIPEDWETSTLDNFI 146 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV----------DPGEI 291 ++L+ + + S+G I K + G + G I Sbjct: 147 SKLDAGVSVNSVNEKDIFSHGKNILKTSCVSNGYFYGHEAKSIVPDDINRAKTTPKKGCI 206 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI----DSTYLAWLMRSYDLCKVFY 347 + ++ N L + E + + D+ +LA+++ + Sbjct: 207 IISRMNTPNLVGELGYVERDEPNLYLPDRLWQMNVCQEQIIDNRWLAYILSFPLISNKLK 266 Query: 348 AMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 +G +++ + + L P EQ I +++ I L +Q + + Sbjct: 267 ETATGTSNSMKNISKDSLYSLSFPRPSKDEQTAIAAILSDMDKDIQTL----QQRLDKTR 322 Query: 405 ERRSSFIAAAVTGQIDL 421 + + + +TG+ L Sbjct: 323 QLKQGMMQELLTGKTRL 339 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 40/93 (43%), Gaps = 5/93 (5%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARI 389 Y+ + ++S + + + +D+ + VP E +I++ + A I Sbjct: 24 PYVFYQLQSNLVQRQIAETLGATINQITNKDLSNFKIAVPRNKDEYIEISDKL----ASI 79 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 D L+ +++ + + +++ + +TG+ L Sbjct: 80 DGLIIDLKKIVNKKQAIKTATMQQLLTGKTRLP 112 Score = 45.6 bits (106), Expect = 0.018, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 62/213 (29%), Gaps = 21/213 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKR-FTKLNTGRTSES--GKDI-----IYIGLEDVESGTG 61 Y+ S G IP+ W+ + +KL+ G + S KDI + V +G Sbjct: 125 YRRSE---FGDIPEDWETSTLDNFISKLDAGVSVNSVNEKDIFSHGKNILKTSCVSNGYF 181 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQP 116 + KG I+ ++ L + + + + Sbjct: 182 YGHEAKSIVPDDINRAKTTPKKGCIIISRMNTPNLVGELGYVERDEPNLYLPDRLWQMNV 241 Query: 117 KDVLPELLQGWLLSIDVTQRIEAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 + + + +M + + ++ P P EQ I Sbjct: 242 CQEQIIDNRWLAYILSFPLISNKLKETATGTSNSMKNISKDSLYSLSFPRPSKDEQTAIA 301 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I TL + +L + Q L++ Sbjct: 302 AILSDMDKDIQTLQQRLDKTRQLKQGMMQELLT 334 >gi|40467|emb|CAA35604.1| HsdS polypeptide, part of CfrA family [Citrobacter freundii] Length = 578 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 70/501 (13%), Positives = 137/501 (27%), Gaps = 95/501 (18%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S +P W+ V + + G++ S G Sbjct: 83 IKKQKPLPEI--SEEDKPFELPAGWEWVRLGEAFYIEMGQSXSSQYYNQSEEGIPFFQGK 140 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + K +R TS + K +L P ++ + ++ Sbjct: 141 ADFGKKYPTARYWCTSPTKLAQKNDVLLSVRAPV-GPTNLSPYHCCIGRGLAAIRCLSDA 199 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P ++L +R+E + G T I + MPIPPL EQ+ I + I Sbjct: 200 PHEYLLYILKAS-QRRLEELATGTTFVAVSKTDIEPLLMPIPPLNEQIRIVDTIDRLMSL 258 Query: 181 -----------------------------------------IDTLITERIRFIELLKEKK 199 I + K Sbjct: 259 CDQLEQHSLTSLDAHQQLVEILLTTLTDSQNADELAKNWARISEHFDTLFTTEASIDALK 318 Query: 200 QALVSYIVTKGLNPDVKMKD-------------------------------SGIEWVGLV 228 Q ++ V L P + S E + Sbjct: 319 QTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPPISDEEKPFEL 378 Query: 229 PDHWEVKPFFALVT----ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYE 281 PD WE L + + I+++ L N+ + + + E Sbjct: 379 PDGWEWCCIDDLTFVSGGIQKQPKRRPIKNHFPYLRVANVQRGNINIDELERFELEPHEL 438 Query: 282 TYQIVDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMR 338 T+ + +I+ R +E+ + + + V+ ++A + Sbjct: 439 TFWSLKKNDILIVEGNGSADEIGRCAIWLAPIEKCVYQNHLIRVRGIMDGHQEFIALYLN 498 Query: 339 SYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-VEK 395 S K + + +L ++ + + +PP+ +Q I + I + L + Sbjct: 499 SPSGIKEMQRLAVTTSGLYNLSVGKIRGITIPLPPLNQQNLILSKIREYIFICENLKISL 558 Query: 396 IEQSIVLLKERRSSFIAAAVT 416 L +A A+T Sbjct: 559 QSAQQTQLH------LADALT 573 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 60/202 (29%), Gaps = 13/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W+ I T ++ G + Y+ + +V+ G + + Sbjct: 377 ELPDGWEWCCIDDLTFVSGGIQKQPKRRPIKNHFPYLRVANVQRGNINIDELERFELEPH 436 Query: 75 TSTVSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL----QG 126 T K IL G R AI C Q +++ + ++ Sbjct: 437 ELTFWSLKKNDILIVEGNGSADEIGRCAIWLAPIEKCVYQNHLIRVRGIMDGHQEFIALY 496 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + + + I I +P+PPL +Q LI KI + L Sbjct: 497 LNSPSGIKEMQRLAVTTSGLYNLSVGKIRGITIPLPPLNQQNLILSKIREYIFICENLKI 556 Query: 187 ERIRFIELLKEKKQALVSYIVT 208 + AL + Sbjct: 557 SLQSAQQTQLHLADALTDAAIN 578 >gi|92112221|ref|YP_572149.1| restriction modification system DNA specificity protein [Chromohalobacter salexigens DSM 3043] gi|91795311|gb|ABE57450.1| restriction modification system DNA specificity protein [Chromohalobacter salexigens DSM 3043] Length = 538 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 63/477 (13%), Positives = 138/477 (28%), Gaps = 83/477 (17%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 HW + + ++N + + ++ + +I + V +G+Y D + + F Sbjct: 25 HWLWIEHNQIAEINPKK-PKLDEELSVSFIPMGAVAEESGRYTTDDSKKFEDVKKGYTYF 83 Query: 82 AKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 + G IL+ K+ P + + + G ST+F V + + + + + Sbjct: 84 SDGDILFAKITPCMENGKVALLSNLTNGVGFGSTEFHVSRLTEAVEKKFYFYFFVSKSFR 143 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKII------------------ 175 ++ +A G+ N+ +P+ P EQ I KI Sbjct: 144 KQAQANMAGSAGQLRVTTDYFSNVSVPLCPTREQQRIVTKIEELFSEIDSGVESLKTAQA 203 Query: 176 ----------------AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK--- 216 T + +R E L E+ QA + L Sbjct: 204 KLKTARQSLLKAAFEGKLTEQWRKDNADRQESPEALLERIQAEREAHYQQQLTDWQHQLK 263 Query: 217 -----------------------MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI- 252 + + + +P+ W+ + Sbjct: 264 DWEAAGKEGKKPRKPKVPKALPPLTQQELAELPELPEGWKWINLGNISEISGGITKNQKR 323 Query: 253 ---ESNILSLSYGNII-QKLETRNMGLK--PESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 L N+ KLE ++ + +++ + D+ Sbjct: 324 QSLPQKNPFLRVANVYANKLELDDIHFIGTTPDEAKRAKLKKDDLLIVEGNGSPDQIGRV 383 Query: 307 SAQVM--ERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR--QSLKFED 361 + E + + + S + S K + S +L Sbjct: 384 AKWDGSIEHCTHQNHLIRSRLASPISADFVLHFLLSATGRKAIKKVASSTSGLYTLSLAK 443 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 V++L + V EQ I + + +++D L + S+ + + S + A G+ Sbjct: 444 VEKLCIPVCSKNEQMMIVDQLESRLSQLDQLERTLTASMKQAEALKQSILKRAFAGR 500 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/221 (11%), Positives = 72/221 (32%), Gaps = 13/221 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 + +P+ WK + + ++++ G T + ++ + +V + + Sbjct: 295 LPELPEGWKWINLGNISEISGGITKNQKRQSLPQKNPFLRVANVYANKLELDDIHFIGTT 354 Query: 73 SDTSTVSIFAKGQILY----GKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVL--PELL 124 D + + K +L G R A + + + +L Sbjct: 355 PDEAKRAKLKKDDLLIVEGNGSPDQIGRVAKWDGSIEHCTHQNHLIRSRLASPISADFVL 414 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L + + + + + + +P+ EQ++I +++ + ++D L Sbjct: 415 HFLLSATGRKAIKKVASSTSGLYTLSLAKVEKLCIPVCSKNEQMMIVDQLESRLSQLDQL 474 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 ++ + KQ+++ L P + E + Sbjct: 475 ERTLTASMKQAEALKQSILKRAFAGRLVPQDPDDEPASELL 515 >gi|111222733|ref|YP_713527.1| Type I restriction modification enzyme protein S [Frankia alni ACN14a] gi|111150265|emb|CAJ61962.1| Type I restriction modification enzyme protein S [Frankia alni ACN14a] Length = 399 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 57/407 (14%), Positives = 125/407 (30%), Gaps = 31/407 (7%) Query: 28 VPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY---LPKDGNSRQSDTSTV 78 P+ F ++ +G T ++ G +I + D+ S K+ + + Sbjct: 7 TPLGEFCEIISGATPKTASEEYWGGEIPWATPRDLGSLNSKFLASTSRAITEAGLRSCAT 66 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + G +L P I + F L P + R++ Sbjct: 67 HVLPAGSVLLTSRAPI-GSVAINARPMATNQGFKSLVPDTSRALPGYLYHWLRCQRSRLQ 125 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ GAT I +P+PPL+EQ I + + R E Sbjct: 126 SLGNGATFKELSKSATARIAVPLPPLSEQKRIEQMLDQADTIRARRRETIARLE----EL 181 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q++ S + NP + + + + + R + Sbjct: 182 AQSIFSVMFG---NPVQNERGWRRVPLSELVVRIDSGRSPVCLDRPARPGEWGVLKLGAV 238 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 S + + + + V PG+++F + + + ++ Sbjct: 239 TS---CVYRAGENKALPPDVAAFSACEVRPGDLLFSRKNTRELVAACALVDATPARLLLP 295 Query: 319 AYM----AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP 371 + +D YL L+ + + + SG ++ + L + +PP Sbjct: 296 DLIFRLVVEPRSAVDPVYLHRLLTHPEKRRKVQGLASGSSASMPNISKSRLLGLEIELPP 355 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ Q + N + ++ + + S+V E +S A G+ Sbjct: 356 MEVQKEFANRVRA----LERIKVAHQASLVEQDELVASLAHRAFRGE 398 >gi|94267246|ref|ZP_01290822.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93452076|gb|EAT02762.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 578 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 76/466 (16%), Positives = 138/466 (29%), Gaps = 91/466 (19%) Query: 21 IPKHWKVVPIKRFTKL-NTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +PK W+ V + T T + D + LED+E + K L K + S+ Sbjct: 101 LPKGWEWVRLGDVTNYGVTEKAEPGETSPDTWVLELEDIEKESSKLLQKVFQRDRQFKSS 160 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQR 136 + F +G +LYGKL PYL K +IAD G+C+T+ + ++ L + + Sbjct: 161 KNKFIRGDVLYGKLRPYLDKVLIADASGVCTTEIMPIRAFTGLQSEYLRLSLKTPNFKNY 220 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK----------------------- 173 G + + +PP EQ I EK Sbjct: 221 ATNSTHGMNLPRLGTDKARLALLALPPAPEQSRIVEKVDELMALCDRLEQQTSDQLAAHE 280 Query: 174 ------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV 215 + A R+ T + KQ ++ V L P Sbjct: 281 TLVETLLDTLTRSADATELAANWTRLQTHFDTLFTTESSIDRLKQTILQLAVMGRLVPQD 340 Query: 216 KMKD-------------------------------SGIEWVGLVPDHWEVKPFFALVTEL 244 ++ S E +PD WE F + Sbjct: 341 PNEEPASALLKKIAAEKARLVKEGKIKKTKPLPEISEEEKPFALPDGWEWCRFTDIGELA 400 Query: 245 NRKNTK--------LIESNILSLSYGNI---IQKLETRNMGLKPESYETYQIVDPGEIVF 293 K+ I + G++ +K+ T E ++ G + Sbjct: 401 RGKSKHRPRNDPALYIGGKTPLVQTGDVARADRKITTFTALYNQAGVEQSKLWKAGTLCI 460 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 D L ++ + + Y + +R+ + S Sbjct: 461 TIAANIGDTGILGFDACFPDSVVG---FTPFDDRLKNEYFEYFLRTAK-KNLEEFAPSTA 516 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++++ E ++ + V +PP +E I + D + Q+ Sbjct: 517 QKNINLEVLQNVLVPLPPARELVRIVEKTDKLMGLCDQFKASLSQA 562 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 57/195 (29%), Gaps = 10/195 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--ESGKDIIYIG-------LEDVESGTGKYLPKDGNSR 71 +P W+ +L G++ +YIG DV K Sbjct: 384 LPDGWEWCRFTDIGELARGKSKHRPRNDPALYIGGKTPLVQTGDVARADRKITTFTALYN 443 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 Q+ ++ G + + + I FD + P D + Sbjct: 444 QAGVEQSKLWKAGTLCIT-IAANIGDTGILGFDACFPDSVVGFTPFDDRLKNEYFEYFLR 502 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +E + + + + N+ +P+PP E V I EK D + Sbjct: 503 TAKKNLEEFAPSTAQKNINLEVLQNVLVPLPPARELVRIVEKTDKLMGLCDQFKASLSQA 562 Query: 192 IELLKEKKQALVSYI 206 + A ++ I Sbjct: 563 CQTQHHLTGATMAQI 577 Score = 40.9 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 45/133 (33%), Gaps = 4/133 (3%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + G++++ + DK + A + T G+ S YL +++ Sbjct: 158 KSSKNKFIRGDVLYGKLRPYLDKVLIADASGV---CTTEIMPIRAFTGLQSEYLRLSLKT 214 Query: 340 YDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + G+ L + + + +PP EQ I ++ A D L ++ Sbjct: 215 PNFKNYATNSTHGMNLPRLGTDKARLALLALPPAPEQSRIVEKVDELMALCDRLEQQTSD 274 Query: 399 SIVLLKERRSSFI 411 + + + + Sbjct: 275 QLAAHETLVETLL 287 >gi|169825230|ref|YP_001692841.1| putative type I restriction enzyme S protein [Finegoldia magna ATCC 29328] gi|167832035|dbj|BAG08951.1| putative type I restriction enzyme S protein [Finegoldia magna ATCC 29328] Length = 410 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 47/421 (11%), Positives = 121/421 (28%), Gaps = 41/421 (9%) Query: 27 VVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 V + N G +S + + + D + +D I Sbjct: 5 EVKLGDIITYNKGYAFKSNEYTNTGKMVVRVTDFTLDSIS-DNDSVYLEPNDKYKKFIIN 63 Query: 83 KGQILYGKLGPY--------LRKAIIADFD--GICSTQFLVLQPKDVLPELLQGW--LLS 130 IL +G + + + D + + + P + + Sbjct: 64 TNDILIQTVGSWANNPNSIVGKVVRVPDKCNKAYLNQNIVRIIPNRDFNNTYLYYALKAN 123 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 T + A + I L+EQ I + + + I+ Sbjct: 124 QFSTYCVLRGQGAANQASITLDTIFKFKFRAHLLSEQKRIADILSSYDNLIENNNKRIKL 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT---ELNRK 247 ++ + + G +E+ +P WE + K Sbjct: 184 LEQMAENLYKEWFVRFRFPG--------YEDVEFENGIPKGWEEVRLGEFINLASGYAFK 235 Query: 248 NTKLIESNILSLSYGNIIQ-KLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKR 303 + + + + +I K++ N+ E V G+I+ K Sbjct: 236 SDWWTDQGVPVIKIKDIQNGKIDLTNLDYVSEDNAQKAKNFYVGKGDILIALTGATIGKV 295 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFED 361 + + + + KP + Y+ L + + ++ + ++ D Sbjct: 296 GIVTHDNVLVNQRVGKFFIKKPSIKNIGYIYSLFKQNWIQELIVMYSGSNAAQPNISPFD 355 Query: 362 VKRLPVLVPPIKEQFDI-TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +++ ++ + + NV I + K+ + LL+++R + ++G+++ Sbjct: 356 IEKFKIIY------NKVYVDKFNVIVYPIYDSIIKLYEKNELLEKQRDLLLPRLMSGKLE 409 Query: 421 L 421 + Sbjct: 410 V 410 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 80/211 (37%), Gaps = 7/211 (3%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTG 61 +P Y+D V++ IPK W+ V + F L +G +S + + I ++D+++G Sbjct: 200 RFPGYED--VEFENGIPKGWEEVRLGEFINLASGYAFKSDWWTDQGVPVIKIKDIQNGKI 257 Query: 62 KYLPKDGNSRQSDTSTV-SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 D S + KG IL G + K I D + Q + Sbjct: 258 DLTNLDYVSEDNAQKAKNFYVGKGDILIALTGATIGKVGIVTHDNVLVNQRVGKFFIKKP 317 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 G++ S+ I+ + + S+A I + + + +K Sbjct: 318 SIKNIGYIYSLFKQNWIQELIVMYSGSNAAQPNISPFDIEKFKIIYNKVYVDKFNVIVYP 377 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGL 211 I I + ELL++++ L+ +++ L Sbjct: 378 IYDSIIKLYEKNELLEKQRDLLLPRLMSGKL 408 >gi|159028181|emb|CAO89788.1| hsdS [Microcystis aeruginosa PCC 7806] Length = 406 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 68/425 (16%), Positives = 129/425 (30%), Gaps = 49/425 (11%) Query: 21 IPKHWKVVPIKRFTK-----LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +PK W +V + + G S K I+ VESG Y ++ Sbjct: 3 LPKTWSLVALGDIAAHEKGAIRRGPFGGSLKKEIF-----VESGFKVYEQQNAIKDDFQI 57 Query: 76 STVSI------------FAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLP 121 I ++ G + AI+ G+ + + ++P + Sbjct: 58 GNYFIDEDKFREMEGFNVKPHDLIISCAGTIGKVAIVPYEALPGVINQALMRIRPNPEII 117 Query: 122 ELLQ--GWLLSIDVTQRIEAICEGATMSHAD-WKGIGNIPMPIPPLAEQVLIREKIIAET 178 L S + I G+ + + I +P+PPL EQ I + Sbjct: 118 LCRYLKWLLESPKYQRDIFGKSAGSALKNLAAISEIKKCKIPLPPLEEQRRIAAILDKAD 177 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 EL L S + +P K I +G + F Sbjct: 178 GVRRKRKEAIRLTEEL-------LRSTFLEMFGDPVTNPKGWEIVKLGSLVVGQPNNGIF 230 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 E + L G I E+R + E + + + G+I+F L Sbjct: 231 KKNHEYGGDTPV---VWVKELFSGYTIDCSESRTLTPTDEEVKKFGLT-KGDILFCRSSL 286 Query: 299 QNDKRSLRSAQVMERGI----ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-L 353 D + + + ++S +L +L+ L K A + Sbjct: 287 NRDGIGFNNVFDGMDFSALFECHIIRVRLNQKKVNSIFLNYLLHFPGLRKQIIAKANTVT 346 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ ++K++ +PP + Q + + +I K+E + +S + Sbjct: 347 MSTIGQSEIKKIEFYLPPKELQ----DKFEIFLRKIATNRTKLENK--ESENLFNSLLQR 400 Query: 414 AVTGQ 418 A G+ Sbjct: 401 AFRGE 405 >gi|152991445|ref|YP_001357167.1| type I restriction-modification system, S subunit [Nitratiruptor sp. SB155-2] gi|151423306|dbj|BAF70810.1| type I restriction-modification system, S subunit [Nitratiruptor sp. SB155-2] Length = 373 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 60/409 (14%), Positives = 116/409 (28%), Gaps = 47/409 (11%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK + ++ + + + + G Y P G S D IF Sbjct: 4 WKEYKLNEIAEIFDHKRIP-------LSTMERQKRKGIY-PYYGASGIIDYIDDFIFDGE 55 Query: 85 QILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L + G LR A IA + V++ K+ L + Sbjct: 56 YVLISEDGENLRTRQSPIAFIAKGKFWVNNHAHVIKGKNNYLNKLIVYYFKNLNLNPFL- 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 GA + + +IP+ +P EQ I + + +ID LL + Sbjct: 115 --TGAVQPKLNKTTLLSIPIYLPEDMSEQKAIASVLSSFDDKID-----------LLHRQ 161 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q L + IE + + F + + K + E Sbjct: 162 NQTLEQMA-------QTLFRKWFIEEAKEDWEEGFLPDEFDFLMGHSPKGSSFNEYGFGI 214 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 Y + ++ + +L E+ I Sbjct: 215 PMYQGNADFGFRFPKKRIFTTEPKRFAEKFDTLISVRAPVGEQNMAL------EKCCIGR 268 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + Y + L + S+ D ++L +++PPI Sbjct: 269 GLARFRYKLNPNFYSYTYYKLKYLINKIKLFNDEGTVFGSISKGDFQKLEIMIPPID--- 325 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 I + ID + + I LK+ R + + ++G+I ++ Sbjct: 326 -IIEKFQQQVKPIDDKIIQNSLQIQTLKKLRDTLLPKLMSGEIRIKNAE 373 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 20/182 (10%), Positives = 46/182 (25%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + G + + Y + G + + R T Sbjct: 183 EDWEEGFLPDEFDFLMGHSPKGSSFNEYGFGIPMYQGNADFGFRFPKKRIFTTEPKRFAE 242 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K L P + + + I + + L + + E Sbjct: 243 KFDTLISVRAPVGEQNMALEKCCIGRGLARFRYKLNPNFYSYTYYKLKYLINKIKLFNDE 302 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + IPP+ ++++ +I + +L L Sbjct: 303 GTVFGSISKGDFQKLEIMIPPIDIIEKFQQQVKPIDDKIIQNSLQIQTLKKLRDTLLPKL 362 Query: 203 VS 204 +S Sbjct: 363 MS 364 >gi|157156744|ref|YP_001461441.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli E24377A] gi|157078774|gb|ABV18482.1| type I restriction modification DNA specificity domain protein [Escherichia coli E24377A] Length = 471 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 54/458 (11%), Positives = 135/458 (29%), Gaps = 65/458 (14%) Query: 30 IKRFTK-LNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + ++ G T+ + + ++ + D++ G + + + + G Sbjct: 10 LTDICDDVSYGYTASANEQCIGPKFLRITDIQGGLCNWNAVPYCNIDAKNKSKYNLEIGD 69 Query: 86 ILYGKLG-PYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ + G II D + + P + L + + + Sbjct: 70 IVIARTGNSTGENYIIQDDIDSVFASYLIRYRINKSIADPYFVWLNLRTDNWWSYVNGAK 129 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + A+ K +G+ P+ +P L QV I + +I ++ + ++ Sbjct: 130 TGSAQAGANAKVLGSYPLSLPSLTRQVGISKLFKIINGKIFENTKINQTLEQMAQALFKS 189 Query: 202 LVSY------------------------------------IVTKGLNPDVK--MKDSGI- 222 V + +P+ +K + Sbjct: 190 WFVNFEPVKAKMAVLEAGGSQEDATLAAMTAISGKNADALAVFEREHPEQYAELKATAEL 249 Query: 223 -------EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 +G +P+ W + A + + + + N+ Sbjct: 250 FPLAMQDSELGEIPEGWTLSEIGAQIDIAGGATPSTKTPDFWDNGDIHWTTPKDLSNVKD 309 Query: 276 KPESYETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 K + +I G + + + + A I Y+A+K + Sbjct: 310 KILLHTERKITKAGLGKISSGLLPVNTVLMSSRAPVGYLAIAKVPVAINQGYIAMKCNKE 369 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 S S ++ ++ + ++ +P++ PP++ + + Sbjct: 370 LSPEFVLQWCSANMPEIISRASGTTFAEISKKNFNPIPLVKPPLEL----VKNYTKQVSA 425 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I L+E + L E R + + ++G+I L Q Sbjct: 426 IYSLIENTMRENNSLTELRDTLLPKLLSGEITLPEAEQ 463 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 62/206 (30%), Gaps = 15/206 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTG 61 +DS +G IP+ W + I + G T + DI + +D+ + Sbjct: 253 AMQDSE---LGEIPEGWTLSEIGAQIDIAGGATPSTKTPDFWDNGDIHWTTPKDLSNVKD 309 Query: 62 KY---LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 K + + + +L P IA + ++ ++ Sbjct: 310 KILLHTERKITKAGLGKISSGLLPVNTVLMSSRAPV-GYLAIAKVPVAINQGYIAMKCNK 368 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L I + G T + K IP+ PPL +++ A Sbjct: 369 EL-SPEFVLQWCSANMPEIISRASGTTFAEISKKNFNPIPLVKPPLELVKNYTKQVSAIY 427 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 I+ + E EL L+S Sbjct: 428 SLIENTMRENNSLTELRDTLLPKLLS 453 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 51/187 (27%), Gaps = 8/187 (4%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---KPESYETYQ 284 P + + V+ + L +I L N ++ Sbjct: 4 EPKEYCLTDICDDVSYGYTASANEQCIGPKFLRITDIQGGLCNWNAVPYCNIDAKNKSKY 63 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ G+IV + + + + D ++ +R+ + Sbjct: 64 NLEIGDIVIARTGNSTGENYIIQDDIDSVFASYLIRYRINKSIADPYFVWLNLRTDNWWS 123 Query: 345 VFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 A + + + P+ +P + Q I + I+ + + + L Sbjct: 124 YVNGAKTGSAQAGANAKVLGSYPLSLPSLTRQVGI----SKLFKIINGKIFENTKINQTL 179 Query: 404 KERRSSF 410 ++ + Sbjct: 180 EQMAQAL 186 >gi|301382338|ref|ZP_07230756.1| restriction modification system DNA specificity subunit [Pseudomonas syringae pv. tomato Max13] Length = 424 Score = 97.9 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 44/403 (10%), Positives = 106/403 (26%), Gaps = 29/403 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK +++ + + R +++ + E +Y K +D Sbjct: 32 GWKETQLQKIARSVSDRAVTGDGDNVLSLSGEHGLVLQSEYFGKKIAGDITD--RYLKLL 89 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + +Y GI S + + + W + Sbjct: 90 RDDFVYNDRTTKASTFGTIKRLSKYSGGIVSPIYKCFRFHTGEDPVFWEWYFESGSHEAQ 149 Query: 138 EAICEGAT----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + P EQ + E + +D I + R + Sbjct: 150 LGSLVNEGARAGRFNISIRQFLSTTAWRPDEREQQKVAEFL----SSVDDFIAAQARKVT 205 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LK K+ L + + +++ + V + N Sbjct: 206 ALKIYKKGLTQRLFPQESESQPRLRFPEFQNVEEWKVKRLSGMIELISGMHLSPNDYSTV 265 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + T + +I+ ++ + Sbjct: 266 GEVPYFTGP---SDFTNNLSNVTKWTKRTANVSKAEDILIT---VKGSGVGEIWYSTLPE 319 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 + MA++ S ++ +++ F +GSG + L + L P + Sbjct: 320 IAMGRQLMAIRSKSGASRFMFQFLQTK--KNHFKDLGSGNMIPGLSRAVILELEASFPNL 377 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + + +D L+ Q L+ + + Sbjct: 378 PEQQRIADCL----TSLDDLIAAQTQKHEALETYKMGLMQQLF 416 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 52/182 (28%), Gaps = 4/182 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + WKV + +L +G + +G + N + T ++ Sbjct: 238 EEWKVKRLSGMIELISGMHLSPNDYSTVGEVPYF-TGPSDFTNNLSNVTKWTKRTANVSK 296 Query: 83 KGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL G + + + I Q + ++ K + +L + + + Sbjct: 297 AEDILITVKGSGVGEIWYSTLPEIAMGRQLMAIRSKSGASRFMFQFLQTK--KNHFKDLG 354 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + I + P L EQ I + + + I + Q Sbjct: 355 SGNMIPGLSRAVILELEASFPNLPEQQRIADCLTSLDDLIAAQTQKHEALETYKMGLMQQ 414 Query: 202 LV 203 L Sbjct: 415 LF 416 >gi|269797186|ref|YP_003311086.1| restriction modification system DNA specificity domain protein [Veillonella parvula DSM 2008] gi|269093815|gb|ACZ23806.1| restriction modification system DNA specificity domain protein [Veillonella parvula DSM 2008] Length = 400 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 53/407 (13%), Positives = 128/407 (31%), Gaps = 21/407 (5%) Query: 26 KVVPIKRFT-KLNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNS---RQSDTSTV 78 + V ++ ++ G +S I +I + ++ S Sbjct: 4 QTVRLQDLCISISDGDHQAPPKSNSGIPFITISNITSMNQLDFSSSMFVPRWYYEKLDIK 63 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSIDVTQ 135 K ILY +G + + + L + ++P+ L +L Sbjct: 64 RTAQKNDILYSVVGSFGIPVFMKNSIEFVFQRHIALLRPNIEKIVPQYLYYKILDRAFYM 123 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +++ GA + NI + IP + +Q I + + A I+ + E + Sbjct: 124 MADSLAIGAAQRTITLSSLRNIEINIPEVEQQQSIVDILSAYDDLIENNQKQIKLLEETV 183 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + + + G + + W D + + T L + Sbjct: 184 QRLYKEWFIDLRFPGHGNGEIIDGLPLGWHEDTIDTKVNLLNGFAFKSKDLEETGLFKLV 243 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + + + P+ + +D G+++ + + Sbjct: 244 TIKNVQDGYFEGKNVKYLSKIPDKMPRHCHLDEGDLLLSLTGNVGRVCIV----EGNDFL 299 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 + + Y L RS +L + +G +Q++ + ++ L P Sbjct: 300 LNQRVAKISSET--PAYTYCLFRSNELLVKINNIANGAAQQNVSPIRIGQIKHLFPND-- 355 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + + I V ++++I+LL+E R + + G+I++ Sbjct: 356 -KLIMDF-ERVSGPILKRVVLMKKNIILLEEARDRLLPKLMNGEIEV 400 >gi|209527350|ref|ZP_03275858.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] gi|209492208|gb|EDZ92555.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] Length = 440 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 62/436 (14%), Positives = 138/436 (31%), Gaps = 44/436 (10%) Query: 27 VVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +P+ + + I I ++++G + S ++ + Sbjct: 7 WIPLSQLCEAIVDCEHKTAPVQDSGIPSIRTTNIKNGRLDLENANLVSEETYKLWTARLE 66 Query: 83 K--GQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ + P I+ +C T + K + P L LL+ ++ + Sbjct: 67 PQPNDLILAREAPVGEVGIVPRGKRVCLGQRTVLIRPDGKKLFPRYLLYLLLTPEMRHEM 126 Query: 138 EAICEGATMSHADWKGIGNI-PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 EG+ + H + I N P PPL EQ I + +I+ + + Sbjct: 127 TCRAEGSVVPHLNMSDIRNFEIPPPPPLDEQKAIAHILGTLDDKIELNQQMNRTLEAIAR 186 Query: 197 EKKQALVSYI----------VTKGLNPDVKMKDSGIEW---VGLVPDHWEVKPFFALVTE 243 ++ G++ ++ + +G +P W + + Sbjct: 187 AIFKSWFIDFDPVRAKMDGRQPVGMDAEMAVLFPDEFEDSPLGQIPKGWTYQAANCIANI 246 Query: 244 LNRKNTKLIESNILSLSYGN--IIQKLETRNMGLKPESYETY-----------QIVDPGE 290 K E SL+ N + + G+ + Y +IV Sbjct: 247 GIGKTPPRKEQAWFSLNLKNIRWVSIRDMGASGVFIRKTKEYLIPDALHKFSIKIVPDNT 306 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ F V I + + S YL + +D ++ Sbjct: 307 VLLSFKLTIGRVVLTDGEMVTNEAI--AHFKLPVYTPFSSEYLYLYLEKFDYNQL--GNT 362 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 S + Q++ + +K +P+L P I N + A I +++ +Q L R + Sbjct: 363 SSIAQAVNSKIIKEMPILNPGAD----ILNTFSCRIASIFRKIKQTQQESETLSSIRDTL 418 Query: 411 IAAAVTGQIDLRGESQ 426 + ++G+I ++ + Sbjct: 419 LPKLLSGEIRVKDAEK 434 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 61/216 (28%), Gaps = 21/216 (9%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDV-- 56 +++DS +G IPK W + G+T K+I ++ + D+ Sbjct: 221 DEFEDSP---LGQIPKGWTYQAANCIANIGIGKTPPRKEQAWFSLNLKNIRWVSIRDMGA 277 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + ++ I +L + + ++ D + + + + Sbjct: 278 SGVFIRKTKEYLIPDALHKFSIKIVPDNTVLLS-FKLTIGRVVLTDGEMVTNEAIAHFKL 336 Query: 117 KDVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P +L + I +P L I Sbjct: 337 PVYTPFSSEYLYLYLEKFDYNQLGNTSSIAQAVNSK-----IIKEMPILNPGADILNTFS 391 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 I I + + E L + L+ +++ + Sbjct: 392 CRIASIFRKIKQTQQESETLSSIRDTLLPKLLSGEI 427 >gi|16799598|ref|NP_469866.1| hypothetical protein lin0523 [Listeria innocua Clip11262] gi|16412963|emb|CAC95755.1| lin0523 [Listeria innocua Clip11262] Length = 397 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 53/391 (13%), Positives = 123/391 (31%), Gaps = 30/391 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ ++ G+ E +D K++ +G ++ V G Sbjct: 20 WEQRKLRDIANYRNGKAHEQVEDED----GKYTIINSKFISTNGKVQRYTNEQVEPIFDG 75 Query: 85 QILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +I KA + D + + + P + + + + ++ + Sbjct: 76 EIAMVLSDLPNGKALAKLFLVKEDGKYTLNQRIAGITPNENIDPIFLNFRMNRN--NYFL 133 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G T ++ + N P EQ I ++D I R ++ LK Sbjct: 134 KFDSGVTQTNLSKSQVENFIALYPTFDEQYKIGLF----FTQLDDTIALHQRKLDTLKLM 189 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ L+ + K K++ + + + + E K + + L+ Sbjct: 190 KKGLLQQMFPKRGENIPKIRFDDFDDIWEQ------RILGEFLKESKIKGSNGSLAKKLT 243 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + + + S Y I G+ ++ +D N + ++ Sbjct: 244 VKL--WRKGVVPKEEIYTGSSATQYYIRKTGQFIYGKLDFLNQAFGIIPLELDGYESTLD 301 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQF 376 + I+ T+L + K + +G R + + + +P+ +P EQ Sbjct: 302 SPAFDIEESINETFLLEYVSLARFYKYQGNIANGSRRAKRIHTDTFFEMPIPLPNSNEQQ 361 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERR 407 I + +ID L+ + + L + Sbjct: 362 KIGTF----SRQIDDLIALQQNKLEKLSSLK 388 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 61/186 (32%), Gaps = 9/186 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + WE + + N K + +E + N K + N ++ + E + + G Sbjct: 18 EAWEQRKLRDIANYRNGKAHEQVEDEDGKYTIIN--SKFISTNGKVQRYTNEQVEPIFDG 75 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERG--IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 EI DL N K + V E G + + P+ + + R Sbjct: 76 EIAMVLSDLPNGKALAKLFLVKEDGKYTLNQRIAGITPNE-NIDPIFLNFRMNRNNYFLK 134 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + +L V+ L P EQ+ I ++D + ++ + LK + Sbjct: 135 FDSGVTQTNLSKSQVENFIALYPTFDEQYKIGLF----FTQLDDTIALHQRKLDTLKLMK 190 Query: 408 SSFIAA 413 + Sbjct: 191 KGLLQQ 196 >gi|307708293|ref|ZP_07644760.1| sty sbli [Streptococcus mitis NCTC 12261] gi|307615739|gb|EFN94945.1| sty sbli [Streptococcus mitis NCTC 12261] Length = 385 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 45/399 (11%), Positives = 106/399 (26%), Gaps = 27/399 (6%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYL--PKDGNSRQSDTSTVSI 80 + + +G T ++ DI ++ + D + K + S + Sbjct: 4 KLSDVVTIISGGTPKTSVKEYWDGDIDWLAVADFNTSNRYVSTASKKITELGLNNSNTKM 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG ++ G A + + L+ K E + + + Sbjct: 64 LEKGDLIISARGTVGAIAQLTKPMA-FNQSCFGLRGKKNKLETDYLYYWLKNYVDILLNK 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G+ + + +I + +P + Q I + +I Sbjct: 123 SQGSVFNTINLSTFDDIKIDLPNIENQRSISNFLTLLDNKIQINNQINQEL--------- 173 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + K L ++ + G K + + +E L+ Sbjct: 174 ----EAMAKTLYDYWFVQFDFPDQNGKPYKSSGGKMVYHPELKREIPEGWGVEKLKYFLT 229 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N ++ + L K +L + + T Sbjct: 230 IKNGKDHKHLQDGKFAVYGSGGIMRTVADYLYSGESILFPRKGTLNNVMYVNEEFWTVDT 289 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 M +++ L ++ S S+ + L ++VP +E +I Sbjct: 290 MFYSEVNKNNSAL-YVFYSVKDIDFNKLNTGTGVPSMTSSILYDLNIIVP--EE--NILE 344 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 N + ++ L + R + + GQ+ Sbjct: 345 KFNTIVKQNYETIKLNNIQNQELNQLRDWLLPMLMNGQV 383 Score = 44.4 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 55/184 (29%), Gaps = 22/184 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V +K F + G+ + +D + G+ T Sbjct: 214 EIPEGWGVEKLKYFLTIKNGKDHKHLQDGKF--------------AVYGSGGIMRTVADY 259 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +++ IL+ + G + + T F K+ + + ID Sbjct: 260 LYSGESILFPRKGTLNNVMYVNEEFWTVDTMFYSEVNKNNSALYVFYSVKDIDF----NK 315 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G + +I + + + I EK + I + L + + Sbjct: 316 LNTGTGVPSMTS----SILYDLNIIVPEENILEKFNTIVKQNYETIKLNNIQNQELNQLR 371 Query: 200 QALV 203 L+ Sbjct: 372 DWLL 375 >gi|223934052|ref|ZP_03626004.1| restriction modification system DNA specificity domain protein [Streptococcus suis 89/1591] gi|223897279|gb|EEF63688.1| restriction modification system DNA specificity domain protein [Streptococcus suis 89/1591] Length = 425 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 72/402 (17%), Positives = 149/402 (37%), Gaps = 42/402 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI--IYIGLEDVESGTG------------KYLPKDGNS 70 WK + + S S + + ++++ G K LP S Sbjct: 20 WKQRKLGEVADFSIKTNSLSRDKLSSYFYEVQNIHYGDILTKYDAILDVCNKELPSIIGS 79 Query: 71 RQSDTSTVSIFAKGQILYGK---LGPYLRKAIIADFDG--ICST-QFLVLQPKDVLPELL 124 SD + + ++G I++ + + +F G + S +V +PK Sbjct: 80 TISDFADA-LLSEGDIVFADAAEDSTVGKAIEVRNFKGKNVVSGLHTIVARPKVSYAPYY 138 Query: 125 QGWLLSI-DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G+L++ +I + +G +S + + + P L EQ I +D Sbjct: 139 LGYLINSTAYHNQILPLMQGTKVSSISKANLKSTTVVFPTLPEQEAIGSF----FSDLDQ 194 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 LIT R ++ +KE K+AL+ + KG D D W+ + + + Sbjct: 195 LITLHQRKLDDVKELKKALLQKMFPKGNGNDFP-----ELRFPEFTDAWKQRKLGEVAEK 249 Query: 244 LNRKN--TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQN 300 +++KN + +E+ S +G I Q+ ++ Y IV+P + V+ I Sbjct: 250 ISQKNLDRQYVETFTNSAEFGIISQRDFFEKNISSLDNISGYYIVNPDDFVYNPRISNLA 309 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQS 356 ++ ++ G+++ Y + I ++ + + G R + Sbjct: 310 PVGPIKRNKLGRVGVMSPLYTIFRFSDIHLDFVEKYFDTTIWHRYMELNGDSGARSDRFA 369 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 +K K LP+ +P + EQ I + + +D L+ ++ Sbjct: 370 IKDSVFKGLPIPLPTLPEQEAIGSF----FSDLDQLITLHQR 407 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 67/211 (31%), Gaps = 11/211 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + + K + + + + E+ N + + ++ Sbjct: 13 FPGFTDAWKQRKLGEVADFSIKTNSLSRDKLSSYFYEVQ--NIHYGDILTKYDAILDVCN 70 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVK 324 K +G + ++ G+IVF D K + + + + Sbjct: 71 KELPSIIGSTISDF-ADALLSEGDIVFADAAEDSTVGKAIEVRNFKGKNVVSGLHTIVAR 129 Query: 325 PHGID-STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P YL +L+ S + G S+ ++K V+ P + EQ I + Sbjct: 130 PKVSYAPYYLGYLINSTAYHNQILPLMQGTKVSSISKANLKSTTVVFPTLPEQEAIGSF- 188 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ + +KE + + + Sbjct: 189 ---FSDLDQLITLHQRKLDDVKELKKALLQK 216 >gi|163803499|ref|ZP_02197370.1| type I restriction-modification system, S subunit [Vibrio sp. AND4] gi|159172717|gb|EDP57567.1| type I restriction-modification system, S subunit [Vibrio sp. AND4] Length = 463 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 61/432 (14%), Positives = 143/432 (33%), Gaps = 44/432 (10%) Query: 30 IKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFAKG 84 + L G + ++ + G + R + + K Sbjct: 10 LGELGSLKNGANFNKNDAGDGCPVMSVKQLFRGRYVDTEGLSSIRIGTLKKLDDYLVRKN 69 Query: 85 QILYGKLG----PYLRKAIIADFDGIC-----STQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +L+ + + AI+ D+ C + +F + V P L L S + Sbjct: 70 DLLFARSSLKAEGSGQVAIVNDYPENCIFSGFTIRFRLFDESKVNPLYLYYLLRSAKYRE 129 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I G+ +S+ + IP+ +P Q + + + + + ++ Sbjct: 130 IFVRITTGSVISNLTQATLSKIPVELPNKETQDYVAKILDELDRKNELATATNQTLEQMA 189 Query: 196 KEKKQALVS-----YIVTKGLNPDVK-------MKDSGIE-WVGLVPDHWEVKPFFALV- 241 + ++ G P+ + +E +GL+P+ W V + Sbjct: 190 QAIFKSWFVDFDPVKAKMNGEQPEGMDAATASLFPEKLVESELGLIPEGWPVDQVGNHIE 249 Query: 242 --TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFID 297 + K+++L ES ++ + + R GLK + Y+ Q+++ G++V D Sbjct: 250 LTKGKSYKSSELQESTTALVTLKSFKRGGGYRMDGLKEYTGTYKPQQVIEAGDLVMSLTD 309 Query: 298 LQ------NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG 350 + + A + + + ++P D+ + LM +Y + + Sbjct: 310 VTQAAEIVGKPALVIEAPQYDTLVASLDVAILRPKETDAKQYFYGLMSTYRFHRYAESFA 369 Query: 351 SGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +G L + + P ++ + A I +E L + R + Sbjct: 370 TGTTVLHLSPKGITTFEFACPS----TELVKKYHEFAAPIFAKIEANILESQELVKLRDT 425 Query: 410 FIAAAVTGQIDL 421 + ++G+I+L Sbjct: 426 LLPKLLSGEIEL 437 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 63/202 (31%), Gaps = 16/202 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +G IP+ W V + +L G++ +S + + L+ + G G Y Sbjct: 232 LGLIPEGWPVDQVGNHIELTKGKSYKSSELQESTTALVTLKSFKRGGG-YRMDGLKEYTG 290 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC----------STQFLVLQPKDVLPEL 123 + G ++ I+ + S +L+PK+ + Sbjct: 291 TYKPQQVIEAGDLVMSLTDVTQAAEIVGKPALVIEAPQYDTLVASLDVAILRPKETDAKQ 350 Query: 124 LQG-WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + E+ G T+ H KGI P E +I+ Sbjct: 351 YFYGLMSTYRFHRYAESFATGTTVLHLSPKGITTFEFACPSTELVKKYHEFAAPIFAKIE 410 Query: 183 TLITERIRFIELLKEKKQALVS 204 I E ++L L+S Sbjct: 411 ANILESQELVKLRDTLLPKLLS 432 >gi|308062170|gb|ADO04058.1| Type I R-M system specificity subunit [Helicobacter pylori Cuz20] Length = 425 Score = 97.6 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 131/414 (31%), Gaps = 32/414 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYI--GLEDVESGTG------KYLPKDGNSRQS 73 PK + + + G T + ++I + G++ + + + ++ Sbjct: 13 PKGVEFRKLGDIGEYIRGVTYKKNQEINNLECGIKVLRANNITLSNHLNFEDIKVINKNV 72 Query: 74 DTSTVSIFAKGQILY---GKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWL 128 K IL ++ K DFD + V++ ++V + Sbjct: 73 KIRKEQYLKKNDILICAGSGSSEHIGKVAFINTDFDYVFGGFMGVIRIREVNSRFVYHIF 132 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S Q +E T+++ + + N +PIPPL Q I + + A T L TE Sbjct: 133 TSNIFKQYLEKSLNTTTINNLNANILQNFLIPIPPLEIQQEIVKILDAFTELNTELNTEL 192 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKM------KDSGIEWVGLVPDHWEVKPFFALVT 242 + + + L+ + + D KM K L P E + ++ Sbjct: 193 KARKKQYQYYQNMLLDFKDIHSNHKDAKMSAKTYPKRLKTLLQTLAPKGVEFRKLGEVLE 252 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + ++ +T +G E YQ ++ + Sbjct: 253 YDQPNKYCVTSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKSSPVII----FDDFT 308 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + ++ + + + + + + G RQ + Sbjct: 309 TATQWVDFPFKVKSSAMKILLPKNPTINIRFIFFYMQTIPYNI---SGEHTRQWISR--Y 363 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + +PP++ Q +I +++ A L+ I I K+ R + Sbjct: 364 SQITIPIPPLEIQQEIVKILDQFLALTTDLLAGIPAEIEARKKQYEYYREKLLT 417 >gi|322628320|gb|EFY25108.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-4] gi|322649127|gb|EFY45568.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. OH_2009072675] Length = 229 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 84/242 (34%), Gaps = 19/242 (7%) Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + +++K+AL+ ++T + ++G+ + G W + + Sbjct: 1 MTEKLLANSQQQKKALIQQLLT---GKKRLLDENGVRFSGE----WCTCTLSEVAHIIMG 53 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRS 304 + K N L I + + P Y + PG+I+ Sbjct: 54 SSPKSEAYNDNGLGLPLIQGNADIKCRVSCPRVYTSDITKECTPGDILLSVRAPVGTVA- 112 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + I A+K S + + K Y +S+ +D+K Sbjct: 113 ----LSQHKACIGRGISAIKSKRKMSQSFLYQWFLWFEPKWCYLSQGSTFESINSDDIKT 168 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR-G 423 L + VP +EQ I V++ I L E+ + LK + + + +TG+ ++ Sbjct: 169 LKLSVPNFEEQQKIAAVLSAADTEISTL----EKKLACLKNEKKALMQQLLTGKRRVKVD 224 Query: 424 ES 425 E+ Sbjct: 225 EA 226 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 55/191 (28%), Gaps = 6/191 (3%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 GV++ G W + + G + +S + G + R Sbjct: 32 GVRFSGE----WCTCTLSEVAHIIMGSSPKSEAYNDNGLGLPLIQGNADIKCRVSCPRVY 87 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G IL P ++ ++ K + + + Sbjct: 88 TSDITKECTPGDILLSVRAPV-GTVALSQHKACIGRGISAIKSKRKM-SQSFLYQWFLWF 145 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + +G+T + I + + +P EQ I + A I TL + Sbjct: 146 EPKWCYLSQGSTFESINSDDIKTLKLSVPNFEEQQKIAAVLSAADTEISTLEKKLACLKN 205 Query: 194 LLKEKKQALVS 204 K Q L++ Sbjct: 206 EKKALMQQLLT 216 >gi|153947182|ref|YP_001402491.1| type I restriction-modification system, S subunit [Yersinia pseudotuberculosis IP 31758] gi|152958677|gb|ABS46138.1| putative type I restriction-modification system, S subunit [Yersinia pseudotuberculosis IP 31758] Length = 419 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 53/408 (12%), Positives = 128/408 (31%), Gaps = 27/408 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +W + + T + G + ++++ E+++S T K + Sbjct: 18 NWLNFNLSQITDVYDGTHQTPAYTKSGVMFLSAENIKSLTS---TKFISEEAFKKEFKVY 74 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K +L ++G ++ D L L + ++ Q+ + Sbjct: 75 PKKNDVLMTRIGDVGTANVVETDDDRAYYVTLALLKYKKISPYFLKSSIASPFVQKDIWL 134 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 ++ + I V+ +KI +D LI + + + L K+ Sbjct: 135 RT-LHIAFPKKINMNEIKKVAVNCPPDVVESDKIGQYFKNLDALINQHQQKHDKLSNIKK 193 Query: 201 ALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFAL--------VTELNRKNTK 250 A++ + K P+++ K EW +P + KN Sbjct: 194 AMLEKMFPKPGKTIPEIRFKGFSGEW-EEMPFGACFINVSNNTLSRADLNYDDGMAKNIH 252 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + I + +L + + + G+I+ + Sbjct: 253 YGDVLIKFGEVLDATNELLPFITNNDVANKLKHAALRDGDIIIADAAEDSMVGKCTELFN 312 Query: 311 MERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLP 366 + ++ S + S YL + + S ++ G + S+ ++ Sbjct: 313 IGEQLVLSGLHTIAVRPTLTFASKYLGYYLNSSSYHDQLLSLMQGTKVLSISRTAIQNTN 372 Query: 367 VLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ P +EQ +I N ++D L+ + +Q I L + + ++ Sbjct: 373 IVFPKSAEEQVEIGNY----FQKLDALINQHQQQITKLNNIKQACLSK 416 >gi|330971616|gb|EGH71682.1| type I restriction enzyme, S subunit [Pseudomonas syringae pv. aceris str. M302273PT] Length = 198 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 47/199 (23%), Positives = 73/199 (36%), Gaps = 15/199 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK--LNT--GRTSESGKDIIYIGLEDV 56 M + +YP YKDSGV+W+G +P+ W V IKR +N G + DI I + D Sbjct: 1 MS-FPSYPTYKDSGVEWLGEVPQSWSVYSIKRTVDGCINGLWGDEPDGENDIAVIRVADF 59 Query: 57 ESGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLG----PYLRKAII--ADFDGICS 108 E R + G +L K G + ++ +FD I S Sbjct: 60 ERSFSTVGLDKLTYRSITPKERQSRLIKSGDLLIEKSGGGEKTLVGCVVLFTHEFDAITS 119 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIE--AICEGATMSHADWKGIGNIPMPIPPLAE 166 ++P + R+ ++ + + + D + P E Sbjct: 120 NFVARMRPLAEFDSQFLCYAFGNLYHGRVNYPSVKQVTGIQNLDAESYLQERFCFPTRVE 179 Query: 167 QVLIREKIIAETVRIDTLI 185 Q I + ET RID LI Sbjct: 180 QTQIARFLNHETARIDALI 198 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 66/193 (34%), Gaps = 13/193 (6%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK---NTKLIESNILSLSYGNIIQKLET 270 KDSG+EW+G VP W V V + E++I + + + T Sbjct: 6 YPTYKDSGVEWLGEVPQSWSVYSIKRTVDGCINGLWGDEPDGENDIAVIRVADFERSFST 65 Query: 271 RNMGL-----KPESYETYQIVDPGEIVFRFIDLQND--KRSLRSAQVMERGIITSAYMAV 323 + +++ G+++ + I ++ + Sbjct: 66 VGLDKLTYRSITPKERQSRLIKSGDLLIEKSGGGEKTLVGCVVLFTHEFDAITSNFVARM 125 Query: 324 KP-HGIDSTYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 +P DS +L + + +V Y Q+L E + P EQ I Sbjct: 126 RPLAEFDSQFLCYAFGNLYHGRVNYPSVKQVTGIQNLDAESYLQERFCFPTRVEQTQIAR 185 Query: 381 VINVETARIDVLV 393 +N ETARID L+ Sbjct: 186 FLNHETARIDALI 198 >gi|317182100|dbj|BAJ59884.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F57] Length = 430 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 50/415 (12%), Positives = 123/415 (29%), Gaps = 34/415 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIMISTTATIGEHALLI-VDSLANQRFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKG--LNPDVKMKDSGIEWVGL--------VPDHWEVKPFFALV 241 ++ K++ + + ++ + K S + P E + ++ Sbjct: 192 LKARKKQYEYYQNMLLDFKGIHSNHKDAKMSAKTYPKRLKSLLQTLAPKGVEFRKLGEVL 251 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + ++ +T +G E YQ ++ + Sbjct: 252 EYDQPNKYCVTSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKSSPVII----FDDF 307 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + + + ++ + + + + + + G RQ + Sbjct: 308 TTATQWVDFPFKVKSSAMKILLPKNPTINIRFIFFYMQTIPYNI---SGEHTRQWISR-- 362 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 363 YSKITIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIEARKKQYEYYREKLLT 417 >gi|332297063|ref|YP_004438985.1| restriction modification system DNA specificity domain protein [Treponema brennaborense DSM 12168] gi|332180166|gb|AEE15854.1| restriction modification system DNA specificity domain protein [Treponema brennaborense DSM 12168] Length = 407 Score = 97.6 bits (241), Expect = 4e-18, Method: Composition-based stats. Identities = 59/399 (14%), Positives = 136/399 (34%), Gaps = 22/399 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + + N + Y+ LE V SGT + + + + Sbjct: 16 EDWEEKTLGEVSDFNPKSEIPNI--FKYVDLESV-SGTQLLQYRTETKDSAPSRAQRLAR 72 Query: 83 KGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K I Y + PY + + D D + ST + ++P + L L + + + Sbjct: 73 KNDIFYQTVRPYQKNNFLYDKDDLDFVFSTGYAQIRP-FIDSSFLFTKLQEDEFVKLVLD 131 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 C G + + + N+ + I + + + IDTLIT + E L + K Sbjct: 132 NCTGTSYPAINSNTLENLSVYITTNSIEQTKIGTL---FKNIDTLITSKKAKYEKLLQIK 188 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA----LVTELNRKNTKLIESN 255 ++L+ + + ++ G E+ + ++ K K + + Sbjct: 189 KSLLEKMFPQDGQATPALRFKGFTEDWKEKTMGEIMNITSVKRIHQSDWTNKGIKFLRAR 248 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + SY N K + Y + V +++ + + + + + Sbjct: 249 DIVASYKNEKITDNLFISKQKYDEYTSISGKVKIEDLLVTGVGTIGIPMQIENLEPVYFK 308 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVP-PI 372 + I+ + + + + G+G + E K+ P+++P Sbjct: 309 DGN-IIWFQNSNKINGNFFYYSFCGKKIQYFIKESAGTGTVGTYTIESGKKTPIILPIDK 367 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 EQ I N ++D L+ ++ + L+ + + + Sbjct: 368 AEQTKIGNF----FKQLDTLLSLQKKELDKLQNVKKALL 402 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 55/182 (30%), Gaps = 6/182 (3%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + V++ N K+ + L + Q L+ R ++ +I Sbjct: 18 WEEKTLGEVSDFNPKSEIPNIFKYVDLESVSGTQLLQYRTETKDSAPSRAQRLARKNDIF 77 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 ++ + L ++ + ++ Y ++P S L + V Sbjct: 78 YQTVRPYQKNNFLYDKDDLD-FVFSTGYAQIRPFIDSSFLFTKLQEDEFVKLVLDNCTGT 136 Query: 353 LRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 ++ ++ L V + EQ I ID L+ + L + + S + Sbjct: 137 SYPAINSNTLENLSVYITTNSIEQTKIG----TLFKNIDTLITSKKAKYEKLLQIKKSLL 192 Query: 412 AA 413 Sbjct: 193 EK 194 >gi|319946656|ref|ZP_08020890.1| putative restriction modification system DNA specificity subunit [Streptococcus australis ATCC 700641] gi|319746704|gb|EFV98963.1| putative restriction modification system DNA specificity subunit [Streptococcus australis ATCC 700641] Length = 386 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 52/415 (12%), Positives = 121/415 (29%), Gaps = 54/415 (13%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + G++ + +I I + G I+ Sbjct: 4 IKLGDIIDFKNGKSVKKSDGVIPIYGGNGILGYTDKSNFSHT----------------IV 47 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G++G Y + + S + PK+ ++L + + G++ Sbjct: 48 VGRVGAYCGSIYVEENSCWVSDNAIAGVPKEGQDLTYLYYVLKSL---NLNSKQIGSSQP 104 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +++I ID I + + L+ + L Y Sbjct: 105 LITQS---MLRDMVVDIEINIEKQKRIANSISIIDQKIQINNQINQELEAMAKTLYDYWF 161 Query: 208 TKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKNT------KLI 252 + PD K SG E +P+ W V + + Sbjct: 162 VQFDFPDQNGKPYKSSGGKMVYHPELKLEIPEGWGVDKIEDIAKTGSGGTPKSTNVSYYS 221 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I ++ G + Q + T E + ++ G I+ K S + + Sbjct: 222 NGEIPWINSGELEQTVITSTSNFITEEGLNNSSAKLFPSGTILVAMYGATAGKVSFLTFE 281 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRS---YDLCKVFYAMGSGLRQSLKFEDVKRLP 366 I + + + + +++ + R +L + +K + Sbjct: 282 ASTNQAICAIMLKDI-------RMRYYLKNVIEDLYQYLVKLSTGSARDNLSQDMIKNIK 334 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 V++P I + + I + K +Q L + R + + GQ+ + Sbjct: 335 VVIPSND----ILDRFYDFSNNIIKEITKKQQENEQLTQLRDWILPMLMNGQVKV 385 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 62/195 (31%), Gaps = 8/195 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W V I+ K +G T +S +I +I ++E Sbjct: 190 EIPEGWGVDKIEDIAKTGSGGTPKSTNVSYYSNGEIPWINSGELEQTVITSTSNFITEEG 249 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + S+ +F G IL G K F+ + + KD + + D Sbjct: 250 LNNSSAKLFPSGTILVAMYGATAGKVSFLTFEASTNQAICAIMLKD-IRMRYYLKNVIED 308 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + Q + + G+ + I NI + IP + I E + Sbjct: 309 LYQYLVKLSTGSARDNLSQDMIKNIKVVIPSNDILDRFYDFSNNIIKEITKKQQENEQLT 368 Query: 193 ELLKEKKQALVSYIV 207 +L L++ V Sbjct: 369 QLRDWILPMLMNGQV 383 >gi|299822016|ref|ZP_07053903.1| type I restriction-modification system specificity subunit [Listeria grayi DSM 20601] gi|299816644|gb|EFI83881.1| type I restriction-modification system specificity subunit [Listeria grayi DSM 20601] Length = 376 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 117/392 (29%), Gaps = 33/392 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ F + +G+ + + SG G D ++ Sbjct: 17 DWEERKFADFIDVKSGKDYK-----------HLNSGPIPVYGTGGYMLSVD---RALSDI 62 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I G+ G + ++ T F + PK + + LSI + E Sbjct: 63 DAIGIGRKGTIDKPYLLKAPFWTVDTLFYAV-PKQNID---LQFSLSIFKKINWKKFDES 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + I ++ +P EQ I ++D I R +EL+K+ KQ + Sbjct: 119 TGVPSLSKTVINSVGAFVPSYEEQQKIGSF----FKQLDETIALHQRKLELIKQLKQGFL 174 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + ++ + E + + + + + Sbjct: 175 QQMFVREDEKGPVLRFADFESEWEQRKLGALGSVVMNKRIFKEQTSDDGDVPFYKIGTFG 234 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + L E E Y + G+I+ RS+ E ++ Sbjct: 235 -SEPDAYISYELFLEYKEKYPYPEIGDILLSASGSIG--RSVVYEGKDEYFQDSNIIWLK 291 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 +D+ ++ + L + + + L +++ + +P EQ I Sbjct: 292 HDERLDNK----FLKQFYLIVKWQGLEGSTIKRLYNKNILDTNIFLPSPTEQGKIGCF-- 345 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D ++ + L+ + ++ A Sbjct: 346 --FEKLDTIIALHHNKLEQLQSLKKGYLKALF 375 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 35/97 (36%), Gaps = 9/97 (9%) Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + L + + + S SL + + VP +EQ I + ++ Sbjct: 97 NIDLQFSLSIFKKINWKKFDESTGVPSLSKTVINSVGAFVPSYEEQQKIGSF----FKQL 152 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 D + ++ + L+K+ + F+ +R + + Sbjct: 153 DETIALHQRKLELIKQLKQGFLQQMF-----VREDEK 184 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 49/186 (26%), Gaps = 10/186 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + + + D+ + + S Y+ + Sbjct: 195 SEWEQRKLGALGSVVMNKRIFKEQTSDDGDVPFYKIGTFGSEPDAYISYELFL--EYKEK 252 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G IL G R + D ++ D + + V + Sbjct: 253 YPYPEIGDILLSASGSIGRSVVYEGKDEYFQDSNIIWLKHDERLDNKFLKQFYLIVKWQG 312 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 EG+T+ K I + + +P EQ I I + + L K Sbjct: 313 L---EGSTIKRLYNKNILDTNIFLPSPTEQGKIGCFFEKLDTIIALHHNKLEQLQSLKKG 369 Query: 198 KKQALV 203 +AL Sbjct: 370 YLKALF 375 >gi|307127561|ref|YP_003879592.1| type I restriction-modification system S subunit [Streptococcus pneumoniae 670-6B] gi|306484623|gb|ADM91492.1| type I restriction-modification system S subunit [Streptococcus pneumoniae 670-6B] Length = 352 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 115/392 (29%), Gaps = 44/392 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L+ Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNLLV-------- 170 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 K E G V + + L +N K + + Sbjct: 171 --------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFP 216 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I Y IV ++ N +R + Sbjct: 217 IYGSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEP 266 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 267 VLEKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFV- 322 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A +D I++S+ L+ + S + Sbjct: 323 ---ALVDKSQLAIQKSLEELETLKKSLMQEYF 351 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 64/185 (34%), Gaps = 19/185 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 186 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 233 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 234 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 290 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 T+ + NI +P+PPLA Q + +D + +E L+ K++L Sbjct: 291 AVTIPSLTKSDLLNISIPLPPLALQNEFADF----VALVDKSQLAIQKSLEELETLKKSL 346 Query: 203 VSYIV 207 + Sbjct: 347 MQEYF 351 >gi|283469727|emb|CAQ48938.1| Sau1hsdS1 [Staphylococcus aureus subsp. aureus ST398] Length = 392 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 49/398 (12%), Positives = 105/398 (26%), Gaps = 36/398 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + F T + + ++ + K S + + + Sbjct: 20 EWEEKKLGEFAGKVTQKNVDKKYIETLTNSAELGIISQKDYFDKEISNIDNIKKYYVVEE 79 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +Y + G+ S + V + +++ ++ + S + + Sbjct: 80 NDFVYNPRMSNYAPFGPVNRNKLGKKGVMSPLYTVFKIQNIDLNFIEFYFKSSKWYRFMA 139 Query: 139 AICEGATM---SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + +P+ IP + EQ+ I + +I+ + + Sbjct: 140 LNGDSGARADRFSIKDRTFMEMPLHIPCMDEQIKIGQFFSKLDRQIELEEQKLELLQQQK 199 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K Q + S + + G WE + K Sbjct: 200 KGYMQKIFSQELRFK------------DENGKDYPEWEETTIKEIAQINTGKKDTK---- 243 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + I P Y+ GE + D + + Sbjct: 244 -------DAITNGSYDFYVRSPIVYKINTFSYEGEAILTVGDGVGVGKVF-HYVNGKFDY 295 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 Y L + L + S++ + + + V P EQ Sbjct: 296 HQRVYKISDFKNYYGLLLFYYFSQNFLKETKKYSAKTSVDSVRKDMIANMKVPRPIYIEQ 355 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I I R+D + +Q I LLK+R+ S + Sbjct: 356 KKIGQFI----KRVDNKTKIQKQVIELLKQRKKSLLQK 389 >gi|269978374|gb|ACZ55921.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 431 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 50/417 (11%), Positives = 118/417 (28%), Gaps = 36/417 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIRNGYTPSKNNPEFWKNGTIPWFRMEDLRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---IDTLITER 188 + + + + + D PIPPL Q I + + A T ++T + Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV---------PDHWEVKPFFA 239 + + E Q ++ N + + P E K Sbjct: 192 LNARKKQYEYYQNMLLDFNGINQNHKDAKEKLAQKTYPKRLKTLLQTLAPKGVEFKKLGE 251 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 ++ ++ ++ +T +G E YQ ++ Sbjct: 252 VLEYDQPNKYCVMGKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKSSPVII----FD 307 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + + + + ++ + + + + + SG Sbjct: 308 DFTTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYMQTIPYNI-----SGEHARHWI 362 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +L V +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 363 SRYSQLEVPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 419 >gi|283796925|ref|ZP_06346078.1| restriction modification system DNA specificity domain protein [Clostridium sp. M62/1] gi|291075335|gb|EFE12699.1| restriction modification system DNA specificity domain protein [Clostridium sp. M62/1] Length = 353 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 53/354 (14%), Positives = 119/354 (33%), Gaps = 26/354 (7%) Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 ++ + + + +G + L + +A+ DGI S + +L+ K L Sbjct: 21 RNIHYDDASLANYKKVEQGDFII-HLRSFEGGLEMANEDGIVSPAYTILRCKKPHSSLFY 79 Query: 126 G-WLLSIDVTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + I + ++ + +P ++EQ I + + Sbjct: 80 EAYFHTDEFINHILSKSVEGIRDGRQISYEAFKWLGLPYCDVSEQERIAQ----LFCTLS 135 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I ++ + ++ LK+ K+ L + I S I E+ T Sbjct: 136 HRIEKQQQMVDALKKYKRGLFNQIF------------SAISKSSQCRKLRELVRVSGGKT 183 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + + S ++ + + + + PG ++ K Sbjct: 184 PSMSNSLYWNGDIVWISSKDMKSSRISGSELKITNLALNEMTLYHPGTLLLVARSGIL-K 242 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFED 361 SL A + I A++ HG ++ YL + ++ D QSL + Sbjct: 243 HSLPLAILEVDATINQDIKALQVHGCNAFYLYYAILSQEDTIIRTLVKTGTTVQSLMMDS 302 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + P I +Q I + + A+++ VE E+ + LL + R+ + Sbjct: 303 FLNIEIPTPDIDQQQRIIDKL----AKLEKYVEVQEKELSLLSQMRNGLLQQLF 352 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 52/150 (34%), Gaps = 13/150 (8%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 RN+ S Y+ V+ G+ + + E GI++ AY ++ S Sbjct: 21 RNIHYDDASLANYKKVEQGDFIIHLRSFEG-----GLEMANEDGIVSPAYTILRCKKPHS 75 Query: 331 TYLA--WLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + + + G+R + + +E K L + + EQ I Sbjct: 76 SLFYEAYFHTDEFINHILSKSVEGIRDGRQISYEAFKWLGLPYCDVSEQERIA----QLF 131 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +EK +Q + LK+ + + Sbjct: 132 CTLSHRIEKQQQMVDALKKYKRGLFNQIFS 161 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 72/189 (38%), Gaps = 15/189 (7%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 ++ +++ G+T DI++I +D++S + + + ++++ Sbjct: 170 RKLRELVRVSGGKTPSMSNSLYWNGDIVWISSKDMKS--SRISGSELKITNLALNEMTLY 227 Query: 82 AKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G +L L+ I + D + LQ L +LS + T Sbjct: 228 HPGTLLLVARSGILKHSLPLAILEVDATINQDIKALQVHGCNAFYLYYAILSQEDTIIRT 287 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G T+ NI +P P + +Q I +K+ +++ + + + + LL + Sbjct: 288 LVKTGTTVQSLMMDSFLNIEIPTPDIDQQQRIIDKL----AKLEKYVEVQEKELSLLSQM 343 Query: 199 KQALVSYIV 207 + L+ + Sbjct: 344 RNGLLQQLF 352 >gi|207091738|ref|ZP_03239525.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori HPKX_438_AG0C1] Length = 412 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 57/418 (13%), Positives = 125/418 (29%), Gaps = 43/418 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +PK + + ++ G+ + + GKY G Sbjct: 12 VPKGVEFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYN 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + I + G + + + PK+ L ++L+ Sbjct: 62 REENTITIAQYGT-AGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSIS 120 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---------------IDTLI 185 A I I +PIPPL Q I + + A T ++T + Sbjct: 121 NRSAIPYSISSNNIMQITIPIPPLEIQQEIVKILDAFTELNTELNTELNTELNTELNTEL 180 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLN-------PDVKMKDSGIEWVGLVPDHWEVKPFF 238 ++ + E Q ++ LN K L P E + Sbjct: 181 NTELKARKKQYEYYQNMLLDFKDIYLNHKDAKMSAKTYPKRLKTLLQTLAPKGVEFRKLG 240 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + N+K K+ E + + + G + + GE + Sbjct: 241 EVCESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFN------NDGENITIASRG 294 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + G + Y + + + +L + +++ ++ + + G +L Sbjct: 295 EYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALN 354 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 D++ L + +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 355 KADIETLTIPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 412 >gi|148262630|ref|YP_001229336.1| restriction modification system DNA specificity subunit [Geobacter uraniireducens Rf4] gi|146396130|gb|ABQ24763.1| restriction modification system DNA specificity domain [Geobacter uraniireducens Rf4] Length = 385 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 48/403 (11%), Positives = 113/403 (28%), Gaps = 34/403 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + G S K G Y + + +G ++ Sbjct: 7 KRLGDIVNFKRGYDLPSYK-----------RKEGPYPIVSSSGISGYHAEYKAKGEG-LI 54 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G + +T V K P+ + L + + + T+ Sbjct: 55 TGRYGTLGEMYYVNGKYWPHNTALYVTDFKGNYPKYVYFLLKCLGSLKTSDKS----TVP 110 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + +P Q I + + +ID + K Sbjct: 111 GVNRNDLHELLVPYIKPELQKPIADFLFLLESKIDLNNRINSELEAMAKTLYDYWFVQFD 170 Query: 208 TKGLNPDVKMKDSGI-----EWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSL 259 N SG E +P+ W+V + +N + + I ++ L + Sbjct: 171 FPDKNGKPYKSCSGKIVWNKELKREIPEGWKVGSLLDIAEYINGLPCQKYRPIGTDFLYV 230 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ T L I++ G+++F + +G + Sbjct: 231 IKIREMRDGFTSESELVRPDIPQKAIIENGDVLFSWSASLE-----VQIWTGGKGALNQH 285 Query: 320 YMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 V ++ + + K+ + + +K+ +++PPI+ Sbjct: 286 IFKVTSKKYPKSFYYYQLVNYLQHFKMMADNRRTTMGHITQDHLKQSRIVLPPIEL---- 341 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 T + + I + + + L R + + GQ+ + Sbjct: 342 TEKLECKLGPIRTAITSNQLANNTLSSLRDWLLPMLMNGQVKV 384 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 56/196 (28%), Gaps = 10/196 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP+ WKV + + G + I + ++ G + + D Sbjct: 195 EIPEGWKVGSLLDIAEYINGLPCQKYRPIGTDFLYVIKIREMRDG----FTSESELVRPD 250 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 +I G +L+ L I G + + K L++ Sbjct: 251 IPQKAIIENGDVLFSWS-ASLEVQIWTGGKGALNQHIFKVTSKKYPKSFYYYQLVNYLQH 309 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ A TM H + + +PP+ + K+ I + L Sbjct: 310 FKMMADNRRTTMGHITQDHLKQSRIVLPPIELTEKLECKLGPIRTAITSNQLANNTLSSL 369 Query: 195 LKEKKQALVSYIVTKG 210 L++ V G Sbjct: 370 RDWLLPMLMNGQVKVG 385 >gi|328951821|ref|YP_004369155.1| Site-specific DNA-methyltransferase (adenine-specific) [Desulfobacca acetoxidans DSM 11109] gi|328452145|gb|AEB07974.1| Site-specific DNA-methyltransferase (adenine-specific) [Desulfobacca acetoxidans DSM 11109] Length = 896 Score = 97.2 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 71/390 (18%), Positives = 135/390 (34%), Gaps = 27/390 (6%) Query: 17 WIGAIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + W+ +P F + R D IY+GLE ++ + Sbjct: 512 WLKR--EEWQRLPFGAFAESINERVEPSDAGDEIYVGLEHLDPQDLHI--RRWGKGSDVI 567 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDV 133 T F KG +++G+ Y RK IA FDGICS +V++ P+ VLPE L ++S Sbjct: 568 GTKLRFRKGDLIFGRRRAYQRKLAIAQFDGICSAHAMVVRAKPEVVLPEFLPFLMVSDRF 627 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 R I G+ +WK + P+P + +Q I E + + + Sbjct: 628 MNRAVEISVGSLSPTINWKTLKLEKFPLPSIDQQRRIAEILWEADKVFGKYFSVTKALAK 687 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + +V + E + P + P + R Sbjct: 688 IENALVDIMVRSAAANFESK------PLRELIIGKPQYGANAPAANYRDGMPR------Y 735 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I + + K + L ES + + G+++ K L S + R Sbjct: 736 VRITDIETKGRLTKQDIV-AVLLDESSQKKYELADGDLLIARTGNTVGKSYLYS-ESDGR 793 Query: 314 GIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVP 370 + + +P+ YL + +S G + ++ + L + +P Sbjct: 794 CVYAGYLVRFRPNREIVLPEYLFRVTQSSYYRNWLENNIRVGAQPNVNGTEYGSLLIPLP 853 Query: 371 PIKEQ-FDITNV--INVETARIDVLVEKIE 397 P+ Q ++++ ++ + L+ I Sbjct: 854 PLSFQSERLSDIKELSSGDQGYEELISAIR 883 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 48/143 (33%), Gaps = 11/143 (7%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA--YMA 322 Q L R G + T G+++F K ++ GI ++ + Sbjct: 552 PQDLHIRRWGKGSDVIGTKLRFRKGDLIFGRRRAYQRKLAIAQFD----GICSAHAMVVR 607 Query: 323 VKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 KP + +L +LM S L ++ ++ +K +P I +Q I + Sbjct: 608 AKPEVVLPEFLPFLMVSDRFMNRAVEISVGSLSPTINWKTLKLEKFPLPSIDQQRRIAEI 667 Query: 382 I---NVETARIDVLVEKIEQSIV 401 + + + V K I Sbjct: 668 LWEADKVFGKYFS-VTKALAKIE 689 >gi|241895014|ref|ZP_04782310.1| type I restriction-modification system specificity subunit [Weissella paramesenteroides ATCC 33313] gi|241871732|gb|EER75483.1| type I restriction-modification system specificity subunit [Weissella paramesenteroides ATCC 33313] Length = 399 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 53/395 (13%), Positives = 121/395 (30%), Gaps = 21/395 (5%) Query: 25 WKVVPIKRFTK-LNTG--RTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV- 78 W+ + ++ + G T + ++ ++ + G K + + ++ S+V Sbjct: 17 WEKRKLLDGSEKIGDGLHGTPKYFEKGNVYFVNGNNFIDGEIKITKETKHVAETAQSSVD 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 IL G A + D E + +L + V Sbjct: 77 QGLTNNTILMSINGTIGNLAYYHGEKISLGKSAAFITVSDFYKEFIYAYLQTKTVHSYFM 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G T+ + K + P+ +P + EQ +KI +ID LIT R ++LLKE Sbjct: 137 NSLTGTTIKNLGLKALRETPLSVPVIFEQ----KKIGRLFKQIDKLITVNQRKVDLLKEL 192 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ + + K +++ +G ++ + + Sbjct: 193 KKGFLQKMFPKNEENYPQIRFAGYTDAWEKRKLGDIGSVAMNKRIFKSETFDYGDVPFYK 252 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + I E Y G+++ E Sbjct: 253 IGTFGKIADSFITREKFT-EYKAKYPFPKNGDVLISASGS----IGKTVVYHGEDAYFQD 307 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + H + + + + + L +++ + +P + EQ I Sbjct: 308 SNIVWLEHDGQIDNK--FLEQFYKIVRWSGVEGSTIKRLYNKNILNTSISIPNLDEQEKI 365 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ D L+ ++ + LLK+ + + + Sbjct: 366 GELLY----LFDFLITVNQRRVDLLKQEKKALLQK 396 >gi|328946726|gb|EGG40864.1| type I restriction modification DNA specificity family protein [Streptococcus sanguinis SK1087] Length = 402 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 63/413 (15%), Positives = 128/413 (30%), Gaps = 44/413 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLP---KDGNSRQS 73 WK V + + G T + K DI +I +D+ + +Y+ ++ Sbjct: 15 SDWKKVKLSELGTIVGGGTPSTKKEEYYGGDIPWITPKDLANFGERYIEHGSRNITLAGL 74 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + S+ I G IL+ P IA + + F + P + L + L Sbjct: 75 ENSSAKILPVGSILFSSRAPI-GYIAIASNNVSTNQGFKSIIPNSDVDS-LFLYYLLKFN 132 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFI 192 +IE + G T + +I + IP + EQ I + A +I Sbjct: 133 KDKIENMGSGTTFKEVSASIMKSIEVFIPTEIVEQRKISAILGAIDDKI----------- 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E + + ++ N + +G + + F + K I Sbjct: 182 ----ENNKKINHHLAAISKNYLKIFHSNNSIKLGDLFELKSGYAFKSKDWVDEGKPVIKI 237 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + ++ ++ K ++E V EIV K + Sbjct: 238 KDIDGITIDITNLNYVKNKSQLAKASNFE----VFGKEIVMALTGATTGKIGVIPKNF-- 291 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKR--LPVL 368 +G + S + W + + + + + +L V L V Sbjct: 292 KGYVNQRVGLFYAKTELSYAVLWSILQQQNIITDLIELSSGSAQANLSPSSVNSYDLNVT 351 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + E I + + L I L E R + + ++G++ + Sbjct: 352 LKDLIELDKIISPLYELF--CFNL-----SEIQRLSELRDTLLPKLLSGELSV 397 >gi|226223148|ref|YP_002757255.1| specificity determinant HsdS [Listeria monocytogenes Clip81459] gi|225875610|emb|CAS04313.1| Putative specificity determinant HsdS [Listeria monocytogenes serotype 4b str. CLIP 80459] Length = 414 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 58/383 (15%), Positives = 128/383 (33%), Gaps = 27/383 (7%) Query: 25 WKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTV 78 W+ + + + YI + D++ + + + S D Sbjct: 20 WEQRKLGEIANSFEYGLNASSKTYDGENKYIRITDIDESSHVFNQDNLTSPDISLDNLNH 79 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +G IL + G K+ + + + L+ Sbjct: 80 YLLEEGDILLARTGASTGKSYCYNKIDGKVFFAGFLIRAKIKHEYNVSFIFQSTLTERYN 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ + + + + + IP L EQ I + ++D I R ++ Sbjct: 140 NFIQVTSQRSGQPGINAQEYARFALYIPKLKEQQKIGDF----FKQLDDTIALHQRKLDT 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLI 252 LK+ K+ L+ + K K++ + + + W + + ++ KN + Sbjct: 196 LKQMKKGLLQQMFPKSEEDVPKIRFADFD------EEWYQRKLGEISDKVIEKNKESTYF 249 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVM 311 E+ S YG I Q+ ++ Y IV + V+ I ++ ++ Sbjct: 250 ETLTNSAEYGIISQREFFNKDISNEKNLNGYYIVRENDFVYNPRISNYAPVGPIKRNKLG 309 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQ---SLKFEDVKRLPV 367 GI++ Y + + ++L + G SG R ++K +K +P+ Sbjct: 310 RIGIVSPLYYVFRTFDTNQSFLEYYFDGTVWHNFMLLNGDSGARADRFAIKDSVLKEMPI 369 Query: 368 LVPPIKEQFDITNVINVETARID 390 + EQ I+ ++ T I+ Sbjct: 370 PYSTLYEQEKISFFLDEITIIIN 392 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 15/164 (9%), Positives = 49/164 (29%), Gaps = 5/164 (3%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + + + + + +++ G+I+ K + Sbjct: 47 NKYIRITDIDESSHVFNQDNLTSPDISLDNLNHYLLEEGDILLARTGASTGKSYCYNKID 106 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLV 369 + A H + +++ + + + ++ R + + Sbjct: 107 GKVFFAGFLIRAKIKHEYNVSFIFQSTLTERYNNFIQVTSQRSGQPGINAQEYARFALYI 166 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P +KEQ I + ++D + ++ + LK+ + + Sbjct: 167 PKLKEQQKIGDF----FKQLDDTIALHQRKLDTLKQMKKGLLQQ 206 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 56/190 (29%), Gaps = 10/190 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + W + + + ES + + E ++ KD ++ ++ + I Sbjct: 225 EEWYQRKLGEISDKVIEKNKESTYFETLTNSAEYGIISQREFFNKDISNEKN-LNGYYIV 283 Query: 82 AKGQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + +Y + GI S + V + D L+ + Sbjct: 284 RENDFVYNPRISNYAPVGPIKRNKLGRIGIVSPLYYVFRTFDTNQSFLEYYFDGTVWHNF 343 Query: 137 IEAICEGAT---MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +P+P L EQ I + T+ I+ + + Sbjct: 344 MLLNGDSGARADRFAIKDSVLKEMPIPYSTLYEQEKISFFLDEITIIINLHQNKLKKLSS 403 Query: 194 LLKEKKQALV 203 L K Q + Sbjct: 404 LKKAYLQNMF 413 >gi|168178057|ref|ZP_02612721.1| type I restriction-modification system specificity subunit [Clostridium botulinum NCTC 2916] gi|182671430|gb|EDT83404.1| type I restriction-modification system specificity subunit [Clostridium botulinum NCTC 2916] Length = 377 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 62/403 (15%), Positives = 149/403 (36%), Gaps = 40/403 (9%) Query: 26 KVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + + + ++ G+ + GK ++ G++ K + T ++ Sbjct: 2 EYIKLGELSEFIMGQAPNSQYCNKKGKGTPFVKA-------GQFGVKYPIIDEWTTKSLK 54 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K +L +G K + I + + + L + + +I Sbjct: 55 KALKKDVLICVVGATAGKINLGCDCSIGRSVSAIRCNEKKLD-HVYLYYYLKTWITKIRQ 113 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +G+ + + + +I +P+ L+EQ I + I+ T+ EL+K + Sbjct: 114 QSQGSAVGVITKEMLNDIIIPVVTLSEQNRIVTILDKAQFLINKRKTQIEALDELVKSR- 172 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + NP K + + +G + +R N++ NI L Sbjct: 173 --FIEMFGDPVKNPMKLPK-TPLSNIGQ----------WKTGGTPSRSNSEYYNGNIPWL 219 Query: 260 SYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S G + + + E + +I++ G ++ D K ++ + I Sbjct: 220 SSGELNNIYCFNSDEMITELAIKESSAKIIEKGSLLLGMYDTAALKSTINMIECSCNQAI 279 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQ 375 AY + + +++ Y+ + ++ + + G+ +++L VK L +L+P +K Q Sbjct: 280 --AYAKLDENLVNTIYVYYCIQ--IGKDFYKSQQRGVRQKNLNLSMVKELEILMPELKLQ 335 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +N + L ++E+S+ L++ +S + A G+ Sbjct: 336 NQFADFVNQG----NTLKFEMEKSLKELEDNFNSLMQRAFKGE 374 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 53/191 (27%), Gaps = 10/191 (5%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ + TG T +I ++ ++ + + S+ I Sbjct: 190 TPLSNIGQWKTGGTPSRSNSEYYNGNIPWLSSGELNNIYCFNSDEMITELAIKESSAKII 249 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG +L G K+ I + C+ + + L + + ++ Sbjct: 250 EKGSLLLGMYDTAALKSTINMIECSCNQAIAYAKLDENLVNTIYVYYCIQIGKDFYKSQQ 309 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + + + + +P L Q + + + + + Sbjct: 310 RGVRQKNLNLSMVKELEILMPELKLQNQFADFVNQGNTLKFEMEKSLKELEDNFN----S 365 Query: 202 LVSYIVTKGLN 212 L+ L Sbjct: 366 LMQRAFKGELF 376 >gi|313123147|ref|YP_004033406.1| type i site-specific deoxyribonuclease chain s [Lactobacillus delbrueckii subsp. bulgaricus ND02] gi|312279710|gb|ADQ60429.1| Type I site-specific deoxyribonuclease chain S [Lactobacillus delbrueckii subsp. bulgaricus ND02] Length = 471 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 64/415 (15%), Positives = 128/415 (30%), Gaps = 55/415 (13%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W+ V ++ +T++ + + + K + D S + Sbjct: 66 EIPDSWEWVRLEEIAYTIGNKTNQIKEKEVLPKGKFRVVSQSK---EKIIGYYDDESKLL 122 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I++G ++ G T+ + + ++ ++I Sbjct: 123 RVDGDCIVFGDHTALVKYIDFDFIIGADGTKVFKCFKRTDTKFIFYVLEFALQSIEKISG 182 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL----L 195 + N +P+PPLAEQ I +K+ ID E+ Sbjct: 183 YSRHYKY-------LKNKCLPLPPLAEQKRIVDKLDRIMPLIDEYAKSYTHLAEIDSSFN 235 Query: 196 KEKKQALVSYIVTKGLNPDVKMKD-------------------------------SGIEW 224 K++++ Y + L P S E Sbjct: 236 DRMKKSILQYAMEGKLVPQDPSDQPASELLAEIQQEKTQLVKEKKIKKTKPLPEISEDEI 295 Query: 225 VGLVPDHWEVKPFFA-LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP------ 277 + +P+ W K+ K + L + IQ + Sbjct: 296 LYEIPESWVWARLSDVTNYIQRGKSPKYSNDSDLYVLSQKCIQWSGISLEKARSVSSEFW 355 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + E Y+ V G++++ L R + +V + + + IDS YL Sbjct: 356 DKLEDYRFVQSGDLLWNSTGLGTVGRINIVDQEVAGYPVDSHVTIVRSSSLIDSRYLLRY 415 Query: 337 MRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + S + Y GS ++ L E ++++ V +PP+ EQ I + I+ + Sbjct: 416 LMSPVIQFNLSDYLTGSTKQKELGKESIEKILVPIPPLAEQKRIADKIDQIFDIL 470 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 72/207 (34%), Gaps = 23/207 (11%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +PD WE + + K ++ E +L ++ + + + +G + Sbjct: 59 SEDEIPFEIPDSWEWVRLEEIAYTIGNKTNQIKEKEVLPKGKFRVVSQSKEKIIGY-YDD 117 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 VD IVF D +L + I K T + + Sbjct: 118 ESKLLRVDGDCIVF------GDHTALVKYIDFDFIIGADGTKVFKCFKRTDTKFIFYVLE 171 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + L + G + +K + +PP+ EQ I + ++ RI L+++ +S Sbjct: 172 FALQSIEKISGYSRHY----KYLKNKCLPLPPLAEQKRIVDKLD----RIMPLIDEYAKS 223 Query: 400 IVLLKE--------RRSSFIAAAVTGQ 418 L E + S + A+ G+ Sbjct: 224 YTHLAEIDSSFNDRMKKSILQYAMEGK 250 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 62/177 (35%), Gaps = 11/177 (6%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGN 69 S + + IP+ W + T + G++ + + D+ + + ++ Sbjct: 291 SEDEILYEIPESWVWARLSDVTNYIQRGKSPKYSNDSDLYVLSQKCIQWSGISLEKARSV 350 Query: 70 SRQS--DTSTVSIFAKGQILYGKL--GPYLRKAIIADFDG---ICSTQFLVLQPKDVLPE 122 S + G +L+ G R I+ + S +V + Sbjct: 351 SSEFWDKLEDYRFVQSGDLLWNSTGLGTVGRINIVDQEVAGYPVDSHVTIVRSSSLIDSR 410 Query: 123 LLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L +L+S + + G+T + I I +PIPPLAEQ I +KI Sbjct: 411 YLLRYLMSPVIQFNLSDYLTGSTKQKELGKESIEKILVPIPPLAEQKRIADKIDQIF 467 >gi|328947424|ref|YP_004364761.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328447748|gb|AEB13464.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 444 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 57/430 (13%), Positives = 118/430 (27%), Gaps = 38/430 (8%) Query: 16 QWIGAI-PKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDV--ESGTGKYLP 65 + I + P + G T ++ + ++ E+ T + Sbjct: 6 ELINELCPDGVEYRLFFDVCNYIRGITYNKNDEVNNDSYGIEVLRANNITLETNTLNFDD 65 Query: 66 KDGNSRQSDTSTVSIFAKGQILY-----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 S K IL K I AD + V++PK Sbjct: 66 VKIISENVKIKETQWLKKNDILICAGSGSKEHIGKVAYIFADTNITFGGFMAVVRPKIEN 125 Query: 121 PELLQGWL----LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + +T+++ + N +P+PPL Q I + + Sbjct: 126 FSTRFLFHILTSDMFKRHLAKVSAASSSTINNINNDTWKNFQIPVPPLPVQEEIVRILDS 185 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 T L E + + + AL++ +G G P Sbjct: 186 FTELTAELTAELTARRKQYEYYRDALLT----------PPFGSAGSPINGTFPVVKLKDI 235 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY----QIVDPGEIV 292 + ++ + + YG I + Y + + G+I+ Sbjct: 236 ATEFYRGSGIRRDEITAEGVPCVRYGEIYTTYNISFEKCVSHTKLEYVQSPKYFEHGDIL 295 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F + + A + + V H + YLA ++ + + Sbjct: 296 FAITGENIEDIAKSVAYTGNEKCLAGGDIVVMKHNQNPRYLAHVLATTEARIQKGKGKVK 355 Query: 353 LRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 + ++ + + +P + Q NV++ A L + I K+ R Sbjct: 356 SKVVHSSIPSIQEIEIPLPSLDVQERWANVLDNFDAICSDLKIGLPAEIDARKKQYEYYR 415 Query: 408 SSFIAAAVTG 417 + A G Sbjct: 416 DLLLTFAERG 425 >gi|332663457|ref|YP_004446245.1| restriction modification system DNA specificity domain-containing protein [Haliscomenobacter hydrossis DSM 1100] gi|332332271|gb|AEE49372.1| restriction modification system DNA specificity domain protein [Haliscomenobacter hydrossis DSM 1100] Length = 390 Score = 97.2 bits (240), Expect = 5e-18, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 132/404 (32%), Gaps = 35/404 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W++V ++ + +G +S K + I + D++ G + + V Sbjct: 3 WEMVKLEELITILSGFAFDSKLFSNQKGVPLIRIRDIKRGFSE------TYYEGKFDAVF 56 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIE 138 + G IL G G + A + D + + + + D + + IE Sbjct: 57 VVKNGDILIGMDGEF-NIAEWSGQDALLNQRVCKINSVDTSRLDKRYLLHFLPQELKFIE 115 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 T+ H K I +I +P+PPLA Q I D L + ++ E Sbjct: 116 DKASFVTVKHLSVKDIKSIQIPLPPLATQKRIAA----ILDAADALRRKDHALLQKYAEL 171 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 QA+ V NP K + +G + E KN E +L Sbjct: 172 AQAI---FVDMFGNPVKNEKGWEVSSMGNIILDIEAGS----SFGGEDKNLDKDELGVLK 224 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +S +K + I ++ G+ +F + + + + Sbjct: 225 VSAVTSGTFKPQEYKAVKKDRINKKIIKLNKGDFLFSRANTRELVGATCLVDQNYDHLFL 284 Query: 318 SAYMA---VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP 371 + D ++ ++ ++ +G ++ + +K L +++PP Sbjct: 285 PDKIWKISFHLDKTDPIFIKHILSQKEVRYELNKTATGTSGSMLNISMQKLKELSIVLPP 344 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ Q + +I + +QS + + S + A Sbjct: 345 VELQRNFGKIIQKMSEN----SGFAKQSNMKSETLFQSLLQKAF 384 >gi|170021702|ref|YP_001726656.1| restriction modification system DNA specificity subunit [Escherichia coli ATCC 8739] gi|169756630|gb|ACA79329.1| restriction modification system DNA specificity domain [Escherichia coli ATCC 8739] Length = 585 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 67/489 (13%), Positives = 128/489 (26%), Gaps = 93/489 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + +P+ W+ I G +S + G+ V+ G Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWARINDIASFTNGYAFKSS-EFQNSGVGIVKIGD 139 Query: 61 GKYLPKDGNSRQSDTSTVSI--------FAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 + S S I G ++ G K Sbjct: 140 IDSSGFISTAGMSYVSEKKINVLPEEMRVNPGDMVIAMSGATTGKLGFNKTKSTFLLNQR 199 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV---- 168 V + + + + +I G+ + + I NI +PIPP EQV Sbjct: 200 VGKIVTYSVDKEFIYHYLSTRIEENLSISLGSAIPNISTAQINNIIIPIPPSDEQVKIIA 259 Query: 169 -------------------------------------LIREKIIAETVRIDTLITERIRF 191 E++ RI Sbjct: 260 RVKLLISLCDQLEQQSLTSQDAHQQLVETLLGTLTDSQNAEELAENWARISEHFDTLFTT 319 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKD-----------------------------SGI 222 + KQ ++ V L P + S Sbjct: 320 EASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKPLPPISDE 379 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNT---------KLIESNILSLSYGNIIQKLETRNM 273 E +P+ WE F ++ + + ++ + E + + Sbjct: 380 EKPFELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGYLDLTEIKQI 439 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTY 332 + E E YQ+V ++ D R+ + +D + Sbjct: 440 EIPIEEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRNIGQYVDPYW 499 Query: 333 LAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L M S + F + + S+ ++ PV +PP E I + +++ + Sbjct: 500 LETYMNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSKLHIFYKLCE 559 Query: 391 VLVEKIEQS 399 L I+ + Sbjct: 560 ELKNHIQSA 568 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 56/202 (27%), Gaps = 13/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+ + + +G T + Y+ + +V+ G Sbjct: 384 ELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGYLDLTEIKQIEIPI 443 Query: 74 DTSTVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGWL 128 + KG +L + G + R + I + F + Sbjct: 444 EEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRNIGQYVDPYWLETY 503 Query: 129 LSIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ A + ++ + + P+ IPP +E I K+ + L Sbjct: 504 MNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSKLHIFYKLCEELKN 563 Query: 187 ERIRFIELLKEKKQALVSYIVT 208 + AL V Sbjct: 564 HIQSAQQTQLHLADALTDAAVN 585 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 70/200 (35%), Gaps = 19/200 (9%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLK 276 S E +P+ WE + + N K+++ S + + G+I G+ Sbjct: 93 SEEEKPFELPEGWEWARINDIASFTNGYAFKSSEFQNSGVGIVKIGDIDSSGFISTAGMS 152 Query: 277 PESYET------YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 S + V+PG++V K + ++ + + +D Sbjct: 153 YVSEKKINVLPEEMRVNPGDMVIAMSGATTGKLGFNKTKST--FLLNQRVGKIVTYSVDK 210 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 ++ + + + ++GS ++ + + + +PP EQ I + + + D Sbjct: 211 EFIYHYLSTRIEENLSISLGS-AIPNISTAQINNIIIPIPPSDEQVKIIARVKLLISLCD 269 Query: 391 VLVEK-------IEQSIVLL 403 L ++ +Q + L Sbjct: 270 QLEQQSLTSQDAHQQLVETL 289 >gi|331002083|ref|ZP_08325602.1| hypothetical protein HMPREF0491_00464 [Lachnospiraceae oral taxon 107 str. F0167] gi|330411177|gb|EGG90593.1| hypothetical protein HMPREF0491_00464 [Lachnospiraceae oral taxon 107 str. F0167] Length = 405 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 71/406 (17%), Positives = 137/406 (33%), Gaps = 36/406 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG---KYLPKDGNSRQSDTS--- 76 + W+ + T KD G+ +++G +YL K N + Sbjct: 14 EDWEQRKLSSLCDKFTDGDWIEAKDQSNSGVRLIQTGNVGVAEYLDKPNNKKWISNDTFE 73 Query: 77 --TVSIFAKGQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGWLLS 130 +G IL +L +A I +V D + L +L S Sbjct: 74 ALNCEEVFEGDILISRLPEPAGRACIIPKLASKMITAVDCTIVRVSNDTSNKYLLQYLSS 133 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G T G+ N + IP + E I A +D LIT R Sbjct: 134 QKYFDEVNTCLAGGTRQRISRSGLANFDVAIPVKKSEQ---EAIGAYFSNLDHLITLHQR 190 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +E LK K++++ + K +++ SG + WE + TE + Sbjct: 191 KLEKLKIIKKSMLENLFPKNGENTPRIRFSG------FTEDWEQRKLGECFTERIE--SM 242 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + + E + Y+ V G+I + + + Sbjct: 243 PDGELISVTINDGVKKFSELGRHDNSNDDKSKYKKVCIGDIAYNSMRMWQGASGYS---- 298 Query: 311 MERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLP 366 GI++ AY + + +S ++A+ + + F G+ +LKF + + Sbjct: 299 YYNGIVSPAYTVLSANYNVNSKFIAYQFKLPKMIHTFKINSQGITSDNWNLKFPVLSYIE 358 Query: 367 VLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + + I+EQ I + +D L+ + + L++ + S + Sbjct: 359 IYISKQIEEQSKIAVFLES----LDHLITLHQSKLEKLQKIKKSML 400 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 61/185 (32%), Gaps = 4/185 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + + ++I + + D + D + D S Sbjct: 224 EDWEQRKLGECFTERIESMPDG--ELISVTINDGVKKFSELGRHDNS--NDDKSKYKKVC 279 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G I Y + + + + ++GI S + VL + + + I Sbjct: 280 IGDIAYNSMRMWQGASGYSYYNGIVSPAYTVLSANYNVNSKFIAYQFKLPKMIHTFKINS 339 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 S + +++Q+ + KI +D LIT +E L++ K+++ Sbjct: 340 QGITSDNWNLKFPVLSYIEIYISKQIEEQSKIAVFLESLDHLITLHQSKLEKLQKIKKSM 399 Query: 203 VSYIV 207 + + Sbjct: 400 LESMF 404 >gi|319952390|ref|YP_004163657.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] gi|319421050|gb|ADV48159.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] Length = 427 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 69/416 (16%), Positives = 141/416 (33%), Gaps = 42/416 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS---- 73 W++ + K T + K + I D+ + Y + Sbjct: 23 EWRMSQFGKLYKFYTTNSFSRDKLNYESGKVKNIHYGDIHTKFQSYFYLNNEYVPFVNDD 82 Query: 74 -DTSTVS---IFAKGQILYGK-------LGPYLRKAIIADFDGICSTQFLVLQP--KDVL 120 D S + G ++ +G + I D I + +P K+ Sbjct: 83 LDLSKIKDEAFCKIGDLIIADASEDYADIGKTIEIIDINDEKVIAGLHTFLARPFSKETY 142 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + L S ++ ++I I +G + + + P L EQ I + Sbjct: 143 IGFISYLLKSWNLRKQIMTIAQGTKVLGLSMGRFSQLKLNTPSLPEQQKIASFL----SA 198 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 +D I + + LL++ K+ ++ + + L E G PD E K L Sbjct: 199 VDEKIQQLNKKKTLLEQYKKGVMQQLFSGDL-------RFKDENGGDFPDWEENKQLGTL 251 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFIDL 298 ++ +KN I+ I S++ + + GL + F + Sbjct: 252 TYKVGKKNKNNIQYPIYSINNQEGFRPQSEQFDGLDSNDRGYDISLYKIVDAETFAYNPA 311 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQS 356 + + S+ + ++R I++S Y+ K + +Y K G+RQ Sbjct: 312 RINVGSIGYSYDLKRVIVSSLYVCFKTKDTLEDLFLLAYLDTYSFQKDILRYEEGGVRQY 371 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L +++ + + +P +EQ I N ++ ID +E + Q I + + + Sbjct: 372 LFYDNFSHIKIPLPTTQEQQKIANYLSA----IDTKIETVNQQINKTQAFKKGLLQ 423 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 72/228 (31%), Gaps = 18/228 (7%) Query: 211 LNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGN 263 L P ++ K+ EW G + + F KN + + SY Sbjct: 11 LVPKLRFKEFDGEWRMSQFGKLYKFYTTNSFSRDKLNYESGKVKNIHYGDIHTKFQSYFY 70 Query: 264 IIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER-----GIIT 317 + + N L + G+++ + + Sbjct: 71 LNNEYVPFVNDDLDLSKIKDEAFCKIGDLIIADASEDYADIGKTIEIIDINDEKVIAGLH 130 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQF 376 + ++++L++S++L K + G + L +L + P + EQ Sbjct: 131 TFLARPFSKETYIGFISYLLKSWNLRKQIMTIAQGTKVLGLSMGRFSQLKLNTPSLPEQQ 190 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 I + ++ +I L + LL++ + + +G + + E Sbjct: 191 KIASFLSAVDEKIQQL----NKKKTLLEQYKKGVMQQLFSGDLRFKDE 234 >gi|307721264|ref|YP_003892404.1| restriction modification system DNA specificity domain-containing protein [Sulfurimonas autotrophica DSM 16294] gi|306979357|gb|ADN09392.1| restriction modification system DNA specificity domain protein [Sulfurimonas autotrophica DSM 16294] Length = 412 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 51/421 (12%), Positives = 128/421 (30%), Gaps = 44/421 (10%) Query: 20 AIPK--------HWKVVPIKRFTK--LNTGRTSESGK---DIIYIGLEDVES-GTGKYLP 65 +P+ W + +K + G ++ K I ++D+ G Sbjct: 6 KVPELRFAEFSGEWDEKQLIELSKNGFSNGAFNDPKKAGHGYRIINVKDMYIDGRINISN 65 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQPKDV 119 + + G I + + ++ D + ++P Sbjct: 66 LLRVALDEKEFLKNRVEYGDIFFTRSSLVKEGIAYSNINLNNANDLTFDGHLIRMRPNKQ 125 Query: 120 LPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L + + R + I G T M+ + I ++ + +P EQ I + + Sbjct: 126 NYSPLFLYYNFTTLYARKQFIIRGKTTTMTTIGQEDIASVKIVLPSKLEQEKIAFFLSSV 185 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 +I+ L ++ + K Q + S + + + P+ W K Sbjct: 186 DSKIEQLSKKKTLLEQYKKGVMQKIFSQELRFKDDDES-----------EFPE-WVEKQL 233 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + RK K E+ + + + + + E +V +++ Sbjct: 234 GDFLILTLRKVPKPTENYLAIGIRSHCKGTFQKPDSEPHKIAMEKLFLVKENDLIVSITF 293 Query: 298 LQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 ++ + + G+++ + + +++ + + G Sbjct: 294 AWESAIAIVKKE-DKNGLVSHRFPTYTFDEKIATHEFFKYVIIQKKFRFMLDLISPGGAG 352 Query: 356 S---LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +D L +P IKEQ I N + + +D + + + KE + + + Sbjct: 353 RNRVMSKKDFLTLKWNMPCIKEQTKIANFL----SSLDKKIALTNKELDATKEFKKALLQ 408 Query: 413 A 413 Sbjct: 409 K 409 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 27/219 (12%), Positives = 73/219 (33%), Gaps = 9/219 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ + EW F +K + Y + + Sbjct: 8 PELRFAEFSGEWDEKQLIELSKNGFSNGAFNDPKKAGHGYRIINVKDMYIDGRINISNLL 67 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKPHGID- 329 E V+ G+I F L + + + + + ++P+ + Sbjct: 68 RVALDEKEFLKNRVEYGDIFFTRSSLVKEGIAYSNINLNNANDLTFDGHLIRMRPNKQNY 127 Query: 330 -STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +L + + K F G + ++ ED+ + +++P EQ I ++ + Sbjct: 128 SPLFLYYNFTTLYARKQFIIRGKTTTMTTIGQEDIASVKIVLPSKLEQEKIAFFLSSVDS 187 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +I+ L + LL++ + + + ++ + + + Sbjct: 188 KIEQL----SKKKTLLEQYKKGVMQKIFSQELRFKDDDE 222 >gi|324115000|gb|EGC08965.1| type I restriction modification DNA specificity domain-containing protein [Escherichia fergusonii B253] Length = 402 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 37/386 (9%), Positives = 103/386 (26%), Gaps = 24/386 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + TG + D + Y + + + Sbjct: 17 EWSNLGNLCDIFTGGEAPQKHIKGDTPTSDYQ-----YPIYGNGAEIYGYADSYRIGQDA 71 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA----IC 141 + +G + V+ PK + T ++ Sbjct: 72 VTISSIGANTGTIYFRKAFFTPIIRLKVVIPKHSWLLPRYLFHYLSSQTINSKSSSVPNM 131 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + LA Q I + T L E + + Sbjct: 132 NASDVKKLSIPIPCPNNPE-KSLAIQSEIVRILDKFTALTAELTAELNMRKKQYNYYRDQ 190 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+S P + M G + +G + + + Y Sbjct: 191 LLS--FDNEDVPHLPM---GQKDIGEFIRGGTFQKKDFM--------DAGVGCIHYGQIY 237 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + G ++ ++ A + I S+ Sbjct: 238 TYYGTYTKKTKTHISAALAKKCKKAQKGNLIIATTSENDEDVCKAVAWLGSDDIAVSSDA 297 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITN 380 + H ++ Y+++ ++ +G + + +++ ++ + VP + Q I + Sbjct: 298 CIYKHNLNPKYVSYYFQTEQFQNQKRQYITGAKVRRVNADNLSKILIPVPSMAVQERIVS 357 Query: 381 VINVETARIDVLVEKIEQSIVLLKER 406 +++ + + E + + I L +++ Sbjct: 358 ILDKFDTLTNSITEGLPREIELRQKQ 383 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 48/161 (29%), Gaps = 12/161 (7%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + G E Y G+ + + ++ + II Sbjct: 38 IKGDTPTSDYQYPIYGNGAEIYGYADSYRIGQDAVTISSIGANTGTIYFRKAFFTPIIRL 97 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------P 371 + K + YL + S + S ++ DVK+L + +P Sbjct: 98 KVVIPKHSWLLPRYLFHYLSSQTI-----NSKSSSVPNMNASDVKKLSIPIPCPNNPEKS 152 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + Q +I +++ TA L ++ R ++ Sbjct: 153 LAIQSEIVRILDKFTALTAELTAELNMRKKQYNYYRDQLLS 193 >gi|51893047|ref|YP_075738.1| type I restriction-modification system specificity determinant protein [Symbiobacterium thermophilum IAM 14863] gi|51856736|dbj|BAD40894.1| type I restriction-modification system specificity determinant protein [Symbiobacterium thermophilum IAM 14863] Length = 400 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 50/383 (13%), Positives = 118/383 (30%), Gaps = 28/383 (7%) Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 +P G + + + ++ G+ G Y + + T F + PK L Sbjct: 20 VPVYGTNGPIGWTNKPLCPFPTVIIGRKGAYRGVHLSPSPCWVIDTAFYIS-PKQPLDIR 78 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +TQ I + G+ + + +P+ +PPLA Q I + + RI Sbjct: 79 WAYYQ---LLTQDINGMDSGSAIPSTSREEFYRLPVKVPPLAVQKQIADVLGTLDSRIAN 135 Query: 184 LITERIRFIELLKEKKQALVS-----YIVTKGLNPD--------VKMKDSGIEWVGLVPD 230 + + I + + ++ +G +P+ ++ +G +P Sbjct: 136 VQSTNICLESIGQAIFKSWFVDFDPVRAKAEGRDPEGVDEDTAAWFPEEFQDSELGPIPK 195 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 W V ++++ + E + + + + I + G Sbjct: 196 GWRVDTIDSVISCVGGSTPSTKEPAYWNPPEYHWVTPKDLSGQSTPVLLTTERMISEAGL 255 Query: 291 IVFRFIDL-------QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 L + A I ++A+ P G S Y+L Sbjct: 256 KKISSGLLPEGTLLLSSRAPIGYLAITKIPTAINQGFIAMPPAGQLSPEYMLFWSHYNLD 315 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + +++ ++VPP + N + + E+ + L Sbjct: 316 TIKQHANGSTFMEISKAAFRKIKLVVPP----AQLVNRFTQIAQTVLERIAANERYRMQL 371 Query: 404 KERRSSFIAAAVTGQIDLRGESQ 426 R + + + G++ + + Sbjct: 372 VNLRDTLLPRLIAGKLRVPEAEE 394 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 61/206 (29%), Gaps = 15/206 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGT 60 +++DS +G IPK W+V I G T + + + ++ +D+ + Sbjct: 183 EEFQDSE---LGPIPKGWRVDTIDSVISCVGGSTPSTKEPAYWNPPEYHWVTPKDLSGQS 239 Query: 61 GKYLPKDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 L + + +G +L P I + F+ + P Sbjct: 240 TPVLLTTERMISEAGLKKISSGLLPEGTLLLSSRAPI-GYLAITKIPTAINQGFIAMPPA 298 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L S I+ G+T I + +PP + Sbjct: 299 GQL-SPEYMLFWSHYNLDTIKQHANGSTFMEISKAAFRKIKLVVPPAQLVNRFTQIAQTV 357 Query: 178 TVRIDTLITERIRFIELLKEKKQALV 203 RI R++ + L L+ Sbjct: 358 LERIAANERYRMQLVNLRDTLLPRLI 383 >gi|269978344|gb|ACZ55906.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 420 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 119/410 (29%), Gaps = 34/410 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + +F L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFTFLSKKANCDIALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T + Sbjct: 132 LLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTE-----LNA 186 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL-----VPDHWEVKPFFALVTELNR 246 + + Q ++ N S + + P E + ++ Sbjct: 187 RKKQYQYYQNMLLDFNDINQNHKDAKIKSYPKRLKTLLQTLAPKGVEFRKLGEVLEYDQP 246 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + ++ +T +G E YQ ++ + + + Sbjct: 247 NQYCVTSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKNAPVII----FDDFTTATQ 302 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + ++ + + + I + + + G RQ + +L Sbjct: 303 WVDFPFKVKSSAMKILLPKNPIINIRFIFFYMQTIPYNI---SGEHTRQWISR--YSQLA 357 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + +PP++ Q +I +++ +A L+ I I K+ R + Sbjct: 358 IPIPPLEIQQEIVKILDQFSALTTDLLAGIPAEIKARKKQYEYYREKLLT 407 >gi|17988795|ref|NP_541428.1| type I restriction-modification system specificity subunit [Brucella melitensis bv. 1 str. 16M] gi|225686603|ref|YP_002734575.1| type I restriction enzyme specificity protein [Brucella melitensis ATCC 23457] gi|256043714|ref|ZP_05446637.1| Type I restriction enzyme specificity protein [Brucella melitensis bv. 1 str. Rev.1] gi|256111243|ref|ZP_05452274.1| Type I restriction enzyme specificity protein [Brucella melitensis bv. 3 str. Ether] gi|256262258|ref|ZP_05464790.1| type I restriction-modification system protein [Brucella melitensis bv. 2 str. 63/9] gi|260564901|ref|ZP_05835386.1| type I restriction-modification system protein [Brucella melitensis bv. 1 str. 16M] gi|265990136|ref|ZP_06102693.1| predicted protein [Brucella melitensis bv. 1 str. Rev.1] gi|265992756|ref|ZP_06105313.1| predicted protein [Brucella melitensis bv. 3 str. Ether] gi|17984613|gb|AAL53692.1| type i restriction-modification system specificity subunit [Brucella melitensis bv. 1 str. 16M] gi|225642708|gb|ACO02621.1| Type I restriction enzyme specificity protein [Brucella melitensis ATCC 23457] gi|260152544|gb|EEW87637.1| type I restriction-modification system protein [Brucella melitensis bv. 1 str. 16M] gi|262763626|gb|EEZ09658.1| predicted protein [Brucella melitensis bv. 3 str. Ether] gi|263000805|gb|EEZ13495.1| predicted protein [Brucella melitensis bv. 1 str. Rev.1] gi|263091974|gb|EEZ16280.1| type I restriction-modification system protein [Brucella melitensis bv. 2 str. 63/9] gi|326410993|gb|ADZ68057.1| type I restriction enzyme specificity protein [Brucella melitensis M28] gi|326554284|gb|ADZ88923.1| type I restriction enzyme specificity protein [Brucella melitensis M5-90] Length = 407 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 120/416 (28%), Gaps = 38/416 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ V I + + + R S DI + + + +T Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASA-DIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYK 62 Query: 80 IFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 + +GQ Y + + G+ S + V + Sbjct: 63 VVKRGQFAYATIHLDEGSIDYLRNEDAGLISPMYTVFETNSEEINNEIALRQFKRFALSG 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +PPL EQ I E + A I + I+ Sbjct: 123 RFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAAEAA----IAKTEALIK 178 Query: 194 LLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKL 251 +++ K+AL+ Y V + + W+ G P + Sbjct: 179 AIEQTKKALLKQYFVERQQSLLWSCVAKMGRWLSGGTPATA---------------AEEN 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I + +I + + E +V PG ++ + RS+ S Sbjct: 224 WKGSIPWVCPKDIKGPSISSTVDHISEDAAKALGMVGPGTLLLVVRGMIL-ARSVPSTIC 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R A P+ + L + + L G + E + PV Sbjct: 283 TVRCAFNQDVKAFVPNEGVAPAFLKLWLDINEHKLLGEIETATHGT-KRFPLERLNEFPV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 V EQ + + R+ ++ L+ R + ++G+I L Sbjct: 342 PVVTRDEQIRLVTLAESSQERLRS----ERDNLSALRSVRDALAQELLSGRIRLPE 393 >gi|49484056|ref|YP_041280.1| type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus MRSA252] gi|282904386|ref|ZP_06312274.1| type I restriction-modification enzyme, S subunit, EcoA family [Staphylococcus aureus subsp. aureus C160] gi|282906210|ref|ZP_06314065.1| type I restriction enzyme S subunit [Staphylococcus aureus subsp. aureus Btn1260] gi|282911435|ref|ZP_06319237.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WBG10049] gi|282919572|ref|ZP_06327307.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C427] gi|283958566|ref|ZP_06376017.1| type I restriction-modification enzyme, S subunit, EcoA family [Staphylococcus aureus subsp. aureus A017934/97] gi|295428387|ref|ZP_06821016.1| type I restriction enzyme [Staphylococcus aureus subsp. aureus EMRSA16] gi|49242185|emb|CAG40887.1| putative type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus MRSA252] gi|282317382|gb|EFB47756.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C427] gi|282325130|gb|EFB55440.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WBG10049] gi|282331502|gb|EFB61016.1| type I restriction enzyme S subunit [Staphylococcus aureus subsp. aureus Btn1260] gi|282596004|gb|EFC00968.1| type I restriction-modification enzyme, S subunit, EcoA family [Staphylococcus aureus subsp. aureus C160] gi|283790715|gb|EFC29532.1| type I restriction-modification enzyme, S subunit, EcoA family [Staphylococcus aureus subsp. aureus A017934/97] gi|295127787|gb|EFG57424.1| type I restriction enzyme [Staphylococcus aureus subsp. aureus EMRSA16] gi|315195724|gb|EFU26111.1| putative type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus CGS00] Length = 384 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 46/393 (11%), Positives = 111/393 (28%), Gaps = 30/393 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K+N+G+ + +E G G + Sbjct: 20 EWEEKKLGDLIKVNSGKDYK-----------HLEKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I E I +I+ + + K Q + Sbjct: 122 TGVPSLSKQTINKINRFVPSNKEQQKIGEFFIKLDRQIELEEQKLELLQQQKKGYMQKIF 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + + + + N K + +I + + Sbjct: 182 SQELRFKDENGNDYPNWEEKKIEDI------ASQVYGGGTPNTKIKEFWNGDIPWIQSSD 235 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + K S + ++ I I + + V + ++++ Sbjct: 236 VKVNDLILRQCNKFISKNSIELSSAKLIPANSIAIVTRVGVGKLCLVEFDYATSQDFLSL 295 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVI 382 D Y + + Y + K+ + + + +++ + +P ++EQ I + Sbjct: 296 SSLKYDKLYSLYSLL-YTMKKISANLQGTSIKGITKKELLDSIIKIPHNLEEQQKIGD-- 352 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +ID + + I +LK + + Sbjct: 353 --LFYKIDKYISFNKCKIEILKSLKQGLLQKIF 383 >gi|307825363|ref|ZP_07655582.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] gi|307733538|gb|EFO04396.1| restriction modification system DNA specificity domain protein [Methylobacter tundripaludum SV96] Length = 165 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 56/153 (36%), Gaps = 4/153 (2%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI- 328 + + + ++ +++F + + I + + Sbjct: 12 DKLVYSDDDCEIDKYFLNNNDVLFNRTNSPELVGKTAIYKAEMPAIFAGYLIRIHRKENL 71 Query: 329 -DSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 D+ YL + + S + + S + ++ + +K P+ +PP+KEQ I I+ Sbjct: 72 LDADYLNYFLNSKIAKEYGKTVVISSVNQANINGQKLKSYPIPLPPLKEQQAIVVKISAL 131 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L +Q + L E + S + A +G+ Sbjct: 132 SEETQRLESIYQQKLAALDELKKSLLHQAFSGE 164 Score = 37.5 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 57/166 (34%), Gaps = 8/166 (4%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICST 109 + +++SG + + + + +L+ + + AI Sbjct: 1 MGNIQSGRFVWDKLVYSDDDCEIDKYFL-NNNDVLFNRTNSPELVGKTAIYKAEMPAIFA 59 Query: 110 QFLVLQPKDVL----PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 +L+ + L I + ++ + + + + P+P+PPL Sbjct: 60 GYLIRIHRKENLLDADYLNYFLNSKIAKEYGKTVVISSVNQANINGQKLKSYPIPLPPLK 119 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 EQ I KI A + L + + + L E K++L+ + L Sbjct: 120 EQQAIVVKISALSEETQRLESIYQQKLAALDELKKSLLHQAFSGEL 165 >gi|304383192|ref|ZP_07365665.1| type I site-specific deoxyribonuclease [Prevotella marshii DSM 16973] gi|304335663|gb|EFM01920.1| type I site-specific deoxyribonuclease [Prevotella marshii DSM 16973] Length = 444 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 59/380 (15%), Positives = 125/380 (32%), Gaps = 5/380 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W + L +GR + + + +G + T+ ++ Sbjct: 67 DVPNGWCKTALSEIITLLSGRDLQPTQYNSFEKGIPYITGASNIDNNTIIINRWTTAPIT 126 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I KG +L G + A + G + + + P + + ++ Sbjct: 127 ISHKGDLLITCKGTIGKLAF--NSVGDLHIARQFMSLQFIEPLVSKYLFYCLEERISAIK 184 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + D I N + +PPLAEQ I +I IDT+ + +K+ K Sbjct: 185 QMDNGLIPGIDRSIILNQIIQLPPLAEQYRIVAEIERWFALIDTIEKSKEGLETAIKQTK 244 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + L P + E + + ++ +L R+ + + ++ L Sbjct: 245 SKILDLAIHGKLVPQDPKDEPASEQLRRINPKAKITCDNGHYAQLPREWSVISMQDVCKL 304 Query: 260 SYGNIIQKLETRNMGLKP-ESYETYQIVDPGEIVF--RFIDLQNDKRSLRSAQVMERGII 316 G + N+ +K +++D G+ V ++ L + + S + G Sbjct: 305 KDGIKLDSTPLINLDVKYLRGTSAGKVIDSGKFVTANSYMILVDGENSGEVFKTPIDGYQ 364 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 S + + + + + L + K + V +PP EQ Sbjct: 365 GSTFKLLDIDQNIDEKYILNVINLHRKALRENKVGSAIPHLNKKLFKAISVPLPPYNEQV 424 Query: 377 DITNVINVETARIDVLVEKI 396 I I +D L E + Sbjct: 425 RIVEAIKSTFNLLDALKENL 444 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 69/194 (35%), Gaps = 7/194 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQ 284 VP+ W ++T L+ ++ + + N + Y ++ + + + Sbjct: 67 DVPNGWCKTALSEIITLLSGRDLQPTQYNSFEKGIPYITGASNIDNNTIIINRWTTAPIT 126 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 I G+++ L V + I + S YL + + + Sbjct: 127 ISHKGDLLITCKGTIGK---LAFNSVGDLHIARQFMSLQFIEPLVSKYLFYCL--EERIS 181 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 M +GL + + + +PP+ EQ+ I I A ID + + E +K Sbjct: 182 AIKQMDNGLIPGIDRSIILNQIIQLPPLAEQYRIVAEIERWFALIDTIEKSKEGLETAIK 241 Query: 405 ERRSSFIAAAVTGQ 418 + +S + A+ G+ Sbjct: 242 QTKSKILDLAIHGK 255 >gi|294339640|emb|CAZ88000.1| putative Type I Restriction modification protein [Thiomonas sp. 3As] Length = 396 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 62/366 (16%), Positives = 124/366 (33%), Gaps = 10/366 (2%) Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 G+ N + + F I+ G+ G + + T + + Sbjct: 26 EGEVPVYGSNGITGTHNAANTFGP-AIIVGRKGSFGKVTWTDVPSFCIDTAYFI---DSR 81 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + WL T ++ E + + + +PP EQ I + +T Sbjct: 82 STKASLRWLYWSLQTLGLDEHSEDTGVPGLSREKAYQAKLKLPPSVEQERISNFLDEKTA 141 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 RID LI E+ R +E L+E +++S G N + V PF + Sbjct: 142 RIDALIAEKERLVEKLEEHWASVIS--TELGANETEGKHAWTTIPLKYVTVARCDGPFGS 199 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID-- 297 +T + + + ++ +G TY V G+I+ + Sbjct: 200 ALTSAHYVDEGARVIRLQNIRFGEFDSTDAAFIDDDYFARELTYHSVLEGDILIAGLGDE 259 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 R+ + + ++ + + + ++A + + RQ Sbjct: 260 KNFVGRACVAPNLGSNALVKADCFRFRVDTKRVLPKFVALQLSATAQRDGGLLSSGSTRQ 319 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + +P + EQ DI + L+ + I L+E RSS ++AAV Sbjct: 320 RIPLTVTECRLLCLPALAEQIDIVERLERRKREHSTLLHHTAEHIARLREYRSSLVSAAV 379 Query: 416 TGQIDL 421 TGQ+++ Sbjct: 380 TGQLNV 385 >gi|241762636|ref|ZP_04760708.1| restriction modification system DNA specificity domain protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241372774|gb|EER62486.1| restriction modification system DNA specificity domain protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 419 Score = 96.8 bits (239), Expect = 7e-18, Method: Composition-based stats. Identities = 61/405 (15%), Positives = 123/405 (30%), Gaps = 32/405 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W T +G + + + G Y P S Sbjct: 4 LPQGWIQTTFADITNQRSGNSKLVKGKLES------QESNGLY-PAFSASGPDVWRDAFE 56 Query: 81 FAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + I+ +G KA A I +T + +P+ V E L L + + Sbjct: 57 YEGDAIIVSAVGARCGKAFRAKGQWSAIANTHIVWPEPQVVETEFLFLLLNDENFWE--- 113 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + + +PPL EQ I KI + T + L+++ Sbjct: 114 --KGGSAQPFVKVRATFERTINLPPLPEQRRIVAKIDSLTGKSRRARDHLDHIPRLVEKY 171 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 KQA++S D + G + + R + + Sbjct: 172 KQAILSAAFR----ADWPLISVG--------ETIRAVVAGKNLRCEERPPFEHESGVVKV 219 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA-QVMERGIIT 317 + + + + G+++ + ++ + ++ Sbjct: 220 SAVSWGTFDARASKTLPESFTPPENTRIKAGDLLISRANTLELVGAVVIVLECPSNLFLS 279 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKE 374 + + D +L W +RS D +G + ++L +K + + P E Sbjct: 280 DKVLRLDVEDGDKPWLMWFLRSPDGRAAIEGAATGNQLSMRNLSQAALKSISMPWP-AAE 338 Query: 375 Q-FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q +I + I A I+ L + L+ S +A A G+ Sbjct: 339 QREEIVSRIESAFAWIECLAADAASARKLIDHLDQSMLAKAFKGE 383 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 65/207 (31%), Gaps = 14/207 (6%) Query: 24 HWKVVPIKRFTK-LNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W ++ + + + G+ + + + V GT Sbjct: 183 DWPLISVGETIRAVVAGKNLRCEERPPFEHESGVVKVSAVSWGTFDARASKTLPESFTPP 242 Query: 77 TVSIFAKGQILYGKLGP---YLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + G +L + I+ + S + L L +D L +L S Sbjct: 243 ENTRIKAGDLLISRANTLELVGAVVIVLECPSNLFLSDKVLRLDVEDGDKPWLMWFLRSP 302 Query: 132 DVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 D IE G +M + + +I MP P ++ I +I + I+ L + Sbjct: 303 DGRAAIEGAATGNQLSMRNLSQAALKSISMPWPAAEQREEIVSRIESAFAWIECLAADAA 362 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 +L+ Q++++ L P Sbjct: 363 SARKLIDHLDQSMLAKAFKGELVPQDP 389 >gi|226225493|ref|YP_002759599.1| putative type I restriction-modification system restriction subunit [Gemmatimonas aurantiaca T-27] gi|226088684|dbj|BAH37129.1| putative type I restriction-modification system restriction subunit [Gemmatimonas aurantiaca T-27] Length = 409 Score = 96.8 bits (239), Expect = 7e-18, Method: Composition-based stats. Identities = 45/403 (11%), Positives = 107/403 (26%), Gaps = 33/403 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + WK V + + T + + I + +D + Q ++ Sbjct: 22 EGWKSVTLGEVSTQVTEIVGDRKLTPVSISAGIGFVPQAEKFGRDISGNQ--YQRYTLVR 79 Query: 83 KGQILYGKLGPY---LRKAIIADFDG-------ICSTQFLVLQPKDVLPELLQGWLLSID 132 G ++ K + G + + Sbjct: 80 DGDFVFNKGNSLKFPQGCVYLLHGWGQVAAPSVFICFRLRDGYSNGFFQNCFEQNQHGRQ 139 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + I + + + + + +P P AEQ I E + + I + R + Sbjct: 140 LKRHITSGARSNGLLNISKETFFGVEIPTPTSAEQQKIAECLSSADEL----IAAQARKV 195 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + LK K+ L+ + + +++ G WE+K + K + Sbjct: 196 DALKTHKKGLMQQLFPREGETQPRLRFPDFRECGE----WELKAVGDVFEVTRGKVLAMT 251 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + S Y Y + + D N + Sbjct: 252 LVKEDASSDAPYPVYSSQTKSKGLAGYYSEYLY---RDAITWTTDGAN---AGDVNFRSG 305 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 T+ + + + + +G+ L ++++ + P Sbjct: 306 PFYCTNVCGVLVNTRGYANACVAALLNGVTRSHVSYVGN---PKLMNGVMEKIEIPFPSP 362 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +EQ I + + +D L+ + LK + + Sbjct: 363 QEQQRIAECL----SSLDALITAESDKLEALKHHKRGLMQQLF 401 >gi|23500571|ref|NP_700011.1| type I restriction-modification system, S subunit [Brucella suis 1330] gi|161620898|ref|YP_001594784.1| Type I restriction enzyme EcoR124II specificity protein [Brucella canis ATCC 23365] gi|254703172|ref|ZP_05165000.1| Type I restriction enzyme EcoR124II specificity protein [Brucella suis bv. 3 str. 686] gi|260567900|ref|ZP_05838369.1| type I restriction-modification system protein [Brucella suis bv. 4 str. 40] gi|261753794|ref|ZP_05997503.1| predicted protein [Brucella suis bv. 3 str. 686] gi|23464208|gb|AAN34016.1| type I restriction-modification system, S subunit [Brucella suis 1330] gi|161337709|gb|ABX64013.1| Type I restriction enzyme EcoR124II specificity protein [Brucella canis ATCC 23365] gi|260154565|gb|EEW89646.1| type I restriction-modification system protein [Brucella suis bv. 4 str. 40] gi|261743547|gb|EEY31473.1| predicted protein [Brucella suis bv. 3 str. 686] Length = 407 Score = 96.8 bits (239), Expect = 7e-18, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 121/416 (29%), Gaps = 38/416 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ V I + + + R S DI + + + +T Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASA-DIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYK 62 Query: 80 IFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 + +GQ Y + + G+ S + V + + Sbjct: 63 VVKRGQFAYATIHLDEGSIDYLRNEDAGLISPMYTVFETNSEEIDNEIALRQFKRFALSG 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +PPL EQ I E + A I + I+ Sbjct: 123 RFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAAEAA----IAKTEALIK 178 Query: 194 LLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKL 251 +++ K+AL+ Y V + + W+ G P + Sbjct: 179 AIEQTKKALLKQYFVERQQSLLWSCVAKMGRWLSGGTPATA---------------AEEN 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I + +I + + E +V PG ++ + RS+ S Sbjct: 224 WKGSIPWVCPKDIKGPSISSTVDHISEDAAKALGMVGPGTLLLVVRGMIL-ARSVPSTIC 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R A P+ + L + + L G + E + PV Sbjct: 283 TVRCAFNQDVKAFVPNEGVAPAFLKLWLDINEHKLLGEIETATHGT-KRFPLEHLNEFPV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 V EQ + + R+ ++ L+ R + ++G+I L Sbjct: 342 PVVTRDEQIRLVTLAESSQERLRS----ERDNLSALRSVRDALAQELLSGRIRLPE 393 >gi|269793143|ref|YP_003318047.1| restriction modification system DNA specificity domain-containing protein [Thermanaerovibrio acidaminovorans DSM 6589] gi|269100778|gb|ACZ19765.1| restriction modification system DNA specificity domain protein [Thermanaerovibrio acidaminovorans DSM 6589] Length = 374 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 55/406 (13%), Positives = 123/406 (30%), Gaps = 54/406 (13%) Query: 22 PKHWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P K VP+K+ ++ TG + + G+ Y+ +++ ++ + Sbjct: 13 PSGVKYVPLKQIAEVGTGSSDRVNAVDDGEYPFYVRSKNILRSNRYLFDEEAIIIPGEGG 72 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 IF + + + + + + LS + + Sbjct: 73 IGDIFH----------------YVNGKYDLHQRAYRIHLIDPNVNTKFTYYCLSANFKKF 116 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I AT++ I N +P+PPL Q I + T L E + + Sbjct: 117 IIMKAVNATVTSIRKPMIENFQIPLPPLPVQQEIVRILDNFTELTAELTAELTAELTARR 176 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++ + +++T G +EW + + + K E Sbjct: 177 KQYEYYRDFLLTFG---------DEVEW---TTLGEVAINLDSKRKPVAKGKRKAGEYPY 224 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S + S + +V + + + + Sbjct: 225 YGASGIVDYVDDYIFDGDYLLVSEDGANLV-------------ARVTPIAFSASGKIWVN 271 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 A++ D ++ + + DL + + + L E++ ++PV P +E+ Sbjct: 272 NHAHVLEFETYEDRKFIEYYLNMIDLSRFL---STAAQPKLTQENLNKIPVPAPSFEEKE 328 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA-AAVTG 417 I +++ A + L I I ++ R + VTG Sbjct: 329 RIVAILDRFDALCNDLTSGIPAEIEARQKQYEYYRDKLLTFKEVTG 374 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 59/188 (31%), Gaps = 9/188 (4%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + L+ EL K + ++ ++ + G P + I+ +F Sbjct: 1 MSKLDELIAELCPSGVKYVPLKQIAEVGTGSSDRVNAVDDGEYPFYVRSKNILRSNRYLF 60 Query: 294 ----RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYA 348 I + + + + AY +T + S + K + Sbjct: 61 DEEAIIIPGEGGIGDIFHYVNGKYDLHQRAYRIHLIDPNVNTKFTYYCLSANFKKFIIMK 120 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE--- 405 + S++ ++ + +PP+ Q +I +++ T L ++ + ++ Sbjct: 121 AVNATVTSIRKPMIENFQIPLPPLPVQQEIVRILDNFTELTAELTAELTAELTARRKQYE 180 Query: 406 -RRSSFIA 412 R + Sbjct: 181 YYRDFLLT 188 >gi|227523729|ref|ZP_03953778.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus hilgardii ATCC 8290] gi|227089044|gb|EEI24356.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus hilgardii ATCC 8290] Length = 402 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 58/392 (14%), Positives = 127/392 (32%), Gaps = 42/392 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + K + ++ YI D+ + + KY+ K + Sbjct: 18 WEQRKLGEGLKQLKSYSLPRKYEVPESDTEYIHYGDIHTSSRKYVDKSFRLPNIKSGDFQ 77 Query: 80 IFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G I+ ++ I + + ++ K P LS Sbjct: 78 LLQTGDIVLADASEDYKEIAEPMLMKNIKGRKVVSGLHTIAIRLKCGDPVYYLYLFLSPG 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G + ++ + + +P EQ I + + I ++ + Sbjct: 138 FRHYVYKVGTGLKVFGINYDKVQKYFLAVPDEKEQKYIGKILFLTDQLIAANQSKLEQLK 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L K Q + + EW + + E ++N K Sbjct: 198 RLKKLLMQKIFNQ-----------------EWRFKGFTDPWEQRKLGEIFEERKENPKGQ 240 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +LS++ + I N S + Y++V +I + + + + + Sbjct: 241 TLKMLSVTINSGIVDANVLNRKDNSNSNKSNYKVVHANDIAYNSMRMWQGASGVSN---- 296 Query: 312 ERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPV 367 E GI++ AY +KP D + +L + + + F GL +LK++ +K + V Sbjct: 297 ELGIVSPAYTVLKPRVGLDVRFWGYLFKLTKMLQEFQKNSQGLTSDTWNLKYKQIKSIEV 356 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +P EQ I + ++D + + Sbjct: 357 TMPSKNEQNAI----SQLLQKLDFSIAANLRQ 384 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 66/211 (31%), Gaps = 17/211 (8%) Query: 213 PDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P ++ K W +G + I +K Sbjct: 7 PKIRFKGFDDPWEQRKLGEGLKQLKSYSLPRKYEVPESDTEY-----IHYGDIHTSSRKY 61 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQVMERGIITSAYMAVKP 325 ++ L +Q++ G+IV + + L + + +A++ Sbjct: 62 VDKSFRLPNIKSGDFQLLQTGDIVLADASEDYKEIAEPMLMKNIKGRKVVSGLHTIAIRL 121 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 D Y +L S Y +G+GL + ++ V++ + VP KEQ I ++ Sbjct: 122 KCGDPVYYLYLFLSPGFRHYVYKVGTGLKVFGINYDKVQKYFLAVPDEKEQKYIGKILF- 180 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 D L+ + + LK + + Sbjct: 181 ---LTDQLIAANQSKLEQLKRLKKLLMQKIF 208 >gi|303250873|ref|ZP_07337066.1| hypothetical protein APP6_1998 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302650288|gb|EFL80451.1| hypothetical protein APP6_1998 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 481 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 53/429 (12%), Positives = 116/429 (27%), Gaps = 70/429 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V ++ L GR + + E G Y GN + T + Sbjct: 70 EIPESWVWVRLEDIFHLQAGRFISASE-------IYGEYKEGLYPCYGGNGLRGFVKTYN 122 Query: 80 IFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +G+ + G+ G A+ + +V++ L + + + Sbjct: 123 --REGKFPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYF---LIQLNLN 177 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I ++ +P+PPL EQ I KI I+ + + L ++ Sbjct: 178 QYATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQF 237 Query: 199 K----QALVSYIVTKGLNPDVKM------------------------------------- 217 ++++ + L Sbjct: 238 PEQLKKSILQAAIQGKLTEQNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRD 297 Query: 218 -----------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + E +P+ W + ++ G Sbjct: 298 NLPYEIVNGKERCIADEVPFEIPESWVWVRLSEISKITMGQSPDNK----YLGKEGIEFH 353 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + ++ ES + Y + I L I ++ P Sbjct: 354 QGKSFFSEYIIESSDIYCSLPNKLATPNSILLCVRAPVGIVNITNRELCIGRGLASIDPI 413 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +++ +L + + Y +++ + + + +PP+ EQ I I Sbjct: 414 YVNTIFLYYALFCYKNY-YERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIETLF 472 Query: 387 ARIDVLVEK 395 + + L +K Sbjct: 473 STLQNLSQK 481 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 60/204 (29%), Gaps = 18/204 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + ++ +P+ W + + E +K + Sbjct: 63 TEQDFPFEIPESWVWVRLEDIFHLQAGRFISASEIYGEYKEGLYPCYGGNGLRGFVKTYN 122 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 E F I Q + + A + D+ + + + Sbjct: 123 REGK---------FPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYFLIQ 173 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +L + + + L + + + +PP+ EQ I I I+ + E+ Sbjct: 174 LNLNQY---ATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEK 229 Query: 400 IVLL-----KERRSSFIAAAVTGQ 418 + L ++ + S + AA+ G+ Sbjct: 230 LTALHQQFPEQLKKSILQAAIQGK 253 >gi|229553106|ref|ZP_04441831.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus rhamnosus LMS2-1] gi|229313603|gb|EEN79576.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus rhamnosus LMS2-1] Length = 386 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 57/402 (14%), Positives = 128/402 (31%), Gaps = 38/402 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + S I + + T + + +T ++ KG Sbjct: 11 WEKRKFGDLYSKTSEKNDGSFGPDKIISVATMSWKTNVRISSE-----DYLATYNVLRKG 65 Query: 85 QILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI--- 137 I + K + R DGI S F+V +PK + + + R Sbjct: 66 DIAFEGNKSKKFSFGRFVENDIGDGIVSHVFVVFRPKVSPIISYWKYFIHNEFVMRNILR 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ + M++ + P EQ I + I ++ L++ Sbjct: 126 KSTIKATMMTNLSSHDFLRQTLCTPSFKEQENIGNFLERLDSL----IAATQGKLDNLEK 181 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K+AL+ ++ + + G + + ++ + N L Sbjct: 182 IKRALLKHLFDQSMRFRGYSDPWEKRKFGEL----------YKPNKERNESAEFSSENTL 231 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S++ + +K G S Y+++ G+I F + + GI++ Sbjct: 232 SIATMTVNRKGN----GAAKTSLLKYKVIRIGDIAFEGHTSKKFAFGRFVLNDVADGIMS 287 Query: 318 SAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMGS-GLRQS-LKFEDVKRLPVLVPPIK 373 + ++P ++ L + G + L D+ + + VP I Sbjct: 288 PRFTCLRPIHRQIIQFWKQYIHYEPILRPILIRSTKLGTMMNELVVPDLLKQNIRVPSIN 347 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + +R+D L+ + + L+ + + + Sbjct: 348 EQKLIGKSL----SRVDDLIAATQSKLSSLETLKKALLQGLF 385 >gi|215489627|ref|YP_002332058.1| predicted type I restriction-modification enzyme S subunit [Escherichia coli O127:H6 str. E2348/69] gi|215267699|emb|CAS12157.1| predicted type I restriction-modification enzyme S subunit [Escherichia coli O127:H6 str. E2348/69] Length = 408 Score = 96.4 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 61/405 (15%), Positives = 130/405 (32%), Gaps = 56/405 (13%) Query: 26 KVVPIKRFT-KLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQ---SD 74 + + L TG + Y+ + ++++G +L K Sbjct: 17 EWKMLGEVIHSLKTGLNPRQNFSLNTLDAQGYYVTVREIQNGKVVFLDKTDRVNDRALKI 76 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVL-PELLQGWLLS 130 + S G IL+ G R A+I + + + + K+ + P L L S Sbjct: 77 INGRSNLEAGDILFSGTGTVGRIAVIEENPINWNIKEGVYTIKPIKEKIAPRFLSYLLQS 136 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDT 183 + + G + + + +PIP LA Q I + T Sbjct: 137 SKIVKDYSKKIVGNPVISLPMGDLKKLLIPIPCPDNPEKSLAIQSEIVRILDKFTALTAE 196 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALV 241 L E + + K++ +++ K+ +EW +G V + Sbjct: 197 LTAELTAELNMRKKQYNYYRDQLLS--------FKEGEVEWKTLGEV---------AVIG 239 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 T + + + + G KL ++ I+ G+ Sbjct: 240 TGNHDTQDAIEHGKYIFYARGREPLKLNVF-------DFDETAIITAGD--------GAG 284 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + + + AY V ++ ++ + +Y + A S SL+ Sbjct: 285 VGKVFHYAKGKYALHQRAYRIVPNAFMNPRFVYHYITAYFFTYIQKASVSSSVTSLRRPM 344 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + P+ VPP +EQ I +++ + + E + + I L +++ Sbjct: 345 FLKFPIPVPPSEEQARIVEILDKFDTLTNSITEGLPREIELRQKQ 389 >gi|301162175|emb|CBW21720.1| putative type I DNA restriction-modification [Bacteroides fragilis 638R] Length = 399 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 56/411 (13%), Positives = 122/411 (29%), Gaps = 43/411 (10%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + +L G S + G L N + I Sbjct: 7 KLGEILELQRGYDLPSS-----------QMKKGDILVAGSNGVIGYHNEARSNHP-CITV 54 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G+ G + +T V K P+ L +L ++ + + + + + Sbjct: 55 GRSGSVGKVHYYEQATWAHNTALFVKDFKGNDPKYLYYFLKNLHLDKMFDK--GSSVVPS 112 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 D K + ++ +P + I +ID I + L+ + L Y Sbjct: 113 LDRKVVHSLNVPCHKDIDCQKRIAAI---LSKIDRKIELNCAINQNLEAMAKQLYDYWFV 169 Query: 209 KGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKN------TKLIE 253 + P+ K SG + V +P+ W++ + T + Sbjct: 170 QFDFPNEEGKPYKSSGGKMVWNEKLKREIPEGWDISLIKDIATTYSGGTPKSTNIEYYDN 229 Query: 254 SNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I ++ G + + T+ + + ++ I+ K SL + + Sbjct: 230 GEIAWINSGELNSPIITKTTNYITKCGLENSSAKLYPSNSILVAMYGATAGKVSLLTFE- 288 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 A V P + Y + S R ++ + +K + + +P Sbjct: 289 ---ACSNQAVCGVIPTIENMLYYVYFHISSLYSHFITLSTGSARDNISQDTIKNILLPIP 345 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +I + + + I + Q I L ++R + + GQ+ + Sbjct: 346 T----RNILKLFDEKIGSIYQTIVNNYQQIDSLTKQRDELLPLLMNGQVSV 392 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 68/211 (32%), Gaps = 14/211 (6%) Query: 10 YKDSG--VQWIG----AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDV 56 YK SG + W IP+ W + IK +G T +S +I +I ++ Sbjct: 181 YKSSGGKMVWNEKLKREIPEGWDISLIKDIATTYSGGTPKSTNIEYYDNGEIAWINSGEL 240 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 S + S+ ++ IL G K + F+ + + P Sbjct: 241 NSPIITKTTNYITKCGLENSSAKLYPSNSILVAMYGATAGKVSLLTFEACSNQAVCGVIP 300 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L + + + G+ + I NI +PIP L EKI + Sbjct: 301 -TIENMLYYVYFHISSLYSHFITLSTGSARDNISQDTIKNILLPIPTRNILKLFDEKIGS 359 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIV 207 I + + E L++ V Sbjct: 360 IYQTIVNNYQQIDSLTKQRDELLPLLMNGQV 390 >gi|162447451|ref|YP_001620583.1| type I site-specific restriction-modification system, S (specificity) subunit [Acholeplasma laidlawii PG-8A] gi|161985558|gb|ABX81207.1| type I site-specific restriction-modification system, S (specificity) subunit [Acholeplasma laidlawii PG-8A] Length = 419 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 69/424 (16%), Positives = 144/424 (33%), Gaps = 33/424 (7%) Query: 25 WKVVPIKRFTKL---NTGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W V + G+T +S K I+ + + V++ Y + D + Sbjct: 6 WSKVNLVDCLDKLIDYRGKTPAKSEKGILTLSAKSVKNSNIDYSE--AYTISEDEYKKFM 63 Query: 81 FA----KGQILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDVT 134 KG IL P + A + + + L PK + + L +L S Sbjct: 64 VRGIPVKGDILITTEAPMGQVAKLDRDGVAVAQRLLTLRPNPKILDNDYLLYYLQSPIGQ 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++A G+T++ I + +PPL+EQ +I + +D I + + Sbjct: 124 AELKARESGSTVTGIKQAEFRKINIILPPLSEQKVIANIL----SSLDDKIELNNKINKN 179 Query: 195 LKEKKQALVSYIVTKGLNPDVK---MKDSGIE----WVGLVPDHWEVKPFFALVTELNRK 247 L+E Q L P+ + K SG E +GL+P W+V+ Sbjct: 180 LEELAQTLYKRWFVDFDFPNEEGESYKSSGGEMVESELGLIPKGWKVESIGRSSISKLIS 239 Query: 248 NTKL----IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQND 301 + + I + N+ + + K G + F + + Sbjct: 240 SGINEFNGTKKYIATADVTNLSIRSFVTEIDFKKRPSRANMQPIAGSLWFAKMKDSRKMI 299 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + S S+ ++++ I ++ + + + L Q++ E+ Sbjct: 300 RVSKSSSYLIDKCIFSTGFAGLFAPKYSNYIWTILTTKDFDDTKNNLCNGTTMQAINNEN 359 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + R+ +L+P ++ + I ++ E L + R + + G+I++ Sbjct: 360 INRIRILIPD----NKTLDLFESVSEPIFEKIQFNEIESNKLSKIRDELLPKLMNGEIEV 415 Query: 422 RGES 425 E Sbjct: 416 PIEE 419 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 72/209 (34%), Gaps = 13/209 (6%) Query: 8 PQYKDSGVQW----IGAIPKHWKVVPIKR--FTKLNTGRTSESGKDIIYIGLEDVESGTG 61 YK SG + +G IPK WKV I R +KL + +E YI DV + + Sbjct: 203 ESYKSSGGEMVESELGLIPKGWKVESIGRSSISKLISSGINEFNGTKKYIATADVTNLSI 262 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVLQ 115 + + + ++ + G + + K+ + ++ I ST F L Sbjct: 263 RSFVTEIDFKKRPSRANMQPIAGSLWFAKMKDSRKMIRVSKSSSYLIDKCIFSTGFAGLF 322 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + D +C G TM + + I I + IP L Sbjct: 323 APKYSNYIWTILT-TKDFDDTKNNLCNGTTMQAINNENINRIRILIPDNKTLDLFESVSE 381 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS 204 +I E + ++ E L++ Sbjct: 382 PIFEKIQFNEIESNKLSKIRDELLPKLMN 410 >gi|124006764|ref|ZP_01691595.1| restriction endonuclease S subunits [Microscilla marina ATCC 23134] gi|123987672|gb|EAY27372.1| restriction endonuclease S subunits [Microscilla marina ATCC 23134] Length = 422 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 59/422 (13%), Positives = 117/422 (27%), Gaps = 37/422 (8%) Query: 26 KVVPIKRFTKLN-TGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K + + G T + + II + + + + Y S Sbjct: 3 KWRKLGDLVSYSGKGITPKYVDESSIIVLNQKCIRNHNIDYTLARYTDDTRSISQHKFLQ 62 Query: 83 KGQILYGKLG--PYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G IL G R A + D I + L+L+ ++ +++ Sbjct: 63 TGDILVNSTGQGTAGRCAFVDKLPQDKKVITDSHILILRFQNHFEAKCLSYVIFSIEELV 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + D + N+ + Q I + + +I + Sbjct: 123 QTFMDGSTGQGELDKVRLFNLMTSLTENKLYQKQIAKVLSDLDAKIALNNQINAELEAMA 182 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSG------IEWVGLVPDHWEVKPFFALVTEL----- 244 K N K SG E VP+ WEVK + Sbjct: 183 KLIYDYWFVQFDFPDAN-GKPYKSSGGKMVYNEELKREVPEGWEVKKISSFAKTSSGGTP 241 Query: 245 -NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQN 300 K NI ++ G + Q + + + ++ G I+ Sbjct: 242 LRSKKEYYHNGNIPWINSGELNQPFIVSSQKFITKEGLNNSSAKVFKKGTILIAMYGATA 301 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 K S + A A+ H YL + + + R +L + Sbjct: 302 GKVSFMDIE----ACTNQAICAIDTHSNLRVYLKLGLETL-YDYLVTLSSGSARDNLSQD 356 Query: 361 DVKRLPVLVPPIKEQFDITNVINVET-ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +K L ++P + + T A ++ ++ + + L R + + GQ+ Sbjct: 357 KIKELKFVIPN----EKLLQQFDKFTKAPLNNILANL-KQNQQLTSLRDWLLPMLMNGQV 411 Query: 420 DL 421 + Sbjct: 412 SV 413 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 65/211 (30%), Gaps = 15/211 (7%) Query: 10 YKDSG------VQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDV 56 YK SG + +P+ W+V I F K ++G T K +I +I ++ Sbjct: 203 YKSSGGKMVYNEELKREVPEGWEVKKISSFAKTSSGGTPLRSKKEYYHNGNIPWINSGEL 262 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 K + S+ +F KG IL G K D + + + Sbjct: 263 NQPFIVSSQKFITKEGLNNSSAKVFKKGTILIAMYGATAGKVSFMDIEACTNQAICAIDT 322 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 L + L + + + G+ + I + IP + A Sbjct: 323 HSNL--RVYLKLGLETLYDYLVTLSSGSARDNLSQDKIKELKFVIPNEKLLQQFDKFTKA 380 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIV 207 I + + + L L++ V Sbjct: 381 PLNNILANLKQNQQLTSLRDWLLPMLMNGQV 411 >gi|62317327|ref|YP_223180.1| HsdS restriction-modification system, S subunit [Brucella abortus bv. 1 str. 9-941] gi|62197520|gb|AAX75819.1| HsdS, type I restriction-modification system, S subunit [Brucella abortus bv. 1 str. 9-941] Length = 407 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 121/416 (29%), Gaps = 38/416 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ V I + + + R S DI + + + +T Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASA-DIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYK 62 Query: 80 IFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 + +GQ Y + + G+ S + V + + Sbjct: 63 VVKRGQFAYATIHLDEGSIDYLRNEDVGLISPMYTVFETNSEEIDNEIALRQFKRFALSG 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +PPL EQ I E + A I + I+ Sbjct: 123 RFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAAEAA----IAKTEALIK 178 Query: 194 LLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKL 251 +++ K+AL+ Y V + + W+ G P + Sbjct: 179 AIEQTKKALLKQYFVERQQSLLWSCVAKMGRWLSGGTPATA---------------AEEN 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I + +I + + E +V PG ++ + RS+ S Sbjct: 224 WKGSIPWVCPKDIKGPSISSTVDHISEDAAKALGMVGPGTLLLVVRGMIL-ARSVPSTIC 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R A P+ + L + + L G + E + PV Sbjct: 283 TVRCAFNQDVKAFVPNEGVAPAFLKLWLDINEHKLLGEIETATHGT-KRFPLERLNEFPV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 V EQ + + R+ ++ L+ R + ++G+I L Sbjct: 342 PVVTRDEQIRLVTLAESSQERLRS----ERDNLSALRSVRDALAQELLSGRIRLPE 393 >gi|260361455|ref|ZP_05774514.1| restriction endonuclease, S subunit [Vibrio parahaemolyticus K5030] gi|260878068|ref|ZP_05890423.1| restriction endonuclease, S subunit [Vibrio parahaemolyticus AN-5034] gi|260896963|ref|ZP_05905459.1| restriction endonuclease, S subunit [Vibrio parahaemolyticus Peru-466] gi|308088719|gb|EFO38414.1| restriction endonuclease, S subunit [Vibrio parahaemolyticus Peru-466] gi|308090038|gb|EFO39733.1| restriction endonuclease, S subunit [Vibrio parahaemolyticus AN-5034] gi|308111011|gb|EFO48551.1| restriction endonuclease, S subunit [Vibrio parahaemolyticus K5030] Length = 590 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 57/479 (11%), Positives = 128/479 (26%), Gaps = 97/479 (20%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLP-KDGNSRQS 73 P HW+ + + + + G+ G D +Y+ + D+++ + + + Sbjct: 104 PLHWETICVGQVAHVLGGKRVPKGYKLSEQPTDFVYLRVTDMKNQSIDESDLRYISEEVF 163 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSID 132 + G + G I S T+ L + +L Sbjct: 164 KQISRYTINTGDVYVTIAGTIGAVGTIPPHLDGMSLTENAAKLVFSGLSKKYLVTVLQSS 223 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR----------- 180 T++ I + +PIPPL EQ I +K+ Sbjct: 224 FVTRQFNDAVNQMAQPKLSLNSIKHTCIPIPPLEEQEYIADKVDELMALCDQLEQQTEAS 283 Query: 181 ------------------------------IDTLITERIRFIELLKEKKQALVSYIVTKG 210 I E + + KQ ++ V Sbjct: 284 IEAHQVLVTTLLDTLTNSADADELMQNWARISEHFDTLFTTEESIDQLKQTILQLAVMGK 343 Query: 211 LNPDVKMKDSGIEWV-------------------------------GLVPDHWEVKPFFA 239 L P + E + +P WE Sbjct: 344 LVPQDPSDEPAAELLKRIAEEKAQLVKEKKIKKQKALPPIAEDEKPFELPSGWEWCRLDD 403 Query: 240 LVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ------IVDP 288 + + K I L NI ++ + + ++ ++ P Sbjct: 404 ICFGITSGSTPPKVNFNESEGIPYLKVYNIREQKIDFEYKPQFVDNDCHKTKLARSVLYP 463 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G++V + K ++ E + + Y+ + + Sbjct: 464 GDVVMNIVGPPLGKIAIIPDTYPEWNCNQAITFFRPIVPQLNKYIYTYLTAGSFLDSIEL 523 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +G+ + ++ + + + PP++EQ I N ++ + L ++ + +E + Sbjct: 524 IGTAGQDNISVTKSRSILLPTPPLREQKRIVNKVHELFLLCNSLKMRLRKR----QELK 578 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 68/201 (33%), Gaps = 14/201 (6%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ + + +G T + I Y+ + ++ + K Sbjct: 391 ELPSGWEWCRLDDICFGITSGSTPPKVNFNESEGIPYLKVYNIREQKIDFEYKPQFVDND 450 Query: 74 DTSTV---SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPK-DVLPELLQG 126 T S+ G ++ +GP L K I + C+ +P L + + Sbjct: 451 CHKTKLARSVLYPGDVVMNIVGPPLGKIAIIPDTYPEWNCNQAITFFRPIVPQLNKYIYT 510 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + IE I A + +I +P PPL EQ I K+ + ++L Sbjct: 511 YLTAGSFLDSIELIGT-AGQDNISVTKSRSILLPTPPLREQKRIVNKVHELFLLCNSLKM 569 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + EL +V V Sbjct: 570 RLRKRQELKLCITDTIVEQAV 590 >gi|225629307|ref|ZP_03787340.1| Type I restriction enzyme specificity protein [Brucella ceti str. Cudo] gi|260167613|ref|ZP_05754424.1| type I restriction-modification system, S subunit [Brucella sp. F5/99] gi|261757036|ref|ZP_06000745.1| type I restriction-modification system protein [Brucella sp. F5/99] gi|225615803|gb|EEH12852.1| Type I restriction enzyme specificity protein [Brucella ceti str. Cudo] gi|261737020|gb|EEY25016.1| type I restriction-modification system protein [Brucella sp. F5/99] Length = 407 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 121/416 (29%), Gaps = 38/416 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ V I + + + R S DI + + + +T Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASA-DIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYK 62 Query: 80 IFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 + +GQ Y + + G+ S + V + + Sbjct: 63 VVKRGQFAYATIHLDEGSIDYLRNEDAGLISPMYTVFETNSKEIDNEIALRQFKRFALSG 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +PPL EQ I E + A I + I+ Sbjct: 123 RFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAAEAA----IAKTEALIK 178 Query: 194 LLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKL 251 +++ K+AL+ Y V + + W+ G P + Sbjct: 179 AIEQTKKALLKQYFVERQQSLLWSCVAKMGRWLSGGTPATA---------------AEEN 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I + +I + + E +V PG ++ + RS+ S Sbjct: 224 WKGSIPWVCPKDIKGPSISSTVDHISEDAAKALGMVGPGTLLLVVRGMIL-ARSVPSTIC 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R A P+ + L + + L G + E + PV Sbjct: 283 TVRCAFNQDVKAFVPNEGVAPAFLKLWLDINEHKLLGEIETATHGT-KRFPLERLNEFPV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 V EQ + + R+ ++ L+ R + ++G+I L Sbjct: 342 PVVTRDEQIRLVTLAESSQERLRS----ERDNLSALRSVRDALAQELLSGRIRLPE 393 >gi|159026888|emb|CAO89139.1| hsdS [Microcystis aeruginosa PCC 7806] Length = 510 Score = 96.4 bits (238), Expect = 8e-18, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 60/159 (37%), Gaps = 3/159 (1%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI--ITSAY 320 + R + K + G+I+ + + + +++ + + Sbjct: 64 GFFKDQSDRFLTFKKSIELNCTYLQKGDILVARLPDPLGRACIFPLSGIKKFVTVVDVCI 123 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + I+S YL +L+ S SG R+ + ++ ++ + P+ EQ I Sbjct: 124 IRNNSNFINSQYLLYLINSPQTRLEVDKYKSGSTRKRISRKNFAKIQFPIAPLPEQHRIV 183 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + +D V +++ + LK R + + A G+ Sbjct: 184 EKIEELFSELDNGVASLKKVLEQLKTYRQAVLKWAFEGK 222 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 70/492 (14%), Positives = 136/492 (27%), Gaps = 93/492 (18%) Query: 20 AIPKHWKVVPIKRFT---------KLNTGRTSESGKDIIYIGLEDVESGTGKY-LPKDGN 69 +P W IK + + ++ I L D+ G K + Sbjct: 16 DLPPGWTKSAIKELIGHDGIFCDGDWVESKDQDPNGEVRLIQLADIGDGFFKDQSDRFLT 75 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI------CSTQFLVLQPKDVLPEL 123 ++S + KG IL +L L +A I GI + + + Sbjct: 76 FKKSIELNCTYLQKGDILVARLPDPLGRACIFPLSGIKKFVTVVDVCIIRNNSNFINSQY 135 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK---------- 173 L + S ++ G+T K I PI PL EQ I EK Sbjct: 136 LLYLINSPQTRLEVDKYKSGSTRKRISRKNFAKIQFPIAPLPEQHRIVEKIEELFSELDN 195 Query: 174 ---------------------------------------IIAETVRIDTLITERIRFIEL 194 + ++ + ER R + Sbjct: 196 GVASLKKVLEQLKTYRQAVLKWAFEGKLTEKWRNTHQDSLEDADTLLEQIKAERKRHYQQ 255 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKD-----------SGIEWVGLVPDHWEVKPFFALVTE 243 E + + G +K + + +PD W L++ Sbjct: 256 QLEDWKQALKEWENNGKETKKPIKPQQPKDLPPLTKEELSNLPSLPDGWMWVKVDYLLSL 315 Query: 244 LNR-----------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDP 288 + K ++ S I L NI + + + ++ V Sbjct: 316 DKKGMTTGPFGTLLKKSEHQISGIPVLGIENIGNGVFLPKNKIFITEKKARELSSFEVSG 375 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFY 347 G+I+ + + +++ + I + +L + + Sbjct: 376 GDIIISRSGTVGEICLVPDYFGYSLISTNLIRISLNKNIIIPKFFVFLFLGGGSVREQVK 435 Query: 348 AMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + G R L ++ + P ++EQ I I + D L + +++ + Sbjct: 436 ELCKGSTRDFLNQTILQTIIFPFPSLQEQTQIVQEIESRLSVCDQLEATLTENLDKAEAL 495 Query: 407 RSSFIAAAVTGQ 418 R S + A G+ Sbjct: 496 RQSILKRAFEGK 507 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 80/214 (37%), Gaps = 22/214 (10%) Query: 18 IGAIPKHWKVVPIKRFTKL-NTG-----------RTSESGKDIIYIGLEDVESGTGKYLP 65 + ++P W V + L G ++ I +G+E++ G G +LP Sbjct: 297 LPSLPDGWMWVKVDYLLSLDKKGMTTGPFGTLLKKSEHQISGIPVLGIENI--GNGVFLP 354 Query: 66 KDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFDGI--CSTQFLVLQPKDVL 120 K+ + + + G I+ + G ++ D+ G ST + + + Sbjct: 355 KNKIFITEKKARELSSFEVSGGDIIISRSGTVGEICLVPDYFGYSLISTNLIRISLNKNI 414 Query: 121 PELLQGWLLSI---DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L + V ++++ +C+G+T + + I P P L EQ I ++I + Sbjct: 415 IIPKFFVFLFLGGGSVREQVKELCKGSTRDFLNQTILQTIIFPFPSLQEQTQIVQEIESR 474 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 D L ++ + +Q+++ L Sbjct: 475 LSVCDQLEATLTENLDKAEALRQSILKRAFEGKL 508 >gi|83269308|ref|YP_418599.1| type I restriction-modification system, S subunit [Brucella melitensis biovar Abortus 2308] gi|148558787|ref|YP_001257780.1| type I restriction-modification system, S subunit [Brucella ovis ATCC 25840] gi|189022582|ref|YP_001932323.1| type I restriction-modification system, S subunit [Brucella abortus S19] gi|237816882|ref|ZP_04595874.1| Type I restriction enzyme EcoR124II specificity protein [Brucella abortus str. 2308 A] gi|254690827|ref|ZP_05154081.1| type I restriction-modification system, S subunit [Brucella abortus bv. 6 str. 870] gi|254695865|ref|ZP_05157693.1| type I restriction-modification system, S subunit [Brucella abortus bv. 3 str. Tulya] gi|254698608|ref|ZP_05160436.1| type I restriction-modification system, S subunit [Brucella abortus bv. 2 str. 86/8/59] gi|254700052|ref|ZP_05161880.1| type I restriction-modification system, S subunit [Brucella suis bv. 5 str. 513] gi|254705682|ref|ZP_05167510.1| type I restriction-modification system, S subunit [Brucella pinnipedialis M163/99/10] gi|254710913|ref|ZP_05172724.1| type I restriction-modification system, S subunit [Brucella pinnipedialis B2/94] gi|254712614|ref|ZP_05174425.1| type I restriction-modification system, S subunit [Brucella ceti M644/93/1] gi|254715685|ref|ZP_05177496.1| type I restriction-modification system, S subunit [Brucella ceti M13/05/1] gi|254732055|ref|ZP_05190633.1| type I restriction-modification system, S subunit [Brucella abortus bv. 4 str. 292] gi|256015605|ref|YP_003105614.1| type I restriction-modification system, S subunit [Brucella microti CCM 4915] gi|256029297|ref|ZP_05442911.1| type I restriction-modification system, S subunit [Brucella pinnipedialis M292/94/1] gi|256058985|ref|ZP_05449196.1| type I restriction-modification system, S subunit [Brucella neotomae 5K33] gi|256157492|ref|ZP_05455410.1| type I restriction-modification system, S subunit [Brucella ceti M490/95/1] gi|256253531|ref|ZP_05459067.1| type I restriction-modification system, S subunit [Brucella ceti B1/94] gi|256256009|ref|ZP_05461545.1| type I restriction-modification system, S subunit [Brucella abortus bv. 9 str. C68] gi|260544564|ref|ZP_05820385.1| type I restriction-modification system protein [Brucella abortus NCTC 8038] gi|260756405|ref|ZP_05868753.1| predicted protein [Brucella abortus bv. 6 str. 870] gi|260759837|ref|ZP_05872185.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260763076|ref|ZP_05875408.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260882229|ref|ZP_05893843.1| predicted protein [Brucella abortus bv. 9 str. C68] gi|261216285|ref|ZP_05930566.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|261217434|ref|ZP_05931715.1| predicted protein [Brucella ceti M13/05/1] gi|261220661|ref|ZP_05934942.1| predicted protein [Brucella ceti B1/94] gi|261313102|ref|ZP_05952299.1| predicted protein [Brucella pinnipedialis M163/99/10] gi|261318496|ref|ZP_05957693.1| predicted protein [Brucella pinnipedialis B2/94] gi|261320308|ref|ZP_05959505.1| predicted protein [Brucella ceti M644/93/1] gi|261322929|ref|ZP_05962126.1| predicted protein [Brucella neotomae 5K33] gi|261750535|ref|ZP_05994244.1| predicted protein [Brucella suis bv. 5 str. 513] gi|265986294|ref|ZP_06098851.1| predicted protein [Brucella pinnipedialis M292/94/1] gi|265995989|ref|ZP_06108546.1| predicted protein [Brucella ceti M490/95/1] gi|294853393|ref|ZP_06794065.1| type I restriction enzyme [Brucella sp. NVSL 07-0026] gi|297249368|ref|ZP_06933069.1| type I restriction enzyme, S subunit [Brucella abortus bv. 5 str. B3196] gi|82939582|emb|CAJ12562.1| type I restriction-modification system, S subunit [Brucella melitensis biovar Abortus 2308] gi|148370072|gb|ABQ62944.1| type I restriction-modification system, S subunit [Brucella ovis ATCC 25840] gi|189021156|gb|ACD73877.1| type I restriction-modification system, S subunit [Brucella abortus S19] gi|237787695|gb|EEP61911.1| Type I restriction enzyme EcoR124II specificity protein [Brucella abortus str. 2308 A] gi|255998265|gb|ACU49952.1| type I restriction-modification system, S subunit [Brucella microti CCM 4915] gi|260097835|gb|EEW81709.1| type I restriction-modification system protein [Brucella abortus NCTC 8038] gi|260670155|gb|EEX57095.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260673497|gb|EEX60318.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260676513|gb|EEX63334.1| predicted protein [Brucella abortus bv. 6 str. 870] gi|260871757|gb|EEX78826.1| predicted protein [Brucella abortus bv. 9 str. C68] gi|260917892|gb|EEX84753.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|260919245|gb|EEX85898.1| predicted protein [Brucella ceti B1/94] gi|260922523|gb|EEX89091.1| predicted protein [Brucella ceti M13/05/1] gi|261292998|gb|EEX96494.1| predicted protein [Brucella ceti M644/93/1] gi|261297719|gb|EEY01216.1| predicted protein [Brucella pinnipedialis B2/94] gi|261298909|gb|EEY02406.1| predicted protein [Brucella neotomae 5K33] gi|261302128|gb|EEY05625.1| predicted protein [Brucella pinnipedialis M163/99/10] gi|261740288|gb|EEY28214.1| predicted protein [Brucella suis bv. 5 str. 513] gi|262550286|gb|EEZ06447.1| predicted protein [Brucella ceti M490/95/1] gi|264658491|gb|EEZ28752.1| predicted protein [Brucella pinnipedialis M292/94/1] gi|294819048|gb|EFG36048.1| type I restriction enzyme [Brucella sp. NVSL 07-0026] gi|297173237|gb|EFH32601.1| type I restriction enzyme, S subunit [Brucella abortus bv. 5 str. B3196] Length = 407 Score = 96.4 bits (238), Expect = 9e-18, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 121/416 (29%), Gaps = 38/416 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ V I + + + R S DI + + + +T Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASA-DIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYK 62 Query: 80 IFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 + +GQ Y + + G+ S + V + + Sbjct: 63 VVKRGQFAYATIHLDEGSIDYLRNEDAGLISPMYTVFETNSEEIDNEIALRQFKRFALSG 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +PPL EQ I E + A I + I+ Sbjct: 123 RFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAAEAA----IAKTEALIK 178 Query: 194 LLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKL 251 +++ K+AL+ Y V + + W+ G P + Sbjct: 179 AIEQTKKALLKQYFVERQQSLLWSCVAKMGRWLSGGTPATA---------------AEEN 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I + +I + + E +V PG ++ + RS+ S Sbjct: 224 WKGSIPWVCPKDIKGPSISSTVDHISEDAAKALGMVGPGTLLLVVRGMIL-ARSVPSTIC 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R A P+ + L + + L G + E + PV Sbjct: 283 TVRCAFNQDVKAFVPNEGVAPAFLKLWLDINEHKLLGEIETATHGT-KRFPLERLNEFPV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 V EQ + + R+ ++ L+ R + ++G+I L Sbjct: 342 PVVTRDEQIRLVTLAESSQERLRS----ERDNLSALRSVRDALAQELLSGRIRLPE 393 >gi|269978362|gb|ACZ55915.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 408 Score = 96.4 bits (238), Expect = 9e-18, Method: Composition-based stats. Identities = 60/397 (15%), Positives = 124/397 (31%), Gaps = 28/397 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + G++ K + + + + G + +R + Sbjct: 17 EFRKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEANRSGE----------T 65 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G Y D + F V PK + I A Sbjct: 66 IAISSSGVYAGYVSYWDIPVFLADSFSVS-PKQKTLMPKYLFHYLTTQQDAIHATKSAGG 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + H K + N +PIPPL Q I + + A T L TE + + + L+ + Sbjct: 125 IPHVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTELNTELKARKKQYEYYQNMLLDF 184 Query: 206 IVTKGLNPDVKM------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + D KM K L P E + + N+K K+ E + + Sbjct: 185 NDINQNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGDVCESTNKKTLKISEVSEVKN 244 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + G + + GE + + + G + Sbjct: 245 KGMYPVINSGRDLYGYYHDFN------NDGENITIASRGEYAGFINYFNEKFFAGGLCYP 298 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y + + + +L + +++ ++ + + G +L D++ L + +PP++ Q +I Sbjct: 299 YKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALNKADIETLTIPIPPLEIQQEIV 358 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ + L+ I I K+ R + Sbjct: 359 KILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 395 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 49/161 (30%), Gaps = 15/161 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + + +T + + ++ G+ V + + + Sbjct: 214 PKGVEFRKLGDVCESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGEN--- 270 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQR 136 I G Y + V ++L + L +L + ++ Sbjct: 271 ------ITIASRGEYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIM 324 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + G ++ + I + +PIPPL Q I + + Sbjct: 325 ENLVFRG-SIPALNKADIETLTIPIPPLEIQQEIVKILDQF 364 >gi|15964351|ref|NP_384704.1| putative specificity protein S [Sinorhizobium meliloti 1021] gi|15073528|emb|CAC45170.1| Putative specificity protein S [Sinorhizobium meliloti 1021] Length = 424 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 64/424 (15%), Positives = 142/424 (33%), Gaps = 38/424 (8%) Query: 23 KHWKVVPIKRFT-----KLNTGRTSE-------SGKDIIYIGLEDVESGT-GKYLPKDGN 69 K W + + ++ TG S + +D+ G ++ Sbjct: 2 KEWTETTLGQLCDDGGGEIKTGPFGSQLHQSDYSSDGTPVVMPKDILEGRLSEFSVARVG 61 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKD--VLPELLQ 125 S G I+YG+ G R A+I + + +C T L + + P+ L Sbjct: 62 SEHVQRLAQHQLQSGDIVYGRRGDIGRCALITERETGWLCGTGCLRISLGQGAIEPKFLF 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L++ I GATM + + + +I + P + Q I + A I Sbjct: 122 YFLINPVTVSWIYNQAVGATMPNLNTGILRSITVRYPDILTQRRIAGILSAYDDL----I 177 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 R I +L++ + L + P + +G+VP W F V Sbjct: 178 EVNQRRIAILEDMARRLFDEWFVRFRYPGHEAVPLVETELGMVPVGWTPGTFRECVDVNP 237 Query: 246 ---RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + + ++ ++ + M +++ G++++ + Sbjct: 238 ETLSPRKAPAHIHYIDIASVSVGRVDAVTTMKFSEAPGRARRVIRNGDVIWSTVRPNRRS 297 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFED 361 +L V + ++ + A++ D ++ R+ G ++ D Sbjct: 298 HALL-LDVASDTVASTGFAALRSRNSDWAWVYEATRTDAFVGFLVGRARGSAYPAVVGAD 356 Query: 362 VKRLPVLVPPIKE----QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + +P++VPP+ Q + + L + + L+ R + ++G Sbjct: 357 FEDVPLIVPPLDLRSTFQIQVG--------PMHELASTLHRQNNKLRAARDLLLPKLISG 408 Query: 418 QIDL 421 +ID+ Sbjct: 409 EIDV 412 Score = 36.3 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 58/192 (30%), Gaps = 6/192 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +P W + +N S I YI + V G ++ Sbjct: 217 LGMVPVGWTPGTFRECVDVNPETLSPRKAPAHIHYIDIASVSVGRVD-AVTTMKFSEAPG 275 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSID 132 + G +++ + P R + + ST F L+ ++ + + Sbjct: 276 RARRVIRNGDVIWSTVRPNRRSHALLLDVASDTVASTGFAALRSRNSDWAWVYEATRTDA 335 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G+ ++P+ +PPL + + ++ TL + + Sbjct: 336 FVGFLVGRARGSAYPAVVGADFEDVPLIVPPLDLRSTFQIQVGPMHELASTLHRQNNKLR 395 Query: 193 ELLKEKKQALVS 204 L+S Sbjct: 396 AARDLLLPKLIS 407 >gi|320155756|ref|YP_004188135.1| type I restriction-modification system, DNA-methyltransferase subunit M [Vibrio vulnificus MO6-24/O] gi|319931068|gb|ADV85932.1| type I restriction-modification system, DNA-methyltransferase subunit M [Vibrio vulnificus MO6-24/O] Length = 590 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 57/474 (12%), Positives = 127/474 (26%), Gaps = 93/474 (19%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLP-KDGNSRQS 73 P HW+ + + + + G+ G D +Y+ + D+++ + + + Sbjct: 104 PLHWETICVGQVAHVLGGKRVPKGYKLSEQPTDFVYLRVTDMKNQSIDESDLRYISEEVF 163 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSID 132 + G + G I S T+ L + +L Sbjct: 164 KQISRYTINTGDVYVTIAGTIGAVGTIPPHLDGMSLTENAAKLVFSGLSKKYLVTVLQSS 223 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR----------- 180 T++ I + +PIPPL EQ I +K+ Sbjct: 224 FVTRQFNDAVNQMAQPKLSLNSIKHTCIPIPPLEEQEYIADKVDELMALCDQLEQQTEAS 283 Query: 181 ------------------------------IDTLITERIRFIELLKEKKQALVSYIVTKG 210 I E + + KQ ++ V Sbjct: 284 IEAHQVLVTTLLDTLTNSADADELMQNWARISEHFDTLFTTEESIDQLKQTILQLAVMGK 343 Query: 211 LNPDVKMKD-------------------------------SGIEWVGLVPDHWEVKPFFA 239 L P + S E +P+ W+ Sbjct: 344 LVPQDPSDEPAAELLKRIAEEKAQLVKEKKIKKQKELPPISEDEKPFELPNGWKWCRLDD 403 Query: 240 LVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ------IVDP 288 + + K I L NI ++ + + ++ ++ P Sbjct: 404 ICFGITSGSTPPKVNFNESEGIPYLKVYNIREQKIDFEYKPQFVDNDCHKTKLARSVLYP 463 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G++V + K ++ E + + ++ + + Sbjct: 464 GDVVMNIVGPPLGKIAIIPDTYPEWNCNQAITFFRPIVPQLNKFIYTYLTAGSFLNSIEL 523 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +G+ + ++ + + + PP+KEQ I N ++ + L ++ + L Sbjct: 524 IGTAGQDNISVTKSRSILLPTPPLKEQRRIVNKVHELFLLCNSLKMRLRERHEL 577 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 67/201 (33%), Gaps = 14/201 (6%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P WK + + +G T + I Y+ + ++ + K Sbjct: 391 ELPNGWKWCRLDDICFGITSGSTPPKVNFNESEGIPYLKVYNIREQKIDFEYKPQFVDND 450 Query: 74 DTSTV---SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPK-DVLPELLQG 126 T S+ G ++ +GP L K I + C+ +P L + + Sbjct: 451 CHKTKLARSVLYPGDVVMNIVGPPLGKIAIIPDTYPEWNCNQAITFFRPIVPQLNKFIYT 510 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + IE I A + +I +P PPL EQ I K+ + ++L Sbjct: 511 YLTAGSFLNSIELIGT-AGQDNISVTKSRSILLPTPPLKEQRRIVNKVHELFLLCNSLKM 569 Query: 187 ERIRFIELLKEKKQALVSYIV 207 EL +V V Sbjct: 570 RLRERHELKLCITDTIVERAV 590 >gi|295397610|ref|ZP_06807686.1| restriction endonuclease S subunits family protein [Aerococcus viridans ATCC 11563] gi|294974148|gb|EFG49899.1| restriction endonuclease S subunits family protein [Aerococcus viridans ATCC 11563] Length = 402 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 130/396 (32%), Gaps = 25/396 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ ++ ++N + Y+ LE V+ Y + + Sbjct: 21 DWEQRRLENVVEINPSSNLPNS--FHYVDLESVKGTELIYSRIEYRDTAPS-RAKRLARN 77 Query: 84 GQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G + + + PY + + + +G + ST + ++P E L +L + ++ Sbjct: 78 GDVFFQLVRPYQKNNYLFNLEGKNYVFSTGYAQMRPSIS-SEYLINYLTTDKFIFQVLNR 136 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G++ + + I + IP + KI I+ IT R ++ L + K+ Sbjct: 137 STGSSYPAINSTDLIKIKIAIPQNELESF---KIGRILELINQTITLHQRKLDQLNQLKE 193 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +L+ + K++ +G E G + ++ N Sbjct: 194 SLLQQMFPGKGETVPKLRFAGFE--GEWEERKLGDILSERNDQIPETNEY--PLMSFVQG 249 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G + L +S + Y+ + G+ ++ +L+ + +I+ Y Sbjct: 250 KGVTPKGERYNRSFLVKDSEKKYKKTELGDFIYSSNNLETGSIG---FNKTGKAVISPVY 306 Query: 321 MAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQF 376 DS ++ L + G+ + + D + + +P KE+ Sbjct: 307 CIFNSKKAKDSQFIGILSARKEFISEMVRFRQGVVYGQWRIHESDFLNINIRIPNDKEKQ 366 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I ID + ++ + LK + + Sbjct: 367 LII----YLFENIDNTLVLYQRKLDQLKNMKQILLQ 398 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 66/189 (34%), Gaps = 6/189 (3%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + V E+N + + + L + + +R ++ G+ Sbjct: 20 DDWEQRRLENVVEINPSSNLPNSFHYVDLESVKGTELIYSRIEYRDTAPSRAKRLARNGD 79 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + F+ + L + + + + ++ Y ++P + +L + +V Sbjct: 80 VFFQLVRPYQKNNYLFNLE-GKNYVFSTGYAQMRPSISSEYLINYLTTDKFIFQVLNRST 138 Query: 351 SGLRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 ++ D+ ++ + +P + E F I ++ I+ + ++ + L + + S Sbjct: 139 GSSYPAINSTDLIKIKIAIPQNELESFKIGRILE----LINQTITLHQRKLDQLNQLKES 194 Query: 410 FIAAAVTGQ 418 + G+ Sbjct: 195 LLQQMFPGK 203 >gi|229088747|ref|ZP_04220304.1| Methyltransferase type 11 [Bacillus cereus Rock3-44] gi|228694572|gb|EEL47991.1| Methyltransferase type 11 [Bacillus cereus Rock3-44] Length = 395 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 59/405 (14%), Positives = 138/405 (34%), Gaps = 40/405 (9%) Query: 35 KLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSD--TSTVSIFAKGQILYG 89 + + G + GK I + D+ + NS Q D T + + G +++ Sbjct: 2 EFSNGINAPKENYGKGRKMISVMDILADEPIIYGNIRNSVQVDDKTESKNKVENGDLVFV 61 Query: 90 KLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + + + S + + K L+ +IE G+ Sbjct: 62 RSSEIRDEVGWAKAYRQKEYALYSGFSIRGKKKSDFDAKFIELSLNNSNRGQIERQAGGS 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + + +I + P + EQ+ I ++D I + + +LK+ KQ + Sbjct: 122 TRFNVSQSILKSIGILEPSIEEQIEIGNF----FEKLDETIALHQQELTILKQTKQGFLQ 177 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG-- 262 + K +++ G + G + + V + K+ + Y Sbjct: 178 KMFPKEGESVPEVRFPG--FTGDWEERKLINNIIEKVLDFRGKSPAKFGMKWGNSGYLVL 235 Query: 263 ---NIIQKLETRNMGLKP------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 N+ + + K E + + ++ G++VF + + + Sbjct: 236 SALNVKNGYIDKLVEAKYGDQMLFERWMGKERLEKGDVVFTTEAPLGNVAQVP----DDN 291 Query: 314 GIITSAYMAVKP---HGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLV 369 G I + + D+ +LA L+R+ ++ G + + ++ ++ + Sbjct: 292 GYILNQRVVAFKTSTEKTDNNFLAQLLRNPLFQTRLKENASGGTAKGIGMKEFAKMSATI 351 Query: 370 P-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P ++EQ I N ++D + + + LKE + +F+ Sbjct: 352 PASVEEQTKIGNF----FKQLDETIALHQLELDTLKETKKAFLQK 392 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 62/200 (31%), Gaps = 19/200 (9%) Query: 24 HWKVVPI-KRFTKLN---TGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 W+ + + G++ + + +V++G L + Q Sbjct: 198 DWEERKLINNIIEKVLDFRGKSPAKFGMKWGNSGYLVLSALNVKNGYIDKLVEAKYGDQM 257 Query: 74 DTS---TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ---PKDVLPELLQGW 127 KG +++ P A + D +G Q +V + L Sbjct: 258 LFERWMGKERLEKGDVVFTTEAPLGNVAQVPDDNGYILNQRVVAFKTSTEKTDNNFLAQL 317 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L + R++ G T K + IP E+ KI ++D I Sbjct: 318 LRNPLFQTRLKENASGGTAKGIGMKEFAKMSATIPASVEEQ---TKIGNFFKQLDETIAL 374 Query: 188 RIRFIELLKEKKQALVSYIV 207 ++ LKE K+A + + Sbjct: 375 HQLELDTLKETKKAFLQKMF 394 >gi|294792925|ref|ZP_06758071.1| putative type I restriction-modification system, S subunit [Veillonella sp. 6_1_27] gi|294455870|gb|EFG24234.1| putative type I restriction-modification system, S subunit [Veillonella sp. 6_1_27] Length = 490 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 73/421 (17%), Positives = 136/421 (32%), Gaps = 54/421 (12%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP W+ V +K + S I I D K ++ + + I Sbjct: 67 IPNTWRWVRLKEIVYNRGQKKPTSKFWYIDISSIDNTRQKLKQAINIIDAENAPSRARRI 126 Query: 81 FAKGQILYGKLGPYLRKAIIAD----FDGICSTQFLVL-QPKDVLPELLQGWLLSIDVTQ 135 G ILY + PYL I D F+ I ST + V + L +LLS Q Sbjct: 127 VDVGDILYSTVRPYLHNMCIIDSTSPFESIASTGLAAMTCYNKVYNKYLFYYLLSASFDQ 186 Query: 136 RIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ G + + + +P+PP+ EQ I EKI ID + E Sbjct: 187 YANSLENSKGVAYPAINDERLYKAVIPLPPVDEQKRIVEKIEVIFPLIDRYEGVWHKLNE 246 Query: 194 LLKEK----KQALVSYIVTKGLNPDVKMKDSGIEWV------------------------ 225 L K +++++ + L S E + Sbjct: 247 LNKTFPETLQKSILQEAIQGKLCEQKDEDGSAKELIEKISLEKERLIESGQIKKHKALPA 306 Query: 226 -------GLVPDHWEVKPFFALVTELNRKNTKLIE--SNILSLSYGNIIQKLETRNMGLK 276 +P W + + T K ++ + + I+K + + Sbjct: 307 IQEDEIPFDIPSSWCWERLGNISTYNQTKPKIKAIDLDRLIWVLDLDDIEKNTGKILRYV 366 Query: 277 PESYET----YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDST 331 + + G+I++ + K + +E G+ T + G + Sbjct: 367 KAKDKKVSGEKVVFHKGQILYSKLRPYLKKALIA----LEDGVCTPELVPFDIFGGCNRN 422 Query: 332 YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 Y+ +++S + + G+ + E + L + +PPI+EQ I + + I Sbjct: 423 YILSVLKSPYVDFGVNSATYGVKMPRVGVETMINLLIPIPPIREQERIVKKFDKSHSLIQ 482 Query: 391 V 391 Sbjct: 483 R 483 >gi|303258214|ref|ZP_07344221.1| type I restriction system specificity protein [Burkholderiales bacterium 1_1_47] gi|302858967|gb|EFL82051.1| type I restriction system specificity protein [Burkholderiales bacterium 1_1_47] Length = 408 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 51/425 (12%), Positives = 122/425 (28%), Gaps = 49/425 (11%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY--LPKDGNSRQSDTST 77 K + ++ G T ++ DI ++ ++D T K + S+ Sbjct: 2 KTYKLTDIAEVIVGGTPKTSVAEYWNGDIPWLSVKDFNKVTRYVLTTEKKISMEGLQKSS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ K I+ G A+I + ++ + + + Sbjct: 62 TNLLKKDDIIISARGTVGALAMIKTPMA-FNQSCYGIRVNAEKVSPAYLFYSLKTKIKAL 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +A G+ + + +P L EQ+ + +I+ + L ++ Sbjct: 121 KAASHGSVFDTITLDTLNGLDFELPSLNEQLCASNFLSLLDEKIELNNSINRNLDALARQ 180 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLV------PDHWEVKPFFALVTELNR----- 246 + SG + V P+HWEV F V Sbjct: 181 LYDYWFVQ-FDFPDESGRPYRTSGGKMVWNNRLKRNIPEHWEVVNIFDSVDVQYGFPFST 239 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + +S++ + +I+ E + G+++ + Sbjct: 240 DSFVDQDSDVPVVRIRDILNG---TVSAYSTEQVGEKYRLSTGDLILGMDGNFHM----- 291 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + + + + + +M + + L +D+K Sbjct: 292 NLWCDNKSFLNQRCVRFRQKDNSAVSTLQVMYEIAPYIRAKEQVAKGSTVGHLSDKDLKD 351 Query: 365 LPVLVPPIKEQFDITNVINVE-------TARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 L ++ P +N + I L+ + + I L + R + + G Sbjct: 352 LWIMTP-----------LNNKYFSASSTLNHISNLIIENRREISELTKLRDDLLPILLNG 400 Query: 418 QIDLR 422 Q+ +R Sbjct: 401 QVSIR 405 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 65/194 (33%), Gaps = 14/194 (7%) Query: 21 IPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP+HW+VV I + G + D+ + + D+ +GT + Sbjct: 216 IPEHWEVVNIFDSVDVQYGFPFSTDSFVDQDSDVPVVRIRDILNGTVSAYSTEQVGE--- 272 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI-DV 133 + G ++ G G + D + + + + KD + I Sbjct: 273 ---KYRLSTGDLILGMDG-NFHMNLWCDNKSFLNQRCVRFRQKDNSAVSTLQVMYEIAPY 328 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E + +G+T+ H K + ++ + P + + + I E + Sbjct: 329 IRAKEQVAKGSTVGHLSDKDLKDLWIMTPLNNKYFSASSTLNHISNLIIENRREISELTK 388 Query: 194 LLKEKKQALVSYIV 207 L + L++ V Sbjct: 389 LRDDLLPILLNGQV 402 >gi|70725064|ref|YP_251978.1| hypothetical protein SH0063 [Staphylococcus haemolyticus JCSC1435] gi|68445788|dbj|BAE03372.1| hsdS [Staphylococcus haemolyticus JCSC1435] Length = 407 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 49/410 (11%), Positives = 135/410 (32%), Gaps = 26/410 (6%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +++ + + S K+++++ D+E G + ILY Sbjct: 4 KLEKLLDSVSIKHPFSKKNVVFLNTSDIEEGNI-LKKEYSKIDDLPGQAKKSIQPNDILY 62 Query: 89 GKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE--- 142 ++ P ++ +F+ + ST+ +VL+ + + + + + + Sbjct: 63 SEIRPKNKRYAYINFECDDYVVSTKLMVLRNINPDLVHSKYLYYFLIDQKTVNYLQNIAE 122 Query: 143 --GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 T + + N+ + +P + +Q+ I + +I+ EL + + Sbjct: 123 SRSGTFPQITFSEVKNLKLDLPSIEKQITIINIMDTLNEKINNNKKIISNLEELSQTSFK 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + K SG E +G +P +W +K + N L + Sbjct: 183 RWFVDFEFPDED-GNPYKSSGGEMIDSELGEIPKNWSIKTVKEIAESFNSIRKPLSKIER 241 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + ++ I +V +Q + + V + + Sbjct: 242 EKRESIYPYYGATKIIDYVDNYIFDGKYI-----LVGEDGTVQTETGNPFIQYVWGKFWV 296 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 ++ +K I L +++ ++ ++ L +++ + ++ + Sbjct: 297 SNHAHILKGKLISDELLMLYLKNTNVAPYI---TGAVQPKLNKKNLNSIKFVIADKETII 353 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 N I +I +L + L E R + + ++G+I++ + + Sbjct: 354 KFENSIKSYFQKIRIL----NKENKKLIELRDTLLPKLMSGEIEIPDDIE 399 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 65/204 (31%), Gaps = 21/204 (10%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG + +G IPK+W + +K + K P Sbjct: 198 YKSSGGEMIDSELGEIPKNWSIKTVKEIAESFNSIRKPLSKIE--------REKRESIYP 249 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVL 120 G ++ D IF IL G+ G S +L+ K + Sbjct: 250 YYGATKIIDYVDNYIFDGKYILVGEDGTVQTETGNPFIQYVWGKFWVSNHAHILKGKLIS 309 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 ELL +L + +V GA + K + +I I + I + + Sbjct: 310 DELLMLYLKNTNVAP----YITGAVQPKLNKKNLNSIKFVIADKETIIKFENSIKSYFQK 365 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I L E + IEL L+S Sbjct: 366 IRILNKENKKLIELRDTLLPKLMS 389 >gi|154507565|ref|ZP_02043207.1| hypothetical protein ACTODO_00044 [Actinomyces odontolyticus ATCC 17982] gi|153797199|gb|EDN79619.1| hypothetical protein ACTODO_00044 [Actinomyces odontolyticus ATCC 17982] Length = 383 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 53/396 (13%), Positives = 110/396 (27%), Gaps = 37/396 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + +L G + + G+ G + S Sbjct: 13 PDGVEYRALGDVAELKRGEAVTRKEVV-----------EGQVPVIAGGREPAYYIDRSNR 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ G Y D S F ++ + VL + + + I A+ Sbjct: 62 QGETIVIAGSGAYAGFVSFWDEPIFVSDAFSIVVDRSVL-QPRFVYHWLSGRQEAIHALK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + H K + + P+PPL Q I + T L E + Sbjct: 121 SGGGVPHVYPKDVAKLRCPVPPLEVQREIVRILDQFTTLEAELEAELEARRTQYAHYRTH 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+SY P + + V K T + ++ Sbjct: 181 LLSYESLAARGPVN------VIELQDVGVVRMCKRIHKAETSIQGDIPFFK-----ISTF 229 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G + + K + Y G+++ A + ++ Sbjct: 230 GGTPTSFISAELYGKYKD--KYPYPKKGDLLISAAGTIGQIVRFDGADAYFQD-SNIVWL 286 Query: 322 AVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + YL + + + G + L + + + VPPI+ Q I + Sbjct: 287 EHDESIVLNRYLYYVYLNTRWTTD------GGTIKRLYNNRILQQQICVPPIETQITIAD 340 Query: 381 VINVETARIDVLVEKIEQSI----VLLKERRSSFIA 412 +++ A ++ + + I + R ++ Sbjct: 341 LLDRFDALVNDISSGLPAEIAARRAQYEHYRDRLLS 376 >gi|91785555|ref|YP_560761.1| putative HsdS protein [Burkholderia xenovorans LB400] gi|91689509|gb|ABE32709.1| Putative HsdS protein [Burkholderia xenovorans LB400] Length = 438 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 68/435 (15%), Positives = 133/435 (30%), Gaps = 49/435 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVE-SGTGKYLPKDGNSRQ 72 W V + ++ G +S I + + + + +G ++ Sbjct: 3 SEWTHVRLGELAEVKHGWAFKSDYFKADDEAAGLPIVVAIGNFQYTGGFRFESTQIKRYT 62 Query: 73 SDTSTVSIFAKGQILYGKL-----GPYLRKAIIADFDGICSTQF-----LVLQPKDVLPE 122 + + I G+IL G L +G +VL+ V + Sbjct: 63 GEFPSEYILQPGEILLVMTCQTAGGEILGIPARVPDNGRVYLHNQRLGKVVLKSGRVCSD 122 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L L + + G + H I + +P +AEQ I E + A RI Sbjct: 123 FLYWLFLYPPFNRHLVNSATGTKILHTAPSRIESFEFKLPSVAEQREIAEALDAIDDRIS 182 Query: 183 TLITERIRFIELLKEKKQALVS-----YIVTKGLNPDVK------MKDSGIEW--VGLVP 229 L + + + ++ +G P+ + G E +GLVP Sbjct: 183 LLRETNVTLEAIAQAMFKSWFVDFEPVRAKQEGRAPEGMDEATAALFPDGFEESELGLVP 242 Query: 230 DHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 W + + LN K E L + ++ T+N + I Sbjct: 243 RAWRARSLDSFADYLNGLALQKFPAESEDEYLPVIKIAQLRAGNTQNADRASTKLKAEYI 302 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL--C 343 V G+++F + G + V + + +L + L Sbjct: 303 VRDGDVLFSWSGSLE-----VELWCGGEGALNQHLFKVTSSEV-PKWFYYLATRHHLPEF 356 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + A + ++ + + + VPP + + + A + L + I L Sbjct: 357 REIAAHKATTMGHIQRKHLTEAKIAVPPPE----VLTRLTEFVAPLIELRIENAVRIRSL 412 Query: 404 KERRSSFIAAAVTGQ 418 E R S + ++GQ Sbjct: 413 GELRDSLLPRLISGQ 427 >gi|319788900|ref|YP_004090215.1| restriction modification system DNA specificity domain [Ruminococcus albus 7] gi|315450767|gb|ADU24329.1| restriction modification system DNA specificity domain [Ruminococcus albus 7] Length = 536 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 68/453 (15%), Positives = 134/453 (29%), Gaps = 76/453 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P+ W+ ++ + T T + S + I++ ++V SG + Sbjct: 87 DLPEGWEWARLQSICEPITDGTHKTPTYSDEGFIFLSSKNVTSGHIDWDNIMYIPESLHN 146 Query: 76 STVSIF--AKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLS 130 + K IL K G AI+ S L + + PE L + S Sbjct: 147 ELYARLAPQKNDILLAKNGTTGVAAIVNRDCVFDIYVSLALLRIIGYIISPEYLLSTIAS 206 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + +G + + + I +P+ P+ EQ I K+ D + +++ Sbjct: 207 STIQNYFNSSLKGIGVPNLHLEHIRTTLIPVAPINEQNRIAAKLEQLLSFADNIESDKTD 266 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVK---------------------------------- 216 ++ K ++ + L P Sbjct: 267 LQTTIQLTKSKILDLAIRGKLVPQNPDDEPASVLLDRIRAEKEELIKQGKIKRDKKESVI 326 Query: 217 ---------------MKDSGIEWVGLVPDHWEVKPFFALVT--ELNRKNTKLIESNILSL 259 + E +PD W K+ K ES+ + Sbjct: 327 FKGDDNSYYEKIGDTVTCIDEELPFELPDGWAWVRLQTCCQKEIKRGKSPKYTESSGTLV 386 Query: 260 SYGNIIQKLETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 K + NM L Y + + + V R + Sbjct: 387 FAQKCNTKYDGINMDLALYLDESTLVKYPDDEYMQDKDTVINSTGTGTLGRVGIYRRTDN 446 Query: 313 RGII-----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R + + + + I + Y+ ++++ GS ++ LK +K L V Sbjct: 447 RREMPVVPDSHVTVIRTNNEISAEYIYHFLKAHQHELEKLGEGSTNQKELKPLTLKNLIV 506 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 +PP EQ I +I ++ IE+S+ Sbjct: 507 ALPPYAEQERIIEIITAAFE----IMTNIEKSL 535 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 75/215 (34%), Gaps = 10/215 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQK-L 268 PD +K E +P+ WE +T+ K + + LS N+ + Sbjct: 73 PDGTVKCIEDEIPYDLPEGWEWARLQSICEPITDGTHKTPTYSDEGFIFLSSKNVTSGHI 132 Query: 269 ETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + N+ PES +I+ ++ + + ++ A + + Sbjct: 133 DWDNIMYIPESLHNELYARLAPQKNDILLAKNGTTG-VAAIVNRDCVFDIYVSLALLRII 191 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + I YL + S + F + G+ +L E ++ + V PI EQ I + Sbjct: 192 GYIISPEYLLSTIASSTIQNYFNSSLKGIGVPNLHLEHIRTTLIPVAPINEQNRIAAKLE 251 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + D + ++ +S + A+ G+ Sbjct: 252 QLLSFADNIESDKTDLQTTIQLTKSKILDLAIRGK 286 >gi|294677465|ref|YP_003578080.1| type I restriction-modification system RcaSBIV subunit S [Rhodobacter capsulatus SB 1003] gi|294476285|gb|ADE85673.1| type I restriction-modification system RcaSBIV, S subunit [Rhodobacter capsulatus SB 1003] Length = 401 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 67/417 (16%), Positives = 124/417 (29%), Gaps = 43/417 (10%) Query: 24 HWKVVPIKRFT-KLNTGRTSE----SGKDIIYIGLEDVESGTGKY---LPKDGNSRQSDT 75 W+V P+ + G E S I I ++ +G K + + + Sbjct: 4 GWQVKPLHSLALTITDGNWVETKDQSDSGIRLIQTGNIGTGFFKNRCEKSRYIDDATFER 63 Query: 76 STVSIFAKGQILYGKL-GPYLRKAIIAD----FDGICSTQFLVLQPKDVLPELLQGWLLS 130 + G L +L P R II D + +LPE + +S Sbjct: 64 LRCTEVFPGDCLVSRLPDPVGRSCIIPDTGEKMITAVDCTIIRFDRDVLLPEFFIYFSMS 123 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + C G T +G +P+P+PPL EQ I + +D Sbjct: 124 QSYLTAVADACTGTTRQRISRTNLGKLPIPLPPLDEQKRIIAILDETFEGLDRARANAEA 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + +E +A + E + W + + + Sbjct: 184 NLADARELFEATLR------------------EELEKNSTDWRECSLSDIGQTVTGSTPR 225 Query: 251 LIES-----NILSLSYGNIIQKL--ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 E+ I + G+ + + GL + + +I+ PG + I K Sbjct: 226 TSETGNTGTFIPFIKPGDFLPDGRLNYESEGLSEKGAASSRILPPGSALMVCIGATIGKA 285 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDV 362 + I + V GI ++ M S + G + Sbjct: 286 GFSDRSIATNQQINA---LVPSVGICGEFVYLQMLSKSFQREVIQNAGQATLPIINKSKW 342 Query: 363 KRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L V +P + Q +T + + L + ++ L R S + A +G+ Sbjct: 343 SALKVRMPHDLSRQEAVTAKMREARNHVSSLEKHFTTTLADLTSLRQSLLQKAFSGE 399 >gi|282907757|ref|ZP_06315597.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WW2703/97] gi|282328321|gb|EFB58594.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WW2703/97] Length = 387 Score = 96.0 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 60/403 (14%), Positives = 137/403 (33%), Gaps = 36/403 (8%) Query: 28 VPIKRFTKLNTGRTSES---GKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTSTVSIF 81 + + G G + +DV + L N + S Sbjct: 1 KKVGELLEFKNGLNKGKEYFGSGSSIVNFKDVFNNRSLNTNNLTGKVNVNSKELKNYS-V 59 Query: 82 AKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG + + + + + + + S L +PK + + + + T Sbjct: 60 EKGDVFFTRTSEVIGEIGYPSVILNDPENTVFSGFVLRGRPKSGIDLINNNFKRYVFFTN 119 Query: 136 RIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++M+ I + KI ++D I + +EL Sbjct: 120 SFRKEMITKSSMTTRALTSGSAINKMKVIYPVSAKEQRKIGDFFSKLDRQIELEEQKLEL 179 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+++K+ + I ++ L + HWE + E N ++ Sbjct: 180 LQQQKKGYMQKIFSQEL--------RFKDENSEDYPHWENSKIEKYLKERNERSD--KGQ 229 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + II+ E + Y++V +I + + + + G Sbjct: 230 MLSVTINSGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASG----RSNYNG 285 Query: 315 IITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVP 370 I++ AY + P S+ + +++ + F GL +LK++ +K + + +P Sbjct: 286 IVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINIDIP 345 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++EQ I + ++D+L+ K + I +L++ + SF+ Sbjct: 346 VLEEQEKIGDF----FKKMDILISKQKIKIEILEKEKQSFLQK 384 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFA 82 HW+ I+++ K R+ + + + + SG K+ D ++ D S + Sbjct: 208 HWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKDKSNYKVVR 262 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 263 KNDIAYNSMRMWQGASGRSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 322 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 323 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKIKIEILEKEKQSFL 382 Query: 200 QALV 203 Q + Sbjct: 383 QKMF 386 >gi|293189230|ref|ZP_06607953.1| type I restriction enzyme specificity protein HsdS [Actinomyces odontolyticus F0309] gi|292821693|gb|EFF80629.1| type I restriction enzyme specificity protein HsdS [Actinomyces odontolyticus F0309] Length = 395 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 58/406 (14%), Positives = 122/406 (30%), Gaps = 45/406 (11%) Query: 22 PKHWKVVPIKRFTKLNTG-RTSES---GKDIIYIGLEDVESGTGKYLPKDGNSR-QSDTS 76 P + P+ L G + K I I + + G D Sbjct: 13 PDGVEYRPLGEIADLQRGAGMPKKLFVDKGIPAIHYGHIFTKYGIQAKCAAAYLAPEDAE 72 Query: 77 TVSIFAKGQILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ G ++ L + D +G+ V++ V L +L + Sbjct: 73 KLTRVFPGDLVVANTSENLEDVGKGVVWLGDVEGVTGGHATVVRSLAVDSVFLSYYLRTE 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D + +G + + I +P+PP+ Q I + T L E Sbjct: 133 DFALKKRKYAQGTKVIELSAANLSKIDIPLPPVEVQREIVRILDQFTTLEAELEAELEAR 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + L+SY P +K +G + T + + Sbjct: 193 QAQYEHYRNHLLSYDSLAARGPVEMVK------LGE---------LAHIATGGRNTSDAV 237 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + L ++ ++ G+ V + Sbjct: 238 DNGTYPFYVRSQVP-------LSLNEYDFDESAVLTAGDGV--------GVGKVFHHVEG 282 Query: 312 ERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + AY + + S YL +M S + + S++ ++R PV VP Sbjct: 283 KYALHQRAYRIVPNLELLSSRYLYHVMVSQFGRYLESTVFHSSVTSVRKPMLERFPVAVP 342 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSI----VLLKERRSSFIA 412 P++EQ + +V++ A ++ + + I + R ++ Sbjct: 343 PMEEQDRVADVLDRFNALVNDITSGLPAEIAARRAQYEHYRDRLLS 388 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 65/169 (38%), Gaps = 9/169 (5%) Query: 228 VPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNM----GLKPESY 280 PD E +P + ++ I ++ YG+I K + L PE Sbjct: 12 CPDGVEYRPLGEIADLQRGAGMPKKLFVDKGIPAIHYGHIFTKYGIQAKCAAAYLAPEDA 71 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRS 339 E V PG++V + + + G+ V+ +DS +L++ +R+ Sbjct: 72 EKLTRVFPGDLVVANTSENLEDVGKGVVWLGDVEGVTGGHATVVRSLAVDSVFLSYYLRT 131 Query: 340 YDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D G + L ++ ++ + +PP++ Q +I +++ T Sbjct: 132 EDFALKKRKYAQGTKVIELSAANLSKIDIPLPPVEVQREIVRILDQFTT 180 >gi|332969662|gb|EGK08678.1| hypothetical protein HMPREF9374_3258 [Desmospora sp. 8437] Length = 281 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 74/201 (36%), Gaps = 9/201 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 E +P++W K K + + GNI T +G + Sbjct: 21 PEAEQPYELPENWVWVRLLDGGAICLDKFRKPVNARQREERKGNIPYYGATGQVGWIDDF 80 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS--TYLAWLM 337 ++V GE F+D K + + + + + +K + +L + + Sbjct: 81 LTNEELVLVGEDGAPFLDPNKSKAYM----ITGKAWVNNHAHILKSNFGSPGNKFLTYYL 136 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 ++ R L ++++P +PP+ EQ I + + +ID E I+ Sbjct: 137 NQFNYNGFV---TGTTRLKLTQGKLRQIPFPLPPLSEQKRIVDRVESLLGKIDEAKELIQ 193 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 ++ ++RR++ + A G+ Sbjct: 194 EARDSFEQRRAAILDRAFRGE 214 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/213 (13%), Positives = 59/213 (27%), Gaps = 22/213 (10%) Query: 20 AIPKHWKVVPI---KRFT------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 +P++W V + +N + E +I Y G + G + Sbjct: 28 ELPENWVWVRLLDGGAICLDKFRKPVNARQREERKGNIPYYGAT-GQVGWIDDFLTNEEL 86 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 KA + + +L+ P Sbjct: 87 VLVGEDGAPFLDPN----------KSKAYMITGKAWVNNHAHILKSNFGSPGNKFLTYYL 136 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 G T + IP P+PPL+EQ I +++ + +ID Sbjct: 137 NQF--NYNGFVTGTTRLKLTQGKLRQIPFPLPPLSEQKRIVDRVESLLGKIDEAKELIQE 194 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 + ++++ A++ L + + E Sbjct: 195 ARDSFEQRRAAILDRAFRGELTRTWREQHPDAE 227 >gi|229082881|ref|ZP_04215305.1| hypothetical protein bcere0023_54720 [Bacillus cereus Rock4-2] gi|228700419|gb|EEL52981.1| hypothetical protein bcere0023_54720 [Bacillus cereus Rock4-2] Length = 393 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 44/385 (11%), Positives = 102/385 (26%), Gaps = 27/385 (7%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG 92 + + + S ++ I E + + + + + G + L Sbjct: 26 IFESISNKNHNSDLPVLAITQEHGAIPRDR-INYNVSVTNKSLENYKVVEIGDFVIS-LR 83 Query: 93 PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATM-SHAD 150 + + + GICS +++L+ K + E + + Q + EG Sbjct: 84 SFQGGIEYSLYHGICSPAYIILRKKIPIVEQYYKHYFKTNKFIQDLNKDLEGIRDGKMVS 143 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + +I +P P EQ I + + + I + K Q L+ Sbjct: 144 YSQFSSILLPKPENKEQQKIADFLSSLDDLITAENEKLEALKVNKKGLMQKLL------- 196 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 G W F + + N + + Sbjct: 197 ------------PAEGKTVPEWRFPEFRDCREWDIYRIKDFAKVTTGKKDTQNKVDHGKY 244 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + E + D ++ + Sbjct: 245 PFFVRSQAVEKIDSYTFDCEAILTSGDGVGVGKNFHYINGKFDFHQRVYCIYDFSKSAFG 304 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 ++ + +V S++ + +P+ +P I EQ I++ + + ID Sbjct: 305 KFVFQYFSEHFKNRVMKLSAKNSVDSVRKSMITEMPITMPNIAEQHKISDCL----SSID 360 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAV 415 L+ + + LK + + Sbjct: 361 DLITAQAEKVKTLKLYKKGLMQGLF 385 >gi|254383777|ref|ZP_04999125.1| type I restriction-modification system specificity subunit [Streptomyces sp. Mg1] gi|194342670|gb|EDX23636.1| type I restriction-modification system specificity subunit [Streptomyces sp. Mg1] Length = 403 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 69/385 (17%), Positives = 138/385 (35%), Gaps = 27/385 (7%) Query: 47 DIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF-- 103 ++ +++ + + S +G +L K G L + + Sbjct: 22 GFTFLSTPNIKGREIDFDNVNYITEFRYQESPELKLREGDVLLAKDGNTLGIVNLVKYLP 81 Query: 104 -DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 + V++P + L+ L S I + G + H I +P+P+P Sbjct: 82 RPATVNGSIAVIRPTGIDGAFLRYVLASRVTQAAINMLKGGMGVPHLFQWDINRLPVPVP 141 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 PL EQ I + + ET RID L R R LL+E+ + ++K Sbjct: 142 PLEEQRRIADFLDVETARIDRLTQLRSRQAGLLEERFGLALDKAFENATYEPTRLKY--- 198 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + + P + + P ++ + L++ + S I +L S Sbjct: 199 -LLAVKPRYGVLVPQYSDSGVRFIRVNDLLDLAGRADSLAKIPDELS---------SQYA 248 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + PG+++ + + ++ Q+ + + H + LA + + Sbjct: 249 RTVTRPGDVLLSVVGTMG-RSAVVPPQLAGANVARAVASLRTRHEVSPELLATWLTTPSF 307 Query: 343 CKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID----VLVEKI 396 + + + +L ED+ + P + E+ + + + T+ I L + Sbjct: 308 LRQASDVTGSDTAQPTLGMEDLSNFRLSWP-VDERGR--DELLLVTSTIRRHQRELTGVL 364 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 E +L ERR + I AAVTGQ D+ Sbjct: 365 EVQRRVLTERRQALITAAVTGQFDV 389 >gi|269115098|ref|YP_003302861.1| Type I restriction enzyme specificity protein [Mycoplasma hominis] gi|268322723|emb|CAX37458.1| Type I restriction enzyme specificity protein [Mycoplasma hominis ATCC 23114] Length = 446 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 60/394 (15%), Positives = 118/394 (29%), Gaps = 31/394 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDII------YIGLEDVESGTGKYLPKDGNSRQSDT 75 P + I L G + ++ D + YI ++ + + Sbjct: 13 PNGVEYKKIGDLGILYNGLSGKNKNDFLNNTNKQYITYLNIFNNLSIDIKGLEKVSVLKN 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIAD------------FDGICSTQFLVLQPKDVLPEL 123 + G IL+ + A + + + ++ Sbjct: 73 EKQNRVLYGDILFTTSSESANECGYASVANDKYFDNNDVYLNSFCFGYRLFNIENYNVNY 132 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +++ + I G T + + I +PIPPL Q I + T Sbjct: 133 FKYLFKDLNIRKEIIKCVNGVTRFNLSKEQFKRILIPIPPLEIQNQIVNILDKFTELTTE 192 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 L TE + + L+ + K L + M + + Sbjct: 193 LTTELTYRDKQYNYYRNKLLDFDNNKEL-LNKIMNNQQCSNNIVEYKKIGDLGILYNGLS 251 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQN 300 KN L +N ++Y NI L L+ S E V G+I+F Sbjct: 252 GKNKNDFLNNTNKQYITYLNIFNNLSIDIKSLEKVSVLKNEKQNRVLYGDILFTTSSESA 311 Query: 301 DKRSLRSAQVMERGIITSAYMAVKP--------HGIDSTYLAWLMRSYDLCKVFYAMGSG 352 ++ S + Y+ + Y +L + ++ K +G Sbjct: 312 NECGYASVANDKYFDNNDVYLNSFCFGYRLFNIENYNVNYFKYLFKDLNIRKEIIKCVNG 371 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 R +L E KR+ + +PP++ Q I +++ Sbjct: 372 VTRFNLSKEQFKRILIPIPPLEIQNKIVEILDKL 405 >gi|323215378|gb|EGA00122.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. MB101509-0077] Length = 552 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 63/473 (13%), Positives = 121/473 (25%), Gaps = 99/473 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTPLAIE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDL 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ +E + G T+ + + P IPP AEQ Sbjct: 200 KVLSPFLSEISYYIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 + + RI Sbjct: 260 LSTVKKLMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADALAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 320 TTEASIDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 S E VP+ WE + + I ++ G+I + Sbjct: 380 ISDKEKPFEVPEGWEWCKFGLISEFINGDRGSNYPNKNEYVVHGIPWINTGHIEKNGTLS 439 Query: 272 NMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + + + G++V+ K + I +S + Sbjct: 440 ITDMNFITEKKFNELRSGKIQSGDLVYCLRGATFGKTAFVKPYESG-AIASSLMIIRPFI 498 Query: 327 GIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 Y+ + S + + +L V PP++EQF I Sbjct: 499 REMGEYIYNYLISPFGRSQIFRFDNGSAQPNLSANSVMLYAFACPPLQEQFRI 551 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 62/196 (31%), Gaps = 10/196 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P WE F L K ++ + + K N+ + E Sbjct: 93 SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPKDMKTNLIVDSED 152 Query: 280 -------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + V PG I+F +R A + + P + +Y Sbjct: 153 KVTPLAIEDGLTKVSPGSILFVARSGIL-RRIFPVAITSIECTVNQDLKVLSPFLSEISY 211 Query: 333 LAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 LM + + +SL F+D P ++PP EQ I + + + D Sbjct: 212 YIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRILSTVKKLMSLCD 271 Query: 391 VLVEKIEQSIVLLKER 406 L + S+ ++ Sbjct: 272 QLEQHSLTSLDAHQQL 287 >gi|260858509|ref|YP_003232400.1| type I restriction-modification enzyme S subunit [Escherichia coli O26:H11 str. 11368] gi|257757158|dbj|BAI28660.1| type I restriction-modification enzyme S subunit [Escherichia coli O26:H11 str. 11368] Length = 589 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 66/514 (12%), Positives = 140/514 (27%), Gaps = 109/514 (21%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLE 54 +K K P+ S + +P+ W+ + G + K+I+ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWTRLINLGIWALGSGFPNVVQGSTDKEILMCKVS 140 Query: 55 DVE-SGTGKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLG---PYLRKAIIADFDGIC 107 D+ G K++ N+ D + + I G I++ K+G ++ I+ I Sbjct: 141 DMNLEGNEKFIFSTKNTISKDLADEYKIKISEPGTIIFPKIGGAIATNKRRILVQDTAID 200 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + + E L ++D + G ++ + IG+IP+ +P L Q Sbjct: 201 NNCLGIKPCDAISGEWFYLILNTLD----MSKYQSGTSIPAINQSVIGSIPIALPSLKMQ 256 Query: 168 VLIREK-----------------------------------------IIAETVRIDTLIT 186 I + RI Sbjct: 257 EKIVSYVITLMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWARISEHFD 316 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD--------------------------- 219 + KQ ++ V L P + Sbjct: 317 TLFTTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKP 376 Query: 220 ----SGIEWVGLVPDHWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 S E +P+ WE + + I ++ G+I + Sbjct: 377 LPPISDEEKPFELPEGWEWCKFGLTSEFINGDRGSNYPNKNEYVSQGIPWINTGHIEKNG 436 Query: 269 ETRNMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + + + G++V+ K + I +S + Sbjct: 437 TLTVTEMNFITEGKFNELRSGKIQKGDLVYCLRGATFGKTAFVIPYETG-AIASSLMIIR 495 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDI---T 379 Y+ + S Y + +L V PP+ EQ+ I Sbjct: 496 PFITEMGGYIYNYLTSPFGRSQIYRFDNGSAQPNLSANSVMLYSFPCPPLTEQYRIFSQV 555 Query: 380 NVINVETARIDVLVEKIEQ-SIVLLKERRSSFIA 412 +++ ++ ++ +Q + L + I Sbjct: 556 GLLHELCDKLKTRIKTAQQTQLHLADALTDAAIN 589 >gi|310287613|ref|YP_003938871.1| HsdS-like protein of Type I restriction-modification system [Bifidobacterium bifidum S17] gi|309251549|gb|ADO53297.1| HsdS-like protein of Type I restriction-modification system [Bifidobacterium bifidum S17] Length = 412 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 64/407 (15%), Positives = 131/407 (32%), Gaps = 34/407 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV----SI 80 W+ + + G DI G+ G Y + DT ++ Sbjct: 19 WEQRKLGEIASFSKGSGYSKA-DIRESGIPLFLYG-RMYTQYETRVDSVDTFAAPRPGTL 76 Query: 81 FAKG-QILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW-LLSIDVT 134 ++KG +I+ G R + I V+ P+ ++ L + L Sbjct: 77 YSKGTEIVVPASGESAEDIARASAITREGIALGGDLNVVYPQRMVTPLFLAYGLSHGSSQ 136 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + +G T+ H + + + P + EQ I + I + + + Sbjct: 137 KLLAQKAQGKTVVHIHASDLKGLGIAFPDVTEQQAIGTFFSSLDDLITLHQRKYDKLV-- 194 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK--LI 252 + + + + P I + G D WE + + KN L Sbjct: 195 -------IFKKTMLEKMFPKDGESVPEIRFAGFT-DPWEQRKLGEFSKKNTIKNANGALS 246 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVM 311 E+ S G I Q + + Y +V P + V+ I + ++ Sbjct: 247 ETFTNSAEQGVISQLDYFDHDITNDANISGYYVVQPDDFVYNPRISATAPCGPINRNRLN 306 Query: 312 ERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLP 366 G+++ Y +D TYL ++ + G+ R S+ + +P Sbjct: 307 RAGVMSPLYTVFSVDASMDKTYLEHYFKTSRWHDFMFLEGNTGARSDRFSISDATLFEMP 366 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + P I EQ + + + L+ ++ + LL+ + S + Sbjct: 367 IWCPEISEQIAMAKQLET----TETLITLHQRKLELLRNIKKSLLDK 409 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 60/191 (31%), Gaps = 8/191 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + ++ + + ES I YG + + ETR + + + Sbjct: 17 DPWEQRKLGEIASFSKGSGYSKADIRESGIPLFLYGRMYTQYETRVDSVDTFAAPRPGTL 76 Query: 287 DPG--EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLC 343 EIV + + SA E + V P + +LA+ + Sbjct: 77 YSKGTEIVVPASGESAEDIARASAITREGIALGGDLNVVYPQRMVTPLFLAYGLSHGSSQ 136 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K+ G + D+K L + P + EQ I + +D L+ ++ Sbjct: 137 KLLAQKAQGKTVVHIHASDLKGLGIAFPDVTEQQAIGTF----FSSLDDLITLHQRKYDK 192 Query: 403 LKERRSSFIAA 413 L + + + Sbjct: 193 LVIFKKTMLEK 203 >gi|262373386|ref|ZP_06066665.1| predicted protein [Acinetobacter junii SH205] gi|262313411|gb|EEY94496.1| predicted protein [Acinetobacter junii SH205] Length = 814 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 73/488 (14%), Positives = 142/488 (29%), Gaps = 101/488 (20%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W + T L T T + I +I ++DV T + S + + Sbjct: 101 LPSKWVKAYLGEVTLLITDGTHHTPKYLDSGIPFISVKDVSGKTISFDDCKYISSEEHSE 160 Query: 77 TVSIFAK--GQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + IL ++G R +I DF S L L + L +L S Sbjct: 161 LIKRCKPEINDILLCRIGTLGRATLIDVEKDFSIFVSLGLLKLSKIINYSKYLHLFLHSP 220 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI-------------------- 170 + Q E G+ + + K + +I + +PPL EQ I Sbjct: 221 QALLQFDEVKVGGSHTNKLNLKDLPHIVINLPPLEEQQRIVEKVDELMQLCDQLEQQQNL 280 Query: 171 ---------------------REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 ++ RI + + KQ ++ V Sbjct: 281 SSEAHDQLVDTLLNVLTNSSDVDEFQQNWQRISENFDLLFTTEYSIDQLKQTILQLAVMG 340 Query: 210 GLNPDVK-------------------------------MKDSGIEWVGLVPDHWEVKPFF 238 L ++ S E +P +W Sbjct: 341 KLVKQDPNDEPASELLKQITEEKAKLIKEGKIKKSKPLLEISNEEKQYEIPHNWVWARLD 400 Query: 239 A------LVTELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQ---IVDP 288 + + ++S I L N L ++ E V Sbjct: 401 SLTSKIGAGSTPKGGKEVYVDSGIPFLRSQNVWNDGLALDDVAFISEGTHEKMSGTHVQA 460 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 +++F + +L + + + + +L ++RS + K+ Sbjct: 461 NDLLFNITGGSIGRCALVATDFETANVSQHVTIVRSIDKDLAPFLHLVLRSSYIQKLVMD 520 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + G+ R+ L + + + +P + EQ I + + + ID L + S+ L++ + Sbjct: 521 VQVGVSREGLSIGKLSQFLIPLPSLTEQKRIIKKVEILNSIIDSL----QVSLRKLQKTK 576 Query: 408 ----SSFI 411 S I Sbjct: 577 LHLADSLI 584 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 64/179 (35%), Gaps = 8/179 (4%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-----GEIVFRF 295 +T+ K ++S I +S ++ K + + S E +++ +I+ Sbjct: 117 ITDGTHHTPKYLDSGIPFISVKDVSGKTISFDDCKYISSEEHSELIKRCKPEINDILLCR 176 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGL 353 I + +L + ++ + + S YL + S F +G Sbjct: 177 IGTLG-RATLIDVEKDFSIFVSLGLLKLSKIINYSKYLHLFLHSPQALLQFDEVKVGGSH 235 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L +D+ + + +PP++EQ I ++ D L ++ S + + + Sbjct: 236 TNKLNLKDLPHIVINLPPLEEQQRIVEKVDELMQLCDQLEQQQNLSSEAHDQLVDTLLN 294 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 67/200 (33%), Gaps = 12/200 (6%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPK-DGNSR 71 IP +W + +K+ G T + GK+ I ++ ++V + + Sbjct: 389 EIPHNWVWARLDSLTSKIGAGSTPKGGKEVYVDSGIPFLRSQNVWNDGLALDDVAFISEG 448 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQF-LVLQPKDVLPELLQGW 127 + + + +L+ G + + + S +V L L Sbjct: 449 THEKMSGTHVQANDLLFNITGGSIGRCALVATDFETANVSQHVTIVRSIDKDLAPFLHLV 508 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S + + + + G + + +P+P L EQ I +K+ ID+L Sbjct: 509 LRSSYIQKLVMDVQVGVSREGLSIGKLSQFLIPLPSLTEQKRIIKKVEILNSIIDSLQVS 568 Query: 188 RIRFIELLKEKKQALVSYIV 207 + + +L+ + Sbjct: 569 LRKLQKTKLHLADSLIVNAL 588 >gi|28897161|ref|NP_796766.1| HsdS polypeptide [Vibrio parahaemolyticus RIMD 2210633] gi|28805370|dbj|BAC58650.1| putative HsdS polypeptide, part of CfrA family [Vibrio parahaemolyticus RIMD 2210633] Length = 583 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 57/479 (11%), Positives = 128/479 (26%), Gaps = 97/479 (20%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLP-KDGNSRQS 73 P HW+ + + + + G+ G D +Y+ + D+++ + + + Sbjct: 97 PLHWETICVGQVAHVLGGKRVPKGYKLSEQPTDFVYLRVTDMKNQSIDESDLRYISEEVF 156 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSID 132 + G + G I S T+ L + +L Sbjct: 157 KQISRYTINTGDVYVTIAGTIGAVGTIPPHLDGMSLTENAAKLVFSGLSKKYLVTVLQSS 216 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR----------- 180 T++ I + +PIPPL EQ I +K+ Sbjct: 217 FVTRQFNDAVNQMAQPKLSLNSIKHTCIPIPPLEEQEYIADKVDELMALCDQLEQQTEAS 276 Query: 181 ------------------------------IDTLITERIRFIELLKEKKQALVSYIVTKG 210 I E + + KQ ++ V Sbjct: 277 IEAHQVLVTTLLDTLTNSADADELMQNWARISEHFDTLFTTEESIDQLKQTILQLAVMGK 336 Query: 211 LNPDVKMKDSGIEWV-------------------------------GLVPDHWEVKPFFA 239 L P + E + +P WE Sbjct: 337 LVPQDPSDEPAAELLKRIAEEKAQLVKEKKIKKQKALPPIAEDEKPFELPSGWEWCRLDD 396 Query: 240 LVTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ------IVDP 288 + + K I L NI ++ + + ++ ++ P Sbjct: 397 ICFGITSGSTPPKVNFNESEGIPYLKVYNIREQKIDFEYKPQFVDNDCHKTKLARSVLYP 456 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G++V + K ++ E + + Y+ + + Sbjct: 457 GDVVMNIVGPPLGKIAIIPDTYPEWNCNQAITFFRPIVPQLNKYIYTYLTAGSFLDSIEL 516 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +G+ + ++ + + + PP++EQ I N ++ + L ++ + +E + Sbjct: 517 IGTAGQDNISVTKSRSILLPTPPLREQKRIVNKVHELFLLCNSLKMRLRKR----QELK 571 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 68/201 (33%), Gaps = 14/201 (6%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ + + +G T + I Y+ + ++ + K Sbjct: 384 ELPSGWEWCRLDDICFGITSGSTPPKVNFNESEGIPYLKVYNIREQKIDFEYKPQFVDND 443 Query: 74 DTSTV---SIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPK-DVLPELLQG 126 T S+ G ++ +GP L K I + C+ +P L + + Sbjct: 444 CHKTKLARSVLYPGDVVMNIVGPPLGKIAIIPDTYPEWNCNQAITFFRPIVPQLNKYIYT 503 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + IE I A + +I +P PPL EQ I K+ + ++L Sbjct: 504 YLTAGSFLDSIELIGT-AGQDNISVTKSRSILLPTPPLREQKRIVNKVHELFLLCNSLKM 562 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + EL +V V Sbjct: 563 RLRKRQELKLCITDTIVEQAV 583 >gi|269978368|gb|ACZ55918.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 420 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 57/411 (13%), Positives = 116/411 (28%), Gaps = 36/411 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + G++ K + + + + G + +R + Sbjct: 13 PKGVEFRKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEANRSGE------- 64 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G Y D + F V PK + I A Sbjct: 65 ---TIAISSSGVYAGYVSYWDIPVFLADSFSVS-PKQKTLMPKYLFHYLTTQQDAIHATK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + H K + N +PIPPL Q I + + A T L ++ + E Q Sbjct: 121 STGGIPHVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTEL-NTELKARKKQYEYYQN 179 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVG-----LVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++ N S + + L P E K + N Sbjct: 180 MLLDFNDINQNHKDAKIKSYPKRLKTLLQTLAPKGVEFKTLEEVFEIKNGYTPSKNNPEF 239 Query: 257 LSLSYGNIIQKLETRNMG---------LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + R G + P++ + ++ I+ + L Sbjct: 240 WKNGTIPWFRMEDIRENGRILKDSIQHITPKALKGKKLFPKNSIIISTTATIGEHALLIV 299 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRL 365 + + +++ K + + + + L + + S+ K+ Sbjct: 300 DSLANQRFT---FLSKKANCDLALDMKFFFYQCFLLGEWCKNNINVSGFASVDMTAFKKY 356 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP++ Q +I +++ A L+ I I K+ R + Sbjct: 357 KFPIPPLEIQQEIVKILDQFLALTTDLLAGIPAEIKARKKQYEYYREKLLT 407 >gi|148988314|ref|ZP_01819761.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP6-BS73] gi|147925995|gb|EDK77069.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP6-BS73] Length = 352 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 50/392 (12%), Positives = 115/392 (29%), Gaps = 44/392 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L+ Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNLLV-------- 170 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 K E G V + + L +N K + + Sbjct: 171 --------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFP 216 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I Y IV ++ N +R + Sbjct: 217 IYGSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEP 266 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 267 VLEKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVV 323 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D I++S+ L+ + S + Sbjct: 324 ----QVDKSQLAIQKSLEELETLKKSLMQEYF 351 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 186 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 233 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 234 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 290 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 291 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 321 >gi|269967980|ref|ZP_06182019.1| hypothetical protein VMC_34490 [Vibrio alginolyticus 40B] gi|269827416|gb|EEZ81711.1| hypothetical protein VMC_34490 [Vibrio alginolyticus 40B] Length = 421 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 55/393 (13%), Positives = 127/393 (32%), Gaps = 26/393 (6%) Query: 38 TGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGK 90 G S D+ +++ +V ++ K + + S A I+ Sbjct: 35 RGHNYPSTGDLKEQGHTLFLSASNVTKRGFEFNSKQYITLEKSQSMGNGKLALNDIVLTS 94 Query: 91 LGPYLRKAIIAD-------FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G A + + I S ++ + P ++ +L S ++I+ I G Sbjct: 95 RGSIGHIAWYDEIVKQKVPYARINSGMLILRSNDSMCPSIVSQYLKSPIGAKKIDLISFG 154 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + + IP + + + ++DTLI + + + L K+A++ Sbjct: 155 SAQPQLTKASVSKLKITIPENKTEQYLVG---SYFQKLDTLINQHQQKHDKLSNLKKAML 211 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + K ++ G G + K++ + + YG Sbjct: 212 EKMFPKAGETVPAIRFDGFS--GDWQSKTLGSVASFHKGKGLPKSSIQDDGVYSCIHYGE 269 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII-TSAYMA 322 + K + + + V K ++ V + G++ + Sbjct: 270 LFTKYSEVIEMVTGRTNQNDNFFSVSNDVLMPTSDVTPKGLVKPCCVKQSGVVLGGDILV 329 Query: 323 VKPHGIDSTYLAWL--MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++P+ + A+L +V + L ++ L + I EQ I N Sbjct: 330 IRPNDQNLIDGAFLSRFIRTREQQVLQNVTGSTVFHLYASSIENLDIAFCSIDEQKAIAN 389 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D+L+ + Q I LK + + + Sbjct: 390 Y----FQKLDLLISQNNQQITKLKNIKQACLDK 418 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 57/192 (29%), Gaps = 12/192 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG--KYLPKDGNSRQSDTSTVSIF 81 W+ + + G+ + G KY + F Sbjct: 233 DWQSKTLGSVASFHKGKGLPKSSIQDDGVYSCIHYGELFTKYSEVIEMVTGRTNQNDNFF 292 Query: 82 A-KGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + +L ++ + + LV++P D I ++ Sbjct: 293 SVSNDVLMPTSDVTPKGLVKPCCVKQSGVVLGGDILVIRPNDQNLIDGAFLSRFIRTREQ 352 Query: 137 I-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G+T+ H I N+ + + EQ I ++D LI++ + I L Sbjct: 353 QVLQNVTGSTVFHLYASSIENLDIAFCSIDEQKAIANY----FQKLDLLISQNNQQITKL 408 Query: 196 KEKKQALVSYIV 207 K KQA + + Sbjct: 409 KNIKQACLDKMF 420 >gi|183600210|ref|ZP_02961703.1| hypothetical protein PROSTU_03754 [Providencia stuartii ATCC 25827] gi|188022507|gb|EDU60547.1| hypothetical protein PROSTU_03754 [Providencia stuartii ATCC 25827] Length = 368 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 46/401 (11%), Positives = 114/401 (28%), Gaps = 45/401 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + L+ G+ ++ ++ +P ++ + + Sbjct: 2 SEWQNTTLGDVITLHYGKALKT------------QNRIVGNIPVYSSAGITGYHNEPLVM 49 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I+ G+ G + + T + VL + + + T +E + E Sbjct: 50 SKGIIIGRKGTVGKVYYSPEPFWCIDTAYYVLPNETKYDFIWLYYQ---LGTIGLEELNE 106 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + + IP +AEQ I + + +ID L + + Sbjct: 107 DSAVPGLNRTTAYSQDILIPSIAEQKAIASVLSSLDDKIDLLHRQNKTLESM-------- 158 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + +E H +K FA + K + E I + + Sbjct: 159 ----------AETLFRQWFVEEAQDDWVHGTLKDEFAFTMGQSPKGSSFNEEQIGTPMFQ 208 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + T ++ + I A Sbjct: 209 GNADFGFRFPKERVYTTEPTRFAQKLDTLI------SVRAPVGAQNMARSKCCIGRGVAA 262 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + Y + L + S+ D +++ V++PP I + Sbjct: 263 FRHINNPDWYTYTYFKLRCLMDEIKKFNDEGTVFGSISKSDFEKIEVIIPPAS----IIH 318 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ++ V I L++ R + + ++G++ + Sbjct: 319 NYEIMVKPLNDRVITNCFQIEKLEKLRDTLLPKLMSGEVRV 359 >gi|157159064|ref|YP_001463945.1| putative type I restriction-modification system, S subunit [Escherichia coli E24377A] gi|157081094|gb|ABV20802.1| putative type I restriction-modification system, S subunit [Escherichia coli E24377A] Length = 373 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 50/400 (12%), Positives = 134/400 (33%), Gaps = 44/400 (11%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + LN G+ + +D +G+ G + ++ + Sbjct: 4 WIKTKLGEIVILNYGKA---------LKAQDRNAGSIPVYSSGGLT---GWHNKALINEQ 51 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G+ G + + T + +L +L + T +E + E + Sbjct: 52 GIIIGRKGTVGKAYLTYGPFWCIDTAYYILPNPSKYD---FVFLFYLLKTLGLEELNEDS 108 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + +P L EQ I + + +ID L + + + + Sbjct: 109 AVPGLNRDTAYSQEILLPSLPEQKTIASVLSSLDDKIDLLHRQNKTLESMAETLFR---- 164 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + S + + P +K A ++ +T ++ + G Sbjct: 165 ---QWFILDSTGVSVSIDQIIDFNPKRTLIKSQDATYLDMAGLST------VIFRANGYY 215 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + K ++ + E G ++ ++ ++ Sbjct: 216 RRPFSSGTKFTKRDTLLAR---------ITPCLENGKAAYIDFLDDNETGWGSTEFIVMR 266 Query: 325 PHGIDSTYLAWLM-RSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 P +++++M R+ D + + GS RQ + + +K+ V +P I + Sbjct: 267 PKKEIHPFISYIMCRNPDFKEYAESCMEGSTGRQRVNLDHLKKFNVNLPTEASLRIINEL 326 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ ++ L+ + I L++ R + + ++G++ + Sbjct: 327 LDSFESK---LINN-SKQIDSLEKLRDTLLPKLMSGEVRV 362 >gi|237729543|ref|ZP_04560024.1| type I restriction-modification system specificity subunit [Citrobacter sp. 30_2] gi|226908149|gb|EEH94067.1| type I restriction-modification system specificity subunit [Citrobacter sp. 30_2] Length = 410 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 53/424 (12%), Positives = 134/424 (31%), Gaps = 42/424 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W ++ L GR+ +I +V + + + + Sbjct: 2 SEWVNRKLREVGTLERGRSRHRPRYAFHLYNGPYPFIQTGEVRAASKYINSYENTYSEDG 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ KG + + + + I DFD L P + + Sbjct: 62 LKQSKLWPKGTLCIT-IAANIAELAILDFDACFPDSVLGFLPDTTKTSVDFVFYTLRHYQ 120 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ I EG+ + + NI P PP++EQ I + A +I+ L + + Sbjct: 121 KTLKHIGEGSVQDNINLGTFENIEFPFPPISEQKAIASVLSALDDKINLLHRQNKTLESM 180 Query: 195 LKE-KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + +Q + + A + Sbjct: 181 AETLFRQWFIEEAQAD-----------------WEITTLDCHITVAKGLSYKGAGLTTSD 223 Query: 254 SNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + I S ++++ ++ G+K ++ I+ G+I+ + ++ R + ++ Sbjct: 224 NGIPLFSLNSVLEGGGYKSAGIKYYNGDFKERHIIKHGDIIVANTEQGHEYRLIGYPAII 283 Query: 312 ER-----GIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDV 362 I T V + + ++ +L+ S D+ + A +G L + + Sbjct: 284 PTTKSKLSIYTHHLFKVSINDDSYLTNYFMYYLLCSKDMHEQVVAATNGSTVNQLSADGL 343 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +R +PP + + + I ++ R + + ++G++ ++ Sbjct: 344 QRPEFKLPP----ECMVKKFTTQITSFWEKISINNSQIKNIESLRDTLLPKLLSGEVRVK 399 Query: 423 GESQ 426 + Sbjct: 400 YAEE 403 >gi|85716965|ref|ZP_01047929.1| type I restriction-modification system, endonuclease S subunit [Nitrobacter sp. Nb-311A] gi|85696244|gb|EAQ34138.1| type I restriction-modification system, endonuclease S subunit [Nitrobacter sp. Nb-311A] Length = 402 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 67/388 (17%), Positives = 131/388 (33%), Gaps = 21/388 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W V + R + Y+GLE ++ + + + ST F Sbjct: 12 GWTRVRFDQIATQINERVDNPAEAGVERYVGLEHLDPDSLRI--RRWGEPTDVESTKLRF 69 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRIEA 139 G I++GK Y RK +ADF+GICS +VL+ K VLP+ L ++ S +R + Sbjct: 70 QPGDIIFGKRRVYQRKVAVADFEGICSAHAMVLRAKPGAVLPDFLPFFMQSDLFMERALS 129 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I G+ +W + +PP+ EQ + E + A + + + Sbjct: 130 ISVGSLSPTINWTALAAEEFLLPPIREQSRLVEALSAADKL----AEVQHDLLTRSESVF 185 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLIESNIL 257 +AL + +G P + + + +R + Sbjct: 186 KALFKERIGRGFKPADYQRWEEDDEPNMCFVRLSEVASVDRGRFSHRPRNLPQFFGGPYP 245 Query: 258 SLSYGNI---IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 G++ + + + L E + + PG I + + E Sbjct: 246 FAQTGDVAAARGRDFSASQFLSDEGVQYGKSFPPGTIFLTIAAVIAA----TAISTTETY 301 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 S V +D YL + +R F ++++ E ++ L + P ++ Sbjct: 302 CTDSVVGIVPKDPLDVDYLEYTLRFTRPYLEFEVATQTAQKNINLEVLRPLTIPWPSKED 361 Query: 375 QFDITNVINVETARIDVLVEK--IEQSI 400 + I + + I + + + I Sbjct: 362 RDAIAKELAAAESAIRTIEARQAATKKI 389 >gi|160894141|ref|ZP_02074919.1| hypothetical protein CLOL250_01695 [Clostridium sp. L2-50] gi|156864174|gb|EDO57605.1| hypothetical protein CLOL250_01695 [Clostridium sp. L2-50] Length = 372 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 117/396 (29%), Gaps = 33/396 (8%) Query: 30 IKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + G+ + D YI E++ + F G IL Sbjct: 5 LADICEYAKGKVDVAILDADTYISTENMMPNKRGITSATSLPT---VAQTQAFLAGDILV 61 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGATMS 147 + PY +K A+F+G CS LV + K+ + + ++L+ D + +G M Sbjct: 62 SNIRPYFKKIWFAEFNGGCSNDVLVFRAKNGVSKRFLYYVLANDTFFDYSMSTSKGTKMP 121 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 D I +P +Q I + A + I L+++ QA+ + Sbjct: 122 RGDKAAIMKYDVPDFTYEKQEKIAGILDALDKK----IQLNTEINNNLEQQAQAIYQQMF 177 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + W + ++ N + Sbjct: 178 -----------------IDNARSDWAEGTLSDIADITIGQSPSGSSYNEDGTGTIFFQGR 220 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 E G + S Y + I A+ Sbjct: 221 AEF---GFRFPSVRLYTTEPKRMARSNDTLMSVRAPVGDLNVAHTDCCIGRGLAAIHSKS 277 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +++ + M S + + S+ + +P+L+P I + A Sbjct: 278 NHQSFVLYTMFSLKKQLDVFNGEGTVFGSINRNSLNDMPILIPSDD----ILDEFERIVA 333 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +D+ + I L++ R + + ++G++D+ Sbjct: 334 PMDLTIRNNYDEICRLQDIRDTLLPRLMSGELDVSD 369 Score = 42.5 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 45/182 (24%), Gaps = 2/182 (1%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + + G++ G ++ + + R T + Sbjct: 183 SDWAEGTLSDIADITIGQSPSGSSYNEDGTGTIFFQGRAEFGFRFPSVRLYTTEPKRMAR 242 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 L P +A D + K + + + Q E Sbjct: 243 SNDTLMSVRAPV-GDLNVAHTDCCIGRGLAAIHSKS-NHQSFVLYTMFSLKKQLDVFNGE 300 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + ++P+ IP + + I E R ++ L Sbjct: 301 GTVFGSINRNSLNDMPILIPSDDILDEFERIVAPMDLTIRNNYDEICRLQDIRDTLLPRL 360 Query: 203 VS 204 +S Sbjct: 361 MS 362 >gi|291288456|ref|YP_003505272.1| restriction modification system DNA specificity domain protein [Denitrovibrio acetiphilus DSM 12809] gi|290885616|gb|ADD69316.1| restriction modification system DNA specificity domain protein [Denitrovibrio acetiphilus DSM 12809] Length = 405 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 53/420 (12%), Positives = 119/420 (28%), Gaps = 39/420 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + L G+ + K G V G+ + Sbjct: 3 EWKEYKLADLANLRNGK-GLNNKFYTDFGKSGVWGANGQIASTNEVLNSDPV-------- 53 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I+ G++G Y +A+ + + + PK+ +LL + G Sbjct: 54 --IVIGRVGAYCGSIHMAEGNNWVTDNAIQATPKNDTDLNFLYYLLKSL---NVSRAATG 108 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + GIG + P Q + + + +I+ E+ + ++ Sbjct: 109 SAQPLITQSGIGVLECKAPSPKIQKEVASILSSLDDKIELNRKMNETLEEMARAIFKSWF 168 Query: 204 S-----YIVTKGLNPDVKMKDSG----------IEWVGLVPDHWEVKPFFALVTELNRKN 248 + +G P + + +P WEVK + L Sbjct: 169 VDFDPVHAKARGEEPSGMPDEIASLFPSEFVHSEQLNNPIPKGWEVKSLGDVFEALGGGT 228 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-------FIDLQND 301 E + + N+ + + G + L + Sbjct: 229 PSTKEPEYWVNGIYHWATPKDLSNLNEPIILTTERMLTEKGLNKISSGLLPKGTVLLSSR 288 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 A + ++A+K + S Y + ++ + + ++ Sbjct: 289 APIGYVAISETPIAVNQGFIAIKENETFSKYFIYFWCKENIELIIANANGSTFLEISKKN 348 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + P + E I + I L++K L + R S + ++G+I++ Sbjct: 349 FRNINSVFP-VDE--KIISEFTSIVEPIFQLIQKNIIEKNTLTDLRDSLLPRLISGEIEV 405 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 59/195 (30%), Gaps = 13/195 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----------LEDVESGTGKYLPKDGNS 70 IPK W+V + + G T + + ++ L ++ + Sbjct: 208 IPKGWEVKSLGDVFEALGGGTPSTKEPEYWVNGIYHWATPKDLSNLNEPIILTTERMLTE 267 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + + + KG +L P I++ + F+ ++ + + + Sbjct: 268 KGLNKISSGLLPKGTVLLSSRAPI-GYVAISETPIAVNQGFIAIKENETFSK-YFIYFWC 325 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERI 189 + + I A G+T K NI P + I I E+ Sbjct: 326 KENIELIIANANGSTFLEISKKNFRNINSVFPVDEKIISEFTSIVEPIFQLIQKNIIEKN 385 Query: 190 RFIELLKEKKQALVS 204 +L L+S Sbjct: 386 TLTDLRDSLLPRLIS 400 >gi|261378712|ref|ZP_05983285.1| type I restriction-modification system specificity subunit [Neisseria cinerea ATCC 14685] gi|269144866|gb|EEZ71284.1| type I restriction-modification system specificity subunit [Neisseria cinerea ATCC 14685] Length = 413 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 56/420 (13%), Positives = 127/420 (30%), Gaps = 42/420 (10%) Query: 27 VVPIKRFT-KLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + +++G T D I ++ E + + + + V Sbjct: 3 EKRLIDISRNISSGITPLRSNDEFWTDGTIPWLKTEQLGEKYIFDTNEHITEKALQEANV 62 Query: 79 SIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 IF + + G I + ++ + + + Sbjct: 63 KIFPENTLSIAMYGEGKTRGNVSILKRPMATNQACCNIELDEGKVSSEYVYYFLKTQYEN 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + G + + I N + +P L Q I + +D I + L Sbjct: 123 LRGLSSG-IRKNLNTNDIKNFVVRLPKNLKTQQSIAAVL----SALDKKIALNKQINARL 177 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNR 246 +E + L Y + PD K SG E V +P WEVK + + Sbjct: 178 EEMAKTLYDYWFVQFDFPDANGKPYKSSGGEMVFDETLKREIPKGWEVKSLNQVADIVMG 237 Query: 247 KNTKLIESNILSLSYGNI--IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ N+ + R ++ + + G+I+ D Sbjct: 238 QSPDGASYNLEQEGTIFFQGSTDFDWRFPNVRQYTTSPTRFAQKGDILLSVRAPVGDL-- 295 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 I A++ ++++L ++M+ + S+ +D+ Sbjct: 296 ---NIAPFECCIGRGLAALRSKSGNNSFLFYVMKYFKTVFERRNTEGTTFGSITKDDLHS 352 Query: 365 LPVLVPP---IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L ++ P +++ +I ++ D ++ Q L + R + + GQI + Sbjct: 353 LKLVAPADNVLEKYNEIA-------SKYDEMIFIRSQQNHQLTQLRDFLLPMLMNGQISV 405 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 63/201 (31%), Gaps = 8/201 (3%) Query: 10 YKDSGVQWI------GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + + IPK W+V + + + G++ + + G+ + Sbjct: 202 YKSSGGEMVFDETLKREIPKGWEVKSLNQVADIVMGQSPDGASYNLEQEGTIFFQGSTDF 261 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + N RQ TS KG IL P IA F+ L+ K Sbjct: 262 DWRFPNVRQYTTSPTRFAQKGDILLSVRAPV-GDLNIAPFECCIGRGLAALRSKSGNNSF 320 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L +++ T EG T + ++ + P E I Sbjct: 321 LF-YVMKYFKTVFERRNTEGTTFGSITKDDLHSLKLVAPADNVLEKYNEIASKYDEMIFI 379 Query: 184 LITERIRFIELLKEKKQALVS 204 + + +L L++ Sbjct: 380 RSQQNHQLTQLRDFLLPMLMN 400 >gi|322656670|gb|EFY52958.1| Type I restriction enzyme EcoAI specificity protein (S protein) [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] Length = 554 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 63/473 (13%), Positives = 121/473 (25%), Gaps = 99/473 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTPLAIE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDL 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ +E + G T+ + + P IPP AEQ Sbjct: 200 KVLSPFLSEISYYIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 + + RI Sbjct: 260 LSTVKKLMSLCDQLEQHSLTSLDAHQQLVETLLTTLTDSQNADALAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 320 TTEASIDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKDGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKP-------FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 S E VP+ WE + + I ++ G+I + Sbjct: 380 ISDKEKPFEVPEGWEWCKFGLISEFINGDRGSNYPNKNEYVVHGIPWINTGHIEKNGTLS 439 Query: 272 NMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + + + G++V+ K + I +S + Sbjct: 440 ITDMNFITEKKFNELRSGKIQSGDLVYCLRGATFGKTAFVKPYESG-AIASSLMIIRPFI 498 Query: 327 GIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 Y+ + S + + +L V PP++EQF I Sbjct: 499 REMGEYIYNYLISPFGRSQIFRFDNGSAQPNLSANSVMLYAFACPPLQEQFRI 551 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 62/196 (31%), Gaps = 10/196 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P WE F L K ++ + + K N+ + E Sbjct: 93 SEEEKPFELPVGWEWVTFSHLGHFFGGKTPSKMKDEYWGGTIPWVTPKDMKTNLIVDSED 152 Query: 280 -------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + V PG I+F +R A + + P + +Y Sbjct: 153 KVTPLAIEDGLTKVSPGSILFVARSGIL-RRIFPVAITSIECTVNQDLKVLSPFLSEISY 211 Query: 333 LAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 LM + + +SL F+D P ++PP EQ I + + + D Sbjct: 212 YIRLMMNGFERYIVENLTKTGTTVESLLFDDFISHPFMIPPFAEQNRILSTVKKLMSLCD 271 Query: 391 VLVEKIEQSIVLLKER 406 L + S+ ++ Sbjct: 272 QLEQHSLTSLDAHQQL 287 >gi|149196780|ref|ZP_01873833.1| Type I restriction-modification system specificity subunit [Lentisphaera araneosa HTCC2155] gi|149139890|gb|EDM28290.1| Type I restriction-modification system specificity subunit [Lentisphaera araneosa HTCC2155] Length = 405 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 59/408 (14%), Positives = 129/408 (31%), Gaps = 43/408 (10%) Query: 25 WKVVPIKRFTKLN---TGRTSESGKDI------IYIGLEDV-ESGTGKYLPKDGNSRQSD 74 WK P+ + G SG D+ +++ +V +SG + ++SD Sbjct: 19 WKESPLMEVADIIDGDRGSNYPSGDDLNTSGHTLFLNASNVTKSGFIFNTNQYIIKKKSD 78 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG-------ICSTQFLVLQPKDVLPELLQGW 127 + + I+ G A + I S ++ + P + Sbjct: 79 AMGNGMLSLDDIIITSRGSVGNVAWYSGEIHQEIPFARINSGMLIIRCKNMLTPTFITCL 138 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLIT 186 L+S ++I I G+ K + + P EQ I + I+ Sbjct: 139 LMSPLGRRQISTITFGSAQPQLTKKDVSIFTVSFPVDKQEQAKIGKYFQQVDKLINNHQE 198 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + + K + + P+++ K +W D K + V + Sbjct: 199 KHKKLQNIKKAMLKKMFPQAGQS--VPEIRFKGFSGDWEFQTLDEVATKHDNSRVPITAK 256 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + + ++ GE V D ND ++ Sbjct: 257 DRIAGVTPYYGANGIQDYVEGFT-----------------HEGEYVLLAEDGANDLKNYP 299 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 V + + + ++ ++ L +L + + + G R L + L Sbjct: 300 INYVTGKIWVNNHAHVLQGKNYKTSTL-YLKYAISQIDIEPFLVGGGRAKLNASVMMNLG 358 Query: 367 VLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P I+EQ I + +D L+ + +Q I L+ + + ++ Sbjct: 359 LSLPEKIQEQEKIGSY----FKSLDNLISQHDQQIQKLQNIKQACLSK 402 >gi|16415962|emb|CAC85954.1| AloI restriction modification enzyme [Acinetobacter lwoffii] Length = 1262 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 48/396 (12%), Positives = 115/396 (29%), Gaps = 42/396 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W V + G+ ++ G V G+ + + Sbjct: 903 WPQVKVGSICSFEYGK--PLPEENRVSGPYPVMGSNGRV----------GYHSEYLIKGP 950 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G+ G + + T F + + +L + G Sbjct: 951 AIIIGRKGSAGQVVWEEEDCYPIDTTFYAKTLTSDIDKYFLFHVLKELDLGH---LQGGV 1007 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + +PMP+PP+ Q + + + + + + +L S Sbjct: 1008 GVPGLNRNEAHELPMPLPPIKVQEQMVVDFKKIDADVASAAALVSDSLSRINSEVDSLYS 1067 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V + IE + + + + + G + Sbjct: 1068 SGVGR----------ISIEEISTNVQYGLNEKMNETGIG-------YKTFRMNEVIDGRM 1110 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + + + + YQ ++ G+++F + + ++ ++Y+ Sbjct: 1111 VDNGKMKRANISAKEFSKYQ-LNKGDLLFIRSNGSLEHIGRFGLFDLDGEYCYASYLVRI 1169 Query: 325 PHGIDSTYLAWL---MRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 +L M S L K ++ SG ++ +K + V VP + EQ Sbjct: 1170 VADTSKIRPYYLAIIMNSAALRKEVVSLAVKSGGTNNINATKMKSIKVPVPSLDEQAKFI 1229 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + V + +I R+S+ + + Sbjct: 1230 AKIE----LLQKQVADAQATIDSAAARKSTVMKKYL 1261 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 21/146 (14%), Positives = 39/146 (26%), Gaps = 4/146 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K ++ +I S YG + + + ++ + K S Sbjct: 901 SKWPQVKVGSICSFEYGKPLPEENRVSGPYPVMGSNGRVGYHSEYLIKGPAIIIGRKGSA 960 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 E +L + + G L + L Sbjct: 961 GQVVWEEEDCYPIDTTFYAKTLTSDIDKYFLFHVLKELDLGHLQGGVGVPGLNRNEAHEL 1020 Query: 366 PVLVPPIKEQFDITNVINVETARIDV 391 P+ +PPIK Q + V+ +ID Sbjct: 1021 PMPLPPIKVQEQMV----VDFKKIDA 1042 >gi|328947490|ref|YP_004364827.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328447814|gb|AEB13530.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 493 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 83/253 (32%), Gaps = 15/253 (5%) Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 ++ I +LL+E ++ +S+ + + E +P+ W Sbjct: 18 KLVPQIASEGNARDLLEEIRKEKLSHGLDFANAKSNPCDITEEEIPFDIPESWCWCRLGE 77 Query: 240 ---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIV 292 V K + + + + YG + + + K + +E + +I+ Sbjct: 78 LGNFVRGSGIKRDETTNTGLPCVRYGEMYTTYKIKFSKTKSFTSKDVFEKCHKIHTNDIL 137 Query: 293 FRFIDLQNDKRSLR-SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +L + + E + P +S +L +L+ S + + Sbjct: 138 MALTGENKWDIALAATYEGTEEIAMGGDLCKFTPINCNSLFLVYLINSPYGIEYKRNTST 197 Query: 352 G-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----- 405 G + + L + +PP+ EQ I I I+ K E + + E Sbjct: 198 GDIIVHTSTTKLGNLLIPLPPLAEQRRIVAAIEKFMPLIEEY-GKKETQLKAINEKIGTL 256 Query: 406 RRSSFIAAAVTGQ 418 + + + AV G+ Sbjct: 257 TKKAILQEAVQGK 269 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 58/427 (13%), Positives = 121/427 (28%), Gaps = 54/427 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGT-GKYLPKDGNSRQSD 74 IP+ W + G + + + + ++ + K+ + + Sbjct: 65 DIPESWCWCRLGELGNFVRGSGIKRDETTNTGLPCVRYGEMYTTYKIKFSKTKSFTSKDV 124 Query: 75 TSTVSIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 IL G L + P + L + Sbjct: 125 FEKCHKIHTNDILMALTGENKWDIALAATYEGTEEIAMGGDLCKFTPINCNSLFLVYLIN 184 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + G + H +GN+ +P+PPLAEQ I I I+ + Sbjct: 185 SPYGIEYKRNTSTGDIIVHTSTTKLGNLLIPLPPLAEQRRIVAAIEKFMPLIEEYGKKET 244 Query: 190 RFIELLKE----KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + ++ K+A++ V L P + + + + + + F + Sbjct: 245 QLKAINEKIGTLTKKAILQEAVQGKLVPQIAAEGNARDLLEEIRKEKLSHGFANSYGICS 304 Query: 246 RKNTKLIESNILSLSYGNIIQKL--ETRNMGLKPESYETYQIVDPGEIVF---------- 293 K K S++ S S + +K E + + E + GEI Sbjct: 305 EKGKKSKSSDLRSKSQIRVTKKELPEITEDEIPFDIPENWCWCRLGEICKLIDGEKVKEV 364 Query: 294 ---------------RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA---- 334 I + ++ ++ G + K GI + Sbjct: 365 KLPLLDAKYLRGKKDATIVSEGKVANVNDLLILVDGENSGEVFVNKEKGIMGSTFKQLCI 424 Query: 335 -------WLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++++ ++ K + L + L + +PP+ EQ I I Sbjct: 425 CEKLYLPYILKFIEMHKELLRNSKKGAAIPHLNKDIFFGLLLPLPPLSEQKRIVAAIEKM 484 Query: 386 TARIDVL 392 + L Sbjct: 485 LPLCERL 491 >gi|315171543|gb|EFU15560.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1342] Length = 407 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 49/400 (12%), Positives = 123/400 (30%), Gaps = 24/400 (6%) Query: 23 KHWKVVPIKRFTKLNT----GRTSES--GKDIIYIGLEDVESGTGK-YLPKDGNSRQSDT 75 + W++ + K G + + Y+ +++ G N + Sbjct: 18 EDWELCKLNNIYKKIRNAFVGTATPYYVKEGNFYLESNNIKDGNINQNTKVFINDEFYEK 77 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ---PKDVLPELLQGWLLSID 132 I+ + G A+I + L++ ++ P L L+ Sbjct: 78 QKDKWLETEDIVMVQSGHVGHTAVIPKELNNTAAHALIMFQERKRETNPYFLNYQFLTDT 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++++ I G T+ H + + + + AE+ LI + ++D I R + Sbjct: 138 SKRKLDMITTGNTIKHILASEMKSFEVFVCESAEENLISDF----FRKLDDTIGLHQRKL 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + LKE K+A + + +M+ + E + + + + Sbjct: 194 DQLKELKKAYLQVMFPAKDETVPRMRFAYFEGEWEL------CKLGDFLIVPPKIKATID 247 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + L N+ + Y G+ ++ + N ++ ++ Sbjct: 248 NPSDLMTVKLNLGGVYSGASRDTLSLGSTIYYKRFSGQFIYGKQNFFNGSMAIIPKELHG 307 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + ++ R + + + ++ +LVP Sbjct: 308 KATSGDVPSFDIININKDYLFYFISRKSYWKSKEVEATGTGSKRIHEKTLQNFSILVPLK 367 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I + +ID + + + LK + S++ Sbjct: 368 DEQIRI----STFCEKIDDTITLHQNKLNQLKSLKKSYLQ 403 >gi|282883099|ref|ZP_06291699.1| putative type-1 restriction enzyme specificity protein [Peptoniphilus lacrimalis 315-B] gi|281297076|gb|EFA89572.1| putative type-1 restriction enzyme specificity protein [Peptoniphilus lacrimalis 315-B] Length = 397 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 45/389 (11%), Positives = 117/389 (30%), Gaps = 27/389 (6%) Query: 26 KVVPIKRFTK---LNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTVS 79 + + T + I YI ++++ G + S+ ++ S Sbjct: 14 EWKKLGEICIDKFWVMPTTPKFIQNGIPYITGKNIKDGKIDFDNVKYISQDDYNNISKNR 73 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRI 137 K IL +G ++ + L K +L + + + Q + Sbjct: 74 DILKNDILVSMIGTIGEIGLVCNSIKFYGQNLYLLRLNKKIILNKFFYHYFSQNKIKQGL 133 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + ++ + + + +PIP L Q I + + T ++ L E ++ + Sbjct: 134 ISKKNSSSQGYIRAGQLEYLEIPIPSLETQEKIVDILDKFTNYVNELQAELQAELQARNK 193 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + + L+ + K S E + + T K + + Sbjct: 194 QYEYYRDML----LSEEYLNKRSS-ELFIKNNNSITKCKLKDIATITRGKRLVRSDLKEI 248 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + S + ++ G + E Sbjct: 249 GKFPVFQNSLKPLGYYYDRNFSGDKACVISAG-------------AAGVIFYREEDFWAA 295 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + I + ++ + + S + + L ++V+ L +LVP ++ Q Sbjct: 296 DDVLVINSDRILNKFIYYFLLSNQ-RLIKTKVRKASVPRLSRDEVENLEILVPSMELQKI 354 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKER 406 I V++ + + + + I +++ Sbjct: 355 IVKVLDKFQSLVIDTKGLLPKEIEKRQKQ 383 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 16/187 (8%), Positives = 60/187 (32%), Gaps = 12/187 (6%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-----PESYETYQIVDPGEI 291 + K I++ I ++ NI + + + + +I Sbjct: 21 ICIDKFWVMPTTPKFIQNGIPYITGKNIKDGKIDFDNVKYISQDDYNNISKNRDILKNDI 80 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMG 350 + I + + ++ + + I + + + + + Sbjct: 81 LVSMIGTIGEIGLVCNSIKFYGQ--NLYLLRLNKKIILNKFFYHYFSQNKIKQGLISKKN 138 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV----LLKER 406 S + ++ ++ L + +P ++ Q I ++++ T ++ L +++ + + Sbjct: 139 SSSQGYIRAGQLEYLEIPIPSLETQEKIVDILDKFTNYVNELQAELQAELQARNKQYEYY 198 Query: 407 RSSFIAA 413 R ++ Sbjct: 199 RDMLLSE 205 >gi|270296267|ref|ZP_06202467.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270273671|gb|EFA19533.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 454 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 58/424 (13%), Positives = 124/424 (29%), Gaps = 54/424 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKY--LPKDGNSRQ 72 IP+ W+ V + + G +S + + L ++++ K+ P + Sbjct: 30 EIPQGWEWVRLGNIATIIGGYAYKSQDFINSSNNQVLRLGNIKNDFLKHNASPVYISDDL 89 Query: 73 SDTSTVSIFAKGQILYGKLGPYLR-------KAIIADFDGICST--QFLVLQPKDVLPEL 123 + + IL G + K D + + L +V + Sbjct: 90 ATKTDKFRCHLDDILITMTGTRKKRDYFFSYKVEQNDLNYFINQRVGILRFYISEVSMFM 149 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + A + + I + +P+PPL+EQ+ I KI ++ Sbjct: 150 IYALKAENTLQNVFQYETGTANQGNLGAENIAKVYIPLPPLSEQLRIVSKIKELIPLVEA 209 Query: 184 LITERIRFIELLKE----KKQALVSYIVTK------------------------GLNPDV 215 + L ++++ + L + Sbjct: 210 YEQTQNELNTLNTSLNELLCKSILQEAIQGKLVLQVAEEGTAQELLERIRQEKLQLVKEG 269 Query: 216 KMKDSGI--EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 K+K S + + D+ + A E+ ++L L + E RN Sbjct: 270 KLKKSALTDSVIYKGDDNKYYERINAQTVEIELPFEYPNNWSVLRLKDICQLIDGEKRNG 329 Query: 274 GLKPES------YETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKP 325 + V+ G+ V+ ++ S V + G + S + + Sbjct: 330 KGICLDAKYLRGKSSATTVEKGKFVYAGDNIILVDGENSGEVFTVPQDGYMGSTFKQLWL 389 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + + L E LP+ +PP +EQ I IN Sbjct: 390 SSAMWKPYILAFILFYKEDLRNSKRGAAIPHLNKELFYNLPIGIPPYQEQQRIAKRINEL 449 Query: 386 TARI 389 + + Sbjct: 450 SQLL 453 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 74/220 (33%), Gaps = 19/220 (8%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTEL----NRKNTKLIESNILSLSYGNIIQKLETRNM 273 K E +P WE + T + + + SN L GNI N Sbjct: 21 KCIDEEIPFEIPQGWEWVRLGNIATIIGGYAYKSQDFINSSNNQVLRLGNIKNDFLKHNA 80 Query: 274 GLKPESYE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER----GIITSAYMAVK 324 S + +I+ + + S +V + I + Sbjct: 81 SPVYISDDLATKTDKFRCHLDDILITMTGTRKKRDYFFSYKVEQNDLNYFINQRVGILRF 140 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 S ++ + +++ + + + +G + +L E++ ++ + +PP+ EQ I + I Sbjct: 141 YISEVSMFMIYALKAENTLQNVFQYETGTANQGNLGAENIAKVYIPLPPLSEQLRIVSKI 200 Query: 383 NVETARIDVLVEKIEQSIVL---LKERR-SSFIAAAVTGQ 418 ++ + + L L E S + A+ G+ Sbjct: 201 KELIPLVEAYEQTQNELNTLNTSLNELLCKSILQEAIQGK 240 >gi|37680389|ref|NP_934998.1| type I restriction-modification system, endonuclease S subunit [Vibrio vulnificus YJ016] gi|37199136|dbj|BAC94969.1| type I restriction-modification system, endonuclease S subunit [Vibrio vulnificus YJ016] Length = 389 Score = 95.2 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 73/395 (18%), Positives = 141/395 (35%), Gaps = 30/395 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W++V K + R + +Y+GLE ++ + K K Sbjct: 5 QLPEGWQMVKFGDIAKHISKRVEPSETELEVYVGLEHLDPDSLKI--KRHGVPSDVAGQK 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQR 136 + KGQI++GK Y RK +AD+D ICS +V PK VLPE L ++ S +R Sbjct: 63 LLVKKGQIIFGKRRAYQRKVAVADWDCICSAHAMVLEANPKTVLPEFLPVFMQSGYFMER 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 AI EG+ WK + +PPL Q K+IA +I+ + + Sbjct: 123 AIAISEGSLSPTIKWKVLEQQKFSLPPLELQ----SKLIARLSKIENTYDLSCQVENAAR 178 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 +AL+ K S + + + + + + E + Sbjct: 179 SLYKALLFATFE---------KSSETKKLKSYIRNISSGKSISAASIP----ADVNEFGV 225 Query: 257 LSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 L +S N N +K + V +++ + + + Sbjct: 226 LKVSAVNNGSFNPGENKLVKGDKISLLKNHVMANDLLMSRANTAELVGDVCIVDKTSTKL 285 Query: 316 ITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP 370 + + +L L+R L ++ SG +++ + + + V Sbjct: 286 FLPDKLWKIEPINEHYKLWLFHLLRFLKLNGTLASLSSGTSGSMKNISQKKLLEIDVG-- 343 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 ++ +I V+ ++ +++ + L KE Sbjct: 344 DSEKAQEIGEVLQSAFLCVESSSMRVKAILELYKE 378 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 69/184 (37%), Gaps = 8/184 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQI 285 +P+ W++ F + ++++ + + + L+ + G+ + + Sbjct: 5 QLPEGWQMVKFGDIAKHISKRVEPSETELEVYVGLEHLDPDSLKIKRHGVPSDVAGQKLL 64 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLC 343 V G+I+F K ++ I ++ M P + +L M+S Sbjct: 65 VKKGQIIFGKRRAYQRKVAVA----DWDCICSAHAMVLEANPKTVLPEFLPVFMQSGYFM 120 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + A+ G ++K++ +++ +PP++ Q + ++ D+ + + L Sbjct: 121 ERAIAISEGSLSPTIKWKVLEQQKFSLPPLELQSKLIARLSKIENTYDLSCQVENAARSL 180 Query: 403 LKER 406 K Sbjct: 181 YKAL 184 >gi|158522248|ref|YP_001530118.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158511074|gb|ABW68041.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 412 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 63/399 (15%), Positives = 130/399 (32%), Gaps = 18/399 (4%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF--AKGQILYGK 90 + I ++ G + S ++ S G ++ + Sbjct: 10 IVDCEHKTAPTQAEGYPSIRTPNIGRGYFLLDGVNRVSEETYRSWTRRAEPKPGDLIMAR 69 Query: 91 LGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 P A++ C Q + V P L L+ + I A+ G T+ Sbjct: 70 EAPVGNVAMVPAGLRPCLGQRTLLIRPMRSKVFPRYLAYLLIGDQIQNIIHAMTNGVTVP 129 Query: 148 HADWKGIG-NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H + K + P+PPL Q I + A I+ + E Q L Sbjct: 130 HLNMKDVRSLPLPPLPPLPTQRKIAAILSAYDDLIENNLRRIKILE----EMAQNLYREW 185 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 K P + +G +P+ WEV + + +NR T +++ SL Sbjct: 186 FVKFRFPGWEKARFVDSPLGKIPEEWEVTTINKVTSYINRGVTPKYDASASSLVVNQKCI 245 Query: 267 KLETRNMGLKPESYETYQ---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + N+ L + V G+I+ + R + + + + + V Sbjct: 246 RDRKLNLSLARQHKSRVMDDKYVVFGDILINSTGVGTLGRVAQVYEDLNDVTVDTHVSIV 305 Query: 324 KPHGIDSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +P D L + + G+ + L+ + + +++PP+K + Sbjct: 306 RPSNGDGIDFLGLALIDLEPHFESLGAGATGQTELRRDRIGETEIVLPPVKMRKQF---- 361 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + LV + L+ R + ++G++D+ Sbjct: 362 SEKVTSLRKLVLNLAARNETLRRTRDLLLPKLISGEVDV 400 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 60/198 (30%), Gaps = 9/198 (4%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +G IP+ W+V I + T +N G T + + + + + + + Sbjct: 204 LGKIPEEWEVTTINKVTSYINRGVTPKYDASASSLVVNQKCIRDRKLNLSLARQHKSRVM 263 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLS 130 +F G IL G R A + + D T +++P + G L Sbjct: 264 DDKYVVF--GDILINSTGVGTLGRVAQVYEDLNDVTVDTHVSIVRPSNGDGIDFLGLALI 321 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + IG + +PP+ + EK+ + + L Sbjct: 322 DLEPHFESLGAGATGQTELRRDRIGETEIVLPPVKMRKQFSEKVTSLRKLVLNLAARNET 381 Query: 191 FIELLKEKKQALVSYIVT 208 L+S V Sbjct: 382 LRRTRDLLLPKLISGEVD 399 >gi|269978360|gb|ACZ55914.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 430 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 57/414 (13%), Positives = 127/414 (30%), Gaps = 32/414 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYI--GLEDVESGTG------KYLPKDGNSRQS 73 PK + + + G T + ++I + G++ + + + ++ Sbjct: 13 PKGVEFRKLGDIGEYIRGVTYKKNQEINNLECGIKVLRANNITLSNHLNFEDIKVINKNV 72 Query: 74 DTSTVSIFAKGQILY---GKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWL 128 K IL ++ K DFD + V++ ++V + Sbjct: 73 KIRKEQYLKKNDILICAGSGSSEHIGKVAFINTDFDYVFGGFMGVIRIREVNSRFVYHIF 132 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S Q +E T+++ + + N +PIPPL Q I + + A T L TE Sbjct: 133 TSNIFKQYLEKSLNTTTINNLNANILQNFLIPIPPLEIQQEIVKILDAFTELNTELNTEL 192 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKM------KDSGIEWVGLVPDHWEVKPFFALVT 242 + + + L+ + + D KM K L P E + ++ Sbjct: 193 KARKKQYEYYQNMLLDFNDINSTHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGEVLE 252 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + ++ +T +G E YQ ++ + Sbjct: 253 YDQPNKYCVTSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKSYPVII----FDDFT 308 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + ++ + + + + + G Sbjct: 309 TATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYMQTIPYNI-----GGEHARHWISRY 363 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +L V +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 364 SQLEVPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYREKLLT 417 >gi|303250871|ref|ZP_07337064.1| hypothetical protein APP6_1996 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302650286|gb|EFL80449.1| hypothetical protein APP6_1996 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 417 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 52/415 (12%), Positives = 111/415 (26%), Gaps = 63/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRK- 97 + I YI +D G + D S K I++ + G Sbjct: 5 YKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRYGTIGVVR 64 Query: 98 AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 I + + S ++ + + + + +L S I+ T + K I Sbjct: 65 VIEENIKLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQPNVGLKSIKKF 124 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ----------------- 200 +P+PPL EQ I KI I+ + + L ++ + Sbjct: 125 IIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTE 184 Query: 201 ---------ALVSYIVTKGLNPDVKMK--------------------------DSGIEWV 225 AL+ I + L P + K E Sbjct: 185 QNPNDEPASALIERIKAEKLRPIAEKKLKKPKVISEIIMRDNLPYEIVNGEERCIADEVP 244 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIE---SNILSLSYGNIIQKLET--RNMGLKPESY 280 +P+ W + + + L GNI ++ Sbjct: 245 FEIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKIDVSSDIVKVNLDI 304 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + +++ + + + + + + Y+ + + S Sbjct: 305 PENKRCYKNDLLICARNGSKKLVGKAAIIDKDGYSFGAFMAIFRSPF--NKYIYYYLSSP 362 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 F + + + ++ + +P + EQ I I + + L +K Sbjct: 363 LFRNDFDGINTTTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTLQNLSQK 417 Score = 83.7 bits (205), Expect = 6e-14, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 66/182 (36%), Gaps = 16/182 (8%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFIDLQNDK 302 ++ I +S + K K S E Y ++ +I+F Sbjct: 4 EYKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRYGTIGVV 63 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFED 361 R + + +++ + ++ I+ Y+ + S + + ++ + Sbjct: 64 RVIEENI---KLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQPNVGLKS 120 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERRSSFIAAAVT 416 +K+ + +PP+ EQ I I I+ + E+ + L ++ + S + AA+ Sbjct: 121 IKKFIIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTALHQQFPEQLKKSILQAAIQ 179 Query: 417 GQ 418 G+ Sbjct: 180 GK 181 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 56/171 (32%), Gaps = 10/171 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V + + N G T I + +++ G + D D Sbjct: 246 EIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKID-VSSDIVKVNLDI 304 Query: 76 STVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 K +L KA I D DG S + + + + +L S Sbjct: 305 PENKRCYKNDLLICARNGSKKLVGKAAIIDKDGY-SFGAFMAIFRSPFNKYIYYYLSSPL 363 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I T++ + N +P+P L EQ+ I EKI + Sbjct: 364 FRNDFDGINT-TTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTLQN 413 >gi|295101277|emb|CBK98822.1| Restriction endonuclease S subunits [Faecalibacterium prausnitzii L2-6] Length = 393 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 59/399 (14%), Positives = 125/399 (31%), Gaps = 24/399 (6%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 P+ + E + I DV G + NS F + ILY Sbjct: 8 PVGEVCSSISDTYREKKNMVTLINTSDVLEGRVLNHERVPNS-NLKGQFKKTFQRDDILY 66 Query: 89 GKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 ++ P R+ DF D I ST+ +V++ K + + + + E T Sbjct: 67 SEIRPQNRRFAYVDFSPIDYIASTKLMVIRAKKDVVSPKYLYYFLKNSSTVAELQLLAET 126 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 S + + + + ++E I+ ++ IT + + L+++ Q+ Sbjct: 127 RSGTFPQITFSEVANLTIPVPSLAVQEVIVQTMQCLEDKITCNEQINDNLEQQAQSYFQE 186 Query: 206 IVTKGLNPDVKMKDS---GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + +P+ + G G P + + + I + Sbjct: 187 LFVDNADPEWAIGTISDLGTVVGGSTPSKAKPEYYTESGIAWITPKDLSINKSKFVSHGE 246 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N I +L +N + I+ G ++F A + + Sbjct: 247 NDITELGLKN--------SSAAIMPEGTVLFSSRAPIGY-----IAIAAGEVTTNQGFKS 293 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 V P T + L + + + +K +P ++P + ++ Sbjct: 294 VVPKPEIGTPFVYFFLKNTLPVIEGMASGSTFKEVSGSTMKNVPAVIPDAETLAKFSDF- 352 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 A I +E+ L R + + ++G+ID+ Sbjct: 353 ---CAPIFAQQRILEEQNQSLATLRDNLLPKLMSGEIDV 388 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 13/169 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYL---PKDGNSR 71 P+ W + I + G T K I +I +D+ K++ D Sbjct: 194 PE-WAIGTISDLGTVVGGSTPSKAKPEYYTESGIAWITPKDLSINKSKFVSHGENDITEL 252 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ +I +G +L+ P IA + + F + PK + + Sbjct: 253 GLKNSSAAIMPEGTVLFSSRAPI-GYIAIAAGEVTTNQGFKSVVPKPEI-GTPFVYFFLK 310 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + IE + G+T + N+P IP + + Sbjct: 311 NTLPVIEGMASGSTFKEVSGSTMKNVPAVIPDAETLAKFSDFCAPIFAQ 359 >gi|17232094|ref|NP_488642.1| type I site-specific deoxyribonuclease chain S [Nostoc sp. PCC 7120] gi|17133739|dbj|BAB76301.1| type I site-specific deoxyribonuclease chain S [Nostoc sp. PCC 7120] Length = 390 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 48/391 (12%), Positives = 117/391 (29%), Gaps = 28/391 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK + + G++ + +G ++ ++ Q I Sbjct: 2 SEWKETTLGEIADIIMGQSPTGETCNNNGQGLPLLNGPTEFGDRNPLPTQFTIDPKKIAE 61 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G +L+ G + AD ++ KD + + + + A+ Sbjct: 62 AGDLLFCVRGSTTGRMNWADQKYAIGRGIASIRAKDGILFQPYIRAIIEKELKSLLAVAT 121 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+T + + N+ + +P ++ I +I L ++ + Q L Sbjct: 122 GSTFPNISKDHLLNLIVQLPSKNIKIYISNLARILDEKIYNLRSQNETLEAI----AQTL 177 Query: 203 VSYIVTKGLNPD---VKMKDSGIEW----VGLVPDHWEVKPFFALVTELNR--------K 247 + P+ K SG +G +P+ W V + + Sbjct: 178 FKHWFIDFEFPNADGKPYKSSGGAMVRSALGYIPEAWSVGKLGQYLNIKHGYAFKGEYIT 237 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK----- 302 + + +++ +++ + Y ++ ++ DL + Sbjct: 238 TEVTEKILLTPVNFKIGGGFNDSKYKYYSADDYSNEYVLRRKDLAITMTDLSKEGDSLGY 297 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + + + V+ + ID T+L +L+ + SG + Sbjct: 298 PAFIPDIKGKVFLHNQRIGKVENNNIDKTFLYFLLCRREYRSHILGTSSGSTVRHTSPSR 357 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + ++P + I + TA ID + Sbjct: 358 ICEYSFVIPDFEL---IDKFSALATATIDKI 385 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 52/174 (29%), Gaps = 19/174 (10%) Query: 10 YKDSGV----QWIGAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESG 59 YK SG +G IP+ W V + ++ + G + + I + + + G Sbjct: 195 YKSSGGAMVRSALGYIPEAWSVGKLGQYLNIKHGYAFKGEYITTEVTEKILLTPVNFKIG 254 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL------GPYLRKAIIADFDGIC---STQ 110 G K D S + + + A I D G + + Sbjct: 255 GGFNDSKYKYYSADDYSNEYVLRRKDLAITMTDLSKEGDSLGYPAFIPDIKGKVFLHNQR 314 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 ++ ++ L L + I G+T+ H I IP Sbjct: 315 IGKVENNNIDKTFLYFLLCRREYRSHILGTSSGSTVRHTSPSRICEYSFVIPDF 368 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 42/148 (28%), Gaps = 7/148 (4%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N + RN + + +I + G+++F + + + I Sbjct: 37 NGPTEFGDRNPLPTQFTIDPKKIAEAGDLLFCVRGSTTGRMNWA---DQKYAIGRGIASI 93 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 GI + +L + ++ + + L V +P I I Sbjct: 94 RAKDGILFQPYIRAIIEKELKSLLAVATGSTFPNISKDHLLNLIVQLPSKNI--KI--YI 149 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSF 410 + +D + + L+ + Sbjct: 150 SNLARILDEKIYNLRSQNETLEAIAQTL 177 >gi|291556519|emb|CBL33636.1| Restriction endonuclease S subunits [Eubacterium siraeum V10Sc8a] Length = 373 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 65/392 (16%), Positives = 137/392 (34%), Gaps = 36/392 (9%) Query: 29 PIKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + +T + +D Y+GLE ++SGT K + KG +L Sbjct: 5 RFDQIAINSTEKKKPVEEDRFTYLGLEHLDSGTLKVTRFGSEVAPIGE--KLVMHKGDVL 62 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG--WLLSIDVTQRIEAICEGAT 145 +GK Y +K IA FDGI S +VL+PK+ + + ++ S I G+ Sbjct: 63 FGKRRAYQKKVAIAPFDGIFSAHGMVLRPKENVIDKDFFPLFISSDYFLDAAIKISVGSL 122 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 +W+ + + +P + Q + E + + ++ EL + S Sbjct: 123 SPTINWRDLKELEFELPDMDSQRKLAEVLWSINDTMEAYKKLISATDEL-------VKSQ 175 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + P K ++ +G + K ++ + + + I Sbjct: 176 FIDMFGAPLSNEKGWPLKRIGDLFS-----------LISRGKQPSYVDHSSVRVVNQACI 224 Query: 266 QKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQNDKRSLRSAQVMERGII---TSA 319 +K ++ + V I+ R ++ + + + Sbjct: 225 YWDRFNFENVKYHDSQSGKKTLPVKKDCILINSTGTGTLGRCNVFPELTDGYVYVVDSHV 284 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + H +++ + ++ D+ K YA GS + L E + + ++VPP++ Q Sbjct: 285 TVLAESHDVNAYFFKCFLQREDVQKKIYAECVNGSTNQIELSKEKLSDVLLVVPPMERQE 344 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + D Q + L+ +RR Sbjct: 345 QFAAFV----RQSDKSKYNASQVMRLIAQRRK 372 Score = 44.0 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 40/162 (24%), Gaps = 11/162 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-- 80 K W + I L + S D + + + + S Sbjct: 188 KGWPLKRIGDLFSLISRGKQPSYVDHSSVRVVNQACIYWDRFNFENVKYHDSQSGKKTLP 247 Query: 81 FAKGQILYGKLGP-YLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSID-- 132 K IL G L + + + + + VL + L + Sbjct: 248 VKKDCILINSTGTGTLGRCNVFPELTDGYVYVVDSHVTVLAESHDVNAYFFKCFLQREDV 307 Query: 133 -VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 E + + + ++ + +PP+ Q Sbjct: 308 QKKIYAECVNGSTNQIELSKEKLSDVLLVVPPMERQEQFAAF 349 >gi|294775385|ref|ZP_06740904.1| type I restriction modification DNA specificity domain protein [Bacteroides vulgatus PC510] gi|294450767|gb|EFG19248.1| type I restriction modification DNA specificity domain protein [Bacteroides vulgatus PC510] Length = 370 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 61/379 (16%), Positives = 120/379 (31%), Gaps = 21/379 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P W + I G + T K G + D + Sbjct: 2 LPDGWCLTDIGELLINRDGERKP-------VSSVIRSKQTSKIYDYYGAAGVIDKVDSYL 54 Query: 81 FAKGQILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 F + +L G+ G L A A+ + VL + L ++ + + Sbjct: 55 FDERLLLIGEDGANLLSRSKNNAFFAEGRYWVNNHAHVLDAT---DKNLLDFIAIVINSM 111 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +++ G+ + IP+ +PPLAEQ I +I ID + ++ + Sbjct: 112 KLDDYITGSAQPKLSQDNLNKIPIVLPPLAEQQRIIAEIKKWFTLIDQIEQDKADLQTTI 171 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + K ++ + L P + IE + + + Sbjct: 172 ELTKSKILDLAIHGKLIPQDPNDEPAIELLKRINPDFTPCDNGHYTQLPEGWAIC-KMKQ 230 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I S++ G + +ET N + + K ++ + +E Sbjct: 231 ITSITNGKSQKNVETLNGIYPIYGSGGVIGRANQYLCIAGSTIIGRKGTINNPIFVEEHF 290 Query: 316 IT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 +A+ I YL + S+D K+ S SL + + + +PP K Sbjct: 291 WNVDTAFGLKANDAILDKYLYYFCLSFDFSKL---DKSTAMPSLTKTSIGNVLIPIPPYK 347 Query: 374 EQFDITNVINVETARIDVL 392 EQ I I++ ++ + Sbjct: 348 EQERIVAKIDMVLDTMNEI 366 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 52/155 (33%), Gaps = 8/155 (5%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME-- 312 + S+ K+ + D ++ RS +A E Sbjct: 24 PVSSVIRSKQTSKIYDYYGAAGVIDKVDSYLFDERLLLIGEDGANLLSRSKNNAFFAEGR 83 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + A++ ++A ++ S L + L +++ ++P+++PP+ Sbjct: 84 YWVNNHAHVLDATDKNLLDFIAIVINSMKLDDYI---TGSAQPKLSQDNLNKIPIVLPPL 140 Query: 373 KEQFDITNVINVETARIDVLV---EKIEQSIVLLK 404 EQ I I ID + ++ +I L K Sbjct: 141 AEQQRIIAEIKKWFTLIDQIEQDKADLQTTIELTK 175 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 16/164 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + +K+ T + G++ + +VE+ G Y P G+ + Sbjct: 218 QLPEGWAICKMKQITSITNGKSQK-----------NVETLNGIY-PIYGSGGVIGRANQY 265 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G + G+ G + + T F + +L + L + LS D Sbjct: 266 LCIAGSTIIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDF----SK 321 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + M IGN+ +PIPP EQ I KI ++ Sbjct: 322 LDKSTAMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNE 365 >gi|307253723|ref|ZP_07535587.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306858799|gb|EFM90848.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 428 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 52/415 (12%), Positives = 111/415 (26%), Gaps = 63/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRK- 97 + I YI +D G + D S K I++ + G Sbjct: 16 YKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRYGTIGVVR 75 Query: 98 AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 I + + S ++ + + + + +L S I+ T + K I Sbjct: 76 VIEENIKLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQPNVGLKSIKKF 135 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ----------------- 200 +P+PPL EQ I KI I+ + + L ++ + Sbjct: 136 IIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTE 195 Query: 201 ---------ALVSYIVTKGLNPDVKMK--------------------------DSGIEWV 225 AL+ I + L P + K E Sbjct: 196 QNPNDEPASALIERIKAEKLRPIAEKKLKKPKVISEIIMRDNLPYEIVNGEERCIADEVP 255 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIE---SNILSLSYGNIIQKLET--RNMGLKPESY 280 +P+ W + + + L GNI ++ Sbjct: 256 FEIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKIDVSSDIVKVNLDI 315 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + +++ + + + + + + Y+ + + S Sbjct: 316 PENKRCYKNDLLICARNGSKKLVGKAAIIDKDGYSFGAFMAIFRSPF--NKYIYYYLSSP 373 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 F + + + ++ + +P + EQ I I + + L +K Sbjct: 374 LFRNDFDGINTTTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTLQNLSQK 428 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 67/188 (35%), Gaps = 16/188 (8%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFI 296 + ++ I +S + K K S E Y ++ +I+F Sbjct: 9 DHKMPQEYKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRY 68 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ 355 R + + +++ + ++ I+ Y+ + S + + Sbjct: 69 GTIGVVRVIEENI---KLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQP 125 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERRSSF 410 ++ + +K+ + +PP+ EQ I I I+ + E+ + L ++ + S Sbjct: 126 NVGLKSIKKFIIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTALHQQFPEQLKKSI 184 Query: 411 IAAAVTGQ 418 + AA+ G+ Sbjct: 185 LQAAIQGK 192 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 56/171 (32%), Gaps = 10/171 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V + + N G T I + +++ G + D D Sbjct: 257 EIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKID-VSSDIVKVNLDI 315 Query: 76 STVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 K +L KA I D DG S + + + + +L S Sbjct: 316 PENKRCYKNDLLICARNGSKKLVGKAAIIDKDGY-SFGAFMAIFRSPFNKYIYYYLSSPL 374 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I T++ + N +P+P L EQ+ I EKI + Sbjct: 375 FRNDFDGINT-TTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTLQN 424 >gi|167760901|ref|ZP_02433028.1| hypothetical protein CLOSCI_03289 [Clostridium scindens ATCC 35704] gi|167661504|gb|EDS05634.1| hypothetical protein CLOSCI_03289 [Clostridium scindens ATCC 35704] Length = 487 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 61/438 (13%), Positives = 137/438 (31%), Gaps = 51/438 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V I F K + + Y GL+ +E + K S + + + G Sbjct: 2 KTVKISSFLKERKIKFKPEVAN--YTGLQRIE--KIDFSGKVYLSPVQTNTDMILVKPGD 57 Query: 86 ILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ++ + I D + + + L+ +L S + + Sbjct: 58 LVISGINVEKGALAIYTGEEDVLASIHYSAYEFDAEKIDIDYLKWFLKSGIFRKLLLKQT 117 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 K I + +P L +Q + +I I + + + + ++ +Q Sbjct: 118 GRGIKKEIKAKHFLPIEIQLPSLNQQHEVVRQIQGVADYIVEINQQIEQQTKYMEILRQT 177 Query: 202 LVSYIVTKGLNPDVKMKD-------------------------------SGIEWVGLVPD 230 ++ + L + S E ++P Sbjct: 178 ILQQAIEGKLCEQNPSDEPASVLLEKIKAEKERLIVEKKIKKQKTLPPISNAEKPFVLPK 237 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLS-------YGNIIQKLETRNMGLKPESYETY 283 WE ++ E R + + + + I L+ S +Y Sbjct: 238 GWEWCRLGEILYEAPRNGYSPPKVERETNTRVLTLTATTSGILDLQHYKYVEDMISESSY 297 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYD 341 + G+++ + + + ++ V+ +G I M + DS Y+ + ++S Sbjct: 298 LWIKQGDLLIQRSNSLDYVGTVCLCDVVIKGYIYPDLMMKAKVSNEADSHYIVYYLKSPF 357 Query: 342 LCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F +G + +K V +P+ +PPI EQ I + A + +++ Q Sbjct: 358 ARQYFKDRATGTSNSMKKIKQSVVSEIPIALPPINEQKQIVAKMKELFALNQKMNQELLQ 417 Query: 399 SIVLLKERRSSFIAAAVT 416 + + S + A + Sbjct: 418 AKKYASQLMESVLQEAFS 435 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 61/141 (43%), Gaps = 1/141 (0%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +V PG++V I+++ ++ + + I + ID YL W ++ Sbjct: 46 TNTDMILVKPGDLVISGINVEKGALAIYTGEEDVLASIHYSAYEFDAEKIDIDYLKWFLK 105 Query: 339 SYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S K+ G+++ +K + + + +P + +Q ++ I I + ++IE Sbjct: 106 SGIFRKLLLKQTGRGIKKEIKAKHFLPIEIQLPSLNQQHEVVRQIQGVADYIVEINQQIE 165 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 Q ++ R + + A+ G+ Sbjct: 166 QQTKYMEILRQTILQQAIEGK 186 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 60/200 (30%), Gaps = 13/200 (6%) Query: 21 IPKHWKVVPIKRFTKL--NTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +PK W+ + G + + + L SG Sbjct: 235 LPKGWEWCRLGEILYEAPRNGYSPPKVERETNTRVLTLTATTSGILDLQHYKYVEDMISE 294 Query: 76 STVSIFAKGQILYGKLGP--YLRKAIIAD--FDGICSTQFLV--LQPKDVLPELLQGWLL 129 S+ +G +L + Y+ + D G ++ + + +L Sbjct: 295 SSYLWIKQGDLLIQRSNSLDYVGTVCLCDVVIKGYIYPDLMMKAKVSNEADSHYIVYYLK 354 Query: 130 SIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S Q + G +M + IP+ +PP+ EQ I K+ + E Sbjct: 355 SPFARQYFKDRATGTSNSMKKIKQSVVSEIPIALPPINEQKQIVAKMKELFALNQKMNQE 414 Query: 188 RIRFIELLKEKKQALVSYIV 207 ++ + + ++++ Sbjct: 415 LLQAKKYASQLMESVLQEAF 434 >gi|198284497|ref|YP_002220818.1| restriction modification system DNA specificity protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|218667678|ref|YP_002427161.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] gi|198249018|gb|ACH84611.1| restriction modification system DNA specificity domain [Acidithiobacillus ferrooxidans ATCC 53993] gi|218519891|gb|ACK80477.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] Length = 418 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 56/433 (12%), Positives = 124/433 (28%), Gaps = 61/433 (14%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK + +L G + + SG + G + DT ++ Sbjct: 3 NEWKECSLGDVIELKRGYDLPQKERL---------SGDVPLVSSSGVT---DTHAKAMVK 50 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ G+ G + + +T V K P + +L +D + Sbjct: 51 GPGVVTGRYGTLGQVFYVRQNFWPLNTTLYVYDFKGNDPRFISYFLREVDFLVYSDK--- 107 Query: 143 GATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 A + + + + IP EQ I + +I+ + + Q+ Sbjct: 108 -AAVPGLNRNHLHQARVRIPTDPTEQRRIAHILGTLDDKIENNRKTAKTLEAMAQAIFQS 166 Query: 202 LVS-----YIVTKGLNPDVKMKDSGI--------------EWVGLVPDHWEVKPFFALVT 242 G +P+ K + +G +P+ W V+ + Sbjct: 167 WFVDFDPVRAKMAGESPESICKRLKLTPEILDLFPDKLVDSELGEIPEGWVVRSLDNIGN 226 Query: 243 ELNR----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 LN K + + L + ++ E IV G+I+F + Sbjct: 227 FLNGLALQKFPSKGQDDALPVIKIAQLRSGNLGGADQASCEIEPQYIVHDGDILFSWSGS 286 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL--MRSYDLCKVFYAMGSGLRQS 356 G + V P +L + D + A + Sbjct: 287 LECAI-----WSGGTGALNQHLFKVTPKSDYPRWLCYFGVHHFLDFFREIAAGKATTMGH 341 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-----VEKIEQSIVLLKERRSSFI 411 ++ + + P + ++V + + ++ +E+ L R + + Sbjct: 342 IQRHHLSDSKLPFPC-------SGTLDVMNKPLSSMFEVMWMKTVEEQ--KLVFLRDTLL 392 Query: 412 AAAVTGQIDLRGE 424 ++G+I + E Sbjct: 393 PKLISGEIRVLDE 405 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 64/200 (32%), Gaps = 15/200 (7%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTG----RTSESGKD--IIYIGLEDVESGTGKYLP 65 DS +G IP+ W V + G + G+D + I + + SG Sbjct: 206 DSE---LGEIPEGWVVRSLDNIGNFLNGLALQKFPSKGQDDALPVIKIAQLRSGNLG--- 259 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL-L 124 + + I G IL+ G L AI + G + + PK P Sbjct: 260 -GADQASCEIEPQYIVHDGDILFSWSGS-LECAIWSGGTGALNQHLFKVTPKSDYPRWLC 317 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + R A + TM H + + +P P ++ + + + + Sbjct: 318 YFGVHHFLDFFREIAAGKATTMGHIQRHHLSDSKLPFPCSGTLDVMNKPLSSMFEVMWMK 377 Query: 185 ITERIRFIELLKEKKQALVS 204 E + + L L+S Sbjct: 378 TVEEQKLVFLRDTLLPKLIS 397 >gi|21229940|ref|NP_635857.1| putative restriction modification system specificity subunit [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66766816|ref|YP_241578.1| putative restriction modification system specificity subunit [Xanthomonas campestris pv. campestris str. 8004] gi|21111451|gb|AAM39781.1| putative restriction modification system specificity subunit [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572148|gb|AAY47558.1| putative restriction modification system specificity subunit [Xanthomonas campestris pv. campestris str. 8004] Length = 430 Score = 94.9 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 46/415 (11%), Positives = 120/415 (28%), Gaps = 41/415 (9%) Query: 25 WKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W++ + F R ++++ + E + L + + + Sbjct: 26 WELKKLSCFLVEQKKRNKNLSFGPQEVLSVSGEHGCVNQIELLGRSYAGVS--LANYHVV 83 Query: 82 AKGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 G I+Y K P+ G+ ST + V + S D Sbjct: 84 ETGDIVYTKSPLKRNPFGIIKENKGKPGVVSTLYAVYRTTVFGNPAFLDHYFSGDYNLNS 143 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + P + EQ I + + + I + + Sbjct: 144 YLQPIVRKGAKNDMKVSNAAVLAGEVFAPEVEEQKKIADFLTSLDDLISVQVLKVEALKV 203 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEW-----VG---LVPDHWEVKPFFALVTELN 245 K+ L+ + + +++ +G + D + F + Sbjct: 204 ----HKRGLMQELFPREGEASPRLRFPEFSNASGWTLGKASDIIDVLQGYGFPERLQGGR 259 Query: 246 RKNTKLIESNIL--SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QN 300 N + + + + G I+ ++ +++ G VF I N Sbjct: 260 EGNFPFYKVSDISACVDAGGILLDKANNHIDADVLEELRAKLMPIGSTVFAKIGEAIRSN 319 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + +++ + + H ++ ++ L + G+ ++K Sbjct: 320 KRAITSRPCLVDNNVAGVKAITGLAHD---RFVYYMWCQIPLIEY----AGGVVPAVKKS 372 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++++PV P EQ I + ++ A+ + + L+ + + Sbjct: 373 LMEQIPVCYPKFDEQQRIADFLSSLDAK----IAAEFDQLAALRTHKKGLMQQLF 423 >gi|308270631|emb|CBX27243.1| hypothetical protein N47_A12720 [uncultured Desulfobacterium sp.] Length = 393 Score = 94.9 bits (234), Expect = 3e-17, Method: Composition-based stats. Identities = 60/407 (14%), Positives = 133/407 (32%), Gaps = 38/407 (9%) Query: 25 WKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVS 79 W++V + +++ + S + + + D+ + + N +S Sbjct: 8 WELVKLGGICEIDPSKRELADIASDTLVSFAEMADLNEKRPYFNFSRKSNLGVLKKGGLS 67 Query: 80 IFAKGQILYGKLGPYLR------KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 F +L K+ P A + G ST+F VL+ + P LL + S Sbjct: 68 YFKDADVLLAKMTPCFENGKSGLVAGCLNGIGFGSTEFFVLRGVKIDPYLLYSIISSDFF 127 Query: 134 TQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G T + N +P+PPL EQ I + I+ + + Sbjct: 128 IDSGKLMMLGTTGRKRLMKDFVANYQIPLPPLEEQKQIAALFQSIETAIEQVEVQEKNLQ 187 Query: 193 ELLKEKKQALVSYI--VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 L + L S T LN + K F + ++ + Sbjct: 188 NLKNQLLCELFSEALQFTNYLNKNDFEK----------------IKFEKIALNISERVEP 231 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPES-YETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + ++ KP+ T + G+I+F K ++ Sbjct: 232 QKTTLDTYVGLEHLDPDNLVIARTGKPDDVIGTKLKIYKGDIIFGKRRAYQRKVAVSHFD 291 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + S + I+ +L + M+S + G ++K++ + + Sbjct: 292 GI--ASAHSMILRANEKYIEKEFLPFFMQSDVFMNRAVQISEGSLSPTIKWKTLAAQEFI 349 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +P ++Q + + + D ++++Q LK + ++ + Sbjct: 350 LPKKEKQKE----LTKLFKQFDTTRDQLKQQKTTLKNLKQKLLSEIL 392 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 51/136 (37%), Gaps = 8/136 (5%) Query: 285 IVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ + N K L + + G ++ + ++ ID L ++ S Sbjct: 68 YFKDADVLLAKMTPCFENGKSGLVAGCLNGIGFGSTEFFVLRGVKIDPYLLYSIISSDFF 127 Query: 343 CK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 +G+ R+ L + V + +PP++EQ I I+ +E++E Sbjct: 128 IDSGKLMMLGTTGRKRLMKDFVANYQIPLPPLEEQKQIA----ALFQSIETAIEQVEVQE 183 Query: 401 VLLKERRSSFIAAAVT 416 L+ ++ + + Sbjct: 184 KNLQNLKNQLLCELFS 199 >gi|255262928|ref|ZP_05342270.1| restriction modification system DNA specificity domain protein [Thalassiobium sp. R2A62] gi|255105263|gb|EET47937.1| restriction modification system DNA specificity domain protein [Thalassiobium sp. R2A62] Length = 380 Score = 94.9 bits (234), Expect = 3e-17, Method: Composition-based stats. Identities = 60/397 (15%), Positives = 123/397 (30%), Gaps = 38/397 (9%) Query: 32 RFTKLNTGRTSESGKDIIYIGLEDVESGTG-KYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 + G I V + D T+ +K I+ + Sbjct: 11 EVCDIQGGTQPPKSTFIDEPTDGYVRLLQIQDFKTDKKAVFVPDKQTLKKCSKNDIMIAR 70 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G L K ++ +G + + P + + +L + I + A + Sbjct: 71 YGASLGKI-LSGLEGAYNVALVKTIPDLERLDRAYFAHFLRANAFQSFILNLGGRAAQAG 129 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + I +P+PPL EQ I + R L QA+ + Sbjct: 130 FNKADLERIKIPLPPLEEQKRIAGILDQADALRRLRTRALDRLNTLG----QAIFHEMFG 185 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +P + +G V V + + K +E+ L+ N + Sbjct: 186 ---DPTHNF---SLATLGEV----------CDVRDGTHDSPKYVETGYPLLTSKNFSTGV 229 Query: 269 ETRNMGLKPESYETYQIVDP------GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + G K S E Y ++ G+IV I + + I A + Sbjct: 230 LSFD-GAKSISEEDYFKINKRSKVDLGDIVMPMIGTIGSPVVI--EEEAAFAIKNVALIK 286 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++++ L+ L ++ G G ++ + D+++L +PP ++Q Sbjct: 287 FVEGSPKASFIQTLLSGVYLERIVKTQGRGGTQKFVSLGDLRKLQFPLPPKEQQEAF--- 343 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + I K+ + + +S A G+ Sbjct: 344 -EGLISEIKKQKSKLCNLVTTQETLFASLQHRAFRGE 379 >gi|257417155|ref|ZP_05594149.1| type I restriction endonuclease S subunit domain-containing protein [Enterococcus faecalis AR01/DG] gi|257158983|gb|EEU88943.1| type I restriction endonuclease S subunit domain-containing protein [Enterococcus faecalis ARO1/DG] Length = 379 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 49/398 (12%), Positives = 114/398 (28%), Gaps = 37/398 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDII------YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +R + + + YI D+ + + ++ N ++ Sbjct: 2 CKFERIVEKLKSYSLSREVETNEFTGMKYIHYGDIHTKKADKVSENSNIPNIIKKNYALL 61 Query: 82 AKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G ++ + FD + + L+PK++ P L + + Sbjct: 62 EIGDLILTDASEDYKGIATPAVIRENTSFDIVAGLHTIALRPKNIDPMFLYYLIKAPTFR 121 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G + + + IP E L+ + +D + + EL Sbjct: 122 KYGYKVGTGMKVFGISSSKVLDFTTYIPKNDETKLVSSFLEKIDYALDLHQRKLDQLKEL 181 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q + + V P ++ D EW + K + K Sbjct: 182 KKAYLQ--LMFPVKDERVPKLRFADFEEEWEQCKLEDLANKYNNLRIPITASKRIYGNTP 239 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + ++ GE + D ++ + V + Sbjct: 240 YYGANGIQDFVEGYT-----------------HDGEFILVAEDGASNLKDYPVQYVNGKV 282 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + ++ +LM + + + G R L E + L + P E Sbjct: 283 WVNNHAHVLQAKR-SKADNKFLMNAIKSINIEPFLVGGGRSKLNSEVMMNLEINTPSKDE 341 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q I + ++D + + + LK + S++ Sbjct: 342 QLKI----STLCKQLDDITALYQNKLNQLKNLKKSYLQ 375 >gi|283796717|ref|ZP_06345870.1| putative type I restriction modification DNA specificity domain protein [Clostridium sp. M62/1] gi|291075601|gb|EFE12965.1| putative type I restriction modification DNA specificity domain protein [Clostridium sp. M62/1] Length = 436 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 76/430 (17%), Positives = 146/430 (33%), Gaps = 42/430 (9%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDV------ESGTGKYLPKDGNSRQSD- 74 W+++ K LNTG + + + L+ + + GT + D Q+ Sbjct: 10 NGWQILKFSECIKQLNTGLNPRNHFSLGHGSLKYITAKNLTQFGTIDFSKCDFIDEQAKR 69 Query: 75 -TSTVSIFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLL 129 S G IL+ P +I D+D S + + K +LP+ L ++ Sbjct: 70 IIHRRSDIQVGDILFSSRAPIGHCHLICEKPDDYDIGESIFSIRVNRKIILPDYLCLYMA 129 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + G+ + + + + +PP+ EQ I E +ID I Sbjct: 130 SDYFVRMASLHTTGSIIQEIRISDLMDTDVILPPMNEQRRIAEC----FKKIDRKIALNN 185 Query: 190 RFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFAL 240 + + L ++ + L Y T+ PD + SG + V +P W Sbjct: 186 KINDNLAQQLRLLYDYWFTQFDFPDESGKPYRSSGGQMVWSDDAKKEIPASWNSTKMSDA 245 Query: 241 VTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP------G 289 + R N KL I ++ N+ G IV G Sbjct: 246 IEGIRTGLNPRDNFKLGSGTIKYITVKNLRSDGILDFSGCDTIDETARAIVHRRSDVCTG 305 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+F I ++ + + + YL ++S K A Sbjct: 306 DILFASIAPLGRCHLVQELPQDWDINESVFSIRCNKATVTPEYLYMHLQSEAFVKESTAC 365 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +G + + ++ + +L+PP+ + + + +T + L K+ + I L + R Sbjct: 366 STGSVFKGIRINTLLDSRMLLPPM----QVVDKFSQQTKPLFSLQYKLNKEIQALTQLRD 421 Query: 409 SFIAAAVTGQ 418 + + GQ Sbjct: 422 WLLPMLMNGQ 431 Score = 37.5 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 67/208 (32%), Gaps = 25/208 (12%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP W + + + TG I YI ++++ S Sbjct: 232 EIPASWNSTKMSDAIEGIRTGLNPRDNFKLGSGTIKYITVKNLRSDGILDFSGCDTI--- 288 Query: 74 DTSTVSIFAK------GQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPEL 123 D + +I + G IL+ + P R ++ D+D S + V PE Sbjct: 289 DETARAIVHRRSDVCTGDILFASIAPLGRCHLVQELPQDWDINESVFSIRCNKATVTPEY 348 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA------EQVLIREKIIAE 177 L L S + A G+ + + M +PP+ +Q + + Sbjct: 349 LYMHLQSEAFVKESTACSTGSVFKGIRINTLLDSRMLLPPMQVVDKFSQQTKPLFSLQYK 408 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSY 205 + +T+ ++ + QA +S Sbjct: 409 LNKEIQALTQLRDWLLPMLMNGQATISD 436 >gi|317051875|ref|YP_004112991.1| restriction modification system DNA specificity domain [Desulfurispirillum indicum S5] gi|316946959|gb|ADU66435.1| restriction modification system DNA specificity domain [Desulfurispirillum indicum S5] Length = 527 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 50/412 (12%), Positives = 125/412 (30%), Gaps = 36/412 (8%) Query: 28 VPIKRFTKLNTG-RTSESGK----DIIYIG---LEDVESGTGKYLPKDGNSRQSDTSTVS 79 VPI + G + + I ++ LED+ +G + + ++ + + Sbjct: 4 VPISSIADVTAGQGAPKPDEFSDSGIPFVRAGSLEDLLAGKSESDLELVPAQTAKKRKLK 63 Query: 80 IFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ KG IL+ K G + + + +L PK + + +L Sbjct: 64 LYPKGSILFAKSGMSATKDRIYVLQNPAHVVSHLAILTPK---DNVYRDYLRLALKQFPP 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ + I + +P+P + +Q I + I + +L Sbjct: 121 SSLIKDPAYPAIGLGEIQSYEIPVPEEIDDQKRIAHLLGKVEGLIARRKQHLQQLDDL-- 178 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 L S + +P K + +G F + + +N+ Sbjct: 179 -----LKSVFLEMFGDPVRNEKGWEKDRIGRSTKVQGGFAFKSKDLVTKGNVRLVKIANV 233 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVME 312 + + + + G+++ + + Sbjct: 234 HFENLIW----DDVTFVPNHFIEDYIRFALSEGDLLIALTRPIIKSLDVVKTATVREADL 289 Query: 313 RGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLV 369 ++ I+ + + + GL+ ++ ++ +P+ Sbjct: 290 PCLLNQRVARFVFDKAAINKRFFLQYCYTSFFKNTVDKLCPPGLQPNISTNQIEDIPIYY 349 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 PPI Q ++ +++ L +QS+ L+ + A G++DL Sbjct: 350 PPIDLQNQFATIVE----KVEGLKSHYQQSLTDLESLYGALSQKAFKGELDL 397 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 59/207 (28%), Gaps = 22/207 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDG--NSRQSDT 75 K W+ I R TK+ G +S ++ + + +V + N D Sbjct: 195 KGWEKDRIGRSTKVQGGFAFKSKDLVTKGNVRLVKIANVHFENLIWDDVTFVPNHFIEDY 254 Query: 76 STVSIFAKGQILYGKLGPYLR--------KAIIADFDGICSTQF--LVLQPKDVLPELLQ 125 ++ ++G +L P ++ AD + + + V + Sbjct: 255 IRFAL-SEGDLLIALTRPIIKSLDVVKTATVREADLPCLLNQRVARFVFDKAAINKRFFL 313 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + ++ +C + I +IP+ PP+ Q + Sbjct: 314 QYCYTSFFKNTVDKLCPPGLQPNISTNQIEDIPIYYPPIDLQNQFATIVEKVEGLKSHYQ 373 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLN 212 L AL L+ Sbjct: 374 QSLTDLESLYG----ALSQKAFKGELD 396 >gi|3057070|gb|AAC38352.1| HsdS subunit [Lactococcus lactis] Length = 395 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 56/398 (14%), Positives = 129/398 (32%), Gaps = 45/398 (11%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKD 67 +P+ W+ + + + G T + + G D E G Y+ K Sbjct: 11 KVPELRFKGFTNDWEERKLGELSNIVGGGTPSTSNPEYWDGDIDWYAPAEIGEQSYVSKS 70 Query: 68 GNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + S+ I G +L+ AI+A + F + P + Sbjct: 71 KKTITELGLKKSSARILPVGTVLFTSRAGIGNTAILAKE-ATTNQGFQSIVPDQNKLDSY 129 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + ++ + E G+T K + + + +P L+EQ I +D Sbjct: 130 FIFSRTNELKRYGEVTGAGSTFVEVSGKQMSKMSIMVPELSEQQKIGNF----FKELDNT 185 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I R ++LLKE+K+ + + K +++ +G D WE + + Sbjct: 186 IALHQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAG------FADDWEERKLGDITKIS 239 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K + + + Y ++ + + + I G V N + Sbjct: 240 TGK--LDANAMVENGKYDFYTSGIKKYRIDVAAFEGPSITIAGNGATVGYMHLADNKFNA 297 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + V++ ++ +++ + K+ +G + + + Sbjct: 298 YQRTYVLQEFLVDRSFIFSEIGNKLP------------KKIKQEARTGNIPYIVMDMLTE 345 Query: 365 LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L + +P EQ I + ++D + ++ + Sbjct: 346 LKLSIPQNNSEQQKIGSF----FKQLDDTIALHQRKLA 379 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 59/167 (35%), Gaps = 6/167 (3%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + + +I + I + + K + + + + + + Sbjct: 45 NPEYWDGDIDWYAPA-EIGEQSYVSKSKKTITELGLKKSSARILPVGTVLFTSRAGIGNT 103 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 A + + + ++ P R+ +L + G+G + + + ++ Sbjct: 104 AILAKEATTNQGFQSIVPDQNKLDSYFIFSRTNELKRYGEVTGAGSTFVEVSGKQMSKMS 163 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++VP + EQ I N +D + ++ + LLKE++ ++ Sbjct: 164 IMVPELSEQQKIGNF----FKELDNTIALHQRKLDLLKEQKKGYLQK 206 >gi|120601902|ref|YP_966302.1| restriction modification system DNA specificity subunit [Desulfovibrio vulgaris DP4] gi|120562131|gb|ABM27875.1| restriction modification system DNA specificity domain [Desulfovibrio vulgaris DP4] Length = 595 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 69/490 (14%), Positives = 141/490 (28%), Gaps = 88/490 (17%) Query: 9 QYKDSGVQWIGAIP-----KHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVES 58 YK + G P HWK V + + G + + I I + D+ Sbjct: 4 SYKPIEIVKEGKNPLLGKADHWKRVYVSEIAMVQNGFAFKSKFFSRDEGIPLIRIRDI-- 61 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 + + + G +L G G ++ A +G+ + + + + Sbjct: 62 ----LSAETEHKYFGQFDKEYLVHNGDLLIGMDGDFV-AAYWPGKEGLLNQRVCRIVIES 116 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + +L I T+ H K + IP+P+PPL EQ I KI Sbjct: 117 ENYDKKFFFLALQPYLDAIHEKTSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKIEELF 176 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +D + + E L +Q+L+ + L + +++ G K Sbjct: 177 SELDAGVENLTKAKEQLGVYRQSLLKHAFEGKLTEAWRKRNADKLESGEALLKRVKKERE 236 Query: 239 ALVTELNRKNTKLIESN------------------------------------ILSLSYG 262 + + K + + G Sbjct: 237 EYFKKQLEQWEKDVAQWEADGKPGKKPTQPKKPKKLAPISEEELKELPELPEGWVWARLG 296 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME---------- 312 N+I + + ++ IV ID + K + S E Sbjct: 297 NLIDPPAYGTSRKSDYNIDGTGVLRIPNIVDGKIDSSDLKYTAFSPGEEEQYRLKAGDLL 356 Query: 313 ----RGIITSAYMAVKPHGIDSTYLA--WLMR-----------------SYDLCKVFYAM 349 G ++ D+ Y+ +L+R S L + Sbjct: 357 TIRSNGSVSLVGQCALIEDDDTRYVYAGYLIRLRTIGLLVSKFLLYCLSSLRLRNQIESK 416 Query: 350 GSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 ++ +++ L V + EQ +++ ++ + IE + ++ + Sbjct: 417 AKSTSGVNNINSQELSSLIVPLCSQLEQNEVSKLLADSLSTAGEQTSMIEIQLEHIRILK 476 Query: 408 SSFIAAAVTG 417 S + A +G Sbjct: 477 QSILDKAFSG 486 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 51/140 (36%), Gaps = 6/140 (4%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-R 338 ++ +V G+++ + + + G++ + + + + Sbjct: 74 FDKEYLVHNGDLLIGMDGD-----FVAAYWPGKEGLLNQRVCRIVIESENYDKKFFFLAL 128 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 L + S + L + V +P+ +PP+ EQ I I + +D VE + + Sbjct: 129 QPYLDAIHEKTSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKIEELFSELDAGVENLTK 188 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 + L R S + A G+ Sbjct: 189 AKEQLGVYRQSLLKHAFEGK 208 >gi|289524551|ref|ZP_06441405.1| type I restriction-modification system specificity determinant [Anaerobaculum hydrogeniformans ATCC BAA-1850] gi|289502210|gb|EFD23374.1| type I restriction-modification system specificity determinant [Anaerobaculum hydrogeniformans ATCC BAA-1850] Length = 113 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 24/69 (34%), Positives = 36/69 (52%) Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 Q+L V PP+ EQ I ++ +TA+ID + I LL+E R+ IA Sbjct: 1 QNLDSRTYLSELVAFPPLPEQTAIVEYLDTQTAKIDAAISAARSEIDLLREYRTRLIADV 60 Query: 415 VTGQIDLRG 423 VTG++D+R Sbjct: 61 VTGKVDVRE 69 >gi|307259764|ref|ZP_07541484.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306866154|gb|EFM98022.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 427 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 52/431 (12%), Positives = 117/431 (27%), Gaps = 67/431 (15%) Query: 27 VVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---ST 77 V ++ + + + + I YI +D G + D S Sbjct: 2 WVRLEDVCQEISDIDHKMPQEYKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSK 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 K I++ + G II + + S ++ + + + + +L S Sbjct: 62 KFAPQKNDIIFPRYGTIGVVRIIEENIKLLVSYSCACIRVEYINMQYVVAYLNSELAKLE 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I+ T + K I +P+PPL EQ I KI I+ + + L + Sbjct: 122 IKKYTNKTTQPNVGLKSIKKFIIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQ 181 Query: 197 EKK----QALVSYIVTKGLNPDVKM----------------------------------- 217 + ++++ + L Sbjct: 182 QFPEQLKKSILQAAIQGKLTEQNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIM 241 Query: 218 -------------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + E +P+ W + ++ G Sbjct: 242 RDNLPYEIVNGKERCIADEVPFEIPESWVWVRLSEISKITMGQSPDNK----YLGKEGIE 297 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + ++ ES + Y + I L I +++ Sbjct: 298 FHQGKSFFSEYIIESSDIYCSLPNKLATPNSILLCVRAPVGIVNITNRELCIGRGLASIE 357 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +++ +L + + Y +++ + + + +PP+ EQ I I Sbjct: 358 SIYVNTIFLYYALFCYKNY-YERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIET 416 Query: 385 ETARIDVLVEK 395 + + L +K Sbjct: 417 LFSTLQNLSQK 427 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 67/188 (35%), Gaps = 16/188 (8%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFI 296 + ++ I +S + K K S E Y ++ +I+F Sbjct: 16 DHKMPQEYKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRY 75 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ 355 R + + +++ + ++ I+ Y+ + S + + Sbjct: 76 GTIGVVRIIEENI---KLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQP 132 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERRSSF 410 ++ + +K+ + +PP+ EQ I I I+ + E+ + L ++ + S Sbjct: 133 NVGLKSIKKFIIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTALHQQFPEQLKKSI 191 Query: 411 IAAAVTGQ 418 + AA+ G+ Sbjct: 192 LQAAIQGK 199 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 57/167 (34%), Gaps = 10/167 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---S 76 IP+ W V + +K+ G++ ++ Y+G E +E GK + SD Sbjct: 264 EIPESWVWVRLSEISKITMGQSPDNK----YLGKEGIEFHQGKSFFSEYIIESSDIYCSL 319 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IL P I + + ++ V + + Sbjct: 320 PNKLATPNSILLCVRAPV-GIVNITNRELCIGRGLASIESIYVN--TIFLYYALFCYKNY 376 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 E G+T I N +PIPPL EQ+ I EKI + Sbjct: 377 YERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIETLFSTLQN 423 >gi|18765810|gb|AAL78768.1|AF326617_1 HP790-like protein [Helicobacter pylori] Length = 409 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 56/403 (13%), Positives = 128/403 (31%), Gaps = 30/403 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + G++ K + + + + G + +R + Sbjct: 13 PKGVGFRKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEANRIGE------- 64 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G Y D + F V PK + I A Sbjct: 65 ---TIAISSSGVYAGYVSYWDIPVFLADSFSVS-PKQKTLMPKYLFYYLTTQQDAIHATK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + H K + N +PIPPL Q I + + A T L TE + K++ Q Sbjct: 121 SAGGIPHVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTELNTELNTELNARKKQYQY 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL------VTELNRKNTKLIESN 255 + ++ N + E + P +K + +++++ Sbjct: 181 YQNMLLD--FNDINQSHKDAKERLVQKPYPKRLKTLLQTLAPKGVGFRKLGEVCEILDNR 238 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK--RSLRSAQVMER 313 + ++ + + Y I D ++ +K + + + Sbjct: 239 RIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVINKDNTPVVNWASGKI 298 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + A++ + + +L + +++ D+ +G + E++K++ + +PP++ Sbjct: 299 WVNNHAHVLQTKNELKLKFLYFYLQTIDV----SYCVAGTPPKINQENLKKITIPIPPLE 354 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q +I +++ + L+ I I K+ R + Sbjct: 355 IQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYREKLLT 397 >gi|108563257|ref|YP_627573.1| HP0790-like protein [Helicobacter pylori HPAG1] gi|107837030|gb|ABF84899.1| HP0790-like protein [Helicobacter pylori HPAG1] Length = 412 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 53/389 (13%), Positives = 122/389 (31%), Gaps = 27/389 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + G++ K + + + + G + +R + I Sbjct: 19 RKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEANRSGE----------TIA 67 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G Y D + F V PK + I A + Sbjct: 68 ISSSGVYAGYVSYWDIPVFLADSFSVS-PKQKTLMPKYLFHYLTTQQDAIHATKSTGGIP 126 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H K + N +PIPPL Q I + + A T L TE + ++ Y Sbjct: 127 HVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTELNTELNTELNTELNARKKQYQYYQ 186 Query: 208 TKGLN------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ K S + + + + + +++++ + ++ Sbjct: 187 NMLLDFNDINQNHKDAKMSAKTYPKRLKTLLQTLVPKGVEFRKLGEVCEILDNRRIPIAK 246 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ--VMERGIITSA 319 + + Y I D ++ +K + + + A Sbjct: 247 NKRKPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVINKNNTPVVNWASGKIWVNNHA 306 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ + + +L + +++ D+ +G + E++K++ + +PP++ Q +I Sbjct: 307 HVLQTKNELKLKFLYFYLQTIDVSYYV----AGTPPKINQENLKKITIPIPPLEIQQEIV 362 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRS 408 +++ +A L+ I I K R+ Sbjct: 363 KILDQFSALTTDLLAGIPAEI---KARKK 388 Score = 44.4 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 45/162 (27%), Gaps = 13/162 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +PK + + ++ R K I +G Y+ Sbjct: 221 VPKGVEFRKLGEVCEILDNRRIPIAKNKRKPGIYPYYGANGIQDYIDSYIFDGDFV---- 276 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + + + K A + VLQ K+ L + + Sbjct: 277 -LVGEDGSVINKNNT--PVVNWASGKIWVNNHAHVLQTKNELKLKFLYF----YLQTIDV 329 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + T + + + I +PIPPL Q I + + + Sbjct: 330 SYYVAGTPPKINQENLKKITIPIPPLEIQQEIVKILDQFSAL 371 >gi|324005094|gb|EGB74313.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 57-2] Length = 584 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 75/510 (14%), Positives = 149/510 (29%), Gaps = 105/510 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLE 54 +K K P+ S + +P+ W+ V G+T KD I ++ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVTFSHLGYFFGGKTPSKMKDEYWGGTIPWVTPK 140 Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQF 111 D+++ + ++ + G IL+ LR I + + Sbjct: 141 DMKTNLIVDSEDKVTPLAIE-DGLTKVSPGSILFVARSGILRRIFPVAITSIECTVNQDI 199 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 VL P +++ IE + G T+ ++ + P IPP AEQ Sbjct: 200 KVLSPFFSDISYYIRLMMNGFERYIIENLTKTGTTVESLLFEDFISHPFMIPPFAEQNRI 259 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 E++ RI Sbjct: 260 LSTVKKLMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWARISEHFDTLF 319 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 320 TTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPP 379 Query: 220 -SGIEWVGLVPDHWEVKPFFA---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNM-G 274 S E +PD WE +T+ + + ++ I L GN+ + + + + Sbjct: 380 ISDEEKPFELPDGWEWCRLNDLFSFITDGDHQAPPKSDTGIPFLVIGNLNKGIVSFDECK 439 Query: 275 LKPESYETY----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 P Y + G++++ + E + +K Sbjct: 440 YVPIDYYERLDWSRKPCQGDVLYTVTGSYGIPIIV---DNNEPFCVQRHVAILKSCSNTP 496 Query: 331 -TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 TYL +L S + +G ++++ ++ +P+ VP Q I Sbjct: 497 ITYLRYLFLSKYSYAYAEKIATGIAQKTVPLTGLRLMPIPVP----QHRTLLNIINLIKL 552 Query: 389 IDVLVEKIEQSIVLLKERRSSF-IAAAVTG 417 +D + E ++ I ++ + +A A+TG Sbjct: 553 VDAMSESLKIGIQSAQQ--TQLHLADALTG 580 >gi|393411|emb|CAA52162.1| hsdS [Escherichia coli] Length = 406 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 53/400 (13%), Positives = 116/400 (29%), Gaps = 48/400 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + V + + TG ++ + I V S S F + Sbjct: 17 EWVTLGSMADIGTGSSNRQDESENGIYPFYVRSKNIL------------KSDTFEFDEVA 64 Query: 86 ILYGKLGPYLRKAIIADFDGICST-QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G + + + + + + +S Q I GA Sbjct: 65 IVIPGEGGIGDIFHYVEGKYALHQRAYRIRITTNAVDTKFLYYFMSSSFKQYILTKSVGA 124 Query: 145 TMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 T + +PIP LA Q I + T L E + + K+ Sbjct: 125 TAISIRKPMLEGFKVPIPSPDNPEKSLAIQSEIVRILDTFTALTAELTAELTAELNMRKK 184 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + +++ K+ +EW +G K ++ Sbjct: 185 QYNYYRDQLLS--------FKEGEVEWKTLGE---------IGNFTYGYAAKAMDSGDAR 227 Query: 256 ILSLSYGNIIQKLETRN-MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + ++ N KL N M ++ +D +++ K + Sbjct: 228 FVRITDINKDGKLSKENPMYVELNEENEKYTLDKNDLLMARTGATFGKTMIFEEDYPAVY 287 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVP--- 370 + + I++ Y +S + + G + +K++ V +P Sbjct: 288 AGFLIKLNLNETIINAKYYWHFAQSDFFWEQANKLVSGGGQPQFNANALKQVRVPIPYPS 347 Query: 371 ----PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + EQ I ++++ A + E + + I L +++ Sbjct: 348 HPQKSLDEQGRIVDILDKFDAIAASITEGLPREIELRQKQ 387 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 55/166 (33%), Gaps = 16/166 (9%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVF----RFIDLQNDKRSLRSAQVMERGIITS 318 + + G+ P + I+ F I + + + + Sbjct: 30 GSSNRQDESENGIYPFYVRSKNILKSDTFEFDEVAIVIPGEGGIGDIFHYVEGKYALHQR 89 Query: 319 AY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP------- 370 AY + + + +D+ +L + M S + S++ ++ V +P Sbjct: 90 AYRIRITTNAVDTKFLYYFMSSSFKQYILTKSVGATAISIRKPMLEGFKVPIPSPDNPEK 149 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + Q +I +++ TA L ++ + + K+ R ++ Sbjct: 150 SLAIQSEIVRILDTFTALTAELTAELTAELNMRKKQYNYYRDQLLS 195 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 74/229 (32%), Gaps = 24/229 (10%) Query: 1 MKHYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLED 55 M+ K Y Y+D S + G + + + G ++ D ++ + D Sbjct: 181 MRK-KQYNYYRDQLLSFKE--GEV----EWKTLGEIGNFTYGYAAKAMDSGDARFVRITD 233 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLV 113 + ++ + K +L + G K +I D+ + + + Sbjct: 234 INKDGKLSKENPMYVELNEENEKYTLDKNDLLMARTGATFGKTMIFEEDYPAVYAGFLIK 293 Query: 114 LQPKDVLPELLQGWLL--SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-------L 164 L + + W S ++ + G + + + +PIP L Sbjct: 294 LNLNETIINAKYYWHFAQSDFFWEQANKLVSGGGQPQFNANALKQVRVPIPYPSHPQKSL 353 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 EQ I + + + I ITE + L++K+ ++ P Sbjct: 354 DEQGRIVDILD-KFDAIAASITEGLPREIELRQKQYEYYRDLLFSFPKP 401 >gi|146300449|ref|YP_001195040.1| restriction modification system DNA specificity subunit [Flavobacterium johnsoniae UW101] gi|146154867|gb|ABQ05721.1| restriction modification system DNA specificity domain protein [Flavobacterium johnsoniae UW101] Length = 267 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 57/251 (22%), Positives = 103/251 (41%), Gaps = 18/251 (7%) Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 ++ R IELL EKK+A++ + KGL P+V MKDSGIEW G +P+HW+V L Sbjct: 18 QFCPKKTRLIELLDEKKKAVIIQNIIKGLAPNVAMKDSGIEWFGEIPEHWKVVKLKYLSK 77 Query: 243 ELNRKN---------TKLIESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDP 288 ++ + +L E+ + G+ + N + S+E ++ D Sbjct: 78 NIDTGSTPNGYDIPIEELNENVWNWFTPGDFNEDFNFVNESKRKLSFEVVEDNNVRLYDS 137 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 ++F I K ++ ++ + + S + Sbjct: 138 NSVMFVGIGATLGKIAV----TDTNFYTNQQINIIELNNDINKMFVAYSLSATIKISKML 193 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 S L + + + + +P + EQ + + + KI SI LLKE+R+ Sbjct: 194 ANSATLPILNQQKLGDIQIPIPDLNEQILVVERLENIYFNHFNIANKISTSIELLKEKRT 253 Query: 409 SFIAAAVTGQI 419 + I+A + G+I Sbjct: 254 AIISATINGEI 264 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 47/216 (21%), Positives = 97/216 (44%), Gaps = 13/216 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII----------YIGLEDVES 58 KDSG++W G IP+HWKVV +K +K ++ +G DI + D Sbjct: 51 AMKDSGIEWFGEIPEHWKVVKLKYLSKNIDTGSTPNGYDIPIEELNENVWNWFTPGDFNE 110 Query: 59 --GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + + + V ++ +++ +G L K + D + + Q +++ Sbjct: 111 DFNFVNESKRKLSFEVVEDNNVRLYDSNSVMFVGIGATLGKIAVTDTNFYTNQQINIIEL 170 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + ++ + + + + AT+ + + +G+I +PIP L EQ+L+ E++ Sbjct: 171 NNDINKMFVAYS-LSATIKISKMLANSATLPILNQQKLGDIQIPIPDLNEQILVVERLEN 229 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + + IELLKEK+ A++S + +N Sbjct: 230 IYFNHFNIANKISTSIELLKEKRTAIISATINGEIN 265 >gi|312130088|ref|YP_003997428.1| restriction modification system DNA specificity domain [Leadbetterella byssophila DSM 17132] gi|311906634|gb|ADQ17075.1| restriction modification system DNA specificity domain [Leadbetterella byssophila DSM 17132] Length = 390 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 48/386 (12%), Positives = 115/386 (29%), Gaps = 21/386 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + +P W + K T + + G + +++ + Sbjct: 8 LKDVPVEW--KALGGIVKTKTAPSKIKREHYCLSGSNPIIDQGAQFIAGYTDV------N 59 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + K + + G + DF + + + + ++ Sbjct: 60 FPMVEKNEYII--FGDHSEHIKYVDFS-FIQGADGLKILNSKNNNVKYLYYCFLSFYEKE 116 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + T + I P LA Q I + + + L + I+ K+ Sbjct: 117 GSYQRHWTKAKETLIPIPYPNDPEKSLAVQQEIVRVLDGLSEQNKALTAALAQEIDQRKK 176 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + + +V+ K G E VG ++ I Sbjct: 177 QYEYYREELFRFE-GKEVEWKTLGDENVGKFTRGSGLQKKDF--------TEFGIGCIHY 227 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 Y + + + + G++V +D A + E I Sbjct: 228 GQVYTYYNTYTYETKSFVSVDFAKNARKAKTGDLVIATTSENDDDVCKAVAWLGEEDIAV 287 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQF 376 S+ H ++ ++A+ ++ K +G + + D++++ + P I EQ Sbjct: 288 SSDACFYSHSLNPKFVAYYFQTEQFQKQKRKYITGTKVRRVNVNDLEKITIPKPVITEQE 347 Query: 377 DITNVINVETARIDVLVEKIEQSIVL 402 I ++++ +V ++E+ I L Sbjct: 348 RIVHLLDQYDEATKNIVAQLEREIEL 373 Score = 39.8 bits (91), Expect = 0.98, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 41/132 (31%), Gaps = 20/132 (15%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + +V+ E + ++ K S G + + YL + S+ Sbjct: 59 NFPMVEKNEYIIFGDHSEHIKYVDFSFIQGADG-----LKILNSKNNNVKYLYYCFLSFY 113 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLVE 394 + ++ K + +P + Q +I V++ + + L Sbjct: 114 EKE------GSYQRHWTKA--KETLIPIPYPNDPEKSLAVQQEIVRVLDGLSEQNKALTA 165 Query: 395 KIEQSIVLLKER 406 + Q I K++ Sbjct: 166 ALAQEIDQRKKQ 177 >gi|317502418|ref|ZP_07960582.1| restriction endonuclease S subunit [Lachnospiraceae bacterium 8_1_57FAA] gi|316896156|gb|EFV18263.1| restriction endonuclease S subunit [Lachnospiraceae bacterium 8_1_57FAA] Length = 363 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 57/390 (14%), Positives = 111/390 (28%), Gaps = 32/390 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + G ++ + L DV G++ + + Sbjct: 3 VKLGDVCE--RGTSN--------LKLSDVSEKNGEFSVFGASGYIGSVDFYQQGYP-YVA 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 K G + +A++ L PKD + +++ +E GAT+ Sbjct: 52 VVKDGAGIGRAMLCPGKTSVIGTMQYLLPKDNILPKYLFYVVK---YMNLEKYFTGATIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H +K N QV I + + + +I + ++LL + +A V Sbjct: 109 HIYFKDYKNEEFNFDFWERQVEIVSVL----SKCEKVIDLCKQELQLLDKLIKA---RFV 161 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + K + + K A L N+ + Sbjct: 162 EMFGDVIHNSKKWQVCLFAEITSSRLGKMLDAKQQTGRNSYPYLANFNVQWFRF-----N 216 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 LE N E + G+++ + + Sbjct: 217 LENLNKMDFDEKDRAEFELREGDLLVCEGGEIGRCAVWHNELQPCFFQKALHRVRCNHQI 276 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I YLAW R F A+ L +K+L V VPP++ Q + Sbjct: 277 ILPDYLAWWFRYNCDYGGFSALAGAKATIAHLPGAKLKQLQVAVPPMELQEQFAVFV--- 333 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A+ D +++++ + S + Sbjct: 334 -AQTDKSKVAVQKALDEAQLLFDSLMQEYF 362 >gi|6137144|gb|AAF04354.1| restriction modification system specificity subunit [Streptococcus thermophilus] Length = 419 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 55/415 (13%), Positives = 133/415 (32%), Gaps = 30/415 (7%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 +P+ W+ + + R YI Y K Sbjct: 11 EVPELRFKGFTDEWEERKLSSIANVLLERIKIMIDSSSYYISTRWFSGSKNDYFNK--QV 68 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPEL-L 124 D + + G+ Y K G+ ST ++V +P + + + Sbjct: 69 ASRDVTGYFLVKNGEFAYNKSYSNGYPWGAIKRLDKYEMGVLSTLYIVFKPTAINSQFLV 128 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + + + EGA + + + + +++I + ++D Sbjct: 129 SYYETTRWYREVSKNAAEGARNHGLLNISPNDFFNTLLTIPKSAEEQQQIGSFFKQLDDT 188 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 IT R ++LLKE+K+ + + K +++ +G + ++ + Sbjct: 189 ITLHQRKLDLLKEQKKGFLQKMFPKNSAKVPELRFAG---FADDWEERKLSDIADKAVDN 245 Query: 245 NRKNTKLIESNILSLSYG----NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K + E + + + + ++P E + G+I + Sbjct: 246 RGKTPTISEDESSVIRGCKSRKRCSRLFQVLILAMRPLMTEFAAYIKEGDICVFYCGKYW 305 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQSLKF 359 +L Q+ I + DS +L +++ + + ++ S+K Sbjct: 306 FGLALMDTQMKNATIAQNIVAFRANEKYDSKFLYANVIKEGESSNKAHVCDGAVQPSIKV 365 Query: 360 EDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + V ++EQ + +D L+ ++ + LLKE++ F+ Sbjct: 366 SQLVDVDYCVTENMEEQRKLGEY----FLNLDNLITLHQRKLDLLKEQKKGFLQK 416 >gi|323223408|gb|EGA07738.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB102109-0047] gi|323225169|gb|EGA09416.1| restriction modification system DNA specificity domain protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB110209-0055] Length = 361 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 62/367 (16%), Positives = 124/367 (33%), Gaps = 32/367 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W ++ + KL G + + K + I ++++ +G+G Y G + Sbjct: 2 VPKGWMLLQVSDICKLQNGNSFKPHEWDTKGLPIIRIQNL-NGSGNYNYFSGVPQD---- 56 Query: 77 TVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + GQ+L+ G I G+ + + + + E L Sbjct: 57 -KWLVEPGQLLFSWAGTKGVSFGPFIWNGPKGVLNQHIYKVFANENVHEHWLYLALLHIT 115 Query: 134 TQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + T+ H K I N + PP+AEQ I + + + I+ + + Sbjct: 116 QKIEAQAHGFKSTLLHVQKKDIDNQFVLTPPVAEQKKISQIL----STWNKAISVTEKLL 171 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +++K+AL+ ++T + ++G+ + G W + + + K Sbjct: 172 ANSQQQKKALIQQLLT---GKKRLLDENGVRFSGE----WCTCTLSEVAHIIMGSSPKSE 224 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQV 310 N L I + + P Y + PG+I+ Sbjct: 225 AYNDNGLGLPLIQGNADIKCRVSCPRVYTSDITKECTPGDILLSVRAPVGTVA-----LS 279 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + I A+K S + + K Y +S+ +D+K L + VP Sbjct: 280 QHKACIGRGISAIKSKRKMSQSFLYQWFLWFEPKWCYLSQGSTFESINSDDIKTLKLSVP 339 Query: 371 PIKEQFD 377 +EQ Sbjct: 340 NFEEQQK 346 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 73/200 (36%), Gaps = 7/200 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +VP W + + N + K E + L I + N + +V Sbjct: 1 MVPKGWMLLQVSDICKLQNGNSFKPHEWDTKGLPIIRIQNLNGSGNYNYFSGVPQDKWLV 60 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 +PG+++F + + +G++ V + + +L + K+ Sbjct: 61 EPGQLLFSWAGTKGVSFG-PFIWNGPKGVLNQHIYKVFANENVHEHWLYLALLHITQKIE 119 Query: 347 YAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 ++ +D+ VL PP+ EQ I+ +++ + + E+ + + Sbjct: 120 AQAHGFKSTLLHVQKKDIDNQFVLTPPVAEQKKISQILSTW----NKAISVTEKLLANSQ 175 Query: 405 ERRSSFIAAAVTGQIDLRGE 424 +++ + I +TG+ L E Sbjct: 176 QQKKALIQQLLTGKKRLLDE 195 >gi|218665757|ref|YP_002425317.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] gi|218517970|gb|ACK78556.1| type I restriction-modification system, S subunit [Acidithiobacillus ferrooxidans ATCC 23270] Length = 409 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 55/384 (14%), Positives = 115/384 (29%), Gaps = 25/384 (6%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESG-TGKYLPKDGNSRQSD 74 W+ + +G T + + + +I + + K + Sbjct: 25 SDWQKTTVGEIASGFLSGGTPSTSRADFWEGENPWITSKWLGDKLELTTGEKFVSEGAVK 84 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSID 132 + I K I++ + K I D + +++ + + L L Sbjct: 85 KTATKIVPKDSIIFAT-RVGVGKVGINRIDLAINQDLAGVLIDNERYDIKFLAYQLGIDS 143 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + Q + GAT+ + I + +PPL EQ I + + I + R I Sbjct: 144 IQQYVAMNKRGATIKGITRDCLEQIRLNLPPLPEQKKIAHIL----STVQRAIEAQERII 199 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + E K+AL+ + T+GL + K + I + + E+ +T K Sbjct: 200 QTTTELKKALMHKLFTEGL-RNEPQKQTEIGPIPESWEVVEIGDLGKCITGSTPKTKVDS 258 Query: 253 ESNILSLSYG-----NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + + + + PE T + + ++ I K + Sbjct: 259 FYDPPTEDFIAPADLGARRYVYDSEKKISPEGMATIRPIPRNAVMCVCIGSSIGKVGMSY 318 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 E ++ + + SY G L + V Sbjct: 319 R---EESATNQQINSIICGEGRDPEFVYCLLSYRSDYWKSFATFGPVPILSKGRFSTIGV 375 Query: 368 LVP-PIKEQFDITNVINVETARID 390 +P + EQ I + I+ Sbjct: 376 PIPSSLDEQQAIAKPLVSTVKDIE 399 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 58/169 (34%), Gaps = 7/169 (4%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + I S G+ ++ + +T + P + + + K + Sbjct: 53 WEGENPWITSKWLGDKLELTTGEKFVSEGAVKKTATKIVPKDSIIFATRVGVGKVGINRI 112 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV 367 + + + D +LA+ + + + G + + + ++++ + Sbjct: 113 DLAINQDLAGVLI--DNERYDIKFLAYQLGIDSIQQYVAMNKRGATIKGITRDCLEQIRL 170 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +PP+ EQ I ++++ +E E+ I E + + + T Sbjct: 171 NLPPLPEQKKIAHILSTV----QRAIEAQERIIQTTTELKKALMHKLFT 215 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 65/187 (34%), Gaps = 10/187 (5%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG-----LEDVESGTGKYLP 65 K + IG IP+ W+VV I K TG T ++ D Y + + G +Y+ Sbjct: 224 KQTE---IGPIPESWEVVEIGDLGKCITGSTPKTKVDSFYDPPTEDFIAPADLGARRYVY 280 Query: 66 KDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 +T+ + ++ +G + K ++ + + Q + + Sbjct: 281 DSEKKISPEGMATIRPIPRNAVMCVCIGSSIGKVGMSYREESATNQQINSIICGEGRDPE 340 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDT 183 + L + ++ + I +PIP L EQ I + +++ I+ Sbjct: 341 FVYCLLSYRSDYWKSFATFGPVPILSKGRFSTIGVPIPSSLDEQQAIAKPLVSTVKDIEG 400 Query: 184 LITERIR 190 + Sbjct: 401 FVYADGH 407 >gi|82750028|ref|YP_415769.1| type-I specificity determinant subunit [Staphylococcus aureus RF122] gi|82655559|emb|CAI79953.1| type-I specificity determinant subunit [Staphylococcus aureus RF122] Length = 410 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 51/386 (13%), Positives = 112/386 (29%), Gaps = 24/386 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + G+ + I ++ + G + K + + + Sbjct: 20 EWEEKKLGDVATFAKGKLGAKKDVSQNGVPIILYGELYTKYGAIVSKIFSKTDIPENKLK 79 Query: 80 IFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I K +L G I + +L P+ + ++ Sbjct: 80 IAKKNDVLIPSSGETAIDIATASCIYLNKGVAVGGDINILTPQKQDDRFI-SLSINGINK 138 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRFIE 193 + +G T+ H I N+ + P EQV I + +I+ + + Sbjct: 139 NELSKYAQGKTVVHLYNNDIKNLKIVFPSEFEEQVRIGDFFSKLDRQIELEEQKLELLQQ 198 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 K Q + S + + + + +G V + + KN Sbjct: 199 QKKGYMQKIFSQELRFKDENGEEYPEWEEKQLGEVAE-------IIGGGPPSTKNKLYWN 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I S I K + E + + I + ++A + + Sbjct: 252 GEINWFSPI-EIGNKTYVYSSQKKITEEGLRKSSAKILPVGTILFTSRAGIGKTAILAKE 310 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPI 372 + ++ P S L + + + +++L + +P I Sbjct: 311 STTNQGFQSIVPRKGVLDSYYVYTISNILKILAEKVSAGSTFSEISKKQMEQLNLNIPMI 370 Query: 373 KEQFDITNVINVETARIDVLVEKIEQ 398 KEQ +I+ ++ D L+E E+ Sbjct: 371 KEQKNISKF----FSKFDNLIEIQER 392 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 54/177 (30%), Gaps = 2/177 (1%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 +++ G E +V F + ++ IL + ++ Sbjct: 10 PELRFPGFEGEWEEKKLGDVATFAKGKLGAKKDVSQNGVPIILYGELYTKYGAIVSKIFS 69 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI-ITSAYMAVKPHGIDSTYL 333 +I +++ + S + +G+ + + P D ++ Sbjct: 70 KTDIPENKLKIAKKNDVLIPSSGETAIDIATASCIYLNKGVAVGGDINILTPQKQDDRFI 129 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARI 389 + + + ++ L D+K L ++ P +EQ I + + +I Sbjct: 130 SLSINGINKNELSKYAQGKTVVHLYNNDIKNLKIVFPSEFEEQVRIGDFFSKLDRQI 186 >gi|330899951|gb|EGH31370.1| Type I restriction-modification system specificity subunit [Pseudomonas syringae pv. japonica str. M301072PT] Length = 441 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 63/441 (14%), Positives = 137/441 (31%), Gaps = 48/441 (10%) Query: 23 KHWKVVPIKRFTK-------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ VP+ + +N +I + + G + Sbjct: 2 SDWRFVPLGDLIESLDAGVSVNAEDRPHGAGEIGVLKTSAISGGEFHAEQNKAVLQSERR 61 Query: 76 STVSIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 IL ++ A + L+P+D + ++ Sbjct: 62 LIAEPVQADSILVSRMNTPALVGESCYVAEAYPMLFLPDRLWQLKPRDRMQVNMRWLSFV 121 Query: 131 I---DVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + D +E G TM + + + P+ PPL+EQ +I + + I Sbjct: 122 LQSADYRSYVEVHATGTSGTMKNLPKSKMLSFPVLYPPLSEQKIIAQILDTLDTIIRETE 181 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV--------GLVPDHWEVKPF 237 + + L L+ ++T+G++ + +++ S E G +P W Sbjct: 182 SILDKLKALKH----GLLHDLLTRGIDANGELRPSQSEAPQLYKESQWGCIPKEWRQTST 237 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES-------------YETYQ 284 L + + + T + ++ G + Sbjct: 238 RELCSLITKGTTPAANNMWQGSEGVKFLRVDNLSFDGQLDFDASRFQISLGTHRGELSRS 297 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLC 343 I PG+++ + K L + Q+ E I + A +P+ + L WL S Sbjct: 298 ICLPGDVLTNIVGPPLGKLGLVTKQMGEVNINQAIALFRPEPNLLPGFLLLWLGGSPAQT 357 Query: 344 KV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + A + + +L + LP+ ++EQ I + I R+ + Sbjct: 358 WLRKRAKQTSGQVNLTLALCQELPIPKISLEEQQLIVDRIEKMHERL----SVGTSELSK 413 Query: 403 LKERRSSFIAAAVTGQIDLRG 423 L + + + +TG++ + Sbjct: 414 LHHMKYAMMDDLLTGRVRVTP 434 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 73/215 (33%), Gaps = 19/215 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGK 62 YK+S QW G IPK W+ + L T T+ + + ++ ++++ Sbjct: 220 YKES--QW-GCIPKEWRQTSTRELCSLITKGTTPAANNMWQGSEGVKFLRVDNLSFDGQL 276 Query: 63 YLPKDGNSRQSDTS----TVSIFAKGQILYGKLGPYLRKAI-IADFDGICS----TQFLV 113 T + SI G +L +GP L K + G + Sbjct: 277 DFDASRFQISLGTHRGELSRSICLPGDVLTNIVGPPLGKLGLVTKQMGEVNINQAIALFR 336 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 +P + LL S T + + + + +P+P L EQ LI ++ Sbjct: 337 PEPNLLPGFLLLWLGGSPAQTWLRKRAKQTSGQVNLTLALCQELPIPKISLEEQQLIVDR 396 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I R+ +E + + L++ V Sbjct: 397 IEKMHERLSVGTSELSKLHHMKYAMMDDLLTGRVR 431 >gi|170017257|ref|YP_001728176.1| putative restriction-modification enzyme type I S subunit [Leuconostoc citreum KM20] gi|169804114|gb|ACA82732.1| Putative restriction-modification enzyme type I S subunit [Leuconostoc citreum KM20] Length = 397 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 50/402 (12%), Positives = 117/402 (29%), Gaps = 36/402 (8%) Query: 24 HWKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDV-----ESGTGKYLPKDGNSRQSD 74 W+ + + + + + + I ++ D+ YL Sbjct: 16 DWEERKLGDMMDVTSVKRIHQSDWTNSGIRFLRARDIVSAAKNEEPSDYLYISEEKYNEY 75 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSID 132 + ++G +L +G +I D + I + + + + + + Sbjct: 76 SKISGKVSQGDLLVTGVGSIGVPLLITDDNPIYFKDGNIIWFKNEHKIDGNFFYYSFINN 135 Query: 133 VTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q+ G T+ + +P EQ I ++D I R Sbjct: 136 KIQKYIRDVAGIGTVGTYTIDSGKKTRISLPTYDEQNKIGSF----FKQLDNTIALHQRK 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE+K+ + + K +++ +G D WE + + + Sbjct: 192 LDLLKEQKKGFLQKMFPKNGAKIPELRFAG------FTDDWEERKLGEIFDYEQPTKYIV 245 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + ++ ++ +G E +V V Sbjct: 246 QSTEYDDTFNTPVLTAGKSFLLGYTDEISGIKNATVENPVVIFDDFTTGSHY------VD 299 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 I S+ M + +S ++ + K + + P Sbjct: 300 FPFKIKSSAMKLLSLNDNSDNFYFMFNTLKNIKYVPQS----HERHWISKFSEFEIYKPS 355 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I ++D + +Q + LLK+++ F+ Sbjct: 356 QEEQQKIGPF----FKQLDNTIALHQQKLDLLKQQKKGFLQK 393 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 61/206 (29%), Gaps = 9/206 (4%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVK---PFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P ++ K +W N L +I+S + Sbjct: 5 TPQIRFKGFTDDWEERKLGDMMDVTSVKRIHQSDWTNSGIRFLRARDIVSAAKNEEPSDY 64 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + E + V G+++ + L + + H I Sbjct: 65 LYISEEKYNEYSKISGKVSQGDLLVTGVGSIG-VPLLITDDNPIYFKDGNIIWFKNEHKI 123 Query: 329 DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + + + K + G + + K+ + +P EQ I + Sbjct: 124 DGNFFYYSFINNKIQKYIRDVAGIGTVGTYTIDSGKKTRISLPTYDEQNKIGSF----FK 179 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D + ++ + LLKE++ F+ Sbjct: 180 QLDNTIALHQRKLDLLKEQKKGFLQK 205 >gi|298575369|ref|NP_247095.2| Type I restriction-modification enzyme subunit S [Methanocaldococcus jannaschii DSM 2661] gi|2826248|gb|AAB98112.1| type I restriction-modification enzyme 2, S subunit [Methanocaldococcus jannaschii DSM 2661] Length = 343 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 58/331 (17%), Positives = 109/331 (32%), Gaps = 35/331 (10%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFT-KLNTGRTSE-------SGKDIIYIGLEDVESG 59 +K + IG IP+ W++V +K K+ G T + I ++ +ED+ + Sbjct: 6 ENFKKTE---IGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNS 62 Query: 60 TGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + + S I K +L+ G A I + + L + PK Sbjct: 63 NKYLTNTKIKITEEGLNNSNAWIVPKNSVLFAMYGSIGETA-INKIEVATNQAILGIIPK 121 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 D + E + + + T + + + + + +P+PPL EQ I + + Sbjct: 122 DNILESEFLYYILAKNKNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKIL--- 178 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 +ID I + I L+ K+ L+ ++TKG+ K +G +P+ WEV Sbjct: 179 -TKIDEGIEIIEKSINKLERIKKGLMHKLLTKGIGHSRFKKS----EIGEIPEDWEVFEI 233 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQ-------------KLETRNMGLKPESYETYQ 284 + +S N I R + Sbjct: 234 KDIFEVKTGTTPSTKKSEYWENGEINWITPLDLSRLNEKIYIGSSERKVTKIALEKCNLN 293 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 ++ G I+ L +G Sbjct: 294 LIPKGSIIISTRAPVGYVAVLTVESTFNQGC 324 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 20/205 (9%) Query: 224 WVGLVPDHWEVKPFFALVT------------ELNRKNTKLIESNILSLSYGNIIQKLETR 271 +G +P+ WE+ + E KN + I ++ N Sbjct: 12 EIGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNSNKYLTNTKI 71 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + IV ++F + +E + + I + Sbjct: 72 KITEEGLNNSNAWIVPKNSVLFAMYGSIGETAI----NKIEVATNQAILGIIPKDNILES 127 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + + +++L + VK + +PP++EQ I ++ +ID Sbjct: 128 EFLYYILAKNKNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKIL----TKIDE 183 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 +E IE+SI L+ + + +T Sbjct: 184 GIEIIEKSINKLERIKKGLMHKLLT 208 >gi|284800800|ref|YP_003412665.1| hypothetical protein LM5578_0548 [Listeria monocytogenes 08-5578] gi|284993986|ref|YP_003415754.1| hypothetical protein LM5923_0547 [Listeria monocytogenes 08-5923] gi|284056362|gb|ADB67303.1| hypothetical protein LM5578_0548 [Listeria monocytogenes 08-5578] gi|284059453|gb|ADB70392.1| hypothetical protein LM5923_0547 [Listeria monocytogenes 08-5923] Length = 389 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 52/398 (13%), Positives = 124/398 (31%), Gaps = 34/398 (8%) Query: 25 WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVS 79 W+ + + G + GK I + D+ ++ + + Sbjct: 12 WEQRELSSLLSFSNGINAPKEHYGKGRKMISVMDILDEKPVKYEFIRNSVQVDKKIESKN 71 Query: 80 IFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G I++ + + + S + + + L+ Sbjct: 72 KVEYGDIVFVRSSEVPEEVGWAKAYLEKEYALYSGFSIRGKKINEFNPYFVELTLNSINR 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++IE G+T + + +I + +P + EQ KI +++DT I R ++ Sbjct: 132 KQIERKAGGSTRFNVSQTILSSIELLMPEIEEQ----NKIDKFFIQLDTTIALHQRKLDT 187 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LK K+ + + +++ + + ++ E+ + L Sbjct: 188 LKRMKKGFLQQMFPNNEEKVPRLRFADFDEEWEQ----------RMLNEIANRYDNLRVP 237 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S + E + GE + D ND ++ V + Sbjct: 238 ITASARSSGTTPYYGANGIQDYVEGFT-----HDGEFILVAEDGANDVKNYPVQYVNGKI 292 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + ++ + +LM + + K+ + G R L + + L V P + Sbjct: 293 WVNNHAHVLQAKE-NKHDNKFLMNAIKILKIEPFLVGGGRAKLNSDVMMTLMVKFPCYEG 351 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q I + R+D + + I L + +++ Sbjct: 352 QKKIGTFL----QRLDNTITLHKNKINKLSSLKKTYLQ 385 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 63/164 (38%), Gaps = 6/164 (3%) Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +++ + ++ RN + E+ V+ G+IVF ++ A + Sbjct: 39 KMISVMDILDEKPVKYEFIRNSVQVDKKIESKNKVEYGDIVFVRSSEVPEEVGWAKAYLE 98 Query: 312 ERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + S + + + ++ + S + ++ G R ++ + + +L+ Sbjct: 99 KEYALYSGFSIRGKKINEFNPYFVELTLNSINRKQIERKAGGSTRFNVSQTILSSIELLM 158 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I+EQ I + ++D + ++ + LK + F+ Sbjct: 159 PEIEEQNKI----DKFFIQLDTTIALHQRKLDTLKRMKKGFLQQ 198 >gi|315652288|ref|ZP_07905280.1| type I restriction system specificity protein [Eubacterium saburreum DSM 3986] gi|315485411|gb|EFU75801.1| type I restriction system specificity protein [Eubacterium saburreum DSM 3986] Length = 421 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 48/414 (11%), Positives = 122/414 (29%), Gaps = 36/414 (8%) Query: 11 KDSGVQW--IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 K+ V+W IG IP+ K+ T + ++ + G + +++ Sbjct: 13 KNEKVEWKEIGDIPE----------IKVITVKKKLKKQEYLREGDYPIIDQGQEFIVGYT 62 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 N + G + F + + + ++ Sbjct: 63 NDNDAIIDKYPCVIFGD--------------HTESIKYVDFAFAQGADGIKILKTDEKYI 108 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + I + + + + +PIP + Q I + + T + L E Sbjct: 109 KSRYLYHTILSYYKLEGKYMRHFSLLRKTLIPIPSIKTQEKIVKTLDKFTEYVTELQAEL 168 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV-KPFFALVTELNRK 247 ++ + + + ++++ + K + G + Sbjct: 169 QAELQYRTNQYEYYRNMLLSEEYLNKLSKKLLDVSEGGTNRLCCTTLGDIGKFTRGNGLQ 228 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFIDLQNDKR 303 + + YG I K + E +E + G+I+ + Sbjct: 229 KSDFASHGKPVIHYGQIYTKYGFETNEVISFVSEELFEKLRKARQGDILMATTSENIEDV 288 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDV 362 I S M + Y+A+ ++ + K +G + + +D+ Sbjct: 289 GKCVVWTGNEEIGFSGDMYSYRTTENPKYIAYYFQTAEFQKQKEKKVTGTKLIRIHGDDM 348 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + +PP+ Q I +++ A + + + I ++ R + Sbjct: 349 EKFSIHLPPLSLQNKIVEILDKFQAILSETRGLLPKEIEERQKQYEYYREKLLT 402 >gi|161871030|ref|YP_001598938.1| type I restriction enzyme [Neisseria meningitidis 053442] gi|161596583|gb|ABX74243.1| type I restriction enzyme [Neisseria meningitidis 053442] Length = 405 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 47/395 (11%), Positives = 112/395 (28%), Gaps = 31/395 (7%) Query: 26 KVVPIKR---FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + P+ + TG+ K + + G Y + Sbjct: 20 EWKPLGGENGIAIIKTGQAVSKQK---------ISNNIGSYPVINSGKEPLGYIDEWNTE 70 Query: 83 KGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G + + + + V ++ + + ++ Q I A+C Sbjct: 71 NDPIGITTRGAGVGSITWQEGRYFRGNLNYAVTIKNRTELDVRFLYHILLEFEQEIHALC 130 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + + + +PIPPL Q I + + T L R R ++ Sbjct: 131 TFTGIPALNASNLKKLLIPIPPLETQQKIVKILDKFTELEAEL-ALRKRQYRYYRDFLLD 189 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + I ++KD + +G V T +I Sbjct: 190 FDNQIGGIADGYKGRLKDVVWKTLGEV------FDLKNGYTPSKSNKEYWENGSIPWFRM 243 Query: 262 GNIIQKLETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 +I + + LK S + ++ I+ + ++ + + + Sbjct: 244 EDIRENGRILDNSLKHISKSAVKGGKLFPAKSIMMSTTATIGEHALIKVNYISNQQLTNF 303 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 +D + + S L + +++K+L + +PP+ EQ I Sbjct: 304 TIKDEFKDALDINFAFYYFFIIAEQSKKLINTSSL-PIISMKELKKLKIPIPPLPEQEKI 362 Query: 379 TNVINVETARIDVL-------VEKIEQSIVLLKER 406 +++ + + + +E+ Sbjct: 363 AAILDKFDTLTHSISEGLPHEIALRRKQYEYYREQ 397 >gi|218698186|ref|YP_002405853.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Escherichia coli 55989] gi|218354918|emb|CAV02125.1| Type I restriction enzyme EcoAI specificity protein (S protein) (S.EcoAI) [Escherichia coli 55989] Length = 578 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 54/487 (11%), Positives = 132/487 (27%), Gaps = 96/487 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV 56 +K K P+ S + +P+ W+ V + ++ GR + + + + ++ Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVRVADLMEVINGRAYKKHEMLQTGTPLLRVGNL 140 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + G ++Y + + + + Sbjct: 141 ------FTSNEWYYSDLQLDENKYINNGDLIYAWSASFGPFIWTGEKVIYHYHIWKLNLF 194 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + +T +I++ G M H + + + +PP+ EQ I KI Sbjct: 195 AEEYSNKYFIHDFLLSITDKIKSQGNGIAMLHMTKEKMEQQIIALPPINEQQQIVRKIRE 254 Query: 177 ET-----------------------------------------VRIDTLITERIRFIELL 195 T RI + Sbjct: 255 LTVLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWARISEHFDTLFTTEASV 314 Query: 196 KEKKQALVSYIVTKGLNPDVKMKD-------------------------------SGIEW 224 KQ ++ V L P + S E Sbjct: 315 DTLKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPPISDEEK 374 Query: 225 VGLVPDHWEVKPFFALVT----ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-- 278 +P+ WE L + + ++++ L N+ + + + E Sbjct: 375 PFELPEGWEWCRIDDLTFVSGGIQKQPKRRPVKNHFPYLRVANVQRGDINIDKLERFEVE 434 Query: 279 -SYETYQIVDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLA 334 + ++ +I+ R +E+ + + + V+ ++A Sbjct: 435 PHELAFWSLEKNDILIVEGNGSADEIGRCAIWHAPIEKCVYQNHLIRVRGIIEGYQEFIA 494 Query: 335 WLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + + +L ++ + + +PP+ +Q I + I + L Sbjct: 495 LYLNSPSGIKEMQRLAVTTSGLYNLSVGKIRGITIPLPPLNQQNLILSRIREYILACENL 554 Query: 393 VEKIEQS 399 + + Sbjct: 555 KTSTQSA 561 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 63/193 (32%), Gaps = 5/193 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE L+ +N + K E + + Sbjct: 93 SEEEKPFELPEGWEWVRVADLMEVINGRAYKKHEMLQTGTPLLRVGNLFTSNEWYYSDLQ 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + ++ G++++ + + I + ++ + S Sbjct: 153 LDENKYINNGDLIYAWSASFGPFIWTGEKVIYHYHIW--KLNLFAEEYSNKYFIHDFLLS 210 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + G+G + E +++ + +PPI EQ I I T D L ++ Sbjct: 211 --ITDKIKSQGNGIAMLHMTKEKMEQQIIALPPINEQQQIVRKIRELTVLCDQLEQQSLT 268 Query: 399 SIVLLKERRSSFI 411 S+ ++ + + Sbjct: 269 SLDAHQQLVETLL 281 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 62/202 (30%), Gaps = 13/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ I T ++ G + Y+ + +V+ G + + Sbjct: 377 ELPEGWEWCRIDDLTFVSGGIQKQPKRRPVKNHFPYLRVANVQRGDINIDKLERFEVEPH 436 Query: 75 TSTVSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL----QG 126 K IL G R AI C Q +++ + ++ Sbjct: 437 ELAFWSLEKNDILIVEGNGSADEIGRCAIWHAPIEKCVYQNHLIRVRGIIEGYQEFIALY 496 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + + + I I +P+PPL +Q LI +I + + L T Sbjct: 497 LNSPSGIKEMQRLAVTTSGLYNLSVGKIRGITIPLPPLNQQNLILSRIREYILACENLKT 556 Query: 187 ERIRFIELLKEKKQALVSYIVT 208 + AL + Sbjct: 557 STQSAQQTQLHLADALTDAAIN 578 >gi|315651210|ref|ZP_07904240.1| 50S ribosomal protein L10 [Eubacterium saburreum DSM 3986] gi|315486506|gb|EFU76858.1| 50S ribosomal protein L10 [Eubacterium saburreum DSM 3986] Length = 367 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 51/395 (12%), Positives = 127/395 (32%), Gaps = 43/395 (10%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + G+ + V S G +P G + +++ K +L G Sbjct: 8 LSELVTIKYGKNQKK-----------VHSDDGN-IPIYGTGGLMGYAKTALYDKPSVLIG 55 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + G + + T F + D++ +++S+ + EG T+ Sbjct: 56 RKGTIGKVKYVEHPFWTVDTLFYTIINTDIVTPKYLYYVMSL---IDLNNYNEGTTIPSL 112 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + + + IP + EQ ++ + ID I L+++ +A+ S Sbjct: 113 RTETLNRLEFNIPSIEEQEIVLSCLNP----IDEKIELNNAINNNLEQQAKAIFSKEFLT 168 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE--LNRKNTKLIESNILSLSYGNIIQK 267 +E + + + + + + E+ I L + Q Sbjct: 169 ------------LETLPDGWNQASLIDIADYLNGLAMQKYRPTADETGIPVLKIKELRQT 216 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 N L + ++ I+ G+++F + L + V + Sbjct: 217 CCDDNSELCSPNIKSEYIIQDGDVIFSWSGSL-----LVDFWCGGICGLNQHLFKVTSNK 271 Query: 328 IDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + + + + A + +K +++ + VL+P + I ++ Sbjct: 272 YNKWFYYAWTKHHLDRFIAVAADKATTMGHIKRDELAKAKVLIPNEADYQRIGALL---- 327 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I L+ L R + + ++G++D+ Sbjct: 328 QPIYDLIISNRIENKKLSSLRDTLLPKLMSGELDV 362 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 56/167 (33%), Gaps = 4/167 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + S ++++ YG +K+ + + + + + L K ++ Sbjct: 2 KFKRYALSELVTIKYGKNQKKVHSDDGNIPIYGTGGLMGYAKTALYDKPSVLIGRKGTIG 61 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + +E T + D +L L + SL+ E + RL Sbjct: 62 KVKYVEHPFWTVDTLFYTIINTDIVTPKYLYYVMSLIDLNNYNEGTTIPSLRTETLNRLE 121 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P I+EQ ++ ID +E L+++ + + Sbjct: 122 FNIPSIEEQ----EIVLSCLNPIDEKIELNNAINNNLEQQAKAIFSK 164 >gi|126176529|ref|YP_001052678.1| restriction modification system DNA specificity subunit [Shewanella baltica OS155] gi|125999734|gb|ABN63809.1| restriction modification system DNA specificity domain [Shewanella baltica OS155] Length = 363 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 76/393 (19%), Positives = 135/393 (34%), Gaps = 36/393 (9%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + +DI YIGLE V T + L K + S F KG IL+ Sbjct: 6 LNEVADEIRESFSPTPDEDIPYIGLEHVSQQTLQLLGKGSSLNVE--SNKYKFKKGDILF 63 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G L PY RK IA FDG+CST++ V++PK + L+ + Sbjct: 64 GTLRPYFRKVTIAPFDGVCSTEYSVIRPKKADYTNFVFYFLANEKFIEYATTNSVGARPR 123 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 WK + + E+ I K+ A I+ R I+LL+E + L Sbjct: 124 TKWKLFSDYKVRKTRNQEKFDIGFKLRALDDLIEN----NRRRIQLLEESARLLYQEWFV 179 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P ++ ++ + VP+ WE KP + T K K ++ Sbjct: 180 HLRFPGHEL----VKVIDGVPEGWEKKPIKQIATLNYGKALKAEVRIPGPFPVYGSSGEV 235 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 S+E + PG +V R ++ + + + + + Sbjct: 236 G---------SHEKALVKGPGIVVGRKGNVGSI------------FWVNTDFYPIDTVYF 274 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 S + L + L V + L + +L+P K ++ Sbjct: 275 ISAEESSLFLYHALQNVQFINTDVAVPGLNRDMAYSREILIPDHKNYQR---FLSEV-QP 330 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + ++ L + R + ++G++ + Sbjct: 331 IQKQINNLQDYNNKLAQARDLLLPKLMSGELTV 363 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 56/184 (30%), Gaps = 20/184 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W+ PIK+ LN G+ ++ I P G+S + + ++ Sbjct: 195 VPEGWEKKPIKQIATLNYGKALKAEVRIP------------GPFPVYGSSGEVGSHEKAL 242 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I+ G+ G + T + + + L Q ++ I Sbjct: 243 VKGPGIVVGRKGNVGSIFWVNTDFYPIDTVYFISAEESSL--------FLYHALQNVQFI 294 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + + IP ++ +I+ L + + Sbjct: 295 NTDVAVPGLNRDMAYSREILIPDHKNYQRFLSEVQPIQKQINNLQDYNNKLAQARDLLLP 354 Query: 201 ALVS 204 L+S Sbjct: 355 KLMS 358 >gi|83647701|ref|YP_436136.1| restriction endonuclease S subunit [Hahella chejuensis KCTC 2396] gi|83635744|gb|ABC31711.1| Restriction endonuclease S subunit [Hahella chejuensis KCTC 2396] Length = 406 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 62/420 (14%), Positives = 140/420 (33%), Gaps = 34/420 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGT---GKYLPKDGNSRQSDTS 76 W V + + G+T + + ++DV+ G + + + Sbjct: 2 SWDRVGLSTVADVFNGKTPSKAEQRDEGFPVLKIKDVDENFKFRGAFQSFVDDEFYAKHK 61 Query: 77 TVSIFAKGQILYGK------LGPYLRKAIIADFDGICSTQFLVLQPKDVL--PELLQGWL 128 I ++ +G A D + + ++LV + K + P+ L WL Sbjct: 62 AKKIQLHDSMILNAAHNSDYVGSKQYCAEEDVVDSVATGEWLVCRAKQGVLSPKFLNFWL 121 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S ++ + +G H K + + +P+PPL Q I + Sbjct: 122 RSEATRFEMKGLVKG---IHLYPKDVARLEIPLPPLETQKQIAAILEKADQLRKDCQQME 178 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 L Q++ + +P K + + + PF + + + ++ Sbjct: 179 QELNNLA----QSVFMDMFG---DPVSNPKGWNKASLRSISTKFNDGPFGSNLKTSHYRD 231 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + ++ G E+ E + PG+IV + N + + Sbjct: 232 SGVQVIRLTNIGTGWFKNDDRAFVSVEHAETLEKFH-CKPGDIVIATLGDPNLRACIIPD 290 Query: 309 QVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 +V I + + P+ YL + + G R + + + Sbjct: 291 EVPL-AINKADCVHCVPNTKIVRKEYLVEFLNLPSTLRSIENKLHGQTRTRISSGQLAEV 349 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 VL+PP+ EQ N I + + L + V ++ +S + A G+++++ ++ Sbjct: 350 DVLIPPLSEQDKFMNAIWLRDKELKRL----QDQNVAFEDLFNSLMQKAFNGELNIKNKA 405 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 18/205 (8%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPK-DGNSRQ 72 PK W ++ + K N G + + I L ++ +G K + + Sbjct: 200 PKGWNKASLRSISTKFNDGPFGSNLKTSHYRDSGVQVIRLTNIGTGWFKNDDRAFVSVEH 259 Query: 73 SDTSTVSIFAKGQILYGKLG-PYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQ--GW 127 ++T G I+ LG P LR II D I + P + + Sbjct: 260 AETLEKFHCKPGDIVIATLGDPNLRACIIPDEVPLAINKADCVHCVPNTKIVRKEYLVEF 319 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L + IE G T + + + + IPPL+EQ I + L + Sbjct: 320 LNLPSTLRSIENKLHGQTRTRISSGQLAEVDVLIPPLSEQDKFMNAIWLRDKELKRLQDQ 379 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN 212 + F +L +L+ LN Sbjct: 380 NVAFEDLFN----SLMQKAFNGELN 400 >gi|16272178|ref|NP_438384.1| type I restriction/modification specificity protein [Haemophilus influenzae Rd KW20] gi|260580902|ref|ZP_05848726.1| type I restriction/modification specificity protein [Haemophilus influenzae RdAW] gi|12229974|sp|P71344|T1SI_HAEIN RecName: Full=Putative type I restriction enzyme specificity protein HI_0216; Short=S protein gi|1573175|gb|AAC21883.1| type I restriction/modification specificity protein (hsdS) [Haemophilus influenzae Rd KW20] gi|260092391|gb|EEW76330.1| type I restriction/modification specificity protein [Haemophilus influenzae RdAW] Length = 385 Score = 93.7 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 64/386 (16%), Positives = 119/386 (30%), Gaps = 35/386 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + +G N+ Q + +G+ Sbjct: 18 EWKPLDEVANIVNNARKPVKSSLRV---------SGNIPYYGANNIQDYVEGYT--HEGE 66 Query: 86 ILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G A + V+ K+ L L+ A Sbjct: 67 FVLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLA-- 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + IP+PIPPL+ Q I + + A T L +E + +Q Sbjct: 125 -GKERAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELTSELTSELILRQK 183 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 Y K LN D K + +G V K KN +I Sbjct: 184 QYEYYREKLLNIDEMNK---VIELGDVGPVRMCKRIL--------KNQTASSGDIPFYKI 232 Query: 262 GNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G +K + + Y+ Y G+I+ E + Sbjct: 233 GTFGKKPDAYISNELFQEYKQKYSYPKKGDILISASGTIGRTVIF----DGENSYFQDSN 288 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + +L Y + K A G G Q L +++K++ + +PP+KEQ I + Sbjct: 289 IVWIDNDETLVLNKYLYHFYKIAKWGIAEG-GTIQRLYNDNLKKVKISIPPLKEQHRIVS 347 Query: 381 VINVETARIDVLVEKIEQSIVLLKER 406 +++ + + E + +I ++R Sbjct: 348 ILDKFETLTNSITEGLPLAIEQSQKR 373 >gi|323143495|ref|ZP_08078178.1| type I restriction modification DNA specificity domain protein [Succinatimonas hippei YIT 12066] gi|322416780|gb|EFY07431.1| type I restriction modification DNA specificity domain protein [Succinatimonas hippei YIT 12066] Length = 401 Score = 93.7 bits (231), Expect = 6e-17, Method: Composition-based stats. Identities = 43/408 (10%), Positives = 113/408 (27%), Gaps = 31/408 (7%) Query: 29 PIKRFT-KLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +K K+ +G T + + +I ++ + ++ ++ Sbjct: 5 KLKDICTKIYSGGTPSTKEPKYWGGNIPWLSSSESGKDFIYETDNYISNLALKETSTKYV 64 Query: 82 AKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPK-DVLPELLQGWLLSIDVTQRIE 138 +K ++ G + + + L+ + L + L ++ Sbjct: 65 SKNTVIIATAGEGKTRGQVSYLKIGACINQSLIALETDAKKVDSLFLYYYLKNSYSRIRS 124 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + + IP ++EQ+ I + + ++I + L K Sbjct: 125 LSNATGIRGSLSGARLKELIVFIPDVSEQLKISDLLYKLDLKIQNNKKQIEILETLAKTI 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 K SG E +P+ W + + + K + Sbjct: 185 YDYWFVQ-FDFPNEEGKPYKSSGGKMVWNEELKREIPEGWRCIKLCNIFSFIKGKIPQK- 242 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 L + + + Y + + D + V Sbjct: 243 ------LLEQKEPSLEQYITIDVANGGTPLYCLPALMPYCNSETIMVMDGAASGDVYVGI 296 Query: 313 RGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 G++ S + +K D S +L+ + A + ++ + + +P Sbjct: 297 DGVLGSTFSMLKSKREDISNSYIYLILNSLKKIYKKANTGSTVPHANRKYIENMVIALPN 356 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++ + I ++ + I L +S + + GQ+ Sbjct: 357 D------CKFLSRKFDEIYAQIKLQKLLIKNLNSLKSFLLPLLMNGQV 398 Score = 42.5 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 69/200 (34%), Gaps = 17/200 (8%) Query: 10 YKDSG--VQWIGA----IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + W IP+ W+ + + G+ + + LE Sbjct: 202 YKSSGGKMVWNEELKREIPEGWRCIKLCNIFSFIKGKIPQKLLEQKEPSLE----QYITI 257 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 +G + + + + + G + DG+ + F +L+ K Sbjct: 258 DVANGGTPLYCLPALMPYCNSETIMVMDGAASGDVYV-GIDGVLGSTFSMLKSKREDISN 316 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 +L+ + + + G+T+ HA+ K I N+ + +P + + + I Sbjct: 317 SYIYLILNSLKKIYKKANTGSTVPHANRKYIENMVIALPNDC------KFLSRKFDEIYA 370 Query: 184 LITERIRFIELLKEKKQALV 203 I + I+ L K L+ Sbjct: 371 QIKLQKLLIKNLNSLKSFLL 390 >gi|49658898|emb|CAF28524.1| putative HsdS-like DNA methylase [Yersinia pseudotuberculosis] Length = 449 Score = 93.7 bits (231), Expect = 6e-17, Method: Composition-based stats. Identities = 71/473 (15%), Positives = 140/473 (29%), Gaps = 93/473 (19%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G +W+ + + F L G K SG + G S Sbjct: 2 GSEWLD--------ITLGEFLNLKRGYDLPKSKR---------NSGNIPIISSSGFSGNH 44 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 D + ++ G+ G + + +T V K P L +I+ Sbjct: 45 DKP---MVYGPGVVTGRYGTIGEVFYVNESYWPLNTTLYVDDFKGNSPLFCYYLLQTINF 101 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 + A + + I + +P + Q I + +++ IT + Sbjct: 102 RAYSDK----AAVPGINRNHIHMANIRVPKSVVTQDNIAVVL----KKLEDKITNNLEIN 153 Query: 193 ELLKEKKQALVSYIVTKG------------------------------------------ 210 + L++ QAL + Sbjct: 154 KTLEQITQALFNSWFVDFEPVKAKIAVLEAGGSQEEATLAAMTAISGKDADSLAIFEREH 213 Query: 211 ------LNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 L ++ S ++ +G P+ W V + VTEL R + ++ Sbjct: 214 PEQYTELKATAELFPSAMQESELGETPEGWNVCNIKSSVTELRRGISPKYTEETDGVTVI 273 Query: 263 N--IIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 N I+ + + I + G+++ R + E I Sbjct: 274 NQKCIRNHTINFSLARLHDSKKRTISGRELQVGDVLVNSTGTGTLGRLAPIRYLAETVIA 333 Query: 317 TSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 S V+ T L+ Y+ GS + L+ E ++ + PP+ Sbjct: 334 DSHVTVVRADTAKITASYLSGLLMKYEQFIESNGSGSTGQTELRKEVLEEIYFPCPPL-- 391 Query: 375 QFDI-TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I + + T R++ + +EQ I +L + R + + ++G+I L Q Sbjct: 392 ---ILGQLFDKFTNRLNAKLSLLEQQITVLSQLRDTLLPKLLSGEITLPESEQ 441 >gi|327383090|gb|AEA54566.1| hypothetical protein LC2W_2234 [Lactobacillus casei LC2W] Length = 608 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 75/402 (18%), Positives = 131/402 (32%), Gaps = 26/402 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ K + + I + ED+ S G+ S F Sbjct: 221 WEKRKFKDL--VVRVNKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINSKQGIY--FEPQ 276 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L+GKL PYL+ + F G F VL+ + L+ Q + I G Sbjct: 277 DVLFGKLRPYLQNWLFPSFYGRAVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGT 336 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M +DW + N PIP +EQ KI +D LI + LK+ K + Sbjct: 337 KMPRSDWNTVSNTSFPIPVQSEQ----RKIWQLFNVLDNLIAATQDKLSFLKKMKMFFLQ 392 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I + +++ G V H+++ + E K L + Sbjct: 393 QIFPTKNHDVPQIRFDG---FTDVWSHYKLGSLMRIDKEQEVKKELLTDIQKGFYVLAMR 449 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRF----------IDLQNDKRSLRSAQVMERG 314 ++ KP V + + +L R L +A + Sbjct: 450 TFSMDGYIDHSKPYWLNHLDNVSDDKFLLPREFAILDADMDANLPKIGRVLLNASSEKYL 509 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK 373 + G D ++ LMR + + +G + L ++V + +LVP Sbjct: 510 LAAHVRKIQVKSGNDPIFIYALMRGNSVHERLKLEANGSISKRLLDKNVYKQSILVPNRS 569 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I ++ + +Q I +LK+ + S + Sbjct: 570 EQSRIGR----LFFLLETTITLHQQKIKMLKQVKKSCLQNLF 607 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 51/398 (12%), Positives = 115/398 (28%), Gaps = 40/398 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK-------------DGNSR 71 W+ + + T + + + + Y+ + + + Sbjct: 15 WEKRKLGEIFNVVTDYVANGSFKSLRQRVSTYSNPNFAYMIRLQDASNNWKGPWLYTDQQ 74 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLL 129 + G IL +G + ++ D + ++L+ L L Sbjct: 75 SYSFLAKTKLNPGDILMSNVGSVGKFFLVPDLDRPMTLAPNAILLRSMTYSTYFLFQLLQ 134 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + +T+ I + + I +P L E ++ + + +D LI Sbjct: 135 TSSMTESINEKTTPGVQQKINKTDLKKIITNVPTLNESSMVGQML----SLLDNLIAATQ 190 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 I+ L++ K+AL+ + + W K F + K + Sbjct: 191 DKIDALEQAKKALLQRLFDQ-------------SWRFKGYSDPWEKRKFKDLVVRVNKTS 237 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + Q +++ LK S + +P +++F + S Sbjct: 238 DDSTIPSVEFEDIISKQGRLNKDVRLKINS-KQGIYFEPQDVLFGKLRPYLQNWLFPSFY 296 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + ++ + S YL L++S V + V + Sbjct: 297 GR---AVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGTKMPRSDWNTVSNTSFPI 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 P EQ I +D L+ + + LK+ + Sbjct: 354 PVQSEQRKI----WQLFNVLDNLIAATQDKLSFLKKMK 387 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 58/150 (38%), Gaps = 7/150 (4%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K + S+ ++PG+I+ + + + + ++ Sbjct: 65 KGPWLYTDQQSYSFLAKTKLNPGDILMSNVGSVGK--FFLVPDLDRPMTLAPNAILLRSM 122 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + +L L+++ + + + G++Q + D+K++ VP + E ++++ Sbjct: 123 TYSTYFLFQLLQTSSMTESINEKTTPGVQQKINKTDLKKIITNVPTLNE----SSMVGQM 178 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +D L+ + I L++ + + + Sbjct: 179 LSLLDNLIAATQDKIDALEQAKKALLQRLF 208 >gi|182412909|ref|YP_001817975.1| restriction modification system DNA specificity subunit [Opitutus terrae PB90-1] gi|177840123|gb|ACB74375.1| restriction modification system DNA specificity domain [Opitutus terrae PB90-1] Length = 437 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 52/409 (12%), Positives = 114/409 (27%), Gaps = 27/409 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + ++T R + K + + + + + K Sbjct: 30 WETQTLGSLVTISTERVGD-NKCVP-MSITSGVGLVSQMEKFGRVIAGDSYKNYLLLKKN 87 Query: 85 QILYGKLGPYLRK------------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 Y K A + + C + L L G L Sbjct: 88 DFAYNKSATKEYPEGFIARYSGEALAAVPNSIFTCFRINGDSPIPEYLNYLFLGNLHGQW 147 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + IE D + +P+P+P ++KI +D LI + + Sbjct: 148 LRKFIEVGARAHGSLSIDEDDLLALPVPLPAGRSSRAEQQKIAGCLGTLDELIGAESQNL 207 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMK----DSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + LK K+ L+ + + +++ S +W + ++ Sbjct: 208 DALKAHKKGLMRQLFPREGETLPRLRFAEFHSAPKWEMVPLGAIAEIKLGKMLDCQKHTT 267 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 L+ N + M + + + G++V R+ Sbjct: 268 GLLLPYLNNIAIRWNAVDTSNLPEMYFDDHELDRFG-LKAGDVVVCEGG--EPGRAAVWD 324 Query: 309 QVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLP 366 + A V+ + + L + + F + G + L E +L Sbjct: 325 GRLPDLKFQKAVHRVRFNVPFEPHLLVQYLEAIAGTPQFEKLFTGGGIKHLTRETFAKLE 384 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 V + EQ I + + +D L+ +V L+ + + Sbjct: 385 VPLISESEQHRIATCL----SSLDDLIAAQSDRLVALQTHKQGLLQQLF 429 >gi|113866036|ref|YP_724525.1| Type I restriction-modification system specificity subunit [Ralstonia eutropha H16] gi|113524812|emb|CAJ91157.1| Type I restriction-modification system specificity subunit [Ralstonia eutropha H16] Length = 422 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 55/422 (13%), Positives = 134/422 (31%), Gaps = 45/422 (10%) Query: 30 IKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + G+ + Y+ E + + A+G Sbjct: 7 LGDHITVQKGKAPLVTGYVGKGAEPYLSPEYLRG-------RAPADLAKAGPDAVRAAEG 59 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + + G + + + ST + + ++ + ++A G Sbjct: 60 ETILLWDGSNAGEFFRSKVGLVASTMTKISPSSVF--RPAYFFHVAKQAERFLKAQTNGT 117 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + H D + + I + P EQ L+ E + I I LK KQ L+ Sbjct: 118 GIPHVDRELLEGIKVFCPGSTEQQLLAEILDTLDTAIYE----TEAIIAKLKAVKQGLLH 173 Query: 205 YIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++T+G++ + +++ E +G +P+ W + P + + T Sbjct: 174 DLLTRGIDANGELRPPQAEAPHLYESSPLGWIPNEWGLAPTATRCHLITKGTTPAANEMW 233 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQI-------------VDPGEIVFRFIDLQNDKR 303 + ++ G T+++ G+++ + K Sbjct: 234 QGGAGIRFLRVDNLSFDGQLDLDASTFRVSLATHKGFLARSRCLEGDVLTNIVGPPLGKL 293 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFED 361 L + ++ E I + + + +L + S + + +L Sbjct: 294 GLVTKEIGEVNINQAIALFRPTEQLLPKFLLIWLSSSISQSWLRNRAKQTSGQVNLTLAL 353 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + LP+ I EQ I + ++ +I E+ I ++ +S + +TG++ + Sbjct: 354 CQELPLPRMTINEQQAIVDRVDAAQEQIWC----EEELIRKMRLEKSGLMDDLLTGRVRV 409 Query: 422 RG 423 + Sbjct: 410 KP 411 >gi|261884856|ref|ZP_06008895.1| type I restriction-modification system, S subunit [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 319 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 85/209 (40%), Gaps = 16/209 (7%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G +P WEV + + RKNT ++ + + +I++ + + Y + Sbjct: 11 GRIPKEWEVVRLGDVFQRVTRKNTVNSDNVLTISAQNGLIKQENFFTKSVASKDLSNYIL 70 Query: 286 VDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ GE + + + + G++++ Y+ K +S + + L K Sbjct: 71 LEKGEFAYNKSYSSGYPMGATKRLNLYNYGVLSNLYIYFKIKNGNSDFYEQYFEAGLLNK 130 Query: 345 VFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVET---ARIDVLVEKI 396 + + G R ++ D + + +PP+KEQ I ++++ + +D L+ + Sbjct: 131 EIHQIAQEGARNHGLLNISVVDFFNILIALPPLKEQEKIADILSTWDMAISNLDELIIQK 190 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDLRGES 425 ++ +++ + ++ +I + + Sbjct: 191 QK-------LKTALMQNLLSAKIRFKEFT 212 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 48/289 (16%), Positives = 95/289 (32%), Gaps = 19/289 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 YK + V G IPK W+VV + + T + + + +++ I ++ + K Sbjct: 4 SYKQTAV---GRIPKEWEVVRLGDVFQRVTRKNTVNSDNVLTISAQNGLIKQENFFTK-- 58 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPEL 123 + D S + KG+ Y K G+ S ++ + K+ + Sbjct: 59 SVASKDLSNYILLEKGEFAYNKSYSSGYPMGATKRLNLYNYGVLSNLYIYFKIKNGNSDF 118 Query: 124 LQGWLLSIDVTQRIEAICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + + + I I + +H NI + +PPL EQ I + + + Sbjct: 119 YEQYFEAGLLNKEIHQIAQEGARNHGLLNISVVDFFNILIALPPLKEQEKIADILSTWDM 178 Query: 180 RIDTLITERIRFIELLKEKKQALVSYI--VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I L I+ +L Q L+S + P ++K I + Sbjct: 179 AISNLDELIIQKQKLKTALMQNLLSAKIRFKEFTAPWQEVKLGDILDYEQPTKYIVN--- 235 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 K L L Y + + + + + + T Sbjct: 236 NVAYVNDMYKIPVLTAGKSFILGYTDETNGIYDKLPVILFDDFTTDTKF 284 >gi|302346749|ref|YP_003815047.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] gi|302150720|gb|ADK96981.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] Length = 407 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 54/417 (12%), Positives = 126/417 (30%), Gaps = 32/417 (7%) Query: 26 KVVPIKR-FTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + V K G T S++ Y+ + D+ + + I Sbjct: 2 EYVKFKDVIINSQYGYTATETSQTEGTYKYLRITDIVPYYVNFDTVPFCKITEKDVSKYI 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQF------LVLQPKDVLPELLQGWLLSIDVT 134 +G IL + G + GI +T + ++ K VLP ++ L + Sbjct: 62 VKEGDILIARTGATTGYNYVV-PSGISNTVYASYLIRFIVDKKLVLPLFMKYVLKTQSYY 120 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I G+ + K +P L Q I + + I+ R I L Sbjct: 121 GFINNYIGGSAQPGMNAKVFTKFNIPKLSLVTQQKIASILSSYDRLIEN----NTRRIRL 176 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L++ + L + P+ + + + + +K L + K+ +E Sbjct: 177 LEQMAENLYKEWFVRFRFPEHENVEI-VNGLPKGWKTIHIKELAQLKSGYAFKSEWFVEE 235 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLRS-- 307 + I + E G++ K S+ Sbjct: 236 GEAVAKIKD-IGNILMDTSNFSYVDKENCIKAKKFLLTTGDLTIALTGATIGKISIVPKH 294 Query: 308 -AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRL 365 + + ++ P + + + + S + ++ E ++++ Sbjct: 295 KGNIYTNQRLGKFFLGDNPMEKLPFLYCLFKQESMVSNIVNLSNSSSAQPNISPEQIEKI 354 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +L DI ++ N + + + LL +R + ++G+++++ Sbjct: 355 KIL-----GNHDIISMYNKTCNPLFSNILALYSQNQLLTRQRDLLLPRLMSGKLEVK 406 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 59/196 (30%), Gaps = 13/196 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK WK + IK +L +G +S + ++D+ + +++ Sbjct: 206 LPKGWKTIHIKELAQLKSGYAFKSEWFVEEGEAVAKIKDIGNILMDTSNFSYVDKENCIK 265 Query: 77 TVS-IFAKGQILYGKLGPYLRKAII---ADFDGICSTQ----FLVLQPKDVLPELLQGWL 128 + G + G + K I + + + FL P + LP L + Sbjct: 266 AKKFLLTTGDLTIALTGATIGKISIVPKHKGNIYTNQRLGKFFLGDNPMEKLPFLYCLFK 325 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 V+ + + + + I I + + + I L ++ Sbjct: 326 QESMVSNIVNLSNSSSAQPNISPEQIEKI-KILGNHDIISMYNKTCNPLFSNILALYSQN 384 Query: 189 IRFIELLKEKKQALVS 204 L+S Sbjct: 385 QLLTRQRDLLLPRLMS 400 >gi|121610479|ref|YP_998286.1| restriction modification system DNA specificity subunit [Verminephrobacter eiseniae EF01-2] gi|121555119|gb|ABM59268.1| restriction modification system DNA specificity domain [Verminephrobacter eiseniae EF01-2] Length = 296 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 94/291 (32%), Gaps = 17/291 (5%) Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+ + + + I +P+ I +EQ I + + ID LI + ++ Sbjct: 6 QKYFLNSAAGSGVQNLNADIIKQLPILITKYSEQQKIADCL----SSIDQLIAAEAQKLD 61 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE--LNRKNTKL 251 LK K+ L+ + K++ + + + + N + Sbjct: 62 TLKAHKKGLMQQLFPAEGETLPKLRFPEFKDARVWASCDLGSRTIKVGSGITPNGGDKNY 121 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYET----YQIVDPGEIVFRFIDLQNDKRSLRS 307 I + + NI N + ++ ++ ++ Sbjct: 122 INAGRPFIRSQNIDWGELLLNNVAFIDDETHASFVSTKINDSDVFLNITGASIGISAIAD 181 Query: 308 AQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRL 365 ++V+ + + ++ T+L + S + + G RQ L F V+ Sbjct: 182 SRVIGGNVNQHVCIIRLKQKELNPTFLNQYLLSQYGQRQIDSFQAGGNRQGLNFTQVRSF 241 Query: 366 PVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P ++EQ I + + + ID L+ Q LK + + Sbjct: 242 SIPTPSKMEEQIRIADCL----SSIDELINVQSQKFEALKIHKKGLMQQLF 288 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 32/78 (41%), Gaps = 5/78 (6%) Query: 339 SYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S K F + Q+L + +K+LP+L+ EQ I + + + ID L+ Sbjct: 2 SPTSQKYFLNSAAGSGVQNLNADIIKQLPILITKYSEQQKIADCL----SSIDQLIAAEA 57 Query: 398 QSIVLLKERRSSFIAAAV 415 Q + LK + + Sbjct: 58 QKLDTLKAHKKGLMQQLF 75 >gi|317481746|ref|ZP_07940778.1| type I restriction modification DNA specificity domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316916860|gb|EFV38250.1| type I restriction modification DNA specificity domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 335 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 54/341 (15%), Positives = 121/341 (35%), Gaps = 21/341 (6%) Query: 77 TVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSID 132 T++++ I+ LR + V+Q L + ++ + Sbjct: 9 TLTLYPSDSIVIVARSGILRHTIPVAKLRKPATVNQDIKVIQTVDSCDSSWLLQYFIASN 68 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 T E G T+ D+ + + + +P + EQ I R+D LIT R Sbjct: 69 KTLLREYGKTGTTVESIDFAKMKSTALMVPYIEEQQAIGSF----FSRLDNLITLHQRKY 124 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + L K++++ + K +++ +G E+ A + Sbjct: 125 DKLVIFKKSMLEKMFPKDGESVPEIRFAGFTDPWEQRKLGEIVSIGAGAPPSAFSAGNFL 184 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + L+ + Q + + + G I+F +R + + Sbjct: 185 YVKVDDLNESSHFQFDSAQRVDANTAVKP----IRKGSIIFAKRGAAILGNKVRV--LGK 238 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 I + MA++P G+D+ +L + L ++ + + + ++ PV +P + Sbjct: 239 TAYIDTNMMALEPRGVDADFLWLFINQTGLYRIAD---TSTIPQINNKHIEPYPVDIPNM 295 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I +R+D L+ ++ + LL++ + S + Sbjct: 296 AEQQAIGTF----FSRLDDLITLHQRKLELLQDIKKSLLDK 332 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/187 (21%), Positives = 75/187 (40%), Gaps = 14/187 (7%) Query: 25 WKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + G S + +Y+ ++D+ + + D R + V Sbjct: 158 WEQRKLGEIVSIGAGAPPSAFSAGNFLYVKVDDLNESS--HFQFDSAQRVDANTAVKPIR 215 Query: 83 KGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG I++ K G K + T + L+P+ V +L + I Sbjct: 216 KGSIIFAKRGAAILGNKVRVLGKTAYIDTNMMALEPRGVD----ADFLWLFINQTGLYRI 271 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + +T+ + K I P+ IP +AEQ I R+D LIT R +ELL++ K+ Sbjct: 272 ADTSTIPQINNKHIEPYPVDIPNMAEQQAIGTF----FSRLDDLITLHQRKLELLQDIKK 327 Query: 201 ALVSYIV 207 +L+ + Sbjct: 328 SLLDKMF 334 >gi|281421788|ref|ZP_06252787.1| putative type I restriction enzyme EcoAI specificity protein [Prevotella copri DSM 18205] gi|281404146|gb|EFB34826.1| putative type I restriction enzyme EcoAI specificity protein [Prevotella copri DSM 18205] Length = 385 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 43/387 (11%), Positives = 111/387 (28%), Gaps = 25/387 (6%) Query: 27 VVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI- 80 ++ T + ++ ++ + + + Sbjct: 2 WCKLEDITSVIGDGLHGTPQYNPNGAYYFVNGNNLSNRQIIIKNNTKRVSEEEYIKYKKP 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + IL G ++ I + ++ E + + S + Sbjct: 62 LNEHTILVSINGTIGNIGTYSNEQIILGKSACYFNITPFLVKEYMCYVIESNYFQKYALL 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+T+ + K I +PIPP++EQ I +I I+ + R +++ K Sbjct: 122 SATGSTIKNVPLKAINEFYVPIPPVSEQKRIVSEIDYLLAFINKVEEGRENLQSIVQSAK 181 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWV----------GLVPDHWEVKPFFALVTELNRKNT 249 ++ + L P ++ E + P + ++ + T + Sbjct: 182 SKILDLAIHGKLVPQDPNEEPASELLKRINPKAEITCDTPQYGKLLKGWCETTLKSLAKE 241 Query: 250 KLIESNILSLSYGNIIQKLET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + G++ + Y V + + Sbjct: 242 VFAGGDKPTEFTKEKTNGNIIPIYSNGVEKDGLYGYTNVARVIEPCLTVSARGTIGFTCI 301 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I+ + P D Y+ + + + L +K++ + Sbjct: 302 RNIPFVPIVRLITIVPNP-AFDLKYMKFCLDCLLIWSE-----GSSIPQLTVPTIKKMQL 355 Query: 368 LVPPIKEQFDITNVINVETARIDVLVE 394 +PP++EQ I I +++ + E Sbjct: 356 PLPPLQEQHRIVAKIEELFNQLNKIEE 382 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 67/185 (36%), Gaps = 3/185 (1%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQK--LETRNMGLKPESYETYQIVDPGEIVFR 294 +++ + + + GN + + +N + E + P Sbjct: 8 ITSVIGDGLHGTPQYNPNGAYYFVNGNNLSNRQIIIKNNTKRVSEEEYIKYKKPLNEHTI 67 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-L 353 + + ++ + + + SA + Y+ +++ S K +G Sbjct: 68 LVSINGTIGNIGTYSNEQIILGKSACYFNITPFLVKEYMCYVIESNYFQKYALLSATGST 127 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ + + V +PP+ EQ I + I+ A I+ + E E +++ +S + Sbjct: 128 IKNVPLKAINEFYVPIPPVSEQKRIVSEIDYLLAFINKVEEGRENLQSIVQSAKSKILDL 187 Query: 414 AVTGQ 418 A+ G+ Sbjct: 188 AIHGK 192 >gi|327490260|gb|EGF22048.1| type I restriction/modification specificity protein [Streptococcus sanguinis SK1058] Length = 392 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 126/394 (31%), Gaps = 25/394 (6%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + + + R + Y+ E++ S G S +F KG IL Sbjct: 17 LSQVSSYVSERIRIDEVNLDNYVSTENMISERGGVTKATKLPSGKTIS---VFQKGDILI 73 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATMS 147 + PY +K +AD G CS LV++ + + L L S + +G M Sbjct: 74 SNIRPYFKKIWLADKSGGCSNDVLVVRANEKISNRFLYYVLSSDNFFDYAVGTSKGTKMP 133 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 D K I +PI L EQ I E + A +I +E + + Sbjct: 134 RGDKKAIMKYKVPIYSLVEQEKIAEILRAFDKKIILNKQINHHLVEQAYAIYKEVCVNQK 193 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + S G P + + + + + +L Y Sbjct: 194 DDSFVEETIKSISQKVITGKTPSTQNKDYYGGDLPFITIPDMHNNIYCVETLRY------ 247 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K E + + + I+ I + + AV P Sbjct: 248 -----LTQKGEQTQPSKTLPKNSIIVSCIATPG-----LVSLLDRESQTNQQINAVIPSE 297 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 YL + S + +L +++ + P I+++++ +N Sbjct: 298 NQEYYLFLELLSKSKLIIELGSSGSTTYNLNKTQFEKIKISAPSIEKRYE----LNNLLR 353 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + + + + L + R + + ++G+I + Sbjct: 354 PLFLKINQTQHETIKLSQLRDTLLPKLLSGEISV 387 >gi|306825747|ref|ZP_07459086.1| restriction modification system DNA specificity subunit [Streptococcus sp. oral taxon 071 str. 73H25AP] gi|304432108|gb|EFM35085.1| restriction modification system DNA specificity subunit [Streptococcus sp. oral taxon 071 str. 73H25AP] Length = 408 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 63/416 (15%), Positives = 131/416 (31%), Gaps = 31/416 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVS 79 + + + + T ++ K + +++ S T S + + + S Sbjct: 3 EWIKAEEYCISVFDGTHDTPKVTESGYKLVTSKNILSNTLDLNSAYFISEEDFVNINKRS 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE----LLQGWLLSIDVTQ 135 + IL+ +G ++ + + + L +L S + Sbjct: 63 KVKQYDILFSMIGTVG--SLYFETSDTIDYAIKNIGVFSCCDKEKAEWLYYYLQSSYARK 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I+ GA +G+ P+P I+ + ID I + + L Sbjct: 121 YIKRYLNGAVQKFLPLRGLREFPVPQFNKELHNRIKILLN-----IDQKIQTNNQINQEL 175 Query: 196 KEKKQALVSYIVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNR 246 + + L Y + PD K SG E +P+ W V + + N Sbjct: 176 EAMAKTLYDYWFVQFDFPDQNGKPYKSSGGKMVYHPELKREIPEGWGVDSLWNIANFYNG 235 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + Y +I+ E N K I + I Sbjct: 236 LAMQKYRPDTNEDDYLPVIKIREMMNSFSKDTEKARLDIPTEAVVERGDILFSWSATLEV 295 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFEDVKRL 365 ERG + V T++ + ++SY + K + + + +K+ Sbjct: 296 IIWGKERGALNQHIFKVTSDTYPKTFIYFELKSYLKVFKSIAELRKTTMGHITQDHLKQA 355 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++VPPI+ + I+V+ I + +E L + R + + GQ+ + Sbjct: 356 KIVVPPIEL----ISKIDVQLQHIMSQQQILENQNQELTQLRDWLLPMLMNGQVKV 407 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 60/191 (31%), Gaps = 16/191 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W V + G ++ +D + I + ++ + KD + Sbjct: 216 EIPEGWGVDSLWNIANFYNGLAMQKYRPDTNEDDYLPVIKIREMMNS----FSKDTEKAR 271 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 D T ++ +G IL+ L I G + + + L S Sbjct: 272 LDIPTEAVVERGDILFSWS-ATLEVIIWGKERGALNQHIFKVTSDTYPKTFIYFELKSYL 330 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + A TM H + + +PP+ + KI + I + Sbjct: 331 KVFKSIAELRKTTMGHITQDHLKQAKIVVPPI----ELISKIDVQLQHIMSQQQILENQN 386 Query: 193 ELLKEKKQALV 203 + L + + L+ Sbjct: 387 QELTQLRDWLL 397 >gi|229491519|ref|ZP_04385340.1| putative type I restriction-modification system, S subunit [Rhodococcus erythropolis SK121] gi|229321200|gb|EEN87000.1| putative type I restriction-modification system, S subunit [Rhodococcus erythropolis SK121] Length = 416 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 84/424 (19%), Positives = 149/424 (35%), Gaps = 46/424 (10%) Query: 21 IPKHWKVVPIKRFTKL-NTGRTSESGKDIIY--IGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W VP+K T + G E D + + + ++ D+ + Sbjct: 9 LPDSWNWVPLKFSTTFLSRGTAPEYVDDGPVRAVSQAANRATGIDWSRTRFHAHVGDSRS 68 Query: 78 VS-IFAKGQILYGKLGP-YLRKAIIA-----DFDGICSTQFLVLQPKDVL--PELLQGWL 128 + IL G L + D I V + + P L W+ Sbjct: 69 LKGYLYSDDILINSTGTGTLGRIGYFAEGPDDRPCIADGHVTVTRADRNIIEPRFLFYWM 128 Query: 129 LSIDVTQRIEAI--CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 I + + + P+ +PP+ Q I + + ET RID+L Sbjct: 129 SCAPYQDYIYSCLVTGATNQIELNRDQLAGTPVVVPPIHVQRRIVDLLDLETGRIDSLAA 188 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + R + LL+++ + + IV G + P V+P L+ +L R Sbjct: 189 GQQRVLNLLEDRVDSRILEIV-------------GGSRLVD-PSGDAVQPAKRLLAKLAR 234 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLK--PESYETYQIVDPGEIVFRFIDLQNDKRS 304 E I + G + + R+ G + Q V+ G++V +D + Sbjct: 235 ATKATGEV-ITAYRDGQVTSRSIRRSEGYTLAASTDPQGQGVEVGDVVVHGLD----GFA 289 Query: 305 LRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL----KF 359 G + Y P G + + L+R + + R+ + Sbjct: 290 GAIGDSEADGNCSPVYHVCAPADGGNPAFYGRLLRVLAVENYLGLFATSTRERAVDFRSW 349 Query: 360 EDVKRLPVL-VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +P+ V P+ Q +I +I A I L +++ + LL ERR + I AAVTGQ Sbjct: 350 DLFGNIPIPQVEPL-VQHEIGQMI----ASIRPLRKEVVRFNALLAERRLALITAAVTGQ 404 Query: 419 IDLR 422 ID+ Sbjct: 405 IDVT 408 >gi|56476903|ref|YP_158492.1| restriction modification system specificity subunit [Aromatoleum aromaticum EbN1] gi|56312946|emb|CAI07591.1| restriction modification system specificity subunit [Aromatoleum aromaticum EbN1] Length = 424 Score = 93.3 bits (230), Expect = 7e-17, Method: Composition-based stats. Identities = 47/407 (11%), Positives = 119/407 (29%), Gaps = 26/407 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T R + + I + +D + Q S ++ Sbjct: 23 SWEYTVLGDASTPVTERVGDRKLTPVSISAGIGFVPQAEKFGRDISGNQ--YSLYTLVRD 80 Query: 84 GQILYGKLGPY---LRKAIIADFDGICS--TQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G +Y K G + F+ + K + ++ Sbjct: 81 GDFVYNKGNSLKFPQGCVYQLRGLGEVAAPNVFISFRLKQGFVAEYFQYCFEKNIHGAQL 140 Query: 139 AIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + I +P P EQ I + + +D +I + R +E Sbjct: 141 KKHITSGARSNGLLNVSKDQFYGISIPTPLPDEQQKIADCL----TSLDEVIAAQGRKVE 196 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGI-EWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +K K+ L+ + + +++ + P + ++ N Sbjct: 197 AVKTYKRGLMQQLFPREGETLPRLRFPEFRDSPKWEPTTLDGLVDLQSGGTPSKINLAFW 256 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +I +S ++ + + + ++V G ++ + K ++ + Sbjct: 257 NGSIPWVSAKDMKRLFLDDAEDHISAAAVDDGAKLVPAGTVLMLTRGMTLLK-NVPICVL 315 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVL 368 A+ P G + L+ + ++ + L +++K L + Sbjct: 316 RREMSFNQDVKALLPKGETTGLFVALLLLGNKQRLLRMVDIAGHGTGKLNTDELKALKLA 375 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P EQ I + + + +D + + LK + + Sbjct: 376 APKPAEQQRIADFL----SSLDAQIAAEADKLAALKIHKDGLMQQLF 418 >gi|313669545|ref|YP_004049970.1| restriction modification system DNA specificity domain [Sulfuricurvum kujiense DSM 16994] gi|313156742|gb|ADR35417.1| restriction modification system DNA specificity domain [Sulfuricurvum kujiense DSM 16994] Length = 417 Score = 92.9 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 62/421 (14%), Positives = 133/421 (31%), Gaps = 38/421 (9%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGK--YLPKDGNSRQSDTSTVS 79 K + + + G +S + + D + L K S Sbjct: 3 KTIQLGDYICTLKGFAFKSQWYEKDGHPIVKVSDFTENSIDTSKLVKIPFEVAEKYKKYS 62 Query: 80 IFAKGQILYGKLGPY-----------LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + I+ +G + ++ A + ++ K++ L L Sbjct: 63 L-KTNDIVIQTVGSWPSNPASVVGKTIKVPCQAHGSLLNQNAVIIYPDKNIDQSYLYYVL 121 Query: 129 LSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + I +GA + I + +P Q I + V I+ Sbjct: 122 KDQNFKDYIVGTAQGAASQASITLDAIKGFELELPDQEVQQKIASILSTYDVLIENNNRR 181 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF---FALVTEL 244 E ++L K P + + +G +P WEV P + +T+ Sbjct: 182 ITILE----EMARSLYREWFVKFRFPGHEAVEMVDSELGQIPKGWEVSPLENLCSRITDG 237 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE------TYQIVDPGEIVFRFIDL 298 + K+ K + S ++ N K + V +I+ Sbjct: 238 SHKSPKSVLEGFPMASVKDMHDFGLNVNSCRKISKEDFDDLVRNDCKVTANDILIAKDGS 297 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSL 357 K + + ++ +++S M + I S +L++ ++S ++ + SG + Sbjct: 298 YL-KHTFVVEKDLDIALLSSIAMLRPNNKIKSHFLSYCLKSPEVKERMKQCVSGVAIPRI 356 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +D + ++VP I Q N+I L+ + LLK +R + ++G Sbjct: 357 ILQDFRNFKIIVPTIDIQKQWNNLIEDNIQMCWNLI----KQNNLLKTQRDMLLPKLISG 412 Query: 418 Q 418 + Sbjct: 413 K 413 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 67/206 (32%), Gaps = 13/206 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYL 64 + DS +G IPK W+V P++ T + +S K ++D+ Sbjct: 209 EMVDSE---LGQIPKGWEVSPLENLCSRITDGSHKSPKSVLEGFPMASVKDMHDFGLNVN 265 Query: 65 P-KDGNSRQSD--TSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKD 118 + + D IL K G YL+ + + D + S+ ++ Sbjct: 266 SCRKISKEDFDDLVRNDCKVTANDILIAKDGSYLKHTFVVEKDLDIALLSSIAMLRPNNK 325 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + L L S +V +R++ G + + N + +P + Q I Sbjct: 326 IKSHFLSYCLKSPEVKERMKQCVSGVAIPRIILQDFRNFKIIVPTIDIQKQWNNLIEDNI 385 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 LI + L+S Sbjct: 386 QMCWNLIKQNNLLKTQRDMLLPKLIS 411 >gi|328542331|ref|YP_004302440.1| Restriction modification system, type I hsdS [polymorphum gilvum SL003B-26A1] gi|326412078|gb|ADZ69141.1| Putative Restriction modification system, type I hsdS [Polymorphum gilvum SL003B-26A1] Length = 390 Score = 92.9 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 57/403 (14%), Positives = 127/403 (31%), Gaps = 29/403 (7%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 VPI T++ G T K DI ++ +D++ S ++ Sbjct: 4 VPIGDVTEVKGGGTPSKRKPEYYQGDIPWVTPKDMKVWDISDAIDKITPEAVADSATNLI 63 Query: 82 AKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 IL L+ I + + L D + I Sbjct: 64 PARSILLVNRSGILKHTLPVGITRREVAINQDLKALICSDR-AHPEYLAHIVKAAEPIIL 122 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 T + + + +P+P L EQ I + R R ++ L Sbjct: 123 KWVRATTADNFPIDSLKELKIPLPTLDEQRRIAGILDQADAL----RRLRSRALDKLNTL 178 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 QA+ + +P K + +G + + +E L + + Sbjct: 179 GQAIFHEMFG---DPATNPKGWPMGVIGDLLKEAKYGSSGKANSEGRG----LPMLRMGN 231 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++Y I + +++ L + ++ Y PG+++F + + E I Sbjct: 232 VTYDGRIDLSDLKHIELSDKEFDKYT-TRPGDLLFNRTNSKELVGKTAVVTQAEPMAIAG 290 Query: 319 AYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQ 375 + + + ++ Y++ + S V M + ++ ++ + +P +PP++ Q Sbjct: 291 YLVRGRANARGNTHYISGYLNSTHGKAVLRNMCKNIVGMANINAKEFQSIPTAIPPVEIQ 350 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A + + S+ + +S A G+ Sbjct: 351 RVYAEKLMSLRAE----EAQFQASLDIASTLFASLQHRAFKGE 389 >gi|325661847|ref|ZP_08150468.1| hypothetical protein HMPREF0490_01204 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471825|gb|EGC75042.1| hypothetical protein HMPREF0490_01204 [Lachnospiraceae bacterium 4_1_37FAA] Length = 379 Score = 92.9 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 53/385 (13%), Positives = 104/385 (27%), Gaps = 40/385 (10%) Query: 26 KVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + + G + + + I ++D+ D D Sbjct: 4 EYKRLGDIASYINGYAFKPEQRGTEGLPIIRIQDLTGN-----AYDLGFYDGDYPEKIEI 58 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G +L L I + + + V + Sbjct: 59 NNGDVLISWS-ASLGVYIWNRGKALLNQHIFKVAFDKVNVNKDYFVFAVKHKLDEMVLKT 117 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GATM H K N +P P L Q + + +I R + I+ L E +A Sbjct: 118 HGATMKHIIKKDFDNTKIPFPSLEMQEETASILKM----VSDIIDTRQQEIKKLDELIRA 173 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + G E +G +T+ K ++ I +S Sbjct: 174 RFVELFENGDYKT--------EKLG---------SVCTKITDGTHKTPTYLDEGITFISA 216 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII----T 317 NI+ + E +I + I L V + + Sbjct: 217 KNIVNGELDFSDVKHISEEEYQEIQKRCQTAIYDILLSKSGSLGAPVIVKTEEKLGLFES 276 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF 376 A + + +L +++ + + F G + L + V+VPPI+EQ Sbjct: 277 LAVIKYDREKLLPEFLCEQLKTDRIQRQFTTGTKGVAIKHLHLGVIAETDVIVPPIEEQR 336 Query: 377 DITNVINV----ETARIDVLVEKIE 397 + + + + + + Sbjct: 337 QFADFVKQVDKSKFQKYNATILSHN 361 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 45/140 (32%), Gaps = 10/140 (7%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 ++G Y ++ G+++ + + ++ V Sbjct: 40 GNAYDLGFYDGDYPEKIEINNGDVLISWSASLG-----VYIWNRGKALLNQHIFKVAFDK 94 Query: 328 IDSTYLAWLMR-SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ ++ + L ++ + + +D + P ++ Q + +++ + + Sbjct: 95 VNVNKDYFVFAVKHKLDEMVLKTHGATMKHIIKKDFDNTKIPFPSLEMQEETASILKMVS 154 Query: 387 ARIDVLVEKIEQSIVLLKER 406 ID +Q I L E Sbjct: 155 DIIDT----RQQEIKKLDEL 170 >gi|330838540|ref|YP_004413120.1| restriction modification system DNA specificity domain protein [Selenomonas sputigena ATCC 35185] gi|329746304|gb|AEB99660.1| restriction modification system DNA specificity domain protein [Selenomonas sputigena ATCC 35185] Length = 443 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 45/359 (12%), Positives = 115/359 (32%), Gaps = 30/359 (8%) Query: 80 IFAKGQIL---YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 +G+++ +GK + + + + + + Sbjct: 83 YLKEGEVVSIPWGKSRDVTDCIKYYKGKFVTADNRIATSNDITKLSNRYLYYWMMSQGKV 142 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I+ G+ + H D + N+ +PIPPLA Q I + + T L + + + L K Sbjct: 143 IDTFYRGSGIKHPDMAKVLNMQIPIPPLAIQNEIVKLLDDFTELTAELTEQLMTELTLRK 202 Query: 197 EKKQALVSYIVTK--------GLNPDVKMKDSGIEWVGLVPDHWE--VKPFFALVTELNR 246 ++ ++ + + I GL+ ++ K + +++ Sbjct: 203 KQYNFYRDSLLNFVRVDDTIVQTDRQTDRQAQRISKFGLLRKTFDVEWKTLGEVSSQICS 262 Query: 247 KNTKLIESNILSLSYGNI-----IQKLETRNMGLKPESY----ETYQIVDPGEIVFRFID 297 T + + I + + G+K + + + ++ Sbjct: 263 GGTPTASNAAFYVGTIPWLRTQEIDWADIYDTGIKISEEALKASSARWIPANCVIVAMYG 322 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 K ++ + + + + + Y+ + S K A G G + ++ Sbjct: 323 ATAAKVAINRIPLTTNQACCN--LKINEEMAEHRYVYHWLCSQY--KTLKAKGQGSQSNI 378 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ P+ VPP+ Q I ++++ + L + I K+ R + Sbjct: 379 NKNIIEKYPIPVPPLDVQQKIVSILDRFDTLCNDLTSGLPAEIAARKKQYEHYRDRLLT 437 >gi|325287951|ref|YP_004263741.1| restriction modification system DNA specificity domain-containing protein [Cellulophaga lytica DSM 7489] gi|324323405|gb|ADY30870.1| restriction modification system DNA specificity domain protein [Cellulophaga lytica DSM 7489] Length = 409 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 58/409 (14%), Positives = 133/409 (32%), Gaps = 21/409 (5%) Query: 25 WKVVPIKRFTKLNTGR----TSESGKDIIYIGLEDV----ESGTGKYLPKDGNSRQSDTS 76 W+ + ++ + + + + + + +V ++G + Sbjct: 4 WEEENLSNLFEIKSSKRVLKSDWKTEGVPFYRAREVVKLAQNGFVNNELFISEKLYDQYT 63 Query: 77 TVSIFAK-GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 F K I+ +G + ++ F ++ + D ++ + Sbjct: 64 KDRGFPKEDDIIISAVGTLGQCYLVKKSDKFYFKDASVLWFEKKSDTDSRFIEYAFKTRL 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +I GAT+ N+ +P+PPLAEQ I K+ +ID I + Sbjct: 124 IKNQINKKSSGATVGTLTISTARNLKIPLPPLAEQQRIVAKLDGLFAKIDKAI----GLL 179 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E QAL+ ++ + K + + PF + + + Sbjct: 180 EDNIAHTQALMGSVLDEEFGRLEKYNKPLMTFCKNPKKDMVGGPFGSNLKASEYVDKGYP 239 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + ++ N +K K E ++ + G+IV + K + Sbjct: 240 IIRLQNVDRFNFKEKNIMFVTEEKAEFLSSHSYIS-GDIVMTKLGDPLGKCCVVEDVHGV 298 Query: 313 RGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + S+ + Y ++ + S K + G R + ++V+ + + Sbjct: 299 DRGVISSDIIRIRIDESKHYKPYVVAGINSEFFIKQLKSKTQGSTRPRVTLKEVRAMQLP 358 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + ++Q I+ ++E Q + LK +SS + A G Sbjct: 359 MLKREDQVIAAKRIDGILELQSKVLETQNQKLNHLKALKSSLLDQAFKG 407 >gi|297562022|ref|YP_003680996.1| restriction modification system DNA specificity domain protein [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296846470|gb|ADH68490.1| restriction modification system DNA specificity domain protein [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 415 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 68/424 (16%), Positives = 149/424 (35%), Gaps = 43/424 (10%) Query: 22 PKHWKVVPIKRFT-----KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPK-DG 68 P+ WKV + + TG + I + +++ K Sbjct: 10 PQTWKVTTLGELCASGGGNIQTGPFGSQLHAADYVTQGIPSVMPQNIGDNVIKEEGIARI 69 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDV-LPELLQ 125 + + A G I+Y + G ++A++ + +C T L ++P E + Sbjct: 70 APEDAFRLEKYLLAPGDIVYSRRGDIEKRALVRETQRGWLCGTGCLRVRPGVGANSEFIS 129 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L V + I GATM + + K + ++P+ +PPL EQV I + A +I Sbjct: 130 YYLGHPSVREWIVKHAVGATMPNLNTKILSSLPVSVPPLNEQVSIASTLGALDNKITVNK 189 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + LL + + L+ + G D+ + + +E+ + + L Sbjct: 190 QIVSTYESLLATEFEQLIR--IEAGAEQDIALANEFVEFNPKYQKPSDPTSRHVNMAALP 247 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + ++ + + G Q +T + P + Sbjct: 248 TSSARVHTWDFRKPTPGTRFQNGDTLLARITP------------------CLENGKTAFV 289 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCK--VFYAMGSGLRQSLKFEDV 362 E GI ++ ++ ++ + ++L+ R+ + + +G+ RQ + + Sbjct: 290 DFMDDNETGIGSTEFIVMRSLPGVPQHFSYLLARNKRFREHAISNMIGTSGRQRCPADRL 349 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + P E I +V A + L E I L E R + + ++G++ ++ Sbjct: 350 PGFSMKRPDPTELERIGKDSDVAFAHMRSL--DSEAYI--LAELRDTLLPKLISGELRVK 405 Query: 423 GESQ 426 + Sbjct: 406 DAEK 409 >gi|322386250|ref|ZP_08059882.1| type I restriction system specificity protein [Streptococcus cristatus ATCC 51100] gi|321269712|gb|EFX52640.1| type I restriction system specificity protein [Streptococcus cristatus ATCC 51100] Length = 394 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 59/416 (14%), Positives = 139/416 (33%), Gaps = 48/416 (11%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + G++ + G +G +D S S I+ Sbjct: 4 IKLGDVIDFKNGKSIKKSD------------GNIPIYGGNGILGYTDKSNFSH----TIV 47 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G++G Y + + S + PK+ ++L + + G++ Sbjct: 48 VGRVGAYCGSIHVEENLCWVSDNAIAGIPKEGQDLTYLYYVLKSL---NLNSKQIGSSQP 104 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + ++ + + E+ K I ID I + + L+ + L Y Sbjct: 105 LITQSMLKDMVVDVEIDNEKQKRIAKSILI---IDQKIQINNQINQELEAMAKTLYDYWF 161 Query: 208 TKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKN------TKLI 252 + PD K SG E +P+ W V+ ++ + I Sbjct: 162 VQFDFPDQNGKPYKSSGGKMVYNPELKREIPEGWGVEKLKDKLSVSRGISYKTENIKDNI 221 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ----NDKRSLRSA 308 + +++L+ +I + ++ + Y +IV G+++ DL + Sbjct: 222 GTPMINLASIDINRNYKSTGLKYFNGEYLKEKIVSGGDLLIACTDLTRNADIVGSPIIVP 281 Query: 309 QVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 ++ + + + I+ YL +R+ SG L + + Sbjct: 282 FDEQKYVFSMDLAKIDSKVDFINKYYLYSTLRTEHYHNYIKKWASGTNVLHLNLDGMNWY 341 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + VPPI+ Q + + +I + + + +++ ++ L + R + + GQ+ + Sbjct: 342 SISVPPIELQEEYSQIILNFSKKTNKNIQENQE----LTQLRDWLLPMLMNGQVKV 393 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 58/202 (28%), Gaps = 14/202 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE----SGTGKYLPKDGNSRQSDT 75 IP+ W V +K ++ G + ++ IG + Y + Sbjct: 190 EIPEGWGVEKLKDKLSVSRGISYKTENIKDNIGTPMINLASIDINRNYKSTGLKYFNGEY 249 Query: 76 STVSIFAKGQILYGKLGPYLR--------KAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 I + G +L + + S + K + Sbjct: 250 LKEKIVSGGDLLIACTDLTRNADIVGSPIIVPFDEQKYVFSMDLAKIDSKVDFINKYYLY 309 Query: 128 L--LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + I+ G + H + G+ + +PP+ Q + I+ + + + I Sbjct: 310 STLRTEHYHNYIKKWASGTNVLHLNLDGMNWYSISVPPIELQEEYSQIILNFSKKTNKNI 369 Query: 186 TERIRFIELLKEKKQALVSYIV 207 E +L L++ V Sbjct: 370 QENQELTQLRDWLLPMLMNGQV 391 >gi|126667622|ref|ZP_01738591.1| specificity determinant for hsdM and hsdR [Marinobacter sp. ELB17] gi|126627891|gb|EAZ98519.1| specificity determinant for hsdM and hsdR [Marinobacter sp. ELB17] Length = 479 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 62/429 (14%), Positives = 138/429 (32%), Gaps = 58/429 (13%) Query: 38 TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK 97 TG T S + +++G + + N L P + Sbjct: 15 TGFTQISTTGKKVKTKDCLQTGRFPVIDQGQNPVAG------YVDDPDRLINVSDPLIVF 68 Query: 98 AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 F V + +L ++ ++ +K + + Sbjct: 69 GDHTRAVKWVDFSF-VPGADGTKILQPEPYLFPRFAYYQLRSLEIPNKGYSRHFKFLKEL 127 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + PLAEQ I K+ +++ R +LK +Q++++ V+ L + + Sbjct: 128 KFEVAPLAEQKTIAVKLDTLLAQVENTKARLERIPTILKRFRQSVLAAAVSGRLTEEWRN 187 Query: 218 ----KDSGIEWV-----------------------------------GLVPDHWEVKP-- 236 K S + + G +P+ W P Sbjct: 188 NRTTKSSPKKLLNHFEELRQIAVQDENLRTGKKTKYKPVTIDTYGTPGDLPNSWYWIPVE 247 Query: 237 -FFALVTELNRKNTKLIESNILSLSYGNIIQKL------ETRNMGLKPESYETYQIVDPG 289 VT+ K I + + ++ N+ + E + + G Sbjct: 248 ALATKVTDGVHKKPTYISNGVPFITVKNLTKGNGISFTETNYISTHDHEEFCKRTNPEKG 307 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ R +R+ + I S + S YL +S + + Sbjct: 308 DILISKDGTLGVVRQIRTDAIF--SIFVSVALVKPADRSMSNYLELAFQSSVVQGQMIGV 365 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G+GL+ + D+++ + VPP++EQ +I + ++ A + + +++ ++ + + S Sbjct: 366 GTGLQ-HIHLIDLRKDLIPVPPLEEQIEIVHQVDQLFAYAERVEQQVNNALARVNKLTQS 424 Query: 410 FIAAAVTGQ 418 +A A G+ Sbjct: 425 ILAKAFRGE 433 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 68/206 (33%), Gaps = 9/206 (4%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKD---GNSR 71 G +P W +P++ T + + +I ++++ G G + Sbjct: 235 GDLPNSWYWIPVEALATKVTDGVHKKPTYISNGVPFITVKNLTKGNGISFTETNYISTHD 294 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLS 130 + + KG IL K G L D I S V K + L Sbjct: 295 HEEFCKRTNPEKGDILISKDG-TLGVVRQIRTDAIFSIFVSVALVKPADRSMSNYLELAF 353 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + I G + H + +P+PPL EQ+ I ++ + + + Sbjct: 354 QSSVVQGQMIGVGTGLQHIHLIDLRKDLIPVPPLEEQIEIVHQVDQLFAYAERVEQQVNN 413 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVK 216 + + + Q++++ L + Sbjct: 414 ALARVNKLTQSILAKAFRGELTEQWR 439 >gi|323972573|gb|EGB67776.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli TA007] Length = 300 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 62/163 (38%), Gaps = 6/163 (3%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N ++ E R + + + ++ + G+++ + ++ + Q Sbjct: 61 NKLETNEIRYVTREFHTAQSKTALKAGDLLTVQSGHIG-ETAVVTDQFHGANCHALIVTR 119 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +K D YL + + S + + +D+K+ VL+P + EQ I + Sbjct: 120 LKQEKADPHYLCFYVNSEIGRARMKGLEVGSTILHINTKDLKKFRVLLPSLPEQKKIAQI 179 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 ++ D + E+ + + ++ + + +TG+ L E Sbjct: 180 LSTW----DKAISVTEKLLTNSQRQKKALMQQLLTGKKRLLDE 218 >gi|329913308|ref|ZP_08275914.1| Type I restriction-modification system, specificity subunit S [Oxalobacteraceae bacterium IMCC9480] gi|327545395|gb|EGF30614.1| Type I restriction-modification system, specificity subunit S [Oxalobacteraceae bacterium IMCC9480] Length = 517 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 60/422 (14%), Positives = 121/422 (28%), Gaps = 50/422 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W VP+ L G S K SG Y + G Sbjct: 111 EWAEVPLGDVITLQRGFDLPSQKRKPGKVPIVSSSGVSDYNSEVGVKGPG---------- 160 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G+ G + +I + +T V P L +ID + Sbjct: 161 --VVTGRYGTIGQVFLIKEDFWPLNTTLWVKNFHGNDPHFASYLLRTIDFRSCSDKS--- 215 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 ++ + + IP+ PPLAEQ I + +I+ + + ++ Sbjct: 216 -SVPGVNRNDLHRIPVLRPPLAEQKSIALILGTLDDKIELNRRMNKTLEAIARALFKSWF 274 Query: 204 SYI--VTKGLNPDVKMKDSG----------------IEWVGLVPDHWEVKPFFALVTELN 245 V ++ + DS +G +P+ WEVKP + +N Sbjct: 275 VDFEPVRAKIDARWQSGDSLPGLPAHLCELFPSRLVDSELGEIPEGWEVKPLDEIAAFIN 334 Query: 246 R----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 K + ++ L + ++ + I+ G+ +F + Sbjct: 335 GLALQKFSATDLADSLPVIKIAELRNGVSHKSDRASRDVPEKYIIKDGDFLFSWSGSL-- 392 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFE 360 L G + V +++ + + A + ++ Sbjct: 393 ---LAKFWTEGEGALNQHLFKVTSEQYPMWFVSHWVHHHLEEFQSIAASKATTMGHIQRG 449 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETA-RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +K + P + A ID + E L R + + V+G++ Sbjct: 450 HLKSAMTVCPDQDTLKKF----DCVMAPLIDEAI-HNELESRSLAALRDTLLPKLVSGEL 504 Query: 420 DL 421 + Sbjct: 505 RV 506 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 60/189 (31%), Gaps = 17/189 (8%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLP 65 DS +G IP+ W+V P+ G + + I + ++ +G + Sbjct: 311 DSE---LGEIPEGWEVKPLDEIAAFINGLALQKFSATDLADSLPVIKIAELRNG----VS 363 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + D I G L+ G L K + +G + + + + Sbjct: 364 HKSDRASRDVPEKYIIKDGDFLFSWSGSLLAK-FWTEGEGALNQHLFKVTSEQYPMWFVS 422 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 W+ + A + TM H G++ + +Q +++ ID I Sbjct: 423 HWVHHHLEEFQSIAASKATTMGHIQR---GHLKSAMTVCPDQDTLKKFDCVMAPLIDEAI 479 Query: 186 TERIRFIEL 194 + L Sbjct: 480 HNELESRSL 488 >gi|291527172|emb|CBK92758.1| Restriction endonuclease S subunits [Eubacterium rectale M104/1] Length = 382 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 47/376 (12%), Positives = 105/376 (27%), Gaps = 29/376 (7%) Query: 28 VPIKRFTKLNTGRTSESGK-------DIIYIGLEDVE--SGTGKYLPKDGNSRQSDTSTV 78 V +K L G+T D +I + D+ S + + S + Sbjct: 3 VKLKDIFDLQMGKTPSRSNLEYWNTTDYKWISIADLTKTSKYIFETKEYLSKSAIKDSGI 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + ++ + K I D + + + K V+ + + E Sbjct: 63 KVIPANTVVMS-FKLSIGKTAITKEDMYSNEAIMAFKDKHVINIIPEYIFYLFKYKNWEE 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + + I + I + +Q I + +D E EL Sbjct: 122 CSNKAVMGKTLNKATLSEIEVEICSIEKQRQIVNILDKIMSAVDGRKQELQLLDEL---- 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + V + K I + + I S Sbjct: 178 ---IKARFVEMFGDLKTNSKMWQIVGFNE------CAVIDTNMIHNFQGYEDYPHIGIDS 228 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + KL + + P I++ I +K +L + + Sbjct: 229 IEK--ETGKLIGYRTISEDGVVSGKYLFTPQHIIYSKIRPNLNKVALPDFDGL--CSADA 284 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFD 377 + VK + Y+ + +R+ A S + + V+ + +PP+ Q Sbjct: 285 YPILVKKEICNREYMGYTLRNKYFLDYILAFSSRTNLPKVNKKQVEGFKLPLPPMGLQNQ 344 Query: 378 ITNVINVET-ARIDVL 392 + ++ ++ D + Sbjct: 345 FADFVHQVDKSKFDTM 360 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 58/153 (37%), Gaps = 4/153 (2%) Query: 25 WKVVPIKRFTKL--NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W++V + N + +D +IG++ +E TGK + S S +F Sbjct: 196 WQIVGFNECAVIDTNMIHNFQGYEDYPHIGIDSIEKETGKLIGYRTISEDGVVSGKYLFT 255 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL--LSIDVTQRIEAI 140 I+Y K+ P L K + DFDG+CS + K + + I A Sbjct: 256 PQHIIYSKIRPNLNKVALPDFDGLCSADAYPILVKKEICNREYMGYTLRNKYFLDYILAF 315 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + K + +P+PP+ Q + Sbjct: 316 SSRTNLPKVNKKQVEGFKLPLPPMGLQNQFADF 348 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 56/184 (30%), Gaps = 11/184 (5%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE----TRNMGLKPESYETYQIVDP 288 + K + + I + ++ + E I D Sbjct: 1 MRVKLKDIFDLQMGKTPSRSNLEYWNTTDYKWISIADLTKTSKYIFETKEYLSKSAIKDS 60 Query: 289 GEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G V + + ++A E A MA K + + ++ + Sbjct: 61 GIKVIPANTVVMSFKLSIGKTAITKEDMYSNEAIMAFKDKHVINIIPEYIFYLFKYKNWE 120 Query: 347 YAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + ++L + + V + I++Q I N+++ + +D +Q + LL E Sbjct: 121 ECSNKAVMGKTLNKATLSEIEVEICSIEKQRQIVNILDKIMSAVD----GRKQELQLLDE 176 Query: 406 RRSS 409 + Sbjct: 177 LIKA 180 >gi|229042278|ref|ZP_04190030.1| hypothetical protein bcere0027_3480 [Bacillus cereus AH676] gi|228727069|gb|EEL78274.1| hypothetical protein bcere0027_3480 [Bacillus cereus AH676] Length = 396 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 55/404 (13%), Positives = 129/404 (31%), Gaps = 27/404 (6%) Query: 31 KRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + G ++ + + I + + + +V I ++ Sbjct: 2 GDTADFSKGNGYSKSDLTDEGKPVILYGRLYTRYETVIESVDTFTIEKDKSV-ISKGNEV 60 Query: 87 LYGKLGPYL----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC- 141 + G R ++++ I +++P + + + +S ++ + Sbjct: 61 IVPASGETSEDISRASVVSKPGIILGGDLNIIRPSNEIDPIFLALTISNGKQKKELSKRA 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +G ++ H + + + P EQ+ I + ++D I + + LK+ KQ Sbjct: 121 QGKSVVHLHNSDLKEVNLLFPKKEEQIKIGKF----FKQLDDTIALHQQELTTLKQTKQG 176 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN-----I 256 + + K + + G V + + E Sbjct: 177 FLQKMFPKEGESVPEFRFPGFTGDWEQRRFENVLNKQDGIRRGPFGSALKKEFFVKDSDY 236 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 N I + E +E + E F R R + +++G+ Sbjct: 237 AVYEQQNAIYDNYETRYNITKEKFEELKNFQLSEGDFILSGAGTIGRISRVPKGIKQGVF 296 Query: 317 TSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSL-KFEDVKRLPVLVPPI 372 A + + DS Y +RS ++ + +L +VK+ V+VP Sbjct: 297 NQALIRFKIDENITDSEYFVQWIRSANMQRKLTGANPGSAMTNLVPMSEVKKWDVMVPSK 356 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 EQ I ++D ++ ++ + +LKE + +F+ T Sbjct: 357 NEQIKIGKF----FKQLDEMIALQQRDLDVLKETKKAFLQKMFT 396 >gi|154488696|ref|ZP_02029545.1| hypothetical protein BIFADO_02003 [Bifidobacterium adolescentis L2-32] gi|154082833|gb|EDN81878.1| hypothetical protein BIFADO_02003 [Bifidobacterium adolescentis L2-32] Length = 395 Score = 92.9 bits (229), Expect = 9e-17, Method: Composition-based stats. Identities = 63/406 (15%), Positives = 125/406 (30%), Gaps = 38/406 (9%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKD-----GNSRQSD 74 K V I K +G T S I +IG + GK+L K+ Sbjct: 10 KKVTIGELGKTQSGGTPSSKHPEFFNGSIPWIGTTAL---NGKFLGKNDAVKLITEEAVA 66 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDV 133 S I + I+ G + + K I S + ++ + L Sbjct: 67 KSATKIVPEKSIMVG-IRVGVGKVAINAVPMCTSQDIVSIVGIDEASWNKEYISLALQYK 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A +GAT++ K + I +P P+ EQ + + + ++ + + Sbjct: 126 APLLAAQAQGATIAGITSKTLKAIEIPAIPINEQNRVVDILRKLENQVGFVRKQLCGLDA 185 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L+K + + P +K G P A L Sbjct: 186 LVKSRFVEIFGDFACYETKP--LIKCVDCIEAGKSPKCLAFSRKMAEPGV-------LKL 236 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN-DKRSLRSAQVME 312 S I S Y K R++ L + +V +I+ + RS+ Sbjct: 237 SAISSGVYCENENKALPRSVSLTIDK-----VVHANDILLSRKNTPELVGRSVLVKHTDG 291 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ---SLKFEDVKRLPVLV 369 + + P + + + L ++ G + ++ ++ +L + + Sbjct: 292 NIMFPDIIFRMHPLPPINAMYLSYLLAGPLLHSIQSLAHGSAKSMSNIPKSELAKLSIPI 351 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P + Q + N + +++D +Q I L+ S Sbjct: 352 PALNLQNEFANFV----SQVDKSRFVAQQQIEKLQMLYDSLAQEYF 393 >gi|145637803|ref|ZP_01793452.1| type I restriction/modification specificity protein [Haemophilus influenzae PittHH] gi|145268996|gb|EDK08950.1| type I restriction/modification specificity protein [Haemophilus influenzae PittHH] Length = 464 Score = 92.9 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 67/467 (14%), Positives = 139/467 (29%), Gaps = 80/467 (17%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES-GTGKY-LPKDGNSRQSDTST 77 +P++WK+V + K+ ++ +I D+ S TG + PK + + Sbjct: 11 KLPENWKLVRLGDIAKV-NEKSLTKKSQADFIRYIDISSVSTGAFDTPKLLKKDEIPSRA 69 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKD-VLPELLQGWLLSIDV 133 I + + P L++ + + I ST F V+ + L L + S Sbjct: 70 KRILRNNDFIISTVRPNLKQFSFIEEAQENLIASTGFCVISSNNSKLAWYLYSLITSDLF 129 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 T+ + I +G + K I + +P+P E I + I + + Sbjct: 130 TEYLVKISDGGAYPAFNPKEIEDAIIPLPDKD----NLEFISDTSRFFHKKIQLNTQINQ 185 Query: 194 LLKEKKQALVSYIVTKG--------------------LNPDVKMKDSGIEWV-------- 225 L++ QAL L + E + Sbjct: 186 TLEQIAQALFKSWFVDFDPVRAKVQALSDGLSLEQAELAAMQTISGKTPEELTALSQTQP 245 Query: 226 ----------------------GLVPDHWEVKPFFALVTELNRK-----NTKLIESNILS 258 G VP WE+K L + K N + ++ Sbjct: 246 DRYAELAETAKAFPCEMVEVDGGEVPKGWEMKALSDLGQIICGKTPSKSNKEFFGDDVPF 305 Query: 259 LSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + ++ ++ T N+ + +Y++ + + I I + Sbjct: 306 IKIPDMHNQVFITQTTDNLSVVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQ 365 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFEDVKRLPVLVPPI 372 I + + +L ++ + K + SG +L ++ ++ P Sbjct: 366 INS----IIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLNLNTSTFSKIEIITPS- 420 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +I + + I L E R + + G+I Sbjct: 421 ---KEIIYIFTKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI 464 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 46/134 (34%), Gaps = 7/134 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG-TGKYLPKDGNSR 71 G +PK W++ + ++ G+T G D+ +I + D+ + + + Sbjct: 268 GEVPKGWEMKALSDLGQIICGKTPSKSNKEFFGDDVPFIKIPDMHNQVFITQTTDNLSVV 327 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ + I + ++ + ++ + E L L Sbjct: 328 GANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLYLSLKQP 387 Query: 132 DVTQRIEAICEGAT 145 +T+ ++ + G T Sbjct: 388 SMTKYLKDLASGGT 401 >gi|227517377|ref|ZP_03947426.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis TX0104] gi|227075176|gb|EEI13139.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis TX0104] Length = 390 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 63/396 (15%), Positives = 129/396 (32%), Gaps = 27/396 (6%) Query: 32 RFTKLNTGRTSES---GKDIIYIGLEDV--ESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 F G E G + + DV G + K + +G I Sbjct: 3 EFYDFKNGLNKEKEFFGSGVPIVNFVDVFHNRGLTPEMLKGRVTLSKKEIKNFEVKQGDI 62 Query: 87 LYGKLGPYLRKAIIADFD------GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + + + + S L + + V P + Sbjct: 63 FFTRTSETINEIGYPSVMLGVPTDTVFSGFVLRGRARSVDPMDNLFKRYVFFTESFRNEM 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + ++M+ I + KI A +ID I R ++ LKE K+ Sbjct: 123 VKKSSMTTRALTSGTAIKEMYVQYPSSKDEQHKIGAFLAQIDDTIALHQRELDQLKELKK 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + + K++ + E WE FF + + + +N +L S+ LS Sbjct: 183 AYLQLMFPVKDERVPKLRFADFEG------EWEQCKFFDMWEKSSDRNKELKYSSKDVLS 236 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + RN E +TY I+ G+I F ++ ++ GI++ + Sbjct: 237 VAKMTKNPVERNS--SDEYMKTYNILHYGDIAFEGNKSKDYSFGRFVLNNLQDGIVSHVF 294 Query: 321 MAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL---KFEDVKRLPVLVPPIKEQF 376 + KP +D ++ + + K + + +D+ + + +P + EQ Sbjct: 295 IVFKPKVKMDIDFMKVYINNEYFMKHHLVKATTKTLMMTTLNVQDMNKQKLRIPSLNEQE 354 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I +D + + + LK + S++ Sbjct: 355 RIGKF----FKELDHAITLHQNKLTQLKSLKKSYLQ 386 >gi|294341644|emb|CAZ90063.1| putative Type I restriction-modification system (Specificity subunit) [Thiomonas sp. 3As] Length = 393 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 60/408 (14%), Positives = 134/408 (32%), Gaps = 41/408 (10%) Query: 29 PIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 P++ LN G ++ + ++ + + + K + + + F G + Sbjct: 8 PLREVALLNPRLGEKLDANAFVSFVPMASLSAEDAKVTSVEQRPYAEVSKGYTPFKSGDV 67 Query: 87 LYGKLGPYLRK-----AIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEA 139 L K+ P ++ + G ST+F V++P L +L + E Sbjct: 68 LVAKITPCFENGKISQVLLPETYGFGSTEFHVVRPLPNKSDARYLHHFLRLGTIRIEGER 127 Query: 140 ICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + + +P+PP+ EQ I + + +L Sbjct: 128 RMTGSGGQRRVPENFLAELSIPLPPVPEQRRIAAILDQADALRAKRREALAQLDKLT--- 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 QA+ + + + + +E VT+ ++ K I Sbjct: 185 -QAIFVEMFGDLESNVNGLPVTNLE------------DLCVRVTDGTHQSPKWEPDGIPF 231 Query: 259 LSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 L NI+ + + +D G+I+F + + + + Sbjct: 232 LFISNILNGEISYSTEKFISRETYHELTRRCAIDAGDILFTTVGSYGNTAVVSGER---E 288 Query: 314 GIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 +KP+ DS++ A ++ S + + + G ++++ D+K L V P Sbjct: 289 FCFQRHIAHIKPNAEKLDSSFCAAMLESASVRRQIDKVARGVAQKTINLADLKALRVFYP 348 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PI++Q + + + QS+ + + A G+ Sbjct: 349 PIEKQKSF----TTKQGLVKSIKAIQAQSLREFDDLFVTLQHRAFRGE 392 >gi|300727766|ref|ZP_07061150.1| type I restriction-modification system, endonuclease S subunit [Prevotella bryantii B14] gi|299774976|gb|EFI71584.1| type I restriction-modification system, endonuclease S subunit [Prevotella bryantii B14] Length = 375 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 75/382 (19%), Positives = 133/382 (34%), Gaps = 31/382 (8%) Query: 34 TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP 93 +T YIGLE ++S + N I KG IL+GK Sbjct: 10 FNSTAKKTPTESDKEHYIGLEHIDSECLEITRWGSNVAPIGE--KLIMKKGDILFGKRRA 67 Query: 94 YLRKAIIADFDGICSTQFLVLQPKDVLPELLQG--WLLSIDVTQRIEAICEGATMSHADW 151 Y RK IA FDGI S +VL+P + + + ++ S +R I G +W Sbjct: 68 YQRKLAIAPFDGIFSAHGMVLRPNEEVVDKNYFPFFMSSDLFMERAVQISVGGLSPTINW 127 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 K + P+P LAEQ ++ +K+ A + + +E ++ + Sbjct: 128 KDLREQEFPLPSLAEQKVLADKLWAAYRL----KESYKKLLAATEEMVKSQFIEMFYNEK 183 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 P K+K + + + + +S + L NII Sbjct: 184 YPLQKLKT-------------HIDVIRGVSYKPVDIKEETSDSISVILRSNNIINGQINF 230 Query: 272 NMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKP 325 + + ++ T Q++ G+IV + + TS Sbjct: 231 DDVVYVDNKRVTTEQVLSKGDIVMCGSNGSKKLVGKAAMINTIPSYRTSFGAFCLGIRCK 290 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I YL+ ++ +V +GSG ++K E + L + +P +++Q + Sbjct: 291 ESILPEYLSVYFQTPKYREVIEFLGSGSNILNIKPEHIYNLEIPIPSLEDQKHFVTIAEQ 350 Query: 385 ETA---RIDVLVEKIEQSIVLL 403 I +E I+ I L Sbjct: 351 ADKSGFEIRKSIEAIDNVIKSL 372 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 11/133 (8%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLM 337 I+ G+I+F K ++ GI ++ M ++P+ D Y + M Sbjct: 49 IGEKLIMKKGDILFGKRRAYQRKLAIAPFD----GIFSAHGMVLRPNEEVVDKNYFPFFM 104 Query: 338 RSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + GL ++ ++D++ +P + EQ + + + L E Sbjct: 105 SSDLFMERAVQISVGGLSPTINWKDLREQEFPLPSLAEQKVLADKLWAAY----RLKESY 160 Query: 397 EQSIVLLKERRSS 409 ++ + +E S Sbjct: 161 KKLLAATEEMVKS 173 >gi|168207082|ref|ZP_02633087.1| putative type I restriction-modification enzyme, S subunit, EcoA family [Clostridium perfringens E str. JGS1987] gi|170661517|gb|EDT14200.1| putative type I restriction-modification enzyme, S subunit, EcoA family [Clostridium perfringens E str. JGS1987] Length = 394 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 47/398 (11%), Positives = 117/398 (29%), Gaps = 30/398 (7%) Query: 24 HWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + G ++ S I ++ + G+ + + Sbjct: 16 EWEEKKLGSIGEFFKGSGISKSDLSESGKECILYGELYTTYGEVITSIRSKTDISLKNAV 75 Query: 80 IFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ G + + + V +P L + L+ + Sbjct: 76 LSKINDVIIPSSGETAVDIATASCVMKDNVLLGGDLNVFRPNK-DNGLFISYQLNNAKKK 134 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I I +GA++ H + + + + P L EQ I I+ + Sbjct: 135 EIAKIAQGASVVHIYNEQLKKVKVDTPSLQEQEKIANFFSILDELIEEQEGKVKDLELYK 194 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K Q + + + GL WE K + + + Sbjct: 195 KGMMQKIFKQEIRFKDD------------NGLDYPEWEEKKITEIFNITRGQVIAKTSIS 242 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + + +Y GE + D N + + + + Sbjct: 243 PIKIDRSIYPVYSSQTSNYGILGYDSSYDF--DGEFLTWTTDGAN---AGKVFKRNGKFR 297 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 T+ + I + ++ + + L + + + +P ++EQ Sbjct: 298 CTNVCGLLVEKDITKGFANEFIKEILEKETPKHVSYIGNPKLMNGVIGDIKIRIPLLEEQ 357 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + + + ID +VE+ ++++ L+E + S + Sbjct: 358 RKIADFL----SNIDKIVEEEKKNLADLREMKKSLLQQ 391 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 26/211 (12%), Positives = 72/211 (34%), Gaps = 6/211 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K+ EW + FF ++ + IL ++ T Sbjct: 6 PKLRFKEFSDEW--EEKKLGSIGEFFKGSGISKSDLSESGKECILYGELYTTYGEVITSI 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 S + + +++ + S + + ++ +P+ + + Sbjct: 64 RSKTDISLKNAVLSKINDVIIPSSGETAVDIATASCVMKDNVLLGGDLNVFRPNKDNGLF 123 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ + + ++ + E +K++ V P ++EQ I N + +D L Sbjct: 124 ISYQLNNAKKKEIAKIAQGASVVHIYNEQLKKVKVDTPSLQEQEKIANF----FSILDEL 179 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +E+ E + L+ + + +I + Sbjct: 180 IEEQEGKVKDLELYKKGMMQKIFKQEIRFKD 210 >gi|257467222|ref|ZP_05631533.1| putative type I restriction-modification system, specificity determinant; restriction endonuclease [Fusobacterium gonidiaformans ATCC 25563] Length = 422 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 49/394 (12%), Positives = 108/394 (27%), Gaps = 33/394 (8%) Query: 26 KVVPIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + I + + K+ I + E K +D D S Sbjct: 14 EWKKIGDIITKFSEKQRNKVNLKLVYTVSKEYGLISSK--EYWKNKERREDYTVYSEDLS 71 Query: 77 TVSIFAKGQILYGKLGPYLRK--AIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSID 132 +I K Y + + +GI S + + + + ++ S Sbjct: 72 NYNIIKKNMFAYNPARLNIGSIDCLFDREEGILSPMYTIFSIDEEIINSKYLLYFIKSPK 131 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + I E D+ I +PIP L Q I + + T + L E + Sbjct: 132 ILKIINDKKEEGARFRFDFNRWKKIEIPIPSLETQEKIVKILDNFTNYVTELQAELQAEL 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + ++ Q ++++G ++ E E+ +V K Sbjct: 192 QARVKQYQYYRDMLLSEG-----YLRKISEERFLKTNSVIEIYKLNEVVEIKRGKRLVKS 246 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Q E + S + + + E Sbjct: 247 -------------QLSELEKYPVFQNSLIPLGYYKDKNFEGNKTCIISAGAAGDIFYQAE 293 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + P + + + L ++V+++ VL+P + Sbjct: 294 DFWAADDVFVLSPSKKIVDKYLYYFLLSKQEFIKSKVRKASIPRLSRDEVEKIDVLIPSL 353 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + Q I V++ + + + Q I +++ Sbjct: 354 ELQNKIVEVLDKFQSLLSDTKGLLPQEIEQRQKQ 387 >gi|301062613|ref|ZP_07203245.1| type I restriction modification DNA specificity domain protein [delta proteobacterium NaphS2] gi|300443293|gb|EFK07426.1| type I restriction modification DNA specificity domain protein [delta proteobacterium NaphS2] Length = 422 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 55/399 (13%), Positives = 121/399 (30%), Gaps = 25/399 (6%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + GRT G ++ + D++ + ++ + + + Sbjct: 6 LGDICDIVIGRTPSRSVPEYWGTGYPWVTISDLKEKHIWHTKEEITQNAIEKVKCRLIPR 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G +L+ + K A + + L KD ++ V + + + Sbjct: 66 GTLLFS-FKLTIGKMAFAARNLYTNEAIAGLLIKDPKKLCSDYLFYAMKVAKLLGSNQAV 124 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + L +Q+ I + I T EL L Sbjct: 125 MGKTLNSKSLALIKVPVPEHLEDQLHIATLLSRLEALIATRKDNLRMLDEL-------LK 177 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + NP K +G + + + N +S N Sbjct: 178 SIFLEMFGNPVKNEKTWQTAHLGNLARVERGRFSPRPRNDPKFYNGNFPFIQTRDISRAN 237 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 +L + L + + G +V + + ++ + + Sbjct: 238 --GRLTEYSQTLNDLGIKVSKEFKNGTVVIAIVGATIGETAILQVDTYATDSVIG--ITP 293 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P ID+ YL +L+R V A R ++ +K L +++PP + + Sbjct: 294 LPERIDAVYLEFLLR--FWKPVLKARAPEAARANININTLKPLNIILPP----KHLVSNF 347 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +++ + +Q++ LKE + A G++DL Sbjct: 348 VLIVQKVESIKSLYQQNLKGLKELYGTLSQKAFKGKLDL 386 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 61/198 (30%), Gaps = 12/198 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 K W+ + ++ GR S + +I D+ G+ Sbjct: 192 KTWQTAHLGNLARVERGRFSPRPRNDPKFYNGNFPFIQTRDISRANGRLTEYSQTLNDLG 251 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 F G ++ +G + + I D + + + P + + L Sbjct: 252 IKVSKEFKNGTVVIAIVGATIGETAILQVDTYATDSVIGITPLPERIDAVYLEFLLRFWK 311 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++A A ++ + + + + +PP +++++ + + ++ Sbjct: 312 PVLKARAPEAARANININTLKPLNIILPPKHLVSNFVLI----VQKVESIKSLYQQNLKG 367 Query: 195 LKEKKQALVSYIVTKGLN 212 LKE L L+ Sbjct: 368 LKELYGTLSQKAFKGKLD 385 >gi|293570792|ref|ZP_06681841.1| type I restriction-modification system, S subunit, putative [Enterococcus faecium E980] gi|291609145|gb|EFF38418.1| type I restriction-modification system, S subunit, putative [Enterococcus faecium E980] Length = 495 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 64/439 (14%), Positives = 130/439 (29%), Gaps = 58/439 (13%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----------GKDIIYIGLEDVESGTGK 62 S + + IP+ W+ + + TG + G YIG +DV Sbjct: 59 SEDEVLFDIPESWEWTRMSNIADMYTGNSIPKTIKENKYSKVGNGYDYIGTKDVGFDYTI 118 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLP 121 +G + K IL G RK I D + P Sbjct: 119 NYD-NGIKIPFEEDKFRNSFKDSILMCIEGGSAGRKIGILDKTVCFGNKLCSFNLIYGEP 177 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L +L S Q G + + I +P+PPL EQ I KI + Sbjct: 178 RFLYYYLQSPLFFQAFRDEMTG-IIGGVSITKLKGIIVPLPPLEEQKRIVAKIEELMPYV 236 Query: 182 DTLITERIRFIELLKEK----KQALVSYIVTKGLNPDVK--------------------- 216 D EL K+ +++++ Y + L + Sbjct: 237 DKYDVAYSEVEELNKKFPEDIQKSILQYAIQGKLVEQREEDGTAEDLYKQIQEEKKKLIK 296 Query: 217 ----------MKDSGIEWVGLVPDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGN 263 + + E +P++W+ L+ + K + + +S + Sbjct: 297 EGKIKKTKALPEITEDEIPFDIPENWKWVRLGDLLYKLTDGTHSTPKYTATGVPFISVKD 356 Query: 264 IIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 I + + E+ + +I+ + + ++ Sbjct: 357 ISSGEIDFSNTKFISREEHEALYKRCDPERDDILLTKVGTTGIPV-IVDTDKEFSLFVSV 415 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 A + I + Y +++++ + G+ ++ D+ + + P+ EQ Sbjct: 416 ALLKFNTDLIFNKYFMYVIKAPVVQIQARENTRGVGNKNWVMRDIANTVLPLSPLAEQNR 475 Query: 378 ITNVINVETARIDVLVEKI 396 I I + L++K+ Sbjct: 476 IVEKIEELLPYTNQLIKKV 494 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 70/215 (32%), Gaps = 20/215 (9%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP-- 277 S E + +P+ WE + + GN + T+++G Sbjct: 59 SEDEVLFDIPESWEWTRMSNIADMYTGNSIPKTIKENKYSKVGNGYDYIGTKDVGFDYTI 118 Query: 278 ----------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 E + ++ K + + + + + Sbjct: 119 NYDNGIKIPFEEDKFRNSFKDSILMCIEGGSAGRKIGI----LDKTVCFGNKLCSFNLIY 174 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET- 386 + +L + ++S + F +G+ + +K + V +PP++EQ I I Sbjct: 175 GEPRFLYYYLQSPLFFQAFRDEMTGIIGGVSITKLKGIIVPLPPLEEQKRIVAKIEELMP 234 Query: 387 --ARIDVLVEKIEQSIVLL-KERRSSFIAAAVTGQ 418 + DV ++E+ ++ + S + A+ G+ Sbjct: 235 YVDKYDVAYSEVEELNKKFPEDIQKSILQYAIQGK 269 >gi|75674466|ref|YP_316887.1| restriction endonuclease S subunits [Nitrobacter winogradskyi Nb-255] gi|74419336|gb|ABA03535.1| restriction endonuclease S subunit [Nitrobacter winogradskyi Nb-255] Length = 444 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 54/440 (12%), Positives = 117/440 (26%), Gaps = 47/440 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W V + + + + Y + +G G + + + G Sbjct: 5 WPTVALGDLLR-RSEHIIPLDPEATYKEVTVRINGKGVVERRQVQGVEIAANRRYQAKSG 63 Query: 85 QILYGKLGPYLRKAIIADFD---GICSTQF--LVLQPKDVLPELLQGWLLSIDVTQRIEA 139 Q + ++ + + + + + F + + L + + + Sbjct: 64 QFIISRIDARHGASGLIPDELDGAVVTNDFPLFDVAEDRLDAAFLGWMSKTASFVELCKR 123 Query: 140 ICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG T + +P+PPL EQ I +I ++ R IE ++ Sbjct: 124 ASEGTTNRVRLSEDRFKALSIPLPPLDEQRRIVARIEELAAKVKEARGLRAAAIEEVEAH 183 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 A++ L V K S E + K Sbjct: 184 WPAILRLAFDGKLVSLVPFKASAQEILKQAATFHANYQETKNNNAYPNKPQISDNGPYAL 243 Query: 259 LSYGNIIQ--------------------------KLETRNMGLKPESYETYQIVDPGEIV 292 + L+T N+ + + PG+ Sbjct: 244 PTGWCWTTLGSVLTHMVDCVNDTPNFSEVDTGLLGLKTTNIRPYRLDLQRRWYMTPGDFA 303 Query: 293 FRFIDLQNDKRSLRSAQVMERGIIT-------------SAYMAVKPHGIDSTYLAWLMRS 339 + + G + + + I S YL + S Sbjct: 304 SWNRRQPPQAGDIVLTREAPVGNVCMLPEGISACLTQRLMLLRAENRVIQSRYLLHFLNS 363 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 A G G ++ D + +PP+++Q I ++ +++D + + Sbjct: 364 PCFTDQIAASGRGQTHPHIRVGDAPHFLLPLPPMEQQVKIVAELDALQSKLDSVKALQTE 423 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 + L + + A TG+ Sbjct: 424 TAAELDAMLPAILDKAFTGE 443 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 66/202 (32%), Gaps = 11/202 (5%) Query: 21 IPKHWKVVPIKRF----TKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W + SE ++ + ++ + + Sbjct: 243 LPTGWCWTTLGSVLTHMVDCVNDTPNFSEVDTGLLGLKTTNIRPYRLDLQRRWYMTPGDF 302 Query: 75 TSTVSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLL 129 S G I+ + P ++ + C TQ L + + + L +L Sbjct: 303 ASWNRRQPPQAGDIVLTREAPVGNVCMLPEGISACLTQRLMLLRAENRVIQSRYLLHFLN 362 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S T +I A G T H + +P+PP+ +QV I ++ A ++D++ + Sbjct: 363 SPCFTDQIAASGRGQTHPHIRVGDAPHFLLPLPPMEQQVKIVAELDALQSKLDSVKALQT 422 Query: 190 RFIELLKEKKQALVSYIVTKGL 211 L A++ T L Sbjct: 423 ETAAELDAMLPAILDKAFTGEL 444 >gi|60681038|ref|YP_211182.1| putative type I restriction-modification specificity protein [Bacteroides fragilis NCTC 9343] gi|60492472|emb|CAH07242.1| putative type I restriction-modification specificity protein [Bacteroides fragilis NCTC 9343] Length = 457 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 57/437 (13%), Positives = 117/437 (26%), Gaps = 70/437 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IPK W+ +++ T L + + ++ +++ P + + Sbjct: 24 EIPKGWEWCRLRQITSLLGDGIHGTPEYDPNGEYYFVNGNNLQDKKIVIKPDTKKVSREE 83 Query: 75 TSTVSI-FAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSID 132 K +L G D + + + L E + L S Sbjct: 84 YLKYKKNLNKHTVLVSINGTLGNIGFYNDEPIMLGKSACYFNLIVEDLKEYVYILLQSPF 143 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE---KIIAETVRIDTLITERI 189 + G T+ + + N+ +P+PPL EQ I + + + + T Sbjct: 144 FMEYTLKAATGTTIKNVSLMAMNNLLIPLPPLCEQNRIVDRMTILDTKVKQYQKQETCLR 203 Query: 190 RFIELLKEK-KQALVSYIVTKGLNPDV------------------------KMKDS---- 220 + K++++ + L P + K+K S Sbjct: 204 ELNNNIYSILKKSILQDAIQGKLVPQIAEEGTAEELLAEIHKEKERLVKEGKLKKSALTD 263 Query: 221 ----------------------GIEWVGLVPDHWEVKPFFALVTELNRK------NTKLI 252 E + +PD W L K + Sbjct: 264 SIIFKGDDNKYYERIGGKDICIDDEILFEIPDSWVWCRLGFLFNHNTGKALNASNKEGSM 323 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + + L + +S V G+++ Sbjct: 324 LPYITTSNLYWGQFDLSSVRQMYFKDSEIEKCSVSNGDLLVCEGGDIGRAAIWP---YDT 380 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 I + ++ + T + + + Q L + + + V +PPI Sbjct: 381 PMCIQNHIHKLRSYNQLDTLFYYYIFQAYKYNGYIGGKGIGIQGLSSKALHNMLVPLPPI 440 Query: 373 KEQFDITNVINVETARI 389 EQ IT+ I+ I Sbjct: 441 NEQIRITSKISSLFQFI 457 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 64/215 (29%), Gaps = 18/215 (8%) Query: 218 KDSGIEWVGLVPDHWEVKPFFAL----VTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 K E +P WE + ++ ++ N+ K Sbjct: 15 KCIDEEIPFEIPKGWEWCRLRQITSLLGDGIHGTPEYDPNGEYYFVNGNNLQDKKIVIKP 74 Query: 274 GLKPESYETYQIVDPG-EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 K S E Y + + ++ + SA Y Sbjct: 75 DTKKVSREEYLKYKKNLNKHTVLVSINGTLGNIGFYNDEPIMLGKSACYFNLIVEDLKEY 134 Query: 333 LAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + L++S + A +++ + L + +PP+ EQ I + +D Sbjct: 135 VYILLQSPFFMEYTLKAATGTTIKNVSLMAMNNLLIPLPPLCEQNRIVDR----MTILDT 190 Query: 392 LVEKIEQSIVLLKE--------RRSSFIAAAVTGQ 418 V++ ++ L+E + S + A+ G+ Sbjct: 191 KVKQYQKQETCLRELNNNIYSILKKSILQDAIQGK 225 >gi|297568979|ref|YP_003690323.1| restriction modification system DNA specificity domain protein [Desulfurivibrio alkaliphilus AHT2] gi|296924894|gb|ADH85704.1| restriction modification system DNA specificity domain protein [Desulfurivibrio alkaliphilus AHT2] Length = 458 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 54/459 (11%), Positives = 133/459 (28%), Gaps = 66/459 (14%) Query: 24 HWKVVPIKR----FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK---DGNSRQSDTS 76 W + +K ++G YI + ++ G + + + + Sbjct: 4 EWVRLTLKEAGVSLLDCVHKTPPDAGDGYPYIAIPQMKEGRIDFNANPRLISAADLEEWT 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDV 133 + + ++ + A + + L V P L+ + Sbjct: 64 KKANPQEDDVVLSRRCNPGETAYVPAGVRFALGQNLVLLRSDSSRVYPPFLRWLANGPEW 123 Query: 134 TQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +++ GA I N +PIPP+ EQ I + + +I+ Sbjct: 124 WAQVDKYLNVGAVFDSLRCADIPNFELPIPPIEEQKAIAHILGSLDDKIELNRRMNATLE 183 Query: 193 ELLKEKKQA-------LVSYIVTKG---------------------------LNPDVKMK 218 + + ++ ++ + G + + Sbjct: 184 AMARALFKSWFVDFDPVIDNALAAGNPIPEPLQARAKARKALGDQRKPLPEAIQKQFPSR 243 Query: 219 DSGIEWVGLVPDHWE-------VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 E +G VP+ WE T K + + L+ G + Q + T Sbjct: 244 FVSTEEMGWVPEGWEVSQISQLCTKIQNGGTPRKDKTEYWDDGTVPWLTSGEVRQNIITN 303 Query: 272 NMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + G V + + A V E A + P Sbjct: 304 TVNRITNLGLKNSSAKWLPSGATVIAMYGAT----AGQVAFVGEPLTTNQAVCGLIPKEP 359 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + +L + + +Q++ +++ V++PP+ ++ + Sbjct: 360 Y-RFFNYLTLERIVATLANQARGSAQQNISKGIIQQTKVVIPPVVL----GELLEKQVDN 414 Query: 389 I-DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I D ++ + L + R + + ++GQ+ + + Sbjct: 415 IFDKWIKNLNSQ-ETLAKIRDTLLPKLISGQLRIPDAEK 452 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 58/194 (29%), Gaps = 10/194 (5%) Query: 19 GAIPKHWKVVPIKRFT-KLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNS 70 G +P+ W+V I + K+ G T K + ++ +V + Sbjct: 251 GWVPEGWEVSQISQLCTKIQNGGTPRKDKTEYWDDGTVPWLTSGEVRQNIITNTVNRITN 310 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 S+ G + G + + L PK+ P +L Sbjct: 311 LGLKNSSAKWLPSGATVIAMYGATAGQVAFVGEPLTTNQAVCGLIPKE--PYRFFNYLTL 368 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G+ + I + IPP+ L+ +++ + + + Sbjct: 369 ERIVATLANQARGSAQQNISKGIIQQTKVVIPPVVLGELLEKQVDNIFDKWIKNLNSQET 428 Query: 191 FIELLKEKKQALVS 204 ++ L+S Sbjct: 429 LAKIRDTLLPKLIS 442 >gi|238854454|ref|ZP_04644794.1| type IC specificity subunit [Lactobacillus jensenii 269-3] gi|282932599|ref|ZP_06338020.1| type IC specificity subunit [Lactobacillus jensenii 208-1] gi|313472061|ref|ZP_07812553.1| type I restriction-modification system, specificity subunit [Lactobacillus jensenii 1153] gi|238832947|gb|EEQ25244.1| type IC specificity subunit [Lactobacillus jensenii 269-3] gi|239530090|gb|EEQ69091.1| type I restriction-modification system, specificity subunit [Lactobacillus jensenii 1153] gi|281303295|gb|EFA95476.1| type IC specificity subunit [Lactobacillus jensenii 208-1] Length = 390 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 75/380 (19%), Positives = 129/380 (33%), Gaps = 31/380 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK F+ T ++ D I E++ SG GK + F KG Sbjct: 14 WKNKKFLTFSSKITKNSTSDDIDFPRIEFENIVSGEGKLAQNRSKLNHIK--SGIKFDKG 71 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL+GKL PYL+ +A+F G+ F V++ K +L+ + +++ G Sbjct: 72 DILFGKLRPYLKNWWLAEFPGVAVGDFWVIRAK--DNRYFLYYLIQAPLFEKVSNYTTGT 129 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M +DW + N +P + EQ I + + + + K L Sbjct: 130 KMPRSDWNYVSNTFFKLPKIDEQEKIGRILDKVDSLLSLQHRKMELENQTSKAIYNYLFD 189 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K E + L+ KN Sbjct: 190 KNKPFYFKDNKTKKVFLKE-------------LGTTYSGLSGKNKTDFGHGKAKYITYLN 236 Query: 265 IQKLETRNMGLKP--ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAY 320 + K N L E + V G+I+F ++ L S + + S Sbjct: 237 VNKNTIANHNLLDLIEIDKKQNEVLNGDILFTISSETPEEVGLASLWPYDDTNIYLNSFC 296 Query: 321 MAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 +P+ I++ +LA+ +RS + K Y + G+ R +L + V L V VP EQ Sbjct: 297 FGFRPNSKINNLWLAYELRSLKIRKNMYKLAQGISRYNLSKKSVLNLQVDVPSDAEQN-- 354 Query: 379 TNVINVETARIDVLVEKIEQ 398 ++ L+ + Sbjct: 355 ------FDSKFVKLINIQTK 368 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 60/180 (33%), Gaps = 10/180 (5%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGE 290 F + KN+ + + + + NI+ + K ++ D G+ Sbjct: 13 PWKNKKFLTFSSKITKNSTSDDIDFPRIEFENIVSGEGKLAQNRSKLNHIKSGIKFDKGD 72 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+F + L G+ + ++ + +L +L+++ KV Sbjct: 73 ILFGKLRPYLKNWWLAEF----PGVAVGDFWVIRAKD-NRYFLYYLIQAPLFEKVSNYTT 127 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + V +P I EQ I +++ ++D L+ + + L + + Sbjct: 128 GTKMPRSDWNYVSNTFFKLPKIDEQEKIGRILD----KVDSLLSLQHRKMELENQTSKAI 183 >gi|227893572|ref|ZP_04011377.1| conserved hypothetical protein [Lactobacillus ultunensis DSM 16047] gi|227864624|gb|EEJ72045.1| conserved hypothetical protein [Lactobacillus ultunensis DSM 16047] Length = 406 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 52/406 (12%), Positives = 132/406 (32%), Gaps = 34/406 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + G+++ I I ++ + + K + D + Sbjct: 20 DWEQRKLNDIGNFYYGKSAPKWSVTNNGGIPCIRYGELYTKYSTKIDKILSFTSIDKDKL 79 Query: 79 SIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + ++L ++G + + A + + + + P + +L + + Sbjct: 80 KFSSGHEVLIPRVGEEPLDFAKHASWLSVPNVAIGEMITVFNTKEDPLFIANYLRAKYIV 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + EG +S+ + + IP E+ + + I +I+ LI+ + R ++ Sbjct: 140 KFA-KFVEGGNVSNLYFDRYKYTNIFIPTKKEERSVSKLI----YKINKLISLQQRKMKE 194 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L KQ++ I+ + K W + + N K Sbjct: 195 LNSLKQSISKLILENQSDKIRFCKFKESNW-----------KTYQFGQLYQKTNDKNKNV 243 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N E + Y I G+IVF + + + G Sbjct: 244 NDNFKIISVAGMDWGQSVTKSSKEYMKPYNITKLGDIVFEGHKNKQHEFGRFIENTLGTG 303 Query: 315 IITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLV 369 +++ + +P D + + + S ++ M + + +L +D+K+ +++ Sbjct: 304 LVSHIFDVYRPKNEISDLNFWKFYINSENIMNRVLRMSTSSARMMNNLNNKDLKKQKIVI 363 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P +E I N++ + + ++LL+ + + Sbjct: 364 PGYEEMKKIGNLLLTLQEN----IGNSQTKLMLLRNIEKALLQDLF 405 >gi|297617309|ref|YP_003702468.1| restriction modification system DNA specificity domain protein [Syntrophothermus lipocalidus DSM 12680] gi|297145146|gb|ADI01903.1| restriction modification system DNA specificity domain protein [Syntrophothermus lipocalidus DSM 12680] Length = 422 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 53/421 (12%), Positives = 131/421 (31%), Gaps = 35/421 (8%) Query: 26 KVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDV---ESGTGKYLPKDGNSRQSDTSTV 78 K+ + + + + + + + +++ G + R Sbjct: 6 KLYKMSELCDITSSKRIYAADYKPEGVPFYRGKEIVEKHQGKLDVSTELFIDRVKFEQIR 65 Query: 79 SIF---AKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + F G +L +G ++ +F +++ L WLLS Sbjct: 66 AKFGTPKAGDLLLTSVGTLGVPYVVRHGEEFYFKDGNLTWFTNFRNLDNRFLYYWLLSPQ 125 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++++ G++ + + + +P Q I + A ID Sbjct: 126 GREQLKKCVIGSSQPAYTIALLKEMEICLPHFPIQRKIAAILSAYDDLIDNNNRRIRILE 185 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E Q + K P + +G +P+ WEVK LV Sbjct: 186 ----EMAQLIYREWFVKFRFPGYEKVRMVDSELGPIPEGWEVKRLSDLVDTQYGYTESAR 241 Query: 253 ESNI-------LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + ++ + I + + + E Y Y++ +V R D + Sbjct: 242 DLPVGPKYLRGTDINKNSYIDWDKVQFCTINDEDYRKYKLKQGDILVIRMADP----GKV 297 Query: 306 RSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 + + S + +K + YL + + S +G R+S + Sbjct: 298 GIVEQSVEAVFASYLIRLKIRSLSVAPYYLFYFLLSDRYQNYINRASTGTTRKSASASVI 357 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + +++PP +I ++ + + + +L+ R + ++G++++ Sbjct: 358 TDISLVIPP----KEIIDMFEEIIMGYRKFLNILLKQNTVLRRTRDLLLPKLISGELNVE 413 Query: 423 G 423 Sbjct: 414 D 414 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 62/182 (34%), Gaps = 14/182 (7%) Query: 3 HYKAYPQ--YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII----YIGLEDV 56 + Y + DS +G IP+ W+V + G T ES +D+ Y+ D+ Sbjct: 200 RFPGYEKVRMVDSE---LGPIPEGWEVKRLSDLVDTQYGYT-ESARDLPVGPKYLRGTDI 255 Query: 57 E-SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 + + + + +G IL ++ + I+ +L+ Sbjct: 256 NKNSYIDWDKVQFCTINDEDYRKYKLKQGDILVIRMADPGKVGIVEQSVEAVFASYLIRL 315 Query: 116 PKDVL---PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 L P L +LLS I G T A I +I + IPP + E Sbjct: 316 KIRSLSVAPYYLFYFLLSDRYQNYINRASTGTTRKSASASVITDISLVIPPKEIIDMFEE 375 Query: 173 KI 174 I Sbjct: 376 II 377 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 56/198 (28%), Gaps = 8/198 (4%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 MK S + I+ G + E +K Sbjct: 1 MKSSTKLYKMSELCDITSSKRIYAADYKPEGVPFYRGKEIVEKHQGKLDVSTELFIDRVK 60 Query: 277 PESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E + G+++ + +R + + + D+ +L + Sbjct: 61 FEQIRAKFGTPKAGDLLLTSVGTLGVPYVVRHGEEFYFKDGNLTWFTNFRNL-DNRFLYY 119 Query: 336 LMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S + + + + +K + + +P Q I +++ D L++ Sbjct: 120 WLLSPQGREQLKKCVIGSSQPAYTIALLKEMEICLPHFPIQRKIAAILSA----YDDLID 175 Query: 395 KIEQSIVLLKERRSSFIA 412 + I +L+E I Sbjct: 176 NNNRRIRILEEMAQ-LIY 192 >gi|42528244|ref|NP_973342.1| type I restriction-modification system, S subunit [Treponema denticola ATCC 35405] gi|41819514|gb|AAS13261.1| type I restriction-modification system, S subunit [Treponema denticola ATCC 35405] Length = 532 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 54/451 (11%), Positives = 118/451 (26%), Gaps = 83/451 (18%) Query: 21 IPKHWKVVPIKRFTKLN-TGRTS--ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ--SDT 75 +P+ W + + G+T + + + + S Sbjct: 86 VPEGWAWCRLGEICEFISRGKTPVYTKESQYPVLAQKCNQWDGIRLDKVLFLDPNSLSKW 145 Query: 76 STVSIFAKGQILYGKLGP-YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + I+ G + + I D + F+V + + ++ + Sbjct: 146 TNEYHLQHEDIVINSTGTGTIGRVGIFDIGILGQYPFIVPDSHISVVRCYKVYIHRKYIY 205 Query: 135 QRIEAIC----------EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + K + +PIPPL+EQ I KI A +ID L Sbjct: 206 HIFTSEYLQTKINKVATGSTNQKELPKKVLTEFFIPIPPLSEQQRIVAKIEAIFAQIDLL 265 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPD------------------------------ 214 + +K+ K ++ + L P Sbjct: 266 EQNKADLQTAVKQAKSKILDLAIRGKLVPQDPADEPASVMLEKLHAEKEAKIAAGEIKRG 325 Query: 215 ----VKMKDS-----------------GIEWVGLVPDHWEVKPFFALV-------TELNR 246 K+S E +P++W+ + + Sbjct: 326 KNDSYIYKNSTDNCYYEKFFEKKDLCIDNEIPFELPENWQWTKLGRICDKLVDGDHNPPK 385 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD----PGEIVFRFIDLQNDK 302 + E ++S N + N+ + + + G+I F + Sbjct: 386 GIEEKTEYIMVSSRNINHNTVEDLENVRYLTKEMFDAENLRTNATAGDIFFTSVGSLGR- 444 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 I +++ + + Y+ + S +G + ++ Sbjct: 445 ---SCIYDGRMNICFQRSVSILNTKVYNKYVKFFFDSNFYQNYVAEHATGTAQMGFYLQE 501 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + +PPI EQ I I +D + Sbjct: 502 MAESFIAIPPISEQKRIVARIEEIFYVLDNI 532 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 71/216 (32%), Gaps = 15/216 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES--------NILSLSYGNIIQKLE 269 KD E VP+ W + ++R T + + G + K+ Sbjct: 76 KDIEDEIPFAVPEGWAWCRLGEICEFISRGKTPVYTKESQYPVLAQKCNQWDGIRLDKVL 135 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII-----TSAYMAVK 324 + + Y + ++ + + ++ + + + Sbjct: 136 FLDPNSLSKWTNEYHLQHEDIVINSTGTGTIGRVGIFDIGILGQYPFIVPDSHISVVRCY 195 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I Y+ + S L + +G ++ L + + + +PP+ EQ I I Sbjct: 196 KVYIHRKYIYHIFTSEYLQTKINKVATGSTNQKELPKKVLTEFFIPIPPLSEQQRIVAKI 255 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A+ID+L + +K+ +S + A+ G+ Sbjct: 256 EAIFAQIDLLEQNKADLQTAVKQAKSKILDLAIRGK 291 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 59/173 (34%), Gaps = 9/173 (5%) Query: 20 AIPKHWKVVPIKRFT-KLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P++W+ + R KL G + E + I + ++ T + L + Sbjct: 359 ELPENWQWTKLGRICDKLVDGDHNPPKGIEEKTEYIMVSSRNINHNTVEDLENVRYLTKE 418 Query: 74 DTSTVSI---FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 ++ G I + +G R I IC + + + V + ++ + S Sbjct: 419 MFDAENLRTNATAGDIFFTSVGSLGRSCIYDGRMNICFQRSVSILNTKVYNKYVKFFFDS 478 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + G + + + IPP++EQ I +I +D Sbjct: 479 NFYQNYVAEHATGTAQMGFYLQEMAESFIAIPPISEQKRIVARIEEIFYVLDN 531 >gi|93213410|gb|ABC46685.1| Sau1hsdS1 [Staphylococcus aureus] Length = 419 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 62/407 (15%), Positives = 133/407 (32%), Gaps = 27/407 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + K + I + +Y K +S+ + ++ Sbjct: 20 EWEEKKLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 77 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 78 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 137 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + I + P L EQ I + ++D I + + Sbjct: 138 VSGIAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKF----FSKLDRQIELEEQKL 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ELL+++K+ + I ++ L + + +W + K + N + Sbjct: 194 ELLQQQKKGYMQKIFSQELRFKNENGNDYPDWERIKFFDVIDKVIDFRGRTPKKLNMEWS 253 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SA 308 + L+LS N+ + N+ K + + Y G +++ L + + Sbjct: 254 DEGYLALSAVNVKKGYIDFNVEAKYGNLDLYTRWMRGNELYKGQVLFTTEAPMGNVAQVP 313 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV 367 + I +LA L+ S ++ + SG + + +++ RL V Sbjct: 314 DNKGYILSQRTIAFNSNEKITDNFLASLLSSENVYNDLLKLCSGATAKGVSQKNLNRLYV 373 Query: 368 LVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P I EQ +I +I+ LVE + I K ++ F+ Sbjct: 374 TIPHSISEQEEIAEF----FRKINQLVELQKYKIEHTKSQKQVFLQK 416 >gi|308270340|emb|CBX26952.1| hypothetical protein N47_A09810 [uncultured Desulfobacterium sp.] Length = 422 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 67/430 (15%), Positives = 143/430 (33%), Gaps = 48/430 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + + G+ E+ + Y+G + G Y + + Sbjct: 2 SEWVIDQLHNLLDFQKGKKVETSEIQRSGYERYLGAASLVGGHDGYASTRFSVKA----- 56 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 K +L G + G+ S+ L P + + L + L + I Sbjct: 57 ----NKDDVLMLWDGERSGLVG-HNLTGVVSSTVTKLSPNNKIISSLLYYYLLQSF-EWI 110 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G + H + + + P + I +D I + I ++ Sbjct: 111 QNRRTGTGVPHVPKDLMKILKLKYPKENKYQKKVALI---LETVDQAIEKTEALIYKYQQ 167 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELNR--- 246 K L+ + T+G+ D K++ + +G +P W++ + + + Sbjct: 168 IKAGLMHDLFTRGVTADGKLRPLREQAPELYKETPIGWIPKEWDIVRASDICHPITKGTT 227 Query: 247 -----------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 K+ I LS + + S V PG+I+ Sbjct: 228 PSTFINNANRIKSIPYIRVENLSFNGSLRFDMDSLFVSNIIHNSELARSKVFPGDILMNI 287 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGL 353 + K SL + + E + + H YL + + S K FY + Sbjct: 288 VGPPLGKVSLITDEYEEWNTNQAVSIYRVLHQRYRLYLLYYLLSDFAQKWFYLRSKRTSG 347 Query: 354 RQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +L E L + +P + E I+N+++ +I+ +E + + LK+++S + Sbjct: 348 QVNLTLEMCSNLEMPLPKNEGELASISNILSQIFEKIN--IENNFR--IKLKKQKSGLMN 403 Query: 413 AAVTGQIDLR 422 +TG++ + Sbjct: 404 DLLTGKVQVT 413 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 70/213 (32%), Gaps = 19/213 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-LNTGRTSES-------GKDIIYIGLEDVESGTG 61 YK++ + W IPK W +V + G T + K I YI +E++ Sbjct: 198 YKETPIGW---IPKEWDIVRASDICHPITKGTTPSTFINNANRIKSIPYIRVENLSFNGS 254 Query: 62 KYLPKD----GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVL 114 D N + S G IL +GP L K + + + + Sbjct: 255 LRFDMDSLFVSNIIHNSELARSKVFPGDILMNIVGPPLGKVSLITDEYEEWNTNQAVSIY 314 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + L + L D Q+ + T + + +P + + I Sbjct: 315 RVLHQRYRLYLLYYLLSDFAQKWFYLRSKRTSGQVNLTLEMCSNLEMPLPKNEGELAS-I 373 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +I I F LK++K L++ ++ Sbjct: 374 SNILSQIFEKINIENNFRIKLKKQKSGLMNDLL 406 >gi|254779944|ref|YP_003058051.1| putative type I restriction enzyme specificity protein [Helicobacter pylori B38] gi|254001857|emb|CAX30107.1| Putative type I restriction enzyme specificity protein [Helicobacter pylori B38] Length = 362 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 42/401 (10%), Positives = 109/401 (27%), Gaps = 47/401 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W+ V + ++ TG + ++ + + Y +++ Sbjct: 6 LPLNWQRVRLGDICEITTG-SLDANEMVHYGKYR-----------FYTCAKEYYFIDKYA 53 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F IL G Y+ + VL + + L++ + I+ Sbjct: 54 FDTEAILISGNGAYVGYVHYYKGKFNAYQRTYVLD-NFSEHIIFVKYFLTMFLQSHIQTN 112 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + + +PPL EQ+ I + + +L ++ + K Sbjct: 113 RNEGNTPYIVMATLKDFEILLPPLNEQIAIANILSDVDRYLYSLDALILKKESVKKALSF 172 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L+S + + W+ + + Sbjct: 173 ELLSQ----------------RKRLKGFNQAWQRVRLGDICEITTGSLDANEMVHYGKYR 216 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + ++ + I + Y Sbjct: 217 FYTCAKEYYFIDKYAFDTEAI-------------LISGNGAYVGYVHYYKGKFNAYQRTY 263 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + ++ + + + + G + +K +L+PP+ EQ I N Sbjct: 264 VLDNFSEHI-IFVKYFLTMFLQSHIQTNRNEGNTPYIVMATLKDFEILLPPLNEQIAIAN 322 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +++ I L K Q + + + ++ +I + Sbjct: 323 ILSDLDNEIISLKNKKSQ----FENIKKALNHDLMSAKIRV 359 >gi|229164779|ref|ZP_04292611.1| Type I restriction modification system, specificity subunit [Bacillus cereus R309803] gi|228618682|gb|EEK75676.1| Type I restriction modification system, specificity subunit [Bacillus cereus R309803] Length = 269 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 102/272 (37%), Gaps = 28/272 (10%) Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 +KI A ++ I + IE ++ K+ L+ + TKG+ + +G +P Sbjct: 5 KKITAILSNVEEAIKKTEAVIEQTEKVKKGLMQQLFTKGIGHKDYKQTV----IGEIPRK 60 Query: 232 WEVKPFFALVTELN----RKNTKLIESNILSLSYGNIIQKLETRNMGL----KPESYETY 283 W++ P L+ + K + S + + + L Sbjct: 61 WDIYPLRDLIIGGSQNGLYKPKEYYGSGFGMVHMREMFKGEVLDISALQMVNTSVGENEK 120 Query: 284 QIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRS 339 ++ G+I+F + + + + E S+ + + P+ I +L RS Sbjct: 121 FSLNEGDILFARRSVVYEGAGTPVYVPKHTEPITFESSIIRITPNQDFILPMFLNLYFRS 180 Query: 340 Y----DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ ++ + + ED+ L V VP + EQ I N + + R E Sbjct: 181 PVGRVNMQRIIRRLAVSG---ISSEDLLGLYVPVPSLDEQKQIVNSLAGVSKR----KEI 233 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDLR-GESQ 426 E+ I L + + + + +TG++ ++ E + Sbjct: 234 EEKKISSLTKVKQGLMQSLLTGKVRVKVDEDE 265 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 20/45 (44%), Gaps = 4/45 (8%) Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + EQ IT ++ + ++ ++K E I ++ + + T Sbjct: 1 MNEQKKITAIL----SNVEEAIKKTEAVIEQTEKVKKGLMQQLFT 41 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 69/217 (31%), Gaps = 17/217 (7%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKR--FTKLNTGRTSESGKDIIYIGLEDVES-GTGKYLPK 66 YK + IG IP+ W + P++ G G+ + G+ L Sbjct: 49 YKQT---VIGEIPRKWDIYPLRDLIIGGSQNGLYKPKEYYGSGFGMVHMREMFKGEVLDI 105 Query: 67 DG---NSRQSDTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGIC----STQFLVLQ 115 + + +G IL+ + + S + Sbjct: 106 SALQMVNTSVGENEKFSLNEGDILFARRSVVYEGAGTPVYVPKHTEPITFESSIIRITPN 165 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 +LP L + S ++ I +S + + + +P+P L EQ I + Sbjct: 166 QDFILPMFLNLYFRSPVGRVNMQRIIRRLAVSGISSEDLLGLYVPVPSLDEQKQIVNSLA 225 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + R + + ++ + Q+L++ V ++ Sbjct: 226 GVSKRKEIEEKKISSLTKVKQGLMQSLLTGKVRVKVD 262 >gi|240949220|ref|ZP_04753564.1| restriction modification system, specificity subunit [Actinobacillus minor NM305] gi|240296336|gb|EER46980.1| restriction modification system, specificity subunit [Actinobacillus minor NM305] Length = 384 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 50/389 (12%), Positives = 111/389 (28%), Gaps = 24/389 (6%) Query: 31 KRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTVSIFAKGQ 85 + G S + +++ S ++ + S +G Sbjct: 2 GDAADVRDGTHSSPNYYETGYPLVTSKNLTEYGLDLSDVSFISLCDFNEINKRSKVDEGD 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G +G ++ L+ + +++ L L S I G T Sbjct: 62 ILLGLIGTIGNPILVDKSGYAIKNVGLIKEKEELKNIFLVQLLKSSTFNNYIFQKNTGNT 121 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + N P + EQ I ++D I R + K A + Sbjct: 122 QKFLSLDTLRNFNFLCPKIEEQTAIGNF----FKQLDETIALHRRNCIKFQNLKTAYLEN 177 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 I + + E L + E RK I ++ Sbjct: 178 IFSTKYIQIQNENKNAWEQRKLGEVGYCQSGIGFPEREQGRKK------GIPFYKVSDMT 231 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 M QI+ V I + + + + ++ ++++ Sbjct: 232 LIGNELIMVTSNNYVSEEQILKNRWKVINSIPAIIFAKVGAALLLDRKRLVLNSFLIDNN 291 Query: 326 HGI---DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 + + + ++ + G S +DV+ L V++P +EQ I N Sbjct: 292 TMAYILNEQWDYYFCKTLFDTIYLPQLSQVGALPSFNGKDVENLNVIIPKSKEEQTTIGN 351 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSS 409 ++D + ++ + ++ +++ Sbjct: 352 F----FKQLDETIALHQKELAKYQQIKAA 376 >gi|24215897|ref|NP_713378.1| type I restriction enzyme EcoprrI specificity protein [Leptospira interrogans serovar Lai str. 56601] gi|24197105|gb|AAN50396.1| type I restriction enzyme EcoprrI specificity protein [Leptospira interrogans serovar Lai str. 56601] Length = 411 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 56/408 (13%), Positives = 130/408 (31%), Gaps = 42/408 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTG-------KYLPKDGNSRQSDTST 77 + + + G T + IG + + + + + ++ S Sbjct: 16 EWKTLGEVAEYVRGLTYSKTDESPDNIGYKVIRANNITLPGNLLNFNDIKFINLDTNVSD 75 Query: 78 VSIFAKGQILYG----KLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELL-QGWLLSI 131 K IL + A I D D V++ KD + L S Sbjct: 76 SKKLYKNDILISAASGSRDHVGKVAFIYSDLDYYFGGFMGVIRCKDEINSRYLFHILASD 135 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + ++ + +T+++ + + +PIPPLA Q+ I + A T L TE Sbjct: 136 IFQKYLDEMLNSSTINNLNSAVMSGFQLPIPPLAVQIEIVRILDAFTELTTELTTELTTE 195 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 + ++ + + + ++ +EW +G + A + K Sbjct: 196 LTTELTARKKQYN----YYRDQLLSFEEGEVEWKTLGETLVRTKGTNITAGQMKELNK-- 249 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + E + I+ + + + + Sbjct: 250 ---------YGAPLKVFAGGRTVAFVNFEDIPAKDVNREPSIIVKSRGVIEFEYYDKPFS 300 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 + K GI+ Y+ + ++ + F ++GS ++ + D + + Sbjct: 301 HKNEMWSYHS----KNEGINIKYVYYFLKMNEP--YFRSIGSKMQMPQIATPDTDKFQIP 354 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP+ EQ I +++ A + E + + I L ++ R ++ Sbjct: 355 IPPLAEQERIVAILDKFDALTSSISEGLPREIRLRQKQYEYYRELLLS 402 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 50/167 (29%), Gaps = 3/167 (1%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 +EW L V+ T+ + N +++ + + Sbjct: 15 VEWKTLGEVAEYVRGLTYSKTDESPDNIGYKVIRANNITLPGNLLNFNDIKFINLDTNVS 74 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRS 339 + + +I+ D + + +M I+S YL ++ S Sbjct: 75 DSKKLYKNDILISAASGSRDHVGKVAFIYSDLDYYFGGFMGVIRCKDEINSRYLFHILAS 134 Query: 340 YDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 K + S +L + + +PP+ Q +I +++ Sbjct: 135 DIFQKYLDEMLNSSTINNLNSAVMSGFQLPIPPLAVQIEIVRILDAF 181 >gi|302190881|ref|ZP_07267135.1| type I restriction-modification system specificity protein [Lactobacillus iners AB-1] Length = 389 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 43/402 (10%), Positives = 118/402 (29%), Gaps = 30/402 (7%) Query: 29 PIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKD--GNSRQSDTSTVSI 80 + + G T ++ +I ++ ++D + + D S+ + Sbjct: 4 KLSEIMDIIGGGTPKTSNPEYWNGNIPWLSVKDFNNDYRYVYETEKAITQAGLDNSSTKM 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + G A+I F + L+ K L + + L ++ Sbjct: 64 LKRNDSIISARGTVGEMAMIP-FPMAFNQSCYGLRAKKGLVDAEYLYYLIKHNVVVLKKN 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ +I + +P L EQ ++ + +I+ + + + Sbjct: 123 THGSVFDTITHDTFDDIEVELPSLKEQKVVASILRNLDDKIEVNNEINKNLEQQARSLFK 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A +P S +W ++ T N L I + Sbjct: 183 AWFVDF-----DPFANTMLS--DWKKGKLKDILKLKRQSIKTGENTTLPYLPIDVIPMRT 235 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + E+ + D +I+ + + + L + R + Sbjct: 236 -------FALTDFKPNAEAQSSLITFDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFT-- 286 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDIT 379 +A + S L + + ++ + + +++P + Sbjct: 287 LAPYNNEYLSFALLCCDQESSIDYAQSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFN 346 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ +I + + L+E R++ + ++ ++D+ Sbjct: 347 EIVLPMLRQIQNSYFENNR----LREIRNALLPRLMSDEVDV 384 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 7/192 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK +K KL ++ ++G++ + Y+ ++ + T + D S++ Sbjct: 197 SDWKKGKLKDILKLKR-QSIKTGENTTLPYLPIDVIPMRT--FALTDFKPNAEAQSSLIT 253 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F K I+ G + Y + ++A DGI T L P E L LL D I+ Sbjct: 254 FDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFTLAP--YNNEYLSFALLCCDQESSIDYA 311 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + S + + + I +K + + I L+E + Sbjct: 312 QSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQIQNSYFENNRLREIRN 371 Query: 201 ALVSYIVTKGLN 212 AL+ +++ ++ Sbjct: 372 ALLPRLMSDEVD 383 >gi|283796927|ref|ZP_06346080.1| type I restriction-modification enzyme, S subunit, EcoA family [Clostridium sp. M62/1] gi|291075337|gb|EFE12701.1| type I restriction-modification enzyme, S subunit, EcoA family [Clostridium sp. M62/1] Length = 374 Score = 92.2 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 115/396 (29%), Gaps = 34/396 (8%) Query: 30 IKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + NT + ++ ++I + +Y K+ + +T+ I + +Y Sbjct: 2 LSSVFAKNTQKNTDGRITNVICNSAKQGLIPQREYFDKNI-ANSDNTNGYYIIEENDFVY 60 Query: 89 GKL----GPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLL----SIDVTQRIEA 139 PY + GI S +L + K + W Sbjct: 61 NPRKSADAPYGPISSYKYTEAGIVSPLYLCFRAKKEINPAFFEWYFRSSAWHRYVYMSGD 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +P+ IP EQ I + RI+ + + Sbjct: 121 SGARHDRVSIKDDTFFAMPINIPSAHEQAQIAIFLERIEQRIEMQRALVDSLKKYKRGVV 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 A+ S+ ++K S G W V L+ + L ES + Sbjct: 181 AAIFSH----------QLKFSDAT--GNPYPEWTSCTLQDAVDFLDGQRKPL-ESADRAK 227 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G + + + ++ GE ++ + + + A Sbjct: 228 RQGQYPYYGASGIIDYIDDFIFDEPLLLLGEDGANILNRSTPLCFIA---EGKYWVNNHA 284 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ G + +L L+ S D + + L + +R+ + +P +EQ I Sbjct: 285 HVMRPKAGQNIKFLCELLESLDYTRY---NTGTAQPKLNQDKCRRIGLALPVYEEQCHIA 341 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++ R D K + + L R + Sbjct: 342 DFLSAFDQRTD----KAQSILDYLLSNRDGLLQQLF 373 >gi|21228841|ref|NP_634763.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20907364|gb|AAM32435.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 406 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 140/414 (33%), Gaps = 43/414 (10%) Query: 28 VPIKRFTKLNTG------------RTSESGKDIIYIGLEDVESGTGKYLPK-DGNSRQSD 74 + G ++ + I + +++ + + Sbjct: 6 RTLGDICDEVKGIVQTGPFGSQLHKSDYKDEGIPVVMPKNIIEDKISIEEIARIGKKDVE 65 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDV---LPELLQGWLL 129 + KG I+YG+ G R+A+I +C T + + K+ P L +L Sbjct: 66 RLSQHKLQKGDIVYGRRGDIGRRALIKGEQAGWLCGTGCIKISLKNASILEPSFLYYYLG 125 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++ I GATM + + I +IP+ P L Q I + + I+ Sbjct: 126 QPEIVSWIYNQAIGATMPNLNTSIIRSIPITYPSLTTQKKIAYILSSYDDLIENNTRRIE 185 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + + K P + +G +P W+V + + + + Sbjct: 186 ILE----QMAKLVYEEWFVKFRFPGHENVKMVPSDLGEIPKRWKV-REVSEILKRFKAGK 240 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 K + N+L +I + E +G + + + ++F + + Sbjct: 241 KYTQDNVLEEGLIPVIDQSEKEILGFHNDIADHSASLKNPIMIFGDH-------TCKIKI 293 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 ++E + + + ++ +L+++ K + + +++ V++ Sbjct: 294 LIEPFSVGPNVIPFRSEDYPEIFVFFLIKNLVQTKEYKRH---------WNELQAKRVVL 344 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 P + D NV+N + + +E L++ R + ++G+ID+ Sbjct: 345 PDVPLAMDFVNVVNPLFKQ----ITLLEHKNQNLRKTRDLLLPKLISGEIDVSD 394 >gi|309797884|ref|ZP_07692265.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 145-7] gi|308118492|gb|EFO55754.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 145-7] Length = 415 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 64/407 (15%), Positives = 131/407 (32%), Gaps = 53/407 (13%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + V + + G++ K I ++ + G + K + + + Sbjct: 17 EWVALNKLATFLKGKSLPKEKITPDGNRYCIHYGELFTHYGPIIDKVCSKTNQAINESIL 76 Query: 81 FAKGQILYGKLGPYLR----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 K +L R + I + I L+++ V I+ ++ Sbjct: 77 SEKNDVLMPTSDVTPRGLATASCIQESGVILGGDILIIRCSGVD--GRYLSNFIINNKKK 134 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERI 189 I + +G+T+ H K IG + +PIP LA Q I + T L E Sbjct: 135 ILQMVKGSTVYHLYAKDIGKLLIPIPCPNNPEKSLAIQSEIVRILDKFTALTAELTAELS 194 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRK 247 + + L+S K+ +EW +G + + K K Sbjct: 195 MRKKQYNYYRDQLLS------------FKEGEVEWKALGEIGEVRMCKRIL--------K 234 Query: 248 NTKLIESNILSLSYGNIIQKLETRNM-GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + E I G ++ ++ L E E Y GE++ Sbjct: 235 SQTSSEGEIPFYKIGTFGKEPDSYISRKLFNEFKEKYSYPKVGEVLISASGTIGRTVIF- 293 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + + +L Y + K + G G + L +++++L Sbjct: 294 ---DGRESYFQDSNIVWIENNEKIVLNKYLFYFYKIAKWGISEG-GTIKRLYNDNLRKLM 349 Query: 367 VLVP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + VP + EQ I +++ A + + E + + I L +++ Sbjct: 350 IPVPFPDSPERSLVEQQKIVKLLDKFDALTNSITEGLPREIELRQKQ 396 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 54/172 (31%), Gaps = 8/172 (4%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 I + + ++ + + +++ D+ + S Sbjct: 39 TPDGNRYCIHYGELFTHYGPIIDKVCSKTNQAINESILSEKNDVLMPTSDVTPRGLATAS 98 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 I+ + ++ G+D YL+ + + K+ + L +D+ +L + Sbjct: 99 CIQESGVILGGDILIIRCSGVDGRYLSNFIINNK-KKILQMVKGSTVYHLYAKDIGKLLI 157 Query: 368 LVP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P + Q +I +++ TA L ++ R ++ Sbjct: 158 PIPCPNNPEKSLAIQSEIVRILDKFTALTAELTAELSMRKKQYNYYRDQLLS 209 >gi|163801598|ref|ZP_02195496.1| type I restriction-modification system, endonuclease S subunit [Vibrio sp. AND4] gi|159174515|gb|EDP59317.1| type I restriction-modification system, endonuclease S subunit [Vibrio sp. AND4] Length = 382 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 71/401 (17%), Positives = 137/401 (34%), Gaps = 39/401 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI-IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + W++V K + R + D+ IY+GLE ++ + + K + Sbjct: 8 ESWQMVKFGDIAKQISKRVEPNETDLKIYVGLEHLDPDS--LIIKRHGVPSDVKGQKLLV 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIEA 139 KGQI++GK Y RK +AD D ICS +V P V+PE L ++ S R A Sbjct: 66 NKGQIIFGKRRAYQRKIAVADCDCICSAHAMVLEANPDKVIPEFLPFFMQSDVFMNRAVA 125 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I EG+ WK + + IP + +Q KII + + + E Sbjct: 126 ISEGSLSPTIKWKVLASQNFKIPSVVQQ----RKIIEAGFLLQRIQEQITDLNESAINLS 181 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +++ + + ++ + K+ + + + Sbjct: 182 NSIIQKSL-----------------NRDKVEVKKLNQLVDMQVGYAFKSKDFSDKGVALM 224 Query: 260 SYGN------IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 N + + Y Y + D I+ + + Sbjct: 225 RGANVGVSKPDWANGKKFLSNEMAKDYSEYLLNDKDIIIAMDRPFTGAGFKVSRLSKSDL 284 Query: 314 GI--ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 + GI YL L+ S + ++ G+ L +++ V V Sbjct: 285 PCLLVQRVGRFHSYKGITQEYLWLLLNSKFVKGYLFSQQKGMDIPHLSRKEILECEVPVL 344 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 EQ +++N I ++ D L E+ I +++ + + + Sbjct: 345 SEDEQNELSNTIGCLLSKCDAL---SEKRI-YVRQIKKTLL 381 >gi|160939418|ref|ZP_02086768.1| hypothetical protein CLOBOL_04311 [Clostridium bolteae ATCC BAA-613] gi|158437628|gb|EDP15390.1| hypothetical protein CLOBOL_04311 [Clostridium bolteae ATCC BAA-613] Length = 366 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 119/382 (31%), Gaps = 31/382 (8%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAII- 100 ++I + +Y KD + +T+ I +Y PY + Sbjct: 3 SNVICNSAKQGLIPQREYFDKDI-ANSDNTNGYYIIESNDFVYNPRKSADAPYGPISSYQ 61 Query: 101 ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG----ATMSHADWKGIGN 156 GI S +L + K + L W R + Sbjct: 62 YPEAGIVSPLYLCFRAKREINPLYFEWYFRSSTWHRYIYMSGDSGARHDRVSIKDDVFFA 121 Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 +P+ +P EQ I + A RI+T T + + ++L+S GL +V+ Sbjct: 122 MPINVPSAKEQERISLFLDAIERRIETQRTLVETLKKYKRGVVRSLLS-PEHCGL-KEVQ 179 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + I +G + + E+ + YG + + Sbjct: 180 WQCDTIGNLGFFIKGAPLSK------------ADISETGTPFILYGELYTTYHEVITSVV 227 Query: 277 PESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 ++ E G+++ +++ S S ++ I+ + ID + Sbjct: 228 RKTEAVVEQVHHSMVGDVLIPTSGETSEEISTASCVMLPGVILAGDLNIFRSTKIDGRIM 287 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++++ + ++ ++ ++ + P + Q I ++ I + Sbjct: 288 SYILNHIVNGNIARVAQGKSVVHVQASEISKIKISYPDPETQIRIIKILEA----ISNRI 343 Query: 394 EKIEQSIVLLKERRSSFIAAAV 415 E E + L + RSS + Sbjct: 344 ESCENELNHLTKMRSSLLQQLF 365 >gi|71275993|ref|ZP_00652275.1| Restriction modification system DNA specificity domain [Xylella fastidiosa Dixon] gi|71899061|ref|ZP_00681226.1| Restriction modification system DNA specificity domain [Xylella fastidiosa Ann-1] gi|170731328|ref|YP_001776761.1| hypothetical protein Xfasm12_2285 [Xylella fastidiosa M12] gi|71163226|gb|EAO12946.1| Restriction modification system DNA specificity domain [Xylella fastidiosa Dixon] gi|71731174|gb|EAO33240.1| Restriction modification system DNA specificity domain [Xylella fastidiosa Ann-1] gi|167966121|gb|ACA13131.1| hypothetical protein Xfasm12_2285 [Xylella fastidiosa M12] Length = 425 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 62/435 (14%), Positives = 135/435 (31%), Gaps = 54/435 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + N R E G +I + D+ S F G L+ Sbjct: 2 KLSDLIDFNPKRPLEKGVMNPFIEMADLPEVERDVSGIGSRIFNGGGSK---FKNGDTLF 58 Query: 89 GKLGPYLRKAIIADFDGI-------CSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAI 140 ++ P L A G+ ST+F+V+ KD E ++ + + Sbjct: 59 SRITPCLENGKTAKVGGLPNNAVGHGSTEFIVMAAKDSSDEDFVYYVARHPEFRAYAQGR 118 Query: 141 CEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG + W+ I + +P E+ I + + I + + Sbjct: 119 MEGTSGRQRVSWQAIADYEIPDFSSLERNRIGSVLSSIDNLIANNRRVNQVLEAMARALF 178 Query: 200 QALVSYI--VTKGLNPDVKMKDSG----------------IEWVGLVPDHWEVKPFFALV 241 +A V L + +S +G +P+ W+++ ++ Sbjct: 179 KAWCVDFEPVRAKLEGRWQRGESLPGLPAHLYDLFPDRLIESELGEIPEGWQMRSLDSIA 238 Query: 242 TELNR----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 LN K E+ L + ++ T + + IV G+++F + Sbjct: 239 NYLNGLALQKFPPESENEFLPVIKIAQLRTGNTSGADKASKQIKPEYIVVDGDVLFSWSG 298 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG---LR 354 RG + V + + + + F A+ +G Sbjct: 299 SLE-----VEVWNGGRGALNQHLFKVTSEEV--PKWFYFFATRHHLQNFRAIATGKATTM 351 Query: 355 QSLKFEDV--KRLPVLVPPIKEQFDITNVINVETARI-DVLVEKIEQSIVLLKERRSSFI 411 ++ + + R+ V +P E + A + + ++ +QS L + R + + Sbjct: 352 GHIQRKHLTDARIAVALPESME------KFDAVIAPLFNQMISNAQQSRS-LAQLRDTLL 404 Query: 412 AAAVTGQIDLRGESQ 426 ++G++ + + Sbjct: 405 PKLISGELRVPDAER 419 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 54/193 (27%), Gaps = 11/193 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG----RTSESGKD--IIYIGLEDVESGTGKYLPKDGNSR 71 +G IP+ W++ + G + ++ + I + + +G + Sbjct: 222 LGEIPEGWQMRSLDSIANYLNGLALQKFPPESENEFLPVIKIAQLRTGN----TSGADKA 277 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 I G +L+ G + G + + ++V Sbjct: 278 SKQIKPEYIVVDGDVLFSWSGSLEVEV-WNGGRGALNQHLFKVTSEEVPKWFYFFATRHH 336 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R A + TM H K + + + + I ++ + + Sbjct: 337 LQNFRAIATGKATTMGHIQRKHLTDARIAVALPESMEKFDAVIAPLFNQMISNAQQSRSL 396 Query: 192 IELLKEKKQALVS 204 +L L+S Sbjct: 397 AQLRDTLLPKLIS 409 >gi|322388273|ref|ZP_08061877.1| type I restriction-modification system specificty subunit [Streptococcus infantis ATCC 700779] gi|321140945|gb|EFX36446.1| type I restriction-modification system specificty subunit [Streptococcus infantis ATCC 700779] Length = 414 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 46/380 (12%), Positives = 127/380 (33%), Gaps = 31/380 (8%) Query: 50 YIGLEDVESGTGKYLPK-----DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--- 101 Y+ + D++ + +L D N + + + +L+ + G + K + Sbjct: 47 YLRITDIDDSSRLFLTDKLSSPDVNFTEEEYENYKL-RINDLLFARTGASVGKTYLYRES 105 Query: 102 -DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 + L+ Q IE + + + K G+ + Sbjct: 106 DGEVYYAGFLIRARLHDSYDGNFIFQQTLTDKYKQFIEITSQRSGQPGVNGKEYGDWKIG 165 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 + EQ I + + + + L ++S + K +++ Sbjct: 166 MTSYPEQSAIGSLFRTLDDLLASYKNNLVNYQSLKV----TMLSKMFPKVRQTVPEIRLD 221 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 G E D W+ + + + ++ + + G + +++ + + + Sbjct: 222 GFE------DEWKKAKLKDVAHRVQGNDGRMDLPTLTISASGGWMNQIDRFSANIAGKEQ 275 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + Y ++ GE+ + + + K + + E ++ Y + + + + +M S Sbjct: 276 KNYTLLKKGELSYNHGNSKLAKYGVVFELKEYEEALVPKVYHSFRVNQLADAKFIEIMFS 335 Query: 340 YDL--CKVFYAMGSGLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + ++ + SG R ++ F+D + +++P EQ I + +D L+ Sbjct: 336 TKIPDRELGKLVSSGARMDGLLNISFDDFMNIAIIIPTFAEQQAIGIY----FSNLDNLI 391 Query: 394 EKIEQSIVLLKERRSSFIAA 413 + I L+ + + Sbjct: 392 VAHQDKIFQLETLKKKLLQD 411 >gi|327386274|gb|AEA57748.1| Restriction endonuclease S subunit [Lactobacillus casei BD-II] Length = 431 Score = 92.2 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 75/402 (18%), Positives = 131/402 (32%), Gaps = 26/402 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ K + + I + ED+ S G+ S F Sbjct: 44 WEKRKFKDL--VVRVNKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINSKQGIY--FEPQ 99 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L+GKL PYL+ + F G F VL+ + L+ Q + I G Sbjct: 100 DVLFGKLRPYLQNWLFPSFYGRAVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGT 159 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M +DW + N PIP +EQ KI +D LI + LK+ K + Sbjct: 160 KMPRSDWNTVSNTSFPIPVQSEQ----RKIWQLFNVLDNLIAATQDKLSFLKKMKMFFLQ 215 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I + +++ G V H+++ + E K L + Sbjct: 216 QIFPTKNHDVPQIRFDG---FTDVWSHYKLGSLMRIDKEQEVKKELLTDIQKGFYVLAMR 272 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRF----------IDLQNDKRSLRSAQVMERG 314 ++ KP V + + +L R L +A + Sbjct: 273 TFSMDGYIDHSKPYWLNHLDNVSDDKFLLPREFAILDADMDANLPKIGRVLLNASSEKYL 332 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK 373 + G D ++ LMR + + +G + L ++V + +LVP Sbjct: 333 LAAHVRKIQVKSGNDPIFIYALMRGNSVHERLKLEANGSISKRLLDKNVYKQSILVPNRS 392 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I ++ + +Q I +LK+ + S + Sbjct: 393 EQSRIGR----LFFLLETTITLHQQKIKMLKQVKKSCLQNLF 430 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 68/231 (29%), Gaps = 21/231 (9%) Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +D LI I+ L++ K+AL+ + + W K Sbjct: 1 MLSLLDNLIAATQDKIDALEQAKKALLQRLFDQ-------------SWRFKGYSDPWEKR 47 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 F + K + + Q +++ LK S + +P +++F + Sbjct: 48 KFKDLVVRVNKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINS-KQGIYFEPQDVLFGKL 106 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 S + ++ + S YL L++S V Sbjct: 107 RPYLQNWLFPSFYGR---AVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGTKMPR 163 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + V +P EQ I +D L+ + + LK+ + Sbjct: 164 SDWNTVSNTSFPIPVQSEQRKI----WQLFNVLDNLIAATQDKLSFLKKMK 210 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 4/31 (12%), Positives = 13/31 (41%) Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +D L+ + I L++ + + + Sbjct: 1 MLSLLDNLIAATQDKIDALEQAKKALLQRLF 31 >gi|89093019|ref|ZP_01165970.1| putative type I restriction enzyme, S subunit [Oceanospirillum sp. MED92] gi|89082669|gb|EAR61890.1| putative type I restriction enzyme, S subunit [Oceanospirillum sp. MED92] Length = 394 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 119/398 (29%), Gaps = 32/398 (8%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK + +G+ +I ED+ + G Y GN + TS + Sbjct: 18 EWKKATLASLCSNFRSGK---------FIRSEDI-NKDGAYPVYGGNGLRGYTSEYN--H 65 Query: 83 KGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +G L G+ G ++ + + +Q + L + + + Sbjct: 66 EGSYALIGRQGALCGNMNFSNGKAFFTEHAIAVQANEKNDTLFLYY---KLGSMNLGQYS 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + + EQ I I+ + + L +A Sbjct: 123 GQSAQPGLSVNKLSELETFTAGKVEQTAIGNYFHKLDTLINQHQQKHDKLSNLK----KA 178 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ + K +++ G ++ + + + Sbjct: 179 MLEKMFPKAGETVPEVRFDGFTGNWTTTSLSKIAHVIDPHPSHRAPDAVANGVPFIGIGD 238 Query: 262 GNIIQKLETRNMGLKPE----SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + ++ +N+ + P + V+ G+ + + L S V + Sbjct: 239 VDENGHVDFKNVRIVPYHIYGEHRQRYQVEVGDFAYGRVASIGKIIDLSS-NVDREYTYS 297 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPI-KEQ 375 VKP + S YL M + R+SL +D + L V P +EQ Sbjct: 298 PTMAIVKPVTLYSPYLKGYMNTSVFKGRVDNKTTGSTRKSLGVQDFRELSVCFPEQQEEQ 357 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++ +L+ + Q I LK + + + Sbjct: 358 IKIGDY----FLKLGLLINQHNQQITKLKNIKQACLDK 391 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 55/199 (27%), Gaps = 17/199 (8%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + M+ E K A + R + +I + R Sbjct: 1 MAMELKEPEIRFDGFSGEWKKATLASLCSNFRSGKFIRSEDINKDGAYPVYGGNGLRGYT 60 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + +Y ++ + ++ N K A D+ +L Sbjct: 61 SEYNHEGSYALIGRQGALCGNMNFSNGKAFFTE----------HAIAVQANEKNDTLFLY 110 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + S +L + G + L + L EQ I N ++D L+ Sbjct: 111 YKLGSMNLGQY---SGQSAQPGLSVNKLSELETFTAGKVEQTAIGNY----FHKLDTLIN 163 Query: 395 KIEQSIVLLKERRSSFIAA 413 + +Q L + + + Sbjct: 164 QHQQKHDKLSNLKKAMLEK 182 >gi|332983356|ref|YP_004464797.1| restriction modification system DNA specificity domain-containing protein [Mahella australiensis 50-1 BON] gi|332701034|gb|AEE97975.1| restriction modification system DNA specificity domain protein [Mahella australiensis 50-1 BON] Length = 358 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 50/406 (12%), Positives = 123/406 (30%), Gaps = 58/406 (14%) Query: 22 PKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P W+ + + TG +GK ++ VE + D Sbjct: 5 PSDWEKDTVSNVVDITTGCRDTQDNKANGKYPFFVRSPIVERIDVADFDCEAVLTAGDGI 64 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + V+ + + S + + Sbjct: 65 G----------------TGKVYHYVKGKFSAHQRVYVMSNFRNIDGKYFYYFFSKNFFKE 108 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 +E +++ I ++ P ++EQ I + + ID + L Sbjct: 109 VEKYTAKSSVDSVRRAMIADMEFVHPSVSEQREIVKVLSDFDAYIDN--------LSELI 160 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 KK+++ + ++ +++ EW D+ ++ ++ +++ + Sbjct: 161 NKKKSIRDGALVDLISGRTRLEGFDYEW-----DNGKIGDILKILHGKSQRGVESYNGKY 215 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L G +I K + D ++ + + S I Sbjct: 216 PILGTGGVIGKATEY-------------LCDWECVLIGRKGTIDKPIYMNSP----FWTI 258 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + Y + + ++ + S R SL + ++ +P+ +P +EQ Sbjct: 259 DTLYYSKPVENQCVKFQYYIFCAIPWYDY---TESSGRPSLSRKVIENIPIRIPKYEEQQ 315 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 I +V+ I+ L + ++ I + R + +TG++ L Sbjct: 316 AIASVLTAMDKEIENLEAERDKMI----QIREGAMDDLLTGRVRLT 357 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 41/121 (33%), Gaps = 4/121 (3%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + Y+ ID Y + +V S++ + Sbjct: 67 GKVYHYVKGKFSAHQRVYVMSNFRNIDGKYFYYFFSKNFFKEVEKYTAKSSVDSVRRAMI 126 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + P + EQ +I V++ A ID L E I + K R + ++G+ L Sbjct: 127 ADMEFVHPSVSEQREIVKVLSDFDAYIDNLSELINKK----KSIRDGALVDLISGRTRLE 182 Query: 423 G 423 G Sbjct: 183 G 183 >gi|116871901|ref|YP_848682.1| type I restriction endonuclease S subunit [Listeria welshimeri serovar 6b str. SLCC5334] gi|116740779|emb|CAK19899.1| type I restriction endonuclease S subunit domain protein [Listeria welshimeri serovar 6b str. SLCC5334] Length = 392 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 48/397 (12%), Positives = 126/397 (31%), Gaps = 34/397 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ ++ + G K I + + + + + + +V + Sbjct: 17 WEQRKLEELAAFSKGIGYTKNDLVEKGIPLVLYGRLYTKYETIITEVNTFTKMKDKSV-V 75 Query: 81 FAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSI-DVTQ 135 +++ G + +I I ++QPK L + +S + + Sbjct: 76 SKGNEVVVPSSGETAKDISRASVIGAEGFILGGDLNIIQPKRELNSIFLALTISNGEQQK 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I +G ++ H + + + P EQ I + ++D I R ++ L Sbjct: 136 EIIKRAQGKSVVHLYNTDLKQVKLSYPIFNEQQKIGDF----FKQLDNTIALHQRKLDAL 191 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K K+ L+ + +++ + E+ + + + ++ + Sbjct: 192 KLMKKGLLQQMFANNEEKAPRLRFINFDEEWEQRKLNEIANRYDNLRVPITASARISGTT 251 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G GE + D N+ ++ V + Sbjct: 252 PYYGANGIQDYVEGFT---------------HDGEFILVAEDGANNVKNYPVQHVNGKIW 296 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + ++ + +LM + + + + G R L + + +L V P +EQ Sbjct: 297 VNNHAHVLQAKE-NKHDNKFLMNAIKIIRFEPFLVGGGRAKLNSDVMMKLIVKFPCYEEQ 355 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I + R++ ++ + I L + +++ Sbjct: 356 KKIGTFL----QRLENVITLHKNKINKLSSLKKTYLQ 388 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 57/181 (31%), Gaps = 8/181 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFR 294 A + L+E I + YG + K ET + + + + E+V Sbjct: 25 LAAFSKGIGYTKNDLVEKGIPLVLYGRLYTKYETIITEVNTFTKMKDKSVVSKGNEVVVP 84 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSG 352 S S E I+ ++P ++ L S + + Sbjct: 85 SSGETAKDISRASVIGAEGFILGGDLNIIQPKRELNSIFLALTISNGEQQKEIIKRAQGK 144 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L D+K++ + P EQ I + ++D + ++ + LK + + Sbjct: 145 SVVHLYNTDLKQVKLSYPIFNEQQKIGDF----FKQLDNTIALHQRKLDALKLMKKGLLQ 200 Query: 413 A 413 Sbjct: 201 Q 201 >gi|255324374|ref|ZP_05365492.1| type I site-specific deoxyribonuclease [Corynebacterium tuberculostearicum SK141] gi|255298561|gb|EET77860.1| type I site-specific deoxyribonuclease [Corynebacterium tuberculostearicum SK141] Length = 372 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 48/380 (12%), Positives = 108/380 (28%), Gaps = 38/380 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + V + + G + K G+ V + + L +D Sbjct: 17 EYVKLGDVATVKAGSSVSKQKIAESAGIYPVINSGREPLGFIAEFNSTDP---------- 66 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G + + ++ + + + Q I +C A Sbjct: 67 IGITTRGAGVGFVSWTEGPHFKGNLNYNVKVNSDIVSDRFLFFTLKEHGQSIRDLCSFAG 126 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + K I + P+PP Q I E++ A I++ + + AL Sbjct: 127 IPALNLKSIKTLAFPLPPREVQDAIVERLDALAALIES------------LDSEIALREK 174 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + +S E + + + + N++ Sbjct: 175 RFEYFREQLLTFDESD---------GVEYVKLGEVAGYSPLRVDSADLNADTFVGVDNLL 225 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + + + + G+++ I K G + +A+ P Sbjct: 226 KDRGGKALSEHGPNTKRSTKYQVGDVLIGNIRPYLRKIW----HATNEGGCSGDVLAIHP 281 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +D+ +L W + + G + +P ++ Q DI + ++ Sbjct: 282 SKVDARFLYWTLFGDEFWHYNNNFSRGGKMPRGDKAAILAYQFPLPSLEVQQDIADKLDT 341 Query: 385 ETARIDVLVEKIEQSIVLLK 404 A ID L K E+ + + Sbjct: 342 MQALIDNL--KKERELRKTQ 359 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 63/184 (34%), Gaps = 12/184 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + V + G + ++G++++ G + ++ + Sbjct: 193 EYVKLGEVA----GYSPLRVDSADLNADTFVGVDNLLKDRGGKALSEHGPNTKRSTKYQV 248 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G +L G + PYLRK A +G CS L + P V L L + Sbjct: 249 ---GDVLIGNIRPYLRKIWHATNEGGCSGDVLAIHPSKVDARFLYWTLFGDEFWHYNNNF 305 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G M D I P+P L Q I +K+ ID L ER + ++ Sbjct: 306 SRGGKMPRGDKAAILAYQFPLPSLEVQQDIADKLDTMQALIDNLKKERELRKTQFEYHRE 365 Query: 201 ALVS 204 L++ Sbjct: 366 KLLT 369 >gi|150398839|ref|YP_001322606.1| restriction modification system DNA specificity subunit [Methanococcus vannielii SB] gi|150011542|gb|ABR53994.1| restriction modification system DNA specificity domain [Methanococcus vannielii SB] Length = 392 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 55/397 (13%), Positives = 129/397 (32%), Gaps = 34/397 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ + + K ++ I +G+ + S D Sbjct: 17 PEGVEFKELGEIWK-RAPKSKIGVGKIPLLGVGKI---------ICFTSGSKDYLVNDFL 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G+ ++ G + F +++ + L + E + Sbjct: 67 VDGEYIFVNDGGVADFKYYSGKAYYTDHVFTFGIESELVNVKFVYYFLKDNQFMINEKMF 126 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +G+ + + K + +P+PPL Q I + + T L E +E K++ + Sbjct: 127 QGSGLKNLQKKLFETLKIPLPPLPIQEEIVKILDNFT----ELEAELEAELEARKKQYEY 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++T G + + +G + + E LS Sbjct: 183 YRDELLTFGDD-------VEFKELGEI-------CLNTNNIKWKENQNTNYEYIDLSSVS 228 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + Q ET+ + QIV+ G+++F + SL +++ + T + Sbjct: 229 RDNNQISETKTINSDNAPSRAQQIVNEGDVIFGTTRPTLKRYSLINSEHHNQICSTGFCV 288 Query: 322 -AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 P + +L +++++ G S+ VK+ + P ++EQ I Sbjct: 289 LRANPKKLLPKFLFFILKTTKFYDYVENNQEGAGYPSISNGKVKKFKIPFPSLQEQNRIV 348 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ A ++ + + + L K+ R+ + Sbjct: 349 AILDKFDALVNDISIGLPAELELRKKQYEYYRNKLLT 385 >gi|325982847|ref|YP_004295249.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] gi|325532366|gb|ADZ27087.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] Length = 399 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 55/424 (12%), Positives = 139/424 (32%), Gaps = 54/424 (12%) Query: 23 KHWKVVPIKRFT-KLNTG--RTSESGKDIIYIGLEDVESGTGKYLPK------DGNSRQS 73 W+ + K G ++ ++ IG+ + Y ++ Sbjct: 2 SEWRKCKLSEVAVKFAMGPFGSNIKAENFTNIGVPVIRGTNLNYYRYVDGEFVYLTEEKA 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGWL 128 + S G I+ G + +I + S + + P+ + L + Sbjct: 62 NQLKSSNCFPGDIVVTHRGTLGQVGLIPFGKFDRYVISQSGMKVTVNPEFIDSNFLLYFF 121 Query: 129 LSIDVTQRIEAICEGATMSHADWK--GIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + + + ++ + +PPL EQ I + + +ID L Sbjct: 122 KSNIGQNELLQHESQVGVPSISNPLTSLKSVSLNLPPLPEQKAIASILSSLDDKIDLLHR 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + + L ++++ I+ E + + E Sbjct: 182 QNKTL-------------EAMAETLFRQWFVEEAEIQ--------SENQLILGELIESVS 220 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRS 304 KL I+ L+ +I + N + +S + + + +I+F I N + + Sbjct: 221 ITHKLQTDTIIFLNTSDIYKGDVLINSQVNVDSLPGQAKKSIQRNDILFSEIRPANGRWA 280 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY----DLCKVFYAMGSGLRQSLKFE 360 E ++++ M ++ G S + + D ++ SG + F+ Sbjct: 281 YIHFDA-EDYVVSTKLMVLRSKGFLSQAFVYFFLTNSQTVDWLQLLAESRSGTFPQITFD 339 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ---SIVLLKERRSSFIAAAVTG 417 ++ L + +P ++++ + ++KI I L+ R + + ++G Sbjct: 340 QLRDLKINIPSK-------SILSNSIEWCESALKKINSNSIQIRTLETLRDTLLPKLMSG 392 Query: 418 QIDL 421 ++ + Sbjct: 393 EVRV 396 >gi|168482748|ref|ZP_02707700.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC1873-00] gi|172043831|gb|EDT51877.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC1873-00] Length = 426 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 67/415 (16%), Positives = 140/415 (33%), Gaps = 64/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS---------------------------------------GIEWVGLVPDHW 232 +S E +P+ W Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDSSYYEEVPCEIPESW 251 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQI 285 E + + + R + + + + ++ L SY+ ++ Sbjct: 252 EWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERL 311 Query: 286 VDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSY 340 + G++++ L R ++ G + + V I+ ++ + S Sbjct: 312 LRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIYNFLSSP 371 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + V SG ++ L + +K + +PP+ EQ I + I A ID L+ Sbjct: 372 IVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI 426 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 246 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 305 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 306 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 365 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 366 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 425 Query: 185 I 185 I Sbjct: 426 I 426 >gi|251772360|gb|EES52928.1| restriction modification system DNA specificity domain [Leptospirillum ferrodiazotrophum] Length = 556 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 44/434 (10%), Positives = 117/434 (26%), Gaps = 36/434 (8%) Query: 23 KHWKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W + ++ G T ++ ++ + D+ G + Sbjct: 122 EEWIECKLSEVCSSIDYGLTASAIDTPVGPHFLRITDIVGGAIDWKSVPYVKITESMFRK 181 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I+ + G ++ + + ++ + L+ K + L Sbjct: 182 FQLNSKDIVIARTGASTGSSMYINNPPPAVFASYLVRLKIKTEFDSRFIAYYLKSSKFWS 241 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G + + P+ + I +D I R E L+ Sbjct: 242 FIHGVLGDKSAQPNASARTLTQAPLKAPKN-KNSQRTIAHILGTLDDKIELNRRMNETLE 300 Query: 197 EKKQALVSYIVTKGLNPDVKMKD-----------------SGIEWVGLVPDHWEVKPFFA 239 QA+ +P + +G +P W+V Sbjct: 301 AMAQAIFKSWFVD-FDPVWAKMEGRPMGLPKEIEDLFPDSFEDSELGEIPRGWKVATIGE 359 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQIVDPGEIV 292 +V ES + ++ + + + G++ Sbjct: 360 IVNIAGGSTPSTKESTYWENGRHYWATPKDLSSLSTPVLLGTERKITDAGLAQIGSGKLP 419 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + L + A I ++A+ P S + ++ Sbjct: 420 AGTVLLSSRAPIGYLAISEVPVSINQGFIAMLPREEVSNLFILYWAACAHEEIVSRANGS 479 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + +++ V+ P + + + + + + E+ ++L RSS + Sbjct: 480 TFLEISKANFRQILVIRPTKS----VMELFESNVRPLYLQIVRNERETMILATLRSSLLP 535 Query: 413 AAVTGQIDLRGESQ 426 ++G+I ++ + Sbjct: 536 KLLSGEIRVKDAEK 549 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 64/207 (30%), Gaps = 15/207 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYI-------GLEDVESGT 60 ++DS +G IP+ WKV I + G T + + + +D+ S + Sbjct: 338 DSFEDSE---LGEIPRGWKVATIGEIVNIAGGSTPSTKESTYWENGRHYWATPKDLSSLS 394 Query: 61 GKY---LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + G +L P I++ + F+ + P+ Sbjct: 395 TPVLLGTERKITDAGLAQIGSGKLPAGTVLLSSRAPI-GYLAISEVPVSINQGFIAMLPR 453 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + L + + + I + G+T I + P + L + Sbjct: 454 EEVSNLFILYWAACA-HEEIVSRANGSTFLEISKANFRQILVIRPTKSVMELFESNVRPL 512 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS 204 ++I E + L L+S Sbjct: 513 YLQIVRNERETMILATLRSSLLPKLLS 539 >gi|238926417|ref|ZP_04658177.1| possible type I site-specific deoxyribonuclease [Selenomonas flueggei ATCC 43531] gi|238885821|gb|EEQ49459.1| possible type I site-specific deoxyribonuclease [Selenomonas flueggei ATCC 43531] Length = 391 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 47/405 (11%), Positives = 119/405 (29%), Gaps = 52/405 (12%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT- 75 P + + + G + + I + ++ + G + + Sbjct: 13 PDGVEYKKLGEIATNVFRGAGIKRDELTAMGIPCVRYGEIYTTYGIWFDSCVSHTDETFL 72 Query: 76 STVSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + F G IL+ G + D + +V+ + P+ L L + Sbjct: 73 TNPKYFGHGDILFAITGESVEEIAKSTAYIGHDKCVAGGDIVVLQHEQNPKYLSYVLSTD 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ + + H+ I I +P+PPL Q I + + T L E Sbjct: 133 MAQRQKSKGRVKSKVVHSSVPAIKEIVIPVPPLPIQNEIVKMLDNFTELTAELTAELTLR 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + +L++ D+ +EW L + + ++ + Sbjct: 193 KKQYSFYRDSLLN----------FSRDDAEVEWKTLGETTKSISSGKNKIRVVDGEYPVY 242 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + I+ Y T + + +I+ + Sbjct: 243 GSTGII---------------------GYCTNFVYEHAQILVARVGS----VGYVQIADG 277 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + I+ Y+ + + + + + +K+L + +PP Sbjct: 278 RYDVSDNTLIVDVLSTINMKYIFYYL---GYMNLSRLAHGAGQPLITAGQLKKLIIPIPP 334 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ Q I ++++ L + + I K+ R + Sbjct: 335 LETQAKIVSILDRFDELCHDLTQGLPAEIAARKKQYEYYREKLLT 379 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 66/194 (34%), Gaps = 9/194 (4%) Query: 228 VPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 PD E K + T + R K +L I + YG I + ET+ Sbjct: 12 CPDGVEYKKLGEIATNVFRGAGIKRDELTAMGIPCVRYGEIYTTYGIWFDSCVSHTDETF 71 Query: 284 ----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + G+I+F ++ + +A + + + V H + YL++++ + Sbjct: 72 LTNPKYFGHGDILFAITGESVEEIAKSTAYIGHDKCVAGGDIVVLQHEQNPKYLSYVLST 131 Query: 340 YDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + +K + + VPP+ Q +I +++ T L ++ Sbjct: 132 DMAQRQKSKGRVKSKVVHSSVPAIKEIVIPVPPLPIQNEIVKMLDNFTELTAELTAELTL 191 Query: 399 SIVLLKERRSSFIA 412 R S + Sbjct: 192 RKKQYSFYRDSLLN 205 >gi|330000675|ref|ZP_08303788.1| hypothetical protein HMPREF9538_01448 [Klebsiella sp. MS 92-3] gi|328537911|gb|EGF64097.1| hypothetical protein HMPREF9538_01448 [Klebsiella sp. MS 92-3] Length = 490 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 56/382 (14%), Positives = 131/382 (34%), Gaps = 37/382 (9%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQF 111 +++ K SDTS I K +++ G + + GI S + Sbjct: 1 MKNGLVDQSDKFKKRIA--SSDTSKYRIVYKNELVVGFPIDEGVLGFQTKYPVGIVSPAY 58 Query: 112 LVLQPKD---VLPELLQGWLLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAE 166 + + KD L+ +L S + + + +G ++ +P PP+ + Sbjct: 59 GIWKLKDESVCHIPYLERYLRSSEARRLYASRMQGVVARRRSLTKSDFLSLEVPFPPIND 118 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG 226 Q I + +++ LI +R + ++ L + +++ V +P K + +G Sbjct: 119 QARIANLL----AKVEGLIEQRKQLLQYLDDLLKSV---FVDMFSDPVKNAKGWELTTIG 171 Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI- 285 + V + + NI LK + + Sbjct: 172 EL-----------AVDVRYGTSVSAQGGKYKYIRMNNITPDGYWDFENLKYIDVDNKDLD 220 Query: 286 ---VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYD 341 + G++VF + + E II + V+ + + W + S Sbjct: 221 KYSLQKGDLVFNRTNSKELVGKTAVYDRDETVIIAGYLIRVRFDQQTNPWFVWGHLNSKF 280 Query: 342 LCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + + ++ ++++ +P+L PP++ Q ++ A + + +QS Sbjct: 281 GKAKLFNLCRNIIGMANINAQELRAIPILKPPLELQNKFATIVEKAHA----IKFRYQQS 336 Query: 400 IVLLKERRSSFIAAAVTGQIDL 421 + L+ A G+++L Sbjct: 337 LADLETLYDVVSQKAFKGELEL 358 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/218 (11%), Positives = 55/218 (25%), Gaps = 11/218 (5%) Query: 23 KHWKVVPIKRFT-KLNTGRTSE-SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 K W++ I + G + G YI + ++ G + + Sbjct: 163 KGWELTTIGELAVDVRYGTSVSAQGGKYKYIRMNNITPDGYWDFENLKYIDVDNKDLDKY 222 Query: 80 IFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-- 133 KG +++ + D I + + ++ L+ Sbjct: 223 SLQKGDLVFNRTNSKELVGKTAVYDRDETVIIAGYLIRVRFDQQTNPWFVWGHLNSKFGK 282 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + M++ + + + IP+ PPL Q + Sbjct: 283 AKLFNLCRNIIGMANINAQELRAIPILKPPLELQNKFATIVEKAHAIKFRYQQSLADLET 342 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 L Q + P + G P+H Sbjct: 343 LYDVVSQKAFKGELELSRVPIPTQIFFPVS--GEEPEH 378 >gi|299142937|ref|ZP_07036063.1| type I restriction enzyme specificity protein [Prevotella oris C735] gi|298575553|gb|EFI47433.1| type I restriction enzyme specificity protein [Prevotella oris C735] Length = 402 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 64/402 (15%), Positives = 128/402 (31%), Gaps = 45/402 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDG--NSRQSDTSTV 78 W+ + F G + GK I +I + D+ + T + Sbjct: 25 EWEEHGLSEFLDFKNGLNPKPEKFGKGIKFISVMDILNNTIITYDSIKACVDANNKEIDN 84 Query: 79 SIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G +L+ + L + + I + + K L LL Sbjct: 85 YSVKMGDLLFQRSSETLEDVGRANVYMDEKPAIFGGFVIRGKKKGEYNPLFFKNLLETPF 144 Query: 134 -TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++I + GA + +G+ + + P+ EQ I + + RI T Sbjct: 145 SRRKIIPMGAGAQHFNIGQEGLSKVKLYFAPINEQNKIAKILSLLDDRISTQNKIIEDLK 204 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +L S I + + + +++N Sbjct: 205 KL------------------------KSAIIEIEYSSKTKTSSHIGDFIVQTSKRNKDNA 240 Query: 253 ESNILSL--SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +LS+ G I Q + N + + Y+IV+ + F + + S+ Sbjct: 241 IRTVLSVSNRQGFIQQSEQFENRCVASDDTSNYKIVERNDFAFNP--ARINVGSIARLIT 298 Query: 311 MERGIITSAYMAVKPHGI-DSTYLAWLMRSY-DLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 E+GI++ Y+ + YL + S ++ + +RQ L +E + +P Sbjct: 299 FEKGIVSPMYICFRTKDYATPEYLDYFFESKLFFTEIQKRLEGSVRQCLSYESLCNIPFP 358 Query: 369 VPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLLKER 406 + I+ Q I + +I D L +Q LL++ Sbjct: 359 LLAIEVQQRIGKQLFTLAQKIKLETDFLEILHKQKQHLLRQM 400 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 68/197 (34%), Gaps = 14/197 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-----P 277 E+ G +H + + I +S +I+ +K Sbjct: 21 EFEGEWEEHGLSEFLD--FKNGLNPKPEKFGKGIKFISVMDILNNTIITYDSIKACVDAN 78 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAW 335 V G+++F+ + + + E+ I ++ + + Sbjct: 79 NKEIDNYSVKMGDLLFQRSSETLEDVGRANVYMDEKPAIFGGFVIRGKKKGEYNPLFFKN 138 Query: 336 LMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L+ + + MG+G + ++ E + ++ + PI EQ I ++ + +D + Sbjct: 139 LLETPFSRRKIIPMGAGAQHFNIGQEGLSKVKLYFAPINEQNKIAKIL----SLLDDRIS 194 Query: 395 KIEQSIVLLKERRSSFI 411 + I LK+ +S+ I Sbjct: 195 TQNKIIEDLKKLKSAII 211 >gi|325981136|ref|YP_004293538.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] gi|325530655|gb|ADZ25376.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] Length = 428 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 44/419 (10%), Positives = 100/419 (23%), Gaps = 38/419 (9%) Query: 29 PIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAK 83 + G + I + + ++ G + Sbjct: 5 RLDSVCDFINGGAWSDTEYAHSGIHVVKVTNLSDGRVTRGDDNYLPFSKYEEYKQHELIS 64 Query: 84 GQILYGKLGP-------YLRKAIIADFDGICST------QFLVLQPKDVLPELLQGWLLS 130 G I+ +G + + + + S V +P V L + Sbjct: 65 GDIVVSTVGSHPTQPGSVVGRVALVSVEFSGSFLNQNAACIRVNKPNLVSQRYLFYLANT 124 Query: 131 IDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + IE+ G+ + + P + EQ I + A I+ Sbjct: 125 VIFKHHIESRARGSANQVRMAIGELKKFEVQYPSITEQKKIAAILSAYDEMIENNQRRIA 184 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 ++ +E + + G K+K VP+ W++ Sbjct: 185 LLEKMTEEIYREWFVRLRFPGHEKVKKVKG--------VPEGWKLVKLEHAFKFTGGGTP 236 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG-------EIVFRFIDLQNDK 302 + N Q + G + L + Sbjct: 237 TKEVNRYWDGGDVNWFTPSNITGANGIFLEQSGEQCTEEGLNNSSAKIFPAYSVMLTSRA 296 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + ++ P+ + G L Sbjct: 297 TIGAVGINLTPACTNQGFITCIPNAQYPLPYLYHWIKLAKPHFELLSGGATFAELTKGTF 356 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 KR+ +L PP + + + + +E ++ L E R + ++G++ + Sbjct: 357 KRIEILTPPESIITEFVRI----ESPLFKAIENHLRANSKLIETRDKLLPRLISGKLSV 411 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 60/194 (30%), Gaps = 12/194 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDG---NS 70 +P+ WK+V ++ K G T G D+ + ++ G +L + G Sbjct: 215 VPEGWKLVKLEHAFKFTGGGTPTKEVNRYWDGGDVNWFTPSNITGANGIFLEQSGEQCTE 274 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + S+ IF ++ + I + F+ P P L + Sbjct: 275 EGLNNSSAKIFPAYSVMLTS-RATIGAVGINLTPACTNQGFITCIPNAQYP-LPYLYHWI 332 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 E + GAT + I + PP + I+ + + Sbjct: 333 KLAKPHFELLSGGATFAELTKGTFKRIEILTPPESIITEFVRIESPLFKAIENHLRANSK 392 Query: 191 FIELLKEKKQALVS 204 IE + L+S Sbjct: 393 LIETRDKLLPRLIS 406 >gi|325695186|gb|EGD37087.1| type I restriction-modification system specificty subunit [Streptococcus sanguinis SK150] Length = 402 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 56/414 (13%), Positives = 130/414 (31%), Gaps = 42/414 (10%) Query: 21 IPK--------HWKVVPIKRFTK-LNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDG 68 IP+ +W +K + + G + + Y+ + D++ + K++ + Sbjct: 7 IPEIRFQNYSDNWGGKTLKDLSDSIEYGLNASATYFDGVHKYVRITDIDDNSRKFISEKV 66 Query: 69 NSRQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPKDVLP 121 S + + K +L+ + G + K + + + V Sbjct: 67 TSPDVEFTPELENFKLQKNDLLFARTGASVGKTYLYEEKDGEMYYAGFLIRARIKEAVSA 126 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + L+ + I+ + + + K G + IP + EQ I + Sbjct: 127 DFIFQQTLTEKYKRFIDITSQRSGQPGVNGKEYGEWKLGIPSIQEQSAIGSLFRTLDDLL 186 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 K K +++S + P K I + + Sbjct: 187 ----ATYKENFANYKAFKTSMLSKMF-----PKSGQKVPEI----RLAEFEVEWEEKEFT 233 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGEIVFRFIDLQN 300 + R +T S + + Y +I+ N + + PG ++ + Sbjct: 234 KIVKRISTSSDSSQLPKVEYEDIVSGQGRLNKDVSSKFDNRKGIHFKPGYTLYGKLRPYL 293 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + L + G+ + P+G + ++ +L++S KV ++ Sbjct: 294 NNWLLPKFE----GVALGDFWVFNPNGNNPEFIYYLIQSSHYQKVANDTSGTKMPRSDWK 349 Query: 361 DVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V +P IKEQ I + + +D L+ + I L+ + + Sbjct: 350 SVSTTNFALPSTIKEQVAIGSF----FSNLDTLINSYQDKIYQLEILKKKLLQD 399 >gi|229176527|ref|ZP_04303956.1| Type I restriction-modification system specificity subunit [Bacillus cereus MM3] gi|228606964|gb|EEK64357.1| Type I restriction-modification system specificity subunit [Bacillus cereus MM3] Length = 312 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 38/315 (12%), Positives = 100/315 (31%), Gaps = 11/315 (3%) Query: 104 DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI-EAICEGATMSHADWKGIGNIPMPIP 162 G+ ST ++ +P + + L + + + + EGA Sbjct: 1 MGVLSTLYITFKPTLINSDFLVSYYDTTQWHKEVSMRAAEGARNHGLLNISASEFFDTNL 60 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 + + + KI ++D I + ++++K+ KQ + + +++ G Sbjct: 61 KVPNKEEEQIKIGNFFKQLDDTIALHQQELDIIKQTKQGFLQKMFPNEGESVPEVRFPGY 120 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---S 279 T + I +S G + +K + + Sbjct: 121 TGDWEQRKLGNHAEILTGGTPKTQIKEYWEPREIPWMSSGEVNKKRLSSTDNMISTQGFE 180 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL-MR 338 + + V ++ + ++ ++ + A+ P + + Sbjct: 181 NSSARWVKENSVLIALAGQGKTRGTVAINEIP--LTTNQSIAAIVPKDELHFEFIFQNLE 238 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + G G R L + + + ++ P ++EQ I N ++D + ++ Sbjct: 239 KRYEELRLISSGDGTRGGLNKQLISDVEIMSPSVEEQIKIGNF----FKQLDDTIALHQR 294 Query: 399 SIVLLKERRSSFIAA 413 + LKE + +F+ Sbjct: 295 ELDALKETKKAFLQK 309 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 65/193 (33%), Gaps = 13/193 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + ++ TG T ++ ++I ++ +V +++ + S Sbjct: 123 DWEQRKLGNHAEILTGGTPKTQIKEYWEPREIPWMSSGEVNKKRLSSTDNMISTQGFENS 182 Query: 77 TVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + +L G I + + + PKD L L Sbjct: 183 SARWVKENSVLIALAGQGKTRGTVAINEIPLTTNQSIAAIVPKDELHFEFIFQNLEKRYE 242 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + T + + I ++ + P + EQ+ I ++D I R ++ Sbjct: 243 ELRLISSGDGTRGGLNKQLISDVEIMSPSVEEQIKIGNF----FKQLDDTIALHQRELDA 298 Query: 195 LKEKKQALVSYIV 207 LKE K+A + + Sbjct: 299 LKETKKAFLQKMF 311 >gi|313675494|ref|YP_004053490.1| restriction modification system DNA specificity domain [Marivirga tractuosa DSM 4126] gi|312942192|gb|ADR21382.1| restriction modification system DNA specificity domain [Marivirga tractuosa DSM 4126] Length = 384 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 60/396 (15%), Positives = 127/396 (32%), Gaps = 34/396 (8%) Query: 32 RFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 T T + K + + + D+ + N + KG IL Sbjct: 2 DICTKITDGTHHTPKYTESGVPFFRVTDITASN-NSKKYISNEEHLELIKRCHPEKGDIL 60 Query: 88 YGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 Y K G I+ +F S + K V + L +L + ++ + A Sbjct: 61 YSKNGTIGVGKIVDWDFEFSIFVSLCLIKPNHKIVNTKYLNYFLNTSFALRQALKYSKVA 120 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T+ + I + +P+PPLA Q I + A ++ + Q+L Sbjct: 121 TIKNLHLVEIKKLKVPLPPLAVQERIAAILDAADELRQK----DQALLKKYDDLIQSL-- 174 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + +P K+ ++ +G + K +L + N+ Sbjct: 175 -FLDMFGDPVSNSKNLKVKPLGE---------LCDFYSGKAWKKAELGSYGYKLVRISNL 224 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + L V+ G+++F + + E G++ +K Sbjct: 225 HKP--NFPYWLYEGEMIEKLKVEAGDLLFSWAGV--QASIDVYLYDGETGMLNQHIYNLK 280 Query: 325 PHGIDSTYLAWL-MRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P + L + ++G G+ + LK D+ + VL+P + Sbjct: 281 PKKNSPNKEYLFNLLKLHLRNLRSSLGGGVGQFHLKKSDITSIKVLIPDEATMQ---VFL 337 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + ++ ++ + +I +E S + A G+ Sbjct: 338 DSL-SILNDQKQQAQANIKKSEELFQSLLQKAFKGE 372 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 58/179 (32%), Gaps = 8/179 (4%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIV 292 +T+ K ES + +I ++ E E + G+I+ Sbjct: 1 MDICTKITDGTHHTPKYTESGVPFFRVTDITASNNSKKYISNEEHLELIKRCHPEKGDIL 60 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS- 351 + + ++ + +++ YL + + + + Sbjct: 61 YSKNGTIG-VGKIVDWDFEFSIFVSLCLIKPNHKIVNTKYLNYFLNTSFALRQALKYSKV 119 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++L ++K+L V +PP+ Q I +++ D L +K + + + S Sbjct: 120 ATIKNLHLVEIKKLKVPLPPLAVQERIAAILDAA----DELRQKDQALLKKYDDLIQSL 174 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 66/202 (32%), Gaps = 16/202 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 KV P+ +G+ + + + + ++ Y + + Sbjct: 190 KVKPLGELCDFYSGKAWKKAELGSYGYKLVRISNLHKPNFPY-----WLYEGEMIEKLKV 244 Query: 82 AKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIE 138 G +L+ G + + G+ + L+PK P + L + + Sbjct: 245 EAGDLLFSWAGVQASIDVYLYDGETGMLNQHIYNLKPKKNSPNKEYLFNLLKLHLRNLRS 304 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ G H I +I + IP A + + + ++ + I+ +E Sbjct: 305 SLGGGVGQFHLKKSDITSIKVLIPDEATMQVFLDSL----SILNDQKQQAQANIKKSEEL 360 Query: 199 KQALVSYIVTKGLNPDVKMKDS 220 Q+L+ L +++ K S Sbjct: 361 FQSLLQKAFKGELVSELESKVS 382 >gi|312863322|ref|ZP_07723560.1| type I restriction modification DNA specificity domain protein [Streptococcus vestibularis F0396] gi|311100858|gb|EFQ59063.1| type I restriction modification DNA specificity domain protein [Streptococcus vestibularis F0396] Length = 409 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 56/401 (13%), Positives = 122/401 (30%), Gaps = 25/401 (6%) Query: 25 WKVVPIKRFTKLNT----GRTSES--GKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTST 77 W+ +K G + K Y+ +V++G Y + N + Sbjct: 19 WECDDLKNIFGTIRNAFVGTATPYYVEKGHFYLESNNVKNGKINYNSQIFINDEFYEKQR 78 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVT 134 I+ + G A+I + K+V P L S Sbjct: 79 DKWLKTNDIVMVQSGHVGHTAVIPKELNNTAAHALIVFTDYKKEVNPHFLNYQFQSSSKR 138 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++++ I G T+ H + + M P + EQ I + + + L Sbjct: 139 KKLDLISTGNTIKHILASEMKSFKMDFPTVEEQSAIGSLFRTLDDLLTSYKDNLANYQSL 198 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + P++++ EW + + E+ + Sbjct: 199 KTTMLSKMFPKAGRT--VPEIRLDGFEGEW-----EVVNLGTLIENYDEVISGTSGF--P 249 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S G +Q + + V G + +R + + + ++ + Sbjct: 250 IATSSRKGLYLQNDYFEGGRTGIDLTLDFHRVPIGYVTYRHMSDDSIFKFNKNNFETDVL 309 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPI 372 + + + D +L + + + L F M G R L ++++ + VP + Sbjct: 310 VSKEYPVFISNDSSDIDFLLYHLNNSRLFLRFSTMQKLGGTRVRLYYKNLITYKIAVPTV 369 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 KEQ I + +D L+ ++ I L+ + + Sbjct: 370 KEQQAIGAY----FSILDNLIATHQEKISQLETLKKKLLQD 406 >gi|146319440|ref|YP_001199152.1| restriction endonuclease S subunit [Streptococcus suis 05ZYH33] gi|146321641|ref|YP_001201352.1| restriction endonuclease S subunit [Streptococcus suis 98HAH33] gi|253752460|ref|YP_003025601.1| type I restriction-modification system S protein [Streptococcus suis SC84] gi|253754286|ref|YP_003027427.1| type I restriction-modification system S protein [Streptococcus suis P1/7] gi|253756220|ref|YP_003029360.1| type I restriction-modification system S protein [Streptococcus suis BM407] gi|145690246|gb|ABP90752.1| Restriction endonuclease S subunit [Streptococcus suis 05ZYH33] gi|145692447|gb|ABP92952.1| Restriction endonuclease S subunit [Streptococcus suis 98HAH33] gi|251816749|emb|CAZ52391.1| type I restriction-modification system S protein [Streptococcus suis SC84] gi|251818684|emb|CAZ56519.1| type I restriction-modification system S protein [Streptococcus suis BM407] gi|251820532|emb|CAR47287.1| type I restriction-modification system S protein [Streptococcus suis P1/7] gi|267026754|gb|ACY78468.1| VirA [Streptococcus suis] gi|292559064|gb|ADE32065.1| Restriction modification system DNA specificity domain protein [Streptococcus suis GZ1] gi|319758864|gb|ADV70806.1| restriction endonuclease S subunit [Streptococcus suis JS14] Length = 401 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 66/398 (16%), Positives = 140/398 (35%), Gaps = 39/398 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI--IYIGLEDVESGTG------------KYLPKDGNS 70 WK + + S S + + ++++ G K LP S Sbjct: 20 WKQRKLGEVADFSIKTNSLSRDKLSSYFYEVQNIHYGDILTKYDAILDVCNKELPSIIGS 79 Query: 71 RQSDTSTVSIFAKGQILYGK---LGPYLRKAIIADFDG--ICST-QFLVLQPKDVLPELL 124 SD + + ++G I++ + + +F G + S +V +PK Sbjct: 80 TISDFADA-LLSEGDIVFADAAEDSTVGKAIEVRNFKGKNVVSGLHTIVARPKVSYAPYY 138 Query: 125 QGWLLSI-DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G+L++ +I + +G +S + + + P L EQ I +D Sbjct: 139 LGYLINSTAYHNQILPLMQGTKVSSISKANLKSTTVVFPTLPEQEAIGSF----FSDLDQ 194 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 LIT R ++ +KE K+AL+ + KG D D W+ + + E Sbjct: 195 LITLHQRKLDDVKELKKALLQKMFPKGNGNDFP-----ELRFPEFTDAWKQRKLGEFMKE 249 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 +K + L++ + + ++ S Y I G+ ++ +D N Sbjct: 250 SKILGSKGDIARKLTVRLWG--RGVVSKKEIYSGSSATQYYIRKSGQFIYGKLDFLNQAF 307 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFED 361 + ++ + GI+ +L + + + +G R+ + E Sbjct: 308 GIIPPELDGYESTLDSPAFDLLKGINGQFLLEFVSRKEFYYYQGNIANGSRKAKRIHTET 367 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +P+ +P + EQ I + + +D L+ ++ Sbjct: 368 FLGMPISLPTLPEQEAIGSF----FSDLDQLITLHQRK 401 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 67/211 (31%), Gaps = 11/211 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + + K + + + + E+ N + + ++ Sbjct: 13 FPGFTDAWKQRKLGEVADFSIKTNSLSRDKLSSYFYEVQ--NIHYGDILTKYDAILDVCN 70 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVK 324 K +G + ++ G+IVF D K + + + + Sbjct: 71 KELPSIIGSTISDF-ADALLSEGDIVFADAAEDSTVGKAIEVRNFKGKNVVSGLHTIVAR 129 Query: 325 PHGID-STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P YL +L+ S + G S+ ++K V+ P + EQ I + Sbjct: 130 PKVSYAPYYLGYLINSTAYHNQILPLMQGTKVSSISKANLKSTTVVFPTLPEQEAIGSF- 188 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ + +KE + + + Sbjct: 189 ---FSDLDQLITLHQRKLDDVKELKKALLQK 216 >gi|229002234|dbj|BAH57700.1| hypothetical protein [Staphylococcus aureus] gi|238768520|dbj|BAH66832.1| type I restriction-modification system endonuclease [Staphylococcus aureus] Length = 433 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 62/427 (14%), Positives = 137/427 (32%), Gaps = 39/427 (9%) Query: 30 IKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKG 84 K+ G +S K I I ++++ S + + + + + K Sbjct: 8 FGDVAKIKNGYAFKSKEFQEKGIPVIKIKNIISPIVDTKDSQKVSIKTYEKTKGFSLKKN 67 Query: 85 QILYGKLGPYL-------RKAIIADFDGICSTQFLV------LQPKDVLPELLQGWLLSI 131 IL G + K +FD V L L +L Sbjct: 68 DILISLTGSGVNQMSSAVGKVGRIEFDYPALQNQRVGKFELKYSNSADLDFLFYYFLQPK 127 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + A ++ + K I + +P L +Q I + + +I I + Sbjct: 128 ITEYLVRNSTGSANQANINSKLIETVKIPNFSLIKQKSISKFLN----QITRKIETNQKM 183 Query: 192 IELLKEKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTEL 244 I LKE Q L + PD K SG E +G +P +W++ + + Sbjct: 184 IANLKELSQTLFKHWFVDFEFPDEDGNPYKSSGGEMIDSELGKIPSNWKIYKLKDIASHK 243 Query: 245 NRK-NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 N K E + E + I++ ++F ++ + Sbjct: 244 KETFNPKKSEEVTVKHFSLPAYDNEEQAIEEEVNKIKSNKWIINNNCVLFSKMNPDTKRI 303 Query: 304 SLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKF 359 L + + +S ++ ++ P+ ++++ + + A +G RQ +K Sbjct: 304 WLPVIDNKKLNVASSEFVVMESPNNKINSFIYNICLNSQFIDYLKANTTGSTNSRQRVKP 363 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + E I + ++ + I L + R + + ++G++ Sbjct: 364 TIAVNYKLAI----E-DSIVKKYSEIITPYMEEMKILRSEIGKLTQLRDTLLPKLMSGEL 418 Query: 420 DLRGESQ 426 ++ + + Sbjct: 419 EISDDIE 425 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 54/146 (36%), Gaps = 12/146 (8%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG + +G IP +WK+ +K + + + ++ Sbjct: 212 YKSSGGEMIDSELGKIPSNWKIYKLKDIASHKKETFNPKKSEE--VTVKHFSLPAYDNEE 269 Query: 66 KDGNSRQSD-TSTVSIFAKGQILYGKLGPYLRKAII----ADFDGICSTQFLVLQ-PKDV 119 + + S I +L+ K+ P ++ + + S++F+V++ P + Sbjct: 270 QAIEEEVNKIKSNKWIINNNCVLFSKMNPDTKRIWLPVIDNKKLNVASSEFVVMESPNNK 329 Query: 120 LPELLQGWLLSIDVTQRIEAICEGAT 145 + + L+ ++A G+T Sbjct: 330 INSFIYNICLNSQFIDYLKANTTGST 355 >gi|15900419|ref|NP_345023.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae TIGR4] gi|148996901|ref|ZP_01824619.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP11-BS70] gi|149005619|ref|ZP_01829358.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP18-BS74] gi|168577282|ref|ZP_02723073.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae MLV-016] gi|169833432|ref|YP_001694005.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae Hungary19A-6] gi|14971978|gb|AAK74663.1| putative type I restriction-modification system, S subunit [Streptococcus pneumoniae TIGR4] gi|147757476|gb|EDK64515.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP11-BS70] gi|147762559|gb|EDK69519.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP18-BS74] gi|168995934|gb|ACA36546.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae Hungary19A-6] gi|183577166|gb|EDT97694.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae MLV-016] Length = 426 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 67/415 (16%), Positives = 140/415 (33%), Gaps = 64/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS---------------------------------------GIEWVGLVPDHW 232 +S E +P+ W Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESW 251 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQI 285 E + + + R + + + + ++ L SY+ ++ Sbjct: 252 EWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERL 311 Query: 286 VDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSY 340 + G++++ L R ++ G + + V I+ ++ + S Sbjct: 312 LRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIYNFLSSP 371 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + V SG ++ L + +K + +PP+ EQ I + I A ID L+ Sbjct: 372 IVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI 426 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 246 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 305 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 306 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 365 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 366 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 425 Query: 185 I 185 I Sbjct: 426 I 426 >gi|205372128|ref|ZP_03224944.1| hypothetical protein Bcoam_01225 [Bacillus coahuilensis m4-4] Length = 424 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 54/427 (12%), Positives = 136/427 (31%), Gaps = 33/427 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+V+ I + + S + + + ++ G + + + + Sbjct: 4 NGWEVLAIDDVCTVTDCQHSTAPAVDYETEYRMLRTVNIRDGRLRDIETTKSVTEETYKK 63 Query: 78 VSI---FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWL---L 129 S+ G ++ + P AI+ D + + L L+ K + + Sbjct: 64 WSVRGYLEDGDVILTREAPMGEVAILKDEEYKFFLGQRMLQLKVKKEIITPEFLYYSLQT 123 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S Q + G+ +S+ + + + +P + Q I + + + + + Sbjct: 124 SSMRHQIMMNEGTGSVVSNIRIPLLKKMQISVPSIKLQKKITLLLESIDSKYNNNNSMIK 183 Query: 190 RFIELLKEKKQALVSYIVTKGLNPD---VKMKDSG----IEWVGLVPDHWEVKPFFALVT 242 E Q L P+ + K SG G +P+ W ++ + Sbjct: 184 GLE----ELSQILFKQWFIDFEFPNEDGMPYKSSGGKMVDSEFGEIPEGWNIEYLSSSTE 239 Query: 243 ELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 L+ K ES + + + + ++ + ++ + Sbjct: 240 FLSGGTPKTKESTYWNGDIPFFTPKDVGSSVYTTNTEKTITELGLSKCNSRLYPKNTVFI 299 Query: 301 DKRSLR--SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 R A + + A+K YL +++ L ++ + ++ Sbjct: 300 TARGTVGKVALANRDMAMNQSCFALKSRNECQFYLYGAIKTL-LREIIQGANGAVFNAIN 358 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D+ RL + +P Q + + + +E + L+ R + + ++G+ Sbjct: 359 LSDLNRLRLAMP----QQGLIDKYEAIAITFFDQMSALEFENINLQILRDTLLPKLLSGE 414 Query: 419 IDLRGES 425 I++ ES Sbjct: 415 IEIPDES 421 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 69/199 (34%), Gaps = 12/199 (6%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 MK +G E + + + ++ + + +++ + + + Sbjct: 1 MKSNGWEVLAIDDVCTVTDCQHSTAPAVDYETEY---RMLRTVNIRDGRLRDIETTKSVT 57 Query: 277 PESYETYQ---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 E+Y+ + ++ G+++ + L+ + + VK I +L Sbjct: 58 EETYKKWSVRGYLEDGDVILTREAPMGEVAILKDEEYKFFLGQRMLQLKVKKEIITPEFL 117 Query: 334 AWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +++ + + +++ +K++ + VP IK Q IT ++ ++ + Sbjct: 118 YYSLQTSSMRHQIMMNEGTGSVVSNIRIPLLKKMQISVPSIKLQKKITLLLESIDSKYNN 177 Query: 392 LVEKIEQSIVLLKERRSSF 410 I L+E Sbjct: 178 ----NNSMIKGLEELSQIL 192 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 65/206 (31%), Gaps = 14/206 (6%) Query: 10 YKDSGVQWI----GAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ES 58 YK SG + + G IP+ W + + T+ +G T ++ + DI + +DV S Sbjct: 210 YKSSGGKMVDSEFGEIPEGWNIEYLSSSTEFLSGGTPKTKESTYWNGDIPFFTPKDVGSS 269 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 K ++ K + G + A+ + + F + K Sbjct: 270 VYTTNTEKTITELGLSKCNSRLYPKNTVFITARGTVGKVALANRDMAMNQSCFAL---KS 326 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + + I GA + + + + + +P I Sbjct: 327 RNECQFYLYGAIKTLLREIIQGANGAVFNAINLSDLNRLRLAMPQQGLIDKYEAIAITFF 386 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 ++ L E I L L+S Sbjct: 387 DQMSALEFENINLQILRDTLLPKLLS 412 >gi|163784829|ref|ZP_02179613.1| type I restriction-modification system specificity subunit [Hydrogenivirga sp. 128-5-R1-1] gi|159879899|gb|EDP73619.1| type I restriction-modification system specificity subunit [Hydrogenivirga sp. 128-5-R1-1] Length = 80 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 45/77 (58%) Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + GS ++ + E VK L + +PP+ EQ I ++ +T +ID L++K E+ I L+K Sbjct: 4 EKFMTGSAGQKRIPTEFVKNLQIPLPPLHEQQKIAQYLDKKTQQIDQLIQKTEKEIKLIK 63 Query: 405 ERRSSFIAAAVTGQIDL 421 E + I+ AV G+I + Sbjct: 64 EFKEKLISDAVLGKIKV 80 >gi|295114354|emb|CBL32991.1| Restriction endonuclease S subunits [Enterococcus sp. 7L76] gi|315145851|gb|EFT89867.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2141] Length = 407 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 129/412 (31%), Gaps = 48/412 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG----TGKYLPKDGNSRQSDTSTV 78 + W++ + + +G S D + G+ + G TG D Sbjct: 18 EDWELCKLSGVIEKLSGGASIKPTDYLEDGIRTIPKGAVNATGIADLSGSKYISEDFFEK 77 Query: 79 SI---FAKGQILYG---------KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 +I ++ +G +R + + + + + + + L Sbjct: 78 NITSHVHTNNLVTSLRDLVPSAPNMGRIVRIEGDEEQFLMPQGVYKLELFEGMDGDFLIS 137 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + S + I A G+T H NI + +P EQ I ++D IT Sbjct: 138 FSNSDKYRKIISAEKNGSTQVHIRNGEFLNIDINLPSKYEQKKIGAF----FKQLDDTIT 193 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 R ++ LKE K+A + + K +++ + E D W++ + + Sbjct: 194 LHQRKLDQLKELKKAYLQLMFPKKDETVPRVRFADFE------DDWQLCKLGETFSIIMG 247 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRS 304 ++ Y + + +N + P + T + + G+++ + Sbjct: 248 QSPNSENYTENPDDYILVQGNSDMKNNKVVPRIWTTQVTKKAEKGDLILSVRAPVGEIGK 307 Query: 305 LRSAQVMERGII----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 V+ RG+ DS Y +S+ Sbjct: 308 TDYNVVLGRGVAAVKGNDFIFQQLRKMKDSGYWTRY------------STGSTFESINSN 355 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 D+K + +P EQ I + +D + + + LK + S++ Sbjct: 356 DIKEALINIPNKDEQQKIGD----LFTHLDDAIILNQNKLNQLKSLKKSYLQ 403 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 49/180 (27%), Gaps = 5/180 (2%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + G++ S + G R T K Sbjct: 232 DWQLCKLGETFSIIMGQSPNSENYTENPDDYILVQGNSDMKNNKVVPRIWTTQVTKKAEK 291 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G ++ P + D++ + ++ D + L + + G Sbjct: 292 GDLILSVRAPV-GEIGKTDYNVVLGRGVAAVKGNDFI----FQQLRKMKDSGYWTRYSTG 346 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T + I + IP EQ I + I + + L K Q + Sbjct: 347 STFESINSNDIKEALINIPNKDEQQKIGDLFTHLDDAIILNQNKLNQLKSLKKSYLQNMF 406 >gi|256810724|ref|YP_003128093.1| restriction modification system DNA specificity domain protein [Methanocaldococcus fervens AG86] gi|256793924|gb|ACV24593.1| restriction modification system DNA specificity domain protein [Methanocaldococcus fervens AG86] Length = 219 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 60/191 (31%), Gaps = 5/191 (2%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 K A T I + +I + + + + + Sbjct: 30 CKKIKAGGTPKTSVKEYYESGTIPFVKIEDITNSNKYLTYTKVKITEKGLNNSNAWIVPK 89 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSG 352 + +A A + + P + + + + + + Sbjct: 90 NSVLFAMYGSIGETAINKIEVATNQAILGIIPKGEVLESEFLYYILAKNKNYYSKLGMQT 149 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +++L + VK + +PPI+EQ I + ID L+E + L++ + + Sbjct: 150 TQKNLNAQIVKTFKIPLPPIEEQKAIAERL----KSIDELIEIKRKEKEQLEKAKKKIMD 205 Query: 413 AAVTGQIDLRG 423 +TG+I ++ Sbjct: 206 LLLTGKIRVKN 216 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 74/194 (38%), Gaps = 11/194 (5%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDV--ESGTGKYLPKDGNS 70 +P+ W VV +K K+ G T ++ I ++ +ED+ + Y Sbjct: 17 VPEDWDVVELKDVCKKIKAGGTPKTSVKEYYESGTIPFVKIEDITNSNKYLTYTKVKITE 76 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + S I K +L+ G A I + + L + PK + E + + Sbjct: 77 KGLNNSNAWIVPKNSVLFAMYGSIGETA-INKIEVATNQAILGIIPKGEVLESEFLYYIL 135 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + T + + + + +P+PP+ EQ I E++ + I+ E+ + Sbjct: 136 AKNKNYYSKLGMQTTQKNLNAQIVKTFKIPLPPIEEQKAIAERLKSIDELIEIKRKEKEQ 195 Query: 191 FIELLKEKKQALVS 204 + K+ L++ Sbjct: 196 LEKAKKKIMDLLLT 209 >gi|313605683|gb|EFR83058.1| specificity subunit [Listeria monocytogenes FSL F2-208] Length = 326 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 46/326 (14%), Positives = 99/326 (30%), Gaps = 10/326 (3%) Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G IL+G +G + D L+ + K++L L L S + IE G Sbjct: 2 GDILFGMIGTIGTPVQLIRKDFAIKNVALIKEKKNILNRFLIHLLKSAVFDRYIENENTG 61 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 T I N P L EQ I ++D I R ++ LK K+ L+ Sbjct: 62 GTQKFLSLSKIRNFCFLSPKLEEQDQISLF----FKQLDNAIALHQRKLDALKLMKKGLL 117 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + +++ + + + I + G+ Sbjct: 118 QQMFPNNEEKVPRLRFADFNEKWERCKISSFARNTYGGGTPKTNVPEYWQGRIPWIQSGD 177 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 ++ + K + + I I + + A + + ++++ Sbjct: 178 LLIDSLFNIIPKKHVTGSAVKSSATKCIPANSIAIVTRVGVGKLAFIPFEYTTSQDFLSL 237 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVI 382 +DS + + + L + + + + D+ + P EQ I + Sbjct: 238 SNLRVDSNFGTYSIYIM-LQRELNNIQGSTIKGITKSDLLEKNINKPLNRIEQERIGVSL 296 Query: 383 NVETARIDVLVEKIEQSIVLLKERRS 408 +D ++ + + L + Sbjct: 297 ----KLLDNIITLHQSKLEKLSSLKK 318 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 47/127 (37%), Gaps = 9/127 (7%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G+I+F I + + I + + + I + +L L++S + Sbjct: 1 MGDILFGMIGT----IGTPVQLIRKDFAIKNVALIKEKKNILNRFLIHLLKSAVFDRYIE 56 Query: 348 A-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G ++ L ++ L P ++EQ I ++ ++D + ++ + LK Sbjct: 57 NENTGGTQKFLSLSKIRNFCFLSPKLEEQDQI----SLFFKQLDNAIALHQRKLDALKLM 112 Query: 407 RSSFIAA 413 + + Sbjct: 113 KKGLLQQ 119 >gi|313123731|ref|YP_004033990.1| type-i specificity determinant subunit [Lactobacillus delbrueckii subsp. bulgaricus ND02] gi|312280294|gb|ADQ61013.1| Putative type-I specificity determinant subunit [Lactobacillus delbrueckii subsp. bulgaricus ND02] Length = 390 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 56/399 (14%), Positives = 124/399 (31%), Gaps = 38/399 (9%) Query: 24 HWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + G ++ G I + + + ++ +S S Sbjct: 18 DWEQRKLGDVANFSKGTGYSKSDLKGTGSPIILYGRLYTKYETII-RNVDSFVVPKSGSV 76 Query: 80 IFAKGQILYGKLGPYLRKAII---ADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVT 134 G+++ G I + GI ++ D+ P L + + Sbjct: 77 FSKGGEVIVPGSGETAEDISIASVVEPAGILLGGDLNIIYPNSDLDPTFLAITISNGKPH 136 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +G ++ H + +I + P L+EQ I + I ++ + L Sbjct: 137 FDMARRAQGKSIVHLHNADLKHISLKTPNLSEQKRISKIFEVLDQTITLHEEKKHQLESL 196 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 Q + + K P V+ + EW +H ++ + T + + Sbjct: 197 KSALLQKMFAN---KNGYPAVRFEGFSNEW-----EHCKLGDVADITTGSRNHQDSVTDG 248 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + +++L ++T I+ PG+ + + Sbjct: 249 KYPFFVRSDKVERLNEY-------DFDTKAILVPGD---------GRIGEIFHYYNGKFA 292 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + Y +GI+ +L L + G SL+ + VP I E Sbjct: 293 LHQRVYKVDNFNGINELFLLGLFKYSFKEHALRLNAQGTVPSLRLPMFTNWSISVPMITE 352 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I +++ + + + LLK+ + S + A Sbjct: 353 QKRIGVF----FQKLEQTISLYDHKLELLKKVKRSMLQA 387 >gi|225856224|ref|YP_002737735.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae P1031] gi|225725320|gb|ACO21172.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae P1031] Length = 426 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 67/415 (16%), Positives = 138/415 (33%), Gaps = 64/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS---------------------------------------GIEWVGLVPDHW 232 +S E +P+ W Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESW 251 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQI 285 E + + + R + + + + ++ L SY+ ++ Sbjct: 252 EWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERL 311 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLAWLMRSY 340 + G++++ L R + A + V I+ ++ + S Sbjct: 312 LRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIYNFLSSP 371 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + V SG ++ L + +K + +PP+ EQ I + I A ID L+ Sbjct: 372 IVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI 426 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 246 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 305 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 306 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 365 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 366 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 425 Query: 185 I 185 I Sbjct: 426 I 426 >gi|170760858|ref|YP_001787474.1| type IC HsdS subunit [Clostridium botulinum A3 str. Loch Maree] gi|169407847|gb|ACA56258.1| type IC HsdS subunit [Clostridium botulinum A3 str. Loch Maree] Length = 410 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 50/379 (13%), Positives = 128/379 (33%), Gaps = 17/379 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK R + ++ Y + G G + ++ ++ V Sbjct: 15 EWKEKKCSNLFDKIRNRV-DVEENKSYKQIGIRSHGKGIFYKEEVTGKELGNKRVFWVEP 73 Query: 84 GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + + R + I S +F + +PK + +L + Sbjct: 74 NVFIVNIVFAWERAVARTTENEIGMIASHRFPMYKPKKEILDLDYITYFFKTNKGKALLE 133 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + + +V ++KI + + ID I ++ +E LKE K+ Sbjct: 134 LASPGGAGRNKTLGQKEFDNLKIILPKVEEQKKIGSVILLIDKKIEKQQEKVEALKEYKK 193 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ I ++ + + K+ E + + + ++ Sbjct: 194 GIMQKIFSQEI----RFKEDNEEEYPEWEEKKLCSLGETYTGLSGKTKDNFGFGSGKYIT 249 Query: 261 YGNIIQKLETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGI 315 Y N+ + ++ + E E V G+I+F ++ + S + +E Sbjct: 250 YMNVFKNIKINLDMIDFVDIEEDEKQNTVLKGDILFTTSSETPEEVGMASVCDKDIENLY 309 Query: 316 ITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 + S + + + ++ + +RS + + G R +L ++ ++ + VP Sbjct: 310 LNSFCFGFRLNSFEKINYNFITYYLRSPKIRGKISILAQGSTRYNLPKTELMKMMIKVPC 369 Query: 372 IKEQFDITNVINVETARID 390 +EQ I N ++ +++ Sbjct: 370 FEEQQKIANFLSKIDDKLN 388 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 70/200 (35%), Gaps = 10/200 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + + + + R + E+ K + + ++ Sbjct: 13 SGEWKEKKCSNLFDKIRNRVDVEENKSYKQIGIRSHGKGIFYKEEVTGKELGNKRVFWVE 72 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVF 346 VF + +R++ E G+I S + +D Y+ + ++ + Sbjct: 73 PNVFIVNIVFAWERAVARTTENEIGMIASHRFPMYKPKKEILDLDYITYFFKTNKGKALL 132 Query: 347 YAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G ++L ++ L +++P ++EQ I +VI ID +EK ++ + L Sbjct: 133 ELASPGGAGRNKTLGQKEFDNLKIILPKVEEQKKIGSVI----LLIDKKIEKQQEKVEAL 188 Query: 404 KERRSSFIAAAVTGQIDLRG 423 KE + + + +I + Sbjct: 189 KEYKKGIMQKIFSQEIRFKE 208 >gi|154492482|ref|ZP_02032108.1| hypothetical protein PARMER_02116 [Parabacteroides merdae ATCC 43184] gi|254881867|ref|ZP_05254577.1| restriction modification system DNA specificity subunit [Bacteroides sp. 4_3_47FAA] gi|154087707|gb|EDN86752.1| hypothetical protein PARMER_02116 [Parabacteroides merdae ATCC 43184] gi|254834660|gb|EET14969.1| restriction modification system DNA specificity subunit [Bacteroides sp. 4_3_47FAA] Length = 397 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 52/415 (12%), Positives = 117/415 (28%), Gaps = 44/415 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + WK + + +G + + + +G+ V +L K + Sbjct: 2 EQWKEYKLSDILSIVSGFAYKGEYLGKGESLLLGMGCVSYSEL-FLEKGMRPYAGEFPER 60 Query: 79 SIFAKGQILYGKLG----------PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 G I+ P + + + PK + + Sbjct: 61 YSVEAGDIVLATRQQSDNLPILGMPAIVPQKFKGKKMVFGANLYKVVPKSPEFPIDYIYW 120 Query: 129 --LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + I + G T+ I + P ++ I + + I+ I Sbjct: 121 LLKTPAYIRHIRSCQTGTTVRMITKANIEDYAFMCPCKEQRNQISKLLWD----IEMKIV 176 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 R + L+++ QAL + SG ++ L + Sbjct: 177 LNRRINDNLEQQAQALFDHYFD-----------SGSIYLEDSIMGCLTDIAVYLNGLAMQ 225 Query: 247 KNT-KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K IE ++ L + Q+ +S + I+D +I+F + + Sbjct: 226 KFPATDIERSLPVLKIKELGQRKCDDCSDRCSDSIDADYIIDNEDIIFSWSGTL-----M 280 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + V P + + R K+ + ++ D++ Sbjct: 281 VDVWCGGKCGLNQHLFKVTPLKNYPRWFVYYWTNRHLKKFKLIAKDKAVTMGHIRRGDLE 340 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 V +P +I IN + I L+ R + + ++G+ Sbjct: 341 NAEVAIPTNLNMLEINARINPLF----QSIIDRRLEITKLENIRDALLPKLMSGE 391 >gi|300861381|ref|ZP_07107467.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TUSoD Ef11] gi|300849173|gb|EFK76924.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TUSoD Ef11] Length = 403 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 60/412 (14%), Positives = 129/412 (31%), Gaps = 48/412 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG----TGKYLPKDGNSRQSDTSTV 78 + W++ + + +G S D + G+ + G TG D Sbjct: 14 EDWELCKLSGVIEKLSGGASIKPTDYLEDGIRTIPKGAVNATGIADLSGSKYISEDFFEK 73 Query: 79 SI---FAKGQILYG---------KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 +I ++ +G +R + + + + + + + L Sbjct: 74 NITSHVHTNNLVTSLRDLVPSAPNMGRIVRIEGDEEQFLMPQGVYKLELFEGMDGDFLIS 133 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + S + I A G+T H NI + +P EQ I ++D IT Sbjct: 134 FSNSDKYRKIISAEKNGSTQVHIRNGEFLNIDINLPSKYEQKKIGAF----FKQLDDTIT 189 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 R ++ LKE K+A + + K +++ + E D W++ + + Sbjct: 190 LHQRKLDQLKELKKAYLQLMFPKKDETVPRVRFADFE------DDWQLCKLGETFSIIMG 243 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRS 304 ++ Y + + +N + P + T + + G+++ + Sbjct: 244 QSPNSENYTENPDDYILVQGNSDMKNNKVVPRIWTTQVTKKAEKGDLILSVRAPVGEIGK 303 Query: 305 LRSAQVMERGII----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 V+ RG+ DS Y +S+ Sbjct: 304 TDYNVVLGRGVAAVKGNDFIFQQLRKMKDSGYWTRY------------STGSTFESINSN 351 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 D+K + +P EQ I + +D + + + LK + S++ Sbjct: 352 DIKEALINIPNKDEQQKIGD----LFTHLDDAIILNQNKLNQLKSLKKSYLQ 399 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 49/180 (27%), Gaps = 5/180 (2%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + G++ S + G R T K Sbjct: 228 DWQLCKLGETFSIIMGQSPNSENYTENPDDYILVQGNSDMKNNKVVPRIWTTQVTKKAEK 287 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G ++ P + D++ + ++ D + L + + G Sbjct: 288 GDLILSVRAPV-GEIGKTDYNVVLGRGVAAVKGNDFI----FQQLRKMKDSGYWTRYSTG 342 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T + I + IP EQ I + I + + L K Q + Sbjct: 343 STFESINSNDIKEALINIPNKDEQQKIGDLFTHLDDAIILNQNKLNQLKSLKKSYLQNMF 402 >gi|160946888|ref|ZP_02094091.1| hypothetical protein PEPMIC_00849 [Parvimonas micra ATCC 33270] gi|158447272|gb|EDP24267.1| hypothetical protein PEPMIC_00849 [Parvimonas micra ATCC 33270] Length = 417 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 46/413 (11%), Positives = 123/413 (29%), Gaps = 23/413 (5%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL 64 K Y K+ V+W + + GR I LE+ + Y Sbjct: 3 KIYELLKNEKVEW----------KKLGEVCNIKRGRVISK------IYLEEHKGEFPVYS 46 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + N+ + + F + G Y + + +++PKD +L Sbjct: 47 SQTRNNGEIGRISTYDFDGEFATWTTDGAYAGTVFYRNGKFSVTNICGLIEPKD-NKKLS 105 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +++ + + + G+ + I +PIP + Q I + + T + L Sbjct: 106 VKFIVYWLQIEAKKHVKGGSGNPKLMSNVVERIKIPIPSIETQEKIVKTLDKFTNYVTEL 165 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 +E ++ ++ + ++++ + + L + +L Sbjct: 166 QSELQSELQSRTKQYEYYRDMLLSEEYLNKLSCHLEENRLLKLEWKTLDEISVGSLSYGS 225 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + + +I P E I++ +I+F K Sbjct: 226 -GASAIDYDGETRYIRITDINDSGGLNKEKASPNVVEAKYILNNEDILFARSGSTVGKNY 284 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVK 363 + + V ++ + + + S G + ++ + Sbjct: 285 IHLINDKCIYAGYLIRLIVNREIALPKFVFYCLNTNRYKIFVDNTKSRGSQPNINAKQYG 344 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + + PI+ Q + +++ + + + + I ++ R + Sbjct: 345 SFKIPIIPIEIQNKVVEILDKFRSLLADTKGLLPKEIEQRQKQYEYYREKLLT 397 >gi|210135697|ref|YP_002302136.1| type I R-M system S protein [Helicobacter pylori P12] gi|210133665|gb|ACJ08656.1| type I R-M system S protein [Helicobacter pylori P12] Length = 402 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 59/419 (14%), Positives = 127/419 (30%), Gaps = 49/419 (11%) Query: 22 PKHWKVVPIKRF---TKLNTGRTSESG-----------KDIIYIGLEDVESGTGKYLPKD 67 P +W+ V + TG + I +I +D Y Sbjct: 11 PSNWQRVRLGDMTTSFTKQTGFDYSASIKPTLIKEQLPNYIPFIQNKDFLGHYINYKTDY 70 Query: 68 GNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLP-ELL 124 + + + +L G A+ VL+ K+ + + Sbjct: 71 FIPNEIAIRFPQILLNEKCLLISISGAIGNVAVFNHSQDAFIGGAIAVLKFKEKKSLDFV 130 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L+S + + I + ++ + + ++ +P+PPL EQ+ I + + +L Sbjct: 131 MHFLMSASGQKSLLNIVKSSSHKNLTIADLRDLLIPLPPLNEQIAIANILSDLDHYLYSL 190 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 ++ + K L+S ++K W + +++ Sbjct: 191 DALILKKESVKKALSFELLSQ--------RKRLKGFNQAWQRVRLGDIFFITAGGDLSKP 242 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + NTK + N S + L Y ++ I+ I Sbjct: 243 HYSNTKQSDFNYPIYSNAIDKKGLY---------GYSSFFIIKNKSITITARGTMG---- 289 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAW--LMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + ++P + + + S KV + L V Sbjct: 290 -VAFFRDYPYVPIGRLLVLQPKISNIDCRFYAEYINS----KVKFNTEQTTIPQLTIPKV 344 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +L+PPI EQ I N+++ I L K Q + + ++ +I + Sbjct: 345 ALCEILLPPINEQIAIANILSALDNEIISLKNKKRQ----FDNIKKALNHDLMSAKIRV 399 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 61/178 (34%), Gaps = 5/178 (2%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I G+ I + + +++ ++ + + Sbjct: 48 PNYIPFIQNKDFLGHYINYKTDYFIPNEIAIRFPQILLNEKCLLISISGAIGNVAVFNHS 107 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 Q G + + +D + +LM + + + S ++L D++ L + Sbjct: 108 QDAFIGGAIAVLKFKEKKSLD-FVMHFLMSASGQKSLLNIVKSSSHKNLTIADLRDLLIP 166 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +PP+ EQ I N+++ + L I + + + + ++ + L+G +Q Sbjct: 167 LPPLNEQIAIANILSDLDHYLYSLDALILKK----ESVKKALSFELLSQRKRLKGFNQ 220 >gi|298292624|ref|YP_003694563.1| restriction modification system DNA specificity domain protein [Starkeya novella DSM 506] gi|296929135|gb|ADH89944.1| restriction modification system DNA specificity domain protein [Starkeya novella DSM 506] Length = 392 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 54/419 (12%), Positives = 129/419 (30%), Gaps = 46/419 (10%) Query: 24 HWKVVPIKRFTKLNTGRTS---ESGKDIIYIGLEDVESGTG--KYLPKDGNSRQSDTSTV 78 WK + + G G + IG+ D + + + + Sbjct: 4 GWKRRSLADLLEFRNGMNFTQASQGARVKIIGVGDFKDKEVLNDFSETPSITLNGKLNPD 63 Query: 79 SIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW----LLS 130 + +L+ + R +++ S ++ + E+ + + S Sbjct: 64 DLLKNDDLLFVRSNGNKALIGRCVLVSGITEPISFSGFTIRGRVKSDEINHSFASKLVRS 123 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + G+++++ + +PPL EQ I E + I+ L R Sbjct: 124 PLFKEHLHRMGGGSSINNLSQDTLSEFCFSLPPLPEQRKIAEILRTWDEAIEKLEALRAA 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + + +Q L H ++ + ++ + Sbjct: 184 KLRRITSVRQRLFEAAFA---------------------SHNRLQRARDIFEPVSERARP 222 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + G + + R + + +Y++V PG+ V + Sbjct: 223 DLPLLAVMQDIGIVRRDELDRRVAMPDGDTSSYKVVRPGDFVISLRSFEG-----GLEYS 277 Query: 311 MERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPV 367 G+++ AY ++P +S + G+R + + F D +P+ Sbjct: 278 TITGLVSPAYTVLRPTTEVVGDYYRHFFKSRSFIGRLDKLIFGIRDGKQIAFRDFGDMPI 337 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 PP+ EQ T + A + I L ++ + +TG+ + E+ Sbjct: 338 PAPPVSEQKAQTGALGCLEADL----ALENVRIEALTRQKRGLMQKLLTGEWRVNVEAD 392 >gi|119945591|ref|YP_943271.1| restriction modification system DNA specificity subunit [Psychromonas ingrahamii 37] gi|119864195|gb|ABM03672.1| restriction modification system DNA specificity domain [Psychromonas ingrahamii 37] Length = 611 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 63/495 (12%), Positives = 131/495 (26%), Gaps = 104/495 (21%) Query: 21 IPKHWKVVPIKRFTK-LNTGRTSESGKD------IIYIGLEDVESGTGKYLPK-DGNSRQ 72 +P W + T L +G T GK+ +I++ ++V + K + Sbjct: 120 LPGGWAFERLGNLTSRLGSGSTPRGGKNAYVDKGVIFLRSQNVWNDGLKLDDTAYISDET 179 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQ-PKDVLPELLQGWL 128 + +L G L ++ I S V++ + + L + Sbjct: 180 HHKMENTRVFPNDVLLNITGASLGRSTIFPKALVTANVSQHVTVIRLIHPSICQYLHLAI 239 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK--------------- 173 +S V + G + K + P+PPL EQ I K Sbjct: 240 MSPLVQELAWGRQVGMAIEGLSKKVLEQFEFPVPPLEEQHRIVAKVDELMLLCDLFEQKT 299 Query: 174 --------------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + R+ + + KQ ++ V Sbjct: 300 ESSIDAHKTLVEVLLTTLTDSKNSDELNKNWARVSEFFDILFTTEHSIDQLKQTILQLAV 359 Query: 208 TKGLNPDVKMKD-------------------------------SGIEWVGLVPDHWEVKP 236 L + + + E VP WE Sbjct: 360 MGKLVAQNENDEPASKLLERIAAEKETLIKDKKIKKQKALPPITDEEKPFSVPSGWEWCR 419 Query: 237 FFALVTELNRKNTKLI---ESNILSLSYGNIIQKLET---RNMGLKPESYETYQIVDPGE 290 + ++ + L G+I + + + G+ Sbjct: 420 IYDASLFTEYGTSEKAFEGNDGVPVLKMGDIQSGKVYHGGQKVVPSTIKDLPNLYLKYGD 479 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAY---MAVKPHGIDSTYLAWLMRSYDLCKVF- 346 I++ + + + ++Y + + YL M++ K Sbjct: 480 ILYNRTNSAELVGKTGMFEGDDDIFTFASYLIRIRCDFEKVAPQYLTLSMQTPLFKKTQI 539 Query: 347 --YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV----EKIEQSI 400 + + ++ +K + + +P + EQ+ I N + D L E + + Sbjct: 540 DPHVKQQCGQANVNGTIMKSMLISIPSLSEQYRIVNKVEELMTLCDQLKTRLNESQQSQL 599 Query: 401 VLLKERRSSFIAAAV 415 L + I AV Sbjct: 600 HLA----DALIEQAV 610 >gi|288926746|ref|ZP_06420657.1| type I restriction system specificity protein [Prevotella buccae D17] gi|288336476|gb|EFC74851.1| type I restriction system specificity protein [Prevotella buccae D17] Length = 205 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 64/175 (36%), Gaps = 9/175 (5%) Query: 247 KNTKLIESNILSLSYGNIIQKLET----RNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + + I + YG I T + E + G++V ++ Sbjct: 28 QKKDFTPAGIGCIHYGQIYTYYGTCAKKTKSFVSQELALKARKAKYGDLVIATTSENDED 87 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 A + + I S H ++ Y+A+ ++ K + +G + + +D Sbjct: 88 VCKAVAWLGDEDIAISGDACFYTHTMNPKYVAYYFQTEQFQKQKRSFITGTKVRRVNTKD 147 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + ++ + VPP+ EQ I +++ + E + + I L ++ R ++ Sbjct: 148 LAKIEIPVPPLAEQQRIVAILDDFDTLTTSISEGLPKEIELRRKQYEYYRDQLLS 202 >gi|293388419|ref|ZP_06632927.1| putative restriction endonuclease S subunit [Enterococcus faecalis S613] gi|312908545|ref|ZP_07767489.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 512] gi|312908985|ref|ZP_07767847.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 516] gi|291082194|gb|EFE19157.1| putative restriction endonuclease S subunit [Enterococcus faecalis S613] gi|310625512|gb|EFQ08795.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 512] gi|311290685|gb|EFQ69241.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 516] Length = 405 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 62/405 (15%), Positives = 155/405 (38%), Gaps = 36/405 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W++ +K T+ G ++ D+ + + + + GN + ++ Sbjct: 18 EDWELCKLKEITERVKG--NDGRMDLPTLTISASQGWLNQKDRFSGNIAGKEQKNYTLLL 75 Query: 83 KGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K ++ Y KL Y + ++ + + + E Sbjct: 76 KNELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKE 135 Query: 139 ------AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + ++ NI + IP + EQ I + +ID +IT R + Sbjct: 136 LGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDDIITLHQRKL 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVK-MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + LKE K+A + + + K K ++ G W+ + + + ++K+T Sbjct: 192 DQLKELKKAYLQLMFVSMNTKNNKVPKLRFADFEGD----WKQRKLGDFLEDFSKKSTIE 247 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 E ILS + + E R + S Y+I+D G++V +L ++ + Sbjct: 248 NEYIILSSTNNGM----EIREGRVSGNSNLGYKIIDDGDLVLSPQNLWLGNINI---NNI 300 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM----GSGLRQSLKFEDVKRLPV 367 +G+++ +Y K ++ +L +R+ + + S +R++L+ + ++ + Sbjct: 301 GQGLVSPSYKTFKIIDLNKEFLNPQLRTNKMLDQYKNASTQGASIVRRNLELDLFYQIRI 360 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P +EQ I + +++ + + + +K + +++ Sbjct: 361 FIPKNEEQKQIG----LLFRKLNESISLHQSKLDSIKYLKKAYLQ 401 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 63/185 (34%), Gaps = 8/185 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + F + + +++ + II + S ++G + I Sbjct: 227 DWKQRKLGDFLEDFSKKSTIENEYII------LSSTNNGMEIREGRVSGNSNLGYKIIDD 280 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ +L I + G+ S + + D+ E L L + + + + Sbjct: 281 GDLVLSPQNLWLGNININNIGQGLVSPSYKTFKIIDLNKEFLNPQLRTNKMLDQYKNAST 340 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 S ++ I + +++I +++ I+ ++ +K K+A Sbjct: 341 QGA-SIVRRNLELDLFYQIRIFIPKNEEQKQIGLLFRKLNESISLHQSKLDSIKYLKKAY 399 Query: 203 VSYIV 207 + + Sbjct: 400 LQNMF 404 >gi|291276639|ref|YP_003516411.1| putative type I restriction-modification system S protein [Helicobacter mustelae 12198] gi|290963833|emb|CBG39669.1| putative type I restriction-modification system S protein [Helicobacter mustelae 12198] Length = 435 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 59/419 (14%), Positives = 116/419 (27%), Gaps = 33/419 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 P + IK G + + +I + DV + + Sbjct: 13 PHGVEFKAIKDIAMFRRGSFPQPYTRSKWYGGDNSMPFIQVIDVADTMKLNEKSKQSISK 72 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 KG ++ G K I +D + + + + Sbjct: 73 LAQPKSVFVPKGTVIVTLQGTI-GKVAITQYDSYIDRTIAIFTSYRINIDKKYFAYMLYQ 131 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + GAT+ + + +P+PPL Q I + + T L TE Sbjct: 132 KFAMEKMLARGATLKTITKEEFSDFKIPLPPLEVQREIVKILDTFTELNTELNTELKLRK 191 Query: 193 ELLKEKKQALVS--------YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + + + L+S + L K + L P E + + Sbjct: 192 KQYEYYRNWLLSFGDVDASKEGAEQRLRNKSYPKALKALLLSLCPHGVEFRKLGEVCERS 251 Query: 245 NRKNTKLIES-------NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 N + N S G I + ++ E I++ +V + Sbjct: 252 TGINITAAQMKKLQETFNKTSSQRGIKIFGGGETKVNIRSEDISEKSIINAESVVVKSRG 311 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + S+ K + +L + + S A L Sbjct: 312 NIGFEYCNEPFSHKNEIWSYSS----KTNEAMIKFLHYYLASKQDYFQRLANEFTKMPQL 367 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 K LP+ +PP++ Q +I +++ + + L I I K+ R + Sbjct: 368 KVSHTDNLPIPLPPLEVQREIVKILDDFSTLTEDLSSGIPAEIAARKKQYEYYRDKLLT 426 >gi|94266628|ref|ZP_01290308.1| hypothetical protein MldDRAFT_3372 [delta proteobacterium MLMS-1] gi|93452747|gb|EAT03290.1| hypothetical protein MldDRAFT_3372 [delta proteobacterium MLMS-1] Length = 348 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 72/196 (36%), Gaps = 4/196 (2%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P+ W + Y + E + Q+V Sbjct: 4 ELPEGWVSNTLCQFTQSRGSSINPAKFPAEIFELYSVPSYETGVPERVSGMEIGSSKQVV 63 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKV 345 P ++ I+ + ++ + ++Q R I ++ ++ P G++ +L + ++ + Sbjct: 64 VPNSVLLCKINPRINRSWVVASQSDFRQIASTEWIVFPPSEGVEPKFLCFFLKQNAVRDF 123 Query: 346 FYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 SG+ +K +K P V P+ EQ I I AR+D + + L Sbjct: 124 LAQNVSGVGGSLMRVKPSTLKGHPFPVAPLNEQRRIVEKIETLFARLDKGEAALREVQKL 183 Query: 403 LKERRSSFIAAAVTGQ 418 L R S + AAVTGQ Sbjct: 184 LASYRQSVLKAAVTGQ 199 >gi|146281850|ref|YP_001172003.1| type I restriction-modification system, S subunit [Pseudomonas stutzeri A1501] gi|145570055|gb|ABP79161.1| type I restriction-modification system, S subunit [Pseudomonas stutzeri A1501] Length = 527 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 81/483 (16%), Positives = 150/483 (31%), Gaps = 90/483 (18%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG 61 +A P+ +S +I +P W + T++ D + + + G Sbjct: 65 AKRQALPEVCESEQPYI--LPNGWAWGRLGDVTEIL---------DSLRRPVTKQDRKPG 113 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK----AIIADFDGICSTQFLVLQPK 117 Y P G S D + IF + +L G+ G A + VL+PK Sbjct: 114 PY-PYYGASGVVDYVSAYIFDEPLVLVGEDGAKWGVGERTAFSITGKTWVNNHAHVLRPK 172 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + +L+ Q + G T+ + + +I +P+PPLAEQ I K+ Sbjct: 173 R--DAVCDDYLVISLTAQDLSQFITGMTVPKLNQARLTSIGIPLPPLAEQHRIVAKVDEL 230 Query: 178 TVRID-----------------------------------------TLITERIRFIELLK 196 D + Sbjct: 231 MALCDRLEAQQADAESAHALLVQALLHSLTQAADAEDFAASWQRLAEHFHTLFTTESSID 290 Query: 197 EKKQALVSYIVTKGLNPDVK--------MKDSGIEWVGLVPDHWEVKPFF-------ALV 241 KQ L+ V L P ++ +E + Sbjct: 291 ALKQTLLQLAVMGKLVPQDPNDEPASELLQRIAVERLDREGSRRSKSQVELREIDGSEKK 350 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP-------ESYETYQIVDPGEIVFR 294 EL + I+S+S G+ + + G P + Q V+ +V Sbjct: 351 FELPAGWEWVRLQQIVSVSSGDGLVSAKMNTEGSVPVYGGNGVTGHHDRQNVEKETLVIG 410 Query: 295 FIDLQNDKRSL--RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + L SA V + +I + ID ++L WL++ +L + A Sbjct: 411 RVGYYCGSIHLTPASAWVTDNALI----VRFSERNIDKSFLFWLLKGTNLKEQENA---T 463 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + + + + +PP+ EQ I +N D L ++ Q+ L ++ ++ + Sbjct: 464 AQPVISGRKIYPIVLAIPPLAEQRRIVAKLNQLMVLCDQLKTRLTQARRLNEQLATALVE 523 Query: 413 AAV 415 AV Sbjct: 524 QAV 526 >gi|94266646|ref|ZP_01290324.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93452717|gb|EAT03266.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 344 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 64/184 (34%), Gaps = 9/184 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + + K +ES I + N+ + + E +V+ +I+ + Sbjct: 18 LGEFINGVAFKPADWVESGIPIIRIQNLTDPD--KPLNRTEREVEDKYVVEHNDILVSWS 75 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLR 354 + R R + V P+ T + + ++ ++ + Sbjct: 76 ATLDAFR-----WRGPRAYVNQHIFKVVPNPELDTGFVFYALKESIRELVHSEHLHGTTM 130 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + + + P +PP+ EQ I I AR+D + + LL R S + AA Sbjct: 131 KHINRKPFLAHPRALPPLNEQRRIVEKIETLFARLDKGEAALREVQKLLASYRQSVLKAA 190 Query: 415 VTGQ 418 VTGQ Sbjct: 191 VTGQ 194 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 49/333 (14%), Positives = 100/333 (30%), Gaps = 42/333 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W + + + G + + G+ + K N + + Sbjct: 5 DLPTGWVMANVDALGEFINGVAFKPADWVES-GIPIIRIQNLTDPDKPLNRTEREVEDKY 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + IL L + + P L + L + + + + Sbjct: 64 VVEHNDILVSWS-ATLDAFRWRGPRAYVNQHIFKVVPNPELDTGFVFYALKESIRELVHS 122 Query: 140 IC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G TM H + K P +PPL EQ I EKI R+D +LL Sbjct: 123 EHLHGTTMKHINRKPFLAHPRALPPLNEQRRIVEKIETLFARLDKGEAALREVQKLLASY 182 Query: 199 KQALVSYIVTKGLNPDVK---------------------------------MKDSGIEWV 225 +Q+++ VT L D + + Sbjct: 183 RQSVLKAAVTGQLTADWRAENAHRLEPGRDLLTRILQTRRDTWQGRGKYKEPTTPDTTNL 242 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESN---ILSLSYGNIIQK-LETRNMGLKPESYE 281 +P+ W + + ++ ++ ++ + L GNI+ L+ RN P+ + Sbjct: 243 PELPEGWVWATVDQVSSSVDYGSSAKCTTDAIGVPVLRMGNIVGGTLDLRNFKYLPDDHS 302 Query: 282 TYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + +++ +++F + Q E Sbjct: 303 EFPKLLLESRDLLFNRTNSAELVGKTAVYQGPE 335 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 8/98 (8%), Positives = 26/98 (26%), Gaps = 6/98 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS 73 + +P+ W + + + +S + + + ++ GT Sbjct: 242 LPELPEGWVWATVDQVSSSVDYGSSAKCTTDAIGVPVLRMGNIVGGTLDLRNFKYLPDDH 301 Query: 74 DTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICST 109 + +L+ + + K + S Sbjct: 302 SEFPKLLLESRDLLFNRTNSAELVGKTAVYQGPESVSN 339 >gi|67920713|ref|ZP_00514232.1| Restriction modification system DNA specificity domain [Crocosphaera watsonii WH 8501] gi|67856830|gb|EAM52070.1| Restriction modification system DNA specificity domain [Crocosphaera watsonii WH 8501] Length = 563 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 74/480 (15%), Positives = 139/480 (28%), Gaps = 94/480 (19%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W+ V + T L T + + ++ ++D+ SG S ++ Sbjct: 85 LPIGWEWVRLDDITLLITDGAHHTPTYRFSGVPFLSVKDISSGFINLANTRFISEETHQK 144 Query: 77 TVSIFAK--GQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + IL K+ +I +F S L + L+ + S Sbjct: 145 LIKRCHPEFNDILLTKVETTGIAKVIDIDIEFSIFVSLALLKFNKSLIYTYYLELLINSP 204 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV------------ 179 V ++ +G + K I N P+PPL EQ I +K+ Sbjct: 205 LVKEKSAKNTQGVGNKNLVLKHIKNFVTPLPPLNEQHRIVKKVAQLMKYCDELENKKTEQ 264 Query: 180 ----------------------------RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 +I E +K+ +Q ++ V L Sbjct: 265 KKQLILLGETATNKLTKTKEEDFKNNWQQIQENFELIYSTPENIKQLRQTILQLAVMGKL 324 Query: 212 NPDVKMKD-------------------------------SGIEWVGLVPDHWEVKPFFAL 240 P K + + E +P WE + Sbjct: 325 VPQDKSDEPASILLEKIKSEKAKLVKDKKIKKSKPLPPITDDEIPYNLPVGWEWVRLGNI 384 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 V + + + + K E ++T+ F D+ Sbjct: 385 VNFIGGSQPPKKKFIYHEEKGYTRL----IQIRDFKSEEFKTFVPNQYANRPFSKDDVMI 440 Query: 301 DKRSLRSAQVME--RGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMG--SGLR 354 + Q++ G A M P I YL +L++ + K+ A + + Sbjct: 441 GRYGPPVFQILRGLEGTYNVALMKADPIHLLISKDYLYYLLQEPRIQKIVIAESERTAGQ 500 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++ E + + +P + EQ I ++ D L +++ Q I E R I A Sbjct: 501 TGVRKELINAFVIGLPSLNEQHRIVKKVDQLMKYCDDLEQQLTQGI----EYRKKLIQTA 556 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 58/192 (30%), Gaps = 8/192 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + E +P WE + +T+ S + LS +I Sbjct: 77 TDDEIPYNLPIGWEWVRLDDITLLITDGAHHTPTYRFSGVPFLSVKDISSGFINLANTRF 136 Query: 277 PESYETYQIVDPGEIVFRFIDLQ----NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 +++ F I L + + ++ A + I + Y Sbjct: 137 ISEETHQKLIKRCHPEFNDILLTKVETTGIAKVIDIDIEFSIFVSLALLKFNKSLIYTYY 196 Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L L+ S + + G+ ++L + +K +PP+ EQ I + D Sbjct: 197 LELLINSPLVKEKSAKNTQGVGNKNLVLKHIKNFVTPLPPLNEQHRIVKKVAQLMKYCDE 256 Query: 392 LVEKIEQSIVLL 403 L K + L Sbjct: 257 LENKKTEQKKQL 268 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 62/191 (32%), Gaps = 5/191 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 +P W+ V + G K I + + ++ + + Sbjct: 372 LPVGWEWVRLGNIVNFIGGSQPPKKKFIYHEEKGYTRLIQIRDFKSEEFKTFVPNQYANR 431 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL---LSIDVTQR 136 F+K ++ G+ GP + + + +G + + P +L + Sbjct: 432 PFSKDDVMIGRYGPPVFQI-LRGLEGTYNVALMKADPIHLLISKDYLYYLLQEPRIQKIV 490 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I A + + I + +P L EQ I +K+ D L + + IE K Sbjct: 491 IAESERTAGQTGVRKELINAFVIGLPSLNEQHRIVKKVDQLMKYCDDLEQQLTQGIEYRK 550 Query: 197 EKKQALVSYIV 207 + Q + ++ Sbjct: 551 KLIQTAIYQLL 561 >gi|18765815|gb|AAL78770.1|AF326620_1 JHP1422-like protein [Helicobacter pylori] Length = 370 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 56/400 (14%), Positives = 123/400 (30%), Gaps = 39/400 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P +W+ V + + G Y + + Y S+ I Sbjct: 7 PSNWQKVRLGDIFFITAGGDLSK---PHYSNTKQSDFNYPIYSNAIEKKGLYGYSSFFII 63 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G A D+ + + LVLQPK + + + +++ Sbjct: 64 KNKSITITSRGTI-GVAFFRDYPYVPIGRLLVLQPKISNIDCRF---YAEYINSKVKFNT 119 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 E T+ + +P+PPL EQ+ I + + +L ++ + K Sbjct: 120 EQTTIPQLTIPKVALCEIPLPPLNEQIAIANVLSDVDRYLYSLDALILKKESVKKALSFE 179 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+S + + W+ +++ + + L + Sbjct: 180 LLSQ----------------RKRLKGFNQAWQRVRLGDILSYEQPTKFLVATTQYLQKGF 223 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I+ +T +G + + Y + V F D D + + + +SA Sbjct: 224 TPILTAGKTFILGYTNDKHGIYTNIP----VIIFDDFTTDSKMV----NFPFKVKSSAIK 275 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + L ++ + ++ +L+PP+ EQ I N+ Sbjct: 276 ILSLRDNNQADLKYI----YEKLTLLKHQVTDHKRYWIDEFSNFEILLPPLNEQIAIANI 331 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ I L K Q + + + ++ +I + Sbjct: 332 LSDLDNEIIGLKNKKRQ----FENIKKALNHDLMSAKIRV 367 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 57/201 (28%), Gaps = 16/201 (7%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P +W+ + + + S N Y ++ I+ Sbjct: 6 TPSNWQKVRLGDIFFITAGGDLSKPHYSNTKQSDFNYPIYSNAIEKKGLY-GYSSFFIIK 64 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW--LMRSYDLCKV 345 I + + + ++P + + + S KV Sbjct: 65 NKSITITSRGTIG-----VAFFRDYPYVPIGRLLVLQPKISNIDCRFYAEYINS----KV 115 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + L V + +PP+ EQ I NV++ + L I + + Sbjct: 116 KFNTEQTTIPQLTIPKVALCEIPLPPLNEQIAIANVLSDVDRYLYSLDALILKK----ES 171 Query: 406 RRSSFIAAAVTGQIDLRGESQ 426 + + ++ + L+G +Q Sbjct: 172 VKKALSFELLSQRKRLKGFNQ 192 >gi|325682981|ref|ZP_08162497.1| type I restriction-modification system S subunit [Lactobacillus reuteri MM4-1A] gi|324977331|gb|EGC14282.1| type I restriction-modification system S subunit [Lactobacillus reuteri MM4-1A] Length = 365 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 52/395 (13%), Positives = 126/395 (31%), Gaps = 40/395 (10%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +K G ++ KD+ + +G+Y P G + + + + Sbjct: 2 KLKDVC--IKGTSNIRQKDV---------NDSGRY-PVYGAAGPVGFMNSFQYDEPYVGV 49 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 K G + +A + L PK + + +S +E GAT+ H Sbjct: 50 VKDGAGIGRATYLPSNSSIIGTMQALIPKKNVLPKYLYYAVSS---MHLEKYYSGATIPH 106 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 +K + + EQ II ++ +I+ + + + L E +A V Sbjct: 107 IYFKNYKHERFVLVSKKEQEQ----IIWRFSLLEKMISNKQQQLLKLDELIKA---RFVE 159 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +P K + + D + I + N+ Sbjct: 160 MFGDPISNKKSWKKRLLNDLVDKIGS------GATPKGGKESYQDHGISFIRSMNVHDGY 213 Query: 269 ETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAV 323 + + IV ++ + + ++ + + + Sbjct: 214 FNYKDLAYINSTQAKQLSNVIVQSQDVFINITGASVARSCIVPDDILPARVNQHVSIIRC 273 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 K ++ ++ L + ++ + G RQ++ + ++ L +++PPI Q + N Sbjct: 274 KSDVLNPIFINNLFLNDSFKRILLSIGLSGGATRQAITKKQLEMLKIILPPISLQNEYAN 333 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ ++D I++S+ ++ S + Sbjct: 334 FVH----QVDKSKVVIQKSLDETQKLYDSLMQEYF 364 >gi|259907262|ref|YP_002647618.1| Type I restriction modification DNA specificity domain protein [Erwinia pyrifoliae Ep1/96] gi|224962884|emb|CAX54365.1| Type I restriction modification DNA specificity domain protein [Erwinia pyrifoliae Ep1/96] Length = 437 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 48/429 (11%), Positives = 123/429 (28%), Gaps = 64/429 (14%) Query: 51 IGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGI 106 I +++ + K + G IL G + + +A Sbjct: 2 IRSQNIYNDGFKNSGLAYITEDAAKKLNNVEVQDGDILLNITGDSVARVCLAPEGHLPAR 61 Query: 107 CSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPP 163 + + K+ ++ +L S + I GAT + I ++ + P Sbjct: 62 VNQHVAIIRPNSKEFDARFIRYFLASPAQQNVLLTIASAGATRNALTKSNIESLLICKPC 121 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL--------------------- 202 L Q I +++ + +I + ++ + ++ Sbjct: 122 LKNQKWIADQLESLDKKIHSNQQINQTLEQMAQALFKSWFVDFEPVKAKIALLEAGGSQQ 181 Query: 203 ------VSYIVTKGLNPDVKMKDSGIE-------------------WVGLVPDHWEVKPF 237 ++ I K + K E +G +P W Sbjct: 182 EATLAAMTAISGKDADSLEVFKHKQPEKYAELKATAELFPSAMQESELGEIPQGWTNSEI 241 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-------VDPGE 290 + E N + N+ K +I + G Sbjct: 242 GEEIDIAGGATPSTKEPKFWENGDINWTTPKDLSNLQDKILIKTDRKITDRGLAKISSGL 301 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + + + + A I Y+A+K + + ++++ ++ Sbjct: 302 LAIDTVLMSSRAPVGYLALTKIPVAINQGYIAMKCNYDLNPEFVLQWCNHNMPEIISRAS 361 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + ++ +P++ P + ++ E + +L+EK + +L++ R + Sbjct: 362 GTTFAEISKKNFNPIPLIKPT----KKMVDIYTREVRSLYLLIEKNVRKTEILQQLRDTL 417 Query: 411 IAAAVTGQI 419 + ++G+I Sbjct: 418 LPKLLSGEI 426 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 59/197 (29%), Gaps = 12/197 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKY---LPKD 67 +G IP+ W I + G T + + DI + +D+ + K + Sbjct: 229 LGEIPQGWTNSEIGEEIDIAGGATPSTKEPKFWENGDINWTTPKDLSNLQDKILIKTDRK 288 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 R + + A +L P + + ++ ++ L Sbjct: 289 ITDRGLAKISSGLLAIDTVLMSSRAPV-GYLALTKIPVAINQGYIAMKCNYDLN-PEFVL 346 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I + G T + K IP+ P + ++ + + I+ + + Sbjct: 347 QWCNHNMPEIISRASGTTFAEISKKNFNPIPLIKPTKKMVDIYTREVRSLYLLIEKNVRK 406 Query: 188 RIRFIELLKEKKQALVS 204 +L L+S Sbjct: 407 TEILQQLRDTLLPKLLS 423 >gi|188496140|ref|ZP_03003410.1| putative type I restriction-modification system, S subunit [Escherichia coli 53638] gi|188491339|gb|EDU66442.1| putative type I restriction-modification system, S subunit [Escherichia coli 53638] Length = 588 Score = 91.4 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 54/494 (10%), Positives = 129/494 (26%), Gaps = 100/494 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYI 51 +K K P+ S + +P+ W+ + T G+T + I ++ Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWARLPDITYYRVGKTPPTKDLSFWETSTTGIPWV 140 Query: 52 GLEDVESGTGKYLPKDGNSR--QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 + D+ S+ Q+D IL + K I D + Sbjct: 141 SISDLNHNGIVNATSKHVSKKAQADIFKYLPIPAETILMS-FKLTVGKTSILKTDAYHNE 199 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV- 168 + + + + + + + + + +P+P EQ Sbjct: 200 AIISINEMKGIHKNY--LFHILPFIVLQGNTKQAIMGHTLNSDSLSMLLLPVPCEKEQCR 257 Query: 169 ----------------------------------------LIREKIIAETVRIDTLITER 188 E++ RI+ Sbjct: 258 ITYKYEELMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWARINEHFDTL 317 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKD----------------------------- 219 + KQ ++ V L P + Sbjct: 318 FTTETSVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLP 377 Query: 220 --SGIEWVGLVPDHWEVKPFFALVT----ELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 S E +P+ WE L + + ++++ L N+ + + Sbjct: 378 PISDEEKPFELPEGWEWCRIDDLTFVSGGIQKQPKRRPVKNHFPYLRVANVQRGDINIDK 437 Query: 274 GLKPE---SYETYQIVDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + E + ++ +I+ R +E+ + + + V+ Sbjct: 438 LERFELEPHELAFWSLEKNDILIVEGNGSADEIGRCAIWHAPIEKCVYQNHLIRVRGIIE 497 Query: 329 -DSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++A + S K + + +L ++ + + +PP+ +Q I + I Sbjct: 498 GYQEFIALYLNSPSGIKEMQRLAVTTSGLYNLSVGKIRGITIPLPPLNQQNLILSRIREY 557 Query: 386 TARIDVLVEKIEQS 399 + L + + Sbjct: 558 ILVCENLKTSTQSA 571 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 57/200 (28%), Gaps = 20/200 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ I T ++ G + Y+ + +V+ G + + Sbjct: 387 ELPEGWEWCRIDDLTFVSGGIQKQPKRRPVKNHFPYLRVANVQRGDINIDKLERFELEPH 446 Query: 75 TSTVSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL----QG 126 K IL G R AI C Q +++ + ++ Sbjct: 447 ELAFWSLEKNDILIVEGNGSADEIGRCAIWHAPIEKCVYQNHLIRVRGIIEGYQEFIALY 506 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + + + I I +P+PPL +Q RI I Sbjct: 507 LNSPSGIKEMQRLAVTTSGLYNLSVGKIRGITIPLPPLNQQ-------NLILSRIREYIL 559 Query: 187 ERIRFIELLKEKKQALVSYI 206 + +Q + Sbjct: 560 VCENLKTSTQSAQQTQLHLA 579 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 58/201 (28%), Gaps = 11/201 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE--------SNILSLSYGNIIQKLETR 271 S E +P+ WE + K + + I +S ++ Sbjct: 93 SEEEKPFELPEGWEWARLPDITYYRVGKTPPTKDLSFWETSTTGIPWVSISDLNHNGIVN 152 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 I I I + +++ + A +++ + + Sbjct: 153 ATSKHVSKKAQADIFKYLPIPAETILMSFKLTVGKTSILKTDAYHNEAIISI--NEMKGI 210 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + +L + + L + + L + VP KEQ IT + D Sbjct: 211 HKNYLFHILPFIVLQGNTKQAIMGHTLNSDSLSMLLLPVPCEKEQCRITYKYEELMSLCD 270 Query: 391 VLVEKIEQSIVLLKERRSSFI 411 L ++ S+ ++ + + Sbjct: 271 QLEQQSLTSLDAHQQLVETLL 291 >gi|296876904|ref|ZP_06900950.1| type I restriction/modification specificity protein [Streptococcus parasanguinis ATCC 15912] gi|296432096|gb|EFH17897.1| type I restriction/modification specificity protein [Streptococcus parasanguinis ATCC 15912] Length = 417 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 48/419 (11%), Positives = 121/419 (28%), Gaps = 41/419 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++ + + G++ ++ + DV++ D Sbjct: 6 WEITSLSELGTFSRGKSKHRPRNDIKLFEGGTYPLVQTGDVKAANLYITKNDSYYNDFGL 65 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 ++ G + + + + I + + L + + + Sbjct: 66 KQSKLWPAGTLCIT-IAANIAETAILSYPMCFPDSIVGFNANPEKSSELFVYYFFEYIKK 124 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I+ G+ + + + + + +P Q I E + ID I + + L Sbjct: 125 EIQKSASGSIQDNINIDYLSKMRIKVPEKDYQDKIVEVL----SSIDKKILLNNQINQEL 180 Query: 196 KEKKQALVSYIVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNR 246 + + L Y + PD K SG E +P+ W V+ +++ Sbjct: 181 EGMAKTLYDYWFVQFDFPDQNGKPYKSSGGKMVYNPELKREIPEGWGVETLRDFESKIIT 240 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET----------YQIVDPGEIVFRFI 296 T ++ I + R + E+ + + G + I Sbjct: 241 GKTPSRANSDNFGGKIPFITIGDIRGNTFIYSTSESLTDLGASVQQNKYLPEGSLCVSCI 300 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + I + V YL + +++Y A + Sbjct: 301 ATVGEIGFTTEWSHTNQQINS----IVFEDENHRYYLYFALKNYFENAKASAKTGNTFAN 356 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ED + +++P +I N + + ++ ++ L + R + + Sbjct: 357 MNKEDFSGIRIILPS----KEIKNNFHEISEPYFAQIKCLQGQNQELTQFRDWLLPMLM 411 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 61/196 (31%), Gaps = 9/196 (4%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSES------GKDIIYIGLEDVESGTGKY-LPKDGNSR 71 IP+ W V ++ F +K+ TG+T G I +I + D+ T Y + Sbjct: 221 EIPEGWGVETLRDFESKIITGKTPSRANSDNFGGKIPFITIGDIRGNTFIYSTSESLTDL 280 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + +G + + + + + Q + +D + L Sbjct: 281 GASVQQNKYLPEGSLCVSCI-ATVGEIGFTTEWSHTNQQINSIVFEDENHRYYLYFALKN 339 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + G T ++ + + I + +P + E +I L + Sbjct: 340 YFENAKASAKTGNTFANMNKEDFSGIRIILPSKEIKNNFHEISEPYFAQIKCLQGQNQEL 399 Query: 192 IELLKEKKQALVSYIV 207 + L++ V Sbjct: 400 TQFRDWLLPMLMNRQV 415 >gi|283458001|ref|YP_003362608.1| restriction endonuclease S subunit [Rothia mucilaginosa DY-18] gi|283134023|dbj|BAI64788.1| restriction endonuclease S subunit [Rothia mucilaginosa DY-18] Length = 390 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 118/410 (28%), Gaps = 63/410 (15%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 P + P+ + +G + +I + + D+ + N+ Sbjct: 17 PDGVEYRPLGEIADVTSGYVFPVKYQGNEKSAEDNIPFYKVSDMNLPGNEMFMTSSNNYV 76 Query: 73 SDTST----VSIFAKGQILYGKLGPYL--RKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 S S S+ I++ K+G + K I I + + + V+ Sbjct: 77 SAESAEEMRASLAQPESIIFPKIGAAIATNKKRILTEKSIVDNNVMAVTARSVINSKFLY 136 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ + + + + +P+PP+ Q I E + T L Sbjct: 137 YV--LSGFDLMSWSMGAGAVPSIKKSVVVKHEVPVPPMEVQEAIVEILDKFTNLEAELEA 194 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E + + +L + P + N Sbjct: 195 ELEARTLQYEYYRDSLFEAL------------------------DCPRVPLDSFAKIKNG 230 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K K + + + + T I G + +L Sbjct: 231 KTYKDFGAGNIPV----YGSGGIMTYVDRSSYDKPTVLIPRKGSL-----------GNLF 275 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + T Y + + +L + +++ L + +G SL + + ++ Sbjct: 276 YLEEPFWNVDTIFYTEIDEEQVIPKFLYYFLKTAHLEDL---NTAGGVPSLTQKVLNKVL 332 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + VP ++EQ I ++++ A L E + + + R ++ Sbjct: 333 IPVPSLEEQQRIVDILDRFDALTSSLSEGLPAELTARRSQYEYYRDQLLS 382 >gi|312887840|ref|ZP_07747427.1| restriction modification system DNA specificity domain protein [Mucilaginibacter paludis DSM 18603] gi|311299659|gb|EFQ76741.1| restriction modification system DNA specificity domain protein [Mucilaginibacter paludis DSM 18603] Length = 546 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 60/428 (14%), Positives = 135/428 (31%), Gaps = 46/428 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W +P + T + K +G + + V++ Sbjct: 18 SGWLTIPFGESIEKTGTFTKLTSKQYN-------ATGNYPVVDQGETFISGYIDDVNLIY 70 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 KG + G + R +F + + + Sbjct: 71 KGDLPVIIFGDHTRFVKYINFKFAVGADGTKILKPINALNEKFFYYYIKSLNIPSLGYSR 130 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 ++ + + +P+P ++EQ I K+ IDTL + R EL+K+ + L Sbjct: 131 HFSI-------LKTVKIPVPSISEQHRIVAKLDKAFDNIDTLKGKIERIPELIKQFRLQL 183 Query: 203 VSYIVTKGLNPDVKMKDSGIEW-------------------------VGLVPDHWEVKPF 237 + Y ++ L D + + + +P W Sbjct: 184 LDYAISGKLTADWRKNNIQDANDIVKNLKQRTNDSKRLDFFEDIEVTLFDIPKQWTFAYL 243 Query: 238 FALVTELNRKNTKLIES--NILSLSYGNIIQK-LETRNMGLKPESYE-TYQIVDPGEIVF 293 AL ++ + E+ +I L GN+ ++ ++ + E ++ G+++F Sbjct: 244 GALSEKITYGTSVKSENEGDIPVLRMGNLQNGQIDWSDLKYTSDPEELKKYSLNRGDVLF 303 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSG 352 + + + I M + +S YL +L+ S + + + Sbjct: 304 NRTNSPELVGKTSIYESDNQAIYAGYLMKIWNKPELNSYYLNYLLNSAYARNWCWQVKTD 363 Query: 353 L--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + ++ + + + V +PP EQ I +N ++LV + E + + Sbjct: 364 GVSQSNINAQKLSKFVVPLPPPDEQTIIVVKLNKLFESAEILVNQFESLRLKINALPQVL 423 Query: 411 IAAAVTGQ 418 + A G+ Sbjct: 424 LQKAFRGE 431 >gi|164551512|gb|ABY60973.1| Sau1hsdS1 [Staphylococcus aureus] Length = 394 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 68/398 (17%), Positives = 142/398 (35%), Gaps = 34/398 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ K + + L G G +PK + SD + Sbjct: 20 EWEE---KSISSFLKESKIKGSNGSHAKKLTVKLWGKG-VVPKKETFKGSDNTQYYKRKA 75 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI--- 140 GQ++YGKL I + D + + L I + + Sbjct: 76 GQLMYGKLDFLNCAFGIVPDSLNNYESTIDSPSFDFINGDSKFLLERIKLKSFYKKFGDI 135 Query: 141 -CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + ++P+ P EQ+ I E ++D I + + +ELL+++K Sbjct: 136 ANGSRKAKRINQDTFLSLPVFAPKYDEQLRIGEF----FSKLDRQIELQKQKLELLQQQK 191 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + I ++ L + G HWE + E N ++ + Sbjct: 192 KGYMQKIFSQEL--------RFKDENGEDYPHWENSKIEKYLKERNERSD--KGQMLSVT 241 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 II+ E ++ Y++V +I + + + + GI++ A Sbjct: 242 INSGIIKFSELDRKDNSSKNKSNYKVVRKNDIAYNSMRMWQGASGKSNY----NGIVSPA 297 Query: 320 YMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQ 375 Y + P S+ + +++ + F GL +LK++ +K + + +P ++EQ Sbjct: 298 YTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINIDIPVLEEQ 357 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++D+L+ K + I +L++ + SF+ Sbjct: 358 EKIGDF----FKKMDILISKQKIKIEILEKEKQSFLQK 391 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + + K Y G++++ +D N + + T + Sbjct: 51 WGKGVVPKKETFKGSDNTQYYKRKAGQLMYGKLDFLNCAFGIVP-DSLNNYESTIDSPSF 109 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFEDVKRLPVLVPPIKEQFDITNV 381 DS +L ++ K F + +G R+ + + LPV P EQ I Sbjct: 110 DFINGDSKFLLERIKLKSFYKKFGDIANGSRKAKRINQDTFLSLPVFAPKYDEQLRIGEF 169 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +++D +E +Q + LL++++ ++ + ++ + E Sbjct: 170 ----FSKLDRQIELQKQKLELLQQQKKGYMQKIFSQELRFKDE 208 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 65/184 (35%), Gaps = 9/184 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD-TSTVSIFA 82 HW+ I+++ K R+ + + + + SG K+ D S S + Sbjct: 215 HWENSKIEKYLKERNERSDKGQM----LSVT-INSGIIKFSELDRKDNSSKNKSNYKVVR 269 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I Y + + + ++++GI S + VL P L G+ I Sbjct: 270 KNDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINS 329 Query: 143 G---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + +K + NI + IP L EQ I + + I + + + Sbjct: 330 QGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKIKIEILEKEKQSFL 389 Query: 200 QALV 203 Q + Sbjct: 390 QKMF 393 >gi|113476050|ref|YP_722111.1| restriction modification system DNA specificity subunit [Trichodesmium erythraeum IMS101] gi|110167098|gb|ABG51638.1| restriction modification system DNA specificity domain [Trichodesmium erythraeum IMS101] Length = 402 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 61/412 (14%), Positives = 134/412 (32%), Gaps = 31/412 (7%) Query: 25 WKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS- 76 W+ V ++ K+ T T+ S + I ++ + +++ G ++D + Sbjct: 3 WQRVFVEDVAKIVTKGTTPTSIGFSFSKEGIPFLRVNNIQDGKINLGDVLFIDSKTDQAL 62 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQF-LVLQPKDVLPELLQGWLLSIDV 133 S K ++ G + A+I C+ ++ +V P WL + D Sbjct: 63 ARSRILKKDVIISIAGTIGKTAVIPTNAPAMNCNQALAIIRLHNNVDPYYFNHWLNTGDA 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++I AT+S+ I + +P+PP+ EQ I + E Sbjct: 123 FRQITGSKVTATISNLSLGCIKKLKIPLPPIEEQRRIAAILDQADAIRRKRQQAIALTDE 182 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L L S + +P + K ++ + V + + + + Sbjct: 183 L-------LRSTFLEMFGDPVINPKGWEVKKLEEVALKRKGAIKCGPFGSQLLISEFVKD 235 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVD-----PGEIVFRFIDLQNDKRSLRSA 308 + + + +QK E K + E Y+ + +++ Sbjct: 236 G--IPVYGIDNVQKNEFVWAKPKYITTEKYEQLKSFSIQDEDVLISRTGTVGRTCVAPPD 293 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 +++ + + YL++ + S L + M ++K L Sbjct: 294 IPRSILGPNLLKVSLNTNKMLPKYLSYALNHSNPLIEEIKRMSPGATVAVFNTTNLKALR 353 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +P I Q N T +++ +K + +S + A GQ Sbjct: 354 LTIPHINLQSQFVNF----TENVELTKQKESNYLTESNNLFNSLLQRAFKGQ 401 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 63/208 (30%), Gaps = 22/208 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES------------GKDIIYIGLEDVESGTGKY-LPKDG 68 PK W+V ++ G I G+++V+ + PK Sbjct: 199 PKGWEVKKLEEVALKRKGAIKCGPFGSQLLISEFVKDGIPVYGIDNVQKNEFVWAKPKYI 258 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVL--QPKDVLPEL- 123 + + + +L + G R + I L + +LP+ Sbjct: 259 TTEKYEQLKSFSIQDEDVLISRTGTVGRTCVAPPDIPRSILGPNLLKVSLNTNKMLPKYL 318 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 S + + I+ + GAT++ + + + + IP + Q T ++ Sbjct: 319 SYALNHSNPLIEEIKRMSPGATVAVFNTTNLKALRLTIPHINLQSQFVNF----TENVEL 374 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 + ++ +L+ L Sbjct: 375 TKQKESNYLTESNNLFNSLLQRAFKGQL 402 >gi|302333477|gb|ADL23670.1| type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus JKD6159] Length = 412 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 48/400 (12%), Positives = 125/400 (31%), Gaps = 20/400 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI---IYIGLEDVESGTGKYLPKDGNSRQS----DTS 76 WK ++ + T + +++ ++ + D Sbjct: 20 EWKEKKLEDTLEFIKDGTHGTHENVNNGPWLLSAKNIKNNKIIISSDDRKISESDYKKIY 79 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGWLLSIDVT 134 KG +L +G R AI+ + + I + ++ + + Sbjct: 80 KNYKLEKGDLLLTIVGTIGRAAIVKNPNNIAFQRSVAILKTKATYDVGFIFQLFQTKYFK 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + I I + I + E+ KI ++D I + +EL Sbjct: 140 NLLLRKQVVSAQPGLYLGDIRKIKISITNIIEEQ---RKIGIFFSKLDRQIELEEQKLEL 196 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+++K+ + I ++ L + + +W + + N K + Sbjct: 197 LQQQKKGYMQKIFSQELRFKDENGNDYPKWEEKKIEDIASQ--VYGGGTPNTKIKEFWNG 254 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +I + ++ K S + ++ I I + + V Sbjct: 255 DIPWIQSSDVKVNDLILQQCNKFISKNSIELSSAKLIPANSIAIVTRVGVGKLCLVEFDY 314 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIK 373 + ++++ D Y + + Y + K+ + + + +++ + +P ++ Sbjct: 315 ATSQDFLSLSSLKYDKLYSLYSLL-YTMKKISANLQGTSIKGITKKELLDSIIKIPHNLE 373 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + +ID + + I +LK + + Sbjct: 374 EQQKIGD----LFYKIDKYISFNKCKIEMLKSLKQGLLKK 409 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 57/180 (31%), Gaps = 5/180 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHW-EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 P+++ + EW + E T N N + S + II + + Sbjct: 10 PELRFPEFEGEWKEKKLEDTLEFIKDGTHGTHENVNNGPWLLSAKNIKNNKIIISSDDRK 69 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + ++ G+++ + +++ + S + D Sbjct: 70 ISESDYKKIYKNYKLEKGDLLLTIVGTIGRAAIVKNPNNI--AFQRSVAILKTKATYDVG 127 Query: 332 YLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARI 389 ++ L ++ + + L D++++ + + I+EQ I + +I Sbjct: 128 FIFQLFQTKYFKNLLLRKQVVSAQPGLYLGDIRKIKISITNIIEEQRKIGIFFSKLDRQI 187 >gi|168485625|ref|ZP_02710133.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC1087-00] gi|183571135|gb|EDT91663.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC1087-00] Length = 426 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 66/415 (15%), Positives = 139/415 (33%), Gaps = 64/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +P L+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPSLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS---------------------------------------GIEWVGLVPDHW 232 +S E +P+ W Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESW 251 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQI 285 E + + + R + + + + ++ L SY+ ++ Sbjct: 252 EWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERL 311 Query: 286 VDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSY 340 + G++++ L R ++ G + + V I+ ++ + S Sbjct: 312 LRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIYNFLSSP 371 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + V SG ++ L + +K + +PP+ EQ I + I A ID L+ Sbjct: 372 IVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI 426 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 65/192 (33%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +P + EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPSLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 54/181 (29%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLE----DVESGTGKYLPKDGNSRQSD 74 IP+ W+ V + T + G++ + IY + + Sbjct: 246 EIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPETVHS 305 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 306 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 365 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 366 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 425 Query: 185 I 185 I Sbjct: 426 I 426 >gi|197121943|ref|YP_002133894.1| restriction modification system DNA specificity domain [Anaeromyxobacter sp. K] gi|196171792|gb|ACG72765.1| restriction modification system DNA specificity domain [Anaeromyxobacter sp. K] Length = 364 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 54/386 (13%), Positives = 116/386 (30%), Gaps = 40/386 (10%) Query: 43 ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA---- 98 + + ++ +ED+ P + + FA G +L K+ P Sbjct: 2 SDSEYVSFVPMEDLGITQKYLEPTKERCLSDVIGSYTYFADGDVLLAKITPCFENGKLGI 61 Query: 99 --IIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQRIEAICEG-ATMSHADWKGI 154 + + G S+++ VL+P+ + +L G + + I Sbjct: 62 ARGLVNGIGFGSSEYFVLRPQPSVTSEWLYYFLARSAFRAVGATRMTGAVGHKRVEKEFI 121 Query: 155 GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 + P+P+PPL EQ I + + ++ I + + Sbjct: 122 ESCPIPVPPLEEQRRITALLDKSFQSLSDALSAAADGIHRADALFDSYRHSVF------- 174 Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + G P + ++S + ++ + Q Sbjct: 175 -------VAQKGERPTTTLD------------RIATNLDSKRVPITKADRRQGAFPYYGA 215 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTY 332 Y + I D ++ RS + + + + A++ Y Sbjct: 216 SGIVDYVSDYIFDGDTLLVSEDGANLLSRSTPIAFSVTGKYWVNNHAHVLKFNDSATQRY 275 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDV 391 + + + S L K + L + + +P+ +P E+ DI + + Sbjct: 276 VEFYLESISLQKYV---TGAAQPKLTQKALNSIPIPLPATPAERADIVKRMQSLESEAQR 332 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTG 417 L E L+E R S + A +G Sbjct: 333 LRALYESKSAALEELRESLLHTAFSG 358 >gi|323935281|gb|EGB31634.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli E1520] Length = 461 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 71/450 (15%), Positives = 141/450 (31%), Gaps = 65/450 (14%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W + + N G+ + K+ Y+G +V G + S Sbjct: 5 WVHAKLGDYIDSNLGKMLDQNKNKGDFHPYLGNSNVRWGYFDLENLSLMKFEEHESDRYG 64 Query: 81 FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI- 137 KG ++ + G R AI D + ++P L + Sbjct: 65 IRKGDLIICEGGEPGRCAIWEDDVPNMKIQKALHRVRPLPGLTSEYLYYWFLYFGRTGQL 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +A G T+ H K + +P+ IPP+ EQ I + + +I ++ + Sbjct: 125 DAYFTGTTIKHLTGKALSELPIEIPPIDEQKHISMVLGSLDTKIKANRKINKTLEQMSQT 184 Query: 198 KKQA-------LVSYIVTKGLNPDV-----------------KMKDSGIE---------- 223 ++ ++ + G NP K +E Sbjct: 185 LFKSWFVDFDPVIDNALDAG-NPIPEALQTRAELRQKVRNSADFKPLPVEIRSLFPSEFV 243 Query: 224 --WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS------------YGNIIQKLE 269 +G VP W K + T K + GN ++ Sbjct: 244 ETELGWVPKGWHYKNAEEIATISIGKTPPRTQKECFCDKKDSNYAWVSIKDLGNCSVFIK 303 Query: 270 TRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + L ++ +Y + + P + V L + ++ + I Y HGI Sbjct: 304 DSSEYLTSDAVNSYNVKIVPKDAVLLSFKLTIGRIAIAEDILTTNEAIAHFY--NMKHGI 361 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + YL ++ +D + S + ++ + ++++PVLVP I T Sbjct: 362 NKEYLYSYLKIFDYNSL--GSTSSIATAINSKIIRKIPVLVPDGD----ILEKYKKSTDI 415 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I ++ +I L R + + ++G+ Sbjct: 416 IFQKIKFNNGNICNLTALRDTLLPKLISGE 445 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 21/199 (10%), Positives = 52/199 (26%), Gaps = 14/199 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTS----------ESGKDIIYIGLEDVESGTGKYLPKD 67 +G +PK W + ++ G+T + + ++ ++D+ + + Sbjct: 247 LGWVPKGWHYKNAEEIATISIGKTPPRTQKECFCDKKDSNYAWVSIKDLGNCSVFIKDSS 306 Query: 68 GNSRQSDTSTV--SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 ++ I K +L + + IA+ + Sbjct: 307 EYLTSDAVNSYNVKIVPKDAVLLS-FKLTIGRIAIAEDILTTNEAIAHFYNMKHGINKEY 365 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + + K I IP+ +P ++ +I Sbjct: 366 LYSYLKIFDYNSLGSTSSIA-TAINSKIIRKIPVLVPDGDILEKYKKSTDIIFQKIKFNN 424 Query: 186 TERIRFIELLKEKKQALVS 204 L L+S Sbjct: 425 GNICNLTALRDTLLPKLIS 443 >gi|313896514|ref|ZP_07830065.1| type I restriction modification DNA specificity domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312974938|gb|EFR40402.1| type I restriction modification DNA specificity domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 376 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 56/386 (14%), Positives = 121/386 (31%), Gaps = 22/386 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + +GLE + G + D S D + F +G IL Sbjct: 4 VTLGEVAMEARETCKGDRSGFPTVGLEHITPGEIRLSEYDVGS---DNTFTKRFHEGDIL 60 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +G+ YL+KA IA F+GICS V++ + P LL + + D G+ Sbjct: 61 FGRRRAYLKKAAIAPFEGICSGDITVIRAIQDKMEPRLLPFVIQNDDFFDFAVGRSAGSL 120 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 W+ + +P + +Q + + + I+ ++E ++ Sbjct: 121 SPRVKWEHLKTYSFELPEMDKQRELADVLW----AIEDTRAAYQELAVAMEELVKSQFVE 176 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + + + + + + + + GNI+ Sbjct: 177 MFGDPILNTHGWQKVSLSALAEIKIGPFGSLLHREDYIVGGHPVVNPSH----VHDGNIV 232 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + K + Y + + ++V Q T + + Sbjct: 233 IDEKLTISETKYKELSAYHLFE-NDVVLGRRGEMGR---CAVVQTSGLLCGTGSMIIRTL 288 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + + +L ++ K+ M G +L + RL ++ PP + Q + Sbjct: 289 GEVRADFLQKIISFPSFKKMLEDMAVGQTMPNLNVPIISRLEIIKPPNEVQNAYYAFVEQ 348 Query: 385 ETARIDVLVEKIEQ----SIVLLKER 406 + E +++ + +L+E Sbjct: 349 VDKSKLTIREILKKNAAMKLAILREY 374 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 50/165 (30%), Gaps = 9/165 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDI----IYIGLEDVESGTGKYLPK-DGNSRQSDT 75 W+ V + ++ G I + V G K + + Sbjct: 187 GWQKVSLSALAEIKIGPFGSLLHREDYIVGGHPVVNPSHVHDGNIVIDEKLTISETKYKE 246 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVT 134 + + ++ G+ G R A++ +C T ++++ + + Sbjct: 247 LSAYHLFENDVVLGRRGEMGRCAVVQTSGLLCGTGSMIIRTLGEVRADFLQKIISFPSFK 306 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + +E + G TM + + I + + PP Q + Sbjct: 307 KMLEDMAVGQTMPNLNVPIISRLEIIKPPNEVQNAYYAFVEQVDK 351 >gi|253735334|ref|ZP_04869499.1| possible type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus TCH130] gi|253726741|gb|EES95470.1| possible type I site-specific deoxyribonuclease specificity subunit [Staphylococcus aureus subsp. aureus TCH130] Length = 372 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 53/395 (13%), Positives = 115/395 (29%), Gaps = 46/395 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K+N+G+ + ++ G G + Sbjct: 20 EWEEKQLGNIIKVNSGKDYK-----------HLDKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I + +I+ + + K Q + Sbjct: 122 TGVPSLSKQTINKINRFVPTNKEQQKIGKFFSKLDRQIELEEQKLELLQQQKKGYMQKIF 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + G W + + T ++ K + S Sbjct: 182 SQELRFK------------DENGNDYPDWTNERLGEVTTVTMGQSPKSVNYTDNSNDTVL 229 Query: 264 IIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + N + P Y E +++ EI+ + + RG+ + Sbjct: 230 IQGNADIENGLINPRIYTREVTKLIQKDEIILTVRAPVGKLAMAQINACIGRGVCS---- 285 Query: 322 AVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 +L + + + K +S+ D++ + + +P E+ I Sbjct: 286 -----IKGDKFLYYFLEWFATQNKWIRFSQGSTFESISGNDIRNIHIKIPVEDERTKIIK 340 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++N +DVL K + I LK+R+ S + Sbjct: 341 LLNS----LDVLNSKTDLKIQNLKQRKQSLLQKIF 371 >gi|238018338|ref|ZP_04598764.1| hypothetical protein VEIDISOL_00163 [Veillonella dispar ATCC 17748] gi|237864809|gb|EEP66099.1| hypothetical protein VEIDISOL_00163 [Veillonella dispar ATCC 17748] Length = 408 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 68/400 (17%), Positives = 140/400 (35%), Gaps = 27/400 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + +N +T K Y+ LE V GT + + + + + G Sbjct: 22 WEQRKLSEVVTINP-KTELPDK-FKYVDLESV-VGTNLLGFQVIKKENAPSRAQRLASYG 78 Query: 85 QILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + Y + PY R + D D + ST + L+ K L + + + + + C Sbjct: 79 DVFYQTVRPYQRNNYLFENIDKDMVFSTGYAQLRSKL-DSYFLLTLVQNDNFVKVVLDNC 137 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + +G I + IP E+ +I ID +IT R +E LK K+A Sbjct: 138 TGTSYPAINGSELGKITVQIPSNDEE---ANQIGKVFRGIDNIITLHQRKLEKLKLIKKA 194 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL-- 259 L+ + + + +++ G + ++ F + K L E+ L Sbjct: 195 LLQKLFPQHGSNIPELRFKG---FTDAWEQRKLSEFVDKAVDNRGKTPPLDENGAHPLIE 251 Query: 260 --SYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G + L + + T + +I+F + + S + E I Sbjct: 252 VAALGGVYPDYSKVEKYLSDDVFNTNLRAYIKKDDILFTTVGSIGLVSLMDSRE--EAAI 309 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLV-PPIK 373 + YL L + D + ++ S+K + + ++ I+ Sbjct: 310 AQNIVAFRAKENFLPEYLYALFSNEDNQYKAKRIAMVAVQPSIKVSQLVNVEYMISTNIE 369 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + + L+ ++ + +LK + + Sbjct: 370 EQERIGVF----FSSLQSLITLHQRKLDMLKNVKKGLLQK 405 Score = 43.6 bits (101), Expect = 0.068, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 62/169 (36%), Gaps = 6/169 (3%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + V +N K + + L L + + + ++ G+ Sbjct: 20 DAWEQRKLSEVVTINPKTELPDKFKYVDLESVVGTNLLGFQVIKKENAPSRAQRLASYGD 79 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + ++ + L + + + ++ Y ++ DS +L L+++ + KV Sbjct: 80 VFYQTVRPYQRNNYLFE-NIDKDMVFSTGYAQLRSKL-DSYFLLTLVQNDNFVKVVLDNC 137 Query: 351 SGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 +G ++ ++ ++ V +P E+ N I ID ++ ++ Sbjct: 138 TGTSYPAINGSELGKITVQIPSNDEE---ANQIGKVFRGIDNIITLHQR 183 >gi|237822638|ref|ZP_04598483.1| restriction modification system DNA specificity subunit [Streptococcus pneumoniae CCRI 1974M2] Length = 329 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 43/362 (11%), Positives = 89/362 (24%), Gaps = 35/362 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSG-TLGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + L + S Sbjct: 120 MKHLTKKYFDNIMVSYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL-------VKSR 172 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +P K ++ G + F + + I Sbjct: 173 FNEMFGDPLNNNKKFAVKT-GQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAW------- 224 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++D I+ + + I+ + +K Sbjct: 225 --------------KSRKYLIDNPTIIIGRVGA----YCGNVRTTHGKVWISDNAIYIKE 266 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 L +L+ + + + + ++ ++PP+ Q + + + Sbjct: 267 FKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFVVQV 326 Query: 386 TA 387 Sbjct: 327 DK 328 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|295401866|ref|ZP_06811830.1| restriction modification system DNA specificity domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294976120|gb|EFG51734.1| restriction modification system DNA specificity domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 397 Score = 91.0 bits (224), Expect = 4e-16, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 134/416 (32%), Gaps = 49/416 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 +W+ + + K+ G + I + + ESG K D Sbjct: 2 NWRNIKLGEVLKIKHGYAFKGKYFGDKGKYIVLTPGNFRESGGLKLKGDKEKYYLGDFPK 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL----------QPKDVLPELLQGW 127 I KG +L I+ I + + + V+ E + Sbjct: 62 EYILHKGDLLVVMTDLTQECRILGSAAFIDADDVYLHNQRLGKVVDINTELVMKEFVYYL 121 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S V ++ G T+ H I + + IPP+ Q I + + +I Sbjct: 122 FNSKSVRTQLINSSSGTTVHHTSPDRIYEVEVQIPPIKIQEKIVSILKSIDDKIQLNRQM 181 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 E+ + + V G D + +S +G++P++W++ L L+ K Sbjct: 182 NETLEEMAMTLYK---HWFVDFGPFQDGEFVES---ELGVIPNNWKIGQVKDLAKVLSGK 235 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K+ + + G P + + + + + Sbjct: 236 RPKVKDIGEYPIFGGGG------------PMGVTNEYLYNEPIFITGRVGTIGKVFRVSK 283 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ + + ++L ++++ D + G + + +K + V Sbjct: 284 PCWPSD----NSLVLIPLKAYYYSFLYAVLKNIDFSLI---TGGSTQPLITQTSLKSIKV 336 Query: 368 LVPPIK--EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++PP + EQ+ N + L++K + L E R + ++G+ID+ Sbjct: 337 IIPPEETIEQY------NKQVLTYYSLIDKNDNINKQLSEIRDYLLPRLLSGEIDV 386 Score = 44.4 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 62/187 (33%), Gaps = 18/187 (9%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP +WK+ +K K+ +G+ + P G + Sbjct: 213 LGVIPNNWKIGQVKDLAKVLSGKRPK--------------VKDIGEYPIFGGGGPMGVTN 258 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ + + G++G + ++ +++ K L L +ID Sbjct: 259 EYLYNEPIFITGRVGTIGKVFRVSKPCWPSDNSLVLIPLKAYYYSFLYAVLKNIDF---- 314 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 I G+T + +I + IPP ++++ ID + E+ Sbjct: 315 SLITGGSTQPLITQTSLKSIKVIIPPEETIEQYNKQVLTYYSLIDKNDNINKQLSEIRDY 374 Query: 198 KKQALVS 204 L+S Sbjct: 375 LLPRLLS 381 >gi|325498694|gb|EGC96553.1| restriction modification system DNA specificity subunit [Escherichia fergusonii ECD227] Length = 594 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 67/515 (13%), Positives = 137/515 (26%), Gaps = 106/515 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIG 52 +K K P+ S + +P W+ V + FT + G T + I + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWVRLGDFTNIIRGITFPGNEKSQFQAPGKIACLR 140 Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK--LGPYLRKAII----ADFDGI 106 +V+ + S + I+ + K + + Sbjct: 141 TANVQEK-IDWDDLIYISDSFVKRDDQYLQEHDIVMSMANSRELVGKVALASLPDNSKFT 199 Query: 107 CSTQFLVLQPKDVLPELLQGWLLSIDVTQR-IEAICEGATMSHADWKGIGNIPMPIPPLA 165 VL+P V L L + IE+ + +++ + +P+ IPP Sbjct: 200 FGGFLSVLRPLVVNEIYLMALLRCETYKSQLIESASQTTNIANISLAKLNPLPVCIPPAK 259 Query: 166 EQVLIREKIIAETVR-----------------------------------------IDTL 184 EQ+ I +K+ I+ Sbjct: 260 EQIHIVKKMNELMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWARINEH 319 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------- 219 + KQ ++ V L P + Sbjct: 320 FDTLFTTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQ 379 Query: 220 ------SGIEWVGLVPDHWEVKPFFALVTELNRKNT---------KLIESNILSLSYGNI 264 S E +P+ WE F ++ + + ++ Sbjct: 380 KPLPPISDEEKPFELPEGWEWCLFEDIIDIQSGITKGRNLSNRTLVKVPYLRVANVQRGY 439 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AV 323 + E + + + E E YQ+V ++ D R+ + Sbjct: 440 LDLTEIKQIEIPIEEKEKYQVVKGDLLITEGGDWDTVGRTTVWCHDWYIANQNHVFKGRN 499 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +D +L M S + F + + S+ ++ PV +PP E I + Sbjct: 500 IGQDVDPYWLETYMNSPFSRQYFANASKQTTNLASINKTQLRGCPVAIPPSSEAKKIMSK 559 Query: 382 IN---VETARIDVLVEKIEQ-SIVLLKERRSSFIA 412 ++ + ++ +Q + L + I Sbjct: 560 LHIFYKLCEELKNHIQSAQQTQLHLADALTDAAIN 594 >gi|169825073|ref|YP_001692684.1| putative type I restriction enzyme [Finegoldia magna ATCC 29328] gi|167831878|dbj|BAG08794.1| putative type I restriction enzyme [Finegoldia magna ATCC 29328] Length = 466 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 60/434 (13%), Positives = 139/434 (32%), Gaps = 70/434 (16%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLN-----TGRTSESGKDIIY---------IGLED 55 YKD+ V G IP+ W+V IK T++ G + +++ Y I L D Sbjct: 80 YKDTEV---GIIPESWEVKQIKEVTEIVTDYVANGSFASLAENVKYKDEPDEAVLIRLVD 136 Query: 56 VESG-TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQF 111 + GK++ D ++ + S G+I+ +G + + + Sbjct: 137 YNNDFNGKFVFIDSHAY--EFLGKSKLFGGEIIISNVGAKVGTVFRCPTLKYKMSLAPN- 193 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 V+ + WL + +++I G+ + + +P+PP+ EQ I Sbjct: 194 SVMVKFKENDDFYFHWLRGYNGQSMLKSIVTGSAQPKFNKTNFREMLVPVPPVEEQTKIA 253 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 + + +ID L + L Sbjct: 254 NILNSIDEKID-----------LNNGINKNLEQQAFAI---------------------- 280 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 F + + + + + G + + N+ + E + Sbjct: 281 -----FNEMFVDSIYGENFVGDILTPNRGKGLLSKDAVPGNVPVVAGGLEPATYHNQSNT 335 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V + + + + + +S + + ++ Y + M ++F A Sbjct: 336 VAPVLTISASGANAGYVNLWNIPVWSSDSSFIDTNMTENVYFWYAMLKSRQSEIFDAQTG 395 Query: 352 GLRQSLKFEDVKRLPV--LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + + + RLP+ + P +I V + + + ++ + L R + Sbjct: 396 SAQPHIYPKHIARLPMGNIRPD-----EINQY-TVLVSPLFEAIGANKEENLSLASMRDA 449 Query: 410 FIAAAVTGQIDLRG 423 + ++G+ID+ Sbjct: 450 LLPKLMSGEIDVTN 463 >gi|291561048|emb|CBL39848.1| Restriction endonuclease S subunits [butyrate-producing bacterium SSC/2] Length = 371 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 54/401 (13%), Positives = 125/401 (31%), Gaps = 47/401 (11%) Query: 28 VPIKRFTKLNTG---RTSESGKDI------IYIGLEDVESGTGKYLP-KDGNSRQSDTST 77 V + L G + S K+I ++ + ++ + + Sbjct: 4 VKLGDIAVLINGDRGKNYPSQKEIITSGGIPFVNAGHLNGRAIEFEAMNYITPEKYEKLN 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWL--LSIDVT 134 F + ILY G +KA+I D G ++ ++++P L + + Sbjct: 64 SGKFQQNDILYCLRGSLGKKALINDNIYGAIASSLVIIRPNLEKVRPQYLMLALETPLIK 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +++ G++ + K + + +P L Q I K+ ++ LI + + L Sbjct: 124 EQLFKFNNGSSQPNLSAKSVKEYKLELPDLFIQDSIISKL----EKVRNLIEDEKQEKLL 179 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L QA V +P K IE + A +T ++ Sbjct: 180 LDNLIQA---RFVEMFGDPITNSKLLPIEKIEER------YFLKAGITTKAEDIHDYLKD 230 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 YG + N+ + + I Q + Sbjct: 231 KYEIPCYGGNGIRGYVENLSYEG--------------CYPIIGRQGALCGNVQYATGKFH 276 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 A + ++ ++ ++++ DL + + L + + + V+V I Sbjct: 277 ATEHAVLVSTLKNDNTMWVYYMLKLMDLYRY---HTGAAQPGLAVKKLNTIDVIVADINL 333 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q ++ +I+ ++++S+ + S + Sbjct: 334 QNQFAAFVH----QINKSKFEVQKSLEKTQLLYDSLMQEYF 370 >gi|114048353|ref|YP_738903.1| restriction modification system DNA specificity subunit [Shewanella sp. MR-7] gi|113889795|gb|ABI43846.1| restriction modification system DNA specificity domain [Shewanella sp. MR-7] Length = 589 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 61/471 (12%), Positives = 120/471 (25%), Gaps = 96/471 (20%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ + F + N G T I + +++ G ++ Sbjct: 103 ELPVGWEFARLGVFGETNIGLTYSPNDVGENGIPVLRSSNIQQGKIDLSDLVRVNKDVKE 162 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + A G +L + A I D + + + L ++ +L S Sbjct: 163 SS--LVALGDLLICARNGSKSLVGKTAQIKSLDEPMAFGAFMAVFRSELNNYIELFLNSP 220 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------------------ 173 + +E + T++ + IPPL EQ I K Sbjct: 221 LFRRNLEGVST-TTINQITQNNLKETVCTIPPLKEQHRIVAKVDELMALCDQLEQRSESQ 279 Query: 174 -----------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + R+ T + KQ ++ V Sbjct: 280 LAAHQTLVETLLATLTDSTDADELAQNWARLSTHFDTLFTTEASIDALKQTILQLAVMGK 339 Query: 211 LNPDVKMKD-------------------------------SGIEWVGLVPDHWEVKPFFA 239 L P + S E +P W Sbjct: 340 LVPQDPSDEPASTLLARIAAEKARLVKEKKIKKEKPLPALSENEKPFELPLGWAWSRISE 399 Query: 240 LVTELNRKNTKLI----ESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIV 292 +++ + L G+I + G+++ Sbjct: 400 SSLFCEYGSSEKTVSELSDGVPVLKMGDIQDGKVILGSHQVVSPKIDDLPNLYLKKGDVL 459 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAY---MAVKPHGIDSTYLAWLMRSYDLCKVF--- 346 + + + ++Y + H I YL M S K Sbjct: 460 YNRTNSAELVGKTGMFDGDDDTYTFASYLIRIRCSIHNIRPEYLTLCMNSPLFRKTQIEP 519 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + ++ +K + V +PP EQ I I+ D L +++ Sbjct: 520 HIKQQCGQANVNGTLMKSMLVSIPPYHEQVLILQKIHELMTLCDQLKSRLQ 570 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 62/204 (30%), Gaps = 4/204 (1%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPF----FALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P + + + E +P WE + N + S K+ Sbjct: 89 PKAQPEIAEDEKPFELPVGWEFARLGVFGETNIGLTYSPNDVGENGIPVLRSSNIQQGKI 148 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + ++ + + +V G+++ + + + Sbjct: 149 DLSDLVRVNKDVKESSLVALGDLLICARNGSKSLVGKTAQIKSLDEPMAFGAFMAVFRSE 208 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + Y+ + S + + + + ++K +PP+KEQ I ++ A Sbjct: 209 LNNYIELFLNSPLFRRNLEGVSTTTINQITQNNLKETVCTIPPLKEQHRIVAKVDELMAL 268 Query: 389 IDVLVEKIEQSIVLLKERRSSFIA 412 D L ++ E + + + +A Sbjct: 269 CDQLEQRSESQLAAHQTLVETLLA 292 >gi|293609932|ref|ZP_06692234.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828384|gb|EFF86747.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 370 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 52/395 (13%), Positives = 110/395 (27%), Gaps = 39/395 (9%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + + G T + V++G + S S I Sbjct: 7 RLDQVCLIRRGSTITKNQ---------VKAGNIPVVAGGKTSTISHNEANR--DAYTITV 55 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G S V + L ++ + + ++ GA H Sbjct: 56 SASGASAGFVNFWQVPIFASDCSTVEVINE-LADINYVYYFLKFKQDYLYSLQAGAAQPH 114 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 K I I +P+PPL EQ I + V + +LL+ + Sbjct: 115 VYAKDIAKIEIPLPPLPEQRRIAAILDQADVLRQKRQQAIEKLDQLLQAT-------FID 167 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +P K +G + + + + ++ I L N+ Sbjct: 168 MFGDPVSNPKGWDFGCIGDMLESV----------KYGSSDKATLDGEIPILRMNNLTYSG 217 Query: 269 ETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-VK 324 E LK + + +V G+I+F + + E + Sbjct: 218 EMDLRDLKYITKAQADEKYLVKEGDILFNRTNSKELVGKTAVYVGPEPMAYAGYLVRGRT 277 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 Y++ + S + +M + ++ ++ + + + +PP EQ Sbjct: 278 KESFAPEYISAFLNSPWGKEKLQSMCKSIVGMANINAKEFQSIVLPIPPENEQM----YF 333 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I + + + + + S A +G Sbjct: 334 KTRVLAIREKKQLLVNQLNVFETLFKSLQNQAFSG 368 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 74/199 (37%), Gaps = 13/199 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 PK W I + +S+ +I + + ++ L ++ Sbjct: 176 PKGWDFGCIGDMLESVKYGSSDKATLDGEIPILRMNNLTYSGEMDLRDLKYITKAQADEK 235 Query: 79 SIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDV 133 + +G IL+ + + A+ + + +LV + PE + +L S Sbjct: 236 YLVKEGDILFNRTNSKELVGKTAVYVGPEPMAYAGYLVRGRTKESFAPEYISAFLNSPWG 295 Query: 134 TQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++++++C+ M++ + K +I +PIPP EQ+ + I + + Sbjct: 296 KEKLQSMCKSIVGMANINAKEFQSIVLPIPPENEQM----YFKTRVLAIREKKQLLVNQL 351 Query: 193 ELLKEKKQALVSYIVTKGL 211 + + ++L + + L Sbjct: 352 NVFETLFKSLQNQAFSGTL 370 >gi|226954356|ref|ZP_03824820.1| type I restriction modification DNA specificity domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226834892|gb|EEH67275.1| type I restriction modification DNA specificity domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 464 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 68/454 (14%), Positives = 134/454 (29%), Gaps = 62/454 (13%) Query: 22 PKHWKVVPIKRFTK---LNTGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W+ +P++ G+T + G I I + V+SG + LP D D + Sbjct: 6 PISWQQIPLEDALDALIDYRGKTPKKVGNGIPLITAKVVKSG--RILPMDEFIADEDYES 63 Query: 78 ---VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSID 132 I G I+ P A I D + + + + L+ K L + S Sbjct: 64 WMVRGIPQVGDIVVTTEAPLGEVAQIKDANVALAQRIVTLRGKVDFLENNFLLFLMQSNF 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 V ++EA G+T+ + + +PIPP+ EQ I + + +I Sbjct: 124 VQNQLEARATGSTVKGIKQSELRKVILPIPPINEQKSIGKILSDLDDKIHLNNQINQTLE 183 Query: 193 ELLKEKKQALVSY---------IVTKGLNPDVKMKD-----SGIE--------------- 223 + + ++ G +P+ S E Sbjct: 184 SIAQAIFKSWFIDFEPVRAKIAAKQAGQDPERAAMCAISGKSEAELEQMAKEDFAELQAT 243 Query: 224 -----------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 +G VP WEV + + + N Sbjct: 244 AALFPDELVESELGEVPRGWEVSTIGEQTQTVGGATPSTKNDEFWDKGNNHWTTPKDLSN 303 Query: 273 MGLKPESYETYQI-------VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + K +I + G + + + + A I Y+A+ P Sbjct: 304 LTDKILLNTDRKITDAGLKKISSGLLPKNTVLMSSRAPVGYLALAKIEVAINQGYIAILP 363 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + S ++ ++ Q + ++ + + P K + + Sbjct: 364 NMKYSAEYLIQWCEANMAEIKGRASGTTFQEISKKNFREISFFCPDDKV---VVSYTKTV 420 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 D + K + L R + + ++G+I Sbjct: 421 KTLYDEITSKA-KENQSLINLRDTLLPKLMSGEI 453 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 55/197 (27%), Gaps = 12/197 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKY---LPKD 67 +G +P+ W+V I T+ G T + D + +D+ + T K + Sbjct: 256 LGEVPRGWEVSTIGEQTQTVGGATPSTKNDEFWDKGNNHWTTPKDLSNLTDKILLNTDRK 315 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + K +L P +A + + ++ + P Sbjct: 316 ITDAGLKKISSGLLPKNTVLMSSRAPV-GYLALAKIEVAINQGYIAILPNMKY-SAEYLI 373 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I+ G T K I P V + + I + E Sbjct: 374 QWCEANMAEIKGRASGTTFQEISKKNFREISFFCPDDKVVVSYTKTVKTLYDEITSKAKE 433 Query: 188 RIRFIELLKEKKQALVS 204 I L L+S Sbjct: 434 NQSLINLRDTLLPKLMS 450 >gi|83776732|gb|ABC46689.1| Sau1hsdS1 [Staphylococcus aureus] Length = 386 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 59/396 (14%), Positives = 134/396 (33%), Gaps = 42/396 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ ++ K+N+G+ + ++ G G + Sbjct: 20 EWEEKKLEDIIKVNSGKDYK-----------HLDKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I + ++D I + +EL +++K+ + Sbjct: 122 TGVPSLSKQTINKINRFVPTNKEQQKIGKF----FSKLDRQIELEEQKLELFQQQKKGYM 177 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG- 262 I ++ L + G WE K + + RKN L++S Sbjct: 178 QKIFSQEL--------RFKDESGNDYPDWEEKELGEVADRVIRKNKNFESKKPLTISGQL 229 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYM 321 +I + E + + ++ E Y ++ GE + +++ + G+++S Y+ Sbjct: 230 GLIDQTEYFSKSVSSKNLENYTLIKNGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYI 289 Query: 322 AVKPHGIDSTYLA--WLMRSYDLCKVFYAMGSGLRQ----SLKFEDVKRLPVLVPPIKEQ 375 S + ++ +V G R ++ D + + P ++EQ Sbjct: 290 CFSIKSEMSKDFMEAYFDSTHWYREVSGIAVEGARNHGLLNISVNDFFTILIKYPSLEEQ 349 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 I + ++D +E EQ + LL++R+ + + Sbjct: 350 RKIGDF----FIKLDRQIELEEQKLELLQQRKKALL 381 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 61/166 (36%), Gaps = 19/166 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRS-----LRSAQVMERGIITSAYMAVKPHGID 329 +K S + Y+ +D G+I S + + + +G I Y+ P Sbjct: 30 IKVNSGKDYKHLDKGDIPVYGTGGYMTSVSEPLSEIDAVGIGRKGTINKPYLLEAPFWTV 89 Query: 330 STYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 T +++ + S SL + + ++ VP KEQ I Sbjct: 90 DTLFYCTPKKETDILFILSLFRKINWKVYDESTGVPSLSKQTINKINRFVPTNKEQQKIG 149 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +++D +E EQ + L ++++ ++ + ++ + ES Sbjct: 150 KF----FSKLDRQIELEEQKLELFQQQKKGYMQKIFSQELRFKDES 191 Score = 40.9 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 53/191 (27%), Gaps = 13/191 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + K + I + +Y K +S+ + ++ Sbjct: 197 DWEEKELGEVADRVIRKNKNFESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 254 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 255 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 314 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + I + P L EQ I + I +I+ + Sbjct: 315 VSGIAVEGARNHGLLNISVNDFFTILIKYPSLEEQRKIGDFFIKLDRQIELEEQKLELLQ 374 Query: 193 ELLKEKKQALV 203 + K ++++ Sbjct: 375 QRKKALLKSML 385 >gi|308062694|gb|ADO04582.1| type I R-M system specificity subunit [Helicobacter pylori Cuz20] Length = 303 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 47/332 (14%), Positives = 106/332 (31%), Gaps = 33/332 (9%) Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + I + F L P + + + L + + ++ + G+T Sbjct: 2 TSRASIGDCAILKVVATTNQGFQSLIPLEKINNE-FLYYLILTLKNKLLKLASGSTFLEV 60 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 I N+ +P+PPL EQ+ I + + L ++ + K L+S Sbjct: 61 SPNKIKNLLIPLPPLNEQIAIANILSDLDRYLYALDALILKKEGVKKALSFELLSQ---- 116 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 + + W+ + + + I N K Sbjct: 117 ------------RKRLKGFNQAWQRVRLGDIAEIVKGQQINKISL--------NNTDKYP 156 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 N G+ Y V I R + S + + + ++ Sbjct: 157 VINGGIDFLGYTNKFNVSKNTIAISEGGTCGYVRFMTSNFWSGGHNYS---LQKISNKVN 213 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + L +++SY+ + ++++ + +K +L+PP+ EQ I N+++ I Sbjct: 214 NLCLYHILKSYE-KDIMKLGVGSGLKNIQLKALKDFEILLPPLNEQIAIANILSALDNEI 272 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L K Q + + + ++ +I + Sbjct: 273 ASLKNKKRQ----FENIKKALNHDLMSAKIRV 300 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 40/113 (35%), Gaps = 4/113 (3%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + ++ P + + + K+ + +K L + +PP+ Sbjct: 17 ATTNQGFQSLIPLEKINNEFLYYLILTLKNKLLKLASGSTFLEVSPNKIKNLLIPLPPLN 76 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 EQ I N+++ + L I + + + + ++ + L+G +Q Sbjct: 77 EQIAIANILSDLDRYLYALDALILKK----EGVKKALSFELLSQRKRLKGFNQ 125 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 62/180 (34%), Gaps = 11/180 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ V + ++ G+ + T KY +G + +K Sbjct: 127 WQRVRLGDIAEIVKGQQINKIS----------LNNTDKYPVINGGIDFLGYTNKFNVSKN 176 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I + G + + + + + + L + + + I + G+ Sbjct: 177 TIAISEGGTCGYVRFMTSNFWSGGHNYSLQKISNKVNNLCL-YHILKSYEKDIMKLGVGS 235 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + K + + + +PPL EQ+ I + A I +L ++ +F + K L+S Sbjct: 236 GLKNIQLKALKDFEILLPPLNEQIAIANILSALDNEIASLKNKKRQFENIKKALNHDLMS 295 >gi|317177380|dbj|BAJ55169.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F16] Length = 413 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 46/391 (11%), Positives = 119/391 (30%), Gaps = 19/391 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + + +T + + ++ G+ V + + + Sbjct: 13 PKGVEFRKLGEVCESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGEN--- 69 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQR 136 I G Y + V ++L + L +L + ++ Sbjct: 70 ------ITIASRGEYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIM 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G ++ + I + +PIPPL Q I + + A T L TE ++ K Sbjct: 124 ENLVSCG-SIPALNKADIETLTIPIPPLEIQQEIVKILDAFTELNTELNTELNTELKARK 182 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 ++ Q + ++ + + L + I Sbjct: 183 KQYQYYQNMLLDF---KGINQNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGEVCEI 239 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + L+ + ++ I + + ++ Sbjct: 240 IRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQNQKFWA 299 Query: 317 TSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +V P + YL +++ + + S + S+ ++ ++ + +PP++ Q Sbjct: 300 NDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPIPPLEIQ 359 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKER 406 +I +++ + L+ I I K++ Sbjct: 360 QEIVKILDQFSILTTDLLAGIPAEIEARKKQ 390 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 64/180 (35%), Gaps = 9/180 (5%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P E + + N+K K+ E + + + G + + Sbjct: 13 PKGVEFRKLGEVCESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFN------ND 66 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 GE + + + G + Y + + + +L + +++ ++ + Sbjct: 67 GENITIASRGEYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENL 126 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + G +L D++ L + +PP++ Q +I +++ T L ++ LK R+ Sbjct: 127 VSCGSIPALNKADIETLTIPIPPLEIQQEIVKILDAFTELNTELNTELNTE---LKARKK 183 >gi|84489295|ref|YP_447527.1| type I restriction-modification system subunit [Methanosphaera stadtmanae DSM 3091] gi|84372614|gb|ABC56884.1| predicted type I restriction-modification system subunit [Methanosphaera stadtmanae DSM 3091] Length = 393 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 56/402 (13%), Positives = 123/402 (30%), Gaps = 41/402 (10%) Query: 24 HWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + + + K + I + ++ K S+ + Sbjct: 18 EWITYKLCDVVTRIIRKNKNLETKRPLTISAKYGLIDQIEFFDKYVASKNLK--GYYLLK 75 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQR 136 KG+ Y K G ST ++ + + + + + S + Sbjct: 76 KGEFAYNKSYSNGFPYGAVKRLDLYNQGAISTLYICFEITNKINSNFLKIYFDSNKWNKE 135 Query: 137 IEAICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I +H N P ++EQ I + + A +I + + + Sbjct: 136 MYKIAVEGARNHGLLNIPINDFFNTKHLFPSISEQEKIADFLSAIDKKIGFMEKKHTLYQ 195 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K L S +KD I +G P K + Sbjct: 196 NIKKYYSHVLFSNTSDWN---KKNLKDIAIIKMGFTPST---------------KKEEYW 237 Query: 253 ESNILSLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 NI L+ ++ K ++ + + +I+ +V F L+ Sbjct: 238 NGNIKWLAVSDMGSKYISKTKKHITKIAIGKKEIIKKDTLVMSFKLTIGKLGILKEDMYS 297 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 I K I++ ++ + + S +L K G+ +L E + +P+ +P Sbjct: 298 NEAICN---FQWKNKNINTEFMYYYLSSINLKKYGSQAAKGI--TLNKETLNMIPIRIPS 352 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + Q +I N+++ +++ L + I K + + Sbjct: 353 YETQINIVNILSNIDIKLEYL----SKKINYEKRYKKDLLQK 390 >gi|331090314|ref|ZP_08339198.1| hypothetical protein HMPREF1025_02781 [Lachnospiraceae bacterium 3_1_46FAA] gi|330401449|gb|EGG81034.1| hypothetical protein HMPREF1025_02781 [Lachnospiraceae bacterium 3_1_46FAA] Length = 363 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 55/365 (15%), Positives = 103/365 (28%), Gaps = 20/365 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 VP+ +F K + R + +DI + + + +Y K+ D +T I +G Sbjct: 6 VPLGKFIKEYSERN-KGNEDIPVYSVTNSQGFCTEYFGKE--VASQDKTTYKIVPQGYFA 62 Query: 88 YGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGA 144 Y + + I S + V + + + L D+ Q I+A G+ Sbjct: 63 YNPSRINVGSVDWQRYEKRVIVSPLYNVFSVSEGIDRQYLYYFLRSDLGRQMIKAKASGS 122 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P + +Q + I E + E + + Sbjct: 123 VRDNLKLDMLKEMTIPDISVEQQKFCSSVLDKLHKLIQMRQQELQKLDEF-------IKA 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V + K + + K A L N+ + Sbjct: 176 RFVEMFGDVIHNSKKWQVCLFAEITSSRLGKMLDAKQQTGRNSYPYLANFNVQWFRF--- 232 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 LE N E + G+++ + + Sbjct: 233 --NLENLNKMDFDEKDRAEFELREGDLLVCEGGEIGRCAVWHNELQPCFFQKALHRVRCN 290 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I YLAW R F A+ L +K+L V VPP++ Q + Sbjct: 291 HQIILPDYLAWWFRYNCDYGGFSALAGAKATIAHLPGAKLKQLQVAVPPMELQEQFAVFV 350 Query: 383 NVETA 387 Sbjct: 351 AQTDK 355 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 70/173 (40%), Gaps = 11/173 (6%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 P + E + +N + + S++ E + + TY+IV G + Sbjct: 7 PLGKFIKEYSERNKGNEDIPVYSVTNSQGFC-TEYFGKEVASQDKTTYKIVPQGYFAYNP 65 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-L 353 + S+ + +R I++ Y GID YL + +RS ++ A SG + Sbjct: 66 SRIN--VGSVDWQRYEKRVIVSPLYNVFSVSEGIDRQYLYYFLRSDLGRQMIKAKASGSV 123 Query: 354 RQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 R +LK + +K + + P I EQ + ++ L++ +Q + L E Sbjct: 124 RDNLKLDMLKEMTI--PDISVEQQK---FCSSVLDKLHKLIQMRQQELQKLDE 171 >gi|296119615|ref|ZP_06838173.1| type I restriction enzyme EcoprrI specificity protein [Corynebacterium ammoniagenes DSM 20306] gi|295967498|gb|EFG80765.1| type I restriction enzyme EcoprrI specificity protein [Corynebacterium ammoniagenes DSM 20306] Length = 371 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 51/398 (12%), Positives = 101/398 (25%), Gaps = 52/398 (13%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ P+ KL G + + G G + S Sbjct: 13 PEGVNFAPLNTVAKLKRGTSITKK-----------QVTEGDIPVVAGGRTAAYFHGESNR 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ G Y + S F V K L + + + ++ Sbjct: 62 EGETIVIAGSGAYAGYVSWWEGPIFVSDAFSVKPEKRFL-IPRYCYYWLTFQQEILHSLK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + H K +G + +P+PPL Q +I E + +E + + + Sbjct: 121 SGGGVPHVYAKDVGKLRIPVPPLEIQHVIVEILDDFAHLESEHKAVLESELEARRTQYEY 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + +++ G + W K+ K + S Sbjct: 181 YRTMLLSSG-------------------EDWRWTTLGESFALKAGKSIKSDAISSRVTSD 221 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 +I G Q G T + Sbjct: 222 RHIPCFGGNGIRGFVESHSHNGQFPLIGR---------QGALCGNVNWAEGYFYATEHAI 272 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 PH + A+ M + + L +KR+P +P ++ Q + + Sbjct: 273 VATPHQTVNARWAYHM--LGFLDLNKYATKSAQPGLSVARLKRVPFPLPDLQIQRETAAI 330 Query: 382 INVETARIDVL-------VEKIEQSIVLLKERRSSFIA 412 ++ + I+ L + + R + Sbjct: 331 LDKFGSLINDLNSVLFSEIAARRKQYEY---YRDKLLT 365 >gi|21673510|ref|NP_661575.1| type I restriction system specificity protein [Chlorobium tepidum TLS] gi|21646618|gb|AAM71917.1| type I restriction system specificity protein [Chlorobium tepidum TLS] Length = 444 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 53/439 (12%), Positives = 132/439 (30%), Gaps = 54/439 (12%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQ 72 G +W+G + ++ G++ S + + IG+ + +G ++ P + Q Sbjct: 8 GSEWLGE-----E-------CEIVMGQSPPSETCNTVGIGIP-LLNGPTEFGPHHPSPAQ 54 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 T G IL+ G + AD + ++ K + Sbjct: 55 FTTDVRKRAIPGDILFCVRGSTTGRMNWADQEYAIGRGIAAIRHKFKPELQPFVRAVIEC 114 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + A G+T + + + N+ P EQ I + +I+ + Sbjct: 115 YLPELLAQATGSTFPNVSAQQLSNLKWPELAADEQRAIAYILGTLDDKIELNRKQNETLE 174 Query: 193 ELLKEKKQALVSYI--VTKGLNPDVKMKDSG----------------IEWVGLVPDHWEV 234 + + +A V L + S +G +P+ WE+ Sbjct: 175 AMARALFKAWFVDFEPVRAKLEGRWQRGQSLPGLPAHLYDLFPDCLVDSELGEIPEGWEI 234 Query: 235 KPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLE------TRNMGLKPESYETY 283 F +V + K +I S + + +++ + + Sbjct: 235 GSFADVVEIIGGSTPKTSVSEYWGGDIPWFSVVDTPASSDVFVVQTEKSITQSGLNESSA 294 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ G + + + A++ +Y + + + + Sbjct: 295 RLISKGTTIISARGTVGNLAIAGC-----DMTFNQSCYALRSKNSLGSYFVF-LSAQRMV 348 Query: 344 KVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + AM G S + + + + ++PP A + + Sbjct: 349 EQLKAMAHGSVFSTITRQTFEAVQTVLPPENVLQQF----ERSFASLFDEILNNVNESRT 404 Query: 403 LKERRSSFIAAAVTGQIDL 421 L + R + + ++G++ + Sbjct: 405 LAKLRDTLLPKLISGELRV 423 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 14/202 (6%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY-- 63 DS +G IP+ W++ ++ G T ++ G DI + + D + + + Sbjct: 222 DSE---LGEIPEGWEIGSFADVVEIIGGSTPKTSVSEYWGGDIPWFSVVDTPASSDVFVV 278 Query: 64 -LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 K + S+ + +KG + G IA D + L+ K+ L Sbjct: 279 QTEKSITQSGLNESSARLISKGTTIISARGTV-GNLAIAGCDMTFNQSCYALRSKNSL-G 336 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +L + + ++++A+ G+ S + + +PP + I Sbjct: 337 SYFVFLSAQRMVEQLKAMAHGSVFSTITRQTFEAVQTVLPPENVLQQFERSFASLFDEIL 396 Query: 183 TLITERIRFIELLKEKKQALVS 204 + E +L L+S Sbjct: 397 NNVNESRTLAKLRDTLLPKLIS 418 >gi|254520682|ref|ZP_05132738.1| conserved hypothetical protein [Clostridium sp. 7_2_43FAA] gi|226914431|gb|EEH99632.1| conserved hypothetical protein [Clostridium sp. 7_2_43FAA] Length = 405 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 60/383 (15%), Positives = 127/383 (33%), Gaps = 31/383 (8%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---A 101 KD IY + G G + + + V + + + R Sbjct: 37 EKDKIYKQIGIRSHGKGIFYKDEVLGEELGNKRVFWIEPNVFIVNIVFAWERAVARTTEK 96 Query: 102 DFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGATMSH-ADWKGIGNI 157 + I S +F + +PK L + + I A GA + K N+ Sbjct: 97 EVGMIVSHRFPMYKPKQQKLNLDYITYFFKTKIGQNLLELASPGGAGRNKTLGQKEFDNL 156 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + IP L EQ I + I+ + K Q + + + Sbjct: 157 KLKIPSLEEQEKIANFLSNVDKIIEEQEGKVKDLELYKKGMMQKIFKQEIRFKDD----- 211 Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 G WE K ++TE KNT ++ +++ G +I ++E Sbjct: 212 -------NGQDYPEWEEKKLSEVLTETKAKNTGDLKVCSVAVKKG-VIDQIEHLGRSFAA 263 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERGIITSAYMAVKPHGI-----DST 331 + Y++V G++++ + + + E I++ Y +P + Sbjct: 264 KDTSNYKLVKKGDLIYTKSPTGKFPYGIVKQSFLDEDVIVSPLYGVFEPMNYFLGYILHS 323 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARID 390 Y + + + G+ ++ E + +P ++EQ I N + + ID Sbjct: 324 YFYYKENTNNYLHSIVQKGAKNTINISNETFLSKKIRLPINLEEQTKIANFL----SNID 379 Query: 391 VLVEKIEQSIVLLKERRSSFIAA 413 ++E+ + + L++ + + Sbjct: 380 KILEEENKKLEDLRQWKKGLLQQ 402 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 62/177 (35%), Gaps = 9/177 (5%) Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + + ++ L E + I ++ R+ + Sbjct: 38 KDKIYKQIGIRSHGKGIFYKDEVLGEELGNKRVFWIEPNVFIVNIVFAWERAVARTTEKE 97 Query: 312 ERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLP 366 I++ + KP + Y+ + ++ + G ++L ++ L Sbjct: 98 VGMIVSHRFPMYKPKQQKLNLDYITYFFKTKIGQNLLELASPGGAGRNKTLGQKEFDNLK 157 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + +P ++EQ I N + + +D ++E+ E + L+ + + +I + Sbjct: 158 LKIPSLEEQEKIANFL----SNVDKIIEEQEGKVKDLELYKKGMMQKIFKQEIRFKD 210 >gi|16273200|ref|NP_439438.1| type I restriction/modification specificity protein [Haemophilus influenzae Rd KW20] gi|260581408|ref|ZP_05849222.1| type I restriction/modification specificity protein [Haemophilus influenzae RdAW] gi|1175603|sp|P44152|T1SH_HAEIN RecName: Full=Putative type-1 restriction enzyme HindVIIP specificity protein; Short=S.HindVIIP; AltName: Full=Type I restriction enzyme HindVIIP specificity protein; Short=S protein gi|1574744|gb|AAC22935.1| type I restriction/modification specificity protein (hsdS) [Haemophilus influenzae Rd KW20] gi|260091950|gb|EEW75899.1| type I restriction/modification specificity protein [Haemophilus influenzae RdAW] Length = 459 Score = 90.6 bits (223), Expect = 5e-16, Method: Composition-based stats. Identities = 66/471 (14%), Positives = 150/471 (31%), Gaps = 87/471 (18%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WK + + ++ + ++++I DV + + N + Sbjct: 2 SDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEI-SNVKDLPGQAKKAI 60 Query: 82 AKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQR 136 KG ILY ++ P + + D + + ST+F+V++P + PE L L+S + T+ Sbjct: 61 KKGDILYSEIRPGNGRYLFVDNDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEY 120 Query: 137 IEAI--CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I T + + ++ + IP Q I + I +I+ ++ Sbjct: 121 FKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDIITPLDDKIELNTQINQTLEQI 180 Query: 195 LKEKKQALV---------SYIVTKGL---------------------------------- 211 + ++ + ++ G+ Sbjct: 181 AQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE 240 Query: 212 -------NPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTELNRK-----NTKLIESNILS 258 P ++ G+E G VP WE+K L + K N + ++ Sbjct: 241 LAETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPF 300 Query: 259 LSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + ++ ++ T N+ + +Y++ + + I I + Sbjct: 301 IKIPDMHNQVFITQTTDNLSVVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQ 360 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFEDVKRLPVLVPPI 372 I + + +L ++ + K + SG +L ++ ++ P Sbjct: 361 INS----IIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLNLNTSTFSKIEIITPSK 416 Query: 373 KE----QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + Q + ++ L IE L E R + + G+I Sbjct: 417 EIIYIFQKKVVSIFEK------TLSNSIENK--RLTEIRDLLLPRLLNGEI 459 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 49/140 (35%), Gaps = 8/140 (5%) Query: 14 GVQWIG-AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG-TGKYLP 65 GV+ G +P+ W++ + ++ G+T G D+ +I + D+ + Sbjct: 257 GVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPFIKIPDMHNQVFITQTT 316 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + ++ + I + ++ + ++ + E L Sbjct: 317 DNLSVVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLY 376 Query: 126 GWLLSIDVTQRIEAICEGAT 145 L +T+ ++ + G T Sbjct: 377 LSLKQPSMTKYLKDLASGGT 396 >gi|322392313|ref|ZP_08065774.1| type I restriction-modification system specificity subunit [Streptococcus peroris ATCC 700780] gi|321144848|gb|EFX40248.1| type I restriction-modification system specificity subunit [Streptococcus peroris ATCC 700780] Length = 384 Score = 90.6 bits (223), Expect = 5e-16, Method: Composition-based stats. Identities = 45/390 (11%), Positives = 116/390 (29%), Gaps = 27/390 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + + G++ +S + G R T + KG Sbjct: 18 WGNTKLTEKAPIIMGQSPDSKNYTDNPNDYILVQGNADMKNGRVFPRVWTTQVTKLAEKG 77 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ P D+ + ++ D L L + + G+ Sbjct: 78 DLILSVRAPV-GDIGKTDYTVVLGRGVAAIKGNDFL----FYLLSKMKQSNYWARFSTGS 132 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + I + IP EQ I + + L K + Sbjct: 133 TFESINSGDIRFAEIMIPSPEEQSAIGSLFRNLDDLLACYKDNLANYQSLKATKLSKMFP 192 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 P++++ EW ++ + + + K+ ++ + Sbjct: 193 KAGQT--VPEIRLDGFEGEW-----ENKILSEVTNITMGQSPKSENYTDNPNDYILVQGN 245 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 ++ + + + + E + + G+I+ D V+ RG+ Sbjct: 246 AD-IKDKQVVPRLWTTEVTKTAEIGDIILTVRAPVGDIGKTDYNVVIGRGVAA------- 297 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + ++ + + + + + +G +S+ D+K + +P ++EQ I Sbjct: 298 --IKGNDFIFYTLEKMKMTGFWNRLSTGSTFESISSNDIKEAIIQIPTLEEQQAIGTY-- 353 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ I ++ + + Sbjct: 354 --FSNLDNLINSHQEKITQIETLKKKLLQD 381 >gi|298502302|ref|YP_003724242.1| type I site-specific deoxyribonuclease [Streptococcus pneumoniae TCH8431/19A] gi|298237897|gb|ADI69028.1| possible type I site-specific deoxyribonuclease [Streptococcus pneumoniae TCH8431/19A] Length = 426 Score = 90.6 bits (223), Expect = 5e-16, Method: Composition-based stats. Identities = 67/415 (16%), Positives = 138/415 (33%), Gaps = 64/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS---------------------------------------GIEWVGLVPDHW 232 +S E +P+ W Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESW 251 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQI 285 E + + + R + + + + ++ L SY+ ++ Sbjct: 252 EWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERL 311 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLAWLMRSY 340 + G++++ L R + A + V I+ ++ + S Sbjct: 312 LRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIYNFLSSP 371 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + V SG ++ L + +K + +PP+ EQ I + I A ID L+ Sbjct: 372 IVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI 426 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 246 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 305 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 306 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 365 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 366 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 425 Query: 185 I 185 I Sbjct: 426 I 426 >gi|28868302|ref|NP_790921.1| type I restriction-modification system, S subunit [Pseudomonas syringae pv. tomato str. DC3000] gi|28851539|gb|AAO54616.1| type I restriction-modification system, S subunit [Pseudomonas syringae pv. tomato str. DC3000] Length = 422 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 71/436 (16%), Positives = 140/436 (32%), Gaps = 56/436 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + F + G L D G TV Sbjct: 3 SEWREITFGDFVAIQRGHD-----------LPDQNRKLGSVPILGSFGITGYHDTVKAKG 51 Query: 83 KGQILYGKLGPYLRKAIIAD-FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G + G+ G A D +T V K P+ + ++ D Sbjct: 52 PG-VTIGRSGASFGVAAYTDQDYWPLNTALYVTDFKGNHPKFVFYFMRVFDF----SGFN 106 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + + + + +P EQ+ I + + A RI L+ + + ++ Sbjct: 107 SGSAQPSLNRNNLYPVSIRVPQPNEQMAISKLLAALDDRIALLVETNTTLESIAQALFKS 166 Query: 202 LVS-----YIVTKGLNPDVKMKDS--------GIEWVGLVPDHWEVKPFFALVTELNRKN 248 GL P+ + +GLVP W ++ + + K+ Sbjct: 167 WFVDFDPVRAKVAGLEPEGMDAATAALFPDNFEESELGLVPTGWIIESIANVAEVVKGKS 226 Query: 249 TKLIE----SNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQ--- 299 K E + ++ + + R G KP SY+ Q+V PG+++ + D+ Sbjct: 227 YKSTELAESHHTALVTLKSFSRGGGFRLDGFKPYTGSYKQTQVVVPGDLIIAYTDVTQAA 286 Query: 300 --NDKRSLRSAQVMERGIITS---AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 K ++ + ++ S + + YL L R+ +A SG Sbjct: 287 ELIGKPAIVVGVEDYQTLVASLDVGIVRTNNPRVSRQYLYGLFRTELFQSHTFAHTSGTT 346 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL---LKERRSSF 410 L + V P ++ + T + L E+ + +I L + R + Sbjct: 347 VLHLAKDGVGSYKFACPS----QELVQCFSAVT---ETLSERCQNNIDQMRTLTQLRDTL 399 Query: 411 IAAAVTGQIDLRGESQ 426 + ++GQ+ L E++ Sbjct: 400 LPRLISGQLRL-PEAE 414 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 53/204 (25%), Gaps = 18/204 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQ 72 +G +P W + I ++ G++ +S + + L+ G G + Sbjct: 203 LGLVPTGWIIESIANVAEVVKGKSYKSTELAESHHTALVTLKSFSRGGG-FRLDGFKPYT 261 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS------------TQFLVLQPKDVL 120 + G ++ +I + + V Sbjct: 262 GSYKQTQVVVPGDLIIAYTDVTQAAELIGKPAIVVGVEDYQTLVASLDVGIVRTNNPRVS 321 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + L G + A G T+ H G+G+ P + R Sbjct: 322 RQYLYGLFRTELFQSHTFAHTSGTTVLHLAKDGVGSYKFACPSQELVQCFSAVTETLSER 381 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I + +L L+S Sbjct: 382 CQNNIDQMRTLTQLRDTLLPRLIS 405 >gi|145222996|ref|YP_001133674.1| restriction modification system DNA specificity subunit [Mycobacterium gilvum PYR-GCK] gi|145215482|gb|ABP44886.1| restriction modification system DNA specificity domain [Mycobacterium gilvum PYR-GCK] Length = 442 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 62/426 (14%), Positives = 136/426 (31%), Gaps = 31/426 (7%) Query: 24 HWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W +V + ++ G + + ++ + +V + Sbjct: 2 SWPLVALADVAEIQGGIQKQPKRTARDNAFPFLRVANVTARGLALDEVHTIELFDGELER 61 Query: 79 SIFAKGQILY---GKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSID 132 +G +L + +A + D D + + ++P + G L + Sbjct: 62 YRLLRGDLLVVEGNGSASQIGRAAVWDGSITDAVHQNHLIRVRPGFQIDPRFLGHLWNSP 121 Query: 133 VTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + +T + + I +P+P L EQ I + + R+D +E R Sbjct: 122 LIRDELSRVASSTSGLHTLSVTKLKRITLPLPSLTEQRRIVDLLEDHLSRLDAGRSEVER 181 Query: 191 FIEL-----LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL----- 240 + QAL + + + + +P W + Sbjct: 182 AAAKLAILRERTVIQALTGGAEANREDARLTDVSTADGDLSALPIGWSWSRLGDVADVVG 241 Query: 241 -VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIVFR-F 295 VT+ ++K + + L N+ + + K P+S + PG+++ Sbjct: 242 GVTKDSKKQSDPNYVEVPYLRVANVQRGRLNLDEVTKIRVPQSKADALRLRPGDVLLNEG 301 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 D R + I + + ID +L+W + + Sbjct: 302 GDRDKLARGWVWEGQVPDCIHQNHVFRARITDPRIDPYFLSWTANTIGGRWAERNGKQSV 361 Query: 354 R-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 S+ ++R+PV+VPP E I + + D L + I + + S + Sbjct: 362 NLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDRLEKSIRDGMDRALVLKKSLLT 421 Query: 413 AAVTGQ 418 AA +G+ Sbjct: 422 AAFSGR 427 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 59/206 (28%), Gaps = 14/206 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W + + G T +S K ++ Y+ + +V+ G Sbjct: 224 LPIGWSWSRLGDVADVVGGVTKDSKKQSDPNYVEVPYLRVANVQRGRLNLDEVTKIRVPQ 283 Query: 74 DTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFL---VLQPKDVLPELLQG 126 + G +L + G + + + P L Sbjct: 284 SKADALRLRPGDVLLNEGGDRDKLARGWVWEGQVPDCIHQNHVFRARITDPRIDPYFLSW 343 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +I + ++ I +P+ +PP E V I ++ D L Sbjct: 344 TANTIGGRWAERNGKQSVNLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDRLEK 403 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLN 212 ++ K++L++ + L Sbjct: 404 SIRDGMDRALVLKKSLLTAAFSGRLT 429 >gi|281358282|ref|ZP_06244765.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] gi|281315372|gb|EFA99402.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] Length = 375 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 61/395 (15%), Positives = 126/395 (31%), Gaps = 34/395 (8%) Query: 29 PIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + + Y E++ G G Q V+ +L Sbjct: 8 KLSDYADYSKAKISIAEIDTKCYFSTENMLPNKGGVTEAAGLPTQD---NVTKVLPENVL 64 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQRIEAICEGATM 146 + PY +K A+ S L K+ LP L L S + A +G M Sbjct: 65 VSNIRPYFKKIYFANELAGASNDVLCFVAKNGCLPRYLYYLLSSDSFFDYMMAGAKGTKM 124 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 D I N P+ +P EQ I + A +I+ + E K ++ + Sbjct: 125 PRGDKGQIMNFPVWVPAQNEQSRIVSVLSALDEKIENISKINHNLEEQAKAIFKS---WF 181 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + D + DS +G +P W+V ++ K E L+ + Sbjct: 182 IDFEPFRDGEFVDS---ELGQIPAGWQVGTLKDMLEVRYGK-----EHKKLADGAIPVYG 233 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K ++ + + + E + + + +V Sbjct: 234 SGGLMRHVEKALYNGESVLIPRKGTLNNVMRVTG-----------EFWTVDTMFYSVPRK 282 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + YL ++ DL S+ + + + +++PP + + T Sbjct: 283 TGAAKYLYHILSKLDLT---SMNSGSAVPSMTTDILNAIKIILPP----DKVLKDFDYLT 335 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +E + + L + R + + ++G+ID+ Sbjct: 336 SFFWESIETKKMEMQKLAQLRDALLPELMSGEIDV 370 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 50/181 (27%), Gaps = 4/181 (2%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + +++ K + S N++ + + V P Sbjct: 2 KNNQLEKLSDYADYSKAKISIAEIDTKCYFSTENMLPNKGGVTEAAGLPTQDNVTKVLPE 61 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ I K + G V +G YL +L+ S A Sbjct: 62 NVLVSNIRPYFKKIYFANELA---GASNDVLCFVAKNGCLPRYLYYLLSSDSFFDYMMAG 118 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G + PV VP EQ I +V++ +I+ + + K Sbjct: 119 AKGTKMPRGDKGQIMNFPVWVPAQNEQSRIVSVLSALDEKIENISKINHNLEEQAKAIFK 178 Query: 409 S 409 S Sbjct: 179 S 179 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 69/201 (34%), Gaps = 25/201 (12%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP W+V +K ++ G+ + D +P G+ Sbjct: 194 DSE---LGQIPAGWQVGTLKDMLEVRYGKEHKKLADGA--------------IPVYGSGG 236 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +++ +L + G + T F + K + L L + Sbjct: 237 LMRHVEKALYNGESVLIPRKGTLNNVMRVTGEFWTVDTMFYSVPRKTGAAKYLYHILSKL 296 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D+T ++ G+ + + I + +PP + + T I + Sbjct: 297 DLT----SMNSGSAVPSMTTDILNAIKIILPP----DKVLKDFDYLTSFFWESIETKKME 348 Query: 192 IELLKEKKQALVSYIVTKGLN 212 ++ L + + AL+ +++ ++ Sbjct: 349 MQKLAQLRDALLPELMSGEID 369 >gi|108800742|ref|YP_640939.1| restriction endonuclease S subunits-like protein [Mycobacterium sp. MCS] gi|119869881|ref|YP_939833.1| restriction endonuclease S subunits-like protein [Mycobacterium sp. KMS] gi|108771161|gb|ABG09883.1| Restriction endonuclease S subunits-like protein [Mycobacterium sp. MCS] gi|119695970|gb|ABL93043.1| restriction endonuclease S subunits-like protein [Mycobacterium sp. KMS] Length = 419 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 66/420 (15%), Positives = 139/420 (33%), Gaps = 42/420 (10%) Query: 24 HW-KVVPIKRFTK---LNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W + V + + + G + ++ D+ L DV G + R Sbjct: 2 SWAQEVTLAELAEGGLFSDGDWVESKDQDASGDVRLTQLADVGVGEFRDRSDRWMRRDQA 61 Query: 75 TSTVSIFAKGQ-ILYGKL-GPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWL 128 F +G +L ++ P R ++ G + L L +D P + L Sbjct: 62 HRLRCTFLEGDDVLIARMPDPIGRSCLVPSSVGSAVTVVDVAILRLARRDANPRYVMWAL 121 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S ++ A+ G T K + ++ +P+P L EQ I + + R+D + Sbjct: 122 NSPRFHSKVVALQSGTTRKRISRKNLASLTIPLPTLDEQNRIVDLLEDHLSRLDAAESSL 181 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++ A + T G + + + Sbjct: 182 RLAMQKADAMTTASLDRQTTAGSRAWRDTTIGAMAELVEYGSSAKC-------------A 228 Query: 249 TKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSL 305 + +S++ L GNI K+ + P + + ++ G++VF + Sbjct: 229 GQAADSDVPVLRMGNIQNGKINWTGLKYLPAGHAEFPKLLLQSGDLVFNRTNSAELVGKS 288 Query: 306 RSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDV 362 + S + V+ ++ + ++ S + ++ S + ++ + Sbjct: 289 AVFEDTRAASFASYLIRVRFGQEVNPAWANMVINSPAGRRYVKSVASQQVGQANVNGTKL 348 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL----KERRSSFIAAAVTGQ 418 K P+ +PP+ EQ + V E++ I L R + +AAA TG+ Sbjct: 349 KAFPLPLPPLDEQCRRVRAHDEVV----VSRERLHHQIADLVVRAAGLRRALLAAAFTGR 404 >gi|257088126|ref|ZP_05582487.1| type I restriction-modification system specificity subunit protein [Enterococcus faecalis D6] gi|256996156|gb|EEU83458.1| type I restriction-modification system specificity subunit protein [Enterococcus faecalis D6] Length = 380 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 53/393 (13%), Positives = 112/393 (28%), Gaps = 34/393 (8%) Query: 25 WKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + + DI + + + ++ ++ Sbjct: 13 WEQCKLGDLGSVAMNKRIFKEQTSESGDIPFYKIGTFGATADAFISRELFET--YKKKYP 70 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +L G R D +V D L V Sbjct: 71 YPKIGDLLISASGSIGRVVEYKGNDEYFQDSNIVWLKHDDRINNLFLKQFYSIVKWHGL- 129 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG+T+ K I + +P EQ EKI ++D +IT R +E LKE K Sbjct: 130 --EGSTIKRLYNKNILETTIHLPVFDEQ----EKIGTLFKQLDDIITLHQRKLEQLKELK 183 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +A + + K++ + E + + + + + + Sbjct: 184 KAYLQLMFPTKEERVPKLRFADFEGEWELCKLIGILDIIKGTQKSKSELSTNQNNCTPYP 243 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y I N+ + + + V E+ Sbjct: 244 VYNGGINPSGYTNIYNREN---------------AITISEGGNSAGFVNFVQEKFFSGGH 288 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + D+ +L + + S ++ +++ + L + EQ I Sbjct: 289 NYTIVNNVTDTLFLFFYLCSIQ-EEIMRLRVGTGLPNIQKPTLMNLEIQKTTDNEQKFIG 347 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ID+L+ + + LK + S++ Sbjct: 348 LFL----KNIDILITLTQNKLNQLKSLKKSYLQ 376 >gi|30250445|ref|NP_842515.1| restriction modification system, type I [Nitrosomonas europaea ATCC 19718] gi|30139286|emb|CAD86438.1| Restriction modification system, type I [Nitrosomonas europaea ATCC 19718] Length = 396 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 56/404 (13%), Positives = 133/404 (32%), Gaps = 40/404 (9%) Query: 24 HWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-F 81 W+ V + + E+ YI + +++ + + F Sbjct: 9 GWRRVKFGDVVRQCKEKADPETSGLERYIAGDHMDTDDLRLRRWGEIGSGYLGPAFHMRF 68 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQRIE 138 GQ+LYG YLRK +ADF+GIC+ V P ++LPE L + + Sbjct: 69 KPGQVLYGSRRTYLRKVAVADFEGICANTTFVLEPHNPNELLPEFLPFLMQTEAFNDFSV 128 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G+ + ++ + +PP+ EQ + A T + + +L+ Sbjct: 129 KNSKGSVNPYINFSDLAKFEFVLPPIDEQQSAIALLSAATDQCHAVEAAHRAAGRMLQSF 188 Query: 199 KQALVSYIVTKGLNPDVKMKDSGI--EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 K +L+ L + +S + + + P+ P + Sbjct: 189 KDSLL-------LRKTSSLANSFLLGDLLLRSPESGCSAP----------PKDADTGYFV 231 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L L+ + + ++P S + G+++ + + + + Sbjct: 232 LGLAALSRDGYVSGDFKPVEPTSKMVAAKLSMGDMLISRSNTVDRVGFVGIFSDNRDDVS 291 Query: 317 TSAYMA---VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP 370 M P + +L L+++ + + +G + + ++ ++ + VP Sbjct: 292 FPDTMMRLQPNPALVHPHFLEALLQTTSAREFLMRIAAGTSASMKKINRANLLQMRLNVP 351 Query: 371 PIKEQFDITNVINVETARIDVLVEKIE------QSIVLLKERRS 408 + Q ++ + + + + + L R+ Sbjct: 352 DLDVQEM---ALDEL-QQFKNAIATQKARWDAARQLTRLIAMRT 391 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 52/132 (39%), Gaps = 6/132 (4%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSY 340 + PG++++ K ++ + + + ++ P+ + +L +LM++ Sbjct: 65 HMRFKPGQVLYGSRRTYLRKVAVADFEGI---CANTTFVLEPHNPNELLPEFLPFLMQTE 121 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 G + + F D+ + ++PPI EQ +++ T + + + Sbjct: 122 AFNDFSVKNSKGSVNPYINFSDLAKFEFVLPPIDEQQSAIALLSAATDQCHAVEAAHRAA 181 Query: 400 IVLLKERRSSFI 411 +L+ + S + Sbjct: 182 GRMLQSFKDSLL 193 >gi|257417158|ref|ZP_05594152.1| restriction endonuclease S subunit [Enterococcus faecalis AR01/DG] gi|257158986|gb|EEU88946.1| restriction endonuclease S subunit [Enterococcus faecalis ARO1/DG] Length = 367 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 56/392 (14%), Positives = 124/392 (31%), Gaps = 48/392 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIF 81 + W++ ++ + +GR D ++G ++ GTG Y+ + D Sbjct: 18 EDWELCKLEEIVDVRSGR------DYKHLGSGNIPVYGTGGYMLSVSEALSYDEDA---- 67 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G+ G I+ T F + + + ++ Sbjct: 68 ----IGIGRKGTINNPYILKAPFWTVDTLFYTVPKNNFDLNFIYSIFR----KTNWKSKD 119 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 E + I + + IP +EQ I + ++D IT R ++ LKE K+A Sbjct: 120 ESTGVPSLSKTTINAVTVYIPSGSEQQRIGKF----FKQLDDTITLHQRKLDQLKELKKA 175 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + K++ + E + WE++ + K Sbjct: 176 YLQLMFPVKDERVPKLRFADFE------EEWELRKLGDITKISTGKLDANAM-------- 221 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 +E + Y+I P N + Sbjct: 222 ------VENGKYDFYTSGIKKYRIDVPAFEGPAITIAGNGATVGYMHLADNKFNAYQRTY 275 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 ++ +D ++L + + K+ +G + + + L + +P EQ I + Sbjct: 276 VLQKFVVDRSFLFSEVGNKLPKKINQEARTGNIPYIVMDMLTELKLSIPQDEAEQSKIGS 335 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +ID + ++ + LK+ ++S++ Sbjct: 336 F----FKQIDKTIALHQKKLEQLKDLKTSYLQ 363 >gi|88812209|ref|ZP_01127460.1| type I restriction-modification system, S subunit [Nitrococcus mobilis Nb-231] gi|88790460|gb|EAR21576.1| type I restriction-modification system, S subunit [Nitrococcus mobilis Nb-231] Length = 577 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 65/458 (14%), Positives = 133/458 (29%), Gaps = 84/458 (18%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQS--D 74 +P W+ + L T T + K + +I ++D+ G ++ S++ Sbjct: 102 LPPRWRWSRLGGLALLVTDGTHHTPQYVAKGVPFISVKDISGGQLRFSDTKFISQEEHQT 161 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + IL ++G + I+ F S + + + L S Sbjct: 162 ISSRCNPERNDILLCRIGTLGKPVIVDTDQPFSLFVSVGLIKTPKSTPITRWTKLVLESP 221 Query: 132 DVTQRIEAICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT------- 183 + + EAI G + + + I + +P+PPLA Q I K+ D Sbjct: 222 LMLGQYEAIKAGGSHTNKLNLGDIPKLMVPLPPLAGQARIVAKVDELMALCDRLEAQQAD 281 Query: 184 ----------------------------------LITERIRFIELLKEKKQALVSYIVTK 209 + KQ L+ V Sbjct: 282 TEAAHTTLVKTLLDTLTQSRSAEDFAANWQRLSAHFDTLFTTEPSIDTLKQTLLQLAVMG 341 Query: 210 GLNPDVKM---------------------KDSGIEWVGLVPDHWEVKPFFALVTE----- 243 L P + + E + +P WE L Sbjct: 342 KLVPQDPSDGPASELLKRLRGRNGNRQVGRRTNEEALPALPAGWECVSVGDLGPIAGGAT 401 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP---ESYETYQIVDPGEIVFRFIDLQN 300 N+ + L I +S ++ + + + +++ G ++ + Sbjct: 402 PNKGDASLWSGTIPWVSPKDMKRSYINDAVDHVSAVAIEKTSLKLIPAGSLLLVVRGMIL 461 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLK 358 S A I A+ + ++ + ++ + ++ G LK Sbjct: 462 -AHSFPVAISQVPLCINQDMKAISLLPEMAEFVLYALQGLKPHILQLIERSSHGT-CKLK 519 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 E + P +PP+ EQ I ++ D L ++ Sbjct: 520 SETLFGHPFPLPPLAEQHRIVAKVDELMVLCDRLKARL 557 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 61/198 (30%), Gaps = 9/198 (4%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGN 69 + + A+P W+ V + + G T G I ++ +D++ + Sbjct: 376 EALPALPAGWECVSVGDLGPIAGGATPNKGDASLWSGTIPWVSPKDMKRSYINDAVDHVS 435 Query: 70 SRQSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + + +++ + G +L G + I+ + + + E + Sbjct: 436 AVAIEKTSLKLIPAGSLLLVVRGMILAHSFPVAISQVPLCINQDMKAISLLPEMAEFVLY 495 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L + + + P P+PPLAEQ I K+ V D L Sbjct: 496 ALQGLKPHILQLIERSSHGTCKLKSETLFGHPFPLPPLAEQHRIVAKVDELMVLCDRLKA 555 Query: 187 ERIRFIELLKEKKQALVS 204 + AL Sbjct: 556 RLAHCRIVHGRLADALAQ 573 >gi|291485259|dbj|BAI86334.1| hypothetical protein BSNT_04127 [Bacillus subtilis subsp. natto BEST195] Length = 439 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 71/420 (16%), Positives = 141/420 (33%), Gaps = 50/420 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P++W V + G K + E+ G +P G + Q Sbjct: 25 ELPENWIWVKL------LNGYAVCLDKYRKPVNAEERAKRVGN-IPYYGATGQVGWIDDY 77 Query: 80 IFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + +L G+ G P+ KA I + +L+ L Sbjct: 78 LTDEELVLLGEDGVPFLEPFKNKAYIIREKAWVNNHAHILRSNFGSEGNLFLLHYLNQF- 136 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G T K + IP+P+PPL EQ I EK+ +I+ E Sbjct: 137 -NFNGYVSGTTRLKLTQKKMAIIPVPLPPLNEQKRIAEKVERLLSKIEEAKQLIEEAKET 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-----NT 249 + ++ +++ I+ + L+ G +P W L T Sbjct: 196 FELRRASIIRTILKEELS------------NGKLPTGWRNIKVKDLFTIFGGGTPSKAKE 243 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLR 306 + I +S ++ ++ M E + ++ G + R+L Sbjct: 244 EYWNGRIPWISAKDMKTTFISKTMDYITEEGLNNSSAKLAKRGSVAMVVRSGILQ-RTLP 302 Query: 307 SAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 A ++ + I+ +L ++ + Y+ S++FE K Sbjct: 303 VAFLLSECTVNQDLKVFDSGDELINKYFLWYVKGNERNLLHNYSKSGTTVNSIEFEKFKS 362 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK------ERRSSFIAAAVTGQ 418 +L+PP+ V+ + +I+ ++EK + + V+L E +SS ++ A G+ Sbjct: 363 HEILLPPMD-------VLKQKIDKIENVIEKEKSANVMLNLANSIDELKSSILSKAFRGE 415 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 73/191 (38%), Gaps = 9/191 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +P++W K K + + + GNI T +G + Sbjct: 21 EQPYELPENWIWVKLLNGYAVCLDKYRKPVNAEERAKRVGNIPYYGATGQVGWIDDYLTD 80 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSY 340 ++V GE F++ +K + + E+ + + ++ + + +L + + Sbjct: 81 EELVLLGEDGVPFLEPFKNKAYI----IREKAWVNNHAHILRSNFGSEGNLFLLHYLNQF 136 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + R L + + +PV +PP+ EQ I + ++I+ + IE++ Sbjct: 137 NFNGYV---SGTTRLKLTQKKMAIIPVPLPPLNEQKRIAEKVERLLSKIEEAKQLIEEAK 193 Query: 401 VLLKERRSSFI 411 + RR+S I Sbjct: 194 ETFELRRASII 204 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/219 (11%), Positives = 74/219 (33%), Gaps = 11/219 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ 72 G +P W+ + +K + G T K+ I +I +D+++ Sbjct: 215 GKLPTGWRNIKVKDLFTIFGGGTPSKAKEEYWNGRIPWISAKDMKTTFISKTMDYITEEG 274 Query: 73 SDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQP-KDVLPELLQGWL 128 + S+ + +G + L+ + + V +++ + ++ Sbjct: 275 LNNSSAKLAKRGSVAMVVRSGILQRTLPVAFLLSECTVNQDLKVFDSGDELINKYFLWYV 334 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + G T++ +++ + + +PP+ +KI + + Sbjct: 335 KGNERNLLHNYSKSGTTVNSIEFEKFKSHEILLPPMDVLKQKIDKIENVIEK-EKSANVM 393 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + + E K +++S L + +++ +E + Sbjct: 394 LNLANSIDELKSSILSKAFRGELGTNDPSEENAVELLKE 432 >gi|218281998|ref|ZP_03488310.1| hypothetical protein EUBIFOR_00879 [Eubacterium biforme DSM 3989] gi|218216985|gb|EEC90523.1| hypothetical protein EUBIFOR_00879 [Eubacterium biforme DSM 3989] Length = 402 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 50/378 (13%), Positives = 118/378 (31%), Gaps = 35/378 (9%) Query: 30 IKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 KL G + + +I + D+ S + + + Sbjct: 35 FGHVMKLYRGSSPRPIINYVTTDKSGLNWIKIGDMPSTGNRVFFCKERINKEGSKKSRAV 94 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAI 140 KG I+ + + I+ I F++ ++ + + LLS DV + ++ Sbjct: 95 YKGDIILSNSMSFGKPYILEIDGFIHDGWFVIRDYQNYIDKTYLCQLLSSDVVQNQYKST 154 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + + + ++ +P + EQ I + +I+ I + + Sbjct: 155 AAGGVVKNISSDLVNSVKFHLPSIMEQRKIARFLELIDQKIEVQIKIIDDLLTVKN---- 210 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 GI E+ + + T + S ++ Sbjct: 211 --------------------GISNKLFKLQQIELSNHYLFEYLIEGDKTAVDTSCYKKIT 250 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Q L + + + + GE++ + N ++ + + + I ++A Sbjct: 251 VKLNNQGLAFSELNREMADTRPFYVRHKGELIIGKQNYFNGSIAIVT-EQFDNCICSNAI 309 Query: 321 MAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 M+ K GI S +L + + + L Y ++ L ++ + P ++ Q I Sbjct: 310 MSFKIKGIYSDFLYYQISNNNYLNSQSYKANGTGQKELSEKEFLNFKIWCPQLEVQQKIV 369 Query: 380 NVINVETARIDVLVEKIE 397 N +I+ + Sbjct: 370 NCFKSLDLKIENEKAILN 387 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 54/163 (33%), Gaps = 8/163 (4%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + I + ++ + E + + V G+I+ L + Sbjct: 62 NWIKIGDMPSTGNRVFFCKERINKEGSKKSRAVYKGDIILSNSMSFGKPYILEIDGFIHD 121 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPI 372 G + + ID TYL L+ S + + + G+ +++ + V + +P I Sbjct: 122 GWF---VIRDYQNYIDKTYLCQLLSSDVVQNQYKSTAAGGVVKNISSDLVNSVKFHLPSI 178 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I + ID +E + I L ++ Sbjct: 179 MEQRKIARFLE----LIDQKIEVQIKIIDDLLTVKNGISNKLF 217 >gi|1174557|sp|P19704|T1SA_ECOLX RecName: Full=Type-1 restriction enzyme EcoAI specificity protein; Short=S.EcoAI; AltName: Full=Type I restriction enzyme EcoAI specificity protein; Short=S protein gi|146402|gb|AAA23987.1| EcoA type I restriction-modification enzyme S subunit [Escherichia coli] Length = 589 Score = 90.2 bits (222), Expect = 6e-16, Method: Composition-based stats. Identities = 63/509 (12%), Positives = 133/509 (26%), Gaps = 99/509 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESG 59 +K K P+ S + +P W+ + R ++N + +I +I + + + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTK 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLV 113 + + + FA G I K+ P + + + G+ +T+ V Sbjct: 141 FDGSHEFEIKKWKDVKKGYTHFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHV 200 Query: 114 LQPKDVLPELLQ---GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 +P + + + + A N P+P PPL EQ Sbjct: 201 ARPFSDIINRKYLLLNFKSPNFLKSGESQMTGSAGQKRVPRFFFENNPIPFPPLQEQERI 260 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 E++ RI Sbjct: 261 IIRFTQLMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNVEELAENWARISEHFDTLF 320 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 321 TTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPP 380 Query: 220 -SGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 S E +P+ WE + + K+ + L NI + + Sbjct: 381 ISDEEKPFELPEGWEWCRLGSIYNFLNGYAFKSEWFTSVGLRLLRNANIAHGVTNWKDVV 440 Query: 276 KP----ESYETYQIVDPGEIVFR----FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 S I+ +IV I+ + + + + A + Sbjct: 441 HIPNDMISDFENYILSENDIVISLDRPIINTGLKYAIISKSDLPCLLLQRVAKFKNYANT 500 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +++L ++SY S + + ++ + P EQ I + ++ Sbjct: 501 VSNSFLTIWLQSYFFINSIDPGRSNGVPHISTKQLEMTLFPLLPQSEQDRIISKMDELIQ 560 Query: 388 RIDVL----VEKIEQSIVLLKERRSSFIA 412 + L + + L + I Sbjct: 561 TCNKLKYIIKTAKQTQLHLADALTDAAIN 589 >gi|237650545|ref|ZP_04524797.1| restriction modification system DNA specificity subunit [Streptococcus pneumoniae CCRI 1974] Length = 338 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 43/362 (11%), Positives = 89/362 (24%), Gaps = 35/362 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + L + S Sbjct: 120 MKHLTKKYFDNIMVSYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL-------VKSR 172 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +P K ++ G + F + + I Sbjct: 173 FNEMFGDPLNNNKKFAVKT-GQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAW------- 224 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++D I+ + + I+ + +K Sbjct: 225 --------------KSRKYLIDNPTIIIGRVGA----YCGNVRTTHGKVWISDNAIYIKE 266 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 L +L+ + + + + ++ ++PP+ Q + + + Sbjct: 267 FKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFVVQV 326 Query: 386 TA 387 Sbjct: 327 DK 328 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|222055950|ref|YP_002538312.1| Restriction endonuclease S subunit-like protein [Geobacter sp. FRC-32] gi|221565239|gb|ACM21211.1| Restriction endonuclease S subunit-like protein [Geobacter sp. FRC-32] Length = 644 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 51/400 (12%), Positives = 125/400 (31%), Gaps = 43/400 (10%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSESGKD----IIYIGLEDVE-SGTGKYLPKDGNSRQSD 74 +P+ W+V + L G + G++ I +V G S + Sbjct: 3 LPESWRVATVGNVLLDLQPGFAQKPGEEDDGTTPQIRTHNVTPDGKITLEGIKHISASAK 62 Query: 75 TSTVSIFAKGQILYGKLGP--YLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLS 130 + G +++ ++ K + + +G + S L+P L Sbjct: 63 ETARYKLMMGDVVFNNTNSEEWVGKTAVFNQEGEYVFSNHMTRLRPHPELVTPEYLAFYL 122 Query: 131 IDVTQRIEAI---CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + + + K I + + +P L EQ I + + ++ Sbjct: 123 HQLWAIGYSKTRAKRWVSQAGIESKAIASFKLSLPTLPEQHRIIDVLRQAQDL----RSQ 178 Query: 188 RIRFIELLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + ++L E +AL + G + M+ G H + + Sbjct: 179 KEQVLKLSAELAKALFEQHFGIAGASSAWPMEPFG--------KHTTYSKYGPRFPDQQY 230 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ + ++ I+ E + L E + PG +V Sbjct: 231 SDSGIHILRTTDMNNDGTIRWWEAPKLALT-EGQIQEHALKPGTLVVSRSGTIGP---FA 286 Query: 307 SAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVK 363 E + AY+ + Y+ L + + ++ + ++ +++ Sbjct: 287 LFDGQEGRCVAGAYLIEFGLADSVQPEYVRALFATPYVQQMLKKAVRSVAQPNINAPNIQ 346 Query: 364 RLPVLVPPIKEQFDIT----------NVINVETARIDVLV 393 + + VPP++ Q + I ++ID ++ Sbjct: 347 SIKIPVPPLEIQEAFAVQIKQVRAWTSEIVKSASKIDEVI 386 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 55/182 (30%), Gaps = 12/182 (6%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK----PESYETYQIVDPGEIV 292 L +K + + + N+ + G+K + G++V Sbjct: 16 LLDLQPGFAQKPGEEDDGTTPQIRTHNVTPDGKITLEGIKHISASAKETARYKLMMGDVV 75 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 F + + + ++ + P + YLA+ + Sbjct: 76 FNNTNSEEWVGKTAVFNQEGEYVFSNHMTRLRPHPELVTPEYLAFYLHQLWAIGYSKTRA 135 Query: 351 S--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + ++ + + + +P + EQ I +V+ + L + EQ + L E Sbjct: 136 KRWVSQAGIESKAIASFKLSLPTLPEQHRIIDVL----RQAQDLRSQKEQVLKLSAELAK 191 Query: 409 SF 410 + Sbjct: 192 AL 193 >gi|331006907|ref|ZP_08330153.1| Restriction modification system DNA specificity domain containing protein [gamma proteobacterium IMCC1989] gi|330419283|gb|EGG93703.1| Restriction modification system DNA specificity domain containing protein [gamma proteobacterium IMCC1989] Length = 203 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 57/139 (41%), Gaps = 5/139 (3%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 ++ I++ G+I+F K ++ ++ ++ + + ++ Y+ +++R Sbjct: 55 TFLKRSILEEGDILFTIAGATIGKSAVVTSDLLPANTNQALAIIRLHQTVNKKYVFYILR 114 Query: 339 SYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S + + G + +L + + +P +EQ I +++ A + E + Sbjct: 115 SNHMKEYIEKSAKGSAQPNLNLRQINEFCIPLPSPEEQTRIVAILDKFDALTSSITEGLP 174 Query: 398 QSIVLLKE----RRSSFIA 412 + I L ++ R ++ Sbjct: 175 REIELRQKQYEYYRDLLLS 193 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 65/190 (34%), Gaps = 12/190 (6%) Query: 26 KVVPIKRFTK-LNTGRTSES--GKDIIYIGLE--DVESGTGKYLPKDGNSRQSDTSTVSI 80 + P+ T + G T +S I +I E D L G + SI Sbjct: 2 EWKPLGELTSLITKGTTPKSFESSGISFIKTEAFDGTRINKNKLSYVGETIHRTFLKRSI 61 Query: 81 FAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQR 136 +G IL+ G + K+ + + +++ + + ++ S + + Sbjct: 62 LEEGDILFTIAGATIGKSAVVTSDLLPANTNQALAIIRLHQTVNKKYVFYILRSNHMKEY 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR---IDTLITERIRFIE 193 IE +G+ + + + I +P+P EQ I + I + I + Sbjct: 122 IEKSAKGSAQPNLNLRQINEFCIPLPSPEEQTRIVAILDKFDALTSSITEGLPREIELRQ 181 Query: 194 LLKEKKQALV 203 E + L+ Sbjct: 182 KQYEYYRDLL 191 >gi|167756439|ref|ZP_02428566.1| hypothetical protein CLORAM_01972 [Clostridium ramosum DSM 1402] gi|167703847|gb|EDS18426.1| hypothetical protein CLORAM_01972 [Clostridium ramosum DSM 1402] Length = 388 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 123/410 (30%), Gaps = 48/410 (11%) Query: 16 QWIGAI-PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + I + P + P+ + +NT K+I+ G V + Y+ N Sbjct: 6 ELINELCPDGVVLKPLFKLVTINTPSIKILSKNILITGDYPVINQGSDYISGYTN----- 60 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++F K + + G + DF + + + + Sbjct: 61 -DKTALFPKNEYII--FGDHTEIIKYVDFPFAQGADGIKILTSKNINCKYLYYCFVNFYK 117 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + N +P PPL Q I + T L E + Sbjct: 118 TTGKYTRHWSA--------AKNTLIPFPPLPVQEEIVRILDNFTELTAELTAELTAELTA 169 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K++ + ++T G + ++ ++ N + E Sbjct: 170 RKKQYEYYRDSLLT----------------FGDDVERKPLREIATIIRGGNFQKKDFTEK 213 Query: 255 NILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + YG I + + + + + + +I+ + + Sbjct: 214 GIPCIHYGQIYTRYGLSATKTITFIDGDVAKKSKFANTNDIIMAVTSENIEDVCKCVVWL 273 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLV 369 E + S + A+ H ++ +LA+ + K + G + + + + V + Sbjct: 274 GEEKVAISGHTAIIKHNQNAKFLAYYFHTAMFFKDKKKLAHGTKVIEVTPSKLGDIIVPL 333 Query: 370 PPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIA 412 P + EQ I ++++ + + + + ++ R+ + Sbjct: 334 PSLSEQQRIVDILDRFDTLCNDISKGLPAEIAERQKQYEY---YRNKLLT 380 >gi|218282512|ref|ZP_03488762.1| hypothetical protein EUBIFOR_01344 [Eubacterium biforme DSM 3989] gi|218216499|gb|EEC90037.1| hypothetical protein EUBIFOR_01344 [Eubacterium biforme DSM 3989] Length = 365 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 64/389 (16%), Positives = 121/389 (31%), Gaps = 28/389 (7%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 IK ++G+ + KYL ++ ++ T F K I+ Sbjct: 2 KIKDLCSYAPKSRIKAGEAV----------ENAKYLFFTSSADENKRYTDFQFDKEAIIM 51 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G G + ST LVL P + + + +EA +GA + H Sbjct: 52 GTGG--NATLHYYNGKFSVSTDCLVLFPNSKI-KCKYLYYFFKSHMSVLEAGFKGAGLKH 108 Query: 149 ADWKGIGNIP-MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + K I I +P L Q I + T I+ L E F L K + Sbjct: 109 TNKKYIEEINVSKVPDLTTQEKIVSHLDTITENIEKLNRELELFGSLTKA-------RFI 161 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +P I VG V D + +P I + G I + Sbjct: 162 EMFGDPLDGSAKYPIHQVGEVADTIDPQPSHRTPPIDESGIP-YISIRDCNYKTGRIDFE 220 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + E + G+ V I + + + + Sbjct: 221 GARKVSRKILEEQSKRYTLHDGDFVIGKIGTIGNPVFIPPRDDY-TLSANVVLVQPNNNL 279 Query: 328 IDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + +L + + S + + F A S + + + V+ + V+ P + Q + Sbjct: 280 VSPYFLKYSLESGYVDRQFAEAKNSTSQAAFGIQKVRTIKVMNPDLNIQRKF----DNFV 335 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D E++++S+ ++ S + Sbjct: 336 KQVDKSREEVKKSLEKTQQLYDSLMQEYF 364 >gi|148993502|ref|ZP_01822993.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] gi|147927871|gb|EDK78892.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] Length = 426 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 66/415 (15%), Positives = 140/415 (33%), Gaps = 64/415 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS---------------------------------------GIEWVGLVPDHW 232 +S E +P+ W Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESW 251 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQI 285 E + + + R + + + + ++ L SY+ ++ Sbjct: 252 EWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERL 311 Query: 286 VDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSY 340 + G++++ L R ++ G + + V I+ ++ + S Sbjct: 312 LRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIYNFLSSP 371 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + V SG ++ L + +K + +PP+ EQ I + I A I+ L+ Sbjct: 372 IVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHINALI 426 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 246 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 305 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 306 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 365 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI I+ L Sbjct: 366 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHINAL 425 Query: 185 I 185 I Sbjct: 426 I 426 >gi|300862301|ref|ZP_07108380.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TUSoD Ef11] gi|300848252|gb|EFK76010.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TUSoD Ef11] Length = 320 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 74/190 (38%), Gaps = 10/190 (5%) Query: 229 PDHWEVKPFFALVTELNRK-NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-V 286 +H +V+ T L K + ++Y N+ T + + Q V Sbjct: 10 WEHRKVEELGDTFTGLTGKTKEDFGHGDATFVTYINVFSNPITDLKMTESVEIDAKQNQV 69 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGII--TSAYMAVKPHGID-STYLAWLMRSYDLC 343 + G+I F ++ + S + + S +P Y+A+++RS ++ Sbjct: 70 EYGDIFFTTSSETPEEVGMSSVWLGNEANVYLNSFCFGYRPVTELAPYYMAFMLRSPNVR 129 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K F + G+ R ++ V + + VP I EQ + ID L+ ++ + Sbjct: 130 KKFIFLAQGISRYNISKNRVMDIEIPVPNIDEQRKVGQF----FKDIDDLITLHQRKLDQ 185 Query: 403 LKERRSSFIA 412 LKE + +++ Sbjct: 186 LKELKKAYLQ 195 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 9/74 (12%), Positives = 18/74 (24%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + G++ S + G R T K Sbjct: 216 DWQLCKLGETFSIIMGQSPNSENYTENPDDYILVQGNSDMKNNKVVPRIWTTQVTKKAEK 275 Query: 84 GQILYGKLGPYLRK 97 G ++ P Sbjct: 276 GDLILSVRAPVGEI 289 >gi|217971596|ref|YP_002356347.1| restriction modification system DNA specificity domain-containing protein [Shewanella baltica OS223] gi|217496731|gb|ACK44924.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] Length = 642 Score = 89.9 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 52/419 (12%), Positives = 126/419 (30%), Gaps = 40/419 (9%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKD----IIYIGLEDVE-SGTGKYLPKDGNSRQS 73 +P+ W I + G + + GK+ I ++ G + + Sbjct: 2 KLPEGWVETTIGNIIDDMQPGFSQKPGKEDGDTTPQIRTHNISPDGKLTLEGIKHVTASN 61 Query: 74 DTSTVSIFAKGQILYGKLGP--YLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLL 129 S KG +++ ++ K + D +G + S L+ L Sbjct: 62 KESERYSLTKGDVVFNNTNSEEWVGKTAVFDQEGEFVFSNHITRLRANSKLITPDFLAAY 121 Query: 130 SIDVTQRIEAI---CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA--ETVRIDTL 184 + + + + + + +P+P L EQ I + + + Sbjct: 122 LQFLWSMGFSKTRAKRWVSQAGIEGSTLALFRIPLPSLPEQERIVDVLQQVGIVAKAKQS 181 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I + + +V + V V V+E Sbjct: 182 IDDH--------------IDNLVRTAYWEHFSEWYTADGLRDPVRISDIVADSQYGVSEA 227 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + K + S++ + + + L + + +++ G+++F + + Sbjct: 228 MSETGKQAILRMNSITTSGWLNLADLKYATLSEKDIKATTLLN-GDLLFNRTNSKELVGK 286 Query: 305 LRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFE 360 + + ++Y+ GI Y+ + S ++ Sbjct: 287 CAIWRGAKEPFSYASYIVRFRMKEGILPEYIWATLNSSYGKYRLMNSAKQAVSMANVSPT 346 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI-AAAVTGQ 418 D+ R+ V +PP+ Q +IN I+ L +++ E + + A+ G+ Sbjct: 347 DLGRITVPLPPLALQEKFAKLIN----HIETLRQEMLNKQDQYSEL-QTLVTQQALLGE 400 >gi|260664494|ref|ZP_05865346.1| type I restriction-modification system S protein [Lactobacillus jensenii SJ-7A-US] gi|260561559|gb|EEX27531.1| type I restriction-modification system S protein [Lactobacillus jensenii SJ-7A-US] Length = 394 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 58/402 (14%), Positives = 131/402 (32%), Gaps = 33/402 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK V + +G T ++G I +I ++ S + + S+ Sbjct: 14 WKKVKLGEIATTYSGGTPKAGNKKYYNGLIPFIRSGEIHSNKTELF---ISEAGLKNSSA 70 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG +LY G + I+ +G + L + PK P ++ L + Sbjct: 71 KMVTKGDLLYALYGATSGEVDISKINGAINQAVLAIIPKQYNPYIISLLLSKKKDAILSK 130 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G I I + + +D L++ + R +EL + Sbjct: 131 YLQGGQG------NLSAEIVKSIKLILPSKNEESSLYPLFKVLDNLLSLQQRKLELENKL 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+ + Y+ + L P+ K + + +G + + + K + L+ Sbjct: 185 KKQIAFYLYSFTLTPNFKHIEVKNKKLGD---------IVNISNGIMGDSQKKSGNFKLT 235 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVMERGI 315 K++ G + + + ++ G+I++ I+ ++ + Sbjct: 236 RIETISNGKIDLSRTGYIDQVSDEKKFLEVGDILYSNINSLTHIGKNAIVKEKHLPLVHG 295 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 I + + + I YL L+ + + + S+ ++ L + P + Sbjct: 296 INLFRLHITNNQITPNYLHGLLNLPKYKWWVKSHANPAVNQASINKTELSSLVIKYPDLD 355 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q I N IN A+ + L + + + Sbjct: 356 IQNQI-NNINYSFAQYWDI---QYSKKESLCQLKQFLLQNLF 393 >gi|312869799|ref|ZP_07729941.1| type I restriction modification DNA specificity domain protein [Lactobacillus oris PB013-T2-3] gi|311094645|gb|EFQ52947.1| type I restriction modification DNA specificity domain protein [Lactobacillus oris PB013-T2-3] Length = 487 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 54/417 (12%), Positives = 122/417 (29%), Gaps = 48/417 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII---YIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP W+ V + L +GR + + + + + + Sbjct: 73 DIPDSWEWVRLGDVINLISGRDIPKKSHLNKPANDSMPYITGASNIDNNGKITITEWVNN 132 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I G +L G + A++ + + Q + L+ L Q + L + + Sbjct: 133 PSVIVKNGTLLLSVKGTIGKVAVLKIPEAHIARQIMGLENIYKLDLEFQKYFLEDYIEEL 192 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + +P PPL+EQ I KI + + + ++ +L Sbjct: 193 KSKAKSM--IPGISRDDLLSAVIPFPPLSEQSRIAAKIAQLFALLRKVESSIQQYAKLKV 250 Query: 197 EKKQALVSYIVTKGL---NPDVKMKDSGIEWV---------------------------- 225 K ++ L +P + +E + Sbjct: 251 LLKSKVLDLATRGELVEQDPHDEPASVLLEKIKAEKEELIKEKKIKRSKPLAPIAEDEKP 310 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---- 281 +P WE +V+ K E Y I+ + +N + + + Sbjct: 311 FDIPASWEWVRLGEIVSVKGGKRVPRGEKLTNQKDYKPYIRVADMKNQSVNFQHIKYASK 370 Query: 282 ------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLA 334 + + + F + S+ +A + + + +T+L Sbjct: 371 AIFDQLSSYTISSHNVYFSIAGIIGKVGSIPQDLDGALLTENAAKLENIGKNLVSNTFLI 430 Query: 335 WLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + S ++ + + L ++ + PP+ EQ I I + +D Sbjct: 431 NALESDEVKNQHKRILSQVAQPKLALTKLRNTVISFPPLAEQSRIATKIAQLSELLD 487 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 58/195 (29%), Gaps = 11/195 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVT-----ELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + E +PD WE ++ ++ +K+ +N + Sbjct: 66 TEDEKPFDIPDSWEWVRLGDVINLISGRDIPKKSHLNKPANDSMPYITGASNIDNNGKIT 125 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + IV G ++ L+ + E I + +D + Sbjct: 126 ITEWVNNPSVIVKNGTLLLSVKGTIGKVAVLK---IPEAHIARQIMGLENIYKLDLEFQK 182 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + D + + + + +D+ + PP+ EQ I I A + + Sbjct: 183 YFL--EDYIEELKSKAKSMIPGISRDDLLSAVIPFPPLSEQSRIAAKIAQLFALLRKVES 240 Query: 395 KIEQSIVLLKERRSS 409 I+ LK S Sbjct: 241 SIQ-QYAKLKVLLKS 254 >gi|153000503|ref|YP_001366184.1| restriction modification system DNA specificity subunit [Shewanella baltica OS185] gi|151365121|gb|ABS08121.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS185] Length = 616 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 61/204 (29%), Gaps = 12/204 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN-- 272 S E +P+ WE K + NI + G I Sbjct: 117 SDDEKPFELPNGWEWSRLSETGLGSTGKTPSTKQSSFFDGNIPFIGPGQITPAGIVLKAE 176 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 L PG+I I K ++ ER + P I S Y Sbjct: 177 KFLSQSGLGNSCEALPGDIFMVCIGGSIGKAAIVV----ERSGFNQQINCISPLHIASKY 232 Query: 333 LAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L + + + V + + LPV + P++EQ I ++ + D Sbjct: 233 LYFALSTNSFHSSVLEKATGSATPIINRGKWEELPVPIAPLEEQHRIVAKVDELMSLCDA 292 Query: 392 LVEKIEQSIVLLKERRSSFIAAAV 415 L + E SI + + + A + Sbjct: 293 LEAQTEASISAHQILVETLLNALL 316 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 66/208 (31%), Gaps = 11/208 (5%) Query: 11 KDSGVQWIGAI---PKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTG 61 K S V IG + P W+ + ++ F + G T + DI ++ +V Sbjct: 409 KQSEVNPIGEVVVLPDTWQQILVQDFADIRLGSTPSRAEPSYWSGDIPWVSSGEVAGSII 468 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDV 119 K + + S+ SI K +L +G + + D + + Sbjct: 469 KDTAEKITQLGFEKSSTSIIPKRSLLMAIIGQGKTRGQTALLGIDACTNQNVAAFIFNEE 528 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 W+ + + G + K + + P+PPL EQ I KI Sbjct: 529 FVVPEFVWIWAQSKYEAHRGDGRGGAQPALNGKIVRSFRFPLPPLEEQHRIVAKIDELMA 588 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIV 207 + L T A+V + Sbjct: 589 LCEQLKTRLADSQTTQLHLTDAIVEQAI 616 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 61/207 (29%), Gaps = 12/207 (5%) Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLV---PDHWEVKPFFALVTELNRKNTK-----LI 252 A ++ + + + K S + +G V PD W+ Sbjct: 392 ARIAKEKAQLIKNNKIKKQSEVNPIGEVVVLPDTWQQILVQDFADIRLGSTPSRAEPSYW 451 Query: 253 ESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +I +S G I K + + I+ ++ I + Sbjct: 452 SGDIPWVSSGEVAGSIIKDTAEKITQLGFEKSSTSIIPKRSLLMAIIGQGKTRGQTALLG 511 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + A + ++ +S G G + +L + V+ + Sbjct: 512 IDACTNQNVAAFIFNEEFVVPEFVWIWAQSKYEAHRGDGRG-GAQPALNGKIVRSFRFPL 570 Query: 370 PPIKEQFDITNVINVETARIDVLVEKI 396 PP++EQ I I+ A + L ++ Sbjct: 571 PPLEEQHRIVAKIDELMALCEQLKTRL 597 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 67/194 (34%), Gaps = 7/194 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ + +TG+T + +I +IG + + G L + QS Sbjct: 124 ELPNGWEWSRLSETGLGSTGKTPSTKQSSFFDGNIPFIGPGQI-TPAGIVLKAEKFLSQS 182 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G I +G + KA I + Q + P + + L L + Sbjct: 183 GLGNSCEALPGDIFMVCIGGSIGKAAIVVERSGFNQQINCISPLHIASKYLYFALSTNSF 242 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+ + +P+PI PL EQ I K+ D L + I Sbjct: 243 HSSVLEKATGSATPIINRGKWEELPVPIAPLEEQHRIVAKVDELMSLCDALEAQTEASIS 302 Query: 194 LLKEKKQALVSYIV 207 + + L++ ++ Sbjct: 303 AHQILVETLLNALL 316 >gi|331085650|ref|ZP_08334733.1| hypothetical protein HMPREF0987_01036 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406573|gb|EGG86078.1| hypothetical protein HMPREF0987_01036 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 385 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 67/405 (16%), Positives = 126/405 (31%), Gaps = 40/405 (9%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGT---GKYLPKDGNSRQSDTSTVSIFA--- 82 ++ + T KD G+ V++G G YL K+ ++ T Sbjct: 2 RLEDVCTVFTDGDWIESKDQSEKGIRLVQTGNIGEGIYLEKESRAKYIPEDTFKRLKCTE 61 Query: 83 --KGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQ 135 G IL +L + +A I I + + +P + L ++ S Sbjct: 62 IFPGDILVSRLPEPVGRACIIPEKTERMITAVDCTICRPDEALISKDYLCYFMRSNAYYM 121 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G T K +GN+ + +P EQ + E++ ID+ E +L Sbjct: 122 RLLGNVTGTTRKRISRKNLGNVELKVPTKEEQKTVVERLDCLVKVIDSRTKELQLLDDL- 180 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + + V NP I + +RK + Sbjct: 181 ------IKARFVEMFGNPR-------INPNKYPTKLIKDTCIVITGNTPSRKVHEYYGDA 227 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQNDKRSLRSAQVME 312 I + NI+ L + + S + VD G I+ I R Sbjct: 228 IEWIKTDNIVSSLLYPTVASESLSDSGKAVGRAVDAGAILMACIAGSVASIG-RVCITDR 286 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 A+ P D +L L++ L + G+ + ++ +VP Sbjct: 287 EVAFNQQINAIVPKEYDVRFLHALLQISKDYLVEDINMSLKGI---ISKSKLEEKEFIVP 343 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++EQ + + +ID I+ ++ + S + Sbjct: 344 SMEEQVGFADFV----KQIDKSKVAIQAALDKTQLLFDSLMQKYF 384 >gi|78358464|ref|YP_389913.1| restriction endonuclease S subunits-like [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78220869|gb|ABB40218.1| Restriction endonuclease S subunits-like protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 390 Score = 89.9 bits (221), Expect = 8e-16, Method: Composition-based stats. Identities = 72/394 (18%), Positives = 131/394 (33%), Gaps = 27/394 (6%) Query: 24 HWKVVPIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WK+V K R E+ +GLE ++ + NS TS F Sbjct: 9 GWKMVKFGEVVKNANLVEREPEANGVEKIVGLEHIDPEN--LHVRRWNSVVDGTSFTRKF 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIE 138 GQ L+GK Y RK A+F+GICS L +PK+ LPELL S Sbjct: 67 VPGQTLFGKRRAYQRKVAYAEFEGICSGDILTFEPKNRKVLLPELLPFICQSDAFFDHAL 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ W + + P+PP+ EQ I E + A ++ + L Sbjct: 127 DTSAGSLSPRTSWTALKDFEFPLPPIDEQKRIAEILWAADEAVEQWTEAYRQAELALNST 186 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + ++ + + V +KD G G P R + + Sbjct: 187 RSQILQELSQTEV--CVSLKDVGRWVSGGTPS---------------RSRSDFWNGDFPW 229 Query: 259 LSYGNIIQKLETRNMGLKPES-YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +S ++ Q + + + ++ + P E + + + A Sbjct: 230 VSPKDMKQDVISDSEEKLTDTALNGRVTILPSESILIVVRGMILAHTFPVALTGREVTFN 289 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQ 375 + P+ S + + ++ A + L + + + + P +Q Sbjct: 290 QDMKGIIPNSDFSAEFVFHWFKDNSTRILQATEESTHGTKRLATDVLYGMQIPKPSPAKQ 349 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 V ++ + E I S +L+ R++ Sbjct: 350 EMAVTVFETFRTKLAEISEHIASSQQMLRSLRNA 383 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 42/131 (32%), Gaps = 8/131 (6%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCK 344 PG+ +F K + + GI + + +P L ++ +S Sbjct: 68 PGQTLFGKRRAYQRKVAYAEFE----GICSGDILTFEPKNRKVLLPELLPFICQSDAFFD 123 Query: 345 V-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 L + +K +PPI EQ I ++ ++ E Q+ + L Sbjct: 124 HALDTSAGSLSPRTSWTALKDFEFPLPPIDEQKRIAEILWAADEAVEQWTEAYRQAELAL 183 Query: 404 KERRSSFIAAA 414 RS + Sbjct: 184 NSTRSQILQEL 194 >gi|319955097|ref|YP_004166364.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] gi|319423757|gb|ADV50866.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] Length = 584 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 79/212 (37%), Gaps = 11/212 (5%) Query: 218 KDSGIEWVGLVPDHWEVKP---FFALVTELNRKNTKLIESNILSLSYGN--IIQKLETRN 272 K + E +P+ W +T+ + K E + LS N + + ++ Sbjct: 367 KITKEEIPYELPEGWVWCRMIELCQYITDGTHQTPKYTEEGRMFLSAKNVKPFKFMPEKH 426 Query: 273 MGLKPESYETYQIVDP---GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + E +E Y+ +I+ + + +L + ++ + + P+ ++ Sbjct: 427 RFVSEEDFEGYRRNRKPELNDILLTRVGAGIGEATLIDQDLEFAIYVSVGLLKMFPNKLE 486 Query: 330 STYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 Y+ + S + + G + +L +++ V +PPI+EQ I +N Sbjct: 487 PNYIVMWLNSPEGRQYSSKNTYGKGVSQGNLNLSLIRQFVVSLPPIEEQKAIVEKVNALM 546 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D L +++QS + S + G+ Sbjct: 547 GLCDTLEHEVQQSQEYSEMLMQSVLREVFEGK 578 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 72/211 (34%), Gaps = 12/211 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALV-TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 K S E +PD W + K+ K E + + +Q + Sbjct: 75 KISKDEIPYELPDSWVWCRLNDICEYIQRGKSPKYTEIPKIPVISQKCVQWSGFDISRAR 134 Query: 277 P------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM---AVKPHG 327 E Y + + G++++ R + ++ +++ + Sbjct: 135 FITEESLEKYVEERFLQKGDLLWNSTGDGTIGRVISYPGTNYEKVVADSHVTVVRGFKNF 194 Query: 328 IDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I + YL S + ++ GS + L VK + PP++EQ +I V+ Sbjct: 195 IITEYLWIFTASPLIQELVVGRVTGSTKQTELGTGTVKSMEFSFPPLEEQKEIVKVVETL 254 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ L + + I L ++ +S + T Sbjct: 255 FKEVEQLEQLTVERINLKEDFVTSALHQLTT 285 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 77/202 (38%), Gaps = 16/202 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQ-SD 74 +P+ W + + T T ++ K ++ ++V+ K++P+ D Sbjct: 376 ELPEGWVWCRMIELCQYITDGTHQTPKYTEEGRMFLSAKNVK--PFKFMPEKHRFVSEED 433 Query: 75 TSTVSIFAK---GQILYGKLGPYLRKAIIAD----FDGICSTQFLVLQPKDVLPELLQGW 127 K IL ++G + +A + D F S L + P + P + W Sbjct: 434 FEGYRRNRKPELNDILLTRVGAGIGEATLIDQDLEFAIYVSVGLLKMFPNKLEPNYIVMW 493 Query: 128 LLSIDVTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L S + Q +G + + + I + +PP+ EQ I EK+ A DTL Sbjct: 494 LNSPEGRQYSSKNTYGKGVSQGNLNLSLIRQFVVSLPPIEEQKAIVEKVNALMGLCDTLE 553 Query: 186 TERIRFIELLKEKKQALVSYIV 207 E + E + Q+++ + Sbjct: 554 HEVQQSQEYSEMLMQSVLREVF 575 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 67/211 (31%), Gaps = 13/211 (6%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQ--SD 74 +P W + + + G++ + + I I + V+ + + Sbjct: 84 ELPDSWVWCRLNDICEYIQRGKSPKYTEIPKIPVISQKCVQWSGFDISRARFITEESLEK 143 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIAD---FDGICSTQFLVL--QPKDVLPELLQGW 127 KG +L+ G R + + V+ ++ E L + Sbjct: 144 YVEERFLQKGDLLWNSTGDGTIGRVISYPGTNYEKVVADSHVTVVRGFKNFIITEYLWIF 203 Query: 128 LLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + + + G+T + + ++ PPL EQ I + + ++ L Sbjct: 204 TASPLIQELVVGRVTGSTKQTELGTGTVKSMEFSFPPLEEQKEIVKVVETLFKEVEQLEQ 263 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + I L ++ + + + T N + Sbjct: 264 LTVERINLKEDFVTSALHQLTTNNANQEWTF 294 >gi|169834252|ref|YP_001694345.1| restriction modification system DNA specificity subunit [Streptococcus pneumoniae Hungary19A-6] gi|168996754|gb|ACA37366.1| restriction modification system DNA specificity domain [Streptococcus pneumoniae Hungary19A-6] Length = 340 Score = 89.5 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 43/362 (11%), Positives = 89/362 (24%), Gaps = 35/362 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI + L EQ I ++ + I + L + S Sbjct: 120 MKHLTKKYFDNIMVSYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL-------VKSR 172 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +P K ++ G + F + + I Sbjct: 173 FNEMFGDPLNNNKKFAVKT-GQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAW------- 224 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++D I+ + + I+ + +K Sbjct: 225 --------------KSRKYLIDNPTIIIGRVGA----YCGNVRTTHGKVWISDNAIYIKE 266 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 L +L+ + + + + ++ ++PP+ Q + + + Sbjct: 267 FKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFVVQI 326 Query: 386 TA 387 Sbjct: 327 DK 328 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|307150616|ref|YP_003886000.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 7822] gi|306980844|gb|ADN12725.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7822] Length = 467 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 60/458 (13%), Positives = 137/458 (29%), Gaps = 60/458 (13%) Query: 25 WKVVPIKRFTK-----LNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 W+VV ++ + G + I I ++ GT ++ + Sbjct: 8 WQVVTLEDIAQKDGHGFVDGPFGSNLPASEYVPFGIPVIRGTNLSLGTTRFKDDEFVFVS 67 Query: 73 SDTSTV---SIFAKGQILYGKLGPYLRKAIIADFDGI------CSTQFLVLQPKDVLPEL 123 +T+ S+ G I++ K G + AII + L + + P Sbjct: 68 EETAKRLERSLCEPGDIIFTKKGTLGQTAIIPFNHKYQKFLLSSNQMKLTVDIQKAEPLF 127 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + ++ S +I E + + + P+ +PPL EQ I + +I+ Sbjct: 128 VYYYVSSFTSRSKIIQDSEATGVPKTNLTYLRKFPIVLPPLPEQKAIAHILGTLDDKIEL 187 Query: 184 LITERIRFIELLKEKKQALV------------SYIVTKGLNPDVKMKDSGIE-WVGLVPD 230 + + ++ +V DS E +GL+P Sbjct: 188 NQQMNQTLEAMARAIFKSWFVDFDPVRAKMEGKQLVGMDEATAALFPDSFEESDLGLIPK 247 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----------MGLKPESY 280 W V + + + ++ I+ + + + S Sbjct: 248 GWRVSTLDEVTEFVLGGDWGKDLASEQYNQPAYCIRGADIPDLQNAGLGKMPIRYLKASS 307 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG---------IITSAYMAVKPHGIDST 331 + + G IV + + R + + I Sbjct: 308 LKKRSLQAGNIVIEISGGSPTQSTGRPVLITLNLLDRLSYPLVCSNFCRLIFLKEDISPN 367 Query: 332 YLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDV-KRLPVLVPPIKEQFDITNVINVETAR 388 ++ +R F Y G+ ++L ++ ++ +++P Q + V T Sbjct: 368 FIYLWLRWLYASDSFLQYENGTTGIKNLAYKIFSEKYELVLP----QQYVLKVFEKTTQP 423 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + +L R + + ++GQI ++ + Sbjct: 424 LFKKRDANGLQSEILATIRDTLLPKLMSGQIRVKEAEK 461 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 64/218 (29%), Gaps = 29/218 (13%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGR-------TSESGKDIIYIGLEDV----ESGTGKYLPK 66 +G IPK W+V + T+ G + + + I D+ +G GK + Sbjct: 242 LGLIPKGWRVSTLDEVTEFVLGGDWGKDLASEQYNQPAYCIRGADIPDLQNAGLGKMPIR 301 Query: 67 DGNSRQSDTSTVSIFAKGQILY-----GKLGPYLRKAIIA-------DFDGICSTQFLVL 114 + + G I+ R +I + +CS ++ Sbjct: 302 YLKASSLKKRS---LQAGNIVIEISGGSPTQSTGRPVLITLNLLDRLSYPLVCSNFCRLI 358 Query: 115 QPKDVL-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 K+ + P + WL + + G T + Q + + Sbjct: 359 FLKEDISPNFIYLWLRWLYASDSFLQYENGTT--GIKNLAYKIFSEKYELVLPQQYVLKV 416 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 T + E+L + L+ +++ + Sbjct: 417 FEKTTQPLFKKRDANGLQSEILATIRDTLLPKLMSGQI 454 >gi|260582435|ref|ZP_05850227.1| type I restriction/modification specificity protein [Haemophilus influenzae NT127] gi|260094586|gb|EEW78482.1| type I restriction/modification specificity protein [Haemophilus influenzae NT127] Length = 418 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 57/400 (14%), Positives = 127/400 (31%), Gaps = 47/400 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + + G N+ Q + G+ Sbjct: 18 EWKPLDEVANIANNARKPVKS---SLRIS------GNIPYYGANNIQDYVEGYT--HDGE 66 Query: 86 ILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G A + V+ K+ L L+ A Sbjct: 67 FVLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLA-- 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + IP+PIPPL+ Q I + A T L +E I + + ++ Sbjct: 125 -GKERAKLTKAKLQQIPIPIPPLSVQTEIVRILDALTALTSELTSELILRQKQYEYYREK 183 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+S +++ + ++W+ ++ L+ + E+ + ++ Y Sbjct: 184 LLS-------FDSLELSEGVVQWI-------KLIDLGELIRGNGLQKKDFTETGVPAIHY 229 Query: 262 GNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 G I T + PE + + VD G++V + + + +T Sbjct: 230 GQIYTYYGTFATKTKSFVSPELAKKLKKVDYGDVVITNTSENFEDVGKAMVYLGKEQAVT 289 Query: 318 SA---YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIK 373 I S Y +L ++ G + + D+ ++ + +PP+K Sbjct: 290 GGHATIFKPNHEKILSKYFVYLTQTSFFTNEKRKYAKGTKVIDVSATDMAKIILPIPPLK 349 Query: 374 EQFDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 EQ I ++++ + + +E+ ++ +E Sbjct: 350 EQHRIVSILDKFETLTNSITEGLPLAIEQSQKRYEYYREL 389 >gi|313681904|ref|YP_004059642.1| restriction modification system DNA specificity domain [Sulfuricurvum kujiense DSM 16994] gi|313154764|gb|ADR33442.1| restriction modification system DNA specificity domain [Sulfuricurvum kujiense DSM 16994] Length = 635 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 62/439 (14%), Positives = 137/439 (31%), Gaps = 57/439 (12%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + + + +V+ + + D + I + Q +YG Sbjct: 6 LGDILTESKVESLNPDPNNRITVRLNVKGVEKRPVKNDT----EGATKYYIRSFNQFIYG 61 Query: 90 KLGPYLRKAIIADFD---GICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 K + I + S+ + PE + + + + +E I GA Sbjct: 62 KQNLFKGAFGIIPKELDGFETSSDLPCFDIDINRCKPEWILYFFKKGNFYKTLEKIARGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 K I +P+P + +Q I KI + T + E LLK+ +Q+++ Sbjct: 122 GSKRISPKDFFKIEIPLPSIDQQESILNKISSITNYSIRIEDEIFSQQNLLKKLRQSILQ 181 Query: 205 YIVTKGLNPDVKMKDSGIEWVGL-----------------------------------VP 229 + L + ++S +E V +P Sbjct: 182 EAIEGKLTAQWRKENSDVESVSKLLKKIRDEKERLIKEKKIKKGAEVSPILTNDIPFIIP 241 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-------PESYET 282 +W L+ L +K + I + + + + E + Sbjct: 242 QNWGWCRLGNLLRSLEYGTSKKCFQEKKYNTPILRIPNISSGIINVDDLKFTDLSEKEKA 301 Query: 283 YQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMR 338 ++ +I+ + RS+ + + + ID+ Y+ + +R Sbjct: 302 QYTLENNDILIIRSNGSREIVGRSVLVSNEFQNYGYAGYLIRLRFIGISIDAKYIQYALR 361 Query: 339 SYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + + + ++ ++ L + +PPI+EQ I + A D L ++I Sbjct: 362 SPYIREQIEMPLRTTVGINNINSVEISNLLIPLPPIEEQNVIVEKVENLFAMCDDLEQQI 421 Query: 397 EQSIVLLKERRSSFIAAAV 415 +S + S + A Sbjct: 422 NESKANAEMLMQSVLKEAF 440 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 64/175 (36%), Gaps = 2/175 (1%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K L ++ ++ +E R + E Y I + ++ +L + Sbjct: 13 SKVESLNPDPNNRITVRLNVKGVEKRPVKNDTEGATKYYIRSFNQFIYGKQNLFKGAFGI 72 Query: 306 RSAQVMERGIITSA-YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 ++ + + + ++ + + + K + G + + +D Sbjct: 73 IPKELDGFETSSDLPCFDIDINRCKPEWILYFFKKGNFYKTLEKIARGAGSKRISPKDFF 132 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + +P I +Q I N I+ T + ++I LLK+ R S + A+ G+ Sbjct: 133 KIEIPLPSIDQQESILNKISSITNYSIRIEDEIFSQQNLLKKLRQSILQEAIEGK 187 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/201 (11%), Positives = 55/201 (27%), Gaps = 14/201 (6%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP++W + L G + + + + + ++ SG Sbjct: 240 IPQNWGWCRLGNLLRSLEYGTSKKCFQEKKYNTPILRIPNISSGIINVDDLKFTDLSEKE 299 Query: 76 STVSIFAKGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWL 128 IL + G + + + Sbjct: 300 KAQYTLENNDILIIRSNGSREIVGRSVLVSNEFQNYGYAGYLIRLRFIGISIDAKYIQYA 359 Query: 129 LSIDVTQRIEAIC--EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L + + +++ + I N+ +P+PP+ EQ +I EK+ D L Sbjct: 360 LRSPYIREQIEMPLRTTVGINNINSVEISNLLIPLPPIEEQNVIVEKVENLFAMCDDLEQ 419 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + + Q+++ Sbjct: 420 QINESKANAEMLMQSVLKEAF 440 >gi|121595899|ref|YP_987795.1| restriction modification system [Acidovorax sp. JS42] gi|120607979|gb|ABM43719.1| restriction modification system, type I [Acidovorax sp. JS42] Length = 396 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 56/404 (13%), Positives = 133/404 (32%), Gaps = 40/404 (9%) Query: 24 HWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-F 81 W+ V + + E+ YI + +++ + + F Sbjct: 9 GWRRVKFGDVVRQCKEKADPETSGLERYIAGDHMDTDDLRLRRWGEIGSGYLGPAFHMRF 68 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQRIE 138 GQ+LYG YLRK +ADF+GIC+ V P ++LPE L + + Sbjct: 69 KPGQVLYGSRRTYLRKVAVADFEGICANTTFVLEPQNPNELLPEFLPFLMQTEAFNDFSV 128 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G+ + ++ + +PP+ EQ + A T + + +L+ Sbjct: 129 KNSKGSVNPYINFSDLAKFEFVLPPIDEQQSAIALLSAATDQCHAVEAAHRAAGRMLQSF 188 Query: 199 KQALVSYIVTKGLNPDVKMKDSGI--EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 K +L+ L + +S + + + P+ P + Sbjct: 189 KDSLL-------LRKTSSLANSFLLGDLLLRSPESGCSAP----------PKDADTGYFV 231 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L L+ + + ++P S + G+++ + + + + Sbjct: 232 LGLAALSRDGYVSGDFKPVEPTSKMVAAKLSMGDMLISRSNTVDRVGFVGIFSDNRDDVS 291 Query: 317 TSAYMA---VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP 370 M P + +L L+++ + + +G + + ++ ++ + VP Sbjct: 292 FPDTMMRLQPNPALVHPHFLEALLQTTSAREFLMRIAAGTSASMKKINRANLLQMRLNVP 351 Query: 371 PIKEQFDITNVINVETARIDVLVEKIE------QSIVLLKERRS 408 + Q ++ + + + + + L R+ Sbjct: 352 DLDVQEM---ALDEL-QQFKNAIATQKARWDAARQLTRLIAMRT 391 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 52/132 (39%), Gaps = 6/132 (4%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSY 340 + PG++++ K ++ + + + ++ P+ + +L +LM++ Sbjct: 65 HMRFKPGQVLYGSRRTYLRKVAVADFEGI---CANTTFVLEPQNPNELLPEFLPFLMQTE 121 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 G + + F D+ + ++PPI EQ +++ T + + + Sbjct: 122 AFNDFSVKNSKGSVNPYINFSDLAKFEFVLPPIDEQQSAIALLSAATDQCHAVEAAHRAA 181 Query: 400 IVLLKERRSSFI 411 +L+ + S + Sbjct: 182 GRMLQSFKDSLL 193 >gi|194336529|ref|YP_002018323.1| restriction modification system DNA specificity domain [Pelodictyon phaeoclathratiforme BU-1] gi|194309006|gb|ACF43706.1| restriction modification system DNA specificity domain [Pelodictyon phaeoclathratiforme BU-1] Length = 392 Score = 89.5 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 54/401 (13%), Positives = 119/401 (29%), Gaps = 28/401 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKG 84 K V +++ T + G++ ES G + K R S G Sbjct: 2 KTVELQQVTTIIAGQSPESSTYNSIADGLPFFQGKADFQDKFPKVRIWCNSAKRKEADPG 61 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL P I + I ++P L + L + ++ G+ Sbjct: 62 DILMSVRAPV-GSVNICNQKCIIGRGLSAIRPDANLNNYFLYYYLKCNEKNVA-SLGTGS 119 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + + +P+PPL +Q+ + I + + EL L S Sbjct: 120 TFQAITQTTLKRLDVPLPPLDDQIRSATLLSKVENLIFRRREQLKQLDEL-------LKS 172 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + +P + + + R + +++ + Sbjct: 173 VFLEMFGDPVR---------NEMGWEMKRMDEISDSRLGKMRDKKFITGNHLRKYIGNSN 223 Query: 265 IQKLETRNMGLKPESYETYQIV----DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 +Q + L+ ++ + V G+++ RS Sbjct: 224 VQWFRFKLDDLEEMDFDERERVLFALMDGDLLICEGGDIGRCAIWRSNLSECYFQKAIHR 283 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + YL ++M + L F + L E +K + +P ++ Q + Sbjct: 284 VRLHKSQAIPEYLQYVMLFFSLYNGFKNVTCKATISHLTGEKLKETLIPLPSLELQNRFS 343 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ +++ + S + L+ A G++D Sbjct: 344 TIV----KKVEKIKITYTHSFINLESLYGILSQKAFKGELD 380 >gi|325697670|gb|EGD39555.1| hypothetical protein HMPREF9384_0501 [Streptococcus sanguinis SK160] Length = 412 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 56/425 (13%), Positives = 138/425 (32%), Gaps = 41/425 (9%) Query: 23 KHWKVVPIKR----FTKLNTGRTSESGKDII------YIGLEDVESGTGKYLPKDGNSRQ 72 WK + +K F + G +++ ++ +V + + D +++ Sbjct: 2 SEWKFLTLKEAELEFIDGDRGINYPKKSELLLEGDCVFLNTGNVRQNSFDFSNLDFITKE 61 Query: 73 SDTSTVS-IFAKGQILYGKLGPYLRKAIIADFDGICSTQF------LVLQPKDVLPELLQ 125 D + + I+ G A+ + + + + + P + Sbjct: 62 KDNLLRNGKLQRDDIVLTTRGTVGNVALYSQEVPFSNIRINSGMVIIRVNKNFWHPYFVY 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + S ++I + G+ + + +P L EQ ++II ID I Sbjct: 122 LFFQSHLFKKQISRLISGSAQPQLPISILETVSIPQLTLDEQ----KEIIFNIKSIDQKI 177 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKP 236 + + L+ + L Y + PD K SG E +P+ W V+ Sbjct: 178 QINNQINQELETMAKTLYDYWFVQFDFPDQNGKPYKSSGGKMVYNPELKRQIPEGWGVEK 237 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + + K L ++ + + +E ++ + Sbjct: 238 LGDITICHDSKRVPLSSNDRELVKGEIPYYGATGIMDYVNNYIFEGDYVLMAED----GS 293 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + + + A++ L L++ + K+ ++ Sbjct: 294 VMTEKGTPILQRISGKNWVNNHAHVLEPIKNHSCKLLMMLLKDVSVMKI---KTGSIQMK 350 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + E++ ++ V P++ F+I + V + L+E+ +Q L + R + + Sbjct: 351 INQENMNKIVVPAIPLELLFEINQKLEVIDKQQLNLIEENKQ----LTQLRDWLLPMLMN 406 Query: 417 GQIDL 421 GQ+ + Sbjct: 407 GQVKV 411 Score = 40.5 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 63/193 (32%), Gaps = 16/193 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V + T + + + + ++ P G + D Sbjct: 228 QIPEGWGVEKLGDITICHDSKRVPLSSNDRELVKGEI--------PYYGATGIMDYVNNY 279 Query: 80 IFAKGQILYGKLGPYL---RKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IF +L + G + I+ G + VL+P + L+ + Sbjct: 280 IFEGDYVLMAEDGSVMTEKGTPILQRISGKNWVNNHAHVLEP---IKNHSCKLLMMLLKD 336 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I G+ + + + I +P PL I +K+ + LI E + +L Sbjct: 337 VSVMKIKTGSIQMKINQENMNKIVVPAIPLELLFEINQKLEVIDKQQLNLIEENKQLTQL 396 Query: 195 LKEKKQALVSYIV 207 L++ V Sbjct: 397 RDWLLPMLMNGQV 409 >gi|291320524|ref|YP_003515788.1| type I R/M system specificity subunit [Mycoplasma agalactiae] gi|290752859|emb|CBH40834.1| Type I R/M system specificity subunit [Mycoplasma agalactiae] Length = 410 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 53/403 (13%), Positives = 120/403 (29%), Gaps = 28/403 (6%) Query: 25 WKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + F G + E+ I + Y + S I Sbjct: 19 WEQEKLGNFGTSTGGSSIENFFNNNGKYKVISIGSFSEDNT-YNDQGLRIDYSPFIKDKI 77 Query: 81 FAKGQILY-----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 K I+ L KA++ + D V + L ++ ++ + Sbjct: 78 LKKDNIVMILNDKSSEAKILGKALLIEKDDEFVYNQRVQKIDINKDRFLSKFIFTLLNSN 137 Query: 136 RIEAIC---EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 E I +G T + +W I +I IP L EQ I I + Sbjct: 138 SREKITLLAQGNTQIYVNWSSISSIEYLIPNLEEQSQISSLFSHLDSLITLHQRKLSSLK 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L + + + + + W+ + + N KN LI Sbjct: 198 NLKNRL-------LDKMFCDEKSQFPSIRFKEFTNAWEQWKARGILLPYRQKNDKNLTLI 250 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 ++ + ++ + I+ + + S+ + Sbjct: 251 SYSVSNKEGFVDQKEFFDEGGKAVYADKKNSLIISFDTFAYNPSRIN--VGSIALFKNTI 308 Query: 313 RGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVP 370 G+++ Y + + ++ +S K+ +R +L + + + +P Sbjct: 309 NGLVSPIYEVFKVSANSNPDFIYLWFKSECFNKIVANNSNKSVRDTLNLKQFEDNLLNLP 368 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++EQ I + +D L+ ++ + LK +++ + Sbjct: 369 VLQEQNKIA----KLFSSLDSLITLHQRKLNSLKNIKNTLLEK 407 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 72/208 (34%), Gaps = 12/208 (5%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L P ++ K+ W ++ + + N K +I S S N Sbjct: 6 LVPKIRFKEFTNAWEQEKLGNFGTSTGGSSIENFFNNNGKYKVISIGSFSEDNTYNDQGL 65 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA---QVMERGIITSAY--MAVKP 325 R + + +I+ IV D ++ + L A + + + + + Sbjct: 66 R---IDYSPFIKDKILKKDNIVMILNDKSSEAKILGKALLIEKDDEFVYNQRVQKIDINK 122 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 S ++ L+ S K+ + + + + + L+P ++EQ I + Sbjct: 123 DRFLSKFIFTLLNSNSREKITLLAQGNTQIYVNWSSISSIEYLIPNLEEQSQI----SSL 178 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ + LK ++ + Sbjct: 179 FSHLDSLITLHQRKLSSLKNLKNRLLDK 206 >gi|194467966|ref|ZP_03073952.1| restriction modification system DNA specificity domain [Lactobacillus reuteri 100-23] gi|194452819|gb|EDX41717.1| restriction modification system DNA specificity domain [Lactobacillus reuteri 100-23] Length = 385 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 54/377 (14%), Positives = 131/377 (34%), Gaps = 30/377 (7%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTST---VSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 + +++ + D ++D KG +L +G R AI+ + + + Sbjct: 26 LSAKNIINNRVVITSNDRKISENDFKKIHDKFQLRKGDVLLTIVGTIGRSAILKEANKLT 85 Query: 108 STQ----FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + + L S + +++ + I + + IP Sbjct: 86 FQRSVAYLRPDENILTSSNFLFSLSKSSNFQNQLKKRTVISAQPGIYLSDIDKLNITIPE 145 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 L E+ +KI + + ID++++ + R +E LK+ K+A++ + + ++ Sbjct: 146 LKEEQ---DKIASIIITIDSILSLQQRKLEQLKQLKKAMLQQLFVNKNSKQPNLRFKN-- 200 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 W+ + ++ + K+ + + G + + ++ + +S Y Sbjct: 201 ----FNGDWKQRKGKSIFYSKSNKDFPELTVLSATQDKGMVPRSSTGIDIKYEKKSLRGY 256 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL---MRSY 340 + V+PG+ + Q A GI++ AY + + SY Sbjct: 257 KKVEPGDFIVHLRSFQG-----GFAYSDLTGIVSPAYTVFTFKQPEMFNNYFWKEKFTSY 311 Query: 341 DLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + ++ + G+R +S+ + D L P EQ I + I+ L+ + Sbjct: 312 NFIQLLKKVTYGVRDGRSISYSDFLTLNEKFPVEVEQTKIAD----LFKTINNLIAFQQN 367 Query: 399 SIVLLKERRSSFIAAAV 415 + L + + Sbjct: 368 KLTQLTALKKHLLQKLF 384 >gi|194364815|ref|YP_002027425.1| restriction modification system DNA specificity domain [Stenotrophomonas maltophilia R551-3] gi|194347619|gb|ACF50742.1| restriction modification system DNA specificity domain [Stenotrophomonas maltophilia R551-3] Length = 364 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 52/365 (14%), Positives = 104/365 (28%), Gaps = 34/365 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W +VP+K L G +G+ + G+ N + V I Sbjct: 8 SGWPLVPLKNIATLKRGYDLP-------VGMRN----KGEVPIYAANGQNGSHDEVKING 56 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ G+ G R +T V+ P + L + + E E Sbjct: 57 PG-VITGRSGTIGRVHYCEGGFWPLNTALYVMDFHGNHPRWVYYMLSAFKL----ERFSE 111 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA + + + + +P+PPL EQ I + +L L Sbjct: 112 GAGVPTLNRNLVHDELIPLPPLPEQKRIAAILDKADAIRRKRQQAIQLADDL-------L 164 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + +P K + + + K + + K T ++ G Sbjct: 165 RAVFLDMFGDPVTNPKGWPRKALRTLGSSITGKTPPSEKAGMWGKGT-------PFVTPG 217 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 ++ + + + E ++ G ++ I K + + V A Sbjct: 218 DLNGCIRSSAREVTEEGLANSRLCRAGGLLVCCIGATIGKVGISESPVT----FNQQINA 273 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + + + V LK + + VPP Q ++ Sbjct: 274 QEWNCEVHDIYGYFVFKICPQLVRDGAIQTTLPILKKSLFDGIEIPVPPRAMQAKFAGIV 333 Query: 383 NVETA 387 A Sbjct: 334 ESTLA 338 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 41/109 (37%), Gaps = 7/109 (6%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I ++ + +A + HG ++ +++ ++ L + Sbjct: 58 GVITGRSGTIGRVHYCEGGFWPLNTALYVMDFHGNHPRWVYYMLSAFKLERFSE---GAG 114 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +L V + +PP+ EQ I +++ + D + K +Q+I L Sbjct: 115 VPTLNRNLVHDELIPLPPLPEQKRIAAILD----KADAIRRKRQQAIQL 159 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 44/158 (27%), Gaps = 13/158 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK W ++ TG+T S GK ++ D+ G + Sbjct: 179 PKGWPRKALRTLGSSITGKTPPSEKAGMWGKGTPFVTPGDL---NGCIRSSAREVTEEGL 235 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + G +L +G + K I++ + Q + Sbjct: 236 ANSRLCRAGGLLVCCIGATIGKVGISESPVTFNQQI----NAQEWNCEVHDIYGYFVFKI 291 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + +GA + P+ + + + Sbjct: 292 CPQLVRDGAIQTTLPILKKSLFDGIEIPVPPRAMQAKF 329 >gi|307566385|ref|ZP_07628824.1| type I restriction modification DNA specificity domain protein [Prevotella amnii CRIS 21A-A] gi|307344962|gb|EFN90360.1| type I restriction modification DNA specificity domain protein [Prevotella amnii CRIS 21A-A] Length = 399 Score = 89.5 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 118/395 (29%), Gaps = 22/395 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ I+ + + + +++ G V G + ++ + + Sbjct: 21 DWEKKKIESIISQESSTMAMNKLELLKEGFP-VYGADGLIGYINDFQQKEEYIS------ 73 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 K G + K + L+ KD W+ + T + +G Sbjct: 74 ----MVKDGSGVGKLNLCQKHSSILGTLTALKSKDS-KRYFLKWIYYLLNTLDFSSYVKG 128 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 A + H + I + + IP +EQ I + + + I+ + K Q L+ Sbjct: 129 AGIPHIYYSDIKHKCIYIPSFSEQEKIADCLSSLDDYINATQEKIEILQAHKKGLIQQLL 188 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + P ++ G E+ T + I + Sbjct: 189 PALGKTM--PQKRLPKFGKSKKWSPYSMEEMFKIRNGYTPSKSNPKFWEDGTIPWFRMED 246 Query: 264 IIQKL---ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 I + + E+ + + I+ + + + + Sbjct: 247 IREHGHILSDSIQHITKEAVKGKGLFPANSIIVATTATIGEHALIIVDSLANQRFTFLTK 306 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ID Y + M D +G S+ KRL V +P +EQ +I Sbjct: 307 RKSFDTQIDMKYFYYYMYIID-EWCKQHTNAGGFASVDMNGFKRLSVSLPSPEEQKEIAE 365 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 A ID L++ +Q +V+L++ + + Sbjct: 366 ----CFASIDDLIDSTKQKLVMLQKHKQGLMQQLF 396 >gi|225076052|ref|ZP_03719251.1| hypothetical protein NEIFLAOT_01084 [Neisseria flavescens NRL30031/H210] gi|224952612|gb|EEG33821.1| hypothetical protein NEIFLAOT_01084 [Neisseria flavescens NRL30031/H210] Length = 387 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 55/397 (13%), Positives = 107/397 (26%), Gaps = 34/397 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP---KDGNSRQSDTSTVS 79 + WK + ++ R + + + GT P + Sbjct: 16 EEWKNKTLGDLGRVEMCRRIFKEQTQPSGEIPFFKIGTFGQEPDAFISSELFEEYRQKYP 75 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +G IL G R + +V E Q I+ Sbjct: 76 YPKQGDILISAAGTIGRTVEFTGENAYFQDSNIVW---LRFDESQITSTFLNITYQNIKW 132 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG+T+ + + + +P L EQ + +I +E ++ K Sbjct: 133 GLEGSTIKRLYNSDLLSAEITVPSLPEQTHLGLFFRRLDSQIAE----SRAVLEKSRQLK 188 Query: 200 QALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +A+++ + P ++ K EW ++ N + K + Sbjct: 189 KAMLAKMFPANGEKIPQIRFKGFEGEWETYQICDLFRITRGNVLATTNLVDNKNEDYCYP 248 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S + L E+ T+ F + ++ + E G Sbjct: 249 VYSSQTKNKGLMGYWKHYLFENAITWTTDGANAGDVNFRSGKFYCTNVCGVLINEDGFAN 308 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQF 376 + S L + +P+L+PP IKEQ Sbjct: 309 QCIAEILNLVTHSYVSY-----------------VGNPKLMNNVMAEIPILIPPTIKEQT 351 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I N ++D + + L + +AA Sbjct: 352 AIGNF----FLQLDETIALQSAEVEKLNRLKKGLLAA 384 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 54/202 (26%), Gaps = 15/202 (7%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K EW + K I G Q+ + Sbjct: 7 PRLRFKGFTEEWKNKTLGDLGRVEMCRRIF----KEQTQPSGEIPFFKIGTFGQEPDAFI 62 Query: 273 MGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E Y Y G+I+ E + + D + Sbjct: 63 SSELFEEYRQKYPYPKQGDILISAAGTIGRTV----EFTGENAYFQDSNIVWL--RFDES 116 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + + + + L D+ + VP + EQ + + R+D Sbjct: 117 QITSTFLNITYQNIKWGLEGSTIKRLYNSDLLSAEITVPSLPEQT----HLGLFFRRLDS 172 Query: 392 LVEKIEQSIVLLKERRSSFIAA 413 + + + ++ + + +A Sbjct: 173 QIAESRAVLEKSRQLKKAMLAK 194 >gi|315026885|gb|EFT38817.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX2137] Length = 377 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 49/392 (12%), Positives = 121/392 (30%), Gaps = 38/392 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + K++TG+ + K VE+G P S + S ++ Sbjct: 18 EEWEQCKAEELCKISTGKGNTQDK---------VENGK---YPFYVRSENIERSNYFLYD 65 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + +L G + + + + + + S++ +R+ ++ Sbjct: 66 QEAVLTVGDGVGTGRVFHYVSGKYNLHQRVYRMYDFNKQISAKYFYYYFSLNFHRRVRSL 125 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 ++ I ++ + P EQ+ I + + IT R +E LKE K+ Sbjct: 126 TAKTSVDSVRLNMIADMEIKYPSELEQLKIFSFLDY----LIKSITLHQRKLEQLKELKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + + K++ + E + + + + + + Sbjct: 182 AYLQLMFPTKEERVPKLRFADFEGEWELCKLIGILDIIKGTQKSKSELSTNQNNCTPYPV 241 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y I N+ + + + V E+ Sbjct: 242 YNGGINPSGYTNIYNREN---------------AITISEGGNSAGFVNFVQEKFFSGGHN 286 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + D+ +L + + S ++ +++ + L + EQ I Sbjct: 287 YTIVNNVTDTLFLFFYLCSIQ-EEIMRLRVGTGLPNIQKPTLMNLEIQKTTDNEQKFIGL 345 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ID+L+ + + LK + S++ Sbjct: 346 FL----KNIDILITLTQNKLNQLKSLKKSYLQ 373 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 51/186 (27%), Gaps = 7/186 (3%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIV 286 + +V + K E +S GN K+E S + + Sbjct: 4 EMKKVPRLRFRGFSEEWEQCKAEELCKISTGKGNTQDKVENGKYPFYVRSENIERSNYFL 63 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 E V D R M I + Y + +V Sbjct: 64 YDQEAVLTVGDGVGTGRVFHYVSGKYNLHQRVYRMYDFNKQISAKYFYYYFSLNFHRRVR 123 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S++ + + + P EQ I + ++ + ++ + LKE Sbjct: 124 SLTAKTSVDSVRLNMIADMEIKYPSELEQLKIFSFLDYLIKS----ITLHQRKLEQLKEL 179 Query: 407 RSSFIA 412 + +++ Sbjct: 180 KKAYLQ 185 >gi|26251229|ref|NP_757269.1| putative restriction modification enzyme S subunit [Escherichia coli CFT073] gi|227885169|ref|ZP_04002974.1| restriction modification enzyme S subunit [Escherichia coli 83972] gi|300980747|ref|ZP_07175162.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 45-1] gi|26111662|gb|AAN83843.1|AE016772_21 Putative restriction modification enzyme S subunit [Escherichia coli CFT073] gi|227837998|gb|EEJ48464.1| restriction modification enzyme S subunit [Escherichia coli 83972] gi|300409162|gb|EFJ92700.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 45-1] gi|307556579|gb|ADN49354.1| type I restriction-modification system, S subunit [Escherichia coli ABU 83972] gi|315293301|gb|EFU52653.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 153-1] Length = 589 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 63/509 (12%), Positives = 132/509 (25%), Gaps = 99/509 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESG 59 +K K P+ S + +P W+ + R ++N + +I +I + + + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTK 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLV 113 + + + FA G I K+ P + + + G+ +T+ V Sbjct: 141 FDGSHEFEIKKWKDVKKGYTHFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHV 200 Query: 114 LQPKDVLPELLQ---GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV-- 168 +P + + + + A N P+P PPL EQ Sbjct: 201 ARPFSDIINRKYLLLNFKSPNFLKSGESQMTGSAGQKRVPRFFFENNPIPFPPLQEQERI 260 Query: 169 ---------------------------------------LIREKIIAETVRIDTLITERI 189 E++ RI Sbjct: 261 IIRFTQLMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNVEELAENWARISEHFDTLF 320 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKD------------------------------ 219 + KQ ++ V L P + Sbjct: 321 TTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQKKAQLVKEGKIKKQKPLPP 380 Query: 220 -SGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 S E +P+ WE + + K+ + L NI + + Sbjct: 381 ISDEEKPFELPEGWEWCRLGSIYNFLNGYAFKSEWFTSVGLRLLRNANIAHGVTNWKDVV 440 Query: 276 KP----ESYETYQIVDPGEIVFR----FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 S I+ +IV I+ + + + + A + Sbjct: 441 HIPNDMISDFENYILSENDIVISLDRPIINTGLKYAIISKSDLPCLLLQRVAKFKNYANT 500 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +++L ++SY S + + ++ + P EQ I + + Sbjct: 501 VSNSFLTIWLQSYFFINSIDPGRSNGVPHISTKQLEMTLFPLLPQSEQDRIISKTDELIQ 560 Query: 388 RIDVL----VEKIEQSIVLLKERRSSFIA 412 + L + + L + I Sbjct: 561 TCNKLKYIIKTAKQTQLHLADALTDAAIN 589 >gi|282909127|ref|ZP_06316945.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WW2703/97] gi|282327391|gb|EFB57686.1| type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus WW2703/97] Length = 361 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 45/389 (11%), Positives = 109/389 (28%), Gaps = 30/389 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + K+N+G+ + +E G G + + Sbjct: 1 KKLGDLIKVNSGKDYK-----------HLEKGDIPVYGTGGYMTSVSEP---LSEIDAVG 46 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G + ++ T F K+ + + E + Sbjct: 47 IGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDESTGVP 102 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + I I +P EQ I E I +I+ + + K Q + S + Sbjct: 103 SLSKQTINKINRFVPSNKEQQKIGEFFIKLDRQIELEEQKLELLQQQKKGYMQKIFSQEL 162 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + + + + N K + +I + ++ Sbjct: 163 RFKDENGNDYPNWEEKKIEDI------ASQVYGGGTPNTKIKEFWNGDIPWIQSSDVKVN 216 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 K S + ++ I I + + V + ++++ Sbjct: 217 DLILRQCNKFISKNSIELSSAKLIPANSIAIVTRVGVGKLCLVEFDYATSQDFLSLSSLK 276 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVET 386 D Y + + Y + K+ + + + +++ + +P ++EQ I + Sbjct: 277 YDKLYSLYSLL-YTMKKISANLQGTSIKGITKKELLDSIIKIPHNLEEQQKIGD----LF 331 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +ID + + I +LK + + Sbjct: 332 YKIDKYISFNKCKIEILKSLKQGLLQKIF 360 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 67/193 (34%), Gaps = 15/193 (7%) Query: 24 HWKVVPIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYL--PKDGNSRQSD 74 +W+ I+ + G T + DI +I DV+ K + + Sbjct: 174 NWEEKKIEDIASQVYGGGTPNTKIKEFWNGDIPWIQSSDVKVNDLILRQCNKFISKNSIE 233 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + I + K + +FD S FL L L + Sbjct: 234 LSSAKLIPANSIAIVT-RVGVGKLCLVEFDYATSQDFLSLSSLKYD--KLYSLYSLLYTM 290 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++I A +G ++ K + I + + ++KI +ID I+ IE+ Sbjct: 291 KKISANLQGTSIKGITKKE---LLDSIIKIPHNLEEQQKIGDLFYKIDKYISFNKCKIEI 347 Query: 195 LKEKKQALVSYIV 207 LK KQ L+ I Sbjct: 348 LKSLKQGLLQKIF 360 >gi|255284468|ref|ZP_05349023.1| type I restriction-modification system specificity subunit [Bryantella formatexigens DSM 14469] gi|255264978|gb|EET58183.1| type I restriction-modification system specificity subunit [Bryantella formatexigens DSM 14469] Length = 359 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 49/388 (12%), Positives = 115/388 (29%), Gaps = 34/388 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 ++ GR VE+ GKY P G+ + I + ++ G Sbjct: 3 FNDVLEIKNGRNQRR-----------VENPDGKY-PVYGSGGIMGYADDYICSAETVIIG 50 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIEAICEGATMSH 148 + G + + T F + ++V +P L + D Q + + T+ Sbjct: 51 RKGSINNPIFVDEPFWNVDTAFGLEAKREVLIPRYLYYFCKHFDFKQ----LNKTVTIPS 106 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + I + +P L+ Q I ++ ++ +I + +E L E +A + Sbjct: 107 LTKSDLLKIEIKLPCLSNQQSIVHRL----QSVEQIIDNYYQQLEKLDELVKARFVEMFG 162 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + + + + + + G ++ Sbjct: 163 DPVENPHGFRKVALSELAEIKIGPFGSLLHKEDYIEGGHPLLNPSH----IVGGKVVPDS 218 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + K + E Y + ++V T + + + Sbjct: 219 KLTISDKKYDELEAYH-LHTDDVVMGRRGEMGR---CAVVTSEGFLCGTGSLLIRTKGEV 274 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + Y+ + K M G +L V + ++ PPI+ Q + A Sbjct: 275 TADYIQKTISFPSFRKTIEDMAVGQTMPNLNVPIVSKFQIIKPPIEVQKRYYEFV----A 330 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D +++++ + S + Sbjct: 331 QVDKSKIAVQKALDQTQLLFDSLMQKYF 358 >gi|294619474|ref|ZP_06698918.1| restriction modification system DNA specificity protein [Enterococcus faecium E1679] gi|291594301|gb|EFF25731.1| restriction modification system DNA specificity protein [Enterococcus faecium E1679] Length = 400 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 49/410 (11%), Positives = 123/410 (30%), Gaps = 54/410 (13%) Query: 26 KVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVESGT--GKYLPKDGNSRQSDTSTVS 79 + + +++ + E K I + DV K +P + Sbjct: 16 EWKTLDEIGLISSAGVDKKKIEGEKSIKLLNYMDVYRNMYLIKDIPSMIVTAPDKKIEQC 75 Query: 80 IFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLV----LQPKDVLPELLQGWLLS 130 KG I + L + + G+ + ++ P + + L S Sbjct: 76 NVLKGDIFFTPSSEVLNDIGNSAVALENMYGVVYSYHIMRLRLNNPNIITSMFINYMLGS 135 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 V +I +G T + +PIPPL Q I + T L E Sbjct: 136 EFVQNQINKNAKGLTRFGLTKTQWEKLQIPIPPLNVQEEIVRILDTFTELTAELTAELTA 195 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKN 248 + K++ + + ++ +EW +G + ++ + + Sbjct: 196 ELTARKKQYTYYR--------DKLLTFEEGEVEWKPLGELAENHDSMRKPITSGLREIGD 247 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN--DKRSLR 306 ++ + Y I D ++ + + Sbjct: 248 IPYYGASGIV--------------------DYVKDFIFDGDYLLVSEDGANLLARRTPIA 287 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + + A++ + Y+ + + S DL + L ++ + Sbjct: 288 FSISGKSWVNNHAHVLKFNTYAERKYIEYYLNSIDLTPYI---SGAAQPKLNQRNLNAIH 344 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + P ++++ I ++++ A + E + + I L ++ R+ ++ Sbjct: 345 IPNPSLEDKERIVSILDKFDALTSSITEGLPREIELRQKQYEYYRNMLLS 394 >gi|225155300|ref|ZP_03723793.1| Restriction endonuclease S subunits-like protein [Opitutaceae bacterium TAV2] gi|224803907|gb|EEG22137.1| Restriction endonuclease S subunits-like protein [Opitutaceae bacterium TAV2] Length = 462 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 60/213 (28%), Gaps = 10/213 (4%) Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE-------SNILSLSY 261 +G + +P+ W + S Sbjct: 245 QGRGKYKPPAAPDTTTLPPLPEGWTWANIEQIGQTTTGFTPPKNNAALFGGSIPFFKPSD 304 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 ++ + L + E +I+ I+ I K L Q I + + Sbjct: 305 LDVGYNVREYRDSLTNKGAEYGRILPALSILVTCIGATIGKTGLARVQCTTNQQINA--L 362 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 V I S ++ W + S + S L + LPV +PP+ EQ I Sbjct: 363 TVPNELILSQFVYWYINSPLGQRQIIDNASATTLPILNKSRFEALPVPLPPLTEQTRIVA 422 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + ID L + ++ R S + Sbjct: 423 EVERRLSVIDELETLVTANLTRATHLRQSILQQ 455 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 72/202 (35%), Gaps = 9/202 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSR 71 + +P+ W I++ + TG T I + D++ G + Sbjct: 261 LPPLPEGWTWANIEQIGQTTTGFTPPKNNAALFGGSIPFFKPSDLDVGY-NVREYRDSLT 319 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLL 129 I IL +G + K +A + Q L + + +L + + ++ Sbjct: 320 NKGAEYGRILPALSILVTCIGATIGKTGLARVQCTTNQQINALTVPNELILSQFVYWYIN 379 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S ++I T+ + +P+P+PPL EQ I ++ ID L T Sbjct: 380 SPLGQRQIIDNASATTLPILNKSRFEALPVPLPPLTEQTRIVAEVERRLSVIDELETLVT 439 Query: 190 RFIELLKEKKQALVSYIVTKGL 211 + +Q+++ +G+ Sbjct: 440 ANLTRATHLRQSILQQTFNEGI 461 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 74/192 (38%), Gaps = 9/192 (4%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFID 297 V + + E + + N K L + Q++ G+++ Sbjct: 12 CVDNVEKTGPVGREFVYVDIGSINRETKRIEDAKTLLASKAPSRAKQVLKTGDVLVSMTR 71 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QS 356 + + ++ + I ++ + ++ +S +L + +++ + G + Sbjct: 72 PNLNAVAWVPPEL-DGSIGSTGFHVLRAQNTESKFLFYAVQTNSFIEAMCQKVQGALYPA 130 Query: 357 LKFEDVKRLPVLVPP--IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++ D+ +PP + +Q I I + R+D V +++ LK R++ + +A Sbjct: 131 VRPRDISSF--CLPPFSLAQQHRIVAEIEKQFTRLDAGVTALKRVQANLKRNRAAVLKSA 188 Query: 415 VTGQIDLRGESQ 426 G++ + E++ Sbjct: 189 CEGRL-VPTEAE 199 >gi|322515485|ref|ZP_08068471.1| type I restriction/modification specificity protein [Actinobacillus ureae ATCC 25976] gi|322118452|gb|EFX90703.1| type I restriction/modification specificity protein [Actinobacillus ureae ATCC 25976] Length = 386 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 59/414 (14%), Positives = 123/414 (29%), Gaps = 48/414 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + ++ + G+ + + G P G+ ++ Sbjct: 2 EEYKLQDLISIKNGKKYD-----------HLNKGNI---PVYGSGGIMTYVDDYLYDGEA 47 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L + G T + L + P L +L +++ A+ G+T Sbjct: 48 VLLPRKGTLNNIMYSKGKLWTVDTMYYALVNEKADPYYLYAYLSQLNL----SALDSGST 103 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + +IP+ +P Q I + + +D I + L++ + L Y Sbjct: 104 LPSMTSTAYYSIPVKLPNKKNQQKIAQVL----SSLDRKIALNQQINAELEKMAKTLYDY 159 Query: 206 IVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKN------TK 250 + PD K SG E V VP WEVK + + Sbjct: 160 WFVQFDFPDENGNPYKSSGGEMVYHPELKREVPKGWEVKQIKDIAKTGSGGTPKSTIAEY 219 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +I ++ G + + + ++ I+ K SL S Sbjct: 220 YENGDIPWINSGELNNPFIIATENYISQLGLENSSAKLFPADSILMAMYGATAGKTSLIS 279 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + A A+ P+ + + S + R +L + +K L V Sbjct: 280 FE----ATTNQAICAIMPNDKQLNFYLKIALSDLYQYLVNLSSGSARDNLSQDKIKDLYV 335 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +VP + T + ++ + + R + + GQ+++ Sbjct: 336 VVPSEEMIEKYAQY----TTKFYNKIKINLKETQKFTQLRDFLLPMLMNGQVEV 385 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 70/211 (33%), Gaps = 14/211 (6%) Query: 10 YKDSGVQWI------GAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDV 56 YK SG + + +PK W+V IK K +G T +S DI +I ++ Sbjct: 174 YKSSGGEMVYHPELKREVPKGWEVKQIKDIAKTGSGGTPKSTIAEYYENGDIPWINSGEL 233 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + + S+ +F IL G K + F+ + + P Sbjct: 234 NNPFIIATENYISQLGLENSSAKLFPADSILMAMYGATAGKTSLISFEATTNQAICAIMP 293 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 D LS + + G+ + I ++ + +P + Sbjct: 294 NDKQLNFYLKIALSDLYQYLV-NLSSGSARDNLSQDKIKDLYVVVPSEEMIEKYAQYTTK 352 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +I + E +F +L L++ V Sbjct: 353 FYNKIKINLKETQKFTQLRDFLLPMLMNGQV 383 >gi|283796924|ref|ZP_06346077.1| type I restriction-modification system, S subunit, EcoA family [Clostridium sp. M62/1] gi|291075334|gb|EFE12698.1| type I restriction-modification system, S subunit, EcoA family [Clostridium sp. M62/1] Length = 411 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 72/406 (17%), Positives = 147/406 (36%), Gaps = 35/406 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + F + T + S + D+ I +D Y K D S + Sbjct: 25 WRAEKLSDFAERITRKNSNNETDLPLTISSKDGLVDQISYFNK--TVASKDMSGYYLLRN 82 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K G ST ++ K + ++ + S+ + I Sbjct: 83 GEYAYNKSYSVGYDFGSIKRLDRYPMGALSTLYICFALKKHNTDFIKVYFDSLKWYKEIY 142 Query: 139 AIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 I EGA L E + KI + ++ I + ++ LK+ Sbjct: 143 MISAEGARNHGLLNVPTDEFFATEHYLPENTAEQRKIADFLIALERRIDAQQSLVDNLKK 202 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K+ ++ +I + +G EW + +++R+NT + N++ Sbjct: 203 YKRGVMQHIFRQ------LPSRNGAEW--------TCVRLGDIFKKVSRRNTDGVIKNVI 248 Query: 258 SLS--YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERG 314 + S YG I Q+ + + Y +++ G+ V+ + + E+G Sbjct: 249 TNSAEYGLIPQRDFFDKVIAVDGNTANYYVIENGDFVYNPRKSNSAPYGPFNRYTLSEQG 308 Query: 315 IITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--LKFED--VKRLPVLV 369 II+ Y V I +YLAW +S + Y GS + + D + +PV+ Sbjct: 309 IISPLYTCLVLQADISPSYLAWYFKSDAWYRYIYDNGSQGVRHDRVSMTDDLLMGIPVMY 368 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P +Q +++++ AR+ + ++++ L + R ++ Sbjct: 369 PSHVKQLLYADILDMVEARL----QATQKTLDFLNKMRDGYMRQLF 410 >gi|270293233|ref|ZP_06199444.1| type Ic restriction-modification system, HsdS subunit [Streptococcus sp. M143] gi|270279212|gb|EFA25058.1| type Ic restriction-modification system, HsdS subunit [Streptococcus sp. M143] Length = 383 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 47/392 (11%), Positives = 114/392 (29%), Gaps = 32/392 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W I K++ G + + + + K D Sbjct: 18 WVEKKIADIVKISAGGDVDKERLKQSGKYPVIANA---LTNKGIVGFYDD----YKVKAP 70 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + G + V + +L + + +I G Sbjct: 71 AVTVTGRGDVGYAVARHENFTPV-----VRLLTLQSDSIDMDYLENQINSMKILNESTGV 125 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 +GN + P + EQ I + + + + K ++S Sbjct: 126 PQ--LTAPQLGNYKVYRPEIDEQSAIGSLFRTLDDLLASY----KDNLTNYQSLKATMLS 179 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K +++ G E WE K LV+ ++RK K + + Sbjct: 180 KMFPKAGQTIPEIRLDGFE------RKWEKKKLIDLVSPVSRKVKKPSDPYYRLSIRSHA 233 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + K + + V ++V ++ S + + +++ + Sbjct: 234 KGTFKQFVDDPKKIAMDNLFEVKENDLVVNITFAWEHAIAVASKE-DDGLLVSHRFPTFV 292 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIKEQFDITNV 381 D ++ ++ + + + G L +D ++ ++VP ++EQ I + Sbjct: 293 IDKSDKNFINIYIKREEFRQKLDLLSPGGAGRNRVLNVKDFIKIQMIVPELEEQQAIGSY 352 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ I L+ + + Sbjct: 353 ----FSNLDNLINSCQEKITQLETLKKKLLQD 380 >gi|227892722|ref|ZP_04010527.1| restriction modification system DNA specificity subunit [Lactobacillus ultunensis DSM 16047] gi|227865499|gb|EEJ72920.1| restriction modification system DNA specificity subunit [Lactobacillus ultunensis DSM 16047] Length = 383 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 52/365 (14%), Positives = 110/365 (30%), Gaps = 37/365 (10%) Query: 29 PIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + G +S K I I + +V+ G + ++ + + Sbjct: 5 TLDTVCDVLNGYAFKSKKYVSTGIRIIRINNVQDGYIEDKTPVFYPKEDEHVSKYRLLAD 64 Query: 85 QILYGKLGPYLRKAIIADFD--GICST--QFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G R AII D + L ++ VL + L +L S + Sbjct: 65 DVLVSLTGNVGRVAIINDEYLPAALNQRVACLRVKNSKVLKKYLFYFLNSKKFRSDCISS 124 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + K + + IP + EQ + I + + L Sbjct: 125 ANGIAQKNISTKWLKKYKISIPSIEEQRKKVAILSKLESAIKKKNEQINKINLLA----- 179 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 K +E ++ ++T+ ++ + + I + Sbjct: 180 -----------------KARFVEMFAQEQHVSKMSQACFIITDGTHQSPEFVTKGIPFVF 222 Query: 261 YGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N+I + ++ G+++ + ++S + Sbjct: 223 VSNLINNQLIYDTQKFIDENTYNKLIKRTPIEKGDLLLSIVGSYGHVAVVKSNKKFLFQR 282 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 AY+ V P+ +DS YL + + G +++L VK L + +PP+ Sbjct: 283 H-IAYIKVNPNLVDSEYLQSELLDSYVQNQIRMEVHGVAQKTLNLSAVKNLTIKLPPLAS 341 Query: 375 QFDIT 379 Q Sbjct: 342 QKKFA 346 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 65/184 (35%), Gaps = 8/184 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---KPESYETYQIV 286 + + ++ K+ K + + I + N+ + K + + + + Sbjct: 2 QYLTLDTVCDVLNGYAFKSKKYVSTGIRIIRINNVQDGYIEDKTPVFYPKEDEHVSKYRL 61 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 +++ + + A + VK + YL + + S Sbjct: 62 LADDVLVSLTGNVGRVAIINDEYLPAALNQRVACLRVKNSKVLKKYLFYFLNSKKFRSDC 121 Query: 347 YAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + +G ++++ + +K+ + +P I+EQ ++ ++++ ++K + I + Sbjct: 122 ISSANGIAQKNISTKWLKKYKISIPSIEEQRKKVAIL----SKLESAIKKKNEQINKINL 177 Query: 406 RRSS 409 + Sbjct: 178 LAKA 181 >gi|63146884|emb|CAI79467.1| HsdS-type I specificity subunit [Lactobacillus delbrueckii subsp. lactis] Length = 401 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 68/387 (17%), Positives = 116/387 (29%), Gaps = 27/387 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-FA 82 W+ V + + S + + +D+ G G + + TS I F Sbjct: 18 DWEQVKYGEIFQ-RRSKMGVSTPALPSVEYDDINPGMGTL---NKEPKSKGTSKRGIHFN 73 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G +L+GKL PYL+ + A F+G+ F VL + + + + Sbjct: 74 PGDVLFGKLRPYLKNWLFACFEGVAVGDFWVLTSSKIDHGFTYSLIQAPEFQYIANLSSG 133 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 L+EQ I + I ++ + L Q + Sbjct: 134 SKMPRSDWGLVSNARTFIPTNLSEQKSISSVLFGLDTAITLHEEKKRQLERLKSALLQKM 193 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + K P V+ K W + L L + +S I L Sbjct: 194 FAD---KSGYPAVRFKGFDDIWDQEK-----LNSLVRLHRGLTYSPNNVQDSGIRILRSS 245 Query: 263 NIIQKLETRNMGLKPESYETYQI--VDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITS 318 NI+ I V G+I+ + + + E ++ Sbjct: 246 NILDGQFVMTDDDIFVKSSVVNIPTVKDGDILITAANGSIKLVGKHAIISGISENTAVSG 305 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLVPPIKEQ 375 +M V I ++ L + + G+G +LK D+ + V VP EQ Sbjct: 306 GFMLVGSSRI-PDFVNSLFDTSWYQRFIRKYVTGGNGSIGNLKKNDLDKQYVKVPTTSEQ 364 Query: 376 FDITNVINVETARIDVLVEKIEQSIVL 402 I ID L+ I I Sbjct: 365 ERIGEF----FREIDQLI--INNQIKH 385 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 60/182 (32%), Gaps = 10/182 (5%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIV 292 + R + + S+ Y +I + T N K + I +PG+++ Sbjct: 19 WEQVKYGEIFQRRSKMGVSTPALPSVEYDDINPGMGTLNKEPKSKGTSKRGIHFNPGDVL 78 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F + + G+ + + ID + L+++ + + Sbjct: 79 FGKLRPYLKNWLFACFE----GVAVGDFWVLTSSKIDHGFTYSLIQAPEFQYIANLSSGS 134 Query: 353 LRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + V +P + EQ I++V+ +D + E+ L+ +S+ + Sbjct: 135 KMPRSDWGLVSNARTFIPTNLSEQKSISSVLFG----LDTAITLHEEKKRQLERLKSALL 190 Query: 412 AA 413 Sbjct: 191 QK 192 >gi|53805024|ref|YP_113333.1| type I restriction-modification system S subunit [Methylococcus capsulatus str. Bath] gi|53758785|gb|AAU93076.1| type I restriction-modification system, S subunit, EcoA family [Methylococcus capsulatus str. Bath] Length = 416 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 51/426 (11%), Positives = 125/426 (29%), Gaps = 45/426 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + +L G + DV P +S +DT ++ Sbjct: 4 EWKECSLGDVIELKRGYDLPQKDRLP----GDV--------PLVSSSGVTDTHAKAMVKG 51 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI-CE 142 ++ G+ G + + +T V K P + +L +D + Sbjct: 52 PGVVTGRYGTLGQVFYVEQDFWPLNTTLYVRDFKGNDPRFISYFLRDVDFHAYSDKAAVP 111 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G +H + EQ I + +I+ + + + +A Sbjct: 112 GLNRNHLHQAKVRIP----SDPNEQRAIAHILGTLDDKIELNRRQNETLEAMARALFKAW 167 Query: 203 VSYI--VTKGLNPDVKMKDSGIEW----------------VGLVPDHWEVKPFFALVTEL 244 V D + +G +W +G +P+ W V F + + Sbjct: 168 FVDFEPVRAKCRGDRPVAPTGWQWPQHILDLFPDRLVESELGEIPEGWRVFSFGDVAEQG 227 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKR 303 E Y + ES + V G ++ ++ + Sbjct: 228 KGFVNPSREPGERFTHYSLPAFDAGKMPVIEPGESIKSNKTPVPDGAVLVSKLNPHIPRI 287 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL---RQSLKF 359 L + R + ++ ++ P + + + S + + +G Q +K Sbjct: 288 WLV-GEAGNRAVCSTEFIVWTPKSPAQSAFVYCLASSPEFVGAMCQLVTGTSNSHQRVKP 346 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + ++ + V ++ + + + + +L + R + + ++G++ Sbjct: 347 DQLREIRVF----AGNENVVETFSKTAEPLMDQFLQNTRQSRILAQLRDTLLPKLISGEL 402 Query: 420 DLRGES 425 ++ Sbjct: 403 RVKDAE 408 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 54/138 (39%), Gaps = 11/138 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +G IP+ W+V + G + S G+ + L ++G + + + Sbjct: 208 LGEIPEGWRVFSFGDVAEQGKGFVNPSREPGERFTHYSLPAFDAGKMPVIEPGESIK--- 264 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDV-LPELLQGWLLS 130 S + G +L KL P++ + + G +CST+F+V PK + S Sbjct: 265 -SNKTPVPDGAVLVSKLNPHIPRIWLVGEAGNRAVCSTEFIVWTPKSPAQSAFVYCLASS 323 Query: 131 IDVTQRIEAICEGATMSH 148 + + + G + SH Sbjct: 324 PEFVGAMCQLVTGTSNSH 341 >gi|229148006|ref|ZP_04276345.1| N-6 DNA methylase [Bacillus cereus BDRD-ST24] gi|228635431|gb|EEK91922.1| N-6 DNA methylase [Bacillus cereus BDRD-ST24] Length = 1009 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 64/398 (16%), Positives = 126/398 (31%), Gaps = 45/398 (11%) Query: 27 VVPI-KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +V + K + K+ ++ L V + G + +D Q+ + K Sbjct: 636 IVRLGKYIIENTKKVKPADDKERKWVTLG-VSNKDGIVINEDLKPEQTKQ-KYFLVNKND 693 Query: 86 ILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEAI 140 Y + + FD I S ++V + K+ PE L+ L + +I Sbjct: 694 FCYNPYRINVGSIGLNKFDYENQIISGAYVVFRTKEDELNPEYLEKLLKHDSFRAYVNSI 753 Query: 141 CE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + IGN +P+PP+ Q I + I + Sbjct: 754 ANIGKGVRMNLTFDEIGNFELPLPPMEIQ---------------EEIVREYKKISEVLYG 798 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN--I 256 +A++ DS + G P H + + K+ I+ I Sbjct: 799 SKAILDNWDV----------DSTLFTEGNFPLHNIGDLTINSLYGSSEKSDYEIDGYDII 848 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RG 314 + G KL P + G+++ + + E Sbjct: 849 RIGNIGYCSFKLNDLKRVPLPLKKFKNYELKKGDLLIVRSNGNPKLVGKCAIWQDEIPNA 908 Query: 315 IITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + S + + + Y+ + + S G + E +K +P+ +P Sbjct: 909 VYASYLVRFRFNEEAVVPEYIMYYLMSSVGKSYIKPKAGGGTYNFNAERIKEIPIPLPDK 968 Query: 373 KEQFDITNVINVE---TARIDVLVEKIEQSIV-LLKER 406 + Q I + E +R++ L+ K E+ I LLK+ Sbjct: 969 QTQLSIIERVKSEQETVSRVEKLMIKSEERIKSLLKKY 1006 >gi|315187183|gb|EFU20940.1| restriction modification system DNA specificity domain [Spirochaeta thermophila DSM 6578] Length = 554 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 64/425 (15%), Positives = 127/425 (29%), Gaps = 49/425 (11%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKR--FTKLNTGRTSESGKDIIYIGLEDVES 58 M + Y +PK W+ + + G++ Sbjct: 1 MNKQQNY-------------LPKGWQWAKLGDGQIATVVMGQSPPGTTYNEQGQGLPFYQ 47 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 G + R ++ G IL P + + I + ++ Sbjct: 48 GKADFGDVSPTPRVWCSAPKKTAEPGDILLSVRAPVGPTNLASHRCCIGRGLAAIRGERN 107 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L L + + + +G+T + NI +P+PPL Q I E + Sbjct: 108 AL--TLYLYFWFKHIEPWLSEQGQGSTFKAIGKDILENIIVPLPPLPVQERIVEILQKAD 165 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 + +R +EL ++ AL + +P K E +G + Sbjct: 166 ----EIRRKRKEALELAEKILPALFLEMFG---DPATNPKGWETEPIGSLVHFDTALIKP 218 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + IESN + + + E + S +++ + Sbjct: 219 EPGKTYLYLAPEHIESNTGNYTGPHPTDGREIGSAKYSFTS---------DHVLYCKLRP 269 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL-RQS 356 +K L GI ++ + ++P +LA +R G Sbjct: 270 YLNKVVLPHTS----GICSTELVPLRPGPKLLREFLAIYLRLPFFVATAVQKSQGTKMPR 325 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE---KIEQSIVLLKERRSSFIAA 413 E +K+ ++VPPI Q + L+E K+++ + L ++ Sbjct: 326 FGPELMKQERIIVPPIPLQR-------SFCLQASQLMEASRKLKEGLSLSSSCFDGLLSR 378 Query: 414 AVTGQ 418 A TG+ Sbjct: 379 AFTGE 383 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 45/196 (22%), Positives = 71/196 (36%), Gaps = 6/196 (3%) Query: 22 PKHWKVVPIKRFTKLNTG-RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 PK W+ PI +T E GK +Y+ E +ES TG Y + S Sbjct: 197 PKGWETEPIGSLVHFDTALIKPEPGKTYLYLAPEHIESNTGNYTGPHPTDGREIGSAKYS 256 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEA 139 F +LY KL PYL K ++ GICST+ + L+P L +L Sbjct: 257 FTSDHVLYCKLRPYLNKVVLPHTSGICSTELVPLRPGPKLLREFLAIYLRLPFFVATAVQ 316 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +G M + + + +PP+ Q + ++ + + L Sbjct: 317 KSQGTKMPRFGPELMKQERIIVPPIPLQRSFC----LQASQLMEASRKLKEGLSLSSSCF 372 Query: 200 QALVSYIVTKGLNPDV 215 L+S T L + Sbjct: 373 DGLLSRAFTGELTAEW 388 >gi|212691981|ref|ZP_03300109.1| hypothetical protein BACDOR_01476 [Bacteroides dorei DSM 17855] gi|212665373|gb|EEB25945.1| hypothetical protein BACDOR_01476 [Bacteroides dorei DSM 17855] Length = 391 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 56/404 (13%), Positives = 128/404 (31%), Gaps = 54/404 (13%) Query: 24 HWKVVPIKRF-TKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ I K+ +G T G+ ++ ++V G G+ L D DT Sbjct: 25 EWEKCTIGELTIKVGSGVTPRGGEAVYKTEGHPFVRSQNV--GLGQLLLDDIAYIDEDTH 82 Query: 77 TVSI---FAKGQILYGKLGPYLRKAIIADFD---GICSTQ-FLVLQPKDVLPELLQGWLL 129 +L G + ++ IA + G + ++ +++ L +LL Sbjct: 83 QRQKNTELQLDDVLLNITGASIGRSAIATKEIAGGNVNQHVCIIRTQDNLISSFLCNFLL 142 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S ++I++ G +++ I +I + IP + EQ I + + RI T Sbjct: 143 SSYGQKQIDSFQAGGNRQGLNFEQIKSIKIAIPTVNEQYKIAQLLQLVEGRIATQNKIIE 202 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 +L +++++ ++ + ++ I V ++ N Sbjct: 203 DLKKL-----KSVITDLLFNSIIDAHTIRLGNI------------AHITNGVGDVQDANI 245 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + IE+ ++ T + + Y + Sbjct: 246 EHIENWYPFFDRSEELKWFPTYSFDKEAVIY-----------------AGEGQSFYPRYY 288 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + Y + S SL+ + ++ + + Sbjct: 289 KGKFALHQRCYAITDFASCILPKYCYYFMSTLNSYFVRNSVGSTVSSLRMDIFQKAEIKL 348 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 PPI +Q I +I+ ++ E + I L+E + ++ Sbjct: 349 PPIPKQQHICKIIDAFCTKL----EVEQSIISTLQELKQFLLSQ 388 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 73/207 (35%), Gaps = 10/207 (4%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ + EW + VT + E + S + +L + Sbjct: 15 PNLRFPEFSGEW-EKCTIGELTIKVGSGVTPRGGEAVYKTEGHPFVRSQNVGLGQLLLDD 73 Query: 273 MGLKPESYETYQI---VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + E Q + +++ + ++ + ++ + + + Sbjct: 74 IAYIDEDTHQRQKNTELQLDDVLLNITGASIGRSAIATKEIAGGNVNQHVCIIRTQDNLI 133 Query: 330 STYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 S++L + S K + G RQ L FE +K + + +P + EQ+ I ++ Sbjct: 134 SSFLCNFLLSSYGQKQIDSFQAGGNRQGLNFEQIKSIKIAIPTVNEQYKIAQLL----QL 189 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + + I LK+ + S I + Sbjct: 190 VEGRIATQNKIIEDLKKLK-SVITDLL 215 >gi|18765813|gb|AAL78769.1|AF326619_1 HP848-like protein [Helicobacter pylori] Length = 413 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 57/405 (14%), Positives = 122/405 (30%), Gaps = 31/405 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + ++ R K I +G Y+ Sbjct: 13 PKGVEFRKLGEMCEILDNRRIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFV----- 67 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + + + K A + VLQ K+ L + + + Sbjct: 68 LVGEDGSVINKDNT--PVVNWASGKIWVNNHAHVLQTKNELKLKFLYF----YLQTIDVS 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 C T + + + I +PIPPL Q I + + A T L TE + + + Sbjct: 122 YCVAGTPPKINQENLKKITIPIPPLEIQQEIVKILDAFTELNTELNTELKARKKQYQYYQ 181 Query: 200 QALVS--------YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L+ + L K L P + + N+K K+ Sbjct: 182 NMLLDFNDINQSHKDAKERLAQKPYPKRLKTLLQTLAPKGVGFRKLGEVCESTNKKTLKI 241 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 E + + + G + + GE + + + Sbjct: 242 SEVSEVKNKGMYPVINSGRDLYGYYHDFN------NDGENITIASRGEYAGFINYFNEKF 295 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 G + Y + + + +L + +++ ++ + + G +L D++ L + +PP Sbjct: 296 FAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALNKADIETLTIPIPP 355 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ Q +I +++ + L+ I I K+ R + Sbjct: 356 LEIQQEIVTILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 400 >gi|294782549|ref|ZP_06747875.1| type I restriction-modification enzyme S subunit [Fusobacterium sp. 1_1_41FAA] gi|294481190|gb|EFG28965.1| type I restriction-modification enzyme S subunit [Fusobacterium sp. 1_1_41FAA] Length = 371 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 62/372 (16%), Positives = 120/372 (32%), Gaps = 37/372 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK V + ++ TG T ++ +I +++ Y+ + + Sbjct: 7 NEWKKVKLGDVCEVITGNTPLKKIKEYWDKDEVPFITPPELKYEGINYITPNIYVSKIGA 66 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I K I +G L K I D I + Q L KD +LL + + Sbjct: 67 KQGRIIPKNSICVCCIGS-LGKLGILKEDAITNQQINSLILKDKNVDLLYLYFYLKTIKN 125 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +E+I T+ + I + +P L Q I +K+ ++ I R + L Sbjct: 126 NLESIASSTTVKIINKSSFEKIDINLPSLEIQKKISKKL----ELLENNINFRKSQLNSL 181 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 E ++L + G+ + D +G P + K Sbjct: 182 NELSKSLFTKFNKNGVEKQ--LNDVADIIMGQSPLSQSYNKDKKGLPFYQGKTEFSDIYI 239 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + Y N K +V+ +I+ D ++ Sbjct: 240 KEATVYCNSPIK-----------------VVEENDILMSVRAPVGDV-----NIATQKSC 277 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I ++KP ID YL +L++ +GS +++ ++ L + + +Q Sbjct: 278 IGRGLASIKPKKIDYLYLFYLLKEQKSKIEKIGVGS-TFKAINKNNISTLKISIVEKDKQ 336 Query: 376 FDITNVINVETA 387 I N ++ Sbjct: 337 NKIRNYLSSIEK 348 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 51/147 (34%), Gaps = 8/147 (5%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 T N+ + + +I+ I I L+ + + I + + +K Sbjct: 53 NYITPNIYVSKIGAKQGRIIPKNSICVCCIGSLGKLGILKEDAITNQQINS---LILKDK 109 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +D YL + +++ + S + + +++ + +P ++ Q I+ + Sbjct: 110 NVDLLYLYFYLKTIK-NNLESIASSTTVKIINKSSFEKIDINLPSLEIQKKISKKLE--- 165 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ + + + L E S Sbjct: 166 -LLENNINFRKSQLNSLNELSKSLFTK 191 >gi|167837439|ref|ZP_02464322.1| Restriction endonuclease S subunits [Burkholderia thailandensis MSMB43] Length = 462 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 58/454 (12%), Positives = 130/454 (28%), Gaps = 76/454 (16%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + P+ + + G+ ++ G+G N S G Sbjct: 10 WPIKPLGKTLPIEYGKALP----------ANLRDGSGIVPVYGSNGIAGRHSRA--LTSG 57 Query: 85 -QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 +L G+ G + + + T + + L + + +A+ + Sbjct: 58 QTLLIGRKGGAGIAHLSREACWVIDTAYYTVDDSVYDLSFACYLLQFLRL----DALDKS 113 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 T+ P+P EQ +I EK+ +DT + R L+ + +++ Sbjct: 114 TTIPSLSRDDYNATLAPVPTKDEQRIIVEKLDELFSDVDTGVASLSRAYGNLRRYRASVL 173 Query: 204 SYIVTKGL-------------------------------------------------NPD 214 + L + Sbjct: 174 KMALEGRLTVDWRSNNPSTSTGEQLLSRILTARRDEWEREQSERFSAQGKRPPKNWRDKY 233 Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + ++ + +P W L + + + Q + T + Sbjct: 234 AEPAGPDVKNLPELPAGWCWATLQQLTGTITSGSRGWAKYYSNDGPIFIRSQDINTDLLN 293 Query: 275 LKP--------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + +S +V G+++ K + + + E + +A Sbjct: 294 IDSVAHVNPPKDSEGGRTLVRLGDLLITITGANVAKCAEVTCHIDEAYVSQHVALARPVL 353 Query: 327 GIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 S YL + S ++ + L + V + V +P EQ I +I+ Sbjct: 354 PEISRYLHACLTCESQGRRQLLKFAYGAGKPGLNLQQVASVVVPLPTFSEQSQIVQLIDE 413 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A + ++E +V ++ R S + AA G+ Sbjct: 414 QLAAHTRIEGQLEHDVVRARQLRQSILKAAFEGK 447 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 69/215 (32%), Gaps = 11/215 (5%) Query: 14 GVQWIGAIPKHWKVVPIKRF-TKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDG 68 V+ + +P W +++ + +G S I+I +D+ + Sbjct: 240 DVKNLPELPAGWCWATLQQLTGTITSGSRGWAKYYSNDGPIFIRSQDINTDLLNIDSVAH 299 Query: 69 NSRQSDTS-TVSIFAKGQILYGKLGPYLRKAI---IADFDGICSTQFLVLQPKDVLPELL 124 + D+ ++ G +L G + K + S + +P Sbjct: 300 VNPPKDSEGGRTLVRLGDLLITITGANVAKCAEVTCHIDEAYVSQHVALARPVLPEISRY 359 Query: 125 QGWL--LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +++ GA + + + ++ +P+P +EQ I + I + Sbjct: 360 LHACLTCESQGRRQLLKFAYGAGKPGLNLQQVASVVVPLPTFSEQSQIVQLIDEQLAAHT 419 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + + + ++ +Q+++ L + Sbjct: 420 RIEGQLEHDVVRARQLRQSILKAAFEGKLTSAEHL 454 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 47/123 (38%), Gaps = 4/123 (3%) Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + +I +AY V D ++ +L++ ++ S SL +D Sbjct: 67 GAGIAHLSREACWVIDTAYYTVDDSVYDLSFACYLLQ---FLRLDALDKSTTIPSLSRDD 123 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 VP EQ I ++ + +D V + ++ L+ R+S + A+ G++ + Sbjct: 124 YNATLAPVPTKDEQRIIVEKLDELFSDVDTGVASLSRAYGNLRRYRASVLKMALEGRLTV 183 Query: 422 RGE 424 Sbjct: 184 -DW 185 >gi|227114145|ref|ZP_03827801.1| restriction modification system DNA specificity subunit [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 566 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 66/481 (13%), Positives = 124/481 (25%), Gaps = 96/481 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + +P+ W+ + T + + + E+ S Sbjct: 83 IKKQKPQPEI--SEDEKPFELPEGWEFCRLGD-------ATINRDAERVPLSSEERSSRQ 133 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQ 115 G+Y G S D +F K +L G+ G A IAD + VL Sbjct: 134 GQY-DYYGASGIIDKIDDFLFDKPLLLIGEDGANLINRTTPIAFIADGRYWVNNHAHVL- 191 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 D + E ++ + +E G + + I + + P EQ I K+ Sbjct: 192 --DGVSEGFLKYVGLYINSINLEQYITGTAQPKMNQAKMNTILLGLAPEKEQQRILSKVD 249 Query: 176 AETVR-----------------------------------------IDTLITERIRFIEL 194 I Sbjct: 250 ILMSLCDQLAQQSLTSLEAHQQLVETLLATLIDSQNAEELAENWARISQHFDTLFTTEAS 309 Query: 195 LKEKKQALVSYIVTKGLNPDV-------------------------KMKDSGIEWVGLVP 229 + KQ ++ V L P K +E +G Sbjct: 310 IDALKQTILQLAVMGKLVPQDANDEPASELLKRIEQEKIQLVKEGKIKKHPPVEPLGEPT 369 Query: 230 DHWEVK-----------PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 + RK + S N + + + Sbjct: 370 SLPHSWLNIVVQDFADIRLGSTPDRSERKYWNGDVPWVSSGEVANEVILDTKEKITSEGF 429 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + ++ G ++ I + + A ++ Y+ + Sbjct: 430 KNSSTSMIPTGSLLMAIIGQGKTRGQTAVLGIDACTNQNVAAFVFNQALVEPEYVWIWAK 489 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 S L G G + +L + V+ + PIKEQ I + + D L +++ Sbjct: 490 SKYLSHRGDGHG-GAQPALNGKKVRSFIFPLAPIKEQQRIVSEVKRLNDICDALKSRLQS 548 Query: 399 S 399 + Sbjct: 549 A 549 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 67/219 (30%), Gaps = 19/219 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGA---IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYI 51 +K + V+ +G +P W + ++ F + G T + + D+ ++ Sbjct: 356 IKKHPP--------VEPLGEPTSLPHSWLNIVVQDFADIRLGSTPDRSERKYWNGDVPWV 407 Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICST 109 +V + + S S+ S+ G +L +G + + D + Sbjct: 408 SSGEVANEVILDTKEKITSEGFKNSSTSMIPTGSLLMAIIGQGKTRGQTAVLGIDACTNQ 467 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 L E W+ + G + K + + P+ P+ EQ Sbjct: 468 NVAAFVFNQALVEPEYVWIWAKSKYLSHRGDGHGGAQPALNGKKVRSFIFPLAPIKEQQR 527 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I ++ D L + + AL + Sbjct: 528 IVSEVKRLNDICDALKSRLQSAQQTQLHLADALTDAALN 566 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 58/189 (30%), Gaps = 10/189 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE + + L S G + + + Sbjct: 93 SEDEKPFELPEGWEFCRLGDATINRDAERVPLSSEERSS-RQGQYDYYGASGIIDKIDDF 151 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ GE I+ + + A++ Y+ + S Sbjct: 152 LFDKPLLLIGEDGANLINRTTPIAFIA---DGRYWVNNHAHVLDGVSEGFLKYVGLYINS 208 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +L + + + + + + + P KEQ I + +++ + D L +QS Sbjct: 209 INLEQYI---TGTAQPKMNQAKMNTILLGLAPEKEQQRILSKVDILMSLCDQL---AQQS 262 Query: 400 IVLLKERRS 408 + L+ + Sbjct: 263 LTSLEAHQQ 271 >gi|190150795|ref|YP_001969320.1| restriction-modification enzyme [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|189915926|gb|ACE62178.1| Putative restriction-modification enzyme [Actinobacillus pleuropneumoniae serovar 7 str. AP76] Length = 416 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 52/425 (12%), Positives = 111/425 (26%), Gaps = 72/425 (16%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V ++ L GR +I ++ + L + +G+ Sbjct: 2 WVRLEDIFHLQAGR---------FISASEIYGEYKESLYPCYGGNGLRGFVKTYNREGKF 52 Query: 87 -LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + G+ G A+ + +V++ L + + + Sbjct: 53 PIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYF---LIQLNLNQYATATA 109 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QA 201 I ++ +P+PPL EQ I KI I+ + + L ++ ++ Sbjct: 110 QPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKS 169 Query: 202 LVSYIVTKGLNPDVKM-------------------------------------------- 217 ++ + L Sbjct: 170 ILQAAIQGKLTEQNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRDNLPYEII 229 Query: 218 ----KDSGIEWVGLVPDHWEVKPFFALVTE------LNRKNTKLIESNILSLSYGNIIQK 267 + E +P++W + + I L G++ Sbjct: 230 NGEERCIADEVPFEIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTIPWLKTGDLNDG 289 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + T E V + I + +E + + G Sbjct: 290 IITEIPEYITELAIEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTNQACCACIPYTG 349 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I + YL + + S + GSG + ++ E + +PP+ EQ I I + Sbjct: 350 IYNKYLFYYLMSQKTELQKRSEGSG-QPNISKEKIVNYLFPLPPLNEQKCIVEKIETLFS 408 Query: 388 RIDVL 392 + L Sbjct: 409 TLQNL 413 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 60/171 (35%), Gaps = 8/171 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP++W V + G T + I ++ D+ G +P+ Sbjct: 243 EIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTIPWLKTGDLNDGIITEIPEYITELA 302 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + ++V + G +L G + K I + + + P + + L Sbjct: 303 IEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTNQACCACIPYTGIYNKYLFYYLMSQ 362 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 T+ + EG+ + + I N P+PPL EQ I EKI + Sbjct: 363 KTELQKRS-EGSGQPNISKEKIVNYLFPLPPLNEQKCIVEKIETLFSTLQN 412 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 46/131 (35%), Gaps = 9/131 (6%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F I Q + + A + D+ + + + +L + + Sbjct: 52 FPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYFLIQLNLNQY---ATAT 108 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERR 407 + L + + + +PP+ EQ I I I+ + E+ + L ++ + Sbjct: 109 AQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTALHQQFPEQLK 167 Query: 408 SSFIAAAVTGQ 418 S + AA+ G+ Sbjct: 168 KSILQAAIQGK 178 >gi|322513994|ref|ZP_08067069.1| type I restriction/modification specificity protein [Actinobacillus ureae ATCC 25976] gi|322120220|gb|EFX92178.1| type I restriction/modification specificity protein [Actinobacillus ureae ATCC 25976] Length = 386 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 53/388 (13%), Positives = 116/388 (29%), Gaps = 38/388 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + + G N+ Q + +G+ Sbjct: 18 EWKPLDEVANIANNVRKPVKS---SLRIS------GNIPYYGANNIQDYVEGYT--HEGE 66 Query: 86 ILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G A + V+ K+ L L+ A Sbjct: 67 FVLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLA-- 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G ++ + IP+PIPPL+ Q I + + A T L +E I + + ++ Sbjct: 125 -GKELAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELILRQKQYEYYREK 183 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+S ++ G EW L E+ + + NI L Sbjct: 184 LLSE---------EELGKVGFEWRNLG----EICKKVSSGGTPLSTKDEYYNGNIPWLRT 230 Query: 262 GNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + + + + ++ + ++ + Sbjct: 231 QEVQFNEIWDTEVKITQDGLNNSSAKWIPENCVIVAISGATAGRSAINKIALT----TNQ 286 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 ++ + Y + ++G G R L +K P+ +PP+KEQ I Sbjct: 287 HCCNLQIAHEYANYRYVFHWVCKEYEKLKSLGQGARADLNSGIIKNYPIALPPLKEQHRI 346 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 ++++ + + E + +I ++R Sbjct: 347 VSILDKFETLTNSITEGLPLAIEQSQKR 374 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 13/129 (10%), Positives = 45/129 (34%), Gaps = 3/129 (2%) Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ + + V + ++ +++ +L + + + Sbjct: 68 VLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFLAGKE 127 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 L ++++P+ +PP+ Q +I +++ TA L ++ + R Sbjct: 128 ---LAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELILRQKQYEYYREKL 184 Query: 411 IAAAVTGQI 419 ++ G++ Sbjct: 185 LSEEELGKV 193 >gi|46449531|gb|AAS96182.1| type I restriction-modification enzyme, S subunit [Desulfovibrio vulgaris str. Hildenborough] Length = 339 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 45/94 (47%), Gaps = 5/94 (5%) Query: 333 LAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L + S+ + ++ + + + +K P+L+PP+ EQ I +++ D Sbjct: 42 LKQIFNSFRFEQYVKSVQTETAVPHISAQQIKEFPILLPPLTEQKKIARILSTW----DK 97 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 +E +++ I K+++ + + +TG+ L G S Sbjct: 98 AIETVDKLIENSKQQKKALMQQLLTGKKRLPGFS 131 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 41/297 (13%), Positives = 99/297 (33%), Gaps = 12/297 (4%) Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S Q ++++ + H + I P+ +PPL EQ I + D I Sbjct: 44 QIFNSFRFEQYVKSVQTETAVPHISAQQIKEFPILLPPLTEQKKIARIL----STWDKAI 99 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + IE K++K+AL+ ++T + +G + Sbjct: 100 ETVDKLIENSKQQKKALMQQLLTGKKRLPGFSGEWKEVRLGDLFQVTIGGTPSRKNNAYW 159 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + N + + +++ ++ F + Sbjct: 160 DQLKASGNKWVAISDLKNKFLVETNEYITDAGAANSNVKLIPRLTVIMSFKLTIGKRAIT 219 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 ++ I A++ + ID+ + + DL + G +++ + ++ Sbjct: 220 KTQCYTNEAIC--AFIPKHKNEIDTNFFYHHLGIIDLVQDVDQAVKG--KTINKSKIMKI 275 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P + EQ I I + + ++ I L+KE + + + +TG+ ++ Sbjct: 276 RTKLPNLLEQIAIAQRIEAFDLQ---QEDYLKTRIFLVKE-KQALMQQLLTGKRRVK 328 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 66/194 (34%), Gaps = 15/194 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD----------IIYIGLEDVESGTGKYLPKDGNSRQS 73 WK V + ++ G T + ++ + D+++ + + Sbjct: 133 EWKEVRLGDLFQVTIGGTPSRKNNAYWDQLKASGNKWVAISDLKNKFLVETNEYITDAGA 192 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S V + + ++ + K I + PK + + + Sbjct: 193 ANSNVKLIPRLTVIMS-FKLTIGKRAITKTQCYTNEAICAFIPKHKNEIDTNFFYHHLGI 251 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + + + I I +P L EQ+ I ++I A ++ + ++ Sbjct: 252 IDLVQDVDQAVKGKTINKSKIMKIRTKLPNLLEQIAIAQRIEAFDLQ----QEDYLKTRI 307 Query: 194 LLKEKKQALVSYIV 207 L ++KQAL+ ++ Sbjct: 308 FLVKEKQALMQQLL 321 >gi|330506384|ref|YP_004382812.1| restriction modification system DNA specificity domain-containing protein [Methanosaeta concilii GP-6] gi|328927192|gb|AEB66994.1| restriction modification system DNA specificity domain protein [Methanosaeta concilii GP-6] Length = 436 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 57/167 (34%), Gaps = 13/167 (7%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET------YQIVDPGEIVFRFIDLQND 301 + + I + N+ S + PG++VF Sbjct: 54 SRDYSDQGIPVIRGSNLNNGRFLDMNEFVYVSDSKVRKDLSGNLAKPGDLVFTQRGTLGQ 113 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVF-YAMGSGLRQSLK 358 + + +R +++ + M + D +L + S ++ S + Sbjct: 114 VAIIPKEGISDRYVVSQSQMKLTVDDTKADQFFLYYYFSSREVIDRITNFTSSSGVPHIN 173 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 ++ + VPP++ Q I ++++ D L+E + I LL++ Sbjct: 174 LTVLRNFEIPVPPLEIQKSIASILSA----YDDLIENNRRRIQLLEQ 216 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 52/422 (12%), Positives = 117/422 (27%), Gaps = 35/422 (8%) Query: 24 HWKVVPIKRFT-----KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 W ++ G + I I ++ +G + + Sbjct: 26 SWPRKKLELLAADEPYSFVGGPFGSKLTSRDYSDQGIPVIRGSNLNNGRFLDMNEFVYVS 85 Query: 72 QSDTSTV---SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--------L 120 S ++ G +++ + G + AII S +++V Q + Sbjct: 86 DSKVRKDLSGNLAKPGDLVFTQRGTLGQVAIIPKEG--ISDRYVVSQSQMKLTVDDTKAD 143 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L + S +V RI + + H + + N +P+PPL Q I + A Sbjct: 144 QFFLYYYFSSREVIDRITNFTSSSGVPHINLTVLRNFEIPVPPLEIQKSIASILSAYDDL 203 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I+ + + + ++ G M W E+ Sbjct: 204 IENNRRRIQLLEQAARLLYREWFVHLRFPGHEHVRIMDGVPEGWERKT-AFDEMDILSGG 262 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-IVDPGEIVFRFIDLQ 299 + + + + L E + P + +F Sbjct: 263 TPKTGVPDYWNGDIPFFTPKDSMDYAYALATEKRLTEEGLRNCNSKLYPKDTIFITARGT 322 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 K +L + + A+ + Y + + + + ++ Sbjct: 323 VGKINLA----QTAMAMNQSCYALIGKPPLNQYYLYFALVDGVEQFRSRAVGAVFDAIIR 378 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 E ++P +VP I I ++ + I +L + R + + G+I Sbjct: 379 ETFNQIPFIVPD----DKIIQSFTEHVVPIIKQIDVLSTEIRMLTQARDLLLPRLMNGEI 434 Query: 420 DL 421 + Sbjct: 435 KI 436 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 50/169 (29%), Gaps = 9/169 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLED-VESGTGKYLPKDGNSRQS 73 +P+ W+ + +G T ++ DI + +D ++ K Sbjct: 243 VPEGWERKTAFDEMDILSGGTPKTGVPDYWNGDIPFFTPKDSMDYAYALATEKRLTEEGL 302 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 ++ K I G K +A + L K L + +D Sbjct: 303 RNCNSKLYPKDTIFITARGTV-GKINLAQTAMAMNQSCYALIGKPPLN-QYYLYFALVDG 360 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 ++ + GA + IP +P E ++ +ID Sbjct: 361 VEQFRSRAVGAVFDAIIRETFNQIPFIVPDDKIIQSFTEHVVPIIKQID 409 >gi|332201352|gb|EGJ15422.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47368] Length = 331 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 42/349 (12%), Positives = 87/349 (24%), Gaps = 24/349 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + G + +D G E + + N I G Sbjct: 2 KKVKLGEVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSTEINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSG-TLGVFQWRGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNL------------ 167 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 L + G + D+ + + E L L+ N+ Sbjct: 168 -----LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVT 222 Query: 266 QKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + + ++ +IV + + I S + Sbjct: 223 KNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMV 282 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 ++P + +++ + + L +K+ P Sbjct: 283 ILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKYFSPFP 330 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 41/142 (28%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + IV+ G+I+ + ++ V I Sbjct: 39 TSTEINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWRGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|322411066|gb|EFY01974.1| Type I restriction-modification system specificity subunit [Streptococcus dysgalactiae subsp. dysgalactiae ATCC 27957] Length = 381 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 48/391 (12%), Positives = 106/391 (27%), Gaps = 28/391 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + ++ G++ S + G R T G Sbjct: 18 WEERKLGEVAEVTMGQSPSSTNYTANPSDYILVQGNADLKNGYVFPRVWTTQITKTADAG 77 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ P A E L L + + + G+ Sbjct: 78 DLIISVRAPVGDVA-----KTAFDVVLGRGVAGIKGNEFLFQTLSKLKKDGYWKRLSTGS 132 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + + I + + IP L EQ I + R ++LLKE K+ + Sbjct: 133 TFESINSEDIKSTIIQIPSLPEQESIGNFFRQLDDLLT----LHERKLDLLKEHKKTYLR 188 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + ++ E E+ K +K+ ++ Y + Sbjct: 189 LLFPAKGQKVPALRFDSFEGDWEEKKVGEIFKVTRGQVLSATKVSKIKDNKNQYPVYSSQ 248 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 Q +G E + I + + + + + Sbjct: 249 TQNNGL--LGYYSECLFSDAI---------TWTTDGANAGTVNFRKGKFYSTNVNGVLLS 297 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 G + +A ++ S + L + + + +P + EQ I N Sbjct: 298 ESGYANKMVAEILNSVAWKF----VSKVGNPKLMNNVMSEITLSLPSLPEQEAIGNF--- 350 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +D + ++E + L +++ + Sbjct: 351 -FSTLDEEITQVESKLASLNAMKATLLRKIF 380 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 50/182 (27%), Gaps = 12/182 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS--IF 81 W+ + K+ G+ + K + ++ +Y ++ + Sbjct: 209 DWEEKKVGEIFKVTRGQVLSATK------VSKIKDNKNQYPVYSSQTQNNGLLGYYSECL 262 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I + G + VL + + +L+ + + + Sbjct: 263 FSDAITWTTDGANAGTVNFRKGKFYSTNVNGVLLSESGYANKMVAEILNSVAWKFVSKVG 322 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 M++ + I + +P L EQ I I + ++ + + Sbjct: 323 NPKLMNNV----MSEITLSLPSLPEQEAIGNFFSTLDEEITQVESKLASLNAMKATLLRK 378 Query: 202 LV 203 + Sbjct: 379 IF 380 >gi|91773198|ref|YP_565890.1| restriction modification system DNA specificity subunit [Methanococcoides burtonii DSM 6242] gi|91712213|gb|ABE52140.1| Restriction modification system DNA specificity subunit [Methanococcoides burtonii DSM 6242] Length = 351 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 53/397 (13%), Positives = 129/397 (32%), Gaps = 57/397 (14%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK + LN G++ K + +P ++ + ++ Sbjct: 5 WKKCKLGDVLVLNYGKSLPERKRVE------------GKIPVYSSAGLTGYHNETLVNSE 52 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G+ G + T + +L + + ++ + T +E + E + Sbjct: 53 GLIIGRKGTVGKIYYSKTPFFCIDTAYYILPEET---KYYLNFIYYLLKTIGLEELNEDS 109 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + +PPL EQ I + + +ID L + ++ Sbjct: 110 AVPRLNRNTAYSQDILLPPLPEQRAIASVLSSLDDKIDLLHRQNKTLEA---------MA 160 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + + + +G V K + E GNI Sbjct: 161 ETLFRQWFEEEADEGWEEGTLGDVASFHNGKKRPDDIIE------------------GNI 202 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 +G +S V G + L ++ + I +A +A Sbjct: 203 PIYGGNGILGYSDKSNNEGVTVIIGRVGAYCGSLYIERNPV--------WISDNALVAKP 254 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + S++L +L++S L ++ L +K + +++PP I + Sbjct: 255 INKEHSSFLFFLLKSLQLNEIAE---GSSHPLLTQNLLKSIQIILPPE---HRIEPFVYQ 308 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ++K + I L++ R + ++ ++G+ + Sbjct: 309 ADTWFNK-IDKNNKQIRTLEKLRDTLLSKLMSGEARV 344 >gi|289550024|ref|YP_003470928.1| Type I restriction-modification system, specificity subunit S [Staphylococcus lugdunensis HKU09-01] gi|289179556|gb|ADC86801.1| Type I restriction-modification system, specificity subunit S [Staphylococcus lugdunensis HKU09-01] Length = 391 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 49/400 (12%), Positives = 110/400 (27%), Gaps = 40/400 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W +K G++ + I ++ + G K + + Sbjct: 19 EWVRKKLKNIASFGKGKSLSKKDISKEGHPCILYGELYTKYGPITTKVYSKTNKLDKKLV 78 Query: 80 IFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 K Q+L G I I ++ PK + ++ Sbjct: 79 YSEKNQVLIPSSGETDIDIATATCINISEKIIIGGDLNIITPKIADGRFISLYINGKGKY 138 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + ++EQ I +I+ + + Sbjct: 139 NLAKYAQGKSVVHLYNSDIKKLEFFLPKEISEQEKIGNFFSKLDRQIELEEQKLELLKQQ 198 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q + S + + G W K + ++ K + Sbjct: 199 KKGYMQKIFSQEIKFK------------DENGNDYPEWIEKTIEEVTKYISSKKSSNQYI 246 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +L + ++ + + E Y + ++L+ K S+ Sbjct: 247 ENNTLGSYPVYDAIQEIAKDSQYDMEEPYISILKDGAGVGRLNLRAGKSSVI-------- 298 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-K 373 + P ID +L + M+ + K L ++D + + +P Sbjct: 299 ---GTMGYLLPKYIDIQFLYYRMKLLEFKKYII---GSTIPHLYYKDYSKEKLKIPSSSD 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + ++D +EK + LK+R+ + Sbjct: 353 EQKKIGTSL----KKLDDYIEKQSSKVEFLKQRKQGLLQK 388 >gi|32455448|ref|NP_862562.1| hypothetical protein pSRQ800_03 [Lactococcus lactis] gi|14251229|gb|AAK57812.1|U35629_2 HsdS [Lactococcus lactis] Length = 387 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 51/403 (12%), Positives = 124/403 (30%), Gaps = 42/403 (10%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W++ + ++ G++ S + G R Sbjct: 15 KVPELRFKGFTDEWELRKLGDEVRIVMGQSPNSENYTDDPNDYILVQGNADMKNGRVLPR 74 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 T K ++ P +D + ++ + + L + Sbjct: 75 VWTTQVTKQAEKDDLILSVRAPV-GDIGKTAYDVVIGRGVAAIKGNEFI----FQNLGKM 129 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G+T + I + +P + EQ I ++D I R Sbjct: 130 KSDGYWTRYSTGSTFESINSTDIKEAIISVPAIEEQDKIGSF----FKQLDNTIALHQRK 185 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE+K+ + + K +++ +G D WE + + K Sbjct: 186 LDLLKEQKKGFLQKMFPKNGAKVPELRFAG------FADDWEERKLGDITKISTGK--LD 237 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + Y ++ + + + I G V N + + V+ Sbjct: 238 ANAMVENGKYDFYTSGIKKYRIDVAAFEGPSITIAGNGATVGYMHLADNKFNAYQRTYVL 297 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP- 370 + ++ +++ + K+ +G + + + L + +P Sbjct: 298 QEFLVDRSFIFSEIGNKLP------------KKIKQEARTGNIPYIVMDMLTELKLSIPQ 345 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 EQ I + ++D + ++ + LLKE++ F+ Sbjct: 346 NNSEQQKIGSF----FKQLDDTIALHQRKLDLLKEQKKGFLQK 384 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 15/205 (7%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 ++ VK K + + G + WE++ V + ++ Y + + Sbjct: 8 IDDSVKKKVPELRFKGFTDE-WELRKLGDEVRIVMGQSPNSENYTDDPNDYILVQGNADM 66 Query: 271 RNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 +N + P + T + + +++ D V+ RG+ Sbjct: 67 KNGRVLPRVWTTQVTKQAEKDDLILSVRAPVGDIGKTAYDVVIGRGVAA--------IKG 118 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + L + +S+ D+K + VP I+EQ I + + Sbjct: 119 NEFIFQNLGKMKSDGYWTRYSTGSTFESINSTDIKEAIISVPAIEEQDKIGSF----FKQ 174 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAA 413 +D + ++ + LLKE++ F+ Sbjct: 175 LDNTIALHQRKLDLLKEQKKGFLQK 199 >gi|313207214|ref|YP_004046391.1| restriction modification system DNA specificity domain [Riemerella anatipestifer DSM 15868] gi|312446530|gb|ADQ82885.1| restriction modification system DNA specificity domain [Riemerella anatipestifer DSM 15868] gi|315022984|gb|EFT36005.1| restriction modification system DNA specificity domain protein [Riemerella anatipestifer RA-YM] gi|325335340|gb|ADZ11614.1| Restriction endonuclease S subunit [Riemerella anatipestifer RA-GD] Length = 365 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 62/396 (15%), Positives = 130/396 (32%), Gaps = 36/396 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + V + K + K I +D+ GKY N + + + K Sbjct: 2 ERVKLIDICK------PKQWKT---ISGKDILEK-GKYPVYGANGKIGFYNEYNH-EKPT 50 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L G G I F + + + L + + + G Sbjct: 51 LLIGCRGSCGTIHISEPFSYTSGNAMALDGLSSKVDIKFLFYYLK---QRGFDDVMSGGV 107 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 G+ + +P+PPL EQ I EK+ +I + + QAL Sbjct: 108 QKQITKVGLEKVEIPLPPLVEQQAIAEKLDQA----QKIIDLNEAEVARYDKLAQAL--- 160 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + +P K ++ +G V ++ + +I ++ ++ Sbjct: 161 FIDMFGDPVQNPKGWEVKKLGEV------CTNILGGGTPSKSKPEFYIGDIPWVTPKDMK 214 Query: 266 QKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 K ++ + + + P + + I K +L A I A Sbjct: 215 TKFIRNSIDHINKLAIENSSAKLIPVDSILMVIRSGILKHTLPVAINKVSVTINQDMKAF 274 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 P+ + L +++ + +C F + + +++F +K L ++PPI Q + Sbjct: 275 LPNDKITNTL-FMLYFFKVCSYFLLGKVRAVTADNIEFNQIKNLNYILPPITLQNEFAKR 333 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I +I++L + +Q + K S + + G Sbjct: 334 IE----QIELLKNQAQQELEQSKNLFQSLLQESFKG 365 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 65/196 (33%), Gaps = 14/196 (7%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK W+V + + G T K DI ++ +D+++ + N + Sbjct: 172 PKGWEVKKLGEVCTNILGGGTPSKSKPEFYIGDIPWVTPKDMKTKFIRNSIDHINKLAIE 231 Query: 75 TSTVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + IL L+ I + P D + L Sbjct: 232 NSSAKLIPVDSILMVIRSGILKHTLPVAINKVSVTINQDMKAFLPNDKITNTLFMLYFFK 291 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + T + ++ I N+ +PP+ Q ++I +I+ L + + Sbjct: 292 VCSYFLLGKVRAVTADNIEFNQIKNLNYILPPITLQNEFAKRI----EQIELLKNQAQQE 347 Query: 192 IELLKEKKQALVSYIV 207 +E K Q+L+ Sbjct: 348 LEQSKNLFQSLLQESF 363 >gi|78189089|ref|YP_379427.1| restriction endonuclease S subunits-like [Chlorobium chlorochromatii CaD3] gi|78171288|gb|ABB28384.1| Restriction endonuclease S subunits-like protein [Chlorobium chlorochromatii CaD3] Length = 428 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 60/430 (13%), Positives = 131/430 (30%), Gaps = 39/430 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 WK +K L GR+ G +I ++ + + + Sbjct: 5 SEWKEYKLKDLGLLQRGRSRHRPRYAFHLYGGKYPFIQTGEIREASKYITKFEKTYSEEG 64 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ KG + + + + I +FD L P D + + + Sbjct: 65 LKQSKLWPKGTLCIT-IAANIAELAILNFDACFPDSVLGFIPNDKIANADFIYYILTHFQ 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ I EG+ + + ++ PIPPL EQ I + + +I+ L + ++ Sbjct: 124 KELKHIGEGSVQDNINLGTFEDLLFPIPPLPEQRAIASVLSSLDDKIELLHRQNATLEKM 183 Query: 195 LKEKKQA--------------LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 + + L+ K K+ + + W++ Sbjct: 184 AETLFRQWFIERKSLNYDSYDLLDEHDLKNQKNHNNQKNHSSDNGEEAIEEWKIGKVSDY 243 Query: 241 -VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + + + +S + L E I+F ++ Sbjct: 244 ALHLKDSIQPQKNQSTFYFHYSIPSFDNDKNPIKELGKEIQSNKYKAPRYCILFSKLNPH 303 Query: 300 NDKR-SLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL---R 354 DKR L +V + I ++ + V P Y + + D + G Sbjct: 304 KDKRVWLLQNEVEKNAICSTEFQVVLPIKRQYLYFLYGWLTLNDNYNEIASGVGGTSGSH 363 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI---EQSIVLLKERRSSFI 411 Q + + + + E +VI +I L +K + I L R + Sbjct: 364 QRIDPNTIYDFQCPL--VTE-----SVIEKFNIQIKPLFKKQVINQTQIRTLTALRDMLL 416 Query: 412 AAAVTGQIDL 421 ++G++ + Sbjct: 417 PKLMSGEVKV 426 >gi|165976843|ref|YP_001652436.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876944|gb|ABY69992.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 406 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 52/410 (12%), Positives = 111/410 (27%), Gaps = 64/410 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRK- 97 + I YI +D G + D S K I++ + G Sbjct: 5 YKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRYGTIGVVR 64 Query: 98 AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 I + + S ++ + + + + +L S I+ T + K I Sbjct: 65 VIEENIKLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQPNVGLKSIKKF 124 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGLNP 213 +P+PPL EQ I KI I+ + + L ++ ++++ + L Sbjct: 125 IIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTE 184 Query: 214 DVKM------------------------------------------------KDSGIEWV 225 + E Sbjct: 185 QNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRDNLPYEIVNGKERCIADEVP 244 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 +P+ W + + N L + I ++ ES + Y Sbjct: 245 FEIPESWVWVRLSKITMGQSPDNKYLGKEGIEFHQ-------GKSFFSEYIIESSDIYCS 297 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + I L I +++ +++ +L + + Y Sbjct: 298 LPNKLATPNSILLCVRAPVGIVNITNRELCIGIGLASIESIYVNTIFLYYALFCYKNY-Y 356 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 +++ + + + +PP+ EQ I I + + L +K Sbjct: 357 ERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIETLFSTLQNLSQK 406 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 66/182 (36%), Gaps = 16/182 (8%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV------DPGEIVFRFIDLQNDK 302 ++ I +S + K K S E Y ++ +I+F Sbjct: 4 EYKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSKKFAPQKNDIIFPRYGTIGVV 63 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFED 361 R + + +++ + ++ I+ Y+ + S + + ++ + Sbjct: 64 RVIEENI---KLLVSYSCACIRVEYINMQYVVAYLNSELAKLEIKKYTNKTTQPNVGLKS 120 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERRSSFIAAAVT 416 +K+ + +PP+ EQ I I I+ + E+ + L ++ + S + AA+ Sbjct: 121 IKKFIIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTALHQQFPEQLKKSILQAAIQ 179 Query: 417 GQ 418 G+ Sbjct: 180 GK 181 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 55/167 (32%), Gaps = 13/167 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---S 76 IP+ W V + K+ G++ ++ Y+G E +E GK + SD Sbjct: 246 EIPESWVWVRLS---KITMGQSPDNK----YLGKEGIEFHQGKSFFSEYIIESSDIYCSL 298 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IL P I I + + + + + Sbjct: 299 PNKLATPNSILLCVRAPVGIVNITNRELCI---GIGLASIESIYVNTIFLYYALFCYKNY 355 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 E G+T I N +PIPPL EQ+ I EKI + Sbjct: 356 YERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIETLFSTLQN 402 >gi|150389395|ref|YP_001319444.1| restriction modification system DNA specificity subunit [Alkaliphilus metalliredigens QYMF] gi|149949257|gb|ABR47785.1| restriction modification system DNA specificity domain [Alkaliphilus metalliredigens QYMF] Length = 408 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 48/380 (12%), Positives = 108/380 (28%), Gaps = 19/380 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + T+ G+ K E V + +S + T K Sbjct: 22 WEPCKLSDLTEYKNGK-GHEDKQSTSGKYELVNLNSISIDGGLKHSGKFVDDTTDTLFKN 80 Query: 85 QILYGKL----GPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ G L + + + + + + +L+P + + + Sbjct: 81 DLVMVLSDVGHGDLLGRVALIPENDRFVLNQRVALLRPNRAAD-PQFLFSYINAHQRYFK 139 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A G + + + + IP EQV KI ++D LIT R ++ +K Sbjct: 140 AQGAGMSQLNISKGSVESFTSFIPDKEEQV----KIGKHFKQLDNLITLHQRKLDKIKSM 195 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K+A + + K + G + F+ +T N + ++ Sbjct: 196 KKAYLYEMFPVEGESRPKRRFKGFTDAWEQRKLTDEVELFSGLT--YSPNDIVKDNGTFV 253 Query: 259 LSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 L N+ + V G+I+ + + ++ Sbjct: 254 LRSSNVKNGEVVDADNVYVNSEVVNSCNVKNGDIIVVVRNGSRSLIGKHAQIKGDKDKTV 313 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 S ++ L+ + + ++ ++P +EQ Sbjct: 314 IGAFMTGLRSNHSDFVNALLDTPLFKSEIDKNLGATINQITNGMFHQMKFMIPNPEEQDR 373 Query: 378 ITNVINVETARIDVLVEKIE 397 I +D L+ + Sbjct: 374 IG----KLFTGLDNLITLHQ 389 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 73/210 (34%), Gaps = 11/210 (5%) Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 TK L P + K+ + ++ ++ + ++ +I Sbjct: 2 AETKKLIPKRRFKEFQ---NAEAWEPCKLSDLTEYKNGKGHEDKQSTSGKYELVNLNSIS 58 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++ G + + +V + + + +R ++ ++P Sbjct: 59 IDGGLKHSGKFVDDTTDTLFKNDLVMVLSDVGHGDLLGRVALIPENDRFVLNQRVALLRP 118 Query: 326 HGI-DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + D +L + ++ + F A G+G+ + ++ V+ +P +EQ I Sbjct: 119 NRAADPQFLFSYINAH--QRYFKAQGAGMSQLNISKGSVESFTSFIPDKEEQVKIGKH-- 174 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D L+ ++ + +K + +++ Sbjct: 175 --FKQLDNLITLHQRKLDKIKSMKKAYLYE 202 >gi|315038272|ref|YP_004031840.1| restriction modification system DNA specificity domain protein [Lactobacillus amylovorus GRL 1112] gi|312276405|gb|ADQ59045.1| restriction modification system DNA specificity domain protein [Lactobacillus amylovorus GRL 1112] Length = 372 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 52/380 (13%), Positives = 126/380 (33%), Gaps = 19/380 (5%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 + I +I +E + + + K G N + K + K G + K I Sbjct: 2 KEGIPFISVEAIVNNKIDFKRKRGYISNEYNEKCNQKYKPQKNDVYLVKSGSTVGKTAIV 61 Query: 102 D---FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 + I S + P L L + ++ ++ +G T + + + + Sbjct: 62 ETNIPFNIWSPLAALRPNNATSPYFLFYLLQTDNLQSQVINKSKGGTQPNLSMRLLEHFK 121 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK--GLNPDVK 216 + +P + +++ +I I+ + R + LKE K+ L+S + P ++ Sbjct: 122 IFVPNNIDYQTQIARLLINVDKI---ISLQQRKLNELKEVKKTLLSQLFPSKGQYRPIIR 178 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII-QKLETRNMGL 275 K +W + + N + GN I + + + Sbjct: 179 FKKFTNKWTKRKLGNIAKIIGGGTPSTSNHDYWNGNINWYSPTEIGNNIFVNSSNKKISI 238 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 K + + +++ G+ + ++ S + I Y + Sbjct: 239 KGLNNSSAKLLPGGKTILFTSRAGIGNMAIMLTDGCTNQGFQSWVIDDTKIDI---YFLY 295 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + ++VK+L + +P EQ I N++ +ID + + Sbjct: 296 SLGRLLKHDAIRQASGSTFLEISNKEVKKLLLEIPSFTEQKLIGNML----RKIDDDIVR 351 Query: 396 IEQSIVLLKERRSSFIAAAV 415 ++ I+L+ + + + + Sbjct: 352 QKERIILITKIKKNLLQKLF 371 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 56/186 (30%), Gaps = 7/186 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKY-LPKDGNSRQSDTST 77 W + K+ G T + +I + ++ + K + + + S+ Sbjct: 186 WTKRKLGNIAKIIGGGTPSTSNHDYWNGNINWYSPTEIGNNIFVNSSNKKISIKGLNNSS 245 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G+ + + I DG + F D ++ + L + Sbjct: 246 AKLLPGGKTILFTSRAGIGNMAIMLTDGCTNQGFQSWVIDDTKIDIYFLYSLGRLLKHDA 305 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+T K + + + IP EQ LI + I I ++ K Sbjct: 306 IRQASGSTFLEISNKEVKKLLLEIPSFTEQKLIGNMLRKIDDDIVRQKERIILITKIKKN 365 Query: 198 KKQALV 203 Q L Sbjct: 366 LLQKLF 371 >gi|332362406|gb|EGJ40206.1| type I restriction-modification system [Streptococcus sanguinis SK1056] Length = 170 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 31/84 (36%), Positives = 52/84 (61%) Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + +L+ +YD+CKVFY G G+RQ + D+ ++ +L+PP EQ I + ++ + A++D Sbjct: 1 MYYLLHTYDICKVFYNFGGGVRQGGTWSDIYKMELLIPPCNEQQKIADYLDKKIAQLDRA 60 Query: 393 VEKIEQSIVLLKERRSSFIAAAVT 416 +E+ I LK+ RSS I VT Sbjct: 61 KRLLEKQIQKLKDYRSSLIYETVT 84 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 34/114 (29%), Positives = 55/114 (48%), Gaps = 1/114 (0%) Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + L + D+ + G W I + + IPP EQ I + + + ++D Sbjct: 1 MYYLLHTYDICKVFYNFGGGVRQGG-TWSDIYKMELLIPPCNEQQKIADYLDKKIAQLDR 59 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + I+ LK+ + +L+ VTKGL+ V MKDSGI+W+G VP+ W V Sbjct: 60 AKRLLEKQIQKLKDYRSSLIYETVTKGLDKTVPMKDSGIDWIGQVPEGWGVSKL 113 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 15/76 (19%), Positives = 23/76 (30%), Gaps = 5/76 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTK-----LNTGRTSESGKDIIYIGLEDVESGTGKYL 64 KDSG+ WIG +P+ W V +K + + G S L Sbjct: 93 MKDSGIDWIGQVPEGWGVSKLKFTLEKASNNIKVGPFGSSLSGDAIRSSGKWVYNQRNVL 152 Query: 65 PKDGNSRQSDTSTVSI 80 + + S Sbjct: 153 DNNFTETDTFISDAKW 168 >gi|298375506|ref|ZP_06985463.1| HsdS, type I site-specific deoxyribonuclease [Bacteroides sp. 3_1_19] gi|298268006|gb|EFI09662.1| HsdS, type I site-specific deoxyribonuclease [Bacteroides sp. 3_1_19] Length = 409 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 66/412 (16%), Positives = 140/412 (33%), Gaps = 30/412 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLED--VESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + T + S + + D +++ G+ + + + + G Sbjct: 2 TKLSSIADYVTDKISSNDIALREYVTTDCILQNKKGREIATNLPPQSCCLTRYQH---GD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQRIEAICEGA 144 +L + PYL+K AD DG S+ LV + K+ P L LL + +G+ Sbjct: 59 VLIANIRPYLKKVWFADIDGGASSDVLVFRAKEGHSPSFLYAVLLQDSFFDYVMQGAKGS 118 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M D + I MP +E+ + + +D I + + L+ + L Sbjct: 119 KMPRGDKEQILRYEMPTLSCSEESIGTFFLN-----LDQKIRLNEQINQNLEAMAKQLYD 173 Query: 205 YIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNR-----KNTK 250 Y + PD K SG E V +P WE K + N N + Sbjct: 174 YWFVQFDFPDENGRPYKSSGGEMVWNEKLKRKIPASWENKNIEDIADVYNGATPSTINEQ 233 Query: 251 LIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +I+ ++ ++ K + G + S Y + I + + + Sbjct: 234 NYGGDIVWITPKDLSDQKQKFVYQGERNISQAGYNSCSTHLLPPNTILMSSRAPIGLLSI 293 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + P + + + + + ++ + + EDV + P+L Sbjct: 294 AKTELCTNQGFKSFVPKAENISTYLYYYLNIHIKQIEQLGTGTTFKEVSREDVLKFPILK 353 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P I ++ + + ++ I++ L ++R + + GQ+ + Sbjct: 354 PS----DAILDLWEKQVSALNNKQFVIQKENEFLTKQRDELLPLLMNGQVSV 401 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 56/164 (34%), Gaps = 17/164 (10%) Query: 10 YKDSGVQ--WIGA----IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVE 57 YK SG + W IP W+ I+ + G T + G DI++I +D+ Sbjct: 189 YKSSGGEMVWNEKLKRKIPASWENKNIEDIADVYNGATPSTINEQNYGGDIVWITPKDLS 248 Query: 58 SGTGKYL---PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL 114 K++ ++ + ++ + + IL P IA + + F Sbjct: 249 DQKQKFVYQGERNISQAGYNSCSTHLLPPNTILMSSRAPI-GLLSIAKTELCTNQGFKSF 307 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 PK + ++IE + G T + + P Sbjct: 308 VPKA-ENISTYLYYYLNIHIKQIEQLGTGTTFKEVSREDVLKFP 350 >gi|167856385|ref|ZP_02479111.1| Type I restriction-modification system specificity subunit [Haemophilus parasuis 29755] gi|167852491|gb|EDS23779.1| Type I restriction-modification system specificity subunit [Haemophilus parasuis 29755] Length = 236 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 74/195 (37%), Gaps = 14/195 (7%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 WE +P + + KN + ++ + + +I +L+ N + + Y ++ G+ Sbjct: 29 GWENRPLSTVFNRITLKNKENNQNVLTISAQYGLISQLDFFNKSVSAKDITGYYLLHKGD 88 Query: 291 IVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKPHGIDSTY---LAWLMRSYDLCKVF 346 + ++ ++ ++G++++ Y+ K S Y + Sbjct: 89 FAYNKSYSNGYPYGAIKPLKLYDKGVVSTLYICFKLKESYSNYGNFFEHYFEAGVQNNDI 148 Query: 347 YAMGS-GLRQS--LK---FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + G R L E + VL+P +EQ I + + + +D L+E EQ + Sbjct: 149 GKVAQEGARNHGLLNIGIQEFFNEVNVLIPSFEEQQKIADCL----SSLDELIELQEQKL 204 Query: 401 VLLKERRSSFIAAAV 415 LK+ + + Sbjct: 205 AALKQHKKGLMQQLF 219 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 67/206 (32%), Gaps = 11/206 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ P+ T + E+ ++++ I + + K +++ D + + K Sbjct: 29 GWENRPLSTVFNRITLKNKENNQNVLTISAQYGLISQLDFFNKSVSAK--DITGYYLLHK 86 Query: 84 GQILYGKLGPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGWLLSID----VT 134 G Y K G+ ST ++ + K+ + + Sbjct: 87 GDFAYNKSYSNGYPYGAIKPLKLYDKGVVSTLYICFKLKESYSNYGNFFEHYFEAGVQNN 146 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + EGA GI + L ++KI +D LI + + + Sbjct: 147 DIGKVAQEGARNHGLLNIGIQEFFNEVNVLIPSFEEQQKIADCLSSLDELIELQEQKLAA 206 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDS 220 LK+ K+ L+ + + + S Sbjct: 207 LKQHKKGLMQQLFPSHNDLQASKQAS 232 >gi|117676103|ref|YP_863679.1| restriction modification system DNA specificity subunit [Shewanella sp. ANA-3] gi|117614927|gb|ABK50380.1| restriction modification system DNA specificity domain [Shewanella sp. ANA-3] Length = 405 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 51/398 (12%), Positives = 125/398 (31%), Gaps = 21/398 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + + + T + G + ++++ E++ S T + K + + Sbjct: 18 EWQSLSLDKITDVYDGTHQTPAYTKNGVMFLSAENIRSLTSQ---KFISEKAFKEEFKVY 74 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K +L ++G ++ D L L L ++ Q+ + Sbjct: 75 PQKNDVLMTRIGDVGTANVVETDDDKAYYVTLALLKYKQLSPYFLKSSIASPYVQKDIWL 134 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 ++ + I +KI ++D LI + + + L K+ Sbjct: 135 RT-LHIAFPKKINMNEIKQVNVNCPTNSKESDKIGNYFQKLDNLINQYQQKHDKLSNIKK 193 Query: 201 ALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL---IESN 255 A++ + K P+++ K EW +V T + I+ Sbjct: 194 AMLEKMFPKQGETIPEIRFKGFSGEW-DEKELGTDVADIVGGGTPSTSISEFWNGDIDWY 252 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N+ + + + + + +I+ G + + A + + G Sbjct: 253 SPTEIGSNVYAEGSQKKITALGLNSSSAKILPAG----NTVLFTSRAGIGDMAILTKPGT 308 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + Y + + + + R+ +LVP EQ Sbjct: 309 TNQGFQSFVVKEGFVPYFIYSAGKQIKEYALKHASGSTFLEISGKQLGRMKILVPCETEQ 368 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I N ++D L+++ +Q I L + + ++ Sbjct: 369 TAIGNY----FQKLDALIKQHQQQITKLNNIKQACLSK 402 >gi|241888736|ref|ZP_04776043.1| putative type I restriction enzyme specificity protein [Gemella haemolysans ATCC 10379] gi|241864759|gb|EER69134.1| putative type I restriction enzyme specificity protein [Gemella haemolysans ATCC 10379] Length = 621 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 50/407 (12%), Positives = 115/407 (28%), Gaps = 55/407 (13%) Query: 26 KVVPIKRFTKLNT-----GRTSESGKDIIY---------IGLEDVESGTGKYLPKDGNSR 71 + + ++ T G +++ Y I +D++S K Sbjct: 14 EWKKLGEVVEIVTDYVAAGSFKTIAENVKYLQKEGYAQLIRTKDIKSDFKKVDDFVYVDE 73 Query: 72 QSDTSTVSI-FAKGQILYGKLGPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQ 125 + + K I+ +G I + + + L+ K + L Sbjct: 74 NAFRFLYRVNLDKECIILPNIGNCGEVYYIYPEKLPSDNNVLGPNAIYLRSKTQSNKYLY 133 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + +E I + + + +PIP L Q + E + T + L Sbjct: 134 YLFHEYFFQKSLEKITSKVGQGKFNKTDLKELLIPIPSLETQEKMVEILDKFTSYVTELQ 193 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 +E + + L+S ++ + +VT N Sbjct: 194 SELQSRTKQYTYYRDKLLSEEYLIKATKEM-----------EEDRRLNIVQLEEVVTIKN 242 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K+ K ++ + + + + + + + N Sbjct: 243 GKDWKKLDQGDIPVYGSGGEMGVFVDKYSYDKPTVLIPRKGSIDNVFYLDKPFWNVDTIF 302 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + I Y + + YDL K+ + R SL + +L Sbjct: 303 HTEIDESKLI--------------PKYFYYFIEHYDLNKLSD---NSTRPSLTQSTLNKL 345 Query: 366 PVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKE 405 V +PP+ Q I +++ + +E+ ++ +E Sbjct: 346 KVPLPPLSLQNKIVRILDKFQVLLADTKGLIPVEIEQRQKQYEYYRE 392 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 43/396 (10%), Positives = 106/396 (26%), Gaps = 32/396 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +V ++ + G+ + G + K Sbjct: 230 NIVQLEEVVTIKNGKDWKKLD-------------QGDIPVYGSGGEMGVFVDKYSYDKPT 276 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L + G + T F + L + + + + + +T Sbjct: 277 VLIPRKGSIDNVFYLDKPFWNVDTIFHTEIDESKLIPKYFYYFIEHY---DLNKLSDNST 333 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL---ITERIRFIELLKEKKQ-A 201 + + +P+PPL+ Q I + V + I I + E + Sbjct: 334 RPSLTQSTLNKLKVPLPPLSLQNKIVRILDKFQVLLADTKGLIPVEIEQRQKQYEYYREK 393 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVP----------DHWEVKPFFALVTELNRKNTKL 251 L+++ V + S + L D + + + + Sbjct: 394 LLTFDVEYSRTNERTFIISNTYYNILQEAAKYVGIILEDKVIEYRLRDIAEYSSSRISAE 453 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + N+++ R T + +I+ I K Sbjct: 454 ELDTFNYVGVDNLLKDKYGREDSTYVPETGTSIKYEKDDILIGNIRPYLRKIWYSDRTGG 513 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 G + A +DS YL + + G + + +P Sbjct: 514 TNGDV-LAISVKDKKLVDSRYLYHALADERFFEYNIKYSKGAKMPRGDKKKIMEYHFPIP 572 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 P+ Q + ++++ ++ + E + + I +++ Sbjct: 573 PLYVQQHVVSILDKFYTLVNDIKEGLPKEIEQRQKQ 608 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 17/169 (10%), Positives = 57/169 (33%), Gaps = 2/169 (1%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + + + + + + +D I+ I + + Sbjct: 45 QKEGYAQLIRTKDIKSDFKKVDDFVYVDENAFRFLYRVNLDKECIILPNIGNCGEVYYIY 104 Query: 307 SAQVM-ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 ++ + ++ + ++ + YL +L Y K + S + + D+K Sbjct: 105 PEKLPSDNNVLGPNAIYLRSKTQSNKYLYYLFHEYFFQKSLEKITSKVGQGKFNKTDLKE 164 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 L + +P ++ Q + +++ T+ + L +++ R ++ Sbjct: 165 LLIPIPSLETQEKMVEILDKFTSYVTELQSELQSRTKQYTYYRDKLLSE 213 >gi|54024731|ref|YP_118973.1| putative restriction-modification system specificity determinant [Nocardia farcinica IFM 10152] gi|54016239|dbj|BAD57609.1| putative restriction-modification system specificity determinant [Nocardia farcinica IFM 10152] Length = 394 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 58/411 (14%), Positives = 121/411 (29%), Gaps = 34/411 (8%) Query: 23 KHWKVVPIKRFTKLNTGRT--SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +W +VP+ G+T + + ++ + G G G+ + Sbjct: 2 SNWPLVPLGDILA-QDGQTERIANTESEKFLTIR--LYGKGLVERSIGSGKTPKPFVGYR 58 Query: 81 FAKGQILYGKLGPYLRKAIIADFD---GICS---TQFLVLQPKDVLPELLQGWLLSIDVT 134 GQ +Y ++ + + +CS +F V Q + LL+ Sbjct: 59 VKPGQFVYSRIDARNGAYGVVPDELDGAVCSKDFPKFDVDQQRADENYLLRLVQTRDFYR 118 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + +P+PP+ EQ I + R Sbjct: 119 KVQDLSFGATNRQRVKEEEFLRLRIPLPPIEEQRRIAAILDHADALRAKRREALARLD-- 176 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E Q++ + +P ++ VG D +E + L S Sbjct: 177 --ELTQSI---FIDMFGDPVANERNWPFGTVGDFVDRFEGGKNIVGSGDSTDGYRVLKVS 231 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMER 313 + SLSY K IV G+++F + + + R Sbjct: 232 AVTSLSYRESESKPLPEGYVPPSN-----HIVQRGDLLFSRANTSELVGATALVTETDGR 286 Query: 314 GIITSAYMAVKPHGIDSTYLAW---LMRSYDLCKVFYAMG---SGLRQSLKFEDVKRLPV 367 + K + + L + + SG +++ V +P+ Sbjct: 287 TALPDKLWRFKWKNRTAAVPGYVAALFQRPSFRQTISDRATGSSGSMKNISQSKVLSIPL 346 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP++ Q +D + ++ L +S + A G+ Sbjct: 347 GIPPVELQEKF----ESVRVEVDSMKNSNRIALAELDALFASLQSRAFRGE 393 >gi|227834295|ref|YP_002836002.1| hypothetical protein cauri_2473 [Corynebacterium aurimucosum ATCC 700975] gi|227455311|gb|ACP34064.1| hypothetical protein cauri_2473 [Corynebacterium aurimucosum ATCC 700975] Length = 378 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 52/409 (12%), Positives = 119/409 (29%), Gaps = 53/409 (12%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W +VP+ + + +G T + K +I ++ D+ K S Sbjct: 8 DWPMVPLPKLVQFQSGGTPSTKKPEYYNGEIPWVTSADISESHCIDAKKFITEAAIRNSA 67 Query: 78 VSIFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I G + L ++G + K+ + + S L Sbjct: 68 ACIAQPGSVLLVTRIG--VGKSALVEAPVSFSQDVTNLNDLSEECNARYLLHFLQSARSF 125 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL-L 195 ++ G T+ + ++ +P+PPL EQ I + I ++ + Sbjct: 126 FQSRSRGVTIKGIKRTDLNDLLVPLPPLDEQRRIAAILDEVESAIVAAKSQLSELSAIPF 185 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + +++ ++ + D E +P + Sbjct: 186 WMGDRKFELVALSELVDIRSSLVDPTSEPYMDMP-------------------------H 220 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + + ++ G+I++ I +K S+ + + Sbjct: 221 IAPNNLSSGSDDFVGVKSAVEDRVTSGKYAFQAGDILYSKIRPYLNKVSIAAY---DGVC 277 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKE 374 Y V + + ++ W +RS + + + + V Sbjct: 278 SADMYALVPRNRTQTDWIVWQLRSSRFLAYAASSSGRASIPKINRKALGAFKV------- 330 Query: 375 QFDITN--VINVETAR--IDVLVEK-IEQSIVLLKERRSSFIAAAVTGQ 418 I V+ + +E + + + LL+E +SS A G+ Sbjct: 331 --QIVEPAVLEQFNREQNVKKTIENSVRKKLYLLQELQSSLSTRAFQGE 377 >gi|307268428|ref|ZP_07549806.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4248] gi|306515235|gb|EFM83772.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4248] Length = 407 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 53/405 (13%), Positives = 142/405 (35%), Gaps = 34/405 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W++ +K T+ G ++ D+ + + + + GN + ++ Sbjct: 18 EDWELCKLKEITERVKG--NDGRMDLPTLTISASQGWLNQKDRFSGNIAGKEQKNYTLLL 75 Query: 83 KGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K ++ Y KL Y + ++ + + + E Sbjct: 76 KNELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKE 135 Query: 139 ------AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + ++ NI + IP + EQ I + +ID I R + Sbjct: 136 LGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDDTIALHQRKL 191 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA--LVTELNRKNTK 250 + LKE K+A + + K +++ + E D W++ + + + + Sbjct: 192 DQLKELKKAYLQLMFPKKDETVPQVRFADFE------DDWQLCKLGDVVEIFDGTHQTPR 245 Query: 251 LIESNILSLSYGNIIQKLETRNMGLK-PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +S + +S NI + + + E + + G+I+ I D +++ + Sbjct: 246 YTDSGVKFVSVENIATLETKKYITHEAYEKEYSKKRAKKGDILMTRIG---DIGTMKVIE 302 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPV 367 E +K + +L++++ S ++ + + + + ++ ++ + Sbjct: 303 TDEPLAYYVTLALLKAKETNPYFLSFIISSPEIQRNIWKRTLHIAFPKKINLGEINQVEM 362 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +EQ I + +D + + + LK + S++ Sbjct: 363 KITIFEEQDKIGD----LFTNLDDAIILNQNKLNQLKSLKKSYLQ 403 >gi|297587127|ref|ZP_06945772.1| restriction modification system DNA specificity domain protein [Finegoldia magna ATCC 53516] gi|297575108|gb|EFH93827.1| restriction modification system DNA specificity domain protein [Finegoldia magna ATCC 53516] Length = 439 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 64/432 (14%), Positives = 136/432 (31%), Gaps = 43/432 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD---- 74 ++++ +K + +++ + K I + +++ + + Sbjct: 5 ENFEKYKLKELSDISSSKRIFASEYKEKGIPFYRSKEIIEKQSNKRISNKLFISKERYIE 64 Query: 75 -TSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSI 131 + + + G +L +G ++ + + K + L W S Sbjct: 65 IKNKYGVPSCGDLLLTSVGTLGVPYLVKNEEFYFKDGNLTWFRNLKQINKSYLYYWFFSP 124 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +I + G+T + N + IP A Q I + + +I+ Sbjct: 125 EAKYQITSKQIGSTQKALTISNLNNFEILIPTRAIQEKIVTILKSLDSKIE---INNKII 181 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L + + S+ V D +S +G++P+ WEVKP L+ Sbjct: 182 SNLESQAQAIFKSWFVDFEPFQDGNFVES---ELGMIPEGWEVKPIGELLDFDIGGGWGK 238 Query: 252 IESNILSLSYGNIIQKLET----------RNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + L +I+ + N ES + + G+I+F + Sbjct: 239 EKPQEKYLIPAYVIRGTDIPDSKFGYFNMDNYRYHTESNLKNRRLQVGDIIFESSGGSTN 298 Query: 302 KRSLRSAQVME--------RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF---YAMG 350 + R V + I S ++ + + + + Y Y + Sbjct: 299 QDLGRMLLVTDELLNEYNNDVICASFCKLIRINDSSIRWFVYNLLEYSYRNKILTKYEVK 358 Query: 351 SGLRQSLKFEDVKR-LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 S + F K + VP K NV I+ L K+ L E R + Sbjct: 359 STGISNFSFTIFKDDFKIAVPDRKTMER---YFNVTGNNIN-LSAKLGIQNTKLAELRDA 414 Query: 410 FIAAAVTGQIDL 421 + + G+ID+ Sbjct: 415 LLPKLMAGEIDV 426 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 17/189 (8%), Positives = 57/189 (30%), Gaps = 6/189 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 ++ K++ ++ + I+ I + Sbjct: 1 MEFKENFEKYKLKELSDISSSKRIFASEYKEKGIPFYRSKEIIEKQSNKRISNKLFISKE 60 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E Y + G+++ + +++ + + + I+ +YL Sbjct: 61 RYIEIKNKYGVPSCGDLLLTSVGTLGVPYLVKNEEFYFKD--GNLTWFRNLKQINKSYLY 118 Query: 335 WLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI---D 390 + S + + +++L ++ +L+P Q I ++ ++I + Sbjct: 119 YWFFSPEAKYQITSKQIGSTQKALTISNLNNFEILIPTRAIQEKIVTILKSLDSKIEINN 178 Query: 391 VLVEKIEQS 399 ++ +E Sbjct: 179 KIISNLESQ 187 Score = 39.8 bits (91), Expect = 0.98, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 8/83 (9%) Query: 18 IGAIPKHWKVVPIKRFTKLNT----GRTSESGKDII---YIGLEDVESGTGKYLPKDGNS 70 +G IP+ W+V PI + G+ K +I I D+ Y D Sbjct: 212 LGMIPEGWEVKPIGELLDFDIGGGWGKEKPQEKYLIPAYVIRGTDIPDSKFGYFNMDNYR 271 Query: 71 RQSDTS-TVSIFAKGQILYGKLG 92 ++++ G I++ G Sbjct: 272 YHTESNLKNRRLQVGDIIFESSG 294 >gi|91772548|ref|YP_565240.1| restriction modification system DNA specificity subunit [Methanococcoides burtonii DSM 6242] gi|91711563|gb|ABE51490.1| Restriction modification system DNA specificity subunit [Methanococcoides burtonii DSM 6242] Length = 511 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 70/205 (34%), Gaps = 6/205 (2%) Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 ++ + + +P+ + L + K+T ES + + Sbjct: 272 PFTETELAELPTLPNGYGWTRLGELHHLKSDKHTGSGESLFYIGLEHISKNQGTLTDEVK 331 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLA 334 G++++ + +K L + E G+ ++ + + D Y Sbjct: 332 IDVINTVKNSFKKGDLLYGKLRPYLNKVYLAN----EDGVCSTDILVFESIPSLDLNYSK 387 Query: 335 WLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + SY SG+ + + ++ P + ++EQ I I + D + Sbjct: 388 YYFLSYKFVNDMTHNSSGVNLPRVSTKYLQEYPFPLFSLEEQQAIVTEIETRLSVCDKVE 447 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 + IE ++ + + R S + A G+ Sbjct: 448 QDIEDNLKIAEALRQSILKKAFEGK 472 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 55/178 (30%), Gaps = 11/178 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPE------SYETYQIVDPGEIVFRFIDLQN 300 K + +IL ++ ++ E + + +++ G ++F Sbjct: 35 KVPEYWGEDILWITPADLSGYSEKYIYKGRKSITHLGLKNSSARLIPKGSVLFSSRAPIG 94 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 A + + P + + + L + Sbjct: 95 Y-----IAIAGNELCTNQGFKTLIPSEALNRDFLYYYLKSIKQLAEGRASGTTFKELSGK 149 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 LP+ VPP+ EQ I + I + +D + ++ + LK R S + A G+ Sbjct: 150 AFAELPLCVPPLPEQRAIVSKIEQLFSELDNGIANLKLAQQQLKVYRQSVLKKAFEGE 207 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 82/201 (40%), Gaps = 3/201 (1%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 ++ + + +P + + L + + + SG+ + YIGLE + G + Sbjct: 275 ETELAELPTLPNGYGWTRLGELHHLKSDKHTGSGESLFYIGLEHISKNQGTLTDEVKIDV 334 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLS 130 + F KG +LYGKL PYL K +A+ DG+CST LV + L + + LS Sbjct: 335 INTVKNS--FKKGDLLYGKLRPYLNKVYLANEDGVCSTDILVFESIPSLDLNYSKYYFLS 392 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G + K + P P+ L EQ I +I D + + Sbjct: 393 YKFVNDMTHNSSGVNLPRVSTKYLQEYPFPLFSLEEQQAIVTEIETRLSVCDKVEQDIED 452 Query: 191 FIELLKEKKQALVSYIVTKGL 211 +++ + +Q+++ L Sbjct: 453 NLKIAEALRQSILKKAFEGKL 473 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 44/215 (20%), Positives = 79/215 (36%), Gaps = 14/215 (6%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPK--- 66 + +G W + F ++ +G T ++ G+DI++I D+ + KY+ K Sbjct: 9 EKLGD---DWVKGVLSDFGQVVSGGTPKTKVPEYWGEDILWITPADLSGYSEKYIYKGRK 65 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 S+ + KG +L+ P AI + + F L P + L Sbjct: 66 SITHLGLKNSSARLIPKGSVLFSSRAPIGYIAIAGNEL-CTNQGFKTLIPSEALNR-DFL 123 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + Q E G T K +P+ +PPL EQ I KI +D I Sbjct: 124 YYYLKSIKQLAEGRASGTTFKELSGKAFAELPLCVPPLPEQRAIVSKIEQLFSELDNGIA 183 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 + LK +Q+++ L + + + Sbjct: 184 NLKLAQQQLKVYRQSVLKKAFEGELTRQWREQQTD 218 >gi|293384341|ref|ZP_06630226.1| putative restriction endonuclease S subunit [Enterococcus faecalis R712] gi|291078333|gb|EFE15697.1| putative restriction endonuclease S subunit [Enterococcus faecalis R712] Length = 394 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 62/404 (15%), Positives = 155/404 (38%), Gaps = 36/404 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +W++ +K T+ G ++ D+ + + + + GN + ++ K Sbjct: 8 NWELCKLKEITERVKG--NDGRMDLPTLTISASQGWLNQKDRFSGNIAGKEQKNYTLLLK 65 Query: 84 GQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE- 138 ++ Y KL Y + ++ + + + E Sbjct: 66 NELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKEL 125 Query: 139 -----AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + ++ NI + IP + EQ I + +ID +IT R ++ Sbjct: 126 GKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDDIITLHQRKLD 181 Query: 194 LLKEKKQALVSYIVTKGLNPDVK-MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LKE K+A + + + K K ++ G W+ + + + ++K+T Sbjct: 182 QLKELKKAYLQLMFVSMNTKNNKVPKLRFADFEGD----WKQRKLGDFLEDFSKKSTIEN 237 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E ILS + + E R + S Y+I+D G++V +L ++ + Sbjct: 238 EYIILSSTNNGM----EIREGRVSGNSNLGYKIIDDGDLVLSPQNLWLGNINI---NNIG 290 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM----GSGLRQSLKFEDVKRLPVL 368 +G+++ +Y K ++ +L +R+ + + S +R++L+ + ++ + Sbjct: 291 QGLVSPSYKTFKIIDLNKEFLNPQLRTNKMLDQYKNASTQGASIVRRNLELDLFYQIRIF 350 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P +EQ I + +++ + + + +K + +++ Sbjct: 351 IPKNEEQKQIG----LLFRKLNESISLHQSKLDSIKYLKKAYLQ 390 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 63/185 (34%), Gaps = 8/185 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + F + + +++ + II + S ++G + I Sbjct: 216 DWKQRKLGDFLEDFSKKSTIENEYII------LSSTNNGMEIREGRVSGNSNLGYKIIDD 269 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ +L I + G+ S + + D+ E L L + + + + Sbjct: 270 GDLVLSPQNLWLGNININNIGQGLVSPSYKTFKIIDLNKEFLNPQLRTNKMLDQYKNAST 329 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 S ++ I + +++I +++ I+ ++ +K K+A Sbjct: 330 QGA-SIVRRNLELDLFYQIRIFIPKNEEQKQIGLLFRKLNESISLHQSKLDSIKYLKKAY 388 Query: 203 VSYIV 207 + + Sbjct: 389 LQNMF 393 >gi|77164710|ref|YP_343235.1| restriction modification system DNA specificity subunit [Nitrosococcus oceani ATCC 19707] gi|76883024|gb|ABA57705.1| Restriction modification system DNA specificity domain [Nitrosococcus oceani ATCC 19707] Length = 547 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 69/199 (34%), Gaps = 5/199 (2%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY----GNIIQKLETRNMGLKPES 279 WV K TK + ++ + G + E + + Sbjct: 9 WVFCRFGDIARIRNGYAFRSSAFKKTKTHDCDVPLIRQSQLIGTAVNIGEAVYLPAEYLE 68 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 +++ G+I+ ++ + T +DS + + S Sbjct: 69 RFAQYVINKGDILIGMSGAIGKVCRYKNGFPALQNQRTGKIEVFDESQMDSRFFGLYLSS 128 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + A G ++ ++ +D++ LP+ +PP EQ I I + +D +E ++ + Sbjct: 129 IEGELIRQAKGMAVQ-NISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDKGIESLKTA 187 Query: 400 IVLLKERRSSFIAAAVTGQ 418 LK R + + A G+ Sbjct: 188 REQLKVYRQAVLKHAFEGK 206 Score = 75.6 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 57/158 (36%), Gaps = 4/158 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + R++ L + Y++ + R N + + ++ + Sbjct: 325 VDSSDLRSIKLDATEIQKYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFR 384 Query: 325 PHG--IDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + +Y+ L + + + + S + ++ + L + + EQ I + Sbjct: 385 FPQGIVLPSYIQMLFDTQTVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVS 444 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + I + +IE++ LK R S + A +GQ Sbjct: 445 RLEEQLTSISAVKVEIEENFQRLKSLRQSILKKAFSGQ 482 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 66/213 (30%), Gaps = 24/213 (11%) Query: 22 PKHWKVVPIKRFTKLNTG---------RTSESGKDIIYIGL-----EDVESGTGKYLPKD 67 P W ++ G +T D+ I V G YLP + Sbjct: 6 PTGWVFCRFGDIARIRNGYAFRSSAFKKTKTHDCDVPLIRQSQLIGTAVNIGEAVYLPAE 65 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ----FLVLQPKDVLPEL 123 + + KG IL G G + + Q V + Sbjct: 66 Y----LERFAQYVINKGDILIGMSGAIGKVCRYKNGFPALQNQRTGKIEVFDESQMDSRF 121 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 +L SI+ + +G + + K I +P+ +PP EQ I KI +D Sbjct: 122 FGLYLSSIEG--ELIRQAKGMAVQNISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDK 179 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 I E LK +QA++ + L + Sbjct: 180 GIESLKTAREQLKVYRQAVLKHAFEGKLTAQWR 212 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 66/207 (31%), Gaps = 12/207 (5%) Query: 22 PKHWKVVPIKRFTK-LNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W + ++ + G SGK I I L D+++ + Sbjct: 282 PNGWISIQLRELFESTQNGLAKRQGTSGKPIPVIRLADIKNQEVDSSDLRSIKLDATEIQ 341 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDV-LPELLQGWLLS 130 ++ +L ++ + C P+ + LP +Q + Sbjct: 342 KYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFRFPQGIVLPSYIQMLFDT 401 Query: 131 IDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 V + IE A + I + +P L EQ +I ++ + I + E Sbjct: 402 QTVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVSRLEEQLTSISAVKVEIE 461 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 + LK +Q+++ + L P Sbjct: 462 ENFQRLKSLRQSILKKAFSGQLVPQDP 488 >gi|131021|sp|P17222|T1SP_ECOLX RecName: Full=Type-1 restriction enzyme EcoprrI specificity protein; Short=S.EcoprrI; AltName: Full=Type I restriction enzyme EcoprrI specificity protein; Short=S protein gi|42512|emb|CAA36526.1| unnamed protein product [Escherichia coli] Length = 401 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 48/386 (12%), Positives = 109/386 (28%), Gaps = 45/386 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + +P+ + L G T K DI + ++D+ Sbjct: 17 EWLPLSKVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGSSLQKISSCAVKGG 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQ 135 +F + IL A+I + + +F L K+ + + + + Sbjct: 77 KLFPENSILISTSATIGEHALITVPH-LANQRFTCLALKESYADCFDIKFLFYYCFSLAE 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITER 188 ++ + D G +P P LA Q I + + L E Sbjct: 136 WCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFSALTAELTAEL 195 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + K++ +++ + + + E + + Sbjct: 196 TAELSMRKKQYNYYRDQLLSFKEDEVEGKRKTLGE-------------IMKMRAGQHISA 242 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 +IE S Y + K E I G + ++ + Sbjct: 243 HNIIERKEESYIYPCFGGNGIRGYVKEKSHDGEHLLIGRQGALCGNVQRMKGQFYATE-- 300 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 A + GI+ + ++ + +L + + L ++ L + Sbjct: 301 ---------HAVVVSVMPGINIDWAFHMLTAMNLNQY---ASKSAQPGLAVGKLQELKLF 348 Query: 369 VPPIKEQFDITNVINVETARIDVLVE 394 VP I+ Q I +++ + + E Sbjct: 349 VPSIERQIYIAAILDKFDTLTNSITE 374 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 62/221 (28%), Gaps = 22/221 (9%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M +EW+ L +V T K +I +I + L+ Sbjct: 11 MDGVEVEWLPLS----KVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGSSLQ 66 Query: 277 PES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 S + ++ I+ + + + + A D +L Sbjct: 67 KISSCAVKGGKLFPENSILISTSATIGEHALITVPHLANQRFTCLALKESYADCFDIKFL 126 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVET 386 + S S+ + K+ + P + Q +I +++ + Sbjct: 127 FYYCFSLA-EWCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFS 185 Query: 387 ARIDVLVEKIEQSIVLLKE----RRSSFIA---AAVTGQID 420 A L ++ + + K+ R ++ V G+ Sbjct: 186 ALTAELTAELTAELSMRKKQYNYYRDQLLSFKEDEVEGKRK 226 >gi|50122043|ref|YP_051210.1| subunit S of type I restriction-modification system [Pectobacterium atrosepticum SCRI1043] gi|49612569|emb|CAG76019.1| subunit S of type I restriction-modification system [Pectobacterium atrosepticum SCRI1043] Length = 551 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 53/453 (11%), Positives = 136/453 (30%), Gaps = 62/453 (13%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQS--D 74 +P+ W+ V + ++ + G+ + + + + + V+ K + +S Sbjct: 100 ELPEVWEWVRLSDISEYIQRGKGPKYAEHGSVKVVSQKCVQWSGFKLEQSRWITDESIHS 159 Query: 75 TSTVSIFAKGQILYGKLGPYL--RKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + G +L+ G + I + + +++ + + ++ Sbjct: 160 YTKDRFLKDGDVLWNSTGAGGTAGRVIYLPVVKEKLVVDSHVTLIRTVRDNGKFISNYIS 219 Query: 130 SIDVTQRIEAICEGA------TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + QR + + + + + +P PP EQ I +K D Sbjct: 220 TYGIQQRFDPKHSNTLLSGTTNQAELNSSVVNSFLVPFPPQREQERINDKAAELMSLCDQ 279 Query: 184 LITERIRFIELLKEKKQALVSYIVT---------------KGLNPDVKMKDS-------- 220 L + + ++ ++ + L++ +V + + + S Sbjct: 280 LEQQSLTSLDAHQQLVETLLATLVDSQHAEELAENWARISQHFDTLFTTEASIDAIKQTI 339 Query: 221 --------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG---NIIQKLE 269 IE K + E+++ L G KL+ Sbjct: 340 LQLAVMGLLIESAEFSQRSHLKKYLSFGPKNGLSPSEVKYETDVKVLKLGATSYGYLKLQ 399 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM--AVKPHG 327 ++Y + +I+ + + N + +I M Sbjct: 400 ETKYVDIDVKDKSYLFLKKNDILIQRGNSSNFVGCSLLIEEDFDDLIYPDLMMKIRTKDE 459 Query: 328 IDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + Y + S ++ SG + + V+ +P+ VPP Q + I Sbjct: 460 LLPEYAVLWLSSPFARDFMWSKMTGTSGTMPKISKKVVEEIPIAVPPFAVQNQLVIKIKE 519 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSF-IAAAVT 416 L +++ +++ +A A+T Sbjct: 520 LFLLCGSLTSRLQSV------QKTQLHLADALT 546 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 71/211 (33%), Gaps = 15/211 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALV-TELNRKNTKLIESNILSLSYGNIIQKLETR------N 272 S E +P+ WE + K K E + + +Q + Sbjct: 93 SEDEKPFELPEVWEWVRLSDISEYIQRGKGPKYAEHGSVKVVSQKCVQWSGFKLEQSRWI 152 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAYMAVKPHGIDST 331 SY + + G++++ R + V E+ ++ S ++ + Sbjct: 153 TDESIHSYTKDRFLKDGDVLWNSTGAGGTAGRVIYLPVVKEKLVVDSHVTLIRTVRDNGK 212 Query: 332 YLAWLMRSYDLCKVF-------YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +++ + +Y + + F G+ + L V V PP +EQ I + Sbjct: 213 FISNYISTYGIQQRFDPKHSNTLLSGTTNQAELNSSVVNSFLVPFPPQREQERINDKAAE 272 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + D L ++ S+ ++ + +A V Sbjct: 273 LMSLCDQLEQQSLTSLDAHQQLVETLLATLV 303 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 55/192 (28%), Gaps = 13/192 (6%) Query: 30 IKRFTKL--NTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 +K++ G + K D+ + L G K + K Sbjct: 360 LKKYLSFGPKNGLSPSEVKYETDVKVLKLGATSYGYLKLQETKYVDIDVKDKSYLFLKKN 419 Query: 85 QILY---GKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 IL +I D + ++LPE WL S + Sbjct: 420 DILIQRGNSSNFVGCSLLIEEDFDDLIYPDLMMKIRTKDELLPEYAVLWLSSPFARDFMW 479 Query: 139 AICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G TM K + IP+ +PP A Q + KI + +L + + Sbjct: 480 SKMTGTSGTMPKISKKVVEEIPIAVPPFAVQNQLVIKIKELFLLCGSLTSRLQSVQKTQL 539 Query: 197 EKKQALVSYIVT 208 AL + Sbjct: 540 HLADALTDAALN 551 >gi|89902765|ref|YP_525236.1| restriction endonuclease S subunits-like protein [Rhodoferax ferrireducens T118] gi|89347502|gb|ABD71705.1| Restriction endonuclease S subunits-like [Rhodoferax ferrireducens T118] Length = 412 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 58/429 (13%), Positives = 129/429 (30%), Gaps = 56/429 (13%) Query: 20 AIPKHWKVVPIKRFTKL---------NTGRTSESGKD-----IIYIGLEDVESGTGKYLP 65 +P W +V + + N G G D I ++ D+ +G + Sbjct: 11 QLPDGWSLVTVGQLVNEGVIAKPLDGNHGEIHPKGSDFVSDGIPFVMATDINAGKVDLVN 70 Query: 66 -KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQ--FLVLQPKDV 119 K +Q+D+ +L R AI+ + + + Q + KD Sbjct: 71 CKFITKKQADSLAKGFAIPEDVLLTHKATLGRTAIVGELRTPYIMLTPQVTYYRTIKKDR 130 Query: 120 LPELLQGWLLSIDVTQRIEAIC--EGATMSHADWKGIGNIPMPIPPL-AEQVLIREKIIA 176 L + Q G+T ++ ++P+ +P L EQ I + + Sbjct: 131 LHNRFLKYYFDSPFFQDTLVNHGDSGSTRAYVGITAQRDLPIILPNLVREQESIAAVLAS 190 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 +ID L + + + + +G + P Sbjct: 191 LDDKIDLLHRQNQTLEAIAETLFRQWFVEDAQEGWD--------------ERPLSSIANF 236 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 +K + L + + + + IV+ G+++F + Sbjct: 237 LN---GLACQKYPPTNDLEKLPVLKIRELSSGISETADWATSQVKPGYIVEAGDVIFAWS 293 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ 355 + E+ ++ V + + + + A Sbjct: 294 -----ASLMVKVWDGEKCVLNQHLFKVTSDEFPKWFYLRWCKHHLAEFIAVAASHATTMG 348 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK---IEQSIVLLKERRSSFIA 412 +K D+ VLVPP V+ + ++ L+ K I + L++ R + + Sbjct: 349 HIKRGDLDAAMVLVPPPP-------VLETMSRQMQPLLNKQIAIARQRKTLEKLRDTLLP 401 Query: 413 AAVTGQIDL 421 ++G++ + Sbjct: 402 KLMSGEVRV 410 >gi|325696148|gb|EGD38039.1| type I restriction-modification system specificity determinant [Streptococcus sanguinis SK160] Length = 390 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 63/410 (15%), Positives = 136/410 (33%), Gaps = 33/410 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E++E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMENLEPFTRDIPEFEY----LEFRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVRSKENISDENFVYYLMIAPNI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 R I+++ + + N + PPL EQ+ I + + A +I+ Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKTLKALDDKIENNKKINHHLE 177 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E + + + +K I+ V DH F +L + Sbjct: 178 E---------ILQANLEKQLESISIKSKIIDLNLTVSDHVANGSFKSLKDNVKLVEKTDY 228 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + ++ N + E R + + + E++ + + + Sbjct: 229 ALFLRNIDLKNHLNG-ERRYVTESSYEFLKKSRLYGHEVIISNVADVGSVHRVPKMNMPM 287 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 + + + + YL S ++ SG +Q D + L + + Sbjct: 288 VAG-NNVVFLQSENSLLTDYLYVYFNSRLGQHDIMSITSGSAQQKFNKTDFRNLEIPILS 346 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +I + + I ++ I + I L + R++ + ++G+I + Sbjct: 347 DD-------IIKKKISSILHYIDNIHEEIACLMKIRATLLPKLLSGEISV 389 >gi|292492041|ref|YP_003527480.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] gi|291580636|gb|ADE15093.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] Length = 451 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 59/431 (13%), Positives = 140/431 (32%), Gaps = 41/431 (9%) Query: 21 IPKHWKVV---PIKRFTKLNT------GRTSES----GKDIIYIGLEDVESGTGKYLPKD 67 +P W+ V +K + + G + S + + I ++ G +++P Sbjct: 5 LPHGWRFVSVEKLKS-AEARSLAAGPFGSSISSRYFVNEGVPVIRGANLSEGKQRFIPSG 63 Query: 68 GNSRQSDTSTVSI---FAKGQILYGKLGPYLRKAIIADFDGICSTQF------LVLQPKD 118 D + G +++ G + +I S L P Sbjct: 64 FAFITRDKAKEFKGAHVKSGDLVFTCWGTLGQVGLIPRDGPYDSYVISNKQLKLRPDPDI 123 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 E L + S + +R + G+ + + + +P+PPL Q I + A Sbjct: 124 ASSEFLYYYFSSPTLRKRFNDVAIGSAVPGINLGILRRELVPLPPLRMQEKIAAILTAYD 183 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 I+ ++ +E + + G +K W D ++ F Sbjct: 184 DLIEVNKRRIALLEKMAEELYREWFVRLRFPGYQDTRFVKGVPEGW-----DVVSLENFC 238 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-----VDPGEIVF 293 +T+ K ++S L ++ NI + +I + G+I++ Sbjct: 239 ETITDGTHDTPKPVDSGHLLVTGKNIKSNQIDFTGAYFISEQDHREISKRSGLREGDILY 298 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I + + + + + + DS +L ++++ + + AM SG Sbjct: 299 SNIGTIGQTAIVGA---KPDYSVKNVIIFRPRNAHDSLFLFHVLKNPAISEHLLAMASGA 355 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +Q + + +L P + ++ + + L+ +L R + Sbjct: 356 SQQFIGLGTARSFNILKPNSIILEEFGKTVSKFFEQRNTLISMN----HILCSSRDLLLP 411 Query: 413 AAVTGQIDLRG 423 ++G++ + Sbjct: 412 RLISGKLSVED 422 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 76/210 (36%), Gaps = 17/210 (8%) Query: 6 AYPQYKDS----GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVE 57 +P Y+D+ GV P+ W VV ++ F + T T ++ K + + + ++++ Sbjct: 212 RFPGYQDTRFVKGV------PEGWDVVSLENFCETITDGTHDTPKPVDSGHLLVTGKNIK 265 Query: 58 SGTGKYLPKDGNSRQS--DTSTVSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVL 114 S + S Q + S S +G ILY +G + AI+ A D + Sbjct: 266 SNQIDFTGAYFISEQDHREISKRSGLREGDILYSNIGTIGQTAIVGAKPDYSVKNVIIFR 325 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 L L + +++ + A+ GA+ + + P + + Sbjct: 326 PRNAHDSLFLFHVLKNPAISEHLLAMASGASQQFIGLGTARSFNILKPNSIILEEFGKTV 385 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVS 204 + +TLI+ L+S Sbjct: 386 SKFFEQRNTLISMNHILCSSRDLLLPRLIS 415 >gi|307638191|gb|ADN80641.1| type I restriction-modification system specificity subunit S [Helicobacter pylori 908] gi|325996786|gb|ADZ52191.1| Type I restriction-modification system specificity subunit S [Helicobacter pylori 2018] gi|325998378|gb|ADZ50586.1| Type I restriction-modification specificity subunit [Helicobacter pylori 2017] Length = 277 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 50/288 (17%), Positives = 104/288 (36%), Gaps = 20/288 (6%) Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G+T I N+ +P+PPL EQ+ I + + +L ++ + K Sbjct: 1 MASGSTFLEVSPNKIKNLLIPLPPLNEQIAIANILSDVDRYLYSLDALILKKESVKKALS 60 Query: 200 QALVSYIVT-KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 L+S KG N + G +G+ K + + I Sbjct: 61 FELLSQRKRLKGFNQAWQRVRLGD--IGITISGLAGKTKQDFINGNAK--------YITF 110 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGII 316 L+ N + + +K E ++ F + + + +++ + Sbjct: 111 LNVLNNVIIDTSILENVKIYPNEKQNSFKKYDLFFNTSSETPKEVGMCAVLLDDIDQVFL 170 Query: 317 TSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 S + +DS +L++L+ S K F + G R +L + +++PP+ Sbjct: 171 NSFCFGFRIFDKAVDSLFLSYLINSEIGRKAFENLAQGSTRYNLSKSGFNNVCLILPPLN 230 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I NV++ + I L K Q + + + ++ +I + Sbjct: 231 EQIAIANVLSDLDSEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 274 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 71/193 (36%), Gaps = 13/193 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ V + +G ++ +D I YI +V + N + + Sbjct: 77 WQRVRLGDIGITISGLAGKTKQDFINGNAKYITFLNVLNNVIIDTSILENVKIYPNEKQN 136 Query: 80 IFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 F K + + ++ + D + S F + L +L++ + Sbjct: 137 SFKKYDLFFNTSSETPKEVGMCAVLLDDIDQVFLNSFCFGFRIFDKAVDSLFLSYLINSE 196 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + E + +G+T + G N+ + +PPL EQ+ I + I +L ++ +F Sbjct: 197 IGRKAFENLAQGSTRYNLSKSGFNNVCLILPPLNEQIAIANVLSDLDSEIISLKNKKRQF 256 Query: 192 IELLKEKKQALVS 204 + K L+S Sbjct: 257 ENIKKALNHDLMS 269 >gi|225861220|ref|YP_002742729.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae Taiwan19F-14] gi|298503106|ref|YP_003725046.1| type I restriction-modification system subunit S [Streptococcus pneumoniae TCH8431/19A] gi|225727667|gb|ACO23518.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae Taiwan19F-14] gi|298238701|gb|ADI69832.1| type I restriction-modification system S subunit [Streptococcus pneumoniae TCH8431/19A] Length = 373 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 53/400 (13%), Positives = 124/400 (31%), Gaps = 39/400 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + G + D+ + + E L L Sbjct: 171 KSRFNEMF-------------GENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFL 217 Query: 260 SYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N+ + + + + + ++ +IV + + Sbjct: 218 NTKNVTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLR 277 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I S + ++P + +++ + + L +K++ + +PP+ Q Sbjct: 278 INSGMVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQ 336 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + +ID I++S+ L+ + S + Sbjct: 337 NEFADFVV----QIDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|117676180|ref|YP_863756.1| restriction modification system DNA specificity subunit [Shewanella sp. ANA-3] gi|117615004|gb|ABK50457.1| restriction modification system DNA specificity domain [Shewanella sp. ANA-3] Length = 411 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 51/420 (12%), Positives = 149/420 (35%), Gaps = 44/420 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W V + G + + K + ++ D+ +G+ + + + ++ Sbjct: 4 SWPTVTLDECASFQEGYVNPTQKKEHYFDGPVKWLRAVDLNNGSIRNTSRTLSEEGFKSA 63 Query: 77 TVS--IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S +F G + K G R I+ D+ + + ++ + L + + + Sbjct: 64 GKSALLFEPGTLAISKSGTIGRIGILEDYM-CGNRAVINIKVDKDKCDNLYIFYVLLMSR 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + IE + G+ + +G++ + +PPL Q I +++ +ID E+ Sbjct: 123 RVIETLAVGSVQKNLYTSALGSLELRLPPLQVQAAIAKQLSDLDKKIDLNTQTNQTLEEM 182 Query: 195 LKEKKQAL----------VSYIVTKGLNPDVKMKDSG---IEWVGLVPDHWEVKPFFALV 241 + ++ ++ +G++ +GL+P+ W+ +V Sbjct: 183 AQAIFKSWFVDFDPVKAKMNGKQPEGMDAATASLFPEKLVESELGLIPEGWDAVQVGDIV 242 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 L K + + + + ++ + +G + + +F Sbjct: 243 QRLKPKK-RYTKKQVEPYGKTPVYEQGASILLGFHNDDAGFDASPEDPVFIFGDHTCITH 301 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + I+S + +K + + + ++ + + R+ Sbjct: 302 LSC-------SKFDISSNVIPLKGSVRPTIWTYYAIQGKQEFQEY-------RRHWSEFI 347 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +K V++PP++ ++ + +++E +++ L++ R + + ++G+I+L Sbjct: 348 IKD--VVLPPVELAEKYAELVTTKY----LMMESLKRQSKELEQLRDTLLPKLLSGEIEL 401 >gi|253689248|ref|YP_003018438.1| restriction modification system DNA specificity domain protein [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251755826|gb|ACT13902.1| restriction modification system DNA specificity domain protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 390 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 53/392 (13%), Positives = 127/392 (32%), Gaps = 32/392 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTST 77 W + T + S + + +++ + P+ + Sbjct: 2 SWPTYKLTDLCNKITDGSHNPPPGISESKFLMLSSKNIFDDDINFHNPRYLTKDDFEREN 61 Query: 78 VSI-FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + G +L +G R A++ D VL+PK + + Sbjct: 62 RRTDVSSGDVLLTIVGTVGRAAVVPDGSPKFTLQRSVAVLKPKHGIITSRFLMYTLRSML 121 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + A G K + ++ + +P + Q I + + R + I+L Sbjct: 122 DVLLAGARGVAQQGIYLKQLHDLDIKVPSVEIQKHIVNVLDKASSLCRK----REQGIKL 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E +A S + NPD +K+ I + + + T Sbjct: 178 ADEFLRATFSNMFG---NPDNNIKNFPIGTIRD---------LVSSASYGLSSKTSKHSG 225 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETY----QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 L GNI + + + LK + +++ G+++F + + + Sbjct: 226 KYPVLRMGNITYQGDWDLIDLKYIDLDEKAQEKFLLEKGDLLFNRTNSKELVGKTAIFEN 285 Query: 311 MERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPV 367 + V+ + I ++ Y+A + S M + ++ ++++ + + Sbjct: 286 DRDMAFAGYLIRVRTNEIGNNYYIAGYLNSLHGKNTLINMSKSIVGMANINAQEMQNIKI 345 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 L+PP + Q + + +I + +E ++S Sbjct: 346 LIPPKELQDNYEKIYKTVKNKIKIHIESKKES 377 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 62/182 (34%), Gaps = 8/182 (4%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM-----GLKPESYETY 283 P + + + + ES L LS NI + E Sbjct: 4 PTYKLTDLCNKITDGSHNPPPGISESKFLMLSSKNIFDDDINFHNPRYLTKDDFERENRR 63 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 V G+++ + + + A + K I S +L + +RS + Sbjct: 64 TDVSSGDVLLTIVGTVGRAAVVPDGSPKFTLQRSVAVLKPKHGIITSRFLMYTLRS--ML 121 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 V A G +Q + + + L + VP ++ Q I NV++ ++ + I+ + Sbjct: 122 DVLLAGARGVAQQGIYLKQLHDLDIKVPSVEIQKHIVNVLDKASSLCRKREQGIKLADEF 181 Query: 403 LK 404 L+ Sbjct: 182 LR 183 >gi|315638033|ref|ZP_07893218.1| restriction modification system S chain-like protein [Campylobacter upsaliensis JV21] gi|315481881|gb|EFU72500.1| restriction modification system S chain-like protein [Campylobacter upsaliensis JV21] Length = 591 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 69/189 (36%), Gaps = 9/189 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP W V + ++ +G T K I ++ + DV++ + Sbjct: 130 IPNSWAWVKLGDICEIVSGGTPSRDKIEYWHNGTIPWVKIADVKNNVVNQTQEFITELGL 189 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + S+ IF KG +LY + L + I + D + L + L + Sbjct: 190 ENSSAKIFKKGTLLYT-IFATLGETAILNIDAATNQAIAALIETYDYDTKFLMYCLM-SM 247 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ G ++ + + N +P+PPL EQ I +K+ + + Sbjct: 248 KDYVNSLGRGVAQNNINQTMLKNFTIPLPPLCEQQEIVKKLDLLVSLANDFAITKENLKR 307 Query: 194 LLKEKKQAL 202 + K ++ + Sbjct: 308 IEKRIEKRI 316 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 71/203 (34%), Gaps = 20/203 (9%) Query: 229 PDHWEVKPFFALVTELNR------KNTKLIESNILSLSYGNIIQKLETRNMGLKPE---S 279 P+ W + ++ K I + ++ + + E Sbjct: 131 PNSWAWVKLGDICEIVSGGTPSRDKIEYWHNGTIPWVKIADVKNNVVNQTQEFITELGLE 190 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + +I G +++ + L + I ++ + D+ +L + + S Sbjct: 191 NSSAKIFKKGTLLYTIFATLGETAILNIDAATNQAIA----ALIETYDYDTKFLMYCLMS 246 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV-LVEKIE 397 + ++G G + ++ +K + +PP+ EQ +I +++ + + + K Sbjct: 247 --MKDYVNSLGRGVAQNNINQTMLKNFTIPLPPLCEQQEIVKKLDLLVSLANDFAITKEN 304 Query: 398 -QSIVLLKERR--SSFIAAAVTG 417 + I E+R S + A+ G Sbjct: 305 LKRIEKRIEKRIEKSLLKLALEG 327 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 61/213 (28%), Gaps = 10/213 (4%) Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP 236 + + I + + KE +S P +G + + + Sbjct: 379 KKALCKSQIQMLKKELTKCKEITPLNLSEA------PFTIPNSWAWVKLGDICEMKKGPF 432 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 A+ ++ N + + L + L+ V +I+ Sbjct: 433 GSAITKDMFIPNGNNAVKIYEQKNAIQKSETLGEYYISLEHFEKLKQFEVFENDIIVSCA 492 Query: 297 DLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + R + +GII A M + Y K + Sbjct: 493 GTIGE--IFRIPKNAPKGIINQALMKIKLVNEEWIPYFMIFFDFLIKQKSQENSKGSAIK 550 Query: 356 SLK-FEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ + +K + +PP++EQ IT +++ Sbjct: 551 NIPPLDILKNFSIPLPPLQEQEYITQILDTLFT 583 >gi|170768570|ref|ZP_02903023.1| type I restriction modification DNA specificity domain protein [Escherichia albertii TW07627] gi|170122674|gb|EDS91605.1| type I restriction modification DNA specificity domain protein [Escherichia albertii TW07627] Length = 456 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 53/430 (12%), Positives = 136/430 (31%), Gaps = 59/430 (13%) Query: 38 TGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA-KGQILYGKLGPYL 95 G+T + I I + +++G + + + D V +G ++ P Sbjct: 21 RGKTPKKVDNGIPLITAKIIKNGRIQEVNEFIAINDYDDWMVRGLPLEGDVVLTTEAPLG 80 Query: 96 RKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEGATMSHADWKG 153 A + + + + L+ K + + L L S V +++ G+T++ Sbjct: 81 EVAQLDSRKVALAQRVITLRGKKGILENDYLLYLLQSSFVQNQLDGRASGSTVTGIKQSE 140 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA-------LVSYI 206 + I + +PP++ Q I ++ +ID ++ + ++ ++ Sbjct: 141 LREIILRLPPVSLQKSISHQLKCLDKKIDLNNKINKTLEQMSQTLFKSWFVDFDPVIDNA 200 Query: 207 VTKGLNPDVKMKDSGIE-----------------------------WVGLVPDHWEVKPF 237 + G NP + + E +G VP +W V Sbjct: 201 LDAG-NPIPEALQTRAELRQKVRNSADFKPLPAEIRSLFPNKFEETELGWVPKYWFVTEL 259 Query: 238 FALVTELNRKNTKLIE---------SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 L+T + + I +S + + + + +K E ++ Sbjct: 260 GKLITVKRGGSPRPIHDFLCNKGLPWVKISDATASNSRFINLTKDFIKTEGLNKTVLLKK 319 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G ++ + I ++ + + + K+ Sbjct: 320 GSLILSNSATPG-----LPKFLDIDACIHDGWLHFPKKKRLTDIYLYNLFLEIKEKLISQ 374 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +LK + +K + VP I + + + + + + ++I L R Sbjct: 375 GNGSVFTNLKTDILKDYKIAVPGHD----IISYFDKISRELHNKIHSVTENINTLVALRD 430 Query: 409 SFIAAAVTGQ 418 + + ++G+ Sbjct: 431 TLLPKLISGE 440 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 70/190 (36%), Gaps = 9/190 (4%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK----PESYE 281 G ++ + + K K +++ I ++ I + + Sbjct: 2 GNNYIEMRLEDCMDAIIDYRGKTPKKVDNGIPLITAKIIKNGRIQEVNEFIAINDYDDWM 61 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G++V + L S +V + + K +++ YL +L++S Sbjct: 62 VRGLPLEGDVVLTTEAPLGEVAQLDSRKVALAQRV--ITLRGKKGILENDYLLYLLQSSF 119 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + SG +K +++ + + +PP+ Q I++ + +ID L KI +++ Sbjct: 120 VQNQLDGRASGSTVTGIKQSELREIILRLPPVSLQKSISHQLKCLDKKID-LNNKINKTL 178 Query: 401 VLL-KERRSS 409 + + S Sbjct: 179 EQMSQTLFKS 188 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 64/194 (32%), Gaps = 9/194 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLP-KDGNS 70 +G +PK+W V + + + G + K + ++ + D + +++ Sbjct: 247 LGWVPKYWFVTELGKLITVKRGGSPRPIHDFLCNKGLPWVKISDATASNSRFINLTKDFI 306 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + + KG ++ D D + PK + + L Sbjct: 307 KTEGLNKTVLLKKGSLILSNS-ATPGLPKFLDIDACIHDG-WLHFPKKKRLTDIYLYNLF 364 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +++ +++ + G+ ++ + + + +P + +I ++ Sbjct: 365 LEIKEKLISQGNGSVFTNLKTDILKDYKIAVPGHDIISYFDKISRELHNKIHSVTENINT 424 Query: 191 FIELLKEKKQALVS 204 + L L+S Sbjct: 425 LVALRDTLLPKLIS 438 >gi|331090321|ref|ZP_08339205.1| hypothetical protein HMPREF1025_02788 [Lachnospiraceae bacterium 3_1_46FAA] gi|330401456|gb|EGG81041.1| hypothetical protein HMPREF1025_02788 [Lachnospiraceae bacterium 3_1_46FAA] Length = 359 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 42/388 (10%), Positives = 117/388 (30%), Gaps = 32/388 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + G ++ + L DV G++ + + Sbjct: 3 VKLGDVCE--RGTSN--------LKLSDVSEKNGEFSVFGASGYIGSVDFYQQGYP-YVA 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 K G + +A++ L PKD + +++ +E GAT+ Sbjct: 52 VVKDGAGIGRAMLCPGKTSVIGTMQYLLPKDNILPKYLFYVVK---YMNLEKYFTGATIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H +K N QV I + + + +I + ++LL + +A + Sbjct: 109 HIYFKDYKNEEFNFDFWERQVEIVSVL----SKCEKVIDLCKQELQLLDKLIKARFVELF 164 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 ++ + ++ + +G + L K + ++ N Sbjct: 165 GDPVSNSYGLPEATLPDLGEFGRGVSKHRPRNDIKLLGGKYPLIQTGDV-----ANAGLY 219 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + + ++ D G + ++A + + + + Sbjct: 220 ITSYSSTYSELGLKQSKMWDKGTLCI-----TIAANIAKTAILEFDACFPDSVVGFIANE 274 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + S+ + ++++ + + L V+VP ++Q + + Sbjct: 275 RTNNIFVHYWFSFFQAILESQAPESAQKNINLKILSELKVIVPEKRKQDQFASFV----K 330 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAV 415 D +++++ + S + Sbjct: 331 LTDKSKVAVQKALDEAQLLFDSLMQEYF 358 >gi|94263943|ref|ZP_01287746.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93455688|gb|EAT05867.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 414 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 60/413 (14%), Positives = 133/413 (32%), Gaps = 41/413 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W VV ++ +L G + +G+ + + I Sbjct: 20 WAVVELRNIARLKYGENLSGSSMLP---------DGFPVFGANGHIGYYEKPNLFI---D 67 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G +A + +V++ + V + + G+ Sbjct: 68 SVIVSCRGENSGVINLAPAHSFVTNNSIVIELLAQEIYAGYLFYALQLVPKA--RMVSGS 125 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + +P +EQ I + ID I I+ ++ K L+ Sbjct: 126 AQPQVVINDLQKISVNLPSYSEQQKIAHIL----QTIDRAIERTEALIDKYQQIKAGLMH 181 Query: 205 YIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + T+G+ P+ +++ + +G +P WEVK LV + + + L+ I Sbjct: 182 DLFTRGIGPNGQLRPPRDQAPELYQQTPIGRIPKEWEVKNILDLVEFPSGQVSPLVSPYI 241 Query: 257 LSL-----SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +L R + + + + G+IV+ I K L Sbjct: 242 DMSLVAPDHIERNTGRLMLRETAREQGAISGKYVFESGDIVYSKIRPYLRKAILADFD-- 299 Query: 312 ERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLV 369 GI ++ +K + + ++ ++ + + V Sbjct: 300 --GICSADMYPLKVKQGNDPLFIFGVILGERFSTYAESVSMRSGFPKINRSEFSGFSCAV 357 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 P EQ I+ +I A+I E+ + L++++S + TG++ + Sbjct: 358 PSNNEQMKISEIIESAEAKIKS----NEKLLQKLQKQKSGLMYDLFTGRVQVP 406 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 50/195 (25%), Positives = 82/195 (42%), Gaps = 4/195 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IG IPK W+V I + +G+ S D+ + + +E TG+ + ++ Q Sbjct: 210 IGRIPKEWEVKNILDLVEFPSGQVSPLVSPYIDMSLVAPDHIERNTGRLMLRETAREQGA 269 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSIDV 133 S +F G I+Y K+ PYLRKAI+ADFDGICS L+ K + G +L Sbjct: 270 ISGKYVFESGDIVYSKIRPYLRKAILADFDGICSADMYPLKVKQGNDPLFIFGVILGERF 329 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E++ + + +P EQ+ I E I + +I + + + Sbjct: 330 STYAESVSMRSGFPKINRSEFSGFSCAVPSNNEQMKISEIIESAEAKIKSNEKLLQKLQK 389 Query: 194 LLKEKKQALVSYIVT 208 L + V Sbjct: 390 QKSGLMYDLFTGRVQ 404 >gi|300853532|ref|YP_003778516.1| type I restriction enzyme, specificity subunit [Clostridium ljungdahlii DSM 13528] gi|300433647|gb|ADK13414.1| type I restriction enzyme, specificity subunit [Clostridium ljungdahlii DSM 13528] Length = 417 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 68/408 (16%), Positives = 138/408 (33%), Gaps = 43/408 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-------IGLEDVESGTGKYLPKD------GNSR 71 W+ + S S D+ Y + DV G+ L + ++ Sbjct: 19 WEQRKFSGIF-IYLQNNSLSRTDLNYEQGSVKNVHYGDVLIKFGEVLDVEKTEIPFISNN 77 Query: 72 QSDTSTVSIFAKGQILYGKL--GPYLRKAIIADFDGICS-----TQFLVLQPKDVLPELL 124 + +TS+ S+ G I+ + K G S K L Sbjct: 78 EFNTSSTSLLRNGDIVIADAAEDETVGKCSEIKGIGCISIVSGLHTIPCRPIKTFETGYL 137 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++ S ++ + +G +S + N + P ++ L KI +D+L Sbjct: 138 GYYMNSSAYHDQLLPLIQGTKISSISKSALQNTEIIYPDSEKEQL---KIGQFFQNLDSL 194 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 IT R + L K++++ + + S + + + K Sbjct: 195 ITLHQRKYDKLIIVKKSMLEKMFPIDGS------GSNVPEIRFGGFTDDWKFRKLGDCFS 248 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 R + I I + E + Y+ V G+I + + + Sbjct: 249 ERSESMPDGELISVTINDGIKKFSELGRHDTSNDDKSKYKKVCVGDIAYNSMRMWQGASG 308 Query: 305 LRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFE 360 + GI++ AY + P+ IDS +++L + D+ F G+ +LK++ Sbjct: 309 YSPYE----GIVSPAYTVLAPNNGIDSKCISYLFKRPDMIHTFQVNSQGITSDNWNLKYQ 364 Query: 361 DVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + +L+P I+EQ I +D L+ ++ + L+ R Sbjct: 365 ALSEIEILIPNDIQEQKYIAEY----FTGLDNLITLHQRKLEKLRNIR 408 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 57/184 (30%), Gaps = 4/184 (2%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + + + ++I + + D + D + D S Sbjct: 237 DWKFRKLGDCFSERSESMPDG--ELISVTINDGIKKFSELGRHDTS--NDDKSKYKKVCV 292 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G I Y + + + + ++GI S + VL P + + +L + Sbjct: 293 GDIAYNSMRMWQGASGYSPYEGIVSPAYTVLAPNNGIDSKCISYLFKRPDMIHTFQVNSQ 352 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 S + + + ++ I +D LIT R +E L+ + + Sbjct: 353 GITSDNWNLKYQALSEIEILIPNDIQEQKYIAEYFTGLDNLITLHQRKLEKLRNIRFSCT 412 Query: 204 SYIV 207 + Sbjct: 413 EKMF 416 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 64/214 (29%), Gaps = 17/214 (7%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + + K SGI + + + + K + + + +G ++ Sbjct: 12 FKGFTDAWEQRKFSGIFI------YLQNNSLSRTDLNYEQGSVKNVHYGDVLIKFGEVLD 65 Query: 267 KLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 +T + + T ++ G+IV + + I S + Sbjct: 66 VEKTEIPFISNNEFNTSSTSLLRNGDIVIADAAEDETVGKCSEIKGIGCISIVSGLHTIP 125 Query: 325 P---HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQFDIT 379 ++ YL + M S + G S+ ++ ++ P KEQ I Sbjct: 126 CRPIKTFETGYLGYYMNSSAYHDQLLPLIQGTKISSISKSALQNTEIIYPDSEKEQLKIG 185 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ L + S + Sbjct: 186 QF----FQNLDSLITLHQRKYDKLIIVKKSMLEK 215 >gi|254429566|ref|ZP_05043273.1| Type I restriction modification DNA specificity domain protein [Alcanivorax sp. DG881] gi|196195735|gb|EDX90694.1| Type I restriction modification DNA specificity domain protein [Alcanivorax sp. DG881] Length = 471 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 57/468 (12%), Positives = 138/468 (29%), Gaps = 82/468 (17%) Query: 26 KVVPIKRFTKLN---TGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS-- 79 K P++ +L T + + +I + +++ +G + S + Sbjct: 4 KTTPLEELCELVVDCPHSTPKWKSEGVIVLRNQNIRNGQLDLSSPSYTDEEGYQSRIKRA 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + G I++ + P +I + C + ++L+PK + W L Q Sbjct: 64 VPQAGDIVFTREAPMGEVCLIPEGLKCCLGQRQVLLRPKKEISGEYLYWALQSPFVQHQI 123 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + EG + ++ + + + Q + I + I + + L++ Sbjct: 124 SWNEGTGTTVSNVRIPV---LKSLEIPRQSEHEQSIANILGALSERIQSNHQINQTLEKI 180 Query: 199 KQALVSYIVTKG------------------------------------------------ 210 QA+ Sbjct: 181 AQAIFKSWFVDFEPVKAKIAALEAGGSEEDALFTAIQAISGKTTDELARLQAEQPDRYAD 240 Query: 211 LNPDVKMKDSGIE--WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS---------- 258 L ++ S +E +G +P+ W+ K E + + Sbjct: 241 LRATAELFPSTLEDSELGGIPEGWDTCQAHERFEITIGKTPPRKEPHWFTEDPKDIKWLS 300 Query: 259 ---LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + GN+ + + + +I PG ++ F S I Sbjct: 301 IKGMGDGNVFSSVTEEYLIADAVAKHNVKICPPGTVLLSFKLTLGRVMICSSEMTTNEAI 360 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 A+ + + + + S+D + S + ++ + +K + +VP Sbjct: 361 ---AHFRINDDSPGTYWTYLWLSSFDYSSL--GSTSSIATAVNSKTIKGMQFVVPNPVL- 414 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 N + I ++ +++ L E R + + +TG+++L Sbjct: 415 ---LNYFESKMEPIFQQIQTTQENSCSLAELRDALLPKLLTGELELPD 459 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 59/211 (27%), Gaps = 20/211 (9%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDVESGT-- 60 DS +G IP+ W ++ G+T KDI ++ ++ + G Sbjct: 254 DSE---LGGIPEGWDTCQAHERFEITIGKTPPRKEPHWFTEDPKDIKWLSIKGMGDGNVF 310 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + V I G +L L + +I + + + D Sbjct: 311 SSVTEEYLIADAVAKHNVKICPPGTVLLS-FKLTLGRVMICSSEMTTNEAIAHFRINDDS 369 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P +L + + + ++ ++ Sbjct: 370 PGTYWTYLWLSSFDYSSLGSTSSIATAVNSKT-----IKGMQFVVPNPVLLNYFESKMEP 424 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGL 211 I I L E + AL+ ++T L Sbjct: 425 IFQQIQTTQENSCSLAELRDALLPKLLTGEL 455 >gi|169350756|ref|ZP_02867694.1| hypothetical protein CLOSPI_01529 [Clostridium spiroforme DSM 1552] gi|169292619|gb|EDS74752.1| hypothetical protein CLOSPI_01529 [Clostridium spiroforme DSM 1552] Length = 397 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 59/406 (14%), Positives = 116/406 (28%), Gaps = 43/406 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K E + I + E L + T+ + Sbjct: 14 DWEQRKLGEIFK------YEQPQAYIVESTDYDEKNNIPVLTAGQSFILGYTNEQFGIKE 67 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--------LPELLQGWLLSIDVTQ 135 R +I D S+ ++ K L + +V Q Sbjct: 68 ---------ASGRNPVIIFDDFTTSSHYVDFPFKVKSSAIKLLSLNNPNDNMHCAYNVLQ 118 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I + ++ +P + EQ I + + +D LIT R + + Sbjct: 119 CIGYLPVSHERHWISIFSKFDVLLP-KSIDEQEQIGQYL----ANLDNLITLHQRKCDEI 173 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+ K+ ++ + + K++ G E+ A + N + ++ +E+ Sbjct: 174 KKLKKYMLQNMFPQNGEKAPKIRFDGFTDDWEQRKLSEIATMHARIGWQNLRTSEFLENG 233 Query: 256 ILSLSYG-----NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR---SLRS 307 L G I + + + + G I+ L Sbjct: 234 DYMLITGTDFVDGSINYSTCYFVNKERYEQDKNIQIKNGSILITKDGTLGKVALVQGLSM 293 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 + GI ID+ YL +++ L G + L + P Sbjct: 294 PATLNAGIFN--IEIKNELEIDNKYLFQYLKAPFLLDYVKKRATGGTIKHLNQNILVNFP 351 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 VL P EQ I + +D L+ ++ LK + + Sbjct: 352 VLTPQKLEQTKIGQY----FSNLDNLITLHQRKCDELKNMKKFMLQ 393 >gi|167768050|ref|ZP_02440103.1| hypothetical protein CLOSS21_02594 [Clostridium sp. SS2/1] gi|167710379|gb|EDS20958.1| hypothetical protein CLOSS21_02594 [Clostridium sp. SS2/1] Length = 391 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 55/404 (13%), Positives = 127/404 (31%), Gaps = 33/404 (8%) Query: 28 VPIKRFTKLNTG---RTSESGKDI------IYIGLEDVESGTGKYLP-KDGNSRQSDTST 77 V + L G + S K+I ++ + ++ + + Sbjct: 4 VKLGDIAVLINGDRGKNYPSQKEIITSGGIPFVNAGHLNGRAIEFEAMNYITPEKYEKLN 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWL--LSIDVT 134 F + ILY G +KA+I D G ++ ++++P L + + Sbjct: 64 SGKFQQNDILYCLRGSLGKKALINDNIYGAIASSLVIIRPNLEKVRPQYLMLALETPLIK 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +++ G++ + K + + +P L Q I K+ ++ LI + + L Sbjct: 124 EQLFKFNNGSSQPNLSAKSVKEYKLELPDLFIQDSIISKL----EKVRNLIEDEKQEKLL 179 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L QA + + D K + ++ + + Sbjct: 180 LDNLIQARFVELFGDAVYNDKKWETDTVKNLCKEIYGGGTPSKAHP--------EYYKDG 231 Query: 255 NILSLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +I +S ++ K + T ++V ++ K +L A Sbjct: 232 DIPWVSAKDMKTDVLKDSQIKINQLGVDNSTARLVPVNSVIMVIRSGIL-KHTLPVAVNK 290 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + P T + + + + +++F +K+ ++VPP Sbjct: 291 VPITVNQDLKVFIPGERILTRFLAVQFKMQEKDILSGVRAVTADNIEFNSLKQRRMIVPP 350 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I Q + RID ++++S+ + S + Sbjct: 351 IDLQQKYLMFLE----RIDKSKFEVQKSLEKTQLLYDSLMQEYF 390 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 53/190 (27%), Gaps = 12/190 (6%) Query: 25 WKVVPIKRFT-KLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ +K ++ G T DI ++ +D+++ K N D S Sbjct: 202 WETDTVKNLCKEIYGGGTPSKAHPEYYKDGDIPWVSAKDMKTDVLKDSQIKINQLGVDNS 261 Query: 77 TVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 T + ++ L+ + + V P + + + Sbjct: 262 TARLVPVNSVIMVIRSGILKHTLPVAVNKVPITVNQDLKVFIPGERILTRFLAVQFKMQ- 320 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + I + T + ++ + M +PP+ Q + + + Sbjct: 321 EKDILSGVRAVTADNIEFNSLKQRRMIVPPIDLQQKYLMFLERIDKSKFEVQKSLEKTQL 380 Query: 194 LLKEKKQALV 203 L Q Sbjct: 381 LYDSLMQEYF 390 >gi|332655466|ref|ZP_08421203.1| restriction modification system, type I [Ruminococcaceae bacterium D16] gi|332515601|gb|EGJ45214.1| restriction modification system, type I [Ruminococcaceae bacterium D16] Length = 393 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 62/400 (15%), Positives = 134/400 (33%), Gaps = 35/400 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY- 88 + F + R ++ ++ + S ++P N+ +D + +GQ Y Sbjct: 8 LGDFIRQVDVRNTDGKEENLL-----GVSVQKMFIPSIANTVGTDFTKYKEVKRGQFTYI 62 Query: 89 ---GKLGPYLRKAIIADFD-GICSTQFLVLQPKDVL---PELLQGWLLSIDVTQRIEAIC 141 + G + A++ D+D G+ S + V + KD PE L W + + Sbjct: 63 PDTSRRGDKIGIALLTDYDEGLVSNIYTVFEVKDENELLPEYLMLWFSRPEFDRYARFKS 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ DW + + +P+P + +Q I + I I + R + L E + Sbjct: 123 HGSVREIMDWDEMCKVELPVPSIDKQRSIVK----AYQTITERIELKRRINDNLVELCKT 178 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + D +G + PF + + K ++ + L+ Sbjct: 179 EFMRTFATHPEYRDEQSDWFSHPLGKSLSRVAMGPFGSNI-----KTDCFVDHGVPVLNG 233 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDP----GEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 NI L + E + Q+ + G+IV + +R +I+ Sbjct: 234 DNISGYLLSERSFRYVEDEKASQLKNSIAVSGDIVITHRGTLGQVALVPDKTKFDRYVIS 293 Query: 318 SAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKF--EDVKRLPVLVPPI 372 + + Y+ + + + A + S+ +K L + +PPI Sbjct: 294 QSQFLLACDQCALLPEYVLFYFHTDAGRRKLLANDNTTGVPSIAKPTSYIKALHIPIPPI 353 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + Q + ++ A V + L + + ++ Sbjct: 354 ELQQNWAVLVRATLA----AVADNNLEMEKLTDFAQTLLS 389 >gi|119356951|ref|YP_911595.1| restriction modification system DNA specificity subunit [Chlorobium phaeobacteroides DSM 266] gi|119354300|gb|ABL65171.1| restriction modification system DNA specificity domain [Chlorobium phaeobacteroides DSM 266] Length = 413 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 53/400 (13%), Positives = 115/400 (28%), Gaps = 20/400 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + F ++ G+ + + + L D R +D + Sbjct: 2 KTVKLGTFITISKGKKHTLSEMP---SSQSIRMLGIDDLRNDTLIRMTDDKDGVLACVDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L G I ST + + + GAT Sbjct: 59 VLIAWDGANAGTIGYGKQGYIGSTISRLRLHDTSKFFAPFIGMFLQSNFSYLRKTATGAT 118 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + H + + +I +P+ +Q+ I + I + + EL L S Sbjct: 119 IPHINRNALESIQVPVFTYGDQICIATLLSKVENLISRRREQLKQLDEL-------LKSV 171 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + +P + K I+ + + + +KN L+ES I + NI Sbjct: 172 FLEMFGDPMINPKKFPIKLLSEFYINSKHGTKCGPFGSALKKNE-LLESGIAVWNMDNIS 230 Query: 266 QKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 S E +Q + G+I+ ++ + Sbjct: 231 SSGIMILPFRMWVSEEKFQELRAYSVINGDIIISRAGTVGKMCVAKTDGIPAIISTNLIR 290 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + + ++ + G + + L P I+ Q + Sbjct: 291 LRLNSLLLPLYIVSLMTYCNGRVGRLKTGADGTFTHMNTGILDILEFPYPSIELQRQFAD 350 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ +++ + QS+ L+ + A G++D Sbjct: 351 IVE----KVESIKVYYHQSLAELQNLYGTLSQKAFKGELD 386 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 55/179 (30%), Gaps = 10/179 (5%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + +T K L E S ++ + RN L + + ++ + Sbjct: 1 MKTVKLGTFITISKGKKHTLSEM--PSSQSIRMLGIDDLRNDTLIRMTDDKDGVLACVDD 58 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V D N ++ + G S + ++ ++S + Sbjct: 59 VLIAWDGAN-AGTIGYGKQGYIGSTISRLRLHDTSKFFAPFIGMFLQSNF--SYLRKTAT 115 Query: 352 G-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G + ++ + V V +Q I ++++ L+ + + + L E S Sbjct: 116 GATIPHINRNALESIQVPVFTYGDQICIA----TLLSKVENLISRRREQLKQLDELLKS 170 >gi|30065580|ref|NP_839751.1| hypothetical protein S4635 [Shigella flexneri 2a str. 2457T] gi|30043844|gb|AAP19563.1| hypothetical protein S4635 [Shigella flexneri 2a str. 2457T] gi|313646315|gb|EFS10777.1| type I restriction enzyme EcoAI specificity [Shigella flexneri 2a str. 2457T] Length = 551 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 78/190 (41%), Gaps = 2/190 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ I + + S +Y D +E GTG+ + K Sbjct: 363 ELPEGWEWCRIGNIVNIKSELVSPKDYLNLYQVAPDIIEKGTGRVISKRTVKESGVKGPN 422 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S F KGQI+Y K+ P L K +A+++G+CS L + P L ++LSI +++ Sbjct: 423 SRFYKGQIVYSKIRPSLSKVFLAEYNGLCSADMYPLDC-YINPNYLLKYILSIPFLMQVK 481 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 M + NI + IPP EQ I +KI + + LI+ + + Sbjct: 482 KAENRIKMPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLISYIGIYHKTQLHL 541 Query: 199 KQALVSYIVT 208 AL + Sbjct: 542 ADALTDAAIN 551 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 56/453 (12%), Positives = 118/453 (26%), Gaps = 65/453 (14%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESG 59 +K K P+ S + +P+ W+ V + ++ +I+ G V Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQ 140 Query: 60 TGKYLPKDGNSR------------QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 + +++ N+ D + F + G G + I+ Sbjct: 141 SQEFISGYCNNECLLIKLNNPVIVFGDHTRNIKFIDFDFVVGADGVKILSPILICERFFF 200 Query: 108 ST-------------QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGI 154 F VL + ++ + ++C+ Sbjct: 201 WQLRSFKLDVRGYARHFKVLNSCLFALPPIAEQERIVEKVSSLMSLCDQLEQQSLTSLDA 260 Query: 155 GN-IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + + ++ RI + KQ ++ V L P Sbjct: 261 HQQLVETLLGTLTDSQNTAELAENWARISEHFDTLFTTEASVDALKQTILQLAVMGKLVP 320 Query: 214 DVKMKD-----------------------------SGIEWVGLVPDHWEVKPFFALVTEL 244 + S E +P+ WE +V Sbjct: 321 QDPNDEPASELLKRIAQEKAQLVKEGKIQKPLPPISDEEKPFELPEGWEWCRIGNIVNIK 380 Query: 245 ---NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 L + ++ ++ + G+IV+ I Sbjct: 381 SELVSPKDYLNLYQVAPDIIEKGTGRVISKRTVKESGVKGPNSRFYKGQIVYSKIRPSLS 440 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 K L G+ ++ + + + L +++ L +V A L + Sbjct: 441 KVFLAEY----NGLCSADMYPLDCYINPNYLLKYILSIPFLMQVKKAENRIKMPKLNSDS 496 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + V +PP EQ I + IN A + L+ Sbjct: 497 FYNIIVAIPPYNEQQAIFDKINSIEAVCNGLIS 529 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 65/192 (33%), Gaps = 15/192 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE + ++ + K+ S IL +I++ + G Sbjct: 93 SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQSQEFISGYCNNE 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ V F D + + + + P I + W +RS Sbjct: 153 ---CLLIKLNNPVIVFGDHTRN----IKFIDFDFVVGADGVKILSPILICERFFFWQLRS 205 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + L YA F+ + +PPI EQ I ++ + D L ++ S Sbjct: 206 FKLDVRGYAR--------HFKVLNSCLFALPPIAEQERIVEKVSSLMSLCDQLEQQSLTS 257 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 258 LDAHQQLVETLL 269 >gi|210610699|ref|ZP_03288580.1| hypothetical protein CLONEX_00770 [Clostridium nexile DSM 1787] gi|210152332|gb|EEA83338.1| hypothetical protein CLONEX_00770 [Clostridium nexile DSM 1787] Length = 405 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 67/408 (16%), Positives = 141/408 (34%), Gaps = 26/408 (6%) Query: 27 VVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ-SDTSTV 78 +V ++ ++L TG + I I ++++ G+ D S + + Sbjct: 3 IVKLRDISELKTGPFGTQFRASEYVTEGIPVINVKNIGYGSLLVSGLDHVSENTLERLSE 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVL--PELLQGWLLSIDVT 134 +G I++G+ G R +I + + + ++ D + PE + +LL+ V Sbjct: 63 HKLQEGDIVFGRKGSVDRHCLIRKGQDGWMQGSDCIRVRFTDAIVYPEFVSYYLLTDAVK 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +I G+TM+ + +G+I + +P EQ I + ID I+ + + Sbjct: 123 MKINNSAVGSTMASLNTDILGDIDIILPDCEEQKRIALIL----GTIDKKISNNNQINDY 178 Query: 195 LKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L+E + + +Y + PD K SG + + E+ + + N Sbjct: 179 LEEMAKTIYNYWFIQFDFPDENGKPYKSSGGKMSFCNELNREIPQNWNYTSIGNITICLD 238 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-V 310 E LS ++ Y I ++ D Q V Sbjct: 239 SERIPLSNQQREGMKGSIPYYGATGIMDYVNRPIFSGNFVLLAEDGSVMDDNGNPILQRV 298 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 I + ++P S L +L+ + ++ + ++ +L Sbjct: 299 SGDVWINNHTHVLQPVKGYSCRLLYLLLKDIPVSIIK--TGSIQMKINQANLNNYNILSI 356 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P + N + +D + +I+Q L + R + + GQ Sbjct: 357 PDAIRTQFINCVE----PLDTKIMQIQQENNNLIQFRDWLLPMLMNGQ 400 >gi|323697973|ref|ZP_08109885.1| restriction modification system DNA specificity domain [Desulfovibrio sp. ND132] gi|323457905|gb|EGB13770.1| restriction modification system DNA specificity domain [Desulfovibrio desulfuricans ND132] Length = 499 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 68/458 (14%), Positives = 134/458 (29%), Gaps = 73/458 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W + + T T + + E +E+ + + T Sbjct: 14 ELPVQWDWAVFQDIFEDLTSSTKKVKQK------EYIENAPLAVVDQGVALIGGATDKFD 67 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + +G + G + R + DF V K + P + +++ Q + Sbjct: 68 LAFEGDLPVIVFGDHTRCVKLVDFP-FVQGADGVKVLKPLSPLSTNLYSYALNTVQLPDR 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +K + P+PPL EQ I +KI A + LL++ + Sbjct: 127 GYSRH------FKFLKATEFPVPPLNEQRRIADKIDALQAKSRRAREALETVGPLLEKFR 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIE------------------------------------ 223 Q++++ L + + + +E Sbjct: 181 QSVLAAAFRGDLTAEWREQHPDVEPAEKLLERIRVERRARWEEAELAKMRAKGINPKNDK 240 Query: 224 --------------WVGLVPDHWEVKPFFALVTELNRKNTKLI---ESNILSLSYGNIIQ 266 + +P+ W L ++ + E+ + L GNI+ Sbjct: 241 WKAKYKEPEPVDASGLPELPEGWCWAKVEELACDVRYGTSAKTSDDETQMPVLRMGNIVD 300 Query: 267 KLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + LK + + G+I+F + E S + Sbjct: 301 GDLVYD-NLKYLDRSHKDLSELCLHYGDILFNRTNSAELVGKTAMFDSDEDFSFASYLIR 359 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITN 380 V+ I + W + S + S + ++ +K L V +PP EQ ++ Sbjct: 360 VRVLQIVPEVVVWYINSPFGRQWVSQNVSQQVGQANINGSKLKALAVPIPPQDEQVELAR 419 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I A I E I + S +A A G+ Sbjct: 420 KIKQTLAVIKGQRENSIGLIGQVANLDQSILAKAFRGE 457 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 84/229 (36%), Gaps = 20/229 (8%) Query: 3 HYKAYPQYKD------SGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSESGKDI---IYIG 52 +KA +YK+ SG + +P+ W ++ + G ++++ D + Sbjct: 240 KWKA--KYKEPEPVDASG---LPELPEGWCWAKVEELACDVRYGTSAKTSDDETQMPVLR 294 Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIA-DFDGICS 108 + ++ G Y R + G IL+ + + A+ D D + Sbjct: 295 MGNIVDGDLVYDNLKYLDRSHKDLSELCLHYGDILFNRTNSAELVGKTAMFDSDEDFSFA 354 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQ 167 + + ++ ++PE++ ++ S Q + ++ + + + +PIPP EQ Sbjct: 355 SYLIRVRVLQIVPEVVVWYINSPFGRQWVSQNVSQQVGQANINGSKLKALAVPIPPQDEQ 414 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 V + KI I I I + Q++++ L P Sbjct: 415 VELARKIKQTLAVIKGQRENSIGLIGQVANLDQSILAKAFRGELVPQDP 463 >gi|91792595|ref|YP_562246.1| restriction modification system DNA specificity subunit [Shewanella denitrificans OS217] gi|91714597|gb|ABE54523.1| restriction modification system DNA specificity domain [Shewanella denitrificans OS217] Length = 633 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 87/234 (37%), Gaps = 15/234 (6%) Query: 197 EKKQALVSYIVTKG-LNPDVKMKDSG-IEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E+ A+ + +V +G L + + G E +P+ W+ V+ + E Sbjct: 89 ERIAAVKAQLVKEGKLKKQKPLPEIGDNEKPFELPNGWKWSRLGDFVSIIRGITFPSSEK 148 Query: 255 N-------ILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSL 305 + + + N+ LE ++ SY Q + G+IV + + + Sbjct: 149 HRELAPSRVACIRTTNVQDSLEWDDLLYVDRSYVKREEQYLKLGDIVMSMANSRELVGKV 208 Query: 306 RSA--QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFED 361 + ++P+ D+++L ++R+ A + ++ E Sbjct: 209 SFITHIPVGESSFGGFLSVIRPYQFDASFLMSVLRAPLTKNELIGSASQTTNIANISLEK 268 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + L + VPP++EQ I ++ + D L + E SI + + + A + Sbjct: 269 LNPLVIAVPPLEEQHRIVAKVDELMSLCDALEAQTEASIAAHQTLVETLLNALL 322 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 63/186 (33%), Gaps = 9/186 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + E +P+ E L T + + S + Q ++ + ++ Sbjct: 429 TDEEKPFELPESGEWVRLGDLCTLVTSGSRGWKTYYAESGATFIRSQDIKYDRVEFDDKA 488 Query: 280 Y--------ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDS 330 Y VD G ++ K ++ ++ E + A + + ++ Sbjct: 489 YVKLPETTEGKRTKVDVGNLLMTITGANVAKTAIVEIELDEAYVSQHVALIKLINSVMNK 548 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 WL ++ + G + L +++ L + +PP++EQ I + A D Sbjct: 549 YIHLWLTGAFGGRGLLLECSYGAKPGLNLQNINELIIPIPPLEEQHRIVAKVEELMALCD 608 Query: 391 VLVEKI 396 L ++ Sbjct: 609 KLKARL 614 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 63/198 (31%), Gaps = 10/198 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ + V + L T +T + +I +D++ ++ K Sbjct: 436 ELPESGEWVRLGDLCTLVTSGSRGWKTYYAESGATFIRSQDIKYDRVEFDDKAYVKLPET 495 Query: 75 TSTVS-IFAKGQILYGKLGPYLRKAIIADF---DGICSTQF-LVLQPKDVLPELLQGWLL 129 T G +L G + K I + + S L+ V+ + + WL Sbjct: 496 TEGKRTKVDVGNLLMTITGANVAKTAIVEIELDEAYVSQHVALIKLINSVMNKYIHLWLT 555 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + C + + I + +PIPPL EQ I K+ D L Sbjct: 556 GAFGGRGLLLECSYGAKPGLNLQNINELIIPIPPLEEQHRIVAKVEELMALCDKLKARLS 615 Query: 190 RFIELLKEKKQALVSYIV 207 A+V V Sbjct: 616 DAQTTQLHLTDAIVEQAV 633 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 64/203 (31%), Gaps = 16/203 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 +P WK + F + G T S + + I +V+ + ++ R Sbjct: 121 ELPNGWKWSRLGDFVSIIRGITFPSSEKHRELAPSRVACIRTTNVQ-DSLEWDDLLYVDR 179 Query: 72 QSDTSTVSIFAKGQILYGK------LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 G I+ +G I + V++P L Sbjct: 180 SYVKREEQYLKLGDIVMSMANSRELVGKVSFITHIPVGESSFGGFLSVIRPYQFDASFLM 239 Query: 126 GWLLSIDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L + + + + +++ + + + + +PPL EQ I K+ D L Sbjct: 240 SVLRAPLTKNELIGSASQTTNIANISLEKLNPLVIAVPPLEEQHRIVAKVDELMSLCDAL 299 Query: 185 ITERIRFIELLKEKKQALVSYIV 207 + I + + L++ ++ Sbjct: 300 EAQTEASIAAHQTLVETLLNALL 322 >gi|322420368|ref|YP_004199591.1| restriction modification system DNA specificity domain-containing protein [Geobacter sp. M18] gi|320126755|gb|ADW14315.1| restriction modification system DNA specificity domain protein [Geobacter sp. M18] Length = 411 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 60/402 (14%), Positives = 125/402 (31%), Gaps = 31/402 (7%) Query: 24 HWKVVPIKRFTKL----NTGRTSESGKDIIY--IGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ +PI ++ + + + +V +G + ++ Sbjct: 8 DWQRLPIVSLCEVHVDCVNRTAPIVSEPTPFKMLRTTNVRNGYVDAENVRYVTEETYKKW 67 Query: 78 VSIF--AKGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQP--KDVLPELLQGWLLSID 132 +G IL + P I D + + +P K + + L LL D Sbjct: 68 TRRLIPKRGDILLTREAPLGDVGKIRTDDAVFLGQRLYHFRPDPKKLDADFLLYSLLGDD 127 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +I+ G+T+ H + I N+ +P L Q I + A I+ Sbjct: 128 LQSQIKGFGSGSTVEHMRLEDIPNLEFNVPALPIQQRIASILSAYDELIENSQRRIKILE 187 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK-L 251 + L P + + +G +P WEVK + E+ R K Sbjct: 188 ----SMARTLYREWFVHFRFPGHENQPRVASPLGEIPQGWEVKKLGEVAEEMRRNVPKGQ 243 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I+ + +I ++ GE++F I K S+ Sbjct: 244 IDEPTPYVGLEHIPRRSLALAAWETTIELGSNKLEFKKGEVLFGKIRPYFHKVSVAPF-- 301 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + Y+ + S A +G ++ +K+ P+++ Sbjct: 302 -DGLCSADTIVIRARRQEHYAYVVMCVSSDAFVAEASATANGAKMPRANWDVLKKHPIVI 360 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQ---SIVLLKERRS 408 P V + ++ I ++ + + I +L+ R Sbjct: 361 PN-------GEVADKFSSLIKDVIVQEQALVFQIQILRRTRD 395 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 40/182 (21%), Positives = 71/182 (39%), Gaps = 12/182 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G IP+ W+V + + + + Y+GLE + + + Sbjct: 216 LGEIPQGWEVKKLGEVAEEMRRNVPKGQIDEPTPYVGLEHIPRRSLALAAWETTIELG-- 273 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 S F KG++L+GK+ PY K +A FDG+CS +V++ + +S D Sbjct: 274 SNKLEFKKGEVLFGKIRPYFHKVSVAPFDGLCSADTIVIRARRQEHYAYVVMCVSSDAFV 333 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 A GA M A+W + P+ IP E + I +I + + Sbjct: 334 AEASATANGAKMPRANWDVLKKHPIVIP-------NGEVADKFSSLIKDVIVQEQALVFQ 386 Query: 195 LK 196 ++ Sbjct: 387 IQ 388 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 62/181 (34%), Gaps = 6/181 (3%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + T ++ G + + + + I G+ Sbjct: 18 CEVHVDCVNRTAPIVSEPTPFKMLRTTNVRNGYVDAENVRYVTEETYKKWTRRLIPKRGD 77 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+ D +R+ + G + P +D+ +L + + DL G Sbjct: 78 ILLTREAPLGDVGKIRTDDAVFLG-QRLYHFRPDPKKLDADFLLYSLLGDDLQSQIKGFG 136 Query: 351 SG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 SG + ++ ED+ L VP + Q I ++++ D L+E ++ I +L+ + Sbjct: 137 SGSTVEHMRLEDIPNLEFNVPALPIQQRIASILSA----YDELIENSQRRIKILESMART 192 Query: 410 F 410 Sbjct: 193 L 193 >gi|254372674|ref|ZP_04988163.1| predicted protein [Francisella tularensis subsp. novicida GA99-3549] gi|151570401|gb|EDN36055.1| predicted protein [Francisella novicida GA99-3549] Length = 374 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 45/366 (12%), Positives = 108/366 (29%), Gaps = 35/366 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W PI + K+ +G+ D ++ + DV P G + ++ Sbjct: 21 EWVEKPISKALKIGSGK------DYKHLNIGDV--------PVYGTGGYMLSVDKYLYDG 66 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + + T F + +P+ + + E Sbjct: 67 ESVCIGRKGTIDKPIFLNGKFWTVDTLFYTHSFNNSIPKFIYSIFQ----KINWKLYNEA 122 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I + +P L EQ I + + I+T + K Q + Sbjct: 123 SGVPSLSKSTIEKIKINLPTLPEQQKIADCLSTWDEVIETQKSLIEAKKLYKKGMMQKIF 182 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + + + +G V + + + + R+ + + + + Sbjct: 183 SQELRFKADDGSDFPEWVEKKLGEVSE--CLDNLRKPLNDSERQKMQGNIPYWGANNIMD 240 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + + + + + + + + Sbjct: 241 YINDYIFDETIVLLAED--------------GGNFSEYRTRPIANLSKGKCWVNNHTHVL 286 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + S +L S + +G G R L ++ ++ + +P + EQ I N ++ Sbjct: 287 REKKNISKN-EFLFYSLVHKNITGYVGGGTRSKLTKSEMLKIGLKLPCLPEQTKIANFLS 345 Query: 384 VETARI 389 I Sbjct: 346 ALDDEI 351 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 16/170 (9%), Positives = 51/170 (30%), Gaps = 9/170 (5%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL--R 306 + + +L G+ + Y + + K ++ Sbjct: 21 EWVEKPISKALKIGSGKDYKHLNIGDVPVYGTGGYMLSVDKYLYDGESVCIGRKGTIDKP 80 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + + + + ++ + + + A G SL ++++ Sbjct: 81 IFLNGKFWTVDTLFYTHSFNNSIPKFIYSIFQKINWKLYNEASG---VPSLSKSTIEKIK 137 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +P + EQ I + ++ D ++E + I K + + + Sbjct: 138 INLPTLPEQQKIADCLSTW----DEVIETQKSLIEAKKLYKKGMMQKIFS 183 >gi|168485628|ref|ZP_02710136.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CDC1087-00] gi|183571190|gb|EDT91718.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CDC1087-00] Length = 522 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 68/441 (15%), Positives = 133/441 (30%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW-------- 224 +L KE ++++ Y + L K E Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 225 ------------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESN 255 + +P+ W F +LV K + Sbjct: 323 DISIVSQGDDNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTE 382 Query: 256 ILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ N + + I G ++ F L Sbjct: 383 IPWVSISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATH 442 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 II+ + I YL + G ++L + L + + Sbjct: 443 NEAIIS-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISN 499 Query: 372 IKEQFDITNVINVETARIDVL 392 +E I + +++ ++ L Sbjct: 500 HEEMKRIISKVDLLFQKVSQL 520 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 347 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 406 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 407 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 464 >gi|227356294|ref|ZP_03840682.1| type I restriction modification system methylase [Proteus mirabilis ATCC 29906] gi|227163404|gb|EEI48325.1| type I restriction modification system methylase [Proteus mirabilis ATCC 29906] Length = 469 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 51/468 (10%), Positives = 128/468 (27%), Gaps = 74/468 (15%) Query: 26 KVVPIKRFT-KLNTGRTSESGKD-----IIYIGLEDVESGTGKYLPKDGNSRQS-DTSTV 78 + + ++ G + I ++ +++E K+ S + Sbjct: 6 ETRLLGELCHEITVGFVGTMTNEYIENGIPFLRSKNIEEYDVKWDDMKYVSSAFHKKLSK 65 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 S+ G + + G +I + + CS + ++L + ++ + Sbjct: 66 SVLKPGDVAIVRTGKPGTTCVIPNDLREANCSDIVIARVNNELLCPHYLSYFMNAMAHGQ 125 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + A GA H + + +P+P +Q I + + ++ ++ + Sbjct: 126 VNAHIVGAVQQHFNVSSAKKLEIPLPSRVKQTKIVQVLKTLDDKLKLNRQINQTLEQMAQ 185 Query: 197 EKKQALV-------SYIVTKGL-------------------------------------- 211 ++ + G Sbjct: 186 TLFKSWFVDFDPVVDNALDAGFFEQDLAFSDELLRRVEVRKAVRESDNFKPLSEDIRRLF 245 Query: 212 -NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 N + + + G +P W K + + N Sbjct: 246 PNAFEECAEPALGLGGWMPKGWMSKSISDAIFINPKVNLAKDTVAKFVDMKALSTSGYSI 305 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITS--AYMAVKPH 326 + KP + +I+ I N K + S + Sbjct: 306 EEVSEKP--FSGGMKFQNNDILLARITPCLENGKTGIVDFLSENEAGFGSTEFIILRGNK 363 Query: 327 GIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I +Y+A L R + + +GS RQ ++ + VP + ++++ Sbjct: 364 NIHYSYIACLARYESFRQHVIQSMVGSSGRQRVQNGCFNDYKIAVPSGEVMNRFADIVSP 423 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG-------QIDLRGES 425 ++ + L + R + + ++G +ID+ E+ Sbjct: 424 SFKKL----TQNTNESRSLTKLRDTLLPKLISGELSLSDIKIDIPEET 467 >gi|104774035|ref|YP_619015.1| Type I restriction-modification system, specificity subunit [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842] gi|103423116|emb|CAI97855.1| Type I restriction-modification system, specificity subunit [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842] Length = 411 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 55/411 (13%), Positives = 115/411 (27%), Gaps = 43/411 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTS- 76 W+ + T + YI D+ + G S T+ Sbjct: 18 DWEQCKLGDVFSFLKNSTLSRSELNYESGKFKYIHYGDILTKFGDITDTRNFSVPFVTTP 77 Query: 77 ------TVSIFAKGQILY------GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 G ++ +G F + + ++P Sbjct: 78 EKVIRLEKYFLQNGDVVIADTAEDSMVGKVTEIQNPDPFPTVSGLHTIPIRPNKEFAAGF 137 Query: 125 Q-GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ + ++ + +G + + + P EQ LI I ID Sbjct: 138 LGHYMNAPFYHDQLFKLMQGVKVLSLSKSAVIQTKINSPSYCEQRLISRMIN----LIDG 193 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 IT L+ K AL+ + SG V + + Sbjct: 194 TITLHEEKKRQLERLKSALLQKMFAD---------KSGYPPVRFEGFSDKWEQVKYGEIF 244 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDK 302 R + + S+ Y +I + T N K + I +PG+++F + Sbjct: 245 QRRSKMGVSTPTLPSVEYDDINPGMGTLNKEPKSKGISKRGIYFNPGDVLFGKLRPYLKN 304 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + G+ + + ID + L+++ + + V Sbjct: 305 WLFACFE----GVAVGDFWVLTSSKIDHGFTYSLIQTPGFQYIANLSSGSKMPRSDWGLV 360 Query: 363 KRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P EQ I++V+ +D + E + +L + +S + Sbjct: 361 SNARTFIPINHLEQERISSVLFG----LDHAITLYEHKLEILNKIKSFLLQ 407 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 62/212 (29%), Gaps = 15/212 (7%) Query: 213 PDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 P ++ K +W +G V + K I + +G+I Sbjct: 8 PKLRFKGFTDDWEQCKLGDVFSFLKNSTLSRSELNYESGKFKYIHYGDILTKFGDITDTR 67 Query: 269 ETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM---A 322 + + G++V + + Q + S Sbjct: 68 NFSVPFVTTPEKVIRLEKYFLQNGDVVIADTAEDSMVGKVTEIQNPDPFPTVSGLHTIPI 127 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNV 381 + +L M + + + G++ SL V + + P EQ I+ + Sbjct: 128 RPNKEFAAGFLGHYMNAPFYHDQLFKLMQGVKVLSLSKSAVIQTKINSPSYCEQRLISRM 187 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 IN ID + E+ L+ +S+ + Sbjct: 188 IN----LIDGTITLHEEKKRQLERLKSALLQK 215 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 50/179 (27%), Gaps = 3/179 (1%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ V + + S + + +D+ G G + + S F G Sbjct: 235 WEQVKYGEIFQ-RRSKMGVSTPTLPSVEYDDINPGMGTLNKEPKSKGISKRGIY--FNPG 291 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L+GKL PYL+ + A F+G+ F VL + + + Sbjct: 292 DVLFGKLRPYLKNWLFACFEGVAVGDFWVLTSSKIDHGFTYSLIQTPGFQYIANLSSGSK 351 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 EQ I + I + ++ Q + Sbjct: 352 MPRSDWGLVSNARTFIPINHLEQERISSVLFGLDHAITLYEHKLEILNKIKSFLLQNMF 410 >gi|291540207|emb|CBL13318.1| Restriction endonuclease S subunits [Roseburia intestinalis XB6B4] Length = 414 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 55/402 (13%), Positives = 123/402 (30%), Gaps = 22/402 (5%) Query: 29 PIKRFTKLNTGRTSESG------KDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIF 81 IK ++ TG+T +G +I++I D+ + K + + Sbjct: 18 KIKDIGRVVTGKTPLTGVNEYYGGNIMFISPSDLHGDYLIEKSEKTITEEGLKSIESNSI 77 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G +G + + + + Q ++ L + + + +I Sbjct: 78 DGISVLTGCIGWDMGNVAMCNSRCATNQQINAIIDFNHKLVDPRYVYYWLKGKKDYLFSI 137 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 NI +P+P L Q + + + ID I + + + L+E + Sbjct: 138 ASVTRTPILSKSVFENIDIPLPSLKIQERVTKLL----SLIDEKIRKNHQINDYLEEMAK 193 Query: 201 ALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + Y + PD K SG + + + + + + N + L Sbjct: 194 TIYDYWFVQFDFPDENGNPYKSSGGKMIFCKELNRNIPQNWEYTSVGNITKCLDSDRIPL 253 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGII 316 S ++ Y I ++ D Q + I Sbjct: 254 SSHQREEMKGTIPYYGATGIMDYVNRPIFSGDFVLLAEDGSVMDDNGNPILQRISGDVWI 313 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + ++P S L +L+ + ++ + ++ +L P + + Sbjct: 314 NNHTHVLQPVNGYSCRLLYLLLKNIPVSMIK--TGSIQLKINQANLNSYNILNIPKEIRT 371 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N I +I L ++ +L + R + + GQ Sbjct: 372 QFINQIEPMDTKIIQL----QKENNILVQTRDWLLPILMNGQ 409 >gi|260664491|ref|ZP_05865343.1| type IC HsdS subunit [Lactobacillus jensenii SJ-7A-US] gi|260561556|gb|EEX27528.1| type IC HsdS subunit [Lactobacillus jensenii SJ-7A-US] Length = 406 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 54/394 (13%), Positives = 119/394 (30%), Gaps = 39/394 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG---KYLPKDGNSRQSDTSTVS 79 + W+ +K + +G + + D + + + + + + Sbjct: 12 ESWRTEKLKNIGESFSGLSGKKSSDFGHGEAKYITYLNILNNPIIDTKLTDKIEIDNKQH 71 Query: 80 IFAKGQILYGKLGPYLRKAII---------ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + KG I + ++ + + S + + + S Sbjct: 72 LVKKGDIFFTISSETPQEVGLSSVLDTNLNECYLNSFSFGYRLKEISMFDNLFNSYNFRS 131 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +++ + +G + + K + N + P ++EQ I + I + + Sbjct: 132 PNFRRKMYILAQGISRYNISKKAVLNETICFPKISEQKQIGKLIKLMNSLLSLQHRKMEL 191 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + K L + K E + L+ KN Sbjct: 192 ENQTSKAIYNYLFDKNKPFYFKDNKTKKVFLKE-------------LGTTYSGLSGKNKT 238 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKP--ESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + K N L E + V G+I+F ++ L S Sbjct: 239 DFGHGKAKYITYLNVNKNTIANHNLLDLIEIDKKQNEVLNGDILFTISSETPEEVGLASL 298 Query: 309 QVME--RGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 + + S +P+ I++ +LA+ +RS + K Y + G+ R +L + V Sbjct: 299 WPYDDTNIYLNSFCFGFRPNSKINNLWLAYELRSLKIRKNMYKLAQGISRYNLSKKSVLN 358 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 L V VP EQ ++ L+ + Sbjct: 359 LQVDVPSDAEQN--------FDSKFVKLINIQTK 384 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 59/172 (34%), Gaps = 11/172 (6%) Query: 246 RKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 +K++ ++Y NI+ + + K E +V G+I F + Sbjct: 32 KKSSDFGHGEAKYITYLNILNNPIIDTKLTDKIEIDNKQHLVKKGDIFFTISSETPQEVG 91 Query: 305 LRSAQVMERG-----IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLK 358 L S + Y + D+ + ++ RS + + Y + G+ R ++ Sbjct: 92 LSSVLDTNLNECYLNSFSFGYRLKEISMFDNLFNSYNFRSPNFRRKMYILAQGISRYNIS 151 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + V + P I EQ I ++ L+ + + L + + Sbjct: 152 KKAVLNETICFPKISEQKQIG----KLIKLMNSLLSLQHRKMELENQTSKAI 199 >gi|294794795|ref|ZP_06759930.1| HsdS, type I site-specific deoxyribonuclease [Veillonella sp. 3_1_44] gi|294454157|gb|EFG22531.1| HsdS, type I site-specific deoxyribonuclease [Veillonella sp. 3_1_44] Length = 406 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 54/409 (13%), Positives = 121/409 (29%), Gaps = 34/409 (8%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 K P+ G + + I + ++ +G D Sbjct: 18 KRYPLYDLALWKNGLAFKKIHFSDTGVPVIKIAELNNGISGNTSYTKQIFSDDVH----L 73 Query: 82 AKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K +L+ G I F G + + P + + + + L + Sbjct: 74 KKEDLLFSWSGNPQTSIDIFKFQLQEGWLNQHIFKVTPNEEIVDRDYFYFLMKYLKPWFT 133 Query: 139 ---AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + H I + + +P L Q I + + ID I L Sbjct: 134 QIASNKQTTGLGHVTIADIKRMSVLVPSLTMQKKIVDVLKP----IDDKIQINTSINNNL 189 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +++ +AL + + +NP K+ + +G V K Sbjct: 190 EQQAEALFHSLFVEDVNPIW--KEGVLSDLGTVVAGGTPSK---------TKPEYYSRKG 238 Query: 256 ILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I ++ + + K + + G S + ++ + + A Sbjct: 239 IAWITPKDLSLNKSKFISHGEIDISELGFSKSSAIKMPTGTVLFSSRAPIGYIAIAANEV 298 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + +V P+ T + + + L + + + +K +PV++P + Sbjct: 299 TTNQGFKSVVPNENVGTAFMYYLLRFLLPTIEGMASGSTFKEISGAGMKSVPVVIPDNET 358 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + N I E +E L + R + + + G +D+ G Sbjct: 359 ----IDKFNAFCTPIFQQQEVLEAENSRLVDIRDALLPKLMAGDLDVSG 403 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 53/166 (31%), Gaps = 12/166 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYL---PKDGNSRQSD 74 WK + + G T K I +I +D+ K++ D + Sbjct: 209 WKEGVLSDLGTVVAGGTPSKTKPEYYSRKGIAWITPKDLSLNKSKFISHGEIDISELGFS 268 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ G +L+ P AI A+ + F + P + + + L + Sbjct: 269 KSSAIKMPTGTVLFSSRAPIGYIAIAANEV-TTNQGFKSVVPNENV-GTAFMYYLLRFLL 326 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 IE + G+T G+ ++P+ IP + Sbjct: 327 PTIEGMASGSTFKEISGAGMKSVPVVIPDNETIDKFNAFCTPIFQQ 372 >gi|146294001|ref|YP_001184425.1| restriction modification system DNA specificity subunit [Shewanella putrefaciens CN-32] gi|145565691|gb|ABP76626.1| restriction modification system DNA specificity domain [Shewanella putrefaciens CN-32] Length = 440 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 53/406 (13%), Positives = 125/406 (30%), Gaps = 31/406 (7%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRKAIIADF 103 YI + ++ G + + ++D + + + ++ + + A + Sbjct: 33 GFPYIAIPQLKDGHVRVDGTERRISETDFMQWTKKLLPQENDVIVVRRCNSGQSAYVPKG 92 Query: 104 -DGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE-GATMSHADWKGIGNIPM 159 +VL+ K V P L+ + S + +++ GA + I + Sbjct: 93 VKWAIGQNLVVLRSDGKKVYPPFLRWLVRSDEWWEQVRKYLNVGAVFDSLKCREIPLFEL 152 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 PIPP+ Q+ I + + RI+ L + + ++ N + Sbjct: 153 PIPPMVAQIEIATVLNSIDARIELLRETNTTLEAIAQALFKSWFVDFDPVHANAGTQAPS 212 Query: 220 SGIEWV------------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E G +P+ W + V + + + Sbjct: 213 LPPEIQALFPATFIDSPQGPIPEGWALGTIADAVATVGGATPDTKNGEFWNPAEVAWTSP 272 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + + S + + G + + + + A I Y Sbjct: 273 KDLSGLNTPVLLDTERKVSEKGLAKISSGLLPAGTLLMSSRAPIGYLAIAQLPLAINQGY 332 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 +A+ P G ++ + + + + + +++PP++ N Sbjct: 333 IAIPPGGRLPPLYMLFWCRQNMEIIKNRANGSTFMEISKKAFRPIELVLPPVEVIEAFVN 392 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 V D LVE E+ L R + + ++GQ+ L E++ Sbjct: 393 VAQPLF---DRLVEN-EKQAQTLATLRDTLLPRLISGQLRL-PEAK 433 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 53/196 (27%), Gaps = 12/196 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSR 71 G IP+ W + I G T ++ ++ + +D+ L Sbjct: 231 GPIPEGWALGTIADAVATVGGATPDTKNGEFWNPAEVAWTSPKDLSGLNTPVLLDTERKV 290 Query: 72 QSDTSTVS---IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + G +L P IA + ++ + P LP L Sbjct: 291 SEKGLAKISSGLLPAGTLLMSSRAPI-GYLAIAQLPLAINQGYIAIPPGGRLP-PLYMLF 348 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + I+ G+T K I + +PP+ R+ + Sbjct: 349 WCRQNMEIIKNRANGSTFMEISKKAFRPIELVLPPVEVIEAFVNVAQPLFDRLVENEKQA 408 Query: 189 IRFIELLKEKKQALVS 204 L L+S Sbjct: 409 QTLATLRDTLLPRLIS 424 >gi|302668598|ref|YP_003833046.1| type I restriction modification system S subunit HsdS1 [Butyrivibrio proteoclasticus B316] gi|302397562|gb|ADL36464.1| type I restriction modification system S subunit HsdS1 [Butyrivibrio proteoclasticus B316] Length = 388 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 60/399 (15%), Positives = 126/399 (31%), Gaps = 43/399 (10%) Query: 25 WKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 W+ FT+L + + + + + + DV S + + +S Sbjct: 16 WEQRKFGNFTELKSASRVHKDEWTSEGVPFYRSSDVMSAINGTQNEKAFISEELYEKLSS 75 Query: 80 ---IFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSIDV 133 KG +L G I+ D + + + + + S Sbjct: 76 VSGKLEKGDVLVTGGGSVGNPYIVPDNKPLYTKDADLLWIKNQGRFDAYFIYEFFFSPTF 135 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +E+I T++H + P+ +P L EQ + + ID LIT R + Sbjct: 136 RKYLESISHVGTIAHYTITQLTETPVSLPSLEEQKKVGDY----FRSIDNLITLHQRKCD 191 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 KE K+ ++ + K +++ +G +V + + L E Sbjct: 192 ETKELKKYMLQKMFPKNGETKPEIRFAGFTGDWEQRKFSDVVEIGSGM-----DYKHLGE 246 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +I G + ++ + + + + DK + A Sbjct: 247 GDIPVYGTGGYMLSVD-------------AALSEEKDAIGIGRKGTIDKPYILRA---PF 290 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + + + G D + + L ++ D K S SL + + P + Sbjct: 291 WTVDTLFYCIPKDGYDLDFTSCLFQNIDWKK---KDESTGVPSLSKVIINNVETAAPSYE 347 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I + ID L+ ++ KE + + Sbjct: 348 EQRKIGDY----FKGIDNLITLHQRKCDETKELKKYMLQ 382 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 55/204 (26%), Gaps = 10/204 (4%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 ++ SG + + T S + I + Sbjct: 5 PNIRFSGYTDAWEQRKFGNFTELKSASRVHKDEWTSEGVPFYRSSDVMSAINGTQNEKAF 64 Query: 275 LKPESYET----YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + E YE ++ G+++ + + + + D+ Sbjct: 65 ISEELYEKLSSVSGKLEKGDVLVTGGGSVGNPYIVPDNKPLYTKDA-DLLWIKNQGRFDA 123 Query: 331 TYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ S K ++ G + PV +P ++EQ + + I Sbjct: 124 YFIYEFFFSPTFRKYLESISHVGTIAHYTITQLTETPVSLPSLEEQKKVGDY----FRSI 179 Query: 390 DVLVEKIEQSIVLLKERRSSFIAA 413 D L+ ++ KE + + Sbjct: 180 DNLITLHQRKCDETKELKKYMLQK 203 >gi|24115563|ref|NP_710073.1| hypothetical protein SF4364 [Shigella flexneri 2a str. 301] gi|110808124|ref|YP_691644.1| hypothetical protein SFV_4365 [Shigella flexneri 5 str. 8401] gi|24054894|gb|AAN45780.1| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301] gi|110617672|gb|ABF06339.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401] gi|281603673|gb|ADA76657.1| hypothetical protein SFxv_4757 [Shigella flexneri 2002017] Length = 553 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 78/190 (41%), Gaps = 2/190 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ I + + S +Y D +E GTG+ + K Sbjct: 365 ELPEGWEWCRIGNIVNIKSELVSPKDYLNLYQVAPDIIEKGTGRVISKRTVKESGVKGPN 424 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S F KGQI+Y K+ P L K +A+++G+CS L + P L ++LSI +++ Sbjct: 425 SRFYKGQIVYSKIRPSLSKVFLAEYNGLCSADMYPLDC-YINPNYLLKYILSIPFLMQVK 483 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 M + NI + IPP EQ I +KI + + LI+ + + Sbjct: 484 KAENRIKMPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLISYIGIYHKTQLHL 543 Query: 199 KQALVSYIVT 208 AL + Sbjct: 544 ADALTDAAIN 553 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 56/455 (12%), Positives = 118/455 (25%), Gaps = 67/455 (14%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESG 59 +K K P+ S + +P+ W+ V + ++ +I+ G V Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQ 140 Query: 60 TGKYLPKDGNSR------------QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 + +++ N+ D + F + G G + I+ Sbjct: 141 SQEFISGYCNNECLLIKLNNPVIVFGDHTRNIKFIDFDFVVGADGVKILSPILICERFFF 200 Query: 108 ST-------------QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGI 154 F VL + ++ + ++C+ Sbjct: 201 WQLRSFKLDVRGYARHFKVLNSCLFALPPIAEQERIVEKVSSLMSLCDQLEQQSLTSLDA 260 Query: 155 GN-IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + + ++ RI + KQ ++ V L P Sbjct: 261 HQQLVETLLGTLTDSQNTAELAENWARISEHFDTLFTTEASVDALKQTILQLAVMGKLVP 320 Query: 214 DVKMKD-------------------------------SGIEWVGLVPDHWEVKPFFALVT 242 + S E +P+ WE +V Sbjct: 321 QDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPPISDEEKPFELPEGWEWCRIGNIVN 380 Query: 243 EL---NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 L + ++ ++ + G+IV+ I Sbjct: 381 IKSELVSPKDYLNLYQVAPDIIEKGTGRVISKRTVKESGVKGPNSRFYKGQIVYSKIRPS 440 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 K L G+ ++ + + + L +++ L +V A L Sbjct: 441 LSKVFLAEY----NGLCSADMYPLDCYINPNYLLKYILSIPFLMQVKKAENRIKMPKLNS 496 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + V +PP EQ I + IN A + L+ Sbjct: 497 DSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLIS 531 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 65/192 (33%), Gaps = 15/192 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE + ++ + K+ S IL +I++ + G Sbjct: 93 SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQSQEFISGYCNNE 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ V F D + + + + P I + W +RS Sbjct: 153 ---CLLIKLNNPVIVFGDHTRN----IKFIDFDFVVGADGVKILSPILICERFFFWQLRS 205 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + L YA F+ + +PPI EQ I ++ + D L ++ S Sbjct: 206 FKLDVRGYAR--------HFKVLNSCLFALPPIAEQERIVEKVSSLMSLCDQLEQQSLTS 257 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 258 LDAHQQLVETLL 269 >gi|120556289|ref|YP_960640.1| restriction modification system DNA specificity subunit [Marinobacter aquaeolei VT8] gi|120326138|gb|ABM20453.1| restriction modification system DNA specificity domain [Marinobacter aquaeolei VT8] Length = 485 Score = 87.2 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 58/472 (12%), Positives = 132/472 (27%), Gaps = 77/472 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W V + G+ + K+ Y+G +V G + Sbjct: 3 SDWPRVRLGDHIDSCLGKMLDKKKNKGIQQPYLGNSNVRWGEFDLSDLAQMKFEESEHER 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G ++ + G R AI D ++ K L + Sbjct: 63 YGITYGDLIVCEGGEPGRCAIWKAELPDMKIQKALHRIRTKSSLNNRYLYYWFYHAGKHG 122 Query: 137 -IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +E G T+ H + + N+ +P+PPL+ Q + E + + +I ++ Sbjct: 123 LLEPYFTGTTIKHLTGRALNNLEIPLPPLSHQEFMAEVLGSLDDKIQLNHQTNQTLEQMA 182 Query: 196 KEKKQALVSYI--------------------------------------------VTKGL 211 + ++ L Sbjct: 183 QAIFKSWFVDFEPVKAKIAALAAGGSEEDALLAAMQAISGKGEAELSRLQTEQPEQYAEL 242 Query: 212 NPDVKMKDSGIE--WVGLVPDHWEVKPFFALVTELNRKNTKL-----IESNILSLSYGNI 264 ++ S ++ +G +P+ WE + L + I + S+ NI Sbjct: 243 RATAELFPSAMQDSELGEIPEGWEASQLGGYLDTLETGSRPKGGVSGITEGVPSVGAENI 302 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEI----------VFRFIDLQNDKRSLRSAQVMERG 314 + K S + + + G + + D + + Sbjct: 303 VGVGNYHYGKEKFVSVDFFNKLKRGIVEHLDCLLYKDGGKPGDFKPRVSMFGCGFPYNKL 362 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIK 373 I ++ + YL +L+ + + DVK + + PP Sbjct: 363 AINEHVFRLRSQRLGQPYLYFLIGHERVLADLRHKGAKAAIPGINQTDVKTVWTVCPP-- 420 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLK--ERRSSFIAAAVTGQIDLRG 423 ++ ++ N + L + +S L+ + R + + ++G++ + Sbjct: 421 --REVLDIFNTIAEK--SLTSILTRSKESLRLSKLRDTLLPKLLSGELSVSD 468 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 27/76 (35%), Gaps = 10/76 (13%) Query: 9 QYKDSGVQWIGAIPKHWKVVPI-KRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGK 62 +DS +G IP+ W+ + L TG + G + + +G E++ G G Sbjct: 252 AMQDSE---LGEIPEGWEASQLGGYLDTLETGSRPKGGVSGITEGVPSVGAENI-VGVGN 307 Query: 63 YLPKDGNSRQSDTSTV 78 Y D Sbjct: 308 YHYGKEKFVSVDFFNK 323 >gi|187930240|ref|YP_001900727.1| restriction modification system [Ralstonia pickettii 12J] gi|187727130|gb|ACD28295.1| restriction modification system, type I [Ralstonia pickettii 12J] Length = 394 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 64/399 (16%), Positives = 139/399 (34%), Gaps = 38/399 (9%) Query: 27 VVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 +V +L+ R+ + D Y+GLE +E G + + ++ +F G Sbjct: 12 LVKFGDVVRLSKARSQDPLADGIERYVGLEHLEPGDLRIRSWGSVADGVTFTS--VFQPG 69 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAIC 141 Q+L+GK Y RK +ADF G+CS VL+ KD LPELL + Sbjct: 70 QVLFGKRRAYQRKVAVADFSGVCSGDIYVLETKDAQVLLPELLLFICQTDAFFDHAVGTS 129 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ +W + + +PP+ EQ + A T + + + +L+ K + Sbjct: 130 AGSLSPRTNWASLADFEFVLPPIEEQQSAIVLLSAATDQCHAIEAAHLAAGRMLQSFKDS 189 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ Y + NP + + + P+ P +L L+ Sbjct: 190 MLLYNTSAVANPYLL-----SDVLLRSPESGCSAP----------PKDADTGYFVLGLAA 234 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + ++P S + G+++ + + + + M Sbjct: 235 LSRDGYVSGDFKPVEPTSKMVAAKLSKGDMLISRSNTVDRVGFVGIFSDNRDDVSFPDTM 294 Query: 322 ---AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 P + +L L+++ + + +G + + ++ ++ + VP + Q Sbjct: 295 MRLRPNPALVHPDFLEALLQTTSAREYLMRIAAGTSASMKKINRANLLQMRLNVPDLDAQ 354 Query: 376 FDITNVINVETARIDVLVEKIEQ------SIVLLKERRS 408 + ++ + + + L R+ Sbjct: 355 E---SALDAL-QEFKNAIATQKARWDAALQLTKLIAMRT 389 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 51/152 (33%), Gaps = 10/152 (6%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 L R+ G + + PG+++F K ++ G+ + ++ Sbjct: 45 PGDLRIRSWGSVADGVTFTSVFQPGQVLFGKRRAYQRKVAVADFS----GVCSGDIYVLE 100 Query: 325 PHGI---DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQF-DIT 379 L ++ ++ +G + + ++PPI+EQ I Sbjct: 101 TKDAQVLLPELLLFICQTDAFFDHAVGTSAGSLSPRTNWASLADFEFVLPPIEEQQSAIV 160 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +++ T + + + +L+ + S + Sbjct: 161 -LLSAATDQCHAIEAAHLAAGRMLQSFKDSML 191 >gi|330907934|gb|EGH36453.1| type 1 restriction-modification system, specificity subunit S [Escherichia coli AA86] Length = 372 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 61/398 (15%), Positives = 123/398 (30%), Gaps = 44/398 (11%) Query: 26 KVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++V + + + +G + I + D+ SG K + + Sbjct: 5 QLVTLGKHIDILSGCAFPSSGFNRNNGVPLIRIRDILSG------KTETYYEGSYDLKYL 58 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG +L G G + R+ D + + + + P + + +I A Sbjct: 59 IKKGDLLVGMDGDFNRE-YWKGTDALLNQRVCKITPNPETLDKNFLYHFLQKELDKIHAT 117 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + T+ H K I +I + +P L EQ I + I + I+ + Sbjct: 118 TDVVTVKHLSVKKIQDIKIRLPSLKEQKRIAAILDKADA-IHQKREQAIKLADDFLRATF 176 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + NP K + +G + + K+ + E + Sbjct: 177 ATM------YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIR 222 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 I + P+ I + +++ + G A Sbjct: 223 LVQIRDFKSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVAL 276 Query: 321 MAVKPHGIDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 M P +L++ + V + + + E + + V +PPI Q + Sbjct: 277 MKASPKENIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDE 336 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLK----ERRSSFI 411 I + ARI+ EKIE S+ L+ + + Sbjct: 337 ILARL----ARIEKFKEKIEISLNHLEMQFLSLQKRLM 370 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 56/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 187 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 246 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 247 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 305 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I ++ + + Sbjct: 306 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILARLARIEKFKEKIEISLNHLEMQFLSL 365 Query: 199 KQALV 203 ++ L+ Sbjct: 366 QKRLM 370 >gi|237654255|ref|YP_002890569.1| restriction modification system DNA specificity domain protein [Thauera sp. MZ1T] gi|237625502|gb|ACR02192.1| restriction modification system DNA specificity domain protein [Thauera sp. MZ1T] Length = 532 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 62/416 (14%), Positives = 131/416 (31%), Gaps = 30/416 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W V + + + + L V S K+ + ST Sbjct: 10 DLPAGWDVASFGELNSFSGSTVNPATRPDEVFELYSVPSFPTKHPEQLPGRAIG--STKQ 67 Query: 80 IFAKGQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G +L K+ P + + D + I S++++ + ++P + + Sbjct: 68 TVRPGDVLVCKINPRINRVWTVGTRRDHEQIASSEWIGFRSDAMVPRFAKHYFSEPSFRS 127 Query: 136 --RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 E G +++ A + P+ + PLAEQ I +++ A RI Sbjct: 128 LLCSEVSGVGGSLTRAQPSRVAKYPVLVAPLAEQARIADQLEALLARIQACQDRLEAIPA 187 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR------K 247 LLK ++ ++S ++ L + G+ D W + + Sbjct: 188 LLKRFRKLVLSSALSGDLTEVWRA------EQGVGLDTWSARTIADVAEVGTGSTPLRSN 241 Query: 248 NTKLIESNILSLS---YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + E+ ++ + + ++ PG ++ + Sbjct: 242 SNFYAETGTPWVTSAATSRPYIDSADQYVTKAAIDAHRLRVYRPGTLIIAMYGEGKTRGQ 301 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + ++ A + V ++ ++ + S A G G + +L V+ Sbjct: 302 VSELRIDATINQACAAITVDEQQANAAFVKLALLSQYEQTRALAEG-GAQPNLNLSKVRG 360 Query: 365 LPVLVPPIKEQFDITNVINVETA---RIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +P+ +P EQ I + + A ID V L +A A G Sbjct: 361 IPLRLPEGPEQAQIVHRVGELFAFADTIDSRVAAATGKTRKLPSLT---LAKAFRG 413 >gi|191639032|ref|YP_001988198.1| Type I R/M system specificity subunit [Lactobacillus casei BL23] gi|190713334|emb|CAQ67340.1| Type I R/M system specificity subunit [Lactobacillus casei BL23] Length = 426 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 56/417 (13%), Positives = 125/417 (29%), Gaps = 37/417 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK-------------DGNSR 71 W+ + + T + + + + Y+ + + + Sbjct: 20 WEKRKLGEIFNVVTDYVANGSFKSLRQRVSTYSNPNFAYMIRLQDASNNWKGPWLYTDQQ 79 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLL 129 + G IL +G + ++ D + ++L+ L L Sbjct: 80 SYSFLAKTKLNPGDILMSNVGSVGKFFLVPDLDRPMTLAPNAILLRSMTYSTYFLFQLLQ 139 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + +T+ I + + I +P L E ++ + + +D LI Sbjct: 140 TSSMTESINEKTTPGVQQKINKTDLKKIITNVPTLNESSMVGQML----SLLDNLIAATQ 195 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + LK+ K + I + +++ G V H+++ + E K Sbjct: 196 DKLSFLKKMKMFFLQQIFPTKNHDVPQIRFDG---FTDVWSHYKLGSLMRIDKEQEVKKE 252 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF----------IDLQ 299 L + ++ KP V + + +L Sbjct: 253 LLTDIQKGFYVLAMRTFSMDGYIDHSKPYWLNHLDNVSDDKFLLPREFAILDADMDANLP 312 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLK 358 R L +A + + G D ++ LMR + + +G + L Sbjct: 313 KIGRVLLNASSEKYLLAAHVRKIQVKSGNDPIFIYALMRGNSVHERLKLEANGSISKRLL 372 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++V + +LVP EQ I ++ + +Q I +LK+ + S + Sbjct: 373 DKNVYKQSILVPNRSEQSRIGR----LFFLLETTITLHQQKIKMLKQVKKSCLQNLF 425 >gi|254433927|ref|ZP_05047435.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] gi|207090260|gb|EDZ67531.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] Length = 505 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 55/135 (40%), Gaps = 1/135 (0%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ G+I+ ++ + T +DS + + S + Sbjct: 31 YVINKGDILIGMSGAIGKVCRYKNGFPALQNQRTGKIEVFDESQMDSRFFGLYLSSIEGE 90 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + A G ++ ++ +D++ LP+ +PP EQ I I + +D +E ++ + L Sbjct: 91 LIRQAKGMAVQ-NISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDKGIESLKTAREQL 149 Query: 404 KERRSSFIAAAVTGQ 418 K R + + A G+ Sbjct: 150 KVYRQAVLKHAFEGK 164 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 57/158 (36%), Gaps = 4/158 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + R++ L + Y++ + R N + + ++ + Sbjct: 283 VDSSDLRSIKLDATEIQKYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFR 342 Query: 325 PHG--IDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + +Y+ L + + + + S + ++ + L + + EQ I + Sbjct: 343 FPQGIVLPSYIQMLFDTQTVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVS 402 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + I + +IE++ LK R S + A +GQ Sbjct: 403 RLEEQLTSISAVKVEIEENFQRLKSLRQSILKKAFSGQ 440 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 66/207 (31%), Gaps = 12/207 (5%) Query: 22 PKHWKVVPIKRFTK-LNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W + ++ + G SGK I I L D+++ + Sbjct: 240 PNGWISIQLRELFESTQNGLAKRQGTSGKPIPVIRLADIKNQEVDSSDLRSIKLDATEIQ 299 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDV-LPELLQGWLLS 130 ++ +L ++ + C P+ + LP +Q + Sbjct: 300 KYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFRFPQGIVLPSYIQMLFDT 359 Query: 131 IDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 V + IE A + I + +P L EQ +I ++ + I + E Sbjct: 360 QTVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVSRLEEQLTSISAVKVEIE 419 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 + LK +Q+++ + L P Sbjct: 420 ENFQRLKSLRQSILKKAFSGQLVPQDP 446 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 49/142 (34%), Gaps = 6/142 (4%) Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQ----FLVLQPKDVLPELLQGWLLSIDVT 134 + KG IL G G + + Q V + +L SI+ Sbjct: 31 YVINKGDILIGMSGAIGKVCRYKNGFPALQNQRTGKIEVFDESQMDSRFFGLYLSSIEG- 89 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +G + + K I +P+ +PP EQ I KI +D I E Sbjct: 90 -ELIRQAKGMAVQNISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDKGIESLKTAREQ 148 Query: 195 LKEKKQALVSYIVTKGLNPDVK 216 LK +QA++ + L + Sbjct: 149 LKVYRQAVLKHAFEGKLTAQWR 170 >gi|24379345|ref|NP_721300.1| putative type I restriction-modification system, specificity determinant; restriction endonuclease [Streptococcus mutans UA159] gi|24377270|gb|AAN58606.1|AE014930_8 putative type I restriction-modification system, specificity determinant [Streptococcus mutans UA159] Length = 603 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 124/398 (31%), Gaps = 21/398 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFA 82 + ++ + + + + YI L V+ + K + ++ + I Sbjct: 215 EWKTLEEISVPIKNISWKENSERTYSYIDLSSVDRESKKITDITTITADKAPSRAQRIVK 274 Query: 83 KGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQ-PKDVLPELLQGWLLSIDVTQRIE 138 I++G P LR+ + ICST F V + +VLP + S D +E Sbjct: 275 TDDIIFGTTRPTLRRFAKVPENFNNQICSTGFYVFRASNEVLPSYIYHIFASNDFNSYVE 334 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 GA+ + +P+P L Q I + + + + + K Sbjct: 335 KNQSGASYPAIADSLVKKYKLPVPSLKIQSRIVQVLDNFDTVCND--------LNIGLPK 386 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L + + G+ V ++ V K + +I Sbjct: 387 EIELRQKQYEYFRDKLLTFTAEGVYTDSTVQYRQDLIRLLQWVFGP-IKVSLGSICSISR 445 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 Q + + S + + + + + Sbjct: 446 GKRLIRSQLNKNGKYPVYQNSLIPLGYFNETNEEANTTFVISAGAAGEIGFSKQPFWKAD 505 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + I+ +L +++ S K+ + L ++ L V +P + Q I Sbjct: 506 DVWTMSSEFINQRFLYYMLLSNQ-SKIKGQVRKASIPRLSKNVIENLTVCLPESEGQSRI 564 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER----RSSFIA 412 +V++ I+ + E + + I L +++ R ++ Sbjct: 565 VSVLDKFDTLINSISEGLPKEIELRQKQYEYFRDKLLS 602 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 62/202 (30%), Gaps = 2/202 (0%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + + G+EW + + F V + ++ E + ++ + + Sbjct: 6 DMIKDLCPDGVEW-KKLWEVTIWDKKFNSVPKFKQQLVDKYEYLLAKDLSQMVVSGGDIK 64 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + P + T + V GE + I + + I +A + + Sbjct: 65 ILTTSPSNLWTTEYVAGGEFFDKEIVAIPWGGNPIVQYYKGKFITGDNRIARVKNDDELL 124 Query: 332 YLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + K+ + Q V + +PP++ Q ++ +++ T + Sbjct: 125 TKYLYYYLQNNLKLISSFYRGSGIQHPDMSKVLDTKIPIPPLEIQEEVVKILDKFTDYVT 184 Query: 391 VLVEKIEQSIVLLKERRSSFIA 412 L ++ R ++ Sbjct: 185 ELTSELTLRQKQYSFYRDKLLS 206 >gi|291528110|emb|CBK93696.1| Restriction endonuclease S subunits [Eubacterium rectale M104/1] Length = 398 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 55/402 (13%), Positives = 123/402 (30%), Gaps = 22/402 (5%) Query: 29 PIKRFTKLNTGRTSESG------KDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIF 81 IK ++ TG+T +G +I++I D+ + K + + Sbjct: 2 KIKDIGRVVTGKTPLTGVNEYYGGNIMFISPSDLHGDYLIEKSEKTITEEGLKSIESNSI 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G +G + + + + Q ++ L + + + +I Sbjct: 62 DGISVLTGCIGWDMGNVAMCNSRCATNQQINAIIDFNHKLVDPRYVYYWLKGKKDYLFSI 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 NI +P+P L Q + + + ID I + + + L+E + Sbjct: 122 ASVTRTPILSKSVFENIDIPLPSLKIQERVTKLL----SLIDEKIRKNHQINDYLEEMAK 177 Query: 201 ALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + Y + PD K SG + + + + + + N + L Sbjct: 178 TIYDYWFVQFDFPDENGNPYKSSGGKMIFCKELNRNIPQNWEYTSVGNITKCLDSDRIPL 237 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGII 316 S ++ Y I ++ D Q + I Sbjct: 238 SSHQREEMKGTIPYYGATGIMDYVNRPIFSGDFVLLAEDGSVMDDNGNPILQRISGDVWI 297 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + ++P S L +L+ + ++ + ++ +L P + + Sbjct: 298 NNHTHVLQPVNGYSCRLLYLLLKNIPVSMIK--TGSIQLKINQANLNSYNILNIPKEIRT 355 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N I +I L ++ +L + R + + GQ Sbjct: 356 QFINQIEPMDTKIIQL----QKENNILVQTRDWLLPILMNGQ 393 >gi|90580557|ref|ZP_01236362.1| probable type I restriction modification system methylase [Vibrio angustum S14] gi|90438215|gb|EAS63401.1| probable type I restriction modification system methylase [Photobacterium angustum S14] Length = 442 Score = 87.2 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 64/442 (14%), Positives = 134/442 (30%), Gaps = 59/442 (13%) Query: 23 KHWKVVPIKRFTK-LNTG--RTSES-GKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTST 77 +W V+ + + + G ++ +S ++D+ + + D Sbjct: 3 SNWLVLTLGDVCERITDGAHKSPKSVDDGKPMASVKDLTRFGVDLSNARKISKNDFDELV 62 Query: 78 VSIFAK--GQILYGKLG--PYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLS 130 G +L K G + S L P+ + L+ + S Sbjct: 63 QQGCKPQVGDVLIAKDGNSALDTVCTVDTEIDAVLLSSVAILRPDPEKLDSNFLKYYFCS 122 Query: 131 IDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 V ++ GA + + + +PP+ Q I + + ID I Sbjct: 123 PQVIDYLKTNFISGAAIPRVVLRDFRKAEINLPPIETQRKISQYL----SSIDNKIFVNS 178 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-----------------WVGLVPDHW 232 + + L++ QA+ KM E +GL+PD W Sbjct: 179 KINQTLEQMAQAIFKSWFVDFDPVKAKMNGKQPEGMDAATASLFPEKLVESELGLIPDGW 238 Query: 233 EVKPFFALVTELNR---------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 EVK + K + + Y + LK +Y Sbjct: 239 EVKNVGDFTDTFDYVANGSFAALKANVELYDEPNEVIYVRTTDFNKGFKNDLKYTDEPSY 298 Query: 284 QIVDPGEIVFRFIDLQNDK----RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Q + ++ + N + + S M + G +S Y+ ++ +S Sbjct: 299 QFLSKSKLYGHETIISNVGDVGTVFRAPSWYDMPMTLGSNAMGIVSKGANS-YIYYMFKS 357 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI-- 396 + + + SG + ++L V++P + V+ D L K Sbjct: 358 HIGQHLLDGITSGSAQMKFNKTSFRKLRVVLPSKE-------VLAKFEELEDSLWAKHAS 410 Query: 397 -EQSIVLLKERRSSFIAAAVTG 417 ++ + L+ R + + ++G Sbjct: 411 NQKESLHLERLRDTLLPKLLSG 432 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 58/202 (28%), Gaps = 16/202 (7%) Query: 18 IGAIPKHWKVVPIKRFTK----LNTGRT---------SESGKDIIYIGLEDVESGTGKYL 64 +G IP W+V + FT + G + ++IY+ D G L Sbjct: 231 LGLIPDGWEVKNVGDFTDTFDYVANGSFAALKANVELYDEPNEVIYVRTTDFNKGFKNDL 290 Query: 65 PKDGNSRQSDTSTVSIFAKGQIL--YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 S ++ I+ G +G R D + + + K Sbjct: 291 KYTDEPSYQFLSKSKLYGHETIISNVGDVGTVFRAPSWYDMPMTLGSNAMGIVSKGANSY 350 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + I ++ I G+ + + + +P E + + Sbjct: 351 IYYMFKSHI-GQHLLDGITSGSAQMKFNKTSFRKLRVVLPSKEVLAKFEELEDSLWAKHA 409 Query: 183 TLITERIRFIELLKEKKQALVS 204 + E + L L+S Sbjct: 410 SNQKESLHLERLRDTLLPKLLS 431 >gi|260913244|ref|ZP_05919726.1| type I restriction enzyme StySJI specificity protein [Pasteurella dagmatis ATCC 43325] gi|260632831|gb|EEX51000.1| type I restriction enzyme StySJI specificity protein [Pasteurella dagmatis ATCC 43325] Length = 206 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 59/168 (35%), Gaps = 9/168 (5%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + L +S + ++ + + Y +I+F I + +E Sbjct: 39 DISFLPMSLVSEYGQVIGFETRKVYKVKKGYTAFKNKDIIFAKITPCFENGKAALLNDLE 98 Query: 313 RGI---ITSAYMAVKPHGIDSTYLAWLMRSY--DLCKVFYAMGSGLRQSLKFEDVKRLPV 367 G T ++ + + +L + S + GS +Q + + + + Sbjct: 99 NGYGFGSTEFHVIRSQNNCNPNFLFSYLYSDTLLIKGKKSMTGSAGQQRVPAQFFENYII 158 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +PP +EQ I N + + +D L+ + Q I LK + + Sbjct: 159 ALPPPEEQQAIANCL----SSLDSLISEQNQQICRLKTHKKGLMQQLF 202 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 72/207 (34%), Gaps = 19/207 (9%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYL 64 +PQ+KD K W+V +K +N + DI ++ + + S G+ + Sbjct: 6 RFPQFKDC---------KGWEVAELKDIALVNPKKENLPDDLDISFLPMS-LVSEYGQVI 55 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQP-K 117 + + F I++ K+ P + + G ST+F V++ Sbjct: 56 GFETRKVYKVKKGYTAFKNKDIIFAKITPCFENGKAALLNDLENGYGFGSTEFHVIRSQN 115 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + P L +L S + + + G+ + N + +PP EQ I + + Sbjct: 116 NCNPNFLFSYLYSDTLLIKGKKSMTGSAGQQRVPAQFFENYIIALPPPEEQQAIANCLSS 175 Query: 177 ETVRIDTLITERIRFIELLKEKKQALV 203 I + R K Q L Sbjct: 176 LDSLISEQNQQICRLKTHKKGLMQQLF 202 >gi|322372657|ref|ZP_08047193.1| type I restriction-modification system specificty subunit [Streptococcus sp. C150] gi|321277699|gb|EFX54768.1| type I restriction-modification system specificty subunit [Streptococcus sp. C150] Length = 394 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 50/392 (12%), Positives = 115/392 (29%), Gaps = 31/392 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W I K++ G + K V + + D Sbjct: 28 WVENRIADIVKISAGGDVDKIKLKETGQYPVVANS---LTNRGIVGFYDD----YKVKAP 80 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + G + + + + + + E Sbjct: 81 AVTVTGRGDVGYAVARHENFTPIVRLLTLQSENIDVD-------YLENQINSMRILNEST 133 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + +GN + P + EQ I + + + L + Sbjct: 134 GVPQLTAPQLGNYKVYHPEINEQTAIGSLFRNLDDLLASYKDNLANYQSLKATMLAKMFP 193 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 P++++ EW + + + + +S G I Sbjct: 194 KAGQTI--PEMRLDRFEGEW------EIKKFKSISTKRGKSNSKGYDYPAYSVSNQSGLI 245 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 Q + L+ +Y+IV+P E + + + S+ + E I++S Y+ Sbjct: 246 PQSEQFEGSRLENLEKTSYKIVEPNEFAYNP--ARINVGSIAFNDLDETVIVSSLYVIFS 303 Query: 325 -PHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPP-IKEQFDITNV 381 I++ Y ++S + K +R+ L +E+ + + +PP ++EQ I Sbjct: 304 LDKSINNNYALLFIKSPEFNKEVRRNTEGSVREYLFYENFANIRIPIPPSLEEQQAIGAY 363 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ I L+ ++ + Sbjct: 364 ----FSNLDNLINSYQEKISQLETLKNKLLQD 391 >gi|194467964|ref|ZP_03073950.1| type I restriction endonuclease S subunit domain protein [Lactobacillus reuteri 100-23] gi|194452817|gb|EDX41715.1| type I restriction endonuclease S subunit domain protein [Lactobacillus reuteri 100-23] Length = 397 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 59/400 (14%), Positives = 119/400 (29%), Gaps = 31/400 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK----DGNSRQSDTSTVS 79 W+ K S+S KD + + G D + Sbjct: 20 DWEQRKGKSIFY------SKSNKDFPELTVLSATQDKGMIPRSSTGIDIKYEKKSLRGYK 73 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G + L + +D GI S + V K W I+ Sbjct: 74 KIEPGDFVV-HLRSFQGGFAYSDLTGIVSPAYTVFTFKQPEMFNNYFWKEKFTSYNFIQL 132 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + + + + + KI ID LIT + R +E LK + Sbjct: 133 LKKVTYGVRDGRSISYSDFLTLNEKFPVKVEQTKIADLFKIIDNLITLQQRKLEQLKLLE 192 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +AL + ++ + + W + TE K + E + Sbjct: 193 KALQQKLFPNSFQEKPLLR------ILHGDNSWWNNYIGEVFTERVDKGS--SEKLLSVS 244 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + E++ + Y+ V +I + + L + + GI++ A Sbjct: 245 ITDGVYPFDESKRKNNSSDDKHNYKKVFQNDIAYNSMRLWQGALGVSKYE----GIVSPA 300 Query: 320 YMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQ 375 Y +KP ++ ++ ++ D+ +F GL +LKF ++ + + + Q Sbjct: 301 YTVLKPLPNQNSIFYEFMFKNIDMLHIFQRNSQGLTSDTWNLKFNQLQHIKIKTTNLNSQ 360 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I +I+ L L + + Sbjct: 361 NKIA----KLLIKIEELKNNESNYYHNLMTLKKYLLQKLF 396 >gi|300214618|gb|ADJ79034.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius CECT 5713] Length = 375 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 49/375 (13%), Positives = 124/375 (33%), Gaps = 28/375 (7%) Query: 47 DIIYIGLEDVESGTG--KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 DI +I D+++ + K ++ + S + I + A ++ Sbjct: 16 DIPWIQSSDLKNDDIWNVNINKYITNKAVNDSAAKLIPANSIAIVTRVGVGKLAYMSQEY 75 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 ++ K+ L ++ ++ + +G ++ K + N+ + I Sbjct: 76 STSQDFLSLVDIKEDLIFIMYMLYFK---ISKVSSSLQGTSIKGITKKELLNLSISIVNN 132 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG-LNPDVKMKDSGIE 223 + I +D I +++LL + + L+ + + P+++ K + Sbjct: 133 TAEQNR---IGQVFKILDNSINLHEDYLQLLYDFRSFLLQKMFSINDTFPNLRFKQFNDK 189 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 W V ++ T + N E N +S Sbjct: 190 W---------KYKKLGEVADIVSGGTPDTTKHDYWNGSINWYTPAEVGNKIFVSDSQRKI 240 Query: 284 QIV-----DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + + + ++A + E+G + ++ P Sbjct: 241 TNIGLENSSAKILPVGTVLFTSRAGIGKTAILKEKGSTNQGFQSIVPKQKFLDSYFIFSM 300 Query: 339 SYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S L K + G+G + +++ + + +P I EQ +I+ V+ ++D ++ + Sbjct: 301 SNILKKYGESHGAGSTFLEISGKELAKARISLPSITEQKNISKVLF----KLDTIITLQK 356 Query: 398 QSIVLLKERRSSFIA 412 Q I LK+ + + Sbjct: 357 QEIDNLKKLKQFLLQ 371 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 63/186 (33%), Gaps = 8/186 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGK-YLPKDGNSRQSDTST 77 WK + + +G T ++ K I + +V + + + + S+ Sbjct: 190 WKYKKLGEVADIVSGGTPDTTKHDYWNGSINWYTPAEVGNKIFVSDSQRKITNIGLENSS 249 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I G +L+ + K I G + F + PK + + +S + + Sbjct: 250 AKILPVGTVLFTS-RAGIGKTAILKEKGSTNQGFQSIVPKQKFLDSYFIFSMSNILKKYG 308 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E+ G+T K + + +P + EQ I + + I E +L + Sbjct: 309 ESHGAGSTFLEISGKELAKARISLPSITEQKNISKVLFKLDTIITLQKQEIDNLKKLKQF 368 Query: 198 KKQALV 203 Q + Sbjct: 369 LLQNMF 374 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 59/178 (33%), Gaps = 6/178 (3%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + + +N +I + ++ K + + I I + Sbjct: 1 MXXTPDTQNKNYWIGDIPWIQSSDLKNDDIWNVNINKYITNKAVNDSAAKLIPANSIAIV 60 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + A + + + ++++ D ++ +++ + + KV ++ + + Sbjct: 61 TRVGVGKLAYMSQEYSTSQDFLSLVDIKEDLIFIMYMLY-FKISKVSSSLQGTSIKGITK 119 Query: 360 EDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +++ L + + EQ I +D + E + LL + RS + + Sbjct: 120 KELLNLSISIVNNTAEQNRIG----QVFKILDNSINLHEDYLQLLYDFRSFLLQKMFS 173 >gi|149391960|emb|CAL68657.1| restriction-modification enzyme [Pseudomonas putida] Length = 1289 Score = 86.8 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 64/409 (15%), Positives = 143/409 (34%), Gaps = 40/409 (9%) Query: 25 WKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W +PI++ LN ++ + +I ++ + V + + + Sbjct: 903 WPQMPIRQVAVLNPRKSELKGFSASTEISFVEMASVSEDGFITGAVRRKLGEVLKGSYTY 962 Query: 81 FAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKD--VLPELLQGWLLSID 132 FA+ I+ K+ P + +++ G+ S++F V++ V+P+ + G+L + Sbjct: 963 FAEDDIIIAKITPCMENGKCALARGLSNKIGMGSSEFHVIRADKGKVIPDFVFGYLNRAE 1022 Query: 133 VTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V + E G++ ++ +P+PPL Q I + E ++D + Sbjct: 1023 VRKVAEKSMTGSSGHRRVPESFYADLRIPVPPLKVQSQICD----EFTKVDKAVQSARTK 1078 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 I ++ + LV I K S GL EV Sbjct: 1079 IASTQQSIELLVESIYASTAPRIEIAKLSSNIQYGLSEKMNEV-------------GIGY 1125 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + G ++ + + E + Y ++ G+++F + + + Sbjct: 1126 KIFRMNEIIQGRMVDDGAMKCADISVEEFANY-KLNKGDLLFVRSNGSLEHIGKVGLFDL 1184 Query: 312 ERGIITSAY---MAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLP 366 E ++Y + YL +M S K V A+ SG ++ +K + Sbjct: 1185 EGDYCYASYLVRIVPDSSKALPQYLVSIMNSPIFRKGMVQLAVKSGGTNNINATKMKSIK 1244 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 V P + EQ + ++ + + + I R+ + + + Sbjct: 1245 VPTPSLAEQEEFVVKVDALGKQ----IADAQAVIDAAPARKEAVMKKYL 1289 >gi|328471221|gb|EGF42123.1| restriction modification system DNA specificity subunit [Vibrio parahaemolyticus 10329] Length = 418 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 70/425 (16%), Positives = 140/425 (32%), Gaps = 51/425 (12%) Query: 23 KHWKVVPIKR-FTKLNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 K WK + L +G + + S D + + V G + Sbjct: 3 KEWKNGRVGELIASLESGISVNGEDGTPSNDDYAVLKVSAVTYGKFNPQASKKITGSELQ 62 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-------VLQPKDVLPELLQGWL 128 KGQI+ + +FL V P+ + + Sbjct: 63 RAKCNPKKGQIIISRSNTPDLVGASCYVSEDYPNRFLPDKLWQTVPHPEKKVEHKWLAYF 122 Query: 129 ---LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 A +M + + +P+ IPP EQ I + I Sbjct: 123 LASPWARFRLSKLATGTSNSMKNITKSELLTLPVAIPPFLEQKKIASFLECWDNAI---- 178 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 EK +AL++ + G WE + VT N Sbjct: 179 -----------EKTEALIAAKEKQFEWLCQTYFKPGNSTN----SGWEKHKIASFVTVRN 223 Query: 246 RKNTKLIESNILSLSYGNI---IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + E + SL+ N R + + + Y++V P +IVF +L+ Sbjct: 224 EREVPSEEVPLYSLTIENGVTAKTDRYNREFLVIDKGGKKYKVVHPKDIVFNPANLRW-- 281 Query: 303 RSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSL 357 ++ ++V + +++ Y + V + IDS +L + +F M G R ++ Sbjct: 282 GAIARSEVEHKVVLSPIYEVLKVDENKIDSDFLTHALTCSRQIAIFATMVEGTLVERMAV 341 Query: 358 KFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 K + + VP +EQ +I +V+N+ + +++++ + ++ + +T Sbjct: 342 KIDTFLSCHIHVPSSKEEQKNIAHVLNLSKQE----ISLLKKTLEQYRSQKRGLMQKLLT 397 Query: 417 GQIDL 421 G+ + Sbjct: 398 GEWQV 402 >gi|57506131|ref|ZP_00372053.1| type I restriction-modification system S subunit, putative [Campylobacter upsaliensis RM3195] gi|57015615|gb|EAL52407.1| type I restriction-modification system S subunit, putative [Campylobacter upsaliensis RM3195] Length = 544 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 56/435 (12%), Positives = 120/435 (27%), Gaps = 68/435 (15%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESG--TGKYLPKDGNSRQSD 74 IP W V + ++ +G + + I + ++ D +Q Sbjct: 102 IPNSWAWVKLGDICEIISGTSYSKDDLSDEGIRILRGGNINKNSHNIDLFADDVIIKQDL 161 Query: 75 TSTVSIFAKGQIL-YGKLGP--YLRKAIIADFDGICSTQ----FLVLQPKDVLPELLQGW 127 T+ K IL G + K+ +D + ++ K+ + + Sbjct: 162 TNKEKQILKNDILMIASTGSKEIIGKSAFSDVALENTQIGAFLRIIRISKEQNAKYIFHN 221 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L+S I++ G + + + I N +P+PPL EQ I +K+ + Sbjct: 222 LISQIFATHIKSCAGGTNILNIKNEYIENFLIPLPPLCEQQEIVKKLDLLVTLANDFAIT 281 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR- 246 + + K +++L+ + L+ + + + + E E Sbjct: 282 KENLKRIEKRIEKSLLKLALEGSLSKLYRRSSPTLCAFNEINTYNEAIKQKHKNLEKELK 341 Query: 247 ------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K K E L S +++K + + P + P + + Sbjct: 342 KCEKEFKLEKDKEQKALFKSQIQMLKKELIKCKEITPLNSTEAPFTIPNSWAWVKLGDIC 401 Query: 301 DKR------------------------------------------------SLRSAQVME 312 + S E Sbjct: 402 EIISGEIIDLQEENLPLLDVKYLRSKGDKKLANSGNFANANDRLILMDGENSGEIFITKE 461 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 +G + S ++ + M L + L + +PP+ Sbjct: 462 KGFLGSTLKKLEFSSLSQVEFMDFMLLCYKDFFKGNKKGAAIPHLDRKLFANLLIPLPPL 521 Query: 373 KEQFDITNVINVETA 387 KEQ I +++ Sbjct: 522 KEQEHIVQILDTLFT 536 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/203 (12%), Positives = 63/203 (31%), Gaps = 16/203 (7%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSY--GNIIQKLETRNMGLKPE-------S 279 P+ W + ++ + + + + G I K + + Sbjct: 103 PNSWAWVKLGDICEIISGTSYSKDDLSDEGIRILRGGNINKNSHNIDLFADDVIIKQDLT 162 Query: 280 YETYQIVDPGEIVFRFIDL--QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + QI+ ++ K + + I + ++ Y+ + Sbjct: 163 NKEKQILKNDILMIASTGSKEIIGKSAFSDVALENTQIGAFLRIIRISKEQNAKYIFHNL 222 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV-LVEK 395 S + G ++K E ++ + +PP+ EQ +I +++ + + K Sbjct: 223 ISQIFATHIKSCAGGTNILNIKNEYIENFLIPLPPLCEQQEIVKKLDLLVTLANDFAITK 282 Query: 396 IE-QSIVLLKERRSSFIAAAVTG 417 + I E S + A+ G Sbjct: 283 ENLKRIEKRIE--KSLLKLALEG 303 >gi|322377800|ref|ZP_08052289.1| type I restriction-modification system specificty subunit [Streptococcus sp. M334] gi|321281223|gb|EFX58234.1| type I restriction-modification system specificty subunit [Streptococcus sp. M334] Length = 418 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 51/411 (12%), Positives = 127/411 (30%), Gaps = 34/411 (8%) Query: 25 WKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVES----GTGKYLPKDGNSRQSDT 75 W + + +G + I + + D+ + + Q Sbjct: 17 WGNYKLGQLGSFKSGIGFPDSQQGGTEGIPFFKVSDMNNIGNETEMRNANNYVTQEQIVK 76 Query: 76 STVSIFAK-GQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ ++ I++ K+G RK ++ I + K + +I Sbjct: 77 NSWNVVKDTPAIIFAKVGAALMLNRKRLVTKTFLIDNNTMSYSLNKSWDKDFGLTLFQTI 136 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + I I + +P + EQ I + + + Sbjct: 137 YLP----KYAQIGALPSYNASDIATIKVNVPNIQEQSAIGTLFRTLDDLLASYKDNLANY 192 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRK 247 L + P++++ EW + D + V Sbjct: 193 QSLKATMLSKMFPKAGQT--VPEIRLDGFEGEWEKTTLEKSTDRVKSYSLSRDVETNQDT 250 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--- 304 K I + L ++I + + + + + G+++F + Sbjct: 251 GLKYIHYGDIHLGKVSMIDDGNS--IPYIKTDTKLSEFLQQGDLIFADASEDYKGIAEVA 308 Query: 305 LRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDV 362 + + E+ + +AV+P DS +L ++ ++ K Y +G+G+ + ++ Sbjct: 309 VVVDALSEKIVAGLHTIAVRPQSIFDSIFLYFMFKTQTFRKYGYKVGTGMKVFGISPSNL 368 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + P KEQ I + +D L+ ++ I L+ + + Sbjct: 369 MKYEFYYPDKKEQQAIGFY----FSNLDNLINSHQEKISQLETLKKKLLQD 415 >gi|300925837|ref|ZP_07141685.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 182-1] gi|300418089|gb|EFK01400.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 182-1] Length = 381 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 57/380 (15%), Positives = 118/380 (31%), Gaps = 34/380 (8%) Query: 28 VPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 V I+ K+ TG+T G +I +I ++ + + L + + +T + Sbjct: 2 VSIESVAKVITGKTPPKADPNCFGGNIPFITPSEL-TDSDYLLKPETTLTEKGLATTKLI 60 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K IL +G L K IAD + Q + D G+ + ++ I Sbjct: 61 PKNSILVCCIGS-LGKMAIADLPVATNQQINSVIFDDDKIYYRFGFYALKLLKNDLKKIA 119 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 T++ + + +P PPL EQ I + Sbjct: 120 PSTTVAIINKSRFSELKIPCPPLEEQKRIATILDKADGIHKKREQA-------------- 165 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + ++M + + P V + L Sbjct: 166 -IKLADDFLRAKFLEMFGTPANNIHRFPKGTIRD-LVDSVNYGTSAKASIDSGEYPILRM 223 Query: 262 GNIIQKLETRNMGLKPES----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 GNI + LK + +V G+++F + + + Sbjct: 224 GNITYQGRWDFTDLKYLDLSVKEKDKYLVKEGDLLFNRTNSKELVGKTAVYEEDRPMAFA 283 Query: 318 SAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKE 374 + V+P+ I ++ Y++ + S M + ++ ++++ + +L+PP Sbjct: 284 GYLIRVRPNSIGNNYYISGYLNSIHGKITLMNMCKSIVGMANINAQELQNIEILIPPKHL 343 Query: 375 QFD---ITNVINVETARIDV 391 Q + I I + D Sbjct: 344 QDEYEIIYKKIKKGLSIYDK 363 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 42/137 (30%), Gaps = 8/137 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 L L + T +++ I+ I K ++ V I S Sbjct: 40 DYLLKPETTLTEKGLATTKLIPKNSILVCCIGSLG-KMAIADLPVATNQQINSVIFDDDK 98 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + S + L + PP++EQ I +++ Sbjct: 99 IYY---RFGFYALKLLKNDLKKIAPSTTVAIINKSRFSELKIPCPPLEEQKRIATILD-- 153 Query: 386 TARIDVLVEKIEQSIVL 402 + D + +K EQ+I L Sbjct: 154 --KADGIHKKREQAIKL 168 >gi|187477054|ref|YP_785078.1| type i restriction enzyme EcoR124II specificity protein [Bordetella avium 197N] gi|115421640|emb|CAJ48150.1| type i restriction enzyme EcoR124II specificity protein [Bordetella avium 197N] Length = 406 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 51/408 (12%), Positives = 120/408 (29%), Gaps = 48/408 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 P+ G +G+ ++ V+ G N + Sbjct: 17 DWKPLGEVLNRTKGTKITAGQMRELHKEGGPVKIFAGGKTVAFVNFNDIPEKDIQTVP-- 74 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G + + + KD + + I Sbjct: 75 SIIVKSRGII--EFEYYENPFTHKNEMWAYNAKDRALNIKYVYHFLKLNEPHFHGIGSKM 132 Query: 145 TMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 M +PIP L Q I + A T L TE + Sbjct: 133 QMPQIAIPDTDGFSIPIPCPNNPKRSLEIQAEIVRILDAFTELTAELSTELSARKKQYNY 192 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + L+S G + + + + + + E I Sbjct: 193 YRDQLLS--------------------FGEGVPFLSLAQCCESIADGDHQAPQKTEDGIP 232 Query: 258 SLSYGNI--IQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ++ N+ +++ N SY ++ + +I++ + + Sbjct: 233 FITISNVSATNQIDFSNTKFVSNSYYDGLDSKRKARTNDILYTVVGSFGIPVHI---DCE 289 Query: 312 ERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 ++ ++P+ + S Y+ +RS + K + + +G ++++ + R+ + Sbjct: 290 KKFAFQRHIAILRPNPAVVLSKYMYHALRSSAVEKQAHKVAAGAAQKTITLSALNRMLIA 349 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 VP ++EQ I +++ + + E + + I L K+ R ++ Sbjct: 350 VPSLEEQARIVAILDKFDVLTNSIAEGLPREIELRKKQYKHYRDLLLS 397 >gi|323490713|ref|ZP_08095915.1| putative typeI restriction enzyme MjaXP specificity protein [Planococcus donghaensis MPA1U2] gi|323395595|gb|EGA88439.1| putative typeI restriction enzyme MjaXP specificity protein [Planococcus donghaensis MPA1U2] Length = 409 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 52/412 (12%), Positives = 130/412 (31%), Gaps = 37/412 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK + + G ++ +I + D+ + K S + Sbjct: 14 EWKQQELSELLEFKNGINADKDSYGHGTKFINVLDILNNDYILSDKIIGSVNATVQQFQT 73 Query: 81 --FAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSI 131 G IL+ + + + F++ K L + Sbjct: 74 YSVTHGDILFLRSSETREDVGKCNVYLDEEKASVFGGFVIRGKKIADYSPFFLKTALNNS 133 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +I + G+T + + + + IP + EQ + ++ + + + + Sbjct: 134 SARNQISSKAGGSTRYNVGQGILSEVTVMIPKIEEQQKVSSFLMLLNRKTEKQQEKIEKL 193 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +L K Q + S + KD G W+ ++ E K + Sbjct: 194 EQLKKGMMQEIFSQELR--------FKDEDGGEFGE----WKSIKLNKILEERKEKCNER 241 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQV 310 +I ++ + Y +V G+IV+ + + + + Sbjct: 242 NLKVHSVAVRAGVINQITHLGRSFAAKDVSNYSVVKYGDIVYTKSPTGDFPFGIIKQSHI 301 Query: 311 MERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAM-GSGLRQSLKFEDV----K 363 E I++ Y +P Y+ + M + +++ G + ++ + K Sbjct: 302 KEDVIVSPLYGIYEPKNFYIGYILHSYFMYKNNTTNYLHSIVQKGAKNTINITNQNFVSK 361 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + EQ I + + D ++K ++ +++L+E++ F+ Sbjct: 362 NIQLPI-SEVEQKQIADFL----RNTDRKIKKEKEKLMVLEEQKKGFMQRLF 408 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 24/215 (11%), Positives = 73/215 (33%), Gaps = 7/215 (3%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ EW + + + + + L+ I+ + Sbjct: 4 PQLRFDGFDGEWKQQELSELLEFKNGINADKDSYGHGTKFINVLDILNNDYILSDKIIGS 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--S 330 + + ++TY + + R + + D E+ + ++ D Sbjct: 64 VNATVQQFQTYSVTHGDILFLRSSETREDVGKCNVYLDEEKASVFGGFVIRGKKIADYSP 123 Query: 331 TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L + + + G R ++ + + V++P I+EQ +++ + + Sbjct: 124 FFLKTALNNSSARNQISSKAGGSTRYNVGQGILSEVTVMIPKIEEQQKVSSFLM----LL 179 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + EK ++ I L++ + + + ++ + E Sbjct: 180 NRKTEKQQEKIEKLEQLKKGMMQEIFSQELRFKDE 214 >gi|308062794|gb|ADO04682.1| restriction modification system DNA specificity subunit [Helicobacter pylori Cuz20] Length = 318 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 45/333 (13%), Positives = 105/333 (31%), Gaps = 20/333 (6%) Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + I + F L P + + + L + + ++ + G+T Sbjct: 2 TSRASIGDCAILKVVATTNQGFQSLIPLEKINNE-FLYYLILTLKNKLLKLASGSTFLEV 60 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 I N+ +P+PPL EQ+ I + A + L ++ + K L+S Sbjct: 61 SPNKIKNLLIPLPPLNEQIAIANILSALDRYLYALDALILKKEGVKKALSFELLSQ---- 116 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 ++K W + + + + + S I + N + + Sbjct: 117 ----RKRLKGFNQAWQRVKVKDFGIIITGSTPLTQISEYWNGTISWITP-TDINDNKDIF 171 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + T +++ ++ I LR G A+ P+ Sbjct: 172 NSERKITQKGLNTIRMIPKNSVLVTCIASIGKNAILRV-----NGACNQQINAIIPNKDF 226 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETAR 388 + + + + + G + + + + VP + EQ I N+++ Sbjct: 227 NADFIYYLMENNKQYLLGKAGVTATYIISKQVFEEIDFFVPKDLNEQSAIANILSALDNE 286 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I L K Q + + + ++ +I + Sbjct: 287 IASLKNKKRQ----FENIKKALNHDLMSAKIRV 315 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 40/113 (35%), Gaps = 4/113 (3%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + ++ P + + + K+ + +K L + +PP+ Sbjct: 17 ATTNQGFQSLIPLEKINNEFLYYLILTLKNKLLKLASGSTFLEVSPNKIKNLLIPLPPLN 76 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 EQ I N+++ + L I + + + + ++ + L+G +Q Sbjct: 77 EQIAIANILSALDRYLYALDALILKK----EGVKKALSFELLSQRKRLKGFNQ 125 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 64/186 (34%), Gaps = 8/186 (4%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ V +K F + TG T + I +I D+ + + Q +T+ Sbjct: 127 WQRVKVKDFGIIITGSTPLTQISEYWNGTISWITPTDI-NDNKDIFNSERKITQKGLNTI 185 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + K +L + + AI+ +G C+ Q + P +L+ + + Sbjct: 186 RMIPKNSVLVTCIASIGKNAILR-VNGACNQQINAIIPNKDFNADFIYYLMENNKQYLLG 244 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 AT + L EQ I + A I +L ++ +F + K Sbjct: 245 KAGVTATYIISKQVFEEIDFFVPKDLNEQSAIANILSALDNEIASLKNKKRQFENIKKAL 304 Query: 199 KQALVS 204 L+S Sbjct: 305 NHDLMS 310 >gi|312965803|ref|ZP_07780029.1| type I restriction modification DNA specificity domain protein [Escherichia coli 2362-75] gi|331669720|ref|ZP_08370566.1| putative type I restriction-modification system specificity subunit [Escherichia coli TA271] gi|312289046|gb|EFR16940.1| type I restriction modification DNA specificity domain protein [Escherichia coli 2362-75] gi|331063388|gb|EGI35301.1| putative type I restriction-modification system specificity subunit [Escherichia coli TA271] Length = 372 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 61/398 (15%), Positives = 123/398 (30%), Gaps = 44/398 (11%) Query: 26 KVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++V + + + +G + I + D+ SG K + + Sbjct: 5 QLVTLGKHIDILSGCAFPSSGFNRNNGVPLIRIRDILSG------KTETYYEGSYDLKYL 58 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG +L G G + R+ D + + + + P + + +I A Sbjct: 59 IKKGDLLVGMDGDFNRE-YWKGTDALLNQRVCKITPNPETLDKNFLYHFLQKELDKIHAT 117 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + T+ H K I +I + +P L EQ I + I + I+ + Sbjct: 118 TDVVTVKHLSVKKIQDIKIRLPSLKEQKRIAAILDKADA-IRQKREQAIKLADDFLRATF 176 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + NP K + +G + + K+ + E + Sbjct: 177 ATM------YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIR 222 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 I + P+ I + +++ + G A Sbjct: 223 LVQIRDFKSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVAL 276 Query: 321 MAVKPHGIDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 M P +L++ + V + + + E + + V +PPI Q + Sbjct: 277 MKASPKENIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDE 336 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLK----ERRSSFI 411 I + ARI+ EKIE S+ L+ + + Sbjct: 337 ILARL----ARIEKFKEKIEISLNHLEMQFLSLQKRLM 370 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 56/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 187 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 246 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 247 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 305 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I ++ + + Sbjct: 306 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILARLARIEKFKEKIEISLNHLEMQFLSL 365 Query: 199 KQALV 203 ++ L+ Sbjct: 366 QKRLM 370 >gi|297192314|ref|ZP_06909712.1| restriction modification system DNA specificity subunit [Streptomyces pristinaespiralis ATCC 25486] gi|197719704|gb|EDY63612.1| restriction modification system DNA specificity subunit [Streptomyces pristinaespiralis ATCC 25486] Length = 494 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 52/414 (12%), Positives = 116/414 (28%), Gaps = 25/414 (6%) Query: 21 IPKHWKVVPIKR--FTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P+ W + + GR+ + + L + S + +D + Sbjct: 17 LPEGWAWATVGDVLIAPIANGRSVRTEDGGFPVLRLTALRSDKVDLAERKEGEWTADEAA 76 Query: 78 VSIFAKGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQP-KDVLPELLQGWLL 129 + L + L D T V P + + P Sbjct: 77 PFLVRANDFLICRGSGSLDLVGRGALVPEAPDPVAFPDTMIRVRVPVEHMSPRFFTRLWA 136 Query: 130 SIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S V ++IEA + + + +P+PP AEQ I + R+D + Sbjct: 137 SPLVREQIEAAARTTAGIYKVSQPAVRELRIPVPPTAEQHRIAAALDTRMARLDAVDRAV 196 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 L ++A++ L+ + + W Sbjct: 197 TSARRDLAALRKAVL-------LDAVPEPEQWPAHWTATTTGKAGTVELGRARHPDWHTG 249 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLR 306 K+ ++ + + I + + M ++PG+I+ + ++ Sbjct: 250 PKVRPYLRVANVFEDRIDSSDVKVMDFS--GVFGKYRLEPGDILLNEGQSPHLVGRPAMY 307 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKR 364 S + + + R + F + L +K Sbjct: 308 RGIPEGVAFTNSLLRFRASGDVLPGWALLVFRRHLHAGRFMREVRITTNLAHLSGARLKT 367 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + VPP+ EQ + A + +++ R + ++ A +G+ Sbjct: 368 VEFPVPPLDEQRHLVRTTKQRLAAFGRIERGLDRVARHNSAVRRALLSEAFSGR 421 >gi|153811905|ref|ZP_01964573.1| hypothetical protein RUMOBE_02298 [Ruminococcus obeum ATCC 29174] gi|149832039|gb|EDM87124.1| hypothetical protein RUMOBE_02298 [Ruminococcus obeum ATCC 29174] Length = 385 Score = 86.8 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 63/398 (15%), Positives = 134/398 (33%), Gaps = 31/398 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + +F + R SE ++ + S K++P N+ +D + KGQ Sbjct: 6 KQLGQFIRQVDIRNSEGKEENLL-----GVSVQKKFIPSIANTVGTDFKKYKVVKKGQFT 60 Query: 88 Y----GKLGPYLRKAIIADFD-GICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEA 139 Y + G + A++ D++ G+ S + V + K ++PE L W + + Sbjct: 61 YIPDTSRRGDKIGIALLEDYEEGLVSNVYTVFEIIDEKQLIPEYLMLWFSRPEFDRYARF 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+ DW + + +P+PP +Q I + I I + + + L + Sbjct: 121 KSHGSVREVMDWDEMCKVELPVPPYEKQEEIVD----GYKTITERIALKQKINDNLANTE 176 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 QA+ V + +G + D + T +T S Sbjct: 177 QAIWVETVINN--------HTVPTALGDLVDFIDGDRGKNYPTFDEFTSTGYCLFLNASN 228 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + P +IV + E I S Sbjct: 229 VTSTGFNFDNCMFVSEEKDKLMNKGHLSPYDIVLTSRGTLGNVALYDKHIKYENVRINSG 288 Query: 320 YMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF 376 + ++P ++ L++S + SG + L +D++++ +P E Sbjct: 289 MLIIRPKTKRLSPYFIYVLLKSSYMKAAIERFKSGSAQPQLPIKDLQKITFEIP---ESD 345 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ++ + ++ + I LKE + +A Sbjct: 346 TVLVALDRQFLAVEESISINNNEIDNLKELSNVLLAEL 383 >gi|323972574|gb|EGB67777.1| hypothetical protein ERHG_01317 [Escherichia coli TA007] Length = 132 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 27/132 (20%), Positives = 52/132 (39%), Gaps = 13/132 (9%) Query: 303 RSLRSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ--- 355 +++ G++T+ Y+ P Y S L + G R Sbjct: 2 GAIKRLNRYPEGVVTTLYICFELTTPKKSCGDYWEHYFESGLLNNSLSQIAHEGGRAHGL 61 Query: 356 -SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++K D L V VP +EQ I +V++ I L E+ + LK+ + + + Sbjct: 62 LNVKPSDFFSLKVAVPGFEEQQKIASVLSAADTEISTL----EKKLACLKDEKKALMQQL 117 Query: 415 VTGQIDLR-GES 425 +TG+ ++ E+ Sbjct: 118 LTGKRRVKVDEA 129 >gi|294668321|ref|ZP_06733424.1| hypothetical protein NEIELOOT_00233 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291309639|gb|EFE50882.1| hypothetical protein NEIELOOT_00233 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 385 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 60/393 (15%), Positives = 121/393 (30%), Gaps = 34/393 (8%) Query: 43 ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 DI + + ++ KD + T S G IL G + I Sbjct: 11 SESGDIPFYKISTFGGIADAFISKDIFEK--YRETYSYPKIGDILISAAGTLGKTVIFDG 68 Query: 103 FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 +V + + L G+T++ I N+ + P Sbjct: 69 KPSYFQDSNIVWVDN--DEKTVINSFLYYFYQTNPWIKTTGSTINRLYNNDIKNLEISFP 126 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKD 219 L +Q I + +D IT + L+E + L Y + PD K Sbjct: 127 DLIKQQSIAAVL----SALDKKITLNKQINARLEEMAKTLYDYWFVQFDFPDANGKPYKS 182 Query: 220 SGIEWVGLV------PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI--IQKLETR 271 SG E V P WEVK + + ++ N+ + R Sbjct: 183 SGGEMVFDETLKRKIPKGWEVKSLNQVADIVMGQSPDGASYNLEQEGTIFFQGSTDFDWR 242 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 ++ + + G+I+ D I A++ +++ Sbjct: 243 FPNVRQYTTSPTRFAQKGDILLSVRAPVGDL-----NISPFECCIGRGLAALRSKSGNNS 297 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP---IKEQFDITNVINVETAR 388 +L ++M+ + S+ +D+ L ++ P +++ +I ++ Sbjct: 298 FLFYVMKYFKTVFERRNTEGTTFGSITKDDLHSLKLVAPADNVLEKYNEIA-------SK 350 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 D ++ Q L + R + + GQ+ + Sbjct: 351 YDEMIFIRSQQSHQLTQLRDFLLPMLMNGQVSV 383 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 52/171 (30%), Gaps = 10/171 (5%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDPGEIVFRFIDLQ 299 + + K +I + E Y ETY G+I+ Sbjct: 1 MCKRILKEETSESGDIPFYKISTFGGIADAFISKDIFEKYRETYSYPKIGDILISAAGTL 60 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 K + + ++ + +++L + ++ K L Sbjct: 61 G-KTVIFDGKPSYFQDSNIVWVDNDEKTVINSFLYYFYQTNPWIK----TTGSTINRLYN 115 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 D+K L + P + +Q I V++ +D + +Q L+E + Sbjct: 116 NDIKNLEISFPDLIKQQSIAAVLSA----LDKKITLNKQINARLEEMAKTL 162 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 64/204 (31%), Gaps = 8/204 (3%) Query: 10 YKDSGVQWI------GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + + IPK W+V + + + G++ + + G+ + Sbjct: 180 YKSSGGEMVFDETLKRKIPKGWEVKSLNQVADIVMGQSPDGASYNLEQEGTIFFQGSTDF 239 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + N RQ TS KG IL P I+ F+ L+ K Sbjct: 240 DWRFPNVRQYTTSPTRFAQKGDILLSVRAPV-GDLNISPFECCIGRGLAALRSKSGNNSF 298 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L +++ T EG T + ++ + P E I Sbjct: 299 LF-YVMKYFKTVFERRNTEGTTFGSITKDDLHSLKLVAPADNVLEKYNEIASKYDEMIFI 357 Query: 184 LITERIRFIELLKEKKQALVSYIV 207 + + +L L++ V Sbjct: 358 RSQQSHQLTQLRDFLLPMLMNGQV 381 >gi|323481350|gb|ADX80789.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis 62] Length = 292 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 53/317 (16%), Positives = 104/317 (32%), Gaps = 33/317 (10%) Query: 97 KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGN 156 A + + +++ ++ +L+ + + G + K + N Sbjct: 6 VAYLTQGKFWLNNHAHIMRMRNGSN----YFLVQVLEKIDYKKYNTGTAQPKLNSKIVKN 61 Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 I + IP + EQ I ++D +I R ++LLKE K+ + + P Sbjct: 62 IELKIPHIEEQQQIGNF----FKQLDDIIALHQRKLDLLKETKKGFLQKMF-----PKNG 112 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 K I + G WE + E + + Sbjct: 113 AKVPEIRFPGFTG-DWEQCKLGDIAKMYQPPTISGSELLDTGYPVFGANGYIGFYSKSNH 171 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 E ++ S A V G S + V+ I+ +L + Sbjct: 172 LED----------QVTISARGEGTGTPSYVKAPVWITG--NSMVINVEDFDINKKFLYAM 219 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + SY L K G + L + + ++P+++P EQF I ++D + Sbjct: 220 LLSYSLKKYI---TGGAQPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQ 272 Query: 397 EQSIVLLKERRSSFIAA 413 ++ + LLKE + F+ Sbjct: 273 QRKLDLLKETKKGFLQK 289 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 55/196 (28%), Gaps = 26/196 (13%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + K+ T + + Y N Sbjct: 114 KVPEIRFPGFTGDWEQCKLGDIAKMYQPPTISGSELL-----------DTGYPVFGANGY 162 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + Q+ G + +V+ +D + + + Sbjct: 163 IGFYSKSNHLE-DQVTISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLL 221 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ G + +P+ IP EQ I ++D I + R Sbjct: 222 SY--SLKKYITGGAQPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRK 275 Query: 192 IELLKEKKQALVSYIV 207 ++LLKE K+ + + Sbjct: 276 LDLLKETKKGFLQKMF 291 >gi|268323778|emb|CBH37366.1| putative type I restriction enzyme, DNA specificity domain [uncultured archaeon] Length = 323 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 69/193 (35%), Gaps = 2/193 (1%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + T +N + + ++ N + + E E ++ + Sbjct: 13 ECIINDVSIKIHYGYTAKANENGRGSKYLRITDIQENKVNWDTVPFCEIDDEEIEKFE-L 71 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKV 345 IVF K L V + + S + +K + ID Y+ +S + Sbjct: 72 KENNIVFARTGGTVGKSFLIKNDVPSKAVFASYLIRIKLSNYIDKKYIYLFFQSLNYWSQ 131 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 +GL+ ++ + + +L + + P+ EQ I I +D + ++++ LK Sbjct: 132 IELGKTGLKTNVNAQILSKLKLNLAPLPEQRAIVAKIEQLFCDLDNGMANLKKAQEQLKI 191 Query: 406 RRSSFIAAAVTGQ 418 R + + A G+ Sbjct: 192 YRQAVLKKAFEGE 204 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 42/325 (12%), Positives = 106/325 (32%), Gaps = 26/325 (8%) Query: 21 IPKHWKVVPIKRF-TKLNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W+ I K++ G T +E+G+ Y+ + D++ + + Sbjct: 7 IPDNWEECIINDVSIKIHYGYTAKANENGRGSKYLRITDIQENKVNWDTVPFCEIDDEEI 66 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSID 132 + I++ + G + K+ + D + ++ + ++ + + + Sbjct: 67 EKFELKENNIVFARTGGTVGKSFLIKNDVPSKAVFASYLIRIKLSNYIDKKYIYLFFQSL 126 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + ++ + + + + + + PL EQ I KI +D + + Sbjct: 127 NYWSQIELGKTGLKTNVNAQILSKLKLNLAPLPEQRAIVAKIEQLFCDLDNGMANLKKAQ 186 Query: 193 ELLKEKKQALVSYIVT----KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 E LK +QA++ G K + + ++N Sbjct: 187 EQLKIYRQAVLKKAFEGEFTGGTKRWACKKMEAVVELIDG----------DRGPNYPKRN 236 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRS 304 L L LS N+ N + + Q+ ++ G+I+ + Sbjct: 237 DYLYGGYCLFLSTKNVRPDGFEFNETVYISEEKHNQLRKGTLNRGDIILTTRGTIGNVAY 296 Query: 305 LRSAQVMERGIITSAYMAVKPHGID 329 + + I S + + + + Sbjct: 297 YGESVPFDVIRINSGMLILSRNSVH 321 >gi|84385716|ref|ZP_00988747.1| type I restriction-modification system specificity subunit [Vibrio splendidus 12B01] gi|84379696|gb|EAP96548.1| type I restriction-modification system specificity subunit [Vibrio splendidus 12B01] Length = 400 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 56/387 (14%), Positives = 123/387 (31%), Gaps = 23/387 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDII--YIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK V + + I +GLE ++S + + + Sbjct: 15 SDWKKVKFGEVVFEPKESVKDPIAEGIEHVVGLEHIDSEDMHLRRSATIEKSTTFTKKFC 74 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIE 138 G +L+G+ YL+KA A+F GICS V++ K+ + P+LL + + Sbjct: 75 I--GDVLFGRRRAYLKKAAQANFKGICSGDITVMRAKEDILEPDLLPFIVNNDKFFDHAI 132 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G +K + + +P +Q + E + ++ + K Sbjct: 133 THSAGGLSPRVKFKDLADYEFYLPAKDKQFELIELLNGALSALNA--------KNISSRK 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 ++L+ + LN S + ++ + + A T L Sbjct: 185 VESLLKSFQNQYLNKGYYSNRSLLPDDWVMKNIKDFAKVQAGATPLRSNKDYFDNGTTYW 244 Query: 259 LSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++ + + + ++ ++ N +V Sbjct: 245 VKTLDLNNGEINFSEEKISDKAIQKTSCKVKPINTVLVAMYGGFNQIGRTGILKVEAATN 304 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + V + YL ++ + A+ S ++ +DV+ PV +PP+ Q Sbjct: 305 QAISAIEVDESIVLPEYLLHVLNAKVEYWKKVAISSRKDPNITKDDVENFPVPIPPLSTQ 364 Query: 376 FDITNVINVETARIDVLVEKIEQSIVL 402 + + I L + + I Sbjct: 365 V----HLIKQVNEILNLQKSLN--IEK 385 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 60/170 (35%), Gaps = 10/170 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS-ESGKDI------IYIGLEDVESGTGKYLPKDGNSRQS 73 +P W + IK F K+ G T S KD ++ D+ +G + + + + Sbjct: 208 LPDDWVMKNIKDFAKVQAGATPLRSNKDYFDNGTTYWVKTLDLNNGEINFSEEKISDKAI 267 Query: 74 DTSTVSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ + +L G + + + I + + ++ + + + Sbjct: 268 QKTSCKVKPINTVLVAMYGGFNQIGRTGILKVEAATNQAISAIEVDESIVLPEYLLHVLN 327 Query: 132 DVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + + + N P+PIPPL+ QV + +++ Sbjct: 328 AKVEYWKKVAISSRKDPNITKDDVENFPVPIPPLSTQVHLIKQVNEILNL 377 >gi|315919694|ref|ZP_07915934.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313693569|gb|EFS30404.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 402 Score = 86.8 bits (213), Expect = 7e-15, Method: Composition-based stats. Identities = 54/404 (13%), Positives = 128/404 (31%), Gaps = 41/404 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 WK + + ++ G + I ++ + + + + + + D+S + Sbjct: 23 EWKETTLGKIAEITKGSGISKDQLSEQGSPCILYGELYTKYKSEIINEVYSRTELDSSPL 82 Query: 79 SIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G + + + +++ K + L+ Sbjct: 83 VKSKANDVIIPCSGETAIDISTARCVLFNNILLGGDLNIIRLK-YDDGGFFAYQLNGARK 141 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I + +G ++ H + + I + P + EQ KI ID I + + I+ Sbjct: 142 KDIARVAQGVSVVHLYGENLKQIRVYYPNIEEQ----RKITHLLSLIDGRIATQNKIIDK 197 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LK + L+ I+T V T Sbjct: 198 LKSLIKGLIDDIITLECGLLVTF-------------ETLYSKAGEGGTPTTSNMEFYDNG 244 Query: 255 NILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 NI + ++ K N E + ++ I++ + Sbjct: 245 NIPFIKIEDLNNKYLLTNKDCITELGLKKSSAWLIPTNSIIYSNGATIGAISINKYPICT 304 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVP 370 ++GI+ + ID YL + MRS K + + G ++ +D+ + +P Sbjct: 305 KQGILG----IIPNSNIDVEYLYYFMRSSYFQKEVERIVTEGTMKTAYLKDINHIKCPIP 360 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQS-IVLLKERRSSFIAA 413 +Q +I++ ++ L E IE + + ++ ++ Sbjct: 361 DSDKQKEISHALSTL-----SLKEDIENQLLKKYQIQKQYLLSQ 399 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 57/209 (27%), Gaps = 9/209 (4%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + E+ G + + +L E + YG + K ++ Sbjct: 8 DKCNVPHLRFPEFSGE-WKETTLGKIAEITKGSGISKDQLSEQGSPCILYGELYTKYKSE 66 Query: 272 NMGLKPE----SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + +++ S + ++ ++ Sbjct: 67 IINEVYSRTELDSSPLVKSKANDVIIPCSGETAIDISTARCVLFNNILLGGDLNIIRLKY 126 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + A+ + + L E++K++ V P I+EQ I + Sbjct: 127 DDGGFFAYQLNGARKKDIARVAQGVSVVHLYGENLKQIRVYYPNIEEQRKI----THLLS 182 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ID + + I LK I +T Sbjct: 183 LIDGRIATQNKIIDKLKSLIKGLIDDIIT 211 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 59/164 (35%), Gaps = 6/164 (3%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 +I +I +ED+ + S+ + I+Y G + I + Sbjct: 243 NGNIPFIKIEDLNNKYLLTNKDCITELGLKKSSAWLIPTNSIIYSN-GATIGAISINKYP 301 Query: 105 GICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 L + P + E L ++ S + +E I TM A K I +I PIP Sbjct: 302 ICTKQGILGIIPNSNIDVEYLYYFMRSSYFQKEVERIVTEGTMKTAYLKDINHIKCPIPD 361 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +Q ++I + + ++ + +KQ L+S + Sbjct: 362 SDKQ----KEISHALSTLSLKEDIENQLLKKYQIQKQYLLSQMF 401 >gi|254481808|ref|ZP_05095051.1| Type I restriction modification DNA specificity domain protein [marine gamma proteobacterium HTCC2148] gi|214037937|gb|EEB78601.1| Type I restriction modification DNA specificity domain protein [marine gamma proteobacterium HTCC2148] Length = 386 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 51/368 (13%), Positives = 130/368 (35%), Gaps = 33/368 (8%) Query: 27 VVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V IK K+ TG+T G DI ++ D+ Q T+ + Sbjct: 5 VRAIKHVAKVATGKTPSRKLDDNFGGDIPFVTPGDL-GLAAYITEAPQTLSQKGAETIKL 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K ++ +G L K IA + + Q + + G+ + ++EA+ Sbjct: 64 IPKNAVMVSCIGT-LGKVAIAGRELATNQQINSVIFDETKVFPKYGYYALGRLKPKMEAL 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 T++ + ++ + +PPL EQ I + R + I+L E + Sbjct: 123 APSTTVAIINKSNFESLEISVPPLEEQKRIAAILDKADNLRRK----RQQAIQLADEFLR 178 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A+ + + K S +G + + + K + ES + ++ Sbjct: 179 AVFLDMFGEMFTTKGYEKASR-RKIGELTSYI----------DYRGKTPEKSESGVPLIT 227 Query: 261 YGNIIQKLET-----RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N+ + + + + + + +++F + L + E+ + Sbjct: 228 AKNVKKGYISEEPREFIPEENYLEWMSRGLPEKNDVLFTTEAPLGNVALLGNY---EKVV 284 Query: 316 ITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 + ++++ + +L + + + + SG + ++ +++ + + VP ++ Sbjct: 285 VGQRLISLRSLGKVTQEFLMHALLNRFVQGLIEKRSSGSTVKGIRTKELYEIEIPVPNLE 344 Query: 374 EQFDITNV 381 +Q + + Sbjct: 345 DQKRFSKI 352 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 49/139 (35%), Gaps = 8/139 (5%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + L + ET +++ ++ I K ++ ++ I S Sbjct: 45 YITEAPQTLSQKGAETIKLIPKNAVMVSCIGTLG-KVAIAGRELATNQQINSVIFDETKV 103 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 Y + + A S + + + L + VPP++EQ I +++ Sbjct: 104 F--PKYGYYALGRLKPKMEALAP-STTVAIINKSNFESLEISVPPLEEQKRIAAILD--- 157 Query: 387 ARIDVLVEKIEQSIVLLKE 405 + D L K +Q+I L E Sbjct: 158 -KADNLRRKRQQAIQLADE 175 >gi|324993831|gb|EGC25750.1| type I restriction-modification system specificity determinant [Streptococcus sanguinis SK405] Length = 390 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 63/410 (15%), Positives = 136/410 (33%), Gaps = 33/410 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E +E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMEKLEPFTRDIPEFEY----LEYRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I + Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVRAKENISDENFVYYLMIAPSI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 R I+++ + + N + PPL EQ+ I + + A +I+ Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKILKALDDKIENNKKINHHLE 177 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E + + + +K I+ V DH F +L + Sbjct: 178 E---------ILQANLEKQLESISIKSKIIDLNLTVSDHVANGSFKSLKDNVKLVEKTDY 228 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + ++ N + E R + + + E++ + + + Sbjct: 229 ALFLRNIDLKNHLNG-ERRYVTESSYEFLKKSRLYGHEVIISNVADVGSVHRVPKMNMPM 287 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 + + + + YL S ++ SG +Q D + L + + Sbjct: 288 VAG-NNVVFLQSENSLLTDYLYVYFNSRLGQHDIMSITSGSAQQKFNKTDFRNLEIPILS 346 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +I + + I ++ I + I L + R++ + ++G+I + Sbjct: 347 DD-------IIKKKISSILHYIDNIHEEIACLMKIRATLLPKLLSGEISV 389 >gi|149391962|emb|CAL68658.1| restriction-modification enzyme [Thermus scotoductus] Length = 1251 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 51/381 (13%), Positives = 111/381 (29%), Gaps = 49/381 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+V + G+ K G V G+ + Sbjct: 893 WEVRKVGDVCNFEYGKGLPQNKRQP--GPYPVIGSNGRV----------GFHNQYLVEGP 940 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G+ G + T F V + +L + ++ + G Sbjct: 941 AIIVGRKGTAGAVYWEDNNCWPIDTTFYVKLKASDIS---LRYLYLMLQELHLDKLSGGV 997 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P+PPL Q I ++ A ++ E ++ KEK QA + Sbjct: 998 GVPGLNRDDVYQQKIPVPPLDVQAQIVDECQAIDAEVEQAEKEVSDCYQIAKEKVQACFA 1057 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 L V + + T+ E + + + GN+ Sbjct: 1058 QGQVTALGTLV------------------------HINRESTDPTQFSEKSFIYVDIGNV 1093 Query: 265 IQKLETRNMGL----KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + K +I G ++ + + + + ++ + Sbjct: 1094 EKGTGVIDYSQVITGKDAPSRARRIAPKGSVIISTVRPNLRGFAFIDRDTAD-CVFSTGF 1152 Query: 321 MAVKPHG----IDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 ++ + + M S DL AMG S+ D++ L + VP ++ Q Sbjct: 1153 AVLESKDESVLKNKSLFYAFMFSDDLMAQMIDAMGKAAYPSINQTDIENLRIRVPDVQAQ 1212 Query: 376 FDITNVINVETARIDVLVEKI 396 + ++ ++ I Sbjct: 1213 EKLIQELDKLETQLQSARAVI 1233 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 44/161 (27%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K ++ + YG + + + + +V + K + Sbjct: 890 QSKWEVRKVGDVCNFEYGKGLPQNKRQPGPYPVIGSNGRVGFHNQYLVEGPAIIVGRKGT 949 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + L +L + G L +DV + Sbjct: 950 AGAVYWEDNNCWPIDTTFYVKLKASDISLRYLYLMLQELHLDKLSGGVGVPGLNRDDVYQ 1009 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + VPP+ Q I + A ++ +++ + KE Sbjct: 1010 QKIPVPPLDVQAQIVDECQAIDAEVEQAEKEVSDCYQIAKE 1050 >gi|262403984|ref|ZP_06080539.1| type I restriction-modification system specificity subunit S [Vibrio sp. RC586] gi|262349016|gb|EEY98154.1| type I restriction-modification system specificity subunit S [Vibrio sp. RC586] Length = 391 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 57/402 (14%), Positives = 121/402 (30%), Gaps = 24/402 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYL-PKDGNSRQSDTSTVSIFA 82 V IK K+ TG+T + + G + VE G+ +++ P ++ + + + Sbjct: 2 VSIKSVAKVTTGKTPSKKVEEYFGGHIPFISPVELGSAQFVSPAKQTLTEAGAAQIKLVP 61 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K +L +G L K IAD + Q + + L G+ + +E+I Sbjct: 62 KNSVLVCCIGS-LGKLAIADQTLATNQQINSVTFDEKLVFPKYGYYALSRLKPILESIAP 120 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 T++ + +P+PPL EQ I + E L Sbjct: 121 ATTVAIVSKSKFEELEIPLPPLEEQKRIAAILDKADAIRQKRKQAITLADEF-------L 173 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 S + +P K + + + + + I +++ G Sbjct: 174 RSVFLEMFGDPVTNPKGWSRKEIKEGVSRITSGWSAKGDSRPCGQGEVGV-LKISAVTSG 232 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 K + G+++F + + + + + Sbjct: 233 EFKPKENKFVEKHIIPEGKNLIFPKKGDLLFSRANTRELVAATCIVPKDCDDVFLPDKLW 292 Query: 323 VK---PHGIDSTYLAWLMRSYDLCKVFYAMG---SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + Y L++ + + SG ++ + + PI Q Sbjct: 293 NIELSSEELMPEYFHMLLQDDKFKETLTSQATGSSGSMLNISKQKFETTLAPFAPIDLQM 352 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N+ ++ S L E+ ++ A +GQ Sbjct: 353 KFKNIYWHLKDNA----ANMKNSEDYLIEQFNALSQKAFSGQ 390 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 62/208 (29%), Gaps = 22/208 (10%) Query: 22 PKHWKVVPIKR-FTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK W IK +++ +G +++ ++ + + V SG K + Sbjct: 188 PKGWSRKEIKEGVSRITSGWSAKGDSRPCGQGEVGVLKISAVTSGEFKPKENKFVEKHII 247 Query: 75 TSTVSIF--AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-------PELLQ 125 ++ KG +L+ + A C FL + ++ PE Sbjct: 248 PEGKNLIFPKKGDLLFSRANTRELVAATCIVPKDCDDVFLPDKLWNIELSSEELMPEYFH 307 Query: 126 GWLLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + + + G+ +M + + P P+ Q+ + Sbjct: 308 MLLQDDKFKETLTSQATGSSGSMLNISKQKFETTLAPFAPIDLQMKFKNIYWHLKDNAAN 367 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 + E+ AL + L Sbjct: 368 MKNSEDYL----IEQFNALSQKAFSGQL 391 >gi|298502305|ref|YP_003724245.1| type I restriction-modification system subunit S [Streptococcus pneumoniae TCH8431/19A] gi|298237900|gb|ADI69031.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae TCH8431/19A] Length = 522 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 69/441 (15%), Positives = 133/441 (30%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW-------- 224 +L KE ++++ Y + L K E Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 225 ------------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESN 255 + +P+ W F +LV K + Sbjct: 323 DISIVSQGDDNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTE 382 Query: 256 ILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ N + + I G ++ F L Sbjct: 383 IPWVSISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATH 442 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 II+ + I YL + G ++L + L + + Sbjct: 443 NEAIIS-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISN 499 Query: 372 IKEQFDITNVINVETARIDVL 392 +E I + +++ ++ L Sbjct: 500 HEEMKRIISKVDLLFQKVSQL 520 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 347 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 406 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 407 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 464 >gi|219669965|ref|YP_002460400.1| restriction modification system DNA specificity domain protein [Desulfitobacterium hafniense DCB-2] gi|219540225|gb|ACL21964.1| restriction modification system DNA specificity domain protein [Desulfitobacterium hafniense DCB-2] Length = 406 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 49/408 (12%), Positives = 113/408 (27%), Gaps = 29/408 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G I V +K + L+ ++ YI D+ + +D + Sbjct: 2 GEI-----VKKLKSY-PLSRDVETKERTGYRYIHYGDIHKQIADLIVQDEDLPSIKEGDY 55 Query: 79 SIFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +G ++ + I + ++P+ L L + Sbjct: 56 IPLNQGDLVLADVSEDYTGIAEPSIILHEPKTKIIAGLHTIAIRPQSATSLYLYYLLHTE 115 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G + + + + P EQ I I + + Sbjct: 116 RFKKFGSHVGTGLKVFGITFNNLSLFQIKTPSFPEQTAIGNFFRTLDDTITLHKRKLDKL 175 Query: 192 IELLKEKKQALVSYI------VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 EL Q L V + S + + + + Sbjct: 176 KELKNGYLQKLFPQPGEDVPRVRFAGFNEPWEVRSFENILAPAVASNTLSRAELSYEKGS 235 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 KN + + Y +I + + Y+ ++ G+++F Sbjct: 236 IKNIHYGDILVRFGVYIDIARDPIPCIANGRIIDYKNK-LLQEGDVIFADTAEDETVGKA 294 Query: 306 RSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFED 361 + + S + + YL + + S+ + G++ SL ++ Sbjct: 295 VEITNISNFQVVSGLHTMAYRPKIKMSPYYLGYYLNSHSFRYQLLPLMQGVKVLSLSRKN 354 Query: 362 VKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + + P + EQ I + + +I L + LK+ +S Sbjct: 355 LSKTLIRYPAVLSEQSQIGDFLRNLDEQIFTLY----NKLGKLKQLKS 398 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 8/172 (4%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--- 304 + I I L ++ L Y ++ G++V + + Sbjct: 20 KERTGYRYIHYGDIHKQIADLIVQDEDLPSIKEGDYIPLNQGDLVLADVSEDYTGIAEPS 79 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 + + + I +A++P S YL +L+ + K +G+GL + F ++ Sbjct: 80 IILHEPKTKIIAGLHTIAIRPQSATSLYLYYLLHTERFKKFGSHVGTGLKVFGITFNNLS 139 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P EQ I N +D + ++ + LKE ++ ++ Sbjct: 140 LFQIKTPSFPEQTAIGNF----FRTLDDTITLHKRKLDKLKELKNGYLQKLF 187 >gi|319777746|ref|YP_004137397.1| restriction modification system DNA specificity domain [Mycoplasma fermentans M64] gi|318038821|gb|ADV35020.1| Restriction modification system DNA specificity domain [Mycoplasma fermentans M64] Length = 372 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 52/383 (13%), Positives = 127/383 (33%), Gaps = 26/383 (6%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG 105 K++ P G + + I+ G++G I + Sbjct: 6 KELGIFETGSTLIKKIGNFPAFGGNGIITYVNKWNVDEDAIIIGRVGANCGCVNITNKKS 65 Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 + L+ +PK+ + + + G++ +GNI + IP L Sbjct: 66 FVTDNALIFKPKEKNMARFYFYF---LLHLNLNKFHIGSSQPLLTQGILGNIKINIPSLN 122 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 + I + + ID I ++ L+ QA+ + + + K E + Sbjct: 123 KCQKISKILDN----IDNQIERNNSMVQKLQVMGQAIFNRWFLQFEHFKKDNKFKYNEDL 178 Query: 226 G-LVPDHWEVKPFFALVTELNR-----KNTKLIESNILSLSYG---NIIQKLETRNMGLK 276 +P++WEVK + KN + I L+ G N + + K Sbjct: 179 NLKIPENWEVKKIAEICKIFLGGTPSTKNREYWNGEINWLNSGEVANFPIIDSEKTINEK 238 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 +++ G +V ++R + + I + + ++ + + + Sbjct: 239 GLKNSNTKLLKKGTVVISITG------NIRVSYLAIDSCINQSIVGIEENELLKIGYLYP 292 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + + ++ + ++ L +++PP ++ ++ N T I + +I Sbjct: 293 FLKNKIEFLIRSSTGNCQKHINKNFIENLKIVLPP----KNVLDIFNNLTQNIYAKISQI 348 Query: 397 EQSIVLLKERRSSFIAAAVTGQI 419 L + ++ + + QI Sbjct: 349 SLMTKKLIKFKNKLLPLLINQQI 371 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 65/192 (33%), Gaps = 9/192 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP++W+V I K+ G T + +I ++ +V + K N + Sbjct: 181 KIPENWEVKKIAEICKIFLGGTPSTKNREYWNGEINWLNSGEVANFPIIDSEKTINEKGL 240 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + KG ++ G + D + +V ++ L ++ + + Sbjct: 241 KNSNTKLLKKGTVVISITGNI--RVSYLAIDSCINQS-IVGIEENELLKIGYLYPFLKNK 297 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G H + I N+ + +PP + +I + + I+ Sbjct: 298 IEFLIRSSTGNCQKHINKNFIENLKIVLPPKNVLDIFNNLTQNIYAKISQISLMTKKLIK 357 Query: 194 LLKEKKQALVSY 205 + L++ Sbjct: 358 FKNKLLPLLINQ 369 >gi|293408034|ref|ZP_06651874.1| conserved hypothetical protein [Escherichia coli B354] gi|291472285|gb|EFF14767.1| conserved hypothetical protein [Escherichia coli B354] Length = 576 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 46/190 (24%), Positives = 79/190 (41%), Gaps = 2/190 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ I + + S +Y + + +E GTG+ + K Sbjct: 388 ELPEGWEWCRIGNIVNIKSELVSPKDYLNLYQVAPDIIEKGTGRVISKRTVKESGVKGPN 447 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S F KGQI+Y K+ P L K +A+++G+CS L + P L ++LSI +++ Sbjct: 448 SRFYKGQIVYSKIRPSLSKVFLAEYNGLCSADMYPLDC-YINPNYLLKYILSIPFLMQVK 506 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 M + NI + IPP EQ I +KI + + LI+ + + Sbjct: 507 KAENRIKMPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLISYIGIYHKTQLHL 566 Query: 199 KQALVSYIVT 208 AL + Sbjct: 567 ADALTDAAIN 576 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 69/482 (14%), Positives = 136/482 (28%), Gaps = 98/482 (20%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLE 54 +K K P+ S + +P+ W+ + G + K+I+ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWTRLINLGIWALGSGFPNVVQGSTDKEILMCKVS 140 Query: 55 DVE-SGTGKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLG---PYLRKAIIADFDGIC 107 D+ G K++ N+ D + + I G I++ K+G ++ I+ I Sbjct: 141 DMNLEGNEKFIFSTKNTISKDLADEYKIKISEPGTIIFPKIGGAIATNKRRILVQDTAID 200 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + + E L ++D + G ++ + IG+IP+ +P L Q Sbjct: 201 NNCLGIKPCDAISGEWFYLILNTLD----MSKYQSGTSIPAINQSVIGSIPIALPSLKMQ 256 Query: 168 VLIREK-----------------------------------------IIAETVRIDTLIT 186 I + RI Sbjct: 257 EKIVSYVITLMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNAEELAENWTRISEHFD 316 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD--------------------------- 219 + KQ ++ V L P + Sbjct: 317 TLFTTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKP 376 Query: 220 ----SGIEWVGLVPDHWEVKPFFALVTEL---NRKNTKLIESNILSLSYGNIIQKLETRN 272 S E +P+ WE +V L + ++ ++ Sbjct: 377 LPPISDEEKPFELPEGWEWCRIGNIVNIKSELVSPKDYLNLYQVAPDIIEKGTGRVISKR 436 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + G+IV+ I K L G+ ++ + + + Sbjct: 437 TVKESGVKGPNSRFYKGQIVYSKIRPSLSKVFLAEY----NGLCSADMYPLDCYINPNYL 492 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L +++ L +V A L + + V +PP EQ I + IN A + L Sbjct: 493 LKYILSIPFLMQVKKAENRIKMPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGL 552 Query: 393 VE 394 + Sbjct: 553 IS 554 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 66/204 (32%), Gaps = 16/204 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI-----ESNILSLSYGNIIQKLETRNMG 274 S E +P+ WE L + + IL ++ + + + Sbjct: 93 SEEEKPFELPEGWEWTRLINLGIWALGSGFPNVVQGSTDKEILMCKVSDMNLEGNEKFIF 152 Query: 275 LKPESYETY-------QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + +I +PG I+F I + R V + I + Sbjct: 153 STKNTISKDLADEYKIKISEPGTIIFPKIGGAI-ATNKRRILVQDTAIDNNCLGIKPCDA 211 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I + ++ + D+ K ++ + +P+ +P +K Q I + + + Sbjct: 212 ISGEWFYLILNTLDMSKY---QSGTSIPAINQSVIGSIPIALPSLKMQEKIVSYVITLMS 268 Query: 388 RIDVLVEKIEQSIVLLKERRSSFI 411 D L ++ S+ ++ + + Sbjct: 269 LCDQLEQQSLTSLDAHQQLVETLL 292 >gi|293384344|ref|ZP_06630229.1| restriction endonuclease S subunit [Enterococcus faecalis R712] gi|291078336|gb|EFE15700.1| restriction endonuclease S subunit [Enterococcus faecalis R712] Length = 409 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 63/405 (15%), Positives = 137/405 (33%), Gaps = 32/405 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESG------TGKYLPKDGNSRQ 72 + W++ +++ T +G T + +V+ + NS Sbjct: 18 EDWELCKLEKLTDFFSGLTYSPDNVQKDGTFVLRSSNVKDNAIISADNVYVRNEVANSEH 77 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 V + + G + A I + + P+ L L + Sbjct: 78 VQVGDVIVVVRN----GSRSLIGKHAPINREMPNTVIGAFMTGLRSPSPKFLNTLLDTQQ 133 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I GAT++ + +P ++ EKI + ++D +IT R + Sbjct: 134 FNVEIHKNL-GATINQITTGEFKRMHFIVPTDEDEK---EKIGSLFRQLDDIITLHQRKL 189 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + LKE K+A + + K++ + E G T+ ++ + Sbjct: 190 DQLKELKKAYLQVMFPAKDERVPKLRFADFE--GEWEQCKLGNILTERNTQQSKSKEYPL 247 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 S + ++ E + +S + Y++ + +IV+ +L K + Sbjct: 248 VSFTVEDGVTPKTERYEREQLVRGDKSSKKYKVTELNDIVYNPANL---KFGAIARNHYG 304 Query: 313 RGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPV 367 + + + Y+ + S+Y+ + D G RQS+ E++ + Sbjct: 305 KAVFSPIYITFIVNDKLACSSYVEVFITRKDFISYSLKYQQGTVYERQSVSPENLLNMKF 364 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L+P KEQ I + ++D ++ I LK + S++ Sbjct: 365 LLPNTKEQEFIGHF----FEKLDCNSNFHKKKITQLKNLKKSYLQ 405 >gi|296535588|ref|ZP_06897769.1| specificity determinant for HsdM and hsdR [Roseomonas cervicalis ATCC 49957] gi|296264104|gb|EFH10548.1| specificity determinant for HsdM and hsdR [Roseomonas cervicalis ATCC 49957] Length = 480 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 75/439 (17%), Positives = 137/439 (31%), Gaps = 52/439 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W + R ++ ++ ++GLE + + P S S Sbjct: 9 LPAGWAHTTLGEVAGEPRARVPADAKSNLPFVGLEHIAPHALR--PHGFGRFGDMRSAAS 66 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 F G +LY ++ PYL K AD +G+ S +FLVL + LL Sbjct: 67 PFTPGDVLYARMRPYLNKVWHADREGVASAEFLVLPRSGRVHPDFLALLLHHRPFVEFAR 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +W I P+ +PP AEQ I + A ++ R E L + + Sbjct: 127 HASSGDRPRVEWADISKYPIALPPRAEQDRIVTAVNALFDEVEAGEASLARAREGLTQFR 186 Query: 200 QALVSYIVT------------------------------KGLNPDVKMKDSGIEWVGLVP 229 +L+ T +GL P G + +P Sbjct: 187 TSLLHAACTGALTADWRTANPTNQTAADLLAEVAAWRAARGLKPLAAASAVGTATLPTLP 246 Query: 230 DHWEVKPFFALVTELNRKNTK-------LIESNILSLSYGNIIQ---KLETRNMGLKPES 279 + W L K+ L + + + G + + ++ + + Sbjct: 247 EGWIWASLPQLGEFGRGKSKHRPRDDARLYGAAMPFIQTGEVSRSRGRITSWSRMYSDFG 306 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ G + + + S V PH + Y+ M++ Sbjct: 307 VAQSKVWPAGTVCI----TIAANIAASGILTFDACFPDSVVGLVTPHAALARYVELFMQT 362 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 YA ++++ E + + V +PP+ E I ++ V I Sbjct: 363 ARANLEAYAPA-TAQKNINLEILNTVAVPLPPLMEIEAIVRAVDDVHTEAVEPVGTIADG 421 Query: 400 IVLLKERRSSFIAAAVTGQ 418 R S + AA TG+ Sbjct: 422 SA----LRQSILHAAFTGR 436 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 61/207 (29%), Gaps = 14/207 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGN 69 + +P+ W + + + G++ +D + +I +V G+ Sbjct: 242 LPTLPEGWIWASLPQLGEFGRGKSKHRPRDDARLYGAAMPFIQTGEVSRSRGRITSWSRM 301 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + ++ G + + + + I FD +V L Sbjct: 302 YSDFGVAQSKVWPAGTVCIT-IAANIAASGILTFDACF-PDSVVGLVTPHAALARYVELF 359 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 +EA + + + + + +P+PPL E I + + Sbjct: 360 MQTARANLEAYAPATAQKNINLEILNTVAVPLPPLMEIEAIVRAVDDVHTEAVEPVGTIA 419 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 L +Q+++ T L P Sbjct: 420 DGSAL----RQSILHAAFTGRLVPQDP 442 >gi|169834387|ref|YP_001694008.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae Hungary19A-6] gi|168996889|gb|ACA37501.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae Hungary19A-6] gi|332203679|gb|EGJ17746.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47368] Length = 522 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 68/441 (15%), Positives = 133/441 (30%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW-------- 224 +L KE ++++ Y + L K E Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 225 ------------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESN 255 + +P+ W F +LV K + Sbjct: 323 DISIVSQGDDNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTE 382 Query: 256 ILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ N + + I G ++ F L Sbjct: 383 IPWVSISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATH 442 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 II+ + I YL + G ++L + L + + Sbjct: 443 NEAIIS-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISN 499 Query: 372 IKEQFDITNVINVETARIDVL 392 +E I + +++ ++ L Sbjct: 500 HEEMKRIISKVDLLFQKVSQL 520 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 347 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 406 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 407 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 464 >gi|170731318|ref|YP_001776751.1| type I restriction system specificity protein [Xylella fastidiosa M12] gi|167966111|gb|ACA13121.1| type I restriction system specificity protein [Xylella fastidiosa M12] Length = 399 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 92/292 (31%), Gaps = 21/292 (7%) Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 Q I + +P+PPL Q I + + T L E +E Sbjct: 119 MQMIAYTPQDHARQWI--GTYSKFLIPVPPLEVQRQIVKVLDTFTTLEAELEAELEAELE 176 Query: 194 LLKEKKQALVSYI--VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + Q + +G + +++ + +G F V + Sbjct: 177 ARRRQYQYYRDALLRFEEGTDAATRVRWVTLGEIG---------SFIRGVGIQKSDFIEF 227 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + + + + G++V +D + A + Sbjct: 228 GSGCIHYGQIHTHYGTWADKTKSFIRSDFAARLRKANTGDLVIATTSEDDDAVAKAVAWM 287 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + S + H I++ Y+++ ++ +G + + + + ++ + V Sbjct: 288 GDEEVAVSTDAYIYRHTINAKYVSYFFQTKFFHSQKKPHITGTKVRRISGDSLAKIRIPV 347 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA--AAV 415 PP++ Q I V++ ++ + + I ++ R + AV Sbjct: 348 PPLEVQARIVAVLDQFDTLVNDITAGLPAEIAARRQQYAYYRDRLLTFKEAV 399 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/193 (11%), Positives = 57/193 (29%), Gaps = 11/193 (5%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIY----IGLEDVESGTGKYLPKDGNSRQSDTSTV-SIF 81 V + G + I + I + + G + K + +SD + Sbjct: 204 WVTLGEIGSFIRGVGIQKSDFIEFGSGCIHYGQIHTHYGTWADKTKSFIRSDFAARLRKA 263 Query: 82 AKGQILYGKLGPYLRKA-----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G ++ + D + ST + + + + + + + + Sbjct: 264 NTGDLVIATTSEDDDAVAKAVAWMGDEEVAVSTDAYIYR-HTINAKYVSYFFQTKFFHSQ 322 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G + + I +P+PPL Q I + ++ + I + Sbjct: 323 KKPHITGTKVRRISGDSLAKIRIPVPPLEVQARIVAVLDQFDTLVNDITAGLPAEIAARR 382 Query: 197 EKKQALVSYIVTK 209 ++ ++T Sbjct: 383 QQYAYYRDRLLTF 395 >gi|167904492|ref|ZP_02491697.1| Restriction endonuclease S subunits [Burkholderia pseudomallei NCTC 13177] Length = 329 Score = 86.4 bits (212), Expect = 8e-15, Method: Composition-based stats. Identities = 44/308 (14%), Positives = 104/308 (33%), Gaps = 29/308 (9%) Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRFIELL 195 + G+TM H + + +P EQ + + + I I L Sbjct: 23 MNKRTHGSTMKHIKRGELREFFVSLPVDGGEQRKLAQILDTLDATIQE----TDAIIAKL 78 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEW--------VGLVPDHWEVKPFFALVTELNRK 247 K KQ L+ ++T G++ + +++ E +G +P W + + Sbjct: 79 KVVKQGLLHDLLTWGIDANGELRPPYSEAPHLYKWSALGWIPKDWTCSALQPWLDGKPKN 138 Query: 248 NTKLIE---SNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQN 300 E + + + + LKP + ++ + G+++ + ++ Sbjct: 139 GYSPQEAGAWTGIQMLGLGCLTADGFQPAQLKPAPRDDRRLCSAFLSEGDLLMSRSNTRD 198 Query: 301 DKRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMG---SGLRQ 355 + + M + +L +++RS L + A SG Sbjct: 199 LVGLAGVYRDVGTPCTYPDLMMRLRPSPETSAEFLQFVLRSPQLRRQIQAQAVGTSGSMV 258 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + V L V +P EQ I + + + + +E I L++ ++ + + Sbjct: 259 KISGKIVSELVVAIPDRTEQEVILSRLLLADRCLTAEIEN----IAKLRQVKAGLMDDLL 314 Query: 416 TGQIDLRG 423 G++ + Sbjct: 315 CGRVRVTP 322 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 11/85 (12%), Positives = 34/85 (40%), Gaps = 5/85 (5%) Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDV 391 L + + + ++ + +K +++ V +P EQ + +++ +D Sbjct: 11 LLFHLLLASVAEMNKRTHGSTMKHIKRGELREFFVSLPVDGGEQRKLAQILDT----LDA 66 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 +++ + I LK + + +T Sbjct: 67 TIQETDAIIAKLKVVKQGLLHDLLT 91 >gi|75909474|ref|YP_323770.1| restriction modification system DNA specificity subunit [Anabaena variabilis ATCC 29413] gi|75703199|gb|ABA22875.1| Restriction modification system DNA specificity domain protein [Anabaena variabilis ATCC 29413] Length = 557 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 62/462 (13%), Positives = 133/462 (28%), Gaps = 95/462 (20%) Query: 24 HWKVVPIKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + +L +G+ ++ G+ + Y+ G + + + + + Sbjct: 85 GWVDTKLGYLIELVSGQHLGQEEQNDQGEGLPYLT------GPADFGEFNPVATRWTNTV 138 Query: 78 VSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 ++ K IL G + KA I + Q + ++P +L E +LL I ++ Sbjct: 139 KALAKKNDILITVKGAGVGKANILSMEKAAIGRQLMAIRP--ILLEYEFIYLLIISSYEK 196 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA-------------------- 176 +A+ G+T+ K I + + +PPLAEQ I EK Sbjct: 197 FQALSIGSTVPGMGRKDILDFSLGLPPLAEQKRIVEKCDRLLSTCDEIEKRQQQKQESVV 256 Query: 177 -----------------ETVRIDTLITERIRFIELLKEK----KQALVSYIVTKGLN--- 212 E + I + + E +QA++ V L Sbjct: 257 RMNESAIAQLLSSQNPEEFRQHWQRICNNFDLLYSIPETIPKLRQAILQLAVQGKLTRQD 316 Query: 213 ----------------------------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 P E + +P W + Sbjct: 317 PNNEPASVLFEKIKFERKRLLGETNFREPKELKPIRDNEILFELPKEWVWTRIGEIFLIS 376 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET------YQIVDPGEIVFRFIDL 298 + ++ + N + + + + + L Sbjct: 377 SGTTPNRTNHKYFEDGTEYWVKTTDLNNETVLNCEEKITKQAVLDCNLKYYPVGTVCVAL 436 Query: 299 QNDKRSLRSAQV--MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 ++ + + +E I S +++ YL ++ + +A + Sbjct: 437 YGGAGTIGKSGLLGIETTINQSVCGIYPNKYVNAKYLHLYIKLIRPLWMNFAASLRKAPN 496 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + V + + P+ EQ I + + D L K++Q Sbjct: 497 INAGVVNNMVFPLAPLAEQKRIVEKCDRLMSLCDTLEAKLKQ 538 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 67/200 (33%), Gaps = 6/200 (3%) Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 Y++ N + + ++ W L+ ++ ++ E N Sbjct: 58 EYLLQNEKNKKNEFIEIKFDFNNKCLPGWVDTKLGYLIELVSGQHLGQEEQNDQGEGLPY 117 Query: 264 IIQ--KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + N + + +I+ ++ + ME+ I M Sbjct: 118 LTGPADFGEFNPVATRWTNTVKALAKKNDILIT---VKGAGVGKANILSMEKAAIGRQLM 174 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 A++P ++ ++ L+ S ++GS + +D+ + +PP+ EQ I Sbjct: 175 AIRPILLEYEFIYLLIISSYEKFQALSIGS-TVPGMGRKDILDFSLGLPPLAEQKRIVEK 233 Query: 382 INVETARIDVLVEKIEQSIV 401 + + D + ++ +Q Sbjct: 234 CDRLLSTCDEIEKRQQQKQE 253 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 51/172 (29%), Gaps = 9/172 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +PK W I +++G T ++ D+ + T + + Sbjct: 359 ELPKEWVWTRIGEIFLISSGTTPNRTNHKYFEDGTEYWVKTTDLNNETVLNCEEKITKQA 418 Query: 73 SDTSTVSIFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + G + G + K+ + + + + P + + Sbjct: 419 VLDCNLKYYPVGTVCVALYGGAGTIGKSGLLGIETTINQSVCGIYPNKYVNAKYLHLYIK 478 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + N+ P+ PLAEQ I EK D Sbjct: 479 LIRPLWMNFAASLRKAPNINAGVVNNMVFPLAPLAEQKRIVEKCDRLMSLCD 530 >gi|194451100|ref|YP_002048348.1| restriction modification system DNA specificity domain [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194409404|gb|ACF69623.1| restriction modification system DNA specificity domain [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] Length = 380 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 52/381 (13%), Positives = 114/381 (29%), Gaps = 38/381 (9%) Query: 26 KVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++V + + + +G + I + D+ SG K + + Sbjct: 5 QLVTLGKHIDILSGCAFPSSGFNRNNGVPLIRIRDILSG------KTETYYEGSYDLKYL 58 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG +L G G + R+ D + + + + P + + +I A Sbjct: 59 IKKGDLLVGMDGDFNRE-YWKGTDALLNQRVCKITPNPETLDKNFLYHFLQKELDKIHAT 117 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + T+ H K I +I + +P L EQ I + Sbjct: 118 TDVVTVKHLSVKKIQDIKIRLPSLKEQKRIAAILDKADAIRQKREQA------------- 164 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + ++M + + P V + L Sbjct: 165 --IKLADDFLRAKFLEMFGTPANNIHRFPKGTIRD-LVDSVNYGTSAKASIDSGEYPILR 221 Query: 261 YGNIIQKLETRNMGLKPES----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 GNI + LK + +V G+++F + + + Sbjct: 222 MGNITYQGRWDFTDLKYLDLSVKEKDKYLVKEGDLLFNRTNSKELVGKTAVYEEDRPMAF 281 Query: 317 TSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + V+P+ I ++ Y++ + S M + ++ ++++ + +L+PP Sbjct: 282 AGYLIRVRPNSIGNNYYISGYLNSIHGKITLMNMCKSIVGMANINAQELQNIEILIPPKH 341 Query: 374 EQFD---ITNVINVETARIDV 391 Q + I I + D Sbjct: 342 LQDEYEIIYKKIKKGLSIYDK 362 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 49/133 (36%), Gaps = 10/133 (7%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-ID 329 + SY+ ++ G+++ N R ++ + P+ Sbjct: 44 KTETYYEGSYDLKYLIKKGDLLVGMDGDFN-----REYWKGTDALLNQRVCKITPNPETL 98 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + +L K+ + L + ++ + + +P +KEQ I +++ + Sbjct: 99 DKNFLYHFLQKELDKIHATTDVVTVKHLSVKKIQDIKIRLPSLKEQKRIAAILD----KA 154 Query: 390 DVLVEKIEQSIVL 402 D + +K EQ+I L Sbjct: 155 DAIRQKREQAIKL 167 >gi|315169210|gb|EFU13227.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1341] Length = 365 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 56/354 (15%), Positives = 124/354 (35%), Gaps = 27/354 (7%) Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQ 125 N + + + K + Y + PY + + D + + ST + ++P L Sbjct: 25 NRDSAPSRAQRLAKKNDVFYQTVRPYQKNNYLFDLPYDNYVFSTGYAQMRPSG-NGYFLL 83 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + R+ G + + + + + +P E+ K +D L+ Sbjct: 84 TLVQEEKFVNRVLERSTGTSYPAINSNDLAKLSVRVPADIEEEQNIGK---FFSNLDNLV 140 Query: 186 TERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 T R ++ LKE K A + V+ + K ++ G W+ + L Sbjct: 141 TLHQRKLDQLKELKTAYLQVMFVSMKTKNNKVPKLRFADFGGE----WDQRKSKELFIPK 196 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + KN + ++ G + + ++ + + Y++V+ + V Q Sbjct: 197 SEKNQPNLPVLSVTQDSGVVYRDQVGIDIKYDSTTLKNYKVVNKNDFVISLRSFQG---- 252 Query: 305 LRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKF 359 ++GI + AY P D+ + +++ + + G+R +S+ F Sbjct: 253 -GFELSDKKGITSPAYTIFVPKDIKLHDNLFWKTQFKTFQFIEALKTVTFGIRDGKSISF 311 Query: 360 EDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + L + P KEQ I +D + + + LK + S++ Sbjct: 312 TEFGDLKLCFPKNKKEQQKIGKF----FEELDYAISLHQNKLTQLKSLKKSYLQ 361 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 51/189 (26%), Gaps = 12/189 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP----KDGNSRQSDTSTVS 79 W K +S K+ + + V +G D + Sbjct: 183 EWDQRKSKELF------IPKSEKNQPNLPVLSVTQDSGVVYRDQVGIDIKYDSTTLKNYK 236 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIE 138 + K + L + ++D GI S + + PKD+ L + L Sbjct: 237 VVNKNDFVIS-LRSFQGGFELSDKKGITSPAYTIFVPKDIKLHDNLFWKTQFKTFQFIEA 295 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + ++KI +D I+ + LK Sbjct: 296 LKTVTFGIRDGKSISFTEFGDLKLCFPKNKKEQQKIGKFFEELDYAISLHQNKLTQLKSL 355 Query: 199 KQALVSYIV 207 K++ + + Sbjct: 356 KKSYLQNMF 364 >gi|260171381|ref|ZP_05757793.1| putative type IC restriction-modification system specificity subunit, partial [Bacteroides sp. D2] Length = 404 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 54/404 (13%), Positives = 128/404 (31%), Gaps = 41/404 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 WK + + ++ G + I ++ + + + + + + D+S + Sbjct: 25 EWKETTLGKIAEITKGSGISKDQLSEQGSPCILYGELYTKYKSEIINEVYSRTELDSSPL 84 Query: 79 SIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G + + + +++ K + L+ Sbjct: 85 VKSKANDVIIPCSGETAIDISTARCVLFNNILLGGDLNIIRLK-YDDGGFFAYQLNGARK 143 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I + +G ++ H + + I + P + EQ KI ID I + + I+ Sbjct: 144 KDIARVAQGVSVVHLYGENLKQIRVYYPNIEEQ----RKITHLLSLIDGRIATQNKIIDK 199 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LK + L+ I+T V T Sbjct: 200 LKSLIKGLIDDIITLECGLLVTF-------------ETLYSKAGEGGTPTTSNMEFYDNG 246 Query: 255 NILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 NI + ++ K N E + ++ I++ + Sbjct: 247 NIPFIKIEDLNNKYLLTNKDCITELGLKKSSAWLIPTNSIIYSNGATIGAISINKYPICT 306 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVP 370 ++GI+ + ID YL + MRS K + + G ++ +D+ + +P Sbjct: 307 KQGILG----IIPNSNIDVEYLYYFMRSSYFQKEVERIVTEGTMKTAYLKDINHIKCPIP 362 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQS-IVLLKERRSSFIAA 413 +Q +I++ ++ L E IE + + ++ ++ Sbjct: 363 DSDKQKEISHALSTL-----SLKEDIENQLLKKYQIQKQYLLSQ 401 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 57/209 (27%), Gaps = 9/209 (4%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + E+ G + + +L E + YG + K ++ Sbjct: 10 DKCNVPHLRFPEFSGE-WKETTLGKIAEITKGSGISKDQLSEQGSPCILYGELYTKYKSE 68 Query: 272 NMGLKPE----SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + +++ S + ++ ++ Sbjct: 69 IINEVYSRTELDSSPLVKSKANDVIIPCSGETAIDISTARCVLFNNILLGGDLNIIRLKY 128 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + A+ + + L E++K++ V P I+EQ I + Sbjct: 129 DDGGFFAYQLNGARKKDIARVAQGVSVVHLYGENLKQIRVYYPNIEEQRKI----THLLS 184 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ID + + I LK I +T Sbjct: 185 LIDGRIATQNKIIDKLKSLIKGLIDDIIT 213 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 59/164 (35%), Gaps = 6/164 (3%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 +I +I +ED+ + S+ + I+Y G + I + Sbjct: 245 NGNIPFIKIEDLNNKYLLTNKDCITELGLKKSSAWLIPTNSIIYSN-GATIGAISINKYP 303 Query: 105 GICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 L + P + E L ++ S + +E I TM A K I +I PIP Sbjct: 304 ICTKQGILGIIPNSNIDVEYLYYFMRSSYFQKEVERIVTEGTMKTAYLKDINHIKCPIPD 363 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +Q ++I + + ++ + +KQ L+S + Sbjct: 364 SDKQ----KEISHALSTLSLKEDIENQLLKKYQIQKQYLLSQMF 403 >gi|163814567|ref|ZP_02205956.1| hypothetical protein COPEUT_00718 [Coprococcus eutactus ATCC 27759] gi|158450202|gb|EDP27197.1| hypothetical protein COPEUT_00718 [Coprococcus eutactus ATCC 27759] Length = 407 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 56/425 (13%), Positives = 122/425 (28%), Gaps = 49/425 (11%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSR 71 IG IP W++ ++ + T + I D+ + + L + Sbjct: 10 IGDIPVDWELQTFDETFRVISNNTLSRENLNNRGGAVRNIHYGDILTKFPEVLDCNEEEI 69 Query: 72 QSDTSTVSIFA------KGQILYG------KLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + + G I+ +G + + D + + + K Sbjct: 70 PYVNELSLLSSSTQLLQDGDIVVADTAEDETVGKVIEVQNLGDSKLVAGLHTIPCRVKKG 129 Query: 120 L--PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 P L ++ S +I G +S I + +PP EQ I + + Sbjct: 130 DFAPGWLGYYMNSDLFHNQILPYITGIKVSSISKGAISETLILVPPFDEQEKIVQSLN-- 187 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 +I L+T + + +K K +S + + + +M+ G + WE + Sbjct: 188 --KIQLLMTSETKVVNKIKLVKNGCLSKMFPQKDDTVPEMRLPG------FTEAWEQRKL 239 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 TE+ + + Y L + + Y V + Sbjct: 240 GDEATEMLAGGDIDKSRVVENGQYPIYANALTNDGIVGYYDD---YYRVKAPAVTVTGRG 296 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 A++ + + H + + K + S L Sbjct: 297 DVGH----AQARIDDFTPVVRLLAIRSEHDV-------YFLENAINKHVVIVESTGVPQL 345 Query: 358 KFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + P +E+ I + +D L+ + + ++ +T Sbjct: 346 TVPQLGNYIISFPTTTEEEIKIGSY----FHNLDHLISLHQCKCDKYSNIKKGMMSDLLT 401 Query: 417 GQIDL 421 G+I L Sbjct: 402 GKIRL 406 >gi|119356723|ref|YP_911367.1| N-6 DNA methylase [Chlorobium phaeobacteroides DSM 266] gi|119354072|gb|ABL64943.1| N-6 DNA methylase [Chlorobium phaeobacteroides DSM 266] Length = 834 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 50/386 (12%), Positives = 108/386 (27%), Gaps = 45/386 (11%) Query: 32 RFTKLNTGRTSESGKD------IIYIGLEDVESGT----GKYLPKDGNSRQSDTSTVSIF 81 ++ +G T +S + + L D+ S + + + R S+ + Sbjct: 465 ELFRVESGGTPKSDVEELWNGGFPWATLADLPSTDFITEIRSTRRTISERGLRESSAKMI 524 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + ++ R AI V+ L + + A Sbjct: 525 PENSVIVSTRATIGRIAINRIPMATNQGFKNVIIEDKSKVISEYVALALTKLVPTMNAWA 584 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G T + +P+PPL Q I KI ID + + Sbjct: 585 TGGTFKEIPKSRFCELEIPLPPLEVQKKIVAKIEGYQKVIDGARAVLDNYRPHIPINPDW 644 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + V + + + K + L Sbjct: 645 PIVKL-------------------------ETVSTIVRGSSPRPQGDPKYFGGPVPRLMV 679 Query: 262 GNIIQKLETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 +I + L + + GE++ L + G + Sbjct: 680 ADITRDGMYTTPLIDSLTELGAGKSRFMKSGEVIITVSGNPGLPTILAVDACIHDGFVG- 738 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + + YL + + + ++G + ++L + ++ + +PP+ Q I Sbjct: 739 --LRELSNDVVPEYLYFSLLALHSQHGSQSVG-AVFKNLTSDQIREFTISLPPLATQQAI 795 Query: 379 TNVINVETARID---VLVEKIEQSIV 401 I E A ++ L+E+ E I Sbjct: 796 VAEIEAEQALVNANSELIERFENKIQ 821 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 64/194 (32%), Gaps = 14/194 (7%) Query: 21 IP--KHWKVVPIKRFTKLNTGRTSESGKDIIYIG-------LEDVESGTGKYLPKDGNSR 71 IP W +V ++ + + G + D Y G + D+ P + Sbjct: 638 IPINPDWPIVKLETVSTIVRGSSPRPQGDPKYFGGPVPRLMVADITRDGMYTTPLIDSLT 697 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + G+++ G I+A D F+ L+ + + Sbjct: 698 ELGAGKSRFMKSGEVIITVSGNPGLPTILA-VDACIHDGFVGLRELSNDVVPEYLYFSLL 756 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + GA + I + +PPLA Q I +I AE ++ Sbjct: 757 ALHSQHGSQSVGAVFKNLTSDQIREFTISLPPLATQQAIVAEIEAEQALVNA----NSEL 812 Query: 192 IELLKEKKQALVSY 205 IE + K QA ++ Sbjct: 813 IERFENKIQATITR 826 >gi|110003976|emb|CAK98316.1| putative hsds protein typeIrestriction enzyme [Spiroplasma citri] Length = 404 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 59/382 (15%), Positives = 115/382 (30%), Gaps = 26/382 (6%) Query: 37 NTGRTSESGK------DIIYIGLEDVES--GTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +G T + +I ++ ++DV + K + S+ + K ++Y Sbjct: 31 KSGGTPSTKNKDFYNGEISFLSIKDVTNQGKYIFQTEKTITKKGLKNSSAWLVPKNSLIY 90 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 I F + +LL + + T + Sbjct: 91 SIYASVGFPTINKIPLATSQAFFSMEINNLYFSTEYLYYLLLKFKKKELNKFIIKQTQPN 150 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV- 207 K I IP L EQ I ID I + LL+++KQ ++ + Sbjct: 151 LSKKIINQFIFKIPSLQEQTKIVNF----FSIIDRKIELIKEQLSLLEKQKQYYLNNMFA 206 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + P ++ K EW + N KN I + Sbjct: 207 NEKSYPKIRFKGFNDEWKSKKIKELGNIKTGKTPSTKNEKNWLNDVLWITIPDM--TKKY 264 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 L + + + IV I+F I + + + I + Sbjct: 265 LTNSKKKISLMASKKNPIVKEKSILFSCIGTIGNIGITTTITSFNQQINS------ISSI 318 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVET 386 D + + Y+ K+ + + + + + V KEQ I N Sbjct: 319 KDGVEYVYYLFQYNTEKIKSYSSAQTLPMINKNYFENIEIFVSLNYKEQTKIANF----F 374 Query: 387 ARIDVLVEKIEQSIVLLKERRS 408 + ID +E I++ + LL++++ Sbjct: 375 SIIDRKIELIKEQLSLLEKQKQ 396 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 68/190 (35%), Gaps = 14/190 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK IK + TG+T + D+++I + D+ K + S + Sbjct: 222 EWKSKKIKELGNIKTGKTPSTKNEKNWLNDVLWITIPDMTKKYLTNSKKKISLMASKKNP 281 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I + IL+ +G I I S + + + + L T++I Sbjct: 282 --IVKEKSILFSCIGTIGNIGITTT---ITSFNQQINSISSIKDGVEYVYYLFQYNTEKI 336 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ T+ + NI + + ++ KI ID I + LL++ Sbjct: 337 KSYSSAQTLPMINKNYFENIEIFVSLNYKEQ---TKIANFFSIIDRKIELIKEQLSLLEK 393 Query: 198 KKQALVSYIV 207 +KQ ++ + Sbjct: 394 QKQYYLNNMF 403 >gi|34764188|ref|ZP_00145050.1| TYPE I RESTRICTION-MODIFICATION SYSTEM SPECIFICITY SUBUNIT [Fusobacterium nucleatum subsp. vincentii ATCC 49256] gi|27886036|gb|EAA23350.1| TYPE I RESTRICTION-MODIFICATION SYSTEM SPECIFICITY SUBUNIT [Fusobacterium nucleatum subsp. vincentii ATCC 49256] Length = 156 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 56/134 (41%), Gaps = 1/134 (0%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 E +K E + V+ G+I+F + + E+ I+T + A+ H Sbjct: 4 EKTISFVKESLAEKLRKVEKGDIIFAVTSENIEDLCKCVVWLGEKEIVTGGHTAILKHNQ 63 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +S +LA+ ++ + +G + ++ + + +P ++EQ I ++++ Sbjct: 64 NSKFLAYYFQTEAFHSQKRKLATGTKVMDITATKLEEILIPLPSLEEQQRIVDILDRFDK 123 Query: 388 RIDVLVEKIEQSIV 401 + ++E + I Sbjct: 124 LCNDILEGLPAEIE 137 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 13/112 (11%), Positives = 33/112 (29%), Gaps = 4/112 (3%) Query: 76 STVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + KG I++ + + I + + + + L + + Sbjct: 16 EKLRKVEKGDIIFAVTSENIEDLCKCVVWLGEKEIVTGGHTAILKHNQNSKFLAYYFQTE 75 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + G + + I +P+P L EQ I + + + Sbjct: 76 AFHSQKRKLATGTKVMDITATKLEEILIPLPSLEEQQRIVDILDRFDKLCND 127 >gi|237751051|ref|ZP_04581531.1| restriction endonuclease S [Helicobacter bilis ATCC 43879] gi|229373496|gb|EEO23887.1| restriction endonuclease S [Helicobacter bilis ATCC 43879] Length = 401 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 60/405 (14%), Positives = 137/405 (33%), Gaps = 36/405 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ V + ++N T + ++++E Y K + + + F Sbjct: 20 EQWQEVRLGEVAEINPKETLRKHYLYKKVAMDNLE----PYTKKVYSFGIESFNGGAKFR 75 Query: 83 KGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L + D ST+F+VL+ K + + + L+ Sbjct: 76 NGDTLLARITPCLENGKTAFVDFLQDDEIAFGSTEFIVLREKTTISDKDFLYYLARSKHF 135 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 R I+++ + + + + +PPL Q I E + + +ID Sbjct: 136 REVAIKSMTGSSGRERVQIEVLRDFTFLLPPLTIQQKIAEILSSFDDKID---------- 185 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LL + + L S +T + + +G + D+ ++ T Sbjct: 186 -LLHRQNKTLESLALTLFRHYFIDNPKRDEWELGKLGDYVKIIDNRGKTPPFTTDITPYP 244 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQI--VDPGEIVFRFIDLQNDKRSLRSAQV 310 + +LS +++ + + E+Y+ + + +I+F + + L + Sbjct: 245 LIEVNALSDDSMLINYDIVRKYVIKETYQKWFREHIKQYDILFSTVGSIGEVAML----L 300 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 +G I + + I YL ++ Y ++ ++ S+K + P Sbjct: 301 DNKGCIAQNVIGFRARDISPFYLYEWLK-YMQQEIKEFDIGSVQPSIKVTHFVEKQIYKP 359 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + + I + + I L+ R + A Sbjct: 360 D----SKILESFDKQMLLITDKISHNAKQIQNLQAMRDILLKAIF 400 >gi|262198181|ref|YP_003269390.1| Restriction endonuclease S subunits-like protein [Haliangium ochraceum DSM 14365] gi|262081528|gb|ACY17497.1| Restriction endonuclease S subunits-like protein [Haliangium ochraceum DSM 14365] Length = 465 Score = 86.4 bits (212), Expect = 9e-15, Method: Composition-based stats. Identities = 58/410 (14%), Positives = 120/410 (29%), Gaps = 52/410 (12%) Query: 16 QWIGA----IPKHWKVVPIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGK 62 W G +P+ W+ + TG+ ++ + D+ G+ Sbjct: 27 PWSGKPGAALPEGWRWSSLGALA---TGKARYGVNLPARPYDAGLPRFVRITDI-GDDGR 82 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK-AIIADFDGICSTQFLVLQ----PK 117 S + G + + G + K + DG+C +L P Sbjct: 83 LRDDAPVSLSDPGAADYRLKPGDLAVARSGATVGKSYLYRPEDGVCVPAGYLLCVPLAPS 142 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 P + W S + + A + + + +P+P+PPL EQ + + Sbjct: 143 RCEPAFVAQWAQSRGYRAWLRSAVRTAAQPNVNASELATLPVPVPPLEEQREVARVLALG 202 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEV 234 + + +L + L+S + + +P + +GL+P W V Sbjct: 203 DALLAHSGRIIDKLGLVLAALVRDLLSRGIGEDGRIRDPARHPELFRETPLGLLPRAWSV 262 Query: 235 KP----FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--------- 281 AL + ++ G + ++ + Y Sbjct: 263 SEAGELLAALKPAMRSGPFGSELRKSDLVAEGVPLLGIDNVDTDAFVPRYRRFVPPHLFQ 322 Query: 282 --TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 V PG+++ + RS + + + D L+ S Sbjct: 323 ALGRYAVRPGDVMVTVMGTVG--RSCVVPDDIGDALSSKHV---WTLSFDPERYLPLLAS 377 Query: 340 YDLC-------KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + G S++ ++ + VPP+ EQ I V+ Sbjct: 378 LQFNYAPWVHAHLTREAQGGTIASIRSSTLRSTLLPVPPLAEQRAIAEVL 427 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 56/147 (38%), Gaps = 16/147 (10%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSY 340 + PG++ K L + + + + Y+ P + ++A +S Sbjct: 99 YRLKPGDLAVARSGATVGKSYLYRPE--DGVCVPAGYLLCVPLAPSRCEPAFVAQWAQSR 156 Query: 341 DLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + + ++ ++ LPV VPP++EQ ++ V+ A D L+ + Sbjct: 157 GYRAWLRSAVRTAAQPNVNASELATLPVPVPPLEEQREVARVL----ALGDALLAHSGRI 212 Query: 400 IVLLKERRSSFIAAAVT------GQID 420 I L ++ + ++ G+I Sbjct: 213 IDKLGLVLAALVRDLLSRGIGEDGRIR 239 >gi|257058613|ref|YP_003136501.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] gi|256588779|gb|ACU99665.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8802] Length = 400 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 55/432 (12%), Positives = 132/432 (30%), Gaps = 55/432 (12%) Query: 6 AYPQYKDSGVQWIGAIP--KHWKVV----PIKRFTKLNTGRTSESGKDIIYIGLEDVESG 59 YPQ + V W P ++W+ + + + K + +E+V+ Sbjct: 2 KYPQLDLTKVFWFQEGPGVRNWQFTESGIKLLNVANITNYGNIDLTKTDRCLSIEEVDQK 61 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG----------PYLRKAIIADFDGICST 109 + +G ++ G +T Sbjct: 62 Y----------------KHFLVDEGDLVIASSGISFDTDGFLRTRGAFIQKKHLPLCMNT 105 Query: 110 QFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + KD +LL WL S + ++I + G+ + + + + +PPL EQ Sbjct: 106 STIRFKAKDETSDLLFLKYWLDSFEFREQITRLVTGSAQQNFGPSHLKQLKISLPPLEEQ 165 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + + + R +EL Q++ + +P + + Sbjct: 166 KRIAKILTKADK----IRRTRRYALELSDTYLQSVFLEMFG---DPVTNSMGWDVVTISD 218 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + + ++ + G I K ++ Sbjct: 219 ISQKVTDGTHQPPLFTSTGIPFIFVQH----IVSGKISFKKTNYVSEKTYNELTRNTKIE 274 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKV 345 +I++ + + ++ + +KP+ I+ST+L M + + Sbjct: 275 LHDILYSSVGSFGVAVEI---LTKDKFVFQRHIAHIKPNHKKINSTFLCSQMNTDFVYNQ 331 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G + ++ D+K L ++ PP++ Q ++ + RI ++ ++ L Sbjct: 332 AKKASRGVAQATINLSDIKELKIIYPPLELQEKFAKIV-QKYERIRKQQQEAQRQADHL- 389 Query: 405 ERRSSFIAAAVT 416 S + + Sbjct: 390 --FQSLLHQFFS 399 >gi|194397224|ref|YP_002037524.1| type I restriction-modification system subunit S [Streptococcus pneumoniae G54] gi|194356891|gb|ACF55339.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae G54] Length = 373 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 124/400 (31%), Gaps = 39/400 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + G + D+ + + E L L Sbjct: 171 KSRFNEMF-------------GENKIFESIDNLFDXIDGDRGKNYPKSDELFSEEYCLFL 217 Query: 260 SYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N+ + + + + + ++ +IV + + Sbjct: 218 NTKNVTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLR 277 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I S + ++P + +++ + + L +K++ + +PP+ Q Sbjct: 278 INSGMVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQ 336 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + ++D I++S+ L+ + S + Sbjct: 337 NEFADFVV----QVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|148263100|ref|YP_001229806.1| restriction modification system DNA specificity subunit [Geobacter uraniireducens Rf4] gi|146396600|gb|ABQ25233.1| restriction modification system DNA specificity domain [Geobacter uraniireducens Rf4] Length = 420 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 56/416 (13%), Positives = 122/416 (29%), Gaps = 47/416 (11%) Query: 25 WKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W +V I RF + +G T +I ++ ++ + S+ Sbjct: 3 WPMVEISRFCQTGSGGTPSRNNAGDYYGGNIPWVKSGELNQEFVLNTEERITELAIKESS 62 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I G IL G + K+ + D + + P + W + + Sbjct: 63 AKIVPAGAILVAMYGATVGKSALLGIDAATNQAICNIIPDPEAADTRYVWYALKNQLPYL 122 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A G + + I N +P+P L+EQ I E + R + + Sbjct: 123 LAQRVGGAQPNISQQIIKNTQIPLPLLSEQRRIVEILDQADHL----RKLRGEADKKAEL 178 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 AL + + P W P ++ ++ + + E+ Sbjct: 179 ILPALFNKMFGGPAT---------------NPMGWPEMPLRQVIAKVEAGWSAVSEARGC 223 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIV---------DPGEIVFRFIDLQNDKRSLRSA 308 + +++ + ++ ++ G+++F + + + Sbjct: 224 TKDEFGVLKVSAVTSGRFLACEHKAVLVLQTDRGLLTPRRGDLLFSRANTRELVAASCVV 283 Query: 309 QVMERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDV 362 + + + P + YL L + F A SG ++ E + Sbjct: 284 EDDHPNLFLPDKLWRLILHPDRATAMYLKELFWNNGFRDRFRASASGSSGSMLNISQEAM 343 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI-VLLKERRSSFIAAAVTG 417 +PP K Q + + + K + L S+ + A +G Sbjct: 344 LNTIAPIPPFKLQEEYSAKAWSL-----AAIAKERRLAGDALDTLWSNLLQRAFSG 394 >gi|325108023|ref|YP_004269091.1| restriction modification system DNA specificity domain protein [Planctomyces brasiliensis DSM 5305] gi|324968291|gb|ADY59069.1| restriction modification system DNA specificity domain protein [Planctomyces brasiliensis DSM 5305] Length = 621 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 58/413 (14%), Positives = 122/413 (29%), Gaps = 29/413 (7%) Query: 23 KHWKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 + W+ V +K L G S +++G++++ + G S Sbjct: 5 EKWRCVSMKELYHGLYDGPHATPKPSDSGPVFLGIKNITDDGHLDLGSIRHISESDYAKW 64 Query: 78 VSIFAK--GQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSID 132 I++ R AII C + + V P+ L + Sbjct: 65 TRRVEPQENDIVFTYEATLNRYAIIPKGFRGCLGRRLALIRPNTEMVDPKFLFLYFFGHT 124 Query: 133 VTQ-RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G+T++ + + +PPL Q I + A I+ Sbjct: 125 WRDLIATKTIIGSTVNRIPLLEFPDFEITLPPLPTQRKIASILSAYDDLIENNTRRIAIL 184 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV---KPFFALVTELNRKN 248 + QAL P + +G +P+ WEV + LV + K+ Sbjct: 185 E----QMAQALYREWFVHFRFPGHENVKLVDSPLGQIPEGWEVEELQSLCKLVMGQSPKS 240 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 E + + + + + + +I+F Sbjct: 241 EFYNEVGDGLPFHQGVTNFGDRYPTHKTFCTVKNR-LAHENDILFSVRAPVGRINIANCE 299 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 V+ RG+ D+ + + G + +S+ D+ L +L Sbjct: 300 IVVGRGVSA------IRRFDDAQIFLFHQLKELFSEEDIMGGGTIFKSVTKHDLTTLKLL 353 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P + + + L E + + +L+ R + ++G++D+ Sbjct: 354 SPSP----KMVELFEQQVQPAFALYENLTKRNEVLRTTRDLLLPKLISGKLDV 402 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 56/187 (29%), Gaps = 3/187 (1%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP+ W+V ++ KL G++ +S G + + + T Sbjct: 214 LGQIPEGWEVEELQSLCKLVMGQSPKSEFYNEVGDGLPFHQGVTNFGDRYPTHKTFCTVK 273 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + + IL+ P R IA+ + + V + + + ++ Sbjct: 274 NRLAHENDILFSVRAPVGR-INIANCEIVVGRG--VSAIRRFDDAQIFLFHQLKELFSEE 330 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + G + + + P L +++ + L Sbjct: 331 DIMGGGTIFKSVTKHDLTTLKLLSPSPKMVELFEQQVQPAFALYENLTKRNEVLRTTRDL 390 Query: 198 KKQALVS 204 L+S Sbjct: 391 LLPKLIS 397 >gi|303253789|ref|ZP_07339924.1| Putative restriction-modification enzyme [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302647373|gb|EFL77594.1| Putative restriction-modification enzyme [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 203 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 60/171 (35%), Gaps = 8/171 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP++W V + G T + I ++ D+ G +P+ Sbjct: 30 EIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTIPWLKTGDLNDGIITEIPEYITELA 89 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + ++V + G +L G + K I + + + P + + L Sbjct: 90 IEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTNQACCACIPYTGIYNKYLFYYLMSQ 149 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 T+ + EG+ + + I N P+PPL EQ I EKI + Sbjct: 150 KTELQKRS-EGSGQPNISKEKIVNYLFPLPPLNEQKCIVEKIETLFSTLQN 199 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 53/181 (29%), Gaps = 7/181 (3%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTE------LNRKNTKLIESNILSLSYGNIIQKLETR 271 + E +P++W + + I L G++ + T Sbjct: 21 RCIADEVPFEIPENWCWVRLGEIGNWGAGATPNRHEPKYYENGTIPWLKTGDLNDGIITE 80 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E V + I + +E + + GI + Sbjct: 81 IPEYITELAIEKTSVKLNPVGSVLIAMYGATIGKLGILNIEATTNQACCACIPYTGIYNK 140 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 YL + + S + GSG + ++ E + +PP+ EQ I I + + Sbjct: 141 YLFYYLMSQKTELQKRSEGSG-QPNISKEKIVNYLFPLPPLNEQKCIVEKIETLFSTLQN 199 Query: 392 L 392 L Sbjct: 200 L 200 >gi|302336437|ref|YP_003801644.1| restriction modification system DNA specificity domain protein [Olsenella uli DSM 7084] gi|301320277|gb|ADK68764.1| restriction modification system DNA specificity domain protein [Olsenella uli DSM 7084] Length = 525 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 55/427 (12%), Positives = 120/427 (28%), Gaps = 70/427 (16%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ + + G I ++ + +V SG + Sbjct: 88 DLPDGWEWARLGSIVLSVADGDHQPPPQVSSGIPFLVISNVSSGYLNFEDTRFVPESYYE 147 Query: 76 S--TVSIFAKGQILYGKLGPYLRKA-IIADFDGICSTQFLVLQPKDVLPELLQGWLLSI- 131 S +G +LY G Y ++ D +++P +L + L Sbjct: 148 SLGEYRRPMRGDVLYTVTGSYGIVIRVLDDRRFCVQRHIGIIRPNKLLGNHYLSYCLQSG 207 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +++ G + I + +P+PPLAEQ I + +D + + Sbjct: 208 WIRSCADSVATGIAQKTVGLQSIRSFLVPVPPLAEQRRIVVALDELLGLVDEVERSQAEL 267 Query: 192 IELLKEKKQALVSYIVTKGLNPDVK------------------------MKDSGIEW--- 224 LL + ++ + L P ++ +E Sbjct: 268 EGLLDRARAKVLDLAIRGRLVPQDPSDEPAEALLARVREERLLMAADGRLRRRDVEGDSV 327 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + D+ + F I +G I + ++ + E + Sbjct: 328 IFRGEDNSYYERFGDNRVIPIEGEVFAIPRTWAWSRFGAISNYGSSESVNPEKIDDEAWV 387 Query: 285 I-----------------------------VDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G++++ + +K + E G Sbjct: 388 LDLEDIEKGSGRILRRVCGGERRSSSVKRPFCAGQLLYSKLRPYLNKVLIAP----EPGY 443 Query: 316 ITSAYM-AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 TS + + +Y+ ++ S G+ L D + + +PP Sbjct: 444 CTSEIIPIELYGTVAPSYIRLVLMSDYFLSYANRCSYGVKMPRLGTRDGQGALLPIPPSH 503 Query: 374 EQFDITN 380 EQ I + Sbjct: 504 EQERIAS 510 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 85/228 (37%), Gaps = 30/228 (13%) Query: 223 EWVGLVPDHWEVKPFFALV---TELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---- 275 E +PD WE ++V + + + + S I L N+ Sbjct: 84 ELPFDLPDGWEWARLGSIVLSVADGDHQPPPQVSSGIPFLVISNVSSGYLNFEDTRFVPE 143 Query: 276 -KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYL 333 ES Y+ G++++ + R + ++P+ + + YL Sbjct: 144 SYYESLGEYRRPMRGDVLYTVTGSYG---IVIRVLDDRRFCVQRHIGIIRPNKLLGNHYL 200 Query: 334 AWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 ++ ++S + ++ +G ++++ + ++ V VPP+ EQ I ++ +D Sbjct: 201 SYCLQSGWIRSCADSVATGIAQKTVGLQSIRSFLVPVPPLAEQRRIVVALDELLGLVDE- 259 Query: 393 VEKIEQSIV-LLKERRSSFIAAAVTGQI---------------DLRGE 424 VE+ + + LL R+ + A+ G++ +R E Sbjct: 260 VERSQAELEGLLDRARAKVLDLAIRGRLVPQDPSDEPAEALLARVREE 307 >gi|219669967|ref|YP_002460402.1| restriction modification system DNA specificity domain protein [Desulfitobacterium hafniense DCB-2] gi|219540227|gb|ACL21966.1| restriction modification system DNA specificity domain protein [Desulfitobacterium hafniense DCB-2] Length = 413 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 52/399 (13%), Positives = 116/399 (29%), Gaps = 20/399 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + K G T DI+++ ++++ + + Sbjct: 20 WEQRELGEDIKFVGGATPFKENPEYWNGDIVWLSSQEIKERFVTSGTYKITKKAVKDNAT 79 Query: 79 SIFAKGQ-ILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + G ++ + G I D + L D Sbjct: 80 KVIKAGTPLIVTRSGILAKRFPISIPTVDVAINQDIKALLYDDERIATDFLIAGLQKNEG 139 Query: 136 RIEAIC--EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I G T+ + M P L EQ I +D IT R ++ Sbjct: 140 FILKHIVKTGTTVQSINLPDFQKFLMAYPMLPEQTAIGNF----FRTLDDTITLHKRKLD 195 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LKE K+A + + + N K++ +G EV + ++ K + Sbjct: 196 KLKELKKAYLQRMFPQAGNDVPKVRFAGFTEPWASRKLGEVAEIVRGASPRPIQDPKWFD 255 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 ++ + + + + L + Sbjct: 256 EKSNVGWLRISDVSVQDGRVHYLEQHISKAGQKKTRVLTQPHLLLSIAASVGKPVINYVN 315 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + ++ + + ++ ++ ++ + Y G + +L + VK + +P + Sbjct: 316 TGVHDGFLIFQNPNFEIEFMFQWLKMFEEQWLKYG-QPGSQINLNSDIVKNQDISIPTKE 374 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I N +D V +Q + L + + S++ Sbjct: 375 EQKHIGN----LFLNLDNQVFVRQQKLDQLNQLKRSYLQ 409 >gi|317481423|ref|ZP_07940490.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] gi|316902408|gb|EFV24295.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] Length = 370 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 49/395 (12%), Positives = 104/395 (26%), Gaps = 57/395 (14%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK + +G T S K +I +I ++ + ++S Sbjct: 23 EWKRHKLSEICSFYSGGTPSSSKKEFYNGNIPFIRSGELHKDKTELF---ITEDGLNSSA 79 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G +L G I+ G + L ++ K + +R+ Sbjct: 80 AKLVEIGDLLLALYGATSGDIAISKIKGAINQAILCIRTKQ---NKKFIESVWNKHVERL 136 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + NIP L EQ + I RI T + L+K Sbjct: 137 LQTYLQGGQGNLSADIVKNIPFYFADLEEQDKLANFISLLDERISTQNKIIEKLETLIKG 196 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + ++S L + +S + L ES + Sbjct: 197 IVETVISSQKPNTLIKNCLECNS----------------------------STLQESQVA 228 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + ++ E I Sbjct: 229 ETGTFPVYGATDISGYTETAGINGESILIIKD----------GSGVGTVKFVSGEYSYIG 278 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + G Y+ + ++ + + F+D + + P Q Sbjct: 279 TLNSLTAKDGYCLKYIYFALQRFSFEPY---KTGMAIPHIYFKDYGKAKIYCPSFSLQTL 335 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I + + I+ +E ++ I+ + +RS ++ Sbjct: 336 IAQKL----SLIENKMEVEKRIILCYQLQRSYLLS 366 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 58/191 (30%), Gaps = 15/191 (7%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNT-----KLIESNILSLSYGNIIQKLETRNMGLKPES 279 W+ + + + + NI + G + + + + Sbjct: 17 FPEFSREWKRHKLSEICSFYSGGTPSSSKKEFYNGNIPFIRSGELHKDKTELFITEDGLN 76 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++V+ G+++ + ++ + I I+S + Sbjct: 77 SSAAKLVEIGDLLLALYGATSGDIAISKIKGAINQAILCIRTKQNKKFIESVWNKH---- 132 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + ++ G + +L + VK +P ++EQ + N I + +D + + Sbjct: 133 --VERLLQTYLQGGQGNLSADIVKNIPFYFADLEEQDKLANFI----SLLDERISTQNKI 186 Query: 400 IVLLKERRSSF 410 I L+ Sbjct: 187 IEKLETLIKGI 197 >gi|239833255|ref|ZP_04681583.1| Type I restriction enzyme EcoEI specificity protein [Ochrobactrum intermedium LMG 3301] gi|239821318|gb|EEQ92887.1| Type I restriction enzyme EcoEI specificity protein [Ochrobactrum intermedium LMG 3301] Length = 865 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 54/410 (13%), Positives = 118/410 (28%), Gaps = 59/410 (14%) Query: 18 IGAIPKHWKVVPIKR--FTKLNTGRTSESG------KDIIYIGLEDVESGTGKY----LP 65 IG + +V + K+ +G T +S I + L D+ + Sbjct: 473 IGK--SGFPMVSLGDEALFKVESGGTPKSDVPEYWDGGIPWATLVDLPASNFITEITGTV 530 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + S+ I +L R AI ++ + Sbjct: 531 RTISEAGLKGSSAKILPANSVLVSSRATIGRIAINRVPLATNQGFKNIVIADEARVLPEY 590 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + +++ G T + + +P+PPL Q I ++ Sbjct: 591 LAFAVTKLVPTMQSWATGGTFAEISKSKFCELEIPLPPLEMQREIVAEV----------- 639 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 E Q ++ L+ EW ++P + Sbjct: 640 -----------EGYQRVIDGA-RAVLDNYRSYIPVDPEW--------PMRPLSEVAQVNP 679 Query: 246 RKNTKLIESNILSLSYGNI------IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 +K+ +S+ + + + + E +Y +++ + Sbjct: 680 KKSELKDTDPSTPVSFVPMAVLNENNVRFDPVEVKTISEVVGSYTYFRESDVLVAKVTPC 739 Query: 300 NDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLR 354 + A+ ++ GI + Y+ +L + + D G+G Sbjct: 740 FENGKAGIARGLKNGIGFGSSEFYVVRANEETLPGWLFHWLTTPDFRARATAKMTGTGGL 799 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIV 401 Q + V+ + +P + Q I I E A I+ L+ + E+ I Sbjct: 800 QRVPRAVVEEELIPLPELVVQKSIVAEIEAERALIEGNRDLITRFEKKIE 849 >gi|160887310|ref|ZP_02068313.1| hypothetical protein BACOVA_05328 [Bacteroides ovatus ATCC 8483] gi|156107721|gb|EDO09466.1| hypothetical protein BACOVA_05328 [Bacteroides ovatus ATCC 8483] Length = 409 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 53/404 (13%), Positives = 126/404 (31%), Gaps = 36/404 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W++ I +L +G T I +I +++ + + + ++ Sbjct: 25 EWEMSSIGEQFELYSGNTPSRMNKNQFDGSINWITSGELKEHYISDTKEKISEEAAKNNS 84 Query: 78 VSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + G + G I + S + K + Sbjct: 85 LKLLPVGTFVIAIYGLEANGVRGTCSITTRESTISQACMAFTSKMDIQNEFLYSWYKKHG 144 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +G + + I + P + EQ +K+I ID I + + IE Sbjct: 145 NIIGIKYAQGTKQQNLSYDIIERFNISYPCMEEQ----KKLIRFISLIDQRIATQNKIIE 200 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LK+ K A+ ++ + + + S I + F + K L Sbjct: 201 DLKKLKSAISKHLFARKDLLETTICLSNIATL------KNGYAFQSGKYNALGKWKILTI 254 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +N+ Y N N+ P + +Q++ G+I+ ++ + Sbjct: 255 TNVPGERYINDEDCNCIINL---PNDIQDHQVLKEGDILISLTGNVGRVSLCKNGDYLLN 311 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPI 372 + + ++ +L ++ S A G G + ++ DV+ + Sbjct: 312 QRVG---LLQLSKNVNREFLYQILSSQRFENSMIACGQGAAQMNIGKGDVESYVLPYSSN 368 Query: 373 KEQFDI---TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +I +++ RI + ++ + + LL ++ + Sbjct: 369 G--NNILWVAKILHSYDERI---INELRR-LTLLTMQKQYLLTQ 406 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 62/199 (31%), Gaps = 5/199 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ + EW + +N+ + I S Sbjct: 15 PHLRFPEFSGEWEMSSIGEQFELYSGNTPSRMNKNQFDGSINWITSGELKEHYISDTKEK 74 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + + +++ G V L+ + + I+ A MA Sbjct: 75 ISEEAAKNNSLKLLPVGTFVIAIYGLEANGVRGTCSITTRESTISQACMAFTSKMDIQNE 134 Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + G +Q+L ++ ++R + P ++EQ + I + ID Sbjct: 135 FLYSWYKKHGNIIGIKYAQGTKQQNLSYDIIERFNISYPCMEEQKKLIRFI----SLIDQ 190 Query: 392 LVEKIEQSIVLLKERRSSF 410 + + I LK+ +S+ Sbjct: 191 RIATQNKIIEDLKKLKSAI 209 >gi|238918026|ref|YP_002931540.1| type I restriction-modification system, S subunit, [Edwardsiella ictaluri 93-146] gi|238867594|gb|ACR67305.1| type I restriction-modification system, S subunit, putative [Edwardsiella ictaluri 93-146] Length = 585 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 66/208 (31%), Gaps = 11/208 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 P + S E +P+ W V K++ + + G+I Sbjct: 86 PKALPEISEEEQPFDLPEGWAWGSIGYITEFVNGYAFKSSDFASEGVGIVKIGDIQDGEI 145 Query: 270 TRNMGLKP-----ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + V G+++ K E + Sbjct: 146 VVDNMSRVSQHVVDGLNENLQVKSGDMLIAMSGATTGKLGFNKTD--EIFYLNQRVGKFI 203 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + +D +L + + + + AMGS ++ + + + + +PP+ EQ I ++ Sbjct: 204 TYLVDKEFLYYPLATKIAENLAKAMGS-AIPNISTKQINEITIALPPLAEQHRIVAKVDE 262 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIA 412 A D L + E + + + +A Sbjct: 263 LMALCDQLEQCSESQLAAHQTLVEALLA 290 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 60/190 (31%), Gaps = 6/190 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTG--KYLPKDGNSRQS 73 +P+ W I T+ G +S + + + + D++ G + + Sbjct: 100 DLPEGWAWGSIGYITEFVNGYAFKSSDFASEGVGIVKIGDIQDGEIVVDNMSRVSQHVVD 159 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G +L G K D I V + L + + Sbjct: 160 GLNENLQVKSGDMLIAMSGATTGKLGFNKTDEIFYLNQRVGKFITYLVDKEFLYYPLATK 219 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 A G+ + + K I I + +PPLAEQ I K+ D L + Sbjct: 220 IAENLAKAMGSAIPNISTKQINEITIALPPLAEQHRIVAKVDELMALCDQLEQCSESQLA 279 Query: 194 LLKEKKQALV 203 + +AL+ Sbjct: 280 AHQTLVEALL 289 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 65/203 (32%), Gaps = 17/203 (8%) Query: 220 SGIEWVGLVPDHW---EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 S E +P W ++ L+T+ + K + +S ++ + Sbjct: 378 SEDEKPFSLPKGWDFAYMQDLCYLITDGTHQTPKYTDDGRPFIS-AQCVKPFRFMPEFCR 436 Query: 277 PESYETYQIVDP------GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 S E YQ+ G+I+ + + ++ + + +++ + + S Sbjct: 437 YVSEEHYQLYIKNRRPEFGDILLSRVGAGIGEAAVIDSCLEFAIYVSTGLLKPNRGAVYS 496 Query: 331 TYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 YL + S G + +L ++ V +PP KEQ I + Sbjct: 497 KYLELWLNSPIGRGFSERNTLGKGVSQGNLNLSLIRSFIVSLPPKKEQKLIVAKVGEMIT 556 Query: 388 RIDVLV----EKIEQSIVLLKER 406 D L + + L + Sbjct: 557 LCDQLKSCLQTSQQTQLALAESL 579 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 70/201 (34%), Gaps = 16/201 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD-- 74 +PK W ++ L T T + + +I + V+ +++P+ + Sbjct: 386 LPKGWDFAYMQDLCYLITDGTHQTPKYTDDGRPFISAQCVK--PFRFMPEFCRYVSEEHY 443 Query: 75 --TSTVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWL 128 G IL ++G + +A + D ST L V + L+ WL Sbjct: 444 QLYIKNRRPEFGDILLSRVGAGIGEAAVIDSCLEFAIYVSTGLLKPNRGAVYSKYLELWL 503 Query: 129 LSIDVTQRIEAIC--EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S E +G + + + I + + +PP EQ LI K+ D L + Sbjct: 504 NSPIGRGFSERNTLGKGVSQGNLNLSLIRSFIVSLPPKKEQKLIVAKVGEMITLCDQLKS 563 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + ++LV + Sbjct: 564 CLQTSQQTQLALAESLVEGAI 584 >gi|166364730|ref|YP_001657003.1| putative type I restriction enzyme specificity protein [Microcystis aeruginosa NIES-843] gi|166087103|dbj|BAG01811.1| putative type I restriction enzyme specificity protein [Microcystis aeruginosa NIES-843] Length = 388 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 62/411 (15%), Positives = 123/411 (29%), Gaps = 41/411 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 I K W P+ + I +D G P G + Q D+ Sbjct: 6 EITKKWPHRPLSEVVDFLDSKRKP-------ITQKDRVPG---PYPYYGANGQQDSVADY 55 Query: 80 IFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IF + +L + G + A + + VL+PK + ++ Sbjct: 56 IFDEPLVLLAEDGGHFGDADKTIAYQVEGKCWVNNHAHVLRPKKDVD---IRYICRHLER 112 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G+T NIP+ +PPL EQ I + EL Sbjct: 113 YDVTPFITGSTRGKLTKTAANNIPIALPPLEEQRRIAAILDKADGVRRKRKEAIRLTDEL 172 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L S + +P K + +G E + + + + E Sbjct: 173 -------LKSTFLEMFGDPVTNPKGWEVRELGDCVKDIESG----WSPKCDTRQAEPEEW 221 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +L L N + P+ ++ + G+++ + + Q+ Sbjct: 222 GVLKLGAVTYGHFNPDENKAMLPDDVPRQELEIKTGDLLVTRKNTYELVGASAFVQMTRP 281 Query: 314 GIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVL 368 ++ + GID Y+ + + + G ++ ++ LP Sbjct: 282 KLMLPDLIFRLRLIDGIDPVYVWQTLSQKTMRLKLSGLAGGTAGSMPNISKARLRTLPFP 341 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-KERRSSFIAAAVTGQ 418 VPP Q + N L ++ ++ + + +S + A G+ Sbjct: 342 VPPQLLQLKYREIFNQF-----WLKKEHQKESEEISENLFNSLLQRAFRGE 387 >gi|255011912|ref|ZP_05284038.1| type I restriction-modification system, endonuclease S subunit [Bacteroides fragilis 3_1_12] gi|313149746|ref|ZP_07811939.1| predicted protein [Bacteroides fragilis 3_1_12] gi|313138513|gb|EFR55873.1| predicted protein [Bacteroides fragilis 3_1_12] Length = 375 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 64/398 (16%), Positives = 124/398 (31%), Gaps = 42/398 (10%) Query: 29 PIKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 +T + +D Y+GLE ++ + + KG IL Sbjct: 5 RFDEIAINSTQKKKPIEEDRFHYVGLEHIDPECFEIQQYGSEVAPVGE--KLVMKKGDIL 62 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRIEAICEGAT 145 +GK Y RK IA DGI S +VL+PK + ++ S + I G Sbjct: 63 FGKRRAYQRKVAIAPCDGIFSAHGMVLRPKTGVIDSSYFPFFISSDTFMETAIRISVGGL 122 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 +WK + +P L EQ + +K+ A + I E++ Sbjct: 123 SPTINWKDLAKQEFELPSLEEQKNLADKLWAAYRLKEAYKKLLIATDEMV---------- 172 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 K IE V + +++ + + + E+ I + N Sbjct: 173 ------------KSQFIEMFENVESYCKLEDLISDTFPGEWGSEPISENTIKVIRTTNFT 220 Query: 266 QKLETRNMGLKPESYETYQIVDP----GEIVFRFIDLQND---KRSLRSAQVMERGIITS 318 + + E ++V G+ + D R + ++ + Sbjct: 221 NEGYLDLTDVVTRDIEPKKVVRKKLKQGDTILERSGGTKDNPVGRVVFFDEIGDYLPNNF 280 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKV----FYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + ++ YL + + + A + Q+L D +++P E Sbjct: 281 TQVLRPKESVNPVYLFYALYNSYNLNKAAMRAMASQTTGIQNLSMSDFMAKFIVLPSRNE 340 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q + D +++Q I + + S I Sbjct: 341 QNKF----EQIYHQADKSKFELKQCIENIDKVIKSLIN 374 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 60/155 (38%), Gaps = 8/155 (5%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGE 290 F + +K + E + +I + E + G + ++ G+ Sbjct: 1 MGKYRFDEIAINSTQKKKPIEEDRFHYVGLEHIDPECFEIQQYGSEVAPVGEKLVMKKGD 60 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCK-VFY 347 I+F K ++ GI ++ M ++P IDS+Y + + S + Sbjct: 61 ILFGKRRAYQRKVAIAPCD----GIFSAHGMVLRPKTGVIDSSYFPFFISSDTFMETAIR 116 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 GL ++ ++D+ + +P ++EQ ++ + + Sbjct: 117 ISVGGLSPTINWKDLAKQEFELPSLEEQKNLADKL 151 >gi|290474452|ref|YP_003467332.1| putative restriction-modification system specificity determinant [Xenorhabdus bovienii SS-2004] gi|289173765|emb|CBJ80545.1| Putative restriction-modification system specificity determinant [Xenorhabdus bovienii SS-2004] Length = 407 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 46/419 (10%), Positives = 114/419 (27%), Gaps = 36/419 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI--------IYIGLEDVESGTGKYLPKDGNSRQSDT 75 W + G T + + + + ++V+ + + Sbjct: 2 SWPQAKLDDVISFIRGVTFKPDDLVEPLSSNSTVVMRTKNVQVEGLEQSDLIAIPSELVK 61 Query: 76 STVSIFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 +G IL + ++ +++ K + + + Sbjct: 62 RKEQALCEGDILISSANSWELVGKASYVPKLNYQATAGGFISIVRAKQRVIDSRYLYHWI 121 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPM---PIPPLAEQVLIREKIIAETVRIDTLITE 187 + + G ++ +G P+PPL EQ I + + + Sbjct: 122 SSPSTQHRIRHCGRQTTNISNLDVGRFKDLEIPLPPLTEQKRIAAILDKAGA----IRRK 177 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 R + I+L E +A+ + +P K + + ++ E Sbjct: 178 RQQAIQLANEFLRAV---FLDMFGDPVTNPKGWEVRPLVDGIKSII--SGWSAKGESYPC 232 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV-DPGEIVFRFIDLQNDKRSLR 306 N +S E + + K + + G+++F + ++ + Sbjct: 233 NEGEYGVLKISAVTSGKFNPQENKFVYEKDIPADKKLVFPKKGDLLFSRANTRDLVAATC 292 Query: 307 SAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFE 360 + + + YL +L+ + +G ++ Sbjct: 293 IVPKDNNNVFLPDKLWNVKTSENILLPEYLNYLIWEPRFKGKLTSQATGTSGSMLNISKG 352 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + + P + Q ++ ID L S+ SS A +G+I Sbjct: 353 KFETTDAIFPDLPLQKKFRSIYWRVQKYIDSL----NASLDGCDASFSSLSQKAFSGEI 407 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 60/208 (28%), Gaps = 22/208 (10%) Query: 22 PKHWKVVPIKR-FTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK W+V P+ + +G +++ + + + V SG + Sbjct: 204 PKGWEVRPLVDGIKSIISGWSAKGESYPCNEGEYGVLKISAVTSGKFNPQENKFVYEKDI 263 Query: 75 TSTVSIF--AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-------LPELLQ 125 + + KG +L+ + A + FL + +V LPE L Sbjct: 264 PADKKLVFPKKGDLLFSRANTRDLVAATCIVPKDNNNVFLPDKLWNVKTSENILLPEYLN 323 Query: 126 GWLLSIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + ++ + G +M + P L Q R R+ Sbjct: 324 YLIWEPRFKGKLTSQATGTSGSMLNISKGKFETTDAIFPDLPLQKKFRSIYW----RVQK 379 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGL 211 I ++ +L + + Sbjct: 380 YIDSLNASLDGCDASFSSLSQKAFSGEI 407 >gi|312905316|ref|ZP_07764431.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0635] gi|310631340|gb|EFQ14623.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0635] gi|315162495|gb|EFU06512.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0645] gi|315578595|gb|EFU90786.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0630] Length = 398 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 51/403 (12%), Positives = 121/403 (30%), Gaps = 39/403 (9%) Query: 23 KHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDV------ESGTGKYLPKDGNSRQ 72 + W++ + +++ + S + + D+ + ++P + Sbjct: 18 EDWELCKLGEKVDISSASRVHKHEWSSSGVRFFRSSDIMSAYNGTTNQKAFIPNELYEEL 77 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLL 129 S IL G +++D + + + + L + + Sbjct: 78 IKKSGK--VNLDDILVTGGGSVGVPYLVSDEKPLYFKDADLLWIKNSGVIDGQFLYTFFI 135 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + I++I T+SH P+ +P EQ I +D IT Sbjct: 136 SPFFRKYIKSISHIGTISHYTIVQAKETPIKLPSFKEQGSIGSF----FKYLDDTITLHQ 191 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R +E LKE K+A + + K++ + E + + + + + Sbjct: 192 RKLEQLKELKKAYLQLMFPTKEERVPKLRFADFEGEWELCKLIGILDIIKGTQKSKSELS 251 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + Y I N+ + + + Sbjct: 252 TNQNNCTPYPVYNGGINPSGYTNIYNREN---------------AITISEGGNSAGFVNF 296 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 V E+ + + D+ +L + + S ++ +++ + L + Sbjct: 297 VQEKFFSGGHNYTIVNNVTDTLFLFFYLCSIQ-EEIMRLRVGTGLPNIQKPTLMNLEIQK 355 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I + ID+L+ + + LK + S++ Sbjct: 356 TTDNEQKFIGLFL----KNIDILITLTQNKLNQLKSLKKSYLQ 394 >gi|257091257|ref|ZP_05585618.1| predicted protein [Enterococcus faecalis CH188] gi|257000069|gb|EEU86589.1| predicted protein [Enterococcus faecalis CH188] Length = 394 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 51/403 (12%), Positives = 121/403 (30%), Gaps = 39/403 (9%) Query: 23 KHWKVVPIKRFTKLNT----GRTSESGKDIIYIGLEDV------ESGTGKYLPKDGNSRQ 72 + W++ + +++ + S + + D+ + ++P + Sbjct: 14 EDWELCKLGEKVDISSASRVHKHEWSSSGVRFFRSSDIMSAYNGTTNQKAFIPNELYEEL 73 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLL 129 S IL G +++D + + + + L + + Sbjct: 74 IKKSGK--VNLDDILVTGGGSVGVPYLVSDEKPLYFKDADLLWIKNSGVIDGQFLYTFFI 131 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + I++I T+SH P+ +P EQ I +D IT Sbjct: 132 SPFFRKYIKSISHIGTISHYTIVQAKETPIKLPSFKEQGSIGSF----FKYLDDTITLHQ 187 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R +E LKE K+A + + K++ + E + + + + + Sbjct: 188 RKLEQLKELKKAYLQLMFPTKEERVPKLRFADFEGEWELCKLIGILDIIKGTQKSKSELS 247 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + Y I N+ + + + Sbjct: 248 TNQNNCTPYPVYNGGINPSGYTNIYNREN---------------AITISEGGNSAGFVNF 292 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 V E+ + + D+ +L + + S ++ +++ + L + Sbjct: 293 VQEKFFSGGHNYTIVNNVTDTLFLFFYLCSIQ-EEIMRLRVGTGLPNIQKPTLMNLEIQK 351 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I + ID+L+ + + LK + S++ Sbjct: 352 TTDNEQKFIGLFL----KNIDILITLTQNKLNQLKSLKKSYLQ 390 >gi|148997029|ref|ZP_01824683.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP11-BS70] gi|147756729|gb|EDK63769.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP11-BS70] Length = 373 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 124/400 (31%), Gaps = 39/400 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + G + D+ + + E L L Sbjct: 171 KSRFNEMF-------------GENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFL 217 Query: 260 SYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N+ + + + + + ++ +IV + + Sbjct: 218 NTKNVTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLR 277 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I S + ++P + +++ + + L +K++ + +PP+ Q Sbjct: 278 INSGMVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQ 336 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + ++D I++S+ L+ + S + Sbjct: 337 NEFADFVV----QVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|317009200|gb|ADU79780.1| type I R-M system S protein [Helicobacter pylori India7] Length = 404 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 44/405 (10%), Positives = 103/405 (25%), Gaps = 40/405 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + I + GR + + +G ++ + Sbjct: 13 PKGVEFRKIGEICLIKRGRVIAKKILQENGKYPVYSSQTLNNGILGFIDTYDFDGEF--- 69 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + G Y + + + + +L +L I Sbjct: 70 ---------LTWTTDGAYAGSVFYRKGRFSITN--VCGLLQVIQDNILHKYLYYILQITT 118 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G + I +PIPPL Q I + + A T + + Sbjct: 119 PLHVSSGMGNPKLMSAAMQQITIPIPPLEIQQEIVKILDAFTELNTE-----LNARKKQY 173 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + Q ++ + E + +K + + + Sbjct: 174 QYYQNMLLDF-----DGIHSNHKDAKEKLAQKTYPKRLKTLLQTLA--PKGVEFRKLGEV 226 Query: 257 LSLSYGNIIQKLETRNMGLKP-----ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +++ G + K + G P Y + I + Sbjct: 227 INIFKGKQLNKELLLDYGEYPVMNGGIHASGYWNEYNTDYPKIIISQGGASAGYVNYMTS 286 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + Y + + + + +L D++ L + +PP Sbjct: 287 KFWAGAHCYTIELNSEKLNYKFLYYFLKNSQIILMKSQFGAGIPALNKADIETLTIPIPP 346 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ Q +I +++ + L+ I I K+ R + Sbjct: 347 LEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYREKLLT 391 >gi|262067380|ref|ZP_06026992.1| type I restriction system specificity protein [Fusobacterium periodonticum ATCC 33693] gi|291378943|gb|EFE86461.1| type I restriction system specificity protein [Fusobacterium periodonticum ATCC 33693] Length = 216 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 69/181 (38%), Gaps = 12/181 (6%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIV 292 ++V + E + + YG I K E ++ E + V+ G+I+ Sbjct: 28 IGSIVRGNGLQKRDFTEEGVGCIHYGQIYTKYGMATEKTISFVEESLAEKLRKVEKGDII 87 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F + + E I+T + A+ H +S +LA+ ++ + +G Sbjct: 88 FAVTSENIEDLCKCVVWLGEEEIVTGGHTAILKHNQNSKFLAYYFQTEAFHSQKRKLATG 147 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLK 404 + ++ + + +PP++EQ I ++++ D + +E ++ + Sbjct: 148 TKVMDVTATKLEEIIIPLPPLEEQQRIVDILDRFNKLCDDISEGLLVEIEARQKQYEYYR 207 Query: 405 E 405 E Sbjct: 208 E 208 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 61/192 (31%), Gaps = 9/192 (4%) Query: 27 VVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSIF 81 V + + G + + + + I + + G K + + + Sbjct: 22 EVRLGDIGSIVRGNGLQKRDFTEEGVGCIHYGQIYTKYGMATEKTISFVEESLAEKLRKV 81 Query: 82 AKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG I++ + + + I + + + + L + + + Sbjct: 82 EKGDIIFAVTSENIEDLCKCVVWLGEEEIVTGGHTAILKHNQNSKFLAYYFQTEAFHSQK 141 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G + + I +P+PPL EQ I + + D + + IE ++ Sbjct: 142 RKLATGTKVMDVTATKLEEIIIPLPPLEEQQRIVDILDRFNKLCDDISEGLLVEIEARQK 201 Query: 198 KKQALVSYIVTK 209 + + ++T Sbjct: 202 QYEYYREKLLTF 213 >gi|237738768|ref|ZP_04569249.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 2_1_31] gi|229423871|gb|EEO38918.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 2_1_31] Length = 216 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 70/185 (37%), Gaps = 9/185 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIV 292 ++V + E + + YG I K E ++ E + V+ G+I+ Sbjct: 28 IASIVRGNGLQKRDFTEEGVGCIHYGQIYTKYGMVAEKTISFVEESLAEKLRKVEKGDII 87 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F + + E I+T + A+ H +S +LA+ ++ + +G Sbjct: 88 FAVTSENIEDLCKCVVWLGEDEIVTGGHTAILKHNQNSKFLAYYFQTEAFHSQKRKLATG 147 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 + ++ + + +PP++EQ I ++++ + + E + I ++ R Sbjct: 148 TKVMDITATKLEEILISLPPLEEQQRIVDILDRFDRLCNDISEGLPAEIEARQKQYEYYR 207 Query: 408 SSFIA 412 + Sbjct: 208 EKLLN 212 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 49/166 (29%), Gaps = 9/166 (5%) Query: 27 VVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSIF 81 V + + G + + + + I + + G K + + + Sbjct: 22 EVRLGDIASIVRGNGLQKRDFTEEGVGCIHYGQIYTKYGMVAEKTISFVEESLAEKLRKV 81 Query: 82 AKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG I++ + + D I + + + + L + + + Sbjct: 82 EKGDIIFAVTSENIEDLCKCVVWLGEDEIVTGGHTAILKHNQNSKFLAYYFQTEAFHSQK 141 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + G + + I + +PPL EQ I + + + Sbjct: 142 RKLATGTKVMDITATKLEEILISLPPLEEQQRIVDILDRFDRLCND 187 >gi|77165283|ref|YP_343808.1| restriction modification system DNA specificity subunit [Nitrosococcus oceani ATCC 19707] gi|254434555|ref|ZP_05048063.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] gi|76883597|gb|ABA58278.1| Restriction modification system DNA specificity domain [Nitrosococcus oceani ATCC 19707] gi|207090888|gb|EDZ68159.1| Type I restriction modification DNA specificity domain protein [Nitrosococcus oceani AFC27] Length = 483 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 66/473 (13%), Positives = 135/473 (28%), Gaps = 81/473 (17%) Query: 24 HWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 W +V + + ++ G + K I + + + + SG ++ D Sbjct: 4 EWPLVTLSKLIEIKHGWAFKGKHMAESVIKGPIVVAIGNFDYSGGFRFSSTRIKRYTEDY 63 Query: 76 STVSIFAKGQILYGKL-----GPYLRKAIIADFDGICSTQ------FLVLQPKDVLPELL 124 G +L G L I D +V +PK V L Sbjct: 64 PKEYQLQPGDVLLAMTCQTPGGEILGLPGIIPEDDEVYLHNQRLGKLIVKEPKKVWAPFL 123 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 LS D + + G + H I + IPP+ Q I + + + +I Sbjct: 124 YWVFLSYDFNRYLAGSATGTKILHTSPNKITSYETRIPPINLQQSIANILWSISDKISLN 183 Query: 185 ITERIRFIELLKEKKQALVSYI-------------------------------------- 206 ++ + ++ Sbjct: 184 HQINQILEQMAQAIFKSWFVDFEPVKAKIAALKAGGSQEDALLAAMQAISGKSSEQLTRL 243 Query: 207 ------VTKGLNPDVKMKDSGIE--WVGLVPDHWEVKPFFALVTELNR----KNTKLIES 254 L ++ S ++ +G +P+ W + + N K E+ Sbjct: 244 QAEQPEQYAQLRTTAELFPSAMQDSELGEIPEGWSCRALDDIAKYKNGLALQKFRPENEN 303 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + L + ++K + I+D G++VF + L R Sbjct: 304 DYLPVVKIAQLKKGYADGEEKASPNINPECIIDNGDVVFSWSGSL-----LVDTWCGGRA 358 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPI 372 + V +L + + L + +K E +KR +P Sbjct: 359 ALNQHLFKVTSETH-PKWLYYHFTQHHLEDFQRIAADKAVTMGHIKREHLKRALCAIPC- 416 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 EQ I++ N ++ +E +SI L R + + ++G++ + Sbjct: 417 -EQL-ISDAGNSLRNILEKQIELRLESIT-LSTLRDTLLPKLLSGELSISDAE 466 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 46/178 (25%), Gaps = 14/178 (7%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR-----TSESGKDI-IYIGLEDVESGTGK 62 +DS +G IP+ W + K G E+ D + + ++ G Sbjct: 264 AMQDSE---LGEIPEGWSCRALDDIAKYKNGLALQKFRPENENDYLPVVKIAQLKKGYAD 320 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + I G +++ G L + + + Sbjct: 321 ----GEEKASPNINPECIIDNGDVVFSWSGSLLVDT-WCGGRAALNQHLFKVTSETHPKW 375 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L + + A + TM H + + IP + + Sbjct: 376 LYYHFTQHHLEDFQRIAADKAVTMGHIKREHLKRALCAIPCEQLISDAGNSLRNILEK 433 >gi|121583503|ref|YP_973929.1| restriction modification system DNA specificity subunit [Polaromonas naphthalenivorans CJ2] gi|120596753|gb|ABM40187.1| restriction modification system DNA specificity domain [Polaromonas naphthalenivorans CJ2] Length = 415 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 59/430 (13%), Positives = 134/430 (31%), Gaps = 51/430 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + F +L G K T P +S SD +V + Sbjct: 3 SEWQFGKLGDFIELKRGYDLPQAK------------RTSGPFPLVSSSGVSDCHSVPMVR 50 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ G+ G + + D +T V K P+ + +L ++D + Sbjct: 51 GPGVVTGRYGTIGQVYFVEDDFWPLNTTLYVRDFKGNDPKFISYFLKTVDFFAYSDK--- 107 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 A + + + IP L Q I + RI L + + ++ Sbjct: 108 -AAVPGVNRNHLHEALGAIPDLPTQQEIARTLGVLDDRIALLRETNATLEAIAQALFKSW 166 Query: 203 VS-----YIVTKGLNPDVK------MKDSGIE--WVGLVPDHWEVKPFFALVTELNRKNT 249 +G P+ + G E +GLVP W + + T K Sbjct: 167 FVDFDPVRARMEGRAPEGMDEATAALFPDGFEDSELGLVPKGWATRTMADISTVGIGKTP 226 Query: 250 KLIESNILS-------------LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 E + S + + + + + + + V ++ F Sbjct: 227 PRKEQHWFSEDPSDVRWVSIRDMGAVGVYAAVTSEFLKKEAIEKFNIRRVPDNTVLMSFK 286 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 I + + + + Y+ ++ +D + + S + + Sbjct: 287 MTIGRVAITDGEMTTNEAI--AHFKLAPDAQLSTEYIYLHLKQFDFSTL--SSTSSIADA 342 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + V+ +P+L+P ++ + + A++ + + L R + + ++ Sbjct: 343 VNSKTVREIPILMPSLEGLTAFQSQVAALFAKLKNTEQHAQ----TLVTLRDTLLPRLIS 398 Query: 417 GQIDLRGESQ 426 GQ+ L E++ Sbjct: 399 GQLRL-PEAE 407 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 58/204 (28%), Gaps = 15/204 (7%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDV--ESGT 60 DS +G +PK W + + + G+T + D+ ++ + D+ Sbjct: 199 DSE---LGLVPKGWATRTMADISTVGIGKTPPRKEQHWFSEDPSDVRWVSIRDMGAVGVY 255 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + + + +L + + I D + + + Sbjct: 256 AAVTSEFLKKEAIEKFNIRRVPDNTVLMS-FKMTIGRVAITDGEMTTNEAIAHFKLAPDA 314 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + L + + + K + IP+ +P L + ++ A + Sbjct: 315 QLSTEYIYLHLKQFDFSTLSSTSSIADAVNSKTVREIPILMPSLEGLTAFQSQVAALFAK 374 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 + + L L+S Sbjct: 375 LKNTEQHAQTLVTLRDTLLPRLIS 398 >gi|294624820|ref|ZP_06703480.1| type I restriction-modification system specificity determinant [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292600884|gb|EFF44961.1| type I restriction-modification system specificity determinant [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 389 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 54/414 (13%), Positives = 129/414 (31%), Gaps = 42/414 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ L G+ ++ GKY N T Sbjct: 3 SEWRDTTWGEEISLEYGKAIRGYDEVR-----------GKYRVFGSNGAIGWTENALAEG 51 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ G+ G Y + + T + V+ K + L + + + I + Sbjct: 52 PG-VILGRKGAYRGVRFWREPFWVIDTAYYVVPKKKLDMRWLYYAIKHHKLGE----IDD 106 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ + + + +P L EQ I + +I+ ++ ++ Sbjct: 107 GSPIPSTTRAAVYVRELTVPSLKEQGEISYVLGVLDDKIELNRRMNQTLEAMVHALFKS- 165 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + V M++S +G +P W+V F + ++ + Y Sbjct: 166 --WFVDFDGVAPEDMQES---ELGFIPKGWQVIAFGDVAQQVKGTVNPMTSPEETFTHYS 220 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + E+ ++ + P E V + R + ++ ++ Sbjct: 221 LPAFDVAQLPVRELGEAIKSNKTPVPNECVLVSKLNPHIPRIWLIGGAGHNAVCSTEFIV 280 Query: 323 VKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDI 378 P ++ +++ S + + +G Q +K E + + V I Sbjct: 281 WMPKKPANSAFVYVLASSSEFNSALRQLVTGTSNSHQRVKPEQLANIRV----------I 330 Query: 379 T---NVINVETARIDVLVEK---IEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I+ +A+ L+EK L + R + + ++G++ ++ + Sbjct: 331 AVNDEAISKFSAQSKPLMEKLLHHRLQSQQLAQLRDTLLPKLISGEVRIKDAER 384 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 51/148 (34%), Gaps = 14/148 (9%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYL 64 ++S +G IPK W+V+ + G + + + L + Sbjct: 176 EDMQESE---LGFIPKGWQVIAFGDVAQQVKGTVNPMTSPEETFTHYSLPAFDVAQLPVR 232 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLP 121 + S + +L KL P++ + + G +CST+F+V PK Sbjct: 233 ELGEAIK----SNKTPVPNECVLVSKLNPHIPRIWLIGGAGHNAVCSTEFIVWMPKKPAN 288 Query: 122 E-LLQGWLLSIDVTQRIEAICEGATMSH 148 + S + + + G + SH Sbjct: 289 SAFVYVLASSSEFNSALRQLVTGTSNSH 316 >gi|238809963|dbj|BAH69753.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 429 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 52/418 (12%), Positives = 119/418 (28%), Gaps = 47/418 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN-SRQSDTSTV 78 IP++W V ++ G K I + + + ++ N Sbjct: 15 EIPENWAWVRHNNIFEIIGGSQPPKSKFIEHEKQGYIRLYQIRDYGENPNPVYIPSKFAF 74 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQR 136 K IL + G + K A+ V + D + + + Q Sbjct: 75 KQSEKNDILLARYGASIGKVFFAENGAYNVALAKVKKMFINDWINKEFMFIFYKSSIYQT 134 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + + + N+ MPIP L E I K I+ + + +L Sbjct: 135 LVKNNSRSAQAGFNKDDLKNLFMPIPSLNESSRIVSKWNDLNKLINEYENKENQLFKLDS 194 Query: 197 EK----KQALVSYIVTKGLNPDVK-------------------------MKDSGIEWVGL 227 + +++++ Y + L KD ++ Sbjct: 195 KIKDKLQKSILQYAIQGKLVKQDPNDEPASKLLEAIQIEKNELIKEGKIKKDKQESFIFQ 254 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI---IQKLETRNMGLKPESYETYQ 284 D + + V + + I + NI + ++N + + Sbjct: 255 GEDKNYYEKIGSKVINITNEIPFEIPKKWAWVRQKNILKLTKNEASKNGNYPYLEAKVLR 314 Query: 285 IVDPGEIVFRFIDLQNDKRSLRS--------AQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + +I+ + + + + + G + S + +K + + Sbjct: 315 KIIKPKIINNGVLINKGDIVILVDGENSGETFVLDQTGYMGSTFKLLKINNKIDQEYVLM 374 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + L + L + +P IKEQ +I + +ID + Sbjct: 375 LLKFYKELFKKNKKGAAIPHLNIDIFNNLLLAIPNIKEQKEIILKL----KKIDNFIS 428 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 62/210 (29%), Gaps = 8/210 (3%) Query: 216 KMKDSGIEWVGLVPDHWEVKP---FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 +KD E +P++W F ++ +K IE I+ Sbjct: 4 NIKDITEELPFEIPENWAWVRHNNIFEIIGGSQPPKSKFIEHEKQGYIRLYQIRDYGENP 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + S ++ + +I+ K + I+ + Sbjct: 64 NPVYIPSKFAFKQSEKNDILLARYGASIGKVFFAE-NGAYNVALAKVKKMFINDWINKEF 122 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + +S + + +D+K L + +P + E I + N I+ Sbjct: 123 MFIFYKSSIYQTLVKNNSRSAQAGFNKDDLKNLFMPIPSLNESSRIVSKWNDLNKLINEY 182 Query: 393 VEKIEQSIVLLKERR----SSFIAAAVTGQ 418 K Q L + + S + A+ G+ Sbjct: 183 ENKENQLFKLDSKIKDKLQKSILQYAIQGK 212 >gi|158521273|ref|YP_001529143.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158510099|gb|ABW67066.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 385 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 122/400 (30%), Gaps = 26/400 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK P+ + + G+ + + +G + ++ TS Sbjct: 7 SSWKTQPLNQLCLVVMGQAPKGDTYNENTLGTPLIAGAADLGLIHPSPKKWTTSPTKTGK 66 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G I+ + + AD L+ + + L + ++ Sbjct: 67 AGDIILC-VRATIGDLNWADSKYCYGRGVCGLRIIEGHDPEFLWFWLMAC-KDHLLSLGR 124 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GAT I N+P+P + EQ I +I R++ + R + ++L Sbjct: 125 GATFKQISKTDIANLPVPALAVDEQRRIVARIKECMERVEEIEGLRAEAMRERGYLLESL 184 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + E G +V + + + + + I+ + Sbjct: 185 IEAEYQ--------------EADGEKVTLADVCAITSSLVDP--RAPQYIDLLHIGGGNI 228 Query: 263 NIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 + E ++ + + +++ I K + + G+ ++ Sbjct: 229 EAKTSKLVNLKTARAEKLKSSKFTFNDSMVLYNKIRPYLMKVA----RPGFSGLCSADMY 284 Query: 321 -MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + P + YL +L+ S A + + + + +P ++Q I Sbjct: 285 PLFPAPQKLTRDYLFYLLLSRHFTDYVIAGSNRAGMPKVNRKHLFAYKFTLPSTQKQQQI 344 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 T ++ A ++ L + S + R S + A G+ Sbjct: 345 TESLDDAVAAVEELQTDMAASTSEVNALRQSILHKAFAGE 384 >gi|326319450|ref|YP_004237122.1| restriction modification system DNA specificity domain-containing protein [Acidovorax avenae subsp. avenae ATCC 19860] gi|323376286|gb|ADX48555.1| restriction modification system DNA specificity domain protein [Acidovorax avenae subsp. avenae ATCC 19860] Length = 434 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 59/428 (13%), Positives = 135/428 (31%), Gaps = 28/428 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 W+ ++ +L TG+T +S G D+ ++ D S + + + + Sbjct: 3 SEWRTYRLQEVGRLVTGKTPKSGVPAFDGDDVPFVSPPDFTGSKWITKTVRSISEAGAQS 62 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S+ +L +G + KA IA + + Q + + + Sbjct: 63 VKGSLIPPRSVLVTCIGSDMGKAAIAASQCVTNQQINAILVDESRFCPEFVYYNLSLRKD 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I ++ G+ + G I + P L Q + + RI L + Sbjct: 123 EIRSLAGGSAQPILNKSAFGQIFLEAPCLEVQRTVSAALRPLDDRITLLRETNATLEAIA 182 Query: 196 KEKKQALV-----SYIVTKGLNPDVKMKDS--------GIEWVGLVPDHWEVKPFFALVT 242 + ++ G P+ + + +G+VP W+V ++ Sbjct: 183 QALFKSWFVDFDPVRAKMAGRAPEGMDEATAALFPDALEETELGIVPKGWQVGVLDSIAA 242 Query: 243 ELNRKNTKLIES-NILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQ 299 + IL + N + + + +++ G+++ + Sbjct: 243 LNPESWSTKHHPDRILYVDLANTKANQIEGITEFRFDDAPSRARRVLREGDVIVGTVRPG 302 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLK 358 N + S T + H D + + + + G +++ Sbjct: 303 NGSFARISVDRAGLTGSTGFAVLRAHHLFDQALVYIAATREESIERLAHLADGGAYPAVR 362 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 E V PV++ P K + V N A+ + ++ L + R + + ++GQ Sbjct: 363 PEVVAGTPVVIAPRKVREAFGGVANHLLAQ----IGGNQEQSRYLGDIRDTLLPRLISGQ 418 Query: 419 IDLRGESQ 426 + L + Sbjct: 419 LRLPEAHE 426 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 66/193 (34%), Gaps = 7/193 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +PK W+V + LN S I+Y+ L + ++ + + + + + Sbjct: 225 LGIVPKGWQVGVLDSIAALNPESWSTKHHPDRILYVDLANTKANQIEGI-TEFRFDDAPS 283 Query: 76 STVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLSI 131 + +G ++ G + P + + ST F VL+ + + + Sbjct: 284 RARRVLREGDVIVGTVRPGNGSFARISVDRAGLTGSTGFAVLRAHHLFDQALVYIAATRE 343 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +R+ + +G + + P+ I P + +I + Sbjct: 344 ESIERLAHLADGGAYPAVRPEVVAGTPVVIAPRKVREAFGGVANHLLAQIGGNQEQSRYL 403 Query: 192 IELLKEKKQALVS 204 ++ L+S Sbjct: 404 GDIRDTLLPRLIS 416 >gi|94263107|ref|ZP_01286925.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93456478|gb|EAT06592.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 456 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 58/457 (12%), Positives = 131/457 (28%), Gaps = 66/457 (14%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV- 78 W+ P+ + L G +S + I +++V++G + + + D +V Sbjct: 4 EWQEKPLGKVFDLVNGYAFKSKDFSSSGVPVIKIKNVKAGY--FSEHNFSYVSPDFLSVR 61 Query: 79 --SIFAKGQILYGKLG--------PYLRKAIIA--DFDGICST---QFLVLQPKDVLPEL 123 + + +L G ++ K + + KDV P Sbjct: 62 HEKLAQRDDLLISMSGNRHDGSPETWVGKVAHFKRNEPFFINQRVGALRAKNTKDVCPRF 121 Query: 124 LQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + L S D +I + ++ K I +P+P + EQ I + + +++ Sbjct: 122 MSYVLSSWDFQHLFISIATSSGGQANISPKQILGTSVPVPHITEQRAIAHILGSLDDKVE 181 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLN----------------------------PD 214 ++ + ++ N P Sbjct: 182 LNRQMNRTLEQMAQALFKSWFIDFDPVVYNTVQAGHPVPERFRVIAERYRQNPEIQTLPQ 241 Query: 215 VKM----KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 + +G +P WE A + ++ N + + + Sbjct: 242 HILDLFPNHFEDSDLGEIPAGWEAMNVGAKFDVIMGQSPPGQSYNEIGQGLPFFQGRRDF 301 Query: 271 RNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 Y E ++ +PG+ + D R + I AV+ Sbjct: 302 GFRYPTQRVYCTEPKRLANPGDTLISVRAPVGDINMARV-----KCCIGRGVAAVRHKSE 356 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + MR+ Y + S+ + +LP + P ++ Sbjct: 357 SRSFTYYSMRALTEQFSSYEGEGTVFGSINKKQFGKLPHVAPDDDL----IDLFESLVGS 412 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 D +E L R + + ++GQ+ + Sbjct: 413 SDGEIEAHIDEADSLSRIRDTLLPKLISGQLRIPDAE 449 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 66/200 (33%), Gaps = 19/200 (9%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII-QKLETRNMGLKPESY---E 281 G + F LV K+ S + + N+ N + Sbjct: 2 GGEWQEKPLGKVFDLVNGYAFKSKDFSSSGVPVIKIKNVKAGYFSEHNFSYVSPDFLSVR 61 Query: 282 TYQIVDPGEIVFRFID------LQNDKRSLRSAQVMERGIITS---AYMAVKPHGIDSTY 332 ++ +++ + + + E I A A + + Sbjct: 62 HEKLAQRDDLLISMSGNRHDGSPETWVGKVAHFKRNEPFFINQRVGALRAKNTKDVCPRF 121 Query: 333 LAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +++++ S+D +F ++ SG + ++ + + V VP I EQ I +++ +D Sbjct: 122 MSYVLSSWDFQHLFISIATSSGGQANISPKQILGTSVPVPHITEQRAIAHILGS----LD 177 Query: 391 VLVEKIEQSIVLLKERRSSF 410 VE Q L++ + Sbjct: 178 DKVELNRQMNRTLEQMAQAL 197 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 50/193 (25%), Gaps = 5/193 (2%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP W+ + + + G++ G + + R Sbjct: 253 DSD---LGEIPAGWEAMNVGAKFDVIMGQSPPGQSYNEIGQGLPFFQGRRDFGFRYPTQR 309 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 T + G L P +A ++ K + + Sbjct: 310 VYCTEPKRLANPGDTLISVRAPV-GDINMARVKCCIGRGVAAVRHKSESRSFTY-YSMRA 367 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q EG + K G +P P L + + I+ I E Sbjct: 368 LTEQFSSYEGEGTVFGSINKKQFGKLPHVAPDDDLIDLFESLVGSSDGEIEAHIDEADSL 427 Query: 192 IELLKEKKQALVS 204 + L+S Sbjct: 428 SRIRDTLLPKLIS 440 >gi|226952350|ref|ZP_03822814.1| restriction modification system DNA specificity domain protein [Acinetobacter sp. ATCC 27244] gi|226836902|gb|EEH69285.1| restriction modification system DNA specificity domain protein [Acinetobacter sp. ATCC 27244] Length = 401 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 59/424 (13%), Positives = 123/424 (29%), Gaps = 50/424 (11%) Query: 13 SGVQWIGAIP--KHWKVV----PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPK 66 S V W P ++W+ + + + K ++ E+V+S + Sbjct: 7 SDVYWFQEGPGVRNWQFKESGIKLLNVANITKQGKIDLNKTDRHLSTEEVDSKYQHF--- 63 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLG----------PYLRKAIIADFDGICSTQFLVLQP 116 + +G ++ G + +T + + Sbjct: 64 -------------LIDEGDLVIASSGITNDEDNLLRTKIAFIEKQHLPLCLNTSTIRFKA 110 Query: 117 KDVLPELLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 KD + +L WL S++ Q+I G + + I + +PPL EQ I + Sbjct: 111 KDGVSDLKFLKHWLNSLEFRQQITKEVTGIAQKNFGPSHLKKIKISLPPLTEQRRIASIL 170 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 + +LL+ + +P K + +VG + + Sbjct: 171 DQADELRQKRQQAIEKLDQLLQAT-------FIDMFGDPVSNPKGWDLRYVGEISES--- 220 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 ++ + + + + + + L E + G+++ Sbjct: 221 -KLGKMLDKKKQSSEIDQYKYLRNANVQWFRFDLSDVFEMEFNEKDRKNCELKFGDVLVC 279 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GL 353 ++ + + I Y WL Y F + Sbjct: 280 EGGEPGRAAIWKNDLENCFFQKALHRVRLDMTQILPEYFVWLFWFYSKNGGFDDHITVAT 339 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 L +K + + +PP+ Q D + I+VL +E S L + SS Sbjct: 340 IAHLTGVKMKAMQIPIPPLSLQEDF----QQKVNEIEVLKTTLENSSKLFESLFSSLQNQ 395 Query: 414 AVTG 417 A G Sbjct: 396 AFNG 399 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 63/200 (31%), Gaps = 14/200 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK W + + ++ G+ + K Y+ +V+ Sbjct: 206 PKGWDLRYVGEISESKLGKMLDKKKQSSEIDQYKYLRNANVQWFRFDLSDVFEMEFNEKD 265 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWLLSI 131 G +L + G R AI + C + L +LPE Sbjct: 266 RKNCELKFGDVLVCEGGEPGRAAIWKNDLENCFFQKALHRVRLDMTQILPEYFVWLFWFY 325 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + AT++H + + +PIPPL+ Q ++K+ I+ L T Sbjct: 326 SKNGGFDDHITVATIAHLTGVKMKAMQIPIPPLSLQEDFQQKVNE----IEVLKTTLENS 381 Query: 192 IELLKEKKQALVSYIVTKGL 211 +L + +L + L Sbjct: 382 SKLFESLFSSLQNQAFNGTL 401 >gi|182683452|ref|YP_001835199.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CGSP14] gi|182628786|gb|ACB89734.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CGSP14] Length = 430 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 60/427 (14%), Positives = 131/427 (30%), Gaps = 63/427 (14%) Query: 29 PIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++ G + KD I +I + D E G ++S + Sbjct: 2 RFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEA 139 KG L + R I+ I + ++ L + ++LS + V + + Sbjct: 62 VKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLS 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + GA + + + + +I +P+PPL+EQ I E I + ++D R +L KE Sbjct: 122 LISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFP 181 Query: 200 ----QALVSYIVTKGLNPDVKMKDS----------------------------------- 220 ++++ Y + L +S Sbjct: 182 DKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGD 241 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 + G +P +W V + + + K + +I ++ L Y Sbjct: 242 DNSYYGNIPMNWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDY 301 Query: 281 --------ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGI 328 + +++ G++ ++ + I Sbjct: 302 YIDTQFISSEQVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEI 361 Query: 329 DSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 S +L + + S K + ++ + L + + P +EQ IT + Sbjct: 362 ISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKL 421 Query: 386 TARIDVL 392 +++ L Sbjct: 422 FEKVNQL 428 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 42 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 98 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 99 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIES 158 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 159 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 196 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 74/183 (40%), Gaps = 16/183 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 247 GNIPMNWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQ 306 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELL 124 S+ ++ K L + + D+DG+ + F+ + +++ + L Sbjct: 307 FISSEQVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFL 366 Query: 125 QGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L S ++++AI + G + + + + +P+ P EQ LI +K+ +++ Sbjct: 367 LFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVN 426 Query: 183 TLI 185 L Sbjct: 427 QLW 429 >gi|311109505|ref|YP_003982358.1| type I restriction enzyme StySPI specificity protein [Achromobacter xylosoxidans A8] gi|310764194|gb|ADP19643.1| type I restriction enzyme StySPI specificity protein [Achromobacter xylosoxidans A8] Length = 400 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 48/411 (11%), Positives = 120/411 (29%), Gaps = 36/411 (8%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIY--------IGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + G + + + ++ + S Sbjct: 6 TRRVGDLCEQLRGVSYSKSDATLSNQAGYKAILRANNITKHGLTFDDLVYVPDA-CISER 64 Query: 79 SIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQ-GWLLSID 132 G ++ L D VL+P ++ + + Sbjct: 65 QFLKAGDVVIAASSGSLDVVGKAARVENDLAAGFGAFCKVLRPNSLVDAGYFAHFFQTSS 124 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 ++I ++ GA +++ + + N+ + +P L EQ I + + + Sbjct: 125 YRRKISSLAAGANINNLRNEHLDNLEIRVPSLPEQRRIADVLDKADALRAQRRAAITKLD 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 EL L S + +P + I G + H K + + + Sbjct: 185 EL-------LQSVFIEMFGDPVTNPRGWAI---GSLNAHGSFKNGLNFGKGESGATVRYV 234 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQV 310 + + + + + +++F + R + Sbjct: 235 G--VGDFQSKAALDDFSSLAFIELNDLPAEDYFLHDSDLLFVRSNGNRELVGRCMAVYPG 292 Query: 311 MERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 ME+ + + + + STY+A L RS ++ + G G Q++ + + LP+ Sbjct: 293 MEKVTYSGFCIRYRIADVSLQSTYVAHLFRSVPFRRLIFQGGQGANIQNINQQILSGLPI 352 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +P Q ++ +I + + ++ +S A +G+ Sbjct: 353 PIPDEGLQRQFAAIVE----KIGAQKQIMHRAAEKSNALFASLQHLAFSGK 399 >gi|269101868|ref|ZP_06154565.1| putative type I restriction-modification system subunit S [Photobacterium damselae subsp. damselae CIP 102761] gi|268161766|gb|EEZ40262.1| putative type I restriction-modification system subunit S [Photobacterium damselae subsp. damselae CIP 102761] Length = 421 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 58/416 (13%), Positives = 130/416 (31%), Gaps = 36/416 (8%) Query: 30 IKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA----KG 84 + N G+T + I ++ + Y + SD + + F G Sbjct: 12 LSNIVD-NRGKTCPVGDAGLPLIATNCIKEHSL-YPVYEKVRYVSDETYTNWFRGHPQPG 69 Query: 85 QILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +++ G R + D C + V P+ L L S Q+I + Sbjct: 70 DMIFVCKGSPGRVNWVPDPVNFCIAQDMVAIRADTTKVYPKYLFALLRSQASQQKILNMH 129 Query: 142 EGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ + H GN+ +P L Q + + ++I++ ++ + + Sbjct: 130 VGSLIPHFKKGDFGNLYFELPEDLEYQKKVGDAYFDFCLKIESNNQLNQTLEQMAQAIFK 189 Query: 201 ALV------------SYIVTKGLNPDVKMKDSGIE-WVGLVPDHWEVKPFFALVTELNRK 247 + V + + + +E +GL+P+ WEV ++V + + Sbjct: 190 SWFVDFDPVKAKMNGEQPVGMDADTALLFPEKLVESELGLIPEGWEVGSLSSIVDVIMGQ 249 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSL 305 + K N + +E + + T + +++ + + Sbjct: 250 SPKGTTYNDQGEGTPLVNGPVEFGVYHPVAQKWTTAPTKLSKNKDLIVCVRGSTTGRYVV 309 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + + +L +S+ L + S +K Sbjct: 310 SDGE-----YCLGRGVCSIRSDDSPAFANYLFKSH-LNNLLNLTTGSTFPSWSGPTLKNF 363 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 V+VPP Q I + ++ + L R + + ++G+IDL Sbjct: 364 KVVVPP---QSIIGKF-ETIVGNLCSMMAQNTGENESLSLLRDTLLPKLLSGEIDL 415 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 54/187 (28%), Gaps = 3/187 (1%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP+ W+V + + G++ + + +G ++ +++ T+ Sbjct: 227 LGLIPEGWEVGSLSSIVDVIMGQSPKGTTYNDQGEGTPLVNGPVEFGVYHPVAQKWTTAP 286 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + ++ G + +++D + L + Sbjct: 287 TKLSKNKDLIVCVRGSTTGRYVVSDGEYCLGRGVC---SIRSDDSPAFANYLFKSHLNNL 343 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+T + N + +PP + + + E L Sbjct: 344 LNLTTGSTFPSWSGPTLKNFKVVVPPQSIIGKFETIVGNLCSMMAQNTGENESLSLLRDT 403 Query: 198 KKQALVS 204 L+S Sbjct: 404 LLPKLLS 410 >gi|315225318|ref|ZP_07867134.1| type I restriction-modification system S subunit [Capnocytophaga ochracea F0287] gi|314944727|gb|EFS96760.1| type I restriction-modification system S subunit [Capnocytophaga ochracea F0287] Length = 258 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 57/171 (33%), Gaps = 8/171 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP W+ + G T G I ++ ++ +G + + Sbjct: 86 EIPNGWEWCRLGLIGDWGAGATPLRGNIEYYGGKIPWLKTGELNNGLIISTEEYITDKAL 145 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + ++ + G IL G + K IA + P + + L +L++ Sbjct: 146 EECSLRLCNVGDILIAMYGATIGKLGIAGIKLTTNQACCACTPIFIYNKFLFYFLMAN-- 203 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 Q EG + + N P+PPL EQ I EKI I+ Sbjct: 204 KQSFIEQGEGGAQPNISRIKLVNYLFPLPPLKEQQHIVEKIEELIPHIEHH 254 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 54/182 (29%), Gaps = 13/182 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPF-----FALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 K E +P+ WE + R N + I L G + L Sbjct: 77 KCIDEEIPFEIPNGWEWCRLGLIGDWGAGATPLRGNIEYYGGKIPWLKTGELNNGLIIST 136 Query: 273 MGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + ++ + G+I+ K + + A A P I Sbjct: 137 EEYITDKALEECSLRLCNVGDILIAMYGATIGKLGIA----GIKLTTNQACCACTPIFIY 192 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + +L + + + G + ++ + +PP+KEQ I I I Sbjct: 193 NKFLFYFLMANK-QSFIEQGEGGAQPNISRIKLVNYLFPLPPLKEQQHIVEKIEELIPHI 251 Query: 390 DV 391 + Sbjct: 252 EH 253 >gi|302336434|ref|YP_003801641.1| restriction modification system DNA specificity domain protein [Olsenella uli DSM 7084] gi|301320274|gb|ADK68761.1| restriction modification system DNA specificity domain protein [Olsenella uli DSM 7084] Length = 478 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 67/442 (15%), Positives = 140/442 (31%), Gaps = 73/442 (16%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P W+ + + ++TG + + + + ++ +G + Sbjct: 36 DLPDGWEWARLGSISLGISTGPFGSALHKGDYVSRGVPIVNPANISNGLITPTSFVSEAT 95 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPK---DVLPELLQG 126 + S+ I + G ++ G+ G R A++ D +C T + + ++ LL Sbjct: 96 RERLSSY-ILSLGDLVIGRRGEMGRVAVVGDECVGWLCGTGCFIARCPGGGELSSNLLSL 154 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S +E G TM + + +G + +P+PPLAEQ I + +D + Sbjct: 155 VFSSTYTKAFLEENAIGTTMKNLSREILGEVLVPVPPLAEQRRIVVALDELLGLVDEVER 214 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVK------------------------MKDSGI 222 + LL + ++ + L P ++ + Sbjct: 215 SQAELEGLLDRARAKVLDLAIRGRLVPQDPSDEPAEALLARVREERLLMAADGRLRRRDV 274 Query: 223 E-----WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 E + G ++E V + L E + + +R + Sbjct: 275 EGDSVIFRGEDNSYYEKVGGAEPVCVDSELPFDLPEGWEWARLPSLFVIDPRSRQDDVAL 334 Query: 278 ESYETYQIVDPG-------------------------EIVFRFIDLQNDKRSLRSAQVME 312 S+ +DPG +++F I + R A+ +E Sbjct: 335 VSFAPMASIDPGFTSHVKYEVRPWGEVKRGFTHFEEGDVLFAKISPCFENRKSFVAESLE 394 Query: 313 R---GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPV 367 T + G+ + ++S MG+ +Q +K E + + Sbjct: 395 NKHGAGTTELIVLRCICGMTPWFALCFLKSPTFIDAAKGTFMGTVGQQRVKREFIDSVLF 454 Query: 368 LVPPIKEQFDITNVINVETARI 389 VPP+ EQ I + I Sbjct: 455 PVPPLSEQARIAKSASKLLDSI 476 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 75/229 (32%), Gaps = 28/229 (12%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRK-------NTKLIESNILSLSYGNIIQKLETRNMGL 275 E +PD WE ++ ++ + + ++ NI L T + Sbjct: 32 ELPFDLPDGWEWARLGSISLGISTGPFGSALHKGDYVSRGVPIVNPANISNGLITPTSFV 91 Query: 276 KPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + E + I+ G++V + V G S+ L Sbjct: 92 SEATRERLSSYILSLGDLVIGRRGEMGRVAVVGDECVGWLCGTGCFIARCPGGGELSSNL 151 Query: 334 AWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L+ S K F ++L E + + V VPP+ EQ I ++ +D Sbjct: 152 LSLVFSSTYTKAFLEENAIGTTMKNLSREILGEVLVPVPPLAEQRRIVVALDELLGLVDE 211 Query: 392 LVEKIEQSIV-LLKERRSSFIAAAVTGQI---------------DLRGE 424 VE+ + + LL R+ + A+ G++ +R E Sbjct: 212 -VERSQAELEGLLDRARAKVLDLAIRGRLVPQDPSDEPAEALLARVREE 259 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 10/169 (5%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS + + +P+ W+ + ++ + + + + ++ G ++ + Sbjct: 301 DSELPF--DLPEGWEWARLPSLFVIDPRSRQDDVALVSFAPMASIDPGFTSHVKYEVRPW 358 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLV-LQPKDVLPELL 124 + F +G +L+ K+ P + + G +T+ +V + P Sbjct: 359 GEVKRGFTHFEEGDVLFAKISPCFENRKSFVAESLENKHGAGTTELIVLRCICGMTPWFA 418 Query: 125 QGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +L S + G + I ++ P+PPL+EQ I + Sbjct: 419 LCFLKSPTFIDAAKGTFMGTVGQQRVKREFIDSVLFPVPPLSEQARIAK 467 >gi|305665032|ref|YP_003861319.1| DNA-methyltransferase, type I restriction-modification enzyme subunit M [Maribacter sp. HTCC2170] gi|88709784|gb|EAR02016.1| DNA-methyltransferase, type I restriction-modification enzyme subunit M [Maribacter sp. HTCC2170] Length = 707 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 41/291 (14%), Positives = 91/291 (31%), Gaps = 11/291 (3%) Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 D +E I ++ I L EK+ +T+I+ Sbjct: 410 NDNDFKDFLEKIRNKEIGKNSWIIKAHEIDENTCNLTPINPNEEKLDKILSP-NTIISAV 468 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++ + Q + + I + ++ +K+ G W F + K Sbjct: 469 SKYSMDFNSELQKIKTNIDSYLSEVNLMLKNEGGIWKEERFGDVCE--FVRGPFGGSLKK 526 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSL 305 + +E I + I + + G+++ + Sbjct: 527 SIFVEKGIAVYEQQHAINNQFEHVRYYINQDKFNEMKRFELKSGDLIMSCSGTMGKVAIV 586 Query: 306 RSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMG-SGLRQSL-KFEDV 362 + E+GII A + + P+ ID +L + M S + +++ + + Sbjct: 587 PKS--FEKGIINQALLKLSPNPSIDVNFLKYWMESKVFKLKIEELSMGAAIKNVASVKIL 644 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 K + V +PPI+ Q I I+ + + + L++ S + Sbjct: 645 KEIMVPIPPIEIQKRIIQRIDSLVNSFEDAILITRNQLNHLEDLGESLLQE 695 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 66/194 (34%), Gaps = 11/194 (5%) Query: 25 WKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK + G ++ K I + + +++ N + + Sbjct: 504 WKEERFGDVCEFVRGPFGGSLKKSIFVEKGIAVYEQQHAINNQFEHVRYYINQDKFNEMK 563 Query: 78 VSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 G ++ G + AI+ GI + L L P + + + Sbjct: 564 RFELKSGDLIMSCSGTMGKVAIVPKSFEKGIINQALLKLSPNPSIDVNFLKYWMESKVFK 623 Query: 135 QRIEAICEGATMSHA-DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +IE + GA + + K + I +PIPP+ Q I ++I + + I + Sbjct: 624 LKIEELSMGAAIKNVASVKILKEIMVPIPPIEIQKRIIQRIDSLVNSFEDAILITRNQLN 683 Query: 194 LLKEKKQALVSYIV 207 L++ ++L+ Sbjct: 684 HLEDLGESLLQETF 697 >gi|37678998|ref|NP_933607.1| restriction endonuclease S subunit [Vibrio vulnificus YJ016] gi|37197740|dbj|BAC93578.1| restriction endonuclease S subunit [Vibrio vulnificus YJ016] Length = 433 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 49/378 (12%), Positives = 119/378 (31%), Gaps = 25/378 (6%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK----LGPYLRKAIIADFDG 105 + L D+ ++ + + + + +G +++ + + + + +G Sbjct: 61 FSTLFDITKEYVPFINTEISLDKVKEESY--CQEGDMVFADASEDIDDVGKSIELINLNG 118 Query: 106 I-----CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 T + D++ S + ++I+ +GA + I NI + Sbjct: 119 EKLLSGLHTILARPKKSDLVKGFGGYLFKSEVMRKQIQKESQGAKVLGISASRISNIEVI 178 Query: 161 IPPLAE-QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 P + Q I + + + I T + K Q L + P+++ Sbjct: 179 YPIDHDEQQKIADCLSSMDDLITTNTKKLELLKLHKKGLLQKLFTA--EGKDIPELRFDG 236 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE- 278 EW + E++ L++ L + S +L L N+ + E Sbjct: 237 FEGEW-----EEVELRKLGDLISGLTYSPDDVRASGLLVLRSSNVQNGKIVYGDNVFVEP 291 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + I +P +I+ + + + T + + L + Sbjct: 292 NIKGANISEPDDILICVRNGSKALIGKNALIPQNVPLSTHGAFMTIFRSKYAQFTFQLFQ 351 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIE 397 + K A S+ + + + +P E+ I + + +D L+ Sbjct: 352 TNAYQKQVDADLGATINSINGKQLLKYKFKIPRSNDEKEKIVKCL----SSLDDLINAQT 407 Query: 398 QSIVLLKERRSSFIAAAV 415 I +LKE + + Sbjct: 408 DKIEVLKEYKKGLMQQLF 425 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 62/202 (30%), Gaps = 17/202 (8%) Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA-------LVTELNRKNTKLIESN 255 +S + K L P +++ S EW + + KN + + Sbjct: 1 MSKLEFKELVP--ELRFSQTEWQKKPFNKLYTLKVTNSLSRDKLNYDDGLVKNIHYGDIH 58 Query: 256 ILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV---M 311 + +I + + N + + + G++VF D + Sbjct: 59 TKFSTLFDITKEYVPFINTEISLDKVKEESYCQEGDMVFADASEDIDDVGKSIELINLNG 118 Query: 312 ERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVL 368 E+ + + +P D +L +S + K G + + + + V+ Sbjct: 119 EKLLSGLHTILARPKKSDLVKGFGGYLFKSEVMRKQIQKESQGAKVLGISASRISNIEVI 178 Query: 369 VP-PIKEQFDITNVINVETARI 389 P EQ I + ++ I Sbjct: 179 YPIDHDEQQKIADCLSSMDDLI 200 Score = 47.9 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 66/204 (32%), Gaps = 20/204 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKD 67 IP+ W+ V +++ L +G T ++ + +V++G Y Sbjct: 228 DIPELRFDGFEGEWEEVELRKLGDLISGLTYSPDDVRASGLLVLRSSNVQNGKIVYGDNV 287 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 +I IL + K + + ST + Sbjct: 288 FVEPNIK--GANISEPDDILICVRNGSKALIGKNALIPQNVPLSTHGAFMTIFRSKYAQF 345 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L + Q+ GAT++ + K + IP ++ K +D L Sbjct: 346 TFQLFQTNAYQKQVDADLGATINSINGKQLLKYKFKIPRSNDEKEKIVKC---LSSLDDL 402 Query: 185 ITERIRFIELLKEKKQALVSYIVT 208 I + IE+LKE K+ L+ + Sbjct: 403 INAQTDKIEVLKEYKKGLMQQLFP 426 >gi|294780396|ref|ZP_06745763.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis PC1.1] gi|294452525|gb|EFG20960.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis PC1.1] Length = 364 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 49/375 (13%), Positives = 132/375 (35%), Gaps = 29/375 (7%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI-------IAD 102 YI D+ + + ++ N ++ G ++ + Sbjct: 3 YIHYGDIHTKKADKVSENSNIPNIIKKNFALLEIGDLILTDASEDYKGIATPAVIRENTS 62 Query: 103 FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 FD + + L+PK++ P L + + + + G + + + IP Sbjct: 63 FDIVAGLHTIALRPKNIDPMFLYYLIKAPTFRKYGYKVGTGMKVFGISSSKVLDFTTYIP 122 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 E L+ + +ID + R ++ LKE K+A + + K +++ + Sbjct: 123 KNDETKLVSSFL----EKIDYALDLHQRKLDQLKELKKAYLQLMFPKKDETVPQVRFADF 178 Query: 223 EWVGLVPDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-PES 279 E D W++ + + + + +S + +S NI + + + E Sbjct: 179 E------DDWQLCKLGDVVEIFDGTHQTPRYTDSGVKFVSVENIATLETKKYITHEAYEK 232 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + G+I+ I D +++ + E +K + +L++++ S Sbjct: 233 EYSKKRAKKGDILMTRIG---DIGTMKVIETDEPLAYYVTLALLKAKETNPYFLSFIISS 289 Query: 340 YDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 ++ + + + + ++ ++ + + +EQ I + +D + + Sbjct: 290 PEIQRNIWKRTLHIAFPKKINLGEINQVEMKITIFEEQDKIGD----LFTNLDDAIILNQ 345 Query: 398 QSIVLLKERRSSFIA 412 + LK + S++ Sbjct: 346 NKLNQLKSLKKSYLQ 360 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 61/164 (37%), Gaps = 8/164 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQ 309 I + N + + + +++ G+++ + + Sbjct: 1 MKYIHYGDIHTKKADKVSENSNIPNIIKKNFALLEIGDLILTDASEDYKGIATPAVIREN 60 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + +A++P ID +L +L+++ K Y +G+G+ + V Sbjct: 61 TSFDIVAGLHTIALRPKNIDPMFLYYLIKAPTFRKYGYKVGTGMKVFGISSSKVLDFTTY 120 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P E +++ + +ID ++ ++ + LKE + +++ Sbjct: 121 IPKNDETKLVSSFLE----KIDYALDLHQRKLDQLKELKKAYLQ 160 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 58/185 (31%), Gaps = 7/185 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W++ + ++ G + + ++ +E++ + K + + Sbjct: 181 DWQLCKLGDVVEIFDGTHQTPRYTDSGVKFVSVENIATLETK--KYITHEAYEKEYSKKR 238 Query: 81 FAKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 KG IL ++G K I D +L+ K+ P L + S ++ + I Sbjct: 239 AKKGDILMTRIGDIGTMKVIETDEPLAYYVTLALLKAKETNPYFLSFIISSPEIQRNIWK 298 Query: 140 ICEGATMS-HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I + M I EQ I + I + + L K Sbjct: 299 RTLHIAFPKKINLGEINQVEMKITIFEEQDKIGDLFTNLDDAIILNQNKLNQLKSLKKSY 358 Query: 199 KQALV 203 Q + Sbjct: 359 LQNMF 363 >gi|325110948|ref|YP_004272016.1| restriction modification system DNA specificity domain protein [Planctomyces brasiliensis DSM 5305] gi|324971216|gb|ADY61994.1| restriction modification system DNA specificity domain protein [Planctomyces brasiliensis DSM 5305] Length = 436 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 57/435 (13%), Positives = 128/435 (29%), Gaps = 42/435 (9%) Query: 27 VVPIKRF-----TKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + ++ TG + I + +V G + K S Sbjct: 6 TTTLGELLDNYGGEIKTGPFGTKLRAAEYTPTGVPVISVGEVGYGRLRLHDKTPRVDTSV 65 Query: 75 TST--VSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLL 129 T+ + G I++G+ G R A + D + S V P + L Sbjct: 66 TNRMPEYLLRYGDIVFGRKGAVDRSARVQVDQDGWFLGSDGIRVRLPSTCDSAFIAYQLQ 125 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + G+TM + I IP+ +P + EQ I +++ +I+ Sbjct: 126 VQAHRDWMIQHAAGSTMPSLNEGIIRRIPIVLPSIEEQRAITAVLVSLDDKIEQNRRTGA 185 Query: 190 RFIELLKEKKQALVSYI-----------VTKGLNPDVKMKDSG---IEWVGLVPDHWEVK 235 + EL + + G+ P+ K +G VP+ WEVK Sbjct: 186 KLEELARAVFKGWFVDFEPVKAKAAGATAFPGMLPETFAKLPSRFVDSELGPVPEGWEVK 245 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-------MGLKPESYETYQIVDP 288 P +VT + + + + + + + Sbjct: 246 PIGDVVTVRGGGTPSTKNESFWTDGTHCWATPKDLSSLQHPVLLSTGRRITTAGVAKISS 305 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G + + L + A + ++A++ +G + + + ++ Sbjct: 306 GLLPIDTVLLSSRAPVGYLALAKVPTAVNQGFIAIECNGPLTPHYVLHWLDSSMEEIKGR 365 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + +P +VP + + + + L+ + + L R Sbjct: 366 ASGTTFAEISKSAFRPIPAIVPTSEMTQAF----DDDVKPLFDLITNLVADSMKLATMRD 421 Query: 409 SFIAAAVTGQIDLRG 423 + ++G + + Sbjct: 422 YLLPRLLSGHVRITP 436 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 60/207 (28%), Gaps = 15/207 (7%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYI-------GLEDVESGTGKY- 63 DS +G +P+ W+V PI + G T + + + +D+ S Sbjct: 232 DSE---LGPVPEGWEVKPIGDVVTVRGGGTPSTKNESFWTDGTHCWATPKDLSSLQHPVL 288 Query: 64 --LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + + + +L P +A + F+ ++ L Sbjct: 289 LSTGRRITTAGVAKISSGLLPIDTVLLSSRAPV-GYLALAKVPTAVNQGFIAIECNGPL- 346 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + I+ G T + IP +P + + I Sbjct: 347 TPHYVLHWLDSSMEEIKGRASGTTFAEISKSAFRPIPAIVPTSEMTQAFDDDVKPLFDLI 406 Query: 182 DTLITERIRFIELLKEKKQALVSYIVT 208 L+ + ++ + L+S V Sbjct: 407 TNLVADSMKLATMRDYLLPRLLSGHVR 433 >gi|172040944|ref|YP_001800658.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] gi|171852248|emb|CAQ05224.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] Length = 411 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 54/404 (13%), Positives = 121/404 (29%), Gaps = 28/404 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ +++ + +S ++ I D + D + S + Sbjct: 17 EWEEKTVQQISVPVARVNPDSTAPVMMISAADGFINQSEKYSSDNAGKS--LSKYIELHQ 74 Query: 84 GQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G++ Y K+ P+ + + + + + P L V ++E Sbjct: 75 GELAYNHGASKIRPFGSCFELRESAARVPFVYHCFRVPEEHPTFTSYSLNRKSVQSQLER 134 Query: 140 ICEGATMS----HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + G + P L EQ I I+ + + Sbjct: 135 LVSSGARMDGLLNISFPQYGTVTAYFPTLEEQQAIGAIFTNLDAAINQHSKKHQALQQAK 194 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKL 251 Q + P+++++ EW +G + F + Sbjct: 195 TALMQRMFPQ--EGQTVPELRLEGFDGEWKTTTLGELGSFKSGVGFPEREQGGDTGLPFY 252 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE--IVFRFIDLQNDKRSLRSAQ 309 S++ + GN +Q + + + + I I+F + R A Sbjct: 253 KVSDLS--APGNELQLRSANHYVTEEQIVRNHWIPVTAVPAILFAKVGAAVFLGRKRLAT 310 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 ++ D + +++ DL + SG SL + + Sbjct: 311 DTFLLDNNLMAFSLDTKSWDVQFADTYLKTVDLTRF---TQSGALPSLNARHLAEAAATI 367 Query: 370 PP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 PP ++EQ I R+D L+ + I LK+ +++ + Sbjct: 368 PPTLEEQQAIG----AVFTRLDTLIATEAKYIESLKQTKTALLQ 407 >gi|126657630|ref|ZP_01728785.1| type II restriction-modification enzyme [Cyanothece sp. CCY0110] gi|126621086|gb|EAZ91800.1| type II restriction-modification enzyme [Cyanothece sp. CCY0110] Length = 1307 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 56/400 (14%), Positives = 131/400 (32%), Gaps = 34/400 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + + ++ G T D +++ + ++ + + S V Sbjct: 933 WNLYRLGDIVEVKIGGTPPRENSDYFKGDNLWVSISEMNGQIIIDTKEKITDQGVKDSNV 992 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG L + K IA D + L P + + L ++ Sbjct: 993 KLIPKGTTLLS-FKLSIGKTAIAGKDLYTNEAIAGLIP--LDKNQVLDLFLFHIFNAKLI 1049 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + G + + + I+E+I+ ID + I+ KEK Sbjct: 1050 NLENVGLNTFGKSLNSGFLKKDVKIPLPPLEIQEEIVKACQAIDEEFEKVETMIKKEKEK 1109 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L + + P ++ + +P + + S+ Sbjct: 1110 IEKLANQ--QYEMYPKYQLG-----NLSSMPQYGANEKAING----------NKISDYRY 1152 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + +I + N E E I++ G+ +F K L +Q + I Sbjct: 1153 IRITDINEDGSLNNDFKTAEKIEDKYILEDGDFLFARSGNTVGKTFLYQSQ-YGKAIFAG 1211 Query: 319 AYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQ 375 + I YL + +S K + +G + ++ + L + +P +++Q Sbjct: 1212 YLIRFKLMQDRILPKYLEIVTKSSIYKKWIEDVQTGSSQPNINGQIYSSLEIPLPELQKQ 1271 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + ++ + +V +++ + I + +R+ S I + Sbjct: 1272 QKIISEVD----KCEVKIKESQTIINSIAKRKESVIYKYL 1307 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 49/185 (26%), Gaps = 4/185 (2%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L P+ K +W N K + I Sbjct: 920 LTPNKKNITFNTKWNLYRLGDIVEVKIGGTPPRENSDYFKGDNLWVSISEMNGQIIIDTK 979 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + +++ G + F I + + + + + Sbjct: 980 EKITDQGVKDSNVKLIPKGTTLLSFKLSIGKTAIAGKDLYTNEAI--AGLIPLDKNQVLD 1037 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR-LPVLVPPIKEQFDITNVINVETARI 389 +L + + + + + +SL +K+ + + +PP++ Q +I Sbjct: 1038 LFLFHIFNAKLINLENVGLNTFG-KSLNSGFLKKDVKIPLPPLEIQEEIVKACQAIDEEF 1096 Query: 390 DVLVE 394 + + Sbjct: 1097 EKVET 1101 >gi|258539029|ref|YP_003173528.1| type I restriction enzyme, specificity protein [Lactobacillus rhamnosus Lc 705] gi|257150705|emb|CAR89677.1| Type I restriction enzyme, specificity protein [Lactobacillus rhamnosus Lc 705] Length = 393 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 109/396 (27%), Gaps = 34/396 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + D + + E G+ N Q + G Sbjct: 20 WEQRKVSELAD---------RYDNHRVPITASERVAGRTPYYGANGIQDHVEGFT--HDG 68 Query: 85 Q-ILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + IL + G D + VLQ K+ +L++ IE Sbjct: 69 EFILVAEDGANDLQNYPVQYVDGKVWVNNHAHVLQAKEE--TADNKFLMNALKHTNIEPY 126 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + + + I +P L EQ+ I + IT R +ELLK KQ Sbjct: 127 LVGGGRAKLNADVMMKIDFKVPTLPEQIQIGKFFDNLDHL----ITLHQRKLELLKRLKQ 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS-- 258 + + + +++ G E+ T K Sbjct: 183 GYLQKLFPQNGENVPELRFKGYSDAWEKRKLGEISDIRGGGTPSTSKPEYWDGEIDWYAP 242 Query: 259 --LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + + L + I+F + L + G Sbjct: 243 AEIGTQRYVSGSRRQITNLGLNKSSATMLPANKTILFTSRAGIGNAAILTKS-----GAT 297 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + ++ Y + K + + + ++ + +P KEQ Sbjct: 298 NQGFQSIVVEPATDVYFLYSEIPEIKRKAIRLAAGSTFLEISGKSLSKIQIWLPSFKEQS 357 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I + +ID L+ + LLK+ + + + Sbjct: 358 RIGH----LFLQIDNLIAATQHKENLLKKIKQACLQ 389 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 48/129 (37%), Gaps = 5/129 (3%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 GE + D ND ++ V + + + ++ + +LM + + Sbjct: 66 HDGEFILVAEDGANDLQNYPVQYVDGKVWVNNHAHVLQAKEETADN-KFLMNALKHTNIE 124 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + G R L + + ++ VP + EQ I + +D L+ ++ + LLK Sbjct: 125 PYLVGGGRAKLNADVMMKIDFKVPTLPEQIQIGKFFD----NLDHLITLHQRKLELLKRL 180 Query: 407 RSSFIAAAV 415 + ++ Sbjct: 181 KQGYLQKLF 189 >gi|325980940|ref|YP_004293342.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] gi|325530459|gb|ADZ25180.1| restriction modification system DNA specificity domain [Nitrosomonas sp. AL212] Length = 575 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 69/474 (14%), Positives = 130/474 (27%), Gaps = 98/474 (20%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P+ W+ V L GR ++ + Y+ +E ++ G + D QS+ Sbjct: 102 ELPEGWEWVRNGFLFTLRKGRIPKNLSENNIGLPYLDIEALDRGVVRRYTDDDKCPQSNE 161 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S IL G I+ GI + V+ + L+ + Sbjct: 162 S--------DILVVCDGSRSG-LILDGKVGIIGSTLSVIDTPVFIQS--FVRLIFKQGYE 210 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE------------------ 177 R+ A +GA + H D + + + PPLAEQ I K+ Sbjct: 211 RLNATMKGAAIPHLDTQKLAFGVIGFPPLAEQHRIVAKVDKLMTLCDQLETQHNNAAKAY 270 Query: 178 -----------------------TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 RI + KQ L+ V L P Sbjct: 271 EKLVSHLLDTLTQSQNAEDFGANWQRIAAHFDTLFTTETSIDALKQTLLQLAVMGKLVPQ 330 Query: 215 VKMKD-------------------------------SGIEWVGLVPDHWEVKPFFALVTE 243 + + E +P WE + Sbjct: 331 DPNDEPASELLKRIQAEKARLVAEGKIKKDKPLPPITEEEKPFKLPRGWEWVRLGTITEI 390 Query: 244 LNRKNTKL------IESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEIVF 293 K + + + ++ + +S + I+ +I Sbjct: 391 KGGKRVSNGFQLLTQPTPHIYIRVSDMKDGSIDDSDLRYIDSEMHGKISRYIITKDDIYI 450 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSG 352 + K + + + + +A + GI+ +L + S F+ Sbjct: 451 TIVGATIGKCGVVPEKFDQMNLTENAARLIPLRGIEKIFLYKCLDSPICQSQFFDKTKQV 510 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q + + + +P EQ I ++ D L +I Q+ L K+ Sbjct: 511 GVQKMALNRLASTIIFLPSRAEQIRIITKVDELMILCDQLKSRITQASQLQKKL 564 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 71/200 (35%), Gaps = 12/200 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLP-KDGNSR 71 +P+ W+ V + T++ G+ +G IYI + D++ G+ + +S Sbjct: 374 KLPRGWEWVRLGTITEIKGGKRVSNGFQLLTQPTPHIYIRVSDMKDGSIDDSDLRYIDSE 433 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGW 127 + I K I +G + K + D + ++ + + L Sbjct: 434 MHGKISRYIITKDDIYITIVGATIGKCGVVPEKFDQMNLTENAARLIPLRGIEKIFLYKC 493 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S + + + + + + +P AEQ+ I K+ + D L + Sbjct: 494 LDSPICQSQFFDKTKQVGVQKMALNRLASTIIFLPSRAEQIRIITKVDELMILCDQLKSR 553 Query: 188 RIRFIELLKEKKQALVSYIV 207 + +L K+ +V + Sbjct: 554 ITQASQLQKKLADVVVEQAI 573 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 56/187 (29%), Gaps = 8/187 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE RK + ++ + + R + + Sbjct: 95 SDEEKPFELPEGWEWVR--NGFLFTLRKGRIPKNLSENNIGLPYLDIEALDRGVVRRYTD 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + +I+ ++ + GII S + +++ + + Sbjct: 153 DDKCPQSNESDILVVCDGSRSGLI-----LDGKVGIIGSTLSVIDTPVFIQSFVRLIFKQ 207 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++ M L + + + PP+ EQ I ++ D L + + Sbjct: 208 -GYERLNATMKGAAIPHLDTQKLAFGVIGFPPLAEQHRIVAKVDKLMTLCDQLETQHNNA 266 Query: 400 IVLLKER 406 ++ Sbjct: 267 AKAYEKL 273 >gi|86146743|ref|ZP_01065063.1| type I restriction-modification system, S subunit, EcoA family protein [Vibrio sp. MED222] gi|85835393|gb|EAQ53531.1| type I restriction-modification system, S subunit, EcoA family protein [Vibrio sp. MED222] Length = 417 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 53/428 (12%), Positives = 126/428 (29%), Gaps = 55/428 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + +L G K I G + + V I + Sbjct: 3 SEWIQSELGDVIELKRGYDLPKTKRI-----------DGNVPVISSSGHSGFHNEVKIKS 51 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ G+ G + I + +T V K P + +L ++ + Sbjct: 52 PG-VVTGRYGTIGQVFYIEEDFWPLNTTLYVKDFKGNDPLFIYYYLKTVSYKDYTDK--- 107 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 ++ L + + K+ +D IT + + L++ QAL Sbjct: 108 ----GAVPGVNRNDLHRAKVLLPKCPKYQNKLAIHLRDLDRKITLNNQINQTLEQMAQAL 163 Query: 203 --------------VSYIVTKGLNPDVKMKDSG--------IEWVGLVPDHWEVKPFFAL 240 ++ KG++ +K+ +GL+P+ W + Sbjct: 164 FKSWFVDFDPVKAKMNGAQPKGMDAPFLLKEVASLFPEKLVESELGLIPEGWSQGVIADI 223 Query: 241 VT---ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + K + + + + L+ + +I++ G+ + + Sbjct: 224 AKLNAKSWTKKNQPEQVHYVDLANTKNGVIETVTSYDFSEAPSRARRILNSGDTIVGTVR 283 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD--LCKVFYAMGSGLRQ 355 N + + ++ + + P T +L + D + + G Sbjct: 284 PGNRSFAF-IGDTEQPLTGSTGFAVLSPKEECWTSFVYLATTNDDSIDEYARLADGGAYP 342 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL--LKERRSSFIAA 413 ++K V P ++P + + + + L + R + + Sbjct: 343 AIKPVVVADTPCVIPTKDVAQKFWQLTEAMLKK------AHQNRLENEVLAKLRDTLLPK 396 Query: 414 AVTGQIDL 421 ++G+IDL Sbjct: 397 LLSGEIDL 404 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 60/193 (31%), Gaps = 7/193 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G IP+ W I KLN ++ + + Y+ L + ++G + + S Sbjct: 208 LGLIPEGWSQGVIADIAKLNAKSWTKKNQPEQVHYVDLANTKNGVIETVTSYDFSEAPS- 266 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPK-DVLPELLQGWLLSI 131 I G + G + P R + ST F VL PK + + + Sbjct: 267 RARRILNSGDTIVGTVRPGNRSFAFIGDTEQPLTGSTGFAVLSPKEECWTSFVYLATTND 326 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D + +G + + P IP + A + E Sbjct: 327 DSIDEYARLADGGAYPAIKPVVVADTPCVIPTKDVAQKFWQLTEAMLKKAHQNRLENEVL 386 Query: 192 IELLKEKKQALVS 204 +L L+S Sbjct: 387 AKLRDTLLPKLLS 399 >gi|315038270|ref|YP_004031838.1| specificity determinant HsdS [Lactobacillus amylovorus GRL 1112] gi|312276403|gb|ADQ59043.1| putative specificity determinant HsdS [Lactobacillus amylovorus GRL 1112] Length = 402 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 55/406 (13%), Positives = 139/406 (34%), Gaps = 38/406 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + +G + E I + GKY+ + + ++ + Sbjct: 20 DWEQRKLDDTITHKSGTSIEKYFSSKGLYKVISIGS-YGSNGKYIDQGIRAAANEKTNSH 78 Query: 80 IFAKGQILYGKLGPYLRKAI------IADFDGICSTQ--FLVLQPKDVLPELLQGWLLSI 131 + KG++ K I + + + + + L P+ +L Sbjct: 79 LIKKGELSMVLNDKTGGKIIGRVLLIEKNNQYVVNQRSEIIKLSTSLWDPQFAFTYLNGP 138 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++ I +G T ++ ++ + + + + EQ ++I ++D +I + R Sbjct: 139 -FRKKVLRIMQGGTQNYVNFSSVKKLTASLTSVKEQ----KEIGTLFQKLDNIIILQQRK 193 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ L++ KQ L +++ + ++ +G E + W+ + ++ Sbjct: 194 LKELQQVKQTLSQFLLNGNTHTRPTLRLNGFEDI------WKENKLKDIAKISMGQSPSS 247 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 N I + N G++P + E ++ + +I+ + Sbjct: 248 NNYNKKGNGKILIQGNADIDNGGIRPRVWTTEITKLANKNDILLTVRAPVGELAITNREV 307 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 V+ RGI + L++ +S+ D+K+L V + Sbjct: 308 VIGRGIAA--------IKGNKFIYNLLVQKNKEHFWDRISSGSTFKSISSNDIKQLKVYI 359 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P KE+ I +V+ +ID + I ++ + + + Sbjct: 360 PSEKEETLIASVLETIKNKID----FQNERINVINKLKKYLLTNLF 401 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 66/209 (31%), Gaps = 11/209 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K +W D + + ++S+ K + Sbjct: 10 PVLRFKGFTDDWEQRKLDDTITHKSGTSIEKYFSSKGLYK---VISIGSYGSNGKYIDQG 66 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--IITS--AYMAVKPHGI 328 + ++ GE+ D K R + + ++ + + Sbjct: 67 IRAAANEKTNSHLIKKGELSMVLNDKTGGKIIGRVLLIEKNNQYVVNQRSEIIKLSTSLW 126 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 D + + KV M G + + F VK+L + +KEQ +I + Sbjct: 127 DPQFAFTYLNGPFRKKVLRIMQGGTQNYVNFSSVKKLTASLTSVKEQKEIG----TLFQK 182 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +D ++ ++ + L++ + + + G Sbjct: 183 LDNIIILQQRKLKELQQVKQTLSQFLLNG 211 >gi|116871899|ref|YP_848680.1| type I restriction endonuclease S subunit [Listeria welshimeri serovar 6b str. SLCC5334] gi|116740777|emb|CAK19897.1| type I restriction endonuclease S subunit domain protein [Listeria welshimeri serovar 6b str. SLCC5334] Length = 402 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 61/401 (15%), Positives = 133/401 (33%), Gaps = 35/401 (8%) Query: 25 WKVVP-IKRFTKLNTGRTSESG------KDIIYIGLEDVESGTG--KYLPKDGNSRQSDT 75 W+ + G T ++ +I +I D+ K + Sbjct: 20 WEQRKVLDYAIHTYGGGTPKTNVPEYWSGEIPWIQSSDLSISNLFNIIPKKHITELAIKS 79 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S I + F+ S FL + + + + + + Sbjct: 80 SATKFIPANSIAIVSRVGVGKLV-FMPFEYTTSQDFL-SLSNLQVDSNFGVYSIYMMLQR 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + I + + EQ I ++D I R +E L Sbjct: 138 ELNNIQGTSIKGITKSDLLEKKINKPSSREEQQKIGSF----FKQLDNTIALHQRKLEAL 193 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K K+ L+ + K + K ++ G WE + + E + ++ Sbjct: 194 KLMKKGLLQQMFPK--SEADIPKIRFADFDGK----WEQRKLGDVFNERSERSA--DGEL 245 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I +I+ + Y++V G+I + + + S GI Sbjct: 246 ISVTINSGVIKASKLEKKDNSSFDKSNYKVVKKGDIAYNSMRMWQGASGYSSY----NGI 301 Query: 316 ITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPP 371 ++ AY + P I++ ++A++ + D+ + F GL +LKF + ++ + +P Sbjct: 302 LSPAYTVIYPIKNINAMFIAYIFKKNDMIQTFQRNSQGLTSDTWNLKFPSLSKIKIKIPT 361 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ ITN++ +++ + I LK+ + +++ Sbjct: 362 NEEQIKITNLL----RKLEYTSTFHQNKIERLKKLKKAYLQ 398 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 16/197 (8%), Positives = 54/197 (27%), Gaps = 6/197 (3%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 + G + + I + ++ + K Sbjct: 12 RFKGFSEAWEQRKVLDYAIHTYGGGTPKTNVPEYWSGEIPWIQSSDLSISNLFNIIPKKH 71 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + + I I + + + + + ++++ +DS + + + Sbjct: 72 ITELAIKSSATKFIPANSIAIVSRVGVGKLVFMPFEYTTSQDFLSLSNLQVDSNFGVYSI 131 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKI 396 L + + + + D+ + P +EQ I + ++D + Sbjct: 132 YMM-LQRELNNIQGTSIKGITKSDLLEKKINKPSSREEQQKIGSF----FKQLDNTIALH 186 Query: 397 EQSIVLLKERRSSFIAA 413 ++ + LK + + Sbjct: 187 QRKLEALKLMKKGLLQQ 203 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 67/182 (36%), Gaps = 7/182 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + R+++ ++I + + K KD + D S + KG Sbjct: 224 WEQRKLGDVFNERSERSADG--ELISVTINSGVIKASKLEKKDNS--SFDKSNYKVVKKG 279 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SIDVTQRIEAICEG 143 I Y + + + + ++GI S + V+ P + + ++ D+ Q + +G Sbjct: 280 DIAYNSMRMWQGASGYSSYNGILSPAYTVIYPIKNINAMFIAYIFKKNDMIQTFQRNSQG 339 Query: 144 AT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 T + + + I + IP EQ+ I + + R +L K Q Sbjct: 340 LTSDTWNLKFPSLSKIKIKIPTNEEQIKITNLLRKLEYTSTFHQNKIERLKKLKKAYLQT 399 Query: 202 LV 203 + Sbjct: 400 MF 401 >gi|56419916|ref|YP_147234.1| type I restriction-modification system specificity protein [Geobacillus kaustophilus HTA426] gi|56379758|dbj|BAD75666.1| type I restriction-modification system specificity protein [Geobacillus kaustophilus HTA426] Length = 391 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 55/406 (13%), Positives = 132/406 (32%), Gaps = 34/406 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ V + + TG+ + + + VE G + + + D F Sbjct: 2 SEWREVSLGEILEFKTGKLNSN---------QAVEGGKYPFFTCSPTTLRID---RYSFD 49 Query: 83 KGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAI 140 +L + V+ PKD +L + + + + Sbjct: 50 TEAVLLAGNNANGIYAVKYYKGKFDAYQRTYVITPKDWETVDLRYMYYQIKLIGETLTQQ 109 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + +I + +PP++ Q I + + +I+ + E+ + Sbjct: 110 SLGTATKFLTLSLLNSIKINLPPISIQRKIATILGSIDDKIELNLKMNQTLEEMAMTLYK 169 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + V G D + +S +G++P W + +V ++N + L Sbjct: 170 ---HWFVDFGPFQDGEFVES---ELGMIPKGWSICELEEIVEKINERVKAGEHLFSLPYV 223 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVD--PGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 +++ + G K S +V G+I+F + K + + R Sbjct: 224 PIDVLNQKSLMINGYKHGSEAKSSLVKFYKGDILFGAMRPYFHKVCIAPFDGITRTTC-- 281 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + K + + +A + R + +E + ++ V++PP+ Sbjct: 282 FVLRPKNNDYYAFVVATIFREETIDYANSHSKGSTIPYADWETLSKMKVILPPL------ 335 Query: 379 TNVINVETAR---IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + L+ + + L + R + ++G+ID+ Sbjct: 336 -QYLKEYNEKVVPLFKLMIQNFLNNEELVKTRDYLLPRLLSGEIDV 380 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 74/190 (38%), Gaps = 5/190 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G IPK W + ++ + R + Y+ ++ + + Sbjct: 188 LGMIPKGWSICELEEIVEKINERVKAGEHLFSLPYVPIDVLNQKSLMI--NGYKHGSEAK 245 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVT 134 S++ F KG IL+G + PY K IA FDGI T VL+PK+ + Sbjct: 246 SSLVKFYKGDILFGAMRPYFHKVCIAPFDGITRTTCFVLRPKNNDYYAFVVATIFREETI 305 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +G+T+ +ADW+ + + + +PPL EK++ + ++ Sbjct: 306 DYANSHSKGSTIPYADWETLSKMKVILPPLQYLKEYNEKVVPLFKLMIQNFLNNEELVKT 365 Query: 195 LKEKKQALVS 204 L+S Sbjct: 366 RDYLLPRLLS 375 >gi|323340692|ref|ZP_08080944.1| type I site-specific deoxyribonuclease [Lactobacillus ruminis ATCC 25644] gi|323091815|gb|EFZ34435.1| type I site-specific deoxyribonuclease [Lactobacillus ruminis ATCC 25644] Length = 236 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 59/170 (34%), Gaps = 7/170 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP WK V + +G T G ++ ++ D+ G +P+ Sbjct: 66 DIPDSWKWVRLGMCGSWGSGATPSRTHPEYYGGNVPWLKTGDLNDGIITEIPEFVTELAL 125 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + ++V + G +L G + K I + D + P + + L Sbjct: 126 EKTSVRLNPVGSVLMAMYGATIGKLGILNIDATTNQACCACIPYTGIYNKYLFYYLM-AH 184 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + EG + + I P +PPLAEQ I EK+ + Sbjct: 185 RRSFIKMGEGGAQPNISKEKIVITPFALPPLAEQKRIVEKLEQLLPLCER 234 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 52/182 (28%), Gaps = 12/182 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + E +PD W+ + +R + + N+ L G++ + T Sbjct: 59 TDDEKNFDIPDSWKWVRLGMCGSWGSGATPSRTHPEYYGGNVPWLKTGDLNDGIITEIPE 118 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E + ++ G ++ K + + A A P+ Sbjct: 119 FVTELALEKTSVRLNPVGSVLMAMYGATIGKLGILNID----ATTNQACCACIPYTGIYN 174 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + G + ++ E + P +PP+ EQ I + + Sbjct: 175 KYLFYYLMAHRRSFIKMGEGGAQPNISKEKIVITPFALPPLAEQKRIVEKLEQLLPLCER 234 Query: 392 LV 393 L Sbjct: 235 LK 236 >gi|331654121|ref|ZP_08355121.1| HsdS specificity protein of type I restriction-modification system [Escherichia coli M718] gi|331047503|gb|EGI19580.1| HsdS specificity protein of type I restriction-modification system [Escherichia coli M718] Length = 201 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 60/189 (31%), Gaps = 10/189 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 P + N I + I + + K T ++V Sbjct: 15 WFPKSIGNSCQTFSGGTPSSTNKTYYGGEIPFIRSAEIQKYKTELYLTKKGLENSTAKMV 74 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+++ + SL G I A + ++ ++ +L+ + + Sbjct: 75 KKGDVLVALYGANSGDVSLSKI----NGAINQAILCLRHESNNAFLYQYLIHKKEW--II 128 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G + +L E +K + + P EQ I + + V +ID + I ++++ Sbjct: 129 TTFLQGGQGNLSGEIIKSIKIFFPQPVEQQKIADFLLVLDDKIDA----QTKKIDIIRKH 184 Query: 407 RSSFIAAAV 415 + + Sbjct: 185 KKGLMQQLF 193 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 57/185 (30%), Gaps = 12/185 (6%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W I + +G T S G +I +I +++ + + ST Sbjct: 15 WFPKSIGNSCQTFSGGTPSSTNKTYYGGEIPFIRSAEIQKYK---TELYLTKKGLENSTA 71 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG +L G ++ +G + L L + I + I Sbjct: 72 KMVKKGDVLVALYGANSGDVSLSKINGAINQAILCL---RHESNNAFLYQYLIHKKEWII 128 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + I +I + P EQ I + ++ +ID + + K Sbjct: 129 TTFLQGGQGNLSGEIIKSIKIFFPQPVEQQKIADFLLVLDDKIDAQTKKIDIIRKHKKGL 188 Query: 199 KQALV 203 Q L Sbjct: 189 MQQLF 193 >gi|295692970|ref|YP_003601580.1| type i restriction-modification system, s subunit [Lactobacillus crispatus ST1] gi|295031076|emb|CBL50555.1| Type I restriction-modification system, S subunit [Lactobacillus crispatus ST1] Length = 230 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 52/165 (31%), Gaps = 6/165 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP W+ V + G T G DI ++ D+ G + + Sbjct: 54 DIPNGWEWVRLGDIGAWAAGATPSRKHSEYYGGDIPWLKTGDLNDGIVEETSEKITELGV 113 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+V I G IL G + K I + + Q + + Sbjct: 114 KNSSVKINKPGNILIAMYGATIGKLGIVGKKELVTNQACCGCTPYKGIYNQYLFYYLLSS 173 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +R+ + G + + I P+PP +EQ + KI Sbjct: 174 RKRLINLGSGGAQPNISKQKIEKFAFPLPPQSEQSRVTAKIEQLL 218 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 55/175 (31%), Gaps = 11/175 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + E +P+ WE + +RK+++ +I L G++ + Sbjct: 47 TDDEKPFDIPNGWEWVRLGDIGAWAAGATPSRKHSEYYGGDIPWLKTGDLNDGIVEETSE 106 Query: 275 LKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E + +I PG I+ K + + + A P+ Sbjct: 107 KITELGVKNSSVKINKPGNILIAMYGATIGKLGIVG---KKELVTNQACCGCTPYKGIYN 163 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + ++ G + ++ + +++ +PP EQ +T I Sbjct: 164 QYLFYYLLSSRKRLINLGSGGAQPNISKQKIEKFAFPLPPQSEQSRVTAKIEQLL 218 >gi|78189486|ref|YP_379824.1| restriction endonuclease S subunits-like [Chlorobium chlorochromatii CaD3] gi|78171685|gb|ABB28781.1| Restriction endonuclease S subunits-like protein [Chlorobium chlorochromatii CaD3] Length = 386 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 50/398 (12%), Positives = 129/398 (32%), Gaps = 34/398 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + + TG+ + Y Y N D + + G + Sbjct: 6 LGKLVDIKTGK-LDVNAGTEYGKYPFFTCAKTVY---RINQYAFDNEAILVAGNGDL--- 58 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQRIEAICEGATMSH 148 + V++ K+ L + + + G + + Sbjct: 59 -------NVKYFKGKFNAYQRTYVIENKEVNLLSMKYLYYFMETYMIHLRNGAIGGIIKY 111 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + +P+PPL +Q I + +++ LI +R + ++ L + +++ + Sbjct: 112 IKIDHLTKAEIPLPPLDDQKRIAHLL----GKVERLIAQRKQHLQQLDQLLKSVFLEMFG 167 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + + H E+ + + ++ Sbjct: 168 -FFDKTYTNWTIDT-----LTSHTEIVSGITKGKKYKTDELIEVPYMRVANVQDEHFVLD 221 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-- 326 E + + + + Y+++ ++ D R +E I + V+ + Sbjct: 222 EIKTISVTKNEIKQYRLLAGDLLLTEGGDPDKLGRGAVWQNQIENCIHQNHIFRVRVNDK 281 Query: 327 -GIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I+ YL+ L+ S F+ + S+ +K+ P+++PPI+ Q ++ Sbjct: 282 SRINPDYLSALIGSPYGKSYFFRSAKQTTGIASINSTQLKKFPIVIPPIELQNRFATIVE 341 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +++ + +QS+ L+ ++ A G++DL Sbjct: 342 ----KVESIKTHYQQSLNNLETLYNALSQKAFKGELDL 375 >gi|149369906|ref|ZP_01889757.1| type I restriction-modification system specificity determinant protein [unidentified eubacterium SCB49] gi|149356397|gb|EDM44953.1| type I restriction-modification system specificity determinant protein [unidentified eubacterium SCB49] Length = 415 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 51/407 (12%), Positives = 115/407 (28%), Gaps = 35/407 (8%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +K ++ GR + + G G R D S ++ IL Sbjct: 5 KLKDLLEIKNGRDYK-----------HLSEGDIPVYGSGGLMRYVDES---LYQGESILL 50 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 + G + + T + + K + +L +E + G + Sbjct: 51 PRKGTLSNIQYVNESFWTVDTIYYSIIDK---SKTEPYYLYRYLTLLDLEHLNSGTGVPS 107 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + +IP+ +P L+ Q I + + +I+ + K Sbjct: 108 MTFGAYYDIPIKLPNLSTQKQIAKVLSDLDAKIEVNNNINQELEAMAKTLYDYWFVQFDF 167 Query: 209 KGLNPDVKMKDSGIEWVGL-----VPDHWEVKPFFALVTELNRKNTKLIE--SNILSLSY 261 +N + G +P+ W F + + + + Sbjct: 168 PDVNGNPYKSSGGAMVFNEALKREIPEGWGDGVFEDVANIIGGSTPSKADSANFTTEDGI 227 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERG 314 I K + N G K + Y + + G + + L + A E Sbjct: 228 PWITPKDLSNNKGKKYITRGEYDVTEQGIKKSSLKLMPSGTVLLSSRAPIGYLAIARETV 287 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + +P ++ + + + G + + +K + ++ PP Sbjct: 288 TTNQGFKSFEPKSYFTSEFLYYQIKNKIPLIEARSGGSTFKEVSASTLKTIKIITPP--- 344 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + I +E+ L E R + + GQ+ + Sbjct: 345 -EKVIKIYQTTAKPIFNKQNLLEKENQKLSELRDWLLPMLMNGQVTV 390 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 37/126 (29%), Gaps = 13/126 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYL----PKD 67 IP+ W + + G T I +I +D+ + GK D Sbjct: 191 EIPEGWGDGVFEDVANIIGGSTPSKADSANFTTEDGIPWITPKDLSNNKGKKYITRGEYD 250 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + S++ + G +L P IA + F +PK + Sbjct: 251 VTEQGIKKSSLKLMPSGTVLLSSRAPI-GYLAIARETVTTNQGFKSFEPKSYFTSEFLYY 309 Query: 128 LLSIDV 133 + + Sbjct: 310 QIKNKI 315 >gi|254426284|ref|ZP_05040000.1| Type I restriction modification DNA specificity domain protein [Synechococcus sp. PCC 7335] gi|196187698|gb|EDX82664.1| Type I restriction modification DNA specificity domain protein [Synechococcus sp. PCC 7335] Length = 409 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 53/417 (12%), Positives = 126/417 (30%), Gaps = 32/417 (7%) Query: 21 IPKHWKVVPIKRFTK-----LNTG---RTSESG----KDIIYIGLEDVESGTGKYLPKDG 68 +P WK+V + + G + + ++ + Sbjct: 3 LPHDWKLVSLSEIASSEKGAIRRGPFGGSLKKSMFVESGFKVYEQQNAIRDDFQIGHYFI 62 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQP--KDVLPELL 124 N + ++ G R AI+ D G+ + + ++P +L L Sbjct: 63 NDEKYKEMEGFSVKPRDLIISCAGTIGRIAIVPDSAEPGVINQALMRIRPDTNVILVRYL 122 Query: 125 QGWLLSIDVTQRIEAICEGATMSHAD-WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + L S + I G+ + + I +P+PP+ EQ I + Sbjct: 123 KWLLESPTYQRDIFGKSAGSALKNLAAIGEIKKCKIPLPPIKEQRRIAAILDKADAVRRK 182 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 +LL+ + + + K S + + + PF + + Sbjct: 183 RKEAIALTEDLLRSVFLDFMESV------SNDCRKVSFKDVTLESRNSFVNGPFGSNLLT 236 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ + I + G + ++ + + V PG+++ + Sbjct: 237 SELQSEGVPVIYIRDIREG-VYNRVSQAFVTKEKAKELAACNVFPGDVLIAKVGDPPGTA 295 Query: 304 SLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFY-AMGSGLRQSLKFE 360 ++ GI+T + ++ + ++A + S + R Sbjct: 296 AIYPLSSP-NGIVTQDVVRMRLDLENATPEFIAAYINSQIGKHTLKPIIVEATRSRFPLG 354 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 K L V +PP+++Q + + +I + + + S + A G Sbjct: 355 AFKNLVVTLPPLEDQQRF----SKQYKKIRHIQNFLHCTCEQENNLFHSLLQRAFRG 407 >gi|225022500|ref|ZP_03711692.1| hypothetical protein CORMATOL_02540 [Corynebacterium matruchotii ATCC 33806] gi|224944739|gb|EEG25948.1| hypothetical protein CORMATOL_02540 [Corynebacterium matruchotii ATCC 33806] Length = 383 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 50/411 (12%), Positives = 107/411 (26%), Gaps = 47/411 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W V + + R E I + D + +Y K D + Sbjct: 2 SEWPTVKLGEVAEQTNHRVGELDVPIYSVTKYDGFVPSSEYFKKRVF--SMDIRKYKLCT 59 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G Y + I+ S + + + + + Sbjct: 60 AGDFAYATIHLDEGSIGISPVKCGISPMYTTFRLNSLNISPDYLLRYLKSSRALTQYLTL 119 Query: 143 G----ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + + + +P PPL EQ I + T I + Sbjct: 120 GSGSAERRKSIKFTDLRKMEIPFPPLGEQNRIVGILGKTTGAISS--------------- 164 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 V K + K++ S + + + + +N Sbjct: 165 --------VQKQIEQAKKLRSSIVGMASKMAVELRAISEYFDINPRQPRNIPDNAPTSFV 216 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV-------- 310 + E + Y + G+I+ I + A++ Sbjct: 217 PMANLDETFGISPITSRFSEHKKGYTYFENGDILLAKITPCFENGKSAIAKLSTQIGHGS 276 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVL 368 E ++ + + +A +++ K + GS ++ + + L V Sbjct: 277 TEFHVLRHKNHMHQDVCLSPLLVAAILKQPSFLKPAENFMRGSAGQKRIPVSYIASLKVP 336 Query: 369 VPPIKEQFDITN--VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 V + I+ ++ L+ + + LL+E + S A G Sbjct: 337 V------LKSVDLEKIDQSLEIVEALLNLYHRKLSLLQELQKSLATRAFAG 381 >gi|229089987|ref|ZP_04221239.1| hypothetical protein bcere0021_8230 [Bacillus cereus Rock3-42] gi|228693334|gb|EEL47043.1| hypothetical protein bcere0021_8230 [Bacillus cereus Rock3-42] Length = 385 Score = 84.8 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 55/365 (15%), Positives = 112/365 (30%), Gaps = 22/365 (6%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPK 117 +PK+ + S+ + I GQ +YGKL + I ++ + + Sbjct: 26 VVPKNEIYQGSEATKYYIRKAGQFIYGKLDFLHQAFGIIPDKLDGYESTLDSPAFDIADN 85 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L+ + + +P+ +P + EQ I + Sbjct: 86 LNSSFFLEHVSRKQFYLYQGTIANGSRKAKRIHSETFFEMPLIVPTMEEQKKIGDF---- 141 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 ++D I + + LK+ KQ + + K +++ G V Sbjct: 142 FKKVDQTIALHQQELTTLKQTKQGFLQKMFPKDGESVPEIRFPGFTGDWEEYKFENVLNK 201 Query: 238 FALVTELNRKNTKLIESN-----ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + E N I + E + + E Sbjct: 202 QDGIRRGPFGSALKKEFFVKDSNYAVYEQQNAIYDNYETRYNITKEKFTELKNFQLSEGD 261 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAM- 349 F R R + +++G+ A + D Y +RS ++ + Sbjct: 262 FILSGAGTIGRISRVPKGIKQGVFNQALIRFKIDEDITDPEYFIQWIRSENMQRKLTGAN 321 Query: 350 -GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 GS + + +VK+ V+VP EQ I ++D + + + LKE + Sbjct: 322 PGSAITNLVPMSEVKKWDVMVPSKNEQIKIGEF----FKQLDDTITLHQSELDALKETKK 377 Query: 409 SFIAA 413 +F+ Sbjct: 378 AFLQK 382 >gi|15900422|ref|NP_345026.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae TIGR4] gi|14971981|gb|AAK74666.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae TIGR4] Length = 522 Score = 84.8 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 68/441 (15%), Positives = 132/441 (29%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW-------- 224 +L KE ++++ Y + L K E Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 225 ------------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESN 255 + +P+ W F +LV K + Sbjct: 323 DISIVSQGDDNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTE 382 Query: 256 ILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ N + + I G ++ F L Sbjct: 383 IPWVSISDMPISGYVTNARESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATH 442 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 II+ + I YL + G ++L + L + + Sbjct: 443 NEAIIS-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISN 499 Query: 372 IKEQFDITNVINVETARIDVL 392 +E I +++ ++ L Sbjct: 500 HEEMKRIIFKVDLLFQKVSQL 520 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 347 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNARESISK 406 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 407 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 464 >gi|268318424|ref|YP_003292142.1| restriction modification system DNA specificity domain protein [Rhodothermus marinus DSM 4252] gi|262335958|gb|ACY49754.1| restriction modification system DNA specificity domain protein [Rhodothermus marinus DSM 4252] Length = 237 Score = 84.8 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 22/201 (10%), Positives = 53/201 (26%), Gaps = 9/201 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-------PES 279 +P W + + + + + + L + Sbjct: 36 DLPYGWHWVRLEEIFEVQQGASMSPKRRAGRNPKPFLRTKNVLWGTVDLSLIDEMDFTDK 95 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + PG+++ + + K ++ + + M++ Sbjct: 96 EIEKLRLQPGDLLVCEGGDVGRTAIWEGQLPLVLYQNHIHRLRAKDAEVEPRFFMYWMQA 155 Query: 340 YD--LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 A +L +K +PP+ EQ I + +I L E Sbjct: 156 AYQVFLAYQGAESRTAIPNLSGRRLKNFNAPLPPLSEQRRIVAHLEAVQEKIRALKAAQE 215 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 ++ LK + + A G+ Sbjct: 216 ETDEELKRLEQAILDRAFRGE 236 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 64/202 (31%), Gaps = 10/202 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W V ++ ++ G + ++ ++V GT D Sbjct: 36 DLPYGWHWVRLEEIFEVQQGASMSPKRRAGRNPKPFLRTKNVLWGTVDLSLIDEMDFTDK 95 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE---LLQGWLLSI 131 G +L + G R AI + Q + + + E + + Sbjct: 96 EIEKLRLQPGDLLVCEGGDVGRTAIWEGQLPLVLYQNHIHRLRAKDAEVEPRFFMYWMQA 155 Query: 132 DVTQR--IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + + N P+PPL+EQ I + A +I L + Sbjct: 156 AYQVFLAYQGAESRTAIPNLSGRRLKNFNAPLPPLSEQRRIVAHLEAVQEKIRALKAAQE 215 Query: 190 RFIELLKEKKQALVSYIVTKGL 211 E LK +QA++ L Sbjct: 216 ETDEELKRLEQAILDRAFRGEL 237 >gi|27365369|ref|NP_760897.1| Restriction endonuclease S subunit [Vibrio vulnificus CMCP6] gi|27361516|gb|AAO10424.1| Restriction endonuclease S subunit [Vibrio vulnificus CMCP6] Length = 560 Score = 84.8 bits (208), Expect = 3e-14, Method: Composition-based stats. Identities = 61/466 (13%), Positives = 124/466 (26%), Gaps = 100/466 (21%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W+ L G K + +G++L N T + Sbjct: 101 LPEGWQACYFGDIYSLVYGDNLPKAK----------RTESGEFLVYGSNG-SVGTHNLFS 149 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ G+ G + + + ++ P + + L ++ + + I Sbjct: 150 VGSPCLVIGRKGSAGAINLSDQPCWVTDVAYSLIPPVGISLKYCFLHLQTLGLDSLGKGI 209 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR-------------------- 180 G + + + IPP EQ I K+ Sbjct: 210 KPG-----LNRNEANALVVCIPPSDEQHRIVAKVDELMALCDQLEQQTEASIEAHQLLVT 264 Query: 181 ---------------------IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 I E + + KQ ++ V L P + Sbjct: 265 TLLDTLTNSADADELMQNWARISEHFDTLFSTEESIDQLKQTILQLAVMGKLVPQDPSDE 324 Query: 220 SGIEWV-------------------------------GLVPDHWEVKPFFALVTELNRKN 248 E + +P WE + + Sbjct: 325 PAAELLKRIADEKAQLVKDKKIKKQKALPPIAEDEKPFELPSGWEWCRIQDVALFTTSGS 384 Query: 249 ----TKLIESNILSLSYGNIIQK------LETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +S L ++ GN+ + R + + ++ +++ Sbjct: 385 RDWAKYYSDSGALFVTMGNLSRGSYELRLDNLRFVRPPKGGEGSRTKLEARDLLISITGD 444 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + L + E I + Y MRS F A G++ S + Sbjct: 445 VGNL-GLIPEEFGEAYINQHTCLLRFMPECQGKYFPDFMRSPLAKYQFDAPQRGIKNSFR 503 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI-EQSIVLL 403 DV + + +PP+ EQ IT ++ + + L ++ E I L Sbjct: 504 LSDVGEMHLPLPPLNEQVRITEKVSDLLSICERLKVRLRESQITQL 549 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 61/198 (30%), Gaps = 10/198 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSR--- 71 +P W+ I+ T + + S +++ + ++ G+ + + Sbjct: 363 ELPSGWEWCRIQDVALFTTSGSRDWAKYYSDSGALFVTMGNLSRGSYELRLDNLRFVRPP 422 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLL 129 + + + +L G +I + G + +L+ + Sbjct: 423 KGGEGSRTKLEARDLLISITGDVGNLGLIPEEFGEAYINQHTCLLRFMPECQGKYFPDFM 482 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + +G + +P+PPL EQV I EK+ + L Sbjct: 483 RSPLAKYQFDAPQRGIKNSFRLSDVGEMHLPLPPLNEQVRITEKVSDLLSICERLKVRLR 542 Query: 190 RFIELLKEKKQALVSYIV 207 A+V V Sbjct: 543 ESQITQLHLTDAIVERAV 560 >gi|325107545|ref|YP_004268613.1| type I restriction system, specificity protein HsdS [Planctomyces brasiliensis DSM 5305] gi|324967813|gb|ADY58591.1| putative type I restriction system, specificity protein HsdS [Planctomyces brasiliensis DSM 5305] Length = 199 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 66/158 (41%), Gaps = 12/158 (7%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + S E + +V G+I + + + + E +++ AY+ + Sbjct: 38 VPRDSLDRKMETNLSDEEHLLVRRGDIAYNMMRMWQGASGVAH----EDCLVSPAYVVLN 93 Query: 325 P-HGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITN 380 P IDS + ++ + K F G+ R L +ED +P VP +EQ I Sbjct: 94 PTELIDSRFASYFFKHPHTLKQFRDFSHGIAEDRLRLYYEDFSAIPTRVPDKEEQARIAR 153 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I +++D+L +Q + LL +RR +TG+ Sbjct: 154 FIEACDSQLDLL----KQKVELLGQRREGLSNRLLTGE 187 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 45/156 (28%), Gaps = 6/156 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +HW + + + + + + + + + Sbjct: 4 EHWTPRLMGELFEKRSEQ---GIAGLPIMSVTIERGLVPRDSLDRKMETNLSDEEHLLVR 60 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---TQRIEA 139 +G I Y + + + +A D + S ++VL P +++ + R + Sbjct: 61 RGDIAYNMMRMWQGASGVAHEDCLVSPAYVVLNPTELIDSRFASYFFKHPHTLKQFRDFS 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 ++ IP +P EQ I I Sbjct: 121 HGIAEDRLRLYYEDFSAIPTRVPDKEEQARIARFIE 156 >gi|153805909|ref|ZP_01958577.1| hypothetical protein BACCAC_00149 [Bacteroides caccae ATCC 43185] gi|149130586|gb|EDM21792.1| hypothetical protein BACCAC_00149 [Bacteroides caccae ATCC 43185] Length = 361 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/364 (12%), Positives = 105/364 (28%), Gaps = 31/364 (8%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +K F ++TG T D ++ ED+++ ++ +T I Sbjct: 18 LKSFADVSTGGTPSKANLEYWNGDKPWVSAEDMKNKY--VYDTCEKVTEAGYATCKIIPV 75 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++Y G I + + + D + + + + I+ + G Sbjct: 76 DTLMYVCRGSI-GVMAINKIECATNQSICRAKCHDNVCNVEFLYHALMYQKDNIKKMGTG 134 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + + +PP EQ+ + + + Sbjct: 135 TSFKSLNQTSFSELKIELPPYNEQMKFVSIAQQADKSEFVGCKSQFIEMFGNQNTNDKGW 194 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + K + K S +G P + + + I +S Sbjct: 195 TESLVK-----DEFKLS----MGKTPARNNPECWDNGTHKWVS---------ISDMSSYT 236 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + + + V G I+ F I+ A+ Sbjct: 237 RYTGDTSEYITDYAIADSGIKAVPKGTIIMSFKLSIGRTAITSEDLYTNEAIM--AFAGF 294 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + +L +L+ + + G Q+L E + +++PPI+ Q + ++ N Sbjct: 295 DEKKFNIDFLHFLIANKNWLLGAKQAVKG--QTLNKESIGNAKIIIPPIEAQEEFASIYN 352 Query: 384 VETA 387 Sbjct: 353 QADK 356 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 46/166 (27%), Gaps = 10/166 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGK--YLPKDGNSRQS 73 K W +K KL+ G+T ++ + D+ S T + Sbjct: 192 KGWTESLVKDEFKLSMGKTPARNNPECWDNGTHKWVSISDMSSYTRYTGDTSEYITDYAI 251 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + KG I+ + + I D + + D + I Sbjct: 252 ADSGIKAVPKGTIIMS-FKLSIGRTAITSEDLYTNEAIMAFAGFDEKKFNIDFLHFLIAN 310 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + + IGN + IPP+ Q Sbjct: 311 KNWLLGAKQAVKGQTLNKESIGNAKIIIPPIEAQEEFASIYNQADK 356 >gi|327390241|gb|EGE88584.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 348 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 57/368 (15%), Positives = 112/368 (30%), Gaps = 48/368 (13%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + + + + K S + ++T N KN K +E Sbjct: 171 KSRFNEMFGDVILNEKEWKVS---------------KWNEILTIRNGKNQKQVEDADGKF 215 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y IV ++ N +R Sbjct: 216 PIYGSGG----------IMGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG-- 263 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Q + Sbjct: 264 -LEPVLEKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFA 319 Query: 380 NVINVETA 387 + + Sbjct: 320 DFVVQVDK 327 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 186 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 233 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 234 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 290 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 291 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 321 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 38/137 (27%), Gaps = 6/137 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGTYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKITDKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + ++P EQ I +N + Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTISQIY 156 Query: 390 DVLVEKIEQSIVLLKER 406 D + E+ L+K R Sbjct: 157 DFRKIQSEKFNELVKSR 173 >gi|293400127|ref|ZP_06644273.1| type I restriction-modification system [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291306527|gb|EFE47770.1| type I restriction-modification system [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 345 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 48/361 (13%), Positives = 104/361 (28%), Gaps = 27/361 (7%) Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKD 118 + D S + G+ Y K G ST ++ K Sbjct: 2 SYFNKTVASKDMSGYYLLKNGEFAYNKSYSVGYDFGSIKRLDRYPMGALSTLYICFVLKR 61 Query: 119 VLPELLQGWLLS-IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + ++ + S + EGA L E + KI Sbjct: 62 HESDFIKAYFDSLKWYREIYMISAEGARNHGLLNVPTEEFFDTKHYLPENTDEQRKIANF 121 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + ID I + ++ LK+ K+ L++ + + G + + Sbjct: 122 LIAIDKKIAAQQSLVDNLKKYKRGLLNKVFSN--------------INGNIYPTVYLSEV 167 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIVFRFI 296 + L + + + L L NI L + + + V +++ Sbjct: 168 ADFLQGLTYSPSDVSVAGYLVLRSSNIQNGVLSFDDCVYVDKKVDESLQVKCDDVIMCVR 227 Query: 297 DLQNDKRSLRSAQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + T + M ++ D+ +L +VF MG+ Sbjct: 228 NGSKKLVGKTALIPNNMAMTTWGAFMMIIRSKLNDTYIFHYLNSQMFFSQVFKDMGTATI 287 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + + + +PP + I + + DV ++ E + L E + + + Sbjct: 288 NQITKGILNECKLPLPPETARKQI----SKMLSSFDVKIQNAEICLTTLVELKKALLQQL 343 Query: 415 V 415 Sbjct: 344 F 344 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 51/155 (32%), Gaps = 11/155 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHG 327 N + + Y ++ GE + S++ G +++ Y+ Sbjct: 2 SYFNKTVASKDMSGYYLLKNGEFAYNKSYSVGYDFGSIKRLDRYPMGALSTLYICFVLKR 61 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVP-PIKEQFDITNV 381 +S ++ S + Y + + G R ++ E+ +P EQ I N Sbjct: 62 HESDFIKAYFDSLKWYREIYMISAEGARNHGLLNVPTEEFFDTKHYLPENTDEQRKIANF 121 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ID + + + LK+ + + + Sbjct: 122 LIA----IDKKIAAQQSLVDNLKKYKRGLLNKVFS 152 >gi|13786606|ref|NP_112723.1| hypothetical protein pCD4_p4 [Plasmid pCD4] gi|13676629|gb|AAK38201.1|AF306799_4 HsdS [Plasmid pCD4] Length = 394 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 56/409 (13%), Positives = 133/409 (32%), Gaps = 47/409 (11%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ K K + R+ + E ++ G Sbjct: 15 KVPELRFPGFTDDWEERKAKEMIKTHHFRSYL-AEPNDVGNYEVIQQGDKPIAGYANGEP 73 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 V++F G P I D I S E + Sbjct: 74 FEYFYDVTLF--GDHTVSLFKPTKPFFIATDGVKIISA-----------DEFDGRYFYVT 120 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + I + KI ++D I R Sbjct: 121 LERYKPASQGYKRHFTILKNEDIWFTTNKDEQV--------KIGTFFKQLDDTIALHQRK 172 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE+K+ + + K +++ +G D WE + + + + ++ Sbjct: 173 LDLLKEQKKGYLQKMFPKNGAKVPELRFAG------FADDWEQRKLKDVTERVRSNDGRM 226 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-V 310 + + + + + + + + + Y ++ GE+ + + + K + + Sbjct: 227 DLPTLTMSASSGWLDQKDRFSGDISGKEKKNYTLLKKGELSYNHGNSKLAKYGVVFSLTN 286 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDL--CKVFYAMGSGLRQ----SLKFEDVKR 364 E ++ Y + K S M S L ++ + SG R ++ ++D Sbjct: 287 YEEALVPRVYHSFKALENTSADFIEYMFSTKLPDRELGKLVSSGARMDGLLNINYDDFMN 346 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + +P +EQ +++ ++D + ++ + LLKE++ F+ Sbjct: 347 IHISIPNYEEQI----LMSTFFRKLDDTIALHQRKLDLLKEQKKGFLQK 391 >gi|265982955|ref|ZP_06095690.1| LOW QUALITY PROTEIN: restriction endonuclease S [Brucella sp. 83/13] gi|264661547|gb|EEZ31808.1| LOW QUALITY PROTEIN: restriction endonuclease S [Brucella sp. 83/13] Length = 345 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 51/346 (14%), Positives = 102/346 (29%), Gaps = 46/346 (13%) Query: 88 YGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEAICE 142 + P ++ + + + ST + VL+ K LP+ + WL + +E Sbjct: 16 FATTRPTQQRYCLIGDEYSGEVASTGYCVLRAKKDKVLPKWILHWLATTTFKTYVEENQS 75 Query: 143 GATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G+ + +P+P LA Q + A T +LL Sbjct: 76 GSAYPAISDAKVREFEIPVPCPDNPEKSLAIQAEFVRILDAFTELTARKKQYNYYREQLL 135 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + D +EW +G V + Sbjct: 136 R--------------------FDDGEVEWKALGEVAELVRGNGLQKKDFTETGVPAIHYG 175 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + PE + VD G++V + + ER Sbjct: 176 QIYTCYGLST-----TETKSYVSPELARRLRKVDRGDVVITNTSENIEDVGKALVYLGER 230 Query: 314 GIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVP 370 +T + + + + Y A+ ++ G + + D+ ++ + VP Sbjct: 231 QAVTGGHATILKPGNCLLGKYFAYFTQTDTFASDKRRYAKGTKVIDVSATDMAKILIPVP 290 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 P+ EQ I +++ A + E + I L K+ R ++ Sbjct: 291 PLAEQAHIVTILDKFDALTHSISEGLPHEISLRKQQYTHYRDRLLS 336 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 60/202 (29%), Gaps = 15/202 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G + + + +L G + + + I + + G + + + Sbjct: 140 GEV----EWKALGEVAELVRGNGLQKKDFTETGVPAIHYGQIYTCYGLSTTETKSYVSPE 195 Query: 75 T-STVSIFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWL 128 + +G ++ + + + + +L+P + L + Sbjct: 196 LARRLRKVDRGDVVITNTSENIEDVGKALVYLGERQAVTGGHATILKPGNCLLGKYFAYF 255 Query: 129 LSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 D +G + + I +P+PPLAEQ I + ++ Sbjct: 256 TQTDTFASDKRRYAKGTKVIDVSATDMAKILIPVPPLAEQAHIVTILDKFDALTHSISEG 315 Query: 188 RIRFIELLKEKKQALVSYIVTK 209 I L K++ +++ Sbjct: 316 LPHEISLRKQQYTHYRDRLLSF 337 >gi|189462165|ref|ZP_03010950.1| hypothetical protein BACCOP_02847 [Bacteroides coprocola DSM 17136] gi|189431138|gb|EDV00123.1| hypothetical protein BACCOP_02847 [Bacteroides coprocola DSM 17136] Length = 462 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 48/382 (12%), Positives = 125/382 (32%), Gaps = 29/382 (7%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDG----NSRQSDTSTVSIFAKGQILYGKLGPYLRKAII 100 I Y D+ + ++ P + S KG IL +G + + Sbjct: 70 NDGIPYYRGGDIYNSFIEFSPNPLRIPRYVYELSIMRRSHLKKGDILMSIVGAIIGNISL 129 Query: 101 --ADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 + + CS + +++PK+ + +L Q+I+ G+ + + + Sbjct: 130 VSTNNNATCSCKLAIIRPKNNISSEYLATYLRCKYGQQQIQKFRRGSGQTGIILEDFDQL 189 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 +P + I + L E + + + Sbjct: 190 LVPDLSNNIKEQISSFVKQSYAYSLKSRQLYSEAESYLLE-CLGMTDFAANPDAYNVKTL 248 Query: 218 KDSGIEWVGLV-----PDHWEVKPFFALVTELNR--KNTKLIESNILSLSYGNIIQKLET 270 K+S ++ P + + + + I+ + G + +E Sbjct: 249 KESFLDTGRFDAEYYLPKYEDYCRLVQSYSNGYELLGDACNIKDANYTPETGVRYKYIEL 308 Query: 271 RNMGLKPESY------------ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 N+G E ++V G+++ I+ + +L + E + ++ Sbjct: 309 ANIGKSGEIIGCDIQNGENLPTRARRMVHQGDVIVSSIEGSLESCALVTED-YEGALCST 367 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFD 377 + ++ ++S L L +S + ++ SG ++ ++++LP+ + + Q + Sbjct: 368 GFYVLQSSKMNSETLLTLFKSLPIQQLMKKGCSGTILTAISKPELEKLPIPIIRQEVQDE 427 Query: 378 ITNVINVETARIDVLVEKIEQS 399 I + A ++ +E + Sbjct: 428 IAQHVRKSFALRKEAMKLLENA 449 >gi|308178070|ref|YP_003917476.1| type I restriction-modification system specificity subunit [Arthrobacter arilaitensis Re117] gi|307745533|emb|CBT76505.1| type I restriction-modification system specificity subunit [Arthrobacter arilaitensis Re117] Length = 417 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 49/147 (33%), Gaps = 7/147 (4%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA--WLMRS 339 + + G+++F + + A + L W +RS Sbjct: 64 SRSQLADGDVLFSIAGALGRSTVVEPDWLPANTNQALAIIRPSRKRGLVRPLYLLWALRS 123 Query: 340 YDL-CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + ++ + +L + V + +P + EQ I ++ A + L + + Sbjct: 124 PTVGKRINEINVQAAQANLSLQQVGEFEIPIPNLAEQEAIAAALDDVDALVKSLKRIVAK 183 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGES 425 + + + + +TG+ L G + Sbjct: 184 KLDV----KQGMMQELLTGRTRLPGFT 206 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 69/423 (16%), Positives = 142/423 (33%), Gaps = 37/423 (8%) Query: 27 VVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS--TV 78 V + + G T S + ++ +E E + K+ + + Sbjct: 8 VRALSEL--ITKGTTPTSIGRNFTANGVRFLKVETFEEDGTYVVGKEAFIDEETHRQLSR 65 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVL----PELLQGWLLSID 132 S A G +L+ G R ++ + +++P P L L S Sbjct: 66 SQLADGDVLFSIAGALGRSTVVEPDWLPANTNQALAIIRPSRKRGLVRPLYLLWALRSPT 125 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 V +RI I A ++ + +G +PIP LAEQ I + + +L + + Sbjct: 126 VGKRINEINVQAAQANLSLQQVGEFEIPIPNLAEQEAIAAALDDVDALVKSLKRIVAKKL 185 Query: 193 ELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ + Q L++ G D + G DH AL E + + L Sbjct: 186 DVKQGMMQELLTGRTRLPGFTGDWRNVTLG--------DHVAYVRSVALSREQLDQGSPL 237 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFI--DLQNDKRSLRS 307 + + ++ T + S+ + PG++VF D +S+ Sbjct: 238 RYLHYGDIHTRKSVRLDATSEFMPRAASHLASGAGRLIPGDLVFADASEDPDGVGKSVEI 297 Query: 308 AQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGLRQ-SLKFEDVK 363 + V G++ A + + ++ + +G + + + Sbjct: 298 SDVPPEGVVPGLHTIAARFDKSVLADGFKAYIQFIPAFRAALLRLAAGTKVLATTRSYIS 357 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L + +P EQ I V+ A I+ L E+ + + + + +TG+ L Sbjct: 358 SLKLPLPGADEQHAIAQVLEDADAEIEAL----ERRLESARAVKVGMMQELLTGRTRLPT 413 Query: 424 ESQ 426 + + Sbjct: 414 KEE 416 >gi|312115847|ref|YP_004013443.1| restriction modification system DNA specificity domain protein [Rhodomicrobium vannielii ATCC 17100] gi|311220976|gb|ADP72344.1| restriction modification system DNA specificity domain protein [Rhodomicrobium vannielii ATCC 17100] Length = 367 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 124/398 (31%), Gaps = 39/398 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+V P+ F +L +G+ ++ +I G+ + +G + + + Sbjct: 4 GWEVRPLGDFIELISGQHIDAVDYNIDGHGVGYI-TGPSDFGRDEPLISKWTEKPKRFAD 62 Query: 83 KGQILYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G IL G + K + S Q + ++ K + + L L + ++ Sbjct: 63 PGDILLTVKGSGVGKINRLRRGRVAISRQIMAVRAKGIDADFLHLLLGAHG--AHFASLA 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + + N+ +P+PP+ EQ I + + + ++ + Sbjct: 121 NGAAIPGISREHVTNLQIPLPPMDEQTRIVAILDEAFAGLSRARANAEANLADARKLLEV 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ + G + + + +K+ + LS Sbjct: 181 TIAERLKSG-------------------NGDWQQCLVENSYRRTKIPSKVQCKDYLSEGR 221 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I+ + G + + V+ +VF R L+ + Sbjct: 222 YPIVSQEADFISGYWEDDAD-LVRVERPIVVFGD-----HTRHLKYIDFDFVVGADGTQL 275 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 I+ + + +RS L G G + +K+ + P + Q I + Sbjct: 276 LAPISQIEPKFYYYALRSIPL------AGKGYARHFS--HLKKETIWFPADLASQRAIAD 327 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + I L + R S + A +G+ Sbjct: 328 TLEEIEVHIADLARAYIAQSGSINSLRQSLLQKAFSGE 365 >gi|330957221|gb|EGH57481.1| type I restriction-modification system specificity determinant [Pseudomonas syringae pv. maculicola str. ES4326] Length = 399 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 43/408 (10%), Positives = 112/408 (27%), Gaps = 57/408 (13%) Query: 26 KVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSI 80 + + +L G + + K + I + + G K + SD + Sbjct: 17 EWQALSELGELVRGSGLQKKDFTEKGVPAIHYGQIYTYYGLSTSKTKSFVSSDLARQLRK 76 Query: 81 FAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLS-IDVT 134 +G ++ + + + + +L+P L + + Sbjct: 77 VNQGDVVITNTSENFKDVGKALVYLGEQQAVTGGHATILRPGSCLLGKYFAYFTQTSEFF 136 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITE 187 +G + + I +PIP L Q I + A T L TE Sbjct: 137 AEKRKYAKGIKVIDVSATDMAKIRIPIPCPDNPKKSLEIQSEIVRMLDAFTELTAGLTTE 196 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + +++ + +++ + +EW L + V Sbjct: 197 LTTELSIREKQYNYYCNQLLS--------FEKQEVEWKTLENITTSIASGRNKVRATEGA 248 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + ++ + ++ + + Sbjct: 249 VPVYGSTGVIGFTSEAAYSG---------------------NVLLVARVGAN---AGRVN 284 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 A + + + + + + +L + G + + +K L V Sbjct: 285 AVAGNFDVSDNTLIVRPNEAWNVRFAFHQLTHMNLNQY---AVGGGQPLVTGGLLKSLKV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 +PP+ EQ I +++ + + E + + L ++ R + Sbjct: 342 QLPPLSEQERIATILDKFDTLTNSISEGLPRETALRQKQYQYYRDLLL 389 >gi|328947120|ref|YP_004364457.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328447444|gb|AEB13160.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 384 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/406 (10%), Positives = 113/406 (27%), Gaps = 37/406 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K + +K+ TG+ + + + G Y + + + + Sbjct: 4 KYIKLKKIATYPTGKLNSNAAE-----------KDGIYPFFTCSHDIYRINNYAYDGEYV 52 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT-QRIEAICEGA 144 +L G + + ++QP D + SI + + +++ G Sbjct: 53 LLGGNNATGDFPIFYYNGKFNAYQRTYLIQPIDTNQFDTRYLFYSIGLKLKLMQSNAAGT 112 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + NI + PL Q I + A I + E L Sbjct: 113 ATRFLTQPILDNINIEYRPLPTQQKIASILSAYDDLIQNYKKQIEALQTAASE----LYK 168 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL------- 257 + P + +P+ W + + K + Sbjct: 169 EWFVRFRFPGWQNAKFE----NGIPEGWSICRLKDFGKVITGKTPPTEKEEYYGGDVMFV 224 Query: 258 --SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 +GN+ + + + Y+ Q + I+ I + + Sbjct: 225 KTPDMHGNMFVQSTSEYLSKLGCEYQKAQYLPENSIMVSCIGT----GGITAINAYPANT 280 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + +L + + + + +L ++L V+ P Sbjct: 281 NQQINSIILKDKKYLPWLYFTISNMKETIEMFGNTGTTMTNLSKGKFEKLKVVKPEHS-- 338 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + + + ++ + + I L ++R + ++G++++ Sbjct: 339 --IIQTFENKVSPLFEQIKNLNKQITNLTQQRDLLLPRLMSGKLEV 382 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 8/191 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIY------IGLEDVESG-TGKYLPKDGNSRQS 73 IP+ W + +K F K+ TG+T + K+ Y + D+ + + + Sbjct: 188 IPEGWSICRLKDFGKVITGKTPPTEKEEYYGGDVMFVKTPDMHGNMFVQSTSEYLSKLGC 247 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + I+ +G I + + Q + KD + +S Sbjct: 248 EYQKAQYLPENSIMVSCIG-TGGITAINAYPANTNQQINSIILKDKKYLPWLYFTISNMK 306 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G TM++ + + P + K+ +I L + + Sbjct: 307 ETIEMFGNTGTTMTNLSKGKFEKLKVVKPEHSIIQTFENKVSPLFEQIKNLNKQITNLTQ 366 Query: 194 LLKEKKQALVS 204 L+S Sbjct: 367 QRDLLLPRLMS 377 >gi|170724867|ref|YP_001758893.1| restriction modification system DNA specificity subunit [Shewanella woodyi ATCC 51908] gi|169810214|gb|ACA84798.1| restriction modification system DNA specificity domain [Shewanella woodyi ATCC 51908] Length = 612 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 72/200 (36%), Gaps = 5/200 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P WE + +L +K E + + N LK Sbjct: 115 SEDEKPFELPKGWEWTRLQDIGHDLGQKTPD-CEFTYIDVGAINKELGFVEEPSILKASD 173 Query: 280 YETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWL 336 + ++V +++ + ++ + I ++A+ + P G++S+Y+ Sbjct: 174 APSRARKLVKRNTVIYSTVRPYLLNIAVIGNDLSPEPIASTAFAIIHPLLGMNSSYIYRY 233 Query: 337 MRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 +RS ++ +G+ ++ + + +PP +EQ I ++ D L + Sbjct: 234 LRSPCFINYVESVQTGIAYPAINDKQFFNGIIAIPPTEEQHRIVAKVDELMILCDALEAQ 293 Query: 396 IEQSIVLLKERRSSFIAAAV 415 E S + + A + Sbjct: 294 TEASKSAHQTLVEILLGALL 313 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 73/194 (37%), Gaps = 12/194 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + E +P+ WE F + +T+ K IE I LS ++ Sbjct: 409 TDEEKPFELPNGWEWARFVDIAYLITDGAHHTPKYIEHGIPFLSVKDMSDGKLNFGDTRF 468 Query: 277 PESYETYQIVD-----PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + ++ G+++ I L + ++ A + + ID Sbjct: 469 ISEEQHKDLIKRCNPQKGDLLLTKIGTTG-VPVLINTDKEFSIFVSVALIKFSTNEIDGN 527 Query: 332 YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +L+ L++S + K G+ ++L + + P+L PP+ EQ I ++ A D Sbjct: 528 FLSLLVKSPLVKKQSQEGTQGVGNKNLVLKTISNFPLLFPPLNEQHRIVAKVDELMALCD 587 Query: 391 VLVEKIE--QSIVL 402 L ++ Q+I L Sbjct: 588 QLKARLSDAQTIQL 601 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 65/197 (32%), Gaps = 9/197 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQ--S 73 +P W+ L T + K I ++ ++D+ G + S + Sbjct: 416 ELPNGWEWARFVDIAYLITDGAHHTPKYIEHGIPFLSVKDMSDGKLNFGDTRFISEEQHK 475 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLS 130 D KG +L K+G +I +F S + ++ L + S Sbjct: 476 DLIKRCNPQKGDLLLTKIGTTGVPVLINTDKEFSIFVSVALIKFSTNEIDGNFLSLLVKS 535 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 V ++ + +G + K I N P+ PPL EQ I K+ D L Sbjct: 536 PLVKKQSQEGTQGVGNKNLVLKTISNFPLLFPPLNEQHRIVAKVDELMALCDQLKARLSD 595 Query: 191 FIELLKEKKQALVSYIV 207 + A+V + Sbjct: 596 AQTIQLHLTDAIVEQAI 612 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 70/191 (36%), Gaps = 10/191 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD--TST 77 +PK W+ ++ +T + YI + + + ++ + + SD + Sbjct: 122 ELPKGWEWTRLQDIGHDLGQKTP--DCEFTYIDVGAI-NKELGFVEEPSILKASDAPSRA 178 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQP-KDVLPELLQGWLLSID 132 + + ++Y + PYL + D I ST F ++ P + + +L S Sbjct: 179 RKLVKRNTVIYSTVRPYLLNIAVIGNDLSPEPIASTAFAIIHPLLGMNSSYIYRYLRSPC 238 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +E++ G + K N + IPP EQ I K+ + D L + Sbjct: 239 FINYVESVQTGIAYPAINDKQFFNGIIAIPPTEEQHRIVAKVDELMILCDALEAQTEASK 298 Query: 193 ELLKEKKQALV 203 + + L+ Sbjct: 299 SAHQTLVEILL 309 >gi|317502424|ref|ZP_07960588.1| type I site-specific deoxyribonuclease [Lachnospiraceae bacterium 8_1_57FAA] gi|316896162|gb|EFV18269.1| type I site-specific deoxyribonuclease [Lachnospiraceae bacterium 8_1_57FAA] Length = 359 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/358 (12%), Positives = 117/358 (32%), Gaps = 20/358 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 VP+ +F K + R + +DI + + + +Y K+ D +T I +G Sbjct: 6 VPLGKFIKEYSERN-KGNEDIPVYSVTNSQGFCTEYFGKE--VASQDKTTYKIVPQGYFA 62 Query: 88 YGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGA 144 Y + + I S + V + + + L D+ Q I+A G+ Sbjct: 63 YNPSRINVGSVDWQRYEKRVIVSPLYNVFSVSEGIDRQYLYYFLRSDLGRQMIKAKASGS 122 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P + +Q + ++ LI R + ++ L E +A Sbjct: 123 VRDNLKLDMLKEMTIPDISVEQQKFCSSVLD----KLHKLIQMRQQELQKLDEFIKARFV 178 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + ++ + ++ + +G + L K + ++ N Sbjct: 179 ELFGDPVSNSYGLPEATLPDLGEFGRGVSKHRPRNDIKLLGGKYPLIQTGDV-----ANA 233 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + + ++ D G + ++A + + + Sbjct: 234 GLYITSYSSTYSELGLKQSKMWDKGTLCI-----TIAANIAKTAILEFDACFPDSVVGFI 288 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + S+ + ++++ + + L V+VP ++Q + + Sbjct: 289 ANERTNNIFVHYWFSFFQAILESQAPESAQKNINLKILSELKVIVPEKRKQDQFASFV 346 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 70/173 (40%), Gaps = 11/173 (6%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 P + E + +N + + S++ E + + TY+IV G + Sbjct: 7 PLGKFIKEYSERNKGNEDIPVYSVTNSQGFC-TEYFGKEVASQDKTTYKIVPQGYFAYNP 65 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-L 353 + S+ + +R I++ Y GID YL + +RS ++ A SG + Sbjct: 66 SRIN--VGSVDWQRYEKRVIVSPLYNVFSVSEGIDRQYLYYFLRSDLGRQMIKAKASGSV 123 Query: 354 RQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 R +LK + +K + + P I EQ + ++ L++ +Q + L E Sbjct: 124 RDNLKLDMLKEMTI--PDISVEQQK---FCSSVLDKLHKLIQMRQQELQKLDE 171 >gi|206890601|ref|YP_002249484.1| restriction endonuclease S subunit [Thermodesulfovibrio yellowstonii DSM 11347] gi|206742539|gb|ACI21596.1| restriction endonuclease S subunit [Thermodesulfovibrio yellowstonii DSM 11347] Length = 404 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 53/368 (14%), Positives = 106/368 (28%), Gaps = 32/368 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P W + ++ + D + + +P G + Q I Sbjct: 20 LPNGWVWTRLGEVVEILDNKRIPVNTDEREKRISG--KSPSELIPYYGATGQVGWIDDYI 77 Query: 81 FAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 F + +L G+ G P KA I + VL+ + + Sbjct: 78 FDEELVLLGEDGAPFFEPTKNKAYIIRGKSWVNNHAHVLRGINGVILNSFICHYLNIFDY 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G T + + IP+P+PPL EQ I KI R++ + + + Sbjct: 138 H--GYVTGTTRLKLNQSSMQQIPIPLPPLNEQKRIVAKIEELFTRLEAGVEALKKVKAQI 195 Query: 196 KEKKQALVSYIVTKGLN---PDVKMKDSGI-------------EWVGLVPDHWEVKPFFA 239 + +QA++ Y L + SG + +P+ W Sbjct: 196 RRYRQAVLKYAFEGKLTNSSSCHSEQRSGEGISEIVTQPSVANNDLPELPEGWRWVKLGE 255 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + ++ N + + K E + P + I L Sbjct: 256 AAEIIMGQSPPSKTYNTVRIGLPFYQGKAEFGLIYPIP---SKWCSKPKKIAEKNDILLS 312 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLK 358 + E I A++ G+ +L ++ + +G+G ++ Sbjct: 313 IRAPVGPTNICFETSCIGRGLAAIRFGGLYKFLFYYL---RNVEREISKIGTGSTFSAIS 369 Query: 359 FEDVKRLP 366 + L Sbjct: 370 KSQISNLK 377 Score = 75.6 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 22/190 (11%), Positives = 61/190 (32%), Gaps = 30/190 (15%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G +++ L+ + + + + E + + + E + Sbjct: 24 WVWTRLGEVVEILDNKRIPVNTDEREKRISGKSPSELIPYYGATGQVGWIDDYIFDEELV 83 Query: 316 I-------------TSAYMAVKPHGIDST--------------YLAWLMRSYDLCKVFYA 348 + AY+ +++ ++ + +D Sbjct: 84 LLGEDGAPFFEPTKNKAYIIRGKSWVNNHAHVLRGINGVILNSFICHYLNIFDYHGYV-- 141 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 R L ++++P+ +PP+ EQ I I R++ VE +++ ++ R Sbjct: 142 -TGTTRLKLNQSSMQQIPIPLPPLNEQKRIVAKIEELFTRLEAGVEALKKVKAQIRRYRQ 200 Query: 409 SFIAAAVTGQ 418 + + A G+ Sbjct: 201 AVLKYAFEGK 210 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 14/90 (15%), Positives = 30/90 (33%), Gaps = 2/90 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTS 76 + +P+ W+ V + ++ G++ S K + + G ++ + + Sbjct: 241 LPELPEGWRWVKLGEAAEIIMGQSPPS-KTYNTVRIGLPFYQGKAEFGLIYPIPSKWCSK 299 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGI 106 I K IL P I + I Sbjct: 300 PKKIAEKNDILLSIRAPVGPTNICFETSCI 329 >gi|308190008|ref|YP_003922939.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] gi|307624750|gb|ADN69055.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] Length = 392 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 48/402 (11%), Positives = 114/402 (28%), Gaps = 38/402 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P ++ +K TK+ G + ++ + + + Sbjct: 13 PDGYEWKDLKDITKIYVGGDLPKKS---FSETKNENFNVKILTNGSFGNNIKGYTDSYVV 69 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G + + +++ + P + + + Sbjct: 70 PGNSITISARGTIGYCEYQNE------PFYPIIRLLAIHPTWHNSKFIYYFLKNLNISPS 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + K I NI +P+ PL Q I E + +E ++ Sbjct: 124 SKSGIPQLTRKHIENIKIPLIPLKIQEKIVEILERF--------RILEAELEARGKQFDF 175 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ ++ K ++ +G K + V R I L Sbjct: 176 WINKLLNFSN--FNKNNSKELQSIGCFISGLRSKNKDSFVDGNQR--------YISYLDV 225 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER------GI 315 N + N +K E ++ G+++F D+ S ++ Sbjct: 226 FNNKEINHLPNNFVKIFDDENQNNLNYGDVIFCGSSENFDETGYASVYTIKSDEKVYLNS 285 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + + + + D + +G R +L E + ++ + +PP+K Sbjct: 286 FSFIFRFKDNELFLPKFSKYFFNCKDFRDLLLKCINGVTRFNLSKEKMSKIKIPIPPLKT 345 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVL----LKERRSSFIA 412 Q I ++++ + + + I L K R + Sbjct: 346 QNKIVSILDKLSEYSQEINSGLPAEIELRSKQFKYYRDQLLN 387 >gi|153807717|ref|ZP_01960385.1| hypothetical protein BACCAC_01999 [Bacteroides caccae ATCC 43185] gi|149129326|gb|EDM20540.1| hypothetical protein BACCAC_01999 [Bacteroides caccae ATCC 43185] Length = 383 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 48/387 (12%), Positives = 120/387 (31%), Gaps = 44/387 (11%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +K F ++TG T D ++ ED+++ ++ +T I Sbjct: 18 LKSFADVSTGGTPSKANLEYWNGDKPWVSAEDMKNKY--VYDTCEKVTEAGYATCKIIPV 75 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++Y G I + + + D + + + + I+ + G Sbjct: 76 DTLMYVCRGSI-GVMAINKIECATNQSICRAKCHDNVCNVEFLYHALMYQKDNIKKMGTG 134 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + + +PP EQ+ + K K Sbjct: 135 TSFKSLNQTSFSELKIELPPYNEQMKFVSI-----------------AQQADKSKFGDFK 177 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + NP + + ++ +G + K + + + Y Sbjct: 178 SQFIEMFGNPLSLNQKNELKRLGEC--CILNPRRPNIALCDTDKVSFIPMPAVSEDGYLV 235 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYM 321 + E + + + + +++F I ++N K ++ G+ ++ + Sbjct: 236 DMTDEEYGKVK------KGFTYFENNDVLFAKITPCMENGKGAIVHGLTNGIGMGSTEFH 289 Query: 322 AVKPHG--IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 ++P +L L R + G+G ++ + + V +P ++EQ Sbjct: 290 VLRPINGISSPYWLLALTRMPIFRERAAKNMSGTGGQKRVSASYLDHFMVGLPAMEEQRR 349 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLK 404 + D I++++V L Sbjct: 350 F----EAIYRQADKSESVIQKALVYLN 372 >gi|331266255|ref|YP_004325885.1| type I restriction-modification system S subunit, putative [Streptococcus oralis Uo5] gi|326682927|emb|CBZ00544.1| type I restriction-modification system S subunit, putative [Streptococcus oralis Uo5] Length = 358 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 41/396 (10%), Positives = 112/396 (28%), Gaps = 49/396 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K +T + + + Y N S + K + Sbjct: 2 KLIDVCKPKQWKTISTNELV-----------KDGYPVFGANGIIGYFSDYNH-EKPTLCI 49 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G + T + ++ +L + + G+ Sbjct: 50 TCRGATCGTVNKSLPYSYV-TGNSMALDDLDESVIMIDFLYYFLQYRGFNDVITGSAQPQ 108 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + I +P + Q I + I I + + +L+ Sbjct: 109 ITRQSLSKIIIPDFDITIQKEIAQTIYDLEHLILIRNKQIEKLADLV------------- 155 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN----ILSLSYGNI 264 K E G + ++ + K +L+ N + ++ + Sbjct: 156 ---------KSRFNEMFGDIFENPQS-KLEDHTELNPNKREELLNFNGDVSFIPMANVSE 205 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI---ITSAYM 321 K+ + + + +++ I + + GI T ++ Sbjct: 206 NGKINLSINRNIDDVRKGFTFFKDNDVIVAKITPCFENGKGAPLFGLLNGIGFGSTEFHV 265 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +++ +L + + + GSG ++ + + + + +PP+ Q + Sbjct: 266 LRPKNTVNTVWLYHVTMLSEFRREGERKMTGSGGQRRIPKDFISNFKLNIPPLSLQNEFA 325 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + A++D I++S+ L+ + S + Sbjct: 326 EFV----AQVDKSQLAIQKSLEELETLKKSLMQEYF 357 >gi|298292637|ref|YP_003694576.1| Restriction endonuclease S subunits-like protein [Starkeya novella DSM 506] gi|296929148|gb|ADH89957.1| Restriction endonuclease S subunits-like protein [Starkeya novella DSM 506] Length = 506 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 57/453 (12%), Positives = 125/453 (27%), Gaps = 56/453 (12%) Query: 16 QWIGAIPKHWKVVPIKRFTKLN---TGRTSESGKDIIYIGLEDVESG--TGKYLPKDGNS 70 W +P+ W VP+K + G T + + + + ++L Sbjct: 16 PW--ELPEGWAWVPLKMLSNFIGRGRGPTYVEAGGVPVVNQKCIRWHRLEPRHLKLTSRD 73 Query: 71 RQSDTSTVSIFAKGQILYGKLGP-YLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQG 126 G +L+ G + +A+I D + +++P + P L Sbjct: 74 AFDRLPPELYIRAGDLLWNSTGTGTIGRALIYDGSIAELTVDSHVTIVRPSSIDPAYLGH 133 Query: 127 WLLSIDVTQ-RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 ++ + V ++ + + +P+ PLAEQ I +I I Sbjct: 134 FVETSRVQHLVVDGHVGSTNQQELPRSFVEELIVPLAPLAEQRRIVARIDGLFAEIAEGE 193 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--------------------- 224 L ++A++ VT L D + ++ E Sbjct: 194 AALEEARRGLDTFRRAVLKAAVTGELTKDWRERNPVAETGHDLVARMRGSMSKNKRLRTA 253 Query: 225 ------VGLVPDHWEVK--------PFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 + +PD W + + ++ + + + Sbjct: 254 WTPRTDLPELPDTWAWCAVHEAGDVQLGRQRAPQHHTGAHMRPYLRVANVLEDRLDLSDV 313 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + M PE +ET+ + G+++ + + G + Sbjct: 314 KLMNFTPEEFETFA-LKAGDVLLNEGQAPDLLGRPAMYRGEIEGCCFQKTLLRFRASELV 372 Query: 331 TY------LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 M S + + + L + +PP E I ++ Sbjct: 373 DENFALLVFRHYMHSGRFKR--ESRITTNIGHLTQVRFVEMEFPIPPPAEVAVILRRVSE 430 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 A + ++ + S + AA G Sbjct: 431 ALAASADTLAMLDAEAADAARLKQSILKAAFEG 463 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 82/213 (38%), Gaps = 10/213 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKN-TKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 + +P+ W P L + R +E+ + + I+ LK S + Sbjct: 14 DEPWELPEGWAWVPLKMLSNFIGRGRGPTYVEAGGVPVVNQKCIRWHRLEPRHLKLTSRD 73 Query: 282 TYQIVDP------GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + + P G++++ R+L + + S V+P ID YL Sbjct: 74 AFDRLPPELYIRAGDLLWNSTGTGTIGRALIYDGSIAELTVDSHVTIVRPSSIDPAYLGH 133 Query: 336 LMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + + +GS +Q L V+ L V + P+ EQ I I+ A I Sbjct: 134 FVETSRVQHLVVDGHVGSTNQQELPRSFVEELIVPLAPLAEQRRIVARIDGLFAEIAEGE 193 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 +E++ L R + + AAVTG++ + + Sbjct: 194 AALEEARRGLDTFRRAVLKAAVTGELT-KDWRE 225 >gi|262371155|ref|ZP_06064476.1| restriction modification system DNA specificity subunit [Acinetobacter johnsonii SH046] gi|262313885|gb|EEY94931.1| restriction modification system DNA specificity subunit [Acinetobacter johnsonii SH046] Length = 369 Score = 84.5 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 50/395 (12%), Positives = 107/395 (27%), Gaps = 42/395 (10%) Query: 30 IKRFTKLNTGRTSESGKD------IIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVSIFA 82 + G T + I + ++D+ G T + + S ++ Sbjct: 8 LGELVDFKGGGTPSRNVEEYWDNSIPWATVKDLNEGITLTQTQEFISELGLKNSASNLIT 67 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 KG I+ + I I V ++ + I ++ + Sbjct: 68 KGTIIIPTRMALGKVVISEIDVAINQDLKAVSVKDKEKLDVKYLLRFLESYKENIASMGK 127 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GAT+ + I +P+PPLA Q I + + +LL+ Sbjct: 128 GATVKGITLDQLKAIKVPLPPLAAQRRIASILDQADELRQKRQQAIEKLDQLLQAT---- 183 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + +P K + +G V + + E N S Sbjct: 184 ---FIDMFGDPVSNSKKWTEKTLGEV-VVFNTGKLDSNAAEENGIYPFFTCSRTPFAINT 239 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 E + + + ++ + + + Sbjct: 240 YAFDI----------------------EALLLAGNNAAGQYWVKHYKGKFNAYQRTYVLT 277 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +K YL ++++ + GS + L +K + + +PP+ Q Sbjct: 278 IKDSLCTYGYLRYVLQFLLGFLQRMSKGSSTKY-LTLSILKPIKIPIPPLDLQRKFIQFY 336 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++ +L+ I K + A G Sbjct: 337 ENIDSQNQLLM---RNEIEFSK-LFFTLQNQAFNG 367 >gi|313887160|ref|ZP_07820856.1| type I restriction modification DNA specificity domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312923389|gb|EFR34202.1| type I restriction modification DNA specificity domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 382 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 56/387 (14%), Positives = 118/387 (30%), Gaps = 27/387 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + R ++ D + + ++ N+ +D S I Q Sbjct: 7 KRLGDYIQPVDIRNKDNAVDKLV-----GLTIDKAFIDSVANTIGTDLSKYKIIEAEQFA 61 Query: 88 -----YGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G I S + V+ ++LP L W + + Sbjct: 62 CSLMQVSRDGKMPIAMYAGGEKAILSPAYSMFEVIDKSELLPSYLMMWFRRSEFDREASF 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G W + +PIP + EQ I + I I R L+E Sbjct: 122 YAVGGVRGSLLWDDFLDFRLPIPDIEEQQEIVA----QYEAITRRIALNERICANLEETA 177 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 QAL + + +G++PD + + + K + E + + Sbjct: 178 QALYNKMFVQGIDPDNLPDGWRMGAIEEFGEVVTGKTPSSRYPEHFGD---YMPFVTPAE 234 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G + R + ++ +++ G+++ I K ++ +V+ I S Sbjct: 235 FQGEKFIRTAERKLSIEGVKALEKKVIREGDVMVTCIGSDMGKAAISDTEVVTNQQINS- 293 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 S YL + ++ + GS L + + V PP Sbjct: 294 --IRTYDNTFSEYLYYTLKGMKEILMGLGSGSSTMPLLSKRSFEVVEVPYPPTDL----I 347 Query: 380 NVINVETARIDVLVEKIEQSIVLLKER 406 + + ++E+ + +L+E Sbjct: 348 QCFSQTVKPLSTIIERKSKEKDILREM 374 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 51/148 (34%), Gaps = 9/148 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAYMAV--- 323 Y+I++ + + + D K + E+ I++ AY Sbjct: 37 FIDSVANTIGTDLSKYKIIEAEQFACSLMQVSRDGKMPIAMYAGGEKAILSPAYSMFEVI 96 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + +YL R + + G+R SL ++D + +P I+EQ +I Sbjct: 97 DKSELLPSYLMMWFRRSEFDREASFYAVGGVRGSLLWDDFLDFRLPIPDIEEQQEIVAQY 156 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSF 410 T R + E+ L+E + Sbjct: 157 EAITRR----IALNERICANLEETAQAL 180 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 61/170 (35%), Gaps = 7/170 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVE-SGTGKYLPKDGNSRQS 73 +P W++ I+ F ++ TG+T S G + ++ + + + + + Sbjct: 194 LPDGWRMGAIEEFGEVVTGKTPSSRYPEHFGDYMPFVTPAEFQGEKFIRTAERKLSIEGV 253 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + +G ++ +G + KA I+D + + + Q ++ D + L Sbjct: 254 KALEKKVIREGDVMVTCIGSDMGKAAISDTEVVTNQQINSIRTYDNTFSEYLYYTLKGMK 313 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +TM + + +P PP + + + I+ Sbjct: 314 EILMGLGSGSSTMPLLSKRSFEVVEVPYPPTDLIQCFSQTVKPLSTIIER 363 >gi|149003726|ref|ZP_01828571.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP14-BS69] gi|147758288|gb|EDK65289.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP14-BS69] Length = 520 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 63/415 (15%), Positives = 133/415 (32%), Gaps = 66/415 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS-------------------------- 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 221 -------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 E +P+ WE + + + R + + + + Sbjct: 323 DISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQ 382 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA- 319 ++ L SY+ +++ G++++ L R + A Sbjct: 383 WSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVAD 442 Query: 320 ----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVL 368 + V I+ ++ + S + V SG ++ L + +K + Sbjct: 443 SHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIP 497 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 >gi|323477546|gb|ADX82784.1| type I restriction-modification system specificity subunit [Sulfolobus islandicus HVE10/4] Length = 232 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 79/227 (34%), Gaps = 26/227 (11%) Query: 217 MKDSG--IEWVGLVPDHWEVKPFFALVTELNR-------KNTKLIESNILSLSYGNIIQK 267 MK S G P +WEVK + K+ I L L+ +I + Sbjct: 1 MKMSDYVETEFGEFPKNWEVKRLSEIAELQRGLGYSGKEKSKDEIPDGYLFLTLNSIKKG 60 Query: 268 LETRNMG---LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV--------MERGII 316 + G +K + + V G+IV DL ND + S + E+ + Sbjct: 61 GGLKEDGWTWIKSDRLKERHFVREGDIVIVNTDLSNDGSLIGSPAIVHFPEWYKKEKAVF 120 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDV-KRLPVLVPPIKE 374 + + + + + + + + L + +PP++E Sbjct: 121 SLDIFKLLLKVSNVDVNFLFYYLIFVQPLARKYHTGTTVWRINVDSWARDLLIPLPPLEE 180 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 Q I ++ + ID +E + + LK + + A ++G+I + Sbjct: 181 QKKIVKML----SIIDNKIEVETRYLEYLKRLKEKLLTALMSGRIRV 223 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 63/207 (30%), Gaps = 21/207 (10%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNS 70 G PK+W+V + +L G + +++ L ++ G G Sbjct: 12 GEFPKNWEVKRLSEIAELQRGLGYSGKEKSKDEIPDGYLFLTLNSIKKGGGLKEDGWTWI 71 Query: 71 RQSDTSTVSIFAKGQILY-----GKLGPYLRKA-------IIADFDGICSTQFLVLQPKD 118 + +G I+ G + + S L K Sbjct: 72 KSDRLKERHFVREGDIVIVNTDLSNDGSLIGSPAIVHFPEWYKKEKAVFSLDIFKLLLKV 131 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGI-GNIPMPIPPLAEQVLIREKIIAE 177 ++ + I V G T+ + ++ +P+PPL EQ I + + Sbjct: 132 SNVDVNFLFYYLIFVQPLARKYHTGTTVWRINVDSWARDLLIPLPPLEEQKKIVKMLSII 191 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS 204 +I+ L ++ AL+S Sbjct: 192 DNKIEVETRYLEYLKRLKEKLLTALMS 218 >gi|293369057|ref|ZP_06615655.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CMC 3f] gi|292635863|gb|EFF54357.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CMC 3f] Length = 374 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 56/365 (15%), Positives = 124/365 (33%), Gaps = 26/365 (7%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR---KAIIADFDG 105 ++I +D++ + D +I+ KG +L LR I D + Sbjct: 13 LWITSKDMKFAHIADSLLKISDAALDQM--TIYGKGTLLIVTRSGILRHTFPIAILDTEA 70 Query: 106 ICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + + + L + + + + +G T+ D+ + +P+PPL Sbjct: 71 TVNQDVKAISCVLSHIHTYLYYVIKAQEQVILKDYHKDGTTVDSIDFDKFKKLIVPLPPL 130 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 +EQ I E+I ID + + +K+ K ++ + L P +S IE Sbjct: 131 SEQYRIVEEIEHWFALIDQIEQGKTDLQTTIKQIKGKILDLAIHGKLVPQDPNDESAIEL 190 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-- 282 + + + N ++ L RN+ +K + + Sbjct: 191 LKRINPDFTPCDNRHYTQLPNGWAVCRLDQVADVLDNLRKPINSNERNLRIKGKQIDRLY 250 Query: 283 -------------YQIVDPGEIVFRFIDL-QNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 IVD ++ DK ++++ + + + + + P Sbjct: 251 PYYGATGQVGLIDDYIVDGHYLLLGEDGAPFLDKNAIKAYSISGKSWVNNHVHILSPKID 310 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +L S + + R L D+ + +++PP+ EQ I I ++ Sbjct: 311 ----FEFLQYSLNQIDYSEYVNGSTRLKLTQTDMCSIRLMLPPLSEQKLIKAKIQTLFSQ 366 Query: 389 IDVLV 393 +D+++ Sbjct: 367 LDMIM 371 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 59/176 (33%), Gaps = 1/176 (0%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ + ++ S + + + + + I G ++ Sbjct: 1 MDNRKYWNNAKHLWITSKDMKFAHIADSLLKISDAALDQMTIYGKGTLLIVTRSGILRHT 60 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDV 362 + E + TYL +++++ + + S+ F+ Sbjct: 61 FPIAILDTEATVNQDVKAISCVLSHIHTYLYYVIKAQEQVILKDYHKDGTTVDSIDFDKF 120 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 K+L V +PP+ EQ+ I I A ID + + +K+ + + A+ G+ Sbjct: 121 KKLIVPLPPLSEQYRIVEEIEHWFALIDQIEQGKTDLQTTIKQIKGKILDLAIHGK 176 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 56/166 (33%), Gaps = 2/166 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W V + + + + + ++ + + P G + Q Sbjct: 208 QLPNGWAVCRLDQVADVLDNLRKPINSNERNLRIKGKQID--RLYPYYGATGQVGLIDDY 265 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I +L G+ G I ++ + P++ +L Sbjct: 266 IVDGHYLLLGEDGAPFLDKNAIKAYSISGKSWVNNHVHILSPKIDFEFLQYSLNQIDYSE 325 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 G+T + +I + +PPL+EQ LI+ KI ++D ++ Sbjct: 326 YVNGSTRLKLTQTDMCSIRLMLPPLSEQKLIKAKIQTLFSQLDMIM 371 >gi|119512203|ref|ZP_01631293.1| type I site-specific deoxyribonuclease [Nodularia spumigena CCY9414] gi|119463169|gb|EAW44116.1| type I site-specific deoxyribonuclease [Nodularia spumigena CCY9414] Length = 318 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 34/245 (13%), Positives = 76/245 (31%), Gaps = 19/245 (7%) Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKD-----SGIEWVGLVPDHWEVKPFFALVTE 243 + E + ++ A + + K+K + +PD W L Sbjct: 29 KQRREKWEAEQLAKMQAQGKTPKDDSWKLKYKEPVAPDTSELPELPDGWVWATLPQLGEL 88 Query: 244 LNRKNTK-------LIESNILSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVF 293 K+ L + G++ + E + ++ G + Sbjct: 89 NRGKSKHRPRNDPKLYGGQYPFIQTGDVRSANGVIHGYTQTYSEEGLKQSRLWSKGTLCI 148 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + L I+ + + ++ + +R+ YA Sbjct: 149 TIAANIAETAILGFDACFPDSIVG---FISNSNNCEINWIEFFIRTAKENLERYAPA-TA 204 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++++ E + L V +P EQ I + + + D L + ++ +I + R S + Sbjct: 205 QKNINVEILSDLAVPLPSWAEQSKIVEELELIFSVTDQLEKTVDTNIKRAERLRQSILKQ 264 Query: 414 AVTGQ 418 A TGQ Sbjct: 265 AFTGQ 269 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 72/216 (33%), Gaps = 9/216 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGN 69 + +P W + + +LN G++ ++ +I DV S G Sbjct: 70 LPELPDGWVWATLPQLGELNRGKSKHRPRNDPKLYGGQYPFIQTGDVRSANGVIHGYTQT 129 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + +++KG + + + + I FD + E+ Sbjct: 130 YSEEGLKQSRLWSKGTLCIT-IAANIAETAILGFDACFPDSIVGFISNSNNCEINWIEFF 188 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + +E + + + + ++ +P+P AEQ I E++ D L Sbjct: 189 IRTAKENLERYAPATAQKNINVEILSDLAVPLPSWAEQSKIVEELELIFSVTDQLEKTVD 248 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 I+ + +Q+++ T L P + + + Sbjct: 249 TNIKRAERLRQSILKQAFTGQLVPQDPNDEPAEKLL 284 >gi|309800156|ref|ZP_07694342.1| type I restriction-modification enzyme, S subunit [Streptococcus infantis SK1302] gi|308116203|gb|EFO53693.1| type I restriction-modification enzyme, S subunit [Streptococcus infantis SK1302] Length = 227 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 19/178 (10%), Positives = 56/178 (31%), Gaps = 12/178 (6%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNM----GLKPESYETYQIVDPGEIVFRFIDLQND 301 + I + N+ IV+ +++ Sbjct: 53 GGRESYVNEGIALIRSMNVYDGKFIFKDLAYLTNVQAEKLNNVIVESDDVLLNITGASVS 112 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMG---SGLRQSL 357 + + ++ + + + S L+ + + + +G RQ++ Sbjct: 113 RCCIVPQIILPARVNQHVSIIRCKKHLLSPIFLNQLLITSEFKSLLQKIGESSGATRQAI 172 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ L + +PP+ Q + + + A++D I++S+ L+ + S + Sbjct: 173 TKNQIEELTIPIPPLSLQNEFADFV----AQVDKSQLAIQKSLEELETLKKSLMQEYF 226 >gi|308189805|ref|YP_003922736.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] gi|307624547|gb|ADN68852.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] Length = 395 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 46/411 (11%), Positives = 111/411 (27%), Gaps = 52/411 (12%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTS 76 P ++ V ++ G ++ KD + Y+ +V + + S+ Sbjct: 13 PNGYEWVKLENIATFINGLKCKTKKDFLDGNEHYVSYLNVFNNQEIDFLPTSKVKISNNE 72 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICS----------TQFLVLQPKDVLPELLQG 126 + G +++ + A + S LP+ + Sbjct: 73 NQNCLQIGDVIFSGSSENFEETGYASVFNLISENKIYLNSFCFAIRFNNKNLFLPKFSKY 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S ++ G T + +I +P+ PL Q I E + RI Sbjct: 133 LFNSEIFRNQLVKCINGVTRFNLSKVKFASIKVPLIPLKIQEKIVEILERF--RILEAEL 190 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + EL KQ ++ +K + + Sbjct: 191 KAELKAELEARGKQF-----------------------------NFWLKKIYGNIDSKYI 221 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + ++ NI + + + + + + I K Sbjct: 222 TKLENLDINIETGKLNANKKNENGKYLFFTCDEKPYRINEYAFDAESILISGNGSKLGHI 281 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKR 364 S + Y+ + + ++ ++ GS + ++ Sbjct: 282 SYYEGKFNAYQRTYVLTSKDVNINLKYLYYFLKHNFKDYISSIHFGSSSVPYITLPILQE 341 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL----LKERRSSFI 411 + +PP++ Q I ++++ + + + I L K R + Sbjct: 342 YKLKLPPLEIQNKIVSILDKLSEYSQEINSGLPAEIELRSKQFKYYRDQLL 392 >gi|322379269|ref|ZP_08053655.1| methylase [Helicobacter suis HS1] gi|321148306|gb|EFX42820.1| methylase [Helicobacter suis HS1] Length = 272 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 29/235 (12%), Positives = 71/235 (30%), Gaps = 8/235 (3%) Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R + +E A + + + +++++ +G V + + Sbjct: 37 RECKKEQEDLHARLQNLPLEKALKELRVRGVEFVELGEVCSVVDYVANGSFKILSCHVQY 96 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRS 307 E + + + + + SY + P ++V I + Sbjct: 97 LHAEDYAILVRLKDFSNGWRPPFVYINEYSYHFLKKTKLRPNDVVMCNIGSVGVCFKVPD 156 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLP 366 S + + S ++ + +S K ++ + G D K L Sbjct: 157 LGQPMSLATNSILIRPCDSRLLSNFMFYFFKSKIFQKAITSITTQGAHPKFNKTDFKTLK 216 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV-----LLKERRSSFIAAAVT 416 + +PP+ Q I +++ T L ++ + L R+ FI + Sbjct: 217 IPLPPLFIQERIVTILDCLTELTAELTAELTAELTAELTAELTARKKHFITILMR 271 >gi|37528150|ref|NP_931495.1| Type I restriction enzyme specificity protein HsdS [Photorhabdus luminescens subsp. laumondii TTO1] gi|36787587|emb|CAE16692.1| Type I restriction enzyme specificity protein HsdS [Photorhabdus luminescens subsp. laumondii TTO1] Length = 365 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 132/396 (33%), Gaps = 64/396 (16%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ +++ K+ G+ D ++ +P G+ + + Sbjct: 18 WE--SLEQVAKIKHGK--------------DWKNLNAGDIPVYGSGGIMGYVDTYSYNQP 61 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLV-LQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 +L + G I T + + K ++P+ L ++ +ID+ A+ G Sbjct: 62 TVLIPRKGSITNIFYIESPFWNVDTIYYTEIDAKKIIPKFLYYFIKTIDL----LALDTG 117 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 A + I +PIP L Q I + A T L E + + + L+ Sbjct: 118 AGRPSLTQAILNKIQIPIPSLNIQTEIVRILDAFTELTAKLTAELTARQKQYEYYRDQLL 177 Query: 204 SYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 S +++ +EW +G + T N +++ Sbjct: 178 S------------FEENEVEWKTLGE---------IATIGTGSRNTNEAVLDGQYPFFVR 216 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 + +++ ++ I+ G+ V + + + AY Sbjct: 217 SQEPRAIDSF-------EFDETAIITAGDGV--------GVGKVFHYVSGKYALHQRAYR 261 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + V+ +S +L + +R+ + SL+ ++ P+ +PP+ EQ I Sbjct: 262 IVVRDDRFNSKFLFYYIRNNFAHYLTKVSVHASVTSLRKPMFEKYPIPIPPLVEQDRIVA 321 Query: 381 VINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ + E + + I L ++ R ++ Sbjct: 322 ILDKFDTLTSSISEGLPREIELRQKQYEYYRDLLLS 357 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 64/210 (30%), Gaps = 26/210 (12%) Query: 2 KHYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES 58 K Y Y+D S + + + + + TG + + E V Sbjct: 164 ARQKQYEYYRDQLLSFEE--NEV----EWKTLGEIATIGTGSRNTN---------EAVLD 208 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQP 116 G + + R D+ F + I+ G + K + + ++ Sbjct: 209 GQYPFFVRSQEPRAIDS---FEFDETAIITAGDGVGVGKVFHYVSGKYALHQRAYRIVVR 265 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 D + + + + + A+++ P+PIPPL EQ I + Sbjct: 266 DDRFNSKFLFYYIRNNFAHYLTKVSVHASVTSLRKPMFEKYPIPIPPLVEQDRIVAILDK 325 Query: 177 E---TVRIDTLITERIRFIELLKEKKQALV 203 T I + I + E + L+ Sbjct: 326 FDTLTSSISEGLPREIELRQKQYEYYRDLL 355 >gi|86137461|ref|ZP_01056038.1| type I restriction system specificity protein [Roseobacter sp. MED193] gi|85825796|gb|EAQ45994.1| type I restriction system specificity protein [Roseobacter sp. MED193] Length = 395 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 59/402 (14%), Positives = 118/402 (29%), Gaps = 46/402 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + T + TG++ K + S G Y + Sbjct: 17 EWKQLGELTNVKTGQSVNKNK---------IASNPGPYAVINSGREPLGFIDEWNTDDDP 67 Query: 86 ILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I G + + + + V D E + L ++ I A+C Sbjct: 68 IGVTTRGAGVGSITWQEGKYFRGNLNYAVSIKSDAKLETRYLYHLLLEKQADIHALCTFD 127 Query: 145 TMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + + + +PIP LA Q I + + T L E + Sbjct: 128 GIPALNAGKLKGLVIPIPCPDDPEKSLAIQAEIVRILDSFTELTAELTAELKARKQQYNH 187 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + L+S D +EW K +V N K + + + Sbjct: 188 YRDQLLS------------FDDGDVEW----------KTLGDVVDFQNAKPHEKLVTPDG 225 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--I 315 ++ ++ + ++ DL N + ++ V E G Sbjct: 226 DVALLTAGYISTDGRSARFVKTTDVLTPAFKNDVAMVMSDLPNGRALAKTFFVDEDGRYA 285 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 ++ +S +L + K SG + LK + + + V VP +E Sbjct: 286 ANQRVCLLRVKDPESFSSKFLHYVMNRNKQLLRYDSGYDQTHLKKDWILGVKVPVPSAEE 345 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q I +++ L E + + I L ++ R ++ Sbjct: 346 QNRIVTILDKFNTLTASLSEGLPREIKLRQQQYEYYRDLLLS 387 >gi|313678344|ref|YP_004056084.1| type I restriction modification system, S subunit [Mycoplasma bovis PG45] gi|312950546|gb|ADR25141.1| type I restriction modification system, S subunit [Mycoplasma bovis PG45] Length = 359 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 45/362 (12%), Positives = 100/362 (27%), Gaps = 21/362 (5%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGI 106 + +D +Y G + +D I + Y + + G+ Sbjct: 6 YSVTNKDGFVNQNEYFDDGGKAVFADKKNSLIVSINTFAYNPSRINVGSLALYKHSEMGL 65 Query: 107 CSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 S + V Q + + + + + + + P A Sbjct: 66 VSPIYEVFQINNNNNPDFFLLWFRSEAFKNIVSTNSNKSVRDTLNLSQFESESVNFPNFA 125 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 EQ I I + + L L+ + + ++ Sbjct: 126 EQSKISSLFTHLDSLITLHQRKLLSLKNLKSR----LLDRMFCDEKSQFPSIRFKEFTNA 181 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 E F+ + K+ + ++ + K ++ Sbjct: 182 WEQEKLGECSKFYNGL-TSVSKSDFGVGKDLYIDYLNVFNNTFSQFSELKKFKNSSRQNY 240 Query: 286 VDPGEIVFRFIDLQNDKRS----LRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMR 338 V +++ D+ + + + S + V+ + D Y+ + R Sbjct: 241 VQFKDVILTISSETPDEVAMSSVINWKNDYKNVAFNSFCILVRFNQLEKYDVNYIGYFFR 300 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKI 396 S M G+ R ++ +K+ L PP I EQ I NV+ +D L+ Sbjct: 301 SNSFRTQAMLMAQGISRFNINQTALKKTLFLFPPNIYEQQKIGNVLY----YLDSLITLH 356 Query: 397 EQ 398 ++ Sbjct: 357 QR 358 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 53/158 (33%), Gaps = 6/158 (3%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + + E + G K + + F + + + SL + E G+ Sbjct: 6 YSVTNKDGFVNQNEYFDDGGKAVFADKKNSLIVSINTFAYNPSRINVGSLALYKHSEMGL 65 Query: 316 ITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ Y + + + + RS + +R +L + V P Sbjct: 66 VSPIYEVFQINNNNNPDFFLLWFRSEAFKNIVSTNSNKSVRDTLNLSQFESESVNFPNFA 125 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 EQ I + +D L+ ++ ++ LK +S + Sbjct: 126 EQSKI----SSLFTHLDSLITLHQRKLLSLKNLKSRLL 159 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 52/174 (29%), Gaps = 19/174 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + +K G TS S D +YI +V + T + + S Sbjct: 182 WEQEKLGECSKFYNGLTSVSKSDFGVGKDLYIDYLNVFNNTFSQFSELKKFKNSSRQNYV 241 Query: 80 IFAKGQILYGKLGPYLRKAII-------ADFDGICSTQFLV----LQPKDVLPELLQGWL 128 F ++ + + D+ + F + Q + + + Sbjct: 242 QFK--DVILTISSETPDEVAMSSVINWKNDYKNVAFNSFCILVRFNQLEKYDVNYIGYFF 299 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGN-IPMPIPPLAEQVLIREKIIAETVRI 181 S + + +G + + + + + + P + EQ I + I Sbjct: 300 RSNSFRTQAMLMAQGISRFNINQTALKKTLFLFPPNIYEQQKIGNVLYYLDSLI 353 >gi|227550180|ref|ZP_03980229.1| possible type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecium TX1330] gi|257897502|ref|ZP_05677155.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com12] gi|227180696|gb|EEI61668.1| possible type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecium TX1330] gi|257834067|gb|EEV60488.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com12] Length = 209 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 69/212 (32%), Gaps = 12/212 (5%) Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + L P K + + G + V ++ T + + Sbjct: 1 MQKLFPKNGSKFPQLRFAGFA--DAWEQRKLGEVADIIGGGTPSTNVSEYWNGDIDWYSP 58 Query: 268 LETRNMGLKPESYETY-----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 +E N ES + Q + + + +A + + G + + Sbjct: 59 VEIGNQIYIDESQKKITGLGLQKSSAHILPVGTVLFTSRAGIGNTAILAKEGCTNQGFQS 118 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + P+ R+ +L + G+G + + + ++P+ VP I+EQ I Sbjct: 119 IVPYKDLLNSYFIFSRTSELKRYGEINGAGSTFIEVSGKQMAKMPISVPSIEEQQKIGTF 178 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D + ++ + L+E + ++ Sbjct: 179 ----FKQLDDTITLHQRKLEKLQELKKGYLQK 206 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 67/191 (35%), Gaps = 12/191 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLP---KDGNSRQSDTST 77 W+ + + G T + + G VE G Y+ K S+ Sbjct: 24 WEQRKLGEVADIIGGGTPSTNVSEYWNGDIDWYSPVEIGNQIYIDESQKKITGLGLQKSS 83 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I G +L+ AI+A G + F + P L + + ++ + Sbjct: 84 AHILPVGTVLFTSRAGIGNTAILAKE-GCTNQGFQSIVPYKDLLNSYFIFSRTSELKRYG 142 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E G+T K + +P+ +P + EQ I ++D IT R +E L+E Sbjct: 143 EINGAGSTFIEVSGKQMAKMPISVPSIEEQQKIGTF----FKQLDDTITLHQRKLEKLQE 198 Query: 198 KKQALVSYIVT 208 K+ + + Sbjct: 199 LKKGYLQKMFC 209 >gi|323380338|gb|ADX52606.1| restriction modification system DNA specificity domain protein [Escherichia coli KO11] Length = 388 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 61/383 (15%), Positives = 118/383 (30%), Gaps = 24/383 (6%) Query: 28 VPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 V + ++ G + +S + +I + D E + + Sbjct: 2 VKLGEIFTISRGGSPRPIQDYITDSCNGVNWIMIGDTEPNSKYIRHTAKKIKFEGVKKSR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRI 137 G +L + R I+ D +G +LVL PK+ + +L S I Sbjct: 62 KVYPGDLLLTNSMSFGRPYIL-DVEGCIHDGWLVLSPKNNQIHIDYFYHYLNSPTAKIII 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 GA + + + + N+ +P PP AEQV I + + Sbjct: 121 SNKAAGAVVKNLNSDIVRNLEIPFPPFAEQVRIASTLDKADGIRQKREQAIKLADDF--- 177 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 L + + +P K ++ + H + N + L ++ Sbjct: 178 ----LRATFLEMFGDPVQNPKGWNVKPLADQIIHANNGISRRRKEDTNEGDIVLRLQDVH 233 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 G K R + E D + + R+ +E Sbjct: 234 Y--SGITFDKELNRIKLVDKEKQIARVEYDDLLFIRVNGNPNYVGRTAVFKSYIEPVYHN 291 Query: 318 SAYMAVK-PHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + +K + S +L +L+ S K+ S + ++ + + +L PPI+ Sbjct: 292 DHLIRIKLDNEYQSDFLCYLINSPFSRKLIAQQIKTSAGQHTISQDGILKLMFYRPPIEL 351 Query: 375 QFDITNVINVETARIDVLVEKIE 397 Q N I + I +K E Sbjct: 352 QEKFIN-IKNKIESIFYRKDKHE 373 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 59/159 (37%), Gaps = 8/159 (5%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ + I+ + + +K E + + V PG+++ Sbjct: 22 YITDSCNGVNWIMIGDTEPNSKYIRHTAKKIKFEGVKKSRKVYPGDLLLTNSMSFGRPYI 81 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVK 363 L + G + ++ K + I Y + S + +G ++L + V+ Sbjct: 82 LDVEGCIHDGWL---VLSPKNNQIHIDYFYHYLNSPTAKIIISNKAAGAVVKNLNSDIVR 138 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 L + PP EQ I + ++ + D + +K EQ+I L Sbjct: 139 NLEIPFPPFAEQVRIASTLD----KADGIRQKREQAIKL 173 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 50/195 (25%), Gaps = 14/195 (7%) Query: 22 PKHWKVVPIKR-FTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK W V P+ N G + +D I + L+DV + + + D Sbjct: 193 PKGWNVKPLADQIIHANNGISRRRKEDTNEGDIVLRLQDVHYSGITFDKELNRIKLVDKE 252 Query: 77 TVS-IFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +L+ ++ + + ++ + +L+ Sbjct: 253 KQIARVEYDDLLFIRVNGNPNYVGRTAVFKSYIEPVYHNDHLIRIKLDNEYQSDFLCYLI 312 Query: 130 SIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + I A GI + PP+ Q Sbjct: 313 NSPFSRKLIAQQIKTSAGQHTISQDGILKLMFYRPPIELQEKFINIKNKIESIFYRKDKH 372 Query: 188 RIRFIELLKEKKQAL 202 F + + ++ Sbjct: 373 EDLFASISNKLIHSI 387 >gi|307264086|ref|ZP_07545683.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|306870564|gb|EFN02311.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 414 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 44/427 (10%), Positives = 112/427 (26%), Gaps = 72/427 (16%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V ++ L GR +I ++ + L + +G+ Sbjct: 2 WVRLEDIFHLQAGR---------FISASEIYGEYKESLYPCYGGNGLRGFVKTYNREGKF 52 Query: 87 -LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + G+ G A+ + +V++ L + + + Sbjct: 53 PIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYF---LIQLNLNQYATATA 109 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QA 201 I ++ +P+PPL EQ I KI I+ + + L ++ ++ Sbjct: 110 QPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKS 169 Query: 202 LVSYIVTKGLNPDVKM-------------------------------------------- 217 ++ + L Sbjct: 170 ILQAAIQGKLTEQNPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRDNLPYEIV 229 Query: 218 ----KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE---SNILSLSYGNIIQKLET 270 + E +P+ W + + + L GNI Sbjct: 230 NGKERCIADEVPFEIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKID 289 Query: 271 --RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + +++ + K+ + A ++++ + Sbjct: 290 VSSDIVKVNLDIPENKRCYKNDLLICARN--GSKKLVGKAAIIDKDGYSFGAFMTIFRSP 347 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + Y+ + + S F + + + ++ + +P + EQ I I + Sbjct: 348 FNKYIYYYLSSPLFRNDFDGINTTTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFST 407 Query: 389 IDVLVEK 395 + L +K Sbjct: 408 LQNLSQK 414 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 46/131 (35%), Gaps = 9/131 (6%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F I Q + + A + D+ + + + +L + + Sbjct: 52 FPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYFLIQLNLNQY---ATAT 108 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KERR 407 + L + + + +PP+ EQ I I I+ + E+ + L ++ + Sbjct: 109 AQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQY-AEKEEKLTALHQQFPEQLK 167 Query: 408 SSFIAAAVTGQ 418 S + AA+ G+ Sbjct: 168 KSILQAAIQGK 178 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 56/171 (32%), Gaps = 10/171 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V + + N G T I + +++ G + D D Sbjct: 243 EIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKID-VSSDIVKVNLDI 301 Query: 76 STVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 K +L KA I D DG S + + + + +L S Sbjct: 302 PENKRCYKNDLLICARNGSKKLVGKAAIIDKDGY-SFGAFMTIFRSPFNKYIYYYLSSPL 360 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I T++ + N +P+P L EQ+ I EKI + Sbjct: 361 FRNDFDGINT-TTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTLQN 410 >gi|300870776|ref|YP_003785647.1| putative restriction endonuclease type I S subunit [Brachyspira pilosicoli 95/1000] gi|300688475|gb|ADK31146.1| putative restriction endonuclease type I, S subunit [Brachyspira pilosicoli 95/1000] Length = 386 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 52/401 (12%), Positives = 115/401 (28%), Gaps = 43/401 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + + G +D++ + G + Sbjct: 13 PDGVEYKKLINVCDIKRGERITK---------KDIKENEMFPIISGGQFPMGMYDKFNR- 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I G + L L PK L + + + Sbjct: 63 DENTITIASYGS-AGYVDYQTKKFWANDVCLCLYPKIKLLNK-FLYYYLKFKQDFLYSKT 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 A +H I + +P+PP+ Q I + T + E ++ Sbjct: 121 TNAIPNHIPTDIIKELLIPLPPIEIQKEIVGILDTFTK--------YQDLLNRELELRKK 172 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 Y K L + ++ +E + + + K K + S I ++ Sbjct: 173 QYEYYNNKLLTFNDNVEYKTLEELCD-------------IVDYRGKTPKKVNSGIFLITA 219 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEI--VFRFIDLQNDKRSLRSAQVMERGIITSA 319 NI + + Y + + + + + E + Sbjct: 220 KNIRKGYIDYEKSKEYVDINDYPNIMHRGLPQIGDVLITTEAPLGYVAQIDRENVALAQR 279 Query: 320 YMAVKPHGID---STYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + +P S YL +++ + K+ G + +K + +L + VPP++EQ Sbjct: 280 VIKYRPKDKSLLSSYYLKYILLGKEFQDKLLINATGGTVKGIKGSKLHKLTIPVPPLEEQ 339 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I N+++ A + + + I + K+ R + Sbjct: 340 ERIVNILDKFDALCNDITRGLPAEIEMRKKQYEYYRDKLLT 380 >gi|160886161|ref|ZP_02067164.1| hypothetical protein BACOVA_04168 [Bacteroides ovatus ATCC 8483] gi|156108046|gb|EDO09791.1| hypothetical protein BACOVA_04168 [Bacteroides ovatus ATCC 8483] Length = 383 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 48/387 (12%), Positives = 120/387 (31%), Gaps = 44/387 (11%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +K F ++TG T D ++ ED+++ ++ +T I Sbjct: 18 LKSFADVSTGGTPSKANLEYWNGDKPWVSAEDMKNKY--VYDTCEKVTEAGYATCKIIPV 75 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++Y G I + + + D + + + + I+ + G Sbjct: 76 DTLMYVCRGSI-GVMAINKIECATNQSICRAKCHDNVCNVEFLYHALMYQKDNIKKMGTG 134 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + + +PP EQ+ + K K Sbjct: 135 TSFKSLNQTSFSELKIELPPYNEQMKFVSI-----------------AQQADKSKFGDFK 177 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + NP + + ++ +G + K + + + Y Sbjct: 178 SQFIEMFGNPLSLNQKNELKRLGEC--CILNPRRPNIALCDTDKVSFIPMPAVSEDGYLV 235 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYM 321 + E + + + + +++F I ++N K ++ G+ ++ + Sbjct: 236 DMTDEEYGKVK------KGFTYFENNDVLFAKITPCMENGKGAIVHGLTNGIGMGSTEFH 289 Query: 322 AVKPHG--IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 ++P +L L R + G+G ++ + + V +P ++EQ Sbjct: 290 VLRPINGISSPYWLLALTRMPIFRERAAKNMSGTGGQKRVSASYLDHFMVGLPAMEEQRR 349 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLK 404 + D I++++V L Sbjct: 350 F----EAIYRQADKSKSVIQKALVYLN 372 >gi|322387160|ref|ZP_08060770.1| type I restriction-modification system [Streptococcus infantis ATCC 700779] gi|321141689|gb|EFX37184.1| type I restriction-modification system [Streptococcus infantis ATCC 700779] Length = 521 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 62/441 (14%), Positives = 127/441 (28%), Gaps = 68/441 (15%) Query: 21 IPKHWKVVPIKRFT-----KLNTGR----------TSESGKDIIYIGLEDVESGTGKYLP 65 IPK W +V + + G + + T Y Sbjct: 82 IPKGWAIVYLPDICALEDGSIKRGPFGSSITKSMFVPKGEHTYKVYEQGNAIRKTIDYGD 141 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPEL 123 G IL G I ++G+ + L L + + Sbjct: 142 YWLKESDYIRLKNFSIKAGDILISCAGTIGEIFQIPSNYYNGVINQALLKLTLNSDIIDS 201 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGI--GNIPMPIPPLAEQVLIREKIIAETVRI 181 + + ++ G+ + + +PM +PPLAEQ I E I + ++ Sbjct: 202 QYFKWMFTSLINTLKEHSIGSAIKNLASIKFLKYEVPMLLPPLAEQQRIVEVIESALEKV 261 Query: 182 DTLITERIRFIELLK---------------------------------EKKQALVSYIVT 208 D + +L K EK +A + Sbjct: 262 DEYAESYNQLQKLDKIFPDKLKKSILQYAMQGKLVEQDPNDEPVEVLLEKIRAEKQKLFE 321 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +G +K S + G +++ P +++ L+ + ++I S + Sbjct: 322 EGKIKKKDLKISIVSQ-GDDNSYYKQLPRNWMLSTLDSVSNLYTGNSINSTEKKKYFSGV 380 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL---------------RSAQVMER 313 + N + I I L K S + + + Sbjct: 381 DGINYIATKDVNFDNTINYDNGIRIPDNYLSKFKISYFNSVLLCLEGGSAGRKIGLLKQD 440 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + + ++ +L + ++S F SG+ + ++ + + V P Sbjct: 441 VCFGNKLCNLSFYYGENKFLYYFLQSPQFLSDFQKNKSGIIGGVSKNNLGNILIPVLPRN 500 Query: 374 EQFDITNVINVETARIDVLVE 394 EQ IT I++ ++ L E Sbjct: 501 EQMRITQGIDLLFQKVSQLSE 521 Score = 67.1 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 67/176 (38%), Gaps = 12/176 (6%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 E GN I+K ES + G+I+ + + S Sbjct: 121 EHTYKVYEQGNAIRKTIDYGDYWLKESDYIRLKNFSIKAGDILISCAGTIGEIFQIPSN- 179 Query: 310 VMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK-RLP 366 G+I A + + IDS Y W+ S +++GS ++ + +K +P Sbjct: 180 -YYNGVINQALLKLTLNSDIIDSQYFKWMFTSLINTLKEHSIGSAIKNLASIKFLKYEVP 238 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIE--QSIVLL--KERRSSFIAAAVTGQ 418 +L+PP+ EQ I VI ++D E Q + + + + S + A+ G+ Sbjct: 239 MLLPPLAEQQRIVEVIESALEKVDEYAESYNQLQKLDKIFPDKLKKSILQYAMQGK 294 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 57/174 (32%), Gaps = 11/174 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLPKDGNS 70 +P++W + + + L TG + S + I YI +DV Sbjct: 346 QLPRNWMLSTLDSVSNLYTGNSINSTEKKKYFSGVDGINYIATKDVNFDNTINYDNGIRI 405 Query: 71 RQSDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + S I +L + G RK + D + L + L +L Sbjct: 406 PDNYLSKFKISYFNSVLLCLEGGSAGRKIGLLKQDVCFGNKLCNLSFYYGENKFLYYFLQ 465 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 S + G + +GNI +P+ P EQ+ I + I ++ Sbjct: 466 SPQFLSDFQKNKSG-IIGGVSKNNLGNILIPVLPRNEQMRITQGIDLLFQKVSQ 518 >gi|300821374|ref|ZP_07101522.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 119-7] gi|331680404|ref|ZP_08381063.1| type I restriction enzyme EcoR124II specificity protein (S protein)(S.EcoR124II) [Escherichia coli H591] gi|300526263|gb|EFK47332.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 119-7] gi|331071867|gb|EGI43203.1| type I restriction enzyme EcoR124II specificity protein (S protein)(S.EcoR124II) [Escherichia coli H591] Length = 388 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 63/394 (15%), Positives = 116/394 (29%), Gaps = 54/394 (13%) Query: 26 KVVPIKRFTKLNTGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + +T + + ++G++++ + G + + + G Sbjct: 17 EWKAVGDIAGYSTTKVDADKLDATSFVGVDNLLADKGGRIDATYQPNTARLTAY---EPG 73 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQ-----FLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL G + PYL+K +A+ +G CS L K + PE L L S Sbjct: 74 DILLGNIRPYLKKVWMAENNGGCSGDVLAIRILADCKKIISPEYLYYALSSDSFFSYSMQ 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFI 192 +GA M I N +PIP LA Q I + T L E Sbjct: 134 HAKGAKMPRGSKDAILNYQIPIPCPSAPEKSLAIQSEIVRILDKFTALTAELTAELNMRK 193 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + L++ +G +M+ I Sbjct: 194 KQYNYYRDQLLN---LEGRENTREMRIGDI----------------YDFQYGTGNTIPKS 234 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I+ + N P V I + Q Sbjct: 235 GGQYPVYGSNGIVGSHDKYNSEDSP--------------VIGHIGA--YAGIVNWGQGKH 278 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 K + Y +L+ L S + + + + VLVPP+ Sbjct: 279 FVTYNGVICRHKSKEVLQKYAYYLLL---LQDFGSKSNSASQPFVSYNILNAPIVLVPPL 335 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +EQ I +++ + + E + + I L +++ Sbjct: 336 QEQARIVEILDKFDTLTNSITEGLPREIELRQKQ 369 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 19/136 (13%), Positives = 40/136 (29%), Gaps = 9/136 (6%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERG-IITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 +PG+I+ I K + G ++ +A I YL + + S Sbjct: 70 YEPGDILLGNIRPYLKKVWMAENNGGCSGDVLAIRILADCKKIISPEYLYYALSSDSFFS 129 Query: 345 VFYAMGSGL-RQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLVEKI 396 G + + + +P + Q +I +++ TA L ++ Sbjct: 130 YSMQHAKGAKMPRGSKDAILNYQIPIPCPSAPEKSLAIQSEIVRILDKFTALTAELTAEL 189 Query: 397 EQSIVLLKERRSSFIA 412 R + Sbjct: 190 NMRKKQYNYYRDQLLN 205 >gi|189467610|ref|ZP_03016395.1| hypothetical protein BACINT_04000 [Bacteroides intestinalis DSM 17393] gi|189435874|gb|EDV04859.1| hypothetical protein BACINT_04000 [Bacteroides intestinalis DSM 17393] Length = 389 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 55/417 (13%), Positives = 132/417 (31%), Gaps = 52/417 (12%) Query: 23 KHWKVVPIKRFTKL---NTGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQS-DTST 77 + WK + G+T + I I + V++G + + + + D Sbjct: 2 EQWKQDRLIDILDTLIDYRGKTPNKVERGIPLITAKIVKNGRIETPTEFLPAEEYRDWMV 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQ 135 G ++ P A + D + + + L+ K+ L+ +L+S Sbjct: 62 RGYPQVGDVVLTTEAPLGEVAQLKDDKIALAQRIVCLRGKEDALDNTYLKYFLMSNIGQY 121 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R++A G T++ + + + P Q I + + + I R + L Sbjct: 122 RLKARETGTTVTGIKQSELKEVLIDYPNYELQQKIASILSSLDSK----IELNRRINDNL 177 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +++ QA ++ ++ + N + E+N K ++ Sbjct: 178 EQQAQAWLNELLDRYANSTTV--------------------LIHEIAEINPKRNLSKGTS 217 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVME 312 + N+ N G + Y G+ + I + E Sbjct: 218 AKCIEMANLPTTGSFPN-GWIEKEYNGGMKFCNGDTLIARITPCLENGKTAFINFLDKNE 276 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLV 369 ++ Y+ + S+ + + R++D GS RQ + + + + + V Sbjct: 277 IAYGSTEYIVISAKSNYSSSFFYFLARNHDFVDYAVKNMNGSSGRQRVSGDTISKYRIPV 336 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR-----SSFIAAAVTGQIDL 421 P + + T + + L+ R + + ++G++ + Sbjct: 337 IPRE-------KLESFTNHAE--IALKTIKNNSLQNMRLSMTRDALLPKLMSGELKV 384 >gi|315453693|ref|YP_004073963.1| Type II restriction-modification enzyme [Helicobacter felis ATCC 49179] gi|315132745|emb|CBY83373.1| Type II restriction-modification enzyme [Helicobacter felis ATCC 49179] Length = 1627 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 58/420 (13%), Positives = 129/420 (30%), Gaps = 44/420 (10%) Query: 27 VVPIKRFTKLNTGRTSESGKD------IIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVS 79 +V ++ K G T I ++ + D E + + S V Sbjct: 958 IVKLETCGKFLMGGTPSRKNPQYWNGTIKWLTIGDYAEYQSITDTKERITEAGLQASNVK 1017 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SIDVTQRIE 138 + KG ++ + + + I + + + + + P +++ E Sbjct: 1018 LVPKGAVVVS-IYATIGRVGILEGEMTTNQAIVSIIPNQDFRARYLMYVIGYYKFQLLDE 1076 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLI-REKIIAETVRIDTLITERIRF------ 191 I + IP P + EQ++ K+ T + I Sbjct: 1077 VITTSQKNINLGILQNMRIPKPPLQVQEQIITECAKVEKRTQELQEGIQSYQNLILAVLG 1136 Query: 192 -----------IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 K A + + +K+ + D WE+ + Sbjct: 1137 VCGVAKDPKTPPIEQILKTLATLKLELEPTDPKLEALKNLVQDLPNPPADGWEMAKLGDI 1196 Query: 241 VTELNR------KNTKLIESNILSLSYGNIIQKLE---TRNMGLKPESYETYQIVDPGEI 291 K +++ + I+ + + V P ++ Sbjct: 1197 CDFRRGPFGGSLKKEIFVKNGYKVYEQQHAIKNDFEIGNYFITQEKFDSMKSFEVIPNDL 1256 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAM 349 + + S ++GII A + ++ +S L ++ + + A Sbjct: 1257 IVSCSGTIGKIAIVPSNA--KQGIINQALLRLRLKNGRTNSKTLKIILDNLNNPFNERAH 1314 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI---NVETARIDVLVEKIE-QSIVLLKE 405 G L+ E +K++ + +PP++ Q I +V+ E AR+D + +E + +LKE Sbjct: 1315 GVALKNVANIEVLKQIQIPLPPLEAQEQIMSVLTQIEQEIARLDDEIASLEGKEQEILKE 1374 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 62/442 (14%), Positives = 139/442 (31%), Gaps = 72/442 (16%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLE-------DVESGTGKYLPKDGNSRQSDTS 76 W++ + G S K I++ + + D+ Sbjct: 1187 GWEMAKLGDICDFRRGPFGGSLKKEIFVKNGYKVYEQQHAIKNDFEIGNYFITQEKFDSM 1246 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G + AI+ GI + L L+ K+ ++ ++ Sbjct: 1247 KSFEVIPNDLIVSCSGTIGKIAIVPSNAKQGIINQALLRLRLKNGRTNSKTLKIILDNLN 1306 Query: 135 QRIEAICEGATMSHA-DWKGIGNIPMPIPPLAEQVLIREKIIAETVRID----------- 182 G + + + + + I +P+PPL Q I + I Sbjct: 1307 NPFNERAHGVALKNVANIEVLKQIQIPLPPLEAQEQIMSVLTQIEQEIARLDDEIASLEG 1366 Query: 183 ------------------------TLITERIRFIELLKEKKQALVSYIVTK-GLNPDVKM 217 +I E++ + LKE AL+++ + K GL P + Sbjct: 1367 KEQEILKEFLRLSRERERERSQKPKVILEKLERAKALKESYLALLTHALEKAGLKPTL-- 1424 Query: 218 KDSGIEWVGLVP-DHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETR 271 + ++ + P W++ A+ +RK + + L +S + ++ T Sbjct: 1425 -ATLLDNLPTPPASGWDLVKLGAVCQILIGGTPSRKKPEYFKGTHLWVSIAEMDGQVITN 1483 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS- 330 + V ++ + L + K S+ + + + T+ +A + Sbjct: 1484 TKEKITDEAIKVSNVK---LIPKGTTLLSFKLSIGKVALAGKDLYTNEAIAGLIPKDNQV 1540 Query: 331 -TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK-RLPVLVPPIKEQFDITNVINVETAR 388 + + + + +SL + + + + +PP++EQ I +V Sbjct: 1541 LDRFLFALFKGGAINLDLKGNNAFGKSLNSQTLNDEVKIPLPPLQEQEQIVDV------- 1593 Query: 389 IDVLVEKIEQSIVLLKERRSSF 410 + KIEQ L+ S Sbjct: 1594 ----IAKIEQERTALENAMKSL 1611 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 59/171 (34%), Gaps = 13/171 (7%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRF 295 + +RKN + I L+ G+ + + + ++V G +V Sbjct: 969 MGGTPSRKNPQYWNGTIKWLTIGDYAEYQSITDTKERITEAGLQASNVKLVPKGAVVVSI 1028 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 L A +++ P+ + Y ++ + + ++ Sbjct: 1029 YATIGRVGIL-----EGEMTTNQAIVSIIPNQDFRARYLMYVIGYYKFQLLDEVITTSQK 1083 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++ ++ + + PP++ Q I E A+++ +++++ I + Sbjct: 1084 NINLGILQNMRIPKPPLQVQEQII----TECAKVEKRTQELQEGIQSYQNL 1130 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 59/189 (31%), Gaps = 8/189 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W +V + ++ G T K +++ + +++ + S Sbjct: 1437 SGWDLVKLGAVCQILIGGTPSRKKPEYFKGTHLWVSIAEMDGQVITNTKEKITDEAIKVS 1496 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 V + KG L + K +A D + L PKD + L Sbjct: 1497 NVKLIPKGTTLLS-FKLSIGKVALAGKDLYTNEAIAGLIPKDNQVLDRFLFALFKGGAIN 1555 Query: 137 IEAICEGATMSHADWKGIGN-IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ A + + + + + +P+PPL EQ I + I L Sbjct: 1556 LDLKGNNAFGKSLNSQTLNDEVKIPLPPLQEQEQIVDVIAKIEQERTALENAMKSLKGQQ 1615 Query: 196 KEKKQALVS 204 + + ++ Sbjct: 1616 EATLKKYLN 1624 >gi|90411352|ref|ZP_01219364.1| Restriction modification system, type I [Photobacterium profundum 3TCK] gi|90327881|gb|EAS44212.1| Restriction modification system, type I [Photobacterium profundum 3TCK] Length = 418 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 56/388 (14%), Positives = 123/388 (31%), Gaps = 19/388 (4%) Query: 45 GKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFAKGQILYGKLGPYLRK----AI 99 + + I E + S + ++ KG I++ + G Sbjct: 33 DEGVRVIPAEAIFSDGLNPTTFNHITLEKAQDLKRYRLQKGDIVFARRGAQACGRSALVG 92 Query: 100 IADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGN 156 + I T + VL+ V+PE L + S ++ GATM + + + + Sbjct: 93 DKEEGSIAGTGLIYLRVLKKDLVVPEYLHLAVSSAKSLAWLKTHAIGATMPNLNNSVLCS 152 Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 +P+ +P +QV + I I + + L+E QA+ K Sbjct: 153 LPLNLPSYEKQVEVVN----GYYPIIKKIRVNTKLNQTLEEITQAIFKSWFVDFDPVKAK 208 Query: 217 MKDSGIEWVGLVPDHWEVKPFFA-LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 M +E + + + S+ + ++ G I K + Sbjct: 209 MNGEQLEGMDEATASLFPEKLVESEFGVIPEGWEVKAFSDWVKITKGKNITKKTIVEGDV 268 Query: 276 --KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + I + + + I S + +L Sbjct: 269 PVVAGGLKPAYFHNTHNVEGPAITISASGANAGFINLYYESIWASDSSYISKAATPLFFL 328 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ ++ K++ + + D +RL V+VP + + V V Sbjct: 329 QYVALKFNQKKIYDMQTGAAQPHIYPRDFERLMVVVPSDELCQK----LEEIFTSFFVTV 384 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +Q + L + R + + ++G+I+L Sbjct: 385 SNYKQQNIELSKLRDTLLPKLLSGEIEL 412 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 50/186 (26%), Gaps = 13/186 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G IP+ W+V + K+ G+ + G G + + Sbjct: 235 GVIPEGWEVKAFSDWVKITKGKNITKKTIV-----------EGDVPVVAGGLKPAYFHNT 283 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I G + S + K P ++ ++I Sbjct: 284 HNVEGPAITISASGANAGFINLYYESIWASDSSYI--SKAATPLFFLQYVALKFNQKKIY 341 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + GA H + + + +P + E + V + + I +L Sbjct: 342 DMQTGAAQPHIYPRDFERLMVVVPSDELCQKLEEIFTSFFVTVSNYKQQNIELSKLRDTL 401 Query: 199 KQALVS 204 L+S Sbjct: 402 LPKLLS 407 >gi|310778706|ref|YP_003967039.1| restriction modification system DNA specificity domain protein [Ilyobacter polytropus DSM 2926] gi|309748029|gb|ADO82691.1| restriction modification system DNA specificity domain protein [Ilyobacter polytropus DSM 2926] Length = 500 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 65/195 (33%), Gaps = 3/195 (1%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 P W P V +R + + G L Y Sbjct: 1 MNDNKPIEWIKVPLIECVGIYDRYRKPISKIERDKRVSGKNCNDLFKYYGATGLAGYIDD 60 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ ++ + ++ + + + ++ +L + ++ Sbjct: 61 YLLEGEYVILGEDGAPFLDSLKSKSYLVSGKFWVNNHAHILKSYFNNKFLLHYLNQFNYK 120 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 R L +K++P++VPP+ EQ ++ + I + +D +E ++++ L Sbjct: 121 NYV---SGTTRLKLNQTSMKKIPIIVPPLAEQEEVVSRIESLFSELDNGIENLKRAQKQL 177 Query: 404 KERRSSFIAAAVTGQ 418 K R S + A G+ Sbjct: 178 KLYRQSILRDAFEGK 192 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 55/465 (11%), Positives = 127/465 (27%), Gaps = 70/465 (15%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W VP+ + K + G + + + Sbjct: 6 PIEWIKVPLIECVGIYDRYRKPISKIERDKRVSGKNCN--DLFKYYGATGLAGYIDDYLL 63 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ++ G+ G ++ + + ++ + +LL + Sbjct: 64 EGEYVILGEDGAPFLDSLKSKSYLVSGKFWVNNHAHILKSYFNNKFLLHYLNQFNYKNYV 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV-----------------RIDTL 184 G T + + IP+ +PPLAEQ + +I + Sbjct: 124 SGTTRLKLNQTSMKKIPIIVPPLAEQEEVVSRIESLFSELDNGIENLKRAQKQLKLYRQS 183 Query: 185 ITERIRFIELLKEKKQA--------------LVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 I +L +E +++ + V N + K+ EW Sbjct: 184 ILRDAFEGKLTEEWRRSNPDKVEDPEVLVEKIKEARVEYYENQLEEWKERVEEWKSRGEV 243 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNII------------------------- 265 + L N +++GN Sbjct: 244 GKKPSRPSKLKEFTLSTNKMKNIQGWTWMAFGNTFTESPQNGIYKPANLYGEGTKIIRID 303 Query: 266 --------QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 K +++ L E E Y++ + ++ R + + + E + Sbjct: 304 NFYDGVINSKKTFKSLKLTEEEVEKYKLTNNNILINRVNSIDYLGKCGLCQNIDESTVFE 363 Query: 318 SAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 S M + + S ++ + S + S+ DV + + + Sbjct: 364 SNIMKITVDNKNIVSKFITLYLTSRIGISELRKNAKHAVNQASINQTDVSNVLAPICSLD 423 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 EQ ++ ++ + + + L + S+ + R S + A +G+ Sbjct: 424 EQNEVIKIVEEKLSICENLERTLRSSLKRSELLRQSILNKAFSGK 468 >gi|94266803|ref|ZP_01290467.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93452525|gb|EAT03114.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 603 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 51/191 (26%), Gaps = 12/191 (6%) Query: 229 PDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 P W + T R T+ + I G ++ + E Sbjct: 102 PAGWAYCRLNEIGTWGSGATPKRGITEYYDGGIPWFKSGELVGDFISSAEETITERALKE 161 Query: 284 QIVD---PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 V PG+++ K S+ A A P Sbjct: 162 TSVRLNLPGDVLIAMYGATIGKASILKC----HATTNQAVCACTPFSGILNTYLLNFLKA 217 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 G + ++ E + + +PP+ EQ I ++ A D L ++ + Sbjct: 218 SKRHFTSMGAGGAQPNISKEKIIAVVFPLPPLAEQHRIVEKVDELMALCDRLEQQTSDQL 277 Query: 401 VLLKERRSSFI 411 + + + Sbjct: 278 AAHETLVETLL 288 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 57/190 (30%), Gaps = 7/190 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W + +G T + I + ++ + R Sbjct: 101 LPAGWAYCRLNEIGTWGSGATPKRGITEYYDGGIPWFKSGELVGDFISSAEETITERALK 160 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++V + G +L G + KA I + P + Sbjct: 161 ETSVRLNLPGDVLIAMYGATIGKASILKCHATTNQAVCACTPFSGILN-TYLLNFLKASK 219 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ G + + I + P+PPLAEQ I EK+ D L + + Sbjct: 220 RHFTSMGAGGAQPNISKEKIIAVVFPLPPLAEQHRIVEKVDELMALCDRLEQQTSDQLAA 279 Query: 195 LKEKKQALVS 204 + + L+ Sbjct: 280 HETLVETLLD 289 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 56/194 (28%), Gaps = 15/194 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K S E +PD WE + + + + S +++ Sbjct: 375 KISEEEKPFTLPDGWEWCRLGEIANQSEAGWSPKCDDVPKSGKEWGVLKVSAVTWGKFLS 434 Query: 278 ESYET---------YQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPH 326 + + V P + + + + V I++ + ++ Sbjct: 435 DENKRLPQHLEPRRKHEVKPNDFLISRANTAELVARSVVVPEDVPSHLIMSDKIIRIEFS 494 Query: 327 GIDSTYLAWLMR-SYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + L S + + G +++ E ++ L V +PP EQ I + Sbjct: 495 PLVFPGYINLFNASSVARAYYARVAGGTSSSMKNVSREQIQALCVPLPPYPEQLRILRKM 554 Query: 383 NVETARIDVLVEKI 396 + + L + Sbjct: 555 DKVVHLCEQLKAHL 568 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/236 (14%), Positives = 75/236 (31%), Gaps = 18/236 (7%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT-------GRTSESGKDIIYIGL 53 +K K P K S + +P W+ + + +SGK+ + + Sbjct: 367 IKKTKPLP--KISEEEKPFTLPDGWEWCRLGEIANQSEAGWSPKCDDVPKSGKEWGVLKV 424 Query: 54 EDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADF---DGIC 107 V G + + L + R ++ + I Sbjct: 425 SAVTWGKFLSDENKRLPQHLEPRRKHEVKPNDFLISRANTAELVARSVVVPEDVPSHLIM 484 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT---MSHADWKGIGNIPMPIPPL 164 S + + ++ ++ + V + A G T M + + I + +P+PP Sbjct: 485 SDKIIRIEFSPLVFPGYINLFNASSVARAYYARVAGGTSSSMKNVSREQIQALCVPLPPY 544 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 EQ+ I K+ + L R + + +A+ + +T L D + + Sbjct: 545 PEQLRILRKMDKVVHLCEQLKAHLGRASQTRQRFAEAVANNTITSCLARDSLFRQT 600 >gi|160889099|ref|ZP_02070102.1| hypothetical protein BACUNI_01520 [Bacteroides uniformis ATCC 8492] gi|156861566|gb|EDO54997.1| hypothetical protein BACUNI_01520 [Bacteroides uniformis ATCC 8492] Length = 385 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 66/390 (16%), Positives = 133/390 (34%), Gaps = 39/390 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WK IK + SGK I + L+ +ES TG+ + K ++ S S F Sbjct: 16 KGWKTAKIKDVAPEMPSKEQLSGK-IWLLNLDMIESNTGRIIEKVYEDVENALSVQS-FD 73 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAI 140 +G +L+ KL PYL K +I D G+ +T+ + L+P+ + L I Sbjct: 74 EGNVLFSKLRPYLNKVVIPDEPGMATTELVPLRPEPSKLHKVFLSHLLRGNQFVNYANDI 133 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G M + N +PP+ +Q+ Sbjct: 134 AGGTKMPRMPLTELRNFDCILPPMDKQLEFVFIAEQVDKSKFGD---------------- 177 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 S + NP + + ++ +G + K + + + Sbjct: 178 -FKSQFIEMFGNPLSLNQKNELKRLGEC--CILNPRRPNIALCDTDKVSFIPMPAVSEDG 234 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITS 318 Y + E + + + + +++F I ++N K ++ G+ ++ Sbjct: 235 YLVDMTDEEYGKVK------KGFTYFENNDVLFAKITPCMENGKGAIVHGLTNGIGMGST 288 Query: 319 AYMAVKPHG--IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + ++P +L L R + G+G ++ + + V +P ++E Sbjct: 289 EFHVLRPINGISSPYWLLALTRMPIFRERAAKNMSGTGGQKRVSASYLDHFMVGLPAMEE 348 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLK 404 Q + D I++++V L Sbjct: 349 QRRF----EAIYRQADKSKSVIQKALVYLN 374 >gi|237807925|ref|YP_002892365.1| restriction modification system DNA specificity domain-containing protein [Tolumonas auensis DSM 9187] gi|237500186|gb|ACQ92779.1| restriction modification system DNA specificity domain protein [Tolumonas auensis DSM 9187] Length = 371 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 51/403 (12%), Positives = 116/403 (28%), Gaps = 43/403 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W +V + + +T + + +G Y + + Sbjct: 2 SWPIVKLHDICRPRQRKTIAASSLLDSGYSVYGANGKIGYYSEFTHEFP----------- 50 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G I++ + + + D L + +L + + E + G Sbjct: 51 -TLMITCRGATCGNVHISEPRSYINGNAMAIDDIDPL-IVDLKYLYYFFLKRGFEDVISG 108 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + +G+ + +P+PPL EQ I + E L Sbjct: 109 SAQPQITGQGLTKVEIPLPPLEEQKRIAAILDKADAIRQKRQQAIELADEF-------LR 161 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + +P +K +E + + K+ I + GN Sbjct: 162 SVFLDMFGDPVTNLKGWEVESL---------SSLIHVQGGYAFKSADFGTEGIPVVKIGN 212 Query: 264 IIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGII 316 +K + + G+++ + + + Sbjct: 213 ANKKGFTAESIDFVQPTHPEKLKQYELFSGDLLMSLTGTVGKDDYGNITEVTEEYNKYYL 272 Query: 317 TSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIK 373 +++K I+ YL + + + G+RQ ++ D+ +L V VP + Sbjct: 273 NQRVAKISIKSKKINKEYLKYCLSHQAMKNELIKNNRGVRQANISNSDIYQLVVPVPELN 332 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +Q + I+ ++E V E +S A + Sbjct: 333 DQ----DFFCDIVKNIEKQKNRLEGFYVESNELFASLSAELFS 371 >gi|329963226|ref|ZP_08300963.1| type I restriction modification DNA specificity domain protein [Bacteroides fluxus YIT 12057] gi|328528922|gb|EGF55862.1| type I restriction modification DNA specificity domain protein [Bacteroides fluxus YIT 12057] Length = 407 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 62/404 (15%), Positives = 132/404 (32%), Gaps = 42/404 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTS-TV 78 W P+ F G ++ G+ +I + D+ + Y + D Sbjct: 25 EWSSQPLTDFMNFKNGLNPDAKRFGRGTKFISVMDILNNQYICYDNIRASVELQDGDIET 84 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWL-LSID 132 G IL+ + L A+ I + + K L +L S Sbjct: 85 YGVDYGDILFQRSSETLEDVGRANVYLDSKPAIFGGFVIRGKSKGNYNPLFFRYLLASPT 144 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +RI GA + G+ + + IP L+EQ I + + RI T Sbjct: 145 ARKRIIVKGAGAQHFNIGQDGLSKVSLDIPRLSEQEKIGKLLQCVDARIATQ-------- 196 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + EK Q+L+ + ++ + +K + T Sbjct: 197 NKIIEKLQSLIKGL----IDDIITLKCGQLVAF-----ETLYSKAGEGGTPTTSNTEFYD 247 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 +I + ++ K + N E + ++ I++ + Sbjct: 248 NGSIPFIKIDDLRNKYLSANKDYITELGLKKSSAWLIPTHSIIYSNGATIGAISINKYPV 307 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVL 368 ++GI+ V ID +L + M+S K + + G ++ +D+ + Sbjct: 308 CTKQGILG----IVPNTNIDVEFLYYFMQSSYFQKEVERVVTEGTMKTAYLKDINHIKCP 363 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQS-IVLLKERRSSFI 411 +P + Q +I++ ++V L E +E+ + + ++ + Sbjct: 364 IPDLDRQKEISHFLSVL-----SLKEDVERQLLQKYQIQKQYLL 402 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 71/207 (34%), Gaps = 8/207 (3%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P+++ + EW + S + L+ I + Sbjct: 15 PNLRFPEFSGEWSSQPLTDFMNFKNGLNPDAKRFGRGTKFISVMDILNNQYICYDNIRAS 74 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDS 330 + L+ ETY VD G+I+F+ + + + + I ++ + Sbjct: 75 VELQDGDIETYG-VDYGDILFQRSSETLEDVGRANVYLDSKPAIFGGFVIRGKSKGNYNP 133 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + +L+ S K G+G + ++ + + ++ + +P + EQ I ++ AR Sbjct: 134 LFFRYLLASPTARKRIIVKGAGAQHFNIGQDGLSKVSLDIPRLSEQEKIGKLLQCVDAR- 192 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + I L+ I +T Sbjct: 193 ---IATQNKIIEKLQSLIKGLIDDIIT 216 >gi|153955213|ref|YP_001395978.1| Type I restriction enzyme, specificity subunit [Clostridium kluyveri DSM 555] gi|146348071|gb|EDK34607.1| Type I restriction enzyme, specificity subunit [Clostridium kluyveri DSM 555] Length = 382 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 49/387 (12%), Positives = 111/387 (28%), Gaps = 44/387 (11%) Query: 25 WKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ +K G + ++ + D+ + + Sbjct: 20 WEQRKLKDVAYYIRGSFPQPYTNPDFYDEENGKPFVQVADIGFDLRLNPDTKAHISKIAE 79 Query: 76 STVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G+I+ G + + I +D L+ + + + Sbjct: 80 PKSRFVEAGKIVVALQGSIEKSIGRTAITQYDAYFDRTILIFEEYKFPIDKQYFAQVIKK 139 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + GAT+S + + + + +P + EQ I IT R + Sbjct: 140 LFEIEKERAWGATISTITKEHLNDFIIGVPKIEEQNKIGLFFRNLDNL----ITLHQRKL 195 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LK++K+ L+ + K +++ G D WE + +V + ++ K + Sbjct: 196 NHLKDEKKGLLQKMFPKKGENFPELRFPG------FTDPWEQRKLKNIVDVKSGRDYKHL 249 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + + K + I + + + Sbjct: 250 SEGKIPVYGTGGYMLSVNEALSYKED----------------AIGIGRKGTIDKPYILRA 293 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 P ++ + + + K S SL + + VL+P Sbjct: 294 PFWTVDTLFYAVPENNNNLNFVYDI--FQNIKWKQKDESTGVPSLSKTAINNVDVLIPDY 351 Query: 373 KEQFDITNVINVETARIDVLVEKIEQS 399 KEQ I + ID L+ ++ Sbjct: 352 KEQKQIGDF----FQDIDNLITLHQRE 374 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 20/207 (9%), Positives = 56/207 (27%), Gaps = 11/207 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 +P + K + ++ F T + + + + + G ++ Sbjct: 13 FPGFTDPWEQRKLKDV-------AYYIRGSFPQPYTNPDFYDEENGKPFVQVADIGFDLR 65 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + V+ G+IV + + + + + Sbjct: 66 LNPDTKAHISKIAEPKSRFVEAGKIVVALQGSIEKSIGRTAITQYDAYFDRTILIFEEYK 125 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + ++ E + + VP I+EQ I Sbjct: 126 FPIDKQYFAQVIKKLFEIEKERAWGATISTITKEHLNDFIIGVPKIEEQNKIGLF----F 181 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ + LK+ + + Sbjct: 182 RNLDNLITLHQRKLNHLKDEKKGLLQK 208 >gi|254787776|ref|YP_003075205.1| type I restriction-modification system specificity subunit [Teredinibacter turnerae T7901] gi|237684679|gb|ACR11943.1| putative type I restriction-modification system specificity subunit [Teredinibacter turnerae T7901] Length = 401 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 61/414 (14%), Positives = 129/414 (31%), Gaps = 30/414 (7%) Query: 24 HWKVVPIKRF-TKLNTGRT---SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTV 78 W +K F + G +S ++G+++V E+G+ S + Sbjct: 2 SWITASLKDFACTVFDGPHATPKDSESGHSFLGIKNVSENGSLDLSDPKFISDEEFPKWT 61 Query: 79 SIFA--KGQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSIDV 133 K +++ R AII + C + + ++P L + LS Sbjct: 62 RRVKPKKNDVVFSYEATLHRYAIIPEGFDGCLGRRMGLVRVDEGKLVPRYLLYYFLSPLW 121 Query: 134 TQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + GAT++ K + + P L Q I E + + I+ Sbjct: 122 RAYADTKVIIGATVNRLPIKDFPDFQISAPDLHHQQRIVEILASYDDLIENNRRRIQLLE 181 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E + Q ++ G ++K G + K T+ Sbjct: 182 ESARLLYQEWFVHLRFPG---HEQVKTIDGVPEGWDKTTADKVMDVLSGGTPKTKVTEFW 238 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + I + + + S ++ + S Sbjct: 239 DGEIPFFTPKDAKGLFTYNTEKTITDLGLSKCNGRLYPKYTVFITARGT----VGKLSFA 294 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 + S Y V I +L +++ + F A SG ++ + K +P L Sbjct: 295 QRPMAMNQSCYALVTKGEISQEFLYSSLKAS--IEQFKARASGAVFDAIVVDTFKNIPFL 352 Query: 369 VPPIKEQFDITNVINVETARIDVL-VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 VP + + T + ++ID L ++ + L + R + ++G++ + Sbjct: 353 VPSSSLRDEFTEQVKDVFSQIDNLSIQNM-----KLAQARDLLLPKLMSGELTV 401 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 50/169 (29%), Gaps = 8/169 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W + + +G T ++ +I + +D + K Sbjct: 209 VPEGWDKTTADKVMDVLSGGTPKTKVTEFWDGEIPFFTPKDAKGLFTYNTEKTITDLGLS 268 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ K + G + + + + L K + + Sbjct: 269 KCNGRLYPKYTVFITARGTVGKLSFAQRPMAM-NQSCYALVTKGEI-SQEFLYSSLKASI 326 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ +A GA NIP +P + + E++ +ID Sbjct: 327 EQFKARASGAVFDAIVVDTFKNIPFLVPSSSLRDEFTEQVKDVFSQIDN 375 >gi|148988251|ref|ZP_01819714.1| phosphoglycerate kinase [Streptococcus pneumoniae SP6-BS73] gi|147926715|gb|EDK77788.1| phosphoglycerate kinase [Streptococcus pneumoniae SP6-BS73] Length = 522 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 69/441 (15%), Positives = 132/441 (29%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW-------- 224 +L KE ++++ Y + L K E Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 225 ------------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESN 255 + +P+ W F +LV K + Sbjct: 323 DISIVSQGDDNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTE 382 Query: 256 ILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ N + + I G ++ F L Sbjct: 383 IPWVSISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATH 442 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 II+ + I YL + G ++L + L + + Sbjct: 443 NEAIIS-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISN 499 Query: 372 IKEQFDITNVINVETARIDVL 392 +E I +++ ++ L Sbjct: 500 HEEMKRIIFKVDLLFQKVSQL 520 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 347 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 406 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 407 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 464 >gi|257880783|ref|ZP_05660436.1| type I restriction-modification system DNA specificity subunit [Enterococcus faecium 1,230,933] gi|257815011|gb|EEV43769.1| type I restriction-modification system DNA specificity subunit [Enterococcus faecium 1,230,933] Length = 217 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 67/192 (34%), Gaps = 11/192 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPESYE 281 E++G + E ++ + ++ G I K E+ + + + Sbjct: 31 EFIGEDVSDGDWIQ-----KEHIHESGEYRIVQTGNIGIGRYIDKPESAKYLNQESFDEL 85 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 ++PG+I+ + + + + + S +L M S + Sbjct: 86 KANEINPGDILISRLADPAGRALILPFTSSKMVTAVDVAIIRPNKNFISHFLVTRMNSSE 145 Query: 342 LCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 SG + L ++++++ + VP I+EQ I ++D + EQ + Sbjct: 146 TLNDISKQVSGTSHKRLSRKNLEKIELNVPNIEEQEKIG----QLFKKLDEAIAGHEQKL 201 Query: 401 VLLKERRSSFIA 412 +E + + + Sbjct: 202 ATYQELKKALLQ 213 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 67/200 (33%), Gaps = 22/200 (11%) Query: 24 HWKVVPIKRFT-------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPK-----DGNSR 71 W++ +K F + + ++ G G+Y+ K N Sbjct: 23 DWELKELKEFIGEDVSDGDWIQKEHIHESGEYRIVQTGNI--GIGRYIDKPESAKYLNQE 80 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGW 127 D + G IL +L +A+I F ++ K+ + L Sbjct: 81 SFDELKANEINPGDILISRLADPAGRALILPFTSSKMVTAVDVAIIRPNKNFISHFLVTR 140 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + S + I G + K + I + +P + EQ EKI ++D I Sbjct: 141 MNSSETLNDISKQVSGTSHKRLSRKNLEKIELNVPNIEEQ----EKIGQLFKKLDEAIAG 196 Query: 188 RIRFIELLKEKKQALVSYIV 207 + + +E K+AL+ + Sbjct: 197 HEQKLATYQELKKALLQRMF 216 >gi|219855644|ref|YP_002472766.1| hypothetical protein CKR_2301 [Clostridium kluyveri NBRC 12016] gi|219569368|dbj|BAH07352.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 390 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 49/387 (12%), Positives = 111/387 (28%), Gaps = 44/387 (11%) Query: 25 WKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ +K G + ++ + D+ + + Sbjct: 28 WEQRKLKDVAYYIRGSFPQPYTNPDFYDEENGKPFVQVADIGFDLRLNPDTKAHISKIAE 87 Query: 76 STVSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G+I+ G + + I +D L+ + + + Sbjct: 88 PKSRFVEAGKIVVALQGSIEKSIGRTAITQYDAYFDRTILIFEEYKFPIDKQYFAQVIKK 147 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + GAT+S + + + + +P + EQ I IT R + Sbjct: 148 LFEIEKERAWGATISTITKEHLNDFIIGVPKIEEQNKIGLFFRNLDNL----ITLHQRKL 203 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LK++K+ L+ + K +++ G D WE + +V + ++ K + Sbjct: 204 NHLKDEKKGLLQKMFPKKGENFPELRFPG------FTDPWEQRKLKNIVDVKSGRDYKHL 257 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + + K + I + + + Sbjct: 258 SEGKIPVYGTGGYMLSVNEALSYKED----------------AIGIGRKGTIDKPYILRA 301 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 P ++ + + + K S SL + + VL+P Sbjct: 302 PFWTVDTLFYAVPENNNNLNFVYDI--FQNIKWKQKDESTGVPSLSKTAINNVDVLIPDY 359 Query: 373 KEQFDITNVINVETARIDVLVEKIEQS 399 KEQ I + ID L+ ++ Sbjct: 360 KEQKQIGDF----FQDIDNLITLHQRE 382 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 20/207 (9%), Positives = 56/207 (27%), Gaps = 11/207 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 +P + K + ++ F T + + + + + G ++ Sbjct: 21 FPGFTDPWEQRKLKDV-------AYYIRGSFPQPYTNPDFYDEENGKPFVQVADIGFDLR 73 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + V+ G+IV + + + + + Sbjct: 74 LNPDTKAHISKIAEPKSRFVEAGKIVVALQGSIEKSIGRTAITQYDAYFDRTILIFEEYK 133 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + ++ E + + VP I+EQ I Sbjct: 134 FPIDKQYFAQVIKKLFEIEKERAWGATISTITKEHLNDFIIGVPKIEEQNKIGLF----F 189 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ + LK+ + + Sbjct: 190 RNLDNLITLHQRKLNHLKDEKKGLLQK 216 >gi|126090307|ref|YP_001041762.1| restriction modification system DNA specificity subunit [Shewanella baltica OS155] gi|125999938|gb|ABN64007.1| restriction modification system DNA specificity domain [Shewanella baltica OS155] Length = 349 Score = 83.7 bits (205), Expect = 6e-14, Method: Composition-based stats. Identities = 47/369 (12%), Positives = 115/369 (31%), Gaps = 33/369 (8%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ 110 I E + T ++ + +QSD +G I+ + G A+I + Sbjct: 5 IRPEQINRKTEIFINDEFYHKQSD----KWLREGDIVMVQSGHVGHTAVIPPELNNIAAH 60 Query: 111 FLVLQ--PKDVLPELLQGWLLSIDVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQ 167 L++ PK+ + + + + + I G T+ H + + M EQ Sbjct: 61 ALIMFTDPKEEVSPYFLNFQFQTENIKTKLSEITTGNTIKHILSSEMKDFEMFFTDFEEQ 120 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I I+ + + + + + + + P I + G Sbjct: 121 TAIGNTFQKLDSLINQHQQKHDKL---------SNIKKAMLEKMFPKQGETIPEIRFKGF 171 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + WE K ++ ++ S + + + +N + P + + Sbjct: 172 SGE-WEEKELGSVTQITMGQSPSGENYTNNSNDFILVQGNADLKNGFVVPRVWTSEVTKT 230 Query: 288 P--GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 G ++F + V+ RG+ + ++ ++ Sbjct: 231 ATQGALIFSVRAPVGEVGKTNYDVVLGRGVAA---------INANEFIFQQLKKLKSDNY 281 Query: 346 FYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 ++ ++ ++ + V EQ I N ++D L+ + +Q I L Sbjct: 282 WHKVSAGSTFDAISSTELDSTLIWVSSDSEQTAIGNY----FQKLDTLINQHQQQITKLN 337 Query: 405 ERRSSFIAA 413 + + ++ Sbjct: 338 NIKQACLSK 346 >gi|333030655|ref|ZP_08458716.1| restriction modification system DNA specificity domain protein [Bacteroides coprosuis DSM 18011] gi|332741252|gb|EGJ71734.1| restriction modification system DNA specificity domain protein [Bacteroides coprosuis DSM 18011] Length = 386 Score = 83.7 bits (205), Expect = 6e-14, Method: Composition-based stats. Identities = 49/404 (12%), Positives = 112/404 (27%), Gaps = 53/404 (13%) Query: 26 KVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + I + G T I + +ED+ G+ L S Sbjct: 14 EWKTIDTLFNIKNGYTPSKKQKEFWTNGTIPWFRMEDIRI-NGRILRDSIQHVSSSAIRG 72 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 ++ ++ A+I + ++ L K L + Sbjct: 73 NLIPANSLIMSTTATLGEHALILEPFLTNQQITSFSLKEPYKGKLNVKFLFYYFFHFGEW 132 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITER 188 I + +++ + +PIP LA Q I + + L E Sbjct: 133 CINNANKNGSLAIIGVNKLKKYKIPIPCPNNPEKSLAIQQKIVGILDTFSELTAELTAEL 192 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + ++ L++ ++ +EW KP + K Sbjct: 193 TARKKQYEYYREQLLT------------FEEDEVEW----------KPLGEVAELKRGKT 230 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 ++ N I G + +Y + GE + L Sbjct: 231 ----------ITAKNKIDGDIPVISGGQQPAYYNAKFNRKGETITIAGSGAYAGHVLYW- 279 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 E ++ A+ I +T + ++ + +D+++L + Sbjct: 280 --DEPIFVSDAFSIKPNISILNTKYVFYFLMKYQNWIYGLKKGVGVPHVYPKDLEKLFIP 337 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P +K Q I +++ + L ++ + R ++ Sbjct: 338 IPTLKIQQKIVGILDTFSELTAELTAELTARKKQYEYYRDLLLS 381 >gi|238809803|dbj|BAH69593.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 393 Score = 83.7 bits (205), Expect = 6e-14, Method: Composition-based stats. Identities = 54/405 (13%), Positives = 116/405 (28%), Gaps = 42/405 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P ++ V ++ ++ + I + + GK+ N Q D IF Sbjct: 13 PDGYEWVKLEDAVEIFDNKR---------IPIAQNKRIKGKFPYYGANGIQ-DYVNDFIF 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL G+ G + P V + E Sbjct: 63 DGEYILIGEDGSVIDGL---------------NHPILNYATGKFWVNNHSHVIKAKEEFL 107 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR-IDTLITERIRFIELLKEKKQ 200 I +I PP + + +I + I I E + +L+ + + Sbjct: 108 NRFIYHFLSILDISDIVRGTPPKMTKGNLLTILIPKIPLKIQEKIVEILERFRILEAELK 167 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + V K + ++ + N +S Sbjct: 168 AELE--VRGKQFDFWINKLLNFTNFDKNNSKELQSIGCFISGLRSKNKDSFVNGNQRYVS 225 Query: 261 YGNIIQKLETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER---- 313 Y ++ E N +K E ++ G+++F D+ S ++ Sbjct: 226 YLDVFNNKEINYLPNNFVKIFDDENQNDLNYGDVIFCGSSENFDETGYASVYTIKNDEKV 285 Query: 314 --GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 + + + + + D + +G R +L E + ++ + +P Sbjct: 286 YLNSFSFIFRFKDNNLFLPKFSKYFFNCKDFRDLLLKCINGVTRFNLSKEKMSKIKIPIP 345 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVL----LKERRSSFI 411 PI+ Q I ++++ + + + I L K R + Sbjct: 346 PIETQNKIVSILDKLSEYSQEINSGLPAEIELRSKQFKYYRDQLL 390 >gi|146294609|ref|YP_001185033.1| restriction modification system DNA specificity subunit [Shewanella putrefaciens CN-32] gi|145566299|gb|ABP77234.1| restriction modification system DNA specificity domain [Shewanella putrefaciens CN-32] Length = 420 Score = 83.7 bits (205), Expect = 6e-14, Method: Composition-based stats. Identities = 53/423 (12%), Positives = 113/423 (26%), Gaps = 35/423 (8%) Query: 27 VVPIKRFTKLNTGR--TSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF-- 81 + I ++ G T + + ++ + ++ G + S Sbjct: 6 ITTIGGIAEIYDGPHATPKKLEQGPYFLSISSLDKGRLELNKSAFLSEDDFKKWTKRVTP 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQ----FLVLQPKDVLPELLQGWLLSIDVTQRI 137 +G +L+ A++ C + + + K L +L Sbjct: 66 QEGDLLFSYETRLGEAALMPAGVRACLGRRMGLLRLNKAKVTPEYALYAYLSPAFQQTIK 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 GAT+ + N P+ IPP+ EQ + + + +I+ + K Sbjct: 126 ANTLTGATVDRIALNDLPNFPIRIPPIEEQKKVAKLLSGIDKKIELNNHINAELEAMAKT 185 Query: 198 KKQALVSYIVTKGLNPDV---KMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKN 248 +P K SG + V +P+ W VK + + Sbjct: 186 LYDYWFVQFDFPDDSPQRKGKPYKSSGGKMVYNPILKREIPEGWGVKKLSEIAMTGSGGT 245 Query: 249 ------TKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQ 299 I ++ G + + E + ++ I+ Sbjct: 246 PLSSNPEFYENGTIPWINSGELNSQFIVSTSNFITELGLEKSSAKLCPKNTILMAMYGAT 305 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 K S A + P + + R +L Sbjct: 306 AGKVSFIDF----PATTNQAICTINPFDQEMNVYLKFTLERLYQYLINLSSGSARDNLSQ 361 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + +K L V++P + T L+ + L R + + GQ+ Sbjct: 362 DKIKSLDVVIPAPSALTQF----HEFTKSKMELILTNLKENQELTSLRDWLLPMLMNGQV 417 Query: 420 DLR 422 ++ Sbjct: 418 TVK 420 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 58/195 (29%), Gaps = 8/195 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W V + +G T S I +I ++ S Sbjct: 224 EIPEGWGVKKLSEIAMTGSGGTPLSSNPEFYENGTIPWINSGELNSQFIVSTSNFITELG 283 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + S+ + K IL G K DF + + P D + + L Sbjct: 284 LEKSSAKLCPKNTILMAMYGATAGKVSFIDFPATTNQAICTINPFDQEMNVYLKFTLERL 343 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G+ + I ++ + IP + E ++ I T + E Sbjct: 344 YQ-YLINLSSGSARDNLSQDKIKSLDVVIPAPSALTQFHEFTKSKMELILTNLKENQELT 402 Query: 193 ELLKEKKQALVSYIV 207 L L++ V Sbjct: 403 SLRDWLLPMLMNGQV 417 >gi|90579611|ref|ZP_01235420.1| putative type I restriction-modification system specificity subunit [Vibrio angustum S14] gi|90439185|gb|EAS64367.1| putative type I restriction-modification system specificity subunit [Vibrio angustum S14] Length = 363 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 55/402 (13%), Positives = 116/402 (28%), Gaps = 48/402 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W +V + G + Y+ L + Y + ++ST Sbjct: 2 SWPIVELGSVVSFVGGSQPPKSTFKFEPEDDYVRLLQIR----DYKSDKNLTFIPESSTK 57 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID--VTQR 136 K ++ G+ GP + + + +G + + P + + + + L D Sbjct: 58 KFCKKDDVMIGRYGPPVFQI-LRGLEGAYNVALMKAVPSEKVDKDYLYYFLKQDKLFRLI 116 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 A S D + + PM +PPL EQ I + Sbjct: 117 DSLSQRTAGQSGIDMDALKSYPMLLPPLEEQKRIAAILDKAD------------------ 158 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 A+ D ++ ++ G +P + P + + + Sbjct: 159 ----AIRQKRKQAIELADEFLRSVFLDMFGDIPAGFSKYPLV-GCRGSVKAASGKSSKGV 213 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 +S S +I G E+ + +V G + + V + I+ Sbjct: 214 ISDSSTDIPIYGGNGINGYATEALYSKPVVIVGRVGQQCGITTLTDG---PCWVTDNAIV 270 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 D+ YLA ++ L + + + + +PPI EQ Sbjct: 271 ---LEITDLKKYDAAYLAHALKHSPLRDSVKRLD---LPFVNQSMILDYKIPLPPISEQK 324 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + ++ I + ++ A +GQ Sbjct: 325 KFGSIRKNLLKHL----SLQQKGIGISEDNFQVLSQQAFSGQ 362 >gi|319775913|ref|YP_004138401.1| putative type I restriction-modification system [Haemophilus influenzae F3047] gi|317450504|emb|CBY86721.1| Putative type I restriction-modification system [Haemophilus influenzae F3047] Length = 418 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 45/357 (12%), Positives = 118/357 (33%), Gaps = 16/357 (4%) Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 V+ G K L + + + V + + + + Sbjct: 59 VDGGNVKLLTTNESDIWTTEELVQNNISEGEIIAIPWGGNPIVQYYKGKFVTADNRIATS 118 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + + + I + G+ + H + + +PIPPL+ Q I + + Sbjct: 119 NNTKILDNKFLYYFLLSKLDVISSFYRGSGIKHPSMYHVLEMLIPIPPLSVQTEIVKILD 178 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 T L +E + L +++ + +++ +E G V ++ Sbjct: 179 TLTELTSELTSELTSELILRQKQYEYYREKLLSF----------DSLELSGGVVQWIKLI 228 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEI 291 L+ + E+ + ++ YG I T + PE + + G++ Sbjct: 229 DLGELIRGNGLQKKDFTETGVPAIHYGQIYTYYGTFATKTKSFVSPELAKKLKKAKYGDV 288 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMG 350 + + + A +P+ ++ YL +++++ K Sbjct: 289 LIAGTSENLKDVMKPLGWLGSEIAFSGDMFAFRPNKRVNTKYLTYILQTERFYKFKEKYA 348 Query: 351 SGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G + +K ++ + +P +EQ I ++++ + + E + +I ++R Sbjct: 349 QGTKVIRVKADNFLNYEIPLPTFEEQHRIVSILDKFETLTNSITEGLPLAIEQRQKR 405 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 59/185 (31%), Gaps = 7/185 (3%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 F V + + + S I+ + + T + + I Sbjct: 28 WDKRFNAVEKEKQPKVIKYHYYLASELKPLIVDGGNVKLLTTNESDIWTTEELVQNNISE 87 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS 351 I + + + +A + D+ +L + + S + GS Sbjct: 88 GEIIAIPWGGNPIVQYYKGKFVTADNRIATSNNTKILDNKFLYYFLLSKLDVISSFYRGS 147 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 G+ + V + + +PP+ Q +I +++ T L ++ ++L ++ R Sbjct: 148 GI-KHPSMYHVLEMLIPIPPLSVQTEIVKILDTLTELTSELTSELTSELILRQKQYEYYR 206 Query: 408 SSFIA 412 ++ Sbjct: 207 EKLLS 211 >gi|15611482|ref|NP_223133.1| type I restriction enzyme specificity subunit [Helicobacter pylori J99] gi|4154940|gb|AAD05986.1| TYPE I RESTRICTION ENZYME (SPECIFICITY SUBUNIT) [Helicobacter pylori J99] Length = 409 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 62/403 (15%), Positives = 114/403 (28%), Gaps = 26/403 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPPTNNPKNYGNKIAWITPKDLSTLQGRYIKKGSRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA+ + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAEKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYY 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 I I G T +G + IPP EQ I + +I+ Sbjct: 120 KDNISNIGGGTTFKEVSGATLGLFQVKIPPTYYEQQKIAHTLSILDQKIENNHKINELLH 179 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKM----KDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++L+ + N K + + + + + + Sbjct: 180 KILELLYEQYFVRFDFLDENNKPYQTSGGKMKFSKELNRLIPNDFKVKTLGELITWISGS 239 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LR 306 +I G I +N +Y TY + + D+ DK Sbjct: 240 QPPKSCHIYEYKEGYI---RFIQNRDYSSNNYVTYIPISKNNKICYQYDIMMDKYGEAGS 296 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRL 365 ++ + + Y+ + S + K + R SL + L Sbjct: 297 VRFGLQGAYNVALSKISVLNQSMQEYIRSYLNSKPIKKYLSNACMASTRASLNENHIYSL 356 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +PPI I + K QS L R Sbjct: 357 MLPIPPINLLQK----YEKIAKNIITAIIKNNQSTQTLTALRD 395 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 51/190 (26%), Gaps = 2/190 (1%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 IP +KV + +G I + Y + + + Sbjct: 220 IPNDFKVKTLGELITWISGSQPPKSCHIYEYKEGYIRFIQNRDYSSNNYVTYIPISKNNK 279 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I + I+ K G + + E ++ +L S + + + Sbjct: 280 ICYQYDIMMDKYGE-AGSVRFGLQGAYNVALSKISVLNQSMQEYIRSYLNSKPIKKYLSN 338 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 C +T + + I ++ +PIPP+ + I L Sbjct: 339 ACMASTRASLNENHIYSLMLPIPPINLLQKYEKIAKNIITAIIKNNQSTQTLTALRDFLL 398 Query: 200 QALVSYIVTK 209 L+ V Sbjct: 399 PLLLKQQVKP 408 >gi|254447694|ref|ZP_05061160.1| type I restriction-modification system S subunit [gamma proteobacterium HTCC5015] gi|198263037|gb|EDY87316.1| type I restriction-modification system S subunit [gamma proteobacterium HTCC5015] Length = 448 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 48/410 (11%), Positives = 113/410 (27%), Gaps = 52/410 (12%) Query: 40 RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI 99 RT + + + +G G + K + + + I Y Sbjct: 24 RTPSAIDTYAFDCEAVLLAGNGDFNLKYYKGKFNAYQRTYVIEP--IQISLKFLYYLTVS 81 Query: 100 IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI-------EAICEGATMSHADWK 152 + + + K + +L ++ RI A+C+ +D Sbjct: 82 QIERITENNRGSAIRYLKLNDILMPFVYLPPVEEQHRIVQKVDELMALCDRLEQQTSDQL 141 Query: 153 GIG-NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + + Q ++ R+ + + + KQA++ V L Sbjct: 142 EAHETLVDTLLGTLTQSENATELADNWARLAAHFDTLFTTEQSIDKLKQAILQLAVMGRL 201 Query: 212 NPDVKMKDSGIEWVGL-------------------------------VPDHWEVKPFFAL 240 + +E + P W + Sbjct: 202 VEQEAADEPAVEVIKRAQERKSQLLSEKLIKKQKELPDITEPEKPFSTPPLWAYARLDDI 261 Query: 241 VTELNR-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE------TYQIVDPG 289 TE+ K+ I L NI + + + + + PG Sbjct: 262 CTEVTSGSTPPKSEFSEAFGIPYLKVYNIRSQRVDFDYKPQYVTENYHRTTLKRSQLLPG 321 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++V + K ++ E + + ++ +++ K + Sbjct: 322 DVVMNIVGPPLGKTAIIPDDHPEWNCNQAIVRFRPIEIELNQFIHLYLKAGIFLKTIELI 381 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 G+ + ++ + + + +PP EQ I ++ A D L E++ Q+ Sbjct: 382 GTAGQDNISVTKSRSIVIPLPPKAEQQRIVQKVDELMALCDQLKERLNQA 431 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 67/196 (34%), Gaps = 12/196 (6%) Query: 25 WKVVPIKRFT-KLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS-- 76 W + ++ +G T + I Y+ + ++ S + K ++ Sbjct: 253 WAYARLDDICTEVTSGSTPPKSEFSEAFGIPYLKVYNIRSQRVDFDYKPQYVTENYHRTT 312 Query: 77 -TVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S G ++ +GP L K I + C+ + +P ++ L Sbjct: 313 LKRSQLLPGDVVMNIVGPPLGKTAIIPDDHPEWNCNQAIVRFRPIEIELNQFIHLYLKAG 372 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + A + +I +P+PP AEQ I +K+ D L + Sbjct: 373 IFLKTIELIGTAGQDNISVTKSRSIVIPLPPKAEQQRIVQKVDELMALCDQLKERLNQAC 432 Query: 193 ELLKEKKQALVSYIVT 208 + + +A+V + Sbjct: 433 KTRCQLAEAVVENALN 448 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 13/82 (15%), Positives = 31/82 (37%) Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 S + + + ++ + LK D+ V +PP++EQ I ++ A Sbjct: 71 SLKFLYYLTVSQIERITENNRGSAIRYLKLNDILMPFVYLPPVEEQHRIVQKVDELMALC 130 Query: 390 DVLVEKIEQSIVLLKERRSSFI 411 D L ++ + + + + Sbjct: 131 DRLEQQTSDQLEAHETLVDTLL 152 >gi|293388422|ref|ZP_06632930.1| restriction endonuclease S subunit [Enterococcus faecalis S613] gi|312908542|ref|ZP_07767486.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 512] gi|312908988|ref|ZP_07767850.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 516] gi|291082197|gb|EFE19160.1| restriction endonuclease S subunit [Enterococcus faecalis S613] gi|310625509|gb|EFQ08792.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 512] gi|311290688|gb|EFQ69244.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis DAPTO 516] Length = 398 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 137/404 (33%), Gaps = 32/404 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESG------TGKYLPKDGNSRQS 73 +W++ +++ T +G T + +V+ + NS Sbjct: 8 NWELCKLEKLTDFFSGLTYSPDNVQKDGTFVLRSSNVKDNAIISADNVYVRNEVANSEHV 67 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 V + + G + A I + + P+ L L + Sbjct: 68 QVGDVIVVVRN----GSRSLIGKHAPINREMPNTVIGAFMTGLRSPSPKFLNTLLDTQQF 123 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I GAT++ + +P ++ EKI + ++D +IT R ++ Sbjct: 124 NVEIHKNL-GATINQITTGEFKRMHFIVPTDEDEK---EKIGSLFRQLDDIITLHQRKLD 179 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LKE K+A + + K++ + E G T+ ++ + Sbjct: 180 QLKELKKAYLQVMFPAKDERVPKLRFADFE--GEWEQCKLGNILTERNTQQSKSKEYPLV 237 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 S + ++ E + +S + Y++ + +IV+ +L K + + Sbjct: 238 SFTVEDGVTPKTERYEREQLVRGDKSSKKYKVTELNDIVYNPANL---KFGAIARNHYGK 294 Query: 314 GIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVL 368 + + Y+ + S+Y+ + D G RQS+ E++ + L Sbjct: 295 AVFSPIYITFIVNDKLACSSYVEVFITRKDFISYSLKYQQGTVYERQSVSPENLLNMKFL 354 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +P KEQ I + ++D ++ I LK + S++ Sbjct: 355 LPNTKEQEFIGHF----FEKLDCNSNFHKKKITQLKNLKKSYLQ 394 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 59/191 (30%), Gaps = 8/191 (4%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET--RNMGLKPESYET 282 + +++ + L + + L N+ N+ ++ E + Sbjct: 5 FNYNWELCKLEKLTDFFSGLTYSPDNVQKDGTFVLRSSNVKDNAIISADNVYVRNEVANS 64 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + IV ++ + A+M +L L+ + Sbjct: 65 EHVQVGDVIVVVRNGSRSLIGKHAPINREMPNTVIGAFMTGLRSP-SPKFLNTLLDTQQF 123 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + + KR+ +VP E+ I + ++D ++ ++ + Sbjct: 124 NVEIHKNLGATINQITTGEFKRMHFIVPTDEDEKEKIGS----LFRQLDDIITLHQRKLD 179 Query: 402 LLKERRSSFIA 412 LKE + +++ Sbjct: 180 QLKELKKAYLQ 190 >gi|192361761|ref|YP_001984091.1| type I restriction system specificity protein [Cellvibrio japonicus Ueda107] gi|190687926|gb|ACE85604.1| type I restriction system specificity protein [Cellvibrio japonicus Ueda107] Length = 398 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 52/415 (12%), Positives = 128/415 (30%), Gaps = 35/415 (8%) Query: 23 KHWKVVPIKR-----FTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNS 70 W+ + + L TG S I ++D+ + Sbjct: 2 SEWREFTLGKLIDDGIADLQTGPFGTMLKASEYSDVGTPVIAVQDIGENRLIHNKFVYVE 61 Query: 71 RQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGW 127 + T + +G I++G+ G R+A I + + + + L+ + + + Sbjct: 62 QNIVTRLSRYKVKEGDIIFGRKGAVERRARIRKDEDGWLQGSDCIRLRFNSRINSIFISY 121 Query: 128 LL-SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + + GATM + + +P+ +PP+ EQ I + + + +ID L Sbjct: 122 QFGSKSYREWMIQNSTGATMPSLNQSVLKLLPIRLPPIEEQKAIADILSSFDDKIDLLHR 181 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + + + + ++ + G P ++ Sbjct: 182 QNKTLESMAETLFRQWFVEDAQEDWEEKGLLELVDLVGGG--------TPKTSINEYWCG 233 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 L +I + G I + +N+ + +++ V L Sbjct: 234 DIPWLSGGDIATHHKGFISR--SEKNITQIGLENSSAKLLTKLATVISARGTVGKHCLLA 291 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 S + + + P + Y +L+ + + ++ A + ++ + Sbjct: 292 S-----EMTFSQSNYGILPKIKNCYYFTYLLIGHIVEELQSAAYGSVFDTITTATFRDAT 346 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P + I +Q I +L++ R + + G+I + Sbjct: 347 FKTPSEEL---IFAF-EEVVKGYFEKKLFNQQQIHILEKIRDGLLPKLMNGEITV 397 >gi|85711748|ref|ZP_01042804.1| type I restriction-modification system, S subunit, EcoA family protein [Idiomarina baltica OS145] gi|85694363|gb|EAQ32305.1| type I restriction-modification system, S subunit, EcoA family protein [Idiomarina baltica OS145] Length = 663 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 61/465 (13%), Positives = 154/465 (33%), Gaps = 70/465 (15%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGK-------DIIYIGLEDVES-GTGKYLPKDGNSRQS 73 +W V + + K+ +G T + GK +I I ++V + G + + Sbjct: 3 SNWPKVRLGDYCIKIGSGATPKGGKKVYLDEGEISLIRSQNVYNEGFSSSGLVYITNSAA 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQ--FLVLQPKDVLPELLQGWL 128 D + IL G + + +A + + + + + P + P ++ +L Sbjct: 63 DKLRNVEVQERDILINITGDSVARVCMAPREYLPARVNQHVAIIRVDPTEFNPNFVRYFL 122 Query: 129 LSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + + I GAT + I N+ + P L +Q I +++ + +I Sbjct: 123 STSEQQRLLLTIASAGATRNALTKSQIENLEIIKPNLEKQAAIAQQLSSLEDKIKVNNQV 182 Query: 188 RIRFIELLKEKKQAL--------------------------------------VSYIVTK 209 ++ + ++ ++ + T Sbjct: 183 NQTLEQIAQAIFKSWFVDFEPVKAKINALAAGGSQEDALLAAMQAISGKDKAQLTQLQTD 242 Query: 210 GLNPDVKMKDSGI--------EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +++ + +G +P+ W ++PF + E + Y Sbjct: 243 SPEHYNQLRTTAELFPSAMQDSELGEIPEGWFLEPFSNIARLDTTSVKPAKEPEKIWEHY 302 Query: 262 -GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + L + V P I+ ++ + L I ++ + Sbjct: 303 SIPAFDDGMSPAFDLGVDIKSNKYRVFPASILVSKLNPHFPRTWLPDVFDSNAAICSTEF 362 Query: 321 MAVKPHGIDSTYLAW-LMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLVPPIKEQF 376 M P + +++S +G RQ + + V + VL+P +E Sbjct: 363 MQFVPIKPNQRAFVAGVVKSESFQNGIMMRVTGSTGSRQRAQPKQVAEMEVLLPS-EELR 421 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +I +V+ +++ I +++ L + R + + ++G++ + Sbjct: 422 NIYSVL--IAPQLESQASNIREALN-LADVRDTLLPKLLSGELQV 463 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 40/121 (33%), Gaps = 14/121 (11%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKL-NTGRTSESGKDIIY--IGLEDVESGTGKYLP 65 +DS +G IP+ W + P +L T + I+ + + G Sbjct: 260 AMQDSE---LGEIPEGWFLEPFSNIARLDTTSVKPAKEPEKIWEHYSIPAFDDGMSPAFD 316 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLP 121 + + S IL KL P+ + + D ICST+F+ P Sbjct: 317 LGVDIK----SNKYRVFPASILVSKLNPHFPRTWLPDVFDSNAAICSTEFMQFVPIKPNQ 372 Query: 122 E 122 Sbjct: 373 R 373 >gi|319777021|ref|YP_004136672.1| anti-codon nuclease masking agent (prrb) [Mycoplasma fermentans M64] gi|318038096|gb|ADV34295.1| Anti-codon nuclease masking agent (PrrB) [Mycoplasma fermentans M64] Length = 397 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 57/407 (14%), Positives = 131/407 (32%), Gaps = 42/407 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P ++ V ++ ++ + I + + GK+ N Q D IF Sbjct: 13 PDGYEWVKLEDAVEIFDNKR---------IPIAQNKRIKGKFPYYGANGIQ-DYVNDFIF 62 Query: 82 AKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 IL G+ G + + + ++ ++ L + +L +D++ Sbjct: 63 DGEYILIGEDGSVIDGLNHPILNYATGKFWVNNHSHVIKAKEEFLNRFIYHFLSILDISD 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + T + I +P PL Q I E + + L E +E+ Sbjct: 123 IVRG-----TPPKMTKGNLLTILIPKIPLKIQEKIVEILERFRILEAELKAELKAELEVR 177 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 ++ ++ ++ K ++ +G K + V R + L N Sbjct: 178 GKQFDFWINKLLNFTNFDKNNSK--ELQSIGCFISGLRSKNKDSFVNGNQRYVSYLDVFN 235 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER-- 313 ++Y N +K E ++ G+++F D+ S ++ Sbjct: 236 NKEINY--------LPNNFVKIFDDENQNDLNYGDVIFCGSSENFDETGYASVYTIKNDE 287 Query: 314 ----GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + + + + + D + +G R +L E + ++ + Sbjct: 288 KVYLNSFSFIFRFKDNNLFLPKFSKYFFNCKDFRDLLLKCINGVTRFNLSKEKMSKIKIP 347 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVL----LKERRSSFI 411 +PPI+ Q I ++++ + + + I L K R + Sbjct: 348 IPPIETQNKIVSILDKLSEYSQEINSGLPAEIELRSKQFKYYRDQLL 394 >gi|218673271|ref|ZP_03522940.1| putative type I restriction enzyme specificity subunit [Rhizobium etli GR56] Length = 239 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 23/123 (18%), Positives = 49/123 (39%), Gaps = 1/123 (0%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQ 355 K + +++ ++ + +L WL+ S +G Sbjct: 1 ISTGLKVARVTSKDAGCLLVQRVTRFRASEFLTQDFLWWLLSSQTFLSHSLQRATGSDLP 60 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +D+ P+ +PP++EQ I I A+ID L E+ +++ L+ + +A A Sbjct: 61 HISGDDISTCPIPIPPLEEQHKIARRIESAFAKIDRLAEEARRALQLVGRLDEAILAKAF 120 Query: 416 TGQ 418 G+ Sbjct: 121 RGE 123 >gi|223983262|ref|ZP_03633455.1| hypothetical protein HOLDEFILI_00735 [Holdemania filiformis DSM 12042] gi|223964755|gb|EEF69074.1| hypothetical protein HOLDEFILI_00735 [Holdemania filiformis DSM 12042] Length = 359 Score = 83.3 bits (204), Expect = 7e-14, Method: Composition-based stats. Identities = 54/391 (13%), Positives = 119/391 (30%), Gaps = 36/391 (9%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + K + G+ +S VE+ G+Y P G+ + + + Sbjct: 2 LKTFKDILIIKNGKNQKS-----------VENPEGQY-PIYGSGGIIGFANNYLCEGNTV 49 Query: 87 LYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + G+ G + T F LV ++P+ L + L D + + T Sbjct: 50 VIGRKGSINNPIFVDKPFWNVDTAFGLVTDRSKMIPKYLYYFCLHFDF----NRLNKAVT 105 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + I + +P L Q + +K+ + I L ++++ + L + + Sbjct: 106 LPSLTKSDLLKIEIDVPDLVVQFKVVDKL-QKVELIINLKKQQLQKFDDL------IRAR 158 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 V + K ++ + E I + L + Sbjct: 159 FVEMFGDIKSNSKKWEQVYLKDISYLISGGTPSRAKPEYFEGEIPWISTVALGKTEIGFE 218 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 +E S ++ ++F + I++ + Sbjct: 219 DAIEYITKDAIENSATK--LIPANSLLFGIRVGVGKVSKNVVPMCTNQDIVS---ITNID 273 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + +L +L+ + F G Q +K E +K + V +K Q D Sbjct: 274 DNFNLVFLKYLL--DEYLDFFNGQKRGATIQGIKSETLKNILVPKVNLKLQNDF----EQ 327 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D ++QS+ +E S + Sbjct: 328 FVNQVDKSKLAVQQSLDKTQELFDSLMQKYF 358 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 60/200 (30%), Gaps = 12/200 (6%) Query: 15 VQWIGAIPKH---WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLP 65 V+ G I + W+ V +K + L +G T K +I +I + + Sbjct: 160 VEMFGDIKSNSKKWEQVYLKDISYLISGGTPSRAKPEYFEGEIPWISTVALGKTEIGFED 219 Query: 66 --KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + + S + +L+G + + K + + + D L Sbjct: 220 AIEYITKDAIENSATKLIPANSLLFG-IRVGVGKVSKNVVPMCTNQDIVSITNIDDNFNL 278 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + L + GAT+ + + NI +P L Q + + Sbjct: 279 VFLKYLLDEYLDFFNGQKRGATIQGIKSETLKNILVPKVNLKLQNDFEQFVNQVDKSKLA 338 Query: 184 LITERIRFIELLKEKKQALV 203 + + EL Q Sbjct: 339 VQQSLDKTQELFDSLMQKYF 358 >gi|218514809|ref|ZP_03511649.1| type I restriction-modification system, S subunit [Rhizobium etli 8C-3] Length = 283 Score = 83.3 bits (204), Expect = 8e-14, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 76/219 (34%), Gaps = 11/219 (5%) Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSY 261 + ++ E + +P W +V K+ + I L Sbjct: 65 HALSSRRGARTNTHQVNFEAIADIPTSWADGIIAIGSEMVVGFAFKSEWFRAAGIKLLRG 124 Query: 262 GNI----IQKLETRNMGLKPESYETYQIVDPGEIVFRF---IDLQNDKRSLRSAQVMERG 314 NI I + + + + +++ +IV + K + + Q Sbjct: 125 ANIAPGAINWSDLKCLDTSIADEFSKYLIEEDDIVLAMDRPVISTGLKVARVTCQDAGCL 184 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 ++ + ++L WL+ S +G + +D+ P+ +PP + Sbjct: 185 LVQRVTRFRATEFVTQSFLWWLLNSQMFLSHSLQRATGSDLPHISGDDIATCPIPIPPKE 244 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ +I I A+ID L + ++++ L+ + + +A Sbjct: 245 EQHEIVRRIESAFAKIDRLAAEAKRALELVGKLDEAILA 283 >gi|15839330|ref|NP_300018.1| type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] gi|9107979|gb|AAF85526.1|AE004080_8 type I restriction-modification system specificity determinant [Xylella fastidiosa 9a5c] Length = 412 Score = 83.3 bits (204), Expect = 8e-14, Method: Composition-based stats. Identities = 46/366 (12%), Positives = 118/366 (32%), Gaps = 40/366 (10%) Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 ++ G+ G Y + + T + ++ PK L + + ++ +G+ Sbjct: 56 VVLGRKGAYRGVEFCHESFWVIDTAYYLV-PKTDLDMRWLYYAVKHYKLGEVD---DGSP 111 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + + + +PP EQ I + + +I+ + + +A Sbjct: 112 IPSTTRAAVYMLELDVPPKHEQHAIAKILGTLDDKIELNRRTNETLEAMARALFKAWCVD 171 Query: 206 I-------------------VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + L + EW G +P+ W V + L R Sbjct: 172 FEPVRAKLEGRWQRGESLPGLPAHLYDLFPARLIESEW-GEIPEGWRVDSLGKVAVHLRR 230 Query: 247 --KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + +++ + + + G+ GEI+F + K Sbjct: 231 SVQPSEIKDETSYIALEHMPKRCIALAEWGVANGIESNKYEFKQGEILFGKLRPYFHKVG 290 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM---RSYDLCKVFYAMGSGL-RQSLKFE 360 + G+ ++ + + P I T+ +++ S + A +G + Sbjct: 291 VAPVD----GVCSTDIVVIAP--ILPTWFGFVLVHVSSDAFVEYTNAGSTGTKMPRTSWS 344 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ + PV++P I + I +++ E L + R + + ++G++ Sbjct: 345 EMAQYPVVLPHEDVAVAFNQHIQALSEEI--IIKIHESR--SLVQLRDTLLPKLISGELR 400 Query: 421 LRGESQ 426 + + Sbjct: 401 VPDAER 406 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 46/192 (23%), Positives = 72/192 (37%), Gaps = 6/192 (3%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +W G IP+ W+V + + S + YI LE + + Sbjct: 208 EW-GEIPEGWRVDSLGKVAVHLRRSVQPSEIKDETSYIALEHMPKRCIAL--AEWGVANG 264 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSID 132 S F +G+IL+GKL PY K +A DG+CST +V+ P + + S Sbjct: 265 IESNKYEFKQGEILFGKLRPYFHKVGVAPVDGVCSTDIVVIAPILPTWFGFVLVHVSSDA 324 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + A G M W + P+ +P V + I A + I I E + Sbjct: 325 FVEYTNAGSTGTKMPRTSWSEMAQYPVVLPHEDVAVAFNQHIQALSEEIIIKIHESRSLV 384 Query: 193 ELLKEKKQALVS 204 +L L+S Sbjct: 385 QLRDTLLPKLIS 396 >gi|160939176|ref|ZP_02086527.1| hypothetical protein CLOBOL_04070 [Clostridium bolteae ATCC BAA-613] gi|158438139|gb|EDP15899.1| hypothetical protein CLOBOL_04070 [Clostridium bolteae ATCC BAA-613] Length = 375 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 52/396 (13%), Positives = 111/396 (28%), Gaps = 27/396 (6%) Query: 26 KVVPIKRFTKLNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 V + + G+ + ++ D+++ + + + Sbjct: 2 NQVELGTILHMEKGKKPQKQSKEIEDGFLPYVDIKAFEKGIIDSYASPE-----KCVLCD 56 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G +L G A + ST L D L + + T + + Sbjct: 57 DGDLLIVCDGSRSGLTGRAIKGVVGST--LSKISADGLTREYLRYFIQSKYT-LLNTQKK 113 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G H + + + + IP L EQ I +I +D + + L +QA+ Sbjct: 114 GTGTPHLNAQILKQSKLIIPSLPEQERIVARIEELFSELDKAVETLKTTKQQLAVYRQAV 173 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + + +G KN L E I +++ Sbjct: 174 LKEAFSCA-DTFEPFGSIMTSRLGK--------------MLDKEKNVGLPEQYIRNINVR 218 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 L + G+++ + Sbjct: 219 WFSFDLSDLLKMRIETKEIEKYSIKYGDLIICEGGEPGRCAVWDRNDSIFYQKALHRVRF 278 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 S Y G+G+ + L + + ++PV + I +Q + I Sbjct: 279 KNGENPKLYMYYLWFISQTGELEKYFTGTGI-KHLTGQSLLKVPVPIISISKQNTVVLKI 337 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + + + + IEQS+ + R S + A G+ Sbjct: 338 ESQLSVCNQIEKMIEQSLQQAEAMRQSILKQAFEGR 373 >gi|110644783|ref|YP_672513.1| type I restriction enzyme EcoEI specificity protein [Escherichia coli 536] gi|110346375|gb|ABG72612.1| type I restriction enzyme EcoEI specificity protein [Escherichia coli 536] Length = 568 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 60/482 (12%), Positives = 131/482 (27%), Gaps = 96/482 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + + +P+ W+ V ++ G K S + Sbjct: 83 IKKQKPLPEI--SEEEKLFELPEGWEWVRFGNIYEMEYGNNLPQEK----------RSNS 130 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-KAIIADFDGICSTQFLVLQPKDV 119 G+Y N + I I+ G+ G + + + P + Sbjct: 131 GEYNVYGSNGVVGTHNEACI-KSPCIIIGRKGSAGALNLSNQPACWVTDVAYSTIPPIAM 189 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADW---------------KGIGNIPMPIPPL 164 + E + ++ + + + I G + A + + L Sbjct: 190 VLEFVFIQFHTLGLDKLGKGIKPGLNRNDAYSLVIAIPPRSEQKAIVSKVNELMSLCDQL 249 Query: 165 AEQV---------------------LIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +Q E++ RI + KQ ++ Sbjct: 250 EQQSLTSLDAHQQLVETLLGTLADSQNAEELAENWARISEHFDTLFTTEASVDALKQTIL 309 Query: 204 SYIVTKGLNPDVK-------------------------------MKDSGIEWVGLVPDHW 232 V L P + S E +P+ W Sbjct: 310 QLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLLPISDEEKPFELPNGW 369 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI------- 285 E L+ ++ + S + +++ +++ + + Sbjct: 370 EWCRLGELIDSIDAGWSPACSSEPAAPGEWGVLKTTAVQSLEYREYENKALPKNKAPRPQ 429 Query: 286 --VDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSY 340 V G+I+ +N E +I+ + I Y++ + Sbjct: 430 LEVKAGDILITRAGPKNRVGISCLVENTRENLMISDKIIRFHLISEDISEKYISLCLNYG 489 Query: 341 DLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 SG+ + ++ + +K P+ +P EQ IT+ IN L +I+ Sbjct: 490 FTSTYLENSKSGMAESQMNISQDILKMAPIAIPTTHEQLKITDKINEMMDYFITLKSQIQ 549 Query: 398 QS 399 + Sbjct: 550 SA 551 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 62/192 (32%), Gaps = 16/192 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E + +P+ WE F + N + + + + Sbjct: 93 SEEEKLFELPEGWEWVRFGNIYEMEYGNNLPQEKRS--------NSGEYNVYGSNGVVGT 144 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + I P I+ R +L + + AY + P + ++ + Sbjct: 145 HNEACIKSPCIIIGRKGSA----GALNLSNQPACWVTDVAYSTIPPIAMVLEFVFIQFHT 200 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +G G++ L D L + +PP EQ I + +N + D L ++ S Sbjct: 201 LG----LDKLGKGIKPGLNRNDAYSLVIAIPPRSEQKAIVSKVNELMSLCDQLEQQSLTS 256 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 257 LDAHQQLVETLL 268 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 50/205 (24%), Gaps = 16/205 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ + S + + V+S + + Sbjct: 364 ELPNGWEWCRLGELIDSIDAGWSPACSSEPAAPGEWGVLKTTAVQSLEYREYENKALPKN 423 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGW 127 G IL + GP R I + + S + + Sbjct: 424 KAPRPQLEVKAGDILITRAGPKNRVGISCLVENTRENLMISDKIIRFHLISEDISEKYIS 483 Query: 128 LLSIDVTQRIE----AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + + + P+ IP EQ+ I +KI T Sbjct: 484 LCLNYGFTSTYLENSKSGMAESQMNISQDILKMAPIAIPTTHEQLKITDKINEMMDYFIT 543 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 L ++ + AL + + Sbjct: 544 LKSQIQSAQQTQLHLADALTNAAIN 568 >gi|326561038|gb|EGE11403.1| putative type I restriction enzyme HindVIIP specificity protein [Moraxella catarrhalis 7169] gi|326564413|gb|EGE14641.1| putative type I restriction enzyme HindVIIP specificity protein [Moraxella catarrhalis 12P80B1] gi|326567425|gb|EGE17540.1| putative type I restriction enzyme HindVIIP specificity protein [Moraxella catarrhalis BC1] gi|326569344|gb|EGE19404.1| putative type I restriction enzyme HindVIIP specificity protein [Moraxella catarrhalis BC8] Length = 454 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 51/459 (11%), Positives = 127/459 (27%), Gaps = 82/459 (17%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + +LN R + GK ++ + + + + S F G I Sbjct: 5 QTRLDEIAELNPTRALKKGKMTSFVEMASLPTNSRDIENIAQKEFSGSGSK---FKNGDI 61 Query: 87 LYGKLGPYLRKA-------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSI--DVTQRI 137 L+ ++ P L + D ST+F+VL ++ + + + Sbjct: 62 LFARITPCLENGKTAKVAGLQHDEIAHGSTEFIVLSAREPEFDEDYLYYFCRLSEFRNYA 121 Query: 138 EAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ EG + W+ + P + + + +I + + Sbjct: 122 KSRMEGTSGRQRVSWQALAEFEFDFPDKEIRKKAADMLKIFDDKIQLNTQTNQTLEAIAQ 181 Query: 197 EKKQALVSYI-------------------------VTKGLNPDVKMKD------------ 219 ++ V G + Sbjct: 182 AIFKSWFVDFDPVRAKAAALSEGKSEYEANLAAMSVICGKDTSELNDTEYKALWQIAEAF 241 Query: 220 ----SGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYG----NIIQKL 268 G VP WE + + N K+ + S I + G I+ Sbjct: 242 PSELVENIEFGEVPKGWENTTLSEICSMQNGYAFKSNEWTGSGIPVIKIGSVKPMIVDIE 301 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + E + ++ G+IV + + + ++ ++ P + Sbjct: 302 SNGFVSEENEHIRSDFLLKQGDIVVGLTGYVGEVGRIPA---GDKAMLNQRVAKFVPKKL 358 Query: 329 DSTYLAWLM-----RSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 D + R + G + ++ ++ P+ + ++ + Sbjct: 359 DHELSYYNFVYCLARQRTFKEYAELNAKGSAQANISTRELLNYPICLASLE--------V 410 Query: 383 NVETA-RIDVL---VEKIEQSIVLLKERRSSFIAAAVTG 417 + +I+ L + Q +L++ R + ++G Sbjct: 411 HKFFEIKINELLYKILTNSQESKVLEKTRDLLLPKLLSG 449 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 55/174 (31%), Gaps = 11/174 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G +PK W+ + + G +S + I I + V+ S +++ Sbjct: 252 GEVPKGWENTTLSEICSMQNGYAFKSNEWTGSGIPVIKIGSVKPMIVDIESNGFVSEENE 311 Query: 75 -TSTVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLP-----ELLQGW 127 + + +G I+ G G I + + + PK + + Sbjct: 312 HIRSDFLLKQGDIVVGLTGYVGEVGRIPAGDKAMLNQRVAKFVPKKLDHELSYYNFVYCL 371 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + E +G+ ++ + + N P+ + L KI +I Sbjct: 372 ARQRTFKEYAELNAKGSAQANISTRELLNYPICLASLEVHKFFEIKINELLYKI 425 >gi|191170640|ref|ZP_03032192.1| type I restriction enzyme EcoEI specificity protein [Escherichia coli F11] gi|300992647|ref|ZP_07179961.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 200-1] gi|52420939|emb|CAH55818.1| putative restriction modification enzyme S subunit [Escherichia coli] gi|190908864|gb|EDV68451.1| type I restriction enzyme EcoEI specificity protein [Escherichia coli F11] gi|300305268|gb|EFJ59788.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 200-1] gi|324014076|gb|EGB83295.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 60-1] Length = 568 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 60/482 (12%), Positives = 131/482 (27%), Gaps = 96/482 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + + +P+ W+ V ++ G K S + Sbjct: 83 IKKQKPLPEI--SEEEKLFELPEGWEWVRFGNIYEMEYGNNLPQEK----------RSNS 130 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-KAIIADFDGICSTQFLVLQPKDV 119 G+Y N + I I+ G+ G + + + P + Sbjct: 131 GEYNVYGSNGVVGTHNEACI-KSPCIIIGRKGSAGALNLSNQPACWVTDVAYSTIPPIAM 189 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADW---------------KGIGNIPMPIPPL 164 + E + ++ + + + I G + A + + L Sbjct: 190 VLEFVFIQFHTLGLDKLGKGIKPGLNRNDAYSLVIAIPPRSEQKAIVSKVNELMSLCDQL 249 Query: 165 AEQV---------------------LIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +Q E++ RI + KQ ++ Sbjct: 250 EQQSLTSLDAHQQLVETLLGTLADSQNAEELAENWARISEHFDTLFTTEASVDALKQTIL 309 Query: 204 SYIVTKGLNPDVK-------------------------------MKDSGIEWVGLVPDHW 232 V L P + S E +P+ W Sbjct: 310 QLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLLPISDEEKPFELPNGW 369 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI------- 285 E L+ ++ + S + +++ +++ + + Sbjct: 370 EWCRLGELIDSIDAGWSPACSSEPAAPGEWGVLKTTAVQSLEYREYENKALPKNKAPRPQ 429 Query: 286 --VDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSY 340 V G+I+ +N E +I+ + I Y++ + Sbjct: 430 LEVKAGDILITRAGPKNRVGISCLVENTRENLMISDKIIRFHLISEDISEKYISLCLNYG 489 Query: 341 DLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 SG+ + ++ + +K P+ +P EQ IT+ IN L +I+ Sbjct: 490 FTSTYLENSKSGMAESQMNISQDILKMAPIAIPTTHEQLKITDKINEMMDYFITLKSQIQ 549 Query: 398 QS 399 + Sbjct: 550 SA 551 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 62/192 (32%), Gaps = 16/192 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E + +P+ WE F + N + + + + Sbjct: 93 SEEEKLFELPEGWEWVRFGNIYEMEYGNNLPQEKRS--------NSGEYNVYGSNGVVGT 144 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + I P I+ R +L + + AY + P + ++ + Sbjct: 145 HNEACIKSPCIIIGRKGSA----GALNLSNQPACWVTDVAYSTIPPIAMVLEFVFIQFHT 200 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +G G++ L D L + +PP EQ I + +N + D L ++ S Sbjct: 201 LG----LDKLGKGIKPGLNRNDAYSLVIAIPPRSEQKAIVSKVNELMSLCDQLEQQSLTS 256 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 257 LDAHQQLVETLL 268 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 50/205 (24%), Gaps = 16/205 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ + S + + V+S + + Sbjct: 364 ELPNGWEWCRLGELIDSIDAGWSPACSSEPAAPGEWGVLKTTAVQSLEYREYENKALPKN 423 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGW 127 G IL + GP R I + + S + + Sbjct: 424 KAPRPQLEVKAGDILITRAGPKNRVGISCLVENTRENLMISDKIIRFHLISEDISEKYIS 483 Query: 128 LLSIDVTQRIE----AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + + + P+ IP EQ+ I +KI T Sbjct: 484 LCLNYGFTSTYLENSKSGMAESQMNISQDILKMAPIAIPTTHEQLKITDKINEMMDYFIT 543 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 L ++ + AL + + Sbjct: 544 LKSQIQSAQQTQLHLADALTNAAIN 568 >gi|327474703|gb|EGF20108.1| hypothetical protein HMPREF9391_0217 [Streptococcus sanguinis SK408] Length = 388 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 125/411 (30%), Gaps = 47/411 (11%) Query: 29 PIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K + + G+ + Y+ + D+ + +SD Sbjct: 2 RYKNLSDFSIGKGTYGISASAVGKDDNLPTYLRITDINDDGTINFASLKSVDRSDADKYR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQ---PKDVLPELLQGWLLSIDVT 134 + I++ + G ++ D FL+ P+ +P+ ++ + S + Sbjct: 62 L-QPNDIVFARTGGSTGRSYFYDGKDGEFVFAGFLIKFSIDPQKCIPKFIKYYCQSREYY 120 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G+T + + K +P+P PL +Q LI + + +I Sbjct: 121 NWVASFNTGSTRGNINAKTFEKMPIPDLPLEQQQLIVDILSPIDDKI------------- 167 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 E + + ++V N S +G + + F + K I+ Sbjct: 168 --ENNKKINHHLVAISKNYLKIFYSSNSIKLGDIFELKSGYAFKSKDWVDEGKPVIKIKD 225 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + ++ ++ K ++E V EIV K + G Sbjct: 226 IDGITIDITNLNYVKNKSQLSKASNFE----VFGKEIVMALTGATTGKIGVIPKNF--NG 279 Query: 315 IITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKR--LPVLVP 370 + S + W + + + + + +L V L V Sbjct: 280 YVNQRVGLFYAKTELSYAVLWSILQQQNIITDLIKLSSGSAQANLSPFSVNSYDLNVTFK 339 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + E ++ + + L I L + R + + ++G++ + Sbjct: 340 DLIE-------LDKVLSPLYELFCFNLSEIQRLSKLRDTLLPKLLSGELSV 383 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 51/146 (34%), Gaps = 9/146 (6%) Query: 28 VPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA- 82 + + +L +G +S + I ++D++ T + +S S S F Sbjct: 194 IKLGDIFELKSGYAFKSKDWVDEGKPVIKIKDIDGITIDITNLNYVKNKSQLSKASNFEV 253 Query: 83 -KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIE 138 +I+ G K + +F+G + + + K L + L ++ + Sbjct: 254 FGKEIVMALTGATTGKIGVIPKNFNGYVNQRVGLFYAKTELSYAVLWSILQQQNIITDLI 313 Query: 139 AICEGATMSHADWKGIGNIPMPIPPL 164 + G+ ++ + + + + Sbjct: 314 KLSSGSAQANLSPFSVNSYDLNVTFK 339 >gi|167750090|ref|ZP_02422217.1| hypothetical protein EUBSIR_01058 [Eubacterium siraeum DSM 15702] gi|167656963|gb|EDS01093.1| hypothetical protein EUBSIR_01058 [Eubacterium siraeum DSM 15702] Length = 377 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 55/403 (13%), Positives = 127/403 (31%), Gaps = 39/403 (9%) Query: 30 IKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI----F 81 + + + G I ++ G G+ + + I Sbjct: 3 LNDICEFIVDCPHTTAPDEGAGYPLIRTPNI--GKGRLVLNGVHRVSEKVYRQRIQRGMP 60 Query: 82 AKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +++ + P AI+ + + +C T L V P+ L ++L+ ++ Sbjct: 61 QDNDLIFAREAPAGNVAIVKNGEKVCLGQRTVLLRPDKSKVCPDYLVYYILAPAQQYKLL 120 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 GAT++H + I N+P+ +PPL Q ++ + A I+ + I+LL+E Sbjct: 121 GTANGATVAHVNLPVIRNMPVELPPLEVQEIVAGYLSAYDNLIEN----NQKQIKLLEEA 176 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q L P + V VP+ W + K + Sbjct: 177 AQRLYKEWFVDLRFPGYE----DTPIVDGVPEGWADGTLGDIAVFKRGKTITKAQ----- 227 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + N+ + E + I + + ++ + S Sbjct: 228 ---------VSDGNIPVVAGGLEPAYYHNKANTTAPLITVSASGANAGFTRLYNIDVFAS 278 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + + + K+ + + +D+ L + VP Sbjct: 279 DCSYIDSNSTPFLLFVYCFLKTNAMKLNSLQKGSAQPHVYAKDLNALVLSVPSEGVLTAF 338 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +++ RI +L ++ + + R + ++G+I++ Sbjct: 339 CGIVSPYFERIRLL----QRENEIAAQARDRMLPKLMSGEIEV 377 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 59/187 (31%), Gaps = 15/187 (8%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG-- 289 + + + + + NI + N + Q + G Sbjct: 1 MILNDICEFIVDCPHTTAPDEGAGYPLIRTPNIGKGRLVLNGVHRVSEKVYRQRIQRGMP 60 Query: 290 ---EIVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCK 344 +++F + +++ E+ + + + YL + + + Sbjct: 61 QDNDLIFAREAPAGNVAIVKN---GEKVCLGQRTVLLRPDKSKVCPDYLVYYILAPAQQY 117 Query: 345 VFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 +G + ++ +PV +PP++ Q + ++ D L+E ++ I LL Sbjct: 118 KLLGTANGATVAHVNLPVIRNMPVELPPLEVQEIVAGYLSA----YDNLIENNQKQIKLL 173 Query: 404 KERRSSF 410 +E Sbjct: 174 EEAAQRL 180 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 55/199 (27%), Gaps = 15/199 (7%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +P Y+D+ + + +P+ W + G+T + G Sbjct: 189 RFPGYEDTPI--VDGVPEGWADGTLGDIAVFKRGKTITKA-----------QVSDGNIPV 235 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 G + + I G + + D S + P LL Sbjct: 236 VAGGLEPAYYHNKANTTAPLITVSASGANAGFTRLYNIDVFASDCSYIDSNST--PFLLF 293 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + ++ ++ +G+ H K + + + +P + RI L Sbjct: 294 VYCFLKTNAMKLNSLQKGSAQPHVYAKDLNALVLSVPSEGVLTAFCGIVSPYFERIRLLQ 353 Query: 186 TERIRFIELLKEKKQALVS 204 E + L+S Sbjct: 354 RENEIAAQARDRMLPKLMS 372 >gi|268592728|ref|ZP_06126949.1| type I restriction enzyme EcoEI specificity protein [Providencia rettgeri DSM 1131] gi|291311502|gb|EFE51955.1| type I restriction enzyme EcoEI specificity protein [Providencia rettgeri DSM 1131] Length = 593 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 62/496 (12%), Positives = 131/496 (26%), Gaps = 99/496 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESG 59 +K K P+ S + +P+ W+ + +N + + +I ++ + + + Sbjct: 83 IKKQKPLPEI--SEDEKPFELPEGWEWTSLNEIALINPKIEVTNDEQEISFVPMPCISTR 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLV 113 ++ + FA G I K+ P + + + G+ +T+ V Sbjct: 141 FDGTHDQEIKKWGEVKKGYTHFADGDIALAKITPCFENSKAVIFEGLKNGVGVGTTELHV 200 Query: 114 LQPKDVLPELLQGWLL---SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 +P L L +T + A N P+P PP EQ I Sbjct: 201 ARPLSSELNLQYILLNIKAPHYLTIGELQMTGSAGQKRVPRSFFENYPIPFPPKTEQARI 260 Query: 171 RE-----------------------------------------KIIAETVRIDTLITERI 189 E ++ RI+ Sbjct: 261 VETFSELMSLCDQLEQQSLTSLEAHQQLVETLLATLTDSQNEKELAENWSRINQHFDTLF 320 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------------------------ 225 + KQ ++ V L P + E + Sbjct: 321 TTEASIDALKQTILQLAVMGKLVPQDPNDEPASELLKRIEQEKARLVKQGKIKKQKPLPP 380 Query: 226 -------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 +P WE L + E + +++ P Sbjct: 381 ISDEEKPFELPQGWEWCRLGNLAHNSEAGWSPQCEVSPRVDDNWGVLKISSVTWSEFNPN 440 Query: 279 SYET---------YQIVDPGEIVFRFIDL-QNDKRSLRSAQVMERGIITSAYMA--VKPH 326 + V + + + RS+ ++ S + Sbjct: 441 ENKALPKHLEPKIEYEVKARDFLISRANTADLVARSVVVPDSPPNHLMLSDKIIRFQFSK 500 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 +D+ Y+ + S + + G +++ V L V +P EQ +I + Sbjct: 501 LVDANYINLVNNSKYSRTYYSEVAGGTSSSMKNVSRIQVSSLLVALPSYNEQLNIVEKVR 560 Query: 384 VETARIDVLVEKIEQS 399 T + L +++ + Sbjct: 561 NLTLLCEHLKSRLQSA 576 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 60/202 (29%), Gaps = 9/202 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE---TRNMGLK 276 S E +P+ WE + + E I + I + + + + Sbjct: 93 SEDEKPFELPEGWEWTSLNEIALINPKIEVTNDEQEISFVPMPCISTRFDGTHDQEIKKW 152 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 E + Y G+I I + + ++ G+ + S Sbjct: 153 GEVKKGYTHFADGDIALAKITPCFENSKAVIFEGLKNGVGVGTTELHVARPLSSELNLQY 212 Query: 337 MR------SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + Y GS ++ + + P+ PP EQ I + + D Sbjct: 213 ILLNIKAPHYLTIGELQMTGSAGQKRVPRSFFENYPIPFPPKTEQARIVETFSELMSLCD 272 Query: 391 VLVEKIEQSIVLLKERRSSFIA 412 L ++ S+ ++ + +A Sbjct: 273 QLEQQSLTSLEAHQQLVETLLA 294 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 21/205 (10%), Positives = 58/205 (28%), Gaps = 16/205 (7%) Query: 20 AIPKHWKVVPIKRFTKLNT-------GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W+ + + + + + + V + Sbjct: 389 ELPQGWEWCRLGNLAHNSEAGWSPQCEVSPRVDDNWGVLKISSVTWSEFNPNENKALPKH 448 Query: 73 SDTSTVSIFAKGQILYGKLGP---YLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQG 126 + L + R ++ D + S + + Q ++ Sbjct: 449 LEPKIEYEVKARDFLISRANTADLVARSVVVPDSPPNHLMLSDKIIRFQFSKLVDANYIN 508 Query: 127 WLLSIDVTQRIEAICEGAT---MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + ++ + G T M + + ++ + +P EQ+ I EK+ T+ + Sbjct: 509 LVNNSKYSRTYYSEVAGGTSSSMKNVSRIQVSSLLVALPSYNEQLNIVEKVRNLTLLCEH 568 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 L + + AL + Sbjct: 569 LKSRLQSAQQTQFHLADALTDAALN 593 >gi|291527176|emb|CBK92762.1| Restriction endonuclease S subunits [Eubacterium rectale M104/1] Length = 396 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 49/409 (11%), Positives = 129/409 (31%), Gaps = 37/409 (9%) Query: 28 VPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESG------TGKYLPKDGNSRQSDT 75 +K +G + + + + D+ + + +S + Sbjct: 3 KKLKDVCMFYSGTGFPIQYQGQTKGEYPFYKVGDIANNAIAGKIYLELCNNYISSDVAKM 62 Query: 76 STVSIFAKGQILYGKLGPYLR--KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 I K +++ K+G L+ + I D + + + PK + + ++ Sbjct: 63 IKGCILPKDTVVFAKIGEALKLNRRAITSCDCLIDNNAMGIAPKLDSLRIQYFYFCMKNL 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++ + E T+ + + +P L EQ E+I + +I +R + + Sbjct: 123 K--MQTLAESTTVPSVRKTVLEKYEIEVPSLVEQ----EEIEKKLTLTQKIIEKRRQELS 176 Query: 194 LLKEKKQALVSYIV-TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L E +A + NP K + E VG + + + + Sbjct: 177 YLDEIIKARFVEMFGDPATNPFNWDKINISEVVGDKVSNGFFAKRDDYADD--GNVSVMG 234 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQ 309 + I++ Y T E +E V G+++F L + + Sbjct: 235 VAYIVNRMYSQWQDLPRTNGTDKDIEKFE----VKYGDMLFCRSSLVAEGIGKASIVPED 290 Query: 310 VMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 V + + + + Y+ + A + ++ + + + Sbjct: 291 VPQNTLFECHVIRLPLDLSKCVPEYMQVFSTMEYFRRQIIAQSKTATMTTIGQDGILKAD 350 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +L+PP+ +Q + ++ + +++++ + S + Sbjct: 351 ILLPPMSKQREFYAFVHQV----NKSKVAVQKALDETQILFDSLMQKYF 395 >gi|241895463|ref|ZP_04782759.1| type I site-specific deoxyribonuclease specificity subunit [Weissella paramesenteroides ATCC 33313] gi|241871437|gb|EER75188.1| type I site-specific deoxyribonuclease specificity subunit [Weissella paramesenteroides ATCC 33313] Length = 410 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 60/387 (15%), Positives = 129/387 (33%), Gaps = 15/387 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ +K + + P + + S Sbjct: 15 DWEKRRLKSMGDFRRVSVDPQKTPNTLFTEYSMPAYDNNKTP-NIVLGSTIHSNRLQIGD 73 Query: 84 GQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L KL ++ G + S++F+ + + L+ LLS T+ +E I Sbjct: 74 NVLLINKLNVRQKRVWYVKRAGNNAVSSSEFMPFTSESLKLSFLKQLLLSDKSTKFMENI 133 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G + S +I + ++KI +D LIT R ++LLK+KK Sbjct: 134 SSGTSNSQ-KRITPLDISNYLIEKPTDAREQDKIGDFFETLDNLITVNQRKVDLLKKKKT 192 Query: 201 ALVSYIVTK--GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + K NP+++ K W K ++ N I Sbjct: 193 GYLQKLFPKNGQNNPELRFKGFTDAWEKRRLGDVVNKVKSYSLSHDVECNESTGYKYIHY 252 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR---SLRSAQVMERGI 315 I + + L Y + +++ ++ A E + Sbjct: 253 GDIHTGIADIINKKSVLPNIKPNQYDTLSVNDLIVADASEDYQGIASPAVIQALPDENLV 312 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 +A++P ++T+L L+ + + Y G+GL + + ++ + +P KE Sbjct: 313 AGLHTIALRPQATNATFLYHLLHTGNFKHFGYRTGTGLKVFGISWPNLSKFEFNLPSQKE 372 Query: 375 QFDITNVINVETARIDVLVEKIEQSIV 401 Q ++ +++ +D L+ ++ + Sbjct: 373 QDEVVDLL----RLLDNLIVVNQRKVD 395 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 62/190 (32%), Gaps = 8/190 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 D WE + ++ N L Y + + + ++ Sbjct: 14 DDWEKRRLKSMGDFRRVSVDPQKTPNTLFTEYSMPAYDNNKTPNIVLGSTIHSNRLQIGD 73 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ KR + + +S +M + ++L L+ S K + Sbjct: 74 NVLLINKLNVRQKRVWYVKRAGNNAVSSSEFMPFTSESLKLSFLKQLLLSDKSTKFMENI 133 Query: 350 GSGL---RQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 SG ++ + D+ + P EQ I + +D L+ ++ + LLK+ Sbjct: 134 SSGTSNSQKRITPLDISNYLIEKPTDAREQDKIGDF----FETLDNLITVNQRKVDLLKK 189 Query: 406 RRSSFIAAAV 415 +++ ++ Sbjct: 190 KKTGYLQKLF 199 >gi|160914348|ref|ZP_02076567.1| hypothetical protein EUBDOL_00356 [Eubacterium dolichum DSM 3991] gi|158433821|gb|EDP12110.1| hypothetical protein EUBDOL_00356 [Eubacterium dolichum DSM 3991] Length = 504 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 55/414 (13%), Positives = 130/414 (31%), Gaps = 47/414 (11%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W+ + K+ G + + I + + D++ + + + + Sbjct: 88 EIPDNWEWKSWGEVSYKIQYGYNAPAKDTGVIKMVRITDIQDNQVLWDSVPFCNIKENEI 147 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-----LVLQPKDVLPELLQGWLLSI 131 + IL+ + G + K+ + + S V ++ P+ L+ ++ + Sbjct: 148 PDYLLHNFDILFARTGGTVGKSFLVENINEDSVFAGYLIRTVYNYNEINPKYLKYFMETS 207 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++ + + + + + +PIPPL EQ I K+ I+ + Sbjct: 208 LYWSQLKKGTIATAQPNCNGQTLSKMILPIPPLQEQHRIVAKLQELEPLIEKYRIAEEQL 267 Query: 192 IELL----KEKKQALVSYIVTKGLNPDVKM------------------------KDSGIE 223 EL + K++++ Y + L P K E Sbjct: 268 HELNSNIKDQLKKSILQYAIEGKLVPQDPNDEPASVLLERIREEKQQLIKEGKIKKDKNE 327 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL--------SLSYGNIIQKLETRNMGL 275 + D+ + F ++ + + N + L+ GN+ + + + Sbjct: 328 SIIFRRDNSYYEKFGNTEFCIDDEIKCSVPINWILTRQKNLCWLNNGNLSKGEILPYLEV 387 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQVMERGIITSAYMAVKPHGIDSTY 332 K ++ ++ RG + S + ++ + Sbjct: 388 KVLRGNKEAETKDSGVIVTRGTNVILVDGENSGEVMKIKYRGYMGSTFKILQTSNFVNEK 447 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ + K + L E + +PPI EQ I IN+ T Sbjct: 448 YVDIIFQCNRIKYKHNKKGAAIPHLDKELFNNTLIFLPPITEQQRILEKINLIT 501 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 77/215 (35%), Gaps = 17/215 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNR------KNTKLIESNILSLSYGNIIQKLETR 271 + E +PD+WE K + + ++ K+T +I+ ++ N + Sbjct: 79 RCIEDELPFEIPDNWEWKSWGEVSYKIQYGYNAPAKDTGVIKMVRITDIQDNQVLWDSVP 138 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGID 329 +K Y ++ +I+F K L + E + + + I+ Sbjct: 139 FCNIKENEIPDY-LLHNFDILFARTGGTVGKSFLVE-NINEDSVFAGYLIRTVYNYNEIN 196 Query: 330 STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL + M + + + + + + ++ + +PP++EQ I + Sbjct: 197 PKYLKYFMETSLYWSQLKKGTIATAQPNCNGQTLSKMILPIPPLQEQHRIVAKLQELEPL 256 Query: 389 IDVLVEKIEQSIVLLK-----ERRSSFIAAAVTGQ 418 I+ E+ + L + + S + A+ G+ Sbjct: 257 IEKYR-IAEEQLHELNSNIKDQLKKSILQYAIEGK 290 >gi|260664492|ref|ZP_05865344.1| conserved hypothetical protein [Lactobacillus jensenii SJ-7A-US] gi|260561557|gb|EEX27529.1| conserved hypothetical protein [Lactobacillus jensenii SJ-7A-US] Length = 376 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 68/393 (17%), Positives = 137/393 (34%), Gaps = 33/393 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK F+ T ++ D I E++ SG GK + F KG Sbjct: 14 WKNKKFLTFSSKITKNSTSDDIDFPRIEFENIVSGEGKLAQNRSKLNHIK--SGIKFDKG 71 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL+GKL PYL+ +A+F G+ F V++ K +L+ + +++ G Sbjct: 72 DILFGKLRPYLKNWWLAEFPGVAVGDFWVIRAK--DNRYFLYYLIQAPLFEKVSNYTTGT 129 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M +DW + N +P + EQ I + + + L K Q + Sbjct: 130 KMPRSDWNYVSNTFFKLPKIDEQEKIGRILDKVDSLLSLQQRKLELISALEKGLGQIIKQ 189 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 G+ + + ++ N L N+ Sbjct: 190 QNNKYGIT----------------------FSLNNFLEIPPQIQARIKNKNQLLTVKLNL 227 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 Y I GE++F ++ N +L + + + ++ ++K Sbjct: 228 QGIARGVQRDTLSLGSTKYFIRHTGELIFGKQNIFNGSIALITKE-FDGLATSNDVPSLK 286 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV-LVPPIKEQFDITNVI 382 I+ +L +L+++ D K + +G + + D+ +L + ++P K Q I + + Sbjct: 287 ISNINPQFLFYLLKNPDFWKHTELIATGTGSKRVHIHDLLKLHIKIIPDAKYQAKIVS-L 345 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +I + + I + K+ + Sbjct: 346 SRNFEKIVLNQQIIVKECEKTKQF---LLQNLF 375 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 63/189 (33%), Gaps = 16/189 (8%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGE 290 F + KN+ + + + + NI+ + K ++ D G+ Sbjct: 13 PWKNKKFLTFSSKITKNSTSDDIDFPRIEFENIVSGEGKLAQNRSKLNHIKSGIKFDKGD 72 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+F + L G+ + ++ + +L +L+++ KV Sbjct: 73 ILFGKLRPYLKNWWLAEF----PGVAVGDFWVIRAKD-NRYFLYYLIQAPLFEKVSNYTT 127 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + V +P I EQ I +++ ++D L+ ++ + L+ Sbjct: 128 GTKMPRSDWNYVSNTFFKLPKIDEQEKIGRILD----KVDSLLSLQQRKLELISALEKGL 183 Query: 411 IAAAVTGQI 419 GQI Sbjct: 184 ------GQI 186 >gi|326802763|ref|YP_004320581.1| type I restriction modification DNA specificity domain protein [Aerococcus urinae ACS-120-V-Col10a] gi|326650196|gb|AEA00379.1| type I restriction modification DNA specificity domain protein [Aerococcus urinae ACS-120-V-Col10a] Length = 334 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 58/346 (16%), Positives = 114/346 (32%), Gaps = 24/346 (6%) Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 S+ + G+ G + T F + + +I+ + Sbjct: 5 NKSLSDIDAVGLGRKGTIDNPIYLKAPFWTVDTLFYITTQQQNNIMFFYYLFKTINWKKY 64 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 EA + + I +I + +P EQ KI +D +IT + IE L+ Sbjct: 65 NEAS----GVPSLSKQTIYSISVKVPNTFEQ----SKISKLFYSLDRIITLEQQKIEKLE 116 Query: 197 EKKQALVSYIVTKG-LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKLIES 254 KQ L+ + P V+ K +W + ++ + L+ K + Sbjct: 117 LLKQYLLQNMFADESGYPRVRFKGYNNKW-----ERSKLNTISDSYSGLSGKTKEDFGKG 171 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + Y N+ + G + V G+ +F ++ + S + Sbjct: 172 EARYIEYKNVFDNPVAKLDGTDAIDIDYKQNEVKKGDFLFTTSSETPEEVGMSSLWDYDL 231 Query: 314 GII---TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 I + + IDS YLA+ RS + + G+ R ++ L + V Sbjct: 232 NNIYLNSFCFGVRIKEKIDSYYLAYYFRSPEFRSRVMKLAQGISRYNISKNKFCELKISV 291 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P +E I ++ T L++ E + K +S + + Sbjct: 292 PSYEEGVRIGRLLKSTT----DLIDLEENKLKEFKLIKSKLLQSLF 333 >gi|240172538|ref|ZP_04751197.1| type I restriction-modification system specificity determinant [Mycobacterium kansasii ATCC 12478] Length = 66 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 39/61 (63%) Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + V+VPP EQ I N ++ +T ++D L+ + E+ I L +ERRS+ I A VTGQID Sbjct: 1 MLADFDVVVPPADEQASILNYLDQQTTKVDTLIAESERFIELARERRSALITAVVTGQID 60 Query: 421 L 421 + Sbjct: 61 V 61 >gi|257790529|ref|YP_003181135.1| N-6 DNA methylase [Eggerthella lenta DSM 2243] gi|257474426|gb|ACV54746.1| N-6 DNA methylase [Eggerthella lenta DSM 2243] Length = 799 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 73/200 (36%), Gaps = 11/200 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 +P++ + S W+G +P+ W L + K + + G + Q Sbjct: 595 FAMLPDPEICYEASET-WLGDIPESWSALRIGDLFELRSTKVSDEDYRPLSVTKKGIVPQ 653 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K +++ ++V G+ D+R+ + + T + Sbjct: 654 LDSV----AKSDNHANRKLVKEGDFAINSRS---DRRNSCGFSPYDGSVSTITTVLFPRQ 706 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I S Y L + + FY G G+ + + D+K++ + PPI EQ I + ++ Sbjct: 707 PIVSRYFDLLFDTPRFAEEFYRWGHGIDSDIWTTNWSDMKKIVIPCPPISEQKRIVDYLS 766 Query: 384 VETARIDVLVEKIEQSIVLL 403 E +I ++ I + Sbjct: 767 DELKQIRSARASVQAEIENI 786 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 62/170 (36%), Gaps = 9/170 (5%) Query: 17 WIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+G IP+ W + I +L + T S +D + + G D ++ + + Sbjct: 611 WLGDIPESWSALRIGDLFELRS--TKVSDEDYRPLSVT----KKGIVPQLDSVAKSDNHA 664 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQ 135 + +G + +DG ST VL P+ + + + Sbjct: 665 NRKLVKEGDFAINSRSDRRNSCGFSPYDGSVSTITTVLFPRQPIVSRYFDLLFDTPRFAE 724 Query: 136 RIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G + + +W + I +P PP++EQ I + + E +I + Sbjct: 725 EFYRWGHGIDSDIWTTNWSDMKKIVIPCPPISEQKRIVDYLSDELKQIRS 774 >gi|154490802|ref|ZP_02030743.1| hypothetical protein PARMER_00719 [Parabacteroides merdae ATCC 43184] gi|154088550|gb|EDN87594.1| hypothetical protein PARMER_00719 [Parabacteroides merdae ATCC 43184] Length = 384 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 63/396 (15%), Positives = 126/396 (31%), Gaps = 39/396 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W + +LN+G + + + +G G + + Sbjct: 20 KTWNKKTLDELVQLNSG-----------MDYKHLCNGNIPVYGTGGYMLSVNAALSY--D 66 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I G+ G + I+ T F + K + I E Sbjct: 67 KDAIGIGRKGTINKPYILKAPFWTVDTLFYAIPRKYNN----LQFCNCIFQRIDWLKYDE 122 Query: 143 GATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + I +I + P EQ I + I T + + ++ Q+ Sbjct: 123 STGVPSLSKNIINSIEVNCAPSYDEQQKIASYFQSLDSLIQTTSKKLVSLKQIKDASLQS 182 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + P V+ K EW E PF + + E ++T E +LS + Sbjct: 183 MFPQ--EGETVPKVRFKGFEGEW--------EKIPFGSFLKESYERSTVENEDILLSSAI 232 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + E + S Y+ + ++ +L + E G+I+ AY Sbjct: 233 TGVYLNSELFG-HQRGASNIGYKKIKKNMLILSTQNL--HLGNANVNLRFEHGLISPAYK 289 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQFD 377 + I +L ++ F + R+++ ++D+ + VL+P EQ Sbjct: 290 VYEIVNISPLFLQQWIKMDSTKVFFLNATTAGASLCRKNIVWDDLYKQIVLIPSKNEQVK 349 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +D + Q + LK+ +++ + Sbjct: 350 IGLF----FSNLDKQISLQTQRLEKLKQIKAACLDK 381 >gi|299137474|ref|ZP_07030656.1| restriction modification system DNA specificity domain protein [Acidobacterium sp. MP5ACTX8] gi|298600879|gb|EFI57035.1| restriction modification system DNA specificity domain protein [Acidobacterium sp. MP5ACTX8] Length = 499 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 50/393 (12%), Positives = 119/393 (30%), Gaps = 29/393 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + F + GR+ G P G++ I I+ Sbjct: 2 KQLGTFCEFKYGRSLPEKH------------RQGGNFPVYGSNGIVGWHHEPITNGPTII 49 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G + T + V Q + ++L + ++A+ + A + Sbjct: 50 IGRKGSAGALQYSSMSCCPIDTTYYVDQSCTSVNLRWLFFMLQML---DLDALNKHAAVP 106 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + +P EQ I + L L S + Sbjct: 107 GLNRNDAYEKELLLPSSDEQKKIAALLDMADALRHQRQESLQLAETL-------LRSCFL 159 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +P K+ I + + + PF + + + ++ + + + G++ Sbjct: 160 NIFGDPVSNSKNWPIVPLSELAVKFSDGPFGSNLKTEHYRDNGIRVWRLQDIGIGSLKNS 219 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPH 326 + + PG+++ + N + ++ + + E + K Sbjct: 220 GIAYISPQHYANLPKHH-CAPGDVIVGTLGEPNLRAAIVPSTIPESLNKADCVQIRAKKG 278 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++ WL+ + +++ G R + ++ L V +PPI Q + + ++ Sbjct: 279 VALPEFICWLLNMPGTLALAHSLVLGETRARISMGRLRTLNVPLPPIGLQREFSQTLSRI 338 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 A L E + + +S A G+ Sbjct: 339 LA----LKELVLAQSPEVDYLFASIQQRAFRGE 367 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 85/279 (30%), Gaps = 21/279 (7%) Query: 23 KHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPK-DGNSRQS 73 K+W +VP+ K + G + I L+D+ G+ K + + Sbjct: 170 KNWPIVPLSELAVKFSDGPFGSNLKTEHYRDNGIRVWRLQDIGIGSLKNSGIAYISPQHY 229 Query: 74 DTSTVSIFAKGQILYGKLG-PYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWL 128 A G ++ G LG P LR AI I + + + LPE + L Sbjct: 230 ANLPKHHCAPGDVIVGTLGEPNLRAAIVPSTIPESLNKADCVQIRAKKGVALPEFICWLL 289 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 ++ G T + + + +P+PP+ Q + + Sbjct: 290 NMPGTLALAHSLVLGETRARISMGRLRTLNVPLPPIGLQREFSQTLSRILAL----KELV 345 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + ++ LN + + +E G P +P + Sbjct: 346 LAQSPEVDYLFASIQQRAFRGELNLNRSTLANEVESPG--PTSVPERPTAEGRFKRPGSF 403 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 E L + I ++ E + Y+I+ Sbjct: 404 VAPPEIEAQMLELEDRIDYGPGDSISW-SEDFFKYRILS 441 >gi|262375746|ref|ZP_06068978.1| predicted protein [Acinetobacter lwoffii SH145] gi|262309349|gb|EEY90480.1| predicted protein [Acinetobacter lwoffii SH145] Length = 333 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 42/344 (12%), Positives = 103/344 (29%), Gaps = 32/344 (9%) Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS---IDVTQRIEAICEGA 144 + G A+I + + L++ +P + IE G Sbjct: 1 MVQSGHVGHAAVIPEELNNSAAHALIMFSDYKVPTNPYFLNYQLQTNNAKSAIEKFTTGN 60 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T+ H + EQ+ + E +D I + + + K+A++ Sbjct: 61 TIRHILSSDMKEFLGFFTNFDEQLKVGEF----FQNLDQSIALHEKKLAQTQNFKKAMLE 116 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K + +++ + + W ++ + + Y + Sbjct: 117 KMFPKQGSKRPEIR------LNSFREDWYSSKLTDYISIKHGYAFNGEFFSDKETDYCLL 170 Query: 265 IQKLETRNMGLKPESYETY-------QIVDPGEIVFRFIDLQNDK-----RSLRSAQVME 312 G K E + Y I+ +++ DL + +L + + Sbjct: 171 TPGNFMIGGGFKAEKFIYYKGGVPKNYILKENDLIVTMTDLSKESDTLGLPALLPSIEGK 230 Query: 313 RGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + + + ++ +L +L+++ K +G + + + Sbjct: 231 ILLHNQRLGLITFENLELEKEFLFYLLQTKSYHKYIVLSATGTTVKHTSPSKILGFTCKI 290 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P EQ I +ID + +Q + LK + +F+ Sbjct: 291 PEPTEQKKIGGF----FKKIDEKINLHQQQLQTLKNLKQAFLEK 330 >gi|146319104|ref|YP_001198816.1| Type I restriction enzyme EcoKI specificity protein (S protein) [Streptococcus suis 05ZYH33] gi|145689910|gb|ABP90416.1| Type I restriction enzyme EcoKI specificity protein (S protein) [Streptococcus suis 05ZYH33] gi|292558740|gb|ADE31741.1| Type I restriction modification DNA specificity domain protein [Streptococcus suis GZ1] Length = 442 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 54/439 (12%), Positives = 123/439 (28%), Gaps = 66/439 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W+ V + G+ G ++ Y+ + D++ GT K Sbjct: 4 DIPESWEWVRLGAIVTAKGGKRIPKGYNLQEEDNGHPYLRVTDMKDGTIKPTNIKFAPDN 63 Query: 73 -SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWL 128 + I G I+ + + ++ + + L L Sbjct: 64 VYTIIRNYTISSTDIYVTIAGTIGDVGIVPENFNNALLTENALKLMLTESINKMFLAHLL 123 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S V ++ + + + + +P+PPLAEQ I +I +++ Sbjct: 124 KSPLVQKQFKEVYNQVAQPKLSIRSTNSTIIPLPPLAEQKRIVAQIERALEQVEVYAESY 183 Query: 189 IRFIELLKEKK----QALVSYIVTKGLNPDVKM--------------------------- 217 + EL + ++++ Y + L Sbjct: 184 NKLQELDRAFPDKLKKSILQYAMQGKLVAQDPNDEPVEVLLEMIRAEKQKLYEEGKLKKK 243 Query: 218 --------KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL----SLSYGNII 265 K G +P +W + + + + K + I+ + G I Sbjct: 244 DLAEIMVEKGDDNSPYGKIPRNWTLLSVKDIFSITTGLSYKKTDLAIIQRGVRIIRGGNI 303 Query: 266 QKLETRNMGLKPESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + L + + Y + ++V + + Sbjct: 304 EPLAYKLLDNDYYIESKYITSESVYLKRNQLVTPVSSSLEHIGKFARIDKNYSDTVAGGF 363 Query: 321 MA----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIK 373 + S YL + S K + ++ + L + + P + Sbjct: 364 VFQLTPFISSDTLSNYLLLCLSSPLFYKQLQSVTKLSGQALYNIPKTKLNDLRIALAPEQ 423 Query: 374 EQFDITNVINVETARIDVL 392 EQ I+N + ++++L Sbjct: 424 EQERISNKVGQLFQKVNLL 442 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 72/210 (34%), Gaps = 22/210 (10%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---- 282 +P+ WE A+VT K + + ++ + ++ +KP + + Sbjct: 4 DIPESWEWVRLGAIVTAKGGKRIPKGYNLQEEDNGHPYLRVTDMKDGTIKPTNIKFAPDN 63 Query: 283 -YQIVDPGEI----VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 Y I+ I ++ I + + +A + I+ +LA L+ Sbjct: 64 VYTIIRNYTISSTDIYVTIAGTIGDVGIVPENFNNALLTENALKLMLTESINKMFLAHLL 123 Query: 338 RSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +S + K F + L + +PP+ EQ I I + VE Sbjct: 124 KSPLVQKQFKEVYNQVAQPKLSIRSTNSTIIPLPPLAEQKRIVAQIERALEQ----VEVY 179 Query: 397 EQSIVLLKE--------RRSSFIAAAVTGQ 418 +S L+E + S + A+ G+ Sbjct: 180 AESYNKLQELDRAFPDKLKKSILQYAMQGK 209 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 32/87 (36%), Gaps = 6/87 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP++W ++ +K + TG + + + I ++E K L D Sbjct: 260 GKIPRNWTLLSVKDIFSITTGLSYKKTDLAIIQRGVRIIRGGNIEPLAYKLLDNDYYIES 319 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAI 99 ++ S++ K L + L Sbjct: 320 KYITSESVYLKRNQLVTPVSSSLEHIG 346 >gi|330833401|ref|YP_004402226.1| restriction endonuclease S subunit [Streptococcus suis ST3] gi|329307624|gb|AEB82040.1| restriction endonuclease S subunit [Streptococcus suis ST3] Length = 252 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 67/211 (31%), Gaps = 11/211 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + + K + + + + E+ N + + ++ Sbjct: 13 FPGFTDAWKQRKLGEVADFSIKTNSLSRDKLSSYFYEVQ--NIHYGDILTKYDAILDVCN 70 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVK 324 K +G + ++ G+IVF D K + + + + Sbjct: 71 KELPSIIGSTISDF-ADALLSEGDIVFADAAEDSTVGKAIEVRNFKGKNVVSGLHTIVAR 129 Query: 325 PHGID-STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P YL +L+ S + G S+ ++K V+ P + EQ I + Sbjct: 130 PKVSYAPYYLGYLINSTAYHNQILPLMQGTKVSSISKANLKSTTVVFPTLPEQEAIGSF- 188 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ + +KE + + + Sbjct: 189 ---FSDLDQLITLHQRKLDDVKELKKALLQK 216 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 81/208 (38%), Gaps = 28/208 (13%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDII---YIGLEDVESGTG------------KYLPKDGN 69 WK + + +T+ +D + + ++++ G K LP Sbjct: 20 WKQRKLGEVADFSI-KTNSLSRDKLSSYFYEVQNIHYGDILTKYDAILDVCNKELPSIIG 78 Query: 70 SRQSDTSTVSIFAKGQILYGK---LGPYLRKAIIADFDG--ICST-QFLVLQPKDVLPEL 123 S SD + + ++G I++ + + +F G + S +V +PK Sbjct: 79 STISDFADA-LLSEGDIVFADAAEDSTVGKAIEVRNFKGKNVVSGLHTIVARPKVSYAPY 137 Query: 124 LQGWLLSI-DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 G+L++ +I + +G +S + + + P L EQ I +D Sbjct: 138 YLGYLINSTAYHNQILPLMQGTKVSSISKANLKSTTVVFPTLPEQEAIGSF----FSDLD 193 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKG 210 LIT R ++ +KE K+AL+ + KG Sbjct: 194 QLITLHQRKLDDVKELKKALLQKMFPKG 221 >gi|163756219|ref|ZP_02163334.1| hypothetical protein KAOT1_06767 [Kordia algicida OT-1] gi|161323831|gb|EDP95165.1| hypothetical protein KAOT1_06767 [Kordia algicida OT-1] Length = 553 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 61/472 (12%), Positives = 132/472 (27%), Gaps = 81/472 (17%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQ------ 72 IP+ W I F G++++ K+ G + S + + Sbjct: 84 KIPETWISKNITEFYYTIGGKSNQIKSKNYNERGKYPIVSQGKNKIDGYSDDESKLLKLA 143 Query: 73 ------SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 D + F + G G I+ +DGI S F + L Sbjct: 144 KPVVVFGDHTRQVKFIDFDFIIGADGTK----ILNPYDGIDSQFFYLHISFFDLSNKGYA 199 Query: 127 WLL---------------SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 ++ + +E + + + A L Sbjct: 200 RHYSLLKLKAFCLPPLEEQKEIVRVVETLFKEVEQLEQLTVKRIGLKEDFVTSALHQLTT 259 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE-------- 223 + E + +K+ ++ ++ V L + ++ + E Sbjct: 260 KNANQEWKFLQEHFKSFFNETTNIKKLRETVLQLAVQGKLTANWRVNNPDTEDASQLLKQ 319 Query: 224 ---------------------------WVGLVPDHWEVKPFFALVTELNRK-------NT 249 VP+ W F L T +N + Sbjct: 320 IQEEKAQLIAAKKIKKEKVLPPITKDEIPYEVPEGWVWVNFGDLATFINGDRGKNYPNKS 379 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-----VDPGEIVFRFIDLQNDKRS 304 + ++ + ++ G+I + + + + I + G++V+ K + Sbjct: 380 EYVDEGVAWINTGHINPDGTLSESKMNYITEDKFDILRGGKIQDGDLVYCLRGATFGKTA 439 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVK 363 S + I +S + S ++ + S + + + +L VK Sbjct: 440 YVSP-FKKGAIASSLMIIRAYIQDSSGFIYRFLISPEGKRQLLRFDNGSAQPNLSANKVK 498 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +PP++EQ I +N D L E+++QS + S + Sbjct: 499 LYAFPLPPLEEQKAIVAKVNALMELCDKLEEEVQQSQAYSTQLMQSCLREVF 550 >gi|253569685|ref|ZP_04847094.1| type I R/M system specificity subunit [Bacteroides sp. 1_1_6] gi|251840066|gb|EES68148.1| type I R/M system specificity subunit [Bacteroides sp. 1_1_6] Length = 402 Score = 82.9 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 58/356 (16%), Positives = 112/356 (31%), Gaps = 31/356 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTS 76 W+ I + G T ++ I + ++ ++ + + S Sbjct: 25 EWETKSINDLADVIGGGTPDTTVKSYWDGGIQWFTPSEIGKNKFVDASLRTITEDGLNNS 84 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + IL ++ + F L K + + L + Sbjct: 85 SAKLLPPNTILLSSRATIGECSLSLRECA-TNQGFQSLVSKKCN--VDFLYYLIQTKKKD 141 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T + I + +P EQ I + ID I + + IE LK Sbjct: 142 LIRKSCGSTFLEISANEVRKIQVSVPSDVEQQKIAGLL----SLIDKRIATQNKIIEDLK 197 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + K A+V ++ K++D G V+ ++ Sbjct: 198 KLKSAIVEMLLCNQNGESFKLRDVG----------CFVRGLTYANEDVTENKAATTVIRA 247 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 +L+YGN + K E + P T QI+ G+IV + + S G Sbjct: 248 NNLNYGNNVDKDEVVYVNKTP---TTSQILRKGDIVICMANGSSSLVGKNSYYPFNDGQS 304 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLV 369 T + WLM+S ++ Y G+G +L +D+ + + Sbjct: 305 TIGAFCGIYRTSYPF-VKWLMQSQRYKRLVYQSLQGGNGAIANLNGDDILNMSFPL 359 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 51/152 (33%), Gaps = 5/152 (3%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + I K + + L+ + + + I L + + + Sbjct: 57 WFTPSEIGKNKFVDASLRTITEDGLNNSSAKLLPPNTILLSSRATIGECSLSLRECATNQ 116 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + ++ + +L +L+++ + + +V+++ V VP EQ I Sbjct: 117 GFQSLVSKKCNVDFLYYLIQTKK-KDLIRKSCGSTFLEISANEVRKIQVSVPSDVEQQKI 175 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + ID + + I LK+ +S+ Sbjct: 176 A----GLLSLIDKRIATQNKIIEDLKKLKSAI 203 >gi|260592890|ref|ZP_05858348.1| type I restriction-modification system, S subunit [Prevotella veroralis F0319] gi|260535179|gb|EEX17796.1| type I restriction-modification system, S subunit [Prevotella veroralis F0319] Length = 506 Score = 82.9 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 57/432 (13%), Positives = 120/432 (27%), Gaps = 71/432 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W++ + I + + + + K G S D Sbjct: 85 EIPQGWELARFGSV-------MYNRDSERIPLSVAE-RNKLTKIYDYYGASGVIDKVDKY 136 Query: 80 IFAKGQILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 +F K +L G+ G L A IA + VL D + + ++ + Sbjct: 137 LFNKDLLLIGEDGANLINRSKPIAYIATGKYWVNNHAHVL---DCIDSIFMQYICLYINS 193 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI-- 192 + G + + + +I + +PP EQ I +K+ + R Sbjct: 194 ISLVDYVTGTAQPKMNQEKMNSILLVLPPHNEQKRILQKVDKIQPLYVRYEKNKSRLEAL 253 Query: 193 --ELLKEKKQALVSYIVTKGLNPDVK------------------------MKDSGI---- 222 L +++++ + L P +K I Sbjct: 254 TKTLYTNLRKSILQEAMQGKLIPQDPNDEPASVLLQRIREERLKLVKDGKLKKKDIVDSL 313 Query: 223 ----------------------EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 E +PD W + + ++ + Sbjct: 314 IFKGDDNKYYEQVGKSITEITEEIPFSIPDSWTWSRLSGVAKIIMGQSPDGNDVFEAEKE 373 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI-ITSA 319 K S +P I L + + +++R I I Sbjct: 374 DNAYEFHQGKIYFTEKYISPSGKWCKNPPRIANIGSLLVCIRAPIGDVNIVQRQIAIGRG 433 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 A+ + T + + +++ + +K L + +PP+ EQ I+ Sbjct: 434 LAAIIGYAKIKTDFLYYWILAHKKNLIEKGTGSTFKAITLDVLKDLIIPIPPLAEQKRIS 493 Query: 380 NVINVETARIDV 391 + I + +I+ Sbjct: 494 SRIELLYNKIEN 505 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 67/207 (32%), Gaps = 17/207 (8%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 E +P WE+ F +++ + E LS++ N + K+ Sbjct: 77 CIDDEIPFEIPQGWELARFGSVMYNRDS------ERIPLSVAERNKLTKIYDYYGASGVI 130 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + + ++ RS + + + A++ I Y+ Sbjct: 131 DKVDKYLFNKDLLLIGEDGANLINRSKPIAYIATGKYWVNNHAHVLDCIDSIFMQYICLY 190 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + S L + + E + + +++PP EQ I ++ + V EK Sbjct: 191 INSISLVDYV---TGTAQPKMNQEKMNSILLVLPPHNEQKRILQKVDKI-QPLYVRYEKN 246 Query: 397 EQSIVLL-----KERRSSFIAAAVTGQ 418 + + L R S + A+ G+ Sbjct: 247 KSRLEALTKTLYTNLRKSILQEAMQGK 273 >gi|207092914|ref|ZP_03240701.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori HPKX_438_AG0C1] Length = 307 Score = 82.9 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 45/316 (14%), Positives = 104/316 (32%), Gaps = 20/316 (6%) Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 V+ + + L + +E+ +G+ ++ L EQ+ I Sbjct: 3 FVVFENPKIDLNYLYYFLCYIEKEWLESGQQGSQVNLNVDLIKNKEVFYPKDLNEQIAIA 62 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT-KGLNPDVKMKDSGIEWVGLVPD 230 + + +L ++ + K L+S KG N + G +G+ Sbjct: 63 NILSDVDHYLYSLDALILKKESIKKALSFELLSQRKRLKGFNQAWQKVKLGD--IGITIS 120 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 K + + I L+ N + + +K E + Sbjct: 121 GLVGKTKQDFINGNAK--------YITFLNVLNNVIIDTSMLENVKIYPNEKQNSFKKYD 172 Query: 291 IVFRFIDLQNDKRSLRSA--QVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVF 346 + F + + + +++ + S + +D +L++L+ S K F Sbjct: 173 LFFNTSSETPKEVGMCAVLLDDIDQVFLNSFCFGFRIFDKAVDGLFLSYLINSEIGRKAF 232 Query: 347 YAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + G R +L + +++PP+ EQ I N+++ + I L K Q Sbjct: 233 ENLAQGSTRYNLSKSGFNNICLILPPLNEQIAIANILSALDSEIISLKNKKRQ----FDN 288 Query: 406 RRSSFIAAAVTGQIDL 421 + + ++ +I + Sbjct: 289 IKKALNHDLMSAKIRV 304 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 72/193 (37%), Gaps = 13/193 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ V + +G ++ +D I YI +V + N + + Sbjct: 107 WQKVKLGDIGITISGLVGKTKQDFINGNAKYITFLNVLNNVIIDTSMLENVKIYPNEKQN 166 Query: 80 IFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 F K + + ++ + D + S F + L +L++ + Sbjct: 167 SFKKYDLFFNTSSETPKEVGMCAVLLDDIDQVFLNSFCFGFRIFDKAVDGLFLSYLINSE 226 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + E + +G+T + G NI + +PPL EQ+ I + A I +L ++ +F Sbjct: 227 IGRKAFENLAQGSTRYNLSKSGFNNICLILPPLNEQIAIANILSALDSEIISLKNKKRQF 286 Query: 192 IELLKEKKQALVS 204 + K L+S Sbjct: 287 DNIKKALNHDLMS 299 >gi|260906088|ref|ZP_05914410.1| restriction endonuclease S subunits-like protein [Brevibacterium linens BL2] Length = 383 Score = 82.9 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 55/356 (15%), Positives = 122/356 (34%), Gaps = 36/356 (10%) Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + G +L K+ P++R++ + + S++++V + + P+ L +L+S Sbjct: 50 SSKQVVQPGDVLISKIVPHIRRSAVIPKLAGRRQLASSEWIVFRNQSFDPKYLVHFLMSD 109 Query: 132 DVTQRIEAICEGATMSHAD--WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + G S + + +I P+PPL EQ I + Sbjct: 110 VFHHQFLNTVAGVGGSLLRARPQYVRSIMAPLPPLDEQRRIAAILDKADAIRQKRRQATT 169 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 L + Q + ++ +S +G V K ++ KN Sbjct: 170 HLETLAQSIFQTMF----------GSRLAESSSTTIGDVAQLQGGKSLSSIDDSAATKNR 219 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRS 307 L S++ S ++ K P+ Y G+++ + + Sbjct: 220 VLKISSVTSGTFKPWESK-------PVPDDYSPPLSHFSHKGDLLISRANTSELVGASAL 272 Query: 308 AQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMG---SGLRQSLKFEDV 362 V +G++ + I+ Y L+R+ + M G +++ + Sbjct: 273 VHVEPQGLLLPDKIWRFDWLIETQPEYFFHLLRTKAIRGRISNMATGSGGSMKNISKPKL 332 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + EQ + + ++DVL K ++S + +S + A G+ Sbjct: 333 LSVQIPRIESNEQREFV----KQVRKVDVLRAKFDESNA--DQLFASLQSRAFRGE 382 >gi|260655882|ref|ZP_05861351.1| putative type-I specificity determinant subunit [Jonquetella anthropi E3_33 E1] gi|260629498|gb|EEX47692.1| putative type-I specificity determinant subunit [Jonquetella anthropi E3_33 E1] Length = 415 Score = 82.9 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 56/401 (13%), Positives = 121/401 (30%), Gaps = 25/401 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ I + G G I + + +G +D + + Sbjct: 18 EWEERKINDVANFSKGNGYSKGDLKGSGTPIILYGRLYTKYQ--FEIEGVDTFADIRSGA 75 Query: 80 IFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 +F+KG + R A I + +L P + P + + + Sbjct: 76 VFSKGNEVIVPASGETAEDIARAAAILKSGILLGGDLNILHPFTFMNPSFVALVISNGPP 135 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + +G ++ H I + + P AEQ +KI ++D LI R Sbjct: 136 QKELARKAQGKSIVHIHNSDIQEVTVRYPDRAEQ----DKISRTFSKLDHLIALHERKYS 191 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L K+ ++ + K ++ +G EV ++ TK Sbjct: 192 KLMNVKKFMLEKMFPKDSAKVPALRFAGFSGEWEKRKLGEVMKVTSVKRIHQSDWTKEGV 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + + + G++ + + V Sbjct: 252 RFLRARDIVAASKNETINDCLFISKEKYEECSLVSGKVSINDLLVTGVGTIGVPFLVRNL 311 Query: 314 GII----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVL 368 I + ID +L + S + G + E ++ P++ Sbjct: 312 APIYFKDGNIIWFKNEGKIDGEFLLYSFSSSSIQNFIATTSGLGTVGTYTIETGEKTPII 371 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +P I+E+ I + + +D L+ Q + L+ + S Sbjct: 372 LPSIQEEKKIGQFL----SYLDHLLSLHRQELERLQNVKKS 408 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 61/186 (32%), Gaps = 8/186 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQI 285 + ++ L S + YG + K + G+ + + Sbjct: 17 DEWEERKINDVANFSKGNGYSKGDLKGSGTPIILYGRLYTKYQFEIEGVDTFADIRSGAV 76 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGII--TSAYMAVKPHGIDSTYLAWLMRSYDLC 343 G V + + R+A +++ GI+ + ++ +++A ++ + Sbjct: 77 FSKGNEVIVPASGETAEDIARAAAILKSGILLGGDLNILHPFTFMNPSFVALVISNGPPQ 136 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K G + D++ + V P EQ I + +++D L+ E+ Sbjct: 137 KELARKAQGKSIVHIHNSDIQEVTVRYPDRAEQDKI----SRTFSKLDHLIALHERKYSK 192 Query: 403 LKERRS 408 L + Sbjct: 193 LMNVKK 198 >gi|304310388|ref|YP_003809986.1| Type I restriction-modification system specificity subunit [gamma proteobacterium HdN1] gi|301796121|emb|CBL44327.1| Type I restriction-modification system specificity subunit [gamma proteobacterium HdN1] Length = 403 Score = 82.9 bits (203), Expect = 1e-13, Method: Composition-based stats. Identities = 42/406 (10%), Positives = 119/406 (29%), Gaps = 23/406 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W +P+ + ++ GR+ ++ +I DV+ + ++ Sbjct: 2 SWPKLPLDQLGYVSRGRSRHRPRNDPSLYGGSYPFIQTGDVKHANFRISDHTATYSEAGL 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ K + + + + ++ + + + + + Sbjct: 62 AQSRLWPKDTLCIT-IAANIADTALLGYEACFPDSIIGFIADEEKADPRFVKYYFDIIQR 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + +GAT + + + + + PP+ Q + E + A I T + Sbjct: 121 ELQMVSQGATQDNLSQEKLLSFGIACPPVEVQRKVAEVLSAYDDLIATNQRRIALLEDAA 180 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + + ++ G + V + V K L E+ Sbjct: 181 RRLYREWFVHLRFPGHESVAVK-----DGVPEGWCKRSMTSVADFVNGFAFKPEHLGEAG 235 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + + + ++ + G+++F + L + + Sbjct: 236 LPVVKIPELRSGITSKTPYNLGHIVPQRNHITTGDVLFSWSATL-----LVNEWGEGPAL 290 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + V P L + ++ Q ++ + +LVP Sbjct: 291 LNQHLFKVIPRNELHKRLVRFAVEAAIPELIGHAVGATMQHIRRSALDNHLMLVPDDTTS 350 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + D ++ Q+ L K R + ++GQ+D+ Sbjct: 351 VAFAAQADPMM---DAVLNLTAQNRELTKA-RDLLLPKLMSGQLDV 392 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 56/188 (29%), Gaps = 9/188 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W + G + + + + ++ SG P + Sbjct: 205 VPEGWCKRSMTSVADFVNGFAFKPEHLGEAGLPVVKIPELRSGITSKTPYNLGHI---VP 261 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + G +L+ L + + + + P++ L + L + + + + Sbjct: 262 QRNHITTGDVLFSWS-ATLLVNEWGEGPALLNQHLFKVIPRNELHKRLVRFAVEAAIPEL 320 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I GATM H + N M +P V + + L + + Sbjct: 321 I-GHAVGATMQHIRRSALDNHLMLVPDDTTSVAFAAQADPMMDAVLNLTAQNRELTKARD 379 Query: 197 EKKQALVS 204 L+S Sbjct: 380 LLLPKLMS 387 >gi|77920514|ref|YP_358329.1| restriction endonuclease S subunit [Pelobacter carbinolicus DSM 2380] gi|77546597|gb|ABA90159.1| restriction endonuclease S subunit [Pelobacter carbinolicus DSM 2380] Length = 394 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 75/401 (18%), Positives = 134/401 (33%), Gaps = 31/401 (7%) Query: 24 HWKVVPIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W++V K + R E+ +GLE ++ + NS TS F Sbjct: 9 GWEMVKFGEVVKNSNLVERDPEANAIERIVGLEHIDPENLHI--RRWNSVADGTSFTRKF 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIE 138 GQ L+GK Y RK A+F+GICS L +PK+ L ELL S Sbjct: 67 VPGQTLFGKRRAYQRKVAFAEFEGICSGDILTFEPKNAKILLAELLPFICQSDAFFDYAL 126 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ W+ + + P+PPL EQ I E + A ++ + ++ Sbjct: 127 DTSVGSLSPRTSWRALKDFEFPLPPLDEQKRIAEILWAADEAVEKYASLTSDLNAYVETL 186 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + ++ + S +G PF +L+ + ++ + Sbjct: 187 IENNITSTNVTSV--LGDYCPSDGIKIG---------PFGSLLHAEDYQSEGVPVVMPAD 235 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G I ++ R K + Y++ + + R DL KR+L A T Sbjct: 236 IEKGVIQEEKVARISEEKALELQNYRLSENDILFPRRGDLT--KRALVLAHQENWLCGTG 293 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 GI+ + W + S + +L +K++P +P Sbjct: 294 TIRVRLKEGINPRAVFWAVTSSSTNRWLDRFSVGTTMPNLNATTIKKIPFHLPE------ 347 Query: 378 ITNVINVETARIDVLVEKIEQSI---VLLKERRSSFIAAAV 415 + ++ ++ L + I V Sbjct: 348 -GSKAGEFLGLLERTKSLHANAVVHHQKLVALKKHLIGNLV 387 >gi|159038424|ref|YP_001537677.1| restriction endonuclease S subunits-like protein [Salinispora arenicola CNS-205] gi|157917259|gb|ABV98686.1| Restriction endonuclease S subunits-like protein [Salinispora arenicola CNS-205] Length = 412 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 55/413 (13%), Positives = 126/413 (30%), Gaps = 24/413 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W V + +++ G+ +S +++ Y+G V+ G Sbjct: 5 WPVSTVGEQFEVHLGKMLDSARNVGFPKPYVGNRAVQWGWIDLSAVGVAPLTQSDIRRFR 64 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLP-ELLQGWLLSIDVTQRI 137 G +L + G R AI D C ++PK+ L+ L Sbjct: 65 LRNGDLLVCEGGEIGRGAIWRDQLSECYYQKALHRMRPKNGYDVRLMLALLEYWSTGGVF 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++H +P+P+P AEQ I E I I L + + + Sbjct: 125 PNYVTQTSIAHLPRDKFIEMPLPLPSAAEQARIGEVIQDVNDLIHALRRMIAKKQAIRQG 184 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIE-WVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 +Q L++ G S E +G + + + + Sbjct: 185 LRQQLLT-----GRTRLPGYSGSWREVSLGRYVSYVNTVALSRAQLDGESPV-RYVHYGD 238 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERG 314 + ++ + G++VF + D +S+ V + G Sbjct: 239 IHARDSPMLDAAREALPRASSTLLRNAGRLKVGDLVFADVSEDPDGVGKSVEVTSVPDVG 298 Query: 315 IIT--SAYMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 ++ A + + ++ + + + G + + + + + +P Sbjct: 299 VVPGLHTIAARFEKAVLADGFKAYLQFVPSFRETLHRLVVGTKVLATTRSLISSITLTLP 358 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + EQ I +V+ + + + ++ + + + G+ L G Sbjct: 359 NVDEQRAIASVLTDADRE----IAVLRVRLAKARDVKQGMMQELLAGRTRLPG 407 >gi|332366398|gb|EGJ44149.1| hypothetical protein HMPREF9389_0052 [Streptococcus sanguinis SK355] Length = 408 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 46/389 (11%), Positives = 122/389 (31%), Gaps = 25/389 (6%) Query: 47 DIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVS----IFAKGQILYGKLGPYLRKAIIA 101 + + ++ +E +GK + + S + + + +L +G ++ Sbjct: 30 GVPFYRSKEVIEISSGKNISEQLFISSEKYSEIKSKFPVPQENDVLITAVGTIGEILVVK 89 Query: 102 DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI 161 D + L+ + +L + + + + + Sbjct: 90 DPNFYFKDGNLIWLRNINFDIIDIDYLYYFFKSDLFQKTIRYNNIGAVQKALTIDFLKTV 149 Query: 162 PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV---KMK 218 + + K+I+ ID I + + L+ + L Y + PD K Sbjct: 150 KITLPSLDNQRKLISVLKSIDKKIQINSQINQELEAMAKTLYDYWFVQFDFPDQNGKPYK 209 Query: 219 DSG------IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 SG E +P+ W V+ + + K L ++ + Sbjct: 210 SSGGKMVYHPELKREIPEGWGVEKLGDITICHDSKRVPLSSNDRELVKGEIPYYGATGIM 269 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + ++ ++ + + + + + A++ Sbjct: 270 DYVNDYIFDGDYVLMAED----GSVMTEKGTPILQRISGKNWVNNHAHVLEPIKNHSCKL 325 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L L++ + K+ ++ + E++ ++ V P+K F+I + V + L Sbjct: 326 LMMLLKDVSVMKI---KTGSIQMKINQENMNKIVVPAIPLKLLFEINQKLEVIEKQQLNL 382 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +E+ +Q L + R + + GQ+ + Sbjct: 383 IEENKQ----LTQLRDWLLPMLMNGQVKV 407 >gi|261885495|ref|ZP_06009534.1| restriction and modification enzyme CjeI [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 727 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 118/395 (29%), Gaps = 36/395 (9%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K+V + ++ TG T + G D + D+ +G + + + Sbjct: 343 KLVKLGEICEILTGSTPSTQKKEFYGSDFPFYRPADLING-RNVNSSEVMVSKLGYESQR 401 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K IL +G R +I +L + + E L + Q + Sbjct: 402 ALPKKSILVSCIGTIGRVGMIEKSGIFNQQINALLPNNNYISEFLFYLFDTNFFKQLLIQ 461 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ + NI +P+PPL Q I ++ + E+ + I + E+ Sbjct: 462 QTHNTTVPIINKSKFSNIKIPLPPLEIQEKIAKEC--------EEVEEKFKTIRMSIEEY 513 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++L+ I+ KG + DS +E G + + Sbjct: 514 KSLIKEILIKG----CVITDSRLEIGGGYEQDLAQIVNDLPSPQNYGLSEWESVKLTNKD 569 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 I +++ +++ + + + +P + + + S+ + Sbjct: 570 FILKIGKRVLDKDLTQDGINVFSANVKEPFGKINKDLIKDFSLDSVLWGIDGDWMTG--- 626 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFY---------AMGSGLRQSLKFEDVKRLPVLVP 370 T ++RS G + E + L + +P Sbjct: 627 -FVKANEPFYPTDHCGVLRSKSHKAKILEFALFEVGAKFGFSRQNRASIERISNLTLSLP 685 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 P++ Q I I I L + L+ Sbjct: 686 PLEAQEKIVKAIEFCEGEISNL----NNELKTLEN 716 >gi|159027726|emb|CAO89595.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 1193 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 47/403 (11%), Positives = 107/403 (26%), Gaps = 42/403 (10%) Query: 25 WKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W +K ++ G + E + ++ + D + + Sbjct: 761 WNTSKLKDLFNISRGGSPRPINNYLTEDDNGVNWLKIGDTKEVDKYIYKTRQKIKPEGAK 820 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ-FLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ + + I+ I + L L + + Q Sbjct: 821 FSRKVIEDDLILSNSMSFGKPFIMKITAYIHDGWLLFRSITNQASKDYLYIVLGTNLIYQ 880 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF---- 191 + G + + + + +I +PIPP Q I K+ E + Sbjct: 881 LFKKQTIGGVVENLNIDLVKHIKVPIPPKEIQDKIVAKMDDAYAAKKQKELEAQQLLESI 940 Query: 192 -----------------IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 + +S + +P + + Sbjct: 941 DDYLLGELGIELPEPEENTIKNRIFIRNLSEVSGDRFDPLYYFSNIYKSLEKSAFKLDYI 1000 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--------ETYQIV 286 A + + + + R Y I+ Sbjct: 1001 SRITAYMKTGFASGKQDQSKDDQGIIQIRPTNINNAREFVFNKNVYIPHFELLKRKEDIL 1060 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCK 344 EI+F + Q + ++ V ID YLA + Y + Sbjct: 1061 QKDEILFNNTNSQELVGKSILFNLEGFYFCSNHITRVGVKKGKIDPQYLAHIFNLYQHQQ 1120 Query: 345 VFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 VF+ + + + + E + +L + +PP+++Q +I+ IN Sbjct: 1121 VFFKICTNWNNQSGVNVEVLGQLKIPLPPLEKQIEISEHINAI 1163 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 68/220 (30%), Gaps = 14/220 (6%) Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 R + + ++ + G K+KD G P K Sbjct: 737 WGRLDPHFHKIEFKMIEQQIENGKWNTSKLKDLFNISRGGSPRPINNYLTEDDNGVNWLK 796 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 E + + + +KPE + + V +++ ++ Sbjct: 797 IGDTKEVD----------KYIYKTRQKIKPEGAKFSRKVIEDDLILSNSMSFGKPFIMKI 846 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLP 366 + G + + YL ++ + + ++F G+ ++L + VK + Sbjct: 847 TAYIHDGWL---LFRSITNQASKDYLYIVLGTNLIYQLFKKQTIGGVVENLNIDLVKHIK 903 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 V +PP + Q I ++ A + +Q + + + Sbjct: 904 VPIPPKEIQDKIVAKMDDAYAAKKQKELEAQQLLESIDDY 943 >gi|317014950|gb|ADU82386.1| typeI R-M system specificity subunit [Helicobacter pylori Gambia94/24] Length = 207 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 65/198 (32%), Gaps = 11/198 (5%) Query: 229 PDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQI 285 P W+ + + + + + S NI+ + + + Sbjct: 13 PKAWQKVRLGDIAHIFDGTHQTPQYTHYGVAFFSVENIVSDKPVKFISQQDYLTATNQNR 72 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + +I+ I S I + + + +S YL + ++S K Sbjct: 73 PEYNDILLTRIGTIG--VSKVVNWNYPFSIYVTLAVIKQSKYFNSYYLHYFIQSNFFQKE 130 Query: 346 FYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + ++K+ V++PP+ EQ I NV++ I L K Q Sbjct: 131 LKNNSLLQAIPCKINMNELKKCEVILPPLNEQIAIANVLSDVDNEIISLKNKKRQ----F 186 Query: 404 KERRSSFIAAAVTGQIDL 421 + + + ++ +I + Sbjct: 187 ENIKKALNHDLMSAKIRV 204 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 63/190 (33%), Gaps = 8/190 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +PK W+ V + + G + + + +E++ S K + + Sbjct: 12 LPKAWQKVRLGDIAHIFDGTHQTPQYTHYGVAFFSVENIVSD--KPVKFISQQDYLTATN 69 Query: 78 VSIFAKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IL ++G K + ++ V++ + + + Q+ Sbjct: 70 QNRPEYNDILLTRIGTIGVSKVVNWNYPFSIYVTLAVIKQSKYFNSYYLHYFIQSNFFQK 129 Query: 137 IEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 A + + + +PPL EQ+ I + I +L ++ +F + Sbjct: 130 ELKNNSLLQAIPCKINMNELKKCEVILPPLNEQIAIANVLSDVDNEIISLKNKKRQFENI 189 Query: 195 LKEKKQALVS 204 K L+S Sbjct: 190 KKALNHDLMS 199 >gi|303326056|ref|ZP_07356499.1| type I restriction-modification system, S subunit [Desulfovibrio sp. 3_1_syn3] gi|302863972|gb|EFL86903.1| type I restriction-modification system, S subunit [Desulfovibrio sp. 3_1_syn3] Length = 302 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 47/294 (15%), Positives = 91/294 (30%), Gaps = 14/294 (4%) Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +V Q I E I IP L EQ + + + +D I Sbjct: 11 NIKFMYEVLQTISYCTETHERHWISKFAPMPIK--IPQLREQQKVADCL----SSLDQRI 64 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 ++ LK K L+ + ++ G L Sbjct: 65 NAETEKLDALKAHKNGLLKQLFPLEGETLPALRFPEFRDAGEWKKADFGNIAKFLSGGTP 124 Query: 246 RKNTK-LIESNILSLSYGNIIQKLETRN--MGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 K+ +I +S ++ ++ K +I G ++ K Sbjct: 125 SKDVCDYWGGDIPWISASSMHNTKIEKSDCNITKLAVSNGARIAPKGTLLLLVRGSMLHK 184 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 R L ++ V I YL + + + + + + +G+ L +D Sbjct: 185 RILLGISEIDVSFNQDVKALVLNDDITELYLMYFLMASESKLLATVVQTGIGAGKLDTDD 244 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P+++P EQ I+N + + +D L+ Q I LLK+ + + Sbjct: 245 LNNFPIMMPSPIEQQRISNCL----SSLDELIAAQTQKINLLKDHKKGLMQQLF 294 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 73/214 (34%), Gaps = 24/214 (11%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESG 59 +P+++D+G WK K +G T DI +I + + Sbjct: 97 RFPEFRDAG---------EWKKADFGNIAKFLSGGTPSKDVCDYWGGDIPWISASSMHNT 147 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK---AIIADFDGICSTQFLVLQP 116 + + + I KG +L G L K I++ D + L Sbjct: 148 KIEKSDCNITKLAVS-NGARIAPKGTLLLLVRGSMLHKRILLGISEIDVSFNQDVKALVL 206 Query: 117 KDVLPELLQGWL-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 D + EL + ++ + + G D + N P+ +P EQ I + Sbjct: 207 NDDITELYLMYFLMASESKLLATVVQTGIGAGKLDTDDLNNFPIMMPSPIEQQRISNCLS 266 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + I + + I LLK+ K+ L+ + + Sbjct: 267 SLDEL----IAAQTQKINLLKDHKKGLMQQLFPR 296 >gi|222036088|emb|CAP78833.1| (Q83II4) hypothetical protein [Escherichia coli LF82] gi|312948974|gb|ADR29801.1| type I restriction enzyme EcoEI specificity protein [Escherichia coli O83:H1 str. NRG 857C] Length = 568 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 60/482 (12%), Positives = 131/482 (27%), Gaps = 96/482 (19%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 +K K P+ S + + +P+ W+ V ++ G K S + Sbjct: 83 IKKQKPLPEI--SEEEKLFELPEGWEWVRFGNIYEMEYGNNLPQEK----------RSNS 130 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-KAIIADFDGICSTQFLVLQPKDV 119 G+Y N + I I+ G+ G + + + P + Sbjct: 131 GEYNVYGSNGVVGTHNEACI-KSPCIIIGRKGSAGALNLSNQPACWVTDVAYSTIPPIAM 189 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADW---------------KGIGNIPMPIPPL 164 + E + ++ + + + I G + A + + L Sbjct: 190 VLEFVFIQFHTLGLDKLGKGIKPGLNRNDAYSLVIAIPPRSEQKAIVSKVNELMSLCDQL 249 Query: 165 AEQV---------------------LIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +Q E++ RI + KQ ++ Sbjct: 250 EQQSLTSLDAHQQLVETLLGTLADSQNAEELAENWARISEHFDTLFTTEASVDALKQTIL 309 Query: 204 SYIVTKGLNPDVK-------------------------------MKDSGIEWVGLVPDHW 232 V L P + S E +P+ W Sbjct: 310 QLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLLPISDEEKPFELPNGW 369 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI------- 285 E L+ ++ + S + +++ +++ + + Sbjct: 370 EWCRLGELIDSIDAGWSPACSSEPAAPGEWGVLKTTAVQSLEYREYENKALPKNKAPRPQ 429 Query: 286 --VDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSY 340 V G+I+ +N E +I+ + I Y++ + Sbjct: 430 LEVKAGDILITRAGPKNRVGISCLVENTRENLMISDKIIRFHLISEDISEKYISLCLNYG 489 Query: 341 DLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 SG+ + ++ + +K P+ +P EQ IT+ IN L +I+ Sbjct: 490 FTSTYLENSKSGMAESQMNISQDILKMAPIAIPTTHEQLKITDKINEIMDYFITLKSQIQ 549 Query: 398 QS 399 + Sbjct: 550 SA 551 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 62/192 (32%), Gaps = 16/192 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E + +P+ WE F + N + + + + Sbjct: 93 SEEEKLFELPEGWEWVRFGNIYEMEYGNNLPQEKRS--------NSGEYNVYGSNGVVGT 144 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + I P I+ R +L + + AY + P + ++ + Sbjct: 145 HNEACIKSPCIIIGRKGSA----GALNLSNQPACWVTDVAYSTIPPIAMVLEFVFIQFHT 200 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +G G++ L D L + +PP EQ I + +N + D L ++ S Sbjct: 201 LG----LDKLGKGIKPGLNRNDAYSLVIAIPPRSEQKAIVSKVNELMSLCDQLEQQSLTS 256 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 257 LDAHQQLVETLL 268 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 50/205 (24%), Gaps = 16/205 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ + S + + V+S + + Sbjct: 364 ELPNGWEWCRLGELIDSIDAGWSPACSSEPAAPGEWGVLKTTAVQSLEYREYENKALPKN 423 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGW 127 G IL + GP R I + + S + + Sbjct: 424 KAPRPQLEVKAGDILITRAGPKNRVGISCLVENTRENLMISDKIIRFHLISEDISEKYIS 483 Query: 128 LLSIDVTQRIE----AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + + + P+ IP EQ+ I +KI T Sbjct: 484 LCLNYGFTSTYLENSKSGMAESQMNISQDILKMAPIAIPTTHEQLKITDKINEIMDYFIT 543 Query: 184 LITERIRFIELLKEKKQALVSYIVT 208 L ++ + AL + + Sbjct: 544 LKSQIQSAQQTQLHLADALTNAAIN 568 >gi|258546307|ref|ZP_05706541.1| restriction modification system DNA specificity domain protein [Cardiobacterium hominis ATCC 15826] gi|258518451|gb|EEV87310.1| restriction modification system DNA specificity domain protein [Cardiobacterium hominis ATCC 15826] Length = 391 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 63/414 (15%), Positives = 118/414 (28%), Gaps = 52/414 (12%) Query: 26 KVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 K+ + G++ S DI G + + KY + Sbjct: 6 KIKFLSDLIDFKNGKSIKPSSGDIPIYGGNGILGYSEKYNYNNI---------------- 49 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G++G Y S + + K +L+ + G+ Sbjct: 50 -LIIGRVGAYCGSIHYHKEKCWVSDNAIAGEVKSDYSIDYLYYLMKSL---NLNDRQVGS 105 Query: 145 TMSHADWKGIGNIPMPIPPL-AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + NI + I Q I + +D I + L+E + L Sbjct: 106 SQPLLTQGVLNNISVKIYESSQTQQSIAAVL----SALDKKIALNKQINARLEEMAKTLY 161 Query: 204 SYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKN------ 248 Y + PD K SG E V +P WEVK + + + Sbjct: 162 DYWFVQFDFPDANGKPYKSSGGEMVFDETLKREIPKGWEVKSLGEIASTSSGGTPTSTIQ 221 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKP---ESYETYQIVDPGEIVFRFIDLQNDKRSL 305 NI ++ G + + ++V I+ K SL Sbjct: 222 EYYKGGNIPWINSGELNNNFIVHTDNFITQTGMDNSSAKLVSEKSILLAMYGATAGKTSL 281 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 S + I S P ++ + R +L + +KRL Sbjct: 282 ISFKTTTNQAICSIL----PKDMNHRVYIKSYLDNMYLYLVQLSSGSARDNLSQDKIKRL 337 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +++P I + + T +E + L + R + + GQ+ Sbjct: 338 HLVIPESG----ILEIFSKVTEDFYKKIETNLKQSHHLTQLRDFLLPMLMNGQV 387 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 14/212 (6%) Query: 10 YKDSGVQWI------GAIPKHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDV 56 YK SG + + IPK W+V + ++G T G +I +I ++ Sbjct: 178 YKSSGGEMVFDETLKREIPKGWEVKSLGEIASTSSGGTPTSTIQEYYKGGNIPWINSGEL 237 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + D S+ + ++ IL G K + F + + P Sbjct: 238 NNNFIVHTDNFITQTGMDNSSAKLVSEKSILLAMYGATAGKTSLISFKTTTNQAICSILP 297 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 KD+ + ++ + + G+ + I + + IP + + Sbjct: 298 KDMNHR-VYIKSYLDNMYLYLVQLSSGSARDNLSQDKIKRLHLVIPESGILEIFSKVTED 356 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 +I+T + + +L L++ V Sbjct: 357 FYKKIETNLKQSHHLTQLRDFLLPMLMNGQVF 388 >gi|291296825|ref|YP_003508223.1| restriction modification system DNA specificity domain-containing protein [Meiothermus ruber DSM 1279] gi|290471784|gb|ADD29203.1| restriction modification system DNA specificity domain protein [Meiothermus ruber DSM 1279] Length = 419 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 49/430 (11%), Positives = 117/430 (27%), Gaps = 48/430 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + +L G Y+P +S +D ++ Sbjct: 4 EWKECALGEVIELKRGYDLPQQDRRP------------GYVPIVSSSGVTDYHAEAMVKG 51 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G+ G + +T V K P + +L +D ++ Sbjct: 52 PGVVTGRYGTLGEVFYVEQDFWPLNTTLYVRDFKGNDPRFISYFLRGLDFFAYVDK---- 107 Query: 144 ATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 A + + + + +P + EQ I + +I+ + + ++ Sbjct: 108 AAVPGINRNHLHQARVIVPTDVGEQRAIAHILGTLDDKIELNRRMSETLEAMARALFKSW 167 Query: 203 VSYI-------------------VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + L + E +G +P+ W VK L Sbjct: 168 FVDFDPVRAKMEGRWQRGQSLPGLPAHLYDLFPDRLVDSE-LGEIPEGWGVKSIGDLAEV 226 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR-------FI 296 + K + + + + + + +I D G + Sbjct: 227 VGGSTPKTECAEFWDGGTHHWVTPKDLSGLSMPVLLDTERKITDAGLAQISSGLLPRGTV 286 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 L + A + ++A+KP S ++ Sbjct: 287 LLSSRAPIGYLAIAEVPVAVNQGFIAMKPRQGVSNLFLLRWARAAHDEILSHANGSTFLE 346 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + + V+ PP I + + + + V + L R + + ++ Sbjct: 347 ISKASFRPIRVVTPPTP----IMDAFDQFSRPMYGKVVENALESRTLAALRDALLPKLIS 402 Query: 417 GQIDLRGESQ 426 G+I ++ + Sbjct: 403 GEIRVKDAER 412 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 61/203 (30%), Gaps = 15/203 (7%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKY- 63 DS +G IP+ W V I ++ G T ++ G ++ +D+ + Sbjct: 205 DSE---LGEIPEGWGVKSIGDLAEVVGGSTPKTECAEFWDGGTHHWVTPKDLSGLSMPVL 261 Query: 64 --LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + + +G +L P IA+ + F+ ++P+ + Sbjct: 262 LDTERKITDAGLAQISSGLLPRGTVLLSSRAPI-GYLAIAEVPVAVNQGFIAMKPRQGVS 320 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L + I + G+T I + PP + ++ Sbjct: 321 N-LFLLRWARAAHDEILSHANGSTFLEISKASFRPIRVVTPPTPIMDAFDQFSRPMYGKV 379 Query: 182 DTLITERIRFIELLKEKKQALVS 204 E L L+S Sbjct: 380 VENALESRTLAALRDALLPKLIS 402 >gi|160939419|ref|ZP_02086769.1| hypothetical protein CLOBOL_04312 [Clostridium bolteae ATCC BAA-613] gi|158437629|gb|EDP15391.1| hypothetical protein CLOBOL_04312 [Clostridium bolteae ATCC BAA-613] Length = 430 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 54/417 (12%), Positives = 113/417 (27%), Gaps = 38/417 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS--- 76 W T + D++ I DV G L N+ S Sbjct: 25 WCSHKFSSVFSFLQNNTLSRAELDATGDVLDIHYGDVLIKYGSILDATDNTIPHIISGHE 84 Query: 77 --TVSIFAKGQILYG------KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GW 127 G I+ +G + I+ + +P + + Sbjct: 85 STNYDYLQDGDIIVADTAEDETVGKTIELLNISGRKIEAGLHTVPCRPLFPFASMYLGYY 144 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNI-PMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + +++ + +G + I +AEQ I + R Sbjct: 145 MNTPHYHKQLVPLMQGIKVLSISKGNISKTEISSPQTIAEQEKISRFLYLLDQRAAAQSK 204 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + L I +G + + F N Sbjct: 205 IIDALKKYKRGLSDTLFDRTAQSPSCK--------IVKLGDAFELLQNNTFSRDDLTTNP 256 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFIDLQNDK 302 + + I + + YG + E +KP + + + G+IVF Sbjct: 257 SSVQNIHYGDVLVKYGAVTNISEYTPPYIKPTINLQKFVATSYLRDGDIVFADTAEDYSV 316 Query: 303 RSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLK 358 + S + YLA+ S + Y + G S+ Sbjct: 317 GKATEIAGANGLAVLSGLHTIPCRPLMKFHPMYLAYYFNSSLFRRQIYPLVQGTKVSSIS 376 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + V P +EQ I +++ +D+ + E+++ L R++ + Sbjct: 377 KGELVKTSVYAPTEREQRRIASMLY----LLDLRITFEEKTVNALTNTRTALLQQLF 429 >gi|238921300|ref|YP_002934815.1| hypothetical protein NT01EI_3443 [Edwardsiella ictaluri 93-146] gi|238870869|gb|ACR70580.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146] Length = 323 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 51/304 (16%), Positives = 113/304 (37%), Gaps = 16/304 (5%) Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL-IREKIIAETVRIDTL 184 + L ++ QR A GAT++ K + N + +P ++ + I +K+ + I L Sbjct: 27 FYQLQSNLVQRQIAETLGATINQITNKDLSNFKIAVPRNKDEYIEISDKLASIDGLIIDL 86 Query: 185 ITERIRFIELLKEKKQALVS---YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + + Q L++ + L D +K +G +P+ W ++ F L Sbjct: 87 KKIVNKKQAIKTATMQQLLTGKTRLPQFALREDGTVKGYKKSELGEIPEDWSIENFSTLA 146 Query: 242 T-ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQ 299 T R N + + L + +II + N + + + P +I+F + Sbjct: 147 TLRNERINPRTKDIECLCIELEHIISEYGQLNGFTETSGTSSIKNVFSPNDILFGKLRSY 206 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 K + + G+ ++ +K ++ +++ + Sbjct: 207 LKKYW----KATQSGVCSTEIWVLKTELHKAIPEFIFQTVKTDRFVQTASEAYGTHMPRA 262 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++ +K L V P I+EQ I +++ I L Q + ++ + + +TG Sbjct: 263 DWKIIKELQVATPSIEEQIAIATILSDMDKEIQTL----HQRLDKTRQLKQGMMQELLTG 318 Query: 418 QIDL 421 + L Sbjct: 319 KTRL 322 Score = 73.3 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 54/199 (27%), Positives = 85/199 (42%), Gaps = 10/199 (5%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDI--IYIGLEDVESGTGKYLPKD 67 YK S +G IP+ W + L R + KDI + I LE + S G+ Sbjct: 125 YKKSE---LGEIPEDWSIENFSTLATLRNERINPRTKDIECLCIELEHIISEYGQL--NG 179 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQ 125 +S ++F+ IL+GKL YL+K A G+CST+ VL+ + +PE + Sbjct: 180 FTETSGTSSIKNVFSPNDILFGKLRSYLKKYWKATQSGVCSTEIWVLKTELHKAIPEFIF 239 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + Q G M ADWK I + + P + EQ+ I + I TL Sbjct: 240 QTVKTDRFVQTASEAY-GTHMPRADWKIIKELQVATPSIEEQIAIATILSDMDKEIQTLH 298 Query: 186 TERIRFIELLKEKKQALVS 204 + +L + Q L++ Sbjct: 299 QRLDKTRQLKQGMMQELLT 317 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 40/93 (43%), Gaps = 5/93 (5%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARI 389 Y+ + ++S + + + +D+ + VP E +I++ + A I Sbjct: 24 PYVFYQLQSNLVQRQIAETLGATINQITNKDLSNFKIAVPRNKDEYIEISDKL----ASI 79 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 D L+ +++ + + +++ + +TG+ L Sbjct: 80 DGLIIDLKKIVNKKQAIKTATMQQLLTGKTRLP 112 >gi|119471837|ref|ZP_01614170.1| Restriction endonuclease S subunits-like protein [Alteromonadales bacterium TW-7] gi|119445327|gb|EAW26616.1| Restriction endonuclease S subunits-like protein [Alteromonadales bacterium TW-7] Length = 440 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 59/404 (14%), Positives = 127/404 (31%), Gaps = 19/404 (4%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYG 89 +++ + ++ D+++G Y K Q++ G +L Sbjct: 35 GNHGEIHPKGDDFVESGVPFVMASDIKNGQINYETCKYIKPEQAEGLRKGFAKSGDVLLT 94 Query: 90 KLGPYLRKAIIADFDGICS------TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC-E 142 R A++ + T + VL ++ L+ + S + +E Sbjct: 95 HKATIGRTALVNNKSYQYIMLTPQVTYYRVLDHNELSNLYLKYYFDSSLFQKTLELWSGS 154 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+T ++ +P+ +PP+ +Q I + A I+ + KE + Sbjct: 155 GSTRAYLGITAQHKLPVILPPIEKQKKIASTLAAYDNLIENSRKRISTIENITKEVYREW 214 Query: 203 VSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K W +G + D K + I+ Sbjct: 215 FVRFRFPEYKTSSFKKGIPASWEIKTLGDLCDVTSSKRIYQEDYVP-EGVPFFRSKEIIQ 273 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 S G + + + E E + G+I+ + LR Sbjct: 274 KSNGLEPKDILYISDEKYTEIKEKFGSPKSGDILLTSVGTLGISYQLRDDDKFYFKDGNL 333 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 ++ ++ +L + + S + +Q+ +K++ VL+P I+ Sbjct: 334 IWLKALDQEVN-KFLKFWLNSPVGKAALLETTIGSSQQAFTISGLKKVKVLLPNIEL--- 389 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 IT N +A + + Q I +L + S I V+G+ + Sbjct: 390 ITEF-NKFSAPLKEQCYNLHQQIKILNNTKLSLIERLVSGEQKV 432 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 59/181 (32%), Gaps = 12/181 (6%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET----YQIVDPGEI 291 P E++ K +ES + + +I + + G++ Sbjct: 32 PLDGNHGEIHPKGDDFVESGVPFVMASDIKNGQINYETCKYIKPEQAEGLRKGFAKSGDV 91 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCK--VFY 347 + + + + Y + + + + YL + S K + Sbjct: 92 LLTHKATIGRTALVNNKSYQYIMLTPQVTYYRVLDHNELSNLYLKYYFDSSLFQKTLELW 151 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + R L +LPV++PPI++Q I + + A D L+E + I ++ Sbjct: 152 SGSGSTRAYLGITAQHKLPVILPPIEKQKKIASTL----AAYDNLIENSRKRISTIENIT 207 Query: 408 S 408 Sbjct: 208 K 208 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 64/217 (29%), Gaps = 26/217 (11%) Query: 6 AYPQYKDS----GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVE 57 +P+YK S G IP W++ + + + + + + + +++ Sbjct: 219 RFPEYKTSSFKKG------IPASWEIKTLGDLCDVTSSKRIYQEDYVPEGVPFFRSKEII 272 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAK-------GQILYGKLGPYLRKAIIAD---FDGIC 107 + PKD + + + G IL +G + D F Sbjct: 273 QKSNGLEPKDILYISDE--KYTEIKEKFGSPKSGDILLTSVGTLGISYQLRDDDKFYFKD 330 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + + L+ WL S + G++ G+ + + +P + Sbjct: 331 GNLIWLKALDQEVNKFLKFWLNSPVGKAALLETTIGSSQQAFTISGLKKVKVLLPNIELI 390 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + L + + LVS Sbjct: 391 TEFNKFSAPLKEQCYNLHQQIKILNNTKLSLIERLVS 427 >gi|324992017|gb|EGC23939.1| type I restriction enzyme specificity protein [Streptococcus sanguinis SK405] Length = 408 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 45/406 (11%), Positives = 115/406 (28%), Gaps = 33/406 (8%) Query: 24 HWKVVPIKRFTKLNTGRTS-----------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 W + + S + I ++ D+++G ++ Sbjct: 17 SWSIKKLSDTQTCFKDGNYGEAYPKETDLTTSTQGIPFLRGSDLDNGKLTLTNARYITKS 76 Query: 73 SDTSTVS-IFAKGQILYGKLGPYL--RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 VS + I+ G ++Q +++ + + Sbjct: 77 KHNELVSGHLIEDDIVIAVRGSLGSLGYVSPESVGWNINSQLAIIRTRKIEIIGNYLIQF 136 Query: 130 SIDVT--QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + I + G + K + NI +PIP + EQ I + + Sbjct: 137 LLSNRGGKEISSHITGTALKQLPIKQLKNIKVPIPKIDEQSAIGSLFRTLDDLLASYKVN 196 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + L + P++++ EW + + + ++ Sbjct: 197 LANYQSLKATMLSKMFPKAGQT--VPEIRLDGFEGEWGNAIINDYVTLLNGRAF----KQ 250 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + L + GN + L+ E + + ++++ + + Sbjct: 251 DELLNGGKYRVVRVGNFNTNEKWYYSNLELEENK---YANKDDLLYLWATNFGPEIWKEE 307 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I + ID YL + + D ++ + ++ Sbjct: 308 KIIFHYHIWKLEF---DRSIIDRNYLYYWLE-KDKKRIQQNTNGSTMVHVTKSMMENREF 363 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 L P +EQ I + + +D L+ ++ I L+ + + Sbjct: 364 LFPMFREQQAIGSY----FSNLDNLINSHQEKISQLETLKKKLLQD 405 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 65/208 (31%), Gaps = 11/208 (5%) Query: 213 PDVKMKDSGIEW-VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 P ++ K W + + D ++ + + G+ + + Sbjct: 7 PKIRFKKFNDSWSIKKLSDTQTCFKDGNYGEAYPKETDLTTSTQGIPFLRGSDLDNGKLT 66 Query: 272 NMGLKPESYETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + + G +IV + V A + + Sbjct: 67 LTNARYITKSKHNELVSGHLIEDDIVIAVRGSLGSLGYVSPESVGWNINSQLAIIRTRKI 126 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I YL + S K + +G + L + +K + V +P I EQ I + Sbjct: 127 EIIGNYLIQFLLSNRGGKEISSHITGTALKQLPIKQLKNIKVPIPKIDEQSAIGS----L 182 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ + ++ + +++ ++ Sbjct: 183 FRTLDDLLASYKVNLANYQSLKATMLSK 210 >gi|269966771|ref|ZP_06180846.1| hypothetical protein VMC_22760 [Vibrio alginolyticus 40B] gi|269828631|gb|EEZ82890.1| hypothetical protein VMC_22760 [Vibrio alginolyticus 40B] Length = 371 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 49/362 (13%), Positives = 114/362 (31%), Gaps = 25/362 (6%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 Y+ N + S + ++ G I++ + + L D Sbjct: 26 YVVYGANGKIGYYSEYTH-ENPTVMITCRGATCGNVHISEPKAYINGNAMALDDVDPE-R 83 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + +L + + + G+ KG+ + +P+PPL Q I E + Sbjct: 84 VDINYLRYCLIDRGFRDVISGSAQPQITGKGLSKVQIPLPPLETQKQIAEVLEKADQLRK 143 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 L + + +P K W VT Sbjct: 144 DCQQMEQELNSLAQSV-------FIDMFGDPVTNPKG----WDLKPLSSLGEVKGGLQVT 192 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF--RFIDLQN 300 N + ++ Y + ++ E + + + E +++ G+++F + Sbjct: 193 SKRAANPISVPYLRVANVYRDHLELDEVKEIRVTENELE-RVLLEKGDVLFVEGHGNANE 251 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLR--QSL 357 R+ + + + + + +P Y++ + S + M +L Sbjct: 252 VGRTAVWNDEVAQCVHQNHLIRFRPGADVRPEYVSAFVNSASGKRQLLKMSKTTSGLNTL 311 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV-LLKERRSSFIAAAVT 416 ++K + VLVPP+ EQ D + A+ + + + L + ++ + A Sbjct: 312 STSNIKSIQVLVPPLLEQDDFLAFLASCKAQ-----QVVNDQLSVELDQNFNALMQKAFK 366 Query: 417 GQ 418 G+ Sbjct: 367 GE 368 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 44/144 (30%), Gaps = 9/144 (6%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 K Y Y +P ++ + + G A V P Sbjct: 24 DGYVVYGANGKIGYYSEYTHENP-TVMITCRGATCGNVHISEPKAYINGNAM-ALDDVDP 81 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 +D YL + + V + + + + ++ + +PP++ Q I V+ Sbjct: 82 ERVDINYLRYCLIDRGFRDVI---SGSAQPQITGKGLSKVQIPLPPLETQKQIAEVLE-- 136 Query: 386 TARIDVLVEKIEQSIVLLKERRSS 409 + D L + +Q L S Sbjct: 137 --KADQLRKDCQQMEQELNSLAQS 158 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 58/204 (28%), Gaps = 17/204 (8%) Query: 22 PKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK W + P+ ++ G + + + + Y+ + +V + + Sbjct: 171 PKGWDLKPLSSLGEVKGGLQVTSKRAANPISVPYLRVANVYRDHLELDEVKEIRVTENEL 230 Query: 77 TVSIFAKGQILY----GKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSI 131 + KG +L+ G R A+ D L+ Sbjct: 231 ERVLLEKGDVLFVEGHGNANEVGRTAVWNDEVAQCVHQNHLIRFRPGADVRPEYVSAFVN 290 Query: 132 D---VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 Q ++ + ++ I +I + +PPL EQ + Sbjct: 291 SASGKRQLLKMSKTTSGLNTLSTSNIKSIQVLVPPLLEQDDFLAFL----ASCKAQQVVN 346 Query: 189 IRFIELLKEKKQALVSYIVTKGLN 212 + L + AL+ LN Sbjct: 347 DQLSVELDQNFNALMQKAFKGELN 370 >gi|149005622|ref|ZP_01829361.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP18-BS74] gi|147762562|gb|EDK69522.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP18-BS74] Length = 522 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 68/441 (15%), Positives = 132/441 (29%), Gaps = 71/441 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW-------- 224 +L KE ++++ Y + L K E Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDL 322 Query: 225 ------------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESN 255 + +P+ W F +LV K + Sbjct: 323 DISIVSQGDDNSYYGNKDETTSYPIYKIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTE 382 Query: 256 ILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I +S ++ N + + I G ++ F L Sbjct: 383 IPWVSISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATH 442 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 II+ + I YL + G ++L + L + + Sbjct: 443 NEAIIS-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISN 499 Query: 372 IKEQFDITNVINVETARIDVL 392 +E I +++ ++ L Sbjct: 500 HEEMKRIIFKVDLLFQKVSQL 520 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 46/130 (35%), Gaps = 17/130 (13%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESG 59 YP YK IP+ W+ + G+T + +I ++ + D+ SG Sbjct: 345 YPIYK---------IPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISG 395 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + + + + I KG +L + K I D + + + P Sbjct: 396 YVTNTRESISKLALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYAN 454 Query: 120 LPELLQGWLL 129 +++ +L+ Sbjct: 455 KENIIRDYLM 464 >gi|153807715|ref|ZP_01960383.1| hypothetical protein BACCAC_01997 [Bacteroides caccae ATCC 43185] gi|160886163|ref|ZP_02067166.1| hypothetical protein BACOVA_04170 [Bacteroides ovatus ATCC 8483] gi|160889101|ref|ZP_02070104.1| hypothetical protein BACUNI_01522 [Bacteroides uniformis ATCC 8492] gi|149129324|gb|EDM20538.1| hypothetical protein BACCAC_01997 [Bacteroides caccae ATCC 43185] gi|156108048|gb|EDO09793.1| hypothetical protein BACOVA_04170 [Bacteroides ovatus ATCC 8483] gi|156861568|gb|EDO54999.1| hypothetical protein BACUNI_01522 [Bacteroides uniformis ATCC 8492] Length = 376 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 71/397 (17%), Positives = 128/397 (32%), Gaps = 37/397 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI--FAKGQ 85 V K Y D + + G D I F GQ Sbjct: 4 VKFGDVVKDVKINIDRLNNPYEYYVAGDHMDSEDLTIHRKGCFTTDDVGPAFIRVFKPGQ 63 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVL---QPKDVLPELLQGWLLSIDVTQRIEAICE 142 ILYG YL+K +ADF+G+C+ V P LL +LS D T A + Sbjct: 64 ILYGSRRTYLKKIAVADFEGVCANTTFVFETKDPHAFEQRLLPFIMLSKDFTTWSIAKSK 123 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+T + + + + +PPL EQ ++ +K+ A + + E ++ Sbjct: 124 GSTNPYVLFSDLADFEFELPPLEEQKVLVDKLWAAYRL----KEAYKKLLVATDEMVKSQ 179 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + N I G+ P + ++ L+ + G Sbjct: 180 FIEMYYNTHNKQTLESVCPIMNKGITPKYV-------------ESSSVLVINQACIHWDG 226 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG-IITSAYM 321 + ++ N + +I++ G+++ R + I ++ Sbjct: 227 QRLGNIKYHNEEIPV----RKRILESGDVLLNATGNGTLGRCCVFICPSDNNTYINDGHV 282 Query: 322 AVKPHGI---DSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 L + D Y GS + + F D+K++ V VP + EQ Sbjct: 283 IALSTDRAVILPEVLNTYLSLNDTQAEIYRQYVTGSTNQVDIVFSDIKKMKVPVPSMDEQ 342 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 V+ + D +++Q I + + S I Sbjct: 343 ILFVEVL----TQADKSKFELKQCIENIDKVIKSLIN 375 >gi|29294593|ref|NP_808862.1| type I restriction/modification system specificity subunit [Lactococcus lactis subsp. lactis bv. diacetylactis] gi|29170405|emb|CAD79593.1| HsdD protein [Lactococcus lactis subsp. lactis bv. diacetylactis] Length = 412 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 52/409 (12%), Positives = 116/409 (28%), Gaps = 76/409 (18%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKD 67 +P+ W+ + + + G T + + G D E G +Y+ K Sbjct: 62 KVPELRFKGFTDDWEERKLGELSNIVGGGTPSTSNSEYWDGDIDWYAPAEIGEQRYVSKS 121 Query: 68 GNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + S+ I G +L+ AI+ + F + P + Sbjct: 122 KKTITELGLKKSSARILPVGTVLFTSRAGIGNTAILGKE-ATTNQGFQSIVPNPNKLDSY 180 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + ++ + E G+T K + + + +P L + + + Sbjct: 181 FIYSRTNELKRYGEVTGAGSTFVEISGKQMSKMSIMVPELRFAGFADDWEERKLSSMTNY 240 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + + K L++ ++ +K SG F + Sbjct: 241 KNGKSHEDKQSTSGKLELIN---LNSISISGGLKHSG--------------KFIDEADDT 283 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 +K+ L + ++ + L PE L Sbjct: 284 LQKDD-------LVMILSDVGHGDLLGRVALYPEDD--------------RFVLNQRVAL 322 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 LR + + + S A + + + + F Sbjct: 323 LRPNTIADPQFLFSYINAHQ---------YYFKAQGAGMSQLNISKGSVENFISF----- 368 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 V + I+EQ I + ++D + ++ + LLKE++ F+ Sbjct: 369 --VPI--IEEQKKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 409 >gi|315038769|ref|YP_004032337.1| type I restriction-modification system S subunit [Lactobacillus amylovorus GRL 1112] gi|312276902|gb|ADQ59542.1| type I restriction-modification system S subunit [Lactobacillus amylovorus GRL 1112] Length = 361 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 48/377 (12%), Positives = 104/377 (27%), Gaps = 41/377 (10%) Query: 30 IKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +K + TG T DI ++ + + G + +S I Sbjct: 5 LKNIGTIITGNTPSKKNSKYWNSNDICFVKPDVIGDGVDNVNQSNEYISNYASSKARIVG 64 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K IL +G R I++ + Q + P + ++L ++ A+ Sbjct: 65 KNTILITCIGNIGRVGIVSKEKIAFNQQISAIVPNCKINFRYLAYVLLFS-KSKLNAMAN 123 Query: 143 GATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 A + + + N + I L Q I E + E L Sbjct: 124 SAVVPIINKTQLENFKIKIDSNLEHQAQIVEALDKIEEIKRIQDKEIKYLDTL------- 176 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + V +P + KD + +G + A ++ ++ + Sbjct: 177 IKARFVEMFGDPIINTKDLSLVSLGKL------CTLKAGEFTAAKEIHANKDNINRYPCF 230 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G + N L + +L + G + Sbjct: 231 GGNGVRGYVDNYT-----------------HDGNYSLIGRQGALCGNVQLTAGKFRNTEH 273 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 A+ WL L K+ + L + + ++ + + + Q + + Sbjct: 274 AILVKPNVQVNYYWLFMLLKLEKLNRFSSGAAQPGLAVKTLNKIFIPIADLNLQNEFASF 333 Query: 382 INVETAR--IDVLVEKI 396 ++ L+ K Sbjct: 334 AQQVDKSKVVNNLIMKY 350 >gi|254435960|ref|ZP_05049467.1| hypothetical protein NOC27_3023 [Nitrosococcus oceani AFC27] gi|207089071|gb|EDZ66343.1| hypothetical protein NOC27_3023 [Nitrosococcus oceani AFC27] Length = 339 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 42/310 (13%), Positives = 97/310 (31%), Gaps = 27/310 (8%) Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + I R+ A+ G + I ++ +P+P EQ I + + + Sbjct: 30 FIFHFLITQRLRLIALASGNLIPGLSRGDILSLKVPVPSHEEQQKIADCLSSLDAL---- 85 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTE 243 I + ++ LK K+ L+ + + +++ G F Sbjct: 86 IAAQTEKLDALKTHKKGLMQQLFPRAGETVPRLRFPKFRDGGRWTSKKMSDVYRFLSTNT 145 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLE-----------TRNMGLKPESYETYQIVDPGEIV 292 +R + + ++ YG+I K N E + G+IV Sbjct: 146 YSRDKLNYEKGEVKNIHYGDIHTKFSTLFDVTQEYVPYINRTESLERIKDDSYCLEGDIV 205 Query: 293 FRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLA---WLMRSYDLCKVFY 347 F + + I++ + + + + +L +S + + Sbjct: 206 FADASEDVEDVGKSIEIVNTGNEKILSGLHTLLARQKNNDLVIGFGGYLFKSGLIREQIK 265 Query: 348 AMGSGLRQ-SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G + + + ++ V P +EQ I + + + +D L+ + I LK Sbjct: 266 RESQGAKVLGISSGRLSKIKVCFPYEKREQQKIAHCL----SSLDALIAAQAEKIDALKT 321 Query: 406 RRSSFIAAAV 415 + + Sbjct: 322 HKKGLMQQLF 331 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 34/87 (39%), Gaps = 5/87 (5%) Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + + L + A G+ L L D+ L V VP +EQ I + + + Sbjct: 27 HGEFIFHFLITQRLRLIALASGN-LIPGLSRGDILSLKVPVPSHEEQQKIADCL----SS 81 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D L+ + + LK + + Sbjct: 82 LDALIAAQTEKLDALKTHKKGLMQQLF 108 >gi|327540221|gb|EGF26810.1| type I restriction-modification system S subunit [Rhodopirellula baltica WH47] Length = 603 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 70/201 (34%), Gaps = 10/201 (4%) Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY- 261 + Y+ ++GL +K E VP +W + L +++ T + N+ + Sbjct: 82 IEYLKSRGLKKGKSLKPPAPEEF-QVPANWTLTHLNDLAYQVHYGYTASADENLRDVRML 140 Query: 262 ------GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N++ ++ E Y++ D +++ K L + Sbjct: 141 RITDIQNNMVNWQTVPGCEIEEEKVAQYELAD-NDLLIARTGGTIGKTYLIQGVSVRSVF 199 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 + + + + YL + A +G + ++ +K L V VPP+ E Sbjct: 200 ASYLIRVIPSKLVCAEYLKRFLECPFYWGQLRAKSAGTGQPNVNATSLKSLIVPVPPLAE 259 Query: 375 QFDITNVINVETARIDVLVEK 395 Q I + + + D L + Sbjct: 260 QRRIVSKVEGLMSLCDTLESQ 280 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 74/230 (32%), Gaps = 19/230 (8%) Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLV---PDHWEVKPFFAL-----VTELNRKNTKL 251 Q+L+S K L K S IE P W V + + Sbjct: 379 QSLISE---KKLKKQW--KFSNIEDDDEPFPIPQSWAWCRILDTAERVTVGHVGSMKDEY 433 Query: 252 IESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 ++ I L N + L + + + + + PG+++ N + Sbjct: 434 VDEGIPFLRTLNVRALRYEPLGLKFISPEFHASLAKSALAPGDVLVVRSG--NVGTTCVV 491 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + + P +D YLA M S V + V +P+ Sbjct: 492 PDSLPEANCSDLVIVKVPIAVDPNYLAIYMNSAAKVHVEAGTVGVALTHFNTKSVAAMPL 551 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +PP EQ I + ++V +++D L ++ ++ I + G Sbjct: 552 SLPPKAEQKRIVSKVSVLLSQLDELSARLRSRQSTTDALLTALIHQILEG 601 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 78/192 (40%), Gaps = 8/192 (4%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W + + +++ G T+ + + D+ + + D+++ + G + + Sbjct: 105 QVPANWTLTHLNDLAYQVHYGYTASADENLRDVRMLRITDIQNNMVNWQTVPGCEIEEEK 164 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGI----CSTQFLVLQPKDVLPELLQGWLLSI 131 A +L + G + K + + S V+ K V E L+ +L Sbjct: 165 VAQYELADNDLLIARTGGTIGKTYLIQGVSVRSVFASYLIRVIPSKLVCAEYLKRFLECP 224 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ A G + + + ++ +P+PPLAEQ I K+ DTL ++R Sbjct: 225 FYWGQLRAKSAGTGQPNVNATSLKSLIVPVPPLAEQRRIVSKVEGLMSLCDTLESQRRSR 284 Query: 192 IELLKEKKQALV 203 + + ++++ Sbjct: 285 ESVRERASRSVL 296 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 73/198 (36%), Gaps = 14/198 (7%) Query: 21 IPKHWKVVPIKRFTKLNT----GRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP+ W I + T G + + I ++ +V + +Y P + Sbjct: 405 IPQSWAWCRILDTAERVTVGHVGSMKDEYVDEGIPFLRTLNVRA--LRYEPLGLKFISPE 462 Query: 75 TS---TVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLL 129 S A G +L + G ++ D + CS +V P V P L + + Sbjct: 463 FHASLAKSALAPGDVLVVRSGNVGTTCVVPDSLPEANCSDLVIVKVPIAVDPNYLAIY-M 521 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + +EA G ++H + K + +P+ +PP AEQ I K+ ++D L Sbjct: 522 NSAAKVHVEAGTVGVALTHFNTKSVAAMPLSLPPKAEQKRIVSKVSVLLSQLDELSARLR 581 Query: 190 RFIELLKEKKQALVSYIV 207 AL+ I+ Sbjct: 582 SRQSTTDALLTALIHQIL 599 >gi|330978668|gb|EGH77949.1| restriction modification system DNA specificity domain [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 603 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 62/473 (13%), Positives = 136/473 (28%), Gaps = 79/473 (16%) Query: 21 IPKHWKVVPIKRF-TKLN----TGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSD 74 +PK+W+ V + ++ +G+ + G + ++ +V+ G + D Sbjct: 19 LPKNWERVSLGEISANISPGFASGKHNSDGSGVPHLRPMNVDRDGQIDLSVVKSVAESKD 78 Query: 75 TSTVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 G IL+ + S LQ + + L Sbjct: 79 VE----LKSGDILFNNTNSAELVGKTAVVSHRETGFAFSNHMTRLQLESGIASSFVARQL 134 Query: 130 SIDVTQRIEAICEGATMSHADWKG---IGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ A IP +PP AEQ+ I K+ +D + Sbjct: 135 HFLWMSGYMKYRCTNHVNQASISSKTLANTIPFFLPPSAEQIRIVAKLEELLTDLDAGVA 194 Query: 187 ERIRFIELLKEKKQALVSYIVTKG-LNPDVKMKDSGIEWV-------------------- 225 E + LK+ +Q+L+ +G L + + + E Sbjct: 195 ELKTAQKKLKQYRQSLLKSAGWEGMLTAEWRAQHKPTETGAQLLQRILTERRASWEAKQL 254 Query: 226 ------GLVPDHWEVKPFFALVTELNRKNTKLIESNI---------------------LS 258 G P K + +L + Sbjct: 255 AKFKDQGKAPPKDWQKKYPEPAQANTSDLPELPAGWVWASVEQISEIQGGIQKQPSRAPK 314 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVD---------PGEIVFRFIDLQNDKRSLRSAQ 309 ++ ++ LK + ++ G+++ + + + Sbjct: 315 VNKYPFLRVANVARGKLKLDDIHEIELFPGELERLALVAGDVLIVEGNGSLTEIGRCALW 374 Query: 310 VM--ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRL 365 + + + V+P G+ S +L + S+ + + +L + ++ Sbjct: 375 DGSVTNAVHQNHLIRVRPIGVVSQFLETWLNSFGGIDKLTKLAATTSGLYTLSVGKISKV 434 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PV + P EQ V+ +D + + S+ +R + + AA GQ Sbjct: 435 PVPIAPRTEQEAAMKVLVESLLALDFQEQSVSLSLKQSTAQRQNILRAAFAGQ 487 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 69/206 (33%), Gaps = 12/206 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 + +P W +++ +++ G + ++ + +V G K Sbjct: 283 LPELPAGWVWASVEQISEIQGGIQKQPSRAPKVNKYPFLRVANVARGKLKLDDIHEIELF 342 Query: 73 SDTSTVSIFAKGQILY----GKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQG 126 G +L G L R A+ + + + ++P V+ + L+ Sbjct: 343 PGELERLALVAGDVLIVEGNGSLTEIGRCALWDGSVTNAVHQNHLIRVRPIGVVSQFLET 402 Query: 127 WLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 WL S ++ + + + I +P+PI P EQ + ++ + +D Sbjct: 403 WLNSFGGIDKLTKLAATTSGLYTLSVGKISKVPVPIAPRTEQEAAMKVLVESLLALDFQE 462 Query: 186 TERIRFIELLKEKKQALVSYIVTKGL 211 ++ ++Q ++ L Sbjct: 463 QSVSLSLKQSTAQRQNILRAAFAGQL 488 >gi|217975328|ref|YP_002360079.1| restriction modification system DNA specificity domain-containing protein [Shewanella baltica OS223] gi|217500463|gb|ACK48656.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS223] Length = 401 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 56/379 (14%), Positives = 116/379 (30%), Gaps = 29/379 (7%) Query: 48 IIYIGLEDVESGTGKY---LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 YI L ++ + L ++ +S ++ + + G +L + P L + + Sbjct: 29 FKYIDLGSLDKDKKEICLDLVQEISSSEAPSRARQLVKTGDVLISTVRPNLNGIAVVPKE 88 Query: 105 ---GICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 ST F VL+ + L+ W+ S + GA+ K I + + Sbjct: 89 LDGATASTGFCVLRANEEKLDSTYLRYWVESTTFVSDMVNKSTGASYPAVSDKIINDSEL 148 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 P+PPL Q I + L + + +P K Sbjct: 149 PLPPLETQKQIAAVLEKADQLRKDCKLLEQELNSLAQSV-------FIEMFGDPVTNPKG 201 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + +G + K + L +N+ + + M + Sbjct: 202 WKTQMLGSISKVQLGKMLSSASKIGINSKKYLRNANVKWRNIEIH----DLLEMDFTDKE 257 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA---WL 336 E +Q++ G+++ R +E A V+ + +T + Sbjct: 258 IEKFQLI-TGDLLVCEGGEIG--RCAIWIGQVEDCYYQKALHRVRLNPDLATAEYIQEYF 314 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 L + + + L E + +L V +PPI+ Q + I +E Sbjct: 315 FWMAKLGGLISSTNEVTFKHLTAEKMNKLVVPLPPIETQRKF----KTIYSSIQSELEHN 370 Query: 397 EQSIVLLKERRSSFIAAAV 415 + + + S + A Sbjct: 371 AKQMAQTEMVFQSLMQKAF 389 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 62/200 (31%), Gaps = 13/200 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK WK + +K+ G+ S I Y+ +V+ + Sbjct: 199 PKGWKTQMLGSISKVQLGKMLSSASKIGINSKKYLRNANVKWRNIEIHDLLEMDFTDKEI 258 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWLLSID 132 G +L + G R AI C + L P E +Q + + Sbjct: 259 EKFQLITGDLLVCEGGEIGRCAIWIGQVEDCYYQKALHRVRLNPDLATAEYIQEYFFWMA 318 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + T H + + + +P+PP+ Q K I + + + + Sbjct: 319 KLGGLISSTNEVTFKHLTAEKMNKLVVPLPPIETQ----RKFKTIYSSIQSELEHNAKQM 374 Query: 193 ELLKEKKQALVSYIVTKGLN 212 + Q+L+ LN Sbjct: 375 AQTEMVFQSLMQKAFNDELN 394 >gi|91217330|ref|ZP_01254290.1| Restriction endonuclease S subunits [Psychroflexus torquis ATCC 700755] gi|91184438|gb|EAS70821.1| Restriction endonuclease S subunits [Psychroflexus torquis ATCC 700755] Length = 574 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 65/194 (33%), Gaps = 6/194 (3%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR-NMGLKPESYETY 283 G + F + +KN + + I + G+I KL GL Sbjct: 87 NGWIWSRVRDSGFTQTGSTPPKKNPENYGNYIPFIGPGDISNKLMRYPTEGLSELGISVG 146 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ ++ I K ++ V I + + P +S Sbjct: 147 RLIPEDSLMMVCIGGSIGKCNINEIDVSCNQQINTITPILIPTIYIKAVC----QSPFFQ 202 Query: 344 -KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 V + + LP+ +PP++EQ +I V+ + I+ L + + I L Sbjct: 203 SNVLDKSSGSATPIINKGKWESLPIPIPPLEEQKEIVKVVEILFKEIEQLEQLTSERIAL 262 Query: 403 LKERRSSFIAAAVT 416 ++ +S + T Sbjct: 263 KEDFVTSVLNQLST 276 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 70/210 (33%), Gaps = 11/210 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI--------LSLSYGNIIQKLETR 271 + E +P W ++ + ++ G I + Sbjct: 364 TEDEIPYELPVGWVWCRLGDASKQITDGEHQTPPRIASGRKLLSAKNVRDGFINYENCDY 423 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + G+++ + + S+ + + + + A + + G++ Sbjct: 424 ISEIHYQKSIKRCNPEIGDLLIVSVGGTIGRVSMVTKNISFALVRSVAMVKNQ--GLEPD 481 Query: 332 YLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 YL W+M S L + + G + L ++K + P++EQ I +N D Sbjct: 482 YLRWVMNSPLLKDIIESKKRGGAQPCLYLGEIKDFTFPIAPLEEQKAIVEKVNALMELCD 541 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 L +++ S + S + G+I Sbjct: 542 GLEQEVRHSQEQSELLMKSCLREVFEGKIK 571 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 72/193 (37%), Gaps = 8/193 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W ++ TG T G I +IG D+ + +Y + + + Sbjct: 84 DLPNGWIWSRVRDSGFTQTGSTPPKKNPENYGNYIPFIGPGDISNKLMRYPTEGLS--EL 141 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + + ++ +G + K I + D C+ Q + P + ++ S Sbjct: 142 GISVGRLIPEDSLMMVCIGGSIGKCNINEIDVSCNQQINTITPILIPTIYIKAVCQSPFF 201 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+ + ++P+PIPPL EQ I + + I+ L I Sbjct: 202 QSNVLDKSSGSATPIINKGKWESLPIPIPPLEEQKEIVKVVEILFKEIEQLEQLTSERIA 261 Query: 194 LLKEKKQALVSYI 206 L ++ ++++ + Sbjct: 262 LKEDFVTSVLNQL 274 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 67/200 (33%), Gaps = 8/200 (4%) Query: 20 AIPKHWKVVPIKRFTK-LNTG--RTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W + +K + G +T + ++V G Y D S Sbjct: 371 ELPVGWVWCRLGDASKQITDGEHQTPPRIASGRKLLSAKNVRDGFINYENCDYISEIHYQ 430 Query: 76 STVSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV--LPELLQGWLLSI 131 ++ G +L +G + + + + + V K+ P+ L+ + S Sbjct: 431 KSIKRCNPEIGDLLIVSVGGTIGRVSMVTKNISFALVRSVAMVKNQGLEPDYLRWVMNSP 490 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + IE+ G I + PI PL EQ I EK+ A D L E Sbjct: 491 LLKDIIESKKRGGAQPCLYLGEIKDFTFPIAPLEEQKAIVEKVNALMELCDGLEQEVRHS 550 Query: 192 IELLKEKKQALVSYIVTKGL 211 E + ++ + + + Sbjct: 551 QEQSELLMKSCLREVFEGKI 570 >gi|239637507|ref|ZP_04678480.1| type I RM system specificity subunit HsdIB [Staphylococcus warneri L37603] gi|239596902|gb|EEQ79426.1| type I RM system specificity subunit HsdIB [Staphylococcus warneri L37603] Length = 380 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 57/395 (14%), Positives = 128/395 (32%), Gaps = 43/395 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK + + + +G + G + + D+ + + D + T Sbjct: 19 EWKNNELGKLLSIISGHSPSYYSEGSEYPLYKVNDLNNNSKFQNYSDLYVEKKHTPLNKK 78 Query: 81 FAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I++ K G L K I + G T + L+ DV + + + + + Sbjct: 79 V----IIFPKRGAAILLNKIRIINTPGYIDTNLMGLEFNDVNDTEFYYYAI---LREGLY 131 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I + +T+ + K I + P E+ + +I+ + + E K Sbjct: 132 RIADTSTIPQINNKHILPYKIYSPSYIEKNKLGNFFSKLDQQIELEEQKLAKLEEQKKGY 191 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q + S + + G WE + ++ K + + + Sbjct: 192 MQKIFSQEMRFK------------DENGNDYPDWEETTLKNITNYISSKKSSNQYNERNN 239 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + ++ L+ + E Y + ++L+ K S+ Sbjct: 240 SKGYPVYDAIQEIGKDLQYDMEEPYISILKDGAGAGRLNLRAGKSSVI-----------G 288 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFD 377 ++ + +D +L + M+ + K L ++D + +L+P EQ Sbjct: 289 TMGYIQANNVDIQFLYYRMKLLNFRKFII---GSTIPHLYYKDYSKEKILIPTSNDEQKK 345 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I + I ID L++ + LK+R+ + Sbjct: 346 IGHFI----LNIDKLIDNKTLKLDYLKQRKQGLLQ 376 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 60/179 (33%), Gaps = 8/179 (4%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + E + L N + + + ++ I+F + Sbjct: 35 HSPSYYSEGSEYPLYKVNDLNNNSKFQNYSDLYVEKKHTPLNKKVIIFPKRGAAILLNKI 94 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 R G I + M ++ + ++ T + + ++ + + + + Sbjct: 95 RIINTP--GYIDTNLMGLEFNDVNDTEFYYY--AILREGLYRIADTSTIPQINNKHILPY 150 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + P E+ + N +++D +E EQ + L+E++ ++ + ++ + E Sbjct: 151 KIYSPSYIEKNKLGNF----FSKLDQQIELEEQKLAKLEEQKKGYMQKIFSQEMRFKDE 205 >gi|254464815|ref|ZP_05078226.1| type I site-specific deoxyribonuclease chain S [Rhodobacterales bacterium Y4I] gi|206685723|gb|EDZ46205.1| type I site-specific deoxyribonuclease chain S [Rhodobacterales bacterium Y4I] Length = 576 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 62/194 (31%), Gaps = 20/194 (10%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESN-------ILSLSYGNIIQKLETRNMG---LKPE 278 P W + + + I++ + + G+ + + +K E Sbjct: 85 PASWHWCYLDDVAAIARGGSPRPIKAYLADGSDGVPWIKIGDSTRGSIYIDRTAERIKAE 144 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW--L 336 ++V PG+++ + +E I + P + S Sbjct: 145 GLSKSRLVVPGDLLLS----NSMSFGFPYITNIEGCIHDGWLVIRTPDQLMSKLFLHALF 200 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + A + Q+L + V++L V +PPI EQ I ++ A D L + Sbjct: 201 LSEHAKQSFAEAASGAVVQNLNADKVRKLTVPLPPIAEQHRIVAKVDELMALCDRLEQVR 260 Query: 397 EQSIVLLKERRSSF 410 +E R Sbjct: 261 RSR----EELRDKL 270 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 49/179 (27%), Gaps = 13/179 (7%) Query: 246 RKNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + + L NI + + + V +++ Sbjct: 391 GGKSTYADEGTPFLRSQNIYDDGLRLDDVVFINDETNKKMRRTQVKGKDLLLNITGGSIG 450 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + + + + YL L RS + +G R L Sbjct: 451 RCARIPDDFAGANVSQHVAIIRTAAAGTEDYLHLLCRSPFFQEYVIGEQTGAGRGGLPKN 510 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLV----EKIEQSIVLLKERRSSFIAAAV 415 + R+PV +PP+ EQ I ++ D L I LL + + A+ Sbjct: 511 RMDRIPVPLPPLTEQHRILAKVDALMTLCDRLETALTTTDTTRIRLL----DALLHEAL 565 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 63/195 (32%), Gaps = 9/195 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP W + + G + + + +I + D G+ + Sbjct: 84 IPASWHWCYLDDVAAIARGGSPRPIKAYLADGSDGVPWIKIGDSTRGSIYIDRTAERIKA 143 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S + G +L + I I ++ P ++ +L L + Sbjct: 144 EGLSKSRLVVPGDLLLSNSMSFGFPYITNIEGCIHDGWLVIRTPDQLMSKLFLHALFLSE 203 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q GA + + + + + +P+PP+AEQ I K+ D L R Sbjct: 204 HAKQSFAEAASGAVVQNLNADKVRKLTVPLPPIAEQHRIVAKVDELMALCDRLEQVRRSR 263 Query: 192 IELLKEKKQALVSYI 206 EL + A ++ + Sbjct: 264 EELRDKLTAASLARL 278 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 56/200 (28%), Gaps = 12/200 (6%) Query: 22 PKHWKVVPIKRFT-------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQS 73 P W + G+++ + + ++ +++ + N + Sbjct: 368 PLGWSWARVGTIALQTGSGSTPRGGKSTYADEGTPFLRSQNIYDDGLRLDDVVFINDETN 427 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQF-LVLQPKDVLPELLQGWLL 129 + +L G + + D S ++ + L Sbjct: 428 KKMRRTQVKGKDLLLNITGGSIGRCARIPDDFAGANVSQHVAIIRTAAAGTEDYLHLLCR 487 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + + GA + IP+P+PPL EQ I K+ A D L T Sbjct: 488 SPFFQEYVIGEQTGAGRGGLPKNRMDRIPVPLPPLTEQHRILAKVDALMTLCDRLETALT 547 Query: 190 RFIELLKEKKQALVSYIVTK 209 AL+ + Sbjct: 548 TTDTTRIRLLDALLHEALEP 567 >gi|257092508|ref|YP_003166149.1| Restriction endonuclease S subunits-like protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257045032|gb|ACV34220.1| Restriction endonuclease S subunits-like protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 403 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 119/410 (29%), Gaps = 35/410 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIY--------IGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 V + G T + + + ++V++ G + Sbjct: 7 VELSDVAAFIRGITFKPEDVVPVDTPGAAACMRTKNVQT-ELDLCDVWGIPQSFVRREDQ 65 Query: 80 IFAKGQILYGKLGPYLRKAIIA---------DFDGICSTQFLVLQPKDVLPELLQGWLLS 130 G +L + F G S L P V P L W S Sbjct: 66 YLIPGDVLVSSANSWNLVGKCCLVPSLPWRSTFGGFIS--VLRANPAKVDPRYLFRWFAS 123 Query: 131 IDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + +S+ + + + +P L EQ I E + Sbjct: 124 DRTQATVRSFGQQTTNISNLNVGRCLKLKLHLPALPEQRRIAEILDKADALRAKRRAALA 183 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + L + + +P K + + + PF + + + + + Sbjct: 184 QLDALTQSI-------FLDMFGDPATNPKGWPCAQLCTLGTKFSDGPFGSNLKSDHYRAS 236 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + ++ G + + + ++ + PG+++ + N + ++ Sbjct: 237 GVRVVRLQNIGVGEFLGADAAYISEDHFRNLKKHECL-PGDVLVGTLGDPNLRACIQPRW 295 Query: 310 VMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPV 367 + + S ++ +L+ ++ + G R + ++ L + Sbjct: 296 LSVALNKADCVQIRPDERTATSEFVCFLLNQPGTQRMAQDLMHGQTRIRISMGRLRSLAI 355 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 VPPI Q D + A ++ L ++ L +S A G Sbjct: 356 PVPPIGLQRDF----TQQVAAMETLKTAHRAALAQLDALFASLQHRAFLG 401 >gi|296121476|ref|YP_003629254.1| restriction modification system DNA specificity domain protein [Planctomyces limnophilus DSM 3776] gi|296013816|gb|ADG67055.1| restriction modification system DNA specificity domain protein [Planctomyces limnophilus DSM 3776] Length = 620 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 58/411 (14%), Positives = 119/411 (28%), Gaps = 32/411 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W + TG+ + + V +G + + + + F Sbjct: 5 PNGWTTDALSNLVTFKTGKLNSN---------AAVSNGAYPFFTCSQETLR---TNTFAF 52 Query: 82 AKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 +L + V+ P+D + + + +++ Sbjct: 53 DTECVLLAGNNANGIYPLKYFHGRFDAYQRTYVVTPQDCTRLNTRFLYYSMWPLLEHLQS 112 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I GA + + + P Q I + A I+ E Sbjct: 113 ISTGAATKFLTLTILNGLQLTFPSEPVQRKIAGILSAYDDLIENNTRRIAILE----EMA 168 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 QA+ P + +G +P+ W+VK ++ + T N Sbjct: 169 QAIYREWFVHFRFPGHENTLLVDSPLGKIPEGWQVKRLDSICERITSGGTPRTNVNEYWD 228 Query: 260 SYGNIIQKLETRNMGLK-PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME------ 312 + ET N + E T + V F + + + Sbjct: 229 GDIPWLSSGETGNTFITETEKKITQEGVTNSSTRFARSGCTVIASAGQGKTRGQTSMLCL 288 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVP- 370 I + +AV G +T F + R SL + + L +++P Sbjct: 289 DCYINQSTIAVTADGKQTTDSFLFFDLVQRYDQFRQISDGSSRGSLTTKLIADLEIILPQ 348 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P Q +V T + +E I + +L++ R + ++G++D+ Sbjct: 349 PFLIQK-----FDVLTTPVVKHIENILRKNKILRKTRDLLLPKLISGELDV 394 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 24/204 (11%), Positives = 66/204 (32%), Gaps = 13/204 (6%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNS 70 +G IP+ W+V + + + +G T + DI ++ + + K Sbjct: 194 LGKIPEGWQVKRLDSICERITSGGTPRTNVNEYWDGDIPWLSSGETGNTFITETEKKITQ 253 Query: 71 RQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 S+ G + G + + D + + + + Sbjct: 254 EGVTNSSTRFARSGCTVIASAGQGKTRGQTSMLCLDCYINQSTIAVTADGKQTTDSFLFF 313 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + I +G++ + + + Q + +K T + I Sbjct: 314 DLVQRYDQFRQISDGSSRGSLTT----KLIADLEIILPQPFLIQKFDVLTTPVVKHIENI 369 Query: 189 IRFIELLKEKKQALVSYIVTKGLN 212 +R ++L++ + L+ +++ L+ Sbjct: 370 LRKNKILRKTRDLLLPKLISGELD 393 >gi|327480085|gb|AEA83395.1| restriction modification system DNA specificity domain protein [Pseudomonas stutzeri DSM 4166] Length = 562 Score = 82.1 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 81/239 (33%), Gaps = 17/239 (7%) Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW------VGLVPDHWEVKPFFALVTEL 244 + L +E+ + L+S + + K S + + + ++ + Sbjct: 326 MVRLRQERSEWLLSKQDSAPECKTMLRKLSSLSEASPPFPLPDSWQAVHLIDCSRMLVDC 385 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRN-----MGLKPESYETYQIVDPGEIVFRFIDLQ 299 + K I + NI + + E + +PG+I+F Sbjct: 386 HNKTAPYASEGIPIIRTSNIRNREFRFDDLKYVNDETYEYWSRRCPPEPGDIMFTREAPM 445 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG---SGLRQS 356 + + + + M V+P L+ + + A + Sbjct: 446 GEAAIIP---DGAKFCLGQRTMLVRPMHDYIDNRYLLITLTEPHLLERASTDAIGSTVKH 502 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L+ DV++L + +PP+ EQ I ++ D L ++ Q+ L ++ S+ + AV Sbjct: 503 LRVGDVEQLNIPLPPLAEQHRIVAKVDQLMVLCDQLRTRLTQARQLNEQLASALVEQAV 561 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 57/189 (30%), Gaps = 12/189 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W + + GR + + + + ++ + Sbjct: 82 ELPAGWAWARLSNVVNVLNGRAYKKEELLDAGTPVLRVGNL------FTSNHWYHSNLTL 135 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVT 134 G +L+ I I L + ++ T Sbjct: 136 EEDKYCNPGDLLFAWS-ASFGPFIWQGERSIYHYHIWKLDFYAQGQLSKHYLYNFLLEQT 194 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q I+A G +M H + + + +P+PPLAEQ I K+ D L ++ Sbjct: 195 QEIKAAGHGVSMVHMTKEKMEKLVVPVPPLAEQHRIVAKVDELMALCDRLEAQQADAENA 254 Query: 195 LKEKKQALV 203 + QAL+ Sbjct: 255 HAQLVQALL 263 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 63/190 (33%), Gaps = 13/190 (6%) Query: 227 LVPDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +P W +V K +L+++ L GN+ + L E + Sbjct: 82 ELPAGWAWARLSNVVNVLNGRAYKKEELLDAGTPVLRVGNLFTSNHWYHSNLTLEEDK-- 139 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +PG+++F + ER I + + +L Sbjct: 140 -YCNPGDLLFAWSASFGPFI-----WQGERSIYHYHIWKLDFYAQGQLSKHYLYNFLLEQ 193 Query: 344 -KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + A G G+ + E +++L V VPP+ EQ I ++ A D L + + Sbjct: 194 TQEIKAAGHGVSMVHMTKEKMEKLVVPVPPLAEQHRIVAKVDELMALCDRLEAQQADAEN 253 Query: 402 LLKERRSSFI 411 + + + Sbjct: 254 AHAQLVQALL 263 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 70/196 (35%), Gaps = 9/196 (4%) Query: 21 IPKHWKVVPIKR----FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W+ V + + + + I I ++ + ++ + ++ Sbjct: 366 LPDSWQAVHLIDCSRMLVDCHNKTAPYASEGIPIIRTSNIRNREFRFDDLKYVNDETYEY 425 Query: 77 TVSIF--AKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSI 131 G I++ + P AII D C T + + L L Sbjct: 426 WSRRCPPEPGDIMFTREAPMGEAAIIPDGAKFCLGQRTMLVRPMHDYIDNRYLLITLTEP 485 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +R G+T+ H + + +P+PPLAEQ I K+ V D L T + Sbjct: 486 HLLERASTDAIGSTVKHLRVGDVEQLNIPLPPLAEQHRIVAKVDQLMVLCDQLRTRLTQA 545 Query: 192 IELLKEKKQALVSYIV 207 +L ++ ALV V Sbjct: 546 RQLNEQLASALVEQAV 561 >gi|322689707|ref|YP_004209441.1| restriction-modification system specificity subunit [Bifidobacterium longum subsp. infantis 157F] gi|320461043|dbj|BAJ71663.1| restriction-modification system specificity subunit [Bifidobacterium longum subsp. infantis 157F] Length = 385 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 59/394 (14%), Positives = 127/394 (32%), Gaps = 35/394 (8%) Query: 25 WKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + G S + +Y+ ++D+ + + D R + V Sbjct: 19 WEQRKLGEIVSIGAGAPPSAFSAGNFLYVKVDDLNESS--HFQFDSAQRVDANTAVKPIR 76 Query: 83 KGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG I++ K G K + T + L+P+ V +L + I Sbjct: 77 KGSIIFAKRGAAILGNKVRVLGKTAYIDTNMMALEPRGVD----ADFLWLFINQTGLYRI 132 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + +T+ + K I P+ IP +AEQ I R+D LIT R + L K+ Sbjct: 133 ADTSTIPQINNKHIEPYPVDIPNMAEQQAIGTF----FSRLDDLITLHQRKYDKLVIFKK 188 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +++ + K +++ +G E+ + + R L+ Sbjct: 189 SMLEKMFPKDGESVPEIRFAGFTDPWEQRKLGELFDYEQPQPYIVRGTEYDDSFPTPVLT 248 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G + +G E ++ + +V I Sbjct: 249 AGQ------SFVLGYTNEKQGIKMASPEHPVIIFDDFTTSSHFVDFPFKVKSSAI---KL 299 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDIT 379 + ++ D + ++++ V + L+P E I Sbjct: 300 LTLRDKNEDIHFAYQVLQNIAYTPV-------SHERHWISKFATFATLMPECKSEMQAIG 352 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + +D L+ ++ + LL++ + S + Sbjct: 353 HF----MSNLDGLITLHQRKLELLQDIKKSLLDK 382 >gi|270157705|ref|ZP_06186362.1| putative type I restriction-modification system S subunit [Legionella longbeachae D-4968] gi|269989730|gb|EEZ95984.1| putative type I restriction-modification system S subunit [Legionella longbeachae D-4968] Length = 437 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 61/437 (13%), Positives = 133/437 (30%), Gaps = 56/437 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +V + + + D + +G Y P G S D+ IF Sbjct: 6 EVKKLIDLVNFENNKRIP-------LKDSDRKKRSGIY-PYYGASGIIDSIDDFIFDGEY 57 Query: 86 ILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L + G L+ A A + +L K++ +L + Sbjct: 58 LLISEDGENLKTRKTPIAFKACGKFWVNNHAHILSEKEI---GTLDYLKYYFSQFSVLPY 114 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 GA K + I +P P ++ I + + T +I+ + + + Sbjct: 115 ITGAAQPKLSKKNLEIIEIPFPNKITRLKINAILNSLTRKIELNKKINQTLESIAQTIFK 174 Query: 201 ALVSYIVTKGLNPDV--------------------KMKDSGIEW--VGLVPDHWEVKPFF 238 + + + S E GL+P W++ Sbjct: 175 SWFVDFDPVHAKANASSEDEYDTIAKELGISREILDLFPSEFEESDQGLIPKGWKINNLS 234 Query: 239 ALVTELNRKNTKLIESNI-----LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 +T + K+ K E ++L + + Y+ Q+V PGE++ Sbjct: 235 NYITVVKGKSYKSSELQPSTTALVTLKSFHRGGGYRLDGLKPYTGKYKAEQLVKPGELII 294 Query: 294 RFIDLQ------NDKRSLRSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKV 345 + D+ + +E + + + + + YL ++ Sbjct: 295 AYTDVTQNADVIGKPAVIIKNSNIENLVASLDVGIIRIIKNHFQQGYLYNYFKTDLFQNY 354 Query: 346 FYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 SG L + +L PP I + + +++ Q I +L+ Sbjct: 355 ILGYTSGTTVLHLSKNWLIDHMILTPP----SQIIDRFEKLSTHFFQMIDANFQEIEILE 410 Query: 405 ERRSSFIAAAVTGQIDL 421 + ++ + ++G+ID+ Sbjct: 411 KSKNELLPKLLSGEIDV 427 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 54/202 (26%), Gaps = 17/202 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY----IGLEDVESGTGKYLPKDGNSRQSD 74 G IPK WK+ + + + G++ +S + + L+ G G Y Sbjct: 222 GLIPKGWKINNLSNYITVVKGKSYKSSELQPSTTALVTLKSFHRGGG-YRLDGLKPYTGK 280 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC------------STQFLVLQPKDVLPE 122 + G+++ +I I + + Sbjct: 281 YKAEQLVKPGELIIAYTDVTQNADVIGKPAVIIKNSNIENLVASLDVGIIRIIKNHFQQG 340 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + + I G T+ H + + + PP + ID Sbjct: 341 YLYNYFKTDLFQNYILGYTSGTTVLHLSKNWLIDHMILTPPSQIIDRFEKLSTHFFQMID 400 Query: 183 TLITERIRFIELLKEKKQALVS 204 E + E L+S Sbjct: 401 ANFQEIEILEKSKNELLPKLLS 422 >gi|282601270|ref|ZP_06257961.1| putative HsdS [Subdoligranulum variabile DSM 15176] gi|282569616|gb|EFB75151.1| putative HsdS [Subdoligranulum variabile DSM 15176] Length = 283 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 111/306 (36%), Gaps = 30/306 (9%) Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHA--DWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + + + +G + + +PP+ EQ I + ++ Sbjct: 1 MFNLMMQLPHYAKLFYLMSDGVHIEKLLFKTNDWLERKLAMPPIGEQKRIAAILTSQDKF 60 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 ID K Q L+ +G + + W+++P ++ Sbjct: 61 IDLKEKRLAEKQRQKKYLVQQLI----------------TGKKRLPGFQGEWQLQPLRSV 104 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMG-LKPESYETYQIVDPGEIVFRFIDLQ 299 + E + K +E ++LS I K E + L + Y+I G+I + +L Sbjct: 105 LKERKSYSPKGLEYPHVTLSTEGIFPKSERYDRDHLVKNEDKEYKITHLGDICYNPANL- 163 Query: 300 NDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQ 355 K + I + Y+ + + YLA + +D G R Sbjct: 164 --KFGVICENTFGDAIFSPIYVTFEVSDKVCKEYLANYLMRWDFINAVRKYEEGTVYERM 221 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++K ED + + +P + EQ I V++ ID+L + IEQ K+++ + + + Sbjct: 222 AVKPEDFLKYVIRLPSLDEQNAIAKVLSTADREIDLLRQDIEQE----KQKKKALMQLLL 277 Query: 416 TGQIDL 421 TG + + Sbjct: 278 TGIVRV 283 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 38/95 (40%), Gaps = 7/95 (7%) Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +M+ K+FY M G+ K D + +PPI EQ I ++ + Sbjct: 1 MFNLMMQLPHYAKLFYLMSDGVHIEKLLFKTNDWLERKLAMPPIGEQKRIAAILTSQDKF 60 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ID E+ + + ++ + +TG+ L G Sbjct: 61 ID----LKEKRLAEKQRQKKYLVQQLITGKKRLPG 91 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 60/189 (31%), Gaps = 5/189 (2%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ P++ K + + + + + +D + D I Sbjct: 95 EWQLQPLRSVLKERKSYSPKGLEYPHVTLSTEGIFPKSERYDRDHLVKNEDKE-YKITHL 153 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAIC 141 G I Y F D I S ++ + D + + +L+ D + Sbjct: 154 GDICYNPANLKFGVICENTFGDAIFSPIYVTFEVSDKVCKEYLANYLMRWDFINAVRKYE 213 Query: 142 EG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG + + +P L EQ I + + ID L + + + K Sbjct: 214 EGTVYERMAVKPEDFLKYVIRLPSLDEQNAIAKVLSTADREIDLLRQDIEQEKQKKKALM 273 Query: 200 QALVSYIVT 208 Q L++ IV Sbjct: 274 QLLLTGIVR 282 >gi|330991917|ref|ZP_08315866.1| Putative type-1 restriction enzyme MjaXP specificity protein [Gluconacetobacter sp. SXCC-1] gi|329760938|gb|EGG77433.1| Putative type-1 restriction enzyme MjaXP specificity protein [Gluconacetobacter sp. SXCC-1] Length = 423 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 57/431 (13%), Positives = 119/431 (27%), Gaps = 54/431 (12%) Query: 29 PIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + G +S I + G G + + Sbjct: 2 KLADVIDIRHGFAFRGEFFSDSPTGFILATPGNFAIGGG-FRSGKAKYYNGPVPDEYCLS 60 Query: 83 KGQILYGKLGP--------YLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDV 133 +G I+ Y + + + + + PK + + + + Sbjct: 61 EGDIIVTMTDLSKDADTLGYSASVPASANTFLHNQRIGKIVPKGNINLRFIYWLMRTPAY 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I A G+T+ H I + P L +Q I + +ID Sbjct: 121 RDEILASYTGSTVKHTSPSRILSFQFDCPSLEDQGRIASILDILDNKIDLNCRTNETLEA 180 Query: 194 LLKEKKQALVSYIVTKG------------LNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 + + Q + V G L P++ P+ W V+P + Sbjct: 181 IARALFQ---DWFVGFGPTRAKMAGQAAYLAPEIWKLFPDRLDDEEKPEGWTVEPVDNVA 237 Query: 242 TELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + LN + E L ++ T + +V+ G+I+F + Sbjct: 238 SFLNGLALQKYPAGEGAFLPAIKIAQLRSESTHSADRVSVGIPCEYVVEEGDILFSWSGS 297 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY--DLCKVFYAMGSGLRQS 356 K ERG + V ++ ++ Y D + + Sbjct: 298 LLCK-----FWNGERGALNQHLFKVTSGRFPDWFIFEWIQHYMPDFQAIAESKA-TTMGH 351 Query: 357 LKFEDVKRLPVLVPPIKEQFD----ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ + V +P I + I +++ + L E R + Sbjct: 352 IQRHHLTESLVTIPSSCVMKQADLIIGSHIRK--------IKENHKESRNLSELRDLLLP 403 Query: 413 AAVTGQIDLRG 423 ++G+I +R Sbjct: 404 RLMSGEIRIRD 414 Score = 47.9 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 54/188 (28%), Gaps = 10/188 (5%) Query: 22 PKHWKVVPIKRFTKLNTG---RTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P+ W V P+ G + +G+ + I + + S + + + Sbjct: 225 PEGWTVEPVDNVASFLNGLALQKYPAGEGAFLPAIKIAQLRSESTHSADRVSVGIPCE-- 282 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + +G IL+ G L K G + + + W+ + Sbjct: 283 --YVVEEGDILFSWSGSLLCK-FWNGERGALNQHLFKVTSGRFPDWFIFEWIQHYMPDFQ 339 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 A + TM H + + IP I + +I E EL Sbjct: 340 AIAESKATTMGHIQRHHLTESLVTIPSSCVMKQADLIIGSHIRKIKENHKESRNLSELRD 399 Query: 197 EKKQALVS 204 L+S Sbjct: 400 LLLPRLMS 407 >gi|268323492|emb|CBH37080.1| type I restriction-modification system, subunit S [uncultured archaeon] gi|268326508|emb|CBH40096.1| putative type I restriction-modification system, subunit S [uncultured archaeon] Length = 386 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 55/399 (13%), Positives = 120/399 (30%), Gaps = 49/399 (12%) Query: 30 IKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS---IFAKGQ 85 +K N G+T + I I + + L + +T G Sbjct: 22 LKNVVD-NRGKTCPTADSGIPLIATNCIVNNYLYPLYEKVRYVTEETYKTWFRDHPRPGD 80 Query: 86 ILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 +++ G R A + D C + + + P+ L L S + Q IE++ Sbjct: 81 MIFVLKGTPGRIAWVPDPIDFCVAQDMVAIRADERKIFPKYLFAVLRSDSIQQEIESLHV 140 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ + H ++ +PI Q I + I+LL + + L Sbjct: 141 GSLIPHFKKGDFNDLIIPIVEPKLQ-----------EFIGNQYFDFSVKIDLLHRQNKTL 189 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + + +E + + L+ K + + Sbjct: 190 EAMA-------ETLFRQWFVEEADEGWEEGRLGDVIELIYGKGLKKEIRTGTGYPVIGSS 242 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 ++ Y + +V+ IV L I T+ Y+ Sbjct: 243 GVVG-------------YHSEFLVEGPGIVIGRKGTLGKVIYL---WDNFFPIDTTYYIK 286 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 K Y +L+++ + L + + + P+++ Sbjct: 287 SKVESAGLLYEYFLLKTLNFE---EMNSDSAVPGLNRDIALSTEIKIAPLEKLNKFNQFT 343 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ID L E + I L++ R + + ++G++ + Sbjct: 344 STF---IDKLKENT-KQIRTLEKLRDTLLPKLMSGEVRI 378 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 57/189 (30%), Gaps = 15/189 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + +L G+ + + +GTG P G+S + + Sbjct: 207 EGWEEGRLGDVIELIYGKGLKKE----------IRTGTG--YPVIGSSGVVGYHSEFLVE 254 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I+ G+ G + + D T + + K + + + T E + Sbjct: 255 GPGIVIGRKGTLGKVIYLWDNFFPIDTTYYI---KSKVESAGLLYEYFLLKTLNFEEMNS 311 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + + I PL + + ++ + +L L Sbjct: 312 DSAVPGLNRDIALSTEIKIAPLEKLNKFNQFTSTFIDKLKENTKQIRTLEKLRDTLLPKL 371 Query: 203 VSYIVTKGL 211 +S V Sbjct: 372 MSGEVRIQF 380 >gi|227505723|ref|ZP_03935772.1| type I restriction-modification system specificity subunit [Corynebacterium striatum ATCC 6940] gi|227197691|gb|EEI77739.1| type I restriction-modification system specificity subunit [Corynebacterium striatum ATCC 6940] Length = 382 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 48/408 (11%), Positives = 115/408 (28%), Gaps = 47/408 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K+V +K + + G T+ + + ++ + D+ + T Y + + Sbjct: 3 KIVSLKEVCESDYGVTASATEQPTGTHFLRITDIVNFT-DYSGVPFVDIDDEDRRKKLLK 61 Query: 83 KGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQRI 137 + I+ + G + + + + ++ + +PK + + Sbjct: 62 QNDIVVARTGATVGASHLFRGTEPTVFASYLVRFRPKTSDVDPVFVSYVLNSPAWKQFIF 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + + + +P + EQ I + A +I L Sbjct: 122 ANAHSKSAQPNLSAAAMMDFQFSLPEIREQQKIASVLKALDDKIAANSRIIKIATHLNIN 181 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 LV VTK L ++ + + K L E I Sbjct: 182 ----LVEKAVTKELE--------------------HLQNLADITMGSSPKGEFLNEEGIG 217 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + E G+I+F + + + ER + Sbjct: 218 EPFFQGVRDFGELFPSERVFAEKAVRTA-QEGDILFA------VRAPIGEVNIAERPCVI 270 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 +A L +L++ + Y + + D+ + V Sbjct: 271 GRGIAAIRGKQSHLGLFYLLKGHPELWETYQSSGTVFAGINKSDLHNAVIPV------LR 324 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + + + I + +L R + ++G+I + GE+ Sbjct: 325 DSEKLEQQLTPIHERAMHALRENQVLARTRDELLPLLMSGRITV-GEA 371 >gi|228984124|ref|ZP_04144310.1| Type Ic restriction-modification system, HsdS subunit [Bacillus thuringiensis serovar tochigiensis BGSC 4Y1] gi|228775652|gb|EEM24032.1| Type Ic restriction-modification system, HsdS subunit [Bacillus thuringiensis serovar tochigiensis BGSC 4Y1] Length = 352 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/374 (12%), Positives = 112/374 (29%), Gaps = 28/374 (7%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA--I 99 + +++ L T K + + + +Y + L Sbjct: 2 NPKDENLELWSLTVENGLTPKTERYNREFLVKKEDKFKAVSNNEFIYNPMNMTLGAVDLN 61 Query: 100 IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 + S ++ ++ K+ G L + ++ + ++ + Sbjct: 62 LTGKKVAVSGYYITMKTKENYDNNYFGVWLKTPLAIKMYKLYATGSLVE-RQRVQFPTLS 120 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 I L + ++KI A ++D IT + I +LK+ KQA + + K +++ Sbjct: 121 QIKTLVPSLEEQKKIGALFKQLDDTITLHQQEITVLKQTKQAFLQKMFPKEGKSVPEVRF 180 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 G + +++ L + +KL+++ + + Sbjct: 181 PGFTGEWEL---RKIREIGDLSAGGDINKSKLVDNEKYPVLANALTNDGIVGYYDEYKIE 237 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 I G++ N +R + G + +L Sbjct: 238 APAVTITGRGDVGHAKARHINFTPVVRLLVLKADG----------------FDVDFLENC 281 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + +F + S L + + P +EQ I ++D + + Sbjct: 282 INTRNIF--VESTGVPQLTVPQLGTYEISFPSFREQTKIGRF----FKQLDDTISLHQSE 335 Query: 400 IVLLKERRSSFIAA 413 I L++ + +F+ Sbjct: 336 IEALQKTKKAFLQK 349 >gi|118578797|ref|YP_900047.1| restriction modification system DNA specificity subunit [Pelobacter propionicus DSM 2379] gi|118501507|gb|ABK97989.1| restriction modification system DNA specificity domain protein [Pelobacter propionicus DSM 2379] Length = 590 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 56/506 (11%), Positives = 128/506 (25%), Gaps = 114/506 (22%) Query: 20 AIPKHWKVVPIKR-FTKLNTG--RTSESGK--DIIYIGLEDV------------------ 56 +P+ W+ V + K+ G + + + D +YI +++ Sbjct: 86 ELPQGWEWVRLGEAMLKITDGTHHSPPNNEKGDFLYISAKNIKDDGVLISNATYVTEEVH 145 Query: 57 ---------ESGTGKYLPKDGNSRQSDTSTVSI----------------FAKGQILYGKL 91 E G Y+ + + + +L+ Sbjct: 146 DEIFSRCDPEYGNILYIKDGATTGIVTINDLKEPFSMLSSVALLKQPHQVDNRYLLFTLR 205 Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 P+ + A G+ T+ + + D + L V + + + + Sbjct: 206 SPFFYGEMRAGMTGVAITRVTLKKLHDAIIPLPPLSEQHRIVARIDQLMARCDELEKLRK 265 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIA----------------------ETVRIDTLITERI 189 + +Q+L + + E Sbjct: 266 EREEKRLAVHAAAIKQLLDSNFASSRLRVSQDSSSLRAFVPSCETGGAFDFLAKHFGELY 325 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV--------------------- 228 E + E ++A++ V L P E + + Sbjct: 326 TVKENVAELRKAILQLAVMGRLVPQDPNDPPASELLREIEKEKVSREGAKTRRKETKLPP 385 Query: 229 ----------PDHWEVKPFFALVTELNRKNTK--------LIESNILSLSYGNIIQKLET 270 P WE ++ + G++ + T Sbjct: 386 IKPEKVPYQLPKGWEWVRLGDAGAFERGRSKHRPRNDKRLFEHGTYPFVQTGDVSRSKAT 445 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMERGIITSAYMAVKPHGID 329 N + SY + + + ++ + + I + +A Sbjct: 446 ENQIMTCTSYYNDFGLKQSRLWEKGTLCITIAANIAETGFLGMDACIPDSVVAFLGVNKS 505 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L + + + S ++++ + L +PP+ EQ I I+ A Sbjct: 506 LEKLVKVFIDVAKGDLEHFAPSTAQKNINLGIINELLFPLPPLNEQHRIVARIDQLMALC 565 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAV 415 D L EQ I +R+ + A + Sbjct: 566 DTL----EQRIDAATVKRTELLGAVM 587 Score = 76.4 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 64/190 (33%), Gaps = 11/190 (5%) Query: 223 EWVGLVPDHWEVKPFFA---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 E +P WE +T+ + E I+ + Sbjct: 82 EVPYELPQGWEWVRLGEAMLKITDGTHHSPPNNEKGDFLYISAKNIKDDGVLISNATYVT 141 Query: 280 YETYQIV------DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 E + + + G I++ ++ + +++S + +PH +D+ YL Sbjct: 142 EEVHDEIFSRCDPEYGNILYIKDGATTGIVTINDLKEP-FSMLSSVALLKQPHQVDNRYL 200 Query: 334 AWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + +RS A +G + + + + +PP+ EQ I I+ AR D L Sbjct: 201 LFTLRSPFFYGEMRAGMTGVAITRVTLKKLHDAIIPLPPLSEQHRIVARIDQLMARCDEL 260 Query: 393 VEKIEQSIVL 402 + ++ Sbjct: 261 EKLRKEREEK 270 >gi|282932598|ref|ZP_06338019.1| type I restriction modification DNA specificity domain protein [Lactobacillus jensenii 208-1] gi|281303294|gb|EFA95475.1| type I restriction modification DNA specificity domain protein [Lactobacillus jensenii 208-1] Length = 405 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/410 (10%), Positives = 126/410 (30%), Gaps = 34/410 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG---KYLPKDGNSRQSDTSTVS 79 + W+ +K + +G + + D + + + + + + Sbjct: 12 ESWRTEKLKNIGESFSGLSGKKSSDFGHGEAKYITYLNILNNPIIDTKLTDKIEIDNKQH 71 Query: 80 IFAKGQILYGKLGPYLRKAII---------ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + KG I + ++ + + S + + + S Sbjct: 72 LVKKGDIFFTISSETPQEVGLSSVLDTNLNECYLNSFSFGYRLKEISMFDNLFNSYNFRS 131 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +++ + +G + + K + N + P ++EQ I + I + + Sbjct: 132 PNFRRKMYILAQGISRYNISKKAVLNETICFPKISEQKQIGKLIKLMNSLLSLQQRKLEL 191 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 +L K+ L S+ +T K + + ++ + + + K Sbjct: 192 ENKLKKQIAFYLYSFTLTP------NFKHIEV-------KNKKLGDIVDISNGIMGDSQK 238 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRS 307 + L+ K++ G + + + ++ G+I++ I+ ++ Sbjct: 239 KSGNFKLTRIETISNGKIDLSRTGYIDQVSDEKKFLEVGDILYSNINSLTHIGKNAIVKE 298 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRL 365 + I + + + I YL L+ + + + S+ ++ L Sbjct: 299 KHLPLVHGINLFRLHITNNQITPNYLHGLLNLPKYKWWVKSHANPAVNQASINKTELSSL 358 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P + Q I N IN A+ + L + + + Sbjct: 359 VIKYPDLDIQNQI-NNINYSFAQYWDI---QYSKKESLCQLKQFLLQNLF 404 >gi|154488694|ref|ZP_02029543.1| hypothetical protein BIFADO_02001 [Bifidobacterium adolescentis L2-32] gi|154082831|gb|EDN81876.1| hypothetical protein BIFADO_02001 [Bifidobacterium adolescentis L2-32] Length = 392 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 58/377 (15%), Positives = 116/377 (30%), Gaps = 38/377 (10%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKD-----GNSRQSD 74 K V I K +G T S I +IG + GK+L K+ Sbjct: 11 KKVTIGELGKTQSGGTPSSKHPEFFNGSIPWIGTTAL---NGKFLGKNDAVKLITEEAVA 67 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-VLQPKDVLPELLQGWLLSIDV 133 S I + I+ G + + K I S + ++ + L Sbjct: 68 KSATKIVPEKSIMVG-IRVGVGKVAINAVPMCTSQDIVSIVGIDEASWNKEYISLALQYK 126 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A +GAT++ K + I +P P+ EQ + + + ++ + + Sbjct: 127 APLLAAQAQGATIAGITSKTLKAIEIPAIPINEQNRVVDILRKLENQVGFVRKQLCGLDA 186 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L + S V + +K W + + + + I Sbjct: 187 L-------VKSRFVEMFGD----LKSDTNGWPIKPFETFAIIDTHMANDLTPYLDMPHIG 235 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + G + G+ Y P +++ I +K +L Sbjct: 236 IDSIESGTGRLSGYRTVAEDGIISGKYP----FTPEHLIYSKIRPSLNKVALPDFS---- 287 Query: 314 GIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVP 370 G+ +S + P + YLA +MRS + + + + + + + +P Sbjct: 288 GVCSSDAYPILPIAGECNRVYLAEVMRSAYFLEYILPLSGRAQMPKVNKKALSGFSMPLP 347 Query: 371 PIKEQFDITNVINVETA 387 PI+ Q + Sbjct: 348 PIELQQQFAAFVAQVDK 364 >gi|108935909|sp|P10485|T1S1_ECOLX RecName: Full=Type-1 restriction enzyme EcoR124II specificity protein; Short=S.EcoR124II; AltName: Full=Type I restriction enzyme EcoR124II specificity protein; Short=S protein gi|84310051|emb|CAB37630.2| unnamed protein product [Escherichia coli] Length = 404 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 61/400 (15%), Positives = 118/400 (29%), Gaps = 50/400 (12%) Query: 26 KVVPIKRFTK-------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + +P+ TK L + I + ++ Y + Q+ + V Sbjct: 17 EWLPLGEITKYEQPTKYLVKAKDYHDTYTIPVLTAG--KTFILGYTNETHGIYQASKAPV 74 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 IF + DFD + + + + L ++ T E Sbjct: 75 IIF----------DDFTTANKWVDFDFKAKSSAMKMVTSCDDNKTLLKYVYYWLNTLPSE 124 Query: 139 AICEGATMSHADWKGIGNIPMPIPP-----LAEQVLIREKIIAETVRIDTLITERIRFIE 193 IP+P P LA Q I + T L E + Sbjct: 125 FAEGDHKRQWISNYSQKKIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELNMRKK 184 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKL 251 + L+S K+ +EW +G + ++ T K Sbjct: 185 QYNYYRDQLLS------------FKEGEVEWKTLGEI------GKWYGGGTPSKNKIEFW 226 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 +I +S ++ + L + E + + +++ I DK L SA Sbjct: 227 ENGSIPWISPKDMGRTLVDSSEDYITEEAVLHSSTKLIPANSIAIVVRSSILDKV-LPSA 285 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLP 366 + + AV PH + M + G S+ + + Sbjct: 286 LIKVPATLNQDMKAVIPHENILVKYIYHMIGSRGSDILRAAKKTGGSVASIDSKKLFSFK 345 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + VP I EQ I +++ + + E + + I L +++ Sbjct: 346 IPVPNINEQQRIVEILDKFDTLTNSITEGLPREIELRQKQ 385 Score = 43.6 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 54/184 (29%), Gaps = 11/184 (5%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 P + + + ++ +T +G E++ YQ I+F Sbjct: 18 WLPLGEITKYEQPTKYLVKAKDYHDTYTIPVLTAGKTFILGYTNETHGIYQASKAPVIIF 77 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 N + + + Y+ + + + +A G Sbjct: 78 DDFTTANK---WVDFDFKAKSSAMKMVTSCDDNKTLLKYVYYWLNTLPSE---FAEGDHK 131 Query: 354 RQSLKFEDVKRLPVLVP-----PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 RQ + K++P+ P + Q +I +++ TA L ++ R Sbjct: 132 RQWISNYSQKKIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELNMRKKQYNYYRD 191 Query: 409 SFIA 412 ++ Sbjct: 192 QLLS 195 >gi|226225586|ref|YP_002759692.1| type I restriction-modification system restriction subunit [Gemmatimonas aurantiaca T-27] gi|226088777|dbj|BAH37222.1| type I restriction-modification system restriction subunit [Gemmatimonas aurantiaca T-27] Length = 445 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 52/442 (11%), Positives = 120/442 (27%), Gaps = 45/442 (10%) Query: 24 HWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + + G + + I + + G G + Sbjct: 4 EWRECSLGELIDIKHGFAFQGEFIRDESRGDILLTPGNFSIGGG-FKSDKFKYFDGPVPG 62 Query: 78 VSIFAKGQILYGKLG----------PYLRKAIIADFDGICSTQ---FLVLQPKDVLPELL 124 + A+ +L P A + + + LV + + L Sbjct: 63 DFVLAEADLLVTMTDLSKQSDTLGLPAFVPARSDGRRYLHNQRLGKILVKDQQAIDSRFL 122 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L S D + A G T+ H + I P L EQ I + +I+ Sbjct: 123 HYLLCSADYRNEVLASATGTTVKHTSPERIKRFRFSRPLLDEQRAIAHILGTLDDKIELN 182 Query: 185 ITERIRFIELLKEKKQALVSYI--VTKGLNPDVKMKDSGIEWV----------GLVPDHW 232 E+ + ++ V + I + G +P++W Sbjct: 183 RRMSETLEEMARALFKSWFVDFDPVRAKADGRHHCLPQPIAELFPDSFEGSEMGEIPNNW 242 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI------- 285 E+K L + E + + + + Sbjct: 243 ELKTIGDLADVVGGGTPSTKEPTFWEDGTHAWATPKDLSGLSVPVLLETERYVTSLGLSQ 302 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G + + L + A I ++A+KP S L S+ ++ Sbjct: 303 IGSGLLPRGTVLLSSRAPIGYLAVAETPVAINQGFIAMKPKAGVSNLFLLLWASFAHDQI 362 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD-ITNVINVETARIDVLVEKIEQSIVLLK 404 + + + +P++ P I + + + + ++ L Sbjct: 363 VSRANGSTFLEISKANFRPIPMVAP-----RACIMDAFDRLARPLYERIVACAKASRTLT 417 Query: 405 ERRSSFIAAAVTGQIDLRGESQ 426 R + + ++G++ ++ + Sbjct: 418 ALRDTLLPKLISGELRVKDAER 439 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 56/196 (28%), Gaps = 12/196 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYI-------GLEDVESGTGKY---LPKDG 68 G IP +W++ I + G T + + + +D+ + + Sbjct: 236 GEIPNNWELKTIGDLADVVGGGTPSTKEPTFWEDGTHAWATPKDLSGLSVPVLLETERYV 295 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 S + +G +L P +A+ + F+ ++PK + L L Sbjct: 296 TSLGLSQIGSGLLPRGTVLLSSRAPI-GYLAVAETPVAINQGFIAMKPKAGVSN-LFLLL 353 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + +I + G+T IPM P RI Sbjct: 354 WASFAHDQIVSRANGSTFLEISKANFRPIPMVAPRACIMDAFDRLARPLYERIVACAKAS 413 Query: 189 IRFIELLKEKKQALVS 204 L L+S Sbjct: 414 RTLTALRDTLLPKLIS 429 >gi|77415025|ref|ZP_00791100.1| Type I restriction modification DNA specificity domain protein [Streptococcus agalactiae 515] gi|77158925|gb|EAO70161.1| Type I restriction modification DNA specificity domain protein [Streptococcus agalactiae 515] Length = 385 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 67/209 (32%), Gaps = 12/209 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGL 275 +E +P+ W + + + K NI ++ ++ ++ + Sbjct: 77 EVEVPYEIPESWNWVKLRNIGSITSGGTPKSSEPSYYGGNITWITPADMGKQQNNKFFAK 136 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + + R+ V E ++ P +D +L Sbjct: 137 SSKKITELGLQKSSAQLISKNSIVYSSRAPIGHINIVTEDYTTNQGCKSITPLLVDLIFL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 WL++ + + + + + +PP+ EQ I I A++D Sbjct: 197 YWLLQ-FRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKRIVAQIEKALAKVDEYA 255 Query: 394 EKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 256 ESYNKLQQLDKEFPDKLKKSILQYAMQGK 284 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 79/212 (37%), Gaps = 17/212 (8%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV----ESGTGKY 63 V+ IP+ W V ++ + +G T +S + +I +I D+ + Sbjct: 77 EVEVPYEIPESWNWVKLRNIGSITSGGTPKSSEPSYYGGNITWITPADMGKQQNNKFFAK 136 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 K S+ + +K I+Y P I+ + + + P V L Sbjct: 137 SSKKITELGLQKSSAQLISKNSIVYSSRAPIGHINIVTEDY-TTNQGCKSITPLLVD--L 193 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + L T+ I G T G G+ +P+PPLAEQ I +I ++D Sbjct: 194 IFLYWLLQFRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKRIVAQIEKALAKVDE 253 Query: 184 LITERIRFIELLKEKK----QALVSYIVTKGL 211 + +L KE ++++ Y + L Sbjct: 254 YAESYNKLQQLDKEFPDKLKKSILQYAMQGKL 285 >gi|56707658|ref|YP_169554.1| hypothetical protein FTT_0523 [Francisella tularensis subsp. tularensis SCHU S4] gi|110670129|ref|YP_666686.1| hypothetical protein FTF0523 [Francisella tularensis subsp. tularensis FSC198] gi|56604150|emb|CAG45156.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320462|emb|CAL08539.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC198] Length = 408 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 56/388 (14%), Positives = 124/388 (31%), Gaps = 32/388 (8%) Query: 44 SGKDIIYIGLEDVES---------GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 S +I ++ ++D ES G G Y+ + N ++ + + K+ Sbjct: 9 SKANIEWVKIQDKESYPILGVRGQGQGVYINRIANGKELTMKKYQKSEPYHLFFCKVRTV 68 Query: 95 LRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + A+ + Q+L + +LPE L+ L +T + GA H Sbjct: 69 KGQWGVVYPEYANSYASSNMQYLKIDLDKILPEYLEMLLKLKKITDIWDKNAIGADGRHF 128 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL--VSYIV 207 K + + +P+PP+ Q I + + + L + +++ A + Sbjct: 129 PLKILLTLQIPLPPIEIQKQIVQAYEDKINLANQLEQRAEKLEAKIEKYLYAKLGIEQAQ 188 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT-------------KLIES 254 + + +K E + + + Sbjct: 189 EQKQDKKGLLKFVRFEQLQRWDTDFFKQKEGYSSKYETVSYEDLFVSLNNGIAARNYASD 248 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I L +I + Y+ +++ G ++ + L + Sbjct: 249 GIRYLKVSDIKDNYINNDKPFYVNKYKESDLIEKGTLLITRKGTVGNSYYL--DKDGSFV 306 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 + ++ ++ YL+ + S + K + +G SL +K + + +PP+K Sbjct: 307 ASSEIFIIKLNDKVNGNYLSEINLSSFVKKQYREKSTGTIMPSLSQPKLKSILIPLPPLK 366 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIV 401 Q I I I L ++ EQ+ Sbjct: 367 IQNHIAVRIQKLKDYIKALEQQAEQNRE 394 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 47/141 (33%), Gaps = 2/141 (1%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQ-NDKRSLRSAQVMERGIITSAYMAVKPH 326 R K + + YQ +P + F + + Y+ + Sbjct: 37 YINRIANGKELTMKKYQKSEPYHLFFCKVRTVKGQWGVVYPEYANSYASSNMQYLKIDLD 96 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I YL L++ + ++ G + + + L + +PPI+ Q I + Sbjct: 97 KILPEYLEMLLKLKKITDIWDKNAIGADGRHFPLKILLTLQIPLPPIEIQKQIVQAYEDK 156 Query: 386 TARIDVLVEKIEQSIVLLKER 406 + L ++ E+ +++ Sbjct: 157 INLANQLEQRAEKLEAKIEKY 177 >gi|330941025|gb|EGH43947.1| restriction modification system DNA specificity domain [Pseudomonas syringae pv. pisi str. 1704B] Length = 293 Score = 81.8 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 64/198 (32%), Gaps = 10/198 (5%) Query: 227 LVPDHWEVKPFFA---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--- 280 +P WE +T+ IE + LS ++ N Sbjct: 96 QLPATWEWARLADVAFQITDGAHHTPTYIEFGVPFLSVKDMSGGSLGFNATRYISEEAHE 155 Query: 281 --ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 G+++ I + ++ + ++ +YL L+ Sbjct: 156 QLTKRCHPQRGDLLLTKIGTTG-VPVIVDTDRPFSIFVSVGLIKAPWDHLNVSYLQLLIS 214 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S + K G+ ++L + + +PP+ EQ I ++ D L ++ Sbjct: 215 SPFVKKQSLDGTEGVGNKNLVLRKIANFLIAIPPLAEQRRIVIKVDELMTLCDQLKIRLT 274 Query: 398 QSIVLLKERRSSFIAAAV 415 Q+ L ++ S+ + AV Sbjct: 275 QARQLNEQLASTLVEQAV 292 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 67/197 (34%), Gaps = 9/197 (4%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ + ++ G + ++ ++D+ G+ + S ++ Sbjct: 96 QLPATWEWARLADVAFQITDGAHHTPTYIEFGVPFLSVKDMSGGSLGFNATRYISEEAHE 155 Query: 76 STVSIFAK--GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 G +L K+G I+ F S + + LQ + S Sbjct: 156 QLTKRCHPQRGDLLLTKIGTTGVPVIVDTDRPFSIFVSVGLIKAPWDHLNVSYLQLLISS 215 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 V ++ EG + + I N + IPPLAEQ I K+ D L + Sbjct: 216 PFVKKQSLDGTEGVGNKNLVLRKIANFLIAIPPLAEQRRIVIKVDELMTLCDQLKIRLTQ 275 Query: 191 FIELLKEKKQALVSYIV 207 +L ++ LV V Sbjct: 276 ARQLNEQLASTLVEQAV 292 >gi|32455436|ref|NP_862551.1| hypothetical protein pSRQ900_03 [Lactococcus lactis] gi|14251234|gb|AAC98712.2| HsdS [Lactococcus lactis] Length = 396 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 56/408 (13%), Positives = 128/408 (31%), Gaps = 43/408 (10%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ K K ++ R+ +G E ++ G + Sbjct: 15 KVPELRFKGFTDEWEERKFKDILKTHSFRSYLAGVSEN-GEYEVIQQGDKPIVGYSDGEP 73 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +D V++F G P + D I S +L + Sbjct: 74 FTDYKDVTLF--GDHTVSLYKPKSPFFVATDGVKILSA-----------DNFEGNYLYTT 120 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + E + + + +KI + ++D I Sbjct: 121 LERYKPEPQGYKRHFTILKNQDVWFTENMEEQ--------QKIGSFFKQLDDTIALHQHK 172 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++LLKE+K+ + + K +++ +G + + ++ + N K Sbjct: 173 LDLLKEQKKGFLQKMFPKNGAKVPELRFAGFD---DDWEQRKLGDLAEIKDSARIPNIKW 229 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIV----DPGEIVFRFIDLQNDKRSLRS 307 + + L ++ + + L Y Y + G+++F Sbjct: 230 QKEGVPYLRSSDLSSEHIKDGLFLSLADYMKYDKITGSPKKGDLIFASGGDIGLAIYKHD 289 Query: 308 AQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRL 365 + + + Y+ K +D +L + S + K +G + + L Sbjct: 290 SLPIYVQGGSILYVKTSKCENLDGLFLKYSFASPKVKKYIRNASTGTSLKHFVLKPANAL 349 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P+ P + EQ I +++ D + ++ + LLKE++ F+ Sbjct: 350 PMSYPDLIEQEKIGSLLMQM----DRTITLHQRKLDLLKEQKKGFLQK 393 >gi|227541297|ref|ZP_03971346.1| restriction modification system DNA specificity subunit [Corynebacterium glucuronolyticum ATCC 51866] gi|227182848|gb|EEI63820.1| restriction modification system DNA specificity subunit [Corynebacterium glucuronolyticum ATCC 51866] Length = 332 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 52/350 (14%), Positives = 111/350 (31%), Gaps = 25/350 (7%) Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 ++S IF G +L+ + + K I + + +Q D EL + + Sbjct: 2 NSSAAKIFPAGTLLFS-IFATVGKCSILEIKAATNQAIAGIQISDSNVELPYLYHYLSYL 60 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +IE+ +G ++ + K + + +P+PPL EQ I + Sbjct: 61 RPQIESRAKGVAQNNINLKTLKQLEIPLPPLEEQRRIATILEKANSL--------RNAPP 112 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + +VS V L E + + +++ + + I Sbjct: 113 RTEVHINNIVSQFVENRL-------LRSNEKFVKLSELCDIQSGITKGRKTKKALAAKIP 165 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +S + + + + + E E Y + ++ D R + Sbjct: 166 YLAVSNVKDGYLDLSKVKEIEVTNEEIEKYALHKGDILLTEGGDPDKLGRGCLWNDEIPN 225 Query: 314 GIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVL 368 + + V+ I + L ++ S +L F + S+ + + Sbjct: 226 CLHQNHIFRVRLKDKQAIPANVLMAILSSKELKSYFLKSAKQTTGIASINRTQLSNASIP 285 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + I I+ + L+ +LL E S A A TG+ Sbjct: 286 ILDNET---IAE-IDCLLFMCEKLMATNTSRTLLLDELIQSLSARAFTGE 331 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 60/201 (29%), Gaps = 19/201 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + + +G T I Y+ + +V+ G ++ Sbjct: 136 KFVKLSELCDIQSGITKGRKTKKALAAKIPYLAVSNVKDGYLDLSKVKEIEVTNEEIEKY 195 Query: 80 IFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVL------PELLQGWLLS 130 KG IL + G R + D C Q + + + L+ Sbjct: 196 ALHKGDILLTEGGDPDKLGRGCLWNDEIPNCLHQNHIFRVRLKDKQAIPANVLMAILSSK 255 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +++ + ++ + + N +PI +I + L+ Sbjct: 256 ELKSYFLKSAKQTTGIASINRTQLSNASIPILD----NETIAEIDCLLFMCEKLMATNTS 311 Query: 191 FIELLKEKKQALVSYIVTKGL 211 LL E Q+L + T L Sbjct: 312 RTLLLDELIQSLSARAFTGEL 332 >gi|224456728|ref|ZP_03665201.1| hypothetical protein FtultM_02792 [Francisella tularensis subsp. tularensis MA00-2987] gi|282158820|gb|ADA78211.1| hypothetical protein NE061598_02955 [Francisella tularensis subsp. tularensis NE061598] Length = 401 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 56/388 (14%), Positives = 124/388 (31%), Gaps = 32/388 (8%) Query: 44 SGKDIIYIGLEDVES---------GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 S +I ++ ++D ES G G Y+ + N ++ + + K+ Sbjct: 2 SKANIEWVKIQDKESYPILGVRGQGQGVYINRIANGKELTMKKYQKSEPYHLFFCKVRTV 61 Query: 95 LRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + A+ + Q+L + +LPE L+ L +T + GA H Sbjct: 62 KGQWGVVYPEYANSYASSNMQYLKIDLDKILPEYLEMLLKLKKITDIWDKNAIGADGRHF 121 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL--VSYIV 207 K + + +P+PP+ Q I + + + L + +++ A + Sbjct: 122 PLKILLTLQIPLPPIEIQKQIVQAYEDKINLANQLEQRAEKLEAKIEKYLYAKLGIEQAQ 181 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT-------------KLIES 254 + + +K E + + + Sbjct: 182 EQKQDKKGLLKFVRFEQLQRWDTDFFKQKEGYSSKYETVSYEDLFVSLNNGIAARNYASD 241 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I L +I + Y+ +++ G ++ + L + Sbjct: 242 GIRYLKVSDIKDNYINNDKPFYVNKYKESDLIEKGTLLITRKGTVGNSYYL--DKDGSFV 299 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 + ++ ++ YL+ + S + K + +G SL +K + + +PP+K Sbjct: 300 ASSEIFIIKLNDKVNGNYLSEINLSSFVKKQYREKSTGTIMPSLSQPKLKSILIPLPPLK 359 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIV 401 Q I I I L ++ EQ+ Sbjct: 360 IQNHIAVRIQKLKDYIKALEQQAEQNRE 387 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 47/141 (33%), Gaps = 2/141 (1%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQ-NDKRSLRSAQVMERGIITSAYMAVKPH 326 R K + + YQ +P + F + + Y+ + Sbjct: 30 YINRIANGKELTMKKYQKSEPYHLFFCKVRTVKGQWGVVYPEYANSYASSNMQYLKIDLD 89 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I YL L++ + ++ G + + + L + +PPI+ Q I + Sbjct: 90 KILPEYLEMLLKLKKITDIWDKNAIGADGRHFPLKILLTLQIPLPPIEIQKQIVQAYEDK 149 Query: 386 TARIDVLVEKIEQSIVLLKER 406 + L ++ E+ +++ Sbjct: 150 INLANQLEQRAEKLEAKIEKY 170 >gi|220932853|ref|YP_002509761.1| restriction modification system DNA specificity domain protein [Halothermothrix orenii H 168] gi|219994163|gb|ACL70766.1| restriction modification system DNA specificity domain protein [Halothermothrix orenii H 168] Length = 565 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 50/316 (15%), Positives = 100/316 (31%), Gaps = 17/316 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLP---KDGN 69 +P+ W+ V + ++ G T ++ +I ++ D+ KY+ ++ Sbjct: 81 ELPESWEWVRLGNIGRIVGGGTPKTKVHAYWENGNIAWLTPADLNGLKSKYISRGRRNIT 140 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 S+ + KG +L+ P IA D + F P + + L Sbjct: 141 KLGLQNSSAKLLPKGSVLFSSRAPI-GYVAIAQNDLATNQGFKSCVPYIMDMNQYIYYFL 199 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 D + + G T K + N P+PPL EQ I K+ D L Sbjct: 200 MYDAKRINDNA-SGTTFKEVSGKEVANFIFPLPPLNEQKRIVNKLDELMTFCDQLEVSLE 258 Query: 190 RFIELLKEKKQALVSYIV----TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + +++ + I + L+ ++ ++ V P++ L + Sbjct: 259 KKANAKQLVSKSISNRIQKSKSKEELDKNITFIIRNLKEVYTTPENLNDLKDIILQLAIQ 318 Query: 246 RKNTKLIESNILSLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K + + I K + R K + + EI F R Sbjct: 319 GKLVPQDPDDEPASVLIEKINKEKERLIKEKKIRKTKPLPPIKEAEIPFELPKGWEWVRL 378 Query: 305 LRSAQVMERGIITSAY 320 + +R + Sbjct: 379 GEIMIINQRNKLNDNL 394 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 51/186 (27%), Gaps = 7/186 (3%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 E +P+ WE + + K + + + K S Sbjct: 75 EEEIPFELPESWEWVRLGNIGRIVGGGTPKTKVHAYWENGNIAWLTPADLNGLKSKYISR 134 Query: 281 ETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 I G + + + A + + P+ +D Sbjct: 135 GRRNITKLGLQNSSAKLLPKGSVLFSSRAPIGYVAIAQNDLATNQGFKSCVPYIMDMNQY 194 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + YD ++ + + ++V +PP+ EQ I N ++ D L Sbjct: 195 IYYFLMYDAKRINDNASGTTFKEVSGKEVANFIFPLPPLNEQKRIVNKLDELMTFCDQLE 254 Query: 394 EKIEQS 399 +E+ Sbjct: 255 VSLEKK 260 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 73/222 (32%), Gaps = 10/222 (4%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIESNILSL 259 L+ + P +K E +P WE ++ R N L S + Sbjct: 345 LIKEKKIRKTKPLPPIK--EAEIPFELPKGWEWVRLGEIMIINQRNKLNDNLEVSFVPMK 402 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS- 318 + + L E + Y ++V I + R + + G Sbjct: 403 LIEDGYLSKHSHKKKLWKEVKKGYTHFKENDLVVAKITPCFENRKSAIMKNLYSGYGAGT 462 Query: 319 ---AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 + ID + +++++ + + +G +Q ++ + ++ + +PP+ Sbjct: 463 TELHVLTSYLKEIDMKFFLYIVKAKNFINQGVSTFTGTAGQQRIRKDFIENFVIGLPPLN 522 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ I I+ A ++L +I ++ + S Sbjct: 523 EQKQIVKKIDKLMALCNLLENQINKNRNNSELLMKSLQRKLF 564 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 72/216 (33%), Gaps = 11/216 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT 60 ++ K P K++ + + +PK W+ V + +N ++ ++ ++ +E G Sbjct: 351 IRKTKPLPPIKEAEIPF--ELPKGWEWVRLGEIMIINQRNKLNDNLEVSFVPMKLIEDGY 408 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--------YLRKAIIADFDGICSTQFL 112 + + F + ++ K+ P ++ G L Sbjct: 409 LSKHSHKKKLWKEVKKGYTHFKENDLVVAKITPCFENRKSAIMKNLYSGYGAGTTELHVL 468 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIR 171 K++ + + + + + + G I N + +PPL EQ I Sbjct: 469 TSYLKEIDMKFFLYIVKAKNFINQGVSTFTGTAGQQRIRKDFIENFVIGLPPLNEQKQIV 528 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +KI + L + + + ++L + Sbjct: 529 KKIDKLMALCNLLENQINKNRNNSELLMKSLQRKLF 564 >gi|126208116|ref|YP_001053341.1| putative type I restriction system specificity protein [Actinobacillus pleuropneumoniae L20] gi|126096908|gb|ABN73736.1| putative type I restriction system specificity protein [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 413 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 43/413 (10%), Positives = 121/413 (29%), Gaps = 34/413 (8%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + G++ S D+ + D + +G ++ + Q KG +L Sbjct: 3 KLGDIADIVMGQSPSSS-DVNMERIGDPLLNGPTEFTSFYPSPVQYTEKGKKFAEKGDLL 61 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 + G + AD ++ K+ P L+ D +RI G+T Sbjct: 62 FCVRGSTTGRINFADQKYAIGRGLAAIRGKNGYPT-KFIELILKDCLERILQSATGSTFP 120 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV---- 203 + + ++ + L E + I + + +I ++ + ++ Sbjct: 121 NVSQAMLLDLDIGDFSLPEAIKIADILGIIDHKIHLNTQTNQTLEQIAQAIFKSWFVDFE 180 Query: 204 -SYIVTKGLNPDVKMKDSGI--EWVG-----LVPDHWEVKPFFALVTELNRKNTKLIESN 255 +G N V SG E + ++ ++ + ++ + Sbjct: 181 PVKAKMQGGNLAVMEAISGKNSEELHRLQTENPTEYQKLWAIADAFPDEIGEDGIPVGWE 240 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV-----------FRFIDLQNDKRS 304 + L I +N+ +K Y + I+ + + Sbjct: 241 NVYLKDVCNIVYG--KNLPIKKLQEFGYPVFGGNGIIGFYEKFLYEEPHTLVSCRGAASG 298 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + ++ + S ++ + + + + + ++ Sbjct: 299 KVMYSQPYSFVTNNSLVIEHSKSFLS--YFYIYEALRIQTLVELTTGSAQPQMTIANMNP 356 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + +++P I N+ + + + + L++ R + + G Sbjct: 357 VQIILPT----DKIHNLYTSQVKYLYEKIYRNNLENEQLEKIRDELLPKLLNG 405 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 60/198 (30%), Gaps = 18/198 (9%) Query: 18 IGA--IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IG IP W+ V +K + G+ K + +G + K Sbjct: 230 IGEDGIPVGWENVYLKDVCNIVYGKNLPIKKLQEFGYPVFGGNGIIGFYEK--------- 280 Query: 76 STVSIFAKGQILYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ + L G K + + + + ++ K L ++ Sbjct: 281 ---FLYEEPHTLVSCRGAASGKVMYSQPYSFVTNNSLVIEHSKSFLS---YFYIYEALRI 334 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q + + G+ + + + +P L ++ +I E + ++ Sbjct: 335 QTLVELTTGSAQPQMTIANMNPVQIILPTDKIHNLYTSQVKYLYEKIYRNNLENEQLEKI 394 Query: 195 LKEKKQALVSYIVTKGLN 212 E L++ + ++ Sbjct: 395 RDELLPKLLNGDLCNTMD 412 >gi|326334515|ref|ZP_08200726.1| type I restriction enzyme EcoR124II specificity protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325693284|gb|EGD35212.1| type I restriction enzyme EcoR124II specificity protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 395 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 52/398 (13%), Positives = 115/398 (28%), Gaps = 39/398 (9%) Query: 27 VVPIKRFTKLNT-----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ + ++S L ++ Y + ++ S V IF Sbjct: 22 WKPLGEIAEYEQPTKYLVKSSNYKDIYPTPVLTAGKTFILGYTDETEGIYKASISPVIIF 81 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + DFD + + + E+L ++ T E I Sbjct: 82 ----------DDFTTANKWVDFDFKAKSSAMKMITSKNEKEVLLKYIYYWLNTLPSELIE 131 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 IP +PPL+ Q I + T + + E + Sbjct: 132 GDHKRQWISNYANKKIP--LPPLSVQQEIVRILDKFTQL-----EAELDCRKRQYEYYRN 184 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + G ++ K +G V + Sbjct: 185 KLLTFNEIGGGTEIVWKT-----LGEVGTFIRGNGLQKKDLITSGVPAIHYGQIYTYYGI 239 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 +E + E+ E + VD G+++ D A ++ +T + Sbjct: 240 S-----VEQTISFVSRETAEGLRKVDYGDVIITNTSENIDDVGKAVAYCVKEQGVTGGHA 294 Query: 321 -MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDI 378 + I YL + ++ + G + + D+ ++ + +PP+ EQ I Sbjct: 295 TIFKPSEKIIGKYLVYYTQTTEFSNQKRKYAKGTKVIDISANDLTKITIPLPPLSEQQRI 354 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ ++ + E + + I L ++ R ++ Sbjct: 355 ATILDKFDTLVNSISEGLPKEIALRRKQYEYYRERLLS 392 >gi|294339002|emb|CAZ87347.1| putative type I restriction-modification (R-M) system HsdS [Thiomonas sp. 3As] gi|294341829|emb|CAZ90258.1| putative type I restriction-modification (R-M) system HsdS [Thiomonas sp. 3As] Length = 428 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 57/437 (13%), Positives = 134/437 (30%), Gaps = 51/437 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS-DTS 76 W P+ + T + + + ++ +V SR D Sbjct: 4 EWVPRPLSEVAREITVGFVGTMADQYVAEGVPFLRSLNVRPFEIDLGDVKYISRDFHDRL 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S G ++ + G A+++D + CS +V ++ L + ++ Sbjct: 64 RKSALRPGDVVIVRTGKPGTCAVVSDALPEANCSDVVIVRCGEE-LNPHFLSYWVNAMAA 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + A GA H + + +P+PPL+ Q + ++A RID L + Sbjct: 123 SHVTAHTVGAVQQHFNVASAKLLRLPVPPLSVQDEVLAPLLAIDRRIDLLRQTNATLEAI 182 Query: 195 LKEKKQALV-----SYIVTKGLNPDVK------MKDSGIEW--VGLVPDHWEVKPFFALV 241 + ++ +G P+ + S E +G +P W V+ + Sbjct: 183 AQALFKSWFIDFDPVRAKAEGREPEGMDAATAALFPSEFEESALGEIPKGWGVRSLDSFA 242 Query: 242 TE----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 +K + L + ++ T + IV G+++F + Sbjct: 243 NYLNGLAMQKFPPESDEEYLPVIKIAQLRAGNTSGADRASSRLKPDYIVRDGDVLFSWSG 302 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC---KVFYAMGSGLR 354 G + V + + + + + A + Sbjct: 303 SLE-----VELWCGGNGALNQHLFKVTSSKV--PKWFYYLATKQFLPTFREIAAHKATTM 355 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL--KER---RSS 409 ++ + V +P + V++ + + + + I L +E R + Sbjct: 356 GHIQRVHLMEASVAMPAPE-------VLDAL-SPLMRSI-LERRVIGALHARELAAVRDA 406 Query: 410 FIAAAVTGQIDLRGESQ 426 + ++G++ L + Sbjct: 407 LLPRLISGKLRLPEAEE 423 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 41/144 (28%), Gaps = 11/144 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG----RTSESGKD--IIYIGLEDVESGTGKYLPKDGNSR 71 +G IPK W V + F G + + + I + + +G + Sbjct: 226 LGEIPKGWGVRSLDSFANYLNGLAMQKFPPESDEEYLPVIKIAQLRAGN----TSGADRA 281 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S I G +L+ G L + +G + + V Sbjct: 282 SSRLKPDYIVRDGDVLFSWSGS-LEVELWCGGNGALNQHLFKVTSSKVPKWFYYLATKQF 340 Query: 132 DVTQRIEAICEGATMSHADWKGIG 155 T R A + TM H + Sbjct: 341 LPTFREIAAHKATTMGHIQRVHLM 364 >gi|217033075|ref|ZP_03438541.1| hypothetical protein HPB128_179g1 [Helicobacter pylori B128] gi|298737197|ref|YP_003729727.1| putative type I restriction enzyme specificity subunit [Helicobacter pylori B8] gi|216945196|gb|EEC23883.1| hypothetical protein HPB128_179g1 [Helicobacter pylori B128] gi|298356391|emb|CBI67263.1| putative type I restriction enzyme (specificity subunit) [Helicobacter pylori B8] Length = 252 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 51/160 (31%), Gaps = 5/160 (3%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N I +K ++ + ++ D L + ++ Sbjct: 94 NSIDIDGNLKNTMKRVNFYDNSLKQDDIVMVLSDVAHGDFLGLCAVIPSNDYVLNQRMGR 153 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++ L + K F G G + +L + ++ + +PP+ EQ I N+ Sbjct: 154 LRIRNDCINILFLRLYINANQKYFKMQGQGSSQLNLSKKAIEDFEIPLPPLNEQIAIANI 213 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ I L K Q + + ++ +I + Sbjct: 214 LSALDHEIASLKNKKRQ----FDNIKKALNHDLMSAKIRV 249 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 69/190 (36%), Gaps = 16/190 (8%) Query: 25 WKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVS 79 W+ V + + G E+ D I L ++ G K K N + Sbjct: 61 WQRVRLGDICEFGNGEAYETLIVENGDFKLISLNSIDIDGNLKNTMKRVNFYDNS----- 115 Query: 80 IFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + I+ A+I D + + + L+ ++ +L L Sbjct: 116 -LKQDDIVMVLSDVAHGDFLGLCAVIPSNDYVLNQRMGRLRIRNDCINILFLRLYINANQ 174 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + +G++ + K I + +P+PPL EQ+ I + A I +L ++ +F + Sbjct: 175 KYFKMQGQGSSQLNLSKKAIEDFEIPLPPLNEQIAIANILSALDHEIASLKNKKRQFDNI 234 Query: 195 LKEKKQALVS 204 K L+S Sbjct: 235 KKALNHDLMS 244 >gi|302331824|gb|ADL22017.1| restriction endonuclease S subunit, HsdS [Staphylococcus aureus subsp. aureus JKD6159] Length = 410 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 58/430 (13%), Positives = 135/430 (31%), Gaps = 60/430 (13%) Query: 26 KVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDG-NSRQSDTSTVSIF 81 + + +++G + G ++ +DV G + Sbjct: 4 ETFNLTDLYTISSGLSKNRKYFGTGTPFLTFKDVFDNLILPNEFSGQVITEEKEREKYSV 63 Query: 82 AKGQILYGKLGPYLR-----KAIIADFDGICSTQFLV------LQPKDVLPELLQGWLLS 130 KG + + + D+ F +LP + S Sbjct: 64 KKGDLFLTRTSEKQNELGISAVALKDYKNATFNGFTKRLRPNKYCENKLLPVFAAFYFRS 123 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + ++ ++ +T + + + I + + IP L Q+ I ++A + + Sbjct: 124 NNFRNQVNSMSIMSTRASLNNEMISKLKITIPSLQNQMKISHILLALLKK----EKINQK 179 Query: 191 FIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTE 243 I L+E Q L PD K SG E +G +P W+V + Sbjct: 180 IIANLEELSQTLFKRWFVDFEFPDENGNPYKSSGGEMVDSELGKIPRSWKVDELGNYIKI 239 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + K +N K + I+ +IV D +++ Sbjct: 240 KSGKRP---------------------KNKVDKEDIENVVPIIGASKIVGYTNDYLYNEK 278 Query: 304 SLRSAQVMERGIITSAYMAVKPHGID----STYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + +V G+I P S + + + + + L Sbjct: 279 IIIIGRVGTHGVIQRFSTRTWPSDNTFVITSDFESIIYQVLKSIDYISLNRGSTQPLLSQ 338 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI---VLLKERRSSFIAAAVT 416 +D+K V++P +++ + + +++ ++Q I L + R + + ++ Sbjct: 339 KDIKNTKVVMPTN------ATLLSKYQKKNNHILKMMDQKIIENKKLTQLRDTLLPKLMS 392 Query: 417 GQIDLRGESQ 426 G+I++ + + Sbjct: 393 GEIEIPDDIE 402 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 64/199 (32%), Gaps = 15/199 (7%) Query: 10 YKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG + +G IP+ WKV + + K+ +G+ ++ D ED+E+ +P Sbjct: 209 YKSSGGEMVDSELGKIPRSWKVDELGNYIKIKSGKRPKNKVDK-----EDIEN----VVP 259 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 G S+ + ++ + I+ G++G + + F++ D + Q Sbjct: 260 IIGASKIVGYTNDYLYNEKIIIIGRVGTHGVIQRFSTRTWPSDNTFVI--TSDFESIIYQ 317 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 ++ + + + Q + +I Sbjct: 318 VLKSIDYISLNRGSTQPLLSQKDIKNTKVVMPTNATLLSKYQKKNNHILKMMDQKIIENK 377 Query: 186 TERIRFIELLKEKKQALVS 204 LL + + Sbjct: 378 KLTQLRDTLLPKLMSGEIE 396 >gi|157737949|ref|YP_001490633.1| Type I restriction-modification system specificity determinant [Arcobacter butzleri RM4018] gi|157699803|gb|ABV67963.1| Type I restriction-modification system specificity determinant [Arcobacter butzleri RM4018] Length = 448 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 64/413 (15%), Positives = 120/413 (29%), Gaps = 33/413 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K +++V I F K N + + Y + G +L + T Sbjct: 30 KDFELVKIGTFLKRNKTQIIV-DDNTTYKRVTIKLYNNGVFLRDTEIGKNIGTKKQFSIK 88 Query: 83 KGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRI 137 +GQ L K+ IA + I + FL + P+ L + Q Sbjct: 89 EGQFLLSKIDARNGAFGIATNEVDGAIITADFLAFDIDTSKINPDFLVLITTTKKFMQFA 148 Query: 138 EAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ G T D N +P+P L Q I E + + + ++ Sbjct: 149 QSASSGTTGRQRIDESKFLNTKIPLPKLDIQKQIVENYQNKINLASEQGQKAENLEKNIE 208 Query: 197 EKKQALV-----------SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 E + V K + + G E + L+ N Sbjct: 209 EYLYTELGIIKLEILTKDESSVLKFVTFKDMINCWGYENNNQIKIESTKYKVLKLINICN 268 Query: 246 RKN---------TKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVF 293 + I I + G + L E +I ++ Sbjct: 269 IGSGGTPSRNYPNYYINGTIPWIKTGEVRDALILNTEESITEEALQNSNAKIYPKDSLIV 328 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + + + + + +LM + K+ Sbjct: 329 AMYGATAGRTAKLGIEASTNQACAILHNFDLNKININFIWFYLMTQLENFKLL--TSGSA 386 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL-LKE 405 + +L + +K + +PPI+ Q I N I + I +L E+ E++ L L+E Sbjct: 387 QPNLNADKIKNYQIPIPPIEMQNKIANNIEILKNEIKILNEQSEKNKKLALEE 439 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 39/134 (29%), Gaps = 3/134 (2%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 K + + G+ + ID +N + + +V I + Sbjct: 77 KNIGTKKQFSIKEGQFLLSKIDARNGAFGIATNEVDGAIITADFLAFDIDTSKINPDFLV 136 Query: 336 LMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L+ + F + G+ RQ + + +P + Q I + Sbjct: 137 LITTTKKFMQFAQSASSGTTGRQRIDESKFLNTKIPLPKLDIQKQIVENYQNKINLASEQ 196 Query: 393 VEKIEQSIVLLKER 406 +K E ++E Sbjct: 197 GQKAENLEKNIEEY 210 >gi|303253790|ref|ZP_07339925.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302647374|gb|EFL77595.1| Type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 302 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 62/211 (29%), Gaps = 13/211 (6%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S ++ +P W L + K E + + I + + + K S Sbjct: 63 SQQDFPFEIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYIS 122 Query: 280 YETYQIVDPGE-------IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 I + G + I + A + ++ + + Sbjct: 123 KGNRNITENGLRSSSTRLLSKNSIVYSSRAPIGYIAITETELCTNQGFKSIDLYNKEIVD 182 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + Y ++ + + + +PP+ EQ I I I+ Sbjct: 183 YLYYSLIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQY 242 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 + E+ + L ++ + S + AA+ G+ Sbjct: 243 -AEKEEKLTALHQQFPEQLKKSILQAAIQGK 272 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 79/211 (37%), Gaps = 16/211 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPK---DGN 69 IPK W V + ++ G T ++ +D I +I D++ +GKY+ K + Sbjct: 70 EIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNIT 129 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +S+ + +K I+Y P I + + + F + + + + Sbjct: 130 ENGLRSSSTRLLSKNSIVYSSRAPI-GYIAITETELCTNQGFKSIDLYNKE-IVDYLYYS 187 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I T I++ G T GN +P+PPL EQ I KI I+ + Sbjct: 188 LIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQYAEKEE 247 Query: 190 RFIELLKEKK----QALVSYIVTKGLNPDVK 216 + L ++ ++++ + L Sbjct: 248 KLTALHQQFPEQLKKSILQAAIQGKLTKQDP 278 >gi|119510903|ref|ZP_01630026.1| type I restriction-modification system, M subunit, putative [Nodularia spumigena CCY9414] gi|119464431|gb|EAW45345.1| type I restriction-modification system, M subunit, putative [Nodularia spumigena CCY9414] Length = 471 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 56/156 (35%), Gaps = 10/156 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTG 61 KDSG++W+G IP HW+V+ +K TK+ G+ + ++ +I DV + Sbjct: 283 MKDSGIEWLGKIPNHWEVIKVKHLTKILRGKFTHRPRNDPRFYDGQYPFIQTGDVANANK 342 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + ++ + F G ++ + + I +F+ + P + Sbjct: 343 FIMEYTQTLNENGYAVSKEFPSGTLVMT-IAANIGDMAILNFNACFPDSIVGFLPSKMTD 401 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 + + L + ++ + + Sbjct: 402 -IFFLYHLFSSMKKQFFRTYAITLCNPLSFVSFAFF 436 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 44/168 (26%), Positives = 63/168 (37%), Gaps = 14/168 (8%) Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKP-------FFAL 240 + IE L EK+ AL+S+ VTKGL+P V MKDSGIEW+G +P+HWEV Sbjct: 254 QEYNIEKLDEKRTALISHAVTKGLDPSVPMKDSGIEWLGKIPNHWEVIKVKHLTKILRGK 313 Query: 241 VTELNRKNTKLIESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 T R + + + + G N + + L Y + G +V Sbjct: 314 FTHRPRNDPRFYDGQYPFIQTGDVANANKFIMEYTQTLNENGYAVSKEFPSGTLVMTIAA 373 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 D L I+ + D +L L S Sbjct: 374 NIGDMAILNFNACFPDSIVG----FLPSKMTDIFFLYHLFSSMKKQFF 417 >gi|300114418|ref|YP_003760993.1| restriction modification system DNA specificity subunit [Nitrosococcus watsonii C-113] gi|299540355|gb|ADJ28672.1| restriction modification system DNA specificity subunit [Nitrosococcus watsonii C-113] Length = 557 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 57/158 (36%), Gaps = 4/158 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I R++ L + Y++ + R N + + ++ + Sbjct: 338 IDGSNLRSIKLDDIEIQKYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFR 397 Query: 325 PHG--IDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + +Y+ L + + + + S + ++ + L + + EQ I + Sbjct: 398 FSQGVVLPSYIQMLFDTQIVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVS 457 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + I + +IE+++ L+ R S + A +GQ Sbjct: 458 RLEEQLTAISAVKAEIERNLQRLESLRQSILKKAFSGQ 495 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 2/91 (2%) Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + YL + +S + + GS ++LKF D R+ V +PP+ EQ I I + Sbjct: 131 YLNNYLRYFYKSGKVVRY--QAGSNNLRNLKFNDYLRISVPLPPLNEQQRIVAKIEELFS 188 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +D +E ++ + LK R + + A G+ Sbjct: 189 ELDKGIESLKTAREQLKVYRQAVLKHAFEGK 219 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 66/207 (31%), Gaps = 12/207 (5%) Query: 22 PKHWKVVPIKRFTKL-NTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W + ++ + G SGK I I L DV++ Sbjct: 295 PNGWISIQLRELFESAQNGLAKREGISGKPIPVIRLADVKNQEIDGSNLRSIKLDDIEIQ 354 Query: 78 VSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQPK-DVLPELLQGWLLS 130 ++ +L ++ + C + VLP +Q + Sbjct: 355 KYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFRFSQGVVLPSYIQMLFDT 414 Query: 131 IDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 V + IE A + I + +P L EQ +I ++ + I + E Sbjct: 415 QIVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVSRLEEQLTAISAVKAEIE 474 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK 216 R ++ L+ +Q+++ + L P Sbjct: 475 RNLQRLESLRQSILKKAFSGQLVPQDP 501 >gi|158335388|ref|YP_001516560.1| type I restriction-modification system S subunit [Acaryochloris marina MBIC11017] gi|158305629|gb|ABW27246.1| type I restriction-modification system S subunit [Acaryochloris marina MBIC11017] Length = 573 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 69/210 (32%), Gaps = 16/210 (7%) Query: 221 GIEWVGLVPDHWEVKP---FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK- 276 +E +P W + + + K E+ ++ +I + K Sbjct: 371 EVEQPFQIPRSWTWVRVETICTHIVDCLHRTPKYQENGYPAIRTSDIQPGKILVDQARKV 430 Query: 277 ----PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 ++ I G+I + N + E + ID + Sbjct: 431 GIEEYQTQTQRLIPQEGDIFYSREG--NFGIAAVVPPQCEICLSQRMMQFRVASNIDPYF 488 Query: 333 LAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +W+M + + +G + +K+ +PP EQ I ++ D Sbjct: 489 FSWVMNAPVIFNQALNDAAGMTVPHVNIRSLKQFVFPLPPFAEQKRIVIKVDQLMTFCDN 548 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L + + + +++ +AAAV GQ+++ Sbjct: 549 LEAHLHE-----TQEKATALAAAVVGQLEV 573 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 68/200 (34%), Gaps = 14/200 (7%) Query: 227 LVPDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPE 278 +P +W + L V + + K I + NI + + E Sbjct: 89 DIPSNWSIAHLIDLSLLVVDCHNKTAPTTFEGIPLIRTTNIRNRQFRFHGMKYVDQDTYE 148 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +PG+I+F + + + G + + V +D ++ + Sbjct: 149 FWSRRCFPEPGDIIFTREAPMGEATIIPDGMKVCLG-QRTMLIRVFEQFVDRNFVLLALT 207 Query: 339 SYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L K + G + L+ +DV+++ + +PP+ EQ I ++ A D Sbjct: 208 EPGLIKRLASNAVGMTVKHLRVKDVEQICLPLPPLAEQKRIVAKVDELMAMCDRYEVSKC 267 Query: 398 QSIVLLKERR----SSFIAA 413 L + R + + A Sbjct: 268 DRNTLRTKMRASANDALMNA 287 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 76/224 (33%), Gaps = 13/224 (5%) Query: 20 AIPKHWKVVPIKR----FTKLNTGRTSESGKDIIYIGLEDVESGTGKY--LPKDGNSRQS 73 IP +W + + + + + I I ++ + ++ + Sbjct: 89 DIPSNWSIAHLIDLSLLVVDCHNKTAPTTFEGIPLIRTTNIRNRQFRFHGMKYVDQDTYE 148 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLS 130 S G I++ + P II D +C T + + + V + L Sbjct: 149 FWSRRCFPEPGDIIFTREAPMGEATIIPDGMKVCLGQRTMLIRVFEQFVDRNFVLLALTE 208 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +R+ + G T+ H K + I +P+PPLAEQ I K+ D + Sbjct: 209 PGLIKRLASNAVGMTVKHLRVKDVEQICLPLPPLAEQKRIVAKVDELMAMCDRYEVSKCD 268 Query: 191 FIELLKEK----KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 L + AL++ + LN + E + P+ Sbjct: 269 RNTLRTKMRASANDALMNAETDESLNTAWEFVQEHWECLIQEPE 312 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 8/195 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTG---RTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V ++ RT + I D++ G + Sbjct: 377 QIPRSWTWVRVETICTHIVDCLHRTPKYQENGYPAIRTSDIQPGKILVDQARKVGIEEYQ 436 Query: 76 STVSIF--AKGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLS-I 131 + +G I Y + G + A++ IC S + + + + W+++ Sbjct: 437 TQTQRLIPQEGDIFYSREGNFGIAAVVPPQCEICLSQRMMQFRVASNIDPYFFSWVMNAP 496 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G T+ H + + + P+PP AEQ I K+ D L Sbjct: 497 VIFNQALNDAAGMTVPHVNIRSLKQFVFPLPPFAEQKRIVIKVDQLMTFCDNLEAHLHET 556 Query: 192 IELLKEKKQALVSYI 206 E A+V + Sbjct: 557 QEKATALAAAVVGQL 571 >gi|121610071|ref|YP_997878.1| restriction modification system DNA specificity subunit [Verminephrobacter eiseniae EF01-2] gi|121554711|gb|ABM58860.1| restriction modification system DNA specificity domain [Verminephrobacter eiseniae EF01-2] Length = 416 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 52/425 (12%), Positives = 122/425 (28%), Gaps = 39/425 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W I K+ G + + + + G G + K Sbjct: 3 SEWATATIGDVAKIKHGFAFKGEFFTDEVTPNVLVTPGNFAIGGGFQIGK-PKYYAGPLP 61 Query: 77 TVSIFAKGQILYGKLG------PYLRKAIIADFDGICS------TQFLVLQPKDVLPELL 124 +G+++ A + G+ V + + L Sbjct: 62 DDYALTEGEVVVTMTDLSKASDTLGYAAKVPSVPGVTYWHNQRIGLLQVTDKQRACKDWL 121 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + + + G T+ H I + +PPL EQ I E + + RID L Sbjct: 122 HYLMRTHEYRAWVVGSASGTTVKHTSPSRIESFSFKLPPLEEQRAIAETLGSLDDRIDNL 181 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + ++ + V M++S +GL+P W + F + L Sbjct: 182 RQTNATLEAIAAALFKS---WFVDFDGVSATDMRES---ELGLIPKGWRIGSFDEAIEIL 235 Query: 245 NRKNT-----KLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDL 298 ++ S + + + K + Q + + Sbjct: 236 GGGTPKTSIADYWSGDVPWFSVVDAPGSGQVFVLDTEKKITALGLQNCSAKLLPEMTTII 295 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSL 357 + A + + A++P + + + + G ++ Sbjct: 296 SARGTVGKVAMTGVPMAMNQSCYALRPRQQSGEAFVY-FSTLRFVEHLQRIAHGAVFDTI 354 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-VEKIEQSIVLLKERRSSFIAAAVT 416 + K++ +PP + I + ++ + + + +I L R + + ++ Sbjct: 355 TRDSFKQVTTCLPPDEV---IAGFAEIANPLLERIRINGQQAAI--LAALRDALLPRLIS 409 Query: 417 GQIDL 421 GQ+ + Sbjct: 410 GQLRV 414 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 58/204 (28%), Gaps = 14/204 (6%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY 63 ++S +G IPK W++ ++ G T ++ D+ + + D + Sbjct: 211 MRESE---LGLIPKGWRIGSFDEAIEILGGGTPKTSIADYWSGDVPWFSVVDAPGSGQVF 267 Query: 64 ---LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 K + + + + + G K + + L+P+ Sbjct: 268 VLDTEKKITALGLQNCSAKLLPEMTTIISARGTV-GKVAMTGVPMAMNQSCYALRPRQQ- 325 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + ++ + ++ I GA + +PP E R Sbjct: 326 SGEAFVYFSTLRFVEHLQRIAHGAVFDTITRDSFKQVTTCLPPDEVIAGFAEIANPLLER 385 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I + L L+S Sbjct: 386 IRINGQQAAILAALRDALLPRLIS 409 >gi|91787818|ref|YP_548770.1| restriction endonuclease S subunits-like protein [Polaromonas sp. JS666] gi|91697043|gb|ABE43872.1| Restriction endonuclease S subunits-like protein [Polaromonas sp. JS666] Length = 451 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 62/438 (14%), Positives = 133/438 (30%), Gaps = 44/438 (10%) Query: 24 HWKVVPIK----RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ +K + + + YI + +++G S + Sbjct: 6 SWQRQTLKAAGISLIDCDHRTPPAANEGYPYIAIPQLKNGHVSLDGVRRISPEDYLEWTK 65 Query: 80 IFAK--GQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ + A+I + L K V P+ L+ L D Sbjct: 66 KLKPQTHDVIVVRRCNSGDSALIPPGLECAIGQNLVILRSDGKTVQPQFLRWLLNGPDWW 125 Query: 135 QRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +++ GA + I N + IPP+ +Q I + A RI L Sbjct: 126 EQVSKFINVGAVFDSLRCRDIPNFELTIPPIDDQREIAIVLDALDDRIALLREINTTLEA 185 Query: 194 LLKEKKQALVSYI----------VTKGLN-PDVKMKDSGIEW--VGLVPDHWEVKPFFAL 240 + + ++ V +G++ + G E +GLVP W V Sbjct: 186 IAQALFKSWFVDFDPVRAKMEGRVPEGMDEATAALFPDGFEESELGLVPRGWTVDRLDTW 245 Query: 241 VTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 ++ + I + S+ +I++ + K S++ + + G ++ Sbjct: 246 LSVLETGRRPKGGVGGISDGVPSIGAESIVRIGQFDFGKTKYVSHDFFANMKSGALISHD 305 Query: 296 IDLQNDKRSLRSA----------QVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCK 344 + L D ER I ++ + +L + + S + Sbjct: 306 VLLYKDGGKPGVFLPRVSMFGDDFPFERCGINEHVFRMRLKAPFNQPFLYFWLWSDAVMH 365 Query: 345 VFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G + DV+ + VP + N + + + + + L Sbjct: 366 ELKHRGGKAAIPGINQSDVREQELSVPNAS----VLNRFDELVSPLVGRIFSNAKQAQTL 421 Query: 404 KERRSSFIAAAVTGQIDL 421 R + + ++GQ+ L Sbjct: 422 ATLRDTLLPRLISGQLRL 439 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 57/202 (28%), Gaps = 16/202 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 S + W + N I L G++ R Sbjct: 1 MSSDVSWQRQTLKAAGISLIDCDHRTPPAANEGYPYIAIPQLKNGHVSLDGVRRISPEDY 60 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + +++ D + G + + +L WL+ Sbjct: 61 LEWTKKLKPQTHDVIVVRRCNSGDSALIPPGLECAIG-QNLVILRSDGKTVQPQFLRWLL 119 Query: 338 RSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 D + + SL+ D+ + +PPI +Q +I V++ R Sbjct: 120 NGPDWWEQVSKFINVGAVFDSLRCRDIPNFELTIPPIDDQREIAIVLDALDDR------- 172 Query: 396 IEQSIVLLKERRSSF--IAAAV 415 I LL+E ++ IA A+ Sbjct: 173 ----IALLREINTTLEAIAQAL 190 >gi|332076949|gb|EGI87411.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17545] Length = 424 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 60/423 (14%), Positives = 132/423 (31%), Gaps = 64/423 (15%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGA 144 L + R I+ I + ++ L + ++LS + V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----Q 200 + + + + +I +P+PPL+EQ I E I + ++D R +L KE + Sbjct: 122 VVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDS-----------------------------------GIEWV 225 +++ Y + L +S + Sbjct: 182 SILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYY 241 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIE-------SNILSLSYGNIIQKLETRNMGLKPE 278 G +P +W V + + + K + I+ N ++ N Sbjct: 242 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNINPLEFSLLDNDYYIDT 301 Query: 279 SY--ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTY 332 + + +++ G++ ++ + I S + Sbjct: 302 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 361 Query: 333 LAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L + + S K + ++ + L + + P +EQ IT + ++ Sbjct: 362 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 421 Query: 390 DVL 392 + L Sbjct: 422 NQL 424 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIES 153 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 154 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 191 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 72/182 (39%), Gaps = 17/182 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I ++ L D Sbjct: 242 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNINPLEFSLLDNDYYIDT 301 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 302 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 361 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 362 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 421 Query: 182 DT 183 + Sbjct: 422 NQ 423 >gi|282917066|ref|ZP_06324824.1| hypothetical protein SATG_00559 [Staphylococcus aureus subsp. aureus D139] gi|282319553|gb|EFB49905.1| hypothetical protein SATG_00559 [Staphylococcus aureus subsp. aureus D139] Length = 370 Score = 81.4 bits (199), Expect = 3e-13, Method: Composition-based stats. Identities = 52/372 (13%), Positives = 106/372 (28%), Gaps = 26/372 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ ++ K+N+G+ + ++ G G + Sbjct: 20 EWEEKKLESIIKVNSGKDYK-----------HLDKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I + +I+ + + K Q + Sbjct: 122 TGVPSLSKQTINKINRFVPTNKEQQKIGKFFSKLDRQIELEEQKLELLQQQKKGYMQKIF 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + + V K +ES N Sbjct: 182 SQELRFKDENGNDYPEWENVMLQKVLKDKTEG-IKRGPFGGALKKDIFVESGYAVYEQRN 240 Query: 264 IIQKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + + Y+ V P +I+ + Q +GII A + Sbjct: 241 AIYDISNFRYYINENKYKEMQSFSVQPNDIIMSCSGTIGRLALIP--QNYTKGIINQALI 298 Query: 322 AVKPHGID-STYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + S + MRS + + GS + + +++K +P +P EQ I Sbjct: 299 RFRTNHKIRSEFFLIFMRSNQMQRKILEANPGSAITNLVPVKELKLIPFPLPVKFEQDKI 358 Query: 379 TNVINVETARID 390 + I + RI+ Sbjct: 359 SQFILIINRRIE 370 >gi|163844961|ref|YP_001622616.1| hypothetical protein BSUIS_B0834 [Brucella suis ATCC 23445] gi|163675684|gb|ABY39794.1| Hypothetical protein, conserved [Brucella suis ATCC 23445] Length = 391 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 61/401 (15%), Positives = 112/401 (27%), Gaps = 48/401 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W+ V I + + + R S DI + + + +T Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASA-DIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYK 62 Query: 80 IFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV---- 133 + +GQ Y + + G+ S + V + + Sbjct: 63 VVKRGQFAYATIHLDEGSIDYLRNEDAGLISPMYTVFETNSEEIDNEIALRQFKRFALSG 122 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + +PPL EQ I E + A I + I+ Sbjct: 123 RFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAAEAA----IAKTEALIK 178 Query: 194 LLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTKL 251 +++ K+AL+ Y V + + W+ G P + Sbjct: 179 AIEQTKKALLKQYFVERQQSLLWSCVAKMGRWLSGGTPATA---------------AEEN 223 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETY-QIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +I + +I + + E +V PG ++ + RS+ S Sbjct: 224 WKGSIPWVCPKDIKGPSISSTVDHISEDAAKALGMVGPGTLLLVVRGMIL-ARSVPSTIC 282 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R A P+ + L + + L G + E + PV Sbjct: 283 TVRCAFNQDVKAFVPNEGVAPAFLKLWLDINEHKLLGEIETATHGT-KRFPLEHLNEFPV 341 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 V EQ LV E S L+ R Sbjct: 342 PVVTRDEQIR--------------LVTLAESSQERLRSERQ 368 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 78/195 (40%), Gaps = 12/195 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSL-SYGNIIQKLETRNMGLKPESYETYQI 285 VP W + E++ +N + +LS+ + ++ E + + E+ Y++ Sbjct: 4 EVPKGWREVRIGQIAREISNRNHASADIPVLSMTKHRGFVRSNEYFSKSVHSENTRQYKV 63 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLC 343 V G+ + I L S+ + + G+I+ Y + + ID+ + + L Sbjct: 64 VKRGQFAYATIHLDE--GSIDYLRNEDAGLISPMYTVFETNSEEIDNEIALRQFKRFALS 121 Query: 344 KVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 F +G R+S+ F D+ +PP+ EQ I V+ + + K E I Sbjct: 122 GRFDPYSNGGVNRRKSILFSDLSAFKFGLPPLTEQRAIAEVLGAA----EAAIAKTEALI 177 Query: 401 VLLKERRSSFIAAAV 415 +++ + + + Sbjct: 178 KAIEQTKKALLKQYF 192 >gi|255690133|ref|ZP_05413808.1| putative type I restriction-modification system specificity determinant [Bacteroides finegoldii DSM 17565] gi|260624417|gb|EEX47288.1| putative type I restriction-modification system specificity determinant [Bacteroides finegoldii DSM 17565] Length = 379 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 68/403 (16%), Positives = 131/403 (32%), Gaps = 46/403 (11%) Query: 28 VPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST---VSIFAK 83 V + +NT + + Y+G E +ES + L + + T F K Sbjct: 4 VKFEDVATRVNTREDRLNTSLLYYVGGEHIESN--EMLVQGCGLIKGSTIGPMFYCGFKK 61 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQRIEAI 140 G IL P+LRKA + +FDGICS + V K +L E L + S D E Sbjct: 62 GDILLVSRNPHLRKASMVEFDGICSEKTFVLGTKDSKVLLQEFLALVMQSDDFWNYCEEH 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G +W + +P + EQ I +K+ + ++ + E++ Sbjct: 122 KSGGVNYFLNWSTLAKYEFYLPSIQEQKEIADKVWSAYRLKESYKKLLVATDEMV----- 176 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 K IE V + +++ + + + E+ I + Sbjct: 177 -----------------KSQFIEMFENVESYCKLEDLVSDTFPGEWGSEPISENAIKVIR 219 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDP----GEIVFRFIDLQND---KRSLRSAQVMER 313 N + + E ++V G+ + D R + ++ + Sbjct: 220 TTNFTNEGYLDLTDVVTRDIEPKKVVRKKLKQGDTILERSGGTKDNPVGRVVFFDEIGDY 279 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF----YAMGSGLRQSLKFEDVKRLPVLV 369 + ++ YL + + + A + Q+L D +++ Sbjct: 280 LPNNFTQVLRPKESVNPVYLFYALYNSYNLNKAAMRAMASQTTGIQNLSMSDFMAKFIVL 339 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 P EQ + D +++Q I + + S I Sbjct: 340 PSRNEQNKF----EQIYRQADKSKFELKQCIENIDKVIKSLIN 378 >gi|210611277|ref|ZP_03288832.1| hypothetical protein CLONEX_01022 [Clostridium nexile DSM 1787] gi|210152041|gb|EEA83048.1| hypothetical protein CLONEX_01022 [Clostridium nexile DSM 1787] Length = 410 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 57/418 (13%), Positives = 136/418 (32%), Gaps = 33/418 (7%) Query: 26 KVVPIKRFTKLN---TGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + IK + T + + I + ++++G + + + Sbjct: 3 ECRTIKELCSVVVDCPHSTPTWTAEGKIVVRSNNIKNGRIDFSSPSYTDDEHFQQRIKRA 62 Query: 82 AK--GQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQR 136 G I+ + P +I + C Q L P+ L L S V + Sbjct: 63 TPQGGDIIITREAPMGEVGMIPEGIVCCLGQRMVLLRANPEICDNYYLLYSLQSRYVQHQ 122 Query: 137 I-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I + G T+S+ + + +P PL++Q + + + I + + L Sbjct: 123 ISWSEGTGTTVSNLRIPHLEQLKIPYLPLSKQRQVSSVLRCLEGK----IEQNRVINDNL 178 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +++ ++L + + + + + + K + + ++ Sbjct: 179 QQQAKSLFKKWFIDNPDAALWQEGTFSDLIEKTISGDWGKDTPSGNNTEM--VYCIRGAD 236 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ------ 309 I + GN K + + P++Y + Q+V+ +V +A Sbjct: 237 IPEVRTGN---KGKMPTRYILPKNYASKQLVNGDIVVEISGGSPTQSTGRAAAISAPLLA 293 Query: 310 -VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRL- 365 + + T+ A+KP S Y+ + VF++ G+ ++L Sbjct: 294 RYDKGMVCTNFCKALKPITGYSMYVYHYWQYLYDQGVFFSYENGTTGIKNLDISGFLETE 353 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 P+ + P + + + + L R + + ++G+ID+ G Sbjct: 354 PISIAP----EKLVKKFDTFCQAVFSKIYANGLENEQLALVRDTLLPKLMSGEIDVSG 407 >gi|160915334|ref|ZP_02077546.1| hypothetical protein EUBDOL_01342 [Eubacterium dolichum DSM 3991] gi|158432725|gb|EDP11014.1| hypothetical protein EUBDOL_01342 [Eubacterium dolichum DSM 3991] Length = 420 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 55/414 (13%), Positives = 130/414 (31%), Gaps = 47/414 (11%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W+ + K+ G + + I + + D++ + + + + Sbjct: 4 EIPDNWEWKSWGEVSYKIQYGYNAPAKDTGVIKMVRITDIQDNQVLWDSVPFCNIKENEI 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-----LVLQPKDVLPELLQGWLLSI 131 + IL+ + G + K+ + + S V ++ P+ L+ ++ + Sbjct: 64 PDYLLHNFDILFARTGGTVGKSFLVENINEDSVFAGYLIRTVYNYNEINPKYLKYFMETS 123 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++ + + + + + +PIPPL EQ I K+ I+ + Sbjct: 124 LYWSQLKKGTIATAQPNCNGQTLSKMILPIPPLQEQHRIVAKLQELEPLIEKYRIAEEQL 183 Query: 192 IELL----KEKKQALVSYIVTKGLNPDVKM------------------------KDSGIE 223 EL + K++++ Y + L P K E Sbjct: 184 HELNSNIKDQLKKSILQYAIEGKLVPQDPNDEPASVLLERIREEKQQLIKEGKIKKDKNE 243 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL--------SLSYGNIIQKLETRNMGL 275 + D+ + F ++ + + N + L+ GN+ + + + Sbjct: 244 SIIFRRDNSYYEKFGNTEFCIDDEIKCSVPINWILTRQKNLCWLNNGNLSKGEILPYLEV 303 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQVMERGIITSAYMAVKPHGIDSTY 332 K ++ ++ RG + S + ++ + Sbjct: 304 KVLRGNKEAETKDSGVIVTRGTNVILVDGENSGEVMKIKYRGYMGSTFKILQTSNFVNEK 363 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ + K + L E + +PPI EQ I IN+ T Sbjct: 364 YVDIIFQCNRIKYKHNKKGAAIPHLDKELFNNTLIFLPPITEQQRILEKINLIT 417 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 66/204 (32%), Gaps = 13/204 (6%) Query: 227 LVPDHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 +PD+WE K + + ++ + + + ++ E+ Sbjct: 4 EIPDNWEWKSWGEVSYKIQYGYNAPAKDTGVIKMVRITDIQDNQVLWDSVPFCNIKENEI 63 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSY 340 ++ +I+F K L + + I+ YL + M + Sbjct: 64 PDYLLHNFDILFARTGGTVGKSFLVENINEDSVFAGYLIRTVYNYNEINPKYLKYFMETS 123 Query: 341 DLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + + + + ++ + +PP++EQ I + I+ E+ Sbjct: 124 LYWSQLKKGTIATAQPNCNGQTLSKMILPIPPLQEQHRIVAKLQELEPLIEKYR-IAEEQ 182 Query: 400 IVLLK-----ERRSSFIAAAVTGQ 418 + L + + S + A+ G+ Sbjct: 183 LHELNSNIKDQLKKSILQYAIEGK 206 >gi|118475739|ref|YP_892534.1| restriction and modification enzyme CjeI [Campylobacter fetus subsp. fetus 82-40] gi|118414965|gb|ABK83385.1| restriction and modification enzyme CjeI [Campylobacter fetus subsp. fetus 82-40] Length = 1285 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 51/389 (13%), Positives = 111/389 (28%), Gaps = 24/389 (6%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K+V + ++ TG T + G D + D+ +G + + + Sbjct: 901 KLVKLGEICEILTGSTPSTQKKEFYGSDFPFYRPADLING-RNVNSSEVMVSKLGYESQR 959 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K IL +G R +I +L + + E L + Q + Sbjct: 960 ALPKKSILVSCIGTIGRVGMIEKSGIFNQQINALLPNNNYISEFLFYLFDTNFFKQLLIQ 1019 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE---TVRIDTLITERIRFIELLK 196 T+ + NI +P+PPL Q I ++ I I E I+ + Sbjct: 1020 QTHNTTVPIINKSKFSNIKIPLPPLEAQEKIVKECEEVEEKFKTIRMSIEEYKSLIKEIL 1079 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 K + + G + + + + +P + +++ Sbjct: 1080 IKSCVITDASLEIGGGYEQNL----AQILNDLPSPQNYG-LSEWESVKLTNKDFILKIGK 1134 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 L + + +K + + + + + + + E Sbjct: 1135 RVLDKDLTQDGINVFSANVKEPFGKINKDLIKDFSLDSVLWGIDGDWMTGFVKANEPFYP 1194 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 T ++ + L + + F + E + L + +PP++ Q Sbjct: 1195 TDHCGVLRSKSHKAKILEFALFEVGAKFGFSR-----QNRASIERISNLTLSLPPLEAQE 1249 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE 405 I I I L + L+ Sbjct: 1250 KIVKAIEFCEGEISNL----NNELKTLEN 1274 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 32/298 (10%), Positives = 86/298 (28%), Gaps = 26/298 (8%) Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 +L+ + + + + IP + A + + + + Sbjct: 814 DNPEKLCFLVRRAFILNDDFEKQKLKDPYVSFSQNLQIPANLSEFAFTTPLLKCLDFTSA 873 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 + + I I + LNP K ++ +G + + Sbjct: 874 KFNKAINLNIASSKNGVN-------------LNPFEGSKFKLVK-LGEICE-----ILTG 914 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 +K + + + + + + + YE+ + + I+ I Sbjct: 915 STPSTQKKEFYGSDFPFYRPADLINGRNVNSSEVMVSKLGYESQRALPKKSILVSCIGTI 974 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLK 358 + + + + I + + S +L +L + ++ + + Sbjct: 975 GRVGMIEKSGIFNQQIN----ALLPNNNYISEFLFYLFDTNFFKQLLIQQTHNTTVPIIN 1030 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE--RRSSFIAAA 414 + + +PP++ Q I + + IE+ L+KE +S I A Sbjct: 1031 KSKFSNIKIPLPPLEAQEKIVKECEEVEEKFKTIRMSIEEYKSLIKEILIKSCVITDA 1088 >gi|315612675|ref|ZP_07887587.1| HsdS family protein [Streptococcus sanguinis ATCC 49296] gi|315315262|gb|EFU63302.1| HsdS family protein [Streptococcus sanguinis ATCC 49296] Length = 388 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 56/410 (13%), Positives = 118/410 (28%), Gaps = 41/410 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + G K + GKY + N + + G Sbjct: 5 EECILGDLVEFQRGYDLPKSKFV-----------EGKYPVQSSNGILGYHNEYKVEGPG- 52 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G+ G +I + +T V + K E + L +D+ + G Sbjct: 53 ITIGRSGTVGNPHLIRENFFPHNTSLFVKEFKGNDIEYIYYLLQYLDL--GNQKSGSGVP 110 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + I +Q I ID I + + L+ + L Y Sbjct: 111 TMNRNHLHPIKIRAYRDKTCQQRTI-----KILSLIDKKIQINNQINQELEVMAKTLYDY 165 Query: 206 IVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNRK-NTKLIESN 255 + PD K SG E +P W V+ +L+ N Sbjct: 166 WFVQFDFPDQNGKPYKSSGGKMVYNPELKREIPVRWGVEKLSSLLEIGRETINPMKTPKE 225 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + L IV+ +++ ++ ++ + E I Sbjct: 226 EFKYYSIPEYDVSGSFSYELGETIRSNKFIVEKSDLLVSKLNPWFNRV---VYNLEENAI 282 Query: 316 ITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP 371 ++ ++ K + + +L + S + + +G + + + + + Sbjct: 283 SSTEFIVWKTFNRFEKNFLYQVATSKEFIEYCTRFTTGTSNSHKRVSPDIMVGFQIPF-- 340 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 E+ I T I V + L + R + + GQ+ + Sbjct: 341 --EKTYI-QKFGEITDSIRTQVLQNNVQNQELTQLRDWLLPMLMNGQVKV 387 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 62/200 (31%), Gaps = 20/200 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W V + ++ +T + I DV L + Sbjct: 196 EIPVRWGVEKLSSLLEIGRETINPMKTPKEEFKYYSIPEYDVSGSFSYELGETIR----- 250 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQP-KDVLPELLQGWLLSID 132 S I K +L KL P+ + + + + I ST+F+V + L S + Sbjct: 251 -SNKFIVEKSDLLVSKLNPWFNRVVYNLEENAISSTEFIVWKTFNRFEKNFLYQVATSKE 309 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G + P + Q+ + I + I I ++ Sbjct: 310 FIEYCTRFTTGTS-------NSHKRVSPDIMVGFQIPFEKTYIQKFGEITDSIRTQVLQN 362 Query: 193 ELLKEKKQALVSYIVTKGLN 212 + ++ L +++ +N Sbjct: 363 NVQNQELTQLRDWLLPMLMN 382 >gi|308182609|ref|YP_003926736.1| Type I restriction/modification specificity protein [Helicobacter pylori PeCan4] gi|308064794|gb|ADO06686.1| Type I restriction/modification specificity protein [Helicobacter pylori PeCan4] Length = 416 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 70/413 (16%), Positives = 128/413 (30%), Gaps = 39/413 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPSTNNPKNYGNKIAWITPKDLSTLQGRYIKKGSRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA+ + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAEKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 I I G T + + IPP EQ I + +I+ Sbjct: 120 KDNISNIGGGTTFKEVSGATLSLFEVKIPPTYYEQQKIARTLSVLDQKIENNHKINELLH 179 Query: 193 ELLKEKKQALVSYI-VTKGLNPDVK-----MKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 ++L+ + G N + MK S E L+P+ +EVK LV + Sbjct: 180 KILELLYEQYFVRFDFLDGNNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGELVDIFSG 238 Query: 247 KNTKLIESNILSLSYGNIIQK---------LETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + + Y I K T N+ P+ Y +++P I+ Sbjct: 239 YSFQSNTYSNNKNDYILITNKNVQHSLVDLSITTNLLFLPKKLPKYCLLEPTNILITLTG 298 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQ 355 + S + I+ V P + + L+R+ + +Q Sbjct: 299 HIGRCALVFS----KNCILNQRVGVVLPKEKELNPFYYSLIRNPLFSAILQRNAIGSSQQ 354 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +L D ++ + I + I L+ Q+ L R Sbjct: 355 NLSPIDTLKIQIPF-----NHKIIKQYSKTCENIIKLLVSNMQTTQTLTALRD 402 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 52/158 (32%), Gaps = 8/158 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGKY-LPKDGNSRQS 73 IP ++V + + +G + S + D I I ++V+ + + Sbjct: 220 IPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYILITNKNVQHSLVDLSITTNLLFLPK 279 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSID 132 + IL G R A++ + I + + V+ PK+ L + + Sbjct: 280 KLPKYCLLEPTNILITLTGHIGRCALVFSKNCILNQRVGVVLPKEKELNPFYYSLIRNPL 339 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + ++ G++ + I +P + Sbjct: 340 FSAILQRNAIGSSQQNLSPIDTLKIQIPFNHKIIKQYS 377 >gi|153805906|ref|ZP_01958574.1| hypothetical protein BACCAC_00146 [Bacteroides caccae ATCC 43185] gi|149130583|gb|EDM21789.1| hypothetical protein BACCAC_00146 [Bacteroides caccae ATCC 43185] Length = 370 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 70/393 (17%), Positives = 137/393 (34%), Gaps = 33/393 (8%) Query: 27 VVPIKRFTK--LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-FAK 83 +V + K I Y+G E ++S K + F Sbjct: 3 IVKFSEVAHRAYTREDRFNTEK-IYYVGGEHIDSCELYVTKKGVIKGSTIGPMFYCGFTA 61 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL---PELLQGWLLSIDVTQRIEAI 140 GQIL+ P+L+K IADFDGICS + V++ KD E L + S D E Sbjct: 62 GQILFVTRNPHLKKCSIADFDGICSEKTFVIETKDESILTQEYLAIIMQSDDFWNYCEEN 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G +W + + +PP+ +Q+ I +K + + + + +LL ++ Sbjct: 122 KSGGVNYFLNWSTLADYEFELPPIKQQLEIAQK-------VMSAYRLKQSYKKLLDATRE 174 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + S + NP K W+ + E+ K + +L+L Sbjct: 175 MVKSQFIEMFGNPVTNTK------------GWKTAKIKDVAPEMPSKEQLSGKIWLLNLD 222 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + E+ + Q D G ++F + +K + Sbjct: 223 MIESNTGRIIEKVYEDVENALSVQSFDEGNVLFSKLRPYLNKVVIP--DEPGMATTELVP 280 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +P + +L+ L+R + G + +++ ++PP+ +Q + Sbjct: 281 LRPEPSKLHKVFLSHLLRGNQFVNYANDIAGGTKMPRMPLTELRNFDCILPPMDKQLEFV 340 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++D ++ +SI + + S I Sbjct: 341 ----FIAEQVDKSEFELRKSIDAIDQVIKSLIN 369 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/185 (23%), Positives = 78/185 (42%), Gaps = 8/185 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WK IK + SGK I + L+ +ES TG+ + K ++ S S F Sbjct: 192 KGWKTAKIKDVAPEMPSKEQLSGK-IWLLNLDMIESNTGRIIEKVYEDVENALSVQS-FD 249 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAI 140 +G +L+ KL PYL K +I D G+ +T+ + L+P+ + L I Sbjct: 250 EGNVLFSKLRPYLNKVVIPDEPGMATTELVPLRPEPSKLHKVFLSHLLRGNQFVNYANDI 309 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G M + N +PP+ +Q+ ++D E + I+ + + + Sbjct: 310 AGGTKMPRMPLTELRNFDCILPPMDKQLEFV----FIAEQVDKSEFELRKSIDAIDQVIK 365 Query: 201 ALVSY 205 +L++ Sbjct: 366 SLINN 370 >gi|322387157|ref|ZP_08060767.1| type I site-specific deoxyribonuclease chain S [Streptococcus infantis ATCC 700779] gi|321141686|gb|EFX37181.1| type I site-specific deoxyribonuclease chain S [Streptococcus infantis ATCC 700779] Length = 395 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 67/384 (17%), Positives = 123/384 (32%), Gaps = 33/384 (8%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKFFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + IPPLAEQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIAIPPLAEQQRIVEVIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS--------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 +S E L + K + + E + Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLEISIVSQGDDNSYYEEVPNTWQLFK 251 Query: 264 IIQKLETRNMGLKPESYETYQIVD------------PGEIVFRFIDLQNDKR--SLRSAQ 309 + L+ N + Y V G V + S Sbjct: 252 LKNLLQLDNGTKQQNERLIYWDVKTLRGIKDAEFKEKGNKVHSKDTVILVDGENSGELFI 311 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + G + S + + S L + + L K L V + Sbjct: 312 IPHDGYMGSTFKKIHYLEAGSKKYIDLYIDSKKELLKNSKTGSAIPHLNKTLFKELIVAL 371 Query: 370 PPIKEQFDITNVINVETARIDVLV 393 PPI+EQ I++ I ++I+ L+ Sbjct: 372 PPIQEQKRISSKITQIFSQINRLI 395 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 10/192 (5%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY---ETYQIVDPGEI 291 ++ + + I + S + +N+ ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKFFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I VI ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIAIPPLAEQQRIVEVIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 54/166 (32%), Gaps = 12/166 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W++ +K +L+ G ++ + I + G + Sbjct: 242 EVPNTWQLFKLKNLLQLDNGTKQQNERLIYW-----------DVKTLRGIKDAEFKEKGN 290 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G + I DG + F + + + + + ++ Sbjct: 291 KVHSKDTVILVDGENSGELFIIPHDGYMGSTFKKIHYLEAGSKKYIDLYIDSK-KELLKN 349 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 G+ + H + + + +PP+ EQ I KI +I+ LI Sbjct: 350 SKTGSAIPHLNKTLFKELIVALPPIQEQKRISSKITQIFSQINRLI 395 >gi|297581880|ref|ZP_06943801.1| restriction endonuclease S subunit [Vibrio cholerae RC385] gi|297533974|gb|EFH72814.1| restriction endonuclease S subunit [Vibrio cholerae RC385] Length = 306 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 72/202 (35%), Gaps = 13/202 (6%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + KP ++ K++ + I + N + E G K + E + G Sbjct: 17 EEILEKPLDGNHGNIHPKSSDYVGYGIPFVMANNFVNG-EVDLSGCKFITKERADRLQKG 75 Query: 290 -----EIVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDL 342 +I+ + + Y + +D+ ++ S Sbjct: 76 FALTGDILLTHKGTVGSTAIVGELNTDYIMLTPQVTYYRVRDANRLDNRFIRHYFDSSSF 135 Query: 343 CKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 +F ++ G G R L LP++ PP+ EQ I ++ D L+ +++ IV Sbjct: 136 QSLFASLAGGGTRAYLGIVKQLELPIVKPPVDEQRAIAQALSDV----DALLATLDEVIV 191 Query: 402 LLKERRSSFIAAAVTGQIDLRG 423 ++ + + + +TG+ L G Sbjct: 192 KKRDLKQAAMQQLLTGKTRLPG 213 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 84/254 (33%), Gaps = 40/254 (15%) Query: 21 IPKHWKVVPIKRFTKL---------NTGRTSESGKD-----IIYIGLEDVESGTGKYLP- 65 IP W +V +K+ + N G D I ++ + +G Sbjct: 2 IPDDWDIVSVKQLVEEEILEKPLDGNHGNIHPKSSDYVGYGIPFVMANNFVNGEVDLSGC 61 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--------- 116 K ++D G IL G AI+ G +T +++L P Sbjct: 62 KFITKERADRLQKGFALTGDILLTHKGTVGSTAIV----GELNTDYIMLTPQVTYYRVRD 117 Query: 117 -KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + ++ + S ++ G T ++ +P+ PP+ EQ I + + Sbjct: 118 ANRLDNRFIRHYFDSSSFQSLFASLAGGGTRAYLGIVKQLELPIVKPPVDEQRAIAQALS 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVS----------YIVTKGLNPDVKMKDSGIEWV 225 + TL ++ +L + Q L++ V K L+ +++ G Sbjct: 178 DVDALLATLDEVIVKKRDLKQAAMQQLLTGKTRLPGVSGEWVVKRLDAIAEIRSGGTPST 237 Query: 226 GLVPDHWEVKPFFA 239 G P W+ + Sbjct: 238 GE-PSFWDGDILWC 250 Score = 39.8 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 13/78 (16%), Positives = 29/78 (37%), Gaps = 10/78 (12%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTG-KYLPKDGNSRQ---S 73 W V + ++ +G T +G+ DI++ D+ + G KYL + Sbjct: 217 EWVVKRLDAIAEIRSGGTPSTGEPSFWDGDILWCTPTDITALNGHKYLRETSRLISLLGL 276 Query: 74 DTSTVSIFAKGQILYGKL 91 + S+ + ++ Sbjct: 277 NASSAEMIPAQSVVMTSR 294 >gi|220930104|ref|YP_002507013.1| restriction modification system DNA specificity domain protein [Clostridium cellulolyticum H10] gi|220000432|gb|ACL77033.1| restriction modification system DNA specificity domain protein [Clostridium cellulolyticum H10] Length = 385 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 50/397 (12%), Positives = 109/397 (27%), Gaps = 30/397 (7%) Query: 31 KRFTKLN-TGRTSES------GKDIIYIGLEDVESGT--GKYLPKDGNSRQSDTSTVSIF 81 + G T + +I +I D+ G K +S + Sbjct: 2 GEMAEETYGGGTPSTLNKAYWNGNIPWIQSSDLVEHQLFGVSPRKYITESGVCSSAAKLV 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + K F S FL + + + + I+A+ Sbjct: 62 PENSIAIVT-RVGVGKLATMPFAFATSQDFL-SLSNLKCEIWFFAYSIYKKLQRDIDAVQ 119 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + + + EQ I + +D IT R ++ LK+ K Sbjct: 120 GTSIKGITKNELLSKSICAPSDILEQTSIGNFL----HLLDDAITLHKRKLDDLKDLKHG 175 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-----NTKLIESNI 256 + + + ++ +G + W+ + + + N NI Sbjct: 176 YLQQMFPQAGESVPLVRFAG------FTEPWQKRTLGDVAEIVGGGTPDTANPAYWNGNI 229 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S I + K I + A + Sbjct: 230 EWFSPTEIGTETYASISHKKISELGLKNSSAKMLTGGSTILFTSRAGIGDMAILTRPAAT 289 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +++ Y + M S + +++ R+ + +P KEQ Sbjct: 290 NQGFQSLEIRKTFDVYFIYSMGSKIKEYALKNASGSTFLEISGKNLGRMKLRIPTFKEQT 349 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I N +D + +Q + LK+ + +++ Sbjct: 350 AIGNF----FRNLDDQITAQKQKLSQLKQLKFAYLQK 382 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 55/186 (29%), Gaps = 8/186 (4%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 W+ + ++ G T ++ +I + ++ K + S+ Sbjct: 200 WQKRTLGDVAEIVGGGTPDTANPAYWNGNIEWFSPTEIGTETYASISHKKISELGLKNSS 259 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G + + I + F L+ + + + + + + Sbjct: 260 AKMLTGGSTILFTSRAGIGDMAILTRPAATNQGFQSLEIRKTFD-VYFIYSMGSKIKEYA 318 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+T K +G + + IP EQ I +I + + +L Sbjct: 319 LKNASGSTFLEISGKNLGRMKLRIPTFKEQTAIGNFFRNLDDQITAQKQKLSQLKQLKFA 378 Query: 198 KKQALV 203 Q ++ Sbjct: 379 YLQKML 384 >gi|328947485|ref|YP_004364822.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328447809|gb|AEB13525.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 267 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 69/193 (35%), Gaps = 6/193 (3%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-- 278 E +P++W +V +K S I S N+ QKL + + PE Sbjct: 73 EDEIPFEIPENWCWCRLGEIVYNNGQKIPDKEFSYIDIGSIDNLHQKLNDKENFVSPEQA 132 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-VKPHGIDSTYLAWLM 337 +IV G+I++ + + + I ++ + I++ YL + + Sbjct: 133 PSRARKIVKKGDIIYATVRPYLHNMCIIDKDFEKEPIASTGFAVLACYPQINNQYLFYYL 192 Query: 338 RSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 S ++ + + + + +PP+ EQ I + ID + Sbjct: 193 LSPSFDNYANDTENSKGVAYPAINDDKLYKGVIPLPPLAEQKRIVRALEAILPVIDEYRK 252 Query: 395 KIEQSIVLLKERR 407 K E+ L+ R+ Sbjct: 253 KEEELARLILSRK 265 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 64/184 (34%), Gaps = 11/184 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS--DTST 77 IP++W + N G+ K+ YI + +++ K K+ + Sbjct: 79 EIPENWCWCRLGEIV-YNNGQKIP-DKEFSYIDIGSIDNLHQKLNDKENFVSPEQAPSRA 136 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWL---LS 130 I KG I+Y + PYL I D D I ST F VL + + S Sbjct: 137 RKIVKKGDIIYATVRPYLHNMCIIDKDFEKEPIASTGFAVLACYPQINNQYLFYYLLSPS 196 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 D +G + + +P+PPLAEQ I + A ID + Sbjct: 197 FDNYANDTENSKGVAYPAINDDKLYKGVIPLPPLAEQKRIVRALEAILPVIDEYRKKEEE 256 Query: 191 FIEL 194 L Sbjct: 257 LARL 260 >gi|83815971|ref|YP_445228.1| type I restriction-modification system, S subunit, putative [Salinibacter ruber DSM 13855] gi|83757365|gb|ABC45478.1| type I restriction-modification system, S subunit, putative [Salinibacter ruber DSM 13855] Length = 408 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 70/194 (36%), Gaps = 12/194 (6%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LETRNMGLKPES 279 W+ + D V + +L + L LS N+ +K + + Sbjct: 4 WIRRILDDLPVDFIDGDRSSRYPTRDELKDEGFLFLSTKNVTKKGLRLDDLDFVSPSKFE 63 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLM 337 + P +I+ K +L + + G+I + + ++ ++L + M Sbjct: 64 EIKKGRLRPNDILITTRGSIG-KVALFESPKYKTGLINAQLLILRSDDESLSPSFLYYTM 122 Query: 338 RSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +S K SG + + D+K + + VPP+ Q I N++ +D +E Sbjct: 123 KSSSFQKRLKNYASGSAQPQIPVRDLKEIEIEVPPLTIQHRIANILGA----LDDKIELN 178 Query: 397 EQSIVLLKERRSSF 410 + L+E + Sbjct: 179 RRMNETLEEMAQTL 192 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 52/416 (12%), Positives = 113/416 (27%), Gaps = 53/416 (12%) Query: 25 WKVVPIKRF-TKLNTGRTSE--------SGKDIIYIGLEDVESGTGKYLPKDGNSRQ-SD 74 W + G S + +++ ++V + D S + Sbjct: 4 WIRRILDDLPVDFIDGDRSSRYPTRDELKDEGFLFLSTKNVTKKGLRLDDLDFVSPSKFE 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVL--PELLQGWLL 129 IL G + A+ G+ + Q L+L+ D P L + Sbjct: 64 EIKKGRLRPNDILITTRGSIGKVALFESPKYKTGLINAQLLILRSDDESLSPSFLYYTMK 123 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S +R++ G+ + + I + +PPL Q I + A +I+ Sbjct: 124 SSSFQKRLKNYASGSAQPQIPVRDLKEIEIEVPPLTIQHRIANILGALDDKIELNRRMNE 183 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 E+ + V G D G+E + R Sbjct: 184 TLEEMAQTLYYHYFDGSVEGG--------DIGLEELVE---------------IKPRMPV 220 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV--FRFIDLQNDKRSLRS 307 + + + ++ + K E + V+ ++ + Sbjct: 221 PDDDEVLTYVGMADVEPNRMSVTDYGKKEYTSGRRFVNHDTLMARITPSLENGKTAFVDF 280 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAM--GSGLRQSLKFEDVKR 364 E ++ + ++ S + R + + GS RQ ++ + Sbjct: 281 LDDGEMAFGSTEFTVMRAREGTSPCFVYCCARDERFREYAISTMTGSSGRQRVQENLLGE 340 Query: 365 LPVLVPPIKE---QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 E Q + + + + L+ L E R + ++G Sbjct: 341 YGF------EDFDQSRM-DQFHNRVEPLFKLIRSNTSENQTLAETRDYLLPKLISG 389 >gi|261339077|ref|ZP_05966935.1| hypothetical protein ENTCAN_05289 [Enterobacter cancerogenus ATCC 35316] gi|288318912|gb|EFC57850.1| type I restriction/modification specificity protein [Enterobacter cancerogenus ATCC 35316] Length = 460 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 55/444 (12%), Positives = 135/444 (30%), Gaps = 54/444 (12%) Query: 25 WKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST-V 78 W+ V + ++ +I ++ + K + Sbjct: 5 WREVSLGEISEKIGDGIHGTPVYNDSGKYYFINGSNLSDNSIKITDTTKRVAHDEFLKHR 64 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRI 137 +L G A+ + D I + KD + + ++LS + I Sbjct: 65 KELGDNTVLVSINGTIGNTALFNNEDIILGKSACYINLKDCISKYFILYILSGYLFQEYI 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T+ + K + + +P +Q I R + ++ + Sbjct: 125 QRCSTGSTIKNVSLKMMRDFRFLMPESKEDQEKAVHIIQKLDERRRLNNVQNKTLEQMSQ 184 Query: 197 EKKQA-------LVSYIVTKGLNPDVKMKDS----------------------------- 220 ++ ++ + G NP + S Sbjct: 185 TLFKSWFVDFDPVIDNALDAG-NPIPEALQSRAKLRQKIRNSADFKPLPADVRALFPAEF 243 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 +G VP W+ K LV + ++ L+ G+I + K E Sbjct: 244 EETELGWVPKDWQPKSMHDLVESASITYPLSKTDKVIFLNTGDIEKGSFLHQNYSKTEGL 303 Query: 281 --ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + + G+I+F I +N + + + + + T + + I+ +++ Sbjct: 304 PGQAKKSIKKGDILFSEIRPENKRYAFVHFESDDYVVSTKLMVLRAKNEINPLLPYFIIT 363 Query: 339 SYDLCKVFYAMG---SGLRQSLKFEDVKRLPVLVPPIKEQFDITN-VINVETARIDVLVE 394 D K + SG + F++++ + ++P I IN + Sbjct: 364 LEDNTKKLQRVAELRSGTFPQITFKELEFINFIMPNND---RIMELFINNYLTPAYNKII 420 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + + R + + ++G+ Sbjct: 421 ATKKINMNITTLRDTLLPKLISGE 444 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 65/196 (33%), Gaps = 10/196 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +G +PK W+ + + + +I++ D+E G+ + Sbjct: 248 LGWVPKDWQPKSMHDLVESASITYPLSKTDKVIFLNTGDIEKGSF-LHQNYSKTEGLPGQ 306 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWL----- 128 KG IL+ ++ P ++ F D + ST+ +VL+ K+ + LL ++ Sbjct: 307 AKKSIKKGDILFSEIRPENKRYAFVHFESDDYVVSTKLMVLRAKNEINPLLPYFIITLED 366 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + E N MP ++ I + +I Sbjct: 367 NTKKLQRVAELRSGTFPQITFKELEFINFIMPNNDRIMELFINNYLTPAYNKIIATKKIN 426 Query: 189 IRFIELLKEKKQALVS 204 + L L+S Sbjct: 427 MNITTLRDTLLPKLIS 442 >gi|213619072|ref|ZP_03372898.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] Length = 165 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 35/140 (25%), Positives = 57/140 (40%), Gaps = 5/140 (3%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAW 335 PE YQ + +IV S + + S + KP + YL Sbjct: 29 PEDVSKYQ-LQDRDIVISRAGSVGF--SFLVQNPPSQVVFASYLIRFKPVNYFSEYYLKR 85 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S D M +G Q++ + + L V +PPI EQ I ++ A++D Sbjct: 86 FLESSDYWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLAQVDSTKA 145 Query: 395 KIEQSIVLLKERRSSFIAAA 414 ++EQ +LK R + +AAA Sbjct: 146 RLEQIPQILKRFRQAVLAAA 165 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 63/163 (38%), Gaps = 3/163 (1%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG- 105 D+ ++ D+ G + + + I+ + G ++ + Sbjct: 3 DVKFLRTTDITKGAVDWSSVPYCMDAPEDVSKYQLQDRDIVISRAGSVGFSFLVQNPPSQ 62 Query: 106 --ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 S L+ +L S D ++ + G + + + + + + +PIPP Sbjct: 63 VVFASYLIRFKPVNYFSEYYLKRFLESSDYWNQLSLMSAGNAVQNVNAQKLSTLTVPIPP 122 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 +AEQ +I EK+ ++D+ + ++LK +QA+++ Sbjct: 123 IAEQKIIAEKLDTLLAQVDSTKARLEQIPQILKRFRQAVLAAA 165 >gi|223984080|ref|ZP_03634234.1| hypothetical protein HOLDEFILI_01526 [Holdemania filiformis DSM 12042] gi|223963955|gb|EEF68313.1| hypothetical protein HOLDEFILI_01526 [Holdemania filiformis DSM 12042] Length = 405 Score = 81.0 bits (198), Expect = 4e-13, Method: Composition-based stats. Identities = 57/402 (14%), Positives = 122/402 (30%), Gaps = 34/402 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK + + RT E +D + L G G+ R + G Sbjct: 26 WKSIAFGDLVHEYSDRTKEENEDTL---LSAAIEGMFLNTELFGHQRGASNKGYKKIKHG 82 Query: 85 QILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ +L A + G+ S + +L+ W+ S + Sbjct: 83 TMVLSTQNLHLGNANVNQRFEHGMVSPAYKTYDIIGCSVDLIAQWIKSDAAKRFFYNATT 142 Query: 143 ---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + +W + + +P EQ + + + + RI+ + + Sbjct: 143 VGASVCRRNVEWDTLYEQSLYLPCRDEQEKVAKFLALLSNRINKQQQFVAALKKYKRGVI 202 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q + + +G ++ + N + S Sbjct: 203 QHIFRHSFAQGNTEWTCVRLGDV----------------FKKVSRRNTNGMVKNVITNSA 246 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITS 318 YG + Q+ + Y +++ G+ V+ + E+GII+ Sbjct: 247 EYGLVPQREFFEKDIAVDGNTANYYVIEEGDFVYNPRKSNTAPYGPFNRYSLSEKGIISP 306 Query: 319 AY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS--LKFED--VKRLPVLVPPIK 373 Y V I +YLAW +S + Y GS + + D + +PV+ P Sbjct: 307 LYTCLVLQADIYPSYLAWYFKSDAWHRYIYDNGSQGVRHDRVSMTDDLLMGIPVMFPDRT 366 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q +++ +I+ ++ ++ LL + + Sbjct: 367 RQLIYAEMLD----KIEKRLQAAQKEYELLVSMKVGCVQQLF 404 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 81/212 (38%), Gaps = 17/212 (8%) Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 GL K++ G + + W+ F LV E + + + E +LS + + Sbjct: 9 NGLEKCPKLRFPGFD------EPWKSIAFGDLVHEYSDRTKEENEDTLLSAAIEGMFLNT 62 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 E + S + Y+ + G +V +L + Q E G+++ AY G Sbjct: 63 ELFG-HQRGASNKGYKKIKHGTMVLSTQNL--HLGNANVNQRFEHGMVSPAYKTYDIIGC 119 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +A ++S + FY + R++++++ + + +P EQ + + Sbjct: 120 SVDLIAQWIKSDAAKRFFYNATTVGASVCRRNVEWDTLYEQSLYLPCRDEQEKVAKFL-- 177 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 A + + K +Q + LK+ + I Sbjct: 178 --ALLSNRINKQQQFVAALKKYKRGVIQHIFR 207 >gi|63146890|emb|CAI79473.1| HsdS-type I specificity subunit [Lactobacillus delbrueckii subsp. lactis] Length = 387 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 126/396 (31%), Gaps = 35/396 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T + ++ K + E + D S S I + Sbjct: 18 DWEQRKLGDVCEPITD-SIDTQKYPNEVFAEYSMPAFDASMKPDIVLGSSMNSVRKIITR 76 Query: 84 GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L KL ++ + + +CS +F+ L V L S T+ +E Sbjct: 77 PCLLVNKLNVRKKRIWYVKKPNKNAVCSAEFIPLHSDTVDLTFLNQVAKSETFTRYLENH 136 Query: 141 CEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + + + + IP + EQ + I +D IT L+ Sbjct: 137 SSGSSNSQKRITPRSLMLSKLHIPTIEEQ----KLIGKIFESLDHTITLHEEKKRQLERL 192 Query: 199 KQALVSYIVTKG-LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K AL+ + P V+ + EW E + +V + + +L + Sbjct: 193 KSALLQKMFADESGYPVVRFEGFSDEW--------EQRKLKDVVEKQIKGKAQLEKLAPG 244 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + Y + + N G + + ++ + Sbjct: 245 EVEYLDTSR----LNGGQAILTNGLKDVTLDDILILWDGSKAGTVYHGFEGALGST---- 296 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + +S ++ ++ + ++ + ++ + + + VP EQ Sbjct: 297 ---LKAYRTSANSKFVYQYLKRHQ-DNIYNNYRTPNIPHVQKDFLNVFTISVPVSDEQEK 352 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++D ++ ++ + LLKE++ F+ Sbjct: 353 IGSF----FKQLDDTIDLHQRKLDLLKEQKKGFLQK 384 >gi|68535975|ref|YP_250680.1| putative DNA restriction-modification system, specificity subunit [Corynebacterium jeikeium K411] gi|68263574|emb|CAI37062.1| putative DNA restriction-modification system, specificity subunit [Corynebacterium jeikeium K411] Length = 408 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 75/392 (19%), Positives = 147/392 (37%), Gaps = 25/392 (6%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY--LRKAIIAD- 102 ++ +I LE+V T K + + + F +G +L K+ P + + D Sbjct: 27 DEVTFIPLENV-WPTNKADDFQIVPWEKRLTGYTPFRRGDLLLPKVTPTVTHGRTMFTDT 85 Query: 103 --FDGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIP 158 G+ ST+ ++ + P L L+ + A +G + + + + Sbjct: 86 ATELGVASTEVYTVRARPGTDPRWLAYLLVGTEFLGLAGASVQGTGGLKRISTQFVESYL 145 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGLNPD 214 +P EQ I + + ET ID + T+ + LL E++ ++ + G P Sbjct: 146 LPDASSEEQRAIADYLDRETAEIDAMTTDLDKMEALLTERRATTVRSTMDRAAEFGRIPL 205 Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + + G P K + + S++ + + + Sbjct: 206 GYVAQT---VSGATPSTSIAKYWADSAESGIHWVSIGDMSSVPVV-------LETQKYVS 255 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + ++ PG ++F S I + + +L Sbjct: 256 TEGRKTARLKVAGPGTVLFAMYGATLGAVSRLGVDACWNQAILGVF--PHESRLSPEFLE 313 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + G+ + +L E VK LP+ +PP++ Q I+ ++ +TA ID ++ Sbjct: 314 SALIALKPSLEALHRGN-TQNNLNAEQVKGLPIPLPPLEVQEAISQELSEKTAEIDAMLA 372 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I + LL ERR++ IAAAVTGQID+ + Sbjct: 373 DITELRDLLAERRAAVIAAAVTGQIDIPAAEE 404 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 70/187 (37%), Gaps = 18/187 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESG----------KDIIYIGLEDVESGTGKY-LPKD 67 G IP + + +G T + I ++ + D+ S K Sbjct: 201 GRIP-------LGYVAQTVSGATPSTSIAKYWADSAESGIHWVSIGDMSSVPVVLETQKY 253 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 ++ T+ + + G +L+ G L D + L + P + Sbjct: 254 VSTEGRKTARLKVAGPGTVLFAMYGATLGAVSRLGVDACWNQAILGVFPHESRLSPEFLE 313 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I + +EA+ G T ++ + + + +P+P+PPL Q I +++ +T ID ++ + Sbjct: 314 SALIALKPSLEALHRGNTQNNLNAEQVKGLPIPLPPLEVQEAISQELSEKTAEIDAMLAD 373 Query: 188 RIRFIEL 194 +L Sbjct: 374 ITELRDL 380 >gi|308179091|ref|YP_003918497.1| type I restriction-modification system specificity subunit [Arthrobacter arilaitensis Re117] gi|307746554|emb|CBT77526.1| type I restriction-modification system specificity subunit [Arthrobacter arilaitensis Re117] Length = 393 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 52/408 (12%), Positives = 117/408 (28%), Gaps = 38/408 (9%) Query: 29 PIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + G+ +E+ I +E + G + + I G Sbjct: 5 TLGDVFERITNGKNVRQNETDGGIRITRIETISMGIVDPTRVGYAGLEHSDNEKWILRDG 64 Query: 85 QILYGKLGP---YLRKAIIAD--FDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRI 137 IL + + A+ + + + L L+P V + + ++ Sbjct: 65 DILMSHINSPVHVGKCALYTNDLPEMVHGMNLLRLEPNKSLVDSSYAVRYFRTPAFRAQL 124 Query: 138 EA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I + + K + +I + +P L EQ I + L Sbjct: 125 RKFINQAVNQASISVKNLKSIEIALPQLEEQRRIAGILDKADALRGKRRKAIAHLDVLG- 183 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 Q++ + + ++D+ + +V V K + + Sbjct: 184 ---QSIFHEMFAGLSGDALTLRDASLRFV------SGRNMVGTGVNAHPTKKVLKVNA-- 232 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR--SAQVMERG 314 + + + + + + V+ G+++ D + V Sbjct: 233 ---ASSGEFDGSQVKPLPMNYDP-PAAHRVEVGDLIVTRASGTKDLIGVATLVDSVPSET 288 Query: 315 IITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVP 370 + V P + + Y +L RS K SG ++ + +++P Sbjct: 289 YLPDKLWKAVVNPRLLLAEYFRFLTRSTTYRKYVSNAASGAAGVSNISQAKLLDFQLVLP 348 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 PI+ Q + A I+ L + L S A G+ Sbjct: 349 PIESQQAFADR----MAAIESLKMTYRAQLADLDALFLSLQDRAFKGE 392 >gi|315453994|ref|YP_004074264.1| putative type I restriction-modification system [Helicobacter felis ATCC 49179] gi|315133046|emb|CBY83674.1| putative type I restriction-modification system [Helicobacter felis ATCC 49179] Length = 247 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 67/184 (36%), Gaps = 3/184 (1%) Query: 221 GIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 G+EW +G + + + + +L+ + + + PE Sbjct: 38 GVEWVELGEIGEFVRGSGLTKADLHPDNPSGELVGAIHYGEIHTFYNTHTAQTKSFVSPE 97 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + V G+I+ A + + I+T + A+ H + YLA+ Sbjct: 98 LAKKLKPVYCGDIILTTTSEDLKGLCKAVAWLGDSQIVTGGHAAIFRHHQNPKYLAYWFH 157 Query: 339 SYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + D K + G + + +K D+ R + +PP+ Q I +++ A L E I Sbjct: 158 TKDFIKQKRKIAYGTKVTEVKPSDLARCIIPLPPLAIQAKIVEILDQFNALTTDLQEGIP 217 Query: 398 QSIV 401 I Sbjct: 218 AEIE 221 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 64/208 (30%), Gaps = 14/208 (6%) Query: 22 PKHWKVVPIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 P+ + V + + G + SG+ + I ++ + + + + Sbjct: 36 PQGVEWVELGEIGEFVRGSGLTKADLHPDNPSGELVGAIHYGEIHTFYNTHTAQTKSFVS 95 Query: 73 SDTSTV-SIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + G I+ + I + + P+ L W Sbjct: 96 PELAKKLKPVYCGDIILTTTSEDLKGLCKAVAWLGDSQIVTGGHAAIFRHHQNPKYLAYW 155 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + D ++ I G ++ + +P+PPLA Q I E + L Sbjct: 156 FHTKDFIKQKRKIAYGTKVTEVKPSDLARCIIPLPPLAIQAKIVEILDQFNALTTDLQEG 215 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDV 215 IE +++ Q ++ ++ + Sbjct: 216 IPAEIEAREKQYQHYLNTLLNFKESACQ 243 >gi|240128340|ref|ZP_04741001.1| hypothetical protein NgonS_06871 [Neisseria gonorrhoeae SK-93-1035] gi|268686737|ref|ZP_06153599.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035] gi|268627021|gb|EEZ59421.1| conserved hypothetical protein [Neisseria gonorrhoeae SK-93-1035] Length = 405 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 66/422 (15%), Positives = 138/422 (32%), Gaps = 43/422 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 H+K I+ N G + + ++ + + + F Sbjct: 2 NHFKKQQIQNIADFNPREQLAKGALAKSVPMAMLKEFQRQITGYEIKAFNGGAK----FR 57 Query: 83 KGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDVT 134 G L K+ P L + ST+F+VL+ K+ PE L + +S D Sbjct: 58 NGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEFIVLRAKNETNPEFLYYFAISPDFR 117 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +R EG + + + + +PIP Q I + +D I + Sbjct: 118 KRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQSIAAVL----SALDKKIALNKQINA 173 Query: 194 LLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTEL 244 L+E + L Y + PD K SG + V +P WEV+ + + Sbjct: 174 RLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGDMVFDETLKREIPKGWEVRSLNQVADIV 233 Query: 245 NRKNTKLIESNILSLSYGNI--IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 ++ N+ + R ++ + + G+I+ D Sbjct: 234 MGQSPDGASYNLEQEGTIFFQGSTDFDWRFPNVRQYTTSPTRFAQKGDILLSVRAPVGDL 293 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 I A++ ++++L ++M+ + S+ +D+ Sbjct: 294 -----NISPFECCIGRGLAALRSKSGNNSFLFYVMKYFKTVFERRNTEGTTFGSITKDDL 348 Query: 363 KRLPVLVPP---IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 L ++ P +++ +I ++ D ++ Q L + R + + GQ+ Sbjct: 349 HSLKLVAPADNVLEKYNEIA-------SKYDEMIFIGSQQNHQLTQLRDFLLPMLMNGQV 401 Query: 420 DL 421 + Sbjct: 402 SV 403 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 64/204 (31%), Gaps = 8/204 (3%) Query: 10 YKDSGV-----QWIG-AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + + IPK W+V + + + G++ + + G+ + Sbjct: 200 YKSSGGDMVFDETLKREIPKGWEVRSLNQVADIVMGQSPDGASYNLEQEGTIFFQGSTDF 259 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + N RQ TS KG IL P I+ F+ L+ K Sbjct: 260 DWRFPNVRQYTTSPTRFAQKGDILLSVRAPV-GDLNISPFECCIGRGLAALRSKSGNNSF 318 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L +++ T EG T + ++ + P E I Sbjct: 319 LF-YVMKYFKTVFERRNTEGTTFGSITKDDLHSLKLVAPADNVLEKYNEIASKYDEMIFI 377 Query: 184 LITERIRFIELLKEKKQALVSYIV 207 + + +L L++ V Sbjct: 378 GSQQNHQLTQLRDFLLPMLMNGQV 401 >gi|282933735|ref|ZP_06339090.1| type I restriction modification DNA specificity domain protein [Lactobacillus jensenii 208-1] gi|281302114|gb|EFA94361.1| type I restriction modification DNA specificity domain protein [Lactobacillus jensenii 208-1] Length = 412 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 51/401 (12%), Positives = 117/401 (29%), Gaps = 37/401 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQS---DTST 77 WK V + ++ G T + + G E G YL + S+ Sbjct: 38 WKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGKTIYLHESQRKLSELGLKKSS 97 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G IL+ II + + F +QP + + + LS + + Sbjct: 98 ARLLNPGAILFTSRAGIGNTGIIINPSA-TNQGFQSIQPNKNIIDSYFIFCLSSRLKRYA 156 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+T + + + I EQ I I + + + +L K Sbjct: 157 LKHSAGSTFTEISGSEMKKAKIRICAKNEQNKISTCIKSLDSLLSLQQRKLELEKQLKKF 216 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q ++S P+++ D W + ++++ TK Sbjct: 217 CLQNILSD---NKKCPNLRFHDFSTNWKKVKVGDIFTVTRGKVLSKDKISKTKDHIMKYP 273 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S + L E T+ R + ++ + + G + Sbjct: 274 VYSSQTLNNGLLGYYHDYLFEDAITWTTDGANAGTVRLRAGKFYGTNVNGVLLSKNGYVN 333 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV-PPIKEQF 376 A ++ + + L ++ + + P ++EQ Sbjct: 334 DA----NAEALNQIAWKY-------------VSKVGNPKLMNNVMQNIMFSIAPSVEEQV 376 Query: 377 DITN--VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I+ +++ ++ +I + +I + + + + Sbjct: 377 IISKLFILHSKSLKI------YQANINVYTQLKQFLLQNLF 411 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 68/209 (32%), Gaps = 15/209 (7%) Query: 204 SYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + L P V+ + W +G V + E N + Sbjct: 18 THADEQRLYPKVRFRGFDEPWKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGK 77 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + +GLK + ++++PG I+F + + + +G + Sbjct: 78 TIYLHESQRKLSELGLK---KSSARLLNPGAILFTSRAGIGNTGIIINPSATNQGFQS-- 132 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 I +Y + + S + ++K+ + + EQ I+ Sbjct: 133 --IQPNKNIIDSYFIFCLSSRLKRYALKHSAGSTFTEISGSEMKKAKIRICAKNEQNKIS 190 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRS 408 I +D L+ ++ + L K+ + Sbjct: 191 TCI----KSLDSLLSLQQRKLELEKQLKK 215 >gi|320536547|ref|ZP_08036572.1| type I restriction modification DNA specificity domain protein [Treponema phagedenis F0421] gi|320146602|gb|EFW38193.1| type I restriction modification DNA specificity domain protein [Treponema phagedenis F0421] Length = 444 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 48/366 (13%), Positives = 109/366 (29%), Gaps = 30/366 (8%) Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVL 114 + S G I Y + I S ++V Sbjct: 65 NNQTGIFDAYIESGSKIKQKYKRMENGWIAYNPYRVNIGSIGIKKKEHKYEFISPAYVVF 124 Query: 115 -QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 +LPE L + ++ I G+ + ++ + + +P+P L+EQ Sbjct: 125 SCQNSLLPEYLFLTMKTLKFNSIIRDNTTGSVRQNLSYENLKTLQIPLPTLSEQQ---AL 181 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 I ++ + KE ++ L+ + + KDS +E+V Sbjct: 182 IDTYNAKLQQAEDLEKLAEQKKKEIEEYLLQELGIEEHENQSVKKDSYLEFVRFKDIERW 241 Query: 234 VKPFFALVTELNRKNTKLIES-------------------NILSLSYGNIIQKLETRNMG 274 + N + +I + +I + + Sbjct: 242 DCYNNKNKGHSSFYNEVPLSKILIEKPQYGAAYKAKDKASDIRYIRITDITEDGSLTDTF 301 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTY 332 + ++ +++ + + K L + + I + + +D Y Sbjct: 302 ASADQFKEQYLLNQYDFLIARSGATVGKTFLYE-EKYGKAIFAGYLIRFILNKSMVDPYY 360 Query: 333 LAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +SY K + ++ + P+++PP+ Q I I+ + RI Sbjct: 361 ILVYTKSYIYKKWIQNNMRVSGQPNINSQQYMDSPIILPPLDIQNRIVAHISEQKERIKE 420 Query: 392 LVEKIE 397 L ++ E Sbjct: 421 LKQQAE 426 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 67/186 (36%), Gaps = 4/186 (2%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + + + E + K + N + + + + Y+ ++ G Sbjct: 32 SRFPIVTLNEHIKEESTKYNISDPQTNYGMLGVNNQTGIFDAYIESGSKIKQKYKRMENG 91 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 I + + ++ + I + + + + YL M++ + Sbjct: 92 WIAYNPYRVNIGSIGIKKKEHKYEFISPAYVVFSCQNSLLPEYLFLTMKTLKFNSIIRDN 151 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +G +RQ+L +E++K L + +P + EQ + + N + + + L + EQ ++E Sbjct: 152 TTGSVRQNLSYENLKTLQIPLPTLSEQQALIDTYNAKLQQAEDLEKLAEQKKKEIEEY-- 209 Query: 409 SFIAAA 414 + Sbjct: 210 -LLQEL 214 >gi|258654734|ref|YP_003203890.1| Restriction endonuclease S subunits-like protein [Nakamurella multipartita DSM 44233] gi|258557959|gb|ACV80901.1| Restriction endonuclease S subunits-like protein [Nakamurella multipartita DSM 44233] Length = 411 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 59/167 (35%), Gaps = 7/167 (4%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS---AQVMERGIITSA 319 N I++ +G++ + G+I+F + +R+ + G + + Sbjct: 49 NEIREAGIARIGVEDAHRLRRHALREGDIIFSRRGDVGRRSLVRTREAGWLCGTGCLAAR 108 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + + + +L + + G +L + LPV +P EQ I Sbjct: 109 FGSDRTTVNPAYVADYLGGTSAQAWLVDNAVGGTMPNLNTSILSALPVWLPSKLEQDRIV 168 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + ID I+ I + + + +TG+ L G ++ Sbjct: 169 AALEDVRKVIDS----IQHLIAKRQAIKQGMMQHLLTGRTRLPGFNE 211 Score = 80.2 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 47/354 (13%), Positives = 117/354 (33%), Gaps = 25/354 (7%) Query: 81 FAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQP----KDVLPELLQGWLLSIDVT 134 +G I++ + G R++++ + +C T L + V P + +L Sbjct: 72 LREGDIIFSRRGDVGRRSLVRTREAGWLCGTGCLAARFGSDRTTVNPAYVADYLGGTSAQ 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G TM + + + +P+ +P EQ I + ID++ + + Sbjct: 132 AWLVDNAVGGTMPNLNTSILSALPVWLPSKLEQDRIVAALEDVRKVIDSIQHLIAKRQAI 191 Query: 195 LKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + Q L++ G N G + + L +L Sbjct: 192 KQGMMQHLLTGRTRLPGFNEAWSETTLGAVARFSKGAGLPKAALTSSGSTLCIHYGELFT 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + + + E +++ D+ + SA Sbjct: 252 FYGPEIRQ--VFSRTTPTGRVVVSEDL---------DVLMPTSDVTPRGLAKASAIHGAG 300 Query: 314 GIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ + ++P ++A +R + +V + L D++ + +P Sbjct: 301 VVLGGDILIIRPDKAHAHGPFVAHAIRHHA-DQVLQLVRGSTVYHLYATDMRNFALSLPS 359 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + EQ I + +++ L E++ ++ + ++ + +TG L E+ Sbjct: 360 VNEQRAIAGALLDADRQLEALEERLMKA----RAFKTGMMQRLLTGHTRLPTEA 409 >gi|60681330|ref|YP_211474.1| putative type I restriction enzyme, partial [Bacteroides fragilis NCTC 9343] gi|60492764|emb|CAH07538.1| putative type I restriction enzyme, partial [Bacteroides fragilis NCTC 9343] Length = 372 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 60/393 (15%), Positives = 128/393 (32%), Gaps = 52/393 (13%) Query: 24 HWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WK I + + + + + + I + LE + GTG+ L ++ Q F Sbjct: 26 EWKKDIIGNVISVKSEKYNPHSNRTEFICVELESISQGTGELLETFNSAEQKSIKNK--F 83 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G +L+GKL PYLRK + F+G+CS++ V++ + P L ++ + Sbjct: 84 SPGTVLFGKLRPYLRKFYLPYFEGVCSSEIWVMRSNKIEPAFLYSFIQTPYFISLANQSS 143 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G I R KI ID I + + I L+ + Sbjct: 144 GSK----MPRADWGLIETSKIAYPPNSAERVKIGKFLKLIDERIATQNKIIAHLESLIKG 199 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKLIESNILSLS 260 L + ++ +W+ ++T K L E NI Sbjct: 200 LTNQLLI-------------------PNSNWQPTTIGQVLTINPGKDYKHLKEGNIPVYG 240 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G + + + ++ + I + + Sbjct: 241 TGGYMLSVNDYLYDGESVCIGRKGTINK-----------------PIFLTGKFWTIDTLF 283 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + + +L ++ D K A G SL ++++ + +P + Q I Sbjct: 284 YTSNFNSLLPRFGYYLFKTIDWLKYNEASG---VPSLSKVSIEKIHISLPSLAIQNSICR 340 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ ++ E + + +++ + Sbjct: 341 LLDSIYDKL----ALEESVLNNHQTQKAFILQQ 369 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 19/190 (10%), Positives = 54/190 (28%), Gaps = 6/190 (3%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P+ + ++ K+ K + + ++ + L Q Sbjct: 20 FPEFSGEWKKDIIGNVISVKSEKYNPHSNRTEFICVELESISQGTGELLETFNSAEQKSI 79 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + + + LR + G+ +S ++ + I+ +L +++ + Sbjct: 80 KNKFSPGTVLFGKLRPYLRKFYLPYFEGVCSSEIWVMRSNKIEPAFLYSFIQTPYFISLA 139 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + ++ + PP E+ I + ID + + I L+ Sbjct: 140 NQSSGSKMPRADWGLIETSKIAYPPNSAERVKIGKFL----KLIDERIATQNKIIAHLES 195 Query: 406 RRSSFIAAAV 415 + Sbjct: 196 LIKGLTNQLL 205 >gi|154500307|ref|ZP_02038345.1| hypothetical protein BACCAP_03974 [Bacteroides capillosus ATCC 29799] gi|150271039|gb|EDM98313.1| hypothetical protein BACCAP_03974 [Bacteroides capillosus ATCC 29799] Length = 305 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 39/309 (12%), Positives = 99/309 (32%), Gaps = 7/309 (2%) Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 P D + ++ + + + I +G + W+ + I P PP Q I + Sbjct: 3 FIPYDGISDVRFVKYCFDMLQRDCKQISQGTAQDNLSWQKLSTIEFPAPPFETQRRIADI 62 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + A I+ + E + + + G + W D + Sbjct: 63 LSAYDDLIENNRKQIKLLEEATQRLYKEWFVDLRFPGYEHTKIVDGVPEGWKKSRADTFF 122 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG-EIV 292 ++ + I +S ++ + + E IV IV Sbjct: 123 NITIGKTPPRAEQQWFTDAKKGIPWVSISDM--GDTSAFIFDTSEELTADAIVKHNVTIV 180 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 L + K ++ + + T+ +A S + S Sbjct: 181 PAGTVLLSFKLTVGRVSITGADMCTNEAIAHFRIADPSNREYAYCYLKNYHYDTLGSTSS 240 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +++ + +K +P ++P I + + + ++ +Q I+ L++ R + Sbjct: 241 ISKAVNSKIIKAMPFVMPN----HAIMDEFSEHCRPLLEQIKTKQQVILNLQQARDRLLP 296 Query: 413 AAVTGQIDL 421 ++G++++ Sbjct: 297 KLMSGEVEV 305 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 55/198 (27%), Gaps = 14/198 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDV--ESGTGKYLPKDGN 69 +P+ WK F + G+T K I ++ + D+ S ++ Sbjct: 109 VPEGWKKSRADTFFNITIGKTPPRAEQQWFTDAKKGIPWVSISDMGDTSAFIFDTSEELT 168 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + V+I G +L + + I D + + P + Sbjct: 169 ADAIVKHNVTIVPAGTVLLS-FKLTVGRVSITGADMCTNEAIA--HFRIADPSNREYAYC 225 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + K I +P +P A E +I T + Sbjct: 226 YLKNYHYDTLGSTSSISKAVNSKIIKAMPFVMPNHAIMDEFSEHCRPLLEQIKTKQQVIL 285 Query: 190 RFIELLKEKKQALVSYIV 207 + L+S V Sbjct: 286 NLQQARDRLLPKLMSGEV 303 >gi|327472744|gb|EGF18171.1| type I restriction enzyme, S subunit [Streptococcus sanguinis SK408] Length = 433 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 50/437 (11%), Positives = 127/437 (29%), Gaps = 48/437 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 W++ + + G++ ++ + DV++ D Sbjct: 5 SWEITSLSELGAFSRGKSKHRPRNDAKLFEGGKYPLVQTGDVKAANLYITKNDSYYNDFG 64 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G + + + + I + + L + + Sbjct: 65 LKQSKLWPAGTLCIT-IAANIAETAILSYPMCFPDSIVGFNANPEKSSELFVYYFFEYIK 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I+ G+ + + + + + +P Q I E + ID I + + Sbjct: 124 KEIQKSASGSIQDNINIDYLSKMRIKVPEKKYQDKIVELL----SSIDKKILLNNQINQE 179 Query: 195 LKEKKQALVSYIVTKGLNPDV---KMKDSGIEWVG------LVPDHWEVKPFFALVTELN 245 LK + L Y + PD K SG + V +P+ W V F + +++ Sbjct: 180 LKAMAKTLYDYWFVQFDFPDQNGNPYKSSGGKMVYNPDLKREIPEGWGVTTFSSWISDNK 239 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---------SYETYQIVDPGEIVFRFI 296 + S + I+ + + + +++ +IV Sbjct: 240 TGDWGKETSQGNYTLEVDCIRGADINGLSGNGKTDMPTRFILEKNKNKLLTDFDIVIEIS 299 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGI------DSTYLAWL------MRSYDLCK 344 + + R + E + + + + + + Sbjct: 300 GGSPTQSTGRIVGISENVLNRFDLPLICSNFCKAVSLKEQETFYNFVYEWKNLYDNGVLF 359 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + SG++ L V + PPI + ++ +I +L+ + L Sbjct: 360 SWEGKTSGIKNLLFDSFVTNYHIAQPPIGLMEQFFDYVSSVDRKIQLLL----KQNQELT 415 Query: 405 ERRSSFIAAAVTGQIDL 421 + R + + GQ+ + Sbjct: 416 QLRDWLLPMLMNGQVKV 432 >gi|255322117|ref|ZP_05363264.1| type I restriction-modification system, S subunit [Campylobacter showae RM3277] gi|255300815|gb|EET80085.1| type I restriction-modification system, S subunit [Campylobacter showae RM3277] Length = 290 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 13/88 (14%), Positives = 35/88 (39%), Gaps = 4/88 (4%) Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + ++ K S+ ++ + +PP+ EQ I +++ I++ Sbjct: 1 MIHIFKTNTFFKQVKNDLGATINSINNGNLLNFKIPLPPLDEQKKIAEILSTWDEAINLT 60 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQID 420 + IE K+ + + + +T +I Sbjct: 61 INLIESK----KQFKKALMQNLLTAKIR 84 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 51/290 (17%), Positives = 99/290 (34%), Gaps = 30/290 (10%) Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 GAT++ + + N +P+PPL EQ I E + I+ I + K Sbjct: 15 KNDLGATINSINNGNLLNFKIPLPPLDEQKKIAEILSTWDEAINLTINLIESKKQFKKAL 74 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q L++ + D W+ ++ E K+ E +S Sbjct: 75 MQNLLTAKIRFPQFKDE----------------WKETKLGKILKEHKIKSDNKSEVFSVS 118 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV-MERGIIT 317 + G II ++E E Y +V P ++V+ + + + I++ Sbjct: 119 VHKG-IINQIEHLGRSFSAEDTSNYNLVKPFDLVYTKSPTGDFPFGIIKQNLNPFNVIVS 177 Query: 318 SAYMAVKP-HGIDSTYLAWLMRS-----YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP- 370 Y +P + T L + S L + ++ + +LVP Sbjct: 178 PLYGVFEPINKFLGTLLHYFFESSIRTNNYLKPIIQKGAKNTI-NISNDTFLSRSILVPI 236 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 + EQ I V+ D + + + LK+++ + + G+I Sbjct: 237 NLDEQQKIAEVLMA----CDDEINLLNLKLENLKKQKQGLMQKLLKGEIR 282 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 73/220 (33%), Gaps = 22/220 (10%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +PQ+KD WK + + K + ++ ++ ++ + + ++L Sbjct: 84 RFPQFKD-----------EWKETKLGKILKEHKIKS-DNKSEVFSVSVHKGIINQIEHLG 131 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVL 120 + + DTS ++ ++Y K I F+ I S + V +P + Sbjct: 132 RSFS--AEDTSNYNLVKPFDLVYTKSPTGDFPFGIIKQNLNPFNVIVSPLYGVFEPINKF 189 Query: 121 PELLQGWLLSIDVT--QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L + + ++ I + + + + I ++ Sbjct: 190 LGTLLHYFFESSIRTNNYLKPIIQKGAKNTINISNDTFLSRSILVPINLDEQQKIAEVLM 249 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 D I +E LK++KQ L+ ++ + K Sbjct: 250 A-CDDEINLLNLKLENLKKQKQGLMQKLLKGEIRTCYVKK 288 >gi|118443819|ref|YP_878480.1| type I restriction-modification system specificity subunit [Clostridium novyi NT] gi|118134275|gb|ABK61319.1| type I restriction-modification system specificity subunit, putative [Clostridium novyi NT] Length = 401 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 59/413 (14%), Positives = 123/413 (29%), Gaps = 32/413 (7%) Query: 28 VPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVS 79 + + +++ G+ G D YI D+ G K + + + +T Sbjct: 2 IKLGEISEIKGGKRLPKGCDFVEQETKYKYIRARDIGEGKIKCDELQYIDEKTYETIKNY 61 Query: 80 IFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + +G + I D + + + K+ L +L Q Sbjct: 62 TVSTNDVCITIVGANIGDIGIVSEELDGANLTENAVKITKLKNYDSSFLLYYLSMDKSKQ 121 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ + GA I I +P + Q + I I+ + E Sbjct: 122 EMQTLAAGAAQPKLGIYKIKEILVPKVDINIQKKVVNIISKYDYLIENNLKRIKLLEESA 181 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK-LIES 254 + + G E+V VP W +V+ L E Sbjct: 182 ELIYKEWFVNFRFPGYEKC--------EFVDGVPKGWSKVHLSEIVSTQYGFTESALNED 233 Query: 255 NILSLSYGNIIQKLETRNMG-----LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + G I K N ++ + + +I+ + K + Sbjct: 234 TGVKYLRGKDINKTSYINWSSVPWCKIEDNQKDKYALKKHDILVIRM-ADPGKVGIVEED 292 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + + + I YL + + S + +G R+S + + + +L Sbjct: 293 IEAVFASYLIRININNDNIKPYYLFYFLNSDFYQQFISQSSTGATRKSANAKLITDVDIL 352 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P + + + VL+ + Q LKE R I + G+I++ Sbjct: 353 MPE----KKVIEQFETKITDLRVLLNNLLQQNQKLKEARDILIPKLIMGEIEV 401 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 43/137 (31%), Gaps = 7/137 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTS 76 +PK W V + G T + Y+ +D+ + + + + Sbjct: 206 VPKGWSKVHLSEIVSTQYGFTESALNEDTGVKYLRGKDINKTSYINWSSVPWCKIEDNQK 265 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL---PELLQGWLLSIDV 133 K IL ++ + I+ + +L+ + P L +L S Sbjct: 266 DKYALKKHDILVIRMADPGKVGIVEEDIEAVFASYLIRININNDNIKPYYLFYFLNSDFY 325 Query: 134 TQRIEAICEGATMSHAD 150 Q I GAT A+ Sbjct: 326 QQFISQSSTGATRKSAN 342 >gi|148984625|ref|ZP_01817893.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP3-BS71] gi|147923016|gb|EDK74131.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP3-BS71] gi|301799880|emb|CBW32456.1| putative type I RM modification enzyme [Streptococcus pneumoniae OXC141] Length = 368 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 40/339 (11%), Positives = 83/339 (24%), Gaps = 24/339 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + G + +D G E + + N I G Sbjct: 2 KKVKLGEVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSTEINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWRGRSAVLNQHIFKVVLDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNL------------ 167 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 L + G + D+ + + E L L+ N+ Sbjct: 168 -----LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVT 222 Query: 266 QKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + + ++ +IV + + I S + Sbjct: 223 KNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMV 282 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 ++P + +++ + + L Sbjct: 283 ILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPIT 320 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 41/142 (28%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + IV+ G+I+ + ++ V I Sbjct: 39 TSTEINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWRGRSAVLNQHIFKVVLDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|183603438|ref|ZP_02716813.2| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC3059-06] gi|183576866|gb|EDT97394.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CDC3059-06] Length = 424 Score = 80.6 bits (197), Expect = 5e-13, Method: Composition-based stats. Identities = 61/424 (14%), Positives = 135/424 (31%), Gaps = 66/424 (15%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGA 144 L + R I+ I + ++ L + ++LS + V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----Q 200 + + + + +I +P+PPL+EQ I E I + ++D R +L KE + Sbjct: 122 VVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDS-----------------------------------GIEWV 225 +++ Y + L +S + Sbjct: 182 SILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYY 241 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ- 284 G +P +W V + + + K + +I + II+ + + + Y Sbjct: 242 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRIIRGGNIKPLEFSLLDNDYYID 300 Query: 285 ---------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDST 331 + +++ G++ ++ + I S Sbjct: 301 TQFISSEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISK 360 Query: 332 YLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +L + + S K + ++ + L + + P +EQ IT + + Sbjct: 361 FLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEK 420 Query: 389 IDVL 392 ++ L Sbjct: 421 VNQL 424 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIES 153 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 154 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 191 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 73/182 (40%), Gaps = 17/182 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 242 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 301 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 302 QFISSEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 361 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 362 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 421 Query: 182 DT 183 + Sbjct: 422 NQ 423 >gi|291547734|emb|CBL20842.1| Restriction endonuclease S subunits [Ruminococcus sp. SR1/5] Length = 414 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 49/414 (11%), Positives = 111/414 (26%), Gaps = 33/414 (7%) Query: 29 PIKRFTKLNTGRTSES-----GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + G + + + + E G K + Sbjct: 5 KLGEILSVKHGWAFKGEYFAEDGEQSILTPGNFFEKGGFKPNNGKERYYTGTYPKEYLCH 64 Query: 83 KGQILYGKL----GPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQ 135 KG ++ G A++ + + Q + K + ++ V + Sbjct: 65 KGDLIVAMTQQAEGLLGSTALVPENNKYLHNQRIGLITCDEKRLNKLFAYYLFMTKSVRE 124 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++E G + H + I ++ + IP + Q I + + +I + Sbjct: 125 QLERSSSGTKVKHTSPEKIYDVEVEIPDVISQQKIANLLWSIDEKIANNNAINDNLEQQA 184 Query: 196 KE-KKQALVSYIVTKGLNPDVKMKDSGIEWVG----LVPDHWEVKPFFALVTEL----NR 246 K + + + W +P W + + + + Sbjct: 185 KLLYNYWFIQFNFPDENGNPYHSSGGQLVWNKNLQQEIPQDWRSGNLYDIADYINGIACQ 244 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K E + L + + T + + I+ G+I+F + Sbjct: 245 KYRPFDEEHSLPVVKIREMNGGITNDTERVSSTIPAKNIISSGDILFSWSASLE-----V 299 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKR 364 + V P S+ +L S L + E ++ Sbjct: 300 IMWYGVDAGLNQHIFKVVPKSYFSSEYVYLQLSEYLIHFIKIAEARKTTMGHITSEHLQD 359 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +++PP I + I + ++I L + R I + GQ Sbjct: 360 SHIILPPAN----IIKNFSEYVRPIYQMKKQIANETSELIKLRDWLIPMLMNGQ 409 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 58/192 (30%), Gaps = 12/192 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP+ W+ + G R + + + + ++ G + ++ Sbjct: 221 EIPQDWRSGNLYDIADYINGIACQKYRPFDEEHSLPVVKIREMNGGITNDTERVSSTIP- 279 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 +I + G IL+ L + D + + PK LS + Sbjct: 280 ---AKNIISSGDILFSWS-ASLEVIMWYGVDAGLNQHIFKVVPKSYFSSEYVYLQLSEYL 335 Query: 134 TQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I+ A TM H + + + + +PP E + + E I Sbjct: 336 IHFIKIAEARKTTMGHITSEHLQDSHIILPPANIIKNFSEYVRPIYQMKKQIANETSELI 395 Query: 193 ELLKEKKQALVS 204 +L L++ Sbjct: 396 KLRDWLIPMLMN 407 >gi|311110800|ref|ZP_07712197.1| putative type I restriction modification DNA specificity domain protein [Lactobacillus gasseri MV-22] gi|311065954|gb|EFQ46294.1| putative type I restriction modification DNA specificity domain protein [Lactobacillus gasseri MV-22] Length = 391 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 46/385 (11%), Positives = 105/385 (27%), Gaps = 27/385 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDV------ESGTGKYLPKDGNSRQS 73 K+ + + +G + + D+ + R Sbjct: 5 KIKLLGEICEFYSGTGFPKKFQGNLEGKYPFYKVGDISKSADENKNFLTKSDNYVDERIV 64 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 T I I++ K+G L+ C VL K +L ++ Sbjct: 65 KTLKGKIVPPKTIVFAKIGEALKLNRRMITSTECLIDNNVLGIKPKNDSILAEYIFYFMK 124 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 ++E E T+ + I + +P + Q I + +ID + ++ Sbjct: 125 FVKLENYSESTTVPSVRKSELEKIKIRVPSIQNQQKIISIL----EKIDKTKKSKTESLK 180 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L E +A + + K K S IE + I Sbjct: 181 KLNELIKARFVEMFGDPQDSKSKWKKSTIEK-------CCTLKSGKTLPRNIENEGGNIP 233 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + T + + I G ++F ++ + Sbjct: 234 YVKVKDMNSLENTTYITTSTRFVSDKTANKSIFPVGTVIFPKRGG---AIGTNKKRLTKV 290 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I + +L +++ + + +D+ L + +PP+ Sbjct: 291 PICADLNIMGVIPDNTRISSYYLFEYFNMVDLNTLNNGSSVPQINNKDINPLNINIPPLS 350 Query: 374 EQFDITNVINVET-ARIDVLVEKIE 397 Q + N ++ ++ + +V + Sbjct: 351 LQNEFANFVHQVDKSKFENIVYLNK 375 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 70/195 (35%), Gaps = 9/195 (4%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M+D I+ +G + + + F + +S S L + + Sbjct: 1 MEDIKIKLLGEICEFYSGTGFPKKFQGNLEGKYPFYKVGDISKSADENKNFLTKSDNYVD 60 Query: 277 PESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + +IV P IVF I R +I + + +KP DS Sbjct: 61 ERIVKTLKGKIVPPKTIVFAKIGEALKLN--RRMITSTECLIDNNVLGIKPKN-DSILAE 117 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++ K+ S S++ +++++ + VP I+ Q I +++ +ID + Sbjct: 118 YIFYFMKFVKLENYSESTTVPSVRKSELEKIKIRVPSIQNQQKIISILE----KIDKTKK 173 Query: 395 KIEQSIVLLKERRSS 409 +S+ L E + Sbjct: 174 SKTESLKKLNELIKA 188 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 57/169 (33%), Gaps = 6/169 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESG-----KDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTV 78 WK I++ L +G+T +I Y+ ++D+ S Y+ T+ Sbjct: 204 WKKSTIEKCCTLKSGKTLPRNIENEGGNIPYVKVKDMNSLENTTYITTSTRFVSDKTANK 263 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 SIF G +++ K G + ++ + +L + Sbjct: 264 SIFPVGTVIFPKRGGAIGTNKKRLTKVPICADLNIMGVIPDNTRISSYYLFEYFNMVDLN 323 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + G+++ + K I + + IPPL+ Q + I Sbjct: 324 TLNNGSSVPQINNKDINPLNINIPPLSLQNEFANFVHQVDKSKFENIVY 372 >gi|295697502|ref|YP_003590740.1| restriction endonuclease S subunits [Bacillus tusciae DSM 2912] gi|295413104|gb|ADG07596.1| restriction endonuclease S subunits [Bacillus tusciae DSM 2912] Length = 206 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 65/203 (32%), Gaps = 7/203 (3%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-E 281 E +P+ W + R Y R L Sbjct: 3 EGPYKLPEGWRWVRLGEVCQCERRTVDPRRSPKATFYLYSIPAYDESQRPQRLDGSQIGS 62 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG-IITSAYM--AVKPHGIDSTYLAWLMR 338 + ++ PG +F ++ + + + + + + ++ +M P+ +D YL L+ Sbjct: 63 SKVVIGPGVCLFSKLNPRIPRAWVVAGVPQDGMPVASTEFMPLRPNPNVLDLDYLGKLLM 122 Query: 339 SYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + +G RQ LK + + +PP+ EQ I + +I Sbjct: 123 TEWFVSQVRLDVTGATGSRQRLKPGVILNALIPLPPLDEQGRIVAHLEAVQEKIRAFKSA 182 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 ++ L+ S + A G+ Sbjct: 183 QSETDQELRRLEQSMLDKAFRGE 205 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 77/206 (37%), Gaps = 20/206 (9%) Query: 20 AIPKHWKVVPIKRFT-----KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W+ V + ++ R+ ++ + I D + Sbjct: 7 KLPEGWRWVRLGEVCQCERRTVDPRRSPKATFYLYSIPAYDESQRPQRLDGSQI------ 60 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQ--GW 127 S+ + G L+ KL P + +A + D + ST+F+ L+P + +L Sbjct: 61 GSSKVVIGPGVCLFSKLNPRIPRAWVVAGVPQDGMPVASTEFMPLRPNPNVLDLDYLGKL 120 Query: 128 LLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L++ ++ GAT I N +P+PPL EQ I + A +I Sbjct: 121 LMTEWFVSQVRLDVTGATGSRQRLKPGVILNALIPLPPLDEQGRIVAHLEAVQEKIRAFK 180 Query: 186 TERIRFIELLKEKKQALVSYIVTKGL 211 + + + L+ +Q+++ L Sbjct: 181 SAQSETDQELRRLEQSMLDKAFRGEL 206 >gi|328947425|ref|YP_004364762.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328447749|gb|AEB13465.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 195 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 60/183 (32%), Gaps = 8/183 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNI---IQKLETRNMGLKPESYETYQIVDPGEIVF 293 A T K I +S G + + + K + ++V +V Sbjct: 9 LCAGATPSTSKPEYWENGTISWMSSGEVNLGQVYQTEKKITKKGFENCSTKMVPKNTVVV 68 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + ++ ++ S + +DS YL + ++S + G G Sbjct: 69 ALAGQGKTRGTVAITRISL-CTNQSLCSILTKDFVDSYYLYFYLKSQYQRLRAISSGEGT 127 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSS 409 R L ++ + +PP+ Q I +++ + + + + I K+ R Sbjct: 128 RGGLSLRILRDFELPLPPLSVQQRIVKILDRFDSLCNDISSGLPAEIEARKKQYEYYRDK 187 Query: 410 FIA 412 ++ Sbjct: 188 LLS 190 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 47/163 (28%), Gaps = 9/163 (5%) Query: 30 IKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + +L G T + K I ++ +V G K + + + + Sbjct: 3 LGEIGELCAGATPSTSKPEYWENGTISWMSSGEVNLGQVYQTEKKITKKGFENCSTKMVP 62 Query: 83 KGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K ++ G I + + KD + + L + Sbjct: 63 KNTVVVALAGQGKTRGTVAITRISLCTNQSLCSILTKDFVDSYYLYFYLKSQYQRLRAIS 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 T + + + +P+PPL+ Q I + + + Sbjct: 123 SGEGTRGGLSLRILRDFELPLPPLSVQQRIVKILDRFDSLCND 165 >gi|254432101|ref|ZP_05045804.1| HsdS, type I site-specific deoxyribonuclease [Cyanobium sp. PCC 7001] gi|197626554|gb|EDY39113.1| HsdS, type I site-specific deoxyribonuclease [Cyanobium sp. PCC 7001] Length = 361 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 75/227 (33%), Gaps = 7/227 (3%) Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 +QA+++ + L + + +P + ++ N ++ Sbjct: 7 RFRQAVLAAATSGELTREWREARGIESLPRKIPLGEVIHEMRNGLSPKPSLNPPGVKILR 66 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ-NDKRSLRSAQVMERGI 315 + I + R + L + + ++ G+++F + + +A + Sbjct: 67 IGAVRPGTIDWTDHRYLELSDKDLAAF-RLEAGDLIFTRYNGTLEFVGACANATSIPDVY 125 Query: 316 ITSA---YMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVP 370 + + Y+ S ++ S ++ + D+K + +P Sbjct: 126 VYPDKLIRVRCDTSRALPAYVEISFSSVEIRDHIEGLVKSSAGQKGISGTDLKNIFFPLP 185 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I+EQ +I + + D L ++ + L+ + +A A G Sbjct: 186 SIEEQIEIVHQVQALFTLADQLESRLSAARKLVDRLTPALLAKAFRG 232 >gi|309972662|gb|ADO95863.1| Probable DNA specificity protein of restriction modification system [Haemophilus influenzae R2846] Length = 396 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 51/391 (13%), Positives = 130/391 (33%), Gaps = 30/391 (7%) Query: 26 KVVPIKRFTKLNT-GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + P+K+ + G+ + + D G Y N + + + F Sbjct: 18 EWKPLKKVCNFISTGKLNANAMDE-----------NGIYPFFTCNEKPYKINNYA-FDME 65 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SIDVTQRIEAICEG 143 IL G + + V+ D ++ + + + I + Sbjct: 66 AILISGNGSQVGHLNYFKGKFNAYQRTYVIGEFDNNTLVMYLYHYLNFKLRDYITINSKK 125 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 ++ + + +PIPPL+ Q+ I + + A T L +E + L +++ + Sbjct: 126 GSVPYITLPMLEKFEIPIPPLSVQIEIVKILDALTALTSELTSELTSELILRQKQYEYYR 185 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 ++++ ++ G EW L + + I Sbjct: 186 EKLLSEE-----ELGKVGFEWKTLGDVAKIQRGASPRPISQYITDDPNGIPWIKIGDTSL 240 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + +E + E E +I+ G+ V L+ + + G + ++ Sbjct: 241 DSKYIENTAQKITIEGAEKSRILKSGDFVMSNSMSYGRPYILKISGAIHDGWAS---ISN 297 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++S +L + + S + + + S +L + +K L + +P + Q +I + Sbjct: 298 FGNILNSDFLYYYLSSNTVQSYWNGKINSSSVSNLNSDIIKSLSIPIPTLNIQIEIAKTL 357 Query: 383 NVETARIDVL-------VEKIEQSIVLLKER 406 + + + +E+ ++ +E Sbjct: 358 DKFETLTNSITKGLPLAIEQSQKRYEYYREL 388 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 20/182 (10%), Positives = 56/182 (30%), Gaps = 7/182 (3%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + +K I + L+ + + KP Y ++ + Sbjct: 19 WKPLKKVCNFISTGKLNANAMDENGIYPFFTCNEKPYKINNYAFDMEAILI---SGNGSQ 75 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 L + + + + YL + + G + Sbjct: 76 VGHLNYFKGKFNAYQRTYVIGEFDNNTLVMYLYHYLNFKLRDYITINSKKGSVPYITLPM 135 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTG 417 +++ + +PP+ Q +I +++ TA L ++ ++L ++ R ++ G Sbjct: 136 LEKFEIPIPPLSVQIEIVKILDALTALTSELTSELTSELILRQKQYEYYREKLLSEEELG 195 Query: 418 QI 419 ++ Sbjct: 196 KV 197 >gi|94265772|ref|ZP_01289507.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93453707|gb|EAT04088.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 579 Score = 80.2 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 67/466 (14%), Positives = 134/466 (28%), Gaps = 91/466 (19%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVE-SGTGKYLPKDGNSRQS 73 +P W + ++ G + DI++ + D+ G + + N+ Sbjct: 101 LPAGWALTNLENIGYWAVGNGFPKKEQGLSNLDILFCKVSDMNLPGNHRKIVGTANTVSK 160 Query: 74 DTSTV---SIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 +T+ I G +++ K+G ++ +IA I + + + E L + Sbjct: 161 ETAQKLRLHIHPPGTVIFPKIGGAIATNKRRLIARPTAIDNNCLGITPSCGITSEYLLLF 220 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK-------------- 173 L +ID+ G ++ +G IP+ +PPLAEQ I EK Sbjct: 221 LTTIDMQ----RYQVGTSVPALSQSTLGKIPVHLPPLAEQHRIVEKVDELMALCDRLEQQ 276 Query: 174 ---------------------------IIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + + R+ T + KQ ++ Sbjct: 277 TSDQLAAHETLVETLLDTLTRSADATELDSNWTRLQTHFDTLFTTESSIDHLKQTILQLA 336 Query: 207 VTKGLNPDVKMK-----------------------------DSGIEWVGLVPDHWEVKPF 237 V L P S + VP +W + Sbjct: 337 VMGRLVPQDPNDEPASTLLKKIAAEKARLVKEGKLKKPKPLPSVGDEPFSVPANWTWQSL 396 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPESYETYQIVDPGEIVFRFI 296 L + I ++ E + G+ Sbjct: 397 GGLGYTQTGSTPSKSNKSYFGNFIPFIKPGDIIHGHVNYTHEGLSKEGRNNLGKWAGPSS 456 Query: 297 DLQNDKRSLRSAQVMERGI-ITSAYMAVKPHGIDSTYLAWL-MRSYDLCKV-FYAMGSGL 353 L ++ +++R A+ P+ D + + + S + S Sbjct: 457 ILMVCIGTIGKCALIDRDCTFNQQINAISPYLTDMSGYLMISLSSRYFQNEAWDRSSSTT 516 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 L + +PV +PP+ EQ I + A + + ++I Q+ Sbjct: 517 ISILNKGKWEDIPVPIPPLAEQHRIVEKTDELMALCNQIKDRINQA 562 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 64/193 (33%), Gaps = 11/193 (5%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-- 283 G + E ++A+ +K L +IL ++ R + + Sbjct: 104 GWALTNLENIGYWAVGNGFPKKEQGLSNLDILFCKVSDMNLPGNHRKIVGTANTVSKETA 163 Query: 284 -----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 I PG ++F I + R I + GI S YL + Sbjct: 164 QKLRLHIHPPGTVIFPKIGGAI-ATNKRRLIARPTAIDNNCLGITPSCGITSEYLLLFLT 222 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + D+ + +L + ++PV +PP+ EQ I ++ A D L ++ Sbjct: 223 TIDMQRY---QVGTSVPALSQSTLGKIPVHLPPLAEQHRIVEKVDELMALCDRLEQQTSD 279 Query: 399 SIVLLKERRSSFI 411 + + + + Sbjct: 280 QLAAHETLVETLL 292 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 61/194 (31%), Gaps = 9/194 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P +W + TG T I +I D+ G Y + + + Sbjct: 387 VPANWTWQSLGGLGYTQTGSTPSKSNKSYFGNFIPFIKPGDIIHGHVNYTHEGLSKEGRN 446 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDV 133 IL +G + A+I D D + Q + P S Sbjct: 447 NLG-KWAGPSSILMVCIGTIGKCALI-DRDCTFNQQINAISPYLTDMSGYLMISLSSRYF 504 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 T+S + +IP+PIPPLAEQ I EK + + + + Sbjct: 505 QNEAWDRSSSTTISILNKGKWEDIPVPIPPLAEQHRIVEKTDELMALCNQIKDRINQADQ 564 Query: 194 LLKEKKQALVSYIV 207 + + + + + Sbjct: 565 IRQHLSETVAIQAL 578 >gi|307710634|ref|ZP_07647067.1| type I restriction modification DNA specificity domain protein [Streptococcus mitis SK564] gi|307618577|gb|EFN97720.1| type I restriction modification DNA specificity domain protein [Streptococcus mitis SK564] Length = 383 Score = 80.2 bits (196), Expect = 6e-13, Method: Composition-based stats. Identities = 45/395 (11%), Positives = 106/395 (26%), Gaps = 38/395 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + + +++ + T K + + Sbjct: 19 EWKQHKLGEVFEQTVEYVDPYEQNLELWSVTVESGLTPKEERYNREFLVKKSDKFKKLYP 78 Query: 84 GQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +I+Y + + + S ++ ++ K LS R+ I Sbjct: 79 EEIVYNPMNITIGAVGFNNAGKKVAVSGYYVTMKMKSKFSNKFFSAWLSCPKAIRLYKIY 138 Query: 142 EGAT---MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + + +I P EQ I + + + + Sbjct: 139 STGSLIERQRVQFPTLSDIKDYFPTFDEQSAIGSLFRTLDDLLSSY----KDNLTNYQAL 194 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 K ++S + K +++ +G E EV + R L +I Sbjct: 195 KATMLSKMFPKAGQTVPEIRLNGFEEDWERKSLSEVCTINSG-----RDYKHLKNGDIPV 249 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 G + + + +D ++ + + Sbjct: 250 YGTGGYMLSVNEKLSDEDAIGIGRKGTIDKPYLL-----------------SAPFWTVDT 292 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + G D YL + + K S SL + ++++ P +EQ I Sbjct: 293 LFYVICKVGYDLNYLFLIFQKIRWKKFDE---STGVPSLSKKTIEKVVSKFPSYEEQCAI 349 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ I L+ + + Sbjct: 350 GLY----FSDLDNLINYYQEKISQLETLKKKLLQD 380 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 53/181 (29%), Gaps = 18/181 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W+ + +N+GR + +++G G + + Sbjct: 220 EDWERKSLSEVCTINSGRDYK-----------HLKNGDIPVYGTGGYMLSVNE---KLSD 265 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + I G+ G + +++ T F V+ +L I R + E Sbjct: 266 EDAIGIGRKGTIDKPYLLSAPFWTVDTLFYVICKVGYD----LNYLFLIFQKIRWKKFDE 321 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + K I + P EQ I I+ + + L K+ Q + Sbjct: 322 STGVPSLSKKTIEKVVSKFPSYEEQCAIGLYFSDLDNLINYYQEKISQLETLKKKLLQDM 381 Query: 203 V 203 Sbjct: 382 F 382 >gi|240047535|ref|YP_002960923.1| putative type-1 restriction enzyme MjaXP specificity protein [Mycoplasma conjunctivae HRC/581] gi|239985107|emb|CAT05100.1| Putative type-1 restriction enzyme MjaXP specificity protein [Mycoplasma conjunctivae] Length = 415 Score = 80.2 bits (196), Expect = 7e-13, Method: Composition-based stats. Identities = 55/404 (13%), Positives = 114/404 (28%), Gaps = 23/404 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK V I K+ TG+T ++ GK + +D + K K ++ Sbjct: 2 NEWKKVKISEIGKVVTGKTPKTSNSSFYGGKTPFFTPSDDWSTKYIKNTNKYLTEDGKNS 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 SI K I +G + A+ + ++ + + + Sbjct: 62 VKGSIIPKNAICVSCIGSIGKVAMTSSETVTNQQINSIIVNETKYDIDFIYYAMLELGKV 121 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + P L EQ I + +I+ + Sbjct: 122 LNLHSGSSTVVPIISKNTFSEYKLACPKLDEQKKISNVLSIIDKKIEINRQINDNLEKQT 181 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV------PDHWEVKPFFALVTELNRKNT 249 K + N K SG E V P W ++ K Sbjct: 182 KLLYDYWFTQFDFPDEN-GNPYKSSGGEMVFNEELKRYIPKGWSIETLANNTISRIIKPG 240 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFIDLQNDKRSLRS 307 I + +I K + + ++ E+ + P + F + + Sbjct: 241 VNIFKEKTYFATADINNKEISSGNKVLYQNRESRANMQPIKSSVWFAKMKNSVKHLFITD 300 Query: 308 AQVM--ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKR 364 GI+++ + ++ Y++ + S + G ++S+ D+ Sbjct: 301 NMDFMINEGILSTGFCGLECEKNSFEYISSFINSSYFEMAKDILSHGATQESVNNNDLNF 360 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +L+P + T I + + S + E R Sbjct: 361 INILIPD----RRTLLNFHKITKPIYEQITENICSNRKITELRD 400 >gi|34764189|ref|ZP_00145051.1| TYPE I RESTRICTION-MODIFICATION SYSTEM SPECIFICITY SUBUNIT [Fusobacterium nucleatum subsp. vincentii ATCC 49256] gi|27886037|gb|EAA23351.1| TYPE I RESTRICTION-MODIFICATION SYSTEM SPECIFICITY SUBUNIT [Fusobacterium nucleatum subsp. vincentii ATCC 49256] Length = 373 Score = 80.2 bits (196), Expect = 7e-13, Method: Composition-based stats. Identities = 46/379 (12%), Positives = 107/379 (28%), Gaps = 34/379 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKY--LPKDGNSRQSDT 75 P + + + K+I+ + D+ +PK + Sbjct: 13 PNGVEYKELGELGIFENIGVDKKINVNEKEILLLNYTDIYKNNYIDSSIPKMIVTANDKK 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIAD------FDGICSTQFLVL---QPKDVLPELLQG 126 + I A + S + P V + Sbjct: 73 IENCSVEECDIFITPTSETKEDIGHASVILETIPNCCYSYHIMRYRLINPNRVTASFIMY 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S D+ ++I +G T + N+ +P P + Q I + + T ++ L Sbjct: 133 LFYSQDLKRQILKYAQGLTRYGLSKEKFSNLLIPFPNIRIQEEIVKVLDDYTKSVEELKG 192 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + K++ Y++ N +K +G + + Sbjct: 193 KLNEELTARKKQYSWYRDYLLKFE-NKVETVK------LGSIGKVSMCRRIL-------- 237 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K+ I I G +K + K Y+ +V + + Sbjct: 238 KSETNIVGGIPFFKIGTFGKKEDAYISIEKFNEYKEKYSYPKKGMVLISTSGTIGRTIVF 297 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + ++ + + YL + ++ G + L E++++ Sbjct: 298 DGKPAYYQDSNIVWIDNNEEKVLNKYLYYFYQTSPWKIDM----GGTIERLYNENIEKTI 353 Query: 367 VLVPPIKEQFDITNVINVE 385 + +PP++ Q I V++ Sbjct: 354 IPLPPLEVQKRIVGVLDNF 372 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 10/167 (5%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVME 312 N + N I + + + V+ +I + + + Sbjct: 47 NYTDIYKNNYIDSSIPKMIVTANDKKIENCSVEECDIFITPTSETKEDIGHASVILETIP 106 Query: 313 RGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + M P+ + ++++ +L S DL + G R L E L + Sbjct: 107 NCCYSYHIMRYRLINPNRVTASFIMYLFYSQDLKRQILKYAQGLTRYGLSKEKFSNLLIP 166 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 P I+ Q +I V++ T ++ L K+ + + K+ R + Sbjct: 167 FPNIRIQEEIVKVLDDYTKSVEELKGKLNEELTARKKQYSWYRDYLL 213 >gi|317506902|ref|ZP_07964674.1| hypothetical protein HMPREF9336_01045 [Segniliparus rugosus ATCC BAA-974] gi|316254830|gb|EFV14128.1| hypothetical protein HMPREF9336_01045 [Segniliparus rugosus ATCC BAA-974] Length = 330 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 55/352 (15%), Positives = 111/352 (31%), Gaps = 33/352 (9%) Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVT 134 ST F+ +L+GKL P L K +F+G+CST + ++P L +LL + Sbjct: 2 STKFRFSPEHVLFGKLRPNLGKISRPEFEGVCSTDIIPIRPGKHLDRNYLAHFLLQPSMI 61 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + GA + + +P+P L+EQ I + Sbjct: 62 DYAASRTSGANLPRLSPDLLAKFLIPLPSLSEQRRIAAILDQADALRSRRRQVLNHL--- 118 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 A ++ V K ++ V V +T Sbjct: 119 ------ATLTGSVFHDTFGGHTYKTLRLDEVAAVSSGITKGRKTNE-------STTPTPY 165 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +S I+ + + + Y + D ++ D R + Sbjct: 166 LAVSNVQAGCIKLDLVKEIPATSAEIQRYALQDGDLVLTEGGDPDKLGRGTVWRSQLALC 225 Query: 315 IITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVP 370 + + V+P YL+ + S + F + S+ ++ PV +P Sbjct: 226 LHQNHVFKVRPDKHIVLPDYLSECLASSESRAYFLRSAKQTTGIASINMTQLRAAPVPMP 285 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLL----KERRSSFIAAAVTGQ 418 P+++Q + + + + ++ + E +S + A G+ Sbjct: 286 PMRDQLR---FLERKMS-----IASKHAALQHIMATHDELFASLQSRAFRGE 329 Score = 43.2 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 52/196 (26%), Gaps = 11/196 (5%) Query: 26 KVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 K + + +++G +T+ES Y+ + +V++G K S Sbjct: 136 KTLRLDEVAAVSSGITKGRKTNESTTPTPYLAVSNVQAGCIKLDLVKEIPATSAEIQRYA 195 Query: 81 FAKGQILYGKLGP---YLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G ++ + G R + +C ++P + Sbjct: 196 LQDGDLVLTEGGDPDKLGRGTVWRSQLALCLHQNHVFKVRPDKHIVLPDYLSECLASSES 255 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R + + + + P+ + + I + + Sbjct: 256 RAYFLRSAKQTTGIASINMTQLRAAPVPMPPMRDQLRFLERKMS-IASKHAALQHIMATH 314 Query: 196 KEKKQALVSYIVTKGL 211 E +L S L Sbjct: 315 DELFASLQSRAFRGEL 330 >gi|294647359|ref|ZP_06724952.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CC 2a] gi|294809020|ref|ZP_06767742.1| type I restriction modification DNA specificity domain protein [Bacteroides xylanisolvens SD CC 1b] gi|292637318|gb|EFF55743.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CC 2a] gi|294443745|gb|EFG12490.1| type I restriction modification DNA specificity domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 427 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 58/418 (13%), Positives = 126/418 (30%), Gaps = 31/418 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +V+ I + + + +I I DVE+G +I K Sbjct: 12 QVLKINQIVRTISETHKFDKDKLIAINTSDVENGVMGNGTLTFVDELKGQFKKTIV-KDD 70 Query: 86 ILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV----TQRI 137 IL+ ++ P R+ D + ST+ +VL+ + +L + + + Sbjct: 71 ILFSEIRPANRRFAKVTTKNTKDYVVSTKLMVLRKYNEDVDLEYFYYCLTNQPFLDILQR 130 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 A + + + PIPP++EQ I I +I + K+ Sbjct: 131 RAENRIGSFPQITFDLLSEYAFPIPPISEQKRISSVISTLDKKIALNRQINQNLEAMAKQ 190 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGI--------EWVGLVPDHWEVKPFFALVTELNRKNT 249 N G +++ + + + + + K+ Sbjct: 191 LYDYWFVQFDFPNENGRPYKSFGGKMVWNEKQRKYIPEYWEVKSLSNWLEIKSGFPFKSE 250 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESY-----ETYQIVDPGEIVFRFIDLQNDKRS 304 + +Q E G + + Y + G+ + Sbjct: 251 TYKPIGRYKIITIKNVQDGELVTSGCDYVNDIPSRAKDYISLQIGDRLISLTGNCGRLCV 310 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 + E ++ + I Y + S + V + +G + +L ++ Sbjct: 311 VCE----ENLLLNQRVGLLCCDAIYLEYFYNFLNSGTMRTVIDNLANGAAQANLSPVELC 366 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +PPI I N + I + + Q I L ++R + + GQ+ + Sbjct: 367 KTDCFIPPID----ILLSYNRKVNAIRKAIVQNNQEISQLAKQRDELLPLLMNGQVSV 420 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 64/193 (33%), Gaps = 6/193 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP------KDGNSRQSD 74 IP++W+V + + ++ +G +S + + + N S Sbjct: 226 IPEYWEVKSLSNWLEIKSGFPFKSETYKPIGRYKIITIKNVQDGELVTSGCDYVNDIPSR 285 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G L G R ++ + + + + + +L + E +L S + Sbjct: 286 AKDYISLQIGDRLISLTGNCGRLCVVCEENLLLNQRVGLLCCDAIYLEYFYNFLNSGTMR 345 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I+ + GA ++ + IPP+ + K+ A I E + + Sbjct: 346 TVIDNLANGAAQANLSPVELCKTDCFIPPIDILLSYNRKVNAIRKAIVQNNQEISQLAKQ 405 Query: 195 LKEKKQALVSYIV 207 E L++ V Sbjct: 406 RDELLPLLMNGQV 418 >gi|221231341|ref|YP_002510493.1| type I restriction-modification system S protein [Streptococcus pneumoniae ATCC 700669] gi|220673801|emb|CAR68303.1| putative type I restriction-modification system S protein [Streptococcus pneumoniae ATCC 700669] Length = 426 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 61/424 (14%), Positives = 135/424 (31%), Gaps = 66/424 (15%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGA 144 L + R I+ I + ++ L + ++LS + V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----Q 200 + + + + +I +P+PPL+EQ I E I + ++D R +L KE + Sbjct: 122 VVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDS-----------------------------------GIEWV 225 +++ Y + L +S + Sbjct: 182 SILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYY 241 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ- 284 G +P +W V + + + K + +I + II+ + + + Y Sbjct: 242 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRIIRGGNIKPLEFSLLDNDYYID 300 Query: 285 ---------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDST 331 + +++ G++ ++ + I S Sbjct: 301 TQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISK 360 Query: 332 YLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +L + + S K + ++ + L + + P +EQ IT + + Sbjct: 361 FLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEK 420 Query: 389 IDVL 392 ++ L Sbjct: 421 VNQL 424 Score = 77.1 bits (188), Expect = 6e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIES 153 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 154 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 191 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 242 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 301 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 302 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 361 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 362 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 421 Query: 182 DTLI 185 + L Sbjct: 422 NQLW 425 >gi|317182159|dbj|BAJ59943.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F57] Length = 396 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 58/392 (14%), Positives = 116/392 (29%), Gaps = 31/392 (7%) Query: 22 PKHWKVVPIKRFTKLN-------TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + N TG+ + + ++ Y + N Q+ Sbjct: 13 PKGVEFRKLGEVLEYNQPNKYCVTGKEFDESYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ I + + S+ +L PK+ + + Sbjct: 71 KSSPVIIF-DDF-------TTATQWVDFPFKVKSSAMKILLPKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I T I +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNISGEHTRQWISRYS--KITIPIPPLEIQQEIVKILDAFTELNTELNTELKARKKQ 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + L+ KG+N + KD+ I+ V + Sbjct: 178 YQYYQNMLLD---FKGINSNH--KDAKIKTYPKRLKTLLQTLAPKGVEFRKLGEVCDFQK 232 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 K+ + G +P Y I + S + Sbjct: 233 GKSITKKAVTFGKVPVISGGRQPAYYHNEANRSGETIAISSSGVY---AGYVSYWDIPVF 289 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + S ++ K + YL + + + +G + +D++ + +PP++ Sbjct: 290 LADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIPHVYSKDLQNFLIPIPPLEI 348 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q +I +++ A L+ I I K++ Sbjct: 349 QQEIVKILDQFLALTTDLLAGIPAEIEARKKQ 380 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 59/181 (32%), Gaps = 18/181 (9%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P E + ++ + ++ +T +G E YQ Sbjct: 13 PKGVEFRKLGEVLEYNQPNKYCVTGKEFDESYPTPVLTAGKTFILGYTNEKDNIYQASKS 72 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFY 347 ++ + + + + ++ + + + I+ ++ + M++ Sbjct: 73 SPVII----FDDFTTATQWVDFPFKVKSSAMKILLPKNPTINIRFIFFYMQTIPY----N 124 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 G RQ + ++ + +PP++ Q +I +++ T E + LK R+ Sbjct: 125 ISGEHTRQWISR--YSKITIPIPPLEIQQEIVKILDAFT-------ELNTELNTELKARK 175 Query: 408 S 408 Sbjct: 176 K 176 >gi|317179714|dbj|BAJ57502.1| Type I R-M system specificity subunit [Helicobacter pylori F30] Length = 197 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 58/197 (29%), Gaps = 17/197 (8%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G + K T + ++GN ++ + L+ + Y Sbjct: 15 LGDIGKPCMCKRVMKHQTTRYGEIPFYKIG-----TFGNTADAFISKKLFLEYRT--KYS 67 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 G+I+ + + + + +L +Y K Sbjct: 68 FPKKGDILISASGT----IGKAVIYDGKPAYFQDSNIVWIDNDETLVKNDFLFYAYSNVK 123 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 L ++ + + +PP+ EQ I NV++ I L K Q + Sbjct: 124 W--NTEHTTILRLYNDNFRNTLIPLPPLNEQSAIANVLSALDNEIISLKNKKRQ----FE 177 Query: 405 ERRSSFIAAAVTGQIDL 421 + + ++ +I + Sbjct: 178 NIKKALNHDLMSAKIRV 194 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 59/189 (31%), Gaps = 10/189 (5%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYR 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G + I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGKAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + A I +L ++ +F + Sbjct: 121 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQSAIANVLSALDNEIISLKNKKRQFENIK 180 Query: 196 KEKKQALVS 204 K L+S Sbjct: 181 KALNHDLMS 189 >gi|256841221|ref|ZP_05546728.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256737064|gb|EEU50391.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 388 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 54/405 (13%), Positives = 120/405 (29%), Gaps = 53/405 (13%) Query: 26 KVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 ++ + + +N R + + +I + V G + + F Sbjct: 13 ELKRLGQCCIINPRRPNIALCDTDKVSFIPMPAVSED-GYLVDMADEEYGKVKKGFTYFE 71 Query: 83 KGQILYGKLGPYLRK------AIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVT 134 +L+ K+ P + + + G+ ST+F VL+P + + P L Sbjct: 72 NNDVLFAKITPCMENGKGAIAYGLTNGIGVGSTEFHVLRPINGISSPYWLLTLTRMPIFR 131 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID-----TLITER 188 +R G + + + +P + EQ I Sbjct: 132 ERAAKNMSGTGGQKRVSASYLNHFMVGLPAIEEQRRFEAIYRQADKSKFGDFKSQFIEMF 191 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + P++WE + + K Sbjct: 192 GTVENNTHNFPIMTIGEFANCFAGATPSTSH---------PEYWENGRIRWMSSGEVHK- 241 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 ++ ++R L +S T + ++ I Q R + Sbjct: 242 --------------GHVEDTDSRITELGYKSASTRMVPIHSIVI--AIAGQGKTRGTVAI 285 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 ++ S V ++ +YL ++ L + R L + ++++PV+ Sbjct: 286 TEVDLCTNQSLCAIVPDERVNYSYLYHNLQGRYLELRGLSGDVNGRGGLNLKIIQKIPVI 345 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK-----ERRS 408 +PPI++Q ++ + D I++++V L E R Sbjct: 346 LPPIEKQQQFASI----AQQADKSKSVIQKALVYLNDIQSDELRK 386 >gi|261392482|emb|CAX50031.1| putative type I restriction-modification system S protein [Neisseria meningitidis 8013] Length = 385 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 61/378 (16%), Positives = 118/378 (31%), Gaps = 26/378 (6%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 YI +++ + S V+ F KG IL + PYL+K A FDG CS Sbjct: 26 YISTDNILQNKQGI--ECAASLPIQGGKVTAFKKGDILLANIRPYLKKIWYAQFDGGCSA 83 Query: 110 QFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 L ++ + + D +G M D I +P+ Q Sbjct: 84 DVLAIRANAKIDSHFLFYALFRDDFFIHAMKGSKGTKMPRGDKTQIMEFKIPVFAPQTQQ 143 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWV 225 I + +D I + L+E + L Y + PD K SG E V Sbjct: 144 SITTVL----SALDKKIALNKQINARLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGEMV 199 Query: 226 GLVPDHWEVKPFFALV--TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 E+ + V K ++ + + ++ + + + Sbjct: 200 FDETLKREIPKGWESVELQSCLAKVPSTVKISNKDIKDFGKYPVIDQSQDFICGFTDDEK 259 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 I++P + F D R ++ + + + YL + + Sbjct: 260 SILNPQDAHIIFGD---HTRIVKLVNFKYARGADGTQVILSNNERMPNYLFYQI-----I 311 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G + + +K P+++P ++N +I + + L Sbjct: 312 NQIDLSSYGYARHF--KFLKEFPIILPDKDISRKYYEIVNYFFIKIRNNI----KQNHHL 365 Query: 404 KERRSSFIAAAVTGQIDL 421 + R + + GQ+ + Sbjct: 366 TQLRDFLLPMLMNGQVSV 383 >gi|75909705|ref|YP_324001.1| restriction modification system DNA specificity subunit [Anabaena variabilis ATCC 29413] gi|75703430|gb|ABA23106.1| Restriction modification system DNA specificity domain protein [Anabaena variabilis ATCC 29413] Length = 405 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 51/413 (12%), Positives = 112/413 (27%), Gaps = 48/413 (11%) Query: 30 IKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKG 84 + + G + I + + ++ GT + SD + Sbjct: 8 LADTCEFINGGAWSDKEYVEAGIPVVKVTNMVDGTIETNNLSYLPLSSSDKYKKHLLFVN 67 Query: 85 QILYGKLGP--------YLRKAIIADFD--GICSTQFLVLQPKDVL---PELLQGWLLSI 131 ++ +G R +++ + + ++ K P+ L +I Sbjct: 68 DLVVTTVGSHPTQPGSVVGRTSVVPQHFDGAFLNQNAVCIRVKCKNLISPKFLIYISKTI 127 Query: 132 DVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 IE+ G+ + PPL Q I + A I+ Sbjct: 128 LFKHHIESRARGSANQVRMALGELKKFTFKFPPLPVQKKIAAILSAYDDLIENNNRRIAI 187 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ +E + + G K P+ WE K F Sbjct: 188 LEKMAEEIYREWFVRLRFPGHEQVKFNKGI--------PESWERKRFDEFCLLQRG---- 235 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 L +I ++Y V+P I + Sbjct: 236 ------YDLPDTQVIPGQYPVIASTSIKTYHNQFKVNPPVITTGRSGSL----GIILFIN 285 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + + +G + + ++ K+ +L + L + VP Sbjct: 286 SQAWPLNTTLFVKNFYGNSPYLIYYTLK---FLKLENFNSGAGVPTLNRNHLGGLYMSVP 342 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 P Q + + I + + E + +S L E R + ++G++ + Sbjct: 343 PKSLQNNFNDKIAILFKQ----KELLSKSKNALIEIRDRLLTRLISGKLSVED 391 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 57/184 (30%), Gaps = 16/184 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP+ W+ F L G + I G+Y P ++ Sbjct: 217 IPESWERKRFDEFCLLQRGYDLPDTQVIP-----------GQY-PVIASTSIKTYHNQFK 264 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I G+ G I +T V P L+ L + + E Sbjct: 265 VNPPVITTGRSGSLGIILFINSQAWPLNTTLFVKNFYGNSPYLIYYTLKFLKL----ENF 320 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 GA + + +G + M +PP + Q +KI + + L + IE+ Sbjct: 321 NSGAGVPTLNRNHLGGLYMSVPPKSLQNNFNDKIAILFKQKELLSKSKNALIEIRDRLLT 380 Query: 201 ALVS 204 L+S Sbjct: 381 RLIS 384 >gi|256810495|ref|YP_003127864.1| N-6 DNA methylase [Methanocaldococcus fervens AG86] gi|256793695|gb|ACV24364.1| N-6 DNA methylase [Methanocaldococcus fervens AG86] Length = 1068 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 43/423 (10%), Positives = 123/423 (29%), Gaps = 47/423 (11%) Query: 27 VVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS--- 76 + + + + G I I +++++ + + Sbjct: 640 ITTLGKIAHVFDGPFGSELKNEEYVDSGIPLIRVQNIKDNRLVLTRDNTVYISVEKHQKL 699 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF----LVLQPKDVLPELLQGWLLSID 132 S G ++ K G A++ + + + + ++ +++ PE L ++ S Sbjct: 700 KRSEVLPGDVVVTKTGWLGNAAVVPEEVKKANIRADIAGIRIKSEEISPEYLAIYISSNI 759 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI------------------ 174 + + G+T + + + + +PP Q I + + Sbjct: 760 GKKLCYRLSSGSTRDRIIIENLRKLKIIVPPKDIQEKIVQIMENAYKLKKQKEKEAEELL 819 Query: 175 ----IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + I E + + + + + N + G Sbjct: 820 NSIDDYVLKELGIEIPEIEESKIFIVDFNDIIKNKRLDAEFNQEKYKILMDAVEKGKYKT 879 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGN------IIQKLETRNMGLKPESYETYQ 284 K F + + + + I + + + + + + + Sbjct: 880 VEVGKVFKYIKKGIEVGSNAYTKEGIPFIRVSDIDDYKIHFENADKKINPKLYKELKDKY 939 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 G++++ + II+ + +K + Y ++ S L K Sbjct: 940 KPQVGDLLYSKDGTIGF---CVMVEEDRDFIISGGILRLKVKDNINPYYIKVILSTKLLK 996 Query: 345 VF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L+ + K+L + +PP + Q I + + L ++ ++ I Sbjct: 997 TLAEQRSIGAVIKHLREVEFKKLKIPLPPKEIQDKIAEEVKRRIKKAQQLKKESKKVIEE 1056 Query: 403 LKE 405 K+ Sbjct: 1057 AKK 1059 >gi|95929208|ref|ZP_01311952.1| restriction modification system DNA specificity domain [Desulfuromonas acetoxidans DSM 684] gi|95134706|gb|EAT16361.1| restriction modification system DNA specificity domain [Desulfuromonas acetoxidans DSM 684] Length = 474 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 67/468 (14%), Positives = 142/468 (30%), Gaps = 86/468 (18%) Query: 24 HWKVVPIKRFT-----KLNTGRTSESGKDIIYI-----GLEDVESGTGKYLPKDGNSRQS 73 W+V + + TG Y+ + + G + + Sbjct: 12 SWEVATLGDVCRRGGGDVQTGPFGSQLHAADYVPVGIPSIMPMNIGDNRISEEGIARITP 71 Query: 74 DTS---TVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQP---KDVLPELLQ 125 + + + + G I+Y + G R+A++ + + +C T L ++ V P Sbjct: 72 EDARRLSKYLVRTGDIVYSRRGDVERRALVREPEDGWLCGTGCLRVRFGEKSVVHPPYAA 131 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L V + I +GATM + + + +P +P + EQ + + A +I+ Sbjct: 132 YYLGHPSVREWIVRHAQGATMPNLNTSILSALPFVLPSIEEQEQVASVLTALDDKIELNR 191 Query: 186 TERIRFIELLKEKKQALV---------SYIVTKGLNPDVKMKD--SG------------- 221 ++ + ++ G +P+ SG Sbjct: 192 QINQTLEQIAQTIFKSWFIDFEPVKAKIEAKAAGRDPERAAMCAISGKLEPELDQLPPEQ 251 Query: 222 ----------------IEWVGLVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSY 261 +GL+P WEVK + LN K E++ L + Sbjct: 252 YQQLAATAALFPDALVESELGLIPVGWEVKSLDQVANYLNGLALQKFPPESETDWLPVIK 311 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 ++K +T + IVD G+++F + +G + Sbjct: 312 IAQLKKGDTEGADRASSKLKPVYIVDDGDVLFSWSGSLT-----VDIWTGGQGALNQHLF 366 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITN 380 V + + + A ++ + + + P + Sbjct: 367 KVTSVNYPKWFYLHWTKFHLARFQNIAADKAVTMGHIQRKHLTEALCVAPEK-------S 419 Query: 381 VINVETARIDVLVEKIEQSIVL------LKERRSSFIAAAVTGQ--ID 420 I+ + L+ Q I L L R + + ++G+ ID Sbjct: 420 GIDSFDSLFSSLLA---QEIELRIVSRSLSFLRDTLLPKLLSGELCID 464 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 41/138 (29%), Gaps = 11/138 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG----RTSESGKD--IIYIGLEDVESGTGKYLPKDGNSR 71 +G IP W+V + + G + + + I + ++ G + + Sbjct: 271 LGLIPVGWEVKSLDQVANYLNGLALQKFPPESETDWLPVIKIAQLKKGD----TEGADRA 326 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S V I G +L+ G L I G + + + W Sbjct: 327 SSKLKPVYIVDDGDVLFSWSGS-LTVDIWTGGQGALNQHLFKVTSVNYPKWFYLHWTKFH 385 Query: 132 DVTQRIEAICEGATMSHA 149 + A + TM H Sbjct: 386 LARFQNIAADKAVTMGHI 403 >gi|91205219|ref|YP_537574.1| restriction endonuclease S subunits [Rickettsia bellii RML369-C] gi|157827443|ref|YP_001496507.1| restriction endonuclease S subunits [Rickettsia bellii OSU 85-389] gi|91068763|gb|ABE04485.1| Restriction endonuclease S subunits [Rickettsia bellii RML369-C] gi|157802747|gb|ABV79470.1| Restriction endonuclease S subunits [Rickettsia bellii OSU 85-389] Length = 245 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 45/269 (16%), Positives = 93/269 (34%), Gaps = 32/269 (11%) Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 MP+PPL EQ I + I + I L ++ L ++ LNP Sbjct: 1 MPLPPLPEQQKIANILRVWDKA----IEKVSTLISLNEKFFNNLAKKLLKNCLNPQYLAW 56 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 +G + + + I++K + + Sbjct: 57 CP--VTLGEIFTERRETTLNKMELLSITGSE-------------GIVKKDSLKKRDTSNK 101 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWL 336 Y ++ PG++ + + + + S GI++ AY P+ I++ ++ +L Sbjct: 102 DKSKYLLIYPGDLGYNTMRMWQGVCGISSLS----GIVSPAYTICIPNSSAINTQFIYFL 157 Query: 337 MRSYDLCKVFYAMGSGLRQ---SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + FY GL LKF + + +P I+ Q N++ + Sbjct: 158 FKLPKMINEFYRYSQGLVDDTLGLKFSYFAEIKINIPTIEYQNQTANIL----LNYKNQI 213 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 K + L+ ++ I +TG+ ++ Sbjct: 214 SKYKNYKKALQSQKQGLIQKLLTGEWRVK 242 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 49/184 (26%), Gaps = 7/184 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W V + T + + G E + K ++ D S + G Sbjct: 56 WCPVTLGEIFTERRETTLNKMELLSITGSEGIVKKDS---LKKRDTSNKDKSKYLLIYPG 112 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + Y + + I+ GI S + + P + L E Sbjct: 113 DLGYNTMRMWQGVCGISSLSGIVSPAYTICIPNSSAINTQFIYFLFKLPKMINEFYRYSQ 172 Query: 145 TMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + I + IP + Q ++ +I + + Q Sbjct: 173 GLVDDTLGLKFSYFAEIKINIPTIEYQNQTANILLNYKNQISKYKNYKKALQSQKQGLIQ 232 Query: 201 ALVS 204 L++ Sbjct: 233 KLLT 236 >gi|322691669|ref|YP_004221239.1| hypothetical protein BLLJ_1480 [Bifidobacterium longum subsp. longum JCM 1217] gi|320456525|dbj|BAJ67147.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 363 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 40/404 (9%), Positives = 98/404 (24%), Gaps = 64/404 (15%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + T L G + G + ST + Sbjct: 2 SWRETTLGEITDLKRGFDLPKS-----------QRLQGDVPVYSSSGITGSNSTAA-VEG 49 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G+ G +T + P + L +I Sbjct: 50 PCVITGRYGTIGEVFFSGGPCWPLNTALYSTEFNGNNPRFIYYLLQTIPWQ----GYTTA 105 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + + P+ IP A Q I E + + +I L Sbjct: 106 SAVPGVNRNHVNLCPVKIPDRATQDAIVEVLDSIVDKIALNNRLNDYLANLC-------- 157 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 E + + + ++ + +S + Sbjct: 158 -------------------ETIASRYCNDRNSRLRDICYQVADHVDYDNANQETYVSTES 198 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 ++Q R + + G+ + I K + G + + Sbjct: 199 LMQNKGGRQLASSLPATGKITRYKAGDTLISNIRPYFKKIWYAPFE----GTCSGDVIVF 254 Query: 324 KPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + + +R G + + V + Sbjct: 255 RANDPSNAPYLHACLRQDSFFDYVMQGAKGTKMPRGDKKQMMEFKV-----------ASS 303 Query: 382 INVET-ARIDVLVEK---IEQSIVLLKERRSSFIAAAVTGQIDL 421 + + +D +++ + V L+ R + + ++G+ID+ Sbjct: 304 CSTKDLILLDSAIKQRSDNDSETVKLQALRDTLLPKLMSGEIDV 347 >gi|255601534|ref|XP_002537701.1| conserved hypothetical protein [Ricinus communis] gi|223515425|gb|EEF24682.1| conserved hypothetical protein [Ricinus communis] Length = 265 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 101/278 (36%), Gaps = 29/278 (10%) Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + + +PP+ EQ I + + D I R +++K+AL++ ++ Sbjct: 1 MKIALPPVQEQRRIADIL----STWDQAIIVTERLCANSQQRKRALMTSLL--------- 47 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 SG D W F ++ + + RKN+ + + +I + + N + Sbjct: 48 ---SGRRRFPSFEDKWRYVDFDSIFSRVLRKNSSNNNNVLTISGEHGLISQRDYFNKSVA 104 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGI---DSTY 332 + Y + + + +++ E GI++S Y+ + D + Sbjct: 105 GANLTGYTFLQRFDFAYNKSYSSGYPLGAIKPLLAYETGIVSSLYLCFRLREDVDADFDF 164 Query: 333 LAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + + G R ++ D +L + +P +EQ I VINV A Sbjct: 165 FRHYFEAGFMNQEIEGIAQEGARNHGLLNVSVNDFFKLRLHIPSAQEQRRIAEVINVAEA 224 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 ++ E + L + + + +TG+ + Sbjct: 225 E----QKRHEAQLQSLCLEKLALMQQLLTGKRCVSPPE 258 >gi|303255270|ref|ZP_07341341.1| putative type I RM modification enzyme [Streptococcus pneumoniae BS455] gi|301801717|emb|CBW34423.1| putative type I RM modification enzyme [Streptococcus pneumoniae INV200] gi|302597739|gb|EFL64814.1| putative type I RM modification enzyme [Streptococcus pneumoniae BS455] Length = 368 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 45/366 (12%), Positives = 97/366 (26%), Gaps = 24/366 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M H K NI +P L EQ I ++ + I + Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNL------------ 167 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 L + G + D+ + + E L L+ N+ Sbjct: 168 -----LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVT 222 Query: 266 QKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + + ++ +IV + + I S + Sbjct: 223 KNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMV 282 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++P + +++ + + L +K++ + +PP+ Q + + Sbjct: 283 ILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFADF 341 Query: 382 INVETA 387 + Sbjct: 342 VAQVDK 347 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|224023387|ref|ZP_03641753.1| hypothetical protein BACCOPRO_00080 [Bacteroides coprophilus DSM 18228] gi|224016609|gb|EEF74621.1| hypothetical protein BACCOPRO_00080 [Bacteroides coprophilus DSM 18228] Length = 404 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 117/410 (28%), Gaps = 29/410 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K + + + + ++ G Y P G D IF Sbjct: 5 KTYKLGDIVNVLDYKRIP-------LSSTQRQNKKGIY-PYYGAQGIIDYIDDYIFDGEY 56 Query: 86 ILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L + G L+ A IA + +++ + ++ + + Sbjct: 57 LLIAEDGENLKSQKQHVAQIATGKYWVNNHAHIVESNGLCD---IRYVCYLLNRMDLSGY 113 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ + + I + +P L EQ I E + +I+ + + + Sbjct: 114 ITGSAQPKLNQANLLKIEIKLPSLKEQYKIAEFLHLFDGKIELNRRINENLEQQAQALFK 173 Query: 201 ALVSYIVTKGLNPD------VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + K+ I ++G +P E + E+ Sbjct: 174 SWFVDFEPFKNGKFVDSELGMIPKELNIRYIGDIPHTIECGRRPKGGATDKGVPSIGAEN 233 Query: 255 NILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Y K + L Y+++ + + N E Sbjct: 234 IKGLGIYDYSKTKYIPKEFALTTNRGKINGYELLIYKDGGKPGYFIPNYTIFGDGFPFDE 293 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP 371 I + + + + + M++ + ++G + +DV+ LP+ Sbjct: 294 MFINEHVFKLNLLNKEYNIFAYFYMQTPFIMNQLNSIGGKAAIPGINTKDVESLPIF--- 350 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 E I N+ I + K + L + R + ++G++ + Sbjct: 351 SYENNKIKEFGNIVLPMIKR-ILKNCRENARLAQLRDILLPKLMSGELKI 399 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 63/180 (35%), Gaps = 11/180 (6%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + +V L+ K L + + + + ++ ++ + Sbjct: 3 NIKTYKLGDIVNVLDYKRIPLSSTQRQNKKGIYPYYGAQGIIDYIDDYIFDGEYLLIAED 62 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +L++ K+ + + + A++ D Y+ +L+ DL Sbjct: 63 ----GENLKSQKQHVAQIATGKYWVNNHAHIVESNGLCDIRYVCYLLNRMDLSGYI---T 115 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + L ++ ++ + +P +KEQ+ I ++ D +E + L+++ + Sbjct: 116 GSAQPKLNQANLLKIEIKLPSLKEQYKIAEFLH----LFDGKIELNRRINENLEQQAQAL 171 >gi|319951809|ref|YP_004163076.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] gi|319420469|gb|ADV47578.1| restriction modification system DNA specificity domain protein [Cellulophaga algicola DSM 14237] Length = 507 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 54/369 (14%), Positives = 123/369 (33%), Gaps = 17/369 (4%) Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 S G+Y + + + + F + +++G G + ++ + + K Sbjct: 24 STKGEYPFLTSSQKITKRTDSPQFFEECLVFGNGGS--ANIHYLNEPFATTSHCYIAERK 81 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 D + + +E +GA + + K I + +PI P+ Q I + Sbjct: 82 DKKVNIRFVYYYLSGNLHILERGFKGAGLKNISSKYIATLDIPILPIENQNKIVALLDKA 141 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + + + EL L + + +P + K +E + ++ PF Sbjct: 142 SALVQKREKSIAQLDEL-------LRAQFLDMFGDPVMANKHDSVE-LKFYLKKIQIGPF 193 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + + + N +++ I E K S Y + + G+I+ Sbjct: 194 GSQLHRKDYIKGGIPLVNPVNIIDNKIFPDNEITLTEEKYNSLPNYHLTE-GDIIMARRG 252 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQS 356 + + S Y+ + S +L + ++ + G ++ Sbjct: 253 EMGRCGLITEIENNWFCGTGSLYLRP-KNIEHSVFLLLALTEFNTIEYLNRNAKGVTMKN 311 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 L + +P++ E I N I E + QS L++ +S + A Sbjct: 312 LNKTIISNIPII---KCEDHLILEF-NSIYYSIQAQKETLIQSRTELEDLLNSLLQEAFK 367 Query: 417 GQIDLRGES 425 G+I++ E Sbjct: 368 GKIEVSKEE 376 >gi|26554275|ref|NP_758209.1| type I restriction-modification system S subunit [Mycoplasma penetrans HF-2] gi|26454284|dbj|BAC44613.1| type I restriction-modification system S subunit [Mycoplasma penetrans HF-2] Length = 519 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 61/439 (13%), Positives = 123/439 (28%), Gaps = 69/439 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP +W V +K + LN G ES I + + D + S Sbjct: 83 EIPNNWTWVRLKNISNLNGGYAFESNLFLSHGIRVVRISDFDDKGILENEIKRTKYFSRL 142 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICS--TQFLVLQPKDVLPELLQGWLLSIDV 133 I IL G + K I ++ S Q + +L +L+ Sbjct: 143 DPYKI-ELNDILMCMTGGTVGKNCIIEYINEDSYINQRIAKITSIILNSKFLHHVLNSSY 201 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I + +T + I +P PP+ Q I I + I+ + + Sbjct: 202 IISIINNSKTSTNDNISMDLIKEFLIPCPPIFTQNKIVNFIGQISSFIEKYSELKNKLQT 261 Query: 194 LLKEKKQALVSYIV---------------------------------------------- 207 L ++ K +L + I Sbjct: 262 LDQKFKLSLKNSIFKYAIEGKLVKQNLNDEPASELVKKIYEEKQKLISEGKIKKDKNESY 321 Query: 208 -----TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + I+ +P W + + K K + + + Sbjct: 322 IFKDNNCYYEKVSNFEPKKIDVPFGIPKTWHWIKLSNICELILGKTPKRSINTNWNSNDI 381 Query: 263 NIIQKLETRNMGLKPESYE---------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 N + + +++G + E + + E + L + S+ + Sbjct: 382 NWVTISDMKDLGKIFSTKEYITNEAFKNEFTRISKKESLLMSFKLTIGRTSILEIDAVHN 441 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I + +L + + ++ + G ++ E + + V +PPI Sbjct: 442 EAIVTINPYYDKDYAIRDFLFYTLGTFVSFIEKTSAIKGS--TINKEKMINMLVSLPPIN 499 Query: 374 EQFDITNVINVETARIDVL 392 EQ I I+ + I+ + Sbjct: 500 EQRRIIKSISKIHSLINSI 518 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 65/194 (33%), Gaps = 6/194 (3%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 IE +P++W + ++ + I + + K N + + Sbjct: 78 IEVPFEIPNNWTWVRLKNISNLNGGYAFESNLFLSHGIRVVRISDFDDKGILENEIKRTK 137 Query: 279 SYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + ++ +I+ K + + E I + ++S +L + Sbjct: 138 YFSRLDPYKIELNDILMCMTGGTVGKNCIIEY-INEDSYINQRIAKITSIILNSKFLHHV 196 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + S + + + ++ + +K + PPI Q I N I ++ I+ E Sbjct: 197 LNSSYIISIINNSKTSTNDNISMDLIKEFLIPCPPIFTQNKIVNFIGQISSFIEKYSELK 256 Query: 397 EQSIVLLKERRSSF 410 + L ++ + S Sbjct: 257 NKLQTLDQKFKLSL 270 >gi|260660507|ref|ZP_05861422.1| restriction modification system DNA specificity subunit [Lactobacillus jensenii 115-3-CHN] gi|260548229|gb|EEX24204.1| restriction modification system DNA specificity subunit [Lactobacillus jensenii 115-3-CHN] Length = 388 Score = 79.8 bits (195), Expect = 8e-13, Method: Composition-based stats. Identities = 47/398 (11%), Positives = 115/398 (28%), Gaps = 31/398 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTV 78 WK V + + + G I +++E+GT + S++ + + Sbjct: 14 WKKVKLGQIADVRDGTHESPKYVSQNGYPLITSKNLENGTINFDDISYISKKDYEEINKR 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S+ K IL+G +G AI+ L+ ++ L + S + Sbjct: 74 SLVEKNDILFGMIGTIGNVAIVKKSGFAIKNVALIKSNSEIPSINLIQIIQSDIFKKYTN 133 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G + I + +E +LI + + + +L K Sbjct: 134 RLNSGNSQKFISLGDIRKFDFKMASKSENMLISKLFKKVDTLLSLQQRKLELEKQLKKFC 193 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q ++S P+++ D W + ++++ TK Sbjct: 194 LQNILSD---NKKCPNLRFHDFSTNWKKVKVGDIFTVTRGKVLSKDKISKTKDHIMKYPV 250 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 S + L E T+ R + ++ + + G + Sbjct: 251 YSSQTLNNGLLGYYHDYLFEDAITWTTDGANAGTVRLRAGKFYGTNVNGVLLSKNGYVND 310 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV-PPIKEQFD 377 A ++ + + L ++ + + P ++EQ Sbjct: 311 A----NAEALNQIAWKY-------------VSKVGNPKLMNNVMQNIMFSIAPSVEEQV- 352 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I+ ++ + +I + + + + Sbjct: 353 ---IISKLFILHSKSLKIYQANINVYTQLKQFLLQNLF 387 >gi|319778989|ref|YP_004129902.1| Type I restriction-modification system, specificity subunit S [Taylorella equigenitalis MCE9] gi|317109013|gb|ADU91759.1| Type I restriction-modification system, specificity subunit S [Taylorella equigenitalis MCE9] Length = 387 Score = 79.8 bits (195), Expect = 9e-13, Method: Composition-based stats. Identities = 51/406 (12%), Positives = 128/406 (31%), Gaps = 45/406 (11%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + ++ G+ + V S G +P G +T +++ K +L G Sbjct: 8 LSELAEIKYGKNQKK-----------VLSEDGN-IPIYGTGGLFGYATTALYDKPSVLIG 55 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + G + + T F + D++ +++S+ EG T+ Sbjct: 56 RKGTIRKVKYVEHPFWTVDTLFYTIINTDIVIPKYLYYVMSL---IDFNNYDEGTTIPSL 112 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + + + IP EQ ++ + +++ + +K A +S Sbjct: 113 RTETLNRLEFDIPSKEEQEIVLSCLNPIDEKVELNNAINNNLEQQIKTICTAWLSA---- 168 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 + + + V + K T+L ESNI + N +K Sbjct: 169 --------CAPSSDVILEGWSKISLSSIADFVGGYSYKRTELTESNIAMATIKNFDRKGG 220 Query: 270 TRNMGLK----PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM---- 321 + G K + Q + + + DL + + +A+++ S + Sbjct: 221 FKLDGYKEIVPSNKLKDSQYAELFDTLVAHTDLTQNAEIIGNAELVMNTNGYSDIVFSMD 280 Query: 322 ----AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQ 375 + +A +++ +G L + + + +P + Sbjct: 281 LVKVVPNKKHVSKFLIAAILQDKKFKAHCLGYVNGTTVLHLSKKALPEYQLYLPADLS-- 338 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ++ + + + L+ R + + ++G+ID+ Sbjct: 339 --VLKPLDELVTALYQRISANIEETTKLETLRDTLLPKLMSGEIDV 382 >gi|218247750|ref|YP_002373121.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 8801] gi|218168228|gb|ACK66965.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8801] Length = 238 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 79/216 (36%), Gaps = 11/216 (5%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNM 273 + KDS + + + + + + I L N++ ++ ++ Sbjct: 21 HQFKDSVLGRIPVEWEVKLLDKLLIEKRYGISTSLSEDPKGIPVLRMNNLVDGEVDFTDI 80 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV---KPHGIDS 330 + ++ G+++F + + + + + ++Y+ +D Sbjct: 81 KYSERNDAKKLTLNKGDVLFNRTNSVDYVGRTAIYRDSNKVVSFASYLVRLVTDNAMLDP 140 Query: 331 TYLAWLMRSYDLCKVFYAMGS-GLRQ-SLKFEDVKRLPVLVPP-IKEQFDITNVINVETA 387 YL + + + + G++Q ++ ++ RL + +P I EQ I I+ T Sbjct: 141 EYLNLWLNDKNNQIRVKQLATIGVQQANVNPTNLGRLLLAIPKKITEQKKIVKKISSCTN 200 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + K + ++ L+ + + +TG++ + Sbjct: 201 FL----HKTQTNLTKLRSIKIGLMQDLLTGKVRVTE 232 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 83/213 (38%), Gaps = 18/213 (8%) Query: 9 QYKDSGVQWIGAIPKHWKVVPI-KRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYL 64 Q+KDS +G IP W+V + K + G ++ +D I + + ++ G + Sbjct: 22 QFKDS---VLGRIPVEWEVKLLDKLLIEKRYGISTSLSEDPKGIPVLRMNNLVDGEVDFT 78 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFL----VLQPK 117 + R + KG +L+ + R AI D + + S V Sbjct: 79 DIKYSERND--AKKLTLNKGDVLFNRTNSVDYVGRTAIYRDSNKVVSFASYLVRLVTDNA 136 Query: 118 DVLPELLQGWLLSIDVTQRIEAICE-GATMSHADWKGIGNIPMPIP-PLAEQVLIREKII 175 + PE L WL + R++ + G ++ + +G + + IP + EQ I +KI Sbjct: 137 MLDPEYLNLWLNDKNNQIRVKQLATIGVQQANVNPTNLGRLLLAIPKKITEQKKIVKKIS 196 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + T + T + + Q L++ V Sbjct: 197 SCTNFLHKTQTNLTKLRSIKIGLMQDLLTGKVR 229 >gi|197336491|ref|YP_002157416.1| restriction modification system S subunit [Vibrio fischeri MJ11] gi|197315194|gb|ACH64642.1| restriction modification system S subunit [Vibrio fischeri MJ11] Length = 426 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 66/430 (15%), Positives = 141/430 (32%), Gaps = 43/430 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + PI + N RT + G + +I + + + + + + F G Sbjct: 4 DIAPITTLVEFNPSRTIKKGTVVPFIEMASLPTSHRDI---GIIAEKEFNGGGAKFKNGD 60 Query: 86 ILYGKLGPYLRKAIIADF-------DGICSTQFLVLQPKDVLPELLQGWL--LSIDVTQR 136 L+ ++ P L A G ST+F+V+ K + + + Sbjct: 61 TLFARITPCLENGKTAQVQGLPEGTFGFGSTEFIVMSAKLPEYDKDYVYYLARLPEFRIY 120 Query: 137 IEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + EG + W+ + PP + + +I + ++ Sbjct: 121 AQTHMEGTSGRQRVPWQSLAKFEYRFPPKEGRKSAASFLKMLDKKIASNTAMNQTLEKIA 180 Query: 196 KEKKQALVSYIVT----------KGLNPDVK-MKDSGIEW--VGLVPDHWEVKPFFALVT 242 ++ GL+P+++ + S E VG++P W+V+ Sbjct: 181 LRIFKSWFIDFDPVKANKEGVAFDGLSPEIQALFPSEFEESEVGVIPKGWKVQSLSKTAN 240 Query: 243 E----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +K + + + L + ++ T+ + + I+ G+ +F + Sbjct: 241 FLNGLACQKYPPVSQDDALPVIKIAEMRSGYTQKTNEASSTVNSKYIIKSGDFLFSWSGS 300 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSL 357 GI+ V + A + + + A + + Sbjct: 301 LTT-----CYWGHSIGILNQHLFKVTSDIYPQWFYAHWVNYHLGEFIRIAADKATTMGHI 355 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETA-RIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 K + VLVP DI + A I+ L++ E + L+ + R F+ ++ Sbjct: 356 KRGHLDEAKVLVPS----QDILVAGSRVIAPLINKLIQNQENTRSLI-DIRDRFLPKLIS 410 Query: 417 GQIDLRGESQ 426 GQI + GE+Q Sbjct: 411 GQITV-GEAQ 419 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 62/208 (29%), Gaps = 14/208 (6%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGK 62 ++++S V G IPK WKV + + G + I + ++ SG Sbjct: 217 EFEESEV---GVIPKGWKVQSLSKTANFLNGLACQKYPPVSQDDALPVIKIAEMRSGY-- 271 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + N S ++ I G L+ G L GI + + Sbjct: 272 --TQKTNEASSTVNSKYIIKSGDFLFSWSGS-LTTCYWGHSIGILNQHLFKVTSDIYPQW 328 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 W+ A + TM H + + +P V I ++ Sbjct: 329 FYAHWVNYHLGEFIRIAADKATTMGHIKRGHLDEAKVLVPSQDILVAGSRVIAPLINKLI 388 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKG 210 I++ L+S +T G Sbjct: 389 QNQENTRSLIDIRDRFLPKLISGQITVG 416 >gi|227544651|ref|ZP_03974700.1| type I site-specific restriction-modification system, S (specificity) subunit [Lactobacillus reuteri CF48-3A] gi|300909432|ref|ZP_07126893.1| type I restriction/modification specificity protein [Lactobacillus reuteri SD2112] gi|227185376|gb|EEI65447.1| type I site-specific restriction-modification system, S (specificity) subunit [Lactobacillus reuteri CF48-3A] gi|300893297|gb|EFK86656.1| type I restriction/modification specificity protein [Lactobacillus reuteri SD2112] Length = 385 Score = 79.5 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 55/402 (13%), Positives = 116/402 (28%), Gaps = 40/402 (9%) Query: 29 PIKRFTKLN---TGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + G+T + DI+ + +++++G L K + + Sbjct: 5 RLGDVLTIIMDYRGKTPKKLGLDWTEDKNDIVALSAKNLKNGELINLDKSHYGNSALYNK 64 Query: 78 VSI---FAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVL--PELLQGWLLSI 131 + G IL P ++ I S + +L+P + + P L ++ S Sbjct: 65 WMKDGDISVGDILMTSEAPLGELFLVDKPIKAILSQRIFLLRPNNSIVLPWYLYFYMSSK 124 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + R+ G T+ K + NI + +P L+ Q I K++ I I Sbjct: 125 NFQNRLNGHATGTTVIGIKQKELRNIEIELPSLSIQKSIVRKLVP----ISKKIEINKEI 180 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L E + S N K + G P + + + + + Sbjct: 181 NANLLELITLIWSRYSQNISNKVPLKKIAKDIVTGKTPSTKIKANYGSDIPFVKIPDMHN 240 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +++ L + + + I+ I S Sbjct: 241 KVFI-----------DETLQSLSLLGADSQKNKYLPANSIMVSCIGTPGLVSLTGSIAQT 289 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + I + ++ +RS G ++L D +L V+VP Sbjct: 290 NQQINSLVL-----DEKFIYWVFLELRSLSNKIGNLGSGGTTIKNLNKSDFSKLEVVVPD 344 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + N I + L + + + Sbjct: 345 NDI---LLDKFNSIAKPIFESIHTNSFETNKLNQLKKRLLHK 383 >gi|315634180|ref|ZP_07889469.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] gi|315477430|gb|EFU68173.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] Length = 475 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 129/411 (31%), Gaps = 40/411 (9%) Query: 31 KRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFA 82 K + G+ G D I YI ED+++G Y + Sbjct: 48 KNIIVVKGGKRLPEGHDFLNNKSGIPYIRAEDIKNGFVDYTNSPTISLLTHREIKAYQTE 107 Query: 83 KGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L +G + I F+ + + L K + PE L +L S IE Sbjct: 108 YNDVLMTIVGNSIGDIGIVKFNLDICNLTENAVRLITKKIKPEYLFSFLESKFGQNYIER 167 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK- 198 G + I I +PI Q I + + ++ LL + Sbjct: 168 NKVGTAQPKLSIERIRKIKIPIVSSEFQDEIESLVSSAFEKLQKSKETYQAAQNLLLDHL 227 Query: 199 --------KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 QA+ + ++ E+ + + K + Sbjct: 228 GLKDFNPPAQAVNVKSFSDSFGRSGRL---DAEFYQEKYEGYLKKIQAYPYGCEPIRTAC 284 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPE------------SYETYQIVDPGEIVFRFIDL 298 ++ + Q +E N+G E + V +++ ++ Sbjct: 285 KLKDANYTPKDNQTYQYIELSNIGNLGEITGASLDLGCNLPSRARRKVSKNDVIVSSVEG 344 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSL 357 ++ S + + + ++ + V I+S L L +S + ++ SG ++ Sbjct: 345 SLASCAIVS-EQYHQALCSTGFYVVSSEKINSETLLILFKSESIQQLLKQGCSGTILTAI 403 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINV---ETARIDVLVEKIEQSIVLLKE 405 ++ +P+ + Q I ++I + L+EK ++++ L E Sbjct: 404 NKDEFLNIPLPLVDANIQTQIADLIRQSNYLRIKSKGLLEKAKKAVELAIE 454 >gi|217034275|ref|ZP_03439692.1| hypothetical protein HP9810_885g6 [Helicobacter pylori 98-10] gi|216943247|gb|EEC22712.1| hypothetical protein HP9810_885g6 [Helicobacter pylori 98-10] Length = 197 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 59/197 (29%), Gaps = 17/197 (8%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G + K T + ++GN ++ + L+ ++ Y Sbjct: 15 LGDIGKPCMCKRVMKHQTTRYGEIPFYKIG-----TFGNTADAFISKKLFLEYKT--KYS 67 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 G+I+ + + + + +L +Y K Sbjct: 68 FPKKGDILISASGT----IGKAVIYDGKPAYFQDSNIVWIDNDETLVKNDFLFYAYSNVK 123 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 L ++ + + +PP+ EQ I N+++ I L K Q + Sbjct: 124 W--NTEHTTILRLYNDNFRNTLIPLPPLNEQSAIANILSALDNEITSLKNKKRQ----FE 177 Query: 405 ERRSSFIAAAVTGQIDL 421 + + ++ +I + Sbjct: 178 NIKKALNHDLMSTKIRV 194 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 59/189 (31%), Gaps = 10/189 (5%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPLNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G + I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGKAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + A I +L ++ +F + Sbjct: 121 NVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQSAIANILSALDNEITSLKNKKRQFENIK 180 Query: 196 KEKKQALVS 204 K L+S Sbjct: 181 KALNHDLMS 189 >gi|237744714|ref|ZP_04575195.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 7_1] gi|229431943|gb|EEO42155.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 7_1] Length = 346 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 44/369 (11%), Positives = 107/369 (28%), Gaps = 39/369 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVE--SGTGKYLPKDGNSRQS 73 WK + + L G+T + +I + D+ + + Sbjct: 7 SEWKKIKLGDIFILQMGKTPLRENKLYWDKGNYNWISISDMNFSEKYISFTKEKITDFAI 66 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + I K ++ + K I + D + + PK+ + S+ Sbjct: 67 KKSGIKIIPKNTVIMS-FKLSIGKVKIVNEDIYSNEAIMAFIPKEDIFIDENFLYHSLKS 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E I + + I + +P L Q I + + Sbjct: 126 VRWNEGINKAVKGLTLNKNLISQKEIFLPDLTTQKEITNNLDTIDNLL------------ 173 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 E ++ ++Y+ G + V +GIE + + + + Sbjct: 174 ---ELRKKQLNYLKELGKSLFVTFNKNGIEK--------RLDDIADISMGQSPLSQSYNI 222 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 Y + + +IV+ +I+ D ++ Sbjct: 223 DKKGLPFYQGKTEFGDIYIKEPIIYCNSPIKIVEKNDILMSVRAPVGD-----VNIATQK 277 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I +++ +D YL +L++ K+ +++ ++ L + + + Sbjct: 278 SCIGRGLASIRAKKVDYLYLFYLLK-ERKIKIEKMGVGSTFKAINKNNISSLQIPIIEMS 336 Query: 374 EQFDITNVI 382 +Q I + Sbjct: 337 KQNRIKKYL 345 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 59/190 (31%), Gaps = 15/190 (7%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ----KLETRNMGLKPESYETYQI 285 W+ + K N I + + E + I Sbjct: 7 SEWKKIKLGDIFILQMGKTPLRENKLYWDKGNYNWISISDMNFSEKYISFTKEKITDFAI 66 Query: 286 VDPGEIVF--RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH---GIDSTYLAWLMRSY 340 G + + + + V E A MA P ID +L ++S Sbjct: 67 KKSGIKIIPKNTVIMSFKLSIGKVKIVNEDIYSNEAIMAFIPKEDIFIDENFLYHSLKSV 126 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + GL +L + + + +P + Q +ITN ++ ID L+E ++ + Sbjct: 127 RWNEGINKAVKGL--TLNKNLISQKEIFLPDLTTQKEITNNLDT----IDNLLELRKKQL 180 Query: 401 VLLKERRSSF 410 LKE S Sbjct: 181 NYLKELGKSL 190 >gi|298384310|ref|ZP_06993870.1| type I restriction-modification system, S subunit [Bacteroides sp. 1_1_14] gi|298262589|gb|EFI05453.1| type I restriction-modification system, S subunit [Bacteroides sp. 1_1_14] Length = 329 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 42/315 (13%), Positives = 98/315 (31%), Gaps = 33/315 (10%) Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVL 169 ++L+P ++ + S +G ++ +P+PPL EQ Sbjct: 1 MVILRPINIYAKFYLYLFKSQWYIDEGTKYFKGVVGQQRVHKGIFTDLHIPLPPLVEQQR 60 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 I +I ID + ++ +K+ K ++ + L P + IE + + Sbjct: 61 IVTEIEKWFALIDQIEQGKVNLQTTIKQIKSKILDLAIHGKLVPQDPNDEPSIELLQRIN 120 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------------- 275 + ++ + +++ S K + Sbjct: 121 PDFTPCDNGHYPFDVPNEWKWCKMNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGI 180 Query: 276 -----------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV---MERGII--TSA 319 +++ + G+++ R+ + ++ + Sbjct: 181 SLEQAKFLDPSTINKWDSKYKLQTGDVLVNSTGTGTVGRTRLFDESCLGKYPFVVPDSHV 240 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + I+S Y+ M S + + GS ++ L ++ L +PPIKEQ Sbjct: 241 SVVRTYEEINSEYVFAYMSSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPLPPIKEQQR 300 Query: 378 ITNVINVETARIDVL 392 I I + +D + Sbjct: 301 IVQKIEKLFSILDNI 315 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 2/101 (1%) Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFD 377 + ++P I + + +L +S G+ +Q + L + +PP+ EQ Sbjct: 1 MVILRPINIYAKFYLYLFKSQWYIDEGTKYFKGVVGQQRVHKGIFTDLHIPLPPLVEQQR 60 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I I A ID + + +K+ +S + A+ G+ Sbjct: 61 IVTEIEKWFALIDQIEQGKVNLQTTIKQIKSKILDLAIHGK 101 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 63/181 (34%), Gaps = 17/181 (9%) Query: 20 AIPKHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLE----DVESGTGKYLPKDGN--SRQ 72 +P WK + + G++ + +D + +++ G S Sbjct: 134 DVPNEWKWCKMNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQAKFLDPSTI 193 Query: 73 SDTSTVSIFAKGQILYGKLGP-------YLRKAIIADFDGIC--STQFLVLQPKDVLPEL 123 + + G +L G ++ + + + S +V +++ E Sbjct: 194 NKWDSKYKLQTGDVLVNSTGTGTVGRTRLFDESCLGKYPFVVPDSHVSVVRTYEEINSEY 253 Query: 124 LQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + ++ S + Q IE G+T + N+ P+PP+ EQ I +KI +D Sbjct: 254 VFAYMSSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPLPPIKEQQRIVQKIEKLFSILD 313 Query: 183 T 183 Sbjct: 314 N 314 >gi|307704328|ref|ZP_07641245.1| type I restriction modification DNA specificity domain protein [Streptococcus mitis SK597] gi|307622088|gb|EFO01108.1| type I restriction modification DNA specificity domain protein [Streptococcus mitis SK597] Length = 390 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 57/405 (14%), Positives = 111/405 (27%), Gaps = 33/405 (8%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 IPK W I K++ G + K + + K Sbjct: 5 DIPKIRFYSYQGSWTENRIADIVKISAGGDVDKVKLKESGKYPVIANA---LTNKGIVGF 61 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 D + G + + K + Sbjct: 62 YED----YKVKAPAVTVTGRGDVGYAVARHENFTPIVRLLTLQSEKIDVD-------YLE 110 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + E + +GN + P + EQ I + + Sbjct: 111 NQINSMRILNESTGVPQLTAPQLGNYKVYYPEIEEQSAIGSLFRTLDDLLASY----KDN 166 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + + K ++S + K +++ +G + V E+ F Sbjct: 167 LANYQSLKATMLSKMFPKDGQTVPEIRLNGFDGEWEVQSLKELARFSKGNGYTKSDLVNS 226 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 IL Q + + E+ E I GE+V + S S Sbjct: 227 GNEIILYGQLYTNYQTVISMVNTFVLETREKSVISKGGEVVVPASGESAEDISRASVIEK 286 Query: 312 ERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 II + P + +DS +LA + + K G L+ D++++ + Sbjct: 287 SGVIIGGDLNIIYPDENKVDSIFLALTISNGSQQKELIKRAQGKSVVHLRNNDLEKVVLH 346 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I+EQ I + +D L ++ I L+ + + Sbjct: 347 YPSIEEQQAIGAY----FSNLDNLFNFHQEKISQLETLKKKLLQD 387 >gi|260910285|ref|ZP_05916960.1| type I restriction-modification system [Prevotella sp. oral taxon 472 str. F0295] gi|260635587|gb|EEX53602.1| type I restriction-modification system [Prevotella sp. oral taxon 472 str. F0295] Length = 446 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 69/423 (16%), Positives = 125/423 (29%), Gaps = 53/423 (12%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IP+ W V + L+ G+T + + I+ I + Y+ + S Sbjct: 24 EIPESWCFVRLGDICNYLHRGKTPKYGNQKILPIIAQKCNHWNQLYIDRCLFSDTDYILK 83 Query: 78 VS---IFAKGQILYGKLGPYLRKAIIADFDGIC---------STQFLVLQPKDVLPELLQ 125 KG I+ G D + S +V K V + Sbjct: 84 YKEEQFLQKGDIIINSTGGGTVGRTGYIDDSVFDKFDKFVADSHVTVVRSTKLVSHRYIY 143 Query: 126 GWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +LLS + IE C G+T I + +PIPP+ EQ I +K+ + + Sbjct: 144 LYLLSPYIQIGIEERCTGSTNQIELRTTTISDYLVPIPPVEEQKRIVKKVESMLPIVTRY 203 Query: 185 ITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDSG------------------- 221 + L ++++ + L P + Sbjct: 204 QKLQSNLEHLNSTLFPLIKKSILQEAIQGKLVPQDPNDEPASVLLQRIKEEKQRLVKEGK 263 Query: 222 ---------IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 + + G ++E A+ E + E LS + + + N Sbjct: 264 LKKKDVVDSVIYKGDDNKYYEQVDGIAVPIESDYDFPSTWEVVRLSHICRLMDGEKKEGN 323 Query: 273 MGLKPESY----ETYQIVDPGEIV--FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 Y T +D G+ V I L + + S V G + S + + Sbjct: 324 HVCLDAKYLRGKSTGTYLDKGKFVAKGNNIILVDGENSGEVFTVPHDGYMGSTFKQLWVS 383 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + + + L + L + +PP +EQ I N I Sbjct: 384 SRMYLPYVLYIIQFYKNLLRNSKKGAAIPHLNKDIFYSLLIGIPPYQEQERIANAIGELY 443 Query: 387 ARI 389 A + Sbjct: 444 APL 446 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 67/219 (30%), Gaps = 20/219 (9%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL-------SLSYGNIIQKLETR 271 E +P+ W + L+R T + + + + Sbjct: 16 CIDEEIPFEIPESWCFVRLGDICNYLHRGKTPKYGNQKILPIIAQKCNHWNQLYIDRCLF 75 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVMERGIITSAY-MAVKPH 326 + Y+ Q + G+I+ R+ ++ + S + Sbjct: 76 SDTDYILKYKEEQFLQKGDIIINSTGGGTVGRTGYIDDSVFDKFDKFVADSHVTVVRSTK 135 Query: 327 GIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + Y+ + S + GS + L+ + V +PP++EQ I + Sbjct: 136 LVSHRYIYLYLLSPYIQIGIEERCTGSTNQIELRTTTISDYLVPIPPVEEQKRIVKKVES 195 Query: 385 ETARIDVLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 I +K++ ++ L + S + A+ G+ Sbjct: 196 ML-PIVTRYQKLQSNLEHLNSTLFPLIKKSILQEAIQGK 233 >gi|227365079|ref|ZP_03849108.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] gi|227069883|gb|EEI08277.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] Length = 338 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 49/365 (13%), Positives = 116/365 (31%), Gaps = 36/365 (9%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +V +K G ++ KD+ + +G+Y P G + + + + Sbjct: 2 IVKLKDVC--IKGTSNIRQKDV---------NDSGRY-PVYGAAGPVGFMNSFQYDEPYV 49 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 K G + +A + L PK + + +S +E GAT+ Sbjct: 50 GVVKDGAGIGRATYLPSNSSIIGTMQALIPKKNVLPKYLYYAVSS---MHLEKYYSGATI 106 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H +K + + EQ II ++ +I+ + + + L E +A Sbjct: 107 PHIYFKNYKHERFVLVSKKEQEQ----IIWRFSLLEKMISNKQQQLLKLDELIKA---RF 159 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 V +P K + + D + I + N+ Sbjct: 160 VEMFGDPISNKKSWKKRLLNDLVDKIGS------GATPKGGKESYQDHGISFIRSMNVHD 213 Query: 267 KLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYM 321 + + IV ++ + + ++ + + + Sbjct: 214 GYFNYKDLAYINSTQAKQLSNVIVQSQDVFINITGASVARSCIVPDDILPARVNQHVSII 273 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 K ++ ++ L + ++ + G RQ++ + ++ L +++PPI Q + Sbjct: 274 RCKSDVLNPIFINNLFLNDSFKRILLSIGLSGGATRQAITKKQLEMLKIILPPISLQNEY 333 Query: 379 TNVIN 383 N ++ Sbjct: 334 ANFVH 338 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 43/153 (28%), Gaps = 26/153 (16%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + + + + + II + + + YL + + S Sbjct: 37 GFMNSFQYDEPYVGVVKDGAGIGRATYLPSNSSIIGTMQALIPKKNVLPKYLYYAVSSMH 96 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L K + + F++ K ++ KEQ I + ++ ++ +Q ++ Sbjct: 97 LEKYY---SGATIPHIYFKNYKHERFVLVSKKEQEQII----WRFSLLEKMISNKQQQLL 149 Query: 402 LLKER-------------------RSSFIAAAV 415 L E + + V Sbjct: 150 KLDELIKARFVEMFGDPISNKKSWKKRLLNDLV 182 >gi|242280198|ref|YP_002992327.1| restriction modification system DNA specificity domain protein [Desulfovibrio salexigens DSM 2638] gi|242123092|gb|ACS80788.1| restriction modification system DNA specificity domain protein [Desulfovibrio salexigens DSM 2638] Length = 415 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 42/392 (10%), Positives = 100/392 (25%), Gaps = 30/392 (7%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF-DGICST 109 I L + G + + + G I G+ G + +D +T Sbjct: 22 IDLPQSQRRVGDIPILGSSGVTGYHNESKVAGPG-ITVGRSGASIGVVTYSDIDFWPLNT 80 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 V + P +L D G+ + + + IPPL Q Sbjct: 81 ALYVKDFRGNHPRFAYYFLKQFDFK----RYNSGSAQPSLNRNFVHPTKIRIPPLKTQQA 136 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALV------------SYIVTKGLNPDVKM 217 I + +ID + + ++ V Sbjct: 137 IAHILGTIDEKIDLNRRMNETLEAMAQAIFKSWFVDFDPVKAKARGEQPVGMDAETAALF 196 Query: 218 KDSGI-EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR----- 271 DS +G +P W V V + Sbjct: 197 PDSFEPSGLGEIPRGWRVSEVGKEVIVKGGATPSTKNPLFWDGGSFCWATPKDLSALESP 256 Query: 272 --NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + E + + + + + A + ++A++ Sbjct: 257 VLLDTARKITEEGVNRISSKLLPKGTLLMSSRAPVGYLAIAEIDTAVNQGFIAMECSKSL 316 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + ++ + Q + ++ K + ++VP +E + N + + Sbjct: 317 NCMYMLFWCKENMETIKSNANGSTFQEISKKNFKPISIIVP--EE--LVLNKFESAISPL 372 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + L R + + ++G++ + Sbjct: 373 YRKIVSNLKERQTLTSLRDTLLPNLISGELSV 404 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 69/207 (33%), Gaps = 15/207 (7%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----------LEDVE 57 ++ SG +G IP+ W+V + + + G T + + + G L +E Sbjct: 198 DSFEPSG---LGEIPRGWRVSEVGKEVIVKGGATPSTKNPLFWDGGSFCWATPKDLSALE 254 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 S + + + + KG +L P IA+ D + F+ ++ Sbjct: 255 SPVLLDTARKITEEGVNRISSKLLPKGTLLMSSRAPV-GYLAIAEIDTAVNQGFIAMECS 313 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L + + + I++ G+T K I + +P I Sbjct: 314 KSLN-CMYMLFWCKENMETIKSNANGSTFQEISKKNFKPISIIVPEELVLNKFESAISPL 372 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS 204 +I + + ER L L+S Sbjct: 373 YRKIVSNLKERQTLTSLRDTLLPNLIS 399 >gi|159897810|ref|YP_001544057.1| restriction modification system DNA specificity subunit [Herpetosiphon aurantiacus ATCC 23779] gi|159890849|gb|ABX03929.1| restriction modification system DNA specificity domain [Herpetosiphon aurantiacus ATCC 23779] Length = 398 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 85/198 (42%), Gaps = 4/198 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP W V + + G + D + LED+E T L + + S Sbjct: 74 QIPPSWIWVSLDDIVVYDAGSKHDPNNLDPDSWLLELEDIEKNTSVILGQFLVKERKPKS 133 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQ 135 + F K ILYGKL PYL K I+A G C+T+ +VL+PK L P +Q +L S Sbjct: 134 NKASFQKNDILYGKLRPYLNKVIVAHTSGFCTTEIVVLRPKLELSPFYIQNFLKSPFFVS 193 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G M +P+PPLAEQ I K+ D L ++ L Sbjct: 194 YVNQHSYGTKMPRLGTLDGKKASIPLPPLAEQQRIVAKVAQLMALCDQLEQQQTSREALR 253 Query: 196 KEKKQALVSYIVTKGLNP 213 ++ +Q+ + ++++ P Sbjct: 254 QQVQQSAIKQLLSELARP 271 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 29/229 (12%), Positives = 73/229 (31%), Gaps = 9/229 (3%) Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + K+Q +++ V +N + W+ + D + ++ + N Sbjct: 45 YNMFKPLIKEQQMLNIDVRSSINKEHTKFQIPPSWIWVSLDDIV---VYDAGSKHDPNNL 101 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + + + + + +I++ + +K + Sbjct: 102 DPDSWLLELEDIEKNTSVILGQFLVKERKPKSNKASFQKNDILYGKLRPYLNKVIVAHTS 161 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 G T+ + ++P S + ++S G L D K+ + Sbjct: 162 ----GFCTTEIVVLRPKLELSPFYIQNFLKSPFFVSYVNQHSYGTKMPRLGTLDGKKASI 217 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +PP+ EQ I + A D L ++ L ++ + S I ++ Sbjct: 218 PLPPLAEQQRIVAKVAQLMALCDQLEQQQTSREALRQQVQQSAIKQLLS 266 >gi|294670044|ref|ZP_06735001.1| type I restriction system specificity protein [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291308165|gb|EFE49408.1| type I restriction system specificity protein [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 394 Score = 79.5 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 52/386 (13%), Positives = 116/386 (30%), Gaps = 39/386 (10%) Query: 26 KVVPIKR---FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + P+ + TG+ K + + G Y + Sbjct: 20 EWKPLGGENGIAIIKTGQAVSKQK---------ISNNIGSYPVINSGKEPLGYIDEWNTE 70 Query: 83 KGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G + + + + V ++ + + ++ Q I A+C Sbjct: 71 NDPIGITTRGAGVGSITWQEGRYFRGNLNYAVTIKDRTELDVRFLYHILLEFEQEIHALC 130 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + + + +PIPPL Q I + + T TL ++L K++ Sbjct: 131 TFTGIPALNASNLKKLLIPIPPLEIQQKIVKILDKFTELEATLEATLEAELQLRKQQYNY 190 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +++ DV K + N K + S Sbjct: 191 YRDFLLNFAGREDVLFK-----------------KLSEVTNFQNGKGHEKDISESGKFIV 233 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT---- 317 N K + N + S + + +I+ DL N K ++ V + T Sbjct: 234 VN--SKFISTNGQVLKYSDKQLVPLFEDDILIVMSDLPNGKVLSKTYLVKQNNKFTLNQR 291 Query: 318 -SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + ++++ + + + +L+ E + + + +PP+ EQ Sbjct: 292 IGRITVKNKSELLPKFVSYFLDRTRQLTKYDNKVD--QTNLRKEQILEVFIPIPPLSEQA 349 Query: 377 DITNVINVETARIDVLVEKIEQSIVL 402 I +++ L + + + I L Sbjct: 350 RIVAILDKFDTLTTSLSDGLPREIAL 375 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 64/189 (33%), Gaps = 10/189 (5%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + N + N I N G +P Y + I Sbjct: 21 WKPLGGENGIAIIKTGQAVSKQKISNNIGSYPVINSGKEPLGYIDEWNTENDPIGITTRG 80 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQS 356 + + + RG + A +D +L ++ + + +A+ + + Sbjct: 81 AGVGSITWQEGRYF-RGNLNYAVTIKDRTELDVRFLYHIL--LEFEQEIHALCTFTGIPA 137 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 L ++K+L + +PP++ Q I +++ T L +E + L K+ R + Sbjct: 138 LNASNLKKLLIPIPPLEIQQKIVKILDKFTELEATLEATLEAELQLRKQQYNYYRDFLLN 197 Query: 413 AAVTGQIDL 421 G+ D+ Sbjct: 198 --FAGREDV 204 >gi|257413933|ref|ZP_04744714.2| phosphoribosylformylglycinamidine synthase [Roseburia intestinalis L1-82] gi|257201766|gb|EEV00051.1| phosphoribosylformylglycinamidine synthase [Roseburia intestinalis L1-82] Length = 463 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 54/419 (12%), Positives = 123/419 (29%), Gaps = 59/419 (14%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W + T + G + + Y N + Sbjct: 52 EVPEGWCWCRLPVITTDIFAGGDKPDVYET-CLTESC---KIPIYSNGMENEGLYGYTNK 107 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + I G + + K++ V + + Sbjct: 108 PRVTEPSITISARGTIGFCCVRETPFVPIVRLITITPSKEIN------LYYLKTVFESLI 161 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG+++ GI +PIPP+ EQ+ ++ K+ I + E+ LL+ Sbjct: 162 ETGEGSSIPQLTVPGIKPKLIPIPPVNEQIRLQNKLNQILNYIVNISFEKDELQNLLQIV 221 Query: 199 KQALVSYIVTKGLNPDVK----------------------------------MKDSGIEW 224 K +++ + L P K + Sbjct: 222 KSKILNLAIRGKLVPQNPNDEPASVLLNRIHDEKEELIKQGNIKRDKKESVIFKGDDNSY 281 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKL----IESNILSLSYGNIIQKLETRNMGLKPESY 280 G+ ++ + E + LS N+ N + + Sbjct: 282 YGISLPTGWSWTILKDISFSISDGSHNPPSNKEFGVPLLSAANVNNNSILINNASRWITN 341 Query: 281 ETYQI------VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E ++I ++ G+++ + + + E + + +KP I YL Sbjct: 342 EEWEIENQRTDIEIGDVLLTIVGSIGRSAVV---ETNEHFALQRSVAVIKPCLISPFYLM 398 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + ++ + K G ++ + + + V +PP+ EQ I I++ ++D++ Sbjct: 399 RIFQAPQIQKWINDNSKGTAQKGIYLNALSIMTVPIPPLDEQLRIVKQISIFFEQLDLI 457 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 67/231 (29%), Gaps = 12/231 (5%) Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK--MKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + Q L+++I +K E VP+ W + T++ Sbjct: 13 CMDSTFYFQWQYLITHINNLHYEKFQDGTVKCIEDEIPFEVPEGWCWCRLPVITTDIFAG 72 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 K K+ + G++ E Y I + Sbjct: 73 GDKPDVYETCLTESC----KIPIYSNGMENEGLYGYTNKPRVTEPSITISARGTIGFCCV 128 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + I+ + + +++ + L +K + Sbjct: 129 RETPFVPIVRLITITPSKEINL-----YYLKT-VFESLIETGEGSSIPQLTVPGIKPKLI 182 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP+ EQ + N +N I + + ++ LL+ +S + A+ G+ Sbjct: 183 PIPPVNEQIRLQNKLNQILNYIVNISFEKDELQNLLQIVKSKILNLAIRGK 233 >gi|323497666|ref|ZP_08102682.1| HsdA [Vibrio sinaloensis DSM 21326] gi|323317249|gb|EGA70244.1| HsdA [Vibrio sinaloensis DSM 21326] Length = 443 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 50/445 (11%), Positives = 120/445 (26%), Gaps = 62/445 (13%) Query: 28 VPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V I L+ G + + + + D+ +GT + + T I Sbjct: 5 VSIGDVVSLSQGFAVNSKSKHLMGDSGLPLLRITDLINGT-----EAQYLTKETAPTKCI 59 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL--LSIDVTQRIE 138 K +I++ + G + +G+ + P + L + + + Sbjct: 60 AQKHEIIFTRTGQVG--LVFRGREGVVHNNCFKVIPNEDLVTHDYIYWTLKQPHIIKLAN 117 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ G+ + +I + + P Q + + ++ ++ + Sbjct: 118 SVASGSVQKDLNHSAFKSIDIDLIPKTVQEQNCQILNRIEEKLILNTQINQTLEQMAQVL 177 Query: 199 KQA-------LVSYIVTKG--------------------------LNPDVKMKDSGIEWV 225 ++ ++ + G + ++ S E Sbjct: 178 FKSWFVDFDPVIDNALDAGSDIPEVFESRVERRKAVRESADFKPLPDDVRRLFPSEFEES 237 Query: 226 --GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 G VP WE +T + N + + K + Sbjct: 238 ESGWVPKGWETSTAGQELTVKGGSTPSTKNPDFWDGGNINWTSPKDLSDNDTKIMFETSR 297 Query: 284 QIVDPGEIVFR-------FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 +I D G + + + A I Y+A+ S Sbjct: 298 KITDAGLAKITSGLLPRETVLMSSRAPVGYLALTKIPVAINQGYIAIPESRRLSQEYILY 357 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + G + + K + +LVPP I + + Sbjct: 358 WLDSQMDMIKGLSGGTTFAEISKKTFKSISILVPPCP----IVEAFSKNVEVYLNKISSN 413 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 LL +SS + ++G++++ Sbjct: 414 VGESSLLATVQSSLLPKLISGELEI 438 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 55/196 (28%), Gaps = 12/196 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSR 71 G +PK W+ + + G T + +I + +D+ K + + Sbjct: 240 GWVPKGWETSTAGQELTVKGGSTPSTKNPDFWDGGNINWTSPKDLSDNDTKIMFETSRKI 299 Query: 72 QSDTSTVS---IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + + +L P + + ++ + L Sbjct: 300 TDAGLAKITSGLLPRETVLMSSRAPV-GYLALTKIPVAINQGYIAIPESRRL-SQEYILY 357 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 I+ + G T + K +I + +PP + + +I + + E Sbjct: 358 WLDSQMDMIKGLSGGTTFAEISKKTFKSISILVPPCPIVEAFSKNVEVYLNKISSNVGES 417 Query: 189 IRFIELLKEKKQALVS 204 + L+S Sbjct: 418 SLLATVQSSLLPKLIS 433 >gi|228475404|ref|ZP_04060123.1| Sau1hsdS1 [Staphylococcus hominis SK119] gi|228270587|gb|EEK12019.1| Sau1hsdS1 [Staphylococcus hominis SK119] Length = 394 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 62/406 (15%), Positives = 119/406 (29%), Gaps = 47/406 (11%) Query: 23 KHWKVVPIKRFTKL-NTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W I K+ +G + S + + I ++++ ++ Sbjct: 18 EDWNERTISDSIKILKSGLSRELSTTDIGLPVIRANNLQNYNLVLDDIKYWFKEDPKGAK 77 Query: 79 ---SIFAKGQILYG------KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 K IL K+G D I +T L K+ Sbjct: 78 TENYYLEKNDILVNFINSEAKMGTSCIIKSDFKRDTIYTTNILRYVTKETYDSYFHYIYT 137 Query: 130 SIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 ++ I + IP IP EQ I + ++D I Sbjct: 138 QTYNYKKWIKIITKPAVNQASFTTVDFKKIPYYIPEFNEQKKIGDF----FSKLDRQIEL 193 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + ++LL+++K+ + I ++ L + G WE+K + Sbjct: 194 EEKKLDLLEQQKKGYMQKIFSQEL--------RFKDENGNDYPEWEIKKLMQIAKVKTGS 245 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + L ++ I+ PGE + L Sbjct: 246 KNVQDNIQDGKYKFFDR----SVEVKYLNTFDFDETAIIYPGE----------GSKFLPR 291 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + AY + ++ + S +SL+ +L V Sbjct: 292 YFSGKYSLHQRAYSIYDININNNYLYYY--LSLQNNHFLKYAVGSTVKSLRMSGFDKLKV 349 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +VP EQ I + +D +EK + LLK+R+ SF+ Sbjct: 350 MVPKNSEQEKIGSF----FKNLDEFIEKQANKVELLKKRKQSFLQK 391 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 280 YETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 ++ +I+ FI+ ++S + T+ V DS + Sbjct: 77 KTENYYLEKNDILVNFINSEAKMGTSCIIKSDFKRDTIYTTNILRYVTKETYDSYFHYIY 136 Query: 337 MRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++Y+ K + + S D K++P +P EQ I + +++D +E Sbjct: 137 TQTYNYKKWIKIITKPAVNQASFTTVDFKKIPYYIPEFNEQKKIGDF----FSKLDRQIE 192 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 E+ + LL++++ ++ + ++ + E Sbjct: 193 LEEKKLDLLEQQKKGYMQKIFSQELRFKDE 222 >gi|228472518|ref|ZP_04057278.1| type I restriction modification DNA specificity domain protein [Capnocytophaga gingivalis ATCC 33624] gi|228275931|gb|EEK14687.1| type I restriction modification DNA specificity domain protein [Capnocytophaga gingivalis ATCC 33624] Length = 233 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 60/173 (34%), Gaps = 3/173 (1%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET--RNMGLKPESYETYQ 284 +P+ W ++ + + + N + ++ + Q T + + + + Sbjct: 60 KLPEGWVWCQGNQILNTMKSQKPSGEKFNYIDIASIDNRQNKITEVKTIAVTEAPSRASR 119 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 V G+ +F + + + T Y+ + YL +LM S + + Sbjct: 120 KVKFGDTLFSMVRPYLKNIAFVDEEYSNCIASTGFYVCSPNETLFPKYLFYLMVSDYVVQ 179 Query: 345 VFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 G S+ ED+ +PP+ EQ I I A D + +++ Sbjct: 180 GLNKHMKGDNSPSINNEDITNFIFPLPPLAEQHRIVEKIESFFASFDQIEKEL 232 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 40/169 (23%), Positives = 65/169 (38%), Gaps = 6/169 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 +P+ W + + K YI + +++ K K ++ + Sbjct: 60 KLPEGWVWCQGNQILNTMKSQKPSGEK-FNYIDIASIDNRQNKITEVKTIAVTEAPSRAS 118 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVL-PELLQGWLLSIDVT 134 G L+ + PYL+ D + I ST F V P + L P+ L ++S V Sbjct: 119 RKVKFGDTLFSMVRPYLKNIAFVDEEYSNCIASTGFYVCSPNETLFPKYLFYLMVSDYVV 178 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 Q + +G + + I N P+PPLAEQ I EKI + D Sbjct: 179 QGLNKHMKGDNSPSINNEDITNFIFPLPPLAEQHRIVEKIESFFASFDQ 227 >gi|254037299|ref|ZP_04871376.1| conserved hypothetical protein [Escherichia sp. 1_1_43] gi|226840405|gb|EEH72407.1| conserved hypothetical protein [Escherichia sp. 1_1_43] Length = 273 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 52/156 (33%), Gaps = 10/156 (6%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITS 318 + N E ++ + V G++ + + Q E Sbjct: 40 FYNYFTPDELGDLVQSNDKERENCSVKRGDVFLTRTSETMHELGMSCVALQDYENATFNG 99 Query: 319 AYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKE 374 ++PH Y+ + +RS + A + R SL E + RL + PP +E Sbjct: 100 FCKRLRPHQNSELVPEYVGYYLRSTKFRQSMLAFSTMSTRASLNNEMIGRLEISYPPEEE 159 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 Q +I V+ +D + Q L++ + Sbjct: 160 QIEIARVL----KNLDDKITLNRQINQTLEQMAQAL 191 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 64/197 (32%), Gaps = 13/197 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +WK + + +G + + ++ +DV + +D + Sbjct: 3 SNWKTTKLLDHYDIRSGLSKPAKDFGSGHPFLTFKDVFYNYFTPDELGDLVQSNDKEREN 62 Query: 80 I-FAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLV----LQPKDVLPELLQGWLL 129 +G + + + + D++ F Q +++PE + +L Sbjct: 63 CSVKRGDVFLTRTSETMHELGMSCVALQDYENATFNGFCKRLRPHQNSELVPEYVGYYLR 122 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S Q + A +T + + + IG + + PP EQ+ I + +I Sbjct: 123 STKFRQSMLAFSTMSTRASLNNEMIGRLEISYPPEEEQIEIARVLKNLDDKITLNRQINQ 182 Query: 190 RFIELLKEKKQALVSYI 206 ++ + ++ Sbjct: 183 TLEQMAQALFKSWFVDF 199 >gi|260913245|ref|ZP_05919727.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260632832|gb|EEX51001.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 238 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 75/190 (39%), Gaps = 13/190 (6%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 W+ K L +E ++KNT + G I +L +++ ++ Y++V P Sbjct: 23 SGWDKKILGELFSERSKKNTPEKTVLAATQDRGVIPYELMEKSVIRDRKNLSGYKLVLPK 82 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVF-Y 347 + V + GII+ AY+ + P Y + + Sbjct: 83 DFVISLRSFEG-----GFEYSEYEGIISPAYVVLYPKIKICNYFFRIYFKQERFIQQIQN 137 Query: 348 AMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 ++ + LR +S+ ++ L + +P I EQ I + + + +D L+E EQ + LK+ Sbjct: 138 SLNNSLRDGKSISYKQASTLSIALPEITEQQKIADCL----SSLDELIELQEQKLAALKQ 193 Query: 406 RRSSFIAAAV 415 + + Sbjct: 194 HKKGLMQQLF 203 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 66/203 (32%), Gaps = 14/203 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY--LPKDGNSRQSDTSTVSI 80 W + + + + + + G Y + K + + S + Sbjct: 23 SGWDKKILGELFSERSKKNTPEKT----VLAATQDRGVIPYELMEKSVIRDRKNLSGYKL 78 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIEA 139 + L + ++++GI S ++VL PK + + Q+I+ Sbjct: 79 VLPKDFVIS-LRSFEGGFEYSEYEGIISPAYVVLYPKIKICNYFFRIYFKQERFIQQIQN 137 Query: 140 ICEGATMS--HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + +K + + +P + EQ I + + + I + + + LK+ Sbjct: 138 SLNNSLRDGKSISYKQASTLSIALPEITEQQKIADCLSSLDEL----IELQEQKLAALKQ 193 Query: 198 KKQALVSYIVTKGLNPDVKMKDS 220 K+ L+ + + + S Sbjct: 194 HKKGLMQQLFPSHNDLQASKQAS 216 >gi|87309190|ref|ZP_01091327.1| probable type I restriction modification system methylase [Blastopirellula marina DSM 3645] gi|87288181|gb|EAQ80078.1| probable type I restriction modification system methylase [Blastopirellula marina DSM 3645] Length = 460 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 51/457 (11%), Positives = 135/457 (29%), Gaps = 58/457 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + +++G + + G ++ +DV + + Sbjct: 3 SEWKTAFLTDLYDISSGLSKSAKFFGSGHPFVAFKDVMYNYFLPNELSQLVQSTKEEQQK 62 Query: 80 I-FAKGQILYGKLGPYLR------KAIIADFDGICSTQFLVLQPKDV---LPELLQGWLL 129 +G + + + A+ D + L+PK +PE + +L Sbjct: 63 CSVNRGDVFLTRTSETMNELGMSSVAVKDYEDATFNGFTKRLRPKPDTTIVPEFVAYYLR 122 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + A +T + + + I + + P + EQ +I + +I+ Sbjct: 123 SPKFRSEMRAFSTMSTRASLNNEMISRLKISFPSVLEQRVIGGVLKTLDDKIELNRQMNE 182 Query: 190 RFIELLKEKKQA-------LVSYIVTKG---------------------------LNPDV 215 + + Q+ ++ + G + Sbjct: 183 TLESMARALFQSWFVDFDPVIDKALAAGNPIPEPLQARAETRRALANSTQPLPAHIQKLF 242 Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 E +G +P+ W+V P +T R + L + + + + Sbjct: 243 PDAFQFDEEMGWIPEGWKVTPVGEAITINPRVS--LKKGAVAKYVDMKSLPTSGFAINEV 300 Query: 276 KPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + +++ I + E G ++ ++ ++P T Sbjct: 301 IEKEFSGGAKFLNADVLMARITPCLENGKAGVVDYLDDDEPGFGSTEFIVLRPKNEIGTP 360 Query: 333 LAWLMRSYDLCK---VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + + V +GS RQ ++ + +P + + + Sbjct: 361 FIAALVRDENFRAHCVSNMVGSSGRQRVQNSCFDSYFLCLPSKP---PLLTSYHKTCSTF 417 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + K++ L + R + + ++G+I + + Sbjct: 418 FARITKLKLETNSLTKLRDTLLPKLLSGEIRIPDAEK 454 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 51/133 (38%), Gaps = 12/133 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G IP+ WKV P+ +N + + G Y+ ++ + + + + S Sbjct: 253 GWIPEGWKVTPVGEAITINPRVSLKKGAVAKYVDMKSLPTSGFAINE----VIEKEFSGG 308 Query: 79 SIFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVL-PELLQGWLLS 130 + F +L ++ P L + D G ST+F+VL+PK+ + + + Sbjct: 309 AKFLNADVLMARITPCLENGKAGVVDYLDDDEPGFGSTEFIVLRPKNEIGTPFIAALVRD 368 Query: 131 IDVTQRIEAICEG 143 + + G Sbjct: 369 ENFRAHCVSNMVG 381 >gi|291460947|ref|ZP_06026048.2| putative type I restriction modification DNA specificity domain protein [Fusobacterium periodonticum ATCC 33693] gi|291379863|gb|EFE87381.1| putative type I restriction modification DNA specificity domain protein [Fusobacterium periodonticum ATCC 33693] Length = 487 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 59/460 (12%), Positives = 132/460 (28%), Gaps = 87/460 (18%) Query: 21 IPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP +W V +K +K + G K + ++ ++ + + + Sbjct: 34 IPSNWVWVGLKYISKKIFAGG----DKPENFSKMKTDKNIFPIFSNGIDKDGLYGYTDEA 89 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + + G I ++ + + + + Sbjct: 90 KVLEKALTISARGTIGFTKIREANFTPIIRLIVI------ILKDRILYEFLDYYFKYNSL 143 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+++ + +P+ PL EQ I EK+ + ++ +K Sbjct: 144 EGVGSSIPQLTVPIVNEKIIPLSPLEEQKRIVEKLDFLFEKTKKAKEIIEEIKIDIENRK 203 Query: 200 QALVSYIVTKGLNPDVKM--KDSGIEWV-------------------------------- 225 +++ L + K S ++ + Sbjct: 204 ISILDRAFKGTLTSKWRNENKTSDVKELLKSINEEKIKKWEKDCLQAEKDGNKKPKKPII 263 Query: 226 --------------GLVPDHWEVKPFFAL-----VTELNRKNTKLIESNILSLSYG---- 262 +PD W + + K ++ NI G Sbjct: 264 KEVKDMIVPVDKQPYKLPDSWVWVRLGEISKLSGGSGFPEKYQGFLDKNIPFYKVGSLKN 323 Query: 263 ---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N + + + ++ I+F I R R A + E I + Sbjct: 324 IDDNFYIENSENYIDDDILTEIKAKLFPANTIIFAKIG--EAIRLNRRAILKENSCIDNN 381 Query: 320 YMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 MA+ + Y+ + ++ DL K A S++ ++ L +PP++EQ +I Sbjct: 382 LMALVSNSSCYFRYVYFWLKKEDLYKYAQA---TTVPSIRQSTLEELEFPLPPLEEQEEI 438 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + + E +E+SI + A G+ Sbjct: 439 VRALDEVLENENKVKELLEKSI----------LHKAFKGE 468 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 56/172 (32%), Gaps = 15/172 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W V + +KL+ G K+I + + +++ + ++ + Sbjct: 279 KLPDSWVWVRLGEISKLSGGSGFPEKYQGFLDKNIPFYKVGSLKNIDDNFYIENSENYID 338 Query: 74 D----TSTVSIFAKGQILYGKLGPYLR--KAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 D +F I++ K+G +R + I + + L + Sbjct: 339 DDILTEIKAKLFPANTIIFAKIGEAIRLNRRAILKENSCIDNNLMALVS---NSSCYFRY 395 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + + T+ + + P+PPL EQ I + Sbjct: 396 VYFWLKKEDLYKYAQATTVPSIRQSTLEELEFPLPPLEEQEEIVRALDEVLE 447 >gi|189423706|ref|YP_001950883.1| restriction modification system DNA specificity domain [Geobacter lovleyi SZ] gi|189419965|gb|ACD94363.1| restriction modification system DNA specificity domain [Geobacter lovleyi SZ] Length = 422 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 61/424 (14%), Positives = 126/424 (29%), Gaps = 35/424 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTV 78 W+ V + + T T ++ + I + ++ + + + ++ + Sbjct: 6 WQYVRGENYCSKVTDGTHDTPEQVERGKYLITSKHIKGDEIDFDSAYFITEEDFNEINKR 65 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S + ++ +G Y I D D L ++ + L +L S Sbjct: 66 SKVDQWDVIISMIGAYCGFCFIESNSDIDYAIKNVGLFKTGNEINAKWLYYYLNSSVGKA 125 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++A G+T + + +P+ P + I + + ID I R Sbjct: 126 HLDAAKSGSTQPYIALGALRELPILTPKDEITKKKIVNVLDS----IDKKIRNNNRINAE 181 Query: 195 LKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELN 245 L+ + L Y + PD K SG + V +P W L + Sbjct: 182 LEAMAKTLYDYWFVQFDFPDATGKPYKSSGGKMVYNTTLKREIPVGWNDGTLDDLGQIVG 241 Query: 246 RKNT-KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG-------EIVFRFID 297 ESN + I + N G K + + D G + + Sbjct: 242 GSTPSTKKESNFTASGTPWITPNDLSDNQGYKFITRGAQDVSDSGIKDASLKKYPAGTVL 301 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 L + A E + + P S+ + L + + + Sbjct: 302 LSSRAPIGYMAIAREELTTNQGFKSFIPTNDYSSAFIYYTLKNSLKTIVQHASGSTFKEV 361 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +K + + +P + A + +EQ L + R + + G Sbjct: 362 SGAVLKTVKICLPASG----VVEQFTNAVAPTFKRQDLLEQENQHLTQLRDWLLPMLMNG 417 Query: 418 QIDL 421 Q+ + Sbjct: 418 QVTV 421 Score = 50.6 bits (119), Expect = 6e-04, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 40/126 (31%), Gaps = 12/126 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTG-KYLPK---DG 68 IP W + ++ G T + K+ +I D+ G K++ + D Sbjct: 223 EIPVGWNDGTLDDLGQIVGGSTPSTKKESNFTASGTPWITPNDLSDNQGYKFITRGAQDV 282 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + +++ + G +L P AI + + F P + + Sbjct: 283 SDSGIKDASLKKYPAGTVLLSSRAPIGYMAIAREEL-TTNQGFKSFIPTNDYSSAFIYYT 341 Query: 129 LSIDVT 134 L + Sbjct: 342 LKNSLK 347 >gi|291320527|ref|YP_003515791.1| type I R/M system specificity subunit [Mycoplasma agalactiae] gi|290752862|emb|CBH40837.1| Type I R/M system specificity subunit [Mycoplasma agalactiae] Length = 508 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 52/350 (14%), Positives = 123/350 (35%), Gaps = 26/350 (7%) Query: 80 IFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVL--PELLQGWLLSID 132 + Y + + + +G+ S ++ + K+ + + D Sbjct: 2 LIKNDDFAYNRSISGEKIFGVIRKLENYENGVISPVYIAFRLKNKHVTDSVFLQYYYLTD 61 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + I + ++ L + +I +D+LI R + Sbjct: 62 IWHKEAKNIVFKGARQLLNVSINDFFDMKLIISPNYLEQHRIGRLLSNLDSLIALHQRKL 121 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 LK K L+ + + ++ + WE + L + KNT Sbjct: 122 SSLKNLKNRLLDKMFCDEKSQFPSIRFK------EFTNAWEQEKLKNLTDRIIEKNTHSQ 175 Query: 253 ESNILSLSYG-NIIQKLETRNMGLKPESYETYQIVDPGEIVFRF-IDLQNDKRSLRSAQV 310 S +L++S +I + + N + ++ + Y ++ + + I + +R + Sbjct: 176 SSRVLTISQHQGLIDQNDFFNHRVASKNLKNYLLIKNDDFAYNRSISGEKIFGVIRKLEN 235 Query: 311 MERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQ--SLKFEDVKR 364 E G+I+ Y+A H DS +L + + K + G RQ ++ D Sbjct: 236 YENGVISPVYIAFRLKNKHVTDSVFLQYYYLTDIWHKEAKNIVFKGARQLLNVSINDFFD 295 Query: 365 LPVLV-PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +++ P EQ I ++ + +D L+ ++ + LK ++ + Sbjct: 296 MKLIISPNYLEQHRIGRLL----SNLDSLIALHQRKLSSLKNLKNRLLDK 341 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 55/138 (39%), Gaps = 12/138 (8%) Query: 284 QIVDPGEIVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRS 339 ++ + + I + +R + E G+I+ Y+A H DS +L + + Sbjct: 1 MLIKNDDFAYNRSISGEKIFGVIRKLENYENGVISPVYIAFRLKNKHVTDSVFLQYYYLT 60 Query: 340 YDLCKVFYAMG-SGLRQ--SLKFEDVKRLPVLV-PPIKEQFDITNVINVETARIDVLVEK 395 K + G RQ ++ D + +++ P EQ I ++ + +D L+ Sbjct: 61 DIWHKEAKNIVFKGARQLLNVSINDFFDMKLIISPNYLEQHRIGRLL----SNLDSLIAL 116 Query: 396 IEQSIVLLKERRSSFIAA 413 ++ + LK ++ + Sbjct: 117 HQRKLSSLKNLKNRLLDK 134 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 45/382 (11%), Positives = 102/382 (26%), Gaps = 37/382 (9%) Query: 25 WKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ +K T + T ++ I + + + + Sbjct: 155 WEQEKLKNLTDRIIEKNTHSQSSRVLTISQHQGLIDQNDFF--NHRVASKNLKNYLLIKN 212 Query: 84 GQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQR 136 Y + + + +G+ S ++ + K+ + + D+ + Sbjct: 213 DDFAYNRSISGEKIFGVIRKLENYENGVISPVYIAFRLKNKHVTDSVFLQYYYLTDIWHK 272 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I + ++ L + +I +D+LI R + LK Sbjct: 273 EAKNIVFKGARQLLNVSINDFFDMKLIISPNYLEQHRIGRLLSNLDSLIALHQRKLSSLK 332 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 K L+ + + ++ V +L + I ++ Sbjct: 333 NLKNRLLDKMFCDEKSQFPSIRFKEFTNAWEQ----------WKVGDLIKSAKVNICRSV 382 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + +I++ G + ET ++F L K I Sbjct: 383 VKYGKYEVIEQGIQSVFGYSNNTNETPYWDYEPIVLFGDHTLSIYKPK------SPFFIA 436 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + A + YL + + Y Y S +SL + EQ Sbjct: 437 SDGVKAYYSLRTNGYYLFYSLERYKPLSDGYKRYSSTLKSLNMWITEN-------DVEQS 489 Query: 377 DITNVINVETARIDVLVEKIEQ 398 I + +D L+ ++ Sbjct: 490 KI----SSLFTLLDSLITLHQR 507 >gi|321222502|gb|EFX47574.1| Type I restriction-modification system, specificity subunit S [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 199 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 58/170 (34%), Gaps = 6/170 (3%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + E + + +++F I + + G Sbjct: 18 FMPMAGVPTTYLGKCNFETKKWSEVKKGFTQFQNDDVIFAKITPCFENGKAVVIKEFPNG 77 Query: 315 IITS----AYMAVKPHGIDSTYLAWLMRSYDL--CKVFYAMGSGLRQSLKFEDVKRLPVL 368 + I+ +L L+++ D GS + + E ++ V Sbjct: 78 YGAGSTEYYVLRSINGLINPHWLFALVKTKDFLTNGALNMSGSVGHKRVTKEFLENYGVP 137 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 VPP+ EQ I ++ A++D ++EQ +LK R S I AAV GQ Sbjct: 138 VPPLAEQKVIAEKLDTLLAQVDSTKARLEQIPQILKRFRQSVIVAAVNGQ 187 >gi|260436989|ref|ZP_05790805.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] gi|292810610|gb|EFF69815.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] Length = 266 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 74/209 (35%), Gaps = 11/209 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFA--LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 K E VP+ W + V + ++ I ++ N+++ + Sbjct: 12 KCIEEEIPFEVPEGWAWCRLNSIVDVRDGTHDTPTYVDKGIPLITSKNLVEGGIDYSNVK 71 Query: 276 KPESYETYQI-----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + I V+ G+I+F I + + ++ I A S Sbjct: 72 YISEKDAISINERSGVNIGDILFAMIGTIGNPSMVTEDILI--SIKNVALFKFTFSKNLS 129 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + Y + GL+ + ++ V VPP++EQ I +++ +I Sbjct: 130 NHFVMYFLDYAQEDMKNKPSGGLQPFVSLNFLRTYLVPVPPVEEQQRIVSILADSINKIR 189 Query: 391 VLVEKIEQSIVL-LKERRSSFIAAAVTGQ 418 ++ ++ + +K+ +S + A+ G+ Sbjct: 190 N-IDILKNELTASVKKAKSKILDLAIRGK 217 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 59/203 (29%), Gaps = 6/203 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W + + G K I I +++ G Y S + S Sbjct: 21 EVPEGWAWCRLNSIVDVRDGTHDTPTYVDKGIPLITSKNLVEGGIDYSNVKYISEKDAIS 80 Query: 77 --TVSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDV 133 S G IL+ +G +++ + I L Sbjct: 81 INERSGVNIGDILFAMIGTIGNPSMVTEDILISIKNVALFKFTFSKNLSNHFVMYFLDYA 140 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ G + +P+PP+ EQ I + +I + + Sbjct: 141 QEDMKNKPSGGLQPFVSLNFLRTYLVPVPPVEEQQRIVSILADSINKIRNIDILKNELTA 200 Query: 194 LLKEKKQALVSYIVTKGLNPDVK 216 +K+ K ++ + L P Sbjct: 201 SVKKAKSKILDLAIRGKLVPQDP 223 >gi|302879961|ref|YP_003848525.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] gi|302582750|gb|ADL56761.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] Length = 401 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 54/374 (14%), Positives = 123/374 (32%), Gaps = 21/374 (5%) Query: 26 KVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVSIF 81 ++ + + TG + + IY+ + V++ + ++ + + + Sbjct: 3 ELATLGAVVEKTTGTRNPTKAPNDSFIYVDVAAVDNTQKIIFGARNILGNAAPSRARKLI 62 Query: 82 AKGQILYGKLGPYLRKAIIA--DFDG-ICSTQFLV-LQPKDVLPELLQGWLLSIDVTQRI 137 G IL + P L + D DG I ST F V VLPE L +++S + Sbjct: 63 RTGDILVSTVRPNLNAVALVTADLDGQIASTGFCVLRATTKVLPEYLFYFVISRKFVDAL 122 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ GA + +P+P + EQ I + + + + EL+ Sbjct: 123 SSLVAGALYPAVSDSQVLAQSLPLPSIVEQRRIVDILSRAGGIVKLRREAEKKSAELIPA 182 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + +P K + + V + + + + + + Sbjct: 183 L-------FLDMFGDPATNPKGWPVVMLPDVLAYPFKNGLYLPKEKYAPEESGEGVEMVH 235 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGI 315 K L E + +++ L D + E + Sbjct: 236 MSDAFYGEVKRGGLRRVLAEEKQIRDYGLSKNDLLVARRSLTYDGAAKLCGIPASDEPLL 295 Query: 316 ITSAYMAVKPH--GIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPP 371 S+++ + P + + YL + + + + V + + + ++PV+VPP Sbjct: 296 FESSFIRLIPDSGKVRTEYLLYYLNDENTRRAHVLSRISGITISGINQAAMNQIPVMVPP 355 Query: 372 IKEQFDITNVINVE 385 + +Q D ++ Sbjct: 356 LPKQGDFVERVSEV 369 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 50/135 (37%), Gaps = 11/135 (8%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+I+ + + +L +A + + T + + YL + + S Sbjct: 59 RKLIRTGDILVSTVRPNLNAVALVTADLDGQIASTGFCVLRATTKVLPEYLFYFVISRKF 118 Query: 343 CKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 ++ +G ++ V + +P I EQ I ++++ + K+ + Sbjct: 119 VDALSSLVAGALYPAVSDSQVLAQSLPLPSIVEQRRIVDILSRAGG-----IVKLRREAE 173 Query: 402 LLKERRSS-FIAAAV 415 +S+ I A Sbjct: 174 K----KSAELIPALF 184 >gi|169823775|ref|YP_001691386.1| type I restriction-modification enzyme specificity subunit [Finegoldia magna ATCC 29328] gi|167830580|dbj|BAG07496.1| type I restriction-modification enzyme specificity subunit [Finegoldia magna ATCC 29328] Length = 375 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 59/408 (14%), Positives = 121/408 (29%), Gaps = 46/408 (11%) Query: 22 PKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 PK W+ V + ++ G + GK G + Sbjct: 5 PKDWEEVKLVDIPIQIKKGDLITKKE-----------IANGKIPVIAGGKSPAYYCNRYN 53 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I G + S + + + E + L + I + Sbjct: 54 REGTTITVSASGANAGYVNLFYGQIFASDCSTIEEDRSYCIE--YIYYLMAKEQENIYKL 111 Query: 141 CEGATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G H K I + + + EQ I E ++ I+ +E L EKK Sbjct: 112 QTGGAQPHVHPKDIKKLEIIYSRNIEEQKSIAETLMTFDRHIEN--------LEKLIEKK 163 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + + V + ++ EW +G + K F+ T N K Sbjct: 164 KMIRDGAVEDLMTGKTRLDGFDGEWEKLLLGDIFKINMCKRVFSYQTVKNGK-------- 215 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I G +K + Y+ Y G+ + K + + + Sbjct: 216 IPFFKIGTFGKKADAYISEELFNQYKHLYPYPSKGDSLISASGSIG-KIVVYNGENSYYQ 274 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIK 373 ++ + +D ++L + +R++ + L + + +P IK Sbjct: 275 DSNIVWLKTNLNIVDKSFLYFYLRTFPWKI----TEGTTIKRLYNNIILETEINLPTDIK 330 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 EQ I +++ I+ L ++ + + + +TG++ L Sbjct: 331 EQQAIASILTSMDEEIENLEKEKAKIEKIKA----GAMDDLLTGRVRL 374 >gi|148549813|ref|YP_001269915.1| restriction modification system DNA specificity subunit [Pseudomonas putida F1] gi|148513871|gb|ABQ80731.1| restriction modification system DNA specificity domain [Pseudomonas putida F1] Length = 561 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 57/485 (11%), Positives = 112/485 (23%), Gaps = 95/485 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W I G + I ++++ ++ N + + Sbjct: 82 ELPDGWAWCRIVDTGNYINGLAFKPSDWSSTGRPIIRIQNLSGRNAEF-----NRTEREV 136 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG-------------ICSTQFLVLQPK----- 117 + G IL L I G I S Q+L K Sbjct: 137 DASVVVNPGDILVSWS-ATLDTFIWRGEQGVLNQHIFRVTPSKIVSVQYLYWLLKWAIKV 195 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEG------ATMSHADWKGIGNIPMPIPPLA------ 165 E G +++ A G + + + Sbjct: 196 LADSEHAHGLVMAHINRGPFLAQPIGLPPLTEQNKIVVKIAELMALCDRLEARQADADSA 255 Query: 166 ------------EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 Q R+ + KQ L+ V L P Sbjct: 256 HAQLVQALLGSLTQASDAADFAQSWQRLAEHFHTLFTTESSIDALKQTLLQLAVMGKLVP 315 Query: 214 DVKMKDSGIEWVGLVPDHWEV-------KPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + E + V + K L + N G I Sbjct: 316 QDSRDEPASELLKRVSEEKARLVAEGKLKKQKPLGDVAISDIPFDVPDNWAWSRIGEIAL 375 Query: 267 KLETRNMGLKPESYETYQIVDPGEI------------------------------VFRFI 296 E + + ++ G+I ++ Sbjct: 376 NTEYGLSEKTFDLQDGVPVLKMGDIQEGRVLLGGQMAVSKNTEGLPGLYLETEDLLYNRT 435 Query: 297 DLQNDKRSLRSAQVMERGIITSAY---MAVKPHGIDSTYLAWLMRSYDLCKVFYA---MG 350 + ++Y + YL M + + Sbjct: 436 NSAELVGKTGVFLGQAGEYSFASYLIRIRCLKELFSPLYLNISMNAPGFRETQINPHLKQ 495 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + ++ +K + V VPP+ EQ I ++ A + L ++ Q++ + + S+ Sbjct: 496 QCGQANVNGTIMKNMLVSVPPLPEQHRIVAKVDQLMALCEQLKTRLNQALQVHEHLASAL 555 Query: 411 IAAAV 415 + AV Sbjct: 556 VEQAV 560 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 12/200 (6%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNM 273 M+ + E +PD W +N K + + + N+ + N Sbjct: 72 MEVADSEQPFELPDGWAWCRIVDTGNYINGLAFKPSDWSSTGRPIIRIQNLSGRNAEFNR 131 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + +V+PG+I+ + + E+G++ V P I S Sbjct: 132 --TEREVDASVVVNPGDILVSWSATLDTFI-----WRGEQGVLNQHIFRVTPSKIVSVQY 184 Query: 334 AWLMRSYDLCKVFYA-MGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + + + + G + + P+ +PP+ EQ I I A D Sbjct: 185 LYWLLKWAIKVLADSEHAHGLVMAHINRGPFLAQPIGLPPLTEQNKIVVKIAELMALCDR 244 Query: 392 LVEKIEQSIVLLKERRSSFI 411 L + + + + + Sbjct: 245 LEARQADADSAHAQLVQALL 264 >gi|315030633|gb|EFT42565.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4000] Length = 385 Score = 79.1 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 47/391 (12%), Positives = 118/391 (30%), Gaps = 23/391 (5%) Query: 34 TKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + G + I + + + + + G+++ Sbjct: 2 ASFSKGNGYSKADLIEEGHPLILYGRLYTKYETIIESVDTFAKL-QDKSILSKGGEVIVP 60 Query: 90 KLGPYLR---KAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 G +A + D G+ ++ ++ P L + + + + +G Sbjct: 61 SSGESAEDISRASVVDVAGVVLGGDLNIIKTNSELNPTFLALTISNGSQQKEMSKRAQGK 120 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 ++ H + I + P + EQ+ I I+ + + EL K Q + Sbjct: 121 SIVHLHNSDLKEINLLYPKIEEQIYIGLFFKKLEDTINLHQRKLDQLKELKKAYLQVMF- 179 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V P ++ D EW + + + K E+ + + ++ Sbjct: 180 -PVKDERAPKLRFADFEGEWEQCKLEDYATYRRGSFPQPYGNKKWYDGENAMPFVQVIDV 238 Query: 265 IQKLETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 ++L + S V G++V + ++R ++ Sbjct: 239 TEQLSLVKDTKQKISKLAQSKSVFVSAGKVVVTLQGSIGRVAITQYNSYIDRTLL---VF 295 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 D + A+ ++ G +++ E + V P +EQ N Sbjct: 296 ESYEKETDEYFWAYTIQQ-KFEIEKRKAPGGTIKTITKEALSSFEVNFPEYEEQQKNGNF 354 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +D ++ ++ + LK + S++ Sbjct: 355 L----KNLDNILTLDQKKLDQLKSLKKSYLQ 381 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 50/189 (26%), Gaps = 10/189 (5%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ ++ + G + + ++ + DV + Sbjct: 197 EWEQCKLEDYATYRRGSFPQPYGNKKWYDGENAMPFVQVIDVTEQLSLVKDTKQKISKLA 256 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S + G+++ G R I ++ LV + + + Sbjct: 257 QSKSVFVSAGKVVVTLQGSIGR-VAITQYNSYIDRTLLVFESYEKETDEYFWAYTIQQKF 315 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G T+ + + + + P EQ + + + + L Sbjct: 316 EIEKRKAPGGTIKTITKEALSSFEVNFPEYEEQQKNGNFLKNLDNILTLDQKKLDQLKSL 375 Query: 195 LKEKKQALV 203 K Q + Sbjct: 376 KKSYLQNMF 384 >gi|255690849|ref|ZP_05414524.1| HsdS, type I site-specific deoxyribonuclease [Bacteroides finegoldii DSM 17565] gi|260623570|gb|EEX46441.1| HsdS, type I site-specific deoxyribonuclease [Bacteroides finegoldii DSM 17565] Length = 271 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 73/211 (34%), Gaps = 10/211 (4%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN--ILSLSYGNIIQKLETRNMGL 275 K E +P WE L+ +K + + L GNI + + Sbjct: 11 KCIDEEIPFEIPQGWEWCRLSLLIYPPKYGTSKKSVPSGLLPVLRMGNIQDGEIVFDKLV 70 Query: 276 KPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + ++ G+++F + + I + ++P I+S YL Sbjct: 71 YSNDLDDNKKLLLQYGDLLFNRTNSAELVGKTAIFRGQRNAIFAGYLILLRPIFINSEYL 130 Query: 334 AWLMRSYDLCKVFYAMGS-GLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 L+ + + + G++Q ++ E + L + VP + E I + I Sbjct: 131 NLLLNTPYARDYCNEVKTIGVQQCNINAEKISNLLIPVPNLHETVAIVEKVKNIALPIIK 190 Query: 392 LVEKIEQSIVLLKER----RSSFIAAAVTGQ 418 E ++ L +E R S + A+ G+ Sbjct: 191 YGEFYQKLKHLNRELPIIIRKSILQEAIQGK 221 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 77/218 (35%), Gaps = 13/218 (5%) Query: 20 AIPKHWKVVPIKRFT---KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ + K T + S + + + +++ G + K S D + Sbjct: 20 EIPQGWEWCRLSLLIYPPKYGTSKKSVPSGLLPVLRMGNIQDGEIVF-DKLVYSNDLDDN 78 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G +L+ + + AI +L+L + LL+ Sbjct: 79 KKLLLQYGDLLFNRTNSAELVGKTAIFRGQRNAIFAGYLILLRPIFINSEYLNLLLNTPY 138 Query: 134 --TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 E G + + + I N+ +P+P L E V I EK+ + I + Sbjct: 139 ARDYCNEVKTIGVQQCNINAEKISNLLIPVPNLHETVAIVEKVKNIALPIIKYGEFYQKL 198 Query: 192 IELLKE----KKQALVSYIVTKGLNPDVKMKDSGIEWV 225 L +E +++++ + L P + + + E + Sbjct: 199 KHLNRELPIIIRKSILQEAIQGKLVPQIAEEGTARELL 236 >gi|225619349|ref|YP_002720575.1| type1 restriction modification enzyme [Brachyspira hyodysenteriae WA1] gi|225214168|gb|ACN82902.1| type1 restriction modification enzyme [Brachyspira hyodysenteriae WA1] Length = 479 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 53/425 (12%), Positives = 120/425 (28%), Gaps = 53/425 (12%) Query: 29 PIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + +G +S I I + ++ D + + K Sbjct: 45 KLGEYFDIFSGFAFKSEDYIEDGIPVIRISNISDNFNINNMVFVPDEYLDKYSNFVLKKN 104 Query: 85 QILYGKLGPYLRK--AIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAI 140 IL G K + D + + + L+ + L + V ++ Sbjct: 105 DILVSLTGDGKLKSDLVFEDNKYLLNQRVGCLRSIKEVNILFFYYVINYCNLVDKQFYWF 164 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK- 199 G T + NI +P+ +Q I I +I +L I +++ E Sbjct: 165 SNGKTQLNISPFDFLNIKIPLIDKQKQDEIVSLIEPIENKIKSLKETIIPEQKIINEVFA 224 Query: 200 ---------------------QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 Q + S I + K Sbjct: 225 REFGFDENLYNEFGKGMTAGTQIADNKTFKVFNTDFSDFSKSDIMRFSTRFHNTPTKKLM 284 Query: 239 ALVTELN--------------RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 ++ +++ + E I + N+ N E Sbjct: 285 NILNDIDTIKVKNIIFEYEKGIQPNYNTEGEIHVIKIQNLKNSYIDFNDSEYILEGEYNL 344 Query: 285 IVD----PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 I D + + + + + E I T ++ + + + RS Sbjct: 345 ISDSKKLKYDDIILCVTGKISLGKIDLYNYEEDAITTVDNFIIRITNYNKLFFVYFFRSI 404 Query: 341 DLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA---RIDVLVEK 395 G+ + L++++++ + P+K+Q +I + I+ + +I++ +EK Sbjct: 405 LGYFQIERDFTGTTNQIHLRWKEIENFKIPNIPLKKQQEIVDEIDNKIKEQQKINIQIEK 464 Query: 396 IEQSI 400 I Sbjct: 465 ERNKI 469 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 55/172 (31%), Gaps = 15/172 (8%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY---ETYQIVDPGEIVFR 294 F + + K+ IE I + NI NM P+ Y + ++ +I+ Sbjct: 50 FDIFSGFAFKSEDYIEDGIPVIRISNISDNFNINNMVFVPDEYLDKYSNFVLKKNDILVS 109 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-L 353 K L + + + + K FY +G Sbjct: 110 LTGDGKLKSDLVFEDNKYLLNQRVGCLRSIKEVNILFFYYVINYCNLVDKQFYWFSNGKT 169 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + ++ D + + + ++Q +I + L+E IE I LKE Sbjct: 170 QLNISPFDFLNIKIPLIDKQKQDEIVS-----------LIEPIENKIKSLKE 210 >gi|187736396|ref|YP_001878508.1| restriction modification system DNA specificity domain [Akkermansia muciniphila ATCC BAA-835] gi|187426448|gb|ACD05727.1| restriction modification system DNA specificity domain [Akkermansia muciniphila ATCC BAA-835] Length = 386 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 54/398 (13%), Positives = 116/398 (29%), Gaps = 50/398 (12%) Query: 25 WKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + + G T S I + +++ + L +D + + + Sbjct: 19 WERRKLGDLAEFRRGLTYSPRDISTSGIRVLRSSNIDEDSF-VLAEDDVYVKETAVCIPL 77 Query: 81 FAKGQILY----GKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG IL G + A+I D G + F++L + + + + Sbjct: 78 VEKGDILITAANGSSRLVGKHALIIDDKGKMVHGGFMLLAHPYTHSAFVNALMHAPWYSS 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I G + + I +EQ E+I + +D LIT R E L Sbjct: 138 FIRTNVAGGNGAIGNLNKSDLEEQDIAATSEQEQ--ERIGSLFASLDHLITLHQRKYEKL 195 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K++++ + K +++ +G + ++ V + + Sbjct: 196 LNIKKSMLDKMFPKNGELFPEVRFAG---FTDAWERQKLGDLVESVPFKQYIASPEPDGK 252 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + G + + G+ E Y I + + Sbjct: 253 FEIIQQG--SEPIIGYGNGIPCEDYAKITIFGDHTVSIYK-------------PQKPFFV 297 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV----LVPP 371 T + +D + +L+ Y Y + + P Sbjct: 298 ATDGTRLLTARVLDGDFFYFLLERYKPIPEGYKRHYT------------ILIERYGCFPS 345 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +EQ I ID L+ ++ + L+ + + Sbjct: 346 HREQKLIAIF----FRNIDHLITLHQRKLEKLQNIKKA 379 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 54/147 (36%), Gaps = 8/147 (5%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG-IITSAYMAVKPHGID 329 + E+ +V+ G+I+ + + + + ++G ++ +M + Sbjct: 63 EDDVYVKETAVCIPLVEKGDILITAANGSSRLVGKHALIIDDKGKMVHGGFMLLAHPYTH 122 Query: 330 STYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 S ++ LM + A G+G +L D++ + +EQ I + Sbjct: 123 SAFVNALMHAPWYSSFIRTNVAGGNGAIGNLNKSDLEEQDIAATSEQEQERIGS----LF 178 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 A +D L+ ++ L + S + Sbjct: 179 ASLDHLITLHQRKYEKLLNIKKSMLDK 205 >gi|327183902|gb|AEA32349.1| restriction modification system DNA specificity domain-containing protein [Lactobacillus amylovorus GRL 1118] Length = 370 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 44/378 (11%), Positives = 109/378 (28%), Gaps = 17/378 (4%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + +K ++ G++ +S G + R + + K Sbjct: 2 EYKHLKNIAQITMGQSPKSETYNNKKEGLPFFQGNADFGEISPKVRIWCSVPKKVAHKND 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL P IAD + + +++ + ++ ++ G+T Sbjct: 62 ILISVRAPI-GALNIADTECCIGRGLAAISVRNIKDRD-YIFNALKAKSEYLKNRGTGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + ++ +PI ++ + + + + E K L+ Sbjct: 120 FKAINKNILEDVEIPIVSTEKRDIEIKVLNKL--------NIVKKQKEKELSKLDTLIKA 171 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + K+ + + + + V E + T + + N Sbjct: 172 RFVEMFGDIKNNKNYNYKPISDLTNVVSGGTPKRDVKEYWDRGTI---PWVKTTELKNNK 228 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + ++V I+ + +A + + A + P Sbjct: 229 VNSTEEYITKTGLQNSSAKLVPSHTILIAMYGQGKTRG--MTAYLEKEAATNQACACILP 286 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++ W ++ G + +L +K P+L PPI Q + I+ Sbjct: 287 SSKINSEYLWQYLIMSYEELRNLAKGGNQPNLNSRMIKDFPILDPPISLQNKFVSFIHQV 346 Query: 386 TAR--IDVLVEKIEQSIV 401 ++ L+ K SI Sbjct: 347 DKSKVVNNLIMKYIISID 364 >gi|303267739|ref|ZP_07353550.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS457] gi|303270111|ref|ZP_07355814.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS458] gi|302640356|gb|EFL70800.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS458] gi|302642728|gb|EFL73064.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS457] Length = 268 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 13 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 72 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 73 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 132 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 133 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 192 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 193 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 222 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 82/214 (38%), Gaps = 13/214 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 19 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 78 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 79 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 138 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 139 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 198 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS 220 +L KE ++++ Y + L +S Sbjct: 199 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDES 232 >gi|317182737|dbj|BAJ60521.1| Type I R-M system specificity subunit [Helicobacter pylori F57] Length = 193 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 57/197 (28%), Gaps = 17/197 (8%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G V K T + ++GN ++ + L+ ++ Y Sbjct: 11 LGDVGKPCMCKRVMKHQTTRYGEVPFYKIG-----TFGNTADAFISKKLFLEYKT--KYS 63 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 G+I+ + + + + +L +Y K Sbjct: 64 FPKKGDILISASGTIGRAVI----YDGKPAYFQDSNIVWIDNDETLVKNDFLFYAYSNVK 119 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 L + + + +PP+ EQ I N+++ I L K Q + Sbjct: 120 W--NTEHTTILRLYNNNFRNTLIPLPPLNEQSAIANILSGLDNEIASLKNKKRQ----FE 173 Query: 405 ERRSSFIAAAVTGQIDL 421 + + + +I + Sbjct: 174 NIKKALNHDLMNAKIRV 190 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 58/189 (30%), Gaps = 10/189 (5%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + ++ + + + ++ K Sbjct: 2 LPLNWQRVRLGDVGKPCMCKRVMKHQTTRYGEVPFYKIGTFGNTADAFISKKLFL--EYK 59 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 60 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYAYS 116 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ I + I +L ++ +F + Sbjct: 117 NVKWNTEHTTILRLYNNNFRNTLIPLPPLNEQSAIANILSGLDNEIASLKNKKRQFENIK 176 Query: 196 KEKKQALVS 204 K L++ Sbjct: 177 KALNHDLMN 185 >gi|301055838|ref|YP_003794049.1| type I restriction modification enzyme protein S [Bacillus anthracis CI] gi|300378007|gb|ADK06911.1| type I restriction modification enzyme protein S [Bacillus cereus biovar anthracis str. CI] Length = 369 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 48/398 (12%), Positives = 120/398 (30%), Gaps = 47/398 (11%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVSI 80 ++ + + G + +++++G + S + + S Sbjct: 11 ELKKCEDIIDVRDGTHDSPKYVENGYPLVTSKNIKNGKLDLENINYISTEDFQKVNKRSK 70 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG ++ +G ++ F K + + +LLS +++ Sbjct: 71 VDKGDVIMPMIGTIGNPLLVETDREFAIKNVALFKFQDNKFIFNKFFYYFLLSDLCKKQL 130 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G T S + + N+ +P+PPL +Q I + I+ + EL Sbjct: 131 NGSKRGGTQSFVSLRDLRNLKVPLPPLEQQKEIVMVLDKVQGLIEKRKEAIAKLDEL--- 187 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + S NP K + + + + ++ Sbjct: 188 ----IESVFYDMFGNPITNPKKWETTRLDNIVVLQRGYDLPIKSRNEMGEVEIWGSNGVV 243 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + G IV + + + Sbjct: 244 GVH---------------------NEAKIIGGGIVTGRSGSIGNVYYTYK----DFWALN 278 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + + +G + YL +L++ ++L + +L + + + P+ +Q + Sbjct: 279 TTLFSKETYGNNIVYLKYLLQYFNLKRFLN---GTGVPTLNRNVIHKEQIYKIPLNKQEE 335 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I +I+ + E S++ L+E S+ + A Sbjct: 336 FAGII----KQIERTKSQFESSLIRLEESFSALVQRAF 369 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 68/191 (35%), Gaps = 13/191 (6%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK-----LETRNMGLKP 277 + + + + V + + K +E+ ++ NI Sbjct: 3 DNIMEIRYELKKCEDIIDVRDGTHDSPKYVENGYPLVTSKNIKNGKLDLENINYISTEDF 62 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWL 336 + VD G+++ I + L E I A + + + + + Sbjct: 63 QKVNKRSKVDKGDVIMPMIGTIGNP--LLVETDREFAIKNVALFKFQDNKFIFNKFFYYF 120 Query: 337 MRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + S K G + + D++ L V +PP+++Q +I V++ ++ L+EK Sbjct: 121 LLSDLCKKQLNGSKRGGTQSFVSLRDLRNLKVPLPPLEQQKEIVMVLD----KVQGLIEK 176 Query: 396 IEQSIVLLKER 406 +++I L E Sbjct: 177 RKEAIAKLDEL 187 >gi|313500656|gb|ADR62022.1| HsdS [Pseudomonas putida BIRD-1] Length = 576 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 66/496 (13%), Positives = 137/496 (27%), Gaps = 103/496 (20%) Query: 20 AIPKHWKVVPIKRFTK---------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 +P W L G + + ++ + D++ + P+ S Sbjct: 83 ELPTTWIWTSFDDLINPEYPIAYGVLVPG--PDVADGVPFVRIADLDLVAPPHKPEKSIS 140 Query: 71 RQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGW 127 + D + G+IL G +G + I + + + P + + W Sbjct: 141 PEVDRQYERTRIRGGEILMGVVGSIGKLGIAPESWAGANIARAICRVVPSVHVSKDYIIW 200 Query: 128 LLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------------- 173 LL D + ++ + I + P+PPLAEQ I K Sbjct: 201 LLQSDLMRKQFLGDTRTLAQPTLNVGLIRSAAAPLPPLAEQHRIVAKVEELMALCDRLEA 260 Query: 174 ----------------IIAETVRID------------TLITERIRFIELLKEKKQALVSY 205 + + T ID + K+ L+ Sbjct: 261 QQADAESAHVQLVQAMLDSLTQAIDAADFATSWQRLAEHFHTLFTNEFAIDALKKTLLQL 320 Query: 206 IVTKGLNPDVKMKDSGIEWV-------------------------------GLVPDHWEV 234 V L P +S E + +P W+ Sbjct: 321 AVMGKLVPQDVTDESASELLKRIEGEKQRLVDEGLMKKQKPLVESTSGQIKPALPSSWKW 380 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY----------Q 284 P + T ++ + N + K + + Sbjct: 381 VPLLDITTGMDSGWSPACLGNCSPSDDVWGVLKTTAVQVMSYLQHENKELPSHLEPRPEA 440 Query: 285 IVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYD 341 G+I+F N + +I+ + P + ++A + + + Sbjct: 441 ETKVGDILFTRAGPMNRVGISCLVESTRPKLMISDKIIRFHPVELGVYGRFVALCLNAGE 500 Query: 342 LCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K SG+ + ++ E ++ P+ + P++EQ I ++ D L ++I Sbjct: 501 TAKYLEQAKSGMAASQVNISQEKLRLAPIPLAPLREQHRIVKKVDQLMKLCDTLKQQINV 560 Query: 399 SIVLLKERRSSFIAAA 414 + E + +A Sbjct: 561 ARSKQTELLDTLMAQV 576 >gi|225351803|ref|ZP_03742826.1| hypothetical protein BIFPSEUDO_03404 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225157050|gb|EEG70389.1| hypothetical protein BIFPSEUDO_03404 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 151 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 59/150 (39%), Gaps = 8/150 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH- 326 + G S +Y+ + G+I F + GI++ + ++P Sbjct: 3 FNSTGNGADESSLPSYKRLRLGDIAFEGHANKEFAYGRFVLNDAGNGIMSPRFTCLRPIV 62 Query: 327 GIDSTYLAWLMRSYDLCK--VFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVIN 383 + ++ + + S ++ + + + SG + L +D +LVP + EQ I + Sbjct: 63 EQEYSFWKYFIHSEEVMRPILVNSTKSGTMMNELVVKDFLEQEILVPSLPEQRQIGAFFD 122 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ + LL+ + S + Sbjct: 123 C----LDSLITLHQRKLELLRNIKKSMLDK 148 >gi|237723419|ref|ZP_04553900.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. D4] gi|229438209|gb|EEO48286.1| type I restriction enzyme EcoAI specificity protein [Bacteroides dorei 5_1_36/D4] Length = 242 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 77/199 (38%), Gaps = 7/199 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIES--NILSLSYGNIIQKLETRNMGLKPESYET-- 282 VP+ W +V EL ++ S I L GNI L S + Sbjct: 13 EVPESWVWCRLDDIVCELKYGTSEKSSSVGKIAVLRMGNITNVGTIDYSNLVYSSNDEDI 72 Query: 283 -YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 ++ +++F + + + I + +KP I YL +M S Sbjct: 73 EQYSLEKNDLLFNRTNSSEWVGKTAIYKEEQPAIYAGYLIRIKPLLISPDYLNTVMNSGY 132 Query: 342 LCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 Y + + + ++ + + +L + +PP+KEQ I ++ + ID++ Sbjct: 133 YRDWCYDVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVAEMDKWISLIDIVKNGKGDL 192 Query: 400 IVLLKERRSSFIAAAVTGQ 418 + ++K+ +S + A+ G+ Sbjct: 193 LTVIKQAKSKILDLAIHGK 211 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 82/230 (35%), Gaps = 10/230 (4%) Query: 20 AIPKHWKVVPIKRF-TKLNTGRTSESGK--DIIYIGLEDVES-GTGKYLPKDGNSRQSDT 75 +P+ W + +L G + +S I + + ++ + GT Y +S D Sbjct: 13 EVPESWVWCRLDDIVCELKYGTSEKSSSVGKIAVLRMGNITNVGTIDYSNLVYSSNDEDI 72 Query: 76 STVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ K +L+ + + AI + +L+ ++ +++ Sbjct: 73 EQYSL-EKNDLLFNRTNSSEWVGKTAIYKEEQPAIYAGYLIRIKPLLISPDYLNTVMNSG 131 Query: 133 VT--QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + S+ + + + + +PIPPL EQ I ++ ID + + Sbjct: 132 YYRDWCYDVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVAEMDKWISLIDIVKNGKGD 191 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 + ++K+ K ++ + L P + IE + + + Sbjct: 192 LLTVIKQAKSKILDLAIHGKLVPQDPNDEPAIELLKRINSDFTPCDNGHY 241 >gi|183603427|ref|ZP_02964389.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CDC0288-04] gi|183575012|gb|EDT95540.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CDC0288-04] Length = 432 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 42 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 98 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 99 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIES 158 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 159 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 196 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 63/432 (14%), Positives = 128/432 (29%), Gaps = 71/432 (16%) Query: 29 PIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++ G + KD I +I + D E G ++S + Sbjct: 2 RFSTLVEIIRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEA 139 KG L + R I+ I + ++ L + ++LS + V + + Sbjct: 62 VKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLS 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + GA + + + + +I +P+PPL+EQ I E I + ++D R +L KE Sbjct: 122 LISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFP 181 Query: 200 ----QALVSYIVTKGLNPDVKMKDSGIEWV------------------------------ 225 ++++ Y + L +S + Sbjct: 182 DKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGD 241 Query: 226 ----------------GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNI 264 +P+ W F +LV K + I +S ++ Sbjct: 242 DNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDM 301 Query: 265 IQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + + I G ++ F L II+ + Sbjct: 302 PISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IF 360 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 I YL + G ++L + L + + +E I + Sbjct: 361 PYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIIS 418 Query: 381 VINVETARIDVL 392 +++ ++ L Sbjct: 419 KVDLLFQKVSQL 430 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 257 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 316 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 317 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 374 >gi|89898852|ref|YP_521323.1| restriction modification system DNA specificity subunit [Rhodoferax ferrireducens T118] gi|89343589|gb|ABD67792.1| restriction modification system DNA specificity domain [Rhodoferax ferrireducens T118] Length = 447 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 53/455 (11%), Positives = 123/455 (27%), Gaps = 62/455 (13%) Query: 23 KHWKVVPIKRFTK-----LNTGRTSES--GKDIIYIGLEDVESGT--GKYLPKDGNSRQS 73 W + L G + KD + G+ + G+++ D + Sbjct: 2 SEWIETTVGEIAASSRNALVGGPFGSNLVSKDYVDQGVPVIRGQNMGGRWVAGDFACVST 61 Query: 74 DTS---TVSIFAKGQILYGKLGPYLRKAIIADFDGIC-----STQFLVLQPKDVLPELLQ 125 + + + + G I++ + G + A++ D S L + P+ L Sbjct: 62 EKAAALSANTARPGDIVFTQRGTLGQVALVPDSPYETYVVSQSQMKLTVDPEKADSLFLY 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTL 184 S + I + H + + + P+ +P + Q I ++ +I+ Sbjct: 122 YLFSSPIQQEYIRQNSIQVGVPHTNLGILRDTPVVLPKSVDVQKDIARQLGTLDDKIELN 181 Query: 185 ITERIRFIELLKEKKQALVSYI----------------VTKGLNPDV---KMKDSGIEWV 225 + + Q+ GL P + + Sbjct: 182 RRMNETLEAMARAIFQSWFVDFDPVRAKASGESADSICQRLGLTPKLLALFPDSFEDSEL 241 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G +P W + L E + + ++ Sbjct: 242 GEIPSGWMIGSIGTLANVTGGSTPNTKEPKYWDDGVHCWATPKDLSRLSSPVLLETERKV 301 Query: 286 VDPGEIVFRF-------IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 D G + L + + ++A+ P+ S Y Sbjct: 302 SDDGLAQIGSGLLKPGAVLLSSRAPIGYRVINEVPVAVNQGFIAMTPNSGVSKYFLLYWA 361 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL------ 392 + ++ + + + +PV+ P + + D Sbjct: 362 EWAHDEIVSRANGSTFLEISKANFRPIPVVRPT-----------DALFEKFDQYVGPLYK 410 Query: 393 -VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + EQ LL +R S + ++G++ + E + Sbjct: 411 RIVSNEQEKQLLVAQRDSLLPKLLSGEVMVAAEEE 445 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 64/210 (30%), Gaps = 19/210 (9%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYI-------GLEDVESGT 60 ++DS +G IP W + I + G T + + + +D+ + Sbjct: 234 DSFEDSE---LGEIPSGWMIGSIGTLANVTGGSTPNTKEPKYWDDGVHCWATPKDLSRLS 290 Query: 61 GKYLPKDGNSRQSDT---STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 L + D + G +L P + I + + F+ + P Sbjct: 291 SPVLLETERKVSDDGLAQIGSGLLKPGAVLLSSRAPIGYRV-INEVPVAVNQGFIAMTPN 349 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + + I + G+T I P + + EK Sbjct: 350 SGVSKYFLLYWAEWA-HDEIVSRANGSTFLEISKANFRPI----PVVRPTDALFEKFDQY 404 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIV 207 + I + +LL ++ +L+ ++ Sbjct: 405 VGPLYKRIVSNEQEKQLLVAQRDSLLPKLL 434 >gi|237807984|ref|YP_002892424.1| restriction modification system DNA specificity domain-containing protein [Tolumonas auensis DSM 9187] gi|237500245|gb|ACQ92838.1| restriction modification system DNA specificity domain protein [Tolumonas auensis DSM 9187] Length = 437 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 119/396 (30%), Gaps = 32/396 (8%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDG-NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD- 102 + + ++ D ++D SI +G I+ G + +II++ Sbjct: 37 SSGVPVLKGGNLHGAYVDDSDCDFLTEEKADELKSSIAFEGDIVITHRGTIGQVSIISED 96 Query: 103 ---FDGICSTQF--LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW--KGIG 155 + S + L K V P + +L S ++ + + + Sbjct: 97 AKYPRYVVSQSQLKISLDRKKVNPYYVNYYLRSHLGQHQLLSFASQVGVPAIAQASTSVK 156 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS-----YIVTKG 210 I +P PPL Q I E I + +I ++ + ++ G Sbjct: 157 QIRVPCPPLDIQNKIVEFIRSVDKKIANNTQTNQTLEQMAQAIFKSWFVDFDPVKAKMNG 216 Query: 211 LNPDVKMKDSG--------IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 P+ + + +GL+P+ W ++P + +N + K E Sbjct: 217 EQPEGMDEATAALFPDKLVESELGLIPEGWNIQPLSDVSRVINGRAYKNSEFREKGTPIV 276 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I +++D G+++F + + I Sbjct: 277 RIQNLTGAGKTVYSDIDLPQDKLIDHGDLIFAWSATFG-----PYLWRGPKSIYHYHIWK 331 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++ M L + G+G + L ++ ++VP V Sbjct: 332 MEVDENKFGKYLLFMHLARLTEYLKNQGTGSIFTHLTKGIMESQKLVVPFEGVVQAFAKV 391 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + +ID L + L+ R + ++G Sbjct: 392 VTPLFVQIDAL----HKQNKTLESLREILLPKLLSG 423 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 68/191 (35%), Gaps = 11/191 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +G IP+ W + P+ +++ GR ++ K + ++++ +G GK + D Sbjct: 239 LGLIPEGWNIQPLSDVSRVINGRAYKNSEFREKGTPIVRIQNL-TGAGKTVYSDI----- 292 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 D + G +++ + I ++ + ++ + Sbjct: 293 DLPQDKLIDHGDLIFAWS-ATFGPYLWRGPKSIYHYHIWKMEVDENKFGKYLLFMHLARL 351 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 T+ ++ G+ +H + + + +P + + V+ID L + Sbjct: 352 TEYLKNQGTGSIFTHLTKGIMESQKLVVPFEGVVQAFAKVVTPLFVQIDALHKQNKTLES 411 Query: 194 LLKEKKQALVS 204 L + L+S Sbjct: 412 LREILLPKLLS 422 >gi|325686955|gb|EGD28979.1| type I restriction/modification specificity protein [Streptococcus sanguinis SK72] Length = 382 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 52/406 (12%), Positives = 112/406 (27%), Gaps = 40/406 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG---KYLPKDGNSRQSDTSTVSIFAKG 84 + + +++ + + + G+ + T + D S KG Sbjct: 4 IRLGEIGRISMCKRILKSQTNEFSGIPFYKISTFGGTPTVYIDEKVYHEYKEKYSYPKKG 63 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G + I D +V + L + G+ Sbjct: 64 DILISAAGTIGKTVIFDGEDSYFQDSNIVWIEN--DESKVTNQFLYYFLQTNPFITTNGS 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T+ + + +P P +Q KI +D I + + L+ + L Sbjct: 122 TIKRLYNDNLRDTTIPNVPSIQQQ---NKITDILGTLDKKIQINNQINQELEAMAKTLYD 178 Query: 205 YIVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 Y + PD K SG E +P+ W V+ +T N K+ K ++ Sbjct: 179 YCFVQFDFPDQNGKPYKSSGGKMVYSPELKREIPEGWGVEKLSHFLTIKNGKDHKHLQDG 238 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 ++ I + T + ++ + ++ + + Sbjct: 239 KFAVYGSGGIMRTVT-------------------DYLYSGESILFPRKGTLNNVMYVNEK 279 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + ++ S S+ + L ++VP Sbjct: 280 FWTVDTMFYSEVNKNNSALYVFYSVKDIDFNKLNTGTGVPSMTSSILYDLNIIVPEAN-- 337 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N R ++ L + R + + GQ+ + Sbjct: 338 --ILEKFNTIVKRNYETIKLNNIQNQELTQLRDWLLPMLMNGQVKV 381 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 55/184 (29%), Gaps = 22/184 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V + F + G+ + +D + G+ T T Sbjct: 210 EIPEGWGVEKLSHFLTIKNGKDHKHLQDGKF--------------AVYGSGGIMRTVTDY 255 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +++ IL+ + G + + T F K+ + + ID Sbjct: 256 LYSGESILFPRKGTLNNVMYVNEKFWTVDTMFYSEVNKNNSALYVFYSVKDIDF----NK 311 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G + +I + + + I EK R I + L + + Sbjct: 312 LNTGTGVPSMTS----SILYDLNIIVPEANILEKFNTIVKRNYETIKLNNIQNQELTQLR 367 Query: 200 QALV 203 L+ Sbjct: 368 DWLL 371 >gi|219872006|ref|YP_002476381.1| type I site-specific deoxyribonuclease S subunit, restriction modification system [Haemophilus parasuis SH0165] gi|219692210|gb|ACL33433.1| type I site-specific deoxyribonuclease S subunit, restriction modification system [Haemophilus parasuis SH0165] Length = 332 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 39/372 (10%), Positives = 111/372 (29%), Gaps = 50/372 (13%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTS----TVSIFAKGQILYGKLGPYLRKAIIADFDG--I 106 + ++ L D ++ + IL + I + G Sbjct: 1 MTNLNRNGITLLLDDLKFVNIQSNSADGKRTSLQANDILISITTELGKIGFIPENFGEAY 60 Query: 107 CSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + + + P + + L S + + I ++ + + + I + + +P + Sbjct: 61 INQHTALIRIDPNKAHAKFIAYVLSSATMNKTINSLNDAGAKAGLNLPTIKALSLKLPSI 120 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 EQ+ I E + D I + +E +++K+AL+ ++ Sbjct: 121 EEQIQIAETL----STWDNAIQTTEKLLENSRQQKKALMQRLL----------------- 159 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + K ++L ++ + +I + + E++ Sbjct: 160 -----KGNNWLQTDLAELAVISKGSQLNKNTLSDNGQYAVINGGIEPSGYTDKFNTESHT 214 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 I + E+ A+ + + + + Y+ Sbjct: 215 I----------TISEGGNSCGYIGFQKEKFWCGGHCYAL-SNLRINCLFLYQLLKYNEEN 263 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + +++ + ++ + P I EQ I +++ I+ L ++ + L Sbjct: 264 IMRLRVGSGLPNIQKKALESFSLSYPQDISEQQKIAEILSTADQEIETL----QRKLECL 319 Query: 404 KERRSSFIAAAV 415 K + + + Sbjct: 320 KLEKGALMQRVF 331 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 54/137 (39%), Gaps = 5/137 (3%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + +I+ + +A + + P+ + ++A+++ S Sbjct: 29 KRTSLQANDILISITTELGKIGFIPENFGEAYINQHTALIRIDPNKAHAKFIAYVLSSAT 88 Query: 342 LCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + K ++ +G + L +K L + +P I+EQ I ++ D ++ E+ + Sbjct: 89 MNKTINSLNDAGAKAGLNLPTIKALSLKLPSIEEQIQIAETLSTW----DNAIQTTEKLL 144 Query: 401 VLLKERRSSFIAAAVTG 417 ++++ + + + G Sbjct: 145 ENSRQQKKALMQRLLKG 161 >gi|320352394|ref|YP_004193733.1| hypothetical protein Despr_0256 [Desulfobulbus propionicus DSM 2032] gi|320120896|gb|ADW16442.1| hypothetical protein Despr_0256 [Desulfobulbus propionicus DSM 2032] Length = 113 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 28/110 (25%), Positives = 46/110 (41%), Gaps = 12/110 (10%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTK--LNTGRTSES---GKDIIYIGLEDVESG 59 AYP+YKDSGVQW+G +P+HW++ PIK + G + + ++ E + +G Sbjct: 4 PAYPRYKDSGVQWLGEVPEHWEIRPIKAIVSTPVTDGPHETPEIFDEGVPFVSAEAISNG 63 Query: 60 TGKYLPKDGNSRQSDTSTV---SIFAKGQI----LYGKLGPYLRKAIIAD 102 + G D G I ++ L R A++ Sbjct: 64 KINFNKIRGYISAEDHRKYSRKYRPEFGDIQSSAIFTWLNLVQRPAVLRW 113 >gi|271968781|ref|YP_003342977.1| Restriction endonuclease S subunits-like protein [Streptosporangium roseum DSM 43021] gi|270511956|gb|ACZ90234.1| Restriction endonuclease S subunits-like protein [Streptosporangium roseum DSM 43021] Length = 402 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 63/420 (15%), Positives = 127/420 (30%), Gaps = 40/420 (9%) Query: 22 PKHWKVVPIKRFT----KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P+ W ++ + + E+ YI + ++ G S Sbjct: 2 PRGWPLLELSKVGVQVHDCEHRTPPEAETGYPYIAIPNIVDGRLDLTQVRLISTSDLEEW 61 Query: 78 VSIFAK--GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 ++ + G A+I D Q LV+ + + ++ Sbjct: 62 NRRTKPIADDVIITRRGRVGDSAVIPDDLECAIGQNLVILRSSGMDVNQKYLRWAVRGKY 121 Query: 136 RI----EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 I G+ + + I + +P+PP+ Q++I E + A +I Sbjct: 122 WESEVERLINVGSIFDSLNVRDIARMRIPVPPMQFQLVIAEVLGALDDKIAANKRTAATA 181 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV-GLVPDHWEVKPFFALVTELNRKNTK 250 +EL K S + D+ W+ G P E Sbjct: 182 LELASAKY----SAAAAMSADWCTVTLDAAARWLSGGTPKTSE---------------PD 222 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 +I +S ++ + E + +IV G ++F Sbjct: 223 YWNGDIPWISALSLKSPWIDDSDRKLTEVGARSGTRIVPSGSVIFVVRGSSLKTEFRVGI 282 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRS--YDLCKVFYAMGSGLRQSLKFEDVKRLP 366 E + IDS L +RS ++ + G L + + +L Sbjct: 283 TQREVAFGQDCKALIAAESIDSHVLFHAIRSRTPEIMAMVDETSIGA-GRLSTDLISKLD 341 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + VP K Q N E +D + + ++ +L R + + ++G++ +R Q Sbjct: 342 IRVP--KHQK---NKTADELRSLDEVAARCQKESRILAALRDTLLPQLMSGKLCVRDAEQ 396 >gi|168485836|ref|ZP_02710344.1| putative type I restriction-modification system, S subunit [Streptococcus pneumoniae CDC1087-00] gi|183571015|gb|EDT91543.1| putative type I restriction-modification system, S subunit [Streptococcus pneumoniae CDC1087-00] Length = 373 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 41/396 (10%), Positives = 117/396 (29%), Gaps = 31/396 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++P + +++ + + L +K++ + +PP+ Q + Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFA 340 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + A++D I++S+ L+ + S + Sbjct: 341 DFV----AQVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|262165310|ref|ZP_06033047.1| type I restriction-modification system specificity subunit S [Vibrio mimicus VM223] gi|262025026|gb|EEY43694.1| type I restriction-modification system specificity subunit S [Vibrio mimicus VM223] Length = 498 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 61/183 (33%), Gaps = 4/183 (2%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLI---ESNILSLSYGN-IIQKLETRNMGLKPE 278 +++ +P W + + E+ I L N K++ ++ + Sbjct: 96 DYLFDIPSGWSWERLGNVGETNIGLTYSPKDAGETGIPVLRSANIQKGKIDLSDLVRVQK 155 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +V+ G+++ + + + + + Y+ + Sbjct: 156 EVKYSVLVEVGDLLICARNGSKALVGKTAQICELKEPMAFGAFMAIFRSCINDYIEVFLN 215 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 S K + + + +++ +PP++EQ I ++ A D L ++ E Sbjct: 216 SPVYRKNLEGVSTTTINQITQSNLRSTICPIPPVEEQHRIVAKVDELMALCDQLEQQTED 275 Query: 399 SIV 401 S+ Sbjct: 276 SLD 278 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 63/192 (32%), Gaps = 11/192 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP W + + N G T I + +++ G ++ Sbjct: 100 DIPSGWSWERLGNVGETNIGLTYSPKDAGETGIPVLRSANIQKGKIDLSDLVRVQKEVKY 159 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + G +L + A I + + + + + + ++ +L S Sbjct: 160 S--VLVEVGDLLICARNGSKALVGKTAQICELKEPMAFGAFMAIFRSCINDYIEVFLNSP 217 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +E + T++ + + PIPP+ EQ I K+ D L + Sbjct: 218 VYRKNLEGVST-TTINQITQSNLRSTICPIPPVEEQHRIVAKVDELMALCDQLEQQTEDS 276 Query: 192 IELLKEKKQALV 203 ++ + + L+ Sbjct: 277 LDAHQVLVETLL 288 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 15/118 (12%), Positives = 34/118 (28%), Gaps = 8/118 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P+ W+ G+ + K+ + Y+ +V+ + Sbjct: 384 ELPEGWEWCRFGDVAISRLGKMLDKSKNLGNPLPYLRNTNVQWHRFDLEDIKRMKIEDAE 443 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G +L + G R AI D ST+ + + + + Sbjct: 444 KEEFLVLPGDLLICEGGEPGRCAIWKDD----STEMYFQKAYIEQEHWVAAYPSIYNF 497 >gi|145589315|ref|YP_001155912.1| restriction modification system DNA specificity subunit [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145047721|gb|ABP34348.1| restriction modification system DNA specificity domain protein [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 556 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 73/193 (37%), Gaps = 5/193 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG-----TGKYLPKDGNSRQSDT 75 +P W ++ +++TG+T ++ YIG + L D + Sbjct: 74 VPSGWVWKSLREVGRVSTGKTPDTRNSNFYIGTTPFIGPGQLSMNHRILKSDKFISKEAE 133 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 SI G IL +G + K+ IA + Q +Q D E + L + + Sbjct: 134 LNTSIALPGSILMVCIGGSIGKSAIATHRVAFNQQINAIQTTDCNVEFIHMCLRAKFFLE 193 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ + G+ + NI +PIPP+ +Q I EK+ A D L + ++ Sbjct: 194 RVHQLSSGSATPIINKSRWENIQIPIPPIGQQNKIVEKVNALMQLCDQLERDALKKEIFH 253 Query: 196 KEKKQALVSYIVT 208 +S ++ Sbjct: 254 DNLVIHFMSLLLR 266 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 58/196 (29%), Gaps = 14/196 (7%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI-----------IQK 267 S + + +P+ W L + N +K S ++ G I Sbjct: 349 QSDKKGLHQIPESWSWIRLSELASFENGDRSKNYPSRDQFVAAGMAFINAGHLQEEGIDY 408 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + ++ + G+I+F +++ + I +S + Sbjct: 409 SNMNFIDVETYDNLRSGKIKEGDILFCLRGSLGKFAIVKNGETG--AIASSLVIIRPFAP 466 Query: 328 IDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 YL S + +L D+ V +PP+ EQ I + Sbjct: 467 EIVDYLGIYFSSTLAKDQILKFDNGTAQPNLAGADLGHFQVPLPPLSEQKAIVASLKRLL 526 Query: 387 ARIDVLVEKIEQSIVL 402 A D L E ++ L Sbjct: 527 ALCDQLSESFSKARQL 542 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 66/202 (32%), Gaps = 17/202 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII-----------YIGLEDVESGTGKYLPKD- 67 IP+ W + + G + K+ +I ++ Y + Sbjct: 357 QIPESWSWIRLSELASFENG---DRSKNYPSRDQFVAAGMAFINAGHLQEEGIDYSNMNF 413 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQ 125 + D +G IL+ G + AI+ + + I S+ ++ + + L Sbjct: 414 IDVETYDNLRSGKIKEGDILFCLRGSLGKFAIVKNGETGAIASSLVIIRPFAPEIVDYLG 473 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + S +I G + +G+ +P+PPL+EQ I + D L Sbjct: 474 IYFSSTLAKDQILKFDNGTAQPNLAGADLGHFQVPLPPLSEQKAIVASLKRLLALCDQLS 533 Query: 186 TERIRFIELLKEKKQALVSYIV 207 + +L + V + Sbjct: 534 ESFSKARQLECMLADSFVDQAL 555 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 68/216 (31%), Gaps = 21/216 (9%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLV--------PDHWEVKPFFALVTELNRK-----N 248 ++ V+ L S E + P W K + K N Sbjct: 40 ILQLAVSGRLTTAANSLRSANENLSDQSKAEPFIVPSGWVWKSLREVGRVSTGKTPDTRN 99 Query: 249 TKLIESNILSLSYGN--IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + G + ++ + + E+ I PG I+ I K ++ Sbjct: 100 SNFYIGTTPFIGPGQLSMNHRILKSDKFISKEAELNTSIALPGSILMVCIGGSIGKSAIA 159 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 R A++ + ++ +R+ + + + SG + + + Sbjct: 160 ----THRVAFNQQINAIQTTDCNVEFIHMCLRAKFFLERVHQLSSGSATPIINKSRWENI 215 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLV-EKIEQSI 400 + +PPI +Q I +N D L + +++ I Sbjct: 216 QIPIPPIGQQNKIVEKVNALMQLCDQLERDALKKEI 251 >gi|323441772|gb|EGA99415.1| hypothetical protein SAO46_2327 [Staphylococcus aureus O46] Length = 227 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 24/218 (11%), Positives = 69/218 (31%), Gaps = 14/218 (6%) Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q + S + D D + +G + + ++ ++ + Sbjct: 17 YMQKIFSQELRFKDENDEDYPDWKEKKLGDITE-------QSMYGIGASATRFDSKNIYI 69 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGII 316 ++ + + P+ + +I+F K + + + + Sbjct: 70 RITDIDEKSRKLNYQNLTTPDELNNKYKLKRNDILFARTGASTGKSYIHKEEKDIYNYYF 129 Query: 317 TSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + ++ + S V + + E+ +LP+++P E Sbjct: 130 AGFLIKFEIDEQNNPLFIYQFTLTSKYNKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLE 189 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q I ++ R D +E +Q I +L++++ + Sbjct: 190 QQKIAEFLD----RFDQQIELEKQKIEILQQQKKGLLQ 223 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 66/190 (34%), Gaps = 11/190 (5%) Query: 24 HWKVVPIKRFTKLNT---GRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + T+ + G ++ IYI + D++ + K ++ + + Sbjct: 38 DWKEKKLGDITEQSMYGIGASATRFDSKNIYIRITDIDEKSRKLNYQNLTTPDELNNKYK 97 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL------VLQPKDVLPELLQGWLLSIDV 133 + + IL+ + G K+ I + + + P + + L+ Sbjct: 98 L-KRNDILFARTGASTGKSYIHKEEKDIYNYYFAGFLIKFEIDEQNNPLFIYQFTLTSKY 156 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + + + + +P+ +P EQ I E + +I+ + + Sbjct: 157 NKWVKVMSVRSGQPGINSEEYAKLPLVLPNKLEQQKIAEFLDRFDQQIELEKQKIEILQQ 216 Query: 194 LLKEKKQALV 203 K Q++ Sbjct: 217 QKKGLLQSMF 226 >gi|150391749|ref|YP_001321798.1| restriction modification system DNA specificity subunit [Alkaliphilus metalliredigens QYMF] gi|149951611|gb|ABR50139.1| restriction modification system DNA specificity domain [Alkaliphilus metalliredigens QYMF] Length = 383 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 54/395 (13%), Positives = 115/395 (29%), Gaps = 28/395 (7%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 IK K G T + + + I + + SD +I ++ Sbjct: 14 IKNIYKRVKG-TPITAEKMHKIKSATGTIRVFAGGATEIKANVSDLPNANIINVPVVIVQ 72 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 G I + + + L +V A ++ S Sbjct: 73 SRGVI--DFIYCNEPCTFKNEMWGYTSAGAYEVKFLFYYLKHNVDYFRNAGDGRSSFSQI 130 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 +P+ P EQ I + I + L EKK+A+ + Sbjct: 131 SLPVTEEYKIPLIPSNEQQAIASVLSDFDEHITN--------LTELIEKKKAIRDGALED 182 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 ++ ++ EW + + L I + Sbjct: 183 LVSGRTRLDGFDGEW--------VNVKLSDFAQINPS-SPLPESFKYVDLESVKGISLVN 233 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 R + ++ G+I F+ + L ++ ++ + S A + Sbjct: 234 WRVESKETAPSRAKRLAQHGDIFFQTVRPYQRNNYL--YELPDKDFVFSTGYAQIRTENN 291 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVP-PIKEQFDITNVINVETA 387 + +L L+R +G ++ + + + VP I EQ I +++ Sbjct: 292 AGFLFLLLRQDVFVNEVIDNCTGTSYPAINPSKLADINIYVPVDICEQQAIASILTSMDE 351 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 I+ L + + I + R + +TG++ L+ Sbjct: 352 EIESLETEKSKMI----QIREGAMDELLTGRVRLK 382 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 75/192 (39%), Gaps = 8/192 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W V + F ++N +S + Y+ LE V+ G + + + + + Sbjct: 196 EWVNVKLSDFAQINP--SSPLPESFKYVDLESVK-GISLVNWRVESKETAPSRAKRLAQH 252 Query: 84 GQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G I + + PY R + D D + ST + ++ + L L + Sbjct: 253 GDIFFQTVRPYQRNNYLYELPDKDFVFSTGYAQIRTE-NNAGFLFLLLRQDVFVNEVIDN 311 Query: 141 CEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 C G + + + +I + +P + EQ I + + I++L TE+ + I++ + Sbjct: 312 CTGTSYPAINPSKLADINIYVPVDICEQQAIASILTSMDEEIESLETEKSKMIQIREGAM 371 Query: 200 QALVSYIVTKGL 211 L++ V + Sbjct: 372 DELLTGRVRLKI 383 >gi|313123733|ref|YP_004033992.1| restriction endonuclease s subunits-like protein [Lactobacillus delbrueckii subsp. bulgaricus ND02] gi|312280296|gb|ADQ61015.1| Restriction endonuclease S subunits-like protein [Lactobacillus delbrueckii subsp. bulgaricus ND02] Length = 381 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 57/394 (14%), Positives = 126/394 (31%), Gaps = 28/394 (7%) Query: 31 KRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + G ++ G I + + + ++ +S S G++ Sbjct: 2 GDVANFSKGTGYSKSDLKGTGSPIILYGRLYTKYETII-RNVDSFVVPKSGSVFSKGGEV 60 Query: 87 LYGKLGPYLRKAII---ADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G I + GI ++ D+ P L + + + Sbjct: 61 IVPGSGETAEDISIASVVEPAGILLGGDLNIIYPNSDLDPTFLAITISNGKPHFDMARRA 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +G ++ H + +I + P L+EQ I + I ++ + L Q Sbjct: 121 QGKSIVHLHNADLKHISLKTPNLSEQKRISKIFEVLDQTITLHEEKKQQLKCLKSALLQK 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + G +PDV+ K W + + N K T + L Sbjct: 181 MFANKNKSG-DPDVRFKGFDERW---------ERHILNDLAIFNPKGTLPTSFEYVDLGS 230 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 ++ + + + ++ G++ ++ + L + + S Sbjct: 231 VIGVEMISHKTISKFDAPSRAQRLAQVGDLFYQTVRPYQQNNYL--FDNKDNAYVFSTGY 288 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVP-PIKEQFDIT 379 A ID +L L+++ + +G ++ +D+ ++ V +P KEQ I Sbjct: 289 AQLRPLIDGYFLLCLVQTKSFVRKVMNACTGTSYPAINSQDLAQIGVNIPINSKEQRLIG 348 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 N ID L+ +Q + L + S + Sbjct: 349 N----LYKVIDNLITLYQQKLDDLNTIKQSLLQK 378 >gi|303255883|ref|ZP_07341922.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS455] gi|302597154|gb|EFL64261.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS455] Length = 264 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 3 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 62 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 63 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 122 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 123 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 182 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 183 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 212 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 82/214 (38%), Gaps = 13/214 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 9 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 68 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 69 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 128 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 129 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 188 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS 220 +L KE ++++ Y + L +S Sbjct: 189 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDES 222 >gi|251771131|gb|EES51714.1| DNA polymerase, beta domain protein region [Leptospirillum ferrodiazotrophum] Length = 545 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 56/444 (12%), Positives = 136/444 (30%), Gaps = 42/444 (9%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKY-LPKDG 68 SG++W IP ++ V + + G S + I + ++ Y +P Sbjct: 106 SGMEW-QKIP--FERVLLG---PIRNGIYKPSNFHGRGTKIINMGELFKYPRMYSVPMKR 159 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPE 122 S KG +++ + A + + ++P Sbjct: 160 VDLSLSEGDRSNILKGDLIFARRSLVPAGAGKCSIVLEVQEPTTFESSIIRVRPDQTKSH 219 Query: 123 --LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 L + S ++ I +S K + + +P PPL+EQ I + + Sbjct: 220 SLFLFYYFNSPVGLHSLDTIRRQVAVSGITGKDLARLEVPNPPLSEQRAIAHILGTLDDK 279 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVT---------KGLNPDV---KMKDSGIEWVGLV 228 I+ + + ++ GL ++ +G + Sbjct: 280 IELNRRMNETLEAMAQAIFKSWFVDFDPVRAKMEGRETGLPKEIEDLFPDSFEDSELGEI 339 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P W V+ + +++ Q + + + Sbjct: 340 PRGWRVRSTGEAFELNPSEKLSKGKNSPYLDMSAIPTQG--SWPESPIYRPFVSGSKFRN 397 Query: 289 GEIVFRFIDL---QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM---RSYDL 342 G+ +F I ++ + + G ++ ++ ++P +L+ ++ Sbjct: 398 GDTLFARITPCLENGKTAYIQCLEEEQVGWGSTEFIVIRPKAPFPKEFGYLLARDNAFRE 457 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G+ RQ ++ + + +L P I V + L++ + I Sbjct: 458 HAIQSMSGTSGRQRVQLDSIAAFKILQPE----ARILKAFEVIIRQWFELIKVNSEFIAG 513 Query: 403 LKERRSSFIAAAVTGQIDLRGESQ 426 + R + + +TG+I + G + Sbjct: 514 FNQMRDALLPKLLTGEIRVSGPEK 537 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 69/207 (33%), Gaps = 17/207 (8%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 K SG+EW + + + P + K + ++ G + + + Sbjct: 99 KMWKKLGSGMEWQKIPFERVLLGPIRNGI----YKPSNFHGRGTKIINMGELFKYPRMYS 154 Query: 273 MGLKPESYE----TYQIVDPGEIVFRFIDLQNDKRSLRSA--QVMERGIITSAYMAVKPH 326 + +K + G+++F L S +V E S+ + V+P Sbjct: 155 VPMKRVDLSLSEGDRSNILKGDLIFARRSLVPAGAGKCSIVLEVQEPTTFESSIIRVRPD 214 Query: 327 GI--DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 S +L + S + + +D+ RL V PP+ EQ I +++ Sbjct: 215 QTKSHSLFLFYYFNSPVGLHSLDTIRRQVAVSGITGKDLARLEVPNPPLSEQRAIAHILG 274 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSF 410 +D +E + L+ + Sbjct: 275 T----LDDKIELNRRMNETLEAMAQAI 297 >gi|323358028|ref|YP_004224424.1| restriction endonuclease S subunits [Microbacterium testaceum StLB037] gi|323274399|dbj|BAJ74544.1| restriction endonuclease S subunits [Microbacterium testaceum StLB037] Length = 392 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 53/412 (12%), Positives = 126/412 (30%), Gaps = 41/412 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ VP+ + G T + G + + ++++S T Sbjct: 3 WETVPLGEVARFVRGVTYKPGDVVANGADGVACLRTKNIQS-TLDLTDLVCVRSDLKHRV 61 Query: 78 VSIFAKGQILYGKLGPY---LRKA-IIADFDGICSTQFL---VLQPKDVLPELLQGWLLS 130 + +L + R AD +G+ F+ +P+D+ P W S Sbjct: 62 EQRVQEDDVLVSSANSWHLVGRAVQAGADAEGMLIGGFIGGLRFKPEDISPRYGYYWFSS 121 Query: 131 IDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + ++ + + +S+ + +P+P+PPL EQ I + T +T Sbjct: 122 PVIQAKVRSFGQQTTNISNLNVDRTLRLPIPLPPLPEQRRIVAILDEADALRTTAVTATE 181 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 R + + + L P + I + + Sbjct: 182 RVDDARA---------ALFEHLFPSAGEDLTTIGALIESTQYGTSGK--------AGGTG 224 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA- 308 + + +L+ I + + + + + E Y +V G+++F + Sbjct: 225 RFPILRMGNLTARGRIDLRDMKYIDIPDQEVEKY-LVRKGDVLFNRTNSAELVGKTAVYR 283 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLP 366 + + R + Y+A + S + M + ++ +V+ + Sbjct: 284 EDVPRAYAGYLVRLRASDEFIAEYIAGYLNSVHGKRTLRRMAKSIVGMANINAREVQTIR 343 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + P ++ ++ + + E L +S A G+ Sbjct: 344 LPAPSAEKMHAYKAFVDESWSN----TARFESRARELDSLFASLQHRAFRGE 391 >gi|260903739|ref|ZP_05912061.1| type i restriction enzyme EcoR124II specificity protein [Brevibacterium linens BL2] Length = 395 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 52/390 (13%), Positives = 113/390 (28%), Gaps = 23/390 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 P+ N G + K G + G D +I I+ Sbjct: 19 RPLGSLGTRNKGTAMTASKMKTIGGGGPIRVFAGGQTVADVAEDA--IPAKNIVRVPSII 76 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G F + V + + +LL+ ++ A + Sbjct: 77 VKSRGHIGFSYYERPFTHKTELWSYTIDAPGVDQKFVYYYLLTQVEKLQVLARATSVKLP 136 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + +P+PP Q I + T L E L Sbjct: 137 QLSVRDTDTLNVPMPPFEVQREIVRVLDKFTQLEAELEAELDARRTQYDYYAGEL----- 191 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 L D ++ I V + +P +T + ++ + + Sbjct: 192 ---LTIDEGVRRVRIGDVATIVRGASPRPIQKFITSDPEGVPWIKIGDVPADG-----KY 243 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + E + V PG+ V + + G + ++ Sbjct: 244 ITSTAQRVTIEGAAKSRRVLPGDFVLSNSMSFGRPYVSQIEGCIHDGWLA---ISAFEDS 300 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + YL +L+RS + + F G ++L + V+ + + VPP EQ + ++++ Sbjct: 301 FERDYLYYLLRSTPVQEEFARRAGAGTVKNLNADIVRSVVIPVPPRAEQKRVIDLLDHFD 360 Query: 387 ARIDVLV----EKIEQSIVLLKERRSSFIA 412 A ++ + ++ + R + Sbjct: 361 ALVNDIRIGLPAELAARRKQYEYYRDRLLT 390 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 22/198 (11%), Positives = 60/198 (30%), Gaps = 9/198 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 I + + V I + G + + + +I + DV + Sbjct: 194 IDEGVRRVRIGDVATIVRGASPRPIQKFITSDPEGVPWIKIGDVPADGKYITSTAQRVTI 253 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW-LLSI 131 + G + + R + I + +D + L S Sbjct: 254 EGAAKSRRVLPGDFVLSNSMSFGRPYVSQIEGCIHDGWLAISAFEDSFERDYLYYLLRST 313 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 V + T+ + + + ++ +P+PP AEQ + + + ++ + Sbjct: 314 PVQEEFARRAGAGTVKNLNADIVRSVVIPVPPRAEQKRVIDLLDHFDALVNDIRIGLPAE 373 Query: 192 IELLKEKKQALVSYIVTK 209 + +++ + ++T Sbjct: 374 LAARRKQYEYYRDRLLTF 391 >gi|148544103|ref|YP_001271473.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri DSM 20016] gi|184153473|ref|YP_001841814.1| hypothetical protein LAR_0818 [Lactobacillus reuteri JCM 1112] gi|325682357|ref|ZP_08161874.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] gi|148531137|gb|ABQ83136.1| restriction modification system DNA specificity domain [Lactobacillus reuteri DSM 20016] gi|183224817|dbj|BAG25334.1| conserved hypothetical protein [Lactobacillus reuteri JCM 1112] gi|324978196|gb|EGC15146.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] Length = 375 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 44/398 (11%), Positives = 112/398 (28%), Gaps = 40/398 (10%) Query: 30 IKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGN-SRQSDTSTVSIF 81 + ++ G+ G + Y+ + D + + + + Sbjct: 6 LGDIAEIKGGKRMPKGTRLQQEKNQHPYLRITDYDGKSFDRNSIRYVPDEVFEKISNYTV 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +G I +G I + ++ + V + + +L S+ +++ Sbjct: 66 TEGDIFLSIVGTIGIATTIDKEYDNANLTENAVKIIPDESVNSKYILYFLQSMLGQRQMN 125 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G+T K I I + +P L Q + + +I LL Sbjct: 126 ELSVGSTQKKLPIKNIKKIKILLPNLEIQNKVVSNLQILDKKIALNNQINDNLDALLTNI 185 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + G + + + + E ++ Sbjct: 186 FKKYM-------------------INDGFEKSNLTQIANYKNGLAMQKYRPNSNEESLPV 226 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 L + Q + + + IV+ G+I+F + L ++ + Sbjct: 227 LKIKELNQGNTDDSSDRCSANLDNSVIVNTGDIIFSWSGTL-----LVKNWTGDKAGLNQ 281 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFD 377 V + + ++ + + L A G +K D+K V +P Sbjct: 282 HLFKVTSNKYPAWFIYEWTKYHLLRFQAIAAGKATTMGHIKRSDLKSSLVYIPS----QL 337 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + A I + + L + + + + Sbjct: 338 FLAKMDSQLAPIYSQRLNLIKENQQLSKLKQTLLKKYF 375 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 58/189 (30%), Gaps = 10/189 (5%) Query: 21 IPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 I ++ + + G R + + + + + ++++ G + ++ Sbjct: 191 INDGFEKSNLTQIANYKNGLAMQKYRPNSNEESLPVLKIKELNQGN---TDDSSDRCSAN 247 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I G I++ G L K D G + + + W + Sbjct: 248 LDNSVIVNTGDIIFSWSGTLLVKNWTGDKAG-LNQHLFKVTSNKYPAWFIYEWTKYHLLR 306 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + A + TM H + + + IP + ++ + LI E + +L Sbjct: 307 FQAIAAGKATTMGHIKRSDLKSSLVYIPSQLFLAKMDSQLAPIYSQRLNLIKENQQLSKL 366 Query: 195 LKEKKQALV 203 + + Sbjct: 367 KQTLLKKYF 375 >gi|331671459|ref|ZP_08372257.1| putative toxin-antitoxin system, toxin component [Escherichia coli TA280] gi|331071304|gb|EGI42661.1| putative toxin-antitoxin system, toxin component [Escherichia coli TA280] Length = 457 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 44/449 (9%), Positives = 117/449 (26%), Gaps = 57/449 (12%) Query: 25 WKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST-V 78 W V + ++ T + + +I ++ + K + Sbjct: 5 WIEVSLGEISEKIGDGIHGTPTYNNSGNYYFINGSNLIDNSIKITETTKCVDHDEYLKHR 64 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRI 137 + +L G A+ + + I + KD + + ++LS + I Sbjct: 65 KKLSNNTVLVSINGTIGNTALYNNENIILGKSACYINLKDNISKHFILYVLSGYLFQEYI 124 Query: 138 EAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T+ + K + + +P EQ + I + ++ + Sbjct: 125 QRCSTGSTIKNVSLKMMRDFRFLMPESKEEQEKVVRIIQKIDELKRLNNAQNQTLEQMSQ 184 Query: 197 EKKQA-------LVSYIVTKGLNPDVKMKDSGIE-------------------------- 223 ++ ++ + G NP + S E Sbjct: 185 ALFKSWFVDFDPVIDNALDAG-NPIPETLQSRAELRQNVRNSTDFKPLPAEIRSLFPSEF 243 Query: 224 ---WVGLVPDHWEVKPFFALVT------ELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 +G VP WE + F + +RK T + + ++ + Sbjct: 244 EETELGWVPKGWESETFDSFCDLIQSGGTPSRKETSFWDGGTIKWLSSGEVKGKIILDTK 303 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 K + + + + ++ A + + Sbjct: 304 EKITDIGLLNSSSKLWEKYTTVVAMYGATAGEVCIIGDKMAANQACCGLYSKIF--PFFV 361 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + ++ +Q+L + + P I + + + Sbjct: 362 YNFVCNKANELASKATGSAQQNLNKLIISTTKFICPSND----IITIFEDNVTPLFMKWF 417 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L R + + ++G++ + Sbjct: 418 SNSSENNTLIALRDTLLPKLISGELSVED 446 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 55/195 (28%), Gaps = 11/195 (5%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGN 69 +G +PK W+ F + +G T + I ++ +V+ + Sbjct: 248 LGWVPKGWESETFDSFCDLIQSGGTPSRKETSFWDGGTIKWLSSGEVKGKIILDTKEKIT 307 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 S+ ++ K + G + I + L K + Sbjct: 308 DIGLLNSSSKLWEKYTTVVAMYGATAGEVCIIGDKMAANQACCGLYSKIF---PFFVYNF 364 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + G+ + + I P + + + ++ + +E Sbjct: 365 VCNKANELASKATGSAQQNLNKLIISTTKFICPSNDIITIFEDNVTPLFMKWFSNSSENN 424 Query: 190 RFIELLKEKKQALVS 204 I L L+S Sbjct: 425 TLIALRDTLLPKLIS 439 >gi|282881941|ref|ZP_06290586.1| type I restriction-modification system methyltransferase subunit [Peptoniphilus lacrimalis 315-B] gi|281298216|gb|EFA90667.1| type I restriction-modification system methyltransferase subunit [Peptoniphilus lacrimalis 315-B] Length = 983 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 48/394 (12%), Positives = 111/394 (28%), Gaps = 39/394 (9%) Query: 26 KVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 ++V + G + + I ++ + K +S Sbjct: 602 EMVKLGNIATFIRGISFPKKAQKDQADDLLNVITTRAAQADGIDF-KKVVYIEKSYAKPD 660 Query: 79 SIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + K IL R + + + + + ++ +L I + Sbjct: 661 KMVFKEDILISLANSLELVGRVTYVDENYKDATFGAFMGVIRVNYQKVHPMYLFHILNSI 720 Query: 136 RIEAICE-----GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + +S+ ++ +GN+ +P+P L Q+ I +++ I Sbjct: 721 EAKKYFRAVAKTTTNISNITFEDLGNLVLPLPRLDYQLKIIDELNRYQEMIVGAKKIVNN 780 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ L + + + + L + G P + + Sbjct: 781 YLPKLPSYEIVVSTSLNDSELFEIMS---------GGTPSTKNP--------DYWGGDIS 823 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I L R + K + +++ G IV ++ Sbjct: 824 WITLADLPQEDYVTTIDKSVRTITKKGLDNSSAKMLPVGAIVVSTRATIGRVGIVKHPLA 883 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 +G + KP + +LA L++ F A + + + ++ V +P Sbjct: 884 TNQGFKN--VIIKKPDVVIPEFLALLLKEKTEEMEFLA-SGATFKEISKFNFGKIKVELP 940 Query: 371 PIKEQFDITNVI---NVETARIDVLVEKIEQSIV 401 + EQ I I ++E E I Sbjct: 941 SLDEQKRILVKIHEEESFVKPAKKVIEVFEDKID 974 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 52/144 (36%), Gaps = 6/144 (4%) Query: 266 QKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY---M 321 ++ + + +SY ++V +I+ + + + A+ + Sbjct: 642 DGIDFKKVVYIEKSYAKPDKMVFKEDILISLANSLELVGRVTYVDENYKDATFGAFMGVI 701 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDIT 379 V + YL ++ S + K F A+ ++ FED+ L + +P + Q I Sbjct: 702 RVNYQKVHPMYLFHILNSIEAKKYFRAVAKTTTNISNITFEDLGNLVLPLPRLDYQLKII 761 Query: 380 NVINVETARIDVLVEKIEQSIVLL 403 + +N I + + + L Sbjct: 762 DELNRYQEMIVGAKKIVNNYLPKL 785 >gi|300865293|ref|ZP_07110106.1| hypothetical protein OSCI_1610001 [Oscillatoria sp. PCC 6506] gi|300336694|emb|CBN55256.1| hypothetical protein OSCI_1610001 [Oscillatoria sp. PCC 6506] Length = 304 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 76/228 (33%), Gaps = 15/228 (6%) Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN- 255 +KQ L+ ++ ++ +G WE K L N N Sbjct: 78 RRKQELLQTYKRGVMHKIFSLEIRFKGAIGSEFPDWEEKRLDELGEFKNGFNADKSSFGD 137 Query: 256 -ILSLSYGNIIQKLETRNMGLKPESYETY----QIVDPGEIVFRFIDLQNDKRS--LRSA 308 + ++ +I K E + + L+ + + G+++F ++ + Sbjct: 138 GVEFVNLMDIFGKSEIKKIPLERVQISSKQVEQYKIKKGDVLFVRSSVKREGVGQPCLVN 197 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRL 365 E + + + + + +L + S + K ++ S ++ E + + Sbjct: 198 DDFEDTVYSGFIIRFREKSSELCHLYKKYCFSSLEFRKELLSLATSSANTNINQESLSAI 257 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + P KEQ IT + +I+ L + I ++ + + Sbjct: 258 ILFYPCKKEQEKITGFLTAMDRKIETL----SRQIDQTEQFKKGLLQK 301 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 43/106 (40%), Gaps = 6/106 (5%) Query: 320 YMAVKPHGIDSTYLAWLMRS--YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 ++ + + + +R L ++ G+G ++L ++ L + +P I EQ Sbjct: 3 FLPKQNRASLKFVILFFLRERGKYLLELASPGGAGRNKTLGQQNFAGLEITLPKIAEQEK 62 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 I + + R+ ++ + LL+ + + + +I +G Sbjct: 63 IASFLGAVDRRL----AQLRRKQELLQTYKRGVMHKIFSLEIRFKG 104 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 70/196 (35%), Gaps = 16/196 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVS 79 W+ + + G ++ G + ++ L D+ K +P + S Sbjct: 112 DWEEKRLDELGEFKNGFNADKSSFGDGVEFVNLMDIFGKSEIKKIPLERVQISSKQVEQY 171 Query: 80 IFAKGQILYGKL-----GPYLRKAIIADFDGICSTQFLVLQ---PKDVLPELLQGWLLSI 131 KG +L+ + G + DF+ + F++ ++ + S+ Sbjct: 172 KIKKGDVLFVRSSVKREGVGQPCLVNDDFEDTVYSGFIIRFREKSSELCHLYKKYCFSSL 231 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + ++ + ++ + + + I + P EQ I + +D I R Sbjct: 232 EFRKELLSLATSSANTNINQESLSAIILFYPCKKEQEKITGFL----TAMDRKIETLSRQ 287 Query: 192 IELLKEKKQALVSYIV 207 I+ ++ K+ L+ + Sbjct: 288 IDQTEQFKKGLLQKMF 303 >gi|295091337|emb|CBK77444.1| Restriction endonuclease S subunits [Clostridium cf. saccharolyticum K10] Length = 397 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 45/406 (11%), Positives = 109/406 (26%), Gaps = 58/406 (14%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STV 78 P + + ++ G+ ++ G+Y + Sbjct: 13 PDGVEYRKVGDIANISRGKVMSKDF---------LKENAGEYPVYSSQTENEGKLGSINT 63 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ + + G + V+ ++ + + + Sbjct: 64 YMYDGEYLTWTTDGANAGTVFFRSGKFSVTNVCGVIDNTSEDVDIKYLYYVLN--REAPS 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G + I +P+PPL Q I + + T+ L E + + Sbjct: 122 YVNSGMGNPKLMSNVMARISLPVPPLEIQREIVRVLDSFTLLTAELTAELTARKKQYEFY 181 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L+S G + E +G + K + + ++ Sbjct: 182 RDKLLS--FDIG---------TRFEKLGDTCNMKAGKAILSA------RISEKPSKITPY 224 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 +G + ++ E F I Q + + Sbjct: 225 KCFGGNGVRGYVSDVSHHGE--------------FPIIGRQGALCGNVNYATGDFYATEH 270 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 A + +L L+ + +L + G + L ++++ L VP + Q + Sbjct: 271 AVVVESKGAYLQRFLYHLLTAMNLNQY---KSQGAQPGLAVKNLENLIAPVPKLDVQERL 327 Query: 379 TNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIAAAVTG 417 V++ + L +E ++ R + A TG Sbjct: 328 VRVLDNFESICTDLNIGLPAEIEARQKQYEY---YRDLLLTFAETG 370 >gi|295135270|ref|YP_003585946.1| hypothetical protein ZPR_3434 [Zunongwangia profunda SM-A87] gi|294983285|gb|ADF53750.1| conserved hypothetical protein [Zunongwangia profunda SM-A87] Length = 383 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 56/380 (14%), Positives = 114/380 (30%), Gaps = 44/380 (11%) Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKL---GPYLRKAII--ADFDGICSTQFLVL---Q 115 +P N ++ S I GQ ++ Y + + I S + V Sbjct: 1 MPSVANVVGTNLSRYLIVEPGQFACNRMHVGRDYRIPVALSEKEKPFIVSPAYDVFEIKD 60 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 P +LPE L W + + + W +I +P+P + +Q I Sbjct: 61 PSILLPEYLMMWFRRAEFDRNAWFYTDADVRGGLAWDAFCSIELPVPSIEKQREIAR--- 117 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS---------GIEWVG 226 E + I + L+E QAL + P+ + K E Sbjct: 118 -EYNVVKNRIKLNEEINQKLEETAQALYKHWFVDFEFPNTEGKPYKSFGGKLIYNEELDR 176 Query: 227 LVPDHWEVKPFFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 +P+ W + + K + + + K + Sbjct: 177 EIPEGWIASSIDEICDIQDGDRGKNYPKKEEFSDDGYCLFLNAGNVTKSGFDFSNNSFVN 236 Query: 280 YETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E +++ G ++V + E I S + ++ + S +L Sbjct: 237 KEKDELLRKGKLKRKDVVMTTRGTVGNIGYYNDKLDFENVRINSGMVILR-NPKISFFLY 295 Query: 335 WLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR---ID 390 M+S ++ + + L D+KR+ +P +N+I ++ + Sbjct: 296 TKMKSAEMKDLIMNHLSGSAQPQLPITDIKRMEFPLP-----RKGSNLIEKFNSKVTPLQ 350 Query: 391 VLVEKIEQSIVLLKERRSSF 410 ++ I L + S Sbjct: 351 NSIDDKNLQIRYLNQL-QSL 369 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 60/200 (30%), Gaps = 15/200 (7%) Query: 20 AIPKHWKVVPIKRFTKLN---TGRTSESGKDI------IYIGLEDVESGTGKYLPKDGNS 70 IP+ W I + G+ ++ +++ +V + + Sbjct: 177 EIPEGWIASSIDEICDIQDGDRGKNYPKKEEFSDDGYCLFLNAGNVTKSGFDFSNNSFVN 236 Query: 71 RQSDTSTVS-IFAKGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQG 126 ++ D + ++ G D + + +V+ + L Sbjct: 237 KEKDELLRKGKLKRKDVVMTTRGTVGNIGYYNDKLDFENVRINSGMVILRNPKISFFLYT 296 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + S ++ I G+ I + P+P + EK ++ + I Sbjct: 297 KMKSAEMKDLIMNHLSGSAQPQLPITDIKRMEFPLPRKGS--NLIEKFNSKVTPLQNSID 354 Query: 187 ERIRFIELLKEKKQALVSYI 206 ++ I L + + +S + Sbjct: 355 DKNLQIRYLNQLQSLFLSKM 374 >gi|110639720|ref|YP_679930.1| type I site-specific deoxyribonuclease S subunit [Cytophaga hutchinsonii ATCC 33406] gi|110282401|gb|ABG60587.1| type I site-specific deoxyribonuclease S subunit [Cytophaga hutchinsonii ATCC 33406] Length = 354 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 54/392 (13%), Positives = 113/392 (28%), Gaps = 60/392 (15%) Query: 23 KHWKVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + W+ + ++ G+ ++ K+ + GL G G + S Sbjct: 8 EEWEEKTLGEICEMQAGKFVSASEIKEQHFDGLFPCYGGNGLRGYTKSYNYDGKYS---- 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 L G+ G A+ + +V+ P + + + +LL+ + Sbjct: 64 ------LIGRQGALCGNVNFANGKFHATEHAVVVTPLNGINTVWMFYLLTNL---NLNQF 114 Query: 141 CEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + + IP + EQ I + RI T L+K Sbjct: 115 ATGMAQPGLSVQNLEKVESTIPKAIDEQEKIASFLTLIDGRISTQNKIIKELELLIKSIS 174 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q + H +L + K + I S++LS Sbjct: 175 QIIFHG-------------------------HRYKFKKASLGSICTIKKGEQINSSVLSE 209 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 S N G+ P Y + I + Sbjct: 210 S-----GLYAVMNGGITPSGYYSQYNCVGNTISISEGGNS---CGYVQFNDKKFWSGGHC 261 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y + + S + + + +++ +D+++ V P I +Q+ I+ Sbjct: 262 YTLSEINAEISNKYLYYFMKFSENLIMSLRVGSGLPNIQKKDLEKFNVAFPEINQQYQIS 321 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +++ T +I + K ++S I Sbjct: 322 KFLDLLTEKI---------QVE--KSLKTSLI 342 >gi|148998186|ref|ZP_01825655.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP11-BS70] gi|147755829|gb|EDK62873.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP11-BS70] Length = 364 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 82/214 (38%), Gaps = 13/214 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDES 296 >gi|57242480|ref|ZP_00370418.1| Type I restriction modification DNA specificity domain, putative [Campylobacter upsaliensis RM3195] gi|57016765|gb|EAL53548.1| Type I restriction modification DNA specificity domain, putative [Campylobacter upsaliensis RM3195] Length = 185 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 56/162 (34%), Gaps = 5/162 (3%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPES-YETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + ++ + + + LK + Q V G+++ + ++ Sbjct: 18 DNYEFMNYIDIASVSKEIGVIEKMKFLKSDFPSRARQRVFKGDLLISSLSGSQKAIAIVK 77 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLP 366 T ++ +L L+R++ ++ SG S+ ++ L Sbjct: 78 NDEKNLIASTGFFIISNAADCLKEFLMDLLRTHFFQELLMRESSGAIMASINQKEFLNLK 137 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +PP+ EQ I I+ A +++ LL+ + Sbjct: 138 IPLPPLTEQERIAKEISQRKA---NAKALKQEAKELLENAKK 176 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 39/181 (21%), Positives = 68/181 (37%), Gaps = 5/181 (2%) Query: 28 VPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V + ++NT ++ + + YI + V G + KG + Sbjct: 2 VRLGEIARVNTKLENIDNYEFMNYIDIASVSKEIGVIEKMKFLKSDFPSRARQRVFKGDL 61 Query: 87 LYGKLGPYLRKAIIADFDG---ICSTQ-FLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 L L + I D I ST F++ D L E L L + + + Sbjct: 62 LISSLSGSQKAIAIVKNDEKNLIASTGFFIISNAADCLKEFLMDLLRTHFFQELLMRESS 121 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M+ + K N+ +P+PPL EQ I ++I L E +E K++ + + Sbjct: 122 GAIMASINQKEFLNLKIPLPPLTEQERIAKEISQRKANAKALKQEAKELLENAKKEVEQI 181 Query: 203 V 203 + Sbjct: 182 I 182 >gi|167767087|ref|ZP_02439140.1| hypothetical protein CLOSS21_01605 [Clostridium sp. SS2/1] gi|167711062|gb|EDS21641.1| hypothetical protein CLOSS21_01605 [Clostridium sp. SS2/1] Length = 425 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 47/403 (11%), Positives = 111/403 (27%), Gaps = 21/403 (5%) Query: 30 IKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQ 85 + +++G +S G ++ + V + D G Sbjct: 25 LSELYDMSSGISSTKEQSGHGAPFVSFKTVFNNYFLPEELPDLMDTNEKEQETYSIKMGD 84 Query: 86 ILYGKLGPYLR-----KAIIADFDGICSTQFLVL----QPKDVLPELLQGWLLSIDVTQR 136 + + + + ++ G + F+ + V P+ + + S + Sbjct: 85 VFITRTSETIDELAMSCVAVKNYPGATYSGFIKRLRPKTARIVYPKYMAFYFRSELFRKA 144 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + + + +P EQV I + + + +I E+ Sbjct: 145 VTNNAFMTLRASFNKDIFTFLDIYLPDYHEQVKIGDMLYSIECKIQKNKKINDYLEEMAN 204 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 N K SG E + + + + N + Sbjct: 205 TIYDYWFVQFDFPDEN-GRPYKSSGGEMTFCKELNQNIPQNWGYTSVGNITVCFDSDRIP 263 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGI 315 LS ++ Y I ++ D Q + Sbjct: 264 LSNHQRQEMKGTIPYYGATGIMDYVNCAIFSGDFVLLAEDGSVMDDNGNPILQRISGDVW 323 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I + ++P S L +L+ + ++ + ++ +L P + Sbjct: 324 INNHTHVLQPVNGYSCRLLYLLLKDIPVSMIK--TGSIQMKINQANLNSYNILNIPDGIR 381 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 N I ID + +I++ LK+ R+ + + GQ Sbjct: 382 SRFINQIE----PIDTKIIQIQKENDNLKQIRNWLLPMLMNGQ 420 >gi|315222637|ref|ZP_07864526.1| type I restriction modification DNA specificity domain protein [Streptococcus anginosus F0211] gi|315188323|gb|EFU22049.1| type I restriction modification DNA specificity domain protein [Streptococcus anginosus F0211] Length = 357 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 63/389 (16%), Positives = 123/389 (31%), Gaps = 46/389 (11%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V + I +GLE + ++ S F+KG + Sbjct: 5 TVKLGDIAIEAKSSNKGDKTGIRIVGLEHLTPSNVTLSSWSDDTEN---SFTKEFSKGDV 61 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGA 144 L+G+ YL+KA +A FDGICS V++ V P+LL + + + G+ Sbjct: 62 LFGRRRAYLKKAAVAPFDGICSGDITVIRAIEDKVDPDLLPFIIQNDFLFDFAVGKSAGS 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 W + + +P + EQ + E + + + + EL+ Sbjct: 122 LSPRVKWTHLKEFAIELPSMPEQSKLAETLWSINETKNAYEDLINKTDELV--------- 172 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 K IEW G + E+ + ++E ++ ++ G Sbjct: 173 -------------KSQFIEWFGNEKNTAELGECAFIEKGKIITRDNVVEGDVPVVAAG-- 217 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 ++P Y G I + + + + Sbjct: 218 ----------IEPSCYHNESNRMAGIITVSASGAN--AGYVNYWNMPIFASDCNTVLTKD 265 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + +D +L +R+ GSG + + +D++ + V VP + Q + Sbjct: 266 TNKLDEVFLYHRLRTMQEEIFLMQRGSG-QPHVYAKDLEHIIVPVPNMDAQIRFSAFAEQ 324 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAA 413 D ++++I L IA Sbjct: 325 S----DKSKFALQEAIKDLDALSKKIIAE 349 >gi|26991425|ref|NP_746850.1| type I restriction-modification system, S subunit [Pseudomonas putida KT2440] gi|24986497|gb|AAN70314.1|AE016672_5 type I restriction-modification system, S subunit [Pseudomonas putida KT2440] Length = 576 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 68/496 (13%), Positives = 141/496 (28%), Gaps = 103/496 (20%) Query: 20 AIPKHWKVVPIKRFTK---------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 +P W L G + + ++ + D++ + P+ S Sbjct: 83 ELPTTWIWTSFDDLINPEYPIAYGVLVPG--PDVADGVPFVRIADLDLVAPPHKPEKSIS 140 Query: 71 RQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGW 127 + D + G+IL G +G + I + + + P + + W Sbjct: 141 PEVDRQYERTRIRGGEILMGVVGSIGKLGIAPESWAGANIARAICRVVPSVHVSKDYIIW 200 Query: 128 LLSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------------- 173 LL D + ++ + I + P+PPLAEQ I K Sbjct: 201 LLQSDLMRKQFLGDTRTLAQPTLNVGLIRSAAAPLPPLAEQHRIVAKVEELMALCDRLEA 260 Query: 174 ----------------IIAETVRID------------TLITERIRFIELLKEKKQALVSY 205 + + T ID + K+ L+ Sbjct: 261 QQADAESAHVQLVQAMLDSLTQAIDAADFATSWQRLAEHFHTLFTNEFAIDALKKTLLQL 320 Query: 206 IVTKGLNPDVKMKDSGIEWV-------------------------------GLVPDHWEV 234 V L P +S E + +P W+ Sbjct: 321 AVMGKLVPQDVTDESASELLKRIEGEKQRLVNEGLMKKQKPLVESTSGQIKPALPSSWKW 380 Query: 235 KPFFALVTELNRKNTK---------LIESNILSLSYGNIIQKLETRNMGLKPESYET-YQ 284 P + T ++ + +L + ++ L+ N L Sbjct: 381 VPLLDITTGMDSGWSPACLGNSSPSDDVWGVLKTTAVQVMSYLQHENKELPSHLEPRPEA 440 Query: 285 IVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYD 341 G+I+F N + +I+ + P + ++A + + + Sbjct: 441 ETKVGDILFTRAGPMNRVGISCLVESTRPKLMISDKIIRFHPVELGVYGRFVALCLNAGE 500 Query: 342 LCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K SG+ + ++ E ++ P+ + P++EQ I ++ D L ++I Sbjct: 501 TAKYLEQAKSGMAASQVNISQEKLRLAPIPLAPLREQHRIVTKVDQLMKLCDTLKQQINV 560 Query: 399 SIVLLKERRSSFIAAA 414 + E + +A Sbjct: 561 ARSKQTELLDTLMAQV 576 >gi|329114036|ref|ZP_08242800.1| Putative type-1 restriction enzyme MjaXP specificity protein [Acetobacter pomorum DM001] gi|326696575|gb|EGE48252.1| Putative type-1 restriction enzyme MjaXP specificity protein [Acetobacter pomorum DM001] Length = 439 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 55/431 (12%), Positives = 129/431 (29%), Gaps = 38/431 (8%) Query: 25 WKVVPIKRFTKLNTGR--TSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W + I + G T ++ I++G++ ++ G + + Sbjct: 12 WPLSTISAVADVFDGPHATPKTIDHGAIFLGIDSLDHGRLNLSSTRHVTNEDFKKWTKRV 71 Query: 82 AK--GQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSIDVTQR 136 G I++ AII + C + + + + +S ++ Sbjct: 72 KPEAGDIVFSYETRLGEVAIIPEGLVCCLGRRMALIRTDRSVLNEKFFLYYFMSPQFQEQ 131 Query: 137 IEAI-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I GAT+ K + + +PPL EQ I + + +ID + Sbjct: 132 IRKNTINGATVDRIPLKEFPSFKLELPPLDEQHTIASILGSLDDKIDLNRRTNETLEAMA 191 Query: 196 KEKKQALV-----SYIVTKGLNPD--VKMKDSGIEWVGL--VPDHWEVKPFFA-LVTELN 245 + + + G P ++ + + + P+ W+ P + + Sbjct: 192 RALFRDWFVDFGPTRAKMAGEAPYLAPELWELFPDRLDDEGNPEGWQSWPLADLAILSKS 251 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 N K ++ E V I+ ++ + + + Sbjct: 252 SINPAQFSDEYFLHFSLPAFDKGMMPDLVKGEEIKSGKFSVSSNSILLSKLNPETPRVWM 311 Query: 306 RSAQVMERG-IITSAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVK 363 +A I ++ +M + P D L + S + M +G +S + Sbjct: 312 VTAHEEPYQRICSTEFMVLNPLQKDWLALIYCACLSQPFRETLQGMVTGTSKSHQRVQ-- 369 Query: 364 RLPVLVPPIKE-QFDITNVINVETARID-------VLVEKIEQSIVLLKERRSSFIAAAV 415 P+ Q + + ++ + D + L + R + + Sbjct: 370 -------PLAVMQTHLLHATDILMRQFDLTAQPLLAKMNFNRNESNTLAQLRDLLLPKLM 422 Query: 416 TGQIDLRGESQ 426 +G+I +R + Sbjct: 423 SGEISIRDAEK 433 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 55/151 (36%), Gaps = 13/151 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 P+ W+ P+ L+ + S + ++ L + G L K + S Sbjct: 234 PEGWQSWPLADLAILSKSSINPAQFSDEYFLHFSLPAFDKGMMPDLVKGEEIKSGKFS-- 291 Query: 79 SIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLP-ELLQGWLLSID 132 + IL KL P + + + ICST+F+VL P L+ LS Sbjct: 292 --VSSNSILLSKLNPETPRVWMVTAHEEPYQRICSTEFMVLNPLQKDWLALIYCACLSQP 349 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + ++ + G + SH + + + + Sbjct: 350 FRETLQGMVTGTSKSHQRVQPLAVMQTHLLH 380 >gi|29349951|ref|NP_813454.1| putative type I restriction enzyme specificity protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341862|gb|AAO79648.1| putative type I restriction enzyme specificity protein [Bacteroides thetaiotaomicron VPI-5482] Length = 381 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 70/194 (36%), Gaps = 10/194 (5%) Query: 223 EWVGLVPDHWEVKPFFALV---TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 E+ G +H + + R + L +++ + +I R E Sbjct: 11 EFSGEWEEHTLSEYLEFKNGLNPDAKRIGSGLPFISVMDILSEGVINYDNIRGKVNATEK 70 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLM 337 V G+++F+ + + + R I ++ D + +L+ Sbjct: 71 EIECFGVKDGDLLFQRSSETLEDVGRANVYMDNRTAIYGGFVIRGRKIGNYDPLFFKYLL 130 Query: 338 RSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + K MG+G + ++ E + ++ + P I+EQ I + + ID + Sbjct: 131 ATPLARKRTCRMGAGAQHFNIGQEGLSKISLYFPSIEEQRKIAEFL----SLIDERIATQ 186 Query: 397 EQSIVLLKERRSSF 410 + I LK+ +S+ Sbjct: 187 NKIIEDLKKLKSAI 200 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 44/401 (10%), Positives = 107/401 (26%), Gaps = 48/401 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDV-ESGTGKYLPKDGNSRQ-SDTSTV 78 W+ + + + G ++ G + +I + D+ G Y G Sbjct: 15 EWEEHTLSEYLEFKNGLNPDAKRIGSGLPFISVMDILSEGVINYDNIRGKVNATEKEIEC 74 Query: 79 SIFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLSID 132 G +L+ + R + D F++ K + P + L + Sbjct: 75 FGVKDGDLLFQRSSETLEDVGRANVYMDNRTAIYGGFVIRGRKIGNYDPLFFKYLLATPL 134 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +R + GA + +G+ I + P + EQ I E + RI T Sbjct: 135 ARKRTCRMGAGAQHFNIGQEGLSKISLYFPSIEEQRKIAEFLSLIDERIATQNKIIEDLK 194 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +L ++ + + K I +G + + +K+ Sbjct: 195 KLKSAISLNVLHS------DKWEQFKIKDIAQIG-------RGRVISSIEIGQQKSPTY- 240 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 S + E + + Sbjct: 241 ----PVYSSQTSNDGIMGYLDDYMFEGEYISW------------TTDGANAGTVFYRNGK 284 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + D+ +++ ++ V + + L + + + +P + Sbjct: 285 FNCTNVCGLLKLRKEFDTHFVSLVLAEATKKYVSINLAN---PKLMNNTMGNIQIRLPKL 341 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I ++ + L + ++ ++ Sbjct: 342 EEQKRI----SIVFRVLQRLWTVHNSLLTEYTKQEQYLLSQ 378 >gi|193070100|ref|ZP_03051046.1| HsdS protein [Escherichia coli E110019] gi|218561524|ref|YP_002394437.1| type I restriction-modification system (hsdS-like) [Escherichia coli S88] gi|4210350|emb|CAA10700.1| HsdS protein [Escherichia coli] gi|192956553|gb|EDV87010.1| HsdS protein [Escherichia coli E110019] gi|218368293|emb|CAR06111.1| type I restriction-modification system (hsdS-like) [Escherichia coli S88] Length = 463 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 58/445 (13%), Positives = 126/445 (28%), Gaps = 50/445 (11%) Query: 20 AIPKHWKVVPIKRFTK---LNTGRTSES---GKDIIYIGLEDVESGTGKYLPK-DGNSRQ 72 +P W + TK ++ G I I + ++++G + Sbjct: 7 KLPLGWNCKKLVDCTKEGNISYGIVQPGQHQEDGIGIIRVNNIQNGNIYIDDVLKVSHEI 66 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLS 130 + G++L +G AI + V++P D + L Sbjct: 67 ESKFAKTRLEGGEVLLTLVGSTGISAITTKALQGWNVARAVAVIKPCDEISAEWIHICLQ 126 Query: 131 IDVTQRI-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 T+ ++ + K + IP+PIPP E+V + + RI+ I Sbjct: 127 SPFTKYFLDSRANTTVQKTLNLKDVKEIPLPIPPHEERVSLEKIYFNFENRINLNIKINK 186 Query: 190 RFIELLKEKKQALVSYI---VTKGLNPDVK------------------------------ 216 E+ + ++ V L+ Sbjct: 187 ILEEMSQNLFKSWFVDFDPVVDNALDAGNPIPEALQSRAELRQKVRNSADFKPLPAEIRS 246 Query: 217 MKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + S E +G +P W++K + N + + Y +++ + R Sbjct: 247 LFPSEFEETELGWMPKGWQIKSLDHIANFQNGLALQKFRPKNMEDDYLPVLKIADLRAGQ 306 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-YMAVKPHGIDSTYL 333 + + I D ++ + + + Y S Y Sbjct: 307 ITNDERARTDISDSCKVYDGDMIFSWSGTLMIDIWTGGNAALNQHLYKVTSKKYPQSFYF 366 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 W ++ + + +K D+ L+P N++ A+I Sbjct: 367 MWTIQHLSRFQHIAEAKAVTMGHIKKGDLSNSFCLIPTSSLITKYDNIVGGYLAKIKNQR 426 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 Q + R + + ++G+ Sbjct: 427 LLNNQ----MTALRDTLLPKLISGE 447 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 54/194 (27%), Gaps = 12/194 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNS 70 +G +PK W++ + G + + + + D+ +G + Sbjct: 257 LGWMPKGWQIKSLDHIANFQNGLALQKFRPKNMEDDYLPVLKIADLRAGQI----TNDER 312 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 ++D S G +++ G + + + + K W + Sbjct: 313 ARTDISDSCKVYDGDMIFSWSGTLMIDI-WTGGNAALNQHLYKVTSKKYPQSFYFMWTIQ 371 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + A + TM H + N IP + + +I + Sbjct: 372 HLSRFQHIAEAKAVTMGHIKKGDLSNSFCLIPTSSLITKYDNIVGGYLAKIKNQRLLNNQ 431 Query: 191 FIELLKEKKQALVS 204 L L+S Sbjct: 432 MTALRDTLLPKLIS 445 >gi|282933739|ref|ZP_06339094.1| type I restriction modification DNA specificity domain protein [Lactobacillus jensenii 208-1] gi|281302118|gb|EFA94365.1| type I restriction modification DNA specificity domain protein [Lactobacillus jensenii 208-1] Length = 401 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 50/398 (12%), Positives = 134/398 (33%), Gaps = 18/398 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK V ++ ++ G +++ ++ + + + G+ ++ KG Sbjct: 14 WKKVKLEEISERVNG--NDNRFNLPVLTISAKTGWMTQEDRFSGDISGKQKKNYTLLHKG 71 Query: 85 QILYG----KLGPYLRKAIIADFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIE 138 ++ Y K+ Y + ++ K+ P ++ + DV +++ Sbjct: 72 ELSYNHGNSKVAKYGAVFSLQNYSEALIPHVYHSFKIIKETTPVFIENFFKKKDVNKQLR 131 Query: 139 AICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + +++++ ++I +++L++ + R +EL+K+ Sbjct: 132 KYISSSARMDGLLNISYSDFMKVHLFISQKISETKQIDKIFEILNSLLSLQQRKLELMKQ 191 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + L+ + T+ +P++ +K + W + T R N I + Sbjct: 192 LYRYLLENLNTEKKHPNIFIKGNYSHWNKVKLSDLGEIRTGKTPTPSVRSNYTNIGMPFV 251 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + + T + + + QI G I+ I + E+ Sbjct: 252 TPTEIVDLYNYNT-SRFISNSGLKKAQIAPKGSILVTCIASIGKNTCVFK----EKVAFN 306 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 AV P+ + + ++ + Q + +D + +VP + EQ D Sbjct: 307 QQINAVTPNSFNDSTFLAFKSLQWSKRIDCLTANTAMQIINKKDFSNIETMVPNLNEQKD 366 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I+ + + L K + L+ + + Sbjct: 367 ISKIWLKSYS----LTYKYSDAKKLIIRLKKFLLQNLF 400 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 60/186 (32%), Gaps = 6/186 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 HW V + ++ TG+T + IG+ V L SR S + Sbjct: 216 SHWNKVKLSDLGEIRTGKTPTPSVRSNYTNIGMPFVTPTEIVDLYNYNTSRFISNSGLKK 275 Query: 80 --IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I KG IL + + + + Q + P + S+ ++RI Sbjct: 276 AQIAPKGSILVTCIASIGKNTCVFKEKVAFNQQINAVTPNSFNDSTFLAFK-SLQWSKRI 334 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + M + K NI +P L EQ I + + + I L K Sbjct: 335 DCLTANTAMQIINKKDFSNIETMVPNLNEQKDISKIWLKSYSLTYKYSDAKKLIIRLKKF 394 Query: 198 KKQALV 203 Q L Sbjct: 395 LLQNLF 400 >gi|169350755|ref|ZP_02867693.1| hypothetical protein CLOSPI_01528 [Clostridium spiroforme DSM 1552] gi|169292618|gb|EDS74751.1| hypothetical protein CLOSPI_01528 [Clostridium spiroforme DSM 1552] Length = 647 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 49/386 (12%), Positives = 105/386 (27%), Gaps = 37/386 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T L+ + + Y I E K + G + +D +I + Sbjct: 188 DWEQRKLGDCTFLSGKKNKNNLNLEPYAITNEHGFIPQNKAHDEFGYMKDTDRRAYNIVS 247 Query: 83 KGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQRIEA 139 K Y + + I S+ + V Q V L W + D I Sbjct: 248 KNSFAYNPARINIGSIGYYKGTENVIISSLYEVFQTVDSVYDPFLWQWFKTKDFQNWIIR 307 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + EG+ + + + + +P L EQ+ I A I + + + + Sbjct: 308 LQEGSVRLYFYYDKLCECIIRMPKLEEQIKIANYFEALDNLITLHQWKCMISRKNIVYAW 367 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 +G + + + + + + Sbjct: 368 ---------------------EQRKLGKIFVSMQNNTLSRADLSYDSGVAMNVHYGDILV 406 Query: 260 SYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 +G ++ R + E + G+I+ + + Sbjct: 407 KFGEVLDIKSERLPMIVDETVLDKYKSSFLKNGDIIIADTAEDETVGKCTEIAGLSDEYV 466 Query: 317 TSAYMAVKPHGIDST---YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-P 371 S + + YL + M S + G+ S+ ++ ++ P Sbjct: 467 ISGLHTIPYRPLQKFAFGYLGYYMNSTSYHNQLLPLMQGIKVTSISKVSLQNTVIIYPKS 526 Query: 372 IKEQFDITNVINVETARIDVLVEKIE 397 EQ I +D L+ + Sbjct: 527 KVEQAAIGKY----FYNLDNLITLHQ 548 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/385 (11%), Positives = 100/385 (25%), Gaps = 48/385 (12%) Query: 25 WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVE--SGTGKYLPKDGNSRQSDTST 77 W+ + ++G + + I + + D+ + + + + Sbjct: 5 WEQRKLGEIGSASSGVGFPNSEQGGKEGIPFYKVSDMNLEGNEIEMTVSNNYVTKEQIAR 64 Query: 78 VSIFAKGQI---LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + K+G + + + + Sbjct: 65 KKWSPLNDVPAMYFAKVGAAVMLNRKRLCRFPFLFDNNTMAYSLNKEYWDINFAKAEFAK 124 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + + +I + IP L EQ I I + + Sbjct: 125 IDLTKLVQVGALPSYNANDVESIKIMIPSLFEQSKIGNYFDELDRLITLHQRKIL----- 179 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+ D +G L + N+ N L Sbjct: 180 ----------------LDKYFLTIDWEQRKLGD---------CTFLSGKKNKNNLNLEPY 214 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I + K +K Y IV + + + S+ + E Sbjct: 215 AITNEHGFIPQNKAHDEFGYMKDTDRRAYNIVSKNSFAYNP--ARINIGSIGYYKGTENV 272 Query: 315 IITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPI 372 II+S Y + W ++ D + +R ++ + + +P + Sbjct: 273 IISSLYEVFQTVDSVYDPFLWQWFKTKDFQNWIIRLQEGSVRLYFYYDKLCECIIRMPKL 332 Query: 373 KEQFDITNVINVETARIDVLVEKIE 397 +EQ I N +D L+ + Sbjct: 333 EEQIKIANYFEA----LDNLITLHQ 353 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 52/148 (35%), Gaps = 15/148 (10%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + G + E + V +I + ND ++ A+V ++ + Sbjct: 35 FYKVSDMNLEGNEIEMTVSNNYVTKEQIARKKWSPLNDVPAMYFAKVGAAVMLNRKRLCR 94 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYA-----------MGSGLRQSLKFEDVKRLPVLVPPI 372 P D+ +A+ + F + G S DV+ + +++P + Sbjct: 95 FPFLFDNNTMAYSLNKEYWDINFAKAEFAKIDLTKLVQVGALPSYNANDVESIKIMIPSL 154 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSI 400 EQ I N + +D L+ ++ I Sbjct: 155 FEQSKIGNYFD----ELDRLITLHQRKI 178 >gi|126452645|ref|YP_001064383.1| restriction endonuclease S subunits [Burkholderia pseudomallei 1106a] gi|242315787|ref|ZP_04814803.1| type I site-specific deoxyribonuclease [Burkholderia pseudomallei 1106b] gi|126226287|gb|ABN89827.1| : Restriction endonuclease S subunits [Burkholderia pseudomallei 1106a] gi|242139026|gb|EES25428.1| type I site-specific deoxyribonuclease [Burkholderia pseudomallei 1106b] Length = 315 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 44/307 (14%), Positives = 102/307 (33%), Gaps = 30/307 (9%) Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G++ + + + P EQ I E + I + + Sbjct: 11 RNAVGSSYPALNDSDVRRFLIFAAPYREQEKIAEILDTLDTAIRETVVIIAKLKL----V 66 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELNRKNTK 250 K+ L+ ++T+G++ + +++ E +G +P WEV ++++EL + + Sbjct: 67 KRGLLHDLLTRGIDNNGELRPPPSEAPDLYIQSSLGWMPKEWEVVRLESVLSELGQGWSP 126 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---------VDPGEIVFRFIDLQND 301 + + I++ G + I V G+I+ N Sbjct: 127 DCPAESAGANEWGILKTTSIVWDGYNENENKRLPISLKPRPALEVASGDILITRAGPMNR 186 Query: 302 KRSLRSAQVMER--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQS 356 + + I Y S Y A + S SG+ + + Sbjct: 187 VGVVAHVFGTRKKLMISDKMYRLRLLKSEVSAYFALALASTYAQDAISRTISGMAESQTN 246 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++ L + P EQ +I + + R+ S+ L++++S + + Sbjct: 247 ISQSVIRNLAIFRPKATEQGEIVERVRILDERL----AGEALSLHKLQKQKSGLVDDLLL 302 Query: 417 GQIDLRG 423 G++ + Sbjct: 303 GRVRVTP 309 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 29/80 (36%), Gaps = 4/80 (5%) Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 M S + +L DV+R + P +EQ I +++ +D + + Sbjct: 1 MSSAVTAQAVRNAVGSSYPALNDSDVRRFLIFAAPYREQEKIAEILDT----LDTAIRET 56 Query: 397 EQSIVLLKERRSSFIAAAVT 416 I LK + + +T Sbjct: 57 VVIIAKLKLVKRGLLHDLLT 76 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 32/96 (33%), Gaps = 7/96 (7%) Query: 18 IGAIPKHWKVVPIKRF-TKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNS 70 +G +PK W+VV ++ ++L G + + + + + Sbjct: 101 LGWMPKEWEVVRLESVLSELGQGWSPDCPAESAGANEWGILKTTSIVWDGYNENENKRLP 160 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI 106 A G IL + GP R ++A G Sbjct: 161 ISLKPRPALEVASGDILITRAGPMNRVGVVAHVFGT 196 >gi|218510288|ref|ZP_03508166.1| putative Type I restriction enzyme ecoeispecificity protein [Rhizobium etli Brasil 5] Length = 472 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 63/244 (25%), Gaps = 54/244 (22%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG------NIIQKLETRNMGLKPESY 280 P W + N +K S +S G + G+ S Sbjct: 83 EEPKGWCWVTANDVWEFENGDRSKNYPSRDHFISDGVPFVNAGHLMNERVSFDGMNYISE 142 Query: 281 ETYQIV-----DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E + + G+ ++ + I +S + +L Sbjct: 143 EKFNNLSGGKLRKGDQIYCLRGSLGKHA--VYSFDRPAAIASSLVILRPMLSESVPFLKL 200 Query: 336 LMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S + + +L ++K + +PP+ EQ+ I ++ A D L Sbjct: 201 YLSSDIAFSMLKRYDNGTAQPNLSSANLKLFEIPLPPLAEQYRIVAKVDELMALCDELEA 260 Query: 395 ---KIEQSIVLL-------------------------------------KERRSSFIAAA 414 + E L K+ R + + A Sbjct: 261 ARTEREAKRDRLAASSVARLNNPDPETFRDDARFALDALQALTARPNQIKQLRQTILNLA 320 Query: 415 VTGQ 418 V G+ Sbjct: 321 VRGK 324 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 55/173 (31%), Gaps = 11/173 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLP-KDGNSRQ 72 PK W V + G S++ + ++ + + + + + Sbjct: 85 PKGWCWVTANDVWEFENGDRSKNYPSRDHFISDGVPFVNAGHLMNERVSFDGMNYISEEK 144 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + KG +Y G + A+ I S+ ++ L+ +L S Sbjct: 145 FNNLSGGKLRKGDQIYCLRGSLGKHAVYSFDRPAAIASSLVILRPMLSESVPFLKLYLSS 204 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ G + + +P+PPLAEQ I K+ D Sbjct: 205 DIAFSMLKRYDNGTAQPNLSSANLKLFEIPLPPLAEQYRIVAKVDELMALCDE 257 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 10/77 (12%), Positives = 24/77 (31%), Gaps = 4/77 (5%) Query: 21 IPKHWKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP W + + ++ S + + + +++ G+ + SD Sbjct: 373 IPSSWTWGRVGDAVLFTQYGTSQKSHVSQSGVPVLTMGNIQDGSVIWGNDKRIPESSDDL 432 Query: 77 TVSIFAKGQILYGKLGP 93 K +LY + Sbjct: 433 PALYLKKFDLLYNRTNS 449 >gi|254416678|ref|ZP_05030429.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] gi|196176644|gb|EDX71657.1| Type I restriction modification DNA specificity domain protein [Microcoleus chthonoplastes PCC 7420] Length = 272 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 41/165 (24%), Positives = 71/165 (43%), Gaps = 2/165 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P W+ V + + ES + +I + +E GTG+ L + TS Sbjct: 83 ELPYGWEWVRFDSVATIQSNLVKPESYSNYPHIAPDKIEKGTGRLLDCNTIQEDGVTSPK 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 F GQILY K+ P L KA++ DF+G+CS ++ + L ++L+ + + Sbjct: 143 HFFFSGQILYSKIRPNLSKAVVIDFEGLCSADMYPIKA-YIYTRYLHFYILTGTFLELVV 201 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + + N +P+PPL EQ I K+ D Sbjct: 202 GYDNRLAIPKVNQQQLNNTVVPVPPLPEQHRIVAKVDRLMSFCDE 246 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 61/189 (32%), Gaps = 10/189 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTEL---NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 E +P WE F ++ T + + +I +L N + Sbjct: 79 EMPFELPYGWEWVRFDSVATIQSNLVKPESYSNYPHIAPDKIEKGTGRLLDCNTIQEDGV 138 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 G+I++ I K + + G+ ++ +K + +++ Sbjct: 139 TSPKHFFFSGQILYSKIRPNLSKAVVIDFE----GLCSADMYPIKAYIYTRYLHFYILTG 194 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 L V + + + V VPP+ EQ I ++ + D L K+ QS Sbjct: 195 TFLELVVGYDNRLAIPKVNQQQLNNTVVPVPPLPEQHRIVAKVDRLMSFCDELEAKLTQS 254 Query: 400 I---VLLKE 405 I L E Sbjct: 255 ISDREKLME 263 >gi|331002082|ref|ZP_08325601.1| hypothetical protein HMPREF0491_00463 [Lachnospiraceae oral taxon 107 str. F0167] gi|330411176|gb|EGG90592.1| hypothetical protein HMPREF0491_00463 [Lachnospiraceae oral taxon 107 str. F0167] Length = 375 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 39/375 (10%), Positives = 103/375 (27%), Gaps = 44/375 (11%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLED-------VESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + G T + + + G D E + + S+ + Sbjct: 2 GEVANIVGGGTPSTSNEKYWDGNIDWYAPAEIGEQIYAFWSIRKITEEGLKHSSAKLLPA 61 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + + + + DG + F + D L + + + ++ E + G Sbjct: 62 FKTVLFTSRAGIGNMAVLQKDGATNQGFQSIVCNDCL-VPYFVFSMGFQIKKKAERVAAG 120 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T S K + ++ + + EQ+ I + I ++ Sbjct: 121 STFSEISGKQLCDLEIMVTTDKEQLKIGSYFQSLDHLITLHQSKS--------------- 165 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 ++ +G + + F K+ I GN Sbjct: 166 --FKCFFVDVACCTLSWEQRKLGEIGSVAMCRRIF--------KHQTTESGEIPFFKIGN 215 Query: 264 IIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + E ++ Y + G ++ + ++ Sbjct: 216 FGGTPDAFISKDLFEDFKAKYPYPEKGAVLISASGSIGRTVVFTGKDEYFQD--SNIVWL 273 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + I +++L L + + L +++ + ++P + EQ I++ + Sbjct: 274 KHDNSITNSFLYHLYSIVRWVGIE----GTTIKRLYNDNILKTEAIIPLVSEQQKISDYL 329 Query: 383 NVETARIDVLVEKIE 397 + D L+ + Sbjct: 330 DAV----DHLITLHQ 340 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 44/152 (28%), Gaps = 4/152 (2%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N K + NI + I +++ K F+ + + Sbjct: 17 NEKYWDGNIDWYAPAEIGEQIYAFWSIRKITEEGLKHSSAKLLPAFKTVLFTSRAGIGNM 76 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 A + + G + ++ + Y + M K + + + L + Sbjct: 77 AVLQKDGATNQGFQSIVCNDCLVPYFVFSMGFQIKKKAERVAAGSTFSEISGKQLCDLEI 136 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +V KEQ I + +D L+ + Sbjct: 137 MVTTDKEQLKIGSY----FQSLDHLITLHQSK 164 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 54/194 (27%), Gaps = 12/194 (6%) Query: 24 HWKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + + R +I + + + ++ KD + + Sbjct: 179 SWEQRKLGEIGSVAMCRRIFKHQTTESGEIPFFKIGNFGGTPDAFISKDLF--EDFKAKY 236 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 KG +L G R + D +V D + L V Sbjct: 237 PYPEKGAVLISASGSIGRTVVFTGKDEYFQDSNIVWLKHDNSITNSFLYHLYSIVRWVG- 295 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG T+ I IP ++EQ I + + A I E I Sbjct: 296 --IEGTTIKRLYNDNILKTEAIIPLVSEQQKISDYLDAVDHLITLHQLEPYYLIFKAIAY 353 Query: 199 KQALVSYIVTKGLN 212 + ++ Y K N Sbjct: 354 R--IIDYAEYKFFN 365 >gi|331650404|ref|ZP_08351476.1| type I restriction-modification system, S subunit, EcoA family [Escherichia coli M605] gi|331040798|gb|EGI12956.1| type I restriction-modification system, S subunit, EcoA family [Escherichia coli M605] Length = 440 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 57/444 (12%), Positives = 114/444 (25%), Gaps = 66/444 (14%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + F L G K G + G S D + Sbjct: 3 SEWINTTLGEFITLKRGYDLPKSKR---------NDGNIPVISSSGYSGTHDVP---MVK 50 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI-C 141 ++ G+ G + D +T V K P + L +ID + Sbjct: 51 GPGVVTGRYGTIGEVFYVVDDFWPINTTLYVSDFKGNSPLFVYYLLQTIDFHAYSDKAAV 110 Query: 142 EGATMSHADWKGIG---------NIPMPIPPLAEQVLIREKIIAETVRIDTLITER---- 188 G +H I I + + +++ + +KI ++ I + Sbjct: 111 PGINRNHVHMANIRVPKSVLEQEKIASILKKIEDRIHVNQKINDILEQMAQAIFKSWFVD 170 Query: 189 -------------IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV---------- 225 E A++S TK L + + Sbjct: 171 YEPVNAKLDVLESGGSEEEALCAAMAVISGKDTKALTAFKDEHPNEYSELKTIANLFPDA 230 Query: 226 ------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 G +P W + V + E S + N K + Sbjct: 231 MTESEFGSIPLGWYLSEIGNEVKVVGGATPSTKEPAFWSNGSIFWATPKDLSNKKDKVLN 290 Query: 280 YETYQIVDPGEIVF-------RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 +I G I L + A I Y+A++ + + Sbjct: 291 TTERKITSLGVSKISSGVQPENTIILSSRAPVGYLAITKIPVAINQGYIAMQCNKVLPPE 350 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 ++ + ++ + ++ + + V+VP + T +I Sbjct: 351 FVLQWATHSMQEITIRSSGSTFAEISKKNFRTINVVVPSSELLMLYGKY----TRKIYDQ 406 Query: 393 VEKIEQSIVLLKERRSSFIAAAVT 416 + LKE ++S + ++ Sbjct: 407 INSKINESSKLKELKNSLLPKLLS 430 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 64/199 (32%), Gaps = 12/199 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKY---LPKDG 68 G+IP W + I K+ G T + + I + +D+ + K + Sbjct: 237 GSIPLGWYLSEIGNEVKVVGGATPSTKEPAFWSNGSIFWATPKDLSNKKDKVLNTTERKI 296 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 S + + + I+ P I + ++ +Q VLP Sbjct: 297 TSLGVSKISSGVQPENTIILSSRAPV-GYLAITKIPVAINQGYIAMQCNKVLPPEFVLQW 355 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + G+T + K I + +P +L + +I++ I E Sbjct: 356 ATHSMQEITIRS-SGSTFAEISKKNFRTINVVVPSSELLMLYGKYTRKIYDQINSKINES 414 Query: 189 IRFIELLKEKKQALVSYIV 207 + EL L+S Sbjct: 415 SKLKELKNSLLPKLLSNAF 433 >gi|9507688|ref|NP_053007.1| hypothetical protein pNZ4000_02 [Lactococcus lactis subsp. cremoris] gi|2895543|gb|AAC64329.1| unknown [Lactococcus lactis subsp. cremoris] Length = 310 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 61/170 (35%), Gaps = 6/170 (3%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 ++ + GN + + E+ Q D + I + + Sbjct: 41 HGTPNYSDNGDVFFINGNNLVNGKIVITKETKLVTESNQSKDDKLLNMDTILMSINGTIG 100 Query: 306 RSAQ-VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVK 363 A ER ++ + + D ++ +++ + F + ++L + ++ Sbjct: 101 NLAWYNNERVMLGKSAAYLTVSNFDKKFIFSYLQTSTIKNYFLNNLTGTTIKNLGLKTIR 160 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + VP ++EQ I + ++D + ++ + LLKE++ ++ Sbjct: 161 DTTLFVPTLEEQQKIGSF----FKQLDDTIALHQRKLDLLKEQKKGYLQK 206 >gi|194397904|ref|YP_002037176.1| Type I restriction modification DNA specificity domain [Streptococcus pneumoniae G54] gi|194357571|gb|ACF56019.1| Type I restriction modification DNA specificity domain [Streptococcus pneumoniae G54] Length = 364 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 80/210 (38%), Gaps = 12/210 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRISTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEY 256 Query: 393 VEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 257 AESYNRLEQLDKEFPDKLKKSILQYAMQGK 286 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 83/214 (38%), Gaps = 13/214 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V I ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRISTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKK----QALVSYIVTKGLNPDVKMKDS 220 +L KE ++++ Y + L +S Sbjct: 263 LEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDES 296 >gi|237753063|ref|ZP_04583543.1| conserved hypothetical protein [Helicobacter winghamensis ATCC BAA-430] gi|229375330|gb|EEO25421.1| conserved hypothetical protein [Helicobacter winghamensis ATCC BAA-430] Length = 466 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 51/441 (11%), Positives = 120/441 (27%), Gaps = 65/441 (14%) Query: 29 PIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGNSRQSDTST---V 78 P+K F K+ +G+ G+ Y+ ++D++S + S D T Sbjct: 33 PLKNFVKIKSGKRIPKGRSYANTTTAYKYLRVDDLDSEILEIDIDKLKSIDKDIFTLLER 92 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ G + I + + + ++ + L + + + Sbjct: 93 YEIYNDEVALSIAGTIGKVFIFHN---ATNNRVILTENCVKLQAQDNLLPKFLSLILKTN 149 Query: 139 AICEGATMSHADWKGIGN--------IPMPIPPLAEQVLIREKIIAETVRIDT------- 183 + + IPPL+ Q I + + Sbjct: 150 FLQSQMKRQYIQTTIPKLAIERIKELQIPSIPPLSTQQHIIDLMDKAYKAKQEKENKAKE 209 Query: 184 --------------LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 +I L A +S + + + K L+ Sbjct: 210 LLDSIDSYLLEELGIILPLRANNTLDSRIYTAKISALSGSRFDANYHQKYYRDLEKSLLS 269 Query: 230 DHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + + +L+ + + I + +I + K S ++ Sbjct: 270 SPYPLVNLASLINNFKKGIEVGSSEYSQNKEIPFIRVSDITNNGIDFDNVQKFISASLFE 329 Query: 285 IVDPGEIVFRF--IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + A V II+ + ++ + + Sbjct: 330 NLKAYKPKQNELLYSKDGTVGICLEADVSRDYIISGGILRLELKAEVDKDFLCFLLGSYM 389 Query: 343 CKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 VF S + + L + +L + +PP+ Q I N + ++++ L + E Sbjct: 390 INVFANRVSIGAVIKHLNIGEFLKLKIPLPPLAIQTQIANRL--KSSKFQALSLEKEA-- 445 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 + A +ID+ Sbjct: 446 -------KEILHKA---KIDV 456 >gi|311033110|ref|ZP_07711200.1| putative restriction-modification enzyme type I S subunit [Bacillus sp. m3-13] Length = 393 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 59/405 (14%), Positives = 134/405 (33%), Gaps = 50/405 (12%) Query: 25 WKVVPIKRFT-------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + + + +I + ++ + Sbjct: 20 WEQRKLGDLLAYEQPTKYIVKSTYYDDSFEIPVLTAG----------QSFILGYTNEENG 69 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + + + + + S+ +L K + + ++ Sbjct: 70 IKEVSDEDPVIIFDDFTTGSHYVDFPFKVKSSAMKLLSLKSGDEDFYFIYNTLKNIKYVP 129 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ + ++ +P KI + D LIT R ++LL E Sbjct: 130 QSHE----RHWISKFSLFDVAVPSSDEQ------AKIGGYFKQFDNLITLHQRNLKLLNE 179 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K++L+ + P I + G D WE + ++ E K E +L Sbjct: 180 TKKSLLQKMF-----PKDGANVPEIRFEGFT-DAWEQRRLDKILKERKVKQKITEEFPLL 233 Query: 258 SLSYGNII-----QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + G + +K R+ K + +TY + +IV+ N K Sbjct: 234 AFASGQGVIDRSERKTNNRDFLTKDATKKTYLLTKYDDIVYN---PSNLKYGAIDRNKHG 290 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLV 369 +G+I+ Y+ + I +++ +++S + + G RQ++K E + L V++ Sbjct: 291 QGVISPIYVTFETDEI-PSFIELIVKSKNFKQRALQYEEGTVTKRQAVKPEHLLCLNVVL 349 Query: 370 P-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P EQ I N ++D ++ ++ + LK + S + Sbjct: 350 PNSKDEQIKIGNF----FKQLDDMITLHQRELHSLKNLKKSLLQQ 390 >gi|311113526|ref|YP_003984748.1| type I restriction modification enzyme protein S [Rothia dentocariosa ATCC 17931] gi|310945020|gb|ADP41314.1| type I restriction modification enzyme protein S [Rothia dentocariosa ATCC 17931] Length = 367 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 53/369 (14%), Positives = 112/369 (30%), Gaps = 24/369 (6%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + KL GR + G ++ + K + ++ Sbjct: 2 RLGDLVKLYKGRKPLEIVNEPIEGYRRSLQISDLRPGAKPRYCPADKKE--LLAVPNDVI 59 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G + ST ++ ++ L + + A +GAT+ Sbjct: 60 IAWDGANAGTTSHGLKGSVGSTLMVLRIQQEELIDTAYLGHFIASKQSYLRAKTKGATIP 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H D + ++ +P+PPL EQ I + ++ R + L E + Sbjct: 120 HLDRVILESLDVPLPPLEEQKRIVAILDKA----KSIQEAREHQLTTLDELLISFFKDSF 175 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 P +K+ G P + V E N + + L G Sbjct: 176 HAEDYPHKPLKEIATVLSGGTP--------RSSVQEYWNGNIEWVTPADLGQHEGIYFSS 227 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + ++ G ++ +G + V Sbjct: 228 SSRKITD-TGLKNSSAVLLPIGSVMMSSRAPIGHLAINTVPMATNQGFKS----IVPGEE 282 Query: 328 IDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I + YL + ++S+ K ++G + + V+ + V VPPI++Q + ++ Sbjct: 283 ITNLYLLFWLKSH--MKYIQSLGVGATFKEISKRGVENIKVPVPPIRKQNRFSRKVSKII 340 Query: 387 ARIDVLVEK 395 ++ L+ K Sbjct: 341 SQ-QTLIHK 348 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 42/126 (33%), Gaps = 12/126 (9%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 P +++ + + G + ID+ YL + S Sbjct: 55 PNDVIIAWDGAN--AGTTSHGLKGSVGSTLMVLRIQQEELIDTAYLGHFIASK--QSYLR 110 Query: 348 AMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 A G L ++ L V +PP++EQ I +++ + + E E + L E Sbjct: 111 AKTKGATIPHLDRVILESLDVPLPPLEEQKRIVAILD----KAKSIQEAREHQLTTLDEL 166 Query: 407 RSSFIA 412 I+ Sbjct: 167 ---LIS 169 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 65/191 (34%), Gaps = 15/191 (7%) Query: 28 VPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYL---PKDGNSRQSDTSTV 78 P+K + +G T S +I ++ D+ G Y + S+ Sbjct: 183 KPLKEIATVLSGGTPRSSVQEYWNGNIEWVTPADLGQHEGIYFSSSSRKITDTGLKNSSA 242 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + G ++ P I + F + P + + L + I+ Sbjct: 243 VLLPIGSVMMSSRAPI-GHLAINTVPMATNQGFKSIVPGEEITN-LYLLFWLKSHMKYIQ 300 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ GAT +G+ NI +P+PP+ +Q + +I + T + +E K Sbjct: 301 SLGVGATFKEISKRGVENIKVPVPPIRKQNRFSR----KVSKIISQQTLIHKSLENDKSL 356 Query: 199 KQALVSYIVTK 209 ++ S Sbjct: 357 FLSIQSRFFNY 367 >gi|149189421|ref|ZP_01867706.1| restriction modification system DNA specificity domain [Vibrio shilonii AK1] gi|148836779|gb|EDL53731.1| restriction modification system DNA specificity domain [Vibrio shilonii AK1] Length = 589 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 65/185 (35%), Gaps = 13/185 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESN---ILSLSYGNIIQKLETRNMGLKPESYETY 283 +P W + L T+L + + S N+ + + + Y Sbjct: 105 ELPQSWAIARLGNLCTKLTDGSHNPAKDFGSGYPMFSSQNVHFRSIDFTSPSRYVDEDNY 164 Query: 284 QI------VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++P +++ + R+ ++ + ++ DS +L + + Sbjct: 165 LKEHARTQIEPRDVLLTIVGTLG--RAAVVPNDAPEFVLQRSVAVLQTKI-DSDFLTYFL 221 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S K F G G ++ + + +PV VP ++EQ I ++ D L ++ Sbjct: 222 ASPTCIKYFEENGKGTAQKGIYLGKLSLMPVFVPSLEEQHRIVAKVDELMTLCDQLEQQT 281 Query: 397 EQSIV 401 E SI Sbjct: 282 EASIA 286 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 46/467 (9%), Positives = 110/467 (23%), Gaps = 89/467 (19%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLP---KDGNSRQ 72 +P+ W + + T + KD ++V + + Sbjct: 105 ELPQSWAIARLGNLCTKLTDGSHNPAKDFGSGYPMFSSQNVHFRSIDFTSPSRYVDEDNY 164 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSI 131 + +L +G R A++ + Q V + + + S Sbjct: 165 LKEHARTQIEPRDVLLTIVGTLGRAAVVPNDAPEFVLQRSVAVLQTKIDSDFLTYFLASP 224 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID--------- 182 + E +G + +P+ +P L EQ I K+ D Sbjct: 225 TCIKYFEENGKGTAQKGIYLGKLSLMPVFVPSLEEQHRIVAKVDELMTLCDQLEQQTEAS 284 Query: 183 --------------------------------TLITERIRFIELLKEKKQALVSYIVTKG 210 E + + KQ ++ V Sbjct: 285 IAAHQVLVTTLLGTLTNSANAEELMQNWQLVAEHFDTLFTTEESIDQLKQTILQLAVMGK 344 Query: 211 LNPDVKMKDSGIEWV----------------------------GLVPDHWEVKPFFALVT 242 L P + + + + + + L Sbjct: 345 LVPQDQNDEPASKLLERIAEEKAQLIKEKKIKKQKALPPIADDEKPFELPSGWEWCRLGD 404 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG------EIVFRFI 296 + S G + + Y + ++ + Sbjct: 405 LCKLVTSGSRGWKEYYASSGATFIRSQDIKYDRLDFDERAYVQLPKSTEGKRTKVDVGNL 464 Query: 297 DLQNDKRSLRSAQVMER----GIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + ++ V+E ++ + + + WL S + Sbjct: 465 LMTITGANVGKVAVVEDPIEEAYVSQHVALIKLIDDVLIDYLHVWLTGSMGGRGLLLQSS 524 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 G + L +++ L + +P + E + + A + L + I+ Sbjct: 525 YGAKPGLNLQNINELLIPLPTMLELNRVVLKVREMLAISEQLKDYIK 571 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 61/198 (30%), Gaps = 10/198 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W+ + KL T + + +I +D++ + + Sbjct: 392 ELPSGWEWCRLGDLCKLVTSGSRGWKEYYASSGATFIRSQDIKYDRLDFDERAYVQLPKS 451 Query: 75 TSTVS-IFAKGQILYGKLGPYLRKAIIAD---FDGICSTQF-LVLQPKDVLPELLQGWLL 129 T G +L G + K + + + S L+ DVL + L WL Sbjct: 452 TEGKRTKVDVGNLLMTITGANVGKVAVVEDPIEEAYVSQHVALIKLIDDVLIDYLHVWLT 511 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + I + +P+P + E + K+ + L Sbjct: 512 GSMGGRGLLLQSSYGAKPGLNLQNINELLIPLPTMLELNRVVLKVREMLAISEQLKDYIK 571 Query: 190 RFIELLKEKKQALVSYIV 207 + +A+V + Sbjct: 572 SYQTTQLYLTEAIVEQAI 589 >gi|301633597|gb|ADK87151.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 373 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 47/382 (12%), Positives = 101/382 (26%), Gaps = 38/382 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K IK +++ G+ ++D Y N+ + Sbjct: 4 KTYKIKDICEISRGKAITKKY------IKDNPGQYPVYSSTTANNGEIGRIKDYDLDGEY 57 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + + G Y + S V K E+ +L + + + Sbjct: 58 VTWTTDGIYAGTVFYRNEKFNASQHCGV--LKLKNNEISAKFLTYALGMEAPKFVNNACP 115 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + + I + PPL Q I + T L + ++ Sbjct: 116 IPNLNLSRTEEIELDFPPLQIQQKIATILDTFTELSAELRERKKQYAFYRDYL------- 168 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA-LVTELNRKNTKLIESNILSLSYGNI 264 LN + K G ++ E+ + S + N Sbjct: 169 -----LNQENIRKIYGANIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATTND 223 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + ++ E N + + + + Sbjct: 224 GELGRIKDCDFDGEYI---------------TWTTNGYAGVVFYRNGKFNASQDCGVLKV 268 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + T + + K + + S R L + + + + PP++ Q I +++ Sbjct: 269 KNKKICTKFLSFLLKIEAPKFVHNLAS--RPKLSQKVMAEIELSFPPLEIQEKIADILFA 326 Query: 385 ETARIDVLVEKIEQSIVLLKER 406 + LVE I I L K++ Sbjct: 327 FEKLCNDLVEGIPAEIELRKKQ 348 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 14/174 (8%), Positives = 44/174 (25%), Gaps = 4/174 (2%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + K+ I + + + ++ ++ Sbjct: 2 QIKTYKIKDICEISRGKAITKKYIKDNPGQYPVYSSTTANNGEIGRIKDYDLDGEYVTWT 61 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 D + S + V + +L + + + + +L Sbjct: 62 TDGIYAGTVFYRNEKFNASQHCGVLKLKNNEISAKFLTYALGMEAPKFVNNACPIPNLNL 121 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + PP++ Q I +++ T L ++ + R + Sbjct: 122 SRTEEIELDFPPLQIQQKIATILDTFTE----LSAELRERKKQYAFYRDYLLNQ 171 >gi|319758540|gb|ADV70482.1| type I restriction-modification system, S subunit [Streptococcus suis JS14] Length = 299 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 66/212 (31%), Gaps = 20/212 (9%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLK 276 +E +PD WE L + K + NI ++ ++ ++ + Sbjct: 78 VEVPYEIPDSWEWVRLRNLGVITSGGTPKSSESTYYDGNITWITPADMGKQQNNKLFATS 137 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + + + R+ V +V P ++ ++ Sbjct: 138 SKKITELGVQKSSAQLISKNSIVYSSRAPIGHINIVNYDFTTNQGCKSVTPILVNLDFMY 197 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 W+++ + + + + + +PP+ EQ I I + VE Sbjct: 198 WILQ-FRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKRIVAHIERALEQ----VE 252 Query: 395 KIEQSIVLLKE--------RRSSFIAAAVTGQ 418 +S L+E + S + A+ G+ Sbjct: 253 VYAESYNKLQELDRAFPDKLKKSILQYAMQGK 284 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 81/226 (35%), Gaps = 24/226 (10%) Query: 5 KAYPQY-----KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGL 53 K Y + K V + IP W+ V ++ + +G T +S + +I +I Sbjct: 65 KPYEKLADGTVKKVEVPY--EIPDSWEWVRLRNLGVITSGGTPKSSESTYYDGNITWITP 122 Query: 54 EDV----ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 D+ + K S+ + +K I+Y P I ++D + Sbjct: 123 ADMGKQQNNKLFATSSKKITELGVQKSSAQLISKNSIVYSSRAPI-GHINIVNYDFTTNQ 181 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + P V L + + T+ I G T G G+ +P+PPLAEQ Sbjct: 182 GCKSVTPILVN--LDFMYWILQFRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKR 239 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 I I +++ + EL + ++++ Y + L Sbjct: 240 IVAHIERALEQVEVYAESYNKLQELDRAFPDKLKKSILQYAMQGKL 285 >gi|301794018|emb|CBW36416.1| putative type I RM modification enzyme [Streptococcus pneumoniae INV104] Length = 373 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 41/396 (10%), Positives = 116/396 (29%), Gaps = 31/396 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++P + +++ + + L +K++ + +PP+ Q + Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFA 340 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + A +D I++S+ L+ + S + Sbjct: 341 DFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|152986197|ref|YP_001351360.1| type I restriction-modification system subunit S [Pseudomonas aeruginosa PA7] gi|150961355|gb|ABR83380.1| type I restriction-modification system, S subunit, putative [Pseudomonas aeruginosa PA7] Length = 547 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 69/192 (35%), Gaps = 3/192 (1%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPESYE 281 E VP +WE A+ + +K + I + N ++ + + + Sbjct: 78 EKPFDVPTNWEWVRVAAVGHDWGQKTPDKAFTYIDVGAVDNAAGRISAPQVLMAEDAPSR 137 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS-TYLAWLMRSY 340 ++V PG +++ I ++ + I ++A+ + P+ Y +RS Sbjct: 138 ARKVVRPGTVIYSTIRPYLLNVAVIEEAYEQEPIASTAFAIIHPYLEMPARYFLCYLRSP 197 Query: 341 DLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + ++ G+ ++ + +PP+ EQ I ++ A D L + + Sbjct: 198 VFVRYVESVQMGIAYPAINDGQFFSGLIPLPPLAEQHRIVAKVDELMALCDRLEARQADA 257 Query: 400 IVLLKERRSSFI 411 + + + Sbjct: 258 DSAHAQLVQALL 269 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 74/190 (38%), Gaps = 8/190 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTV 78 +P +W+ V + +T + YI + V++ G+ P+ + + + Sbjct: 82 DVPTNWEWVRVAAVGHDWGQKTPDKA--FTYIDVGAVDNAAGRISAPQVLMAEDAPSRAR 139 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQ-GWLLSIDV 133 + G ++Y + PYL + + I ST F ++ P +P +L S Sbjct: 140 KVVRPGTVIYSTIRPYLLNVAVIEEAYEQEPIASTAFAIIHPYLEMPARYFLCYLRSPVF 199 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +E++ G + + +P+PPLAEQ I K+ D L + Sbjct: 200 VRYVESVQMGIAYPAINDGQFFSGLIPLPPLAEQHRIVAKVDELMALCDRLEARQADADS 259 Query: 194 LLKEKKQALV 203 + QAL+ Sbjct: 260 AHAQLVQALL 269 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 42/119 (35%), Gaps = 3/119 (2%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 +L N + + + A++ Y+ + DL + Sbjct: 431 NLINRSTPIAFMARGKYWVNNHAHVLDGVSEALLLYVQLYFNAIDLKPYV---TGTAQPK 487 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + +L+PP EQ I ++ A D L ++ Q+ + + S+ + AV Sbjct: 488 MNQAKMNSIVLLLPPEAEQHRIVAKVDQLMALCDQLKARLNQARQVHEHLASALVEQAV 546 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 66/201 (32%), Gaps = 20/201 (9%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 D+ V +P W + + T G + I E G K G S Sbjct: 361 DTEVT----VPAGWSLSTVGEVTICRDG------ERIPVSQAE--REGRAKTYDYYGASG 408 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDVLPELLQG 126 D +F K +L G+ G L A +A + VL D + E L Sbjct: 409 VIDKIDGYLFDKPLLLVGEDGANLINRSTPIAFMARGKYWVNNHAHVL---DGVSEALLL 465 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ ++ G + + +I + +PP AEQ I K+ D L Sbjct: 466 YVQLYFNAIDLKPYVTGTAQPKMNQAKMNSIVLLLPPEAEQHRIVAKVDQLMALCDQLKA 525 Query: 187 ERIRFIELLKEKKQALVSYIV 207 + ++ + ALV V Sbjct: 526 RLNQARQVHEHLASALVEQAV 546 >gi|48243740|gb|AAT40844.1| putative type I restiction/modification specificity protein [Haemophilus influenzae] Length = 371 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 59/387 (15%), Positives = 118/387 (30%), Gaps = 42/387 (10%) Query: 26 KVVPIKRFTKLNT-----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 K +P+ ++ + L ++ Y +D + S V I Sbjct: 7 KWIPLGDVADYEQPTKYLVNSTVYNDNYPTPVLTAGKTFILGYTNEDEGIYFASKSPVII 66 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F + DFD + + + L ++ T Sbjct: 67 F----------DDFTTANKWVDFDFKAKSSAMKMITSKNEKFALLKYIYYWLNTLPNNQT 116 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 IP IPPL+ Q I + + A T L +E I + + ++ Sbjct: 117 DGDHKRQWISNYANKLIP--IPPLSVQTEIVKILDALTTLTSELTSELILRQKQYEYYRE 174 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L++ + + + +G V K KN +I Sbjct: 175 KLLN-----------IDEMNKVTELGDVGPVRMCKRIL--------KNQTANSGDIPFYK 215 Query: 261 YGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G +K + + Y+ Y G+I+ E + Sbjct: 216 IGTFGKKPDAYISNELFQEYKQKYSYPKKGDILISASGTIGRTVIF----DGENSYFQDS 271 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + +L Y + K A G G Q L +++K++ + +PP+KEQ I Sbjct: 272 NIVWIDNDETLVLNKYLYHFYKIAKWGIAEG-GTIQRLYNDNLKKVKISIPPLKEQHRIV 330 Query: 380 NVINVETARIDVLVEKIEQSIVLLKER 406 ++++ + + E + +I ++R Sbjct: 331 SILDKFETLTNSITEGLPLAIEQSQKR 357 >gi|325066641|ref|ZP_08125314.1| type I restriction/modification specificity protein [Actinomyces oris K20] Length = 287 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 36/309 (11%), Positives = 83/309 (26%), Gaps = 32/309 (10%) Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 T F + + P+ + ++ +E + + + + + + +P PP++ Q Sbjct: 2 DTIFYTQIGEQLEPKFFYYYFQTL----HLERMNQAGGVPSLTQRTLNELKIPTPPISIQ 57 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + + T L E + +++ T N + + Sbjct: 58 WEIVKILDQFTELEAELEAELGVRKQQYSH----YLNHFFTSNANTRTR-------TLRD 106 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 V K T + + L E E Y Sbjct: 107 VGPVRMCKRITKNQTSQQGGVPFYKIRTFGGTA-------DAYISRELYNEYKEQYHFPK 159 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 PG I+ + + + + +L Y + Sbjct: 160 PGSILISAAGTIGR----AVPYDGKDAYFQDSNIVWIENDETLVLNRYLFYFYKVANW-- 213 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV----LL 403 G + L + + + +PP+ EQ I + ++ ++ L + I Sbjct: 214 KTDDGTIKRLYNDRLLNTAIPIPPLSEQHRIVDCLDKFDTLVNDLTSGLPAEIEARRRQY 273 Query: 404 KERRSSFIA 412 + R + Sbjct: 274 EYYRDRLLT 282 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 52/188 (27%), Gaps = 10/188 (5%) Query: 27 VVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 ++ + + + + + + Y+ ++ Sbjct: 101 TRTLRDVGPVRMCKRITKNQTSQQGGVPFYKIRTFGGTADAYISREL--YNEYKEQYHFP 158 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G IL G R D +V E L + Sbjct: 159 KPGSILISAAGTIGRAVPYDGKDAYFQDSNIVWI---ENDETLVLNRYLFYFYKVANWKT 215 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + T+ + N +PIPPL+EQ I + + ++ L + IE + + + Sbjct: 216 DDGTIKRLYNDRLLNTAIPIPPLSEQHRIVDCLDKFDTLVNDLTSGLPAEIEARRRQYEY 275 Query: 202 LVSYIVTK 209 ++T Sbjct: 276 YRDRLLTF 283 >gi|208435397|ref|YP_002267063.1| typeI R-M system specificity subunit [Helicobacter pylori G27] gi|208433326|gb|ACI28197.1| typeI R-M system specificity subunit [Helicobacter pylori G27] Length = 212 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 43/130 (33%), Gaps = 6/130 (4%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGS 351 I + + + + P+ + +L + ++ + + Sbjct: 59 NTITIAQYGTAGYVNFQKNKFWANDVCFCIYPNKDIIKNIFLYYFLKVNQNYLYEISNRN 118 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 S+ + + + +PP+ EQ I N+++ I L K Q + + + Sbjct: 119 ATPYSISKDKILDFEIPLPPLNEQIAIANILSDVDHEIISLKNKKRQ----FENIKKALN 174 Query: 412 AAAVTGQIDL 421 ++ +I + Sbjct: 175 HDLMSAKIRV 184 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 66/209 (31%), Gaps = 11/209 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P +W+ V + ++ G + ++ V G G + +R Sbjct: 7 PSNWQRVRLGDICEIKRGVRITKNELDVFGKYPVVSGGVGFLGYTNNFNR---------- 56 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + F + KD++ + + L ++ E Sbjct: 57 YENTITIAQYGTAGYVNFQKNKFWANDVCFCIYPNKDIIKNIFLYYFLKVNQNYLYEISN 116 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 AT I + +P+PPL EQ+ I + I +L ++ +F + K Sbjct: 117 RNATPYSISKDKILDFEIPLPPLNEQIAIANILSDVDHEIISLKNKKRQFENIKKALNHD 176 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 L+S + L K P Sbjct: 177 LMS-AKIRVLKKLTPQKSRTNPLHKETPK 204 >gi|228994624|ref|ZP_04154448.1| hypothetical protein bpmyx0001_53020 [Bacillus pseudomycoides DSM 12442] gi|228765109|gb|EEM13839.1| hypothetical protein bpmyx0001_53020 [Bacillus pseudomycoides DSM 12442] Length = 405 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 65/381 (17%), Positives = 141/381 (37%), Gaps = 29/381 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVS 79 W + + + N G ++ K +I + D+ + K + T + Sbjct: 15 WSSIKLDELLEFNNGINADKNSYGKGRKFINVLDILNNEHIVYENIKGSVEVDAKTENNN 74 Query: 80 IFAKGQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLSID 132 G IL+ + + + + F++ K + P L+ L + Sbjct: 75 KVEYGDILFLRSSETREDVGKCSVYLDEKEYCLFGGFVIRGKKIAEYEPYFLKLNLETPL 134 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +I + G+T + + ++ + IP + EQ I + + + I + + I Sbjct: 135 IRHQIGSKSGGSTRFNVSQSILSSVEIKIPSINEQKKISKFMD----LFNKKIQLQQQKI 190 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +LL+E+K+ + + K +++ +G D WE + F ++TE K Sbjct: 191 DLLQEQKKGFLQKMFPKAGEKQPQVRFAG------FTDDWEQREFGEIITERREKTKIEN 244 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E +LS + + E + S Y + +++ +L + Q E Sbjct: 245 EDTLLSSAIDGMYLNSELF-SHFRGASNIGYLKIRKNDMILSAQNL--HLGNCNINQRFE 301 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM----GSGLRQSLKFEDVKRLPVL 368 GII+ AY +D+ ++ ++ + F S R+++++ + + Sbjct: 302 HGIISPAYKVYSLVNVDAAFMHAWIKKDSTKQFFEKATTEGASVCRKNIEWGTLYSQKIY 361 Query: 369 VPPIKEQFDITNVINVETARI 389 +P EQ I + NV RI Sbjct: 362 IPIYSEQQKIGELFNVLDKRI 382 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 57/163 (34%), Gaps = 8/163 (4%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N+L + I + E V+ G+I+F + S + E+ Sbjct: 45 NVLDILNNEHIVYENIKGSVEVDAKTENNNKVEYGDILFLRSSETREDVGKCSVYLDEKE 104 Query: 315 IITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVP 370 ++ I +L + + + G R ++ + + + +P Sbjct: 105 YCLFGGFVIRGKKIAEYEPYFLKLNLETPLIRHQIGSKSGGSTRFNVSQSILSSVEIKIP 164 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I EQ I+ ++ + ++ +Q I LL+E++ F+ Sbjct: 165 SINEQKKISKFMD----LFNKKIQLQQQKIDLLQEQKKGFLQK 203 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 50/185 (27%), Gaps = 8/185 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ +T +D + D + + R + K Sbjct: 223 DWEQREFGEIITERREKTKIENEDTLLSSAIDGMYLNSELF---SHFRGASNIGYLKIRK 279 Query: 84 GQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ++ +L I GI S + V +V + W+ Q E Sbjct: 280 NDMILSAQNLHLGNCNINQRFEHGIISPAYKVYSLVNVDAAFMHAWIKKDSTKQFFEKAT 339 Query: 142 E---GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +W + + + IP +EQ I E RI + + K Sbjct: 340 TEGASVCRKNIEWGTLYSQKIYIPIYSEQQKIGELFNVLDKRIQLQQQKLELLQKQKKGF 399 Query: 199 KQALV 203 Q + Sbjct: 400 MQQMF 404 >gi|257440125|ref|ZP_05615880.1| ribosomal protein L10 [Faecalibacterium prausnitzii A2-165] gi|257197477|gb|EEU95761.1| ribosomal protein L10 [Faecalibacterium prausnitzii A2-165] Length = 387 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 61/414 (14%), Positives = 125/414 (30%), Gaps = 50/414 (12%) Query: 29 PIKRFTK--LNTGRTSESGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQ 85 + K L+T E I Y+ + G + + + + G Sbjct: 2 RLGDCGKTNLHTYSDKEKWSLIRYLDTGSITEGRIDEIQTLYPGVDKIPSRARRKASVGD 61 Query: 86 ILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAI 140 IL+ + P + I + + + ST F V+ + P + +L V + ++AI Sbjct: 62 ILFSTVRPNQKHYGIIEAGTENLLVSTGFTVVTVDTTIADPYFIYYYLTQSSVIESLQAI 121 Query: 141 --CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +T I +I + +P L Q I + + I + L + Sbjct: 122 AEQSTSTYPSIKPSDIEDIELDLPELETQKKIGSTLRMLDRK----IALNEEINDNLYAQ 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIE 253 +A+ + +P W + + + + E Sbjct: 178 AKAIFDNHFI---------------NIDAIPAGWRKGNLLDIANYLNGLAMQKFRPQGHE 222 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + L + Q + L S + I+ G++VF + L Sbjct: 223 IGLPVLKIKELRQGSCDDSSELCSLSIKPEYIIHNGDVVFSWSGSL-----LVDIWCGGT 277 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPP 371 + V D + +L ++ L + + +K E++ + VL+P Sbjct: 278 CGLNQHLFKVTSDVYD-KWFYYLWTAHHLARFIAIAADKATTMGHIKREELAKAEVLIPC 336 Query: 372 IKEQFDITNV--INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + + N I L+ L R + +TG+ID+ Sbjct: 337 EE------DYTSFNSIMQPIFELIISNRIESRKLAALRDELLPKLMTGEIDISD 384 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 50/190 (26%), Gaps = 10/190 (5%) Query: 21 IPKHWKVVPIKRFTKLNTG----RTSESGKDI--IYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W+ + G + G +I + ++++ G+ + Sbjct: 192 IPAGWRKGNLLDIANYLNGLAMQKFRPQGHEIGLPVLKIKELRQGSCDDSSE---LCSLS 248 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I G +++ G L G + + W Sbjct: 249 IKPEYIIHNGDVVFSWSGSLLVDIWCGGTCG-LNQHLFKVTSDVYDKWFYYLWTAHHLAR 307 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 A + TM H + + + IP + + I + E + L Sbjct: 308 FIAIAADKATTMGHIKREELAKAEVLIPCEEDYTSFNSIMQPIFELIISNRIESRKLAAL 367 Query: 195 LKEKKQALVS 204 E L++ Sbjct: 368 RDELLPKLMT 377 >gi|91773783|ref|YP_566475.1| restriction modification system DNA specificity subunit [Methanococcoides burtonii DSM 6242] gi|91712798|gb|ABE52725.1| Restriction modification system DNA specificity subunit [Methanococcoides burtonii DSM 6242] Length = 391 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 47/398 (11%), Positives = 108/398 (27%), Gaps = 41/398 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + I + +D + +Q+ + + Q Sbjct: 20 KKLGEVFIEVNEKVGQRNLETYSITAGQGFVSQKEKFGRDISGQQN--AKYTALQVNQFA 77 Query: 88 YGKLGPYLRKAII-----ADFDGICSTQFLVL------QPKDVLPELLQGWLLSIDVTQR 136 Y K K D F+ +L + L + Q Sbjct: 78 YNKGNSKKYKYGCVYLNTTDKQIAVPNVFISFKLIDNEMSSVFYAKLFENHYLDKGLRQI 137 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I + + + + K + + +P EQ I + + ++ L ++ + K Sbjct: 138 ISSSARMDGLLNVNKKYFFQLKIIVPTTPEQHKIAIFLTSVDEKLQALKKKKELLEQYKK 197 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 Q L S + + D + +G + A K + + Sbjct: 198 GAMQKLFSQELRFKQDDGSAFPDWEEKKLGDI------FDIKAGGDIDKSKVSDIKTGLY 251 Query: 257 LSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 Y N +K S + + G + +N +R ++ + Sbjct: 252 RYPIYSNSEKEKGLFGYSNSYSISEKCLTVTGRGRLGIAHARFENFYPIVRLLVLIPKIP 311 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + ++ +A+ S L + V P EQ Sbjct: 312 ANV-----------------VFYENIINQLNFAIESTGVPQLTSPQISSYKVHYPSFTEQ 354 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + + + ID +E + + I +E + + Sbjct: 355 EKIADFL----SSIDGSIENVGKQIEASQEWKKGLLQK 388 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 60/189 (31%), Gaps = 12/189 (6%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + E+N K + + + + E + + Y + + + Sbjct: 21 KLGEVFIEVNEKVGQRNLETYSITAGQGFVSQKEKFGRDISGQQNAKYTALQVNQFAYNK 80 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + K ++ I + + S + A L ++ L K + S Sbjct: 81 GNSKKYKYGCVYLNTTDKQIAVPNVFISFKLIDNEMSSVFYAKLFENHYLDKGLRQIISS 140 Query: 353 LRQ-----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + ++ + +L ++VP EQ I + ++ L ++ LL++ + Sbjct: 141 SARMDGLLNVNKKYFFQLKIIVPTTPEQHKIAIFLTSVDEKLQAL----KKKKELLEQYK 196 Query: 408 SSFIAAAVT 416 + + Sbjct: 197 KGAMQKLFS 205 >gi|195867489|ref|ZP_03079493.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|195660965|gb|EDX54218.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 9 str. ATCC 33175] Length = 405 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 49/398 (12%), Positives = 123/398 (30%), Gaps = 20/398 (5%) Query: 32 RFTKLNTGRTSESGKD----------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +++ +GR ++ K+ I ++ ++++ + + + N + S V + Sbjct: 10 DISEIISGRGPKNVKNLQDFASQHGKINWLLVKNLINNSINNDFEKYNLDEEKHSLVKL- 68 Query: 82 AKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K +++Y AI +D + F + P + + + I ++ Sbjct: 69 NKNELVYSMYATPGIVAINEFYDNLYINQSFCKIIPNENICLKKFLFYWLIKNKNYALSL 128 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL---ITERIRFIELLKE 197 G T S+ + I N + +PP+ EQ I I + Sbjct: 129 SSGTTQSNLNINKIRNFVIYLPPIEEQNAIISIIEPHEKLFIKYSNLVDISSVENTKKDV 188 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLIESN 255 + + K + ++ ++ + A + E + K+ Sbjct: 189 DNLISIIEPLEKSIKTINLLQTKIGLFIEKTFNFINNNLANADLIEFSLKDLLNIKRGLP 248 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I N + K Y + I + + + Sbjct: 249 ITEKDLLNNPGNYPLISASSKNNGIFGYFNDYMYDGKNITISMNGNAGCIFYQIGKFSAN 308 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 ++ + + + + ++ R L +++ VL+P ++ Q Sbjct: 309 SDVLVLSNSNKNLTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNMEIQ 368 Query: 376 FDITNVINVETARIDVLVEKIEQSIV--LLKERRSSFI 411 + + ++ + V KIE+++ LLK + I Sbjct: 369 KEFSKIVEPLL-NLSTKVNKIEKNLNECLLKIVKKLII 405 >gi|29349948|ref|NP_813451.1| putative typeI restriction enzyme MjaXP specificity protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341859|gb|AAO79645.1| putative Type I restriction enzyme MjaXP specificity protein [Bacteroides thetaiotaomicron VPI-5482] Length = 428 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 56/406 (13%), Positives = 127/406 (31%), Gaps = 42/406 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTS 76 W I + G T ++ G DI + ++ ++ + + D S Sbjct: 50 EWNKYTINDLATVVGGGTPDTTVKSYWGGDIQWFTPSEIGKNKYVDFSKRTITRDGLDNS 109 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + IL +I ++ + F L K + + L + Sbjct: 110 SAKLLPLHTILLSSRATVGECSIASNEC-TTNQGFQSLIAKQCN--IDFLYYLIQTKKKD 166 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T I I + +P EQ I + + ID I + + I+ LK Sbjct: 167 LIRNACGSTFLEISANEIRKIKVAVPVQNEQEQIAKLL----SLIDERIATQNKIIDKLK 222 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + L + GL K + V ++ + + Sbjct: 223 SLIKGLPHKMAEIGLQK-----------------GCWEKVLLSTVLVERKELNSELYTVH 265 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI- 315 +I ++E + Y + G+I++ + + +E+ + Sbjct: 266 SVSVSEGVINQIEYLGRSFAAKDTSNYHVARYGDIIYTKSPTGDFPYGIVKQSYIEQPVA 325 Query: 316 ITSAYMAVKPHGIDS--TYLAWLMRSYDLCKVFY-AMGSGLRQSLKFED--VKRLPVLVP 370 I+ Y P ++ + M S Y + G + ++ + + +P Sbjct: 326 ISPLYGVYSPTSFETGVYLHYYFMSSVLAKNYLYPLIQKGAKNTINISNQRFLENRIALP 385 Query: 371 PIKE-QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + +I + ++D +EK ++ L ++RS + Sbjct: 386 LKQTDRHNIARALITIQKKLD--IEKC--AMDSLTKQRSYLLQQLF 427 >gi|315148960|gb|EFT92976.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4244] Length = 387 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 60/399 (15%), Positives = 118/399 (29%), Gaps = 42/399 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W++ + + R S +I + + P S DT T + Sbjct: 18 EDWELCKLSTEFEKVNERNDGSLGKEHWISVAKMYFQN----PDKVQSNNIDTRTY-VMR 72 Query: 83 KGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G I + K + R G+ S F + + K + I+ Sbjct: 73 TGDIAFEGHPNKEFKFGRFVANDIGTGVVSELFPIYRHKQEYDNYYWKNAIQIERVMGPI 132 Query: 139 AICE----GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G + + D + IP L EQ I + +IDT I R ++ Sbjct: 133 FAKSITSSGNSSNKLDPNHFLRQQVFIPKLEEQSKIGLFL----KKIDTTIALHQRKLDQ 188 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LKE K+A + + K++ + E WE + K Sbjct: 189 LKELKKAYLQVMFPVKDERVPKLRLADFEG------EWEQCKLGDITKISTGKLDANAM- 241 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +E + Y+I P N + Sbjct: 242 -------------VENGKYDFYTSGIKKYRIDVPAFEGPAITIAGNGATVGYMHLADNKF 288 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIK 373 ++ +D ++L + + K+ +G + + + L + +P Sbjct: 289 NAYQRTYVLQEFVVDRSFLFSEVGNKLPKKINQEARTGNIPYIVMDMLTELKLSIPQDEA 348 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ I + +ID + + + LK+ ++S++ Sbjct: 349 EQSKIGSF----FKQIDKTIALHQNKLEQLKDLKTSYLQ 383 >gi|303255274|ref|ZP_07341345.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS455] gi|302597743|gb|EFL64818.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS455] Length = 300 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 67/202 (33%), Gaps = 15/202 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S +++ + + + IE + N I++L T+ E Sbjct: 107 SKSQYLRDHSTGATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDE- 165 Query: 280 YETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 V+ G+++ ++ A + + V + + W + Sbjct: 166 ----HKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPDRLWKVILNDRVNPVFLWKL 221 Query: 338 ----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ K + SG +++ + ++ V PP+ Q + + + A +D Sbjct: 222 ITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLALQNEFADFV----ALVDKSQ 277 Query: 394 EKIEQSIVLLKERRSSFIAAAV 415 I++S+ L+ + S + Sbjct: 278 LAIQKSLEELETLKKSLMQEYF 299 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/253 (11%), Positives = 73/253 (28%), Gaps = 8/253 (3%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + + E E ++ Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITK---RKFQLDEHKVEIGDVII 175 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + T L + +PD V + E L + + Sbjct: 176 SRMNTSELVGAAGYVWAINSDNIYLPDRLWKVILNDRVNPVFLWKLITNEKTKLKIKRIS 235 Query: 264 IIQKLETRNMGLK 276 +N+ Sbjct: 236 SGTSGSMKNISKS 248 >gi|3335668|gb|AAC78319.1| restriction-modification enzyme MpuUVI S subunit [Mycoplasma pulmonis] Length = 399 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 45/370 (12%), Positives = 110/370 (29%), Gaps = 19/370 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVDENIAKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + E +K +++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILI 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I+ L + D I H+ F I + G I Sbjct: 176 KIIEP-LEKQINAFDELILSEQKSLQHYLNYFFGKFYQIEPSLFHDYKLEKIAKIRRGKI 234 Query: 265 IQKLETRNM---------GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + + K Y + + I + I Sbjct: 235 INSFDLKENPGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSI 294 Query: 316 ITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ + ++ + +L + ++ + ++ R S++ + + + +P ++ Sbjct: 295 TNVCFILLLNDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLE 354 Query: 374 EQFDITNVIN 383 Q I +I Sbjct: 355 IQSAILGIIE 364 >gi|325989582|ref|YP_004249281.1| type I restriction-modification system, specificity protein, probable fragment [Mycoplasma suis KI3806] gi|323574667|emb|CBZ40320.1| Type I restriction-modification system, specificity protein, probable fragment [Mycoplasma suis] Length = 390 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 55/409 (13%), Positives = 112/409 (27%), Gaps = 36/409 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++V + + +++ G + IG ++V L + N Sbjct: 5 WELVTLDKLGRISKGIQKHKPNHDKKLFCFGKVPLIGCKEVSDSRLTVLKSNRNYNFYGL 64 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL---SID 132 +F K + + G + + + F+ S+ P + + Sbjct: 65 LQSKLFPKNTVCVVETGSLVTDSALLKFEACLSSDLYGFIPFSKISTPTFIKYCLDAPKN 124 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + T H + + P PPL Q I E + + +D + Sbjct: 125 KRKLKNLASLYITQPHLTLSKLFQVKFPKPPLEIQQKIGEILSRYDLILDNHERQIELLK 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L +L K PD + S +P+ W F L K Sbjct: 185 NLKA----SLFKEWFIKLRFPDYEKYSSE----NGIPEGWRKIRFGDLTEIQIGKKPASH 236 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + L +V G + Sbjct: 237 SELLDGLGKYPFFTCSTKTKNSYTFSYDFPSLLVSAG------------GAYHCKFYDGK 284 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 T ++ S + + L K+ S ++L + +K + +L+P Sbjct: 285 FEASTHVLVSKLKFRKFSYLILEALNLVHLPKLQRFTFSVAIKNLSPQKLKEIEILIPD- 343 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N I +EK+E + +E + + + + +I + Sbjct: 344 ---QKILEKFNNFWKNIHSKIEKLELKMQKYEEIKKKLLDSLFSQEIQV 389 Score = 43.6 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 64/203 (31%), Gaps = 16/203 (7%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGK 62 + Y +Y S IP+ W+ + T++ G+ S +++ D Sbjct: 199 RFPDYEKY-SSE----NGIPEGWRKIRFGDLTEIQIGKKPASHSELL-----DGLGKYPF 248 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + S + +L G Y D ST LV + K Sbjct: 249 FTCSTKTKNSYTFS----YDFPSLLVSAGGAY--HCKFYDGKFEASTHVLVSKLKFRKFS 302 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L L++ +++ + + + + I + IP +I+ Sbjct: 303 YLILEALNLVHLPKLQRFTFSVAIKNLSPQKLKEIEILIPDQKILEKFNNFWKNIHSKIE 362 Query: 183 TLITERIRFIELLKEKKQALVSY 205 L + ++ E+ K+ +L S Sbjct: 363 KLELKMQKYEEIKKKLLDSLFSQ 385 >gi|265759583|ref|ZP_06090997.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_33FAA] gi|263233385|gb|EEZ19059.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_33FAA] Length = 237 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 74/197 (37%), Gaps = 7/197 (3%) Query: 229 PDHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYE---TY 283 P+ WE +V EL ++ L I L GNI L S Sbjct: 11 PNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIKL 70 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ +++F + + + I + ++P I S YL +M S Sbjct: 71 YSLEKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSYYR 130 Query: 344 KVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 Y + + + ++ + + +L + +PP+KEQ I + + ID + E Sbjct: 131 NWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTIKNSKEDLQT 190 Query: 402 LLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 191 TIKQAKSKILNLAIHGK 207 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 80/214 (37%), Gaps = 10/214 (4%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGK--DIIYIGLEDVES-GTGKYLPKDGNSRQSDTS 76 +P W+ ++ +L G + +S I + + ++ + GT Y +S D Sbjct: 10 LPNGWEWCNLEDIVCELKYGTSEKSLSVGKIAVLRMGNITNVGTIDYSNLVYSSNNEDIK 69 Query: 77 TVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S+ K +L+ + + AI +L+ ++ +++ Sbjct: 70 LYSL-EKDDLLFNRTNSSEWVGKTAIYKKEQPAIYAGYLIRIRPILIFSDYLNTVMNSSY 128 Query: 134 TQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + S+ + + + + +PIPPL EQ I ++ IDT+ + Sbjct: 129 YRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTIKNSKEDL 188 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 +K+ K +++ + L P + IE + Sbjct: 189 QTTIKQAKSKILNLAIHGKLVPQDPNDEPAIELL 222 >gi|308184242|ref|YP_003928375.1| Type I restriction/modification specificity protein [Helicobacter pylori SJM180] gi|308060162|gb|ADO02058.1| Type I restriction/modification specificity protein [Helicobacter pylori SJM180] Length = 413 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 58/410 (14%), Positives = 116/410 (28%), Gaps = 35/410 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDT 75 W+ +K K+ TG+T ++ ++I D+ P+ + + Sbjct: 2 SEWQTFCLKDLGKIVTGKTPKTSNLDFFNGKYMFITPNDLHGTYRIIKTPRTLSDSGLKS 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL G +G + D + Q + + + + Sbjct: 62 IQNNTIDNTSILVGCIGDVGMVRMCFDKCA-TNQQINSITDIKDFCNPYYLYYYLSNKKE 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI--- 192 + I + I + +P + Q I + +I+ Sbjct: 121 LFKNIALSTVVPIIPKTIFQEIEVLLPNIETQQKIARTLSVLDQKIENNHKINELLHKIL 180 Query: 193 ---ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + KMK S E L+P+ +EVK LV + + Sbjct: 181 ELLYEQYFVRFDFLDENNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGELVDIFSGYSF 239 Query: 250 KLIESNILSLSYGNIIQK---------LETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + + Y I K T N+ P+ Y +++P I+ Sbjct: 240 QSNTYSNNKNDYILITNKNVQHSLVDLSVTTNLLFLPKKLPKYCLLEPTNILITLTGHIG 299 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLK 358 + S + I+ V P + + L+R+ + +Q+L Sbjct: 300 RCALVFS----KNCILNQRVGVVLPKEKELNPFYYSLIRNPLFSAILQRNAIGSSQQNLS 355 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 D ++ + I + I L+ QS L R Sbjct: 356 PIDTLKIQIPF-----NHKIIKQYSKTCENIIKLLVSNMQSTQTLTALRD 400 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 8/158 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKY-LPKDGNSRQS 73 IP ++V + + +G + + D I I ++V+ + + Sbjct: 218 IPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYILITNKNVQHSLVDLSVTTNLLFLPK 277 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSID 132 + IL G R A++ + I + + V+ PK+ L + + Sbjct: 278 KLPKYCLLEPTNILITLTGHIGRCALVFSKNCILNQRVGVVLPKEKELNPFYYSLIRNPL 337 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + ++ G++ + I +P + Sbjct: 338 FSAILQRNAIGSSQQNLSPIDTLKIQIPFNHKIIKQYS 375 >gi|206603919|gb|EDZ40399.1| Putative Type I Restriction modification system, S subunit [Leptospirillum sp. Group II '5-way CG'] Length = 360 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 66/395 (16%), Positives = 131/395 (33%), Gaps = 39/395 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W VP++ + ++ LE +E TG L K+ + + S+ F G Sbjct: 3 WPAVPLREIAPPKASTQPFPDSFVWHLSLEQIEGDTGAVLAKNYDYSGNVGSSTFYFDTG 62 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 +LY KL PYL K ++ D GI +T+ + PK + P L +L S + ++ Sbjct: 63 NVLYSKLRPYLNKVVVPDEPGIATTELIPLRPDPKVLNPRYLAFYLRSPNFVKQASHHVA 122 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G M +P+PPL+EQ I E + + + ++ + Sbjct: 123 GTKMPRVVMDWFWKHKIPLPPLSEQKRIVEILDEADRIRRSRREANQKAERIIPALFLKM 182 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 V+ P W K F + + N KL L Sbjct: 183 FGDPVSN-------------------PKGWPTKLFADIFRDTTAGNKKLQSKQFLEFGRI 223 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 ++ + +++ G + Y+ P + + D + + Sbjct: 224 AVVDQGQSQIAGYTDDVALAYKGTFP-------VIVFGDHTRIFKFVDHPFVLGADGVRV 276 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + A+ C++ +G + + +K + P Q N Sbjct: 277 LITKPRYNPLFAYW-----HCQLLNMPIAGYSRHF--KFLKEKFFICPDKGLQDRFANFA 329 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++ T++ + E + ++ S + A +G Sbjct: 330 SIVTSQ----ISVFENAADRVERLFSVMLDRAFSG 360 >gi|171920601|ref|ZP_02931852.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|171903311|gb|EDT49600.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 13 str. ATCC 33698] Length = 409 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 48/392 (12%), Positives = 122/392 (31%), Gaps = 24/392 (6%) Query: 29 PIKRFTKLNTGRTSESGKD----------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 I +++ +GR ++ K+ I ++ ++++ + + + N + S V Sbjct: 7 KILDISEIISGRGPKNVKNLQDFASQHGKINWLLVKNLINNSINNDFEKYNLDEEKHSLV 66 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + K +++Y AI +D + F + P + + + I Sbjct: 67 KL-NKNELVYSMYATPGIVAINEFYDNLYINQSFCKIIPNENICLKKFLFYWLIKNKNYA 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL---ITERIRFIEL 194 ++ G T S+ + I N + +PP+ EQ I I + Sbjct: 126 LSLSSGTTQSNLNINKIRNFVIYLPPIEEQNAIISIIEPHEKLFIKYSNLVDISSVENTK 185 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLI 252 + + K +N +K + D + + + T Sbjct: 186 KDVDNLISIIEPIEKVINNIKNIKIKIESLINKYFDFLYSDLEDSNFKKYILGDLFTINR 245 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I S N I + K Y + F I Q + Sbjct: 246 GQIINSKYIYNNIGPYPVVSSNTKNNGIFGYINSYMYDGEFITISADGAYAGTVFLQNGK 305 Query: 313 RGIITSAYMAVKPHG----IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 I ++ +K ++ ++ ++++ + R +++ +K + + Sbjct: 306 FSITNVCFILMKNKDIDFKFNNKFVYYILKKEQEINRLKSQVGSSRPAVREYSLKEIKIN 365 Query: 369 VPPIKEQF---DITNVINVETARIDVLVEKIE 397 +P ++ Q I + + + + + + + Sbjct: 366 LPNMEIQEEFSKIVEPLLNLSTKANRIEKILN 397 >gi|189499173|ref|YP_001958643.1| N-6 DNA methylase [Chlorobium phaeobacteroides BS1] gi|189494614|gb|ACE03162.1| N-6 DNA methylase [Chlorobium phaeobacteroides BS1] Length = 775 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 50/364 (13%), Positives = 108/364 (29%), Gaps = 53/364 (14%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W +V + ++ G + G G + S +G Sbjct: 453 WPMVELVEVAEILKGSAITKKD-----------TKHGNIPVIAGGQEPAYYHNKS-NREG 500 Query: 85 QIL-YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ G Y S + + + + + + I +G Sbjct: 501 DVITVSASGAYAGFVNYFTIPIFASDCSTIQTKDENIVSTRYLFSILKAKQEDIYEFQQG 560 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET------VRIDTLITERIRFIELLKE 197 H K + I +P+PPL Q I ++ +I +I ++ Sbjct: 561 GGQPHVYPKDLKTIKIPLPPLEIQEQIVAELDGYAGIIAGAKQIAQNWKPKIEIDPEWEK 620 Query: 198 KKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K +S VTKG P + ++SGI ++ Sbjct: 621 VKLGEISDRVTKGTTPTTNGFQFQESGINFI----------------------------- 651 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I S+ G + + ++ + + +I+F S+ S+ + Sbjct: 652 KIESIDDGGYFIREKLAHINQECNESLKRSQLKENDILFSIAGALGRVASIESSILPAN- 710 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 + + +DS YL ++RS + + + G + +L V + +P ++ Sbjct: 711 TNQALAIISPKKELDSKYLEQVLRSDLIQNQIFGLKVGVAQSNLSLAQVSDFEIPLPSLE 770 Query: 374 EQFD 377 + Sbjct: 771 IKNK 774 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 47/166 (28%), Gaps = 2/166 (1%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S I G + T +L+E + + + N+ + Sbjct: 427 SKIAENGDYNLSGDRYRVATDYTNAKWPMVELVEVAEILKGSAITKKDTKHGNIPVIAGG 486 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 E + I + I S ++ + +L Sbjct: 487 QEPAYYHNKSNREGDVITVSASGAYAGFVNYFTIPIFASDCSTIQTKDENIVSTRYLFSI 546 Query: 340 YDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + ++ G + + +D+K + + +PP++ Q I ++ Sbjct: 547 LKAKQEDIYEFQQGGGQPHVYPKDLKTIKIPLPPLEIQEQIVAELD 592 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 54/157 (34%), Gaps = 12/157 (7%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 I W+ V + + + G T + I +I +E ++ G K + Q Sbjct: 613 EIDPEWEKVKLGEISDRVTKGTTPTTNGFQFQESGINFIKIESIDDGGYFIREKLAHINQ 672 Query: 73 SDTSTVSI--FAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWL 128 ++ + IL+ G R A I + ++ PK L + Sbjct: 673 ECNESLKRSQLKENDILFSIAGALGRVASIESSILPANTNQALAIISPKKELDSKYLEQV 732 Query: 129 LSID-VTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 L D + +I + G S+ + + +P+P L Sbjct: 733 LRSDLIQNQIFGLKVGVAQSNLSLAQVSDFEIPLPSL 769 >gi|323971896|gb|EGB67120.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli TA007] Length = 390 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 48/395 (12%), Positives = 106/395 (26%), Gaps = 54/395 (13%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-----PKDGNSRQSDTSTVSI 80 + + + K G +G+ ++ + D I Sbjct: 17 EWQTLGKVLKRTKGTKITAGQ------MKALHKDNAPLKIFAGGKTVAFVDFKDIPEKDI 70 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + I+ G + D + + + + + I Sbjct: 71 NREPSIIVKSRGII--EFEYYDKPFSHKNEMWSYHSNNDAISIKYIYYFLKINEGYFQKI 128 Query: 141 CEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIE 193 M +PIP LA Q I + T L E + Sbjct: 129 GGKMQMPQIATPDTDKFEVPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELTAELN 188 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKL 251 + K++ +++ K+ +EW +G V + T + + Sbjct: 189 MRKKQYNYYRDQLLS--------FKEGEVEWKTLGEV---------AVIGTGNHDTQDAI 231 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + G KL + + Sbjct: 232 EHGKYIFYARGREPLKLNVFDFDETA---------------IITAGDGAGVGKVFHYAKG 276 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + AY V ++ ++ + +Y + A S SL+ + P+ VPP Sbjct: 277 KYALHQRAYRIVPNAFMNPRFVYHYITAYFFTYIQKASVSSSVTSLRRPMFLKFPIPVPP 336 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +EQ I +++ + + E + + I L +++ Sbjct: 337 SEEQARIVEILDKFDTLTNSITEGLPREIELRQKQ 371 >gi|227510762|ref|ZP_03940811.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] gi|227189764|gb|EEI69831.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] Length = 304 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 42/103 (40%), Gaps = 8/103 (7%) Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPI 372 + Y + H +D YL S + G R ++K +D ++P+ +P + Sbjct: 2 SPLYYIFRAHNVDGMYLEKYFSSTKWHRFMELNGDTGARADRFAIKDKDFVQMPIPLPNL 61 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +EQ I + D L+ ++ + LLKE + ++ Sbjct: 62 EEQSKIARFLENV----DNLIAANQRKLDLLKELKQGYLQKLF 100 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 46/298 (15%), Positives = 98/298 (32%), Gaps = 20/298 (6%) Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT---MSHADWKGIGNIPMPIPPL 164 S + + + +V L+ + S + +E + K +P+P+P L Sbjct: 2 SPLYYIFRAHNVDGMYLEKYFSSTKWHRFMELNGDTGARADRFAIKDKDFVQMPIPLPNL 61 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 EQ I + I R ++LLKE KQ + + + + +++ +G Sbjct: 62 EEQSKIARFLENVDNL----IAANQRKLDLLKELKQGYLQKLFPQNGSKFPQLRFAGFAD 117 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYE 281 +V T + Q+ +++ E Sbjct: 118 AWEPRKLGDVANIVGGGTPSTSILEYWNGNIDWYAPAEIGEQRYVSKSQKTITELGLKKS 177 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + I+ G I+F L S R + ++ P + Sbjct: 178 SATILPVGTILFTSRAGIGKTAILAS-----RAATNQGFQSIVPRTEMLNSYFIFSETSK 232 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 L K G+G + + ++++P+++P +KEQ I ++D L+ ++ Sbjct: 233 LKKYGEITGAGSTFVEVSGKQMEKMPIILPILKEQEIIGKF----FKQLDKLIAANQR 286 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 65/186 (34%), Gaps = 8/186 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKDGNSRQS---DTST 77 W+ + + G T + + G D E G +Y+ K + S+ Sbjct: 119 WEPRKLGDVANIVGGGTPSTSILEYWNGNIDWYAPAEIGEQRYVSKSQKTITELGLKKSS 178 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +I G IL+ + AI+A + F + P+ + + + + + Sbjct: 179 ATILPVGTILFTSRAGIGKTAILA-SRAATNQGFQSIVPRTEMLNSYFIFSETSKLKKYG 237 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E G+T K + +P+ +P L EQ +I + I + + EL K Sbjct: 238 EITGAGSTFVEVSGKQMEKMPIILPILKEQEIIGKFFKQLDKLIAANQRKVEKLKELKKG 297 Query: 198 KKQALV 203 Q + Sbjct: 298 YMQKMF 303 >gi|331018717|gb|EGH98773.1| type I restriction-modification system specificity determinant [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 347 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 47/337 (13%), Positives = 112/337 (33%), Gaps = 26/337 (7%) Query: 107 CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 S + + ++P L + S+D +++A+ +T + + + +PP+ Sbjct: 12 TSLTYYRVDQTKLIPLYLAAFFSSVDFQNQLKAVMGLSTRNQVPITAQRKLNVVVPPIEN 71 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS-----YIVTKGLNPDVK----- 216 Q I + + RI L + + ++ +GL P+ Sbjct: 72 QRYIADTLGTLDDRISMLREINTTLEAIAQALFKSWFVDFDPVRAKAEGLEPEGMDAATA 131 Query: 217 --MKDSGIE-WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 DS E +GLVP W + ++ I + Sbjct: 132 ALFPDSFEELELGLVPSGWGCGVLGDVADTTRKQIQPSAMKAETLYVGLEHIPRQSLGLD 191 Query: 274 GLKPES--YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + G+I+F + K + G+ ++ + D Sbjct: 192 SWASTDGLESAKSCFEKGDILFGKLRPYFHKIVIAPF----AGVCSTDILVCNAKVADYY 247 Query: 332 YLA-WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + S +L + +G + ++D+ +++PP++ + ++V++ +I Sbjct: 248 GFVAMQLFSTELVAYADRLSNGAKMPRVNWKDLSDYALVIPPVEVAAEYSDVVHPLFEQI 307 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 E L + R + + ++GQ+ L E++ Sbjct: 308 TA--NVHEAK--TLGQLRDTLLPRLISGQLRL-PEAE 339 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 75/190 (39%), Gaps = 5/190 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +P W + + S + +Y+GLE + + S Sbjct: 143 LGLVPSGWGCGVLGDVADTTRKQIQPSAMKAETLYVGLEHIPRQSLGLDS--WASTDGLE 200 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSIDVT 134 S S F KG IL+GKL PY K +IA F G+CST LV K + L S ++ Sbjct: 201 SAKSCFEKGDILFGKLRPYFHKIVIAPFAGVCSTDILVCNAKVADYYGFVAMQLFSTELV 260 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + GA M +WK + + + IPP+ + + +I + E +L Sbjct: 261 AYADRLSNGAKMPRVNWKDLSDYALVIPPVEVAAEYSDVVHPLFEQITANVHEAKTLGQL 320 Query: 195 LKEKKQALVS 204 L+S Sbjct: 321 RDTLLPRLIS 330 >gi|148927731|ref|ZP_01811170.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] gi|147886922|gb|EDK72453.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] Length = 335 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 42/323 (13%), Positives = 95/323 (29%), Gaps = 13/323 (4%) Query: 100 IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 + + V +L ++ ++ G T + + I + Sbjct: 20 YKSKAYLVQGKIWVNNHAHILLARNNKYVKYALNYVDYQSYVTGTTRLKLNQSALKRIII 79 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 P P EQ I KI ID + K +Q+++ + K ++ Sbjct: 80 PFPDENEQKRIVAKIEELFSEIDNAESAITTASGYYKSYEQSIIDSLFAKYEAEAEMVEF 139 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 I + + + + + + E + + + E Sbjct: 140 GDIAEIKGGITKGRKLRGMPIGETPYLRVANVQD---------GYLYLDEIKTINVTAEE 190 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLM 337 Y +++ + D R +E I + + Y+++ Sbjct: 191 LRKYSLMNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYAT 250 Query: 338 RSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ F + SL +K L + P+ +Q +I I + + I ++ Sbjct: 251 KTTRARDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKE 310 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 + + K R S +A A G+ Sbjct: 311 LIVAHHRSKALRQSILAKAFKGE 333 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 69/200 (34%), Gaps = 14/200 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 ++V ++ G T + Y+ + +V+ G + ++ Sbjct: 135 EMVEFGDIAEIKGGITKGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKY 194 Query: 80 IFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G IL+ + G R I +C Q + + + + + ++ T R Sbjct: 195 SLMNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYATKTTR 254 Query: 137 IEAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + ++ + + N+ +P PLA+Q I E I+ + I + E I Sbjct: 255 ARDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVA 314 Query: 192 IELLKEKKQALVSYIVTKGL 211 K +Q++++ L Sbjct: 315 HHRSKALRQSILAKAFKGEL 334 >gi|71906941|ref|YP_284528.1| restriction modification system DNA specificity subunit [Dechloromonas aromatica RCB] gi|71846562|gb|AAZ46058.1| Restriction modification system DNA specificity domain [Dechloromonas aromatica RCB] Length = 285 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 38/288 (13%), Positives = 86/288 (29%), Gaps = 17/288 (5%) Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 A+ ++ +G + + +P + +Q I + I+ E + Sbjct: 3 HGAASQANVSPSQVGGLEIVLPNIEQQRRIASILSTYDDLIENNTRRIAILE----EMAR 58 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR------KNTKLIES 254 + P + G++P+ W++ + K+ + Sbjct: 59 RIYEEWFVHFRFPLHEQVKMVESEFGVIPEGWKITSLGEAFNIVLGGTPSRNKSEYWDQG 118 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I ++ G + T + I + S E Sbjct: 119 TIPWINSGKVNDLRITTPSEYITDLGLKKSAAKLMPAATTVIAITGATLGQVSYLCTEMS 178 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 S G S Y+ L+++ + + G +Q + E V + +++PP Sbjct: 179 ANQSVVGVFDASGKYSEYIYRLIQN-RIMAIIQHASGGAQQHINKEIVNDVVLVLPPDD- 236 Query: 375 QFDITNVINVETAR-IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + TA I L+ + L+ R + V+G++D+ Sbjct: 237 ----VLSLFNNTALPIGELINTLLHKNANLRTTRDLLLPKLVSGELDV 280 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 57/201 (28%), Gaps = 12/201 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSR 71 G IP+ WK+ + + G T K I +I V + Sbjct: 84 GVIPEGWKITSLGEAFNIVLGGTPSRNKSEYWDQGTIPWINSGKVNDLRITTPSEYITDL 143 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + + G L + + + V + L Sbjct: 144 GLKKSAAKLMPAATTVIAITGATLGQVSYLCTEMSANQSV-VGVFDASGKYSEYIYRLIQ 202 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I G H + + + ++ + +PP + + I LI + Sbjct: 203 NRIMAIIQHASGGAQQHINKEIVNDVVLVLPP----DDVLSLFNNTALPIGELINTLLHK 258 Query: 192 IELLKEKKQALVSYIVTKGLN 212 L+ + L+ +V+ L+ Sbjct: 259 NANLRTTRDLLLPKLVSGELD 279 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 A G+ + ++ V L +++P I++Q I ++++ D L+E + I +L+E Sbjct: 1 MAHGAASQANVSPSQVGGLEIVLPNIEQQRRIASILST----YDDLIENNTRRIAILEEM 56 >gi|332969663|gb|EGK08679.1| restriction modification system DNA specificity protein [Desmospora sp. 8437] Length = 241 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 60/181 (33%), Gaps = 17/181 (9%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I S + + + + E + Y + +++F I + + A Sbjct: 21 DDNICSFVPMAAVDDFTGSISVLEKRPFGEVKKGYTYFEENDVLFAKITPCMENGKVAVA 80 Query: 309 ----QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDV 362 G + I + L+RS K A+ +G +Q + + Sbjct: 81 KGLINNFGFGTTEFHVIRCSHLNIHPRLVYHLVRSDFFRKQAKAVMTGAVGQQRVPKLFL 140 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ----SIV-LLKERRSSFIAAAVTG 417 + P VPP EQ +I V++ D+L+ + E I L S + A G Sbjct: 141 EGYPFPVPPFDEQEEIVKVVD------DLLMHEYETFTTLEIEGHLNSLTQSILTQAFRG 194 Query: 418 Q 418 + Sbjct: 195 E 195 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 69/210 (32%), Gaps = 14/210 (6%) Query: 29 PIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + T +N G+ ++ + V+ TG + + F + Sbjct: 2 RLGELTVINPGKPRKLEYPDDNICSFVPMAAVDDFTGSISVLEKRPFGEVKKGYTYFEEN 61 Query: 85 QILYGKLGPYLRKAIIADFDGICST--------QFLVLQPKDVLPELLQGWLLSIDVTQR 136 +L+ K+ P + +A G+ + + ++ P L+ + S ++ Sbjct: 62 DVLFAKITPCMENGKVAVAKGLINNFGFGTTEFHVIRCSHLNIHPRLVYHLVRSDFFRKQ 121 Query: 137 IEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +A+ G + P P+PP EQ I + + + T L Sbjct: 122 AKAVMTGAVGQQRVPKLFLEGYPFPVPPFDEQEEIVKVVDDLLMHEYETFTTLEIEGHL- 180 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 Q++++ L ++S +E + Sbjct: 181 NSLTQSILTQAFRGELGTHDPAEESALELL 210 >gi|227487584|ref|ZP_03917900.1| restriction-modification system specificity determinant [Corynebacterium glucuronolyticum ATCC 51867] gi|227092402|gb|EEI27714.1| restriction-modification system specificity determinant [Corynebacterium glucuronolyticum ATCC 51867] Length = 384 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 48/402 (11%), Positives = 102/402 (25%), Gaps = 28/402 (6%) Query: 23 KHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + W V + +D ++ L+ + G I Sbjct: 4 EQWPTVKLGTLLSPVGVAERITQPEDETFVTLK-LHGGGAVPRNIGAGKTPKPFIGFRI- 61 Query: 82 AKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR-- 136 Q +Y ++ I + S F V +++ + + ++ Sbjct: 62 RTNQFIYSRIDARNGAFAIVPKALDGAVVSKDFPVFSIGELVESRYLAYFCTTPSFEKLV 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ +P+PPL EQ I K+ I + Sbjct: 122 QVKSSGATNRQRIKEDLFLSLEIPLPPLEEQRRIARKLSLNQSTILRIQKSIEMLENFRV 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + + L +G D + + L+ Sbjct: 182 QSAVRMFESARQTLL-------------LGDFCDTFGGTSLPTESPFKGEDSGILLMRVS 228 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S GN + T + + + G + R A Sbjct: 229 DMNSVGNELFINSTVSWSDDDSFMKRNFVAPAGSTILPKRGASISTNKKRLAVRPTYLDP 288 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + + + +++DL + L +D+ L + +P Q Sbjct: 289 NLMGVLPDSTVLKGVCMYYWFKTFDLNSI---TSGSSVPQLNKKDLTPLQIPIPDPNTQD 345 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + ++Q I L +E +SS A TG+ Sbjct: 346 MFV----KLFNQTLAIERHLQQQIALARELQSSLSTRAFTGE 383 >gi|257463920|ref|ZP_05628306.1| restriction endonuclease S subunit [Fusobacterium sp. D12] gi|317061447|ref|ZP_07925932.1| type I restriction-modification enzyme [Fusobacterium sp. D12] gi|313687123|gb|EFS23958.1| type I restriction-modification enzyme [Fusobacterium sp. D12] Length = 392 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 72/198 (36%), Gaps = 5/198 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +PD W F E K K + S + G I T +G + Sbjct: 19 SKEEQPYEIPDSWVWGYMFFAFAECLDKYRKPVNSAERANRIGKIPYYGATGQVGWIDDF 78 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++V GE F+DL +K + + + + + +K L +L+ Sbjct: 79 LTDDELVLVGEDGAPFLDLLKNKAYM----IQGKAWVNNHAHILKSFYGHFGNL-YLLNY 133 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++ + R L + +P+ +PP KEQ I I+ + E I++ Sbjct: 134 LNIFDFSKYVNGTTRLKLTQSKLAEIPIPIPPKKEQQRIVEKIDSLFEKTKKAKELIQEV 193 Query: 400 IVLLKERRSSFIAAAVTG 417 ++ R+ S + A G Sbjct: 194 KEEIEMRKISILNKAFRG 211 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 10/116 (8%), Positives = 27/116 (23%), Gaps = 6/116 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTG---RTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP WK V ++ + T + + + +I ++ ++ Sbjct: 277 KIPDTWKWVKLENIITILGDGLHGTPKYNENGEYYFINGSNLSFKNIVINSSTKKVSTAE 336 Query: 75 TSTVSI-FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + I G + + I + + + Sbjct: 337 YKKYKKNLNERTIFLSINGTLGKTGFYNNEKIILGKSVCYINLCNNCNKKFYSLFF 392 >gi|23452745|gb|AAN33143.1| putative type I specificity subunit HsdS [Campylobacter jejuni] Length = 404 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 51/409 (12%), Positives = 111/409 (27%), Gaps = 35/409 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESG--TGKYLPKDGNSRQSDT 75 P + + +L+ + K++ + DV + K +P + Sbjct: 13 PNGVEFKNLWEIGELSNTGVDKKIRENQKEVFLLNFLDVMNNHYINKNIPSMKVTASEAE 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAII--------ADFDGICSTQFLVLQPKDVLPELLQGW 127 K + + + + + + + P L+ Sbjct: 73 IQKCNILKNDLFITPSSENINEIGFASVAIEDMPNVCYSYHIMRFRIFNRQINPYFLRYC 132 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S ++ ++I +G T N+ +PIPPL Q I + + T L E Sbjct: 133 FDSENLRKQILKNAQGITRFGLTQPKWKNLQIPIPPLEIQEEIVKILDTFTELEAELEAE 192 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + L+S E++ +E+K + Sbjct: 193 LEARRRQYEYYRNKLLS-----------------FEYLKTNGGGYELKMLGEICERQKGI 235 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N E +++ G+I + Q + + D Sbjct: 236 NITAGEMEKIAIQNGDIRIFAGGKTFIDTKMELLQEQNILKKTSIIVKSRGYVDFEYYAK 295 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + + + +L + + + L D R + Sbjct: 296 PFTHKNELWSYSLNPDTKDINLKFIFYYLKNKVEYFQKIARANAVKIPQLAVADTDRFQI 355 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP+ Q I N+++ A L I I K+ R+ + Sbjct: 356 PIPPLATQEKIVNILDQFHALTTDLQSGIPAEIEARKKQYEYYRNQLLT 404 >gi|313678341|ref|YP_004056081.1| type I restriction modification system, S subunit [Mycoplasma bovis PG45] gi|312950575|gb|ADR25170.1| putative type I restriction modification system, S subunit [Mycoplasma bovis PG45] Length = 385 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 48/393 (12%), Positives = 113/393 (28%), Gaps = 33/393 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ ++ ++ S GK+ D N T F Sbjct: 19 WEQKKLRNVVSYHSSVMIASD-----------VKKYGKFDVYDPNKIVGKTDAE-PFRSD 66 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I K G R ++ I ST + P + ++ + G+ Sbjct: 67 YISIVKDGDAGRIRLLPKNTMILST---MGALIAKDPFKIDFLYYMLNAINDLARERNGS 123 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + H +K G +P EQ I I + + L Sbjct: 124 IIPHIYFKDYGQNIYNLPSTPEQSKISSLFTRLDSLITLHQRKLLSLKNLKSRL------ 177 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + + + + W+V E +++ + ++ Sbjct: 178 -LDRMFCDEKSQFPSIRFKEFTNTWEQWKVGDLITERIEFTKESNEFPLMAFVANEGVVA 236 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAV 323 + R+ ++ + Y++ G+ ++ +L R A I+ Y + Sbjct: 237 KGERYDRSSLVRDIYNKIYKVTKYGDFIYSSNNLD---RGSIGANKYGNACISPVYSIFK 293 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITN 380 + D ++ ++ + G+ + + + + P I EQ I Sbjct: 294 CTNSSDHNFIKNILSRHSFVNKLLKYRQGVVYGQLKIHESIFLNINLNSPSILEQNKIGK 353 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +D L+ ++ + LK +++ + Sbjct: 354 I----FYNLDSLITLHQRKLNSLKNIKNTLLDK 382 >gi|194398404|ref|YP_002037175.1| type I restriction-modification system subunit S [Streptococcus pneumoniae G54] gi|194358071|gb|ACF56519.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae G54] Length = 432 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 42 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 98 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 99 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIES 158 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 159 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 196 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 63/428 (14%), Positives = 128/428 (29%), Gaps = 71/428 (16%) Query: 33 FTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ G + KD I +I + D E G ++S + KG Sbjct: 6 LVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKG 65 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEG 143 L + R I+ I + ++ L + ++LS + V + ++ G Sbjct: 66 TFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISG 125 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK---- 199 A + + + + +I +P+PPL+EQ I E I + ++D R +L KE Sbjct: 126 AVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLK 185 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWV---------------------------------- 225 ++++ Y + L +S + Sbjct: 186 KSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSY 245 Query: 226 ------------GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKL 268 +P+ W F +LV K + I +S ++ Sbjct: 246 YGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISG 305 Query: 269 ETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 N + + I G ++ F L II+ + Sbjct: 306 YVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYAN 364 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I YL + G ++L + L + + +E I + +++ Sbjct: 365 KENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDL 422 Query: 385 ETARIDVL 392 ++ L Sbjct: 423 LFQKVSQL 430 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 257 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 316 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 317 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 374 >gi|157415312|ref|YP_001482568.1| hypothetical protein C8J_0992 [Campylobacter jejuni subsp. jejuni 81116] gi|157386276|gb|ABV52591.1| hypothetical protein C8J_0992 [Campylobacter jejuni subsp. jejuni 81116] gi|315932187|gb|EFV11130.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 327] Length = 1190 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 125/404 (30%), Gaps = 31/404 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVS 79 +V +K G T DI ++ + D + + S Sbjct: 801 LVKLKICGDFFMGGTPSRKNINYWNGDIKWLTISDYSNHQVIMDTKEKITREGFKNSNAK 860 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG ++ + + + I D + + + P + + + ++ Sbjct: 861 MIQKGAVVVS-IYATIGRVGILGEDMTTNQAIVAIIPNEEFINKYLMYAI-DYFKFQLYN 918 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + + N+ +P PPL Q I + + +TL + L+K Sbjct: 919 EVITTSQQNINLGILQNMVIPKPPLEIQKQIVAECEKIEEQYNTLSLSIKEYQNLIKAML 978 Query: 200 QA--LVSYIVTKGLNP------DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 Q ++ LN ++ + E++ + + +L+ L Sbjct: 979 QKCGIIEDNQEYELNSILDKINNLCKINLDSEFLSSFNKTIKEYALSNPIFKLSIGKRVL 1038 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + E + Y + V ID + + Sbjct: 1039 NNELLENGQIPVYSANVLEVFGFVNKEILQDY----DNDSVLWGIDGDWMVGFIPKNKKF 1094 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ Y+++++ + F + + +K L V +P Sbjct: 1095 YPTDHCGVLRVDDTKI-NAKYISFILNEAGKKQGFSR-----KLRASIDRIKALRVKLPS 1148 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ Q I ++I+ +I+ + + + + L++ + + + Sbjct: 1149 LEFQDQIADIID----KIEKKINEYKIELDRLEKEKEKILQKYL 1188 >gi|303270059|ref|ZP_07355777.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS458] gi|302640405|gb|EFL70834.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS458] Length = 195 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 56/161 (34%), Gaps = 11/161 (6%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITS 318 + E +N+ + + V+ G+++ ++ A + + Sbjct: 39 SYDYFNSSEVKNLPIDYIPLDE-HKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPD 97 Query: 319 AYMAVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 V + + W + ++ K + SG +++ + ++ V PP+ Sbjct: 98 RLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLAL 157 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q + + + A +D I++S+ L+ + S + Sbjct: 158 QNEFADFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 194 >gi|84489291|ref|YP_447523.1| hypothetical protein Msp_0480 [Methanosphaera stadtmanae DSM 3091] gi|84372610|gb|ABC56880.1| HsdS [Methanosphaera stadtmanae DSM 3091] Length = 405 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 43/401 (10%), Positives = 99/401 (24%), Gaps = 51/401 (12%) Query: 24 HWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + + + K + I + ++ K S+ + Sbjct: 42 EWITYKLCDVVTRIIRKNKNLETKRPLTISAKYGLIDQIEFFDKYVASKNLK--GYYLLK 99 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQR 136 KG+ Y K G ST ++ + + + + + S + Sbjct: 100 KGEFAYNKSYSNGFPYGAVKRLDLYNQGAISTLYICFEITNKINSNFLKIYFDSNKWNKE 159 Query: 137 IEAICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I +H N P ++EQ I + + A +I + E + Sbjct: 160 MYKIAVEGARNHGLLNIPINDFFNTKHLFPSISEQEKIADFLSAIDKKIGFMEKEINKQS 219 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + MK + D+ + K Sbjct: 220 K----------------------YMKKIRENILNDNSDNSNKVQLKEICIINKGKQLNKT 257 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 K N G P + V I + V + Sbjct: 258 NMI--------NDGKYYVLNGGKTPSGFTNSWNVPENTISISEGGNS---CGFVNYNVQK 306 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 Y L + + K+ +++ +D+++ + +P Sbjct: 307 FYCGGHCYYLTNISDEIDPLLLYHCLKMNENKIMNLRVGSGLPNIQKKDLEKYKLYIPTK 366 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I I++ ++ ++ + LK+ + + Sbjct: 367 NH-----EKITYLLNNINLKIDLNKEKLNHLKQFKKGLLQK 402 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 71/207 (34%), Gaps = 8/207 (3%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNM 273 K+ W +VT + RKN L L++S +I ++E + Sbjct: 26 QCNKNIPELRFPEFEGEWITYKLCDVVTRIIRKNKNLETKRPLTISAKYGLIDQIEFFDK 85 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER--GIITSAYMAVKPHGIDST 331 + ++ + Y ++ GE + + I T + I+S Sbjct: 86 YVASKNLKGYYLLKKGEFAYNKSYSNGFPYGAVKRLDLYNQGAISTLYICFEITNKINSN 145 Query: 332 YLAWLMRSYDLCKVFYAMG-SGLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L S K Y + G R ++ D L P I EQ I + ++ Sbjct: 146 FLKIYFDSNKWNKEMYKIAVEGARNHGLLNIPINDFFNTKHLFPSISEQEKIADFLSAID 205 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 +I + ++I + +K+ R + + Sbjct: 206 KKIGFMEKEINKQSKYMKKIRENILND 232 >gi|297519230|ref|ZP_06937616.1| specificity determinant for hsdM and hsdR [Escherichia coli OP50] Length = 278 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 50/149 (33%), Gaps = 6/149 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 V P +++ N + E ++ ++ Sbjct: 66 NTSTYYSGQIPEGYWVYPEDLIVGMDGDFN-----ATIWCSEPALLNQRVCKIEVQEDKY 120 Query: 331 TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + A S + L ++ + +PP+ EQ I ++ A++ Sbjct: 121 NKRFFYHALPGYLSAINANTSSVTVKHLSSRTLQDTLLPLPPLAEQKIIAEKLDTLLAQV 180 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D ++EQ +LK R + +AAAVTG+ Sbjct: 181 DSTKARLEQIPQILKRFRQAVLAAAVTGR 209 >gi|331087341|ref|ZP_08336409.1| hypothetical protein HMPREF0987_02712 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330408367|gb|EGG87842.1| hypothetical protein HMPREF0987_02712 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 410 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 46/413 (11%), Positives = 115/413 (27%), Gaps = 59/413 (14%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P ++ ++ L + K + Y G ++ Y+ Sbjct: 13 PDGVEIHYLEDCCNLLDKKRKPITKAFREAGEYPYYGANGIQDYVANYIFDG-------- 64 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 T + + + K G A + +++ K+ + + +L T Sbjct: 65 -TYVLVGEDGSVITKEGT--PVVTWAKGKIWVNNHAHIIEEKEGV---MLRYLYHYLQTI 118 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + ++ G + + + +PPL Q I + + T+ L E + Sbjct: 119 DVTSLIHG-NIPKLTGGDFKALKIAVPPLEVQREIVRVLDSFTLLTAELTAELTARKKQY 177 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + L++ G + +K + ++ Sbjct: 178 NFYRDKLLT--------------------FGKDTLNCRLKEICD-ICLGLTATPNYTDAG 216 Query: 256 ILSLSYGNIIQKLETRNMGLK-----PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +S N N E + G+++F + + Sbjct: 217 VKFISAQNTSNDFLDLNNVKYISEADFEKATSNAKPQKGDLLFTRVGSNLGHPVVVETDE 276 Query: 311 MERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVL 368 ++ ++ ++ YL M + G + +L +K + Sbjct: 277 DLCIFVSLGFLRIRNKEQVIIGYLKHWMNTDLFWSQVRKNVHGAAKVNLNTGWLKEFNIS 336 Query: 369 VPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIAAA 414 +PP++ Q I +V++ + L +E ++ R + A Sbjct: 337 LPPLETQERIVHVLDNFESICTDLNIGLPAEIEARQKQYEY---YRDLLLTFA 386 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 54/190 (28%), Gaps = 9/190 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E + + L K K I ++ T Sbjct: 6 ELIREYCPDGVEIHYLEDCCNLLDKKRKPITKAFREAGEYPYYGANGIQDYVANYIFDGT 65 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 Y +V V + + A++ + G+ YL +++ D+ Sbjct: 66 YVLVGEDGSVITKEGTPVVTWAKGKIW-----VNNHAHIIEEKEGVMLRYLYHYLQTIDV 120 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + G L D K L + VPP++ Q +I V++ T L ++ Sbjct: 121 TSLIH----GNIPKLTGGDFKALKIAVPPLEVQREIVRVLDSFTLLTAELTAELTARKKQ 176 Query: 403 LKERRSSFIA 412 R + Sbjct: 177 YNFYRDKLLT 186 >gi|19698526|gb|AAL93190.1| type I restriction enzyme S protein [Campylobacter jejuni] Length = 409 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 51/409 (12%), Positives = 111/409 (27%), Gaps = 35/409 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESG--TGKYLPKDGNSRQSDT 75 P + + +L+ + K++ + DV + K +P + Sbjct: 13 PNGVEFKNLWEIGELSNTGVDKKIRENQKEVFLLNFLDVMNNHYINKNIPSMKVTASEAE 72 Query: 76 STVSIFAKGQILYGKLGPYLRKAII--------ADFDGICSTQFLVLQPKDVLPELLQGW 127 K + + + + + + + P L+ Sbjct: 73 IQKCNILKNDLFITPSSENINEIGFASVAIEDMPNVCYSYHIMRFRIFNRQINPYFLRYC 132 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S ++ ++I +G T N+ +PIPPL Q I + + T L E Sbjct: 133 FDSENLRKQILKNAQGITRFGLTQPKWKNLQIPIPPLEIQEEIVKILDTFTELEAELEAE 192 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + + L+S E++ +E+K + Sbjct: 193 LEARRRQYEYYRNKLLS-----------------FEYLKTNGGGYELKMLGEICERQKGI 235 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N E +++ G+I + Q + + D Sbjct: 236 NITAGEMEKIAIQNGDIRIFAGGKTFIDTKMELLQEQNILKKTSIIVKSRGYVDFEYYAK 295 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + + + +L + + + L D R + Sbjct: 296 PFTHKNELWSYSLNPDTKDINLKFIFYYLKNKVEYFQKIARANAVKIPQLAVADTDRFQI 355 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +PP+ Q I N+++ A L I I K+ R+ + Sbjct: 356 PIPPLATQEKIVNILDQFHALTTDLQSGIPAEIEARKKQYEYYRNQLLT 404 >gi|301801396|emb|CBW34082.1| putative type I restriction-modification system S protein [Streptococcus pneumoniae INV200] Length = 432 Score = 77.1 bits (188), Expect = 6e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 42 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 98 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 99 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIES 158 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 159 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 196 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 64/432 (14%), Positives = 127/432 (29%), Gaps = 71/432 (16%) Query: 29 PIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++ G + KD I +I + D E G ++S + Sbjct: 2 RFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEA 139 KG L + R I+ I + ++ L + ++LS + V + + Sbjct: 62 VKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLS 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + GA + + + + +I +P+PPLAEQ I E I + ++D R +L KE Sbjct: 122 LISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFP 181 Query: 200 ----QALVSYIVTKGLNPDVKMKDSGIEWV------------------------------ 225 ++++ Y + L +S + Sbjct: 182 DKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGD 241 Query: 226 ----------------GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNI 264 +P W F +LV K + I +S ++ Sbjct: 242 DNSYYGNKDETTSYPIYEIPKAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDM 301 Query: 265 IQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + + I G ++ F L II+ + Sbjct: 302 PISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IF 360 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 I YL + G ++L + L + + +E I + Sbjct: 361 PYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIIS 418 Query: 381 VINVETARIDVL 392 +++ ++ L Sbjct: 419 KVDLLFQKVSQL 430 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IPK W+ + G+T + +I ++ + D+ SG + + Sbjct: 257 IYEIPKAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 316 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 317 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 374 >gi|303262771|ref|ZP_07348709.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS292] gi|302636093|gb|EFL66590.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS292] Length = 197 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 56/161 (34%), Gaps = 11/161 (6%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITS 318 + E +N+ + + V+ G+++ ++ A + + Sbjct: 41 SYDYFNSSEVKNLPIDYIPLDE-HKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPD 99 Query: 319 AYMAVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 V + + W + ++ K + SG +++ + ++ V PP+ Sbjct: 100 RLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLAL 159 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q + + + A +D I++S+ L+ + S + Sbjct: 160 QNEFADFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 196 >gi|148927367|ref|ZP_01810898.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] gi|147887266|gb|EDK72727.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] Length = 335 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 42/323 (13%), Positives = 94/323 (29%), Gaps = 13/323 (4%) Query: 100 IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 + + V +L ++ + G T + + I + Sbjct: 20 YKSKAYLVQGKIWVNNHAHILLARNNKYVKYALNYVDYQRYVTGTTRLKLNQSALKRIII 79 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 P P EQ I KI ID + K +Q+++ + K ++ Sbjct: 80 PFPDENEQKRIVAKIEELFSEIDNAESAITTASGYYKSYEQSIIDSLFAKYEAEAEMVEF 139 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 I + + + + + + E + + + E Sbjct: 140 GDIAEIKGGITKGRKLRGMPIGETPYLRVANVQD---------GYLYLDEIKTINVTAEE 190 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLM 337 Y +++ + D R +E I + + Y+++ Sbjct: 191 LRKYSLMNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYAT 250 Query: 338 RSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ F + SL +K L + P+ +Q +I I + + I ++ Sbjct: 251 KTTRARDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKE 310 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 + + K R S +A A G+ Sbjct: 311 LIVAHHRSKALRQSILAKAFKGE 333 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 69/200 (34%), Gaps = 14/200 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 ++V ++ G T + Y+ + +V+ G + ++ Sbjct: 135 EMVEFGDIAEIKGGITKGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKY 194 Query: 80 IFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G IL+ + G R I +C Q + + + + + ++ T R Sbjct: 195 SLMNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYATKTTR 254 Query: 137 IEAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + ++ + + N+ +P PLA+Q I E I+ + I + E I Sbjct: 255 ARDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVA 314 Query: 192 IELLKEKKQALVSYIVTKGL 211 K +Q++++ L Sbjct: 315 HHRSKALRQSILAKAFKGEL 334 >gi|265763427|ref|ZP_06091995.1| type I restriction endonuclease S subunit [Bacteroides sp. 2_1_16] gi|263256035|gb|EEZ27381.1| type I restriction endonuclease S subunit [Bacteroides sp. 2_1_16] Length = 355 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 60/391 (15%), Positives = 139/391 (35%), Gaps = 51/391 (13%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V ++ +N ++ IYI LE VE G + + ++ ++ + + IL Sbjct: 2 VSLQDIATINP-KSDPLQNTFIYIDLEAVEKGELRKI-QEIMREEAPSRAQRVIDNNDIL 59 Query: 88 YGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + + PY + I + + ST + ++ + LP + L + + +++ C Sbjct: 60 FQCVRPYQKNNYIHRILNTSNQQWVASTGYAQIRTTE-LPNYIYHLLNTDEFNRKVMVRC 118 Query: 142 EGATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G++ + + + I + P EQ+ I + RI T +L + Sbjct: 119 TGSSYPAINSEDLATIHLYYTPDKKEQLKISRLLDLLDKRIATQNKIIEDLKKLKSAISE 178 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L +K S + + +V L +S + Sbjct: 179 RLF-----------KSVKGSTV----------LLSDLCDIVKGKQINGENLSDSGNYYVM 217 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDP-GEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G N ++ + + + G + F + + ++ Sbjct: 218 NGGTEPSGYYDNYNVEASTISISEGGNSCGYVQFNTSPFWSGGHCYSIQNIADK------ 271 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 +D+ YL ++S + + +GSGL +++ +D+ ++VP I+ Q I+ Sbjct: 272 --------VDNMYLYHYLKSNEDAIMKLRIGSGL-PNIQKKDLAMFKIIVPKIEWQIKIS 322 Query: 380 NVINVET--ARIDVLVEK--IEQSIVLLKER 406 ++ A I+ ++ +Q + LL++ Sbjct: 323 TFLSSLERKAEIEERIQNVMQKQKLYLLQQM 353 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 64/180 (35%), Gaps = 7/180 (3%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + + T + + + L + + + + + +++D +I+F Sbjct: 1 MVSLQDIATINPKSDPLQNTFIYIDLEAVEKGELRKIQEIMREEAPSRAQRVIDNNDILF 60 Query: 294 RFIDLQNDKRSL-RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + R + + S A Y+ L+ + + + +G Sbjct: 61 QCVRPYQKNNYIHRILNTSNQQWVASTGYAQIRTTELPNYIYHLLNTDEFNRKVMVRCTG 120 Query: 353 LR-QSLKFEDVKRLPVLV-PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++ ED+ + + P KEQ I+ +++ +D + + I LK+ +S+ Sbjct: 121 SSYPAINSEDLATIHLYYTPDKKEQLKISRLLD----LLDKRIATQNKIIEDLKKLKSAI 176 >gi|307353815|ref|YP_003894866.1| restriction modification system DNA specificity domain-containing protein [Methanoplanus petrolearius DSM 11571] gi|307157048|gb|ADN36428.1| restriction modification system DNA specificity domain protein [Methanoplanus petrolearius DSM 11571] Length = 413 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 64/355 (18%), Positives = 128/355 (36%), Gaps = 28/355 (7%) Query: 81 FAKGQILYGKLGPYLRKAIIA--DFDGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQR 136 ++ G + +II D GI S L + K +LP+ L+ + + + Sbjct: 73 IKTDDLIISCSGTLGKVSIIQKNDPSGIISQALLLLRVDKKKILPKYLKYFFNTKEGYNA 132 Query: 137 IEAICEGATMSHADWK-GIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I + G+ + + I IP+ +PPL Q I + I +D I R ++L Sbjct: 133 IVSRSSGSVQVNISKRADIEQIPIRLPPLIIQTKIVDII----SALDNKIELNTRMNKVL 188 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSGIE----WVGLVPDHWEVKPFFALVTELNRKN 248 ++ AL + PD K SG + +G VP+ WE F + N K+ Sbjct: 189 EDIAHALFHRWFVEFEFPDAEGKPYKSSGGKMVGSEMGSVPEGWESLSFKDFLKIRNEKS 248 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + G I + + S +I+ ++VF + ++ Sbjct: 249 NDPAIPEYSVTNLG--IYPRDEKYKKKLSSSSSKNKIIHKFDLVFGMSREILNWGVMKDE 306 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSY--DLCKVFYAMGSGLRQSLKFEDVKRLP 366 G+ ++ + + ++ +L M+SY + + + + Sbjct: 307 IG---GVSSAYNVFIIDKEVNPLFLESFMKSYLPYFKDIIKPSAREGQ-GIDKAALFSKN 362 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +PP I + I +V E+ L E R + + ++G+I++ Sbjct: 363 IYLPPKD----ILDQYYDMENTILSVVRNFEKENENLIEIRDTLLPKLMSGEIEV 413 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 50/139 (35%), Gaps = 4/139 (2%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I + I + + E +++ + +++ ++ Sbjct: 41 IPVYEQQHAISESRDFRFFISEEKFQSMRRFAIKTDDLIISCSGTLGKVSIIQKNDPSGI 100 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKF-EDVKRLPVLVPP 371 + V I YL + + + + SG ++ ++ D++++P+ +PP Sbjct: 101 ISQALLLLRVDKKKILPKYLKYFFNTKEGYNAIVSRSSGSVQVNISKRADIEQIPIRLPP 160 Query: 372 IKEQFDITNVINVETARID 390 + Q I ++I+ +I+ Sbjct: 161 LIIQTKIVDIISALDNKIE 179 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 49/127 (38%), Gaps = 8/127 (6%) Query: 10 YKDSGVQWIGA----IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 YK SG + +G+ +P+ W+ + K F K+ ++++ I + ++ G Sbjct: 213 YKSSGGKMVGSEMGSVPEGWESLSFKDFLKIRNEKSNDPA--IPEYSVTNL--GIYPRDE 268 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 K S +S I K +++G L ++ D G S+ + V + L Sbjct: 269 KYKKKLSSSSSKNKIIHKFDLVFGMSREILNWGVMKDEIGGVSSAYNVFIIDKEVNPLFL 328 Query: 126 GWLLSID 132 + Sbjct: 329 ESFMKSY 335 >gi|182683453|ref|YP_001835200.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CGSP14] gi|225856227|ref|YP_002737738.1| type I restriction enzyme [Streptococcus pneumoniae P1031] gi|225858346|ref|YP_002739856.1| type I restriction enzyme [Streptococcus pneumoniae 70585] gi|182628787|gb|ACB89735.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CGSP14] gi|225722066|gb|ACO17920.1| type I restriction enzyme [Streptococcus pneumoniae 70585] gi|225726255|gb|ACO22107.1| type I restriction enzyme [Streptococcus pneumoniae P1031] Length = 516 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 10/206 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + +G ++ + L + +PP+ EQ I I ++D E Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESY 254 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQ 418 + L KE + S + A+ G+ Sbjct: 255 NRLEQLDKEFPDKLKKSILQYAMQGK 280 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 68/436 (15%), Positives = 128/436 (29%), Gaps = 67/436 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G + + + + +PPL+EQ I E I + ++D R +L Sbjct: 202 RVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLD 261 Query: 196 KEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW------------- 224 KE ++++ Y + L K E Sbjct: 262 KEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIV 321 Query: 225 -------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLS 260 + +P+ W F +LV K + I +S Sbjct: 322 SQGDDNSYYGNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVS 381 Query: 261 YGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ N + + I G ++ F L II Sbjct: 382 ISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAII 441 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + I YL + G ++L + L + + +E Sbjct: 442 S-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMK 498 Query: 377 DITNVINVETARIDVL 392 I + +++ ++ L Sbjct: 499 RIISKVDLLFQKVSQL 514 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 341 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 400 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 401 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 458 >gi|78484678|ref|YP_390603.1| restriction modification system DNA specificity subunit [Thiomicrospira crunogena XCL-2] gi|78362964|gb|ABB40929.1| Type I restriction enzyme, S subunit [Thiomicrospira crunogena XCL-2] Length = 371 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 54/399 (13%), Positives = 123/399 (30%), Gaps = 38/399 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 KV+ + + L G+ ++ + G+ + G + N+ + Sbjct: 4 KVIELSKALNLKNGKALKNTSN----GIYQIFGSNGVIGTTELNN-----------NENA 48 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 ++ G++G Y ++ S +V +PK P+ + + + + GA Sbjct: 49 LIIGRVGAYCGSIELSQEKFWASDNTIVAEPK---PDNVLHYWYYRLKSFPLRKFAGGAA 105 Query: 146 MSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + EQ I + I+ E + Q Sbjct: 106 QPLLTQNTLKPLKIAAHTDYLEQDKIANILKVYDDLIENNNRRIALLEESARLLYQEWFV 165 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKLIESNILSLSYGN 263 ++ G + VP+ WE L+ E+ N + I+S + + Sbjct: 166 HLRFPGHEHCKI--------IDGVPEGWERTRLEDLIEEIKEAVNPESIDSETPYIGLEH 217 Query: 264 IIQKLETRNMGLKPESY-ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + ++ T + E G+I+F I K + + + A + Sbjct: 218 MPRRSITLSEWETVEKVTSKKYRYYSGDIIFGKIRPYFHKVGFA---ITDGVTSSDAVVV 274 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + YL + S + ++ + V VP ++ Sbjct: 275 RSKDISNYQYLLMYLSSDFFISLASKTVKEGSKMPRADWKYLMTTDVQVPSDFLLKSFSD 334 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++ ++ L + +Q LK+ R + + G+I Sbjct: 335 SVDKILKQLKTLSVQNKQ----LKKARDILLPRLMNGEI 369 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 74/188 (39%), Gaps = 6/188 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W+ ++ + + + YIGLE + + + + TS Sbjct: 181 VPEGWERTRLEDLIEEIKEAVNPESIDSETPYIGLEHMPRRSITLSE--WETVEKVTSKK 238 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR-- 136 + G I++GK+ PY K A DG+ S+ +V++ KD+ LS D Sbjct: 239 YRYYSGDIIFGKIRPYFHKVGFAITDGVTSSDAVVVRSKDISNYQYLLMYLSSDFFISLA 298 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + EG+ M ADWK + + +P + + ++ TL + + + Sbjct: 299 SKTVKEGSKMPRADWKYLMTTDVQVPSDFLLKSFSDSVDKILKQLKTLSVQNKQLKKARD 358 Query: 197 EKKQALVS 204 L++ Sbjct: 359 ILLPRLMN 366 >gi|327390251|gb|EGE88592.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 427 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 59/158 (37%), Gaps = 8/158 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIES 153 Query: 385 ETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 ++D E + L KE + S + A+ G+ Sbjct: 154 ALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 191 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 64/427 (14%), Positives = 128/427 (29%), Gaps = 71/427 (16%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGA 144 L + R I+ I + ++ L + ++LS + V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----Q 200 + + + + +I +P+PPLAEQ I E I + ++D R +L KE + Sbjct: 122 VVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKK 181 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWV----------------------------------- 225 +++ Y + L +S + Sbjct: 182 SILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYY 241 Query: 226 -----------GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLE 269 +P+ W F +LV K + I +S ++ Sbjct: 242 GNKDETTSYPIYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGY 301 Query: 270 TRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 N + + I G ++ F L II+ + Sbjct: 302 VTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANK 360 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I YL + G ++L + L + + +E I + +++ Sbjct: 361 ENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLL 418 Query: 386 TARIDVL 392 ++ L Sbjct: 419 FQKVSQL 425 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 252 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 311 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 312 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 369 >gi|283782191|ref|YP_003372946.1| restriction modification system DNA specificity subunit [Pirellula staleyi DSM 6068] gi|283440644|gb|ADB19086.1| restriction modification system DNA specificity subunit [Pirellula staleyi DSM 6068] Length = 517 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 65/413 (15%), Positives = 135/413 (32%), Gaps = 43/413 (10%) Query: 25 WKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + + TG + + + ++G + L S+ I Sbjct: 3 WRRIKVGDLLARKTGTVNPDKSPSERFSLYSIPAFDNGAPEEL-----LGSEIGSSKQIL 57 Query: 82 AKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G +L K+ P++R+ + I S +++V + V L L+ + + Sbjct: 58 QPGDVLLSKIVPHIRRCWVVGKTLSTHRMIGSGEWIVFRTHRVDAGYLSKVLVGDEFHKA 117 Query: 137 IEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G ++ A + I +P+PPL EQ I + R ++L Sbjct: 118 FLQTVAGVGGSLKRARPAAVAEIEIPVPPLDEQRRIAAVLDKADALRRQ----RQESLQL 173 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 ++ Q++ + + + +E + + R + Sbjct: 174 TEKLLQSVFLSMFGDPVGNPKNLPTDDLENLAKLERGKFTPR--------PRNDPSYYSG 225 Query: 255 NILSLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + G+I + ++ L + + PG +V + + ++ V Sbjct: 226 DFPFIQTGDITRSKGRITGWTQTLNEKGIRVSREFQPGTVVIAIVGATLGETAIVETPVY 285 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 I + P S +L +L+R + + R +L E ++ LP L P Sbjct: 286 CPDSIIG--VTPYPTKATSEFLEFLLRLWKPR-LKELAPDAARANLNLERLRPLPALAPE 342 Query: 372 IKEQF---DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + Q I + T +D LL + SS A G++DL Sbjct: 343 LDLQQEFSRIARDLRQLT--LDKTENG-----KLLDKLFSSLQQRAFRGELDL 388 >gi|254448613|ref|ZP_05062072.1| type I restriction-modification system, endonuclease S subunit [gamma proteobacterium HTCC5015] gi|198261802|gb|EDY86088.1| type I restriction-modification system, endonuclease S subunit [gamma proteobacterium HTCC5015] Length = 379 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 65/396 (16%), Positives = 133/396 (33%), Gaps = 36/396 (9%) Query: 23 KHWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI- 80 WK++ ++ + + IY+GLE +E ++L + D + Sbjct: 11 SGWKLLRFGDLARNISKREDPATTDEKIYVGLEHIE---PRHLRVNRFGSPGDVIGQKLK 67 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIE 138 F G I++GK Y RKA +A+F GICS +V + + P L + + Sbjct: 68 FNAGDIIFGKRRAYQRKAAVANFSGICSAHAMVLRENNEFIFPGFLIHLMHTDVFMNTAI 127 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I EG+ WK + P+PP Q + + + + I++ I Sbjct: 128 RISEGSLSPTIKWKILAEQKFPVPPKNIQSTLLDSL-TKIEEIESSIF------------ 174 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 VT LN + S + +++ +TE + + ++ Sbjct: 175 -------AVTSSLNTLLASYKSKHMPIRAKAKQAKIEKIGNFLTESKIPGSTGDVAKKIT 227 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G Y + PG+ ++ +D N ++ ++ T Sbjct: 228 VKL---YGLGAIAKDGASGSVNTKYFLRKPGQFIYSKLDFLNGAFAIIPEELEGYESTTD 284 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQF 376 + + +L + + + F G R + + + + Q Sbjct: 285 LPCFDVKDTLHAEWLLHFVDRTEFYESFTHSAKGGRKAKRISPQAFLSCEFPYVGPETQS 344 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + I L+EK ++ + L+E I+ Sbjct: 345 EHLSAIKKIVTE-KHLMEKKQKIMFRLREM---LIS 376 >gi|146319105|ref|YP_001198817.1| type I restriction-modification system, S subunit [Streptococcus suis 05ZYH33] gi|253752153|ref|YP_003025294.1| type I restriction-modification system S protein [Streptococcus suis SC84] gi|253753979|ref|YP_003027120.1| type I restriction-modification system S protein [Streptococcus suis P1/7] gi|253755914|ref|YP_003029054.1| type I restriction-modification system S protein [Streptococcus suis BM407] gi|145689911|gb|ABP90417.1| type I restriction-modification system, S subunit [Streptococcus suis 05ZYH33] gi|251816442|emb|CAZ52078.1| type I restriction-modification system S protein [Streptococcus suis SC84] gi|251818378|emb|CAZ56206.1| type I restriction-modification system S protein [Streptococcus suis BM407] gi|251820225|emb|CAR46645.1| type I restriction-modification system S protein [Streptococcus suis P1/7] gi|292558741|gb|ADE31742.1| type I restriction-modification system, S subunit [Streptococcus suis GZ1] Length = 522 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 70/461 (15%), Positives = 142/461 (30%), Gaps = 78/461 (16%) Query: 5 KAYPQY-----KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGL 53 K Y + K V + IP W+ V ++ + +G T +S + +I +I Sbjct: 65 KPYEKLADGTVKKVEVPY--EIPDSWEWVRLRNLGVITSGGTPKSSESTYYDGNITWITP 122 Query: 54 EDV----ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 D+ + K S+ + +K I+Y P I ++D + Sbjct: 123 ADMGKQQNNKLFATSSKKITELGVQKSSAQLISKNSIVYSSRAPI-GHINIVNYDFTTNQ 181 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + P V L + + T+ I G T G G+ +P+PPLAEQ Sbjct: 182 GCKSVTPILVN--LDFMYWILQFRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKR 239 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDS----- 220 I I +++ + EL + ++++ Y + L + Sbjct: 240 IVAHIERALEQVEVYAESYNKLQELDRAFPDKLKKSILQYAMQGKLVAQDPNDEPVEVLL 299 Query: 221 -------------------------------------GIEWVG-------LVPDHWEVKP 236 E +G +P W Sbjct: 300 EMIRAEKQKLYEEGKLKKKDLAEIMVEKGDDNSPYGKNKENIGFSNSTLFKLPSSWCYVK 359 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 F LV K E N + I + K + Y + ++ ++ Sbjct: 360 FGGLVLFNIGKTPPRSEPNYWGDDIPWVSISDMSNNGHIFKTKEYLSDFAINQKKVKIAS 419 Query: 296 IDL--QNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSG 352 + K ++ + A +++ P+ ++ +LMR L Sbjct: 420 AGTLLMSFKLTIGKVALEVPASHNEAIISIFPYGDKENIIRDYLMRFLPLISTTGNSKDA 479 Query: 353 LR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 ++ ++L + L + + +E DI +++ ++ L Sbjct: 480 IKGKTLNSTSISGLLIPISNYREMKDIVTKVDLLFEKVAQL 520 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 66/212 (31%), Gaps = 20/212 (9%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLK 276 +E +PD WE L + K + NI ++ ++ ++ + Sbjct: 78 VEVPYEIPDSWEWVRLRNLGVITSGGTPKSSESTYYDGNITWITPADMGKQQNNKLFATS 137 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + + + R+ V +V P ++ ++ Sbjct: 138 SKKITELGVQKSSAQLISKNSIVYSSRAPIGHINIVNYDFTTNQGCKSVTPILVNLDFMY 197 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 W+++ + + + + + +PP+ EQ I I + VE Sbjct: 198 WILQ-FRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKRIVAHIERALEQ----VE 252 Query: 395 KIEQSIVLLKE--------RRSSFIAAAVTGQ 418 +S L+E + S + A+ G+ Sbjct: 253 VYAESYNKLQELDRAFPDKLKKSILQYAMQGK 284 >gi|254422455|ref|ZP_05036173.1| Type I restriction modification DNA specificity domain protein [Synechococcus sp. PCC 7335] gi|196189944|gb|EDX84908.1| Type I restriction modification DNA specificity domain protein [Synechococcus sp. PCC 7335] Length = 430 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 49/415 (11%), Positives = 116/415 (27%), Gaps = 27/415 (6%) Query: 27 VVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF--- 81 P+ + T ++ I + + G+ + + + +T I Sbjct: 18 TKPLSKLCHSITDCHHSTPKYTSAGKIVIRNFNIKNGRLILDNVSFTDEETYQARIARSK 77 Query: 82 -AKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G ++ + P II + C L+ ++++ + + + Q+ Sbjct: 78 PEPGDLIITREAPMGEICIIPEGIECCLGQRMVLIKPDENIIDNNYLLYAILSEYVQKQI 137 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ + + IP Q I + A +ID + K Sbjct: 138 LKSNNTGSIVSNLRIPDLEDLQIPIKEPQSQIAGILSALDAKIDLNNRINAELEAMAKTI 197 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKNT--- 249 N K SG + V +P+ W+ L + Sbjct: 198 YDYWFVQFDFPDEN-GKPYKSSGGKMVYNKTLRREIPEKWKAGTLEDLGKIVGGSTPSTK 256 Query: 250 ---KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF--RFIDLQNDKRS 304 E+ I ++ ++ + + + I D ++ + L + Sbjct: 257 VEANFSENGIPWIAPNDLSNNVGNKYITKGSLDVTLEGIKDASLKLYPKGTVLLSSRAPI 316 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 A + + + P+ ST + + + + + +K Sbjct: 317 GYMAIARNKLTTNQGFKSFIPNNKFSTEFVFYAVKNSMKAIIQYASGSTFKEVSGTILKT 376 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + V +PP I + + +EQ L + R + + GQ+ Sbjct: 377 INVCLPPPD----IADGYTNHMRSTFSRQDFLEQENQQLTQLRDWLLPMLMNGQV 427 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 73/210 (34%), Gaps = 21/210 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTG-KYLPK---DG 68 IP+ WK ++ K+ G T S I +I D+ + G KY+ K D Sbjct: 231 EIPEKWKAGTLEDLGKIVGGSTPSTKVEANFSENGIPWIAPNDLSNNVGNKYITKGSLDV 290 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 +++ ++ KG +L P AI + + F P + + Sbjct: 291 TLEGIKDASLKLYPKGTVLLSSRAPIGYMAIARNKL-TTNQGFKSFIPNNKFSTEFVFYA 349 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + I G+T + I + +PP + T + + + + Sbjct: 350 VKNSMKAII-QYASGSTFKEVSGTILKTINVCLPPP-------DIADGYTNHMRSTFSRQ 401 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 + ++ Q L +++ +N V MK Sbjct: 402 DFLEQENQQLTQ-LRDWLLPMLMNGQVTMK 430 >gi|332076950|gb|EGI87412.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17545] Length = 516 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 10/206 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + +G ++ + L + +PP+ EQ I I ++D E Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESY 254 Query: 397 EQSIVLLKE----RRSSFIAAAVTGQ 418 + L KE + S + A+ G+ Sbjct: 255 NRLEQLDKEFPDKLKKSILQYAMQGK 280 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 68/436 (15%), Positives = 127/436 (29%), Gaps = 67/436 (15%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ G + + + + +PPL+EQ I E I + ++D R +L Sbjct: 202 RVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLD 261 Query: 196 KEKK----QALVSYIVTKG--------------LNPDVKMKDSGIEW------------- 224 KE ++++ Y + L K E Sbjct: 262 KEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIV 321 Query: 225 -------------------VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLS 260 + +P+ W F +LV K + I +S Sbjct: 322 SQGDDNSYYGNKDETTSYPIYKIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVS 381 Query: 261 YGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ N + + I G ++ F L II Sbjct: 382 ISDMPISGYVTNTRESISKLALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAII 441 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + I YL + G ++L + L + + +E Sbjct: 442 S-IFPYANKENIIRDYLMIFLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMK 498 Query: 377 DITNVINVETARIDVL 392 I +++ ++ L Sbjct: 499 RIIFKVDLLFQKVSQL 514 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 46/130 (35%), Gaps = 17/130 (13%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESG 59 YP YK IP+ W+ + G+T + +I ++ + D+ SG Sbjct: 339 YPIYK---------IPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISG 389 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + + + + I KG +L + K I D + + + P Sbjct: 390 YVTNTRESISKLALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYAN 448 Query: 120 LPELLQGWLL 129 +++ +L+ Sbjct: 449 KENIIRDYLM 458 >gi|315639045|ref|ZP_07894214.1| type I specificity subunit HsdS [Campylobacter upsaliensis JV21] gi|315480873|gb|EFU71508.1| type I specificity subunit HsdS [Campylobacter upsaliensis JV21] Length = 470 Score = 76.8 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 51/441 (11%), Positives = 118/441 (26%), Gaps = 65/441 (14%) Query: 29 PIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST---V 78 P+K F K+ +G+ G Y+ ++D++S + S D T Sbjct: 33 PLKNFVKIKSGKRIPKGRSYANTTTTYKYLRVDDLDSEILEIDIDKLKSIDKDIFTLLER 92 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ G + I + S + ++ + L + + + + Sbjct: 93 YEIHNDEVALSIAGTIGKVFIFHN---TTSNRVILTENCVKLQAQDNLLPKFLSLILKTD 149 Query: 139 AICEGATMSHADWKGIGN--------IPMPIPPLAEQVLIREKIIAETVRIDT------- 183 + + IPPL+ Q I + + Sbjct: 150 FLQSQMKRQYIQTTIPKLAIERIKELQIPSIPPLSTQQHIIDLMDKAYKAKQEKENKAKE 209 Query: 184 --------------LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 +I L +S + + + K L+ Sbjct: 210 LLDSIDSYLLEELGIILPLRANNTLDSRIYTQKISALSGSRFDANYHQKYYRDLEKSLLS 269 Query: 230 DHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + + +L+ + + I + +I + K S ++ Sbjct: 270 SPYPLVNLASLINNFKKGIEVGSSEYSQNKEIPFIRVSDITNNGIDFDNVQKFISASLFE 329 Query: 285 IVDPGEIVFRF--IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + A V II+ + ++ + + Sbjct: 330 NLKAYKPKQNELLYSKDGTVGICLEADVSRDYIISGGILRLELKAEVDKDFLCFLLGSYM 389 Query: 343 CKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 VF S + + L + L + +PP+ Q I N + ++++ L + E Sbjct: 390 INVFANRVSIGAVIKHLNIGEFLNLKIPLPPLAIQTQIANRL--KSSKFQALSLEKEA-- 445 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 + A +ID+ Sbjct: 446 -------KEILNKA---KIDV 456 >gi|331678990|ref|ZP_08379662.1| putative type I restriction-modification system specificity subunit [Escherichia coli H591] gi|331073055|gb|EGI44378.1| putative type I restriction-modification system specificity subunit [Escherichia coli H591] Length = 360 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 55/391 (14%), Positives = 118/391 (30%), Gaps = 45/391 (11%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G+ ++ + V +G+ G D ++ + I+ Sbjct: 6 VKLGDVINVHYGKALKAD--------QRVSNGSVHVFGSSGIVGNHD---KTLCSYPTII 54 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G D I T + V +L +L I + ++ Sbjct: 55 IGRKGSVGAITWAPDGGWIIDTAYYVEI--KDNNKLDLRYLFYILSGIDLTKKTITTSIP 112 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +PP EQ I + + + I + I+ + A + Sbjct: 113 GLNRDDLYDTFIKLPPFEEQKRIVDLLD-KAEGIRQKREQAIKLADDFLRATFATM---- 167 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP K + +G + + K+ + E + I Sbjct: 168 --YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIRLVQIRDF 217 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + P+ I + +++ + G A M P Sbjct: 218 KSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVALMKASPKE 271 Query: 328 IDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +L++ + V + + + E + + V +PPI Q +I + Sbjct: 272 NIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDEILARL-- 329 Query: 385 ETARIDVLVEKIEQSIVLLK----ERRSSFI 411 ARI+ EKIE S+ L+ + + Sbjct: 330 --ARIEKFKEKIEISLNHLEMQFLSLQKRLM 358 Score = 43.2 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 56/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 175 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 234 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 235 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 293 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I ++ + + Sbjct: 294 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILARLARIEKFKEKIEISLNHLEMQFLSL 353 Query: 199 KQALV 203 ++ L+ Sbjct: 354 QKRLM 358 >gi|301633326|gb|ADK86880.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 429 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 49/411 (11%), Positives = 108/411 (26%), Gaps = 38/411 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K IK + GR G+ V S + G D G+ Sbjct: 4 KTYKIKDICDIKRGRVISKLDIKKDPGVFPVYSAATNNDGEFGRINSYDFD-------GE 56 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + Y + + +L+ K+ + + Sbjct: 57 YVTWTADGYGGAVFYRNGKFSITNLCGLLKVKNKEISSKY-LAHILKLEAPKFTNRVFKN 115 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT-----------------LITER 188 K + IP+ PPL Q I + T Sbjct: 116 RPKLTHKTMAEIPIDFPPLKIQEKIATILDTFTELSAELSAELSAELSAELSAELSAELS 175 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI--EWVGLVPDHWEVKPFFALVTELNR 246 L + A +S ++ L+ ++ + S E + + + ++ Sbjct: 176 AELSAELSAELSAELSAELSAELSAELSAELSAELRERRKQYDFYRDYLLNQENIRKIYG 235 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--------- 297 N I + N +++ + + P + Y + I+ Sbjct: 236 ANIPFETFQIRDICEINRGREINEKYLRENPGEFPVYSSATTNGGLIGKINDYDFHGEYV 295 Query: 298 --LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + E+ + ++ + +L + L + + Sbjct: 296 TWTTGGAHAGNVFYRNEKFSCSQNCGLLEVKNKNKFSSKFLCFALKLQSKKFVNYASAIP 355 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 L + + + + PP++ Q I +++ + L E I I L K++ Sbjct: 356 VLTIKRIAEIELSFPPLEIQEKIADILFAFEKLCNDLTEGIPAEIELRKKQ 406 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 53/190 (27%), Gaps = 16/190 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV---SIFA 82 + I+ ++N GR I + + G++ + F Sbjct: 241 ETFQIRDICEINRGRE---------INEKYLRENPGEFPVYSSATTNGGLIGKINDYDFH 291 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + + G + + CS +L+ K+ + ++ + + Sbjct: 292 GEYVTWTTGGAHAGNVFYRNEKFSCSQNCGLLEVKNKNKFSSKFLCFALKLQSKKFVNYA 351 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKI---IAETVRIDTLITERIRFIELLKEKK 199 + K I I + PPL Q I + + + I I + + Sbjct: 352 S-AIPVLTIKRIAEIELSFPPLEIQEKIADILFAFEKLCNDLTEGIPAEIELRKKQLDYY 410 Query: 200 QALVSYIVTK 209 Q + V Sbjct: 411 QNFLFNWVQN 420 >gi|91775529|ref|YP_545285.1| restriction modification system DNA specificity subunit [Methylobacillus flagellatus KT] gi|91709516|gb|ABE49444.1| restriction modification system DNA specificity domain [Methylobacillus flagellatus KT] Length = 408 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 87/247 (35%), Gaps = 19/247 (7%) Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 ++ +++L+E K+A + + ++GL + + +GL+P+ W + L + Sbjct: 153 QQSSLLDMLQELKRATLGELFSRGLRAEAQK----ETEIGLMPESWSPRTILELCEIWSG 208 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + + + + K R + + V+ G + + R + Sbjct: 209 GTPRKSVTEYWNGDIPWVSGKDLKRPALDDAIDHVSAAGVEAGSRLAPEGAVLLLVRGMG 268 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAW---LMRS------YDLCKVFYAMGSGLRQSL 357 A+ + +I A + T + +RS L G +L Sbjct: 269 LAKDLPVAVINRAMAFNQDVKALVTRGEYSGQFLRSAIYAGKERLLSQIVPSAHGTM-TL 327 Query: 358 KFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 DV+ V P E DI +++ +ID + +L+E S + +T Sbjct: 328 NLNDVETFKVACPSDPDEAKDIVTILHTLDRKID----LHQTKCEVLEELFESLLRKLMT 383 Query: 417 GQIDLRG 423 G+I + Sbjct: 384 GEIAVSD 390 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 63/205 (30%), Gaps = 15/205 (7%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYL 64 K++ IG +P+ W I ++ +G T DI ++ +D++ Sbjct: 183 KETE---IGLMPESWSPRTILELCEIWSGGTPRKSVTEYWNGDIPWVSGKDLKRPALD-D 238 Query: 65 PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK---AIIADFDGICSTQFLVLQPKDVLP 121 D S + + +G +L G L K + + + L + Sbjct: 239 AIDHVSAAGVEAGSRLAPEGAVLLLVRGMGLAKDLPVAVINRAMAFNQDVKALVTRGEYS 298 Query: 122 ELLQGWLLSIDVTQRIEAI-CEGATMSHADWKGIGNIPMPIPPLAEQ-VLIREKIIAETV 179 + + + I + + + P ++ I + Sbjct: 299 GQFLRSAIYAGKERLLSQIVPSAHGTMTLNLNDVETFKVACPSDPDEAKDIVTILHTLDR 358 Query: 180 RIDTLITERIRFIELLKEKKQALVS 204 +ID T+ EL + + L++ Sbjct: 359 KIDLHQTKCEVLEELFESLLRKLMT 383 >gi|15645090|ref|NP_207260.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori 26695] gi|2313566|gb|AAD07524.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori 26695] Length = 365 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 55/403 (13%), Positives = 115/403 (28%), Gaps = 56/403 (13%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPPTNNPKNYGNKISWITPKDLSTLQGRYIKKGSRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA+ + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAEKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 + EG T+ +G + IPP EQ I + +I+ Sbjct: 120 KNNFINMGEGTTIKGIYNIALGLFKVKIPPTYYEQQKIARTLSILDQKIENNHKINELL- 178 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 H + + KN KL Sbjct: 179 --------------------------------------HTLAYKIYEYYFKYKPKNAKLE 200 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + I + +++ + + + P I+ N + + Sbjct: 201 QIIIENPKSNIMVKNAQKTQDKYPFFTSGDNILSYPKAIIDGRNCFLNTGGNAGIKFYVG 260 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + ++ + + S YL L+ S + L+ +K+ P+ +P + Sbjct: 261 KASYSTDTWCICANEF-SDYLYLLLSSIKNHINQSFFQGTSLKHLQKNLLKKYPIYMPSV 319 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 E N + L+ ++ L++ R + + Sbjct: 320 HEIKKF----NQIMMPLLTLISINTRTSKKLEQIRDFLLPLLL 358 >gi|307747955|gb|ADN91225.1| Type I restriction modification enzyme [Campylobacter jejuni subsp. jejuni M1] Length = 1279 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 125/404 (30%), Gaps = 31/404 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVS 79 +V +K G T DI ++ + D + + S Sbjct: 890 LVKLKICGDFFMGGTPSRKNINYWNGDIKWLTISDYSNHQVIMDTKEKITREGFKNSNAK 949 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG ++ + + + I D + + + P + + + ++ Sbjct: 950 MIQKGAVVVS-IYATIGRVGILGEDMTTNQAIVAIIPNEEFINKYLMYAI-DYFKFQLYN 1007 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + + N+ +P PPL Q I + + +TL + L+K Sbjct: 1008 EVITTSQQNINLGILQNMVIPKPPLEIQKQIVAECEKIEEQYNTLSLSIKEYQNLIKAML 1067 Query: 200 QA--LVSYIVTKGLNP------DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 Q ++ LN ++ + E++ + + +L+ L Sbjct: 1068 QKCGIIEDNQEYELNSILDKINNLCKINLDSEFLSSFNKTIKEYALSNPIFKLSIGKRVL 1127 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + E + Y + V ID + + Sbjct: 1128 NNELLENGQIPVYSANVLEVFGFVNKEILQDY----DNDSVLWGIDGDWMVGFIPKNKKF 1183 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ Y+++++ + F + + +K L V +P Sbjct: 1184 YPTDHCGVLRVDDTKI-NAKYISFILNEAGKKQGFSR-----KLRASIDRIKALRVKLPS 1237 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ Q I ++I+ +I+ + + + + L++ + + + Sbjct: 1238 LEFQDQIADIID----KIEKKINEYKIELDRLEKEKEKILQKYL 1277 >gi|256833460|ref|YP_003162187.1| restriction modification system DNA specificity domain-containing protein [Jonesia denitrificans DSM 20603] gi|256686991|gb|ACV09884.1| restriction modification system DNA specificity domain protein [Jonesia denitrificans DSM 20603] Length = 384 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 52/158 (32%), Gaps = 5/158 (3%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++S+G + K E+ + G++++ L + + + Sbjct: 41 NMSHGRFVSGDFVYVSQEKFEADLSRNSAQGGDLIYTQRGTLGQVALLPPGKELHVISQS 100 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + D Y+ + S S + + L + +P I EQ Sbjct: 101 QMRLRIDEAKADPLYVYYASTSPHFLWQIDNRAISTGVPHINLGILGDLEIPLPSIAEQR 160 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 I + +D +E +++ + ++ + AAA Sbjct: 161 AIAATLGA----LDDKIESNRRAVTIAEQLGDALFAAA 194 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 53/392 (13%), Positives = 116/392 (29%), Gaps = 56/392 (14%) Query: 46 KDIIYIGLEDVESG---TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + I ++ G +G ++ ++D S S G ++Y + G + A++ Sbjct: 32 SGVPVIRGANMSHGRFVSGDFVYVSQEKFEADLSRNS-AQGGDLIYTQRGTLGQVALLPP 90 Query: 103 FDGIC----STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 + S L + P + S +I+ + H + +G++ Sbjct: 91 GKELHVISQSQMRLRIDEAKADPLYVYYASTSPHFLWQIDNRAISTGVPHINLGILGDLE 150 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 +P+P +AEQ I + A +I++ +L A S D+ M Sbjct: 151 IPLPSIAEQRAIAATLGALDDKIESNRRAVTIAEQLGDALFAAAASESRLLSDVADITM- 209 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 G P ++ + TR+ G++ Sbjct: 210 -------GSSPKGADLNEDGDGLPFYQG-----------------------TRDFGVRFP 239 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 S + + + I A+ + L + MR Sbjct: 240 SLRVWTTAPVRTAAKSDTLMSVRAPVGELNRASADCCIGRGVAAIHSDTH-PSTLYYAMR 298 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPV------LVPPIKEQFDITNVINVETARIDVL 392 S + + S+ DV + + ++ + + ID Sbjct: 299 SSSSAWEKFQGEGTVFASVNKTDVHGAEIRWVGDGAL--LE--------LEDKLRAIDAR 348 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 +E +E L R + + ++G++ + E Sbjct: 349 IESLESETQRLTALRDALLPELLSGRMRVPAE 380 >gi|145641327|ref|ZP_01796906.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae R3021] gi|145273870|gb|EDK13737.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 22.4-21] Length = 428 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 63/438 (14%), Positives = 126/438 (28%), Gaps = 75/438 (17%) Query: 23 KHWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVES-GTGKYLPKDGNSRQSD 74 WK +K + TG+T S + ++ +D+ T + + D Sbjct: 2 SDWKCYQLKNLGVIKTGKTPPSSCKDAFSNTGVPFVTPKDMNGVKTIFKTERYLSKIGLD 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + K I +G + KA + D + + Q L + + Sbjct: 62 LVKNYLVPKNSIAVSCIGSDMGKAYLLSEDSVTNQQINTLIVNK-NHNFEFIYYKLSIMQ 120 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +++I G+ + I + +P L Q EK+ +I ++ Sbjct: 121 DYLKSIAGGSATPILNKSHFSEIEIELPDLDTQNNFVEKLKYLDKKIQLNTQINQTLEQI 180 Query: 195 LKEKKQALV---------SYIVTKGL---------------------------------- 211 + ++ ++ GL Sbjct: 181 AQALFKSWFVDFDPVCAKVQALSDGLSLEQAELAAMQAISGKTPEELTALSQTQPERYAE 240 Query: 212 --NPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +E G P WE PF + E K L + S++ II + Sbjct: 241 LAETAKAFPCEMVEVDGVEAPKGWETIPFKDFIKEKKEKVGSLKNTPEYSVTNNGIIPRS 300 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + N L + ++ ++VF + + V + G ++SAY Sbjct: 301 QKFNKQLSKNPEKNK-LLHKTDLVFGMSREILNWGIM----VDDIGSVSSAYHVYSIDKN 355 Query: 329 DSTYLA---WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV- 384 +L +M + S QS E +K V+VP N + Sbjct: 356 VINHLYLKMMMMNKFQYFNELIRPSSREGQSFDKELLKEKTVIVPS--------NFLLDH 407 Query: 385 ---ETARIDVLVEKIEQS 399 + ++ + I++ Sbjct: 408 FLYKLELLNHQINTIKKK 425 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/199 (10%), Positives = 46/199 (23%), Gaps = 18/199 (9%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 ++K+ G+ G P F I R Sbjct: 4 WKCYQLKNLGVIKTGKTPPSSCKDAFSNTGVPFVTPKDMNGVKTIFK----------TER 53 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + +V I I K L + E + + + + Sbjct: 54 YLSKIGLDLVKNYLVPKNSIAVSCIGSDMGKAYL----LSEDSVTNQQINTLIVNKNHNF 109 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + S + G L + + +P + Q + + +D Sbjct: 110 EFIYYKLSIMQDYLKSIAGGSATPILNKSHFSEIEIELPDLDTQNNFVEKL----KYLDK 165 Query: 392 LVEKIEQSIVLLKERRSSF 410 ++ Q L++ + Sbjct: 166 KIQLNTQINQTLEQIAQAL 184 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 52/173 (30%), Gaps = 5/173 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK W+ +P K F K + Y +G K + + Sbjct: 261 PKGWETIPFKDFIKEKKEKVGSLKNTPEY---SVTNNGIIPRSQKFNKQLSKNPEKNKLL 317 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 K +++G L I+ D G S+ + V + L ++ ++ Q + Sbjct: 318 HKTDLVFGMSREILNWGIMVDDIGSVSSAYHVYSIDKNVINHLYLKMMMMNKFQYFNELI 377 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + + + + + + + ++ I + + Sbjct: 378 RPSSREGQSFDKE--LLKEKTVIVPSNFLLDHFLYKLELLNHQINTIKKKQKH 428 >gi|3335660|gb|AAC78315.1| restriction-modification enzyme MpuUII S subunit [Mycoplasma pulmonis] Length = 369 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 108/365 (29%), Gaps = 19/365 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVDENIAKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + E +K +++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILI 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I+ L + D I H+ F I + G I Sbjct: 176 KIIEP-LEKQINAFDELILSEQKSLQHYLNYFFGKFYQIEPSLFHDYKLEKIAKIRRGKI 234 Query: 265 IQKLETRNM---------GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + + K Y + + I + I Sbjct: 235 INSFDLKENPGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSI 294 Query: 316 ITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ + ++ + +L + ++ + ++ R S++ + + + +P ++ Sbjct: 295 TNVCFILLLNDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLE 354 Query: 374 EQFDI 378 Q I Sbjct: 355 IQSAI 359 >gi|237739317|ref|ZP_04569798.1| predicted protein [Fusobacterium sp. 2_1_31] gi|229422925|gb|EEO37972.1| predicted protein [Fusobacterium sp. 2_1_31] Length = 504 Score = 76.8 bits (187), Expect = 7e-12, Method: Composition-based stats. Identities = 59/458 (12%), Positives = 136/458 (29%), Gaps = 72/458 (15%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP +W + + + ++ Y+ ++ + G S + Sbjct: 34 IPSNWVWTRYDVLFSDIS-KNEKKIEEKNYLENGEIAIVSQGKDKIVGYSDILEVKPYKE 92 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ G + +F + + + + L ++ Sbjct: 93 ELP--LII--FGDHTLNVKYIEFPFYIGADGVKVLKTTDIIIPKFLFYLLNNLKTFSLIN 148 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + P+ PL EQ I EK+ +I ++ +K Sbjct: 149 TGYRRHYPI----LKKLFFPLSPLNEQKRIVEKLDFLFEKIKRAKEIIEEIKIDIENRKI 204 Query: 201 ALVSYIVTKGLNPDVK--MKDSGIEWV--------------------------------- 225 +++ L + K S ++ + Sbjct: 205 SILDRAFKGTLTSKWRSENKISDVKELLKSINEEKIKKWEEDCLQAEKDGNKKPKKPTIT 264 Query: 226 -------------GLVPDHWEVKPFFALVTELNRKNTKLIE-----SNILSLSYGNIIQK 267 +PD W +V K I+ I + + Sbjct: 265 EVKDMIVPVDEQPYKLPDSWVWVRLGDIVEINPNKIKINIDENELVDFIPMKNVSENSPE 324 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKP 325 + N + Y +I+F I ++N K ++ S + G ++ + ++ Sbjct: 325 IIENNFEKFKNLQKGYSQFIENDILFAKITPCMENGKTAIVSNLKEKIGYGSTEFHVLRS 384 Query: 326 HGIDSTYLAW-LMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I S L + ++ K + GS + + E ++ P +PP++EQ +I V+ Sbjct: 385 TKIISNKLLYNFLKQQRFRKDAKYNMTGSVGFRRVPTEFMRSYPFPLPPLEEQQEIVRVL 444 Query: 383 NVETARIDVLVEK--IEQSIVLLKERRSSFIAAAVTGQ 418 + + + E +E+ I +L+ S + A G+ Sbjct: 445 DEVLENENKVKELLELEERIDILE---KSILHKAFKGE 479 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 83/217 (38%), Gaps = 12/217 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P W V + ++N + + + + +I +++V + + + + ++ Sbjct: 279 KLPDSWVWVRLGDIVEINPNKIKINIDENELVDFIPMKNVSENSPEIIENNFEKFKNLQK 338 Query: 77 TVSIFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLV-LQPKDVLPELLQGWLL 129 S F + IL+ K+ P + + + G ST+F V K + +LL +L Sbjct: 339 GYSQFIENDILFAKITPCMENGKTAIVSNLKEKIGYGSTEFHVLRSTKIISNKLLYNFLK 398 Query: 130 SIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + G+ + + + P P+PPL EQ I + + + E Sbjct: 399 QQRFRKDAKYNMTGSVGFRRVPTEFMRSYPFPLPPLEEQQEIVRVLDEVLEN-ENKVKEL 457 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 + E + +++++ L +S +E + Sbjct: 458 LELEERIDILEKSILHKAFKGELGTQNSSDESAMELL 494 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 73/195 (37%), Gaps = 10/195 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +P +W + L +++++ K+ E N L I+ + + + +G Sbjct: 29 EQPYTIPSNWVWTRYDVLFSDISKNEKKIEEKNYLENGEIAIVSQGKDKIVGYSDILEVK 88 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + I+F L ++ + + I +L +L+ + Sbjct: 89 PYKEELPLIIFGDHTLN-----VKYIEFPFYIGADGVKVLKTTDIIIPKFLFYLLNNLKT 143 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + +G R+ + +K+L + P+ EQ I ++ +I E IE+ + Sbjct: 144 FSLIN---TGYRRH--YPILKKLFFPLSPLNEQKRIVEKLDFLFEKIKRAKEIIEEIKID 198 Query: 403 LKERRSSFIAAAVTG 417 ++ R+ S + A G Sbjct: 199 IENRKISILDRAFKG 213 >gi|294793171|ref|ZP_06758317.1| putative toxin-antitoxin system, toxin component [Veillonella sp. 6_1_27] gi|294456116|gb|EFG24480.1| putative toxin-antitoxin system, toxin component [Veillonella sp. 6_1_27] Length = 374 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 50/407 (12%), Positives = 116/407 (28%), Gaps = 49/407 (12%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + +K N + G IG++ ++ + S + F Sbjct: 3 EWVMKKLKDIADFNPRESLAKGTVAKKIGMDKLQ----PFCRDVLGYDLEQFSGGTKFRN 58 Query: 84 GQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G + ++ P L + G ST+++V + K+ + E +L+ + + Sbjct: 59 GDTIMARITPCLENGKTAKVSILDDGEVGFGSTEYIVFRAKNSVDEDFIYYLVCSPLVRE 118 Query: 137 I--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +++ + + N+ + +P EQ I + + +I + Sbjct: 119 PAIKSMVGSSGRQRVQTDVVQNLEIMVPDYEEQRRISGLLKSLDDKIALNNAINNNLAQQ 178 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K Q + G P W+ + + K + Sbjct: 179 AKTIYQTWFEKFILS---------------NGSCPPTWKRGILADIANITSGKRPPKKST 223 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + I T +G E+ T +I+ G + + + Sbjct: 224 KK--QNGFEIPLLGATSIVGFTNEANYTNKILVIGRV---GTHGIVQRINFPCWASDNTL 278 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 +ITS + + + + D+ ++ +L+P Sbjct: 279 VITSEL------------YEYTFQILQKINYHAMNRGSTQPLITQADMNKVDILIPD--- 323 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I ++ E L E R + ++G++D+ Sbjct: 324 -NQILTEFESIVGQLMKKYETNLMENTKLAELRDYLLPCLLSGELDV 369 >gi|331017720|gb|EGH97776.1| restriction modification system DNA specificity domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 381 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 56/194 (28%), Positives = 85/194 (43%), Gaps = 5/194 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +PK WK + + T + SE + Y+GLE + + + + Sbjct: 176 LGQVPKGWKFGILGDIAQTVTRKATVSEFNDQLNYVGLEHIPRKSLSLI--NWGCADGLA 233 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSIDVT 134 S+ S+F+K IL+GKL PY K +IA DG+CST LV QPK ++ L S + Sbjct: 234 SSKSVFSKTDILFGKLRPYFHKVVIAPIDGVCSTDVLVCQPKVNDYYGIVLMHLFSESLI 293 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + GA M WK + PM IPP + I+ I + I + I+L Sbjct: 294 SYANRLSNGAKMPRVSWKDLAAYPMCIPPSDIAMSFNSVILPMVGEIISNIEQIQTVIQL 353 Query: 195 LKEKKQALVSYIVT 208 + L+S V Sbjct: 354 RETLLPKLISGEVR 367 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 51/365 (13%), Positives = 119/365 (32%), Gaps = 28/365 (7%) Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSIDV 133 + + G++L +G + A+ + + V + E + L S Sbjct: 12 SRTRLKGGEVLLTLVGSVGQVAVASKKLKGFNVARAVAVIHPIDSVEAEWIALCLRSPLS 71 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + K + +P+P PP +E+ I + A I L Sbjct: 72 KHLLGSRANTTVQTTINLKDLRELPIPFPPESERKEITAALGALDSCIAVLHETNATLQS 131 Query: 194 LLKEKKQALV-----------SYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFAL 240 + + ++ S + + + E +G VP W+ + Sbjct: 132 IAQTIFKSWFVDFNPVHAKSESRAPSYIDTGTADLFPNDFESSALGQVPKGWKFGILGDI 191 Query: 241 VTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + RK ++ + + L N G + + +I+F + Sbjct: 192 AQTVTRKATVSEFNDQLNYVGLEHIPRKSLSLINWGCADGLASSKSVFSKTDILFGKLRP 251 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSGL-RQS 356 K + G+ ++ + +P D + + + S L + +G Sbjct: 252 YFHKVVIAPID----GVCSTDVLVCQPKVNDYYGIVLMHLFSESLISYANRLSNGAKMPR 307 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++D+ P+ +PP +VI I + IE I + + R + + ++ Sbjct: 308 VSWKDLAAYPMCIPPSDIAMSFNSVILPMVGEI---ISNIE-QIQTVIQLRETLLPKLIS 363 Query: 417 GQIDL 421 G++ L Sbjct: 364 GEVRL 368 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 56/134 (41%), Gaps = 6/134 (4%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++ + + GE++ + ++ S ++ + + + +++ ++A + Sbjct: 8 DAKYSRTRLKGGEVLLTLVGSVGQ-VAVASKKLKGFNVARAVAVIHPIDSVEAEWIALCL 66 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 RS + + + + ++ +D++ LP+ PP E+ +IT + +D + + Sbjct: 67 RSPLSKHLLGSRANTTVQTTINLKDLRELPIPFPPESERKEITAALGA----LDSCIAVL 122 Query: 397 EQSIVLLKERRSSF 410 ++ L+ + Sbjct: 123 HETNATLQSIAQTI 136 >gi|255690847|ref|ZP_05414522.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] gi|260623571|gb|EEX46442.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] Length = 251 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 54/173 (31%), Gaps = 12/173 (6%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIE----------SNILSLSYGNIIQKLETRNMGLK 276 P +W + + E + + ++ N Sbjct: 80 ETPKNWVWTRLSHIANIYTGNSISETEKKSKFTDVIGRYYIGTKDVDFNNRIIYDNGIAI 139 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 P+ YE + P + I+ + R + + + P Y+ + Sbjct: 140 PKQYEPDFRLAPNNSILMCIEGGSAGRKIAILN--QDVCFGNKLCCFSPFVGIGKYMYYY 197 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++S ++F +G+ + VK + + +PPIKEQ I I ++ Sbjct: 198 LQSPSFFELFNLNKTGIIGGVSIAKVKEILIPLPPIKEQQRIVAQIEKLFEQL 250 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 53/171 (30%), Gaps = 11/171 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 PK+W + + TG + + YIG +DV+ + Sbjct: 82 PKNWVWTRLSHIANIYTGNSISETEKKSKFTDVIGRYYIGTKDVDFNNRIIYDNGIAIPK 141 Query: 73 SDTSTVSIFAKGQILYGKL-GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + IL G RK I + D + P + + + +L S Sbjct: 142 QYEPDFRLAPNNSILMCIEGGSAGRKIAILNQDVCFGNKLCCFSPFVGIGKYMYYYLQSP 201 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + G + + I +P+PP+ EQ I +I ++ Sbjct: 202 SFFELFNLNKTG-IIGGVSIAKVKEILIPLPPIKEQQRIVAQIEKLFEQLR 251 >gi|317013880|gb|ADU81316.1| Type I restriction/modification specificity protein [Helicobacter pylori Gambia94/24] Length = 414 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 57/409 (13%), Positives = 117/409 (28%), Gaps = 33/409 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDT 75 W+ +K K+ TG+T ++ ++I D+ P+ + + Sbjct: 2 SEWQTFCLKDLGKIVTGKTPKTSNLDFFNGKYMFITPNDLHGTYRIIKTPRTLSDSGLKS 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL G +G + D + Q + + + + Sbjct: 62 IQNNTIDNISILVGCIGDVGMVRMCFDKCA-TNQQINSITDIKDFCNPYYLYYYLSNKKE 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + I + I + +P + Q I + +I+ ++L Sbjct: 121 LFKNIALSTVVPIIPKTIFQEIEVLLPNIETQQKIARTLSILDQKIENNHKINELLHKIL 180 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSG-----IEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + + N G E L+P+ +EVK LV + + + Sbjct: 181 ELLYEQYFVRFDFSDENNKPYQTSGGKMKFSKELNRLIPNDFEVKTLGELVDIFSGYSFQ 240 Query: 251 LIESNILSLSYGNIIQK---------LETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + Y I K T N+ P+ Y +++P I+ Sbjct: 241 SNTYSNNKNDYILITNKNVQHSLIDLSITTNLLFLPKKLPKYCLLEPTNILITLTGHIGR 300 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLKF 359 + S + I+ V P + + L+R+ + +Q+L Sbjct: 301 CALVFS----KNCILNQRVGVVLPKEKELNPFYYSLIRNPLFSAILQRNAIGSSQQNLSP 356 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 D ++ + I + I L+ QS L R Sbjct: 357 IDTLKIQIPF-----NHKIIKQYSKTCENIIKLLVSNMQSTQTLTALRD 400 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 8/158 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKY-LPKDGNSRQS 73 IP ++V + + +G + + D I I ++V+ + + Sbjct: 218 IPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYILITNKNVQHSLIDLSITTNLLFLPK 277 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSID 132 + IL G R A++ + I + + V+ PK+ L + + Sbjct: 278 KLPKYCLLEPTNILITLTGHIGRCALVFSKNCILNQRVGVVLPKEKELNPFYYSLIRNPL 337 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + ++ G++ + I +P + Sbjct: 338 FSAILQRNAIGSSQQNLSPIDTLKIQIPFNHKIIKQYS 375 >gi|108563200|ref|YP_627516.1| putative type I restriction-modification enzyme specificity subunit S [Helicobacter pylori HPAG1] gi|107836973|gb|ABF84842.1| putative type I restriction-modification enzyme specificity subunit S [Helicobacter pylori HPAG1] Length = 444 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 46/434 (10%), Positives = 122/434 (28%), Gaps = 58/434 (13%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + T + + + ++ Sbjct: 13 PKGVEFRKLGEVLEYDQPNQYCVTSKEFDKSYPTPVLTAG----------KTFILGYTNE 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + +K + + + S+ +L K+ + + Sbjct: 63 KDNIYQASKNAPVIIFDDFTTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR-----------IDT 183 I + +PIPPL Q I + + A T ++T Sbjct: 120 IIPYNIGGEHARQWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELNT 177 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLN-------PDVKMKDSGIEWVGLVPDHWEVKP 236 + ++ + E Q ++ N K L P E + Sbjct: 178 ELNTELKARKKQYEYYQNMLLDFNDINSNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRK 237 Query: 237 FFALVTELNR-------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + ++ + + ++ N Q ++ E + G Sbjct: 238 LGDIGEFYGGLVGKSKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKEKQNTIQLG 297 Query: 290 EIVFRFID------LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++F + + + + + + + + ++L +R Y+ Sbjct: 298 DVLFTGSSENLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKHFLRDYNFR 357 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K + +G R ++ + + ++ + +PP++ Q +I +++ +A L+ I I Sbjct: 358 KNISKVANGVTRFNVSKQLLSQITIPIPPLEIQQEIVKILDQFSALTTDLLAGIPAEIKA 417 Query: 403 LKE----RRSSFIA 412 K+ R + Sbjct: 418 RKKQYEYYREKLLT 431 >gi|257784005|ref|YP_003179222.1| restriction modification system DNA specificity domain-containing protein [Atopobium parvulum DSM 20469] gi|257472512|gb|ACV50631.1| restriction modification system DNA specificity domain protein [Atopobium parvulum DSM 20469] Length = 386 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 53/398 (13%), Positives = 106/398 (26%), Gaps = 31/398 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + V + F G+ ++ K + + + + +G Sbjct: 10 ERVRLTSFVS-AAGKRNKGAKCTDVYSVTNSHGFVPSTEYFSKEVFSKELEAYRLVERGM 68 Query: 86 ILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAIC 141 + Y + + + + S ++V P L +L S +I Sbjct: 69 LAYNPSRINVGSIALQESADRVVVSPLYVVFSVDTRHLAPGYLLRFLKSKPGLNQIAFRS 128 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G S+ + + + MP+P + Q + +I+ L+K + Sbjct: 129 SGTVRSNLKFDALSLLEMPLPSIDVQEKRLVVLSRLEKQIEARGEFIASLDTLVKSRFIE 188 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + N ++ G P RK + I + Sbjct: 189 MFGDPIALNSNKKSRLDSFAKIITGNTPS---------------RKKPEYYGDYIEWIKT 233 Query: 262 GNIIQKLETRN--MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 NI L + +I G ++ I + Sbjct: 234 DNITSTPVLTKAAESLSEDGASAGRIAPSGSVLMSCIAGSVKSIGKVAIADRPVAFNQQI 293 Query: 320 YMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + GI + YL W++ LC G+ L + R VPP Q + Sbjct: 294 NAIIPADGILTEYLYWMLSLSKDYLCSDINMQLKGI---LNKTALSRKMFCVPPPSLQQE 350 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++D L E+ L+ S Sbjct: 351 FATFV----RQVDKLRVVAEEQKKKLQTLYDSLAQEYF 384 >gi|94970783|ref|YP_592831.1| restriction modification system S subunit [Candidatus Koribacter versatilis Ellin345] gi|94552833|gb|ABF42757.1| restriction modification system S subunit [Candidatus Koribacter versatilis Ellin345] Length = 436 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 60/410 (14%), Positives = 124/410 (30%), Gaps = 42/410 (10%) Query: 45 GKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD- 102 + + +I D+++ + N T I A G IL G + A++ D Sbjct: 36 KRGVAFIRAADMDASDVLFDTASRINDVARKRITKGIGAPGDILLSHKGTVGKVALVPDD 95 Query: 103 -FDGICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS--HADWKGIGNI 157 +CS Q F D L L + A G T + + Sbjct: 96 APPFVCSPQTTFWRTLKGDRLDRRYLHAYLRSPYFHQQLASRAGETDMAPYVSLTSQRGL 155 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT---KGLNPD 214 + +P + Q I + A +I + ++ + Q+ K L Sbjct: 156 HVLMPDIDIQRRIGSIVGALDAKISVERKIKGTLADIARALFQSWFVDFDPVRAKSLGSS 215 Query: 215 VKMKDS---------GIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYG 262 + S +G +P W V + LN + E+ L + Sbjct: 216 SSLPASLESLFPDTFEESELGQIPSGWTVGSLDQIAHFLNGLALQRFPPNENGSLPVIKI 275 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 ++ T L + + IV G+++F + +G + Sbjct: 276 AQLKAGNTEGADLASPNLDPGYIVQDGDVLFSWSGSLECVV-----WSGGKGALNQHLFK 330 Query: 323 VKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 V + + R D + A + ++ + +L+P + Sbjct: 331 VTSKDYPKWFFYLWIHRHLDEFRRIAAAKATTMGHIQRYHLSEAKILLP-----HK--KL 383 Query: 382 INVETARIDVLVEKI-----EQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 ++ I L+E I + I L R + ++G++ + +++ Sbjct: 384 LDAADRIIGPLIESINVRAVQSKI--LGRIRDLLLPKLISGELAIEDDAE 431 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 53/192 (27%), Gaps = 10/192 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +G IP W V + + G + I + +++G + + Sbjct: 235 LGQIPSGWTVGSLDQIAHFLNGLALQRFPPNENGSLPVIKIAQLKAGN----TEGADLAS 290 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + I G +L+ G L + + G + + KD W+ Sbjct: 291 PNLDPGYIVQDGDVLFSWSGS-LECVVWSGGKGALNQHLFKVTSKDYPKWFFYLWIHRHL 349 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 R A + TM H + + +P I I+ + Sbjct: 350 DEFRRIAAAKATTMGHIQRYHLSEAKILLPHKKLLDAADRIIGPLIESINVRAVQSKILG 409 Query: 193 ELLKEKKQALVS 204 + L+S Sbjct: 410 RIRDLLLPKLIS 421 >gi|32455758|ref|NP_862217.1| restriction modification system subunit S [Lactobacillus delbrueckii subsp. lactis] gi|6469512|gb|AAF13313.1|AF109691_6 type I S-subunit [Lactobacillus delbrueckii subsp. lactis] Length = 389 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 52/394 (13%), Positives = 124/394 (31%), Gaps = 33/394 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + T + ++ K + E + D S S I + Sbjct: 21 WEQRKLGDVCEPITD-SIDTQKYPNEVFAEYSMPAFDASMKPDIVLGSSMNSVRKIITRP 79 Query: 85 QILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +L KL ++ + + +CS +F+ L V L S T+ +E Sbjct: 80 CLLVNKLNVRKKRIWYVKKPNKNAVCSAEFIPLYSDTVDLTFLNQVAKSETFTRYLENHS 139 Query: 142 EG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + + + IP + EQ LI + + I ++ + L Sbjct: 140 SGSSNSQKRITPRSLMLSKLHIPTIEEQKLIGKIFESLDHTITLHEEKKRQLECLKSALL 199 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q + + K P V+ + W E + +V + + +L + + Sbjct: 200 QKMFAD---KSGYPVVRFEGFDKAW--------EERKLKDVVEKQIKGKAQLEKLAPGEV 248 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y + + N G + + ++ + Sbjct: 249 EYLDTSR----LNGGQAILTNGLKDVTLDDILILWDGSKAGTVYHGFEGALGST------ 298 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +S ++ ++ + ++ + ++ + + + VP EQ I Sbjct: 299 -LKAYRTSANSKFVYQYLKRHQ-DNIYNNYRTPNIPHVQKDFLNVFTISVPVSDEQEKIG 356 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + ++D + ++ + LLKE++ F+ Sbjct: 357 SF----FKQLDDTIAFHQRKLDLLKEQKKGFLQK 386 >gi|226223150|ref|YP_002757257.1| specificity determinant HsdS [Listeria monocytogenes Clip81459] gi|225875612|emb|CAS04315.1| Putative specificity determinant HsdS [Listeria monocytogenes serotype 4b str. CLIP 80459] Length = 400 Score = 76.4 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 67/184 (36%), Gaps = 7/184 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNII--QKLETRNMGLKPESYETYQIVDPG 289 WE + LV + K + + +L+ S I Q+ N + E+ Y ++ G Sbjct: 19 WEQRKLGDLVVDYVEKTSVQNQFPMLTSSQQKGIVLQEDYFANRQVTTENNIGYFVLPRG 78 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 FR ND +++RGII+ Y DS + + + ++ Sbjct: 79 YFTFRSRS-DNDVFVFNRNDIIDRGIISYFYPVFTLKSADSDFFLRRINNGIQRQLSIQA 137 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + L + K + + P EQ I + ++D + ++ + LK+ + Sbjct: 138 EGTGQHVLSLKKFKNIVAMFPSEGEQKKIGSF----FKQLDDTIALHQRKLDTLKQMKKG 193 Query: 410 FIAA 413 + Sbjct: 194 LLQQ 197 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 49/398 (12%), Positives = 114/398 (28%), Gaps = 30/398 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + +TS + + + + + + + +G Sbjct: 19 WEQRKLGDLVVDYVEKTSVQNQFPMLTSSQQKGIVLQEDYFANRQVTTENNIGYFVLPRG 78 Query: 85 QILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + + GI S + V K ++ + +++ Sbjct: 79 YFTFRSRSDNDVFVFNRNDIIDRGIISYFYPVFTLKSA-DSDFFLRRINNGIQRQLSIQA 137 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 EG K NI P EQ I ++D I R ++ LK+ K+ Sbjct: 138 EGTGQHVLSLKKFKNIVAMFPSEGEQKKIGSF----FKQLDDTIALHQRKLDTLKQMKKG 193 Query: 202 LVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 L+ + K P ++ D EW E + K + + + + Sbjct: 194 LLQQMFPKSEEDVPKIRFADFDEEWY-QRKLGEEFEKINERNDGSFGKTHWISVAKMYFV 252 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N L ++ G+I F + K A + GI++ Sbjct: 253 EP----------NKVLSNNIDTRTYVMRKGDIAFEGHSNTDFKFGRFVANDIGPGIVSEL 302 Query: 320 YMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGLRQS----LKFEDVKRLPVLVPPIKE 374 + + D+ Y ++ + Y+ + L + + + +E Sbjct: 303 FPVYRHKTNYDNNYWKNAIQLEHIMAPIYSKSITSSGNSSNKLDSKHFLNQKIYIADFEE 362 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q I ++ ++D + + + + +++ Sbjct: 363 QEKIGSI----FKQLDNTIILYQNKLNKFDILKKAYLQ 396 Score = 37.9 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 52/189 (27%), Gaps = 9/189 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W + + R S +I + + ++ + + + + Sbjct: 216 EEWYQRKLGEEFEKINERNDGSFGKTHWISVAKMY-----FVEPNKVLSNNIDTRTYVMR 270 Query: 83 KGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 KG I + + R GI S F V + K + ++ Sbjct: 271 KGDIAFEGHSNTDFKFGRFVANDIGPGIVSELFPVYRHKTNYDNNYWKNAIQLEHIMAPI 330 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + ++ K + +EKI + ++D I + Sbjct: 331 YSKSITSSGNSSNKLDSKHFLNQKIYIADFEEQEKIGSIFKQLDNTIILYQNKLNKFDIL 390 Query: 199 KQALVSYIV 207 K+A + + Sbjct: 391 KKAYLQTMF 399 >gi|307268430|ref|ZP_07549808.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4248] gi|306515237|gb|EFM83774.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4248] Length = 389 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 47/402 (11%), Positives = 129/402 (32%), Gaps = 37/402 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDII------YIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W++ +R + + + YI D+ + + ++ N Sbjct: 8 NWELCKFERIFEKVKSYSLSREVETNEFTGMKYIHYGDIHTKKADKVSENSNIPNIIKKN 67 Query: 78 VSIFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 ++ G ++ + FD + + L+PK++ P L + + Sbjct: 68 FALLEIGDLILTDASEDYKGIATPAVIRENTSFDIVAGLHTIALRPKNIDPMFLYYLIKA 127 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G + + + IP E L+ + +ID + R Sbjct: 128 PTFRKYGYKVGTGMKVFGISSSKVLDFTTYIPKNDETKLVSSFL----EKIDYALDLHQR 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ LKE K+A + + K +++ + E + ++ + + K Sbjct: 184 KLDQLKELKKAYLQLMFPKKDETVPQVRFANFEENWEL------CKLENIIEKQIKGKAK 237 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + S+ Y + R G KP + V +I+ + + K Sbjct: 238 VENLCNGSVEYLDA-----NRLNGGKPIYTKALPDVSERDIIILWDGSKAGKVYY----- 287 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 +G++ S A + ++ + + ++ + + P+ + Sbjct: 288 GFKGVLGSTLKAYQLKECANSQFIYQQLLDNQNNIYNNYRTPNIPHVVKNFSSIFPIWMT 347 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +EQ + +++ + +D + + + + S++ Sbjct: 348 SFEEQSQMADIL----SNLDNRIILQQNLTDTMISLKKSYLQ 385 Score = 40.5 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 15/184 (8%) Query: 23 KHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI- 80 ++W++ ++ + G+ +E++ +G+ +YL + + T ++ Sbjct: 217 ENWELCKLENIIEKQIKGKAK----------VENLCNGSVEYLDANRLNGGKPIYTKALP 266 Query: 81 -FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ I+ G K F G+ + Q K+ + +D I Sbjct: 267 DVSERDIIILWDGSKAGKVYY-GFKGVLGSTLKAYQLKECANS-QFIYQQLLDNQNNIYN 324 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + H P+ + EQ + + + RI I L K Sbjct: 325 NYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYL 384 Query: 200 QALV 203 Q + Sbjct: 385 QNMF 388 >gi|258651343|ref|YP_003200499.1| restriction modification system DNA specificity domain-containing protein [Nakamurella multipartita DSM 44233] gi|258554568|gb|ACV77510.1| restriction modification system DNA specificity domain protein [Nakamurella multipartita DSM 44233] Length = 400 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 52/401 (12%), Positives = 114/401 (28%), Gaps = 36/401 (8%) Query: 37 NTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST---VSIFAKGQILYGKLG 92 N GR+ + I V+ + + ++ T S G I++ G Sbjct: 19 NRGRSCPTEAEGFPLIATNCVKDDSLYPVFENVRYVSQATYRDWFRSHPEPGDIVFVCKG 78 Query: 93 PYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWL--LSIDVTQRIEAICEGATMSHA 149 R A++ D + + L+ + + + +V RIE + G + H Sbjct: 79 SPGRIAMVPDPVPFCIAQDMVALRANSRIVNPHYLYYALKNQEVRARIENMHVGTMIPHF 138 Query: 150 DWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 G + + + L++Q+ I E + A +I EL E + Sbjct: 139 KKGDFGKLHLDVHVRLSDQMAIAEVLGALDDKIAGNSKMASTAGELATE---CFRDVSID 195 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + K + I G + + + + Sbjct: 196 ATFDETTFEKVAAIGGGGTP----------STKVPGYWDGPIAWATPTDLTALPGPYLER 245 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 R++ L + G I+ A + ++ V P Sbjct: 246 TARSITLSGLDNCASALFPRGAILMTSRATIG-----AFAIAQRPVAVNQGFIVVVPEDP 300 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + + + + L + LPV VP V+ R Sbjct: 301 QMKWWLFHTMRDRVDEFISHANGATFLELSRGRFRSLPVRVPA-------GRVLRAFDER 353 Query: 389 IDVLVEKIEQSI---VLLKERRSSFIAAAVTGQIDLRGESQ 426 ++ + ++ L E R + + ++G++ ++ + Sbjct: 354 VEAIHAVARHALVENTELAELRDTLLPHLMSGRLRVKDAEK 394 >gi|88854448|ref|ZP_01129115.1| type I restriction-modification system specificity determinant [marine actinobacterium PHSC20C1] gi|88816256|gb|EAR26111.1| type I restriction-modification system specificity determinant [marine actinobacterium PHSC20C1] Length = 388 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 45/396 (11%), Positives = 109/396 (27%), Gaps = 47/396 (11%) Query: 30 IKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF-AKG 84 + + G+ + I ++ + G + + D + F G Sbjct: 21 LGDVGTVVRGKRFVKDDMQDAGVPCIHYGEIYTKYGVSATESFSFVSEDRAKTLRFAEPG 80 Query: 85 QILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ G + + I + P + + S +I Sbjct: 81 DVILVSAGEAIEDIGKSVAWLGDEPIAIHDACYAFSSAMDPRFVSYFFASRGFRDQIRQK 140 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + +S + + + +P+PPL Q I + T L E E Sbjct: 141 ISSSKISSISTRAVASARIPVPPLEVQREISRILDDFTELEAELEAELEARREQYVAYSG 200 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L++ SG + + E + A+ + + +N S Sbjct: 201 TLLN------------FGHSGQVRRAPMGEVAEFRRGSAITARQTKPGVIPVVANGPKPS 248 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + +V L S + + Sbjct: 249 LFHNVSNRTGET------------------VVIARSGAY---AGLVSYWDQPIFLTDAFS 287 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + ++ +R+ G+G+ ++ ++V++ + VP I EQ I Sbjct: 288 IHPDLEILRPRFVYHWLRTEQASLHSMKKGAGV-PHVRVKEVEQRFIPVPTIAEQVRILE 346 Query: 381 VINVETARIDVLV----EKIEQSIVLLKERRSSFIA 412 +++ A ++ L ++ + R + Sbjct: 347 ILDNFDALVNDLSIGLPAELAARRTQYEYYRDKLLT 382 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 55/166 (33%), Gaps = 5/166 (3%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYE 281 G H + +V + ++ + + YG I K + + + Sbjct: 13 GREVQHMALGDVGTVVRGKRFVKDDMQDAGVPCIHYGEIYTKYGVSATESFSFVSEDRAK 72 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 T + +PG+++ + A + + I +D ++++ S Sbjct: 73 TLRFAEPGDVILVSAGEAIEDIGKSVAWLGDEPIAIHDACYAFSSAMDPRFVSYFFASRG 132 Query: 342 LCKVFYAMGSGLRQSLKFED-VKRLPVLVPPIKEQFDITNVINVET 386 S + S V + VPP++ Q +I+ +++ T Sbjct: 133 FRDQIRQKISSSKISSISTRAVASARIPVPPLEVQREISRILDDFT 178 >gi|261491603|ref|ZP_05988186.1| putative type I restiction/modification specificity protein [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261312729|gb|EEY13849.1| putative type I restiction/modification specificity protein [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 197 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 54/162 (33%), Gaps = 10/162 (6%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I G +K E Y+ Y G I+ E Sbjct: 37 IPFYKIGTFGKKPNAYISRELFEDYKQKYSYPRKGNILISASGTIGRTVIF----DGEDS 92 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + + +L Y + A G G Q L +++K+L + VPP+ E Sbjct: 93 YFQDSNIVWIENDESQVLDKFLFYLYQIADWNIAEG-GTIQRLYNDNLKKLKIPVPPLSE 151 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q I N+++ + + + E + + I L +E R + Sbjct: 152 QQKIVNILDKFDSLTNSITEGLPKEIKLRREQYGYYREQLLN 193 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 56/188 (29%), Gaps = 9/188 (4%) Query: 27 VVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + + + + DI + + Y+ ++ + S Sbjct: 11 WKSLGEIGEARMCKRILKEQTSNVGDIPFYKIGTFGKKPNAYISRELF--EDYKQKYSYP 68 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG IL G R I D +V + L I Sbjct: 69 RKGNILISASGTIGRTVIFDGEDSYFQDSNIVWIEN--DESQVLDKFLFYLYQIADWNIA 126 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 EG T+ + + +P+PPL+EQ I + +++ + I+L +E+ Sbjct: 127 EGGTIQRLYNDNLKKLKIPVPPLSEQQKIVNILDKFDSLTNSITEGLPKEIKLRREQYGY 186 Query: 202 LVSYIVTK 209 ++ Sbjct: 187 YREQLLNF 194 >gi|149007168|ref|ZP_01830832.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP18-BS74] gi|225856551|ref|YP_002738062.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae P1031] gi|147761206|gb|EDK68173.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP18-BS74] gi|225725686|gb|ACO21538.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae P1031] Length = 373 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 40/396 (10%), Positives = 116/396 (29%), Gaps = 31/396 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKGLITKRKLQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++P + +++ + + L +K++ + +PP+ Q + Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFA 340 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++D I++S+ L+ + S + Sbjct: 341 DFVV----QVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|321310235|ref|YP_004192564.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802079|emb|CBY92725.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 216 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 57/177 (32%), Gaps = 8/177 (4%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQK--LETRNMGLKPESYETYQIVDPGEIVFRF 295 + KN+ +S + NI + P+++ +++ G+IV Sbjct: 22 CEMHLGTAFKNSFYRDSGFPIVKTSNIQGGLVITDNLKYCNPDNHLDSEVIKYGDIVMAK 81 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LR 354 + + S + P + M G Sbjct: 82 DGSCGK---VGINLTSQEFFFDSHVVKFVPDEEILIGGYLYHCLLNFQSEIEGMAKGSTI 138 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + ++ +++RL + VP ++ Q I ++ L ++++Q + +E + + Sbjct: 139 RGIRKSELERLKIPVPSLETQTRIAETLDKFQELKQELKQELKQELK--QELKQELL 193 >gi|268603246|ref|ZP_06137413.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae PID1] gi|268587377|gb|EEZ52053.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae PID1] Length = 398 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 49/395 (12%), Positives = 103/395 (26%), Gaps = 29/395 (7%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + P+ TK+ G+ E KD + + T + D D Sbjct: 11 EWKPLGEVLVRTKGTKITAGQMKEMHKDNAPLKIFAG-GKTFALVDFD------DVPDKD 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I + I+ G + D + + + + Sbjct: 64 IHREPSIIVKSRGII--EFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQENYFRN 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I M N +PIP L Q I + + T TL + L K + Sbjct: 122 IGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELALRKRQY 181 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + L+ D ++ + + K + + + + Sbjct: 182 RYYRDLL----LDFDNQIGGGIADGYQCRLKNVVWKTLGEVAEYSKNRICSDKLNEHNYV 237 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N++Q E + + S +I+ I K G + Sbjct: 238 GVDNLLQNREGKKLSGYVPSEGKMTEYIVNDILIGNIRPYLKKIWQADCTGGTNGDV--L 295 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V ++ YL ++ G + + + +PP+ EQ I Sbjct: 296 VIRVTDEKVNPKYLYQVLADDKFFAFNMKHAKGAKMPRGSKAAIMQYKIPIPPLPEQEKI 355 Query: 379 TNVINVETARIDVL-------VEKIEQSIVLLKER 406 ++ + + + +E+ Sbjct: 356 VAILGKFDTLTHSVSEGLPHEIALRRKQYEYYREQ 390 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 59 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 116 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 117 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 176 Query: 403 LK-ERR 407 K + R Sbjct: 177 RKRQYR 182 >gi|227431499|ref|ZP_03913543.1| possible type I site-specific deoxyribonuclease specificity subunit [Leuconostoc mesenteroides subsp. cremoris ATCC 19254] gi|227352745|gb|EEJ42927.1| possible type I site-specific deoxyribonuclease specificity subunit [Leuconostoc mesenteroides subsp. cremoris ATCC 19254] Length = 209 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 60/167 (35%), Gaps = 6/167 (3%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + + +I + I + + K + + + + + + Sbjct: 45 NPEYWDGDIDWYAPA-EIGEQSYVSKSKKTITELGLKKSSARILPVGTVLFTSRAGIGNT 103 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 A + + + ++ P R+ +L + G+G + + + ++ Sbjct: 104 AILAKEATTNQGFQSIVPDQNKLDSYFIFSRTNELKRYGEVTGAGSTFVEVSGKQMSKMS 163 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++VP + EQ I + ++D + ++ + LLKE++ F+ Sbjct: 164 IMVPELSEQQKIGSF----FKQLDDTIALHQRKLDLLKEQKKGFLQK 206 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 72/203 (35%), Gaps = 20/203 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKD 67 +P+ W+ + + + G T + + G D E G Y+ K Sbjct: 11 KVPELRFKGFTDDWEERKLGELSNIVGGGTPSTSNPEYWDGDIDWYAPAEIGEQSYVSKS 70 Query: 68 GNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + S+ I G +L+ AI+A + F + P + Sbjct: 71 KKTITELGLKKSSARILPVGTVLFTSRAGIGNTAILAKE-ATTNQGFQSIVPDQNKLDSY 129 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + ++ + E G+T K + + + +P L+EQ I ++D Sbjct: 130 FIFSRTNELKRYGEVTGAGSTFVEVSGKQMSKMSIMVPELSEQQKIGSF----FKQLDDT 185 Query: 185 ITERIRFIELLKEKKQALVSYIV 207 I R ++LLKE+K+ + + Sbjct: 186 IALHQRKLDLLKEQKKGFLQKMF 208 >gi|254518117|ref|ZP_05130173.1| type I restriction-modification system specificity subunit [Clostridium sp. 7_2_43FAA] gi|226911866|gb|EEH97067.1| type I restriction-modification system specificity subunit [Clostridium sp. 7_2_43FAA] Length = 377 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 52/401 (12%), Positives = 115/401 (28%), Gaps = 34/401 (8%) Query: 28 VPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + T+L S I+ I + + S++ + +G Sbjct: 4 KKVSEVTELIKRGVSPKYVEDDGILVINQKCIRDNRVDLSLARLTSKEKKITEEKFLNEG 63 Query: 85 QILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL G + + T +++P + G+ + ++ + Sbjct: 64 DILINSTGTGTLGRTAQINNINESITVDTHITIMRPSKDVNAKFLGYFIRLNESLITSMG 123 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + NI + +P Q I + + A I+ + + + + Sbjct: 124 KGATNQIELSATDLANIEIYLPGKNIQDKIVKILSAYDNLIENNLKRIKLLEKSAELLYK 183 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 G E++ VP+ W+ K L+ + RK+ E + L Sbjct: 184 EWFINFRFPG--------YEEYEFLNGVPNGWKKKKVGELILKFKRKSKVKKE-DYLEAG 234 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 II + ++ G + V I + + G + Sbjct: 235 EIPIIDQSKSFIGGYTDNEDAKEESVP------AIIFGDHTRIVKYIDFPFASGADGTQL 288 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + Y W +++ DL Y F+ +K + +P + Sbjct: 289 IYSNSTEVSQQYFYWAIKNIDLSNYSYTR--------HFKYLKDEEIYIPSKTVMEKFSE 340 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ +I L L E R + + G+I++ Sbjct: 341 IVGYNFKQITNL----RNQNNKLIEARDILLPKLIIGEIEV 377 >gi|57242479|ref|ZP_00370417.1| putative type I specificity subunit HsdS [Campylobacter upsaliensis RM3195] gi|57016764|gb|EAL53547.1| putative type I specificity subunit HsdS [Campylobacter upsaliensis RM3195] Length = 467 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 49/441 (11%), Positives = 116/441 (26%), Gaps = 65/441 (14%) Query: 29 PIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGNSRQSDTST---V 78 P+K F K+ +G+ G+ Y+ ++D++S + S D T Sbjct: 33 PLKNFVKIKSGKRIPKGRSYANTTTAYKYLRVDDLDSEILEIDIDKLKSIDKDIFTLLER 92 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ G + I + + + ++ + L + + + Sbjct: 93 YEIYNDEVALSIAGTIGKVFIFHN---ATNNRVILTENCVKLQAQDNLLPKFLSLILKTN 149 Query: 139 AICEGATMSHADWKGIGN--------IPMPIPPLAEQVLIREKIIAETVRIDT------- 183 + + IPPL+ Q I + + Sbjct: 150 FLQSQMKRQYIQTTIPKLAIERIKELQIPSIPPLSTQQHIIDLMDKAYKAKQEKENKAKE 209 Query: 184 --------------LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 +I L A +S + + + K L Sbjct: 210 LLDSIDSYLLEELGIILPLRANNTLETRIYTAKISALSGSRFDANYHQKYYRDLEKSLFS 269 Query: 230 DHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + + +L+ + + I + +I + K S ++ Sbjct: 270 SPYPLVNLASLINNFKKGIEVGSSEYSQNKEIPFIRVSDITNNGIDFSNVQKFISASLFE 329 Query: 285 IVDPGEIVFRF--IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + A II+ + ++ + + Sbjct: 330 NLKAYKPKENELLYSKDGTVGICLEADTSCDYIISGGILRLELKAEVDKDFLCFLLGSYI 389 Query: 343 CKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 VF S + + L + L + +PP+ Q I + + + ++ L + E Sbjct: 390 MNVFANRVSIGAVIKHLNIGEFLNLKIPLPPLALQTQIASRL--KNSKFQALSLEKEA-- 445 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 + A +ID+ Sbjct: 446 -------KEILHKA---KIDV 456 >gi|297516523|ref|ZP_06934909.1| specificity determinant for hsdM and hsdR [Escherichia coli OP50] Length = 151 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 44/99 (44%), Gaps = 1/99 (1%) Query: 321 MAVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ + I +L + S + + + R++L +D+K V +P I+EQ +I Sbjct: 10 ISPEYKIIVPMFLHIWLSSPVMQTWLVQSSKEVARKTLNLKDLKNAFVPLPSIEEQHEIV 69 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A D + +++ ++ + S +A A G+ Sbjct: 70 RRVEQLFAYADSIEKQVNNALARVNNLTQSILAKAFRGE 108 >gi|313898078|ref|ZP_07831617.1| conserved domain protein [Clostridium sp. HGF2] gi|312957106|gb|EFR38735.1| conserved domain protein [Clostridium sp. HGF2] Length = 376 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 61/387 (15%), Positives = 118/387 (30%), Gaps = 45/387 (11%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG-KYLPKDGNSRQSDTSTVSIFAKGQI 86 I +L + I + ++DV K + +DTS + Sbjct: 6 CRIGDCVELYNEVS-----GIPNLTVDDVSGVNREKEFFEPSKQVGNDTSKYKVVPPNYF 60 Query: 87 LYGKLGPYLRKA-----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + K + S + V K+ P L + + + +R Sbjct: 61 ACNLMHVGRDKVLPIAMNHTKLNKYVSPAYTVFCIKENTPLLKDYFFMMLKSEERDRYFW 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 S D L E L ++ T + +L Sbjct: 121 FHTDSSVRDGMTWDAFCDLEFSLPELELQQKYSDIYTAMCLNQQSYEAGLEDL------- 173 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +V G ++ + ++ NRKN L + S+ Sbjct: 174 ---QVVCHG-------------YIDELRKTLVHHKLGNYISLCNRKNANLK-FGVESVRG 216 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVF-RFIDLQNDKRSLRSAQVMERGIITSAY 320 +I +K + S + Y I++P E + +K SL E I +S+Y Sbjct: 217 ISIEKKFIQTKADMSGVSLKPYTIIEPDEFAYVTVTSRNGEKISLAHNNSDETFICSSSY 276 Query: 321 MAVKPHGID---STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + + + +YLA L + + R++ + ++ + + +P + Q Sbjct: 277 VVFRVNDKNVLLPSYLAMLFGRGEFDRYARFHSWGSTRETFDWNEMCDVEIPIPDVSIQR 336 Query: 377 DITNVINVETARIDVLV--EKIEQSIV 401 DI N+ A ID EK++ I Sbjct: 337 DIVNIYE---AYIDRREINEKLKAQIQ 360 >gi|295426378|ref|ZP_06819031.1| possible type I restriction enzyme S protein [Lactobacillus amylolyticus DSM 11664] gi|295063937|gb|EFG54892.1| possible type I restriction enzyme S protein [Lactobacillus amylolyticus DSM 11664] Length = 393 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 48/401 (11%), Positives = 112/401 (27%), Gaps = 23/401 (5%) Query: 28 VPIKRFTKLNTGRTSES------GKDIIYIGLED--VESGTGKYLPKDGNSRQSDTSTVS 79 V I+ K+ TG+T + G+ +I D ++ G KY + + ++ + Sbjct: 3 VKIENIGKVVTGKTPSTSNSANFGEGYSFITPADLHIDDGVVKYTKRTITQKGFNSIKNN 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIE 138 + IL G +G + + + + Q + L + + Sbjct: 63 TISGLSILVGCIGWDMGNVALVNGKCATNQQINSITDINYELYNPYYIYYWLKLHKNFLF 122 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + NI + IP + Q + ID+ I + L+ Sbjct: 123 KLANVTRTPILKKSDFENIEIEIPNIKVQNTTAGLL----RTIDSKIANNNAISKELESM 178 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + +Y + PD K + + + E NT Sbjct: 179 AKTIYNYWFLQFEFPDKDGKP----YKSNGGKMVWNEQLKQEIPEGWEVNTLKEILKENI 234 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 S + + E + + P ++ ++ Sbjct: 235 KSKVKVKEAAEIGKVPFFTSGEAILFVDKPIVSGLNCYLNTGGNAGIK--WFYGDASYST 292 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 ++ L ++++ + + L+ ++ + +P Sbjct: 293 DTWSLTCDSDMKYLLPFILKGIEPSMDKKFFQGTGLKHLQKNLLRNYIITIPDKNTIDRF 352 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++N + L + Q LK R + + GQ+ Sbjct: 353 KKIVNNSFKQQSKLFNENLQ----LKSMRDFLLPMLMNGQV 389 >gi|30250415|ref|NP_842485.1| restriction modification system, type I [Nitrosomonas europaea ATCC 19718] gi|30181210|emb|CAD86408.1| Restriction modification system, type I [Nitrosomonas europaea ATCC 19718] Length = 547 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 46/409 (11%), Positives = 114/409 (27%), Gaps = 37/409 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDT---STVS 79 P+++ ++ G+ + + ++ ++ +T + Sbjct: 7 TPLRQLAIVSAGQAAPKSDEFSDYGTPFVRAGSLDRLLSGEPESGLELVSEETARRRKLK 66 Query: 80 IFAKGQILYGKLG--PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + +G +L+ K G + + + L PK +L Sbjct: 67 TYPRGTVLFAKSGMSATKDRVYVLQNPAHVVSHLATLIPKSG---THIDYLRLALKHFPP 123 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPL-AEQVLIREKIIAETVRIDTLITERIRFIELLK 196 ++ + I + +P P Q+ I + I + +L Sbjct: 124 SSLIKDPAYPAIGLGDIEDFKIPTPDSSDAQIRIAHLLGKVEGLIAQRKQHLQQLDDL-- 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 L S + +P E P T ++ I Sbjct: 182 -----LKSVFLEMFGDPVRN------EKGWDKPALTAFGKISTGNTPPRSESVNYDGDFI 230 Query: 257 LSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + NI + L + V G ++ I + + Sbjct: 231 EWIKTDNITGDAVCVTPSTEHLSEIGARKARTVTSGALLVACIAGSVESIGRAALTDRTV 290 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPI 372 ++ YL L + + + G+++ L D +++ ++ P Sbjct: 291 SFNQQINAIQPGKDVNPLYLYGLFKLS--RSYIQSHATKGMKKILTKGDFEKITMIKPSF 348 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + Q + +++ + + +QS+ L+ + A G++DL Sbjct: 349 EMQNRFAVI----FEKVESIKSRYKQSLADLETLYGALSQQAFKGELDL 393 Score = 43.6 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 59/200 (29%), Gaps = 15/200 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDII-------YIGLEDVESGTGKYLPKDGNSRQSDT 75 K W + F K++TG T + + +I +++ P + + Sbjct: 198 KGWDKPALTAFGKISTGNTPPRSESVNYDGDFIEWIKTDNITGDAVCVTPSTEHLSEIGA 257 Query: 76 STVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G +L + + +A + D + Q +QP + L + L Sbjct: 258 RKARTVTSGALLVACIAGSVESIGRAALTDRTVSFNQQINAIQPGKDVN-PLYLYGLFKL 316 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I++ I M P Q +++++ + + + Sbjct: 317 SRSYIQSHATKGMKKILTKGDFEKITMIKPSFEMQNRFA----VIFEKVESIKSRYKQSL 372 Query: 193 ELLKEKKQALVSYIVTKGLN 212 L+ AL L+ Sbjct: 373 ADLETLYGALSQQAFKGELD 392 >gi|301156218|emb|CBW15689.1| unnamed protein product [Haemophilus parainfluenzae T3T1] Length = 450 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 49/458 (10%), Positives = 121/458 (26%), Gaps = 74/458 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTS-------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK ++ G + G+ + ++ + D +++ + Sbjct: 2 SDWKEYKFSELCDISRGASPRPIHEYITDGEGMPWVKIADATKSNSRFIEDTAERIKLSG 61 Query: 76 STVSI-FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +G ++ +A I L+ K + L ++ Sbjct: 62 VKKSVEVFEGDLILSNSATPGLPKFMAINACIHDGWMLLRNFK--NITKEFAYWLLLNER 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + + N + IP + EQ I + +I ++ Sbjct: 120 NNLVKQGTGTVFINLKTDILRNHIVKIPSIEEQNKIVSILNGIEDKIQLNTQINQTLEQI 179 Query: 195 LKEKKQALV---------SYIVTKGL---------------------------------- 211 + ++ ++ GL Sbjct: 180 AQALFKSWFVDFDPVRAKVQALSDGLSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE 239 Query: 212 -------NPDVKMKDSGIEWV-----GLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 P ++ G+E + + +K + + + K + +S Sbjct: 240 LVEIAKAFPCEMVEVDGVEVLKGWEVKELGSLMTIKRGGSPRPIKDFISDKGLNWVKISD 299 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + L + +K E +++ G ++ L ++ I Sbjct: 300 ATAEDNPFLFSTKEYIKSEGLSKTVLLNKGSLILSNSATP----GLPRFLELDACIHDGW 355 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + YL + + + + +LK + VK +VP + Sbjct: 356 LYFSDIKSLTQEYLYFFFLNIR-NDLVAQGNGSVFTNLKTDIVKAQKAIVPD----ERVI 410 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + + I L+ + + LKE R + + G Sbjct: 411 YYFDKQVKSIMNLIRYNTANSISLKETRDLLLPKLLNG 448 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 57/204 (27%), Gaps = 17/204 (8%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVE-SGTGKYLPK 66 GV+ + K W+V + + G + S K + ++ + D Sbjct: 256 GVEVL----KGWEVKELGSLMTIKRGGSPRPIKDFISDKGLNWVKISDATAEDNPFLFST 311 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + S + KG ++ + I K + Sbjct: 312 KEYIKSEGLSKTVLLNKGSLILSNSATPGLPRFLELDACIHDGWLYFSDIKSL--TQEYL 369 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + +++ + A G+ ++ + +P + + I LI Sbjct: 370 YFFFLNIRNDLVAQGNGSVFTNLKTDIVKAQKAIVPDE----RVIYYFDKQVKSIMNLIR 425 Query: 187 ERIRFIELLKEKKQALVSYIVTKG 210 LKE + L+ ++ G Sbjct: 426 YNTANSISLKETRDLLLPKLLNGG 449 >gi|15900770|ref|NP_345374.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae TIGR4] gi|149010479|ref|ZP_01831850.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] gi|14972361|gb|AAK75014.1| putative type I restriction-modification system, S subunit [Streptococcus pneumoniae TIGR4] gi|147764960|gb|EDK71889.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] Length = 373 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 40/396 (10%), Positives = 116/396 (29%), Gaps = 31/396 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++P + +++ + + L +K++ + +PP+ Q + Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFA 340 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++D I++S+ L+ + S + Sbjct: 341 DFVV----QVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|15829150|ref|NP_326510.1| restriction-modification enzyme subunit S1B [Mycoplasma pulmonis UAB CTIP] gi|14090094|emb|CAC13852.1| RESTRICTION-MODIFICATION ENZYME SUBUNIT S1B [Mycoplasma pulmonis] Length = 369 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 108/365 (29%), Gaps = 19/365 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + E +K +++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILI 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I+ L + D I H+ F I + G I Sbjct: 176 KIIEP-LEKQINAFDELILSEQKSLQHYLNYFFGKFYQIEPSLFHDYKLEKIAKIRRGKI 234 Query: 265 IQKLETRNM---------GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + + K Y + + I + I Sbjct: 235 INSFDLKENPGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSI 294 Query: 316 ITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ + ++ + +L + ++ + ++ R S++ + + + +P ++ Sbjct: 295 TNVCFILLLNDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLE 354 Query: 374 EQFDI 378 Q I Sbjct: 355 IQSAI 359 >gi|237753062|ref|ZP_04583542.1| type I restriction/modification specificity protein [Helicobacter winghamensis ATCC BAA-430] gi|229375329|gb|EEO25420.1| type I restriction/modification specificity protein [Helicobacter winghamensis ATCC BAA-430] Length = 185 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 59/162 (36%), Gaps = 5/162 (3%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPES-YETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + ++ + + + LK + Q V G+++ + ++ Sbjct: 18 DNYEFMNYIDIASVSKEIGVIEKMKFLKSDFPSRARQRVFKGDLLISSLSGSQKAIAIVE 77 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLP 366 + T ++ +L L+R++ ++ SG S+ ++ L Sbjct: 78 SDEKNLIASTGFFIISNVANCLKEFLMDLLRTHFFQELLMRESSGAIMASINQKEFLNLK 137 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +PP+ EQ I I+ AR L ++ + LL+ + Sbjct: 138 IPLPPLIEQERIAKEISQRKARAKALKQEAK---ELLESAKK 176 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 70/181 (38%), Gaps = 5/181 (2%) Query: 28 VPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V + ++NT ++ + + YI + V G + KG + Sbjct: 2 VRLGEVARVNTKLENIDNYEFMNYIDIASVSKEIGVIEKMKFLKSDFPSRARQRVFKGDL 61 Query: 87 LYGKLGPYLRKAII---ADFDGICSTQ-FLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 L L + I + + I ST F++ + L E L L + + + Sbjct: 62 LISSLSGSQKAIAIVESDEKNLIASTGFFIISNVANCLKEFLMDLLRTHFFQELLMRESS 121 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M+ + K N+ +P+PPL EQ I ++I R L E +E K++ + + Sbjct: 122 GAIMASINQKEFLNLKIPLPPLIEQERIAKEISQRKARAKALKQEAKELLESAKKEVEHI 181 Query: 203 V 203 + Sbjct: 182 I 182 >gi|225351807|ref|ZP_03742830.1| hypothetical protein BIFPSEUDO_03408 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225157054|gb|EEG70393.1| hypothetical protein BIFPSEUDO_03408 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 199 Score = 76.4 bits (186), Expect = 9e-12, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 52/170 (30%), Gaps = 13/170 (7%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 +N L L GN L+ E G++++ + Sbjct: 39 YSQNELLSSGKYPVLRVGNFYTNDSWYYSNLELEDKN---YAYEGDLLYTWSATFGPHI- 94 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + I V+ A+ + D ++ + ++ Sbjct: 95 ----WHGNKVIYHYHIWKVQLEAALEKLFAFQLLERDKERILSDKNGSTMVHITKTGIEN 150 Query: 365 LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 VL+P ++EQ I + R+D L+ ++ + LL+ + S + Sbjct: 151 TSVLMPCSVEEQRRIGAFFD----RLDSLITLHQRKLELLRNIKKSMLDK 196 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 50/183 (27%), Gaps = 6/183 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + GR + + + G Y + + +G Sbjct: 22 WEQRKLGEVAHFINGRAYSQNELLSSGKYPVLRVGNF-YTNDSWYYSNLELEDKNYAYEG 80 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +LY I I +Q + L +L LL D + + Sbjct: 81 DLLYTWS-ATFGPHIWHGNKVIYHYHIWKVQLEAALEKLFAFQLLERDKERILSDKNGST 139 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + EQ I IT R +ELL+ K++++ Sbjct: 140 MVHITKTGIENTSVLMPCSVEEQRRIGAFFDRLDSL----ITLHQRKLELLRNIKKSMLD 195 Query: 205 YIV 207 + Sbjct: 196 KMF 198 >gi|240169984|ref|ZP_04748643.1| polypeptide HsdS [Mycobacterium kansasii ATCC 12478] Length = 409 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 63/405 (15%), Positives = 126/405 (31%), Gaps = 23/405 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD-----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W+V + + G+ + GK + Y+ +V+ G D Sbjct: 2 NWQVRQLGEIAETALGKMLDKGKQKGLPQVPYLRNVNVQWGRVDTDDLLTMELADDERER 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLPELLQGWLLSIDVTQR 136 + G +L + G R AI + ++P L +LL Sbjct: 62 FGVSAGDLLVCEGGEIGRSAIWHGQADYIAYQKALHRIRPGKSLDVRFLRYLLEHYSLNG 121 Query: 137 IEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 A + G+T++H + + +P+P+PPL EQ I + I R++ L Sbjct: 122 TLAGLATGSTIAHLPQQQLRRVPVPVPPLNEQCRIVDLIEDHLSRLEAGQRWLSVGERKL 181 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + A +S + + +G V + K ++ L N Sbjct: 182 EAFWLAALSA-------SRRALVGAQFRTIGDVAETTLGKML-DAKRQVGSPTPYLRNIN 233 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + L + +S V PG+++ Sbjct: 234 VRWGEF-----DLSDVQLTPLTDSEVQRFDVRPGDVMACEGGEPGRCAVWCRPVGEVAFQ 288 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + V+ G T LM + + + L E ++ + + VP + Sbjct: 289 KALHRIRVRNPGEVLTSFLALMLEEAIRSGRCNRMFTGTTIKHLPQEKLRVIEIPVPALH 348 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q + + + L + + + RSS + AA +G+ Sbjct: 349 TQRQAVDCLAELVGAQERLRAALANAAARIAAMRSSLLTAAFSGR 393 >gi|25026814|ref|NP_736868.1| putative restriction enzyme subunit S [Corynebacterium efficiens YS-314] gi|23492093|dbj|BAC17068.1| putative restriction enzyme subunit S [Corynebacterium efficiens YS-314] Length = 409 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 58/424 (13%), Positives = 130/424 (30%), Gaps = 45/424 (10%) Query: 21 IPKHWKVVPIKR-FTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W V ++ TK TG + S + + V + Sbjct: 8 VPDGWTQVHVRDLITKKFTGPSPTCDERPIASDDEWGLLKTTAVTWDGWREEAHKVPPAS 67 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKD--VLPELLQ 125 + G +L K GP R ++ + S + + L+P+ VLP++L Sbjct: 68 YWGNESIEVRAGDVLITKAGPRHRVGVVVHVRSTRPHLMVSGKMVGLRPRTSVVLPQILA 127 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI--PPLAEQVLIREKIIAETVRIDT 183 G L + V + + A G S ++ + + P + EQ+ I + A +I Sbjct: 128 GLLSTKVVQEYLNARTTGMAESQTNFADEALLSAELVLPTMPEQLRIARILDAIDEQIAA 187 Query: 184 LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + + LV + P + + I Sbjct: 188 SRRILSKLRLEAEGVLDRLVQELSPADFVPLADLCTADI-----------------CYGI 230 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + +L++ + + ++ V PG+++ Sbjct: 231 VQSGVFVPGGVPVLAIRDLDGDFETGVHLTSRSIDAQYRRSRVAPGDVLLSIKGTIGKVG 290 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYL--AWLMRSYDLCKVFYAMGSGLRQSLKFED 361 + G I+ ++ +L+ ++ A+ R + Sbjct: 291 IVP---DTYNGNISREIARIRFSARTDPAFARYYLLSREAQRRLDLAVVGTTRAEVSIHV 347 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ-SIVLLKERRSSFIAAAVTGQID 420 +K+ P I+ Q ++ V+ R ++ E+ ++ L+ R ++G++ Sbjct: 348 LKKFAFPSPAIQYQRNVARVMTALQER-----QESERIALTKLQAMRRGLFEDLLSGRVR 402 Query: 421 LRGE 424 + E Sbjct: 403 VPAE 406 >gi|332674318|gb|AEE71135.1| type I R-M system specificity subunit [Helicobacter pylori 83] Length = 179 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 60/164 (36%), Gaps = 11/164 (6%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S+ +++ ++ +T I D I R L + I++ Sbjct: 27 SVEQITQQGEIKVYDVNNFIGYTDTTFISDKPYISIVKDGSVGRVRILPP----KTNILS 82 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + H + +L +L+ ++D + + F+D K + +PP+ EQ Sbjct: 83 TMGALIANHRTTTEFLFYLLSNFDFKNF---TSGSIIPHIYFKDYKEKTIFLPPLNEQNA 139 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 140 IANILSDLDNEIASLKNKKRQ----FENIKKALNHDLMSAKIRV 179 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 39/184 (21%), Positives = 66/184 (35%), Gaps = 15/184 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W+ V + T + + +E + + G+ D N+ T T I Sbjct: 6 LPLNWQRVRLGDIANYLTSK----------LSVEQI-TQQGEIKVYDVNNFIGYTDTTFI 54 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K I K G R I+ I ST ++ E L L + D Sbjct: 55 SDKPYISIVKDGSVGRVRILPPKTNILSTMGALIANHRTTTEFLFYLLSNFDFK----NF 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ + H +K + +PPL EQ I + I +L ++ +F + K Sbjct: 111 TSGSIIPHIYFKDYKEKTIFLPPLNEQNAIANILSDLDNEIASLKNKKRQFENIKKALNH 170 Query: 201 ALVS 204 L+S Sbjct: 171 DLMS 174 >gi|139438849|ref|ZP_01772309.1| Hypothetical protein COLAER_01313 [Collinsella aerofaciens ATCC 25986] gi|133775560|gb|EBA39380.1| Hypothetical protein COLAER_01313 [Collinsella aerofaciens ATCC 25986] Length = 493 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 56/201 (27%), Positives = 84/201 (41%), Gaps = 6/201 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W + +D +I ++++ GTGK L S Sbjct: 50 IPESWAWARFSEVIGIAARLVDPLKYQDFPHIAPDNIQKGTGKLLFCHSVKADEVKSANH 109 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +F+ GQILY K+ P LRKA+IA FDG+CS L K PE + LLS T+ Sbjct: 110 LFSAGQILYSKIRPALRKAVIAPFDGLCSADMYPLNTKLQ-PEYVLTVLLSNFFTEETLK 168 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE----LL 195 M + K + I +P+PPLAEQ I E++ ++ E L Sbjct: 169 GDTRVKMPKTNQKSLNVILVPVPPLAEQRRIVERVNELMPLVEEYGELEDAREELDAALP 228 Query: 196 KEKKQALVSYIVTKGLNPDVK 216 +++++ V GL P Sbjct: 229 GRLRKSVLQLAVQGGLVPQDP 249 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 67/208 (32%), Gaps = 13/208 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ---KLETRNMG 274 K E +P+ W F ++ R L + ++ NI + KL + Sbjct: 40 KCIADEVPFGIPESWAWARFSEVIGIAARLVDPLKYQDFPHIAPDNIQKGTGKLLFCHSV 99 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E + G+I++ I K + G+ ++ + L Sbjct: 100 KADEVKSANHLFSAGQILYSKIRPALRKAVIAPFD----GLCSADMYPLNTKLQPEYVLT 155 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L+ ++ + + + + V VPP+ EQ I +N ++ Sbjct: 156 VLLSNFFTEETLKGDTRVKMPKTNQKSLNVILVPVPPLAEQRRIVERVNELMPLVEEY-G 214 Query: 395 KIEQSIVLLKE-----RRSSFIAAAVTG 417 ++E + L R S + AV G Sbjct: 215 ELEDAREELDAALPGRLRKSVLQLAVQG 242 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 53/185 (28%), Gaps = 14/185 (7%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI---ESNILSLSYGNIIQKLETRNMGL 275 E +P+ WE ++ T + R + E + N Sbjct: 308 CIEDEIPFEIPESWEWARLESVTTYIQRGKSPKYSTVEKYPVIAQKCNQWSGFSVEKARF 367 Query: 276 KP----ESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAY--MAVKPH 326 Y +I+ G++++ L + + S + + Sbjct: 368 IDPATVSKYADERILKDGDLLWNSTGLGTLGRMAVYDSAKNRYGWAVADSHVTVIRTRED 427 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +D + + SG ++ L E V+ + VPP+ EQ I ++ Sbjct: 428 WLDHRFAFAYFAGPSVQSEIEDQASGSTKQKELAQETVRNYLIPVPPLAEQRRIVYEVDW 487 Query: 385 ETARI 389 + Sbjct: 488 LFKIL 492 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 17/176 (9%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ ++ T + G++ + + I + +G + K + S Sbjct: 316 EIPESWEWARLESVTTYIQRGKSPKYSTVEKYPVIAQKC-NQWSGFSVEKARFIDPATVS 374 Query: 77 TV---SIFAKGQILYGKLG-PYLRKAIIAD------FDGICSTQFLVLQPKDV--LPELL 124 I G +L+ G L + + D + + V++ ++ Sbjct: 375 KYADERILKDGDLLWNSTGLGTLGRMAVYDSAKNRYGWAVADSHVTVIRTREDWLDHRFA 434 Query: 125 QGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + V IE G+T + + N +P+PPLAEQ I ++ Sbjct: 435 FAYFAGPSVQSEIEDQASGSTKQKELAQETVRNYLIPVPPLAEQRRIVYEVDWLFK 490 >gi|161507539|ref|YP_001577493.1| Type I restriction-modification system specificity subunit [Lactobacillus helveticus DPC 4571] gi|160348528|gb|ABX27202.1| Type I restriction-modification system specificity subunit [Lactobacillus helveticus DPC 4571] Length = 356 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 48/390 (12%), Positives = 101/390 (25%), Gaps = 56/390 (14%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + F + +G+ + + SG+ G D ++ Sbjct: 19 NDWEERKLGDFIDVKSGKDYK-----------HLNSGSIPVYGTGGYMLSVD---RALSD 64 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I G+ G + ++ T F + PK + + LSI + E Sbjct: 65 IDAIGIGRKGTIDKPYLLKAPFWTVDTLFYAV-PKQNID---LQFSLSIFKKINWKKFDE 120 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + I ++ +P EQ I I + F E + L Sbjct: 121 STGVPSLSKTVINSVGASVPSYEEQQKIGSFFKQLDKTIALHQRKLESFQFTYHEIIRRL 180 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 L + + + E +L + Sbjct: 181 FLKKAKWQLTKLSDL----------------------VTILDKNRKPVKKEDRLLGDTPY 218 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 ++ G GE + D N V + + + Sbjct: 219 YGANGIQDYISGFT----------HKGEFILIAEDGANSLTEYPIYFVKGQIWVNNHAHV 268 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +K + S +L + R L +D++ + + +P EQ I Sbjct: 269 LKVNRDVSP--LFLALALKQINYSKYTVGSSRNKLNLKDLENIAIFIPDNNEQQKIGQFY 326 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + ++ I +K+ + + Sbjct: 327 SNYLNYLR----INKKRIQYMKQFKQFLLQ 352 >gi|240117543|ref|ZP_04731605.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae PID1] Length = 407 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 49/395 (12%), Positives = 103/395 (26%), Gaps = 29/395 (7%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + P+ TK+ G+ E KD + + T + D D Sbjct: 20 EWKPLGEVLVRTKGTKITAGQMKEMHKDNAPLKIFAG-GKTFALVDFD------DVPDKD 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I + I+ G + D + + + + Sbjct: 73 IHREPSIIVKSRGII--EFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQENYFRN 130 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I M N +PIP L Q I + + T TL + L K + Sbjct: 131 IGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELALRKRQY 190 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + L+ D ++ + + K + + + + Sbjct: 191 RYYRDLL----LDFDNQIGGGIADGYQCRLKNVVWKTLGEVAEYSKNRICSDKLNEHNYV 246 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N++Q E + + S +I+ I K G + Sbjct: 247 GVDNLLQNREGKKLSGYVPSEGKMTEYIVNDILIGNIRPYLKKIWQADCTGGTNGDV--L 304 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V ++ YL ++ G + + + +PP+ EQ I Sbjct: 305 VIRVTDEKVNPKYLYQVLADDKFFAFNMKHAKGAKMPRGSKAAIMQYKIPIPPLPEQEKI 364 Query: 379 TNVINVETARIDVL-------VEKIEQSIVLLKER 406 ++ + + + +E+ Sbjct: 365 VAILGKFDTLTHSVSEGLPHEIALRRKQYEYYREQ 399 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 185 Query: 403 LK-ERR 407 K + R Sbjct: 186 RKRQYR 191 >gi|168464570|ref|ZP_02698473.1| restriction modification system DNA specificity domain [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195632638|gb|EDX51092.1| restriction modification system DNA specificity domain [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 464 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 54/152 (35%), Gaps = 7/152 (4%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G+I + + + + + G+IVF + + E+ I++ + M Sbjct: 50 GDIFSESDFVFVSPDKANELQRNMAFRGDIVFTQRGTLGQVALIPEDSLYEKYIVSQSQM 109 Query: 322 A--VKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 V P D+ ++ R+ + + G + +K + +PP+ EQ I Sbjct: 110 KLTVNPKQADAYFIYTYFRTNEAKALIENNAIVGGVPHINLGILKEFKLRLPPLSEQKRI 169 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + + ID + Q L++ + Sbjct: 170 ----SEVSKSIDNKINLNRQINQTLEQMSQTL 197 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 55/455 (12%), Positives = 134/455 (29%), Gaps = 72/455 (15%) Query: 25 WKVVPIKRF-----TKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 W V I G S + I ++ + D Sbjct: 5 WTTVSINDIKLPEKYSCVGGPFGSSLSQKHYVDSGVPVIRGTNLAGDI--FSESDFVFVS 62 Query: 73 SDTST---VSIFAKGQILYGKLGPYLRKAIIAD----FDGICSTQFL--VLQPKDVLPEL 123 D + ++ +G I++ + G + A+I + I S + + PK Sbjct: 63 PDKANELQRNMAFRGDIVFTQRGTLGQVALIPEDSLYEKYIVSQSQMKLTVNPKQADAYF 122 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + + IE + H + + + +PPL+EQ I E + +I+ Sbjct: 123 IYTYFRTNEAKALIENNAIVGGVPHINLGILKEFKLRLPPLSEQKRISEVSKSIDNKINL 182 Query: 184 LITERIRFIELLKEKKQA-------LVSYIVTKGLNPDVKMKDSGIE------------- 223 ++ + ++ ++ + G NP + S E Sbjct: 183 NRQINQTLEQMSQTLFKSWFVDFDPVIDNALDAG-NPIPEALQSRAELRQKVRNSADFKP 241 Query: 224 ----------------WVGLVPDHWEVKPFFALVTELNR--KNTKLIESNILSLSYGNII 265 +G VP W ++ F + + K+ + Sbjct: 242 LPADIRTLFPAEFEETELGWVPKGWRIESFSEIAQLVKENVKSEDISSEVHYVGLEHLER 301 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK- 324 + + N G + + G+++F + K ++ GI ++ + + Sbjct: 302 KHIFITNYGNGRDVSSNKSAFNKGDLLFGKLRPYFHKVAITPFS----GICSTDILVFRA 357 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + +A + + + G R + +D+ + +++P +I Sbjct: 358 KEKYYKSLMAMYVFTDEFVAYANLRSIGTRMPRAEAKDLLKYRIVLPN----KNILEKFE 413 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + L R + + ++G+ Sbjct: 414 LLLKNYWSKGQLNNDESKHLTTLRDTLLPKLISGE 448 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 44/190 (23%), Positives = 73/190 (38%), Gaps = 5/190 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +PK W++ +L ++ Y+GLE +E ++ GN R + Sbjct: 259 LGWVPKGWRIESFSEIAQLVKENVKSEDISSEVHYVGLEHLERKHI-FITNYGNGR-DVS 316 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLSIDVT 134 S S F KG +L+GKL PY K I F GICST LV + K+ + + ++ + + Sbjct: 317 SNKSAFNKGDLLFGKLRPYFHKVAITPFSGICSTDILVFRAKEKYYKSLMAMYVFTDEFV 376 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G M A+ K + + +P + + E L Sbjct: 377 AYANLRSIGTRMPRAEAKDLLKYRIVLPNKNILEKFELLLKNYWSKGQLNNDESKHLTTL 436 Query: 195 LKEKKQALVS 204 L+S Sbjct: 437 RDTLLPKLIS 446 >gi|330684146|gb|EGG95895.1| type I restriction modification DNA specificity domain protein [Staphylococcus epidermidis VCU121] Length = 388 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 61/393 (15%), Positives = 126/393 (32%), Gaps = 41/393 (10%) Query: 30 IKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + G+ I +I ++ + + K + D + + + +K + Sbjct: 25 LGSIACFSKGKLGSKKDISQNGIPFILYGELYTKYNAIIEKVYSKIAIDKNNLKVASKNE 84 Query: 86 ILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G I + + +L PK+ + L+ + Sbjct: 85 VLIPSSGETSIDIATASCIDINEEVAIGGDINILTPKN-VDGRFISLSLNGVNKLELSKY 143 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G T+ H I + + +P E+ +KI ++D I R +ELL+++K+ Sbjct: 144 AQGKTVVHLYNNDIKKLKLSLPINFEEQ---QKIGDFFSKLDHQIELEERKLELLEQQKK 200 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + I ++ L + G WE+K + + Sbjct: 201 GYMQKIFSQEL--------RFKDENGNNYPEWEIKELMQIAKVKTGRKNVQDNIQDGKYK 252 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + L ++ I+ PGE + L + + AY Sbjct: 253 FFDR----SVEVKYLNTFDFDETAIIYPGE----------GSKFLPRYFSGKYSLHQRAY 298 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + ++ + S +SL+ +L V+VP EQ I + Sbjct: 299 SIYDININNNYLYYY--LSLQNNHFLKYAVGSTVKSLRMSGFDKLKVMVPKNSEQEKIGS 356 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D +EK + LLK R+ SF+ Sbjct: 357 F----FKNLDEFIEKQSDKVELLKLRKQSFLQK 385 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 72/196 (36%), Gaps = 12/196 (6%) Query: 237 FFALVTELNRK---NTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGE 290 ++ K + ++ I + YG + K + + ++ E Sbjct: 25 LGSIACFSKGKLGSKKDISQNGIPFILYGELYTKYNAIIEKVYSKIAIDKNNLKVASKNE 84 Query: 291 IVFRFIDLQNDKRSLRS-AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ + + S + E I + P +D +++ + + ++ Sbjct: 85 VLIPSSGETSIDIATASCIDINEEVAIGGDINILTPKNVDGRFISLSLNGVNKLELSKYA 144 Query: 350 GSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 L D+K+L + +P +EQ I + +++D +E E+ + LL++++ Sbjct: 145 QGKTVVHLYNNDIKKLKLSLPINFEEQQKIGDF----FSKLDHQIELEERKLELLEQQKK 200 Query: 409 SFIAAAVTGQIDLRGE 424 ++ + ++ + E Sbjct: 201 GYMQKIFSQELRFKDE 216 >gi|304570622|ref|YP_830484.2| restriction modification system DNA specificity subunit [Arthrobacter sp. FB24] Length = 412 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 137/411 (33%), Gaps = 37/411 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 HW +V +L G+ + G+ +G + D+ +F Sbjct: 24 HWPLVRSSELFELRYGKA---------LVASGRRPGSVPVYGTNGQTGSHDSP---LFRG 71 Query: 84 GQILYGKLGP-YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ G+ G +L + + T + + DV + + + + + Sbjct: 72 PGLILGRKGAGHLGVHWTDNDYWVIDTAYSLSPRDDVDLKFAYYLIKHVGL----NHLKH 127 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + G P+PP+A Q I + +D +I R I+LL+E A+ Sbjct: 128 GTSNPSLTRDAFGAQYFPLPPVATQGAIATTL----SALDDMIDSNRRKIDLLEELGAAI 183 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 V + L+ + +G V E + + ++ S+ Sbjct: 184 VEQRLH--LDAYGFPEYERGRRLGDVLRVLETGSRPKGGAAPSG--SGVVSLGAESIQSA 239 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRF-----IDLQNDKRSLRSAQVMERGIIT 317 + +++ + + ++ +++ + + +E I Sbjct: 240 GVCTTNVFKHIPEEFAARMKRGHLEEEDVLVYKDGGRPGNFIPHVSAFGYGFPVEEAAIN 299 Query: 318 SA-YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 Y GI L WL+RS + + G+G L + + LP+ + + E Sbjct: 300 EHVYRVRSSDGISQALLYWLLRSPWMDQEMRKRGTGVAIPGLNSSNFRDLPLPI--LTET 357 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 V+N + + + ++ L R+ + +TG+I + E++ Sbjct: 358 D--VEVLNDRLSPVLASMLRLGTESGRLAALRNVLLPELLTGRIRV-PEAE 405 >gi|238918946|ref|YP_002932460.1| type I restriction-modification system, S subunit [Edwardsiella ictaluri 93-146] gi|238868514|gb|ACR68225.1| type I restriction-modification system, S subunit [Edwardsiella ictaluri 93-146] Length = 461 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 48/449 (10%), Positives = 124/449 (27%), Gaps = 61/449 (13%) Query: 27 VVPIKRFTKLN---TGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + +L T E + I + +++ +G ++ + + Sbjct: 7 TTKLADLCELVVDCPHSTPEWTDSGFIVLRNQNIRNGVLDLSSPSFTNKDGFLNRIKRAK 66 Query: 83 K--GQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G ++ + P +I C + ++L+P+ + W L Q + Sbjct: 67 PQEGDLVITREAPMGEVCLIPAGLECCLGQRQVLLRPRKGVSGYYLFWALQSPYVQHQIS 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPL-AEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 EG + ++ + + IP L + + + + +I ++ + Sbjct: 127 WNEGTGTTVSNIRIPILKELNIPRLLDSEDAVASCLNSLANKITLNRQINQTLEQMAQAL 186 Query: 199 KQALVSYI---VTKGLNPDVKMKDSGIEW------------------------------- 224 ++ V L+ ++S + Sbjct: 187 FKSWFVDFDPVVDNALDAGFFEQNSELSEELLRRAEQRKAVREQPDFKPLPAETRQLFPA 246 Query: 225 ------------VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 G VP W A ++ N + K + Sbjct: 247 AFEACEEPSLGLGGWVPKGWSGSSVGAEFNLTMGQSPASSTYNDIGDGIPFFQGKTDF-- 304 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 G + S Y + ++ I A + ++ Sbjct: 305 -GFRFPSNRIYCSSPKRMANKHDTLVSVRAPVGDINLAADKCAIGRGVAAARHGSGSVSF 363 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + +++ + + S+ +D K +PV+ + ++ A +D Sbjct: 364 TYYTLKNLSKYFSVFNGEGTVFGSINQKDFKSIPVV----SVTTRLVAEFDLFCAHLDSR 419 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +E E ++ L R + + ++G++ L Sbjct: 420 IEVNENEVIALSNLRDTLLPKLISGELRL 448 >gi|226198243|ref|ZP_03793814.1| type I restriction-modification system specificity determinant protein [Burkholderia pseudomallei Pakistan 9] gi|225929763|gb|EEH25779.1| type I restriction-modification system specificity determinant protein [Burkholderia pseudomallei Pakistan 9] Length = 277 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/209 (11%), Positives = 68/209 (32%), Gaps = 12/209 (5%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM-------GLKP 277 +G +P W V + + E + + + + + Sbjct: 66 LGEIPKGWAVSTVGRVAQCVGGGTPSTKEQKFWEPAIHHWTTPKDLSGIAAPVLLDTERR 125 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 S V G + + + + A + Y+A+ P G + + Sbjct: 126 LSDAGLAKVSSGLLPVGTLLMSSRAPIGYLAISQIPLAVNQGYIAMLPGGQLAPEYLYFW 185 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 ++ + + + +P+++P + + A+I + + E Sbjct: 186 CQSNMDAIKQKANGSTFMEISKTAFRPIPIVLPSSE----VAACFADLAAKIFERISEGE 241 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + L+E R++ + ++G++ L E++ Sbjct: 242 RQRIHLEEIRNTLLPRLISGKLRL-PEAE 269 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 56/197 (28%), Gaps = 12/197 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----------LEDVESGTGKYLPKD 67 +G IPK W V + R + G T + + + L + + + Sbjct: 66 LGEIPKGWAVSTVGRVAQCVGGGTPSTKEQKFWEPAIHHWTTPKDLSGIAAPVLLDTERR 125 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + + G +L P I+ + ++ + P L + Sbjct: 126 LSDAGLAKVSSGLLPVGTLLMSSRAPI-GYLAISQIPLAVNQGYIAMLPGGQL-APEYLY 183 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I+ G+T IP+ +P + RI + Sbjct: 184 FWCQSNMDAIKQKANGSTFMEISKTAFRPIPIVLPSSEVAACFADLAAKIFERISEGERQ 243 Query: 188 RIRFIELLKEKKQALVS 204 RI E+ L+S Sbjct: 244 RIHLEEIRNTLLPRLIS 260 >gi|116609660|gb|ABK02384.1| restriction modification system DNA specificity domain protein [Arthrobacter sp. FB24] Length = 390 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 61/411 (14%), Positives = 137/411 (33%), Gaps = 37/411 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 HW +V +L G+ + G+ +G + D+ +F Sbjct: 2 HWPLVRSSELFELRYGKA---------LVASGRRPGSVPVYGTNGQTGSHDSP---LFRG 49 Query: 84 GQILYGKLGP-YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ G+ G +L + + T + + DV + + + + + Sbjct: 50 PGLILGRKGAGHLGVHWTDNDYWVIDTAYSLSPRDDVDLKFAYYLIKHVGL----NHLKH 105 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + G P+PP+A Q I + +D +I R I+LL+E A+ Sbjct: 106 GTSNPSLTRDAFGAQYFPLPPVATQGAIATTL----SALDDMIDSNRRKIDLLEELGAAI 161 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 V + L+ + +G V E + + ++ S+ Sbjct: 162 VEQRLH--LDAYGFPEYERGRRLGDVLRVLETGSRPKGGAAPSG--SGVVSLGAESIQSA 217 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRF-----IDLQNDKRSLRSAQVMERGIIT 317 + +++ + + ++ +++ + + +E I Sbjct: 218 GVCTTNVFKHIPEEFAARMKRGHLEEEDVLVYKDGGRPGNFIPHVSAFGYGFPVEEAAIN 277 Query: 318 SA-YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 Y GI L WL+RS + + G+G L + + LP+ + + E Sbjct: 278 EHVYRVRSSDGISQALLYWLLRSPWMDQEMRKRGTGVAIPGLNSSNFRDLPLPI--LTET 335 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 V+N + + + ++ L R+ + +TG+I + E++ Sbjct: 336 D--VEVLNDRLSPVLASMLRLGTESGRLAALRNVLLPELLTGRIRV-PEAE 383 >gi|225858684|ref|YP_002740194.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae 70585] gi|225721300|gb|ACO17154.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae 70585] Length = 373 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 40/396 (10%), Positives = 116/396 (29%), Gaps = 31/396 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++P + +++ + + L +K++ + +PP+ Q + Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFA 340 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + ++D I++S+ L+ + S + Sbjct: 341 DFVV----QVDKSQLAIQKSLEELETLKKSLMQEYF 372 >gi|254463147|ref|ZP_05076563.1| hypothetical protein RB2083_3738 [Rhodobacterales bacterium HTCC2083] gi|206679736|gb|EDZ44223.1| hypothetical protein RB2083_3738 [Rhodobacteraceae bacterium HTCC2083] Length = 443 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 45/385 (11%), Positives = 105/385 (27%), Gaps = 22/385 (5%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII- 100 ++ + + G P+D + G +++ K+ I Sbjct: 57 FSPDTEVTLLTIR----FDGSIEPRDPTRICDVKGKLFRVHPGDVVFSKIDVRNGAIGIA 112 Query: 101 ---ADFDGICS--TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + S ++V Q K + + + I + + Sbjct: 113 PNDIKNMCVTSEFPVYIVNQDKTDPDYIKLLFRTDAFMKLLNSMISGASGRKRIQPSQLE 172 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK--------QALVSYIV 207 +P+P + QV + + V + L+ + L + Q+ S Sbjct: 173 KAKVPLPSNSAQVKVADYWRTGDVAKNALVLKLESLTRDLGKWMEGQTVDFTQSCKSRFF 232 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 G + + + + E E YG + Sbjct: 233 VAGYEATQQWDMKAGRAAHFLLSNPDFVRLGDYTEECTESVKPWDEPEKKFPVYGVNNKN 292 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 N ++ F N + QV I + Y + G Sbjct: 293 GVFLNKYQTGNTFNAPYKRIEKNWFFHNPTRANVGSLGKVPQVSNEAITSPEYQVWRLTG 352 Query: 328 -IDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++A L+R+ + + G++Q + + ++ + + + P+KEQ + Sbjct: 353 GFLPEFMALLIRTDYFLSLVDFNRVGGVKQRMYYSNLADIRLPMVPLKEQQRVAEDYTKL 412 Query: 386 TARIDVLVEKIEQSIVLLKERRSSF 410 A I + + + L+ + Sbjct: 413 LAEI--AEARSDLKLRKLEIEKMIL 435 >gi|256853726|ref|ZP_05559091.1| predicted protein [Enterococcus faecalis T8] gi|256710669|gb|EEU25712.1| predicted protein [Enterococcus faecalis T8] Length = 186 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 46/120 (38%), Gaps = 8/120 (6%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL 353 I + + S I ++ + + + ++ SY L K G Sbjct: 71 TISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLLSYSLKKYI---TGGA 127 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + L + + ++P+++P EQF I ++D + ++ + LLKE + F+ Sbjct: 128 QPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRKLDLLKETKKGFLQK 183 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 55/196 (28%), Gaps = 26/196 (13%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + K+ T + + Y N Sbjct: 8 KVPEIRFPGFTGDWEQCKLGDIAKMYQPPTISGSELL-----------DTGYPVFGANGY 56 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + Q+ G + +V+ +D + + + Sbjct: 57 IGFYSKSNHLE-DQVTISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLL 115 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ G + +P+ IP EQ I ++D I + R Sbjct: 116 SY--SLKKYITGGAQPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRK 169 Query: 192 IELLKEKKQALVSYIV 207 ++LLKE K+ + + Sbjct: 170 LDLLKETKKGFLQKMF 185 >gi|91217496|ref|ZP_01254455.1| specificity determinant HsdS-like protein [Psychroflexus torquis ATCC 700755] gi|91184381|gb|EAS70765.1| specificity determinant HsdS-like protein [Psychroflexus torquis ATCC 700755] Length = 347 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 66/163 (40%), Gaps = 6/163 (3%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--IITSAYMAV 323 K +R+ ++ + + + +IV D+ N K + + E + A+ Sbjct: 35 SKFISRDGKVRKNTRKQMFPLFEEDIVMVMSDVPNGKALAKCYIIEENNKYSLNQRICAI 94 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + + +L + + + F + + +L+ D+ P+ PP+ EQ I +++ Sbjct: 95 RTTEFNIGFLYYQLNRHSYFLAFNNGEN--QSNLRKGDILNCPLWKPPLSEQKQIVAILD 152 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I+ IE++IV KE S + A + + D G + Sbjct: 153 KAFTAIEQAKANIEKNIVNAKELFQSKLNAIFSQKGD--GWEE 193 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 44/358 (12%), Positives = 106/358 (29%), Gaps = 24/358 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ + K G+ E D+ K++ +DG R++ + + Sbjct: 4 EMTTLGESCKFFNGKAHEKDIDVE----GAFVVVNSKFISRDGKVRKNTRKQMFPLFEED 59 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ---GWLLSIDVTQRIEAICE 142 I+ KA+ + + ++ + Q + ++ A Sbjct: 60 IVMVMSDVPNGKALAKCYIIEENNKYSLNQRICAIRTTEFNIGFLYYQLNRHSYFLAFNN 119 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G S+ I N P+ PPL+EQ I + I+ + I KE Q+ Sbjct: 120 GENQSNLRKGDILNCPLWKPPLSEQKQIVAILDKAFTAIEQAKANIEKNIVNAKELFQSK 179 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 ++ I ++ + + + I + T +++ L S Sbjct: 180 LNAIFSQKGDGWEERQIKDI-----------TTKIGSGATPRGGQSSYKESGISLIRSMN 228 Query: 263 NIIQKLETRNMGLKPESYETY---QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + ++ +++ + + + + Sbjct: 229 VHDDGFRDKKLAFIDDEQANKLSNVTIEENDVLLNITGASVARCCIVDKHFLPARVNQHV 288 Query: 320 YMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKE 374 + GI S + S + + +G RQ++ ++ + P I E Sbjct: 289 SIIRLKEGIMSNKFLHFALTSKETKSLLLGIGEQGATRQAITKVQIENFKIAFPSIIE 346 Score = 39.8 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 37/116 (31%), Gaps = 11/116 (9%) Query: 24 HWKVVPIKRF-TKLNTGRTSE------SGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDT 75 W+ IK TK+ +G T I I +V + G + Q++ Sbjct: 190 GWEERQIKDITTKIGSGATPRGGQSSYKESGISLIRSMNVHDDGFRDKKLAFIDDEQANK 249 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWL 128 + + +L G + + I D + + +++ K+ + Sbjct: 250 LSNVTIEENDVLLNITGASVARCCIVDKHFLPARVNQHVSIIRLKEGIMSNKFLHF 305 >gi|330937290|gb|EGH41301.1| restriction modification system DNA specificity domain protein [Pseudomonas syringae pv. pisi str. 1704B] Length = 381 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 60/214 (28%), Positives = 90/214 (42%), Gaps = 15/214 (7%) Query: 8 PQYKDSGVQWI----------GAIPKHWKVVPIKRFTKLNTGR--TSESGKDIIYIGLED 55 P Y D+G + G +PK WK + + T + SE + Y+GLE Sbjct: 156 PSYIDTGTADLFPNDFESSAVGQVPKGWKFGILGDIAQTVTRKATVSEFNDQLNYVGLEH 215 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 + + + + S+ S+F+K IL+GKL PY K +IA DG+CST LV Q Sbjct: 216 IPRKSLSLI--NWGCADGLASSKSVFSKTDILFGKLRPYFHKVVIAPIDGVCSTDVLVCQ 273 Query: 116 PKDVLPE-LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 PK ++ L S + + GA M WK + PM IPP + I Sbjct: 274 PKVNDYYGIVLMHLFSESLISYANRLSNGAKMPRVSWKDLAAYPMCIPPSDIAMSFNSVI 333 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + I + I + I+L + L+S V Sbjct: 334 LPMVGEIISNIEQIQTVIQLRETLLPKLISGEVR 367 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 52/365 (14%), Positives = 119/365 (32%), Gaps = 28/365 (7%) Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSIDV 133 + + G++L +G + A+ + + V + E + L S Sbjct: 12 SRTRLKGGEVLLTLVGSVGQVAVASKKLKGFNVARAVAVIHPIDSVEAEWIALCLRSPLS 71 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + K + +P+P PP +E+ I + A I L Sbjct: 72 KHLLGSRANTTVQTTINLKDLRELPIPFPPESERKEITAALGALDSCIAVLHETNATLQS 131 Query: 194 LLKEKKQALV-----------SYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFAL 240 + + ++ S + + + E VG VP W+ + Sbjct: 132 IAQTIFKSWFVDFNPVHAKSESRAPSYIDTGTADLFPNDFESSAVGQVPKGWKFGILGDI 191 Query: 241 VTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + RK ++ + + L N G + + +I+F + Sbjct: 192 AQTVTRKATVSEFNDQLNYVGLEHIPRKSLSLINWGCADGLASSKSVFSKTDILFGKLRP 251 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSGL-RQS 356 K + G+ ++ + +P D + + + S L + +G Sbjct: 252 YFHKVVIAPID----GVCSTDVLVCQPKVNDYYGIVLMHLFSESLISYANRLSNGAKMPR 307 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++D+ P+ +PP +VI I + IE I + + R + + ++ Sbjct: 308 VSWKDLAAYPMCIPPSDIAMSFNSVILPMVGEI---ISNIE-QIQTVIQLRETLLPKLIS 363 Query: 417 GQIDL 421 G++ L Sbjct: 364 GEVRL 368 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 56/134 (41%), Gaps = 6/134 (4%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++ + + GE++ + ++ S ++ + + + +++ ++A + Sbjct: 8 DAKYSRTRLKGGEVLLTLVGSVGQ-VAVASKKLKGFNVARAVAVIHPIDSVEAEWIALCL 66 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 RS + + + + ++ +D++ LP+ PP E+ +IT + +D + + Sbjct: 67 RSPLSKHLLGSRANTTVQTTINLKDLRELPIPFPPESERKEITAALGA----LDSCIAVL 122 Query: 397 EQSIVLLKERRSSF 410 ++ L+ + Sbjct: 123 HETNATLQSIAQTI 136 >gi|20090947|ref|NP_617022.1| type I site-specific deoxyribonuclease [Methanosarcina acetivorans C2A] gi|19916030|gb|AAM05502.1| type I site-specific deoxyribonuclease [Methanosarcina acetivorans C2A] Length = 290 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 47/302 (15%), Positives = 96/302 (31%), Gaps = 20/302 (6%) Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + IE T+ H K + I +P+PPL Q I + Sbjct: 1 MPDFAYRSLVKILKDIEDRTAFVTVKHLSAKQLNTIKIPVPPLETQQKIVSILKKAEET- 59 Query: 182 DTLITERIRFIELLKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 + E Q L+ + +P V K+ + V + + Sbjct: 60 -------KKLRAQADELTQKLLQSVFLEMFGDPVVNPKNWKEIKLKDVSE---IVSGVTK 109 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 +L K T + ++ + E + + + P E Y + ++ D Sbjct: 110 GRKLAGKPTVFVPYLRVANVQDGYLDLTEIKEIEVLPSDVEKYALQGGDILLTEGGDPDK 169 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMG--SGLRQS 356 R + + I + V+ + YL+ L+ S F + S Sbjct: 170 LGRGAVWNRQIPTCIHQNHIFRVRVNRECLVPEYLSMLIGSTYGKMYFLKSAKQTTGIAS 229 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +K P L+ + Q +++ +I+ +QS + S + A T Sbjct: 230 INSTQLKNFPALIASLDLQLRFAEMVH----QIEKTTVSQQQSSFKINNLFDSLMQKAFT 285 Query: 417 GQ 418 G+ Sbjct: 286 GE 287 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 61/205 (29%), Gaps = 18/205 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 PK+WK + +K +++ +G T + + Y+ + +V+ G Sbjct: 89 PKNWKEIKLKDVSEIVSGVTKGRKLAGKPTVFVPYLRVANVQDGYLDLTEIKEIEVLPSD 148 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGWLL 129 G IL + G F V ++ L L+ Sbjct: 149 VEKYALQGGDILLTEGGDPDKLGRGAVWNRQIPTCIHQNHIFRVRVNRECLVPEYLSMLI 208 Query: 130 SIDVTQRI--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + ++ + ++ + + N P I L Q+ E +I+ Sbjct: 209 GSTYGKMYFLKSAKQTTGIASINSTQLKNFPALIASLDLQLRFAE----MVHQIEKTTVS 264 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN 212 + + + +L+ T L Sbjct: 265 QQQSSFKINNLFDSLMQKAFTGELF 289 >gi|315930741|gb|EFV09751.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 305] Length = 782 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 124/404 (30%), Gaps = 31/404 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVS 79 +V +K G T DI ++ + D + + S Sbjct: 393 LVKLKICGDFFMGGTPSRKNINYWNGDIKWLTISDYSNRQVIMDTKEKITREGFKNSNAK 452 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG ++ + + + I D + + + P + + + ++ Sbjct: 453 MIQKGAVVVS-IYATIGRVGILGEDMTTNQAIVAIIPNEEFINKYLMYAI-DYFKFQLYN 510 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + + N+ +P PPL Q I + + +TL + L+K Sbjct: 511 EVITTSQQNINLGILQNMVIPKPPLEIQKQIVAECEKIEEQYNTLSLSIKEYQNLIKAML 570 Query: 200 QA--LVSYIVTKGLNP------DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 Q ++ LN ++ + E++ + + +L+ L Sbjct: 571 QKCGIIEDNQEYELNSILDKINNLCKINLDSEFLSSFNKTIKEYALSNPIFKLSIGKRVL 630 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + E + Y + V ID + + Sbjct: 631 NNELLENGQIPVYSANVLEVFGFVNKEILQDY----DNDSVLWGIDGDWMVGFIPKNKKF 686 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ Y+++++ + F + + +K L V +P Sbjct: 687 YPTDHCGVLRVDDTKI-NAKYISFILNEAGKKQGFSR-----KLRASIDRIKALRVKLPS 740 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ Q I ++ T +I+ + + + + L++ + + + Sbjct: 741 LEFQDQIADI----TDKIEKKINEYKIELDRLEKEKEKILQKYL 780 >gi|269978348|gb|ACZ55908.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 409 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 50/402 (12%), Positives = 111/402 (27%), Gaps = 29/402 (7%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVEFRKLGEVLEYDQPNQYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ I + + S+ +L K+ + + Sbjct: 71 KSSPVIIF-DDF-------TTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I T + +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNISGEHTRQWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELNA 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K++ Q + + ++ E + Sbjct: 178 RKKQYQYYQNMFLDFNDINQNHKDAKMSAKPYPKRLKTLLQTLAPKGVEFRKLGEVCDFQ 237 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S++ + G + +Y + GE + I S + Sbjct: 238 KGKSITKKAVTFGKVPVISGGRQPAYYHNEANRSGETI--AISSSGVYAGYVSYWDIPVF 295 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + S ++ K + YL + + G+ + +D++ + +PP++ Sbjct: 296 LADSFPVSPKQKTLMPKYLFHYLTTQQDAIHATKSAGGI-PHVYSKDLQNFLIPIPPLEI 354 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q +I +++ + L+ I I K+ R + Sbjct: 355 QQEIVKILDQFSLLTTDLLAGIPAEIEARKKQYEYYREKLLT 396 >gi|281177458|dbj|BAI53788.1| conserved hypothetical protein [Escherichia coli SE15] Length = 415 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 52/417 (12%), Positives = 113/417 (27%), Gaps = 31/417 (7%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF-AKGQIL 87 IK G + I V + D S+ F + I+ Sbjct: 7 KIKDVCDFVGGSQPPKSQFIYVSKPGYVRLIQTRDYKTDAFPTYIPISSTKKFCDEFDIM 66 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID--VTQRIEAICEGAT 145 G+ GP + + G + L + PK+ + + L D + Sbjct: 67 IGRYGPPIFQIC-RGLKGAYNVALLKVIPKEGVSRDFLYYFLKQDSVFQYVDKLSARTGG 125 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + D + P+ IP E +EK++ ID I R L+ + L Y Sbjct: 126 QTGVDLVSLKEYPVRIPEEIE---CQEKLVTILSVIDKKIALNNRINTELEAMAKTLYDY 182 Query: 206 IVTKGLNPD---VKMKDSGIEW------VGLVPDHWEVKPFFALVTELNR--------KN 248 + PD K SG + +P W + + Sbjct: 183 WFVQFDFPDANGKPYKTSGGKMEYNATLKREIPAGWNDSILGKFIELDRGVTYSKEDVRT 242 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ + + ++ ++ P S + ++ + Sbjct: 243 QDDKDTIGILRATNVTGNNVDIDDLVFIPSSRVNVNQMLNKFDILIVMSSGSKEHVGKNG 302 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 ++ + + P ++ ++S G +L + Sbjct: 303 VYYFEKKHAFGAFCSKITPVRKYRYFINTFLQSKWFKSYINNQCLGTNINNLTNTHITNC 362 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 ++ P + + + I + Q L + R + + GQ+ ++ Sbjct: 363 EIICPTPD----VVALFENKMMPIYNKLASNTQENSHLIQLRDWLLPLLMNGQVTVK 415 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 47/188 (25%), Gaps = 14/188 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 IP W + +F +L+ G T I + +V Sbjct: 213 EIPAGWNDSILGKFIELDRGVTYSKEDVRTQDDKDTIGILRATNVTGNNVDIDDLVFIPS 272 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQG 126 + K IL + + P + Sbjct: 273 SRVNVN-QMLNKFDILIVMSSGSKEHVGKNGVYYFEKKHAFGAFCSKITPVRKYRYFINT 331 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L S I C G +++ I N + P L K++ ++ + Sbjct: 332 FLQSKWFKSYINNQCLGTNINNLTNTHITNCEIICPTPDVVALFENKMMPIYNKLASNTQ 391 Query: 187 ERIRFIEL 194 E I+L Sbjct: 392 ENSHLIQL 399 >gi|261494963|ref|ZP_05991432.1| putative type I restiction/modification specificity protein [Mannheimia haemolytica serotype A2 str. OVINE] gi|261309372|gb|EEY10606.1| putative type I restiction/modification specificity protein [Mannheimia haemolytica serotype A2 str. OVINE] Length = 184 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 53/161 (32%), Gaps = 8/161 (4%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I G +K E Y+ + + + + Sbjct: 24 IPFYKIGTFGKKPNAYISRELFEDYKQKYSYPRKGNILISASGTIGRTVIFDGEDSYFQD 83 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 ++ + +L +L + D G Q L +++K+L + VPP+ EQ Sbjct: 84 SNIVWIENDESQVLDKFLFYLYQIADW----NIAEGGTIQRLYNDNLKKLKIPVPPLSEQ 139 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I N+++ + + + E + + I L +E R + Sbjct: 140 QKIVNILDKFDSLTNSITEGLPKEIKLRREQYGYYREQLLN 180 Score = 44.4 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 53/167 (31%), Gaps = 4/167 (2%) Query: 43 ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + DI + + Y+ ++ + S KG IL G R I Sbjct: 19 SNVGDIPFYKIGTFGKKPNAYISRELF--EDYKQKYSYPRKGNILISASGTIGRTVIFDG 76 Query: 103 FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 D +V + L I EG T+ + + +P+P Sbjct: 77 EDSYFQDSNIVWIEN--DESQVLDKFLFYLYQIADWNIAEGGTIQRLYNDNLKKLKIPVP 134 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 PL+EQ I + +++ + I+L +E+ ++ Sbjct: 135 PLSEQQKIVNILDKFDSLTNSITEGLPKEIKLRREQYGYYREQLLNF 181 >gi|330879394|gb|EGH13543.1| type I restriction-modification system subunit S [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 782 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 66/189 (34%), Gaps = 5/189 (2%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--Q 284 VP +WE A+ + +K + + + + L E + + Sbjct: 82 EVPTNWEWVRVAAVGHDWGQKTPD-QAFTYIDVGAVDNAAGTISTPQVLMAEDAPSRARK 140 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS-TYLAWLMRSYDLC 343 +V G +++ I ++ + I+++A+ + P+ Y +RS Sbjct: 141 VVRSGTVIYSTIRPYLLNVAVIDKAYEQEPIVSTAFAIIHPYLEMPARYFLCYLRSPVFV 200 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + ++ G+ ++ + +PP+ EQ I ++ A + L + + Sbjct: 201 RYVESVQIGIAYPAINDGQFFSGLIPLPPLAEQHRIVAKVDELMALCERLEAQQADADSA 260 Query: 403 LKERRSSFI 411 + + + Sbjct: 261 HTQLVQALL 269 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 74/191 (38%), Gaps = 8/191 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTV 78 +P +W+ V + +T + YI + V++ G P+ + + + Sbjct: 82 EVPTNWEWVRVAAVGHDWGQKTPDQA--FTYIDVGAVDNAAGTISTPQVLMAEDAPSRAR 139 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQ-GWLLSIDV 133 + G ++Y + PYL + D I ST F ++ P +P +L S Sbjct: 140 KVVRSGTVIYSTIRPYLLNVAVIDKAYEQEPIVSTAFAIIHPYLEMPARYFLCYLRSPVF 199 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + +E++ G + + +P+PPLAEQ I K+ + L ++ Sbjct: 200 VRYVESVQIGIAYPAINDGQFFSGLIPLPPLAEQHRIVAKVDELMALCERLEAQQADADS 259 Query: 194 LLKEKKQALVS 204 + QAL+ Sbjct: 260 AHTQLVQALLD 270 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 68/201 (33%), Gaps = 14/201 (6%) Query: 229 PDHWEVKPFFALVTELNR---KNTKLIESNILSL--SYGNIIQKLETRNMGLKPESY--- 280 P+ WE L++ + + +S + + GN +K R+ G + + Y Sbjct: 367 PEAWEWCRVSDLISIKHGYAFSSAYFCDSASPYVLTTPGNFHEKGGFRDRGSRTKYYRGP 426 Query: 281 -ETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAW 335 + ++ G+++ + + + + S Y+ Sbjct: 427 VDKEFALEAGDLIVAMTEQAAGLLGSPAIVPNDGKVYLHNQRLGKIIFDSEIVFSRYIFH 486 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + L +G+ + + + +PP+ EQ I ++ D L Sbjct: 487 YFNTAYLRTCVADSSTGMKVKHTSPGKIGAVFFPIPPLAEQHRIAAKVDQLMDLCDELKT 546 Query: 395 KIEQSIVLLKERRSSFIAAAV 415 ++ Q+ L ++ S+ + A+ Sbjct: 547 RLIQARQLNEKLASTMVEHAL 567 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 63/204 (30%), Gaps = 17/204 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W+ + + G S V + G + K G + + Sbjct: 366 VPEAWEWCRVSDLISIKHGYAFSSAYFCDSAS-PYVLTTPGNFHEKGGFRDRGSRTKYYR 424 Query: 81 --------FAKGQILYGKL---GPYLRKAIIADFDGICSTQF-----LVLQPKDVLPELL 124 G ++ L I DG ++ + V + Sbjct: 425 GPVDKEFALEAGDLIVAMTEQAAGLLGSPAIVPNDGKVYLHNQRLGKIIFDSEIVFSRYI 484 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + + + G + H IG + PIPPLAEQ I K+ D L Sbjct: 485 FHYFNTAYLRTCVADSSTGMKVKHTSPGKIGAVFFPIPPLAEQHRIAAKVDQLMDLCDEL 544 Query: 185 ITERIRFIELLKEKKQALVSYIVT 208 T I+ +L ++ +V + + Sbjct: 545 KTRLIQARQLNEKLASTMVEHALD 568 >gi|320536227|ref|ZP_08036273.1| type I restriction modification DNA specificity domain protein [Treponema phagedenis F0421] gi|320146929|gb|EFW38499.1| type I restriction modification DNA specificity domain protein [Treponema phagedenis F0421] Length = 637 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 49/409 (11%), Positives = 118/409 (28%), Gaps = 32/409 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 I G +GK ++ V G D + I K Sbjct: 212 WDTIGNICTRQKGINITAGKMKELHKDGAPVRIFAGGSTFADIEIKDIGEEN--IIRKNS 269 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I+ G + +F + L + L +V + G Sbjct: 270 IIVKSRGNIDFEFYEKEFSHKNEMWSYSSKDDKELNIKFLYYYLKNNVKYFRDNAITG-K 328 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + N +P P + Q I + + + I +++ + Sbjct: 329 LPQISIGVTDNYKIPKPHIFVQNQIVKVLDKFQELLTNTTGLLPEEISKRQKQYEYYRER 388 Query: 206 IV----------------------TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 ++ + K G+E + Sbjct: 389 LLTFNSKSDNTHTHTHTHTHTLGNHFFDTLNEAAKIVGVELESKAEWKTLGEIGIFTNGF 448 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQ 299 K+ + + ++ YG+I K + + E+ + V G++V Sbjct: 449 GMPKSMFDVNGEVGAIHYGHIYTKYNQFVLKPIVKISKENALKLKQVTHGDLVIARTSEN 508 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQ-SL 357 + + +T + AV H + Y++++ + + G++ + Sbjct: 509 IEDVMKTIVYLGNDNAVTGGHAAVYSHNQNPKYMSYVFNGASYFINQKNKLARGVKVIEI 568 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 D+ ++ + +P I Q + ++++ A I+ + E + + I L +++ Sbjct: 569 STTDMNKIKIPLPSIFVQEHVVSILDKFDALINNISEGLPKEIELRQKQ 617 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 47/394 (11%), Positives = 111/394 (28%), Gaps = 34/394 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + K+ + KD P G + D IF Sbjct: 16 EWKELGEVVKILDSQRKPISKD----------KREAGNYPYYGANGILDYVNDYIFDGVF 65 Query: 86 ILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +L G+ G + K A+ + VL + L + +T + Sbjct: 66 LLMGEDGSVINKDKSPVLHWAEGKIWVNNHAHVLAENKEIVLLRFVYFF---LTTTDVST 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 T + + + +I +PIP L Q I + + T + L + ++ ++ Sbjct: 123 IVRGTPPKINQQSLRSIQIPIPSLETQEKIVKILDQFTNYVTELQVKLRTELQARTKQYN 182 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-TKLIESNILSL 259 + L+ + K S + + T N T + Sbjct: 183 YYRDML----LSEEYLNKLSEKIDLLEDKKEIVWDTIGNICTRQKGINITAGKMKELHKD 238 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 I + ++ + I+ I+ + + + + + Sbjct: 239 GAPVRIFAGGSTFADIEIKDIGEENIIRKNSIIVKSRGNIDFEFYEKEFSHKNEMW---S 295 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y + ++ +L + +++ ++ +G + + P I Q I Sbjct: 296 YSSKDDKELNIKFLYYYLKN-NVKYFRDNAITGKLPQISIGVTDNYKIPKPHIFVQNQIV 354 Query: 380 NVINVETARIDVL-------VEKIEQSIVLLKER 406 V++ + + K ++ +ER Sbjct: 355 KVLDKFQELLTNTTGLLPEEISKRQKQYEYYRER 388 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 15/195 (7%), Positives = 58/195 (29%), Gaps = 11/195 (5%) Query: 26 KVVPIKRFTKLNTG-RTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVS 79 + + G +S ++ I + + +++ K + + + Sbjct: 434 EWKTLGEIGIFTNGFGMPKSMFDVNGEVGAIHYGHIYTKYNQFVLKPIVKISKENALKLK 493 Query: 80 IFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G ++ + + + + + + V + + + Sbjct: 494 QVTHGDLVIARTSENIEDVMKTIVYLGNDNAVTGGHAAVYSHNQNPKYMSYVFNGASYFI 553 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G + + I +P+P + Q + + I+ + + IEL Sbjct: 554 NQKNKLARGVKVIEISTTDMNKIKIPLPSIFVQEHVVSILDKFDALINNISEGLPKEIEL 613 Query: 195 LKEKKQALVSYIVTK 209 +++ + +++ Sbjct: 614 RQKQYEYYREHLLNF 628 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 15/158 (9%), Positives = 49/158 (31%), Gaps = 14/158 (8%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 GN + + + GE + N +S + + + Sbjct: 42 GNYPYYGANGILDYVNDYIFDGVFLLMGE----DGSVINKDKSPVLHWAEGKIWVNNHAH 97 Query: 322 A--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++ + + + D+ + G + + ++ + + +P ++ Q I Sbjct: 98 VLAENKEIVLLRFVYFFLTTTDVSTIVR----GTPPKINQQSLRSIQIPIPSLETQEKIV 153 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAA 413 +++ T + L K+ + + R ++ Sbjct: 154 KILDQFTNYVTELQVKLRTELQARTKQYNYYRDMLLSE 191 >gi|171920127|ref|ZP_02931536.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 1 str. ATCC 27813] gi|171902492|gb|EDT48781.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 1 str. ATCC 27813] Length = 299 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 71/198 (35%), Gaps = 9/198 (4%) Query: 229 PDHWEVKPFFAL---VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 P++W + ++ + K++K S I + + K N + E E + Sbjct: 33 PNNWIWVKLNNISNVISGYSFKSSKYTSSGIRIIRISDFDSKEVDNNEPIFYEYNEKFNS 92 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + + ++ ++ Y+ +L+ + + + Sbjct: 93 YKIENNDIILAMTGGTVGKNIIIKKANDYYLNQRVARIRTFNVNYNYIYYLINTTYIQGL 152 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + ++ +D+ L + +PP+ EQ I + IN+ I ++IEQ + L+ Sbjct: 153 INDSKNSTNDNISLKDINNLLIPLPPLDEQQRIVDKINLLEFFIKQY-DEIEQKLSKLEN 211 Query: 406 -----RRSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 212 EFPEKLKKSVLQYAMQGK 229 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 82/266 (30%), Gaps = 6/266 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W V + + + +G + +S K I I + D +S + Sbjct: 32 IPNNWIWVKLNNISNVISGYSFKSSKYTSSGIRIIRISDFDSKEVDNNEPIFYEYNEKFN 91 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQ 135 + I I+ G + K II V + + + ++ Q Sbjct: 92 SYKI-ENNDIILAMTGGTVGKNIIIKKANDYYLNQRVARIRTFNVNYNYIYYLINTTYIQ 150 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + +T + K I N+ +P+PPL EQ I +KI I + +L Sbjct: 151 GLINDSKNSTNDNISLKDINNLLIPLPPLDEQQRIVDKINLLEFFIKQYDEIEQKLSKLE 210 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 E + L ++ + + +D + + + + +K Sbjct: 211 NEFPEKLKKSVLQYAMQGKLIKQDPNDDSIKDLLKQIHKEKQKLYKEGKLKKKDLEESII 270 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYE 281 S + LK + Sbjct: 271 YKSDDKSYYEKIGNNEPKKLKNLPFN 296 >gi|320177259|gb|EFW52266.1| Type I restriction-modification system, specificity subunit S [Shigella dysenteriae CDC 74-1112] Length = 360 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 55/386 (14%), Positives = 118/386 (30%), Gaps = 41/386 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G+ ++ + V +G+ G D ++ + I+ Sbjct: 6 VKLGDVINVHYGKALKAD--------QRVSNGSVHVFGSSGIVGNHD---KTLCSYPTII 54 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I T + V +L +L I + ++ Sbjct: 55 IGRKGSVGAITWAPSGGWIIDTAYYVEI--KDNNKLDLRYLFYILSGIDLTKKTITTSIP 112 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +PP EQ I + + + I + I+ + A + Sbjct: 113 GLNRDDLYDTFIKLPPFEEQKRIVDLLD-KAEGIRQKREQSIKLADDFLRATFATM---- 167 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP K + +G + + K+ + E + I Sbjct: 168 --YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIRLVQIRDF 217 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + P+ I + +++ + G A M P Sbjct: 218 KSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVALMKASPKE 271 Query: 328 IDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +L++ + V + + + E + + V +PPI Q +I + + Sbjct: 272 NIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRL-- 329 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSF 410 ARI+ EKIE S+ L+ + S Sbjct: 330 --ARIEKFKEKIEISLNHLEIQFLSL 353 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 57/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 175 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 234 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 235 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 293 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I +++ + + Sbjct: 294 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRLARIEKFKEKIEISLNHLEIQFLSL 353 Query: 199 KQALV 203 ++ L+ Sbjct: 354 QKRLM 358 >gi|90961893|ref|YP_535809.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius UCC118] gi|90821087|gb|ABD99726.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius UCC118] Length = 372 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 60/375 (16%), Positives = 117/375 (31%), Gaps = 46/375 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + K+N+GR + + SG+ G + Sbjct: 23 WERKELNNILKINSGRDYKQ-----------LNSGSIPVYGTGGYMLSV---NDKLSDTD 68 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + G+ G + + T F ++ + + I+ + E+ Sbjct: 69 AVGIGRKGTIDKPLYLKAPFWTVDTLFYCTSKENSDVKFIYLLFQIINWKRYDES----T 124 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I NI +P + EQ + I +D + R E L K+AL+ Sbjct: 125 GVPSLSKNTISNIKTYVPKIKEQ----DYISKLFFSLDNTLQLHERKYEELTLIKKALLQ 180 Query: 205 YIVT--KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + G P+V+ K+ W + + + K K I + Sbjct: 181 KLFPKKDGFKPEVRYKNFNDAWEQRKLGEVVERFDNLRIPVTSSKREKGITPYYGANGIQ 240 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + +Q GE V D ND ++ V + + + Sbjct: 241 DYVQGYT-----------------HDGEFVLVAEDGANDLQNYPVHYVNGKVWVNNHAHV 283 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 ++ L +L+ + K+ + G R L + + +LP+ VP EQ + Sbjct: 284 LQGKNKMVDNL-FLVNAIKQIKIETYLVGGSRAKLNADVMMKLPIKVPTFNEQQRLGKY- 341 Query: 383 NVETARIDVLVEKIE 397 AR+D L+ + Sbjct: 342 ---FARLDSLITLHQ 353 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 34/104 (32%), Gaps = 7/104 (6%) Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + D ++ L + + + S SL + + VP Sbjct: 87 PFWTVDTLFYCTSKENSDVKFIYLLFQIINWKRYDE---STGVPSLSKNTISNIKTYVPK 143 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 IKEQ + I+ +D ++ E+ L + + + Sbjct: 144 IKEQ----DYISKLFFSLDNTLQLHERKYEELTLIKKALLQKLF 183 >gi|34540359|ref|NP_904838.1| hypothetical protein PG0545 [Porphyromonas gingivalis W83] gi|34396671|gb|AAQ65737.1| hypothetical protein PG_0545 [Porphyromonas gingivalis W83] Length = 701 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 129/397 (32%), Gaps = 27/397 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIFAKG 84 KVV K +N G S+SG + ++ + +++ R D + + + Sbjct: 38 KVVR-KGIFNVNAGNFSDSG--VPFVRISNLKGMKINTTDIVCIPRAIHDDNHKTALVRN 94 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIEAICE 142 I+ K + D V + L +L + ++++ Sbjct: 95 DIILSKTAIPAASIVSIDECNTSQDTVAVKLALNSKLNSPYLVTFLNTKYGMEQMKKRFS 154 Query: 143 GATMSHADWKGIGN-IPMPIPPLAEQVLIREKIIAETVRIDTLITER-----IRFIELLK 196 G H + N + +P+ Q+ ++E + I+ L Sbjct: 155 GNVQMHLNLDECRNELLVPVLSAEIQMQVKELFELSMQKSTEGISLYSSAESYLLACLGM 214 Query: 197 EKKQALVSYIVTKGL----------NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + A + K L + + + + V P + T + Sbjct: 215 QDFVANIDAYNVKTLKESFLESGRIDAEYYLPKYEDYINAVSAYTGGVAPLGEVCTIKDS 274 Query: 247 KNTKLIESNILSLSYGNIIQKLETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 T + + NI + + + +IV G+++ I+ Sbjct: 275 NYTPECDMKYRYIELANIGKSGDITGCLYENGEDLPTRARRIVTQGDVIVSSIEGSLSSC 334 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDV 362 +L + ++ + ++ + V+ + I+ L L +S + ++ SG + ++ Sbjct: 335 ALIT-DDYDQSLCSTGFYVVRSNQINPETLLTLFKSLPIQQLLKKACSGTILTGIGKQEF 393 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +++P+ + + Q +I + A E +E++ Sbjct: 394 EKIPIPLIRPEVQEEIAQHVQRSFALRKEASELLEKA 430 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 63/441 (14%), Positives = 137/441 (31%), Gaps = 70/441 (15%) Query: 27 VVPIKRFTKLN-TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 V P+ + + T E YI L ++ N T I +G Sbjct: 262 VAPLGEVCTIKDSNYTPECDMKYRYIELANIGKSGDITGCLYENGEDLPTRARRIVTQGD 321 Query: 86 ILYGKL-GPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ + G A+I D +CST F V++ + PE L S+ + Q ++ C Sbjct: 322 VIVSSIEGSLSSCALITDDYDQSLCSTGFYVVRSNQINPETLLTLFKSLPIQQLLKKACS 381 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT-----LITERIRFIELLKE 197 G ++ + IP+P+ Q I + + + + Sbjct: 382 GTILTGIGKQEFEKIPIPLIRPEVQEEIAQHVQRSFALRKEASELLEKAKLSVEYAIETG 441 Query: 198 KKQALV------SYIVTKGLNPDVKMKDSGI----------------------------- 222 +L+ + + L + +K+ GI Sbjct: 442 GGNSLIYSGLLNTLAKYERLAMWLLLKELGIVDESPNRQRVVTTEKRLSESFFTSGRLDA 501 Query: 223 -------EWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQ-KLETR 271 +++ K +V + E+ I + ++ + +ET Sbjct: 502 EYYQPKYDYLDAQFSSIPTKRLGDIVNIHKSIEPGSDAYQENGIPFVRVADLSKFGIETS 561 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP---HGI 328 ++ L +Y T I+ + + IITS + + Sbjct: 562 SICLDSSTYSTAPRPRKNTILLSKDGS----VGIAYKMEEDADIITSGAILHLSMKGKEL 617 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET- 386 YL ++ S + G + Q K ++ ++ + + P+ Q ++++++ Sbjct: 618 LPDYLTLVLNSPIVRMQAERDAGGSIIQHWKPSEISQVIIPMLPVYIQQKLSDLVSKSFA 677 Query: 387 ------ARIDVLVEKIEQSIV 401 A ++ +EQ+I Sbjct: 678 FRRESKALLERAKAMVEQAIE 698 >gi|223938811|ref|ZP_03630699.1| N-6 DNA methylase [bacterium Ellin514] gi|223892509|gb|EEF58982.1| N-6 DNA methylase [bacterium Ellin514] Length = 811 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 56/387 (14%), Positives = 104/387 (26%), Gaps = 52/387 (13%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 VV +K L G S + ++ G Y +S I G+ Sbjct: 448 VVRLKDVCSLTKGTHSST------------KTQRGPYPLIVTAKEPLSSSDYEI--DGEA 493 Query: 87 LYGKL-------GPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIE 138 + + L + A + VLQPKD +L+ ++ Sbjct: 494 VCVPMISSTGHGRATLSRIHFASGKFAVANLLAVLQPKDADVLITRFLYLVLDLQKDKVA 553 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +GA + + +P+P LA Q I L E Sbjct: 554 ELMKGAANVSMKVEDLAEFQIPLPSLATQKEIV----------------------LEIEG 591 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q +++ L+ +W D + + + Sbjct: 592 YQKVINGA-RAVLDHYRPHITIHPDWPICCLDDVASLRSGTTPDTTRGDYYVGDVNFVKT 650 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 N I ++ + + G ++ + + V T Sbjct: 651 SEINNCIINSSVTHISREAVRDYGLTVFPKGTVLMAMYGQGKTRGQVAYLNVP--ACTTQ 708 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 A+ P+ L + G G L +K + +PP+ Q Sbjct: 709 NAAAITPNEC-VEPLYLYLYFLGQYDRLRKHGIDGHISHLNLTYLKTFEIPLPPLATQQA 767 Query: 378 ITNVINVETARI---DVLVEKIEQSIV 401 I + I E A + L+ + E+ I Sbjct: 768 IVSEIEAEQALVAANRELITRFEKKIQ 794 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 72/197 (36%), Gaps = 13/197 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W + + L +G T ++ + D+ ++ ++ + + Sbjct: 615 DWPICCLDDVASLRSGTTPDTTRGDYYVGDVNFVKTSEINNCIINSSVTHISREAVRDYG 674 Query: 78 VSIFAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +++F KG +L G + + + + P + + E L +L + Sbjct: 675 LTVFPKGTVLMAMYGQGKTRGQVAYLNVPACTTQNAAAITPNECV-EPLYLYLYFLGQYD 733 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ +SH + + +P+PPLA Q I +I AE + I Sbjct: 734 RLRKHGIDGHISHLNLTYLKTFEIPLPPLATQQAIVSEIEAEQALVAA----NRELITRF 789 Query: 196 KEKKQALVSYIVTKGLN 212 ++K QA ++ I +G N Sbjct: 790 EKKIQATLARIWGEGDN 806 >gi|2581811|gb|AAC25973.1| specificity (S) subunit homolog [Mycoplasma pulmonis] Length = 369 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 108/365 (29%), Gaps = 19/365 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYINEKFSTTSNCFILKVDENIAKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + E +K +++ Sbjct: 116 RYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILI 175 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I+ L + D I H+ F I + G I Sbjct: 176 KIIEP-LEKQINAFDELILSEQKSLQHYLNYFFGKFYQIEPSLFHDYKLEKIAKIRRGKI 234 Query: 265 IQKLETRNM---------GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + + K Y + + I + I Sbjct: 235 INSFDLKENPGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSI 294 Query: 316 ITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ + ++ + +L + ++ + ++ R S++ + + + +P ++ Sbjct: 295 TNVCFILLLNDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLE 354 Query: 374 EQFDI 378 Q I Sbjct: 355 IQSAI 359 >gi|240016162|ref|ZP_04722702.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae FA6140] Length = 411 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 49/395 (12%), Positives = 102/395 (25%), Gaps = 25/395 (6%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + P+ TK+ G+ E KD + + T + D D Sbjct: 20 EWKPLGEVLVRTKGTKITAGQMKEMHKDNAPLKIFAG-GKTFALVDFD------DVPDKD 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I + I+ G + D + + + + Sbjct: 73 IHREPSIIVKSRGII--EFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQENYFRN 130 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I M N +PIP L Q I + + T TL +E + Sbjct: 131 IGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEATLEAELALR 190 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + Y L+ D ++ + + K + + + + Sbjct: 191 KRQYRYYRDLLLDFDNQIGGGIADGYQCRLKNVVWKTLGEVAEYSKNRICSDKLNEHNYV 250 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N++Q E + + S +I+ I K G + Sbjct: 251 GVDNLLQNREGKKLSGYVPSEGKMTEYIVNDILIGNIRPYLKKIWQADCTGGTNGDV--L 308 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 + V ++ YL ++ G + + + +PP+ EQ I Sbjct: 309 VIRVTDEKVNPKYLYQVLADDKFFAFNMKHAKGAKMPRGSKAAIMQYKIPIPPLPEQEKI 368 Query: 379 TNVINVETARIDVL-------VEKIEQSIVLLKER 406 ++ + + + +E+ Sbjct: 369 VAILGKFDTLTHSVSEGLPHEIALRRKQYEYYREQ 403 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 42/127 (33%), Gaps = 5/127 (3%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E ++ Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEATLEA 185 Query: 403 LKERRSS 409 R Sbjct: 186 ELALRKR 192 >gi|329118871|ref|ZP_08247567.1| type I site-specific deoxyribonuclease [Neisseria bacilliformis ATCC BAA-1200] gi|327465062|gb|EGF11351.1| type I site-specific deoxyribonuclease [Neisseria bacilliformis ATCC BAA-1200] Length = 484 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 64/467 (13%), Positives = 126/467 (26%), Gaps = 75/467 (16%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES-GTGKYLP------------- 65 +P W+ + + K+ G+T + I D+ S TGK+ Sbjct: 11 KLPSGWQFIRLGDIAKI-NGKTLTKKSALTDIRYIDISSTSTGKFEEPTLIKIEDAPSRA 69 Query: 66 -----KDGNSRQSDTSTVSIF----AKGQILYGKLGPYLRKA----IIADFDGICSTQFL 112 + + + F G L G + A + + ++ Sbjct: 70 KRTLTNNDIIISTVRPNLKQFAFIEEAGSNLIASTGFCVISADSEKLAWYLYALITSDIF 129 Query: 113 VLQPKDVLPELLQGWLLSIDVTQ---------------------RIEAICEGATMSHADW 151 V I++ + T + Sbjct: 130 TAHLVAVADGAAYPAFNPIEIEDAVIALPPENYLDVIVDVTRAIHKKIHLNTQTNQTLEQ 189 Query: 152 KGIGNIPMPIPPLAEQVLIREKI-------IAETVRIDTLITERIRFIELLKEKKQALVS 204 + AET + L + L + A Sbjct: 190 TAQALYKSWFVDFEPTRAKAAVLAAGGSQEEAETAAMSALSGHPPAALAALARQNPARHQ 249 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE-----SNILSL 259 + T L I+ G VP WEVK + + K+ K E + +++L Sbjct: 250 QLAT--LAAAFPSALVSIDSYGEVPAGWEVKKVGDIAKVIKGKSYKSSELESSKTALVTL 307 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ------NDKRSLRSAQVMER 313 N + +Y+ Q V G+++ + D+ + S E Sbjct: 308 KSFNRGGGYRLDGLKEYTGTYKPEQEVFAGDLIIAYTDVTQAADVIGKPAMVMSDNRYEH 367 Query: 314 GIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPP 371 II+ V+P+ Y + M + + +G L + V + VP Sbjct: 368 LIISLDVGVVRPNNSVYKYFLYCMAMTVAFQAHTQSFCTGTTVLHLGKDAVPSFEIAVPN 427 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A+I+ + + V L+ R + + + G+ Sbjct: 428 EFLLKKFAEISESIFAKINENI----KQSVRLQNVRDTLLPKLLNGE 470 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 59/201 (29%), Gaps = 16/201 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY----IGLEDVESGTGKYLPKDGNSRQSD 74 G +P W+V + K+ G++ +S + + L+ G G Y Sbjct: 269 GEVPAGWEVKKVGDIAKVIKGKSYKSSELESSKTALVTLKSFNRGGG-YRLDGLKEYTGT 327 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL-----------VLQPKDVLPEL 123 G ++ +I + S V V Sbjct: 328 YKPEQEVFAGDLIIAYTDVTQAADVIGKPAMVMSDNRYEHLIISLDVGVVRPNNSVYKYF 387 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L +++ ++ C G T+ H + + + +P E + +I+ Sbjct: 388 LYCMAMTVAFQAHTQSFCTGTTVLHLGKDAVPSFEIAVPNEFLLKKFAEISESIFAKINE 447 Query: 184 LITERIRFIELLKEKKQALVS 204 I + +R + L++ Sbjct: 448 NIKQSVRLQNVRDTLLPKLLN 468 >gi|237751855|ref|ZP_04582335.1| restriction modification system DNA specificity subunit [Helicobacter winghamensis ATCC BAA-430] gi|229376753|gb|EEO26844.1| restriction modification system DNA specificity subunit [Helicobacter winghamensis ATCC BAA-430] Length = 203 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 17/156 (10%), Positives = 57/156 (36%), Gaps = 8/156 (5%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL-----QNDKRSLRSAQ 309 I+ + + ++ G+I+ + + Sbjct: 42 PIIKIKNVANGDVNLNDVVFYPYSKQLEKFLIKYGDILVSLTGNHPQAQSQVVGQISKYK 101 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPV 367 + ++ + + +L +L+++ + + + SG + ++ +D++ L + Sbjct: 102 YKQFALLNQRVAKIVTKDAEQDFLYYLLKTNKIHNILASHSSGSANQANISSKDIENLTI 161 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 +PP+ Q I +++ +ID L+ + +++ L Sbjct: 162 PLPPLTIQQKIAEILSSFDDKID-LLHRQNKTLESL 196 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 64/190 (33%), Gaps = 19/190 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + W+ V + ++ G +S + + I +++V +G Sbjct: 8 EQWQEVRLGEVAEIVNGYAFKSKEFLNIQQRDSLPIIKIKNVANGDVNLNDVVFYPYSKQ 67 Query: 75 TSTVSIFAKGQILYGKLGPYLRK---------AIIADFDGICSTQFLVLQPKDVLPELLQ 125 + G IL G + + + + + + KD + L Sbjct: 68 LEK-FLIKYGDILVSLTGNHPQAQSQVVGQISKYKYKQFALLNQRVAKIVTKDAEQDFLY 126 Query: 126 GWLLSIDVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L + + + + G+ ++ K I N+ +P+PPL Q I E + + +ID L Sbjct: 127 YLLKTNKIHNILASHSSGSANQANISSKDIENLTIPLPPLTIQQKIAEILSSFDDKIDLL 186 Query: 185 ITERIRFIEL 194 + L Sbjct: 187 HRQNKTLESL 196 >gi|209554488|ref|YP_002284453.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|209541989|gb|ACI60218.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 10 str. ATCC 33699] Length = 372 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 55/393 (13%), Positives = 122/393 (31%), Gaps = 43/393 (10%) Query: 32 RFTKLNTGRTSESGKD----------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 +++ +GR ++ K+ I ++ ++++ + + + N + S V + Sbjct: 10 DISEIISGRGPKNVKNLQDFASQHGKINWLLVKNLINNSINNDFEKYNLDEEKHSLVKL- 68 Query: 82 AKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K +++Y AI +D + F + P + + + I ++ Sbjct: 69 NKNELVYSMYATPGIVAINEFYDNLYINQSFCKIIPNENICLKKFLFYWLIKNKNYALSL 128 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G T S+ + I N + +PP+ EQ I I I I I L EK Sbjct: 129 SSGTTQSNLNINKIRNFVIYLPPIEEQNAIISIIEPLEKSI-KTINLLQTKIGLFIEKTF 187 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ + + +KD GL I Sbjct: 188 NFINNNLANADLIEFSLKDLLNIKRGLP---------------------------ITEKD 220 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + K Y + I + + + Sbjct: 221 LLNNPGNYPLISASSKNNGIFGYFNDYMYDGKNITISMNGNAGCIFYQIGKFSANSDVLV 280 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++ + + + + ++ R L +++ VL+P ++ Q + + Sbjct: 281 LSNSNKNLTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNMEIQKEFSK 340 Query: 381 VINVETARIDVLVEKIEQSIV--LLKERRSSFI 411 ++ + V KIE+++ LLK + I Sbjct: 341 IVEPLL-NLSTKVNKIEKNLNECLLKIVKKLII 372 >gi|57505322|ref|ZP_00371251.1| anti-codon nuclease masking agent (prrB) [Campylobacter upsaliensis RM3195] gi|57016458|gb|EAL53243.1| anti-codon nuclease masking agent (prrB) [Campylobacter upsaliensis RM3195] Length = 396 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 47/400 (11%), Positives = 118/400 (29%), Gaps = 37/400 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P + + + + +S +G + + + D Sbjct: 20 PNGVEFKELGELWE----KAPKSKMGANQAKNLSKNNGNICFTSGETHYFIDDY-----L 70 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G+ L+ L I + T + + + L + Sbjct: 71 VDGEFLF--LNDGGTADIKYNSGKAYYTDHIFAFTSQKICVKFLYYFLKDKQEAINKTCF 128 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +G + + I P+P+PPL Q I E + A T L E +E ++ + Sbjct: 129 QGTGLKNLQKNKIEKFPIPLPPLEIQYKIVEILDAFTELEAELEAELEAELETRLKQYEY 188 Query: 202 LVSYIVTKGLNPDVKMKDSGI---EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +++++ + K + I + +G + A + K + I Sbjct: 189 YRNFLLSYDELENRTAKLNEILKFKTLGELGIRNAGTKITAHQMQALHK----ENAPIRI 244 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G+ I ++ R+ + +++ + + + + + S Sbjct: 245 FAGGSTIADVDYRD-------------LPKKDVIDKPSIICKVRGYIGFEYYDKPFSHKS 291 Query: 319 AYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + + + + + S LK ++ + +PP+ Q Sbjct: 292 EFWSYTIEKNANQKFIYYFLVNQQEYFQQIAKANSVKIPQLKVKNTDNFQIPLPPLAVQN 351 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +I +++ + L I I K+ R ++ Sbjct: 352 EIVEILDKFDTLTNDLTNGIPAEIEARKKQYEYYRERLLS 391 >gi|299144867|ref|ZP_07037935.1| putative type I restriction enzyme S protein [Bacteroides sp. 3_1_23] gi|298515358|gb|EFI39239.1| putative type I restriction enzyme S protein [Bacteroides sp. 3_1_23] Length = 420 Score = 75.6 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 53/416 (12%), Positives = 120/416 (28%), Gaps = 29/416 (6%) Query: 31 KRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS-----DTSTVSIFA 82 + G + + YI L K+ S+ + D + + Sbjct: 2 GEILDVTRGASLSGEYYATEGEYIRLTCGNFDYQNNCFKENKSKDNLYYVGDFKSEFLME 61 Query: 83 KGQIL-------YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +G I+ G LG + ++ + + + + S V Q Sbjct: 62 EGDIITPLTEQAIGLLGSTAIIPESGKYIQSQDVAKIICKEDLLDKDFAFYLISSALVKQ 121 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ A + + H I + + IP L+EQ I + + + +I+ ++ Sbjct: 122 QLSAAAQQTKIRHTSPDKIKDCTVWIPELSEQKRIGKLLRSIDRKIELNRAINQNLEAMM 181 Query: 196 KEKKQALVSYI--VTKGLNPDVK---MKDSGIEWVGLVPDHWEVKPFFALVTELNR---K 247 K + +G P E +P W + K Sbjct: 182 KLLYDYWFVQFDFLNEGGKPYKASGGKMVWNEELKREIPQGWGNMSIGDYAPCKSGYAFK 241 Query: 248 NTKLIESNILSLSYGNIIQKLETR--NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + + GNI + + +T + ++V K ++ Sbjct: 242 SKDFGCKGLPVIKIGNIQENYTLDMADSQCIDLFNKTLFLAKRYDLVIAMTGATIGKFAI 301 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + P L + Y ++F + ++ E + + Sbjct: 302 SQRNYWVNQRVGRFDLGDSPLLRLGFLFNSLKQEYFREQIFQIACGCAQPNISGEQIDSI 361 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +L P + N N + L + I L ++R+ + + GQ+ + Sbjct: 362 LLLKPN----NTVLNQFNKICKSLLELQSENYLQIEELTKQRNELLPLLMNGQVSV 413 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 63/211 (29%), Gaps = 14/211 (6%) Query: 10 YKDSG--VQWIGA----IPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESG 59 YK SG + W IP+ W + I + +G +S K + I + +++ Sbjct: 202 YKASGGKMVWNEELKREIPQGWGNMSIGDYAPCKSGYAFKSKDFGCKGLPVIKIGNIQEN 261 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL---QP 116 D T+ + + ++ G + K I+ + + + Sbjct: 262 YT-LDMADSQCIDLFNKTLFLAKRYDLVIAMTGATIGKFAISQRNYWVNQRVGRFDLGDS 320 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L L ++I I G + + I +I + P + + Sbjct: 321 PLLRLGFLFNSLKQEYFREQIFQIACGCAQPNISGEQIDSILLLKPNNTVLNQFNKICKS 380 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + E L++ V Sbjct: 381 LLELQSENYLQIEELTKQRNELLPLLMNGQV 411 >gi|256617144|ref|ZP_05473990.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|256596671|gb|EEU15847.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|295113428|emb|CBL32065.1| Restriction endonuclease S subunits [Enterococcus sp. 7L76] Length = 186 Score = 75.6 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 46/120 (38%), Gaps = 8/120 (6%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL 353 I + + S I ++ + + + ++ SY + K G Sbjct: 71 TISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLLSYSIKKYI---TGGA 127 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + L + + ++P+++P EQF I ++D + ++ + LLKE + F+ Sbjct: 128 QPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRKLDLLKETKKGFLQK 183 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 55/196 (28%), Gaps = 26/196 (13%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + K+ T + + Y N Sbjct: 8 KVPEIRFPGFTGDWEQCKLGDIAKMYQPPTISGSELL-----------DTGYPVFGANGY 56 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + Q+ G + +V+ +D + + + Sbjct: 57 IGFYSKSNHLE-DQVTISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLL 115 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 I+ G + +P+ IP EQ I ++D I + R Sbjct: 116 SY--SIKKYITGGAQPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRK 169 Query: 192 IELLKEKKQALVSYIV 207 ++LLKE K+ + + Sbjct: 170 LDLLKETKKGFLQKMF 185 >gi|324994848|gb|EGC26761.1| hypothetical protein HMPREF9392_1664 [Streptococcus sanguinis SK678] Length = 387 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 63/413 (15%), Positives = 130/413 (31%), Gaps = 46/413 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E +E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMEKLEPFTRDIPEFEY----LEYRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I + Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVRAKENISDENFVYYLMIAPSI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 R I+++ + + N + PPL EQ+ I + + A +I Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKILKALDDKI----------- 166 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E + + ++V N S +G + + F + K I Sbjct: 167 ----ENNKKINHHLVAISKNYLKIFYSSNSIKLGDIFELKSGYAFKSKDWVDEGKPVIKI 222 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + ++ ++ K ++E V EIV K + Sbjct: 223 KDIDGITIDITNLNYVKNKSQLSKASNFE----VFGKEIVMALTGATTGKIGVIPKNF-- 276 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKR--LPVL 368 G + S + W + + + + + +L V L V Sbjct: 277 NGYVNQRVGLFYAKTELSYAVLWSILQQQNIITDLIKLSSGSAQANLSPFSVNSYDLNVT 336 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + E ++ + + L I L + R + + ++G++ + Sbjct: 337 FKDLIE-------LDKVLSPLYELFCFNLSEIQRLSKLRDTLLPKLLSGELSV 382 >gi|261837871|gb|ACX97637.1| type I restriction enzyme S protein [Helicobacter pylori 51] Length = 365 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 58/408 (14%), Positives = 114/408 (27%), Gaps = 56/408 (13%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPSTNNPKNYGNKIAWITPKDLSTLQGCYIKKGNRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAKKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 I I G T +G + IPP EQ I + +I+ Sbjct: 120 KDNISNIGGGTTFKEVSGATLGLFKVKIPPTYYEQQKIARTLSILDQKIENNHKINELL- 178 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 H + + KN KL Sbjct: 179 --------------------------------------HTLAYKIYEYYFKYKPKNAKLE 200 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + I + +++ + + + P I+ N + + Sbjct: 201 QIIIENPKSSIMVKNAQKTQDKYLFFTSGDNILSYPQAIIDGRNCFLNTGGNADIKFYVG 260 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + ++ + + S YL L+ S + L+ +K+ P+ +P Sbjct: 261 KASYSTDTWCICANEF-SDYLYLLLSSIKTHINQSFFQGTSLKHLQKNLLKKYPIYMPSA 319 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 E +I L+ ++ L++ R + +T Q+ Sbjct: 320 HEIKKFNQIIMPLL----TLISINTRTSKKLEQIRDFLLPLLLTQQVK 363 >gi|320185255|gb|EFW60032.1| Type I restriction-modification system, specificity subunit S [Shigella flexneri CDC 796-83] Length = 360 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 55/386 (14%), Positives = 118/386 (30%), Gaps = 41/386 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G+ ++ + V +G+ G D ++ + I+ Sbjct: 6 VKLGDVINVHYGKALKAD--------QRVSNGSVHVFGSSGIVGNHD---KTLCSYPTII 54 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I T + V +L +L I + ++ Sbjct: 55 IGRKGSVGAITWAPSGGWIIDTAYYVEI--KDNNKLDLRYLFYILSGIDLTKKTITTSIP 112 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +PP EQ I + + + I + I+ + A + Sbjct: 113 GLNRDDLYDTFIKLPPFEEQKRIVDLLD-KAEGIRQKREQSIKLADDFLRATFATM---- 167 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP K + +G + + K+ + E + I Sbjct: 168 --YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIRLVQIRDF 217 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + P+ I + +++ + G A M P Sbjct: 218 KSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVALMKASPKE 271 Query: 328 IDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +L++ + V + + + E + + V +PPI Q +I + + Sbjct: 272 NIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRL-- 329 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSF 410 ARI+ EKIE S+ L+ + S Sbjct: 330 --ARIEKFKEKIEISLNHLEIQFLSL 353 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 57/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 175 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 234 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 235 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 293 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I +++ + + Sbjct: 294 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRLARIEKFKEKIEISLNHLEIQFLSL 353 Query: 199 KQALV 203 ++ L+ Sbjct: 354 QKRLI 358 >gi|257090553|ref|ZP_05584914.1| conserved hypothetical protein [Enterococcus faecalis CH188] gi|307290748|ref|ZP_07570647.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0411] gi|312903691|ref|ZP_07762865.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0635] gi|256999365|gb|EEU85885.1| conserved hypothetical protein [Enterococcus faecalis CH188] gi|306498196|gb|EFM67714.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0411] gi|310632883|gb|EFQ16166.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0635] gi|315030977|gb|EFT42909.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4000] gi|315578705|gb|EFU90896.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0630] Length = 198 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 46/120 (38%), Gaps = 8/120 (6%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL 353 I + + S I ++ + + + ++ SY L K G Sbjct: 83 TISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLLSYSLKKYI---TGGA 139 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + L + + ++P+++P EQF I ++D + ++ + LLKE + F+ Sbjct: 140 QPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRKLDLLKETKKGFLQK 195 Score = 41.3 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 55/196 (28%), Gaps = 26/196 (13%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + K+ T + + Y N Sbjct: 20 KVPEIRFPGFTGDWEQCKLGDIAKMYQPPTISGSELL-----------DTGYPVFGANGY 68 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S + Q+ G + +V+ +D + + + Sbjct: 69 IGFYSKSNHLE-DQVTISARGEGTGTPSYVKAPVWITGNSMVINVEDFDINKKFLYAMLL 127 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ G + +P+ IP EQ I ++D I + R Sbjct: 128 SY--SLKKYITGGAQPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIALQQRK 181 Query: 192 IELLKEKKQALVSYIV 207 ++LLKE K+ + + Sbjct: 182 LDLLKETKKGFLQKMF 197 >gi|187731881|ref|YP_001882947.1| putative type I restriction-modification system specificity subunit [Shigella boydii CDC 3083-94] gi|187428873|gb|ACD08147.1| putative type I restriction-modification system specificity subunit [Shigella boydii CDC 3083-94] Length = 360 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 56/386 (14%), Positives = 115/386 (29%), Gaps = 41/386 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G+ ++ + V +G+ G D ++ + I+ Sbjct: 6 VKLGDVINVHYGKALKAD--------QRVSNGSVHVFGSSGIVGNHD---KTLCSYPTII 54 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I T + V +L +L I + ++ Sbjct: 55 IGRKGSVGAITWAPSGGWIIDTAYYVEI--KDNNKLDLRYLFYILSGIDLTKKTITTSIP 112 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +PP EQ I + + + I + I+ + A + Sbjct: 113 GLNRDDLYDTFIKLPPFEEQKRIVDLLD-KAEGIRQKREQSIKLADDFLRATFATM---- 167 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP K P H K+ + E + I Sbjct: 168 --YGNPITNPKKW--------PVHLMGDIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDF 217 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + P+ I + +++ + G A M P Sbjct: 218 KSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVALMKASPKE 271 Query: 328 IDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +L++ + V + + + E + + V +PPI Q +I + + Sbjct: 272 NIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRL-- 329 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSF 410 ARI+ EKIE S+ L+ + S Sbjct: 330 --ARIEKFKEKIEISLNHLEIQFLSL 353 Score = 43.2 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 57/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 175 PKKWPVHLMGDIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 234 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 235 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 293 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I +++ + + Sbjct: 294 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRLARIEKFKEKIEISLNHLEIQFLSL 353 Query: 199 KQALV 203 ++ L+ Sbjct: 354 QKRLM 358 >gi|261367887|ref|ZP_05980770.1| type I restriction-modification enzyme, S subunit, EcoA family [Subdoligranulum variabile DSM 15176] gi|282570698|gb|EFB76233.1| type I restriction-modification enzyme, S subunit, EcoA family [Subdoligranulum variabile DSM 15176] Length = 380 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 102/354 (28%), Gaps = 26/354 (7%) Query: 73 SDTSTVSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 D IF +L + G L+ A IA + ++Q + + Sbjct: 43 IDYVNDYIFDGTYLLIAEDGENLKSQKQNIAQIAKGKFWVNNHAHIVQTNERCD---LRY 99 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L + + + G+ + + + +P + EQ + A +I+ Sbjct: 100 LHYLINSMDLSGYITGSAQPKLSQANLNAVTLQLPIIDEQEKTVAILGALDDKIELNNKI 159 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + +K + N + I P Sbjct: 160 NDNLQKQVKAIYHVMFVDTPNAARNTCRADECFDISIGKTPPRKEPEWF----------- 208 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + +S+S + + + +V L + K ++ Sbjct: 209 SECSKDCVWVSISDMGASGLYIADSSEYLTQDAVQKFNIR---VVPDNTVLLSFKLTVGR 265 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + + T+ +A S + ++ + +K +P Sbjct: 266 VAITDGEVTTNEAIAHFKTDKPEINEYLYCYLKAFNFETMGSTSSIATAVNSKIIKAMPF 325 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++P KE + A L+++ ++ L + R + + ++G+IDL Sbjct: 326 VIPDDKE----LEKFHAIAAPCFALIKENQRENKRLAKIRDNLLPKLMSGEIDL 375 >gi|307262545|ref|ZP_07544185.1| hypothetical protein appser12_20800 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306867757|gb|EFM99593.1| hypothetical protein appser12_20800 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 160 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 45/149 (30%), Gaps = 1/149 (0%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + I L G++ + T E V + I + Sbjct: 10 NRHEPKYYENGTIPWLKTGDLNDGIITEIPEYITELAIEKTSVKLNPVGSVLIAMYGATI 69 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 +E + + GI + YL + + S + GSG + ++ E + Sbjct: 70 GKLGILNIEATTNQACCACIPYTGIYNKYLFYYLMSQKTELQKRSEGSG-QPNISKEKIV 128 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVL 392 +PP+ EQ I I + + L Sbjct: 129 NYLFPLPPLNEQKCIVEKIETLFSTLQNL 157 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 50/139 (35%), Gaps = 1/139 (0%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 I ++ D+ G +P+ + ++V + G +L G + K I + + Sbjct: 19 NGTIPWLKTGDLNDGIITEIPEYITELAIEKTSVKLNPVGSVLIAMYGATIGKLGILNIE 78 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + P + + L T+ + EG+ + + I N P+PPL Sbjct: 79 ATTNQACCACIPYTGIYNKYLFYYLMSQKTELQKRS-EGSGQPNISKEKIVNYLFPLPPL 137 Query: 165 AEQVLIREKIIAETVRIDT 183 EQ I EKI + Sbjct: 138 NEQKCIVEKIETLFSTLQN 156 >gi|281422289|ref|ZP_06253288.1| putative type I restriction modification DNA specificity domain protein [Prevotella copri DSM 18205] gi|281403610|gb|EFB34290.1| putative type I restriction modification DNA specificity domain protein [Prevotella copri DSM 18205] Length = 1297 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 53/377 (14%), Positives = 112/377 (29%), Gaps = 52/377 (13%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG- 105 D YI + D+ D ++ I +G +L+ + G KA + Sbjct: 964 DYRYIRITDINEDG---TLNDDWKTVAEVEKQYILKEGDVLFARSGATAGKAFYYKNEYG 1020 Query: 106 -ICSTQFLVLQ---PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI 161 +L+ V+P + L S + +E G + + + + +P+ Sbjct: 1021 KALYAGYLIRFRFDESKVIPLFVYNLLCSKEYNDWVEKTKGGTARQNINSQQYCSFEIPL 1080 Query: 162 PPLAEQVLIR---EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 PP+ Q I EK+ V + I L E Q+ + + L+ V Sbjct: 1081 PPMDIQKKIVEECEKVNNRMVELLQQIQYNEERKLHLFEDAQSKANRALR--LDSAVFNI 1138 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 G + ++ + ++ N S Sbjct: 1139 SIGRRVLKKEVVDTGRFDIYSANVFESFGKSEHSVLNDFSQPS----------------- 1181 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 V ID + Q+ + + + S YL + ++ Sbjct: 1182 -------------VLWGIDGDWMVNFIGKDQLFCPTDHCGVIRVLNENEVLSRYLVYPLQ 1228 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F E ++ L + VP I+ Q ++ + ++ID + K +Q Sbjct: 1229 KEGEKQRFSRANRA-----STERIRSLIIQVPSIEVQKEVVEKL----SKIDEEISKAKQ 1279 Query: 399 SIVLLKERRSSFIAAAV 415 + + + + + Sbjct: 1280 YVANASSAKQAILDKYL 1296 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 50/156 (32%), Gaps = 6/156 (3%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + + +I + + E I+ G+++F K + + Sbjct: 963 MDYRYIRITDINEDGTLNDDWKTVAEVEKQYILKEGDVLFARSGATAGKAFYYKNEYGKA 1022 Query: 314 GIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPP 371 + ++ L+ S + G RQ++ + + +PP Sbjct: 1023 LYAGYLIRFRFDESKVIPLFVYNLLCSKEYNDWVEKTKGGTARQNINSQQYCSFEIPLPP 1082 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + Q I E +++ + ++ Q I +ER+ Sbjct: 1083 MDIQKKIVE----ECEKVNNRMVELLQQIQYNEERK 1114 >gi|331681326|ref|ZP_08381963.1| putative type I restriction-modification system, S subunit [Escherichia coli H299] gi|331081547|gb|EGI52708.1| putative type I restriction-modification system, S subunit [Escherichia coli H299] Length = 465 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 51/435 (11%), Positives = 115/435 (26%), Gaps = 59/435 (13%) Query: 38 TGRTSESGKDI------IYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQILYGK 90 G+ D +++ ++V ++ N + G I+ Sbjct: 20 RGKNYPKHNDFMENGYCLFLSAKNVTKSGFQFQETLFINETKDRELRAGKLKYGDIVLTT 79 Query: 91 LGPYLRKAIIADF----DGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIEAICEGA 144 G A + ++ ++++ K P+ L L S + ++I + G+ Sbjct: 80 RGTVGNVAYYDNNNPYKHIRINSGMIIIRADNKLWNPKFLYFILKSELLKEQIINLISGS 139 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA--- 201 + + I +P+ + Q I I +++ I ++ + ++ Sbjct: 140 AVPQLPARDIRKFILPVINRSLQNKITNIISDINDKVNLNIEINQTLEKMSQTLFKSWFV 199 Query: 202 ----LVSYIVTKGLNPDVKMKDSGIE-----------------------------WVGLV 228 ++ + G NP + S E +G V Sbjct: 200 DFDPVIDNALDAG-NPIPEALQSRAELRQKVRNCADFKPLPAEIRSLFPSEFEETELGWV 258 Query: 229 PDHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 P W + K N + L + ++ K+ N K + Sbjct: 259 PKGWSFTALKNFGKIICGKTPTKSNKNYYGEDFLFIKIPDMHGKVFVTNSHDKLSKLGSE 318 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + L S + V YL + M Sbjct: 319 SQSNKIIPHGSICVSCIATVGLVSINAQDCHTNQQINSIVPNSPHYRNYLYFSMLEKYKI 378 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 A G ++ + L+P + + T + E + L Sbjct: 379 FHDLASGGSATLNMNTSVFSNIATLMPN----NLVLKQFHKITEPWFEAILLNEYKLTSL 434 Query: 404 KERRSSFIAAAVTGQ 418 R + + ++G+ Sbjct: 435 ASLRDTLLPKLISGE 449 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 66/179 (36%), Gaps = 9/179 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFR 294 + N + L LS N+ + L + ++ + G+IV Sbjct: 19 DRGKNYPKHNDFMENGYCLFLSAKNVTKSGFQFQETLFINETKDRELRAGKLKYGDIVLT 78 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + I S + ++ + +L ++++S L + + SG Sbjct: 79 TRGTVGNVAYYDNNNPYKHIRINSGMIIIRADNKLWNPKFLYFILKSELLKEQIINLISG 138 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-KERRSS 409 L D+++ + V Q ITN+I+ ++++ +E I Q++ + + S Sbjct: 139 SAVPQLPARDIRKFILPVINRSLQNKITNIISDINDKVNLNIE-INQTLEKMSQTLFKS 196 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 57/194 (29%), Gaps = 8/194 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +G +PK W +K F K+ G+T G+D ++I + D+ D S+ Sbjct: 255 LGWVPKGWSFTALKNFGKIICGKTPTKSNKNYYGEDFLFIKIPDMHGKVFVTNSHDKLSK 314 Query: 72 QSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 S + I G I + + I D + Q + P + + Sbjct: 315 LGSESQSNKIIPHGSICVSCI-ATVGLVSINAQDCHTNQQINSIVPNSPHYRNYLYFSML 373 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + G+ + + NI +P + I + Sbjct: 374 EKYKIFHDLASGGSATLNMNTSVFSNIATLMPNNLVLKQFHKITEPWFEAILLNEYKLTS 433 Query: 191 FIELLKEKKQALVS 204 L L+S Sbjct: 434 LASLRDTLLPKLIS 447 >gi|328947970|ref|YP_004365307.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328448294|gb|AEB14010.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 212 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 70/217 (32%), Gaps = 19/217 (8%) Query: 214 DVKMKDSGIEWVGLVPDH---WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 K +E G P W+VK T N N E+ + L K Sbjct: 1 MNIFKSEFVEMFGENPVESGKWKVKKLGDCGTFKNGMNYSPSENGVDILCLNVSDFKDNY 60 Query: 271 RNMGLK-------PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--IITSAYM 321 + K E + + +IVF + A + + + Sbjct: 61 KIQDCKTLSSISLNEEPSSEYYLQNDDIVFVRSNGNKKLVGRCVALYPNDCKVLFSGFCI 120 Query: 322 AVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +++ +L +++ + G+ ++ +L + + L + VPP+ Q Sbjct: 121 RFRKSTDNLNTDFLLHFLKTDLTREQLKGKGANIQ-NLNQQILANLHLPVPPLDLQNQFA 179 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +ID ++Q I L+E S + + Sbjct: 180 AFV----QQIDKSKFVVKQQITDLQELLDSKMQEYFS 212 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 61/204 (29%), Gaps = 14/204 (6%) Query: 15 VQWIGAIPKH---WKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDG 68 V+ G P WKV + G + DI+ + + D + K Sbjct: 9 VEMFGENPVESGKWKVKKLGDCGTFKNGMNYSPSENGVDILCLNVSDFKDNYKIQDCKTL 68 Query: 69 NSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVL 120 +S + S+ I++ + + D + S + + Sbjct: 69 SSISLNEEPSSEYYLQNDDIVFVRSNGNKKLVGRCVALYPNDCKVLFSGFCIRFRKSTDN 128 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 R + +GA + + + + + N+ +P+PPL Q + Sbjct: 129 LNTDFLLHFLKTDLTREQLKGKGANIQNLNQQILANLHLPVPPLDLQNQFAAFVQQIDKS 188 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 + + ELL K Q S Sbjct: 189 KFVVKQQITDLQELLDSKMQEYFS 212 >gi|332704541|ref|ZP_08424629.1| restriction modification system protein with DNA specificity domain [Desulfovibrio africanus str. Walvis Bay] gi|332554690|gb|EGJ51734.1| restriction modification system protein with DNA specificity domain [Desulfovibrio africanus str. Walvis Bay] Length = 442 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 62/409 (15%), Positives = 131/409 (32%), Gaps = 33/409 (8%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST----VSIFAKGQILYGKLGPYLRKAI 99 I I ++ G+ L +D + +I +G +++ G + + Sbjct: 36 QSTGIPLIRGSNLSEAVGQRLVEDEYVFMPEEKAAEFPRAIAIRGDLVFTCWGTIGQVGL 95 Query: 100 IAD----FDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEAICEGATMSHADWKG 153 I + S + + L P L S + I+ + G+++ + Sbjct: 96 IDKRARFDRYLVSNKQMKLSPDPAKADSLFLYYLFSSPQIRATIKNLGIGSSVPGFNLGQ 155 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS-----YIVT 208 + +I P+PPL+EQ I + A +I+ L + A Sbjct: 156 LRSIRFPLPPLSEQSRISRVLGALDDKIEQNQQAVRALERLAQAIFCAWFVDFEPIKAKV 215 Query: 209 KGLNPDVKMKDSGIE---------WVGLVPDHWEVKPFFALVT-ELNRKNTKLIESNILS 258 G M + +G VP+ W+V L T + + + Sbjct: 216 AGATSFPSMPQPVFDALSIRLIDSKIGPVPEGWKVGTVSDLATLSKTQIKPQDYPDELFD 275 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + L +V G ++ ++ + + L +R I ++ Sbjct: 276 YFSIPAFDTGKRAFLELGKAIKSNKFVVVEGCVLLSKLNPRIPRIWLPPPPNGKRQITST 335 Query: 319 AYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKE 374 ++ P ID YL + + SG Q ++ D+ V+VPP Sbjct: 336 EFLVFVPCSSIDRHYLYCQFQQSSFRENLAQGASGTSSSHQRVRPNDLLGKAVIVPPKPI 395 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + + ++I+ + L E R + ++G++ +R Sbjct: 396 RMEFAHLIDPLFSFA----SACLLESTKLAEMRDYLLPKLLSGEVTMRD 440 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 64/202 (31%), Gaps = 14/202 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IG +P+ WKV + L+ + + Y + ++G +L + Sbjct: 241 IGPVPEGWKVGTVSDLATLSKTQIKPQDYPDELFDYFSIPAFDTGKRAFLELGKAIK--- 297 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELL-QGWLL 129 S + +G +L KL P + + + I ST+FLV P + Sbjct: 298 -SNKFVVVEGCVLLSKLNPRIPRIWLPPPPNGKRQITSTEFLVFVPCSSIDRHYLYCQFQ 356 Query: 130 SIDVTQRIEAICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + G + + + +PP ++ I + E Sbjct: 357 QSSFRENLAQGASGTSSSHQRVRPNDLLGKAVIVPPKPIRMEFAHLIDPLFSFASACLLE 416 Query: 188 RIRFIELLKEKKQALVSYIVTK 209 + E+ L+S VT Sbjct: 417 STKLAEMRDYLLPKLLSGEVTM 438 >gi|227326888|ref|ZP_03830912.1| putative restriction modification system DNA specificity domain [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 522 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/398 (10%), Positives = 110/398 (27%), Gaps = 27/398 (6%) Query: 34 TKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLG 92 + G I Y+ +++ G + + + D S+ G ++ + G Sbjct: 62 FEFLRGIQFNHTSGIPYVRTQNLMDGYIDFSDGIYVDLKCKDMVAKSLCETGDLIVCRKG 121 Query: 93 PYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMSHA 149 + + S + +L S R G Sbjct: 122 KVGAASAVSADIHGAAISENVTRFRLDKSYDADFLATFLNSNHGRMRFLREATGVIQKWI 181 Query: 150 DWKGIGNIP---------MPIPPLAEQVLI----REKIIAETVRIDTLITERIRFIELLK 196 + + + I I Q +++ ++ I + Sbjct: 182 NNEKLRQIRVIRIDSSAEKYIGGKVRQAEKLRAWAKRLEVRLALLENKIPISKHVVRE-A 240 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + +A +SY+ L+ + D + + + ++ Sbjct: 241 KHSKATLSYLTENRLDARYYANKHLDLYAQFTDDFESLGSICSKFKYGASIAANYVNTDG 300 Query: 257 LSLSYGN-----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 L GN I K + + E ++ +I+ + S Sbjct: 301 LPFIRGNALSPNRINKDDIVYLNRSLEDEGNNYCIEEDDILITRSGTVGVAAHVTSEYAK 360 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVP 370 + Y++W + S+ + F + +G Q ++ E++ + + Sbjct: 361 YWYGSFIIKCTLSNKLYLPAYVSWYLNSWVGQQQFRRLENGAVQLNINIEELSSIAIWKA 420 Query: 371 PIKEQFDITNVINVETA--RIDVLVEKIEQSI-VLLKE 405 + Q +I ++ + + + L+ +++ L E Sbjct: 421 SQEFQNEIQQLLFEQISAVNLYKLLANTAKALVEALIE 458 >gi|254372941|ref|ZP_04988430.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151570668|gb|EDN36322.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 445 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 43/383 (11%), Positives = 115/383 (30%), Gaps = 27/383 (7%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 + K+ +G+ G G Y+ ++ + + + K+ + Sbjct: 53 IKDSKEYKILGVR--TYGKGVYINREVYGSSLKMRVYQKAKENHLFWCKVDTKNGAFGVV 110 Query: 102 -----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGI-- 154 + + F + + + LQ + S + + +++ G T Sbjct: 111 KKEQSNSIASSNMAFAEIDITKIDMDFLQLFFKSEEFQKYLDSFVVGTTNRKYIKFDELL 170 Query: 155 GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL--VSYIVTKGLN 212 + +P+PP+ Q I + + + L + +++ A + + + + Sbjct: 171 HKVEIPLPPIEVQKQIVQAYEDKINLANQLEQRAEKLEAKIEKYLYAKLGIQQALEQKQD 230 Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT-------------KLIESNILSL 259 ++ E + + + I L Sbjct: 231 KKGLLRFVRFEQLQRWDTDFFKQKEGYSSKYETVSYEDLFVSLNNGIAARNYASDGIRYL 290 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 +I + Y+ +++ G ++ + + + Sbjct: 291 KVSDIKDNYINNDKPFYVNKYKESDLIEKGTLLITRKGTVGNSYYF--DKEGSFVASSEI 348 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI 378 ++ I+ YL+ + S + K + +G SL +K + + +PP++ Q I Sbjct: 349 FIIKLNDKINGNYLSEINLSSFVKKQYREKSTGTIMPSLSQPKLKSILIPLPPLEIQNHI 408 Query: 379 TNVINVETARIDVLVEKIEQSIV 401 I I +L ++ EQ+ Sbjct: 409 AMRIQKLKDYIKILKQQAEQNRE 431 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 56/143 (39%), Gaps = 4/143 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPH 326 R + YQ + + +D +N + + ++ A+ + Sbjct: 72 YINREVYGSSLKMRVYQKAKENHLFWCKVDTKNGAFGVVKKEQSNSIASSNMAFAEIDIT 131 Query: 327 GIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVK-RLPVLVPPIKEQFDITNVIN 383 ID +L +S + K + G+ R+ +KF+++ ++ + +PPI+ Q I Sbjct: 132 KIDMDFLQLFFKSEEFQKYLDSFVVGTTNRKYIKFDELLHKVEIPLPPIEVQKQIVQAYE 191 Query: 384 VETARIDVLVEKIEQSIVLLKER 406 + + L ++ E+ +++ Sbjct: 192 DKINLANQLEQRAEKLEAKIEKY 214 >gi|325270621|ref|ZP_08137219.1| type I restriction-modification system specificity determinant [Prevotella multiformis DSM 16608] gi|324987016|gb|EGC19001.1| type I restriction-modification system specificity determinant [Prevotella multiformis DSM 16608] Length = 401 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 113/402 (28%), Gaps = 31/402 (7%) Query: 29 PIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + +S Y+ + + K + ++ + KG +L Sbjct: 2 KLSQIAEYVEDKISSSQITLEEYVTTDSILQN--KQGKAVATNLPPTVCPLTHYLKGDVL 59 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQRIEAICEGATM 146 + PYL+K A+ +G S LV + K LL +G+ M Sbjct: 60 VANIRPYLKKVWYANINGGASADVLVFRAKQGNDSTFLYALLLQDSFFAYAMKGAKGSKM 119 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 D I +P L EQ I + II T ++ + K+ Sbjct: 120 PRGDKDQIMRYELPTFTLHEQKNIGKLIIDITNKLSLNRAVNHNLEAMAKQLYDYWFVQF 179 Query: 207 VTKGLNPDVKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 N K SG + +P W+ + K Sbjct: 180 DFPDEN-GKPYKSSGGKMGWNEKLKREIPQGWKDCKIKDFMRIFTGKKDVSKAVP----- 233 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + PE+ + + + G V + R + + + Sbjct: 234 -------GNYKFFSCAPEAITSNEYIYDGYAVLVSGNGSYTGR-VGFYRGKFDLYQRTYA 285 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + ++ + +R + D+ E I Sbjct: 286 CVLDEEVRNVSFFYYTLRYLFQPIYSGGKHGSSIPYIVLGDLADFRFAF---NE--TICK 340 Query: 381 -VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ T D + +++ I L ++R + + GQ+ + Sbjct: 341 KFVDTVTPMFDEQLLRLQ-EIEKLTKQRDELLPLLMNGQVKV 381 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 46/160 (28%), Gaps = 20/160 (12%) Query: 10 YKDSG--VQWIG----AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + W IP+ WK IK F ++ TG+ +DV Sbjct: 189 YKSSGGKMGWNEKLKREIPQGWKDCKIKDFMRIFTGK-------------KDVSKAVPGN 235 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PE 122 + ++ TS I+ +L G Y + + + + Sbjct: 236 YKFFSCAPEAITSNEYIYDGYAVLVSGNGSYTGRVGFYRGKFDLYQRTYACVLDEEVRNV 295 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 + L G+++ + + + Sbjct: 296 SFFYYTLRYLFQPIYSGGKHGSSIPYIVLGDLADFRFAFN 335 >gi|299142940|ref|ZP_07036066.1| type I restriction-modification enzyme, S subunit, EcoA family [Prevotella oris C735] gi|298575556|gb|EFI47436.1| type I restriction-modification enzyme, S subunit, EcoA family [Prevotella oris C735] Length = 384 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 58/393 (14%), Positives = 137/393 (34%), Gaps = 44/393 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 W+ + L+ G I ++ + + + + + D + + Sbjct: 9 EWQEKRLSDIADLSKGIGISKDQLSADGEPCILYGELYTKYKSETIKEVISKTNIDNTKL 68 Query: 79 SIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G + + D + +++ + L+ Sbjct: 69 VKSKANDVIIPCSGETAEEIATARCVLKDDILLGGDLNIIRLHG-YDGSFMSYQLNGKRK 127 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +G ++ H + + NI P L EQ I + RI T + Sbjct: 128 YDIAKVAQGVSVVHLYGEHLKNIKTINPSLNEQKKIANLLSLLDERISTQNKIIDKL--- 184 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 Q+L+ I + L D M ++ E + + K + Sbjct: 185 -----QSLIKGISNRLLYADNSM----------------SIRIEEMLIERSERTKKNNQY 223 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 +LS + I + + + + ++ Y+I+ +IV +L ++ + G Sbjct: 224 EVLSSTVNGIFSQRDYFSKDIASDNNVGYKIIHLHDIVLSPQNLWM--GNINFNDKFDIG 281 Query: 315 IITSAYMAV-KPHGIDSTYLAWLMRS----YDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 I++ +Y G D Y+A L+++ Y+ V S +R++L +E ++L + Sbjct: 282 IVSPSYKVFSIADGFDKKYVAALLKTHHALYNYMLVSEQGASIVRRNLNYEAFEQLVFKI 341 Query: 370 PPIKEQFDITNVINVETARIDV---LVEKIEQS 399 P + +Q +I + I++ +R++ L++ Sbjct: 342 PSLNKQREIGHAISLLKSRLENANLLIKTYNSQ 374 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 15/155 (9%), Positives = 48/155 (30%), Gaps = 4/155 (2%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y + + +++ ++ + + + ++ Sbjct: 46 YTKYKSETIKEVISKTNIDNTKLVKSKANDVIIPCSGETAEEIATARCVLKDDILLGGDL 105 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++ HG D +++++ + + L E +K + + P + EQ I N Sbjct: 106 NIIRLHGYDGSFMSYQLNGKRKYDIAKVAQGVSVVHLYGEHLKNIKTINPSLNEQKKIAN 165 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + +D + + I L+ + Sbjct: 166 LL----SLLDERISTQNKIIDKLQSLIKGISNRLL 196 >gi|256826765|ref|YP_003150724.1| hypothetical protein Ccur_03150 [Cryptobacterium curtum DSM 15641] gi|256582908|gb|ACU94042.1| hypothetical protein Ccur_03150 [Cryptobacterium curtum DSM 15641] Length = 182 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/183 (20%), Positives = 68/183 (37%), Gaps = 12/183 (6%) Query: 230 DHWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 WE + ++ KN E+ S G + Q + ES Y +V+ Sbjct: 3 SPWEQRKLGDFASKKTSKNNSLAFSETFTNSAERGVVSQLDYFDHDVTNAESIGGYYVVE 62 Query: 288 PGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKV 345 P + V+ I + + ++ G+++ Y +D YL R+ K Sbjct: 63 PDDFVYNPRISVTAPVGPINRNRLGRTGVMSPLYTVFETDESVDKCYLEHFFRTRIWHKF 122 Query: 346 FYAMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + G+ R S+ E +P+ P +EQ I + + ID L+ ++ + Sbjct: 123 MFLEGNSGARSDRFSIGDETFFEMPIACPLFEEQRAIASYLES----IDSLITLHQRKLK 178 Query: 402 LLK 404 LLK Sbjct: 179 LLK 181 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 16/167 (9%), Positives = 37/167 (22%), Gaps = 11/167 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + F T + + + E Y D + Sbjct: 5 WEQRKLGDFASKKTSKNNSLAFSETFTNSAERGVVSQLDYFDHDVT-NAESIGGYYVVEP 63 Query: 84 GQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLL----SIDVT 134 +Y + + G+ S + V + + + + Sbjct: 64 DDFVYNPRISVTAPVGPINRNRLGRTGVMSPLYTVFETDESVDKCYLEHFFRTRIWHKFM 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + +P+ P EQ I + + I Sbjct: 124 FLEGNSGARSDRFSIGDETFFEMPIACPLFEEQRAIASYLESIDSLI 170 >gi|496156|gb|AAA65631.1| restriction modification enzyme subunit S1A [Mycoplasma pulmonis] gi|3335658|gb|AAC78314.1| restriction-modification enzyme MpuUI S subunit [Mycoplasma pulmonis] Length = 401 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 45/374 (12%), Positives = 115/374 (30%), Gaps = 25/374 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL-- 202 + I + + +P L Q I + I + E +K ++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILI 175 Query: 203 -VSYIVTKGLNPDVKMKDSGIEWVGLV------------PDHWEVKPFFALVTELNRKNT 249 + + K +N ++ S + + P ++ + L+ K Sbjct: 176 KIIEPLEKQINAFDELILSEQKSLQHYLNYFLNKLASINPSIFKNYKLGEIAKILSGKTP 235 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + + + ++ I G I+F L + Sbjct: 236 STAKKELWKKEIPFFGPGDLDNMVPKRFITFNEKMIKRSGTILFSSAATIGKVGILDNLS 295 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + I + + + + +L +L++ F + ++K + + + + Sbjct: 296 WFNQQITS---IEANNNYVMDKFLFFLLKKISSKIKFENSSGTIFPTIKKKYFENFTLEI 352 Query: 370 PPIKEQFDITNVIN 383 P +K Q I +I Sbjct: 353 PNLKTQSAILGIIE 366 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 58/183 (31%), Gaps = 14/183 (7%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + K+ +G+T + K+I + G D+++ +PK + Sbjct: 222 KLGEIAKILSGKTPSTAKKELWKKEIPFFGPGDLDN----MVPKRFITFNEKMIK----R 273 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G IL+ + I+ + + + + + +LL ++ Sbjct: 274 SGTILFSSAATIGKVGILDNLSWFNQQITSIEANNNYVMDKFLFFLLKKISSKIKFENSS 333 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G K N + IP L Q I I +I+ L ++ + + L Sbjct: 334 GTIFPTIKKKYFENFTLEIPNLKTQSAILGIIEPLHKKINLLKQKKKLLEKRFIYYQNHL 393 Query: 203 VSY 205 + Sbjct: 394 IKE 396 >gi|329903167|ref|ZP_08273389.1| Type I restriction-modification system, specificity subunit S [Oxalobacteraceae bacterium IMCC9480] gi|327548462|gb|EGF33134.1| Type I restriction-modification system, specificity subunit S [Oxalobacteraceae bacterium IMCC9480] Length = 441 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 55/405 (13%), Positives = 131/405 (32%), Gaps = 29/405 (7%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 I I ++ + +S + +I G +++ + G + +I+ Sbjct: 35 DYGIPVIRGANMGEKWVGGDFVYVSREKSIQLSQNIAKPGDLVFTQRGTLGQVSIVPKHK 94 Query: 105 GIC-----STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 C S L + P + L S + + I + H + + P+ Sbjct: 95 HDCYVVSQSQMKLTVDPLKADVDFLYYLFKSPEQLEYIRNAAIQTGVPHTNLGILKKTPI 154 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI----------VTK 209 IP L Q + A RI L + + ++ V + Sbjct: 155 KIPALLVQQQAAFILSALDDRITLLRETNTTLEAIAQALFKSWFVDFDPVRAKQEGRVPE 214 Query: 210 GLNPDVK--MKDSGIE-WVGLVPDHWEVKPFFALVTELNRKNTKLIESN-ILSLSYGNII 265 G++ DS E +GL+P W L T + +L + N Sbjct: 215 GMDAATAALFPDSFEESELGLLPRGWSFGTLADLAELNPESWTTKVHPKTVLYIDLANTK 274 Query: 266 QKLETRNMGLKPESYETY--QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + +++ G+ + + ++ + ++ + + Sbjct: 275 NNEIDVTTEYVFDEAPSRARRVLRTGDSIIGTVRP-GNRSFAYIYRAARNLTGSTGFAVL 333 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +P I + ++ + + + A G +++ E V + + VP + I Sbjct: 334 RPKVIKNAEFIFIAATQNSSIDYLAHIADGGAYPAVRPEVVANIELTVPHEEV---IAAF 390 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + A + ++ + + +I L R + + ++GQ+ L E++ Sbjct: 391 -HDIVAPLSSMIGENQLTIQTLVTLRDTLLPRLISGQLRL-PEAE 433 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 54/168 (32%), Gaps = 9/168 (5%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRS 304 + ++ I + N+ +K + + I PG++VF Sbjct: 30 SKDYVDYGIPVIRGANMGEKWVGGDFVYVSREKSIQLSQNIAKPGDLVFTQRGTLGQVSI 89 Query: 305 LRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDV 362 + + + S + V P D +L +L +S + + + Sbjct: 90 VPKHKHDCYVVSQSQMKLTVDPLKADVDFLYYLFKSPEQLEYIRNAAIQTGVPHTNLGIL 149 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 K+ P+ +P + Q I + +D + + ++ L+ + Sbjct: 150 KKTPIKIPALLVQQQ-AAFI---LSALDDRITLLRETNTTLEAIAQAL 193 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 60/193 (31%), Gaps = 7/193 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +P+ W + +LN + K ++YI L + ++ + + Sbjct: 233 LGLLPRGWSFGTLADLAELNPESWTTKVHPKTVLYIDLANTKNNEIDVTTEYVFDEA-PS 291 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLP-ELLQGWLLSI 131 + G + G + P R ST F VL+PK + E + Sbjct: 292 RARRVLRTGDSIIGTVRPGNRSFAYIYRAARNLTGSTGFAVLRPKVIKNAEFIFIAATQN 351 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + I +G + + NI + +P + + + I Sbjct: 352 SSIDYLAHIADGGAYPAVRPEVVANIELTVPHEEVIAAFHDIVAPLSSMIGENQLTIQTL 411 Query: 192 IELLKEKKQALVS 204 + L L+S Sbjct: 412 VTLRDTLLPRLIS 424 >gi|315222640|ref|ZP_07864529.1| type I restriction modification DNA specificity domain protein [Streptococcus anginosus F0211] gi|315188326|gb|EFU22052.1| type I restriction modification DNA specificity domain protein [Streptococcus anginosus F0211] Length = 339 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 55/357 (15%), Positives = 110/357 (30%), Gaps = 33/357 (9%) Query: 34 TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP 93 + Y+GLE ++S + + I KG +L+GK Sbjct: 11 FNSTEKKKPVDEDKHTYLGLEHLDSDSIYITRYGADVAPKG--DKLIMKKGDVLFGKRRA 68 Query: 94 YLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAICEGATMSHADW 151 Y +K IA FDGI S +VL+PK+ + + ++ S I G+ +W Sbjct: 69 YQKKVAIAPFDGIFSAHGMVLRPKEDVIDKDFFPMFIKSDYFLDAAIKISVGSLSPTINW 128 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + + + +P L EQ + E + I + + + I E ++ + T Sbjct: 129 RDLKELKFELPSLEEQRKLAEVLW----AIYDMKDKYKKLILATDELVKSQFIEMFTD-- 182 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + D +G PD + K + Sbjct: 183 VKKGILSDMATIIMGQSPDGKTYNDTGDGMAFYQGKTEF-----------------GDLY 225 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + +I +++ + E I A++P +T Sbjct: 226 IREATTWTTAPSRIAIANDVLMSVRAPVG-----STNIATEECCIGRGLAAIRPIEEKTT 280 Query: 332 YLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + MG +++ + V +LP+ + I+ Q + Sbjct: 281 TMFIIYAMRVIEDTIANMGVGSTFKAINKDQVHKLPIPLANIELQNQFVELAEQSDK 337 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 58/184 (31%), Gaps = 15/184 (8%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 +K + + + G I+ G+++F Sbjct: 11 FNSTEKKKPVDEDKHTYLGLEHLDSDSIYITRYGADVAPKGDKLIMKKGDVLFGKRRAYQ 70 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKV-FYAMGSGLRQSL 357 K ++ GI ++ M ++P D + ++S L ++ Sbjct: 71 KKVAIAPFD----GIFSAHGMVLRPKEDVIDKDFFPMFIKSDYFLDAAIKISVGSLSPTI 126 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVI---NVETARIDVLV----EKIEQS-IVLLKERRSS 409 + D+K L +P ++EQ + V+ + L+ E ++ I + + + Sbjct: 127 NWRDLKELKFELPSLEEQRKLAEVLWAIYDMKDKYKKLILATDELVKSQFIEMFTDVKKG 186 Query: 410 FIAA 413 ++ Sbjct: 187 ILSD 190 >gi|241762570|ref|ZP_04760644.1| DNA polymerase beta domain protein region [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241372831|gb|EER62528.1| DNA polymerase beta domain protein region [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 527 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 52/424 (12%), Positives = 128/424 (30%), Gaps = 33/424 (7%) Query: 25 WKVVPIKRFT----KLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W V + +++ G + + + + DV G K K Q ++ Sbjct: 109 WPSVRLDSILVPTERISYGVVQPGKESLNGVPIVRVSDVRDGMIK-TEKPLKISQEVENS 167 Query: 78 V--SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSID 132 + G++L +G AI+ + + + + +Q L + Sbjct: 168 YLRTRLTGGELLLSIVGTVGETAIVPESLKGWNIARAIARIPVREDIGARWVQLALKTET 227 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + Q I + + + + +P+P P ++ I + + +ID Sbjct: 228 IKQLINSKLNTTVQPTLNLRDVFELPVPFPSKEKRSSILNILGSLDDKIDLNRRTNETLE 287 Query: 193 ELLKEKKQALV-----SYIVTKGLNPD--VKMKDSGIEWVGLV--PDHWEVKPFFALVTE 243 + + + + G P ++ + + + P+ W+ Sbjct: 288 AMARALFRDWFVDFGPTRAKMAGEAPYLAPELWELFPDRLDDEGKPEGWKNSQIGKQFDI 347 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ N+ K + + Y P + + F L + + Sbjct: 348 TMGQSPPGYTYNLDGNGKPFYQGKADFGTIFPTRRMYCAA----PNRMAYTFDSLVSVRA 403 Query: 304 SLRSAQVM-ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + E I A++ Y +M+S + + S+ + Sbjct: 404 PVGEVNLSAEECCIGRGLAAIRHPQNLPYYTYLVMKSLRKIFFSFEDNGTVFGSINKKQF 463 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 ++L V ++ + NV I + E L + R + ++G+I +R Sbjct: 464 EKLGV----LE--SKVENVFEKRVDPIFKKIITNEAESYTLAQLRDLLLPKLMSGEISIR 517 Query: 423 GESQ 426 + Sbjct: 518 NAEK 521 >gi|194336314|ref|YP_002018108.1| restriction modification system DNA specificity domain [Pelodictyon phaeoclathratiforme BU-1] gi|194308791|gb|ACF43491.1| restriction modification system DNA specificity domain [Pelodictyon phaeoclathratiforme BU-1] Length = 412 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 50/416 (12%), Positives = 123/416 (29%), Gaps = 33/416 (7%) Query: 26 KVVPIKRF-TKLNTGRTSESGKDII------YIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 K V ++ +K+ +G T G ++ ++ +++ + + N +Q++ Sbjct: 3 KFVKLRSITSKIGSGATPRGGNNVYSEQGVAFVRSQNILDMSFSEKGLVFINDQQAEKLK 62 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IL G + ++ I + S +++ KD + L Sbjct: 63 GVTVENDDILLNITGDSIARSCIVPTTILPARVSQHVSIIRCKDRKSAPYVNYYLHYLKP 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ G T + + + N+ + +P +D I R Sbjct: 123 HLLQICRVGGTRNALTKEAVENLYINLPCDYNARAKV------LSALDAKIECNNRINAE 176 Query: 195 LKEKKQALVSYIVTKGLNPDV---KMKDSGIEWVG------LVPDHWEVKPFFALVTELN 245 L+ + L Y + PD K SG + V +P W T Sbjct: 177 LEAMAKTLYDYWFVQFNFPDHNGHPYKSSGGKMVYNPTLKRQIPAGWHYSTIGETFTTHL 236 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQNDK 302 + + N + E + + P +++ + + + Sbjct: 237 GGTPSRDKDEYWTPCEVNWLSSAENPGTFVVDPDERISYLGLQNSPAKLLPQGTVILSIV 296 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 R LR++ + + + + + + ++ ++ + + + Sbjct: 297 RHLRASILGIEAATNQSVVGIVETSMFKHCFIYPYLVREIPRLMVLRTGAQQPHINKGVL 356 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + VP I A + + ++ Q L + R + + GQ Sbjct: 357 DESLLAVPDKST---IEAY-TRLAAPLFLQMKNYHQQNRELTQLRDWLLPILMNGQ 408 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 25/212 (11%), Positives = 59/212 (27%), Gaps = 24/212 (11%) Query: 10 YKDSGVQWIGA----------IPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIG 52 YK SG G IP W I + G T KD + ++ Sbjct: 202 YKSSG----GKMVYNPTLKRQIPAGWHYSTIGETFTTHLGGTPSRDKDEYWTPCEVNWLS 257 Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 + + + S + +G ++ + +A I + + Sbjct: 258 SAENPGTFVVDPDERISYLGLQNSPAKLLPQGTVILSIVRHL--RASILGIEAATNQSV- 314 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 V + + + + + R+ + GA H + + + +P + Sbjct: 315 VGIVETSMFKHCFIYPYLVREIPRLMVLRTGAQQPHINKGVLDESLLAVPDKSTIEAYTR 374 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVS 204 +++ + +L L++ Sbjct: 375 LAAPLFLQMKNYHQQNRELTQLRDWLLPILMN 406 >gi|161870102|ref|YP_001599272.1| hypothetical protein NMCC_1141 [Neisseria meningitidis 053442] gi|161595655|gb|ABX73315.1| conserved hypothetical protein [Neisseria meningitidis 053442] Length = 385 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 61/378 (16%), Positives = 114/378 (30%), Gaps = 26/378 (6%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 YI +++ + S V+ F KG IL + PYL+K A FDG CS Sbjct: 26 YISTDNILQNKQGI--ECAASLPIQGGKVTAFKKGDILLANIRPYLKKIWYAQFDGGCSA 83 Query: 110 QFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 L ++ + D +G M D I +P+ L Q Sbjct: 84 DVLAIRANAKTDSHFLFYALFRDDFFIHAMKGAKGTKMPRGDKTQIMEFKIPVFDLKTQQ 143 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWV 225 I + +D I + L+E + L Y + PD K SG + V Sbjct: 144 SIAAVL----SALDKKIALNKQINARLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGDMV 199 Query: 226 GLVPDHWEVKPFFALV--TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 E+ + + K + + ++ + + + Sbjct: 200 FDETLKREIPKGWGSIELQSCLAKIPNTTKILNKDIKDFGKYPVVDQSQDFICGFTNDEK 259 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 I++P + F D R ++ + + + YL + + Sbjct: 260 SILNPQDAHIIFGD---HTRIVKLVNFQYARGADGTQVILSNNERMPNYLFYQI-----I 311 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G + + +K +++P I+ N V V + L Sbjct: 312 NQIDLSSYGYARHF--KFLKEFKIILPSKD----ISQKYNEIANTFFVKVRNNLKQNHHL 365 Query: 404 KERRSSFIAAAVTGQIDL 421 + R + + GQ+ + Sbjct: 366 TQLRDFLLPMLMNGQVSV 383 >gi|82546468|ref|YP_410415.1| type I restriction-modification system specificity subunit [Shigella boydii Sb227] gi|81247879|gb|ABB68587.1| putative type I restriction-modification system specificity subunit [Shigella boydii Sb227] Length = 356 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 55/386 (14%), Positives = 118/386 (30%), Gaps = 41/386 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G+ ++ + V +G+ G D ++ + I+ Sbjct: 2 VKLGDVINVHYGKALKAD--------QRVSNGSVHVFGSSGIVGNHD---KTLCSYPTII 50 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I T + V +L +L I + ++ Sbjct: 51 IGRKGSVGAITWAPSGGWIIDTAYYVEI--KDNNKLDLRYLFYILSGIDLTKKTITTSIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +PP EQ I + + + I + I+ + A + Sbjct: 109 GLNRDDLYDTFIKLPPFEEQKRIVDLLD-KAEGIRQKREQSIKLADDFLRATFATM---- 163 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP K + +G + + K+ + E + I Sbjct: 164 --YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIRLVQIRDF 213 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + P+ I + +++ + G A M P Sbjct: 214 KSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVALMKASPKE 267 Query: 328 IDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +L++ + V + + + E + + V +PPI Q +I + + Sbjct: 268 NIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRL-- 325 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSF 410 ARI+ EKIE S+ L+ + S Sbjct: 326 --ARIEKFKEKIEISLNHLEIQFLSL 349 Score = 43.2 bits (100), Expect = 0.077, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 57/185 (30%), Gaps = 4/185 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 171 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 230 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 231 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 289 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + + + + +P+PP+ Q I +++ + + Sbjct: 290 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRLARIEKFKEKIEISLNHLEIQFLSL 349 Query: 199 KQALV 203 ++ L+ Sbjct: 350 QKRLI 354 >gi|149185163|ref|ZP_01863480.1| hypothetical protein ED21_18957 [Erythrobacter sp. SD-21] gi|148831274|gb|EDL49708.1| hypothetical protein ED21_18957 [Erythrobacter sp. SD-21] Length = 388 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 43/395 (10%), Positives = 106/395 (26%), Gaps = 39/395 (9%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + + + + + + + G I Sbjct: 10 RKLSHYFTHSKRK---GRAGLPLMSVTMHDGLVRRDSLDRKTDSALKDEEHLLVEPGDIA 66 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID---VTQRIEAICEGA 144 Y + + +AD S + V++PK+ + D + Sbjct: 67 YNMMRMWQGALGLADEAANVSPAYGVMRPKNTVDPRFAKHWFKSDRGLYMLWAFSYGLTE 126 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 IP+ P +Q+ + + +I+ LI R + +AL+ Sbjct: 127 DRLRLYPAEFLEIPVSWPEFLDQIQTADALD----QIERLILLSHRLAGAKGRRYRALIQ 182 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + + V ++K + + L Sbjct: 183 RLSSNHAGARTE--------------------LGDFVARSSQKASVDSAPTSIELDNVEG 222 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 E +I++ + +K A + Sbjct: 223 QSGR-LIGATPTKELQGARATFQTADILYCKLRPYLNK--FHYADRPGLASTEFWVLRAD 279 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + +L L+++++ +E V+ P+ +P EQ N++ Sbjct: 280 RDVCEQRFLFHLIQTHEFAAEANRPTGSRMPRADWEVVQGAPLPLPSKDEQ---ANLLLP 336 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV--TG 417 A + +I + LL+ ++ S + + TG Sbjct: 337 LDAAHSDWLAEIRRG-ELLQIKKRSLMQRLLPDTG 370 >gi|119715343|ref|YP_922308.1| restriction modification system DNA specificity subunit [Nocardioides sp. JS614] gi|119536004|gb|ABL80621.1| restriction modification system DNA specificity domain [Nocardioides sp. JS614] Length = 225 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 67/180 (37%), Gaps = 5/180 (2%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEI 291 + ++S + S+ YG I T + ++PE + ++ PG++ Sbjct: 23 QLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTASSVHRFVRPELKGSLRLARPGDL 82 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V + A + + + + H +D T++++ ++ + + S Sbjct: 83 VIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFRHQMDPTFVSYFFQTAHFHEQKARLAS 142 Query: 352 -GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + ++ R+ PP++ Q +I +V++ A L ++E + R + Sbjct: 143 ESKLARVSGANLARIVAPAPPLEVQREIVSVLDKFRALEAELKAELEARREQYRYYRDAL 202 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 57/191 (29%), Gaps = 11/191 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDG-NSRQSDTS 76 P+ ++P+ + + GR + I ++ + G R Sbjct: 13 PQGVPLMPLGQLGEFIRGRRFTKADYVDSGLGSIHYGEIYTDYGTTASSVHRFVRPELKG 72 Query: 77 TVSIFAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ + G ++ G A + D + + + + P + + + Sbjct: 73 SLRLARPGDLVIAATGENVQEVCKAVAWLGDEEVAIHDDCYIFR-HQMDPTFVSYFFQTA 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++ + + ++ + I P PPL Q I + L E Sbjct: 132 HFHEQKARLASESKLARVSGANLARIVAPAPPLEVQREIVSVLDKFRALEAELKAELEAR 191 Query: 192 IELLKEKKQAL 202 E + + AL Sbjct: 192 REQYRYYRDAL 202 >gi|283834998|ref|ZP_06354739.1| type I restriction enzyme EcoAI specificity protein [Citrobacter youngae ATCC 29220] gi|291069285|gb|EFE07394.1| type I restriction enzyme EcoAI specificity protein [Citrobacter youngae ATCC 29220] Length = 571 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 57/491 (11%), Positives = 119/491 (24%), Gaps = 81/491 (16%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKL--NTGRTSESGKD----IIYIGLE 54 +K K P+ S + +P W+ V + ++ + + Y G Sbjct: 83 IKKTKPLPEI--SEEEKPFELPVGWEWVRLGEIVEVLDYMRKPISKDERTQGIYPYYGAS 140 Query: 55 DVESGTGKYL-PKDGNSRQSDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFL 112 + Y+ D + K K ++ F I + +FL Sbjct: 141 GIVDHVSDYIFDDKLVLVGEDGAKWRKGDKTAFCISGKSWVNNHAHVLKVFKSIITNEFL 200 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ----- 167 V + Q I + L +Q Sbjct: 201 VNYLTISDLAHFITGTTVPKLNQAKLISIPVIISPIKTQININAKIEQLMSLCDQLEQHS 260 Query: 168 --------------------VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 +++ RI + KQ ++ V Sbjct: 261 LTSLDAHQQLVETLLTTLTGSQNADELAENWARISEHFDTLFTTEASIDALKQTILQLAV 320 Query: 208 TKGLNPDVKMKD-------------------------------SGIEWVGLVPDHWEVKP 236 L P + S E +P WE Sbjct: 321 MGKLVPQDPNDEPASELLKRIAQEKTQLVKDGKIKKQKPLPPISDEEKPFELPSGWEWCR 380 Query: 237 FFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPG 289 ++ LN K+ + + L NI + + + ++ Sbjct: 381 LGSIFNFLNGYAFKSEWFSPAGLRLLRNANIAHGVTNWKDVVYIPNEMRDDFENYVLSEN 440 Query: 290 EIVFR----FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 +IV I+ ++R + + + A + + +T+L ++SY Sbjct: 441 DIVISLDRPIINTGLKYATIRKSDLPCLLLQRVAKFKNYANTVSNTFLTTWLKSYFFINS 500 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV----EKIEQSIV 401 S + + ++ + EQ I + N + + L + + Sbjct: 501 IDPGRSNGVPHISTKQLEMTLFPLLSQSEQDRIISKANELISICEKLKYHIQTTQQTQLH 560 Query: 402 LLKERRSSFIA 412 L + I Sbjct: 561 LADALTDAAIN 571 >gi|86158750|ref|YP_465535.1| type I restriction-modification system specificity subunit [Anaeromyxobacter dehalogenans 2CP-C] gi|85775261|gb|ABC82098.1| type I restriction-modification system specificity subunit [Anaeromyxobacter dehalogenans 2CP-C] Length = 404 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 56/383 (14%), Positives = 113/383 (29%), Gaps = 23/383 (6%) Query: 46 KDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAK--GQILYGKLGPYLRKAIIAD 102 + +++G+ +V E G + G +++ R AII + Sbjct: 31 EGPVFLGISNVTEDGHLDLSSIRHIAEDDFPKWTRRVEPRAGDLVFTYEATLNRYAIIPN 90 Query: 103 -FDGICSTQFLVLQPK--DVLPELLQGWLLSIDVTQRIEAIC-EGATMSHADWKGIGNIP 158 F G + +++P V P L + + + + + GAT+ P Sbjct: 91 GFRGCLGRRMALIRPNLARVDPRFLHYYFFTPEWREVVRKNTLAGATVDRLPLTKFPEFP 150 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 + +P L+EQ I + A ID +AL P + Sbjct: 151 VRVPSLSEQRRIARVLAAYDGLIDNSKRRIGVLE----RMARALYREWFVLFRYPGAQTT 206 Query: 219 DSGIEWVGLVPDHWEVKP---FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 +G VP W ++ + K+ E + I R+ Sbjct: 207 SRMSTRIGRVPRDWVLRSPKEIAEVQYGFPFKSALFSEDSAAGTPVVRIRDIPVGRSETY 266 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E + + G+++ + R + +P G S Sbjct: 267 TTEPAASRYEIQNGDVLVGMDGDFHMCI-----WSSGRALQNQRVARFRPSGEWSALHLL 321 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 L + + + A+ L ++ + + PP + I + Sbjct: 322 LALTAPVQALNRAIIGTTVAHLGDSHIRGILLGEPPPP----VLARAKEVFEPIGREIAT 377 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 ++Q I L+ R + + GQ Sbjct: 378 LQQRIRNLRATRDLLLPRLMAGQ 400 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 15/107 (14%), Positives = 35/107 (32%), Gaps = 14/107 (13%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 IG +P+ W + K ++ G + +S + + D+ G + Sbjct: 213 IGRVPRDWVLRSPKEIAEVQYGFPFKSALFSEDSAAGTPVVRIRDIPVG------RSETY 266 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 ++ G +L G G + I + + + + +P Sbjct: 267 TTEPAASRYEIQNGDVLVGMDGDF-HMCIWSSGRALQNQRVARFRPS 312 >gi|327390254|gb|EGE88595.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 345 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 58/345 (16%), Positives = 113/345 (32%), Gaps = 58/345 (16%) Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 I ST F+VL L +LLS + R+ G + + + + +PPL+ Sbjct: 2 IASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLS 60 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDS- 220 EQ I E I + ++D R +L KE ++++ Y + L +S Sbjct: 61 EQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESV 120 Query: 221 --------------------------------------GIEWVGLVPDHWEVKPFFALVT 242 E +P+ WE + + Sbjct: 121 EVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYEEVPCEIPESWEWVRLNDITS 180 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQIVDPGEIVFRF 295 + R + + + + ++ L SY+ +++ G++++ Sbjct: 181 YIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNS 240 Query: 296 IDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 L R + A + V I+ ++ + S + V Sbjct: 241 TGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKA 300 Query: 351 SGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 SG ++ L + +K + +PP+ EQ I + I A ID L+ Sbjct: 301 SGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDALI 345 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 43/109 (39%), Gaps = 5/109 (4%) Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK 373 +I S V ++ TYL + + S + +G ++ + L + +PP+ Sbjct: 1 MIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLS 60 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 EQ I I ++D E + L KE + S + A+ G+ Sbjct: 61 EQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 109 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 165 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 224 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 225 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 284 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 285 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 344 Query: 185 I 185 I Sbjct: 345 I 345 >gi|194246615|ref|YP_002004254.1| Type I restriction-modification system methyltransferase subunit [Candidatus Phytoplasma mali] gi|193806972|emb|CAP18407.1| Type I restriction-modification system methyltransferase subunit [Candidatus Phytoplasma mali] Length = 925 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 53/396 (13%), Positives = 120/396 (30%), Gaps = 34/396 (8%) Query: 28 VPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKY--LPKDGNSRQSDTSTVS 79 + + + G + I + + D+ K + +T+ Sbjct: 540 IKLSDVVNIQKGNNPPKDEKAYIEGKIPFFKVSDIAKFHIKLNLSESVHKINPAYKTTLK 599 Query: 80 IFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 +F K +L G + +A+I+ + ST + ++L +L + Sbjct: 600 LFKKNSLLIPTTGESCKLNHRALISKDSYVAST---ITVLTCDENKILPLFLFYCLLFVD 656 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + D + NI +P+P + EQ I + +I I+ + Sbjct: 657 MGNFVKNDFYPGVDSQMFKNILIPLPTIKEQEKIIKNLIPYNKIIEQSKKIYANWRP--- 713 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + V K +++K+ G+ + + I+S Sbjct: 714 --------HFVIKKEWKSLRLKEISSIIQGVSIKKFISFEIDNTKKIDKENKVEFIKSGQ 765 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG-- 314 + ++K N LK + ++ +++ + R + Sbjct: 766 VRGLDKFNLKKRHYSNENLKIPENK---LLQNEDLILNKQGIGTAGRICFFKSSLFNNST 822 Query: 315 --IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVP 370 + I+ YL + M + M G + + ++ L + P Sbjct: 823 TINTCGYIIRANKQIINPRYLLYFMSGIIGFSELHNMAIGTTGQIQIPITKIENLIIKFP 882 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++EQ I +NVE I + I +K+ Sbjct: 883 SLEEQEKIIQSLNVEYELIKNQKKIIHILNQKIKQY 918 Score = 42.9 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 59/206 (28%), Gaps = 24/206 (11%) Query: 21 IPKHWKVVPIKRFTKLNTG--------------RTSESGKDIIYIGLEDVES-GTGKYLP 65 I K WK + +K + + G + + + +I V Sbjct: 717 IKKEWKSLRLKEISSIIQGVSIKKFISFEIDNTKKIDKENKVEFIKSGQVRGLDKFNLKK 776 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGK--LGPYLRKAIIADFDGICST-----QFLVLQPKD 118 + ++ + ++ K +G R ST +++ K Sbjct: 777 RHYSNENLKIPENKLLQNEDLILNKQGIGTAGRICFFKSSLFNNSTTINTCGYIIRANKQ 836 Query: 119 VLPELLQGWLL--SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 ++ + + I ++ I N+ + P L EQ I + + Sbjct: 837 IINPRYLLYFMSGIIGFSELHNMAIGTTGQIQIPITKIENLIIKFPSLEEQEKIIQSLNV 896 Query: 177 ETVRIDTLITERIRFIELLKEKKQAL 202 E I + +K+ +++ Sbjct: 897 EYELIKNQKKIIHILNQKIKQYCESI 922 >gi|17231117|ref|NP_487665.1| hypothetical protein alr3625 [Nostoc sp. PCC 7120] gi|17132758|dbj|BAB75324.1| alr3625 [Nostoc sp. PCC 7120] Length = 353 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 58/190 (30%), Gaps = 11/190 (5%) Query: 220 SGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + E +P+ W +T+ K E + LS N+ N Sbjct: 145 TQNEIEYTIPNTWCWARLANICEFITDGTHYTPKYTEHGRIFLSSQNVKPFSFMPNNHKF 204 Query: 277 PESYETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + +I+ + + ++ ++ ++ + ID Sbjct: 205 VSEEAYQGYIKNRKPEFEDILLTRVGAGIGEAAVIDQKLEFAIYVSLGLLRPFKEFIDPY 264 Query: 332 YLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL + S K G + +L ++ V VPP+ EQ I + Sbjct: 265 YLVIWLNSPIGTKHSQKNTYGKGVSQGNLNLGLIRGFVVSVPPLAEQKRIVEKCDRLMFL 324 Query: 389 IDVLVEKIEQ 398 D L K++Q Sbjct: 325 CDTLEAKLKQ 334 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 34/174 (19%), Positives = 56/174 (32%), Gaps = 12/174 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD---- 74 IP W + + T T + I L ++P + + Sbjct: 153 IPNTWCWARLANICEFITDGTHYTPKYTEHGRIFLSSQNVKPFSFMPNNHKFVSEEAYQG 212 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIAD----FDGICSTQFLVLQPKDVLPELLQGWLLS 130 IL ++G + +A + D F S L + + P L WL S Sbjct: 213 YIKNRKPEFEDILLTRVGAGIGEAAVIDQKLEFAIYVSLGLLRPFKEFIDPYYLVIWLNS 272 Query: 131 IDVTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 T+ + +G + + + I + +PPLAEQ I EK D Sbjct: 273 PIGTKHSQKNTYGKGVSQGNLNLGLIRGFVVSVPPLAEQKRIVEKCDRLMFLCD 326 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 9/42 (21%), Positives = 19/42 (45%) Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + V + V +PP+ EQ I + + D + ++ +Q Sbjct: 2 ISGGKVYPIVVCLPPLTEQKRIVEKCDRLLSTCDEIEKRQQQ 43 >gi|15829147|ref|NP_326507.1| restriction modification enzyme subunit S1A [Mycoplasma pulmonis UAB CTIP] gi|14090091|emb|CAC13849.1| RESTRICTION MODIFICATION ENZYME SUBUNIT S1A [Mycoplasma pulmonis] Length = 368 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 44/359 (12%), Positives = 110/359 (30%), Gaps = 28/359 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I +I+ + Q + Sbjct: 116 AYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPLEKQINAFDELILSE--------QKSLQ 167 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + LN + P ++ + L+ K + + Sbjct: 168 HYLNYFLNKLASI----------NPSIFKNYKLGEIAKILSGKTPSTAKKELWKKEIPFF 217 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + ++ I G I+F L + + I + + Sbjct: 218 GPGDLDNMVPKRFITFNEKMIKRSGTILFSSAATIGKVGILDNLSWFNQQITS---IEAN 274 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + + +L +L++ F + ++K + + + +P +K Q I +I Sbjct: 275 NNYVMDKFLFFLLKKISSKIKFENSSGTIFPTIKKKYFENFTLEIPNLKTQSAILGIIE 333 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 58/183 (31%), Gaps = 14/183 (7%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + K+ +G+T + K+I + G D+++ +PK + Sbjct: 189 KLGEIAKILSGKTPSTAKKELWKKEIPFFGPGDLDN----MVPKRFITFNEKMIK----R 240 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G IL+ + I+ + + + + + +LL ++ Sbjct: 241 SGTILFSSAATIGKVGILDNLSWFNQQITSIEANNNYVMDKFLFFLLKKISSKIKFENSS 300 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G K N + IP L Q I I +I+ L ++ + + L Sbjct: 301 GTIFPTIKKKYFENFTLEIPNLKTQSAILGIIEPLHKKINLLKQKKKLLEKRFIYYQNHL 360 Query: 203 VSY 205 + Sbjct: 361 IKE 363 Score = 41.3 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 35/142 (24%), Gaps = 3/142 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + + I + ++ ++ Sbjct: 31 YNLYSSKTKDQGIFGKINSYDFNGEYILITTHGAYAGTVKYVNEKFSTTSNCFILKVNEN 90 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I T + LK ++ V +P +K Q I +I Sbjct: 91 IVKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPLEK 150 Query: 388 RI---DVLVEKIEQSIVLLKER 406 +I D L+ ++S+ Sbjct: 151 QINAFDELILSEQKSLQHYLNY 172 >gi|332880948|ref|ZP_08448618.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332681122|gb|EGJ54049.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 977 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 58/399 (14%), Positives = 128/399 (32%), Gaps = 49/399 (12%) Query: 26 KVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTV 78 ++V + G + I + +++ SG + + + + + Sbjct: 600 EIVDFSDIATITRGVNYQRAQQTTYKTSNIILPADNITLSGELEVIKEIYIDQSIILAPE 659 Query: 79 SIFAKGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWL-LS 130 +G I + + LP+ L +L S Sbjct: 660 KQLRQGDIFICMSSGSKEHVGKVAFIDQDTKYYAGGFMGIIRTSTSRCLPQYLFFYLLKS 719 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + I+ + +GA +++ I +I +P+P + Q I +++ I + Sbjct: 720 LKYREEIKLLTQGANINNISS-TINSIKIPLPSVEVQQKIVDELDGYRKIIFGAQSIVSN 778 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + L + K + +K S I + F++ E Sbjct: 779 YEPHLPKFKTGNI-------------VKLSDICEINR----------FSVNPEREYGEES 815 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +I S++ G + G K + ++ G+I+ + S Sbjct: 816 FTYIDISSVTSGTGKVDTSQKIKG-KDAPSRARRGMNKGDILMSTVRPNLKAFSYVDFDT 874 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLV 369 + ++ + + P ++ YL + + + AM + S+ D++ L ++ Sbjct: 875 KG-FVASTGFAVLTPKNVNGKYLLYALLDDFVGNQLSDAMSKAMYPSVNKSDLENLDIIC 933 Query: 370 PPIKEQFDITNVINVETARIDVLVE-------KIEQSIV 401 P I+EQ + I E + I E KIEQ I Sbjct: 934 PSIEEQNEAVIQIERELSFIKSSEEIVSIFTKKIEQKIN 972 >gi|322510788|gb|ADX06102.1| putative type I restriction modification DNA specificity domain protein [Organic Lake phycodnavirus 1] Length = 316 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 55/326 (16%), Positives = 115/326 (35%), Gaps = 40/326 (12%) Query: 79 SIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQ 135 KG IL G K I + + + + + K V+ + + W L D+ Sbjct: 2 FEIQKGNILIALSGATTGKIGIYNLEYKSYLNQRVGKITEKTGVIQKYIYYWYLCCDIES 61 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + +G + I NI +PIPPL +Q I + + + + E+I+ ++ L Sbjct: 62 TVLKMAQGTAQPNISTNNISNIKIPIPPLEKQEEIVKYLDFIYEKANKTSQEKIKELKTL 121 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 E LN ++ ++ +G V + + R + E Sbjct: 122 NEFC-----------LNTQKMFGENVVKTLGEV---------CSRIKGEKRNSKDGKEIG 161 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + L Y +I+ L E I+ + G Sbjct: 162 LYPLYYCSILGYLYLDTFDYTGEG-----------IIINKTNGSGKAMIYFGNDKYNVGK 210 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 T + + I +L+ + L + ++ ++S+ ED+ ++ + +PP++ Q Sbjct: 211 TTLHFKSKSNIIITKYIYYYLLHNIPLIEKYFK--GANQKSIVEEDLFKIKIPIPPLETQ 268 Query: 376 FDITNVINVETARIDVLVEKIEQSIV 401 +I D L++++E+ I Sbjct: 269 QEIVEY----CEYNDTLIKQLEKEIE 290 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 40/120 (33%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G I+ K + + + + K I W + V Sbjct: 4 IQKGNILIALSGATTGKIGIYNLEYKSYLNQRVGKITEKTGVIQKYIYYWYLCCDIESTV 63 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + ++ ++ + + +PP+++Q +I ++ + + ++ + + L E Sbjct: 64 LKMAQGTAQPNISTNNISNIKIPIPPLEKQEEIVKYLDFIYEKANKTSQEKIKELKTLNE 123 >gi|15828902|ref|NP_326262.1| restriction modification enzyme subunit S2A [Mycoplasma pulmonis UAB CTIP] gi|14089845|emb|CAC13604.1| RESTRICTION MODIFICATION ENZYME SUBUNIT S2A [Mycoplasma pulmonis] Length = 395 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 49/365 (13%), Positives = 105/365 (28%), Gaps = 15/365 (4%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + K+ G + + K I+ + + K N+ + KG I Sbjct: 3 IYKLGEIAKIVGGNSKFTEKYIL-----NNQGIYSVISSKTSNNGIYGCINTFQYEKG-I 56 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 K G Y + ++ L+ + + + + I++I G+T Sbjct: 57 TISKDGVYAGTIFYQEKPFSITSHAFYLEITNKNVLEKYLFYFLKNKQEHIQSITYGSTR 116 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + + IP L Q I + I + E +K +++ I Sbjct: 117 DSLTKTDFSDFVVSIPSLETQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILIKI 176 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN--- 263 + L + D I H+ L + +I + GN Sbjct: 177 IEP-LEKQINAFDELIFSEQKSLQHYLNYFLNKLASINPSIFKNYKLGDITKIVSGNPKF 235 Query: 264 ---IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--IITS 318 I+K E + S+ + + S+ + I S Sbjct: 236 TKSYIEKNEGVYPVISSSSFNNGVYGYINTFDYEKGITISKDGSVGNIFYQSNCFSINAS 295 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 A + + + + + + + + D+ ++ V +P +K Q I Sbjct: 296 AMLIQPVENMILEKYLFYLLRSKEKNIKQVFSGSVIKHIYPRDIVKIKVDLPTLKTQSAI 355 Query: 379 TNVIN 383 +I Sbjct: 356 LGIIE 360 >gi|86151110|ref|ZP_01069326.1| type II restriction-modification enzyme [Campylobacter jejuni subsp. jejuni 260.94] gi|85842280|gb|EAQ59526.1| type II restriction-modification enzyme [Campylobacter jejuni subsp. jejuni 260.94] Length = 1279 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 46/404 (11%), Positives = 124/404 (30%), Gaps = 31/404 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVS 79 +V +K G T DI ++ + D + + S Sbjct: 890 LVKLKICGDFFMGGTPSRKNINYWNGDIKWLTISDYSNRQVIMDTKEKITREGFKNSNAK 949 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG ++ + + + I D + + + P + + + ++ Sbjct: 950 MIQKGAVVVS-IYATIGRVGILGEDMTTNQAIVAIIPNEEFINKYLMYAI-DYFKFQLYN 1007 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + + N+ +P PPL Q I + + +TL + L+K Sbjct: 1008 EVITTSQQNINLGILQNMVIPKPPLEIQKQIVAECEKIEEQYNTLSLSIKEYQNLIKAML 1067 Query: 200 QA--LVSYIVTKGLNP------DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 Q ++ LN ++ + E++ + + +L+ L Sbjct: 1068 QKCGIIEDNQEYELNSILDKINNLCKINLDSEFLSSFNKTIKEYALSNPIFKLSIGKRVL 1127 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + + E + Y + V ID + + Sbjct: 1128 NNELLENGQIPVYSANVLEVFGFVNKEILQDY----DNDSVLWGIDGDWMVGFIPKNKKF 1183 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 ++ Y+++++ + F + + +K L V +P Sbjct: 1184 YPTDHCGVLRVDDTKI-NAKYISFILNEAGKKQGFSR-----KLRASIDRIKALRVKLPS 1237 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ Q I ++ T +I+ + + + + L++ + + + Sbjct: 1238 LEFQDQIADI----TDKIEKKINEYKIELDRLEKEKEKILQKYL 1277 >gi|297380618|gb|ADI35505.1| type I R-M system specificity subunit [Helicobacter pylori v225d] Length = 204 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 64/200 (32%), Gaps = 12/200 (6%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI------IQKLETRNMGLKPESYET 282 P +W+ + + + IE+ + N+ +R + + Sbjct: 7 PLNWQRVRLGDIAEIKRGASPRPIENPKWFCANSNVGWVRISDISKNSRFLYKTAQKLSK 66 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 I + + + + I ++ + ID YL + + Y Sbjct: 67 KGIEKSRLVKQNSLIMSMCATIGKPIITKIDTCIHDGFVVFENPKIDLNYLYYFL-CYIE 125 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + G + +L + +K V P + EQ I N+++ I L K Q Sbjct: 126 KEWLESGQQGSQVNLNVDLIKNKEVFYPKDLNEQIAIANILSALDNEITSLKNKKRQ--- 182 Query: 402 LLKERRSSFIAAAVTGQIDL 421 + + + ++ +I + Sbjct: 183 -FENIKKALNHDLMSAKIRV 201 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 61/194 (31%), Gaps = 13/194 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P +W+ V + ++ G + ++ ++ + D+ + Sbjct: 6 LPLNWQRVRLGDIAEIKRGASPRPIENPKWFCANSNVGWVRISDISKNSRFLYKTAQKLS 65 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + + ++ + I I + PK L + Sbjct: 66 KKGIEKSRLVKQNSLIMSMCATIGKPIITKIDTCIHDGFVVFENPKIDLN---YLYYFLC 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIR 190 + + + + + + I N + P L EQ+ I + A I +L ++ + Sbjct: 123 YIEKEWLESGQQGSQVNLNVDLIKNKEVFYPKDLNEQIAIANILSALDNEITSLKNKKRQ 182 Query: 191 FIELLKEKKQALVS 204 F + K L+S Sbjct: 183 FENIKKALNHDLMS 196 >gi|317181779|dbj|BAJ59563.1| Type I restriction enzyme specificity subunit [Helicobacter pylori F57] Length = 390 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 60/395 (15%), Positives = 116/395 (29%), Gaps = 29/395 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPSTNNPKNYGNKIAWITPKDLSTLQGRYIKKGSRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAKKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I + G T +G + IPP + +KI +D I + E Sbjct: 120 KDNISNMGVGTTFKDISKPALGLFKVKIPPTYYEQ---QKIARTLSILDQKIENNHKINE 176 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL + + L + D K + + ++ Sbjct: 177 LLHKILELLYEQYFVRFDFLDENNKPYQTSGGKMKFSKELNRLIPNDFEVKTLGELTQLK 236 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + ++ + K P ETYQ I+ + + Sbjct: 237 VGNKNANHSSNQGKYPFFTCSNNPLRCETYQFEGKHIIISGNGNFY-------VTHYDGK 289 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 AV P+ + L +L + + + + D++ + +++P +K Sbjct: 290 FDAYQRTYAVSPNNPNHYVLIYLFVKSYTNYLKLQSRGSIIKFITKSDIENIKIVLPNLK 349 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 NV+ ++E QS L R Sbjct: 350 TYAKWNNVL--------KMIENNNQSTQTLTALRD 376 >gi|160939417|ref|ZP_02086767.1| hypothetical protein CLOBOL_04310 [Clostridium bolteae ATCC BAA-613] gi|158437627|gb|EDP15389.1| hypothetical protein CLOBOL_04310 [Clostridium bolteae ATCC BAA-613] Length = 174 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 44/133 (33%), Gaps = 8/133 (6%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 G+ + + E+ I +M+++ + ++ + + + L Sbjct: 49 RCYAYKGDTLLVC---KGSGSGAVVRLTQEKAHIARQFMSLRANEKMTSDFCYYL-TGFL 104 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +GL + + V V +PP+ EQ I +++D + E + Sbjct: 105 SDRIKRNATGLIEGIDRGTVLNQTVFLPPLHEQKKIARF----FSKLDFTITAHENMLDT 160 Query: 403 LKERRSSFIAAAV 415 L R+ + Sbjct: 161 LINERTGLMQRLF 173 >gi|223983260|ref|ZP_03633453.1| hypothetical protein HOLDEFILI_00733 [Holdemania filiformis DSM 12042] gi|223964753|gb|EEF69072.1| hypothetical protein HOLDEFILI_00733 [Holdemania filiformis DSM 12042] Length = 342 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 51/367 (13%), Positives = 107/367 (29%), Gaps = 46/367 (12%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSI 80 + ++ +G T + DI +I ++ T + + + + Sbjct: 6 KLGDICEIVSGTTPNTSCSKYWNGDINWITPAELSDDTIIINESVRKITRQAVIDTGLKS 65 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F G ++ P K IA + C+ F L + + + + TQ + ++ Sbjct: 66 FPPGTVILSSRAPI-GKVAIAGREMYCNQGFKNLICSERINNI-YLYWFLKRNTQYLNSL 123 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 GAT + +I + +P + EQ+ E + + +I R R + L + + Sbjct: 124 GRGATFKEISKSIVSDIQISLPLIEEQIKRAENL----RKCWNVIILRKRELCKLDDLIK 179 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A V NP K+ ++ V V N + I+ Sbjct: 180 A---RFVEMFGNPITNNKNFVVKKVIEVVKLQRGHDLPIQNRIQNSTIPVWGSNGIVGYH 236 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G++ + +L S II Sbjct: 237 NEAKSNSGIITGRSGTL-----------GKVYYYAHPFWPLNTTLYSINTYNNNII---- 281 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 YL +L+ Y+L + +L + ++ P+ Q + + Sbjct: 282 -----------YLKYLLEFYELQRF---ASGTGVPTLNRNEFHNEMIIDVPLDLQNEFAD 327 Query: 381 VINVETA 387 + Sbjct: 328 FVKQVDK 334 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 11/161 (6%), Positives = 44/161 (27%), Gaps = 4/161 (2%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 +K +I ++ + N ++ + + + L + + A Sbjct: 24 SKYWNGDINWITPAELSDDTIIINESVRKITRQAVIDTGLKSFPPGTVILSSRAPIGKVA 83 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + + + + + + V + + Sbjct: 84 IAGREMYCNQGFKNLICSERINNIYLYWFLKRNTQYLNSLGRGATFKEISKSIVSDIQIS 143 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +P I+EQ + + ++ ++ + L + + Sbjct: 144 LPLIEEQIKRAENL----RKCWNVIILRKRELCKLDDLIKA 180 >gi|291559576|emb|CBL38376.1| Restriction endonuclease S subunits [butyrate-producing bacterium SSC/2] Length = 422 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 53/415 (12%), Positives = 116/415 (27%), Gaps = 32/415 (7%) Query: 30 IKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQ 85 + +++G +S G ++ + V + D G Sbjct: 9 LSELYDMSSGISSTKEQSGHGAPFVSFKTVFNNYFLPEELPDLMDTNEKEQETYSIKMGD 68 Query: 86 ILYGKLGPYLR-----KAIIADFDGICSTQFLVL----QPKDVLPELLQGWLLSIDVTQR 136 + + + + ++ G + F+ + V P+ + + S + Sbjct: 69 VFITRTSETIDELAMSCVAVKNYPGATYSGFIKRLRPKTARIVYPKYMAFYFRSELFRKA 128 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + + + +P EQV I + + + +I E L+ Sbjct: 129 VTNNAFMTLRASFNKDIFTFLDIYLPDYHEQVKIGDMLYSIECKIQKNKKINDYLEEQLQ 188 Query: 197 EKKQALVSYIVTKGLN-----PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR----- 246 + + + ++P W+VKP + + N Sbjct: 189 LLYDYWFTQFNFPDDDGQPYKASNGLMVWNENINHIIPAGWQVKPMGTICSFRNGINYNK 248 Query: 247 ---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 NT N+ ++S + + P V I+ + R Sbjct: 249 NVEGNTTYKIINVRNISSSTLFLDESNFDEICLPRQQGDKYCVSDESIIIARSGIPGATR 308 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 L + I + P+ L G + +++ E +K Sbjct: 309 ILCNPSS--NIIFCGFIICCTPYNNTLQNYLTLYLKQFEGSSATQTGGSILKNVSQETLK 366 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L V +PP + N N + I L+ + V L R + + GQ Sbjct: 367 NLLVPIPP----QSLLNQFNDSVSHIYNLIIGNIKENVQLTTLRDWLLPMLMNGQ 417 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 55/191 (28%), Gaps = 7/191 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSR--QSD 74 IP W+V P+ G + I + ++ S T + + Sbjct: 225 IPAGWQVKPMGTICSFRNGINYNKNVEGNTTYKIINVRNISSSTLFLDESNFDEICLPRQ 284 Query: 75 TSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + I+ + G P + + I F++ L Sbjct: 285 QGDKYCVSDESIIIARSGIPGATRILCNPSSNIIFCGFIICCTPYNNTLQNYLTLYLKQF 344 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G+ + + + + N+ +PIPP + + + I I E ++ Sbjct: 345 EGSSATQTGGSILKNVSQETLKNLLVPIPPQSLLNQFNDSVSHIYNLIIGNIKENVQLTT 404 Query: 194 LLKEKKQALVS 204 L L++ Sbjct: 405 LRDWLLPMLMN 415 >gi|317178236|dbj|BAJ56025.1| Type I R-M system specificity subunit [Helicobacter pylori F16] Length = 178 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 61/164 (37%), Gaps = 11/164 (6%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S+ K++ ++ +T I D I R L + I++ Sbjct: 23 SVEQITQQGKIKVYDVNNFIGYTDTTFISDKPYISIVKDGSVGRVRILPP----KTNILS 78 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + H + +L +L+ ++D S + + F+D K + +PP+ EQ Sbjct: 79 TMGALIANHRTTTEFLFYLLSNFDFKNF---TSSSIIPHIYFKDYKEKTIFLPPLNEQSA 135 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I N+++ I L K Q + + + ++ +I + Sbjct: 136 IANILSALDNEIISLKNKKRQ----FENIKKALNHDLMSAKIRV 175 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 65/184 (35%), Gaps = 15/184 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W+ V + T + +E + + GK D N+ T T I Sbjct: 2 LPLNWQRVRLGDIANYLTSN----------LSVEQI-TQQGKIKVYDVNNFIGYTDTTFI 50 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K I K G R I+ I ST ++ E L L + D Sbjct: 51 SDKPYISIVKDGSVGRVRILPPKTNILSTMGALIANHRTTTEFLFYLLSNFDFK----NF 106 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + H +K + +PPL EQ I + A I +L ++ +F + K Sbjct: 107 TSSSIIPHIYFKDYKEKTIFLPPLNEQSAIANILSALDNEIISLKNKKRQFENIKKALNH 166 Query: 201 ALVS 204 L+S Sbjct: 167 DLMS 170 >gi|308183007|ref|YP_003927134.1| type I R-M system S protein [Helicobacter pylori PeCan4] gi|308065192|gb|ADO07084.1| type I R-M system S protein [Helicobacter pylori PeCan4] Length = 406 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 50/404 (12%), Positives = 110/404 (27%), Gaps = 35/404 (8%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVGFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ I + + S+ +L PK+ + + Sbjct: 71 KSSPVIIF-DDF-------TTATQWVDFPFKVKSSAMKILLPKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNIGGEHARHWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTRKKQ 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + L+ N + E + P +K + KL E Sbjct: 178 YQYYQNMLLD------FNDINQSHKDAKEKLAQKPYPKRLKTLLQTLAPKGVGFRKLGEV 231 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVME 312 + + + + + + I S + Sbjct: 232 CDFQKGKSITKKAVTFGKVPVISGGRQPAYYHNEANRSGETIAISSSGVYAGYVSYWDIP 291 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + S ++ K + YL + + + +G + +D++ + +PP+ Sbjct: 292 VFLADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIPHVYSKDLQNFLIPIPPL 350 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + Q +I +++ + L+ I I K+ R + Sbjct: 351 EIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 394 >gi|309804934|ref|ZP_07698993.1| conserved domain protein [Lactobacillus iners LactinV 09V1-c] gi|308165747|gb|EFO67971.1| conserved domain protein [Lactobacillus iners LactinV 09V1-c] Length = 376 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 52/385 (13%), Positives = 109/385 (28%), Gaps = 47/385 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + +L T S+ + + T + +P N + +D S + + ++ Sbjct: 7 KLGELIELVTETNSDLKYQENDVRGMTI---TKEIIPTKANVKNTDLSKFLVVHPNEFIF 63 Query: 89 GKL--------GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G + K LP+ L + + Sbjct: 64 NPRTHGKKIGFGYNNSNKAFLISWNNIAFSLSEYGRKLALPKYLFLHFNRSEWDRAACFS 123 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G++ W + ++ + +P LA Q A Sbjct: 124 SWGSSTEVFSWNALCDMDIDLPSLAIQQKYVNVYNAMVSN-------------------- 163 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +GL D+ IE + P ++ ++ N E+ + Sbjct: 164 ---QKAYERGLEDLKLTCDAYIEDL------RRQIPCESIGPYIDSVNENNSENAYTHVQ 214 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 ++ Y +V G I + + S+ E +++ Y Sbjct: 215 GVESGGSFIDTRANMQGVDIGKYTVVRKGNIAYNPSRINI--GSIALYNSDEPCVVSPMY 272 Query: 321 MAVKPHGID---STYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 K D YL + + +Y +R + F ++ + +P I+ Q Sbjct: 273 SVFKVTDTDKVSPEYLMLWFNRTEFQRYTWYYAAGSVRDTFDFNLMQEVEFPIPSIETQK 332 Query: 377 DITNVINVETARIDVLVEKIEQSIV 401 DI N++ R + EK++ I Sbjct: 333 DIVNILTAYNKR-KSINEKLKAQIK 356 >gi|52548299|gb|AAU82148.1| conserved hypothetical protein [uncultured archaeon GZfos11A10] Length = 411 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 64/420 (15%), Positives = 127/420 (30%), Gaps = 38/420 (9%) Query: 24 HWKVVPIKRF-----TKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSR 71 W+ V + F + TG + K I + +V G + + Sbjct: 2 SWRKVQLAEFLDDGGIDIRTGPFGTQLKAADYTPKGTPVINVRNVGYGDLRPEKLEFVPD 61 Query: 72 QSDTS-TVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELL--Q 125 Q + I I++G+ G R +++ S + D + Sbjct: 62 QVVSRLPKHILETRDIVFGRKGAVDRHLFVSESETGWMQGSDCIRLRVLTDAIHPAFLSF 121 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + ATM+ + IG IP+ +P A Q I + A I+ Sbjct: 122 ALRLPSHKQWMLTQCSNKATMASLNQDVIGRIPINLPDPATQDEIATILSAYDDLIENNR 181 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + + ++ G + VP+ WE K + + Sbjct: 182 RRIQLLEQAARLLYREWFVHLRFPG--------HEHVAITDGVPEGWEKKKIAEVCETVG 233 Query: 246 R-----KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K ++ E +I + +I + + + + E ++V L Sbjct: 234 GGTPSTKVSEYWEGDITWIVPSDITKNDCLALLDSERKITEMGLRKSSAKMVPAETILMT 293 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKF 359 + S+ +M+ + T+ D + L + G + Sbjct: 294 SRASVGFFALMDFEVCTNQGFISIIPHEDELRMYLLFNLMSRVTEIRSNAKGTTYPEISK 353 Query: 360 EDVKRLPVLVPPI---KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + + V+VP E + I + R+ L ++E + LL R + AVT Sbjct: 354 GRFRGMDVVVPSKPLVSEFMRFASDIIQQVRRLKRLTLQLEAARNLLLPR---LMNGAVT 410 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 57/196 (29%), Gaps = 11/196 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGT---GKYLPKDGNSR 71 +P+ W+ I + G T + DI +I D+ + Sbjct: 216 VPEGWEKKKIAEVCETVGGGTPSTKVSEYWEGDITWIVPSDITKNDCLALLDSERKITEM 275 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ + IL + + DF+ + F+ + P + + + L Sbjct: 276 GLRKSSAKMVPAETILMTS-RASVGFFALMDFEVCTNQGFISIIPHEDELRMYLLFNLMS 334 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 VT+ I + +G T + + +P ++ L ++ Sbjct: 335 RVTE-IRSNAKGTTYPEISKGRFRGMDVVVPSKPLVSEFMRFASDIIQQVRRLKRLTLQL 393 Query: 192 IELLKEKKQALVSYIV 207 L++ V Sbjct: 394 EAARNLLLPRLMNGAV 409 >gi|30248401|ref|NP_840471.1| restriction modification system, type I [Nitrosomonas europaea ATCC 19718] gi|30138287|emb|CAD84295.1| Restriction modification system, type I [Nitrosomonas europaea ATCC 19718] Length = 446 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 56/436 (12%), Positives = 134/436 (30%), Gaps = 43/436 (9%) Query: 25 WKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKD-GNSRQSDT 75 W ++ K+ G S I I + + + +D+ Sbjct: 5 WPYKRVEEIALKVAMGPFGSSIKVETFTDTGIPIISGQHLRDAELTDSEFNFITEEHADS 64 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADF----DGICSTQ--FLVLQPKDVLPELLQGWLL 129 + +G +++ G + A I + + S + +L +LPE + + Sbjct: 65 LKNANVQRGDVIFTHAGNIGQVAFIPNHSKYQRYVISQRQFYLRCDTSIILPEFVVYYFK 124 Query: 130 SIDVTQRIEAICEGATMSHA--DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S + ++ A + + I +P+P + EQ ++ I A V+I Sbjct: 125 SPEGQHKLLANANQVGVPSIARPSSYLKTIEVPVPSIEEQQVVVRNIKALDVKIRANRRI 184 Query: 188 RIRFIELLKEKKQALVSY---------IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 + + ++ + +G +P + L D + Sbjct: 185 NQTLEAMAQAVFKSWFVDFDPVKARIAAIEQGQDPLRAAMRAISGKTDLELDQMPREHHD 244 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 L + ES + ++ G ++++ ++ ++ + V+ + Sbjct: 245 QLAATAALFPDTMQESELGAIPKGWQVKRVGDLIELAYGKALKSTDRQEGAVPVYGSGGI 304 Query: 299 QNDK----RSLRSAQVMERGIITSAYMAVKPHGIDSTYLA---------WLMRSYDLCKV 345 + V +G + S Y P T + + + Sbjct: 305 TGCHNEALVPHGAIIVGRKGTVGSLYWEDDPFYPIDTTFYVKPKAVPMTYCFYAMQTLGL 364 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 L E+V RL ++ P + N + A+I ++ E + L E Sbjct: 365 NKMNTDAAVPGLNRENVYRLELVKPSTP----VLNAFDGLVAQIRKTMQANETTGQSLAE 420 Query: 406 RRSSFIAAAVTGQIDL 421 R + + ++G++ + Sbjct: 421 LRDTLLPKLLSGELSV 436 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 56/194 (28%), Gaps = 21/194 (10%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +GAIPK W+V + +L G+ + D + G +P G+ + Sbjct: 262 LGAIPKGWQVKRVGDLIELAYGKA---------LKSTDRQEGA---VPVYGSGGITGCHN 309 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G I+ G+ G D T F V + Sbjct: 310 EALVPHGAIIVGRKGTVGSLYWEDDPFYPIDTTFYVKPKAVPMTYCFYAMQTLGLNKMNT 369 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +A G + + + + +I + + L E Sbjct: 370 DAAVPGLNRENVYR---------LELVKPSTPVLNAFDGLVAQIRKTMQANETTGQSLAE 420 Query: 198 KKQALVSYIVTKGL 211 + L+ +++ L Sbjct: 421 LRDTLLPKLLSGEL 434 >gi|219851731|ref|YP_002466163.1| restriction modification system DNA specificity domain protein [Methanosphaerula palustris E1-9c] gi|219545990|gb|ACL16440.1| restriction modification system DNA specificity domain protein [Methanosphaerula palustris E1-9c] Length = 205 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 66/199 (33%), Gaps = 21/199 (10%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G P+ W F + + E + + + G+K E+ Sbjct: 26 IGCYPERWREGRFDEFILLQRGYDITKDEQHDGIVPV--------VSSSGIKSFHNESRA 77 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 PG ++ R L + ++ G + ++ +L+ + +L Sbjct: 78 N-GPGVVIGRKGTLGKVFYVDCPYWPHD-----TSLWVKDFKGNNPKFVYYLLTTLNLKS 131 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + +L V L + +P I++Q I ++ + ID + L+ Sbjct: 132 L---DTGTSNPTLNRNYVHALKIAMPNIEDQKIIVEIL----SSIDKKTATEQSRKEALE 184 Query: 405 ERRSSFIAAAVTGQIDLRG 423 +S + +T +I ++ Sbjct: 185 ILFASLLHDLMTVKIRVKN 203 >gi|218281983|ref|ZP_03488301.1| hypothetical protein EUBIFOR_00870 [Eubacterium biforme DSM 3989] gi|218217039|gb|EEC90577.1| hypothetical protein EUBIFOR_00870 [Eubacterium biforme DSM 3989] Length = 383 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 45/399 (11%), Positives = 113/399 (28%), Gaps = 49/399 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 + W I +L++G T + + I +I ++++ + Sbjct: 17 EDWCTSTIGENFRLSSGLTPSTKEKAYFNNGIIPWINSGELKNKYISFTENKLTLDAVKK 76 Query: 76 STVSIFAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++I+ ++ G KA I D S + + ++ Sbjct: 77 HNLTIYPMDTMVIAIYGLEAAGVRGKASITKMDSTISQSCMAFNSLGNVLTQFMYYVYKK 136 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +G + + + + P EQ+ I + +I+T + Sbjct: 137 EAQILGTRYAQGTKQQNLSSDLLSSYKLLYPSKEEQLKIVNFLSLIDEKIETQSKIINDY 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L K ++ + S I +G + +KN Sbjct: 197 KLLKKYITKSFIKQ-------KGTSYLLSEIAELG-------RGRVISSAEISKQKNPIY 242 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + + + G + G + + Sbjct: 243 PVYSSQTSNNGVMGYLDNYDYEGE-----------------YITWTTDGANAGTVYYRNG 285 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + +G D+ Y++ ++ Y V + + L + + + +P Sbjct: 286 KFNCTNVCGILKIKNGYDAYYISNILNCYTKKYVSTNLAN---PKLMNNVMANIKINLPS 342 Query: 372 IKEQFDITNVINVETA--RIDVLV--EKIEQSIVLLKER 406 I+ Q +N++ +I+ + ++Q + LLK Sbjct: 343 IERQKYFSNILKAIEYRVKIEQDIKLNLVKQKVFLLKNM 381 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 56/179 (31%), Gaps = 8/179 (4%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ---IVDPGE 290 + +T ++ I ++ G + K + I Sbjct: 27 NFRLSSGLTPSTKEKAYFNNGIIPWINSGELKNKYISFTENKLTLDAVKKHNLTIYPMDT 86 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +V L+ +++ I+ + MA G T + + + + Sbjct: 87 MVIAIYGLEAAGVRGKASITKMDSTISQSCMAFNSLGNVLTQFMYYVYKKEAQILGTRYA 146 Query: 351 SGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 G +Q+L + + +L P +EQ I N + + ID +E + I K + Sbjct: 147 QGTKQQNLSSDLLSSYKLLYPSKEEQLKIVNFL----SLIDEKIETQSKIINDYKLLKK 201 >gi|297379659|gb|ADI34546.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori v225d] Length = 363 Score = 74.4 bits (181), Expect = 4e-11, Method: Composition-based stats. Identities = 42/400 (10%), Positives = 105/400 (26%), Gaps = 52/400 (13%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKY-LPKDGNSRQSDT 75 W+ +K K+ TG+T ++ ++I D+ P+ + + Sbjct: 2 SEWQTFCLKDLGKIVTGKTPKTSNLDFFNGKYMFITPNDLHGTYRVIKTPRTLSDSGLKS 61 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL G +G + D + Q + + + + Sbjct: 62 IQNNTIDNTSILVGCIGDVGMVRMCFDKCA-TNQQINSITDIKDFCNPYYLYYYLSNKKE 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + I + I + +P + Q I + +I+ Sbjct: 121 LFKNIALSTVVPIIPKTTFQEIEVLLPNIETQQKIARTLSILDQKIENNHKINELL---- 176 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 H + + KN L + Sbjct: 177 -----------------------------------HTLAYKIYEYYFKYKPKNANLEQII 201 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I + +++ + + + P I+ N + + + Sbjct: 202 IENPKSSIMVKNAQKTQDKYPFFTSGDNILFYPKAIIDGRNCFLNTGGNAGIKFYVGKAS 261 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 ++ + + S YL L+ + + L+ +K+ P+ +P + E Sbjct: 262 YSTDTWCICANEF-SDYLYLLLSNIKTHINQSFFQGTSLKHLQKNLLKKYPIYMPSVHEI 320 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I L+ ++ L++ R + + Sbjct: 321 KKFNQIIMPLL----TLISINTRTSKKLEQIRDFLLPLLL 356 >gi|308189188|ref|YP_003933319.1| type I restriction-modification system specificty subunit [Pantoea vagans C9-1] gi|308059698|gb|ADO11870.1| type I restriction-modification system specificty subunit [Pantoea vagans C9-1] Length = 378 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 57/409 (13%), Positives = 120/409 (29%), Gaps = 54/409 (13%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPK 66 +P+ W+ + + + + + D+ + + ++ K Sbjct: 6 KVPEIFFKRFGREWENLTLGDLGSVAMNKRIFKHQTTIAGDVPFFKIGTFGKQPDAFISK 65 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + G +L G R + +V D Sbjct: 66 --ALFNEYKAKYPYPVAGDLLLSASGSIGRVVEYKGEEHYYQDSNIVWLKHDGKINNSFL 123 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + V + EG+T+ K I + + P EQ+ I ++DTLI Sbjct: 124 KVFYSMVKW---SGLEGSTIQRLYNKNILDTEISTPERQEQIAIGNY----FQKLDTLIN 176 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + + + L K+AL+ + K P+++ K EW + L Sbjct: 177 QHQQKHDKLSSIKKALLEKMFPKEGETIPEIRFKGFSGEWKEVTLSSVIDVRSGKDYKHL 236 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + N + + S + + + + + + I+ Sbjct: 237 GKGNIPVYGTGGYMHSVDSALSNDKDAIGIGRKGTIDKPYILRA---------------- 280 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + + AV + +L L + D K S SL + Sbjct: 281 -------PFWTVDTLFYAVPLTSFNLDFLFCLFQKIDWKKHDE---STGVPSLSKIAINN 330 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +PV EQ I N ++D L+ + +Q I L + + ++ Sbjct: 331 VPVYATNELEQTAIGNY----FQKLDALINQHQQQITKLNNIKQACLSK 375 >gi|156932819|ref|YP_001436735.1| hypothetical protein ESA_00615 [Cronobacter sakazakii ATCC BAA-894] gi|156531073|gb|ABU75899.1| hypothetical protein ESA_00615 [Cronobacter sakazakii ATCC BAA-894] Length = 483 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 62/479 (12%), Positives = 136/479 (28%), Gaps = 81/479 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK V + F G+ + K+ Y+G +V G + + Sbjct: 3 SEWKQVRLGDFIDSCLGKMLDQKKNKGAFHPYLGNSNVRWGEFDFSNLAEMKFEDTEHER 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 KG ++ + G R AI D + ++ L + + Sbjct: 63 YALKKGDLVVCEGGEPGRCAIWEDEIPNMKIQKALHRIRTLPGLVTKYLYYWFLLAGKTG 122 Query: 137 I-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 E G T+ H + + ++ + +PP+ + + + +I ++ Sbjct: 123 SLEPYFTGTTIKHLTGRSLADLTITLPPVKHKEKCALVLGSLDRKITHNKKINQTLEQMA 182 Query: 196 KEKKQALV------------------------------------SYIVTKGLNPDVK--M 217 + ++ + V + +P+ + Sbjct: 183 QALFKSWFVDFEPVKAKMTVLEAGGSQEDATLAAMSAISGKDADTLAVFEREHPEQYAEL 242 Query: 218 KDS--------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 K + +G +P+ WE +PF L++ + ES+ II+ + Sbjct: 243 KATAELFPSAMQESELGEIPEGWEFQPFGELLSHTIGGDWGKDESDDKHKMPVRIIRGTD 302 Query: 270 TRNMG----------LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI---- 315 N+ E + ++ G+IV + + RS V + Sbjct: 303 IPNIKSCQDSNVPFRYVEEKKLKTRSLNAGDIVIEVSGGSPTQPTGRSIYVTNEILKRLS 362 Query: 316 -----ITSAYMAVKPHGIDSTYLAWLMRSYD--LCKVFYAMGSGLRQSLKFEDV-KRLPV 367 + + L + Y S + + + + V Sbjct: 363 LPVEPASFCRLFRPKSKELGMVLGLYLERIYQDGKTWLYQNQSTGISNFQTKVFLENEMV 422 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 V P + I + T L+ + L + R + + ++G+I L Q Sbjct: 423 AVAPSE----ILKLFYKTTLPFVKLM--HSSENIKLTQLRDTLLPKLLSGEITLPEAEQ 475 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 12/99 (12%), Positives = 24/99 (24%), Gaps = 13/99 (13%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDV-ESGTGKYLPKDGN 69 +G IP+ W+ P G + + I D+ + + Sbjct: 258 LGEIPEGWEFQPFGELLSHTIGGDWGKDESDDKHKMPVRIIRGTDIPNIKSCQDSNVPFR 317 Query: 70 SRQSDTSTVSIFAKGQILY-----GKLGPYLRKAIIADF 103 + G I+ P R + + Sbjct: 318 YVEEKKLKTRSLNAGDIVIEVSGGSPTQPTGRSIYVTNE 356 >gi|110834690|ref|YP_693549.1| type I restriction-modification system, S subunit [Alcanivorax borkumensis SK2] gi|110647801|emb|CAL17277.1| type I restriction-modification system, S subunit [Alcanivorax borkumensis SK2] Length = 391 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 55/128 (42%), Gaps = 7/128 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + ++P +++ I + + + R I + ++ + +L S Sbjct: 52 SSKNCIEPRDVLLSKIVPHIRRCWVVPEKGGYRQIGSGEWIIFRDERFYPGFLKHYFTSE 111 Query: 341 DLCKVFYAMGSGLRQSL---KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + F +G+ SL + V+R+ + +PP++EQ I +++ + D + K + Sbjct: 112 LFHRQFMNTVAGVGGSLVRARPAGVERIEIPLPPLEEQKRIATILD----KADAIRRKRQ 167 Query: 398 QSIVLLKE 405 Q+I L +E Sbjct: 168 QAIQLAEE 175 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 56/378 (14%), Positives = 130/378 (34%), Gaps = 34/378 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W +VP + G + + + + + + L S+ + Sbjct: 2 SWPLVPASEIM-VKRGGSLNPAKFPDETFELLSIPAFDKNKPEIL-----KGAEIGSSKN 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +L K+ P++R+ + G I S ++++ + + P L+ + S + Sbjct: 56 CIEPRDVLLSKIVPHIRRCWVVPEKGGYRQIGSGEWIIFRDERFYPGFLKHYFTSELFHR 115 Query: 136 RIEAICEGATMSHAD--WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G S G+ I +P+PPL EQ I + + +R + I+ Sbjct: 116 QFMNTVAGVGGSLVRARPAGVERIEIPLPPLEEQKRIATILDKADA----IRRKRQQAIQ 171 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 L +E +A+ + +P K ++ + + + +EL+ L Sbjct: 172 LAEEFLRAV---FLDMFGDPVTNPKGWKVKKIDDLCEVQGGLQVSKKRSELSISAPYLRV 228 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF--RFIDLQNDKRSLRSAQVM 311 +N+L N + E + + L Y+ + +++ + RS + Sbjct: 229 ANVL----RNRLYLGEIKEINLTQAEYD-RVRLKRDDVLIVEGHGNPNEIGRSALWTGEI 283 Query: 312 ERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPV 367 + + + + V+ I ++ + S + ++ VK + Sbjct: 284 DGMVHQNHLIRVRVKSKEIRPRFVNDYINSPGGRVQMMKASNTTSGLNTISTGIVKSTEI 343 Query: 368 LVPPIKEQFDITNVINVE 385 +VPPI Q +V++ Sbjct: 344 IVPPIYLQDKYMSVVSKF 361 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 61/204 (29%), Gaps = 18/204 (8%) Query: 22 PKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK WKV I ++ G + SE Y+ + +V + Sbjct: 192 PKGWKVKKIDDLCEVQGGLQVSKKRSELSISAPYLRVANVLRNRLYLGEIKEINLTQAEY 251 Query: 77 TVSIFAKGQILY----GKLGPYLRKAIIA-DFDGICSTQFLVL---QPKDVLPELLQGWL 128 + +L G R A+ + DG+ L+ + K++ P + ++ Sbjct: 252 DRVRLKRDDVLIVEGHGNPNEIGRSALWTGEIDGMVHQNHLIRVRVKSKEIRPRFVNDYI 311 Query: 129 LSIDVT-QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S Q ++A + ++ + + + +PP+ Q + + Sbjct: 312 NSPGGRVQMMKASNTTSGLNTISTGIVKSTEIIVPPIYLQDKYMSVVSKFEDVLVKSRMH 371 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 +L+ L Sbjct: 372 EGGID----SSLFSLIKKAFKGNL 391 >gi|145221398|ref|YP_001132076.1| restriction modification system DNA specificity subunit [Mycobacterium gilvum PYR-GCK] gi|145213884|gb|ABP43288.1| restriction modification system DNA specificity domain [Mycobacterium gilvum PYR-GCK] Length = 368 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 59/400 (14%), Positives = 119/400 (29%), Gaps = 41/400 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W VPI F + + + + Y N + ST + Sbjct: 3 WPEVPIDSFCR-----------PKQWPTISQSQLTPTGYPVYGANGQIGWYSTYNH-ESE 50 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L G ++ T + + +L+ + +R+ G+ Sbjct: 51 TVLITCRGATCGTVNVSPPKSYV-TGNAMALDSLDEARIHLRYLVHVLTPERLRRSITGS 109 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + I +P+PPLA+Q I + R+ EL + ++ + Sbjct: 110 AQPQITRESLKAITVPLPPLADQRRIAAILDQADRLRSHRHGLLRRYSELKRAGFASMFA 169 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I + G +G + R++ L + + Sbjct: 170 GISSSG-------------KLGDYGEVQGGLQVSRK-----RESLPLERPYLRVANIYRG 211 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM--ERGIITSAYMA 322 L E+ ++PG+++F ++ + + + + Sbjct: 212 KLDLGEVKTIRVTEAESMRVRLEPGDLLFVEGHANPNEVGRVAEWNGSVPDCLHQNHLIR 271 Query: 323 VKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDI 378 V+ ++ TY S D F G ++ ++ P+ VPPI Q + Sbjct: 272 VRLDRSAVEPTYAEAWFNSRDGSMHFQRAGKTTSGLNTINASQLRAAPLPVPPISLQREY 331 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 V N ID + L+ E S + A +GQ Sbjct: 332 VTVANA----IDNHLRDQTMQSELVDELFVSLQSRAFSGQ 367 >gi|262166149|ref|ZP_06033886.1| type I restriction-modification system specificity subunit S [Vibrio mimicus VM223] gi|262025865|gb|EEY44533.1| type I restriction-modification system specificity subunit S [Vibrio mimicus VM223] Length = 423 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 46/406 (11%), Positives = 102/406 (25%), Gaps = 29/406 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W I + ++ + + G + Sbjct: 20 DWTQQRIGSILTDISRPVQLLDNELYQL-VTVKRRNEGVVPRSVVKGKNILVKNYFAIKA 78 Query: 84 GQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G L K I + + S ++LV+ + + ++ + Sbjct: 79 GDFLISKRQVVHGANGIVPESLDNAVVSNEYLVVTDNQKITAKFWSTISKRPEIKKLYFI 138 Query: 140 ICEGATMSHA--DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G + D + IP L EQ I + +D I L++ Sbjct: 139 SSYGVDIEKLVFDITDWKERYILIPELNEQQKITDF----FQNLDQQIELHQDKHRKLQQ 194 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN---------RKN 248 K+A++ + + +++ +G + D E+ + + N Sbjct: 195 LKKAMLDKMFPRAGKKVPELRFAGFDSDWETKDFQEIFIYIRNNSLSRAELNDDAGLGMN 254 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + + + G+IVF Sbjct: 255 VHYGDVLVKFGEILDFTLEKVPFITNGGAVEKMMPNRLQDGDIVFADAAEDLTVGKCCEI 314 Query: 309 QVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 + + + YL + + S + G S+ +K Sbjct: 315 NKLGSQPLFAGLHTIAVRPKKAFAPKYLGYFLNSNLYHDQLLTLIQGTKVSSISKSSIKE 374 Query: 365 LPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 V P EQ I ++ L+ +Q I L+ + + Sbjct: 375 TQVYYPKDAAEQAKIGEY----FHNLERLIVIQQQKINKLENIKQA 416 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 51/141 (36%), Gaps = 7/141 (4%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 K + Y + G+ + + + + + + + I + + + Sbjct: 66 KNILVKNYFAIKAGDFLISKRQVVHGANGIVPESLDNAVVSNEYLVVTDNQKITAKFWST 125 Query: 336 LMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + ++ K+++ G+ + D K +L+P + EQ IT+ +D Sbjct: 126 ISKRPEIKKLYFISSYGVDIEKLVFDITDWKERYILIPELNEQQKITDF----FQNLDQQ 181 Query: 393 VEKIEQSIVLLKERRSSFIAA 413 +E + L++ + + + Sbjct: 182 IELHQDKHRKLQQLKKAMLDK 202 >gi|56418879|ref|YP_146197.1| type I restriction-modification system S subunit [Geobacillus kaustophilus HTA426] gi|56378721|dbj|BAD74629.1| type I restriction-modification system S subunit [Geobacillus kaustophilus HTA426] Length = 346 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 47/363 (12%), Positives = 114/363 (31%), Gaps = 26/363 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + TG+ + D GKY + T S + +L Sbjct: 4 VSLGSLVNIRTGKLDANASDP-----------EGKYPFFTCSRETLKIDTYS-YDCECVL 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G K FD +++ + + + ++ + G + Sbjct: 52 VAGNGDLNVKYYNGKFDAY-QRTYIIESIDKNILNVKYLYYFMQLYVSKLRQMSIGGVIK 110 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + +P+P + +Q I + + ID + +L K S + Sbjct: 111 YIKLNYLTDAKIPLPNIEKQNKIVKVLEKAQELIDKRKAQIKALDQLTK-------SLFL 163 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + + I +G V + P + + + I + ++ Sbjct: 164 EMFGDLKNNRYNWPIAELGDVCISIKDGPHVSPKYTQKGIPFISVNNIINNKWDFTNVKY 223 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + E + + G+I++ + + + + A + + Sbjct: 224 ISETD----YEIFAKRCKPEKGDILYTKGGTTGFAKYI-DIDIKFMNWVHLAVLKYDKNI 278 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +D +L ++ S+ G+ L +K++ VLVPP++ Q +++ Sbjct: 279 MDGIFLTHMLNSHFCYAQSQKYTRGIANRDLVLSQMKKIKVLVPPLERQKKFVSIVEKVP 338 Query: 387 ARI 389 +RI Sbjct: 339 SRI 341 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 36/106 (33%), Gaps = 4/106 (3%) Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 ++ + + I + + + K+ G+ + +K + Sbjct: 60 VKYYNGKFDAYQRTYIIESIDKNILNVKYLYYFMQLYVSKLRQMSIGGVIKYIKLNYLTD 119 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + +P I++Q I V+ + L++K + I L + S Sbjct: 120 AKIPLPNIEKQNKIVKVLE----KAQELIDKRKAQIKALDQLTKSL 161 >gi|15645467|ref|NP_207641.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori 26695] gi|2313983|gb|AAD07897.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori 26695] Length = 298 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 104/296 (35%), Gaps = 14/296 (4%) Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++++ I++ G + + I +PIPPL Q I + + A T L Sbjct: 1 MFCFENLNIQNDIKSKSFGGIVKSISMNDLQQITIPIPPLEIQQEIVKILDAFTELNTEL 60 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFAL 240 TE + + + L+ + + D K+K L P E + + Sbjct: 61 NTELKARKKQYEYYQNMLLDFNDINQNHKDAKIKTYPKRLKTLLHTLAPKGVEFRKLGEV 120 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 N+K K+ E + + + G + + GE + + Sbjct: 121 CESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFN------NDGENITIASRGEY 174 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + G + Y + + + +L + +++ ++ + + G +L Sbjct: 175 AGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALNKA 234 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 D++ L + +PP++ Q +I +++ +A L+ I I K+ R + Sbjct: 235 DIETLTIPIPPLEIQQEIVKILDQFSALTTDLLAGIPAEIKARKKQYEYYREKLLT 290 Score = 43.2 bits (100), Expect = 0.088, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 50/164 (30%), Gaps = 15/164 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + + +T + + ++ G+ V + + + Sbjct: 109 PKGVEFRKLGEVCESTNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGEN--- 165 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQR 136 I G Y + V ++L + L +L + ++ Sbjct: 166 ------ITIASRGEYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNEIQIM 219 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + G ++ + I + +PIPPL Q I + + + Sbjct: 220 ENLVFRG-SIPALNKADIETLTIPIPPLEIQQEIVKILDQFSAL 262 >gi|184200170|ref|YP_001854377.1| type I restriction enzyme S protein [Kocuria rhizophila DC2201] gi|183580400|dbj|BAG28871.1| type I restriction enzyme S protein [Kocuria rhizophila DC2201] Length = 398 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 49/122 (40%), Gaps = 1/122 (0%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYD 341 I++ +++F +R + + A + P +D YL + +R + Sbjct: 68 RSILEQDDLLFSIAGTIGRVARVRPSDLPGNTNQAVAIIRPNPEKVDRDYLYYCLRDTER 127 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + + ++Q+L +V + + +P + EQ I + +I+ E++ Sbjct: 128 IARARTRVVQSVQQNLSLAEVSNIELPLPSLPEQRAIAATLGALDDKIESNRRLAERASA 187 Query: 402 LL 403 L+ Sbjct: 188 LI 189 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 51/387 (13%), Positives = 126/387 (32%), Gaps = 40/387 (10%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS--TVSIFAKGQILYGKLGPYLRKAIIA- 101 I ++ +E + K + + SI + +L+ G R A + Sbjct: 33 SSGINFVKVESITEAGTLNHSKLAFIDEQTHAMLARSILEQDDLLFSIAGTIGRVARVRP 92 Query: 102 -DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIP 158 D G + +++P + + D + A + + + NI Sbjct: 93 SDLPGNTNQAVAIIRPNPEKVDRDYLYYCLRDTERIARARTRVVQSVQQNLSLAEVSNIE 152 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 +P+P L EQ I + A +I++ R L+ L++ T+ L P + Sbjct: 153 LPLPSLPEQRAIAATLGALDDKIESNRRLAERASALIDASASQLLARTSTEVL-PLADL- 210 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 +E+ L + + ++ + Q + + Sbjct: 211 ---VEFNRLSVNPHSTDTLR-----------------YIDIASVSSGQIDSVQELTWNEA 250 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLM 337 + V G++++ + N +L + ++ + + P S+ L ++ Sbjct: 251 PSRARRGVSDGDVIYSTVRPGNRAFALIV-DPTPGSVASTGFAVMSPSVRLGSSMLTSVV 309 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPP---IKEQFDITNVINVETARIDVLV 393 ++ + ++ G ++ + + V+VP + EQ +T + V Sbjct: 310 GAHKFAEYLESVAHGSAYPAVGIQAMGNYSVVVPKEAVVAEQ------FEADTMPLRRRV 363 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQID 420 + L R + + ++G++ Sbjct: 364 AQARAESERLAALRDTLLPELLSGRVR 390 >gi|289422996|ref|ZP_06424816.1| type I restriction system specificity protein [Peptostreptococcus anaerobius 653-L] gi|289156570|gb|EFD05215.1| type I restriction system specificity protein [Peptostreptococcus anaerobius 653-L] Length = 200 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 68/185 (36%), Gaps = 9/185 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIV 292 +V N + IE+ + YG + + + E + +I G+IV Sbjct: 12 IATIVRGGNFQKKDFIENGRPCIHYGQMYTHFGIAADKTLTFVNEEVFAKSKIAKSGDIV 71 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + +A + + I S + A+ H + +L++ S + G Sbjct: 72 MAVTSENVEDVCSCTAWIGDEDIAISGHTAIISHNQNPKFLSYYFHSVMFFNQKKKLAHG 131 Query: 353 LRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 + + + + +++P I++Q + ++++ + + + E + I ++ R Sbjct: 132 TKVIEVTPSKLGDIVIMLPTIEKQNRMVSILDRFDSLCNSISEGLPAEIEARQKQYEFYR 191 Query: 408 SSFIA 412 ++ Sbjct: 192 DKLLS 196 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 49/166 (29%), Gaps = 11/166 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSD-TSTVSIFA 82 V + + G + I I + + G K + + I Sbjct: 7 VKLGEIATIVRGGNFQKKDFIENGRPCIHYGQMYTHFGIAADKTLTFVNEEVFAKSKIAK 66 Query: 83 KGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G I+ A I D D S + + P+ L + S+ + Sbjct: 67 SGDIVMAVTSENVEDVCSCTAWIGDEDIAISGHTAI-ISHNQNPKFLSYYFHSVMFFNQK 125 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + G + +G+I + +P + +Q + + ++ Sbjct: 126 KKLAHGTKVIEVTPSKLGDIVIMLPTIEKQNRMVSILDRFDSLCNS 171 >gi|86750171|ref|YP_486667.1| restriction modification system DNA specificity subunit [Rhodopseudomonas palustris HaA2] gi|86573199|gb|ABD07756.1| Restriction modification system DNA specificity domain [Rhodopseudomonas palustris HaA2] Length = 411 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 55/415 (13%), Positives = 127/415 (30%), Gaps = 57/415 (13%) Query: 26 KVVPIKRFTKLN-TGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAK 83 + P+ T+ + S++ YI L V+ T + + + + + + + Sbjct: 17 EWEPLGEVTQPTANIKWSQADGVYQYIDLTSVDIKTKRVTEASEITAETAPSRAQKLVKE 76 Query: 84 GQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIE 138 +++ P ++ + D + + ST + VL+ K LP+ + WL + + +E Sbjct: 77 NDVIFATTRPAQQRYCLIDSELAGNVASTGYCVLRAKKDQVLPKWILHWLGTTEFKNYVE 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRF 191 GA + +PIP LA Q I + T L Sbjct: 137 ENQSGAAYPAISDGKVKAFKIPIPCPDDPEKSLAIQGEIVRILDTFTELTAELTAGLAAE 196 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + K++ ++T + +EW +++ + Sbjct: 197 LAQRKKQYSHYRDQLLTFNED--------EVEW-----KTLGDIATLRRGRVMSKGYLRD 243 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 S + + + GE V D N + Sbjct: 244 NAGVYPVYSSQTANNGMIGQIDTFDFD----------GEYVSWTTDGANAGTVFYRNEKF 293 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL------ 365 + +D +L++ + + V+ MG+ L V+++ Sbjct: 294 SITNVCGVIKENGTCPLDLKFLSFWLSTEAKKHVYSGMGN---PKLMSHQVEKIPIPIPF 350 Query: 366 ----PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + ++ Q + +++ A L E + + I L ++ R ++ Sbjct: 351 PDDPKI---SLEAQKRVAAILDKLDALTTSLTEILPREIELREKQYAYYRDQLLS 402 >gi|239906158|ref|YP_002952897.1| type I restriction enzyme S protein [Desulfovibrio magneticus RS-1] gi|239796022|dbj|BAH75011.1| type I restriction enzyme S protein [Desulfovibrio magneticus RS-1] Length = 427 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 61/422 (14%), Positives = 128/422 (30%), Gaps = 32/422 (7%) Query: 23 KHWKVVP----IKRFTK---LNTGRTSESGKDIIYI-GLEDVESGTGKYLPKDGNSRQ-- 72 W V + T E D ++ +D+ +G + + Sbjct: 14 SEWAVRRRWFCLAELADGIFDCPHSTPELTADGPFLVRSQDIRTGFVDISKLAHVAEKTF 73 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKA--IIADFDGICSTQFLVLQPKDV--LPELLQGWL 128 D + + +G ILY + G Y A I + ++++PK L+ WL Sbjct: 74 LDRVSKATPEEGDILYSREGTYFGIAAEIPKGLRVCLGQRMVLIRPKRSRLASRFLRYWL 133 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S +++ + +G + I IP+P PL EQ I + + +ID Sbjct: 134 NSGILSRHLHGFRDGTVAERLNMPTIRAIPVPDFPLKEQQAIAAILGSLDDKIDLNRRIN 193 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + + + V G P + ++ + + Sbjct: 194 ETLEAMARAIFK---DWFVDFG--PTRAKMEGRAPYLAQEIWNLFPDALDDEGKPV--GW 246 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFIDLQNDKRS 304 + L G ++K + G P Y + + Sbjct: 247 EYRPVGDFAELRGGKQLEKEKIAACGAIPVFGGAGIMGYTDSYNADGFVIAVGRVGAYCG 306 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 A I +A + + + +L +R D+ + + + DV Sbjct: 307 QFFAHRGRAWINNNASLIRQRDQCNGEWLYCALRHADIDVIKK---GAAQPFVSNTDVAN 363 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 LP++ P + ++ + V E I L + R + ++G+I ++ Sbjct: 364 LPIIWPG----HATLSTLSKILVPLMVKAEHNNAEIDSLAQTRDFLLPKLMSGEIRVKDA 419 Query: 425 SQ 426 + Sbjct: 420 EK 421 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 56/183 (30%), Gaps = 14/183 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W+ P+ F +L G+ E K I G V G G D + Sbjct: 243 PVGWEYRPVGDFAELRGGKQLEKEK-IAACGAIPVFGGAGIMGYTDSYNADGFV------ 295 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G++G Y + + +++ +D L I+ Sbjct: 296 ----IAVGRVGAYCGQFFAHRGRAWINNNASLIRQRDQCNGEWLYCALRHADIDVIKK-- 349 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + N+P+ P A + + ++ V+ + E + Sbjct: 350 -GAAQPFVSNTDVANLPIIWPGHATLSTLSKILVPLMVKAEHNNAEIDSLAQTRDFLLPK 408 Query: 202 LVS 204 L+S Sbjct: 409 LMS 411 >gi|254374066|ref|ZP_04989548.1| predicted protein [Francisella novicida GA99-3548] gi|151571786|gb|EDN37440.1| predicted protein [Francisella novicida GA99-3548] Length = 417 Score = 74.1 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 64/419 (15%), Positives = 131/419 (31%), Gaps = 46/419 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + + E + S T +++P N+ +D S I KGQ Sbjct: 6 KKLGNYIQQVSIKNKELEVSNLL-----GVSITKEFIPSIANTVGTDMSKYKIVQKGQFA 60 Query: 88 Y----GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAI 140 Y + G + A++ D I ST + V + D LPE L W + + + Sbjct: 61 YGPVTSRNGDKISVALLEDDSAIVSTSYTVFEIIDKTKLLPEYLMMWFRREEFDRYARYM 120 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+T W+ + ++ +PIP + +Q I E I I + + L+E Q Sbjct: 121 SHGSTREVFGWEEMCDVELPIPSIEKQREIVA----EYYAITNRIKLNEQLNQKLEETAQ 176 Query: 201 ALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNR---KN 248 A+ PD K +G E V +P W+V + K+ Sbjct: 177 AIYKEWFVDFEFPDENGKPYKSNGGEMVWCEELEKEIPKGWKVSKVGDEICYKKGYAFKS 236 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDPGEIVFRFIDLQ---- 299 + E+ + + N+ K + + +I+ + Sbjct: 237 AEYSENGVGIVRVSNLTDKSVDISDCYYINEKNLTSKYEQHRLKTNDIIIATVGSWASNP 296 Query: 300 ---NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--R 354 K + +A YL + + + + G + Sbjct: 297 ASVVGKVVKVPVIANNFLLNQNAVCIRTKDYRIQEYLHQHLITKKYSEYVVSGAQGSANQ 356 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 S+ + ++P I + + ++ ++ Q L + ++ Sbjct: 357 ASVTLNHLFEYKFIIPD----SVIIDKACDTFSMVNKIINNFAQENSYLHGLKEILLSK 411 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 26/81 (32%), Gaps = 6/81 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IPK WKV + G + S + + + ++ + + ++ T Sbjct: 212 EIPKGWKVSKVGDEICYKKGYAFKSAEYSENGVGIVRVSNLTDKSVDISDCYYINEKNLT 271 Query: 76 STV--SIFAKGQILYGKLGPY 94 S I+ +G + Sbjct: 272 SKYEQHRLKTNDIIIATVGSW 292 >gi|315225321|ref|ZP_07867136.1| conserved hypothetical protein [Capnocytophaga ochracea F0287] gi|314944715|gb|EFS96749.1| conserved hypothetical protein [Capnocytophaga ochracea F0287] Length = 260 Score = 74.1 bits (180), Expect = 5e-11, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 56/171 (32%), Gaps = 8/171 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQS 73 IPK W+ V + + G T I+++ ++ +G + + Sbjct: 92 EIPKDWRWVRMGQIGDWGAGSTPPRSNPNYYNGKILWLKTGELNNGIVFDTEEKITEKAF 151 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 ++ I G +L G + K IAD + + P + + + Sbjct: 152 QECSLRINKVGNVLIAMYGATIGKLAIADKELTTNQACCGCSPYLINN--WYLFYFLMAS 209 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 ++ EG + + +P+PPL EQ I I I+ Sbjct: 210 REQFIKRGEGGAQPNISRVKLVEHLIPLPPLYEQQRIVNTIQNIFRCIEKN 260 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 51/181 (28%), Gaps = 13/181 (7%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNM 273 E +P W + R N IL L G + + Sbjct: 84 CIDEEIPFEIPKDWRWVRMGQIGDWGAGSTPPRSNPNYYNGKILWLKTGELNNGIVFDTE 143 Query: 274 GLKPESYETYQIV---DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 E + G ++ K ++ ++ A P+ I++ Sbjct: 144 EKITEKAFQECSLRINKVGNVLIAMYGATIGKLAIADKELT----TNQACCGCSPYLINN 199 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 YL + + + G + ++ + + +PP+ EQ I N I I+ Sbjct: 200 WYLFYFLMASREQ-FIKRGEGGAQPNISRVKLVEHLIPLPPLYEQQRIVNTIQNIFRCIE 258 Query: 391 V 391 Sbjct: 259 K 259 >gi|121608536|ref|YP_996343.1| restriction modification system DNA specificity subunit [Verminephrobacter eiseniae EF01-2] gi|121553176|gb|ABM57325.1| restriction modification system DNA specificity domain [Verminephrobacter eiseniae EF01-2] Length = 403 Score = 74.1 bits (180), Expect = 5e-11, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 76/237 (32%), Gaps = 24/237 (10%) Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 +E K+A + + T+GL + + +GLVP+ W F L + + + Sbjct: 162 QELKRAAMRELFTRGLRGEAQK----ETEIGLVPESWVEVVFAELGEIVTGTTPPTKDRD 217 Query: 256 ILSLSYGNIIQKLETRN--------MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 I + + + + + G I K Sbjct: 218 YYDDGTIPFISPGDIDHGFPIASTQKHITDSGLAVSRALPAGTTCVVCIGSTIGKVG--R 275 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 V+ V G D YL+ L+ +Y V A L ++L + Sbjct: 276 TTVVSSATNQQINAIVPGVGYDPNYLSHLL-TYRADIVRNAASPSPVPILSKGTFEKLML 334 Query: 368 LV---PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P EQ +I +++ +D + Q +L+E S + +TG I + Sbjct: 335 FTSTNP--DEQTEIAAILDT----LDRKIALHRQKRAVLEELFKSLLHKLMTGAIRV 385 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 57/206 (27%), Gaps = 12/206 (5%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKY 63 K++ IG +P+ W V ++ TG T + I +I D++ G Sbjct: 183 KETE---IGLVPESWVEVVFAELGEIVTGTTPPTKDRDYYDDGTIPFISPGDIDHG-FPI 238 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + S + G +G + K + Q + V + Sbjct: 239 ASTQKHITDSGLAVSRALPAGTTCVVCIGSTIGKVGRTTVVSSATNQQINAIVPGVGYDP 298 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRID 182 L + + + + + + Q I + +I Sbjct: 299 NYLSHLLTYRADIVRNAASPSPVPILSKGTFEKLMLFTSTNPDEQTEIAAILDTLDRKIA 358 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 +R EL K L++ + Sbjct: 359 LHRQKRAVLEELFKSLLHKLMTGAIR 384 >gi|261496173|ref|ZP_05992579.1| DNA methylase-type I restriction-modification system [Mannheimia haemolytica serotype A2 str. OVINE] gi|261308125|gb|EEY09422.1| DNA methylase-type I restriction-modification system [Mannheimia haemolytica serotype A2 str. OVINE] Length = 478 Score = 74.1 bits (180), Expect = 5e-11, Method: Composition-based stats. Identities = 49/414 (11%), Positives = 113/414 (27%), Gaps = 43/414 (10%) Query: 30 IKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI---F 81 + + + +G T + ++ + D+ + D + I Sbjct: 45 LNEVSLIKSGTTPTDRDDNLKEGVVLLKTNDIRNNLLNKYSSVDYFISEDINEKMISSQL 104 Query: 82 AKGQILYGKLGPYLRKAIIADF------DGICSTQFLV-----LQPKDVLPELLQGWLLS 130 +G +L +G L + + K++LP L +L S Sbjct: 105 KEGDVLVNIVGATLEVVGRVAYVSSTFPKANITQAMSFVRLKSKYNKELLPTYLFAFLQS 164 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +I + + + +G I +P+ L Q + + I + Sbjct: 165 SYGKIQINRNARPTGQYNLNNEELGAIKVPLIDLETQKQVDKIIKQSNDFVQKSTQSYQE 224 Query: 191 FIELLKEK--------------KQALVSYIVTKG-LNPDVKMKDSGIEWVGLVPDHWEVK 235 LL E ++L + G L+ + W + + Sbjct: 225 AETLLLENLGLRAFQADSNPVNVKSLKESFLQTGRLDAEYYQTKYEQYWNLIQSQDYVFI 284 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGEIVFR 294 + + + + + NI N E G+I+ Sbjct: 285 R-DEYLHITQKPDWTKPMYQYIEIGDVNIGDGSYQTNWIETQELPANAKTQAQTGDILIS 343 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSG 352 + ++ + + + + + ++ L L+RS G Sbjct: 344 TVRPYRGAVTIIGENDQDLVVSGAFTVLRRKENSVFNNEVLKVLLRSELYKDWLLQFNVG 403 Query: 353 LR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 +K D+ LP+ + Q I I + + L ++ + + K Sbjct: 404 TSYPVIKDNDILNLPIPKISGEIQEKIAEYI----RQSNDLRQQAQNLLAQAKN 453 >gi|308179939|ref|YP_003924067.1| type I restriction-modification system specificity subunit [Lactobacillus plantarum subsp. plantarum ST-III] gi|308045430|gb|ADN97973.1| type I restriction-modification system specificity subunit [Lactobacillus plantarum subsp. plantarum ST-III] Length = 297 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 105/293 (35%), Gaps = 23/293 (7%) Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +L + + + I + +T+ + K I + +P L EQ + + +I + I Sbjct: 18 YFLYFLISEENLSKIADTSTIPQINNKHIIPYTIYLPCLMEQQRLGKVLILLSNLIAANE 77 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + + L K Q + S + + K W + + + Sbjct: 78 DKLEQLKTLKKLMMQKIFSQ--------EWRFKGFTDPWEQRKLKWFLRVSKLKNIDGIF 129 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 KN+ L ++ ++ + +S Y I+D ++V+ L+++ + Sbjct: 130 DKNSVLS-----VSGEFGVVNQIAFQGRSFAGKSILNYGILDHNDVVYTKSPLKSNPYGI 184 Query: 306 RSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSY-DLCKVFYAMGSGLRQS---LKFE 360 + + G++++ Y P + S L + + + + ++ ++ E Sbjct: 185 IKTNLGKAGVVSTLYAVYAPLKTVYSPILGYYFNLDTRVNNYLRPLVNKGAKNDMKVRDE 244 Query: 361 DVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 V V +P I+ Q I N + ID L+ E + LKE + + Sbjct: 245 AVLEGKVCIPDSIETQKRICN----LFSLIDNLIAANEDKLNQLKELKKYLMQ 293 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 37/103 (35%), Gaps = 7/103 (6%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + MA+ P D +L +L+ +L K+ + + + + + +P + Sbjct: 1 MFMDTNMMALTPVETDLYFLYFLISEENLSKIAD---TSTIPQINNKHIIPYTIYLPCLM 57 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 EQ + + L+ E + LK + + + Sbjct: 58 EQQR----LGKVLILLSNLIAANEDKLEQLKTLKKLMMQKIFS 96 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 59/190 (31%), Gaps = 8/190 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAK 83 W+ +K F +++ + + D + E G + G S + I Sbjct: 108 WEQRKLKWFLRVSKLKNIDGIFDKNSVLSVSGEFGVVNQIAFQGRSFAGKSILNYGILDH 167 Query: 84 GQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT--QRI 137 ++Y K PY G+ ST + V P + + G+ ++D + Sbjct: 168 NDVVYTKSPLKSNPYGIIKTNLGKAGVVSTLYAVYAPLKTVYSPILGYYFNLDTRVNNYL 227 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + + + + + ID LI + LKE Sbjct: 228 RPLVNKGAKNDMKVRDEAVLEGKVCIPDSIETQKRICN-LFSLIDNLIAANEDKLNQLKE 286 Query: 198 KKQALVSYIV 207 K+ L+ + Sbjct: 287 LKKYLMQNMF 296 >gi|258509975|ref|YP_003175638.1| HsdS [Lactobacillus rhamnosus Lc 705] gi|257152816|emb|CAR91787.1| Restriction modification system DNA specificity domain [Lactobacillus rhamnosus Lc 705] Length = 199 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 20/127 (15%), Positives = 49/127 (38%), Gaps = 5/127 (3%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 GE + D ND ++ V + + + ++ + +LM + + Sbjct: 75 HNGEFILVAEDGANDLKNYPIQYVNGKAWVNNHAHVLQGKKTITDN-KFLMNAIKNFNIE 133 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + G R L + + +L +L+P EQ I + + +D + ++ + L+E Sbjct: 134 PFLVGGGRAKLNADVMMKLNILLPTFVEQEKIGS----LFSLLDKTIALHQRKLEKLQEL 189 Query: 407 RSSFIAA 413 + ++ Sbjct: 190 KKGYLQK 196 >gi|114566066|ref|YP_753220.1| type I restriction-modification system (specificity subunit) [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114337001|gb|ABI67849.1| type I restriction-modification system (specificity subunit) [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 263 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 27/255 (10%), Positives = 79/255 (30%), Gaps = 23/255 (9%) Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 PL Q I + + + + L + L + + +P K Sbjct: 25 YRPLETQKQIAKTLDTVSELLAILKQQLAELDNL-------IKTTFYDMFGDPVTNEKGW 77 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ + + + ++ + + + + +I + + P Sbjct: 78 EIKTIAEIAE--------QKLSYGSGASAIEYDGITRYIRITDINDNGSLNDDIVSPSET 129 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMR 338 ++ G+I+F K +LR + R I + + P + Y+ + + Sbjct: 130 SAKYNLNDGDILFARSGATVGK-TLRYRRSFGRCIYAGYLIRLVPKKALVLPDYIYYFTK 188 Query: 339 SYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + + ++ + + VPP+ Q ++ + + ++ Sbjct: 189 TDYYKGFIESNMKTVAQPNINAQQYGTFKICVPPLNLQTQFAEIV----TKTEEQKALVQ 244 Query: 398 QSIVLLKERRSSFIA 412 ++I + S ++ Sbjct: 245 KAINETQYLFDSLMS 259 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 13/161 (8%) Query: 23 KHWKVVPIKRFTK----LNTGRTSESGKDI-IYIGLEDVESGTGKYLPKDGNSRQSDTST 77 K W++ I + +G ++ I YI + D+ S+TS Sbjct: 75 KGWEIKTIAEIAEQKLSYGSGASAIEYDGITRYIRITDINDNGSLNDDIVS---PSETSA 131 Query: 78 VSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKD--VLPELLQGWLLSID 132 G IL+ + G + K + I + + L PK VLP+ + + + Sbjct: 132 KYNLNDGDILFARSGATVGKTLRYRRSFGRCIYAGYLIRLVPKKALVLPDYIYYFTKTDY 191 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 IE+ + + + + G + +PPL Q E Sbjct: 192 YKGFIESNMKTVAQPNINAQQYGTFKICVPPLNLQTQFAEI 232 >gi|332829958|gb|EGK02586.1| hypothetical protein HMPREF9455_00836 [Dysgonomonas gadei ATCC BAA-286] Length = 657 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 61/191 (31%), Gaps = 3/191 (1%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + + R + ++ + R + IV G+ Sbjct: 1 MAKEYFIKDFLKRIKRPIELNGDEEYKLVTIKMNHNGVILRERKKGCDIKSNMYIVHEGD 60 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + ID +N + + + + I S L + S G Sbjct: 61 FILSGIDARNGAFGIIPPGLDGAIVTNDFWYFDLEEDIISKELFLEITSTGWFDEICKKG 120 Query: 351 S-GLRQSLK--FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 S G Q ++ + + +P EQ I I ++ +L ++IE LL++ + Sbjct: 121 SDGTTQRIRLQKDKFFNQKIWLPEKDEQKIILEKIRSFKSKFKILSKQIEYQQELLQKFK 180 Query: 408 SSFIAAAVTGQ 418 + + A+ G+ Sbjct: 181 QAILQDAIQGK 191 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 58/439 (13%), Positives = 125/439 (28%), Gaps = 56/439 (12%) Query: 30 IKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 IK F K + ++ + ++ +G K G S + I +G + Sbjct: 7 IKDFLKRIKRPIELNGDEEYKLVTIKMNHNGVILRERKKGCDI---KSNMYIVHEGDFIL 63 Query: 89 GKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSI--DVTQRIEAICEG 143 + I I + F ++ + ++ + + +G Sbjct: 64 SGIDARNGAFGIIPPGLDGAIVTNDFWYFDLEEDIISKELFLEITSTGWFDEICKKGSDG 123 Query: 144 ATMS-HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 T N + +P EQ +I EKI + + L + ELL++ KQA+ Sbjct: 124 TTQRIRLQKDKFFNQKIWLPEKDEQKIILEKIRSFKSKFKILSKQIEYQQELLQKFKQAI 183 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGL----------------------------------- 227 + + L D + ++ +E Sbjct: 184 LQDAIQGKLTADWREQNPDVESASELLKRIKAEKTKLIKEKKIKKEKPLPPIKGDKISYA 243 Query: 228 VPDHWEVKPFFALV------TELNRKNTKLIESNILSLSYGNIIQKL-ETRNMGLKPESY 280 +P+ W + + + S I + N+ ++ ES Sbjct: 244 LPEDWTWCYLGDICSKTGSGSTPRGGKSVYTSSGIKFIRSQNVYDSSLILEDIVFISEST 303 Query: 281 ETYQ---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 V +++ + E I + I ++ + ++ Sbjct: 304 HKSMSGTKVIANDLLLNITGGSIGRCCQVPNDFDEANINQHVAIIRVIQPILNSIIHMII 363 Query: 338 RSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S K+ A R+ L + +P+ +PP EQ I + + + L +I Sbjct: 364 CSPYFQNKIIEAQTGAGREGLPKNKMDIIPIPLPPFIEQQIIVEKVESLLGKCNQLSVEI 423 Query: 397 EQSIVLLKERRSSFIAAAV 415 E + + + Sbjct: 424 ENQRKYSQYLQKALFNEVF 442 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 69/199 (34%), Gaps = 12/199 (6%) Query: 21 IPKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W + G++ + I +I ++V + S + Sbjct: 244 LPEDWTWCYLGDICSKTGSGSTPRGGKSVYTSSGIKFIRSQNVYDSSLILEDIVFISEST 303 Query: 74 DTS-TVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQF-LVLQPKDVLPELLQGWL 128 S + + +L G + + D + ++ + +L ++ + Sbjct: 304 HKSMSGTKVIANDLLLNITGGSIGRCCQVPNDFDEANINQHVAIIRVIQPILNSIIHMII 363 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S +I GA + IP+P+PP EQ +I EK+ + + + L E Sbjct: 364 CSPYFQNKIIEAQTGAGREGLPKNKMDIIPIPLPPFIEQQIIVEKVESLLGKCNQLSVEI 423 Query: 189 IRFIELLKEKKQALVSYIV 207 + + ++AL + + Sbjct: 424 ENQRKYSQYLQKALFNEVF 442 >gi|298375509|ref|ZP_06985466.1| restriction endonuclease S subunit [Bacteroides sp. 3_1_19] gi|298268009|gb|EFI09665.1| restriction endonuclease S subunit [Bacteroides sp. 3_1_19] Length = 267 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 44/225 (19%), Positives = 73/225 (32%), Gaps = 14/225 (6%) Query: 10 YKDSGVQ--WIG----AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDV 56 YK SG + W IPK W + IK F +G T +S +I +I ++ Sbjct: 31 YKSSGGEMVWNKKLKREIPKGWNISLIKDFATTYSGGTPKSTNIEYYNNGEIAWINSGEL 90 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 S + S+ ++ IL G K + F+ + + P Sbjct: 91 NSPIITKTTNYITKCGLENSSAKLYPSNSILVAMYGATAGKVSLLTFEACSNQAVCGIIP 150 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L + + + G+ + I NI +PIP L EKI + Sbjct: 151 -TIENMLYYVYFHISSLYSHFITLSTGSARDNISQNTIKNILLPIPTRNILKLFDEKIGS 209 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 I + E L++ V+ + V K G Sbjct: 210 IYQMIVNNYQQIDSLAMQRDELLPLLMNGQVSVNSDLSVYKKRRG 254 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 73/250 (29%), Gaps = 26/250 (10%) Query: 190 RFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFAL 240 + L + L Y + P+ K SG E V +P W + Sbjct: 1 MLNQNLTAMAKQLYDYWFVQFDFPNEEGKPYKSSGGEMVWNKKLKREIPKGWNISLIKDF 60 Query: 241 VTELNRKN------TKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEI 291 T + I ++ G + + T+ + + ++ I Sbjct: 61 ATTYSGGTPKSTNIEYYNNGEIAWINSGELNSPIITKTTNYITKCGLENSSAKLYPSNSI 120 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + K SL + + A + P + Y + S Sbjct: 121 LVAMYGATAGKVSLLTFE----ACSNQAVCGIIPTIENMLYYVYFHISSLYSHFITLSTG 176 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 R ++ +K + + +P +I + + + I ++ Q I L +R + Sbjct: 177 SARDNISQNTIKNILLPIPT----RNILKLFDEKIGSIYQMIVNNYQQIDSLAMQRDELL 232 Query: 412 AAAVTGQIDL 421 + GQ+ + Sbjct: 233 PLLMNGQVSV 242 >gi|91201732|emb|CAJ74792.1| unknown protein [Candidatus Kuenenia stuttgartiensis] Length = 137 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 44/132 (33%), Gaps = 1/132 (0%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 I+ +IV K L + + + I YL + + S + + Sbjct: 3 ILSENDIVIARTGGTIGKSFLIKDIPVRSLFASYLIRVIPSKNIFPEYLKYFLESPEYWE 62 Query: 345 VFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 Y + ++ + L V + P+ EQ I ++ A I L +++ + Sbjct: 63 QLYDAAWGAGQPNVNGTSLSNLIVSLSPLAEQQAIVERVDKLMAMIGELEKQVSERKEQS 122 Query: 404 KERRSSFIAAAV 415 + S + A Sbjct: 123 EMLMQSVLREAF 134 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 58/136 (42%), Gaps = 4/136 (2%) Query: 79 SIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 +I ++ I+ + G + K+ + S V+ K++ PE L+ +L S + Sbjct: 2 TILSENDIVIARTGGTIGKSFLIKDIPVRSLFASYLIRVIPSKNIFPEYLKYFLESPEYW 61 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +++ GA + + + N+ + + PLAEQ I E++ I L + E Sbjct: 62 EQLYDAAWGAGQPNVNGTSLSNLIVSLSPLAEQQAIVERVDKLMAMIGELEKQVSERKEQ 121 Query: 195 LKEKKQALVSYIVTKG 210 + Q+++ KG Sbjct: 122 SEMLMQSVLREAFAKG 137 >gi|327390066|gb|EGE88410.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 220 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 59/186 (31%), Gaps = 10/186 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + + ++++ N L N + Y IV Sbjct: 44 EMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFPIYGSGGIMGYAKDWIVKKN 103 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ N +R + I+S YL + + Y+ K+ A+ Sbjct: 104 SVIIGRKGNINKPILVRENFWNVDTAFG---LEPVLEKINSEYLFYFCQLYNFEKLNKAV 160 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 SL D+ + + +PP+ Q + + + ++D I++S+ L+ + S Sbjct: 161 ---TIPSLTKSDLLNISIPLPPLALQNEFADFVV----QVDKSQLAIQKSLEELETLKKS 213 Query: 410 FIAAAV 415 + Sbjct: 214 LMQEYF 219 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 54 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 101 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 102 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 158 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 159 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 189 >gi|288937352|ref|YP_003441411.1| restriction modification system DNA specificity domain protein [Klebsiella variicola At-22] gi|288892061|gb|ADC60379.1| restriction modification system DNA specificity domain protein [Klebsiella variicola At-22] Length = 430 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 45/430 (10%), Positives = 122/430 (28%), Gaps = 56/430 (13%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + F +L G G+ + G + D + Sbjct: 5 WIECELGDFIELKRGYDLPKSTR---------NEGSIPIISSSGFT---DFHDKPMVKGP 52 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA----- 139 ++ G+ G + +T V+ K + L +I + Sbjct: 53 GVVTGRYGTIGEVFYSEEDFWPLNTTLYVVDFKGNDRLFVYYLLQTISYADYTDKAAVPG 112 Query: 140 -----ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL---------- 184 + + + + L ++V + ++I ++ Sbjct: 113 VNRNHLHKAKVKVPISLDIQQKVAAQLYQLEKRVALSKQINQTLEQMSQTLFKSWFVDFD 172 Query: 185 --ITERIRFIELLKEKKQA-------LVSYIVTKGLNPDV---KMKDSGIEWVGLVPDHW 232 I + + E Q+ + + K L D+ D +G +P +W Sbjct: 173 PVIDNALDAGNSIPEALQSRAELRQKVRNNADFKPLPADIRALFPDDFEETELGWIPKNW 232 Query: 233 EVKPFFAL--VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 ++ F + + + N K+ + + + + + G + + G+ Sbjct: 233 HIRDFSDIAVLIKNNIKSDDICDDIHYIGLEHLERKHIFITSYGNGSDVSSNKSAFNKGD 292 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAM 349 ++F + K ++ GI ++ + + + +A + + Sbjct: 293 LLFGKLRPYFHKVAITPFS----GICSTDILVFRAKEKFYKSLMAMYSFTDEFVAYANLR 348 Query: 350 GSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +G R + +D+ + +++P I + + L R Sbjct: 349 STGTRMPRAEAKDLLKYKIILPNKD----ILEKFELLLEDYWAKGQLNNNENDHLTALRD 404 Query: 409 SFIAAAVTGQ 418 + + ++G+ Sbjct: 405 TLLPKLISGE 414 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 45/190 (23%), Positives = 67/190 (35%), Gaps = 5/190 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G IPK+W + L DI YIGLE +E + + Sbjct: 225 LGWIPKNWHIRDFSDIAVLIKNNIKSDDICDDIHYIGLEHLERKHIFITSYG--NGSDVS 282 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLSIDVT 134 S S F KG +L+GKL PY K I F GICST LV + K+ + + + + + Sbjct: 283 SNKSAFNKGDLLFGKLRPYFHKVAITPFSGICSTDILVFRAKEKFYKSLMAMYSFTDEFV 342 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G M A+ K + + +P + + E L Sbjct: 343 AYANLRSTGTRMPRAEAKDLLKYKIILPNKDILEKFELLLEDYWAKGQLNNNENDHLTAL 402 Query: 195 LKEKKQALVS 204 L+S Sbjct: 403 RDTLLPKLIS 412 >gi|119510902|ref|ZP_01630025.1| putative type I restriction enzyme specificity protein [Nodularia spumigena CCY9414] gi|119464430|gb|EAW45344.1| putative type I restriction enzyme specificity protein [Nodularia spumigena CCY9414] Length = 60 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 26/54 (48%), Positives = 36/54 (66%) Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +PP EQ IT+ +N E +I KI+++I LLKE R+S I AVTG+ID+R Sbjct: 2 IPPFNEQLQITDFLNKEMQKIYQQKAKIKEAIELLKEYRTSLITNAVTGKIDVR 55 >gi|146321306|ref|YP_001201017.1| type I restriction-modification system, S subunit [Streptococcus suis 98HAH33] gi|145692112|gb|ABP92617.1| type I restriction-modification system, S subunit [Streptococcus suis 98HAH33] Length = 253 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 72/210 (34%), Gaps = 22/210 (10%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---- 282 +P+ WE A+VT K + + ++ + ++ +KP + + Sbjct: 4 DIPESWEWVRLGAIVTAKGGKRIPKGYNLQEEDNGHPYLRVTDMKDGTIKPTNIKFAPDN 63 Query: 283 -YQIVDPGEI----VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 Y I+ I ++ I + + +A + I+ +LA L+ Sbjct: 64 VYTIIRNYTISSTDIYVTIAGTIGDVGIVPENFNNALLTENALKLMLTESINKMFLAHLL 123 Query: 338 RSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +S + K F + L + +PP+ EQ I I + VE Sbjct: 124 KSPLVQKQFKEVYNQVAQPKLSIRSTNSTIIPLPPLAEQKRIVAQIERALEQ----VEVY 179 Query: 397 EQSIVLLKE--------RRSSFIAAAVTGQ 418 +S L+E + S + A+ G+ Sbjct: 180 AESYNKLQELDRAFPDKLKKSILQYAMQGK 209 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 68/207 (32%), Gaps = 15/207 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W+ V + G+ G ++ Y+ + D++ GT K Sbjct: 4 DIPESWEWVRLGAIVTAKGGKRIPKGYNLQEEDNGHPYLRVTDMKDGTIKPTNIKFAPDN 63 Query: 73 -SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWL 128 + I G I+ + + ++ + + L L Sbjct: 64 VYTIIRNYTISSTDIYVTIAGTIGDVGIVPENFNNALLTENALKLMLTESINKMFLAHLL 123 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S V ++ + + + + +P+PPLAEQ I +I +++ Sbjct: 124 KSPLVQKQFKEVYNQVAQPKLSIRSTNSTIIPLPPLAEQKRIVAQIERALEQVEVYAESY 183 Query: 189 IRFIELLKEKK----QALVSYIVTKGL 211 + EL + ++++ Y + L Sbjct: 184 NKLQELDRAFPDKLKKSILQYAMQGKL 210 >gi|332202401|gb|EGJ16470.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41317] Length = 286 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 75/199 (37%), Gaps = 9/199 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPL+EQ I E I + ++D R Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNR 262 Query: 191 FIELLKEKKQALVSYIVTK 209 +L K+ L + Sbjct: 263 LEQLDKKFPDKLKNLFFNM 281 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 75/201 (37%), Gaps = 15/201 (7%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV- 391 +++ S + F ++ SG ++L + V + + +PP+ EQ I I ++D Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEY 256 Query: 392 ------LVEKIEQSIVLLKER 406 L + ++ LK Sbjct: 257 AESYNRLEQLDKKFPDKLKNL 277 >gi|296277302|ref|ZP_06859809.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus MR1] Length = 212 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 67/212 (31%), Gaps = 4/212 (1%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ + + + G W K ++ N++ E +L+ S Sbjct: 2 LLQQQKKCYIQKIFSQELRFKDEEGNYYKGWNKKQLKDVLEFSNKRTINENEYPVLTSSR 61 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 +I + + + P + + +++ GII+ Y Sbjct: 62 QGLILQSDYYKDRKTFAESNIGYFILPKNHITYRSRSDDGIFKFNLNLMIDVGIISKYYP 121 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 K + YL + + + L +D++ + +P +EQ I + Sbjct: 122 VFKGIDANQYYLTLHLNYQLKKEYIKYATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDF 181 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + ID LVEK + LK R+ + Sbjct: 182 ----FSEIDRLVEKQSSKVGRLKVRKKELLQK 209 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 53/184 (28%), Gaps = 5/184 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W +K + + RT + + Y KD + I Sbjct: 30 KGWNKKQLKDVLEFSNKRTINENEYPVLTSSRQGLILQSDY-YKDRKTFAESNIGYFILP 88 Query: 83 KGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K I Y G + + GI S ++ + + L+ + + Sbjct: 89 KNHITYRSRSDDGIFKFNLNLMIDVGIIS-KYYPVFKGIDANQYYLTLHLNYQLKKEYIK 147 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + K + NI +P EQ I + ++ ++ R KE Sbjct: 148 YATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDFFSEIDRLVEKQSSKVGRLKVRKKELL 207 Query: 200 QALV 203 Q + Sbjct: 208 QKMF 211 >gi|319428579|gb|ADV56653.1| restriction modification system DNA specificity domain protein [Shewanella putrefaciens 200] Length = 383 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 67/372 (18%), Positives = 123/372 (33%), Gaps = 25/372 (6%) Query: 26 KVVPIKRFTKLN--TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + V + T + + YIGLE ++SG+ K + + G + + S +F K Sbjct: 5 QTVKFGDICREVKLTTKDPIADGYERYIGLEHLDSGSLK-IKRWGVIAEDNPSFTRVFKK 63 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSIDVTQRIEAIC 141 G IL+GK PYL+KA IA+FDGICS +V++P + L+ + S + + Sbjct: 64 GHILFGKRRPYLKKAAIAEFDGICSGDIIVMEPTNSFIAASLIPNIVQSELMWEWAIKTS 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ +K + + + + AEQ+ + + + Sbjct: 124 SGSLSPRTKFKLLAELDITLMSNAEQIRKIKVFNKFDDVERLQYDVGDSLNLVWNTLYRE 183 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR-KNTKLIESNILSLS 260 S S I G + D V V + I +S Sbjct: 184 FYS--------------SSDIAPNGKLRDVIHVLQPGKSVKSASTAARKTQIGVLKVSAV 229 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G + E + + + E + D +++ + Q + T Sbjct: 230 SGGFYKPSENKLVTQESEIEKLQICPDKSDLLITRANTPQLVGDSCIVQDKFENVFTPDK 289 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 + L L K + + +G +++ + + V +P EQ Sbjct: 290 IWRAQVVPGVDKYWLLQLLQYLRKSGMLGKVATGTSNSMKNISQSKMLDIDVYIPTALEQ 349 Query: 376 FDITNVINVETA 387 I VI Sbjct: 350 EKIGRVIKCLMQ 361 >gi|217033266|ref|ZP_03438697.1| hypothetical protein HP9810_9g19 [Helicobacter pylori 98-10] gi|216944207|gb|EEC23632.1| hypothetical protein HP9810_9g19 [Helicobacter pylori 98-10] Length = 390 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 59/395 (14%), Positives = 116/395 (29%), Gaps = 29/395 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPSTNNPKNYGNKIAWITPKDLSTLQGRYIKKGSRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAKKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I I G T +G + IPP + +KI +D I + E Sbjct: 120 KDNISNIGGGTTFKEVSGATLGLFKVKIPPTYYEQ---QKIARTLSILDQKIENNHKINE 176 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL + + L + D K + + ++ Sbjct: 177 LLHKILELLYEQYFVRFDFLDENNKPYQTSGGKMKFSKELNRLIPNDFEVKTLGELTQLK 236 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + ++ + K P ETYQ I+ + + Sbjct: 237 VGNKNANHSSNQGKYPFFTCSNNPLRCETYQFEGKHIIISGNGNFYVTHYDGKFDAYQRT 296 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ P+ + L +L + + + + D++ + +++P +K Sbjct: 297 YVVN-------PNNPNHYVLIYLFVKSYTNYLKLQSRGSIIKFITKSDIENIKIVLPNLK 349 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 NV+ ++E QS L R Sbjct: 350 TYAKWNNVL--------KMIENNNQSTQTLTALRD 376 >gi|315639046|ref|ZP_07894215.1| type I restriction modification DNA specificity domain protein [Campylobacter upsaliensis JV21] gi|315480874|gb|EFU71509.1| type I restriction modification DNA specificity domain protein [Campylobacter upsaliensis JV21] Length = 191 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 58/165 (35%), Gaps = 7/165 (4%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSL 305 T + + + + + + ES++ V +IVF I K +L Sbjct: 21 YTYKKGRAYIRIKDLSFKEDISLNSAVFIDESFKPTNEVRVKENDIVFATIGATIGKANL 80 Query: 306 RSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVK 363 + ++ I + + + ++ +S + + + + + + Sbjct: 81 VTQELAGSFISNNTSKFSIFNQLAYPAFCTYIFQSNFFQEFIKQNTTITAQPKITNKCIL 140 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 L + +PP+ EQ I I+ AR L ++ + LL+ + Sbjct: 141 NLKIPLPPLTEQERIAKEISQRKARAKALKQEAK---ELLESAKK 182 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 59/187 (31%), Gaps = 11/187 (5%) Query: 28 VPIKRFT-----KLNTGRTSESGKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVSIF 81 V + L + + K YI ++D+ + Sbjct: 2 VRLGEVGIMQNGSLISEKLYTYKKGRAYIRIKDLSFKEDISLNSAVFIDESFKPTNEVRV 61 Query: 82 AKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + I++ +G + KA + +T + + P S + Sbjct: 62 KENDIVFATIGATIGKANLVTQELAGSFISNNTSKFSIFNQLAYPAFCTYIFQSNFFQEF 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 I+ K I N+ +P+PPL EQ I ++I R L E +E K Sbjct: 122 IKQNTTITAQPKITNKCILNLKIPLPPLTEQERIAKEISQRKARAKALKQEAKELLESAK 181 Query: 197 EKKQALV 203 ++ + ++ Sbjct: 182 KEVEHII 188 >gi|301162157|emb|CBW21702.1| putative type I restriction enzyme specificity protein [Bacteroides fragilis 638R] Length = 368 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 48/395 (12%), Positives = 109/395 (27%), Gaps = 47/395 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTS 76 W+ I + G T ++ I + ++ ++ + + S Sbjct: 9 EWETKSINDLADVIGGGTPDTTVKSYWDGGIQWFTPSEIGKNKFVDASLRTITEDGLNNS 68 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + IL ++ + F L K+ + + L + Sbjct: 69 SAKLLPPNTILLSSRATIGECSLSLRECA-TNQGFQSLVSKNCN--VDFLYYLIQTKKKD 125 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + G+T + I + +P EQ I E + ID I + + IE LK Sbjct: 126 LIRKSCGSTFLEISANEVRKIQVSVPSDVEQQKIAELL----SLIDKRIATQNKIIEDLK 181 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 K A+ ++ G + K I +G + + +KN Sbjct: 182 LLKSAISLNVLHSG--TWKQFKIKDIAQIG-------RGRVISSIEISQQKNPTY----- 227 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S + E + + Sbjct: 228 PVYSSQTSNDGIMGYLDDYMFEGEYISW------------TTDGANAGTVFYRDGKFNCT 275 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +G D T+ L+ + K L + + + +P +EQ Sbjct: 276 NVCGLLKLLNGFD-THFVSLILAEATKKYVSINL--ANPKLMNNIMGNIQICLPEFEEQK 332 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 I ++ ++ L++ + + +++ + Sbjct: 333 RI----SIVFKKLQELLDVQKILLNQYSKQKQCLL 363 >gi|213616106|ref|ZP_03371932.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] Length = 126 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 33/81 (40%) Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + + L + + P+ VPP++EQ +I + A D + +++ Sbjct: 3 NDSNNISLSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVN 62 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 ++ + S +A A G+ Sbjct: 63 NALNRVNSLTQSILAKAFRGE 83 >gi|288870250|ref|ZP_06409683.1| putative type II restriction-modification enzyme [Clostridium hathewayi DSM 13479] gi|288867882|gb|EFD00181.1| putative type II restriction-modification enzyme [Clostridium hathewayi DSM 13479] Length = 889 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 54/395 (13%), Positives = 117/395 (29%), Gaps = 48/395 (12%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSI 80 V + ++ G T + + + +++ SG + K Sbjct: 520 VRLGELVQIIKGVTYSKEDQVYNETNNVILTADNITNSGDFDVVKKVFLRADLTIDGTKK 579 Query: 81 FAKGQILYG----KLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDV 133 + I + A I+ + F+ + +DV + L L S Sbjct: 580 LKQNDIFMCFSSGSKSHVGKSAYISYNTEYFAGGFMGVLRCKSEDVSMKYLWAILSSNQF 639 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I G +++ + +I +P+PPL Q I +I +I + Sbjct: 640 RHIISQESTGININNLS-ANLADIKIPLPPLDVQKKIVAEIEEIDREESYIIEQVDALRY 698 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + V G ++ + + + + Sbjct: 699 S--------ILSAVKNGA------------------AGEPLEKLGVVASYSQDRISCAEL 732 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 S+ + N++Q +E + T G I+ I K L Sbjct: 733 SSDTYVGVDNLLQNMEGKGSSQFVPKSGTAIAYSKGNILLSNIRPYLKKIWLADNDGGSS 792 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPI 372 G + + + I S YL +L+ + + + G+ V V +P + Sbjct: 793 GDV--LVLKMDDTKISSKYLYYLLATDEFFEYEMQHIKGVKMPRADKASVLNYNVPIPSL 850 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +Q +I I +I+ + + + LK+++ Sbjct: 851 FKQQEIVAEIE----KIESEITTRKMRLEDLKKQK 881 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 43/141 (30%), Gaps = 14/141 (9%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAW 335 + + + + +I F + +M K + YL Sbjct: 573 TIDGTKKLKQNDIFMCFSSGSKSHVGKSAYISYNTEYFAGGFMGVLRCKSEDVSMKYLWA 632 Query: 336 LMRSYDLCKVFYAMGSGLRQSLK--FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ S + +G+ ++ ++ + + +PP+ Q I I + + Sbjct: 633 ILSSNQFRHIISQESTGI--NINNLSANLADIKIPLPPLDVQKKIVAEIEEIDRE-ESYI 689 Query: 394 EKIEQSIVLLKERRSSFIAAA 414 I + R S ++A Sbjct: 690 ------IEQVDALRYSILSAV 704 >gi|153811192|ref|ZP_01963860.1| hypothetical protein RUMOBE_01584 [Ruminococcus obeum ATCC 29174] gi|149832690|gb|EDM87774.1| hypothetical protein RUMOBE_01584 [Ruminococcus obeum ATCC 29174] Length = 380 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 63/385 (16%), Positives = 125/385 (32%), Gaps = 43/385 (11%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +L + + +++ +P GN + D S + Y Sbjct: 7 KFGELIELTNEKNANGLYGEDDAIGVNIDK---IIMPMRGNLEKKDFSNFHLVPPRHFAY 63 Query: 89 GKLGPYLRKAIIAD----FDGICSTQFLVLQ---PKDVLPELLQGWLLSIDVTQRIEAIC 141 G D F + ++ K +L L +L + + E I Sbjct: 64 NPRGSRKLGIGFNDTEKTFIITFNDNVFRIKETAKKKILDTYLFMYLCRKEWDRYAEFIS 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G++ DW + +PP+ Q + A Sbjct: 124 WGSSTEVFDWNIFCEEEIFLPPIQIQQKYVDVYNAMLEN--------------------- 162 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +GL+ + D+ IE + H + + KN L+ + ++ Sbjct: 163 --QKSYERGLDDLKLVCDAYIEELRKELPHK---KLGNYIALCDEKNDDLV-YGLDAVRG 216 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVF-RFIDLQNDKRSLRSAQVMERGIITSAY 320 +I ++ ++ S + Y +V P E + +K SL E I +S+Y Sbjct: 217 ISIEKRFIYTKANMEGVSLKPYAVVKPDEFAYVTVTSRNGEKISLARNNSDETYICSSSY 276 Query: 321 MAVKPHGID---STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + K + YL+ L + + R++ +E++K + + +P I+ Q Sbjct: 277 IVFKVDDTNTLLPAYLSMLFERSEFNRYSRFNSWGSARETFDWEEMKNVLIPIPNIEIQQ 336 Query: 377 DITNVINVETARIDVLVEKIEQSIV 401 DI N+ R D + EK++ I Sbjct: 337 DIVNIFEAYNTRRD-INEKLKAQIK 360 >gi|326572828|gb|EGE22813.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis CO72] Length = 221 Score = 73.7 bits (179), Expect = 6e-11, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 37 KISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSSAKWIPANCVI 96 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 97 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 152 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 153 SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 212 Query: 409 SFIA 412 + Sbjct: 213 QLLN 216 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 68/192 (35%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++G T + + +I ++ ++V S+ Sbjct: 27 EWRALGEVAKKISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSS 86 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 87 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 146 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 147 KSLGTG-SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 205 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 206 QYEYYREQLLNF 217 >gi|153869116|ref|ZP_01998801.1| conserved hypothetical protein [Beggiatoa sp. PS] gi|152074332|gb|EDN71197.1| conserved hypothetical protein [Beggiatoa sp. PS] Length = 472 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 44/403 (10%), Positives = 106/403 (26%), Gaps = 31/403 (7%) Query: 35 KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 + K+ Y+ + +V Y + + + I+ + P Sbjct: 54 SFVNIKNLSLNKNFNYLEISNVSLAGLGYTTNEIDYLNIPDRATYVLKNHDIVISTVRPN 113 Query: 95 LRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWK 152 + + ++ F VL+ + + + + + Sbjct: 114 RNAVALIRQGKRLVGTSGFTVLRIDKLSSYYVFAFCKTKYFITHLMRKNTATMYPAVSDN 173 Query: 153 GIGNIPMPIPPLAEQVLIREKIIAET-----VRIDTLITERIRFIELLKEKKQALVSYIV 207 + N + +P Q I + +I E + EL + Q + Sbjct: 174 DVLNSIILVPSATFQAKIESIVKLAYQKLEKSQILYTQAESLLLQELGLDNWQPPILETT 233 Query: 208 TKGLNPDVKMKDSG-IEWVGLVPDHWEVKPFFALVTELNRK---------------NTKL 251 L+ ++ + I+ + + Sbjct: 234 ELKLSQILEDNPTFRIDSEYFQTKYLHNIRLIKSYPNGSITLGEVIKSITGGATPLGANY 293 Query: 252 IESNILSLS----YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 E I L N I + + K ++ + +++F + + Sbjct: 294 FEKGIPFLRVQNIKPNYIDDSDLVYISKKDDAKLKRSKLKENDVLFSITGSYGNAAVVTK 353 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 S + + + A + S G R +L +E +K+ Sbjct: 354 EFAGCNINQHSVKLTLTGKTFSPYFFAVFLNSRVGRLQSDKYIVGITRPALDYESIKKFE 413 Query: 367 VLVPPIKEQFDITNVINVETARIDV---LVEKIEQSIVLLKER 406 + + P K Q I ++I L+E + ++ L E+ Sbjct: 414 IPLVPYKFQRKIEDLIKAAYKNSKAGKMLLELAKHAVELAIEQ 456 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 55/162 (33%), Gaps = 10/162 (6%) Query: 28 VPIKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSD-TSTVSIF 81 + + K TG + G K I ++ +++++ S++ D S Sbjct: 273 ITLGEVIKSITGGATPLGANYFEKGIPFLRVQNIKPNYIDDSDLVYISKKDDAKLKRSKL 332 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFL----VLQPKDVLPELLQGWLLSIDVTQRI 137 + +L+ G Y A++ C+ L K P +L S + Sbjct: 333 KENDVLFSITGSYGNAAVVTKEFAGCNINQHSVKLTLTGKTFSPYFFAVFLNSRVGRLQS 392 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + G T D++ I +P+ P Q I + I A Sbjct: 393 DKYIVGITRPALDYESIKKFEIPLVPYKFQRKIEDLIKAAYK 434 >gi|5712706|gb|AAD47617.1| HsdS variable domain [Lactococcus lactis] Length = 166 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 54/153 (35%), Gaps = 8/153 (5%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 N+ + +I + G I + + ++V G+I++ + + + Sbjct: 22 GNSSYYKGDIPFIRSGEINSDKTELFLTEAGLKSSSAKMVSVGDILYALYGATSGEVGIS 81 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 G I A +A+KP +++ K+ G + +L VK L Sbjct: 82 QI----NGAINQAILAIKPCDGYNSHFLMQWLKLKKQKIIDQYLQGGQGNLSGSIVKNLV 137 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + VP +EQ I ++D + ++ Sbjct: 138 LKVPNFEEQKKIGAF----FKQLDDTITLHQRK 166 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 57/164 (34%), Gaps = 10/164 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + T +G T +G DI +I ++ S + + + S+ Sbjct: 1 DWEERKLGELTTSFSGGTPSAGNSSYYKGDIPFIRSGEINSDKTELFLTEAGLKS---SS 57 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + + G ILY G + I+ +G + L ++P D L + + I Sbjct: 58 AKMVSVGDILYALYGATSGEVGISQINGAINQAILAIKPCDGYNSHFLMQWLKLKKQKII 117 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + +G + + N+ + +P EQ I I Sbjct: 118 DQYLQG-GQGNLSGSIVKNLVLKVPNFEEQKKIGAFFKQLDDTI 160 >gi|262279999|ref|ZP_06057784.1| type-1 restriction enzyme EcoEI specificity protein [Acinetobacter calcoaceticus RUH2202] gi|262260350|gb|EEY79083.1| type-1 restriction enzyme EcoEI specificity protein [Acinetobacter calcoaceticus RUH2202] Length = 815 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 62/200 (31%), Gaps = 6/200 (3%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 S E ++P W +V+ + E+ L GN + Sbjct: 93 MISEDEKPFIIPPSWSWSRLNWIVSILGDGIHGTPIYEENTGLYFVNGNNLNNGNIIIKH 152 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT--SAYMAVKPHGIDSTY 332 + + ++ R I + + A II SA + Sbjct: 153 ETKTVSQESFNKNKKDLNLRSILVSINGTIGNVAFYNNEPIILGKSACYFNLIEPNSMAF 212 Query: 333 LAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + ++ S + +G ++L + + PV +PP++EQ I ++ D Sbjct: 213 MKIVLNSPYFYQYANKEATGSTIKNLSLASMNKFPVPLPPLEEQKSIVAKVDELMQLCDQ 272 Query: 392 LVEKIEQSIVLLKERRSSFI 411 L ++ S + + I Sbjct: 273 LEKQQSLSSEAHDQLVDTLI 292 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 52/488 (10%), Positives = 123/488 (25%), Gaps = 94/488 (19%) Query: 21 IPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP W + + E + ++ ++ +G + Q Sbjct: 103 IPPSWSWSRLNWIVSILGDGIHGTPIYEENTGLYFVNGNNLNNGNIIIKHETKTVSQESF 162 Query: 76 STVSI-FAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDV 133 + IL G A + I + + ++ L S Sbjct: 163 NKNKKDLNLRSILVSINGTIGNVAFYNNEPIILGKSACYFNLIEPNSMAFMKIVLNSPYF 222 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMP----------------IPPLAEQVLIREKIIAE 177 Q G+T+ + + P+P + L +Q+ ++ + +E Sbjct: 223 YQYANKEATGSTIKNLSLASMNKFPVPLPPLEEQKSIVAKVDELMQLCDQLEKQQSLSSE 282 Query: 178 -------------------------TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 I +++ KQ ++ V L Sbjct: 283 AHDQLVDTLIKVLINSSDVDEFQQNWQSISENFDLLFTTEYSVEQLKQTILQLAVMGKLV 342 Query: 213 PDVKMKDSGIEWVGLVPD------------------------------------HWEVKP 236 + E + + + + + Sbjct: 343 KQDTNDEPASELLEKIAEKKAKLVQEGTITKKAKTYSLQIADINLPRNWSLVNINQIIWD 402 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI--VDPGEIVFR 294 A + + + PE + G+I+ Sbjct: 403 LDAGWSPACHPYPAGSNKWGVLKTTFVQCNYFIENENKELPEELSPRVESELKSGDILVT 462 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL---CKVFYAMGS 351 N + ++ S + + + + ++ S + S Sbjct: 463 RAGPYNRVGVACYIDNIRPKLMISDKIIRISYDKVNLFGPFIALSINFGITKIYLKDNQS 522 Query: 352 GL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK-ERR 407 G+ + ++ + ++ P+L+PP+ EQ I +N I+ L + ++ + K Sbjct: 523 GMAESQVNISQDKLRSAPILLPPLSEQKRIVEKVNQLFFMIEQL-QILQGKLQRTKLHLA 581 Query: 408 SSFIAAAV 415 S IA A+ Sbjct: 582 DSLIANAL 589 >gi|257078399|ref|ZP_05572760.1| type IC HsdS subunit [Enterococcus faecalis JH1] gi|256986429|gb|EEU73731.1| type IC HsdS subunit [Enterococcus faecalis JH1] Length = 383 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 67/157 (42%), Gaps = 11/157 (7%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYM 321 + + + + + + + Y ++ E+ + + + K + S + E ++ Y Sbjct: 27 GWLNQKDRFSGNIAGKEQKNYTLLLKNELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYH 86 Query: 322 AVKP-HGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQ----SLKFEDVKRLPVLVPPIKEQ 375 + K D +L ++ + K + SG R ++ ++D + + +P + EQ Sbjct: 87 SFKSTKNSDPDFLEYIFATKKPDKELGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQ 146 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I+N++ +ID + ++ + LKE + +++ Sbjct: 147 KKISNLL----RKIDDTIALHQRKLDQLKELKKAYLQ 179 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 52/397 (13%), Positives = 137/397 (34%), Gaps = 34/397 (8%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG- 89 K T+ G ++ D+ + + + + GN + ++ K ++ Y Sbjct: 2 KEITERVKG--NDGRMDLPTLTISASQGWLNQKDRFSGNIAGKEQKNYTLLLKNELSYNH 59 Query: 90 ---KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE------AI 140 KL Y + ++ + + + E + Sbjct: 60 GNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKELGKLVSSG 119 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + ++ NI + IP + EQ I + +ID I R ++ LKE K+ Sbjct: 120 ARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDDTIALHQRKLDQLKELKK 175 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA--LVTELNRKNTKLIESNILS 258 A + + K +++ + E D W++ + + + + +S + Sbjct: 176 AYLQLMFPKKDETVPQVRFADFE------DDWQLCKLGDVVEIFDGTHQTPRYTDSGVKF 229 Query: 259 LSYGNIIQKLETRNMGLK-PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +S NI + + + E + + G+I+ I D +++ + E Sbjct: 230 VSVENIATLETKKYITHEAYEKEYSKKRAKKGDILMTRIG---DIGTMKVIETDEPLAYY 286 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQ 375 +K + +L++++ S ++ + + + + ++ ++ + + +EQ Sbjct: 287 VTLALLKAKETNPYFLSFIISSPEIQRNIWKRTLHIAFPKKINLGEINQVEMKITIFEEQ 346 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I + +D + + + LK + S++ Sbjct: 347 DKIGD----LFTNLDDAIILNQNKLNQLKSLKKSYLQ 379 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 58/185 (31%), Gaps = 7/185 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W++ + ++ G + + ++ +E++ + K + + Sbjct: 200 DWQLCKLGDVVEIFDGTHQTPRYTDSGVKFVSVENIATLETK--KYITHEAYEKEYSKKR 257 Query: 81 FAKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 KG IL ++G K I D +L+ K+ P L + S ++ + I Sbjct: 258 AKKGDILMTRIGDIGTMKVIETDEPLAYYVTLALLKAKETNPYFLSFIISSPEIQRNIWK 317 Query: 140 ICEGATMS-HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I + M I EQ I + I + + L K Sbjct: 318 RTLHIAFPKKINLGEINQVEMKITIFEEQDKIGDLFTNLDDAIILNQNKLNQLKSLKKSY 377 Query: 199 KQALV 203 Q + Sbjct: 378 LQNMF 382 >gi|213023061|ref|ZP_03337508.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 121 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 32/76 (42%) Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L + + P+ VPP++EQ +I + A D + +++ ++ Sbjct: 3 ISLSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNR 62 Query: 403 LKERRSSFIAAAVTGQ 418 + S +A A G+ Sbjct: 63 VNSLTQSILAKAFRGE 78 >gi|326559471|gb|EGE09894.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis 7169] Length = 221 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 37 KISSSGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSSAKWIPANCVI 96 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 97 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 152 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 153 SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 212 Query: 409 SFIA 412 + Sbjct: 213 QLLN 216 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 19/192 (9%), Positives = 67/192 (34%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++ T + + +I ++ ++V S+ Sbjct: 27 EWRALGEVAKKISSSGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSS 86 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 87 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 146 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 147 KSLGTG-SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 205 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 206 QYEYYREQLLNF 217 >gi|238651120|ref|YP_002916978.1| putative type I restriction enzyme S subunit [Rickettsia peacockii str. Rustic] gi|238625218|gb|ACR47924.1| putative type I restriction enzyme S subunit [Rickettsia peacockii str. Rustic] Length = 311 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 41/331 (12%), Positives = 89/331 (26%), Gaps = 30/331 (9%) Query: 88 YGKLGPYLRKAIIADFDG------ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI- 140 K G K D + + + + + L L S + +I Sbjct: 1 MCKDGALTGKVCFVDDKILPQIGVMVNEHVYIFRGNIIHQSYLFYCLNSDIIQNQINKNL 60 Query: 141 -CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + I +I +P+ L +Q I E++ + I+ + + K Sbjct: 61 AYNKGAQPGLNREHINSIYIPLLSLEKQQKIIEELNSYQKIIEGAKQIIDNWHPYFEINK 120 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q + +N S +H E I+ +I + Sbjct: 121 QWEIVKFGDIVINKLKSNILS--------LEHKEYTTLIVGKKGKMININTAIKGDIPVI 172 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + G I Sbjct: 173 ASGLGFSPYSHNQYNFNGN--------------IITISSSGAYAGYIWYHNSPMWTSDCN 218 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + T + + ++ + + +D++ L + +PP++EQ + Sbjct: 219 VIYSINEKLLLTKYLYYILKSQQNIIYQMQAGSGQPHVYLKDLEDLQIPIPPLEEQQKMV 278 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++ ++ID L I+Q LK +S Sbjct: 279 TELDNNQSKIDNLKNYIKQFENKLKTTLNSL 309 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 56/195 (28%), Gaps = 10/195 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR-------- 71 I K W++V S + Y L + G + Sbjct: 117 EINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVGKKGKMININTAIKGDIPVIASGL 176 Query: 72 --QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + F I G Y + S ++ + L + + Sbjct: 177 GFSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWTSDCNVIYSINEKLLLTKYLYYI 236 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I + G+ H K + ++ +PIPPL EQ + ++ +ID L Sbjct: 237 LKSQQNIIYQMQAGSGQPHVYLKDLEDLQIPIPPLEEQQKMVTELDNNQSKIDNLKNYIK 296 Query: 190 RFIELLKEKKQALVS 204 +F LK +L Sbjct: 297 QFENKLKTTLNSLWQ 311 >gi|212691987|ref|ZP_03300115.1| hypothetical protein BACDOR_01482 [Bacteroides dorei DSM 17855] gi|212665379|gb|EEB25951.1| hypothetical protein BACDOR_01482 [Bacteroides dorei DSM 17855] Length = 394 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 65/406 (16%), Positives = 142/406 (34%), Gaps = 43/406 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDV-ESGTGKYLPKDGNSR-QSDTSTV 78 W+ P+ F G ++ G +I + D+ + Y + Q Sbjct: 9 EWEKYPLTDFMSFKNGMNPDAKRFGSGTKFISVMDILNNQYICYDNIRASVELQEGDLDT 68 Query: 79 SIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGW-LLSID 132 G I++ + L + I + + K L + L S Sbjct: 69 YGVNYGDIVFQRSSETLEDVGQANVYLDCKPAIFGGFVIRGKSKGNYNPLFLRYLLASPT 128 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +RI GA + G+ + + +P + EQ + + + ID I + + I Sbjct: 129 ARKRIIVKGAGAQHFNISQDGLSKVVIDVPNIDEQEKVGKLLQC----IDERIATQNKII 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + L+ ++ + KGLN + W++ L+TE KN+ Sbjct: 185 DKLQSLISGIIQNAIQKGLND----------------NTWKMIYLSKLLTERKEKNSNGY 228 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVM 311 E N +S+S G +I ++E Y +V G+IV+ N + + Sbjct: 229 EVNSVSVSEG-VINQIEYLGRSFAASDTSKYNVVRYGDIVYTKSPTGNFPYGIIKQSFQK 287 Query: 312 ERGIITSAYMAVKPHGIDSTYL--AWLMRSYDLCKVFY-AMGSGLRQSLKFED--VKRLP 366 ++ Y +P+ ++ + + S + G + ++ + Sbjct: 288 HPVAVSPLYGVYEPYSNEAGCFLHYYFLSSIVTTNYLSPLIQKGAKNTINISNQTFLNNM 347 Query: 367 VLVPPIKEQFD-ITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 V P + I ++ +++ +E++ S+VLL+++++ + Sbjct: 348 VPYPKEEVGIRPIAALLRNVQIKLN--IERL--SLVLLEQQKTYLL 389 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 65/191 (34%), Gaps = 7/191 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 P + + + R + +++ + I R E V Sbjct: 12 KYPLTDFMSFKNGMNPDAKRFGSGTKFISVMDILNNQYICYDNIRASVELQEGDLDTYGV 71 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCK 344 + G+IVF+ + + + + I ++ + +L +L+ S K Sbjct: 72 NYGDIVFQRSSETLEDVGQANVYLDCKPAIFGGFVIRGKSKGNYNPLFLRYLLASPTARK 131 Query: 345 VFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G+G + ++ + + ++ + VP I EQ + ++ ID + + I L Sbjct: 132 RIIVKGAGAQHFNISQDGLSKVVIDVPNIDEQEKVGKLLQC----IDERIATQNKIIDKL 187 Query: 404 KERRSSFIAAA 414 + S I A Sbjct: 188 QSLISGIIQNA 198 >gi|238852889|ref|ZP_04643292.1| putative restriction-modification enzyme [Lactobacillus gasseri 202-4] gi|238834481|gb|EEQ26715.1| putative restriction-modification enzyme [Lactobacillus gasseri 202-4] Length = 907 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 39/374 (10%), Positives = 100/374 (26%), Gaps = 15/374 (4%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG----QI 86 T +T K I + S K+ + I Sbjct: 542 GNITDYGNEKTIPLHKIAILKNGTSITSSKIKHGNI-PVIAGGREPAYYHNEENRSEPTI 600 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 + G Y D S F + + L + L ++I + G+ Sbjct: 601 TVSQSGAYAGFVSYHDKPIFASDCFTITAKPNSGYSTLDLYYLLKKKQKQIYSFATGSIQ 660 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H K + + +P Q + ++ + + + L E +Q+L S I Sbjct: 661 KHVYAKDMEDFKIPDKGQELQ-----VVNNLIASFESEVQRQRQLENELTELQQSLFSDI 715 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 N + + + K ++ ++ Sbjct: 716 DKVYKNSQKVDQSISMLEDNELVKVMGGKRIPKEYDRAPFPTCHYYPGVKDFENFTINLK 775 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + ++ ++ + + +A+ Sbjct: 776 TSDCIDDVVF--EKIKRYVLKENDVFVSAAGTIGKVGMAPKVKGGTISLTENAHRIRVID 833 Query: 327 GI--DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 +L ++++S + ++ + L E +K + + + I EQ ++ + Sbjct: 834 QTKLIPRFLMYILKSQSIQDAMNSLVTKTGTPKLSIESLKNIEIPILKITEQQELIKKWD 893 Query: 384 VETARIDVLVEKIE 397 +I+ + +I Sbjct: 894 QLNTKINDIYSQIN 907 >gi|326572177|gb|EGE22173.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis BC7] Length = 221 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 37 KISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSSARWIPANCVI 96 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 97 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 152 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 153 SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 212 Query: 409 SFIA 412 + Sbjct: 213 QLLN 216 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 68/192 (35%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++G T + + +I ++ ++V S+ Sbjct: 27 EWRALGEVAKKISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSS 86 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 87 ARWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 146 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 147 KSLGTG-SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 205 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 206 QYEYYREQLLNF 217 >gi|251791790|ref|YP_003006511.1| Restriction endonuclease S subunits-like protein [Dickeya zeae Ech1591] gi|247540411|gb|ACT09032.1| Restriction endonuclease S subunits-like protein [Dickeya zeae Ech1591] Length = 436 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 46/320 (14%), Positives = 99/320 (30%), Gaps = 30/320 (9%) Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 L + P + +L SI + + + P A Q+ I Sbjct: 85 LWVDPALADTRYVYYYLRSIQIKEAGYSRHFKFLKEV---DIPIPFKDGSPDFAYQIRIV 141 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK--GLNPDVKMKDSGIEWVGLVP 229 + I + +LLK + V G + K G P Sbjct: 142 HLLGKVEELITQRKHHLQQLDDLLKSVFLEMFGDPVRNEKGWDKIPFSKLLADIESGKSP 201 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDP 288 + + E +L L + ET N L + T V Sbjct: 202 KCEARQ-------------AESNEWGVLKLGAVTRCKFDETENKALPNDVIPSTRDEVKA 248 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSYDLCK 344 G+++F + + ++ ++ + I+ ++ L+ K Sbjct: 249 GDLLFSRKNTYELVAACAYVFSTRPKLLMPDLIFRFIFKQDVDINPIFMWKLLTCDSQRK 308 Query: 345 VFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 ++ +G ++ ++K + + PP+ Q ++ +++ + + +QS+ Sbjct: 309 AIQSLAAGAAGSMPNISKTNLKSVRLPKPPLSLQNQFATIVE----KVESIKSRYQQSLA 364 Query: 402 LLKERRSSFIAAAVTGQIDL 421 L+ SS A G++DL Sbjct: 365 DLEVLYSSLSQRAFKGELDL 384 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 59/207 (28%), Gaps = 21/207 (10%) Query: 23 KHWKVVPIKRF-TKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 K W +P + + +G++ + + + L V Sbjct: 181 KGWDKIPFSKLLADIESGKSPKCEARQAESNEWGVLKLGAVTRCKFDETENKALPNDVIP 240 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGI--------CSTQFLVLQPKDVLPELLQGW 127 ST G +L+ + Y A A +F+ Q D+ P + Sbjct: 241 STRDEVKAGDLLFSRKNTYELVAACAYVFSTRPKLLMPDLIFRFIFKQDVDINPIFMWKL 300 Query: 128 LLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + I+++ GA M + + ++ +P PPL+ Q + Sbjct: 301 LTCDSQRKAIQSLAAGAAGSMPNISKTNLKSVRLPKPPLSLQNQFATIVEKVESIKSRYQ 360 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLN 212 L +L L+ Sbjct: 361 QSLADLEVLYS----SLSQRAFKGELD 383 >gi|315036578|gb|EFT48510.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0027] Length = 207 Score = 73.3 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 23/203 (11%), Positives = 65/203 (32%), Gaps = 11/203 (5%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY---GNIIQKLE 269 P ++ + +W + K E+++ + G+ ++ +E Sbjct: 9 PRLRFRGFSEDWELCKLGQVANYRRGSFPQPYGNKEWYDGENSMPFVQVVDVGDNLRLVE 68 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + V G++V + ++R ++ +D Sbjct: 69 DTKQKISELAQPKSVFVKEGKVVVTLQGSIGRVAITQYPAYVDRTLL---IFESYKAEMD 125 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y A++++ G +++ E + + P I+EQ + ++ Sbjct: 126 EYYFAYVIQQL-FEYEKTRAPGGTIKTVTKEALSDFTISFPSIEEQKK----LGKFFEQL 180 Query: 390 DVLVEKIEQSIVLLKERRSSFIA 412 D + + + L E + S++ Sbjct: 181 DDTITLHQNKLEQLNELKKSYLQ 203 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 60/194 (30%), Gaps = 14/194 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 + W++ + + G + + ++ + DV + Sbjct: 18 EDWELCKLGQVANYRRGSFPQPYGNKEWYDGENSMPFVQVVDVGDNLRLVEDTKQKISEL 77 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 +G+++ G R I + L+ + + + + Sbjct: 78 AQPKSVFVKEGKVVVTLQGSIGR-VAITQYPAYVDRTLLIFESYKAEMDEYYFAYVIQQL 136 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G T+ + + + + P + EQ +K+ ++D IT +E Sbjct: 137 FEYEKTRAPGGTIKTVTKEALSDFTISFPSIEEQ----KKLGKFFEQLDDTITLHQNKLE 192 Query: 194 LLKEKKQALVSYIV 207 L E K++ + + Sbjct: 193 QLNELKKSYLQNMF 206 >gi|269978338|gb|ACZ55903.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 412 Score = 73.3 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 46/403 (11%), Positives = 113/403 (28%), Gaps = 28/403 (6%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVEFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +I + + S+ +L K+ + + Sbjct: 71 KSSPAIIFDD--------FTTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNIGGEHARHWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELNT 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 ++ Y L+ + + + P + L + Sbjct: 178 ELNARKKQYQYYQNMLLDFNDINSNHKDAKIKSYPKRLKTL-LHTLAPKGVEFRKLGEVC 236 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I+ + L+ + ++ I + + ++ Sbjct: 237 EIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQNQKF 296 Query: 315 IITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 +V P + YL +++ + + S + S+ ++ ++ + +PP++ Sbjct: 297 WANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPIPPLE 356 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q +I +++ + L+ I I K+ R + Sbjct: 357 IQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 399 >gi|325832827|ref|ZP_08165558.1| type I restriction modification DNA specificity domain protein [Eggerthella sp. HGA1] gi|325485825|gb|EGC88287.1| type I restriction modification DNA specificity domain protein [Eggerthella sp. HGA1] Length = 397 Score = 73.3 bits (178), Expect = 8e-11, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 62/175 (35%), Gaps = 10/175 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W+ + + + T + + ++++ ++++ G SR++ Sbjct: 223 EIPEGWEWARLGSLLSVISDGTHKTPEYTNDGVLFLSVQNISKGFFDLSRVKHISRETHK 282 Query: 76 STVSIFAK--GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 G IL ++G + I+ +F S L + + ++ Sbjct: 283 GLCKRVRPQNGDILLCRIGTLGKPIIVDVDYEFSIFVSLGLLRPINRSLAEWIVNCLDSP 342 Query: 131 IDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E + G + I + +PIPPL EQ I E+I V I Sbjct: 343 MGFNWIQEVKVGGGTHTFKINLGDIPSFLVPIPPLVEQRRIAERISELDVLITNQ 397 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 111/394 (28%), Gaps = 79/394 (20%) Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVL-PELLQGWLLS 130 G +LY + PYL I D I ST F + D + L +L+S Sbjct: 10 RARKPVKLGDVLYSTVRPYLHNMCIVDRKFSLPPIASTGFAAMVCLDGISNGYLLNYLMS 69 Query: 131 IDVTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 D +G + K + +P+PPLAEQ I E++ + Sbjct: 70 PDFDTYANRTDNSKGVAYPAINDKHLYAALVPVPPLAEQRRIAERVSELMPLVGEHGKLE 129 Query: 189 IRFI----ELLKEKKQALVSYIVTKGLNPDVK---------------------------- 216 L + +++++ V L P Sbjct: 130 DEREALDASLPERLRKSVLQMAVEGKLVPQDPSEEPASVLLDRIREERAHLIKEKKIKAP 189 Query: 217 -----------------------MKDSGIEWVGLVPDHWEVKPFFA---LVTELNRKNTK 250 E +P+ WE + ++++ K + Sbjct: 190 KGGESVIYLGSDGRRYEKRGKGEPVCIDDEIPFEIPEGWEWARLGSLLSVISDGTHKTPE 249 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-----GEIVFRFIDLQNDKRSL 305 +L LS NI + + + G+I+ I + Sbjct: 250 YTNDGVLFLSVQNISKGFFDLSRVKHISRETHKGLCKRVRPQNGDILLCRIGTLGKPIIV 309 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQ-SLKFEDV 362 E I S + + + ++ + S +G G + D+ Sbjct: 310 DV--DYEFSIFVSLGLLRPINRSLAEWIVNCLDSPMGFNWIQEVKVGGGTHTFKINLGDI 367 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 V +PP+ EQ I I + +DVL+ Sbjct: 368 PSFLVPIPPLVEQRRIAERI----SELDVLITNQ 397 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 52/142 (36%), Gaps = 10/142 (7%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCK 344 V G++++ + + + I ++ + A+ GI + YL + S D Sbjct: 15 VKLGDVLYSTVRPYLHNMCIVDRKFSLPPIASTGFAAMVCLDGISNGYLLNYLMSPDFDT 74 Query: 345 VFYA--MGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 G+ ++ + + V VPP+ EQ I ++ + K+E Sbjct: 75 YANRTDNSKGVAYPAINDKHLYAALVPVPPLAEQRRIAERVSELMPLVGEH-GKLEDERE 133 Query: 402 LL-----KERRSSFIAAAVTGQ 418 L + R S + AV G+ Sbjct: 134 ALDASLPERLRKSVLQMAVEGK 155 >gi|317012277|gb|ADU82885.1| type I restriction enzyme specificity subunit [Helicobacter pylori Lithuania75] Length = 390 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 59/395 (14%), Positives = 117/395 (29%), Gaps = 29/395 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K + Sbjct: 2 SEWQTFCLKDLGKIVGGATPPTNNPKNYGNKIAWITPKDLSTLQGRYIKKGSRNISRLGL 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA+ + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAEKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I I G T +G + IPP + +KI +D I + E Sbjct: 120 KDNISNIGGGTTFKEVSGATLGLFQVKIPPTYYEQ---QKIARTLSILDQKIENNHKINE 176 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL + + L + D K + + ++ Sbjct: 177 LLHKILELLYEQYFVRFDFLDENNKPYQTSGGKMKFSKELNRLIPNDFEVKTLGELIQLK 236 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + ++ + K P ETYQ IV + + Sbjct: 237 VGNKNANHSSNQGKYPFFTCSNNPLKCETYQFEGKHIIVSGNGNFYVTHYDGKFDAYQRT 296 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ P+ + L +L + + + + D++ + +++P +K Sbjct: 297 YVVN-------PNNPNHYVLIYLFVKSYTNYLKLQSRGSIIKFITKSDIENIKIVLPNLK 349 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 NV+ ++E QS L R Sbjct: 350 TYTKWNNVL--------KMIENNMQSTQTLTALRD 376 >gi|291044249|ref|ZP_06569958.1| type I restriction-modification system specificity determinant [Neisseria gonorrhoeae DGI2] gi|291011143|gb|EFE03139.1| type I restriction-modification system specificity determinant [Neisseria gonorrhoeae DGI2] Length = 354 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 40/342 (11%), Positives = 87/342 (25%), Gaps = 16/342 (4%) Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 D I + I+ G + D + + + + Sbjct: 13 DDVPDKDIHREPSIIVKSRGII--EFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKT 70 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I M N +PIP L Q I + + T TL + Sbjct: 71 QENYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAEL 130 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L K + + + L+ D ++ + + K + + Sbjct: 131 ALRKRQYRYYRDLL----LDFDNQIGGGIADGYQCRLKNVVWKTLGEVAEYSKNRICSDK 186 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + N++Q E + + S +I+ I K Sbjct: 187 LNEHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVNDILIGNIRPYLKKIWQADCTGGT 246 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPP 371 G + + V ++ YL ++ G + + + +PP Sbjct: 247 NGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAKGAKMPRGSKAAIMQYKIPIPP 304 Query: 372 IKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 + EQ I ++ + + + +E+ Sbjct: 305 LPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYEYYREQ 346 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 15 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 72 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 73 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 132 Query: 403 LK-ERR 407 K + R Sbjct: 133 RKRQYR 138 >gi|257440745|ref|ZP_05616500.1| type I restriction-modification system specificity subunit [Faecalibacterium prausnitzii A2-165] gi|257196806|gb|EEU95090.1| type I restriction-modification system specificity subunit [Faecalibacterium prausnitzii A2-165] Length = 128 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 44/128 (34%), Gaps = 2/128 (1%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + KDS ++WIG IP+ W+VV K + S ++ + + Sbjct: 3 KMKDSAIEWIGEIPEGWEVVKAKYLFAQRNEK-GNSALVLLSPTQKYGVIPQSQLEGVVQ 61 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 +D T G + L + ++++G+CS + VL L Sbjct: 62 VKENTDLRTFKTIHIGDFVIS-LRSFQGGFEFSNYEGVCSPAYQVLHATKDLSNDFFRLS 120 Query: 129 LSIDVTQR 136 I + Sbjct: 121 FQIRWFYQ 128 >gi|294780398|ref|ZP_06745765.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis PC1.1] gi|294452527|gb|EFG20962.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis PC1.1] Length = 371 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 67/157 (42%), Gaps = 11/157 (7%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYM 321 + + + + + + + Y ++ E+ + + + K + S + E ++ Y Sbjct: 27 GWLNQKDRFSGNIAGKEQKNYTLLLKNELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYH 86 Query: 322 AVKP-HGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQ----SLKFEDVKRLPVLVPPIKEQ 375 + K D +L ++ + K + SG R ++ ++D + + +P + EQ Sbjct: 87 SFKSTKNSDPDFLEYIFATKKPDKELGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQ 146 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I+N++ +ID + ++ + LKE + +++ Sbjct: 147 KKISNLL----RKIDDTIALHQRKLDQLKELKKAYLQ 179 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 48/392 (12%), Positives = 125/392 (31%), Gaps = 36/392 (9%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG- 89 K T+ G ++ D+ + + + + GN + ++ K ++ Y Sbjct: 2 KEITERVKG--NDGRMDLPTLTISASQGWLNQKDRFSGNIAGKEQKNYTLLLKNELSYNH 59 Query: 90 ---KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE------AI 140 KL Y + ++ + + + E + Sbjct: 60 GNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKELGKLVSSG 119 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + ++ NI + IP + EQ I + +ID I R ++ LKE K+ Sbjct: 120 ARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDDTIALHQRKLDQLKELKK 175 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A + + K +++ + E + ++ + + K+ S+ Sbjct: 176 AYLQLMFPKKDETVPQVRFANFEENWEL------CKLENIIEKQIKGKAKVENLCNGSVE 229 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y + R G KP + V +I+ + + K +G++ S Sbjct: 230 YLDA-----NRLNGGKPIYTKALPDVSERDIIILWDGSKAGKVYY-----GFKGVLGSTL 279 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 A + ++ + + ++ + + P+ + +EQ + + Sbjct: 280 KAYQLKECANSQFIYQQLLDNQNNIYNNYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMAD 339 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ + +D + + + + S++ Sbjct: 340 IL----SNLDNRIILQQNLTDTMISLKKSYLQ 367 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 15/184 (8%) Query: 23 KHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI- 80 ++W++ ++ + G+ +E++ +G+ +YL + + T ++ Sbjct: 199 ENWELCKLENIIEKQIKGKAK----------VENLCNGSVEYLDANRLNGGKPIYTKALP 248 Query: 81 -FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ I+ G K F G+ + Q K+ + +D I Sbjct: 249 DVSERDIIILWDGSKAGKVYY-GFKGVLGSTLKAYQLKECANS-QFIYQQLLDNQNNIYN 306 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + H P+ + EQ + + + RI I L K Sbjct: 307 NYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYL 366 Query: 200 QALV 203 Q + Sbjct: 367 QNMF 370 >gi|262377419|ref|ZP_06070642.1| predicted protein [Acinetobacter lwoffii SH145] gi|262307649|gb|EEY88789.1| predicted protein [Acinetobacter lwoffii SH145] Length = 457 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 55/451 (12%), Positives = 136/451 (30%), Gaps = 66/451 (14%) Query: 30 IKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT----STV 78 + K+ G+ D YI + D+ +YLP++G D + Sbjct: 6 LGDIVKIKGGKRLPKSSQLQVIKNDHPYIRVRDMGE---RYLPRNGLEYVPDNVFPSISR 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I ++ +G +I+ ++ S + + + L +LLS + Sbjct: 63 YIVNTNDLILSIVGTVGLVSIVDEYFNNASLTENCVKLTGLDEKDAKYLYYYLLSQYGKE 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 I+A GA I I + + I + +I + Sbjct: 123 EIKARTVGAVQPKLPLYNIEKIQIRWFDKLIREKIVTCLSTLDDKIQLNNQTNQTLESIA 182 Query: 196 KEKKQALVSY---------IVTKGLNPDV------------KMKDSGIE----------- 223 + ++ G +P++ +K E Sbjct: 183 QAIFKSWFIDFEPVRAKIAAKQNGEDPEIAAMCVISGKSEEDLKKMAEEDFAELQATAAL 242 Query: 224 --------WVGLVPDHWEVKPFF---ALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 +G VP W L + K + + LS +T Sbjct: 243 FPDELVESELGEVPRGWFKTDLSILADLNVQSWTKKNCPEKVTYVDLSNTKWGVIQQTEE 302 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + +++ G+ + + N + E ++ + + P + Sbjct: 303 FIFEKAPSRARRVLKIGDTIVGTVRPANGSYA---FIQRENLTGSTGFAVLSPKHKNYAE 359 Query: 333 LAWLMRSYD--LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +++ + + ++ + G ++ ++ V P ++P I+ + + N+ + Sbjct: 360 FIYIVATDKENIKRLAHLADGGAYPAVSYDTVLNTPCILP-IENKDGVLNLFHKNVKEFY 418 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +L + +L R + + ++G++D+ Sbjct: 419 LLSASKFEENNILASIRDTLLPKLLSGELDV 449 Score = 39.8 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 52/149 (34%), Gaps = 5/149 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +P+ W + LN ++ + + Y+ L + + G + + + Sbjct: 252 LGEVPRGWFKTDLSILADLNVQSWTKKNCPEKVTYVDLSNTKWGVIQQTEEFIFEKAPS- 310 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLP-ELLQGWLLSIDV 133 + G + G + P + + ST F VL PK E + + Sbjct: 311 RARRVLKIGDTIVGTVRPANGSYAFIQRENLTGSTGFAVLSPKHKNYAEFIYIVATDKEN 370 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP 162 +R+ + +G + + N P +P Sbjct: 371 IKRLAHLADGGAYPAVSYDTVLNTPCILP 399 >gi|253567533|ref|ZP_04844964.1| restriction modification system DNA specificity subunit [Bacteroides sp. 3_2_5] gi|251943647|gb|EES84242.1| restriction modification system DNA specificity subunit [Bacteroides sp. 3_2_5] Length = 233 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 68/211 (32%), Gaps = 14/211 (6%) Query: 10 YKDSG--VQWIG----AIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDV 56 YK SG + W IP+ W + IK +G T +S +I +I ++ Sbjct: 23 YKSSGGKMVWNEKLKREIPEGWDISLIKDIATTYSGGTPKSTNIEYYDNGEIAWINSGEL 82 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 S + S+ ++ IL G K + F+ + + P Sbjct: 83 NSPIITKTTNYITKCGLENSSAKLYPSNSILVAMYGATAGKVSLLTFEACSNQAVCGVIP 142 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L + + + G+ + I NI +PIP L EKI + Sbjct: 143 -TIENMLYYVYFHISSLYSHFITLSTGSARDNISQDTIKNILLPIPTRNILKLFDEKIGS 201 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIV 207 I + + E L++ V Sbjct: 202 IYQTIVNNYQQIDSLTKQRDELLPLLMNGQV 232 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 63/202 (31%), Gaps = 17/202 (8%) Query: 227 LVPDHWEVKPFFALVTELNRKN------TKLIESNILSLSYGNIIQKLETRNMGLKPE-- 278 +P+ W++ + T + I ++ G + + T+ + Sbjct: 39 EIPEGWDISLIKDIATTYSGGTPKSTNIEYYDNGEIAWINSGELNSPIITKTTNYITKCG 98 Query: 279 -SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + ++ I+ K SL + + A V P + Y + Sbjct: 99 LENSSAKLYPSNSILVAMYGATAGKVSLLTFE----ACSNQAVCGVIPTIENMLYYVYFH 154 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S R ++ + +K + + +P +I + + + I + Sbjct: 155 ISSLYSHFITLSTGSARDNISQDTIKNILLPIPT----RNILKLFDEKIGSIYQTIVNNY 210 Query: 398 QSIVLLKERRSSFIAAAVTGQI 419 Q I L ++R + + GQ+ Sbjct: 211 QQIDSLTKQRDELLPLLMNGQV 232 >gi|218441049|ref|YP_002379378.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7424] gi|218173777|gb|ACK72510.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7424] Length = 238 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 16/160 (10%), Positives = 56/160 (35%), Gaps = 4/160 (2%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + + E ++ ++ D R + I + Sbjct: 78 GYLDLSDVYQIEATEEEINKLKLQFGDLLLTEGGDPDKLGRGSFWKNKISECIHQNHIYR 137 Query: 323 VKPHG--IDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 V+ + +++ + S F A + ++ + +K P++ P ++ Q I Sbjct: 138 VRFNFDEFYPPFISAQIGSPYGKSYFLAHAKQTTGIATINQQVLKNFPLMNPSLEIQKQI 197 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + + ++ L + +++ + + + ++ + A G+ Sbjct: 198 ASTLTEQMQEVERLTQSLQEQLDTINKLPAALLKRAFNGE 237 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 65/202 (32%), Gaps = 14/202 (6%) Query: 24 HWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W++ + + G + + + + Y+ + +V+ G + Sbjct: 37 NWEIKKLGDVGNIVAGIPLGNRDSKINTRSVPYLRVANVKDGYLDLSDVYQIEATEEEIN 96 Query: 78 VSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 G +L + G R + + C Q + + + E ++ + + Sbjct: 97 KLKLQFGDLLLTEGGDPDKLGRGSFWKNKISECIHQNHIYRVRFNFDEFYPPFISAQIGS 156 Query: 135 QRIEAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++ + ++ + + + N P+ P L Q I + + ++ L Sbjct: 157 PYGKSYFLAHAKQTTGIATINQQVLKNFPLMNPSLEIQKQIASTLTEQMQEVERLTQSLQ 216 Query: 190 RFIELLKEKKQALVSYIVTKGL 211 ++ + + AL+ L Sbjct: 217 EQLDTINKLPAALLKRAFNGEL 238 >gi|306815460|ref|ZP_07449609.1| restriction modification system, type I [Escherichia coli NC101] gi|305851122|gb|EFM51577.1| restriction modification system, type I [Escherichia coli NC101] Length = 443 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 52/415 (12%), Positives = 124/415 (29%), Gaps = 41/415 (9%) Query: 38 TGRTSESGKDI------IYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQILYGK 90 G+ D +++ ++V ++ N + G I+ Sbjct: 20 RGKNYPKHNDFMENGYCLFLSAKNVTKSGFQFQETLFINETKDRELRAGKLKYGDIVLTT 79 Query: 91 LGPYLRKAIIADF----DGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIEAICEGA 144 G A + ++ ++++ K P+ L L S + ++I + G+ Sbjct: 80 RGTVGNVAYYDNNNPYKHIRINSGMIIIRADNKLWNPKFLYFILKSELLKEQIINLISGS 139 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA--- 201 + + I +P+ + Q I I +++ I ++ + ++ Sbjct: 140 AVPQLPARDIRKFILPVINRSLQNKITNIISDINDKVNLNIEINQTLEKMSQTLFKSWFV 199 Query: 202 ----LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 ++ + G NP + + E V + + KP A + L ++ E+ + Sbjct: 200 DFDPVIDNALDAG-NPIPEALQARAELRQKVRNSTDFKPLPAEIRSLF--PSEFEETELG 256 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN----DKRSLRSAQVMER 313 + G +LE ++ + + ++ V+ + V + Sbjct: 257 WVPGGWETNRLENILELAYGKALKKTERIEGDYPVYGSGGVDGSHNEFLVKGPGIIVGRK 316 Query: 314 GIITSAYMAVKPHGIDSTYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 G + S Y K T + + + L + Sbjct: 317 GTVGSLYWENKDFYPIDTVFYVKPKKYFSLVYCYQLLKTLGLENMNTDAAVPGLNRNNAY 376 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 RL V+ P Q I ++ I L R + + ++G+ Sbjct: 377 RLDVITPT---QTIIAQY-TNIVQTFRYKMDSNNNEIDNLTNLRDTLLPKLISGE 427 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 66/179 (36%), Gaps = 9/179 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFR 294 + N + L LS N+ + L + ++ + G+IV Sbjct: 19 DRGKNYPKHNDFMENGYCLFLSAKNVTKSGFQFQETLFINETKDRELRAGKLKYGDIVLT 78 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + I S + ++ + +L ++++S L + + SG Sbjct: 79 TRGTVGNVAYYDNNNPYKHIRINSGMIIIRADNKLWNPKFLYFILKSELLKEQIINLISG 138 Query: 353 -LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-KERRSS 409 L D+++ + V Q ITN+I+ ++++ +E I Q++ + + S Sbjct: 139 SAVPQLPARDIRKFILPVINRSLQNKITNIISDINDKVNLNIE-INQTLEKMSQTLFKS 196 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 53/187 (28%), Gaps = 16/187 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +P W+ ++ +L G+ + + I G Y P G+ + Sbjct: 255 LGWVPGGWETNRLENILELAYGKALKKTERI-----------EGDY-PVYGSGGVDGSHN 302 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + I+ G+ G T F V K + + T + Sbjct: 303 EFLVKGPGIIVGRKGTVGSLYWENKDFYPIDTVFYVKPKKY----FSLVYCYQLLKTLGL 358 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E + A + + + + P + ++D+ E L Sbjct: 359 ENMNTDAAVPGLNRNNAYRLDVITPTQTIIAQYTNIVQTFRYKMDSNNNEIDNLTNLRDT 418 Query: 198 KKQALVS 204 L+S Sbjct: 419 LLPKLIS 425 >gi|223984081|ref|ZP_03634235.1| hypothetical protein HOLDEFILI_01527 [Holdemania filiformis DSM 12042] gi|223963956|gb|EEF68314.1| hypothetical protein HOLDEFILI_01527 [Holdemania filiformis DSM 12042] Length = 211 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 64/188 (34%), Gaps = 13/188 (6%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 D WE L T ++ K + + G + + R + + TY+ V Sbjct: 30 DTWEEMIISDLFTPISDKGHSDLTVLTIVQGTGTLPRDSVDRRISYDKSNTNTYKRVVEN 89 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLC-KVFY 347 + + + GI++ AY ++ S + RSY Sbjct: 90 DFILHLRSFEG-----GLEIANSEGIVSPAYTILRASRKISPKFYYAYFRSYWFISNKLR 144 Query: 348 AMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G+R +S+ + + + P + EQ I + V ID+ + E+++ L Sbjct: 145 IAVEGIRDGKSINMDTFWNIKIPYPSLSEQIQIAEYLQV----IDLKLTNAEKTLENLMN 200 Query: 406 RRSSFIAA 413 RS + Sbjct: 201 IRSGLMQQ 208 >gi|298484572|ref|ZP_07002694.1| type I restriction-modification enzyme, S subunit [Bacteroides sp. D22] gi|298269273|gb|EFI10912.1| type I restriction-modification enzyme, S subunit [Bacteroides sp. D22] Length = 184 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 14/170 (8%), Positives = 54/170 (31%), Gaps = 9/170 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN------IIQKLETRNMGLKPESY 280 + +++ + + + + E ++ + N + + K + Sbjct: 2 EEYNRIKIQHICSNICSGGTPKSTIAEYYGGNIPWLNTKEINFCRIYGTEKTITDKGLNN 61 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + + + ++ K ++ + + + + Sbjct: 62 SSAKWIPTDSVIVAMYGATAGKTAIAKIPLTTNQACCNLTIDSAKADY---RFVYYALCN 118 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 D + G +Q+L + +K + P ++EQ I ++++ ++I+ Sbjct: 119 DYAYLASLANGGAQQNLNAQQIKEFEIPFPSLEEQKRIADILSSLDSKIE 168 >gi|153815635|ref|ZP_01968303.1| hypothetical protein RUMTOR_01871 [Ruminococcus torques ATCC 27756] gi|145847066|gb|EDK23984.1| hypothetical protein RUMTOR_01871 [Ruminococcus torques ATCC 27756] Length = 342 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 40/355 (11%), Positives = 108/355 (30%), Gaps = 28/355 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + G ++ + L DV G++ + + Sbjct: 3 VKLGDVCE--RGTSN--------LKLSDVSEKNGEFSVFGASGYIGSVDFYQQGYP-YVA 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 K G + +A++ L PKD + +++ +E GAT+ Sbjct: 52 VVKDGAGIGRAMLCPGKTSVIGTMQYLLPKDNILPKYLFYVVK---YMNLEKYFTGATIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H +K N QV I + + + +I + ++LL + +A + Sbjct: 109 HIYFKDYKNEEFNFDFWERQVEIVSVL----SKCEKVIDLCKQELQLLDKLIKARFVELF 164 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 ++ + ++ + +G + L K + ++ N Sbjct: 165 GDPVSNSYGLPEATLPDLGEFGRGVSKHRPRNDIKLLGGKYPLIQTGDV-----ANAGLY 219 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + + ++ D G + ++A + + + + Sbjct: 220 ITSYSSTYSELGLKQSKMWDKGTLCI-----TIAANIAKTAILEFDACFPDSVVGFIANE 274 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + S+ + ++++ + + L V+VP ++Q + + Sbjct: 275 RTNNIFVHYWFSFFQAILESQAPESAQKNINLKILSELKVIVPEKRKQDQFASFV 329 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 41/111 (36%), Gaps = 7/111 (6%) Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + +I + + I YL ++++ +L K F + Sbjct: 55 DGAGIGRAMLCPGKTSVIGTMQYLLPKDNILPKYLFYVVKYMNLEKYF---TGATIPHIY 111 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 F+D K + Q +I +V+ ++ + +++ +Q + LL + + Sbjct: 112 FKDYKNEEFNFDFWERQVEIVSVL----SKCEKVIDLCKQELQLLDKLIKA 158 >gi|145630829|ref|ZP_01786607.1| type I restriction/modification specificity protein [Haemophilus influenzae R3021] gi|144983711|gb|EDJ91171.1| type I restriction/modification specificity protein [Haemophilus influenzae R3021] Length = 445 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 54/432 (12%), Positives = 126/432 (29%), Gaps = 68/432 (15%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD-G 105 + I I E+ K ++ KG IL +G + + I + + G Sbjct: 19 ETISINSENKYPDYSKISKFVSKDTYNNWFRKGHPKKGDILISTVGANIGRVSIMNENRG 78 Query: 106 ICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + + + ++P+ L +L+ + ++ G+ + NI + +P Sbjct: 79 CIAQNLIGLRTDKEKLVPDYLYYFLIKKSTQHTLSSLNIGSAQPSIKVPHLLNILINVPN 138 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS---------YIVTKGLN-- 212 + Q I + + +I+ ++ + ++ ++ GLN Sbjct: 139 IQRQEEIANILSSLDEKIEINTQINQTLEQIAQALFKSWFVDFDPVRAKVQALSDGLNLE 198 Query: 213 ----------------------------------PDVKMKDSGIEWVG-LVPDHWEVKPF 237 +E G VP WE+K Sbjct: 199 QAELAAMQAISGKTPEELTALSQTQPDRYAELAETAKAFPCEMVEVDGVEVPKGWEMKAL 258 Query: 238 FALVTELNRK-----NTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDP 288 L + K N + + + ++ + T N+ L + ++ + + P Sbjct: 259 SDLGQIICGKTPSKSNKEFYGDEVPFIKIPDMHNQAFITQTTDNLSLSGANSQSKKYIPP 318 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I I + I + + S +L ++ + K Sbjct: 319 KSICVSCIATVGLVSMTSKPSHTNQQINS----IIPNDEQTSEFLYLSLKQPSMTKYLKD 374 Query: 349 MGSGLRQ--SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + SG +L ++ ++ P +I ++ + + V L E Sbjct: 375 LASGGSATLNLNTSTFSKIEIMTPS----KEIIDIFHNKVVYAFEKVLSNSIENKRLAEI 430 Query: 407 RSSFIAAAVTGQ 418 R + + G+ Sbjct: 431 RDLLLPNLLNGE 442 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 57/173 (32%), Gaps = 10/173 (5%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 L + +I S + K+ ++ G+I+ + Sbjct: 9 HYLINGYELIETISINSENKYPDYSKISKFVSKDTYNNWFRKGHPKKGDILISTVGANIG 68 Query: 302 KRSLRSAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLK 358 + S+ + RG I + + YL + + ++ + S+K Sbjct: 69 RVSIMNEN---RGCIAQNLIGLRTDKEKLVPDYLYYFLIKKSTQHTLSSLNIGSAQPSIK 125 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETAR--IDVLVEKIEQSIVLLKERRSS 409 + + + VP I+ Q +I N+++ + I+ + + + I + S Sbjct: 126 VPHLLNILINVPNIQRQEEIANILSSLDEKIEINTQINQTLEQIA--QALFKS 176 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 14/131 (10%), Positives = 45/131 (34%), Gaps = 7/131 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESG-TGKYLPKDGNSRQ 72 +PK W++ + ++ G+T G ++ +I + D+ + + + Sbjct: 248 EVPKGWEMKALSDLGQIICGKTPSKSNKEFYGDEVPFIKIPDMHNQAFITQTTDNLSLSG 307 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +++ + I + ++ + ++ + E L L Sbjct: 308 ANSQSKKYIPPKSICVSCIATVGLVSMTSKPSHTNQQINSIIPNDEQTSEFLYLSLKQPS 367 Query: 133 VTQRIEAICEG 143 +T+ ++ + G Sbjct: 368 MTKYLKDLASG 378 >gi|292558144|gb|ADE31145.1| putative HsdS [Streptococcus suis GZ1] Length = 301 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 39/309 (12%), Positives = 94/309 (30%), Gaps = 27/309 (8%) Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + I+ G+ + + + + + +P Q I + ID I Sbjct: 1 MNSIKKEIQKTSSGSIQDNINIDYLTKLKLKVPNKDYQDRIVNLL----STIDKKILINN 56 Query: 190 RFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFAL 240 + E L+ + L Y + PD K SG + V +P+ W VK + Sbjct: 57 QINEELEAMAKTLYDYWFVQFDFPDENGKPYKSSGGKMVYNDQLKREIPEGWGVKQLGEI 116 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--------TYQIVDPGEIV 292 N N + E+ N+ + + +V I+ Sbjct: 117 CEFRNGINYEKSETGDTLSKIVNVRNISNSSTFVTTHDLDSITLDRRRIESYLVTDRTIL 176 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + R + + I + + ++ Y + + Sbjct: 177 ITRSGIPGATRIVS--DIPVNTIYSGFIIGATVANLNLFYYVFYHLKNIEMLMSNQSAGT 234 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +++ + + +++P + Q +N + ++E + L + R + Sbjct: 235 IMKNISQTTLSEIRIVIPNKEIQKVFSNEVRSLL----DVIENNLKQNQELTQLRDWLLP 290 Query: 413 AAVTGQIDL 421 + GQ+ + Sbjct: 291 MLMNGQVKV 299 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 61/195 (31%), Gaps = 7/195 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQSDT 75 IP+ W V + + G E + + + ++ + + D +S D Sbjct: 103 EIPEGWGVKQLGEICEFRNGINYEKSETGDTLSKIVNVRNISNSSTFVTTHDLDSITLDR 162 Query: 76 S--TVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSID 132 + IL + G I++D + F++ L + + Sbjct: 163 RRIESYLVTDRTILITRSGIPGATRIVSDIPVNTIYSGFIIGATVANLNLFYYVFYHLKN 222 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G M + + I + IP Q + ++ + I+ + + Sbjct: 223 IEMLMSNQSAGTIMKNISQTTLSEIRIVIPNKEIQKVFSNEVRSLLDVIENNLKQNQELT 282 Query: 193 ELLKEKKQALVSYIV 207 +L L++ V Sbjct: 283 QLRDWLLPMLMNGQV 297 >gi|329963223|ref|ZP_08300960.1| type I restriction modification DNA specificity domain protein [Bacteroides fluxus YIT 12057] gi|328528919|gb|EGF55859.1| type I restriction modification DNA specificity domain protein [Bacteroides fluxus YIT 12057] Length = 389 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 46/415 (11%), Positives = 114/415 (27%), Gaps = 68/415 (16%) Query: 24 HWKVVPIKRFTKLN-------------TGRTSESGKDIIYIGLE---DVESGTGKYLPKD 67 W+ + + G+ +I+ G+ DV + Y+ ++ Sbjct: 15 EWETIKVSELLDFYSTNSLSWEQLDYSNGKIKNLHYGLIHKGVPTMVDVACDSLPYIKEE 74 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV---------LQPKD 118 + ++F +G + + A C Q +V Sbjct: 75 SM-----LKSFTLFKEGDVAFADASEDTNDVAKAIEVVNCDNQQIVSGLHTIHGRDNSNR 129 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + S ++I I +G + + + IP EQ I + +I Sbjct: 130 TVIGYKGYAFASDSFHKQIRRIAQGTKVFSISVRNFDEAYIGIPSKEEQTQIAKLLITID 189 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 RI T +L ++ ++ ++ + ++ I Sbjct: 190 KRIATQNKIIEDLKKL-----KSAITDLLFHSIADAHTIRLGKI------------AHIT 232 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 ++ NT+ E ++ T + + Y Sbjct: 233 NGAGDVQDANTEHQEDWYPFFDRSEELKWFPTYSFDKEAVIY-----------------A 275 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + + Y + + SL+ Sbjct: 276 GEGQSFYPRYYNGKFALHQRCYAITDFASCIIPKYCYHFMNTLNSYFVRNSVGSTVPSLR 335 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +++ + +PPI +Q I +I+ ++ E ++ I +L+E + ++ Sbjct: 336 MDIFQKVEIRLPPIPKQQHICKIIDAFYTKL----EVEQRGISILQELKQFLLSQ 386 >gi|296114042|ref|YP_003627980.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis RH4] gi|295921736|gb|ADG62087.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis RH4] gi|326566110|gb|EGE16267.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis BC1] Length = 209 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 25 KISSSGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSSAKWIPANCVI 84 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 85 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 140 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 141 SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 200 Query: 409 SFIA 412 + Sbjct: 201 QLLN 204 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 19/192 (9%), Positives = 67/192 (34%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++ T + + +I ++ ++V S+ Sbjct: 15 EWRALGEVAKKISSSGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSS 74 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 75 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 134 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 135 KSLGTG-SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 193 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 194 QYEYYREQLLNF 205 >gi|326567813|gb|EGE17917.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis 12P80B1] gi|326573728|gb|EGE23686.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis O35E] Length = 221 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 37 KISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSSAKWIPANCVI 96 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 97 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 152 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 153 SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 212 Query: 409 SFIA 412 + Sbjct: 213 QLLN 216 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 68/192 (35%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++G T + + +I ++ ++V S+ Sbjct: 27 EWRALGEVAKKISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSS 86 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 87 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 146 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 147 KSLGTG-SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 205 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 206 QYEYYREQLLNF 217 >gi|239621711|ref|ZP_04664742.1| HsdS variable domain-containing protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239515586|gb|EEQ55453.1| HsdS variable domain-containing protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 232 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 58/160 (36%), Gaps = 8/160 (5%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 N+ I + I ++ + + + ++VD G +++ + + ++ Sbjct: 76 GNSAYYGGEIPFIRSAEIDCDSTELSLTVAGLNNSSAKLVDKGMVLYAMYGATSGEVAIS 135 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 +G I A +A+ + + + G + +L +K L Sbjct: 136 KI----KGAINQAILAMDASDMAANRFIAYWLRRQKKSITETFLQGGQGNLSGAIIKELG 191 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + P + EQ I + + +D L+ ++ + +++R Sbjct: 192 IPQPSLDEQRQIGSF----FSNLDDLITLHQRKRLSIRQR 227 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 54/180 (30%), Gaps = 10/180 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + +G T +G +I +I +++ + S+ Sbjct: 56 WEQRKLGELALTYSGGTPSAGNSAYYGGEIPFIRSAEID---CDSTELSLTVAGLNNSSA 112 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG +LY G + I+ G + L + D+ + L E Sbjct: 113 KLVDKGMVLYAMYGATSGEVAISKIKGAINQAILAMDASDMAANRFIAYWLRRQKKSITE 172 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G + I + +P P L EQ I I +R+ + Sbjct: 173 TFLQG-GQGNLSGAIIKELGIPQPSLDEQRQIGSFFSNLDDLITLHQRKRLSIRQRSPVW 231 >gi|326561268|gb|EGE11627.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis 46P47B1] Length = 217 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 33 KISSSGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSSAKWIPANCVI 92 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 93 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 148 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 149 SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 208 Query: 409 SFIA 412 + Sbjct: 209 QLLN 212 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 19/192 (9%), Positives = 67/192 (34%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++ T + + +I ++ ++V S+ Sbjct: 23 EWRALGEVAKKISSSGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKITEPGVKNSS 82 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 83 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 142 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 143 KSLGTG-SQTNINAQIVKKLKIPIPPLSIQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 201 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 202 QYEYYREQLLNF 213 >gi|323157625|gb|EFZ43731.1| type I restriction enzyme EcoAI specificity [Escherichia coli EPECa14] Length = 399 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 66/204 (32%), Gaps = 16/204 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI-----ESNILSLSYGNIIQKLETRNMG 274 S E +P+ WE L + + IL ++ + + + Sbjct: 93 SEEEKPFELPEGWEWTRLINLGIWALGSGFPNVVQGSTDKEILMCKVSDMNLEGNEKFIF 152 Query: 275 LKPESYETY-------QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + +I +PG I+F I + R V + I + Sbjct: 153 STKNTISKDLADEYKIKISEPGTIIFPKIGGAI-ATNKRRILVQDTAIDNNCLGIKPCDA 211 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I + ++ + D+ K ++ + +P+ +P +K Q I + + + Sbjct: 212 ISGEWFYLILNTLDMSKY---QSGTSIPAINQSVIGSIPIALPSLKMQEKIVSYVITLMS 268 Query: 388 RIDVLVEKIEQSIVLLKERRSSFI 411 D L ++ S+ ++ + + Sbjct: 269 LCDQLEQQSLTSLDAHQQLVETLL 292 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 81/216 (37%), Gaps = 19/216 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLE 54 +K K P+ S + +P+ W+ + G + K+I+ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWTRLINLGIWALGSGFPNVVQGSTDKEILMCKVS 140 Query: 55 DVE-SGTGKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLG---PYLRKAIIADFDGIC 107 D+ G K++ N+ D + + I G I++ K+G ++ I+ I Sbjct: 141 DMNLEGNEKFIFSTKNTISKDLADEYKIKISEPGTIIFPKIGGAIATNKRRILVQDTAID 200 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + + E L ++D + G ++ + IG+IP+ +P L Q Sbjct: 201 NNCLGIKPCDAISGEWFYLILNTLD----MSKYQSGTSIPAINQSVIGSIPIALPSLKMQ 256 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 I +I D L + + ++ ++ + L+ Sbjct: 257 EKIVSYVITLMSLCDQLEQQSLTSLDAHQQLVETLL 292 >gi|148825870|ref|YP_001290623.1| type I restriction/modification specificity protein [Haemophilus influenzae PittEE] gi|229846818|ref|ZP_04466925.1| type I restriction/modification specificity protein [Haemophilus influenzae 7P49H1] gi|148716030|gb|ABQ98240.1| type I restriction/modification specificity protein [Haemophilus influenzae PittEE] gi|229810307|gb|EEP46026.1| type I restriction/modification specificity protein [Haemophilus influenzae 7P49H1] gi|309973015|gb|ADO96216.1| Type I restriction enzyme HindVIIP, S protein [Haemophilus influenzae R2846] Length = 467 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 57/470 (12%), Positives = 138/470 (29%), Gaps = 84/470 (17%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVS 79 + +P F L T T +S K + + +++ G S + + S Sbjct: 5 EFIPASEFCDLVTDGTHDSPKKTEFGVKLVTSKNIVGGKLDLTSAYFISESDAQNINKRS 64 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +L +G A+I + L+ + L +L S I+ Sbjct: 65 QVHINDVLLSMIGTVGEVALIEKEPDFVIKNVGLLKNSDPKKAKWLYYYLKSPITQNLIK 124 Query: 139 AICEGATMSHADWKGIGNIPM-------PIPPLAEQVLIREK---IIAETVRIDTLITER 188 G T + + N+P+ + EQ+ +K + + + I + Sbjct: 125 DRLRGTTQQYIPLGELRNLPILKPNSEEHLQNTIEQLSSLDKKIQLNTQINQTLEQIAQA 184 Query: 189 IRFIELLKEKKQALVSYIVTKGL------------------------------------- 211 + + ++ GL Sbjct: 185 LFKSWFVDFDPVRTKVQALSDGLSLEQAELAAMQTISGKTPEELTALSQTQPEHYAELAE 244 Query: 212 ----NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--------NTKLIESNILSL 259 P ++ G++ V VP WE L + +++ ++ + + + Sbjct: 245 TAKAFPCEMVEVDGVDGV-EVPKGWECFSLRELSSVVSKGTTPKKSSLSSCDSKETVPFI 303 Query: 260 SYGNIIQKLETRNMGL------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 +I + + + + I+ +I+ + + Sbjct: 304 KVKDISESGQILINQVEQIPEKISSTELKRSILHKNDILISIAGTIGRVAIVPNELENAN 363 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 +++ + + +L + + + G++ ++ E V+ + + +P Sbjct: 364 TNQAISFIRLYNDNLVGIISTFLKSRKNQKDILSKVIQGVQANISLEVVRNIKIFLP--- 420 Query: 374 EQFDITNVINVETARIDVLVEK--IEQSIVLLKER-RSSFIAAAVTGQID 420 N + + L+ K I Q LL E+ R + ++G+ID Sbjct: 421 -----INFDHKAILIFNSLLNKQLINQKENLLTEKSRDLLLPQLLSGEID 465 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 57/191 (29%), Gaps = 12/191 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDVESGTGKYLPKDG-- 68 +PK W+ ++ + + + T+ + + +I ++D+ + + Sbjct: 263 EVPKGWECFSLRELSSVVSKGTTPKKSSLSSCDSKETVPFIKVKDISESGQILINQVEQI 322 Query: 69 -NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 S SI K IL G R AI+ + +T + + L+ Sbjct: 323 PEKISSTELKRSILHKNDILISIAGTIGRVAIVPNELENANTNQAISFIRLYNDNLVGII 382 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + I + + L + +I ++ LI + Sbjct: 383 STFLKSRKNQKDILSKVIQGVQANISLEVVRNIKIFLPINFDHKAILIFNSLLNKQLINQ 442 Query: 188 RIRFIELLKEK 198 + + Sbjct: 443 KENLLTEKSRD 453 >gi|313668372|ref|YP_004048656.1| Type I restriction-modification system DNA methylase [Neisseria lactamica ST-640] gi|313005834|emb|CBN87289.1| putative Type I restriction-modification system DNA methylase [Neisseria lactamica 020-06] Length = 395 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 58/414 (14%), Positives = 124/414 (29%), Gaps = 43/414 (10%) Query: 27 VVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAK 83 + I ++N ++ ++I+Y+ ++ + + + + Sbjct: 4 QIKIGEIAEINANSLTQKDMFQEIMYLDTGNITRNEIDNIQILNITMDKIPSRAKRKVKD 63 Query: 84 GQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I+Y + P + + I ST F + D + + L Sbjct: 64 KTIIYSTVRPNQEHYGFLENPSDNFIVSTGFSTIDVYDDNTDEKFIYYLLTQKHITDYLH 123 Query: 141 CEGAT----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G + I N+ +P L Q I + +D I + L+ Sbjct: 124 TIGENSVSSYPSINPDDIANLKFTVPDLKTQQSIAAVL----SALDKKIALNKQINARLE 179 Query: 197 EKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRK 247 E + L Y + PD K SG E V +P W+ LVT K Sbjct: 180 EMAKTLYDYWFVQFDFPDANSKPYKSSGGEMVFDETLKREIPKGWKPFKLSELVTLSTGK 239 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + + S++T I+ G F Sbjct: 240 EDANFATEQGIYPFFTC----SEKILKCDVYSFDTQAILLAGNGTFSVKRFTG------- 288 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 R ++P + + + + ++ K + + + D++ + V Sbjct: 289 -----RFNAYQRTYVLEPKSKNLYPIVYFVIIDNVIKFTSGSRGSIIKFITRGDIEHIDV 343 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++P E + V+ + E +E+ L + R + + GQ+ + Sbjct: 344 VLPNDIENMRFSEVLYTYLLQA----ELLEKQNYQLTQLRDFLLPMLMNGQVSV 393 >gi|289810317|ref|ZP_06540946.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 111 Score = 72.9 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 31/68 (45%) Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + L + + P+ VPP++EQ +I + A D + +++ ++ + S Sbjct: 1 GTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVNSLTQSI 60 Query: 411 IAAAVTGQ 418 +A A G+ Sbjct: 61 LAKAFRGE 68 >gi|317480921|ref|ZP_07940002.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] gi|316903006|gb|EFV24879.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] Length = 373 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 62/163 (38%), Gaps = 7/163 (4%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 ++ + + I R + V+ G+++F+ + + + +R Sbjct: 47 VMDILNNDFITYDCIRTSVEITPEEQVAFAVEKGDMLFQRSSETLEDVGRANVYMDDRPA 106 Query: 316 ITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPI 372 + ++ + + +L+ S K MG+G + ++ + + ++ + P + Sbjct: 107 VFGGFVIRGKKKAEYNPMFFRYLLASPYARKKVIPMGAGAQHFNIGQDGLSKVKLHFPIL 166 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +EQ I +++ I+ + + I LK+ +S+ Sbjct: 167 QEQQKIADLL----RLINERISTQNKIIEDLKKLKSAISKQVF 205 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/309 (10%), Positives = 79/309 (25%), Gaps = 32/309 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLP--KDGNSRQSDTST 77 + W+ + + G + K I +I + D+ + + + Sbjct: 14 EEWEEHYLAEYLDFKNGLNPSANKFGSGIKFISVMDILNNDFITYDCIRTSVEITPEEQV 73 Query: 78 VSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 KG +L+ + L + D + + + K + +LL+ Sbjct: 74 AFAVEKGDMLFQRSSETLEDVGRANVYMDDRPAVFGGFVIRGKKKAEYNPMFFRYLLASP 133 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++ + GA + G+ + + P L EQ I + + RI T Sbjct: 134 YARKKVIPMGAGAQHFNIGQDGLSKVKLHFPILQEQQKIADLLRLINERISTQNKIIEDL 193 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +L + + + E G A T + Sbjct: 194 KKLKSAISKQVFAQ-----------------EPNGWSRLDTLFSKGKAGGTPTSTNKEYY 236 Query: 252 IESN----ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 I ++ + ++ + +V ++ Sbjct: 237 NGEIPFLSINDITKQGKYVRYTENHLSQSGLENSSAWVVPKYSLIMSMYASVGLVTINEI 296 Query: 308 AQVMERGII 316 + + Sbjct: 297 PITTSQAMF 305 >gi|42525884|ref|NP_970982.1| type I restriction-modification system, S subunit, putative [Treponema denticola ATCC 35405] gi|41815934|gb|AAS10863.1| type I restriction-modification system, S subunit, putative [Treponema denticola ATCC 35405] Length = 562 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 51/375 (13%), Positives = 108/375 (28%), Gaps = 41/375 (10%) Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP--YLRKAIIADFDGICST 109 ++ + K D S S+ IL G G I + Sbjct: 2 SSGELNLKRIYSVDKMITQAGFDNSATSLIPPQCILVGLAGQGKTRGTVGINYLSLCINQ 61 Query: 110 QFLVLQPKDVLPELLQGWLL-SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + P + + + E + + I N P+ +PPL+EQ Sbjct: 62 SICAILPNTNILSSEYLYQYLNSKYLDLRELSMGNGGRGGLNLQLIKNFPILLPPLSEQR 121 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVS-YIVTKGLNPDVKMKDSGIEWVGL 227 I E + I +L + + + Q L++ G N + K G Sbjct: 122 CIAEVLSDTDTYISSLKKLITKKEAIKQGIMQELLTGKKRLPGFNGEWIEKRLGELLEYE 181 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P ++ + ++ ++ +G E Y Sbjct: 182 QPQ------------------QYIVVNTKYFTQGIPVLTAGKSFILGYTSERAGVYNNPP 223 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + F D + + + + ++ + + + LM+ Sbjct: 224 ----IILFDDFTTESKLV---DFKFKVKSSAIKILKNTGICNIRIVFELMQMIKFESKD- 275 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q ++ V +PP + EQ I N+++ I+ L ++ + ++ Sbjct: 276 ------HQRFWISIFNKIRVKIPPTLAEQTAIANILSDMDQEIEAL----KKKLKKVESI 325 Query: 407 RSSFIAAAVTGQIDL 421 + + +TG I L Sbjct: 326 KQGMMQKLLTGDIRL 340 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 56/145 (38%), Gaps = 4/145 (2%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 ++ P I+ + ++ + + + + + S YL + Sbjct: 24 DNSATSLIPPQCILVGLAGQGKTRGTVGINYLSLCINQSICAILPNTNILSSEYLYQYLN 83 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 S L +MG+G R L + +K P+L+PP+ EQ I V++ I L + I + Sbjct: 84 SKYLDLRELSMGNGGRGGLNLQLIKNFPILLPPLSEQRCIAEVLSDTDTYISSLKKLITK 143 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRG 423 + + + +TG+ L G Sbjct: 144 K----EAIKQGIMQELLTGKKRLPG 164 >gi|315652287|ref|ZP_07905279.1| type I restriction system specificity protein [Eubacterium saburreum DSM 3986] gi|315485410|gb|EFU75800.1| type I restriction system specificity protein [Eubacterium saburreum DSM 3986] Length = 182 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 73/178 (41%), Gaps = 10/178 (5%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQN 300 K + + ++ YG+I + +G+ + E + V+ G++V Sbjct: 1 MPKTMFKDDGEVGAIHYGHIYTRYNMFIDKPVVGISTKDAEKLKKVNKGDLVIARTSENI 60 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQ-SLK 358 D A + E+ ++ + + H + YL++++ + K M G++ L Sbjct: 61 DDVMKTVAYLGEKTVVAGGHSTIFRHKENPKYLSYVLNGADYAIKQKNKMARGVKVIELS 120 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 D++++ + +P ++ Q I ++++ ++ + + + I ++ R ++ Sbjct: 121 TADMEKIKIPLPSLQVQEYIVSILDKFDTLVNDIKSGLPKEIEERQKQYEYYRERLLS 178 Score = 44.0 bits (102), Expect = 0.051, Method: Composition-based stats. Identities = 16/174 (9%), Positives = 55/174 (31%), Gaps = 6/174 (3%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQ-SDTSTVSIFAKGQILYGKLGPYLRKA-- 98 + ++ I + + ++ K D + KG ++ + + Sbjct: 6 FKDDGEVGAIHYGHIYTRYNMFIDKPVVGISTKDAEKLKKVNKGDLVIARTSENIDDVMK 65 Query: 99 ---IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + + + + + K+ L + ++ + G + + Sbjct: 66 TVAYLGEKTVVAGGHSTIFRHKENPKYLSYVLNGADYAIKQKNKMARGVKVIELSTADME 125 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 I +P+P L Q I + ++ + + + IE +++ + +++ Sbjct: 126 KIKIPLPSLQVQEYIVSILDKFDTLVNDIKSGLPKEIEERQKQYEYYRERLLSF 179 >gi|317488603|ref|ZP_07947147.1| type I restriction modification DNA specificity domain-containing protein [Eggerthella sp. 1_3_56FAA] gi|316912297|gb|EFV33862.1| type I restriction modification DNA specificity domain-containing protein [Eggerthella sp. 1_3_56FAA] Length = 445 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 62/175 (35%), Gaps = 10/175 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP+ W+ + + + T + + ++++ ++++ G SR++ Sbjct: 271 EIPEGWEWARLGSLLSVISDGTHKTPEYTNDGVLFLSVQNISKGFFDLSRVKHISRETHK 330 Query: 76 STVSIFAK--GQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 G IL ++G + I+ +F S L + + ++ Sbjct: 331 GLCKRVRPQNGDILLCRIGTLGKPIIVDVDYEFSIFVSLGLLRPINRSLAEWIVNCLDSP 390 Query: 131 IDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E + G + I + +PIPPL EQ I E+I V I Sbjct: 391 MGFNWIQEVKVGGGTHTFKINLGDIPSFLVPIPPLVEQRRIAERISELDVLITNQ 445 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 68/234 (29%), Gaps = 17/234 (7%) Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + LI E+ E L S + E +P+ WE Sbjct: 218 LDRIREERAHLIKEKKIKAPKGGESVIYLGSDGRRYEKRGKGEPVCIDDEIPFEIPEGWE 277 Query: 234 VKPFFA---LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-- 288 + ++++ K + +L LS NI + + + Sbjct: 278 WARLGSLLSVISDGTHKTPEYTNDGVLFLSVQNISKGFFDLSRVKHISRETHKGLCKRVR 337 Query: 289 ---GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 G+I+ I + E I S + + + ++ + S Sbjct: 338 PQNGDILLCRIGTLGKPIIVDV--DYEFSIFVSLGLLRPINRSLAEWIVNCLDSPMGFNW 395 Query: 346 FY--AMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +G G + D+ V +PP+ EQ I I + +DVL+ Sbjct: 396 IQEVKVGGGTHTFKINLGDIPSFLVPIPPLVEQRRIAERI----SELDVLITNQ 445 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 52/151 (34%), Gaps = 11/151 (7%) Query: 279 SYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 SY +++ G++++ L + + S + P + Y Sbjct: 53 SYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPDWLRYEYA 112 Query: 334 AWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET---AR 388 + V SG ++ L E VKR + VPP+ EQ I ++ Sbjct: 113 FLYFAGPSVQSVIEDQASGSTKQKELAQETVKRYLIPVPPLAEQRRIAERVSELMPLVGE 172 Query: 389 IDVLVEKIEQSIVLL-KERRSSFIAAAVTGQ 418 L ++ E L + R S + AV G+ Sbjct: 173 YGKLEDEREALDASLPERLRKSVLQMAVEGK 203 >gi|326568185|gb|EGE18267.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis BC8] Length = 221 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 37 KISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSSAKWIPANCVI 96 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 97 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 152 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 153 SQTNINAQIVKKLKIPIPPLSVQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 212 Query: 409 SFIA 412 + Sbjct: 213 QLLN 216 Score = 69.1 bits (167), Expect = 2e-09, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 68/192 (35%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++G T + + +I ++ ++V S+ Sbjct: 27 EWRALGEVAKKISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSS 86 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 87 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 146 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 147 KSLGTG-SQTNINAQIVKKLKIPIPPLSVQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 205 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 206 QYEYYREQLLNF 217 >gi|15828905|ref|NP_326265.1| restriction modification enzyme subunit S2B [Mycoplasma pulmonis UAB CTIP] gi|14089848|emb|CAC13607.1| RESTRICTION MODIFICATION ENZYME SUBUNIT S2B [Mycoplasma pulmonis] Length = 336 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 43/356 (12%), Positives = 104/356 (29%), Gaps = 34/356 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVDENIAKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + I I ++ Q ++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPLEKQ----INAFDELILSEQKSLQHYLN 171 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 Y K IE P + + K I S Sbjct: 172 YFFG---------KFYQIE-----PSLFHDYKLEKIAKIRRGK-------IINSFDLKEN 210 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + K Y + + I + I ++ + Sbjct: 211 PGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSITNVCFILLL 270 Query: 325 PHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 ++ + +L + ++ + ++ R S++ + + + +P ++ Q I Sbjct: 271 NDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLEIQSAI 326 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 35/142 (24%), Gaps = 3/142 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + + I + ++ ++ Sbjct: 31 YNLYSSKTKDQGIFGKINSYDFNGEYILITTHGAYAGTVKYVNEKFSTTSNCFILKVDEN 90 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I T + LK ++ V +P +K Q I +I Sbjct: 91 IAKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPLEK 150 Query: 388 RI---DVLVEKIEQSIVLLKER 406 +I D L+ ++S+ Sbjct: 151 QINAFDELILSEQKSLQHYLNY 172 >gi|298482623|ref|ZP_07000807.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. D22] gi|298271086|gb|EFI12663.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. D22] Length = 324 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 84/265 (31%), Gaps = 8/265 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W+ VP+ LN + +I + V G + + Sbjct: 18 EVPEGWQSVPVSELFCLNPKSEITDATSVGFIPMACVNDGFSGNHQFEERIWKEVKKGYC 77 Query: 80 IFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 F G I K+ P + + G +T+ ++L+P ++ + S Sbjct: 78 HFQNGDIGIAKISPCFENLKSTIFQNLPNNYGAGTTELVILRPLNIHAKFYLYLFKSQWY 137 Query: 134 TQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +G ++ +P+PPLAEQ I +I ID + + Sbjct: 138 ISEGTKYFKGVVGQQRVHKGIFTDLQIPLPPLAEQYRIVAEIEKWFALIDQIEQGKTGLQ 197 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 ++ + K ++ + L P + E + + + + Sbjct: 198 TIVMQTKSKILDLAIHGKLVPQDPNDEPAFELLKRINPDFTPCDNGHYTQLPDG-WAVAP 256 Query: 253 ESNILSLSYGNIIQKLETRNMGLKP 277 + SL G +E N+ +K Sbjct: 257 MQMLCSLIDGEKQNGIERINLDVKY 281 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 57/174 (32%), Gaps = 6/174 (3%) Query: 227 LVPDHWEVKPFFALVTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 VP+ W+ P L + I + + E + Y Sbjct: 18 EVPEGWQSVPVSELFCLNPKSEITDATSVGFIPMACVNDGFSGNHQFEERIWKEVKKGYC 77 Query: 285 IVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 G+I I N K ++ G T+ + ++P I + + +L +S Sbjct: 78 HFQNGDIGIAKISPCFENLKSTIFQNLPNNYGAGTTELVILRPLNIHAKFYLYLFKSQWY 137 Query: 343 CKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 G+ +Q + L + +PP+ EQ+ I I A ID + + Sbjct: 138 ISEGTKYFKGVVGQQRVHKGIFTDLQIPLPPLAEQYRIVAEIEKWFALIDQIEQ 191 >gi|13541295|ref|NP_110983.1| restriction endonuclease S subunit fragment [Thermoplasma volcanium GSS1] gi|14324678|dbj|BAB59605.1| hypothetical protein [Thermoplasma volcanium GSS1] Length = 152 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/121 (24%), Positives = 53/121 (43%), Gaps = 1/121 (0%) Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + K + +V + A V + YL + ++S ++ +G Sbjct: 1 MIALNGQGKTKGMVGILKVESTCNQSLAAFNVNERTLHYRYLYYFLKS-KYKQMRGLVGD 59 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 LR L ++ L + VP ++EQF I+N + + I ++ K E+ I LLKE R+S I Sbjct: 60 DLRDGLSLSVLRELRIPVPSLQEQFAISNYSDNQIHVIKNMISKQEKMIELLKEHRASLI 119 Query: 412 A 412 Sbjct: 120 T 120 >gi|315148957|gb|EFT92973.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX4244] Length = 328 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 46/268 (17%), Positives = 100/268 (37%), Gaps = 16/268 (5%) Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + G + +EKI + ++D +I R ++ LKE K+A + + Sbjct: 69 INQITTGEFKRMHFTVPIDEDEKEKIGSLFRQLDDIIALHQRKLDQLKELKKAYLQVMFP 128 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 K++ + E WE FF + + + +N +L S+ LS + + Sbjct: 129 VKDERVPKLRLADFEG------EWEQCKFFDMWEKSSDRNKELKYSSKDVLSVAKMTKNP 182 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-G 327 RN E +TY I+ G+I F ++ ++ GI++ ++ KP Sbjct: 183 VERNS--SDEYMKTYNILHYGDIAFEGNKSKDYSFGRFVLNNLQDGIVSHVFIVFKPKVK 240 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSL---KFEDVKRLPVLVPPIKEQFDITNVINV 384 +D ++ + + K + + +D+ + + +P + EQ I Sbjct: 241 MDIDFMKVYINNEYFMKHHLVKATTKTLMMTTLNVQDMNKQKLRIPSLNEQERIGKF--- 297 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIA 412 +D + + + L + S++ Sbjct: 298 -FKELDHAITLHQNKLTQLNSLKKSYLQ 324 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 38/124 (30%), Gaps = 3/124 (2%) Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G+++ + + E +L L+ + + Sbjct: 4 GDVIVVVRNGSRSLIGKHAPINREMPNTVIGAFMTGLRSPSPKFLKALLDTQQFNVEIHK 63 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + KR+ VP I E I ++D ++ ++ + LKE + Sbjct: 64 NLGATINQITTGEFKRMHFTVP-IDEDEK--EKIGSLFRQLDDIIALHQRKLDQLKELKK 120 Query: 409 SFIA 412 +++ Sbjct: 121 AYLQ 124 >gi|238854453|ref|ZP_04644793.1| type IC HsdS subunit [Lactobacillus jensenii 269-3] gi|238832946|gb|EEQ25243.1| type IC HsdS subunit [Lactobacillus jensenii 269-3] Length = 387 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 44/402 (10%), Positives = 122/402 (30%), Gaps = 34/402 (8%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTG---KYLPKDGNSRQSDTSTVSIFAKGQIL 87 K + +G + + D + + + + + + + KG I Sbjct: 2 KNIGESFSGLSGKKSSDFGHGEAKYITYLNILNNPIIDTKLTDKIEIDNKQHLVKKGDIF 61 Query: 88 YGKLGPYLRKAII---------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + ++ + + S + + + S + +++ Sbjct: 62 FTISSETPQEVGLSSVLDTNLNECYLNSFSFGYRLKEISMFDNLFNSYNFRSPNFRRKMY 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +G + + K + N + P ++EQ I + I + + +L K+ Sbjct: 122 ILAQGISRYNISKKAVLNETICFPKISEQKQIGKLIKLMNSLLSLQQRKLELENKLKKQI 181 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 L S+ +T K + + ++ + + + K + L+ Sbjct: 182 AFYLYSFTLTP------NFKHIEV-------KNKKLGDIVDISNGIMGDSQKKSGNFKLT 228 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVMERGI 315 K++ G + + + ++ G+I++ I+ ++ + Sbjct: 229 RIETISNGKIDLSRTGYIDQVSDEKKFLEVGDILYSNINSLTHIGKNAIVKEKHLPLVHG 288 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIK 373 I + + + I YL L+ + + + S+ ++ L + P + Sbjct: 289 INLFRLHITNNQITPNYLHGLLNLPKYKWWVKSHANPAVNQASINKTELSSLVIKYPDLD 348 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q I N IN A+ + L + + + Sbjct: 349 IQNQI-NNINYSFAQYWDI---QYSKKESLCQLKQFLLQNLF 386 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 51/187 (27%), Gaps = 9/187 (4%) Query: 28 VPIKRFTKL---NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + G + + + +E + +G + SD G Sbjct: 202 KKLGDIVDISNGIMGDSQKKSGNFKLTRIETISNGKIDLSRTGYIDQVSDE--KKFLEVG 259 Query: 85 QILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ILY + + AI+ + + + ++ +L + + + Sbjct: 260 DILYSNINSLTHIGKNAIVKEKHLPLVHGINLFRLHITNNQITPNYLHGLLNLPKYKWWV 319 Query: 142 EGATMSHADWKGIGNIP-MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + I + + I+ +I + E L + KQ Sbjct: 320 KSHANPAVNQASINKTELSSLVIKYPDLDIQNQINNINYSFAQYWDIQYSKKESLCQLKQ 379 Query: 201 ALVSYIV 207 L+ + Sbjct: 380 FLLQNLF 386 >gi|187476894|ref|YP_784918.1| restriction modification system, specificity subunit [Bordetella avium 197N] gi|115421480|emb|CAJ47988.1| restriction modification system, specificity subunit [Bordetella avium 197N] Length = 412 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 65/421 (15%), Positives = 120/421 (28%), Gaps = 50/421 (11%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESGTG 61 +KAY + P W I + E + I ++ G Sbjct: 13 RFKAYSE------------P--WAEEKIGDVLAEKRRPIVLEDDQRYELITVK--RRNEG 56 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKD 118 R + G + K I I S ++LV + Sbjct: 57 VVSRGHLLGRDILVKNYAQLKAGDFVISKRQVVHGATGIVPPALDGAIVSNEYLVAVDSE 116 Query: 119 VLPELLQGWLLS-IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 L + S + ++ G + + IP + I Sbjct: 117 RLRTEFLTIVASLPAMRRKFVLSSYGVDIEKLFFDAADWKKRDIPIPCTKEQTD--ISGY 174 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG------IEWVGLVPDH 231 + +I + ++ KQAL+ + + +++ G IE +G V Sbjct: 175 FQALKHIIEFHQQKHGKIQALKQALLQKMFPRSGAATPELRFKGFSGNWAIERLGQVGRT 234 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 F I +S ++ T N + + + V G++ Sbjct: 235 QSGIGFPDTEQGGKVGTPFFK---ISDMSLAGNENEMLTANNYVNDAQLQRNRWVPIGDV 291 Query: 292 ---VFRFID---LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 VF + + N KR +RS +++ +A + D + L + L K Sbjct: 292 PAVVFAKVGAALMLNRKRMVRSPFLID----NNAMAYIFDSTWDEDFGKALFDTIYLPKY 347 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G S D++ + V P EQ I +D L+ K + LK Sbjct: 348 AQV---GALPSYNGSDIEGITVHRPKDRLEQKQIGGF----FKLLDTLISKHATQLHKLK 400 Query: 405 E 405 + Sbjct: 401 Q 401 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 48/136 (35%), Gaps = 7/136 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + Y + G+ V + + + + + +AV + + +L + Sbjct: 71 KNYAQLKAGDFVISKRQVVHGATGIVPPALDGAIVSNEYLVAVDSERLRTEFLTIVASLP 130 Query: 341 DLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + F G+ + D K+ + +P KEQ DI+ + ++E + Sbjct: 131 AMRRKFVLSSYGVDIEKLFFDAADWKKRDIPIPCTKEQTDISGY----FQALKHIIEFHQ 186 Query: 398 QSIVLLKERRSSFIAA 413 Q ++ + + + Sbjct: 187 QKHGKIQALKQALLQK 202 >gi|323700561|ref|ZP_08112473.1| restriction modification system DNA specificity domain [Desulfovibrio sp. ND132] gi|323460493|gb|EGB16358.1| restriction modification system DNA specificity domain [Desulfovibrio desulfuricans ND132] Length = 394 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 45/396 (11%), Positives = 105/396 (26%), Gaps = 32/396 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + ++ + + K Y+ + P +D T+ Sbjct: 22 SGWEEAFLGDLVEIVS--PPKKIKTSRYLR--EGRFPIIDQSPDVQCGWTNDVDTLIDNP 77 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 I++ G + + + + + P + +E+ Sbjct: 78 LPLIVF---GDHTCVLKLINRPFAQGADGIKIFKPKRTPSTEFLYHFLCAHPLEMESYKR 134 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 ++ G AEQ I + +D I + +E+L++ K L Sbjct: 135 HFSILK------GAQIFYPEVEAEQKKIANCL----SSLDEFIANEVSKLEVLRDHKCGL 184 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK---LIESNILSL 259 + + + +++ W LV+ + K+ L Sbjct: 185 MQQLFPQEGQTQPRLRFPEFRNKP----GWSKCKLGDLVSISSGKSPSQYALSSDGRYPF 240 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + +V G ++F + +R V Sbjct: 241 IKVEDLNNCTKYQVNSREYCNDAKGVVSEGALLFPKRGAAIELNKIRITSVGILFDTNLM 300 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + D+T L +L + + + + + V P EQ I Sbjct: 301 AIIPH----DATELEFLFYYLSCVGLSQIADTSTIPQINNKHIIPFIVYKPLRLEQQKIA 356 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + D + E I LK + + Sbjct: 357 DCLTA----TDDSIAAQEAMIDALKTHKRGLMQQLF 388 >gi|225352848|ref|ZP_03743871.1| hypothetical protein BIFPSEUDO_04482 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156319|gb|EEG69888.1| hypothetical protein BIFPSEUDO_04482 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 158 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 65/157 (41%), Gaps = 13/157 (8%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I E+ S Y+IV G++V+ + + GI++ AY+ Sbjct: 7 NGIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYD----GIVSPAYV 62 Query: 322 AVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP-PIKEQF 376 +P+ + + + A L+R L K + + G Q LKF+D + + +P EQ Sbjct: 63 VARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQR 122 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + R+D L+ ++ + LL+ + S + Sbjct: 123 QIGGFFD----RLDSLITLHQRKLELLRNIKKSMLDK 155 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 52/159 (32%), Gaps = 11/159 (6%) Query: 56 VESGTGKYLPKDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 V G Y + + + + I G ++Y + + + +DGI S ++ Sbjct: 3 VSVANGIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYDGIVSPAYV 62 Query: 113 VLQPKDVLPELLQG---WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQV 168 V +P + + + + + +I + +P EQ Sbjct: 63 VARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQR 122 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 I IT R +ELL+ K++++ + Sbjct: 123 QIGGFFDRLDSL----ITLHQRKLELLRNIKKSMLDKMF 157 >gi|111656907|ref|ZP_01407733.1| hypothetical protein SpneT_02001847 [Streptococcus pneumoniae TIGR4] Length = 290 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 67/185 (36%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 106 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPE 165 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDS 330 SY+ +++ G++++ L R ++ G + + V I+ Sbjct: 166 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINC 225 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 226 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 285 Query: 389 IDVLV 393 ID L+ Sbjct: 286 IDALI 290 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 110 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 169 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 170 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 229 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 230 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 289 Query: 185 I 185 I Sbjct: 290 I 290 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 4/54 (7%) Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 +PP+ EQ I I ++D E + L KE + S + A+ G+ Sbjct: 1 LPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 54 >gi|302553243|ref|ZP_07305585.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM 40736] gi|302470861|gb|EFL33954.1| conserved hypothetical protein [Streptomyces viridochromogenes DSM 40736] Length = 495 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 67/204 (32%), Gaps = 12/204 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLI----ESNILSLSYGNIIQKLETRNMGLKPESYET 282 VP HW V + + ++ E + + I+ + LK S + Sbjct: 213 KVPAHWTVVSLDEITELIEYGSSTKTSESAEVGGVPVLRMGNIKDGKVDPRVLKYISADH 272 Query: 283 ----YQIVDPGEIVFRFIDLQNDKRSLRSA-QVMERGIITSAYMAVKPHG-IDSTYLAWL 336 + G+++F + S + + +D+ ++ + Sbjct: 273 PDAVRYRLQEGDLLFNRTNSFELVGKSAVYRDKFGPMAFASYLIRCRFLPGVDTDWVNLV 332 Query: 337 MRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S + ++ + + ++ + +P+ +PP EQ I +V+ A L Sbjct: 333 INSSIGRRYVRSVATQQVGQANVNGTKLAAMPIPLPPEGEQRRILDVVETHQAAALRLES 392 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 I Q R + + A G+ Sbjct: 393 GIRQQGAKATRLRRALLTQAFAGR 416 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 72/205 (35%), Gaps = 13/205 (6%) Query: 20 AIPKHWKVVPIKRFTKLN-TGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P HW VV + T+L G ++++ + + + + +++ G S Sbjct: 213 KVPAHWTVVSLDEITELIEYGSSTKTSESAEVGGVPVLRMGNIKDGKVDPRVLKYISADH 272 Query: 74 DTSTVSIFAKGQILYGKLGP---YLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGW 127 + +G +L+ + + A+ D G S V + + Sbjct: 273 PDAVRYRLQEGDLLFNRTNSFELVGKSAVYRDKFGPMAFASYLIRCRFLPGVDTDWVNLV 332 Query: 128 LLSIDVTQRIEA-ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + S + + + + ++ + + +P+P+PP EQ I + + L + Sbjct: 333 INSSIGRRYVRSVATQQVGQANVNGTKLAAMPIPLPPEGEQRRILDVVETHQAAALRLES 392 Query: 187 ERIRFIELLKEKKQALVSYIVTKGL 211 + ++AL++ L Sbjct: 393 GIRQQGAKATRLRRALLTQAFAGRL 417 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 14/113 (12%), Positives = 42/113 (37%), Gaps = 6/113 (5%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTYLAWLMRSY 340 +V PG+++F + + ++ ++T + V + ++A+ Sbjct: 32 LVLPGDLLFTRYNGNPEFVGACTSVPDSAPLLTYPDKLIRVRVDRRVVLPEFVAYAFSWE 91 Query: 341 DLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + ++K++ + VP + EQ I + + ++I+ Sbjct: 92 GTRARVREYVKTTAGQAGISGGELKKIELPVPSLAEQRRIVAALEEQISKIES 144 >gi|237750332|ref|ZP_04580812.1| restriction endonuclease S [Helicobacter bilis ATCC 43879] gi|229374226|gb|EEO24617.1| restriction endonuclease S [Helicobacter bilis ATCC 43879] Length = 233 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 62/168 (36%), Gaps = 4/168 (2%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL--ETRNMGLKPESY 280 E +P+ W + + ++ + + + + K + + K S Sbjct: 59 EAPFEIPNSWAWVKGYDIFLPIDNTEPQGDFFKYIDIDSIDNKNKKVKSPKTIETKNASS 118 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + + G+++F + + +L + + I ++ + + +DS +L +LM S Sbjct: 119 RARRPLKYGDVLFSMVRPYLENIALIDEALAD-CIASTGFFVCGTNILDSRFLYYLMTSP 177 Query: 341 DLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + G S+ +D+ +PP+ EQ I ++ Sbjct: 178 YVVYGLNSFMKGDNSPSIVKDDILNFNYPLPPLCEQEHIVQTLDTLFT 225 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 58/165 (35%), Gaps = 5/165 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTV 78 IP W V L T G YI ++ +++ K PK ++ + + Sbjct: 63 EIPNSWAWVKGYDIF-LPIDNTEPQGDFFKYIDIDSIDNKNKKVKSPKTIETKNASSRAR 121 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G +L+ + PYL + D D I ST F V + L + S V Sbjct: 122 RPLKYGDVLFSMVRPYLENIALIDEALADCIASTGFFVCGTNILDSRFLYYLMTSPYVVY 181 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + +G I N P+PPL EQ I + + Sbjct: 182 GLNSFMKGDNSPSIVKDDILNFNYPLPPLCEQEHIVQTLDTLFTL 226 >gi|3335662|gb|AAC78316.1| restriction-modification enzyme MpuUIII S subunit [Mycoplasma pulmonis] Length = 366 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 44/361 (12%), Positives = 106/361 (29%), Gaps = 34/361 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + I I ++ Q ++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPLEKQ----INAFDELILSEQKSLQHYLN 171 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 Y K IE P + + K I S Sbjct: 172 YFFG---------KFYQIE-----PSLFHDYKLEKIAKIRRGK-------IINSFDLKEN 210 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + K Y + + I + I ++ + Sbjct: 211 PGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSITNVCFILLL 270 Query: 325 PHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 ++ + +L + ++ + ++ R S++ + + + +P ++ Q I +I Sbjct: 271 NDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLEIQSAILGII 330 Query: 383 N 383 Sbjct: 331 E 331 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 35/142 (24%), Gaps = 3/142 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + + I + ++ ++ Sbjct: 31 YNLYSSKTKDQGIFGKINSYDFNGEYILITTHGAYAGTVKYVNEKFSTTSNCFILKVNEN 90 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I T + LK ++ V +P +K Q I +I Sbjct: 91 IVKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPLEK 150 Query: 388 RI---DVLVEKIEQSIVLLKER 406 +I D L+ ++S+ Sbjct: 151 QINAFDELILSEQKSLQHYLNY 172 >gi|327459320|gb|EGF05666.1| type I restriction enzyme, S subunit [Streptococcus sanguinis SK1] Length = 319 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/318 (12%), Positives = 99/318 (31%), Gaps = 27/318 (8%) Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 G++ ++ + + + I + +P LA Q + Sbjct: 11 NENHNNGYVSNLLSMMNLAQYQGQSAQPGLSVSTLSKIIIKLPDLATQEQCFNVLN---- 66 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV---KMKDSG------IEWVGLVPD 230 ID I + L++ + L Y + PD K SG E +P+ Sbjct: 67 LIDQKIQINNQINRELEDMAKTLYDYWFVQFDFPDQNGKPYKSSGGKMVYNPELKREIPE 126 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ--------KLETRNMGLKPESYET 282 W V+ + N N + S + N+ + T Sbjct: 127 GWRVEKLGDVAKFKNGINYEKTSSGSEKIKIINVRNISSSTIFVNQTDLDEISLENDKST 186 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 IV+ G I+ + R + ++ + + + +A + + + L + Sbjct: 187 NFIVNEGMILITRSGIPGATRLVS--ELEAKTVYSGFIIASEVNDLIFKNLIFYYLKNVE 244 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + +++ + + + +PP ++I+ + ++ +++ Sbjct: 245 EVLKNQSAGTIMKNISQSVLTDMVISLPPQNVLLKFNSIIDNLLEQ----MKNVQRQNQE 300 Query: 403 LKERRSSFIAAAVTGQID 420 L + R + + GQ+ Sbjct: 301 LTQLRDWLLPMLMNGQVK 318 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 72/195 (36%), Gaps = 7/195 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ--S 73 IP+ W+V + K G +TS + I I + ++ S T D + + Sbjct: 123 EIPEGWRVEKLGDVAKFKNGINYEKTSSGSEKIKIINVRNISSSTIFVNQTDLDEISLEN 182 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSID 132 D ST I +G IL + G ++++ + + F++ + L + + Sbjct: 183 DKSTNFIVNEGMILITRSGIPGATRLVSELEAKTVYSGFIIASEVNDLIFKNLIFYYLKN 242 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 V + ++ G M + + ++ + +PP + I ++ + + Sbjct: 243 VEEVLKNQSAGTIMKNISQSVLTDMVISLPPQNVLLKFNSIIDNLLEQMKNVQRQNQELT 302 Query: 193 ELLKEKKQALVSYIV 207 +L L++ V Sbjct: 303 QLRDWLLPMLMNGQV 317 >gi|303267712|ref|ZP_07353532.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae BS457] gi|303270082|ref|ZP_07355794.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae BS458] gi|302640384|gb|EFL70819.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae BS458] gi|302642756|gb|EFL73083.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae BS457] Length = 184 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 65/181 (35%), Gaps = 14/181 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPES 279 +P+ WE + + + R + + + + ++ L S Sbjct: 4 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 63 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLA 334 Y+ +++ G++++ L R + + A + V I+ ++ Sbjct: 64 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVADSHVTVIRVLSGVINCHFIY 123 Query: 335 WLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S + V SG ++ L + +K + +PP+ EQ I + I A ID L Sbjct: 124 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 183 Query: 393 V 393 + Sbjct: 184 I 184 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 4 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 63 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 64 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYVWAVADSHVTVIRVLSGVINCHFIY 123 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 124 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 183 Query: 185 I 185 I Sbjct: 184 I 184 >gi|313668696|ref|YP_004048980.1| restriction modification system DNA specificity domain [Neisseria lactamica ST-640] gi|313006158|emb|CBN87620.1| putative restriction modification system DNA specificity domain [Neisseria lactamica 020-06] Length = 219 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 61/169 (36%), Gaps = 12/169 (7%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMG----LKPESYETYQIVDPGEIVFRFIDLQNDK 302 + ES + ++ YG I + + PE E + VD G++V + Sbjct: 37 QKKDFTESGVPAIHYGQIYTYYGNQTDKTLSFVSPELAEKLKKVDKGDVVITNTSENIED 96 Query: 303 RSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKF 359 + E +T + + I + + ++ K G + + Sbjct: 97 VGKALLYLGEEQAVTGGHATIFKPSKEIVGKFFVYFTQTEIFDKAKRKFAKGTKVIDVSA 156 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK-ERR 407 D+ ++ + +PP++ Q I +++ T L +E + L K + R Sbjct: 157 TDMAKIQIPIPPLETQKKIVKILDKFTE----LEATLEAELALRKRQYR 201 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 47/166 (28%), Gaps = 11/166 (6%) Query: 26 KVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSI 80 + P+ L G + + + I + + G K + + + Sbjct: 20 EWKPLGEVGLLVRGNGLQKKDFTESGVPAIHYGQIYTYYGNQTDKTLSFVSPELAEKLKK 79 Query: 81 FAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VT 134 KG ++ + + + + + +P + + + Sbjct: 80 VDKGDVVITNTSENIEDVGKALLYLGEEQAVTGGHATIFKPSKEIVGKFFVYFTQTEIFD 139 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + +G + + I +PIPPL Q I + + T Sbjct: 140 KAKRKFAKGTKVIDVSATDMAKIQIPIPPLETQKKIVKILDKFTEL 185 >gi|305431931|ref|ZP_07401098.1| type I restriction-modification system [Campylobacter coli JV20] gi|304445015|gb|EFM37661.1| type I restriction-modification system [Campylobacter coli JV20] Length = 477 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 57/425 (13%), Positives = 141/425 (33%), Gaps = 55/425 (12%) Query: 29 PIKRFTKLNTGR----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + +N + + ++ ++++ SG + ++ + + FA+ Sbjct: 53 KLSNIADINPSKAEINNFSKDAIVTFLSMQNLGSGFIHH--REQGQIVEFENGYTYFAEN 110 Query: 85 QILYGKLGPYLRKAII------ADFDGICSTQFLVLQPK--DVLPELLQGWLLSIDVTQR 136 IL K+ P + + G ST+F V + + L E + +L + + Sbjct: 111 DILIAKITPCMEHGKCAIATDLYNGIGFGSTEFNVFRIRDPRFLTEFVFCYLNRDSIRKI 170 Query: 137 IEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G + +P+PI P+ Q+ I+ + ++ + E+L Sbjct: 171 ATDNMVGTSGRQRVPTAFYEKLPIPILPIEFQLEIQNLVKDSHKALEESKELYKKAEEIL 230 Query: 196 KEK--------KQALVSYIVTKGLN----PDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + Q+L++ N +K+S ++ L ++++ K Sbjct: 231 YNELGLDPKNPLQSLLNSKTNNSTNSPNISIRTLKESFLKTGRLDSEYYQSKYEDIEKFI 290 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------------------KPESYETYQ 284 + N NI++ N K + K + Sbjct: 291 KSYSNGYDSFLNIINNKDTNFTPKNNENYNYIELANIGNNGNINEPISDLGKNLPTRARR 350 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 IV G+++ I+ +L + + ++ ++++ + + ++S L + +S + Sbjct: 351 IVSNGDVIISSIEGSLSSCALITQE-FDKHLVSTGFFVLNSKLLNSETLLVMFKSQIFQE 409 Query: 345 VFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVET-------ARIDVLVEKI 396 SG ++ E++ ++ + Q I I +D K+ Sbjct: 410 YLKKFPSGTILCAINKEELSKIFIPKIDPTTQEKIAKYIQESFNLRKKSKQLLDNAKIKV 469 Query: 397 EQSIV 401 E+ I Sbjct: 470 EEQIQ 474 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 62/181 (34%), Gaps = 7/181 (3%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPESYETYQIVDP 288 + ++ + N ++ + LS N+ R G E Y Sbjct: 50 SYEKLSNIADINPSKAEINNFSKDAIVTFLSMQNLGSGFIHHREQGQIVEFENGYTYFAE 109 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL----AWLMRSYDLCK 344 +I+ I + A + GI + D +L + + K Sbjct: 110 NDILIAKITPCMEHGKCAIATDLYNGIGFGSTEFNVFRIRDPRFLTEFVFCYLNRDSIRK 169 Query: 345 VF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + +G+ RQ + ++LP+ + PI+ Q +I N++ ++ E +++ + Sbjct: 170 IATDNMVGTSGRQRVPTAFYEKLPIPILPIEFQLEIQNLVKDSHKALEESKELYKKAEEI 229 Query: 403 L 403 L Sbjct: 230 L 230 >gi|148976554|ref|ZP_01813250.1| hypothetical protein VSWAT3_11331 [Vibrionales bacterium SWAT-3] gi|145964130|gb|EDK29387.1| hypothetical protein VSWAT3_11331 [Vibrionales bacterium SWAT-3] Length = 427 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 58/428 (13%), Positives = 131/428 (30%), Gaps = 42/428 (9%) Query: 29 PIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKG 84 +++G +S G ++ V + D + KG Sbjct: 8 RFSDLYSMSSGISSTKEQAGHGAPFLSFSAVFNNYFVPDELADLMDASAKQQETYSIKKG 67 Query: 85 QILYGKLGPY-----LRKAIIADFDGICSTQFLVL----QPKDVLPELLQGWLLSIDVTQ 135 I + + D+ + FL Q P+ + +L S + Sbjct: 68 DIFLTRTSEVVDELAMSSVATQDYPRATYSGFLKRLRPTQNDISYPKYMAFYLRSSLFRK 127 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + + + +P QV + + + ID I R L Sbjct: 128 TMTNNAVMTLRASLNEDIFSYLDLLLPDFDTQVKVGDLL----YAIDQKIEVNARINHEL 183 Query: 196 KEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTE--- 243 + L Y + P+ K SG + + +P WE + ++++ Sbjct: 184 GLMTKTLYDYWFVQFDFPNADGKPYKASGGQMLYNKTLKRDIPVDWEARNLDSILSRSGT 243 Query: 244 --LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP------GEIVFRF 295 R N KL E + ++ +I + S E+ I++ G+++F Sbjct: 244 GLNPRSNFKLGEGSNYYVTIKSIDNGKINLDDKCDRISDESLTIINNRSDLKVGDVLFTS 303 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLR 354 I + ++ + + + S Y L+ ++ + + Sbjct: 304 IQPVGETYFIQEKPTNWNINESVFTLRADTEQVTSEYFYMLLSGQEMKAYTKQSSAGSIH 363 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ++ +K + +IT + + I IE+ +L E R + Sbjct: 364 KGIRHGVLKEFILPFGG----KEITKEFSKVLSPILKKQALIEKENRVLSETRDWLLPML 419 Query: 415 VTGQIDLR 422 + GQ+ ++ Sbjct: 420 MNGQVTVK 427 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 51/190 (26%), Gaps = 19/190 (10%) Query: 10 YKDSGVQWI------GAIPKHWKVVPIKRFTKLN-TGRTSESG-----KDIIYIGLEDVE 57 YK SG Q + IP W+ + + TG S Y+ ++ ++ Sbjct: 208 YKASGGQMLYNKTLKRDIPVDWEARNLDSILSRSGTGLNPRSNFKLGEGSNYYVTIKSID 267 Query: 58 SGTGKYLPKDGNSRQSD---TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ---- 110 +G K + S G +L+ + P I + + Sbjct: 268 NGKINLDDKCDRISDESLTIINNRSDLKVGDVLFTSIQPVGETYFIQEKPTNWNINESVF 327 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 L + V E L ++ + G+ + +P Sbjct: 328 TLRADTEQVTSEYFYMLLSGQEMKAYTKQSSAGSIHKGIRHGVLKEFILPFGGKEITKEF 387 Query: 171 REKIIAETVR 180 + + + Sbjct: 388 SKVLSPILKK 397 >gi|308061790|gb|ADO03678.1| type I restriction enzyme specificity subunit [Helicobacter pylori Cuz20] Length = 390 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 60/367 (16%), Positives = 112/367 (30%), Gaps = 35/367 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQ---S 73 W+ +K K+ G T + I +I +D+ + G+Y+ K S Sbjct: 2 SEWQTFCLKDLGKIVGGATPSTNNPKNYGNKIAWITPKDLSTLQGRYIKKGSRSISRLGF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K IL+ P IA + F + P + + L Sbjct: 62 KSCSCVLLPKHAILFSSRAPI-GYVAIAKKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYH 119 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFI 192 I I G T +G + IPP EQ I + +I+ Sbjct: 120 KDNISNIGGGTTFKEVSGATLGLFKVKIPPTYYEQQKIARTLSVLDQKIENNHKINELLH 179 Query: 193 ELLKEKKQALVSYI-VTKGLNPDVK-----MKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 ++L+ + G N + MK S E L+P+ +EVK L Sbjct: 180 KILELLYEQYFVRFDFLDGNNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGELTQLKVG 238 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 S N L+ E+Y+ + I+ Sbjct: 239 NKNANHS------SNQGKYPFFTCSNNPLRCETYQ----FEGKHIIISGNGNFYVTHYDG 288 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 +R + S P+ + L +L + + + + D++ + Sbjct: 289 KFDAYQRTYVVS------PNNPNHYVLIYLFVKSYTNYLKLQSRGSIIKFITKSDIENIK 342 Query: 367 VLVPPIK 373 +++P +K Sbjct: 343 IVLPNLK 349 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 9/166 (5%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 N N + + K +R++ + ++ I+F Sbjct: 28 NYGNKIAWITPKDLSTLQGRYIKKGSRSISRLGFKSCSCVLLPKHAILFSSRAPIGYVA- 86 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 +R + ++ P+ + + Y + G + + + Sbjct: 87 ----IAKKRLCTNQGFKSIIPNKKIYFEFLYYLLKYHKDNISNIGGGTTFKEVSGATLGL 142 Query: 365 LPVLVPP-IKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 V +PP EQ I ++V +I+ + E + + + LL E+ Sbjct: 143 FKVKIPPTYYEQQKIARTLSVLDQKIENNHKINELLHKILELLYEQ 188 >gi|148983888|ref|ZP_01817207.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP3-BS71] gi|147924035|gb|EDK75147.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP3-BS71] Length = 305 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 67/185 (36%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 121 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPE 180 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDS 330 SY+ +++ G++++ L R ++ G + + V I+ Sbjct: 181 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINC 240 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 241 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 300 Query: 389 IDVLV 393 ID L+ Sbjct: 301 IDALI 305 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 4/69 (5%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSS 409 ++L + V + + +PP+ EQ I I ++D E + L KE + S Sbjct: 1 MKNLNSDKVASILIPLPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKS 60 Query: 410 FIAAAVTGQ 418 + A+ G+ Sbjct: 61 ILQYAMQGK 69 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 125 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 184 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 185 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 244 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 245 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 304 Query: 185 I 185 I Sbjct: 305 I 305 >gi|325578337|ref|ZP_08148472.1| restriction endonuclease, S subunits [Haemophilus parainfluenzae ATCC 33392] gi|325160073|gb|EGC72202.1| restriction endonuclease, S subunits [Haemophilus parainfluenzae ATCC 33392] Length = 378 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 51/388 (13%), Positives = 113/388 (29%), Gaps = 49/388 (12%) Query: 26 KVVPIKRFTKLNT-----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 K +P+ ++ + L ++ Y +D + S V I Sbjct: 18 KWIPLGDVADYEQPTKYLVNSTVYNDNYPTPVLTAGKTFILGYTNEDEGIYFASKSPVII 77 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F + DFD + + + L ++ T I Sbjct: 78 F----------DDFTTANKWVDFDFKAKSSAMKMITSKNEKFALLKYIYYWLNTLPNNQI 127 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 IP IPPL+ Q I + + A T L +E + + ++ Sbjct: 128 DGDHKRQWISNYANKLIP--IPPLSVQTEIVKILDALTALTSELTSELTLRRKQYEYYRE 185 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L+S E +G V W+ + + RK ++ Sbjct: 186 KLLSE-----------------EELGKVGFEWKTLDQISENLDSKRK----------PIT 218 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITS 318 G Y I D ++ R+ + + + + Sbjct: 219 SGLRTSGKIPYYGASGIVDYVEDYIFDGDFLLISEDGANLLARNTPIAFSATGKIWVNNH 278 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 A++ + ++ + + DL + L +++ + + +P I +Q I Sbjct: 279 AHILKFNSYEERRFIEFYLNKIDLTPYI---SGAAQPKLNKKNLNSIKIPIPSIPKQQHI 335 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 ++++ + + E + +I ++R Sbjct: 336 VSILDKFETLTNSITEGLPLAIEQSQKR 363 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 51/186 (27%), Gaps = 8/186 (4%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 P + + + ++ +T +G E Y I+F Sbjct: 19 WIPLGDVADYEQPTKYLVNSTVYNDNYPTPVLTAGKTFILGYTNEDEGIYFASKSPVIIF 78 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 N + + Y+ + + + ++ G Sbjct: 79 DDFTTANK---WVDFDFKAKSSAMKMITSKNEKFALLKYIYYWLNTLPNNQI-----DGD 130 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + +PP+ Q +I +++ TA L ++ + R ++ Sbjct: 131 HKRQWISNYANKLIPIPPLSVQTEIVKILDALTALTSELTSELTLRRKQYEYYREKLLSE 190 Query: 414 AVTGQI 419 G++ Sbjct: 191 EELGKV 196 >gi|261492678|ref|ZP_05989228.1| type I restriction-modification system, S subunit, putative [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261495897|ref|ZP_05992321.1| type I restriction-modification system, S subunit, putative [Mannheimia haemolytica serotype A2 str. OVINE] gi|261308441|gb|EEY09720.1| type I restriction-modification system, S subunit, putative [Mannheimia haemolytica serotype A2 str. OVINE] gi|261311664|gb|EEY12817.1| type I restriction-modification system, S subunit, putative [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 111 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 36/104 (34%), Gaps = 4/104 (3%) Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G ++ K + + +GI + +L + + S + Sbjct: 12 GSVLIAMYGATIGKLGILKIAATTNQACCACI---PFNGIYNKFLFYYLMSQKAEFQKKS 68 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 GSG + ++ E + +PPI EQ I I A I+ L Sbjct: 69 EGSG-QPNISKEKIINYLFPLPPIHEQHRIVQKIEQLFAEIEKL 111 >gi|222444447|ref|ZP_03606962.1| hypothetical protein METSMIALI_00058 [Methanobrevibacter smithii DSM 2375] gi|222434012|gb|EEE41177.1| hypothetical protein METSMIALI_00058 [Methanobrevibacter smithii DSM 2375] Length = 186 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 68/168 (40%), Gaps = 4/168 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + +K + + G + ++ + G+++FR Q Sbjct: 18 SRYSKKYDGEKQKIDVLYCKVDEFYTREGDIAKDIDSKYLTQNGDVIFRLSSPQVAISIS 77 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 ++++ E +++S ++ +KP ++ +LA L+ S G+ + +K DV RL Sbjct: 78 ENSEIPEGVVVSSKFVIIKPRDVNPDFLAELLNSNIARNQIQKFSEGIIKQIKKNDVARL 137 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P ++EQ + IN+ I + + ++++I + I Sbjct: 138 KFEIPSLEEQKEYVEYINLINKEIKLQKQLLKENID----LKEGIIQK 181 >gi|326565155|gb|EGE15346.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis 103P14B1] gi|326574649|gb|EGE24585.1| type I restriction modification DNA specificity protein [Moraxella catarrhalis 101P30B1] Length = 209 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 57/184 (30%), Gaps = 11/184 (5%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIV 292 + T I L + E + + + ++ Sbjct: 25 KISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSSAKWIPANCVI 84 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + + V + Y+ + + + + ++G+G Sbjct: 85 IAMYGATVGRVGINKIPMTTNQACAN--IEVNEEIAEYRYVYYCLANQY--EYIKSLGTG 140 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + ++ + VK+L + +PP+ Q I +++ + E + + I L ++ R Sbjct: 141 SQTNINAQIVKKLKIPIPPLSVQSQIVAILDTFDTLTQSISEGLPKEIKLRQKQYEYYRE 200 Query: 409 SFIA 412 + Sbjct: 201 QLLN 204 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 68/192 (35%), Gaps = 9/192 (4%) Query: 26 KVVPIKRFTK-LNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + + K +++G T + + +I ++ ++V S+ Sbjct: 15 EWRALGEVAKKISSGGTPKTGIPEYYNNGEIPWLRTQEVNFNDIYDTGVKIIEPGVKNSS 74 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G + + I + ++ + + E + + + I Sbjct: 75 AKWIPANCVIIAMYGATVGRVGINKIPMTTNQACANIEVNEEIAEYRYVYYCLANQYEYI 134 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +++ G + ++ + + + + +PIPPL+ Q I + ++ + I+L ++ Sbjct: 135 KSLGTG-SQTNINAQIVKKLKIPIPPLSVQSQIVAILDTFDTLTQSISEGLPKEIKLRQK 193 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 194 QYEYYREQLLNF 205 >gi|237822120|ref|ZP_04597965.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CCRI 1974M2] Length = 186 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 65/185 (35%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 2 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPE 61 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDS 330 SY+ +++ G++++ L R + A + V I+ Sbjct: 62 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINC 121 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 122 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 181 Query: 389 IDVLV 393 ID L+ Sbjct: 182 IDALI 186 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ V + T + G++ + + I + + ++ S Sbjct: 6 EIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPETVHS 65 Query: 77 --TVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 66 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 125 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 126 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 185 Query: 185 I 185 I Sbjct: 186 I 186 >gi|310831004|ref|YP_003969647.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis virus BV-PW1] gi|309386188|gb|ADO67048.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis virus BV-PW1] Length = 913 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 107/392 (27%), Gaps = 39/392 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG- 84 + + G + I GKY G + T G Sbjct: 554 EWKKLGDICDFKRGERITKKEHI----------DNGKYYVIGGGDETKNFKTNKFNRSGF 603 Query: 85 QILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ-RIEAIC 141 + G + I + F + L + L + Sbjct: 604 NCRIARYGGSEKNFIKITNFDYWLHDNAFTLQVKNKDLNIKYISYYLLNYIKNNYYYKKL 663 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + D+ G + +PIP L Q KI I ++ + KE + Sbjct: 664 NNSVPPALDFDGFTKLKIPIPSLEIQEETVNKIELFDGLIKSM----EDLNKKHKEGMKI 719 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + ++ K ++ K + + + +R + E N L +S Sbjct: 720 YMEIMLKKYIDEIEWKKLGDVCEI-------------KIGGTPSRNKEEYWEGNNLWVSV 766 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + V ++ + L + K S+ + + + T+ + Sbjct: 767 RELNNNIINDTKEKISDLGVNKSNVK---LIPKDTILMSFKLSIGKMGITGKDLYTNEAI 823 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 A + + G+ SL +K + + VP ++ Q Sbjct: 824 AGLITNKLIDKKYLYYYLQNNLIINNNDGAMGNGSLNISKLKIIKIPVPSLETQNKTVEQ 883 Query: 382 INVETARIDVLVEKIEQSIVLLKE-RRSSFIA 412 +N ID ++ + I K+ + I Sbjct: 884 LNF----IDQIISENNNMIKNYKQNIKDILIQ 911 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 58/188 (30%), Gaps = 10/188 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 E + L + + R + +I + Y I ET+N Sbjct: 541 DEEMLKLQEKANCEWKKLGDICDFKRGERITKKEHIDNGKYYVIGGGDETKNF------- 593 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL--AWLMR 338 + R +++ + + +A+ + + +L+ Sbjct: 594 -KTNKFNRSGFNCRIARYGGSEKNFIKITNFDYWLHDNAFTLQVKNKDLNIKYISYYLLN 652 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + + + +L F+ +L + +P ++ Q + N I + I + + ++ Sbjct: 653 YIKNNYYYKKLNNSVPPALDFDGFTKLKIPIPSLEIQEETVNKIELFDGLIKSMEDLNKK 712 Query: 399 SIVLLKER 406 +K Sbjct: 713 HKEGMKIY 720 >gi|496159|gb|AAA65634.1| restriction-modification enzyme subunit S1B [Mycoplasma pulmonis] gi|3335666|gb|AAC78318.1| restriction-modification enzyme MpuUV S subunit [Mycoplasma pulmonis] Length = 336 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 43/356 (12%), Positives = 104/356 (29%), Gaps = 34/356 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I + I I ++ Q ++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPLEKQ----INAFDELILSEQKSLQHYLN 171 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 Y K IE P + + K I S Sbjct: 172 YFFG---------KFYQIE-----PSLFHDYKLEKIAKIRRGK-------IINSFDLKEN 210 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + K Y + + I + I ++ + Sbjct: 211 PGDYPVISSNTKNNGIFGYLNSYMYDGEYITISADGAYAGTVFLNNGKFSITNVCFILLL 270 Query: 325 PHGID--STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 ++ + +L + ++ + ++ R S++ + + + +P ++ Q I Sbjct: 271 NDKVNLLTKFLFYYLKKNENIIQKKSIVGSSRPSVREYTLSEIAIKIPSLEIQSAI 326 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 35/142 (24%), Gaps = 3/142 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + + I + ++ ++ Sbjct: 31 YNLYSSKTKDQGIFGKINSYDFNGEYILITTHGAYAGTVKYVNEKFSTTSNCFILKVNEN 90 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I T + LK ++ V +P +K Q I +I Sbjct: 91 IVKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKIQSAIIKIIEPLEK 150 Query: 388 RI---DVLVEKIEQSIVLLKER 406 +I D L+ ++S+ Sbjct: 151 QINAFDELILSEQKSLQHYLNY 172 >gi|298229904|ref|ZP_06963585.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae str. Canada MDR_19F] Length = 198 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 65/185 (35%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 14 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPE 73 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDS 330 SY+ +++ G++++ L R + A + V I+ Sbjct: 74 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINC 133 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 134 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 193 Query: 389 IDVLV 393 ID L+ Sbjct: 194 IDALI 198 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 18 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 77 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 78 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 137 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 138 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 197 Query: 185 I 185 I Sbjct: 198 I 198 >gi|282851966|ref|ZP_06261326.1| type I restriction modification DNA specificity domain protein [Lactobacillus gasseri 224-1] gi|282556975|gb|EFB62577.1| type I restriction modification DNA specificity domain protein [Lactobacillus gasseri 224-1] Length = 675 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 39/374 (10%), Positives = 101/374 (27%), Gaps = 15/374 (4%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG----QI 86 T +T K I + S K+ + I Sbjct: 308 GNITDYGNEKTIPLHKIAILKNGTSITSSKIKHGNI-PVIAGGREPAYYHNEENRSEPTI 366 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 + G Y D S F + + L + L ++I + G+ Sbjct: 367 TVSQSGAYAGFVSYHDKPIFASDCFTITAKPNSGYSTLDLYYLLKKKQKQIYSFATGSIQ 426 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H K + + +P Q + ++ + + + L E +Q+L S I Sbjct: 427 KHVYAKDMEDFKVPDKGQELQ-----VVNNLIAGFESEVQRQRQSENELTELQQSLFSDI 481 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 N + + + K ++ ++ Sbjct: 482 DKVYKNSQKVDQSISMLEDNELVKVMGGKRIPKEYDRAPFPTCHYYPGVKDFENFTINLK 541 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + + ++ ++ + + +A+ Sbjct: 542 TSDCIDDVVF--EKIKRYVLKENDVFVSAAGTIGKVGMAPKVKGGTISLTENAHRIRVID 599 Query: 327 GI--DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 +L ++++S ++ ++ + L E +K + + + I EQ ++ + Sbjct: 600 QTKLIPRFLMYILKSQNIQNAMNSLVTKTGTPKLSIESLKNIEIPILKITEQQELIKKWD 659 Query: 384 VETARIDVLVEKIE 397 +I+ + +I Sbjct: 660 QLNTKINDIYSQIN 673 >gi|307287469|ref|ZP_07567521.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0109] gi|306501515|gb|EFM70814.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0109] Length = 286 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 60/186 (32%), Gaps = 14/186 (7%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVD 287 D WE + V + ++ Y + + +N + P + T + + Sbjct: 16 DDWEERKLGDEVRIVMGQSPNSENYTDDPNDYILVQGNADMKNGRVFPRVWTTQVTKQAE 75 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 +++ D V+ RG+ + L + Sbjct: 76 KDDLILSVRAPVGDIGKTAYDVVIGRGVAA--------IKGNEFIFQNLGKMKSDGYWTR 127 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +S+ D+K + VP I+EQ I + ++D + ++ + LLKE + Sbjct: 128 YSTGSTFESINSTDIKEAIISVPTIEEQNKIGSF----FKQLDNTIALHQRKLDLLKETK 183 Query: 408 SSFIAA 413 F+ Sbjct: 184 KGFLQK 189 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 36/281 (12%), Positives = 78/281 (27%), Gaps = 14/281 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + ++ G++ S + G R T K Sbjct: 17 DWEERKLGDEVRIVMGQSPNSENYTDDPNDYILVQGNADMKNGRVFPRVWTTQVTKQAEK 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ P +D + ++ + + L + G Sbjct: 77 DDLILSVRAPV-GDIGKTAYDVVIGRGVAAIKGNEFI----FQNLGKMKSDGYWTRYSTG 131 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T + I + +P + EQ I ++D I R ++LLKE K+ + Sbjct: 132 STFESINSTDIKEAIISVPTIEEQNKIGSF----FKQLDNTIALHQRKLDLLKETKKGFL 187 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + K +++ G ++ P K++K + + + N Sbjct: 188 QKMFPKNGAKVPEIRFPGFTEDWEERKLGDIAPLR---GGFAFKSSKFRNTGVPIVRISN 244 Query: 264 IIQKLET--RNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 I+ E + + I+ V K Sbjct: 245 ILSSGEVGGDFAYYDEQDKDDKYILPDKSAVLAMSGATTGK 285 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 11/79 (13%), Positives = 24/79 (30%), Gaps = 5/79 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W+ + L G +S K + + + ++ S +G+ + D Sbjct: 208 EDWEERKLGDIAPLRGGFAFKSSKFRNTGVPIVRISNILS-SGEVGGDFAYYDEQDKDDK 266 Query: 79 SIFAKGQILYGKLGPYLRK 97 I + G K Sbjct: 267 YILPDKSAVLAMSGATTGK 285 >gi|87310395|ref|ZP_01092525.1| putative specificity protein s [Blastopirellula marina DSM 3645] gi|87286894|gb|EAQ78798.1| putative specificity protein s [Blastopirellula marina DSM 3645] Length = 396 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 45/394 (11%), Positives = 117/394 (29%), Gaps = 28/394 (7%) Query: 56 VESGTGKYLPKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFL 112 +++G + + + ++ + ++ + A + + Sbjct: 1 MQNGRIDVATARKITESDFFEWTKKALPQENDVILSRRCNPGETAFVDSKLKCALGQNLV 60 Query: 113 VLQPKDVLPELLQGWLLSIDVTQRIE---AICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 +L+ L L + I GA + +PIPPL EQ Sbjct: 61 LLRADGELVYPPFLRWLVRSPHWWNQVGTFINVGAVFDSLRCADVPKFRLPIPPLPEQKA 120 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI--VTKGLNPDVKMKDSG------ 221 I + A +I+ + + ++ V ++ S Sbjct: 121 IASILGALDDKIELNRRMNETLEAMARALFKSWFVDFDPVRAKMDGRQPPGMSADVAALF 180 Query: 222 ----IEWVGL-VPDHWEVKPFFALVTEL---NRKNTKLIESNILSLSYGNIIQKLETRNM 273 + G VP+ W+V L R N + ++ + + M Sbjct: 181 PDKLVHVNGELVPEGWKVGRLGDLCRINSNTVRANEVSGMIEYVDIASVSEGRSSGPTAM 240 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + G+ ++ + N + L E I ++ + + P+ + YL Sbjct: 241 DFNSAPSRARRKISHGDTIWSCVRP-NRRSFLFVHSPPENRIASTGFAVISPNLLTPCYL 299 Query: 334 AWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + +++ +++ + +L P + I + Sbjct: 300 HYAITTHEFTSYLTNCADGSAYPAVRPDHFSDAELLEPDL---QTIEAF-DEVVWSFRNQ 355 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + E + +L E R + + ++G++ + + Sbjct: 356 IAVNEGASNILAELRDALLPKLLSGELRVADAEK 389 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 52/142 (36%), Gaps = 7/142 (4%) Query: 19 GA-IPKHWKVVPIKRFTKLNT--GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +P+ WKV + ++N+ R +E I Y+ + V G P + + + Sbjct: 189 GELVPEGWKVGRLGDLCRINSNTVRANEVSGMIEYVDIASVSEGRSS-GPTAMDFNSAPS 247 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G ++ + P R + + I ST F V+ P + P L + + + Sbjct: 248 RARRKISHGDTIWSCVRPNRRSFLFVHSPPENRIASTGFAVISPNLLTPCYLHYAITTHE 307 Query: 133 VTQRIEAICEGATMSHADWKGI 154 T + +G+ Sbjct: 308 FTSYLTNCADGSAYPAVRPDHF 329 >gi|23466324|ref|NP_696927.1| truncated type I restriction system specificity protein [Bifidobacterium longum NCC2705] gi|23327079|gb|AAN25563.1| truncated type I restriction system specificity protein [Bifidobacterium longum NCC2705] Length = 189 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 58/160 (36%), Gaps = 8/160 (5%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 N+ I + I ++ + + + ++VD G +++ + + ++ Sbjct: 33 GNSAYYGGEIPFIRSAEIDCDSTELSLTVAGLNNSSAKLVDKGMVLYAMYGATSGEVAIS 92 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 +G I A +A+ + + + G + +L +K L Sbjct: 93 KI----KGAINQAILAMDASDMAANRFIAYWLRRQKKSITETFLQGGQGNLSGAIIKELG 148 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + P + EQ I + + +D L+ ++ + +++R Sbjct: 149 IPQPSLDEQRQIGSF----FSNLDDLITLHQRKRLSIRQR 184 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 54/180 (30%), Gaps = 10/180 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + +G T +G +I +I +++ + S+ Sbjct: 13 WEQRKLGELALTYSGGTPSAGNSAYYGGEIPFIRSAEID---CDSTELSLTVAGLNNSSA 69 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG +LY G + I+ G + L + D+ + L E Sbjct: 70 KLVDKGMVLYAMYGATSGEVAISKIKGAINQAILAMDASDMAANRFIAYWLRRQKKSITE 129 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 +G + I + +P P L EQ I I +R+ + Sbjct: 130 TFLQG-GQGNLSGAIIKELGIPQPSLDEQRQIGSFFSNLDDLITLHQRKRLSIRQRSPVW 188 >gi|325104013|ref|YP_004273667.1| hypothetical protein Pedsa_1278 [Pedobacter saltans DSM 12145] gi|324972861|gb|ADY51845.1| hypothetical protein Pedsa_1278 [Pedobacter saltans DSM 12145] Length = 397 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 58/405 (14%), Positives = 123/405 (30%), Gaps = 34/405 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + + +D+ L V T K +P N+ +D S I KGQ Sbjct: 4 KKLGDYIQ----QVNNRNRDLQVETLLGVSI-TKKLIPSIANTVGTDMSAYKIVEKGQFA 58 Query: 88 YGKLGPYLR-----KAIIADFDGICSTQFLVL---QPKDVLPELLQGWLLSIDVTQRIEA 139 YG + + S ++V +LPE L W + + Sbjct: 59 YGTITSRNGDKISIALADEYDKALVSQIYIVFEVIDTNLLLPEYLMMWFSRPEFDRYARY 118 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+T DW+ + + +PIP + +Q I + ++ I + E L+ Sbjct: 119 HSHGSTREAFDWEDLCEVELPIPSIEKQREIVA----QYQAVENKIKVNEQICEQLEATA 174 Query: 200 QALVSYIVTKGLNPD---VKMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKNTK 250 Q L P+ K SG E +P+ WEV ++ + K Sbjct: 175 QTLYKQWFVDFEFPNENGEPYKSSGGIMVFNEELEKEIPEGWEVGKLEDIIYYSDTKIAL 234 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + +S +++ + + + + G+I+ I K + Sbjct: 235 KNLTTDNYISTESMLPEKKGVEFISNVPEGNNVTVFEKGDILISNIRPYLKKIWFAN--- 291 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLV 369 + G + + ++ + G + + +L+ Sbjct: 292 KKGGCSNDVLCIRSKEIVYQFFALNILFNDQFFDYVMQGAKGTKMPRGDKDWILEYKILL 351 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 P +I + + + + L + +S + Sbjct: 352 PK----KEILATFSKDIELVSRVKISKTIQNQKLTQLQSFLLNRL 392 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 68/173 (39%), Gaps = 16/173 (9%) Query: 10 YKDSG------VQWIGAIPKHWKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDV--ESGT 60 YK SG + IP+ W+V ++ + + ++ YI E + E Sbjct: 195 YKSSGGIMVFNEELEKEIPEGWEVGKLEDIIYYSDTKIALKNLTTDNYISTESMLPEKKG 254 Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 +++ + + V++F KG IL + PYL+K A+ G CS L ++ K+++ Sbjct: 255 VEFISNVP-----EGNNVTVFEKGDILISNIRPYLKKIWFANKKGGCSNDVLCIRSKEIV 309 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 L + Q + + +GA + I L ++ ++ Sbjct: 310 --YQFFALNILFNDQFFDYVMQGAKGTKMPRGDKDWILEYKILLPKKEILATF 360 >gi|124265199|ref|YP_001019203.1| restriction modification system, type I [Methylibium petroleiphilum PM1] gi|124257974|gb|ABM92968.1| restriction modification system, type I [Methylibium petroleiphilum PM1] Length = 412 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/368 (12%), Positives = 106/368 (28%), Gaps = 31/368 (8%) Query: 81 FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSID----VT 134 + +++ G I+ + + S+ + L + Sbjct: 45 LSPNDLVFPHRGAIGEVGIVPEDGERYVLSSSLMKLTCDVARAHPDFVYYFFKSAIGRFE 104 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + I + +PP+ EQV I + A RI L + Sbjct: 105 LLKNSSQVGTPGIGQPLTSLKQIKLRLPPVGEQVAIAAALRALDDRIALLRDTNATLEAI 164 Query: 195 LKEKKQALVS-----YIVTKGLNPDVK------MKDSGIEW--VGLVPDHWEVKPFFALV 241 + ++ ++GL P + G+E +G VP W Sbjct: 165 AQALFKSWFVDFDPVRAKSQGLAPAGMDEATAALFPEGVEESALGPVPRGWRAATLAETF 224 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 ++ G ++ ++ + + G+ + I + Sbjct: 225 EINPSRSLPKDSEAKYLEMAGVPTTGHCAESIAVRA--FGSGTKFRNGDTLLARITPCLE 282 Query: 302 KRSLRSAQV---MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF---YAMGSGLRQ 355 E G ++ ++ ++P Y A+L+ + + F G+ RQ Sbjct: 283 NGKTAFVDFLVEDEIGWGSTEFIVLRPKAPLPDYFAYLLCRHAPFREFAERSMSGTSGRQ 342 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + + + VPP + + + L R + + + Sbjct: 343 RVQNDVLATYRIAVPP----SAVAEAFGALINPLRHAITSNHARGATLGALRDALLPRLI 398 Query: 416 TGQIDLRG 423 +GQ+ L Sbjct: 399 SGQLRLPD 406 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 58/173 (33%), Gaps = 15/173 (8%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEIVFRFIDLQNDK 302 K+ +++ + + N+ + + + P ++VF + Sbjct: 2 KSDCYVDAGVRVVRGTNLTGGRSFSGEFVFITPEKAVELNSANLSPNDLVFPHRGAIGEV 61 Query: 303 RSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKF 359 + + ER +++S+ M + ++ + +S S + + Sbjct: 62 GIVP--EDGERYVLSSSLMKLTCDVARAHPDFVYYFFKSAIGRFELLKNSSQVGTPGIGQ 119 Query: 360 --EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +K++ + +PP+ EQ I + +D + + + L+ + Sbjct: 120 PLTSLKQIKLRLPPVGEQVAIAAALRA----LDDRIALLRDTNATLEAIAQAL 168 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 65/196 (33%), Gaps = 13/196 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +P+ W+ + ++N R+ + Y+ + V T + + R + Sbjct: 208 LGPVPRGWRAATLAETFEINPSRSLPKDSEAKYLEMAGV--PTTGHCAESIAVRAFG--S 263 Query: 78 VSIFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLPELL-QGWLL 129 + F G L ++ P L ++ D G ST+F+VL+PK LP+ Sbjct: 264 GTKFRNGDTLLARITPCLENGKTAFVDFLVEDEIGWGSTEFIVLRPKAPLPDYFAYLLCR 323 Query: 130 SIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + E G + + + +PP A I I + Sbjct: 324 HAPFREFAERSMSGTSGRQRVQNDVLATYRIAVPPSAVAEAFGALINPLRHAITSNHARG 383 Query: 189 IRFIELLKEKKQALVS 204 L L+S Sbjct: 384 ATLGALRDALLPRLIS 399 >gi|254505201|ref|ZP_05117352.1| hypothetical protein SADFL11_5241 [Labrenzia alexandrii DFL-11] gi|222441272|gb|EEE47951.1| hypothetical protein SADFL11_5241 [Labrenzia alexandrii DFL-11] Length = 279 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 87/297 (29%), Gaps = 29/297 (9%) Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + GA+ K + +P+PPL EQ I + Sbjct: 1 MFVKDMVGKSTGASYPAVSDKIVKASSIPLPPLDEQRRISAILDKADSLRQKRKQAIALL 60 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 L Q++ + +P K P + + + + + + Sbjct: 61 DSLT----QSIFLEMFG---DPVSNPKGW--------PQNNSLSDIADIASGITKGRKLR 105 Query: 252 IESN--ILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 E + L+ N+ K + + Y++ ++ D R Sbjct: 106 GEPTRTVPYLAVANVQDKTLKLDIVKTIEATEAEIGRYRLQVDDLLLTEGGDPDKLGRGS 165 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFYAMG--SGLRQSLKFED 361 + I + V+ + L WL+ S + F + S+ Sbjct: 166 LWRGELHEAIHQNHIFRVRLTSNNVHPLYAMWLIGSDYGKRYFLKSAKQTTGIASINKTQ 225 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + LP L+PP K Q + + ++D L+ L SS A +G+ Sbjct: 226 LSNLPFLLPPKKLQQEFADQAQAVKTKLDKLLTCE----DLTNSLFSSLQHRAFSGE 278 Score = 43.6 bits (101), Expect = 0.065, Method: Composition-based stats. Identities = 17/166 (10%), Positives = 43/166 (25%), Gaps = 15/166 (9%) Query: 22 PKHW-KVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK W + + + +G T E + + Y+ + +V+ T K Sbjct: 79 PKGWPQNNSLSDIADIASGITKGRKLRGEPTRTVPYLAVANVQDKTLKLDIVKTIEATEA 138 Query: 75 TSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL- 129 +L + G + + + Sbjct: 139 EIGRYRLQVDDLLLTEGGDPDKLGRGSLWRGELHEAIHQNHIFRVRLTSNNVHPLYAMWL 198 Query: 130 ---SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +++ + ++ + + N+P +PP Q + Sbjct: 199 IGSDYGKRYFLKSAKQTTGIASINKTQLSNLPFLLPPKKLQQEFAD 244 >gi|254777458|ref|ZP_05218974.1| Type I restriction-modification system (specificity subunit) [Mycobacterium avium subsp. avium ATCC 25291] Length = 392 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 51/400 (12%), Positives = 125/400 (31%), Gaps = 33/400 (8%) Query: 26 KVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + V + L E + IG+ G G + + + A G Sbjct: 4 ERVRVGDVLSLQRRSVDIEPFTEYSLIGVYSF--GKGIFHREPRRGSELGDYRFFSIAPG 61 Query: 85 QILYGKLGPYLRKAIIADFD--GICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ + + A G T V + V + + LS + I Sbjct: 62 DLVLSNIQAWEGAIACAQERDAGTIGTHRFLTYVSRDGQVDTAWAKWFFLSEPGMELIRK 121 Query: 140 ICEGATMSH--ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G T+ + + +P+PP+ EQ + ++ + + R L + Sbjct: 122 AAPGTTIRNRTLAIDRFEALEIPLPPIDEQRQVASQLDRLSEVVQLASERRRHGETLFRA 181 Query: 198 KKQALVSYIVTK-GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + S ++ G + + + + P A + Sbjct: 182 LTDSRESKLIAGLGKTGVPARRLADVAEINPRPTRLAADTLVAF---------------V 226 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + + E Y+ G+++F I + G+ Sbjct: 227 PMAAVDADTGSVSDAEVRSVAELGAGYKQFRRGDVIFARITPCMQNGKSAVFSDRDYGLG 286 Query: 317 TSAYMAVKP-HGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIK 373 ++ + V+P + + + Y+ ++R+ + + G+ +Q + + ++ L V +P + Sbjct: 287 STEFHVVRPGNEVSAEYIHRILRTRAVRLNATEHFTGTAGQQRVPADFLRELLVPIPSRE 346 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +Q +I ++ A +++ L + S + A Sbjct: 347 DQQEIVASLDALRASAGEFRALNQKASALAR----SLLPA 382 >gi|258517328|ref|YP_003193550.1| hypothetical protein Dtox_4260 [Desulfotomaculum acetoxidans DSM 771] gi|257781033|gb|ACV64927.1| hypothetical protein Dtox_4260 [Desulfotomaculum acetoxidans DSM 771] Length = 287 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 67/204 (32%), Gaps = 9/204 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-MGLKPESYE 281 E P+ W + L+ + + + + + L + + + Sbjct: 17 ERCEKYPNDWVIAKLGNLLERVRMPVKVIANCEYQEIGIRSHGKGLFYKEPIKGSDLGNK 76 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + ++ ++ + ++ +A+ M I+ YL + Sbjct: 77 SVFWIEADCLIINIVFAWEQAVAITTAREKGMIASHRFPMWKSKGNIELNYLLKFFLTPF 136 Query: 342 LCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIE 397 + G ++L ++ ++ V +P I+EQ I + D +E E Sbjct: 137 GKNLLELASPGGAGRNKTLGQDEFNKILVCIPSNIEEQQKIVKIFTTW----DKAIELKE 192 Query: 398 QSIVLLKERRSSFIAAAVTGQIDL 421 + I+ K ++ + +TG+ L Sbjct: 193 KLILEKKNQKKWLMQNLLTGKKRL 216 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 20/189 (10%), Positives = 55/189 (29%), Gaps = 4/189 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W + + + + + Y + G G + + +V Sbjct: 23 PNDWVIAKLGNLLERVRMPV-KVIANCEYQEIGIRSHGKGLFYKEPIKGSDLGNKSVFWI 81 Query: 82 AKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ + + + I + I S +F + + K + + + Sbjct: 82 EADCLIINIVFAWEQAVAITTAREKGMIASHRFPMWKSKGNIELNYLLKFFLTPFGKNLL 141 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + G + + + ++KI+ D I + + I K + Sbjct: 142 ELASPGGAGRNKTLGQDEFNKILVCIPSNIEEQQKIVKIFTTWDKAIELKEKLILEKKNQ 201 Query: 199 KQALVSYIV 207 K+ L+ ++ Sbjct: 202 KKWLMQNLL 210 >gi|332292348|ref|YP_004430957.1| restriction modification system DNA specificity domain protein [Krokinobacter diaphorus 4H-3-7-5] gi|332170434|gb|AEE19689.1| restriction modification system DNA specificity domain protein [Krokinobacter diaphorus 4H-3-7-5] Length = 465 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 56/391 (14%), Positives = 110/391 (28%), Gaps = 22/391 (5%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 ++G + G Y ++ S V + +++ Sbjct: 5 KFDELFDFAKKSKIKAGDG----------NKEGLYPFYTSSAILSKRIDVFQEERVSLIF 54 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGATMS 147 G G A D ST +V K+ + +E +GA + Sbjct: 55 GTGG--KASAHYVDEQFSTSTDCIVAYKKEDKDLNEKFVFYYLFGNIHILERGFKGAGLK 112 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H K I N+ +PI P+ Q I + + + ELL+ + + Sbjct: 113 HISKKYIQNLDIPILPIETQNKIVALLDKASALVQKREESIALLDELLRAQFLKMFGKAN 172 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + S + + PF + + + + + N + Sbjct: 173 PQFSVWADVQIKSLVL---DRKNSMRTGPFGSNLKHSEFVEDGPVAVLGIDNAVKNTFEW 229 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 E R + + V P +++ + + A +++ P Sbjct: 230 KERRFITNEKYEELKRYTVFPRDVIITIMGTVGRSAVIPENIPTAINTKHLACLSLDPKK 289 Query: 328 IDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + YLA+ + S + L +K L + PI+ Q Sbjct: 290 CNPYYLAYSIHSNPYLSFQMKAREKGAIMAGLNLTIIKDLKLKDVPIELQNKF----EDI 345 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 I V E + QS L +S + A + Sbjct: 346 YHNIQVQKETLTQSKNELDNLYNSLLQRAFS 376 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 58/174 (33%), Gaps = 10/174 (5%) Query: 45 GKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF 103 + +G+++ T ++ + + + + ++ +G R A+I + Sbjct: 211 DGPVAVLGIDNAVKNTFEWKERRFITNEKYEELKRYTVFPRDVIITIMGTVGRSAVIPEN 270 Query: 104 DGIC----STQFLVLQPKDVLPEL-LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 L L PK P + ++ +++A +GA M+ + I ++ Sbjct: 271 IPTAINTKHLACLSLDPKKCNPYYLAYSIHSNPYLSFQMKAREKGAIMAGLNLTIIKDLK 330 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + P+ Q K I + L +L+ ++ LN Sbjct: 331 LKDVPIELQ----NKFEDIYHNIQVQKETLTQSKNELDNLYNSLLQRAFSEQLN 380 >gi|307067138|ref|YP_003876104.1| restriction endonuclease S subunit [Streptococcus pneumoniae AP200] gi|306408675|gb|ADM84102.1| Restriction endonuclease S subunit [Streptococcus pneumoniae AP200] Length = 240 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 67/185 (36%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 56 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPE 115 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDS 330 SY+ +++ G++++ L R ++ G + + V I+ Sbjct: 116 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINC 175 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 176 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 235 Query: 389 IDVLV 393 ID L+ Sbjct: 236 IDALI 240 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 60 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 119 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 120 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 179 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 180 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 239 Query: 185 I 185 I Sbjct: 240 I 240 >gi|294155918|ref|YP_003560302.1| type I restriction-modification system, specificity protein [Mycoplasma crocodyli MP145] gi|291600326|gb|ADE19822.1| type I restriction-modification system, specificity protein [Mycoplasma crocodyli MP145] Length = 417 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 118/394 (29%), Gaps = 22/394 (5%) Query: 26 KVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ----SDTSTVSI 80 + + T + + K I +E+ T K L +G + S + Sbjct: 16 NIKKLWEVTYWDKKFKNIDKSKQPKTIKYRYLEASTLKDLIVEGGDVKILSTGKFSAYTT 75 Query: 81 FAK-GQIL-----YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 K G L G + + ++ + + + Sbjct: 76 KEKAGDFLAYGEVVSIPGGGSAIIKYTNGYFCTTDNRIMTSRNKDILNNKFLYFYLKLIN 135 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q +E GA++ H + + I ++ +PIP + Q I E + L E + Sbjct: 136 QDVENTYRGASIKHPEMRRILDLKIPIPQIEIQNKIVEILDKFEELEAELTAELTAELTA 195 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 ++ ++ KD ++ + V + + + +E+ Sbjct: 196 RYKQYNYYKQLLLDF-----SNRKDVEVKKLWEVTYWDKKFKNIDKSKQPKTIKYRYLEA 250 Query: 255 NILS--LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + L + G ++ L T + + GE+V + ++ Sbjct: 251 STLKDLIVEGGDVKILSTGKFSAYTTKEKAGDFLAYGEVV----SIPGGGSAIIKYTNGY 306 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + M + I + + V A + + + + +P I Sbjct: 307 FCTTDNRIMTSRNKDILNNKFLYFYLKLINKDVGNAYRGAGIKHPDMKTILEFKIPIPSI 366 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +EQ I +++ + + E + + L K++ Sbjct: 367 EEQNKIVEILDKFEIYSNSINEGLPLELELRKKQ 400 >gi|253569551|ref|ZP_04846961.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251841570|gb|EES69651.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 326 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 56/349 (16%), Positives = 110/349 (31%), Gaps = 38/349 (10%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ 110 +GLE + K+ D ++ + T F KGQIL+G+ YL+KA IADFDGICS Sbjct: 1 VGLEHLIPQEIKFSGYDVDTENTFT---KTFKKGQILFGRRRAYLKKAAIADFDGICSGD 57 Query: 111 FLVL--QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 V+ P V P LL + + + G W+ + + +PP+ EQ Sbjct: 58 ITVIEAIPGKVDPLLLPFIIQNDKFFDYAVSRSAGGLSPRVKWEHLKDYEFDLPPIEEQR 117 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 ++ +K+ A + K L +M S + Sbjct: 118 ILADKLWAAYR-----------------------LKESYKKLLTATQEMVKSQFIEIFYG 154 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV-- 286 + VK + + + + N + + S E ++V Sbjct: 155 METTPVKDYIDDSFPGEWGTEDKDGNGVKVIRTTNFTNSGKLNLADVVTRSIEDRKVVRK 214 Query: 287 --DPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY----LAWLMR 338 + + N + + + + ++ +D + L + + Sbjct: 215 QIKKYDTILERSGGTADNPVGRVVLFEEDNLFLCNNFTQVLRFKDVDPRFAFYALYYFYQ 274 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + Q+L + + ++Q + Sbjct: 275 TNRTAIRSMGSKTTGIQNLNMSKYLEIGIPNASDEDQKAFVTIAEQADK 323 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 55/143 (38%), Gaps = 8/143 (5%) Query: 269 ETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 E + G ++ T+ G+I+F K ++ + G IT + P Sbjct: 10 EIKFSGYDVDTENTFTKTFKKGQILFGRRRAYLKKAAIADFDGICSGDIT--VIEAIPGK 67 Query: 328 IDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +D L +++++ GL +K+E +K +PPI+EQ + + + Sbjct: 68 VDPLLLPFIIQNDKFFDYAVSRSAGGLSPRVKWEHLKDYEFDLPPIEEQRILADKLWAAY 127 Query: 387 ARIDVLVEKIEQSIVLLKERRSS 409 L E ++ + +E S Sbjct: 128 ----RLKESYKKLLTATQEMVKS 146 >gi|157164035|ref|YP_001466968.1| type I restriction modification DNA specificity domain-containing protein [Campylobacter concisus 13826] gi|112800945|gb|EAT98289.1| type I restriction enzyme [Campylobacter concisus 13826] Length = 382 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 45/383 (11%), Positives = 103/383 (26%), Gaps = 36/383 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ K + K T + + G Y + + Sbjct: 13 PEGVKFDELGVICKSLAKGTLKQEDLV----------DKGAYPVVNSSRDYYGFYDKYNN 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSIDVTQRIEA 139 G Y D KD + L + ++ Sbjct: 63 EANAFTIASRGEYAGFVKFIDCKFWAGGLCYPYASKDEDYVLTKFIFYFLKSIEKKNMDI 122 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + ++ + + +P+PP+ Q I + + T + EL KK Sbjct: 123 LVARGSIPALNKSDFDKVKIPVPPMEVQREIARIMDSFTSLTEE--LMAKLTEELTARKK 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 Q + K ++ +G + A K +K + Sbjct: 181 QYEFYRDFLLSFDELDKNGGCELKTLGEI------CDLIAGRDISKDKVSKEKDIKFKFP 234 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y N I +P V + + ++ I Sbjct: 235 IYSNGIGDNALYGFTDEP-------RVMKQCVTISARGT----IGYCALRLDPFYPIVRL 283 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 A+ I + +L + + + + ++ + L V ++ + VP ++ Q + Sbjct: 284 ICAIPKSNITAQFLKYFLDTQKI-----SVPTSGIPQLTIPMVAKIKIPVPSLQTQQKVV 338 Query: 380 NVINVETARIDVLVEKIEQSIVL 402 ++++ ++ + E + + I L Sbjct: 339 DILDKFDTLVNSITEGLPREIEL 361 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 51/145 (35%), Gaps = 5/145 (3%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 N + + + G+ Y + + + Sbjct: 48 NSSRDYYGFYDKYNNEANAFTIASRGEYAGFVKFIDCKFWAGGLCYP-YASKDEDYVLTK 106 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++ + ++S + + + G +L D ++ + VPP++ Q +I +++ T+ + Sbjct: 107 FIFYFLKSIEKKNMDILVARGSIPALNKSDFDKVKIPVPPMEVQREIARIMDSFTSLTEE 166 Query: 392 LVEKIEQSIVLLKE----RRSSFIA 412 L+ K+ + + K+ R ++ Sbjct: 167 LMAKLTEELTARKKQYEFYRDFLLS 191 >gi|260887977|ref|ZP_05899240.1| type I restriction enzyme EcoR124II specificity protein [Selenomonas sputigena ATCC 35185] gi|260862228|gb|EEX76728.1| type I restriction enzyme EcoR124II specificity protein [Selenomonas sputigena ATCC 35185] Length = 124 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 42/122 (34%), Gaps = 8/122 (6%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 K ++ + + + + + Y+ + S K A G G + Sbjct: 1 MYGATAAKVAINRIPLTTNQACCN--LKINEEMAEHRYVYHWLCSQY--KTLKAKGQGSQ 56 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSF 410 ++ +++ P+ VPP+ Q I ++++ + L + I K+ R Sbjct: 57 SNINKNIIEKYPIPVPPLDVQQKIVSILDRFDTLCNDLTSGLPAEIAARKKQYEHYRDRL 116 Query: 411 IA 412 + Sbjct: 117 LT 118 >gi|218283420|ref|ZP_03489439.1| hypothetical protein EUBIFOR_02028 [Eubacterium biforme DSM 3989] gi|218215893|gb|EEC89431.1| hypothetical protein EUBIFOR_02028 [Eubacterium biforme DSM 3989] Length = 201 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 61/178 (34%), Gaps = 8/178 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 K + L++S G +I PG++V+ I Sbjct: 28 MQRPFVWATSKVSDLMDSLYKGYPVGYLIIWKNPDVKLKNGTLSSGKFRFRPGDVVYGKI 87 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ 355 + Q K S + AY+ +GI +L L+++ D K ++ Sbjct: 88 NPQLGKYFYASVDGLTSA---DAYVFNGKNGISQKFLFSLLQTADFFKYSVSVSKRSGMP 144 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +++ L P +EQ I + + +D L+ ++ + L+ + S + Sbjct: 145 KINRDELNAYSFLAPNAEEQNKIGDFL----LELDHLITLHQRELKKLQNIKKSMLEK 198 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 65/199 (32%), Gaps = 23/199 (11%) Query: 15 VQWI--GAI-------PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 + WI G I P W + + G + Y+ + Sbjct: 15 ISWINSGEIAIPEMQRPFVWATSKVSDLMD-----SLYKGYPVGYL----IIWKNPDVKL 65 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL- 124 K+G +S F G ++YGK+ P L K A DG+ S V K+ + + Sbjct: 66 KNGTL----SSGKFRFRPGDVVYGKINPQLGKYFYASVDGLTSADAYVFNGKNGISQKFL 121 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L + D + ++ + + M + + P EQ I + ++ I Sbjct: 122 FSLLQTADFFKYSVSVSKRSGMPKINRDELNAYSFLAPNAEEQNKIGDFLLELDHLITLH 181 Query: 185 ITERIRFIELLKEKKQALV 203 E + + K + + Sbjct: 182 QRELKKLQNIKKSMLEKMF 200 >gi|325973137|ref|YP_004250201.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323651739|gb|ADX97821.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 295 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 48/142 (33%), Gaps = 10/142 (7%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDS 330 N ++ + +L E + + Y Sbjct: 56 NRNYNFYGLLQSKLFPKNTVCVVETGSLVTDSALLKF---EACLSSDLYGFIPFSKISTP 112 Query: 331 TYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 T++ + + + + + S + L + ++ PP++ Q I ++ +R Sbjct: 113 TFIKYCLDAPKNKRKLKNLASLYITQPHLTLSKLFQVKFPKPPLEIQQKIGEIL----SR 168 Query: 389 IDVLVEKIEQSIVLLKERRSSF 410 D++++ E+ I LLK ++S Sbjct: 169 YDLILDNHERQIELLKNLKASL 190 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/280 (13%), Positives = 73/280 (26%), Gaps = 20/280 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++V + + +++ G + IG ++V L + N Sbjct: 5 WELVTLDKLGRISKGIQKHKPNHDKKLFCFGKVPLIGCKEVSDSRLTVLKSNRNYNFYGL 64 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL---SID 132 +F K + + G + + + F+ S+ P + + Sbjct: 65 LQSKLFPKNTVCVVETGSLVTDSALLKFEACLSSDLYGFIPFSKISTPTFIKYCLDAPKN 124 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + T H + + P PPL Q I E + + +D + Sbjct: 125 KRKLKNLASLYITQPHLTLSKLFQVKFPKPPLEIQQKIGEILSRYDLILDNHERQIELLK 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L +L K PD + S +P+ W F L K Sbjct: 185 NLKA----SLFKEWFIKLRFPDYEKYSSE----NGIPEGWRKIRFGDLTEIQIGKKPASH 236 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + L +V G + Sbjct: 237 SELLDGLGKYPFFTCSTKTKNSYTFSYDFPSLLVSAGGAI 276 Score = 37.9 bits (86), Expect = 3.7, Method: Composition-based stats. Identities = 9/47 (19%), Positives = 20/47 (42%), Gaps = 5/47 (10%) Query: 3 HYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII 49 + Y +Y S IP+ W+ + T++ G+ S +++ Sbjct: 199 RFPDYEKY-SSE----NGIPEGWRKIRFGDLTEIQIGKKPASHSELL 240 >gi|268610088|ref|ZP_06143815.1| hypothetical protein RflaF_11404 [Ruminococcus flavefaciens FD-1] Length = 385 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 51/397 (12%), Positives = 119/397 (29%), Gaps = 35/397 (8%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + + +E + I +E G +P N +Q+ + I +Y Sbjct: 6 KFSELIEEISEQNTELKYGLDDIVGVTIEKG---LIPTIANLQQTALNKFYIVKPDTFVY 62 Query: 89 G------KLGPYLRKAIIADFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIEAI 140 +LG K + F V + + PE L + + + Sbjct: 63 NPRTHGVRLGMGFNKTNYTYITSWNNIAFKVKDDALRILNPEYLWLYFNRSEWDRETNYH 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G++ W N+ + IP + Q + ++ + I I + R + L+ Sbjct: 123 AWGSSTIVFSWNTFLNLEIQIPEKSYQ----DNLVRQYNAIKRRIALKQRINDNLEATLM 178 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + V +S I + + + N+ ++ + Sbjct: 179 TVYKDKVAD---------NSEITTTSPLGSLCKQITDGKHGDCESEDNSGYFFVSVKDII 229 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G I K + ++ G+I+F + S+ + Sbjct: 230 NGCIEYKNARQITRADFSDANKRTNLEVGDILFTNSGTLGRMALITSSYYANITTFQKSV 289 Query: 321 MAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 +KP S+ +L SY+ K+ +++L D++ + P Sbjct: 290 AILKPDTKKISSIFMYLSLSYNKSKIIEFAHGSAQKNLLLSDIRGFEIKYPS-------A 342 Query: 380 NV---INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I+ + ++ + ++ L+E ++ Sbjct: 343 EYRNGIDDLIKPLFERIQNNNEELIKLRELSRILLSQ 379 >gi|60681333|ref|YP_211477.1| putative type IC restriction-modification system specificity subunit, partial [Bacteroides fragilis NCTC 9343] gi|60492767|emb|CAH07541.1| putative type IC restriction-modification system specificity subunit, partial [Bacteroides fragilis NCTC 9343] Length = 376 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 48/150 (32%), Gaps = 4/150 (2%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y ++ + +++ S + ++ Sbjct: 46 YTTYKSEVINDVQSKTDIDAKNLVRSKENDVIIPSSGETAIDISTARCVPYDDVLLGGDL 105 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++ + D +L++ + + L E +K L V +P +KEQ I + Sbjct: 106 NIIRLYQNDGRFLSYQLNGVRKLDIARVAQGSSVIHLYGESIKSLSVSLPALKEQQKIVS 165 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++ + ID + + I K+ +++ Sbjct: 166 LL----SLIDERIATQNKIIEEYKKLKNAL 191 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 51/399 (12%), Positives = 112/399 (28%), Gaps = 43/399 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG------TGKYLPKDGNSRQSDTST 77 WK I+ ++ G + ++ G + G + + + D Sbjct: 9 EWKKYFIRDIAEVTKGAGISKEQRSLF-GTPCILYGELYTTYKSEVINDVQSKTDIDAKN 67 Query: 78 VSIFAKGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + ++ G + A +D + L + + L+ Sbjct: 68 LVRSKENDVIIPSSGETAIDISTARCVPYDDVLLGGDLNIIRLYQNDGRFLSYQLNGVRK 127 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +G+++ H + I ++ + +P L EQ I + ID I + + IE Sbjct: 128 LDIARVAQGSSVIHLYGESIKSLSVSLPALKEQQKIVSLL----SLIDERIATQNKIIEE 183 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K+ K AL K +G + D + ++ + L Sbjct: 184 YKKLKNALAELFFA---------KSIEYTSIGEMCDVVMGQSPSSVAYNYTKNGLPL--- 231 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 IQ G+ T I + I L A+ Sbjct: 232 ----------IQGNLDIFEGVTSPRMWTSDITKQCD--IGDIILTVRAPVGDVAKSNMIA 279 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + A+K + + Y K ++ D+ + + V Sbjct: 280 CVGRGVCAIKVKESGCSEYVYQYLLYFKAKWGSIEQGSTFSAISRNDILNINIPVITK-- 337 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + A D + ++ + +++ + Sbjct: 338 -RLIVA--SHLLALFDSEISIEALNLNVYTKQKQYLLTK 373 >gi|298484558|ref|ZP_07002686.1| type I restriction enzyme EcoEI specificity protein [Bacteroides sp. D22] gi|298269286|gb|EFI10919.1| type I restriction enzyme EcoEI specificity protein [Bacteroides sp. D22] Length = 167 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 63/169 (37%), Gaps = 8/169 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQI 285 +P W + N + K + L I + + P++YE+ + Sbjct: 2 QLPKGWTTIKVGDVAIYTNGRAFKPEDWMHEGLPIIRIQNLNDNSASYNRTPKTYESKYL 61 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCK 344 + G+++F + + + V P+ YL + ++ Sbjct: 62 IHNGDLLFAWAASLGTYI-----WNGGKAWLNQHIFKVDPYPFAQKQYLYHVFKAMITEF 116 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + GSG+ + + + + +L+PP++EQ I + + ++DV++ Sbjct: 117 YTQSHGSGMV-HITKKQFENIKLLLPPLEEQKRIVQTLEQISTKLDVIM 164 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 53/163 (32%), Gaps = 3/163 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W + + GR + +D ++ GL + N + Sbjct: 2 QLPKGWTTIKVGDVAIYTNGRAFKP-EDWMHEGLPIIRIQNLNDNSASYNRTPKTYESKY 60 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L+ L I + + P + + + + Sbjct: 61 LIHNGDLLFA-WAASLGTYIWNGGKAWLNQHIFKVDP-YPFAQKQYLYHVFKAMITEFYT 118 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 G+ M H K NI + +PPL EQ I + + + ++D Sbjct: 119 QSHGSGMVHITKKQFENIKLLLPPLEEQKRIVQTLEQISTKLD 161 >gi|294339299|emb|CAZ87655.1| Putative Restriction modification system protein [Thiomonas sp. 3As] Length = 407 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 60/425 (14%), Positives = 124/425 (29%), Gaps = 54/425 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + F +L G L +E G Sbjct: 3 SDWRQSNLGEFVRLQRGHD-----------LTSLEQRPGNVPVMGSAGPNGTHDVARATG 51 Query: 83 KGQILYGKLGPYLRKAIIAD-FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G ++ G+ G + + + +T V P L ++D+ Sbjct: 52 PG-VVIGRSGASIGRVHFSSSDYWPHNTCLYVTDFCGNNPRFAYYLLSTLDL----AKYN 106 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + I ++P+ IP EQ I E + RID L + + ++ Sbjct: 107 SGSAQPSLNRNFIYSMPVEIPGRREQDEIVEVLQTIDDRIDLLRQTNATLEAIAQALFKS 166 Query: 202 LVS-----YIVTKGLNPDVK------MKDSGIEW--VGLVPDHWEVKPFFALVTELNR-- 246 +G P+ + S E +G +P W V + T LN Sbjct: 167 WFVDFDPVRAKAEGREPEGMDAETAALFPSEFEESELGAIPKGWRVGALDSFATYLNGLA 226 Query: 247 --KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K L + ++ T + + IV G+++F + Sbjct: 227 LQKYPPESAEEYLPVIKIAQLRAGHTNSADKASAQLKPEYIVRDGDVLFSWSGSLE---- 282 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL--CKVFYAMGSGLRQSLKFEDV 362 G + V + +L + L + A + ++ + Sbjct: 283 -VELWCGGVGALNQHLFKVTS-CKVPKWFYYLATKHFLPGFRDIAAHKATTMGHIQRRHL 340 Query: 363 KRLPVLVPPIKEQFDITNVINVETAR----IDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + +P + V++ + +D V Q+ L+ R S + ++G+ Sbjct: 341 AEARLAMPAL-------AVLDELSPLMGPLLDRRVNGGLQARELV-AIRDSLLPRLISGK 392 Query: 419 IDLRG 423 + ++ Sbjct: 393 LPVKE 397 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 46/147 (31%), Gaps = 14/147 (9%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRT-----SESGKDI-IYIGLEDVESGTGK 62 ++++S +GAIPK W+V + F G ES ++ I + + +G Sbjct: 197 EFEESE---LGAIPKGWRVGALDSFATYLNGLALQKYPPESAEEYLPVIKIAQLRAG--- 250 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + I G +L+ G L + G + + V Sbjct: 251 -HTNSADKASAQLKPEYIVRDGDVLFSWSGS-LEVELWCGGVGALNQHLFKVTSCKVPKW 308 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHA 149 R A + TM H Sbjct: 309 FYYLATKHFLPGFRDIAAHKATTMGHI 335 >gi|308513224|ref|YP_003933627.1| hypothetical protein HMPREF0868_1373 [Clostridiales genomosp. BVAB3 str. UPII9-5] gi|307346930|gb|ADN43914.1| conserved hypothetical protein [Clostridiales genomosp. BVAB3 str. UPII9-5] Length = 165 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 54/163 (33%), Gaps = 5/163 (3%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 Y + + + + ++ +IV + +A + Sbjct: 1 MHYGQMYTHFGIYATEPLKYISEDVAKKSKMAVKNDIVMAVTSENVEDVCKCTAWLGNEN 60 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIK 373 I S + A+ H ++ YL++ + + G + + + + + +P + Sbjct: 61 IAVSGHTAIIHHNQNAKYLSYYFHTAMFFAQKKRLAHGTKVIEVTPNALNDIVIPLPSLA 120 Query: 374 EQFDITNVINVETARIDVLV----EKIEQSIVLLKERRSSFIA 412 +Q I ++++ A + L +IE + R ++ Sbjct: 121 DQERIVSILDRFDALCNDLSRGLPAEIEARRKQYEYYRDKLLS 163 >gi|332202400|gb|EGJ16469.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41317] Length = 240 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 67/185 (36%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 56 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPE 115 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDS 330 SY+ +++ G++++ L R ++ G + + V I+ Sbjct: 116 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINC 175 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 176 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 235 Query: 389 IDVLV 393 ID L+ Sbjct: 236 IDALI 240 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 58/181 (32%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGL-EDVESGTGKYLPKDGNSRQSDTST 77 IP+ W+ V + T + G++ + IY + + +G + + Sbjct: 60 EIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPETVHS 119 Query: 78 VS---IFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 120 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 179 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 180 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 239 Query: 185 I 185 I Sbjct: 240 I 240 >gi|15902489|ref|NP_358039.1| type I restriction-modification system S subunit [Streptococcus pneumoniae R6] gi|116515880|ref|YP_815958.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae D39] gi|15458013|gb|AAK99249.1| type I restriction enzyme [Streptococcus pneumoniae R6] gi|116076456|gb|ABJ54176.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae D39] Length = 426 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 123/416 (29%), Gaps = 69/416 (16%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDSGIEWV---------------------------------------------- 225 +S + Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNKDETTSYPI 251 Query: 226 GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGLK 276 +P+ W F +LV K + I +S ++ N + Sbjct: 252 YEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISKL 311 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + I G ++ F L II+ + I YL Sbjct: 312 ALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMIF 370 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I +++ ++ L Sbjct: 371 LPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIIFKVDLLFQKVSQL 424 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 251 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 310 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 311 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 368 >gi|315445329|ref|YP_004078208.1| restriction endonuclease S subunit [Mycobacterium sp. Spyr1] gi|315263632|gb|ADU00374.1| restriction endonuclease S subunit [Mycobacterium sp. Spyr1] Length = 420 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 48/420 (11%), Positives = 114/420 (27%), Gaps = 41/420 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS---ESGKDIIYIGLEDV--ESGTGKYLPKDGNSRQSDT 75 +P +W P+ G G ++ L DV + S Sbjct: 18 LPANWDEAPLAEIGGFKNGINKGADSFGHGFPFVNLMDVFGITRIRDTTTLGLISSSEVE 77 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG------ICSTQFLVLQPKDVL-PELLQGWL 128 +G +L+ + +A + S L + D L Sbjct: 78 RRNYNLREGDVLFVRSSVKPSGVGLATLIARSLPDTVFSGFLLRFRSNDRLANSFKAYLF 137 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKI---IAETVRIDTL 184 R+ + ++ + + +G++ + P EQ I + + ++ L Sbjct: 138 SDAGFRNRVIGASTVSANTNINQRTLGSLSVRFPQSRLEQESIAQALSDADLLIETLERL 197 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I ++ + +++ AL S G V F + T Sbjct: 198 IAKKKAIKHGMMQQQFALPSMA-------------------GECATLGSVANFMSGGTPD 238 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + ++ T + + + P + Sbjct: 239 RSNAEHWSGNIPWISATTLRQVEVSTSEQHVTSRAVRAGSKMAPLGSTLMLVRGSALHSE 298 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDV 362 +R++ V+ A+ P + ++ + S L + + Sbjct: 299 IRASLVIAPVCFNQDVKALVPLPRMVPKFLTYSIHANTDRLLRLVTSAGNTAGVLDTKVL 358 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 K + VP Q + +V + T + + + ++ + + +TG+ L Sbjct: 359 KAFELWVPRRDVQEHVVSVFDAVTTEL----ALLTAKLEKVRATKQGMMQELLTGRTRLP 414 >gi|149003722|ref|ZP_01828567.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS69] gi|149025495|ref|ZP_01836431.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP23-BS72] gi|147758284|gb|EDK65285.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS69] gi|147929445|gb|EDK80441.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP23-BS72] Length = 426 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 124/416 (29%), Gaps = 69/416 (16%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDSGIEWV---------------------------------------------- 225 +S + Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEDKIKKKDLDISIVSQGDDNSYYGNKDETTSYPI 251 Query: 226 GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGLK 276 +P+ W F +LV K + I +S ++ N + Sbjct: 252 YEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISKL 311 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + I G ++ F L II+ + I YL Sbjct: 312 ALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMIF 370 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 371 LPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 424 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 251 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 310 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 311 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 368 >gi|260641864|ref|ZP_05413806.2| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] gi|260624415|gb|EEX47286.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] Length = 353 Score = 71.4 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 64/384 (16%), Positives = 122/384 (31%), Gaps = 59/384 (15%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WK IK + SGK I + L+ +ES TG+ + K ++ S S F Sbjct: 16 KGWKTAKIKDVAPEMPSKEQLSGK-IWLLNLDMIESNTGRIIEKVYEDVENALSVQS-FD 73 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAI 140 +G +L+ KL PYL K +I D G+ +T+ + L+P+ + L I Sbjct: 74 EGNVLFSKLRPYLNKVVIPDEPGMATTELVPLRPEPSKLHKVFLSHLLRGNQFVNYANDI 133 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G M + N +PP+ +Q+ Sbjct: 134 AGGTKMPRMPLTELRNFDCILPPMDKQLEFVFIAEQVDK--------------------- 172 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + K IE G + + E Sbjct: 173 -----------SKFGDFKSQFIEMFGGLCQDTPWSDVVTITNGKAYPEEYQEEGAYPICG 221 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G I+ E + + + + N+ + S + + Sbjct: 222 SGGIMCYGEKK-------------LCNGNTTILGRKGNINNPIFMESGYWIVDTAFS--- 265 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + V + + + YD K+ G+ SL +D++++ + +P +++Q + Sbjct: 266 IDVDKAKLHPKFFYYWCCQYDFTKLNK---QGVLPSLTRKDLEKVKMAIPQMRDQLKFVS 322 Query: 381 VINVETARIDVLVEKIEQSIVLLK 404 + D I++++V L Sbjct: 323 IAEQA----DKSKSVIQKALVYLN 342 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 53/171 (30%), Gaps = 6/171 (3%) Query: 221 GIEWVGLV---PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 IE G W+ + E+ K + +L+L + Sbjct: 4 FIEMFGNPVTNTKGWKTAKIKDVAPEMPSKEQLSGKIWLLNLDMIESNTGRIIEKVYEDV 63 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 E+ + Q D G ++F + +K + + +P + +L+ L+ Sbjct: 64 ENALSVQSFDEGNVLFSKLRPYLNKVVIP--DEPGMATTELVPLRPEPSKLHKVFLSHLL 121 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 R + G + +++ ++PP+ +Q + + Sbjct: 122 RGNQFVNYANDIAGGTKMPRMPLTELRNFDCILPPMDKQLEFVFIAEQVDK 172 >gi|331007826|ref|ZP_08330929.1| Type I restriction-modification system, specificity subunit S [gamma proteobacterium IMCC1989] gi|330418368|gb|EGG92931.1| Type I restriction-modification system, specificity subunit S [gamma proteobacterium IMCC1989] Length = 604 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 64/204 (31%), Gaps = 9/204 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI---IQKLETRNMGLK 276 S E + ++ V + E+ +N + + + I + + Sbjct: 102 SDEEKPFKLLNNGWVWTQLGEIAEIAPRNALDDDMEVGFVPMPRITTSYDGSHEQEVRPW 161 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTY 332 + Y G+I I + + ++ G ++ Y Sbjct: 162 GTIKKGYTHFSNGDIALAKITPCFENSKAAVFRGLKNGYGAGTTELHIARPIQDTVNPLY 221 Query: 333 LAWLMRSYDL--CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + +++ GS ++ + P+ +PP+KEQ I +N D Sbjct: 222 ILLYLKAPMFLEKGKSKMTGSAGQKRIPNSYFSGNPLPLPPLKEQHRIVTKVNELMTLCD 281 Query: 391 VLVEKIEQSIVLLKERRSSFIAAA 414 L ++ E SI + + ++A Sbjct: 282 QLEQQQETSITAHQTLVETLLSAL 305 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 58/479 (12%), Positives = 121/479 (25%), Gaps = 98/479 (20%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + ++ + ++ ++ + + + ++ + + F+ Sbjct: 113 NGWVWTQLGEIAEIAPRNALDDDMEVGFVPMPRITTSYDGSHEQEVRPWGTIKKGYTHFS 172 Query: 83 KGQILY--------GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV- 133 G I R G V P + +L + Sbjct: 173 NGDIALAKITPCFENSKAAVFRGLKNGYGAGTTELHIARPIQDTVNPLYILLYLKAPMFL 232 Query: 134 -------------TQRIEAICEGATMSHADWKGIGNIPMPIPPLA--------------- 165 + + G + K I + L Sbjct: 233 EKGKSKMTGSAGQKRIPNSYFSGNPLPLPPLKEQHRIVTKVNELMTLCDQLEQQQETSIT 292 Query: 166 -EQVLIREKIIAETV------------RIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 Q L+ + A T RI + + K++++ V L Sbjct: 293 AHQTLVETLLSALTNSADNKCFEQAWTRIAENFDTLFTTEHSIDQLKKSILQLAVMGKLV 352 Query: 213 PD---------------------------VKMKDSGIEWVGLVPDHWEVKPFFALVT--- 242 P K K G +P + + Sbjct: 353 PQDSSNEPAEILLQNIAKEKEYLIENKKIRKAKLRGATIEYELPFEVSGNSIWTTIDKIS 412 Query: 243 ---------ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP----- 288 E + ++ + L+ I R+ +K S E + + Sbjct: 413 LRVIDGNYGESYPTKNEFLDEGVPFLTSAAIGLSGNIRHDKVKYISKEKHAELRKAQSST 472 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVF 346 +I+ + L + + I + +D YL M++ K Sbjct: 473 NDILLTNRGARAGAVGLLEDAIYKDCNIGPQLTSIRCLDQYVDPNYLLIYMQTNVFIKFL 532 Query: 347 YAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ-SIVLL 403 SG + + LPV++ P+KEQ I + + + D+L E+I+ I L Sbjct: 533 NEANSGSAMNFVNLAKTVALPVVLHPLKEQKRIVSKVGDLFSLCDLLKEQIKNSQISQL 591 >gi|323698909|ref|ZP_08110821.1| restriction modification system DNA specificity domain [Desulfovibrio sp. ND132] gi|323458841|gb|EGB14706.1| restriction modification system DNA specificity domain [Desulfovibrio desulfuricans ND132] Length = 532 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 60/407 (14%), Positives = 130/407 (31%), Gaps = 29/407 (7%) Query: 25 WKVVPIKRFTKLN-TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + + T R + I+ I + D + K SR DT+ I + Sbjct: 3 WGFRTLDALLDKSGTDRAGKQDLPILSITMSDGLVDQSEKFKKRVASR--DTTKYRIAHR 60 Query: 84 GQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG---WLLSIDVTQRIEA 139 +++ G + + GI S + + + K + +L S Q + Sbjct: 61 NELVVGFPIDEGVLGFQTKYPAGIVSPAYDIWKLKSPNDTFIPYLERYLRSNQARQIYAS 120 Query: 140 ICEGA--TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +GA + +P P +Q I + I + +L Sbjct: 121 KMKGAVARRRSLSKVDFLGLEIPFPSFDDQKRIAHLLGKVEGLIARRKQHLQQLDDL--- 177 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 L S + +P E +G + + R + K Sbjct: 178 ----LKSVFLKMFGDPVRNEMGWETELLGEL-----ATIERGRFSPRPRNDPKFYNGAYP 228 Query: 258 SLSYGNIIQ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + G+I + +L L + + D G IV + + ++ Sbjct: 229 FIQTGDISRSNGRLREYTQTLNELGIKVSKKFDVGTIVIAIVGATIGETAILQIPTYAPD 288 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + +S ++ +L+R + + R ++ E ++ LPV+ P K+ Sbjct: 289 SVIGITPKSATKETESVFIEFLLRFWKPV-LRARAPEAARANINIETLRPLPVICPLDKD 347 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ++ +++ L + +QS+ ++ + A G++DL Sbjct: 348 RERFATIVE----KVEDLKSRYQQSLADMEYLYGALSQKAFNGELDL 390 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 52/199 (26%), Gaps = 14/199 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W+ + + GR S ++ +I D+ G+ + Sbjct: 195 GWETELLGELATIERGRFSPRPRNDPKFYNGAYPFIQTGDISRSNGRLREYTQTLNELGI 254 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLSIDV 133 F G I+ +G + + I + + PK E + L Sbjct: 255 KVSKKFDVGTIVIAIVGATIGETAILQIPTYAPDSVIGITPKSATKETESVFIEFLLRFW 314 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + A A ++ + + + +P+ P ++ + Sbjct: 315 KPVLRARAPEAARANINIETLRPLPVICPLDKDRERFATIVEKVEDLKSRYQQSLADMEY 374 Query: 194 LLKEKKQALVSYIVTKGLN 212 L AL L+ Sbjct: 375 LYG----ALSQKAFNGELD 389 >gi|307246970|ref|ZP_07529034.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306852112|gb|EFM84353.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 244 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 60/184 (32%), Gaps = 15/184 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVT-----ELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + ++ +P+ W + + ++ I +S + K Sbjct: 63 TEQDFPFEIPESWVWVRLEDVCQEISDIDHKMPQEYKGKNGIPYISPKDFYDKNGIDFAN 122 Query: 275 LKPESYETYQIV------DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 K S E Y ++ +I+F R + + +++ + ++ I Sbjct: 123 AKKVSEEDYFLLSKKFAPQKNDIIFPRYGTIGVVRIIEENI---KLLVSYSCACIRVEYI 179 Query: 329 DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + Y+ + S + + ++ + +K+ + +PP+ EQ I I Sbjct: 180 NMQYVVAYLNSELAKLEIKKYTNKTTQPNVGLKSIKKFIIPLPPLNEQKRIVAKIEELLP 239 Query: 388 RIDV 391 I+ Sbjct: 240 YIEQ 243 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 59/175 (33%), Gaps = 10/175 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP+ W V ++ + + + + I YI +D G + Sbjct: 70 EIPESWVWVRLEDVCQEISDIDHKMPQEYKGKNGIPYISPKDFYDKNGIDFANAKKVSEE 129 Query: 74 DT---STVSIFAKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLL 129 D S K I++ + G II + + S ++ + + + + +L Sbjct: 130 DYFLLSKKFAPQKNDIIFPRYGTIGVVRIIEENIKLLVSYSCACIRVEYINMQYVVAYLN 189 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 S I+ T + K I +P+PPL EQ I KI I+ Sbjct: 190 SELAKLEIKKYTNKTTQPNVGLKSIKKFIIPLPPLNEQKRIVAKIEELLPYIEQY 244 >gi|291526090|emb|CBK91677.1| Restriction endonuclease S subunits [Eubacterium rectale DSM 17629] Length = 367 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 53/368 (14%), Positives = 115/368 (31%), Gaps = 28/368 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V ++ + G ++ I D+ +G Y P G S + + + Sbjct: 3 VKLEDVCE--RGSSN--------IKQSDIIKMSGNY-PIYGASGLAGKVNFYHQEQPYVA 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 K G + + + L PK + +++S +E GAT+ Sbjct: 52 VVKDGAGIGRTTLNPAKSSVIGTMQYLIPKKNVLPEYLFYVVS---YMHLEKYYTGATIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H +K N + + +Q+ I + + R +I R + + L +A + Sbjct: 109 HIYFKDYKNKEFNLDNIEKQLEIIDVL----GRCKKVIEARKQELVELDSLTKARFVELF 164 Query: 208 --TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + N +K S +G +++ I L G Sbjct: 165 GDIRCNNKLPLVKLSEFVNIG-------SSKRIYANEYVDKGVPFYRSKEIRELGTGMKP 217 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 E E Y + G+I+ I + + + Sbjct: 218 SVELYIKQERYDEIKEKYGVPKKGDILIAAIGATIGYSWIVDTDTPFYYKDGNLIILSIK 277 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + ++ +L + MR + + +L E ++++ V+ P IK Q + ++ Sbjct: 278 NNVNPIFLNYTMRILIEDFKNKDVAGSAQLALTIEKLEKMMVVNPDIKLQNQFADFVHQV 337 Query: 386 T-ARIDVL 392 ++ D + Sbjct: 338 NKSKFDTM 345 Score = 36.7 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 28/75 (37%), Gaps = 4/75 (5%) Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +L + + F+D K + I++Q +I +V+ R ++E Sbjct: 88 YLFYVVSYMHLEKYYTGATIPHIYFKDYKNKEFNLDNIEKQLEIIDVLG----RCKKVIE 143 Query: 395 KIEQSIVLLKERRSS 409 +Q +V L + Sbjct: 144 ARKQELVELDSLTKA 158 >gi|218550409|ref|YP_002384200.1| restriction modification system DNA specificity domain [Escherichia fergusonii ATCC 35469] gi|218357950|emb|CAQ90594.1| putative restriction modification system DNA specificity domain [Escherichia fergusonii ATCC 35469] Length = 524 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 50/370 (13%), Positives = 98/370 (26%), Gaps = 32/370 (8%) Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVL--QPKDVLPE 122 N + S G +L G K+ + S + PK V Sbjct: 91 INDEDDEKLKRSRLVDGDVLLTITGAKFGKSAVVSAKHLPANISQHSVRFKPDPKKVDAY 150 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L +L S I GAT D+ + ++ +P Q I +K+ Sbjct: 151 FLVAYLNSKTGQVAIWKEAYGATRPAIDFPSVRSLAVPKVLPLAQKYIGDKVRQAEQLRV 210 Query: 183 TLITERIRFIELLKEKKQ-------------------ALVSYIVTKGLNPDVKMKDSGIE 223 + + +L G + Sbjct: 211 WAKRLNSVLQSQIHSVFKGDPKPEKRIGKVISIQQLSSLRLEAEYYGDLELWAELEIKNS 270 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPES 279 P + S I + N+I + + + + Sbjct: 271 PFPNKPLGELSSRIKDGPGGWAVSTSDYRPSGIPVIRSVNLIDGRCELEDCVFISKEKHN 330 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 V PG ++ S + + + I+ YLA + S Sbjct: 331 DLRSHQVKPGGLLLSVRGTIGRAAVFDSEKYSTASLNAAVVTIDCKPTINPYYLAAFLNS 390 Query: 340 YDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITN-VINVETARI--DVLVEK 395 + +G Q ++ ++ +++PPI Q I ++ A I ++L++ Sbjct: 391 EVGRIQSNRIANGAVQLNMNLKETASNLIVIPPINLQETIAATFLSKNRAIILANLLIQS 450 Query: 396 IEQSIVLLKE 405 + + L E Sbjct: 451 AKTLVEALIE 460 >gi|126208661|ref|YP_001053886.1| putative type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae L20] gi|126097453|gb|ABN74281.1| putative type I restriction-modification system, S subunit [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 404 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 60/197 (30%), Gaps = 7/197 (3%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M + I++ + + + ++ N + + + Sbjct: 1 MVVNYIKFNDTEIEFIDGDRGIHYPKKEEFSSSGYCVFLNTGNVTSNGFNFNDLDFITKE 60 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLA 334 + V P +IV + + ++ + I S + ++ + +L Sbjct: 61 KDELLRKGRVIPHDIVLTTRGTVGNVAYVSENELYKNIRINSGMVIIRSDCSKYEPYFLY 120 Query: 335 WLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 RS K GSG + L +K + ++ Q I V++ +D + Sbjct: 121 SFFRSELFKKQCEYNGSGSAQPQLPISALKNISFPNFNLETQQKIAQVLST----LDRKI 176 Query: 394 EKIEQSIVLLKERRSSF 410 +Q L++ + Sbjct: 177 ALNQQISAKLEKMAKTL 193 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 52/402 (12%), Positives = 115/402 (28%), Gaps = 36/402 (8%) Query: 38 TGRTSESGKDII------YIGLEDVESGTGKYLPKDGNSRQSDTSTVS-IFAKGQILYGK 90 G ++ ++ +V S + D +++ D I+ Sbjct: 20 RGIHYPKKEEFSSSGYCVFLNTGNVTSNGFNFNDLDFITKEKDELLRKGRVIPHDIVLTT 79 Query: 91 LGPYLRKAIIADFDGICSTQF------LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 G A +++ + + + + P L + S ++ E G+ Sbjct: 80 RGTVGNVAYVSENELYKNIRINSGMVIIRSDCSKYEPYFLYSFFRSELFKKQCEYNGSGS 139 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + NI P L Q I + + +D I + L++ + L Sbjct: 140 AQPQLPISALKNISFPNFNLETQQKIAQVL----STLDRKIALNQQISAKLEKMAKTLYD 195 Query: 205 YIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--KNTKLIESNILSL 259 Y + PD K SG E V +V + N K + + Sbjct: 196 YWFVQFDFPDENGNPYKSSGGEMVYNPELKRDVPKGWECDFVENYLDKVPNTDKIPSKEI 255 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 I ++ + + +++P + F D R ++ Sbjct: 256 QVKGQIPVIDQSQDYICGFTDNENALLEPIDAHIIFGD---HTRVVKLVNFPYARGADGT 312 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + + +L + M G + ++ +K VL+P I Sbjct: 313 QIIISNNKKLPNFLFYQM-----IAKIDLSNYGYARH--YKFLKESKVLIPT----EYIA 361 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + L + + L + R + + GQ+++ Sbjct: 362 QKYHQTVKPYFDLWKTNLKETQKLTQLRDFLLPMLMNGQVEV 403 >gi|312872265|ref|ZP_07732335.1| conserved domain protein [Lactobacillus iners LEAF 2062A-h1] gi|311092088|gb|EFQ50462.1| conserved domain protein [Lactobacillus iners LEAF 2062A-h1] Length = 378 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 60/388 (15%), Positives = 127/388 (32%), Gaps = 47/388 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + +L ++ I + + K + + + I G+ + Sbjct: 7 KLGELIELLGNTNNDLQYGIEDVR---GVNNLKKMMSTKADLNGRNLGKFQIVYPGEFFF 63 Query: 89 GKLGPYLR-----KAIIADFDGICSTQFLVLQPKDV-----LPELLQGWLLSIDVTQRIE 138 IC+ ++V + K + L E L + + + + Sbjct: 64 NHRTSRNGSKFSITYNYESNPIICTEDYVVFRLKKICENILLKEWLYMYFNRSEFDRFVI 123 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G++ +W + +I + +PPLA Q A Sbjct: 124 TNSWGSSTEFYNWSDVCDIELHLPPLAIQQKYVNVYNAMVAN------------------ 165 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +GL D+ IE + + + + + R + + ++ + + Sbjct: 166 -----QKAYERGLEDLKLTCDAYIEDLRRKYE------LQEIGSYIERIDERNKDNQLTN 214 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + + L S Y+IV G+I + +N R L E +I+S Sbjct: 215 VKGLTVYKHFIDTKANLTNVSITNYKIVRVGDIGYVPTTNRNGDR-LACGLCNEDCLISS 273 Query: 319 AYMAVKPHGID--STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQ 375 Y ++P S YL R + + R++ F D++ + +PP++ Q Sbjct: 274 IYEVIRPDNSKLRSDYLFLWFRRSEFDRYVRYCSWGSARETFDFRDMEEFSIPIPPLEIQ 333 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLL 403 I ++ V T R D + EK++ I + Sbjct: 334 NSIADIYKVYTERKD-INEKLKAQIKAI 360 >gi|237650526|ref|ZP_04524778.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae CCRI 1974] Length = 184 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 64/181 (35%), Gaps = 14/181 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPES 279 +P+ WE + + + R + + + + ++ L S Sbjct: 4 EIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPETVHS 63 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLA 334 Y+ +++ G++++ L R + A + V I+ ++ Sbjct: 64 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 123 Query: 335 WLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S + V SG ++ L + +K + +PP+ EQ I + I A ID L Sbjct: 124 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 183 Query: 393 V 393 + Sbjct: 184 I 184 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ V + T + G++ + + I + + ++ S Sbjct: 4 EIPESWEWVRLNDITSYIQRGKSPKYSNISIYPVIAQKCNQWSGFSIDLARFIDPETVHS 63 Query: 77 --TVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 64 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIY 123 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI ID L Sbjct: 124 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 183 Query: 185 I 185 I Sbjct: 184 I 184 >gi|237750518|ref|ZP_04580998.1| type I restriction-modification system S subunit [Helicobacter bilis ATCC 43879] gi|229374048|gb|EEO24439.1| type I restriction-modification system S subunit [Helicobacter bilis ATCC 43879] Length = 356 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 62/181 (34%), Gaps = 10/181 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P+ W + + E + + + T+++ I Sbjct: 176 EIPNSWAWVKLGDICEIFTGDSINATEKEKNFTHQTSGLNYIATKDLANDTSITYENGIK 235 Query: 287 DPGEIVFRF---------IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 P E + F + ++ + + E + + S ++ + + Sbjct: 236 IPDEFLPSFKIAKANSTLLCIEGGSAGRKVGFLKENVCFGNKLCCFENIFAFSKFVFYFL 295 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV-LVEKI 396 +S + K F + +G+ +K E ++ + +PP+KEQ +I +++ + + K Sbjct: 296 QSGEFSKEFNSNINGIIGGVKKESIRHFLIPLPPLKEQQEIVKKLDLLVTLANDFAITKE 355 Query: 397 E 397 Sbjct: 356 N 356 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 55/172 (31%), Gaps = 12/172 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----------DIIYIGLEDVESGTGKYLPKDGN 69 IP W V + ++ TG + + + + YI +D+ + T Sbjct: 176 EIPNSWAWVKLGDICEIFTGDSINATEKEKNFTHQTSGLNYIATKDLANDTSITYENGIK 235 Query: 70 SRQSDTSTVSIFAKGQILYGKL-GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + I L G RK + + + + + +L Sbjct: 236 IPDEFLPSFKIAKANSTLLCIEGGSAGRKVGFLKENVCFGNKLCCFENIFAFSKFVFYFL 295 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 S + ++ + G + + I + +P+PPL EQ I +K+ Sbjct: 296 QSGEFSKEFNSNING-IIGGVKKESIRHFLIPLPPLKEQQEIVKKLDLLVTL 346 >gi|329733095|gb|EGG69432.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21193] Length = 204 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 63/191 (32%), Gaps = 4/191 (2%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + G W K ++ N++ E +L+ S +I + + Sbjct: 15 DEEGNYYKGWNKKQLKDVLEFSNKRTINENEYPVLTSSRQGLILQSDYYKDRKTFAESNI 74 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + P + + +++ GII+ Y K + YL + Sbjct: 75 GYFILPKNHITYRSRSDDGIFKFNLNLMIDVGIISKYYPVFKGIDANQYYLTLHLNYQLK 134 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L +D++ + +P +EQ I + + ID LVEK + Sbjct: 135 KEYIKYATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDF----FSEIDRLVEKQSSKVGR 190 Query: 403 LKERRSSFIAA 413 LK R+ + Sbjct: 191 LKVRKKELLQK 201 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 53/184 (28%), Gaps = 5/184 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W +K + + RT + + Y KD + I Sbjct: 22 KGWNKKQLKDVLEFSNKRTINENEYPVLTSSRQGLILQSDY-YKDRKTFAESNIGYFILP 80 Query: 83 KGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K I Y G + + GI S ++ + + L+ + + Sbjct: 81 KNHITYRSRSDDGIFKFNLNLMIDVGIIS-KYYPVFKGIDANQYYLTLHLNYQLKKEYIK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + K + NI +P EQ I + ++ ++ R KE Sbjct: 140 YATGTSQLVLSQKDLQNIKTKLPSYEEQQKIGDFFSEIDRLVEKQSSKVGRLKVRKKELL 199 Query: 200 QALV 203 Q + Sbjct: 200 QKMF 203 >gi|111224381|ref|YP_715175.1| Type I restriction-modification system, S subunit [Frankia alni ACN14a] gi|111151913|emb|CAJ63634.1| Type I restriction-modification system, S subunit [Frankia alni ACN14a] Length = 443 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 47/410 (11%), Positives = 116/410 (28%), Gaps = 33/410 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYL----PKDGNSRQSD 74 W + G + DI + + D+ SR Sbjct: 31 WTTTSLGSIVTFWPGYAFPEVEQGKISGDIPFFKVGDMSRPGNDVALNSAEHYVTSRTCR 90 Query: 75 TSTVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 G + + K+G R+ +I + + V + L + +I Sbjct: 91 FFGWKPCPAGAVAFAKVGAALLKNRRRLITQDTLLDNNMLAVAPRPGISSRYLYWLMQTI 150 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D + + + IG+ + I PL+EQ I E + + + + Sbjct: 151 DF----SRFVQDGAVPSVNQNQIGSYKVAIAPLSEQQKITEVLDTVDEAVRSTERLIAKL 206 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNT 249 Q + ++ + + W +G + + N Sbjct: 207 YIERAGIIQERLGEWESRHADNSDGS-SRDVRWVQLGDI---VRETLLGTPLRGRKDGNI 262 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ--NDKRSLRS 307 L++ +S N+ + S + + G+++F + K ++ Sbjct: 263 LLVKMGNISGGMLNM--EHTEHISRSIVGSSIGHLELQHGDLLFNTRNTPDLVGKTAVWP 320 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRL 365 + + + ++ M + +G ++ + D+ + Sbjct: 321 KNLPPAICDNNILRIRFQPEVLPEFVNAYMSWGLGRNRLARLATGTTSVAAIYWRDLCKF 380 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 P+ VP I EQ + + I+ +R+ + + + + + Sbjct: 381 PIPVPAISEQRRLVSGIDYSGSRL----SCEQVELEKFLLIKQGLMDDLL 426 >gi|2408222|gb|AAB70708.1| HsdS [Klebsiella pneumoniae] Length = 439 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 63/443 (14%), Positives = 130/443 (29%), Gaps = 73/443 (16%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK + F +L G G+ + G + D + Sbjct: 5 WKECELGDFIELKRGYDLPKSTR---------NEGSIPIISSSGFT---DFHDKPMVKGP 52 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI-CEG 143 ++ G+ G + +T V+ K P + L +I + G Sbjct: 53 GVVTGRYGTIGEVFYSEEDFWPLNTTLYVVDFKGNDPLFVYYLLQTISYADYTDKAAVPG 112 Query: 144 ATMSHADW--KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK-EKKQ 200 +H + +A Q+ EK + +I+ + + + + Sbjct: 113 VNRNHLHKAKVKVPIYLDIQQKVAAQLYQLEKRVTLGKQINQTLEQMSQTLFKSWFVDFD 172 Query: 201 ALVSYIVTKGLNPDVKMKDSGIE-----------------------------WVGLVPDH 231 ++ + G NP + S E +G VP Sbjct: 173 PVIDNALDAG-NPIPEALQSRAELRQKVRSSADFKPLPADIRALFPAEFEETELGWVPKD 231 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLS------------YGNIIQKLETRNMGLKPES 279 W K + T K + S GN ++ + L ++ Sbjct: 232 WYHKNAEEIATISIGKTPPRNQKECFSHKKDSNYTWVSIKDLGNCNVFIKESSEYLTTDA 291 Query: 280 YETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 Y IV G ++ F S I A+ HG++ YL + Sbjct: 292 VNNYNVKIVPKGAVLLSFKLTIGRIAIAESILTTNEAI---AHFYNMKHGVNKEYLYSYL 348 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE--QFDITNVINVETARIDVLVEK 395 + +D + S + ++ + ++++P+L+P Q+ I+ T I + Sbjct: 349 QHFDYNTL--GSTSSIATAVNSKIIRKIPILLPDTDILHQYKIS------TDIIFKRISF 400 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 ++ L R + + ++G+ Sbjct: 401 NNRNTYDLTALRDTLLPKLISGE 423 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 54/199 (27%), Gaps = 14/199 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTS----------ESGKDIIYIGLEDVESGT--GKYLP 65 +G +PK W + ++ G+T + + ++ ++D+ + K Sbjct: 225 LGWVPKDWYHKNAEEIATISIGKTPPRNQKECFSHKKDSNYTWVSIKDLGNCNVFIKESS 284 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + + V I KG +L + + IA+ + Sbjct: 285 EYLTTDAVNNYNVKIVPKGAVLLS-FKLTIGRIAIAESILTTNEAIAHFYNMKHGVNKEY 343 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + + K I IP+ +P + RI Sbjct: 344 LYSYLQHFDYNTLGSTSSIA-TAVNSKIIRKIPILLPDTDILHQYKISTDIIFKRISFNN 402 Query: 186 TERIRFIELLKEKKQALVS 204 L L+S Sbjct: 403 RNTYDLTALRDTLLPKLIS 421 >gi|293363458|ref|ZP_06610215.1| type I restriction modification DNA specificity domain protein [Mycoplasma alligatoris A21JP2] gi|292552978|gb|EFF41731.1| type I restriction modification DNA specificity domain protein [Mycoplasma alligatoris A21JP2] Length = 398 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 48/397 (12%), Positives = 115/397 (28%), Gaps = 26/397 (6%) Query: 22 PKHWKVVPIKRFTKL--NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 P + ++ + T ++ K + + SG L + S+ Sbjct: 13 PNGVEFKKMETLLDYEHSNKYTVKNIKYSNQFKIPVLTSGKTFLLGYTDETENIFFSSKV 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 K IL+ +++ S+ +L K+ L + ++ + Sbjct: 73 ---KPIILFDDFTANVKRVDFNFKLK--SSAIKILILKNPNNNLKYFFYWLANLKYKPTE 127 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 K +P+PP+ Q I E + T L E + +++ Sbjct: 128 HARQWI------KVYSQFDIPMPPIEIQNKIVEILDNFTELTAELTAELTAELTARQKQY 181 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + ++ N + K E +H + +K +L + + Sbjct: 182 KYFRNMLMDYDNNDSLFNKIINKETNKDCREHNFINKDILKNIVSIKKGAQLNKDKFIKG 241 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 SY N G+K + VD I+ + + Sbjct: 242 SYP-------VFNGGVKESGWHNEYNVDENTIIISQGGSLS--GYVNYIDQKFWASAHCF 292 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y+ K + + ++ + L ++ L + +P + Q I Sbjct: 293 YIECKNNSPIINRYLYHFLKNKQKELMNSKEGAGIPGLGKNILEELEIFIPSVYVQEKIV 352 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++++ + + I L ++ R+ ++ Sbjct: 353 DILDKMEIYTKDIKTGLPLEIELRQKQYEYYRNLLLS 389 >gi|262374260|ref|ZP_06067536.1| predicted protein [Acinetobacter junii SH205] gi|262310818|gb|EEY91906.1| predicted protein [Acinetobacter junii SH205] Length = 433 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 57/399 (14%), Positives = 125/399 (31%), Gaps = 32/399 (8%) Query: 36 LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI---LYGKL- 91 L G+T++ + + + G+ +++ S+ + + ++ Sbjct: 40 LENGKTAKVEN----LPVGCIAHGSTEFIVLSAKSKDDEDFVYYLARLPDFRSYAISRME 95 Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 G R+ + + + + + ++L+ I + +I E + Sbjct: 96 GTSGRQRVSWQALAEFNLRLPEKGKRKKIGKILKSLDDKIHLNNQINQTLESIAQTIFKS 155 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 I P+ A+Q ++ A E + + + QA + L Sbjct: 156 WFIDFDPVRAKIAAKQEGKDAELAAMCAISGKSEAEVEQMAKEDFAELQATAT------L 209 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK------LIESNILSLSYGNII 265 PD + +G VP WE+ A+ + T I LS G Sbjct: 210 FPDELV----ESELGEVPKGWEITNINAVTASIFSGGTPSTKEVTYWNGEIPWLSSGETR 265 Query: 266 QKLETRNMGLKPESYETYQIVDP---GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 K+ E+ G+I+ Q R S +E I S Sbjct: 266 NKIIVSTEKSITETAVKKSSTKLAIFGDILIASAG-QGHTRGQTSFNAIECYINQSIVAL 324 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + +L + + + R SL + + +PV++P Q + + Sbjct: 325 RANDKVSPYWLYYCLEPRYDEMRSVSDSHSSRGSLTTKLLASMPVILPT---QKLVVSF- 380 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + K + I +L + R + + ++G I++ Sbjct: 381 DKVIKPMLAQQVKNAKEIKMLADTRDALLPKLISGDIEV 419 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 59/196 (30%), Gaps = 9/196 (4%) Query: 18 IGAIPKHWKVVPIKRF-TKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNS 70 +G +PK W++ I + +G T + + +I ++ + + K Sbjct: 219 LGEVPKGWEITNINAVTASIFSGGTPSTKEVTYWNGEIPWLSSGETRNKIIVSTEKSITE 278 Query: 71 RQSDTSTVSIFAKGQILYGKL--GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 S+ + G IL G + + + + L+ D + + Sbjct: 279 TAVKKSSTKLAIFGDILIASAGQGHTRGQTSFNAIECYINQSIVALRANDKVSPYWLYYC 338 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 L + ++ K + ++P+ +P V + I + E Sbjct: 339 LEPRYDEMRSVSDSHSSRGSLTTKLLASMPVILPTQKLVVSFDKVIKPMLAQQVKNAKEI 398 Query: 189 IRFIELLKEKKQALVS 204 + L+S Sbjct: 399 KMLADTRDALLPKLIS 414 >gi|307253725|ref|ZP_07535589.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306858801|gb|EFM90850.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 272 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 51/178 (28%), Gaps = 5/178 (2%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 + E +P+ W + ++ G + ++ Sbjct: 100 RCIADEVPFEIPESWVWVRLSEISKITMGQSPDNK----YLGKEGIEFHQGKSFFSEYII 155 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ES + Y + I L I ++ P +++ +L + + Sbjct: 156 ESSDIYCSLPNKLATPNSILLCVRAPVGIVNITNRELCIGRGLASIDPIYVNTIFLYYAL 215 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 Y +++ + + + +PP+ EQ I I + + L +K Sbjct: 216 FCYKNY-YERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIETLFSTLQNLSQK 272 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 57/167 (34%), Gaps = 10/167 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---S 76 IP+ W V + +K+ G++ ++ Y+G E +E GK + SD Sbjct: 109 EIPESWVWVRLSEISKITMGQSPDNK----YLGKEGIEFHQGKSFFSEYIIESSDIYCSL 164 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IL P I + + + P V + + Sbjct: 165 PNKLATPNSILLCVRAPV-GIVNITNRELCIGRGLASIDPIYVN--TIFLYYALFCYKNY 221 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 E G+T I N +PIPPL EQ+ I EKI + Sbjct: 222 YERKSTGSTFKAISKDIIDNTIIPIPPLNEQIRIVEKIETLFSTLQN 268 >gi|260767611|ref|ZP_05876547.1| type I restriction-modification system specificity subunit S [Vibrio furnissii CIP 102972] gi|260617511|gb|EEX42694.1| type I restriction-modification system specificity subunit S [Vibrio furnissii CIP 102972] Length = 374 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 40/402 (9%), Positives = 106/402 (26%), Gaps = 33/402 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + L G + +P +S + + Sbjct: 2 SWVECQLGDILTLKRGYDLP------------HSARKSGSVPVVSSSGITGYHNTAKVEG 49 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW-LLSIDVTQRIEAICE 142 ++ G+ G + +T V K P+ + + + Q +A Sbjct: 50 PAVVTGRYGTLGEVYYVEGECWPLNTSLYVQDFKGNRPKFVYYFLQSVLKGMQSDKAAVP 109 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + + QV + + I I+ E + Q Sbjct: 110 GVNRNDLHARKVKCTKDH----DVQVAVEKIISPYDDLIENNRRRIQLLEESARLLYQEW 165 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 ++ G + V +P+ W+ + K + I + Sbjct: 166 FVHLRFPG--------HEQVNIVDGLPEGWKNMQLTDIAKVNQASLKKGFDEKIEYIDIS 217 Query: 263 NIIQKLETRNMGLKPESY--ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + + +IV +I++ + +L + +R I ++ + Sbjct: 218 CVSTHSISDTTWYEFIDAPGRARRIVQHCDILWSCVRPNRRSHALVW-EPHDRLIASTGF 276 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDIT 379 + + +L + + + G ++ + LVP Sbjct: 277 AVISATEVSPLFLYQSLTTNEYVGYLTNRAGGAAYPAVTARVFEESSTLVPTKNL----V 332 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + + L + R + ++G++ + Sbjct: 333 EQYERQVQDTYTQINILRTQNIKLAQARDLLLPKLMSGELTV 374 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 43/134 (32%), Gaps = 5/134 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ WK + + K+N + + I YI + V + + + Sbjct: 183 LPEGWKNMQLTDIAKVNQASLKKGFDEKIEYIDISCVSTHSISDT-TWYEFIDAPGRARR 241 Query: 80 IFAKGQILYGKLGPYLRK---AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I IL+ + P R I ST F V+ +V P L L + + Sbjct: 242 IVQHCDILWSCVRPNRRSHALVWEPHDRLIASTGFAVISATEVSPLFLYQSLTTNEYVGY 301 Query: 137 IEAICEGATMSHAD 150 + GA Sbjct: 302 LTNRAGGAAYPAVT 315 >gi|329123769|ref|ZP_08252328.1| type I restriction system specificity protein [Haemophilus aegyptius ATCC 11116] gi|327469672|gb|EGF15140.1| type I restriction system specificity protein [Haemophilus aegyptius ATCC 11116] Length = 199 Score = 71.4 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 63/183 (34%), Gaps = 13/183 (7%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIV 292 L+ + E+ + ++ YG I T + PE + + G+++ Sbjct: 11 LGELIRGNGLQKKDFTETGVPAIHYGQIYTYYGTFATKTKSFVSPELAKKLKKAKYGDVL 70 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGS 351 + + A +P+ ++ YL +++++ K Sbjct: 71 IAGTSENLKDVMKPLGWLGSEIAFSGDMFAFRPNKRVNTKYLTYILQTERFYKFKEKYAQ 130 Query: 352 GLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLL 403 G + +K ++ + +P +EQ I ++++ + + +E+ ++ Sbjct: 131 GTKVIRVKADNFLNYEIPLPTFEEQHRIVSILDKFETLTNSITEGLPLAIEQRQKRYEYY 190 Query: 404 KER 406 +E Sbjct: 191 REL 193 >gi|257785026|ref|YP_003180243.1| hypothetical protein Apar_1225 [Atopobium parvulum DSM 20469] gi|257473533|gb|ACV51652.1| hypothetical protein Apar_1225 [Atopobium parvulum DSM 20469] Length = 459 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 54/385 (14%), Positives = 113/385 (29%), Gaps = 46/385 (11%) Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVL 114 S + + G + Y + I + G S ++V Sbjct: 65 SNKIGMFDASIKKGKKIKQKYHVVKDGWLAYNPYRINVGSIGIKTPELQGGYISPAYVVF 124 Query: 115 QPKDVL-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 KD L PE L + S I G+ + + +I PIP + EQ I + Sbjct: 125 SCKDTLLPEYLWLMMKSDYFNALINDSTTGSVRQTLRFDKLASIKAPIPTVDEQKEILAQ 184 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG---------LNPDVKMKDSGIE- 223 A + I++ F + L Q+ VS + + P S E Sbjct: 185 YHATLAEAEKNISDGNSFSDGLLFDIQSKVSDLEKDESAAEKPSSIIQPVPFAAMSRWEV 244 Query: 224 -------------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 P + + L+ K + ES ++ + + I E Sbjct: 245 AYTLKKGKLERVYGSFKCPFKSISELTKESLFGLSLKASLKQESGMIPILRMSNIVNGEI 304 Query: 271 RNMGLKPESYET--------YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 LK Y++ ++ G+ + + + + S + Sbjct: 305 DCSSLKYLPYKSAVTPREPDKWLLRKGDFLINRTNSKELVGKSAVFNLDGDYTYASYIIR 364 Query: 323 VKPHG--IDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + Y+ + + + + + ++ ++ + + +P I EQ I Sbjct: 365 YRFDTSVVLPEYVNIMFMLPLVRIQIDTMSRQTAGQCNINSGEIGSIRIPIPSIPEQQAI 424 Query: 379 TNVI-------NVETARIDVLVEKI 396 + + A+ + L +K Sbjct: 425 IDKYYSTKDGADAFYAKAEELKQKT 449 >gi|108797004|ref|YP_637201.1| type I restriction-modification system specificity subunit [Mycobacterium sp. MCS] gi|119866088|ref|YP_936040.1| type I restriction-modification system specificity subunit [Mycobacterium sp. KMS] gi|108767423|gb|ABG06145.1| type I restriction-modification system specificity subunit [Mycobacterium sp. MCS] gi|119692177|gb|ABL89250.1| type I restriction-modification system specificity subunit [Mycobacterium sp. KMS] Length = 411 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 63/428 (14%), Positives = 127/428 (29%), Gaps = 65/428 (15%) Query: 25 WKVVPIKRFTKLNT----GRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + T G+ + + ++ ++V + K G D Sbjct: 4 WRESVLGDLCTRVTVGHVGKMATEYVPDGVPFLRSQNVR---PFVIDKRGLLYIGDDFNA 60 Query: 79 SI----FAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G ++ + G A++ + C+ ++ + P +L S+ Sbjct: 61 KLRKSALTAGDVVIVRTGYPGTAAVVPEDLDGSNCADLVVITPSDALNPHVLAALFNSVY 120 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G+ H + + + +P AEQ I + I+ LI R + Sbjct: 121 GQHAVSSQLVGSAQQHFNVGSAKTMRVRLPDRAEQDHIAAVL----CSINDLIENNRRRV 176 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR---KNT 249 E+L+ + + K P + +G P WEV F K+ Sbjct: 177 EVLEGMARTIYREWFVKFRYPGNEGVPLVDSALGPAPKGWEVANLFDAADVGFGYSFKSP 236 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + S + I +R E+ + V +++ + Sbjct: 237 RFSNSGPFQVIRIRDIPVGISR--TYTDEAADPRYAVYDDDVLIGMDGDFHMTV-----W 289 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 E + ++P S L + A+ L + ++ + VLV Sbjct: 290 TGEDAWLNQRVTRLRPRLGLSALHLLLAIEEQIKDWNRAIVGTTVAHLGKKHLQLVNVLV 349 Query: 370 PPIKEQFDITNVINV--ETARIDVLVEKIEQSIVLLKERRSSFIA--------------A 413 P I+ A I ++ERR + I Sbjct: 350 PND------AVRIDASVVFAPI-------------MEERR-ALIQSSRRLAALRDLLLPK 389 Query: 414 AVTGQIDL 421 V+GQID+ Sbjct: 390 LVSGQIDV 397 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 63/208 (30%), Gaps = 19/208 (9%) Query: 6 AYPQYKDSGVQW----IGAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDV 56 YP + GV +G PK W+V + + G + +S I + D+ Sbjct: 195 RYPG--NEGVPLVDSALGPAPKGWEVANLFDAADVGFGYSFKSPRFSNSGPFQVIRIRDI 252 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 G + +L G G + D + + L+P Sbjct: 253 PVGISR------TYTDEAADPRYAVYDDDVLIGMDGDFHMTV-WTGEDAWLNQRVTRLRP 305 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + L L + + AI G T++H K + + + +P A ++ Sbjct: 306 RLGLSALHLLLAIEEQIKDWNRAIV-GTTVAHLGKKHLQLVNVLVPNDAVRIDASVVFAP 364 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVS 204 LI R L LVS Sbjct: 365 IMEERRALIQSSRRLAALRDLLLPKLVS 392 >gi|32455521|ref|NP_862273.1| hypothetical protein pRV500_p05 [Lactobacillus sakei] gi|24461248|gb|AAN61995.1|AF438419_5 HsdS [Lactobacillus sakei] Length = 374 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 55/393 (13%), Positives = 109/393 (27%), Gaps = 29/393 (7%) Query: 28 VPIKRFTKLNTGRTSE-SGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 V + + L T + + + Y+ E+++ + G P + + G Sbjct: 4 VKLGDYVSLQTHKVDNLTAVNTSYVSTENLQPNRNGVLFPAASVPSSGKVNFYDV---GD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL--LSIDVTQRIEAICEG 143 IL + PY +K +A G S L + K ++ S + +G Sbjct: 61 ILVSNIRPYFKKIWMAINPGTHSGDVLNFRTKSPKLTQEYLYIVLESDSFFDYVTLTSKG 120 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M D I + +P L Q + I+A +I +EL + Sbjct: 121 TKMPRGDKDAIMDFEFSLPSLDVQQKLSNTIMALERKILMSKQVNDNLLELADATFKKNY 180 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 V + G P L KN ++ + Sbjct: 181 EQQVGNQKLETLATVKGGKRLPKGAPLTEVKTQHPYLRITDYSKNGVPSVQSMQYI---- 236 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + +++ GE+ + L ++ + +A Sbjct: 237 ----------TEEVFDKISRYVINEGEVFLSIVGTIG-IVDLIDERLDNASLTENAVKIH 285 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPI-KEQFDITNV 381 + YL +RS + + ++ L +K V V Q Sbjct: 286 AQTTAMAHYLYLYLRSDEGRHEIDSRTVGTTQKKLAITRIKDFDVGVISETDLQE----- 340 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + + +V I L + R S + Sbjct: 341 FERTVSPLINMVLANRSEIDTLVQIRDSLLQEL 373 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 43/174 (24%), Gaps = 11/174 (6%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP-GE 290 V+ K L N +S N+ G+ Sbjct: 1 MTKVKLGDYVSLQTHKVDNLTAVNTSYVSTENLQPNRNGVLFPAASVPSSGKVNFYDVGD 60 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+ I K + G + + K + YL ++ S Sbjct: 61 ILVSNIRPYFKKIWMAINPGTHSGDVLN--FRTKSPKLTQEYLYIVLESDSFFDYVTLTS 118 Query: 351 SGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI-------DVLVEKI 396 G + + +P + Q ++N I +I D L+E Sbjct: 119 KGTKMPRGDKDAIMDFEFSLPSLDVQQKLSNTIMALERKILMSKQVNDNLLELA 172 >gi|75677298|ref|YP_319719.1| hypothetical protein Nwi_3120 [Nitrobacter winogradskyi Nb-255] gi|74422168|gb|ABA06367.1| hypothetical protein Nwi_3120 [Nitrobacter winogradskyi Nb-255] Length = 233 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 71/215 (33%), Gaps = 15/215 (6%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 LNP KD ++ + L + + N + Sbjct: 27 LNPGPVPKDWQVKTI------AREWKLRCLGEITRELTWRHNDRNFGRELVMGVTNSRGI 80 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGI 328 M Y+I+ P + + S+ ++ +++ Y+ P + Sbjct: 81 VPMQTIGSDLTRYKILLPRAFAYNPMR--IKVGSIARLRLPSEVLVSPDYVLFECVPGKL 138 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D +L L +S+ A GSG +R ++D+ L + +P EQ I+ ++N Sbjct: 139 DPDFLNHLRQSHFWDHYINAGGSGSVRMRAYYDDLAALRLKLPGFAEQHRISAMLNTAQG 198 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + I L + + TG+ ++ Sbjct: 199 E----IALVATEIETLTRQTRGLMQKLPTGERRVK 229 Score = 40.5 bits (93), Expect = 0.56, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 60/195 (30%), Gaps = 14/195 (7%) Query: 19 GAIPKHWKVV------PIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 G +PK W+V ++ ++ T + ++ + V + G + S Sbjct: 30 GPVPKDWQVKTIAREWKLRCLGEITRELTWRHNDRNFGRELVMGVTNSRGIVPMQTIGS- 88 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAI--IADFDGICSTQF--LVLQPKDVLPELLQGW 127 D + I Y + + + + S + P + P+ L Sbjct: 89 --DLTRYKILLPRAFAYNPMRIKVGSIARLRLPSEVLVSPDYVLFECVPGKLDPDFLNHL 146 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S I A G+ A + + + + +P AEQ I + I + TE Sbjct: 147 RQSHFWDHYINAGGSGSVRMRAYYDDLAALRLKLPGFAEQHRISAMLNTAQGEIALVATE 206 Query: 188 RIRFIELLKEKKQAL 202 + Q L Sbjct: 207 IETLTRQTRGLMQKL 221 >gi|269978336|gb|ACZ55902.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 408 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 48/403 (11%), Positives = 116/403 (28%), Gaps = 32/403 (7%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVEFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +I + + S+ +L K+ + + Sbjct: 71 KSSPAIIFDD--------FTTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNIGGEHARHWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELNA 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K++ Q ++ + + KD+ I+ L + Sbjct: 178 RKKQYQ-YYQNMLLDFNDINSNHKDAKIKSYPKRLKTL----LHTLAPKGVEFRKLGEVC 232 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I+ + L+ + ++ I + + ++ Sbjct: 233 EIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQNQKF 292 Query: 315 IITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 +V P + YL +++ + + S + S+ ++ ++ + +PP++ Sbjct: 293 WANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPIPPLE 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q +I +++ + L+ I I K+ R + Sbjct: 353 IQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 395 >gi|269978332|gb|ACZ55900.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 408 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 48/403 (11%), Positives = 116/403 (28%), Gaps = 32/403 (7%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVEFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +I + + S+ +L K+ + + Sbjct: 71 KSSPAIIFDD--------FTTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNIGGEHARHWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELNA 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K++ Q ++ + + KD+ I+ L + Sbjct: 178 RKKQYQ-YYQNMLLDFNDINSNHKDAKIKSYPKRLKTL----LHTLAPKGVEFRKLGEVC 232 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I+ + L+ + ++ I + + ++ Sbjct: 233 EIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQNQKF 292 Query: 315 IITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 +V P + YL +++ + + S + S+ ++ ++ + +PP++ Sbjct: 293 WANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPIPPLE 352 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q +I +++ + L+ I I K+ R + Sbjct: 353 IQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 395 >gi|325681556|ref|ZP_08161080.1| hypothetical protein CUS_4505 [Ruminococcus albus 8] gi|324106755|gb|EGC01047.1| hypothetical protein CUS_4505 [Ruminococcus albus 8] Length = 61 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 16/58 (27%), Positives = 30/58 (51%) Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L +PP++EQ I I+ R + ++ +Q I ++E + S I VTG+ ++ Sbjct: 2 ELLYPMPPVEEQQAIVEHIDSVLERTNAIIADKKQQIETIEEYKKSLIFEYVTGKKEV 59 >gi|284108609|ref|ZP_06386427.1| restriction modification system DNA specificity domain [Candidatus Poribacteria sp. WGA-A3] gi|283829889|gb|EFC34178.1| restriction modification system DNA specificity domain [Candidatus Poribacteria sp. WGA-A3] Length = 264 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 13/107 (12%), Positives = 35/107 (32%), Gaps = 4/107 (3%) Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + ++ S + + ++ + DV + + +P E Sbjct: 150 CTNQGFKSLVCKDGVSNEFLYYLLLTLKPQMIERAIGSTFLEIGKRDVTSIELCIPTYAE 209 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 Q I V++ A + +E+ + + + +TG++ L Sbjct: 210 QCAIATVLSDMDAE----IAVLERRRDKTRAVKQGMMQQLLTGRVRL 252 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 64/194 (32%), Gaps = 11/194 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY---LPKDGNSRQSD 74 W+ + + G T + I + D+ + GKY + + Sbjct: 60 EWETTTVGEVADIRNGATPSTQIGAYWNGPIPWCTPTDITATPGKYLCATERSITAMGLA 119 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ G +L + + IA + F L KD + + L + + Sbjct: 120 NCAASLLPVGALLLCS-RATIGEIKIAVSSVCTNQGFKSLVCKDGVSN-EFLYYLLLTLK 177 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ G+T + + +I + IP AEQ I + I L R + + Sbjct: 178 PQMIERAIGSTFLEIGKRDVTSIELCIPTYAEQCAIATVLSDMDAEIAVLERRRDKTRAV 237 Query: 195 LKEKKQALVSYIVT 208 + Q L++ V Sbjct: 238 KQGMMQQLLTGRVR 251 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 4/61 (6%) Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + +PP EQ I ++ + L I + + + + + +T + L G Sbjct: 2 FQIPLPPPSEQRAIAEALSDVDGLLAALEALIAKK----RAIKQATMQQLLTSKTRLPGF 57 Query: 425 S 425 S Sbjct: 58 S 58 >gi|260102293|ref|ZP_05752530.1| type I restriction-modification system specificity subunit [Lactobacillus helveticus DSM 20075] gi|260083890|gb|EEW68010.1| type I restriction-modification system specificity subunit [Lactobacillus helveticus DSM 20075] Length = 194 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 61/185 (32%), Gaps = 16/185 (8%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPG 289 WE + ++ Y + + +N + P + T + Sbjct: 20 WEQRKLGEEAQLTMGQSPNSENYTKNPDDYILVQGNADMKNGRVVPRVWTTQITKKAEKS 79 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++ D V+ RG+ + ++ + L + Sbjct: 80 DLILSVRAPVGDIGKTDYDVVLGRGVAA---------IKGNEFIFQQLGKMKLTGYWTRY 130 Query: 350 GSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +G +S+ D+K ++VP +EQ I + ++D L+ ++ + L+E + Sbjct: 131 STGSTFESINSNDIKDAKIMVPVEEEQQKIGSF----FQQLDHLITLHQRKLEKLQELKK 186 Query: 409 SFIAA 413 ++ Sbjct: 187 GYLQK 191 Score = 44.0 bits (102), Expect = 0.052, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 49/179 (27%), Gaps = 5/179 (2%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + +L G++ S + G R T K Sbjct: 20 WEQRKLGEEAQLTMGQSPNSENYTKNPDDYILVQGNADMKNGRVVPRVWTTQITKKAEKS 79 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ P D+D + ++ + + L + +T G+ Sbjct: 80 DLILSVRAPV-GDIGKTDYDVVLGRGVAAIKGNEFI----FQQLGKMKLTGYWTRYSTGS 134 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 T + I + + +P EQ I I + + EL K Q + Sbjct: 135 TFESINSNDIKDAKIMVPVEEEQQKIGSFFQQLDHLITLHQRKLEKLQELKKGYLQKMF 193 >gi|167855557|ref|ZP_02478318.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus parasuis 29755] gi|167853303|gb|EDS24556.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus parasuis 29755] Length = 458 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 53/462 (11%), Positives = 127/462 (27%), Gaps = 79/462 (17%) Query: 29 PIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF- 81 + F ++ G +S + + I + ++ G+ L + F Sbjct: 3 KLGDFVRVQGGYAFKSSELSDDKTGVPVIKIGNITGGSFVDLSNYQSVSFQLFEKTKSFA 62 Query: 82 -AKGQILYGKLGPYLRK---AIIADFDGICSTQFLVLQPKDVLPE---LLQGWLLSIDVT 134 IL G + K + + + + K+ P + + S Sbjct: 63 TKDNDILIAMTGANVGKTSRVPVNSDAYLINQRVGRFLLKEDCPYTSDFIYYVVSSKQAY 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q + +GA + K I ++ P I + + +I ++ Sbjct: 123 QYFSRVADGAAQPNISGKTIEDLEFPNIDSRCANKIGNILKSLDDKIQLNTQINQTLEQI 182 Query: 195 LKEKKQALVSYI--------------------------------------------VTKG 210 + ++ + Sbjct: 183 AQTIFKSWFIDFDPVHAKANALASGQTAEQATQAAMAVISGKNTQELHRLQTANPEQYQQ 242 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQK 267 L + SG + G VP WE + + N K++ E I + G++ Sbjct: 243 LWEITEAFPSGFDEEG-VPRGWEQTTLSEVCSMKNGYAFKSSDWTEEGIPVIKIGSVKPM 301 Query: 268 L-ETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + E G E + + ++ G+IV + + ++ Sbjct: 302 IVEVDGNGFVDEEHSVLHSEFLLTEGDIVVGLTGYVGEVGRIP---QGRTAMLNQRVAKF 358 Query: 324 KPHGIDSTYLAWLM-----RSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFD 377 P+ ++ + R G + ++ +++ +++ + Q Sbjct: 359 IPNKLNEQQDYYSFVYCLVRDKSFKAFAETNAKGSAQANISTKELLNYSIILASPEIQMK 418 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 ++I +I LV L + R + ++G+I Sbjct: 419 FESLIKPLLDKI--LVNSGNN--EYLSKVRDLLLPNLLSGEI 456 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 50/172 (29%), Gaps = 11/172 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQ-SDT 75 +P+ W+ + + G +S + I I + V+ + + S Sbjct: 259 VPRGWEQTTLSEVCSMKNGYAFKSSDWTEEGIPVIKIGSVKPMIVEVDGNGFVDEEHSVL 318 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPK-----DVLPELLQGWLL 129 + + +G I+ G G I + + + P + + Sbjct: 319 HSEFLLTEGDIVVGLTGYVGEVGRIPQGRTAMLNQRVAKFIPNKLNEQQDYYSFVYCLVR 378 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E +G+ ++ K + N + + Q+ I +I Sbjct: 379 DKSFKAFAETNAKGSAQANISTKELLNYSIILASPEIQMKFESLIKPLLDKI 430 >gi|257463921|ref|ZP_05628307.1| hypothetical protein FuD12_08734 [Fusobacterium sp. D12] gi|317061448|ref|ZP_07925933.1| predicted protein [Fusobacterium sp. D12] gi|313687124|gb|EFS23959.1| predicted protein [Fusobacterium sp. D12] Length = 124 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 40/92 (43%), Gaps = 6/92 (6%) Query: 330 STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + R+ + +G +++ V+ + +PP++EQ +I V+ + Sbjct: 13 KNFILYFFRTMNFINYIIKFATGSTIKNVSLNTVRESYIPLPPLEEQQEIVRVLEEVLEK 72 Query: 389 IDVLVEKI--EQSIVLLKERRSSFIAAAVTGQ 418 + E I E+ I LL+ S + A G+ Sbjct: 73 EKKVKELIDLEEKIDLLE---KSILDKAFRGK 101 >gi|317505565|ref|ZP_07963476.1| type I restriction-modification system [Prevotella salivae DSM 15606] gi|315663313|gb|EFV03069.1| type I restriction-modification system [Prevotella salivae DSM 15606] Length = 435 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 64/423 (15%), Positives = 121/423 (28%), Gaps = 53/423 (12%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W + L+ G++ G D I+ I + Y + S ++ Sbjct: 11 EVPSSWGWCRLGTICNYLHRGKSPRYGNDKILPIMAQKCNQWDRIYTDRCLFSDKAFIEK 70 Query: 78 VS---IFAKGQILYGKLGPYLRKAIIADFDGIC---------STQFLVLQPKDVLPELLQ 125 G ++ G + S +V K V + Sbjct: 71 YKEEQYLQVGDVIVNSTGGGTVGRTGYIEKYVFEKYTKFVADSHVTVVRTNKLVSSRYIY 130 Query: 126 GWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L+S + +E C G+T I N +P+PP AEQ I EKI ++ Sbjct: 131 YYLISPFIQIGLEERCSGSTNQIELGTASIYNNIIPLPPYAEQKRIIEKIAEVIPVVNRF 190 Query: 185 ITERIRFIELLK----EKKQALVSYIVTKGLNPDVKMK---------------------- 218 ++ +L + ++++ + L P Sbjct: 191 GEKQDFLEKLNQGLKPSLHKSILQEAIQGRLVPQDPNDEPASALLDKIQAEKVRLVKKGI 250 Query: 219 ------DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 + I + G ++E + E + E L+ I + N Sbjct: 251 LKKKDLQTSIIYKGENNKYYEQVGGTSQQIETDYDFPNHWEVVRLAHICRLIDGEKREGN 310 Query: 273 MGLKPESY----ETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPH 326 Y T ++ G+ V ++ S V G + S + + Sbjct: 311 FVCLDAKYLRGKSTGNLLCKGKFVRTGDNIILVDGENSGEVFPVPCDGYMGSTFKQLWVS 370 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + + L E L + +PP KEQ I + + T Sbjct: 371 EAMHLPYVLYFIQFYKDLLRNSKKGAAVPHLNKEIFYSLVIGIPPCKEQMRIAKQVKLLT 430 Query: 387 ARI 389 +I Sbjct: 431 DKI 433 >gi|313159677|gb|EFR59034.1| type I restriction modification DNA specificity domain protein [Alistipes sp. HGB5] Length = 335 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 76/215 (35%), Gaps = 18/215 (8%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGNIIQKLE----- 269 K E +P WE ++V+ + + E + GN + + Sbjct: 77 KCIDEEIPFEIPATWEWCRLLSIVSLLGDGIHGTPEYSEGGSVYFINGNNLFDGQILIKP 136 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 K E+ + ++++ ++ + V+ + SA +GI+ Sbjct: 137 DTKTVSKEEAVKHSRLLNESTVLVSINGTIGNIAFYSGENVI---LGKSACYFNLLNGIE 193 Query: 330 STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+ ++++ + + +G +++ ++ + + +PP EQ I + ++ Sbjct: 194 RKYIKIVLQTDYFLEYTKRVATGSTIKNVPLSGMRNVLIPIPPKDEQQVIIDKLSSLKLL 253 Query: 389 IDVLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 I+ + + L + S + A+ G+ Sbjct: 254 IEKF-NIEQSQLNKLNAELRSVLKKSILQEAIQGK 287 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 79/217 (36%), Gaps = 11/217 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT-----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP W+ + L G + +I ++ G P + + Sbjct: 86 EIPATWEWCRLLSIVSLLGDGIHGTPEYSEGGSVYFINGNNLFDGQILIKPDTKTVSKEE 145 Query: 75 TST-VSIFAKGQILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSID 132 + + +L G A + + I + + + ++ L + Sbjct: 146 AVKHSRLLNESTVLVSINGTIGNIAFYSGENVILGKSACYFNLLNGIERKYIKIVLQTDY 205 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + G+T+ + G+ N+ +PIPP EQ +I +K+ + + I+ E+ + Sbjct: 206 FLEYTKRVATGSTIKNVPLSGMRNVLIPIPPKDEQQVIIDKLSSLKLLIEKFNIEQSQLN 265 Query: 193 ELLKEKK----QALVSYIVTKGLNPDVKMKDSGIEWV 225 +L E + ++++ + L P + + + E + Sbjct: 266 KLNAELRSVLKKSILQEAIQGKLLPQITEEGTAQELL 302 >gi|221195889|ref|ZP_03568941.1| HsdS specificity protein of type I restriction-modification system [Atopobium rimae ATCC 49626] gi|221184236|gb|EEE16631.1| HsdS specificity protein of type I restriction-modification system [Atopobium rimae ATCC 49626] Length = 191 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 57/195 (29%), Gaps = 16/195 (8%) Query: 228 VPDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETRNMGLKPES-- 279 +P WE + ++ N I L +I + E Sbjct: 1 MPSSWEQRKLGEIIQLGGSGGTPSATNPNYYGGEIPFLGIADIEGRDIAHTAKTLTEEGL 60 Query: 280 -YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 IV G + +R + M + I L + Sbjct: 61 RNSAAWIVPAGAVSLAMYASVGKVGIIRQDTATSQAFYN---MVFEDVAIRDFVFTRLEK 117 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + + +G +++L + VK + VP +E I A +D L+ ++ Sbjct: 118 ADAGFEWEPYISTGTQRNLNADKVKAFAIAVPSSREAAKIGRY----FANLDTLITLHQR 173 Query: 399 SIVLLKERRSSFIAA 413 LK+ + S + Sbjct: 174 KSEKLKQLKQSMLEK 188 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 65/193 (33%), Gaps = 11/193 (5%) Query: 22 PKHWKVVPIKRFTKLN-TGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P W+ + +L +G T + G +I ++G+ D+E + K Sbjct: 2 PSSWEQRKLGEIIQLGGSGGTPSATNPNYYGGEIPFLGIADIEGRDIAHTAKTLTEEGLR 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S I G + + II + ++ + + + L D Sbjct: 62 NSAAWIVPAGAVSLAMYASVGKVGIIRQDTATSQAFYNMVFEDVAIRDFVFTRLEKADAG 121 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 E T + + + + +P E KI +DTLIT R E Sbjct: 122 FEWEPYISTGTQRNLNADKVKAFAIAVPSSRE----AAKIGRYFANLDTLITLHQRKSEK 177 Query: 195 LKEKKQALVSYIV 207 LK+ KQ+++ + Sbjct: 178 LKQLKQSMLEKMF 190 >gi|332362407|gb|EGJ40207.1| hypothetical protein HMPREF9393_0205 [Streptococcus sanguinis SK1056] Length = 146 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 50/142 (35%), Gaps = 2/142 (1%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 V G+I+ + V H + + + Sbjct: 5 KNFSVVSGDILLTTRGTIGRIAIVPKDYFEGVLHPCLMKFRVDSHIVQPKLIKYFFNDIT 64 Query: 342 LCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K S + ++K + + + P++EQ+ I ++ + + +D L++ ++ Sbjct: 65 FVKEQLKFLSNSTTIDVIYSYNLKNIIIPIIPMEEQYGIVEYLDKQCSNVDALIKVKQEQ 124 Query: 400 IVLLKERRSSFIAAAVTGQIDL 421 I + ++R + I VTG+ + Sbjct: 125 IKNINKQRQTLIYDYVTGKRRV 146 >gi|315152756|gb|EFT96772.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0031] Length = 197 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 59/163 (36%), Gaps = 8/163 (4%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF----RFIDLQNDKRSLRSAQ 309 ++S+ + K +N+ ++V GE+ + + RSL + Sbjct: 39 YKVISIGSYGLDSKYVDQNIRAVSNEVTDSRVVRNGELTMVLNDKTANGTIIGRSLLIEE 98 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + I + DS + ++ V + G + + + V L + + Sbjct: 99 DNKYVINQRTEIISPKENFDSNFAYTILNGPFRESVKRIVQGGTQIYVNYPAVSNLVLKL 158 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 P ++EQ I ++D + ++ + LLKE + F+ Sbjct: 159 PDVEEQKKIGLF----FKQLDDTIALQQRKLDLLKETKKGFLQ 197 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 72/192 (37%), Gaps = 14/192 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVS 79 + W+ + G E +D Y + G KY+ ++ + ++ + Sbjct: 10 EDWEERKLSEVANHRGGTAIEKYFKEDGKYKVISIGSYGLDSKYVDQNIRAVSNEVTDSR 69 Query: 80 IFAKGQI--LYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G++ + D + + + ++ PK+ +L+ Sbjct: 70 VVRNGELTMVLNDKTANGTIIGRSLLIEEDNKYVINQRTEIISPKENFDSNFAYTILNGP 129 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + ++ I +G T + ++ + N+ + +P + EQ I ++D I + R + Sbjct: 130 FRESVKRIVQGGTQIYVNYPAVSNLVLKLPDVEEQKKIGLF----FKQLDDTIALQQRKL 185 Query: 193 ELLKEKKQALVS 204 +LLKE K+ + Sbjct: 186 DLLKETKKGFLQ 197 >gi|167912944|ref|ZP_02500035.1| restriction modification system DNA specificity domain [Burkholderia pseudomallei 112] Length = 367 Score = 71.0 bits (172), Expect = 4e-10, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 62/193 (32%), Gaps = 10/193 (5%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE--TRNMGLKPESYETYQI 285 ++ ++ + L ++ E +L L NI +++ + + Sbjct: 25 EWENKPLRTLGSFFRGLTYSADEVSEEGLLVLRSSNIQDGSLVLDKDLVFVDKPCPDDLL 84 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G++ + + + + + ++ K Sbjct: 85 LQDGDVAICMSNGSKALVGKSAEFQNNYDGQLTVGAFCSIFRPSLEFAKLIFQTPRYSKF 144 Query: 346 FY-AMGSGLRQSLKFEDVKRLPVLVP--PIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 A+G G ++LK D++ VP P+ EQ I + + A +D L+ Q + Sbjct: 145 VSIAIGGGNIKNLKNSDLEEFEHPVPRMPL-EQQKIADCL----AFLDELISAENQKLST 199 Query: 403 LKERRSSFIAAAV 415 LK + + Sbjct: 200 LKAHKKGMLQQLF 212 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 41/360 (11%), Positives = 106/360 (29%), Gaps = 39/360 (10%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTG 61 +P+++++G W+ P++ G T S + ++ + +++ G+ Sbjct: 16 RFPEFREAG---------EWENKPLRTLGSFFRGLTYSADEVSEEGLLVLRSSNIQDGSL 66 Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVL 114 L KD + G + D + Sbjct: 67 -VLDKDLVFVDKPCPDDLLLQDGDVAICMSNGSKALVGKSAEFQNNYDGQLTVGAFCSIF 125 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREK 173 +P +L+ G + + + P+P + EQ I + Sbjct: 126 RPSLEFAKLIFQTPRYSKFVSI---AIGGGNIKNLKNSDLEEFEHPVPRMPLEQQKIADC 182 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + I+ + + LK K+ ++ + + +++ G Sbjct: 183 LAFLDEL----ISAENQKLSTLKAHKKGMLQQLFPREGEVVPRLRFPAFRKAGAWAKVAA 238 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + F + + L + + + V G+I + Sbjct: 239 GQLFSNRTERGEQGLPIYSVTMTEGLVPRASLDRRIDDI-----AEAGANKAVRRGDIAY 293 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + E +++ AY+ ++P G+D + +L++ + +V A G Sbjct: 294 NMMRMWQGALGVAP----EDCMVSPAYIVLEPQAGVDPVFFYFLLKRPETLQVLTAHSRG 349 >gi|315918352|ref|ZP_07914592.1| type I restriction system specificity protein [Fusobacterium gonidiaformans ATCC 25563] gi|313692227|gb|EFS29062.1| type I restriction system specificity protein [Fusobacterium gonidiaformans ATCC 25563] Length = 241 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 71/168 (42%), Gaps = 6/168 (3%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K +L++ YG+I K + + E+ + + V G +V Sbjct: 59 MPKTMFDNHGEVLAIHYGHIYTKYNIFVKEPIVKVSMENAKNLKKVKKGNLVIAKTSENL 118 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQ-SLK 358 D A + E ++T + A+ HG + YL+++ + K + G++ L Sbjct: 119 DDVMKTVAYLGEDEVVTGGHSAIFRHGANPKYLSYVFNGADYFIKQKNKLAHGVKVIELS 178 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 D+++ +L+PPI Q I ++++ + L + + + I L +++ Sbjct: 179 TTDMEKFQILIPPIHIQEYIVSILDKFDMLTNDLTQGLPREIELRQKQ 226 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 18/194 (9%), Positives = 57/194 (29%), Gaps = 11/194 (5%) Query: 27 VVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSI 80 + + G ++ +++ I + + ++ + + + Sbjct: 44 WKRLGEVGRFENGTGMPKTMFDNHGEVLAIHYGHIYTKYNIFVKEPIVKVSMENAKNLKK 103 Query: 81 FAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLSIDVTQ 135 KG ++ K ++ D + + + P+ + + + Sbjct: 104 VKKGNLVIAKTSENLDDVMKTVAYLGEDEVVTGGHSAIFRHGANPKYLSYVFNGADYFIK 163 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + G + + + IPP+ Q I + + + L R IEL Sbjct: 164 QKNKLAHGVKVIELSTTDMEKFQILIPPIHIQEYIVSILDKFDMLTNDLTQGLPREIELR 223 Query: 196 KEKKQALVSYIVTK 209 +++ + + Sbjct: 224 QKQYEYYREKLFDF 237 >gi|217033243|ref|ZP_03438677.1| hypothetical protein HPB128_149g1 [Helicobacter pylori B128] gi|216945022|gb|EEC23753.1| hypothetical protein HPB128_149g1 [Helicobacter pylori B128] Length = 228 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 64/175 (36%), Gaps = 11/175 (6%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID------LQNDK 302 ++ + + ++ N Q ++ E + G+++F + Sbjct: 46 SQGNKFYVPYINVFNNPQLDLNALESVQIGDKEKQNTIQLGDVLFTGSSENLEDCAMSCV 105 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + + + + + + + ++L +R Y+ K + +G R ++ + Sbjct: 106 VTQKIEKDIYLNSFCFGFRFFDENLFNPSFLKHFLRDYNFRKNISKVANGVTRFNVSKQL 165 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + ++ + +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 166 LSKITIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYREKLLT 220 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 54/171 (31%), Gaps = 15/171 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + + + G ++ K + Y+ +V + L + + D Sbjct: 19 PKGVEFRKLGDIGEFYGGLVGKNKKSFSQGNKFYVPYINVFNNPQLDLNALESVQIGDKE 78 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD----------FDGICSTQFLVLQPKDVLPELLQG 126 + G +L+ L ++ + F P L+ Sbjct: 79 KQNTIQLGDVLFTGSSENLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDENLFNPSFLKH 138 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 +L + + I + G T + + + I +PIPPL Q I + + Sbjct: 139 FLRDYNFRKNISKVANGVTRFNVSKQLLSKITIPIPPLEIQQEIVKILDQF 189 >gi|317014806|gb|ADU82242.1| type I R-M system specificity subunit [Helicobacter pylori Gambia94/24] Length = 185 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 17/130 (13%), Positives = 44/130 (33%), Gaps = 6/130 (4%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGS 351 I + + + V P+ + +L + ++ + + Sbjct: 57 NTITIAQYGTAGYVNFQKNKFWANDICFCVYPNKDVIKNIFLYYFLKVNQNYLYEISNRN 116 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 S+ + + +L+PP+ EQ I N+++ I L K Q + + + Sbjct: 117 ATPYSISKDKILDFEILLPPLNEQAAIANILSDVDHEIISLKNKKRQ----FENVKKALN 172 Query: 412 AAAVTGQIDL 421 ++ +I + Sbjct: 173 HDLMSAKIRV 182 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 60/182 (32%), Gaps = 10/182 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 ++WK V + ++ G + ++ V G G + +R Sbjct: 6 QNWKKVRLGDIAEIKRGVRITKNELDVFGKYPVVSGGVGFLGYTNNFNR----------Y 55 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + I + G + F V KDV+ + + L ++ E Sbjct: 56 ENTITIAQYGTAGYVNFQKNKFWANDICFCVYPNKDVIKNIFLYYFLKVNQNYLYEISNR 115 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 AT I + + +PPL EQ I + I +L ++ +F + K L Sbjct: 116 NATPYSISKDKILDFEILLPPLNEQAAIANILSDVDHEIISLKNKKRQFENVKKALNHDL 175 Query: 203 VS 204 +S Sbjct: 176 MS 177 >gi|327467254|gb|EGF12758.1| type Ic restriction-modification system [Streptococcus sanguinis SK330] Length = 406 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 45/396 (11%), Positives = 104/396 (26%), Gaps = 27/396 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W I ++ G + + + + K D Sbjct: 28 WVEKRIADIVNISAGGDVDKERLKESGKYPVIANA---LTNKGIVGFYDD----YKVKAP 80 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + G + + + + + + E Sbjct: 81 AVTVTGRGDVGYAVARHENFTPIVRLLTLQSENIDVD-------YLENQINSMRILNEST 133 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + +GN + P + EQ I + + + + Sbjct: 134 GVPQLTAPQLGNYKVYHPEIDEQSAIGSLFRTLDDFLASYKDNLANYQSFKATMLSKMFP 193 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 P++++ EW + + L EL+ + L ++S GN+ Sbjct: 194 KAGQS--VPEIRLDGFEGEWRIIKLGDVLSELKSGLSRELSNDDIGLPVIRANNISDGNL 251 Query: 265 IQKLETRNMGLKPES--YETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAY 320 + + + V +I+ FI+ + ++ S + I T+ Sbjct: 252 NLDRDIKYWFKEDPKGANTANYFVKENDILVNFINSEAKMGTAAIVSREPDRETIYTTNI 311 Query: 321 MAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFD 377 + + Y +LM ++ + S D K+ L P +EQ Sbjct: 312 LKLTVKEDYYPYFIYLMTFVQSYQNYIKSITKPAVNQASFTTVDFKKYEFLCPAFQEQQS 371 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +D L+ ++ I L+ + + Sbjct: 372 IGTY----FSNLDSLIAAHQEKISQLETLKKKLLHD 403 >gi|148983885|ref|ZP_01817204.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP3-BS71] gi|147924032|gb|EDK75144.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP3-BS71] gi|301799573|emb|CBW32125.1| putative type I restriction-modification system S protein [Streptococcus pneumoniae OXC141] Length = 424 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 59/411 (14%), Positives = 128/411 (31%), Gaps = 61/411 (14%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + +P+PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIPLPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS-----------------------------------GIEWVGLVPDHWEVKP 236 +S + G +P +W V Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNIPMNWVVIK 251 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--------ETYQIVDP 288 + + + K + +I ++ L Y + Sbjct: 252 IKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKH 311 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWLMRSYDLCK 344 +++ G++ ++ + I S +L + + S K Sbjct: 312 NQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNLSSPLFYK 371 Query: 345 VFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + ++ + L + + P +EQ IT + +++ L Sbjct: 372 QLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 422 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIPLPPLSEQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 74/183 (40%), Gaps = 16/183 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 241 GNIPMNWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQ 300 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELL 124 S+ ++ K L + L D+DG+ + F+ + +++ + L Sbjct: 301 FISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFL 360 Query: 125 QGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L S ++++AI + G + + + + +P+ P EQ LI +K+ +++ Sbjct: 361 LFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVN 420 Query: 183 TLI 185 L Sbjct: 421 QLW 423 >gi|291530636|emb|CBK96221.1| Restriction endonuclease S subunits [Eubacterium siraeum 70/3] Length = 379 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 55/390 (14%), Positives = 108/390 (27%), Gaps = 48/390 (12%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTST 77 W + ++ G T ++ DI + +V + S+ Sbjct: 22 WSTYHLSDIAEVVGGGTPDTTVSSLWNGDIQWFTPTEVGHQKYVSKSARTITQLGLQKSS 81 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G IL + + IA + + F L PK +L Sbjct: 82 AKKLPAGSILLSS-RATIGECSIAQRECTTNQGFQNLIPKKDTNNEFLYYLAQTK-KHHF 139 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+T I +P EQ I + A RI L KE Sbjct: 140 IKYASGSTFLEISNSEIKKTKCTVPGTEEQTQIAAFLSALDDRIAVQNKIIEDLKVLKKE 199 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 +L+ I+ ++ N + + Sbjct: 200 LNYSLIGRIINGK---------------------SSNCKIEDVIDYEQPTNYIVKSDKYI 238 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ + +G E+ Y + + + + K ++ I Sbjct: 239 ENGETPVLTANKAFLLGYTIENEGVY---NKSDCIILDDFTLDFKYVNFPFKIKSSAI-- 293 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + I+ Y + +F + S + +V LP+ +P I EQ + Sbjct: 294 --KILTAKKDIELRYFYEYL-------LFLGLTSHEHKRHYISEVAPLPLYLPSIDEQRN 344 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERR 407 +V+N + + ++ E I LK ++ Sbjct: 345 ALSVLNSISKK----IKVEENYISALKAQK 370 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 36/129 (27%), Gaps = 8/129 (6%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I L + + + + P + + + Sbjct: 88 GSILLSSRATIGECSIAQRECTTNQGFQNLIPKKDTNNEFLYYLAQTKKHHFIKYASGST 147 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS----S 409 + ++K+ VP +EQ I ++ +D + + I LK + S Sbjct: 148 FLEISNSEIKKTKCTVPGTEEQTQIAAFLSA----LDDRIAVQNKIIEDLKVLKKELNYS 203 Query: 410 FIAAAVTGQ 418 I + G+ Sbjct: 204 LIGRIINGK 212 >gi|330941026|gb|EGH43948.1| restriction modification system DNA specificity subunit [Pseudomonas syringae pv. pisi str. 1704B] Length = 280 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 64/218 (29%), Gaps = 9/218 (4%) Query: 206 IVTKGLNPDVKMKDSGIE-WVGLVPDHWEVKPFFALVTELNRKN--TKLIESNILSLSYG 262 V + + + G E +P W+ + R L S + G Sbjct: 60 AVEGKIKKKKPLAEVGEEAEPFELPAGWKWSSLAQVAFVNPRNAAADSLEVSFVPMTFIG 119 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS---- 318 + L E + + G+I I + + G+ Sbjct: 120 TRFDDQHGQEPRLWGELKQGFTHFAEGDIGVAKITPCFENSKACVFSNLLNGLGAGTTEL 179 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +D Y+ ++S G+ ++ L + V+ P +PP+ EQ Sbjct: 180 HIVRPITGTLDPRYVLAYLKSPQFLLVGETKMTGTAGQKRLPKDFVEANPFPLPPLAEQH 239 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 I ++ A D L + + + + + + Sbjct: 240 RIVAKVDELMALCDRLEAQQADAESAHTQLVQALLDSL 277 Score = 53.3 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 63/196 (32%), Gaps = 9/196 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P WK + + +N + ++ ++ + + + ++ + Sbjct: 82 ELPAGWKWSSLAQVAFVNPRNAAADSLEVSFVPMTFIGTRFDDQHGQEPRLWGELKQGFT 141 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICST--------QFLVLQPKDVLPELLQGWLLSI 131 FA+G I K+ P + F + + + + P + +L S Sbjct: 142 HFAEGDIGVAKITPCFENSKACVFSNLLNGLGAGTTELHIVRPITGTLDPRYVLAYLKSP 201 Query: 132 DVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 E G + P P+PPLAEQ I K+ D L ++ Sbjct: 202 QFLLVGETKMTGTAGQKRLPKDFVEANPFPLPPLAEQHRIVAKVDELMALCDRLEAQQAD 261 Query: 191 FIELLKEKKQALVSYI 206 + QAL+ + Sbjct: 262 AESAHTQLVQALLDSL 277 >gi|319758538|gb|ADV70480.1| Type I restriction enzyme EcoKI specificity protein (S protein) [Streptococcus suis JS14] Length = 429 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 54/166 (32%), Gaps = 17/166 (10%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEI----VFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + +K Y I+ I ++ I + + +A Sbjct: 35 KDGTIKPTNIKFAPDNVYTIIRNYTISSTDIYVTIAGTIGDVGIVPENFNNALLTENALK 94 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + I+ +LA L++S + K F + L + +PP+ EQ I Sbjct: 95 LMLTESINKMFLAHLLKSPLVQKQFKEVYNQVAQPKLSIRSTNSTIIPLPPLAEQKRIVA 154 Query: 381 VINVETARIDVLVEKIEQSIVLLKE--------RRSSFIAAAVTGQ 418 I + VE +S L+E + S + A+ G+ Sbjct: 155 QIERALEQ----VEVYAESYNKLQELDRAFPDKLKKSILQYAMQGK 196 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 50/428 (11%), Positives = 116/428 (27%), Gaps = 66/428 (15%) Query: 31 KRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGNSRQ-SDTSTVSIFA 82 G+ G ++ Y+ + D++ GT K + Sbjct: 2 GAIVTAKGGKRIPKGYNLQEEDNGHPYLRVTDMKDGTIKPTNIKFAPDNVYTIIRNYTIS 61 Query: 83 KGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I G I+ + + ++ + + L L S V ++ + Sbjct: 62 STDIYVTIAGTIGDVGIVPENFNNALLTENALKLMLTESINKMFLAHLLKSPLVQKQFKE 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + +P+PPLAEQ I +I +++ + EL + Sbjct: 122 VYNQVAQPKLSIRSTNSTIIPLPPLAEQKRIVAQIERALEQVEVYAESYNKLQELDRAFP 181 Query: 200 ----QALVSYIVTKGLNPDVKM-----------------------------------KDS 220 ++++ Y + L K Sbjct: 182 DKLKKSILQYAMQGKLVAQDPNDEPVEVLLEMIRAEKQKLYEEGKLKKKDLAEIMVEKGD 241 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL----SLSYGNIIQKLETRNMGLK 276 G +P +W + + + + K + I+ + G I+ L + + Sbjct: 242 DNSPYGKIPRNWTLLSVKDIFSITTGLSYKKTDLAIIQRGVRIIRGGNIEPLAYKLLDND 301 Query: 277 PESYETYQ-----IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHG 327 Y + ++V + ++ Sbjct: 302 YYIESKYITSESVYLKRNQLVTPVSSSLEHIGKFARIDKNYSDTVAGGFVFQLTPFISSD 361 Query: 328 IDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 S YL + S K + ++ + L + + P +EQ I+N + Sbjct: 362 TLSNYLLLCLSSPLFYKQLQSVTKLSGQALYNIPKTKLNDLRIALAPEQEQERISNKVGQ 421 Query: 385 ETARIDVL 392 ++++L Sbjct: 422 LFQKVNLL 429 Score = 39.8 bits (91), Expect = 0.86, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 32/87 (36%), Gaps = 6/87 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP++W ++ +K + TG + + + I ++E K L D Sbjct: 247 GKIPRNWTLLSVKDIFSITTGLSYKKTDLAIIQRGVRIIRGGNIEPLAYKLLDNDYYIES 306 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAI 99 ++ S++ K L + L Sbjct: 307 KYITSESVYLKRNQLVTPVSSSLEHIG 333 >gi|258542518|ref|YP_003187951.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256633596|dbj|BAH99571.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256636655|dbj|BAI02624.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-03] gi|256639708|dbj|BAI05670.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-07] gi|256642764|dbj|BAI08719.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-22] gi|256645819|dbj|BAI11767.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-26] gi|256648872|dbj|BAI14813.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-32] gi|256654916|dbj|BAI20843.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-12] Length = 194 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 54/152 (35%), Gaps = 10/152 (6%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 N G+ P Y G I E+ +V+ + Sbjct: 50 NGGITPSGYTNEANRAAGTITISEGGNS----CGYVDYQREKFWCGGHCYSVERPTLFID 105 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARID 390 +L ++ + +GSGL +++ + ++ LP+ P EQ I V+ Sbjct: 106 FLYQTLKFLQPKIMRLRVGSGL-PNIQKKALETLPLYHPIATNEQKAIAAVLTTADEE-- 162 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + IE + L++ + + + +TG+ ++ Sbjct: 163 --IAAIESDLSRLRQEKKALMQQLLTGKRRVK 192 >gi|60681332|ref|YP_211476.1| putative type I restriction-modification system specificity system, partial [Bacteroides fragilis NCTC 9343] gi|60492766|emb|CAH07540.1| putative type I restriction-modification system specificity system, partial [Bacteroides fragilis NCTC 9343] Length = 209 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 84/204 (41%), Gaps = 10/204 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY--GNIIQKLETR 271 K + + + WE +V R+N I+ + S++ G + Q + Sbjct: 9 YYPKKSQELLKLKGLNSKWEQCFLKDVVENFCRRNKSHIQYPMYSVTNDLGFVPQSEKFE 68 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DS 330 + E +Y++++ G+ + + + S+ + +I+S Y+ +P S Sbjct: 69 ERTMMGEDISSYKVINKGDFAYNP--ARINVGSIAKYEGDNPCMISSLYVCFRPKYNISS 126 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L L++S + + G G+R L F + R+ + +PP++EQ I VI+ I Sbjct: 127 EWLQHLLKSQRMIYNYNLFGEGGVRIYLFFPNFGRIKISIPPLEEQKKIAAVIST----I 182 Query: 390 DVLVEKIEQSIVLLKERRSSFIAA 413 + + + L ++S + Sbjct: 183 EQKISVENFILDKLNTQKSFLLTK 206 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 51/160 (31%), Gaps = 3/160 (1%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ +K + R + +Y D+ ++ D S+ + KG Sbjct: 27 WEQCFLKDVVENFCRRNKSHIQYPMYSVTNDLGFVPQSEKFEERTMMGEDISSYKVINKG 86 Query: 85 QILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAIC 141 Y + D + S+ ++ +PK + L S + Sbjct: 87 DFAYNPARINVGSIAKYEGDNPCMISSLYVCFRPKYNISSEWLQHLLKSQRMIYNYNLFG 146 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 EG + + G I + IPPL EQ I I +I Sbjct: 147 EGGVRIYLFFPNFGRIKISIPPLEEQKKIAAVISTIEQKI 186 >gi|302381020|ref|ZP_07269481.1| type I restriction modification DNA specificity domain protein [Finegoldia magna ACS-171-V-Col3] gi|302311241|gb|EFK93261.1| type I restriction modification DNA specificity domain protein [Finegoldia magna ACS-171-V-Col3] Length = 378 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 48/401 (11%), Positives = 100/401 (24%), Gaps = 49/401 (12%) Query: 22 PKHWKVVPIKRFTKL-----NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P + I+ ++++ + L ++ Y + Q+ Sbjct: 13 PDGVEYKKIEEVANYEQPSKYIVKSTKYDDSYVTPVLTAGQTFILGYTNETDGIFQASKD 72 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I + DF + + + L+ Sbjct: 73 NPVII---------FDDFTGAFKWVDFPFKIKSSAMKIITIKENNMPLRYL-----FHIM 118 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + +P+PPL Q I + + T+ L E + + Sbjct: 119 GNLGFKSDEHKRLWISIYSQLKIPVPPLEVQREIVRILDSFTLLTAELTAELTARKKQYE 178 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + L+ + +MK S + + +L E Sbjct: 179 YYEHNLL------FDDKYKRMKLSDL--------------CTVNQGLQIPISKRLKEPRE 218 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 Y + + E+ + I +I+ E Sbjct: 219 NCYRYITVQFLKNNEDEQYYIENPDKNVICKEDDILVTRTGSTGVIVYGV-----EGCFH 273 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYD-LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + I Y+ +L+RS K+ A G L + L V VP I EQ Sbjct: 274 NNFFKVTPNELIHKKYMYFLLRSKYMYNKMLTAASGGTVPDLPHKKFYALEVPVPTIDEQ 333 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I ++ + + I ++ R + Sbjct: 334 KHIVEMLEKFNELSKDVSIGLPAEIEARQKQYEYYRDKLLT 374 >gi|291457410|ref|ZP_06596800.1| type I restriction system specificity protein [Bifidobacterium breve DSM 20213] gi|291381245|gb|EFE88763.1| type I restriction system specificity protein [Bifidobacterium breve DSM 20213] Length = 215 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 23/114 (20%), Positives = 39/114 (34%), Gaps = 6/114 (5%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 E I A+ ++ +LMR Y + S+ + +K L V Sbjct: 11 NVAFENCCIGRGLAAIHSET--PSFALYLMRFLKPQLEAYNGEGTVFGSINGKALKSLEV 68 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P E A ID L+ E L R+ + ++G+ID+ Sbjct: 69 ALPSHNE----VMQFESFAAPIDALIRSNENETRKLNNLRNYLLPKLMSGEIDV 118 >gi|313678681|ref|YP_004056421.1| type I restriction-modification system, S subunit [Mycoplasma bovis PG45] gi|312950662|gb|ADR25257.1| putative type I restriction-modification system, S subunit [Mycoplasma bovis PG45] Length = 505 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 64/440 (14%), Positives = 132/440 (30%), Gaps = 57/440 (12%) Query: 7 YPQYKDSGVQWIG---AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 Y +++D + I IP +W+ V I + + Y + Sbjct: 68 YEKFEDGREEKIEVPFEIPDNWRWVRINCAYQYIPTGVKKYSGSKKYFSTGSINYDNIT- 126 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLP 121 ++ T I QI+ ++ + II + + ST F Q Sbjct: 127 PEQECLFNGRPTRANRIVYYNQIIEARMINTNKATIIDERLDGQLVSTGFFCYQVVLGEI 186 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E L+ S + ++C G T + + + I +P+ PL EQ I E I I Sbjct: 187 EYLKIIFDSHYFKKTKNSLCTGTTQKSINDENLSKILVPLAPLEEQRRIVELIHKLDSLI 246 Query: 182 DTL----ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH------ 231 I EL + ++++++Y + L + DS + + Sbjct: 247 GKYSKFEIELSELEEELPTKLEKSIINYAMKGKLVKQDQNNDSVDNLINEIYKEKQKLVE 306 Query: 232 -------------WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----------- 267 E + NI L G+ I K Sbjct: 307 QGKLKKADLNNLIIYKNDNDNSYYENQSTKPYIKLGNIAELYTGDSINKTFKKKFLSAFS 366 Query: 268 ---------------LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + N PE+++ + P + I + + A Sbjct: 367 ELSYISTKDVGFDKEISYDNGVWIPENFKNEYKIAPKNSILLCI--EGGSAGRKMAITKR 424 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + + + + +L++ + +F + G+ + ++K + + + Sbjct: 425 DVAFGNKLCCINSNNLSNKFLSYFFQCDTFKNMFNSKTKGIISGISLSNLKSIEIPIFSG 484 Query: 373 KEQFDITNVINVETARIDVL 392 Q + N +N+ I L Sbjct: 485 TYQEKLINKLNLIGTIIKKL 504 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 16/207 (7%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE-TRNMGLKPESY 280 IE +PD+W + K S + + + Sbjct: 79 IEVPFEIPDNWRWVRINCAYQYIPTGVKKYSGSKKYFSTGSINYDNITPEQECLFNGRPT 138 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 +IV +I+ + N + + ++ ++++ + + + YL + S+ Sbjct: 139 RANRIVYYNQIIEARMINTNKATII--DERLDGQLVSTGFFCYQVVLGEIEYLKIIFDSH 196 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K ++ +G ++S+ E++ ++ V + P++EQ I +I+ ++D L+ K + Sbjct: 197 YFKKTKNSLCTGTTQKSINDENLSKILVPLAPLEEQRRIVELIH----KLDSLIGKYSKF 252 Query: 400 IVLL--------KERRSSFIAAAVTGQ 418 + L + S I A+ G+ Sbjct: 253 EIELSELEEELPTKLEKSIINYAMKGK 279 >gi|145637386|ref|ZP_01793046.1| type I restriction/modification specificity protein [Haemophilus influenzae PittHH] gi|145269478|gb|EDK09421.1| type I restriction/modification specificity protein [Haemophilus influenzae PittHH] Length = 277 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 46/281 (16%), Positives = 92/281 (32%), Gaps = 27/281 (9%) Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 ++ T IP IPPL+ Q I + + A T L + Sbjct: 9 YIYYWLNTLPNNQTDGDHKRQWISNYANKLIP--IPPLSVQTEIVKILDALTTLTSELTS 66 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E I + + ++ L++ + + + +G V K Sbjct: 67 ELILRQKQYEYYREKLLN-----------IDEMNKVTELGDVGPVRMCKRIL-------- 107 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSL 305 KN +I G +K + + Y+ Y G+I+ Sbjct: 108 KNQTANSGDIPFYKIGTFGKKPDAYISNELFQEYKQKYSYPKKGDILISASGTIGRTVIF 167 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 E + + + +L Y + K A G G Q L +++K++ Sbjct: 168 ----DGENSYFQDSNIVWIDNDETLVLNKYLYHFYKIAKWGIAEG-GTIQRLYNDNLKKV 222 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +PP+KEQ I ++++ + + E + +I ++R Sbjct: 223 KISIPPLKEQHRIVSILDKFETLTNSITEGLPLAIEQSQKR 263 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 66/218 (30%), Gaps = 14/218 (6%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLED 55 + K Y Y+ + + I + KV + + + + DI + + Sbjct: 69 ILRQKQYEYYR----EKLLNIDEMNKVTELGDVGPVRMCKRILKNQTANSGDIPFYKIGT 124 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 Y+ + Q S KG IL G R I + +V Sbjct: 125 FGKKPDAYISNELF--QEYKQKYSYPKKGDILISASGTIGRTVIFDGENSYFQDSNIVWI 182 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 L+ L I EG T+ + + + IPPL EQ I + Sbjct: 183 DN--DETLVLNKYLYHFYKIAKWGIAEGGTIQRLYNDNLKKVKISIPPLKEQHRIVSILD 240 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + + ITE + +K+ ++ NP Sbjct: 241 -KFETLTNSITEGLPLAIEQSQKRYEYYRELLLNFHNP 277 Score = 39.4 bits (90), Expect = 1.3, Method: Composition-based stats. Identities = 9/85 (10%), Positives = 28/85 (32%), Gaps = 8/85 (9%) Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + Y+ + + + G + + + +PP+ Q +I +++ Sbjct: 1 MKNLALLKYIYYWLNTLP-----NNQTDGDHKRQWISNYANKLIPIPPLSVQTEIVKILD 55 Query: 384 VETARIDVLVEKI---EQSIVLLKE 405 T L ++ ++ +E Sbjct: 56 ALTTLTSELTSELILRQKQYEYYRE 80 >gi|237726583|ref|ZP_04557064.1| type I restriction-modification system [Bacteroides sp. D4] gi|229435109|gb|EEO45186.1| type I restriction-modification system [Bacteroides dorei 5_1_36/D4] Length = 189 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 61/187 (32%), Gaps = 16/187 (8%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL---------KP 277 VP+ W L + L+R + + + L+ + L Sbjct: 2 DVPNGWNWCKLNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQARFLDPSTI 61 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM---ERGII--TSAYMAVKPHGIDSTY 332 +++ + G+++ R+ + ++ + + I+S Y Sbjct: 62 NKWDSKYKLQTGDVLVNSTGTGTVGRTRLFDESYLGKYPFVVPDSHVAVVRTYEEINSEY 121 Query: 333 LAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + M S + + GS ++ L ++ L PPI EQ I I + +D Sbjct: 122 VFAYMSSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIEELFSVLD 181 Query: 391 VLVEKIE 397 + +E Sbjct: 182 NIQNALE 188 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 61/181 (33%), Gaps = 17/181 (9%) Query: 20 AIPKHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLE----DVESGTGKYLPKDGN--SRQ 72 +P W + + G++ + +D + +++ G S Sbjct: 2 DVPNGWNWCKLNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQARFLDPSTI 61 Query: 73 SDTSTVSIFAKGQILYGKLGP-------YLRKAIIADFDGIC--STQFLVLQPKDVLPEL 123 + + G +L G ++ + + + S +V +++ E Sbjct: 62 NKWDSKYKLQTGDVLVNSTGTGTVGRTRLFDESYLGKYPFVVPDSHVAVVRTYEEINSEY 121 Query: 124 LQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + ++ S + Q IE G+T + N+ P PP+ EQ I +KI +D Sbjct: 122 VFAYMSSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIEELFSVLD 181 Query: 183 T 183 Sbjct: 182 N 182 >gi|313894106|ref|ZP_07827672.1| type I restriction modification DNA specificity domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313441670|gb|EFR60096.1| type I restriction modification DNA specificity domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 401 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 42/423 (9%), Positives = 119/423 (28%), Gaps = 50/423 (11%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K K TG+ + + + + +Y + Sbjct: 2 EYRKLKTLAKYPTGKLNSNAAVEDGEYPFFTCAHDIYRIDQYSYDGEYVLLGGNNA---- 57 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G + + ++QP + + +++A Sbjct: 58 -SGDF----------PIFYYNGKFDAYQRTYLIQPLSEDTDTKYLYYSIGLKLHQMKANA 106 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + NI + + EQ I + + A I+ + ++LL++ ++ Sbjct: 107 SGTATKFLTQPILNNINIEYRDIEEQKRIADILSAYDNLIEN----NNKRMKLLEQMAES 162 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L + P + + +G +P + + ++ E + L Sbjct: 163 LYKEWFVRFRFPGYEDVEFVGSSLGKLPSTFNIVKIGTVIEYYIGGGWGEEELSELFPEE 222 Query: 262 GNIIQKLETRNMGL----------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 +++ + N+ S + +I F + RS + Sbjct: 223 AYVVRGTDFPNVKYGILDSCPLRYHKSSNYNQRAFKVNDIAFEVSGGTQKQPVGRSILIT 282 Query: 312 ER----------GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 ER + + Y ++ ++ + F+ Sbjct: 283 ERQLDRFNNRLICASFCKLIRCNIKKVSPRYFYHWLQYLYETRIIEQYQLQSTGIINFKF 342 Query: 362 ---VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +++ +++PP I + I ++ + + L +R + ++G+ Sbjct: 343 EYFLRKCNLMIPPKD----IMDKFTESVKPIYDEIDNLAEQNSKLIAQRDMLLPRLMSGK 398 Query: 419 IDL 421 +++ Sbjct: 399 LEV 401 >gi|25028884|ref|NP_738938.1| putative type I restriction-modification system subunit S [Corynebacterium efficiens YS-314] gi|259507946|ref|ZP_05750846.1| type I restriction-modification system subunit S [Corynebacterium efficiens YS-314] gi|23494171|dbj|BAC19138.1| putative type I restriction-modification system subunit S [Corynebacterium efficiens YS-314] gi|259164441|gb|EEW48995.1| type I restriction-modification system subunit S [Corynebacterium efficiens YS-314] Length = 385 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 59/413 (14%), Positives = 121/413 (29%), Gaps = 52/413 (12%) Query: 27 VVPIKRFTKL---NTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS--- 79 + N GRT S I I V+ + + T Sbjct: 6 QRRLTDLLSFIVDNRGRTCPTSETGIPLIATNCVKDDELYPVFEKVRFVDETTYETWFRA 65 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G IL+ G R A++ D C L + P V L L S + Sbjct: 66 HPEPGDILFVCKGSPGRTALVPDPVSFCIAQDMVALRVDPTVVNNRYLYYMLQSQKTRHQ 125 Query: 137 IEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 IE + G + H + + + L EQ I E + A +I E L Sbjct: 126 IENMHVGTMIPHFKKGDFPKLVLSVHADLGEQQAIAEVLGALDDKIAANSACIRLIDEHL 185 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + + + +E +G ++ E + K + + Sbjct: 186 AAEYERTLQQGEV-------------VEELG-------------VIAEFHNKRRIPLSAK 219 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 G + + G E+ +V +V + N + + Sbjct: 220 QRDERPGAVPYYGASGVFGYVNEAIFDEPLV----LVGEDGSVINSDGTPVIQYIWGPSW 275 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + A+K + + L + +R + + ++ + ++KRL + +P + Sbjct: 276 VNNHAHALKGKLVSTELLYYAIRRSQVSTLV---TGAVQPKINMGNLKRLQLALPAPE-- 330 Query: 376 FDITNVINVETARIDVLVEKI--EQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + E + K L R + + ++G + ++ + Sbjct: 331 ----SRTSTEAIIAAEVAAKRAFTTENRTLVATRDALLPQLMSGNLRVKDAEK 379 >gi|42528240|ref|NP_973338.1| type I restriction-modification system, S subunit, truncation [Treponema denticola ATCC 35405] gi|41819510|gb|AAS13257.1| type I restriction-modification system, S subunit, truncation [Treponema denticola ATCC 35405] Length = 162 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 43/162 (26%), Gaps = 2/162 (1%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + ++ + + S K+ ++ + T+ I Sbjct: 1 MWCRLGEICSITMGQSPESSFISNNSDGMEFHQGKIHFTEKYIQKANNYTFNITKIA--P 58 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 I L I +V P + + Sbjct: 59 KNAILLCVRAPVGVVNITEREICIGRGLCSVYPKYRIQSEFWFYWLQCQKDTFEQKSTGT 118 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 Q++ E +K + + +PP EQ I I A++D + Sbjct: 119 TFQAISIELIKNILIPLPPSSEQKRIVAKIEELFAQLDSITA 160 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 48/158 (30%), Gaps = 3/158 (1%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS-RQSDTSTVSIFAKGQ 85 + + G++ ES + G + K + I K Sbjct: 2 WCRLGEICSITMGQSPESSFISNNSDGMEFHQGKIHFTEKYIQKANNYTFNITKIAPKNA 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL P I + + + PK + + L E G T Sbjct: 62 ILLCVRAPV-GVVNITEREICIGRGLCSVYPKYRIQSEFWFYWLQCQ-KDTFEQKSTGTT 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I NI +P+PP +EQ I KI ++D+ Sbjct: 120 FQAISIELIKNILIPLPPSSEQKRIVAKIEELFAQLDS 157 >gi|269978356|gb|ACZ55912.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 226 Score = 70.6 bits (171), Expect = 5e-10, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 64/175 (36%), Gaps = 11/175 (6%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID------LQNDK 302 ++ + + ++ N Q ++ E + G+++F + Sbjct: 40 SQGNKFYVPYVNVFNNPQLDLNALESVQIGDKEKQNTIQLGDVLFTGSSENLEDCAMSCV 99 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + + + + + + + ++L +R Y+ K + +G R ++ + Sbjct: 100 VTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKHFLRDYNFRKNISKVANGVTRFNVSKQL 159 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + ++ + +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 160 LSQITIPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 214 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 53/171 (30%), Gaps = 15/171 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK + + G +S K + Y+ +V + L + + D Sbjct: 13 PKGVGFRKLGDIGEFYGGLVGKSKKSFSQGNKFYVPYVNVFNNPQLDLNALESVQIGDKE 72 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD----------FDGICSTQFLVLQPKDVLPELLQG 126 + G +L+ L ++ + F P L+ Sbjct: 73 KQNTIQLGDVLFTGSSENLEDCAMSCVVTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKH 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 +L + + I + G T + + + I +PIPPL Q I + + Sbjct: 133 FLRDYNFRKNISKVANGVTRFNVSKQLLSQITIPIPPLEIQQEIVKILDQF 183 >gi|331085646|ref|ZP_08334729.1| hypothetical protein HMPREF0987_01032 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406569|gb|EGG86074.1| hypothetical protein HMPREF0987_01032 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 375 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 48/374 (12%), Positives = 98/374 (26%), Gaps = 23/374 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + T + G I G+Y+ +G S T S I Sbjct: 3 VKLSDITHYSKGSQINREDLI----------DNGEYIYLNGGINPSGRWTASNVDANTIT 52 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 + G C L + + +R+ AI GA M Sbjct: 53 ISEGGNSSGYINYITEPFWCGAHCYYLFDGPKNTK--YLYYALKSQQERLFAIRSGACMP 110 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + +G ++ +++ T I + + ++ L + + V Sbjct: 111 NIKKADLGKFEFEFDYDEKKQDEIVSVLSSTENIINNRKKELEKLDELIRARFIELFGDV 170 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 G+ K + VG + + I L+ G Sbjct: 171 GTGVFNYETYKLGDVAKVG-------SSHRVFTTEFVESGIPFYRGTEIGELANGQKPSD 223 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + G+++ I + + + + ++ Sbjct: 224 PYYISEEHYVRLASDDTEPKVGDLLMPSICNKGQVWLVDTEEPFYYKDGRVLCISPDRTV 283 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV--- 384 +S +L + MR L + K +K + VLVPPI+ Q + Sbjct: 284 FNSKFLQYFMREKTLIEYPKMGSGSTFAEFKIFLLKDMDVLVPPIELQEQFADFAQATDK 343 Query: 385 -ETARIDVLVEKIE 397 + + + + Sbjct: 344 SKFQKYNATILSHN 357 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 46/141 (32%), Gaps = 12/141 (8%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 N G+ P T VD I S + E + + Sbjct: 28 YIYLNGGINPSGRWTASNVDANTITISEGGNS----SGYINYITEPFWCGAHCYYLFDGP 83 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE--QFDITNVINVE 385 ++ YL + ++S ++F ++K D+ + E Q +I +V+ Sbjct: 84 KNTKYLYYALKSQQ-ERLFAIRSGACMPNIKKADLGKFEFEF-DYDEKKQDEIVSVL--- 138 Query: 386 TARIDVLVEKIEQSIVLLKER 406 + + ++ ++ + L E Sbjct: 139 -SSTENIINNRKKELEKLDEL 158 >gi|270296268|ref|ZP_06202468.1| type I restriction-modification system S subunit [Bacteroides sp. D20] gi|270273672|gb|EFA19534.1| type I restriction-modification system S subunit [Bacteroides sp. D20] Length = 96 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 37/95 (38%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + ++ + A + + + P Y+ + ++S ++F +G+ Sbjct: 1 MMCIEGGSAGRKIAILNQDVCFGNKLCCFSPFVGIGKYMYYYLQSPSFFELFNLNKTGII 60 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + VK + + +PPIKEQ I I ++ Sbjct: 61 GGVSIAKVKEILIPLPPIKEQQRIVAQIEKLFEQL 95 >gi|254369302|ref|ZP_04985314.1| type I site-specific deoxyribonuclease [Francisella tularensis subsp. holarctica FSC022] gi|157122252|gb|EDO66392.1| type I site-specific deoxyribonuclease [Francisella tularensis subsp. holarctica FSC022] Length = 776 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 52/153 (33%), Gaps = 3/153 (1%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I L +I + Y+ +++ G ++ + L + Sbjct: 612 NYASDGIRYLKVSDIKDNYINNDKPFYVNKYKESDLIEKGTLLITRKGTVGNSYYL--DK 669 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + ++ ++ YL+ + S + K + +G SL +K + + Sbjct: 670 DGSFVASSEIFIIKLNDKVNGNYLSEINLSSFVKKQYREKSTGTIMPSLSQPKLKSILIP 729 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 +PP++ Q I I I L ++ EQ+ Sbjct: 730 LPPLEIQNHIAVRIQKLKDYIKALEQQAEQNRE 762 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 61/175 (34%), Gaps = 7/175 (4%) Query: 33 FTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 F LN G + + I Y+ + D++ Y+ D + + KG +L + Sbjct: 601 FVSLNNGIAARNYASDGIRYLKVSDIKDN---YINNDKPFYVNKYKESDLIEKGTLLITR 657 Query: 91 LGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGATMSH 148 G + D + S++ +++ D + + ++ G M Sbjct: 658 KGTVGNSYYLDKDGSFVASSEIFIIKLNDKVNGNYLSEINLSSFVKKQYREKSTGTIMPS 717 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + +I +P+PPL Q I +I I L + + E +A + Sbjct: 718 LSQPKLKSILIPLPPLEIQNHIAVRIQKLKDYIKALEQQAEQNRENALRNFEAEI 772 >gi|327460987|gb|EGF07320.1| type I restriction-modification system specificity determinant [Streptococcus sanguinis SK1057] Length = 352 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 53/394 (13%), Positives = 106/394 (26%), Gaps = 54/394 (13%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + + + R + Y+ E++ S G S F KG IL Sbjct: 6 LSQVSSYVSERIRIDEVNLDNYVSTENMISERGGVTKATKLPSGKTISA---FQKGDILI 62 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATMS 147 + PY +K +A G CS LV++ + + L L S + +G M Sbjct: 63 SNIRPYFKKIWLAGKSGGCSNDVLVVRANEKISNRFLYYVLSSDNFFDYAVGTSKGTKMP 122 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 D K I +PI L EQ I E + A +I Sbjct: 123 RGDKKAIMKYEVPIYSLVEQEKIAEVLRAFDKKII------------------------- 157 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + +H + ++ E K + Sbjct: 158 -----------------LNKQINHHLEQIALSIFKEEFSKKEVTNKLGDFFPVITGKKDA 200 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + S L + + P+ Sbjct: 201 NIAKGGEYPFFSCSQNISYTDNYSFDARAILLAGNGDFNVKIFNGKFEAYQRTYVLIPNN 260 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + Y L + + + + ++ + + KE + + + Sbjct: 261 DEHFGYLYYAIKYFLKDITSGHRGSVIKFITKGQIEHFDIFMTSNKE------KLFLFNS 314 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ + K + I L R + + ++G+I + Sbjct: 315 FVEN-IAKNNKEIDKLTNIRDTLLPKLLSGEISV 347 >gi|253569682|ref|ZP_04847091.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840063|gb|EES68145.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 393 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 59/406 (14%), Positives = 123/406 (30%), Gaps = 40/406 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 WK + F + + + K + I + + K + S + Sbjct: 9 EWKESVLSDFVERVKRKNKNNLCKLPLTISAQYGLVDQISFFNKVIA--SENMSNYYLLH 66 Query: 83 KGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELL--QGWLLSIDVTQ 135 KG Y K G S+ ++ +P + + S Sbjct: 67 KGDFAYNKSYSSEYPWGAIKRLDCYEQGTLSSLYICFKPYSHVSSDFLTHYFETSKWHQG 126 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 E EGA GI + L + +L +EKI I+ I + + IE Sbjct: 127 ISEIAVEGARNHGLLNVGIQDFFETRHCLPQSLLEQEKIAKFLNLIEERIATQNKIIEKY 186 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + QA++ G+ W+ ++ E KNT Sbjct: 187 ESLIQAIIYQKKAAGIRKG----------------DWQKTELSNVLKERIEKNTNGYIIC 230 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERG 314 +S+S G +I ++E + Y +V G+IV+ + + + + + Sbjct: 231 SVSVSQG-VINQIEYLGRSFAAKETLHYNVVKYGDIVYTKSPTGDFPYGIVKRSYIKDDV 289 Query: 315 IITSAYMAVKP-HGIDSTYLAWLMRSY-----DLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 ++ Y P + L + L + ++ E + + Sbjct: 290 AVSPLYGVYMPVNDYIGVILHFYFMQPSNAFNYLHPLIQKGAKNTI-NITNERFLKNSIP 348 Query: 369 VPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +P + E I N + +ID ++ + ++ + ++ Sbjct: 349 LPKTENEAIYIANTLISIQKKID----MEKKMLWSYEKEKQYLLSK 390 >gi|188524184|ref|ZP_03004248.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|195660056|gb|EDX53436.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 12 str. ATCC 33696] Length = 392 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 42/390 (10%), Positives = 112/390 (28%), Gaps = 11/390 (2%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +V I K+ G T + + ++ +++ + L + SR + I Sbjct: 3 IVNIGSICKIIGGSTPSTKNNNLW--KKEIPFYSLADLLINVASRYISIENNKFIDEPAI 60 Query: 87 LYGKLGPYLRKAIIADFDGICST-QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 L+ + + + + + +VL + + +G+ Sbjct: 61 LFSSTATIGNVCYVEEKCWFNDQIKAFISKDSNVLNTKYLYYWFLNNKHIIKSQANKGSV 120 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL---ITERIRFIELLKEKKQAL 202 S K + N+ + +P + EQ I I + Sbjct: 121 FSSIGIKELVNMKINLPSIEEQNAIISIIEPHEKLFVKYSNLVDISSVENAKKDVDNLIS 180 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + K + ++ ++ + + + E + K+ I+ + + Sbjct: 181 IIEPIEKSIKTINLLQTKIGLFIEKTFNFINDNLVNSDLIEFSLKDLLNIKRGLPITAKD 240 Query: 263 --NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + K Y + I + + + Sbjct: 241 LLNNPGSYPLISASSKNNGIFGYFNDYMYDGQNITISMNGNAGCIFYQIGKFSANSDVLV 300 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++ + + + + ++ R L +++ VL+P I+ Q + Sbjct: 301 LSNSNKNLTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNIEIQEKFSK 360 Query: 381 VINVETARIDVLVEKIEQSIV--LLKERRS 408 ++ + KIE+++ LLK + Sbjct: 361 IVEPLL-NLSTKANKIEKNLNECLLKIVKK 389 >gi|331666163|ref|ZP_08367044.1| type I restriction enzyme EcoAI specificity protein (S protein)(S.EcoAI) [Escherichia coli TA271] gi|331066374|gb|EGI38251.1| type I restriction enzyme EcoAI specificity protein (S protein)(S.EcoAI) [Escherichia coli TA271] Length = 405 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 63/201 (31%), Gaps = 9/201 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE---TRNMGLK 276 S E +PD WE + + + E I + I K + + Sbjct: 93 SEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTKFDGSHEFEIKKW 152 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTY 332 + + Y G+I I + ++ GI I+ Y Sbjct: 153 KDVKKGYTHFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHVARPFSDIINRKY 212 Query: 333 LAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L +S + K GS ++ + + P+ PP++EQ I + D Sbjct: 213 LLLNFKSPNFLKSGESQMTGSAGQKRVPRFFFENNPIPFPPLQEQERIIIRFTQLMSLCD 272 Query: 391 VLVEKIEQSIVLLKERRSSFI 411 L ++ S+ ++ + + Sbjct: 273 QLEQQSLTSLDAHQQLVETLL 293 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 72/213 (33%), Gaps = 12/213 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESG 59 +K K P+ S + +P W+ + R ++N + +I +I + + + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTK 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLV 113 + + + FA G I K+ P + + + G+ +T+ V Sbjct: 141 FDGSHEFEIKKWKDVKKGYTHFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHV 200 Query: 114 LQPKDVLPELLQ---GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 +P + + + + A N P+P PPL EQ I Sbjct: 201 ARPFSDIINRKYLLLNFKSPNFLKSGESQMTGSAGQKRVPRFFFENNPIPFPPLQEQERI 260 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + D L + + ++ ++ + L+ Sbjct: 261 IIRFTQLMSLCDQLEQQSLTSLDAHQQLVETLL 293 >gi|296277174|ref|ZP_06859681.1| type I restriction-modification system S subunit [Staphylococcus aureus subsp. aureus MR1] Length = 210 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 81/216 (37%), Gaps = 14/216 (6%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ + + + G WE + E N ++ + Sbjct: 2 LLQQQKKGYMQKIFSQELRFKDENGEDYPDWENSKIEKYLKERNERSD--KGQMLSVTIN 59 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 II+ E + Y++V +I + + + + GI++ AY Sbjct: 60 SGIIKFSELDRKDNSSKDKSNYKVVRKNDIAYNSMRMWQGASGKSNY----NGIVSPAYT 115 Query: 322 AVKPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQFD 377 + P S+ + +++ + F GL +LK++ +K + + +P ++EQ Sbjct: 116 VLYPTQNTSSLFIGYKFKTHRMIHKFKINSQGLTSDTWNLKYKQLKNINIDIPVLEEQEK 175 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + ++D+L+ K + I +L++ + SF+ Sbjct: 176 IGDF----FKKMDILISKQKMKIEILEKEKQSFLQK 207 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 62/183 (33%), Gaps = 7/183 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ I+++ K R+ + + I ++ ++ D S + K Sbjct: 31 DWENSKIEKYLKERNERSDKGQMLSVTINSGIIKFSELD----RKDNSSKDKSNYKVVRK 86 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I Y + + + ++++GI S + VL P L G+ I Sbjct: 87 NDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQ 146 Query: 144 ---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + +K + NI + IP L EQ I + + I + + + Q Sbjct: 147 GLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFLQ 206 Query: 201 ALV 203 + Sbjct: 207 KMF 209 >gi|301793726|emb|CBW36113.1| type I restriction-modification system M protein [Streptococcus pneumoniae INV104] Length = 425 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 59/413 (14%), Positives = 131/413 (31%), Gaps = 64/413 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDS-----------------------------------GIEWVGLVPDHWEVKP 236 +S + G +P +W V Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNIPMNWVVIK 251 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ----------IV 286 + + + K + +I + II+ + + + Y + Sbjct: 252 IKDIFSINTGLSYKKGDLSINN-KGVRIIRGGNIKPLEFSLLDNDYYIDTQFISSEQVYL 310 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWLMRSYDL 342 +++ G++ ++ + I S +L + + S Sbjct: 311 KHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNLSSPLF 370 Query: 343 CKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 K + ++ + L + + P +EQ IT + +++ L Sbjct: 371 YKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 423 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 241 GNIPMNWVVIKIKDIFSINTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 300 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 301 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 360 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 361 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 420 Query: 182 DTLI 185 + L Sbjct: 421 NQLW 424 >gi|184154001|ref|YP_001842342.1| type I restriction-modification system S subunit [Lactobacillus reuteri JCM 1112] gi|183225345|dbj|BAG25862.1| type I restriction-modification system S subunit [Lactobacillus reuteri JCM 1112] Length = 342 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 60/389 (15%), Positives = 126/389 (32%), Gaps = 49/389 (12%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +V +K G ++ KD+ + +G+Y P G + + + + Sbjct: 2 IVKLKDVC--IKGTSNIRQKDV---------NDSGRY-PVYGAAGPVGFMNSFQYDEPYV 49 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 K G + +A + L PK + + +S +E GAT+ Sbjct: 50 GVVKDGAGIGRATYLPSNSSIIGTMQALIPKKNVLPKYLYYAVSS---MHLEKYYSGATI 106 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H +K + + EQ II ++ +I+ + + + L E +A Sbjct: 107 PHIYFKNYKHERFVLVSKKEQEQ----IIWRFSLLEKMISNKQQQLLKLDELIKA---RF 159 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 V +P + K+ + +G + T + + N GN I+ Sbjct: 160 VEMFGDPIINNKNIKKKKLGDI-----CLLKAGDFTPSKKISPVKTSINKYPCFGGNGIR 214 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 S Q G + F +N + ++ + +E Sbjct: 215 GYVDNYTHQGNYSLIGRQGALCGNVKFATGKFRNTEHAILVSPNIE-------------- 260 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I+S +L L+ L K+ + L + + + V V + Q + N + Sbjct: 261 -INSRWLFELLN---LEKLNRFRSGAAQPGLAVKTLNEIIVPVADLNSQNEYANFV---- 312 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D I++S+ ++ S + Sbjct: 313 QQVDKSKVVIQKSLDETQKLYDSLMQEYF 341 >gi|313158258|gb|EFR57660.1| conserved hypothetical protein [Alistipes sp. HGB5] Length = 95 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 37/94 (39%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + ++ + A + + + P Y+ + ++S ++F +G+ Sbjct: 1 MCIEGGSAGRKIAILNQDVCFGNKLCCFSPFVGIGKYMYYYLQSPSFFELFNLNKTGIIG 60 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + VK + + +PPIKEQ I I ++ Sbjct: 61 GVSIAKVKEILIPLPPIKEQQRIVAQIEKLFEQL 94 >gi|139438845|ref|ZP_01772305.1| Hypothetical protein COLAER_01309 [Collinsella aerofaciens ATCC 25986] gi|133775556|gb|EBA39376.1| Hypothetical protein COLAER_01309 [Collinsella aerofaciens ATCC 25986] Length = 520 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 59/203 (29%), Gaps = 13/203 (6%) Query: 227 LVPDHWEVKPFF--ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +P+ W + K S + ++ NI Q E Sbjct: 92 ELPEGWAWARLETVYNFIDYRGKTPHKSPSGVRLMTASNIRQGYIDYTREEYISEDEYAT 151 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG----IDSTYLAWLMRSY 340 + GE + + A + + + D+ ++ S Sbjct: 152 RLSRGETHRGDLLFTTEAPMGYCAICEMKRCSCGQRVITLQNYGTVGPDNALFCQIILSP 211 Query: 341 DLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +G + +K +K L + +PP+ EQ I +N ++ ++E + Sbjct: 212 LFQIQVKDHATGTTAKGIKAAVLKELFLPIPPLAEQRRIVERVNELMPLVEEY-GELEDA 270 Query: 400 IVLLKE-----RRSSFIAAAVTG 417 L R S + AV G Sbjct: 271 REELDAALPGRLRKSVLQLAVQG 293 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 73/209 (34%), Gaps = 12/209 (5%) Query: 20 AIPKHWKVVPIKRFTKLN--TGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +P+ W ++ G+T +S + + ++ G Y ++ S + Sbjct: 92 ELPEGWAWARLETVYNFIDYRGKTPHKSPSGVRLMTASNIRQGYIDYTREEYISEDEYAT 151 Query: 77 TVSI--FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK---DVLPELLQGWLLSI 131 +S +G +L+ P AI C + + LQ L +LS Sbjct: 152 RLSRGETHRGDLLFTTEAPMGYCAICEMKRCSCGQRVITLQNYGTVGPDNALFCQIILSP 211 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++ G T + + +PIPPLAEQ I E++ ++ Sbjct: 212 LFQIQVKDHATGTTAKGIKAAVLKELFLPIPPLAEQRRIVERVNELMPLVEEYGELEDAR 271 Query: 192 IE----LLKEKKQALVSYIVTKGLNPDVK 216 E L +++++ V GL P Sbjct: 272 EELDAALPGRLRKSVLQLAVQGGLVPQDP 300 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 52/135 (38%), Gaps = 8/135 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTV 78 IP+ W+ + LN G+ + YI + +++ K N+ + + Sbjct: 367 EIPESWEWRRLGSLV-LNRGQKRPEAR-FAYIDISSIDNVNQKLGQETVINAADAPSRAR 424 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLV-LQPKDVLPELLQGWLLSIDV 133 + AK +LY + PYL A I D D I ST F V LP L +L+S Sbjct: 425 KLVAKNDVLYATVRPYLHNACIVDKDFNIKPIASTGFAVLSCLDGFLPSFLLYFLVSPSF 484 Query: 134 TQRIEAICEGATMSH 148 A +++ Sbjct: 485 DSYANANENAKGVAY 499 >gi|306826263|ref|ZP_07459597.1| type I restriction enzyme specificity protein HsdS [Streptococcus sp. oral taxon 071 str. 73H25AP] gi|304431539|gb|EFM34521.1| type I restriction enzyme specificity protein HsdS [Streptococcus sp. oral taxon 071 str. 73H25AP] Length = 398 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 50/400 (12%), Positives = 113/400 (28%), Gaps = 56/400 (14%) Query: 26 KVVPIKRFTKLNT-----GRTSESGKDIIYI---------GLEDVESGTGKYLPKDGNSR 71 + + T G + + K++ YI D++SG + Sbjct: 13 EWKELWEVCDTVTDFTAAGSFASNAKNVKYIQEASFAQLVRTTDLKSGFKGNNFVYVDEH 72 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQG 126 + + ++ +G I + + L+++ L Sbjct: 73 AFNYLYRVNLDQESLVMPNVGNCGEIYYIEPENLPYENNVLGPNALLVRSSKENNRYLFH 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S + I + + + I +PIPP Q I + + T + L + Sbjct: 133 LFQSGQFQNELAKITSNTGQTKYNKTNLKKIRIPIPPQEIQEKIVQILDKFTDYVTELTS 192 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E + + L+S+ +KD G + Sbjct: 193 ELTSRKKQYSFYRDKLLSFEDEVYQVEWKVLKDVATLKNG-------------------K 233 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 L I G + E + P ++ R + N + Sbjct: 234 DWKTLPSGEIPVYGSGGEMG-----------EFVADHSYDKPTVLIPRKGSISNLFYLEK 282 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + ++ Y + I Y + + + K+ + R SL + ++ Sbjct: 283 AFWNVDTVY----YTEIDDEQIIPKYFYYYLTT---VKLEEMATNPTRPSLTQAILDKIR 335 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + VP ++ Q I V++ + L + + L +++ Sbjct: 336 IPVPSLEIQSRIVQVLDNFDKVCNDLNIGLPRENELRQKQ 375 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 51/150 (34%), Gaps = 2/150 (1%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV-MERGIITSAYMAV 323 + + +Y +D +V + + + + E ++ + V Sbjct: 61 FKGNNFVYVDEHAFNYLYRVNLDQESLVMPNVGNCGEIYYIEPENLPYENNVLGPNALLV 120 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++ YL L +S + S + ++K++ + +PP + Q I ++ Sbjct: 121 RSSKENNRYLFHLFQSGQFQNELAKITSNTGQTKYNKTNLKKIRIPIPPQEIQEKIVQIL 180 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + T + L ++ R ++ Sbjct: 181 DKFTDYVTELTSELTSRKKQYSFYRDKLLS 210 >gi|296277375|ref|ZP_06859882.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus MR1] Length = 196 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 51/182 (28%), Gaps = 6/182 (3%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK----LE 269 +++ G E + + I L NI + Sbjct: 1 MPELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLND 60 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + + G+++ + ++ S + + Sbjct: 61 LVYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYY 120 Query: 330 STYLA-WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETA 387 + +L+ K+F A G R+ L F+++ L + P I +EQ I + Sbjct: 121 YNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSKLDQ 180 Query: 388 RI 389 +I Sbjct: 181 QI 182 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 61/171 (35%), Gaps = 13/171 (7%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 12 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 71 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 72 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRI 181 ++I G + ++K I N+ + P + EQ I + +I Sbjct: 132 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSKLDQQI 182 >gi|319428171|gb|ADV56245.1| restriction modification system DNA specificity domain protein [Shewanella putrefaciens 200] Length = 396 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 41/406 (10%), Positives = 105/406 (25%), Gaps = 55/406 (13%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + G+ + ++ ++ + ++ Sbjct: 17 EWKELGNSINFQRGKRLVKSQLEESGEYAVFQNSMTPLGYYHESNVSAKSA--------- 67 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + G D + Q + + + + + +I + A+ Sbjct: 68 FVIC-AGAAGEIGFSDDSFWAADDVYYAEQSEILNSK--YLYHFLLTQKHKIASQVRRAS 124 Query: 146 MSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I + +PIP LA Q I + A T L E + Sbjct: 125 IPRLSKSAIEKLIVPIPCPDNPEKSLAIQAEIVRILDAFTAMTAELTAELNMRKKQYNYY 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L+S ++ +EW + + + + + Sbjct: 185 RDQLLS------------FEEGEVEW----------RALSEMAEYSKARISYTELDDSNY 222 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + +++Q + + +P +I+ I K G + Sbjct: 223 VGVESLLQNRAGKIDSTRTPDSGNLTQYNPDDILIGNIRPYLKKIWHADRVGGTNGDV-- 280 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP------- 370 + I+ YL ++ + G + V +P Sbjct: 281 LVVHPTDTAINPRYLYQVLADDKFFEYNMQHAKGAKMPRGNKPKIMEYLVPIPFASNREK 340 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + EQ I +++ + E + + I L ++ R ++ Sbjct: 341 SLSEQERIVTILDKFDTLTSSITEGLPREIELRQKQYEYYRDLLLS 386 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 44/219 (20%), Positives = 74/219 (33%), Gaps = 26/219 (11%) Query: 1 MKHYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNT-GRTSESGKDIIYIGLEDV 56 M+ K Y Y+D S + G + + + + + + D Y+G+E + Sbjct: 176 MRK-KQYNYYRDQLLSFEE--GEV----EWRALSEMAEYSKARISYTELDDSNYVGVESL 228 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + + IL G + PYL+K AD G + LV+ P Sbjct: 229 LQNRAGKIDSTRTPDSGNLTQY---NPDDILIGNIRPYLKKIWHADRVGGTNGDVLVVHP 285 Query: 117 KD--VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQ 167 D + P L L + +GA M + I +PIP L+EQ Sbjct: 286 TDTAINPRYLYQVLADDKFFEYNMQHAKGAKMPRGNKPKIMEYLVPIPFASNREKSLSEQ 345 Query: 168 VLIREKIIAE---TVRIDTLITERIRFIELLKEKKQALV 203 I + T I + I + E + L+ Sbjct: 346 ERIVTILDKFDTLTSSITEGLPREIELRQKQYEYYRDLL 384 >gi|256854681|ref|ZP_05560045.1| predicted protein [Enterococcus faecalis T8] gi|256710241|gb|EEU25285.1| predicted protein [Enterococcus faecalis T8] Length = 211 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 69/189 (36%), Gaps = 11/189 (5%) Query: 231 HWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 +WE+ + ++ KN E+ S YG I Q++ + +Y +V Sbjct: 23 YWELCKLSDISDKVKEKNKHGKFTETLTNSAEYGIINQRVFFDKDISNVNNLNSYYVVQN 82 Query: 289 GEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + V+ I ++ ++ G+++ Y + H ID+ YL + Sbjct: 83 DDFVYNPRISNFAPVGPIKRNRLGRTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFME 142 Query: 348 AMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G R ++K +P+ P +EQ I ++D + + + L Sbjct: 143 LNGDTGARADRFAIKDSIFVEMPIPYPSTEEQQKIGIF----FKKLDQSITLYKNKLNQL 198 Query: 404 KERRSSFIA 412 K + +++ Sbjct: 199 KALKKAYLQ 207 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 59/189 (31%), Gaps = 8/189 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + + + + E + KD ++ + ++ + Sbjct: 24 WELCKLSDISDKVKEKNKHGKFTETLTNSAEYGIINQRVFFDKDISNVNN-LNSYYVVQN 82 Query: 84 GQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +Y + G+ S + V + + L+ + ++ +E Sbjct: 83 DDFVYNPRISNFAPVGPIKRNRLGRTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFME 142 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + +I + +P ++KI ++D IT + LK Sbjct: 143 LNGDTGARADRFAIK-DSIFVEMPIPYPSTEEQQKIGIFFKKLDQSITLYKNKLNQLKAL 201 Query: 199 KQALVSYIV 207 K+A + + Sbjct: 202 KKAYLQNMF 210 >gi|321310233|ref|YP_004192562.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802077|emb|CBY92723.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 199 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 50/183 (27%), Gaps = 7/183 (3%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + +++ I ++ I + + I+ P Sbjct: 16 EDICKVQNGYSFASGKYRDSGHPIIRIGNIQDVGIQVDDFIYFWDEDYKEDLSRFILKPN 75 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++V K +L + P +D YL + + + Sbjct: 76 DLVITARGSCCGKVALNQTNRSFYLNQGVWRLDPNPEFLDKEYLFHFLLDSNFDFIVVK- 134 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G L K++ + VP + Q +I + +N I+ + ++ + Sbjct: 135 --GTIPRLNVNQFKKIKIPVPSLFTQREIASRLNK-FREIEREINLRDKQYEYYRNY--- 188 Query: 410 FIA 412 I Sbjct: 189 LIN 191 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 54/180 (30%), Gaps = 11/180 (6%) Query: 27 VVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS--TVSI 80 ++ K+ G + SGK I + +++ + + + I Sbjct: 12 ECSLEDICKVQNGYSFASGKYRDSGHPIIRIGNIQDVGIQVDDFIYFWDEDYKEDLSRFI 71 Query: 81 FAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 ++ G K + + + L P + + +D + Sbjct: 72 LKPNDLVITARGSCCGKVALNQTNRSFYLNQGVWRLDPNPEFLDKEYLFHFLLD--SNFD 129 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I T+ + I +P+P L Q I ++ + I+ I R + E + Sbjct: 130 FIVVKGTIPRLNVNQFKKIKIPVPSLFTQREIASRLN-KFREIEREINLRDKQYEYYRNY 188 >gi|307579266|gb|ADN63235.1| type I restriction-modification system specificity determinant [Xylella fastidiosa subsp. fastidiosa GB514] Length = 101 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 5/86 (5%) Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIE 397 ++ + G +Q+L E V+ L PP EQ +I ++I+ +ID Sbjct: 2 TWRYEDIRSLAHGGQQQNLNLEMVRDLLFATPPSHAEQDEIVSIIDAIDRKID----LHR 57 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLRG 423 + +L++ S + +TG+I + Sbjct: 58 RKRHVLEDMSKSLLHKLMTGEISVSD 83 >gi|291320530|ref|YP_003515794.1| type I R/M system specificity subunit [Mycoplasma agalactiae] gi|290752865|emb|CBH40840.1| Type I R/M system specificity subunit [Mycoplasma agalactiae] Length = 395 Score = 70.2 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 46/393 (11%), Positives = 109/393 (27%), Gaps = 23/393 (5%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + ++ I Y + ++ ++ + G + +D I + Sbjct: 19 WEQWKARGILLPYRQKNDKNLTLISYSVSNKEGFVDQKEFFDEGGKAVYADKKNSLIISF 78 Query: 84 GQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIEAI 140 Y + + +G+ S + V + P+ + W S + + Sbjct: 79 DTFAYNPSRINVGSIALFKNTINGLVSPIYEVFKVSANSNPDFIYLWFKSECFNKIVANN 138 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + K + + +P L EQ I + + I + L Sbjct: 139 SNKSVRDTLNLKQFEDNLLNLPVLQEQNKIAKLFSSLDSLITLHQRKHSSLKNLKNR--- 195 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L+ + + ++ + +K +L + Sbjct: 196 -LLDKMFCDEKSQFPSIRFKEFTNAWEQEKLGNLTILNRFPQISAQKLWELNQYFGEVFL 254 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N S E +++ GE++ R+ V I + + Sbjct: 255 LP-----SSDNNNWKCKYSKEIANLINTGEVI-----TIGRARNPNVKYVNGTFISSQNH 304 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + K FY S D P + EQ I Sbjct: 305 IIESKTTDTLLNKFLYFFITKVGKKFYGFES-TYPMFTKIDFLNTKFSFPIVSEQIKIIK 363 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I++ +D L+ ++ + LK +++ + Sbjct: 364 TIDI----LDSLITLHQRKLNSLKNIKNTLLEK 392 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 67/190 (35%), Gaps = 7/190 (3%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG-NIIQKLETRNMGLKPESYETYQI 285 + WE ++ +KN K + S+S + + E + G K + Sbjct: 14 EFTNAWEQWKARGILLPYRQKNDKNLTLISYSVSNKEGFVDQKEFFDEGGKAVYADKKNS 73 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCK 344 + F + + + S+ + G+++ Y + + ++ +S K Sbjct: 74 LIISFDTFAYNPSRINVGSIALFKNTINGLVSPIYEVFKVSANSNPDFIYLWFKSECFNK 133 Query: 345 VF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + +R +L + + + +P ++EQ I + +D L+ ++ L Sbjct: 134 IVANNSNKSVRDTLNLKQFEDNLLNLPVLQEQNKIA----KLFSSLDSLITLHQRKHSSL 189 Query: 404 KERRSSFIAA 413 K ++ + Sbjct: 190 KNLKNRLLDK 199 >gi|332074782|gb|EGI85255.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17570] Length = 240 Score = 70.2 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 67/185 (36%), Gaps = 14/185 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL------- 275 E +P+ WE + + + R + + + + ++ L Sbjct: 56 EVPCEIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPE 115 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKR-SLRSAQVMERGIITSA----YMAVKPHGIDS 330 SY+ +++ G++++ L R ++ G + + V I+ Sbjct: 116 TVHSYQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINC 175 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + S + V SG ++ L + +K + +PP+ EQ I + I A Sbjct: 176 HFIYNFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAH 235 Query: 389 IDVLV 393 I+ L+ Sbjct: 236 INALI 240 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 50/181 (27%), Gaps = 15/181 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-----GKYLPKDGNSRQSD 74 IP+ W+ V + T S +I + + Sbjct: 60 EIPESWEWVRLNDITSYIQRGKSPKYSNIPIYPVIAQKCNQWSGFSIDLARFIDPETVHS 119 Query: 75 TSTVSIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGW 127 + G +++ G R AI + + + V++ + + Sbjct: 120 YQKERLLRDGDLMWNSTGLGTLGRLAIYHENKNPYGWAVADSHVTVIRVLSGVINCHFIY 179 Query: 128 LLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + E K I +P+PPL EQ I +KI I+ L Sbjct: 180 NFLSSPIVQSVIEEKASGSTKQKELLTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHINAL 239 Query: 185 I 185 I Sbjct: 240 I 240 >gi|229547240|ref|ZP_04435965.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis TX1322] gi|229307637|gb|EEN73624.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis TX1322] Length = 225 Score = 70.2 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 69/189 (36%), Gaps = 11/189 (5%) Query: 231 HWEVKPFFALVTELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 +WE+ + ++ KN E+ S YG I Q++ + +Y +V Sbjct: 37 YWELCKLSDISDKVKEKNKHGKFTETLTNSAEYGIINQRVFFDKDISNVNNLNSYYVVQN 96 Query: 289 GEIVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + V+ I ++ ++ G+++ Y + H ID+ YL + Sbjct: 97 DDFVYNPRISNFAPVGPIKRNRLGRTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFME 156 Query: 348 AMGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G R ++K +P+ P +EQ I ++D + + + L Sbjct: 157 LNGDTGARADRFAIKDSIFVEMPIPYPSTEEQQKIGIF----FKKLDQSITLYKNKLNQL 212 Query: 404 KERRSSFIA 412 K + +++ Sbjct: 213 KALKKAYLQ 221 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 59/189 (31%), Gaps = 8/189 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + + + + E + KD ++ + ++ + Sbjct: 38 WELCKLSDISDKVKEKNKHGKFTETLTNSAEYGIINQRVFFDKDISNVNN-LNSYYVVQN 96 Query: 84 GQILYGKLGPYLRKAIIADFD-----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +Y + G+ S + V + + L+ + ++ +E Sbjct: 97 DDFVYNPRISNFAPVGPIKRNRLGRTGVMSPLYYVFRTHSIDNNYLEKYFDTVYWHHFME 156 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + +I + +P ++KI ++D IT + LK Sbjct: 157 LNGDTGARADRFAIK-DSIFVEMPIPYPSTEEQQKIGIFFKKLDQSITLYKNKLNQLKAL 215 Query: 199 KQALVSYIV 207 K+A + + Sbjct: 216 KKAYLQNMF 224 >gi|291542120|emb|CBL15230.1| Restriction endonuclease S subunits [Ruminococcus bromii L2-63] Length = 169 Score = 70.2 bits (170), Expect = 7e-10, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 57/163 (34%), Gaps = 9/163 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRS 304 + ++Y N+ + + Q V G++ F D+ Sbjct: 11 KTKDDFGHGEAKFITYMNVFSNPIADLTMTESIEIDKKQKSVKAGDVFFTTSSETPDEVG 70 Query: 305 LRSA--QVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + S + + + S +P D YLA+++R+ K + G+ R ++ Sbjct: 71 MSSVMPEDADNIYLNSFCFGYRPTEKFDLNYLAYVLRADSFRKEMTFLAQGISRYNISKN 130 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 V + + VP I+EQ + +D L+ ++ + Sbjct: 131 KVMEVCIPVPTIEEQTKVGRY----FRNLDHLITLHQRKQERI 169 >gi|323965458|gb|EGB60913.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli M863] gi|327250265|gb|EGE61984.1| type I restriction modification DNA specificity domain protein [Escherichia coli STEC_7v] Length = 438 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 56/433 (12%), Positives = 115/433 (26%), Gaps = 54/433 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ V + TK ++G T +D I +I +E G K + + Sbjct: 5 WETVRLGDLTKWSSGGTPNKSEDSYWNGTIPWISASSME-GHLYSDSKLKITEDGLINGS 63 Query: 79 SIFAKGQILYGKLGPYLRK---AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + IL G L + +A + L + + + L Q Sbjct: 64 RLAPANSILLLVRGSILHQKIQVGLATKAVAFNQDVKCLIVNNDMIDPWYLLLWFKAKEQ 123 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + I E + + P+ + I+E I I IT L Sbjct: 124 DLLKIVESTGIGAGKLDTKLLMDYPVEIPPK--EIKEYIRFLGKAIFDKITLNENINYNL 181 Query: 196 KEKKQALVSYIVTKGLNPDVK----------------------------MKDSGIEWVGL 227 ++ Q L +P + K E L Sbjct: 182 EKMSQTLFKSWFVD-FDPVIDNALDVGNQIPEALQARAELRQKVRNSADFKPLPTEIRSL 240 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-- 285 P +E + + + + + + + T+ I Sbjct: 241 FPSEFEETELGWVPKGWEIGKLQDLLILQRGFDLPSTQRNIGLHPIIAASGYNGTHDIAM 300 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 V IV + + + + + + + + Y L++ D Sbjct: 301 VKAPGIVTGRSGVLGNVFLI----LEDFWPLNTTLWVKELKHATPCYGYELLKMIDFSSF 356 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G +L + L L+PP + + + V + ++ L Sbjct: 357 ---NGGSAVPTLNRNHIHNLDYLLPPRNL----IEKFELFSMSLYRQVHEFKKQAQTLTA 409 Query: 406 RRSSFIAAAVTGQ 418 R + + ++G+ Sbjct: 410 LRDTLLPKLISGE 422 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 57/187 (30%), Gaps = 16/187 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +PK W++ ++ L G S P S + T Sbjct: 250 LGWVPKGWEIGKLQDLLILQRGFDLPST------------QRNIGLHPIIAASGYNGTHD 297 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +++ I+ G+ G +I + +T V + K P L ID Sbjct: 298 IAMVKAPGIVTGRSGVLGNVFLILEDFWPLNTTLWVKELKHATPCYGYELLKMIDF---- 353 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+ + + I N+ +PP ++ ++ + L Sbjct: 354 SSFNGGSAVPTLNRNHIHNLDYLLPPRNLIEKFELFSMSLYRQVHEFKKQAQTLTALRDT 413 Query: 198 KKQALVS 204 L+S Sbjct: 414 LLPKLIS 420 >gi|197104448|ref|YP_002129825.1| type I restriction-modification system, S subunit [Phenylobacterium zucineum HLK1] gi|196477868|gb|ACG77396.1| type I restriction-modification system, S subunit [Phenylobacterium zucineum HLK1] Length = 487 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 51/415 (12%), Positives = 114/415 (27%), Gaps = 53/415 (12%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ W + G+ GL+ + +P G++ ++V + Sbjct: 64 PRGWLRARVGDLLDFQYGK-----------GLKASDREDAGPIPVYGSNGVVGFTSVPLT 112 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I+ G+ G + + + P L L ++D+ + + Sbjct: 113 RQPSIIVGRKGSAGALNLCTVPSWTTDVAYFIEVPSYFDFNYLFHALTALDLGTLGKGVK 172 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + +PP+ EQ I KI D L R A Sbjct: 173 PG-----LSRSDAYALVLAVPPVGEQRRIVAKIDELMALCDELEAARTAREAARDRLAAA 227 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK-------------------------- 235 ++ + T ++ + + Sbjct: 228 SLARLNTPNPGTFQADARFALDALPALTARPGQIAQLRNTILSLAVRGGLSGNPAWSRQA 287 Query: 236 ----PFFALVTELNRKNTKLIESNILSLSYGNI----IQKLETRNMGLKPESYETYQIVD 287 F +L K+ +S + N+ + + + ++ Sbjct: 288 VRLGDFASLQNGYAFKSEWFSKSGTRLVRNANVGHGSLNWSDEVRLPDTMIHEFERFRLN 347 Query: 288 PGEIVFR---FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 G+IV + K + + + + ++ V +D++YL + S Sbjct: 348 EGDIVLSLDRPFIVSGTKVARVAKEDLPALLLQRVGRFVLSKELDASYLFLWINSPHFSA 407 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 S + + V+ + +PP EQ I + D L + + Sbjct: 408 QIDPGRSNGVPHISSKQVETAEIYLPPPAEQRRIAAEVERLMTICDELEASLTAA 462 >gi|209525707|ref|ZP_03274244.1| type I restriction-modification enzyme S subunit [Arthrospira maxima CS-328] gi|209493876|gb|EDZ94194.1| type I restriction-modification enzyme S subunit [Arthrospira maxima CS-328] Length = 125 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 14/111 (12%), Positives = 42/111 (37%), Gaps = 8/111 (7%) Query: 315 IITSAYMAVKPHGIDSTYLAWL--MRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVP 370 + +A + + T ++ + + G+ + ++ +++ V +P Sbjct: 14 LNQNAIIIRSKNFSQETQFFLYNSLKKPEYINHIEKIFRGNANQANITVKELLEFTVAIP 73 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P+ EQ I +V++ + +E+ + + + +TG+ L Sbjct: 74 PLAEQKAIASVLSYMDKE----IAALEKRRAKTEWIKKGMMQELLTGRKRL 120 >gi|70730331|ref|YP_260070.1| type I restriction-modification system, S subunit [Pseudomonas fluorescens Pf-5] gi|68344630|gb|AAY92236.1| type I restriction-modification system, S subunit, putative [Pseudomonas fluorescens Pf-5] Length = 551 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 59/409 (14%), Positives = 130/409 (31%), Gaps = 38/409 (9%) Query: 21 IPKHWKVVPIKRFTKLNTG-RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P WK V + +LN + E + + + TG L + + Sbjct: 3 LPPSWKEVGLLDVCELNPRIQRPEPETPVTFFPKSLITELTGPSLNSSIQAYGETSRQGV 62 Query: 80 IFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +F G +L G +A + G+ + + P L ++ V + Sbjct: 63 LFKNGDVLIATRGRDAMQATIVSGLVTELGLAQYFLALRAGPQIRPAFLLHFIQQPWVQK 122 Query: 136 RIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G + N+ +P+PPL EQ + + + + + Sbjct: 123 AALNTNRGTQSQLSIPLSFFKNLSIPVPPLQEQDYLIQLLQ------KASLEPYQDALNK 176 Query: 195 LKEKKQALVSYIVTKGLNP--DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + + AL ++ G ++K S I + +K Sbjct: 177 VIDLSDALALQLLVSGEKAQAWPRVKLSSICEF-------------SPAGAHPKKYQGPS 223 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + S + I E T V +++F + + E Sbjct: 224 RTELFSPRSFDHITGQVEPQRLKLEELPPTCAEVQADDVLFTLNQSFRSRGIAFAVTPDE 283 Query: 313 RG--IITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPV 367 + ++A+ ++P+ + YLA +R L + A + + +RL + Sbjct: 284 YATPLASAAFQVLRPNTKVLLADYLACFIRLSWLRQHVPASVLRSIPGRIYRSFFERLEL 343 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 +PP+ +Q I N++ +E+I ++ + + + A + Sbjct: 344 PLPPLDQQKPIVNLLRKVP------IERINDALETARRLGEAMLTEAFS 386 >gi|240047679|ref|YP_002961067.1| putative type-1 restriction enzyme specificity [Mycoplasma conjunctivae HRC/581] gi|239985251|emb|CAT05264.1| Putative type-1 restriction enzyme specificity [Mycoplasma conjunctivae] Length = 387 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 47/397 (11%), Positives = 113/397 (28%), Gaps = 41/397 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + + G T + K ++++ + D++S K +++ + Sbjct: 19 WRHRKLFEIGTIIAGNTPSTKIAEYYAKKGLMWVNILDIKSDITIDTQKKLSTK--GVAV 76 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + IL + R +I + + + L P ++ S + T+ + Sbjct: 77 AKVVPANSILCSVVAILGRNTLILEKSAL-NQALTALTPSKFYD-PYFLYIDSFNWTKSM 134 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + G+ + NI +P L EQ LI I + E Sbjct: 135 QNLGAGSLFQIVNKTDFSNITTLVPDLEEQQLIGNFFRKL-----NRILNTYQAKITKLE 189 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + ++ LN + + A + + + + Sbjct: 190 SIKNIL-------LNKMFVQPTNQPLIRFKDYNSLWKINILAELASIKKGEQVNRNDFVK 242 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + Y N G++P Y + I+ ++ + Sbjct: 243 NGKYPVW-------NGGIEPSGYYNKFNTEENTILIAEGGSTGFV------NFSKQKFWS 289 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV-PPIKEQF 376 + + TY + + +L+ + R+ + +EQ Sbjct: 290 GGHNYTLQNVKLDTYFLFYNLKNQQDFITSLKLGTALTNLQKHRLSRVFISFSRDFEEQQ 349 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I ID L+ E + ++ ++S + Sbjct: 350 KIA----KLFKNIDNLLNLYELKLQKIEIIKTSLLDK 382 >gi|254360725|ref|ZP_04976873.1| type I site-specific deoxyribonuclease, specificity subunit [Mannheimia haemolytica PHL213] gi|1685099|gb|AAC44667.1| HSDS [Mannheimia haemolytica] gi|153091295|gb|EDN73269.1| type I site-specific deoxyribonuclease, specificity subunit [Mannheimia haemolytica PHL213] Length = 442 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 62/443 (13%), Positives = 127/443 (28%), Gaps = 65/443 (14%) Query: 30 IKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +L + + K YI +++ G + + + + FAK IL+ Sbjct: 8 FSDIVELISEKIKIKDLKKENYISTDNMLPNFGGITLAENLPNSA---SCNRFAKKDILF 64 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVL---PELLQGWLLSIDVTQRIEAICEGAT 145 + Y +K +A+F G CS LV++ K+ E L + S D GA Sbjct: 65 SNIRTYFKKVWLAEFSGGCSPDVLVMRSKNTDILLNEYLFLLIRSDDFINFTVISANGAK 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV-- 203 M D + IP + Q A +I + + ++ Sbjct: 125 MPRGDKNAMKGFIFNIPSIEYQKKCIANYFAFDQKIQLNTQTNQTLEAIAQAIFKSWFVD 184 Query: 204 ---------------------------------SYIVTKGLNPDVKMKDSGIEWVGLV-- 228 S + ++ ++ G Sbjct: 185 FDPVRAKAAALSEGKSEHEANLAAMSVICGKDTSELNDTEYKALWQIAEAFPSEFGDEGL 244 Query: 229 PDHWEVKPFFALVTELNRKNTKLIE-----------SNILSLSYGNIIQKLETRNMGLKP 277 P W+ L K E I GN + + LK Sbjct: 245 PIGWKFNQADNLFDVGIGKTPPRKESEWFSDNANDTEWISIKDMGNQGLFITESSEYLKA 304 Query: 278 ESYETYQI--VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E+ +T+ I + ++ F I + + + S +L Sbjct: 305 EAVDTFNIKRIPENTVILSFKLTVGRVSITTKETTTNEAI--AHFKIPSSSNLSSEFLYC 362 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++++D + S + ++ + +K + +L P + I +I + + Sbjct: 363 YLKNFDFNNL--GSTSSIATAVNSKMIKEMEILEPSVLVINHFNEYIEGIFNKIKENIIQ 420 Query: 396 IEQSIVLLKERRSSFIAAAVTGQ 418 L + R + ++G+ Sbjct: 421 NNN----LSKIRDKLLPKLLSGE 439 Score = 43.6 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 20/195 (10%), Positives = 53/195 (27%), Gaps = 12/195 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLPKDGNSR 71 +P WK + G+T + D +I ++D+ + + Sbjct: 244 LPIGWKFNQADNLFDVGIGKTPPRKESEWFSDNANDTEWISIKDMGNQGLFITESSEYLK 303 Query: 72 QS--DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 DT + + ++ + + I + + + + Sbjct: 304 AEAVDTFNIKRIPENTVILS-FKLTVGRVSITTKETTTNEAIAHFKIPSSSNLSSEFLYC 362 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + K I + + P + E I +I I + Sbjct: 363 YLKNFDFNNLGSTSSIATAVNSKMIKEMEILEPSVLVINHFNEYIEGIFNKIKENIIQNN 422 Query: 190 RFIELLKEKKQALVS 204 ++ + L+S Sbjct: 423 NLSKIRDKLLPKLLS 437 >gi|238855188|ref|ZP_04645509.1| HsdS [Lactobacillus jensenii 269-3] gi|238832217|gb|EEQ24533.1| HsdS [Lactobacillus jensenii 269-3] Length = 373 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 44/369 (11%), Positives = 100/369 (27%), Gaps = 46/369 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK V + R K + +I I + + + + + + + Sbjct: 38 WKKVKLGRNVKRIRRKNKNLETNIPLTISAQFGLVDQRDFFGR--VVASENLANYILLKR 95 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K +G ST ++ P+++ + L+ + + I Sbjct: 96 GEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYIAFTPENINSDFLKAFFDTTKWYSHIV 155 Query: 139 AICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +H + + + IP EQ I + + ++ Sbjct: 156 QVSTEGARNHGLLNISPQDFFEMSITIPKSDEQNNISRIYNLMNSLLSLQQRKLELEKQI 215 Query: 195 LKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 K + + + G +K K + P+ ++ N Sbjct: 216 FYALKTHIFAKDLFFNGQKDMIKYKLKDVS-NMYQPETITATQMSTNGYKVFGAN----- 269 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 Y ++ + + V Sbjct: 270 ------GYIGHYYNFNHKDDAIT----------------ICARGASTGAVNFVPGPVWIT 307 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 G S + + I+ Y + + + +L K G + L E + + V + PI Sbjct: 308 G--NSMVVDIDSKLINQLYFYYYLTTLNLKKYI---TGGAQPQLTKEILNGINVNLIPIN 362 Query: 374 EQFDITNVI 382 Q + N++ Sbjct: 363 IQIKVANIL 371 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 82/214 (38%), Gaps = 19/214 (8%) Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS-YG 262 ++ + L P V+ + W + V + RKN L + L++S Sbjct: 18 THADEQRLYPKVRFRGFDEPW--------KKVKLGRNVKRIRRKNKNLETNIPLTISAQF 69 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYM 321 ++ + + + E+ Y ++ GE + + ++ + G +++ Y+ Sbjct: 70 GLVDQRDFFGRVVASENLANYILLKRGEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYI 129 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQF 376 A P I+S +L + + + G R ++ +D + + +P EQ Sbjct: 130 AFTPENINSDFLKAFFDTTKWYSHIVQVSTEGARNHGLLNISPQDFFEMSITIPKSDEQN 189 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +I+ + N+ + L+ ++ + L K+ + Sbjct: 190 NISRIYNLM----NSLLSLQQRKLELEKQIFYAL 219 >gi|282934312|ref|ZP_06339582.1| type I restriction-modification system subunit [Lactobacillus jensenii 208-1] gi|281301596|gb|EFA93870.1| type I restriction-modification system subunit [Lactobacillus jensenii 208-1] Length = 372 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 44/369 (11%), Positives = 100/369 (27%), Gaps = 46/369 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK V + R K + +I I + + + + + + + Sbjct: 38 WKKVKLGRNVKRIRRKNKNLETNIPLTISAQFGLVDQRDFFGR--VVASENLANYILLKR 95 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K +G ST ++ P+++ + L+ + + I Sbjct: 96 GEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYIAFTPENINSDFLKAFFDTTKWYSHIV 155 Query: 139 AICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +H + + + IP EQ I + + ++ Sbjct: 156 QVSTEGARNHGLLNISPQDFFEMSITIPKSDEQNNISRIYNLMNSLLSLQQRKLELEKQI 215 Query: 195 LKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 K + + + G +K K + P+ ++ N Sbjct: 216 FYALKTHIFAKDLFFNGQKDMIKYKLKDVS-NMYQPETITATQMSTNGYKVFGAN----- 269 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 Y ++ + + V Sbjct: 270 ------GYIGHYYNFNHKDDAIT----------------ICARGASTGAVNFVPGPVWIT 307 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 G S + + I+ Y + + + +L K G + L E + + V + PI Sbjct: 308 G--NSMVVDIDSKLINQLYFYYYLTTLNLKKYI---TGGAQPQLTKEILNGINVNLIPIN 362 Query: 374 EQFDITNVI 382 Q + N++ Sbjct: 363 IQIKVANIL 371 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 82/214 (38%), Gaps = 19/214 (8%) Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS-YG 262 ++ + L P V+ + W + V + RKN L + L++S Sbjct: 18 THADEQRLYPKVRFRGFDEPW--------KKVKLGRNVKRIRRKNKNLETNIPLTISAQF 69 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYM 321 ++ + + + E+ Y ++ GE + + ++ + G +++ Y+ Sbjct: 70 GLVDQRDFFGRVVASENLANYILLKRGEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYI 129 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQF 376 A P I+S +L + + + G R ++ +D + + +P EQ Sbjct: 130 AFTPENINSDFLKAFFDTTKWYSHIVQVSTEGARNHGLLNISPQDFFEMSITIPKSDEQN 189 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +I+ + N+ + L+ ++ + L K+ + Sbjct: 190 NISRIYNLM----NSLLSLQQRKLELEKQIFYAL 219 >gi|225860523|ref|YP_002742032.1| type I restriction enzyme specificity protein [Streptococcus pneumoniae Taiwan19F-14] gi|225728021|gb|ACO23872.1| type I restriction enzyme specificity protein [Streptococcus pneumoniae Taiwan19F-14] Length = 426 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 66/192 (34%), Gaps = 7/192 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----R 406 G ++ + L + +PP+ EQ I I ++D E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKL 178 Query: 407 RSSFIAAAVTGQ 418 + S + A+ G+ Sbjct: 179 KKSILQYAMQGK 190 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 61/416 (14%), Positives = 124/416 (29%), Gaps = 69/416 (16%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALVSYIVTKGL 211 + + +PPL+EQ I E I + ++D R +L KE ++++ Y + L Sbjct: 132 LLLIALPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGKL 191 Query: 212 NPDVKMKDSGIEWV---------------------------------------------- 225 +S + Sbjct: 192 VEQDPNDESVEVLLEKIRAEKQKLFEEGKIKKKDLDISIVSQGDDNSYYGNKDETTSYPI 251 Query: 226 GLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGLK 276 +P+ W F +LV K + I +S ++ N + Sbjct: 252 YEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISKL 311 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + I G ++ F L II+ + I YL Sbjct: 312 ALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMIF 370 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 371 LPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 424 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 251 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 310 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 311 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 368 >gi|281420897|ref|ZP_06251896.1| type I restriction-modification system, S subunit [Prevotella copri DSM 18205] gi|281405189|gb|EFB35869.1| type I restriction-modification system, S subunit [Prevotella copri DSM 18205] Length = 373 Score = 69.8 bits (169), Expect = 8e-10, Method: Composition-based stats. Identities = 59/403 (14%), Positives = 127/403 (31%), Gaps = 34/403 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + ++ G+ + D Y P G+ ++ Sbjct: 2 EWKEDVLGNVLEVKYGKDHKKLADGQY--------------PVYGSGGLMRYVDSILYDG 47 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 IL + G + T F + D + + + + ++ G Sbjct: 48 PSILIPRKGTLNNIMFVDSPFWTVDTMFWSIINTDKVDPKFLFYSIC---KRDFASMNVG 104 Query: 144 ATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + +I + P +++Q I + +D I + L+E QA+ Sbjct: 105 SAVPSMTVNILNDIQISYPKNISDQRRIASIL----SSLDRKIELNNKINADLEEMAQAI 160 Query: 203 V-SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ V D K DS + + + +V K+ K ++ L Sbjct: 161 FKNWFVDFEPFKDGKFVDSELGMIPEGWKVGSPYEYVKVVYGAPYKSAKFNDNGE-GLPL 219 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + PE + V+ G+IV D + G++ Sbjct: 220 IRIRDLKDCNPQFYTPEILPQTEYVNMGDIV-----AGMDAEFVPHIWKGNTGLLNQRVC 274 Query: 322 AVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + P S + +L V L D+ + V++PP++ + + Sbjct: 275 KLMPQQTSISNLFVLYLMKPELEFVQSYKTGTTVSHLGKADIDKFVVVLPPLEVVEECSK 334 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 +++ RI + E I L R + + ++G+I++ Sbjct: 335 ILDSILQRIKNI--STESRI--LSTLRDTLLPRLMSGEIEVPE 373 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 71/199 (35%), Gaps = 16/199 (8%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLP 65 DS +G IP+ WKV + K+ G + +++G+ + I + D++ ++ Sbjct: 178 DSE---LGMIPEGWKVGSPYEYVKVVYGAPYKSAKFNDNGEGLPLIRIRDLKDCNPQFYT 234 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + G I+ + I G+ + + L P+ L Sbjct: 235 PEILPQTE------YVNMGDIV-AGMDAEFVPHIWKGNTGLLNQRVCKLMPQQTSISNLF 287 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 L + +++ G T+SH I + +PPL + + + RI + Sbjct: 288 VLYLMKPELEFVQSYKTGTTVSHLGKADIDKFVVVLPPLEVVEECSKILDSILQRIKNIS 347 Query: 186 TERIRFIELLKEKKQALVS 204 TE L L+S Sbjct: 348 TESRILSTLRDTLLPRLMS 366 >gi|325680252|ref|ZP_08159814.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] gi|324108069|gb|EGC02323.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] Length = 366 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 36/392 (9%), Positives = 86/392 (21%), Gaps = 50/392 (12%) Query: 25 WKVVPIKR-FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTVSIF 81 W+ + ++ G + K VE+G + Sbjct: 19 WEQRKLGDEAIEILAGGDIDKSKT--------VENGKYPIYANALTNDGVVGYYDDYYRV 70 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G +V +I+ + + Sbjct: 71 KAPAVTVTGRGEVGFAQARM-----VDFTPVVRLLAIRSNHDCYFLENAINNHKVVVES- 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G NI P + E+ I E + I + + Sbjct: 125 TGVPQLTVPQLSSYNIFFP-KNVEEETRIGEFLHNLDSLITLHQRKLDK----------- 172 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ + L + + V + + + Sbjct: 173 -LNKVKISMLGKMFPKNGADVPEVRFKGFTDSWEQRKLEEVITVGNGMDYKHLSEGDIPV 231 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + + + L + I + + + Sbjct: 232 YGMGGYMLSVDKALSYDKD--------------AIGIGRKGTIDKPYVLKAPFWTVDTLF 277 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 P S + + + S SL ++ V VP + EQ I Sbjct: 278 YCIPKEDYSLDFVYCI--FQNVNWKEKDESTGVPSLSKVNINSTDVKVPALAEQEKIGAY 335 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++D L+ ++ + L+ + S + Sbjct: 336 ----FSKLDDLITLHQRKLEKLRNIKKSMLEK 363 >gi|332877051|ref|ZP_08444802.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684941|gb|EGJ57787.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 203 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 77/187 (41%), Gaps = 10/187 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSY--GNIIQKLETRNMGLKPESYETYQIVD 287 + W + NR+N + S++ G Q +K E Y+I++ Sbjct: 19 EQWREMNLGDITENFNRRNKDRSSYPMYSVTNTSGFSPQNEIFDGKEIKDEDISIYKIIE 78 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVF 346 GE + + + S+ + +I+S Y+ +P IDS +L L++S + + Sbjct: 79 KGEFAYNP--ARINVGSIGRYDNEDLCMISSLYICFRPSENIDSDWLLHLLKSDHMIYQY 136 Query: 347 YAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G G+R L + + R+ V +PP++ Q I N +N D + + ++ Sbjct: 137 GLYGEGGVRIYLFYPNFSRIKVSLPPLEVQKRIANTLN----LFDKKICLETNLLNKFQK 192 Query: 406 RRSSFIA 412 ++ ++ Sbjct: 193 QKKHLLS 199 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 64/189 (33%), Gaps = 9/189 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN-SRQSDTSTVSIF 81 + W+ + + T+ R + + + + + DG + D S I Sbjct: 19 EQWREMNLGDITENFNRRNKDRSS-YPMYSVTNTSGFSPQNEIFDGKEIKDEDISIYKII 77 Query: 82 AKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 KG+ Y + D + + S+ ++ +P + + LL D Sbjct: 78 EKGEFAYNPARINVGSIGRYDNEDLCMISSLYICFRPSENIDSDWLLHLLKSDHMIYQYG 137 Query: 140 IC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + EG + + I + +PPL Q I + D I + +++ Sbjct: 138 LYGEGGVRIYLFYPNFSRIKVSLPPLEVQKRIANTLN----LFDKKICLETNLLNKFQKQ 193 Query: 199 KQALVSYIV 207 K+ L+S + Sbjct: 194 KKHLLSMMF 202 >gi|229822391|ref|YP_002883917.1| Restriction endonuclease S subunit [Beutenbergia cavernae DSM 12333] gi|229568304|gb|ACQ82155.1| Restriction endonuclease S subunit [Beutenbergia cavernae DSM 12333] Length = 405 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 55/388 (14%), Positives = 130/388 (33%), Gaps = 38/388 (9%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS---TVSIFAKGQILYGKLGPYLRKAII 100 + + + +D+ G GK+ + G +T+ S+ +G +++ G +I Sbjct: 35 TESGVPVLRGQDI--GVGKHPQRSGTFVAPETARRLARSLVREGDLVFPHRGAIGEVGLI 92 Query: 101 ADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 D + + S+ + V + K L+ + + A G + I Sbjct: 93 GDDEFLLSSSMMKLTVDRSKAEPAFLMYYFRGPGRRELMMRASTVGTPGIAQPLASLREI 152 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + +P L EQ I E + A +I L +S V + + Sbjct: 153 DLALPSLGEQRAIAEVLGALDDKIAANTKLAATADALA-------MSLFVRSLGSETREY 205 Query: 218 KDSGIEWV---GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + S + + G+ P + + +V G + R Sbjct: 206 EISEVADLVTRGITPSYVDGGSDATMVLGQR-------------CVRGQRVDLGPARWTD 252 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 P ++ +++ PG+++ + + R R E + + + + +T A Sbjct: 253 --PARVKSEKLLSPGDVLINSTGMGSLGRVGRWTYAREATVDSHVTLVRFNDAVVNTTFA 310 Query: 335 -WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + A GS + L + R+ + VP + + ++ A + Sbjct: 311 GFALLRLEREIEVLAEGSTGQTELPRGSLARMKICVPSNENALPLAETLDALVA----MA 366 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 E++ L R + + ++G++ + Sbjct: 367 EQVRNEKQALAATRDALLPQLMSGKLTV 394 >gi|240948007|ref|ZP_04752425.1| hypothetical protein AM305_04463 [Actinobacillus minor NM305] gi|240297677|gb|EER48151.1| hypothetical protein AM305_04463 [Actinobacillus minor NM305] Length = 376 Score = 69.8 bits (169), Expect = 9e-10, Method: Composition-based stats. Identities = 57/388 (14%), Positives = 115/388 (29%), Gaps = 46/388 (11%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD-GNSRQSDTSTVSIFAKGQIL 87 + + +I + L+DV DTS I Sbjct: 4 KLGDLIE-----PYTKSCNIHNLTLDDVSGINRDKEFFSPAKQIGVDTSKYKIVPPNYFA 58 Query: 88 YGKLGPYLRKA-----IIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEA 139 + + D I S + + + KD L E L WL S + + Sbjct: 59 CNLMHVGRDIVLPISLNTTNKDKIVSPAYTIFKVKDETLLLSEYLFIWLKSDEKDRYFWL 118 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + W+ + NI + +PP+ Q A Sbjct: 119 FTDSSIRDGLSWEDMCNIELDLPPIEIQQKYVAVYQALLAN------------------- 159 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 GL + D IE + H N K + + + Sbjct: 160 ----QRAYETGLEDLKLVCDGYIEHL----QHHTELQRIGNYLNKEEINNKNGKYTLNDV 211 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN-DKRSLRSAQVMERGIITS 318 +I +K ++ S + Y +V P + + +N +K ++ +++S Sbjct: 212 KGISIQKKFIETKANMENVSLKPYLLVKPEYFAYVTVTSRNSEKITIAHNNSGNTYLVSS 271 Query: 319 AYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQ 375 +Y + + YLA + + R+ + D+ + + +P + Q Sbjct: 272 SYEVFSVNKAQLLPEYLALFFNRSEFDRYARFHSWGSAREVFSWADLCEVKIPIPELPVQ 331 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLL 403 I ++ V R + E+++Q I + Sbjct: 332 QAIVDIYKVLLER-RQINEQLKQQIKQI 358 >gi|261417778|ref|YP_003251460.1| N-6 DNA methylase [Geobacillus sp. Y412MC61] gi|319767409|ref|YP_004132910.1| N-6 DNA methylase [Geobacillus sp. Y412MC52] gi|261374235|gb|ACX76978.1| N-6 DNA methylase [Geobacillus sp. Y412MC61] gi|317112275|gb|ADU94767.1| N-6 DNA methylase [Geobacillus sp. Y412MC52] Length = 634 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 52/130 (40%), Gaps = 4/130 (3%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + +V G+++ K ++ + I + +D Sbjct: 484 SYEITNNAKIESYLVQEGDVIISVRGA-GIKIAVIPPHEGDILISHNFIGIRPHRHVDPF 542 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDIT-NVINVETARI 389 YL + S + + +G ++ +D++ +P+ V P +EQ +I + + + I Sbjct: 543 YLKIFLESPVGQYLLLSKQAGTNVTILNMKDLENIPIPVRPFEEQKEIIMSYLEEQ-KHI 601 Query: 390 DVLVEKIEQS 399 +++++E+ Sbjct: 602 QDMMKQLEKQ 611 Score = 53.6 bits (127), Expect = 7e-05, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 67/177 (37%), Gaps = 12/177 (6%) Query: 27 VVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGT--GKYLPKDGNSRQSDTSTV 78 V P+KR G I L DV++G L + + + Sbjct: 437 VQPLKRIGTFYRGINISAKDAETENGPYKVIKLSDVQNGEVLIDQLASYEITNNAKIESY 496 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELL-QGWLLSIDVTQ 135 + +G ++ G ++ A+I +G + S F+ ++P + + +L S Sbjct: 497 -LVQEGDVIISVRGAGIKIAVIPPHEGDILISHNFIGIRPHRHVDPFYLKIFLESPVGQY 555 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G ++ + K + NIP+P+ P EQ I + E I ++ + + Sbjct: 556 LLLSKQAGTNVTILNMKDLENIPIPVRPFEEQKEIIMSYLEEQKHIQDMMKQLEKQR 612 >gi|154499003|ref|ZP_02037381.1| hypothetical protein BACCAP_02995 [Bacteroides capillosus ATCC 29799] gi|150271843|gb|EDM99069.1| hypothetical protein BACCAP_02995 [Bacteroides capillosus ATCC 29799] Length = 376 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 49/364 (13%), Positives = 98/364 (26%), Gaps = 44/364 (12%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAII-ADFDGICSTQFLVLQP 116 +++P N +D S + +KG + L A+ D I S + + + Sbjct: 35 EFMPSVANVIGTDLSRYKLISKGLFACNPMHVGRDERLPIALYEKDSPAIVSPAYFMFEI 94 Query: 117 KDVL---PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 D E L W + + + +G+ W + I +P+P A Q I E Sbjct: 95 IDRDVLNEEYLMMWFRRPEFDRECWFMTDGSVRGGITWDDLCRIKLPVPSYARQCEIVES 154 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALV--SYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 A T RI E ++ + + +T L M+ E P Sbjct: 155 YRAITDRIALKRAENDNLAAQMRAYFKEYTANNASITGKLKDYSVMQYGYTETATTEPVG 214 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + + N E ++ G++ Sbjct: 215 PKFLRITDIAQNYIDWNGVPYCP---------------------ISEGNHEKYVLSEGDV 253 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V + + + + Y + S + Sbjct: 254 VVARTGATVGYAKMVGRNIPDSVFASFLVRIRPIDDEYRYYFGLAITSSEFLDFVQTNAG 313 Query: 352 G-LRQSLKFEDVKRLPVLVP---PIKEQF-DITNVINVETARIDVLVEKIEQSIVLLKER 406 G + + + +P + E I++ + ++E E I L E Sbjct: 314 GSAQPQANPPLLGEFELSIPNKQSLPEFNTKISSFLG--------VIESNETEISKLHEV 365 Query: 407 RSSF 410 + + Sbjct: 366 KDTM 369 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 58/184 (31%), Gaps = 7/184 (3%) Query: 29 PIKRFTKLNTGRTSESGKDI---IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +K ++ + G T + + ++ + D+ + + ++G Sbjct: 193 KLKDYSVMQYGYTETATTEPVGPKFLRITDIAQNYIDWNGVPYCPISEGNHEKYVLSEGD 252 Query: 86 ILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ++ + G + A + + S + D + S + ++ Sbjct: 253 VVVARTGATVGYAKMVGRNIPDSVFASFLVRIRPIDDEYRYYFGLAITSSEFLDFVQTNA 312 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ A+ +G + IP KI + I++ TE + E+ + Sbjct: 313 GGSAQPQANPPLLGEFELSIPNKQSLPEFNTKISSFLGVIESNETEISKLHEVKDTMVKM 372 Query: 202 LVSY 205 L S Sbjct: 373 LSSR 376 >gi|313896459|ref|ZP_07830010.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312974883|gb|EFR40347.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 459 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 44/421 (10%), Positives = 111/421 (26%), Gaps = 47/421 (11%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V P+ + T + S + V + G + + G + Sbjct: 35 VEPLGKHLIHQTEKIQLSDYPDEDCTILGVSNKVGMF-DAGVKKGKKIKQKYHRVESGWL 93 Query: 87 LYGKLGPYLRKAIIAD---FDGICSTQFLVL-QPKDVLPELLQGWLLSIDVTQRIEAICE 142 Y + I S ++V + ++P+ L + S I+ Sbjct: 94 AYNPYRINVGSIGIKTADLKGDYISPAYVVFSCMETLIPQFLWLMMRSEYFNTLIKDSTT 153 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ ++ + I PIPP+ EQ I + A + +++ F L Q+ Sbjct: 154 GSVRQTLSYEKLAAIEAPIPPIPEQEQILKVYHATIAAAEKSMSDGDDFSSGLLFDIQST 213 Query: 203 VSYIVTK-------------------------------GLNPDVKMKDSGIEWVGLVPDH 231 VS + + L+ S I + + Sbjct: 214 VSDLKEQDVSTATTSSILQIISYSSVSRWEVAFGLKEGKLDKVYNSFKSPIHTIAELTKE 273 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 ++ + ++ ++ G+ Sbjct: 274 SLFGLSIKASPTQKTGMIPMLRMPNIVDGALDLDDLKYLPRKTATTAREPDKWLLRKGDF 333 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVF--Y 347 + + + + S + + + Y+ L + Sbjct: 334 LINRTNSKELVGKSAVFNLDGDYTYASYVIRYRFDTSIVLPEYVNILFMLPLVRFQIDTM 393 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + + ++ +++ + + +P I EQ +I + + ++ +E R Sbjct: 394 SRQTAGQCNINSDEIGSIRIPIPSISEQEEII-------KKYYSTKDGADKFYTKAEELR 446 Query: 408 S 408 Sbjct: 447 K 447 >gi|315127912|ref|YP_004069915.1| restriction modification system DNA specificity subunit [Pseudoalteromonas sp. SM9913] gi|315016426|gb|ADT69764.1| restriction modification system DNA specificity subunit [Pseudoalteromonas sp. SM9913] Length = 437 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 53/449 (11%), Positives = 127/449 (28%), Gaps = 71/449 (15%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +W + + G++ T +P G++ D S Sbjct: 2 NWIETTVGEYCPFVYGKSLPKT------------QRTEGDIPVFGSNGCVDYHNKSYVNG 49 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI-EAICE 142 I+ G+ G + + T F V + + L S+ + ++ Sbjct: 50 PGIIIGRKGSVGAVHLSVEPFWPIDTSFYVEKESIDELKFTYYLLKSLGLKGMNSDSAVP 109 Query: 143 GATMSH-----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER--------- 188 G + + + ++ +T + I + Sbjct: 110 GLNRENAHALPIRIPEKIQDREKLGQWISVYDSKIELNRQTNQTLEQIAQAIFKSWFVDF 169 Query: 189 ---------IRFIELLKEKKQALVSYIVTKGLNPD-------------------VKMKDS 220 E + A++S L+ + DS Sbjct: 170 DPVRAKIAANTAGENAQRAAIAVISGKNQAALDQLEQQYPAQYQQLQATADLFPDNLIDS 229 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE------TRNMG 274 G+ + + K + K ES I L ++ ++ + Sbjct: 230 GLGEIPDGWEVVGFKDIIRKYIDNRGKTPPTAESGIPLLEVKHLPDGSIKPSLNTSKYVD 289 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 ++ + ++ +I+ + + + V M + + ++ Sbjct: 290 IETFNSWFRAHLEAEDILISTVGTIG-RICMVPKGVKVAIAQNLLGMRFQREKVSPYFMY 348 Query: 335 WLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + M S A + ++ S+K +D++ + +L PP+ Q + +I + Sbjct: 349 YQMDSLRFRHDIDARLVVTVQASIKRKDLETIDLLAPPVALQNEFEKLILPFIE-----I 403 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ--ID 420 + QSI L R + + ++G+ ID Sbjct: 404 LQSNQSIE-LASTRDALLPKLLSGELSID 431 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 65/183 (35%), Gaps = 13/183 (7%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKL---NTGRTSESGK-DIIYIGLEDVESGTGKYLPKD 67 DSG +G IP W+VV K + N G+T + + I + ++ + G+ K Sbjct: 228 DSG---LGEIPDGWEVVGFKDIIRKYIDNRGKTPPTAESGIPLLEVKHLPDGSIKPSLNT 284 Query: 68 GNSRQSDTSTVS---IFAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLP 121 +T IL +G R ++ + + + Q + V P Sbjct: 285 SKYVDIETFNSWFRAHLEAEDILISTVGTIGRICMVPKGVKVAIAQNLLGMRFQREKVSP 344 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + S+ I+A + K + I + PP+A Q + I+ + Sbjct: 345 YFMYYQMDSLRFRHDIDARLVVTVQASIKRKDLETIDLLAPPVALQNEFEKLILPFIEIL 404 Query: 182 DTL 184 + Sbjct: 405 QSN 407 >gi|229523505|ref|ZP_04412910.1| type I restriction-modification system specificity subunit S [Vibrio cholerae bv. albensis VL426] gi|229337086|gb|EEO02103.1| type I restriction-modification system specificity subunit S [Vibrio cholerae bv. albensis VL426] Length = 179 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 60/164 (36%), Gaps = 8/164 (4%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKR 303 +K I S ++ ++ E ++ V P + + K Sbjct: 22 KKIADYWGGTIPWASVKDLKSRVLLNTEDSITELGVVKSATNVIPKGTIIVPTRMALGKV 81 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 ++ + + A + V I+ YLA + S G + + + +K Sbjct: 82 AITGCDMAINQDL-KALIIVDNKQINQCYLARFLESKSSFIESEGKG-ATVKGITLDFLK 139 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 L + +PP+ EQ I +++ + D + +K +Q+I L E R Sbjct: 140 SLEIPLPPLDEQKRIAAILD----KADAIRQKRKQAISLADEFR 179 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 52/162 (32%), Gaps = 6/162 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+V + + G T G I + ++D++S S Sbjct: 2 SWQVKTLGELVTIKGGGTPSKKIADYWGGTIPWASVKDLKSRVLLNTEDSITELGVVKSA 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ KG I+ + AI I ++ + + I Sbjct: 62 TNVIPKGTIIVPTRMALGKVAITGCDMAINQDLKALIIVDNKQINQCYLARFLESKSSFI 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 E+ +GAT+ + ++ +P+PPL EQ I + Sbjct: 122 ESEGKGATVKGITLDFLKSLEIPLPPLDEQKRIAAILDKADA 163 >gi|210611274|ref|ZP_03288829.1| hypothetical protein CLONEX_01019 [Clostridium nexile DSM 1787] gi|210152038|gb|EEA83045.1| hypothetical protein CLONEX_01019 [Clostridium nexile DSM 1787] Length = 231 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 29/234 (12%), Positives = 69/234 (29%), Gaps = 16/234 (6%) Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + L+++ QAL + + +P+ ++ I +G V K Sbjct: 10 INDNLEQQAQALFQELFIENADPEW--REGTISDLGTVVGGSTPSK---------SKPEY 58 Query: 251 LIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 E I ++ + + K + G + + + + + A Sbjct: 59 YTEHGIAWITPKDLSVNKSKFITHGENDITELGLKNSSASIMPEGTVLFSSRAPIGYIAI 118 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + +V P T + L + + + +K +P + Sbjct: 119 AAGEVTTNQGFKSVIPRSAIGTPFVYYFLKNALPTIEGMASGSTFKEVSGSTMKIVPAFI 178 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 P + + I + +E+ L R S + ++G+ID+ Sbjct: 179 PDDET----LARFTEFCSPIFEQQQMLERQNQSLAALRDSLLPKLMSGEIDVSD 228 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 53/169 (31%), Gaps = 13/169 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYL---PKDGNSR 71 P+ W+ I + G T K I +I +D+ K++ D Sbjct: 32 PE-WREGTISDLGTVVGGSTPSKSKPEYYTEHGIAWITPKDLSVNKSKFITHGENDITEL 90 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ SI +G +L+ P IA + + F + P+ + + Sbjct: 91 GLKNSSASIMPEGTVLFSSRAPI-GYIAIAAGEVTTNQGFKSVIPRSAI-GTPFVYYFLK 148 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + IE + G+T + +P IP E + Sbjct: 149 NALPTIEGMASGSTFKEVSGSTMKIVPAFIPDDETLARFTEFCSPIFEQ 197 >gi|311742872|ref|ZP_07716680.1| type I restriction enzyme StySJI specificity protein [Aeromicrobium marinum DSM 15272] gi|311313552|gb|EFQ83461.1| type I restriction enzyme StySJI specificity protein [Aeromicrobium marinum DSM 15272] Length = 382 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 56/397 (14%), Positives = 113/397 (28%), Gaps = 27/397 (6%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 I + G + + L + + D S + + +I + Sbjct: 4 IGDLLVEFKEQ-PGKGDEPTVLTLTERNGFVRQADRFSKRLATEDVSKYKVVRRNEIAFN 62 Query: 90 KLGPYLRKAIIADF--DGICSTQFLVLQPKD-VLPELLQGWLLSIDVTQRIEAICEG--A 144 + +GI S + + +D P + LL+ + + I G Sbjct: 63 PYLLWAGAVAQNTIVDEGIISPLYPTFRVRDGHDPRYVARLLLTPQLIGAYDGIAFGSVP 122 Query: 145 TMSHADWKGIGNIP-MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + N+P +PPL EQ I + L Q++ Sbjct: 123 RRRRSSVHDFLNLPLANVPPLPEQRRIAAILDHADALRAKRRQALSHLNFLT----QSIF 178 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + T+ +P V + D G R G Sbjct: 179 SEMFTREPHPVVALGDIARIRGGKRLPKGASYAIGPTHHPYVRVTDL----------RGG 228 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 IQ + + + +D G+++ ++ + +A + Sbjct: 229 AIQSSNLCFLTPEVQRQIARYTIDEGDVIISIAGSIGLTAAVPATLAGANLTENAAKIVP 288 Query: 324 KP-HGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 K ++LA +++S L +G L +++L V +PP Q + Sbjct: 289 KDGQAYIGSWLARMLQSRSLQDQIAGKVGQVTIGKLALFRIEQLEVPLPPRALQEEFVER 348 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 AR++ + Q +S + A G+ Sbjct: 349 ----AARVEAVTAVARQESAAEDLLFASLQSRAFRGE 381 >gi|294790583|ref|ZP_06755741.1| type I restriction-modification system, S subunit [Scardovia inopinata F0304] gi|294458480|gb|EFG26833.1| type I restriction-modification system, S subunit [Scardovia inopinata F0304] Length = 410 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 66/385 (17%), Positives = 120/385 (31%), Gaps = 27/385 (7%) Query: 25 WKVVPIKR-FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T + + + D ST Sbjct: 24 WEQRKLGDAMLEKVESVTPLRRNSYALWSVPAYTNSKPELATGDKIQ-----STKQRILD 78 Query: 84 GQILYGKLGPYLRKAIIADFDG------ICSTQFLVLQP--KDVLPELLQGWLLSIDVTQ 135 G IL K+ P + + + D I S ++++ + K + + L +L S Sbjct: 79 GDILLCKINPRINRVWVVDTGNLDTSSPIASLEWIIFRTSGKSMDRQFLVDFLSSPKFRN 138 Query: 136 RIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G T + +P LAEQ I E +D LI R E Sbjct: 139 FLLSETIGVTGSQKRVQRNSVKEFMFHLPSLAEQSRIGE----LFKTLDNLIAATERKKE 194 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 LL++KKQA + I ++ L K +G V + + F L Sbjct: 195 LLQKKKQAYLQLIFSQHLRFKGFTKPWEQRKLGDVGNLYSGYAFPNSEQGGKNGILFLKV 254 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 S++ I K + + Y ++D I+F + R Sbjct: 255 SDMNLAGNELEITKAKNYVTNKQIAIYGWKPVIDLPAIIFAKVGAAIMLNRKRICTKPFL 314 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + P +D Y + D + G S+ D+ +L VP + Sbjct: 315 LDNNTMAYSPNPMNLDIAYTVSYFHTIDFSSLTRI---GAVPSIAGSDIAKLVAPVPCMS 371 Query: 374 EQFDITNVINVETARIDVLVEKIEQ 398 EQ + +D L++ ++ Sbjct: 372 EQSRVGE----LFKTLDELIKANDR 392 >gi|170718764|ref|YP_001783948.1| N-6 DNA methylase [Haemophilus somnus 2336] gi|168826893|gb|ACA32264.1| N-6 DNA methylase [Haemophilus somnus 2336] Length = 1110 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 55/187 (29%), Gaps = 7/187 (3%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +G + H E + + ++ + + L Sbjct: 910 EIIGKIAPHIESGKRPSGGVGFIS-SGAYSLGGEHIHKDNGHLELKNIKFVPLTFFHEAE 968 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + G+I+ K +L ++ + + ++ YL ++ S Sbjct: 969 KGKIQKGDILLCKDGALTGKVALVRDELNDIFAMVNEHVFVIRCSQPETQQYLFHVLHSA 1028 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV---LVEKI 396 K+ A +G + L ++K + + +PP+ Q I + +E Sbjct: 1029 MGQKLLKANTTGAAQGGLNSSNLKNIRIPLPPLAIQQQIIAECQKIDQEYETSRMAIETY 1088 Query: 397 EQSIVLL 403 I + Sbjct: 1089 RAKIAQI 1095 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 62/193 (32%), Gaps = 23/193 (11%) Query: 30 IKRFT-KLNTGRTSESGKDIIY-----IGLEDVESGTGKYLPKDGNSRQSD---TSTVSI 80 I + + +G+ G I +G E + G K+ + Sbjct: 912 IGKIAPHIESGKRPSGGVGFISSGAYSLGGEHIHKDNGHLELKNIKFVPLTFFHEAEKGK 971 Query: 81 FAKGQILYGKLGPYLRKAI-----IADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVT 134 KG IL K G K + D + + V++ + L S Sbjct: 972 IQKGDILLCKDGALTGKVALVRDELNDIFAMVNEHVFVIRCSQPETQQYLFHVLHSAMGQ 1031 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++A GA + + NI +P+PPLA Q I + I + + Sbjct: 1032 KLLKANTTGAAQGGLNSSNLKNIRIPLPPLAIQQQIIAEC--------QKIDQEYETSRM 1083 Query: 195 LKEKKQALVSYIV 207 E +A ++ I Sbjct: 1084 AIETYRAKIAQIF 1096 >gi|314934936|ref|ZP_07842295.1| probable specificity determinant HsdS [Staphylococcus caprae C87] gi|313652866|gb|EFS16629.1| probable specificity determinant HsdS [Staphylococcus caprae C87] Length = 145 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 58/148 (39%), Gaps = 9/148 (6%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + L N G+ + + VD ++ F ++ I + K Sbjct: 4 KYLYKGNKGITEKGASKHVKVDKDTLIMSFKLTLGKLAIVKEPIYTNEAIC---HFVWKE 60 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 +++ Y+ + + S ++ G+ +L + + + V +P I+EQ I Sbjct: 61 SNVNTEYMYYYLNSINISTFGAQAVKGV--TLNNDAINSIIVKLPVIQEQNKIAYF---- 114 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D L+EK + LLK+R+ F+ Sbjct: 115 FNKLDKLIEKQSSKVELLKQRKQGFLQK 142 >gi|301048308|ref|ZP_07195339.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 185-1] gi|300299816|gb|EFJ56201.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 185-1] Length = 439 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 63/201 (31%), Gaps = 9/201 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE---TRNMGLK 276 S E +PD WE + + + E I + I K + + Sbjct: 93 SEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTKFDGSHEFEIKKW 152 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTY 332 + + Y G+I I + ++ GI I+ Y Sbjct: 153 KDVKKGYTHFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHVARPFSDIINRKY 212 Query: 333 LAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L +S + K GS ++ + + P+ PP++EQ I + D Sbjct: 213 LLLNFKSPNFLKSGESQMTGSAGQKRVPRFFFENNPIPFPPLQEQERIIIRFTQLMSLCD 272 Query: 391 VLVEKIEQSIVLLKERRSSFI 411 L ++ S+ ++ + + Sbjct: 273 QLEQQSLTSLDAHQQLVETLL 293 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 72/213 (33%), Gaps = 12/213 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESG 59 +K K P+ S + +P W+ + R ++N + +I +I + + + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTK 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLV 113 + + + FA G I K+ P + + + G+ +T+ V Sbjct: 141 FDGSHEFEIKKWKDVKKGYTHFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHV 200 Query: 114 LQPKDVLPELLQ---GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 +P + + + + A N P+P PPL EQ I Sbjct: 201 ARPFSDIINRKYLLLNFKSPNFLKSGESQMTGSAGQKRVPRFFFENNPIPFPPLQEQERI 260 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + D L + + ++ ++ + L+ Sbjct: 261 IIRFTQLMSLCDQLEQQSLTSLDAHQQLVETLL 293 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 4/26 (15%), Positives = 9/26 (34%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG 45 +P+ W+ + G +S Sbjct: 389 ELPEGWEWCRLGSIYNFLNGYAFKSE 414 >gi|146321309|ref|YP_001201020.1| type I restriction-modification system, S subunit [Streptococcus suis 98HAH33] gi|145692115|gb|ABP92620.1| type I restriction-modification system, S subunit [Streptococcus suis 98HAH33] Length = 284 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 75/213 (35%), Gaps = 20/213 (9%) Query: 5 KAYPQY-----KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGL 53 K Y + K V + IP W+ V ++ + +G T +S + +I +I Sbjct: 65 KPYEKLADGTVKKVEVPY--EIPDSWEWVRLRNLGVITSGGTPKSSESTYYDGNITWITP 122 Query: 54 EDVESGTGKYL----PKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST 109 D+ + K S+ + +K I+Y P I ++D + Sbjct: 123 ADMGKQQNDKVFATSSKKITELGVQKSSAQLISKNSIVYSSRAPI-GHINIVNYDFTTNQ 181 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + P V L + + T+ I G T G G+ +P+PPLAEQ Sbjct: 182 GCKSVTPILVN--LDFMYWILQFRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKR 239 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 I I +++ + EL + L Sbjct: 240 IVAHIERALEQVEVYAESYNKLQELDRAFPDKL 272 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 59/192 (30%), Gaps = 12/192 (6%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGLK 276 +E +PD WE L + K + NI ++ ++ ++ + Sbjct: 78 VEVPYEIPDSWEWVRLRNLGVITSGGTPKSSESTYYDGNITWITPADMGKQQNDKVFATS 137 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + + + R+ V +V P ++ ++ Sbjct: 138 SKKITELGVQKSSAQLISKNSIVYSSRAPIGHINIVNYDFTTNQGCKSVTPILVNLDFMY 197 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 W+++ + + + + + +PP+ EQ I I + VE Sbjct: 198 WILQ-FRTKDIILRSSGTTFKEISASGFGDTLLPLPPLAEQKRIVAHIERALEQ----VE 252 Query: 395 KIEQSIVLLKER 406 +S L+E Sbjct: 253 VYAESYNKLQEL 264 >gi|94263484|ref|ZP_01287296.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] gi|93456122|gb|EAT06265.1| Restriction modification system DNA specificity domain [delta proteobacterium MLMS-1] Length = 439 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 65/202 (32%), Gaps = 14/202 (6%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY-- 63 DS +G IP W V P+ +L G T ++ G +I + + D S + Sbjct: 227 DSE---LGEIPVGWGVKPLSDIIELVGGGTPKTKVPEYWGGNIPWFSVVDAPSDFDVWVI 283 Query: 64 -LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 K D S+ I G + G R A++ + + +QPK Sbjct: 284 ETEKHVTKLGVDNSSTKILPIGTTIISARGTVGRCALVGKPMAM-NQSCYGVQPKR-NYG 341 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + D ++ G+ + I + +L E + +I Sbjct: 342 PLFINHMLRDQITSLQRSGHGSVFNTITRSTFKTIKIVDCGDRLSMLFDETVEPLLSKIL 401 Query: 183 TLITERIRFIELLKEKKQALVS 204 + E ++ L+S Sbjct: 402 ENLRENKVLMKTRDTLLPKLIS 423 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 59/412 (14%), Positives = 118/412 (28%), Gaps = 64/412 (15%) Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPKD 118 P G S D IF +L + G LR A +A+ + VLQ D Sbjct: 35 YPYYGASGIVDWVDSYIFDGSYLLLAEDGENLRTKSTPIAFLAEGKFWVNNHAHVLQGSD 94 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L + L + I++ G+T + IP+ P + I + Sbjct: 95 DLDTRFFCYALMVA---DIDSYISGSTRPKITQGDMKRIPLYAPEKEIRHAIAHILGTLD 151 Query: 179 VRIDTLITERIRFIELLKEKKQALV-------SYIVTKGL---------------NPDVK 216 +I+ +L + ++ V G NP+++ Sbjct: 152 DKIELNRQMNRTLEQLAQALFKSWFIDFDPVVYNAVQAGHPVPERFQATAERYRQNPEIQ 211 Query: 217 --------MKDSGIE--WVGLVPDHWEVKPFFALVT-----ELNRKNTKLIESNILSLSY 261 + S E +G +P W VKP ++ K + NI S Sbjct: 212 TLPQHILDLFPSHFEDSELGEIPVGWGVKPLSDIIELVGGGTPKTKVPEYWGGNIPWFSV 271 Query: 262 GNIIQKLE-TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + K + + + R A V + + + Sbjct: 272 VDAPSDFDVWVIETEKHVTKLGVDNSSTKILPIGTTIISARGTVGRCALVGKPMAMNQSC 331 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 V+P M + + + + ++ K + I + Sbjct: 332 YGVQPKRNYGPLFINHMLRDQITSLQRSGHGSVFNTITRSTFKTI-----------KIVD 380 Query: 381 VINVETARIDVLVE-KIEQSIVLLKE------RRSSFIAAAVTGQIDLRGES 425 + + D VE + + + L+E R + + ++G++ + Sbjct: 381 CGDRLSMLFDETVEPLLSKILENLRENKVLMKTRDTLLPKLISGELRIPDAE 432 >gi|301633697|gb|ADK87251.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 361 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 48/381 (12%), Positives = 95/381 (24%), Gaps = 40/381 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K IK + GR G+ V S + G D G+ Sbjct: 4 KTYKIKDICDITRGRVISKLDIKKDPGVFPVYSAATNNDGEFGRINSYDFD-------GE 56 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + Y + + +L+ K+ + + Sbjct: 57 YVTWTADGYGGAVFYRNGKFSITNLCGLLKVKNKEISSKY-LAHILKLEAPKFTNRVFKN 115 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 K + IP+ PPL Q I + T LL ++ + Sbjct: 116 RPKLTHKTMAEIPIDFPPLKIQEKIATILDTFTELRARKKQYAFYRDYLLNQENIRKIYG 175 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +P + I +N + Sbjct: 176 A--------------------NIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAA 215 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + +K ++ I N + + + + Sbjct: 216 TTNDGELGHIKDCDFDGEYI----------TWTTNGYAGVVFYRNGKFNASQDCGVLKVK 265 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + T L+ + K + + S R L + + + + PP++ Q I +++ Sbjct: 266 NKKICTKFLSLLLKIEAPKFVHNLAS--RPKLSQKVMAEIELSFPPLEIQEKIADILFAF 323 Query: 386 TARIDVLVEKIEQSIVLLKER 406 + LVE I I L K++ Sbjct: 324 EKLCNDLVEGIPAEIELRKKQ 344 >gi|325990097|ref|YP_004249796.1| hypothetical protein Msui07530 [Mycoplasma suis KI3806] gi|323575182|emb|CBZ40846.1| hypothetical protein Msui07530 [Mycoplasma suis] Length = 206 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 66/188 (35%), Gaps = 8/188 (4%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + +G + + +L+ K+ + + + + R+ L S + Sbjct: 2 DKLGKISSGKPYDRKYEFNPKLHEKSIPFVG--VKEVGQSRLHILESDRHCFLNNLSKKG 59 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 ++ + + +L + + + + + ++ + + S Sbjct: 60 NKLFSKNTVCISIYGSYPGESALLKSDAF--LSTSVFAFSHYENISNPKFIKYCLDSQRK 117 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + +R++L + + PP +EQ I + ++ D L+E E+ I + Sbjct: 118 TFSSISATTTIRKALPTYQLLSIKFPCPPQEEQERIGDTLSA----YDELIENNERQIEV 173 Query: 403 LKERRSSF 410 L+ R++ Sbjct: 174 LQGVRTAI 181 >gi|217425678|ref|ZP_03457169.1| type I restriction modification DNA specificity domain protein [Burkholderia pseudomallei 576] gi|217391354|gb|EEC31385.1| type I restriction modification DNA specificity domain protein [Burkholderia pseudomallei 576] Length = 267 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 42/284 (14%), Positives = 83/284 (29%), Gaps = 26/284 (9%) Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M+ + + +P P + EQ I + + +L + ++ + Q L Sbjct: 1 MASLNQGVLARAKIPFPQIPEQSAIATALSDVDALLSSLEALIAKKHDIKQAAMQQL--- 57 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 L ++ EW + + + N Sbjct: 58 -----LTGKTRLPGFEGEWRHISAGELGYFRGGTGFPIAFQGEREGTYPFYKVSDMNNEG 112 Query: 266 QKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQ---NDKRSLRSAQVMERGIITS 318 K I PG IVF + KR L ++ + + Sbjct: 113 NKTFMVAANNWVSDDARRVIGATVFAPGSIVFAKVGAAVFLERKRILSKPSCIDNNM--A 170 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFD 377 AY+ + A L+ + + + SL + + +P+ VP I EQ Sbjct: 171 AYVIDETKASVPFIHAQLLA----KRFGDLVATTALPSLNGKVLAAMPLYVPSSIAEQIA 226 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I V++ A + +E + + + +TG+ L Sbjct: 227 IAEVLSDMDAEL----AALEARRDKTRLLKQGMMQELLTGKTRL 266 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 26/72 (36%), Gaps = 4/72 (5%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 SL + R + P I EQ I ++ A + L I + + + + Sbjct: 1 MASLNQGVLARAKIPFPQIPEQSAIATALSDVDALLSSLEALIAKKHD----IKQAAMQQ 56 Query: 414 AVTGQIDLRGES 425 +TG+ L G Sbjct: 57 LLTGKTRLPGFE 68 >gi|167854667|ref|ZP_02477447.1| restriction modification system DNA specificity domain [Haemophilus parasuis 29755] gi|167854204|gb|EDS25438.1| restriction modification system DNA specificity domain [Haemophilus parasuis 29755] Length = 164 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 15/140 (10%), Positives = 48/140 (34%), Gaps = 9/140 (6%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 Y I + ++ R+ + + + + ++ + ++ + Sbjct: 25 DYVKDYIFEGDYLLVSEDGANLLARNTPIAFSISGKNWVNNHVHVLKFNTYTERRFVEFY 84 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + DL + L ++ + + PP +EQ I +++ + + E + Sbjct: 85 LNNIDLTPYI---SGASQPKLNKNNLSNIKIPAPPFEEQQRIVTILDKFETLTNSIAEGL 141 Query: 397 EQSIVLLKE----RRSSFIA 412 + I L ++ R ++ Sbjct: 142 PKEIELRRKQYEYYREKLLS 161 >gi|256832725|ref|YP_003161452.1| restriction modification system DNA specificity domain-containing protein [Jonesia denitrificans DSM 20603] gi|256686256|gb|ACV09149.1| restriction modification system DNA specificity domain protein [Jonesia denitrificans DSM 20603] Length = 398 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 42/110 (38%), Gaps = 1/110 (0%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 E + + G+++F + + +P I+S +L +++ Sbjct: 75 EVIQRRSKLQAGDVLFSGTGTIGRTALVDQLPGDWNIKEGVYALTPRPDLIESRFLIYVL 134 Query: 338 RSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 S + A S+ ++R+ + VPP++ Q +I +++ T Sbjct: 135 HSSLVRNRILAQADGSTVASISMATLRRIRIPVPPLEVQREIVRILDQFT 184 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 55/410 (13%), Positives = 117/410 (28%), Gaps = 49/410 (11%) Query: 22 PKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKY----------LPKDGNS 70 P ++ I L TG + + G + + + ++ Sbjct: 13 PVGVELREIGDVITALRTGLNPRTNFKLNTPGSANFYVTVRELGGFVIRCSDKTDRVDDA 72 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ----FLVLQPKDVLPELLQG 126 S G +L+ G R A++ G + + L +P + L Sbjct: 73 GLEVIQRRSKLQAGDVLFSGTGTIGRTALVDQLPGDWNIKEGVYALTPRPDLIESRFLIY 132 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L S V RI A +G+T++ + I +P+PPL Q I + T L Sbjct: 133 VLHSSLVRNRILAQADGSTVASISMATLRRIRIPVPPLEVQREIVRILDQFTELEAELEA 192 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 E +E K + ++ G + E + T Sbjct: 193 ELEAELEARKRQYTHYRYSLI-----------------FGDTDNARERVRLKDVSTFKRG 235 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 + G +P +Y D +V Sbjct: 236 TA---------FTKRQARKGQYPVVANGPEPIAYHDEFNRDGEFLVIARSGAY---AGAV 283 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + + + P +D Y L+ + GSG+ ++ +++ Sbjct: 284 TYWHGPTFLTDAFSIHPDPQHLDLRYAYHLLTAMQTELHGMKAGSGV-PHVRVREIEEQQ 342 Query: 367 VLVPPIKEQFDITNVINVETARIDVLV----EKIEQSIVLLKERRSSFIA 412 V +P + Q +++ ++ ++ + ++ + R + Sbjct: 343 VAIPSLIVQQNVSARLDDFDRLVNDISVGLPAELAARRKQYEYYRDKLLT 392 >gi|309809680|ref|ZP_07703536.1| conserved hypothetical protein [Lactobacillus iners SPIN 2503V10-D] gi|308170040|gb|EFO72077.1| conserved hypothetical protein [Lactobacillus iners SPIN 2503V10-D] Length = 164 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 51/158 (32%), Gaps = 5/158 (3%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y + + + + +I +IV + +A + I S Sbjct: 1 MYTHFGIYATEPLKYISEDVAKKSKIAVKNDIVMAVTSENVEDVCKCTAWLGNENIAVSG 60 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDI 378 + A+ H ++ YL++ + + G + + + + + +P + EQ I Sbjct: 61 HTAIIHHNQNAKYLSYYFHTAMFFAQKKRLAHGTKVIEVTPNALNDIIIPLPSLAEQKRI 120 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 +++ + + + I K+ R + Sbjct: 121 VGILDRFDDFCNDISTGLPAEIEARKKQYEYYRDKLLN 158 >gi|270594534|ref|ZP_06221501.1| type I restriction-modification system S subunit [Haemophilus influenzae HK1212] gi|270318347|gb|EFA29502.1| type I restriction-modification system S subunit [Haemophilus influenzae HK1212] Length = 131 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 8/95 (8%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLP 65 KDSGV+WIG +P+HW+VV +KR K ++G + +I ++ + D KY+ Sbjct: 32 KDSGVEWIGQVPEHWEVVSMKRVVKEHSGNGFPIDLQGNNGNIPFLKVSDFSENQDKYIF 91 Query: 66 KDGNSRQSDTSTVS---IFAKGQILYGKLGPYLRK 97 K NS + I K I+ K+G LRK Sbjct: 92 KWNNSVTNKVIKQKKWNIVPKNSIVTAKIGEALRK 126 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 41/121 (33%), Positives = 57/121 (47%), Gaps = 10/121 (8%) Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + I LLKE KQ L+ VT+GLNPDV +KDSG+EW+G VP+HWEV +V E + Sbjct: 1 MAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWEVVSMKRVVKEHSG 60 Query: 247 --------KNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFI 296 N I +S N + + N + + + + IV IV I Sbjct: 61 NGFPIDLQGNNGNIPFLKVSDFSENQDKYIFKWNNSVTNKVIKQKKWNIVPKNSIVTAKI 120 Query: 297 D 297 Sbjct: 121 G 121 >gi|281424438|ref|ZP_06255351.1| type I restriction enzyme EcoAI specificity protein [Prevotella oris F0302] gi|281401437|gb|EFB32268.1| type I restriction enzyme EcoAI specificity protein [Prevotella oris F0302] Length = 308 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/279 (13%), Positives = 84/279 (30%), Gaps = 18/279 (6%) Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 E + I +P+P+P LAEQ I +I +V IDT+ + +K+ Sbjct: 34 ENLAGSTNQKELYIGVIERLPLPLPSLAEQQRIVSEIERWSVLIDTIEQGKENLETSIKQ 93 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 K ++ + L P + E + + E+ +L+ K N + Sbjct: 94 AKNKILDLAIHGKLVPQDPNDEPASELLKRINPKAEIACDNEHSRKLHSKGWVQCILNDV 153 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV------- 310 + N E ++ +++ + + + + Sbjct: 154 FTIIMGQSPDGNSINEKNGIEFHQGKLFFSQKKLLKSPFYTTSPIKIAKPNSLVLCVRAP 213 Query: 311 -------MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + I ++P+ + A+ + +S+ + Sbjct: 214 VGDINTLDRKICIGRGLCNLQPNSALNLDFAYYSMIQHKVSLENKATGSTFKSVSKNIIC 273 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + +PP+ EQ I I L+ IE + Sbjct: 274 KELFYLPPLAEQKRIVRKIKDLF----TLINLIEIELDK 308 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 2/91 (2%) Query: 330 STYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 S Y+ + S GS ++ L ++RLP+ +P + EQ I + I + Sbjct: 16 SEYVYAYVSSLSTQLYLEENLAGSTNQKELYIGVIERLPLPLPSLAEQQRIVSEIERWSV 75 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ID + + E +K+ ++ + A+ G+ Sbjct: 76 LIDTIEQGKENLETSIKQAKNKILDLAIHGK 106 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 47/160 (29%), Gaps = 2/160 (1%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W + + G++ + G+E + K S TS + I Sbjct: 143 KGWVQCILNDVFTIIMGQSPDGNSINEKNGIEFHQGKLFFSQKKLLKSPFYTTSPIKIAK 202 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ P D LQP L L + I +E Sbjct: 203 PNSLVLCVRAPV-GDINTLDRKICIGRGLCNLQPNSALN-LDFAYYSMIQHKVSLENKAT 260 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 G+T I +PPLAEQ I KI I+ Sbjct: 261 GSTFKSVSKNIICKELFYLPPLAEQKRIVRKIKDLFTLIN 300 >gi|332289030|ref|YP_004419882.1| Type I restriction modification DNA specificity domain protein [Gallibacterium anatis UMN179] gi|330431926|gb|AEC16985.1| Type I restriction modification DNA specificity domain protein [Gallibacterium anatis UMN179] Length = 361 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 52/375 (13%), Positives = 117/375 (31%), Gaps = 31/375 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ ++ F ++ TG+ + V G + + + F Sbjct: 16 WEKCKLENFVEITTGKLDANAM---------VNDGKYDFYTSGIKKFKINIPA---FTGP 63 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I G + +AD + + VL + + I + ++I A Sbjct: 64 AITIAGNGATVGFMHLADGEFNAYQRTYVLTKFSNSIREFLFYEIGIKLPRKISAEARTG 123 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + N+ + P + EQ I I + I L +AL+ Sbjct: 124 NIPYIVMDMLTNLDVFTPTVPEQQKIGNLFKQLDRLITLHKRKWDDVILLK----KALLQ 179 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K + +++ D WE + +N K+T N + L Sbjct: 180 KMFPKNGSDFPEIRFP------EFTDAWEKCKLGE-IATINPKSTLPQTFNYVDLESVVG 232 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + L ++ G+I ++ + L E + ++ Y ++ Sbjct: 233 TEMRSYKIEKLYSAPSRAQRLAKYGDIFYQTVRPYQKNNYLFELD-DENYVFSTGYAQIR 291 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK-EQFDITNVI 382 +L L+++ +G ++ D+K + + + EQ I N Sbjct: 292 SKIY-PYFLFTLIQNDRFVNEVLDNCTGTSYPAINATDLKNITIFISNNPIEQQKIGN-- 348 Query: 383 NVETARIDVLVEKIE 397 ++D L+ + Sbjct: 349 --LFKQLDRLITLHK 361 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 16/146 (10%), Positives = 39/146 (26%), Gaps = 4/146 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + + I E Y+ K Sbjct: 39 NDGKYDFYTSGIKKFKINIPAFTGPAITIAGNGATVGFMHLADGEFNAYQRTYVLTKFSN 98 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +L + + K+ +G + + + L V P + EQ I N Sbjct: 99 SIREFLFYEIGIKLPRKISAEARTGNIPYIVMDMLTNLDVFTPTVPEQQKIGN----LFK 154 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAA 413 ++D L+ ++ + + + + Sbjct: 155 QLDRLITLHKRKWDDVILLKKALLQK 180 >gi|317490765|ref|ZP_07949219.1| hypothetical protein HMPREF1023_02919 [Eggerthella sp. 1_3_56FAA] gi|316910133|gb|EFV31788.1| hypothetical protein HMPREF1023_02919 [Eggerthella sp. 1_3_56FAA] Length = 457 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 75/211 (35%), Gaps = 12/211 (5%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 E +P+ WE +V + + S I S N Q+L ++ ++ + Sbjct: 9 CIDDEIPFDIPEGWEWARLGNIVYQRAQLKPTSAFSYIDIGSIDNAHQRLSSKETLIEAD 68 Query: 279 SYETYQI--VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAW 335 + V G++++ + + + I ++ + A+ GI + YL Sbjct: 69 KAPSRARKPVKLGDVLYSTVRPYLHNMCIVDRKFSLPPIASTGFAAMVCLDGISNGYLLN 128 Query: 336 LMRSYDLCKVFYA--MGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S D G+ ++ + + V VPP+ EQ I ++ + Sbjct: 129 YLMSPDFDTYANRTDNSKGVAYPAINDKHLYAALVPVPPLAEQRRIAERVSELMPLVGEH 188 Query: 393 VEKIEQSIVLL-----KERRSSFIAAAVTGQ 418 K+E L + R S + AV G+ Sbjct: 189 -GKLEDEREALDASLPERLRKSVLQMAVEGK 218 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 69/210 (32%), Gaps = 15/210 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD--TST 77 IP+ W+ + S YI + +++ + K+ + Sbjct: 17 DIPEGWEWARLGNIVYQRAQLKPTSA--FSYIDIGSIDNAHQRLSSKETLIEADKAPSRA 74 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVL-PELLQGWLLSID 132 G +LY + PYL I D I ST F + D + L +L+S D Sbjct: 75 RKPVKLGDVLYSTVRPYLHNMCIVDRKFSLPPIASTGFAAMVCLDGISNGYLLNYLMSPD 134 Query: 133 VTQRIEA--ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 +G + K + +P+PPLAEQ I E++ + Sbjct: 135 FDTYANRTDNSKGVAYPAINDKHLYAALVPVPPLAEQRRIAERVSELMPLVGEHGKLEDE 194 Query: 191 FI----ELLKEKKQALVSYIVTKGLNPDVK 216 L + +++++ V L P Sbjct: 195 REALDASLPERLRKSVLQMAVEGKLVPQDP 224 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 49/175 (28%), Gaps = 14/175 (8%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL---SYGNIIQKLETRNMGL 275 E +P+ WE + T + R + N Sbjct: 283 CIDGEIPFEIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKCNQWSGFSLERAKF 342 Query: 276 KP----ESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAY--MAVKPH 326 SY +++ G++++ L + + S + P Sbjct: 343 VDPNSVASYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPD 402 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDIT 379 + Y + V SG ++ L E VKR + VPP+ EQ I Sbjct: 403 WLRYEYAFLYFAGPSVQSVIEDQASGSTKQKELAQETVKRYLIPVPPLAEQRRIA 457 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 39/124 (31%), Gaps = 14/124 (11%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ ++ T + G++ + K + + +G L + + + Sbjct: 291 EIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKC-NQWSGFSLERAKFVDPNSVA 349 Query: 77 TV---SIFAKGQILYGKL--GPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQG 126 + + G +L+ G R A+ + + V++ Sbjct: 350 SYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPDWLRYEYA 409 Query: 127 WLLS 130 +L Sbjct: 410 FLYF 413 >gi|126463983|ref|YP_001045096.1| restriction modification system DNA specificity subunit [Rhodobacter sphaeroides ATCC 17029] gi|126105794|gb|ABN78324.1| restriction modification system DNA specificity domain [Rhodobacter sphaeroides ATCC 17029] Length = 575 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 47/140 (33%), Gaps = 4/140 (2%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVF-RFIDLQNDKRSLRSAQVMERGIITSA 319 Y N I + L+ + + + +V R +E+ + + Sbjct: 407 YRNRIDLTNLKKFELQDGEVDKFGLQPFDILVVEGNGSATEIGRCAMWEGQIEQCVHQNH 466 Query: 320 YMAVKPHGID-STYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +P + S Y + S A+ S +L + +P+ +PP+ EQ Sbjct: 467 LIRCRPIDPNLSRYALLYLNSPLGMDEMTELAITSAGLYNLSVGKISTVPLPLPPLAEQH 526 Query: 377 DITNVINVETARIDVLVEKI 396 I ++ +D L + Sbjct: 527 RIVAKVDALMRLLDDLEAAL 546 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 44/122 (36%), Gaps = 4/122 (3%) Query: 278 ESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E + Y G++ I N K ++ G T+ V+P + Y+ Sbjct: 134 EIKKGYTHFAEGDVGLAKITPCFENGKSAVFRGLTGGFGAGTTELHIVRPIFVSPDYILT 193 Query: 336 LMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++S + + G+ ++ + E P +PP+ EQ I + A +D + Sbjct: 194 YLKSPQFIENGIPRMTGTAGQKRVPTEYFIGTPFPLPPLAEQHRIVAKVEELMALLDRIE 253 Query: 394 EK 395 Sbjct: 254 AA 255 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 56/199 (28%), Gaps = 13/199 (6%) Query: 22 PKHWKVVPIKRFTKLNTG--RTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P W+ ++ + G +T Y+G+ +V Q Sbjct: 367 PPRWRWTNLECLFAITGGIQKTPGRMPKANAFPYLGVGNVYRNRIDLTNLKKFELQDGEV 426 Query: 77 TVSIFAKGQILY----GKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELL--QGWL 128 IL G R A+ + + +P D Sbjct: 427 DKFGLQPFDILVVEGNGSATEIGRCAMWEGQIEQCVHQNHLIRCRPIDPNLSRYALLYLN 486 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + E A + + I +P+P+PPLAEQ I K+ A +D L Sbjct: 487 SPLGMDEMTELAITSAGLYNLSVGKISTVPLPLPPLAEQHRIVAKVDALMRLLDDLEAAL 546 Query: 189 IRFIELLKEKKQALVSYIV 207 A + + Sbjct: 547 SASSTTRARLLDATLRAAL 565 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 66/200 (33%), Gaps = 7/200 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P +W I ++ +E ++ + + + + + + Sbjct: 82 LPANWAWSNIASLGSVSPRNEAEDDAMASFVPMTLIPTEIRAANGHEPRHWREIKKGYTH 141 Query: 81 FAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 FA+G + K+ P + G +T+ +++P V P+ + +L S Sbjct: 142 FAEGDVGLAKITPCFENGKSAVFRGLTGGFGAGTTELHIVRPIFVSPDYILTYLKSPQFI 201 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G + P P+PPLAEQ I K+ +D + R E Sbjct: 202 ENGIPRMTGTAGQKRVPTEYFIGTPFPLPPLAEQHRIVAKVEELMALLDRIEAARAGREE 261 Query: 194 LLKEKKQALVSYIVTKGLNP 213 A ++ + + Sbjct: 262 TRNRLTAATLARLTDPKADA 281 >gi|85716901|ref|ZP_01047866.1| putative specificity protein s [Nitrobacter sp. Nb-311A] gi|85696281|gb|EAQ34174.1| putative specificity protein s [Nitrobacter sp. Nb-311A] Length = 451 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 58/425 (13%), Positives = 118/425 (27%), Gaps = 56/425 (13%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY-----GKLGPYLRKA 98 G D + DV S +Y+PK + G IL K P R Sbjct: 35 RGTDFSAVRYGDVSSAPVRYIPKKA-------ADRKTLRPGDILIETAGGTKDQPTGRTV 87 Query: 99 IIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 + D C++ L+ L + + + + Sbjct: 88 YLNQRVFDMLDMPATCASFARFLRVNRELVDPNYLYWYLQSIYSTGAMFPYHIQHTGVAR 147 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL-VSYIVTKG 210 + + I A +D I R E L+ QA+ + + V G Sbjct: 148 FQYTDFAAQWRVPVPDREHQLAIAALLSSLDDKIELNRRTNETLEAMAQAIFLDWFVDFG 207 Query: 211 --------------------LNPDVKMKDS--GIEWVGLV--PDHWEVKPFFALVTELNR 246 +PD K + +G P+ W Sbjct: 208 PTRRKIDGATDPVEVMGGLVNDPDRARKLAALFPSELGEDGLPEGWSEGDLGHYAFLNPE 267 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIVFRFIDLQNDKR 303 + + + + + +IV G+ + + N Sbjct: 268 SWSVRNAPHAIEYVDLANTKWGTIELTTVYRWSDAPSRARRIVRGGDTIVGTVRPGN--- 324 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMG-SGLRQSLKFED 361 S ++ ++ + A++P L +L S + + + G +++ + Sbjct: 325 GSYSYVGIDGLTASTGFAALRPKEKTMAPLVYLAATSVENIERLDKLADGGAYPAVRPDV 384 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 V + + P+ I + A + VE ++ +L R + ++G+I L Sbjct: 385 VLATNMPIVPLD----IVDGFASVCAPLITKVEHNKKENRILAATRDLLLPKLMSGEIRL 440 Query: 422 RGESQ 426 R + Sbjct: 441 RDAER 445 >gi|218516410|ref|ZP_03513250.1| Putative restriction-modification enzyme [Rhizobium etli 8C-3] Length = 112 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 33/105 (31%), Gaps = 6/105 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P+ W + + ++G T G DI ++ D+E P+ Sbjct: 4 LPRGWVETTLGEIGEWSSGGTPSRARPDYYGGDIPWVKTGDLEDRVLLDTPEKITQAGLR 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 S+ +F G +L G + K + + P Sbjct: 64 NSSAKLFPSGTLLIAMYGATIGKTALLGIPAATNQACAAFVPSYH 108 >gi|319776232|ref|YP_004138720.1| putative type-1 restriction enzyme HindVIIP specificity protein [Haemophilus influenzae F3047] gi|329123369|ref|ZP_08251933.1| type I restriction/modification specificity protein [Haemophilus aegyptius ATCC 11116] gi|317450823|emb|CBY87045.1| Putative type-1 restriction enzyme HindVIIP specificity protein [Haemophilus influenzae F3047] gi|327470951|gb|EGF16406.1| type I restriction/modification specificity protein [Haemophilus aegyptius ATCC 11116] Length = 448 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 53/459 (11%), Positives = 116/459 (25%), Gaps = 78/459 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK + GR ++G+ + ++++ +G GK + D ++ Sbjct: 2 SDWKEYKLGELATFYNGRAYKNGEFKTSGTPIVRIQNL-TGEGKTVYSDLQLDEN----- 55 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G ++Y I I + + + + + ++ ++ Sbjct: 56 KYIENGDLIYAWS-ATFGPYIWRGEKSIYHYHIWKIVCNEKIIDKFYFYYKLKLISDSLK 114 Query: 139 AICEGATMSHADWKGIGNIPM--------------------------------------- 159 G+ H + N + Sbjct: 115 DNGNGSIFIHITKSFMENFKIKIPSLEKQKYISNILSNLDKKIRFNTQINQTLEQIAQAL 174 Query: 160 ---PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + + + E+ AL Sbjct: 175 FKSWFVDFDPVRAKVQALSEGMSLEQAELAAMQAISGKTPEELTALSQTQPDCYAELAET 234 Query: 217 MKDSGIEWVG----LVPDHWEVKPFFALVTELNRKNTKLIE-----------SNILSLSY 261 K E V VP WE KP L K E I Sbjct: 235 AKAFPCEMVEVDGVEVPKGWEYKPADELFDIGIGKTPLRKETEWFSTNPDDMQWISIKDM 294 Query: 262 GNIIQKLETRNMGLKPESYETYQI--VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 GN + + L ++ + + I + ++ F I + Sbjct: 295 GNSGVFITESSEFLTNQAVDKFNIRKIPENTVLLSFKLTIGRVSITTCETTTNEAI--AH 352 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + + YL + +D + S + ++ + +K + +L+P + Sbjct: 353 FKITDKSFLTTEYLYLFFQQFDFNSL--GSTSSIATAVNSKTIKGIEILIPNEELIKAFQ 410 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I+ A+I L + + L E R + + G+ Sbjct: 411 MKISNIFAQIKNLTIENKN----LVETRDLLLPRLLNGE 445 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 50/175 (28%), Gaps = 12/175 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDG-- 68 +PK W+ P + G+T + D+ +I ++D+ + Sbjct: 249 EVPKGWEYKPADELFDIGIGKTPLRKETEWFSTNPDDMQWISIKDMGNSGVFITESSEFL 308 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 ++ D + + +L + + I + + + D + Sbjct: 309 TNQAVDKFNIRKIPENTVLLS-FKLTIGRVSITTCETTTNEAIAHFKITDKSFLTTEYLY 367 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L + + + K I I + IP + KI +I Sbjct: 368 LFFQQFDFNSLGSTSSIATAVNSKTIKGIEILIPNEELIKAFQMKISNIFAQIKN 422 >gi|304373163|ref|YP_003856372.1| Restriction endonuclease S subunits [Mycoplasma hyorhinis HUB-1] gi|304309354|gb|ADM21834.1| Restriction endonuclease S subunits [Mycoplasma hyorhinis HUB-1] Length = 355 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/367 (10%), Positives = 109/367 (29%), Gaps = 26/367 (7%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI--LYGKLGPYLRKAIIADFDGIC 107 ++ ++++ GKY + + T K + L Y + Sbjct: 9 FVSKYEIQNNPGKYPVYSSQTTNNGTMGYISSYKYDLECLTWTTRGYAGVVFYRNEKFSV 68 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 S L++ ++++ ++ I + T + + + + + Sbjct: 69 SNSGLLIFKRNIIYNYRYFL-----FVFQMADIQKSMTAGNIPQFTVEMMKEAVLTYSNN 123 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + + KI +D +I+ R + LL++ ++AL S I N ++ Sbjct: 124 LNEQRKISQLFYTLDKIISLYERKMSLLEKLQKALFSNIFVLNANNKPLIRFKSFFEFWE 183 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + ++ ++ + S + + + Sbjct: 184 KNNISDLCKINRGNSKYTINYIQQNVGKFPVYSSQTQNEGISGNISTYDYDGE------- 236 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + + S + + + +S +A + +T +L L + Sbjct: 237 -----YITWTMDGVNAGTVSYRNGKFNVSSSGVLAPNSNKNINT--KFLFYVLKLMNLNQ 289 Query: 348 AMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +L + +EQ I + + ID ++++ + L+K Sbjct: 290 ENIGETIPHFTGSMMNKLEITFVKNRQEQNKIAD----LFSNIDSTHAQLKRKLNLIKNI 345 Query: 407 RSSFIAA 413 + S + Sbjct: 346 QKSVLNK 352 >gi|306826264|ref|ZP_07459598.1| type I restriction-modification system [Streptococcus sp. oral taxon 071 str. 73H25AP] gi|304431540|gb|EFM34522.1| type I restriction-modification system [Streptococcus sp. oral taxon 071 str. 73H25AP] Length = 191 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 61/165 (36%), Gaps = 7/165 (4%) Query: 246 RKNTKLIESNILSLSYGN---IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 K E I + G+ + + + E ++V G+ + Sbjct: 19 SKFITESEKGIPWIKIGDVEKDSKYVSKTKERITQAGSEKSRLVYKGDFIMSNSMSFGRP 78 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFED 361 L + G ++ ++ YL + + + + S G Q+L E Sbjct: 79 YILDIDGCIHDGWLS---ISSFEDLCSPDYLYHYLLTDTMQHMMRKNASNGTVQNLNAEI 135 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 V++L +++PP+ +Q +V++ + L E + + I L +++ Sbjct: 136 VRQLIIVLPPLSQQSQAVSVLDNFDTLTNSLSEGLPKEIELRQKQ 180 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 65/189 (34%), Gaps = 9/189 (4%) Query: 30 IKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 K+ G + ES K I +I + DVE + Q+ + + Sbjct: 3 FGAMAKIVRGASPRPISKFITESEKGIPWIKIGDVEKDSKYVSKTKERITQAGSEKSRLV 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 KG + + R I+ I + P+ L +LL+ + + Sbjct: 63 YKGDFIMSNSMSFGRPYILDIDGCIHDGWLSISSFEDLCSPDYLYHYLLTDTMQHMMRKN 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 T+ + + + + + + +PPL++Q + ++L + IEL +++ + Sbjct: 123 ASNGTVQNLNAEIVRQLIIVLPPLSQQSQAVSVLDNFDTLTNSLSEGLPKEIELRQKQYE 182 Query: 201 ALVSYIVTK 209 + Sbjct: 183 YWREQLFKF 191 >gi|304440529|ref|ZP_07400416.1| type I restriction-modification enzyme s subunit [Peptoniphilus duerdenii ATCC BAA-1640] gi|304371007|gb|EFM24626.1| type I restriction-modification enzyme s subunit [Peptoniphilus duerdenii ATCC BAA-1640] Length = 383 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 43/383 (11%), Positives = 105/383 (27%), Gaps = 39/383 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYI--GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + I ++ + K Y+ GL + K + N + Sbjct: 14 EWKKIGDIKEIKVISPIKKIKKKEYLDEGLYPIIDQGQKLIVGYTNDENATFEKSKYVIF 73 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G + QF+ + + + +L S + I E Sbjct: 74 GD--------------HTESVKYIDFQFVQGADGIKVLKTNEEYLNSRYLYHAILNFYEM 119 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + +PIP + Q I + + T + L E ++ + + ++ Sbjct: 120 KGNYMRHFSLLKKTEIPIPSIETQEKIVKILDNFTEYVTELQVELQARVKQYEYYRDQIL 179 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S E++ + + + + L S Sbjct: 180 SR-----------------EYLCKTSEKIFNNYNNSFEKIKLKDIATITRGRRLVRSDLE 222 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + LKP Y + D + ++ Sbjct: 223 EKGRFPVFQNSLKPLGYYHMNNFSGDKTCLISAGAAGD----IFYAEEDFWAADDVFVID 278 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ++ +L+ ++ K S + +++K++ +LVP I+ Q I +++ Sbjct: 279 SSSVVNKYIYYYLLNKQNMIKSKVRKAS--IPRISRDEIKKIEILVPTIELQKKIVEILD 336 Query: 384 VETARIDVLVEKIEQSIVLLKER 406 + + + Q I +++ Sbjct: 337 KFQSLVSETKGLLPQEIEQRQKQ 359 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 36/93 (38%), Gaps = 8/93 (8%) Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++S YL + ++ K Y F +K+ + +P I+ Q I Sbjct: 96 VLKTNEEYLNSRYLYHAILNFYEMKGNYMR--------HFSLLKKTEIPIPSIETQEKIV 147 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +++ T + L +++ + + R ++ Sbjct: 148 KILDNFTEYVTELQVELQARVKQYEYYRDQILS 180 >gi|227893574|ref|ZP_04011379.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus ultunensis DSM 16047] gi|227864626|gb|EEJ72047.1| type I site-specific deoxyribonuclease specificity subunit [Lactobacillus ultunensis DSM 16047] Length = 373 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 57/390 (14%), Positives = 125/390 (32%), Gaps = 32/390 (8%) Query: 35 KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 ++N S + I + + K + + KG Y K Sbjct: 2 RINRKNESLESTLPLTISAQYGLVKQNSFFNK--QVASKNLKNYILLRKGDFAYNKSYSK 59 Query: 95 LRKAI-----IADFDGICSTQFLVLQPKDVLPEL-LQGWLLSIDVTQRIEAICEGATMSH 148 G+ S+ ++ +P + + + + + EGA Sbjct: 60 DSPYGAIKRLNCYPKGVISSLYIAFKPNGINSKFLEIYYESDKWYKEIYKRAAEGARNHG 119 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + ++ +EKI ++ LI + R + L++ KQAL YI Sbjct: 120 LLNISPHDFFDTLLKISTSKKEQEKIGILLSYVEKLILLQQRKLNDLEQIKQALEDYIFP 179 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 E L+ + + K R E+ + ++ Sbjct: 180 D-----------NNENRKLIFNKNKWKHKKIKDIFEERNIRDGKENLLTVSISKGVVPFN 228 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + Y++V G+I + + + + GI++ AY ++ Sbjct: 229 SMKREINSSSDKSNYKVVKIGDIAYNSMRMWQGACGVSKYD----GIVSPAYTVIRAKEH 284 Query: 329 DSTYLA-WLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP-PIKEQFDITNVIN 383 ++ + ++ + +F GL +LKF +KR+ VL P KEQ ++ Sbjct: 285 ENALFYFYYFKNERMKFIFQKNSQGLTSDTWNLKFPLLKRITVLTPENEKEQIR----VS 340 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ ++V++ + I L + + Sbjct: 341 KLFNKVSLIVKQTGKEIAYLNLVKKFLLQK 370 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 50/156 (32%), Gaps = 7/156 (4%) Query: 32 RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL 91 + R + + I V + K + SD S + G I Y + Sbjct: 201 DIFEERNIRDGKENLLTVSISKGVVPFNSMK----REINSSSDKSNYKVVKIGDIAYNSM 256 Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL---SIDVTQRIEAICEGATMSH 148 + ++ +DGI S + V++ K+ L + + + + + + Sbjct: 257 RMWQGACGVSKYDGIVSPAYTVIRAKEHENALFYFYYFKNERMKFIFQKNSQGLTSDTWN 316 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + I + P ++ + K+ + I Sbjct: 317 LKFPLLKRITVLTPENEKEQIRVSKLFNKVSLIVKQ 352 >gi|167740612|ref|ZP_02413386.1| putative restriction modification system specificity subunit [Burkholderia pseudomallei 14] Length = 392 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 58/158 (36%), Gaps = 9/158 (5%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + ++E S + Y++V+ G++V+ L+ + A + GI+++ Y Sbjct: 68 GCVNQIEHLGRSYAGASVKEYRVVETGDLVYTKSPLKKSPFGVVKANKGKAGIVSTLYAI 127 Query: 323 VKPHGI-DSTYLAWLMRS-YDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIKEQFD 377 +P S Y + + L + ++ + V V+ P ++EQ Sbjct: 128 YRPKEGAHSAYFDYYFSLDHRLNAYLQPLVKKGAKNDMKVNNGVVLSGNVVAPKLEEQKR 187 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + + +D + + LK ++ + Sbjct: 188 IADCL----TSLDERIAVESSKLDTLKVQKKGLMQRLF 221 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 49/383 (12%), Positives = 123/383 (32%), Gaps = 39/383 (10%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGK 62 +P+++ +G +W++ + F R + + +D++ + E + Sbjct: 24 RFPEFRKAG---------NWEIKKLSEFLIETKQRNRDLKYTPQDVLSVSGELGCVNQIE 74 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKD 118 +L + + G ++Y K P+ GI ST + + +PK+ Sbjct: 75 HLGRSYAGASVKE--YRVVETGDLVYTKSPLKKSPFGVVKANKGKAGIVSTLYAIYRPKE 132 Query: 119 VLPELLQGWLLSIDVTQRIEA----ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + S+D + + + + P L EQ I + + Sbjct: 133 GAHSAYFDYYFSLDHRLNAYLQPLVKKGAKNDMKVNNGVVLSGNVVAPKLEEQKRIADCL 192 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 +D I ++ LK +K+ L+ + + +++ G W+ Sbjct: 193 ----TSLDERIAVESSKLDTLKVQKKGLMQRLFPREGETVPRLRFPEFRDAGE----WQS 244 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-MGLKPESYETYQIVDPGEIVF 293 + +L+ + E + + + + + K + V+ V Sbjct: 245 RKISSLLVRSVSPVSVDAEEVYQEIGIRSHGNGVFHKELVHGKALGDKRVFWVEENAFVV 304 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKV--FYA 348 + ++ E+G+I S + D ++ + + + ++ + Sbjct: 305 NIVFAWEQ--AVAVTSEAEKGMIASHRFPMYKAKDGASDVNFIKYFFLTKEGKELLGIAS 362 Query: 349 MGSGLRQS-LKFEDVKRLPVLVP 370 G R L ++ + L L P Sbjct: 363 PGGAGRNRTLGQKEFENLEFLSP 385 >gi|148993497|ref|ZP_01822988.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP9-BS68] gi|147927866|gb|EDK78887.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP9-BS68] Length = 273 Score = 69.1 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 74/188 (39%), Gaps = 9/188 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + V + ++ GA + + + + +I +P+PPLAEQ I E I + +++ + I Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVNNIAGRLIY 262 Query: 191 FIELLKEK 198 + L++ Sbjct: 263 YKMLMRNF 270 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 70/180 (38%), Gaps = 8/180 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ S + F ++ SG ++L + V + + +PP+ EQ I I +++ + Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVNNI 256 >gi|332087067|gb|EGI92201.1| type I restriction modification DNA specificity domain protein [Shigella boydii 3594-74] Length = 334 Score = 69.1 bits (167), Expect = 2e-09, Method: Composition-based stats. Identities = 50/371 (13%), Positives = 110/371 (29%), Gaps = 41/371 (11%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G+ ++ + V +G+ G D ++ + I+ Sbjct: 2 VKLGDVINVHYGKALKAD--------QRVSNGSVHVFGSSGIVGNHD---KTLCSYPTII 50 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I T + V +L +L I + ++ Sbjct: 51 IGRKGSVGAITWAPSGGWIIDTAYYVEI--KDNNKLDLRYLFYILSGIDLTKKTITTSIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +PP EQ I + + + I + I+ + A + Sbjct: 109 GLNRDDLYDTFIKLPPFEEQKRIVDLLD-KAEGIRQKREQSIKLADDFLRATFATM---- 163 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP K + +G + + K+ + E + I Sbjct: 164 --YGNPITNPKKWPVHLMGEIIEFK--------GGNQPPKSDFIFEPKQGYIRLVQIRDF 213 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + P+ I + +++ + G A M P Sbjct: 214 KSDKYATYIPQEKAKR-IFEVDDVMIARYGPP-----VFQILRGLSGSYNVALMKASPKE 267 Query: 328 IDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +L++ + V + + + E + + V +PPI Q +I + + Sbjct: 268 NIRKGFIFYLLQLPEYHDVVVKNSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRL-- 325 Query: 385 ETARIDVLVEK 395 ARI+ EK Sbjct: 326 --ARIEKFKEK 334 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 39/110 (35%), Gaps = 4/110 (3%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + I + G I V+ + L +L + + Sbjct: 46 YPTIIIGRKGSVGAITWAPSGGWIIDTAYYVEIKDNNKLDLRYLFYILSGIDLTKKTITT 105 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 L +D+ + +PP +EQ I ++++ + + + +K EQSI L Sbjct: 106 SIPGLNRDDLYDTFIKLPPFEEQKRIVDLLD----KAEGIRQKREQSIKL 151 Score = 40.9 bits (94), Expect = 0.43, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 51/156 (32%), Gaps = 4/156 (2%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W V + + G I + + + I Sbjct: 171 PKKWPVHLMGEIIEFKGGNQPPKSDFIFEPKQGYIRLVQIRDFKSDKYATYIPQEKAKRI 230 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 F ++ + GP + + + G + + PK+ + + +LL + ++ Sbjct: 231 FEVDDVMIARYGPPVFQI-LRGLSGSYNVALMKASPKENIRKGFIFYLLQLPEYHDVVVK 289 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 A + + + + +P+PP+ Q I +++ Sbjct: 290 NSERTAGQTGVNLELLNKFNVPLPPIYYQDEILDRL 325 >gi|307067139|ref|YP_003876105.1| restriction endonuclease S subunit [Streptococcus pneumoniae AP200] gi|306408676|gb|ADM84103.1| Restriction endonuclease S subunit [Streptococcus pneumoniae AP200] Length = 249 Score = 69.1 bits (167), Expect = 2e-09, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 64/167 (38%), Gaps = 9/167 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + V + ++ GA + + + + +I +P+PPLAEQ I E I Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIDQL 249 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 8/173 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 +++ S + F ++ SG ++L + V + + +PP+ EQ I I+ Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIDQL 249 >gi|171920743|ref|ZP_02931952.1| restriction-modification enzyme subunit s3b [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|195867369|ref|ZP_03079373.1| restriction-modification enzyme subunit s3b [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|171903490|gb|EDT49779.1| restriction-modification enzyme subunit s3b [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|195660845|gb|EDX54098.1| restriction-modification enzyme subunit s3b [Ureaplasma urealyticum serovar 9 str. ATCC 33175] Length = 373 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 48/392 (12%), Positives = 118/392 (30%), Gaps = 42/392 (10%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K G T S + I +G Y+ ++ Sbjct: 3 IKLKDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYI------------NYYMY 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ-RIEAI 140 G I G D S ++ + + + +I+++ Sbjct: 51 EGGHITISMNGNAGCVFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSL 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 C+G T + N+ + +PP+ EQ I I I L T + + Sbjct: 111 CKGTTRLRLSNDDVLNLEINLPPIEEQNAIISIIEPIERIIKNLKTIKYKLET------- 163 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 +++ + ++ + + N ++ + N + + Sbjct: 164 -IMNNFFV-----VFYLFNNEENSNKYKLRNIGKFKGGISTLDKNNYDSGINFINYMDIY 217 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 +I + E IV G+++ ++ + S + + I + + Sbjct: 218 KNFVINDDIKLRLYNASEKDIKSYIVSYGDLLLTASSEIKEEIAFSSVYLSNKQAIFNGF 277 Query: 321 MAVKPHGID---STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF 376 + + + Y A+ RS K + +G R +L +D K + + + + Q Sbjct: 278 SKIYKYDQNILLPIYAAFYFRSEFFRKEVIKLATGYTRFNLSIKDAKNIEISINNFEFQK 337 Query: 377 DITNV------INVETARIDVLVEKIEQSIVL 402 + + ++ + +I+ ++ I Sbjct: 338 KFSKIVEPLLNLSTKANKIEKILNDSLLKITK 369 >gi|124010604|ref|ZP_01695218.1| type I restriction-modification system specificity subunit [Microscilla marina ATCC 23134] gi|123982204|gb|EAY23809.1| type I restriction-modification system specificity subunit [Microscilla marina ATCC 23134] Length = 362 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 11/204 (5%) Query: 225 VGLVPDHWEVKPFFALVTELNRK-------NTKLIESNILSLSYGNIIQKLETRNMGLKP 277 + +P+ W LV ++ +E I ++ +I + + Sbjct: 108 LPNLPEGWGWMKMGNLVKKIQIGPFGSQLHKHDYVEQGIPIINPKHIKDGYIFPSECITK 167 Query: 278 ESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 ++ I++ +I+ + S + S Y+ + ++ A Sbjct: 168 AKVDSLPQYILNMNDIILGRRGEMGRAALISSKENGWFCGTGSLYIRFT-NFFEAKLYAL 226 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++ + GSG +L + LP+ V P+ EQ I I + D + Sbjct: 227 ILGERRVIHYLEKKGSGTTMTNLNLGILNNLPIQVIPLPEQHQIVQEIESRLSVCDQVEA 286 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 I+ + + R S + A G+ Sbjct: 287 SIQTGLAKAEALRQSILKKAFEGR 310 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 72/205 (35%), Gaps = 12/205 (5%) Query: 18 IGAIPKHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGN 69 + +P+ W + + K+ G + I I + ++ G + + Sbjct: 108 LPNLPEGWGWMKMGNLVKKIQIGPFGSQLHKHDYVEQGIPIINPKHIKDGYI-FPSECIT 166 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELLQG 126 + D+ I I+ G+ G R A+I + + + +L Sbjct: 167 KAKVDSLPQYILNMNDIILGRRGEMGRAALISSKENGWFCGTGSLYIRFTNFFEAKLYAL 226 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L V +E G TM++ + + N+P+ + PL EQ I ++I + D + Sbjct: 227 ILGERRVIHYLEKKGSGTTMTNLNLGILNNLPIQVIPLPEQHQIVQEIESRLSVCDQVEA 286 Query: 187 ERIRFIELLKEKKQALVSYIVTKGL 211 + + +Q+++ L Sbjct: 287 SIQTGLAKAEALRQSILKKAFEGRL 311 >gi|326202975|ref|ZP_08192842.1| restriction modification system DNA specificity domain [Clostridium papyrosolvens DSM 2782] gi|325987052|gb|EGD47881.1| restriction modification system DNA specificity domain [Clostridium papyrosolvens DSM 2782] Length = 479 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 5/176 (2%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE----TYQIVDPGEI 291 K I + ++ ++ + V PG++ Sbjct: 43 KVVDGPFGTQLKVEDYRSEGIPVIRVSDVKTGEIPDEGLVRISPDKQRELKRSRVLPGDV 102 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + ++ +++E I + + + I+ +YL +++S K Y G+ Sbjct: 103 ILTKAGAILGYSAVFPERLVEGNITSHSVTIRCKNNINPSYLKHILKSTIGNKQIYRWGN 162 Query: 352 -GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 R L +VKR+ + VP + Q +I +++ +Q + + Sbjct: 163 KSTRPELNTGEVKRILIPVPDLDIQNEIVALMDSAHVSRKSKENDAQQLLASIDNY 218 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 57/411 (13%), Positives = 125/411 (30%), Gaps = 50/411 (12%) Query: 44 SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + I I + DV++G + + S G ++ K G L + + Sbjct: 59 RSEGIPVIRVSDVKTGEIPDEGLVRISPDKQRELKRSRVLPGDVILTKAGAILGYSAVFP 118 Query: 103 FD----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 I S + ++ P L+ L S ++I +T + + I Sbjct: 119 ERLVEGNITSHSVTIRCKNNINPSYLKHILKSTIGNKQIYRWGNKSTRPELNTGEVKRIL 178 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV--------SYIVTKG 210 +P+P L Q I + + V + + + + + + + + + Sbjct: 179 IPVPDLDIQNEIVALMDSAHVSRKSKENDAQQLLASIDNYVLSQLGIQLPQPKENTLAER 238 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT--------------------- 249 K +G + + F A+ + K Sbjct: 239 TFFTPFKKVTGSRFDPKKYSKFYQDLFAAVESCALDKAELRVLITHQASGDWGLDTKEVT 298 Query: 250 ---KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQND- 301 E ++ + + L RN +K ++ V G+++ +D Sbjct: 299 NPNDYTECTVIRATEFDNQYNLNLRNDRIKLRCINNRKLGRMDVQKGDLLIEKSGGSDDQ 358 Query: 302 ---KRSLRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-- 353 + + ++ G I + +DS Y+ +++ K+ AM S Sbjct: 359 PVGRIGIIDTDILGVGNIAYSNFVHKIRIRDDVDSRYIFQFLKTMHNNKLTDAMQSQTNG 418 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 ++L + + +P +Q +I + AR L + Q I K Sbjct: 419 IRNLIMSEYLHQLIPLPLRSKQEEIAEHVADIRARAKSLQLEAAQEIEEAK 469 >gi|238854456|ref|ZP_04644796.1| type I restriction-modification system, S subunit [Lactobacillus jensenii 269-3] gi|282932601|ref|ZP_06338022.1| type I restriction-modification system, S subunit [Lactobacillus jensenii 208-1] gi|313472060|ref|ZP_07812552.1| type I restriction-modification system, S subunit [Lactobacillus jensenii 1153] gi|238832949|gb|EEQ25246.1| type I restriction-modification system, S subunit [Lactobacillus jensenii 269-3] gi|239530089|gb|EEQ69090.1| type I restriction-modification system, S subunit [Lactobacillus jensenii 1153] gi|281303297|gb|EFA95478.1| type I restriction-modification system, S subunit [Lactobacillus jensenii 208-1] Length = 388 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 54/397 (13%), Positives = 124/397 (31%), Gaps = 29/397 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK V + ++ T + + S + E G + +T + Sbjct: 14 WKKVKLGEISEKITQKNNNSCSQFPVLTNSA-EYGIVYQKDFFDKNIAINTDNYYVVHTE 72 Query: 85 QILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +Y PY + G+ S + + + KD +L + + Sbjct: 73 DFVYNPRISKQAPYGPIRVNHLKTGVMSPLYYIFKIKDDFNIGFFEFLFIGNKWHKFMYQ 132 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + + +P Q + +K+I E I+ I + + E Sbjct: 133 NGDSGARSDRYAIKDKVFNKLPIYIPQKIEEQKLIFE---INHKINSLLYLQQRKLELIS 189 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 AL K G + + + ++ N L Sbjct: 190 AL--------------EKGLGQIIKQQNNKYGITFSLNNFLEIPPQIQARIKNKNQLLTV 235 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N+ Y I GE++F ++ N +L + + + ++ Sbjct: 236 KLNLQGLARGVQRDTLSLGSTKYFIRHTGELIFGKQNIFNGSIALITKE-FDGLATSNDV 294 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV-LVPPIKEQFDI 378 ++K I+ +L +L+++ D K + +G + + D+ +L + ++P K Q I Sbjct: 295 PSLKISNINPQFLFYLLKNPDFWKHTELIATGTGSKRVHIHDLLKLHIKIIPDAKYQAKI 354 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++ +I + + I + K+ + Sbjct: 355 VS-LSRNFEKIVLNQQIIVKECEKTKQF---LLQNLF 387 >gi|224418075|ref|ZP_03656081.1| restriction modification enzyme [Helicobacter canadensis MIT 98-5491] gi|253827404|ref|ZP_04870289.1| restriction-modification enzyme [Helicobacter canadensis MIT 98-5491] gi|313141612|ref|ZP_07803805.1| restriction modification enzyme [Helicobacter canadensis MIT 98-5491] gi|253510810|gb|EES89469.1| restriction-modification enzyme [Helicobacter canadensis MIT 98-5491] gi|313130643|gb|EFR48260.1| restriction modification enzyme [Helicobacter canadensis MIT 98-5491] Length = 1322 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/419 (9%), Positives = 122/419 (29%), Gaps = 52/419 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++V ++ K+ +T I +++ G Y N + + + Sbjct: 898 ELVKLESICKMYQPKT---------ITAKEILEK-GDYKVYGANGVIGFYNQYNH-KDSE 946 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + G + + + +++ P + + + + + I+++ G+ Sbjct: 947 VAMTCRGATCGAINYTEPNSWITGNAMIITPLEKNLISKKFLVYILPL-SNIKSVITGSA 1005 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID-------------TLITERIRFI 192 + + +P+PPL Q I + + + + I Sbjct: 1006 QPQITRNNLATLKIPLPPLEIQKQIVAECESLESQCNTIEQSIKAYQELIKAILWHCGIT 1065 Query: 193 ELLKEKKQALVSYI--VTKGLNPDVKMKDSG------------IEWVGLVPDHWEV---- 234 + +++ + + L+ ++ K + + P + Sbjct: 1066 TESTKDFDSILMSLAELESKLDFELLGKTKQDSKAFLQNLTNTLNTLPTPPSNGWEKAKL 1125 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--QIVDPGEIV 292 + E + E + + N + T +I ++ Sbjct: 1126 CKICNINQETYNPSNDEGEMLYIDIDSIEKGTGKINFNDKISCRKLPTRARRIARADSVI 1185 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAM 349 + + ++ + T + + + + + ++ M Sbjct: 1186 ISTVRPYLKGFAYLKNEIKDSIFSTGFAILQGKENLVKSQFVYYCFMFSDDLMQQMKIKM 1245 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 S+ ED++ + +PP++ Q I I I + ++ ++ LL+ ++ Sbjct: 1246 PKSSYPSINTEDLESFTIPLPPLEIQTKIAQSIET----IQSQISFLDSALPLLQSQKQ 1300 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 32/304 (10%), Positives = 76/304 (25%), Gaps = 25/304 (8%) Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + K + E ++ + + + I Q Sbjct: 781 GQEGIHYFMKSGVVENNINYIDTPLFNPNNRFCVNSISFAILSHFVKYLDSKDIDANFLQ 840 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 +R++ + E +A+ LNP +S ++ Sbjct: 841 QFLRQEKNNKNNEFLESARLIDMIDFEKVEFNKAI-------SLNPHSNDSNS-VQSNPF 892 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 +E+ ++ K E + + E Sbjct: 893 ANSKYELVKLESICKMYQPKTITAKEILEKGDYKVYGANGVIGFYNQYNHKDSE------ 946 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + IT M + P + +L+ L + Sbjct: 947 ---VAMTCRGAT----CGAINYTEPNSWITGNAMIITPLEKNLISKKFLVYILPLSNIKS 999 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + + + ++ L + +PP++ Q I ++ + IEQSI +E Sbjct: 1000 VITGSAQPQITRNNLATLKIPLPPLEIQKQIVAECESLESQCNT----IEQSIKAYQELI 1055 Query: 408 SSFI 411 + + Sbjct: 1056 KAIL 1059 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 77/201 (38%), Gaps = 9/201 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVS 79 W+ + + +N + S +++YI ++ +E GTGK + R+ T Sbjct: 1118 NGWEKAKLCKICNINQETYNPSNDEGEMLYIDIDSIEKGTGKINFNDKISCRKLPTRARR 1177 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQGWLLSI---DV 133 I ++ + PYL+ + I ST F +LQ K+ L + + + D+ Sbjct: 1178 IARADSVIISTVRPYLKGFAYLKNEIKDSIFSTGFAILQGKENLVKSQFVYYCFMFSDDL 1237 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 Q+++ ++ + + + + +P+PPL Q I + I +I L + Sbjct: 1238 MQQMKIKMPKSSYPSINTEDLESFTIPLPPLEIQTKIAQSIETIQSQISFLDSALPLLQS 1297 Query: 194 LLKEKKQALVSYIVTKGLNPD 214 +E + + Sbjct: 1298 QKQEVLKKYLFKTFLDRFTKQ 1318 >gi|269978334|gb|ACZ55901.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 420 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 48/414 (11%), Positives = 116/414 (28%), Gaps = 42/414 (10%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVEFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ +I + + S+ +L K+ + + Sbjct: 71 KSSPAIIFDD--------FTTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +PIPPL Q I + + A T L TE + Sbjct: 120 TIPYNIGGEHARHWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELNT 177 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + + + + + + + L L Sbjct: 178 ELNTELNTELNA----RKKQYQYYQNMLLDFNDINSNHKDAKIKSYPKRLKTLLHTLAPK 233 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF-----------RFIDLQNDKR 303 + G + + + + + K + V G I F I + Sbjct: 234 GVEFRKLGEVCEIIRGKRVTKKEILDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGT 293 Query: 304 SLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + ++ +V P + YL +++ + + S + S+ ++ Sbjct: 294 AGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNI 353 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ + +PP++ Q +I +++ + L+ I I K+ R + Sbjct: 354 MQITIPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYREKLLT 407 >gi|282877247|ref|ZP_06286081.1| type I restriction modification DNA specificity domain protein [Prevotella buccalis ATCC 35310] gi|281300633|gb|EFA92968.1| type I restriction modification DNA specificity domain protein [Prevotella buccalis ATCC 35310] Length = 242 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 67/210 (31%), Gaps = 6/210 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W+ ++ GR + + + + + G + + S Sbjct: 14 EIPQGWEWCRMQDVITFVNGRAYKKEELLSRGKYKVLRVGNF-FTNNQWYYSDLELSEDK 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +LY I I ++ + + + ++ A Sbjct: 73 YCYHGDLLYAWS-ASFGPQIWNGDKTIFHYHIWNVKFDTKVLFREYLYYFFLFDKTQVRA 131 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK- 198 G+TM H + + +PIPP+ EQ I + ++ + R L Sbjct: 132 STTGSTMVHVSMENMKPRLIPIPPIDEQKRIVCGVERVLPYVEKYELSQSRKDILDANIK 191 Query: 199 ---KQALVSYIVTKGLNPDVKMKDSGIEWV 225 K++++ + L P + + + E + Sbjct: 192 ESLKKSILQEAIQGKLVPQIVREGTAHELL 221 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 63/207 (30%), Gaps = 11/207 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K E +P WE ++T +N + K E + T N Sbjct: 5 KCIDDEVPFEIPQGWEWCRMQDVITFVNGRAYKKEELLSRGKYKVLRVGNFFTNNQWYYS 64 Query: 278 E-SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + + G++++ + + + I + + Sbjct: 65 DLELSEDKYCYHGDLLYAWSASFGPQIWNGDKTIFHYHIWN----VKFDTKVLFREYLYY 120 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +D +V + + E++K + +PPI EQ I + ++ E Sbjct: 121 FFLFDKTQVRASTTGSTMVHVSMENMKPRLIPIPPIDEQKRIVCGVERVLPYVEKY-ELS 179 Query: 397 EQSIVLL-----KERRSSFIAAAVTGQ 418 + +L + + S + A+ G+ Sbjct: 180 QSRKDILDANIKESLKKSILQEAIQGK 206 >gi|163798236|ref|ZP_02192168.1| putative Type I restriction enzyme MjaXP specificity protein [alpha proteobacterium BAL199] gi|159176484|gb|EDP61067.1| putative Type I restriction enzyme MjaXP specificity protein [alpha proteobacterium BAL199] Length = 310 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 50/326 (15%), Positives = 106/326 (32%), Gaps = 28/326 (8%) Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 + Q + + + L I+ + + + ++ P PPL Sbjct: 2 ATNQQINAVICDPRKADSAFVYYLLDMRAVAIKRLAGAQAVPIVNKSTFEDVTAPFPPLP 61 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 EQ I E + D I + + +++ + S++ +G + Sbjct: 62 EQRKIAEIL----RTWDEAIEKLEALRKANLQRRIWMRSHLF------------TGRTRL 105 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 W ++TE + T E +S+ G +I ++E Y Sbjct: 106 PGYRGEWREVTLGEVLTEHGLQGTGAEEVFSVSVHKG-LINQIEHLGRSFAAAETGHYNR 164 Query: 286 VDPGEIVFRFIDLQNDKRSLRS-AQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYD-L 342 V PG+IV+ + + +++ ++ I++ Y P L L S + Sbjct: 165 VLPGDIVYTKSPTGDFPLGIIKQSKISQQVIVSPLYGVFTPATQALGVILDALFESPIAV 224 Query: 343 CKVFYAMGSGLRQS---LKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 + + ++ + + +P EQ I V+ V A + IE Sbjct: 225 RNYLHPLVQKGAKNTIAITNRRFLEGKLHLPMEPAEQAAIAEVVEVSQAEL----TAIEA 280 Query: 399 SIVLLKERRSSFIAAAVTGQIDLRGE 424 I L ++ + +TG+ + E Sbjct: 281 EIEALTRQKRGLMQKLLTGEWRVTPE 306 >gi|55821636|ref|YP_140078.1| type I restriction-modification system specificty subunit, truncated [Streptococcus thermophilus LMG 18311] gi|55737621|gb|AAV61263.1| type I restriction-modification system specificty subunit, truncated [Streptococcus thermophilus LMG 18311] Length = 101 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 47/104 (45%), Gaps = 7/104 (6%) Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + E + + MA++P GID Y + L K+ + + + ++ +L+ Sbjct: 2 LGEDSYMDTNMMALEPKGIDPEYRYTFINKTGLYKIED---TSTIPQINNKHIEPYLLLI 58 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P ++EQ I + ++D + ++ + LLKE++ F+ Sbjct: 59 PSLEEQHKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 98 >gi|296448297|ref|ZP_06890190.1| restriction modification system DNA specificity domain [Methylosinus trichosporium OB3b] gi|296254212|gb|EFH01346.1| restriction modification system DNA specificity domain [Methylosinus trichosporium OB3b] Length = 393 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 44/108 (40%), Gaps = 7/108 (6%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 R + + + A++ +D +L ++ +YDL R L +D Sbjct: 76 RGVAYRIEGKSWVNNHAHVLRPKPFMDIRFLCRVLENYDLRPFI---TGSTRAKLTKKDA 132 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +R+ + VPP+ EQ I +++ D L K ++I L + + Sbjct: 133 ERIVIPVPPLDEQRRIAAILDQA----DDLRRKRREAIAKLAKLSTGL 176 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 43/313 (13%), Positives = 86/313 (27%), Gaps = 19/313 (6%) Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQ 115 G Y N Q IF + +L + G P A + + VL+ Sbjct: 38 GPYPYYGANGLQGWIDG-FIFDEPLLLLAEDGGHFDDPDRGVAYRIEGKSWVNNHAHVLR 96 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 PK + +L + G+T + K I +P+PPL EQ I + Sbjct: 97 PKPFMDIRFLCRVLENY---DLRPFITGSTRAKLTKKDAERIVIPVPPLDEQRRIAAILD 153 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 + +L V P + S I +G + Sbjct: 154 QADDLRRKRREAIAKLAKLSTGL-------FVELFGTPWISSASSSISDLGSISVFENGD 206 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + ++ + + ++ + K S + P +++ Sbjct: 207 RSSNYPSGDDILSSGIPFLSTKNIVDDKLDLGSLLFISSSKFASL-SRGKARPHDLIITL 265 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-R 354 + I + I +L + + + +G+G Sbjct: 266 RGTLGS-CCIFDGPFSTAFINAQMMIIRPKTDISPVFLHAYLTLPAIKEHLQQIGNGAAV 324 Query: 355 QSLKFEDVKRLPV 367 L + + LP+ Sbjct: 325 PQLTAKQLAGLPI 337 >gi|257091255|ref|ZP_05585616.1| type I restriction-modification system specificity subunit [Enterococcus faecalis CH188] gi|312905314|ref|ZP_07764429.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0635] gi|257000067|gb|EEU86587.1| type I restriction-modification system specificity subunit [Enterococcus faecalis CH188] gi|310631338|gb|EFQ14621.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0635] gi|315162493|gb|EFU06510.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0645] gi|315578593|gb|EFU90784.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0630] Length = 380 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 54/396 (13%), Positives = 121/396 (30%), Gaps = 37/396 (9%) Query: 25 WKVVPIKRFTKLNTGRTS-----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + + DI + + + ++ ++ Sbjct: 10 WEQCKLGDLGSVAMNKRIFKEQTSESGDIPFYKIGTFGATADAFISRELFET--YKKKYP 67 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +L G R D +V D L V Sbjct: 68 YPKIGDLLISASGSIGRVVEYKGNDEYFQDSNIVWLKHDDRINNLFLKQFYSIVKWHGL- 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG+T+ K I + +P EQ EKI ++D +IT R +E LKE K Sbjct: 127 --EGSTIKRLYNKNILETTIHLPVFDEQ----EKIGTLFKQLDDIITLHQRKLEQLKELK 180 Query: 200 QALVSYIVTK---GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 +A + + N K++ + E + ++ + + K+ Sbjct: 181 KAYLQAMFVPTNVQNNKVPKLRFANFEGNWEL------CKLENVIDKQIKGKVKVENLCN 234 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S+ Y + R G KP + V +I+ + + K +G++ Sbjct: 235 GSVEYLDA-----NRLNGGKPIYTKALPDVSERDIIILWDGSKAGKVY-----YGFKGVL 284 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 S A + ++ + + ++ + + P+ + +EQ Sbjct: 285 GSTLKAYQLKECANSQFIYQQLLDNQNNIYNNYRTPNIPHVVKNFSSIFPIWMTSFEEQS 344 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +++ + +D + + + + S++ Sbjct: 345 QMADIL----SNLDNRIILQQNLTDTMISLKKSYLQ 376 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 50/168 (29%), Gaps = 11/168 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSL 305 K +I G + E+Y+ Y G+++ Sbjct: 29 KEQTSESGDIPFYKIGTFGATADAFISRELFETYKKKYPYPKIGDLLISASGSIGRVV-- 86 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + D ++ + ++ + + L +++ Sbjct: 87 --EYKGNDEYFQDSNIVWLK--HDDRINNLFLKQFYSIVKWHGLEGSTIKRLYNKNILET 142 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +P EQ I ++D ++ ++ + LKE + +++ A Sbjct: 143 TIHLPVFDEQEKIG----TLFKQLDDIITLHQRKLEQLKELKKAYLQA 186 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 58/183 (31%), Gaps = 15/183 (8%) Query: 24 HWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-- 80 +W++ ++ G+ + +E++ +G+ +YL + + T ++ Sbjct: 209 NWELCKLENVIDKQIKGK----------VKVENLCNGSVEYLDANRLNGGKPIYTKALPD 258 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ I+ G K F G+ + Q K+ + +D I Sbjct: 259 VSERDIIILWDGSKAGKVYY-GFKGVLGSTLKAYQLKECANS-QFIYQQLLDNQNNIYNN 316 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + H P+ + EQ + + + RI I L K Q Sbjct: 317 YRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYLQ 376 Query: 201 ALV 203 + Sbjct: 377 NMF 379 >gi|257467223|ref|ZP_05631534.1| type I restriction system specificity protein [Fusobacterium gonidiaformans ATCC 25563] Length = 183 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 71/168 (42%), Gaps = 6/168 (3%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDPGEIVFRFIDLQN 300 K +L++ YG+I K + + E+ + + V G +V Sbjct: 1 MPKTMFDNHGEVLAIHYGHIYTKYNIFVKEPIVKVSMENAKNLKKVKKGNLVIAKTSENL 60 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMGSGLRQ-SLK 358 D A + E ++T + A+ HG + YL+++ + K + G++ L Sbjct: 61 DDVMKTVAYLGEDEVVTGGHSAIFRHGANPKYLSYVFNGADYFIKQKNKLAHGVKVIELS 120 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 D+++ +L+PPI Q I ++++ + L + + + I L +++ Sbjct: 121 TTDMEKFQILIPPIHIQEYIVSILDKFDMLTNDLTQGLPREIELRQKQ 168 Score = 44.0 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 17/174 (9%), Positives = 54/174 (31%), Gaps = 6/174 (3%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT-STVSIFAKGQILYGKL----GPYLR 96 ++ +++ I + + ++ + + + KG ++ K ++ Sbjct: 6 FDNHGEVLAIHYGHIYTKYNIFVKEPIVKVSMENAKNLKKVKKGNLVIAKTSENLDDVMK 65 Query: 97 KAIIADFDGICSTQFLVLQPKDVLPEL-LQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 D + + + P+ + + ++ + G + + Sbjct: 66 TVAYLGEDEVVTGGHSAIFRHGANPKYLSYVFNGADYFIKQKNKLAHGVKVIELSTTDME 125 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + IPP+ Q I + + + L R IEL +++ + + Sbjct: 126 KFQILIPPIHIQEYIVSILDKFDMLTNDLTQGLPREIELRQKQYEYYREKLFDF 179 >gi|255690850|ref|ZP_05414525.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] gi|260623482|gb|EEX46353.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] Length = 331 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 74/209 (35%), Gaps = 13/209 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKN--TKLIESNILSLSYGNIIQKLETRNMGL 275 K E +P WE ++ L+ ++ + S+ + Y + N+ + Sbjct: 77 KCIDEESPFEIPKGWEWSKLSNVIELLSGQDFIPEKYNSSNQGIPYITGASNIVNGNLAI 136 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G+++ K + + I + + + + ++ L++ Sbjct: 137 NRWTETPTVIGKLGDLLIVCKGSGVGKMCICNVDK----IHIARQIQIIRNFSNAISLSY 192 Query: 336 LMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + + G+ + E + L + +PP EQ++I + ID Sbjct: 193 VKSVVEANLQTIISNAQGVIPGISREHILNLLIPLPPTNEQYEIDKKLQEILPFIDRY-A 251 Query: 395 KIEQSIVLLK-----ERRSSFIAAAVTGQ 418 K ++++ L + S + AV G+ Sbjct: 252 KSQEALDKLNVELLGNLKKSILQEAVQGR 280 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 65/196 (33%), Gaps = 4/196 (2%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IPK W+ + +L +G+ K +G + + + + Sbjct: 86 EIPKGWEWSKLSNVIELLSGQDFIPEKYNSSNQGIPYITGASNIVNGNLAINRWTETPTV 145 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I G +L G + K I + D I + + + L ++ + Sbjct: 146 IGKLGDLLIVCKGSGVGKMCICNVDKIHIARQIQIIRNFSNAISLSYVKSVVEANLQTII 205 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE-- 197 + + I N+ +P+PP EQ I +K+ ID + +L E Sbjct: 206 SNAQGVIPGISREHILNLLIPLPPTNEQYEIDKKLQEILPFIDRYAKSQEALDKLNVELL 265 Query: 198 --KKQALVSYIVTKGL 211 K++++ V L Sbjct: 266 GNLKKSILQEAVQGRL 281 >gi|198277089|ref|ZP_03209620.1| hypothetical protein BACPLE_03297 [Bacteroides plebeius DSM 17135] gi|198269587|gb|EDY93857.1| hypothetical protein BACPLE_03297 [Bacteroides plebeius DSM 17135] Length = 157 Score = 68.7 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 49/155 (31%), Gaps = 8/155 (5%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV---DPGEIVFR 294 + + R N IL L G + + + + G+++ Sbjct: 7 WGAGSTPQRGNVNYYNGKILWLKTGELNNGIVYDTEEKITQKAFQDCSLRMNKIGDVLIA 66 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 K ++ V + A P+ I + Y+ + + + G + Sbjct: 67 MYGATIGKLAI----VGKELTTNQACCGCTPYLIYNWYIFYFLMASR-DSFIKKGEGGAQ 121 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ + + +PP+KEQ+ I I ++ Sbjct: 122 PNISRVKLVEHLIPLPPLKEQYRIVAQIEKLFEQL 156 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 45/146 (30%), Gaps = 2/146 (1%) Query: 37 NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR 96 G + I+++ ++ +G + + ++ + G +L G + Sbjct: 14 QRGNVNYYNGKILWLKTGELNNGIVYDTEEKITQKAFQDCSLRMNKIGDVLIAMYGATIG 73 Query: 97 KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGN 156 K I + + P + + + EG + + Sbjct: 74 KLAIVGKELTTNQACCGCTPYLIYN--WYIFYFLMASRDSFIKKGEGGAQPNISRVKLVE 131 Query: 157 IPMPIPPLAEQVLIREKIIAETVRID 182 +P+PPL EQ I +I ++ Sbjct: 132 HLIPLPPLKEQYRIVAQIEKLFEQLR 157 >gi|328676720|gb|AEB27590.1| Type I restriction-modification system, specificity subunit S [Francisella cf. novicida Fx1] Length = 384 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 49/396 (12%), Positives = 125/396 (31%), Gaps = 33/396 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG--KYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + R +++ ++ +E++ +++P N +D S + K Q Sbjct: 6 KKLGSYIQQVKKRNADN-----FLTVENLRGININKEFMPSVANVTGTDLSKYKVVEKNQ 60 Query: 86 ILYGKL-----GPYLRKAIIADFDGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRI 137 Y + G + + I S +++ + + +LPE L W + + Sbjct: 61 FAYNPMHVGRDGVLPISMLELEQKVIVSPAYVIFEIVDKQILLPEYLMMWFRRSEFDRNA 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + +W + +PIP + +Q I E I I + + L+E Sbjct: 121 WFTTDSSVRGGFNWDDFCELELPIPSIEKQREIVA----EYYAITNRIKLNEQLNQKLEE 176 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 QA+ PD K + + + + + + +L Sbjct: 177 TAQAIYKEWFVDFEFPDEDGKP----YKSNGGEMVWCEELEKEIPKGWGVVSLDE---VL 229 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ YG + LE N+ L + + ++ + ++ + + Sbjct: 230 TIRYGKDYKNLENGNIPLYGSGGIMGYV---NDYLYSGKAILIPRKGSLNNIIYLNQSFW 286 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + S+Y +L + S+ + + L +L P Sbjct: 287 TVDTMFYSIAKSSSYNQYLFHILKSMDFYSLNVGSAVPSMTTKLLNSLRILKPK----DT 342 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + + I +L+ + + ++ Sbjct: 343 VLDKFEKNITTFFDYKNEKVKEINILELLKETLLSK 378 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 19/116 (16%), Positives = 35/116 (30%), Gaps = 14/116 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IPK W VV + + G+ + ++E+G G+ Sbjct: 215 EIPKGWGVVSLDEVLTIRYGKDYK-----------NLENGNIPL---YGSGGIMGYVNDY 260 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 +++ IL + G + T F + + L L S+D Sbjct: 261 LYSGKAILIPRKGSLNNIIYLNQSFWTVDTMFYSIAKSSSYNQYLFHILKSMDFYS 316 >gi|325973650|ref|YP_004250714.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323652252|gb|ADX98334.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 254 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 15/150 (10%), Positives = 47/150 (31%), Gaps = 8/150 (5%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N ++ + + + ++ + + ++ + Sbjct: 47 NSNLRILSCDRHYNSKGLSQSKLFPKNTVCIVEGGNSSTDTAILKYSSCLSADLHG--FN 104 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 D ++ + + + + + + L + + PP +EQ I + Sbjct: 105 SFEGISDPRFIKYCFDYPKMKEKLMKLAKSTTAQPHLTLSRLLSVKFPCPPQEEQERIGD 164 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++ D L+E E+ I +L+ R++ Sbjct: 165 TLSA----YDELIENNEKQIGVLQAIRTAI 190 >gi|55823564|ref|YP_142005.1| type I restriction-modification system specificty subunit, truncated [Streptococcus thermophilus CNRZ1066] gi|55739549|gb|AAV63190.1| type I restriction-modification system specificty subunit, truncated [Streptococcus thermophilus CNRZ1066] Length = 107 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 21/104 (20%), Positives = 47/104 (45%), Gaps = 7/104 (6%) Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + E + + MA++P GID Y + L K+ + + + ++ +L+ Sbjct: 2 LGEDSYMDTNMMALEPKGIDPEYSYTFINKTGLYKIAD---TSTIPQINNKHIEPYLLLI 58 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P ++EQ I + ++D + ++ + LLKE++ F+ Sbjct: 59 PSLEEQHKIGSF----FKQLDETIALHQRKLDLLKEQKKGFLQK 98 >gi|302062748|ref|ZP_07254289.1| restriction modification system DNA specificity subunit [Pseudomonas syringae pv. tomato K40] Length = 148 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 48/149 (32%), Gaps = 10/149 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + R + + T + +I+ ++ + + MA++ Sbjct: 1 INQRLPNVTKWTKRTANVSKAEDILIT---VKGSGVGEIWYSTLPEIAMGRQLMAIRSKS 57 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 S ++ +++ F +GSG + L + L P + EQ I + + Sbjct: 58 GASRFMFQFLQTK--KNHFKDLGSGNMIPGLSRAVILELEASFPNLPEQQRIADCL---- 111 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D L+ Q L+ + + Sbjct: 112 TSLDDLIAAQTQKHEALETYKMGLMQQLF 140 >gi|148544648|ref|YP_001272018.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri DSM 20016] gi|148531682|gb|ABQ83681.1| restriction modification system DNA specificity domain [Lactobacillus reuteri DSM 20016] Length = 340 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 59/387 (15%), Positives = 124/387 (32%), Gaps = 49/387 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +K G ++ KD+ + +G+Y P G + + + + Sbjct: 2 KLKDVC--IKGTSNIRQKDV---------NDSGRY-PVYGAAGPVGFMNSFQYDEPYVGV 49 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 K G + +A + L PK + + +S +E GAT+ H Sbjct: 50 VKDGAGIGRATYLPSNSSIIGTMQALIPKKNVLPKYLYYAVSS---MHLEKYYSGATIPH 106 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 +K + + EQ II ++ +I+ + + + L E +A V Sbjct: 107 IYFKNYKHERFVLVSKKEQEQ----IIWRFSLLEKMISNKQQQLLKLDELIKA---RFVE 159 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +P + K+ + +G + T + + N GN I+ Sbjct: 160 MFGDPIINNKNIKKKKLGDI-----CLLKAGDFTPSKKISPVKTSINKYPCFGGNGIRGY 214 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 S Q G + F +N + ++ + +E I Sbjct: 215 VDNYTHQGNYSLIGRQGALCGNVKFATGKFRNTEHAILVSPNIE---------------I 259 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +S +L L+ L K+ + L + + + V V + Q + N + + Sbjct: 260 NSRWLFELLN---LEKLNRFRSGAAQPGLAVKTLNEIIVPVADLNSQNEYANFV----QQ 312 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 +D I++S+ ++ S + Sbjct: 313 VDKSKVVIQKSLDETQKLYDSLMQEYF 339 >gi|87300612|ref|ZP_01083454.1| type I restriction system specificity protein [Synechococcus sp. WH 5701] gi|87284483|gb|EAQ76435.1| type I restriction system specificity protein [Synechococcus sp. WH 5701] Length = 351 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 45/343 (13%), Positives = 100/343 (29%), Gaps = 35/343 (10%) Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 S F+ + ++L + G T + + +PPL Sbjct: 2 ATSQDFVNWVCGPNIDPHFLKYVLLAENEALWR-FASGTTHQTIYYPEAKAFHVCLPPLP 60 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI--VTKGLN----------- 212 EQ I + A +I+ ++ + Q+ V L+ Sbjct: 61 EQKAIAAVLGALDDKIELNRRMNATLEKMARALFQSWFVDFDPVRAKLDGQQPVGLDMST 120 Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALV----------TELNRKNTKLIESNILSLSYG 262 + + +G P WEV +++ ++ + + S+ Sbjct: 121 AALFPEHLEDSPLGKKPKGWEVTTLESVLAVLETGGRPKGGVSGITSGVPSIGAESIVSV 180 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID-----LQNDKRSLRSAQVMERGIIT 317 + +T+ + ++ ++ +++ + E I Sbjct: 181 GVFDFGKTKFVPVEFYEGMKRGHIESHDVLLYKDGGRPGEFEPHVSMFGDGFPFEECSIN 240 Query: 318 SAYMAVKPHGIDS-TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 ++ +G+ S YL + M S G+G L V+ L VLVPP Sbjct: 241 EHVYRLRSNGLLSQEYLYFWMSSEFALAEMRIKGTGVAIPGLNSTAVRSLGVLVPPKPVM 300 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + A + + + L R + + ++G+ Sbjct: 301 EAF----TKQVAPLVTQILSNAKQSRTLAILRDTLLPKLLSGE 339 Score = 39.8 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 60/205 (29%), Gaps = 18/205 (8%) Query: 18 IGAIPKHWKVVPIKRF-TKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSR 71 +G PK W+V ++ L TG + G + IG E + S K Sbjct: 133 LGKKPKGWEVTTLESVLAVLETGGRPKGGVSGITSGVPSIGAESIVSVGVFDFGKTKFVP 192 Query: 72 QSDTSTVSI--FAKGQILYGKLGPYLRKA---------IIADFDGICSTQFLVLQPKDVL 120 + +L K G + + + L+ +L Sbjct: 193 VEFYEGMKRGHIESHDVLLYKDGGRPGEFEPHVSMFGDGFPFEECSINEHVYRLRSNGLL 252 Query: 121 PELLQGWLLSIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + +S + + G + + + ++ + +PP +++ Sbjct: 253 SQEYLYFWMSSEFALAEMRIKGTGVAIPGLNSTAVRSLGVLVPPKPVMEAFTKQVAPLVT 312 Query: 180 RIDTLITERIRFIELLKEKKQALVS 204 +I + + L L+S Sbjct: 313 QILSNAKQSRTLAILRDTLLPKLLS 337 >gi|67920382|ref|ZP_00513902.1| Restriction modification system DNA specificity domain [Crocosphaera watsonii WH 8501] gi|67857866|gb|EAM53105.1| Restriction modification system DNA specificity domain [Crocosphaera watsonii WH 8501] Length = 193 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 72/192 (37%), Gaps = 11/192 (5%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M EW+ L P + ++ LS+S N + + + Sbjct: 1 MTLYNSEWI-LKPLSELCEIVIGRTPSRSKPEYWGKGYEWLSISDMNEKKYISVTKETIT 59 Query: 277 PE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E S +++ +VF F L + I+ P + + YL Sbjct: 60 DEGASLCKDKLLSINTVVFSFKLSIGKVSILDAPMYTNEAIV--GLPIKDPSLLYTDYLY 117 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +++++ D+ G +L + ++++ + +PP++EQ I +++ + D + Sbjct: 118 YVLKTLDVSSKTDRAVMGA--TLNKKKLEQIKIPLPPLEEQKRIAKILD----KADEIRH 171 Query: 395 KIEQSIVLLKER 406 K ++SI L E Sbjct: 172 KRKESIRLTDEL 183 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 49/164 (29%), Gaps = 8/164 (4%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDT 75 W + P+ ++ GRT GK ++ + D+ + + Sbjct: 6 SEWILKPLSELCEIVIGRTPSRSKPEYWGKGYEWLSISDMNEKKYISVTKETITDEGASL 65 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + +++ + K I D + + L KD + Sbjct: 66 CKDKLLSINTVVFS-FKLSIGKVSILDAPMYTNEAIVGLPIKDPSLLYTDYLYYVLKTLD 124 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + K + I +P+PPL EQ I + + Sbjct: 125 VSSKTDRAVMGATLNKKKLEQIKIPLPPLEEQKRIAKILDKADE 168 >gi|88856339|ref|ZP_01130998.1| hypothetical protein A20C1_00325 [marine actinobacterium PHSC20C1] gi|88814423|gb|EAR24286.1| hypothetical protein A20C1_00325 [marine actinobacterium PHSC20C1] Length = 395 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 59/406 (14%), Positives = 118/406 (29%), Gaps = 38/406 (9%) Query: 23 KHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + W+V + ++N + + Y G+ G YL + + ++ Sbjct: 3 EGWRV--LGDVLAQVNRSVVVADVESVPYAGVRW--YAGGVYLREVADPEGVKAKQLARI 58 Query: 82 AKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQG--WLLSIDVTQR 136 +G ++Y ++ IA + + F + L + L + D Sbjct: 59 REGDVIYNRMWATRASFGIARADVDGCLVTNDFPTFETNTDLALVDFIGLILQTKDFQAE 118 Query: 137 IEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 G T K +I +P L EQ I + + A I E Sbjct: 119 AALRASGTTERRRLKEKDFLSIETWLPSLPEQCRIVDLMGALDEAI-------AVADESH 171 Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 + A ++ + + + S + +R N Sbjct: 172 EAASFAYIAALQDFDGPRHPRREISEV------------LKKAKAGGTPSRLNLDNFGGA 219 Query: 256 ILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 I L G + E S + I G V + K + + Sbjct: 220 IPWLKSGEVNNDNIHTADESLSEFGLSGSSAWIAPAGSTVVAMYGQGDTKGTAGFLRAPM 279 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + I+ L +RS A+G + +L V + VPP Sbjct: 280 SMNQAVIALVPETTLIEPRLLMHAIRSRTGSLRARAIG-AAQPNLSKSIVLSEAIAVPPR 338 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +Q I + ++ +L L+ R++ + A ++G+ Sbjct: 339 DDQASIADYLDAFL----LLCSDAGSYASALRCLRTNLLTALLSGE 380 >gi|294788778|ref|ZP_06754019.1| type I restriction/modification specificity protein [Simonsiella muelleri ATCC 29453] gi|294483260|gb|EFG30946.1| type I restriction/modification specificity protein [Simonsiella muelleri ATCC 29453] Length = 466 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 53/159 (33%), Gaps = 3/159 (1%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 I+ L+ GN I + E + G+I+ + + Sbjct: 30 GIPFFRSKEIIELNSGNEITTELFISKERFLEIKNKFGTPSYGDILLTSVGTLGVPYFVN 89 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRL 365 + + + + S YL + S K + + +L +K L Sbjct: 90 YKEEFYFKDGNLTWFRKFNNILRSKYLYYWFSSPVGRKALKEITIGSTQPALTITGLKSL 149 Query: 366 PVLVPPIKEQFDITNVINVETARI--DVLVEKIEQSIVL 402 + +P ++EQ I +++ +++I + + + + I Sbjct: 150 TIHLPTLEEQDYIIEILDHLSSKIHLNTQINQTLEQIAQ 188 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 54/419 (12%), Positives = 120/419 (28%), Gaps = 74/419 (17%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV---ESGTGKYLPKDGNSRQSDT 75 +WK + ++ + + + I + +++ SG + + Sbjct: 2 SNWKEYKLGELVEITSSKRIMRSEYQEDGIPFFRSKEIIELNSGNEITTELFISKERFLE 61 Query: 76 STVSIFAK--GQILYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVL---PELLQGWLL 129 G IL +G + + L K + L W Sbjct: 62 IKNKFGTPSYGDILLTSVGTLGVPYFVNYKEEFYFKDGNLTWFRKFNNILRSKYLYYWFS 121 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 S + ++ I G+T G+ ++ + +P L EQ I E + + +I Sbjct: 122 SPVGRKALKEITIGSTQPALTITGLKSLTIHLPTLEEQDYIIEILDHLSSKIHLNTQINQ 181 Query: 190 RFIELLKEKKQALVSYI-------------------------VTKGLNPDVKMKDSG--- 221 ++ + ++ V G P+ S Sbjct: 182 TLEQIAQAMFKSWFVDFDPVHAKVQALSNGLSLEQAELAAMQVISGKTPEELTALSQTQP 241 Query: 222 -----------------IEWVG-LVPDHWEVKPFFALVTELNR---KNTKLIESNILSLS 260 +E G VP W+ + N K+++ +S I + Sbjct: 242 DRYAELAETAKAFPCEMVEVDGIEVPKGWKQTALSEICEMQNGYAFKSSEWTDSGIPVIK 301 Query: 261 YGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 G+I K+ + S + +++ G+IV K A +R ++ Sbjct: 302 IGSIQSKILTVEGNGFVSEDNLSLRSNFVLNDGDIVIGLTGAYVGKVGRMPAN--KRAML 359 Query: 317 TSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAM-----GSGLRQSLKFEDVKRLPVLV 369 I+ S + + + F + ++ +D+ + P+L+ Sbjct: 360 NQRVAKFLAKQINESETFYSFIYMNVIQEEFKNFVDFTAQGSAQPNISTKDILKYPLLL 418 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 54/173 (31%), Gaps = 12/173 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSR-QSD 74 +PK WK + ++ G +S + I I + ++S S Sbjct: 265 EVPKGWKQTALSEICEMQNGYAFKSSEWTDSGIPVIKIGSIQSKILTVEGNGFVSEDNLS 324 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVL-----PELLQGW 127 + + G I+ G G Y+ K + + + + K + + Sbjct: 325 LRSNFVLNDGDIVIGLTGAYVGKVGRMPANKRAMLNQRVAKFLAKQINESETFYSFIYMN 384 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 ++ + ++ +G+ + K I P+ + + + + + Sbjct: 385 VIQEEFKNFVDFTAQGSAQPNISTKDILKYPLLLANNDVHLAFEKLLNKILDK 437 >gi|282878879|ref|ZP_06287644.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310] gi|281299001|gb|EFA91405.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310] Length = 257 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 71/220 (32%), Gaps = 22/220 (10%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K E +P WE L + ++ I+ E +N K Sbjct: 5 KCIDDEVPFEIPQGWEWCRLNDLAMYRKGPFGSSLTKSMFVTKSTQSIKVYEQKNAIQKN 64 Query: 278 ESYETYQI------------VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + Y I V P +I+ + L S + GII A M V Sbjct: 65 HTLGDYYISPKKFETMQSFVVKPNDIIVSCAGTIGEIYLLPSDASI--GIINQALMRVSL 122 Query: 326 HGIDS-TYLAWLMRSYDLCKVFYAMGSGLRQSLK-FEDVKRLPVLVPPIKEQFDITNVIN 383 ++ Y L + +++ FE +K + V +PP+ EQ + N Sbjct: 123 FDLNMAEYWQIYFAYMLLNEAQMKGAGSAIKNIPPFEYLKAVLVPIPPLSEQNRLVERYN 182 Query: 384 VETARIDVLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 + + ID E + L + + S + A+ G+ Sbjct: 183 IILSLIDKY-ELEANKLNRLNQNIYDKLKKSVLQEAIQGK 221 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 72/228 (31%), Gaps = 17/228 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGR----------TSESGKDIIYIGLEDVESGTGKYLPKDGN 69 IP+ W+ + G ++S + I ++ + Sbjct: 14 EIPQGWEWCRLNDLAMYRKGPFGSSLTKSMFVTKSTQSIKVYEQKNAIQKNHTLGDYYIS 73 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPKDVLPELLQGW 127 ++ +T + I+ G ++ GI + + + D+ Sbjct: 74 PKKFETMQSFVVKPNDIIVSCAGTIGEIYLLPSDASIGIINQALMRVSLFDLNMAEYWQI 133 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGI-GNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + + G+ + + + +PIPPL+EQ + E+ ID Sbjct: 134 YFAYMLLNEAQMKGAGSAIKNIPPFEYLKAVLVPIPPLSEQNRLVERYNIILSLIDKYEL 193 Query: 187 ERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 E + L + ++++ + L P + + + E + + + Sbjct: 194 EANKLNRLNQNIYDKLKKSVLQEAIQGKLVPQIDSEGTAQELLEQIKE 241 >gi|307256318|ref|ZP_07538101.1| Type i restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306865144|gb|EFM97044.1| Type i restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 353 Score = 68.3 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 45/383 (11%), Positives = 103/383 (26%), Gaps = 38/383 (9%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KD V+W + + T T + + + E+ Sbjct: 8 KDCKVEW----------KSLGEIL-IRTKGTKITAGQMKELHKENAPVKIFAGGRTVAFV 56 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 +D I + I+ G + D + K+ + + Sbjct: 57 DFNDIPQKDINNEPSIIVKSRGII--EFEYYDKSFSHKNEMWSYHSKNENINIKFVYYFL 114 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + I M +PIPPL Q I + + T Sbjct: 115 KQNEPHFQNIGSKMQMPQIATPDTDKYKIPIPPLEIQEKIVKTLDIFTKL--------EA 166 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + L ++ + ++T G + +EW L + + Sbjct: 167 ELSLRVKQYDYYRNELLTFGDD---------VEWKTLGDVAMIIDSLHQTPKYTEYGKSM 217 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + G + + + +IV + + + Sbjct: 218 ---VRVTDIKGGVLNLLNTLKVDDETFAIFTKKYTPQKEDIVMSRVGSYGNVSLVPET-- 272 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLV 369 + V I++ YL ++ S + G G +++L + +K++PV + Sbjct: 273 --GSVCMGQNTVVINPFINNKYLYHILTSNFVKDFIEKNIGGGNQKTLSLKAIKQIPVPI 330 Query: 370 PPIKEQFDITNVINVETARIDVL 392 Q I ++++ + + Sbjct: 331 VNDCLQQKIVDILDKFDRLTNSI 353 >gi|295110202|emb|CBL24155.1| Restriction endonuclease S subunits [Ruminococcus obeum A2-162] Length = 354 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 55/365 (15%), Positives = 113/365 (30%), Gaps = 29/365 (7%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 ++ G +S + DV TG+Y + I Sbjct: 4 KLEDVC--VRGSSS--------LKQSDVIDKTGEYPIYGAAGYIGNVDFYHQDQP-YIAV 52 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 K G + + + L PK+ + +++ +E GAT+ H Sbjct: 53 VKDGAGIGRTSLYPAKSSVIGTMQYLLPKENVLPEYLCYVVK---YMHLEKYFTGATIPH 109 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV- 207 +K + L Q I + RI+ +I+ R + ++ L E +A + Sbjct: 110 IYFKDYKKEEFNLDILDRQKEIVNIL----GRIECVISSRQQELQKLDELIKARFVEMFG 165 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +NP K + V + P + KP VT+ I+ Y ++ Sbjct: 166 DPYVNPLKWKKLKIKDAVTIEPQNGLYKPQSDYVTDGTGIPILRIDGF-----YDGMVTD 220 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII---TSAYMAVK 324 + E+ ++ +IV ++ + + + M Sbjct: 221 FASLKRLKCSETERQRYLLLEDDIVINRVNSIEYLGKCAHIKELLEDTVYESNMMRMHFD 280 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P + Y+ L+ S + S + S+ +DV + PP+ Q + + + Sbjct: 281 PEYYNPVYICKLLCSQFIYDQIVNHAKKSVNQASINQKDVLDFNIYQPPLDLQNEFADFV 340 Query: 383 NVETA 387 + Sbjct: 341 HQVNK 345 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 41/111 (36%), Gaps = 7/111 (6%) Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 S + +I + + + YL ++++ L K F + Sbjct: 55 DGAGIGRTSLYPAKSSVIGTMQYLLPKENVLPEYLCYVVKYMHLEKYF---TGATIPHIY 111 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 F+D K+ + + Q +I N++ RI+ ++ +Q + L E + Sbjct: 112 FKDYKKEEFNLDILDRQKEIVNILG----RIECVISSRQQELQKLDELIKA 158 >gi|90961895|ref|YP_535811.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius UCC118] gi|90821089|gb|ABD99728.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius UCC118] Length = 384 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 51/383 (13%), Positives = 110/383 (28%), Gaps = 41/383 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ + ++ + + +I + + ++ + Sbjct: 18 NDWERKKLGEIGSVSMNKRIFKDETSTIGEIPFYKIGTFGGKADAFITRKKYEEYKKKYP 77 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG +L G R + +V + L Sbjct: 78 YP--QKGNLLISASGSIGRIIEYNGEEAYYQDSNIVW---LDHDNTILDVFLKPTYEIIK 132 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 EG T+ K I N + P + EQ KI + ++ I R E L Sbjct: 133 WDGIEGTTIKRLYNKNILNTVIYKPTIDEQ----RKIGKLFIILNNTIQLHERKYEELTL 188 Query: 198 KKQALVSYIVT--KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 K+AL+ + G P+V+ K+ W E + ++ ++ Sbjct: 189 IKKALLQKLFPKKDGFKPEVRYKNFNDAW--------EQRKLGEVIISEHKGK------V 234 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + GN G + + V +++ + + G Sbjct: 235 KSIMKGGNTNYLETNYLNGGTAQKVDAIADVSKDDVLILWDGS-----KAGTIYHGFEGA 289 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + S A P S + + + K++ + + + ++ V +P I EQ Sbjct: 290 LGSTLKAYVPKY--SGDFLYQILKKNQDKIYQSYRTPNIPHVIKNFTEKFNVSIPTIIEQ 347 Query: 376 FDITNVINVETARIDVLVEKIEQ 398 +I + ++D L+ + Sbjct: 348 QEIGDF----FKQLDSLITLHRR 366 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 44/170 (25%), Gaps = 11/170 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLE-TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K+ I G K + E + Y G ++ Sbjct: 39 KDETSTIGEIPFYKIGTFGGKADAFITRKKYEEYKKKYPYPQKGNLLISASGSIGRII-- 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 E + + D+T L ++ + + + L +++ Sbjct: 97 --EYNGEEAYYQDSNIVWL--DHDNTILDVFLKPTYEIIKWDGIEGTTIKRLYNKNILNT 152 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P I EQ I ++ ++ E+ L + + + Sbjct: 153 VIYKPTIDEQRKIG----KLFIILNNTIQLHERKYEELTLIKKALLQKLF 198 >gi|291515465|emb|CBK64675.1| Restriction endonuclease S subunits [Alistipes shahii WAL 8301] Length = 353 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 52/389 (13%), Positives = 102/389 (26%), Gaps = 63/389 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W I + GR + + G G + ++ Sbjct: 23 ERWDTYRIADILCIGNGRDYK-----------HLSKGDIPVFGTGGYMTSVNEC---LYE 68 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G+ G + T F K V+P+ + +I+ E Sbjct: 69 GETTFIGRKGTINKPFYYNGKFWTVDTLFYTHSFKRVIPKFVYCLFQTIN----WLRYNE 124 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + I I + IP L EQ I + + RI T + L+K Q + Sbjct: 125 ASGVPSLSKDTIEKIKVRIPQLDEQKKIAKLLSLLDERIATQNKIIEKLQSLIKGIAQNI 184 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 V + + + +++ L Sbjct: 185 VHR----------------------NKPNVRISQCLECSSSTLQESDVLECGAYPVYGAN 222 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 ++ L+ N S E I+ G S + + + Sbjct: 223 GVVGFLDNYNT-----SNEAIYIIKDGS-----------GVGAVSYVAGKCSATGTLNIL 266 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 G YL +L+ ++ M + F+D + + P EQ + Sbjct: 267 QAKKGFSLRYLYYLLNIFNFEPYKTGMA---IPHIYFKDYGKAQIFCPSYSEQLKYAKFL 323 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFI 411 A ID + + ++ L + + Sbjct: 324 ----ATIDDKLLTEQNVLINLSLLKQYLL 348 >gi|26554274|ref|NP_758208.1| type I restriction-modification system S subunit [Mycoplasma penetrans HF-2] gi|26454283|dbj|BAC44612.1| type I restriction-modification system S subunit [Mycoplasma penetrans HF-2] Length = 415 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 82/254 (32%), Gaps = 13/254 (5%) Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + + + + K L +Q L E + +I + I Sbjct: 149 DEYDELNINLINLDKIFKLNLKKSIIQYAIEGKLVKQDLNSETVSELVKKISEEKQKLIS 208 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++ K+K ++ + K IE +P++W + N + Sbjct: 209 EGKIKKDKNESFIFEDNNCYYEKINNGKPQNIEVPFEIPENWSWVRLKTISEIYNGNSIS 268 Query: 251 LIE----------SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 E + + N + N P + + ++I +I+ Sbjct: 269 KEEKEKKYTKCSGYDYIGTKDINFDFSINYDNGVYIPLNEKNFKIAPKNKILLCIEGGS- 327 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + + + + + ++ YL + ++SY +F + +G+ + + Sbjct: 328 --AGKKIGITSKDVCFGNKLVCINDFLSNNLYLFYFLQSYYFKNIFNQLTTGIIGGISIQ 385 Query: 361 DVKRLPVLVPPIKE 374 ++K + + +PP +E Sbjct: 386 NLKNIMIPLPPKRE 399 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 54/172 (31%), Gaps = 10/172 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE---------SGKDIIYIGLEDVESGTGKYLPKDGNS 70 IP++W V +K +++ G + YIG +D+ +G Sbjct: 245 EIPENWSWVRLKTISEIYNGNSISKEEKEKKYTKCSGYDYIGTKDINFDFSINYD-NGVY 303 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + I K +IL G K I +C LV + L + L Sbjct: 304 IPLNEKNFKIAPKNKILLCIEGGSAGKKIGITSKDVCFGNKLVCINDFLSNNLYLFYFLQ 363 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + I + + + NI +P+PP E I + + Sbjct: 364 SYYFKNIFNQLTTGIIGGISIQNLKNIMIPLPPKRECEKIIKITHKIISLLR 415 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 45/144 (31%), Gaps = 8/144 (5%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + + V K +L + + ++S +L +L + Sbjct: 42 KEKNKINLKLNDFVIPARGASIGKITLIKDETAT--CTQTTMYMKPFSIVNSKFLFFLFK 99 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 S + + + + + + +PP EQ I I + +D E Sbjct: 100 SIE--SYLFQSSGSAQPQITVNETIEKLIPIPPSNEQNSIYQKIIILNKSVDEYDELNIN 157 Query: 399 SIVLLK----ERRSSFIAAAVTGQ 418 I L K + S I A+ G+ Sbjct: 158 LINLDKIFKLNLKKSIIQYAIEGK 181 >gi|182414825|ref|YP_001819891.1| restriction endonuclease S subunits-like protein [Opitutus terrae PB90-1] gi|177842039|gb|ACB76291.1| Restriction endonuclease S subunits-like protein [Opitutus terrae PB90-1] Length = 388 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 51/351 (14%), Positives = 100/351 (28%), Gaps = 34/351 (9%) Query: 76 STVSIFAKGQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ST G +L K+ P++R+A + I S++++V + + P ++ L+ Sbjct: 61 STKQAVETGDVLLSKIVPHIRRAWVVGASRGRRMIASSEWIVFRNARIFPGYIRHLLVED 120 Query: 132 DVTQRIEAICEGATMSHAD--WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + G S + I +P+PPLAEQ I E + Sbjct: 121 RFHAKFMSTVSGVGGSLLRARPAHVARIRVPLPPLAEQRRIAEVLDRAEALRAKRRATLA 180 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + L + + +P K +G V Sbjct: 181 QLDSLTQCL-------FLDLFGDPATNPKGWPKTVLGE---------IIEFVGGSQPPRE 224 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I+ ++ + + +++ + Sbjct: 225 TFTYEPSPDTIRLVQIRDFKSDEFKTYIPRRLARRFFNEDDVMIGRYGPP-----VFQIL 279 Query: 310 VMERGIITSAYMAVKPHGIDSTYL-AWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLP 366 G A M P S L++ L A + + + E +++ P Sbjct: 280 RGLCGSYNVALMKALPKDEVSKDFVFHLLQEQRLHSYVVARSERTAGQTGVNLELLEKYP 339 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 PP Q + + A ++ L S+ L +S A G Sbjct: 340 AFRPPASLQREFARRV----AAVEKLKTTQRASLAELDALFASLQHRAFRG 386 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 12/104 (11%), Positives = 30/104 (28%), Gaps = 2/104 (1%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 PK W + + G + + + + + Sbjct: 201 PKGWPKTVLGEIIEFVGGSQPPRETFTYEPSPDTIRLVQIRDFKSDEFKTYIPRRLARRF 260 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 F + ++ G+ GP + + + G + + PKD + + Sbjct: 261 FNEDDVMIGRYGPPVFQI-LRGLCGSYNVALMKALPKDEVSKDF 303 >gi|148544646|ref|YP_001272016.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri DSM 20016] gi|184153999|ref|YP_001842340.1| restriction endonuclease S subunit [Lactobacillus reuteri JCM 1112] gi|148531680|gb|ABQ83679.1| restriction modification system DNA specificity domain [Lactobacillus reuteri DSM 20016] gi|183225343|dbj|BAG25860.1| restriction endonuclease S subunit [Lactobacillus reuteri JCM 1112] Length = 372 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 44/388 (11%), Positives = 107/388 (27%), Gaps = 37/388 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + T ++ KD +++ GK RQ Sbjct: 2 EYKKFTALFTDVTKTGTKIPKDEYLTTGKNIIIDQGKDSIAGYTDRQKGIFEEVPV---- 57 Query: 86 ILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I++ G + R D VL+ K+ + Sbjct: 58 IVF---GDHTRIVKYIDKPFFLGADGVKVLKSKEKESNYKYLYYALKAAHIPNTGYNRHF 114 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I M P L EQ I + + + T I + L + + + Sbjct: 115 K-------WLKQINMNYPDLNEQKNIVDILDSLTRII-------KVRQKELAFFDKLIKA 160 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V +P K + + D + I + N+ Sbjct: 161 RFVEMFGDPISNKKSWKKRLLNDLVDKIGS------GATPKGGKESYQDHGISFIRSMNV 214 Query: 265 IQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-A 319 + + IV ++ + + ++ + + Sbjct: 215 HDGYFNYKDLAYINSTQAKQLSNVIVQSQDVFINITGASVARSCIVPDDILPARVNQHVS 274 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + K ++ ++ L + ++ + G RQ++ + ++ L +++PPI Q Sbjct: 275 IIRCKSDVLNPIFINNLFLNDSFKRILLSIGLSGGATRQAITKKQLEMLKIILPPISLQN 334 Query: 377 DITNVINVET-ARIDVLVEKIEQSIVLL 403 + N ++ ++ + +V + + + Sbjct: 335 EYANFVHQVDKSKFENIVYLNKTLLNKI 362 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 49/163 (30%), Gaps = 21/163 (12%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + +S Y G + + D + + +K Sbjct: 29 GKNIIIDQGKDSIAGYTDRQKGIFEEVPVIVFGDHTRIVKYIDKPFFLGADGVKVLKSKE 88 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +S Y K + +G + K+ +K++ + P + EQ +I ++++ T Sbjct: 89 KESNYKYLY----YALKAAHIPNTGYNRHFKW--LKQINMNYPDLNEQKNIVDILDSLTR 142 Query: 388 RI----------DVLVEKIEQS-----IVLLKERRSSFIAAAV 415 I D L++ I K + + V Sbjct: 143 IIKVRQKELAFFDKLIKARFVEMFGDPISNKKSWKKRLLNDLV 185 >gi|301299372|ref|ZP_07205653.1| type I restriction modification DNA specificity domain protein [Lactobacillus salivarius ACS-116-V-Col5a] gi|300853026|gb|EFK80629.1| type I restriction modification DNA specificity domain protein [Lactobacillus salivarius ACS-116-V-Col5a] Length = 186 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 53/163 (32%), Gaps = 11/163 (6%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES-----YETYQIVDPGEIVFRF 295 V + + K I L+ N+ + VD +I+ Sbjct: 12 VRDGTHDSPKYINEGYPLLTSKNVGDGYINYDDVKYVSENDYVQINKRSKVDVNDILMGM 71 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I + +R + + I A + + L S ++ M G ++ Sbjct: 72 IGTIGNLALIR--EEPDFAIKNVALIKHTSNFDYQFLFQELQTSAISKELLSGMDGGTQK 129 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + ++ L V++P EQ I + + R D L+ ++ Sbjct: 130 FVSLKKIRNLSVMLPSENEQKKIGSYLM----RFDSLIALHQR 168 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 59/185 (31%), Gaps = 6/185 (3%) Query: 25 WKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSD--TSTVS 79 W+ + + G + + ++V G Y S + S Sbjct: 1 WEQRRLGEVADVRDGTHDSPKYINEGYPLLTSKNVGDGYINYDDVKYVSENDYVQINKRS 60 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 IL G +G A+I + L+ + + L L + +++ + Sbjct: 61 KVDVNDILMGMIGTIGNLALIREEPDFAIKNVALIKHTSNFDYQFLFQELQTSAISKELL 120 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +G T K I N+ + +P EQ I ++ I + + +L K Sbjct: 121 SGMDGGTQKFVSLKKIRNLSVMLPSENEQKKIGSYLMRFDSLIALHQRKLEKLKQLKKFL 180 Query: 199 KQALV 203 Q + Sbjct: 181 LQNMF 185 >gi|154173663|ref|YP_001408732.1| type I restriction-modification system S subunit [Campylobacter curvus 525.92] gi|153792995|gb|EAT99440.2| type I restriction-modification system S subunit [Campylobacter curvus 525.92] Length = 323 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 20/237 (8%), Positives = 65/237 (27%), Gaps = 14/237 (5%) Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 + + E ++ + + + + + + + + + + Sbjct: 85 LVEQNLEDESVEILLQKIGQEKQRLVKDKKLKADKFPQSTIFIGEDNSPYEKIGKETRCI 144 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE-----------SNILSLSYGNIIQKLE 269 E +P W + + + ++ + ++ Sbjct: 145 EDEIPFEIPSSWAWVRLGEICQIYTGDSINQTQKLTKYTNLEDGRCYIATKDVDFDGSID 204 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 N P + ++I ++ + + + P I+ Sbjct: 205 YENGVKIPFNESRFKIAPKNSVLLCVEGGS---AGKKIGYLDCDVCFGNKLCCFNPLLIE 261 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ + ++S F SG+ + +K + + +PP+ EQ I I + Sbjct: 262 PKFIYYYLQSQIFIYSFMQKMSGIISGISLNSIKTIVIAIPPLPEQKRIVEKIELLL 318 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 60/174 (34%), Gaps = 13/174 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDII----------YIGLEDVESGTGKYLPKDGN 69 IP W V + ++ TG + + + YI +DV+ G ++G Sbjct: 151 EIPSSWAWVRLGEICQIYTGDSINQTQKLTKYTNLEDGRCYIATKDVDFD-GSIDYENGV 209 Query: 70 SRQSDTSTVSIFAKGQILYG-KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + S I K +L + G +K D D + P + P+ + +L Sbjct: 210 KIPFNESRFKIAPKNSVLLCVEGGSAGKKIGYLDCDVCFGNKLCCFNPLLIEPKFIYYYL 269 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 S G +S I I + IPPL EQ I EKI + Sbjct: 270 QSQIFIYSFMQKMSG-IISGISLNSIKTIVIAIPPLPEQKRIVEKIELLLPLLK 322 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 31/83 (37%), Gaps = 6/83 (7%) Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + G + + +P+ +PP+ EQ I + + I+ E E+ + Sbjct: 3 KWIEQNKVGGGTHTFKINLGSMYSIPLPLPPLSEQKRIVDKLEEILQLIEKYKEDKEK-L 61 Query: 401 VLLK-----ERRSSFIAAAVTGQ 418 L + + S + AV G+ Sbjct: 62 DELNLSFPSKLKKSILDYAVKGK 84 >gi|262039558|ref|ZP_06012857.1| type-1 restriction enzyme EcoR124II specificity protein [Leptotrichia goodfellowii F0264] gi|261746436|gb|EEY33976.1| type-1 restriction enzyme EcoR124II specificity protein [Leptotrichia goodfellowii F0264] Length = 392 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/388 (9%), Positives = 103/388 (26%), Gaps = 30/388 (7%) Query: 26 KVVPIKRFTK-------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + + + + ++ + Sbjct: 14 EWKKLGEVIDYEQPTKYIVNSTQYDDKFKTPVLTAG----------QTFILGYTNEIEGI 63 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +K + + + S+ +L+PK+ L + + I Sbjct: 64 YKASKEDPVIIFDDFTASNHWVDFEFKVKSSAMKILKPKNQFVNLRYCYH----YIKTIN 119 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +PI L Q I + + T + L +E + Sbjct: 120 FDVTEHKRIWIS--KYSQLEVPILSLEIQEKIVKILDKFTNYVTELQSELQSRTKQYNYY 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L+S + LN + D + + + + + K + + Sbjct: 178 RDKLLSE---QYLNKISEKIDKFEDKEYKLRVTTLGEIGEIKMCKRILKEQTSTKGTVPF 234 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 G +K ++ E Y+ V + + + Sbjct: 235 YKIGTFGKKADSFISREIFEEYKKKYSYPKKGEVLISASGTIGRTVIFDGEDCYFQDSNI 294 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 +++ + + YL + + + G + + ++ + + +PPI+ Q + Sbjct: 295 VWLSHNESKVLNKYLYYYYQIVNW----NPSSGGTIKRMYNYNLVNMKIFLPPIEIQDKV 350 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 V++ + + Q I +++ Sbjct: 351 VKVLDKFQELLKDTKGLLPQEIEQRQKQ 378 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 16/184 (8%), Positives = 54/184 (29%), Gaps = 10/184 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + E K ++ + + ++ +T +G E Y+ Sbjct: 11 EKVEWKKLGEVIDYEQPTKYIVNSTQYDDKFKTPVLTAGQTFILGYTNEIEGIYKASKED 70 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ ++ +V + + K ++ Y +++ + Sbjct: 71 PVIIFDDFTASNHWVDFEFKVKSSAM---KILKPKNQFVNLRYCYHYIKTINFD------ 121 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + +L V + ++ Q I +++ T + L +++ R Sbjct: 122 -VTEHKRIWISKYSQLEVPILSLEIQEKIVKILDKFTNYVTELQSELQSRTKQYNYYRDK 180 Query: 410 FIAA 413 ++ Sbjct: 181 LLSE 184 >gi|294782724|ref|ZP_06748050.1| type I restriction modification DNA specificity family protein [Fusobacterium sp. 1_1_41FAA] gi|294481365|gb|EFG29140.1| type I restriction modification DNA specificity family protein [Fusobacterium sp. 1_1_41FAA] Length = 387 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 51/401 (12%), Positives = 124/401 (30%), Gaps = 35/401 (8%) Query: 30 IKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 +K K+ G+ ++ K I G + + G++L D + IL Sbjct: 5 LKELIKIKNGKDYKTCKLGSIPVYGTGGIINYVGEFLYNDES----------------IL 48 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 + G + T + K+++ + L + + + G+T+ Sbjct: 49 LPRKGSLSNIRYVNQPFWTVDTMYWTCVNKELVLPKYLYFYLKLL---DLSSRDSGSTLP 105 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + IP + +Q I + + +I + + Sbjct: 106 SMTFDAYYELEVEIPRIKKQKKILDLLNPIEEKIMINNKINDNLFSQISIIYNYWFTQYE 165 Query: 208 TKGLNPDVKMKDSGIEWVG-----LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 N ++G + +P +W V+ + K + + + Sbjct: 166 FPNTNGKSYKSNNGELYYNNIVKKDIPKNWVVETLASNSLSEIIKPGVDLFEEKIYYTTA 225 Query: 263 NIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFIDLQNDKRSLRSA--QVMERGIITS 318 +I+ K T + + E + P + F + L ++E I+++ Sbjct: 226 DIVNKNITNGSIVSYNTKEDRANMQPIPYSVWFAKMKNTIKHLFLAPNMKFIIENSILST 285 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFD 377 +K I Y++ + + G ++++ +D+ + ++VP Sbjct: 286 GLCGLKCKEIAFEYISSYILHPYFENHKDVLSHGATQEAVNNDDLNYIYIIVPE----EK 341 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + T I + + L R + + GQ Sbjct: 342 ILRQYHNLTKSIFKKIAENMCENKELITIRDFLLPLLMNGQ 382 Score = 36.3 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 64/193 (33%), Gaps = 10/193 (5%) Query: 20 AIPKHWKVVPI--KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IPK+W V + +++ + ++ IY D+ + + + D + Sbjct: 190 DIPKNWVVETLASNSLSEIIKPGV-DLFEEKIYYTTADIVNKNITNGSIVSYNTKEDRAN 248 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGIC------STQFLVLQPKDVLPELLQGWLLSI 131 + + + K+ ++ +A ST L+ K++ E + ++L Sbjct: 249 MQPI-PYSVWFAKMKNTIKHLFLAPNMKFIIENSILSTGLCGLKCKEIAFEYISSYILHP 307 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + GAT + + I + +P + +I + E Sbjct: 308 YFENHKDVLSHGATQEAVNNDDLNYIYIIVPEEKILRQYHNLTKSIFKKIAENMCENKEL 367 Query: 192 IELLKEKKQALVS 204 I + L++ Sbjct: 368 ITIRDFLLPLLMN 380 >gi|10717100|gb|AAG22014.1|AF288037_3 putative HsdS [Streptococcus thermophilus] Length = 402 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 41/410 (10%), Positives = 108/410 (26%), Gaps = 28/410 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + K++ + + + + Y+ + + + K Sbjct: 4 IRLGEIGKISMCKRILKSQTNEFRNPFYKISTFGGTPTVYIDEKIYREYKEKYSYPK-KK 62 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 L G + I D +V + L + G Sbjct: 63 VIFLISAAGTIGKTVIFDGEDSYFQDSNIVWIEN--DESKVTNQFLYYFLQTNPFITTNG 120 Query: 144 ATMSHADWKGIGNI-PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 +T+ + + +P + +Q I + + +I + K Sbjct: 121 STIKRLYNDNLRDTKIPNVPSIQQQNQITDILGTLDKKIQINNQINQELEAMAKTLYDYW 180 Query: 203 VSYIVTKGLNPDVKMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 N K SG E +P+ W + +L+ N Sbjct: 181 FVQFDFPDQN-GKPYKSSGGKMVYNPELKREIPEGWGAEKLSSLLKIGKETTNPKKFPNE 239 Query: 257 LSLSYGNIIQKLETRNMGLKPESYE-TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 Y + ES + V+ +++ ++ ++ + E I Sbjct: 240 EFKYYSIPEFDTTGTYSLERGESIKSNKFKVEKNDLLVSKLNPWFNRV---IYNLEENAI 296 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP 371 ++ ++ K + + + + + +G + + + + + Sbjct: 297 ASTEFIVWKTFNRFEKNFLYQVATGKEFIEYCTRFATGTSNSHKRVSPDIMVGFQIPFEK 356 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 Q +I+ I V + + L + R + + GQ+ + Sbjct: 357 THIQ-KFGEIIDS----IRTQVLQNNEQNQELTQLRDWILPMLMNGQVKV 401 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 63/195 (32%), Gaps = 12/195 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W + K+ T+ ++ Y + + ++ TG Y + G S S Sbjct: 210 EIPEGWGAEKLSSLLKIGKETTNPKKFPNEEFKYYSIPEFDT-TGTYSLERGESI---KS 265 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQP-KDVLPELLQGWLLSIDVT 134 K +L KL P+ + I + + I ST+F+V + L + Sbjct: 266 NKFKVEKNDLLVSKLNPWFNRVIYNLEENAIASTEFIVWKTFNRFEKNFLYQVATGKEFI 325 Query: 135 QRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G + + +P Q E I + ++ + Sbjct: 326 EYCTRFATGTSNSHKRVSPDIMVGFQIPFEKTHIQ-KFGEIIDSIRTQVLQNNEQNQELT 384 Query: 193 ELLKEKKQALVSYIV 207 +L L++ V Sbjct: 385 QLRDWILPMLMNGQV 399 >gi|300113976|ref|YP_003760551.1| restriction modification system DNA specificity domain-containing protein [Nitrosococcus watsonii C-113] gi|299539913|gb|ADJ28230.1| restriction modification system DNA specificity domain protein [Nitrosococcus watsonii C-113] Length = 497 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 55/473 (11%), Positives = 134/473 (28%), Gaps = 83/473 (17%) Query: 26 KVVPIKRFTK----LNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + + + TG + I +E + ++L S Sbjct: 18 QEKKLSELCVGKSGIQTGPFGSQLHKYDYVEQGTPIITVEHLGDNRIEHLNTPYVSDADR 77 Query: 75 TS-TVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQ--GWLL 129 + +G I++ ++G R+A++ + + S + L ++ ++ L + + Sbjct: 78 HRLSKYQIKEGDIVFSRVGSVDRRALVRKQEDGWLFSGRCLRVRVENELIDPAYLSYFFG 137 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITER 188 I +I GATM + K + ++P+ L EQ I + ++ +I Sbjct: 138 LETFKSYIRSIAVGATMPSINTKILSDLPIYYCSDLEEQKEIAKLLLTLDDKIQLNHQIN 197 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW------------------------ 224 ++ + ++ +K G E Sbjct: 198 QTLEQMAQAIFKSWFVD-FEPVKAKIAALKAGGSEEDALLAAMQAISGKSSEQLTRLQAE 256 Query: 225 -----------------------VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +G +P+ W V L ++ T Sbjct: 257 QPEQYAELRATAEPFPSAMQESELGEIPEGWGVGALQDLCLKVESGGTPKRNIPEYWGGE 316 Query: 262 GNIIQKLETRN---------MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + E R+ + + ++ V + L + + Sbjct: 317 IKWLASGEVRDVIAFGTKEKITKSGLENSSAKLWPKYSTVVAMYGATAGQVCL----LAD 372 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 A + P ++ ++ + + +Q+L V R L+PP Sbjct: 373 TMTTNQACCGLIPKE-NNKAFLFITARNSVSSLADKASGSAQQNLNKGLVSRHASLLPPE 431 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + ++ I ++ + + L E R S + ++G++ + Sbjct: 432 NV---LLAYESITFPLIHAWIQNTHECVQ-LTELRDSLLPKLLSGELSISDAE 480 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 66/201 (32%), Gaps = 13/201 (6%) Query: 18 IGAIPKHWKVVPIKRFT-KLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNS 70 +G IP+ W V ++ K+ +G T + G +I ++ +V + Sbjct: 280 LGEIPEGWGVGALQDLCLKVESGGTPKRNIPEYWGGEIKWLASGEVRDVIAFGTKEKITK 339 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + S+ ++ K + G + + + L PK+ ++ + Sbjct: 340 SGLENSSAKLWPKYSTVVAMYGATAGQVCLLADTMTTNQACCGLIPKE--NNKAFLFITA 397 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + + G+ + + + +PP + I I Sbjct: 398 RNSVSSLADKASGSAQQNLNKGLVSRHASLLPPENVLLAYESI---TFPLIHAWIQNTHE 454 Query: 191 FIELLKEKKQALVSYIVTKGL 211 ++L E + +L+ +++ L Sbjct: 455 CVQL-TELRDSLLPKLLSGEL 474 >gi|225854393|ref|YP_002735905.1| type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae JJA] gi|225724268|gb|ACO20121.1| type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae JJA] Length = 338 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/364 (9%), Positives = 97/364 (26%), Gaps = 38/364 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKGLITKRKLQLDELNLL-------VK 171 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S +P K ++ G + F + + I Sbjct: 172 SRFNEMFGDPLNNNKKFAVKT-GQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAW----- 225 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 ++D I+ + + I+ + + Sbjct: 226 ----------------KSRKYLIDNPTIIIGRVGA----YCGNVRTTHGKVWISDNAIYI 265 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 K L +L+ + + + + ++ ++PP+ Q + + + Sbjct: 266 KEFKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQKPLENQKYILPPLALQNEFADFVA 325 Query: 384 VETA 387 + Sbjct: 326 LVDK 329 >gi|315146003|gb|EFT90019.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 74 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 36/78 (46%), Gaps = 7/78 (8%) Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ SY L K G + L + + ++P+++P EQF I ++D + Sbjct: 1 MLLSYSLKKYI---TGGAQPQLTRDVLLKVPIIIPSYNEQFKIGTF----FKQLDDTIAL 53 Query: 396 IEQSIVLLKERRSSFIAA 413 ++ + LLKE + F+ Sbjct: 54 QQRKLDLLKETKKGFLQK 71 >gi|167892259|ref|ZP_02479661.1| restriction modification system DNA specificity domain [Burkholderia pseudomallei 7894] gi|167917016|ref|ZP_02504107.1| restriction modification system DNA specificity domain [Burkholderia pseudomallei BCC215] Length = 576 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 68/198 (34%), Gaps = 14/198 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYL---PKDG 68 +P WK V + + G T S + ++ D+ Y+ +D Sbjct: 84 LPSSWKWVRLADVGAIVGGGTPPSEDVDNFTAAGGGVAWVTPADLGKHGSLYVSRGSRDL 143 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + S+ ++ KG +L+ P AI + + F + P + + Sbjct: 144 TEKGLKASSATVMPKGAVLFTSRAPIGYTAIALNE-ISTNQGFKSVVP-YISDCARYVAI 201 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 T IE G T K + +P P+PPLAEQ+ I K+ D L Sbjct: 202 YLQAFTPWIEGKASGTTFREVSGKTVSGLPFPLPPLAEQLRIVAKVDELLAMCDQLEAAN 261 Query: 189 IRFIELLKEKKQALVSYI 206 + + +A + + Sbjct: 262 AEREKSRDQLVRASLQQL 279 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 64/175 (36%), Gaps = 12/175 (6%) Query: 229 PDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYET-- 282 P W F ++T+ + + E+ + L+ GN+ L+ N P+ Y Sbjct: 373 PSGWAWSRLASLFKVITDGDHQPPPRAETGVAFLTIGNVTTGQLDFSNCRFVPQEYFDAI 432 Query: 283 --YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRS 339 ++ G+ ++ + + +L + +KP ID YL L+ S Sbjct: 433 APHRRPTKGDFLYTVVGATYGRPALV--DTDRPFCVQRHIGILKPVSEIDLGYLHLLLSS 490 Query: 340 YDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + ++ + ++ ++ +PP+ EQ I + A D L Sbjct: 491 PFVYEQATRSLTGTAQPTIPLRPLRNFLAPLPPLAEQHRIVAKVGALMALCDQLE 545 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 51/152 (33%), Gaps = 6/152 (3%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 +G++ +R++ K + ++ G ++F +A + Sbjct: 130 KHGSLYVSRGSRDLTEKGLKASSATVMPKGAVLFTSRAPIGY-----TAIALNEISTNQG 184 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +V P+ D + + + + + V LP +PP+ EQ I Sbjct: 185 FKSVVPYISDCARYVAIYLQAFTPWIEGKASGTTFREVSGKTVSGLPFPLPPLAEQLRIV 244 Query: 380 NVINVETARIDVLV-EKIEQSIVLLKERRSSF 410 ++ A D L E+ + R+S Sbjct: 245 AKVDELLAMCDQLEAANAEREKSRDQLVRASL 276 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 56/172 (32%), Gaps = 9/172 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQ--SD 74 +P W + K+ T + + ++ + +V +G + ++ Sbjct: 372 LPSGWAWSRLASLFKVITDGDHQPPPRAETGVAFLTIGNVTTGQLDFSNCRFVPQEYFDA 431 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + KG LY +G + + D +L+P + LLS Sbjct: 432 IAPHRRPTKGDFLYTVVGATYGRPALVDTDRPFCVQRHIGILKPVSEIDLGYLHLLLSSP 491 Query: 133 V-TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ G + + N P+PPLAEQ I K+ A D Sbjct: 492 FVYEQATRSLTGTAQPTIPLRPLRNFLAPLPPLAEQHRIVAKVGALMALCDQ 543 >gi|325661851|ref|ZP_08150472.1| hypothetical protein HMPREF0490_01208 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471829|gb|EGC75046.1| hypothetical protein HMPREF0490_01208 [Lachnospiraceae bacterium 4_1_37FAA] Length = 325 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 46/356 (12%), Positives = 116/356 (32%), Gaps = 45/356 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ GR + VE+ G+Y P G+ + I ++ Sbjct: 2 RFEDVLEIKNGRNQK-----------AVENPGGQY-PIYGSGGIMGYANDYICDAQTVII 49 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G+ G + + T F + +DVL + + + + T+ Sbjct: 50 GRKGNINSPIFVEEAFWNVDTAFGLSANRDVLLPRYLYYFCK---KFDFKRLNKTVTIPS 106 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + I + +P L +Q + +++ V+I+ +IT R + +E L E +A + Sbjct: 107 LTKSDLLKIEIDLPDLEKQHDVVDQL----VKIERIITLRKQELEFLDELIKA---RFIE 159 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 +P + K + + + + + + + Sbjct: 160 MFGDPIINSKHLETKEL----KDVLMLKAGDFTAASEISDDMSEINQYPCYGGNGVRGYV 215 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 N + + G + F +N + +L ++E + Sbjct: 216 SKYNQDGEYSIIGRQGALS-GNVQFASGKFKNTEHALLVTPIVE---------------M 259 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 ++ +L L+ + DL + + L ++++ + ++ PI +Q + + Sbjct: 260 NNIWLNQLLINLDLKRY---QTGAAQPGLSVKNLQEIEIIYVPIDKQNQFASFVEQ 312 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 51/130 (39%), Gaps = 10/130 (7%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y I D ++ N + + + T+ ++ + YL + + Sbjct: 36 YANDYICDAQTVIIGRKGNINSPIFV---EEAFWNVDTAFGLSANRDVLLPRYLYYFCKK 92 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +D ++ + SL D+ ++ + +P +++Q D+ + + +I+ ++ +Q Sbjct: 93 FDFKRLNK---TVTIPSLTKSDLLKIEIDLPDLEKQHDVVDQLV----KIERIITLRKQE 145 Query: 400 IVLLKERRSS 409 + L E + Sbjct: 146 LEFLDELIKA 155 >gi|302347048|ref|YP_003815346.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] gi|302150486|gb|ADK96747.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] Length = 407 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 15/151 (9%), Positives = 48/151 (31%), Gaps = 4/151 (2%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y + + +++ ++ + + + ++ Sbjct: 62 YTKYKSETIKEVISKTNIDNTKLVKSKANDVIIPCSGETAEEIATARCVLKDDVLLGGDL 121 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++ HG D +++++ + + L E +K + + P + EQ I N Sbjct: 122 NIIRLHGYDGSFMSYQLNGKRKYDIAKVAQGVSVVHLYGEHLKNIKTINPSLNEQKKIAN 181 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFI 411 ++ + +D + + I L+ + Sbjct: 182 LL----SLLDERISTQNKIIDKLESLIKGIM 208 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/363 (12%), Positives = 113/363 (31%), Gaps = 39/363 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTV 78 W+ + ++ G I ++ + + + + + D + + Sbjct: 25 EWQEERLSDIADISKGIGISKDQLSADGEPCILYGELYTKYKSETIKEVISKTNIDNTKL 84 Query: 79 SIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++ G + + D + +++ + L+ Sbjct: 85 VKSKANDVIIPCSGETAEEIATARCVLKDDVLLGGDLNIIRLHG-YDGSFMSYQLNGKRK 143 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +G ++ H + + NI P L EQ I + +D I+ + + I+ Sbjct: 144 YDIAKVAQGVSVVHLYGEHLKNIKTINPSLNEQKKIANLL----SLLDERISTQNKIIDK 199 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 L+ + ++ + +G N +W ++ E + +NT L + Sbjct: 200 LESLIKGIMVELQKQGQNKG----------------NWRNVLLSKVLKERDERNTNLYQV 243 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM-ER 313 +S+S G +I +++ Y +V G++V+ + E Sbjct: 244 FSVSVSQG-VINQVDYLGRSYAARDTSKYNVVHYGDLVYTKSPTGAYPYGIVKQNFNQEN 302 Query: 314 GIITSAYMAVKPHGID-STYLAWLMRS-----YDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ Y P+ + YL RS L + ++ + V Sbjct: 303 VAVSPLYGVYIPNSLSVGRYLHEYFRSEINTHNYLHPLIQKGAKNTI-NITNQRFLENSV 361 Query: 368 LVP 370 +P Sbjct: 362 PIP 364 >gi|210134990|ref|YP_002301429.1| type I R-M system S protein [Helicobacter pylori P12] gi|210132958|gb|ACJ07949.1| type I R-M system S protein [Helicobacter pylori P12] Length = 432 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 50/425 (11%), Positives = 114/425 (26%), Gaps = 52/425 (12%) Query: 22 PKHWKVVPIKRFTKL-------NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + + T + + + ++ Y + N Q+ Sbjct: 13 PKGVGFRKLGEILEYDQPNQYCVTSKEFDKSYPTPVLTAG--KTFILGYTNEKDNIYQAS 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ I + + S+ +L K+ + + Sbjct: 71 KSSPVIIF-DDF-------TTATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFCM---Q 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + +PIPPL Q I + + A T L TE ++ Sbjct: 120 TIPYNIGGEHARHWISRYSQ--LEVPIPPLEIQQEIVKILDAFTELNTELNTELNTELKA 177 Query: 195 LKEKKQALVSYIVTK------------GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 K++ + + ++ L K L P E K + Sbjct: 178 RKKQYEYYQNMLLDFNDINSNHKDAKEKLTQKTYPKRLKTLLQTLAPKGVEFKTLEEVFE 237 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPESYETYQIVDPGEIVF 293 N + + R G + P++ + ++ I+ Sbjct: 238 IRNGYTPSKNNPEFWKNGTIPWFRMEDLRENGRILKDSIQHITPKALKGKKLFPKNSIII 297 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGS 351 + L + + +++ K + + + + L + + Sbjct: 298 STTATIGEHALLIVDSLANQQFT---FLSKKANCDLALDMKFFFYQCFLLGEWCKNNINV 354 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 S+ K+ +PP++ Q +I +++ + L+ I I K+ R Sbjct: 355 SGFASVDMSAFKKYKFPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQYEYYR 414 Query: 408 SSFIA 412 + Sbjct: 415 EKLLT 419 >gi|307312950|ref|ZP_07592578.1| restriction modification system DNA specificity domain protein [Escherichia coli W] gi|306907118|gb|EFN37625.1| restriction modification system DNA specificity domain protein [Escherichia coli W] gi|315063581|gb|ADT77908.1| hypothetical protein ECW_m4636 [Escherichia coli W] Length = 355 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 58/346 (16%), Positives = 107/346 (30%), Gaps = 18/346 (5%) Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 S ++ K S G +L + R I+ D +G +LVL P Sbjct: 8 NSKYIRHTAKKIKFEGVKKSRK--VYPGDLLLTNSMSFGRPYIL-DVEGCIHDGWLVLSP 64 Query: 117 KDV--LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 K+ + +L S I GA + + + + N+ +P PP AEQV I + Sbjct: 65 KNNQIHIDYFYHYLNSPTAKIIISNKAAGAVVKNLNSDIVRNLEIPFPPFAEQVRIASTL 124 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 + L + + +P K ++ + H Sbjct: 125 DKADGIRQKREQAIKLADDF-------LRATFLEMFGDPVQNPKGWNVKPLADQIIHANN 177 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + N + L ++ G K R + E D + Sbjct: 178 GISRRRKEDTNEGDIVLRLQDVHY--SGITFDKELNRIKLVDKEKQIARVEYDDLLFIRV 235 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVF--YAMGS 351 + R+ +E + +K + S +L +L+ S K+ S Sbjct: 236 NGNPNYVGRTAVFKSYIEPVYHNDHLIRIKLDNEYQSDFLCYLINSPFSRKLIAQQIKTS 295 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + ++ + + +L PPI+ Q N I + I +K E Sbjct: 296 AGQHTISQDGILKLMFYRPPIELQEKFIN-IKNKIESIFYRKDKHE 340 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 54/140 (38%), Gaps = 8/140 (5%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + +K E + + V PG+++ L + G + ++ Sbjct: 8 NSKYIRHTAKKIKFEGVKKSRKVYPGDLLLTNSMSFGRPYILDVEGCIHDGWL---VLSP 64 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVI 382 K + I Y + S + +G ++L + V+ L + PP EQ I + + Sbjct: 65 KNNQIHIDYFYHYLNSPTAKIIISNKAAGAVVKNLNSDIVRNLEIPFPPFAEQVRIASTL 124 Query: 383 NVETARIDVLVEKIEQSIVL 402 + + D + +K EQ+I L Sbjct: 125 D----KADGIRQKREQAIKL 140 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 50/195 (25%), Gaps = 14/195 (7%) Query: 22 PKHWKVVPIKR-FTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTS 76 PK W V P+ N G + +D I + L+DV + + + D Sbjct: 160 PKGWNVKPLADQIIHANNGISRRRKEDTNEGDIVLRLQDVHYSGITFDKELNRIKLVDKE 219 Query: 77 TVS-IFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +L+ ++ + + ++ + +L+ Sbjct: 220 KQIARVEYDDLLFIRVNGNPNYVGRTAVFKSYIEPVYHNDHLIRIKLDNEYQSDFLCYLI 279 Query: 130 SIDV--TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + I A GI + PP+ Q Sbjct: 280 NSPFSRKLIAQQIKTSAGQHTISQDGILKLMFYRPPIELQEKFINIKNKIESIFYRKDKH 339 Query: 188 RIRFIELLKEKKQAL 202 F + + ++ Sbjct: 340 EDLFASISNKLIHSI 354 >gi|260905624|ref|ZP_05913946.1| type I restriction-modification system, S subunit, putative [Brevibacterium linens BL2] Length = 403 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 59/396 (14%), Positives = 130/396 (32%), Gaps = 42/396 (10%) Query: 44 SGKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII 100 S + I + ++ G GK P+ S + +G ++ G+ G R A+I Sbjct: 31 SENGVPLISVGEIGDGRLSIGKKTPRVSEETTERLSEY-LLWRGDVVIGRKGAVERSALI 89 Query: 101 ---ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 D + S V + + S V + + + G TM+ + + + Sbjct: 90 NEDQDGYFLGSDGMRVRFGDSINSTFMAYQFRSDAVRRWLISHASGTTMASMNQAILSKL 149 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK-GLNPDVK 216 P+ +PP Q I E + A +I Q+L+ GL+ Sbjct: 150 PILVPPNRTQQAIAEVLGALDDKIAANERLSSGA--------QSLLQEHFAMLGLDRF-- 199 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + E+N + ++ + L ++ T + Sbjct: 200 -------------ADTGPFLTVNDLFEVNPRTSRKVTGQSPYLGMKDLPDTSMTVSSWST 246 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER---GIITSAYMAVKPHGIDSTYL 333 E+ + V+ G+++ I + +E GI ++ ++ V+ + + Sbjct: 247 REAKSGARFVN-GDVLLARITPCLENGKAGYVDFLENAEIGIGSTEFIVVRARDPLLSVV 305 Query: 334 AWLM-RSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + +S G RQ L DV + E + + T+ ++ Sbjct: 306 PFFLTKSERFRDFAIRHMQGTSGRQRLAASDVAGYQLA---EVEDERLNRFGELSTSLLE 362 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + +S L R + ++G+I ++ + Sbjct: 363 RVRTAVAES-QGLAHTRDELLPLLMSGKISVKDAEK 397 >gi|293498315|ref|ZP_06666169.1| hypothetical protein SCAG_00888 [Staphylococcus aureus subsp. aureus 58-424] gi|291097246|gb|EFE27504.1| hypothetical protein SCAG_00888 [Staphylococcus aureus subsp. aureus 58-424] Length = 209 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 77/195 (39%), Gaps = 14/195 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + HWE + E N ++ + II+ E + Sbjct: 22 DENSEDYPHWENSKIEKYLKERNERSD--KGQMLSVTINSGIIKFSELDRKDNSSKDKSN 79 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL-AWLMRSYD 341 Y++V +I + + + + GI++ AY + P S+ + +++ Sbjct: 80 YKVVRKNDIAYNSMRMWQGASG----RSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHR 135 Query: 342 LCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F GL +LK++ +K + + +P ++EQ I + ++D+L+ K + Sbjct: 136 MIHKFKINSQGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDF----FKKMDILISKQKI 191 Query: 399 SIVLLKERRSSFIAA 413 I +L++ + SF+ Sbjct: 192 KIEILEKEKQSFLQK 206 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 63/183 (34%), Gaps = 7/183 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 HW+ I+++ K R+ + + I ++ ++ D S + K Sbjct: 30 HWENSKIEKYLKERNERSDKGQMLSVTINSGIIKFSELD----RKDNSSKDKSNYKVVRK 85 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I Y + + + ++++GI S + VL P L G+ I Sbjct: 86 NDIAYNSMRMWQGASGRSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQ 145 Query: 144 ---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + +K + NI + IP L EQ I + + I + + + Q Sbjct: 146 GLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKIKIEILEKEKQSFLQ 205 Query: 201 ALV 203 + Sbjct: 206 KMF 208 >gi|329730679|gb|EGG67060.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21193] Length = 200 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 77/195 (39%), Gaps = 14/195 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + HWE + E N ++ + II+ E + Sbjct: 13 DENSEDYPHWESSKIEKYLKERNERSD--KGQMLSVTINSGIIKFSELDRKDNSSKDKSN 70 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL-AWLMRSYD 341 Y++V +I + + + + GI++ AY + P S+ + +++ Sbjct: 71 YKVVRKNDIAYNSMRMWQGASGKSNY----NGIVSPAYTVLYPTQNTSSLFIGYKFKTHR 126 Query: 342 LCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F GL +LK++ +K + + +P ++EQ I + ++D+L+ K + Sbjct: 127 MIHKFKINSQGLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDF----FKKMDILISKQKM 182 Query: 399 SIVLLKERRSSFIAA 413 I +L++ + SF+ Sbjct: 183 KIEILEKEKQSFLQK 197 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 63/183 (34%), Gaps = 7/183 (3%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 HW+ I+++ K R+ + + I ++ ++ D S + K Sbjct: 21 HWESSKIEKYLKERNERSDKGQMLSVTINSGIIKFSELD----RKDNSSKDKSNYKVVRK 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I Y + + + ++++GI S + VL P L G+ I Sbjct: 77 NDIAYNSMRMWQGASGKSNYNGIVSPAYTVLYPTQNTSSLFIGYKFKTHRMIHKFKINSQ 136 Query: 144 ---ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + + +K + NI + IP L EQ I + + I + + + Q Sbjct: 137 GLTSDTWNLKYKQLKNINIDIPVLEEQEKIGDFFKKMDILISKQKMKIEILEKEKQSFLQ 196 Query: 201 ALV 203 + Sbjct: 197 KMF 199 >gi|217032124|ref|ZP_03437624.1| hypothetical protein HPB128_16g84 [Helicobacter pylori B128] gi|216946272|gb|EEC24880.1| hypothetical protein HPB128_16g84 [Helicobacter pylori B128] Length = 297 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 58/203 (28%), Gaps = 15/203 (7%) Query: 229 PDHWEVKPFFA----LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-TY 283 P +W+ + + K+ I G + Y+ Y Sbjct: 7 PSNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFLEYKTKY 66 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 G+I+ + + + + +L +Y Sbjct: 67 SFPKKGDILISASGTIGRAVI----YDGKPAYFQDSNIVWIDNDETLVKNDFLFYTYSHV 122 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 K L ++ + + +PP+ EQ I N+++ + L I + Sbjct: 123 KW--NTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSDVDRYLYNLDALILKK---- 176 Query: 404 KERRSSFIAAAVTGQIDLRGESQ 426 + + + ++ + L+G +Q Sbjct: 177 EGVKKALSFELLSQRKRLKGFNQ 199 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 56/190 (29%), Gaps = 10/190 (5%) Query: 21 IPKHWKVVPIKRF-----TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P +W+ V + K + +I + + + ++ K Sbjct: 6 LPSNWQRVRLGDIGKPCMCKRVMKHQTTRYGEIPFYKIGTFGNTADAFISKKLFL--EYK 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + S KG IL G R I +V E L Sbjct: 64 TKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYTYS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ E T+ N +P+PPL EQ+ I + + L ++ + Sbjct: 121 HVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSDVDRYLYNLDALILKKEGVK 180 Query: 196 KEKKQALVSY 205 K L+S Sbjct: 181 KALSFELLSQ 190 Score = 40.9 bits (94), Expect = 0.42, Method: Composition-based stats. Identities = 11/106 (10%), Positives = 28/106 (26%), Gaps = 11/106 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ V + ++ G + ++ V G G + +R + Sbjct: 201 WQKVRLGDIAEIKRGVRITKNELDVFGKYPVVSGGVGFLGYTNNFNR----------YEN 250 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 I + G + + P + + + +L Sbjct: 251 TITIAQYG-TAGYVNFQKNKFWANDVCFCIYPNKDIIKNIFLYLFF 295 >gi|329728696|gb|EGG65125.1| type I restriction modification DNA specificity domain protein [Staphylococcus aureus subsp. aureus 21193] Length = 189 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 45/150 (30%), Gaps = 6/150 (4%) Query: 246 RKNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + I L NI + + + G+++ Sbjct: 25 GGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDEMKNSRTYYGDVLLNITGASIG 84 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQSLKFE 360 + ++ S + + + +L+ K+F A G R+ L F+ Sbjct: 85 RTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFK 144 Query: 361 DVKRLPVLVPPI-KEQFDITNVINVETARI 389 ++ L + P I +EQ I + +I Sbjct: 145 EIANLKIFTPTIFEEQQKIGQFFSKLDQQI 174 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 61/171 (35%), Gaps = 13/171 (7%) Query: 24 HWKVVPIKRFT-KLNTGRTSE------SGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDT 75 W+ + T K+ +G+T + + K I ++ +++ +G + D Sbjct: 4 EWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIRNGKLNLNDLVYISKDIDDE 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQ-FLVLQPKDVLPELLQGWLLSI 131 S G +L G + + I + + ++ K+ +LLS Sbjct: 64 MKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCIIRLKKEYYYNFFGQYLLSR 123 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA-EQVLIREKIIAETVRI 181 ++I G + ++K I N+ + P + EQ I + +I Sbjct: 124 KGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIGQFFSKLDQQI 174 >gi|261364422|ref|ZP_05977305.1| putative type I restriction modification DNA specificity domain protein [Neisseria mucosa ATCC 25996] gi|288567329|gb|EFC88889.1| putative type I restriction modification DNA specificity domain protein [Neisseria mucosa ATCC 25996] Length = 472 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 56/423 (13%), Positives = 117/423 (27%), Gaps = 57/423 (13%) Query: 28 VPIKRFT-KLNTGRTSES--------GKDIIYIGLED------VESGTGKYLPKDGNSRQ 72 +K F + +G T + I + +++ VE KY+ +D Sbjct: 49 KRLKDFALSMGSGATPSTTNPEFYSDKNGIPLLRVQNLTLNSTVELNNLKYITEDV---H 105 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAI---IADFDGICSTQFLVLQPKDVLPELLQ-GWL 128 + S +L G +F G + +V++ L +L Sbjct: 106 ENMLKRSQVTDQDLLVKITGVGRMAVAAVPPKEFSGNVNQHIVVVRTGSREKSLYLANYL 165 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPM----PIPPLAEQVLIREKIIAETVRIDTL 184 + + G T D+ + +IP+ L ++ + + Sbjct: 166 NLDVIEKLASRRVTGGTRPALDYPALRSIPIIEDIDFSILENAKKQANQLKQQAKTLLNS 225 Query: 185 ITER--IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I L E +L S + T V+M + G+ + + Sbjct: 226 INSYLLGELGITLPETDNSLNSRMFT------VQMSEVGVGRLDSFTYQPRFTKLAETLE 279 Query: 243 ELNR------------------KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 + +N L ++ + + + Sbjct: 280 QCRYAVASLAKVATDIKNGVEIRNYVEEGFRYLRVTDLSEHGLNHSSPKFVDVHGVPEKI 339 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDL 342 ++P ++ + + ++ I++S V+ I YL R Sbjct: 340 RLNPNCLLIARSGSLGLVNVV--TEDIKDAILSSHIFKVELDTTQIYPEYLEAFARCPIG 397 Query: 343 CKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + F G+ + +K V +P Q I I A+ L + EQ + Sbjct: 398 QEQFKQLNNGGVIPEINQSALKTFKVALPDKSTQQKIIAHIRAIKAQAATLQAEAEQLLS 457 Query: 402 LLK 404 K Sbjct: 458 QAK 460 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 48/150 (32%), Gaps = 24/150 (16%) Query: 269 ETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 E N+ E + V +++ + + + + + + Sbjct: 93 ELNNLKYITEDVHENMLKRSQVTDQDLLVKITGVGRMAVAAVPPKEFSGNVNQHIVVVRT 152 Query: 325 PHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 S YLA + + K+ + G R +L + ++ +P++ Sbjct: 153 GSREKSLYLANYLNLDVIEKLASRRVTGGTRPALDYPALRSIPII--------------- 197 Query: 384 VETARID-VLVEKIEQSIVLLKERRSSFIA 412 ID ++E ++ LK++ + + Sbjct: 198 ---EDIDFSILENAKKQANQLKQQAKTLLN 224 >gi|148927587|ref|ZP_01811058.1| hypothetical protein TM7_0305 [candidate division TM7 genomosp. GTL1] gi|147887063|gb|EDK72560.1| hypothetical protein TM7_0305 [candidate division TM7 genomosp. GTL1] Length = 298 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 28/87 (32%), Gaps = 3/87 (3%) Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ Y+ + + D R L +KR+ + P EQ I I + Sbjct: 18 NNKYVKYALNYVDYQSYV---TGTTRLKLNQSALKRIIIPFPDENEQKRIVAKIEELFSE 74 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 ID I + K S I + Sbjct: 75 IDNAESAITTASGYYKSYEQSIIDSLF 101 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 17/142 (11%), Positives = 39/142 (27%), Gaps = 9/142 (6%) Query: 26 KVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 ++V ++ G T + Y+ + +V+ G + ++ Sbjct: 109 EMVEFGDIAEIKGGITKGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKY 168 Query: 80 IFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G IL+ + G R I +C Q + + + + + ++ T R Sbjct: 169 SLMNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYATKTTR 228 Query: 137 IEAICEGATMSHADWKGIGNIP 158 I Sbjct: 229 ARDYLSLHLALKVMNCCAKKIW 250 >gi|224457361|ref|ZP_03665834.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. tularensis MA00-2987] gi|254370730|ref|ZP_04986735.1| predicted protein [Francisella tularensis subsp. tularensis FSC033] gi|151568973|gb|EDN34627.1| predicted protein [Francisella tularensis subsp. tularensis FSC033] gi|282159469|gb|ADA78860.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. tularensis NE061598] Length = 225 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 9/61 (14%), Positives = 23/61 (37%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + + + + + + +PP+ EQ I ++ +D +E +Q+I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFENVDKAIELHQQNITNANT 60 Query: 406 R 406 Sbjct: 61 L 61 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 67/240 (27%), Gaps = 17/240 (7%) Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G M H NI +P+PPLAEQ I K+ + +D I + I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFENVDKAIELHQQNITNANT 60 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + K E+ + LV + N+K ++ Sbjct: 61 LMASTLDKTF----------KKLEGEYSKIALLDVMKISNKTLVPDDNQKYNY-----VV 105 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + +L E + G +++ + +K + I Sbjct: 106 LENIEGNTGRLIDFCETQGKEIKSSKVEFKKGMVLYGKLRPYLNKVWFSEFDDVATTEIL 165 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK--RLPVLVPPIKEQ 375 Y + + S L +V L +K + +PP+ Q Sbjct: 166 PFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNCSGSRMPRLTTAFLKSEEAYIPLPPLPIQ 225 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 4/125 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K++ + + Y+ LE++E TG+ + + S+ F KG +LY Sbjct: 82 LLDVMKISNKTLVPDDNQKYNYVVLENIEGNTGRLIDFCETQGKEIKSSKVEFKKGMVLY 141 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGAT 145 GKL PYL K ++FD + +T+ L P D ++ + LS QR+ C G+ Sbjct: 142 GKLRPYLNKVWFSEFDDVATTEILPFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNCSGSR 201 Query: 146 MSHAD 150 M Sbjct: 202 MPRLT 206 >gi|114568716|ref|YP_755396.1| restriction modification system DNA specificity subunit [Maricaulis maris MCS10] gi|114339178|gb|ABI64458.1| restriction modification system DNA specificity domain [Maricaulis maris MCS10] Length = 383 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 61/405 (15%), Positives = 116/405 (28%), Gaps = 45/405 (11%) Query: 30 IKRFTKLNTGR------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + KL GR ++ + + YI +++V KD ++ Sbjct: 7 LGDIVKLRKGRKAQEVLSAAAAGALPYIQIDEVRGVAPTKYAKDPSAVD--------VGP 58 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRIEAIC 141 + G I ST + + L +A Sbjct: 59 DDLCIVWDGANAGTVGYGLSGAIGSTVARIRFSDHGQWDAAFVGRLLQGKFRQLNDQAQA 118 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GAT+ H D + + +P L EQ I + + +R + L + ++ Sbjct: 119 RGATIPHVDKSKLEQLAIPRIDLDEQRRIAAILDKADA----IRRKREEALALADDFLKS 174 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + L P+ S I+ + + + L+ S Sbjct: 175 TFLEMFGDPLAPEPHGSISTIDTECDLFAGNSLPRGEEFRGQDRG---CLLLKVSDLNSE 231 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 GN Q + ++ E + G IVF + + + ++ M Sbjct: 232 GNETQIVSSKLWVPPNEKLRASMVAPAGSIVFPKRG--GAISTNKKRVLSRPAVLDPNLM 289 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 V P S +L ++L + L +DV L ++VP P+ Sbjct: 290 GVAPKSGSSISFRYLRNWFELLDLVTISSGSTVPQLNKKDVGPLRIVVPTPVD------- 342 Query: 381 VINVETARIDVLVEKIEQSIVLLKE-------RRSSFIAAAVTGQ 418 R D + E+ + L+ +S A G+ Sbjct: 343 -----LERFDNIYERSAKLREKLRSAWDSSAHLFASLSQRAFRGE 382 >gi|207859651|ref|YP_002246302.1| type I restriction-modification system specificity subunit M [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|1679867|emb|CAA68058.1| Sty SBLI [Salmonella enterica] gi|206711454|emb|CAR35838.1| putative Type I restriction-modification system specificity subunit M [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 434 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 64/196 (32%), Gaps = 11/196 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY---LPKDG 68 +G +PK W +L G T ++ DI + + D S + Y K Sbjct: 223 LGWMPKGWITTSFNDLIELIGGGTPKTSVEEFWNGDIPWFSVVDAPSESDVYVLTTEKKI 282 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + S+ + KG + G + A++A + + + V+ ++ E + Sbjct: 283 TIEGLNNSSAKLLRKGTTIISARGTVGKCAMVAVPMAMNQSCYGVIGKNNISDE--YIYF 340 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + Q ++ + G+ + NI +P + +I + Sbjct: 341 QLKNAVQTLQQMGHGSVFNTITRDTFKNIKVPFCNEELTNSYSLLVKNYFSKILNNNYQN 400 Query: 189 IRFIELLKEKKQALVS 204 I L L+S Sbjct: 401 IALTNLRDTLLPKLIS 416 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 48/440 (10%), Positives = 123/440 (27%), Gaps = 73/440 (16%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K +P+ F L G K ++ G + + A G Sbjct: 5 KTIPLNEFITLQRGFDLPQDKRVM-----------GDIPVVASTGVVGYHNEEKVLAPG- 52 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 ++ G+ G I +T V K P + L SID G+ Sbjct: 53 VVIGRSGSIGGGQYITTNFWPLNTTLWVKDFKGHHPRFVYYLLRSIDF----SQFNVGSG 108 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA---- 201 + + + I + + + + I +I ++ + ++ Sbjct: 109 VPTLNRNHLSGILVADTSYSYEKEASDIIGILDDKIKLNKELNHTLEQISQTLFKSWFVD 168 Query: 202 ---LVSYIVTKGLNPDVKMKDSGIE-----------------------------WVGLVP 229 ++ + G NP + S E +G +P Sbjct: 169 FDPVIDNALDAG-NPIPEALQSRAELRQKIRNSADFKPLPADIRALFPAEFEETELGWMP 227 Query: 230 DHWEVKPFFALVTELNRKN-----TKLIESNILSLSYGNIIQKLE------TRNMGLKPE 278 W F L+ + + +I S + + + + + ++ Sbjct: 228 KGWITTSFNDLIELIGGGTPKTSVEEFWNGDIPWFSVVDAPSESDVYVLTTEKKITIEGL 287 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + +++ G + + + S Y + + I Y+ + ++ Sbjct: 288 NNSSAKLLRKGTTIISARGTVGKCAMVAVPM----AMNQSCYGVIGKNNISDEYIYFQLK 343 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + + + ++ + K + V ++TN ++ + Sbjct: 344 N-AVQTLQQMGHGSVFNTITRDTFKNIKVPFCN----EELTNSYSLLVKNYFSKILNNNY 398 Query: 399 SIVLLKERRSSFIAAAVTGQ 418 + L R + + ++G+ Sbjct: 399 QNIALTNLRDTLLPKLISGE 418 >gi|312126615|ref|YP_003991489.1| restriction modification system DNA specificity domain-containing protein [Caldicellulosiruptor hydrothermalis 108] gi|311776634|gb|ADQ06120.1| restriction modification system DNA specificity domain protein [Caldicellulosiruptor hydrothermalis 108] Length = 481 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 54/436 (12%), Positives = 125/436 (28%), Gaps = 57/436 (13%) Query: 25 WKVV-PIKRFTK-LNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 WK +++ + G+ D YI + +++ G++ +D + + Sbjct: 38 WKESFKLRQIVSRIRNGKDFSKKVYADYETDTCYIRVNNLK-PMGEFTGEDIIFLRDEEI 96 Query: 77 TVSI---FAKGQILYGKLGPYLRK----------AIIADFDGICSTQFLVLQPKDVLPEL 123 +G L + G I ++ E Sbjct: 97 EKFFNLFIDEGDFLITRSGTVGIAFKFIRHDLPEYIRDKNFMPAGYIIVIKVHNLFDDEY 156 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR-----EKIIAET 178 L+ +L S + EA+ G + + +G +P+ L + ++I Sbjct: 157 LKYFLYSSISRRYFEALACGKSQQNISQADLGKWLVPLQILKNIPVNEIKEKEQEISKLK 216 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +I + S + K + + S + + F Sbjct: 217 TQIKEPKIIVSEVFGKYFKLDLKQYSDLEKKHIFEENLFNLSRATQLRSSLKFHHPRSDF 276 Query: 239 ALVTELNRKN-----------------TKLIESNILSLSYGNIIQ-KLETRNMGLKPESY 280 L K E ++ + N+ ++ + + Sbjct: 277 VLGKLKEFKTVKLKQLLREPVRRGVQPEYKEEGEVMVVKTANLKNSYIDLSEVEYVSSEF 336 Query: 281 ETYQIVDPG----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLA 334 G +++ + + E ++ + V ++ YL Sbjct: 337 FQKNKKKAGIKYLDVLIASTG-TGSIGKVDIWESDEEALVDGHISILRVDQDKVNPRYLT 395 Query: 335 WLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET---ARI 389 + +RS A SG+ + + D+++ +L+P Q I I + +I Sbjct: 396 YYLRSLFGYSQIEANFSGMSNQIEIYPNDIEKFDILLPDKTIQEQIVKEIETKLNAQKKI 455 Query: 390 DVLVEKIEQSIVLLKE 405 +E+++Q I L E Sbjct: 456 AEQIERLKQEIDNLIE 471 >gi|260436988|ref|ZP_05790804.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] gi|292810611|gb|EFF69816.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] Length = 257 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 63/201 (31%), Gaps = 4/201 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP +W + NTG+ + GK + YI +V + Sbjct: 14 EIPNNWVWCNLGLLFNHNTGKALNSANSEGKALTYITTSNVYWNRFELNDLKSMPFTDSE 73 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 KG +L + G R AI + I L + + + Sbjct: 74 IEKCTIKKGDLLVCEGGDIGRAAIWNFDNEIRIQNHLHRLRAYDYIQTAFYYYVLYAFKL 133 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + G + + NI +P+PP+ EQ I I + ID + + + + Sbjct: 134 SGKISGNGIGLQGLSSNALHNIIVPVPPIEEQKNIVMSIEKLMLSIDNIESHKNILAICI 193 Query: 196 KEKKQALVSYIVTKGLNPDVK 216 + K ++ + L P Sbjct: 194 ENTKAKILELAIRGKLVPQDP 214 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 65/208 (31%), Gaps = 11/208 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRK------NTKLIESNILSLSYGNIIQKLETR 271 K E +P++W L K + + I + + +L Sbjct: 5 KLCDFESDYEIPNNWVWCNLGLLFNHNTGKALNSANSEGKALTYITTSNVYWNRFELNDL 64 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 +S + G+++ I + ++ + T Sbjct: 65 KSMPFTDSEIEKCTIKKGDLLVCEGGDIGRAAIW---NFDNEIRIQNHLHRLRAYDYIQT 121 Query: 332 YLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + ++ L G GL+ L + + V VPPI+EQ +I I ID Sbjct: 122 AFYYYVLYAFKLSGKISGNGIGLQ-GLSSNALHNIIVPVPPIEEQKNIVMSIEKLMLSID 180 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + ++ ++ + A+ G+ Sbjct: 181 NIESHKNILAICIENTKAKILELAIRGK 208 >gi|126661659|ref|ZP_01732673.1| type I restriction-modification system specificity subunit [Cyanothece sp. CCY0110] gi|126617057|gb|EAZ87912.1| type I restriction-modification system specificity subunit [Cyanothece sp. CCY0110] Length = 383 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 61/395 (15%), Positives = 131/395 (33%), Gaps = 29/395 (7%) Query: 30 IKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK-- 83 ++ +L I ++ G+ S ++ Sbjct: 7 LEDVCELIVDCEHKTAPTQETGYPSIRTPNIGRGSLILDKVKRVSEETYKKWTRRAIPTT 66 Query: 84 GQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ + P AII +C T + V P L LL ++ + ++ Sbjct: 67 DDLILAREAPVGNVAIIPSNLKVCLGQRTVLIRANKNKVFPRYLCYLLLGDEIQGKFFSL 126 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 GAT+ H + K I P + ++KI + D LI + I++L+E Q Sbjct: 127 SNGATVHHLNVKD---IRNLELPKLPPLPTQKKIASILSTYDDLIENNTKRIKILEEMAQ 183 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + K P + +GL+P+ WEVK + + K Sbjct: 184 TIYKEWFVKFRFPGHEQVKMVESELGLIPEGWEVKKLGRIASFKTGKLNSNAAKPDGIYP 243 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + Q++ + S++T IV G + + + Y Sbjct: 244 FFTCSQQIFRTDTY----SFDTECIVLAGN--------NANGIFHIKYFNGKFDVYQRTY 291 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++ + ++ ++ +G + L + + + ++V + Q + Sbjct: 292 VIQTLDKQTASNYYLYFAIKEQLELLKSISTGAATKFLTIKILNNINIIVNSNQIQEQFS 351 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 +VI+ ++ID+L EK + L++ R + Sbjct: 352 DVISTVFSQIDILQEKNQN----LRKTRDLLLPKL 382 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 55/188 (29%), Gaps = 14/188 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP+ W+V + R TG+ + + G Y + + T T Sbjct: 208 LGLIPEGWEVKKLGRIASFKTGKLNSNA-----------AKPDGIYPFFTCSQQIFRTDT 256 Query: 78 VSIFAKGQILY--GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S F I+ + +++ + + + Sbjct: 257 YS-FDTECIVLAGNNANGIFHIKYFNGKFDVYQRTYVIQTLDKQTASNYYLYFAIKEQLE 315 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +++I GA K + NI + + Q + I +ID L + + Sbjct: 316 LLKSISTGAATKFLTIKILNNINIIVNSNQIQEQFSDVISTVFSQIDILQEKNQNLRKTR 375 Query: 196 KEKKQALV 203 L+ Sbjct: 376 DLLLPKLI 383 >gi|317483692|ref|ZP_07942647.1| type I restriction modification DNA specificity domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316914864|gb|EFV36331.1| type I restriction modification DNA specificity domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 172 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 55/153 (35%), Gaps = 6/153 (3%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + ++ IL ++ ++ Q + E + P + + + ++ A Sbjct: 24 SNYVDGKILWVTSQDVKQHYIENTTTMISEKGAATLTLYPSDSIVIVARSGILRHTIPVA 83 Query: 309 QVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 ++ + + + S L + + S Y +S+ F +K Sbjct: 84 KLRKPATVNQDIKVIQTVDSCDSSWLLQYFIASNKTLLREYGKTGTTVESIDFAKMKSTA 143 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++VP I+EQ I + +R+D L+ ++ Sbjct: 144 LMVPYIEEQQAIGSF----FSRLDNLITLHQRK 172 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 54/168 (32%), Gaps = 13/168 (7%) Query: 25 WKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ ++ G T I+++ +DV+ + + + + +T Sbjct: 1 WEQRKLENLASFGGGHTPSMADASNYVDGKILWVTSQDVKQHYIENTTTMISEKGA--AT 58 Query: 78 VSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 ++++ I+ LR + V+Q L + ++ + Sbjct: 59 LTLYPSDSIVIVARSGILRHTIPVAKLRKPATVNQDIKVIQTVDSCDSSWLLQYFIASNK 118 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 T E G T+ D+ + + + +P + EQ I I Sbjct: 119 TLLREYGKTGTTVESIDFAKMKSTALMVPYIEEQQAIGSFFSRLDNLI 166 >gi|291543146|emb|CBL16256.1| Restriction endonuclease S subunits [Ruminococcus bromii L2-63] Length = 370 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 52/386 (13%), Positives = 120/386 (31%), Gaps = 48/386 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 I R + +D I + +++P N+ D + K + +Y Sbjct: 7 KIGDLITTVDERNTIGIRDFYGINI------NKEFMPTVANTEGLDERKYKVVRKNRFVY 60 Query: 89 GKLGPYLRKAIIA-----DFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + I D + S ++ V VLP L+ + + Sbjct: 61 SGMQTGRDECIRISMYTKDKPILVSPAYVTFEVTALSTVLPLYFFLRFLTKEKDRYGAFC 120 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G+ S+ DW+ ++ + +P + Q + A Sbjct: 121 SDGSIRSNLDWEVFCDMNIELPSIEIQQKYVDVYNAMLAN-------------------- 160 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 GL+ D+ IE + ++ + + E N + + ++ Sbjct: 161 ---QQSYEHGLDDLKLTCDAYIEELRRKTPCEKIGKYLSECNERN-----NVGLTVNNVR 212 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIV-FRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ + S Y+++ P EI + DK SL E +++S Sbjct: 213 GIATSKEFIDTKANMDGVSLSNYKMIHPNEIAYISDTSRRGDKISLAMNSSDEMYLVSSI 272 Query: 320 --YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + YL + + R++ + D+ + + +P I Q Sbjct: 273 STVFRTNKEHLLPEYLFLFYSRTEFDRYARFNSWGSARETFNWNDMCDVKIPIPDITIQK 332 Query: 377 DITN--VINVETARIDVLVEKIEQSI 400 I ++ + +I+ ++ ++I Sbjct: 333 SIAEMYMVYNKRKKINEQLKVQIKNI 358 >gi|205355246|ref|YP_002229047.1| type I restriction-modification system specificity subunit M [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205275027|emb|CAR40113.1| putative Type I restriction-modification system specificity subunit M [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|326630409|gb|EGE36752.1| putative Type I restriction-modification system specificity subunit M [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 434 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 64/196 (32%), Gaps = 11/196 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY---LPKDG 68 +G +PK W +L G T ++ DI + + D S + Y K Sbjct: 223 LGWMPKGWITTSFNDLIELIGGGTPKTSVEEFWNGDIPWFSVVDAPSESDVYVLTTEKKI 282 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + S+ + KG + G + A++A + + + V+ ++ E + Sbjct: 283 TIEGLNNSSAKLLRKGTTIISARGTVGKCAMVAVPMAMNQSCYGVIGKNNISDE--YIYF 340 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + Q ++ + G+ + NI +P + +I + Sbjct: 341 QLKNAVQTLQQMGHGSVFNTITRDTFKNIKVPFCNEELTNSYSLLVKNYFSKILNNNYQN 400 Query: 189 IRFIELLKEKKQALVS 204 I L L+S Sbjct: 401 IALTNLRDTLLPKLIS 416 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 48/435 (11%), Positives = 125/435 (28%), Gaps = 63/435 (14%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K +P+ F L G K ++ G + + A G Sbjct: 5 KTIPLNEFITLQRGFDLPQDKRVM-----------GDIPVVASTGVVGYHNEEKVLAPG- 52 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI-------- 137 ++ G+ G I +T V K P + L SI +Q Sbjct: 53 VVIGRSGSIGGGQYITTNFWPLNTTLWVKDFKGHHPRFVYYLLRSIYFSQFNVGSGVPTL 112 Query: 138 -EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT---------- 186 G ++ + I L +++ + +++ +I + Sbjct: 113 NRNHLSGILVADTSYSYEKEASDIIGILDDKIKLNKELNHTLEQISQTLFKSWFVDFDPV 172 Query: 187 ---------ERIRFIELLKEKKQALVSYIVTKGLNPDV---KMKDSGIEWVGLVPDHWEV 234 ++ E +Q + + K L D+ + +G +P W Sbjct: 173 IDNALDAGTPIPEALQSRAELRQKIRNSADFKPLPADIRALFPAEFEETELGWMPKGWIT 232 Query: 235 KPFFALVTELNRKN-----TKLIESNILSLSYGNIIQKLE------TRNMGLKPESYETY 283 F L+ + + +I S + + + + + ++ + + Sbjct: 233 TSFNDLIELIGGGTPKTSVEEFWNGDIPWFSVVDAPSESDVYVLTTEKKITIEGLNNSSA 292 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ G + + + S Y + + I Y+ + +++ + Sbjct: 293 KLLRKGTTIISARGTVGKCAMVAVPM----AMNQSCYGVIGKNNISDEYIYFQLKN-AVQ 347 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + ++ + K + V ++TN ++ + + L Sbjct: 348 TLQQMGHGSVFNTITRDTFKNIKVPFCN----EELTNSYSLLVKNYFSKILNNNYQNIAL 403 Query: 404 KERRSSFIAAAVTGQ 418 R + + ++G+ Sbjct: 404 TNLRDTLLPKLISGE 418 >gi|291288563|ref|YP_003505379.1| hypothetical protein Dacet_2666 [Denitrovibrio acetiphilus DSM 12809] gi|290885723|gb|ADD69423.1| conserved hypothetical protein [Denitrovibrio acetiphilus DSM 12809] Length = 429 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 69/428 (16%), Positives = 139/428 (32%), Gaps = 55/428 (12%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 I + +L + D+ L + ++P N SD S I KGQ Sbjct: 6 KKIGNYIQLVD----KRNNDLKVNTLLGLTVDKI-FIPSVANIVGSDMSKYKIIKKGQFA 60 Query: 88 YGKL-----GPYLRKAIIADFDGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEA 139 + G + + I S + V + ++LPE L W+ + + Sbjct: 61 CSLMQVRRDGKIPVALLTDFDEAIISQAYPVFKIIDDCELLPEYLMMWMSRSEFDREACF 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G +W+ NI +P+P +Q I + E I I + + L+E Sbjct: 121 YAVGGVRGSLEWEDFCNIELPVPNPDKQQQIVD----EYNTIVNRIKLNEQLSQKLEETA 176 Query: 200 QALVSYIVTKGLNPD---------------VKMKDSGIEWVG------LVPDHWEVKPFF 238 Q L + P + SG + V VPD W+ Sbjct: 177 QTLYKHWFVDFEFPITAEYAQSIGKPELEGKPYRSSGGKMVWNNDLDQDVPDEWKYDTLS 236 Query: 239 ALVT------ELNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVDP 288 T + +S I + N+ + + V Sbjct: 237 NRCTKIGSGSTPCGGKSAYKKSGISLIRSLNVHDYNFQYRDLAFIDSTQATKLDNVEVKE 296 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVF- 346 +++ + + + V+ + + + V+P + S+YL + + S + Sbjct: 297 KDVLLNITGVSVARCCRVPSNVLPARVNQHVSIVRVEPEKLSSSYLLFTLCSAIYKQKLL 356 Query: 347 -YAMGSGLRQSLKFEDVKRLPVLVP---PIKEQFDITNVINVETARIDVLVE-KIEQSIV 401 + RQ++ D++ +L+P +K +IT+ + + E ++ I+ Sbjct: 357 GSSEAGSTRQAITKGDIEEFEILIPKNDSMKSFEEITDSLICYKENLSAQSEYLLKARIL 416 Query: 402 LLKERRSS 409 LL++ + Sbjct: 417 LLQKMIKA 424 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/142 (21%), Positives = 54/142 (38%), Gaps = 9/142 (6%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + Y+I+ G+ + ++ D K + + II+ AY K Sbjct: 42 NIVGSDMSKYKIIKKGQFACSLMQVRRDGKIPVALLTDFDEAIISQAYPVFKIIDDCELL 101 Query: 333 LAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +LM RS + + G+R SL++ED + + VP +Q I + N R Sbjct: 102 PEYLMMWMSRSEFDREACFYAVGGVRGSLEWEDFCNIELPVPNPDKQQQIVDEYNTIVNR 161 Query: 389 IDVLVEKIEQSIVLLKERRSSF 410 ++ EQ L+E + Sbjct: 162 ----IKLNEQLSQKLEETAQTL 179 >gi|167856384|ref|ZP_02479110.1| type I restriction-modification system specificity determinant [Haemophilus parasuis 29755] gi|167852490|gb|EDS23778.1| type I restriction-modification system specificity determinant [Haemophilus parasuis 29755] Length = 166 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 51/160 (31%), Gaps = 9/160 (5%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +S N++ K + E +I+ I K G Sbjct: 11 YISTENLLSDYGGVTASNKLPTTEKVTAYKKNDILVSNIRPYLKKV---WQADKNGGASN 67 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-PIKEQ 375 + I+ ++L++ +++ D G+ + PV VP KEQ Sbjct: 68 DIIIIRAKPSINISFLSFAIKNDDFIDYMMKGAKGVKMPRGDLNLISIFPVAVPTSPKEQ 127 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I + + + +D L+ + + I LK + + Sbjct: 128 QAIADCL----SSLDNLINEQNERIGRLKTHKKGLMQQLF 163 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 55/157 (35%), Gaps = 5/157 (3%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 YI E++ S G + + K IL + PYL+K AD +G S Sbjct: 10 NYISTENLLSDYGGVTASNKLPTTEKVTAYK---KNDILVSNIRPYLKKVWQADKNGGAS 66 Query: 109 TQFLVLQPKDVLPELLQGW-LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAE 166 ++++ K + + + + D + +G M D I P+ +P E Sbjct: 67 NDIIIIRAKPSINISFLSFAIKNDDFIDYMMKGAKGVKMPRGDLNLISIFPVAVPTSPKE 126 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 Q I + + + I+ R K Q L Sbjct: 127 QQAIADCLSSLDNLINEQNERIGRLKTHKKGLMQQLF 163 >gi|254875064|ref|ZP_05247774.1| restriction modification system DNA specificity subunit [Francisella tularensis subsp. tularensis MA00-2987] gi|254841063|gb|EET19499.1| restriction modification system DNA specificity subunit [Francisella tularensis subsp. tularensis MA00-2987] Length = 222 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 22/54 (40%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + + + + +PP+ EQ I ++ +D +E +Q+I Sbjct: 5 GMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFENVDKAIELHQQNITNANTL 58 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 67/237 (28%), Gaps = 17/237 (7%) Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G M H NI +P+PPLAEQ I K+ + +D I + I Sbjct: 1 MHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFENVDKAIELHQQNITNANTLMA 60 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + K E+ + LV + N+K ++ + Sbjct: 61 STLDKTF----------KKLEGEYSKIALLDVMKISNKTLVPDDNQKYNY-----VVLEN 105 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 +L E + G +++ + +K + I Y Sbjct: 106 IEGNTGRLIDFCETQGKEIKSSKVEFKKGMVLYGKLRPYLNKVWFSEFDDVATTEILPFY 165 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK--RLPVLVPPIKEQ 375 + + S L +V L +K + +PP+ Q Sbjct: 166 PIDNTRLNMIFVKYYFLSSSYLQRVMRNCSGSRMPRLTTAFLKSEEAYIPLPPLPIQ 222 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 4/125 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K++ + + Y+ LE++E TG+ + + S+ F KG +LY Sbjct: 79 LLDVMKISNKTLVPDDNQKYNYVVLENIEGNTGRLIDFCETQGKEIKSSKVEFKKGMVLY 138 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGAT 145 GKL PYL K ++FD + +T+ L P D ++ + LS QR+ C G+ Sbjct: 139 GKLRPYLNKVWFSEFDDVATTEILPFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNCSGSR 198 Query: 146 MSHAD 150 M Sbjct: 199 MPRLT 203 >gi|317481753|ref|ZP_07940783.1| type I restriction modification DNA specificity domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316916801|gb|EFV38193.1| type I restriction modification DNA specificity domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 165 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 55/153 (35%), Gaps = 6/153 (3%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + ++ IL ++ ++ Q + E + P + + + ++ A Sbjct: 11 SNYVDGKILWVTSQDVKQHYIENTTTMISEKGAATLTLYPSDSIVIVARSGILRHTIPVA 70 Query: 309 QVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 ++ + + + S L + + S Y +S+ F +K Sbjct: 71 KLRKPATVNQDIKVIQTVDSCDSSWLLQYFIASNKTLLREYGKTGTTVESIDFAKMKSTA 130 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++VP I+EQ I + +R+D L+ ++ Sbjct: 131 LMVPYIEEQQAIGSF----FSRLDNLITLHQRK 159 >gi|15645993|ref|NP_208174.1| restriction modification system S subunit [Helicobacter pylori 26695] gi|2314551|gb|AAD08423.1| restriction modification system S subunit [Helicobacter pylori 26695] Length = 160 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 43/123 (34%), Gaps = 4/123 (3%) Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + E + ++ P + + + K+ + +K Sbjct: 6 GVILVILKEIATTNQGFQSLIPLEKINNEFLYYLILTLKNKLLKLASGSTFLEVSPNKIK 65 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 L + +PP+ EQ I N+++ + L I + + + + ++ + L+G Sbjct: 66 NLLIPLPPLNEQIAIANILSDLDRYLYNLDALILKK----ESVKKALSFELLSQRKRLKG 121 Query: 424 ESQ 426 +Q Sbjct: 122 FNQ 124 >gi|167761883|ref|ZP_02434010.1| hypothetical protein BACSTE_00226 [Bacteroides stercoris ATCC 43183] gi|167700253|gb|EDS16832.1| hypothetical protein BACSTE_00226 [Bacteroides stercoris ATCC 43183] Length = 402 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 51/411 (12%), Positives = 119/411 (28%), Gaps = 32/411 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS-----DTST 77 K + + G + + YI L K+ S+ + D Sbjct: 4 KKYKLGEILDVTRGASLSGEFYATEGKYIRLTCGNFDYQNNCFKENKSKDNLYYIGDFKP 63 Query: 78 VSIFAKGQIL-------YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + +G ++ G LG + ++ + + + + + S Sbjct: 64 EFLMEEGDVITPLTEQAIGLLGSTAIIPESGKYIQSQDVAKIICKEELLDKDFAFYLISS 123 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 V Q++ A + + H I + + IP L EQ I + + + +I+ Sbjct: 124 TLVKQQLSAAAQQTKIRHTSPDKIRDCTVWIPELTEQKRIGKLLRSLDRKIELNRAINQN 183 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 ++K+ K SG E + + + T Sbjct: 184 LEAMVKQLYDYWFVQ-FDFPNEEGKPYKSSGGE-------MVWNEKLKRFIPKGWESTTL 235 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPES--YETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 E + + + E+ + + Y + N ++ Sbjct: 236 GNECQMYQPKTLGLSELDESAKYKVYGANGVIGKYHTYNHENSEIAMACRGNSCGTVNRT 295 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + I + Y+ ++ + A+ + L E++ + + Sbjct: 296 APFSWITGNAMVIKMIDDLIHNEYIKQALQ---YANIDGAISGSGQPQLTRENLNSIKLC 352 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 P + I + + + I + + E +I L +R + V GQI Sbjct: 353 KPTREL---IICF-SEQVSNIIKMYLQNESNIEELTRQRDELLPLLVNGQI 399 Score = 36.7 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 55/194 (28%), Gaps = 19/194 (9%) Query: 10 YKDSGVQ--WIGA----IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + W IPK W+ + ++ +T +GL +++ + KY Sbjct: 209 YKSSGGEMVWNEKLKRFIPKGWESTTLGNECQMYQPKT---------LGLSELDE-SAKY 258 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 N T + +I G + +V K + + Sbjct: 259 KVYGANGVIGKYHTYNH-ENSEIAMACRGNSCGTVNRTAPFSWITGNAMV--IKMIDDLI 315 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ I+ G+ + + +I + P + E++ Sbjct: 316 HNEYIKQALQYANIDGAISGSGQPQLTRENLNSIKLCKPTRELIICFSEQVSNIIKMYLQ 375 Query: 184 LITERIRFIELLKE 197 + E Sbjct: 376 NESNIEELTRQRDE 389 >gi|307245105|ref|ZP_07527198.1| Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|307254060|ref|ZP_07535907.1| Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258516|ref|ZP_07540253.1| Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306853994|gb|EFM86206.1| Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306862985|gb|EFM94932.1| Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867420|gb|EFM99271.1| Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 375 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 45/392 (11%), Positives = 109/392 (27%), Gaps = 41/392 (10%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS 70 KD V+W + + T T + + + E+ Sbjct: 8 KDCKVEW----------KSLGEIL-IRTKGTKITAGQMKELHKENAPVKIFAGGRTVAFV 56 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 +D I + I+ G + D + K+ + + Sbjct: 57 DFNDIPQKDINNEPSIIVKSRGII--EFEYYDKSFSHKNEMWSYHSKNENINIKFVYYFL 114 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + I M +PIPPL Q I + + T TL Sbjct: 115 KQNEPHFQNIGSKMQMPQIATPDTDKYKIPIPPLEIQEKIVKTLDIFTKLEATLEATLEA 174 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + L ++ + ++T + + D E + K+ Sbjct: 175 ELSLRVKQYDYYRNELLTFDDDVEFITLDKISENLN--------------SMRKPIKSGL 220 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + I I+ +E E I + G + + + + Sbjct: 221 REKGRIPYYGASGIVDYVEDYIF-----DDEILLISEDGANLIARNTP------IAFSVL 269 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + A++ ++ ++ + + + DL + L +++ ++P+ Sbjct: 270 GKCWVNNHAHVLKFKTDVERKFVEFYLNNLDLSPFI---SGAAQPKLNKQNLNKIPIPNI 326 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 Q I ++++ + + + + + I L Sbjct: 327 TFATQQKIVDILDKFDRLTNSISDGLPKEIEL 358 >gi|167571301|ref|ZP_02364175.1| restriction modification system DNA specificity domain [Burkholderia oklahomensis C6786] Length = 398 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 55/362 (15%), Positives = 106/362 (29%), Gaps = 31/362 (8%) Query: 81 FAKGQILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDV 133 G +L G + ++++ L WL D Sbjct: 33 LQDGDVLLNITGDGVTFGRGCLVPSHVLPACVNQHVMLIRTDSTLCHSGYLAAWLALQDS 92 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 IE+ G + I + +P+PPL Q I + A RI+ L Sbjct: 93 KAYIESFNAGGSRRAITKGHIESFNVPLPPLDIQQGIADLAAALNGRIELLRQTNATLES 152 Query: 194 LLKEKKQALV-----SYIVTKGLNPDVK----MKDSGIEW----VGLVPDHWEVKPFFAL 240 + + ++ G P+ K E+ +G +P W+V + + Sbjct: 153 IAQALFKSWFIDFDPVRAKVGGREPECMDAAVAKLFPAEFHESAMGRIPKGWKVGDVYEV 212 Query: 241 VTELNRKN--TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +KL S L I + PE + + PG+IV Sbjct: 213 AQVTYGAPFASKLFNSEGDGLPLVRIRDLKDEAPGVWTPEVHPKGYRLRPGDIVVGMDGE 272 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 E + KP S + L + + L Sbjct: 273 FR-----AYLWGGEEAWMNQRICVFKPVNGHSAAFVRCAIAAPLAHIEATETATTVIHLG 327 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D+ R ++VPP + + + + + + +Q+ L + R + + ++G+ Sbjct: 328 KGDIDRFRIVVPPPD----VASAFSAISEPLYERIVAGKQNARTLSKLRDALLPRLISGK 383 Query: 419 ID 420 + Sbjct: 384 LR 385 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 53/192 (27%), Gaps = 14/192 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGR------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IPK WKV + ++ G + G + + + D++ G Sbjct: 198 GRIPKGWKVGDVYEVAQVTYGAPFASKLFNSEGDGLPLVRIRDLKD------EAPGVWTP 251 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G I+ G G + R + + + + V +P + Sbjct: 252 EVHPKGYRLRPGDIVVGMDGEF-RAYLWGGEEAWMNQRICVFKPVNG-HSAAFVRCAIAA 309 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 IEA T+ H I + +PP RI Sbjct: 310 PLAHIEATETATTVIHLGKGDIDRFRIVVPPPDVASAFSAISEPLYERIVAGKQNARTLS 369 Query: 193 ELLKEKKQALVS 204 +L L+S Sbjct: 370 KLRDALLPRLIS 381 >gi|268609387|ref|ZP_06143114.1| putative specificity protein S [Ruminococcus flavefaciens FD-1] Length = 183 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 42/128 (32%), Gaps = 7/128 (5%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + ++ G+IVF + D+ S + + I YL + Sbjct: 56 DKERLNKYVLSDGDIVFSRVGSV-DRCSYVDSNHSGWMFSGRCLRVRPYNAIYPLYLYYF 114 Query: 337 MRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + S+ + + + V VP I Q I +++ +I+ Sbjct: 115 FCMESTKRFVRNIAVGATMPSINTKLMGEIEVSVPSIDTQKRIAAILSSIDDKIE----- 169 Query: 396 IEQSIVLL 403 + +I LL Sbjct: 170 LNTAINLL 177 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 60/166 (36%), Gaps = 15/166 (9%) Query: 30 IKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKD---GNSRQSDTSTVS 79 + + TG + +E + G + ++ + + Sbjct: 6 LGSIADIQTGPFGSQLHKEDYVQDGTPIVTVEHL--GNRVFTEQNLPMVSDADKERLNKY 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQP-KDVLPELLQGWLLSIDVTQR 136 + + G I++ ++G R + + + S + L ++P + P L + + Sbjct: 64 VLSDGDIVFSRVGSVDRCSYVDSNHSGWMFSGRCLRVRPYNAIYPLYLYYFFCMESTKRF 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + I GATM + K +G I + +P + Q I + + +I+ Sbjct: 124 VRNIAVGATMPSINTKLMGEIEVSVPSIDTQKRIAAILSSIDDKIE 169 >gi|306815514|ref|ZP_07449663.1| restriction modification system DNA specificity domain protein [Escherichia coli NC101] gi|305851176|gb|EFM51631.1| restriction modification system DNA specificity domain protein [Escherichia coli NC101] Length = 300 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 68/197 (34%), Gaps = 20/197 (10%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRN 272 +K +EW+ + + + L++S ++ YG I + + Sbjct: 11 LKGCDVEWI-------SLGNIGKFIRGNGLQKKDLVDSGFPAIHYGQIYTRYGLSADRTF 63 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + PE + +++ ++ A + ++ I+ M + ++ Y Sbjct: 64 NYVSPELANKLRKAQKNDLLLATTSENDEDVVKPLAWLGDKVAISGDMMLFRHEQ-NAKY 122 Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP-------PIKEQFDITNVINV 384 LA +S +G + + D+ ++ + +P + Q +I +++ Sbjct: 123 LAHFFQSKIFQAQKMKYITGAKVRRVSSGDLAKITIPIPCPDNPEKSLSIQSEIVRILDK 182 Query: 385 ETARIDVLVEKIEQSIV 401 TA L ++ + Sbjct: 183 FTALTAELTAELTAELT 199 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/230 (12%), Positives = 57/230 (24%), Gaps = 27/230 (11%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPK 66 K V+WI + K G + I + + G + Sbjct: 12 KGCDVEWI----------SLGNIGKFIRGNGLQKKDLVDSGFPAIHYGQIYTRYGLSADR 61 Query: 67 DGNSRQSDTSTV-SIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLP 121 N + + K +L S ++ + + Sbjct: 62 TFNYVSPELANKLRKAQKNDLLLATTSENDEDVVKPLAWLGDKVAISGDMMLFR-HEQNA 120 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKI 174 + L + S + GA + + I +PIP L+ Q I + Sbjct: 121 KYLAHFFQSKIFQAQKMKYITGAKVRRVSSGDLAKITIPIPCPDNPEKSLSIQSEIVRIL 180 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 T L E + + + + + K+ +EW Sbjct: 181 DKFTALTAELTAELTAELTAELTAELTMRKKQYNYYRDQLLSFKEGEVEW 230 >gi|283797287|ref|ZP_06346440.1| type I restriction-modification system, S subunit [Clostridium sp. M62/1] gi|291074955|gb|EFE12319.1| type I restriction-modification system, S subunit [Clostridium sp. M62/1] Length = 448 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 72/219 (32%), Gaps = 20/219 (9%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVT--ELNRKNTKLIESNILSLSYGNIIQKLETRNMGL- 275 E +PD WE LV K+ K + + KL ++ L Sbjct: 13 CIADEVPFEIPDSWEWARLKNLVIKEIKRGKSPKYASDGSVYVFAQKCNVKLGEIDISLA 72 Query: 276 ------KPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGII---TSAYMAVK 324 E Y + + +I+ R + + II + + Sbjct: 73 KFLDMRIFEKYPVEEYMVDEDIIINSTGNGTLGRIGMFRDSDRINDSIIVPDSHVTIIRA 132 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + + YL ++++ Y GS + L+ + L + +PPIKEQ I + Sbjct: 133 CNQLKKDYLFYVLKYYQPFLEKLGEGSTNQTELRPSTIAELFIPIPPIKEQEQIVTKLLE 192 Query: 385 ETARIDVLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 +D L K E ++ + S + A+ G+ Sbjct: 193 VIPMVD-LYGKKENALQAYNTDFPTRLKKSILQEAIQGK 230 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 70/219 (31%), Gaps = 25/219 (11%) Query: 20 AIPKHWKVVPIKRFT--KLNTGRTSESGKDIIY--------IGLEDVESGTGKYLPKDGN 69 IP W+ +K ++ G++ + D + L +++ K+L Sbjct: 21 EIPDSWEWARLKNLVIKEIKRGKSPKYASDGSVYVFAQKCNVKLGEIDISLAKFLDMRIF 80 Query: 70 SRQSDTSTVSIFAKGQILYGKLGP-YLRKAIIADFDGICSTQFLVLQPKDVLPELL---- 124 + + + I+ G L + + + +V + Sbjct: 81 EKYPVEE--YMVDE-DIIINSTGNGTLGRIGMFRDSDRINDSIIVPDSHVTIIRACNQLK 137 Query: 125 --QGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + +E + EG+T + I + +PIPP+ EQ I K++ + Sbjct: 138 KDYLFYVLKYYQPFLEKLGEGSTNQTELRPSTIAELFIPIPPIKEQEQIVTKLLEVIPMV 197 Query: 182 D----TLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 D + + K++++ + L P Sbjct: 198 DLYGKKENALQAYNTDFPTRLKKSILQEAIQGKLVPQDP 236 Score = 40.9 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 10/76 (13%), Positives = 26/76 (34%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W + + +N + ++ + ++ G + ++ S + Sbjct: 295 DIPDSWAWIRMGSLLAVNPRNAVSDDTVVGFMPMPLLQDGFNNDHTFEEKLWKNVKSGFT 354 Query: 80 IFAKGQILYGKLGPYL 95 FA ++ K+ P Sbjct: 355 HFANNDVVIAKITPCF 370 >gi|323189850|gb|EFZ75128.1| type I restriction enzyme specificity protein [Escherichia coli RN587/1] Length = 376 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 48/390 (12%), Positives = 114/390 (29%), Gaps = 58/390 (14%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + K+ TG+ + + G+Y+ S Sbjct: 17 EWKTLGQTCKIETGKLN-----------ANAAVDDGEYMFFTTAKETSKIDKFRW-DTEA 64 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L +++ + + ++LS + + +E A Sbjct: 65 LLIAGNANVGEVKHYIGKFEAYQRTYVLTNFDENVSVRFLYFVLSHSLKKYLEERTNSAA 124 Query: 146 MSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 M++ + N P+PIP LA Q I + T L E + Sbjct: 125 MTYIVLSTLENFPIPIPCPGNPQKSLAIQSEIVRILDKFTAVTAELTAELDMRKKQYNYY 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEW--VGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + L+S K+ +EW +G V + I Sbjct: 185 RDQLLS------------FKEGEVEWKTLGEVAQFKRGTAITQ---------KQTTPGEI 223 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ G I + + IV S + Sbjct: 224 PVVANGPIPTYFHSESNR------------QGETIVIARSGAY---AGYVSFWNQPIFLT 268 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + + ++ ++++ G+G+ ++ ++ + + +PPI EQ Sbjct: 269 DAFSVHSDLKIVKPKFIYHVLQNKQEHIHAMKKGAGV-PHVRVKEFETYDIPIPPITEQD 327 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKER 406 I ++++ + + E + + I L +++ Sbjct: 328 RIVSILDKFDTLTNSITEGLPREIELRQKQ 357 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 72/212 (33%), Gaps = 22/212 (10%) Query: 1 MKHYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE 57 M+ K Y Y+D S + G + + + + G + + V Sbjct: 176 MRK-KQYNYYRDQLLSFKE--GEV----EWKTLGEVAQFKRGTAITQKQTTP-GEIPVVA 227 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 +G ++RQ + I+ + G Y + + F V Sbjct: 228 NGPIPTYFHSESNRQGE----------TIVIARSGAYAGYVSFWNQPIFLTDAFSV-HSD 276 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + + + + + I A+ +GA + H K +PIPP+ EQ I + Sbjct: 277 LKIVKPKFIYHVLQNKQEHIHAMKKGAGVPHVRVKEFETYDIPIPPITEQDRIVSILDKF 336 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 +++ R IEL +++ + + + Sbjct: 337 DTLTNSITEGLPREIELRQKQYEYYRDLLFSF 368 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 36/124 (29%), Gaps = 8/124 (6%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQ 355 N + + Y+ S + + S+ L K S Sbjct: 67 IAGNANVGEVKHYIGKFEAYQRTYVLTNFDENVSVRFLYFVLSHSLKKYLEERTNSAAMT 126 Query: 356 SLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + ++ P+ +P + Q +I +++ TA L +++ R Sbjct: 127 YIVLSTLENFPIPIPCPGNPQKSLAIQSEIVRILDKFTAVTAELTAELDMRKKQYNYYRD 186 Query: 409 SFIA 412 ++ Sbjct: 187 QLLS 190 >gi|315038771|ref|YP_004032339.1| type I restriction-modification system subunit S [Lactobacillus amylovorus GRL 1112] gi|312276904|gb|ADQ59544.1| type I restriction-modification system subunit S [Lactobacillus amylovorus GRL 1112] Length = 241 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/215 (10%), Positives = 67/215 (31%), Gaps = 5/215 (2%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + V +P + +++ + + V + F + I L+ Sbjct: 32 IKARFVEMFGDPGLVHRENKVCNLENVAEVRSSHRIFTR-EFTKSGVPFYRGTEISLLAN 90 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G + E + G+++ I + + +++ + Sbjct: 91 GKEPIHSYYISKARYDEITKNDSKPKIGDLLMPSICDKGQIWLVNTSKPFYYKDGRVLCI 150 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + ++ + M+ + K +K+L + +PP+K Q + + Sbjct: 151 SPNREKFNTIFFHQYMKMKSQIEYLKIGSGSTFAEFKIFQLKKLKINIPPLKLQNEFASF 210 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++D I++S+ + S + + Sbjct: 211 V----QQVDKSKVAIQKSLDETQTLFDSLMQKYFS 241 >gi|158521274|ref|YP_001529144.1| N-6 DNA methylase [Desulfococcus oleovorans Hxd3] gi|158510100|gb|ABW67067.1| N-6 DNA methylase [Desulfococcus oleovorans Hxd3] Length = 1362 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 42/131 (32%), Gaps = 3/131 (2%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + K + E + + G+++ + + V + Sbjct: 519 GQVTKGTSWISEKAIELVKASWKLRAGDVLISKSGTIGKVGIVCNGAVGAVAASGLYVLR 578 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 K ID +L + S + G+ L ++ LPV +PP++ Q + + Sbjct: 579 PKDGRIDPHFLVAYLDSNECRAWLKDRASGGVINHLNKRAIENLPVPIPPLQIQHRVADE 638 Query: 382 INVETARIDVL 392 ++D L Sbjct: 639 FREH--KVDAL 647 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 80/208 (38%), Gaps = 12/208 (5%) Query: 30 IKRFTKLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-FAK 83 + + + G + + YI ++D+E G + + + S Sbjct: 485 LGAIKEESMGEKQPGSLSLTIEPVPYIRIKDIEKGQVTKGTSWISEKAIELVKASWKLRA 544 Query: 84 GQILYGKLGPYLRKAIIADF--DGICSTQFLVLQPK--DVLPELLQGWLLSIDVTQRIEA 139 G +L K G + I+ + + ++ VL+PK + P L +L S + ++ Sbjct: 545 GDVLISKSGTIGKVGIVCNGAVGAVAASGLYVLRPKDGRIDPHFLVAYLDSNECRAWLKD 604 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE--TVRIDTLITERIRFIELLKE 197 G ++H + + I N+P+PIPPL Q + ++ ++ + + + E Sbjct: 605 RASGGVINHLNKRAIENLPVPIPPLQIQHRVADEFREHKVDALNYLMVILSGKGHDSIAE 664 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWV 225 + + + + N + S ++ + Sbjct: 665 WIEKTIKKLPSDMENKRFPLDLSLLDQL 692 >gi|254369156|ref|ZP_04985168.1| predicted protein [Francisella tularensis subsp. holarctica FSC022] gi|157122106|gb|EDO66246.1| predicted protein [Francisella tularensis subsp. holarctica FSC022] Length = 225 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 9/61 (14%), Positives = 23/61 (37%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + + + + + + +PP+ EQ I ++ +D +E +Q+I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFENVDKAIELHQQNITNANT 60 Query: 406 R 406 Sbjct: 61 L 61 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/240 (15%), Positives = 71/240 (29%), Gaps = 17/240 (7%) Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G M H NI +P+PPLAEQ I K+ + +D I + I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKRIVAKLDSLFENVDKAIELHQQNITNANT 60 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + K K+ + + + + + + + Sbjct: 61 LMASTLDKTFKKLEGEYSKIALLDVMKI-----------SNKTLVPDDNQKYNYVGLENI 109 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + G +I ET+ +K E G +++ + L +K + I Sbjct: 110 EGNTGRLIDFCETQGKEIKSSKVE----FKKGIVLYGKLRLYLNKVWFSEFDDVATTEIL 165 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK--RLPVLVPPIKEQ 375 Y + + S L +V L +K + +PP+ Q Sbjct: 166 PFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSRMPRLTTAFLKSEEAYIPLPPLPIQ 225 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 55/125 (44%), Gaps = 4/125 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K++ + + Y+GLE++E TG+ + + S+ F KG +LY Sbjct: 82 LLDVMKISNKTLVPDDNQKYNYVGLENIEGNTGRLIDFCETQGKEIKSSKVEFKKGIVLY 141 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGAT 145 GKL YL K ++FD + +T+ L P D ++ + LS QR+ G+ Sbjct: 142 GKLRLYLNKVWFSEFDDVATTEILPFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSR 201 Query: 146 MSHAD 150 M Sbjct: 202 MPRLT 206 >gi|89256323|ref|YP_513685.1| hypothetical protein FTL_0976 [Francisella tularensis subsp. holarctica LVS] gi|115314771|ref|YP_763494.1| type I site-specific deoxyribonuclease [Francisella tularensis subsp. holarctica OSU18] gi|167010846|ref|ZP_02275777.1| type I site-specific deoxyribonuclease [Francisella tularensis subsp. holarctica FSC200] gi|254367657|ref|ZP_04983678.1| hypothetical protein FTHG_00927 [Francisella tularensis subsp. holarctica 257] gi|89144154|emb|CAJ79415.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica LVS] gi|115129670|gb|ABI82857.1| type I site-specific deoxyribonuclease [Francisella tularensis subsp. holarctica OSU18] gi|134253468|gb|EBA52562.1| hypothetical protein FTHG_00927 [Francisella tularensis subsp. holarctica 257] Length = 775 Score = 67.1 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 50/138 (36%), Gaps = 3/138 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I+ N Y+ +++ G ++ + L + + ++ Sbjct: 626 IKDNYINNKPFYVNKYKESDLIEKGTLLITRKGTVGNSYYL--DKDGSFVASSEIFIIKL 683 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ++ YL+ + S + K + +G SL +K + + +PP++ Q I I Sbjct: 684 NDKVNGNYLSEINLSSFVKKQYREKSTGTIMPSLSQPKLKSILIPLPPLEIQNHIAVRIQ 743 Query: 384 VETARIDVLVEKIEQSIV 401 I L ++ EQ+ Sbjct: 744 KLKDYIKALEQQAEQNRE 761 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 8/175 (4%) Query: 33 FTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 F LN G + + I Y+ + D++ P N + + KG +L + Sbjct: 601 FVSLNNGIAARNYASDGIRYLKVSDIKDNYINNKPFYVNKYKESD----LIEKGTLLITR 656 Query: 91 LGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGATMSH 148 G + D + S++ +++ D + + ++ G M Sbjct: 657 KGTVGNSYYLDKDGSFVASSEIFIIKLNDKVNGNYLSEINLSSFVKKQYREKSTGTIMPS 716 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + +I +P+PPL Q I +I I L + + E +A + Sbjct: 717 LSQPKLKSILIPLPPLEIQNHIAVRIQKLKDYIKALEQQAEQNRENALRNFEAEI 771 >gi|332983074|ref|YP_004464515.1| restriction modification system DNA specificity domain-containing protein [Mahella australiensis 50-1 BON] gi|332700752|gb|AEE97693.1| restriction modification system DNA specificity domain protein [Mahella australiensis 50-1 BON] Length = 215 Score = 67.1 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 67/198 (33%), Gaps = 13/198 (6%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIES----NILSLSYGNIIQKLETR----NMGLKPES 279 PD E + + T N + + + + YG I + E+ Sbjct: 12 CPDGVEYRKLGDVATISRGGNFQKKDCVADGEVPCIHYGQIHTYYNLFVDKTISYISKET 71 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + + +IV D A + + I + A+ H ++ YL + + S Sbjct: 72 AKKQKFAETNDIVMAVTSENIDDVCKCIAWLGKGKIAVGGHTAIIHHLLNPKYLVYFLSS 131 Query: 340 YDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + G + + + + + + VPP+ Q +I +++ T L + Sbjct: 132 SLFYQQKVKLAHGTKVIEVTPDKLVDIIIPVPPLPVQQEIVRILDNFTELTTELTTDLTA 191 Query: 399 SIVLLKE----RRSSFIA 412 + ++ R ++ Sbjct: 192 ELTARQKQYEYYRDKLLS 209 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 62/198 (31%), Gaps = 10/198 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P + + ++ G + ++ I + + ++ K + +T+ Sbjct: 13 PDGVEYRKLGDVATISRGGNFQKKDCVADGEVPCIHYGQIHTYYNLFVDKTISYISKETA 72 Query: 77 TVSIF-AKGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 F I+ + + I + + P+ L +L S Sbjct: 73 KKQKFAETNDIVMAVTSENIDDVCKCIAWLGKGKIAVGGHTAIIHHLLNPKYLVYFLSSS 132 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 Q+ + G + + +I +P+PPL Q I + T L T+ Sbjct: 133 LFYQQKVKLAHGTKVIEVTPDKLVDIIIPVPPLPVQQEIVRILDNFTELTTELTTDLTAE 192 Query: 192 IELLKEKKQALVSYIVTK 209 + +++ + +++ Sbjct: 193 LTARQKQYEYYRDKLLSF 210 >gi|124009162|ref|ZP_01693844.1| type I site-specific deoxyribonuclease [Microscilla marina ATCC 23134] gi|123985260|gb|EAY25187.1| type I site-specific deoxyribonuclease [Microscilla marina ATCC 23134] Length = 491 Score = 67.1 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 58/433 (13%), Positives = 129/433 (29%), Gaps = 55/433 (12%) Query: 27 VVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVE--SGTGKYLPKDGNSRQSDTS 76 V + K GR ++ +I D+ S + + + + Sbjct: 48 VYKLLDVIKTQRGRFGHRPRNDPAFYGGKYPFIQTGDIVKASQSDGKVVYSQTLNEKGVN 107 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 T +F +L + + I D+ + L PKD + + + Sbjct: 108 TSRLFQPN-VLVMTIAANIGDTAILDYPACFPDSLIALYPKDKRLNINYLNVYFKFIKPY 166 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI---- 192 +E + + + + + + +IP+ +PP Q I K + + + + Sbjct: 167 LEKLAPQSAQKNINIQQLSSIPIIVPPEDRQRQICLKYDKVVHVKQQKLEQAQQLLVGIN 226 Query: 193 -----------ELLKEKKQALV----SYIVTKG-LNPDVKMKDSGIEWVGLVPDHWEVKP 236 Q+ + V+ G +P K+ + + ++ Sbjct: 227 HYLLKELGIVLPKKDTSLQSRIFTVPMRAVSGGRFDPKRYDKNVKDLKGAIEGNRFDSTK 286 Query: 237 FFALVTELNRKNTKLIESNILSLSYGNI-----IQKLETRNMGLKPESYETYQI------ 285 L+ + E+ Y + N+ LK + I Sbjct: 287 LKTLIVSSRAGDWGKDENTNPGSEYCRCLVVRATEFDNVYNLKLKNNRVKHRLIHQEKIK 346 Query: 286 ---VDPGEIVFRFIDLQ-----NDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLA 334 + P +++ LR +++ I S ++ V + YL Sbjct: 347 EIDIRPNDLLIEKSGGSQDQPVGRVAILREELLVKEQICYSNFIHKIRVNSQKVLPEYLF 406 Query: 335 WLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++ K+ AM S ++L D + +P + +Q I I R L Sbjct: 407 CFLKTVHHIKLTDAMQSQTNGIRNLIMPDYLEQTIPLPNLAKQSAIVAHIQDLRDRAKQL 466 Query: 393 VEKIEQSIVLLKE 405 + Q++ K+ Sbjct: 467 QAEANQALAQAKQ 479 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 15/167 (8%), Positives = 51/167 (30%), Gaps = 8/167 (4%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 R + + G+I++ ++ + ++ + + + Sbjct: 62 FGHRPRNDPAFYGGKYPFIQTGDIVKASQSDGKVVYSQTLNEKGVNTSRLFQPNVLVMTI 121 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKF 359 +A + + +A+ P + + + ++++ Sbjct: 122 AANIGDTAILDYPACFPDSLIALYPKDKRLNINYLNVYFKFIKPYLEKLAPQSAQKNINI 181 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + +P++VPP Q I + D +V +Q + ++ Sbjct: 182 QQLSSIPIIVPPEDRQRQI-------CLKYDKVVHVKQQKLEQAQQL 221 >gi|225854394|ref|YP_002735906.1| type I restriction enzyme [Streptococcus pneumoniae JJA] gi|303260414|ref|ZP_07346383.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP-BS293] gi|303265060|ref|ZP_07350974.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS397] gi|225722976|gb|ACO18829.1| type I restriction enzyme [Streptococcus pneumoniae JJA] gi|302638449|gb|EFL68915.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP-BS293] gi|302645420|gb|EFL75653.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS397] Length = 195 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 49/161 (30%), Gaps = 19/161 (11%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + +++ + Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSK 153 Query: 388 RI----DVLVEK---------IEQSIVLLKERRSSFIAAAV 415 I + L E I++S+ L+ + S + Sbjct: 154 LILRRQEQLEELNLLVKSQLAIQKSLEELETLKKSLMQEYF 194 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 46/169 (27%), Gaps = 2/169 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 M H K NI +P L EQ I ++ + I + L Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL 168 >gi|332877500|ref|ZP_08445247.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684606|gb|EGJ57456.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 373 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 52/392 (13%), Positives = 97/392 (24%), Gaps = 38/392 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + + G V+ T + +T S + Sbjct: 8 WLSCTLDSVCDIQ-------------FGTRIVKKQTEAGQYYVYGGGGATFTTKSYNREN 54 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I + + + L L PK L + +I ++ GA Sbjct: 55 AITVSRFALSKECTRFIEGKFFLNDSGLTLHPKTNLLLFQFLKWHVFALNDKIYSLARGA 114 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + D K + + +P EQ I ++ + +I I L Q++ Sbjct: 115 AQKNLDVKRFSKLLIKLPKNNEQCTIATELD----TLQKMIDGYKAQIADLDVLAQSIFV 170 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 D + +G + LS Sbjct: 171 DTFGNVAVNDKCWDIIQMGQLGNFKNGLNYSKGEIGKPLKIIGVGDFQNIKCLS------ 224 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND--KRSLRSAQVMERGIITSAYMA 322 + E ++ +IVF + + R L + + Sbjct: 225 ---SFDNISYINIEDISQEYLLHNEDIVFVRSNGNKNLVGRCLEVFPNSTEVTFSGFCIR 281 Query: 323 VKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + K Q++ + + LP+ VPPI Q Sbjct: 282 FRKSVEIINKYLIATLTDIGFKNTHILKSNGIGIQNINQKLLSSLPIPVPPIGMQKKYAA 341 Query: 381 VINVETARIDVLVEKIEQSI----VLLKERRS 408 + I+ E I Q + L+KER Sbjct: 342 QVEA----IEKQKELIRQQLADAETLMKERMQ 369 >gi|156502396|ref|YP_001428461.1| putative N-6 DNA methylase [Francisella tularensis subsp. holarctica FTNF002-00] gi|290952883|ref|ZP_06557504.1| putative N-6 DNA methylase [Francisella tularensis subsp. holarctica URFT1] gi|295313928|ref|ZP_06804493.1| putative N-6 DNA methylase [Francisella tularensis subsp. holarctica URFT1] gi|156252999|gb|ABU61505.1| putative N-6 DNA methylase [Francisella tularensis subsp. holarctica FTNF002-00] Length = 775 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 50/138 (36%), Gaps = 3/138 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I+ N Y+ +++ G ++ + L + + ++ Sbjct: 626 IKDNYINNKPFYVNKYKESDLIEKGTLLITRKGTVGNSYYL--DKDGSFVASSEIFIIKL 683 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ++ YL+ + S + K + +G SL +K + + +PP++ Q I I Sbjct: 684 NDKVNGNYLSEINLSSFVKKQYREKSTGTIMPSLSQPKLKSILIPLPPLEIQNHIAVRIQ 743 Query: 384 VETARIDVLVEKIEQSIV 401 I L ++ EQ+ Sbjct: 744 KLKDYIKALEQQAEQNRE 761 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 60/175 (34%), Gaps = 8/175 (4%) Query: 33 FTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 F LN G + + I Y+ + D++ P N + + KG +L + Sbjct: 601 FVSLNNGIAARNYASDGIRYLKVSDIKDNYINNKPFYVNKYKESD----LIEKGTLLITR 656 Query: 91 LGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEGATMSH 148 G + D + S++ +++ D + + ++ G M Sbjct: 657 KGTVGNSYYLDKDGSFVASSEIFIIKLNDKVNGNYLSEINLSSFVKKQYREKSTGTIMPS 716 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + +I +P+PPL Q I +I I L + + E +A + Sbjct: 717 LSQPKLKSILIPLPPLEIQNHIAVRIQKLKDYIKALEQQAEQNRENALRNFEAEI 771 >gi|269978342|gb|ACZ55905.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 374 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 50/396 (12%), Positives = 109/396 (27%), Gaps = 67/396 (16%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ + ++ ++ G T + ++GT + + SI Sbjct: 13 PEGVEFKTLEEVFEIKNGYTPSKNNPEFW------KNGTIPWFRMEDIRENGRILKDSI- 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + + + + Sbjct: 66 --------------------------------------------QFYQCFLLGEWCKKNT 81 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + + D PIPPL Q I + + A T L TE ++ K++ Q Sbjct: 82 NVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTELKARKKQYQ- 140 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLV---------PDHWEVKPFFALVTELNRKNTKLI 252 ++ + + KD+ I+ P E K L N Sbjct: 141 YYQNMLLDFNDINSNHKDAKIKSYPKRLKTLLQTLAPKGVEFKKVGELFKRNKGINITAA 200 Query: 253 ESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + L G I + + I++ ++ + + + Sbjct: 201 QMKELHSEIGKVRIFAGGATKADINYKDISKKDIINCESVIIKSRGNIGFEYYNQPFSHK 260 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVP 370 S+ K + + +L + + + A S ++ L D V +P Sbjct: 261 NEIWSYSS----KTNQMLVKFLYYYLSNNQDYFQKLAQSSSVKLPQLSVSDTDEYEVPIP 316 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 P++ Q +I +++ + L+ I I K++ Sbjct: 317 PLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQ 352 >gi|158522736|ref|YP_001530606.1| restriction modification system DNA specificity subunit [Desulfococcus oleovorans Hxd3] gi|158511562|gb|ABW68529.1| restriction modification system DNA specificity domain [Desulfococcus oleovorans Hxd3] Length = 500 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 122/396 (30%), Gaps = 30/396 (7%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++G + GKY + Q+ G +++ Sbjct: 5 KLNNLFDFLPKSKVKAGDGL----------EDGKYPFYTSSENQAKYLDEFQHEPGCLVF 54 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIEAICEGATMS 147 G G ST + ++PK + + Q +E+ +GA + Sbjct: 55 GTGG--KASVHFTTSRFATSTDCITIRPKPNAKIDASYVFQYFKGNIQVLESGFKGAGLK 112 Query: 148 HADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H + +I +P P + +Q I + I + +L L S Sbjct: 113 HISKTYLSDILIPFPKEIDDQKRIAHLLGKVEGLIAQRKQNLQQLDDL-------LKSVF 165 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + +P K E + + + F+ + K + I + Sbjct: 166 LEMFGDPVRNEKGWETERLVEIAS--IERGRFSPRPRNDPKYYNGVHPFIQTGDINRSNG 223 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 +L L + + G IV + + ++ + Sbjct: 224 RLREYTQTLNELGIKVSKEFKVGTIVIAIVGATIGETAILEIPTYAPDSVIGITPKGNNS 283 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 +S ++ +++R + A R ++ E ++ LPV+ P I +V Sbjct: 284 AAESIFIEYILR--FWKPILRAKAPEAARANINIETLRPLPVIRPQSD--DRI--KFSVI 337 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +I+ + +QS+ L+ + A G++DL Sbjct: 338 STKIEGIKSSYQQSLAELENLYGALSQKAFKGELDL 373 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 63/200 (31%), Gaps = 14/200 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 K W+ + + GR S ++ +I D+ G+ + Sbjct: 177 KGWETERLVEIASIERGRFSPRPRNDPKYYNGVHPFIQTGDINRSNGRLREYTQTLNELG 236 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSID 132 F G I+ +G + + I + + + PK E + + Sbjct: 237 IKVSKEFKVGTIVIAIVGATIGETAILEIPTYAPDSVIGITPKGNNSAAESIFIEYILRF 296 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + A A ++ + + + P+P + Q R K + +I+ + + + + Sbjct: 297 WKPILRAKAPEAARANINIETL----RPLPVIRPQSDDRIKFSVISTKIEGIKSSYQQSL 352 Query: 193 ELLKEKKQALVSYIVTKGLN 212 L+ AL L+ Sbjct: 353 AELENLYGALSQKAFKGELD 372 >gi|218282510|ref|ZP_03488760.1| hypothetical protein EUBIFOR_01342 [Eubacterium biforme DSM 3989] gi|218216497|gb|EEC90035.1| hypothetical protein EUBIFOR_01342 [Eubacterium biforme DSM 3989] Length = 365 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 46/379 (12%), Positives = 112/379 (29%), Gaps = 27/379 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + TK+ TG+ + S GKY + ++ S + +L Sbjct: 3 VKVGEITKIKTGKLD-----------ANASSADGKYPFFTCSKDPLRINSYS-YDCECVL 50 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G K FD +++ + L + + + G + Sbjct: 51 VAGNGDLNVKYYNGKFDAY-QRTYIIEDNSNGLLYMPYLYHFLEGYIGELRKQSIGGVIK 109 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + ++ + +P + EQ I + I+ + L + + + Sbjct: 110 YIKLGNLTDVLVELPSIVEQKYIVNLMNISLELIELRKKTIDKLDSL-------VKARFI 162 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +P + + ++ + + I G + Sbjct: 163 EMFGDPYTNPLKWEKLKIKDAVTVEPQNGLYKPQSDYVTDRSGIPILRIDGFYDGIVTDF 222 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--P 325 + + + Y +++ ++ R ++ + ++E + S M + P Sbjct: 223 ASLKRLKCSETEKQKYLLLEDDIVINRVNSIEYLGKCAHIKGLLEDTVYESNMMRMHFDP 282 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 +S Y+ L+ S + + S+ +DV + PPI Q + I Sbjct: 283 ETYNSVYICKLLCSQFIYDQIVNHAKKAVNQASINQKDVLDFNIYQPPIDLQNQFADFIQ 342 Query: 384 VETAR---IDVLVEKIEQS 399 I + ++E+ Sbjct: 343 QVDKSRFDIKKSIIELERE 361 Score = 41.3 bits (95), Expect = 0.33, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 40/105 (38%), Gaps = 4/105 (3%) Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 ++ + + +G+ + + ++ G+ + +K ++ Sbjct: 59 VKYYNGKFDAYQRTYIIEDNSNGLLYMPYLYHFLEGYIGELRKQSIGGVIKYIKLGNLTD 118 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + V +P I EQ I N++N+ L+E +++I L + Sbjct: 119 VLVELPSIVEQKYIVNLMNISLE----LIELRKKTIDKLDSLVKA 159 >gi|253730781|ref|ZP_04864946.1| EcoA family type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus USA300_TCH959] gi|253725494|gb|EES94223.1| EcoA family type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus USA300_TCH959] Length = 347 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 49/352 (13%), Positives = 105/352 (29%), Gaps = 32/352 (9%) Query: 24 HWKVVPIKRFTKLNTG--RTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W+ + K+ G +T + + + I ++ +E++++ K + + Sbjct: 20 EWEEKKLGEVAKIYDGTHQTPKYTNEGIKFLSVENIKTLNS---SKYISEEAFEKEFKIR 76 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR--IE 138 G IL ++G I++ + L L L L+ Q Sbjct: 77 PEFGDILMTRIGDIGTPNIVSSNEKFAYYVSLALLKTKNLNSYFLKNLILSSSIQNELWR 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + IG I + P EQ I + +I+ + + K Sbjct: 137 KTLHVAFPKKINKNEIGKIKINYPKKQEQQKIGQFFSKLDRQIELEEQKLELLQQQKKGY 196 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 Q + S + + G WE + F + N+ + E+ + Sbjct: 197 MQKIFSQELRFK------------DENGNDYPEWEERRFADIFKFHNKLRKPIKENLRVK 244 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMERGIIT 317 SY Y I D ++ RS V + + Sbjct: 245 GSYPYYGATGII--------DYVDDFIFDGNYLLIGEDGANIITRSAPLVYLVNGKFWVN 296 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + P + + +L + +L + L +++K + V++ Sbjct: 297 NHAHILSPLNGN---IQYLYQVAELVNYEKYNTGTAQPKLNIQNLKIISVVI 345 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 52/151 (34%), Gaps = 5/151 (3%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + + + K I LS NI ++ + + E + G+I+ I Sbjct: 32 IYDGTHQTPKYTNEGIKFLSVENIKTLNSSKYISEEAFEKEFKIRPEFGDILMTRIGDIG 91 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLK 358 + S E+ + +K ++S +L L+ S + + + + Sbjct: 92 TPNIVSSN---EKFAYYVSLALLKTKNLNSYFLKNLILSSSIQNELWRKTLHVAFPKKIN 148 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ ++ + P +EQ I + +I Sbjct: 149 KNEIGKIKINYPKKQEQQKIGQFFSKLDRQI 179 >gi|229547175|ref|ZP_04435900.1| restriction modification system DNA specificity domain protein [Enterococcus faecalis TX1322] gi|229307705|gb|EEN73692.1| restriction modification system DNA specificity domain protein [Enterococcus faecalis TX1322] Length = 174 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 56/170 (32%), Gaps = 8/170 (4%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 A E + KN L ++I S I KL + N+ + + I+ G+ Sbjct: 12 DHFEYGLNASAIEYDGKNKYLRITDIDDSSRKFIQNKLTSPNINV---EEASNYILTVGD 68 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+F K + + A DS ++ W + M Sbjct: 69 ILFARTGASVGKTYRYDIKDGKVYFAGFLIRARIKDSFDSEFVYWTTLTDRYNTFIKIMS 128 Query: 351 S-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + ++ +L+P IKEQ I + +ID + ++ Sbjct: 129 QRSGQPGINAKEYSSFNILIPNIKEQQKIGAFL----KKIDDTIALHQRK 174 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 50/168 (29%), Gaps = 10/168 (5%) Query: 24 HWKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTST 77 W++ + E Y+ + D++ + K++ S + ++ Sbjct: 1 DWELCKLGDVADHFEYGLNASAIEYDGKNKYLRITDIDDSSRKFIQNKLTSPNINVEEAS 60 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDV 133 I G IL+ + G + K D E + L+ Sbjct: 61 NYILTVGDILFARTGASVGKTYRYDIKDGKVYFAGFLIRARIKDSFDSEFVYWTTLTDRY 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 I+ + + + + K + + IP + EQ I + I Sbjct: 121 NTFIKIMSQRSGQPGINAKEYSSFNILIPNIKEQQKIGAFLKKIDDTI 168 >gi|166366728|ref|YP_001659001.1| type I restriction-modification system [Microcystis aeruginosa NIES-843] gi|166089101|dbj|BAG03809.1| type I restriction-modification system [Microcystis aeruginosa NIES-843] Length = 240 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 65/192 (33%), Gaps = 9/192 (4%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 I P V V + + I I +I + T + E Sbjct: 40 IGEFEQQPLGNFVDVVSRSVNPRSSRYAGQIFEYIDLREVDDIYGYILTLKLNQGNEIGS 99 Query: 282 TYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR- 338 T +I+F I L N K +L + V + ++ ++ ++ L +L R Sbjct: 100 TKHRFQKNDILFAKIMPSLANKKIALVTQDVT-NAVASTEFIVLRKKSQAEINLYYLFRA 158 Query: 339 --SYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 S + A +G RQ + + L ++VPP + Q I + + E + L Sbjct: 159 LRSDHFTRQATANVTGATGRQRISPSRLLELQIIVPPEEIQTQIGDAVEQEFT-LRTLAA 217 Query: 395 KIEQSIVLLKER 406 + + L + Sbjct: 218 EQSKKADDLAQL 229 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 60/169 (35%), Gaps = 14/169 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + P+ F + + + YI L +V+ G L N ST Sbjct: 44 EQQPLGNFVDVVSRSVNPRSSRYAGQIFEYIDLREVDDIYGYILTLKLNQGNEIGSTKHR 103 Query: 81 FAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDV---LPELLQGWLLSID 132 F K IL+ K+ P L I + + ST+F+VL+ K L L S Sbjct: 104 FQKNDILFAKIMPSLANKKIALVTQDVTNAVASTEFIVLRKKSQAEINLYYLFRALRSDH 163 Query: 133 VTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 T++ A GAT + + + +PP Q I + + E Sbjct: 164 FTRQATANVTGATGRQRISPSRLLELQIIVPPEEIQTQIGDAVEQEFTL 212 >gi|332074786|gb|EGI85259.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17570] Length = 191 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 53/149 (35%), Gaps = 11/149 (7%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + ++ YL +++ S + F ++ SG ++L + V + + +PP+ EQ I I Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLAEQQRIIEAIES 153 Query: 385 ETARIDV-------LVEKIEQSIVLLKER 406 ++D L + ++ LK Sbjct: 154 ALEKVDEYAESYNRLEQLDKKFPDKLKNL 182 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 37/185 (20%), Positives = 70/185 (37%), Gaps = 9/185 (4%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGA 144 L + R I+ I + ++ L + ++LS + V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + +I +P+PPLAEQ I E I + ++D R +L K+ L + Sbjct: 122 VVKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKKFPDKLKN 181 Query: 205 YIVTK 209 Sbjct: 182 LFFNM 186 >gi|188518460|ref|ZP_03003947.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 11 str. ATCC 33695] gi|188997991|gb|EDU67088.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 11 str. ATCC 33695] Length = 391 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 47/399 (11%), Positives = 116/399 (29%), Gaps = 38/399 (9%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K G T S + I +G Y+ ++ Sbjct: 3 IKLKDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYINS------------YMY 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR-IEAI 140 G I G D S ++ + + + I+++ Sbjct: 51 EGGHITISMNGNAGCVFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSL 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 C+G T + N+ + +PP+ EQ I I +K Sbjct: 111 CKGTTRLRLSNDDVLNLEINLPPIEEQNAIISIIEPHEKLFVKYSNLVDISSVENAKKDV 170 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + I+ +K+ I++ + ++ + + N K L + ++ Sbjct: 171 DNLISIIEPIEKVINNIKN--IKFKIESLVNKYFDFLYSNLEDSNFKKYILGDLFTINRG 228 Query: 261 YGNIIQKLETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 + +E+ K Y + F I Q Sbjct: 229 QIINSKYIESNIGSYPVISSNTKNNGVFGYINSYMYDGEFITISADGAYAGTVFLQNGRF 288 Query: 314 GIITSAYMAVKPHGID----STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 I ++ +K + ID + ++ ++++ + R +++ +K + + + Sbjct: 289 SITNVCFILIKNNDIDFKFSNKFVYYILKKEQEVNKLKSQVGSSRPAVREYSLKEIKINL 348 Query: 370 PPIKEQFDITNV------INVETARIDVLVEKIEQSIVL 402 P I+ Q + + ++ + +I+ ++ I Sbjct: 349 PNIEIQEKFSKIVEPLLNLSTKANKIEKILNDSLLKITK 387 >gi|312278975|gb|ADQ63632.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus ND03] Length = 94 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 42/96 (43%), Gaps = 7/96 (7%) Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + M ++P GID Y + L K+ + + + ++ +L+P ++EQ Sbjct: 3 TNMMVLEPKGIDPEYRYTFINKTGLYKIAD---TSTIPQINNKHIEPYLLLIPSLEEQHK 59 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I + +D + ++ + LLKE++ F+ Sbjct: 60 IGSF----FKHLDETIALHQRKLDLLKEQKKGFLQK 91 >gi|148377831|ref|YP_001256707.1| hypothetical protein MAG_5680 [Mycoplasma agalactiae PG2] gi|148291877|emb|CAL59268.1| Hypothetical protein MAG5680 [Mycoplasma agalactiae PG2] Length = 377 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 18/170 (10%), Positives = 54/170 (31%), Gaps = 11/170 (6%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPES---YETYQIVDPGEIVFRFIDLQNDKRS 304 I + + + K E+ + + I+F + Sbjct: 39 KKYYENGTIPFIKVEDTVNKYIENGKYFITENGLINSSAWLAPENSIIFTNGATIGNVAI 98 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVK 363 + ++GI+ + D ++ +L+ S + + G + ++ Sbjct: 99 NKIKTATKQGILG----IIPKQKYDVEFIYYLLSSKNFQNEVNRKITIGTFAMITLSNLD 154 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ V +P + I+ + +D L+ ++ + LK ++ + Sbjct: 155 KIKVNLPNYDIERA---KISSLFSHLDSLITLHQRKLSSLKNLKNRLLDK 201 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 50/386 (12%), Positives = 104/386 (26%), Gaps = 37/386 (9%) Query: 25 WKVVPIKRFTKLNT-GRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + + G T + I +I +ED + + S Sbjct: 16 WEQEKFANIYQFASEGGTPSTSIKKYYENGTIPFIKVEDTVNKYIENGKYFITENGLINS 75 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQ 135 + + + I++ G + I L + PK E + L S + Sbjct: 76 SAWLAPENSIIFTN-GATIGNVAINKIKTATKQGILGIIPKQKYDVEFIYYLLSSKNFQN 134 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + T + + I + +P + + +D+LIT R + L Sbjct: 135 EVNRKITIGTFAMITLSNLDKIKVNLPNYDIERAKISSL---FSHLDSLITLHQRKLSSL 191 Query: 196 KEKKQALVSYIVTKG--LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 K K L+ + P ++ K+ W +V + N + + E Sbjct: 192 KNLKNRLLDKMFCDEKSQFPSIRFKEFTNAWEQWKIGDMFSVGRGYVVPKKNIYSNRQGE 251 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 S + L + + + Sbjct: 252 YIYPIYSSQTVNDGLLGYYNKYLTTN--------------SITWTTDGANAGTVFYRKGL 297 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-I 372 T+ + + L S K +G+ L + + + + I Sbjct: 298 FYATNVCGILSQKQFEPNIYLALALSRVSHKHVTKVGN---PKLMNNAMANIDLQITSDI 354 Query: 373 KEQFDITNVINVETARIDVLVEKIEQ 398 KEQ I + +D L+ ++ Sbjct: 355 KEQSKI----SSLFYHLDSLITLHQR 376 >gi|88596084|ref|ZP_01099321.1| type II restriction-modification enzyme [Campylobacter jejuni subsp. jejuni 84-25] gi|88190925|gb|EAQ94897.1| type II restriction-modification enzyme [Campylobacter jejuni subsp. jejuni 84-25] Length = 1365 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 54/472 (11%), Positives = 137/472 (29%), Gaps = 86/472 (18%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---- 75 ++V +K F K +G + + +G E +++ +G + + Sbjct: 896 ELVRLKDFVLDIQTAKRPSGGVGKYENGALSLGGEHIDNKSGYIKLDNPKYVPIEFYESF 955 Query: 76 --STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP------KDVLPELLQGW 127 I + IL K G K + + I + + + L Sbjct: 956 ALQDKGIVKQFDILICKDGALTGKIAMVRNEFIRKSAMINEHIFLLRCDNIAKQKYLFYI 1015 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK-------------- 173 L S Q +++ G+ + + +I +P Q I + Sbjct: 1016 LHSYSGQQALKSKITGSAQGGINKTNLESILIPNADFEIQKQIVAECEKVEEQYNTIRMS 1075 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--------- 224 I I ++ + + + +++ + D + S IE Sbjct: 1076 IEEYQNLIKAILQKCGIIDDGGGYELNSILENLQKLEFKLDFNLLLSLIEEQISHSEVLV 1135 Query: 225 ----------------------------VGLVPDHWE--------VKPFFALVTELNRKN 248 + P + K Sbjct: 1136 EETQSKERKQDFNAFKNFSKTIQELLQTLSTPPKDGWKRISLKNEQYMELNPSKKEISKL 1195 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + + ++ + ++++ E + Y +I+ I + A Sbjct: 1196 DENMLVSFIEMASVSDKGYIQSKIDRSLNEVRKGYTYFIENDILIAKITPCMENGKCAIA 1255 Query: 309 QVMERGI---ITSAYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVK 363 + + I T ++ G+DS++L + + ++ + G+ + + + Sbjct: 1256 KNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNLNQQNIREKAALAMTGASGHKRVPISFYE 1315 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 L + +PP++ Q I I + +ID L + L++ + + + Sbjct: 1316 NLTIPLPPLEIQEKIVQNIELVEQQIDFL----NLKLEFLEKEKEKILQKYL 1363 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 64/196 (32%), Gaps = 19/196 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK + +K + + + +I + V S G K S Sbjct: 1171 GWKRISLKN--EQYMELNPSKKEISKLDENMLVSFIEMASV-SDKGYIQSKIDRSLNEVR 1227 Query: 76 STVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + F + IL K+ P + + G ST+F + + K L + L Sbjct: 1228 KGYTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNL 1287 Query: 130 SIDVTQRI--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + A+ + N+ +P+PPL Q I + I +ID L + Sbjct: 1288 NQQNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDFLNLK 1347 Query: 188 RIRFIELLKEKKQALV 203 + ++ Q + Sbjct: 1348 LEFLEKEKEKILQKYL 1363 >gi|218667559|ref|YP_002425174.1| type I restriction-modification system, S subunit, putative [Acidithiobacillus ferrooxidans ATCC 23270] gi|218519772|gb|ACK80358.1| type I restriction-modification system, S subunit, putative [Acidithiobacillus ferrooxidans ATCC 23270] Length = 561 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 69/206 (33%), Gaps = 15/206 (7%) Query: 18 IGA------IPKHWKVVPIKRFT-KLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKD 67 IG +P+ W+ + ++ G + I ++ ++D+ +G + Sbjct: 356 IGEGEKPHPLPRSWEWARFGDISYQITDGAHHTPTYVNEGIPFLSVKDMSAGRLDFSDTR 415 Query: 68 GNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPE 122 SR+ + +G +L K+G ++ +F S + + Sbjct: 416 FISREQHEELIKRCFPQRGDLLLTKVGTTGIPILVDTDEEFSIFVSVALIKFPLNHIHGR 475 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + S V ++ E EG + + I + IPPLAEQ I K+ D Sbjct: 476 YLSLLVSSPLVKRQSEEGTEGIGNKNLVLRKIAAFVLAIPPLAEQHRIVAKVDELMALCD 535 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 L A+V V Sbjct: 536 ALKVRLADAQTTQLHLADAIVERAVC 561 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 53/440 (12%), Positives = 116/440 (26%), Gaps = 68/440 (15%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ------- 72 +P W+ V + ++ + I G V +Y+ + Sbjct: 100 LPLCWEWVRLPEIYLSISPSGSKLLSSAIKDAGTFPVVDQGQRYIAGYTDDAALLINLPG 159 Query: 73 -----SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST-------------QFLVL 114 D +T + + G G + + I+ D F VL Sbjct: 160 PVIVFGDHTTERKYIDFDFVAGADGVKILRPILQDEHFFFRQLQGYRLEERGYARHFKVL 219 Query: 115 QPKDV-LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 LP + + + V + + + + + +V +++ Sbjct: 220 NDNLYALPPIEEQHRIVAKVDELMALGDQLEQQQTDSLAAHQTLVETLLGTLTRVGSQQE 279 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM---------------- 217 A RI + + + KQ ++ V L P Sbjct: 280 FSAAWTRIASHFDTLFTTEASIDQLKQTILQLAVMGKLVPQDPNDEPASVLLGKIAKEKT 339 Query: 218 ---------KDSGIEWVGLVPDHWEVKPFFAL---------VTELNRKNTKLIESNILSL 259 K + +G + + +T+ + I L Sbjct: 340 RLFSAGEIRKQKSLFEIGEGEKPHPLPRSWEWARFGDISYQITDGAHHTPTYVNEGIPFL 399 Query: 260 SYGNIIQKLETRNMGLKP-----ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S ++ + E G+++ + L Sbjct: 400 SVKDMSAGRLDFSDTRFISREQHEELIKRCFPQRGDLLLTKVGTTGIPI-LVDTDEEFSI 458 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 ++ A + + I YL+ L+ S + + G+ ++L + + +PP+ Sbjct: 459 FVSVALIKFPLNHIHGRYLSLLVSSPLVKRQSEEGTEGIGNKNLVLRKIAAFVLAIPPLA 518 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I ++ A D L Sbjct: 519 EQHRIVAKVDELMALCDALK 538 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 64/192 (33%), Gaps = 15/192 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P WE + ++ +KL+ S I ++ + + G + Sbjct: 92 SEEEKPYALPLCWEWVRLPEIYLSISPSGSKLLSSAIKDAGTFPVVDQGQRYIAGYTDD- 150 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 I PG ++ K G+ ++P D + ++ Sbjct: 151 -AALLINLPGPVIVFGDHTTERKYIDFDFVAGADGV-----KILRPILQDEHFFFRQLQG 204 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 Y L + YA F+ + +PPI+EQ I ++ A D L ++ S Sbjct: 205 YRLEERGYAR--------HFKVLNDNLYALPPIEEQHRIVAKVDELMALGDQLEQQQTDS 256 Query: 400 IVLLKERRSSFI 411 + + + + Sbjct: 257 LAAHQTLVETLL 268 >gi|198282971|ref|YP_002219292.1| restriction modification system DNA specificity protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198247492|gb|ACH83085.1| restriction modification system DNA specificity domain [Acidithiobacillus ferrooxidans ATCC 53993] Length = 563 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 69/206 (33%), Gaps = 15/206 (7%) Query: 18 IGA------IPKHWKVVPIKRFT-KLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKD 67 IG +P+ W+ + ++ G + I ++ ++D+ +G + Sbjct: 358 IGEGEKPHPLPRSWEWARFGDISYQITDGAHHTPTYVNEGIPFLSVKDMSAGRLDFSDTR 417 Query: 68 GNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPE 122 SR+ + +G +L K+G ++ +F S + + Sbjct: 418 FISREQHEELIKRCFPQRGDLLLTKVGTTGIPILVDTDEEFSIFVSVALIKFPLNHIHGR 477 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + S V ++ E EG + + I + IPPLAEQ I K+ D Sbjct: 478 YLSLLVSSPLVKRQSEEGTEGIGNKNLVLRKIAAFVLAIPPLAEQHRIVAKVDELMALCD 537 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 L A+V V Sbjct: 538 ALKVRLADAQTTQLHLADAIVERAVC 563 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 53/440 (12%), Positives = 116/440 (26%), Gaps = 68/440 (15%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ------- 72 +P W+ V + ++ + I G V +Y+ + Sbjct: 102 LPLCWEWVRLPEIYLSISPSGSKLLSSAIKDAGTFPVVDQGQRYIAGYTDDAALLINLPG 161 Query: 73 -----SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST-------------QFLVL 114 D +T + + G G + + I+ D F VL Sbjct: 162 PVIVFGDHTTERKYIDFDFVAGADGVKILRPILQDEHFFFRQLQGYRLEERGYARHFKVL 221 Query: 115 QPKDV-LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 LP + + + V + + + + + +V +++ Sbjct: 222 NDNLYALPPIEEQHRIVAKVDELMALGDQLEQQQTDSLAAHQTLVETLLGTLTRVGSQQE 281 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM---------------- 217 A RI + + + KQ ++ V L P Sbjct: 282 FSAAWTRIASHFDTLFTTEASIDQLKQTILQLAVMGKLVPQDPNDEPASVLLGKIAKEKT 341 Query: 218 ---------KDSGIEWVGLVPDHWEVKPFFAL---------VTELNRKNTKLIESNILSL 259 K + +G + + +T+ + I L Sbjct: 342 RLFSAGEIRKQKSLFEIGEGEKPHPLPRSWEWARFGDISYQITDGAHHTPTYVNEGIPFL 401 Query: 260 SYGNIIQKLETRNMGLKP-----ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S ++ + E G+++ + L Sbjct: 402 SVKDMSAGRLDFSDTRFISREQHEELIKRCFPQRGDLLLTKVGTTGIPI-LVDTDEEFSI 460 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 ++ A + + I YL+ L+ S + + G+ ++L + + +PP+ Sbjct: 461 FVSVALIKFPLNHIHGRYLSLLVSSPLVKRQSEEGTEGIGNKNLVLRKIAAFVLAIPPLA 520 Query: 374 EQFDITNVINVETARIDVLV 393 EQ I ++ A D L Sbjct: 521 EQHRIVAKVDELMALCDALK 540 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 64/192 (33%), Gaps = 15/192 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P WE + ++ +KL+ S I ++ + + G + Sbjct: 94 SEEEKPYALPLCWEWVRLPEIYLSISPSGSKLLSSAIKDAGTFPVVDQGQRYIAGYTDD- 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 I PG ++ K G+ ++P D + ++ Sbjct: 153 -AALLINLPGPVIVFGDHTTERKYIDFDFVAGADGV-----KILRPILQDEHFFFRQLQG 206 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 Y L + YA F+ + +PPI+EQ I ++ A D L ++ S Sbjct: 207 YRLEERGYAR--------HFKVLNDNLYALPPIEEQHRIVAKVDELMALGDQLEQQQTDS 258 Query: 400 IVLLKERRSSFI 411 + + + + Sbjct: 259 LAAHQTLVETLL 270 >gi|15603403|ref|NP_246477.1| HsdA [Pasteurella multocida subsp. multocida str. Pm70] gi|12721927|gb|AAK03622.1| HsdA [Pasteurella multocida subsp. multocida str. Pm70] Length = 435 Score = 66.7 bits (161), Expect = 7e-09, Method: Composition-based stats. Identities = 49/445 (11%), Positives = 117/445 (26%), Gaps = 73/445 (16%) Query: 32 RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL 91 + + ++ E+ G Y P G S D IF +L + Sbjct: 3 DIVNFLNAKRKP-------LSAKERENRKGIY-PYYGASDIVDYIDDYIFDGRYLLISED 54 Query: 92 GPYLR-----KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 G L+ A IA+ + ++ K + +L + GA Sbjct: 55 GENLKTRKTPIAFIAEGKFWVNNHAHIISGK---DDQTIDYLKYYFSNFDLMPFLTGAVQ 111 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + I + P ++ + + + + +ID ++ + ++ Sbjct: 112 PKLSKGILEKIEIDFPCYEKRKRVNQFLGSLDNKIDLNTQTNQTLEQIAQAIFKSWFVDF 171 Query: 207 V--------------------------------------------TKGLNPDVKMKDSG- 221 K L + + S Sbjct: 172 DPVKAKVDVLANGGSQADAERAAMQVISGKTDAELTQMQQTQPDAYKTLEKNTALFPSEM 231 Query: 222 -IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP--- 277 +G VP W V V + S + + + + K Sbjct: 232 VESELGNVPKGWGVSTIGDSVQTVGGATPSTTNEEFWSNGHIHWTTPKDLSSAKDKILLN 291 Query: 278 ----ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + G + + L + A I Y+ + S Y Sbjct: 292 TDRKITEAGLKKISSGLLPINTVLLSSRAPVGYLALTRIPVAINQGYIGIICSDKLSCYY 351 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +L ++ + + + + VL+P ++ V +++ ++ + Sbjct: 352 VLQWCQANLDEIKGRASGTTFAEINKKTFREMRVLIPN----NELIKVYDLQVEKLYKKI 407 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 + L+ R + + ++G+ Sbjct: 408 TENIIESKALENIRDALLPKLLSGE 432 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 55/197 (27%), Gaps = 12/197 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKY---LPKD 67 +G +PK W V I + G T + + I + +D+ S K + Sbjct: 236 LGNVPKGWGVSTIGDSVQTVGGATPSTTNEEFWSNGHIHWTTPKDLSSAKDKILLNTDRK 295 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + + +L P + + ++ + D L Sbjct: 296 ITEAGLKKISSGLLPINTVLLSSRAPV-GYLALTRIPVAINQGYIGIICSDKL-SCYYVL 353 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 I+ G T + + K + + IP + ++ +I I E Sbjct: 354 QWCQANLDEIKGRASGTTFAEINKKTFREMRVLIPNNELIKVYDLQVEKLYKKITENIIE 413 Query: 188 RIRFIELLKEKKQALVS 204 + L+S Sbjct: 414 SKALENIRDALLPKLLS 430 >gi|257094683|ref|YP_003168324.1| type I restriction-modification enzyme, specificity subunit [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257047207|gb|ACV36395.1| type I restriction-modification enzyme, specificity subunit [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 383 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 51/131 (38%), Gaps = 7/131 (5%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLA 334 +Y+++ ++ G+ V + R ++ + ++ + +D+ +L Sbjct: 50 DTTYKSFHRLNAGDFVISSPKAWEGAVA-RISEEFDGWFLSPVFPTFRADAEKLDTRFLD 108 Query: 335 WLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 W + + + G+ R+S+ + + V +PP+ EQ I ++ + Sbjct: 109 WYCKRDAVWRQLQGKAKGMGARRESVSPDQFLSIEVPLPPLAEQQAIVARLDALAEKTRQ 168 Query: 392 LVEKIEQSIVL 402 VE + ++ Sbjct: 169 -VEAHQDAVEH 178 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 46/358 (12%), Positives = 101/358 (28%), Gaps = 37/358 (10%) Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQF--LV 113 G G + + + G + + + S F Sbjct: 37 GRGLFKRGPIMPLDTTYKSFHRLNAGDFVISSPKAWEGAVARISEEFDGWFLSPVFPTFR 96 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIR 171 + + L + V ++++ +G +I +P+PPLAEQ I Sbjct: 97 ADAEKLDTRFLDWYCKRDAVWRQLQGKAKGMGARRESVSPDQFLSIEVPLPPLAEQQAIV 156 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 ++ A + + V + ++ Sbjct: 157 ARLDALAEKTRQVEAH----------------QDAVEHDAEHLLALRFRDAIANAATRTM 200 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 EV P ++ + L +G K ++PG++ Sbjct: 201 AEVAPLVRREPSIDLNGSYPELGIRSFGKGTFHKPPLSGSEVGTK-----RLYCIEPGDL 255 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL----AWLMRSYDLCKVFY 347 +F ++ + ++ AQ + G S D T + + + + K+ Sbjct: 256 LFS--NVFAWEGAIAIAQPEDAGRFGSHRFITCQVHPDLTTVAFLRYYFLTDEGMLKIGE 313 Query: 348 AMGSGLRQS--LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 A G ++ L E + + V +P + Q + + E A + I ++ L Sbjct: 314 ASPGGAGRNRTLGLEKLMAIEVPLPTLTTQQAF-DRLQAEVAGLKAKHAAIRRASTAL 370 >gi|257438275|ref|ZP_05614030.1| type I restriction-modification enzyme, S subunit, EcoA family [Faecalibacterium prausnitzii A2-165] gi|257199237|gb|EEU97521.1| type I restriction-modification enzyme, S subunit, EcoA family [Faecalibacterium prausnitzii A2-165] Length = 271 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 65/176 (36%), Gaps = 9/176 (5%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQ 299 + I + NI + N + + + + P +I+ Sbjct: 34 PHGGKEAYCLEGISFVRSQNIGDFSFSANGLAHINNEQAKKLSNVELKPNDILLNITGDS 93 Query: 300 NDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + ++ + + A + K + S+YL + ++ + A R +L Sbjct: 94 VARTCIIDSEYLPARVNQHVAIIRGKKDIVLSSYLLYFLQWKKKYLLQLASAGATRNALT 153 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 +++L + +P I++Q I ++ +I ++ ++ L+++ ++ ++ Sbjct: 154 KSMIEQLEIELPTIEQQRKIAGALD----KIQEKIKLNQKINDNLEQQAAALFSSL 205 >gi|240146117|ref|ZP_04744718.1| type I restriction-modification system, S subunit [Roseburia intestinalis L1-82] gi|257201770|gb|EEV00055.1| type I restriction-modification system, S subunit [Roseburia intestinalis L1-82] Length = 330 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 66/202 (32%), Gaps = 14/202 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN-ILSLSYGNIIQKLETRNMGLK 276 K E VP+ W + L K+ + R + Sbjct: 78 KCIEDEIPFEVPEGWCWCRLRDICMMLAGKSKPADQIKSEYFEGSYPCFGGNGIRGYVDE 137 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 T+ IV + I+ + E ++T+ ++ + + ++ Sbjct: 138 YNQDGTFSIVGRQGALCGNIN-----VATGKFYATEHAVVTTLFVGIDFKWSN-----YI 187 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + L K + L ++ + V +PP +EQ I N I+ I+V+ ++ Sbjct: 188 LEALRLNKY---ATGAAQPGLSVANILNVFVPIPPTQEQDRIGNNIDKSLKIIEVIEQEK 244 Query: 397 EQSIVLLKERRSSFIAAAVTGQ 418 + +S + A+ G+ Sbjct: 245 TDLQKNIITAKSKILDLAIRGK 266 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 61/196 (31%), Gaps = 13/196 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W ++ + G++ + + I E E + + + Sbjct: 87 EVPEGWCWCRLRDICMMLAGKSKPADQ----IKSEYFEGSYPCFGGNGIRGYVDEYN--- 139 Query: 80 IFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G + G+ G +A + +V + ++L R+ Sbjct: 140 --QDGTFSIVGRQGALCGNINVATGKFYATEHAVVTTLFVGIDFKWSNYILEAL---RLN 194 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 GA I N+ +PIPP EQ I I I+ + E+ + + Sbjct: 195 KYATGAAQPGLSVANILNVFVPIPPTQEQDRIGNNIDKSLKIIEVIEQEKTDLQKNIITA 254 Query: 199 KQALVSYIVTKGLNPD 214 K ++ + L P Sbjct: 255 KSKILDLAIRGKLVPQ 270 >gi|261366731|ref|ZP_05979614.1| putative type I restriction modification system methylase [Subdoligranulum variabile DSM 15176] gi|282571558|gb|EFB77093.1| putative type I restriction modification system methylase [Subdoligranulum variabile DSM 15176] Length = 350 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 49/361 (13%), Positives = 102/361 (28%), Gaps = 55/361 (15%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAII-ADFDGICSTQFLVLQP 116 +++P N +D S + +KG + L A+ D I S + + + Sbjct: 35 EFMPSVANVIGTDLSRYKLISKGLFACNPMHVGRDERLPIALYEKDNAAIVSPAYFMFEI 94 Query: 117 KDVL---PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 D E L W + + + +G+ W + I +P+PP Q+ + E Sbjct: 95 IDRDVLNEEYLMMWFRRPEFDRECWFMTDGSVRGGISWDDLCRIQLPVPPYERQLDVVES 154 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 A T RI V + + E Sbjct: 155 YRAITRRIAMKKEINDNL-------------EAVLAASHSKMFFSKDTSE---------- 191 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 L+T N K+ + I ++ + N+ ++ Sbjct: 192 HSKLGELMTFGNGKSRPKTDGPIPVYGGNGVLSYTDHHNI--------------ENAVLI 237 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + ++ + K + + + +F Sbjct: 238 GRVGA----YCGSVYLEQGICWVSDNAIFAKSKITKDEFFDYFL--LKRLNLFNHHVGTG 291 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +Q L E + + V P + EQ + + N + I + + I+ L+E ++ Sbjct: 292 QQLLTQEILNNIEVPKP-VTEQIE---LFNRKATSIFETIFTNSREIIRLQELSDLLLSR 347 Query: 414 A 414 Sbjct: 348 L 348 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 46/179 (25%), Gaps = 25/179 (13%) Query: 29 PIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + G++ ++ I V G G D ++ + +L Sbjct: 194 KLGELMTFGNGKSRPKTDGPIP------VYGGNGVLSYTDHHNI-----------ENAVL 236 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G++G Y + S + + K E + + + G Sbjct: 237 IGRVGAYCGSVYLEQGICWVSDNAIFAKSKITKDEFFDYF---LLKRLNLFNHHVGTGQQ 293 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 I I E + I I R I L+E L+S + Sbjct: 294 LLTQ----EILNNIEVPKPVTEQIELFNRKATSIFETIFTNSREIIRLQELSDLLLSRL 348 >gi|322517064|ref|ZP_08069949.1| hypothetical protein HMPREF9425_1226 [Streptococcus vestibularis ATCC 49124] gi|322124324|gb|EFX95832.1| hypothetical protein HMPREF9425_1226 [Streptococcus vestibularis ATCC 49124] Length = 381 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 76/405 (18%), Positives = 145/405 (35%), Gaps = 39/405 (9%) Query: 29 PIKRFTKLNTGRTSESGKDII-YIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + T + S DI YI +++ ++ G+ + + + T V+ F K I Sbjct: 3 KLSNVSCYVTEKISVDSIDISEYITTDNLLQNKKGRVI-----AEKLPTQKVTRFKKNDI 57 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGAT 145 L + PYL+K AD DG S+ LV++P DV+ + + + +G Sbjct: 58 LIANIRPYLKKIWQADIDGGASSDVLVVRPNDVIDYNFLYYALTQDSFFEYVMKGSKGTK 117 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 M D I N +P + EQ+ I + + ID I + + L+ + L Y Sbjct: 118 MPRGDKSQIMNFVIPDLEIDEQIKIGKLL----KSIDQKIQINNQINQELEAMAKTLYDY 173 Query: 206 IVTKGLNPDV---KMKDSG------IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + PD K SG E +P+ W V+ + + K K+ ++I Sbjct: 174 WFVQFDFPDQNGKPYKSSGGKMVYNPELKREIPEGWGVESVGN-LLDKVTKAEKIENNSI 232 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + +I + + G G ++F + RG Sbjct: 233 EFIGEIPVIDQSQKFIAGFTNNE-NALLQAQDGHVIFGDHT----RVVKYINFDYARGAD 287 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + I + L ++ +DL YA F+ +K V+VP Sbjct: 288 GTQVLISNNENISNVLLYHMIEDFDLSNYGYAR--------HFKFLKEKTVIVPD----K 335 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++++ + I ++ L + R + + GQ+ + Sbjct: 336 EVSSKFETQANVIYEKIKNNIFENQELTQLRDWLLPMLMNGQVKV 380 >gi|289644884|ref|ZP_06476932.1| restriction modification system DNA specificity domain protein [Frankia symbiont of Datisca glomerata] gi|289505313|gb|EFD26364.1| restriction modification system DNA specificity domain protein [Frankia symbiont of Datisca glomerata] Length = 211 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 59/204 (28%), Gaps = 16/204 (7%) Query: 225 VGLVPDHWEVKPFFALVTELNRKN----------TKLIESNILSLSYGNIIQKLETRNMG 274 +G VPD W + + + + I + Sbjct: 5 IGPVPDTWHRLLLGDACQVQAGPSGATFRPADRASHGVRMVTPKSIQDDRIVADGCVTIR 64 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + G+IV + + +L + + + + YL Sbjct: 65 PEAADRMKRYALREGDIVCTRVG-NGRRHALAGPEHTGWLLGGACLFLRPHAAVLPRYLN 123 Query: 335 WLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +R + + + ++ + LP+ +PP + Q I ++++ +D + Sbjct: 124 HYLRQPMVQDWLAQRVTGAVVPTVTAGTLGDLPLALPPWETQHAIADLLDA----LDEKI 179 Query: 394 EKIEQSIVLLKERRSSFIAAAVTG 417 I +E A +TG Sbjct: 180 SAHHAIIRSTEELGRVLAPALLTG 203 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 69/201 (34%), Gaps = 11/201 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGK-YLPKDGN 69 IG +P W + + ++ G + + + + + ++ Sbjct: 5 IGPVPDTWHRLLLGDACQVQAGPSGATFRPADRASHGVRMVTPKSIQDDRIVADGCVTIR 64 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPK-DVLPELLQG 126 +D +G I+ ++G R A+ + L L+P VLP L Sbjct: 65 PEAADRMKRYALREGDIVCTRVGNGRRHALAGPEHTGWLLGGACLFLRPHAAVLPRYLNH 124 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L V + GA + +G++P+ +PP Q I + + A +I Sbjct: 125 YLRQPMVQDWLAQRVTGAVVPTVTAGTLGDLPLALPPWETQHAIADLLDALDEKISAHHA 184 Query: 187 ERIRFIELLKEKKQALVSYIV 207 EL + AL++ V Sbjct: 185 IIRSTEELGRVLAPALLTGAV 205 >gi|312867160|ref|ZP_07727370.1| type I restriction modification DNA specificity domain protein [Streptococcus parasanguinis F0405] gi|311097289|gb|EFQ55523.1| type I restriction modification DNA specificity domain protein [Streptococcus parasanguinis F0405] Length = 381 Score = 66.4 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 52/392 (13%), Positives = 117/392 (29%), Gaps = 44/392 (11%) Query: 18 IGAI---PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + I P WK + K KL+ G+ LE+ + Y + N+ + Sbjct: 4 LEEIQNCPVEWKELGDKNVAKLSRGKVMSKQF------LEENKGEFPVYSSQTANNGEIG 57 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + I + G + +++ +LL ++ Sbjct: 58 RISSFEYDGEYITWTTDGANAGTVFYRKGKFSITNVCGLVEINS--NQLLTKFVYYYLTI 115 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G +G I +PI PL Q I + + T + L +E + Sbjct: 116 STKKYVSSGMGNPKLMSNVMGKIKIPILPLEIQEKIVQILDKMTEYVTELTSELTSELTS 175 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 K++ +++ G + + + L Sbjct: 176 RKKQYSFYRDKLLSFE---------------GEIYQVEWKVLKDVATLKNGKDWKALSSG 220 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I G + E Y P ++ R + N ++ ++ Sbjct: 221 EIPVYGSGGEMG-----------EFVSDYSYDKPTVLIPRKGSISNLFYLEKAFWNVDTI 269 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 Y + + Y + + + K+ + R SL + ++ + VP ++ Sbjct: 270 Y----YTEIDEKLVIPKYFYYYLTT---VKLEEMATNPTRPSLTQAILDKIRIPVPSLEI 322 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q I V++ + L + + I L +++ Sbjct: 323 QSRIIQVLDNFETVCNDLNIGLPKEIELRQKQ 354 >gi|301633427|gb|ADK86981.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 379 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 57/384 (14%), Positives = 109/384 (28%), Gaps = 32/384 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFA 82 K IK + + GR I + + G Y + F Sbjct: 4 KTYKIKDICEASHGRE---------INTKYLRENQGIYPVYSSATSNEGEMGRIKTYDFD 54 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + + Y + S+ + K + E+ +L + + + Sbjct: 55 GEYVTWTTRWSYAGSIYYRNGKFSASSNCGI--LKVLNKEINPKFLAYALKKEAKKFVNT 112 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + IP+ PPL Q I + T EL E L Sbjct: 113 TSAIPILRTQKVVEIPIDFPPLQIQEKIATILDTFTELSAE--LSAELSAELSAELSAEL 170 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 D + P +W+ + + + E+ +K E Sbjct: 171 RERKKQYAFYRDYLL----------NPKNWKEENKYYKLGEIAQKVLVGGEKPADFSKEK 220 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N + K + K E + Y E + + ++ ++ Sbjct: 221 NEVYKYPILSNNSKAEEFLVYSKTFRVEEKSITVSARGTIGAVFYRDFAYLPAVSLICFV 280 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 D +L +R+ K A G L K + VP +K+Q +I ++ Sbjct: 281 P-KEEFDIRFLFHALRAIKFKKQGSATGQ-----LTVAQFKEYGIHVPSLKKQKEIAAIL 334 Query: 383 NVETARIDVLVEKIEQSIVLLKER 406 + + L E I I L K++ Sbjct: 335 DPLYSFFTDLNEGIPAEIELCKKQ 358 >gi|260910284|ref|ZP_05916959.1| type I restriction-modification system S subunit [Prevotella sp. oral taxon 472 str. F0295] gi|260635586|gb|EEX53601.1| type I restriction-modification system S subunit [Prevotella sp. oral taxon 472 str. F0295] Length = 279 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 77/233 (33%), Gaps = 19/233 (8%) Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG------- 226 + + I + + + K++ ++ V + + + G + Sbjct: 49 LEGTAQELHEQIKSEKQSLVKEGKLKKSALTDSVIFKGDDNKYYEQVGKNCIDITDKIPF 108 Query: 227 LVPDHWEVKPFFALVTELNR----------KNTKLIESNILSLSYGNIIQKLETRNMGLK 276 +P++W + K T ++ N + K+ N Sbjct: 109 EIPNNWVWTRLSDVADIYTGNSISETEKNAKYTNVVGRNYIGTKDVGFDNKVFYNNGVAI 168 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 P+ YE + + + ++ + A + + + + P Y+ + Sbjct: 169 PKEYEQNFRIALKNSIL--MCIEGGSAGRKVAILNQDVCFGNKLCCLSPFIEIGKYIYFY 226 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++S +F +G+ + VK + + +PP+KEQ I + + AR+ Sbjct: 227 LQSPSFIGMFNQNKAGIIGGVSIAKVKDILIPLPPLKEQCRIIHRLEELYARL 279 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 55/172 (31%), Gaps = 11/172 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI---------IYIGLEDVESGTGKYLPKDGNS 70 IP +W + + TG + + YIG +DV + Sbjct: 109 EIPNNWVWTRLSDVADIYTGNSISETEKNAKYTNVVGRNYIGTKDVGFDNKVFYNNGVAI 168 Query: 71 RQSDTSTVSIFAKGQILYGKL-GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + I K IL G RK I + D + L P + + + +L Sbjct: 169 PKEYEQNFRIALKNSILMCIEGGSAGRKVAILNQDVCFGNKLCCLSPFIEIGKYIYFYLQ 228 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 S G + + +I +P+PPL EQ I ++ R+ Sbjct: 229 SPSFIGMFNQNKAG-IIGGVSIAKVKDILIPLPPLKEQCRIIHRLEELYARL 279 >gi|325973653|ref|YP_004250717.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323652255|gb|ADX98337.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 395 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 19/141 (13%), Positives = 53/141 (37%), Gaps = 8/141 (5%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 ++ ++ + + K +L + + + + P D+ Sbjct: 60 KRHFNYRGLKSNKLFPKNTVCIVRVGGSVGKTALLKRESCLTEHV--YFFSSYPKISDNK 117 Query: 332 YLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ + + ++ + + S + L + +K + PP +EQ I + ++ Sbjct: 118 FIKYCLNFSNISEKIICLSKSSTAQPVLSLQKLKIIKFPCPPQEEQERIGDTLSA----Y 173 Query: 390 DVLVEKIEQSIVLLKERRSSF 410 D L+E E+ I +L+ R++ Sbjct: 174 DELIENNERQIEVLQGIRTAI 194 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 49/402 (12%), Positives = 114/402 (28%), Gaps = 40/402 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSE----------SGKDIIYIGLEDVESGTGKYLPKDGNS 70 I WK+V I + ++ +G+ I + E V + + Sbjct: 4 ISNEWKLVTIDQLGRVESGKPLPCRVEDSHLLFEDGFIPLVDGEAVSNSNLYIRKCKRHF 63 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + +F K + ++G + K + + + + + Sbjct: 64 NYRGLKSNKLFPKNTVCIVRVGGSVGKTALLKRESCLTEHVYFFSSYPKISDNKFIKYCL 123 Query: 131 IDVTQRIEAIC---EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + IC + + I P PP EQ I + + A I+ + Sbjct: 124 NFSNISEKIICLSKSSTAQPVLSLQKLKIIKFPCPPQEEQERIGDTLSAYDELIENNERQ 183 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV--PDHWEVKPFFALVTELN 245 + A+ P+ ++ E PD W+ + + T Sbjct: 184 IEVLQGIRT----AIFKEWFVNFGFPNYLTYEAERERERESSLPDSWQYQKIEEIATITK 239 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + + + +K R ++ I + + Sbjct: 240 GEKSAKLSVKDGKYPFFTSSEKSPERINEYSWDAES---------IFINYTGNFVAQLYR 290 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + + +L L+ + F++ + L+ + L Sbjct: 291 GKFDASDNCWV--------IIPKNKKFLYLLLETIIYSLPFFSSNCFGMKVLRSNLLFGL 342 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 VL+P IK N I + +E ++++I L++ + Sbjct: 343 NVLIPDIKT----LEKFNNICEFIQLKIENLQKNIERLEKIK 380 >gi|315173019|gb|EFU17036.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1346] Length = 171 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 53/170 (31%), Gaps = 11/170 (6%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 F E + + + + Y+ Y IV Sbjct: 13 FKGFTDEWEERKLGEVYNFQYGQFNNNPDNGGQYPIYGANGIIGGYDEYN--SENAIVIG 70 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + A+ ++S + +L+ S ++ K+ + Sbjct: 71 HMGA--YAGHVLWAEGKHFVTYNGTMGIADKSILNSNFGYYLVVSVNVPKL---TAGSGQ 125 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + + D+ + +L+P I+EQ I + ++D + ++ + LLK Sbjct: 126 PFVSYSDLNGIKILIPTIEEQQKIGSF----FKQLDNTITLHQRKLDLLK 171 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 49/173 (28%), Gaps = 30/173 (17%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYLPKDG 68 +P+ W+ + G+ ++G G + G +Y ++ Sbjct: 7 KVPELRFKGFTDEWEERKLGEVYNFQYGQFNNNPDNGGQYPIYGANGIIGGYDEYNSENA 66 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 I+ G +G Y + A+ + + + G+ Sbjct: 67 -----------------IVIGHMGAYAGHVLWAEGKHFVTYNGTMGIADKSILNSNFGYY 109 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L + V + G+ + + I + IP + EQ I I Sbjct: 110 LVVSVNVP--KLTAGSGQPFVSYSDLNGIKILIPTIEEQQKIGSFFKQLDNTI 160 >gi|38234848|ref|NP_940615.1| putative type I restriction/modification system DNA specificity protein [Corynebacterium diphtheriae NCTC 13129] gi|38201112|emb|CAE50836.1| Putative type I restriction/modification system DNA specificity protein [Corynebacterium diphtheriae] Length = 414 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 51/144 (35%), Gaps = 9/144 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV---K 324 ++ + + + + V G++V + + + + + Sbjct: 48 SDSEFVDQRFDKAIGRKTVRLGDVVITTKGTVGRVAEVSKVPNAGLAVYSPQVCYLRSLQ 107 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P + YL +L+ S S + L D K + V +P +++Q I +V+ Sbjct: 108 PSILHQRYLKYLLMSPATKYSISTFASSSDMAPYLSLSDFKSMVVDLPSLEDQRAIADVL 167 Query: 383 NVETARIDVLVEKIEQSIVLLKER 406 +D + + ++ I L ++ Sbjct: 168 GA----LDDKIAENQRVIQLSEQL 187 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 51/402 (12%), Positives = 116/402 (28%), Gaps = 38/402 (9%) Query: 33 FTKLNTGRTSESGK----DIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 +L G ++ + + DV E G + + R G ++ Sbjct: 13 VLELGDGYRTKRSELSQFGYAIVRAGDVVEIGVTASDSEFVDQRFDKAIGRKTVRLGDVV 72 Query: 88 YGKLGPYLRKAIIADFD----GICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G R A ++ + S Q +L +L + +LL T+ + Sbjct: 73 ITTKGTVGRVAEVSKVPNAGLAVYSPQVCYLRSLQPSILHQRYLKYLLMSPATKYSISTF 132 Query: 142 EGATMS--HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 ++ + ++ + +P L +Q I + + A +I +L Sbjct: 133 ASSSDMAPYLSLSDFKSMVVDLPSLEDQRAIADVLGALDDKIAENQRVIQLSEQLAMTFY 192 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ T+ + D + G P + + + ++ L Sbjct: 193 RS------TEKSESSLTFADVAGIYGGGTPSTKNPDFWDGEIRWATPTDITALKGPWLC- 245 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 R++ + S + + G I+ + Sbjct: 246 --------GTARSITEEGLSKSSGSLHPEGSILMTSRATVGHVAFI-----DAPTTTNQG 292 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ + P +L + ++ + +A G L K+LP E Sbjct: 293 FINLVPQEAYRYWLYFQLKQRTSEFIAWANG-ATFLELSRGTFKKLPFQACAETE----L 347 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 N A + V K ++ +L R + + G+I + Sbjct: 348 EKFNSVVAPLMKRVLKAQKENQVLAATRDELLPLLMNGKITV 389 >gi|227529080|ref|ZP_03959129.1| possible restriction modification system DNA specificity subunit [Lactobacillus vaginalis ATCC 49540] gi|227351005|gb|EEJ41296.1| possible restriction modification system DNA specificity subunit [Lactobacillus vaginalis ATCC 49540] Length = 217 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 54/163 (33%), Gaps = 9/163 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R T + + ++ +I + + + + + V P + V K ++ Sbjct: 56 RNWTNDKRNGHIWITPTDINKSIIIDSERYLSDKGWSKARVVPKDSVLITSIASIGKNAI 115 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + I + + +++Y + + + + G + + Sbjct: 116 NAIEAAFNQQINALII-----QNNNSYFVLMAMTREKQRFEALAGQTATPIINKSTLSSF 170 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +P KEQ I N ++D L+ + + L + + Sbjct: 171 TIKLPSKKEQDKIGNF----FKQLDSLITLHQCKLNQLSKMKK 209 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 51/191 (26%), Gaps = 15/191 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 + W + K+ TG T + I+I D+ + + + Sbjct: 31 ETWDQRKLSELGKVFTGNTPSTKDVRNWTNDKRNGHIWITPTDINKSIIIDSERYLSDKG 90 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S + K +L + + AI A + + Sbjct: 91 W--SKARVVPKDSVLITSIASIGKNAINAIEAAFNQQ---INALIIQNNNSYFVLMAMTR 145 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 QR EA+ + + + + +P EQ I I + + Sbjct: 146 EKQRFEALAGQTATPIINKSTLSSFTIKLPSKKEQDKIGNFFKQLDSLITLHQCKLNQLS 205 Query: 193 ELLKEKKQALV 203 ++ K Q + Sbjct: 206 KMKKFYLQKMF 216 >gi|209554404|ref|YP_002284454.1| restriction-modification enzyme subunit s3b [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|209541905|gb|ACI60134.1| restriction-modification enzyme subunit s3b [Ureaplasma urealyticum serovar 10 str. ATCC 33699] Length = 406 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 53/412 (12%), Positives = 120/412 (29%), Gaps = 49/412 (11%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K G T S + I +G Y+ ++ Sbjct: 3 IKLKDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYI------------NYYMY 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ-RIEAI 140 G I G D S ++ + + + +I+++ Sbjct: 51 EGGHITISMNGNAGCVFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSL 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK- 199 C+G T + N+ + +PP+ EQ I I +K Sbjct: 111 CKGTTRLRLSNDDVLNLEINLPPIEEQNAIISIIEPHEKLFIKYSNLVDISSVENTKKDV 170 Query: 200 -----------------QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 +A+ + T N V E + F ++ Sbjct: 171 DNLISIIEPIERIIKNLKAIKYKLETIMNNFFVVFYLFNNEENSNKYKLRNIGKFKGGIS 230 Query: 243 ELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 L++ N + N + + +I + E IV G+++ Sbjct: 231 TLDKNNYDSGINFINYMDIYKNFVINDDIKLRLYNASEKDIKSYIVSYGDLLLTASSETK 290 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSG-LRQS 356 ++ + S + + I + + + + + Y A+ RS K + +G R + Sbjct: 291 EEIAFSSVYLSNKQAIFNGFSKIYKYDQNILLPIYAAFYFRSEFFRKEVIKLATGYTRFN 350 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNV------INVETARIDVLVEKIEQSIVL 402 L +D K + + + + Q + + ++ + +I+ ++ I Sbjct: 351 LSIKDAKNIEISINNFEFQKKFSKIVEPLLNLSTKANKIEKILNDSLLKITK 402 >gi|218960818|ref|YP_001740593.1| Restriction modification system DNA specificity domain:N-6 DNA methylase:Type I restriction-modification system, M subunit [Candidatus Cloacamonas acidaminovorans] gi|167729475|emb|CAO80386.1| Restriction modification system DNA specificity domain:N-6 DNA methylase:Type I restriction-modification system, M subunit [Candidatus Cloacamonas acidaminovorans] Length = 837 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 48/387 (12%), Positives = 108/387 (27%), Gaps = 59/387 (15%) Query: 26 KVVPIKRFT----KLNTGRTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V + + +G + + YI + D++ L + S + + Sbjct: 484 DTVRLDEICVKKAQYGSGAAKTDYDGETRYIRITDIDDDG--NLKDNDIVSPSVINEKYL 541 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ---PKDVLPELLQGWLLSIDVTQRI 137 + +L+ + G R I +L+ LP + S I Sbjct: 542 LNEDDLLFARSGSVGRVYIHRQKGRFIFAGYLIRFVLDKNKALPRFIFYLTKSEYYANWI 601 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + T+S+ + + ++ +P+PPL+ Q I ++ Sbjct: 602 IKQSKTGTISNINAQQYSSLRIPLPPLSVQEEIVAELD---------------------- 639 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q ++ K W + E + I+++ Sbjct: 640 SYQKIIDGA-----------KQVVDNWKPHIDIDPEWDSYPYKEIFTTLTAPMKIQTSEY 688 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 S S I + + ++ ++F + S Q + Sbjct: 689 SSSGAYPIIDQSMHEIAGWTDDERALVRIEKPVVIFGDHTCRIKYISKNFCQGAD----- 743 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + YL + + ++ + Y F + + +P + Q Sbjct: 744 GIKILSTTENVIPKYLYYYLLAFPITPQGYNR--------HFSKLVEKEISIPELDVQQI 795 Query: 378 ITNVINVETARID---VLVEKIEQSIV 401 I + I E ++ L+ EQ I Sbjct: 796 IVSRIESEQKLVEANRKLIALFEQKIK 822 >gi|94986115|ref|YP_605479.1| restriction modification system DNA specificity subunit [Deinococcus geothermalis DSM 11300] gi|94556396|gb|ABF46310.1| restriction modification system DNA specificity domain [Deinococcus geothermalis DSM 11300] Length = 417 Score = 66.4 bits (160), Expect = 9e-09, Method: Composition-based stats. Identities = 47/428 (10%), Positives = 121/428 (28%), Gaps = 46/428 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + + G+ GL + E +P G++ ++ Sbjct: 4 EWIDTTVGEIAPFSYGK-----------GLPERERKQTGSVPVYGSNGIVGFHDSALTGG 52 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I+ G+ G T F V L L S+ + E + Sbjct: 53 PTIVIGRKGTVGAVHYSPIPCWPIDTTFFVSDSDRSLVRYSYYLLKSLGL----ENMNAD 108 Query: 144 ATMSHADWKGIG-NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + I + AEQ I + +I+ + + + +A Sbjct: 109 SAVPGLNRDAAHARIVLIPRDKAEQRAIAHILGTLDDKIELNRKQSETLEAMARALFKAW 168 Query: 203 VSYI-------------------VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 + L + E +G +P+ W V F + + Sbjct: 169 FVDFEPVRAKMEGRWQRGQSLPGLPAHLYDLFPDRLVDSE-LGEIPEGWRVFAFGDVAQQ 227 Query: 244 LNRKNTKLIESNILSLSYG-NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 L Y ++ V G ++ ++ + Sbjct: 228 GKGVVNPGNSPQDLFTHYSLPAFDSAHCPSIEPGHAIKSNKTPVPDGAVLVSKLNPHIPR 287 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL---RQSLK 358 + ++ ++ P ++ + + S + + + +G Q +K Sbjct: 288 VW-HVGTAGPNAVCSTEFIVWAPKAPANSAFLYCLASSPEFSGAMHQLVTGTSNSHQRVK 346 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + ++ + V + I + ++ +++ +QS L + R + + ++G+ Sbjct: 347 PDQLREIRVF---AATENAIEAFSEWVRSPLEKILQNRQQSRT-LAQLRDALLPRLISGE 402 Query: 419 IDLRGESQ 426 + + + Sbjct: 403 LRIADAEK 410 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 61/199 (30%), Gaps = 10/199 (5%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP+ W+V + G + + + + P Sbjct: 206 DSE---LGEIPEGWRVFAFGDVAQQGKGVVNPGNSPQDLFTHYSLPAFDSAHCP-SIEPG 261 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVLQPKDVLPELL-QGW 127 + S + G +L KL P++ + G +CST+F+V PK Sbjct: 262 HAIKSNKTPVPDGAVLVSKLNPHIPRVWHVGTAGPNAVCSTEFIVWAPKAPANSAFLYCL 321 Query: 128 LLSIDVTQRIEAICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S + + + + G + + I + E + + +I Sbjct: 322 ASSPEFSGAMHQLVTGTSNSHQRVKPDQLREIRVFAATENAIEAFSEWVRSPLEKILQNR 381 Query: 186 TERIRFIELLKEKKQALVS 204 + +L L+S Sbjct: 382 QQSRTLAQLRDALLPRLIS 400 >gi|302347049|ref|YP_003815347.1| hypothetical protein HMPREF0659_A7328 [Prevotella melaninogenica ATCC 25845] gi|302150605|gb|ADK96866.1| conserved hypothetical protein [Prevotella melaninogenica ATCC 25845] Length = 382 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 45/346 (13%), Positives = 93/346 (26%), Gaps = 42/346 (12%) Query: 76 STVSIFAKGQILYGK----LGPYLRKAIIADFDG---ICSTQFLVLQPKDV--LPELLQG 126 + +G I + + + +G IC + + + Sbjct: 55 KNYELCQEGDIAFADASEDTNEVAKAVEFYNLNGKDVICGLHTIHGRDNQHKTIVGYKGY 114 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 S Q+I I +G + + K + IP EQ I + RI T Sbjct: 115 AFSSTAFHQQIRRIAQGTKIYSINSKNFSECYIGIPSKGEQKKIATLLRLIDERISTQNK 174 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + L+K + ++++D E L Sbjct: 175 IIDKLESLIKGICNNYFLKLSHSQEMKSIRLRDILKERNEYCCKDGTFVHGTLSKDGLFP 234 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + L E + Y+I +I + +L K + Sbjct: 235 KTERW-------------------NRDFLVKEENKKYKITHLDDICYNPANL---KFGVI 272 Query: 307 SAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDV 362 + I + Y+ ++ ++ + + + + G R S+ ED Sbjct: 273 CRNIYGDLIFSPIYVTFEISKKVNIGFIELYLTNRNFIEKIRKFEQGTVYERMSVSPEDF 332 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +P + EQ +I L + + L + Sbjct: 333 LSYKIRIPSLSEQT-------FFYQKIQRLKNCSQNELEHLNLYKK 371 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 41/136 (30%), Gaps = 10/136 (7%) Query: 281 ETYQIVDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW--- 335 + Y++ G+I F D +++ + + +I + T + + Sbjct: 55 KNYELCQEGDIAFADASEDTNEVAKAVEFYNLNGKDVICGLHTIHGRDNQHKTIVGYKGY 114 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 S + + G S+ ++ + +P EQ I ID + Sbjct: 115 AFSSTAFHQQIRRIAQGTKIYSINSKNFSECYIGIPSKGEQKKIA----TLLRLIDERIS 170 Query: 395 KIEQSIVLLKERRSSF 410 + I L+ Sbjct: 171 TQNKIIDKLESLIKGI 186 >gi|332877050|ref|ZP_08444801.1| hypothetical protein HMPREF9074_00527 [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684940|gb|EGJ57786.1| hypothetical protein HMPREF9074_00527 [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 394 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 66/196 (33%), Gaps = 11/196 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQI 285 + + + + K + ++ ++ +K ES ++Y + Sbjct: 17 ELLEFYSTNSLSWEQLDYGNGIIKNLHYGLIHKGLPTMVDISSDLLPYIKSESMPKSYTL 76 Query: 286 VDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW---LMRSY 340 G++ F D + +++ E+ I++ + D T + + S Sbjct: 77 FLNGDVAFADASEDTNDVAKAVEIVNCDEQQIVSGLHTIHGRDKSDLTVIGYKGYAFASD 136 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K + G S+ + + V +P EQ I ++ ID+ + + Sbjct: 137 SFHKQIRRIAQGTKVFSINVRNFDEVRVGIPSKDEQIKIAKLLRA----IDLRIATQNKI 192 Query: 400 IVLLKERRSSFIAAAV 415 I LK+ +S+ I Sbjct: 193 IEDLKKLKSAIIDKLF 208 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 61/395 (15%), Positives = 122/395 (30%), Gaps = 61/395 (15%) Query: 24 HWKVVPIKRFTKLNT-----------GRTSESGKDIIYI-----GLEDVESGTGKYLPKD 67 WK+V + + + G I + D+ S Y+ + Sbjct: 9 EWKIVKVSELLEFYSTNSLSWEQLDYGNGIIKNLHYGLIHKGLPTMVDISSDLLPYIKSE 68 Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE----- 122 + + ++F G + + A C Q +V + Sbjct: 69 SMPK-----SYTLFLNGDVAFADASEDTNDVAKAVEIVNCDEQQIVSGLHTIHGRDKSDL 123 Query: 123 ----LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 S ++I I +G + + + + + IP EQ+ I + + Sbjct: 124 TVIGYKGYAFASDSFHKQIRRIAQGTKVFSINVRNFDEVRVGIPSKDEQIKIAKLL---- 179 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 ID I + + IE LK+ K A++ + ++ Sbjct: 180 RAIDLRIATQNKIIEDLKKLKSAIIDKLFDNLEGERCTYRELF----------------- 222 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + K+ + S YG + + + S TY+I+ G+ V Sbjct: 223 -QIVNDRNKDFHFNKVIAASQEYGMVERDTLNLKVQFDESSINTYKIIRTGDYVVYLRSF 281 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY--LAWLMRSYDLCKVFYAMGSGLR-- 354 Q A GI + AY+ ++P+ +Y L + S + G+R Sbjct: 282 QG-----GFAFSELDGICSPAYIILRPNTRILSYGFLRYYFVSQPFINSLRLVTYGIRDG 336 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +S+ E+ +P+ +P Q I I ++ Sbjct: 337 RSINVEEWMDMPISIPDKSIQDQILKTIQSIDNKL 371 >gi|154492483|ref|ZP_02032109.1| hypothetical protein PARMER_02117 [Parabacteroides merdae ATCC 43184] gi|254881868|ref|ZP_05254578.1| type I restriction-modification system specificity protein [Bacteroides sp. 4_3_47FAA] gi|154087708|gb|EDN86753.1| hypothetical protein PARMER_02117 [Parabacteroides merdae ATCC 43184] gi|254834661|gb|EET14970.1| type I restriction-modification system specificity protein [Bacteroides sp. 4_3_47FAA] Length = 248 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 92/233 (39%), Gaps = 12/233 (5%) Query: 192 IELLKEKKQALV-SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + L+++ QAL S+ V D + DS +G++P W V + ++ K Sbjct: 20 NDNLEQQAQALFKSWFVDFEPFKDGEFVDS---ELGMIPKGWRVVCLGEVTKQVTEKVGN 76 Query: 251 LIESNILS-LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + +LS ++ G ++ E + ++ Y IV+P + F + + + S+ + Sbjct: 77 REDVTVLSPVNSGELVLSEEYFTKQVFSKNLSKYLIVNP--LSFAYNPARINIGSIGLNE 134 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVL 368 G ++ Y+ K + + R+ G+RQSL ++D + + Sbjct: 135 YDFVGCVSPVYVVFKCEPNYHYFFDFYKRTAVFKDEVALRAIGGVRQSLGYDDFSLIKTI 194 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P + N ++ ++ + + L R S + ++G++ + Sbjct: 195 YPTPD----VVAEFNNLYLKMKEVITRNDIQNNKLTTLRDSLLPKLMSGELKI 243 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 64/204 (31%), Gaps = 14/204 (6%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IPK W+VV + TK T + +D+ + V SG + + Sbjct: 48 DSE---LGMIPKGWRVVCLGEVTKQVTEKVGNR-EDVTVLSP--VNSGELVLSEEYFTKQ 101 Query: 72 --QSDTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGW 127 + S I Y + + DF G S ++V + + + Sbjct: 102 VFSKNLSKYLIVNPLSFAYNPARINIGSIGLNEYDFVGCVSPVYVVFKCEPNYHYFFDFY 161 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + G + I + + + +++ +IT Sbjct: 162 KRTAVFKDEVALRAIGGVRQSLGYDDF----SLIKTIYPTPDVVAEFNNLYLKMKEVITR 217 Query: 188 RIRFIELLKEKKQALVSYIVTKGL 211 L + +L+ +++ L Sbjct: 218 NDIQNNKLTTLRDSLLPKLMSGEL 241 >gi|167752727|ref|ZP_02424854.1| hypothetical protein ALIPUT_00987 [Alistipes putredinis DSM 17216] gi|167659796|gb|EDS03926.1| hypothetical protein ALIPUT_00987 [Alistipes putredinis DSM 17216] Length = 372 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 50/368 (13%), Positives = 114/368 (30%), Gaps = 46/368 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + + R + I I S + ++ N + + A Sbjct: 7 KWVRLGDYIEQCDERNHSNKYGIEAIK---GISTSKTFIDTKANLDGVPLQSYKLVAPRY 63 Query: 86 ILY----GKLGPYLRKAIIADFD-GICSTQFLVLQPKDVL---PELLQGWLLSIDVTQRI 137 Y + G + A + S+ + V + + PE L + L + + Sbjct: 64 FAYVPDTSRRGDKVALAFNDSSCTYLISSIYCVFKVSVLDKLSPEYLYLFFLRPEFDRYA 123 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G+ W + N +P+P + EQ + + + L + Sbjct: 124 RYNSWGSAREVFSWGNMCNTMIPLPTITEQQKVVN----AWKAFREIKEQNEAKAAPLMQ 179 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 Q+ + + K ++ +G + + + V + Sbjct: 180 VCQSYIQELKHKY----------PLQEIGPYIEECDERNVDLSVRLSQGIANTKVFQAPK 229 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ + K IV G+ + +N ++ + + ++ Sbjct: 230 QVALNSKSDK-----------------IVRTGQFGYNRATTRNGEKISIAYRTGADCTVS 272 Query: 318 SAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 SAY K D +L + + + M G + +F+++ R+ + +PPI+ Sbjct: 273 SAYGVFKITNEDIIEPYFLWMWVSRPEFDRYARYMSKGSAHEFFEFDEMCRVKIPLPPIE 332 Query: 374 EQFDITNV 381 Q I N+ Sbjct: 333 IQRAIVNI 340 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 60/187 (32%), Gaps = 15/187 (8%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + + + + +N + I ++ + + L ++Y++V P Sbjct: 5 NVKWVRLGDYIEQCDERN-HSNKYGIEAIKGISTSKTFIDTKANLDGVPLQSYKLVAPRY 63 Query: 291 IVFR-FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVF 346 + + DK +L +I+S Y K +D YL + + Sbjct: 64 FAYVPDTSRRGDKVALAFNDSSCTYLISSIYCVFKVSVLDKLSPEYLYLFFLRPEFDRYA 123 Query: 347 YAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV--------LVEKIE 397 R+ + ++ + +P I EQ + N I L++ + Sbjct: 124 RYNSWGSAREVFSWGNMCNTMIPLPTITEQQKVVNAW-KAFREIKEQNEAKAAPLMQVCQ 182 Query: 398 QSIVLLK 404 I LK Sbjct: 183 SYIQELK 189 >gi|312898353|ref|ZP_07757743.1| type I restriction modification DNA specificity domain protein [Megasphaera micronuciformis F0359] gi|310620272|gb|EFQ03842.1| type I restriction modification DNA specificity domain protein [Megasphaera micronuciformis F0359] Length = 185 Score = 66.4 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 60/166 (36%), Gaps = 12/166 (7%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLP---KDGNSRQSDT 75 W+ + + G T + DI + VE G+ +Y+ + + Sbjct: 19 WEQRKLGEVADIIGGGTPSTSFADYWDGDIDWYSP--VEIGSNRYVSDSIRKITKLGLEK 76 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S+ I G +L+ AI+ G + F + P+ + + + L+ + + Sbjct: 77 SSTKILPVGTVLFTSRAGIGNTAILRKE-GCTNQGFQSIIPRKNILDTYFLYTLTPQLKR 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E + G+T + + +P+ IP L EQ + + I Sbjct: 136 YGELMGAGSTFVEVSGRQMEKMPLNIPSLEEQKKVGKLFEILDDSI 181 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 46/151 (30%), Gaps = 10/151 (6%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 I+ N R + + +I+ G ++F + LR Sbjct: 44 WDGDIDWYSPVEIGSNRYVSDSIRKITKLGLEKSSTKILPVGTVLFTSRAGIGNTAILRK 103 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 G + ++ P + L + MG+G + ++++P Sbjct: 104 -----EGCTNQGFQSIIPRKNILDTYFLYTLTPQLKRYGELMGAGSTFVEVSGRQMEKMP 158 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + +P ++EQ + +D + + Sbjct: 159 LNIPSLEEQKKVG----KLFEILDDSITLHQ 185 >gi|77415160|ref|ZP_00791180.1| Type I restriction modification DNA specificity domain protein [Streptococcus agalactiae 515] gi|77158789|gb|EAO70080.1| Type I restriction modification DNA specificity domain protein [Streptococcus agalactiae 515] Length = 271 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 5/159 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IPK W+ V + + LN + K D + +ED+E TG+ + K+ + +S + Sbjct: 110 IPKSWEWVRLGNISSLNFFSSISGDKIPNDSWVLDMEDIEKETGRLVRKNYKTEKSSYKS 169 Query: 78 VSI-FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I F+K ILY KL P L+K II+D +G +T+ L ++ + + + Sbjct: 170 NKISFSKDTILYAKLRPNLKKVIISDENGFATTELLPIKVFGNISLDYIRYCMISPFYYF 229 Query: 137 I-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 G M + + +P+PPL EQ I KI Sbjct: 230 NIIQSVYGVKMPRVSSGFLNSTLLPLPPLTEQQRIVSKI 268 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 62/208 (29%), Gaps = 12/208 (5%) Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA- 239 + I + + + K+ + +V + + K E +P WE Sbjct: 67 LLEKIKAEKQKLYEEGKLKKKDLEELVVTKGDDNSPYK----EVPYNIPKSWEWVRLGNI 122 Query: 240 ----LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + ++ + +L +N + SY++ +I + + Sbjct: 123 SSLNFFSSISGDKIPNDSWVLDMEDIEKETGRLVRKNYKTEKSSYKSNKISFSKDTILYA 182 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-R 354 N K+ + S + T I Y+ + M S G+ Sbjct: 183 KLRPNLKKVIISDENG--FATTELLPIKVFGNISLDYIRYCMISPFYYFNIIQSVYGVKM 240 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + + +PP+ EQ I + I Sbjct: 241 PRVSSGFLNSTLLPLPPLTEQQRIVSKI 268 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 22/53 (41%), Gaps = 4/53 (7%) Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 PP+ EQ I I A++ E + L KE + S + A+ G+ Sbjct: 1 PPLXEQKRIVAQIEKALAKVXEYAESYNKLXQLDKEFPDKLKKSILQYAMQGK 53 >gi|269978358|gb|ACZ55913.1| truncated putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 327 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 87/315 (27%), Gaps = 12/315 (3%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + + D PIPPL Q I + + A T L TE Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE 191 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 ++ K++ + ++ + + KD+ I+ V Sbjct: 192 LKARKKQYE-YYQNMLLDFNDINQNHKDAKIKSYPKRLKTLLQTLAPKGVEFRKLGEVCE 250 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I N N + G + G+ V D ++ Sbjct: 251 ILDNRRIPIAKNKRKPGIYPYYGANGIQDYIDSYIFDGDFVLVGEDGSVINKNNTPVVNW 310 Query: 312 ERGIITSAYMAVKPH 326 G I M + + Sbjct: 311 ASGKIWVIIMLMCFN 325 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 19/177 (10%), Positives = 53/177 (29%), Gaps = 11/177 (6%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVF 293 T I +I + + + P++ + ++ I+ Sbjct: 27 IKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRILKDSIQHITPKALKGKKLFPKNSIII 86 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-- 351 + L + + +++ K + + + + L + + Sbjct: 87 STTATIGEHALLIVDSLANQQFT---FLSKKANCDLALDMKFFFYQCFLLGEWCKKNTNV 143 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 S+ K+ +PP++ Q +I +++ T L ++ LK R+ Sbjct: 144 SGFASVDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTE---LKARKK 197 >gi|292491019|ref|YP_003526458.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] gi|291579614|gb|ADE14071.1| restriction modification system DNA specificity domain protein [Nitrosococcus halophilus Nc4] Length = 545 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 23/215 (10%), Positives = 59/215 (27%), Gaps = 21/215 (9%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS----------LSYGNIIQKLETRNM 273 +G +P W V L+ EL + ++ +T+ + Sbjct: 334 ELGEIPVGWRVGKLEELIDELETGSRPKGGVGQFFEGVPSIGAESITRIGEYDYSKTKFV 393 Query: 274 GLKPESYETYQIVDPGEIVFRFID-----LQNDKRSLRSAQVMERGIITSAYMA-VKPHG 327 + I+ +++ E + Sbjct: 394 PKEYFEKMRRGIIKDRDVLIYKDGGQPGRFDARISMFGGGFPYETACLNEHVFRLQAKKP 453 Query: 328 IDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 YL + SY + + + D+K L L + VI Sbjct: 454 TYQNYLYLWLSSYPVIEELRFRGAKAAIPGINSGDIKELDFLFMDEEVLEKFDEVIEPLF 513 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +++ + + + +L + R + + ++G++ + Sbjct: 514 SKL----LQNSREMAVLGKLRDTLLPKLMSGELRV 544 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 66/183 (36%), Gaps = 15/183 (8%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFI 296 K E+N + ++ GN + LK E ++ G+++ Sbjct: 16 KHGYAFPGKEITTKETNDVLVTPGNFEIGGGFKASKLKYFEGEVPEEYVLAEGDLIVTMT 75 Query: 297 DLQNDK-----RSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 DL D +L +R + + VK + S +L WLMRS + Sbjct: 76 DLSRDGDTLGYSALVPKFDGKRLLHNQRIGLVLVKSDEVSSAFLHWLMRSREYRFYVLGS 135 Query: 350 GSG-LRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +G + E +K++ + +P KEQ I ++ + +D +E + L+ Sbjct: 136 ATGSTVRHTSPERIKQIELEIPSDPKEQEAIAEIL----SSLDEKIELNRKQNRTLEAIA 191 Query: 408 SSF 410 + Sbjct: 192 QAL 194 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 56/206 (27%), Gaps = 20/206 (9%) Query: 18 IGAIPKHWKVVPIKRFT-KLNTGRTSESG-----KDIIYIGLEDVESGTGKYLPKDGNSR 71 +G IP W+V ++ +L TG + G + + IG E + + G+Y Sbjct: 335 LGEIPVGWRVGKLEELIDELETGSRPKGGVGQFFEGVPSIGAESI-TRIGEYDYSKTKFV 393 Query: 72 QSDTSTVS---IFAKGQILYGKLGPYLRKA---------IIADFDGICSTQFLV-LQPKD 118 + I +L K G + + K Sbjct: 394 PKEYFEKMRRGIIKDRDVLIYKDGGQPGRFDARISMFGGGFPYETACLNEHVFRLQAKKP 453 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L WL S V + + A + + I + E I Sbjct: 454 TYQNYLYLWLSSYPVIEELRFRGAKAAIPGINSGDIKELDFLFMDEEVLEKFDEVIEPLF 513 Query: 179 VRIDTLITERIRFIELLKEKKQALVS 204 ++ E +L L+S Sbjct: 514 SKLLQNSREMAVLGKLRDTLLPKLMS 539 >gi|323340762|ref|ZP_08081014.1| hypothetical protein HMPREF0542_11445 [Lactobacillus ruminis ATCC 25644] gi|323091885|gb|EFZ34505.1| hypothetical protein HMPREF0542_11445 [Lactobacillus ruminis ATCC 25644] Length = 188 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 15/149 (10%), Positives = 56/149 (37%), Gaps = 9/149 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + S + + +I+F ++ + ++ + + Sbjct: 43 DDVKYIDGSNFSKLSRSKLFINDIMFTYVGTVGEVAIIKENDRFY--LAPNVSRIRVKSD 100 Query: 328 IDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINV 384 +++ MR+ + +F + + + +L E++++ + +P +EQ + + Sbjct: 101 DSPKFISHYMRTDNFKNKVIFPLIATSSQPALSMENIRKFTINIPINREEQ----DCLAK 156 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ I +K + + + Sbjct: 157 YFDSLDHLITLHQRKIDKIKNMKKAMLDQ 185 >gi|218133859|ref|ZP_03462663.1| hypothetical protein BACPEC_01748 [Bacteroides pectinophilus ATCC 43243] gi|217991234|gb|EEC57240.1| hypothetical protein BACPEC_01748 [Bacteroides pectinophilus ATCC 43243] Length = 357 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 50/358 (13%), Positives = 112/358 (31%), Gaps = 49/358 (13%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKL----GPYLRKAIIADF-DGICSTQFL---VL 114 ++P N +D S + KG+ + L A+ + I S + V+ Sbjct: 36 FMPSVANVIGTDLSKYKLITKGKFACNPMHVGRDERLPVALYDEEKPAIVSPAYFMFEVI 95 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + L W + + +G+ W I + +PIPP+ Q+ I Sbjct: 96 DNSILNEDYLMMWFRRPEFDRICWLHTDGSVRGGITWDDICRLELPIPPIENQLEIVN-- 153 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 I I + + + L + Q L + + + + + L H Sbjct: 154 --SYKAITERIALKQKINDNLDDTAQTLYQKYFESNSDKSSWKQGTVGDVLQLQRGH--- 208 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + ++ G T +G E ++ G R Sbjct: 209 ------------------DLPRTEMTGGKYPVAGSTGTIGYHDEFTAEAPVIVMG----R 246 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 ++ N + L + ++ + + + ++ +L+++ + G Sbjct: 247 SGNIGNPRLYLCNCWT-----HNTSLYVKQIYEAEPLWVFYLLKNLNYDGFV---GGSAV 298 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +L DV + +PP++ Q + + I E + I L+E + + Sbjct: 299 PTLNRNDVHAYGIAIPPLELQK---SFSQKVMSLIYCKEENL-SEIEKLQELQKIILT 352 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 55/147 (37%), Gaps = 9/147 (6%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAV---KPHGID 329 + Y+++ G+ + + D+R + + I++ AY ++ Sbjct: 42 NVIGTDLSKYKLITKGKFACNPMHVGRDERLPVALYDEEKPAIVSPAYFMFEVIDNSILN 101 Query: 330 STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL R + ++ + +R + ++D+ RL + +PPI+ Q +I N T R Sbjct: 102 EDYLMMWFRRPEFDRICWLHTDGSVRGGITWDDICRLELPIPPIENQLEIVNSYKAITER 161 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++ L + + Sbjct: 162 ----IALKQKINDNLDDTAQTLYQKYF 184 >gi|290957397|ref|YP_003488579.1| type I restriction protein fragment [Streptomyces scabiei 87.22] gi|260646923|emb|CBG70022.1| putative type I restriction enzyme fragment [Streptomyces scabiei 87.22] Length = 220 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 54/145 (37%), Gaps = 7/145 (4%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++ + PG+++F + K + T ++ G+DS ++ ++ Sbjct: 77 DAISLKAVFRPGDVLFGKLRAYLRKFWFADVAGL---CTTEIWVLRARPGVDSRFVRSIV 133 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + + + V+ L V +PP EQ I +V+ A ID + + Sbjct: 134 ETERFIEAASGAYGTHMPRSDWGTVRSLSVDIPPHDEQRAIASVL----ADIDREISILH 189 Query: 398 QSIVLLKERRSSFIAAAVTGQIDLR 422 + ++ + + +TG+ L Sbjct: 190 ARLAKARDVKQGMMQQLLTGRTRLP 214 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 41/159 (25%), Positives = 74/159 (46%), Gaps = 6/159 (3%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 + LE VESG+G+ + K + S ++F G +L+GKL YLRK AD G+C+ Sbjct: 55 PLVELEQVESGSGRLVGK--SQAADAISLKAVFRPGDVLFGKLRAYLRKFWFADVAGLCT 112 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 T+ VL+ + + ++ + + G M +DW + ++ + IPP EQ Sbjct: 113 TEIWVLRARPGVDSRFVRSIVETERFIEAASGAYGTHMPRSDWGTVRSLSVDIPPHDEQR 172 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 I + ID I+ + ++ KQ ++ ++ Sbjct: 173 AIASVL----ADIDREISILHARLAKARDVKQGMMQQLL 207 >gi|328947981|ref|YP_004365318.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328448305|gb|AEB14021.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 185 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 51/168 (30%), Gaps = 9/168 (5%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES----YETYQIVDPG 289 + + E + S N + + N+ + + G Sbjct: 21 CNKLVDGDHNPPKSVEEQTEYIMASSRNINYDRLDDLENVRYLSKEVFKIENNRTKAEKG 80 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I F + + I + V I++ +L + S Sbjct: 81 DIFFTSVGTIGR----SCIYSGDYNICFQRSVTVLNTNINNQFLKYFFDSNFFQTYVIEH 136 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +G + +++ P+ +PP+ EQ I + ++ ++D + + Sbjct: 137 STGTAQMGFYLKEMANSPIAIPPMHEQARIVDKVSELFYQLDQIQNNL 184 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 61/173 (35%), Gaps = 9/173 (5%) Query: 20 AIPKHWKVVPIKRFT-KLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP W+ V + KL G ++ E + I ++ L + Sbjct: 7 EIPDSWRWVKLTSICNKLVDGDHNPPKSVEEQTEYIMASSRNINYDRLDDLENVRYLSKE 66 Query: 74 D---TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + KG I + +G R I + IC + + + ++ + L+ + S Sbjct: 67 VFKIENNRTKAEKGDIFFTSVGTIGRSCIYSGDYNICFQRSVTVLNTNINNQFLKYFFDS 126 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + G K + N P+ IPP+ EQ I +K+ ++D Sbjct: 127 NFFQTYVIEHSTGTAQMGFYLKEMANSPIAIPPMHEQARIVDKVSELFYQLDQ 179 >gi|295087090|emb|CBK68613.1| Restriction endonuclease S subunits [Bacteroides xylanisolvens XB1A] Length = 414 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 48/132 (36%), Gaps = 9/132 (6%) Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 F + Q + + + A + + I S ++ + +L + + Sbjct: 42 HFAIVGRQGALCGCLNIESGKFYATEHAVVVNSYNIISSLFIYHFFTALNLNQY---ATA 98 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-----KER 406 + L ++ + + +PP+ EQ I + I + E+ + + L ++ Sbjct: 99 TAQPGLAVSNIMEVFIPLPPLSEQHRIVSKIEELL-PLVKTYERAQNGLNTLNVSLNEQL 157 Query: 407 RSSFIAAAVTGQ 418 R S + A+ G+ Sbjct: 158 RKSILQEAIQGR 169 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 52/181 (28%), Gaps = 20/181 (11%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +PK WK +K + TG T + + I + ++ K D + Sbjct: 236 DLPKGWKWCRLKDICSIFTGATFKKEEATITKQGIRILRGGNISPFELKIKDDDIFLAKD 295 Query: 74 DTSTVSIFAKGQIL------------YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + + IL ++ + + F I + + Sbjct: 296 KIKEAILLKENDILTPAVTSLENIGKMARVDSDMPDTTVGGFVFIIRLHLINQWFSKYI- 354 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L + R G + + + +PIPPL EQ I +I ++ Sbjct: 355 -LCLLSSPFMIDFMRSITNKSGQAFYNIGKERLSTALLPIPPLVEQHRIVAQIEKLFEQL 413 Query: 182 D 182 Sbjct: 414 R 414 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 26/231 (11%), Positives = 68/231 (29%), Gaps = 22/231 (9%) Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG-------IEWVGLVPDHWE 233 + I + + K++ +S V + + + G E +P W+ Sbjct: 183 LIEQIRLEKLQLVKEGKLKKSALSNSVIYKGDDNKYYEQVGKNINEITEEIAFDLPKGWK 242 Query: 234 VKPFFALVTELNRKNTKLIESNILSL--------SYGNIIQKLETRNMGLKPESYETYQI 285 + + K E+ I + K++ ++ L + + + Sbjct: 243 WCRLKDICSIFTGATFKKEEATITKQGIRILRGGNISPFELKIKDDDIFLAKDKIKEAIL 302 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWLMRSY- 340 + +I+ + + + ++ + S Y+ L+ S Sbjct: 303 LKENDILTPAVTSLENIGKMARVDSDMPDTTVGGFVFIIRLHLINQWFSKYILCLLSSPF 362 Query: 341 --DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 D + ++ E + + +PP+ EQ I I ++ Sbjct: 363 MIDFMRSITNKSGQAFYNIGKERLSTALLPIPPLVEQHRIVAQIEKLFEQL 413 >gi|317180667|dbj|BAJ58453.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F32] Length = 377 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 62/384 (16%), Positives = 121/384 (31%), Gaps = 26/384 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 ++ + L T +S + YI +++ ++ G K+ N Q + F K + Sbjct: 3 KTLQDYATLIND-TIQSNEINHYITTDNMCQNLGGIDTFKNINIPQGKVRS---FQKDDV 58 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L + Y RK A G CS+ LV + K + L L S T + +G+ M Sbjct: 59 LLSNIRLYFRKVYRAKQKGGCSSDVLVFRAKRIDSATLFAILSSQIFTDYACSGSQGSKM 118 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAE--TVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P + +I+ L+ + + + + + Sbjct: 119 PRGNKTHMMDFKIPTINFTIAKIFNSIQNKIENNHKINELLHKILELLYEQYFVRFDFLD 178 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 KMK S E L+P W V+ + + T S SY Sbjct: 179 ENNKPYQTSGGKMKFS-KELNRLIPSGWSVRFLNHKIVSTYQPKTISKTLLNDSYSYSVY 237 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + I G+ ++ L + + + Sbjct: 238 GGGGIIGRFTEYNHEQSEFIISCRGQCGISYLTLPKSWITG-----------NAMVIRPT 286 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 TYL ++ Y L ++ + +++ +P+L+P N N Sbjct: 287 KSYTSKTYLYHTIKKYKLTNYI---TGSVQPQITRQNLSTMPILIPK----RKTLNKWNN 339 Query: 385 ETARIDVLVEKIEQSIVLLKERRS 408 ++ + L+ QS L R Sbjct: 340 ISSLLWNLIHNNMQSTQTLTALRD 363 >gi|319744172|gb|EFV96542.1| type I restriction/modification specificity protein [Streptococcus agalactiae ATCC 13813] Length = 164 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 16/109 (14%), Positives = 44/109 (40%), Gaps = 3/109 (2%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 +I++ I +N + + T + + + YL +++S + Sbjct: 53 FQKNDILYSEIRPKNRRFAYIDFDSDNYVASTKLMVIRANNRVLPQYLYQILKSEKVINQ 112 Query: 346 FYAMG---SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++ SG + F ++ ++ V +P + EQ +I + + + +I+ Sbjct: 113 LQSLAESRSGTFPQITFSELAQIDVYIPELSEQKEIADFLKLFDDKIEN 161 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 64/162 (39%), Gaps = 7/162 (4%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + ++ + ++ I DV G N + F K ILY Sbjct: 2 KLGDVCDSVSVTFDKTKQQVVLINTSDVLEGEVTNHILVDNKGLKGQFKKT-FQKNDILY 60 Query: 89 GKLGPYLRKAIIADF---DGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIEAI--CE 142 ++ P R+ DF + + ST+ +V++ + LP+ L L S V +++++ Sbjct: 61 SEIRPKNRRFAYIDFDSDNYVASTKLMVIRANNRVLPQYLYQILKSEKVINQLQSLAESR 120 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 T + + I + IP L+EQ I + + +I+ Sbjct: 121 SGTFPQITFSELAQIDVYIPELSEQKEIADFLKLFDDKIENN 162 >gi|304320737|ref|YP_003854380.1| hypothetical protein PB2503_05837 [Parvularcula bermudensis HTCC2503] gi|303299639|gb|ADM09238.1| hypothetical protein PB2503_05837 [Parvularcula bermudensis HTCC2503] Length = 204 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 49/132 (37%), Gaps = 9/132 (6%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV---KPHGIDSTYLAW 335 S T + +++F + + R + + + +P + +LAW Sbjct: 61 SKRTPDWLSGDDVIFSARGTRTLAYPI--NDPPARAVCAPQFYVIKVKRPEKLLPAFLAW 118 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR---IDV 391 + F +G Q+++ + ++ LP+ +PP+ EQ I ++ Sbjct: 119 QINQKPAQDYFSRTATGSYIQNIRRKALENLPLAIPPVHEQQVIVEFWRAAQRERAVLNQ 178 Query: 392 LVEKIEQSIVLL 403 L++ Q + L Sbjct: 179 LIQNRNQQLDAL 190 >gi|281357556|ref|ZP_06244043.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] gi|281315813|gb|EFA99839.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] Length = 174 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 51/173 (29%), Gaps = 5/173 (2%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + +++ K + S N++ + + V P Sbjct: 2 KNNQLEKLSDYADYSKAKISIAEIDTKCYFSTENMLPNKGGVTEAAGLPTQDNVTKVLPE 61 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ I K + G V +G YL +L+ S A Sbjct: 62 NVLVSNIRPYFKKIYFANELA---GASNDVLCFVAKNGCLPRYLYYLLSSDSFFDYMMAG 118 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 G + PV VP EQ I +V++ +I+ + KI ++ Sbjct: 119 AKGTKMPRGDKGQIMNFPVWVPAQNEQSRIVSVLSALDEKIEN-ISKINHNLE 170 >gi|329575631|gb|EGG57164.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX1467] Length = 321 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 57/132 (43%), Gaps = 11/132 (8%) Query: 288 PGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKV 345 E+ + + + K + S + E ++ Y + K D +L ++ + K Sbjct: 2 KNELSYNHGNSKLAKYGAVFSLKTYEEALVPRVYHSFKSTKNSDPDFLEYIFATKKPDKE 61 Query: 346 F-YAMGSGLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + SG R ++ ++D + + +P + EQ I+N++ +ID + ++ + Sbjct: 62 LGKLVSSGARMDGLLNINYDDFSNIKINIPHVHEQKKISNLL----RKIDDTIALHQRKL 117 Query: 401 VLLKERRSSFIA 412 LKE + +++ Sbjct: 118 DQLKELKKAYLQ 129 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/265 (13%), Positives = 92/265 (34%), Gaps = 24/265 (9%) Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + ++ NI + IP + EQ I + +ID I R ++ LKE K+A + + Sbjct: 77 NINYDDFSNIKINIPHVHEQKKISNLL----RKIDDTIALHQRKLDQLKELKKAYLQLMF 132 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 K +++ + E + ++ + + K+ S+ Y + Sbjct: 133 PKKDETVPQVRFANFEENWEL------CKLENIIEKQIKGKAKVENLCNGSVEYLDA--- 183 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 R G KP + V +I+ + + K +G++ S A + Sbjct: 184 --NRLNGGKPIYTKALPDVSERDIIILWDGSKAGKVYY-----EFKGVLGSTLKAYQLKE 236 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ + + ++ + + P+ + +EQ + +++ + Sbjct: 237 CANSQFIYQQLLDNQNNIYNNYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADIL----S 292 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIA 412 +D + + + + S++ Sbjct: 293 NLDNRIILQQNLTDTMISLKKSYLQ 317 Score = 41.3 bits (95), Expect = 0.33, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 60/184 (32%), Gaps = 15/184 (8%) Query: 23 KHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI- 80 ++W++ ++ + G+ +E++ +G+ +YL + + T ++ Sbjct: 149 ENWELCKLENIIEKQIKGKAK----------VENLCNGSVEYLDANRLNGGKPIYTKALP 198 Query: 81 -FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ I+ G K +F G+ + Q K+ + +D I Sbjct: 199 DVSERDIIILWDGSKAGKVYY-EFKGVLGSTLKAYQLKECANS-QFIYQQLLDNQNNIYN 256 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + H P+ + EQ + + + RI I L K Sbjct: 257 NYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYL 316 Query: 200 QALV 203 Q + Sbjct: 317 QNMF 320 >gi|288804029|ref|ZP_06409441.1| type I restriction-modification system, S subunit [Prevotella melaninogenica D18] gi|288333494|gb|EFC71957.1| type I restriction-modification system, S subunit [Prevotella melaninogenica D18] Length = 168 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 5/132 (3%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + +++ L ++V G I L + E + Sbjct: 41 RFVDSSAEYLSEAGKAISRVVPIGSTAVCCIGSIGKAGYL----IKEGTTNQQINCVIPS 96 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 +DS +L +L S + + S + ++ + V +PPI+EQ I + I Sbjct: 97 EAVDSVFLYYLCTSPLFYQELITLSSAVTISIINKSKMENIIVPLPPIEEQKRIVSKIED 156 Query: 385 ETARIDVLVEKI 396 I + E + Sbjct: 157 LFGFIKTIEESL 168 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 51/161 (31%), Gaps = 5/161 (3%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQSDTST-VSIF 81 + +K + + TG T Y G + ++ G+++ + Sbjct: 2 LCKLKNISLIITGSTPSKSNSAYYGGKVPFYKPIDLDAGRFVDSSAEYLSEAGKAISRVV 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G +G + + V+ + V L S Q + + Sbjct: 62 PIGSTAVCCIGSIGKAGYLIKEGTTNQQINCVIPSEAVDSVFLYYLCTSPLFYQELITLS 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 T+S + + NI +P+PP+ EQ I KI I Sbjct: 122 SAVTISIINKSKMENIIVPLPPIEEQKRIVSKIEDLFGFIK 162 >gi|332829721|gb|EGK02367.1| hypothetical protein HMPREF9455_01637 [Dysgonomonas gadei ATCC BAA-286] Length = 183 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 73/173 (42%), Gaps = 8/173 (4%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 K+ + ++ G I Q + E +Y+I++ G+ + + + Sbjct: 13 FSFRNKSQEQYPKYSITNDLGFIPQSERFEERNMIYEDISSYKIINKGDFAYNP--ARIN 70 Query: 302 KRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKF 359 S+ + +I+S Y+ +P + S +L +++S + + G G+R L F Sbjct: 71 VGSIAKYEGDNPCMISSLYVCFRPKPNMSSEWLKHVLKSKRMIYNYNLFGEGGVRIYLFF 130 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + R+ + VPP++EQ I +I+ ID + + L ++S ++ Sbjct: 131 PNFGRIKINVPPLEEQERIAIIIST----IDQKISIESLMLNKLNTQKSFLLS 179 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 49/155 (31%), Gaps = 3/155 (1%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 +K + R + Y D+ ++ N D S+ I KG Y Sbjct: 6 LKDVVINFSFRNKSQEQYPKYSITNDLGFIPQSERFEERNMIYEDISSYKIINKGDFAYN 65 Query: 90 KLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC-EGATM 146 + D + S+ ++ +PK + +L + EG Sbjct: 66 PARINVGSIAKYEGDNPCMISSLYVCFRPKPNMSSEWLKHVLKSKRMIYNYNLFGEGGVR 125 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + G I + +PPL EQ I I +I Sbjct: 126 IYLFFPNFGRIKINVPPLEEQERIAIIISTIDQKI 160 >gi|60680964|ref|YP_211108.1| putative type I restriction enzyme specificity protein [Bacteroides fragilis NCTC 9343] gi|60492398|emb|CAH07167.1| putative type I restriction enzyme specificity protein [Bacteroides fragilis NCTC 9343] Length = 447 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 41/286 (14%), Positives = 94/286 (32%), Gaps = 27/286 (9%) Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV----SYIVTKGLNP 213 L +Q E I + ++ ++ K+K ++++ + + Sbjct: 13 WAIQGKLVQQDPNDEPASVLLEHIREEKAKLVKEKKIKKDKNESIIYRGDDNSYYEKIIA 72 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVT----------ELNRKNTKLIESNILSLSYGN 263 ++K E +P+ WE + + + K + I + N Sbjct: 73 TGEVKCIDEEIPFEIPNGWEWERVGNIFFVTKLAGFEYTKFFTKEAISAFNPIPIVRAQN 132 Query: 264 IIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + N + Q+ ++ ++ FI + A+ A Sbjct: 133 VRMGFFEENKNEAISEMLSNQLKRSALNKKCLLMTFIGAGIGDTCIFPAERKNHLAPNVA 192 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + I Y + + S + A S + SL E +++L + +PP KEQ I Sbjct: 193 KIEPLDDSIFLDYAVFALMSPCGQRGVNAIKKSTAQPSLSMETIRKLLIPIPPFKEQKCI 252 Query: 379 TNVINVET------ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + ++ +++ + +I I +L S + A+ G+ Sbjct: 253 SLKLSEVLPLVEKYSKVQKVQNQINDEINIL--LSKSILQEAIRGK 296 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/347 (10%), Positives = 103/347 (29%), Gaps = 29/347 (8%) Query: 20 AIPKHWKVVPIKRFT-----------KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 IP W+ + K T + I + ++V G + + Sbjct: 86 EIPNGWEWERVGNIFFVTKLAGFEYTKFFTKEAISAFNPIPIVRAQNVRMGFFEENKNEA 145 Query: 69 NSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 S S K +L +G + I + V + + + + + Sbjct: 146 ISEMLSNQLKRSALNKKCLLMTFIGAGIGDTCIFPAERKNHLAPNVAKIEPLDDSIFLDY 205 Query: 128 L----LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 +S + + AI + + I + +PIPP EQ I K+ ++ Sbjct: 206 AVFALMSPCGQRGVNAIKKSTAQPSLSMETIRKLLIPIPPFKEQKCISLKLSEVLPLVEK 265 Query: 184 LITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDSGIEWVGLVP-DHWEVKPFF 238 + ++ E ++++ + L P + + + + + + + + Sbjct: 266 YSKVQKVQNQINDEINILLSKSILQEAIRGKLVPQIAEEGTADKLLAEIHKEKERLVKEG 325 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 L + + + + + + + ++ + + DL Sbjct: 326 KLKKAILTDSVIYKGDDNKYYERVGKSEIDISDEIPFEIPQSWSWCRLSSVITLLSGRDL 385 Query: 299 QNDKR--------SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 D+ + A G+ ++ ++ + + Y +L+ Sbjct: 386 TPDRYNSEENGIPYITGASNFYNGVSSTLAVSNIRNRTYTNYEVYLI 432 >gi|302520833|ref|ZP_07273175.1| restriction modification system DNA specificity subunit protein [Streptomyces sp. SPB78] gi|302429728|gb|EFL01544.1| restriction modification system DNA specificity subunit protein [Streptomyces sp. SPB78] Length = 278 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 47/157 (29%), Gaps = 9/157 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + E Y+++ ++ D R + I + V+ Sbjct: 114 DLSTVKEIAASVAEIERYKLLSEDLLLTEGGDPDKLGRGTLWRDELPVCIHQNHVFRVRV 173 Query: 326 HGI---DSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 D YL W+M S F + S+ + P+ VPPI Q D + Sbjct: 174 KTRAEVDPLYLNWVMSSSYGKGYFLRTAKQTTGIASINKTQLGEFPLPVPPIARQKDFRS 233 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I + + L E +S A +G Sbjct: 234 RIESVQES----QQAHRTHLATLDELFTSLQHRAFSG 266 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/66 (28%), Positives = 34/66 (51%), Gaps = 4/66 (6%) Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 GSG ++ + ++ L V VPP+ EQ I +++ ++D L K ++I LL + Sbjct: 1 MTGSGGQRRVPESYLRSLSVPVPPLAEQRHIATLLD----QVDTLRAKRREAIALLDDLA 56 Query: 408 SSFIAA 413 SS + Sbjct: 57 SSLFSD 62 >gi|302035527|ref|YP_003795849.1| hypothetical protein NIDE0137 [Candidatus Nitrospira defluvii] gi|300603591|emb|CBK39921.1| protein of unknown function, putative Type I restriction endonuclease, S subunit [Candidatus Nitrospira defluvii] Length = 389 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 55/404 (13%), Positives = 126/404 (31%), Gaps = 46/404 (11%) Query: 29 PIKRFTKLNTG-----RTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 P+ +++G +T ES ++ Y+ + +V+ G + + Sbjct: 8 PLSDVADISSGITLGRKTKESELTEVPYLRVANVQDGHLLLGDLKMIAATRREAEKWALK 67 Query: 83 KGQILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +L + G R A + +C Q + + + ++ + +A Sbjct: 68 DGDLLLTEGGDLDKLGRGACWREQLPLCIHQNHIFRVRLPADRYDADFVSFQIGSPYGKA 127 Query: 140 IC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + ++ + + +G P+ PP+AEQ I ++ A+ +D + Sbjct: 128 YFLAHAKKTTGIASINQRVLGAFPLVSPPIAEQHRIAVRLKAQLAEVDRARQAAQAQLRE 187 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + ++V + + N + E I + Sbjct: 188 VARLADSIVLNSIRQHPNDRHDLGSVLNE------------------------VKNGIGA 223 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + Y+ PG + + + + + G Sbjct: 224 AWAEYRVLGATRDGLAPAKEPPGKHAPKYKPAFPGTVFYNPMRILIGSIAFVD-DDDAPG 282 Query: 315 IITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPP 371 I + Y+A K +DS + + +RS + ++ G R+ + F + + +P Sbjct: 283 ITSPDYVALTGKSDKVDSRWFYYWLRSPLGAQCIISLARGAVRERMLFNRLSEGEIELPR 342 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 Q +V E + +E I L +R +A A Sbjct: 343 YPVQQR-ASVALKELKPLRQAIECQLAEIERLPQR---LLAQAF 382 >gi|331266259|ref|YP_004325889.1| type I restriction-modification system, putative [Streptococcus oralis Uo5] gi|326682931|emb|CBZ00548.1| type I restriction-modification system, putative [Streptococcus oralis Uo5] Length = 213 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 56/179 (31%), Gaps = 9/179 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTE------LNRKNTKLIESNILSLSYGNIIQKLETR 271 K E G + L + + S + + +I + + Sbjct: 14 KSRFNEMFGDPVFNEMRWRRCKLKDISVEKLAYGSGASAIDFSGLRYIRITDIDECGNLK 73 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--ID 329 P YE +++ G+I+F K L S + + + + P+ ++ Sbjct: 74 PDKKSPNHYEEKYLLNTGDILFARSGATVGKTFLYSKEKYGPALFAGYLIRLIPNLSLVN 133 Query: 330 STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ + + + + + ++ + L ++PP+ Q + + + Sbjct: 134 PVFVYHFTNTKFYKEFIAKVQNTVAQPNINAKQYSELDFILPPLALQNEFADFVAQVDK 192 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 55/160 (34%), Gaps = 15/160 (9%) Query: 25 WKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVS 79 W+ +K + +G ++ + YI + D++ G K K N + Sbjct: 31 WRRCKLKDISVEKLAYGSGASAIDFSGLRYIRITDIDECGNLKPDKKSPNHYEE----KY 86 Query: 80 IFAKGQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPK--DVLPELLQGWLLSIDV 133 + G IL+ + G + K + + + + L P V P + + + Sbjct: 87 LLNTGDILFARSGATVGKTFLYSKEKYGPALFAGYLIRLIPNLSLVNPVFVYHFTNTKFY 146 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + I + + + K + +PPLA Q + Sbjct: 147 KEFIAKVQNTVAQPNINAKQYSELDFILPPLALQNEFADF 186 >gi|198273274|ref|ZP_03205810.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|198249794|gb|EDY74574.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 4 str. ATCC 27816] Length = 382 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 44/384 (11%), Positives = 109/384 (28%), Gaps = 14/384 (3%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +V I K+ G T + + ++ +++ + L + SR + I Sbjct: 3 IVNIGSICKIIGGSTPSTKNNNLW--KKEIPFYSLADLLINVASRYISIENNKFIDEPAI 60 Query: 87 LYGKLGPYLRKAIIADFDGICST-QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 L+ + + + + + +VL + + +G+ Sbjct: 61 LFSSTATIGNVCYVEEKCWFNDQIKAFISKDSNVLNTKYLYYWFLNNKHIIKSQANKGSV 120 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QA 201 S K + N+ + +P + EQ I I +K + Sbjct: 121 FSSIGIKELVNMKINLPSIEEQNAIISIIEPHEKLFVKYSNLVDISSVENAKKDVDNLIS 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE-VKPFFALVTELNRKNTKLIESNILSLS 260 ++ + + + + + F K S Sbjct: 181 IIEPLDILENKINKLKTVLKKLLINIYDKNCNSHVNLFENNKIYTNKYLNQNLYCDTSCI 240 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 I + N+ L+ + + I+F + +N E + ++ + Sbjct: 241 GELEINFSKMINISLEDKPSRADLSIKNNSIIFSKLLGENKVYC---FLNNENIVFSTGF 297 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 +K + ++ L + S D + +G + D+ ++ P + +I Sbjct: 298 FNIKSNDENNDDLLSFLLSSDFKNQKSMLANGTTMIGINNSDLTKVRCKAPFLN--SNIY 355 Query: 380 NVINVETARIDVLVEKIEQSIVLL 403 + I+ + IV L Sbjct: 356 FTFFNKLNEIENKITLARNKIVNL 379 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 52/152 (34%), Gaps = 2/152 (1%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + ++ + +N+ N+ + S E + +D I Sbjct: 1 MSIVNIGSICKIIGGSTPSTKNNNLWKKEIPFYSLADLLINVASRYISIENNKFIDEPAI 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + + I A+++ + +++ YL + + A Sbjct: 61 LFSSTATIGNVCYVEEKCWFNDQI--KAFISKDSNVLNTKYLYYWFLNNKHIIKSQANKG 118 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + S+ +++ + + +P I+EQ I ++I Sbjct: 119 SVFSSIGIKELVNMKINLPSIEEQNAIISIIE 150 >gi|19881312|gb|AAM00902.1|AF486570_3 3' truncated HsdS [Campylobacter jejuni subsp. jejuni ATCC 33560] Length = 221 Score = 66.0 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 56/163 (34%), Gaps = 5/163 (3%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N L + + I K E+ + ++ ++ S + + Sbjct: 47 NFLDVMNNHYINKNIPSMKVTASEAEIQKCNILKNDLFITPSSENINEIGFASVAIEDMP 106 Query: 315 IITSAYMAV----KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + +Y + I+ +L + S +L K G R L K L + + Sbjct: 107 NVCYSYHIMRFRIFNRQINPYFLRYCFDSENLRKQILKNAQGITRFGLTQPKWKNLQIPI 166 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 PP++ Q +I +++ T L ++E + R+ ++ Sbjct: 167 PPLEIQEEIVKILDTFTELEAELEAELEARRRQYEYYRNKLLS 209 >gi|304387859|ref|ZP_07370033.1| type I restriction enzyme EcoprrI specificity protein [Neisseria meningitidis ATCC 13091] gi|304338124|gb|EFM04260.1| type I restriction enzyme EcoprrI specificity protein [Neisseria meningitidis ATCC 13091] Length = 198 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 50/171 (29%), Gaps = 4/171 (2%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY---ETYQIVDPGEI 291 T +I +I + + LK S + ++ I Sbjct: 10 FDLKNGYTPSKSNKEYWENGSIPWFRMEDIRENSRILDNSLKHISKSAVKGGKLFPAKSI 69 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + ++ + + + +D + + S Sbjct: 70 MMSTTATIGEHALIKVNYISNQQLTNFTIKDEFKDALDINFAFYYFFIIAEQSKKLINTS 129 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 L + +++K+L + +PP+ EQ I +++ + E + I L Sbjct: 130 SL-PIISMKELKKLKIPIPPLPEQEKIAAILDKFDTLTHSISEGLPHEIAL 179 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 57/192 (29%), Gaps = 9/192 (4%) Query: 27 VVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 + L G T I + +ED+ + + +S Sbjct: 3 WKTLGEVFDLKNGYTPSKSNKEYWENGSIPWFRMEDIRENSRILDNSLKHISKSAVKGGK 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +F I+ A+I F + ++ + + ++ Sbjct: 63 LFPAKSIMMSTTATIGEHALIKVNYISNQQLTNFTIKDEFKDALDINFAFYYFFIIAEQS 122 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + +++ K + + +PIPPL EQ I + ++ I L ++ Sbjct: 123 KKLINTSSLPIISMKELKKLKIPIPPLPEQEKIAAILDKFDTLTHSISEGLPHEIALRRK 182 Query: 198 KKQALVSYIVTK 209 + + ++ Sbjct: 183 QYEYYREQLLAF 194 >gi|313620385|gb|EFR91788.1| restriction modification system DNA specificity subunit [Listeria innocua FSL S4-378] Length = 201 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 63/183 (34%), Gaps = 8/183 (4%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 WE + + ++ KN + S G I + N+ +S + Y++V PG+ Sbjct: 21 WEQRKAGNIFMTISDKNHAHLPVLSASQELGMIRRDNIGINIKYNEKSLKNYKLVKPGQF 80 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 V Q + Y + + S + ++ S K + Sbjct: 81 VIHLRSFQGGFAWSYITGITSPAYTILDYKEPQKNV--SKFWKEVLTSPIFIKRLETITY 138 Query: 352 GLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G+R +S+ F D L P + EQ I+ ++D + + L + + Sbjct: 139 GIRDGRSISFADFSTLKFSAPSVDEQRKISAF----FQQLDNNTTIQQNKLEKLISLKEA 194 Query: 410 FIA 412 ++ Sbjct: 195 YLQ 197 Score = 40.5 bits (93), Expect = 0.57, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 51/185 (27%), Gaps = 9/185 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + + + ++ + + + + Sbjct: 20 EWEQRKAGNIFMTISDKNHA---HLPVLSASQELGMIRRDNIGINIKYNEKSLKNYKLVK 76 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQRIEA 139 GQ + L + + GI S + + +P+ + + + L S +R+E Sbjct: 77 PGQFVI-HLRSFQGGFAWSYITGITSPAYTILDYKEPQKNVSKFWKEVLTSPIFIKRLET 135 Query: 140 ICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I G + + P + EQ I + + I L + Sbjct: 136 ITYGIRDGRSISFADFSTLKFSAPSVDEQRKISAFFQQLDNNTTIQQNKLEKLISLKEAY 195 Query: 199 KQALV 203 Q + Sbjct: 196 LQNMF 200 >gi|322514822|ref|ZP_08067841.1| type I restriction-modification system [Actinobacillus ureae ATCC 25976] gi|322119204|gb|EFX91345.1| type I restriction-modification system [Actinobacillus ureae ATCC 25976] Length = 449 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 52/379 (13%), Positives = 112/379 (29%), Gaps = 26/379 (6%) Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVL 114 + + G I Y + + S ++V Sbjct: 69 DNKVGIFDAYISKGKEINQPYKKMETGFIAYNPYRINVGSIGLKTEKHQHQYISPAYVVF 128 Query: 115 QPKDVL-PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ------ 167 + L PE L + + I G+ + + + + +P+P + Q Sbjct: 129 SCQTTLLPEYLFLVFKTNFYNRIIRENTTGSVRQNLSFDNLIKMQIPLPDINTQKALAQA 188 Query: 168 ----VLIREKIIAETVRIDTLITER-----IRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 + +++ + +ID+ I + I+ ++ + + ++ + LN + Sbjct: 189 YQDKMAKADELEKQANQIDSDIEQYLFEQLGIEIQQTQKVQTGKLQFVNFRDLNLWGVVS 248 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 I + + E+N I + N+ ++ K Sbjct: 249 QDAITAETIFKSNQFKNKPITNFFEINPTTQIPSNQIISFIPMANVSDIYGEISIYDKQT 308 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTYLA 334 Y ++++ I + A +E G + K L Sbjct: 309 LKPNYTKFKENDLIWAKITPCMENGKSAIASNLENGFGFGSTEFHVLRAKNKDFSIHLLH 368 Query: 335 WLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN-VETARIDV 391 L+R+ L K+ Y GS +Q + +K L + V ++ Q I I + + D Sbjct: 369 SLLRTSHLRKIATQYFTGSAGQQRVPKSFLKALTLPVLNLEIQTKILTYIQTQKQQQKDS 428 Query: 392 LVEKIEQSIVLLKERRSSF 410 L I L E + Sbjct: 429 LATASAYRIEALMEFEKAI 447 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 71/204 (34%), Gaps = 4/204 (1%) Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 Q + V + +K + +++++ + E N K Sbjct: 4 SNFQTAFLHFVDFSQFNNWNVKQYVNTNLLK--SNFKIEFLAEHLIEQNNKIKPFDFPEK 61 Query: 257 LSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 G + K + Y+ ++ G I + + L++ + + I Sbjct: 62 DFAILGVDNKVGIFDAYISKGKEINQPYKKMETGFIAYNPYRINVGSIGLKTEKHQHQYI 121 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 + + + YL + ++ ++ +G +RQ+L F+++ ++ + +P I Sbjct: 122 SPAYVVFSCQTTLLPEYLFLVFKTNFYNRIIRENTTGSVRQNLSFDNLIKMQIPLPDINT 181 Query: 375 QFDITNVINVETARIDVLVEKIEQ 398 Q + + A+ D L ++ Q Sbjct: 182 QKALAQAYQDKMAKADELEKQANQ 205 >gi|257440123|ref|ZP_05615878.1| type I restriction system specificity protein [Faecalibacterium prausnitzii A2-165] gi|257197475|gb|EEU95759.1| type I restriction system specificity protein [Faecalibacterium prausnitzii A2-165] Length = 228 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 69/240 (28%), Gaps = 20/240 (8%) Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS---GIEWVGLVPDHWEVKPFFALV 241 + + L+++ Q+ + +P+ G G P + + + Sbjct: 1 MRVIQTVNDNLEQQAQSYFQELFVDNADPEWTTGTISDLGTVVGGSTPSKAKPEYYTESG 60 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + N I +L RN + I+ G ++F Sbjct: 61 IAWITPKDLSNNKSKFVSHGENDITELGLRN--------SSASIMPEGTVLFSSRAPIGY 112 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 A + +V P T + L + + + Sbjct: 113 -----IAIAAGEVTTNQGFKSVVPKPEIGTPFVYFFLKNTLPVIEGMASGSTFKEVSGST 167 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +K +P ++P + ++ A I +E+ L R + + ++G+ID+ Sbjct: 168 MKNVPAVIPDAETLAKFSDF----CAPIFAQQRILEEQNQSLATLRDNLLPKLMSGEIDV 223 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 54/169 (31%), Gaps = 13/169 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYL---PKDGNSR 71 P+ W I + G T K I +I +D+ + K++ D Sbjct: 29 PE-WTTGTISDLGTVVGGSTPSKAKPEYYTESGIAWITPKDLSNNKSKFVSHGENDITEL 87 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 S+ SI +G +L+ P IA + + F + PK + + Sbjct: 88 GLRNSSASIMPEGTVLFSSRAPI-GYIAIAAGEVTTNQGFKSVVPKPEI-GTPFVYFFLK 145 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + IE + G+T + N+P IP + + Sbjct: 146 NTLPVIEGMASGSTFKEVSGSTMKNVPAVIPDAETLAKFSDFCAPIFAQ 194 >gi|17230969|ref|NP_487517.1| type I restriction-modification enzyme S subunit [Nostoc sp. PCC 7120] gi|17132610|dbj|BAB75176.1| type I restriction-modification enzyme S subunit [Nostoc sp. PCC 7120] Length = 383 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/319 (11%), Positives = 90/319 (28%), Gaps = 18/319 (5%) Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG-ATMSHAD 150 P + +++ G + + LP L + ++ + + + Sbjct: 52 SPIIVDYLLSSATGTANQANIGANTLRELPFPLPPLAEQKRIVEKCDRLLSICDEIEKRH 111 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER---IRFIELLKEKKQALVSYIV 207 + +I Q+L + + E + + +QA++ V Sbjct: 112 QQRQESIVRMNESAIAQLLSSQNPDDFRQHWQRICNNFDLLYSIPETIPKLRQAILQLAV 171 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN---- 263 L + + D +V + +++ K+T I I L N Sbjct: 172 QGKLTNQSSK------EIKKISDTHKVSDYVSILNGYAFKSTWFINDGIRLLRNANVGHG 225 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFR----FIDLQNDKRSLRSAQVMERGIITSA 319 ++ + + + +D +IV I + + + Sbjct: 226 DLRWDDVATISEERAQEFQRFKLDIDDIVISLDRPIISTGLKVARITKNDLPCLLLQRVG 285 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 K + + ++S S + + ++ + P +EQ I Sbjct: 286 KFEFKTDKVIPDFFFLWLQSPIFINAIDPGRSNGVPHISSKSIEAILFNPPSREEQKRIV 345 Query: 380 NVINVETARIDVLVEKIEQ 398 + + D L K++Q Sbjct: 346 EKCDRLMSLCDTLEAKLKQ 364 >gi|256826763|ref|YP_003150722.1| hypothetical protein Ccur_03130 [Cryptobacterium curtum DSM 15641] gi|256582906|gb|ACU94040.1| hypothetical protein Ccur_03130 [Cryptobacterium curtum DSM 15641] Length = 208 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 50/137 (36%), Gaps = 11/137 (8%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRS 339 + Y+ +IV+ +L K + + + + Y+ +++ ++ Sbjct: 76 KKYKETRLDDIVYNPANL---KFGAIARNTLRNAVFSPIYVTFNVDETAAPSFIEKVVTR 132 Query: 340 YDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + G R S+ E++ L V +P + EQ I + +D L+ Sbjct: 133 SRFIQGALRYQQGTVYERMSVSPEELCDLNVTLPYLDEQQYIGSY----FTNLDHLITLH 188 Query: 397 EQSIVLLKERRSSFIAA 413 ++ LK+ + S + Sbjct: 189 QRKCDKLKQLKQSLLEK 205 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 50/190 (26%), Gaps = 12/190 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLED---VESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + + + +D + V T +Y + Sbjct: 23 WEQRKLGDVLTERNIQRA-QSEDFPLVSFTVENGVTPKTERYDREQLVRGDRAAKKYKET 81 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEA 139 I+Y + V + P ++ + Q Sbjct: 82 RLDDIVYNPANLKFGAIARNTLRNAVFSPIYVTFNVDETAAPSFIEKVVTRSRFIQGALR 141 Query: 140 ICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + ++ + +P L EQ I IT R + LK+ Sbjct: 142 YQQGTVYERMSVSPEELCDLNVTLPYLDEQQYIGSYFTNLDHL----ITLHQRKCDKLKQ 197 Query: 198 KKQALVSYIV 207 KQ+L+ + Sbjct: 198 LKQSLLEKMF 207 >gi|331681144|ref|ZP_08381781.1| type I restriction-modification system specificity determinant [Escherichia coli H299] gi|331081365|gb|EGI52526.1| type I restriction-modification system specificity determinant [Escherichia coli H299] Length = 434 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 50/431 (11%), Positives = 123/431 (28%), Gaps = 64/431 (14%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 I + G++S S + TG+Y P G++ S + I+ G Sbjct: 10 IGEHLLIRNGKSSPS------------RAITGEY-PVYGSNGIIGYSDEYNANENTIIIG 56 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 ++G Y I+ + ++ K E + + + G+ Sbjct: 57 RVGSYCGSVYISGKKCWVTDNAIIGTAK---NENESHFWFYLLKKIDLNNYSTGSGQPLI 113 Query: 150 DWKGIGNIPMPIPPLAEQVLIR-EKIIAETVRIDTLITERIRFIELLKEKKQA------- 201 + I I + IP L+E+ + + +I+ + ++ + ++ Sbjct: 114 NQTIINTISVTIPKLSEKRVSIGHFLRHFDQKINLSLNINQSLEQMSQTLFKSWFVDFDP 173 Query: 202 LVSYIVTKG--------------------------LNPDVKMKDSGIEW--VGLVPDHWE 233 ++ + G L + S E +G VP W Sbjct: 174 VIDNALDAGNPIPEALQSRAELRQKVRSSADFKPLLVEIRSLFPSEFEETELGWVPKGWT 233 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 +K + + Q ++ KP Y + +F Sbjct: 234 LKSVAKSININPSIKLPKNKIAKYVDMKSLPTQGYSISDIIEKP--YSGGAKFQNNDTLF 291 Query: 294 RFIDL---QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM---RSYDLCKVFY 347 I + E ++ ++ ++ + ++ L + Sbjct: 292 ARITPCLENGKTGFVDFLDEKETAFGSTEFIVMRGTPQVHYLYVACLARENNFRLHAIQN 351 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 +GS RQ ++ + +P + ++ + + + + L R Sbjct: 352 MVGSSGRQRVQNSCFDSFYIAIPTP----AVMSLFSGKVSSYFDKMYFCNLENKSLTALR 407 Query: 408 SSFIAAAVTGQ 418 + + ++G+ Sbjct: 408 DTLLPKLISGE 418 Score = 38.2 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 62/196 (31%), Gaps = 13/196 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G +PK W + + + +N K Y+ ++ + + + + S Sbjct: 225 LGWVPKGWTLKSVAKSININPSIKLPKNKIAKYVDMKSLPTQG----YSISDIIEKPYSG 280 Query: 78 VSIFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQ--PKDVLPELLQGWL 128 + F L+ ++ P L + ST+F+V++ P+ + Sbjct: 281 GAKFQNNDTLFARITPCLENGKTGFVDFLDEKETAFGSTEFIVMRGTPQVHYLYVACLAR 340 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + I+ + + + + IP A L K+ + ++ E Sbjct: 341 ENNFRLHAIQNMVGSSGRQRVQNSCFDSFYIAIPTPAVMSLFSGKVSSYFDKMYFCNLEN 400 Query: 189 IRFIELLKEKKQALVS 204 L L+S Sbjct: 401 KSLTALRDTLLPKLIS 416 >gi|3806000|gb|AAC69262.1| type I restriction-modification enzyme S subunit homolog [Helicobacter pylori] Length = 159 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 44/137 (32%), Gaps = 1/137 (0%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N I +K ++ + ++ D L + ++ Sbjct: 18 NSIDIDGNLKNTMKRVNFYDNSLKQDDIVMVLSDVAHGDFLGLCAVIPSNDYVLNQRMGR 77 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 ++ L + K F G G + +L + ++ + +PP+ EQ I N+ Sbjct: 78 LRIRNDCINILFLRLYINANQKYFKMQGQGSSQLNLSKKAIEDFEIPLPPLNEQAAIANI 137 Query: 382 INVETARIDVLVEKIEQ 398 ++ I L K Q Sbjct: 138 LSDVDNEIISLKNKKRQ 154 >gi|321310230|ref|YP_004192559.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802074|emb|CBY92720.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 207 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 53/162 (32%), Gaps = 4/162 (2%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN--MGLKPESYETY 283 G ++++ + T ++ K+ +S + NI + PESY Sbjct: 7 GSDLKYFKLGDVCEVCTGVDFKSCSYRDSGFPIIKVRNIQDGQIVTDSLNYCDPESYRDA 66 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +IV G++V K + + YL + S Sbjct: 67 EIVKYGDVVMARAGSSG-KVGINLLDQEFFFDGNLFKFIPNTEMLIGRYLYHFLLS-RQE 124 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ++ + ++ +++L + +P ++ Q I ++ Sbjct: 125 EIQSLVKGSTIPVIRKSALEKLRIPIPSLEVQESIAQTLDKF 166 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 60/184 (32%), Gaps = 6/184 (3%) Query: 29 PIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + ++ TG +S I + +++ G I G Sbjct: 14 KLGDVCEVCTGVDFKSCSYRDSGFPIIKVRNIQDGQI-VTDSLNYCDPESYRDAEIVKYG 72 Query: 85 QILYGKLGPYLRK-AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 ++ + G + + D + P + + + + I+++ +G Sbjct: 73 DVVMARAGSSGKVGINLLDQEFFFDGNLFKFIPNTEMLIGRYLYHFLLSRQEEIQSLVKG 132 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T+ + + +PIP L Q I + + + E R I L ++ + Sbjct: 133 STIPVIRKSALEKLRIPIPSLEVQESIAQTLDKFREIEREIEREIEREISLRDKQYEYYR 192 Query: 204 SYIV 207 +Y++ Sbjct: 193 NYLI 196 >gi|332829720|gb|EGK02366.1| hypothetical protein HMPREF9455_01636 [Dysgonomonas gadei ATCC BAA-286] Length = 363 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 17/162 (10%), Positives = 54/162 (33%), Gaps = 4/162 (2%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + E+ I ++ + + G P + + ++ + ++ + Sbjct: 21 EEWSETEIKNILKIGSGRDYKHLETGNIPVFGTGGYMTSINDFLYDGESVCIGRKGTINK 80 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 +G + + + +L ++ + SL ++++ + Sbjct: 81 PFYLKGKFWTVDTLFYTYSYKNIQPKFLFYIFEQINWLKYNEASGVPSLSKSTIEKILIA 140 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +P +EQ I + + ID +E + I K+ +++ Sbjct: 141 IPKKEEQDKIATFL----SLIDERIETQNKIIEEYKKLKNAL 178 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 55/390 (14%), Positives = 111/390 (28%), Gaps = 51/390 (13%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + W IK K+ +GR + +E+G G ++ Sbjct: 21 EEWSETEIKNILKIGSGRDYK-----------HLETGNIPVFGTGGYMTSI---NDFLYD 66 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + G+ G + + T F K++ P+ L E Sbjct: 67 GESVCIGRKGTINKPFYLKGKFWTVDTLFYTYSYKNIQPKFLFYIFE----QINWLKYNE 122 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + I I + IP EQ I + ID I + + IE K+ K AL Sbjct: 123 ASGVPSLSKSTIEKILIAIPKKEEQDKIATFL----SLIDERIETQNKIIEEYKKLKNAL 178 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + G + V ++ + + N Sbjct: 179 ------------------AVFFFGTSVKYTSVGEICDVIMGQSPSSAAYNYVNNGLPLIQ 220 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + E S T Q + G+I+ + RGI Sbjct: 221 GNLDISEGTTSPRMWTSEITKQ-CEIGDIILTVRAPVGVVAKSNMIACVGRGIC----AI 275 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 S Y+ ++ Y K ++ +++ L + +P I ++ + + I Sbjct: 276 KVKESKCSEYVYQYLQ-YFKNKWISIEQGSTFSAISRDNI--LSISIPSITKRLTVASHI 332 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIA 412 A D + + + + ++ ++ Sbjct: 333 ---LALFDNKINAEISFLKMYRSQKQFLLS 359 >gi|171920617|ref|ZP_02695518.2| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|171903331|gb|EDT49620.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 13 str. ATCC 33698] Length = 358 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 44/392 (11%), Positives = 104/392 (26%), Gaps = 44/392 (11%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFAK 83 + + + G T I + ++ G Y + ++ + K Sbjct: 4 IYKLGSLVNIYKGST--------LITKKYIDENQGIYPVISSKTTENGIYGFINRYDYEK 55 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA--IC 141 +I +G + + + V + + +++ + I Sbjct: 56 NKITMSLIGENAGTFFWQEKNFSLTNNACVFISNKNINYNYKYLFITLKKHEYKIKEFIV 115 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + + + +P + Q I I I I I L EK Sbjct: 116 IGSARPMISSNHLKLVDVNLPSIEIQDAIISIIEPLEKSI-KTINLLQTKIGLFIEKTFN 174 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 ++ + + +KD GL I Sbjct: 175 FINNNLANADLIEFSLKDLLNIKRGLP---------------------------ITEKDL 207 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N + K Y + I + + + + Sbjct: 208 LNNPGNYPLISASSKNNGIFGYFNDYMYDGKNITISMNGNAGCIFYQIGKFSANSDVLVL 267 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + + + + ++ R L +++ VL+P ++ Q + + + Sbjct: 268 SNSNKNLTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNMEIQKEFSKI 327 Query: 382 INVETARIDVLVEKIEQSIV--LLKERRSSFI 411 + + V KIE+++ LLK + I Sbjct: 328 VEPLL-NLSTKVNKIEKNLNECLLKIVKKLII 358 >gi|315149123|gb|EFT93139.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0012] Length = 314 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 18/123 (14%), Positives = 44/123 (35%), Gaps = 6/123 (4%) Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 ++ +D N + ++ + +DS + + K + + Sbjct: 1 MYGKLDFLNQAFGIVPIELDGYESTVDSPSFDFKPLVDSVFFLEYVSLEKFYKYQGNIAN 60 Query: 352 GLRQ--SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G R+ + E +P+ P KEQ I ++D + ++ + LKE + + Sbjct: 61 GSRKAKRIHVETFFNMPLPTPSYKEQQKIG----TLFKQLDDTITLHQRKLEQLKELKKA 116 Query: 410 FIA 412 ++ Sbjct: 117 YLQ 119 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 73/241 (30%), Gaps = 17/241 (7%) Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 +KI ++D IT R +E LKE K+A + + + K+ + Sbjct: 87 QKIGTLFKQLDDTITLHQRKLEQLKELKKAYLQLMFVPTNTKNNKVPKLRFANFEENWEL 146 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +++ + K L ++ L + L S I+ G Sbjct: 147 CKLENIIEKQIKGKAKVENLCNGSVEYLDANRLNGGKPIYTKALSDVSERDIIILWDGS- 205 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +G++ S A + ++ + + ++ + Sbjct: 206 ------------KAGKVYYGFKGVLGSTLKAYQLKEYANSQFIYQQLLDNQNNIYNNYRT 253 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + P+ + +EQ + +++ + +D + + + + S++ Sbjct: 254 PNIPHVVKNFSSIFPIWMTSFEEQSQMADIL----SNLDNRIILQQNLTDTMISLKKSYL 309 Query: 412 A 412 Sbjct: 310 Q 310 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 59/184 (32%), Gaps = 15/184 (8%) Query: 23 KHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKD--GNSRQSDTSTVS 79 ++W++ ++ + G+ +E++ +G+ +YL + + T +S Sbjct: 142 ENWELCKLENIIEKQIKGKAK----------VENLCNGSVEYLDANRLNGGKPIYTKALS 191 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ I+ G K F G+ + Q K+ + +D I Sbjct: 192 DVSERDIIILWDGSKAGKVYY-GFKGVLGSTLKAYQLKEYANS-QFIYQQLLDNQNNIYN 249 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + H P+ + EQ + + + RI I L K Sbjct: 250 NYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYL 309 Query: 200 QALV 203 Q + Sbjct: 310 QNMF 313 >gi|57865902|ref|YP_190014.1| type I restriction-modification system S subunit [Staphylococcus epidermidis RP62A] gi|57636560|gb|AAW53348.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus epidermidis RP62A] Length = 381 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 51/355 (14%), Positives = 110/355 (30%), Gaps = 33/355 (9%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + K+ + S + I GQ++YGKL + + + D + Sbjct: 53 IVEKESIFKGSSNTQYYIRKAGQLMYGKLDFLNCAFGLVPTELNNFESTIDSPSFDFIKG 112 Query: 123 LLQGWLLSIDVTQRIEAI----CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + L I + + + ++P+ P + EQ I + Sbjct: 113 DKKFLLERIKMKSFYKKYGDLANGSRKAKRINQNTFLSMPLYAPTINEQKKIGDFFSKLD 172 Query: 179 VRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 +I+ + + + Q + S + + G W K F Sbjct: 173 RQIELEEKKLELLEQQKRGYMQKIFSQQLRFK------------DEKGNDYPKWIFKKFE 220 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 + + K ++ S I + +I + + +G + + ++ Sbjct: 221 EIFKVVPSKKYQIKSSEIEDNASIPVIDQGQNLILGFSNNKEKVFNDFK--NVIIYGDHT 278 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 KRS + + G+ + D +YL ++ F G ++ Sbjct: 279 TVIKRSDKPFIIGGDGV----KLLTSKVDSDISYLYNALQ------YFNVKSEGYKRHFS 328 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 K + I+EQ I N+ ++D +EK + LLK+R+ + Sbjct: 329 ILKNKDFYIST-SIEEQKRIANI----FNKLDKYIEKQFAKVELLKQRKQGLLQK 378 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 45/128 (35%), Gaps = 3/128 (2%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + + K S Y I G++++ +D N L ++ T + Sbjct: 49 WGKGIVEKESIFKGSSNTQYYIRKAGQLMYGKLDFLNCAFGLVPTEL-NNFESTIDSPSF 107 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFEDVKRLPVLVPPIKEQFDITNV 381 D +L ++ K + + +G R+ + +P+ P I EQ I + Sbjct: 108 DFIKGDKKFLLERIKMKSFYKKYGDLANGSRKAKRINQNTFLSMPLYAPTINEQKKIGDF 167 Query: 382 INVETARI 389 + +I Sbjct: 168 FSKLDRQI 175 >gi|139438176|ref|ZP_01771729.1| Hypothetical protein COLAER_00717 [Collinsella aerofaciens ATCC 25986] gi|133776373|gb|EBA40193.1| Hypothetical protein COLAER_00717 [Collinsella aerofaciens ATCC 25986] Length = 116 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 5/91 (5%) Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 + + + + + D YLA+++RS + + G+ R ++ V L V Sbjct: 8 DQDDVYLNSFCFGYRQDSTFDPHYLAYMLRSSSIRSDLTLLAQGISRFNISKNKVMELSV 67 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 VP EQ I AR+D L+ ++ Sbjct: 68 PVPSAAEQKQIGQY----FARLDSLITLHQR 94 >gi|88811759|ref|ZP_01127013.1| type I restriction-modification system specificity determinant XF2741 [Nitrococcus mobilis Nb-231] gi|88791150|gb|EAR22263.1| type I restriction-modification system specificity determinant XF2741 [Nitrococcus mobilis Nb-231] Length = 421 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 55/419 (13%), Positives = 129/419 (30%), Gaps = 32/419 (7%) Query: 32 RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL 91 N E GK++ +I + + S F G L K+ Sbjct: 5 EIVAFNPTTPLEKGKELPFIEMAALPISERDIPTFQYRVAGGSGSK---FRNGDTLLAKI 61 Query: 92 GPYLR-------KAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQRIEAICEG 143 P L + + D G ST+F+V++ ++ E + + G Sbjct: 62 TPCLENGKGGQVRGLPGDGVGHGSTEFIVMRARERSDEQFVYYLSRLPEFRKFAIQQMTG 121 Query: 144 AT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + +W+ + N + + I + +I+ + + + Sbjct: 122 TSGRQRVNWQSLTNFDVADLDGELRESIGATLGVLDDKIELNRRMNETLEAMARAIFKDW 181 Query: 203 V-----SYIVTKGLNPDVKMKDSGI---EWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + +G P + + G + + + E NR+ + S Sbjct: 182 FIDFGPTRAKAEGRAPYLAPDVWDLFAGTLDGEHKPARWLVRPASDLFEFNRRESLRKGS 241 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 L + + E Y++ G+ +F I + + Sbjct: 242 EAPYLDMAALPTIGPVPDAPSIRE-YKSGSKFRDGDTLFARITPCLENGKTAYVFGLGDE 300 Query: 315 II---TSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVL 368 +I ++ ++ ++ ++++ R +G RQ + E +++ P++ Sbjct: 301 VIGAGSTEFIVIRSRPPLPLPASYVLARDPGFRAHAERSMTGTSGRQRVNAEALRQYPIV 360 Query: 369 VPPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 P + ++I+ I + +S L + R + +TG+I LR + Sbjct: 361 APSDSRLWKALGDLIDPMMGGI---IANALESRTLART-RDLLLPKLLTGEIRLRDAEK 415 Score = 40.5 bits (93), Expect = 0.57, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 41/130 (31%), Gaps = 12/130 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W V P + N + G + Y+ + + + P + + S F Sbjct: 217 PARWLVRPASDLFEFNRRESLRKGSEAPYLDMAALPT----IGPVPDAPSIREYKSGSKF 272 Query: 82 AKGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDV 133 G L+ ++ P L + G ST+F+V++ + LP Sbjct: 273 RDGDTLFARITPCLENGKTAYVFGLGDEVIGAGSTEFIVIRSRPPLPLPASYVLARDPGF 332 Query: 134 TQRIEAICEG 143 E G Sbjct: 333 RAHAERSMTG 342 >gi|284007654|emb|CBA73343.1| restriction modification system DNA specificity domain [Arsenophonus nasoniae] Length = 363 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 59/184 (32%), Gaps = 13/184 (7%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-MGLKPESYETYQIVD----PGE 290 + N + + + N+ + N G S E + G+ Sbjct: 12 SIISGPFGSNIGQRFFQDVGVPVIRGNNLTTDFKKFNDEGFVFLSEEKANELKADAIRGD 71 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCK-VFY 347 I+F + + +R +I+ + + P D Y+ + + S + K + Sbjct: 72 ILFTAAGTIGQVGMIPQSSKYDRYVISNKQLRLRIDPEKADPNYVYYWLASPWIYKTIVD 131 Query: 348 AMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +K LP+++P I EQ I + + ID ++ L+ Sbjct: 132 RNTGSTVPLINLGIIKTLPIVLPEDIFEQKKI----SKIFSLIDKKIDLNNHINTELEAM 187 Query: 407 RSSF 410 + Sbjct: 188 AKTL 191 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 38/139 (27%), Gaps = 15/139 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W + + TK+ G T + D I ++ + S + Sbjct: 225 EIPEGWGISRVGSVTKIELGGTPSTKVDSYWENANIPWLSSTETASFPVVSAEQMVTQSG 284 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA--------DFDGICSTQFLVLQPKDVLPELL 124 D S ++ KG ++ + + + F V L + Sbjct: 285 IDNSAATLLPKGTVVISIVRYIRPSIFVMVNKFCRRSKRHFLVHDIFDVALNNQSLNGVD 344 Query: 125 QGWLLSIDVTQRIEAICEG 143 + + I + Sbjct: 345 NNCKTEYNFQLHRQQIDDS 363 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 42/300 (14%), Positives = 88/300 (29%), Gaps = 29/300 (9%) Query: 37 NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK--GQILYGKLGPY 94 N G+ + I ++ + K+ + + + G IL+ G Sbjct: 21 NIGQRFFQDVGVPVIRGNNLTTDFKKFNDEGFVFLSEEKANELKADAIRGDILFTAAGTI 80 Query: 95 LRKAIIAD----FDGICS--TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 + +I + S L + P+ P + WL S + + I G+T+ Sbjct: 81 GQVGMIPQSSKYDRYVISNKQLRLRIDPEKADPNYVYYWLASPWIYKTIVDRNTGSTVPL 140 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + I +P+ +P + +KI ID I L+ + L Y Sbjct: 141 INLGIIKTLPIVLPEDIFEQ---KKISKIFSLIDKKIDLNNHINTELEAMAKTLYDYWFV 197 Query: 209 KGLNPD---VKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + PD K SG + V +P+ W + ++ + Sbjct: 198 QFDFPDANGKPYKTSGGKMVYNSILKREIPEGWGISRVGSVTKIELGGTPSTKVDSYWEN 257 Query: 260 SYGNIIQKLETRNMGLKP---------ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + ET + + ++ G +V + + + Sbjct: 258 ANIPWLSSTETASFPVVSAEQMVTQSGIDNSAATLLPKGTVVISIVRYIRPSIFVMVNKF 317 >gi|329921180|ref|ZP_08277695.1| hypothetical protein HMPREF9210_0068 [Lactobacillus iners SPIN 1401G] gi|328934718|gb|EGG31214.1| hypothetical protein HMPREF9210_0068 [Lactobacillus iners SPIN 1401G] Length = 197 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 55/193 (28%), Gaps = 7/193 (3%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 W + ++ N + + E G + + Y Sbjct: 9 DWIEGSLSDIANITMGQSPSGSSYNEDGIGTIFFQGRAEF---GFRFPTIRLYTTEPKRM 65 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I + I A+ +++ + M S + Sbjct: 66 AYANDILMSVRAPVGDLNVSHNDCCIGRGLAAIHSKTNHQSFVLYTMFSLKKQFNVFNGE 125 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + S+ + +P+L+P EQ + + A +D + I L+ R S Sbjct: 126 GTVFGSINRNSLNDMPILIPD-DEQIE---KFELIVAPMDATIRNNYDEICCLQAVRDSL 181 Query: 411 IAAAVTGQIDLRG 423 + ++G++D+ Sbjct: 182 LPRLMSGELDVSD 194 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 20/181 (11%), Positives = 43/181 (23%), Gaps = 2/181 (1%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W + + G++ G ++ + R T + Sbjct: 9 DWIEGSLSDIANITMGQSPSGSSYNEDGIGTIFFQGRAEFGFRFPTIRLYTTEPKRMAYA 68 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 IL P ++ D + K + + + Q EG Sbjct: 69 NDILMSVRAPV-GDLNVSHNDCCIGRGLAAIHSKT-NHQSFVLYTMFSLKKQFNVFNGEG 126 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + ++P+ IP + + I E + L+ Sbjct: 127 TVFGSINRNSLNDMPILIPDDEQIEKFELIVAPMDATIRNNYDEICCLQAVRDSLLPRLM 186 Query: 204 S 204 S Sbjct: 187 S 187 >gi|312886111|ref|ZP_07745732.1| restriction modification system DNA specificity domain protein [Mucilaginibacter paludis DSM 18603] gi|311301410|gb|EFQ78458.1| restriction modification system DNA specificity domain protein [Mucilaginibacter paludis DSM 18603] Length = 185 Score = 65.6 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 50/133 (37%), Gaps = 6/133 (4%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 L E ++ G+++F +N + + + + + + YL Sbjct: 46 DLMAEGISEKHLLKNGDVLFAAKGTKNFAAVFENHNEASVASTSFFVIRLTGETLLAEYL 105 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 A + SY + A G S+ + ++ L + VP ++ Q I + ++ Sbjct: 106 ALFLNSYTTQTILKAQAIGTSMPSISKQVLENLEITVPGLEIQKAILQI-----NKLRNK 160 Query: 393 VEKIEQSIVLLKE 405 + ++ I +L+E Sbjct: 161 EKVLKNKIEVLRE 173 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 57/170 (33%), Gaps = 8/170 (4%) Query: 30 IKRFTKLNTGRT--SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 IK T + TG +++Y+ + + + S + G +L Sbjct: 5 IKDITNIQTGLFAKPSGIGEVVYLQSKHFDEYGQLLSILHPDLMAEGISEKHLLKNGDVL 64 Query: 88 YGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G A+ + + S + L + +L E L +L S ++A G Sbjct: 65 FAAKGTKNFAAVFENHNEASVASTSFFVIRLTGETLLAEYLALFLNSYTTQTILKAQAIG 124 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLI--REKIIAETVRIDTLITERIRF 191 +M + + N+ + +P L Q I K+ + + I Sbjct: 125 TSMPSISKQVLENLEITVPGLEIQKAILQINKLRNKEKVLKNKIEVLREK 174 >gi|283795956|ref|ZP_06345109.1| putative type I restriction-modification system specificity protein [Clostridium sp. M62/1] gi|291076601|gb|EFE13965.1| putative type I restriction-modification system specificity protein [Clostridium sp. M62/1] Length = 332 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/315 (12%), Positives = 101/315 (32%), Gaps = 21/315 (6%) Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + L L S+ + ++IE G + H + +PIP + Q +I + A + Sbjct: 25 YNKYLLSVLRSVKIQKQIEQTSVGDVIPHFKKSFFDQLLIPIPSMEIQKIIGDYYFAFSE 84 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +I+ + ++ +++ E + + + K Sbjct: 85 KIEINKKINDNLERQAQLLFKSWFVDFEPFNGTMPSELEVVPFEKIVDFQNGYAFKS--K 142 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + + + G I + S ++ G+I+ D++ Sbjct: 143 ELLNEPSSDCYQVFKQGHIARGGGFIPDGTKSWYPKRLASKLGKFVLKKGDILMAMTDMK 202 Query: 300 NDKRSLRSAQV---MERGIITS--AYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMG-SG 352 ++ L + + I+ + + + +L+ S D + SG Sbjct: 203 DNVAILGNTAIMPIDNEYIVNQRVGLLRTNGYKGITYPFIYLLTNSKDFLIDLRSRANSG 262 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLLKERRS 408 ++ +L ++K ++P +N + I + + L + R Sbjct: 263 VQVNLSSAEIKASRTILPS--------EKVNTAFSEITLPMFEAIISNQLENQRLAQLRD 314 Query: 409 SFIAAAVTGQIDLRG 423 + + ++G+ID+ Sbjct: 315 TLLPRLMSGEIDVSD 329 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 37/102 (36%), Gaps = 5/102 (4%) Query: 307 SAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKR 364 ++ I + + YL ++RS + K G + K + Sbjct: 2 VPDPIDFCIAQDMVALRVNDAKVYNKYLLSVLRSVKIQKQIEQTSVGDVIPHFKKSFFDQ 61 Query: 365 LPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLL 403 L + +P ++ Q I + + +I+ + + +E+ LL Sbjct: 62 LLIPIPSMEIQKIIGDYYFAFSEKIEINKKINDNLERQAQLL 103 Score = 36.7 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 54/207 (26%), Gaps = 21/207 (10%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNS 70 G +P +VVP ++ G +S + + G G + Sbjct: 116 GTMPSELEVVPFEKIVDFQNGYAFKSKELLNEPSSDCYQVFKQGHIARGGGFIPDGTKSW 175 Query: 71 RQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQ---------PK 117 + KG IL AI+ + + +++V Q K Sbjct: 176 YPKRLASKLGKFVLKKGDILMAMTDMKDNVAILGNTAIMPIDNEYIVNQRVGLLRTNGYK 235 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + S D + + + I +P E + Sbjct: 236 GITYPFIYLLTNSKDFLIDLRSRANSGVQVNLSSAEIKASRTILPSEKVNTAFSEITLPM 295 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS 204 I + E R +L L+S Sbjct: 296 FEAIISNQLENQRLAQLRDTLLPRLMS 322 >gi|225351809|ref|ZP_03742832.1| hypothetical protein BIFPSEUDO_03410 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225157056|gb|EEG70395.1| hypothetical protein BIFPSEUDO_03410 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 166 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 52/136 (38%), Gaps = 8/136 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH- 326 + G S +Y+ + G+I F + GI++ + ++P Sbjct: 3 FNSTGNGADESSLPSYKRLRLGDIAFEGHANKEFAYGRFVLNDAGNGIMSPRFTCLRPIV 62 Query: 327 GIDSTYLAWLMRSYDLCK--VFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVIN 383 + ++ + + S ++ + + + SG + L +D +LVP + EQ I + Sbjct: 63 EQEYSFWKYFIHSEEVMRPILVNSTKSGTMMNELVVKDFLEQEILVPSLPEQRQIGAFFD 122 Query: 384 VETARIDVLVEKIEQS 399 +D L+ ++ Sbjct: 123 C----LDSLITLHQRK 134 >gi|327184404|gb|AEA32849.1| N-6 DNA methylase [Lactobacillus amylovorus GRL 1118] Length = 609 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/372 (8%), Positives = 105/372 (28%), Gaps = 12/372 (3%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF 103 S II G Y + + S+++ K I+ + + + Sbjct: 221 SKDAIIENRFNRFRYGDITYTKGESAFISNAISSLNQTGKAVIVVSDGPLFQGGKVASFR 280 Query: 104 DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + + L + + + + Sbjct: 281 KFLVDHDLIETVIALPSSLLSYSIIPINILIINKNKTDSKGQIQFINANQNEWYQTDKHG 340 Query: 164 LAE----QVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 + ++ ++ + + + N + Sbjct: 341 KRILSTLGIQKIVELYHSRASVEGKSAIFANTDYKGTLGIKQYILPSEVQLDNSTYHINR 400 Query: 220 SGIEWVG--LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 S ++ + + + +K + + K + + + + ++ + I + +K Sbjct: 401 SALQNLNTVQLQELVNIKRGYNVTRRNEDKKGRYLTAKVTDITTDHHINDSNLTRINIKT 460 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLA 334 + +++ +I+ + + + A + VK + ++ +L Sbjct: 461 NAES--YLIENNDILISTRGTIGKVAFVNNIKQCTVPNANLAILRVKSSKLNTVNMIWLM 518 Query: 335 WLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + S + + +G ++ +D+ ++P+ V P++ Q A+++ Sbjct: 519 LYLASPLGQFMIQQVATGTAISTISTKDLGKIPIPVLPLEAQNKAVQQFQTVQAKLNAEK 578 Query: 394 EKIEQSIVLLKE 405 +++ I +E Sbjct: 579 AALQKKIEANQE 590 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 61/191 (31%), Gaps = 12/191 (6%) Query: 26 KVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 V ++ + G R + + + D+ + + + Sbjct: 407 NTVQLQELVNIKRGYNVTRRNEDKKGRYLTAKVTDITTDHHINDSNLTRINIKTNAESYL 466 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGIC----STQFLVLQPKDVLPELLQG---WLLSIDV 133 IL G + A + + + L ++ + + +L S Sbjct: 467 IENNDILISTRGTIGKVAFVNNIKQCTVPNANLAILRVKSSKLNTVNMIWLMLYLASPLG 526 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I+ + G +S K +G IP+P+ PL Q ++ +++ + IE Sbjct: 527 QFMIQQVATGTAISTISTKDLGKIPIPVLPLEAQNKAVQQFQTVQAKLNAEKAALQKKIE 586 Query: 194 LLKEKKQALVS 204 +E+ + ++ Sbjct: 587 ANQEELYSSMN 597 >gi|134294390|ref|YP_001118125.1| restriction endonuclease S subunits-like protein [Burkholderia vietnamiensis G4] gi|134137547|gb|ABO53290.1| Restriction endonuclease S subunits-like protein [Burkholderia vietnamiensis G4] Length = 424 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/343 (12%), Positives = 98/343 (28%), Gaps = 34/343 (9%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD-- 104 IY + G G Y + + + + ++ K+ + + Sbjct: 20 GTIYRQIGVRLWGEGAYERESIDGADTKYPNFNRIEADDLVVNKIWARNGSVAVVTTELS 79 Query: 105 -GICSTQFLVLQPK--DVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMP 160 G ST+F K +LP ++ Q + +G + + I +P Sbjct: 80 GGYVSTEFPAYTLKGERILPAWMRLVTKWRGFWQACDEKAQGTSGKNRIKPGEFLAIEIP 139 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 +PPL EQ I K+ + + L ++ + Sbjct: 140 LPPLPEQRAIVAKLDELSDKTTQLNAYLDTVEADADALIRSYM----------------- 182 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES- 279 G + +E + LV+ + + + + + + Sbjct: 183 ----FGEQANGYEKRKMSELVSLRSTDVAVDNTQEYRFAGVYSFGRGVFASAVKSGSDFA 238 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLM 337 YE V G+ + + + + + +++ + + L Sbjct: 239 YERLSTVKAGDFTYPKLMAWEGALGVVPPEC-DGMVVSPEFPVFTVNTDAVLPEVLDIYF 297 Query: 338 RSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFD 377 R+ + A+ G R+ L+ D + VPP+ Q Sbjct: 298 RTPSVWPELAALSGGTNLRRRRLQPSDFLEYEMSVPPMPVQTK 340 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 49/129 (37%), Gaps = 5/129 (3%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH- 326 E ++ Y + ++ ++V I +N ++ + ++ G +++ + A Sbjct: 36 YERESIDGADTKYPNFNRIEADDLVVNKIWARNGSVAVVTTELSG-GYVSTEFPAYTLKG 94 Query: 327 -GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I ++ + + + G + +K + + + +PP+ EQ I ++ Sbjct: 95 ERILPAWMRLVTKWRGFWQACDEKAQGTSGKNRIKPGEFLAIEIPLPPLPEQRAIVAKLD 154 Query: 384 VETARIDVL 392 + + L Sbjct: 155 ELSDKTTQL 163 >gi|241763495|ref|ZP_04761548.1| restriction modification system DNA specificity domain protein [Acidovorax delafieldii 2AN] gi|241367336|gb|EER61667.1| restriction modification system DNA specificity domain protein [Acidovorax delafieldii 2AN] Length = 325 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/323 (13%), Positives = 93/323 (28%), Gaps = 31/323 (9%) Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 +L + IE+ G + I + +P+PPLA Q I + RI L Sbjct: 3 YLTHPTIKSYIESFNAGGSRRAITKAHIESFVVPLPPLATQRAIAALLGGIDDRITLLRE 62 Query: 187 ERIRFIELLKEKKQALVS-----YIVTKGLNPDVK------MKDSGIE--WVGLVPDHWE 233 + + ++ +G P+ + G E +G VP W Sbjct: 63 TNATLEAIAQALFKSWFVDFDPVRAKMEGRTPEGMDEATAALFPDGFETSELGEVPRGWR 122 Query: 234 VKPFFALV-------TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY---ETY 283 V + T K + I G + + + Sbjct: 123 VGCIDDICSTVTNGGTPSRSKTEYWEQGTIPWFKTGEFHDGFLLQPSERITNAALIGSSV 182 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +++ ++ R ++E A + + + Sbjct: 183 KLLPKDAVLMAIYAAPTVGR---LGILVEPATFNQACTGMVARNEVGPWFLFWTLLNGRD 239 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 +Q++ V ++PP + + N+ + I + + + L Sbjct: 240 WFNSRANGAAQQNISKAIVSAYLTVIPPNP----VLDSFNLVASGIHEAIRMNTEKAMTL 295 Query: 404 KERRSSFIAAAVTGQIDLRGESQ 426 R + + ++GQ+ L E+Q Sbjct: 296 STLRDTLLPRLISGQLRL-PEAQ 317 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 53/196 (27%), Gaps = 10/196 (5%) Query: 18 IGAIPKHWKVVPIKRFTK-LNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGN 69 +G +P+ W+V I + G T K I + + G + Sbjct: 114 LGEVPRGWRVGCIDDICSTVTNGGTPSRSKTEYWEQGTIPWFKTGEFHDGFLLQPSERIT 173 Query: 70 SRQSDTSTVSIFAKGQILYGKL-GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + S+V + K +L P + + I + + ++ + + Sbjct: 174 NAALIGSSVKLLPKDAVLMAIYAAPTVGRLGILVEPATFNQACTGMVARNEV-GPWFLFW 232 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 ++ + GA + + IPP I + Sbjct: 233 TLLNGRDWFNSRANGAAQQNISKAIVSAYLTVIPPNPVLDSFNLVASGIHEAIRMNTEKA 292 Query: 189 IRFIELLKEKKQALVS 204 + L L+S Sbjct: 293 MTLSTLRDTLLPRLIS 308 >gi|332673348|gb|AEE70165.1| possible type I R-M system S protein [Helicobacter pylori 83] Length = 236 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 49/166 (29%), Gaps = 11/166 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 35 PKGVEFKTLEEIFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 94 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 95 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDLALDMKFFFYQCF 153 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + + + + D PIPPL Q I + + Sbjct: 154 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDQF 199 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 56/189 (29%), Gaps = 14/189 (7%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG---------LKPES 279 P E K + N + + R G + P++ Sbjct: 35 PKGVEFKTLEEIFEIKNGYTPSKNNPEFWEKGTIPWFRMEDIRENGRILKDSIQHITPKA 94 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++ I+ + L + + +++ K + + + + Sbjct: 95 LKGKKLFPKNSIIISTTATIGEHALLIVDSLANQQFT---FLSKKANCDLALDMKFFFYQ 151 Query: 340 YDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 L + + S+ K+ +PP++ Q +I +++ + L+ I Sbjct: 152 CFLLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDQFSILTTDLLAGIP 211 Query: 398 QSIVLLKER 406 I K++ Sbjct: 212 AEIEARKKQ 220 >gi|59801123|ref|YP_207835.1| hypothetical protein NGO0699 [Neisseria gonorrhoeae FA 1090] gi|194098762|ref|YP_002001824.1| hypothetical protein NGK_1199 [Neisseria gonorrhoeae NCCP11945] gi|239999056|ref|ZP_04718980.1| hypothetical protein Ngon3_06205 [Neisseria gonorrhoeae 35/02] gi|240113035|ref|ZP_04727525.1| hypothetical protein NgonM_05611 [Neisseria gonorrhoeae MS11] gi|240115792|ref|ZP_04729854.1| hypothetical protein NgonPID1_06034 [Neisseria gonorrhoeae PID18] gi|240125826|ref|ZP_04738712.1| hypothetical protein NgonSK_06367 [Neisseria gonorrhoeae SK-92-679] gi|260440390|ref|ZP_05794206.1| hypothetical protein NgonDG_04756 [Neisseria gonorrhoeae DGI2] gi|268594900|ref|ZP_06129067.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|291043687|ref|ZP_06569403.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|293398985|ref|ZP_06643150.1| type I restriction enzyme, S subunit [Neisseria gonorrhoeae F62] gi|59718018|gb|AAW89423.1| hypothetical protein NGO0699 [Neisseria gonorrhoeae FA 1090] gi|193934052|gb|ACF29876.1| Conserved hypothetical protein [Neisseria gonorrhoeae NCCP11945] gi|268548289|gb|EEZ43707.1| conserved hypothetical protein [Neisseria gonorrhoeae 35/02] gi|291012150|gb|EFE04139.1| conserved hypothetical protein [Neisseria gonorrhoeae DGI2] gi|291610399|gb|EFF39509.1| type I restriction enzyme, S subunit [Neisseria gonorrhoeae F62] gi|317164348|gb|ADV07889.1| hypothetical protein NGTW08_0921 [Neisseria gonorrhoeae TCDC-NG08107] Length = 400 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 61/415 (14%), Positives = 127/415 (30%), Gaps = 38/415 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 H+K I+ N G + + ++ + + + F Sbjct: 2 NHFKKQQIQNIADFNPREQLAKGALAKSVPMAMLKEFQRQITGYEIKAFNGGAK----FR 57 Query: 83 KGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDVT 134 G L K+ P L + ST+F+VL+ K+ PE L + +S D Sbjct: 58 NGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEFIVLRAKNETNPEFLYYFAISPDFR 117 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +R EG + + + + +PIP Q I + +D I + Sbjct: 118 KRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQSIAAVL----SALDKKIALNKQINA 173 Query: 194 LLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALV--TELNRKN 248 L+E + L Y + PD K SG + V E+ + + K Sbjct: 174 RLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGDMVFDETLKREIPKGWGSIELQSCLAKI 233 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + ++ + + + I++P + F D R ++ Sbjct: 234 PNTTKILNKDIKDFGKYPVVDQSQDFICGFTNDEKSILNPQDAHIIFGD---HTRIVKLV 290 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + YL + + G + + +K ++ Sbjct: 291 NFQYARGADGTQVILSNNERMPNYLFYQI-----INQIDLSSYGYARHF--KFLKEFKII 343 Query: 369 VPPIKEQFDITNVINVETARI-DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P + N ++ + L + L + R + + GQ+ +R Sbjct: 344 LPSKDISQKYNEIANTFFVKVRNNLKQNH-----HLTQLRDFLLPMLMNGQVSVR 393 >gi|188585422|ref|YP_001916967.1| N-6 DNA methylase [Natranaerobius thermophilus JW/NM-WN-LF] gi|179350109|gb|ACB84379.1| N-6 DNA methylase [Natranaerobius thermophilus JW/NM-WN-LF] Length = 621 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 64/189 (33%), Gaps = 8/189 (4%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET----RNMGLK 276 E + F+ + K K L N+ + K Sbjct: 417 DYENNTETVSLKSLGTFYRGLNTHAYKTQKSESPTHKILQLSNVENGEIFLENADSYNAK 476 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAW 335 + V PG+++ + + +E +++ ++ +P+ D ++ + Sbjct: 477 ELKNPSSYEVQPGDVIISSRGNSIKIAVIP--EEIENTLLSHNFIGFRPNDNVDPYFIKY 534 Query: 336 LMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 M S K G S LK +D++++ + ++EQ I+N + + ++ Sbjct: 535 FMESPIGIKYLSLYQKGSAVSVLKVKDIEKIYIPKVSLEEQKAISNKLRNADLTLQRKIQ 594 Query: 395 KIEQSIVLL 403 K ++ L Sbjct: 595 KAKEEHKQL 603 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 67/180 (37%), Gaps = 11/180 (6%) Query: 26 KVVPIKRFTKLNTG------RTSESGKDI-IYIGLEDVESGTGKYLPKDG-NSRQSDTST 77 + V +K G +T +S + L +VE+G D N+++ + Sbjct: 423 ETVSLKSLGTFYRGLNTHAYKTQKSESPTHKILQLSNVENGEIFLENADSYNAKELKNPS 482 Query: 78 VSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDV-T 134 G ++ G ++ A+I + + + S F+ +P D + + + + Sbjct: 483 SYEVQPGDVIISSRGNSIKIAVIPEEIENTLLSHNFIGFRPNDNVDPYFIKYFMESPIGI 542 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + +G+ +S K I I +P L EQ I K+ + + I + + Sbjct: 543 KYLSLYQKGSAVSVLKVKDIEKIYIPKVSLEEQKAISNKLRNADLTLQRKIQKAKEEHKQ 602 >gi|240014033|ref|ZP_04720946.1| hypothetical protein NgonD_05193 [Neisseria gonorrhoeae DGI18] gi|240016473|ref|ZP_04723013.1| hypothetical protein NgonFA_04764 [Neisseria gonorrhoeae FA6140] gi|240080595|ref|ZP_04725138.1| hypothetical protein NgonF_04672 [Neisseria gonorrhoeae FA19] gi|240118088|ref|ZP_04732150.1| hypothetical protein NgonPID_06461 [Neisseria gonorrhoeae PID1] gi|240121599|ref|ZP_04734561.1| hypothetical protein NgonPI_07513 [Neisseria gonorrhoeae PID24-1] gi|240123642|ref|ZP_04736598.1| hypothetical protein NgonP_06839 [Neisseria gonorrhoeae PID332] gi|268596720|ref|ZP_06130887.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] gi|268550508|gb|EEZ45527.1| conserved hypothetical protein [Neisseria gonorrhoeae FA19] Length = 400 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 61/415 (14%), Positives = 127/415 (30%), Gaps = 38/415 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 H+K I+ N G + + ++ + + + F Sbjct: 2 NHFKKQQIQNIADFNPREQLAKGALAKSVPMAMLKEFQRQITGYEIKAFNGGAK----FR 57 Query: 83 KGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDVT 134 G L K+ P L + ST+F+VL+ K+ PE L + +S D Sbjct: 58 NGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEFIVLRAKNETNPEFLYYFAISPDFR 117 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 +R EG + + + + +PIP Q I + +D I + Sbjct: 118 KRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQSIAAVL----SALDKKIALNKQINT 173 Query: 194 LLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALV--TELNRKN 248 L+E + L Y + PD K SG + V E+ + + K Sbjct: 174 RLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGDMVFDETLKREIPKGWGSIELQSCLAKI 233 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + ++ + + + I++P + F D R ++ Sbjct: 234 PNTTKILNKDIKDFGKYPVVDQSQDFICGFTNDEKSILNPQDAHIIFGD---HTRIVKLV 290 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + YL + + G + + +K ++ Sbjct: 291 NFQYARGADGTQVILSNNERMPNYLFYQI-----INQIDLSSYGYARHF--KFLKEFKII 343 Query: 369 VPPIKEQFDITNVINVETARI-DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P + N ++ + L + L + R + + GQ+ +R Sbjct: 344 LPSKDISQKYNEIANTFFVKVRNNLKQNH-----HLTQLRDFLLPMLMNGQVSVR 393 >gi|42528243|ref|NP_973341.1| type I restriction-modification system, S subunit, truncation [Treponema denticola ATCC 35405] gi|41819513|gb|AAS13260.1| type I restriction-modification system, S subunit, truncation [Treponema denticola ATCC 35405] Length = 175 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 55/163 (33%), Gaps = 16/163 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP+ W + G+ VE TG+Y P G+ + I Sbjct: 23 IPESWTWCHFGDVADVINGKNQSQ-----------VEDDTGEY-PIYGSGGIMGYANDYI 70 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 K + G+ G + + T F + VLP L + S D T ++ Sbjct: 71 CPKNCTIIGRKGSINNPIFVEEKFWNVDTAFGLAPSSIVLPRYLFYFCKSFDFT----SL 126 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 T+ I +I P+PP Q I +KI +++ Sbjct: 127 DSSTTLPSLTKTSIRSILFPLPPFVAQQRILDKIDELFSQLEK 169 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 60/187 (32%), Gaps = 17/187 (9%) Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +++ + ++ +E +P+ W F + +N KN +E + Sbjct: 1 MLSCYYEKFGDVTETAVEMFSAIPESWTWCHFGDVADVINGKNQSQVEDDTGEYPIYGSG 60 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Y I + N+ + + + +A+ Sbjct: 61 G----------IMGYANDYICPKNCTIIGRKGSINNPIFV----EEKFWNVDTAFGLAPS 106 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + YL + +S+D + S SL ++ + +PP Q I + I+ Sbjct: 107 SIVLPRYLFYFCKSFDFTSL---DSSTTLPSLTKTSIRSILFPLPPFVAQQRILDKIDEL 163 Query: 386 TARIDVL 392 ++++ + Sbjct: 164 FSQLEKI 170 >gi|319775885|ref|YP_004138373.1| Restriction modification enzyme [Haemophilus influenzae F3047] gi|329123734|ref|ZP_08252294.1| type I restriction-modification system [Haemophilus aegyptius ATCC 11116] gi|317450476|emb|CBY86693.1| Restriction modification enzyme [Haemophilus influenzae F3047] gi|327469933|gb|EGF15398.1| type I restriction-modification system [Haemophilus aegyptius ATCC 11116] Length = 138 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 46/138 (33%), Gaps = 5/138 (3%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + + G+I+ R + + + + + + + Sbjct: 3 DNFIIDERKLQKGDILINSTGEGTAGRVTLFGLDGDFVVDSHITIFRPNEKVLPKFAMYS 62 Query: 337 MRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + A G+ + L + + LVP + EQ I N IN I+ + + Sbjct: 63 LAHIGFKTIERMATGASGQIELNLSTIGNISFLVPDLNEQQSIVNQIN----EIETQISE 118 Query: 396 IEQSIVLLKERRSSFIAA 413 +E+ + ++ + + + Sbjct: 119 LEKVLENSRQEKKAVLDK 136 >gi|307067135|ref|YP_003876101.1| restriction endonuclease S subunit [Streptococcus pneumoniae AP200] gi|306408672|gb|ADM84099.1| Restriction endonuclease S subunit [Streptococcus pneumoniae AP200] Length = 297 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 43/109 (39%), Gaps = 5/109 (4%) Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIK 373 +I S V ++ TYL + + S + +G ++ + L + +PP+ Sbjct: 1 MIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLS 60 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 EQ I I ++D E + L KE + S + A+ G+ Sbjct: 61 EQQRIVEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKSILQYAMQGK 109 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 44/128 (34%), Gaps = 17/128 (13%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESG 59 YP YK IP+ W+ + G+T + +I ++ + D+ SG Sbjct: 168 YPIYK---------IPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISG 218 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + + + + I KG +L + K I D + + + P Sbjct: 219 YVTNTRESISKLALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYAN 277 Query: 120 LPELLQGW 127 +++ + Sbjct: 278 KENIIRDY 285 >gi|325973634|ref|YP_004250698.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|325990086|ref|YP_004249785.1| Restriction modification system DNA (specificity subunit), probably fragment [Mycoplasma suis KI3806] gi|323575171|emb|CBZ40833.1| Restriction modification system DNA (specificity subunit), probably fragment [Mycoplasma suis] gi|323652236|gb|ADX98318.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 251 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 61/192 (31%), Gaps = 15/192 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + +G F + LN+ I + + P + Sbjct: 39 DKLGSFETGNPWNSKFDISHSLNKNKGIPFVDGGTISQSKLHILGDKFYDPKYLPSKIK- 97 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSY 340 I + F + + S+ + G I++ A + + + + Sbjct: 98 --IFPKDTVCFVCVGSYPGESSI----LKTNGCISNNIYAFNSCENISFPKFFKYSLDFS 151 Query: 341 DLCKVFYAMGSGLRQS--LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 D+ K + S L + + PP+ EQ+ I N ++ D L+E E+ Sbjct: 152 DIKKKIFISSSTTTPRKALSRHKLLSIKFPCPPLNEQYLIGNTLSA----YDELIENNER 207 Query: 399 SIVLLKERRSSF 410 I +L+ R+S Sbjct: 208 QIEVLQGIRTSI 219 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 56/198 (28%), Gaps = 13/198 (6%) Query: 22 PKHWKVVPIKRFTKLNTGR----------TSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 P W+ V + + TG + K I ++ + L Sbjct: 30 PPRWEWVTLDKLGSFETGNPWNSKFDISHSLNKNKGIPFVDGGTISQSKLHILGDKFYDP 89 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL--- 128 + S + IF K + + +G Y ++ I +G S + + Sbjct: 90 KYLPSKIKIFPKDTVCFVCVGSYPGESSILKTNGCISNNIYAFNSCENISFPKFFKYSLD 149 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + + + +I P PPL EQ LI + A I+ + Sbjct: 150 FSDIKKKIFISSSTTTPRKALSRHKLLSIKFPCPPLNEQYLIGNTLSAYDELIENNERQI 209 Query: 189 IRFIELLKEKKQALVSYI 206 + + + Sbjct: 210 EVLQGIRTSIFKEWFVNL 227 >gi|299822018|ref|ZP_07053905.1| type I restriction-modification system specificity subunit [Listeria grayi DSM 20601] gi|299816646|gb|EFI83883.1| type I restriction-modification system specificity subunit [Listeria grayi DSM 20601] Length = 203 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 44/129 (34%), Gaps = 5/129 (3%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 GE V D ND + V E+ + + ++ S +LM + + Sbjct: 79 HKGEYVLIAEDGANDLINYPVQYVNEKIWVNNHAHVIQGIDRVSDN-KFLMNAIKSINIE 137 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + G R L + +LPV +P EQ I ++D + I L Sbjct: 138 PFLVGGGRAKLTSNTLMKLPVKIPTFLEQKKIGTF----FQQLDNTITLHHSKIEKLTTL 193 Query: 407 RSSFIAAAV 415 + +++ Sbjct: 194 KKAYLKNLF 202 >gi|229826014|ref|ZP_04452083.1| hypothetical protein GCWU000182_01378 [Abiotrophia defectiva ATCC 49176] gi|229789756|gb|EEP25870.1| hypothetical protein GCWU000182_01378 [Abiotrophia defectiva ATCC 49176] Length = 345 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 41/397 (10%), Positives = 102/397 (25%), Gaps = 62/397 (15%) Query: 27 VVPIKRFTKLN--------TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +V +++ K + E +D + E G Y + Sbjct: 2 IVKLEKVCKRIYAGGDVPKDRYSKEKTEDYKVPIFANAEKDEGLYGYTYEAREKEL---- 57 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I G I + V+ + + E + L + + Sbjct: 58 ------SITVAARGTIGYTVIRREPFFPVVRLITVVPDLEKVSERYLFYAL-----KNCK 106 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G ++ I + + + EQ I +++ I E + EL Sbjct: 107 PQSSGTSIPQLTVPDIKKNTLNLLDIVEQESIADRLDKLNGIIKLRTEEISKLDEL---- 162 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +K +E G V + ++ + +N+L Sbjct: 163 ------------------IKARFVEMFGDVIRNDKLWKTDSW-------------NNLLR 191 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G + +E+ + + + K ++ +M Sbjct: 192 IVNGKNQRAIESNDGEYVICGSGGIMGKARDYLTKENSVIVGRKGNINKPILMREKYWNV 251 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + +L + SL D+ + + VP + Q Sbjct: 252 DTAFGIEPNNNHICVEYLYMFCLFFDFNRLNKAVTIPSLTKADLLNIEMPVPDLNIQKRF 311 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ ++D +++++ + S + Sbjct: 312 ATFVH----QVDKSKVAVQKALDETQTLFDSLMQKYF 344 >gi|224538861|ref|ZP_03679400.1| hypothetical protein BACCELL_03757 [Bacteroides cellulosilyticus DSM 14838] gi|224519536|gb|EEF88641.1| hypothetical protein BACCELL_03757 [Bacteroides cellulosilyticus DSM 14838] Length = 186 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 57/163 (34%), Gaps = 9/163 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQND 301 + N + N+I N + + + V G+++ Sbjct: 23 GGKESYLGGNTSLIRSQNVIDFGFLYNGLALINDEQAHGLDNVTVMTGDVLLNITGDSVA 82 Query: 302 KRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + + ++ + A + + + S Y+ + ++ + + G R +L + Sbjct: 83 RCCKVPSNILPARVNQHVAIIRGDNNIVISDYILYYLQYKKPYLLSLSQGGATRNALTKK 142 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 ++ + + +P I EQ I +++ + ID +E + L Sbjct: 143 MIEDIKIPLPSISEQRHIIDLL----SSIDNKIELNRRINDNL 181 >gi|321310224|ref|YP_004192553.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802068|emb|CBY92714.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 206 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 61/180 (33%), Gaps = 5/180 (2%) Query: 230 DHWEVKPFFALVTELNRKN-TKLIESNILSLSYGNIIQKL--ETRNMGLKPESYETYQIV 286 + + + K T + L GNII + + E + V Sbjct: 11 KDVKHLKLKDVCKIIAGKRFTPYTSEGMPVLRSGNIIDGYVVDEDFVYCDREKHPRVDTV 70 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+I+ + + + + YL + S Sbjct: 71 KYGDILIVRFGSAG-VVGMNLINREFFLDANLSKFSPDSKILHKQYLYHFLLSRQEEIKG 129 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +A G + +++ D++ L + VP +++Q I + ++ L ++++ ++L KE+ Sbjct: 130 WARG-AVIPAIRKSDLEELMIPVPSLEQQQTIASKLDKLVELKRELKRELKRELILRKEQ 188 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 56/181 (30%), Gaps = 4/181 (2%) Query: 29 PIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +K K+ G+ T + + + + ++ G + V G I Sbjct: 17 KLKDVCKIIAGKRFTPYTSEGMPVLRSGNIIDGYV-VDEDFVYCDREKHPRVDTVKYGDI 75 Query: 87 LYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 L + G + + + + P + + + + I+ GA Sbjct: 76 LIVRFGSAGVVGMNLINREFFLDANLSKFSPDSKILHKQYLYHFLLSRQEEIKGWARGAV 135 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + + +P+P L +Q I K+ L E R + L KE+ Sbjct: 136 IPAIRKSDLEELMIPVPSLEQQQTIASKLDKLVELKRELKRELKRELILRKEQHSYYRKQ 195 Query: 206 I 206 I Sbjct: 196 I 196 >gi|5712710|gb|AAD47619.1| HsdS variable domain [Lactococcus lactis] Length = 170 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 53/165 (32%), Gaps = 8/165 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKDGNSRQS---DTS 76 W+ + + G T + + G D E G Y+ K + S Sbjct: 1 DWEERKLGELANIVGGGTPSTSNPEYWDGDIDWYAPAEIGEQSYVSKSKKTITELGLKNS 60 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + I G +L+ AI+A + F + P + + + ++ + Sbjct: 61 SARILPVGTVLFTSRAGIGNTAILAKE-ATTNQGFQSIVPDQNKLDSYFIFSRTNELKRY 119 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E G+T K + + + +P L+EQ I I Sbjct: 120 GEVTGAGSTFVEVSGKQMSKMSIMVPELSEQQKIGSFFKQLDETI 164 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 16/153 (10%), Positives = 51/153 (33%), Gaps = 6/153 (3%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + + +I + I + + K + + + + + + Sbjct: 23 NPEYWDGDIDWYAPA-EIGEQSYVSKSKKTITELGLKNSSARILPVGTVLFTSRAGIGNT 81 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 A + + + ++ P R+ +L + G+G + + + ++ Sbjct: 82 AILAKEATTNQGFQSIVPDQNKLDSYFIFSRTNELKRYGEVTGAGSTFVEVSGKQMSKMS 141 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++VP + EQ I + ++D + ++ Sbjct: 142 IMVPELSEQQKIGSF----FKQLDETITLHQRK 170 >gi|300866160|ref|ZP_07110879.1| hypothetical protein OSCI_2700005 [Oscillatoria sp. PCC 6506] gi|300335839|emb|CBN56039.1| hypothetical protein OSCI_2700005 [Oscillatoria sp. PCC 6506] Length = 238 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 77/230 (33%), Gaps = 25/230 (10%) Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 SY + L P+ + K G P + + L K ++ + Sbjct: 10 SYWLCGTLEPNPEGKLITYVDSGGTPSTKNDSYWDGEIPWLTPKEITGFTDSVYVSNTER 69 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I +L N K ++ G ++ +G + Sbjct: 70 TITQLGLNNSAAK--------LLPTGTVMLTKRAPVGAVAINAIPMATNQGFLN----FQ 117 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I YLA+ R+ + A GS L D+ + VPP++EQ I +VI+ Sbjct: 118 CGSKIRPLYLAYWFRTNRVYLDMVANGS-TYPELYKSDLFEFQIAVPPLEEQDAILSVIS 176 Query: 384 ------------VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++A + K+++ L++ R + + ++G +D+ Sbjct: 177 AVQYVSLLGLPLEQSASTPESMIKMQEQNRRLRDIRDAILPNLLSGNLDI 226 >gi|317012655|gb|ADU83263.1| putative type I restriction-modification enzyme specificity subunit S [Helicobacter pylori Lithuania75] Length = 129 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 16/117 (13%), Positives = 47/117 (40%), Gaps = 5/117 (4%) Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKF 359 + + + + + + + ++L +R Y+ K + +G R ++ Sbjct: 3 CVVTQKIEKDIYLNSFCFGFRFFDKNLFNPSFLKHFLRDYNFRKNISKVANGVTRFNVSK 62 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + + ++ + +PP++ Q +I +++ +A L+ I I K+ R + Sbjct: 63 QLLSKITIPIPPLEIQQEIVKILDQFSALTTDLLAGIPAEIKARKKQYEYYREKLLT 119 >gi|260171384|ref|ZP_05757796.1| putative type I restriction endonuclease specificity subunit, partial [Bacteroides sp. D2] gi|315919697|ref|ZP_07915937.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313693572|gb|EFS30407.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 400 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 65/197 (32%), Gaps = 11/197 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQI 285 + D + E + + ++ + +I + + +K + + +++ Sbjct: 17 ELLDFYSTNSLCWEQLEYETNTVQNLHYGLIHVGLPTMIDLSKDKLPNIKEGNMPKNFEL 76 Query: 286 VDPGEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA---WLMRSY 340 G+I F D +++ + E+ ++ + D T + + S Sbjct: 77 CKNGDIAFADASEDTNEVAKAVEFYDLDEKDVVCGLHTIHGRDNADRTVIGFKGYAFSSD 136 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + G S+ ++ + +P +EQ I I+ + + Sbjct: 137 TFHHQIRRIAQGTKVFSISTKNFSECYIGIPSKEEQTKIV----TLLRLINERIATQNKI 192 Query: 400 IVLLKERRSSFIAAAVT 416 I LK+ +S+ A + Sbjct: 193 IEDLKKLKSAISAKLFS 209 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 46/415 (11%), Positives = 119/415 (28%), Gaps = 49/415 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 + W++ + + + + ++ GL V T L KD + Sbjct: 8 EEWEIYKVSELLDFYSTNSLCWEQLEYETNTVQNLHYGLIHVGLPTMIDLSKDKLPNIKE 67 Query: 75 T---STVSIFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFL--VLQPKDVLPE 122 + G I + + + + D +C + + Sbjct: 68 GNMPKNFELCKNGDIAFADASEDTNEVAKAVEFYDLDEKDVVCGLHTIHGRDNADRTVIG 127 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 S +I I +G + K + IP EQ I + I+ Sbjct: 128 FKGYAFSSDTFHHQIRRIAQGTKVFSISTKNFSECYIGIPSKEEQTKIVTLL----RLIN 183 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I + + IE LK+ K A+ + + ++ ++ I+ K F+ Sbjct: 184 ERIATQNKIIEDLKKLKSAISAKLFSQEPIVWNRLNSYFIKGKAGGTPTSTNKKFYDGDI 243 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 N + + + +I Q + IV ++ Sbjct: 244 PFLSINDITKQGKYIWQTENHISQNGL---------DNSSAWIVPKHSLIMSMYASVGLV 294 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + + + + Y + + K +G + ++ + V Sbjct: 295 TINQVPIATSQAMFS-MLLKDESLLDYLYYYLSYFKRRHIHKYLE---TGTQSNINADIV 350 Query: 363 KRLPVLVPPIKEQF--DITNVINVETARIDV--LVEKIEQSIVLLKERRSSFIAA 413 +++P + + I +++ ++D L+ + +++ ++ Sbjct: 351 CG--IMIPDYEYRHNIKIASMLQSIDVKLDNESLI------LNQYNQQKQYLLSQ 397 >gi|313113288|ref|ZP_07798894.1| type I restriction modification DNA specificity domain protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310624398|gb|EFQ07747.1| type I restriction modification DNA specificity domain protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 207 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 5/140 (3%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G ++Q+ + + ++ Y ++ G +R D ++++GII+ Sbjct: 58 QQGIVLQEDYFADRQVTTDNNVGYYVLPKGYFTYRSRS-DTDVFVFNRNNIVDKGIISYY 116 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y P DS +L + ++ A ++ L K + V VP EQ I Sbjct: 117 YPVFAPKSCDSNFLLRRLNHGIKKQLSMAAEGTGQKVLAHAKFKNMVVDVPSQSEQEKIG 176 Query: 380 NVINVETARIDVLVEKIEQS 399 ++ +D L+ ++ Sbjct: 177 TILE----ELDTLITLHQRE 192 >gi|307243985|ref|ZP_07526106.1| type I restriction modification DNA specificity domain protein [Peptostreptococcus stomatis DSM 17678] gi|306492635|gb|EFM64667.1| type I restriction modification DNA specificity domain protein [Peptostreptococcus stomatis DSM 17678] Length = 347 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 59/398 (14%), Positives = 131/398 (32%), Gaps = 70/398 (17%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + ++ + R +E ++ + S +Y+P N+ +D + + KGQ Sbjct: 6 KRLGQYIRQVDVRNTEGKEENLL-----GVSVQKRYIPSIANTVGTDFTKYKVVKKGQFT 60 Query: 88 Y----GKLGPYLRKAIIADFD-GICSTQFLVL---QPKDVLPELLQGWLLSIDVTQRIEA 139 Y + G + A++ D+D G+ S + V K ++P+ L W + + Sbjct: 61 YIPDTSRRGDKIGIALLEDYDEGLVSNVYTVFEVIDKKQLIPQYLMLWFSRPEFDRFARF 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+ DW + N+ +P+P +Q+ I I I + + + L+E+ Sbjct: 121 KSHGSVREVMDWDEMCNVELPVPTYEKQLEIVN----SYKAIMERIDLKQKINDNLEEQV 176 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 AL + +P+ + G P E+ + Sbjct: 177 YALYKQLTQSH-DPNTVFESIATVQSGKRPVSNEIGT-------------------YPLV 216 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G I+ + N D I+ + + + + Sbjct: 217 GAGGIMNYINDYN-------------FDEQIIITGRVGTHG-----VIQRFFSKCWASDN 258 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +K + + ++S D + + + D+K LP+ +P I E Sbjct: 259 TLVIKSNYY--EFSYHFLKSVDWDLLNR---GSTQPLVTQTDIKNLPLYLPDISE----- 308 Query: 380 NVINVETARIDVLVEKIE---QSIVLLKERRSSFIAAA 414 + A + +++ + I L + + I + Sbjct: 309 --LTAFEATAEKIMKHQRVLLKEIESLNQLKDMIITSL 344 >gi|94266712|ref|ZP_01290384.1| hypothetical protein MldDRAFT_4054 [delta proteobacterium MLMS-1] gi|93452632|gb|EAT03198.1| hypothetical protein MldDRAFT_4054 [delta proteobacterium MLMS-1] Length = 122 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 14/85 (16%), Positives = 33/85 (38%), Gaps = 2/85 (2%) Query: 336 LMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ S L A + ++ ++K + +PP++ Q I ++ ++I L Sbjct: 2 ILNSPTLRAKIEREARSTSGVHNINSSEIKAITFDLPPVEVQAKIIERVDEHMSKIGHLE 61 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 + + R S + A G+ Sbjct: 62 AWCQTELTRSAALRQSILKDAFAGR 86 >gi|135208|sp|P19705|T1SE_ECOLX RecName: Full=Type-1 restriction enzyme EcoEI specificity protein; Short=S.EcoEI; AltName: Full=Type I restriction enzyme EcoEI specificity protein; Short=S protein gi|146400|gb|AAA23986.1| EcoE type I restriction modification enzyme S subunit [Escherichia coli] Length = 594 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 65/205 (31%), Gaps = 9/205 (4%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE---TRNM 273 ++ S E +P+ WE + T + E I + I + + + + Sbjct: 90 LRISEDEKPFELPEGWEWITLSEIATINPKIEVTDDEQEISFVPMPCISTRFDGAHDQEI 149 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 E + Y G+I I N K + G+ T+ +P + Sbjct: 150 KKWGEVKKGYTHFADGDIALAKITPCFENSKAVIFKGLKGGVGVGTTELHVARPISSELN 209 Query: 332 YLAWLMR----SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 L+ Y GS ++ + + P+ PP EQ I + Sbjct: 210 LQYILLNIKSPHYLSMGESMMTGSAGQKRVPRSFFENYPIPFPPNTEQARIVGTFSKLMF 269 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIA 412 D L ++ S+ ++ + +A Sbjct: 270 LCDQLEQQSLTSLDAHQQLVETLLA 294 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 46/478 (9%), Positives = 110/478 (23%), Gaps = 98/478 (20%) Query: 20 AIPKHWKVVPIKRFT---------------------------------------KLNTGR 40 +P+ W+ + + ++ G Sbjct: 100 ELPEGWEWITLSEIATINPKIEVTDDEQEISFVPMPCISTRFDGAHDQEIKKWGEVKKGY 159 Query: 41 TSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ-------ILYGKLG 92 T + DI + E+ T+ + + IL Sbjct: 160 THFADGDIALAKITPCFENSKAVIFKGLKGGVGVGTTELHVARPISSELNLQYILLNIKS 219 Query: 93 PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID----VTQRIEAICEGATMSH 148 P+ + G + + + P ++ + + S Sbjct: 220 PHYLSMGESMMTGSAGQKRVPRSFFENYPIPFPPNTEQARIVGTFSKLMFLCDQLEQQSL 279 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + + E++ RI + KQ ++ V Sbjct: 280 TSLDAHQQLVETLLATLTDSQNAEELAENWARISQYFDTLFTTEASIDALKQTILQLAVM 339 Query: 209 KGLNPDVKMKD-------------------------------SGIEWVGLVPDHWEVKPF 237 L + S E +P WE Sbjct: 340 GKLVSQDPNDEPASELLKRVEQEKVQLVKEGKIKKQKPLPPVSDDEKPFELPIGWEWCRI 399 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET----------YQIVD 287 ++ ++ + + K E V Sbjct: 400 GEIIANMDAGWSPACSPEPSPNEDIWGVLKTTAVQSLEYREQENKTLPNSKLPRPQYEVH 459 Query: 288 PGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSYDLCK 344 G+I+ +N + + +I+ + I + Y++ + Sbjct: 460 DGDILVTRAGPKNRVGVSCLVEKTRSKLMISDKIIRFHLISDDISAKYISLCLNRGVTAD 519 Query: 345 VFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 A SG+ + ++ E+++ P+ +PP Q + + I D L +++ + Sbjct: 520 YLEASKSGMAESQMNISQENLRSAPIALPPTAIQLKVISTIEDFFKVCDQLKSRLQSA 577 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 52/206 (25%), Gaps = 17/206 (8%) Query: 20 AIPKHWKVVPIKR-FTKLNTGRT------SESGKDII-YIGLEDVESGTGKYLPKDGNSR 71 +P W+ I ++ G + +DI + V+S + Sbjct: 389 ELPIGWEWCRIGEIIANMDAGWSPACSPEPSPNEDIWGVLKTTAVQSLEYREQENKTLPN 448 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQG 126 G IL + GP R + + S + + Sbjct: 449 SKLPRPQYEVHDGDILVTRAGPKNRVGVSCLVEKTRSKLMISDKIIRFHLISDDISAKYI 508 Query: 127 WLLSIDVTQRIE----AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L + + + + + P+ +PP A Q+ + I D Sbjct: 509 SLCLNRGVTADYLEASKSGMAESQMNISQENLRSAPIALPPTAIQLKVISTIEDFFKVCD 568 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 L + + AL + Sbjct: 569 QLKSRLQSAQQTQLHLADALTDAALN 594 >gi|238923274|ref|YP_002936789.1| putative restriction and modification system specificity protein [Eubacterium rectale ATCC 33656] gi|238874948|gb|ACR74655.1| putative restriction and modification system specificity protein [Eubacterium rectale ATCC 33656] Length = 173 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 17/160 (10%), Positives = 51/160 (31%), Gaps = 12/160 (7%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVD----PGEIVFRFIDLQNDKRS--LRS 307 ++ NI L+ Q+++ G+++F ++ + Sbjct: 10 HGFPFINLQNIFGNNVIDVNKLELADATEKQLLEYSLLKGDVLFVRSSVKLEGVGEAALV 69 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRL 365 + +E + + + + ++ + + A + +++ ++ L Sbjct: 70 PETLENTTYSGFIIRFRDEYGLNNDFKKYIFGTQKVRNQIMAQATNSANKNISQGVLENL 129 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 VP EQ I + +D L+ ++ K+ Sbjct: 130 TFEVPSFDEQAKIGEH----FSNLDHLITLHQRQTDFYKK 165 >gi|332083323|gb|EGI88554.1| type I restriction enzyme EcoAI specificity [Shigella boydii 5216-82] gi|332083684|gb|EGI88902.1| type I restriction enzyme EcoAI specificity [Shigella dysenteriae 155-74] Length = 388 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 70/207 (33%), Gaps = 12/207 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDV 56 +K K P+ S + +P+ W+ V + ++ GR + + + + ++ Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVRVADLMEVINGRAYKKHEMLQTGTPLLRVGNL 140 Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 + + G ++Y + + + + Sbjct: 141 ------FTSNEWYYSDLQLDENKYINNGDLIYAWSASFGPFIWTGEKVIYHYHIWKLNLF 194 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + +T +I++ G M H + + + +PP+ EQ I KI Sbjct: 195 AEEYSNKYFIHDFLLSITDKIKSQGNGIAMLHMTKEKMEQQIIALPPINEQQQIVRKIRE 254 Query: 177 ETVRIDTLITERIRFIELLKEKKQALV 203 TV D L + + ++ ++ + L+ Sbjct: 255 LTVLCDQLEQQSLTSLDAHQQLVETLL 281 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 63/193 (32%), Gaps = 5/193 (2%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE L+ +N + K E + + Sbjct: 93 SEEEKPFELPEGWEWVRVADLMEVINGRAYKKHEMLQTGTPLLRVGNLFTSNEWYYSDLQ 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + ++ G++++ + + I + ++ + S Sbjct: 153 LDENKYINNGDLIYAWSASFGPFIWTGEKVIYHYHIW--KLNLFAEEYSNKYFIHDFLLS 210 Query: 340 YDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + G+G + E +++ + +PPI EQ I I T D L ++ Sbjct: 211 --ITDKIKSQGNGIAMLHMTKEKMEQQIIALPPINEQQQIVRKIRELTVLCDQLEQQSLT 268 Query: 399 SIVLLKERRSSFI 411 S+ ++ + + Sbjct: 269 SLDAHQQLVETLL 281 >gi|47459120|ref|YP_015982.1| restriction-modification enzyme mpuUVIII s subunit [Mycoplasma mobile 163K] gi|47458449|gb|AAT27771.1| restriction-modification enzyme mpuUVIII s subunit [Mycoplasma mobile 163K] Length = 380 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 50/394 (12%), Positives = 104/394 (26%), Gaps = 43/394 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +V+ + ++N GR+ S KDI D Y K N+ + Sbjct: 17 EVIKTDKIFEINKGRSKISKKDI-----SDNHGIYPVYSSKTTNNGILGWINRYDYNDEL 71 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEG 143 I G + + VL+ K+ + L + Sbjct: 72 ITLTSEGYAGTAFYHINEKFNVTGDSFVLKVKNKDITNTKFMFYFLQKEAKNPSNLNLLN 131 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + I +P+PP+ Q I + + + L E + + + L Sbjct: 132 NFSGTLTKSNLSKIEIPLPPIQYQDEIVRILNNFSEILLDLKKEFELRKKQYEYYRNKLF 191 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL--NRKNTKLIESNILSLSY 261 ++ + F + K I + Sbjct: 192 --------------------LFSEQTEYVSIDKIFEINKGKSKISKKDISDNPGIYPVYS 231 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + + YE I ++ +T Sbjct: 232 SKTTNNGILGWINRYEDQYEDELI----------TITVGGYAGTVFYHDNKKINVTEGSW 281 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +K ++ + ++ + ++ Y S LK ++++ + +P I+ Q I Sbjct: 282 ILKAFDKNNVNIKFVFYALEIIAKKYVTKSSTMLELKKSSIEKIKIPLPSIEIQNKIVKN 341 Query: 382 INVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 +N I E + I L K+ R + Sbjct: 342 LNFFEILIKDFKEGLPSEINLRKKQYEYYRDKLL 375 >gi|330997667|ref|ZP_08321512.1| conserved domain protein [Paraprevotella xylaniphila YIT 11841] gi|329570195|gb|EGG51935.1| conserved domain protein [Paraprevotella xylaniphila YIT 11841] Length = 464 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 49/425 (11%), Positives = 131/425 (30%), Gaps = 51/425 (12%) Query: 30 IKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + +N + + +++ +I +E ++ G R + + F +G + Sbjct: 37 LGQLVYINPPVSFDGISDNEEMSFIPMESIDEHNGTIKTLK-TIRFREIKGFTKFQEGDL 95 Query: 87 LYGKLGPYLRK------AIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSIDVTQRI 137 L+ K+ P ++ + + G ST+F VL+PK + + Sbjct: 96 LWAKITPCMQNGKSAIACKLKNGFGCGSTEFFVLRPKSDNILIEYIHYILRDKRVLKSAQ 155 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE------------------KIIAETV 179 + A + ++ +P+ P+ Q I + + + Sbjct: 156 NSFGGSAGQQRVSSSYLKSVKIPLLPIDIQKQIIKQYIQAQEAKQKKDEEAKSLLDSIDS 215 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK-------MKDSGIEWVGLVPDHW 232 + + + ++ + +S ++ +P K + Sbjct: 216 FVLKNMGVALPSKDIYAKVNVVSLSQLIGNRYDPYYHNEYFEEAFKHLKETSNYKLVRLS 275 Query: 233 EVKPFFALVTELNRKNTKLI--ESNILSLSYGNIIQKLETRNMGLKP------ESYETYQ 284 ++ E + + GNI E L + Sbjct: 276 DITVLITSGITPKSGGDDYTDSEHGVAFIRSGNIDIMGEVDFDNLLYIRRNVHNTRMKSS 335 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 V G+I+ + + + + E I + + G + Y+ +++S Sbjct: 336 KVQNGDIMIAIVGATIGQVGIYHSSR-EANINQAIALVRLKDGYNPEYIKEVIKSSIGQL 394 Query: 345 VFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + R ++ E++ + + VP I+ Q ++ + + L ++ + LL Sbjct: 395 NLDRLKRPVARANINLEEISSMLIPVPEIEIQNEMVKSVVSIRQQAKQL---QKEGVKLL 451 Query: 404 KERRS 408 + + Sbjct: 452 ESTKQ 456 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 63/189 (33%), Gaps = 16/189 (8%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P + + + E + + + + ++ + + Sbjct: 32 YPSFDLGQLVYINPPVSFDGISDNEEMSFIPMESIDEHNGTIKTLKTIRFREIKGFTKFQ 91 Query: 288 PGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS--TYLAWLMRSYDLC 343 G++++ I +QN K ++ G ++ + ++P + Y+ +++R + Sbjct: 92 EGDLLWAKITPCMQNGKSAIACKLKNGFGCGSTEFFVLRPKSDNILIEYIHYILRDKRVL 151 Query: 344 K--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 K GS +Q + +K + + + PI Q I I ++ E Sbjct: 152 KSAQNSFGGSAGQQRVSSSYLKSVKIPLLPIDIQKQI----------IKQYIQAQEAKQK 201 Query: 402 LLKERRSSF 410 +E +S Sbjct: 202 KDEEAKSLL 210 >gi|260437998|ref|ZP_05791814.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] gi|292809595|gb|EFF68800.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] Length = 272 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 62/195 (31%), Gaps = 14/195 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPF------FALVTELNRKNTKLIESNILSLSYGNI--IQKL 268 K + +P +W T +S I L NI KL Sbjct: 78 FKGDDNSYYQDLPSNWINIRLSAISEIITKGTTPRGGKIAYRQSGIGFLRAENIAGYDKL 137 Query: 269 ETRNMGLKPES----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + N+ E Y I+ +I+ + + A + + Sbjct: 138 DLSNLNYVDEESHKNYLKRSILKENDILITIAGTLGRTAIVPQHALPLNSNQAVAIVRLV 197 Query: 325 PHG-IDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + I+ YLA+ + S + + +L +++ + +PP+ EQ I I Sbjct: 198 NNKLINVKYLAYTLNSPIIKSDLLAKSVDMAIPNLSLDNIAECNISLPPLAEQKRIVEAI 257 Query: 383 NVETARIDVLVEKIE 397 A +D + ++E Sbjct: 258 EKIFATLDDIANQVE 272 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 63/180 (35%), Gaps = 17/180 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P +W + + +++ T T+ G I ++ E++ +G K + N Sbjct: 88 DLPSNWINIRLSAISEIITKGTTPRGGKIAYRQSGIGFLRAENI-AGYDKLDLSNLNYVD 146 Query: 73 SDTSTVS----IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-----EL 123 ++ I + IL G R AI+ ++ V + V + Sbjct: 147 EESHKNYLKRSILKENDILITIAGTLGRTAIVPQHALPLNSNQAVAIVRLVNNKLINVKY 206 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L L S + + A + + I + +PPLAEQ I E I +D Sbjct: 207 LAYTLNSPIIKSDLLAKSVDMAIPNLSLDNIAECNISLPPLAEQKRIVEAIEKIFATLDD 266 >gi|229547243|ref|ZP_04435968.1| type I restriction enzyme, specificity subunit [Enterococcus faecalis TX1322] gi|229307640|gb|EEN73627.1| type I restriction enzyme, specificity subunit [Enterococcus faecalis TX1322] Length = 222 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 65/209 (31%), Gaps = 11/209 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 V P ++ D EW + + + K E+ + + ++ + Sbjct: 18 VKDERAPKLRFADFEGEWEQCKLEDYATYRRGSFPQPYGNKKWYDGENAMPFVQVIDVTE 77 Query: 267 KLETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 +L + S V G++V + ++R ++ Sbjct: 78 QLSLVKDTKQKISKLAQSKSVFVSAGKVVVTLQGSIGRVAITQYNSYIDRTLL---VFES 134 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 D + A+ ++ G +++ E + V P +EQ N + Sbjct: 135 YEKETDEYFWAYTIQQ-KFEIEKRKAPGGTIKTITKEALSSFEVNFPEYEEQQKNGNFL- 192 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIA 412 +D ++ ++ + LK + S++ Sbjct: 193 ---KNLDNILTLDQKKLDQLKSLKKSYLQ 218 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 50/189 (26%), Gaps = 10/189 (5%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ ++ + G + + ++ + DV + Sbjct: 34 EWEQCKLEDYATYRRGSFPQPYGNKKWYDGENAMPFVQVIDVTEQLSLVKDTKQKISKLA 93 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S + G+++ G R I ++ LV + + + Sbjct: 94 QSKSVFVSAGKVVVTLQGSIGR-VAITQYNSYIDRTLLVFESYEKETDEYFWAYTIQQKF 152 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G T+ + + + + P EQ + + + + L Sbjct: 153 EIEKRKAPGGTIKTITKEALSSFEVNFPEYEEQQKNGNFLKNLDNILTLDQKKLDQLKSL 212 Query: 195 LKEKKQALV 203 K Q + Sbjct: 213 KKSYLQNMF 221 >gi|296270472|ref|YP_003653104.1| restriction modification system DNA specificity domain-containing protein [Thermobispora bispora DSM 43833] gi|296093259|gb|ADG89211.1| restriction modification system DNA specificity domain protein [Thermobispora bispora DSM 43833] Length = 401 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 137/404 (33%), Gaps = 36/404 (8%) Query: 34 TKLNTGRTSESGKDIIY-IGLEDVESGTGKYL--PKDGNSRQSDTSTVSIFAKGQILYGK 90 + T + G + Y +G +++G + + + ++ K I+ + Sbjct: 17 CEHKTAPAAPRGTEYGYSVGTPCIKNGRLLLDAAKRVDRATYEKWTARAVPQKDDIILTR 76 Query: 91 LGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 P A++ +C T L P + P L LLS + +R+ EG+T+ Sbjct: 77 EAPVGEAALLDGNSRVCLGQRTVLLRPDPLKIDPRFLHYLLLSPALQERMRIRAEGSTVP 136 Query: 148 HADWKGIGNIP-MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H + I ++ +PPL EQ + + A +I + + LL+ + + L Sbjct: 137 HLNVGDIRSLQLGELPPLREQHVTAAILGALDDKIAVNERIAVTYESLLRLRFEELR--- 193 Query: 207 VTKGLNPDVKMKDSG-IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 V P + S IE+ VP + ++ + ++ E + G Sbjct: 194 VDVEPAPGEGVAVSELIEFNPSVPAPRTTDAVYLDMSSVPTSTARVREWSRREPKSGTRF 253 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 +T + P + + E GI ++ ++ ++ Sbjct: 254 ANNDTVMARITP------------------CLENGKTAFIDFMEDGETGIGSTEFIVMRA 295 Query: 326 HGIDSTYLAWLM-RSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +L + + RS +GS RQ + + V +P Sbjct: 296 RAGVPVHLPYFLARSPRFRSYAIQNMVGSSGRQRVSASQLAGFTVRLPDPTSMAAFGEAA 355 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + A + L + + L + R + + ++G++ +R + Sbjct: 356 SAAFAHMKSLDAESKN----LAQLRDTLLPKLISGELRVRDAEK 395 >gi|260495160|ref|ZP_05815288.1| LOW QUALITY PROTEIN: type I restriction enzyme specificity protein [Fusobacterium sp. 3_1_33] gi|260197217|gb|EEW94736.1| LOW QUALITY PROTEIN: type I restriction enzyme specificity protein [Fusobacterium sp. 3_1_33] Length = 200 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 54/143 (37%), Gaps = 7/143 (4%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + G +TY ++ R + N + ++ T Y + Sbjct: 51 NIGNIPVYGSGGIINYIDTYIYDKESVLIPRKGSIGNLFYVDKPFWTVD----TIFYTVI 106 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + Y+ + + +L K+ +G SL + ++ + +PP++EQ I ++++ Sbjct: 107 DKDIVIPKYVYYYLSKVNLEKL---NTAGGVPSLTQTVLNKILIPLPPLEEQQRIVDILD 163 Query: 384 VETARIDVLVEKIEQSIVLLKER 406 + + E + I +++ Sbjct: 164 RFDKLCNDISEGLPAEIEARQKQ 186 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 44/154 (28%), Gaps = 17/154 (11%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + K+ G + +P G+ + I+ K +L Sbjct: 35 LGEILKIKNGSDYKK--------------FNIGNIPVYGSGGIINYIDTYIYDKESVLIP 80 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + G + T F + K ++ ++ +E + + Sbjct: 81 RKGSIGNLFYVDKPFWTVDTIFYTVIDK---DIVIPKYVYYYLSKVNLEKLNTAGGVPSL 137 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + I +P+PPL EQ I + + + Sbjct: 138 TQTVLNKILIPLPPLEEQQRIVDILDRFDKLCND 171 >gi|302878447|ref|YP_003847011.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] gi|302581236|gb|ADL55247.1| restriction modification system DNA specificity domain [Gallionella capsiferriformans ES-2] Length = 410 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 48/406 (11%), Positives = 120/406 (29%), Gaps = 36/406 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG----Q 85 + + + I + + + G+ + + R ++ +S K Sbjct: 8 LSDLVTYLNRGVAPKYVETGGIRVYNQKCIRGQRVSDGPSRRTQASARLSQVDKELRLFD 67 Query: 86 ILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +L G + + D + +++P + L + IE + Sbjct: 68 VLINSTGVGTLGRVGQIFGLDEPATADSHLTIVRPDPQKVDPLFLGYVLKAYEPEIERLG 127 Query: 142 EGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 EG+T + +G + +P+ Q + ID + R E + + Sbjct: 128 EGSTGQTELSRAKLGELEIPLISRDAQKSASAFL----YAIDKRLNLLRRISEDIDVFAR 183 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L G + D + L + N+ S+ Sbjct: 184 TLFREWFGAGNSEDWPTARLDQHL-------TAHRGLSYKGAGLCESGEGVPMHNLNSVY 236 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR-----SLRSAQVMERGI 315 G + + ++ ++ PG+I+ + ++ R ++ + + GI Sbjct: 237 EGGGYKYPGIK---YYKGEFKERHVLKPGDIIVTNTEQGHEHRLIGFPAVVPSIFGDNGI 293 Query: 316 ITSAYMAVKPHGIDS---TYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPP 371 + + P ++ +L+ + + +G + L ++ +PP Sbjct: 294 FSQHIYRIVPLDSSYLGREFIYYLLMAGHVRDQIIGSTNGSTVNMLAISGLQDSTFSLPP 353 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 DI + + E+ E+ L + R + + G Sbjct: 354 ----QDIVEKFTATVRPLWEMAERNEKESRDLIKLRDLLLPMLIAG 395 Score = 41.3 bits (95), Expect = 0.34, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 46/166 (27%), Gaps = 16/166 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 + W + + + G + + SG+ + L V G G Y + + Sbjct: 196 EDWPTARLDQHLTAHRGLSYKGAGLCESGEGVPMHNLNSVYEGGG-YKYPGIKYYKGEFK 254 Query: 77 TVSIFAKGQILYGK---------LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 + G I+ +G I +GI S + P D + Sbjct: 255 ERHVLKPGDIIVTNTEQGHEHRLIGFPAVVPSIFGDNGIFSQHIYRIVPLDSSYLGREFI 314 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + I S + I + L Q ++ + Sbjct: 315 YYLLMAGHVRDQIIGSTNGSTVNMLAISGLQDSTFSLPPQDIVEKF 360 >gi|163743542|ref|ZP_02150919.1| Restriction modification system, type I [Phaeobacter gallaeciensis 2.10] gi|161383127|gb|EDQ07519.1| Restriction modification system, type I [Phaeobacter gallaeciensis 2.10] Length = 415 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 59/420 (14%), Positives = 118/420 (28%), Gaps = 35/420 (8%) Query: 30 IKRFTK----LNTGRT---SESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIF 81 + F + + G + +I DV G + ++ S+ +I Sbjct: 4 LSEFCEPGSPITYGVVQPGPTDPNGVKFIRGGDVSDGKIAESELRTISAEVSNQYKRTIL 63 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G++L +G A++ + +V + + L +L+S + Sbjct: 64 RGGELLVSLVGNPGEVALVPSHMAGLNIARQVAMVRLSNQINSKFLMYFLMSPMGRSALG 123 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A G+ S + + + + +P + Q I E + +D I R E L+E Sbjct: 124 AQAIGSVQSVINLRDLKRVEVPNIERSTQDKIAEIL----GTLDDKIELNRRMNETLEEM 179 Query: 199 KQALV-SYIVTKGLNP-DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 +AL + V G + MK+ GI P A + Sbjct: 180 ARALFRDWFVEFGPTRRQMAMKEKGIATDPAAIMGHAFPPEKAATLAPLFPTKLGDDGLP 239 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS---AQVMER 313 ++ L + + P + +L V + Sbjct: 240 EGWETRDLRSALTLNYGKSLT---KKARRPGPFNVFGSGGISGTHDTALAKGPSIIVGRK 296 Query: 314 GIITSAYMAVKPHGIDSTYLA--------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 G + S Y + T + R + + L ++ R Sbjct: 297 GTVGSLYWTREDFYAIDTVFYVTSDYPMVYCHRLLETLGLETMNTDAAVPGLNRDNAYRQ 356 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 + T + D +Q L E R + ++G+I L+ Sbjct: 357 EFAFGGDALIHAYAEFVGNLTEKSDA----NQQENQTLAEMRDLLLPKLMSGEIRLKDAE 412 Score = 39.8 bits (91), Expect = 0.90, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 49/189 (25%), Gaps = 20/189 (10%) Query: 18 IGA--IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +G +P+ W+ ++ LN G++ G + Sbjct: 233 LGDDGLPEGWETRDLRSALTLNYGKSLTKKARRP-----------GPFNVFGSGGISGTH 281 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 T + I+ G+ G + T F V + + + T Sbjct: 282 DTA-LAKGPSIIVGRKGTVGSLYWTREDFYAIDTVFYVT------SDYPMVYCHRLLETL 334 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +E + A + + A E + T + D E E+ Sbjct: 335 GLETMNTDAAVPGLNRDNAYRQEFAFGGDALIHAYAEFVGNLTEKSDANQQENQTLAEMR 394 Query: 196 KEKKQALVS 204 L+S Sbjct: 395 DLLLPKLMS 403 >gi|262383639|ref|ZP_06076775.1| type I restriction-modification system specificity determinant [Bacteroides sp. 2_1_33B] gi|262294537|gb|EEY82469.1| type I restriction-modification system specificity determinant [Bacteroides sp. 2_1_33B] Length = 388 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 48/162 (29%), Gaps = 9/162 (5%) Query: 21 IPKHWKVVPIKRFTKL-NTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 +P+ W++ I F K +G T ++ +V + + + Sbjct: 200 LPEGWRMGTIGEFCKETKSGGTPNRSNPKYWDKHHYRWLKSGEVANNIIFDTEEYISREG 259 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ I G ++ G + D D + + E + + Sbjct: 260 LKGSSAKIIPSGTVVMAMYGATASQVTYLDCDTTTNQACCNMLTATFE-EAAYLYFHCLY 318 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + I+ + G + + I P+ I + K+ Sbjct: 319 QQENIKRLANGGAQENLSQELICAQPILICENTHIYDVFSKL 360 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 44/319 (13%), Positives = 94/319 (29%), Gaps = 24/319 (7%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQP 116 +++ N+ +D ST I Q Y + + I S ++V + Sbjct: 42 QFITSIANTTGTDMSTYKIVQPRQFGYVPVTSRNGDKITIALYEGESPCIISQAYVVFEV 101 Query: 117 KDV---LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 D LPE L W + + G+ +W + +P+ + EQ I Sbjct: 102 IDETELLPEYLMMWFRRPEFDRYARFKSHGSAREVFEWSEMCEFLLPVSSIDEQRKIVA- 160 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 E I+ I R I ++E QA+ + ++ + + + +G + Sbjct: 161 ---EYQAIERRIENNRRLIATIEETAQAIYRKMFVDDIDVENLPEGWRMGTIGEFCKETK 217 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY---ETYQIVDPGE 290 + T + + L G + + + +I+ G Sbjct: 218 -----SGGTPNRSNPKYWDKHHYRWLKSGEVANNIIFDTEEYISREGLKGSSAKIIPSGT 272 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +V + + + A + Y + Sbjct: 273 VVMAMYGATASQVTYLDCDTTTNQACCNMLTATFEE----AAYLYFHCLYQQENIKRLAN 328 Query: 351 SGLRQSLKFEDVKRLPVLV 369 G +++L E + P+L+ Sbjct: 329 GGAQENLSQELICAQPILI 347 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 52/150 (34%), Gaps = 9/150 (6%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAV- 323 ++ T TY+IV P + + DK ++ + II+ AY+ Sbjct: 41 KQFITSIANTTGTDMSTYKIVQPRQFGYVPVTSRNGDKITIALYEGESPCIISQAYVVFE 100 Query: 324 --KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + YL R + + G R+ ++ ++ + V I EQ I Sbjct: 101 VIDETELLPEYLMMWFRRPEFDRYARFKSHGSAREVFEWSEMCEFLLPVSSIDEQRKIV- 159 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSF 410 E I+ +E + I ++E + Sbjct: 160 ---AEYQAIERRIENNRRLIATIEETAQAI 186 >gi|171057996|ref|YP_001790345.1| restriction modification system DNA specificity subunit [Leptothrix cholodnii SP-6] gi|170775441|gb|ACB33580.1| restriction modification system DNA specificity domain [Leptothrix cholodnii SP-6] Length = 578 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 61/199 (30%), Gaps = 16/199 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E P WE+ LV + + + +++ PE+ Sbjct: 92 SDEEISFDAPRGWELVRLGDLVNASEAGWSPSCAGSPRRAGHWGVLKVSAVSWGKFDPEA 151 Query: 280 YET---------YQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGID 329 + V G+ + + + RS+ V R +++ + + Sbjct: 152 NKELPADLQPKPEYEVRSGDFLLSRANTEELVARSVVVGAVDPRLMLSDKIIRLDVANPI 211 Query: 330 STYLAWLMRSYD-LCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + A SG +++ E V LP+ +PP+ EQ I + Sbjct: 212 HRGFLNFCNNEKSARTHYAANASGTSSSMKNVSREVVLNLPIALPPLAEQSRIVTRVEEL 271 Query: 386 TARIDVLVEKIEQSIVLLK 404 D L ++ + + Sbjct: 272 MRLCDALES--QRQLETAQ 288 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 69/187 (36%), Gaps = 9/187 (4%) Query: 220 SGIEWVGLVPDHWEVKPF---FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 S + + +P+ W V LV+ + + E + Y + ++ Sbjct: 386 SDKDGLDDLPEGWVVVRLGAIMELVSGQHLGPAEYAEGLDSGIPYLTGPAEFGPQSPSPT 445 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + E I G+I+ ++ + I+ MAV+ G++ +L + Sbjct: 446 RSTVERRAIAIWGDILIT---VKGSGVGKLNVVAHSEIAISRQLMAVRSIGVNDAFLFIV 502 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK- 395 +++ ++ ++G + EDV + +PP+ EQ I + + L ++ Sbjct: 503 LKTLEIKFQMQSVGI-AIPGIGREDVSHSILGLPPLAEQARIVARVTQLRSHCADLRQRL 561 Query: 396 -IEQSIV 401 Q+I Sbjct: 562 SARQAIQ 568 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 74/193 (38%), Gaps = 16/193 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNS 70 + +P+ W VV + +L +G+ I Y+ G ++ P+ + Sbjct: 391 LDDLPEGWVVVRLGAIMELVSGQHLGPAEYAEGLDSGIPYLT------GPAEFGPQSPSP 444 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKA-IIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 +S +I G IL G + K ++A + S Q + ++ V L L Sbjct: 445 TRSTVERRAIAIWGDILITVKGSGVGKLNVVAHSEIAISRQLMAVRSIGVNDAFLFIVLK 504 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++++ +++++ G + + + + + +PPLAEQ I ++ L Sbjct: 505 TLEIKFQMQSV--GIAIPGIGREDVSHSILGLPPLAEQARIVARVTQLRSHCADLRQRLS 562 Query: 190 RFIELLKEKKQAL 202 + +AL Sbjct: 563 ARQAIQSHLAEAL 575 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 54/191 (28%), Gaps = 15/191 (7%) Query: 22 PKHWKVVPIKRFTK-LNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P+ W++V + G + + + V G Sbjct: 101 PRGWELVRLGDLVNASEAGWSPSCAGSPRRAGHWGVLKVSAVSWGKFDPEANKELPADLQ 160 Query: 75 TSTVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 G L + D + S + + L + + + Sbjct: 161 PKPEYEVRSGDFLLSRANTEELVARSVVVGAVDPRLMLSDKIIRLDVANPIHRGFLNFCN 220 Query: 130 SIDVTQRIEAIC---EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + + A ++M + + + N+P+ +PPLAEQ I ++ D L + Sbjct: 221 NEKSARTHYAANASGTSSSMKNVSREVVLNLPIALPPLAEQSRIVTRVEELMRLCDALES 280 Query: 187 ERIRFIELLKE 197 +R + Sbjct: 281 QRQLETAQHAQ 291 >gi|261839285|gb|ACX99050.1| type I restriction enzyme specificity subunit [Helicobacter pylori 52] Length = 390 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 64/362 (17%), Positives = 118/362 (32%), Gaps = 18/362 (4%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 ++ + L T +S + YI +++ ++ G K+ N Q + F K + Sbjct: 3 KTLQDYATLIND-TIQSNEINHYITTDNMCQNLGGIDTLKNINIPQEKVRS---FQKDDV 58 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L + Y RK A G CS+ LV + K + L L S T + +G+ M Sbjct: 59 LLSNIRLYFRKVYRAKQKGGCSSDVLVFRAKHIDSATLFAILSSQIFTDYACSGSQGSKM 118 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAE--TVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P + +I+ L+ + + + + + Sbjct: 119 PRGNKTHMMDFKIPTINFTIAKIFNSIQNKIENNHKINELLHKILELLYEQYFVRFDFLD 178 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 KMK S E L+P+ +EVK L+ + +I G I Sbjct: 179 ENNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGELI-TWISGSQPPKSCHIYEHKEGYI 236 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVMERGIITSAYMA 322 +N Y TY + + D+ DK ++ + Sbjct: 237 ---RFIQNRDYSSNDYITYIPISKNNKICYQYDIMIDKYGEAGAVRFGLQGAYNVALSKI 293 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIK-EQF--DI 378 + Y+ + S + K + R SL + L + +PPI Q I Sbjct: 294 SVINQSMQEYIRSYLNSKPIKKYLSNACMASTRSSLNENHIYSLMLPIPPINLLQKYEKI 353 Query: 379 TN 380 Sbjct: 354 AK 355 Score = 38.2 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 49/147 (33%), Gaps = 4/147 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 IP ++V + +G I + Y D + + Sbjct: 201 IPNDFEVKTLGELITWISGSQPPKSCHIYEHKEGYIRFIQNRDYSSNDYITYIPISKNNK 260 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIE 138 I + I+ K G A+ G + + + E ++ +L S + + + Sbjct: 261 ICYQYDIMIDKYGEAG--AVRFGLQGAYNVALSKISVINQSMQEYIRSYLNSKPIKKYLS 318 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLA 165 C +T S + I ++ +PIPP+ Sbjct: 319 NACMASTRSSLNENHIYSLMLPIPPIN 345 >gi|257467465|ref|ZP_05631776.1| type I restriction-modification system DNA specificity subunit [Fusobacterium gonidiaformans ATCC 25563] gi|315918590|ref|ZP_07914830.1| conserved hypothetical protein [Fusobacterium gonidiaformans ATCC 25563] gi|313692465|gb|EFS29300.1| conserved hypothetical protein [Fusobacterium gonidiaformans ATCC 25563] Length = 205 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 65/194 (33%), Gaps = 7/194 (3%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G P W+ + T + + + +++ GN+I + + + Y Sbjct: 18 GKKPLAWKATTLGNVTTNIRKNIGDKVYPVFSAVNSGNLIFSDDYFTKQVYSKKLNKYIE 77 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 VD + + + ++ G ++ Y+ ++ + + Sbjct: 78 VDTWNFAYNPARINIGSIGINEHNII--GCVSPVYVVFSVQKEYHSFFRFYFKQNFFNLH 135 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 SG +RQ+L ++D + V+ P + + + +++ L Sbjct: 136 CKTKASGSVRQTLSYKDFSLIDVVYPN----NEYALKFDTLWKSFYQKILRLKAENKYLS 191 Query: 405 ERRSSFIAAAVTGQ 418 E R S + ++G+ Sbjct: 192 ELRDSLLPKLMSGE 205 >gi|229521081|ref|ZP_04410502.1| type I restriction enzyme EcoR124II specificity protein (S protein) (S.EcoR124II) [Vibrio cholerae TM 11079-80] gi|229341966|gb|EEO06967.1| type I restriction enzyme EcoR124II specificity protein (S protein) (S.EcoR124II) [Vibrio cholerae TM 11079-80] Length = 384 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 28/305 (9%), Positives = 78/305 (25%), Gaps = 13/305 (4%) Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI--IAE 177 P ++ + + + + + M + L + I + Sbjct: 72 NPVIIFDDFTTANKWVDFDFKAKSSAMKMIKSSDESKFMLKYVYYWMNTLPSDLIEGDHK 131 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 I + I +K + + + L+ M + L + Sbjct: 132 RQWISNYCAKNIPIPCPDNPEKSLAIQAEIVRILDAFTAMTAELTAELNLRKKQYNYYR- 190 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE----SYETYQIVDPGEIVF 293 L++ + I ++ G ++ + Y + E Sbjct: 191 DQLLSFEEGEVEWKALGKIAEINTGQKPSEILDTEAEFDYINAGTTRSGYCALSNCEGDT 250 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + + + + + + + + Sbjct: 251 VTTPSRGQGGIGFVGYQNKSFWLGPLCYKIRSIDNKVLINKYLFYILQSKNQLLLGLKKE 310 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 G ++ D+ +L V VP + EQ I +++ + E + I L ++ R Sbjct: 311 GGVPAVNKSDLSKLEVPVPSVTEQERIVEILDKFDTLTTSIQEGLPCEIELRQKQYEYYR 370 Query: 408 SSFIA 412 ++ Sbjct: 371 DLLLS 375 >gi|160946889|ref|ZP_02094092.1| hypothetical protein PEPMIC_00850 [Parvimonas micra ATCC 33270] gi|158447273|gb|EDP24268.1| hypothetical protein PEPMIC_00850 [Parvimonas micra ATCC 33270] Length = 186 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 66/178 (37%), Gaps = 7/178 (3%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + + + K + + +S + I+ + N + E Y +++ VF Sbjct: 7 IYDGTHQTPKYVNIGVPFVSVQD-IKNIYGTNKYITIEEYNKFKVKPRKNDVFMTRIGDI 65 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLK 358 ++ +T A + + S +L +L+ S K + + Sbjct: 66 GTCAIVKNDDDLAYYVTLALIRPSNDIVLSKFLKYLIESNQGKKELSKRILHNATPIKIN 125 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++ +L +P IK Q I ++++ A ++ + + + + I L ++ R ++ Sbjct: 126 LGEIGKLKFFIPSIKVQEHIVSILDKFNAIVNNISKGLPKEIELRQKQYEYYREKLLS 183 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 64/185 (34%), Gaps = 10/185 (5%) Query: 32 RFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 F ++ G + ++ ++D+++ Y + + K + Sbjct: 3 DFAEIYDGTHQTPKYVNIGVPFVSVQDIKN---IYGTNKYITIEEYNKFKVKPRKNDVFM 59 Query: 89 GKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQR-IEAICEGA 144 ++G AI+ + D + + VL + L+ + S + + I A Sbjct: 60 TRIGDIGTCAIVKNDDDLAYYVTLALIRPSNDIVLSKFLKYLIESNQGKKELSKRILHNA 119 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + IG + IP + Q I + ++ + + IEL +++ + Sbjct: 120 TPIKINLGEIGKLKFFIPSIKVQEHIVSILDKFNAIVNNISKGLPKEIELRQKQYEYYRE 179 Query: 205 YIVTK 209 +++ Sbjct: 180 KLLSF 184 >gi|34581062|ref|ZP_00142542.1| hypothetical type I restriction enzyme S subunit [Rickettsia sibirica 246] gi|28262447|gb|EAA25951.1| hypothetical type I restriction enzyme S subunit [Rickettsia sibirica 246] Length = 216 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 71/217 (32%), Gaps = 5/217 (2%) Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIE 253 Q ++ ++ + +W + + + + L K T ++ Sbjct: 1 MNSYQKIIEGAKQI-IDNWHPYFEINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVG 59 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 ++ I+ + Y Q G I+ M Sbjct: 60 KKGKMININTAIKGDIPVIASGRVSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWT 119 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 Y + + YL ++++S GSG + + +D++ L + +PP++ Sbjct: 120 SDCNVIYSIN-EKLLLTKYLYYILKSQQNIIYQKQAGSG-QPHVYLKDLEDLQIPIPPLE 177 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 EQ + ++ ++ID L I+Q LK +S Sbjct: 178 EQQKMVTELDNNQSKIDNLKNYIKQFENKLKTTLNSL 214 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 55/194 (28%), Gaps = 9/194 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG---------TGKYLPKDGNS 70 I K W++V S + Y L + G G Sbjct: 23 EINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVGKKGKMININTAIKGDIPVIASGR 82 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + F I G Y + S ++ + L + + Sbjct: 83 VSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWTSDCNVIYSINEKLLLTKYLYYIL 142 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 I G+ H K + ++ +PIPPL EQ + ++ +ID L + Sbjct: 143 KSQQNIIYQKQAGSGQPHVYLKDLEDLQIPIPPLEEQQKMVTELDNNQSKIDNLKNYIKQ 202 Query: 191 FIELLKEKKQALVS 204 F LK +L Sbjct: 203 FENKLKTTLNSLWQ 216 >gi|167851481|ref|ZP_02476989.1| restriction modification system DNA specificity domain [Burkholderia pseudomallei B7210] Length = 387 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 57/184 (30%), Gaps = 12/184 (6%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESN--ILSLSYGNIIQKLETRNMGLKPES---YETY 283 P+ W L + + ++ + L GNI + + Sbjct: 175 PNGWAWTRLAQLGEKFDYGTSQKTGDGAGVPVLRMGNIQRGQVVFDSMKYLHDQLGELPD 234 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRSY 340 + G+++F + ++Y+ P+ + Y+ M S Sbjct: 235 LYLREGDLLFNRTNSYELVGKTGLFSAESNRFSFASYLIRVRLIPNLTNPRYVNLYMNSI 294 Query: 341 DLCKVF---YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE-KI 396 + + + + +K + V +PP+ EQ I + A D L + + Sbjct: 295 VCRRTQIEPQIVQQNGQANFNGSKLKHICVPLPPLAEQARIVARVEELRALCDGLRKRLV 354 Query: 397 EQSI 400 +Q I Sbjct: 355 DQQI 358 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 65/198 (32%), Gaps = 12/198 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W + + + TS+ D + + + +++ G + Q Sbjct: 174 LPNGWAWTRLAQLGEKFDYGTSQKTGDGAGVPVLRMGNIQRGQVVFDSMKYLHDQLGELP 233 Query: 78 VSIFAKGQILYGKLGPY------LRKAIIADFDGICSTQFLVL-QPKDVLPELLQGWLLS 130 +G +L+ + Y + ++ S V P P + ++ S Sbjct: 234 DLYLREGDLLFNRTNSYELVGKTGLFSAESNRFSFASYLIRVRLIPNLTNPRYVNLYMNS 293 Query: 131 IDVTQRIE--AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 I + I + ++ + + +I +P+PPLAEQ I ++ D L Sbjct: 294 IVCRRTQIEPQIVQQNGQANFNGSKLKHICVPLPPLAEQARIVARVEELRALCDGLRKRL 353 Query: 189 IRFIELLKEKKQALVSYI 206 + A+V Sbjct: 354 VDQQICQSRFATAMVQQA 371 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 24/60 (40%), Gaps = 1/60 (1%) Query: 337 MRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + S + K + G+ R+ L + + + +PP EQ I ++ D L + Sbjct: 2 LISAHVQKTVMDVQVGVSREGLSMAKLGQFVIPLPPRSEQARIVAKVDELMRLCDELEAR 61 >gi|325911637|ref|ZP_08174045.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 143-D] gi|325476623|gb|EGC79781.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 143-D] Length = 417 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 53/420 (12%), Positives = 138/420 (32%), Gaps = 32/420 (7%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTS 76 +G I I G+++ K I ++ +++ + K + +Q+ Sbjct: 6 LGKI-----TKKIGSGFTPKGGKSTYCSKGIAFVRSQNILDMQFSKDGLVYISDKQAAKL 60 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLPELLQGWLLSIDV 133 + +L G + +A I D + + +++ + + Sbjct: 61 KNASIESDDVLLNITGDSVARACIMDSKYLPARVNQHVSIIRCDPNKIKSQYLLYYLQYL 120 Query: 134 TQRIEAICE-GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + G+T + I + + +P + +Q I + + + + + Sbjct: 121 KKHLLKMASVGSTRKALTKEEISGLLVELPSIEKQKEITLLLES----VRHKMQINRQIN 176 Query: 193 ELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLV------PDHWEVKPFFALVTE 243 + L + + Y + PD K SG + V P W V+ Sbjct: 177 DNLAAMIKTIYEYWFIQFEFPDENGKPYKSSGGKMVWNEQLKRTIPQGWSVESIINTPLC 236 Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFIDLQND 301 K S L+ ++I + E+ E+ + P + F + Sbjct: 237 YPIKPGIKPFSEKTYLATADVIGTSIGTGNPINYETRESRANMQPEINSVWFAKMKSSIK 296 Query: 302 KRSLRSA--QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLK 358 L S+ + I+++ + ++ Y+A + + + + G ++++ Sbjct: 297 HLFLSSSMHDFIHSSILSTGFQGLQCTERSFEYIASFIGNDYFETLKDQLAHGATQEAVN 356 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +D+K + +L+P ++ + + + L+ L+ R + + GQ Sbjct: 357 NDDLKGVKILIPD----NRTLDLYHSASRQNYQLIGSALIENKHLESLRDWLLPMLMNGQ 412 >gi|325973648|ref|YP_004250712.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323652250|gb|ADX98332.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 246 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 67/191 (35%), Gaps = 8/191 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + ++ +G + + +L+ K+ + + + + R+ L S Sbjct: 10 TTLDKLGKISSGKPYDRKYEFNPKLHEKSIPFVG--VKEVGQSRLHILESDRHCFLNNLS 67 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + ++ + + +L + + + + + ++ + + S Sbjct: 68 KKGNKLFSKNTVCISIYGSYPGESALLKSDAF--LSTSVFAFSHYENISNPKFIKYCLDS 125 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + +R++L + + PP EQ I + ++ D L+E E+ Sbjct: 126 QRKTFSSISATTTIRKALPTYQLFSIKFPCPPQGEQERIGDTLSA----YDELIENNERQ 181 Query: 400 IVLLKERRSSF 410 I +L+ R++ Sbjct: 182 IEVLQGVRTAI 192 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 65/201 (32%), Gaps = 13/201 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++ + + K+++G+ + K I ++G+++V L D + ++ Sbjct: 7 WELTTLDKLGKISSGKPYDRKYEFNPKLHEKSIPFVGVKEVGQSRLHILESDRHCFLNNL 66 Query: 76 STV--SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S +F+K + G Y ++ + D ST + + Sbjct: 67 SKKGNKLFSKNTVCISIYGSYPGESALLKSDAFLSTSVFAFSHYENISNPKFIKYCLDSQ 126 Query: 134 TQRIEAIC-EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +I + +I P PP EQ I + + A I+ + Sbjct: 127 RKTFSSISATTTIRKALPTYQLFSIKFPCPPQGEQERIGDTLSAYDELIENNERQIEVLQ 186 Query: 193 ELLKEKKQ-ALVSYIVTKGLN 212 + + ++ L Sbjct: 187 GVRTAIFKEWFINLRFPNYLT 207 >gi|281424437|ref|ZP_06255350.1| type I restriction enzyme EcoR124II specificity protein [Prevotella oris F0302] gi|281401706|gb|EFB32537.1| type I restriction enzyme EcoR124II specificity protein [Prevotella oris F0302] Length = 272 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 68/207 (32%), Gaps = 15/207 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS--------LSYGNIIQKLETRNMGLKPE 278 VP+ W + T L+R + L G I K + Sbjct: 4 EVPEGWVWITLGEICTFLSRGKSPKYSEERKFPIFAQKCNLKEGGISLKQARFLDPSTID 63 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII-----TSAYMAVKPHGIDSTYL 333 ++ + G+I+ R+ + + + I S Y+ Sbjct: 64 KWDESYKLKTGDILINSTGTGTAGRTRLFDESFLGAYPFAVPDSHVSVVRTSTKIVSEYV 123 Query: 334 AWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + S GS ++ L ++RLP+ +P + EQ I + I + ID Sbjct: 124 YAYVSSLSTQLYLEENLAGSTNQKELYIGVIERLPLPLPSLAEQQRIVSEIERWSVLIDT 183 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + E +K+ ++ + A+ G+ Sbjct: 184 IEQGKENLETSIKQAKNKILDLAIHGK 210 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 91/262 (34%), Gaps = 22/262 (8%) Query: 20 AIPKHWKVVPIKRFTKL-NTGRTSESGKDIIY--------IGLEDVESGTGKYLPKDGNS 70 +P+ W + + + G++ + ++ + + + ++L Sbjct: 4 EVPEGWVWITLGEICTFLSRGKSPKYSEERKFPIFAQKCNLKEGGISLKQARFLDPSTID 63 Query: 71 RQSDTSTVSIFAKGQILYGKLGP-YLRKAIIADFDGICSTQFLVLQPK--------DVLP 121 + ++ G IL G + + D + + F V ++ Sbjct: 64 KWDES---YKLKTGDILINSTGTGTAGRTRLFDESFLGAYPFAVPDSHVSVVRTSTKIVS 120 Query: 122 ELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 E + ++ S+ +E G+T I +P+P+P LAEQ I +I +V Sbjct: 121 EYVYAYVSSLSTQLYLEENLAGSTNQKELYIGVIERLPLPLPSLAEQQRIVSEIERWSVL 180 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 IDT+ + +K+ K ++ + L P + E + + E+ Sbjct: 181 IDTIEQGKENLETSIKQAKNKILDLAIHGKLVPQDPNDEPASELLKRINPKAEIACDNEH 240 Query: 241 VTELNRKNTKLIESNILSLSYG 262 +L + + + ++ L G Sbjct: 241 YAQLPKGWSVISMQDVCKLKDG 262 >gi|168575546|ref|ZP_02721482.1| type I restriction enzyme EcoEI specificity protein [Streptococcus pneumoniae MLV-016] gi|307067540|ref|YP_003876506.1| restriction endonuclease S subunit [Streptococcus pneumoniae AP200] gi|183578530|gb|EDT99058.1| type I restriction enzyme EcoEI specificity protein [Streptococcus pneumoniae MLV-016] gi|306409077|gb|ADM84504.1| Restriction endonuclease S subunit [Streptococcus pneumoniae AP200] Length = 195 Score = 64.8 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 49/161 (30%), Gaps = 19/161 (11%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + +++ + Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLGEQQRIASELDLLSK 153 Query: 388 RI----DVLVEK---------IEQSIVLLKERRSSFIAAAV 415 I + L E I++S+ L+ + S + Sbjct: 154 LILRRQEQLEELNLLVKSQLAIQKSLEELETLKKSLMQEYF 194 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 45/169 (26%), Gaps = 2/169 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 M H K NI + L EQ I ++ + I + L Sbjct: 120 MKHLTKKYFDNIMVSYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL 168 >gi|227511528|ref|ZP_03941577.1| possible type I restriction-modification system specificity subunit [Lactobacillus buchneri ATCC 11577] gi|227085173|gb|EEI20485.1| possible type I restriction-modification system specificity subunit [Lactobacillus buchneri ATCC 11577] Length = 255 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/278 (14%), Positives = 81/278 (29%), Gaps = 28/278 (10%) Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + G+T + K I + IP + K+ +D I +E Sbjct: 2 KKANKLASGSTFTEISGKSTAKITLYIPNEHSEKEKIAKL---FFNLDNRIAANQSKLEQ 58 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 LK K+ L+ I N + + K W + + K Sbjct: 59 LKRLKKLLMQKIF----NQEWRFKGFTDPWEQRKLKQLVKSRDKDRIPIESGKRQAGKYP 114 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + ++ L ++ D+ S V R Sbjct: 115 YYGATGIVDYVKDYIFEGTYL---------------LLAEDGANILDRTHPISYVVNGRF 159 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + + + T L +L S + + L + V ++ VL P E Sbjct: 160 WVNNHAHTFQSSQ--GTDLTFLAESLERIHYQRYNTGTAQPKLNAKVVGKIEVLCPTSNE 217 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 Q + + I+VL+ ++ + L+ + + Sbjct: 218 QRK----LGKLSYLINVLIAANQRRLDQLQSLKKYLMQ 251 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 7/74 (9%), Positives = 20/74 (27%), Gaps = 5/74 (6%) Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIV 401 K + + ++ + +P E+ I +D + + + Sbjct: 2 KKANKLASGSTFTEISGKSTAKITLYIPNEHSEKEKIA----KLFFNLDNRIAANQSKLE 57 Query: 402 LLKERRSSFIAAAV 415 LK + + Sbjct: 58 QLKRLKKLLMQKIF 71 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 51/184 (27%), Gaps = 18/184 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ +K+ +D I +E + GKY P G + D IF Sbjct: 84 WEQRKLKQLV---------KSRDKDRIPIESGKRQAGKY-PYYGATGIVDYVKDYIFEGT 133 Query: 85 QILYGKLGP-----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L + G + + + + Q +L + Sbjct: 134 YLLLAEDGANILDRTHPISYVVNGRFWVNNHAHTFQSSQGTD---LTFLAESLERIHYQR 190 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + K +G I + P EQ + + V I + L K Sbjct: 191 YNTGTAQPKLNAKVVGKIEVLCPTSNEQRKLGKLSYLINVLIAANQRRLDQLQSLKKYLM 250 Query: 200 QALV 203 Q + Sbjct: 251 QNMF 254 >gi|15893271|ref|NP_360985.1| putative type I restriction enzyme S subunit [Rickettsia conorii str. Malish 7] gi|15620492|gb|AAL03886.1| type I restriction enzyme S subunit-like protein [Rickettsia conorii str. Malish 7] Length = 216 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 72/217 (33%), Gaps = 5/217 (2%) Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIE 253 Q ++ ++ + +W + + + + L K T ++ Sbjct: 1 MNSYQKIIEGAKQI-IDNWHPYFEINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVG 59 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 ++ I+ + Y Q G I+ M Sbjct: 60 KKGKMININTAIKGDIPVIASGRVSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWT 119 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 Y + + YL ++++S +GSG + + +D++ L + +PP++ Sbjct: 120 SDCNVIYSIN-EKLLLTKYLYYILKSQQNIIYQKQVGSG-QPHVYLKDLEDLQIPIPPLE 177 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 EQ + ++ ++ID L I+Q LK +S Sbjct: 178 EQQKMVTELDNNQSKIDNLKNYIKQFENKLKTTLNSL 214 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 55/194 (28%), Gaps = 9/194 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG---------TGKYLPKDGNS 70 I K W++V S + Y L + G G Sbjct: 23 EINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVGKKGKMININTAIKGDIPVIASGR 82 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + F I G Y + S ++ + L + + Sbjct: 83 VSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWTSDCNVIYSINEKLLLTKYLYYIL 142 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 I G+ H K + ++ +PIPPL EQ + ++ +ID L + Sbjct: 143 KSQQNIIYQKQVGSGQPHVYLKDLEDLQIPIPPLEEQQKMVTELDNNQSKIDNLKNYIKQ 202 Query: 191 FIELLKEKKQALVS 204 F LK +L Sbjct: 203 FENKLKTTLNSLWQ 216 >gi|256854684|ref|ZP_05560048.1| predicted protein [Enterococcus faecalis T8] gi|256710244|gb|EEU25288.1| predicted protein [Enterococcus faecalis T8] Length = 219 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 65/209 (31%), Gaps = 11/209 (5%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 V P ++ D EW + + + K E+ + + ++ + Sbjct: 15 VKDERAPKLRFADFEGEWEQCKLEDYATYRRGSFPQPYGNKKWYDGENAMPFVQVIDVTE 74 Query: 267 KLETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 +L + S V G++V + ++R ++ Sbjct: 75 QLSLVKDTKQKISKLAQSKSVFVSAGKVVVTLQGSIGRVAITQYNSYIDRTLL---VFES 131 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 D + A+ ++ G +++ E + V P +EQ N + Sbjct: 132 YEKETDEYFWAYTIQQ-KFEIEKRKAPGGTIKTITKEALSSFEVNFPEYEEQQKNGNFL- 189 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIA 412 +D ++ ++ + LK + S++ Sbjct: 190 ---KNLDNILTLDQKKLDQLKSLKKSYLQ 215 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 50/189 (26%), Gaps = 10/189 (5%) Query: 24 HWKVVPIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ ++ + G + + ++ + DV + Sbjct: 31 EWEQCKLEDYATYRRGSFPQPYGNKKWYDGENAMPFVQVIDVTEQLSLVKDTKQKISKLA 90 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S + G+++ G R I ++ LV + + + Sbjct: 91 QSKSVFVSAGKVVVTLQGSIGR-VAITQYNSYIDRTLLVFESYEKETDEYFWAYTIQQKF 149 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + G T+ + + + + P EQ + + + + L Sbjct: 150 EIEKRKAPGGTIKTITKEALSSFEVNFPEYEEQQKNGNFLKNLDNILTLDQKKLDQLKSL 209 Query: 195 LKEKKQALV 203 K Q + Sbjct: 210 KKSYLQNMF 218 >gi|160893878|ref|ZP_02074659.1| hypothetical protein CLOL250_01430 [Clostridium sp. L2-50] gi|156864459|gb|EDO57890.1| hypothetical protein CLOL250_01430 [Clostridium sp. L2-50] Length = 215 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 16/130 (12%), Positives = 42/130 (32%), Gaps = 4/130 (3%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + I A+ +++ + M S + + Sbjct: 87 NDTLMSVRAPVGDLNVAHTDCCIGRGLAAIHSKSNHQSFVLYTMFSLKKQLDVFNGEGTV 146 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 S+ + +P+L+P I + A +D+ + I L++ R + + Sbjct: 147 FGSINRNSLNDMPILIPSDD----ILDEFERIVAPMDLTIRNNYDEICRLQDIRDTLLPR 202 Query: 414 AVTGQIDLRG 423 ++G++D+ Sbjct: 203 LMSGELDVSD 212 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 45/182 (24%), Gaps = 2/182 (1%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W + + G++ G ++ + + R T + Sbjct: 26 SDWAEGTLSDIADITIGQSPSGSSYNEDGTGTIFFQGRAEFGFRFPSVRLYTTEPKRMAR 85 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 L P +A D + K + + + Q E Sbjct: 86 SNDTLMSVRAPV-GDLNVAHTDCCIGRGLAAIHSKS-NHQSFVLYTMFSLKKQLDVFNGE 143 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + ++P+ IP + + I E R ++ L Sbjct: 144 GTVFGSINRNSLNDMPILIPSDDILDEFERIVAPMDLTIRNNYDEICRLQDIRDTLLPRL 203 Query: 203 VS 204 +S Sbjct: 204 MS 205 >gi|321310218|ref|YP_004192547.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802062|emb|CBY92708.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 207 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 54/160 (33%), Gaps = 9/160 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM--GLKPESYETYQIVD 287 H ++ + + ++ + + S I + GN+ T + E + I+ Sbjct: 14 KHLLLEEVCEICSGISFQGSFRRGSGIPVIKAGNVQDDQITEDNLDYFDSEDHPKAAIIK 73 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKV 345 G++V + + +S P S YL + S + Sbjct: 74 YGDVVIVRKGSPGK---VGINLTDQEFFFSSEIFKFVPKEEVLISRYLYHFLLSQ--QEE 128 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 G+ ++ ++ ++ + +P ++ Q I + ++ Sbjct: 129 IKKGARGIIPGIRKSELGKMRIPIPSLETQERIAHTLDKF 168 >gi|84624914|ref|YP_452286.1| hypothetical protein XOO_3257 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84368854|dbj|BAE70012.1| hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 177 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 51/143 (35%), Gaps = 5/143 (3%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSY 340 + + G+++ + R + + E ++ S V+ TYL Sbjct: 30 EERKIQFGDVLVNSTGVGTLGRVAQVLSLDEPTVVDSHVTVVRAGQRLRHTYLGQWFSDK 89 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 GS + L + +P+L+P Q + + + + ++ + + S Sbjct: 90 QSEIQTMGEGSTGQTELSRLKLAHMPILIPS---QKLLADF-DAIVSPLNSKIALADSSS 145 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 L R + + +TG++ ++ Sbjct: 146 RSLATLRDALLPKLITGELRVQD 168 >gi|325973248|ref|YP_004250312.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323651850|gb|ADX97932.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 227 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 49/142 (34%), Gaps = 10/142 (7%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDS 330 + ++ G+I+F S + + + Y + D Sbjct: 55 KKFYNLKGLRQSKLFSKGKILFIRSGNS---AGDSSFLNFDSCLTQNLYSFSSFKEISDP 111 Query: 331 TYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ + +L + + +L + ++ PP+ Q I ++ +R Sbjct: 112 KFVKYCFNFQNLKTKLIVLSKLQTAQPNLTLTKLFQVKFPKPPLDIQQKIGEIL----SR 167 Query: 389 IDVLVEKIEQSIVLLKERRSSF 410 D++++ E+ I LLK + S Sbjct: 168 YDLILDNNEKQIQLLKNLKISL 189 Score = 42.9 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 59/189 (31%), Gaps = 11/189 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD- 74 + W+ V + + + GR + G +I IG E+V P+ Sbjct: 3 EKWEWVTLDKLGNIEAGRQASKLDNSLFEGGNIPLIGGEEVSKSRFSVNPEVKKFYNLKG 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL---LQGWLLSI 131 +F+KG+IL+ + G + +FD + + + + Sbjct: 63 LRQSKLFSKGKILFIRSGNSAGDSSFLNFDSCLTQNLYSFSSFKEISDPKFVKYCFNFQN 122 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 T+ I + + + P PPL Q I E + + +D + Sbjct: 123 LKTKLIVLSKLQTAQPNLTLTKLFQVKFPKPPLDIQQKIGEILSRYDLILDNNEKQIQLL 182 Query: 192 IELLKEKKQ 200 L + Sbjct: 183 KNLKISLFK 191 >gi|317177249|dbj|BAJ55038.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F16] Length = 412 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 54/382 (14%), Positives = 113/382 (29%), Gaps = 23/382 (6%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSLNSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ + + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDIIDLEKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + L Q I + +I+ ++L+ + N Sbjct: 144 FLNIKIKLYLLETQQKIARTLSILDQKIENNHKINELLHKILELLYEQYFVRFDFLDENN 203 Query: 214 DVKM----KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 K + + + + + + +I G I Sbjct: 204 KPYQTSGGKMKFSKELNRLIPNDFEVKTLGELITWISGSQPPKSCHIYEHKEGYI---RF 260 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HG 327 +N Y TY + + D+ DK A + ++ + Sbjct: 261 IQNRDYSSNDYITYIPISKNNKICYQYDIMIDKYGEAGAVRFGLQGSYNVALSKISVLNQ 320 Query: 328 IDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 Y+ + S + K + R SL + L + +PPI Sbjct: 321 SMQEYIRSYLNSKPIKKYLSNACMASTRSSLNENHIYSLMLPIPPINLLQK----YEKIA 376 Query: 387 ARIDVLVEKIEQSIVLLKERRS 408 I + K QS L R Sbjct: 377 KNIITAIIKNNQSTQTLTALRD 398 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 68/178 (38%), Gaps = 13/178 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + + I++ + Sbjct: 18 NNYTKEDNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSL---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDIIDLEKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 S+ D + + + ++ Q I +++ +I+ + E + + + LL E+ Sbjct: 134 SSYPSITPLDFLNIKIKLYLLETQQKIARTLSILDQKIENNHKINELLHKILELLYEQ 191 Score = 38.6 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 55/191 (28%), Gaps = 4/191 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 IP ++V + +G I + Y D + + Sbjct: 223 IPNDFEVKTLGELITWISGSQPPKSCHIYEHKEGYIRFIQNRDYSSNDYITYIPISKNNK 282 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIE 138 I + I+ K G A+ G + + + E ++ +L S + + + Sbjct: 283 ICYQYDIMIDKYGEAG--AVRFGLQGSYNVALSKISVLNQSMQEYIRSYLNSKPIKKYLS 340 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 C +T S + I ++ +PIPP+ + I L Sbjct: 341 NACMASTRSSLNENHIYSLMLPIPPINLLQKYEKIAKNIITAIIKNNQSTQTLTALRDFL 400 Query: 199 KQALVSYIVTK 209 L+ V Sbjct: 401 LPLLLKQQVKP 411 >gi|296314122|ref|ZP_06864063.1| type I restriction enzyme EcoR124II specificity protein [Neisseria polysaccharea ATCC 43768] gi|296839223|gb|EFH23161.1| type I restriction enzyme EcoR124II specificity protein [Neisseria polysaccharea ATCC 43768] Length = 219 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 50/161 (31%), Gaps = 4/161 (2%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + + L + ++ I+ Sbjct: 45 NGIYPFCRTSDVGRVHHSINFYQIQDKLNDIGIKGLRLFKKETILLPKSGASTLLNHRVM 104 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + A + + + YL + + +D+ ++ SLK ++ ++ + Sbjct: 105 LTIDSYVSSHLATIYRNEKIVLAKYLFYFLSQFDVNELIPDKSY---PSLKVTEIAKIKI 161 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK-ERR 407 +PP++ Q I +++ T L +E + L K + R Sbjct: 162 PIPPLETQKKIVKILDKFTELEATLEATLEAELALRKRQYR 202 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 58/194 (29%), Gaps = 12/194 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVES--GTGKYLPKDGNSRQSDTST 77 + P+ +++ G ++ + DV + + Sbjct: 20 EWKPLGEIAEVSAGNSAPQNSAFFENGIYPFCRTSDVGRVHHSINFYQIQDKLNDIGIKG 79 Query: 78 VSIFAKGQILYGKLGPY--LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + +F K IL K G L ++ D S+ + + + + Sbjct: 80 LRLFKKETILLPKSGASTLLNHRVMLTIDSYVSSHLATIYRNEKIVLAKYLFYFLSQF-- 137 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + + I I +PIPPL Q I + + T TL + L Sbjct: 138 DVNELIPDKSYPSLKVTEIAKIKIPIPPLETQKKIVKILDKFTELEATLEATLEAELALR 197 Query: 196 KEKKQALVSYIVTK 209 K + + +++ Sbjct: 198 KRQYRYYRDFLLDF 211 >gi|14518366|ref|NP_116849.1| putative hsds of type i restriction-modification system [Microscilla sp. PRE1] gi|14485001|gb|AAK62883.1| MS161, putative HsdS of type I restriction-modification system [Microscilla sp. PRE1] Length = 227 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 59/172 (34%), Gaps = 10/172 (5%) Query: 228 VPDHWEVKPFFA-----LVTELNRKNTKLIESNILSLSYGNIIQKL-ETRNMGLKPESYE 281 +P +W+ V + + ++ + L NI + N+ E + Sbjct: 1 MPQNWKKYKLENVSERVTVGFVGSMAQEYVDKGVPMLRSQNIKPFSLDFDNVKFISEKFH 60 Query: 282 ---TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + ++ ++ ++ + + + I+ +L + Sbjct: 61 AKISKSSLKADDVAIVRTGTPGTACAI-PERIGQMNCSDLVIVTPNLNLINPHFLCYYFY 119 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 S V + ++Q K++ +L+P +KEQ I +V+ +I+ Sbjct: 120 SIASHYVNSQLVGAVQQHFNVGSAKKMEILLPSLKEQDTIVDVLKSIIDKIE 171 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 63/194 (32%), Gaps = 9/194 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS-D 74 P++WK ++ ++ T S K + + ++++ + + S + Sbjct: 2 PQNWKKYKLENVSERVTVGFVGSMAQEYVDKGVPMLRSQNIKPFSLDFDNVKFISEKFHA 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSID 132 + S + + G I + G CS +V +++ + Sbjct: 62 KISKSSLKADDVAIVRTGTPGTACAIPERIGQMNCSDLVIVTPNLNLINPHFLCYYFYSI 121 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + GA H + + + +P L EQ I + + + +I+ + Sbjct: 122 ASHYVNSQLVGAVQQHFNVGSAKKMEILLPSLKEQDTIVDVLKSIIDKIELNLQMNRTLE 181 Query: 193 ELLKEKKQALVSYI 206 E+ + Sbjct: 182 EMAMTLYKHWFVDF 195 >gi|210135042|ref|YP_002301481.1| type I R-M system S protein [Helicobacter pylori P12] gi|210133010|gb|ACJ08001.1| type I R-M system S protein [Helicobacter pylori P12] Length = 177 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 41/114 (35%), Gaps = 5/114 (4%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I S + + S ++ K + YL + + + +G Sbjct: 68 ISSSGVYAGYVSYWDIPVFLADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIP 126 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + +D++ + +PP++ Q +I +++ T L ++ + LK + Sbjct: 127 HVYSKDLQNFLIPIPPLEIQQEIVKILDAFTE----LNTELNTELKALKSIIKA 176 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 46/162 (28%), Gaps = 12/162 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + G++ K + + + + G + +R + Sbjct: 13 PKGVEFRKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEANRSGE------- 64 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G Y D + F V PK + I A Sbjct: 65 ---TIAISSSGVYAGYVSYWDIPVFLADSFSV-SPKQKTLMPKYLFHYLTTQQDAIHATK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + H K + N +PIPPL Q I + + A T Sbjct: 121 STGGIPHVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTE 162 >gi|295090948|emb|CBK77055.1| Restriction endonuclease S subunits [Clostridium cf. saccharolyticum K10] Length = 332 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/315 (12%), Positives = 100/315 (31%), Gaps = 21/315 (6%) Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + L L S+ + ++IE G + H + +PIP + Q +I + A + Sbjct: 25 YNKYLLSVLRSVKIQKQIEQTSVGDVIPHFKKSFFDQLLIPIPSMEIQKIIGDYYFAFSE 84 Query: 180 RIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA 239 +I+ + ++ + + E + + + K Sbjct: 85 KIEINKKINDNLERQAQLLFKSWFVDFEPFNGTMPSEWEVVPFEKIVDFQNGYAFKS--K 142 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 + + + G I + S ++ G+I+ D++ Sbjct: 143 ELLNEPSSDCYQVFKQGHIARGGGFIPDGTKSWYPKRLASKLGKFVLKKGDILMAMTDMK 202 Query: 300 NDKRSLRSAQV---MERGIITS--AYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMG-SG 352 ++ L + + I+ + + + +L+ S D + SG Sbjct: 203 DNVAILGNTAIMPIDNEYIVNQRVGLLRTNGYKGITYPFIYLLTNSKDFLIDLRSRANSG 262 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLLKERRS 408 ++ +L ++K ++P +N + I + + L + R Sbjct: 263 VQVNLSSAEIKASRTILPS--------EKVNTAFSEITLPMFEAIISNQLENQRLAQLRD 314 Query: 409 SFIAAAVTGQIDLRG 423 + + ++G+ID+ Sbjct: 315 TLLPRLMSGEIDVSD 329 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 37/102 (36%), Gaps = 5/102 (4%) Query: 307 SAQVMERGIITSAYMAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKR 364 ++ I + + YL ++RS + K G + K + Sbjct: 2 VPDPIDFCIAQDMVALRVNDAKVYNKYLLSVLRSVKIQKQIEQTSVGDVIPHFKKSFFDQ 61 Query: 365 LPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLL 403 L + +P ++ Q I + + +I+ + + +E+ LL Sbjct: 62 LLIPIPSMEIQKIIGDYYFAFSEKIEINKKINDNLERQAQLL 103 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 55/207 (26%), Gaps = 21/207 (10%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNS 70 G +P W+VVP ++ G +S + + G G + Sbjct: 116 GTMPSEWEVVPFEKIVDFQNGYAFKSKELLNEPSSDCYQVFKQGHIARGGGFIPDGTKSW 175 Query: 71 RQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQ---------PK 117 + KG IL AI+ + + +++V Q K Sbjct: 176 YPKRLASKLGKFVLKKGDILMAMTDMKDNVAILGNTAIMPIDNEYIVNQRVGLLRTNGYK 235 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + S D + + + I +P E + Sbjct: 236 GITYPFIYLLTNSKDFLIDLRSRANSGVQVNLSSAEIKASRTILPSEKVNTAFSEITLPM 295 Query: 178 TVRIDTLITERIRFIELLKEKKQALVS 204 I + E R +L L+S Sbjct: 296 FEAIISNQLENQRLAQLRDTLLPRLMS 322 >gi|237742976|ref|ZP_04573457.1| restriction modification system DNA specificity subunit [Fusobacterium sp. 7_1] gi|229433638|gb|EEO43850.1| restriction modification system DNA specificity subunit [Fusobacterium sp. 7_1] Length = 337 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 51/365 (13%), Positives = 109/365 (29%), Gaps = 35/365 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + +K + ++ + + GKY + Q+ ++ Sbjct: 2 EYIKVKDILEFKKKSKIKASEGL----------KIGKYNFYTSSREQNKFLDYYEYSNEA 51 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQRIEAICE 142 ++ G A I G S ++ + + + IE Sbjct: 52 LIIG----TGGNANIHHSYGKFSVSTDCFVLENKANKFFLLEYIYKYLLKNIHIIENGFR 107 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA + H + + NI +PI L +Q +K+I IDT I + + L ++L Sbjct: 108 GAGLKHISKEYLENIKIPIISLEKQ----KKLIKNLKNIDTFIDKNKQIKNELNFLNKSL 163 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 + + N K ++ + K +++ Sbjct: 164 FTRMFGDIRNNSFNWKQVKLQ--------DVCSSIVRGPFGSSLKKEFFVKNGYKVYEQK 215 Query: 263 NIIQKLETRNMGLKPESYET---YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 N I++ E G+I+ + + E+GII A Sbjct: 216 NAIKQSANLGEYYIDEKKFKELQRFECKVGDIIMSCSGTVGKL--FQLPENSEKGIINQA 273 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + +L + GSG++ +K++ + +PPI+ Q Sbjct: 274 LCKFSLNNKIKST-YFLKYLEKVIGNIELNGSGIKNISSVSYIKKIDINLPPIELQNKFA 332 Query: 380 NVINV 384 + Sbjct: 333 ERVEK 337 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 42/149 (28%), Gaps = 8/149 (5%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G I K + + Y ++ N S V + Sbjct: 23 GLKIGKYNFYTSSREQNKFLDYYEYSNEALIIGTGGNANIHHSYGKFSVSTDCFVLEN-- 80 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + +L+++ + + + + E ++ + + + +++Q Sbjct: 81 KANKFFLLEYIYKYLLKNIHIIE--NGFRGAGLKHISKEYLENIKIPIISLEKQKK---- 134 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSF 410 + ID ++K +Q L S Sbjct: 135 LIKNLKNIDTFIDKNKQIKNELNFLNKSL 163 >gi|317056089|ref|YP_004104556.1| restriction modification system DNA specificity domain-containing protein [Ruminococcus albus 7] gi|315448358|gb|ADU21922.1| restriction modification system DNA specificity domain protein [Ruminococcus albus 7] Length = 167 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 55/158 (34%), Gaps = 13/158 (8%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 G I + RN+ S TY++V + + + GI++ A Sbjct: 16 GQGTIPRDESDRNISYNKASIPTYKLVKENDFIMHLRPFE-----WGLEIATREGIVSPA 70 Query: 320 YMAVKPHGIDSTYLA-WLMRSYDLC-KVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQ 375 Y ++ + RS + + G+R +S+ +D L + P I EQ Sbjct: 71 YTILRNKVELVPEFYRYYFRSSSFIVEKLTGITEGIRDGRSINMDDFWLLEIPYPSIPEQ 130 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I ++ I+ ++ + + +K + + Sbjct: 131 RKIGQFMD----LINRQIQIEKDKLQAIKLVKKGLLQQ 164 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 55/164 (33%), Gaps = 12/164 (7%) Query: 51 IGLEDVESGTGKY----LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGI 106 + L + G G ++ + ++ T + + + L P+ IA +GI Sbjct: 8 LRLTSIIQGQGTIPRDESDRNISYNKASIPTYKLVKENDFIM-HLRPFEWGLEIATREGI 66 Query: 107 CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH---ADWKGIGNIPMPIPP 163 S + +L+ K L + + + + + +P P Sbjct: 67 VSPAYTILRNKVELVPEFYRYYFRSSSFIVEKLTGITEGIRDGRSINMDDFWLLEIPYPS 126 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + EQ I + + I+ I ++ +K K+ L+ + Sbjct: 127 IPEQRKIGQFMD----LINRQIQIEKDKLQAIKLVKKGLLQQMF 166 >gi|260587528|ref|ZP_05853441.1| type I restriction-modification system, S subunit [Blautia hansenii DSM 20583] gi|260541793|gb|EEX22362.1| type I restriction-modification system, S subunit [Blautia hansenii DSM 20583] Length = 297 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/244 (11%), Positives = 76/244 (31%), Gaps = 12/244 (4%) Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 I ++ T I + + +L G +K E +P W Sbjct: 12 IRICEKLRYGNTGWILLCKKYSKTFYSLHYEKFADG-----SVKYIEEEIPFELPKGWAW 66 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 F A+ + + + S ++ + + + ++ ++ + Sbjct: 67 TRFSAITINRDSERKPISSSQRTDVAKIYDYYGVSGKIDKIDKYIFDERLLLIGED---- 122 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 +L + + + + A+ YL + + + L K + Sbjct: 123 GANLVTRSKPIAFFAEGQYWVNNHAHCIDATDKFILEYLCFYINAISLEKYV---TGSAQ 179 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + +++ + + +PP EQ ++ +N +D + L + +S + A Sbjct: 180 PKMTQDNMNSILIPLPPYSEQKRMSQRLNEVMYTVDNIEIGKAAIRELASKAKSKILDLA 239 Query: 415 VTGQ 418 + GQ Sbjct: 240 IRGQ 243 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 63/202 (31%), Gaps = 16/202 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W T + I + K G S + D Sbjct: 59 ELPKGWAWTRFSAI-------TINRDSERKPISSSQ-RTDVAKIYDYYGVSGKIDKIDKY 110 Query: 80 IFAKGQILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 IF + +L G+ G L A A+ + + + + +L Sbjct: 111 IFDERLLLIGEDGANLVTRSKPIAFFAEGQYWVNNHAHCIDAT---DKFILEYLCFYINA 167 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +E G+ + +I +P+PP +EQ + +++ +D + + EL Sbjct: 168 ISLEKYVTGSAQPKMTQDNMNSILIPLPPYSEQKRMSQRLNEVMYTVDNIEIGKAAIREL 227 Query: 195 LKEKKQALVSYIVTKGLNPDVK 216 + K ++ + L P Sbjct: 228 ASKAKSKILDLAIRGQLVPQNP 249 >gi|332673286|gb|AEE70103.1| type I R-M system S protein [Helicobacter pylori 83] Length = 419 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 61/391 (15%), Positives = 125/391 (31%), Gaps = 34/391 (8%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSLNSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ + + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDIIDLEKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ ++L+ + N Sbjct: 144 FLNIKIKLYPLETQQKIARTLSILDQKIENNHKINELLHKILELLYEQYFVRFDFLDENN 203 Query: 214 DVKMKDSG-----IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK- 267 G E L+P+ +EVK LV + + + + Y I K Sbjct: 204 KPYQTSGGKMKFSKELNRLIPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYILITNKN 263 Query: 268 --------LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 T N+ P+ Y +++P I+ + S + I+ Sbjct: 264 VQHSLVDLSITTNLLFLPKKLPKYCLLEPTNILITLTGHIGRCALVFS----KNCILNQR 319 Query: 320 YMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 V P + + L+R+ + +Q+L D ++ + Sbjct: 320 VGVVLPKEKELNPFYYSLIRNPLFSAILQRKAIGSSQQNLSPIDTLKIQIPF-----NHK 374 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRS 408 I + I L+ QS L R Sbjct: 375 IIKHYSKTCENIIKLLVSNMQSTQTLTALRD 405 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 69/178 (38%), Gaps = 13/178 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + + I++ + Sbjct: 18 NNYTKEDNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSL---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDIIDLEKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 S+ D + + + P++ Q I +++ +I+ + E + + + LL E+ Sbjct: 134 SSYPSITPLDFLNIKIKLYPLETQQKIARTLSILDQKIENNHKINELLHKILELLYEQ 191 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 8/158 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKY-LPKDGNSRQS 73 IP ++V + + +G + + D I I ++V+ + + Sbjct: 223 IPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYILITNKNVQHSLVDLSITTNLLFLPK 282 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSID 132 + IL G R A++ + I + + V+ PK+ L + + Sbjct: 283 KLPKYCLLEPTNILITLTGHIGRCALVFSKNCILNQRVGVVLPKEKELNPFYYSLIRNPL 342 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + ++ G++ + I +P + Sbjct: 343 FSAILQRKAIGSSQQNLSPIDTLKIQIPFNHKIIKHYS 380 >gi|303267751|ref|ZP_07353556.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS457] gi|302642716|gb|EFL73058.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS457] Length = 175 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 46/169 (27%), Gaps = 2/169 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 8 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 66 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 67 ILISWSG-TLGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 125 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 M H K NI +P L EQ I ++ + I + L Sbjct: 126 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLL 174 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 41/139 (29%), Gaps = 10/139 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 45 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 99 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 100 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 155 Query: 388 RIDVLVEKIEQSIVLLKER 406 + L+ + ++ + L Sbjct: 156 LLSKLILRRQEQLEELNLL 174 >gi|86150444|ref|ZP_01068669.1| dna methylase-type I restriction-modification system [Campylobacter jejuni subsp. jejuni CF93-6] gi|85839039|gb|EAQ56303.1| dna methylase-type I restriction-modification system [Campylobacter jejuni subsp. jejuni CF93-6] Length = 471 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 53/419 (12%), Positives = 128/419 (30%), Gaps = 53/419 (12%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV-SIFAKGQILY 88 + K N+ + + ++ +Y+ + ++ S I K +L Sbjct: 56 LGDNMKFNSRYSQPKYDE-----TSKIKVINSQYIRNEYIDYENAKSGYGKIVPKESVLI 110 Query: 89 GKLG-PYLRKAIIA--DFDGICSTQF--LVLQPKDVLPELLQGWLLSIDV--TQRIEAIC 141 G L + I DFD + +V++ K L L Q I Sbjct: 111 NATGVGTLGRVFINILDFDFSIDSHINVIVVKNKTYLNPYFLAIFLQSYYGQIQIIRYYS 170 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK----- 196 + + +PI P+ Q+ I+ + ++ + E L Sbjct: 171 GTSGQIEIYPRDFNYFKIPIFPMEFQLEIQNLVKDSHKALEESKELYKKAEETLYLELGL 230 Query: 197 -------EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 + + + + +K+S ++ L ++++ K + N Sbjct: 231 DPKNPLQSLLDSKIDHSIKSLNISIRTLKESFLKTGRLDSEYYQSKYEDIEKFIKSYPNG 290 Query: 250 KLIESNILSLSYGNIIQKLETRNMGL-------------------KPESYETYQIVDPGE 290 S+I++ N K + K +IV G+ Sbjct: 291 YDSFSSIINNKDTNFTPKNNENYSYIELANIGNNGNISEPISDLGKNLPTRARRIVSKGD 350 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ I+ +L + + ++ ++++ + + ++ L + +S + Sbjct: 351 VIISSIEGSLSSCALITQE-FDKHLVSTGFFVLNSKLLNGETLLVMFKSQIFQEYLKKFP 409 Query: 351 SGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVET-------ARIDVLVEKIEQSIV 401 SG ++ E++ ++ + Q I I +D K+E+ I Sbjct: 410 SGTILCAINKEELSKILIPKIDSTTQEKIAKYIQESFNLRKKSKQLLDNAKIKVEEQIQ 468 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 61/171 (35%), Gaps = 6/171 (3%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFI 296 + + + N++ + S +I RN + E+ ++ IV ++ Sbjct: 54 EYLGDNMKFNSRYSQPKYDETSKIKVINSQYIRNEYIDYENAKSGYGKIVPKESVLINAT 113 Query: 297 DLQNDKRSLRSAQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 + R + + I + + + ++ +LA ++SY SG Sbjct: 114 GVGTLGRVFINILDFDFSIDSHINVIVVKNKTYLNPYFLAIFLQSYYGQIQIIRYYSGTS 173 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + D + + P++ Q +I N++ ++ E +++ L Sbjct: 174 GQIEIYPRDFNYFKIPIFPMEFQLEIQNLVKDSHKALEESKELYKKAEETL 224 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 3/166 (1%) Query: 41 TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII 100 T ++ ++ YI L ++ + P + T I +KG ++ + L + Sbjct: 306 TPKNNENYSYIELANIGNNGNISEPISDLGKNLPTRARRIVSKGDVIISSIEGSLSSCAL 365 Query: 101 ADFDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 + + ST F VL K + E L S + ++ G + + + + I Sbjct: 366 ITQEFDKHLVSTGFFVLNSKLLNGETLLVMFKSQIFQEYLKKFPSGTILCAINKEELSKI 425 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +P Q I + I ++E+ Q + Sbjct: 426 LIPKIDSTTQEKIAKYIQESFNLRKKSKQLLDNAKIKVEEQIQGKI 471 >gi|325696151|gb|EGD38042.1| type I restriction modification DNA specificity family protein [Streptococcus sanguinis SK160] Length = 193 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 63/179 (35%), Gaps = 12/179 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLP---KDGNSRQS 73 WK V + + G T + K DI +I +D+ + +Y+ ++ Sbjct: 15 SDWKKVKLSELGTIVGGGTPSTKKEEYYGGDIPWITPKDLANFGERYIEHGSRNITLAGL 74 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + S+ I G IL+ P IA + + F + P + L + L Sbjct: 75 ENSSAKILPVGSILFSSRAPI-GYIAIASNNVSTNQGFKSIIPNSDVDS-LFLYYLLKFN 132 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRF 191 +IE + G T + +I + IP + EQ I + A +I+ Sbjct: 133 KDKIENMGSGTTFKEVSASIMKSIEVFIPTEIVEQRKISAILGAIDDKIENNKKINHHL 191 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 16/147 (10%), Positives = 47/147 (31%), Gaps = 2/147 (1%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNM-GLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K + +I ++ ++ E G + + + + I + Sbjct: 37 KKEEYYGGDIPWITPKDLANFGERYIEHGSRNITLAGLENSSAKILPVGSILFSSRAPIG 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 A + ++ P+ + + + ++ K+ + + +K + Sbjct: 97 YIAIASNNVSTNQGFKSIIPNSDVDSLFLYYLLKFNKDKIENMGSGTTFKEVSASIMKSI 156 Query: 366 PVLVPP-IKEQFDITNVINVETARIDV 391 V +P I EQ I+ ++ +I+ Sbjct: 157 EVFIPTEIVEQRKISAILGAIDDKIEN 183 >gi|57168922|ref|ZP_00368052.1| type I restriction modification enzyme [Campylobacter coli RM2228] gi|57019758|gb|EAL56444.1| type I restriction modification enzyme [Campylobacter coli RM2228] Length = 1343 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 48/459 (10%), Positives = 122/459 (26%), Gaps = 78/459 (16%) Query: 26 KVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 ++V + L G + + + I + ++ + + Sbjct: 892 ELVRLGEVCDLFNGYAFKKTDYVEKSNTLLIRMGNIRPNGEFDAEHKIQYLPDNFNNKYK 951 Query: 80 --IFAKGQILYGKLGPYLRKAII---------ADFDGICSTQF--LVLQPKDVLPELLQG 126 + G ++ I+ + + + + + L + ++ + L+ Sbjct: 952 DYLLNDGDVIIAMTDMGNAMNILGVPTIVKNKNNRNFLLNQRVGKLFNFSEKIIVQYLKY 1011 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------------- 173 L S +V ++ + G + I + +P+PPL Q I + Sbjct: 1012 ALSSNEVKKQFKLQGYGGLQINLGKTQILSTKIPLPPLEIQKQIVAECEKVEEQYNTLSL 1071 Query: 174 -IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------- 225 I I ++ + + + K +++ + D + S I+ Sbjct: 1072 SIEEYQKLIKAILQKCGIIEDDQEYKLNSILENLQKLESKLDFNLLFSFIDDFTNARQED 1131 Query: 226 ------------------------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 G + +RK + NI + Sbjct: 1132 LKKFKEFVKNIKAILGTFSTPPKQGWNKEKLNEIVSIQSGGTPDRKVKEYWNGNINWVKS 1191 Query: 262 GNIIQKLETRNMGLKPESY-----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + + +++ + + K + + I Sbjct: 1192 EVCQNCYIYDYQVKEKITELGLQKSSAKLLKKETTLIALVGATIGKIGFLTFESATNQNI 1251 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 T Y + L F +G +K L + +PP++ Q Sbjct: 1252 TGLY---PKNLKILNTKYLYYACMGLYGQFRKLGDFAMA--NSNFIKNLTISLPPLEIQE 1306 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I I + +ID L + L++ + + + Sbjct: 1307 KIVQNIELVEQQIDFL----NLKLEFLEKEKEKILQKYL 1341 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 70/210 (33%), Gaps = 12/210 (5%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + K+S E V L + T+ K+ L+ G + + + Sbjct: 881 DELNPFKNSKFELVRLGEVCDLFNGYAFKKTDYVEKSNTLLIRMGNIRPNGEFDAEHKIQ 940 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQND-----KRSLRSAQVMERGIITSAY--MAVK 324 + + +++ G+++ D+ N ++ + ++ + Sbjct: 941 YLPDNFNNKYKDYLLNDGDVIIAMTDMGNAMNILGVPTIVKNKNNRNFLLNQRVGKLFNF 1000 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I YL + + S ++ K F G GL+ +L + + +PP++ Q I Sbjct: 1001 SEKIIVQYLKYALSSNEVKKQFKLQGYGGLQINLGKTQILSTKIPLPPLEIQKQIVAECE 1060 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + L SI ++ + + Sbjct: 1061 KVEEQYNTL----SLSIEEYQKLIKAILQK 1086 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 65/190 (34%), Gaps = 12/190 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY--LPKDGNSRQSD 74 + W + + +G T + +I ++ E ++ + + Sbjct: 1155 QGWNKEKLNEIVSIQSGGTPDRKVKEYWNGNINWVKSEVCQNCYIYDYQVKEKITELGLQ 1214 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDV 133 S+ + K L +G + K F+ + L PK+ + + + + Sbjct: 1215 KSSAKLLKKETTLIALVGATIGKIGFLTFESATNQNITGLYPKNLKILNTKYLYYACMGL 1274 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + A+ I N+ + +PPL Q I + I +ID L + + Sbjct: 1275 YGQFRKLGD---FAMANSNFIKNLTISLPPLEIQEKIVQNIELVEQQIDFLNLKLEFLEK 1331 Query: 194 LLKEKKQALV 203 ++ Q + Sbjct: 1332 EKEKILQKYL 1341 >gi|315609172|ref|ZP_07884134.1| type I restriction-modification system S subunit [Prevotella buccae ATCC 33574] gi|315249141|gb|EFU29168.1| type I restriction-modification system S subunit [Prevotella buccae ATCC 33574] Length = 254 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 67/215 (31%), Gaps = 20/215 (9%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL-------SLSYGNIIQKLETRNMGL 275 E +P+ W + L+R T + + + + + Sbjct: 2 EIPFEIPESWCFVRLGDICNYLHRGKTPKYGNQKILPIIAQKCNHWNQLYIDRCLFSDID 61 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVMERGIITSAY-MAVKPHGIDS 330 Y+ Q + G+I+ R+ ++ + S + + Sbjct: 62 YILKYKEEQFLQKGDIIINSTGGGTVGRTGYIDDSVFDKFDKFVADSHVTVVRSTKLVSH 121 Query: 331 TYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+ + S + GS + L+ + V +PP++EQ + + Sbjct: 122 RYIYLYLLSPYIQIGIEERCTGSTNQIELRTTTISDYLVPIPPVEEQKRLVKKVESML-P 180 Query: 389 IDVLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 I +K++ ++ L + S + A+ G+ Sbjct: 181 IVTRYQKLQSNLEHLNSTLFPLIKKSILQEAIQGK 215 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 70/216 (32%), Gaps = 19/216 (8%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IP+ W V + L+ G+T + + I+ I + Y+ + S Sbjct: 6 EIPESWCFVRLGDICNYLHRGKTPKYGNQKILPIIAQKCNHWNQLYIDRCLFSDIDYILK 65 Query: 78 VS---IFAKGQILYGKLGPYLRKAIIADFDGIC---------STQFLVLQPKDVLPELLQ 125 KG I+ G D + S +V K V + Sbjct: 66 YKEEQFLQKGDIIINSTGGGTVGRTGYIDDSVFDKFDKFVADSHVTVVRSTKLVSHRYIY 125 Query: 126 GWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +LLS + IE C G+T I + +PIPP+ EQ + +K+ + + Sbjct: 126 LYLLSPYIQIGIEERCTGSTNQIELRTTTISDYLVPIPPVEEQKRLVKKVESMLPIVTRY 185 Query: 185 ITERIRFIELLKEKK----QALVSYIVTKGLNPDVK 216 + L ++++ + L P Sbjct: 186 QKLQSNLEHLNSTLFPLIKKSILQEAIQGKLVPQDP 221 >gi|317012656|gb|ADU83264.1| type I restriction-modification methylase [Helicobacter pylori Lithuania75] Length = 235 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 42/144 (29%), Gaps = 6/144 (4%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + N G+ Y D +I+ + + Y Sbjct: 41 YGEYPVMNGGIHASGYWNEYNTDYPKIIISQGGAS---AGYVNYMTSKFWAGAHCYAIEL 97 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + + + + +L D++ L + +PP++ Q +I +++ Sbjct: 98 NSEKLNYKFLYYFLKNSQTILMKSQFGAGIPALNKADIETLTIPIPPLEIQQEIVKILDA 157 Query: 385 ETARIDVLVEKIEQSIVLLKERRS 408 T L ++ LK R+ Sbjct: 158 FTELNTELNTELNTE---LKARKK 178 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 54/188 (28%), Gaps = 10/188 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + + G+ + Y + G + N +D Sbjct: 13 PKGVEFRKLGEVINIFKGKQLNKELLLDYGEYPVMNGGI--HASGYWNEYNTDYPK---- 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ + G ++ + + + Sbjct: 67 ----IIISQGGASAGYVNYMTSKFWAGAHCYAIELNSEKLNYKFLYYFLKNSQTILMKSQ 122 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 GA + + I + +PIPPL Q I + + A T L TE ++ K++ Q Sbjct: 123 FGAGIPALNKADIETLTIPIPPLEIQQEIVKILDAFTELNTELNTELNTELKARKKQYQY 182 Query: 202 LVSYIVTK 209 + ++ Sbjct: 183 YQNMLLDF 190 >gi|268599119|ref|ZP_06133286.1| restriction endonuclease S [Neisseria gonorrhoeae MS11] gi|268601470|ref|ZP_06135637.1| restriction endonuclease S [Neisseria gonorrhoeae PID18] gi|268684425|ref|ZP_06151287.1| restriction endonuclease S [Neisseria gonorrhoeae SK-92-679] gi|268583250|gb|EEZ47926.1| restriction endonuclease S [Neisseria gonorrhoeae MS11] gi|268585601|gb|EEZ50277.1| restriction endonuclease S [Neisseria gonorrhoeae PID18] gi|268624709|gb|EEZ57109.1| restriction endonuclease S [Neisseria gonorrhoeae SK-92-679] Length = 375 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 56/376 (14%), Positives = 118/376 (31%), Gaps = 34/376 (9%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVL 114 ++ + + + F G L K+ P L + ST+F+VL Sbjct: 12 EFQRQITGYEIKAFNGGAKFRNGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEFIVL 71 Query: 115 QPKD-VLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIRE 172 + K+ PE L + +S D +R EG + + + + +PIP Q I Sbjct: 72 RAKNETNPEFLYYFAISPDFRKRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQSIAA 131 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVP 229 + +D I + L+E + L Y + PD K SG + V Sbjct: 132 VL----SALDKKIALNKQINARLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGDMVFDET 187 Query: 230 DHWEVKPFFALV--TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 E+ + + K + + ++ + + + I++ Sbjct: 188 LKREIPKGWGSIELQSCLAKIPNTTKILNKDIKDFGKYPVVDQSQDFICGFTNDEKSILN 247 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 P + F D R ++ + + + YL + + Sbjct: 248 PQDAHIIFGD---HTRIVKLVNFQYARGADGTQVILSNNERMPNYLFYQI-----INQID 299 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI-DVLVEKIEQSIVLLKER 406 G + + +K +++P + N ++ + L + L + Sbjct: 300 LSSYGYARHF--KFLKEFKIILPSKDISQKYNEIANTFFVKVRNNLKQNH-----HLTQL 352 Query: 407 RSSFIAAAVTGQIDLR 422 R + + GQ+ +R Sbjct: 353 RDFLLPMLMNGQVSVR 368 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 57/153 (37%), Gaps = 10/153 (6%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII----TSA 319 ++++ + + G + +++ G+ + I + +++ G + T Sbjct: 9 MLKEFQRQITGYEIKAFNGGAKFRNGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEF 68 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + + +L + S D K G+ RQ + +K L + +P + Q Sbjct: 69 IVLRAKNETNPEFLYYFAISPDFRKRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQS 128 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 I V++ +D + +Q L+E + Sbjct: 129 IAAVLSA----LDKKIALNKQINARLEEMAKTL 157 >gi|312875148|ref|ZP_07735162.1| conserved hypothetical protein [Lactobacillus iners LEAF 2053A-b] gi|325913058|ref|ZP_08175430.1| hypothetical protein HMPREF0523_0355 [Lactobacillus iners UPII 60-B] gi|311089326|gb|EFQ47756.1| conserved hypothetical protein [Lactobacillus iners LEAF 2053A-b] gi|325477640|gb|EGC80780.1| hypothetical protein HMPREF0523_0355 [Lactobacillus iners UPII 60-B] Length = 227 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 68/208 (32%), Gaps = 16/208 (7%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTK------LIESNILSLSYGNIIQKLETRNMGLKPES 279 G+ P + P L + + T + I + +I+ + Sbjct: 19 GIQPSEMQFIPLQELCKVVTKGTTPTTLGKSFTSTGINFIKAESILDNHSIDSSKFAFID 78 Query: 280 YE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E ++ +IVF + ++ + A + + YL Sbjct: 79 EETNALLKRSVIKANDIVFTIAGTLGRFAMVDNSVLPANTNQAVAIIRPDETKVTPAYLY 138 Query: 335 WLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + ++ +L +K LP+ V +K N + + + L+ Sbjct: 139 SFFIGNWHNEYYSKRIQQAVQANLSLTTIKSLPIAV--LK--NTTMNNYDKLVSPLFALM 194 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + E+ L + R + + ++G++D+ Sbjct: 195 KSNEEENRRLSKLRDTLLPRLMSGELDV 222 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 23/200 (11%), Positives = 57/200 (28%), Gaps = 9/200 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + +P++ K+ T T+ + I +I E + K + Sbjct: 22 PSEMQFIPLQELCKVVTKGTTPTTLGKSFTSTGINFIKAESILDNHSIDSSKFAFIDEET 81 Query: 75 TS--TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + S+ I++ G R A++ + +T V + ++ +L S Sbjct: 82 NALLKRSVIKANDIVFTIAGTLGRFAMVDNSVLPANTNQAVAIIRPDETKVTPAYLYSFF 141 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + +P + + L+ Sbjct: 142 IGNWHNEYYSKRIQQAVQANLSLTTIKSLPIAVLKNTTMNNYDKLVSPLFALMKSNEEEN 201 Query: 193 ELLKEKKQALVSYIVTKGLN 212 L + + L+ +++ L+ Sbjct: 202 RRLSKLRDTLLPRLMSGELD 221 >gi|303269976|ref|ZP_07355710.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS458] gi|302640493|gb|EFL70906.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS458] Length = 197 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 50/178 (28%), Gaps = 2/178 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 8 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 66 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 67 ILISWSG-TLGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 125 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M H K NI +P L EQ I ++ + I + L+K + + Sbjct: 126 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLLVKSQFACEI 183 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 45 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 99 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 100 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 155 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 156 LLSKLILRRQEQLEELNLLVKS 177 >gi|298253898|ref|ZP_06977485.1| restriction endonuclease S subunit [Gardnerella vaginalis 5-1] gi|297532041|gb|EFH71016.1| restriction endonuclease S subunit [Gardnerella vaginalis 5-1] Length = 380 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 59/385 (15%), Positives = 116/385 (30%), Gaps = 52/385 (13%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + +ED + G + +DTS ++F K + Sbjct: 16 KLGELIEQRREKN---------CNIEDLIIRGVSREGFIKPKQIDADTSIYNVFYKKDFV 66 Query: 88 YGKLGPYLRKAIIA--DFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + L + ICS+ + V + +LPE L + + +R Sbjct: 67 FNPARMELNSIALNLNFEKAICSSLYEVFYVTRTDVLLPEYLNLIIKRDEFARRCWFEAI 126 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ ++ + + +PPLA Q A Sbjct: 127 GSARNYFRVANLSEFYIDLPPLAIQQKYVNVYNAMVAN---------------------- 164 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 +GL+ D+ IE + + + ++N + + + Sbjct: 165 -QKAYERGLDDLKLTCDAYIEDL----STCDWHKIGNYIKRNRKRNQEKKFTKAGVKGFN 219 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N L M L T++I+ + V+ K+ E I++ AY + Sbjct: 220 N--DGLFIEPMRLFSGDISTFKIITKNDFVYNSRINSTIKKLSIVINEAEDVIVSPAYES 277 Query: 323 VKPH------GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 YL S+ +F + GS F+D+ + + +P EQ Sbjct: 278 FYIEKGKELLYPFYLYLLLQRESFARKVLFNSFGSSTIV-FGFDDLSEIEIPIPSFSEQV 336 Query: 377 DITNVINVETARIDVLVEKIEQSIV 401 I N + + EK++ I Sbjct: 337 AIAN-LYKVYKERWSINEKLKAQIK 360 >gi|254493840|ref|ZP_05107011.1| restriction endonuclease S [Neisseria gonorrhoeae 1291] gi|268603803|ref|ZP_06137970.1| restriction endonuclease S [Neisseria gonorrhoeae PID1] gi|268682271|ref|ZP_06149133.1| restriction endonuclease S [Neisseria gonorrhoeae PID332] gi|226512880|gb|EEH62225.1| restriction endonuclease S [Neisseria gonorrhoeae 1291] gi|268587934|gb|EEZ52610.1| restriction endonuclease S [Neisseria gonorrhoeae PID1] gi|268622555|gb|EEZ54955.1| restriction endonuclease S [Neisseria gonorrhoeae PID332] Length = 375 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 56/376 (14%), Positives = 118/376 (31%), Gaps = 34/376 (9%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVL 114 ++ + + + F G L K+ P L + ST+F+VL Sbjct: 12 EFQRQITGYEIKAFNGGAKFRNGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEFIVL 71 Query: 115 QPKD-VLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIRE 172 + K+ PE L + +S D +R EG + + + + +PIP Q I Sbjct: 72 RAKNETNPEFLYYFAISPDFRKRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQSIAA 131 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVP 229 + +D I + L+E + L Y + PD K SG + V Sbjct: 132 VL----SALDKKIALNKQINTRLEEMAKTLYDYWFVQFDFPDANGKPYKSSGGDMVFDET 187 Query: 230 DHWEVKPFFALV--TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 E+ + + K + + ++ + + + I++ Sbjct: 188 LKREIPKGWGSIELQSCLAKIPNTTKILNKDIKDFGKYPVVDQSQDFICGFTNDEKSILN 247 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 P + F D R ++ + + + YL + + Sbjct: 248 PQDAHIIFGD---HTRIVKLVNFQYARGADGTQVILSNNERMPNYLFYQI-----INQID 299 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI-DVLVEKIEQSIVLLKER 406 G + + +K +++P + N ++ + L + L + Sbjct: 300 LSSYGYARHF--KFLKEFKIILPSKDISQKYNEIANTFFVKVRNNLKQNH-----HLTQL 352 Query: 407 RSSFIAAAVTGQIDLR 422 R + + GQ+ +R Sbjct: 353 RDFLLPMLMNGQVSVR 368 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 57/153 (37%), Gaps = 10/153 (6%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII----TSA 319 ++++ + + G + +++ G+ + I + +++ G + T Sbjct: 9 MLKEFQRQITGYEIKAFNGGAKFRNGDTLLAKITPCLENGKTAFVDILDDGEVAFGSTEF 68 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + + +L + S D K G+ RQ + +K L + +P + Q Sbjct: 69 IVLRAKNETNPEFLYYFAISPDFRKRAIECMEGTSGRQRVNENALKTLELPIPEPQIQQS 128 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 I V++ +D + +Q L+E + Sbjct: 129 IAAVLSA----LDKKIALNKQINTRLEEMAKTL 157 >gi|290953273|ref|ZP_06557894.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. holarctica URFT1] gi|295313479|ref|ZP_06804075.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. holarctica URFT1] Length = 225 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 9/61 (14%), Positives = 23/61 (37%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + + + + + + +PP+ EQ I ++ +D +E +Q+I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKCIVAKLDSLFENVDKAIELHQQNITNANT 60 Query: 406 R 406 Sbjct: 61 L 61 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 70/240 (29%), Gaps = 17/240 (7%) Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G M H NI +P+PPLAEQ I K+ + +D I + I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKCIVAKLDSLFENVDKAIELHQQNITNANT 60 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + K K+ + + + + + + + Sbjct: 61 LMASTLDKTFKKLEGEYSKIALLDVMKI-----------SNKTLVPDDNQKYNYVGLENI 109 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + G +I ET+ +K E G +++ + +K + I Sbjct: 110 EGNTGRLIDFCETQGKEIKSSKVE----FKKGIVLYGKLRPYLNKVWFSEFDDVATTEIL 165 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK--RLPVLVPPIKEQ 375 Y + + S L +V L +K + +PP+ Q Sbjct: 166 PFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSRIPRLTTAFLKSEEAYIPLPPLPIQ 225 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 4/125 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K++ + + Y+GLE++E TG+ + + S+ F KG +LY Sbjct: 82 LLDVMKISNKTLVPDDNQKYNYVGLENIEGNTGRLIDFCETQGKEIKSSKVEFKKGIVLY 141 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGAT 145 GKL PYL K ++FD + +T+ L P D ++ + LS QR+ G+ Sbjct: 142 GKLRPYLNKVWFSEFDDVATTEILPFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSR 201 Query: 146 MSHAD 150 + Sbjct: 202 IPRLT 206 >gi|257458639|ref|ZP_05623773.1| type I restriction-modification system, S subunit [Treponema vincentii ATCC 35580] gi|257443961|gb|EEV19070.1| type I restriction-modification system, S subunit [Treponema vincentii ATCC 35580] Length = 258 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 61/201 (30%), Gaps = 9/201 (4%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 KD E VP+ W ++ + + I+ + + + G++ Sbjct: 14 KDIEDELPFAVPEGWAWCRLPNILISIFAGG----DRPIICEKTQSDKCNIPIYSNGIEN 69 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + + I + I+ + ID +L Sbjct: 70 NGLYGFTNKPVVNVSSITISARGTIGFSCIRYEPFVPIVRLITIIPFTKYIDLVFLKIAF 129 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + +F L +K+ + +PPI EQ I I +ID+L Sbjct: 130 DT-----LFSFSEGSSIPQLTVPTIKQFLIPLPPIAEQKRIVTAIETIFTQIDILETNKA 184 Query: 398 QSIVLLKERRSSFIAAAVTGQ 418 +K+ +S + A+ G+ Sbjct: 185 DLQTAVKQAKSKILDLAIHGK 205 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 58/197 (29%), Gaps = 10/197 (5%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + + G + II + + Y N+ + Sbjct: 24 VPEGWAWCRLPNILISIFAGG----DRPIICEKTQSDKCNIPIYSNGIENNGLYGFTNKP 79 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + I G I + ++ + + + Sbjct: 80 VVNVSSITISARGTIGFSCIRYEPFVPIVRLITIIPFTKYIDLVFLKIAFDTLF-----S 134 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG+++ I +P+PP+AEQ I I +ID L T + +K+ K Sbjct: 135 FSEGSSIPQLTVPTIKQFLIPLPPIAEQKRIVTAIETIFTQIDILETNKADLQTAVKQAK 194 Query: 200 QALVSYIVTKGLNPDVK 216 ++ + L P Sbjct: 195 SKILDLAIHGKLVPQDP 211 >gi|257467463|ref|ZP_05631774.1| putative type I restriction enzyme [Fusobacterium gonidiaformans ATCC 25563] gi|315918588|ref|ZP_07914828.1| type I restriction-modification system specificity subunit [Fusobacterium gonidiaformans ATCC 25563] gi|313692463|gb|EFS29298.1| type I restriction-modification system specificity subunit [Fusobacterium gonidiaformans ATCC 25563] Length = 227 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 17/170 (10%), Positives = 48/170 (28%), Gaps = 6/170 (3%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + ++ + ++ + ++ GEI+ + + L Sbjct: 63 YQEPNYAYFVRNTDLKSGTFEVFVDEHSYNFLSKSVLYGGEIIISNVGDVG-RVFLCPKL 121 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 + + + YL + + + G + D K LP+ Sbjct: 122 NKPMTLGNNIILLRPEQDNLQYYLYIWFKWLYGQSLIQGIKGGSAQPKFNKTDFKNLPIY 181 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +PP + + L+ + L R++ + + G+ Sbjct: 182 LPPDDLLQRF----HQSVQPMFELIAENIVENQRLSALRNTLLPKLMNGE 227 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 55/159 (34%), Gaps = 6/159 (3%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGI 106 ++ D++SGT + + + + S+ G+I+ +G R + + Sbjct: 70 YFVRNTDLKSGTFEVFVDEH---SYNFLSKSVLYGGEIIISNVGDVGRVFLCPKLNKPMT 126 Query: 107 CSTQFLVLQPKDVLPELL-QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 ++L+P+ + W + I+ I G+ + N+P+ +PP Sbjct: 127 LGNNIILLRPEQDNLQYYLYIWFKWLYGQSLIQGIKGGSAQPKFNKTDFKNLPIYLPPDD 186 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + I I E R L L++ Sbjct: 187 LLQRFHQSVQPMFELIAENIVENQRLSALRNTLLPKLMN 225 >gi|239828718|ref|YP_002951341.1| N-6 DNA methylase [Geobacillus sp. WCH70] gi|239809011|gb|ACS26075.1| N-6 DNA methylase [Geobacillus sp. WCH70] Length = 629 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 51/155 (32%), Gaps = 6/155 (3%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + ++ K S ++ G+++ K ++ + + Sbjct: 475 DDLSSIRFKRNSRIDMYLLRKGDVIVSNRGTTI-KVAVVPENEGNLILSHNFLGIRCKDD 533 Query: 328 IDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ID YL + S + ++ +D+K +PV + + EQ I N I Sbjct: 534 IDPYYLKAYLESPVGMYYLINSQVGTNILTINPKDLKEIPVKLTSLDEQRKIANEIREAV 593 Query: 387 ARIDVLVEKIEQSIV--LLKERRSSFIAAAVTGQI 419 + + EQ LLK I++ +I Sbjct: 594 ITYKEKIRQAEQERNASLLKAYEKMGISSLF--KI 626 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 67/179 (37%), Gaps = 11/179 (6%) Query: 26 KVVPIKRFTK-LNTGRTSESGK------DIIYIGLEDVESGTGKYLP-KDGNSRQSDTST 77 + P+K+ T+ + G S + + L DV+ G +++ Sbjct: 430 NIYPLKKLTEKIFRGMNVSSNSIEEGTGEFKLVKLSDVQDGEILLDDLSSIRFKRNSRID 489 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQG-WLLSIDVT 134 + + KG ++ G ++ A++ + +G I S FL ++ KD + +L S Sbjct: 490 MYLLRKGDVIVSNRGTTIKVAVVPENEGNLILSHNFLGIRCKDDIDPYYLKAYLESPVGM 549 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G + + K + IP+ + L EQ I +I + I + + Sbjct: 550 YYLINSQVGTNILTINPKDLKEIPVKLTSLDEQRKIANEIREAVITYKEKIRQAEQERN 608 >gi|269115097|ref|YP_003302860.1| Type I restriction enzyme specificity protein [Mycoplasma hominis] gi|268322722|emb|CAX37457.1| Type I restriction enzyme specificity protein [Mycoplasma hominis ATCC 23114] Length = 404 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/344 (10%), Positives = 98/344 (28%), Gaps = 10/344 (2%) Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ-FLVLQPKDVLPELLQGWLLSID 132 F + IL G + I + + +++L+ + + L + Sbjct: 63 KLIDEYAFDEMAILISGNGSKVGHVNIYNGKFNAYQRTYILLKINHFVLWKYAYFYLKSN 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I + + + + N +PIP ++ Q I E + + + I Sbjct: 123 LKNYINVYKLDSGIPYITLPMLQNFVIPIPHISIQNKIVEILDKLETYTKDIQSGLPLEI 182 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + K++ + ++ + + I + + + + + E+ K Sbjct: 183 DQRKKQYEYYRDKLLDFKDLAGGVLSKNYILLLNELYEKIINIIEYKRINEVTINLKKET 242 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 L+ G + + Y G + Sbjct: 243 LEKNKLLNNGKYQVINSGKEIYGTYNQYNN-----EGNAITIAARGAYAGFINYMNDKFW 297 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 G + Y + + Y+ + ++ + + G +L D+ + +P I Sbjct: 298 AGGLCYPYRSKNETSFLTKYIYYWLKYNEEKISNELVAKGSIPALNKIDIDNFFIPIPHI 357 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q I +++ + + I ++ R+ + Sbjct: 358 SIQNKIVEILDKLETYTKDIQSGLPLEIDQRRKQYEYYRNKLLN 401 >gi|332995831|gb|AEF05886.1| type I restriction-modification enzyme, specificity subunit [Alteromonas sp. SN2] Length = 378 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 53/372 (14%), Positives = 122/372 (32%), Gaps = 41/372 (11%) Query: 61 GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD---GICSTQF--LVLQ 115 G ++ + + + + ++ G +Y +L + + S +F L Sbjct: 32 GPFIRETKSGSEISAAKLNKVKAGDFIYSRLFAWQGSFGLVPEVMDGCYVSNEFPLYELD 91 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI---PMPIPPLAEQVLIRE 172 V+PE L W V + +EA C G+T + + +P + +Q I + Sbjct: 92 TSKVIPEYLVYWFGLPHVQKMVEADCSGSTPGTRNRFKEIFFERLDIELPSIEQQKSIVK 151 Query: 173 KIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 I + I + QA++S K + + Sbjct: 152 SIQLLEQKRSAFI----DLRSTVLADAQAMLSSAFHKII------------------EGA 189 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGEI 291 KP + + R+ ++ L + + + + + E ++ V G++ Sbjct: 190 VYKPISEVAPIVRRQIEITVDGEYPELGARSFGKGIFHKPTLIGAELDWQKLYTVHSGDL 249 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVFYAM 349 V I + R + + Y+ P + +LA+ + + + + A Sbjct: 250 VLSNIKAWEGAIAAAGDNDHGR-VGSHRYITCVPAEGVTTANFLAFYLLTQEGIEQVQAA 308 Query: 350 GSGLRQS---LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 G L + ++++ V VP +Q N ++ + + ++ L+ Sbjct: 309 SPGSADRNRTLAMKRLEKIKVPVPDYDKQL----WFNQLQNYVEKIKQAQSENATELEAL 364 Query: 407 RSSFIAAAVTGQ 418 S + A G+ Sbjct: 365 MPSILDKAFKGE 376 >gi|325122266|gb|ADY81789.1| type I restriction-modification system methyltransferase subunit [Acinetobacter calcoaceticus PHEA-2] Length = 1313 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 36/99 (36%), Gaps = 1/99 (1%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y +Y + PG+I+ + A + + +DS YL + S Sbjct: 530 YNSYMNLQPGDILISRSGTIGKNAIVSEAAAGALAGQGLYVIRPDKNYLDSDYLLAYINS 589 Query: 340 YDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 F A G Q++ + V +LP+ V P+ Q Sbjct: 590 RACQNWFSAHARGTAIQNINRDTVLKLPIPVLPLPIQRR 628 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 54/169 (31%), Gaps = 14/169 (8%) Query: 30 IKRFTKLNTGRT---------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + + + GRT + + YI + D+ G + + ++ Sbjct: 477 LSTISSIFLGRTIKAVDLTSAPHNDQAKGYIRISDLAHGKIVRMSRWLKPDA-PYNSYMN 535 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQ----FLVLQPKDVLPELLQGWLLSIDVTQR 136 G IL + G + AI+++ + + + L ++ S Sbjct: 536 LQPGDILISRSGTIGKNAIVSEAAAGALAGQGLYVIRPDKNYLDSDYLLAYINSRACQNW 595 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 A G + + + + +P+P+ PL Q + I T I Sbjct: 596 FSAHARGTAIQNINRDTVLKLPIPVLPLPIQRRAVARYQQSGTDILTFI 644 >gi|319744168|gb|EFV96540.1| type I restriction modification DNA specificity family protein [Streptococcus agalactiae ATCC 13813] Length = 216 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 18/179 (10%), Positives = 48/179 (26%), Gaps = 5/179 (2%) Query: 246 RKNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 +K +I LS + + G + Y+ + I + Sbjct: 42 KKVDDYWNGDIPWLSPKDLSLNPAMFTGRGQNSITELGYKKSSAKLMPRNSILFSSRAPI 101 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + ++ P + + + + + + + +K Sbjct: 102 GYITIAENDISTNQGFKSIIPKPEYPYTFVYELLKQETPSLESSASGSTFKEVSGTHLKN 161 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 + +P I + + + E+ I L E R + ++G+I + Sbjct: 162 HEIRIPSHS---AIIKF-HESVKPLFKTINLNEKEIQKLIEVRDLLLPTLMSGEISVSD 216 >gi|293603339|ref|ZP_06685767.1| type I restriction-modification enzyme [Achromobacter piechaudii ATCC 43553] gi|292818249|gb|EFF77302.1| type I restriction-modification enzyme [Achromobacter piechaudii ATCC 43553] Length = 243 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 86/243 (35%), Gaps = 18/243 (7%) Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 +D LI+E ++ + LK K L+ + K++ S L D W+ + L Sbjct: 3 LDELISEEVQKLSALKIYKNGLMQQLFPHEGEAVPKLRLSKY----LKADDWKKRKVSDL 58 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQ 299 +T + +E+ + + + + + K + V+P +V + Sbjct: 59 LTRSTKPVDVEVEAAYREIGIRSHGKGIFHKGAVRGKSLGDKRVFWVEPSALVVNIVFAW 118 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYDLCKVF---YAMGSGL 353 ++ E+G+I S + D ++ + + ++ G+G Sbjct: 119 EQ--AIAVTSKAEKGMIASHRFPMYKEKVGKCDVNFIKYFFLTKKGKELLGVASPGGAGR 176 Query: 354 RQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++L + + L P ++EQ +I + D + + I L+ +R + Sbjct: 177 NKTLGQKSFESLEFFTPDCVEEQAEIARCLLSV----DETIAIQTERIDALRSQRKGLMQ 232 Query: 413 AAV 415 Sbjct: 233 HLF 235 >gi|269978350|gb|ACZ55909.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 220 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 49/166 (29%), Gaps = 11/166 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK + ++ ++ G T I + +ED+ + Sbjct: 13 PKGVEFKTLEEVFEIKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRVLKDSIQHITPKA 72 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSI 131 +F K I+ A++ D + + QF L K ++ + Sbjct: 73 LKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQQFTFLSKKANCDLALDMKFFFYQCF 131 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + + + + + D PIPPL Q I + + Sbjct: 132 LLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDQF 177 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 54/185 (29%), Gaps = 12/185 (6%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVF 293 T I +I + + + P++ + ++ I+ Sbjct: 27 IKNGYTPSKNNPEFWKNGTIPWFRMEDIRENGRVLKDSIQHITPKALKGKKLFPKNSIII 86 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-- 351 + L + + +++ K + + + + L + + Sbjct: 87 STTATIGEHALLIVDSLANQQFT---FLSKKANCDLALDMKFFFYQCFLLGEWCKKNTNV 143 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RR 407 S+ K+ +PP++ Q +I +++ + L+ I I K+ R Sbjct: 144 SGFASVDMTAFKKYKFPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQYEYYR 203 Query: 408 SSFIA 412 + Sbjct: 204 EKLLT 208 >gi|323494430|ref|ZP_08099539.1| type I restriction-modification enzyme, specificity subunit [Vibrio brasiliensis LMG 20546] gi|323311360|gb|EGA64515.1| type I restriction-modification enzyme, specificity subunit [Vibrio brasiliensis LMG 20546] Length = 373 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 49/404 (12%), Positives = 120/404 (29%), Gaps = 54/404 (13%) Query: 29 PIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 I+ F L G + + + D + + Q D V I Sbjct: 8 KIRDFCDLVKGNSPTLKTEPGEYPLVVTADFRRSSNDF--------QFDVEAVCIP---- 55 Query: 86 ILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE-AI 140 L G R + + + ++ L + L + + Sbjct: 56 -LVSSTGHGNAAIHRVHYQSGKFALANIMVALIPNNLELCYPKYLYYLLQSSKDHVLVPL 114 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +G + K I + + +P L Q+ KI +++ + + R I Q Sbjct: 115 MKGTSNVSLKVKDIAEVELYLPTLENQIEAVSKIDEALAKVNEVKSLRHSLIMESNAFLQ 174 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ ++ + + + + + RK I+ L Sbjct: 175 SVFQKVI----------------------EGADYQKMEDVAPVVRRKVEIDIDGEYPELG 212 Query: 261 YGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + ++ V G++V I + + R + + Sbjct: 213 ARSFGKGIFHKPTLNGFELDWQKLYAVHDGDLVISNIKAWEGAIAAAGPKDHGR-VGSHR 271 Query: 320 YMAVKPHG--IDSTYLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIKE 374 Y+ P + +L++ + S A G L + ++++ V +P Sbjct: 272 YLTCLPKPGVTTAKFLSFYLLSNQGIAKVQAASPGSADRNRTLAIKRLEKIEVPIPDFDT 331 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 Q + +++ +I + E + L + + A+ G+ Sbjct: 332 QLWFSQLLDGV-EQIKQVQESNRLELEALVP---AILDKAIKGK 371 >gi|326386411|ref|ZP_08208034.1| putative Type I restriction enzyme MjaXP specificity protein [Novosphingobium nitrogenifigens DSM 19370] gi|326209072|gb|EGD59866.1| putative Type I restriction enzyme MjaXP specificity protein [Novosphingobium nitrogenifigens DSM 19370] Length = 255 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 73/203 (35%), Gaps = 12/203 (5%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 HW ++TE +T E +S+ G +I ++E S + Y V PG+ Sbjct: 54 HWREVQLSDVLTEHGEASTGTEEVYSVSVHKG-LINQIEHLGRSFAAASTDHYNRVLPGD 112 Query: 291 IVFRFIDLQNDKRSLRS-AQVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFY 347 IV+ + + +QV I++ Y P + L + Sbjct: 113 IVYTKSPTGDFPLGIIKQSQVKHPVIVSPLYGVFTPIRRELGVLLEAHFEAPLAVKNYLN 172 Query: 348 AMGSGLRQS---LKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + ++ + + + +P KEQ I +I +D I++ I L Sbjct: 173 PLVQKGAKNTIAITNKRFLEGKLHLPLDPKEQKAIAAIIETSRRELDA----IDREIAAL 228 Query: 404 KERRSSFIAAAVTGQIDLRGESQ 426 ++ + +TG+ ++ + + Sbjct: 229 TRQKRGLMQKLLTGEWAVQPDLE 251 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 21/55 (38%), Gaps = 4/55 (7%) Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 P+ EQ I ++ ++ L + + R+ VTG+ L G + Sbjct: 2 PLPEQRKIAAILRTWDLGLEKLSALRKAK----ERLRNWLRTQVVTGKRRLPGFA 52 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 31/98 (31%), Gaps = 8/98 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 HW+ V + G S +++ + + ++L + + +D + Sbjct: 54 HWREVQLSDVLTE-HGEASTGTEEVYSVSVHKGLINQIEHLGRSFAAASTDH--YNRVLP 110 Query: 84 GQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQP 116 G I+Y K I I S + V P Sbjct: 111 GDIVYTKSPTGDFPLGIIKQSQVKHPVIVSPLYGVFTP 148 >gi|315222591|ref|ZP_07864480.1| type I restriction modification DNA specificity domain protein [Streptococcus anginosus F0211] gi|315188277|gb|EFU22003.1| type I restriction modification DNA specificity domain protein [Streptococcus anginosus F0211] Length = 537 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 47/155 (30%), Gaps = 1/155 (0%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 NRK+ + + G + + + + T I+ G+++ Sbjct: 370 NRKDPNGSIGVVNISNIGEYEIDYSSLDHLDEEDRKITNYILQTGDLLIPARGTAIRIAI 429 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVK 363 + + + YL S K+ G ++ ++++ Sbjct: 430 FEEQTYPCIASSNVIVIRATDESLSTIYLKLFFDSPLGRKMLVTRQQGTAVMNISYKELN 489 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + +P I+EQ I E +++ E Sbjct: 490 NIEIPLPSIEEQKSIAEEYTKELEAYKKAIQEAEN 524 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 67/196 (34%), Gaps = 15/196 (7%) Query: 23 KHW------KVVP--IKRFTKLNTGRTSESGKDIIYIGLEDVES---GTGKYLPKDGNSR 71 + W V + + G+ IG+ ++ + Y D Sbjct: 342 EDWIKFQESNVKKQELGTVASIFRGKAINRKDPNGSIGVVNISNIGEYEIDYSSLDHLDE 401 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQ--GW 127 + T I G +L G +R AI + + I S+ +V++ D + + Sbjct: 402 EDRKITNYILQTGDLLIPARGTAIRIAIFEEQTYPCIASSNVIVIRATDESLSTIYLKLF 461 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 S + + +G + + +K + NI +P+P + EQ I E+ E I E Sbjct: 462 FDSPLGRKMLVTRQQGTAVMNISYKELNNIEIPLPSIEEQKSIAEEYTKELEAYKKAIQE 521 Query: 188 RIRFIELLKEKKQALV 203 + QA + Sbjct: 522 AENRWSSTLSRLQARI 537 >gi|195867487|ref|ZP_03079491.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|195660963|gb|EDX54216.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 9 str. ATCC 33175] Length = 362 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 37/383 (9%), Positives = 103/383 (26%), Gaps = 48/383 (12%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---STVSIFAK 83 + + + G T I + ++ G Y + ++ + K Sbjct: 4 IYKLGSLVNIYKGST--------LITKKYIDENQGIYPVISSKTTENGIYGFINRYDYEK 55 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA--IC 141 +I +G + + + V + + +++ + I Sbjct: 56 NKITMSLIGENAGTFFWQEKNFSLTNNACVFISNKNINYNYKYLFITLKKHEYKIKEFIV 115 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + + + +P + Q I I I+ + +I+ L+ + Sbjct: 116 IGSARPMISSNHLKLVDVNLPSIEIQDAIISIIEPIEKVINNIKNIKIKIESLINKYFDF 175 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L S + + I I S Sbjct: 176 LYSDLEDSNFKKYILGDLFTI----------------------------NRGQIINSKYI 207 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N I + K Y + F I Q + I ++ Sbjct: 208 YNNIGPYPVVSSNTKNNGIFGYINSYMYDGEFITISADGAYAGTVFLQNGKFSITNVCFI 267 Query: 322 AVKPHG----IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF- 376 +K ++ ++ ++++ + R +++ +K + + +P ++ Q Sbjct: 268 LMKNKDIDFKFNNKFVYYILKKEQEINRLKSQVGSSRPAVREYSLKEIKINLPNMEIQEE 327 Query: 377 --DITNVINVETARIDVLVEKIE 397 I + + + + + + + Sbjct: 328 FSKIVEPLLNLSTKANRIEKILN 350 >gi|157415305|ref|YP_001482561.1| hypothetical protein C8J_0985 [Campylobacter jejuni subsp. jejuni 81116] gi|157386269|gb|ABV52584.1| hypothetical protein C8J_0985 [Campylobacter jejuni subsp. jejuni 81116] gi|307747948|gb|ADN91218.1| Putative uncharacterized protein [Campylobacter jejuni subsp. jejuni M1] gi|315932180|gb|EFV11123.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni 327] Length = 481 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 51/432 (11%), Positives = 120/432 (27%), Gaps = 61/432 (14%) Query: 28 VPIKR-FTKLNTGRTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 I K G + E G I + D+++ + K + Sbjct: 50 KKIGECLLKSQYGISINMNEEGDGIPIYRMNDIDNMLCNFEVKKYALIDKNELQTFRLNY 109 Query: 84 GQILYGKLGPY-----LRKAIIADFDGICSTQFLVLQPKDVLPELLQ---GWLLSIDVTQ 135 G +L+ + Y + + ++ + L + I + Sbjct: 110 GDVLFNRTNSYEFVGRTGIFYNNRENFVFASYLVRLVCNKEILLPEYLTVFLNTHIGKKE 169 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT----LITERIRF 191 ++ + + + I +PI P+ Q+ I+ + ++ Sbjct: 170 IRRRARPSINQANVNPEELKEIKIPIFPMEFQLEIQNLVKDSHKALEESKELYKKAEETL 229 Query: 192 IELLKEKKQALVSYIVTKGLN--------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTE 243 L + + ++ N +K+S ++ L ++++ K Sbjct: 230 YLELGLDPKNPLQSLLDSKTNNPTKSLNISIHTLKESFLKTGRLDSEYYQSKYEDIEKMI 289 Query: 244 LNRK--------------------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 + K + + L L N I+ + E Y Sbjct: 290 RSYKDGFCNLKDLVNDISSGFAFSSDDYQDVGELVLIRINNIKNATLDLSNVIYLKNEAY 349 Query: 284 QI-----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + G+I+ +R ++ + + +S L L+ Sbjct: 350 NLSPKDKIKKGDILISMSGSIGLSCVVRDDIS---AMVNQRILKISIKNFNSDVLVLLLN 406 Query: 339 SYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET-------ARI 389 S+ F +G G++ +L D++ + + Q I I + Sbjct: 407 SFICKMQFERIGTTGGVQTNLSSIDMQNILIPKIDSTTQEKIAKYIQESFNLRKKSKQLL 466 Query: 390 DVLVEKIEQSIV 401 D K+E+ I Sbjct: 467 DNAKIKVEEQIQ 478 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/197 (10%), Positives = 63/197 (31%), Gaps = 6/197 (3%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + MK + + ++ + N++ E + Sbjct: 34 DSFWTMKLIYNNKLNYKKIGECLLKSQYGISINMNE-EGDGIPIYRMNDIDNMLCNFEVK 92 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA---VKPHGI 328 L ++ ++ G+++F + + ++Y+ + Sbjct: 93 KYALIDKNELQTFRLNYGDVLFNRTNSYEFVGRTGIFYNNRENFVFASYLVRLVCNKEIL 152 Query: 329 DSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 YL + ++ K S + ++ E++K + + + P++ Q +I N++ Sbjct: 153 LPEYLTVFLNTHIGKKEIRRRARPSINQANVNPEELKEIKIPIFPMEFQLEIQNLVKDSH 212 Query: 387 ARIDVLVEKIEQSIVLL 403 ++ E +++ L Sbjct: 213 KALEESKELYKKAEETL 229 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 60/185 (32%), Gaps = 9/185 (4%) Query: 28 VPIKRFT-KLNTGRTSESGK-----DIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSI 80 +K +++G S +++ I + ++++ T + + S Sbjct: 297 CNLKDLVNDISSGFAFSSDDYQDVGELVLIRINNIKNATLDLSNVIYLKNEAYNLSPKDK 356 Query: 81 FAKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 KG IL G + D + + + L + K+ ++L L S + E Sbjct: 357 IKKGDILISMSGSIGLSCVVRDDISAMVNQRILKISIKNFNSDVLVLLLNSFICKMQFER 416 Query: 140 I-CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I G ++ + NI +P Q I + I ++E+ Sbjct: 417 IGTTGGVQTNLSSIDMQNILIPKIDSTTQEKIAKYIQESFNLRKKSKQLLDNAKIKVEEQ 476 Query: 199 KQALV 203 Q + Sbjct: 477 IQGKI 481 >gi|148997025|ref|ZP_01824679.1| type I restriction enzyme EcoEI specificity protein [Streptococcus pneumoniae SP11-BS70] gi|194397487|ref|YP_002037528.1| Type I restriction modification DNA specificity domain [Streptococcus pneumoniae G54] gi|147756725|gb|EDK63765.1| type I restriction enzyme EcoEI specificity protein [Streptococcus pneumoniae SP11-BS70] gi|194357154|gb|ACF55602.1| Type I restriction modification DNA specificity domain [Streptococcus pneumoniae G54] Length = 191 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 49/178 (27%), Gaps = 2/178 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M H K NI + L EQ I ++ + I + L+K + + Sbjct: 120 MKHLTKKYFDNIMVSYTNLGEQQRIASELDLLSKLILRRQEQLEELNLLVKSQFACEI 177 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|258513096|ref|YP_003189352.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256634999|dbj|BAI00973.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256638054|dbj|BAI04021.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-03] gi|256641108|dbj|BAI07068.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-07] gi|256644163|dbj|BAI10116.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-22] gi|256647218|dbj|BAI13164.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-26] gi|256650271|dbj|BAI16210.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-32] gi|256653262|dbj|BAI19194.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01-42C] gi|256656315|dbj|BAI22240.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-12] Length = 236 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 51/167 (30%), Gaps = 9/167 (5%) Query: 249 TKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETY---QIVDPGEIVFRFIDLQNDKRS 304 + + S + + +II K+ T + E + +I++ Sbjct: 18 SDYVTSGVPCIMPQDIIDGKISTGKIAYISEENANRLSNFRLAQNDIIYPRRGDITKHAL 77 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVK 363 + S + + + I YL + + + + L V Sbjct: 78 ITSRENGWLCGTGCLRIRLNTSSILPQYLYYYLTLPHVKEWISQNSVGATMPHLNTSLVG 137 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++ V P EQ I +++ +D +E ++ L+ + Sbjct: 138 QISVSYPTYDEQHTIASILGS----LDDKIELNRRTNETLEAMARAL 180 >gi|288929353|ref|ZP_06423198.1| putative restriction modification system specificity subunit [Prevotella sp. oral taxon 317 str. F0108] gi|288329455|gb|EFC68041.1| putative restriction modification system specificity subunit [Prevotella sp. oral taxon 317 str. F0108] Length = 388 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 61/407 (14%), Positives = 125/407 (30%), Gaps = 43/407 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W + + + R + + D++ + + + L + I Sbjct: 5 EWVASRLSEYLNESKERNKKGHFNKTDVLSVSGDFGIVNQIELLGRSFAGAS--VLPYHI 62 Query: 81 FAKGQILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G I+Y K PY DGI ST + V KD + S+ Sbjct: 63 VRLGNIVYTKSPLKEYPYGIVKANTGKDGIVSTLYAVYSVKDNANYKFIEYYFSLANRAN 122 Query: 137 IEAIC----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + + P + EQ I + ID I+ + + I Sbjct: 123 RYFKPIVRIGAKHDMKIGNQEVLANQVIFPTVKEQEKIAGFL----SLIDDRISNQNKII 178 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 E LK+ K A++ ++ + +++ D GI GL N + Sbjct: 179 EDLKKLKCAIIENVLNNCHDNKMRLGDVGIYIRGLT----------------YSSNDVVE 222 Query: 253 ESNILSLSYGNIIQK---LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + + NI+ N+ + Q + G+IV + + S Sbjct: 223 QKGTIVMRSNNIVSGGLLDYCNNVVRVNKQILQEQQLQNGDIVICMANGSSALVGKTSFY 282 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLP 366 + + + WL ++ + + G+G +L ED+ R+ Sbjct: 283 DGKCLSPITVGAFCGIYRSKMPITKWLFQTNRYHRYIWNSLQGGNGAIANLNGEDILRMS 342 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P I + I + +D+L+E + +++ + Sbjct: 343 FPTPDKST---IGHCI-KLLSSLDLLIENNVSLCSMFSQQKEYLLQQ 385 >gi|307126719|ref|YP_003878750.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae 670-6B] gi|306483781|gb|ADM90650.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae 670-6B] Length = 324 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 35/91 (38%), Gaps = 5/91 (5%) Query: 333 LAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + S + +G ++ + L + +PP+ EQ I I ++D Sbjct: 1 MKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDE 60 Query: 392 LVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 61 YAESYNRLEQLDKEFPDKLKKSILQYAMQGK 91 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 100/325 (30%), Gaps = 57/325 (17%) Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ +LLS + R+ G + + + + +PPL+EQ I E I + ++D Sbjct: 1 MKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDE 60 Query: 184 LITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDS------------------- 220 R +L KE ++++ Y + L +S Sbjct: 61 YAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEG 120 Query: 221 ----------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + G +P +W V + + + K + +I + I Sbjct: 121 KIKKKDLDISIVSQGDDNSYYGNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRI 179 Query: 265 IQKLETRNMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I+ + + + Y + +++ G Sbjct: 180 IRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDG 239 Query: 315 IITSAYMA----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPV 367 ++ ++ + I S +L + + S K + ++ + L + Sbjct: 240 VVAGGFIFQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLI 299 Query: 368 LVPPIKEQFDITNVINVETARIDVL 392 + P +EQ IT + +++ L Sbjct: 300 PLAPFEEQELITQKVEKLFEKVNQL 324 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 73/182 (40%), Gaps = 17/182 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 142 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 201 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 202 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 261 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 262 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 321 Query: 182 DT 183 + Sbjct: 322 NQ 323 >gi|329955589|ref|ZP_08296497.1| type I restriction modification DNA specificity domain protein [Bacteroides clarus YIT 12056] gi|328525992|gb|EGF53016.1| type I restriction modification DNA specificity domain protein [Bacteroides clarus YIT 12056] Length = 405 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 50/410 (12%), Positives = 108/410 (26%), Gaps = 35/410 (8%) Query: 26 KVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKY-------LPKDGNSRQSDTST 77 K + K+ +G +S + L + + S + Sbjct: 4 KKYKLGDIAKIEISGVDKKSVDGETPVRLCNFVDVYRNWAITQKLSENFMIASAKETEIA 63 Query: 78 VSIFAKGQILYGK----LGPYLRKAIIADFDGICSTQFLVLQPKDVLP----ELLQGWLL 129 KGQ+ K A IAD + + L ++ Sbjct: 64 KCSIHKGQVAITKDSETRDDIGIPAYIADDFDNVLLGYHCALITPNDDVLDGKYLNAFMH 123 Query: 130 SIDVTQRIEAICEGATMSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + E G+ + + I IP+ +P L Q I + ID I Sbjct: 124 TRYIQKYFENNASGSGQRYTLSNETIFQIPILLPSLEVQKAIGNLL----SNIDRKIELN 179 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + + L++ + L Y + P+ G + + Sbjct: 180 RQINDNLEKMAKQLYDYWFVQFDFPN---------ENGRPYKSSGGAMVWNEKLKREIPK 230 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + L N N G+ P +I ++ + ++ + Sbjct: 231 EWDNCTLEYYLIIKNGRDHKHLGN-GIYPVYGSGGEIRKVDSFIYSGESILMPRKGSLNN 289 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + S ++ S S+ + + ++ Sbjct: 290 IMYVNDAFWSVDTMFYSEMKQPHCAKYVFYSIKDIDFTRWDSGTGVPSMTSSTLYSILLV 349 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P + + ++++K E IV L ++R + + GQ Sbjct: 350 KPDADS----LAKFDEIITPLFLMIKKNEMQIVELTKQRDDLLPLLMNGQ 395 Score = 39.8 bits (91), Expect = 0.85, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 40/133 (30%), Gaps = 20/133 (15%) Query: 10 YKDSG--VQWIG----AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + W IPK W ++ + + GR + G G Y Sbjct: 211 YKSSGGAMVWNEKLKREIPKEWDNCTLEYYLIIKNGRDHK-------------HLGNGIY 257 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + I++ IL + G + D T F + + Sbjct: 258 PVYGSGGEIRKVDS-FIYSGESILMPRKGSLNNIMYVNDAFWSVDTMFYSEMKQPHCAKY 316 Query: 124 LQGWLLSIDVTQR 136 + + ID T+ Sbjct: 317 VFYSIKDIDFTRW 329 >gi|56697573|ref|YP_167941.1| type I restriction-modification system, S subunit [Ruegeria pomeroyi DSS-3] gi|56679310|gb|AAV95976.1| type I restriction-modification system, S subunit [Ruegeria pomeroyi DSS-3] Length = 434 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 55/441 (12%), Positives = 124/441 (28%), Gaps = 53/441 (12%) Query: 27 VVPIKRFTKLNTGRTSESG--------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + I G + + + +++ G + Sbjct: 6 TIRIGDLADGIRGVSYRPEHLQEDFGRDRTVLLRSTNIQDGQLDFTSIQIVPSYLVKPAQ 65 Query: 79 SIFAKGQILYGKLGP----YLRKAIIADFDG---ICSTQFLVLQPKDVLPELLQ-GWLLS 130 S +G ++ + A G V PK Sbjct: 66 S-VGEGDLVVCMSNGSKALVGKAARYKGEYGAPLTVGAFCSVFHPKTESDSAFLRHVFQG 124 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 + I+ I G+ +++ + I + E+ I + + ID I E Sbjct: 125 EQFRRSIDIILSGSAINNLKNSDVEGISIRAHSPTERATIADILD----AIDDAILETDT 180 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSG-IEWVGLVPDHWEVKPFFALVTELNRKNT 249 IE L Q LV + T GL+ +++ + +E + K+ Sbjct: 181 VIEKLLLVHQGLVHDLTTLGLSKSGEIRRADQLEEFHETDLGPLPHSWCVKSIGRMAKDL 240 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV-------------FRFI 296 L + + + ++ L+ N+G T +++D +V F Sbjct: 241 ALGTAARGANDGQDQLRLLKMGNLGWDALDTSTCELIDVDRVVHWKDALLLDGDLLFNTR 300 Query: 297 DLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + + ++ + + +D + A M + + +G Sbjct: 301 NTPELVGKTAAYDQDDQRTVCDNNILRIRFPSEEMDGRFAAAYMANGRGKSRLMTLATGT 360 Query: 354 --RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV---LLKERRS 408 ++ + D++ + VPP + R+ V + I + L R Sbjct: 361 TSVAAIYWRDLRDFQLPVPPRE-------EREEIVRRLQVSRDTIRREKESRVKLSNLRE 413 Query: 409 SFIAAAVTGQIDL---RGESQ 426 +TG+ + R ++ Sbjct: 414 GLRDDLLTGRKPVVAIREAAE 434 >gi|259500492|ref|ZP_05743394.1| conserved hypothetical protein [Lactobacillus iners DSM 13335] gi|259168105|gb|EEW52600.1| conserved hypothetical protein [Lactobacillus iners DSM 13335] Length = 227 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 67/208 (32%), Gaps = 16/208 (7%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTK------LIESNILSLSYGNIIQKLETRNMGLKPES 279 G+ P + P L + + T + I + +I+ + Sbjct: 19 GIQPSEMQFIPLQELCKVVTKGTTPTTLGKSFTSTGINFIKAESILDNHSIDSSKFAFID 78 Query: 280 YE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E ++ +IVF + ++ + A + + YL Sbjct: 79 EETNALLKRSVIKANDIVFTIAGTLGRFAMVDNSVLPANTNQAVAIIRPDETKVTPAYLY 138 Query: 335 WLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + ++ +L +K LP+ V +K N + + L+ Sbjct: 139 SFFIGNWHNEYYSKRIQQAVQANLSLTTIKSLPIAV--LK--NTTMNNYEKLVSPLFALM 194 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + E+ L + R + + ++G++D+ Sbjct: 195 KNNEEENRRLSKLRDTLLPRLMSGELDV 222 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 60/196 (30%), Gaps = 13/196 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + +P++ K+ T T+ + I +I E + K + Sbjct: 22 PSEMQFIPLQELCKVVTKGTTPTTLGKSFTSTGINFIKAESILDNHSIDSSKFAFIDEET 81 Query: 75 TS--TVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWL 128 + S+ I++ G R A++ + +T + V P L + Sbjct: 82 NALLKRSVIKANDIVFTIAGTLGRFAMVDNSVLPANTNQAVAIIRPDETKVTPAYLYSFF 141 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + A ++ I ++P+ + + + + E Sbjct: 142 IGNWHNEYYSKRIQQAVQANLSLTTIKSLPIAVLKNTTMNNYEKLVSPLFALMKNNEEEN 201 Query: 189 IRFIELLKEKKQALVS 204 R +L L+S Sbjct: 202 RRLSKLRDTLLPRLMS 217 >gi|229542844|ref|ZP_04431904.1| Restriction endonuclease S subunits-like protein [Bacillus coagulans 36D1] gi|229327264|gb|EEN92939.1| Restriction endonuclease S subunits-like protein [Bacillus coagulans 36D1] Length = 379 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 53/391 (13%), Positives = 112/391 (28%), Gaps = 37/391 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + K + E ++ G + + V++F G Sbjct: 21 WEQRKLGKVVK-THQFRPYLAEPNAEGDFEVIQQGDRPVAGYTNGTPFENYRDVTLF--G 77 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 P I D I S L E + L + Sbjct: 78 DHTVSLYKPTKPFFIATDGVKILSADGL---------EGDFLFSLLERYKPEPQGYKRHF 128 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T I + + + + IT +E +K K A +S Sbjct: 129 T---ILKNQGAWITKNVEEQVKIGAFFKNLDHL-------ITLHQCKLEKMKTLKSAYLS 178 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + K + +G WE + V E N K + L Sbjct: 179 EMFPAEGERVPKRRFAG------FTQAWEQRKLGD-VAEFNPKEELPEIFEYVDLESVVG 231 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + R + + ++ G++ ++ + L + ++ Y ++ Sbjct: 232 TELIAHRKVRKEKAPSRAQRLARKGDLFYQTVRPYQKNNYLFEKPC-NNYVFSTGYAQLR 290 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPI-KEQFDITNVI 382 P D +L L+++ K +G ++ D+ + V VP EQ I Sbjct: 291 P-YGDGYFLLSLVQTEQFVKAVLDRCTGTSYPAINSNDLANMEVYVPSRGDEQILIGR-- 347 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +D L+ ++ + L+ + +++ Sbjct: 348 --LFKSVDHLITLHQRKLEKLQNIKEAYLNE 376 >gi|15617467|ref|NP_258262.1| putative type I S-subunit protein [Lactococcus lactis] gi|15553738|gb|AAL02008.1|AF409136_1 putative type I S-subunit protein [Lactococcus lactis] Length = 217 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 53/158 (33%), Gaps = 9/158 (5%) Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 ++ M Q+ + + + +A ++R I Sbjct: 53 PFYKVSDMNNPGNEVVMMNANNYASDSQLKENKWNPINPQNSGVVFAKVGAAIFLDRKRI 112 Query: 317 TSAYMAVKPHGI----DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPP 371 + + + DS++ + ++ G S DV+ + V++P Sbjct: 113 VDTSFLIDNNMMSYLFDSSWNRYFGKTLFEKLRLSIFAQVGALPSFNGSDVEDIKVMIPE 172 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 EQ I + ++D ++ ++ + LLKE++ + Sbjct: 173 ESEQKMIGD----MFEKLDDIIALHQRKLDLLKEQKKA 206 >gi|15611852|ref|NP_223503.1| type I restrictionenzyme (specificity subunit) [Helicobacter pylori J99] gi|4155365|gb|AAD06377.1| TYPE I RESTRICTIONENZYME (SPECIFICITY SUBUNIT) [Helicobacter pylori J99] Length = 207 Score = 63.7 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 62/180 (34%), Gaps = 9/180 (5%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P + + N+K K+ E + + + G + + Sbjct: 13 PKGVGFRKLGEVCESTNKKTLKISEVSEVKNKGMYPVINSGRELYGYYHDFN------ND 66 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 GE + + + G + Y + + + +L + +++ + + Sbjct: 67 GENITIASRGEYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNETQIMENL 126 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + G +L D++ L + +PP++ Q +I +++ +A L I I K R+ Sbjct: 127 VFRGSIPALNKADIETLTIPIPPLEIQQEIVTILDQFSALTTDLQAGIPAEI---KARKK 183 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 48/164 (29%), Gaps = 15/164 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + +T + + ++ G+ V + + + + Sbjct: 13 PKGVGFRKLGEVCESTNKKTLKISEVSEVKNKGMYPVINSGRELYGYYHDFNNDGEN--- 69 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQR 136 I G Y + V ++L + L +L + + Sbjct: 70 ------ITIASRGEYAGFINYFNEKFFAGGLCYPYKVKDTNELLTKFLYFYLKTNETQIM 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + G ++ + I + +PIPPL Q I + + Sbjct: 124 ENLVFRG-SIPALNKADIETLTIPIPPLEIQQEIVTILDQFSAL 166 >gi|304409996|ref|ZP_07391615.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS183] gi|307302291|ref|ZP_07582049.1| restriction modification system DNA specificity domain protein [Shewanella baltica BA175] gi|304351405|gb|EFM15804.1| restriction modification system DNA specificity domain protein [Shewanella baltica OS183] gi|306914329|gb|EFN44750.1| restriction modification system DNA specificity domain protein [Shewanella baltica BA175] Length = 373 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 47/132 (35%), Gaps = 2/132 (1%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL-C 343 ++ ++ I Q R + + + + + + +L M+S Sbjct: 240 LLPSKSVLIAMIG-QGKTRGQSAILEIPATTNQNCFAVMPNDTWEPDFLYLWMKSSYQDL 298 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + G + +L + L V P EQ + I IDVL + + ++ + Sbjct: 299 RDLSSDRGGNQSALNGALLNALEVPAPSKPEQQKLVARIQTALTEIDVLEQSSKAALADI 358 Query: 404 KERRSSFIAAAV 415 ++ + +A A Sbjct: 359 EKLPARILAKAF 370 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 55/192 (28%), Gaps = 10/192 (5%) Query: 28 VPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + +G T G +I ++ +V + ++ ++ + Sbjct: 181 KRLGEHAPTTSGSTPSRGNKQYWQPAEIAWVKTGEVAFAPITATEEAISNLALAECSLKL 240 Query: 81 FAKGQILYGKLGP--YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ-RI 137 +L +G ++ I + + + P D + R Sbjct: 241 LPSKSVLIAMIGQGKTRGQSAILEIPATTNQNCFAVMPNDTWEPDFLYLWMKSSYQDLRD 300 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G S + + + +P P EQ + +I ID L + +++ Sbjct: 301 LSSDRGGNQSALNGALLNALEVPAPSKPEQQKLVARIQTALTEIDVLEQSSKAALADIEK 360 Query: 198 KKQALVSYIVTK 209 +++ Sbjct: 361 LPARILAKAFEN 372 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 14/121 (11%), Positives = 38/121 (31%), Gaps = 9/121 (7%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 +I + R ++ + + +L ++S L + Y S Sbjct: 60 YIVFGDHTRIVKFIDFSFVVGADGVRLYKASEKYEPEFLYLFLKSSKLPEDGYGRHS--- 116 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + +K L V ++Q I + + ++ + + + + R+ + A Sbjct: 117 -----KYLKELFVPEISKEKQRQIAARLKAQLGEVETARQAAKVQLSDARLLRTRML-KA 170 Query: 415 V 415 Sbjct: 171 F 171 >gi|319945006|ref|ZP_08019268.1| restriction modification system [Lautropia mirabilis ATCC 51599] gi|319741576|gb|EFV94001.1| restriction modification system [Lautropia mirabilis ATCC 51599] Length = 420 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 53/411 (12%), Positives = 122/411 (29%), Gaps = 47/411 (11%) Query: 46 KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD- 102 I + + V++ + + ++ +G I+ P I D Sbjct: 9 SGIPVLSAKHVKTDGLVDVQSMRYASTEMYKKWMTVEVQEGDIILTSEAPMGEVFYIQDD 68 Query: 103 FDGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 + + P+ + P+ L WL S + ++I A G+T+ + + + Sbjct: 69 KKYVLGQRVFGLRPNPRLINPKYLAAWLASSEGQRQITARASGSTVQGIRQVELLKLEVD 128 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY---------IVTKGL 211 +P EQ I + T +I +++ ++ + +G Sbjct: 129 LPSKEEQERIANVRFSLTDKIILNRCINQTLEAMVQAIFKSWFVDFDPVKAKIAAIEQGQ 188 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 +P + D + L + ES + ++ G ++++ Sbjct: 189 DPLRAAMRAISGKTDAELDQMPREHHDELAATAELFPDAMEESKLGNIPNGWEVKRVGDL 248 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDK----RSLRSAQVMERGIITSAYMAVKPHG 327 ++ ++ V+ + + V +G + S Y P Sbjct: 249 IELAYGKALKSTDRKQGSVPVYGSGGVTGYHNEALVPHGAIIVGRKGTVGSLYWEDGPFF 308 Query: 328 IDSTYLA---------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 T + + + L E+V RL ++ P I Sbjct: 309 PIDTTFYVKPKVLPMTYCFYAMQTLGLDKMNTDAAVPGLNRENVYRLELVKPSIS----- 363 Query: 379 TNVINVETARIDVLVEKIEQSIVL-------LKERRSSFIAAAVTGQ--ID 420 + D L+ + +++ L E R S + ++G+ ID Sbjct: 364 ------VLSAFDGLIGQTRKAMRANTIASRSLAELRDSLLPKLLSGELAID 408 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 38/134 (28%), Gaps = 12/134 (8%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP W+V + +L G+ + D + G+ +P G+ + Sbjct: 233 LGNIPNGWEVKRVGDLIELAYGKA---------LKSTDRKQGS---VPVYGSGGVTGYHN 280 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ G I+ G+ G T F V + Sbjct: 281 EALVPHGAIIVGRKGTVGSLYWEDGPFFPIDTTFYVKPKVLPMTYCFYAMQTLGLDKMNT 340 Query: 138 EAICEGATMSHADW 151 +A G + Sbjct: 341 DAAVPGLNRENVYR 354 >gi|260589480|ref|ZP_05855393.1| putative type I restriction-modification system, specificity determinant [Blautia hansenii DSM 20583] gi|260540048|gb|EEX20617.1| putative type I restriction-modification system, specificity determinant [Blautia hansenii DSM 20583] Length = 414 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 44/356 (12%), Positives = 108/356 (30%), Gaps = 27/356 (7%) Query: 57 ESGTGKYLPKDGNSRQSDTSTVS-IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 E+GT K L + +D + + ++G+I+ G I + Sbjct: 59 ENGTVKILTTSISDLWADEEKTADVLSEGEIVCIPWGGNP-VVQYYKGKFITGDNRIATS 117 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 + + I + G+ + H D + ++ +P+PP+ Q I + Sbjct: 118 LDVKRLSNKYLYYCMQNRLVDISSYYRGSGIKHPDMSKVLDLVIPVPPIEVQSEIVRILD 177 Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVK 235 T L E + ++ + + ++T + + + + P Sbjct: 178 NFTELTAELTAELTAELTARNKQFEYYRTQLLTFS-DEVEMLTLEDVCQIVDCP------ 230 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + K E+ + + N++ + + E + E Sbjct: 231 ----------HTSPKWKENGVPVIRNYNLVNGQIDTSNLSYVDEDEYLTRIKRIEPQEND 280 Query: 296 IDLQNDKRSLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-- 349 I + + + I YL +++ + ++ Sbjct: 281 ILFSREAPIGNVGIIPANFKCCQGQRVVLLRPDQDIIYPRYLMHILQGEIVRNQISSVEG 340 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID-VLVEKIEQSIVLLK 404 + D+++L VP K Q + + ++ A+++ + E + I L K Sbjct: 341 KGATVSNFNISDLRKLKFQVPDKKVQLYLIDKLD-IFAKLNGDIKEGLPAEIKLRK 395 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 61/203 (30%), Gaps = 22/203 (10%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG--------------NIIQKLETRNM 273 P+ P +++ + NT E + Y ++ L T Sbjct: 12 CPEGVAYMPIWSITAWDKKFNTVAKEKQKTIVKYNYFLAADLKKLESENGTVKILTTSIS 71 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 L + +T ++ GEIV + + + I ++ + S Sbjct: 72 DLWADEEKTADVLSEGEIVCIPWGGNPVVQYYKGKFITGDNRIATSLDVKR----LSNKY 127 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + L + + V L + VPPI+ Q +I +++ T L Sbjct: 128 LYYCMQNRLVDISSYYRGSGIKHPDMSKVLDLVIPVPPIEVQSEIVRILDNFTELTAELT 187 Query: 394 EKIEQSIVLLKE----RRSSFIA 412 ++ + + R+ + Sbjct: 188 AELTAELTARNKQFEYYRTQLLT 210 >gi|323491152|ref|ZP_08096340.1| hypothetical protein VIBR0546_04497 [Vibrio brasiliensis LMG 20546] gi|323314617|gb|EGA67693.1| hypothetical protein VIBR0546_04497 [Vibrio brasiliensis LMG 20546] Length = 472 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 64/193 (33%), Gaps = 12/193 (6%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 G EW+ + ++ + K ++ +G L+ + Sbjct: 2 GSEWIDAKLGDYIDSCLGKMLDKNKNKGEFYSYLGNSNVRWGAF--DLDELAQMKFEDHE 59 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRS 339 + G+++ R + I A V+ G+DS +L + Sbjct: 60 HVRYGIKAGDLIVCEGG--EPGRCAIWEDDLPNMKIQKALHRVRTIDGLDSEFLYYWFLF 117 Query: 340 YDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K+ A + L + +K +P+ +PP+K Q + ++ +I KI + Sbjct: 118 AGKNKLLDAYFTGTTIKHLTGKALKEIPIKIPPLKHQKHVAVLLRGFDKKI-----KINR 172 Query: 399 SI-VLLKERRSSF 410 I L++ + Sbjct: 173 QINQTLEQMAQTL 185 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 56/463 (12%), Positives = 137/463 (29%), Gaps = 76/463 (16%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W + + G+ + K+ Y+G +V G + Sbjct: 3 SEWIDAKLGDYIDSCLGKMLDKNKNKGEFYSYLGNSNVRWGAFDLDELAQMKFEDHEHVR 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 G ++ + G R AI D + ++ D L + + Sbjct: 63 YGIKAGDLIVCEGGEPGRCAIWEDDLPNMKIQKALHRVRTIDGLDSEFLYYWFLFAGKNK 122 Query: 137 -IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++A G T+ H K + IP+ IPPL Q + + +I ++ Sbjct: 123 LLDAYFTGTTIKHLTGKALKEIPIKIPPLKHQKHVAVLLRGFDKKIKINRQINQTLEQMA 182 Query: 196 KEKKQA-------LVSYIVTKG--------------------------LNPDVKMKDSGI 222 + ++ ++ + G + ++ Sbjct: 183 QTLFKSWFVDFDPVIDNALDAGSPIPEVFEARVERRKAVRESADFKPLPDDVRQLFPREF 242 Query: 223 E--WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG------ 274 E +G VP W F L+ + + + II+ + ++ Sbjct: 243 EESELGWVPKGWSFTKFGDLLDKTIGGDWGKDVPDEKHTEQVKIIRGTDIPDLNAGGISS 302 Query: 275 ----LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER------GIITSAYMAVK 324 + ++ +IV + + RS + G++ A + Sbjct: 303 APTRWVESKKLKTRKLEHADIVIEVSGGSPKQPTGRSLLITNDVLSRLGGVVEPASFCRR 362 Query: 325 PHGID-------STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP-VLVPPIKEQF 376 ++ S +L ++ + + + S + + + V++P + Sbjct: 363 FKPVNEKVGLLASEHLKFIYAAGKMWEYQN--QSTGIANFQTKFFLEAEYVMIPNTE--- 417 Query: 377 DITNVINVETARIDVLVEKIEQSIVL-LKERRSSFIAAAVTGQ 418 V+ + + +EK + S + L++ R + + ++G+ Sbjct: 418 ----VLEHYFSFVMSWIEKRQSSTSIGLEKLRDTLLPKLISGE 456 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 27/215 (12%), Positives = 54/215 (25%), Gaps = 29/215 (13%) Query: 4 YKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDII---------YIGLE 54 + ++++S +G +PK W G + GKD+ I Sbjct: 238 FPR--EFEESE---LGWVPKGWSFTKFGDLLDKTIGG--DWGKDVPDEKHTEQVKIIRGT 290 Query: 55 DVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILY-----GKLGPYLRKAIIADFD---- 104 D+ G +S I+ P R +I + Sbjct: 291 DIPDLNAGGISSAPTRWVESKKLKTRKLEHADIVIEVSGGSPKQPTGRSLLITNDVLSRL 350 Query: 105 -GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 G+ + K V ++ + + E S + Sbjct: 351 GGVVEPASFCRRFKPVNEKVGLLASEHLKFIYAAGKMWEYQNQS--TGIANFQTKFFLEA 408 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + E + + + I +R + EK Sbjct: 409 EYVMIPNTEVLEHYFSFVMSWIEKRQSSTSIGLEK 443 >gi|317178850|dbj|BAJ56638.1| Type I R-M system S protein [Helicobacter pylori F30] Length = 257 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 38/103 (36%), Gaps = 1/103 (0%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I S + + S ++ K + YL + + + +G Sbjct: 68 ISSSGVYAGYVSYWDIPVFLADSFSVSPKQKTLMPKYLFHYLTTQQ-DAIHATKSTGGIP 126 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + +D++ + +PP++ Q +I +++ T L + +Q Sbjct: 127 HVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTELKARKKQ 169 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 46/162 (28%), Gaps = 12/162 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + G++ K + + + + G + +R + Sbjct: 13 PKGVEFRKLGEVCDFQKGKSITK-KAVTFGKVPVISGGRQPAYYHNEVNRSGE------- 64 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I G Y D + F V PK + I A Sbjct: 65 ---TIAISSSGVYAGYVSYWDIPVFLADSFSV-SPKQKTLMPKYLFHYLTTQQDAIHATK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + H K + N +PIPPL Q I + + A T Sbjct: 121 STGGIPHVYSKDLQNFLIPIPPLEIQQEIVKILDAFTELNTE 162 >gi|311110802|ref|ZP_07712199.1| putative type I restriction modification DNA specificity domain protein [Lactobacillus gasseri MV-22] gi|311065956|gb|EFQ46296.1| putative type I restriction modification DNA specificity domain protein [Lactobacillus gasseri MV-22] Length = 288 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 37/265 (13%), Positives = 96/265 (36%), Gaps = 19/265 (7%) Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + ++ T + +K + I + P Q I + +++ +I + Sbjct: 10 NYRYLYYALKNAHIPNTGYNRHFKWLKEITINYPDKNRQNDIVNILD----KLEYIIKMK 65 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 + ++ E +A V +P++K KD ++ + + K + R Sbjct: 66 SQELDKFDELIKA---RFVEMFGDPEIKNKDKSLKKLCDICLVNPDKR------KDPRLT 116 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLR 306 +E + + +S + ++T N+ L E + + +++F I ++N K ++ Sbjct: 117 NNDLEVSFVPMSAVSENGDIDTTNIKLYSEVRKGFTYFSSNDVLFAKITPCMENGKGAIA 176 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWL----MRSYDLCKVFYAMGSGLRQSLKFEDV 362 + G ++ + ++P S S+ GS ++ + + + Sbjct: 177 QNLKNDIGFGSTEFHVLRPLENLSNPYWLYVLTTFDSFRKVAEINMTGSAGQKRVPVKFL 236 Query: 363 KRLPVLVPPIKEQFDITNVINVETA 387 + V +PP+ Q + N + Sbjct: 237 ENYKVNIPPLSLQNEFANFVQQVDK 261 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 54/175 (30%), Gaps = 15/175 (8%) Query: 27 VVPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + + +N + + + ++ ++ + V G + + F Sbjct: 96 LKKLCDICLVNPDKRKDPRLTNNDLEVSFVPMSAVSE-NGDIDTTNIKLYSEVRKGFTYF 154 Query: 82 AKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDV 133 + +L+ K+ P + + + G ST+F VL+P P L Sbjct: 155 SSNDVLFAKITPCMENGKGAIAQNLKNDIGFGSTEFHVLRPLENLSNPYWLYVLTTFDSF 214 Query: 134 TQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + E G+ K + N + IPPL+ Q + I Sbjct: 215 RKVAEINMTGSAGQKRVPVKFLENYKVNIPPLSLQNEFANFVQQVDKSKVANIVY 269 >gi|255022639|ref|ZP_05294625.1| hypothetical protein LmonocyFSL_02371 [Listeria monocytogenes FSL J1-208] Length = 261 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 58/168 (34%), Gaps = 10/168 (5%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 KN + I NI ++ G E+Y + +++ G+++F Sbjct: 10 KNVHYGDVLIKYPCILNIKKEEIPYITGGCLEAYNS-NLLENGDLIFADAAEDETVGKAV 68 Query: 307 SAQVMERGIITSAYMAVKPHGIDS---TYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDV 362 + + S + +L + + S + + G + S + ++ Sbjct: 69 EVNGITNENLVSGLHTIVARATTQKAKYFLGYYINSDIYHRQLLRLMQGSKVSAISKGNL 128 Query: 363 KRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 ++ V P I+EQ I + ++D + + + LK+ + Sbjct: 129 QKTDVSFPKDIEEQQKIGSY----FKKLDSTIALHQHKLDTLKQMKKG 172 >gi|257083314|ref|ZP_05577675.1| predicted protein [Enterococcus faecalis Fly1] gi|256991344|gb|EEU78646.1| predicted protein [Enterococcus faecalis Fly1] Length = 374 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 46/393 (11%), Positives = 115/393 (29%), Gaps = 40/393 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W++ + + + G P G + D IF Sbjct: 13 WELCKLGELIESFDSERIPIDSSLRISGQ----------YPYYGATGIIDYIDSYIFDGE 62 Query: 85 QILYGKLGPYL-----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L + G + A + + +++ +L+ I Sbjct: 63 YVLLAEDGANIIMRNYPVAYLTQGKFWLNNHAHIMRMVKGDN----QFLVQILEKMNYSK 118 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + I + +P EQ I I + + +L K Sbjct: 119 YNTGTAQPKLNSNIVKRINLRVPIPEEQQKIGTLFKQLDDTITLHQRKLDQLKKLKKAYL 178 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 QA+ V+ + K ++ G W++ ++ + + K+ S+ Sbjct: 179 QAM---FVSMNTKKNKVPKLRFTDFKGE----WKLCKLENIIEKQIKGKAKVENLCNGSV 231 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y + R G KP + V +I+ + + K +G++ S Sbjct: 232 EYLDA-----NRLNGGKPIYTKALPDVSERDIIILWDGSKAGKVY-----YGFKGVLGST 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 A + ++ + + ++ + + P+ + +EQ + Sbjct: 282 LKAYQLKECANSQFIYQQLLDNQNNIYNNYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMA 341 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 +++ + +D + + I + + S++ Sbjct: 342 DIL----SNLDNRIILQQNLIDTMISLKKSYLQ 370 >gi|57237937|ref|YP_179185.1| type II restriction-modification enzyme [Campylobacter jejuni RM1221] gi|57166741|gb|AAW35520.1| type II restriction-modification enzyme [Campylobacter jejuni RM1221] gi|315058494|gb|ADT72823.1| Type I restriction-modification system, DNA-methyltransferase subunit M / Type I restriction-modification system, specificity subunit S [Campylobacter jejuni subsp. jejuni S3] Length = 1343 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 48/459 (10%), Positives = 122/459 (26%), Gaps = 78/459 (16%) Query: 26 KVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 ++V + L G + + + I + ++ + + Sbjct: 892 ELVRLGEVCDLFNGYAFKKTDYVEKSNTLLIRMGNIRPNGEFDAEHKIQYLPDNFNNKYK 951 Query: 80 --IFAKGQILYGKLGPYLRKAII---------ADFDGICSTQF--LVLQPKDVLPELLQG 126 + G ++ I+ + + + + + L + ++ + L+ Sbjct: 952 DYLLNDGDVIIAMTDMGNAMNILGVPTIVKNKNNRNFLLNQRVGKLFNFSEKIIVQYLKY 1011 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------------- 173 L S +V ++ + G + I + +P+PPL Q I + Sbjct: 1012 ALSSNEVKKQFKLQGYGGLQINLGKTQILSTKIPLPPLEIQKQIVAECEKVEEQYNTLSL 1071 Query: 174 -IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV------- 225 I I ++ + + + K +++ + D + S I+ Sbjct: 1072 SIEEYQKLIKAILQKCGIIEDDQEYKLNSILENLQKLESKLDFNLLFSFIDDFTNARQED 1131 Query: 226 ------------------------GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 G + +RK + NI + Sbjct: 1132 LKKFKEFVKNIKAILGTFSTPPKQGWNKEKLNEIVSIQSGGTPDRKVKEYWNGNINWVKS 1191 Query: 262 GNIIQKLETRNMGLKPESY-----ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + + +++ + + K + + I Sbjct: 1192 EVCQNCYVYDYQVKEKITELGLQKSSAKLLKKETTLIALVGATIGKIGFLTFESATNQNI 1251 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 T Y + L F +G +K L + +PP++ Q Sbjct: 1252 TGLY---PKNLKILNTKYLYYACMGLYGQFRKLGDFAMA--NSNFIKNLTISLPPLEIQE 1306 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I I + +ID L + L++ + + + Sbjct: 1307 KIVQNIELVEQQIDFL----NLKLEFLEKEKEKILQKYL 1341 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 70/210 (33%), Gaps = 12/210 (5%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 + K+S E V L + T+ K+ L+ G + + + Sbjct: 881 DELNPFKNSKFELVRLGEVCDLFNGYAFKKTDYVEKSNTLLIRMGNIRPNGEFDAEHKIQ 940 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQND-----KRSLRSAQVMERGIITSAY--MAVK 324 + + +++ G+++ D+ N ++ + ++ + Sbjct: 941 YLPDNFNNKYKDYLLNDGDVIIAMTDMGNAMNILGVPTIVKNKNNRNFLLNQRVGKLFNF 1000 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I YL + + S ++ K F G GL+ +L + + +PP++ Q I Sbjct: 1001 SEKIIVQYLKYALSSNEVKKQFKLQGYGGLQINLGKTQILSTKIPLPPLEIQKQIVAECE 1060 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + L SI ++ + + Sbjct: 1061 KVEEQYNTL----SLSIEEYQKLIKAILQK 1086 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 65/190 (34%), Gaps = 12/190 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY--LPKDGNSRQSD 74 + W + + +G T + +I ++ E ++ + + Sbjct: 1155 QGWNKEKLNEIVSIQSGGTPDRKVKEYWNGNINWVKSEVCQNCYVYDYQVKEKITELGLQ 1214 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD-VLPELLQGWLLSIDV 133 S+ + K L +G + K F+ + L PK+ + + + + Sbjct: 1215 KSSAKLLKKETTLIALVGATIGKIGFLTFESATNQNITGLYPKNLKILNTKYLYYACMGL 1274 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + + A+ I N+ + +PPL Q I + I +ID L + + Sbjct: 1275 YGQFRKLGD---FAMANSNFIKNLTISLPPLEIQEKIVQNIELVEQQIDFLNLKLEFLEK 1331 Query: 194 LLKEKKQALV 203 ++ Q + Sbjct: 1332 EKEKILQKYL 1341 >gi|261401231|ref|ZP_05987356.1| type I restriction modification DNA specificity family protein [Neisseria lactamica ATCC 23970] gi|269208819|gb|EEZ75274.1| type I restriction modification DNA specificity family protein [Neisseria lactamica ATCC 23970] Length = 432 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 57/435 (13%), Positives = 125/435 (28%), Gaps = 48/435 (11%) Query: 27 VVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAK 83 + I ++N ++ ++I+Y+ ++ + + + + Sbjct: 4 QIKIGEIAEINANSLTQKDMFQEIMYLDTGNITRNEIDNIQILNITMDKIPSRAKRKVKD 63 Query: 84 GQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I+Y + P + + I ST F + D + + L Sbjct: 64 KTIIYSTVRPNQEHYGFLENPSDNFIVSTGFSTIDVYDDNTDEKFIYYLLTQKHVTDYLH 123 Query: 141 CEGAT----MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G + I N+ +P L Q I + +D I + L+ Sbjct: 124 TIGENSVSSYPSINPDDIANLKFTVPYLKTQQSIAAVL----SALDKKIALNKQINARLE 179 Query: 197 EKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLV------PDHWEVKPFFALVTELNRK 247 E + L Y + PD K SG E V P WEVK + Sbjct: 180 EMAKTLYDYWFVQFDFPDANGKSYKSSGGEMVFDETLKRKIPKGWEVKQISHWIKADKSG 239 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESY---------ETYQIVDPGEIVFRFIDL 298 + + N ++ + + + ++++ P + V Sbjct: 240 DWGKEQQEGNYTVKVNCVRGADINAINSQGNIEAPIRFILAKNEHKLLSPFDFVVEISGG 299 Query: 299 QNDKRSLR-------SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG- 350 + + R + +I S + S + + D+ K G Sbjct: 300 SPTQSTGRLAPISQYVLDRFDLPLICSNFCKAISLKDTSYFYQFAFMWSDIYKNNILFGW 359 Query: 351 ---SGLRQSLKFEDVKRLPV-LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + ++L F++ PP + +I+ + + + L + Sbjct: 360 EGKTSGIKNLLFDNFVNGYFECFPPKEIAEQFFKIIDKNHQE----QQLLLKQNHQLTQL 415 Query: 407 RSSFIAAAVTGQIDL 421 R + + GQ+ + Sbjct: 416 RDFLLPMLMNGQVSV 430 >gi|37678451|ref|NP_933060.1| type I restriction-modification enzyme, specificity subunit [Vibrio vulnificus YJ016] gi|37197191|dbj|BAC93031.1| type I restriction-modification enzyme, specificity subunit [Vibrio vulnificus YJ016] Length = 380 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 51/405 (12%), Positives = 118/405 (29%), Gaps = 40/405 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K ++ + D Y + +G G L + T + + Q Sbjct: 2 KEYTLRDVL-IRQKEAITVEDDAEYKRITIKMNGNGVLLRDEVIGDAIGTKRQFLVSSDQ 60 Query: 86 ILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIEAI 140 + K+ I I + F L ++ + + Sbjct: 61 FVLSKIDARNGAFGIVPKSCDGAIITGNFWAFDVNSELADVKYLDFMSKTPEFKDFCIVA 120 Query: 141 CEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG T + D + + +P LAEQ + KI+ +I+ R + L Sbjct: 121 SEGTTNRKYLDENKFLDKRILLPELAEQKKVVAKILKFKNKIELARKIRNEILSDLYVLL 180 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + ++ + KP + RK + + L Sbjct: 181 NSTFHKLI----------------------EGAVYKPMSKVAPLERRKVEIDVNAEYPEL 218 Query: 260 SYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + + + PG++VF + ++ + R + + Sbjct: 219 GVRCFGNGTFHKPILNGMDVGTKKLYQMVPGDLVFSNVFAWEGAIAVVKKEDEGR-VGSH 277 Query: 319 AYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGLRQS---LKFEDVKRLPVLVPPIK 373 ++ P + + +L + + + + A G L + ++ + V VP Sbjct: 278 RFITCLPKSGVVTADFLCFYFLTTEGLEKIQAASPGGAGRNRTLGLKKLENIEVPVPDYD 337 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +Q N + ++ + + ++ L+ S + A G+ Sbjct: 338 KQL----WFNQLQSYVEKIKQAQSENATELEALMPSILDKAFKGE 378 >gi|167854666|ref|ZP_02477446.1| HP0790-like protein [Haemophilus parasuis 29755] gi|167854203|gb|EDS25437.1| HP0790-like protein [Haemophilus parasuis 29755] Length = 199 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 42/134 (31%), Gaps = 5/134 (3%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 Y + + + + +D Y+ + + Sbjct: 55 YHNEYNRNGKTITVAGSGAYAGFIMYWEEPIFVSDAFSIKSDETLLDLKYVYHFLLQHQQ 114 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 GSG+ + +D+ L + +PP+ Q +I +++ T+ L ++ + Sbjct: 115 KIYGMKKGSGV-PHVYPKDLSTLVIPIPPLDVQQEIVRILDAFTSLTAELTAELTAELTS 173 Query: 403 LKER----RSSFIA 412 +++ R + Sbjct: 174 RQKQYQYFRDKLLN 187 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 62/184 (33%), Gaps = 12/184 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + T++ G+T I +D G + G + + Sbjct: 17 EFKSLGDVTEMKRGKT---------ITAKDASGGDIPVIS--GGQKPAYYHNEYNRNGKT 65 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G Y + + S F + + +L L + + Q+I + +G+ Sbjct: 66 ITVAGSGAYAGFIMYWEEPIFVSDAFSIKSDETLLD-LKYVYHFLLQHQQKIYGMKKGSG 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + H K + + +PIPPL Q I + A T L E + +++ Q Sbjct: 125 VPHVYPKDLSTLVIPIPPLDVQQEIVRILDAFTSLTAELTAELTAELTSRQKQYQYFRDK 184 Query: 206 IVTK 209 ++ Sbjct: 185 LLNF 188 >gi|147920565|ref|YP_685638.1| type I restriction modification system, specificity subunit (fragment) [uncultured methanogenic archaeon RC-I] gi|110621034|emb|CAJ36312.1| type I restriction modification system, specificity subunit (fragment) [uncultured methanogenic archaeon RC-I] Length = 194 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 65/198 (32%), Gaps = 16/198 (8%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET----RNMGLKPESYE 281 +P ++ + ++ T + + ++ ++ + +K Sbjct: 4 NEIPKMSIGNLVVSVKSGISSYYTNGGDLVVPMVNIKDLQDGNIITRSVDKVKIKDTKLL 63 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 I+ +IV I QN K ++ ++ I ++ I + + S Sbjct: 64 AKNILSKDDIVVS-IKGQNYKAAVAGSEHEGYAISSNLIAFTLNDRILPEIVEAYLNSPY 122 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVI---NVETARIDVLVEKIE 397 + A SG L + + V VPP +Q I + +D E Sbjct: 123 GQRELRARASGSTMPGLNTRTLLEVAVPVPPPDKQASIAGYLRLARERRKLLDR-----E 177 Query: 398 QSIVLLKERRSSFIAAAV 415 Q I L++ +++ I + Sbjct: 178 QMI--LEQLKNTIIGDVM 193 Score = 43.2 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 65/164 (39%), Gaps = 13/164 (7%) Query: 20 AIPKHWKVVPIKR-FTKLNTGRT--SESGKD--IIYIGLEDVESGTGKYLPKDGNSRQS- 73 IPK + I + +G + +G D + + ++D++ G D + Sbjct: 5 EIPK----MSIGNLVVSVKSGISSYYTNGGDLVVPMVNIKDLQDGNIITRSVDKVKIKDT 60 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAI---IADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 +I +K I+ G + A+ + I S +LPE+++ +L S Sbjct: 61 KLLAKNILSKDDIVVSIKGQNYKAAVAGSEHEGYAISSNLIAFTLNDRILPEIVEAYLNS 120 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + A G+TM + + + + +P+PP +Q I + Sbjct: 121 PYGQRELRARASGSTMPGLNTRTLLEVAVPVPPPDKQASIAGYL 164 >gi|218562667|ref|YP_002344446.1| restriction modification enzyme [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|112360373|emb|CAL35169.1| restriction modification enzyme [Campylobacter jejuni subsp. jejuni NCTC 11168] gi|315927941|gb|EFV07263.1| type I restriction modification DNA specificity domain protein [Campylobacter jejuni subsp. jejuni DFVF1099] Length = 1339 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 47/384 (12%), Positives = 115/384 (29%), Gaps = 24/384 (6%) Query: 52 GLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI--IADFDGICST 109 +++ GK++ + + + IF IL G L K FD Sbjct: 958 RYANIKKHKGKFVSANNHILSVKDKSKIIFDFLYILLEICGQKLYKQGQQYPQFDTNIFY 1017 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 F + P + + + I+ ++ + + E Sbjct: 1018 SFKIPLPPLEIQKQIVAECEKIEEQHNTLSLSIKEYQKLIKAMLQKSGIIEDNQEYELNS 1077 Query: 170 IREKI---------IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM--K 218 I E + I+ I+ +E + K++ ++ Sbjct: 1078 ILENLQKLESKLDFNLLLSLIEEQISHSEVLVEETQSKERKQDFNAFKNFSKTIQELLQT 1137 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET--RNMGLK 276 S G + + + L + + + ++ K + Sbjct: 1138 LSTPPKDGWKRISLKNEQYMELNPSKKEISKLDENMLVSFIEMASVSDKGYIQSKIDRSL 1197 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYL 333 E + Y +I+ I + A+ + I T ++ G+DS++L Sbjct: 1198 NEVRKGYTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFL 1257 Query: 334 AWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + ++ + G+ + + + L + +PP++ Q I I + +ID+ Sbjct: 1258 FYNLNQQNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDL 1317 Query: 392 LVEKIEQSIVLLKERRSSFIAAAV 415 L + L++ + + + Sbjct: 1318 L----NLKLEFLEKEKEKILQKYL 1337 >gi|162453797|ref|YP_001616164.1| subunit S of type I restriction-modification system [Sorangium cellulosum 'So ce 56'] gi|161164379|emb|CAN95684.1| subunit S of type I restriction-modification system [Sorangium cellulosum 'So ce 56'] Length = 440 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 56/390 (14%), Positives = 110/390 (28%), Gaps = 34/390 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G +P+ W + +G + + E G+Y Sbjct: 3 GPLPEGWAETTLASICSHRSGSSKLIKGKL------HAEQRPGRYQGFSAAGPDVWCDG- 55 Query: 79 SIFAKGQ-ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +G+ I+ +G KA A V+ P + ++ +L D Sbjct: 56 -WEHEGEAIVVSAVGTRCGKAFKARGRWSAIANTHVVWPDERAIDVGYLFLHLNDEGFWA 114 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+ + P +PPL EQ I K A +D R LL+ Sbjct: 115 KG---GSAQPFVKVRETLERPFALPPLPEQRRIVAKAEALLGEVDAAKARLARSSLLLRR 171 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVG--------------------LVPDHWEVKPF 237 +QA+++ + L D++ + G P+ + + Sbjct: 172 LRQAVLAAACSGRLTEDLRAPGAAAPAAGPAEPPPRCSTSAPAAPGASSDGPERPLPRSW 231 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 N + S +S + Y ++ Sbjct: 232 VRCPFGSLVDNHDGRRVPVSSAVRARRRGPYPYYGASGVIDSIDGYLFDGEYLLIAEDGA 291 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + + + R + + V+P L +L + + + + + L Sbjct: 292 NLLSRNTRVAFAASGRFWVNNHAHVVQPKA--GVVLGYLELLLNSLDLQHHVTGSAQPKL 349 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +PV VPP +EQ +I A Sbjct: 350 TQAALNGIPVPVPPAEEQAEIVRRAQALFA 379 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 52/168 (30%), Gaps = 11/168 (6%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G +P+ W + + ++KLI+ + + Q + + +E Sbjct: 3 GPLPEGWAETTLAS-ICSHRSGSSKLIKGKLHAEQRPGRYQGFSAAGPDVWCDGWE---- 57 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 GE + ++ A+ I + + ID YL + Sbjct: 58 -HEGEAIVVSAVGTRCGKAF-KARGRWSAIANTHVVWPDERAIDVGYLFLHLNDEGF--- 112 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +A G + +K + P +PP+ EQ I +D Sbjct: 113 -WAKGGSAQPFVKVRETLERPFALPPLPEQRRIVAKAEALLGEVDAAK 159 >gi|293400126|ref|ZP_06644272.1| type I restriction-modification system, specificity subunit [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291306526|gb|EFE47769.1| type I restriction-modification system, specificity subunit [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 204 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 46/179 (25%), Positives = 76/179 (42%), Gaps = 7/179 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + + + +SG + I LE +E GTG+ L ++ QS T G +L+ Sbjct: 14 FSEIARRRKEKYSPDSGVEYPCIELEHIEQGTGRLLGNVSSTTQSSIKTA--ARSGDVLF 71 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 GKL PYLRK A+ D +CS++ P + + +L+ + R+ I G M Sbjct: 72 GKLRPYLRKFAFAEQDIVCSSEIWAFIPSEYVIPKYLYYLVQTEHFLRVANISSGTKMPR 131 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 A+W I P IP + Q I + ID I + +L ++ L+ + Sbjct: 132 AEWANIEKEPFDIPCILIQEKIVSIL----EAIDKKICTSGDSLRMLINFREGLLQQLF 186 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 47/129 (36%), Gaps = 7/129 (5%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+++F + K + ++ + + + + YL +L+++ +V Sbjct: 65 RSGDVLFGKLRPYLRKFAFAEQDIV---CSSEIWAFIPSEYVIPKYLYYLVQTEHFLRVA 121 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++ ++++ P +P I Q I +++ ID + S+ +L Sbjct: 122 NISSGTKMPRAEWANIEKEPFDIPCILIQEKIVSILEA----IDKKICTSGDSLRMLINF 177 Query: 407 RSSFIAAAV 415 R + Sbjct: 178 REGLLQQLF 186 >gi|257457140|ref|ZP_05622317.1| putative DNA methylase-type I restriction-modification system [Treponema vincentii ATCC 35580] gi|257445519|gb|EEV20585.1| putative DNA methylase-type I restriction-modification system [Treponema vincentii ATCC 35580] Length = 440 Score = 63.7 bits (153), Expect = 6e-08, Method: Composition-based stats. Identities = 41/383 (10%), Positives = 104/383 (27%), Gaps = 33/383 (8%) Query: 45 GKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF 103 KD Y + D+E+ K + + + S G++L K+G R I+ Sbjct: 30 EKDYAYMVRTTDLETNNFSDNVKYVSKSTYEFLSKSKVFGGEVLINKIGSPGRTYIMPKL 89 Query: 104 DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 D S + + + + L + + I + + Sbjct: 90 DMPISLGMNLFLLRLKGDVIDENTLYLFLNSTVGKNIIQRKVNGTVPLTIDKKAIRSLYV 149 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK------- 216 R+++ ++ + + L+S + K NP + Sbjct: 150 PVFSHEFRKRLNYLMSDLNN---ASKEANTKYTQAENLLISELGLKNFNPSNEKVSIKTL 206 Query: 217 -------------MKDSGIEWVGLVPDHWEVKPFFALVTELNRKN-----TKLIESNILS 258 E ++ V+ + + +N +I Sbjct: 207 KESFLRTGRIDSEYYQPKYEIFDEKINNIGVEKLENICSLINYGTVPTSPYVKNNKSIPY 266 Query: 259 LSYGNIIQKLETRNMGLK--PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + N+ + E + +I+ + D + Q Sbjct: 267 IKGMNLKNCFIVGDFDEIENTEDLQDKFFTKENDIIISQMGTVGDIGVVTKEQENYLFAS 326 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKE 374 + + + ++ +++ + + +RQ+ +K +P+ + + Sbjct: 327 FTIRARLNDERFNPYFVGAYIQNVAKDFYLHRNIAQASVRQNTDLPTIKNMPIPLVKKEV 386 Query: 375 QFDITNVINVETARIDVLVEKIE 397 Q +I + I E +E Sbjct: 387 QDEIASYIKQSMEYSKKAKELLE 409 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 15/148 (10%), Positives = 43/148 (29%), Gaps = 8/148 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + + V GE++ I + + + + +K Sbjct: 49 DNVKYVSKSTYEFLSKSKVFGGEVLINKIGSPGRTYIMPKLDMPISLGMNLFLLRLKGDV 108 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ID L + S + +G ++ + ++ L V V + +N Sbjct: 109 IDENTLYLFLNSTVGKNIIQRKVNGTVPLTIDKKAIRSLYVPVFS----HEFRKRLNYLM 164 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ++ ++ + I+ Sbjct: 165 SDLNNASKEANTKYTQAENL---LISEL 189 >gi|270292638|ref|ZP_06198849.1| type I restriction-modification enzyme, S subunit, EcoA family [Streptococcus sp. M143] gi|270278617|gb|EFA24463.1| type I restriction-modification enzyme, S subunit, EcoA family [Streptococcus sp. M143] Length = 228 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 54/177 (30%), Gaps = 13/177 (7%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNM----GLKPESYETYQIVDPGEIVFRFIDLQ 299 + I + N+ IV+ +++ Sbjct: 56 PRGGRESYVNEGIALIRSMNVYDGKFIFKDLAYLTNVQAEKLNNVIVESDDVLLNITGAS 115 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMG---SGLRQ 355 + + ++ + + + S L+ + + + +G RQ Sbjct: 116 VSRCCIVPQNILPARVNQHVSIIRCKKHLLSPIFLNQLLITSEFKSLLLKIGESSGATRQ 175 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE-RRSSFI 411 ++ ++ L + +PP+ Q + + + A++D E I L + +SS I Sbjct: 176 AITKNQIEELYIPLPPLSLQNEFADFV----AQVDKSQFACEIVIKLWRNSLKSSII 228 >gi|238810192|dbj|BAH69982.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 225 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 66/192 (34%), Gaps = 9/192 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP++W+V I K+ G T + +I ++ +V + K N + Sbjct: 34 KIPENWEVKKIAEICKIFLGGTPSTKNREYWNGEINWLNSGEVANFPIIDSEKTINEKGL 93 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + KG ++ G + D C Q +V ++ L ++ + + Sbjct: 94 KNSNTKLLKKGTVVISITGNIRVSYLAIDS---CINQSIVGIEENELLKIGYLYPFLKNK 150 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G H + I N+ + +PP + +I + + I+ Sbjct: 151 IEFLIRSSTGNCQKHINKNFIENLKIVLPPKNVLDIFNNLTQNIYAKISQISLMTKKLIK 210 Query: 194 LLKEKKQALVSY 205 + L++ Sbjct: 211 FKNKLLPLLINQ 222 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 82/234 (35%), Gaps = 19/234 (8%) Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTELNR-----KN 248 ++ QA+ + + + K E + +P++WEVK + KN Sbjct: 1 MQVMGQAIFNRWFLQFEHFKKDNKFKYNEDLNLKIPENWEVKKIAEICKIFLGGTPSTKN 60 Query: 249 TKLIESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + I L+ G N + + K +++ G +V ++ Sbjct: 61 REYWNGEINWLNSGEVANFPIIDSEKTINEKGLKNSNTKLLKKGTVVISITG------NI 114 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 R + + I + + ++ + + + + + + ++ + ++ L Sbjct: 115 RVSYLAIDSCINQSIVGIEENELLKIGYLYPFLKNKIEFLIRSSTGNCQKHINKNFIENL 174 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +++PP ++ ++ N T I + +I L + ++ + + QI Sbjct: 175 KIVLPP----KNVLDIFNNLTQNIYAKISQISLMTKKLIKFKNKLLPLLINQQI 224 >gi|225854058|ref|YP_002735570.1| type I restriction enzyme [Streptococcus pneumoniae JJA] gi|225722826|gb|ACO18679.1| type I restriction enzyme [Streptococcus pneumoniae JJA] Length = 326 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 35/91 (38%), Gaps = 5/91 (5%) Query: 333 LAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + + S + +G ++ + L + +PP+ EQ I I ++D Sbjct: 1 MKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDE 60 Query: 392 LVEKIEQSIVLLKE----RRSSFIAAAVTGQ 418 E + L KE + S + A+ G+ Sbjct: 61 YAESYNRLEQLDKEFPDKLKKSILQYAMQGK 91 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 100/325 (30%), Gaps = 57/325 (17%) Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ +LLS + R+ G + + + + +PPL+EQ I E I + ++D Sbjct: 1 MKYYLLSDNFINRVNNKSTGTSYPAINDYNFNLLLIALPPLSEQQRIVEAIESALEKVDE 60 Query: 184 LITERIRFIELLKEKK----QALVSYIVTKGLNPDVKMKDS------------------- 220 R +L KE ++++ Y + L +S Sbjct: 61 YAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEEG 120 Query: 221 ----------------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + G +P +W V + + + K + +I + I Sbjct: 121 KIKKKDLDISIVSQGDDNSYYGNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINN-KGVRI 179 Query: 265 IQKLETRNMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I+ + + + Y + +++ G Sbjct: 180 IRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDG 239 Query: 315 IITSAYMA----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPV 367 ++ ++ + I S +L + + S K + ++ + L + Sbjct: 240 VVAGGFIFQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLI 299 Query: 368 LVPPIKEQFDITNVINVETARIDVL 392 + P +EQ IT + +++ L Sbjct: 300 PLAPFEEQELITQKVEKLFEKVNQL 324 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 142 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 201 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 202 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 261 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 262 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 321 Query: 182 DTLI 185 + L Sbjct: 322 NQLW 325 >gi|303262775|ref|ZP_07348713.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS292] gi|302636097|gb|EFL66594.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS292] Length = 191 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 50/178 (28%), Gaps = 2/178 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSGT-LGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M H K NI +P L EQ I ++ + I + L+K + + Sbjct: 120 MKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLLVKSQFACEI 177 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 42/142 (29%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V + EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 >gi|281421787|ref|ZP_06252786.1| type I restriction-modification system, S subunit [Prevotella copri DSM 18205] gi|281404164|gb|EFB34844.1| type I restriction-modification system, S subunit [Prevotella copri DSM 18205] Length = 296 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 56/153 (36%), Gaps = 4/153 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K+ + + T G+I++ + +K + + T Sbjct: 40 KIIQHLNKNERKINGTRHKFQKGQILYSKLRTYLNKVLVAPN---DGFCTTEIMAFGSYG 96 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + Y+ +++RS G G+ L D + +PP+ EQ I N I Sbjct: 97 ILSNNYICYVLRSLYFLDYTLQCGYGVKMPRLSTTDACNGLIPLPPLAEQERIVNEIQRL 156 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + ID++ + +++ ++ + A+ G+ Sbjct: 157 FSIIDIVENGKDGLQTAIQQAKNKILDHAIHGK 189 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 45/203 (22%), Positives = 76/203 (37%), Gaps = 4/203 (1%) Query: 27 VVPIKRFTKL---NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + T + + + LED+E T K + + + T F K Sbjct: 2 WTTVGEITNYGDSVNVQVEDIDNSDWVLELEDIEKDTAKIIQHLNKNERKINGTRHKFQK 61 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICE 142 GQILY KL YL K ++A DG C+T+ + +L ++ S+ Sbjct: 62 GQILYSKLRTYLNKVLVAPNDGFCTTEIMAFGSYGILSNNYICYVLRSLYFLDYTLQCGY 121 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G M N +P+PPLAEQ I +I ID + + +++ K + Sbjct: 122 GVKMPRLSTTDACNGLIPLPPLAEQERIVNEIQRLFSIIDIVENGKDGLQTAIQQAKNKI 181 Query: 203 VSYIVTKGLNPDVKMKDSGIEWV 225 + + + L P + E + Sbjct: 182 LDHAIHGKLVPQDPNDEPASELL 204 >gi|308190351|ref|YP_003923282.1| hypothetical protein MFE_08370 [Mycoplasma fermentans JER] gi|307625093|gb|ADN69398.1| hypothetical protein MFE_08370 [Mycoplasma fermentans JER] Length = 222 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 66/192 (34%), Gaps = 9/192 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP++W+V I K+ G T + +I ++ +V + K N + Sbjct: 31 KIPENWEVKKIAEICKIFLGGTPSTKNREYWNGEINWLNSGEVANFPIIDSEKTINEKGL 90 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + KG ++ G + D C Q +V ++ L ++ + + Sbjct: 91 KNSNTKLLKKGTVVISITGNIRVSYLAIDS---CINQSIVGIEENELLKIGYLYPFLKNK 147 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + G H + I N+ + +PP + +I + + I+ Sbjct: 148 IEFLIRSSTGNCQKHINKNFIENLKIVLPPKNVLDIFNNLTQNIYAKISQISLMTKKLIK 207 Query: 194 LLKEKKQALVSY 205 + L++ Sbjct: 208 FKNKLLPLLINQ 219 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 80/231 (34%), Gaps = 19/231 (8%) Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVG-LVPDHWEVKPFFALVTELNR-----KNTKL 251 QA+ + + + K E + +P++WEVK + KN + Sbjct: 1 MGQAIFNRWFLQFEHFKKDNKFKYNEDLNLKIPENWEVKKIAEICKIFLGGTPSTKNREY 60 Query: 252 IESNILSLSYG---NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 I L+ G N + + K +++ G +V ++R + Sbjct: 61 WNGEINWLNSGEVANFPIIDSEKTINEKGLKNSNTKLLKKGTVVISITG------NIRVS 114 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + I + + ++ + + + + + + ++ + ++ L ++ Sbjct: 115 YLAIDSCINQSIVGIEENELLKIGYLYPFLKNKIEFLIRSSTGNCQKHINKNFIENLKIV 174 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 +PP ++ ++ N T I + +I L + ++ + + QI Sbjct: 175 LPP----KNVLDIFNNLTQNIYAKISQISLMTKKLIKFKNKLLPLLINQQI 221 >gi|32263453|gb|AAP78481.1| S.AhdI [Aeromonas hydrophila] Length = 227 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 53/131 (40%), Gaps = 8/131 (6%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLM 337 +I G+I+F + + +K L + E GI + ++ + P +++ Y+ ++ Sbjct: 86 KSRSKIFGLGDILFGRLRPELNKVYLVDGEPSE-GICSGEFIVLAPITSRVNARYVRHII 144 Query: 338 RSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + K + +D+ + V VPP++ Q I + + L ++ Sbjct: 145 ASPFVTKFIEKFRVGASLPRIAADDLLGIKVPVPPLEVQEQIARRLAEMDQELRGLRLRV 204 Query: 397 E----QSIVLL 403 E Q + L Sbjct: 205 EELPSQQLEAL 215 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 46/161 (28%), Positives = 74/161 (45%), Gaps = 8/161 (4%) Query: 30 IKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +++ G + I Y+GLE+V S TG+ + + + S S IF G I Sbjct: 38 LRQLVSEKKGAIDPQKQGERQISYLGLENVRSQTGELVGFEPRAASSIKSRSKIFGLGDI 97 Query: 87 LYGKLGPYLRKAIIADF---DGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAIC 141 L+G+L P L K + D +GICS +F+VL P V ++ + S VT+ IE Sbjct: 98 LFGRLRPELNKVYLVDGEPSEGICSGEFIVLAPITSRVNARYVRHIIASPFVTKFIEKFR 157 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 GA++ + I +P+PPL Q I ++ + Sbjct: 158 VGASLPRIAADDLLGIKVPVPPLEVQEQIARRLAEMDQELR 198 >gi|148927588|ref|ZP_01811059.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] gi|147887064|gb|EDK72561.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] Length = 200 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 54/160 (33%), Gaps = 4/160 (2%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + E + + + E Y +++ + D R +E I + Sbjct: 39 GYLYLDEIKTINVTAEELRKYSLMNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFR 98 Query: 323 VKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDI 378 + Y+++ ++ F + SL +K L + P+ +Q +I Sbjct: 99 ARVDSGQFVPEYISYATKTTRARDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEI 158 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I + + I +++ + K R S +A A G+ Sbjct: 159 VESIVTKLSEIKSARKELIVAHHRSKALRQSILAKAFKGE 198 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 67/198 (33%), Gaps = 14/198 (7%) Query: 28 VPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 V ++ G T + Y+ + +V+ G + ++ Sbjct: 2 VEFGDIAEIKGGITKGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKYSL 61 Query: 82 AKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G IL+ + G R I +C Q + + + + + ++ T R Sbjct: 62 MNGDILFTEGGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYATKTTRAR 121 Query: 139 AIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + ++ + + N+ +P PLA+Q I E I+ + I + E I Sbjct: 122 DYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVAHH 181 Query: 194 LLKEKKQALVSYIVTKGL 211 K +Q++++ L Sbjct: 182 RSKALRQSILAKAFKGEL 199 >gi|307262547|ref|ZP_07544187.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306867759|gb|EFM99595.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 74 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 23/63 (36%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 Y+ + + S F + + + ++ + +PP+ EQ I I + + Sbjct: 10 QYIYYYLSSPLFRNDFDGINTTTINQITQNNLNNRLIPLPPLNEQKRIVEKIEKLFSTLQ 69 Query: 391 VLV 393 L Sbjct: 70 NLE 72 >gi|260440925|ref|ZP_05794741.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae DGI2] Length = 253 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 28/251 (11%), Positives = 68/251 (27%), Gaps = 14/251 (5%) Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 + Q I + + T TL + L K + + + L+ D ++ + Sbjct: 1 METQQKIVKILDKFTELEATLEATLEAELALRKRQYRYYRDLL----LDFDNQIGGGIAD 56 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 + K + + + + N++Q E + + S Sbjct: 57 GYQCRLKNVVWKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLSGYVPSEGKM 116 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 +I+ I K G + + V ++ YL ++ Sbjct: 117 TEYIVNDILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFF 174 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEK 395 G + + + +PP+ EQ I ++ + + Sbjct: 175 AFNMKHAKGAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIAL 234 Query: 396 IEQSIVLLKER 406 + +E+ Sbjct: 235 RRKQYEYYREQ 245 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 70/187 (37%), Gaps = 8/187 (4%) Query: 27 VVPIKRFTKLNTGRT-SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + R S+ + Y+G++++ ++ GK L G T I Sbjct: 67 WKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLS--GYVPSEGKMTEYIV--N 122 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE 142 IL G + PYL+K AD G + LV++ + V P+ L L + Sbjct: 123 DILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAK 182 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M I +PIPPL EQ I + ++ I L +++ + Sbjct: 183 GAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYEYY 242 Query: 203 VSYIVTK 209 ++ Sbjct: 243 REQLLAF 249 >gi|329955586|ref|ZP_08296494.1| type I restriction modification DNA specificity domain protein [Bacteroides clarus YIT 12056] gi|328525989|gb|EGF53013.1| type I restriction modification DNA specificity domain protein [Bacteroides clarus YIT 12056] Length = 333 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 55/148 (37%), Gaps = 11/148 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYL---PKDGNS 70 IP W+V + FT++ G T + G DI++I +D+ K++ ++ Sbjct: 49 EIPIDWQVKNLIDFTEIKNGATPSTADEANYGGDIVWITPKDLSDQQSKFVYQGERNITK 108 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + D+ + S+ +L P IA D + F PK + + + Sbjct: 109 QGFDSCSTSMLPINSVLMSSRAPI-GLVSIAKNDVCTNQGFKSFIPKKMEDS-IYLYYYI 166 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIP 158 ++IE + G T + P Sbjct: 167 KHHIKQIEQLGSGTTFKEVSRDDLCKFP 194 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 22/206 (10%), Positives = 61/206 (29%), Gaps = 25/206 (12%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL------ETRNMGLKPESY 280 +P W+VK N + I K + G + + Sbjct: 49 EIPIDWQVKNLIDFTEIKNGATPSTADEANYGGDIVWITPKDLSDQQSKFVYQGERNITK 108 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + + + + + + + + + P ++ + + + Sbjct: 109 QGFDSCSTSMLPINSVLMSSRAPIGLVSIAKNDVCTNQGFKSFIPKKMEDSIYLYYYIKH 168 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE--------QFDITNVINVETARIDVL 392 + ++ + + +D+ + P+LV KE Q I + + Sbjct: 169 HIKQIEQLGSGTTFKEVSRDDLCKFPILVVGAKESYRQWAELQNGIA---DKQF------ 219 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQ 418 +++ I +L ++R + + GQ Sbjct: 220 --VLQKEIAILTKQRDELLPLLMNGQ 243 >gi|218263899|ref|ZP_03477847.1| hypothetical protein PRABACTJOHN_03537 [Parabacteroides johnsonii DSM 18315] gi|218222410|gb|EEC95060.1| hypothetical protein PRABACTJOHN_03537 [Parabacteroides johnsonii DSM 18315] Length = 234 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 54/247 (21%), Positives = 99/247 (40%), Gaps = 21/247 (8%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 VV + + + + ++S +D+ +GLE + ++ D N+ D + F KGQ+ Sbjct: 3 VVKLGDVARESRLKWTKSKQDVPIVGLEHLIPDEIRFDAYDINT---DNTFSKRFVKGQV 59 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLSIDVTQRIEAICEGA 144 L+G+ Y RKA IA+FDGICS V+Q +LPELL + + G+ Sbjct: 60 LFGRRRAYQRKAAIAEFDGICSGDITVIQAIEGKMLPELLPFIIQTPVFFDYANRGSAGS 119 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 W+ + + +PPL EQ ++ +K+ + + +LL + + S Sbjct: 120 LSPRVKWEHLADYEFELPPLEEQKILADKL-------WAAYRLKEAYKKLLVATDEMVKS 172 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + +P K + + L K ++ + + L GNI Sbjct: 173 QFIEMVGDPRNNPKGWPTKRLSE---------LAEYSIGLTYKPEQICDDGTIVLRSGNI 223 Query: 265 IQKLETR 271 + Sbjct: 224 QDGKISF 230 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 15/136 (11%), Positives = 45/136 (33%), Gaps = 3/136 (2%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + ++ + ++I + + G+++F K ++ Sbjct: 16 KWTKSKQDVPIVGLEHLIPDEIRFDAYDINTDNTFSKRFVKGQVLFGRRRAYQRKAAIAE 75 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLP 366 + G IT + + L +++++ L +K+E + Sbjct: 76 FDGICSGDIT--VIQAIEGKMLPELLPFIIQTPVFFDYANRGSAGSLSPRVKWEHLADYE 133 Query: 367 VLVPPIKEQFDITNVI 382 +PP++EQ + + + Sbjct: 134 FELPPLEEQKILADKL 149 Score = 36.3 bits (82), Expect = 9.9, Method: Composition-based stats. Identities = 7/46 (15%), Positives = 16/46 (34%), Gaps = 4/46 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKY 63 PK W + + + G T + I + +++ G + Sbjct: 185 PKGWPTKRLSELAEYSIGLTYKPEQICDDGTIVLRSGNIQDGKISF 230 >gi|886052|gb|AAC44216.1| restriction modification system S subunit [Spiroplasma citri] Length = 294 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 49/300 (16%), Positives = 92/300 (30%), Gaps = 18/300 (6%) Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 F + +LL + + T + K I IP L EQ I Sbjct: 3 FSMEINNLYFSTEYLYYLLLKFKKKELNKFIIKQTQPNLSKKIINQFIFKIPSLQEQTKI 62 Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV-TKGLNPDVKMKDSGIEWVGLVP 229 ID I + LL+++KQ ++ + + P ++ K EW Sbjct: 63 VNF----FSIIDRKIELIKEQLSLLEKQKQYYLNNMFANEKSYPKIRFKGFNDEWKSKKI 118 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + N KN I + L + + + IV Sbjct: 119 KELGNIKTGKTPSTKNEKNWLNDVLWITIPDM--TKKYLTNSKKKISLMASKKNPIVKEK 176 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 I+F I + + + I + D + + Y+ K+ Sbjct: 177 SILFSCIGTIGNIGITTTITSFNQQINS------ISSIKDGVEYVYYLFQYNTEKIKSYS 230 Query: 350 GSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + + + + V KEQ I N + ID +E I++ + LL++++ Sbjct: 231 SAQTLPMINKNYFENIEIFVSLNYKEQTKIANF----FSIIDRKIELIKEQLSLLEKQKQ 286 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 68/190 (35%), Gaps = 14/190 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK IK + TG+T + D+++I + D+ K + S + Sbjct: 112 EWKSKKIKELGNIKTGKTPSTKNEKNWLNDVLWITIPDMTKKYLTNSKKKISLMASKKNP 171 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 I + IL+ +G I I S + + + + L T++I Sbjct: 172 --IVKEKSILFSCIGTIGNIGITTT---ITSFNQQINSISSIKDGVEYVYYLFQYNTEKI 226 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ T+ + NI + + ++ KI ID I + LL++ Sbjct: 227 KSYSSAQTLPMINKNYFENIEIFVSLNYKEQ---TKIANFFSIIDRKIELIKEQLSLLEK 283 Query: 198 KKQALVSYIV 207 +KQ ++ + Sbjct: 284 QKQYYLNNMF 293 >gi|167972319|ref|ZP_02554596.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 5 str. ATCC 27817] gi|184209400|gb|EDU06443.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 5 str. ATCC 27817] Length = 393 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 43/392 (10%), Positives = 122/392 (31%), Gaps = 26/392 (6%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ T ++ +I GL + N+ ++ I Sbjct: 6 KLSSVFEIITTGKQKNTFNINLEGLYPL------ISASTANNGIMGYVDNYLYDGQNITI 59 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMS 147 ++G + F++ + + ++ +LL ++ ++I +I G T Sbjct: 60 SRVGNAGTTFYHEGKISLTDNCFILSKINKKIAKVKYVFYLLKLNEDKKIRSISHGTTRK 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + N+ + +P + Q I I +K + I+ Sbjct: 120 IINKTDLDNLIIYLPSIEIQNAIISIIEPHEKLFVKYSNLVDISSVENAKKDVDNLISII 179 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +K+ I++ + ++ + + N K L + ++ + Sbjct: 180 EPIEKVINNIKN--IKFKIESLVNKYFDFLYSNLEDSNFKKYILGDLFTINRGQIINSKY 237 Query: 268 LETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 +E+ K Y + F I Q I + Sbjct: 238 IESNIGSYPVISSNTKNNGVFGYINSYMYDGEFITISADGAYAGTVFLQNGRFSITNVCF 297 Query: 321 MAVKPHGID----STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +K + ID + ++ ++++ + R +++ +K + + +P I+ Q Sbjct: 298 ILIKNNDIDFKFSNKFVYYILKKEQEVNKLKSQVGSSRPAVREYSLKEIKINLPNIEIQE 357 Query: 377 DITNV------INVETARIDVLVEKIEQSIVL 402 + + ++ + +I+ ++ I Sbjct: 358 KFSKIVEPLLNLSTKANKIEKILNDSLLKITK 389 >gi|219870606|ref|YP_002474981.1| Type I restriction-modification system, S subunit/Type I restriction modification DNA specificity domain-containing protein [Haemophilus parasuis SH0165] gi|219690810|gb|ACL32033.1| Type I restriction-modification system, S subunit/Type I restriction modification DNA specificity domain-containing protein [Haemophilus parasuis SH0165] Length = 454 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 80/233 (34%), Gaps = 21/233 (9%) Query: 189 IRFIELLKEKKQAL--VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 R E+ Q L ++ +G + + + I + + +V + Sbjct: 241 HRLQTANPEQYQQLWEIAEAFPRGFDEEGVPRGWEITTIDEN---------YNVVMGQSP 291 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K E + +L Y + R + + + +I I+ D Sbjct: 292 KGETYNEESNGTLFYQGRAEFG-WRYPEPRLYTTDPKRIAKKSNILMSVRAPVGDL---- 346 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 +E I A+ ++ + +++ + + S+ +D+K + Sbjct: 347 -NVALEDCCIGRGLAALSHKSNSLSFGLYQIKNLQNEFDVFNGEGTVFGSINQKDLKSIR 405 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 V+ P I + + + D+L+E + + I+ L++ R + ++G++ Sbjct: 406 VINPS----SKIIKLFDDVCSTNDLLIENLSREILSLRKIRDELLPMLLSGEV 454 >gi|238910688|ref|ZP_04654525.1| putative type I restriction-modification system, S subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 404 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 61/199 (30%), Gaps = 16/199 (8%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTE---------LNRKNTKLIESNILSLSYGNIIQKLET 270 S E +P WE L T + + K I +S +K Sbjct: 93 SEEEKPFELPVGWEWTRLINLGTWALGSGFPNVVQGNSDKEILMCKVSDMNLEGNEKFIV 152 Query: 271 RNMGLKPESYETYQIVD---PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + PG I+F I + R V E I + + Sbjct: 153 STINTISKDLADEYKIKTSEPGTIIFPKIGGAI-ATNKRRILVQETAIDNNCLGIKPCNA 211 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I + ++ + D+ K ++ + +P+ +P +K Q I + + + Sbjct: 212 ISGEWFYLILSALDMSKY---QSGTSIPAINQSVIGSIPIALPSLKMQEKILSYVITLMS 268 Query: 388 RIDVLVEKIEQSIVLLKER 406 D L S+ ++ Sbjct: 269 LCDQLELHSLTSLDAHQQL 287 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 74/210 (35%), Gaps = 19/210 (9%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLE 54 +K K P+ S + +P W+ + G S K+I+ + Sbjct: 83 IKKQKPLPEI--SEEEKPFELPVGWEWTRLINLGTWALGSGFPNVVQGNSDKEILMCKVS 140 Query: 55 DVE-SGTGKYLPKDGNSRQSDTSTVSIFA---KGQILYGKLG---PYLRKAIIADFDGIC 107 D+ G K++ N+ D + G I++ K+G ++ I+ I Sbjct: 141 DMNLEGNEKFIVSTINTISKDLADEYKIKTSEPGTIIFPKIGGAIATNKRRILVQETAID 200 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + + E L ++D + G ++ + IG+IP+ +P L Q Sbjct: 201 NNCLGIKPCNAISGEWFYLILSALD----MSKYQSGTSIPAINQSVIGSIPIALPSLKMQ 256 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKE 197 I +I D L + ++ ++ Sbjct: 257 EKILSYVITLMSLCDQLELHSLTSLDAHQQ 286 >gi|237738544|ref|ZP_04569025.1| restriction modification system DNA specificity subunit [Fusobacterium sp. 2_1_31] gi|229424211|gb|EEO39258.1| restriction modification system DNA specificity subunit [Fusobacterium sp. 2_1_31] Length = 203 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 72/202 (35%), Gaps = 11/202 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK--LETRNMGLKPE 278 I+ + +++ + ++ KN + I + GNI T + +K Sbjct: 4 DIKTNNKNWEIVKLEKYINIIGGYAFKNIDFKSTGIPLIRIGNINSGQFKSTNLVFIKEN 63 Query: 279 SYETYQIVDPGEIVFRFIDLQND----KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 V P +I+ + E + I+ + Sbjct: 64 KKFEKFKVFPNDILISLTGTVGKDDYGNACILGNSYSEYYLNQRNAKIEIIDKINKNFFL 123 Query: 335 WLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +++ ++ K + G+RQ ++ +D+ L + +PPI+ Q + +I+ L Sbjct: 124 EIIKIKEVKKKLTGISRGIRQANISNKDIYNLSIPLPPIELQNKFAERVE----KIEKLK 179 Query: 394 EKIEQSIVLLKERRSSFIAAAV 415 +IE+SI + + S I+ Sbjct: 180 FEIEKSIEIAQNLYDSLISKYF 201 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 61/195 (31%), Gaps = 13/195 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 K+W++V ++++ + G + I I + ++ SG K Sbjct: 10 KNWEIVKLEKYINIIGGYAFKNIDFKSTGIPLIRIGNINSGQFKSTNLVFIKENKKFEKF 69 Query: 79 SIFAKGQILYGKLGPYLR-------KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +F IL G + + + + + ++ D + + ++ I Sbjct: 70 KVF-PNDILISLTGTVGKDDYGNACILGNSYSEYYLNQRNAKIEIIDKINKNFFLEIIKI 128 Query: 132 DVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ I G ++ K I N+ +P+PP+ Q E++ + Sbjct: 129 KEVKKKLTGISRGIRQANISNKDIYNLSIPLPPIELQNKFAERVEKIEKLKFEIEKSIEI 188 Query: 191 FIELLKEKKQALVSY 205 L Sbjct: 189 AQNLYDSLISKYFDN 203 >gi|304373000|ref|YP_003856209.1| Restriction endonuclease S subunits [Mycoplasma hyorhinis HUB-1] gi|304309191|gb|ADM21671.1| Restriction endonuclease S subunits [Mycoplasma hyorhinis HUB-1] Length = 381 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 41/392 (10%), Positives = 112/392 (28%), Gaps = 34/392 (8%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFA 82 W ++ ++TG + + + ++ GKY + + + Sbjct: 18 WIQGKVEELFFIDTGNSK--------LTKQYIKQNLGKYPVYSSQTENNGIIGYINTYDF 69 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G+ + K + S + L + L + + + + Sbjct: 70 DGEFITWTQDGNAGKVFYRNGRFNASNSGI-----LTLNFPSKYNLKFLFLALIFLNLTK 124 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + I + + + +EKI + +D +I+ R + LL++ ++AL Sbjct: 125 LQIGGTVPHFTASMMRKVIFLIPKNKVEQEKISSIFFTLDKIISLYERKMSLLEKLQKAL 184 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 S I N ++ + ++ ++ + S Sbjct: 185 FSNIFVLNANNKPLIRFKSFFEFWEKNNISDLCKINRGNSKYTINYIQQNVGKFPVYSSQ 244 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + + + + S + + + +S +A Sbjct: 245 TQNEGISGNISTYDYDGE------------YITWTMDGVNAGTVSYRNGKFNVSSSGVLA 292 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNV 381 + +T +L L + + +L + +EQ I + Sbjct: 293 PNSNKNINT--KFLFYVLKLMNLNQENIGETIPHFTGSMMNKLEITFVKNRQEQNKIAD- 349 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + ID ++++ + L+K + S + Sbjct: 350 ---LFSNIDSTHAQLKRKLNLIKNIQKSVLNK 378 >gi|254779130|ref|YP_003057235.1| Type I restriction/modification specificity protein [Helicobacter pylori B38] gi|254001041|emb|CAX28985.1| Type I restriction/modification specificity protein [Helicobacter pylori B38] Length = 419 Score = 63.3 bits (152), Expect = 8e-08, Method: Composition-based stats. Identities = 60/391 (15%), Positives = 122/391 (31%), Gaps = 34/391 (8%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ K + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDIIDLKKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ ++L+ + N Sbjct: 144 FLNIKIKLYPLETQQKIARTLSVLDQKIENNHKINELLHKILELLYEQYFVRFDFLDENN 203 Query: 214 DVKMKDSGI-----EWVGLVPDHWEVKPFFALVTEL---------NRKNTKLIESNILSL 259 G E L+P+ +EVK LV N Sbjct: 204 KPYQTSGGKMKFSKELNRLIPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYMLITNKN 263 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++I T N+ P+ Y +++P I+ + S + I+ Sbjct: 264 VQHSLIDLSITTNLLFLPKKLPKYCLLEPTNILITLTGHIGRCALVFS----KNCILNQR 319 Query: 320 YMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFD 377 V P + + L+R+ + +Q+L D ++ + Sbjct: 320 VGVVLPKEKELNPFYYSLIRNPLFSAILQRNAIGSSQQNLSPIDTLKIQIPF-----NHK 374 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRS 408 I + I L+ Q+ L R Sbjct: 375 IIKQYSKTCENIIKLLVSNMQATQTLTTLRD 405 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 69/178 (38%), Gaps = 13/178 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I I++ + Sbjct: 18 NNYTKEDNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSI---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDIIDLKKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 S+ D + + + P++ Q I ++V +I+ + E + + + LL E+ Sbjct: 134 SSYPSITPLDFLNIKIKLYPLETQQKIARTLSVLDQKIENNHKINELLHKILELLYEQ 191 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 51/158 (32%), Gaps = 8/158 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKY-LPKDGNSRQS 73 IP ++V + + +G + + D + I ++V+ + + Sbjct: 223 IPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYMLITNKNVQHSLIDLSITTNLLFLPK 282 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSID 132 + IL G R A++ + I + + V+ PK+ L + + Sbjct: 283 KLPKYCLLEPTNILITLTGHIGRCALVFSKNCILNQRVGVVLPKEKELNPFYYSLIRNPL 342 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + ++ G++ + I +P + Sbjct: 343 FSAILQRNAIGSSQQNLSPIDTLKIQIPFNHKIIKQYS 380 >gi|322689711|ref|YP_004209445.1| hypothetical protein BLIF_1529 [Bifidobacterium longum subsp. infantis 157F] gi|320461047|dbj|BAJ71667.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 147 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 51/145 (35%), Gaps = 6/145 (4%) Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + ++ ++ Q + E + P + + + ++ A++ + + Sbjct: 1 MWVTSQDVKQHYIENTTTMISEKGAATLTLYPSDSIVIVARSGILRHTIPVAKLRKPATV 60 Query: 317 TSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + S L + + S Y +S+ F +K ++VP I+E Sbjct: 61 NQDIKVIQTVDSCDSSWLLQYFIASNKTLLREYGKTGTTVESIDFAKMKSTALMVPYIEE 120 Query: 375 QFDITNVINVETARIDVLVEKIEQS 399 Q I + +R+D L+ ++ Sbjct: 121 QQAIGSF----FSRLDNLITLHQRK 141 >gi|3335664|gb|AAC78317.1| restriction-modification enzyme MpuUIV S subunit [Mycoplasma pulmonis] Length = 398 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 42/368 (11%), Positives = 103/368 (27%), Gaps = 16/368 (4%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVDENIAKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL-- 202 + I + + +P L Q I + I + E +K ++ Sbjct: 116 AYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPKEDLFFRHKNLVRIDSEENTKKDLSILI 175 Query: 203 -VSYIVTKGLNPDVKMKDSGIEWVGLVPDHW------EVKPFFALVTELNRKNTKLIESN 255 + + K +N ++ S + + +++ F N + +S Sbjct: 176 KIIEPLEKQINAFDELILSEQKSLQHYLNYFLNKLASINPSIFKNYKLGQILNLEKGKSK 235 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + I + + + + I + Sbjct: 236 YNAKYVSQNIGIYNLYSSKTRDQGIFGKINSYDFNGEYILITTHGAYAGTVKYVNEKFST 295 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 ++ ++ I T + LK ++ V +P +K Q Sbjct: 296 TSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKIQ 355 Query: 376 FDITNVIN 383 I +I Sbjct: 356 SAILGIIE 363 >gi|268611922|ref|ZP_06145649.1| hypothetical protein RflaF_20746 [Ruminococcus flavefaciens FD-1] Length = 177 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 60/143 (41%), Gaps = 10/143 (6%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYM 321 + + + + + + E Y ++ GE+ + + + K + S + E ++ Y Sbjct: 13 GWLDQKDRFSANIAGKEQENYTLLHKGELSYNHGNSKLAKYGAVFSLRTYEEALVPRVYH 72 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQ----SLKFEDVKRLPVLVPPIKEQF 376 + K D+ Y+ +L + K + SG R ++ +++ + + +P I+EQ Sbjct: 73 SFKVIEADADYIEYLFATKLPDKELGKLISSGARMDGLLNINYDEFMGISISMPSIEEQK 132 Query: 377 DITNVINVETARIDVLVEKIEQS 399 I++ + +D ++ + Sbjct: 133 KISSYL----RSLDSIITLHQHK 151 >gi|228475536|ref|ZP_04060254.1| restriction modification system DNA specificity domain protein [Staphylococcus hominis SK119] gi|228270318|gb|EEK11753.1| restriction modification system DNA specificity domain protein [Staphylococcus hominis SK119] Length = 171 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 43/172 (25%), Positives = 86/172 (50%), Gaps = 6/172 (3%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRK-NTKLIESNILSLSYGNIIQKLETRNMGL 275 MK+SGI+W+G +P +W+V + + LSL+ +I++ + + GL Sbjct: 5 MKNSGIDWIGEIPKNWKVIKTKHAFKSKKNIVKENAKKYDRLSLTMNGVIKRDKEDSHGL 64 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 +PE +ETYQI+ E++F+ IDL+N + +++ GI++ Y+ + + ++ Y + Sbjct: 65 QPEHFETYQIIYKDELIFKLIDLEN----ISTSRGNYTGIVSPVYIRLI-NPDETKYGYY 119 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + +F + SG+R SL ++ + L P E+ I ++ Sbjct: 120 YFYNMWCQHIFNFLSSGVRSSLTANNLLNVSYLKIPFDEKEKIIKILEKRFK 171 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 57/171 (33%), Gaps = 3/171 (1%) Query: 9 QYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDG 68 + K+SG+ WIG IPK+WKV+ K K E+ K + L + + Sbjct: 4 EMKNSGIDWIGEIPKNWKVIKTKHAFKSKKNIVKENAKKYDRLSLT-MNGVIKRDKEDSH 62 Query: 69 NSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + T I K ++++ + ++ GI S ++ L + + Sbjct: 63 GLQPEHFETYQIIYKDELIFKLIDLENISTSRGNYTGIVSPVYIRLI--NPDETKYGYYY 120 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 Q I S + N+ P E+ I + + Sbjct: 121 FYNMWCQHIFNFLSSGVRSSLTANNLLNVSYLKIPFDEKEKIIKILEKRFK 171 >gi|324990376|gb|EGC22314.1| EcoA family type I restriction-modification system [Streptococcus sanguinis SK353] Length = 175 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 67/175 (38%), Gaps = 12/175 (6%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGE 290 WE + + + RKN L L++S +I + N + + Y ++ GE Sbjct: 4 WEQRKLGEVAERVTRKNKNLESELPLTISAQHGLINQETFFNKKVASKDVSGYYLLKKGE 63 Query: 291 IVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + +++ E G++++ Y+ +P+ IDS +LA S K Sbjct: 64 FAYNKSYSSDYPWGAVKRLNNYEMGVLSTLYIVFRPNSIDSDFLAVYYDSPKWHKEVSMR 123 Query: 350 GS-GLRQ----SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 + G R ++ +D ++ P EQ I + +D L+ ++ Sbjct: 124 AAEGARNHGLLNISPQDFFDTELIFPVNHPEQAAIGSF----FQELDHLITLQQR 174 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 45/168 (26%), Gaps = 13/168 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T + ++ I + + K D S + K Sbjct: 4 WEQRKLGEVAERVTRKNKNLESELPLTISAQHGLINQETFFNKK--VASKDVSGYYLLKK 61 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K G+ ST ++V +P + + L + S + + Sbjct: 62 GEFAYNKSYSSDYPWGAVKRLNNYEMGVLSTLYIVFRPNSIDSDFLAVYYDSPKWHKEVS 121 Query: 139 AICEGATMSH----ADWKGIGNIPMPIPPLAE-QVLIREKIIAETVRI 181 +H + + + P Q I I Sbjct: 122 MRAAEGARNHGLLNISPQDFFDTELIFPVNHPEQAAIGSFFQELDHLI 169 >gi|210610697|ref|ZP_03288578.1| hypothetical protein CLONEX_00768 [Clostridium nexile DSM 1787] gi|210152330|gb|EEA83336.1| hypothetical protein CLONEX_00768 [Clostridium nexile DSM 1787] Length = 189 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 55/191 (28%), Gaps = 7/191 (3%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 +P+ W + ++ N Y + + G + Y Sbjct: 1 MPESWTQGVLADIANITMGQSPSGESFNTQGNGYPFYQG---STDFGTIFPAKRMYTDKP 57 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 L E I ++ ++ ++ +L+++ Sbjct: 58 SRYAAVFDTLLSVRAPVGSLNIAYENCCIGRGLASIHGKYDNNIFVRYLLKNNKWYFDNI 117 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 S+ + + +PV++P I +++ I+ + + EQ L+ R Sbjct: 118 NNNGTTFGSITKDYLFEMPVVIPDG---KSIAMF-EQKSSLIERQIYENEQQTRKLQNLR 173 Query: 408 SSFIAAAVTGQ 418 + + GQ Sbjct: 174 DWLLPMLMNGQ 184 >gi|302336435|ref|YP_003801642.1| restriction modification system DNA specificity domain protein [Olsenella uli DSM 7084] gi|301320275|gb|ADK68762.1| restriction modification system DNA specificity domain protein [Olsenella uli DSM 7084] Length = 176 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 52/170 (30%), Gaps = 6/170 (3%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 IE VPD W ++ + + + ++ L Sbjct: 2 IEVPFDVPDSWAWVRLSSICQPQGSHRPTGKLFRYIDIDSIDNVRCKIIEPKLLSTADAP 61 Query: 282 TYQI--VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGIDSTYLAWLM 337 + V G ++F + +L + + ++ + IDS +L M Sbjct: 62 SRARRAVAKGSVLFSMVRPYLRNIALA-FDEHDGCVASTGFYVCTASSDSIDSEWLFLCM 120 Query: 338 RSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +S G S++ +D+ + V +PP EQ I + Sbjct: 121 KSDYFVNAINVHMRGDNSPSVRKDDMDEMLVPIPPQPEQNRIVREVARLL 170 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 63/169 (37%), Gaps = 7/169 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTV 78 +P W V + + G +GK YI ++ +++ K + PK ++ + + Sbjct: 7 DVPDSWAWVRLSSICQPQ-GSHRPTGKLFRYIDIDSIDNVRCKIIEPKLLSTADAPSRAR 65 Query: 79 SIFAKGQILYGKLGPYLRKAII---ADFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDV 133 AKG +L+ + PYLR + + ST F V + E L + S Sbjct: 66 RAVAKGSVLFSMVRPYLRNIALAFDEHDGCVASTGFYVCTASSDSIDSEWLFLCMKSDYF 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 I G + + +PIPP EQ I ++ + + Sbjct: 126 VNAINVHMRGDNSPSVRKDDMDEMLVPIPPQPEQNRIVREVARLLLLLQ 174 >gi|319777297|ref|YP_004136948.1| hypothetical protein MfeM64YM_0573 [Mycoplasma fermentans M64] gi|318038372|gb|ADV34571.1| Conserved Hypothetical Protein [Mycoplasma fermentans M64] Length = 332 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 39/311 (12%), Positives = 83/311 (26%), Gaps = 16/311 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGN-SRQSDTSTV 78 IP++W V ++ G K I + + + ++ N Sbjct: 4 EIPENWAWVRHNNIFEIIGGSQPPKSKFIEHEKQGYIRLYQIRDYGENPNPVYIPSKFAF 63 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQR 136 K IL + G + K A+ V + D + + + Q Sbjct: 64 KQSEKNDILLARYGASIGKVFFAENGAYNVALAKVKKMFINDWINKEFMFIFYKSSIYQT 123 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + + + N+ MPIP L E I K I+ + + +L Sbjct: 124 LVKNNSRSAQAGFNKDDLKNLFMPIPSLNESSRIVSKWNDLNKLINEYENKENQLFKLDS 183 Query: 197 EK----KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + +++++ Y + L P ++ EL ++ Sbjct: 184 KIKDKLQKSILQYAIQGKLVKQDP---------NDEPASKLLEAIQIEKNELIKEGKIKK 234 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + G E + + E + RF ++ N Sbjct: 235 DKQESFIFQGEDKNYYEKIGSKVINITNEIPFEIPINWAWTRFKNIANLVLGKSPETNNI 294 Query: 313 RGIITSAYMAV 323 Sbjct: 295 NYWKNGVINWF 305 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 58/199 (29%), Gaps = 8/199 (4%) Query: 227 LVPDHWEVKP---FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +P++W F ++ +K IE I+ + S + Sbjct: 4 EIPENWAWVRHNNIFEIIGGSQPPKSKFIEHEKQGYIRLYQIRDYGENPNPVYIPSKFAF 63 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + + +I+ K + I+ ++ +S Sbjct: 64 KQSEKNDILLARYGASIGKVFFAE-NGAYNVALAKVKKMFINDWINKEFMFIFYKSSIYQ 122 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + +D+K L + +P + E I + N I+ K Q L Sbjct: 123 TLVKNNSRSAQAGFNKDDLKNLFMPIPSLNESSRIVSKWNDLNKLINEYENKENQLFKLD 182 Query: 404 KERR----SSFIAAAVTGQ 418 + + S + A+ G+ Sbjct: 183 SKIKDKLQKSILQYAIQGK 201 >gi|223934050|ref|ZP_03626002.1| restriction modification system DNA specificity subunit [Streptococcus suis 89/1591] gi|302024400|ref|ZP_07249611.1| restriction modification system DNA specificity subunit [Streptococcus suis 05HAS68] gi|223897277|gb|EEF63686.1| restriction modification system DNA specificity subunit [Streptococcus suis 89/1591] Length = 156 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 55/141 (39%), Gaps = 12/141 (8%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G I + ++ E+ Y+ V PG+ V Q A G+ + AY Sbjct: 25 GMIRRDEIGIDIKYDKEAVANYKRVLPGQFVIHLRSFQG-----GFAWSEIEGLTSPAYT 79 Query: 322 AVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDI 378 + +S+ ++ S + K + G+R +S+ + D L ++P + EQ I Sbjct: 80 ILDFKEENSSKFWRNVLTSPNFIKKLETVTYGIRDGRSISYSDFSTLNFVIPTLPEQEAI 139 Query: 379 TNVINVETARIDVLVEKIEQS 399 + + +D L+ ++ Sbjct: 140 GSF----FSDLDQLITLHQRK 156 Score = 36.3 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 39/153 (25%), Gaps = 7/153 (4%) Query: 32 RFTKLNTGRTSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 K + + D+ + ++ + D + + GQ + Sbjct: 2 EIFKFVSDK---GYADLPILSASQELGMIRRDEIGIDIKYDKEAVANYKRVLPGQFVI-H 57 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG--WLLSIDVTQRIEAICEGATMSH 148 L + ++ +G+ S + +L K+ + + Sbjct: 58 LRSFQGGFAWSEIEGLTSPAYTILDFKEENSSKFWRNVLTSPNFIKKLETVTYGIRDGRS 117 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + IP L EQ I I Sbjct: 118 ISYSDFSTLNFVIPTLPEQEAIGSFFSDLDQLI 150 >gi|188024412|ref|ZP_02997072.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|198273451|ref|ZP_03205987.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|225551146|ref|ZP_03772092.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 8 str. ATCC 27618] gi|188018697|gb|EDU56737.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|198249971|gb|EDY74751.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|225378961|gb|EEH01326.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 8 str. ATCC 27618] Length = 358 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 48/392 (12%), Positives = 111/392 (28%), Gaps = 57/392 (14%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K G T S + I +G Y+ ++ Sbjct: 3 IKLKDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYINS------------YMY 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR-IEAI 140 G I G D S ++ + + + I+++ Sbjct: 51 EGGHITISMNGNAGCVFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSL 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 C+G T + N+ + +PP+ EQ I I I+ + + + L+ + Sbjct: 111 CKGTTRLRLSNDDVLNLEINLPPIEEQNAIISIIEPIEKVINNIKNIKFKIESLVNKYFD 170 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 L S + + +G + T I S Sbjct: 171 FLYSNLEDSNFKKYI---------LGDLF-------------------TINRGQIINSKY 202 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + I + K Y + F I Q I + Sbjct: 203 IESNIGSYPVISSNTKNNGVFGYINSYMYDGEFITISADGAYAGTVFLQNGRFSITNVCF 262 Query: 321 MAVKPHGID----STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +K + ID + ++ ++++ + R +++ +K + + +P I+ Q Sbjct: 263 ILIKNNDIDFKFSNKFVYYILKKEQEVNKLKSQVGSSRPAVREYSLKEIKINLPNIEIQE 322 Query: 377 DITNV------INVETARIDVLVEKIEQSIVL 402 + + ++ + +I+ ++ I Sbjct: 323 KFSKIVEPLLNLSTKANKIEKILNDSLLKITK 354 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 57/160 (35%), Gaps = 4/160 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K+ + S I + ++ + ++ I + + + Sbjct: 6 KDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYINSYMYEGGHITISMNGNAGC 65 Query: 307 SAQVMERGIITSAYMA---VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 ++ S + + ++ ++ + ++ ++ K+ R L +DV Sbjct: 66 VFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSLCKGTTRLRLSNDDVL 125 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 L + +PPI+EQ I ++I I+ ++ I+ I L Sbjct: 126 NLEINLPPIEEQNAIISIIEPIEKVINN-IKNIKFKIESL 164 >gi|260061348|ref|YP_003194428.1| type I restriction system specificity protein [Robiginitalea biformata HTCC2501] gi|88785480|gb|EAR16649.1| type I restriction system specificity protein [Robiginitalea biformata HTCC2501] Length = 275 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 71/206 (34%), Gaps = 14/206 (6%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLP 65 DS +G IPK W+V I L +G T ++ G ++ ++ +D+ + Y+ Sbjct: 63 DSE---LGPIPKGWEVKGILEVADLLSGGTPKTRVSEYWGGNLNWVSAKDIGNEGTIYIS 119 Query: 66 KDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + S+ I + ++ G + II+ + + + + + Sbjct: 120 ETEKKISYLGLNNSSAKILPENTVIVVARGSVGKFGIISSPMAMNQSCYGLYSTSEF--S 177 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +L+ ++ + + G+ + P + + I Sbjct: 178 QGTIYLIISNLIEEFKRKSYGSVFDTITTSTFKTTSVIYPQEKIIFYFNQIVDPLFKMIR 237 Query: 183 TLITERIRFIELLKEKKQALVSYIVT 208 + +TE I +L L+S V Sbjct: 238 SKVTENIMLSDLRDTLLPKLISGEVR 263 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 83/263 (31%), Gaps = 14/263 (5%) Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVP 229 I ID I + + L+E AL + V G D + DS +G +P Sbjct: 14 ANDIAGVLSAIDDKIENNLAMNQTLEEMAMALYKHWFVDFGPFQDGEFVDS---ELGPIP 70 Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 WEVK + L+ K S + + K + E Sbjct: 71 KGWEVKGILEVADLLSGGTPKTRVSEYWGGNLNWVSAKDIGNEGTIYISETEKKISYLGL 130 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID------STYLAWLMRSYDLC 343 I +N + V + GII+S + S +L+ S + Sbjct: 131 NNSSAKILPENTVIVVARGSVGKFGIISSPMAMNQSCYGLYSTSEFSQGTIYLIISNLIE 190 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + ++ K V+ P Q I N + ++ ++L Sbjct: 191 EFKRKSYGSVFDTITTSTFKTTSVIYP----QEKIIFYFNQIVDPLFKMIRSKVTENIML 246 Query: 404 KERRSSFIAAAVTGQIDLRGESQ 426 + R + + ++G++ L+ + Sbjct: 247 SDLRDTLLPKLISGEVRLKEFRE 269 >gi|257093458|ref|YP_003167099.1| restriction modification system DNA specificity protein-containing protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257045982|gb|ACV35170.1| restriction modification system DNA specificity domain protein [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 444 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 55/442 (12%), Positives = 137/442 (30%), Gaps = 43/442 (9%) Query: 18 IGAIPKHWKVVPIKRFT---KLNTGRTSESG---KDIIYIGLEDVESGTGKYLPK-DGNS 70 + A+P+ W ++ ++ G + I + + Sbjct: 5 LPALPEGWVYSSLEDCARANSISYGVVQPGSPVTGGVPIIRVNNFRGTRIDLSETMRVAP 64 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWL 128 + A G++L +G + A++ D + V+ P + Sbjct: 65 EIEAKYARTRLAGGEVLLTLVGSVGQVAVVPDALKGFNVARAVAVIDPLQHVSAEWIALC 124 Query: 129 LSIDVTQRIE-AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L ++Q + + + + K + +P+P+PP AE+ I + + A RI L Sbjct: 125 LRSPLSQHLLTSRANTTVQTTINLKDVRALPIPMPPAAERQTITKMVSALDDRITLLRET 184 Query: 188 RIRFIELLKEKKQALVS-----YIVTKGLNPDVKMKDS--------GIEWVGLVPDHWEV 234 + + ++ +G P+ + + +GLVP W Sbjct: 185 NATLEAIAQALFKSWFVDFDPVRAKQEGRAPEGMDEATAALFPDEFEESELGLVPRGWRS 244 Query: 235 KPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETR-NMGLKPESYETYQIVDP 288 F +T + K +I S + + +K + + + Sbjct: 245 CSFIETITVIGGGTPKTSIREYWNGHIPWFSVVDAPAVTDVFVIDTVKHITEQGLRNSST 304 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 + + R A V + + ++ D +Y + + Sbjct: 305 SLLPLGTTIISARGTVGRLALVGREMAMNQSCYGLRGKASD--DYFTYFNTYRIVETLKQ 362 Query: 349 MGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE---QSIVLLK 404 G + ++ + + + V+ P I + ++E+++ + L Sbjct: 363 RTHGSVFDTITRDTLAGVCVVYPN-------GAFITAFERTVSPVMERVKENLKQAQTLA 415 Query: 405 ERRSSFIAAAVTGQIDLRGESQ 426 R + + ++G++ L E++ Sbjct: 416 TLRDTLLPRLISGKLRLS-EAE 436 >gi|302190884|ref|ZP_07267138.1| restriction endonuclease S subunit [Lactobacillus iners AB-1] Length = 234 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 67/208 (32%), Gaps = 16/208 (7%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTK------LIESNILSLSYGNIIQKLETRNMGLKPES 279 G+ P + P L + + T + I + +I+ + Sbjct: 26 GIQPSEMQFIPLQELCKVVTKGTTPTTLGKSFTSTGINFIKAESILDNHSIDSSKFAFID 85 Query: 280 YE-----TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 E ++ +IVF + ++ + A + + YL Sbjct: 86 EETNALLKRSVIKANDIVFTIAGTLGRFAMVDNSVLPANTNQAVAIIRPDETKVTPAYLY 145 Query: 335 WLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + ++ +L +K LP+ V +K N + + L+ Sbjct: 146 SFFIGNWHNEYYSKRIQQAVQANLSLTTIKSLPIAV--LK--NTTMNNYEKLVSPLFALM 201 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + E+ L + R + + ++G++D+ Sbjct: 202 KNNEEENRRLSKLRDTLLPRLMSGELDV 229 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 60/196 (30%), Gaps = 13/196 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 P + +P++ K+ T T+ + I +I E + K + Sbjct: 29 PSEMQFIPLQELCKVVTKGTTPTTLGKSFTSTGINFIKAESILDNHSIDSSKFAFIDEET 88 Query: 75 TS--TVSIFAKGQILYGKLGPYLRKAIIADFDGICST----QFLVLQPKDVLPELLQGWL 128 + S+ I++ G R A++ + +T + V P L + Sbjct: 89 NALLKRSVIKANDIVFTIAGTLGRFAMVDNSVLPANTNQAVAIIRPDETKVTPAYLYSFF 148 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + + A ++ I ++P+ + + + + E Sbjct: 149 IGNWHNEYYSKRIQQAVQANLSLTTIKSLPIAVLKNTTMNNYEKLVSPLFALMKNNEEEN 208 Query: 189 IRFIELLKEKKQALVS 204 R +L L+S Sbjct: 209 RRLSKLRDTLLPRLMS 224 >gi|332076171|gb|EGI86637.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41301] Length = 332 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 33/351 (9%), Positives = 94/351 (26%), Gaps = 27/351 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEILSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + ++P + +++ + + L +K+ P Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKYFSPSP 331 >gi|255023374|ref|ZP_05295360.1| type I restriction endonuclease S subunit [Listeria monocytogenes FSL J1-208] Length = 221 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 53/170 (31%), Gaps = 4/170 (2%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + R+++ ++I + + K KD + D S + KG Sbjct: 35 WEQRKLGEVFNERSERSADG--ELISVTINSGVIKASKLEKKDNS--SFDKSNYKVVKKG 90 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I Y + + + + +DGI S + V+ P+ + + ++ + Sbjct: 91 DIAYNSMRMWQGASGYSSYDGILSPAYTVIYPRKDIDXIFIAYMFKKIDMIQTFQRNSQG 150 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 S ++ + + T I + R L Sbjct: 151 LTSDTWNLKFPSLSTIKIKIPANDEQIKITNLFQKLEYTSILHQNRIEML 200 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 62/159 (38%), Gaps = 12/159 (7%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I +I+ + Y++V G+I + + + S GI Sbjct: 57 ISVTINSGVIKASKLEKKDNSSFDKSNYKVVKKGDIAYNSMRMWQGASGYSSYD----GI 112 Query: 316 ITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPP 371 ++ AY + P ID ++A++ + D+ + F GL +LKF + + + +P Sbjct: 113 LSPAYTVIYPRKDIDXIFIAYMFKKIDMIQTFQRNSQGLTSDTWNLKFPSLSTIKIKIPA 172 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 EQ ITN +++ + I +LK+ + Sbjct: 173 NDEQIKITN----LFQKLEYTSILHQNRIEMLKKVKKDL 207 >gi|283956448|ref|ZP_06373928.1| LOW QUALITY PROTEIN: hypothetical protein C1336_000250331 [Campylobacter jejuni subsp. jejuni 1336] gi|283792168|gb|EFC30957.1| LOW QUALITY PROTEIN: hypothetical protein C1336_000250331 [Campylobacter jejuni subsp. jejuni 1336] Length = 1080 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 61/176 (34%), Gaps = 5/176 (2%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 F + +RKN +I L+ + + + K + E ++ + I + Sbjct: 899 FFMGGTPSRKNINYWNGDIKWLTISDYSNRQVIMDAKEK-ITREGFKNSNAKMIQKGAVV 957 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + R + E A +A+ P+ Y +++ + +Q++ Sbjct: 958 VSIYATIGRVGILGEDMTTNQAIVAIIPNKEFINKYLMYAIDYFKFQLYNEVIITSQQNI 1017 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ + + PP++ Q I E +++ I SI ++ + + Sbjct: 1018 NLGILQNMVIPKPPLEIQKQIV----AECEKVEEQYNTIRMSIEEYQKLIKAILQK 1069 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 56/181 (30%), Gaps = 9/181 (4%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVS 79 +V +K G T DI ++ + D + + S Sbjct: 890 LVKLKICGDFFMGGTPSRKNINYWNGDIKWLTISDYSNRQVIMDAKEKITREGFKNSNAK 949 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG ++ + + + I D + + + P + ++ Sbjct: 950 MIQKGAVVVS-IYATIGRVGILGEDMTTNQAIVAIIPNKEFINKYLMYA-IDYFKFQLYN 1007 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + + + N+ +P PPL Q I + + +T+ + +L+K Sbjct: 1008 EVIITSQQNINLGILQNMVIPKPPLEIQKQIVAECEKVEEQYNTIRMSIEEYQKLIKAIL 1067 Query: 200 Q 200 Q Sbjct: 1068 Q 1068 >gi|116620728|ref|YP_822884.1| hypothetical protein Acid_1608 [Candidatus Solibacter usitatus Ellin6076] gi|116223890|gb|ABJ82599.1| hypothetical protein Acid_1608 [Candidatus Solibacter usitatus Ellin6076] Length = 169 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 2/90 (2%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +L + + + + G+ + ++ E + L V VP +EQ +I + A Sbjct: 16 QFLKYALLEGESLRRIIMETRGIVGQSNISLEQCRSLIVSVPSSQEQREIIRRVEAFFAL 75 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 D L + + + + S ++ A GQ Sbjct: 76 ADRLEARCTNAKAHVDKLTQSILSKAFRGQ 105 >gi|13508024|ref|NP_109973.1| type I restriction enzyme ecokI specificity protein [Mycoplasma pneumoniae M129] gi|12229982|sp|P75492|T1SG_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_285; AltName: Full=S.MpnORFGP; AltName: Full=Type I restriction enzyme specificity protein MPN_285; Short=S protein gi|1674248|gb|AAB96198.1| type I restriction enzyme ecokI specificity protein [Mycoplasma pneumoniae M129] Length = 306 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 54/180 (30%), Gaps = 11/180 (6%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + ++ N I + N +++ + + P + Y + I+ Sbjct: 104 QENIRKIYGANIPFETFQIRDICEINRGREINEKYLRENPGEFPVYSSATTNGGLIGKIN 163 Query: 298 -----------LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + E+ + ++ + +L + L Sbjct: 164 DYDFHGEYVTWTTGGAHAGNVFYRNEKFSCSQNCGLLEVKNKNKFSSKFLCFALKLQSKK 223 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + L + + + + PP++ Q I +++ + L E I I L K++ Sbjct: 224 FVNYASAIPVLTIKRIAEIELSFPPLEIQEKIADILFAFEKLCNDLTEGIPAEIELRKKQ 283 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 53/190 (27%), Gaps = 16/190 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV---SIFA 82 + I+ ++N GR I + + G++ + F Sbjct: 118 ETFQIRDICEINRGRE---------INEKYLRENPGEFPVYSSATTNGGLIGKINDYDFH 168 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + + G + + CS +L+ K+ + ++ + + Sbjct: 169 GEYVTWTTGGAHAGNVFYRNEKFSCSQNCGLLEVKNKNKFSSKFLCFALKLQSKKFVNYA 228 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKI---IAETVRIDTLITERIRFIELLKEKK 199 + K I I + PPL Q I + + + I I + + Sbjct: 229 S-AIPVLTIKRIAEIELSFPPLEIQEKIADILFAFEKLCNDLTEGIPAEIELRKKQLDYY 287 Query: 200 QALVSYIVTK 209 Q + V Sbjct: 288 QNFLFNWVQN 297 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 8/39 (20%), Positives = 18/39 (46%) Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + +P+ PP+K Q I +++ T L ++ + Sbjct: 1 MAEIPIDFPPLKIQEKIATILDTFTELSAELSAELSAEL 39 >gi|291516262|emb|CBK69878.1| Restriction endonuclease S subunits [Bifidobacterium longum subsp. longum F8] Length = 265 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 83/291 (28%), Gaps = 48/291 (16%) Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + GAT+SH + I N+P+ +P EQ + E + +ID L Sbjct: 1 MLRLANGATVSHINVADIRNMPVQLPSRGEQSKVAELLNVLDDKIDLNNRLNDYLANLC- 59 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E + + + ++ + Sbjct: 60 --------------------------ETIASRYCNDRNSRLRDICYQVADHVDYDNANQE 93 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 +S +++Q R + + G+ + I K + G Sbjct: 94 TYVSTESLMQNKGGRQLASSLPTTGKITRYKAGDTLISNIRPYFKKIWYAPFE----GTC 149 Query: 317 TSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE 374 + + + + + +R G + + V Sbjct: 150 SGDVIVFRANDPSNAPYLHACLRQDSFFDYVMQGAKGTKMPRGDKKQMMEFKV------- 202 Query: 375 QFDITNVINVET-ARIDVLVEK---IEQSIVLLKERRSSFIAAAVTGQIDL 421 + + E +D ++++ + I L++ R + + ++G+ID+ Sbjct: 203 ----ASSCSAEDLILLDSVIKQRSDNDSEITKLQKLRDTLLPKLMSGEIDV 249 Score = 40.2 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 34/131 (25%), Gaps = 5/131 (3%) Query: 29 PIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 ++ ++ Y+ E + G + G L Sbjct: 73 RLRDICYQVADHVDYDNANQETYVSTESLMQNKGGRQLASSLPTTGKITRYK---AGDTL 129 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGATM 146 + PY +K A F+G CS +V + D + +G M Sbjct: 130 ISNIRPYFKKIWYAPFEGTCSGDVIVFRANDPSNAPYLHACLRQDSFFDYVMQGAKGTKM 189 Query: 147 SHADWKGIGNI 157 D K + Sbjct: 190 PRGDKKQMMEF 200 >gi|282882445|ref|ZP_06291069.1| HsdA [Peptoniphilus lacrimalis 315-B] gi|281297710|gb|EFA90182.1| HsdA [Peptoniphilus lacrimalis 315-B] Length = 207 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 24/211 (11%), Positives = 68/211 (32%), Gaps = 11/211 (5%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L D S G +PD W + ++ + K L + + + Sbjct: 3 LYKDWFFDFSPFSTEGNLPDSWRIGTVGDIIQFHDSKRVPLSGAERDKMEKIYPYYGATS 62 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + ++ ++ + + N + + + A++ Sbjct: 63 LMDYVDNYLFDGIYLLLGED----GTVVDNLGFPILQYVYGQFWVNNHAHIITGKEDFSV 118 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L L R L + + ++Q + +++K++P ++P + + I Sbjct: 119 EELYLLFR---LTNIKSIVTGAVQQKVSQQNLKKVPAIIPSKES----LRTFDDLIQPIF 171 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + L + R + + ++G++D+ Sbjct: 172 AQIRNLRDENTRLADLRDTLLPRLMSGELDV 202 >gi|295401869|ref|ZP_06811833.1| N-6 DNA methylase [Geobacillus thermoglucosidasius C56-YS93] gi|312110990|ref|YP_003989306.1| N-6 DNA methylase [Geobacillus sp. Y4.1MC1] gi|294976123|gb|EFG51737.1| N-6 DNA methylase [Geobacillus thermoglucosidasius C56-YS93] gi|311216091|gb|ADP74695.1| N-6 DNA methylase [Geobacillus sp. Y4.1MC1] Length = 643 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 71/187 (37%), Gaps = 13/187 (6%) Query: 26 KVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +V I ++ G S G+ I + D+E+G ++ D Q+ Sbjct: 446 NLVQIGDIAEVIRGVNLPSRRQIENTDGELFPVIQIRDIENGEIRFETIDEFPIQTRDVQ 505 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGIC--STQFLV---LQPKDVLPELLQGWLLSID 132 G IL G + A++ ++DG+ S F++ K+V P ++ +L S Sbjct: 506 RVTAQPGDILVSSRGTQQKIAVVPEYDGMILVSNMFIIIRLHSTKEVDPVYVKRFLESPI 565 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 EA G+ + I +I +P+ P+ +Q + ++ I ER + Sbjct: 566 GQYFFEAHQSGSIATVLTPNDIRSIELPLLPIEQQQEMIRQLEEADELIRKAYEERKKKY 625 Query: 193 ELLKEKK 199 +K Sbjct: 626 FDAYQKF 632 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 56/178 (31%), Gaps = 6/178 (3%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G + + + N I + G I + ET + Sbjct: 450 IGDIAEVIRGVNLPSRRQIENTDGELFPVIQIRDIENGEI--RFETIDEFPIQTRDVQRV 507 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDL 342 PG+I+ Q K ++ + + +D Y+ + S Sbjct: 508 TAQPGDILVSSRGTQ-QKIAVVPEYDGMILVSNMFIIIRLHSTKEVDPVYVKRFLESPIG 566 Query: 343 CKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 F A SG + L D++ + + + PI++Q ++ + I E+ ++ Sbjct: 567 QYFFEAHQSGSIATVLTPNDIRSIELPLLPIEQQQEMIRQLEEADELIRKAYEERKKK 624 >gi|188577904|ref|YP_001914833.1| hypothetical protein PXO_02076 [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522356|gb|ACD60301.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 292 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 14/87 (16%), Positives = 36/87 (41%), Gaps = 2/87 (2%) Query: 336 LMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + S + + +L ++ +PV +PP++EQ I ++ A D Sbjct: 5 YLNSPVGMAHMRRLAITTSGLFNLSVGKIRSIPVALPPLEEQSRIVAKVDQLMALCDQFK 64 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ ++ + + ++ I A+ G+ Sbjct: 65 SRLSEARRVHEHLANALIGQALNGEKK 91 >gi|149200914|ref|ZP_01877889.1| Restriction modification system DNA specificity domain [Roseovarius sp. TM1035] gi|149145247|gb|EDM33273.1| Restriction modification system DNA specificity domain [Roseovarius sp. TM1035] Length = 294 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 5/90 (5%) Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQ 375 +A + + +D+ YL + +R G + L ++ + PP EQ Sbjct: 14 NAAKLTDISNDVDARYLMYFLRGATGQAAMANQTGGTSQPKLALYRIEEIRFPCPPRGEQ 73 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKE 405 I ++++ D L+E + I LL+E Sbjct: 74 QAIVSILSA----YDDLIENNRRRIALLEE 99 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 38/115 (33%), Gaps = 14/115 (12%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W+ R +LN G+ ++ E+ P G+S Q T ++ Sbjct: 126 LPEGWERRDFGRVAQLNYGKALKA------------ENRVDGPFPVYGSSGQVGTHDKAL 173 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I+ G+ G + T + + K+ L L +I Sbjct: 174 VEAPAIVVGRKGNVGSVYWCPENFWPIDTAYFI--SKEQSDYWLYLTLPNIGFQN 226 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 40/314 (12%), Positives = 77/314 (24%), Gaps = 33/314 (10%) Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + L DV L +L + G + I I P PP EQ Sbjct: 14 NAAKLTDISNDVDARYLMYFLRGATGQAAMANQTGGTSQPKLALYRIEEIRFPCPPRGEQ 73 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 I + A I+ R I LL+E + L P + + + Sbjct: 74 QAIVSILSAYDDLIEN----NRRRIALLEEAARLLYREWFVHFRFPGHE-HVPLTDGLPE 128 Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + + L K ++ + Sbjct: 129 GWERRDFGRVAQLNYGKALKAENRVDGPFPVYGSSGQVG-------------------TH 169 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 +V + K ++ S +L + + Sbjct: 170 DKALVEAPAIVVGRKGNVGSVYWCPENFWPIDTAYFISKEQSDYWLYLTLPNIGFQN--- 226 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 L + V+VP K + + + +I +L ++ L + R Sbjct: 227 --TDSGVPGLNRDFAYSRKVIVPSEKLRREFNLSVQPMLEQIQLLGSYNQK----LAQAR 280 Query: 408 SSFIAAAVTGQIDL 421 + + G+I + Sbjct: 281 DLLLPRLMNGEIAV 294 >gi|157829186|ref|YP_001495428.1| hypothetical protein A1G_07405 [Rickettsia rickettsii str. 'Sheila Smith'] gi|165933913|ref|YP_001650702.1| type I restriction-modification system specificity subunit [Rickettsia rickettsii str. Iowa] gi|157801667|gb|ABV76920.1| hypothetical protein A1G_07405 [Rickettsia rickettsii str. 'Sheila Smith'] gi|165909000|gb|ABY73296.1| type I restriction-modification system specificity subunit [Rickettsia rickettsii str. Iowa] Length = 84 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 42/83 (50%), Gaps = 1/83 (1%) Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + YL ++++S GSG + + +D++ L + +PP++EQ + ++ + Sbjct: 1 MLTKYLYYILKSQQNIIYQKQAGSG-QPHVYLKDLEDLQIPIPPLEEQQKMVTELDNNQS 59 Query: 388 RIDVLVEKIEQSIVLLKERRSSF 410 +ID L I+Q LK +S Sbjct: 60 KIDNLKNYIKQFENKLKTTLNSL 82 >gi|291613558|ref|YP_003523715.1| restriction modification system DNA specificity domain protein [Sideroxydans lithotrophicus ES-1] gi|291583670|gb|ADE11328.1| restriction modification system DNA specificity domain protein [Sideroxydans lithotrophicus ES-1] Length = 815 Score = 62.9 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 56/383 (14%), Positives = 110/383 (28%), Gaps = 51/383 (13%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 P + L G + + G Y N + I+ Sbjct: 462 PFESVCTLEYGSSLPKSE-----------RRDGPYPVLGSNGITG-YHNKFLIEGPAIVI 509 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 G+ G +A+ T + V ++ + + + GA + Sbjct: 510 GRKGSAGEVTYVAENCFPIDTTYYVKPVNPEASDIRYLYQVLKTLKLTDLK--GGAGIPG 567 Query: 149 ADWKGIGN-IPMPIPPLAEQVLIREKIIAETVRI---DTLITERIRFIELLKEKKQALVS 204 + K + +P+PPLA Q I E+I I ++ I L ++ + Sbjct: 568 LNRKDVYEAHQIPLPPLAIQKEIVEEIEGYQKIIDGARQVVENYRPSINLQRDWPVVALG 627 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 +VT G K WVG +P ++ + Sbjct: 628 EVVTTGSG-GTPSKQEANFWVGNIP--------------WVSPKDMKVDFLV-------- 664 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 ++ S ++V G ++ + A +A++ Sbjct: 665 ---DTEDHISEAAISSSATKLVPSGTLLCVVRSGILQH-TFPVALTTRPMAFNQDIVAIQ 720 Query: 325 PH--GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +D YL ++ ++ + + G QS K + +P ++ Q I Sbjct: 721 SDGGKLDIRYLFYIFKAKSNEILAAGIKPGVTVQSFHSGFFKAYQLPLPDLQTQRTIVAE 780 Query: 382 INVETARID---VLVEKIEQSIV 401 I E I+ L+ + E I Sbjct: 781 IEAEQTLINANKQLIARFEAKIQ 803 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 49/172 (28%), Gaps = 11/172 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W VV + +G T + +I ++ +D++ + +S Sbjct: 620 DWPVVALGEVVTTGSGGTPSKQEANFWVGNIPWVSPKDMKVDFLVDTEDHISEAAISSSA 679 Query: 78 VSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPK--DVLPELLQGWLLSID 132 + G +L L + + + +Q + L + Sbjct: 680 TKLVPSGTLLCVVRSGILQHTFPVALTTRPMAFNQDIVAIQSDGGKLDIRYLFYIFKAKS 739 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 I G T+ +P+P L Q I +I AE I+ Sbjct: 740 NEILAAGIKPGVTVQSFHSGFFKAYQLPLPDLQTQRTIVAEIEAEQTLINAN 791 >gi|319896579|ref|YP_004134772.1| restriction modification enzyme [Haemophilus influenzae F3031] gi|317432081|emb|CBY80431.1| Restriction modification enzyme [Haemophilus influenzae F3031] Length = 166 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 48/138 (34%), Gaps = 10/138 (7%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTYLAWLM 337 +Y +++ I + A + GI + + +L + + Sbjct: 31 SYTYFRENDVIIAKITPCMENGKCALAIGLSNGIGMGSSEFHVFRANENKVFPFFLFYSL 90 Query: 338 RSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + K GS + + + L + +P + EQ I N IN I+ + + Sbjct: 91 NRESIRKEAERNMTGSSGHRRVPISFYEDLEISLPDLNEQQSIVNQIN----EIETQISE 146 Query: 396 IEQSIVLLKERRSSFIAA 413 +E+ + ++ + + + Sbjct: 147 LEKVLENSRQEKKAVLDK 164 Score = 36.3 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 67/169 (39%), Gaps = 13/169 (7%) Query: 48 IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK------AIIA 101 + ++ + V + D + + F + ++ K+ P + ++ Sbjct: 2 VSFVEMSSVSNFGFIENKIDKTLGSLRKGSYTYFRENDVIIAKITPCMENGKCALAIGLS 61 Query: 102 DFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIP 158 + G+ S++F V + + P L L + + E G++ ++ Sbjct: 62 NGIGMGSSEFHVFRANENKVFPFFLFYSLNRESIRKEAERNMTGSSGHRRVPISFYEDLE 121 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + +P L EQ I +I I+T I+E + +E +++K+A++ + Sbjct: 122 ISLPDLNEQQSIVNQINE----IETQISELEKVLENSRQEKKAVLDKWL 166 >gi|308190350|ref|YP_003923281.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] gi|319777747|ref|YP_004137398.1| type i restriction enzyme specificity protein [Mycoplasma fermentans M64] gi|307625092|gb|ADN69397.1| type I site-specific deoxyribonuclease [Mycoplasma fermentans JER] gi|318038822|gb|ADV35021.1| Type I restriction enzyme specificity protein [Mycoplasma fermentans M64] Length = 362 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 43/398 (10%), Positives = 118/398 (29%), Gaps = 41/398 (10%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +V +K GR + V G + + + G + Sbjct: 2 LVKLKDIVTFINGRAYSQPELQDKGKYRIVRVGNFS-GKNEWFYSDMELNEDKYCENGDL 60 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 LY K I I ++ + + + + L + +T + G+ M Sbjct: 61 LY-KWACNFGPEIWKSEKTIFHYHIWKIKWDEKRVDKMFLYYLLMYMTPYWLSSTNGSIM 119 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H + + + +PPL Q I + + +I+ + R + + ++ Sbjct: 120 IHITKETMEEKIVDLPPLKTQKKISKILENLDKQIEKNLHIVKRLQVMGQAIFDMFLNNA 179 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 D+ ++ ++ K ++ N+ + Sbjct: 180 K----------------------DYENIESLCKIIWGQCPKGNNILSENVSNNLMLYASG 217 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + N + P +I+ + ++ + ++ I M + Sbjct: 218 AGDLENNKILIS--PKAFTDKPIKIIDNRTICMSIAGTVGKIGISDKNIAIGRAMVGFYN 275 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 L + + + + +++ + + + V + + N + Sbjct: 276 EK-KFGLIYFILNKYSSFLKRQSIGAIQKIINKNHLNIVNVPI-----------LTNEKN 323 Query: 387 ARIDVLVE---KIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ L+ K+E++ + L + + I + GQI++ Sbjct: 324 NLLNELITKCMKLEKNTLSLIKLKEKLIPLLINGQIEI 361 >gi|262369880|ref|ZP_06063207.1| sty SBLI [Acinetobacter johnsonii SH046] gi|262314919|gb|EEY95959.1| sty SBLI [Acinetobacter johnsonii SH046] Length = 434 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 55/442 (12%), Positives = 123/442 (27%), Gaps = 73/442 (16%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 VP+ F L G I +G + + A G ++ Sbjct: 7 VPLNEFILLQRGFDLPQSDRI-----------SGDIPVVASTGVAGFHNEYKVDAPG-VV 54 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I + +T V K + L SID G + Sbjct: 55 IGRSGSIGGGQYIKEKFWPLNTTLWVKDFKGHDARYVYYLLKSIDFH----RFNVGTGVP 110 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY-- 205 + + ++ + + +I + + +I + + ++ Sbjct: 111 TLNRNHLSSVLVKNLGYINEKVIAKTLGDLDDKIHLNNQINQTLESIAQALFKSWFIDFD 170 Query: 206 -------IVTKGLNPDVKMKD-----SGIE--------------------------WVGL 227 +G NP+ S +E +G Sbjct: 171 PVRAKIVAKQEGNNPEFAAMCVISGKSEVELQQMAEDDLAELRATAALFPDELVESELGE 230 Query: 228 VPDHWEVKP---FFALVTELNRKNTKLIESNILSLSYGN------IIQKLETRNMGLKPE 278 VP WEV + K + I L + + + L+ Sbjct: 231 VPKGWEVTRFSNIVEKYIDNRGKTPPIQSEGIPLLEVKHLPEFSLNPDLNTDKKVSLETF 290 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + +++ + + + A + K + ++ ++ + M Sbjct: 291 NTWFRAHLQENDLIMSTVGTIG-RLCIVPANRTLAIAQNILGLRFKLNKVNPLFMYYQMN 349 Query: 339 SYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S A + ++ S+K +D++ + +L P IK Q I + + Sbjct: 350 SAKFRNDVDARLVITVQSSIKRKDLETIDLLQPDIKIQNIFAEKIKPFV------LSQQS 403 Query: 398 QSIVLLKERRSSFIAAAVTGQI 419 + L + R + + ++G+I Sbjct: 404 DESLKLIDIRDALLPKLLSGEI 425 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 54/170 (31%), Gaps = 10/170 (5%) Query: 18 IGAIPKHWKVVPIKRFTKL---NTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +G +PK W+V + N G+T + I + ++ + + Sbjct: 228 LGEVPKGWEVTRFSNIVEKYIDNRGKTPPIQSEGIPLLEVKHLPEFSLNPDLNTDKKVSL 287 Query: 74 DTSTVS---IFAKGQILYGKLGPYLRKAIIA-DFDGICSTQF--LVLQPKDVLPELLQGW 127 +T + ++ +G R I+ + + L + V P + Sbjct: 288 ETFNTWFRAHLQENDLIMSTVGTIGRLCIVPANRTLAIAQNILGLRFKLNKVNPLFMYYQ 347 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + S ++A S K + I + P + Q + EKI Sbjct: 348 MNSAKFRNDVDARLVITVQSSIKRKDLETIDLLQPDIKIQNIFAEKIKPF 397 >gi|238810193|dbj|BAH69983.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 363 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 43/398 (10%), Positives = 118/398 (29%), Gaps = 41/398 (10%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +V +K GR + V G + + + G + Sbjct: 3 LVKLKDIVTFINGRAYSQPELQDKGKYRIVRVGNFS-GKNEWFYSDMELNEDKYCENGDL 61 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 LY K I I ++ + + + + L + +T + G+ M Sbjct: 62 LY-KWACNFGPEIWKSEKTIFHYHIWKIKWDEKRVDKMFLYYLLMYMTPYWLSSTNGSIM 120 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 H + + + +PPL Q I + + +I+ + R + + ++ Sbjct: 121 IHITKETMEEKIVDLPPLKTQKKISKILENLDKQIEKNLHIVKRLQVMGQAIFDMFLNNA 180 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 D+ ++ ++ K ++ N+ + Sbjct: 181 K----------------------DYENIESLCKIIWGQCPKGNNILSENVSNNLMLYASG 218 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + N + P +I+ + ++ + ++ I M + Sbjct: 219 AGDLENNKILIS--PKAFTDKPIKIIDNRTICMSIAGTVGKIGISDKNIAIGRAMVGFYN 276 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 L + + + + +++ + + + V + + N + Sbjct: 277 EK-KFGLIYFILNKYSSFLKRQSIGAIQKIINKNHLNIVNVPI-----------LTNEKN 324 Query: 387 ARIDVLVE---KIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ L+ K+E++ + L + + I + GQI++ Sbjct: 325 NLLNELITKCMKLEKNTLSLIKLKEKLIPLLINGQIEI 362 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 16/161 (9%), Positives = 48/161 (29%), Gaps = 5/161 (3%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDPGE 290 + +VT +N + E + +N + + + G+ Sbjct: 1 MMLVKLKDIVTFINGRAYSQPELQDKGKYRIVRVGNFSGKNEWFYSDMELNEDKYCENGD 60 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +++++ + + I + + + Y + Sbjct: 61 LLYKWACNFGPEIWKSEKTIFHYHI----WKIKWDEKRVDKMFLYYLLMYMTPYWLSSTN 116 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + E ++ V +PP+K Q I+ ++ +I+ Sbjct: 117 GSIMIHITKETMEEKIVDLPPLKTQKKISKILENLDKQIEK 157 >gi|293572023|ref|ZP_06683035.1| type I restriction-modification system specificity subunit [Enterococcus faecium E980] gi|291607885|gb|EFF37195.1| type I restriction-modification system specificity subunit [Enterococcus faecium E980] Length = 187 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 1/118 (0%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S ++ +I+ K L E + + S Y+ Sbjct: 67 ISNSKLVDLRLEENDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVYC 126 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K+ + G + ++ ++ +L + +PP++EQ +T I + I + Sbjct: 127 FLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRRI 184 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 7/164 (4%) Query: 27 VVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + K+ G T + K ++ ++ + D++ G + + + Sbjct: 20 WVYLGSISTKIQYGYTDSAKKQGNVKFLRITDIQEGRVNWSSVPYCDISNSKLVDLRLEE 79 Query: 84 GQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL + G + K+ I++ S + + +L E + +L S + +E Sbjct: 80 NDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVYCFLDSPLYWKLLEK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I G + + + + +P+PPL EQ + KI I Sbjct: 140 ISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRR 183 >gi|332073217|gb|EGI83696.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17570] Length = 332 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/351 (9%), Positives = 94/351 (26%), Gaps = 27/351 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + ++P + +++ + + L +K+ P Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKYFSPSP 331 >gi|327404777|ref|YP_004345615.1| restriction modification system DNA specificity domain-containing protein [Fluviicola taffensis DSM 16823] gi|327320285|gb|AEA44777.1| restriction modification system DNA specificity domain protein [Fluviicola taffensis DSM 16823] Length = 397 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 54/319 (16%), Positives = 97/319 (30%), Gaps = 15/319 (4%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-----KAIIADFDGICSTQFLVLQPK 117 ++P N +D ST + K Q YG + + + I S + V + K Sbjct: 36 FMPSIANIIGTDMSTYKLIRKKQFAYGPVTSRNGDKISIAILDDLDEAIVSQAYTVFEIK 95 Query: 118 DV---LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 D PE L W + + G+ DW + +P+P + +Q I Sbjct: 96 DFNELDPEYLMMWFRRPEFDRYARFKSHGSAREIFDWTEMSETELPVPNIEKQREIVR-- 153 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVGLVPDH 231 E I I+ + I+ L+E QA+ + P K SG + V Sbjct: 154 --EYNTIVNRISLNEQLIQKLEETAQAIYKQWFVEFEFPYENGKPYKSSGGKMVWCEELE 211 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 E+ + + T N LS + + + Y I D + Sbjct: 212 KEIPRGWEVKTLDNFCECLDNLRKPLSGIQRGTKKGVYPYFGAMSIIDYIDSYIYDGVFL 271 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + R A G A GI+ ++ + + Sbjct: 272 LVSEDGANVVDEFGRPATQYVWGKFWLNNHAHILKGINPYSTEFIKLGLSFINASHLVTG 331 Query: 352 GLRQSLKFEDVKRLPVLVP 370 + + ++ + +L P Sbjct: 332 AAQPKINQNNLMSIELLKP 350 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 52/142 (36%), Gaps = 9/142 (6%) Query: 274 GLKPESYETYQIVDPGEIVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAV---KPHGID 329 + TY+++ + + DK S+ ++ I++ AY + +D Sbjct: 42 NIIGTDMSTYKLIRKKQFAYGPVTSRNGDKISIAILDDLDEAIVSQAYTVFEIKDFNELD 101 Query: 330 STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL R + + G R+ + ++ + VP I++Q +I N R Sbjct: 102 PEYLMMWFRRPEFDRYARFKSHGSAREIFDWTEMSETELPVPNIEKQREIVREYNTIVNR 161 Query: 389 IDVLVEKIEQSIVLLKERRSSF 410 + EQ I L+E + Sbjct: 162 ----ISLNEQLIQKLEETAQAI 179 Score = 36.7 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 56/191 (29%), Gaps = 24/191 (12%) Query: 6 AYPQ---YKDSG--VQWIGA----IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV 56 Y YK SG + W IP+ W+V + F + D + L + Sbjct: 190 PYENGKPYKSSGGKMVWCEELEKEIPRGWEVKTLDNFCECL---------DNLRKPLSGI 240 Query: 57 ESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQF 111 + GT K P G D I+ +L + G + G Sbjct: 241 QRGTKKGVYPYFGAMSIIDYIDSYIYDGVFLLVSEDGANVVDEFGRPATQYVWGKFWLNN 300 Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 K + P + L + + GA + + +I + P + V Sbjct: 301 HAHILKGINPYSTEFIKLGLSFIN-ASHLVTGAAQPKINQNNLMSIELLKPGKSVLVEFN 359 Query: 172 EKIIAETVRID 182 + I +I Sbjct: 360 KLIKPLFNQIM 370 >gi|75675445|ref|YP_317866.1| Type I restriction enzyme EcoAI specificity protein [Nitrobacter winogradskyi Nb-255] gi|74420315|gb|ABA04514.1| Type I restriction enzyme EcoAI specificity protein [Nitrobacter winogradskyi Nb-255] Length = 597 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 46/125 (36%), Gaps = 4/125 (3%) Query: 278 ESYETYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E + Y G++ I N K ++ G T+ V+P +D Y+ Sbjct: 139 EIKKGYTHFAEGDVGLAKITPCFENGKSTVFRNLTGGIGTGTTELHIVRPLFVDQDYILL 198 Query: 336 LMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++S + + G+ ++ + E P +PP+ EQ I ++ D L Sbjct: 199 FLKSPHFIETGIPRMTGTAGQKRVPTEYFAHSPFPLPPLAEQHRIVAKVDALMGLCDRLK 258 Query: 394 EKIEQ 398 EQ Sbjct: 259 TAREQ 263 Score = 43.6 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 69/200 (34%), Gaps = 7/200 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP +W+ + L+ + + ++ + + + G + + Sbjct: 87 IPSNWRWSQLAEIGVLSPRNEAPDTLEASFVPMPLIAAEYGVANQHEIRPWGEIKKGYTH 146 Query: 81 FAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 FA+G + K+ P + G +T+ +++P V + + +L S Sbjct: 147 FAEGDVGLAKITPCFENGKSTVFRNLTGGIGTGTTELHIVRPLFVDQDYILLFLKSPHFI 206 Query: 135 QRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G + + P P+PPLAEQ I K+ A D L T R + Sbjct: 207 ETGIPRMTGTAGQKRVPTEYFAHSPFPLPPLAEQHRIVAKVDALMGLCDRLKTAREQRET 266 Query: 194 LLKEKKQALVSYIVTKGLNP 213 + A ++ + P Sbjct: 267 VRDRLAAASLARLNAPDPEP 286 Score = 43.2 bits (100), Expect = 0.078, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 69/188 (36%), Gaps = 6/188 (3%) Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 PD +EW ++ + + + + ++ G + ++ Sbjct: 384 TPDELNMPIPVEWAVQSFENLFLF-IDYRGNTPPKTDEGIPLITAKNIRMGYLNREPREF 442 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDS 330 +++ T + G++ F + ++ + E + + +P+ ID+ Sbjct: 443 ISKATFKTWMTRGFPEIGDLFFT---TEAPLANVCLNDIEEPFALAQRAICFQPYAKIDT 499 Query: 331 TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L + + S + + +G + +K +K LP+ +PP+ EQ I ++ A Sbjct: 500 KFLMFALMSDVMQSLIDKHATGMTAKGIKAAKLKPLPIPIPPLAEQHRIVAKVDELMALC 559 Query: 390 DVLVEKIE 397 D L + Sbjct: 560 DRLEASLT 567 >gi|313143599|ref|ZP_07805792.1| restriction modification enzyme [Helicobacter cinaedi CCUG 18818] gi|313128630|gb|EFR46247.1| restriction modification enzyme [Helicobacter cinaedi CCUG 18818] Length = 1211 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/349 (10%), Positives = 84/349 (24%), Gaps = 17/349 (4%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + ++ L +AI D S F P Sbjct: 855 FKETSDYKKLVESKAYKDSKDKDTLTHNAFLAYARAIEKDKLLYFSLSFNQAPIIIKAPN 914 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + EG + Q K+ + Sbjct: 915 DNKEQKRFLGYEWSNRKGDEG-----LKELNSPYLSPLFERDNPQNE--NKLAHLIRQAF 967 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I+ I K L+ + ++ + + + I G + + Sbjct: 968 LEISSPIPQDLSPYAFKAKLIDMLDFSKVDFNKAISLNPINSQGEGKAQNPFENCKYELV 1027 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 +L I + Q G+ + E+ Sbjct: 1028 KLESVCKMYQPQTITAKEILEQGQYKVYGANGVI--GFYDKYNHKDAEVAMTCRGAT--- 1082 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 IT M + P + +L+ L + + + + ++ Sbjct: 1083 -CGTINFTEPESWITGNAMIITPLEKNLILKKFLIYILPLSNIKSVITGAAQPQITRTNL 1141 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +L + +PP++ Q I E +++ I SI +E + + Sbjct: 1142 SQLKIPLPPLEIQTQIV----AECEKVEEQYNTIRMSIEKYQELIKAIL 1186 >gi|167631093|ref|YP_001681592.1| type i restriction modification DNA specificity domain protein [Heliobacterium modesticaldum Ice1] gi|167593833|gb|ABZ85581.1| type i restriction modification DNA specificity domain protein [Heliobacterium modesticaldum Ice1] Length = 205 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 70/183 (38%), Gaps = 7/183 (3%) Query: 28 VPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V +K ++ G++ + + +++ G Y + + Sbjct: 22 VKLKDMAEVFRGKSVLKKDIKPGRIAVLNISNIDDGEINYTDLETIDEEERKVKRYELVD 81 Query: 84 GQILYGKLGPYLRKAII--ADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDVTQRIEAI 140 G ++ G ++ AI D I S +V++P K+VL E ++ + S T I++ Sbjct: 82 GDLVLTCRGTTIKVAIFRQQDRLIIASANVIVIRPQKEVLSEYIKLFFESPVGTSLIKSY 141 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G T+ + + I + +P+ PL +Q + + E + + + +E Sbjct: 142 QRGTTIMNLNHSDIAEMEIPLAPLEQQRQMIDAYRREQTLYRQALQQAEQRWREAREDIY 201 Query: 201 ALV 203 A + Sbjct: 202 AKM 204 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 48/170 (28%), Gaps = 8/170 (4%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGN--IIQKLETRNMGLKPESYE----TYQIVDPG 289 + K+ + ++ N I E L+ E + G Sbjct: 23 KLKDMAEVFRGKSVLKKDIKPGRIAVLNISNIDDGEINYTDLETIDEEERKVKRYELVDG 82 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++V R + + + + S Y+ S + + Sbjct: 83 DLVLTCRGTTIKVAIFRQQDRLIIASANVIVIRPQ-KEVLSEYIKLFFESPVGTSLIKSY 141 Query: 350 G-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 +L D+ + + + P+++Q + + E +++ EQ Sbjct: 142 QRGTTIMNLNHSDIAEMEIPLAPLEQQRQMIDAYRREQTLYRQALQQAEQ 191 >gi|268323780|emb|CBH37368.1| hypothetical protein BSM_08450 [uncultured archaeon] Length = 134 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 34/91 (37%), Gaps = 1/91 (1%) Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ +L ++S SG + L +K + +PP Q I N I Sbjct: 1 MEPDFLINYIQSPIFILQHKQKKSGTAQPQLPVGTLKEFEIPLPPKDIQQKINNEIARRI 60 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + + + ++ S+ + R S + A G Sbjct: 61 SICNNIQSTVKDSLQKSEALRQSILKRAFEG 91 >gi|224437132|ref|ZP_03658113.1| type II restriction-modification enzyme [Helicobacter cinaedi CCUG 18818] Length = 1171 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/349 (10%), Positives = 84/349 (24%), Gaps = 17/349 (4%) Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + + ++ L +AI D S F P Sbjct: 815 FKETSDYKKLVESKAYKDSKDKDTLTHNAFLAYARAIEKDKLLYFSLSFNQAPIIIKAPN 874 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + EG + Q K+ + Sbjct: 875 DNKEQKRFLGYEWSNRKGDEG-----LKELNSPYLSPLFERDNPQNE--NKLAHLIRQAF 927 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I+ I K L+ + ++ + + + I G + + Sbjct: 928 LEISSPIPQDLSPYAFKAKLIDMLDFSKVDFNKAISLNPINSQGEGKAQNPFENCKYELV 987 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 +L I + Q G+ + E+ Sbjct: 988 KLESVCKMYQPQTITAKEILEQGQYKVYGANGVI--GFYDKYNHKDAEVAMTCRGAT--- 1042 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 IT M + P + +L+ L + + + + ++ Sbjct: 1043 -CGTINFTEPESWITGNAMIITPLEKNLILKKFLIYILPLSNIKSVITGAAQPQITRTNL 1101 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +L + +PP++ Q I E +++ I SI +E + + Sbjct: 1102 SQLKIPLPPLEIQTQIV----AECEKVEEQYNTIRMSIEKYQELIKAIL 1146 >gi|219883432|ref|YP_002478592.1| hypothetical protein Cyan7425_5291 [Cyanothece sp. PCC 7425] gi|219867578|gb|ACL47914.1| hypothetical protein Cyan7425_5291 [Cyanothece sp. PCC 7425] Length = 555 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 45/417 (10%), Positives = 118/417 (28%), Gaps = 38/417 (9%) Query: 30 IKRFTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS---TVSIFA 82 +K + T T K + ++ +++ + + + S Sbjct: 42 LKDLCEFITDGTHVTPKYQQKGVKFLSSTNIDPFSIDFDNTNHISESEHLKLGQQKCNPE 101 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G IL K G A+ D CS V + + + + Sbjct: 102 PGDILISKNGRIGTVAVYRDSHQSCSLFVSVALLRYRGNVDIDFITAFSNSSGGWYQFTR 161 Query: 143 GATMSHADWKGIGNIPMPI---------PPLAEQVLIREKIIAETVRIDTLIT------E 187 A + I + + ++V E++ + + + I Sbjct: 162 SAKTGVITNLHLEEIREVLVPEPFKAVQTYIGDKVRQAERLRERSKELASRIQALVQPLH 221 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 ++ K L + L+ S + + V+ Sbjct: 222 IQNALKTPDSKYNRLEGKELQHRLDAKYYNHRSMEVLDACKDESKAINNLMISVSNGFEH 281 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 T + E +L+ ++ P+S E + + +++ Sbjct: 282 RTFVDEGQPYITVSEVSSGRLDLTSVPKIPDSVEVPDKALINSNCVLVVRTGSIGIAVKV 341 Query: 308 AQVMERGIITSAYMAVKPHGIDS-TYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL 365 + E I+S + ++ + +A + S + + + G + + +++ L Sbjct: 342 HEEDEGASISSHLIRLEFQEESTAAAVAAFLNSAAGECLLHKISYGAVQPQVGQDELLNL 401 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS---FIAAAVTGQI 419 P+ I +++ + I + E +I + ++ + + G+I Sbjct: 402 PIP--------RI--ILDN-SEEILQCMNLQEMAIRSAERLTTAAKLLVEGLIEGKI 447 >gi|313668695|ref|YP_004048979.1| restriction modification system DNA specificity domain [Neisseria lactamica ST-640] gi|313006157|emb|CBN87619.1| putative restriction modification system DNA specificity domain [Neisseria lactamica 020-06] Length = 205 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 47/174 (27%), Gaps = 3/174 (1%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 K + + + + N++Q E + + S Sbjct: 18 KDVVWKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVN 77 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ I K G + + V ++ YL ++ Sbjct: 78 DILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKH 135 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G + + + +PP+ EQ I +++ + E + I L Sbjct: 136 AKGAKMPRGSKTAIMQYKIPIPPLSEQEKIVAILDKFDTLTHSVSEGLPHEIAL 189 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 71/187 (37%), Gaps = 8/187 (4%) Query: 27 VVPIKRFTKLNTGRT-SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + R S+ + Y+G++++ ++ GK L G T I Sbjct: 22 WKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLS--GYVPSEGKMTEYIV--N 77 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE 142 IL G + PYL+K AD G + LV++ + V P+ L L + Sbjct: 78 DILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAK 137 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M I +PIPPL+EQ I + ++ I L +++ + Sbjct: 138 GAKMPRGSKTAIMQYKIPIPPLSEQEKIVAILDKFDTLTHSVSEGLPHEIALRRKQYEYY 197 Query: 203 VSYIVTK 209 ++ Sbjct: 198 CEQLLAF 204 >gi|294101455|ref|YP_003553313.1| restriction modification system DNA specificity domain protein [Aminobacterium colombiense DSM 12261] gi|293616435|gb|ADE56589.1| restriction modification system DNA specificity domain protein [Aminobacterium colombiense DSM 12261] Length = 505 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 65/166 (39%), Gaps = 7/166 (4%) Query: 28 VPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V +K ++ G++ ++ + + +++ G Y D + + Sbjct: 322 VKLKNVAEVFRGKSILKKDLGPGNVAVLNISNIKDGEIDYHDLDTIDEEEHKIKRYELSS 381 Query: 84 GQILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAI 140 G ++ G ++ A+ D I S +V++PK+ + E ++ +L S I++ Sbjct: 382 GDVVLSCRGTSIKSAVFEAQDKTIIASANLVVIRPKEKVKGEFIKIFLESPVGQAMIQSF 441 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 G + + ++ I + +P P+ EQ + E E I Sbjct: 442 QRGTILMNINYADIMEMEIPFLPIYEQQKMIETYCQEFKTYKEAIN 487 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 32/123 (26%), Gaps = 2/123 (1%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + E + G++V + + K ++ Sbjct: 369 EEEHKIKRYELSSGDVVLSCRGTSIKSAVFEAQDKTIIASANLVVIRPKEKVK-GEFIKI 427 Query: 336 LMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + S + + G ++ + D+ + + PI EQ + E + Sbjct: 428 FLESPVGQAMIQSFQRGTILMNINYADIMEMEIPFLPIYEQQKMIETYCQEFKTYKEAIN 487 Query: 395 KIE 397 E Sbjct: 488 LAE 490 >gi|148827016|ref|YP_001291769.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae PittGG] gi|148718258|gb|ABQ99385.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae PittGG] Length = 459 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 51/470 (10%), Positives = 128/470 (27%), Gaps = 92/470 (19%) Query: 29 PIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF- 81 + + + G +S + I + +V G+ L + V F Sbjct: 3 KLGNYISVQNGYAFKSKDFIKNLSGMPVIKIGNVTGGSFIDLSSYDTISEEIARKVKSFQ 62 Query: 82 -AKGQILYGKLGPYLRKA---IIADFDGICSTQFLVLQPKDVLPE---LLQGWLLSIDVT 134 IL G + K + + + L K+ P + + S Sbjct: 63 TKDDDILIAMTGANVGKVSRIAKGTQPCLINQRVGRLILKEDCPYSSDFIYYLVSSNKSF 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q +GA + K I ++ P + + +I ++ Sbjct: 123 QYFSNTADGAAQPNISGKLIEDLEFPDISPKSANKAGKHLKVLDEKIQLNTQINQTLEQI 182 Query: 195 LKEKKQALVS---------YIVTKG-------LNPDVKMKDSGIEWV------------- 225 + ++ +++G L + E + Sbjct: 183 AQALFKSWFVDFDPVRAKVQALSEGMSLEQAELAAMQTISGKTPEELTALSQTQPDRYAE 242 Query: 226 -----------------GLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNII 265 G VP WE + N K+ +E I + G++ Sbjct: 243 LAETAKAFPCEMVEVDGGEVPKGWEKTTLSEICEMQNGYAFKSFDWMEQGIPVIKIGSVK 302 Query: 266 QKL-ETRNMGLKPESYET---YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + E G E Y ++ +I+ + + + ++ ++ Sbjct: 303 PIIVEVEGNGFVSEDYSKLKPDFLLTSSDILVGLTGYVGEVGRIPTGKI---AMLNQRVA 359 Query: 322 AVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-----SGLRQSLKFEDVKRLPVLVPPIKEQ 375 P ID + + + F + ++ +++ + P++ Sbjct: 360 KFLPKEIDKNHCFYNYIYCLARQSQFKEFAEINAKGSAQANISTKELLKFPIIKAN---- 415 Query: 376 FDITNVINVETARIDVLVEKIEQSI------VLLKERRSSFIAAAVTGQI 419 + +++ + + E +E+ + L + R + + G++ Sbjct: 416 ----DKLHILFE--NRVKELLERILWNSQNAETLAKTRDLLLPRLLNGEV 459 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 56/202 (27%), Gaps = 15/202 (7%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 G +PK W+ + ++ G +S + I I + V+ + D Sbjct: 260 GEVPKGWEKTTLSEICEMQNGYAFKSFDWMEQGIPVIKIGSVK--PIIVEVEGNGFVSED 317 Query: 75 TSTVS---IFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPK-----DVLPELLQ 125 S + + IL G G I + + + PK + Sbjct: 318 YSKLKPDFLLTSSDILVGLTGYVGEVGRIPTGKIAMLNQRVAKFLPKEIDKNHCFYNYIY 377 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + E +G+ ++ K + P+ +L ++ RI Sbjct: 378 CLARQSQFKEFAEINAKGSAQANISTKELLKFPIIKANDKLHILFENRVKELLERILWNS 437 Query: 186 TERIRFIELLKEKKQALVSYIV 207 + L++ V Sbjct: 438 QNAETLAKTRDLLLPRLLNGEV 459 >gi|302560832|ref|ZP_07313174.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000] gi|302478450|gb|EFL41543.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000] Length = 321 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 52/349 (14%), Positives = 97/349 (27%), Gaps = 43/349 (12%) Query: 81 FAKGQILYGKLGPYLRKAIIADFDG---ICSTQFLVL--QPKDVLPELLQGWLLSIDVTQ 135 GQI+ KL + + D S ++ V K ++ L + Sbjct: 4 LKTGQIVMSKLNAWEGGLAVVGEDFSDTYVSPEYPVFSVDEKRAQSAYVKHLLAWPRLWG 63 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 R+ +P+P EQ I ++ A RI + + L+ Sbjct: 64 RLTPRGSMVQRKRTTPATFLATCVPLPDPVEQNRIAGRLDAAMHRIAQVDYLKGTSNNLI 123 Query: 196 KEKKQAL---VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 + AL + V K +E P L+ + + Sbjct: 124 LQYADALFRSIKQTAPLAEVLLVDDKFVDVESDSTYPVTGICSFGRDLIRRPVIQGSGTA 183 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 Y + + G+IV ++ ++ + Sbjct: 184 ---------------------------YTRFVQIQAGQIVMSKLNAWEGALAVVGGDFAD 216 Query: 313 RGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLV 369 ++ Y DS YL L+ +L GS R + + V + Sbjct: 217 T-YVSPEYPVFSLIESAADSEYLEHLLAWPELWARLTPRGSMFRRKRTTPATLLATEVPL 275 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P + EQ I + + E + L R + + AA +G+ Sbjct: 276 PSLSEQRRIAKQL----TLARRVAEGSAAQVEQLATLRRALLDAAFSGR 320 >gi|225550745|ref|ZP_03771694.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 2 str. ATCC 27814] gi|225379899|gb|EEH02261.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 2 str. ATCC 27814] Length = 354 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 51/390 (13%), Positives = 105/390 (26%), Gaps = 50/390 (12%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K G T S + I +G Y+ ++ Sbjct: 3 IKLKDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYINS------------YMY 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ-RIEAI 140 G I G D S ++ + + + +I+++ Sbjct: 51 EGGHITISMNGNAGCVFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSL 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 C+G T + N+ + +PP+ EQ I I I I I L EK Sbjct: 111 CKGTTRLRLSNDDVLNLEINLPPIEEQNAIISIIEPIEKSI-KTINLLQTKIGLFIEKTF 169 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 ++ + + +KD GL I + Sbjct: 170 NFINDNLVNSDLIEFSLKDLLNIKRGLP---------------------------ITAKD 202 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + K Y + I + + + Sbjct: 203 LLNNPGSYPLISASSKNNGIFGYFNDYMYDGKNITISMNGNAGCIFYQIGKFSANSDVLV 262 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 ++ + + + + ++ R L +++ VL+P I+ Q + Sbjct: 263 LSNSNKNLTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNIEIQEKFSK 322 Query: 381 VINVETARIDVLVEKIEQSIV--LLKERRS 408 ++ + KIE+++ LLK + Sbjct: 323 IVEPLL-NLSTKANKIEKNLNECLLKIVKK 351 >gi|325474568|gb|EGC77754.1| type I restriction-modification system [Treponema denticola F0402] Length = 157 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 55/147 (37%), Gaps = 13/147 (8%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID--------LQNDKRSLRS 307 +G++ + +N + Y I G I+ D + K S+ + Sbjct: 9 WTWSHFGDVADVINGKNQSQVEDDTGEYPIYGSGGIMGYANDYICPENCTIIGRKGSINN 68 Query: 308 AQVMER--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 +E + +A+ + YL + +S+D + S SL ++R+ Sbjct: 69 PIFVEEKLWNVDTAFGLAPSSIVLPRYLFYFCKSFDFTSL---DSSTTLPSLTKTSIQRI 125 Query: 366 PVLVPPIKEQFDITNVINVETARIDVL 392 +PP+ Q I + I+ +++D + Sbjct: 126 LFPLPPLAAQKRILDKIDELFSQLDKI 152 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 56/163 (34%), Gaps = 16/163 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 IP+ W + G+ VE TG+Y P G+ + I Sbjct: 5 IPESWTWSHFGDVADVINGKNQSQ-----------VEDDTGEY-PIYGSGGIMGYANDYI 52 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + G+ G + + T F + VLP L + S D T ++ Sbjct: 53 CPENCTIIGRKGSINNPIFVEEKLWNVDTAFGLAPSSIVLPRYLFYFCKSFDFT----SL 108 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 T+ I I P+PPLA Q I +KI ++D Sbjct: 109 DSSTTLPSLTKTSIQRILFPLPPLAAQKRILDKIDELFSQLDK 151 >gi|321310227|ref|YP_004192556.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802071|emb|CBY92717.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 199 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 58/184 (31%), Gaps = 8/184 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN--MGLKPESYETYQ 284 ++ + ++ K++ + + NI +++ Sbjct: 11 ENVRYFRLGDVCKTYAGISFKSSFYRDRGFPIIKTRNIQDNQIVTGDLNYCDLANHKDAM 70 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDL 342 I+ G++V + E + S + P+ YL + S Sbjct: 71 IIKHGDVVMAKDGS--CCGKIGINLTDEEFLFDSHVLQFIPNEKLLIKRYLYHFLLSCQD 128 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 A+GS ++ +++++ + V ++ Q + + ++ I+ + ++ Sbjct: 129 KIRELAVGS-AIPGIRKSELEKIKIPVSSLEVQEKVASTLDK-FREIEREISLRDKQYEY 186 Query: 403 LKER 406 + Sbjct: 187 YRNY 190 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 55/176 (31%), Gaps = 8/176 (4%) Query: 29 PIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + K G + +S + I +++ + + I G Sbjct: 17 RLGDVCKTYAGISFKSSFYRDRGFPIIKTRNIQDNQIVTGDLNYCDLAN-HKDAMIIKHG 75 Query: 85 QILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 ++ K G K I D + + + L P + L + + +I + Sbjct: 76 DVVMAKDGSCCGKIGINLTDEEFLFDSHVLQFIPNEKLLIKRYLYHFLLSCQDKIRELAV 135 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + + I +P+ L Q + + + I+ I+ R + E + Sbjct: 136 GSAIPGIRKSELEKIKIPVSSLEVQEKVASTLD-KFREIEREISLRDKQYEYYRNY 190 >gi|297571612|ref|YP_003697386.1| restriction modification system DNA specificity domain protein [Arcanobacterium haemolyticum DSM 20595] gi|296931959|gb|ADH92767.1| restriction modification system DNA specificity domain protein [Arcanobacterium haemolyticum DSM 20595] Length = 249 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 57/184 (30%), Gaps = 15/184 (8%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNI------LSLSYGNIIQKLETRNMGLKPESYET 282 PD W F LV K K E +S ++ Q + Sbjct: 68 PDSWRWIRFGDLVEFRMGKTPKRAEQKYWLRGSVPWVSISDMAQGETITSTRESVSDEAI 127 Query: 283 YQIV-----DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 G ++ F L V II+ + V I +YLA+ + Sbjct: 128 SDAFGGVVSPAGTLIMSFKLTIGRCSFLGVDAVHNEAIIS-VFPIVDTWEILPSYLAYAL 186 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + A G +L + + V +PP+ EQ I ++ ID L ++E Sbjct: 187 PIFSSHGDAKAAMKG--NTLNSTSLNLMMVSLPPLAEQERIVAKLDEVLPLIDQL-AELE 243 Query: 398 QSIV 401 + Sbjct: 244 RERE 247 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 65/185 (35%), Gaps = 23/185 (12%) Query: 18 IGA------IPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYL 64 IG +P W+ + + G+T + + + ++ + D+ G Sbjct: 58 IGENDDPFVLPDSWRWIRFGDLVEFRMGKTPKRAEQKYWLRGSVPWVSISDMAQGETITS 117 Query: 65 PKDGNSRQ--SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP----KD 118 ++ S + SD + G ++ + + D + + + + P + Sbjct: 118 TRESVSDEAISDAFGGVVSPAGTLIMS-FKLTIGRCSFLGVDAVHNEAIISVFPIVDTWE 176 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 +LP L L +A +G T + + + + +PPLAEQ I K+ Sbjct: 177 ILPSYLAYALPIFSSHGDAKAAMKGNT---LNSTSLNLMMVSLPPLAEQERIVAKLDEVL 233 Query: 179 VRIDT 183 ID Sbjct: 234 PLIDQ 238 >gi|261380923|ref|ZP_05985496.1| HsdS protein [Neisseria subflava NJ9703] gi|284796176|gb|EFC51523.1| HsdS protein [Neisseria subflava NJ9703] Length = 223 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 71/241 (29%), Gaps = 24/241 (9%) Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN--PDVKMKDSGIEWVGLVPDHWE 233 R+D+ I E +E ++ K+A+++ + P ++ K EW Sbjct: 1 MFFSRLDSQIAESRAVLEKSRQLKKAMLAKMFPANGEKIPKIRFKGFEGEWETYQICDLF 60 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 ++ N + K + S + L E+ T+ Sbjct: 61 RITRGNVLATTNLVDNKNEDYCYPVYSSQTKNKGLMGYWKHYLFENAITWTTDGANAGDV 120 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 F + ++ + E G + S Sbjct: 121 NFRSGKFYCTNVCGVLINEEGFANQGIAEILNLVTHSYVSY-----------------VG 163 Query: 354 RQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 L + +P+L+PP IKEQ I N ++D + + L + + +A Sbjct: 164 NPKLMNNVMAEIPILIPPTIKEQTAIGNF----FRQLDETIALQSAEVEKLNQLKKGLLA 219 Query: 413 A 413 A Sbjct: 220 A 220 >gi|3335670|gb|AAC78320.1| restriction-modification enzyme MpuUVIII S subunit [Mycoplasma pulmonis] Length = 365 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 44/359 (12%), Positives = 106/359 (29%), Gaps = 31/359 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 ++ + + L G++ + K + IG+ ++ S K G D + Sbjct: 2 EIYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTKDQGIFGKINSYDFNGEY----- 56 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 IL G Y + ++ +L+ + + + L + + + G+ Sbjct: 57 -ILITTHGAYAGTVKYVNEKFSTTSNCFILKVDENIAKTKFLSYLLLLQEKTFNDMAIGS 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I + + +P L Q I + I +I+ + Q + Sbjct: 116 AYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPLEKQINAFDELILSE--------QKSLQ 167 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 + + LN + S + ++++ L ++ N K + NI + + Sbjct: 168 HYLNYFLNKLASINPS-------IFKNYKLGQILNLEKGKSKYNAKYVSQNIGIYNLYSS 220 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + + + + I+ I + ++ ++ Sbjct: 221 KTRDQGIFGKINSYDFNGEYIL---------ITTHGAYAGTVKYVNEKFSTTSNCFILKV 271 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I T + LK ++ V +P +K Q I +I Sbjct: 272 NENIVKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKIQSAILGIIE 330 Score = 40.5 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 35/142 (24%), Gaps = 3/142 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K + + I + ++ ++ Sbjct: 31 YNLYSSKTKDQGIFGKINSYDFNGEYILITTHGAYAGTVKYVNEKFSTTSNCFILKVDEN 90 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 I T + LK ++ V +P +K Q I +I Sbjct: 91 IAKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNINDFEVNLPNLKTQSAIIKIIEPLEK 150 Query: 388 RI---DVLVEKIEQSIVLLKER 406 +I D L+ ++S+ Sbjct: 151 QINAFDELILSEQKSLQHYLNY 172 >gi|261492676|ref|ZP_05989226.1| type I site-specific deoxyribonuclease specificity subunit [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261495899|ref|ZP_05992323.1| type I site-specific deoxyribonuclease specificity subunit [Mannheimia haemolytica serotype A2 str. OVINE] gi|261308443|gb|EEY09722.1| type I site-specific deoxyribonuclease specificity subunit [Mannheimia haemolytica serotype A2 str. OVINE] gi|261311662|gb|EEY12815.1| type I site-specific deoxyribonuclease specificity subunit [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 124 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 34/114 (29%), Gaps = 1/114 (0%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 I D I+ + + V + + + Y + + Sbjct: 11 IDKVASYIFDGKFILIGEDGGNFFTKKDVAFIVEGKFWANNHVHVLSVDFNLEKYFCYYL 70 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +L + G L ++ + + +PPI EQ I I + I+ Sbjct: 71 NALNLPSMGLINGI-AVPKLNQRNLNSILIAIPPISEQHRIVEKIEKLFSEIEK 123 >gi|315225320|ref|ZP_07867135.1| type I restriction enzyme, S subunit [Capnocytophaga ochracea F0287] gi|314944714|gb|EFS96748.1| type I restriction enzyme, S subunit [Capnocytophaga ochracea F0287] Length = 183 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 55/183 (30%), Gaps = 15/183 (8%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE--------TRNMGL 275 + +P W +V K E ++ S+ + ++ L Sbjct: 1 MLLDLPVGWRWCRLKDIVFIFTGATFKKEEVSVESIDIRILRGGNIQPFRLTNRVDDIFL 60 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDST 331 + + ++ +IV + + + + ++ + I S Sbjct: 61 PKDKVKENILLKKNDIVTPAVTSLENIGKMARVEFDLESTTVGGFVFILRQFYCNDIVSK 120 Query: 332 YLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL L+ S L + ++ ++ + +PP+ EQ I I+ Sbjct: 121 YLLALLSSPVLIDYIKSITNKSGQAFYNISKNRLEMTLLPLPPLAEQQRIVESIDAIFRC 180 Query: 389 IDV 391 I+ Sbjct: 181 IEN 183 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 53/184 (28%), Gaps = 24/184 (13%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P W+ +K + TG T + + DI + +++ D + Sbjct: 4 DLPVGWRWCRLKDIVFIFTGATFKKEEVSVESIDIRILRGGNIQPFRLTNRVDDIFLPKD 63 Query: 74 DTSTVSIFAKGQIL---------YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL 124 + K I+ GK+ + +L+ + Sbjct: 64 KVKENILLKKNDIVTPAVTSLENIGKMA----RVEFDLESTTVGGFVFILRQFYCNDIVS 119 Query: 125 QGW-----LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + + + G + + +P+PPLAEQ I E I A Sbjct: 120 KYLLALLSSPVLIDYIKSITNKSGQAFYNISKNRLEMTLLPLPPLAEQQRIVESIDAIFR 179 Query: 180 RIDT 183 I+ Sbjct: 180 CIEN 183 >gi|282883024|ref|ZP_06291625.1| type I restriction enzyme, HsdS subunit [Peptoniphilus lacrimalis 315-B] gi|281297081|gb|EFA89576.1| type I restriction enzyme, HsdS subunit [Peptoniphilus lacrimalis 315-B] Length = 230 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 53/149 (35%), Gaps = 4/149 (2%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ G + VD +++F I + + I Sbjct: 70 NVKNGEVNFDNSYYISEQDYLEINKRSKVDIYDLLFTMIGTIGEVAQITEEA---NYAIK 126 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + + I S YL + ++S + G + + ++ + +++P + Q Sbjct: 127 NVGLIKTNNKILSRYLFYYLKSEKIRNYISENKSKGSQVFISLGKLRNMEIILPCQEVQE 186 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKE 405 I ++++ ++ + E + + I L ++ Sbjct: 187 YIVSILDKFEKLVNDVNEGLPKEIDLRQK 215 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 60/187 (32%), Gaps = 6/187 (3%) Query: 29 PIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVSIFAK 83 + + G + + + ++V++G + S Q + + S Sbjct: 41 KLDAICDVRDGTHNSPKRQLHGKYLVTSKNVKNGEVNFDNSYYISEQDYLEINKRSKVDI 100 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 +L+ +G A I + + L+ +L L +L S + I Sbjct: 101 YDLLFTMIGTIGEVAQITEEANYAIKNVGLIKTNNKILSRYLFYYLKSEKIRNYISENKS 160 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + N+ + +P Q I + ++ + + I+L +++ + Sbjct: 161 KGSQVFISLGKLRNMEIILPCQEVQEYIVSILDKFEKLVNDVNEGLPKEIDLRQKEYEYY 220 Query: 203 VSYIVTK 209 ++ Sbjct: 221 REKLLDF 227 >gi|256851079|ref|ZP_05556468.1| type I R/M system specificity subunit [Lactobacillus jensenii 27-2-CHN] gi|256616141|gb|EEU21329.1| type I R/M system specificity subunit [Lactobacillus jensenii 27-2-CHN] Length = 199 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 50/163 (30%), Gaps = 8/163 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSRQS---DTST 77 WK V + ++ G T + + G E G YL + S+ Sbjct: 38 WKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGKTIYLHESQRKLSELGLKKSS 97 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + G IL+ II + + F +QP + + + LS + + Sbjct: 98 ARLLNPGAILFTSRAGIGNTGIIINPSA-TNQGFQSIQPNKNIIDSYFIFCLSSRLKRYA 156 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 G+T + + + I EQ I I + Sbjct: 157 LKHSAGSTFTEISGSEMKKAKIRICAKNEQNKISTCIKSLDSL 199 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 58/188 (30%), Gaps = 11/188 (5%) Query: 204 SYIVTKGLNPDVKMKDSGIEW----VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + L P V+ + W +G V + E N + Sbjct: 18 THADEQRLYPKVRFRGFDEPWKKVKLGDVAEIIGGGTPSTSNLEYWDGNINWFTPTEVGK 77 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + +GLK + ++++PG I+F + + + +G + Sbjct: 78 TIYLHESQRKLSELGLK---KSSARLLNPGAILFTSRAGIGNTGIIINPSATNQGFQS-- 132 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 I +Y + + S + ++K+ + + EQ I+ Sbjct: 133 --IQPNKNIIDSYFIFCLSSRLKRYALKHSAGSTFTEISGSEMKKAKIRICAKNEQNKIS 190 Query: 380 NVINVETA 387 I + Sbjct: 191 TCIKSLDS 198 >gi|160894144|ref|ZP_02074922.1| hypothetical protein CLOL250_01698 [Clostridium sp. L2-50] gi|156864177|gb|EDO57608.1| hypothetical protein CLOL250_01698 [Clostridium sp. L2-50] Length = 231 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 62/209 (29%), Gaps = 20/209 (9%) Query: 226 GLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 G PD W + K ++ N ++ ++ + + Sbjct: 29 GTKPDDWSDGTIDDLGTEIICGKTPSTKKSEYYGGNTPFITIPDMHGCVYIVSTERYLSD 88 Query: 280 ----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + + + P + I + + I + + GI Y+ Sbjct: 89 AGVASQPKKTLPPNTVCVSCIGTAGLVTLVSEESQSNQQINS----IIPKEGISVYYIYL 144 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVE 394 LM++ +L ++ V++P + Q + + + Sbjct: 145 LMQTLADTINKLGQSGSTIVNLNKTQFGKIQVMIPSELVLQD-----FDSLCRPLFDTIL 199 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ + L E R + + ++G++D+ Sbjct: 200 SNQKENINLSELRDALLPKLMSGELDVSD 228 >gi|238923275|ref|YP_002936790.1| restriction modification system DNA specificity domain protein [Eubacterium rectale ATCC 33656] gi|238874949|gb|ACR74656.1| restriction modification system DNA specificity domain protein [Eubacterium rectale ATCC 33656] Length = 173 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 47/135 (34%), Gaps = 8/135 (5%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + ++PE + V PG+++ + + G + + Sbjct: 37 NYITQTAEKIRPEGLSKTREVHPGDLILSNSMSFGRPYIMAIDGCIHDGWLA---IRDTK 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 D +L L+ + + + AM +G +L E V V P ++EQ I + Sbjct: 94 KNFDLKFLCTLLGTDGMLNQYKAMAAGSTVNNLNKELVGGTTVAFPMVEEQIKIGDY--- 150 Query: 385 ETARIDVLVEKIEQS 399 +D L+ ++ Sbjct: 151 -FTTLDHLITLHQRQ 164 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 50/165 (30%), Gaps = 9/165 (5%) Query: 34 TKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + G + + ++ + D R S G Sbjct: 2 VTIERGGSPRPIDKFITNDENGLNWVKIGDAPEQGNYITQTAEKIRPEGLSKTREVHPGD 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGA 144 ++ + R I+A I + K + L L + + + +A+ G+ Sbjct: 62 LILSNSMSFGRPYIMAIDGCIHDGWLAIRDTKKNFDLKFLCTLLGTDGMLNQYKAMAAGS 121 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 T+++ + + +G + P + EQ+ I + I + Sbjct: 122 TVNNLNKELVGGTTVAFPMVEEQIKIGDYFTTLDHLITLHQRQHK 166 >gi|189467612|ref|ZP_03016397.1| hypothetical protein BACINT_04002 [Bacteroides intestinalis DSM 17393] gi|189435876|gb|EDV04861.1| hypothetical protein BACINT_04002 [Bacteroides intestinalis DSM 17393] Length = 186 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 15/146 (10%), Positives = 47/146 (32%), Gaps = 3/146 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 I+ G + + E + + G+++ + + Sbjct: 29 SGIPFFRGKEIIEKQKGESVSTELYISKSRYDEIKNKFGVPKEGDMLLTSVGTLGIPYIV 88 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 ++ + + I+S +L + S A +++L + + + Sbjct: 89 KNETFYFKD--GNLTWFTDFKEINSKFLYYWFLSPIAKNAINAKAIGSTQKALTIDALSK 146 Query: 365 LPVLVPPIKEQFDITNVINVETARID 390 + +P I Q I ++++ ++I+ Sbjct: 147 FEIDIPNIDTQNRIVSILSSLDSKIE 172 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 54/171 (31%), Gaps = 11/171 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDV---ESGTGKYLPKDGNSRQSDT 75 + WK I +++ + I + +++ + G + + D Sbjct: 2 EEWKTYKIGNLCSISSSKRIFAKEYQSSGIPFFRGKEIIEKQKGESVSTELYISKSRYDE 61 Query: 76 STVS--IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSI 131 + +G +L +G I+ + L K++ + L W LS Sbjct: 62 IKNKFGVPKEGDMLLTSVGTLGIPYIVKNETFYFKDGNLTWFTDFKEINSKFLYYWFLSP 121 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 I A G+T + + IP + Q I + + +I+ Sbjct: 122 IAKNAINAKAIGSTQKALTIDALSKFEIDIPNIDTQNRIVSILSSLDSKIE 172 >gi|256851080|ref|ZP_05556469.1| restriction modification DNA specificity domain-containing protein [Lactobacillus jensenii 27-2-CHN] gi|256616142|gb|EEU21330.1| restriction modification DNA specificity domain-containing protein [Lactobacillus jensenii 27-2-CHN] Length = 216 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 75/193 (38%), Gaps = 11/193 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDP 288 + W+ V + RKN L + L++S ++ + + + E+ Y ++ Sbjct: 27 EPWKKVKLGRNVKRIRRKNKNLETNIPLTISAQFGLVDQRDFFGRVVASENLANYILLKR 86 Query: 289 GEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 GE + + ++ + G +++ Y+A P I+S +L + Sbjct: 87 GEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYIAFTPENINSDFLKAFFDTTKWYSHIV 146 Query: 348 AMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + G R ++ +D + + +P EQ +I+ + N+ + L+ +Q I Sbjct: 147 QVSTEGARNHGLLNISPQDFFEMSITIPKSDEQNNISRIYNLM----NSLLSLQQQDINT 202 Query: 403 LKERRSSFIAAAV 415 ++ + + Sbjct: 203 TQQLKQFLLQNLF 215 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 55/189 (29%), Gaps = 12/189 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK V + R K + +I I + + + + + + + Sbjct: 29 WKKVKLGRNVKRIRRKNKNLETNIPLTISAQFGLVDQRDFFGR--VVASENLANYILLKR 86 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K +G ST ++ P+++ + L+ + + I Sbjct: 87 GEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYIAFTPENINSDFLKAFFDTTKWYSHIV 146 Query: 139 AICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +H + + + IP EQ I + + +L Sbjct: 147 QVSTEGARNHGLLNISPQDFFEMSITIPKSDEQNNISRIYNLMNSLLSLQQQDINTTQQL 206 Query: 195 LKEKKQALV 203 + Q L Sbjct: 207 KQFLLQNLF 215 >gi|297590649|ref|ZP_06949287.1| type I restriction-modification system specificity subunit [Staphylococcus aureus subsp. aureus MN8] gi|297575535|gb|EFH94251.1| type I restriction-modification system specificity subunit [Staphylococcus aureus subsp. aureus MN8] gi|312437728|gb|ADQ76799.1| type I restriction-modification system specificity subunit [Staphylococcus aureus subsp. aureus TCH60] Length = 208 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 20/170 (11%), Positives = 56/170 (32%), Gaps = 6/170 (3%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + +I + ++ K S + ++ I I + + Sbjct: 43 KIKEFWNGDIPWIQSSDVKVNDLILRQCNKFISKNSIELSSAKLIPANSIAIVTRVGVGK 102 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 V + ++++ D Y + + Y + K+ + + + +++ Sbjct: 103 LCLVEFDYATSQDFLSLSSLKYDKLYSLYSLL-YTMKKISANLQGTSIKGITKKELLDSI 161 Query: 367 VLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +P ++EQ I + +ID + + I +LK + + Sbjct: 162 IKIPHNLEEQQKIGD----LFYKIDKYISFNKCKIEILKSLKQGLLQKIF 207 Score = 43.6 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 67/193 (34%), Gaps = 15/193 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYL--PKDGNSRQSD 74 +W+ I+ G + + K DI +I DV+ K + + Sbjct: 21 NWEEKKIEDIASQVYGGGTPNTKIKEFWNGDIPWIQSSDVKVNDLILRQCNKFISKNSIE 80 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + I + K + +FD S FL L L + Sbjct: 81 LSSAKLIPANSIAIVT-RVGVGKLCLVEFDYATSQDFLSLSSLKYD--KLYSLYSLLYTM 137 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++I A +G ++ K + I + + ++KI +ID I+ IE+ Sbjct: 138 KKISANLQGTSIKGITKKE---LLDSIIKIPHNLEEQQKIGDLFYKIDKYISFNKCKIEI 194 Query: 195 LKEKKQALVSYIV 207 LK KQ L+ I Sbjct: 195 LKSLKQGLLQKIF 207 >gi|145635505|ref|ZP_01791205.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittAA] gi|145267270|gb|EDK07274.1| putative type I site-specific restriction-modification system, S subunit [Haemophilus influenzae PittAA] Length = 59 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 21/34 (61%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNT 38 + Y YKDSGV+W+G +P HW++ +K+ Sbjct: 2 RRYESYKDSGVEWLGEVPSHWELKRLKQLFVEKN 35 >gi|13508246|ref|NP_110195.1| type I restriction enzyme ecokI specificity protein (hsdS)-like protein [Mycoplasma pneumoniae M129] gi|12229976|sp|P75279|T1SB_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_507; AltName: Full=S.MpnORFBP; AltName: Full=Type I restriction enzyme specificity protein MPN_507; Short=S protein gi|1674010|gb|AAB95983.1| type I restriction enzyme ecokI specificity protein [Mycoplasma pneumoniae M129] Length = 363 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 55/387 (14%), Positives = 112/387 (28%), Gaps = 49/387 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFAKGQI 86 IK + GR I E +++ +GKY + + + G+ Sbjct: 7 KIKDICDIQRGR---------GITKEYIKNNSGKYPVYSAATTNNGELGFINTYDFAGEY 57 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 + Y + S V K E+ +L + + + + Sbjct: 58 VTWTTNGYAGVVFYRNGKFSASQDCGV--LKVRNKEINAQFLAFALSLKTPQFVHNLGSR 115 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + K + I + PPL Q I + + L E I+ + L++ Sbjct: 116 PKLNRKVVAEISLDFPPLEVQEKIAHFLKSFNELSSQLKAELIKRQKQYAFYSDYLLN-- 173 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 K S E L + ++ +K E + + Sbjct: 174 ----------PKHSQGEEYKLF-----------KLKDIAKKILVGGEKPSDFQKEKDQVY 212 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K + K + + Y + + ++ ++ KP Sbjct: 213 KYPILSNSRKADDFLGYSKTFRIAEKSITVSARGTIGAVFYRDFSYLPAVSLICFIPKPE 272 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE- 385 + +L +++ K GSG L K V +P +K+Q +I ++ Sbjct: 273 F-NINFLFHALKATKFHKQ----GSGT-GQLTMAQFKEYQVYIPSLKKQQEIAATLDPLY 326 Query: 386 --TAR----IDVLVEKIEQSIVLLKER 406 A I +E ++ + +ER Sbjct: 327 YIFANSNWGIYKEIELRKKQMQYYQER 353 Score = 44.0 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 48/166 (28%), Gaps = 16/166 (9%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-------IVDPGEIVFRFIDLQNDKR 303 +I + G I K +N K Y ++ + ++ + Sbjct: 6 YKIKDICDIQRGRGITKEYIKNNSGKYPVYSAATTNNGELGFINTYDFAGEYVTWTTNGY 65 Query: 304 SLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + + + + S + + +GS R L + V Sbjct: 66 AGVVFYRNGKFSASQDCGVLKVRNKEINAQFLAFALSLKTPQFVHNLGS--RPKLNRKVV 123 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + PP++ Q I + + L L+K ++ Sbjct: 124 AEISLDFPPLEVQEKIAHFLKSFNELSSQLKA------ELIKRQKQ 163 >gi|229826009|ref|ZP_04452078.1| hypothetical protein GCWU000182_01373 [Abiotrophia defectiva ATCC 49176] gi|229789751|gb|EEP25865.1| hypothetical protein GCWU000182_01373 [Abiotrophia defectiva ATCC 49176] Length = 345 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 48/365 (13%), Positives = 117/365 (32%), Gaps = 38/365 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V ++ G ++ I DV TG Y + + + Sbjct: 3 VKLEEVC--VRGTSN--------IKQVDVTDKTGDYPIYGASGYIGNVDFYHQENPYVAV 52 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 R + + T ++ +++LP+ L + + + E GAT+ Sbjct: 53 IKDGAGIGRTTLHPAKSSVIGTMQYLIPKENILPKYLFYVVRYMKL----EKYYTGATIP 108 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 H +K + + Q I + + + + +I R + I L +A V Sbjct: 109 HIYFKDYKREEFNLESIEIQAKIVDIL----GKCEKIIEARRKEIISLDNLIKA---RFV 161 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ- 266 + ++ K + +G + + R + ++ + G++ Sbjct: 162 EMFGDININDKKWYSQPLGEL--------CTIVRGGSPRPIESYLGGDVPWIKIGDVTDG 213 Query: 267 ---KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 L + + E + ++V G ++F + + + G I ++A+ Sbjct: 214 ESIYLNSTKEHIIKEGVKKSRLVKAGSLIFANCGVSLGFARIITFD----GCIHDGWLAM 269 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + L + + F A+ +G + +L +K ++PP++ Q + Sbjct: 270 EDIDERIDKVFLLQALNQMTEHFRAIAPAGTQPNLNTAIMKAYKQIIPPMELQKEFIGFC 329 Query: 383 NVETA 387 Sbjct: 330 KQVDK 334 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 57/174 (32%), Gaps = 9/174 (5%) Query: 15 VQWIGAIPKH---WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLP- 65 V+ G I + W P+ + G + G D+ +I + DV G YL Sbjct: 161 VEMFGDININDKKWYSQPLGELCTIVRGGSPRPIESYLGGDVPWIKIGDVTDGESIYLNS 220 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + + G +++ G L A I FDG +L ++ D + + Sbjct: 221 TKEHIIKEGVKKSRLVKAGSLIFANCGVSLGFARIITFDGCIHDGWLAMEDIDERIDKVF 280 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 +T+ AI T + + + IPP+ Q Sbjct: 281 LLQALNQMTEHFRAIAPAGTQPNLNTAIMKAYKQIIPPMELQKEFIGFCKQVDK 334 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 12/76 (15%), Positives = 28/76 (36%), Gaps = 4/76 (5%) Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +L K+ + F+D KR + I+ Q I +++ + + ++ Sbjct: 87 KYLFYVVRYMKLEKYYTGATIPHIYFKDYKREEFNLESIEIQAKIVDILG----KCEKII 142 Query: 394 EKIEQSIVLLKERRSS 409 E + I+ L + Sbjct: 143 EARRKEIISLDNLIKA 158 >gi|284926281|gb|ADC28633.1| restriction modification enzyme [Campylobacter jejuni subsp. jejuni IA3902] Length = 1364 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 53/449 (11%), Positives = 129/449 (28%), Gaps = 82/449 (18%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---- 75 ++V +K F K +G + + +G E +++ +G + Sbjct: 895 ELVRLKDFVLDIQTAKRPSGGVGKYENGALSLGGEHIDNKSGYIKLDNPKYVPIKFYESF 954 Query: 76 --STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP------KDVLPELLQGW 127 I + IL K G K + + I + + + L Sbjct: 955 ALQDKGIVKQFDILICKDGALTGKIAMVRNEFIRKSAMINEHIFLLRCDNIAKQKYLFYI 1014 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK-------------- 173 L S Q +++ G+ + + +I +P Q I + Sbjct: 1015 LHSYSGQQALKSKITGSAQGGINKTNLESILIPNADFEIQKQIVAECEKVEEQYNTIRMS 1074 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--------- 224 I I ++ + + + +++ + D + S IE Sbjct: 1075 IEEYQNLIKAILQKCGIIDDGGGYELNSILENLQKLEFKLDFNLLLSLIEEQISHSEVLV 1134 Query: 225 ----------------------------VGLVPDHWE--------VKPFFALVTELNRKN 248 + P + K Sbjct: 1135 EETQSKERKQDFNAFKNFSKTIQELLQTLSTPPKDGWKRISLKNEQYMELNPSKKEISKL 1194 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + + + ++ + ++++ E + Y +I+ I + A Sbjct: 1195 DENMLVSFIEMASVSDKGYIQSKIDRSLNEVRKGYTYFIENDILIAKITPCMENGKCAIA 1254 Query: 309 QVMERGI---ITSAYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVK 363 + + I T ++ G+DS++L + + ++ + G+ + + + Sbjct: 1255 KNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNLNQQNIREKAALAMTGASGHKRVPISFYE 1314 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVL 392 L + +PP++ Q I I + +ID L Sbjct: 1315 NLTIPLPPLEIQEKIVQNIELVEQQIDFL 1343 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 18/164 (10%), Positives = 52/164 (31%), Gaps = 7/164 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E Y + + + + IV +I+ K ++ + + Sbjct: 929 EHIDNKSGYIKLDNPKYVPIKFYESFALQDKGIVKQFDILICKDGALTGKIAMVRNEFIR 988 Query: 313 R--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + I ++ + YL +++ SY + + +G + + +++ + + Sbjct: 989 KSAMINEHIFLLRCDNIAKQKYLFYILHSYSGQQALKSKITGSAQGGINKTNLESILIPN 1048 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + Q I E +++ I SI + + + Sbjct: 1049 ADFEIQKQIV----AECEKVEEQYNTIRMSIEEYQNLIKAILQK 1088 Score = 37.5 bits (85), Expect = 4.8, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 64/196 (32%), Gaps = 19/196 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK + +K + + + +I + V S G K S Sbjct: 1170 GWKRISLKN--EQYMELNPSKKEISKLDENMLVSFIEMASV-SDKGYIQSKIDRSLNEVR 1226 Query: 76 STVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + F + IL K+ P + + G ST+F + + K L + L Sbjct: 1227 KGYTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNL 1286 Query: 130 SIDVTQRI--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + A+ + N+ +P+PPL Q I + I +ID L + Sbjct: 1287 NQQNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDFLNLK 1346 Query: 188 RIRFIELLKEKKQALV 203 + ++ Q + Sbjct: 1347 LELLEKEKEKILQKYL 1362 >gi|193067080|ref|ZP_03048049.1| N-6 DNA methylase [Escherichia coli E110019] gi|192959670|gb|EDV90104.1| N-6 DNA methylase [Escherichia coli E110019] Length = 923 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 36/99 (36%), Gaps = 1/99 (1%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y ++ + PG+I+ + A + + +DS YL + S Sbjct: 138 YNSHVNLQPGDILISRSGTIGKNAVVSEAATGALAGHGLYVIRPDKNYLDSDYLLAYINS 197 Query: 340 YDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 F A G Q++ + V +LP+ V P+ Q Sbjct: 198 RACQNWFSAHARGTAIQNINRDTVLKLPIPVLPLPIQRR 236 Score = 42.9 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 61/169 (36%), Gaps = 14/169 (8%) Query: 30 IKRFTKLNTGRTSESGKDII---------YIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + + + GRT ++ + YI + D+ G + + S V++ Sbjct: 85 LSTMSSIFAGRTIKAIDLTLAPHDVQAKGYIRISDLAHGRIVRVSRWLKPDVPYNSHVNL 144 Query: 81 FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQR 136 G IL + G + A++++ + V++P + + L ++ S Sbjct: 145 -QPGDILISRSGTIGKNAVVSEAATGALAGHGLYVIRPDKNYLDSDYLLAYINSRACQNW 203 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 A G + + + + +P+P+ PL Q + I T I Sbjct: 204 FSAHARGTAIQNINRDTVLKLPIPVLPLPIQRRAVARYQQSGTDILTFI 252 >gi|84387340|ref|ZP_00990360.1| putative restriction-modification system methyltransferase [Vibrio splendidus 12B01] gi|84377789|gb|EAP94652.1| putative restriction-modification system methyltransferase [Vibrio splendidus 12B01] Length = 1303 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 55/180 (30%), Gaps = 10/180 (5%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY--------ETYQIVDP 288 + K+ + E + YG I + K ++ ++ + Sbjct: 471 LSKVAQINTGKSIRSTEQTEIENPYGYIRIRDIENFRIQKVTTWLQDDLARAYSHNQLYK 530 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 G I+ + + + + S YL + + + + Sbjct: 531 GNILISKTGTIGKLALVDDRNEGAFAGNNFNVLRINSAKVSSEYLLYYLSTSFCQDWLDS 590 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE-TARIDVLVEKIEQSIVLLKER 406 G +Q + + +K LP+L+P ++ Q T I L E +Q+ ER Sbjct: 591 RKRGAVQQHINTDVIKALPILLPSMEMQKRAVAQFEQHGTDVITFLKENSKQADEKAIER 650 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 60/158 (37%), Gaps = 10/158 (6%) Query: 30 IKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + + ++NTG++ S + YI + D+E+ + + + + + K Sbjct: 471 LSKVAQINTGKSIRSTEQTEIENPYGYIRIRDIENFRIQKVTTWLQDDLARAYSHNQLYK 530 Query: 84 GQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPK--DVLPELLQGWLLSIDVTQRIEA 139 G IL K G + A++ D F VL+ V E L +L + +++ Sbjct: 531 GNILISKTGTIGKLALVDDRNEGAFAGNNFNVLRINSAKVSSEYLLYYLSTSFCQDWLDS 590 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 GA H + I +P+ +P + Q + Sbjct: 591 RKRGAVQQHINTDVIKALPILLPSMEMQKRAVAQFEQH 628 >gi|167768053|ref|ZP_02440106.1| hypothetical protein CLOSS21_02597 [Clostridium sp. SS2/1] gi|167710382|gb|EDS20961.1| hypothetical protein CLOSS21_02597 [Clostridium sp. SS2/1] Length = 373 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 49/342 (14%), Positives = 102/342 (29%), Gaps = 39/342 (11%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-----KAIIADFDG 105 I + + GKY N Q D IF +L + G A Sbjct: 21 IPITASDRKEGKYPYYGANGIQ-DYVNDYIFDDELVLLAEDGGNFGSKEKPIAYRVSGKC 79 Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 + VL+PK+ + + L +++ + GAT + + +P+ + Sbjct: 80 WVNNHAHVLKPKEEIDVDYLCYSLMFY---KVDGMINGATRKKLTQTAMKKMKIPLRNIV 136 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 EQ I +++ +I + + + + LL QA V +P K IE + Sbjct: 137 EQKKIVQQLN----KIIEIREKAKKELNLLDNLIQA---RFVEMFGDPITNSKLLPIEKI 189 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 A +T ++ YG + N+ + Sbjct: 190 EER------YFLKAGITTKAEDIHDYLKDKYEIPCYGGNGIRGYVENLSYEG-------- 235 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + I Q + A + ++ ++ ++++ DL + Sbjct: 236 ------CYPIIGRQGALCGNVQYATGKFHATEHAVLVSTLKNDNTMWVYYMLKLMDLYRY 289 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + L + + + V+V I Q ++ Sbjct: 290 ---HTGAAQPGLAVKKLNTIDVIVADINLQNQFAAFVHQINK 328 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 52/161 (32%), Gaps = 6/161 (3%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 L I + K + Y D ++ K + Sbjct: 14 EILDSMRIPITASDRKEGKYPYYGANGIQDYVNDYIFDDELVLLAEDGGNFGSKEKPIAY 73 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 +V + + + +KP + +L S KV + R+ L +K++ + Sbjct: 74 RVSGKCWVNNHAHVLKPKEEI--DVDYLCYSLMFYKVDGMINGATRKKLTQTAMKKMKIP 131 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + I EQ I +N +I + EK ++ + LL + Sbjct: 132 LRNIVEQKKIVQQLN----KIIEIREKAKKELNLLDNLIQA 168 >gi|160945579|ref|ZP_02092805.1| hypothetical protein FAEPRAM212_03108 [Faecalibacterium prausnitzii M21/2] gi|158443310|gb|EDP20315.1| hypothetical protein FAEPRAM212_03108 [Faecalibacterium prausnitzii M21/2] Length = 393 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 47/386 (12%), Positives = 110/386 (28%), Gaps = 37/386 (9%) Query: 45 GKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII--- 100 + + ++E + + + ++D+ + +G I+ G + I Sbjct: 31 DSGVPVLNGSNLEGFSLSEKAFRYVTEEKADSLNKANAHRGDIVITHRGTLGQIVFIPQD 90 Query: 101 --ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA--DWKGIGN 156 D I +QF V VLPE L + + ++ + + Sbjct: 91 SRYDRYVISQSQFRVRCNDKVLPEYLVYYFHTPIGQYKLLSNASQVGVPALARPSSTFQQ 150 Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 I + +P L+ Q + E I I I + L+++ AL S + + Sbjct: 151 IEVTLPELSIQKRVVEII----TTIQRKIENNQELNDNLEQQAAALFSSLYNRSNTEVRY 206 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 I G P E + + K+ G + + + + Sbjct: 207 TDLIQI-LGGGTPKTGETAYWNGNIAFFTPKD------------VGTPYTFITEKTITEE 253 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 S+ ++ + + S Y V L + Sbjct: 254 GLSHCNSRLYPVNTVFVTARGTVGKVGLSGIPM----AMNQSCYALVGKETH--QLLVYF 307 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR-IDVLVEK 395 + ++ + + ++ D ++ + V ++ +E Sbjct: 308 YTLKAVDRLKHKASGAVFDAITTRDFDSEQIMKLSDDDAKAFLCVAEPMFQEMLNNSIEN 367 Query: 396 IEQSIVLLKERRSSFIAAAVTGQIDL 421 + L R + ++G+ID+ Sbjct: 368 LR-----LSTLRDFLLPKLMSGEIDV 388 >gi|332524591|ref|ZP_08400794.1| restriction endonuclease S subunits-like protein [Rubrivivax benzoatilyticus JA2] gi|332107903|gb|EGJ09127.1| restriction endonuclease S subunits-like protein [Rubrivivax benzoatilyticus JA2] Length = 381 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 51/143 (35%), Gaps = 4/143 (2%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 ++ + + +R + +V +++ ID +N + ++ + Sbjct: 28 YHEVTIKLWGKGIVSRGKVRGSDVVSARNVVRHNQLILSKIDARNGAIGMVPPELDGAIV 87 Query: 316 ITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPP 371 + P + ++ WL+RS ++ + G R +K E + +PP Sbjct: 88 SNDFPSFEFRDPGRCNPAFIGWLVRSAPFVELCRSASEGTTNRVRIKEERFLAQEIALPP 147 Query: 372 IKEQFDITNVINVETARIDVLVE 394 + Q I ++ T +I + Sbjct: 148 LSHQHAIAASLDALTDKIRQVEA 170 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 48/370 (12%), Positives = 104/370 (28%), Gaps = 37/370 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W V I + + E + Y + G G + + S ++ Sbjct: 4 WPKVSIGDLLRRSDEPA-EIDAAVEYHEVTIKLWGKG-IVSRGKVRGSDVVSARNVVRHN 61 Query: 85 QILYGKLGPYLRKAIIADFD---GICSTQFL---VLQPKDVLPELLQGWLLSIDVTQRIE 138 Q++ K+ + + I S F P P + + S + Sbjct: 62 QLILSKIDARNGAIGMVPPELDGAIVSNDFPSFEFRDPGRCNPAFIGWLVRSAPFVELCR 121 Query: 139 AICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + EG T + + +PPL+ Q I + A T +I + + Sbjct: 122 SASEGTTNRVRIKEERFLAQEIALPPLSHQHAIAASLDALTDKIRQVEAHLDAADAASAD 181 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + ++ + + V + + V ++ I Sbjct: 182 LL-----LSLHHQHAAGRSVRLGDVMDLHEVDEPITPAGTYPQVGVRGFGGGLFAKAAI- 235 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +Y + + G IV + + A + ++ Sbjct: 236 ----------------SGTDTTYRAFHKLYEGAIVLSQVKGWEGALARCPADLAG-WFVS 278 Query: 318 SAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPI 372 Y +P S YL ++R+ + G+ R+ + E + + +P + Sbjct: 279 PEYRTFRCRPDRAHSEYLGEIVRTQWFWQKLQDATRGVGARRERTRPEQFLNIEMTMPSL 338 Query: 373 KEQFDITNVI 382 +Q I V+ Sbjct: 339 DDQRRIVEVL 348 >gi|265752103|ref|ZP_06087896.1| restriction modification system DNA specificity subunit [Bacteroides sp. 3_1_33FAA] gi|263236895|gb|EEZ22365.1| restriction modification system DNA specificity subunit [Bacteroides sp. 3_1_33FAA] Length = 173 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 54/166 (32%), Gaps = 4/166 (2%) Query: 22 PKHWKVVPIKRFTKLNTGR--TSESGKDII--YIGLEDVESGTGKYLPKDGNSRQSDTST 77 P W + NTG+ S + + I Y+ +V + + Sbjct: 1 PVGWIETILGELFSHNTGKALNSSNKEGIFKDYLTTSNVYWNKFDFTAIKQMPFKESELN 60 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG +L + G R AI IC + + + + + + Sbjct: 61 KCTVTKGDLLVCEGGDIGRSAIWNYDYDICIQNHIHRLRPKIDLCVPFYYYTFAYLKENN 120 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 +G + + I MP+PPLAEQ I +KI +D Sbjct: 121 LIGGKGIGLLGLSSNALHKIEMPLPPLAEQQRIVQKIEELFSVLDN 166 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 42/151 (27%), Gaps = 4/151 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + S Y N + M K ES V G+++ Sbjct: 26 KEGIFKDYLTTSNVYWNKFDFTAIKQMPFK-ESELNKCTVTKGDLLVCEGGDIGRSAIW- 83 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 I + ++P + +Y L + ++ Sbjct: 84 --NYDYDICIQNHIHRLRPKIDLCVPFYYYTFAYLKENNLIGGKGIGLLGLSSNALHKIE 141 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + +PP+ EQ I I + +D + +E Sbjct: 142 MPLPPLAEQQRIVQKIEELFSVLDNIQNALE 172 >gi|303236699|ref|ZP_07323280.1| type I restriction modification DNA specificity domain protein [Prevotella disiens FB035-09AN] gi|302483203|gb|EFL46217.1| type I restriction modification DNA specificity domain protein [Prevotella disiens FB035-09AN] Length = 445 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/399 (10%), Positives = 114/399 (28%), Gaps = 34/399 (8%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +VP++ + D + +E + TGK + +D + + +G + Sbjct: 37 LVPLRELIAPKKNVIKKEEYDGLLPIVEKIVFKTGKVVFRDKKATGM---NLYSLQQGDL 93 Query: 87 LYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 L + + + F I ST + L ++ +L+ + + +I G Sbjct: 94 LISNINFHQGATALNTFGEIAASTHYQPYSIN--LNKVDPEFLVMVLRSSYFLSIISGKK 151 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + N + + KI+A E ++ + + Sbjct: 152 AQGIKNESGYNFIGSFSIPLPTLKEQRKIVALYKAKMENAENSASKAEQAEQAINSYLLD 211 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES----------- 254 ++ G + + + H + + + E + ++L + Sbjct: 212 VLDIGKGNEDGEDILSNAYKYMRFVHRKNISRWDVYNEKSIVKSRLYKHTNLLNVVIDKP 271 Query: 255 -------------NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + + +I + + + Y + ++ + + Sbjct: 272 QYGAAYSSQVFDGKMRYIRITDINEDGSLNEEKVSAKGYSDHYLLKENDFLIARSGNTVG 331 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLK 358 K L + + I + K YL + + + ++ Sbjct: 332 KTFLYKNKFG-KAIFAGYLIRFKLDETKVIPEYLLAYTKCALYKEWIKGNMRVSAQPNIN 390 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + P+++P + Q I + + I+ L + + Sbjct: 391 SQQYLDSPIILPSLDVQSKIVEYVGKQKDEINTLRQLAQ 429 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 17/122 (13%), Positives = 43/122 (35%), Gaps = 2/122 (1%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G+++ I+ +L + + Y + +D +L ++RS + Sbjct: 88 LQQGDLLISNINFHQGATALNTFGEIAASTHYQPYSINL-NKVDPEFLVMVLRSSYFLSI 146 Query: 346 FY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G++ + + + +P +KEQ I + + + K EQ+ + Sbjct: 147 ISGKKAQGIKNESGYNFIGSFSIPLPTLKEQRKIVALYKAKMENAENSASKAEQAEQAIN 206 Query: 405 ER 406 Sbjct: 207 SY 208 >gi|282878312|ref|ZP_06287105.1| type I restriction modification DNA specificity domain protein [Prevotella buccalis ATCC 35310] gi|281299567|gb|EFA91943.1| type I restriction modification DNA specificity domain protein [Prevotella buccalis ATCC 35310] Length = 195 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 58/186 (31%), Gaps = 14/186 (7%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL---SYGNIIQKLETRNMG 274 + E +P +W + + ++R + N Sbjct: 9 QCIAEEIPFEIPVNWAWVRLDDICSFIHRGKSPKYSLIKKYPVVAQKCNQWSGFSLEKAK 68 Query: 275 LKP----ESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAYMAVKPHG 327 SY+ I+ ++++ L + + + S ++P+ Sbjct: 69 FIEPQSISSYKEEYILQDEDLMWNSTGLGTLGRMAIYYKKLNPYKLAVADSHVTVIRPYK 128 Query: 328 ID--STYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 S YL + S + V + GS ++ L + VK V +PP++EQ I + Sbjct: 129 QHIVSEYLYYYFASNTVQSVIEDKSDGSTKQKELSTKTVKSYLVPLPPMEEQKRIVEKVK 188 Query: 384 VETARI 389 + Sbjct: 189 ELMQLL 194 Score = 43.2 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 53/179 (29%), Gaps = 17/179 (9%) Query: 20 AIPKHWKVVPIKRFTKLN-TGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W V + G++ + K + + +G L K S Sbjct: 18 EIPVNWAWVRLDDICSFIHRGKSPKYSLIKKYPVVAQKC-NQWSGFSLEKAKFIEPQSIS 76 Query: 77 TVS---IFAKGQILYGKL--GPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQG 126 + I +++ G R AI + + V++P Sbjct: 77 SYKEEYILQDEDLMWNSTGLGTLGRMAIYYKKLNPYKLAVADSHVTVIRPYKQHIVSEYL 136 Query: 127 WLLSIDVTQR---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + T + + K + + +P+PP+ EQ I EK+ + Sbjct: 137 YYYFASNTVQSVIEDKSDGSTKQKELSTKTVKSYLVPLPPMEEQKRIVEKVKELMQLLK 195 >gi|269978354|gb|ACZ55911.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 205 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 15/117 (12%), Positives = 45/117 (38%), Gaps = 2/117 (1%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG 352 I + + ++ ++ P + YL +++ + + S Sbjct: 65 NTITIAQYGTAGFVNWQNQKFWANDVCFSLIPKETLINRYLYYVLTNMQNHLYSISNRSA 124 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI-VLLKERRS 408 + S+ ++ ++ + +PP++ Q +I +++ T L ++ + LK R+ Sbjct: 125 IPYSISSNNIMQITIPIPPLEIQQEIVKILDAFTELNTELNTELNTELNTELKARKK 181 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 44/162 (27%), Gaps = 11/162 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + ++ G+ + + GKY G Sbjct: 13 PKGVEFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYNR 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + + L PK+ L ++L+ Sbjct: 63 EENTITIAQYG-TAGFVNWQNQKFWANDVCFSLIPKETLINRYLYYVLTNMQNHLYSISN 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 A I I +PIPPL Q I + + A T Sbjct: 122 RSAIPYSISSNNIMQITIPIPPLEIQQEIVKILDAFTELNTE 163 >gi|301633575|gb|ADK87129.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 363 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 54/387 (13%), Positives = 111/387 (28%), Gaps = 49/387 (12%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFAKGQI 86 IK + GR I E +++ +GKY + + + G+ Sbjct: 7 KIKDICDIQRGR---------GITKEYIKNNSGKYPVYSAATTNNGELGFINTYDFAGEY 57 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 + Y + S V K E+ +L + + + + Sbjct: 58 VTWTTNGYAGVVFYRNGKFSASQDCGV--LKVRNKEINAQFLAFALSLKTPQFVHNLGSR 115 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 + + I + PPL Q I + + L E I+ + L++ Sbjct: 116 PKLNRNVVAEISLDFPPLEVQEKIAHFLKSFNELSSQLKAELIKRQKQYAFYSDYLLN-- 173 Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 K S E L + ++ +K E + + Sbjct: 174 ----------PKHSQGEEYKLF-----------KLKDIAKKILVGGEKPSDFQKEKDQVY 212 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K + K + + Y + + ++ ++ KP Sbjct: 213 KYPILSNSRKADDFLGYSKTFRIAEKSITVSARGTIGAVFYRDFSYLPAVSLICFIPKPE 272 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE- 385 + +L +++ K GSG L K V +P +K+Q +I ++ Sbjct: 273 F-NIKFLFHALKATKFHKQ----GSGT-GQLTMAQFKEYQVYIPSLKKQQEIAATLDPLY 326 Query: 386 --TAR----IDVLVEKIEQSIVLLKER 406 A I +E ++ + +ER Sbjct: 327 YIFANSNWGIYKEIELRKKQMQYYQER 353 Score = 43.6 bits (101), Expect = 0.068, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 47/166 (28%), Gaps = 16/166 (9%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-------IVDPGEIVFRFIDLQNDKR 303 +I + G I K +N K Y ++ + ++ + Sbjct: 6 YKIKDICDIQRGRGITKEYIKNNSGKYPVYSAATTNNGELGFINTYDFAGEYVTWTTNGY 65 Query: 304 SLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + + + + + S + + +GS R L V Sbjct: 66 AGVVFYRNGKFSASQDCGVLKVRNKEINAQFLAFALSLKTPQFVHNLGS--RPKLNRNVV 123 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + PP++ Q I + + L L+K ++ Sbjct: 124 AEISLDFPPLEVQEKIAHFLKSFNELSSQLKA------ELIKRQKQ 163 >gi|190606538|ref|YP_001974823.1| hypothetical protein -pVEF3_p54 [Enterococcus faecium] gi|190350308|emb|CAP62660.1| hypothetical protein [Enterococcus faecium] Length = 382 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 53/388 (13%), Positives = 111/388 (28%), Gaps = 44/388 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + TK+ G+ K SG ++ G +S Sbjct: 16 EYKNLVEITKVLRGKRLTRDK----------LSGDERFPVFHGGLDPLGYYGLSNRPANS 65 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 ++ +G +D + S +Q D+L + I + + A Sbjct: 66 VMIINVGASAGTVGYSDVEFWSSDGCYCIQHSDLLDNK-FLYYFLIGQQHLLRSKVRFAG 124 Query: 146 MSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + D I I +PIP L Q I + T L E + K++ Sbjct: 125 IPTLDANVIEKIKIPIPCPDNPEKSLEIQAEIVRILDTFTELTAELTAELTAELTARKKQ 184 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 ++T + +EW KP + + +N Sbjct: 185 YNYYREQLLT--------FEKGEVEW----------KPLGKIADYEQPTKYLVKSTNYSD 226 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++ +T +G E Y I+F N + Sbjct: 227 NFDTPVLTAGKTFILGYTDEISGIYSASKSPVIIFDDFTTANK---WVDFDFKAKSSAMK 283 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + Y+ + + + + G + + + +PP+ EQ I Sbjct: 284 MITSKNESKVLLKYIYYWINTLPNDLIV-----GDHKRQWISNYSNKLIPIPPLGEQTRI 338 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 ++++ A + E + + I L +++ Sbjct: 339 VSILDKFEALTSSITEGLPREIELRQKQ 366 >gi|111657645|ref|ZP_01408377.1| hypothetical protein SpneT_02001155 [Streptococcus pneumoniae TIGR4] Length = 231 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 52/147 (35%), Gaps = 9/147 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +S + KG L + R I+ I + ++ L + ++LS Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSS 202 Query: 132 D-VTQRIEAICEGATMSHADWKGIGNI 157 + V + ++ GA + + + + +I Sbjct: 203 NVVYSQFLSLISGAVVKNLNSDKVASI 229 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 8/155 (5%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE F LV + + + I+ + S G K+ G K + Sbjct: 77 EIDVPYDIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINN 136 Query: 281 ETYQIVDPG-----EIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYL 333 +I G + L N R + G I + ++ + ++ YL Sbjct: 137 VKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYL 196 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPV 367 +++ S + F ++ SG ++L + V + + Sbjct: 197 FYILSSNVVYSQFLSLISGAVVKNLNSDKVASILI 231 >gi|268610918|ref|ZP_06144645.1| type I restriction/modification specificity protein [Ruminococcus flavefaciens FD-1] Length = 238 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 53/186 (28%), Gaps = 12/186 (6%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIVFRF 295 + K + + ++ ++ + R++ ++ + + I Sbjct: 56 CGKTPSTKKKEYYGDYMPFITIPDMHNNVYVIATERSLSKMGSDSQSKKTLPANSICVSC 115 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I + + I + + Y+ LM+++ Sbjct: 116 IGTAGLVTLVAVNSQTNQQINS----IIPKERYSPYYIFLLMQTFSEKINRLGQSGSTIV 171 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +L + LVP I + D + + + + L R + + + Sbjct: 172 NLNKAQFGLMEALVPSINDMNDF----DTTVKPLFERILANQYENQRLAALRDTLLPKLM 227 Query: 416 TGQIDL 421 G+ID+ Sbjct: 228 NGEIDV 233 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 59/191 (30%), Gaps = 9/191 (4%) Query: 22 PKHWKVVPIKRFT-KLNTGRTSESGK------DIIYIGLEDVESGTGKY-LPKDGNSRQS 73 P W + I + + G+T + K + +I + D+ + + + S Sbjct: 39 PDDWSIGTISDLSRDIICGKTPSTKKKEYYGDYMPFITIPDMHNNVYVIATERSLSKMGS 98 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 D+ + I +G + + + Q + PK+ L+ Sbjct: 99 DSQSKKTLPANSICVSCIGT-AGLVTLVAVNSQTNQQINSIIPKERYSPYYIFLLMQTFS 157 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G+T+ + + G + +P + + + RI E R Sbjct: 158 EKINRLGQSGSTIVNLNKAQFGLMEALVPSINDMNDFDTTVKPLFERILANQYENQRLAA 217 Query: 194 LLKEKKQALVS 204 L L++ Sbjct: 218 LRDTLLPKLMN 228 >gi|167768810|ref|ZP_02440863.1| hypothetical protein ANACOL_00127 [Anaerotruncus colihominis DSM 17241] gi|167668982|gb|EDS13112.1| hypothetical protein ANACOL_00127 [Anaerotruncus colihominis DSM 17241] Length = 291 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 4/174 (2%) Query: 227 LVPDHWEVKPFFALVTEL----NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 +PD WE F + T R + I+ S + ++ ++ + Sbjct: 117 ELPDGWEWCNFSMIGTTNLGLTYRPTDIEPDGVIVLRSCNIVNDPIDLSDLVRVKTTIRK 176 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 Q +I+ + + + Y+ +RS Sbjct: 177 NQYAQKNDILICARNGSRVLVGKCALISNLGEAASFGAFMAIYRTEYFEYIVQHLRSSFF 236 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 VF S L + +KR V +PP EQ IT +I+ ++ + +++ Sbjct: 237 RSVFDDSNSTAINQLTQDMLKRAVVPLPPASEQRRITEMIDATLFELNQMEKRL 290 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 51/173 (29%), Gaps = 11/173 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +P W+ N G T +I + ++ + D ++ Sbjct: 117 ELPDGWEWCNFSMIGTTNLGLTYRPTDIEPDGVIVLRSCNIVNDPIDL--SDLVRVKTTI 174 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 K IL + A+I++ S + + E + L S Sbjct: 175 RKNQYAQKNDILICARNGSRVLVGKCALISNLGEAASFGAFMAIYRTEYFEYIVQHLRSS 234 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + ++ + +P+PP +EQ I E I A ++ + Sbjct: 235 FFRSVFDDS-NSTAINQLTQDMLKRAVVPLPPASEQRRITEMIDATLFELNQM 286 >gi|304436279|ref|ZP_07396260.1| 50S ribosomal protein L10 [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304370727|gb|EFM24371.1| 50S ribosomal protein L10 [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 212 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 61/205 (29%), Gaps = 19/205 (9%) Query: 225 VGLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 + +P W+ + + + + + ++ L + Q N Sbjct: 14 ISDIPKGWQEGYLTDIAEYLNGLAMQKFRPQNEQESLPVLKIKELRQGQCDINSERCSLD 73 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + I+ G+++F + L + V + + Sbjct: 74 IKPQYIIHDGDVIFSWSGSL-----LVDFWCGGICGLNQHLFKVHSKQYAP--WLYYSWT 126 Query: 340 YDLCKVFYAMGS---GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 F AM + +K + ++ VL+P + I + + + Sbjct: 127 KYYLAEFVAMAADKATTMGHIKRDALENARVLIPCSDDYLKI----EEQLQPLYDAIIAH 182 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 I L R++ + ++G+ID+ Sbjct: 183 RVEIRKLSTLRNTLLPRLMSGEIDV 207 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 57/193 (29%), Gaps = 10/193 (5%) Query: 18 IGAIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 I IPK W+ + + G R + + + ++++ G + Sbjct: 14 ISDIPKGWQEGYLTDIAEYLNGLAMQKFRPQNEQESLPVLKIKELRQGQCDINSERC--- 70 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 D I G +++ G L G + + K P L W Sbjct: 71 SLDIKPQYIIHDGDVIFSWSGSLLVDFWCGGICG-LNQHLFKVHSKQYAPWLYYSWTKYY 129 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 A + TM H + N + IP + + I E++ I E + Sbjct: 130 LAEFVAMAADKATTMGHIKRDALENARVLIPCSDDYLKIEEQLQPLYDAIIAHRVEIRKL 189 Query: 192 IELLKEKKQALVS 204 L L+S Sbjct: 190 STLRNTLLPRLMS 202 >gi|32455520|ref|NP_862272.1| HsdS' [Lactobacillus sakei] gi|24461247|gb|AAN61994.1|AF438419_4 HsdS' [Lactobacillus sakei] Length = 151 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 13/113 (11%), Positives = 38/113 (33%), Gaps = 7/113 (6%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + + V ++ + + + G D ++ + Sbjct: 34 YDGFHDAAKVQGPGVITGRSGTLGSVY----FNTTDFWPLNTTLFVSNFKGNDPLFVYYY 89 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +++ DL + +L + ++ V +P + +Q +I N +N+ +I Sbjct: 90 LKTMDLGRY---ATGTTVPTLNRNHLDQIKVNIPDLAQQREIANKLNLFDEKI 139 >gi|217032123|ref|ZP_03437623.1| hypothetical protein HPB128_16g83 [Helicobacter pylori B128] gi|216946271|gb|EEC24879.1| hypothetical protein HPB128_16g83 [Helicobacter pylori B128] Length = 83 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 14/85 (16%), Positives = 33/85 (38%), Gaps = 4/85 (4%) Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 M+ + + S+ + + +L+PP+ EQ I N+++ I L K Sbjct: 1 MKVNQNYLYEISNRNATPYSISKDKILDFEILLPPLNEQIAIANILSALDNEIISLKNKK 60 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 Q + + + ++ +I + Sbjct: 61 RQ----FENIKKALNHDLMSAKIRV 81 >gi|324994851|gb|EGC26764.1| hypothetical protein HMPREF9392_1667 [Streptococcus sanguinis SK678] Length = 216 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 56/171 (32%), Gaps = 4/171 (2%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-- 282 +GL + ++ ++ L +I LK Sbjct: 10 LGLRYKNLSDFSIGKGTYGISASAVGKDDNLPTYLRITDINDDGTINFASLKSVDRSDAD 69 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYD 341 + P +IVF + + E ++ P ++ + +S + Sbjct: 70 KYRLQPNDIVFARTGGSTGRSYFYDGKDGEFVFAGFLIKFSIDPQKCIPKFIKYYCQSRE 129 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +G R ++ + +++P+ P+++Q I ++++ +I+ Sbjct: 130 YYNWVASFNTGSTRGNINAKTFEKMPIPDLPLEQQQLIVDILSPIDDKIEN 180 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 60/177 (33%), Gaps = 15/177 (8%) Query: 29 PIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K + + G+ + Y+ + D+ + +SD Sbjct: 13 RYKNLSDFSIGKGTYGISASAVGKDDNLPTYLRITDINDDGTINFASLKSVDRSDADKYR 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVL--QPKDVLPELLQGWLLSIDVT 134 + I++ + G ++ D + + + P+ +P+ ++ + S + Sbjct: 73 L-QPNDIVFARTGGSTGRSYFYDGKDGEFVFAGFLIKFSIDPQKCIPKFIKYYCQSREYY 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G+T + + K +P+P PL +Q LI + + +I+ Sbjct: 132 NWVASFNTGSTRGNINAKTFEKMPIPDLPLEQQQLIVDILSPIDDKIENNKKINHHL 188 >gi|313678683|ref|YP_004056423.1| type I restriction modification system, S subunit [Mycoplasma bovis PG45] gi|312950104|gb|ADR24699.1| type I restriction modification system, S subunit [Mycoplasma bovis PG45] Length = 438 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 74/242 (30%), Gaps = 16/242 (6%) Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV---SYIVTKGLNPDVKMKDS 220 L +Q + + I + + +L K L+ + ++ Sbjct: 200 LVKQDQNNDSVDNLINEIYKEKQKLVEQGKLKKADLNNLIIYKNDNDNSYYEKFENGREE 259 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNMGL 275 IE +P +W F +V K I +S ++I+ + ++ Sbjct: 260 KIEVPFEIPYNWIWSRFNKVVNFKIGKTPPTNDLSFWNGKIPWVSISDMIKNSKIKSTKK 319 Query: 276 KPE-----SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 SY +V ++ F L V GII+ K + Sbjct: 320 FISKKALSSYFNNNLVKKETLIMSFKLTVGKTSILGIDAVHNEGIISIYPYFDKNNLFRD 379 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + +L A+ G L + + +L + +PP+KEQ I I + Sbjct: 380 FLMLFLPIFSQFGDKKEAIKGGT---LNTKSLSKLLIPIPPLKEQQRIVENITKIQKLLK 436 Query: 391 VL 392 L Sbjct: 437 NL 438 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 60/172 (34%), Gaps = 8/172 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLED-VESGTGKYLPKDGNSRQ 72 IP +W + G+T + I ++ + D +++ K K + + Sbjct: 266 EIPYNWIWSRFNKVVNFKIGKTPPTNDLSFWNGKIPWVSISDMIKNSKIKSTKKFISKKA 325 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-SI 131 + + K + L + K I D + + + + P L + +L+ + Sbjct: 326 LSSYFNNNLVKKETLIMSFKLTVGKTSILGIDAVHNEGIISIYPYFDKNNLFRDFLMLFL 385 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + E + K + + +PIPPL EQ I E I + Sbjct: 386 PIFSQFGDKKEAIKGGTLNTKSLSKLLIPIPPLKEQQRIVENITKIQKLLKN 437 >gi|268609384|ref|ZP_06143111.1| putative type I restriction-modification system specificity subunit [Ruminococcus flavefaciens FD-1] Length = 367 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 49/401 (12%), Positives = 108/401 (26%), Gaps = 43/401 (10%) Query: 23 KHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WK + L++G+ + + S T +Y GN + TS + F Sbjct: 3 SEWKEYELGNICSRLSSGKGIK----------AAMISDTAEYAVYGGNGIRGYTSDYN-F 51 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G+ G Y + ++ ++L+ + + Sbjct: 52 EGDCAIIGRQGAYCGNVRYFSGKAYMTEHAVIACANSEHNTRYLSYVLTA---MDLGRLS 108 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + K + + +P L Q I I ++ I E L+++ QA Sbjct: 109 GQSAQPGISVKTLSIQKVKMPSLNLQRKIVAVI----SSLEEKIELNAAINENLEQQAQA 164 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L +++ + G P + + + K+ Sbjct: 165 LFKDMISDVQEQVPFTSVIQV-LGGGTPKTGNQEYWNGEIPFFTPKD------------V 211 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 GN +++ ++ + + S Y Sbjct: 212 GNPYVLTTEKSITPLGLDNCNSRLYPVNTVFLTARGTVGKVSLAGVPM----AMNQSCYA 267 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL-VPPIKEQFDITN 380 G+ + + + + + + ++ D V + P EQ N Sbjct: 268 LAGKDGLHQIIVYHYVL-ETVKALKHKASGAVFDAIITRDFDTENVPALSP--EQIK--N 322 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I I + L R + + + G+ID+ Sbjct: 323 YI-AFAEPIYNEILNRSVENQRLATLRDTLLPKLMNGEIDV 362 >gi|257464673|ref|ZP_05629044.1| Type I restriction-modification system, S subunit/Type I restriction modification DNA specificity [Actinobacillus minor 202] gi|257450333|gb|EEV24376.1| Type I restriction-modification system, S subunit/Type I restriction modification DNA specificity [Actinobacillus minor 202] Length = 360 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 64/197 (32%), Gaps = 4/197 (2%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 SG + +G IP W V+ K F + + +D+ + + G K S Sbjct: 163 SGYKNLGEIPIGWNVLTFKDFISESKEKVGSL-EDVPEYSVGN--EGIYPRSEKYNKSLS 219 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG-WLLSI 131 + G +++G L I+ D G S + V + + + ++ + Sbjct: 220 KTPEKNKVVRIGDLVFGMGSKTLNWGIMNDEIGSVSPAYFVYRIFTNINYIYLNKYIKAK 279 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + D + + +P + K+ + + TE + Sbjct: 280 EYDFQNLIKPTSRQGQSVDKEMFLKKEIYVPNEYLLDIYLNKLKEIDSLVYSYTTEVLIL 339 Query: 192 IELLKEKKQALVSYIVT 208 ++ E L+S V Sbjct: 340 EQIRDELLPKLLSGEVF 356 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 48/364 (13%), Positives = 107/364 (29%), Gaps = 60/364 (16%) Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 + L K+ + + + + + ++ GA I + P + Sbjct: 2 AFNQSCYGLNGKENIIDNGFLYYFLKNNIKELKQKTHGAVFDTITRDTFEYIEIYYPDIK 61 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV---------------------- 203 Q I E + +I ++ + ++ Sbjct: 62 RQKEIAEILEDYDQKIQLNTQINQTLEQIAQTIFKSWFIDFDPVHAKANALANGQTLEQA 121 Query: 204 -------------------------SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFF 238 Y + SG + +G +P W V F Sbjct: 122 TQAAMAVISGKNTQELHRLQTANPEQYQQLWEIAEAFPSGFSGYKNLGEIPIGWNVLTFK 181 Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 ++E K L + S+ I + E N L ++ E ++V G++VF Sbjct: 182 DFISESKEKVGSLEDVPEYSVGNEGIYPRSEKYNKSL-SKTPEKNKVVRIGDLVFGMGSK 240 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL---AWLMRSYDLCKVFYAMGSGLRQ 355 + + E G ++ AY + + + YD + + Sbjct: 241 TLNWGIM----NDEIGSVSPAYFVYRIFTNINYIYLNKYIKAKEYDFQNLIKPTSRQGQ- 295 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 S+ E + + VP + ++ + ID LV +++L++ R + + Sbjct: 296 SVDKEMFLKKEIYVPN----EYLLDIYLNKLKEIDSLVYSYTTEVLILEQIRDELLPKLL 351 Query: 416 TGQI 419 +G++ Sbjct: 352 SGEV 355 >gi|160894147|ref|ZP_02074925.1| hypothetical protein CLOL250_01701 [Clostridium sp. L2-50] gi|156864180|gb|EDO57611.1| hypothetical protein CLOL250_01701 [Clostridium sp. L2-50] Length = 230 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 62/209 (29%), Gaps = 20/209 (9%) Query: 226 GLVPDHWEVKPFFA------LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 G PD W + K ++ N ++ ++ + + Sbjct: 28 GTKPDDWSDGTIDDLGTEIICGKTPSTKKSEYYGGNTPFITIPDMHGCVYIVSTERYLSD 87 Query: 280 ----YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + + + P + I + + I + + GI Y+ Sbjct: 88 AGVASQPKKTLPPNTVCVSCIGTAGLVTLVSEESQSNQQINS----IIPKEGISVYYIYL 143 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVE 394 LM++ +L ++ V++P + Q + + + Sbjct: 144 LMQTLADTINKLGQSGSTIVNLNKTQFGKIQVMIPSELVLQD-----FDSLCRPLFDTIL 198 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 ++ + L E R + + ++G++D+ Sbjct: 199 SNQKENINLSELRDALLPKLMSGELDVSD 227 >gi|145633240|ref|ZP_01788971.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 3655] gi|229845101|ref|ZP_04465236.1| type I restriction/modification specificity protein [Haemophilus influenzae 6P18H1] gi|144986086|gb|EDJ92676.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 3655] gi|229811937|gb|EEP47631.1| type I restriction/modification specificity protein [Haemophilus influenzae 6P18H1] Length = 431 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 62/186 (33%), Gaps = 10/186 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + F LVT+ + K E + ++ NI+ + I Sbjct: 5 EFIPASEFCDLVTDGTHDSPKKTEFGVKLVTSKNIVGGKLDLTSAYFISESDAQNINKRS 64 Query: 290 EIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 ++ + + + + +I + + + +L + ++S + Sbjct: 65 QVHINDVLLSMIGTVGEVALIEKEPDFVIKNVGLLKNSDPKKAKWLYYYLKSPIAQNLIK 124 Query: 348 -AMGSGLRQSLKFEDVKRLPVLVPPIKE--QFDITNVINVETARIDVLVEKIEQSIVLLK 404 + +Q + +++ LP+L P +E Q I + + +D ++ Q L+ Sbjct: 125 DRLRGTTQQYIPLGELRNLPILKPNSEEHLQNTI-----EQLSSLDKKIQLNTQINQTLE 179 Query: 405 ERRSSF 410 + + Sbjct: 180 QIAQAL 185 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 58/439 (13%), Positives = 132/439 (30%), Gaps = 57/439 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVS 79 + +P F L T T +S K + + +++ G S + + S Sbjct: 5 EFIPASEFCDLVTDGTHDSPKKTEFGVKLVTSKNIVGGKLDLTSAYFISESDAQNINKRS 64 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +L +G A+I + L+ + L +L S I+ Sbjct: 65 QVHINDVLLSMIGTVGEVALIEKEPDFVIKNVGLLKNSDPKKAKWLYYYLKSPIAQNLIK 124 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G T + + N+P+ P E + I + +D I + + L++ Sbjct: 125 DRLRGTTQQYIPLGELRNLPILKPNSEEHLQNT---IEQLSSLDKKIQLNTQINQTLEQI 181 Query: 199 KQALVS-------------YIVTKGLNPDVKMKDSGIEWVGLVPD------HWEVKPFFA 239 QAL ++ GL+ + + G P+ + + Sbjct: 182 AQALFKSWFVDFDPVRAKVQALSDGLSLEQAELAAIQAISGKTPEELTALSQTQPDRYTE 241 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVF----- 293 L +++E + ++ G +++++ + + Y + G + Sbjct: 242 LAETAKAFPCEMVEVDGGEVTKGWEVKRIDEVIQKIPVGKKYSSKTAFSEGLVPILDQGR 301 Query: 294 -RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID------------STYLAWLMRSY 340 I NDK ++++ + + ++ D + + + Sbjct: 302 SGVIGYHNDKPGVKASIEDPIIVFANHTCYMRLISYDFSAIQNVFAFKGTECNLYWLYLA 361 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 L K + G D ++VPP + ++I ++ Sbjct: 362 TLGKQEFVEYKGHFP-----DFLIKEIIVPPEELTELFGKYAKENFSKIF----INDREN 412 Query: 401 VLLKERRSSFIAAAVTGQI 419 L + R + + G I Sbjct: 413 SSLAKIRDLLLPKLLNGDI 431 >gi|291569502|dbj|BAI91774.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 255 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 80/260 (30%), Gaps = 35/260 (13%) Query: 171 REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW--VGLV 228 + T L E + + +++ ++T ++ +EW +G Sbjct: 1 MRILDTFTALTAELTAELTAELTVRQKQYNYYRDQLLT--------FEEGEVEWKPLGE- 51 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQ 284 + ++ I ++ YG I ++ E + Sbjct: 52 --------IGEFIRGKRFTKADYVDDGIPAIHYGEIYTHYGVAASHTLSQVRAEMAASLC 103 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 +PG++V + + A + + H I+ +++++M++ Sbjct: 104 YAEPGDVVMTGVGETVEDVGKAVAWIGSEKVAIHDDSWAFRHSINPKFVSYVMQTTAFIN 163 Query: 345 VFYAM-GSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLVEKI 396 SG L +K++P+ +P ++EQ I +++ + E + Sbjct: 164 EKAKHVSSGKVNRLLINGIKKVPIPIPYPNDPKKSLEEQAHIVAILDKFDTLTHSISEGL 223 Query: 397 EQSI----VLLKERRSSFIA 412 I + R + Sbjct: 224 PHEIAWRQKQYEYYRDLLLT 243 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 28/238 (11%), Positives = 72/238 (30%), Gaps = 26/238 (10%) Query: 3 HYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLED 55 K Y Y+D + + G + + P+ + G+ I I + Sbjct: 25 RQKQYNYYRDQLLTFEE--GEV----EWKPLGEIGEFIRGKRFTKADYVDDGIPAIHYGE 78 Query: 56 VESGTGKYLPKDGNSRQSDT-STVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQ 110 + + G + +++ +++ G ++ +G + + + Sbjct: 79 IYTHYGVAASHTLSQVRAEMAASLCYAEPGDVVMTGVGETVEDVGKAVAWIGSEKVAIHD 138 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP------- 163 + P+ + + + ++ GI +P+PIP Sbjct: 139 DSWAFRHSINPKFVSYVMQTTAFINEKAKHVSSGKVNRLLINGIKKVPIPIPYPNDPKKS 198 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 L EQ I + + + I+E + ++K+ ++ + K S Sbjct: 199 LEEQAHIVAILD-KFDTLTHSISEGLPHEIAWRQKQYEYYRDLLLTFPKKEEKQCASD 255 >gi|262065806|ref|ZP_06025418.1| HsdS, type I site-specific deoxyribonuclease [Fusobacterium periodonticum ATCC 33693] gi|291380501|gb|EFE88019.1| HsdS, type I site-specific deoxyribonuclease [Fusobacterium periodonticum ATCC 33693] Length = 180 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 53/149 (35%), Gaps = 7/149 (4%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFIDLQND 301 K + L NI E LK +S + ++ GE++F + + Sbjct: 30 SKKATSVVGEFPILRMNNITYSGEMNYKDLKYIELSDSEKEKFLLKKGELLFNRTNSKEL 89 Query: 302 KRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLK 358 + + ++P + I S +L + M S + K+ Y + ++ Sbjct: 90 VGKTGLFNLDIPMAFAGYLIKIRPSNLIHSKFLLFFMNSEFMKKLLYNKAKNIVGMANIN 149 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++++ +++PPI+ Q I Sbjct: 150 AKELEDFSIILPPIELQNKFAERIEKIEK 178 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 58/168 (34%), Gaps = 10/168 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTV 78 K+W++ + + G + ++ + + + ++ SG Y Sbjct: 12 KNWEIKKLGEVVQTQYGTSKKATSVVGEFPILRMNNITYSGEMNYKDLKYIELSDSEKEK 71 Query: 79 SIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + KG++L+ + D + + ++P +++ + ++ + Sbjct: 72 FLLKKGELLFNRTNSKELVGKTGLFNLDIPMAFAGYLIKIRPSNLIHSKFLLFFMNSEFM 131 Query: 135 QR--IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 ++ M++ + K + + + +PP+ Q E+I Sbjct: 132 KKLLYNKAKNIVGMANINAKELEDFSIILPPIELQNKFAERIEKIEKL 179 >gi|227547680|ref|ZP_03977729.1| possible type I site-specific deoxyribonuclease specificity subunit [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227211835|gb|EEI79731.1| possible type I site-specific deoxyribonuclease specificity subunit [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 172 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 62/176 (35%), Gaps = 12/176 (6%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGE 290 WE + + + RKN L++S +I + N + Y ++ GE Sbjct: 1 WEQRKLGEIAERVTRKNENNESDLPLTISAQHGLIDQRLFFNAQVASRDMSGYYLLRQGE 60 Query: 291 IVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + + +++ E+G +++ Y+ + YL + K + Sbjct: 61 FAYNKSTSADSPWGAIKRLTRYEKGCVSTLYICFALLNANPDYLVTYYETNRWHKAVQMI 120 Query: 350 GS-GLRQ----SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQS 399 + G R ++ +D V +P EQ I +R+D L+ ++ Sbjct: 121 AAEGARNHGLLNIAPDDFFDTMVSLPESQAEQQTIGAF----FSRLDSLITLHQRK 172 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 17/151 (11%), Positives = 41/151 (27%), Gaps = 10/151 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + T + + D+ I + + + D S + + Sbjct: 1 WEQRKLGEIAERVTRKNENNESDLPLTISAQHGLIDQRLFF--NAQVASRDMSGYYLLRQ 58 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K G ST ++ + P+ L + + + ++ Sbjct: 59 GEFAYNKSTSADSPWGAIKRLTRYEKGCVSTLYICFALLNANPDYLVTYYETNRWHKAVQ 118 Query: 139 AICEGATMSH--ADWKGIGNIPMPIPPLAEQ 167 I +H + + Q Sbjct: 119 MIAAEGARNHGLLNIAPDDFFDTMVSLPESQ 149 >gi|320326659|gb|EFW82707.1| hypothetical protein PsgB076_00487 [Pseudomonas syringae pv. glycinea str. B076] Length = 452 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 39/95 (41%), Gaps = 6/95 (6%) Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + P +DS +L + ++S L + +L+ +K L V PPI+ Q Sbjct: 98 QVLLRPNPDKVDSRFLLYALQSPYLQRQIGWNEGTGSTVSNLRIPVLKALKVPTPPIETQ 157 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +I++ + ID + + ++ L+ + Sbjct: 158 REISSTLGS----IDDRIALLRETNANLEAIAQAL 188 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 65/424 (15%), Positives = 130/424 (30%), Gaps = 47/424 (11%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF--AKGQILYGKLGPYLRKAIIA 101 + + I + ++++ G + K I+ + P I Sbjct: 28 TTEGFIVLRNQNIKGGRLDLAAPSYTDEAHYLGRIRRAAPQKDDIVITREAPMGEVCQIP 87 Query: 102 DFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRI-EAICEGATMSHADWKGIGNI 157 + C L P V L L S + ++I G+T+S+ + + Sbjct: 88 EDLKCCLGQRQVLLRPNPDKVDSRFLLYALQSPYLQRQIGWNEGTGSTVSNLRIPVLKAL 147 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS-----YIVTKGLN 212 +P PP+ Q I + + RI L + + ++ +GL Sbjct: 148 KVPTPPIETQREISSTLGSIDDRIALLRETNANLEAIAQALFKSWFVDFGPVRAKAEGLV 207 Query: 213 PDVK-------MKDSGIE-WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 P+ DS +E GLVP W + PF L+ + + + I Sbjct: 208 PEGMDEVTSGMFPDSFVESEQGLVPKGWRLVPFGELLIHTIGGDWGDETPGEKNYIHVAI 267 Query: 265 IQKLETRNM----------GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 I+ + ++ + + G++V D+ + R+ + E Sbjct: 268 IRGTDIPDLQSGAANRVPLRYTSTKKLATRKLQDGDLVLEVSGGSKDQPTGRALYLTEAL 327 Query: 315 IIT--------SAYMAVKPHGIDSTYLAWLMRSYDL---CKVFYAMGSGLRQSLKFEDV- 362 + S ++P ++ L +Y Y S + + Sbjct: 328 LGQFDCPVAPASFCRLLRPSDRNTGLLLAQHLTYIYGIGKTWEYQNQSTGIANFQTTHFL 387 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 K V VPP + +V+ R+ I L R + + ++GQ+ L Sbjct: 388 KNELVAVPPREVLAVFADVVRSIVDRV------HLSQIQNLASLRDALLPRLISGQLRLP 441 Query: 423 GESQ 426 + Sbjct: 442 VAEE 445 >gi|225026006|ref|ZP_03715198.1| hypothetical protein EUBHAL_00244 [Eubacterium hallii DSM 3353] gi|224956656|gb|EEG37865.1| hypothetical protein EUBHAL_00244 [Eubacterium hallii DSM 3353] Length = 215 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 36/129 (27%), Gaps = 5/129 (3%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + G+I+F K + A D+ ++ + Sbjct: 86 YKLTEGDILFARTGASVGKSYIYKNSDGLVYYAGFLIRARIKEEYDTEFVFQNTLTDRYN 145 Query: 344 KVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 K + + ++ + VP +EQ I ID L+ ++ Sbjct: 146 KYIAVTSQRSGQPGVNAQEYAEFEIKVPKKEEQTKIGTY----FRNIDNLITLHQRKCNQ 201 Query: 403 LKERRSSFI 411 L+ R + Sbjct: 202 LQIIRKYML 210 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 57/191 (29%), Gaps = 10/191 (5%) Query: 23 KHWKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + W+ + G E + YI + D++ T ++L + S + + Sbjct: 24 EDWEQRKLGELASSFEYGLNAAAKEYDGENKYIRITDIDDNTHEFLTDNLTSPDIELTGA 83 Query: 79 --SIFAKGQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGWLLSID 132 +G IL+ + G + K+ I ++ E + L+ Sbjct: 84 DNYKLTEGDILFARTGASVGKSYIYKNSDGLVYYAGFLIRARIKEEYDTEFVFQNTLTDR 143 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I + + + + + +P EQ I I + + Sbjct: 144 YNKYIAVTSQRSGQPGVNAQEYAEFEIKVPKKEEQTKIGTYFRNIDNLITLHQRKCNQLQ 203 Query: 193 ELLKEKKQALV 203 + K + + Sbjct: 204 IIRKYMLKNMF 214 >gi|317488601|ref|ZP_07947145.1| type I restriction modification DNA specificity domain-containing protein [Eggerthella sp. 1_3_56FAA] gi|316912295|gb|EFV33860.1| type I restriction modification DNA specificity domain-containing protein [Eggerthella sp. 1_3_56FAA] Length = 182 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 55/182 (30%), Gaps = 14/182 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL---SYGNIIQKLETRNMGLKP-- 277 E +P+ WE + T + R + N Sbjct: 1 EIPFDIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKCNQWSGFSLERAKFVDPN 60 Query: 278 --ESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAY--MAVKPHGIDS 330 SY +++ G++++ L + + S + P + Sbjct: 61 SVASYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPDWLRY 120 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y + V SG ++ L E VKR + VPP+ EQ I +N+ A Sbjct: 121 EYAFLYFAGPSVQSVIEDQASGSTKQKELAQETVKRYLIPVPPLAEQRRIAERLNLILAN 180 Query: 389 ID 390 I+ Sbjct: 181 IN 182 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 62/179 (34%), Gaps = 17/179 (9%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ ++ T + G++ + K + + +G L + + + Sbjct: 5 DIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKC-NQWSGFSLERAKFVDPNSVA 63 Query: 77 TV---SIFAKGQILYGKLG-PYLRKAIIAD------FDGICSTQFLVLQPKDVLPELLQG 126 + + G +L+ G L + + D + + V++ Sbjct: 64 SYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPDWLRYEYA 123 Query: 127 --WLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + V IE G+T + + +P+PPLAEQ I E++ I+ Sbjct: 124 FLYFAGPSVQSVIEDQASGSTKQKELAQETVKRYLIPVPPLAEQRRIAERLNLILANIN 182 >gi|293372408|ref|ZP_06618792.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CMC 3f] gi|292632591|gb|EFF51185.1| type I restriction modification DNA specificity domain protein [Bacteroides ovatus SD CMC 3f] Length = 408 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 54/414 (13%), Positives = 117/414 (28%), Gaps = 34/414 (8%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQS-----DTST 77 K + + G + + YI L K+ S+ + D + Sbjct: 4 KKYKLGEILDVTRGASLSGEYYATEGEYIRLTCGNFDYQNNCFKENKSKDNLYYVGDFKS 63 Query: 78 VSIFAKGQIL-------YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + +G I+ G LG + ++ + + + + S Sbjct: 64 EFLMEEGDIITPLTEQAIGLLGSTAIIPESGKYIQSQDVAKIICKEDLLDKDFAFYLISS 123 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 V Q++ A + + H I + + IP L+EQ I + + + ID I Sbjct: 124 ALVKQQLSAAAQQTKIRHTSPDKIKDCTVWIPKLSEQKRIGKLLRS----IDRKIELNRA 179 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 + L+ + L Y + P+ + K ++ + + KN Sbjct: 180 INQNLEAMAKQLYDYWFVQFDFPNEEGKPYKSSGGKMIWNDRLKREIPVSWNNGTIKNFM 239 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 I + +S + + PE + + + G V + R Sbjct: 240 KIFTGKKDVSKAIP---GKYKFFSCAPEPITSNEFIYDGYAVLVSGNGSYTGR--VGFYK 294 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLV 369 + + Y V + Y +F + D+ Sbjct: 295 GKFDLYQRTYACVLDENQHDISFFYYTLKYLFQPIFSGGRHGSSIPYIVLGDLADFNFAF 354 Query: 370 PPIKEQFDITN--VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + +N + D + + I L ++R + + GQ+ + Sbjct: 355 ------NENVKNMFVNTVKSMFDEQL-LRQCEIEELTKQRDELLPLLMNGQVSV 401 Score = 36.7 bits (83), Expect = 8.1, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 50/183 (27%), Gaps = 20/183 (10%) Query: 10 YKDSGVQWIG------AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + I IP W IK F K+ TG+ +DV Sbjct: 209 YKSSGGKMIWNDRLKREIPVSWNNGTIKNFMKIFTGK-------------KDVSKAIPGK 255 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ-FLVLQPKDVLPE 122 + + TS I+ +L G Y + + + + ++ Sbjct: 256 YKFFSCAPEPITSNEFIYDGYAVLVSGNGSYTGRVGFYKGKFDLYQRTYACVLDENQHDI 315 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + L G+++ + + + + + + ++ Sbjct: 316 SFFYYTLKYLFQPIFSGGRHGSSIPYIVLGDLADFNFAFNENVKNMFVNTVKSMFDEQLL 375 Query: 183 TLI 185 Sbjct: 376 RQC 378 >gi|257458620|ref|ZP_05623755.1| type I restriction-modification system, S subunit [Treponema vincentii ATCC 35580] gi|257444054|gb|EEV19162.1| type I restriction-modification system, S subunit [Treponema vincentii ATCC 35580] Length = 197 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 57/180 (31%), Gaps = 12/180 (6%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + + E I + + Q + L + + ++ G+++F + Sbjct: 22 MQKYRPTSHEQGIFVMKIKELRQGFCDSSSELCSNTVNPFYLIHNGDVIFSWSGSL---- 77 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFED 361 L + V + + + + + L + +K E+ Sbjct: 78 -LVDFWCGGLCGLNQHLFKVTSNKY-AKWFYYCWTKFHLHHFITEAADKATTMGHIKREN 135 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + V++P + + +I L+ I L R + + ++G+ID+ Sbjct: 136 LAKAEVVIPT----KQVYLTVGDLLGQIYNLMIANRIEINTLSALRDTLLPKLMSGEIDV 191 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 45/186 (24%), Gaps = 4/186 (2%) Query: 22 PKHWKVVPIKRFTKLNTG---RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 P W+ + G + I + ++ + + + Sbjct: 2 PDDWQKACLLDIADYTNGLAMQKYRPTSHEQGIFVMKIKELRQGFCDSSSELCSNTVNPF 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + G +++ G L G + + W E Sbjct: 62 YLIHNGDVIFSWSGSLLVDFWCGGLCG-LNQHLFKVTSNKYAKWFYYCWTKFHLHHFITE 120 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 A + TM H + + + IP + + + + + E L Sbjct: 121 AADKATTMGHIKRENLAKAEVVIPTKQVYLTVGDLLGQIYNLMIANRIEINTLSALRDTL 180 Query: 199 KQALVS 204 L+S Sbjct: 181 LPKLMS 186 >gi|227889413|ref|ZP_04007218.1| possible type I restriction enzyme S protein [Lactobacillus johnsonii ATCC 33200] gi|227850030|gb|EEJ60116.1| possible type I restriction enzyme S protein [Lactobacillus johnsonii ATCC 33200] Length = 180 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 62/182 (34%), Gaps = 9/182 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQI 285 ++ ++ ++ K+ K S + + N+ + L + I Sbjct: 2 EYIKLGAICDVINGYAFKSKKYSTSGVRIIRITNVQKGYVEDASPVYYPLNTINELKKYI 61 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC-K 344 + G+++ L + A + K I YL +++ + K Sbjct: 62 LYSGDLLISLTGNVGRVAILDKKYLPAFLNQRVACIRPKSDKILKEYLFYMLNTNLFEVK 121 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + ++++ E +K V +P I+ Q + +++ +++ V + + L Sbjct: 122 SINSSKGIAQKNISTEWLKNYVVPLPSIEIQQHLISIL----KKLEKAVRNKKHELRALD 177 Query: 405 ER 406 + Sbjct: 178 KL 179 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 58/178 (32%), Gaps = 9/178 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVSI 80 + + + + G +S K + I + +V+ G + P + I Sbjct: 2 EYIKLGAICDVINGYAFKSKKYSTSGVRIIRITNVQKGYVEDASPVYYPLNTINELKKYI 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPK--DVLPELLQGWLLSIDVTQR 136 G +L G R AI+ + + ++PK +L E L L + + Sbjct: 62 LYSGDLLISLTGNVGRVAILDKKYLPAFLNQRVACIRPKSDKILKEYLFYMLNTNLFEVK 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +G + + + N +P+P + Q + + + E +L Sbjct: 122 SINSSKGIAQKNISTEWLKNYVVPLPSIEIQQHLISILKKLEKAVRNKKHELRALDKL 179 >gi|170718765|ref|YP_001783949.1| restriction modification enzyme [Haemophilus somnus 2336] gi|168826894|gb|ACA32265.1| restriction modification enzyme [Haemophilus somnus 2336] Length = 161 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 51/138 (36%), Gaps = 10/138 (7%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHGIDSTYLAW 335 +Y +I+ I + A + I + + +++ +L Sbjct: 23 KSSYTYFQENDIIIAKITPCMENGKCALATELSNHIGMGSSEFHVIRSQSPTLNNAFLFH 82 Query: 336 LMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + ++ + + G+ + + + LPV VP I++Q +I I A+ + + Sbjct: 83 FLNRNEIRQSAEQHMTGASGHRRVPIGFYESLPVPVPSIEKQTEILAQI----AQYEAQI 138 Query: 394 EKIEQSIVLLKERRSSFI 411 EQ I L ++ + + Sbjct: 139 ATCEQKIQSLPAQKQAIL 156 Score = 36.3 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 62/159 (38%), Gaps = 13/159 (8%) Query: 60 TGKYLPKDGNSRQSD--TSTVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQF 111 Y+ + N + S+ + F + I+ K+ P + ++ G+ S++F Sbjct: 6 NDGYIQQKINRPLGELRKSSYTYFQENDIIIAKITPCMENGKCALATELSNHIGMGSSEF 65 Query: 112 LVLQPKDV--LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 V++ + L +L ++ Q E GA+ + +P + Sbjct: 66 HVIRSQSPTLNNAFLFHFLNRNEIRQSAEQHMTGASGH---RRVPIGFYESLPVPVPSIE 122 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 + +I+A+ + + I + I+ L +KQA++ + Sbjct: 123 KQTEILAQIAQYEAQIATCEQKIQSLPAQKQAILVQYLQ 161 >gi|148827355|ref|YP_001292108.1| putative type I restriction-modification system, specificity determinant; restriction endonuclease [Haemophilus influenzae PittGG] gi|148718597|gb|ABQ99724.1| putative type I restriction-modification system, specificity determinant; restriction endonuclease [Haemophilus influenzae PittGG] Length = 390 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 48/392 (12%), Positives = 110/392 (28%), Gaps = 42/392 (10%) Query: 26 KVVPIKRFTK-------LNTGRTSESGKDIIYIGLE----DVESGTGKYLPKDGNSRQSD 74 + P+ T + + + K Y+ E V+ G K L + + + Sbjct: 18 EWKPLWSITTWDKRFNAVEKEKQPKVIKYHYYLASELKPLIVDGGNVKLLTTNESDIWTT 77 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 V + + + + + + + + Sbjct: 78 EELVQNNISEGEIIAIPWGGNPIVQYYKGKFVTADNRIATSNNTKILDNKFLYYFLLSKL 137 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 I + G+ + H + + +PIPPL+ Q I + + A T L +E I + Sbjct: 138 DVISSFYRGSGIKHPSMYHVLEMLIPIPPLSVQTEIVKILDALTALTSELTSELILRQKQ 197 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + ++ L+S E +G + + K +V K Sbjct: 198 YEYYREKLLSE-----------------EELGKI--GVQWKALGEIVPISRGKR------ 232 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 L S + L+P Y ++ + +E Sbjct: 233 --LIRSQLKDNDQYPVYQNSLQPLGYYDHKNCRAYMTFVIAAGAAGEIG----FSNVEFW 286 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 Y + I + + + + L ++++ + + KE Sbjct: 287 SADDCYYFDCANKILHDKFLYYFLLSNKHLLTNQVRKASVPRLSRVSIEKIKIPIVSFKE 346 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q I +++ + E + +I ++R Sbjct: 347 QERIVAILDKFETLTHSMTEGLPLAIEQSQKR 378 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 68/207 (32%), Gaps = 4/207 (1%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + S +EW L K F A+ E K K L I+ + + Sbjct: 12 LDGSEVEWKPLWSITTWDKRFNAVEKEKQPKVIKYHYYLASELK-PLIVDGGNVKLLTTN 70 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLA 334 T + + I I + + + +A + D+ +L Sbjct: 71 ESDIWTTEELVQNNISEGEIIAIPWGGNPIVQYYKGKFVTADNRIATSNNTKILDNKFLY 130 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + S + GSG+ + V + + +PP+ Q +I +++ TA L Sbjct: 131 YFLLSKLDVISSFYRGSGI-KHPSMYHVLEMLIPIPPLSVQTEIVKILDALTALTSELTS 189 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++ + R ++ G+I + Sbjct: 190 ELILRQKQYEYYREKLLSEEELGKIGV 216 >gi|304373164|ref|YP_003856373.1| Type I site-specific DNA methyltransferase specificity subunit [Mycoplasma hyorhinis HUB-1] gi|304309355|gb|ADM21835.1| Type I site-specific DNA methyltransferase specificity subunit [Mycoplasma hyorhinis HUB-1] Length = 417 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 51/357 (14%), Positives = 116/357 (32%), Gaps = 26/357 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFA 82 W+ ++ ++ GRT + +E GK+ + + + Sbjct: 22 WQQCKVRELFEIKRGRTILKKE---------IEENRGKFPVYSSQTENNGELGKINTFDF 72 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G+ L Y + + +L+ + + +++ I Sbjct: 73 DGEYLSWTTDGYAGVIFYRNGKFSLTIHCGLLEKRKSNINYYFAYNSISLISKNYVNIAC 132 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + I EQ EKI + +D +I+ R I LL++ ++AL Sbjct: 133 --AIPNLGSDVMSGVEFMICSYKEQ----EKISSIFFTLDKIISLYERKISLLEKIEKAL 186 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR-KNTKLIESNILSLSY 261 + + K ++ G + ++ + + T I L+ Sbjct: 187 LDNMFIKENEEKPSIRFLGFNSDWQSWTLEDKGYLYSGLNSKTKVDFTNGNSKYITYLNV 246 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA---QVMERGIITS 318 N + +S E + G+I+F + + SA +V E+ + S Sbjct: 247 FNNFNIDLKEKSLVFIKSDEKQNSIVKGDILFTMSSETYQEVGMSSAVTEEVNEKIYLNS 306 Query: 319 AYMAVKPHGID---STYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + D + A+L R++ + + + G R +L + L + P Sbjct: 307 FCFGYRLNKADFLFPNFSAFLFRNHSVRHKIILQSNGGTSRFNLSKKSFLNLKIKSP 363 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 17/142 (11%), Positives = 42/142 (29%), Gaps = 5/142 (3%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + ++ ++ F L G + S Sbjct: 50 KFPVYSSQTENNGELGKINTFDFDGEYLSWTTDGYAGVIFYRNGKFSLTIHCGLLEKRKS 109 Query: 331 TYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + S L Y + +L + + + ++ KEQ I+++ + Sbjct: 110 NINYYFAYNSISLISKNYVNIACAIPNLGSDVMSGVEFMICSYKEQEKISSI----FFTL 165 Query: 390 DVLVEKIEQSIVLLKERRSSFI 411 D ++ E+ I LL++ + + Sbjct: 166 DKIISLYERKISLLEKIEKALL 187 >gi|282878313|ref|ZP_06287106.1| type I restriction modification DNA specificity domain protein [Prevotella buccalis ATCC 35310] gi|281299568|gb|EFA91944.1| type I restriction modification DNA specificity domain protein [Prevotella buccalis ATCC 35310] Length = 183 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 55/165 (33%), Gaps = 2/165 (1%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET-RNMGLKPESYETYQI 285 +P+ W ++ + + K I + N ++ +++ + + Sbjct: 18 DIPETWSWSRGKSIFLPMESEKPKNDFVYIDVDAVNNKKYIIDNPKHITTENAPSRASRK 77 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + +++F + +L + + T Y+ GI YL W+M S + Sbjct: 78 LHENDVLFSMVRPYLKNIALVTNEYKNAIASTGFYVITPCIGIYPQYLYWMMLSSYIVDG 137 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 G S+ ++ +PP EQ + I ++ Sbjct: 138 LNMFMKGDNSPSINNCHIEEYLYPIPPESEQQRVVAQIETLFEQL 182 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 63/168 (37%), Gaps = 7/168 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTV 78 IP+ W K + + D +YI ++ V + PK + + + Sbjct: 18 DIPETWSWSRGKSIF--LPMESEKPKNDFVYIDVDAVNNKKYIIDNPKHITTENAPSRAS 75 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVL-PELLQGWLLSIDVT 134 + +L+ + PYL+ + + I ST F V+ P + P+ L +LS + Sbjct: 76 RKLHENDVLFSMVRPYLKNIALVTNEYKNAIASTGFYVITPCIGIYPQYLYWMMLSSYIV 135 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + +G + I PIPP +EQ + +I ++ Sbjct: 136 DGLNMFMKGDNSPSINNCHIEEYLYPIPPESEQQRVVAQIETLFEQLH 183 >gi|332076340|gb|EGI86805.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17545] Length = 266 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 49/178 (27%), Gaps = 2/178 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + + G + +D G E + K N I G Sbjct: 2 KKVKLGQVATFINGYAFKP-QDWSSEGKEIIRIQNLTKTSKGINYYSGTIDKKYIVEAGD 60 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL G L + + + + + + Q G+T Sbjct: 61 ILISWSG-TLGVFQWCGRSAVLNQHIFKVVFDKIDIDKSYFKYVVEKGLQDAVKHTHGST 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M H K NI + L EQ I ++ + I + L+K + + Sbjct: 120 MKHLTKKYFDNIMVSYTNLREQQRIASEMDLLSKLILRRQEQLEELNLLVKSRFNEMF 177 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 43/142 (30%), Gaps = 10/142 (7%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 ++ + + + IV+ G+I+ + ++ V I Sbjct: 39 TSKGINYYSGTIDKKYIVEAGDILISWSGTLG-----VFQWCGRSAVLNQHIFKVVFDKI 93 Query: 329 DSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 D + + L + L + + V ++EQ I + ++ Sbjct: 94 DIDKSYFKYVVEKGLQDAVKHTHGSTMKHLTKKYFDNIMVSYTNLREQQRIASEMD---- 149 Query: 388 RIDVLVEKIEQSIVLLKERRSS 409 + L+ + ++ + L S Sbjct: 150 LLSKLILRRQEQLEELNLLVKS 171 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 30/89 (33%), Gaps = 12/89 (13%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 185 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 232 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQF 111 K ++ G+ G + ++ + T F Sbjct: 233 KNSVIIGRKGNINKPILVRENFWNVDTAF 261 >gi|293402634|ref|ZP_06646737.1| putative type I restriction-modification enzyme, S subunit [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291303926|gb|EFE45212.1| putative type I restriction-modification enzyme, S subunit [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 206 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 54/186 (29%), Gaps = 7/186 (3%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + ++ N + + E G + S Y Sbjct: 23 KLSDIAEITMGQSPSGSSYNEDGIGTIFFQGRAEF---GFRFPSIRLYTTEPKRMACKND 79 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + + I A+ +++ + M S + + Sbjct: 80 TLMSVRAPVGDFNVAHKDCCIGRGLAAIHSKTNHQSFVHYTMFSLKKQLGVFNGEGTVFG 139 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 S+ + +P+L+P ++ + A +D ++ I L++ R + + Sbjct: 140 SINRNSLNEMPILIPSDEK----LDEFEGIVAPMDAVIRNNYDEICRLEQIRDLLLPKLM 195 Query: 416 TGQIDL 421 +G++D+ Sbjct: 196 SGELDV 201 Score = 40.9 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 45/176 (25%), Gaps = 2/176 (1%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ G++ G ++ + + R T + K L Sbjct: 23 KLSDIAEITMGQSPSGSSYNEDGIGTIFFQGRAEFGFRFPSIRLYTTEPKRMACKNDTLM 82 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 P +A D + K + + + Q EG Sbjct: 83 SVRAPV-GDFNVAHKDCCIGRGLAAIHSKT-NHQSFVHYTMFSLKKQLGVFNGEGTVFGS 140 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + + +P+ IP + + I E R ++ L+S Sbjct: 141 INRNSLNEMPILIPSDEKLDEFEGIVAPMDAVIRNNYDEICRLEQIRDLLLPKLMS 196 >gi|146281041|ref|YP_001171194.1| type I restriction-modification system, S subunit, truncation [Pseudomonas stutzeri A1501] gi|145569246|gb|ABP78352.1| type I restriction-modification system, S subunit, truncation [Pseudomonas stutzeri A1501] Length = 157 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 35/95 (36%), Gaps = 3/95 (3%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + + + +L + + S S+ D+ P Sbjct: 55 YIHGRFWTVDTMFYTEVSSDASAKFLYYNALTIPFQYY---STSTALPSMTQGDLLNHPC 111 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +P +EQ I ++ ETARID L+E+ ++ I Sbjct: 112 AIPRREEQAQIARFLDHETARIDGLIEEQQRLIER 146 >gi|324993828|gb|EGC25747.1| hypothetical protein HMPREF9390_0214 [Streptococcus sanguinis SK405] Length = 190 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 56/171 (32%), Gaps = 4/171 (2%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-- 282 +GL + ++ ++ L +I LK Sbjct: 10 LGLRYKNLSDFSIGKGTYGISASAVGKDDNLPTYLRITDINDDGTINFASLKSVDRSDAD 69 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYD 341 + P +IVF + + E ++ P ++ + +S + Sbjct: 70 KYRLQPNDIVFARTGGSTGRSYFYDGKDGEFVFAGFLIKFSIDPQKCIPKFIKYYCQSRE 129 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + +G R ++ + +++P+ P+++Q I ++++ +I+ Sbjct: 130 YYNWVASFNTGSTRGNINAKTFEKMPIPDLPLEQQQLIVDILSPIDDKIEN 180 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 60/177 (33%), Gaps = 15/177 (8%) Query: 29 PIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K + + G+ + Y+ + D+ + +SD Sbjct: 13 RYKNLSDFSIGKGTYGISASAVGKDDNLPTYLRITDINDDGTINFASLKSVDRSDADKYR 72 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVL--QPKDVLPELLQGWLLSIDVT 134 + I++ + G ++ D + + + P+ +P+ ++ + S + Sbjct: 73 L-QPNDIVFARTGGSTGRSYFYDGKDGEFVFAGFLIKFSIDPQKCIPKFIKYYCQSREYY 131 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + G+T + + K +P+P PL +Q LI + + +I+ Sbjct: 132 NWVASFNTGSTRGNINAKTFEKMPIPDLPLEQQQLIVDILSPIDDKIENNKKINHHL 188 >gi|153951382|ref|YP_001398801.1| type I restriction modification DNA specificity domain-containing protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938828|gb|ABS43569.1| putative type I restriction modification DNA specificity domain [Campylobacter jejuni subsp. doylei 269.97] Length = 194 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 60/190 (31%), Gaps = 11/190 (5%) Query: 30 IKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + ++ G T G I + ++D+ + + +F Sbjct: 2 LGEIFEIKNGYTPSKANKEFWEGGTIPWFRMDDIRTNGRILSDSLQHITPKALKGGKLFP 61 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP---ELLQGWLLSIDVTQRIEA 139 K I+ A+I D + + +F L K ++ + + Q + Sbjct: 62 KNSIIISTTATIGEHALII-VDSLANQRFTFLSKKVNCDIAIDMKFIYYYCFILGQWCKQ 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + D K +PIPPL Q I + L + IE K++ Sbjct: 121 NTNVSGFASVDMKAFKQFQIPIPPLEVQEKIVRILDQFHALTTDLTSGIPAEIEARKKQY 180 Query: 200 QALVSYIVTK 209 + + ++T Sbjct: 181 EYYRNQLLTF 190 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 18/173 (10%), Positives = 47/173 (27%), Gaps = 8/173 (4%) Query: 247 KNTKLIESNILSLSYGNIIQKL---ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 I +I + P++ + ++ I+ + Sbjct: 18 NKEFWEGGTIPWFRMDDIRTNGRILSDSLQHITPKALKGGKLFPKNSIIISTTATIGEHA 77 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + + + ID ++ + S+ + K Sbjct: 78 LIIVDSLANQRFTFLSKKVNCDIAIDMKFIYYYC-FILGQWCKQNTNVSGFASVDMKAFK 136 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + + +PP++ Q I +++ A L I I K+ R+ + Sbjct: 137 QFQIPIPPLEVQEKIVRILDQFHALTTDLTSGIPAEIEARKKQYEYYRNQLLT 189 >gi|303262770|ref|ZP_07348708.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS292] gi|303265059|ref|ZP_07350973.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS397] gi|303267631|ref|ZP_07353469.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS457] gi|302636092|gb|EFL66589.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS292] gi|302642830|gb|EFL73139.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS457] gi|302645419|gb|EFL75652.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS397] Length = 337 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 32/345 (9%), Positives = 93/345 (26%), Gaps = 27/345 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + ++P + +++ + + L +K+ Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKK 325 >gi|332673407|gb|AEE70224.1| type I restriction modification DNA specificity family protein [Helicobacter pylori 83] Length = 201 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 46/124 (37%), Gaps = 5/124 (4%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG 352 I + + ++ +V P + YL +++ + + S Sbjct: 65 NTITIAQYGTAGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSA 124 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + S+ ++ ++ + +PP++ Q +I +++ + L+ I I K+ R Sbjct: 125 IPYSISSNNIMQIKIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIEARKKQYEYYRE 184 Query: 409 SFIA 412 ++ Sbjct: 185 KLLS 188 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 42/156 (26%), Gaps = 11/156 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + ++ G+ + + GKY G Sbjct: 13 PKGVEFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYNR 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + + + PK+ L ++L+ Sbjct: 63 EENTITIAQYGT-AGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISN 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 A I I +PIPPL Q I + + Sbjct: 122 RSAIPYSISSNNIMQIKIPIPPLEIQQEIVKILDQF 157 >gi|210134632|ref|YP_002301071.1| type I R-M system S protein [Helicobacter pylori P12] gi|210132600|gb|ACJ07591.1| type I R-M system S protein [Helicobacter pylori P12] Length = 393 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 54/387 (13%), Positives = 119/387 (30%), Gaps = 38/387 (9%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVYYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ K + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDVIDLKKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ ++L+ + N Sbjct: 144 FLNIKVKLYPLETQQKIARTLSILDQKIENNHKINELLHKILELLYEQYFVRFDFLDENN 203 Query: 214 DVKMKDSGI-----EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 G E L+P+ +EVK L S+ + Sbjct: 204 KPYQTSGGKMKFSKELNRLIPNDFEVKTLGELTQLKVGNKNANHSSDQGKYPFFTCSNN- 262 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + +E I+ G + + V P+ Sbjct: 263 ---PLKCETYQFEGKHIIISGN------------GNFYVTHYDGKFDAYQRTYVVNPNNP 307 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + L +L + + + + D++ + +++P +K NV+ Sbjct: 308 NHYVLIYLFVKSYTNYLKLQSRGSIIKFITKSDIENIKIVLPNLKTYTKWNNVL------ 361 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++E QS L R + + Sbjct: 362 --KIIENNNQSTQTLTAFRDFLLPLLL 386 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 69/178 (38%), Gaps = 13/178 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I I++ + Sbjct: 18 NNYTKEDNYKKVYYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSI---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDVIDLKKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 S+ D + V + P++ Q I +++ +I+ + E + + + LL E+ Sbjct: 134 SSYPSITPLDFLNIKVKLYPLETQQKIARTLSILDQKIENNHKINELLHKILELLYEQ 191 >gi|52549370|gb|AAU83219.1| putative restriction modification enzyme S subunit [uncultured archaeon GZfos27A8] Length = 117 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 2/73 (2%) Query: 329 DSTYLAWLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + Y+ +RS + G+ ++ + + P P +EQ I ++ Sbjct: 27 NPDYVLLYLRSPQFLTEGIKRMAGTAGQKRVPRDYFAGSPFPFPSFQEQHRIVTKVDQLM 86 Query: 387 ARIDVLVEKIEQS 399 A D L KIEQS Sbjct: 87 ALCDELEAKIEQS 99 >gi|207108191|ref|ZP_03242353.1| type I R-M system specificity subunit [Helicobacter pylori HPKX_438_CA4C1] Length = 151 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 43/126 (34%), Gaps = 10/126 (7%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + +R A ++ + + + L ++S+ + Sbjct: 32 NPFGFAPYIRKAYEHKKEFSNHHQI--ESFFSSNHILTMFLQSHIQTNRNESNT----PY 85 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + +K +L+PP+ EQ I N+++ I L K Q + + + ++ Sbjct: 86 IVMATLKDFEILLPPLNEQIAIANILSGLDHEIISLKNKKRQ----FENIKKALNHDLMS 141 Query: 417 GQIDLR 422 +I + Sbjct: 142 AKIRVT 147 >gi|254367478|ref|ZP_04983504.1| type I restriction-modification system, subunit R [Francisella tularensis subsp. holarctica 257] gi|134253294|gb|EBA52388.1| type I restriction-modification system, subunit R [Francisella tularensis subsp. holarctica 257] Length = 225 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 9/61 (14%), Positives = 22/61 (36%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + + + + + + +PP+ EQ I + +D +E +Q+I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKCIVAKLYSLFENVDKAIELHQQNITNANT 60 Query: 406 R 406 Sbjct: 61 L 61 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 70/240 (29%), Gaps = 17/240 (7%) Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G M H NI +P+PPLAEQ I K+ + +D I + I Sbjct: 1 MNNLHGVGMKHITKGKFENIQIPLPPLAEQKCIVAKLYSLFENVDKAIELHQQNITNANT 60 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + K K+ + + + + + + + Sbjct: 61 LMASTLDKTFKKLEGEYSKIALLDVMKI-----------SNKTLVPDDNQKYNYVGLENI 109 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + G +I ET+ +K E G +++ + +K + I Sbjct: 110 EGNTGRLIDFCETQGKEIKSSKVE----FKKGIVLYGKLRPYLNKVWFSEFDDVATTEIL 165 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK--RLPVLVPPIKEQ 375 Y + + S L +V L +K + +PP+ Q Sbjct: 166 PFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSRIPRLTTAFLKSEEAYIPLPPLPIQ 225 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 4/125 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K++ + + Y+GLE++E TG+ + + S+ F KG +LY Sbjct: 82 LLDVMKISNKTLVPDDNQKYNYVGLENIEGNTGRLIDFCETQGKEIKSSKVEFKKGIVLY 141 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGAT 145 GKL PYL K ++FD + +T+ L P D ++ + LS QR+ G+ Sbjct: 142 GKLRPYLNKVWFSEFDDVATTEILPFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSR 201 Query: 146 MSHAD 150 + Sbjct: 202 IPRLT 206 >gi|315638642|ref|ZP_07893816.1| type I restriction/modification enzyme [Campylobacter upsaliensis JV21] gi|315481266|gb|EFU71896.1| type I restriction/modification enzyme [Campylobacter upsaliensis JV21] Length = 1191 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 52/162 (32%), Gaps = 7/162 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E Y + + IV +I+ K +L + Sbjct: 1024 EHIDNKSGYVKMQTPKYVPMEFYEDFKKADKGIVRKNDILLCKDGALTGKVALVRDEFEN 1083 Query: 313 R--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + I ++ + +L +++ S + + +G + L ++K + + Sbjct: 1084 QSVMINEHIFLLRCQNSTTQKFLFFILHSQSGQSILKSKVTGSAQGGLSLSNLKDMKIPK 1143 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 P IK Q I + E +++ I SI +E + + Sbjct: 1144 PDIKIQKQIVS----ECEKVEEQYNTIRMSIEKYQELIRAIL 1181 >gi|148927793|ref|ZP_01811221.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] gi|147886858|gb|EDK72400.1| restriction modification system DNA specificity domain [candidate division TM7 genomosp. GTL1] Length = 413 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 37/120 (30%), Gaps = 5/120 (4%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + + Y + ++ D ++ V + + + + ++ Y+ + + Sbjct: 48 DEIDNYLLDGEFVLLGEDGAPFLDPYKSKAYLVQGKIWVNNHAHILL--ARNNKYVKYAL 105 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 D R L +KR+ + P EQ I I + ID I Sbjct: 106 NYVDYQSYV---TGTTRLKLNQSALKRIIIPFPDENEQKRIVAKIEELFSEIDNAESAIT 162 >gi|71900227|ref|ZP_00682365.1| similar to Restriction endonuclease S subunits [Xylella fastidiosa Ann-1] gi|71730000|gb|EAO32093.1| similar to Restriction endonuclease S subunits [Xylella fastidiosa Ann-1] Length = 320 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 41/329 (12%), Positives = 98/329 (29%), Gaps = 27/329 (8%) Query: 101 ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 + + +V + L +++ Q + I + +P Sbjct: 5 HGKFFVTDNAVICDSKVEVDIDWAFHLLSVMNLNQYAMKS----AQPVLAVRTIEQVKVP 60 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ-ALVSYIVTKGLN--PDVKM 217 +PPL Q I + + T L R ++ + + G + + Sbjct: 61 LPPLEVQRQIAKVLDTFTTLEAELEARRRQYQYYRDALLRFGGSTDASGNGEDGAERNQW 120 Query: 218 KDSGIEWVGL-----VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 K +GI W+ P+ E K L+ + + + + ++ +T Sbjct: 121 KPTGINWIDELIAALCPEGVEFKMLGELLDYEQPGKYLVASTAYDNSYWTPVLTAGQTFI 180 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 +G E+ Y ++ + + + + ++ M G + Sbjct: 181 LGYTDETSGIYAASPQEPVII----FDDFTTAFKWVDFPFKAKSSAMKMLTLKAGALDSL 236 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + Y RQ + + + VPP++ Q I V++ ++ + Sbjct: 237 RYVFF---AMQMIAYTPQDHARQWI--GTYSKFLIPVPPLEVQARIVAVLDQFDTLVNDI 291 Query: 393 VEKIEQSIVLLKE----RRSSFIA--AAV 415 + I ++ R + AV Sbjct: 292 TAGLPAEIAARRQQYAYYRDRLLTFKEAV 320 >gi|256854685|ref|ZP_05560049.1| predicted protein [Enterococcus faecalis T8] gi|256710245|gb|EEU25289.1| predicted protein [Enterococcus faecalis T8] Length = 187 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 54/165 (32%), Gaps = 8/165 (4%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 A E + KN L ++I S I KL + N+ + + I+ G+ Sbjct: 30 DHFEYGLNASAIEYDGKNKYLRITDIDDSSRKFIQNKLTSPNINV---EEASNYILTVGD 86 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+F K + + A DS ++ W + M Sbjct: 87 ILFVRTGASVGKTYRYDIKDGKVYFAGFLIRARIKDSFDSEFVYWTTLTDRYNTFIKIMS 146 Query: 351 S-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + ++ +L+P IKEQ I + +ID + Sbjct: 147 QRSGQPGINAKEYSSFNILIPNIKEQQKIGAFL----KKIDDTIA 187 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 51/169 (30%), Gaps = 10/169 (5%) Query: 23 KHWKVVPIKRFTKLN----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTS 76 + W++ + E Y+ + D++ + K++ S + + Sbjct: 18 EDWELCKLGDVADHFEYGLNASAIEYDGKNKYLRITDIDDSSRKFIQNKLTSPNINVEEA 77 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD----GICSTQFLVLQPKDVLPELLQGWLLSID 132 + I G IL+ + G + K D E + L+ Sbjct: 78 SNYILTVGDILFVRTGASVGKTYRYDIKDGKVYFAGFLIRARIKDSFDSEFVYWTTLTDR 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 I+ + + + + K + + IP + EQ I + I Sbjct: 138 YNTFIKIMSQRSGQPGINAKEYSSFNILIPNIKEQQKIGAFLKKIDDTI 186 >gi|224437133|ref|ZP_03658114.1| putative Type I restriction enzyme EcoR124II specificity protein [Helicobacter cinaedi CCUG 18818] Length = 270 Score = 61.0 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 53/179 (29%), Gaps = 13/179 (7%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY------- 280 P W+ + + E + T N + Sbjct: 78 PPQGWDTIKLGQVCEIIRGITYDKTEQTTEKTQNIVLTADNITLNNTFELSKMIYLKQDF 137 Query: 281 --ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +I+ +I F + + +M + ++ ++ + + Sbjct: 138 IGDKNKILRKNDIFMCFSSGSLKHIGKVAFIDKDTEYYAGGFMGILRSRFNAKFVFYTIA 197 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + D + +G + + L + +PP++ Q I +V+ +I+ + +E Sbjct: 198 NDDFKQKLENSATGSNINNLSGKINDLKIPLPPLEAQEKIISVVE----KIESTISLLE 252 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 59/171 (34%), Gaps = 12/171 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVE-SGTGKYLPKDGNSRQSD 74 P+ W + + + ++ G T + + I + +++ + T + + Sbjct: 79 PQGWDTIKLGQVCEIIRGITYDKTEQTTEKTQNIVLTADNITLNNTFELSKMIYLKQDFI 138 Query: 75 TSTVSIFAKGQILYG----KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 I K I L + A I + F+ + + + + + Sbjct: 139 GDKNKILRKNDIFMCFSSGSLKHIGKVAFIDKDTEYYAGGFMGILRSRFNAKFVFYTIAN 198 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 D Q++E G+ +++ I ++ +P+PPL Q I + I Sbjct: 199 DDFKQKLENSATGSNINNLS-GKINDLKIPLPPLEAQEKIISVVEKIESTI 248 >gi|317131477|ref|YP_004090791.1| restriction modification system, type I [Ethanoligenens harbinense YUAN-3] gi|315469456|gb|ADU26060.1| restriction modification system, type I [Ethanoligenens harbinense YUAN-3] Length = 193 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 65/192 (33%), Gaps = 19/192 (9%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE-----TYQIVDPGEIVFRFI 296 N K ++S + +S GN ++ L + + E + +V G+I+F Sbjct: 4 FGSNIKVETFVDSGVPIIS-GNHLRGLYLDELEYNFITEEHARRLSNSLVRAGDIIFTHA 62 Query: 297 DLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GL 353 + +I+ Y+ Y+ + S A S Sbjct: 63 GNIGQVALIPDNCDYPYYVISQRQFYLRCDKKKALPEYINYFFHSRVGQGKLLANASQTG 122 Query: 354 RQSL--KFEDVKRLPVLVPPIKEQFDITNVIN--VETARIDVLVEKIEQSIVLLKERRSS 409 S+ +K + V++PPI+ Q ++ + ++ + L R+ Sbjct: 123 VPSIARPSSHLKGISVVLPPIEVQ------LDWFETVRPMLQILNGNNKENKRLVSLRNM 176 Query: 410 FIAAAVTGQIDL 421 + ++G++ + Sbjct: 177 LLPRLMSGELSV 188 >gi|227892232|ref|ZP_04010037.1| possible restriction modification system DNA specificity protein [Lactobacillus salivarius ATCC 11741] gi|227865954|gb|EEJ73375.1| possible restriction modification system DNA specificity protein [Lactobacillus salivarius ATCC 11741] Length = 380 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 50/385 (12%), Positives = 115/385 (29%), Gaps = 33/385 (8%) Query: 38 TGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIFAKGQILYGKLGPYLR 96 + I ++ + + + D S +G IL K G + Sbjct: 22 KKKEYLQEGSYRIINGSNIVDNKIDWSNCGYISKERYDESEEIKLKEGDILITKDGTIGK 81 Query: 97 KAIIAD---FDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 A++ + S F++ K L +L S + I + G+ + H Sbjct: 82 VAMVNKLDKPSTVASGLFILRNINLKKWDTLYLFYYLQSFKFKEFIYSRTSGSVIPHLYQ 141 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + + +P L +Q I +KI + +ID +EL E + +++ + L Sbjct: 142 RDFEELMIPELSLKQQKQISQKIHSIQQKIDLNNKINTNLLELGLELI-SNINFENYQSL 200 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 N V++KD + ++ ++ Sbjct: 201 NKIVEVKD-------------------GTHSSPASTLNGYPLVTSKAIKGTSVDFSQTKN 241 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 +V+ +I+ I L + ++ I + + S Sbjct: 242 ISEADFTEINKRSLVEYHDILISMIGTVGIVH-LVTENPVKYAIKNVGLIKSSDKKLLSP 300 Query: 332 YLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +L + SY +Q + +++++P+ + DI + + I Sbjct: 301 FLYLYLLSYYGQTYIRKHLSGSTQQFISLTNLRKMPIPISS-----DIPAKLIEKLNTIV 355 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAV 415 + +E L ++ + Sbjct: 356 LQIEHNSNENNTLNSIKNVLLEKYF 380 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 60/170 (35%), Gaps = 6/170 (3%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNII-QKLETRNMGLKPE---SYETYQIVDPGEIVFR 294 + + +K L E + ++ NI+ K++ N G + + G+I+ Sbjct: 15 RIGWKGLKKKEYLQEGSYRIINGSNIVDNKIDWSNCGYISKERYDESEEIKLKEGDILIT 74 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYDLCKVFYAMGSG- 352 + + D+ YL + ++S+ + Y+ SG Sbjct: 75 KDGTIGKVAMVNKLDKPSTVASGLFILRNINLKKWDTLYLFYYLQSFKFKEFIYSRTSGS 134 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + L D + L + +K+Q I+ I+ +ID+ + + L Sbjct: 135 VIPHLYQRDFEELMIPELSLKQQKQISQKIHSIQQKIDLNNKINTNLLEL 184 >gi|167010574|ref|ZP_02275505.1| restriction modification system DNA specificity subunit [Francisella tularensis subsp. holarctica FSC200] Length = 222 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 21/54 (38%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + + + + +PP+ EQ I + +D +E +Q+I Sbjct: 5 GMKHITKGKFENIQIPLPPLAEQKCIVAKLYSLFENVDKAIELHQQNITNANTL 58 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 70/237 (29%), Gaps = 17/237 (7%) Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G M H NI +P+PPLAEQ I K+ + +D I + I Sbjct: 1 MHGVGMKHITKGKFENIQIPLPPLAEQKCIVAKLYSLFENVDKAIELHQQNITNANTLMA 60 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + K K+ + + + + + + + + Sbjct: 61 STLDKTFKKLEGEYSKIALLDVMKI-----------SNKTLVPDDNQKYNYVGLENIEGN 109 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G +I ET+ +K E G +++ + +K + I Y Sbjct: 110 TGRLIDFCETQGKEIKSSKVE----FKKGIVLYGKLRPYLNKVWFSEFDDVATTEILPFY 165 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK--RLPVLVPPIKEQ 375 + + S L +V L +K + +PP+ Q Sbjct: 166 PIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSRIPRLTTAFLKSEEAYIPLPPLPIQ 222 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 4/125 (3%) Query: 30 IKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K++ + + Y+GLE++E TG+ + + S+ F KG +LY Sbjct: 79 LLDVMKISNKTLVPDDNQKYNYVGLENIEGNTGRLIDFCETQGKEIKSSKVEFKKGIVLY 138 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEAICEGAT 145 GKL PYL K ++FD + +T+ L P D ++ + LS QR+ G+ Sbjct: 139 GKLRPYLNKVWFSEFDDVATTEILPFYPIDNTRLNMIFVKYYFLSSSYLQRVMRNYSGSR 198 Query: 146 MSHAD 150 + Sbjct: 199 IPRLT 203 >gi|160914092|ref|ZP_02076318.1| hypothetical protein EUBDOL_00104 [Eubacterium dolichum DSM 3991] gi|158434014|gb|EDP12303.1| hypothetical protein EUBDOL_00104 [Eubacterium dolichum DSM 3991] Length = 169 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 55/162 (33%), Gaps = 6/162 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP +W+ V I + G+T KDI Y+ +++S Sbjct: 4 EIPDNWEWVHINDIAESYLGKTLNKTKDIGESVPYLCSINIQSDYIDMNTIKIAKFNEAE 63 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL--VLQPKDVLPELLQGWLLSIDV 133 + G +L + G R A+ + L V + + P Q L V Sbjct: 64 KQKYLLQDGDLLICEGGDAGRSAVWNKNKTMYYQNALHRVRFYEKLNPVFYQRVLSFYKV 123 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKII 175 ++ ++ +G T+ H ++ P +++ ++ Sbjct: 124 SKILDNYFKGVTIKHFVQNHYFHLFSLPPLRTHRIVANFRLN 165 >gi|218281984|ref|ZP_03488302.1| hypothetical protein EUBIFOR_00871 [Eubacterium biforme DSM 3989] gi|218217040|gb|EEC90578.1| hypothetical protein EUBIFOR_00871 [Eubacterium biforme DSM 3989] Length = 386 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 71/196 (36%), Gaps = 13/196 (6%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESY 280 + + + ++ + T + RKN + + L++S ++ ++ N + + Sbjct: 14 VPNLRFNNNPYKKYNLYEFATRVTRKNKDNVSNLPLTISAQYGLVDQVSFFNKTVASKDM 73 Query: 281 ETYQIVDPGEIVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLM 337 Y ++ GE + +++ + G +++ Y+ K + +S YL Sbjct: 74 SGYYLLKNGEFAYNKSYSNDYPWGAIKRLDLYNMGCLSTLYICFKSNDNIVNSNYLVHYF 133 Query: 338 RSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 S K + G R ++ D VP I+ Q I +++ RI Sbjct: 134 ESPKWHKQVADIAGEGARNHGLLNIAVNDFFNTKHAVPTIENQIKIARFLDLIEERIQTQ 193 Query: 393 VEKIEQSIVLLKERRS 408 ++ I L ++ Sbjct: 194 IKI----IDTLSSQKK 205 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 41/397 (10%), Positives = 94/397 (23%), Gaps = 51/397 (12%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + F T + ++ ++ + + + + D S + G+ Y Sbjct: 29 LYEFATRVTRKNKDNVSNLP-LTISAQYGLVDQVSFFNKTVASKDMSGYYLLKNGEFAYN 87 Query: 90 KLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC--- 141 K G ST ++ + D + + Sbjct: 88 KSYSNDYPWGAIKRLDLYNMGCLSTLYICFKSNDNIVNSNYLVHYFESPKWHKQVADIAG 147 Query: 142 ---EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + N +P + Q+ I + RI T I K+ Sbjct: 148 EGARNHGLLNIAVNDFFNTKHAVPTIENQIKIARFLDLIEERIQTQIKIIDTLSSQKKQI 207 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + L I H + ++ S Sbjct: 208 RNLLFKDI------------------------HKNANCCIQDYVIYEQPQKYIVHSTDYL 243 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + L + + E I + G+ + +K +V + Sbjct: 244 SYGKDYTPVLTANQSFILGYTLEKDGIYEKGDCIIFDDFTNENKYVDFPFKVKSSAL--- 300 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 I T +++ + F S + +V + VP +KEQ Sbjct: 301 --------KILQTKEGLMLKFFYEYLQFLNFESTDHKRHYLSEVAVTDISVPNLKEQT-- 350 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + D + + + + ++ + Sbjct: 351 --FVCKIFTSFDNKLRNEKALLEKYRLQKQFLLNNLF 385 >gi|125973659|ref|YP_001037569.1| restriction modification system DNA specificity subunit [Clostridium thermocellum ATCC 27405] gi|125713884|gb|ABN52376.1| restriction modification system DNA specificity domain [Clostridium thermocellum ATCC 27405] Length = 473 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 52/389 (13%), Positives = 112/389 (28%), Gaps = 31/389 (7%) Query: 44 SGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + ++++ ++++ S G +L K G A++ Sbjct: 73 KEQGFPVYRVKNIIDTQILDDDIVYIDAKKQQQLKRSEVLPGDVLITKAGRIGSAAVVPS 132 Query: 103 FDG---ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 G I S LV K + L +L +T IGN+ + Sbjct: 133 KFGNGNITSHLVLVRLKKTINNYYLVAYLECKYGKVITGRESYKSTRPELTKNEIGNVII 192 Query: 160 PIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ---------ALVSYIVTKG 210 PIP Q I +K+ + + L E Q + S++ + Sbjct: 193 PIPSPEIQKYIGDKVRKAEELREEAKRLKKEAETFLYEMIQLKPLNDFDKDMFSFVNSNY 252 Query: 211 LN------PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 ++ K K +E + K I + N+ Sbjct: 253 IDSERLDSEYYKTKYITLEKLLKSKKVTSFKDIIIESKYGASVPADYTMVGIPFIRGNNL 312 Query: 265 IQK----LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + K + V+ G+I+ + Sbjct: 313 TDNEINIDDIVYLNKKLKDEVKDHHVNTGDILITRSGTVGISAVVDEKCDGFSFGSFMIK 372 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + + Y+A + S+ + +G +Q++ +++ R+ + + + Q I Sbjct: 373 LRIDMRIWNPYYIAAFLNSFWGKWQIERLQNGAVQQNINLQEIGRIIIPIISKENQDKI- 431 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRS 408 I + K QS L++E + Sbjct: 432 ------EELIKNYINKKRQSKQLIQEAKQ 454 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 61/184 (33%), Gaps = 16/184 (8%) Query: 26 KVVPIKRFT-KLNTGRTSESGK---DIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSI 80 KV K + G + + I +I ++ N + D Sbjct: 278 KVTSFKDIIIESKYGASVPADYTMVGIPFIRGNNLTDNEINIDDIVYLNKKLKDEVKDHH 337 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLV----LQPKDVLPELLQGWLLSIDVTQR 136 G IL + G A++ + S + + + P + +L S + Sbjct: 338 VNTGDILITRSGTVGISAVVDEKCDGFSFGSFMIKLRIDMRIWNPYYIAAFLNSFWGKWQ 397 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 IE + GA + + + IG I +PI Q I I ++ + +L++ Sbjct: 398 IERLQNGAVQQNINLQEIGRIIIPIISKENQ-------DKIEELIKNYINKKRQSKQLIQ 450 Query: 197 EKKQ 200 E KQ Sbjct: 451 EAKQ 454 >gi|325912539|ref|ZP_08174927.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 60-B] gi|325478160|gb|EGC81284.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 60-B] Length = 174 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 55/176 (31%), Gaps = 6/176 (3%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M + I +G + E T + + RN+ + Sbjct: 1 MTNWKICTIGDLGMVIGGATPSTKAAENYDGGTIAWITPKDLAGFSGRFISYGERNITKQ 60 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + +++ ++F A + + +V P+ + Sbjct: 61 GLKSCSAKLMPKHTVLFSSRAPIGY-----IAIANQELCTNQGFKSVVPNDDTDYKFLYY 115 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDV 391 + Y+ K+ + + ++ + V VP I+EQ I +V+++ +I+ Sbjct: 116 LLKYNKNKIENLGSGTTFKEVSGSTMRDIEVSVPTSIEEQRKIASVLSLLDDKIEK 171 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 13/172 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYL---PKDGNSRQS 73 +WK+ I + G T + G I +I +D+ +G+++ ++ + Sbjct: 3 NWKICTIGDLGMVIGGATPSTKAAENYDGGTIAWITPKDLAGFSGRFISYGERNITKQGL 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K +L+ P IA+ + + F + P D + L Sbjct: 63 KSCSAKLMPKHTVLFSSRAPI-GYIAIANQELCTNQGFKSVVPNDDTD-YKFLYYLLKYN 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTL 184 +IE + G T + +I + +P + EQ I + +I+ Sbjct: 121 KNKIENLGSGTTFKEVSGSTMRDIEVSVPTSIEEQRKIASVLSLLDDKIEKN 172 >gi|161528118|ref|YP_001581944.1| restriction modification system DNA specificity subunit [Nitrosopumilus maritimus SCM1] gi|160339419|gb|ABX12506.1| restriction modification system DNA specificity domain [Nitrosopumilus maritimus SCM1] Length = 730 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 84/248 (33%), Gaps = 14/248 (5%) Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP 229 + + + I ++ + +E + Q + LN D+++ ++ + + Sbjct: 477 YAKNLGYDKQGILAKESDFSKILEDFNKFLQTNKGSKFDQNLNSDLRLDENYFQNTSDLG 536 Query: 230 DHWEVKPFFALVTELNR-KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQI 285 + + + KN+KL + L G I+ E E + Sbjct: 537 NQTNMCMLKDIADITIGVKNSKLKKDTKYLLVKGQQIKDFEVDLSNASEVGVEFSIEKYL 596 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLC 343 + G+I + I + + ++ + I S YLA + S Sbjct: 597 LQKGDIAITRSGTVGNVGLC---NKDANVIFSDNIIRIRINSDKIISQYLASFLYSELGQ 653 Query: 344 KVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + +G + + +++++ + + I +Q I N + +I ++ I Sbjct: 654 RQIRQCTTGSTIRGISLSNLEKIQIPLISISKQHKIANDL----KKILDAKSELNHLIKN 709 Query: 403 LKERRSSF 410 L+ ++S Sbjct: 710 LENSKTSL 717 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 55/154 (35%), Gaps = 5/154 (3%) Query: 30 IKRFTKLNTG-RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQIL 87 +K + G + S+ KD Y+ ++ + + + + + S + KG I Sbjct: 544 LKDIADITIGVKNSKLKKDTKYLLVKGQQIKDFEVDLSNASEVGVEFSIEKYLLQKGDIA 603 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVL---QPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + G + + + ++ ++ + L +L S ++I G+ Sbjct: 604 ITRSGTVGNVGLCNKDANVIFSDNIIRIRINSDKIISQYLASFLYSELGQRQIRQCTTGS 663 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 T+ + I +P+ +++Q I + Sbjct: 664 TIRGISLSNLEKIQIPLISISKQHKIANDLKKIL 697 >gi|256960371|ref|ZP_05564542.1| type I restriction endonuclease S subunit [Enterococcus faecalis Merz96] gi|256950867|gb|EEU67499.1| type I restriction endonuclease S subunit [Enterococcus faecalis Merz96] Length = 207 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 71/200 (35%), Gaps = 12/200 (6%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K ++ G T+ ++ + S + ++ E + Sbjct: 11 KLRFADFEGEWEQCKLGNILTERNTQQSKSKEYPLVSFTVEDGVTPKTERYEREQLVRGD 70 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAW 335 +S + Y++ + +IV+ +L K + + + + Y+ + S+Y+ Sbjct: 71 KSSKKYKVTELNDIVYNPANL---KFGAIARNHYGKAVFSPIYITFIVNDKLACSSYVEV 127 Query: 336 LMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + D G RQS+ E++ + L+P KEQ I + ++D Sbjct: 128 FITRKDFISYSLKYQQGTVYERQSVSPENLLNMKFLLPNTKEQEFIGHF----FEKLDCN 183 Query: 393 VEKIEQSIVLLKERRSSFIA 412 ++ I LK + S++ Sbjct: 184 SNFHKKKITQLKNLKKSYLQ 203 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 54/191 (28%), Gaps = 11/191 (5%) Query: 24 HWKVVPIKRFTKLNT-GRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + ++ ++ +ED V T +Y + + + Sbjct: 20 EWEQCKLGNILTERNTQQSKSKEYPLVSFTVEDGVTPKTERYEREQLVRGDKSSKKYKVT 79 Query: 82 AKGQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIE 138 I+Y + + S ++ D L ++ D Sbjct: 80 ELNDIVYNPANLKFGAIARNHYGKAVFSPIYITFIVNDKLACSSYVEVFITRKDFISYSL 139 Query: 139 AICEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 +G + + N+ +P EQ E I ++D + I LK Sbjct: 140 KYQQGTVYERQSVSPENLLNMKFLLPNTKEQ----EFIGHFFEKLDCNSNFHKKKITQLK 195 Query: 197 EKKQALVSYIV 207 K++ + + Sbjct: 196 NLKKSYLQNMF 206 >gi|294676868|ref|YP_003577483.1| type I restriction-modification system RcaSBIIIP subunit S [Rhodobacter capsulatus SB 1003] gi|294475688|gb|ADE85076.1| type I restriction-modification system RcaSBIIIP, S subunit [Rhodobacter capsulatus SB 1003] Length = 560 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 54/135 (40%), Gaps = 5/135 (3%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 ++ G++ ++ +S+ T G++ F + ++ + E + Sbjct: 397 NVRMGSLNREPREFISEKTFKSWMTRGFPKLGDLFFT---TEAPLANVCLNDIQEPFALA 453 Query: 318 SAYMAVKPHGIDSTYLAWL-MRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 + ++P+ ST+ L + + + +G + +K +K LP+ +PP+ EQ Sbjct: 454 QRVICLQPYAEISTHYLMLALCGDVMQSLIDGQATGMTAKGIKASKLKPLPISLPPLAEQ 513 Query: 376 FDITNVINVETARID 390 I ++ +D Sbjct: 514 HRIVAKVDALMRLLD 528 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 64/232 (27%), Gaps = 47/232 (20%) Query: 229 PDHWEVKPFFALVT---ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 P W + +++ + + + + + Y + + + + Sbjct: 85 PRGWALTRLGSVIDLLSGQHLQPNEYSSNPAAGIPYITGPSDFAEVGLSISRYALVRKAV 144 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 G+++ K ++ + I+ M++ P +L + ++ L Sbjct: 145 ARGGQLLLTVKGSGVGKTTIC---DLPEVAISRQLMSLAPILWSIRFLE--IITHRLADT 199 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE----------- 394 L + EDV +PP+ EQ I + A +D + Sbjct: 200 LQEQARSLIPGISREDVADFAFPLPPLAEQHRIVAKVEELMALLDRIEAARAGREEGRNR 259 Query: 395 KIEQSIVLL----------------------------KERRSSFIAAAVTGQ 418 ++ L K R + + AV G+ Sbjct: 260 LTAATLARLTDPKADAPAAARFALDTLAPLTTRPDQIKTLRQTILNLAVRGK 311 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 66/195 (33%), Gaps = 7/195 (3%) Query: 20 AIPKHWKVVPIKRFTKLN--TGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W V + G T + I ++V G+ P++ S ++ S Sbjct: 359 ELPKGWAVQSFENLFLFIDYRGNTPPKTDSGVPLITAKNVRMGSLNREPREFISEKTFKS 418 Query: 77 TVSI-FAK-GQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSID- 132 ++ F K G + + P + + + + LQP + L D Sbjct: 419 WMTRGFPKLGDLFFTTEAPLANVCLNDIQEPFALAQRVICLQPYAEISTHYLMLALCGDV 478 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I+ G T + +P+ +PPLAEQ I K+ A +D L Sbjct: 479 MQSLIDGQATGMTAKGIKASKLKPLPISLPPLAEQHRIVAKVDALMRLLDALEAALSASA 538 Query: 193 ELLKEKKQALVSYIV 207 A + + Sbjct: 539 TTRARLLDATLRAAL 553 Score = 43.6 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 63/189 (33%), Gaps = 7/189 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRT--SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +P+ W + + L +G+ G+ + +G + + + Sbjct: 84 VPRGWALTRLGSVIDLLSGQHLQPNEYSSNPAAGIPYI-TGPSDFAEVGLSISRYALVRK 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGI-CSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ GQ+L G + K I D + S Q + L P L+ + Sbjct: 143 AVARGGQLLLTVKGSGVGKTTICDLPEVAISRQLMSLAPILWSIRFLEIITHRLA---DT 199 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + + + + + P+PPLAEQ I K+ +D + R E Sbjct: 200 LQEQARSLIPGISREDVADFAFPLPPLAEQHRIVAKVEELMALLDRIEAARAGREEGRNR 259 Query: 198 KKQALVSYI 206 A ++ + Sbjct: 260 LTAATLARL 268 >gi|289450762|ref|YP_003474821.1| type I restriction modification DNA specificity domain-containing protein [Clostridiales genomosp. BVAB3 str. UPII9-5] gi|289185309|gb|ADC91734.1| type I restriction modification DNA specificity domain protein [Clostridiales genomosp. BVAB3 str. UPII9-5] Length = 178 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 55/135 (40%), Gaps = 11/135 (8%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII---TSAYMAVKPHG-ID 329 + E + G+ + I + +++ G I ++ Y+ + D Sbjct: 45 SFELEKFSGGTKFRNGDTIMARITPCLENGKTAKVNILDDGEIGFGSTEYIVFRAKEGTD 104 Query: 330 STYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 YL +L+ S + + + +GS RQ ++ + V+ L + VPPI+EQ I ++ Sbjct: 105 KDYLYYLVCSPLVREPAIKSMVGSSGRQRVQTDVVQGLSIAVPPIEEQRQIGGILRALDD 164 Query: 388 RIDVLVEKIEQSIVL 402 +I+ + I Sbjct: 165 KIE-----LNNEINK 174 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 58/167 (34%), Gaps = 13/167 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + + N + G I ++ ++ + + S + F G Sbjct: 5 WTIKTLSDIADFNPRESLSKGTLAKKIAMDKLQ----PFCRDVPSFELEKFSGGTKFRNG 60 Query: 85 QILYGKLGPYLR-------KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + ++ P L + G ST+++V + K+ + +L+ + + Sbjct: 61 DTIMARITPCLENGKTAKVNILDDGEIGFGSTEYIVFRAKEGTDKDYLYYLVCSPLVREP 120 Query: 138 --EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +++ + + + + +PP+ EQ I + A +I+ Sbjct: 121 AIKSMVGSSGRQRVQTDVVQGLSIAVPPIEEQRQIGGILRALDDKIE 167 >gi|167949251|ref|ZP_02536325.1| anticodon nuclease [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 296 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 29/84 (34%), Gaps = 5/84 (5%) Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 A+ + G + + D+ +L+P +EQ I + + + ID Sbjct: 86 FAFQFSQKFIKDFVVNKSIGSDQPFISLRDLYAQDILIPKPEEQQIIADCL----SSIDA 141 Query: 392 LVEKIEQSIVLLKERRSSFIAAAV 415 L+ + + LK + + Sbjct: 142 LITAQSEKVNALKAHKKGLMQQLF 165 >gi|325678125|ref|ZP_08157757.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] gi|324110181|gb|EGC04365.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] Length = 248 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 41/144 (28%), Gaps = 7/144 (4%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFI-DLQNDKRSLRSAQVMERGIITS--AYMAV 323 K Y + G+++ S R + + Sbjct: 42 KYNDERERYYTGEYPHEYLCKKGDLIVAMTEQAAGLLGSTAIVPKDNRYLHNQRIGLITC 101 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I + +L + + + SG + E + + V +P I Q I + + Sbjct: 102 DEKHITKMFAYYLFMTKSVREQISRTSSGTKVKHTSPEKIYDVEVSLPDIPTQKKIAHFL 161 Query: 383 NVETARIDVLVE---KIEQSIVLL 403 +I ++ ++ + LL Sbjct: 162 WTIDCKIRNNIQINDNLQHQLKLL 185 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 52/191 (27%), Gaps = 13/191 (6%) Query: 29 PIKRFTKLNTGRTSES-----GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVSIFA 82 + + G + + + + G KY + + + Sbjct: 3 KLGECLTIKHGWAFKGEFFAESGEQSILTPGNFYEAGGFKYNDERERYYTGEYPHEYLCK 62 Query: 83 KGQILYGKL----GPYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQ 135 KG ++ G AI+ + Q + K + ++ V + Sbjct: 63 KGDLIVAMTEQAAGLLGSTAIVPKDNRYLHNQRIGLITCDEKHITKMFAYYLFMTKSVRE 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 +I G + H + I ++ + +P + Q I + +I I L Sbjct: 123 QISRTSSGTKVKHTSPEKIYDVEVSLPDIPTQKKIAHFLWTIDCKIRNNIQINDNLQHQL 182 Query: 196 KEKKQALVSYI 206 K + Sbjct: 183 KLLYDYWFTQF 193 >gi|227523731|ref|ZP_03953780.1| conserved hypothetical protein [Lactobacillus hilgardii ATCC 8290] gi|227089046|gb|EEI24358.1| conserved hypothetical protein [Lactobacillus hilgardii ATCC 8290] Length = 101 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 33/100 (33%), Gaps = 4/100 (4%) Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 G A T L +L S + + L + V ++ VL P Sbjct: 2 NGRFWVNNHAHTFQSSQGTDLTFLAESLERIHYQRYNTGTAQPKLNAKVVGKIEVLCPTS 61 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 EQ + + I+VL+ ++ + L+ + + Sbjct: 62 NEQRK----LGKLSYLINVLIAANQRRLDQLQSLKKYLMQ 97 >gi|257886406|ref|ZP_05666059.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,501] gi|257822262|gb|EEV49392.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,501] Length = 187 Score = 61.0 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 1/118 (0%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S ++ +I+ K L E + + S Y+ Sbjct: 67 ISNSKLVDLRLEENDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDC 126 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K+ + G + ++ ++ +L + +PP++EQ +T I + I + Sbjct: 127 FLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRRI 184 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 7/164 (4%) Query: 27 VVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + K+ G T + K ++ ++ + D++ G + + + Sbjct: 20 WVYLGSISTKIQYGYTDSAKKQGNVKFLRITDIQEGRVNWSSVPYCDISNSKLVDLRLEE 79 Query: 84 GQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL + G + K+ I++ S + + +L E + +L S + +E Sbjct: 80 NDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDCFLDSPLYWKLLEK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I G + + + + +P+PPL EQ + KI I Sbjct: 140 ISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRR 183 >gi|223043667|ref|ZP_03613711.1| Sau1hsdS1 [Staphylococcus capitis SK14] gi|222442945|gb|EEE49046.1| Sau1hsdS1 [Staphylococcus capitis SK14] Length = 400 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 108/402 (26%), Gaps = 40/402 (9%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +P++K + W I + S++ Sbjct: 10 RFPEFK-----------EEWIKQNIGNYLVEYKKYGSQNETHYPVATSSRRGLYMQNEYF 58 Query: 66 KDGNSRQSDTSTVSIFAKGQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE 122 + SI Y + + + S ++ V + Sbjct: 59 EGDREFAKKDVLYSIVPVNYFTYRHMSDDNIFKFNINTFNIPILVSKEYPVFTINNYYSH 118 Query: 123 LLQGW--LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + + + +G T + +K + P EQ I + + Sbjct: 119 NFIFYELNNNNRFEKFCRMQKKGGTRTRLYFKVLKEYKAFFPNYQEQSKIGDFFSKFDYQ 178 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 I+ + + Q + S + + G WE+ + Sbjct: 179 IELEEKKLELLEQQKNGYMQKIFSQELRFK------------DENGNEYPEWELIKLEDI 226 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVFRFIDLQ 299 + E +K +LS I K + N + + Y+I +I + +L Sbjct: 227 LIERKEYASKTENYPHATLSTSGISLKSDRYNRDFLVRDKNKKYKITLMNDICYNPANL- 285 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL---RQ 355 K + + + I + Y+ + + S L+ D G R Sbjct: 286 --KFGVITRNSIGSVIFSPIYITFEVNNGYSPLFIELLVTRKDFINRVRKYEEGTVYERM 343 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S+K ED +P ++EQ I ID E +E Sbjct: 344 SVKPEDFLNYETKIPCLEEQKKIGLF----FTEIDKCSEILE 381 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 18/174 (10%), Positives = 53/174 (30%), Gaps = 12/174 (6%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 P+ + + + + + R + ++ E +E + Sbjct: 6 TPELRFPEFKEEWIKQNIGNYLVEYKKYGSQNETHYPVATSSRRGLYMQNEYFEGDREFA 65 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM----------AVKPHGIDSTYLAWLM 337 ++++ + + S + + I + + + ++ + + Sbjct: 66 KKDVLYSIVPVNYFTYRHMSDDNIFKFNINTFNIPILVSKEYPVFTINNYYSHNFIFYEL 125 Query: 338 RSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + + F M G R L F+ +K P +EQ I + + +I Sbjct: 126 NNNNRFEKFCRMQKKGGTRTRLYFKVLKEYKAFFPNYQEQSKIGDFFSKFDYQI 179 >gi|3805988|gb|AAC69256.1| type I restriction enzyme EcoRI specificity protein homolog [Helicobacter pylori] Length = 119 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 36/121 (29%), Gaps = 9/121 (7%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y G+I+ + + + + +L + Sbjct: 1 KTKYSFPKKGDILISASGTIGRAVI----YDGKPAYFQDSNIVWIDNDETLVKNDFLFYT 56 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA---RIDVLVEKI 396 Y K L ++ + + +PP+ EQ I N+++ +D L+ K Sbjct: 57 YSHVKW--NTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSDVDRYLYNLDALILKK 114 Query: 397 E 397 E Sbjct: 115 E 115 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 31/109 (28%), Gaps = 3/109 (2%) Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + S KG IL G R I +V E L Sbjct: 1 KTKYSFPKKGDILISASGTIGRAVIYDGKPAYFQDSNIVWI---DNDETLVKNDFLFYTY 57 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 ++ E T+ N +P+PPL EQ+ I + + Sbjct: 58 SHVKWNTEHTTILRLYNDNFRNTLIPLPPLNEQIAIANILSDVDRYLYN 106 >gi|254670659|emb|CBA06724.1| anti-codon nuclease masking agent [Neisseria meningitidis alpha153] Length = 160 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 14/146 (9%), Positives = 44/146 (30%), Gaps = 10/146 (6%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + + ++ I+ + + + + + + + Sbjct: 12 DNSLQHISKSAVKGGKLFPANSIIMATSATIGEHALITVPFLANQRFTSLSLKPEFADKL 71 Query: 329 DSTYLAWL-MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 +L + + CK + S+ KR P+ +PP+ EQ I +++ Sbjct: 72 SIYFLYYYCFNLSEWCK--KNTTTSSFASVDMNGFKRFPIPIPPLPEQEKIVAILDKFDT 129 Query: 388 RIDVL-------VEKIEQSIVLLKER 406 + + + +E+ Sbjct: 130 LTHSISEGLPHEIALRRKQYEYYREQ 155 Score = 41.3 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 53/160 (33%), Gaps = 4/160 (2%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 +ED+ + +S +F I+ A+I + + +F Sbjct: 1 MEDIRENGRILDNSLQHISKSAVKGGKLFPANSIIMATSATIGEHALIT-VPFLANQRFT 59 Query: 113 VLQPKDVLP---ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 L K + + ++++ + ++ + D G P+PIPPL EQ Sbjct: 60 SLSLKPEFADKLSIYFLYYYCFNLSEWCKKNTTTSSFASVDMNGFKRFPIPIPPLPEQEK 119 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 I + ++ I L +++ + ++ Sbjct: 120 IVAILDKFDTLTHSISEGLPHEIALRRKQYEYYREQLLAF 159 >gi|317473783|ref|ZP_07933064.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|316910040|gb|EFV31713.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 1249 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 54/411 (13%), Positives = 118/411 (28%), Gaps = 60/411 (14%) Query: 27 VVPIKRFTKLNTGRTSESGKDII----YIGLEDVESGTGKYLPKDGNSRQSDTS--TVSI 80 + PI + +G + K ++ + + + G D + +T+ Sbjct: 877 LKPIDKLASFQSGLW-KGEKGVLQMTKVLRNTNFKLNNGFLDYGDVAEIEVETTQLATRT 935 Query: 81 FAKGQILYGKLG-----PYLRKAIIAD-------FDGICSTQFLVLQPKDVLPELLQGWL 128 G I+ K G R + + CS + VL +V P L L Sbjct: 936 LQYGDIILEKSGGSDTQAIGRVVLFDKTDNETYSYSNFCS-RIRVLDASEVEPLYLWSVL 994 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + G + + D G I +P+PP+A Q I E+I + + Sbjct: 995 HNFYCKGGTIPLQNGIRLLNIDMNGYSKIKIPVPPIAVQKQIVEEIAKVDTSVSDAMQRI 1054 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 ++ ++ +L + + + Sbjct: 1055 DKYESDIENLLSSL----------------------------NNADSTLNTIAPFATKSI 1086 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + ++ N++Q + + P +I+ I K L Sbjct: 1087 KYGDIESETYITTDNMLQNKLGVLPFEGVANISSITEYKPEDILISNIRPYLKKIWLA-- 1144 Query: 309 QVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKR 364 + G + + ++ Y+ +++R G+ ED+ + Sbjct: 1145 --DKEGGCSKDVLVLRSADTSKYLPKYIFYMLRRDSFFDYVMEGKKGIKMPRGNKEDIMK 1202 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +P I EQ I I + K I + + + + + Sbjct: 1203 YKIPMPNIDEQKRIVAQIETLELE----ITKARTLIENVASEKQAILDKYL 1249 >gi|315634371|ref|ZP_07889658.1| type I restriction/modification specificity protein [Aggregatibacter segnis ATCC 33393] gi|315476961|gb|EFU67706.1| type I restriction/modification specificity protein [Aggregatibacter segnis ATCC 33393] Length = 203 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 43/122 (35%), Gaps = 3/122 (2%) Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ + + V + ++ +++ +L + + + Sbjct: 68 VLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNNRFLYHYLTNMNFIPFL---A 124 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 R L ++++P+ +PP+ Q +I +++ TA L ++ + R Sbjct: 125 GKDRAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELILRRKQYEYYRERL 184 Query: 411 IA 412 ++ Sbjct: 185 LS 186 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 53/183 (28%), Gaps = 18/183 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ + + +G N+ Q + G+ Sbjct: 18 EWKPLDEVANIVNNARKPVKSSLRV---------SGNIPYYGANNIQDYVEGYT--HDGE 66 Query: 86 ILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + G A + V+ K+ L +L Sbjct: 67 FVLIAEDGSASLENYSIQWAVGKFWANNHVHVVNGKEKLNN---RFLYHYLTNMNFIPFL 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + + IP+PIPPL+ Q I + + A T L +E I + + ++ Sbjct: 124 AGKDRAKLTKAKLQQIPIPIPPLSVQTEIVKILDALTALTSELTSELILRRKQYEYYRER 183 Query: 202 LVS 204 L+S Sbjct: 184 LLS 186 >gi|304437972|ref|ZP_07397917.1| conserved hypothetical protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369056|gb|EFM22736.1| conserved hypothetical protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 168 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 50/164 (30%), Gaps = 9/164 (5%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYL--PKDGNSRQSDTSTVSIF 81 + + G T ++ +I ++ ++D + K + S+ + Sbjct: 5 LADIMDIIGGGTPKTNVEEYWDGEIPWLSVKDFNNDNRYVYRAEKTITKLGLENSSTKLL 64 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I+ G A+I + L+ KD + + + L + + Sbjct: 65 RYDDIIISARGTVGEVAMIPYPMA-FNQSCYGLRAKDEIVDSTYLYYLIRYNIRELRRKS 123 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 G+ NI + +P + Q + + +I+ Sbjct: 124 HGSVFDTITRDTFTNIEIDLPNMTIQRKVAIILKEIDDKIECNH 167 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 9/135 (6%), Positives = 41/135 (30%), Gaps = 4/135 (2%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + N + + + +++ +I+ + + Sbjct: 34 VKDFNNDNRYVYRAEKTITKLGLENSSTKLLRYDDIIISARGTVGEVAMIP----YPMAF 89 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 S Y I + + + Y++ ++ + ++ + + + +P + Q Sbjct: 90 NQSCYGLRAKDEIVDSTYLYYLIRYNIRELRRKSHGSVFDTITRDTFTNIEIDLPNMTIQ 149 Query: 376 FDITNVINVETARID 390 + ++ +I+ Sbjct: 150 RKVAIILKEIDDKIE 164 >gi|304383193|ref|ZP_07365666.1| conserved hypothetical protein [Prevotella marshii DSM 16973] gi|304335664|gb|EFM01921.1| conserved hypothetical protein [Prevotella marshii DSM 16973] Length = 163 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 42/130 (32%), Gaps = 7/130 (5%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-G 327 T + L + + +I+ P I I + + G P Sbjct: 39 NTASEYLTTKGRDVSRIIPPNSIAICCIGSIGKVGYI-----EQEGTTNQQINTAIPSLA 93 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVINVET 386 I YL L S S + S+ ++ + + +PP +EQ I I+ Sbjct: 94 IFPDYLYHLCTSTYFQNSLMEKSSAVTISIVNKSKMEHIKIPLPPKEEQARIIVAIDNLF 153 Query: 387 ARIDVLVEKI 396 +D + E + Sbjct: 154 NALDAVKENL 163 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 11/159 (6%) Query: 32 RFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTVSIFAKG 84 K+ TG T + D+++G + ++ D S I Sbjct: 2 DVAKIVTGSTPSKSNLSYYGGNFPLYKPSDLDAGRHTNTASEYLTTKGRDVS--RIIPPN 59 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEG 143 I +G K + +G + Q P + P+ L S + Sbjct: 60 SIAICCIGSI-GKVGYIEQEGTTNQQINTAIPSLAIFPDYLYHLCTSTYFQNSLMEKSSA 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 T+S + + +I +P+PP EQ I I +D Sbjct: 119 VTISIVNKSKMEHIKIPLPPKEEQARIIVAIDNLFNALD 157 >gi|227550932|ref|ZP_03980981.1| possible type I restriction-modification system specificity subunit [Enterococcus faecium TX1330] gi|257894852|ref|ZP_05674505.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,408] gi|257896562|ref|ZP_05676215.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com12] gi|257900143|ref|ZP_05679796.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com15] gi|293379739|ref|ZP_06625875.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium PC4.1] gi|293554046|ref|ZP_06674645.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1039] gi|293568541|ref|ZP_06679861.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1071] gi|314939011|ref|ZP_07846276.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133a04] gi|314943437|ref|ZP_07850204.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133C] gi|314952726|ref|ZP_07855704.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133A] gi|314991358|ref|ZP_07856836.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133B] gi|227179932|gb|EEI60904.1| possible type I restriction-modification system specificity subunit [Enterococcus faecium TX1330] gi|257831231|gb|EEV57838.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,408] gi|257833127|gb|EEV59548.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com12] gi|257838055|gb|EEV63129.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com15] gi|291588877|gb|EFF20705.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1071] gi|291601791|gb|EFF32044.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1039] gi|292641737|gb|EFF59911.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium PC4.1] gi|313594032|gb|EFR72877.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133B] gi|313595197|gb|EFR74042.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133A] gi|313597809|gb|EFR76654.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133C] gi|313641720|gb|EFS06300.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133a04] Length = 187 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 1/118 (0%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S ++ +I+ K L E + + S Y+ Sbjct: 67 ISNSKLVDLRLEENDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDC 126 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K+ + G + ++ ++ +L + +PP++EQ +T I + I + Sbjct: 127 FLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRRI 184 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 7/164 (4%) Query: 27 VVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + K+ G T + K ++ ++ + D++ G + + + Sbjct: 20 WVYLGSISTKIQYGYTDSAKKQGNVKFLRITDIQEGRVNWSSVPYCDISNSKLVDLRLEE 79 Query: 84 GQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL + G + K+ I++ S + + +L E + +L S + +E Sbjct: 80 NDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDCFLDSPLYWKLLEK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I G + + + + +P+PPL EQ + KI I Sbjct: 140 ISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRR 183 >gi|91217920|ref|ZP_01254873.1| hypothetical protein P700755_01262 [Psychroflexus torquis ATCC 700755] gi|91183897|gb|EAS70287.1| hypothetical protein P700755_01262 [Psychroflexus torquis ATCC 700755] Length = 195 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 57/147 (38%), Gaps = 4/147 (2%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 L + ++ ++ + G+I+F N +S + ++ ++ Sbjct: 46 NYTYLGDDCYFVDSDTIKSKYYLKTGDILFIGKGTNNFALVFKSIDNLPTIASSALFVLK 105 Query: 324 -KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + ++ ++AW + ++ F +G S+ ++ P+++P ++ Q I + Sbjct: 106 VDKNLVNPDFIAWYINQSEVQNYFKTNEAGTYNTSINKTTLEETPIVLPSLEIQTKIAKI 165 Query: 382 --INVETARIDVLVEKIEQSIVLLKER 406 ++ + + + +++ + + Sbjct: 166 ANLHNQELALSNKIIELKNKLTTTQLL 192 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 41/134 (30%), Gaps = 5/134 (3%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI---IA 101 + I L+D E + G IL+ G + I Sbjct: 32 NGGVRVIQLKDFEENYTYLGDDCYFVDSDTIKSKYYLKTGDILFIGKGTNNFALVFKSID 91 Query: 102 DFDGICSTQFLV--LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPM 159 + I S+ V + V P+ + ++ +V + G + + + P+ Sbjct: 92 NLPTIASSALFVLKVDKNLVNPDFIAWYINQSEVQNYFKTNEAGTYNTSINKTTLEETPI 151 Query: 160 PIPPLAEQVLIREK 173 +P L Q I + Sbjct: 152 VLPSLEIQTKIAKI 165 >gi|304387860|ref|ZP_07370034.1| conserved hypothetical protein [Neisseria meningitidis ATCC 13091] gi|304338125|gb|EFM04261.1| conserved hypothetical protein [Neisseria meningitidis ATCC 13091] Length = 197 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 5/163 (3%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 + + N + N I N G +P Y + I Sbjct: 21 WKPLGGENGIAIIKTGQAVSKQKISNNIGSYPVINSGKEPLGYIDEWNTENDPIGITTRG 80 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQS 356 + + + RG + A +D +L ++ + + +A+ + + Sbjct: 81 AGVGSITWQEGRYF-RGNLNYAVTIKNRTELDVRFLYHIL--LEFEQEIHALCTFTGIPA 137 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 L ++K+L + +PP++ Q I +++ ++ + ++ Sbjct: 138 LNASNLKKLLIPIPPLETQQKIVKILDK-FTELEAELALRKRQ 179 >gi|294813863|ref|ZP_06772506.1| N-6 DNA methylase [Streptomyces clavuligerus ATCC 27064] gi|326442281|ref|ZP_08217015.1| N-6 DNA methylase [Streptomyces clavuligerus ATCC 27064] gi|294326462|gb|EFG08105.1| N-6 DNA methylase [Streptomyces clavuligerus ATCC 27064] Length = 752 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 11/169 (6%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G+ I +T + + + G++V + +A+ + + + Sbjct: 591 SGHRILHGDTGTVPWEEAEAHPRYRLRAGDLVMTRSGTVGRCALVTAAE--DGWLFGTHL 648 Query: 321 MAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFD 377 + ++PH + S YL + +G + + + + LPVL+PP E+ Sbjct: 649 VRIRPHSPVWSDYLLGFLTRPGTQDWIDRRAAGTTGVRHVSAKSLAGLPVLLPPEDERER 708 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I +++ R+D + L E R+ ++G +R + Q Sbjct: 709 IGRLLH----RLDERRRVHTSVVATLDEYRAELADLLLSG--RVRPDDQ 751 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 62/200 (31%), Gaps = 12/200 (6%) Query: 22 PKHWKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS- 73 P W+ + ++ TG T E + + V + + Sbjct: 549 PPGWREATLGELAEITTGPGGKWPEGTGEPSAGVPVVRARHVSGHRILHGDTGTVPWEEA 608 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELL-QGWLLS 130 + G ++ + G R A++ + + T + ++P + G+L Sbjct: 609 EAHPRYRLRAGDLVMTRSGTVGRCALVTAAEDGWLFGTHLVRIRPHSPVWSDYLLGFLTR 668 Query: 131 IDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I+ G T + H K + +P+ +PP E+ I + R + Sbjct: 669 PGTQDWIDRRAAGTTGVRHVSAKSLAGLPVLLPPEDERERIGRLLHRLDERRRVHTSVVA 728 Query: 190 RFIELLKEKKQALVSYIVTK 209 E E L+S V Sbjct: 729 TLDEYRAELADLLLSGRVRP 748 >gi|254390385|ref|ZP_05005602.1| N-6 DNA methylase [Streptomyces clavuligerus ATCC 27064] gi|197704089|gb|EDY49901.1| N-6 DNA methylase [Streptomyces clavuligerus ATCC 27064] Length = 814 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 11/169 (6%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 G+ I +T + + + G++V + +A+ + + + Sbjct: 653 SGHRILHGDTGTVPWEEAEAHPRYRLRAGDLVMTRSGTVGRCALVTAAE--DGWLFGTHL 710 Query: 321 MAVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFD 377 + ++PH + S YL + +G + + + + LPVL+PP E+ Sbjct: 711 VRIRPHSPVWSDYLLGFLTRPGTQDWIDRRAAGTTGVRHVSAKSLAGLPVLLPPEDERER 770 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 I +++ R+D + L E R+ ++G +R + Q Sbjct: 771 IGRLLH----RLDERRRVHTSVVATLDEYRAELADLLLSG--RVRPDDQ 813 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 62/200 (31%), Gaps = 12/200 (6%) Query: 22 PKHWKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQS- 73 P W+ + ++ TG T E + + V + + Sbjct: 611 PPGWREATLGELAEITTGPGGKWPEGTGEPSAGVPVVRARHVSGHRILHGDTGTVPWEEA 670 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELL-QGWLLS 130 + G ++ + G R A++ + + T + ++P + G+L Sbjct: 671 EAHPRYRLRAGDLVMTRSGTVGRCALVTAAEDGWLFGTHLVRIRPHSPVWSDYLLGFLTR 730 Query: 131 IDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I+ G T + H K + +P+ +PP E+ I + R + Sbjct: 731 PGTQDWIDRRAAGTTGVRHVSAKSLAGLPVLLPPEDERERIGRLLHRLDERRRVHTSVVA 790 Query: 190 RFIELLKEKKQALVSYIVTK 209 E E L+S V Sbjct: 791 TLDEYRAELADLLLSGRVRP 810 >gi|257889046|ref|ZP_05668699.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,141,733] gi|257825112|gb|EEV52032.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,141,733] Length = 187 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 1/118 (0%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S ++ +I+ K L E + + S Y+ Sbjct: 67 ISNSKLVDLRLEENDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDC 126 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K+ + G + ++ ++ +L + +PP++EQ +T I + I + Sbjct: 127 FLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRRI 184 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 7/164 (4%) Query: 27 VVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + K+ G T + K ++ ++ + D++ G + + + Sbjct: 20 WVYLGSISTKIQYGYTDSAKKQGNVKFLRITDIQEGRVNWFSVPYCDISNSKLVDLRLEE 79 Query: 84 GQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL + G + K+ I++ S + + +L E + +L S + +E Sbjct: 80 NDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDCFLDSPLYWKLLEK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I G + + + + +P+PPL EQ + KI I Sbjct: 140 ISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRR 183 >gi|209528296|ref|ZP_03276756.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] gi|209491261|gb|EDZ91656.1| restriction modification system DNA specificity domain [Arthrospira maxima CS-328] Length = 192 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 44/134 (32%), Gaps = 1/134 (0%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + N G Y + + + G + + Sbjct: 38 DNATDYDYINAGTTRSGYTASSNCEGDTVTTPYRGQGGICYVGYQKTPFWLGPLCYKLRS 97 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + YL + ++S + G+ ++ D+ +L + +PP+ Q +I ++ Sbjct: 98 TDEALLINKYLFYFLQSESDLLLGLKKEGGV-PAVNKSDLAKLEIPIPPLAIQAEIVRIL 156 Query: 383 NVETARIDVLVEKI 396 + TA L ++ Sbjct: 157 DTFTALTAELTAEL 170 >gi|238924761|ref|YP_002938277.1| restriction modification system, type I [Eubacterium rectale ATCC 33656] gi|238876436|gb|ACR76143.1| restriction modification system, type I [Eubacterium rectale ATCC 33656] Length = 364 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 53/368 (14%), Positives = 106/368 (28%), Gaps = 31/368 (8%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V I TK+ TG+ +V S GKY + ST S + +L Sbjct: 3 VKIGDLTKIKTGKLD-----------ANVSSEDGKYPFFTCSKEPLKISTYS-YDCECVL 50 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G K FD +++ + + D + G + Sbjct: 51 VAGNGDLNVKYYNGKFDAY-QRTYIIEANGSGKLYMPYLYYFMEDYIDELRKQAIGGVIK 109 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + + +P + EQ I E + +D E L + + V Sbjct: 110 YIKLANLTDALIELPSVDEQKSIVEILKKVKGILDKRNDEIRELDNL-------IKARFV 162 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 +P + + + E + I+++ + N Q Sbjct: 163 EMFGDPRSNPFGFEKKRLKDTCKVITGNTPSRAIEEYYGDYIEWIKTDNIVSGILNPTQA 222 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 E+ L + + V+ I+ I R AV P Sbjct: 223 TES----LSEKGMNVGRTVEKDSILMACIAGSIASIG-RVCITDRTVAFNQQINAVVPEQ 277 Query: 328 IDSTYLA--WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + +L + M L + G+ L ++ ++PP+ Q ++ + Sbjct: 278 YNILFLYVLFQMSKDYLVEDINMALKGI---LSKSKLEEKEFIIPPMDLQEQFSDFVKQV 334 Query: 386 T-ARIDVL 392 ++ D + Sbjct: 335 DKSKFDTM 342 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 50/167 (29%), Gaps = 11/167 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P ++ +K K+ TG T G I +I +++ SG + + Sbjct: 172 PFGFEKKRLKDTCKVITGNTPSRAIEEYYGDYIEWIKTDNIVSGILNPTQATESLSEKGM 231 Query: 76 STVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + K IL + + + I D + Q + P+ +L ++L Sbjct: 232 NVGRTVEKDSILMACIAGSIASIGRVCITDRTVAFNQQINAVVPEQYN--ILFLYVLFQM 289 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + A + IPP+ Q + + Sbjct: 290 SKDYLVEDINMALKGILSKSKLEEKEFIIPPMDLQEQFSDFVKQVDK 336 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 11/103 (10%), Positives = 35/103 (33%), Gaps = 1/103 (0%) Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 ++ + + G + + ++ G+ + +K ++ Sbjct: 59 VKYYNGKFDAYQRTYIIEANGSGKLYMPYLYYFMEDYIDELRKQAIGGVIKYIKLANLTD 118 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + +P + EQ I ++ +D ++I + + L + R Sbjct: 119 ALIELPSVDEQKSIVEILKKVKGILDKRNDEI-RELDNLIKAR 160 >gi|47459122|ref|YP_015984.1| type I restriction-modification enzyme s subunit [Mycoplasma mobile 163K] gi|47458451|gb|AAT27773.1| type I restriction-modification enzyme s subunit [Mycoplasma mobile 163K] Length = 378 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 41/387 (10%), Positives = 95/387 (24%), Gaps = 41/387 (10%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR----QSDTSTVSIFAKGQILY 88 ++ + E I + L + + I+ Sbjct: 19 IVEIGSLLNYEQPSKYIVESTNYNKENQIPVLTAGKSFILGYTNEKNNIYGASKNNPIII 78 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 + DF + + L + LL+ + + Sbjct: 79 --FDDFTGSFKWVDFPFKIKSSAIKLLTVNSNNALLRYLYHIMTSMNFFSKEHK-----R 131 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 I +P+P + Q I + + + L E + K++ + +++ Sbjct: 132 LYISIYSKIKIPLPSIEIQEKIVKFLDTFSELTAELTAELTAELTARKKQYECYRDNLLS 191 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 + E + + K+ S I + + K Sbjct: 192 FNESTPYVSIGDVFEIIN--------------GKSILTKDYISKISGIYPVYSSQTLNKG 237 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + E+ G + S+ + I + Sbjct: 238 IIGYINKYEHNEESISWTRDGYV----------AGSVSYHFNEKFNISNRGLLKALNKNE 287 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 +T + + K L + ++ V +PPI+ Q I N+++ Sbjct: 288 VNTKFVFYLLEIIAKKHVNKRE--TIPHLTSSKMAKIKVPLPPIEVQNKIVNILDRFETL 345 Query: 389 IDVLVEKIEQSIVLLKE----RRSSFI 411 I L + I K+ R + Sbjct: 346 ISDLTIGLPAEIEARKKQYEYYRDKLL 372 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 46/156 (29%), Gaps = 10/156 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V I ++ G++ + I + Y + N + I Sbjct: 199 VSIGDVFEIINGKSILTKDYI-----SKISGIYPVYSSQTLNKGIIGYINKYEHNEESIS 253 Query: 88 YGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + + G + I + L K+ + +LL I + + T Sbjct: 254 WTRDGYVAGSVSYHFNEKFNISNRGLLKALNKNEVNTKFVFYLLEIIAKKHVNKR---ET 310 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + H + I +P+PP+ Q I + I Sbjct: 311 IPHLTSSKMAKIKVPLPPIEVQNKIVNILDRFETLI 346 >gi|315656946|ref|ZP_07909832.1| type I restriction-modification system [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315492467|gb|EFU82072.1| type I restriction-modification system [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 111 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/115 (20%), Positives = 47/115 (40%), Gaps = 8/115 (6%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + GI++ AY +DS + W +RS F Sbjct: 1 MNKMKAWQGSYGVSLYD----GIVSPAYYTFDLASSVDSEFFNWAIRSKAYIPFFGRDSY 56 Query: 352 GL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 G+ + K + ++ +P+ VPP++EQ I + + ID L+ +++ + LL Sbjct: 57 GIRTDQWDFKVQALRNIPLFVPPVEEQRQIVDYLVQRLKGIDGLITDLDRQVELL 111 >gi|302345834|ref|YP_003814187.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] gi|302149819|gb|ADK96081.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] Length = 238 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 64/178 (35%), Gaps = 8/178 (4%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + + ++ I L + Q + E+ ++ VD G+++F + K Sbjct: 64 MQKYRPTTNDAGIPVLKIKELGQGKVDEHSDQCSENIDSQYKVDNGDVIFSWSGTLMVKI 123 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 E G+ + Y W + + +K +++ Sbjct: 124 WCG----GECGLNQHLFKVTSEKYPKWFYYFWTLHHLKKFIHIAQDKAVTMGHIKRSELE 179 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + VL+P K+ +I +I+ A+I + + + L R + + ++G++++ Sbjct: 180 KSEVLIPSNKKLIEIDKIISPLLAKI---IALQTECLN-LTALRDTLLPKLMSGEVEI 233 >gi|315634372|ref|ZP_07889659.1| type I restriction/modification specificity protein [Aggregatibacter segnis ATCC 33393] gi|315476962|gb|EFU67707.1| type I restriction/modification specificity protein [Aggregatibacter segnis ATCC 33393] Length = 183 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 56/144 (38%), Gaps = 3/144 (2%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG--IITSAYMAV 323 K + N +K + + +I+ DL N K ++ V E + + Sbjct: 25 SKFISTNGAVKKYCNDQLVPLFKEDILIVMSDLPNGKALAKTFFVTEDNKYTLNQRIGRI 84 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +++ + K +G + +L+ + + + + +PP++EQ I +++ Sbjct: 85 TVKEEVELLPSFVNHFLNRNKQLTKYDNGTDQTNLRKDQILDVVIPIPPLEEQQRIVSIL 144 Query: 383 NVETARIDVLVEKIEQSIVLLKER 406 + + + E + +I ++R Sbjct: 145 DKFETLTNSITEGLPLAIEQSQKR 168 >gi|260905938|ref|ZP_05914260.1| restriction modification system DNA specificity domain [Brevibacterium linens BL2] Length = 388 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 58/407 (14%), Positives = 136/407 (33%), Gaps = 50/407 (12%) Query: 24 HWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + I L G T G+ + + D ++ Q D V Sbjct: 4 GWRKITIGELCTLTKGTTPTQKAIPGQYPLVVTAAD---------SLSSDTYQFDGEAVC 54 Query: 80 IFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV--T 134 I L G L++ A + L+ ++ + ++ L +D Sbjct: 55 IP-----LVSSTGHGHASLKRVHYASGKFAVANIITALEARNGMDVEMKFLWLLLDHGRD 109 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I + +G + + + +PPL EQ I + I + +D +I +R+ Sbjct: 110 EIIVPLMKGTANVSVSQAALASAHVILPPLDEQRRIVDLIES----VDDVIDRALRYTME 165 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 QA + MK + + V F + ++ + ++ Sbjct: 166 CNAVSQARRKDL----------MKATDYVRMDSVATMASGAAFPSSEQGMSVGSIPFVKV 215 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVM 311 + ++L + + + + ++ G ++F + +R L + Sbjct: 216 SDMNLPGNETHIRRANNYVSREAAARLGAKLWPSGTVIFPKVGAALSTEKRRVLTEVTAI 275 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + ++ + + +L MR+ L G S+ + V+ + Sbjct: 276 DNNVMG---LVPIEGVSLTGFLFAFMRTVKLGLYAQP---GAVPSINQKHVRSIRAPRLS 329 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 I+EQ I + E +D +++ E + L+ R++ + ++G+ Sbjct: 330 IEEQSAIID--EAEC--LDAVMQSSEFQLDRLRNLRANLLTTLLSGE 372 >gi|253577075|ref|ZP_04854397.1| type I restriction-modification system specificity subunit [Paenibacillus sp. oral taxon 786 str. D14] gi|251843569|gb|EES71595.1| type I restriction-modification system specificity subunit [Paenibacillus sp. oral taxon 786 str. D14] Length = 204 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 59/163 (36%), Gaps = 7/163 (4%) Query: 28 VPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V ++ ++ G++ +I + + +++ G + + Sbjct: 21 VKLRDVAEIFRGKSILKQDLKPGNIKVLNISNLDDGEVLLDQLETIDEEERKVKRYEILP 80 Query: 84 GQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G ++ G + A+ + G+ S ++ + + +L S T I++ Sbjct: 81 GDLVMTCRGTVNKLAVFPEAQGMVIASSNMIVIRFKSAIKSHFAKMFLESPVGTALIQSF 140 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G T+ + + + + +P+ P +Q + E+ I E R Sbjct: 141 QRGTTVMNLNPADVAELELPLVPEDKQHELIEQYIREKERYKE 183 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 40/132 (30%), Gaps = 18/132 (13%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL-A 334 + E + PG++V N + I +S + ++ ++ Sbjct: 68 EEERKVKRYEILPGDLVMTCRGTVNKLAVFP--EAQGMVIASSNMIVIRFKSAIKSHFAK 125 Query: 335 WLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + S + + +L DV L + + P +Q + L+ Sbjct: 126 MFLESPVGTALIQSFQRGTTVMNLNPADVAELELPLVPEDKQHE--------------LI 171 Query: 394 EKIEQSIVLLKE 405 E+ + KE Sbjct: 172 EQYIREKERYKE 183 >gi|309810076|ref|ZP_07703922.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners SPIN 2503V10-D] gi|308169575|gb|EFO71622.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners SPIN 2503V10-D] Length = 416 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 50/405 (12%), Positives = 124/405 (30%), Gaps = 27/405 (6%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS--IFAKGQILY 88 + + I I ++ G+ + ++ + I+ Sbjct: 17 SDIIDCSHSTPVWRDRGIRVIRNFNLNEGSLDFSKGAFVDEKTYLERTKRAVPEAEDIVI 76 Query: 89 GKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 + P AII + + ++L+ + + + + G+T+S Sbjct: 77 SREAPMGTVAIIPHNLKCCLGQRLVLLKVNSDICSSSYLLFALMSGFVQNQFNKIGSTVS 136 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + +P+ + I I I + + L + + Y Sbjct: 137 NLTIPELKETKIPLVKNH------KAIGKLLESIANKIQVNKQINDNLAAMIKTIYEYWF 190 Query: 208 TKGLNPD---VKMKDSGIEWVGLV------PDHWEVKPFFALVTELNRKNTKLIESNILS 258 + PD K SG + V P W V+ K S Sbjct: 191 IQFEFPDENGKPYKSSGGKMVWNEQLKRTIPQGWSVESIINTPLCYPIKPGIKPFSEKTY 250 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDP--GEIVFRFIDLQNDKRSLRSA--QVMERG 314 L+ ++I + E+ E+ + P + F + L S+ + Sbjct: 251 LATADVIGTSIGTGNPINYETRESRANMQPEINSVWFAKMKSSIKHLFLSSSMHDFIHSS 310 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIK 373 I+++ + ++ Y+A + + + + G ++++ +D+K + +L+P Sbjct: 311 ILSTGFQGLQCTERSFEYIASFIGNDYFETLKDQLAHGATQEAVNNDDLKGVKILIPD-- 368 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + + + L+ L+ R + + GQ Sbjct: 369 --NRTLDLYHSASRQNYQLIGSALIENKHLESLRDWLLPMLMNGQ 411 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 52/173 (30%), Gaps = 18/173 (10%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG-----LKPESYETYQIVDPGE 290 + + + + + I + N+ + + G + + + Sbjct: 14 TLCSDIIDCSHSTPVWRDRGIRVIRNFNLNEGSLDFSKGAFVDEKTYLERTKRAVPEAED 73 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 IV + + + + V S+YL + + S + F Sbjct: 74 IVISREAPMGTVAIIPHNL---KCCLGQRLVLLKVNSDICSSSYLLFALMSGFVQNQFNK 130 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 +GS +L ++K + + +K I ++ +I ++ + I Sbjct: 131 IGS-TVSNLTIPELKETKIPL--VKNHKAIGKLLESIANKI-----QVNKQIN 175 >gi|261492505|ref|ZP_05989059.1| type I restriction-modification system, subunit S [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261311868|gb|EEY13017.1| type I restriction-modification system, subunit S [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 454 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 48/454 (10%), Positives = 119/454 (26%), Gaps = 70/454 (15%) Query: 29 PIKRFTKLN-----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + F + G + ++ + S G + + I Sbjct: 3 KLSDFISIKHGFAFKGEFITTEENANCLITPVNFSIGGGFKSDKFKYYTGEIPEKYILQP 62 Query: 84 GQILYGKLG------PYLRKAIIADFDG---ICSTQF--LVLQPKDVLPELLQGWLLSID 132 ++ A++ + G + + + + ++ E L + + + Sbjct: 63 NDLIVTMTDLSKQADTLGYPALVPNISGKKMLHNQRIGLVEFLDNELDKEYLYFLMRTKE 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +I + GAT+ H I + P L Q LI + ++ +I Sbjct: 123 YRHQILSTATGATVHHTSPSKILDFEFEKPDLQTQKLIAQYLMILEEKIQLNTQTNQTLE 182 Query: 193 ELLKEKKQALV---------SYIVTKG-------LNPDVKMKDSGIEWVGLV-------- 228 + + ++ + + G L+ IE + Sbjct: 183 AIAQAIFKSWFVDFDPVRAKAQAILDGKTSDEANLSAMAVFSGKAIEDLSQTEYQELWEI 242 Query: 229 -------------PDHWEVKPFFALVTELNRKNTKLIESNIL--------SLSYGNIIQK 267 P W+ L K ES +S ++ + Sbjct: 243 ADAFPSEFGDEGLPIGWKFNQADNLFDVGIGKTPPRKESEWFSDNANDTEWISIKDMGNQ 302 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT---SAYMAVK 324 + E + I + L R + + + + Sbjct: 303 GLFITESSEYLKVEAVDKFNIKRIPENTVILSFKLTVGRVSITTKETTTNEAIAHFKIPS 362 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + S +L ++++D + S + ++ + +K + +L P I Sbjct: 363 SSNLSSEFLYCYLKNFDFNNL--GSTSSIATAVNSKMIKEMKILEPSDLVINHFNEYIEG 420 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +I + + L + R + + G+ Sbjct: 421 IFNKIKENIIQNNN----LTKIRDELLPKLLNGE 450 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 19/195 (9%), Positives = 51/195 (26%), Gaps = 12/195 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLP--KDGN 69 +P WK + G+T + D +I ++D+ + + Sbjct: 255 LPIGWKFNQADNLFDVGIGKTPPRKESEWFSDNANDTEWISIKDMGNQGLFITESSEYLK 314 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 D + + ++ + + I + + + + Sbjct: 315 VEAVDKFNIKRIPENTVILS-FKLTVGRVSITTKETTTNEAIAHFKIPSSSNLSSEFLYC 373 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + K I + + P E I +I I + Sbjct: 374 YLKNFDFNNLGSTSSIATAVNSKMIKEMKILEPSDLVINHFNEYIEGIFNKIKENIIQNN 433 Query: 190 RFIELLKEKKQALVS 204 ++ E L++ Sbjct: 434 NLTKIRDELLPKLLN 448 >gi|257891989|ref|ZP_05671642.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,410] gi|260560601|ref|ZP_05832766.1| predicted protein [Enterococcus faecium C68] gi|261209357|ref|ZP_05923734.1| predicted protein [Enterococcus faecium TC 6] gi|294619212|ref|ZP_06698695.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1679] gi|314997568|ref|ZP_07862503.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133a01] gi|257828349|gb|EEV54975.1| type I restriction-modification system specificity subunit [Enterococcus faecium 1,231,410] gi|260073400|gb|EEW61737.1| predicted protein [Enterococcus faecium C68] gi|260076639|gb|EEW64389.1| predicted protein [Enterococcus faecium TC 6] gi|291594555|gb|EFF25949.1| type I restriction-modification system specificity subunit [Enterococcus faecium E1679] gi|313588385|gb|EFR67230.1| type I restriction modification DNA specificity domain protein [Enterococcus faecium TX0133a01] Length = 187 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 1/118 (0%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S ++ +I+ K L E + + S Y+ Sbjct: 67 ISNSKLIDLRLEENDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDC 126 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K+ + G + ++ ++ +L + +PP++EQ +T I + I + Sbjct: 127 FLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRRI 184 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 7/164 (4%) Query: 27 VVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + K+ G T + K ++ ++ + D++ G + + + Sbjct: 20 WVYLGSISTKIQYGYTDSAKKQGNVKFLRITDIQEGRVNWSSVPYCDISNSKLIDLRLEE 79 Query: 84 GQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL + G + K+ I++ S + + +L E + +L S + +E Sbjct: 80 NDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDCFLDSPLYWKLLEK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I G + + + + +P+PPL EQ + KI I Sbjct: 140 ISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMIRRSIRR 183 >gi|148983889|ref|ZP_01817208.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP3-BS71] gi|147924036|gb|EDK75148.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP3-BS71] Length = 213 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 36/116 (31%), Gaps = 8/116 (6%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ V ++ G + KD I +I + D E G + Sbjct: 83 DIPDTWEWVRFSTLVEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIK 142 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 +S + KG L + R I+ I + ++ L + + Sbjct: 143 KSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFY 198 >gi|295087089|emb|CBK68612.1| Type I restriction modification DNA specificity domain. [Bacteroides xylanisolvens XB1A] Length = 366 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 41/131 (31%), Gaps = 12/131 (9%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + + I A + + + + G + Sbjct: 3 GGNIGSMILITRENYFDMAIKNVALFKQYIYNDVLIKYLYFYLQSQVVSIKNTALGGAQS 62 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ---SIVLLK-----ERR 407 + ++ + +PP+ EQ I + +D ++++ E+ + LK + + Sbjct: 63 FVSLNMLRNYLMPIPPLNEQKKIIE----KFKLLDFVIQQYEKSYCELNNLKHELFPKLK 118 Query: 408 SSFIAAAVTGQ 418 S + A+ G+ Sbjct: 119 KSILQEAIQGK 129 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 47/171 (27%), Gaps = 6/171 (3%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIES--NILSLSYGNIIQKLETRNMGLKPESY 280 E +P W+ + + + I + + Sbjct: 192 EIPFEIPVTWQWVRTKDIFQINPKNIAEDNCISAFIPMEKICATYGSEFSYDKVQWKTIK 251 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA----YMAVKPHGIDSTYLAWL 336 Y G++ F I R + GI + I+ YL + Sbjct: 252 TGYTHFADGDVAFAKITPCFQNRKSAIFHNLPNGIGAGTTELKVLRQFGETINRWYLLFF 311 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + S G+ +Q + ++ +PP++EQ I N I + Sbjct: 312 LESPYFIDEATFKGTANQQRITSGYLENKLFPLPPLQEQNRIENHIKAIAS 362 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 57/165 (34%), Gaps = 7/165 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W+ V K ++N +E +I +E + + G D ++ + + Sbjct: 196 EIPVTWQWVRTKDIFQINPKNIAEDNCISAFIPMEKICATYGSEFSYDKVQWKTIKTGYT 255 Query: 80 IFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLV-LQPKDVLPELLQGWLLSID 132 FA G + + K+ P + + + G +T+ V Q + + + L Sbjct: 256 HFADGDVAFAKITPCFQNRKSAIFHNLPNGIGAGTTELKVLRQFGETINRWYLLFFLESP 315 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 A + N P+PPL EQ I I A Sbjct: 316 YFIDEATFKGTANQQRITSGYLENKLFPLPPLQEQNRIENHIKAI 360 >gi|148265620|ref|YP_001232326.1| hypothetical protein Gura_3599 [Geobacter uraniireducens Rf4] gi|146399120|gb|ABQ27753.1| hypothetical protein Gura_3599 [Geobacter uraniireducens Rf4] Length = 482 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 52/408 (12%), Positives = 117/408 (28%), Gaps = 43/408 (10%) Query: 35 KLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI---FAKGQILY 88 K++ G + ++ +V + + + G +L Sbjct: 51 KISDGTHFTPSYTENGVPFLSALNVLENSLSLEAGHRFISSEEHDNLYRRCDPQPGDVLL 110 Query: 89 GKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 K+G R A + F S L + + + PE+L ++ S ++ + +G Sbjct: 111 RKVGVGPRWAAVVPEGLPVFSIFVSVALLRPRTELIAPEVLATFINSESGQTQLLRVQKG 170 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI-------------- 189 A+ + I ++ +P+ Q I E I Sbjct: 171 ASQPDLHLEDIRDVFIPLFGQEFQNRIVELHQNSVEVSSKGIASYKNAETLLLNALNLAT 230 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 ++ V G + E L+ + E + N + Sbjct: 231 YTPTTKNTNIKSFKESFVASGRMDAEYYQPMFDEIEELIKSNGEYFKRVEEIQTYNSRGM 290 Query: 250 KLIESNILSLSYGNII-------QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 I ++ +K S E V +I+ + Sbjct: 291 AAIYDETGTVDMITQKHILEAGLNYDNFDKTNIKHFSTEETSFVAENDILIYGTGANIGR 350 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + ++ + + ++ D Y+A+++ S+ M +G + L +D Sbjct: 351 A--QPYLSEKKAVACQDIIILRV-IEDPVYVAFVINSFIGRLQTEKMRTGSAQPHLYPKD 407 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 V ++ + Q I +I + +QS LL+ + + Sbjct: 408 VAQVLIPFVAKDTQLKI-------REKIISSLALKKQSTALLETAKRA 448 >gi|259501398|ref|ZP_05744300.1| conserved hypothetical protein [Lactobacillus iners DSM 13335] gi|259167147|gb|EEW51642.1| conserved hypothetical protein [Lactobacillus iners DSM 13335] Length = 168 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 52/162 (32%), Gaps = 9/162 (5%) Query: 29 PIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKD--GNSRQSDTSTVSI 80 + + G T ++ +I ++ ++D + + D S+ + Sbjct: 4 KLSEIMDIIGGGTPKTSNPEYWNGNIPWLSVKDFNNDYRYVYETEKAITQAGLDNSSTKM 63 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + + G A+I F + L+ K L + + L ++ Sbjct: 64 LKRNDSIISARGTVGEMAMIP-FPMAFNQSCYGLRAKKGLVDAEYLYYLIKHNVVVLKKN 122 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 G+ +I + +P L EQ ++ + +I+ Sbjct: 123 THGSVFDTITHDTFDDIEVELPSLKEQKVVASILRNLDDKIE 164 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 15/148 (10%), Positives = 45/148 (30%), Gaps = 9/148 (6%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPE-----SYETYQIVDPGEIVFRFIDLQNDK 302 N + NI LS + K + +++ + + + Sbjct: 21 NPEYWNGNIPWLSVKDFNNDYRYVYETEKAITQAGLDNSSTKMLKRNDSIISARGTVGEM 80 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + S Y G+ + + +++ + + ++ + Sbjct: 81 AMIP----FPMAFNQSCYGLRAKKGLVDAEYLYYLIKHNVVVLKKNTHGSVFDTITHDTF 136 Query: 363 KRLPVLVPPIKEQFDITNVINVETARID 390 + V +P +KEQ + +++ +I+ Sbjct: 137 DDIEVELPSLKEQKVVASILRNLDDKIE 164 >gi|212691986|ref|ZP_03300114.1| hypothetical protein BACDOR_01481 [Bacteroides dorei DSM 17855] gi|212665378|gb|EEB25950.1| hypothetical protein BACDOR_01481 [Bacteroides dorei DSM 17855] Length = 147 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 58/139 (41%), Gaps = 10/139 (7%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM--ERGIITSAYMAVKP---HGIDSTYL 333 E + G+++F D+ + + + ++ + S + + + S YL Sbjct: 8 DNEKQNTLLYGDLLFTLSSETPDEVGIGAVYLGESDKYYLNSFCFGLHMTATNKVYSPYL 67 Query: 334 AWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 A+L+ + K Y + G R +L+ D + +P + Q +I +N ++++ Sbjct: 68 AYLVSNSVFRKFIYPLAQGSTRFNLQKNDFMKKKFSLPTFENQKEIARTLNALSSKL--- 124 Query: 393 VEKIEQSIVLLKERRSSFI 411 E + ++ +E++ + Sbjct: 125 -ETERKLLLNYQEQKQYLL 142 >gi|295090945|emb|CBK77052.1| Restriction endonuclease S subunits [Clostridium cf. saccharolyticum K10] Length = 226 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 66/197 (33%), Gaps = 8/197 (4%) Query: 229 PDHWEVKPFFALVTELNRK-NTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQI 285 PD W+ + ++R + K + ++ I+ + + Sbjct: 29 PDEWKNVTLEDITALISRGISPKYADDTDQTVINQKCIRNHIIDLSFARSHRPKVINNKW 88 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G+++ R+ + + S V+P + + L + ++ Sbjct: 89 LQFGDLLINSTGDGTLGRAAQVWFQPHNLTVDSHVTIVRPAAENMIFYIGLWGTQHEKEI 148 Query: 346 FYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 GS + L + VK + +L+P + N A + + ++ L Sbjct: 149 ESLHTGSTGQTELPRDRVKAIELLLPDKET----LERFNALIAPMAAAIVSNQEENNRLA 204 Query: 405 ERRSSFIAAAVTGQIDL 421 R + + ++G+ID+ Sbjct: 205 SIRDALLPKLMSGKIDV 221 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/191 (10%), Positives = 47/191 (24%), Gaps = 9/191 (4%) Query: 21 IPKHWKVVPIKRF-TKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P WK V ++ ++ G + + D I + + + + Sbjct: 28 VPDEWKNVTLEDITALISRGISPKYADDTDQTVINQKCIRNHIIDLSFARSHRP--KVIN 85 Query: 78 VSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 G +L G + + + +++P G + Sbjct: 86 NKWLQFGDLLINSTGDGTLGRAAQVWFQPHNLTVDSHVTIVRPAAENMIFYIGLWGTQHE 145 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + + + I + +P I I + E R Sbjct: 146 KEIESLHTGSTGQTELPRDRVKAIELLLPDKETLERFNALIAPMAAAIVSNQEENNRLAS 205 Query: 194 LLKEKKQALVS 204 + L+S Sbjct: 206 IRDALLPKLMS 216 >gi|218263888|ref|ZP_03477844.1| hypothetical protein PRABACTJOHN_03534 [Parabacteroides johnsonii DSM 18315] gi|218222438|gb|EEC95088.1| hypothetical protein PRABACTJOHN_03534 [Parabacteroides johnsonii DSM 18315] Length = 394 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 47/383 (12%), Positives = 112/383 (29%), Gaps = 46/383 (12%) Query: 39 GRTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVSIFAKGQILY-----GKLG 92 G S I I + + G + +G + K Sbjct: 30 GSEPTSENAIKVIRTTNFTNEGHLDLADVVTRDIEPKKVARKKLKQGDTILERSGGTKDN 89 Query: 93 PYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM----S 147 P R + D + + L+PK+ + + + L A+ A+ Sbjct: 90 PVGRVVFFDEIGDYLLNNFTQALRPKESVNPVYLFYALYNSYNINKAAMRAMASQTTGIQ 149 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + + +P EQ + +I + S + Sbjct: 150 NLSMSDFMSKSIVLPSRDEQN--------KFEQIYRQADKSKFGDFK---------SQFI 192 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 NP + + ++ +G + K + + + Y + Sbjct: 193 EMFGNPLSLNQKNELKRLGEC--CILNPRRPNIALCDTDKVSFIPMPAVSEDGYLVDMTD 250 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKP 325 E + + + + +++F I ++N K ++ + G+ ++ + ++P Sbjct: 251 EEYGKVK------KGFTYFENNDVLFAKITPCMENGKGAIVHGLINGIGMGSTEFHVLRP 304 Query: 326 HG--IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +L L R + G+G ++ + + V +P I+EQ Sbjct: 305 INGISSPYWLLALTRMPIFRERAAKNMSGTGGQKRVSASYLDHFMVGLPAIEEQRRF--- 361 Query: 382 INVETARIDVLVEKIEQSIVLLK 404 + D I++++V L Sbjct: 362 -EAIYRQADKSKSVIQKTLVYLN 383 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 49/169 (28%), Gaps = 11/169 (6%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 IE G V + +++ + + E+ I + N + + Sbjct: 4 FIEMFGTVESYCKLEDLVSDTFPGEWGSEPTSENAIKVIRTTNFTNEGHLDLADVVTRDI 63 Query: 281 E----TYQIVDPGEIVFRFIDLQND---KRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 E + + G+ + D R + ++ + + ++ YL Sbjct: 64 EPKKVARKKLKQGDTILERSGGTKDNPVGRVVFFDEIGDYLLNNFTQALRPKESVNPVYL 123 Query: 334 AWLMRSYDLCKVF----YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + + A + Q+L D +++P EQ Sbjct: 124 FYALYNSYNINKAAMRAMASQTTGIQNLSMSDFMSKSIVLPSRDEQNKF 172 >gi|281357557|ref|ZP_06244044.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] gi|281315814|gb|EFA99840.1| restriction modification system DNA specificity domain protein [Victivallis vadensis ATCC BAA-548] Length = 229 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 63/198 (31%), Gaps = 23/198 (11%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +G +P W+V ++ K E L+ + K Sbjct: 50 ELGQIPAGWQVGTLKDMLEVRYGK-----EHKKLADGAIPVYGSGGLMRHVEKALYNGES 104 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ + + + E + + + +V + YL ++ DL Sbjct: 105 VLIPRKGTLNNVMRVTG-----------EFWTVDTMFYSVPRKTGAAKYLYHILSKLDLT 153 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 S+ + + + +++PP + + T+ +E + + L Sbjct: 154 ---SMNSGSAVPSMTTDILNAIKIILPP----DKVLKDFDYLTSFFWESIETKKMEMQKL 206 Query: 404 KERRSSFIAAAVTGQIDL 421 + R + + ++G+ID+ Sbjct: 207 AQLRDALLPELMSGEIDV 224 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 69/201 (34%), Gaps = 25/201 (12%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP W+V +K ++ G+ + D +P G+ Sbjct: 48 DSE---LGQIPAGWQVGTLKDMLEVRYGKEHKKLADGA--------------IPVYGSGG 90 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +++ +L + G + T F + K + L L + Sbjct: 91 LMRHVEKALYNGESVLIPRKGTLNNVMRVTGEFWTVDTMFYSVPRKTGAAKYLYHILSKL 150 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 D+T ++ G+ + + I + +PP + + T I + Sbjct: 151 DLT----SMNSGSAVPSMTTDILNAIKIILPP----DKVLKDFDYLTSFFWESIETKKME 202 Query: 192 IELLKEKKQALVSYIVTKGLN 212 ++ L + + AL+ +++ ++ Sbjct: 203 MQKLAQLRDALLPELMSGEID 223 >gi|167768809|ref|ZP_02440862.1| hypothetical protein ANACOL_00126 [Anaerotruncus colihominis DSM 17241] gi|167668981|gb|EDS13111.1| hypothetical protein ANACOL_00126 [Anaerotruncus colihominis DSM 17241] Length = 228 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 56/168 (33%), Gaps = 3/168 (1%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET--RNMGLKPESYETYQ 284 +P W + + ++ N + + + +++ + + Sbjct: 55 ELPVGWVWCRGHSCFESMESTKSQSEFFNYIDIDAIDNRLHRIKAAKHLLVSEAPSRASR 114 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 V G ++F + + +L + TS Y+ + ++ +LM S + Sbjct: 115 AVKNGSVLFSLVRPYLENIALVEERYSHCIASTSFYVCNSNGALLPEFMYFLMISGYMVN 174 Query: 345 VFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 G S+ ++++ +PP+ EQ I +N I+ Sbjct: 175 SLNQYMKGDNSPSISKDNIESWLYPIPPLDEQKVICTKLNTTFTLIEN 222 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 57/169 (33%), Gaps = 6/169 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W + T + YI ++ +++ + S S Sbjct: 55 ELPVGWVWCR-GHSCFESMESTKSQSEFFNYIDIDAIDNRLHRIKAAKHLLVSEAPSRAS 113 Query: 80 I-FAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDV-LPELLQGWLLSIDVT 134 G +L+ + PYL + + I ST F V LPE + ++S + Sbjct: 114 RAVKNGSVLFSLVRPYLENIALVEERYSHCIASTSFYVCNSNGALLPEFMYFLMISGYMV 173 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + +G I + PIPPL EQ +I K+ I+ Sbjct: 174 NSLNQYMKGDNSPSISKDNIESWLYPIPPLDEQKVICTKLNTTFTLIEN 222 >gi|332202396|gb|EGJ16465.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41317] Length = 190 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 60/175 (34%), Gaps = 3/175 (1%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNNV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G ++ + L + +PP+ EQ I I ++ ++E + L KE Sbjct: 119 GTSYPAINDYNFNLLLIALPPLSEQQRIVEAIEPALEKVMNMLESYNRLEQLDKE 173 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 34/174 (19%), Positives = 66/174 (37%), Gaps = 7/174 (4%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNNVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + + +PPL+EQ I E I ++ ++ R +L KE L + Sbjct: 132 LLLIALPPLSEQQRIVEAIEPALEKVMNMLESYNRLEQLDKEFPDKLKNLFFNM 185 >gi|207108193|ref|ZP_03242355.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori HPKX_438_CA4C1] Length = 158 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 52/145 (35%), Gaps = 5/145 (3%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 N G Y D I + + + G+ Y + + + Sbjct: 3 NSGRDLYGYYHDFNNDGENITIASRGEYAGFINYFNEKFFAGGLCYP-YKVKDTNELLTK 61 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +L + +++ ++ + + G +L D++ L + +PP++ Q +I +++ + Sbjct: 62 FLYFYLKTNEIQIMENLVFRGSIPALNKADIETLTIPIPPLEIQQEIVKILDQFSLLTTD 121 Query: 392 LVEKIEQSIVLLKE----RRSSFIA 412 L+ I I K+ R + Sbjct: 122 LLAGIPAEIKARKKQYEYYREKLLT 146 >gi|159026847|emb|CAO89098.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 677 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 90/309 (29%), Gaps = 19/309 (6%) Query: 98 AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 A + GI ++ +V + + I + + I + G Sbjct: 369 AFVPYGTGIKTSLLVVQKLPANNDSCFMAQIKKIGYDVKGQTIYKRNQSGVIARTKSGLP 428 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + R I E + I + + + + P+ + Sbjct: 429 IVDDDIDDISQSFRSFINGEFAQNSDCIYTVKNTLLNSRLDAEHYL---------PNDQK 479 Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 ++++G P +++++ I + Y + Q + + + Sbjct: 480 LLEHLKYIGAKPLGEITDILRDAADFRLARDSEIRYIAISDVDYRTM-QVVSQQIIKAHE 538 Query: 278 ESYETYQIVDPGEIVFRFIDLQN----DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + G+I+ +L + + G++ +L Sbjct: 539 APSRATYRLYKGDIITAISGASTGTPRQATALITEDEDGAICSNGFSVLRNIRGVEPLFL 598 Query: 334 AWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 MR+ + +G ++ +D+ ++ V +PP EQ I I A I + Sbjct: 599 LVYMRTDLFLRQIKRYMTGHAIPTILVDDLSKVLVPIPPKSEQQRIAKSI----AEIQAI 654 Query: 393 VEKIEQSIV 401 ++ ++ Sbjct: 655 RKEALKASE 663 >gi|269978340|gb|ACZ55904.1| truncated putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 276 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/268 (13%), Positives = 86/268 (32%), Gaps = 24/268 (8%) Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVT 208 D PIPPL Q I + + A T L TE ++ K++ Q ++ Sbjct: 1 MDMTAFKKYKFPIPPLEIQQEIVKILDAFTELNTELNTELNTELKARKKQYQ-YYQNMLL 59 Query: 209 KGLNPDVKMKDSGIEWVGLV---------PDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + + KD+ I+ P+ E + + + K E Sbjct: 60 DFNDINSNHKDAKIKSYPKRLKTLLQTLAPEGVEFRKLGEVCEIIRGKRVTKKEI----- 114 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 L+ + ++ I + + ++ Sbjct: 115 --------LDKGKYPVVSGGIGFMGYLNEYNREENTITIAQYGTAGFVNWQNQKFWANDV 166 Query: 320 YMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + P + YL +++ + + S + S+ ++ ++ + +PP++ Q +I Sbjct: 167 CFSAIPKETLINRYLYYVLTNMQNYLYSISNRSAIPYSISSNNIMQITIPIPPLEIQQEI 226 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKER 406 +++ + L+ I I K++ Sbjct: 227 VKILDQFSILTTDLLAGIPAEIKARKKQ 254 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 41/156 (26%), Gaps = 11/156 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ + + ++ G+ + + GKY G Sbjct: 89 PEGVEFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYNR 138 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + + PK+ L ++L+ Sbjct: 139 EENTITIAQYGT-AGFVNWQNQKFWANDVCFSAIPKETLINRYLYYVLTNMQNYLYSISN 197 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 A I I +PIPPL Q I + + Sbjct: 198 RSAIPYSISSNNIMQITIPIPPLEIQQEIVKILDQF 233 >gi|284053770|ref|ZP_06383980.1| HsdS [Arthrospira platensis str. Paraca] Length = 238 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 61/192 (31%), Gaps = 16/192 (8%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKL----ETRNMGLKPESYETYQIVDPGEIV 292 + ++ I ++ YG I ++ E + +PG++V Sbjct: 35 IGEFIRGKRFTKADYVDDGIPAIHYGEIYTHYGVAASHTLSQVRAEMAASLCYAEPGDVV 94 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GS 351 + + A + + H I+ +++++M++ S Sbjct: 95 MTGVGETVEDVGKAVAWIGSEKVAIHDDSWAFRHSINPKFVSYVMQTTAFINEKAKHVSS 154 Query: 352 GLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLVEKIEQSI---- 400 G L +K++P+ +P ++EQ I +++ + E + I Sbjct: 155 GKVNRLLINGIKKVPIPIPYPNDPKKSLEEQAHIVAILDKFDTLTHSISEGLPHEIAWRQ 214 Query: 401 VLLKERRSSFIA 412 + R + Sbjct: 215 KQYEYYRDLLLT 226 Score = 42.9 bits (99), Expect = 0.099, Method: Composition-based stats. Identities = 28/238 (11%), Positives = 72/238 (30%), Gaps = 26/238 (10%) Query: 3 HYKAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLED 55 K Y Y+D + + G + + P+ + G+ I I + Sbjct: 8 RQKQYNYYRDQLLTFEE--GEV----EWKPLGEIGEFIRGKRFTKADYVDDGIPAIHYGE 61 Query: 56 VESGTGKYLPKDGNSRQSDT-STVSIFAKGQILYGKLGPY----LRKAIIADFDGICSTQ 110 + + G + +++ +++ G ++ +G + + + Sbjct: 62 IYTHYGVAASHTLSQVRAEMAASLCYAEPGDVVMTGVGETVEDVGKAVAWIGSEKVAIHD 121 Query: 111 FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP------- 163 + P+ + + + ++ GI +P+PIP Sbjct: 122 DSWAFRHSINPKFVSYVMQTTAFINEKAKHVSSGKVNRLLINGIKKVPIPIPYPNDPKKS 181 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 L EQ I + + + I+E + ++K+ ++ + K S Sbjct: 182 LEEQAHIVAILD-KFDTLTHSISEGLPHEIAWRQKQYEYYRDLLLTFPKKEEKQCASD 238 >gi|13508082|ref|NP_110031.1| hypothetical protein MPN343 [Mycoplasma pneumoniae M129] gi|12229978|sp|P75435|T1SD_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_343; AltName: Full=S.MpnORFDP; AltName: Full=Type I restriction enzyme specificity protein MPN_343; Short=S protein gi|1674185|gb|AAB96141.1| hypothetical protein MPN_343 [Mycoplasma pneumoniae M129] Length = 330 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 53/180 (29%), Gaps = 13/180 (7%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-------IVDPGEI 291 + N +I + G I K RN + Y + + Sbjct: 132 RKIYGANIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATTNDGELGRIKDCDF 191 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +I + + + + + + T + + K + + Sbjct: 192 DGEYITWTTNGYAGVVFYRNGKFNASQDCGVLKVKNKKICTKFLSFLLKIEAPKFVHNLA 251 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 S R L + + + + PP++ Q I +++ + LVE I I L R+ Sbjct: 252 S--RPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEIEL---RKKQL 306 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 7/47 (14%), Positives = 19/47 (40%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 +L + + + PP++ Q I +++ T L ++ + Sbjct: 13 IPNLNLSRTEEIELDFPPLQIQQKIATILDTFTELSAELSAELSAEL 59 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 50/188 (26%), Gaps = 16/188 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI--FAK 83 + +K ++ GR + + G+ + Sbjct: 142 ETFQVKDICEIRRGRAITK---------AYIRNNPGENPVYSAATTNDGELGRIKDCDFD 192 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G+ + Y + S V K ++ +L + + + + Sbjct: 193 GEYITWTTNGYAGVVFYRNGKFNASQDCGV--LKVKNKKICTKFLSFLLKIEAPKFVHNL 250 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID---TLITERIRFIELLKEKKQ 200 A+ K + I + PPL Q I + + A + I I + + Q Sbjct: 251 ASRPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEIELRKKQLDYYQ 310 Query: 201 ALVSYIVT 208 + V Sbjct: 311 NFLFNWVQ 318 >gi|312872178|ref|ZP_07732251.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2062A-h1] gi|311092262|gb|EFQ50633.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2062A-h1] Length = 178 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 59/134 (44%), Gaps = 7/134 (5%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII---TSAYMAVK-PHGID 329 + E + G+ + I + +++ G I ++ Y+ + G D Sbjct: 45 SFELEKFSGGTKFRNGDTIMARITPCLENGKTAKVNILDDGEIGFGSTEYIVFRAKKGTD 104 Query: 330 STYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 YL +L+ S + + + +GS RQ ++ + V+ L + VP I+EQ I ++ Sbjct: 105 KDYLYYLVCSPLVREPAIKSMVGSSGRQRVQTDVVQGLSIAVPSIEEQRQIGGILRALDD 164 Query: 388 RIDVLVEKIEQSIV 401 +I+ L +I +++ Sbjct: 165 KIE-LNNEINKNLA 177 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 56/167 (33%), Gaps = 13/167 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W + + N + G I ++ ++ + + S + F G Sbjct: 5 WTIKTLSDIADFNPRESLSKGTLAKKIAMDKLQ----PFCRDVPSFELEKFSGGTKFRNG 60 Query: 85 QILYGKLGPYLR-------KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + ++ P L + G ST+++V + K + +L+ + + Sbjct: 61 DTIMARITPCLENGKTAKVNILDDGEIGFGSTEYIVFRAKKGTDKDYLYYLVCSPLVREP 120 Query: 138 --EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +++ + + + + +P + EQ I + A +I+ Sbjct: 121 AIKSMVGSSGRQRVQTDVVQGLSIAVPSIEEQRQIGGILRALDDKIE 167 >gi|332829722|gb|EGK02368.1| hypothetical protein HMPREF9455_01638 [Dysgonomonas gadei ATCC BAA-286] Length = 164 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 11/159 (6%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I + E + S Y ++ GE+ + +N K ++ Y Sbjct: 7 HGFINQSEKYSNDNAGNSLSKYTLLKQGELAYNRGSSRNKKYGSVFFLNYPNALVPYVYH 66 Query: 322 --AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-----SLKFEDVKRLPVLVPPIKE 374 + D + A+L+ S L K + S + ++ ED + V +P +++ Sbjct: 67 SFRMNSQICDVIFYAYLLNSKLLNKELRKIISSTARMDGLLNISREDFFSIKVPLPKLEK 126 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I+ +N + + + + +++ + Sbjct: 127 QQLISTSLNKLMQK----TKLEKDVVTRYHKQKQYILQQ 161 >gi|121609954|ref|YP_997761.1| restriction modification system DNA specificity subunit [Verminephrobacter eiseniae EF01-2] gi|121554594|gb|ABM58743.1| restriction modification system DNA specificity domain [Verminephrobacter eiseniae EF01-2] Length = 549 Score = 60.2 bits (144), Expect = 7e-07, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 49/133 (36%), Gaps = 11/133 (8%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDSTYLAWLMRS 339 + + I + L Q E I + KP+ D+ Y +L S Sbjct: 167 RFQDRDTLMARITPCLENGKLARFQAPEGEPIGHGSTEFIVIRGKPNVTDNDYAYYLAIS 226 Query: 340 YDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 ++ K + G+ RQ + + + ++ VL+PP+ EQ I +++ +D + Sbjct: 227 SEVRKFAISQMTGTSGRQRVPTDALGKISVLLPPLTEQKAIAHILGT----LDDKIALNR 282 Query: 398 QSIVLLKERRSSF 410 + L+ Sbjct: 283 RMNATLEAIAQVL 295 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 58/446 (13%), Positives = 126/446 (28%), Gaps = 51/446 (11%) Query: 16 QWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 +W G IP LN G ++ + + GT + + Sbjct: 114 EW-GEIP-------FSEAVLLNPATPLVKGVIYPFVEMSAIAVGTRDVKCSEYRNFSGGG 165 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFD-------GICSTQFLVLQ--PKDVLPELLQG 126 S F L ++ P L +A F G ST+F+V++ P + Sbjct: 166 S---RFQDRDTLMARITPCLENGKLARFQAPEGEPIGHGSTEFIVIRGKPNVTDNDYAYY 222 Query: 127 WLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 +S +V + + G + +G I + +PPL EQ I + +I Sbjct: 223 LAISSEVRKFAISQMTGTSGRQRVPTDALGKISVLLPPLTEQKAIAHILGTLDDKIALNR 282 Query: 186 TERIRFIELLKEKKQALVSYI--VTKGLNPDVKMKDSG----------------IEWVGL 227 + + ++ V + + S +G Sbjct: 283 RMNATLEAIAQVLFKSWFVDFDPVRAKMEGRWQRDQSLPGLPADLYDLFPERLVASELGE 342 Query: 228 VPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRNM-GLKPESYE 281 +P+ W + F V + K +I S + + + K + Sbjct: 343 IPEGWAIGSFSEAVEIIGGGTPKTSVSEYWGGDIPWFSVVDTPPSSDVFVVQTEKSITRS 402 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 I + + A++ Y +L Sbjct: 403 GLNGSSARMIAKGTTIISARGTVGNLGIAGRDMTFNQSCYALRGKNGSGDYFVFLSAQCM 462 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE-QFDITNVINVETARIDVLVEKIEQSI 400 + ++ + ++ + + ++PP Q T+ D + S Sbjct: 463 VEQLKVMAHGSVFSTITRQTFDAVRFVLPPEPVLQQ----FERTATSVFDAIFGNGNDSR 518 Query: 401 VLLKERRSSFIAAAVTGQIDLRGESQ 426 L + R + + ++G++ ++ + Sbjct: 519 SLAR-LRGTLLPKLISGELRIQDAER 543 >gi|259506124|ref|ZP_05749026.1| restriction enzyme subunit S [Corynebacterium efficiens YS-314] gi|259166298|gb|EEW50852.1| restriction enzyme subunit S [Corynebacterium efficiens YS-314] Length = 304 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/267 (11%), Positives = 78/267 (29%), Gaps = 28/267 (10%) Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDS 220 +P + EQ+ I + A +I + + LV + P + + Sbjct: 60 LPTMPEQLRIARILDAIDEQIAASRRILSKLRLEAEGVLDRLVQELSPADFVPLADLCTA 119 Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I + +L++ + + ++ Sbjct: 120 DI-----------------CYGIVQSGVFVPGGVPVLAIRDLDGDFETGVHLTSRSIDAQ 162 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL--AWLMR 338 V PG+++ + G I+ ++ +L+ Sbjct: 163 YRRSRVAPGDVLLSIKGTIGKVGIVP---DTYNGNISREIARIRFSARTDPAFARYYLLS 219 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++ A+ R + +K+ P I+ Q ++ V+ R ++ E+ Sbjct: 220 REAQRRLDLAVVGTTRAEVSIHVLKKFAFPSPAIQYQRNVARVMTALQER-----QESER 274 Query: 399 -SIVLLKERRSSFIAAAVTGQIDLRGE 424 ++ L+ R ++G++ + E Sbjct: 275 IALTKLQAMRRGLFEDLLSGRVRVPAE 301 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 33/79 (41%), Gaps = 7/79 (8%) Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 LA L+ + + + A +G+ + + E + +++P + EQ I +++ Sbjct: 17 LPQILAGLLSTKVVQEYLNARTTGMAESQTNFADEALLSAELVLPTMPEQLRIARILDA- 75 Query: 386 TARIDVLVEKIEQSIVLLK 404 ID + + + L+ Sbjct: 76 ---IDEQIAASRRILSKLR 91 >gi|325690781|gb|EGD32782.1| type I restriction/modification enzyme [Streptococcus sanguinis SK115] Length = 191 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 49/141 (34%), Gaps = 6/141 (4%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 T+ + E I + + S+Y+ ++ Y ++M Sbjct: 52 STFHNIANTEYPVLTISASGANAGYVNLWHVPVWASDSSYI--DSKMTNNVYFWYVMLKR 109 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 +++ + + + + ++ ++P I+ N+ + V + I Sbjct: 110 RQQEIYDSQTGSAQPHIYPKHIE----IMPTIELSKKEINLFTKRVTPLFKTVGNNLEEI 165 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 L+ R S ++ ++G+I + Sbjct: 166 NNLQNLRESLLSKLLSGEISV 186 >gi|315124538|ref|YP_004066542.1| type II restriction-modification enzyme [Campylobacter jejuni subsp. jejuni ICDCCJ07001] gi|315018260|gb|ADT66353.1| type II restriction-modification enzyme [Campylobacter jejuni subsp. jejuni ICDCCJ07001] Length = 960 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 55/449 (12%), Positives = 130/449 (28%), Gaps = 82/449 (18%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---- 75 ++V +K F K +G + + +G E +++ +G + + Sbjct: 491 ELVRLKDFVLDIQTAKRPSGGVGKYENGALSLGGEHIDNKSGYIKLDNPKYVPIEFYESF 550 Query: 76 --STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP------KDVLPELLQGW 127 I + IL K G K + + I + + + L Sbjct: 551 ALQDKGIVKQFDILICKDGALTGKIAMVRNEFIRKSAMINEHIFLLRCDNIAKQKYLFYI 610 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S Q +++ G+ + + +I +P Q I + + +T+ Sbjct: 611 LHSYSGQQALKSKITGSAQGGINKTNLESILIPNADFEIQKQIVAECEKVEEQYNTIRMS 670 Query: 188 RIRFIELLKEKKQ---------------------------------ALVSYIVTKG--LN 212 + L+K Q +L+ ++ L Sbjct: 671 VEEYQNLIKAILQKCGIIDDGGGYELNSILENLQKLESKLDFNLLLSLIEEQISHSEVLV 730 Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 + + K+ ++ ++ ++ + K I N +K ++ Sbjct: 731 EETQSKERKEDFNAFKNFSKTIQELLQTLSTPPKDGWKRISLKNEQYMELNPSKKEISKL 790 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI---- 328 S+ V + ID ++ +E I+ + +G Sbjct: 791 DENILVSFIEMASVSDKGYIQSKIDRSLNEVRKGYTYFIENDILIAKITPCMENGKCAIA 850 Query: 329 -----------------------DSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVK 363 DS++L + + ++ + G+ + + + Sbjct: 851 KNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNLNQQNIREKAALAMTGASGHKRVPISFYE 910 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVL 392 L + +PP++ Q I I + +ID L Sbjct: 911 NLTIPLPPLEIQEKIVQNIELVEQQIDFL 939 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 52/164 (31%), Gaps = 7/164 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E Y + + + + IV +I+ K ++ + + Sbjct: 525 EHIDNKSGYIKLDNPKYVPIEFYESFALQDKGIVKQFDILICKDGALTGKIAMVRNEFIR 584 Query: 313 R--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + I ++ + YL +++ SY + + +G + + +++ + + Sbjct: 585 KSAMINEHIFLLRCDNIAKQKYLFYILHSYSGQQALKSKITGSAQGGINKTNLESILIPN 644 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + Q I E +++ I S+ + + + Sbjct: 645 ADFEIQKQIV----AECEKVEEQYNTIRMSVEEYQNLIKAILQK 684 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 64/196 (32%), Gaps = 19/196 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK + +K + + + +I + V S G K S Sbjct: 766 GWKRISLKN--EQYMELNPSKKEISKLDENILVSFIEMASV-SDKGYIQSKIDRSLNEVR 822 Query: 76 STVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + F + IL K+ P + + G ST+F + + K L + L Sbjct: 823 KGYTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNL 882 Query: 130 SIDVTQRI--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + A+ + N+ +P+PPL Q I + I +ID L + Sbjct: 883 NQQNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDFLNLK 942 Query: 188 RIRFIELLKEKKQALV 203 + ++ Q + Sbjct: 943 LELLEKEKEKILQKYL 958 >gi|317131476|ref|YP_004090790.1| restriction modification system DNA specificity domain [Ethanoligenens harbinense YUAN-3] gi|315469455|gb|ADU26059.1| restriction modification system DNA specificity domain [Ethanoligenens harbinense YUAN-3] Length = 178 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 47/133 (35%), Gaps = 9/133 (6%) Query: 273 MGLKPESYETYQIVDPGEIVFRFIDL---QNDKRSLRSAQVMERGIITSAYMAVK--PHG 327 + + G+ + I + E G ++ ++ ++ P Sbjct: 43 SSYEFSPFHGGSKFRNGDTLMARITPCLENGKTALVNILDQGEVGFGSTEFIVMRARPGI 102 Query: 328 IDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 D ++ +L +S L + +GS RQ ++ + PP++EQ +I ++ Sbjct: 103 SDKDFIYYLAQSPILRDKAIKSMVGSSGRQRVQLSVLNDTKFYAPPLEEQIEIAGILRAL 162 Query: 386 TARI--DVLVEKI 396 +I + + Sbjct: 163 DDKIANNTAINHH 175 >gi|309808306|ref|ZP_07702212.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LactinV 01V1-a] gi|308168453|gb|EFO70565.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LactinV 01V1-a] Length = 166 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 60/172 (34%), Gaps = 10/172 (5%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGE 290 +V ++ K N + ++Y T G+ Sbjct: 1 MKYRLCEIVDITMGQSPKSEFYNTEKKGLPFLQGNRTFGFKYPTFDTYTTVMTKFAKAGD 60 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ + + RG+ + ++ + ++L ++M+ Y + + Sbjct: 61 VIMSVRAPVGELNITPVDMCLGRGVCS-----LRMKNGNQSFLFYMMK-YYVSHLIKKEN 114 Query: 351 SGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIV 401 + S+ +D+ L V +P I+EQ I + + +I+ L +I +++ Sbjct: 115 GTVFGSVNRDDINGLEVDIPDDIEEQKKIARFLEMIDDKIE-LNNEINKNLA 165 >gi|262183025|ref|ZP_06042446.1| hypothetical protein CaurA7_03452 [Corynebacterium aurimucosum ATCC 700975] Length = 295 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 31/79 (39%), Gaps = 5/79 (6%) Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 ++ YL ++S + G + +K D+ L V +PP+ EQ I +++ Sbjct: 24 SEECNARYLLHFLQSARSFFQSRSRGV-TIKGIKRTDLNDLLVPLPPLDEQRRIAAILDE 82 Query: 385 ETARIDVLVEKIEQSIVLL 403 + + + + L Sbjct: 83 V----ESAIVAAKSQLSEL 97 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 45/106 (42%), Gaps = 3/106 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 ++V + + + + + D+ +I ++ SG+ ++ TS F Sbjct: 110 ELVALSELVDIRSSLVDPTSEPYMDMPHIAPNNLSSGSDDFVGVKSAVEDRVTSGKYAFQ 169 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 G ILY K+ PYL K IA +DG+CS L P++ W Sbjct: 170 AGDILYSKIRPYLNKVSIAAYDGVCSADMYALVPRNRTQTDWIVWQ 215 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/331 (11%), Positives = 92/331 (27%), Gaps = 44/331 (13%) Query: 95 LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGI 154 + K+ + + S L ++ G T+ + Sbjct: 1 MGKSALVEAPVSFSQDVTNLNDLSEECNARYLLHFLQSARSFFQSRSRGVTIKGIKRTDL 60 Query: 155 GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL-LKEKKQALVSYIVTKGLNP 213 ++ +P+PPL EQ I + I ++ + + +++ ++ Sbjct: 61 NDLLVPLPPLDEQRRIAAILDEVESAIVAAKSQLSELSAIPFWMGDRKFELVALSELVDI 120 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 + D E +P +I + + Sbjct: 121 RSSLVDPTSEPYMDMP-------------------------HIAPNNLSSGSDDFVGVKS 155 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 ++ G+I++ I +K S+ + + Y V + + ++ Sbjct: 156 AVEDRVTSGKYAFQAGDILYSKIRPYLNKVSIAAY---DGVCSADMYALVPRNRTQTDWI 212 Query: 334 AWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITN--VINVETAR-- 388 W +RS + + + + V I V+ Sbjct: 213 VWQLRSSRFLAYAASSSGRASIPKINRKALGAFKV---------QIVEPAVLEQFNREQN 263 Query: 389 IDVLVEK-IEQSIVLLKERRSSFIAAAVTGQ 418 + +E + + + LL+E +SS A G+ Sbjct: 264 VKKTIENSVRKKLYLLQELQSSLSTRAFQGE 294 >gi|256851083|ref|ZP_05556472.1| restriction modification system DNA specificity subunit [Lactobacillus jensenii 27-2-CHN] gi|256616145|gb|EEU21333.1| restriction modification system DNA specificity subunit [Lactobacillus jensenii 27-2-CHN] Length = 175 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 49/150 (32%), Gaps = 6/150 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTV 78 WK V + + + G I +++E+GT + S++ + + Sbjct: 14 WKKVKLGQIADVRDGTHESPKYVSQNGYPLITSKNLENGTINFDDISYISKKDYEEINKR 73 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 S+ K IL+G +G AI+ L+ ++ L + S + Sbjct: 74 SLVEKNDILFGMIGTIGNVAIVKKSGFAIKNVALIKSNSEIPSINLIQIIQSDIFKKYTN 133 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + G + I + +E + Sbjct: 134 RLNSGNSQKFISLGDIRKFDFKMASKSENM 163 >gi|32266933|ref|NP_860965.1| type I restriction/modification enzyme [Helicobacter hepaticus ATCC 51449] gi|32262985|gb|AAP78031.1| type I restriction/modification enzyme [Helicobacter hepaticus ATCC 51449] Length = 1164 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 11/112 (9%), Positives = 36/112 (32%), Gaps = 5/112 (4%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + E I S + + + + ++ + Sbjct: 1035 TISASGANAGFVNYWNEE--IFASDCTTINSDSKLDIKFIYYVLQFIQKDIYRLARGAAQ 1092 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL---VEKIEQSIVLL 403 + +D++++ + +PP+ Q I + + + +E+ ++ I + Sbjct: 1093 PHVYPKDIEQIKIPLPPLDIQKQIVAECERVEKQYNTIRMSIEEYQKLIKAI 1144 >gi|224538863|ref|ZP_03679402.1| hypothetical protein BACCELL_03759 [Bacteroides cellulosilyticus DSM 14838] gi|224519521|gb|EEF88626.1| hypothetical protein BACCELL_03759 [Bacteroides cellulosilyticus DSM 14838] Length = 209 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 64/194 (32%), Gaps = 17/194 (8%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + E+N K ++ + N+ N G + Y G+ + I Sbjct: 19 LIHEIAEINPKRNLSKGTSAKCIEMANLPTIGSFPN-GWIEKEYNGGMKFRNGDTLIARI 77 Query: 297 DL---QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKV--FYAMG 350 + E ++ Y+ + S+ + + R++D G Sbjct: 78 TPCLENGKTAFINFLDKDEIAYGSTEYIVISAKNNYSSSFFYFLARNHDFVDYAVKNMNG 137 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID-VLVEKIEQSIVLLKE--RR 407 S RQ + + + + + V P +E + + + L + S+ ++ R Sbjct: 138 SSGRQRVSGDTIGKYRIPVIPREE-------LESFMSHAEITLKTIKDNSLQNMRLSMIR 190 Query: 408 SSFIAAAVTGQIDL 421 + + ++G++ + Sbjct: 191 DALLPKLMSGELKV 204 >gi|160887307|ref|ZP_02068310.1| hypothetical protein BACOVA_05325 [Bacteroides ovatus ATCC 8483] gi|156107718|gb|EDO09463.1| hypothetical protein BACOVA_05325 [Bacteroides ovatus ATCC 8483] Length = 354 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 47/387 (12%), Positives = 116/387 (29%), Gaps = 53/387 (13%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 ++ +N +++ +YI LE VE G + + ++ ++ + + K IL+ Sbjct: 4 LQDIAAVNP-KSNPLQNSFVYIDLEAVEKGELRKI-QEVMREEAPSRAQRVIYKNDILFQ 61 Query: 90 KLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + PY + I + + ST + ++ + +P + L + +++ C G Sbjct: 62 CVRPYQKNNYIHKIQSKSNQQWVASTGYAQIRTTE-IPNYIYHLLNTDGFNRKVMVRCTG 120 Query: 144 ATMSHADWKGIGNIPMPIPPL-AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 ++ + + + I + EQ+ I + RI T + EK Q+L Sbjct: 121 SSYPAINSEDLATIRFYLTTDTKEQLKISRLLDLLDERIATQ--------NKIIEKLQSL 172 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK---NTKLIESNILSL 259 + K + + K N ++ Sbjct: 173 I--------------KGIAQHCIKESTSGNTYVKLGDICQITTGKLDANAQVDNGIYPFF 218 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + K+++ + I + + Sbjct: 219 TCAEQPFKIDSFAFDTEAL----------------LISGNGANLGYINYYHGKFNAYQRT 262 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y+ + Y+ W ++ ++ S + + L + +P Q I Sbjct: 263 YVLDIFSE-NIQYIKWALKVLLPKRIAIEKSSSNTPYIVLSTLSDLRLPIPNKSIQCHIA 321 Query: 380 NVINVETARIDVLVEKIEQSIVLLKER 406 ++ ++ + S LK+ Sbjct: 322 KLMQSLERKLSSQIAL-NGSYNRLKQY 347 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 55/180 (30%), Gaps = 7/180 (3%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + + + N + L + + + + + +++ +I+F Sbjct: 1 MASLQDIAAVNPKSNPLQNSFVYIDLEAVEKGELRKIQEVMREEAPSRAQRVIYKNDILF 60 Query: 294 RFIDLQNDKRSLRSAQVMERG-IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + Q + S A Y+ L+ + + +G Sbjct: 61 QCVRPYQKNNYIHKIQSKSNQQWVASTGYAQIRTTEIPNYIYHLLNTDGFNRKVMVRCTG 120 Query: 353 LR-QSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 ++ ED+ + + KEQ I+ +++ +D + + I L+ Sbjct: 121 SSYPAINSEDLATIRFYLTTDTKEQLKISRLLD----LLDERIATQNKIIEKLQSLIKGI 176 >gi|210611275|ref|ZP_03288830.1| hypothetical protein CLONEX_01020 [Clostridium nexile DSM 1787] gi|210152039|gb|EEA83046.1| hypothetical protein CLONEX_01020 [Clostridium nexile DSM 1787] Length = 184 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 59/161 (36%), Gaps = 10/161 (6%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQND 301 ++ I + N++ T + + + S + V G+++ Sbjct: 24 GGKETYCDNGISLVRSQNVLDFEFTDSGLAHINDEQASKLSNVEVIDGDVLINITGDSVA 83 Query: 302 KRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + A + + A + + + S+Y+ + ++ + A R +L Sbjct: 84 RVCKMDAAFLPARVNQHVAIVRGEKDKVLSSYILYYLQMMKGHLLQLASAGATRNALTKG 143 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 +++L + +P I+ Q IT+V++ +I + + I Sbjct: 144 MLEQLELELPDIETQMRITSVLDSFQEKI-----ALNRKIN 179 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 58/168 (34%), Gaps = 16/168 (9%) Query: 28 VPIKRFT-KLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKD---GNSRQSDTST 77 V +K K+ +G T GK+ I + ++V ++ N Q+ + Sbjct: 7 VKLKDICSKIGSGATPRGGKETYCDNGISLVRSQNVL--DFEFTDSGLAHINDEQASKLS 64 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQF-LVLQPKDVLPELLQGWLLSIDV 133 G +L G + + D + + +V KD + + L + Sbjct: 65 NVEVIDGDVLINITGDSVARVCKMDAAFLPARVNQHVAIVRGEKDKVLSSYILYYLQMMK 124 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 ++ GAT + + + + +P + Q+ I + + +I Sbjct: 125 GHLLQLASAGATRNALTKGMLEQLELELPDIETQMRITSVLDSFQEKI 172 >gi|116255298|ref|YP_771131.1| putative type I restriction-modification system specificity subunit [Rhizobium leguminosarum bv. viciae 3841] gi|115259946|emb|CAK03043.1| putative type I restriction-modification system specificity subunit [Rhizobium leguminosarum bv. viciae 3841] Length = 445 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 52/445 (11%), Positives = 118/445 (26%), Gaps = 48/445 (10%) Query: 20 AIPKHWKVVPIKRFT----KLNTGRTSESGKD---IIYIGLEDVESGTGKYLPK-DGNSR 71 IP V + + G D + + + + + Sbjct: 3 EIP----FVALADLCPPKRSITYGIVQPGKPDDIGVPIVRVNNFRGHRLDLTERLCVAPN 58 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL-- 129 + S +L +G + AI + V Sbjct: 59 VEAQYSRSRPQPYDVLISLVGSIGQVAIAGPEISGWNLARAVGLIPTKDRHHALWIFYAL 118 Query: 130 -SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + Q I + + K + P+P P + I + + +I+ Sbjct: 119 QSPEAQQYIRQHANTTVQATFNLKDLTKFPIPYPARQGREQIIGMLGSLDDKIELNRKMN 178 Query: 189 IRFIELLKEKKQALVSYIVTKGL------NPDVKM------------------KDSGIEW 224 + + + +P M G + Sbjct: 179 ETLEAIAQAIFRDWFVEFGPTRRKQDGATDPITIMGGLVQDTERAQALADLFPATLGDDS 238 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIE-SNILSLSYGNIIQKLETRNMGLKPESYETY 283 + + + + KN + + L + ++ T + Sbjct: 239 LPEGWESKSLLEQANWINGAAFKNMHFSDAPDALPVVKIAELKNGVTSGTKFTNTALGER 298 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR---SY 340 + GE++F + + + + AV+ +G+ S +++ Sbjct: 299 YRISDGELLFSWSGNPDTSID-AFVWIGGNAWLNQHIFAVRENGVRSKAALYVLLKALMP 357 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 ++ + + ED+KRL + V P + VI D+LV ++ ++ Sbjct: 358 QFAELARNKQTTGLGHVTKEDMKRLEIAVAPGPVETAFEAVITPLV---DLLVSRLFENR 414 Query: 401 VLLKERRSSFIAAAVTGQIDLRGES 425 L R + ++G+I L G Sbjct: 415 T-LAATRDLLLPKLMSGEIRLSGAE 438 >gi|9507712|ref|NP_053051.1| hypothetical protein pNZ4000_01 [Lactococcus lactis subsp. cremoris] gi|5230679|gb|AAD40958.1| hypothetical protein [Lactococcus lactis subsp. cremoris] Length = 100 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 4/78 (5%) Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + KV + G + + + V L + P I+EQ I + ++D + Sbjct: 24 YLMVPFREKVKRIVQGGTQIYVNYPAVSNLNLEQPEIEEQQKIGSF----FKQLDDTIAL 79 Query: 396 IEQSIVLLKERRSSFIAA 413 ++ + LLKE++ F+ Sbjct: 80 HQRKLDLLKEQKKGFLQK 97 >gi|269115297|ref|YP_003303060.1| Type I restriction enzyme specificity protein [Mycoplasma hominis] gi|268322922|emb|CAX37657.1| Type I restriction enzyme specificity protein [Mycoplasma hominis ATCC 23114] Length = 378 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 104/383 (27%), Gaps = 18/383 (4%) Query: 34 TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP 93 ++ G+ +I L + Y + N F + + G Sbjct: 2 CEIKRGKVYSKE----FIKLN--KGEYPVYSSQSLNDGILGKIDKYDFDGEYVTWTTDGA 55 Query: 94 YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKG 153 Y + +L + +L +L + Q + + + + Sbjct: 56 YAGTVFYRIGRFSITNVCGILSVLNK-SKLNVKYLSTCLSMQTKKFVNKASGNPKLMSNI 114 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + NI +PIP ++ Q I E + + + + I+ K++ + ++ Sbjct: 115 MENIEIPIPHISIQNKIVEILDKLEIYTKDIQSGLPLEIDQRKKQYEYYRDKLLDFKDLA 174 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 + + + + + D ++++ R+ + G Sbjct: 175 GGVLSKNYLLLLNELWDKIVNIVECLSISKIFREIKTGKLNANAETPNGKYAFWTCDERP 234 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 L E E+ + + + + H + Y Sbjct: 235 KLIDEYAF-------DEMAILISGNGSKVGHVNIYNGKFNAYQRTYILLKINHFVLWKYA 287 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + ++S + + ++ + +P I Q I +++ A + Sbjct: 288 YFYLKSNLKNYINVYKLDSGIPYITLPMLQNFVIPIPHISIQNKIVEILDKLQAYTKDIQ 347 Query: 394 EKIEQSIVLLKE----RRSSFIA 412 + I K+ R + Sbjct: 348 TGLPLEIDQRKKQYEHYRDKLLN 370 >gi|288801961|ref|ZP_06407402.1| type I restriction enzyme StySJI specificity protein [Prevotella melaninogenica D18] gi|288335396|gb|EFC73830.1| type I restriction enzyme StySJI specificity protein [Prevotella melaninogenica D18] Length = 177 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 52/169 (30%), Gaps = 6/169 (3%) Query: 228 VPDHWEVKPFFALVTELNRKN--TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 +P W + T + + + + + + + Sbjct: 1 MPKTWSNPKIKEVFTINPKNKVLDNINAGFVPMVYIDDGYSGAFKYEKRKWNDIKAGFTH 60 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGII--TSAYMAVKPHGIDSTYLAWLMRSYDLC 343 G+I I + R + + GI T+ + I+ Y + +S Sbjct: 61 FADGDIAVAKISPCLENRKSMILEKLPNGIGAGTTELYIFRSLNINPKYALYCFKSDSFI 120 Query: 344 KVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + +G+ +Q + ++ + +PP+ EQ I I + ++ Sbjct: 121 QQCIGTFNGVVGQQRVARRIIEEIRFPLPPLSEQLRIVTKIEELFSILN 169 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 58/168 (34%), Gaps = 7/168 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK W IK +N + ++ + ++ G + + + F Sbjct: 2 PKTWSNPKIKEVFTINPKNKVLDNINAGFVPMVYIDDGYSGAFKYEKRKWNDIKAGFTHF 61 Query: 82 AKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 A G I K+ P L + + G +T+ + + ++ P+ S Q Sbjct: 62 ADGDIAVAKISPCLENRKSMILEKLPNGIGAGTTELYIFRSLNINPKYALYCFKSDSFIQ 121 Query: 136 RIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + G + I I P+PPL+EQ+ I KI ++ Sbjct: 122 QCIGTFNGVVGQQRVARRIIEEIRFPLPPLSEQLRIVTKIEELFSILN 169 >gi|253576199|ref|ZP_04853530.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251844326|gb|EES72343.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 232 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 58/163 (35%), Gaps = 7/163 (4%) Query: 28 VPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V ++ ++ G++ +I + + +++ G + + Sbjct: 49 VKLRDVAEIFRGKSILKQDLKPGNIKVLNISNLDDGEVLLDQLETIDEEERKVKRYEILP 108 Query: 84 GQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G ++ G + A+ + G+ S ++ + + +L S T I++ Sbjct: 109 GDLVMTCRGTVNKLAVFPEAQGMVIASSNMIVIRFKSAIKSHFAKMFLESPVGTALIQSF 168 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G T+ + + + + +P+ P Q I ++ I E R Sbjct: 169 QRGTTVMNLNPADVAELELPLVPEDRQQEIIQQYIREKERYKE 211 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 42/132 (31%), Gaps = 14/132 (10%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL-A 334 + E + PG++V N + I +S + ++ ++ Sbjct: 96 EEERKVKRYEILPGDLVMTCRGTVNKLAVFP--EAQGMVIASSNMIVIRFKSAIKSHFAK 153 Query: 335 WLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + S + + +L DV L + + P Q +I I + Sbjct: 154 MFLESPVGTALIQSFQRGTTVMNLNPADVAELELPLVPEDRQQEI----------IQQYI 203 Query: 394 EKIEQSIVLLKE 405 + E+ +++E Sbjct: 204 REKERYKEVVRE 215 >gi|297250306|ref|ZP_06864062.2| type I restriction-modification system specificity determinant [Neisseria polysaccharea ATCC 43768] gi|296839222|gb|EFH23160.1| type I restriction-modification system specificity determinant [Neisseria polysaccharea ATCC 43768] Length = 200 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 49/185 (26%), Gaps = 10/185 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 K + + + + N++Q E + + S Sbjct: 13 KDVVWKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVN 72 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ I K G + + V ++ YL ++ Sbjct: 73 DILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKH 130 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIV 401 G + + + +PP+ EQ IT +++ + + + Sbjct: 131 AKGAKMPRGSKTAIMQYKIPIPPLPEQEKITAILDKFDTLTHSVSEGLPHEIALRRKQYE 190 Query: 402 LLKER 406 +E+ Sbjct: 191 YYREQ 195 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 70/187 (37%), Gaps = 8/187 (4%) Query: 27 VVPIKRFTKLNTGRT-SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + R S+ + Y+G++++ ++ GK L G T I Sbjct: 17 WKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLS--GYVPSEGKMTEYIV--N 72 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE 142 IL G + PYL+K AD G + LV++ + V P+ L L + Sbjct: 73 DILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAK 132 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M I +PIPPL EQ I + ++ I L +++ + Sbjct: 133 GAKMPRGSKTAIMQYKIPIPPLPEQEKITAILDKFDTLTHSVSEGLPHEIALRRKQYEYY 192 Query: 203 VSYIVTK 209 ++ Sbjct: 193 REQLLAF 199 >gi|324994850|gb|EGC26763.1| hypothetical protein HMPREF9392_1666 [Streptococcus sanguinis SK678] Length = 191 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 51/156 (32%), Gaps = 9/156 (5%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 E R + + + E++ + + + + + Sbjct: 43 NGERRYVTESSYEFLKKSRLYGHEVIISNVADVGSVHRVPKMNMPMVAG-NNVVFLQSEN 101 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + YL S ++ SG +Q D + L + + +I + Sbjct: 102 SLLTDYLYVYFNSRLGQHDIMSITSGSAQQKFNKTDFRNLEIPILSDD-------IIKKK 154 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + I ++ I + I L + R++ + ++G+I + Sbjct: 155 ISSILHYIDNIHEEIACLMKIRATLLPKLLSGEISV 190 >gi|201068008|ref|ZP_03217849.1| hypothetical protein CJBH_1917c [Campylobacter jejuni subsp. jejuni BH-01-0142] gi|200004412|gb|EDZ04935.1| hypothetical protein CJBH_1917c [Campylobacter jejuni subsp. jejuni BH-01-0142] Length = 90 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 9/83 (10%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGT 60 +KDSG++W+G IP+HW+VV IK TG + + I YI +D++ T Sbjct: 4 FKDSGIEWLGEIPQHWEVVKIKFLAIFYTGDSIKDSEKHKYCFLNNSIPYISTKDIDINT 63 Query: 61 GKYLPKDGNSRQSDTSTVSIFAK 83 +G + + + K Sbjct: 64 NVIDYNNGMFIEKNDANFKRRKK 86 >gi|331087340|ref|ZP_08336408.1| hypothetical protein HMPREF0987_02711 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330408366|gb|EGG87841.1| hypothetical protein HMPREF0987_02711 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 176 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 14/124 (11%), Positives = 45/124 (36%), Gaps = 5/124 (4%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + S + + K I + +L +++++ + G+G Sbjct: 48 IVVVARSGASAGFVSYWNQKIFVTDGFGYEEKSELITTKFLYYVLKNMESELNAMKRGAG 107 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRS 408 + + E + + + +P ++EQ IT++++ + L + I ++ + Sbjct: 108 V-PHISGEMLNSIELPIPLLQEQNRITDILDRFDTLCNDLSTGLPAEIEARQKQYEYYKD 166 Query: 409 SFIA 412 ++ Sbjct: 167 KLLS 170 >gi|313896529|ref|ZP_07830080.1| type I restriction modification DNA specificity domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312974953|gb|EFR40417.1| type I restriction modification DNA specificity domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 452 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 70/193 (36%), Gaps = 14/193 (7%) Query: 23 KHWKV-------VPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQ 72 + W+ + +K ++ G+ + + + + +V Y D S Sbjct: 258 EDWQRFMEKDSRIKLKEVAQVFRGKNISRKDENGNVGVVTISNVGEYVIDYDGLDHISEV 317 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDV--LPELLQGWL 128 T + G +L G R A+ D+ I S +V++P+ L+ + Sbjct: 318 ERKLTSYLLEDGDVLLTARGTATRSAVFHRQDYPCIASANMVVIRPRQDLLDSTYLKMFF 377 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + + + +G + + ++ + + +P+P + EQ + E+ E + + Sbjct: 378 DSPLGGKILSSAQQGTVVVNLSFRDVQEVEIPLPAIHEQKKLTEEYERELEVYLSTLKAA 437 Query: 189 IRFIELLKEKKQA 201 EK QA Sbjct: 438 EERWNNTLEKLQA 450 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 58/166 (34%), Gaps = 5/166 (3%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 RK+ + + G + + + + E T +++ G+++ Sbjct: 286 RKDENGNVGVVTISNVGEYVIDYDGLDHISEVERKLTSYLLEDGDVLLTARGTATRSAVF 345 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKR 364 + + +DSTYL S K+ + G +L F DV+ Sbjct: 346 HRQDYPCIASANMVVIRPRQDLLDSTYLKMFFDSPLGGKILSSAQQGTVVVNLSFRDVQE 405 Query: 365 LPVLVPPIKEQFDITN----VINVETARIDVLVEKIEQSIVLLKER 406 + + +P I EQ +T + V + + E+ ++ L+ R Sbjct: 406 VEIPLPAIHEQKKLTEEYERELEVYLSTLKAAEERWNNTLEKLQAR 451 >gi|169834416|ref|YP_001694341.1| type I restriction enzyme EcoBI specificity protein (S protein)(S.EcoBI) [Streptococcus pneumoniae Hungary19A-6] gi|168996918|gb|ACA37530.1| type I restriction enzyme EcoBI specificity protein (S protein)(S.EcoBI) [Streptococcus pneumoniae Hungary19A-6] Length = 305 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 93/317 (29%), Gaps = 34/317 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + G + D+ + + E L L Sbjct: 171 KSRFNEMF-------------GENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFL 217 Query: 260 SYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N+ + + + + + ++ +IV + + Sbjct: 218 NTKNVTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTIRGTVGNVAYYDELIKYKHLR 277 Query: 316 ITSAYMAVKPHGIDSTY 332 I S + ++P + + Sbjct: 278 INSGMVILRPKTPNHNW 294 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 38/137 (27%), Gaps = 6/137 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGTYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKITDKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + ++P EQ I +N + Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTISQIY 156 Query: 390 DVLVEKIEQSIVLLKER 406 D + E+ L+K R Sbjct: 157 DFRKIQSEKFNELVKSR 173 >gi|167975262|ref|ZP_02557539.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|195659926|gb|EDX53306.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 12 str. ATCC 33696] Length = 360 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 44/385 (11%), Positives = 117/385 (30%), Gaps = 45/385 (11%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ T ++ +I GL + N+ ++ I Sbjct: 6 KLSSVFEIITTGKQKNTFNINLEGLYPL------ISASTANNGIMGYVDNYLYDGQNITI 59 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMS 147 ++G + F++ + + ++ +LL ++ ++I +I G T Sbjct: 60 SRVGNAGTTFYHEGKISLTDNCFILSRINKKIAKVKYVFYLLKLNEDKKIRSISHGTTRK 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + N+ + +P + Q I I I+ + + + L+ + L S + Sbjct: 120 IINKTDLDNLIIYLPSIEIQNAIISIIEPIEKVINNIKNIKFKIESLVNKYFDFLYSNLE 179 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + +G + T I S + I Sbjct: 180 DSNFKKYI---------LGDLF-------------------TINRGQIINSKYIESNIGS 211 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K Y + F I Q I ++ +K + Sbjct: 212 YPVISSNTKNNGVFGYINSYMYDGEFITISADGAYAGTVFLQNGRFSITNVCFILIKNND 271 Query: 328 ID----STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV-- 381 ID + ++ ++++ + R +++ +K + + +P I+ Q + + Sbjct: 272 IDFKFSNKFVYYILKKEQEVNKLKSQVGSSRPAVREYSLKEIKINLPNIEIQEKFSKIVE 331 Query: 382 ----INVETARIDVLVEKIEQSIVL 402 ++ + +I+ ++ I Sbjct: 332 PLLNLSTKANKIEKILNDSLLKITK 356 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 Y+ +L++ + K+ R+ + D+ L + +P I+ Q I ++I I+ Sbjct: 95 KYVFYLLKLNEDKKIRSISHGTTRKIINKTDLDNLIIYLPSIEIQNAIISIIEPIEKVIN 154 Query: 391 VLVEKIEQSIVLL 403 ++ I+ I L Sbjct: 155 N-IKNIKFKIESL 166 >gi|229587245|ref|YP_002845746.1| Type I restriction/modification enzyme endonuclease S subunit [Rickettsia africae ESF-5] gi|228022295|gb|ACP54003.1| Type I restriction/modification enzyme endonuclease S subunit [Rickettsia africae ESF-5] Length = 200 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 63/198 (31%), Gaps = 5/198 (2%) Query: 196 KEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIE 253 Q ++ ++ + +W + + + + L K T ++ Sbjct: 1 MNSYQKIIEGAKQI-IDNWHPYFEINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVG 59 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 ++ I+ + Y Q G I+ M Sbjct: 60 KKGKMININTAIKGDIPVIASGRVSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWT 119 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 Y + + YL ++++S GSG + + +D++ L + +PP++ Sbjct: 120 SDCNVIYSIN-EKLLLTKYLYYILKSQQNIIYQKQAGSG-QPHVYLKDLEDLQIPIPPLE 177 Query: 374 EQFDITNVINVETARIDV 391 EQ + ++ ++ID Sbjct: 178 EQQKMVTELDNNQSKIDN 195 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 48/174 (27%), Gaps = 9/174 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESG---------TGKYLPKDGNS 70 I K W++V S + Y L + G G Sbjct: 23 EINKQWEIVKFGDIVINKLKSNILSLEHKEYTTLIVGKKGKMININTAIKGDIPVIASGR 82 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + F I G Y + S ++ + L + + Sbjct: 83 VSPYSHNQYNFNGNIITISSSGAYAGYIWYHNSPMWTSDCNVIYSINEKLLLTKYLYYIL 142 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 I G+ H K + ++ +PIPPL EQ + ++ +ID Sbjct: 143 KSQQNIIYQKQAGSGQPHVYLKDLEDLQIPIPPLEEQQKMVTELDNNQSKIDNP 196 >gi|268681726|ref|ZP_06148588.1| LOW QUALITY PROTEIN: type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae PID332] gi|268683953|ref|ZP_06150815.1| LOW QUALITY PROTEIN: type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae SK-92-679] gi|268686198|ref|ZP_06153060.1| LOW QUALITY PROTEIN: type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae SK-93-1035] gi|268622010|gb|EEZ54410.1| LOW QUALITY PROTEIN: type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae PID332] gi|268624237|gb|EEZ56637.1| LOW QUALITY PROTEIN: type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae SK-92-679] gi|268626482|gb|EEZ58882.1| LOW QUALITY PROTEIN: type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae SK-93-1035] Length = 206 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 48/185 (25%), Gaps = 10/185 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + K + + + + N++Q E + + S Sbjct: 16 KNVVWKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVN 75 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ I K G + + V ++ YL ++ Sbjct: 76 DILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKH 133 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIV 401 G + + + +PP+ EQ I ++ + + + Sbjct: 134 AKGAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYE 193 Query: 402 LLKER 406 +E+ Sbjct: 194 YYREQ 198 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 70/187 (37%), Gaps = 8/187 (4%) Query: 27 VVPIKRFTKLNTGRT-SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + R S+ + Y+G++++ ++ GK L G T I Sbjct: 20 WKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLS--GYVPSEGKMTEYIV--N 75 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE 142 IL G + PYL+K AD G + LV++ + V P+ L L + Sbjct: 76 DILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAK 135 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M I +PIPPL EQ I + ++ I L +++ + Sbjct: 136 GAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYEYY 195 Query: 203 VSYIVTK 209 ++ Sbjct: 196 REQLLAF 202 >gi|284931720|gb|ADC31658.1| type I restriction-modification system specificity (S) subunit domain protein [Mycoplasma gallisepticum str. F] Length = 212 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 19/177 (10%), Positives = 55/177 (31%), Gaps = 3/177 (1%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + + E IL + + + E + G+++ Sbjct: 36 LRGNGLNWDAISQNGKEDCILYGHLYTDYGMIIDKVLYRTNEKLKNPFFSKFGDVLIPGS 95 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + + ++ + +I ++P + L + K+ + + + Sbjct: 96 GHTPNGLARATSIEKDDVLIGGDVNIIRPRKSINGSYLSLCLNSCRNKLIQIIKGSIVRH 155 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + D+K + V V I E+ ++ ID L+ ++ L+ + + + Sbjct: 156 IHNSDIKEIKVHV-SIHEKEQ--ALLVSIFKNIDNLLALHQRKCEKLQNIKEAILEK 209 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 58/193 (30%), Gaps = 13/193 (6%) Query: 25 WKVVPIKRFTKLNTGR-----TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK V +K G I + + G + K Sbjct: 24 WKQVKLKTLADFLRGNGLNWDAISQNGKEDCILYGHLYTDYGMIIDKVLYRTNEKLKNPF 83 Query: 80 IFAKGQILYGKLGPY----LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G +L G R I D + +++P+ + L Sbjct: 84 FSKFGDVLIPGSGHTPNGLARATSIEKDDVLIGGDVNIIRPRKSIN-GSYLSLCLNSCRN 142 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ I +G+ + H I I + + ++ +++ ID L+ R E L Sbjct: 143 KLIQIIKGSIVRHIHNSDIKEIKVHVSIHEKEQ---ALLVSIFKNIDNLLALHQRKCEKL 199 Query: 196 KEKKQALVSYIVT 208 + K+A++ + Sbjct: 200 QNIKEAILEKMFC 212 >gi|261496909|ref|ZP_05993277.1| type I restriction-modification system, subunit S [Mannheimia haemolytica serotype A2 str. OVINE] gi|261307433|gb|EEY08768.1| type I restriction-modification system, subunit S [Mannheimia haemolytica serotype A2 str. OVINE] Length = 454 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 48/454 (10%), Positives = 119/454 (26%), Gaps = 70/454 (15%) Query: 29 PIKRFTKLN-----TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + F + G + ++ + S G + + I Sbjct: 3 KLSDFISIKHGFAFKGEFITTEENANCLITPVNFSIGGGFKSDKFKYYTGEIPEKYILQP 62 Query: 84 GQILYGKLG------PYLRKAIIADFDG---ICSTQF--LVLQPKDVLPELLQGWLLSID 132 ++ A++ + G + + + + ++ E L + + + Sbjct: 63 NDLIVTMTDLSKQADTLGYPALVPNISGKKMLHNQRIGLVEFLDNELDKEYLYFLMRTKE 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +I + GAT+ H I + P L Q LI + ++ +I Sbjct: 123 YRHQILSTATGATVHHTSPSKILDFEFEKPDLQTQKLIAQYLMILEEKIQLNTQTNQTLE 182 Query: 193 ELLKEKKQALV---------SYIVTKG-------LNPDVKMKDSGIEWVGLV-------- 228 + + ++ + + G L+ IE + Sbjct: 183 AIAQAIFKSWFVDFDPVRAKAQAILDGKTSDEANLSAMAVFSGKAIEDLSQTEYQELWEI 242 Query: 229 -------------PDHWEVKPFFALVTELNRKNTKLIESNIL--------SLSYGNIIQK 267 P W+ L K ES +S ++ + Sbjct: 243 ADAFPSEFGDEGLPIGWKFNQADNLFDVGIGKTPPRKESEWFSDNANDTEWISIKDMGNQ 302 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT---SAYMAVK 324 + E + I + L R + + + + Sbjct: 303 GLFITESSEYLKVEAVDKFNIKRIPENTVILSFKLTVGRVSITTKETTTNEAIAHFKIPS 362 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + S +L ++++D + S + ++ + +K + +L P I Sbjct: 363 SSNLSSEFLYCYLKNFDFNNL--GSTSSIATAVNSKMIKEMEILEPSDLVINHFNEYIEG 420 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +I + + L + R + + G+ Sbjct: 421 IFNKIKENIIQNNN----LTKIRDELLPKLLNGE 450 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 19/195 (9%), Positives = 51/195 (26%), Gaps = 12/195 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLP--KDGN 69 +P WK + G+T + D +I ++D+ + + Sbjct: 255 LPIGWKFNQADNLFDVGIGKTPPRKESEWFSDNANDTEWISIKDMGNQGLFITESSEYLK 314 Query: 70 SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 D + + ++ + + I + + + + Sbjct: 315 VEAVDKFNIKRIPENTVILS-FKLTVGRVSITTKETTTNEAIAHFKIPSSSNLSSEFLYC 373 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + + K I + + P E I +I I + Sbjct: 374 YLKNFDFNNLGSTSSIATAVNSKMIKEMEILEPSDLVINHFNEYIEGIFNKIKENIIQNN 433 Query: 190 RFIELLKEKKQALVS 204 ++ E L++ Sbjct: 434 NLTKIRDELLPKLLN 448 >gi|254831875|ref|ZP_05236530.1| type I restriction enzyme S protein [Listeria monocytogenes 10403S] Length = 370 Score = 59.8 bits (143), Expect = 9e-07, Method: Composition-based stats. Identities = 18/148 (12%), Positives = 48/148 (32%), Gaps = 4/148 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET---YQIVDPGEIVFRFIDLQNDK 302 K+ + + + +++ + E + T V +IVF Sbjct: 25 HKSDYVDSGVAVIMPQNIGSRQVNYEKISYISEEFATTLRRYKVLKNDIVFARRGDVEKH 84 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFED 361 + ++ E + + +++ ++ + + K +L E Sbjct: 85 AFITESEEGELCGTGCFLVRFTSEHVLPEFISLILSTPFVKKWLVLNAVGSNMPNLNTEI 144 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARI 389 +K +P+ P + Q I + I+ +I Sbjct: 145 LKNVPIKFPDLSTQQKILSTISSVEYKI 172 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 53/399 (13%), Positives = 119/399 (29%), Gaps = 55/399 (13%) Query: 25 WKVVPIKRFTKLNTG-------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS- 76 W + K+ TG ++ + I +++ S Y S + T+ Sbjct: 4 WISTSLGEVAKIITGPFGTQLHKSDYVDSGVAVIMPQNIGSRQVNYEKISYISEEFATTL 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFL--VLQPKDVLPELLQGWLLSID 132 K I++ + G + A I + +C T + VLPE + L + Sbjct: 64 RRYKVLKNDIVFARRGDVEKHAFITESEEGELCGTGCFLVRFTSEHVLPEFISLILSTPF 123 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 V + + G+ M + + + + N+P+ P L+ Q +KI++ ++ I + Sbjct: 124 VKKWLVLNAVGSNMPNLNTEILKNVPIKFPDLSTQ----QKILSTISSVEYKIRINTKIN 179 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 L + +A+ + K S I + + + Sbjct: 180 TNLLDMAKAIYMHSF---FGKHENAKISDI-------------LLENSKSNIQVGEAREA 223 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + G I + + +V I + K + Sbjct: 224 RGDYPFFTSGETIYEWDNY-------------LVKDRNIYLNTGGNADVKFYIG------ 264 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 + ++ + + YL + + + L+ VK + +P Sbjct: 265 KAAYSTDTWCISAKNDFTDYLYLFLDAIRPELNQKFFQGTGLKHLQKALVKDKEIYLPS- 323 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +I N + V ++ L + R + Sbjct: 324 ---KEILTEFNSIVKPMMEQVSFNTRNNQYLSDLRDWLL 359 >gi|86150434|ref|ZP_01068659.1| restriction modification enzyme [Campylobacter jejuni subsp. jejuni CF93-6] gi|85839029|gb|EAQ56293.1| restriction modification enzyme [Campylobacter jejuni subsp. jejuni CF93-6] Length = 699 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 60/452 (13%), Positives = 132/452 (29%), Gaps = 88/452 (19%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---- 75 ++V +K F K +G + + +G E +++ +G + + Sbjct: 230 ELVRLKDFVLDIQTAKRPSGGVGKYENGALSLGGEHIDNKSGYIKLDNPKYVPIEFYESF 289 Query: 76 --STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP------KDVLPELLQGW 127 I + IL K G K + + I + + + L Sbjct: 290 ALQDKGIVKQFDILICKDGALTGKIAMVRNEFIRKSAMINEHIFLLRCDNIAKQKYLFYI 349 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S Q +++ G+ + + +I +P Q I + + +T+ Sbjct: 350 LHSYSGQQALKSKITGSAQGGINKTNLESILIPNADFEIQKQIVAECEKVEEQYNTIRMS 409 Query: 188 RIRFIELLKEKKQ--------------ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + L+K Q +++ + D + S IE + Sbjct: 410 VEEYQNLIKAILQKCGIIDDGGGYELNSILENLQKLEFKLDFNLLLSLIEEQISHSEVLV 469 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + E ++ L + K + + LK E Y ++P + Sbjct: 470 EETQSKERKEDFNAFKNFSKTIQELLQTLSTPPKDGWKRISLKNE---QYMELNPSKKEI 526 Query: 294 RFIDLQNDKRSLRSAQVMERGIITS----------------------------------- 318 +D + A V ++G I S Sbjct: 527 SKLDENMLVSFIEMASVSDKGYIQSKIDRSLNEVRKGYTYFIENDILIAKITPCMENGKC 586 Query: 319 ----------------AYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFE 360 ++ G+DS++L + + ++ + G+ + + Sbjct: 587 AIAKNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNLNQQNIREKAALAMTGASGHKRVPIS 646 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + L + +PP++ Q I I + +ID L Sbjct: 647 FYENLTIPLPPLEIQEKIVQNIELVEQQIDFL 678 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 52/164 (31%), Gaps = 7/164 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E Y + + + + IV +I+ K ++ + + Sbjct: 264 EHIDNKSGYIKLDNPKYVPIEFYESFALQDKGIVKQFDILICKDGALTGKIAMVRNEFIR 323 Query: 313 R--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + I ++ + YL +++ SY + + +G + + +++ + + Sbjct: 324 KSAMINEHIFLLRCDNIAKQKYLFYILHSYSGQQALKSKITGSAQGGINKTNLESILIPN 383 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + Q I E +++ I S+ + + + Sbjct: 384 ADFEIQKQIV----AECEKVEEQYNTIRMSVEEYQNLIKAILQK 423 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 64/196 (32%), Gaps = 19/196 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK--------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 WK + +K + + + +I + V S G K S Sbjct: 505 GWKRISLKN--EQYMELNPSKKEISKLDENMLVSFIEMASV-SDKGYIQSKIDRSLNEVR 561 Query: 76 STVSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + F + IL K+ P + + G ST+F + + K L + L Sbjct: 562 KGYTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNL 621 Query: 130 SIDVTQRI--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + A+ + N+ +P+PPL Q I + I +ID L + Sbjct: 622 NQQNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDFLNLK 681 Query: 188 RIRFIELLKEKKQALV 203 + ++ Q + Sbjct: 682 LELLEKEKEKILQKYL 697 >gi|227508550|ref|ZP_03938599.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] gi|227191882|gb|EEI71949.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] Length = 212 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 59/182 (32%), Gaps = 13/182 (7%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEI 291 + T L K ++ +L L NI N + K + ++ ++ Sbjct: 27 KIGSGKTPLGGKKEYEQKNGVLFLRSQNINNNRIDLNNVAYISSKTDEEMISSSINYNDV 86 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC-KVFYAMG 350 + + ++ V + + G DS +L + SY ++F Sbjct: 87 LLNITGASIGRSAVYR-LVRHANVNQHVCIIRLVDGYDSDFLQLFLSSYYGQIQIFRNQA 145 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G R+ L F + + P + EQ + I + + L+ +++ Sbjct: 146 GGGREGLNFFQIGEMTFKFPTLNEQKRFSEF----FIDIQNTIAANQGK--RLQ-IKNAL 198 Query: 411 IA 412 ++ Sbjct: 199 LS 200 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 54/169 (31%), Gaps = 12/169 (7%) Query: 25 WKVVPIKRF-TKLNTGRTS-------ESGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDT 75 W+ +K +K+ +G+T E ++++ +++ + +S+ + Sbjct: 16 WEQRKLKNITSKIGSGKTPLGGKKEYEQKNGVLFLRSQNINNNRIDLNNVAYISSKTDEE 75 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S +L G + ++ + + +++ D LS Sbjct: 76 MISSSINYNDVLLNITGASIGRSAVYRLVRHANVNQHVCIIRLVDGYDSDFLQLFLSSYY 135 Query: 134 -TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 +I G ++ IG + P L EQ E I I Sbjct: 136 GQIQIFRNQAGGGREGLNFFQIGEMTFKFPTLNEQKRFSEFFIDIQNTI 184 >gi|256960368|ref|ZP_05564539.1| type I restriction endonuclease S subunit [Enterococcus faecalis Merz96] gi|256950864|gb|EEU67496.1| type I restriction endonuclease S subunit [Enterococcus faecalis Merz96] Length = 201 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 81/186 (43%), Gaps = 15/186 (8%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 W+ + + + ++K+T E ILS + + E R + S Y+I+D G+ Sbjct: 23 DWKQRKLGDFLEDFSKKSTIENEYIILSSTNNGM----EIREGRVSGNSNLGYKIIDDGD 78 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM- 349 +V +L ++ + +G+++ +Y K ++ +L +R+ + + Sbjct: 79 LVLSPQNLWLGNINI---NNIGQGLVSPSYKTFKIIDLNKEFLNPQLRTNKMLDQYKNAS 135 Query: 350 ---GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S +R++L+ + ++ + +P +EQ I + +++ + + + +K Sbjct: 136 TQGASIVRRNLELDLFYQIRIFIPKNEEQKQIG----LLFRKLNESISLHQSKLDSIKYL 191 Query: 407 RSSFIA 412 + +++ Sbjct: 192 KKAYLQ 197 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 63/185 (34%), Gaps = 8/185 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK + F + + +++ + II + S ++G + I Sbjct: 23 DWKQRKLGDFLEDFSKKSTIENEYII------LSSTNNGMEIREGRVSGNSNLGYKIIDD 76 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ +L I + G+ S + + D+ E L L + + + + Sbjct: 77 GDLVLSPQNLWLGNININNIGQGLVSPSYKTFKIIDLNKEFLNPQLRTNKMLDQYKNAST 136 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 S ++ I + +++I +++ I+ ++ +K K+A Sbjct: 137 QGA-SIVRRNLELDLFYQIRIFIPKNEEQKQIGLLFRKLNESISLHQSKLDSIKYLKKAY 195 Query: 203 VSYIV 207 + + Sbjct: 196 LQNMF 200 >gi|254493320|ref|ZP_05106491.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae 1291] gi|268594458|ref|ZP_06128625.1| type I restriction-modification system specificity determinant [Neisseria gonorrhoeae 35/02] gi|268598586|ref|ZP_06132753.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae MS11] gi|268600939|ref|ZP_06135106.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae PID18] gi|226512360|gb|EEH61705.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae 1291] gi|268547847|gb|EEZ43265.1| type I restriction-modification system specificity determinant [Neisseria gonorrhoeae 35/02] gi|268582717|gb|EEZ47393.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae MS11] gi|268585070|gb|EEZ49746.1| type I restriction enzyme EcoR124II specificity protein [Neisseria gonorrhoeae PID18] Length = 208 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 48/185 (25%), Gaps = 10/185 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + K + + + + N++Q E + + S Sbjct: 18 KNVVWKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVN 77 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ I K G + + V ++ YL ++ Sbjct: 78 DILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKH 135 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIV 401 G + + + +PP+ EQ I ++ + + + Sbjct: 136 AKGAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYE 195 Query: 402 LLKER 406 +E+ Sbjct: 196 YYREQ 200 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 70/187 (37%), Gaps = 8/187 (4%) Query: 27 VVPIKRFTKLNTGRT-SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + R S+ + Y+G++++ ++ GK L G T I Sbjct: 22 WKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLS--GYVPSEGKMTEYIV--N 77 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE 142 IL G + PYL+K AD G + LV++ + V P+ L L + Sbjct: 78 DILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAK 137 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M I +PIPPL EQ I + ++ I L +++ + Sbjct: 138 GAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYEYY 197 Query: 203 VSYIVTK 209 ++ Sbjct: 198 REQLLAF 204 >gi|260439464|ref|ZP_05793280.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] gi|292808099|gb|EFF67304.1| type I restriction-modification system, S subunit [Butyrivibrio crossotus DSM 2876] Length = 245 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 45/176 (25%), Gaps = 2/176 (1%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 E +PD W + + K + + K+ L Sbjct: 70 CIDDEISFDIPDTWSWTRISTITDITMGSSPKSQDICNDNQYIEFHQGKIYFSKKTLM-- 127 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Y + + L I ++K G + + Sbjct: 128 KSNQYTRKTTKLAPKQSVLLCVRAPVGELNITDRDICIGRGLASIKSLGNINEEFIFYWL 187 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + ++ + V+ + + +PP+ EQ +I N I ++ L Sbjct: 188 HPYKTYLVNQSTGSTFSAITSDTVRNILIPLPPLMEQKEILNKIQKVFTLLENLET 243 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 52/164 (31%), Gaps = 1/164 (0%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W I T + G + +S + G + K T + Sbjct: 78 DIPDTWSWTRISTITDITMGSSPKSQDICNDNQYIEFHQGKIYFSKKTLMKSNQYTRKTT 137 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 A Q + + + + I D D ++ + E + L + Sbjct: 138 KLAPKQSVLLCVRAPVGELNITDRDICIGRGLASIKSLGNINEEFIFYWLHP-YKTYLVN 196 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 G+T S + NI +P+PPL EQ I KI ++ Sbjct: 197 QSTGSTFSAITSDTVRNILIPLPPLMEQKEILNKIQKVFTLLEN 240 >gi|298528586|ref|ZP_07015990.1| restriction modification system DNA specificity domain protein [Desulfonatronospira thiodismutans ASO3-1] gi|298512238|gb|EFI36140.1| restriction modification system DNA specificity domain protein [Desulfonatronospira thiodismutans ASO3-1] Length = 382 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 52/401 (12%), Positives = 109/401 (27%), Gaps = 36/401 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 H++ P++ TG+ + + D + + + + Sbjct: 3 SHFQQSPLEEIVNFKTGKLNSNAAK------PDGKYPFFTCSQETYRTDTWSFDGEYVLL 56 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G + + + + + + +++I Sbjct: 57 AG----NNAAGVYPLKYFKGKFDVYQRTYAIRSINETKCLTRYVYYALRLQLELMKSIST 112 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + +P+PPL Q I + A I+ + E Q L Sbjct: 113 GVATKFLTMSLLNRAQIPLPPLPIQRKIASILSAYDDLIENNLRRIKILE----EMAQNL 168 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 K P + +G +P+ WE LV +N S+ Sbjct: 169 YREWFVKFRFPGHEKVRLVDSELGKIPEGWEAVKLGNLVKVRKGQNITKKTIVPGSIP-- 226 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 G+KP Y + SL + S Sbjct: 227 -------VVAGGIKPAYYHNTANTQHPVVTISASGANAGFVSL-----YHEYVWASDCSV 274 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV--PPIKEQFDITN 380 + + Y +L +V + + +D+ + V V PP I N Sbjct: 275 IDRSTTEHVYFFYLQLKERQHEVTRLQRGAAQPHVYPKDLMEI-VAVEAPP-----HILN 328 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + E + +V + +L++ R + ++G++D+ Sbjct: 329 SFSAEVYPLLHMVRNLSLKNRILRQTRDLLLPRLISGELDV 369 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 50/193 (25%), Gaps = 16/193 (8%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 DS +G IP+ W+ V + K+ G+ + G G + Sbjct: 188 DSE---LGKIPEGWEAVKLGNLVKVRKGQNITKKTIVP-----------GSIPVVAGGIK 233 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + + + G + S ++ + + +L Sbjct: 234 PAYYHNTANTQHPVVTISASGANAGFVSLYHEYVWASDCSVIDRSTTE--HVYFFYLQLK 291 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + GA H K + I P ++ + L + Sbjct: 292 ERQHEVTRLQRGAAQPHVYPKDLMEIVAVEAPPHILNSFSAEVYPLLHMVRNLSLKNRIL 351 Query: 192 IELLKEKKQALVS 204 + L+S Sbjct: 352 RQTRDLLLPRLIS 364 >gi|224542466|ref|ZP_03683005.1| hypothetical protein CATMIT_01648 [Catenibacterium mitsuokai DSM 15897] gi|224524613|gb|EEF93718.1| hypothetical protein CATMIT_01648 [Catenibacterium mitsuokai DSM 15897] Length = 300 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 50/157 (31%), Gaps = 6/157 (3%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN--DKRSLRSAQVMERGIITSAYMA-- 322 L G E V G+++ + R+ +V ++ + Sbjct: 130 DLSEWKYGAWSEEEAKPFAVTEGDLLVVRGNGSLALVGRAGLVGKVPDQVAYPDTLIRLR 189 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + S +++ S S + D+ + V VPP+ EQ I Sbjct: 190 TIETVVRSAWMSLNWNSELSRNHLEKRARTSAGIYKISQPDIVSVRVPVPPLAEQDRILA 249 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + +I + ++ ++ +R + + AA G Sbjct: 250 EFDTHMKQIGSVEAALDAALKQATAQRKNLLKAAFAG 286 >gi|307287455|ref|ZP_07567507.1| conserved domain protein [Enterococcus faecalis TX0109] gi|306501501|gb|EFM70800.1| conserved domain protein [Enterococcus faecalis TX0109] Length = 73 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 12/66 (18%), Positives = 33/66 (50%), Gaps = 5/66 (7%) Query: 349 MGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + SG + ++ +++ ++P + +EQ I + ++D + ++ + LLKE++ Sbjct: 8 LVSGAQPNVLSKEIDSFNFMIPILVQEQQKIGSF----FKQLDDTIALHQRKLDLLKEQK 63 Query: 408 SSFIAA 413 F+ Sbjct: 64 KGFLQK 69 >gi|194098149|ref|YP_002001197.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae NCCP11945] gi|193933439|gb|ACF29263.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae NCCP11945] gi|317163874|gb|ADV07415.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae TCDC-NG08107] Length = 207 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 48/185 (25%), Gaps = 10/185 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + K + + + + N++Q E + + S Sbjct: 17 KNVVWKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLSGYVPSEGKMTEYIVN 76 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +I+ I K G + + V ++ YL ++ Sbjct: 77 DILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMKH 134 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIV 401 G + + + +PP+ EQ I ++ + + + Sbjct: 135 AKGAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYE 194 Query: 402 LLKER 406 +E+ Sbjct: 195 YYREQ 199 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 70/187 (37%), Gaps = 8/187 (4%) Query: 27 VVPIKRFTKLNTGRT-SESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + R S+ + Y+G++++ ++ GK L G T I Sbjct: 21 WKTLGEVAEYSKNRICSDKLNEHNYVGVDNLLQNREGKKLS--GYVPSEGKMTEYIV--N 76 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAICE 142 IL G + PYL+K AD G + LV++ + V P+ L L + Sbjct: 77 DILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHAK 136 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA M I +PIPPL EQ I + ++ I L +++ + Sbjct: 137 GAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYEYY 196 Query: 203 VSYIVTK 209 ++ Sbjct: 197 REQLLAF 203 >gi|300911563|ref|ZP_07129007.1| EcoA family type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus TCH70] gi|300886984|gb|EFK82185.1| EcoA family type I restriction-modification enzyme [Staphylococcus aureus subsp. aureus TCH70] Length = 243 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 79/243 (32%), Gaps = 24/243 (9%) Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 +KI ++D I + +ELL+++K+ + I T+ L + + EW Sbjct: 21 QKIGKFFSKLDRQIELEEQKLELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKD 80 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 + + K + + + ++ N + Sbjct: 81 IFIFENNRRKPITSSLREKGLYPYYGATGIIDYVKDYLFNNEE---------------RL 125 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + + S + + + VK + + ++ + + K A + Sbjct: 126 LIGEDGAKWGQFETSSFIANGQYWVNNHAHVVKSNDHNLFFMNYYLN----FKELRAFVT 181 Query: 352 GLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G L ++ + + +P + EQ + ++ ID + I LLKER+ Sbjct: 182 GNAPAKLTHANLCNINLKIPCLTEQ----DKVSALLKSIDNKMNNQMNRIELLKERKKEL 237 Query: 411 IAA 413 + Sbjct: 238 LQK 240 >gi|171920515|ref|ZP_02931799.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma parvum serovar 1 str. ATCC 27813] gi|171902420|gb|EDT48709.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma parvum serovar 1 str. ATCC 27813] Length = 361 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 42/391 (10%), Positives = 112/391 (28%), Gaps = 48/391 (12%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD---TSTVSIFAK 83 + + + G I + +E G Y + ++ + K Sbjct: 3 IYKLYELVNIYKGSN--------LITKKYIEQNEGIYPVISSKTTENGVYGFINTYDYEK 54 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA--IC 141 +I G + + + LV ++ + L++ + I Sbjct: 55 DKITMSSDGENAGTTFWQEKNFSLTNHALVFIMNKLIKYNYKYLFLTLKKHESKIKELII 114 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+T + +I + +P + EQ I + I I+ + +I+ L+ + Sbjct: 115 SGSTRPSVSLSLLKSINIKLPSIEEQNAIIDIIEPIEKVINNIKNVKIKIESLINKYFDF 174 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L S + + I I S Sbjct: 175 LYSDLKDSNFKKYILGDLFTI----------------------------NRGQIINSKYI 206 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N I + K Y + F I + + I ++ Sbjct: 207 DNNIGSYPVISSNTKNNEIFGYINSYMYDGEFITISADGAYAGTVFLENGKFSITNVCFI 266 Query: 322 AVKPHG----IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF- 376 +K ++ ++ ++++ + R +++ +K + + +P ++ Q Sbjct: 267 LIKNKDIDFKFNNKFVYYILKKEQEINRLKSQVGSSRPAVREYSLKEIKINLPNMEIQEE 326 Query: 377 --DITNVINVETARIDVLVEKIEQSIVLLKE 405 I + + + + + + + S++ + + Sbjct: 327 FSKIVEPLLNLSTKANKIEKILNDSLLKITK 357 >gi|297571611|ref|YP_003697385.1| restriction modification system DNA specificity domain protein [Arcanobacterium haemolyticum DSM 20595] gi|296931958|gb|ADH92766.1| restriction modification system DNA specificity domain protein [Arcanobacterium haemolyticum DSM 20595] Length = 242 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 65/190 (34%), Gaps = 11/190 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRK-------NTKLIESNILSLSYGNIIQKLETRN 272 + E +PD WE F L E+ ++ ++ NI+ Sbjct: 54 TDDEEYFDIPDTWEWTRFSELAIEVCTGPFGSALHRRDYVDDGTPVINPSNIVGDTFVPT 113 Query: 273 MGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + E+ + + G++V + A+ Y + + Sbjct: 114 VFVNEETSARLSSFALAHGDLVIGRRGEMGRSAVVSEAEAGWLCGTGCFYAKSGENHVF- 172 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y+A +++ + + Q+L ++RLP+ VP +EQ I + A + Sbjct: 173 DYVALTLKAPSVRAQLSSSSLGTTMQNLNQTTLRRLPLAVPSRREQLRIDAKLGQLKAPM 232 Query: 390 DVLVEKIEQS 399 L + ++ + Sbjct: 233 RSLQQLLQNA 242 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 60/171 (35%), Gaps = 12/171 (7%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 IP W+ ++ TG + I ++ T Sbjct: 61 DIPDTWEWTRFSELAIEVCTGPFGSALHRRDYVDDGTPVINPSNIVGDTFVPTVFVNEET 120 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWL 128 + S+ ++ G ++ G+ G R A++++ + F ++ + + + L Sbjct: 121 SARLSSFALAH-GDLVIGRRGEMGRSAVVSEAEAGWLCGTGCFYAKSGENHVFDYVALTL 179 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 + V ++ + G TM + + + +P+ +P EQ+ I K+ Sbjct: 180 KAPSVRAQLSSSSLGTTMQNLNQTTLRRLPLAVPSRREQLRIDAKLGQLKA 230 >gi|327474706|gb|EGF20111.1| hypothetical protein HMPREF9391_0220 [Streptococcus sanguinis SK408] Length = 274 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 51/156 (32%), Gaps = 9/156 (5%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 E R + + + E++ + + + + + Sbjct: 126 NGERRYVTESSYEFLKKSRLYGHEVIISNVADVGSVHRVPKMNMPMVAG-NNVVFLQSEN 184 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + YL S ++ SG +Q D + L + + +I + Sbjct: 185 SLLTDYLYVYFNSRLGQHDIMSITSGSAQQKFNKTDFRNLEIPILSDD-------IIKKK 237 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + I ++ I + I L + R++ + ++G+I + Sbjct: 238 ISSILHYIDNIHEEIACLMKIRATLLPKLLSGEISV 273 >gi|94266884|ref|ZP_01290541.1| type I restriction enzyme StySPI specificity protein [delta proteobacterium MLMS-1] gi|93452437|gb|EAT03045.1| type I restriction enzyme StySPI specificity protein [delta proteobacterium MLMS-1] Length = 117 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 12/66 (18%), Positives = 30/66 (45%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ++ +K L + VPP +EQ +I ++ ++++ + + + R S + Sbjct: 16 GQANVNGSKLKALAIPVPPAEEQHEILTRMDEHFSKMNTVEGWCQAELTRSASLRQSVLK 75 Query: 413 AAVTGQ 418 A G+ Sbjct: 76 DAFAGR 81 >gi|218133862|ref|ZP_03462666.1| hypothetical protein BACPEC_01751 [Bacteroides pectinophilus ATCC 43243] gi|217991237|gb|EEC57243.1| hypothetical protein BACPEC_01751 [Bacteroides pectinophilus ATCC 43243] Length = 219 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 17/172 (9%), Positives = 49/172 (28%), Gaps = 15/172 (8%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYETYQIVDPGEIVFRFIDLQNDKR 303 N I + G + + + S + +++ ++ + + Sbjct: 56 NNEYWENGTISWVKSGEVHNNITLQTEEYITPLGLSESSTKLLPKDTVLMAMYGVTAGEV 115 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + A + + + + G + +L + Sbjct: 116 GYLAIE----ATTNQAICGMICNSKADAAYLYFSLIQSQAAISRLSNGGAQDNLSKNFID 171 Query: 364 RLPVLVPPIKEQFDITNVINVE-TARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + ++VP + I A I + + I LL+E +++ +A Sbjct: 172 NIKIVVPS-------SEFIEELNLAAIVEQMTLNTKEIALLEELQATALAQL 216 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 17/150 (11%), Positives = 42/150 (28%), Gaps = 9/150 (6%) Query: 21 IPKHWKVVPIKRFT-KLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 +P +++ + F + +G T + I ++ +V + + Sbjct: 30 LPDDFEIQTVSEFCRETKSGSTPSRTNNEYWENGTISWVKSGEVHNNITLQTEEYITPLG 89 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ + K +L G + + + + + I Sbjct: 90 LSESSTKLLPKDTVLMAMYGVTAGEVGYLAIEATTNQAICGMICNSKADA-AYLYFSLIQ 148 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIP 162 I + G + I NI + +P Sbjct: 149 SQAAISRLSNGGAQDNLSKNFIDNIKIVVP 178 >gi|305431924|ref|ZP_07401091.1| restriction modification enzyme [Campylobacter coli JV20] gi|304445008|gb|EFM37654.1| restriction modification enzyme [Campylobacter coli JV20] Length = 258 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 85/241 (35%), Gaps = 18/241 (7%) Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 + + + EQ+ E ++ ET + + + Q L+ + T + + Sbjct: 10 FNLLLSLIEEQISHSEVLVEETQ--SKERKQDFNAFKNFSKTIQELLQTLSTPPKDGWKR 67 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 + +++ L P E+ + + ++ + ++++ Sbjct: 68 ISLKNEQYIELNPSKKEISKLDENMLVS-----------FIEMASVSDKGYIQSKIDRSL 116 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYL 333 E + Y +I+ I + A+ + I T ++ G+DS++L Sbjct: 117 NEVRKGYTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFL 176 Query: 334 AWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + ++ + G+ + + + L + +PP++ Q I I + +ID Sbjct: 177 FYNLNQQNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDF 236 Query: 392 L 392 L Sbjct: 237 L 237 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 68/194 (35%), Gaps = 15/194 (7%) Query: 24 HWKVVPIKR--FTKLNTGRT----SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 WK + +K + +LN + + + +I + V S G K S Sbjct: 64 GWKRISLKNEQYIELNPSKKEISKLDENMLVSFIEMASV-SDKGYIQSKIDRSLNEVRKG 122 Query: 78 VSIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + F + IL K+ P + + G ST+F + + K L + L+ Sbjct: 123 YTYFIENDILIAKITPCMENGKCAIAKNLTNNIGFGSTEFHIFRAKTGLDSSFLFYNLNQ 182 Query: 132 DVTQRI--EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + A+ + N+ +P+PPL Q I + I +ID L + Sbjct: 183 QNIREKAALAMTGASGHKRVPISFYENLTIPLPPLEIQEKIVQNIELVEQQIDFLNLKLE 242 Query: 190 RFIELLKEKKQALV 203 + ++ Q + Sbjct: 243 LLEKEKEKILQKYL 256 >gi|296454641|ref|YP_003661784.1| restriction endonuclease S subunit [Bifidobacterium longum subsp. longum JDM301] gi|296184072|gb|ADH00954.1| Restriction endonuclease S subunit [Bifidobacterium longum subsp. longum JDM301] Length = 342 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/220 (11%), Positives = 68/220 (30%), Gaps = 17/220 (7%) Query: 200 QALVSYIVTKGLNPDVKMKDS------GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 ++ S V ++ K + G + + + + Sbjct: 122 CSIRSEYVIAFYTFKLQYKCNNSTPAWEQRKFGDCFEFLKSNTLSRAGLNDENGTARNVH 181 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQ---IVDPGEIVFRFIDLQNDKRSLRS--A 308 + + +G+ + + + ++ I+ G+++F Sbjct: 182 YGDILIKFGDCLDGERSDLPFITDDTVLPKFAGSILREGDVIFADTAEDEAAGKCVELRK 241 Query: 309 QVMERGIITSAYMAVKPHGIDST-YLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLP 366 E I + +P T YL + S + + G++ S+ ++ Sbjct: 242 LPKEPTISGLHTIPARPRFFFGTGYLGHYLNSDAYHRQLLPLMQGIKVISVSKAALQDTQ 301 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 V P + EQ I + + ID L+ ++ + +++R Sbjct: 302 VRFPGLSEQAAIGAAL----SEIDNLITLHQRKRLSIRQR 337 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 51/194 (26%), Gaps = 20/194 (10%) Query: 25 WKVVPIKRFTKLNTGRTSES-------------GKDIIYIGLEDVESGTGKYLPKDGNSR 71 W+ + T I I D G LP + Sbjct: 148 WEQRKFGDCFEFLKSNTLSRAGLNDENGTARNVHYGDILIKFGDCLDGERSDLPFITDDT 207 Query: 72 QSDTSTVSIFAKGQILYGKL------GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 SI +G +++ G + + I + +P+ Sbjct: 208 VLPKFAGSILREGDVIFADTAEDEAAGKCVELRKLPKEPTISGLHTIPARPRFFFGTGYL 267 Query: 126 -GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L S +++ + +G + + + + P L+EQ I + I Sbjct: 268 GHYLNSDAYHRQLLPLMQGIKVISVSKAALQDTQVRFPGLSEQAAIGAALSEIDNLITLH 327 Query: 185 ITERIRFIELLKEK 198 +R+ + Sbjct: 328 QRKRLSIRQRSPVW 341 >gi|317178851|dbj|BAJ56639.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F30] Length = 164 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 49/149 (32%), Gaps = 9/149 (6%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + P++ + ++ I+ + L + + +++ K + Sbjct: 13 DSIQHITPKALKGKKLFPKNSIIISTTATIGEHALLIVDSLANQRFT---FLSKKANCNI 69 Query: 330 STYLAWLMRSYDLCKVFYAMGS--GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + L + + S+ K+ +PP++ Q +I +++ + Sbjct: 70 ALDMKFFFYQCFLLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDQFST 129 Query: 388 RIDVLVEKIEQSIVLLKE----RRSSFIA 412 L+ I I K+ R ++ Sbjct: 130 LTTDLLAGIPAEIEARKKQYEYYREKLLS 158 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 39/131 (29%), Gaps = 4/131 (3%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 ++D+ + +F K I+ A++ D + + +F Sbjct: 1 MDDIRENGRILKDSIQHITPKALKGKKLFPKNSIIISTTATIGEHALLI-VDSLANQRFT 59 Query: 113 VLQPKDVLP---ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 L K ++ + + + + + + D PIPPL Q Sbjct: 60 FLSKKANCNIALDMKFFFYQCFLLGEWCKKNTNVSGFASVDMTAFKKYKFPIPPLEIQQE 119 Query: 170 IREKIIAETVR 180 I + + + Sbjct: 120 IVKILDQFSTL 130 >gi|237650541|ref|ZP_04524793.1| type I restriction enzyme EcoBI specificity protein (S protein)(S.EcoBI) [Streptococcus pneumoniae CCRI 1974] gi|237822642|ref|ZP_04598487.1| type I restriction enzyme EcoBI specificity protein (S protein)(S.EcoBI) [Streptococcus pneumoniae CCRI 1974M2] Length = 307 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 93/319 (29%), Gaps = 34/319 (10%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 ++ + + G + D+ + + E L L Sbjct: 171 KSRFNEMF-------------GENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFL 217 Query: 260 SYGNIIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + N+ + + + + + ++ +IV + + Sbjct: 218 NTKNVTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLR 277 Query: 316 ITSAYMAVKPHGIDSTYLA 334 I S + ++P + + Sbjct: 278 INSGMVILRPKTPNLNHNW 296 Score = 44.0 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 38/137 (27%), Gaps = 6/137 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGTYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKITDKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + ++P EQ I +N + Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTISQIY 156 Query: 390 DVLVEKIEQSIVLLKER 406 D + E+ L+K R Sbjct: 157 DFRKIQSEKFNELVKSR 173 >gi|320527411|ref|ZP_08028593.1| type I restriction modification DNA specificity domain protein [Solobacterium moorei F0204] gi|320132268|gb|EFW24816.1| type I restriction modification DNA specificity domain protein [Solobacterium moorei F0204] Length = 394 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/349 (11%), Positives = 110/349 (31%), Gaps = 48/349 (13%) Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV---------LQPKDVLPELLQG 126 + ++ G ++ + S Q +V + ++ Sbjct: 84 TKYALIQNGDLILADASEDRKDVGRPVEMLDISNQKIVSGLHTIHARNKTDLIVNGFKGF 143 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 + S + Q+I I G+ + + M IP EQ +KII ++I+ I Sbjct: 144 YFQSSAMKQQIFKIANGSKIYGISSSAFNELKMFIPEKQEQ----KKIIDLMIKIEERIQ 199 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + + I K + +++ +++K +V + Sbjct: 200 TQSKIISDYNSLKSGVYNWMFK------------------ENNVTFKLKQLAHIVKGVQI 241 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 N +L+ + + G + + + + + + Sbjct: 242 NNDQLLSNGAYYMMNGGTLPSGYLDSYNVSENTISISE------------GGNSCGYVQF 289 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + + G V P +++ YL ++ + + +G+GL +++ +D++ Sbjct: 290 NKERFWSGGHCYTIQNVNPLIVENKYLYHYLKHKEKEIMNLRIGTGL-PNIQKKDLENFT 348 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + VP + Q +D + + + + L++++ + Sbjct: 349 IFVPNLLIQRKNL----ALFEMLDEKICILNEELERLEKQKKYLLRNLF 393 >gi|317014845|gb|ADU82281.1| putative type I restriction enzyme [Helicobacter pylori Gambia94/24] Length = 182 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 17/150 (11%), Positives = 46/150 (30%), Gaps = 4/150 (2%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + +G P ++ + + ++ + G Sbjct: 25 HGRDYKNFKLGNIPVYGSGGYMLSINNFLHNGESVCIGRKGTIDKPIYLNGKFWVVDTLF 84 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + + ++ ++ + K + SL + + + +PP+ EQ I NV++ Sbjct: 85 YSYSFKKSIPKFIFYAFSIIKWSNYNEATGVPSLTKMTISNIEIPLPPLDEQAAIANVLS 144 Query: 384 VETA---RIDVLVEKIEQSIVLLKERRSSF 410 +D L+ + + K R Sbjct: 145 DVDRYLCSLDALI-LTRMLVSVSKSHRKGL 173 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 48/161 (29%), Gaps = 18/161 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +PK W+ V + L GR + + + G G + + Sbjct: 8 LPKTWQKVRLGDILTLKHGRDYK-----------NFKLGNIPVYGSGGYMLSINN---FL 53 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + G+ G + + + T F K +P+ + I Sbjct: 54 HNGESVCIGRKGTIDKPIYLNGKFWVVDTLFYSYSFKKSIPKFIFYAFSIIK----WSNY 109 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E + I NI +P+PPL EQ I + + Sbjct: 110 NEATGVPSLTKMTISNIEIPLPPLDEQAAIANVLSDVDRYL 150 >gi|238810194|dbj|BAH69984.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 172 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 43/134 (32%), Gaps = 10/134 (7%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 +Y VD I+ + ++ +T + KP + + Sbjct: 38 ITYVNKWNVDEDAIIIGRVGAN----CGCVNITNKKSFVTDNALIFKPKEKNMARFYFYF 93 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + F+ + L + + + +P + + I+ +++ ID +E+ Sbjct: 94 LLHLNLNKFHI--GSSQPLLTQGILGNIKINIPSLNKCQKISKILD----NIDNQIERNN 147 Query: 398 QSIVLLKERRSSFI 411 + L+ + I Sbjct: 148 SMVQKLQSFEQALI 161 >gi|289168438|ref|YP_003446707.1| type I restriction-modification system specificity determinant [Streptococcus mitis B6] gi|288908005|emb|CBJ22845.1| type I restriction-modification system specificity determinant [Streptococcus mitis B6] Length = 185 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 45/122 (36%), Gaps = 2/122 (1%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G+I+ I K + G + + ++ P + YL +++ Sbjct: 52 YNQGDILIGNIRPYLKKIWFSNQVGGTSGDVLTIQNSITPCMEN-KYLYYILSDDRFFYY 110 Query: 346 FYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G + + + ++P I EQ I ++++ + L E + + I L + Sbjct: 111 NVQYSKGSKMPRGDKKAIMQYKFILPSITEQKRIVSILDNFNTLTNSLSEGLPKEIELRQ 170 Query: 405 ER 406 ++ Sbjct: 171 KQ 172 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 65/185 (35%), Gaps = 7/185 (3%) Query: 29 PIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + R S + Y+G++++ Q +T +G IL Sbjct: 2 KLGAVAEYSQKRISVTDLTPETYVGVDNLLQDRKGKAVATFLPDQGSVTTY---NQGDIL 58 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPK---DVLPELLQGWLLSIDVTQRIEAICEGA 144 G + PYL+K ++ G S L +Q + + L L +G+ Sbjct: 59 IGNIRPYLKKIWFSNQVGGTSGDVLTIQNSITPCMENKYLYYILSDDRFFYYNVQYSKGS 118 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 M D K I +P + EQ I + ++L + IEL +++ + Sbjct: 119 KMPRGDKKAIMQYKFILPSITEQKRIVSILDNFNTLTNSLSEGLPKEIELRQKQYEYWRE 178 Query: 205 YIVTK 209 ++ Sbjct: 179 QLLNF 183 >gi|309386128|gb|ADO66998.1| putative type I restriction-modification system specificity subunit [Enterococcus faecium] Length = 187 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 1/118 (0%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 S ++ +I+ K L E + + S Y+ Sbjct: 67 ISNSKLVDLRLEENDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDC 126 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K+ + G + ++ ++ +L + +PP++EQ +T I + I + Sbjct: 127 FLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIKMIRRSIRRI 184 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 7/164 (4%) Query: 27 VVPIKRFT-KLNTGRTSESGK--DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + K+ G T + K ++ ++ + D++ G + + + Sbjct: 20 WVYLGSISTKIQYGYTDSAKKQGNVKFLRITDIQEGRVNWSSVPYCDISNSKLVDLRLEE 79 Query: 84 GQILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL + G + K+ I++ S + + +L E + +L S + +E Sbjct: 80 NDILIARTGGTMGKSFLVKEISEESVFASYLIRIRLVEKLLSEYVDCFLDSPLYWKLLEK 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 I G + + + + +P+PPL EQ + KI I Sbjct: 140 ISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIKMIRRSIRR 183 >gi|291551221|emb|CBL27483.1| Restriction endonuclease S subunits [Ruminococcus torques L2-14] Length = 374 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 57/143 (39%), Gaps = 7/143 (4%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAVKPHGID--- 329 + Y+++ G+ + + D+R + + I++ AY + Sbjct: 42 NVIGTDLSKYKLITKGKFACNPMHVGRDERLPVALYDEEKPAIVSPAYFMFEVIDNSILK 101 Query: 330 STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL R + ++ + +R + ++D+ RL + +PPI+ Q +I N T R Sbjct: 102 EDYLMMWFRRPEFDRICWLHTDGSVRGGITWDDICRLELPIPPIENQLEIVNSYKAITER 161 Query: 389 IDVLVEKIEQSIVLL-KERRSSF 410 I L +KI ++ + S Sbjct: 162 I-ALKQKINDNLEATAQAYFDSL 183 >gi|167761885|ref|ZP_02434012.1| hypothetical protein BACSTE_00228 [Bacteroides stercoris ATCC 43183] gi|167700255|gb|EDS16834.1| hypothetical protein BACSTE_00228 [Bacteroides stercoris ATCC 43183] Length = 197 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 45/118 (38%), Gaps = 6/118 (5%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLR--SAQVMERGIITSAYMAVKPHGI--DSTYLA 334 + + + G++ D + A + I+ V P+ D YL Sbjct: 60 NEISKFKLKKGQVALTKDSETRDDIGIPTYIADDFDDAILGYHCALVTPNKDILDGRYLN 119 Query: 335 WLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L+ + K F A GSG R +L E + PV + P+ EQ I + + +I+ Sbjct: 120 ALLHTDYAKKYFACNASGSGQRYALSVEALNSFPVPIIPLHEQKQIGEIFSALDKKIE 177 >gi|237752768|ref|ZP_04583248.1| type I restriction-modification system [Helicobacter winghamensis ATCC BAA-430] gi|229376257|gb|EEO26348.1| type I restriction-modification system [Helicobacter winghamensis ATCC BAA-430] Length = 187 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 52/171 (30%), Gaps = 6/171 (3%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 I+ S GN Q + + + + +I+ + + Sbjct: 21 YGIPFYRSKEIIEFSKGNNPQNELFIDENKYNDIANKFGVPQANDILLTSVGTLGIPYLV 80 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 + + + S +L + +S + ++ +Q+L +K Sbjct: 81 PKDKKFYFKDGNLTWFKNFKNIT-SLFLFYWFKSPQGKEKLDSIAIGSTQQALTIAALKA 139 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + +P I ++N + I +E + I L+ R + A Sbjct: 140 VNIHLPHTD----IIRILNEQLNGIQNKIENNTKQIQNLQAMRDMLLKAIF 186 >gi|225550658|ref|ZP_03771607.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 2 str. ATCC 27814] gi|225379812|gb|EEH02174.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 2 str. ATCC 27814] Length = 355 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/348 (10%), Positives = 101/348 (29%), Gaps = 20/348 (5%) Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICST-QFLVLQPKDVLPELLQGWLLSI 131 + IL+ + + + + + +VL + Sbjct: 6 ISIENNKFIDEPAILFSSTATIGNVCYVEEKCWFNDQIKAFISKDSNVLNTKYLYYWFLN 65 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +G+ S K + N+ + +P + EQ I I Sbjct: 66 NKHIIKSQANKGSVFSSIGIKELVNMKINLPSIEEQNAIISIIEPHEKLFVKYSNLVDIS 125 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 +K + I+ +K+ I++ + ++ + + N K L Sbjct: 126 SVENAKKDVDNLISIIEPIEKVINNIKN--IKFKIESLVNKYFDFLYSNLEDSNFKKYIL 183 Query: 252 IESNILSLSYGNIIQKLETRNMGL-------KPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + ++ + +E+ K Y + F I Sbjct: 184 GDLFTINRGQIINSKYIESNIGSYPVISSNTKNNGVFGYINSYMYDGEFITISADGAYAG 243 Query: 305 LRSAQVMERGIITSAYMAVKPHGID----STYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 Q I ++ +K + ID + ++ ++ + + R +++ Sbjct: 244 TVFLQNGRFSITNVCFILIKNNDIDFKFSNKFVYYIFKKEQEVNKLKSQVGSSRPAVREY 303 Query: 361 DVKRLPVLVPPIKEQFDITNV------INVETARIDVLVEKIEQSIVL 402 +K + + +P I+ Q + + ++ + +I+ ++ I Sbjct: 304 SLKEIKINLPNIEIQEKFSKIVEPLLNLSTKANKIEKILNDSLLKITK 351 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 43/108 (39%), Gaps = 2/108 (1%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + S E + +D I+F + + I A+++ + +++ YL + Sbjct: 4 RYISIENNKFIDEPAILFSSTATIGNVCYVEEKCWFNDQI--KAFISKDSNVLNTKYLYY 61 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + A + S+ +++ + + +P I+EQ I ++I Sbjct: 62 WFLNNKHIIKSQANKGSVFSSIGIKELVNMKINLPSIEEQNAIISIIE 109 >gi|189463336|ref|ZP_03012121.1| hypothetical protein BACCOP_04053 [Bacteroides coprocola DSM 17136] gi|189429955|gb|EDU98939.1| hypothetical protein BACCOP_04053 [Bacteroides coprocola DSM 17136] Length = 185 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 62/171 (36%), Gaps = 5/171 (2%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +PD W + L+ + S S I +L I+ Sbjct: 19 QLPDGWTACRLEQVADILDNLRKPINSSERDSRIRNRQIDELYPYYGATGQVGLIDDYII 78 Query: 287 DPGEIVFRFIDL-QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + ++ DK ++++ + + + + + P +L S + Sbjct: 79 NGNYLLLGEDGAPFLDKNAIKAYSISGKSWVNNHAHILSPKID----FEFLQYSLNQIDY 134 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + R L D++ + +++PP+ EQ I + I + +++D+++E + Sbjct: 135 SEYVNGSTRLKLTQTDMRSIKIMLPPLAEQKRIKSKIQILFSQLDLMMESL 185 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 61/182 (33%), Gaps = 5/182 (2%) Query: 7 YPQYKDSGVQWIG---AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 Y +Y ++ + IG +P W +++ + + + + + Sbjct: 3 YNEYSNNIAERIGHYTQLPDGWTACRLEQVADILDNLRKPINSSERDSRIRNRQID--EL 60 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 P G + Q I +L G+ G I ++ + P++ Sbjct: 61 YPYYGATGQVGLIDDYIINGNYLLLGEDGAPFLDKNAIKAYSISGKSWVNNHAHILSPKI 120 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 +L G+T + +I + +PPLAEQ I+ KI ++D Sbjct: 121 DFEFLQYSLNQIDYSEYVNGSTRLKLTQTDMRSIKIMLPPLAEQKRIKSKIQILFSQLDL 180 Query: 184 LI 185 ++ Sbjct: 181 MM 182 >gi|321310232|ref|YP_004192561.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802076|emb|CBY92722.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 195 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 61/165 (36%), Gaps = 4/165 (2%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + L+ K+T +ES L GN + + L ++++++D + + + Sbjct: 13 ICKIHRGLSFKSTYYLESGTPVLKIGN-VDGGKVIKENLFYCDEKSHKVLDMHRVRYEDV 71 Query: 297 DLQNDKRSLRSAQVMER--GIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGL 353 + N + + A + I++S + P+ + ++ + + Sbjct: 72 VITNLAPAGKVAINLTNLEFILSSHVFKLDPNPEILDRRYLYYFLMNSPRQIEQMLTAAN 131 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + +++ +LVP ++ Q I ++ + L + Q Sbjct: 132 VVRIHMSSLEKFKILVPDLETQRSIVAKLDKFRELREELKMRKRQ 176 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 64/190 (33%), Gaps = 7/190 (3%) Query: 26 KVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSI 80 K + K++ G + +S + + +V+ G K + + + Sbjct: 6 KECRLGEICKIHRGLSFKSTYYLESGTPVLKIGNVDGGKVIKENLFYCDEKSHKVLDMHR 65 Query: 81 FAKGQILYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ L P + AI + + + I S+ L P + + + ++ ++IE Sbjct: 66 VRYEDVVITNLAPAGKVAINLTNLEFILSSHVFKLDPNPEILDRRYLYYFLMNSPRQIEQ 125 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + A + + + +P L Q I K+ + + + R R + K Sbjct: 126 MLTAANVVRIHMSSLEKFKILVPDLETQRSIVAKLD-KFRELREELKMRKRQGVYYRNKI 184 Query: 200 QALVSYIVTK 209 + V Sbjct: 185 MGGLQECVFP 194 >gi|333011300|gb|EGK30714.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri K-272] Length = 377 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 66/192 (34%), Gaps = 15/192 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE + ++ + K+ S IL +I++ + G Sbjct: 93 SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQSQEFISGYCNNE 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ V F D + + + + P I + WL+RS Sbjct: 153 ---CLLIKLNNPVIVFGDHTRN----IKFIDFDFVVGADGVKILSPILICERFFFWLLRS 205 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + L YA F+ + +PPI EQ I ++ + D L ++ S Sbjct: 206 FKLDVRGYAR--------HFKVLNSCLFALPPIAEQERIVEKVSSLMSLCDQLEQQSLTS 257 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 258 LDAHQQLVETLL 269 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 70/204 (34%), Gaps = 18/204 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESG 59 +K K P+ S + +P+ W+ V + ++ +I+ G V Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQ 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + +++ N+ + I++ G + R DFD + V + Sbjct: 141 SQEFISGYCNNECL----LIKLNNPVIVF---GDHTRNIKFIDFDFVVGAD-GVKILSPI 192 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + L + + + + +PP+AEQ I EK+ + Sbjct: 193 LICERFFFWLLRSFKLDVRGYARHFKV-------LNSCLFALPPIAEQERIVEKVSSLMS 245 Query: 180 RIDTLITERIRFIELLKEKKQALV 203 D L + + ++ ++ + L+ Sbjct: 246 LCDQLEQQSLTSLDAHQQLVETLL 269 >gi|269978324|gb|ACZ55896.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 330 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 40/372 (10%), Positives = 95/372 (25%), Gaps = 46/372 (12%) Query: 50 YIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 +I D+ P+ + + + IL G +G + D + Sbjct: 2 FITPNDLHGTYRIIKTPRTLSDSGLKSIQNNTINNTSILVGCIGDVGMVRMCFDKCA-TN 60 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 Q + + + + + I + I + +P + Q Sbjct: 61 QQINSITDIKDFCNPYYLYYYLSNKKELFKNIAFSTVVPIIPKTIFQEIEVLLPNIETQQ 120 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + +I+ Sbjct: 121 KIARTLSILDQKIENNHKINELL------------------------------------- 143 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 H + + KN KL + I + +++ + + + P Sbjct: 144 --HTLAYKIYEYYFKYKPKNAKLEQIIIENPKSNIMVKNAQKTQDKYPFFTSGDNILSYP 201 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I+ N + + + ++ + + S YL L+ S Sbjct: 202 KAIIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCICANEF-SDYLYLLLSSIKNHINQSF 260 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + L+ +K+ P+ +P E N + L+ ++ L++ R Sbjct: 261 FQGTSLKHLQKNLLKKYPIYMPSAHEIKKF----NQIMMPLLTLISINTRTSKKLEQIRD 316 Query: 409 SFIAAAVTGQID 420 + +T Q+ Sbjct: 317 FLLPLLLTQQVK 328 >gi|198273534|ref|ZP_03206070.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|198250054|gb|EDY74834.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma urealyticum serovar 4 str. ATCC 27816] Length = 356 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 47/383 (12%), Positives = 110/383 (28%), Gaps = 38/383 (9%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ T ++ +I GL + N+ ++ I Sbjct: 6 KLSSVFEIITTGKQKNTFNINLEGLYPL------ISASTANNGIMGYVDNYLYDGQNITI 59 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMS 147 ++G + F++ + + ++ +LL ++ ++I +I G T Sbjct: 60 SRVGNAGTTFYHEGKISLTDNCFILSRINKKIAKVKYVFYLLKLNEDKKIRSISHGTTRK 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + N+ + +P + Q I I I I I L EK ++ + Sbjct: 120 IINKTDLDNLIIYLPSIEIQNAIISIIEPIEKSI-KTINLLQTKIGLFIEKTFNFINDNL 178 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + +KD GL I + N Sbjct: 179 VNSDLIEFSLKDLLNIKRGLP---------------------------ITAKDLLNNPGS 211 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + K Y + I + + + ++ Sbjct: 212 YPLISASSKNNGIFGYFNDYMYDGQNITISMNGNAGCIFYQIGKFSANSDVLVLSNSNKN 271 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + + ++ R L +++ VL+P I+ Q + ++ Sbjct: 272 LTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNIEIQEKFSKIVEPLL- 330 Query: 388 RIDVLVEKIEQSIV--LLKERRS 408 + KIE+++ LLK + Sbjct: 331 NLSTKANKIEKNLNECLLKIVKK 353 >gi|261491602|ref|ZP_05988185.1| type I restriction-modification system specificity determinant [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261312728|gb|EEY13848.1| type I restriction-modification system specificity determinant [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 187 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 13/118 (11%), Positives = 35/118 (29%), Gaps = 5/118 (4%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + + + + + Y+ + S +F Sbjct: 66 TIAGSGAYAGFLMYWNEPIFLGDAFSVKPDLDILITKYVYHFLLSKQ-QWIFNLKKGSGV 124 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +D+ L + +PP++ Q I ++ T L ++ + R + + Sbjct: 125 PHVYPKDLAILEIPIPPLEIQQKIVKTLDKFTE----LEAELALRKKQYQYYRETLLT 178 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 51/155 (32%), Gaps = 12/155 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ +L G+T I +D G + G + + + Sbjct: 16 EWKPLGEVAELKRGKT---------ITAKDKTEGNIPVIS--GGQKPAYYTGEYNREGET 64 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G Y + + F V +P + + + Q I + +G+ Sbjct: 65 ITIAGSGAYAGFLMYWNEPIFLGDAFSV-KPDLDILITKYVYHFLLSKQQWIFNLKKGSG 123 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + H K + + +PIPPL Q I + + T Sbjct: 124 VPHVYPKDLAILEIPIPPLEIQQKIVKTLDKFTEL 158 >gi|307312926|ref|ZP_07592554.1| putative restriction modification system DNA specificity domain protein [Escherichia coli W] gi|306907094|gb|EFN37601.1| putative restriction modification system DNA specificity domain protein [Escherichia coli W] gi|315063605|gb|ADT77932.1| hypothetical protein ECW_m4660 [Escherichia coli W] gi|320200587|gb|EFW75173.1| hypothetical protein ECoL_02153 [Escherichia coli EC4100B] gi|323380314|gb|ADX52582.1| putative restriction modification system DNA specificity domain protein [Escherichia coli KO11] Length = 508 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/356 (11%), Positives = 96/356 (26%), Gaps = 20/356 (5%) Query: 46 KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA-- 101 I YI + ++S + + T S +L + G ++ Sbjct: 64 DSIPYISGKVIKSFNIDLDECQRISLDSHKNELTKSALKPTDVLVIRKGDMGNACVVPSE 123 Query: 102 -DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 + S + P L +L + + + G + + +P+P Sbjct: 124 VNEANCSSEVIYLKMKASSDPYYLVSYLNCDQGQKAFKRLGRGTIIPGVSLLDVPRLPIP 183 Query: 161 IPPLAEQVLIREK------IIAETVRIDTLITERIRFIELLKEKKQALVS----YIVTKG 210 Q I +K + A + T + + + L + AL++ + Sbjct: 184 KVSEFVQKYIGDKVRQAEQLRAWAKLLRTSVDAHLNSLNLPINEPPALLNRVSAQTMEDR 243 Query: 211 LNPDVKMKDSG--IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI-IQK 267 L+P + + +P N L S I + NI Sbjct: 244 LDPRPYRTHYLCLVREIEKLPHDSISTLVELASGCPVSSNDFLENSGIPLVRIRNIGFDD 303 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + G+ + Y+ + + + + RS ++ + P Sbjct: 304 FIGLDTGVSQDVYQDATKYQAKDKMIV-VGMDGIFRSQFFISDELPMLVNQRVAMLSPQN 362 Query: 328 IDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I L + + + D+ R+ + + + + + Sbjct: 363 IRGELLTHWLNRPEGQMQLNQWAVKTTVEHTSLSDIGRVLIPRLDKSLENKLADYL 418 >gi|256026505|ref|ZP_05440339.1| type I restriction-modification enzyme, S subunit [Fusobacterium sp. D11] gi|289764517|ref|ZP_06523895.1| predicted protein [Fusobacterium sp. D11] gi|289716072|gb|EFD80084.1| predicted protein [Fusobacterium sp. D11] Length = 231 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 63/191 (32%), Gaps = 9/191 (4%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G V + K K +I +++ + + Sbjct: 14 LGDVFNLQMGKTPLRENKLYWNKGKYNW-ISISDMNFSEKYLFSTKEKISDIAIKESGIK 72 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ ++ F + + I+ A++ + ID +L + ++S + Sbjct: 73 LIPKNTVIMSFKLSIGKVKIVNEDIYSNEAIM--AFIPKENFFIDKNFLYYCLKSLKWNE 130 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 GL +L + + + +P + Q +I+N ++ I+ L+E + + LK Sbjct: 131 GINKAVKGL--TLNKNLIAQKEIFLPDLTIQKEISNNLDS----INNLLELRKNQLNYLK 184 Query: 405 ERRSSFIAAAV 415 E S Sbjct: 185 ELNKSLFTRVF 195 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 57/199 (28%), Gaps = 10/199 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVE--SGTGKYLPKDGNSRQS 73 WK V + L G+T +I + D+ + + Sbjct: 7 NEWKKVKLGDVFNLQMGKTPLRENKLYWNKGKYNWISISDMNFSEKYLFSTKEKISDIAI 66 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + + K ++ + K I + D + + PK+ + Sbjct: 67 KESGIKLIPKNTVIMS-FKLSIGKVKIVNEDIYSNEAIMAFIPKENFFIDKNFLYYCLKS 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 + E I + + I + +P L Q I + + ++ + E Sbjct: 126 LKWNEGINKAVKGLTLNKNLIAQKEIFLPDLTIQKEISNNLDSINNLLELRKNQLNYLKE 185 Query: 194 LLKEKKQALVSYIVTKGLN 212 L K + I++ N Sbjct: 186 LNKSLFTRVFGDILSNSFN 204 >gi|304436272|ref|ZP_07396256.1| conserved hypothetical protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304370734|gb|EFM24375.1| conserved hypothetical protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 203 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 48/142 (33%), Gaps = 7/142 (4%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + ++ +I+F + + + A + V I Y+ L Sbjct: 62 SRSVIKEKDILFTIAGTLGRFSFIDESLLPANTNQAVAIIRVNQAKIPPEYIYSLFIGNW 121 Query: 342 LCKVFYAM-GSGLRQSLKFEDVKRLPVL-VPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + ++ +L +K LP+ +P + I + I L++ Sbjct: 122 HNNYYVKHIQQAVQANLSLATIKSLPIPMLPDSDMKVYI-----KMVSPIISLMQSYACE 176 Query: 400 IVLLKERRSSFIAAAVTGQIDL 421 L+ R + + ++G++D+ Sbjct: 177 NSRLQTLRDTLLPRLMSGELDV 198 >gi|139438171|ref|ZP_01771724.1| Hypothetical protein COLAER_00712 [Collinsella aerofaciens ATCC 25986] gi|133776368|gb|EBA40188.1| Hypothetical protein COLAER_00712 [Collinsella aerofaciens ATCC 25986] Length = 188 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 51/193 (26%), Gaps = 23/193 (11%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 +P + K +G + K F T Sbjct: 12 FAGFTDPWEQRK------LGELGSVAMCKRIFKEQTTEQGDVPFYKIGTF-------GGT 58 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + L E YQ G+I+ + + ++ Sbjct: 59 PDAFISRELFDEYQRLYQFPKVGDILISAAGTIGRTIVYQGDPAYYQD--SNIVWLQHDE 116 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +D+ +L + ++ + L +D+ + +P EQ I + Sbjct: 117 RLDNGFLLQFLNGKSW----SSLEGSTLKRLYNKDLLNAEIAIPSPDEQHQIGS----TF 168 Query: 387 ARIDVLVEKIEQS 399 AR+D ++ ++ Sbjct: 169 ARLDDIITLHQRE 181 Score = 40.2 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 50/164 (30%), Gaps = 14/164 (8%) Query: 25 WKVVPIKRFTK------LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + + +T+E G D+ + + ++ ++ + + Sbjct: 19 WEQRKLGELGSVAMCKRIFKEQTTEQG-DVPFYKIGTFGGTPDAFISRELF---DEYQRL 74 Query: 79 SIFAK-GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 F K G IL G R + +V E L L + + Sbjct: 75 YQFPKVGDILISAAGTIGRTIVYQGDPAYYQDSNIVW---LQHDERLDNGFLLQFLNGKS 131 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + EG+T+ K + N + IP EQ I I Sbjct: 132 WSSLEGSTLKRLYNKDLLNAEIAIPSPDEQHQIGSTFARLDDII 175 >gi|323158214|gb|EFZ44306.1| Type I restriction modification DNA specificity domain protein [Escherichia coli E128010] Length = 245 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 64/222 (28%), Gaps = 18/222 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + +P+ + L G T K DI + ++D+ Sbjct: 17 EWLPLSKVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGNSLQKISSCAVKGG 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQ 135 +F + IL A+I + + +F L K+ + + + + Sbjct: 77 KLFPENSILISTSATIGEHALITVPH-LANQRFTCLALKESYADCFDIKFLFYYCFSLAE 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITER 188 ++ + D G +P P LA Q I + T L E Sbjct: 136 WCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFTALTAELTAEL 195 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPD 230 + + L+S+ + + +++ G P Sbjct: 196 NMRKKQYNYYRDQLLSFDESSVEWKTLLEACDYVDYRGKTPK 237 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/206 (11%), Positives = 54/206 (26%), Gaps = 15/206 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNM 273 M +EW+ L +V T K +I +I + Sbjct: 11 MDGVEVEWLPLS----KVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGNSLQ 66 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + ++ I+ + + + + A D +L Sbjct: 67 KISSCAVKGGKLFPENSILISTSATIGEHALITVPHLANQRFTCLALKESYADCFDIKFL 126 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVET 386 + S S+ + K+ + P + Q +I +++ T Sbjct: 127 FYYCFSLA-EWCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFT 185 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIA 412 A L ++ R ++ Sbjct: 186 ALTAELTAELNMRKKQYNYYRDQLLS 211 >gi|283954606|ref|ZP_06372124.1| hypothetical protein C414_000240009 [Campylobacter jejuni subsp. jejuni 414] gi|283793798|gb|EFC32549.1| hypothetical protein C414_000240009 [Campylobacter jejuni subsp. jejuni 414] Length = 476 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 57/422 (13%), Positives = 118/422 (27%), Gaps = 55/422 (13%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV-SIFAKGQILY 88 + K N+ + + ++ +Y+ + ++ S I K +L Sbjct: 57 LGDNMKFNSRYSQPKYDE-----TSKMKVINSQYIRNEYIDYENAKSGYGKIVPKESVLI 111 Query: 89 GKLG-PYLRKAIIA--DFDGICSTQF--LVLQPKDVLPELLQGWLLSIDV--TQRIEAIC 141 G L + I DFD + +V++ K L L Q I Sbjct: 112 NATGVGTLGRVFINILDFDFSIDSHINVIVVKNKTYLNPYFLTIFLQSYYGQIQIIRYYS 171 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK-- 199 + + +PI P+ Q+ I+ + ++ + E+L + Sbjct: 172 GTSGQIEIYPRDFNYFKIPILPIEFQLEIQNLVKDSHKALEESKELYKKAEEILYLELGL 231 Query: 200 ------QALVSYIVTK--------------------GLNPDVKMKDSGI-EWVGLVPDHW 232 Q+L+ + L+ + K I E + + + Sbjct: 232 DPKNPLQSLLDSKIDHSTKSLNISIRTLKESFLKTGRLDSEYYQKKYEINEKIIMNKKYT 291 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES-----YETYQIVD 287 + ++ + + I + N+ Q + E Y Sbjct: 292 VLDNLVSITKSIEPGSNLYKNKGIPFIRVANLTQYGLSEADVFLDEKDFFPQYLQILYPK 351 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 I+F ++ + + I YL + S + Sbjct: 352 KDTILFSKDGSIGVAYCVKEDKEVITSGAILHLNIKDKENILPEYLTLFLNSIFVKLQAQ 411 Query: 348 AMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET-------ARIDVLVEKIEQS 399 G + + ED+K++ V + IK Q I I +D K+E+ Sbjct: 412 RDCGGSIISHWRIEDIKKVLVAILDIKTQEKIAKYIQESFNLRKKSKQLLDNAKIKVEEQ 471 Query: 400 IV 401 I Sbjct: 472 IQ 473 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 61/171 (35%), Gaps = 6/171 (3%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFI 296 + + + N++ + S +I RN + E+ ++ IV ++ Sbjct: 55 EYLGDNMKFNSRYSQPKYDETSKMKVINSQYIRNEYIDYENAKSGYGKIVPKESVLINAT 114 Query: 297 DLQNDKRSLRSAQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 + R + + I + + + ++ +L ++SY SG Sbjct: 115 GVGTLGRVFINILDFDFSIDSHINVIVVKNKTYLNPYFLTIFLQSYYGQIQIIRYYSGTS 174 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + D + + PI+ Q +I N++ ++ E +++ +L Sbjct: 175 GQIEIYPRDFNYFKIPILPIEFQLEIQNLVKDSHKALEESKELYKKAEEIL 225 >gi|261494962|ref|ZP_05991431.1| JHP726-like protein [Mannheimia haemolytica serotype A2 str. OVINE] gi|261309371|gb|EEY10605.1| JHP726-like protein [Mannheimia haemolytica serotype A2 str. OVINE] Length = 224 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 13/118 (11%), Positives = 35/118 (29%), Gaps = 5/118 (4%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + + + + + Y+ + S +F Sbjct: 66 TIAGSGAYAGFLMYWNEPIFLGDAFSVKPDLDILITKYVYHFLLSKQ-QWIFNLKKGSGV 124 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +D+ L + +PP++ Q I ++ T L ++ + R + + Sbjct: 125 PHVYPKDLAILEIPIPPLEIQQKIVKTLDKFTE----LEAELALRKKQYQYYRETLLT 178 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 51/155 (32%), Gaps = 12/155 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + P+ +L G+T I +D G + G + + + Sbjct: 16 EWKPLGEVAELKRGKT---------ITAKDKTEGNIPVIS--GGQKPAYYTGEYNREGET 64 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I G Y + + F V +P + + + Q I + +G+ Sbjct: 65 ITIAGSGAYAGFLMYWNEPIFLGDAFSV-KPDLDILITKYVYHFLLSKQQWIFNLKKGSG 123 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + H K + + +PIPPL Q I + + T Sbjct: 124 VPHVYPKDLAILEIPIPPLEIQQKIVKTLDKFTEL 158 >gi|207108192|ref|ZP_03242354.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori HPKX_438_CA4C1] Length = 191 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 16/116 (13%), Positives = 44/116 (37%), Gaps = 4/116 (3%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG 352 I + + ++ +V P + YL +++ + + S Sbjct: 12 NTITIAQYGTAGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSA 71 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + S+ ++ ++ + +PP++ Q +I +++ T L ++ LK R+ Sbjct: 72 IPYSISSNNIMQITIPIPPLEIQQEIVKILDAFTELNTELNTELNTE---LKARKK 124 >gi|150006174|ref|YP_001300918.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] gi|149934598|gb|ABR41296.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] Length = 358 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 47/389 (12%), Positives = 116/389 (29%), Gaps = 62/389 (15%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ ++ + +G+ + G + TG S + Sbjct: 24 EWENTELQYIAPNICSGKDKPTSN-----GTVALYGSTGIIGMTRLASYNEEI------- 71 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 +L ++G + I + L++ K+ + + Sbjct: 72 ---VLVARVGANAGQLQITTIPCGVTDNTLIINAKEWNRYIYYYLQHYNL-----NRLVF 123 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G+ + + + +E+ I + RI T +L + L Sbjct: 124 GSGQPLITGSMLKKLKIIYGEESERNKIVNLLCLLDERIATQNKIIEDLKKLKSAISERL 183 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 +K S + + +V L +S + G Sbjct: 184 F-----------KSVKGSTV----------LLSDLCDIVKGKQINGENLSDSGNYYVMNG 222 Query: 263 NIIQKLETRNMGLKPESYETYQIVDP-GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 N ++ + + + G + F + + ++ Sbjct: 223 GTEPSGYYDNYNVEASTISISEGGNSCGYVQFNTSPFWSGGHCYSIQNIADK-------- 274 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +D+ YL ++S + + +GSGL +++ +D+ ++VP I+ Q I+ Sbjct: 275 ------VDNMYLYHYLKSNEDAIMKLRIGSGL-PNIQKKDLAMFKIIVPKIEWQIKISTF 327 Query: 382 INVET--ARIDVLVEK--IEQSIVLLKER 406 ++ A I+ ++ +Q + LL++ Sbjct: 328 LSSLERKAEIEERIQNVMQKQKLYLLQQM 356 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 41/97 (42%), Gaps = 9/97 (9%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 G+ + + + Y+ + ++ Y+L ++ + + + +K+L ++ Sbjct: 92 GVTDNTLIINAKEW--NRYIYYYLQHYNLNRLVF---GSGQPLITGSMLKKLKIIYGEES 146 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 E+ I N++ +D + + I LK+ +S+ Sbjct: 147 ERNKIVNLLC----LLDERIATQNKIIEDLKKLKSAI 179 >gi|308063300|gb|ADO05187.1| Type I restriction/modification specificity protein [Helicobacter pylori Sat464] Length = 423 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 67/193 (34%), Gaps = 16/193 (8%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK-------PESYET 282 DH ++ ++ K TK E + ++ ++ K ++ Sbjct: 3 DHVKLSEVCEILNSNVDKKTKENEQKVKLCNFIDVYNNWAITKYTSKKFMTATATQNEIN 62 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWLMR 338 + G + + + + + Y ++ +L + Sbjct: 63 KFSLKKGYVAITKDSETKNDIGISTYIADNFDNVLLGYHCTLLKPNQKVLNGKFLNAYLS 122 Query: 339 SYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLV 393 S+ K F A GSG R +L + +K L + + I+ Q I +++ +I+ + Sbjct: 123 SFYGRKYFSNCASGSGQRYTLTIDIIKDLTIPLINIETQQKIVRTLSILDQKIENNHKIN 182 Query: 394 EKIEQSIVLLKER 406 E + + + LL E+ Sbjct: 183 ELLHKILELLYEQ 195 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 56/416 (13%), Positives = 115/416 (27%), Gaps = 46/416 (11%) Query: 28 VPIKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGNSRQS--DTSTV 78 V + ++ + K+ +I + + KY K + + + Sbjct: 5 VKLSEVCEILNSNVDKKTKENEQKVKLCNFIDVYN-NWAITKYTSKKFMTATATQNEINK 63 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDG------ICSTQFLVLQPKDVLPELLQGWLLSID 132 KG + K I+ + + +L+P + Sbjct: 64 FSLKKGYVAITKDSETKNDIGISTYIADNFDNVLLGYHCTLLKPNQKVLNGKFLNAYLSS 123 Query: 133 V---TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I ++ +P+ + Q I + +I+ Sbjct: 124 FYGRKYFSNCASGSGQRYTLTIDIIKDLTIPLINIETQQKIVRTLSILDQKIENNHKINE 183 Query: 190 RFIELLKEKKQALVSYI-VTKGLNPDVK-----MKDSGIEWVGLVPDHWEVKPFFALVTE 243 ++L+ + G N + MK S E L+P+ +EVK LV Sbjct: 184 LLHKILELLYEQYFVRFDFLDGNNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGELVDI 242 Query: 244 LNRKNTKLIESNILSLSYGNIIQK---------LETRNMGLKPESYETYQIVDPGEIVFR 294 + + + + Y I K T N+ P+ Y +++P I+ Sbjct: 243 FSGYSFQSNTYSNNKNDYILITNKNVQHSLVDLSITTNLLFLPKKLPKYCLLEPTNILIT 302 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMG-SG 352 + S + I+ V P + + L+R+ + Sbjct: 303 LTGHIGRCALVFS----KNCILNQRVGVVLPKEKELNPFYYSLIRNPLFSAILQRNAIGS 358 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +Q+L D ++ + I + I L+ QS L R Sbjct: 359 SQQNLSPIDTLKIQIPF-----NHKIIKQYSKTCENIIKLLVSNMQSTQTLTALRD 409 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 51/158 (32%), Gaps = 8/158 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKY-LPKDGNSRQS 73 IP ++V + + +G + + D I I ++V+ + + Sbjct: 227 IPNDFEVKTLGELVDIFSGYSFQSNTYSNNKNDYILITNKNVQHSLVDLSITTNLLFLPK 286 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV-LPELLQGWLLSID 132 + IL G R A++ + I + + V+ PK+ L + + Sbjct: 287 KLPKYCLLEPTNILITLTGHIGRCALVFSKNCILNQRVGVVLPKEKELNPFYYSLIRNPL 346 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI 170 + ++ G++ + I +P + Sbjct: 347 FSAILQRNAIGSSQQNLSPIDTLKIQIPFNHKIIKQYS 384 >gi|291561051|emb|CBL39851.1| Restriction endonuclease S subunits [butyrate-producing bacterium SSC/2] Length = 393 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 48/345 (13%), Positives = 104/345 (30%), Gaps = 25/345 (7%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR-----KAIIADFDG 105 I + + GKY N Q D IF +L + G A Sbjct: 21 IPITASDRKEGKYPYYGANGIQ-DYVNDYIFDDELVLLAEDGGNFGSKEKPIAYRVSGKC 79 Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 + VL+PK+ + + L +++ + GAT + + +P+ + Sbjct: 80 WVNNHAHVLKPKEEIDVDYLCYSLMFY---KVDGMINGATRKKLTQTAMKKMKIPLRNIV 136 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 EQ I +++ +I + + + + LL QA + + D K + ++ + Sbjct: 137 EQKKIVQQLN----KIIEIREKAKKELNLLDNLIQARFVELFGDAVYNDKKWETDTVKNL 192 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ---KLETRNMGLKPESYET 282 + +I +S ++ K + T Sbjct: 193 CKEIYGGGTPSKAHP--------EYYKDGDIPWVSAKDMKTDVLKDSQIKINQLGVDNST 244 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 ++V ++ K +L A + P T + Sbjct: 245 ARLVPVNSVIMVIRSGIL-KHTLPVAVNKVPITVNQDLKVFIPGERILTRFLAVQFKMQE 303 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + +++F +K+ ++VPPI Q + Sbjct: 304 KDILSGVRAVTADNIEFNSLKQRRMIVPPIDLQQKYLMFLERIDK 348 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 52/161 (32%), Gaps = 6/161 (3%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 L I + K + Y D ++ K + Sbjct: 14 EILDSMRIPITASDRKEGKYPYYGANGIQDYVNDYIFDDELVLLAEDGGNFGSKEKPIAY 73 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 +V + + + +KP + +L S KV + R+ L +K++ + Sbjct: 74 RVSGKCWVNNHAHVLKPKEEI--DVDYLCYSLMFYKVDGMINGATRKKLTQTAMKKMKIP 131 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + I EQ I +N +I + EK ++ + LL + Sbjct: 132 LRNIVEQKKIVQQLN----KIIEIREKAKKELNLLDNLIQA 168 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 51/187 (27%), Gaps = 12/187 (6%) Query: 25 WKVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ +K ++ G T DI ++ +D+++ K N D S Sbjct: 184 WETDTVKNLCKEIYGGGTPSKAHPEYYKDGDIPWVSAKDMKTDVLKDSQIKINQLGVDNS 243 Query: 77 TVSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 T + ++ L+ + + V P + + + Sbjct: 244 TARLVPVNSVIMVIRSGILKHTLPVAVNKVPITVNQDLKVFIPGERILTRFLAVQFKMQE 303 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 I + T + ++ + M +PP+ Q + + Sbjct: 304 KD-ILSGVRAVTADNIEFNSLKQRRMIVPPIDLQQKYLMFLERIDKSKFVIHKFLYCTTH 362 Query: 194 LLKEKKQ 200 K + Sbjct: 363 NTKSIIK 369 >gi|307945077|ref|ZP_07660413.1| type I restriction-modification system specificity subunit [Roseibium sp. TrichSKD4] gi|307770950|gb|EFO30175.1| type I restriction-modification system specificity subunit [Roseibium sp. TrichSKD4] Length = 357 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 43/127 (33%), Gaps = 11/127 (8%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 S+ V I+ + + + +G + ++ + ++ Sbjct: 47 SWHNEAKVQGPGIIIGRKGTLGS----VHYSDGDYWPHDTTLWSKSLNGNNPRFVYFALK 102 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 L + G +L + LP+ +P Q I ++++ D L+E + Sbjct: 103 CLGLERF---NVGGANPTLNRNHIHGLPIHLPERDAQDRIVSILST----YDDLIENNRR 155 Query: 399 SIVLLKE 405 I LL+E Sbjct: 156 RIALLEE 162 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 44/399 (11%), Positives = 90/399 (22%), Gaps = 49/399 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 KHW ++ L G + G+ + S + + Sbjct: 8 KHWAPAVLQDLVFLQRGFDITKA-----------QQKKGEVPVFSSSGLSSWHNEAKVQG 56 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G I+ G+ G T P + L + + E Sbjct: 57 PG-IIIGRKGTLGSVHYSDGDYWPHDTTLWSKSLNGNNPRFVYFALKCLGL----ERFNV 111 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + I +P+ +P Q I + I+ E + + Sbjct: 112 GGANPTLNRNHIHGLPIHLPERDAQDRIVSILSTYDDLIENNRRRIALLEEAARLLYREW 171 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 ++ G + + + L K +E Sbjct: 172 FVHLRFPG-----HEHIPITDGLPEGWERRTFGKVAELKYGKALKKENRVEGPFPVYGSS 226 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I+ +V + K ++ S + Sbjct: 227 GIVG-------------------THQKALVEGPTIIIGRKGNVGSVFWSPADFWPIDTVY 267 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P +L + S L + ++ P KE+ Sbjct: 268 FIPKDQADFWLYLALPSAGFQN-----TDAGVPGLNRDFAYSRKLVQP--KERLR--RHF 318 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 N + ++E L + R + + G+I + Sbjct: 319 NEAVEPMFAQRARLEAYNEKLSQARDLLLPRLMNGEITV 357 Score = 45.2 bits (105), Expect = 0.023, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 35/115 (30%), Gaps = 14/115 (12%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W+ + +L G+ + + G + P G+S T ++ Sbjct: 189 LPEGWERRTFGKVAELKYGKALKKENRV-----------EGPF-PVYGSSGIVGTHQKAL 236 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 I+ G+ G T + + PKD L L S Sbjct: 237 VEGPTIIIGRKGNVGSVFWSPADFWPIDTVYFI--PKDQADFWLYLALPSAGFQN 289 >gi|300214619|gb|ADJ79035.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius CECT 5713] Length = 143 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 34/104 (32%), Gaps = 7/104 (6%) Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + D ++ L + + + S SL + + VP Sbjct: 31 PFWTVDTLFYCTSKENSDVKFIYLLFQIINWKRYDE---STGVPSLSKNTISNIKTYVPK 87 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 IKEQ + I+ +D ++ E+ L + + + Sbjct: 88 IKEQ----DYISKLFFSLDNTLQLHERKYEELTLIKKALLQKLF 127 >gi|321310234|ref|YP_004192563.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802078|emb|CBY92724.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 185 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 59/182 (32%), Gaps = 8/182 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM--GLKPESYETYQIVDPGEIVFR 294 + +N K++ + I L + L + + E +V G+IV Sbjct: 9 ICKVYVGVNFKDSDYKKFGIPVLKASGVNDGLTSEEVAFYCSSEKAFNESLVSFGDIVVT 68 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + + I S + ++ + SG Sbjct: 69 GGASSGKVGI--NLTDINYLPTSKIFKLEPDPSIVSKKYLYYFLLNSSREINSHITSGNA 126 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 +L + ++ VLVP ++ Q I ++ + L + Q + R+ +++ Sbjct: 127 TNLYKSSLLKIRVLVPDLETQDRIVRYLDKFRELREELRMRKSQGVY----YRNKIMSSL 182 Query: 415 VT 416 +T Sbjct: 183 LT 184 >gi|158337895|ref|YP_001519071.1| hypothetical protein AM1_4782 [Acaryochloris marina MBIC11017] gi|158308136|gb|ABW29753.1| conserved hypothetical protein [Acaryochloris marina MBIC11017] Length = 133 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 45/136 (33%), Gaps = 8/136 (5%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG-KDIIYIGLEDVESG 59 M + + Y+DS V W+ +P HW+V + + + + + K ++ + + Sbjct: 1 MLTFPKHETYQDSQVSWLNEVPNHWRVELGRNYLRPKNVKNIGNHVKTVLSLSYGKIV-- 58 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQ 115 K K T I G I+ + + GI ++ +L L Sbjct: 59 -IKPKEKLHGLVPESFETYQIVEPGDIIVRATDLQNDRTSLRIGLVQDHGIITSAYLCLS 117 Query: 116 PKDVLPELLQGWLLSI 131 P + + Sbjct: 118 PSKQIDPRFTYMHMIC 133 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 51/124 (41%), Positives = 71/124 (57%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 +DS + W+ VP+HW V+ + N KN +LSLSYG I+ K + + Sbjct: 6 KHETYQDSQVSWLNEVPNHWRVELGRNYLRPKNVKNIGNHVKTVLSLSYGKIVIKPKEKL 65 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 GL PES+ETYQIV+PG+I+ R DLQND+ SLR V + GIITSAY+ + P Sbjct: 66 HGLVPESFETYQIVEPGDIIVRATDLQNDRTSLRIGLVQDHGIITSAYLCLSPSKQIDPR 125 Query: 333 LAWL 336 ++ Sbjct: 126 FTYM 129 >gi|187476872|ref|YP_784896.1| type I restriction-modification system specificity determinant (partial) [Bordetella avium 197N] gi|115421458|emb|CAJ47964.1| putative type I restriction-modification system specificity determinant (partial) [Bordetella avium 197N] Length = 48 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 21/43 (48%), Positives = 30/43 (69%) Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + ++ E A++D L E++I LL RRS+ IAAAVTG ID+R Sbjct: 3 SFLDREIAKLDKLKPDSERAIALLAARRSALIAAAVTGHIDVR 45 >gi|188527246|ref|YP_001909933.1| hypothetical protein HPSH_02265 [Helicobacter pylori Shi470] gi|188143486|gb|ACD47903.1| hypothetical protein HPSH_02265 [Helicobacter pylori Shi470] Length = 371 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 53/349 (15%), Positives = 111/349 (31%), Gaps = 24/349 (6%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 ++ + L T +S + YI ++ ++ G K+ N Q + F K + Sbjct: 3 KTLQDYATLIND-TIQSNEINHYITTANMCQNLGGIDTFKNINIPQGKVRS---FQKDDV 58 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L + P+ R+ +A G CS+ LV + K + L L S + G+ Sbjct: 59 LLSNIDPWHRQVYMAKQKGGCSSDVLVFRAKHIDSATLFAILSSQSFINYLCLGSVGSKR 118 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAE--TVRIDTLITERIRFIELLKEKKQALVS 204 D + + +P + +I+ ++ + + + + + Sbjct: 119 KRGDKTHMMDFKIPTINFTIAKIFNSIQNKIENNHKINEILHKILELLYEQYFVRFDFLD 178 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 KMK S E L+P+ +EVK L S Sbjct: 179 ENNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGELTQLKVGNKNANHS------SNQGK 231 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 N L+ E+Y+ + I+ +R + S Sbjct: 232 YPFFTCSNNPLRCETYQ----FEGKHIIISGNGNFYVTHYDGKFDAYQRTYVVS------ 281 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 P+ + L +L + + + + D++ + +++P +K Sbjct: 282 PNNPNHYVLIYLFVKSYTNYLKLQSRGSIIKFITKSDIEDIKIVLPNLK 330 >gi|315586429|gb|ADU40810.1| type I restriction-modification enzyme, S subunit [Helicobacter pylori 35A] Length = 368 Score = 58.7 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 48/382 (12%), Positives = 113/382 (29%), Gaps = 53/382 (13%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSLNSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ + + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDIIDLEKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ Sbjct: 144 FLNIKIKLYPLETQQKIARTLSILDKKIENNHKINELL---------------------- 181 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 H + + KN KL + I + +++ + Sbjct: 182 -----------------HTLAYKIYEYYFKYKPKNAKLEQIIIENPKSSIMVKNAQKTQD 224 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + P I+ N + + + ++ + + S YL Sbjct: 225 KYPFFTSGDNILSYPQAIIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCICANEF-SDYL 283 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 L+ S + L+ +K+ P+ +P E +I L+ Sbjct: 284 YLLLSSIKTHINQSFFQGTSLKHLQKNLLKKYPIYMPSAHEIKKFNQIIMPLL----TLI 339 Query: 394 EKIEQSIVLLKERRSSFIAAAV 415 ++ L++ R + + Sbjct: 340 SINTRTSKKLEQIRDFLLPLLL 361 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 18/160 (11%), Positives = 60/160 (37%), Gaps = 10/160 (6%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + + I++ + Sbjct: 18 NNYTKEDNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSL---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDIIDLEKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 S+ D + + + P++ Q I +++ +I+ Sbjct: 134 SSYPSITPLDFLNIKIKLYPLETQQKIARTLSILDKKIEN 173 >gi|300214621|gb|ADJ79037.1| Type I restriction-modification system specificity subunit [Lactobacillus salivarius CECT 5713] Length = 352 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 44/170 (25%), Gaps = 11/170 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLE-TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K+ I G K + E + Y G ++ Sbjct: 7 KDETSTIGEIPFYKIGTFGGKADAFITRKKYEEYKKKYPYPQKGNLLISASGSIGRII-- 64 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 E + + D+T L ++ + + + L +++ Sbjct: 65 --EYNGEEAYYQDSNIVWL--DHDNTILDVFLKPTYEIIKWDGIEGTTIKRLYNKNILNT 120 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + P I EQ I ++ ++ E+ L + + + Sbjct: 121 VIYKPTIDEQRKIG----KLFIILNNTIQLHERKYEELTLIKKALLQKLF 166 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 44/356 (12%), Positives = 97/356 (27%), Gaps = 32/356 (8%) Query: 43 ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + +I + + ++ + KG +L G R Sbjct: 11 STIGEIPFYKIGTFGGKADAFITRKKYEEYKKKYPYP--QKGNLLISASGSIGRIIEYNG 68 Query: 103 FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 + +V + L EG T+ K I N + P Sbjct: 69 EEAYYQDSNIVW---LDHDNTILDVFLKPTYEIIKWDGIEGTTIKRLYNKNILNTVIYKP 125 Query: 163 PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI 222 + EQ I + I I + L+ + + L P + Sbjct: 126 TIDEQRKIGKLFIILNNTIQLHERKYEELT---------LIKKALLQKLFPKKDXFKPEV 176 Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 + D WE + ++ ++ + GN G + + Sbjct: 177 RYKNFX-DAWEQRKLGEVIISEHKGK------VKSIMKGGNTNYLETNYLNGGTAQKVDA 229 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 V +++ + + G + S A P S + + + Sbjct: 230 IADVSKDDVLILWDGS-----KAGTIYHGFEGALGSTLKAYVPKY--SGDFLYQILKKNQ 282 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K++ + + + ++ V +P I EQ +I + ++D L+ ++ Sbjct: 283 DKIYQSYRTPNIPHVIKNFTEKFNVSIPTIIEQQEIGDF----FKQLDSLIALHQR 334 Score = 37.1 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 49/182 (26%), Gaps = 18/182 (9%) Query: 25 WKVVPIKR-FTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + + G+ G + Y+ + GT + + + Sbjct: 185 WEQRKLGEVIISEHKGKVKSIMKGGNTNYLETNYLNGGTAQKVDAIAD-----------V 233 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +K +L G F+G + PK + + +I Sbjct: 234 SKDDVLILWDGSKAGTI-YHGFEGALGSTLKAYVPKY---SGDFLYQILKKNQDKIYQSY 289 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + H + IP + EQ I + I + + +L K Q Sbjct: 290 RTPNIPHVIKNFTEKFNVSIPTIIEQQEIGDFFKQLDSLIALHQRKLEKLKQLKKFLLQN 349 Query: 202 LV 203 + Sbjct: 350 MF 351 >gi|258646664|ref|ZP_05734133.1| putative type I restriction enzyme [Dialister invisus DSM 15470] gi|260404085|gb|EEW97632.1| putative type I restriction enzyme [Dialister invisus DSM 15470] Length = 420 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 53/388 (13%), Positives = 109/388 (28%), Gaps = 26/388 (6%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQ 110 I + E + + + S IL K+ I+ D + Sbjct: 44 IRTLNFERQDFRDELLYVDEDAYNFLEKSKVLPNDILMNKIANPGSVYIMPDLGCPVTCG 103 Query: 111 FLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + + + +V I++ G T + I + Q Sbjct: 104 MNLFLIRFNNQVNQRYMYYNMKNVEPYIKSFSHGTTTKTITKDDVRGIEVYFHSKPMQDS 163 Query: 170 IREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD---VKMKDSGIEWVG 226 I + ID I I L + L Y + PD K G E++ Sbjct: 164 IANFL----TLIDDKIQNNKNIIYTLSRTIKLLYDYWFIQFDFPDKDGKPYKSHGGEFIY 219 Query: 227 ------LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 +P+ W +T ++ N L N + ++ E Y Sbjct: 220 SSLLKRNIPEGWTELSLGKRLTFERG--VEIGSDNYLVEKQENSAPFIRVSDLNGSSEIY 277 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRS 339 ++D + + I + D + + G T + L + ++ S Sbjct: 278 AKMDLLDGKLLAPQDICVSLDGTVGKVDYALYGGYSTGIRKVYDEKAEINNSLIFAILTS 337 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + V +G E V + + + +I + + K++ Sbjct: 338 DYIQYVIEKYATGSNILHASEAVNHMDIPY-SKEVYGQFQKLITPMFEK----MIKVKLE 392 Query: 400 IVLLKERRSSFIAAAVTGQI----DLRG 423 L+ ++ + + GQ+ D+R Sbjct: 393 NEKLQNYKNLILPMLMNGQVIFGEDIRD 420 >gi|265763430|ref|ZP_06091998.1| HsdS [Bacteroides sp. 2_1_16] gi|263256038|gb|EEZ27384.1| HsdS [Bacteroides sp. 2_1_16] Length = 424 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 46/388 (11%), Positives = 104/388 (26%), Gaps = 57/388 (14%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTG--------KYLPKDGNSRQS 73 + W++ + + + + + + ++ G + + + Sbjct: 52 EEWEICKVSELLDFYSTNSLSWEQLEYGTKAIMNLHYGLIHVGLPTMVDLTRDNLPNIKE 111 Query: 74 DT--STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL---------QPKDVLPE 122 D + +G + + + + + +V + Sbjct: 112 DNMPKNFELCKEGDVAFADASEDTNEVAKPIEFFDLAGKNIVCGLHTIHGRDNKNKTVIG 171 Query: 123 LLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 S +I I +G + K + IP EQ I + RI Sbjct: 172 FKGYAFSSSAFHNQIRRIAQGTKIYSISTKNFSECFIGIPSKVEQTKIATLLRLIDERIA 231 Query: 183 TLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 T + EK Q+L+ KGL + G ++ + + Sbjct: 232 TQ--------NKIIEKLQSLI-----KGLRVCCMQRVYG--------NNVYLSEIAQIYQ 270 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 +T+L E L II K + N + I + + Sbjct: 271 PQTISSTELTEDGFLVYGANGIIGKYKDYNHETEQI----------------CITCRGNT 314 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + + I +A + D +L + + + + Sbjct: 315 CGMVNYTKPMSWITGNAMVINTDKYQDKVCKRYLYHYLSAYNFNSIISGSGQPQIVRTPL 374 Query: 363 KRLPVLVPPIKEQFDITNVINVETARID 390 ++L + +P I EQ + + +ID Sbjct: 375 EKLKITLPTISEQKQKAIIFDKIQDKID 402 >gi|332673347|gb|AEE70164.1| type I restrictionenzyme [Helicobacter pylori 83] Length = 169 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 37/95 (38%), Gaps = 1/95 (1%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG 352 I + + ++ +V P + YL +++ + + S Sbjct: 65 NTITIAQYGTAGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSA 124 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + S+ ++ ++ + +PP++ Q +I +++ T Sbjct: 125 ILYSISSNNIMQIKIPIPPLEIQQEIVKILDAFTE 159 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 45/162 (27%), Gaps = 11/162 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + ++ G+ + + GKY G Sbjct: 13 PKGVEFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYNR 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + + + PK+ L ++L+ Sbjct: 63 EENTITIAQYGT-AGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISN 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 A + I I +PIPPL Q I + + A T Sbjct: 122 RSAILYSISSNNIMQIKIPIPPLEIQQEIVKILDAFTELNTE 163 >gi|257458426|ref|ZP_05623567.1| type I restriction-modification system, S subunit [Treponema vincentii ATCC 35580] gi|257444174|gb|EEV19276.1| type I restriction-modification system, S subunit [Treponema vincentii ATCC 35580] Length = 185 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 48/171 (28%), Gaps = 9/171 (5%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD--- 287 + + E ++S N + N+ + + + Sbjct: 18 GRICDKLIDGDHNPPKGIEEKTEYIMVSSRNINYNTVADLENVRYLTKEMFEAENLRTNA 77 Query: 288 -PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+I+F + I +++ I + YL + S Sbjct: 78 TAGDILFTSVGSLGR----SCIYDGSLNICFQRSVSILKTAIYNKYLKFFFDSKFYQNYV 133 Query: 347 YAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +G + +++ + +PPI EQ I I +D + + Sbjct: 134 VEHATGTAQTGFYLQEMAESFIAIPPILEQKRIAAKIEELFNALDKIQNNL 184 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 56/170 (32%), Gaps = 9/170 (5%) Query: 23 KHWKVVPIKRFT-KLNTG-----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 K W+ + R KL G + E + I + ++ T L + Sbjct: 10 KSWQWTKLGRICDKLIDGDHNPPKGIEEKTEYIMVSSRNINYNTVADLENVRYLTKEMFE 69 Query: 77 TVSI---FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 ++ G IL+ +G R I IC + + + + + L+ + S Sbjct: 70 AENLRTNATAGDILFTSVGSLGRSCIYDGSLNICFQRSVSILKTAIYNKYLKFFFDSKFY 129 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + G + + + + IPP+ EQ I KI +D Sbjct: 130 QNYVVEHATGTAQTGFYLQEMAESFIAIPPILEQKRIAAKIEELFNALDK 179 >gi|303260413|ref|ZP_07346382.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP-BS293] gi|302638448|gb|EFL68914.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP-BS293] Length = 357 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/368 (9%), Positives = 103/368 (27%), Gaps = 27/368 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + ++P + +++ + + L +K++ + +PP+ Q + Sbjct: 282 MVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFA 340 Query: 380 NVINVETA 387 + + Sbjct: 341 DFVAQVDK 348 >gi|220911868|ref|YP_002487177.1| hypothetical protein Achl_1095 [Arthrobacter chlorophenolicus A6] gi|219858746|gb|ACL39088.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 401 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 41/422 (9%), Positives = 116/422 (27%), Gaps = 55/422 (13%) Query: 27 VVPIKRFTKLNTGRTSES---GKDIIY---IGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 P+ + ++ G D + + + G G + + + Sbjct: 6 TRPLSSYIRIKHGFAFPGTGFSDDPSFPTLVTPGNFAIGGG-FKGTKTKTYSGEYPPEYK 64 Query: 81 FAKGQILYGKLG------PYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWL-LSID 132 + G ++ AI+ + + + + + LV + + + Sbjct: 65 LSPGDLMVSMTDLSKEGDTLGLPAIVPEGNFLHNQRIGLVEIIDPNVDSRFLSYFLRTDS 124 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 I A G+T+ H IG +P L Q I E + A +I Sbjct: 125 YRAHILATASGSTVRHTSPSRIGAFETCLPSLNAQRSIAEVLGALDDKIAANTRISAISS 184 Query: 193 ELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLI 252 +L + + ++ ++ ++ G + W +A ++ + ++ Sbjct: 185 DLAGLLYDREAARVESQPMSKVLRPILGGTPARSKGEEFWGGARLWASAKDITGADFGVV 244 Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + + + G ++ L Sbjct: 245 T--------------DTAEKITDRAVDTTKAKALPSGSVILTARGTVGTVGRLAV----- 285 Query: 313 RGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + P + + L + ++R+ + K + ++ + L V Sbjct: 286 PASFNQSCYGFVPGLVPAAVLYFGVLRATERAKEI--AHGSVFDTITMKTFDHLSVP--- 340 Query: 372 IKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKERRSSFIAAAVTGQIDLRGE 424 + + E A + + + L R + + ++G++ ++ Sbjct: 341 --------DFNSTELATTEAILGPLMDSITAAVVQNSTLAATRDALLPQLMSGKLRVKDA 392 Query: 425 SQ 426 + Sbjct: 393 EK 394 >gi|149025497|ref|ZP_01836433.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP23-BS72] gi|147929447|gb|EDK80443.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP23-BS72] Length = 166 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 4/69 (5%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSS 409 ++L + V + + +PP+ EQ I I ++D E + L KE + S Sbjct: 1 MKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVDEYAESYNRLEQLDKEFPDKLKKS 60 Query: 410 FIAAAVTGQ 418 + A+ G+ Sbjct: 61 ILQYAMQGK 69 >gi|139438173|ref|ZP_01771726.1| Hypothetical protein COLAER_00714 [Collinsella aerofaciens ATCC 25986] gi|133776370|gb|EBA40190.1| Hypothetical protein COLAER_00714 [Collinsella aerofaciens ATCC 25986] Length = 226 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/182 (9%), Positives = 54/182 (29%), Gaps = 11/182 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G + + + + + + + + + + + ++ Sbjct: 34 LGDCFEFLKNNTLSRAGLNGENGTARNVHYGDILIKFDDCLDGERSDLPFITDDTVLPKF 93 Query: 285 ---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST---YLAWLMR 338 I+ G+++F + + + S + YL + Sbjct: 94 AGSILREGDVIFADTAEDEAAGKCVELRKLPKEPTISGLHTIPARPRFPFGTGYLGHYLN 153 Query: 339 SYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S + + G++ S+ ++ V P + EQ I + + ID L+ + Sbjct: 154 SDAYHRQLLPLMQGIKVISVSKAALQDTQVRFPGLSEQTAIGAAL----SEIDTLITLHQ 209 Query: 398 QS 399 + Sbjct: 210 RE 211 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 24/200 (12%), Positives = 52/200 (26%), Gaps = 22/200 (11%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-------------GKDIIYIGLEDVESGTGKYLPKDGN 69 W+ + + T I I +D G LP + Sbjct: 27 SSWEQRKLGDCFEFLKNNTLSRAGLNGENGTARNVHYGDILIKFDDCLDGERSDLPFITD 86 Query: 70 SRQSDTSTVSIFAKGQILYGKL------GPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 SI +G +++ G + + I + +P+ Sbjct: 87 DTVLPKFAGSILREGDVIFADTAEDEAAGKCVELRKLPKEPTISGLHTIPARPRFPFGTG 146 Query: 124 LQ-GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI- 181 +L S +++ + +G + + + + P L+EQ I + I Sbjct: 147 YLGHYLNSDAYHRQLLPLMQGIKVISVSKAALQDTQVRFPGLSEQTAIGAALSEIDTLIT 206 Query: 182 -DTLITERIRFIELLKEKKQ 200 ++ Q Sbjct: 207 LHQREPPHTMKEGKNVDQHQ 226 >gi|77414974|ref|ZP_00791063.1| type I restriction enzyme S protein (hsdS) [Streptococcus agalactiae 515] gi|77158974|gb|EAO70196.1| type I restriction enzyme S protein (hsdS) [Streptococcus agalactiae 515] Length = 127 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/101 (17%), Positives = 36/101 (35%), Gaps = 10/101 (9%) Query: 14 GVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV----ESGTGKY 63 V+ IP+ W V ++ + +G T +S + +I +I D+ + Sbjct: 21 EVEVPYEIPESWNWVKLRNIGSITSGGTPKSSEPSYYGGNITWITPADMGKQQNNKFFAK 80 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 K S+ + +K I+Y P I+ + Sbjct: 81 SSKKITELGLQKSSAQLISKNSIVYSSRAPIGHINIVTEDY 121 >gi|225873159|ref|YP_002754618.1| type I restriction-modification system, S subunit [Acidobacterium capsulatum ATCC 51196] gi|225791214|gb|ACO31304.1| type I restriction-modification system, S subunit [Acidobacterium capsulatum ATCC 51196] Length = 429 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 60/417 (14%), Positives = 126/417 (30%), Gaps = 44/417 (10%) Query: 38 TGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS--IFAKGQILYGKLGPY 94 G+T + I + V+ G PK+ + + + + +L P Sbjct: 21 RGKTPPKTASGVRLITAKVVKGGQILEEPKEFIAEDFYDEWMRRGLPQELDVLLTTEAPL 80 Query: 95 LRKAIIADFDGIC-STQFLVLQPKDV--LPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 AI+ D I + + ++L+ K P L L S ++A G T+ Sbjct: 81 GETAILRDKTRIALAQRIILLRAKREVVDPLFLFYALQSDFAQSELKARASGTTVLGIKQ 140 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + + +P+ L+ Q+ I + I+ + + L Sbjct: 141 SELRRVRIPLFSLSAQLKIGSILATYDELIENNQRRIRILE----QMARRLYREWFVHFR 196 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK---- 267 P + +G +P WEV+ L+ ++ +I+ Sbjct: 197 FPGHENHPRVPSPLGEIPQGWEVRNLECLMVHQIGGGWGKDVADDTYTEPAWVIRGTDIP 256 Query: 268 ------LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV---------ME 312 +++ S + + G+IVF + R+ + + Sbjct: 257 GARSAQVDSVPYRYHTLSNLRSRRLQAGDIVFEVSGGSKGQPVGRTLLITPELLSAFGGD 316 Query: 313 RGIITSAYMAVKPHG--IDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFED-VKRLPV 367 + S ++P L Y + S + K+ + + Sbjct: 317 DVMCASFCKRIQPDQTAYGPEMLYLSFLEGYESGEIEQYQVQSTGISNFKWTEYIANTLR 376 Query: 368 LVPPIKEQ---FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +VPP + +I + E A + + L+ R + ++GQI L Sbjct: 377 VVPPDSLRKDFQEIVRPLLREVATLGL-------KSANLRRTRDLLLPRLLSGQIKL 426 >gi|170717888|ref|YP_001784942.1| type I restriction enzyme, S subunit [Haemophilus somnus 2336] gi|168826017|gb|ACA31388.1| putative type I restriction enzyme, S subunit [Haemophilus somnus 2336] Length = 171 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 49/149 (32%), Gaps = 8/149 (5%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR---SLRSAQVM 311 I I + L + E +++ G++V + + Sbjct: 27 YIHYGDIHRGIANILNDISVLPNITGEYSELLSFGDLVVADASEDYYGVAAPCVINCIYE 86 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 + + +A++P+ +L +L+ S + +G+G ++ +++ P Sbjct: 87 QNIVAGLHTIAIRPYKSHHLFLYYLLHSSGFKEYCKKVGTGTKVFAITSKNLLGFESFFP 146 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQS 399 +EQ I +D + ++ Sbjct: 147 HYEEQQKIGAF----FTALDRYITIHQRK 171 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 36/149 (24%), Gaps = 7/149 (4%) Query: 43 ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA---- 98 E YI D+ G L + + G ++ Sbjct: 20 EEKTKTKYIHYGDIHRGIANILNDISVLPNITGEYSELLSFGDLVVADASEDYYGVAAPC 79 Query: 99 ---IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 I + + + + ++P L L S + + + G + K + Sbjct: 80 VINCIYEQNIVAGLHTIAIRPYKSHHLFLYYLLHSSGFKEYCKKVGTGTKVFAITSKNLL 139 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTL 184 P EQ I A I Sbjct: 140 GFESFFPHYEEQQKIGAFFTALDRYITIH 168 >gi|332308207|ref|YP_004436058.1| restriction modification system DNA specificity domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175536|gb|AEE24790.1| restriction modification system DNA specificity domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 459 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 62/176 (35%), Gaps = 13/176 (7%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP------GEIVFRFI 296 + + K K +E I + + + + S E + G+++ Sbjct: 19 DCDHKTPKAVEMGIPYIGIPQMDNGRINFDAKPRLISEEDFVKWTRKANPTYGDVILSRR 78 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLR 354 + + + G + K + YL ++++S + + Sbjct: 79 CNSGETVYVPKNRRFALG-QNLVLLRPKGDRLFPEYLRYVVKSKEWWDEVAKYLNPGAIF 137 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +SLK D+ + V PP++ Q I +++ I+ +E +Q+ L++ + Sbjct: 138 ESLKCADIPKFMVPEPPVEAQKKIVEILSA----IEDRIELNQQTNQTLEQMAQAL 189 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 62/478 (12%), Positives = 137/478 (28%), Gaps = 83/478 (17%) Query: 2 KHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGT 60 YK KD GV+ I +T ++ + I YIG+ +++G Sbjct: 3 SKYKK-ASLKDLGVELID-----------------CDHKTPKAVEMGIPYIGIPQMDNGR 44 Query: 61 GKYLPKDGNSRQSDTSTVSIFAK---GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + K + D + A G ++ + + Q LVL Sbjct: 45 INFDAKPRLISEEDFVKWTRKANPTYGDVILSRRCNSGETVYVPKNRRFALGQNLVLLRP 104 Query: 118 DVLPELLQGWLLSIDVTQRI----EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + + + + GA I +P PP+ Q I E Sbjct: 105 KGDRLFPEYLRYVVKSKEWWDEVAKYLNPGAIFESLKCADIPKFMVPEPPVEAQKKIVEI 164 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQA-------LVSYIVTKGLN-------------- 212 + A RI+ ++ + ++ ++ + G Sbjct: 165 LSAIEDRIELNQQTNQTLEQMAQALFKSWFVHFDPVIDNALAAGNEIPDALQHRVEIRKK 224 Query: 213 ----------------PDVKMKDSGIE--------WVGLVPDHWEVKPFFALVTELNRKN 248 ++ S +E G VP W+ K + R + Sbjct: 225 AHALQKQKPNIQPLPEATQRLFPSELEHTDEASIGINGWVPKGWQTKSVDECININPRVS 284 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 L + + + + M + ++Y G+++ I Sbjct: 285 --LPKGTLAKFADMKALPTSGYGIMDVIEKNYTGGAKFQQGDVLLARITPCLQNGKTGIV 342 Query: 309 QVMER----GIITSAYMAVKPHGIDSTYLAWLMRSYDLCK---VFYAMGSGLRQSLKFED 361 M+ G ++ ++ ++ G T + + + + +GS RQ ++ Sbjct: 343 DFMDEDNEIGFGSTEFIVMRRKGGLGTPFISCLARDENFRNHCMQSMVGSSGRQRVQNAC 402 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + +P + ++ N A + + + L R + ++G+I Sbjct: 403 FSAYFLALPTTE---NVLNTFQTIVAPMFTRMTINNEETKSLANLRDLLLPKLISGEI 457 >gi|228472561|ref|ZP_04057321.1| type I restriction enzyme EcoAI specificity protein [Capnocytophaga gingivalis ATCC 33624] gi|228275974|gb|EEK14730.1| type I restriction enzyme EcoAI specificity protein [Capnocytophaga gingivalis ATCC 33624] Length = 222 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/144 (17%), Positives = 52/144 (36%), Gaps = 6/144 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 +P+ W+ G+ YI + +++ + + K + ++ + Sbjct: 65 ELPEGWEWCRGYEILN-PMETQKPIGEMFGYIDIASIDNKNNRIIDAKFISVSEAPSRAS 123 Query: 79 SIFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQP-KDVLPELLQGWLLSIDVT 134 G L+ + PYL+ + + I ST F + P K + P+ L +LS V Sbjct: 124 RKVKFGDTLFSMVRPYLKNIAFVEEEYSNCIASTGFYICSPNKTLYPKFLFYLMLSEYVV 183 Query: 135 QRIEAICEGATMSHADWKGIGNIP 158 + +G + + I N Sbjct: 184 NGLNKYMKGDNSPSINNENITNFF 207 >gi|170079642|ref|YP_001736275.1| Type I restriction modification system, N-6 DNA methylase [Synechococcus sp. PCC 7002] gi|169887311|gb|ACB01020.1| Type I restriction modification system, N-6 DNA Methylase [Synechococcus sp. PCC 7002] Length = 1179 Score = 58.3 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 52/418 (12%), Positives = 117/418 (27%), Gaps = 45/418 (10%) Query: 31 KRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 K T + G + E G+ + ++ + S + IL Sbjct: 739 KHLTLVQYGISIEMNEEGEGTKIYRMNEIHNMLCDIDVLKSAKISSIEIEKYKLKERDIL 798 Query: 88 YGKL-------GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ--GWLLSIDVTQRIE 138 + + L K S V + + ++ + Sbjct: 799 FNRTNSFDLVGRTGLFKISSDREFVFASYLIRVRTDESKILPEYLVAFLNSNLGIWDIKR 858 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL---- 194 S+ + + + + +P+ Q+ ++E ++ I +L Sbjct: 859 RARISINQSNVNSQELAAVKIPLLNREFQLKLKEIFDRAHLKRLESIKTYQEAEDLLLSE 918 Query: 195 -------LKEKKQAL--VSYIVTKG---LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 E+ A+ S G L+ + + + + V+P L+ Sbjct: 919 LGLKDWEPTEETVAVKRFSESFLLGDARLDAEYYQPKYDQAELAIQNCGFSVEPLGMLIE 978 Query: 243 ELNRKNTK--LIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFI 296 + E + G+I + +K + + G+I+F Sbjct: 979 PIQNGFDYREYTEEGTPYIRVGDIKNGQINYDSAVKIPITMDDVAKSVGLHTGDILFTRK 1038 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-----STYLAWLMRSYDLCKVFYAMGS 351 + + +V II+S M V+ + Y++ + S Sbjct: 1039 GSFGNSAVVTENEV--DAIISSEIMLVRINEEYKSKLCPEYVSLFLNSKFGYLQVERRVH 1096 Query: 352 GLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVET---ARIDVLVEKIEQSIVLLKE 405 G+ S+ D+ + + + P Q I I A+ L+E + + E Sbjct: 1097 GVAYYSISQPDLAAIKIPLLPTASQNKIVQFIKSSFHSKAQSKQLLEIAKHGVEKAIE 1154 Score = 36.3 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 17/158 (10%), Positives = 47/158 (29%), Gaps = 7/158 (4%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMER 313 N++ ++ + +I+F + + L Sbjct: 762 YRMNEIHNMLCDIDVLKSAKISSIEIEKYKLKERDILFNRTNSFDLVGRTGLFKISSDRE 821 Query: 314 GIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLV 369 + S + V+ I YL + S S + ++ +++ + + + Sbjct: 822 FVFASYLIRVRTDESKILPEYLVAFLNSNLGIWDIKRRARISINQSNVNSQELAAVKIPL 881 Query: 370 PPIKEQFDITNVINVE-TARIDVLVEKIEQSIVLLKER 406 + Q + + + R++ + E +LL E Sbjct: 882 LNREFQLKLKEIFDRAHLKRLESIKTYQEAEDLLLSEL 919 >gi|331006811|ref|ZP_08330072.1| hypothetical protein IMCC1989_746 [gamma proteobacterium IMCC1989] gi|330419379|gb|EGG93784.1| hypothetical protein IMCC1989_746 [gamma proteobacterium IMCC1989] Length = 193 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 27/132 (20%), Positives = 51/132 (38%), Gaps = 7/132 (5%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS---AYMAVKPHGIDSTYLA 334 E + V+ +I FR L N + ++ +I + K + ID YL Sbjct: 54 EELKEKHRVEVNDIAFRSRGLTNTAALI--NAELDNAVIAAPLLRIRIEKKNKIDPAYLC 111 Query: 335 WLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 WL+ + +G Q + ++ L +L+PPI Q I + + ++ Sbjct: 112 WLINQPASQAALLSQSTGTVQRTIGKPALESLELLIPPIDAQIKIVE-LERLALKEQRIM 170 Query: 394 EKIEQSIVLLKE 405 +++ Q L E Sbjct: 171 QELAQKKRQLME 182 >gi|330723204|gb|AEC45574.1| Restriction endonuclease S subunits [Mycoplasma hyorhinis MCLD] Length = 459 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 46/332 (13%), Positives = 111/332 (33%), Gaps = 16/332 (4%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI--LYGKLGPYLRKAIIADFDGIC 107 ++ ++++ GKY + + T K + L Y + Sbjct: 9 FVSKYEIQNNPGKYPVYSSQTTNNGTMGYISSYKYDLECLTWTTRGYAGVVFYRNEKFSV 68 Query: 108 STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 S L++ ++++ ++ I + T + + + + + Sbjct: 69 SNSGLLIFKRNIIYNYRYFL-----FVFQMADIQKSMTAGNIPQFTVEMMKEAVLTYSNN 123 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 + + KI +D +I+ R I LL++ ++AL+ + K ++ G Sbjct: 124 LNEQRKISQLFYTLDKIISLYERKISLLEKIEKALLDNMFIKENEEKPSIRFLGFNSDWQ 183 Query: 228 VPDHWEVKPFFALVTELNR-KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + ++ + + T I L+ N + +S E + Sbjct: 184 SWTLEDKGYLYSGLNSKTKVDFTNGNSKYITYLNVFNNFNIDLKEKSLVFIKSDEKQNSI 243 Query: 287 DPGEIVFRFIDLQNDKRSLRSA---QVMERGIITSAYMAVKPHGID---STYLAWLMRSY 340 G+I+F + + SA +V E+ + S + + D + A+L R++ Sbjct: 244 VKGDILFTMSSETYQEVGMSSAVTEEVNEKIYLNSFCFGYRLNKADFLFPNFSAFLFRNH 303 Query: 341 DLCK--VFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + + G R +L + L + P Sbjct: 304 SVRHKIILQSNGGTSRFNLSKKSFLNLKIKSP 335 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 46/158 (29%), Gaps = 8/158 (5%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + N K + Y ++ + + Sbjct: 9 FVSKYEIQNNPGKYPVYSSQTTNNGTMGYISSYKYDLECLTWTTRGYAGVVFYRNEKFSV 68 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IK 373 + + + + Y ++ + D+ K +M +G E +K + + Sbjct: 69 SNSGLLIFKRNIIYNYRYFLFVFQMADIQK---SMTAGNIPQFTVEMMKEAVLTYSNNLN 125 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 EQ I + +D ++ E+ I LL++ + + Sbjct: 126 EQRKI----SQLFYTLDKIISLYERKISLLEKIEKALL 159 >gi|293401666|ref|ZP_06645808.1| putative restriction modification system DNA specificity subunit [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304924|gb|EFE46171.1| putative restriction modification system DNA specificity subunit [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 239 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 36/263 (13%), Positives = 81/263 (30%), Gaps = 29/263 (11%) Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMK 218 M +PPL Q I E + +I+ + + +A G + Sbjct: 1 MKLPPLNCQRKIVEILSFIDNKIEENRKINNNLEQQAQAIFKAWFIDFEPFGCS------ 54 Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE 278 +P W V + + S+I + ++ I + GL E Sbjct: 55 ---------IPSDWTVLTLGDVSQMGAGGDKPKNVSSIQTENHPYPI-----YSNGLSDE 100 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Y + + + + + I+ + + + + YL + Sbjct: 101 GLYGYTDIPKIYEESVTVSARGTIGFVCLRHIPYFPIVRLVTLIPNTNILSAKYLYLYLN 160 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + +Q L D ++ +LVP TN++N + + ++ Sbjct: 161 QQHIIG-----TGTTQQQLTVPDFRKTEILVPIKDVVDAFTNIVNPLFDK----IWANQE 211 Query: 399 SIVLLKERRSSFIAAAVTGQIDL 421 L R + + ++G++D+ Sbjct: 212 ENKYLSTLRDTLLPKLISGKLDV 234 Score = 40.5 bits (93), Expect = 0.50, Method: Composition-based stats. Identities = 19/187 (10%), Positives = 42/187 (22%), Gaps = 15/187 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS- 79 IP W V+ + +++ G + +++ Y + Sbjct: 55 IPSDWTVLTLGDVSQMGAGGDKPKN-------VSSIQTENHPYPIYSNGLSDEGLYGYTD 107 Query: 80 --IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + + G + + + L P + +L + Sbjct: 108 IPKIYEESVTVSARGTIGFVCLRHIPYFPI-VRLVTLIPNTNILSAKYLYL----YLNQQ 162 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 I G T + +P + +I E L Sbjct: 163 HIIGTGTTQQQLTVPDFRKTEILVPIKDVVDAFTNIVNPLFDKIWANQEENKYLSTLRDT 222 Query: 198 KKQALVS 204 L+S Sbjct: 223 LLPKLIS 229 >gi|288929354|ref|ZP_06423199.1| HsdS protein [Prevotella sp. oral taxon 317 str. F0108] gi|288329456|gb|EFC68042.1| HsdS protein [Prevotella sp. oral taxon 317 str. F0108] Length = 347 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 61/386 (15%), Positives = 122/386 (31%), Gaps = 46/386 (11%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 +TG + K I V S T + + +I G Sbjct: 2 GDVCNTSTGNKNTQDKTDDGIYPFYVRSQTVERIN------SWTFDGEAILTAGD----- 50 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHAD 150 G + K + I Q + + S R++ + ++ Sbjct: 51 -GVGVGKVFHHTYGKIGVHQRVYILSDFKCDANYLFHFFSSKFYNRVKRMSAKNSVDSVR 109 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + I ++P+ +P EQ+ I + RI T +L + L S I Sbjct: 110 KEMITDMPLSLPCCQEQIKIGYMLSILDERIATQNKIIEDLKKLKCAIIEKLYSEIQ--- 166 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 G E++ V K + S G + + Sbjct: 167 ----------GKEYL---------YGQLFEVVNKRNKQMEYSNILSASQEKGMVNRDDLN 207 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 ++ + + TY+IV G+ V Q A + G+ + AY ++P+ + Sbjct: 208 LDIQFERSNINTYKIVRAGDYVIHLRSFQG-----GFAFSDKLGVCSPAYTILRPNCLLE 262 Query: 331 T-YLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 YL++ S+ K + G+R +S+ E+ + V++P + Q ++ Sbjct: 263 YGYLSYYFTSHRFIKSLIIVTYGIRDGRSINIEEWLNMKVIIPSKEYQLHTLKIL----R 318 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAA 413 I+ +E E + L ++ + Sbjct: 319 SIEGKIENEETYTICLSNQKQYLLNQ 344 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 45/155 (29%), Gaps = 6/155 (3%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + GE + D + + G+ Sbjct: 13 NTQDKTDDGIYPFYVRSQTVERINSWTFDGEAILTAGDGVG-VGKVFHHTYGKIGVHQRV 71 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y+ D+ YL S +V S++ E + +P+ +P +EQ I Sbjct: 72 YILSDFKC-DANYLFHFFSSKFYNRVKRMSAKNSVDSVRKEMITDMPLSLPCCQEQIKIG 130 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + +D + + I LK+ + + I Sbjct: 131 ----YMLSILDERIATQNKIIEDLKKLKCAIIEKL 161 >gi|18765820|gb|AAL78773.1|AF326622_1 JHP785-like protein [Helicobacter pylori] Length = 200 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 44/114 (38%), Gaps = 1/114 (0%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG 352 I + + ++ +V P + YL +++ + + S Sbjct: 65 NTITIAQYGTAGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSA 124 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + S+ ++ ++ + +PP++ Q +I +++ + L+ I I K++ Sbjct: 125 IPYSISSNNIMQITIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIKARKKQ 178 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 41/156 (26%), Gaps = 11/156 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + ++ G+ + + GKY G Sbjct: 13 PKGVGFRKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYNR 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + + + PK+ L ++L+ Sbjct: 63 EENTITIAQYG-TAGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISN 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 A I I +PIPPL Q I + + Sbjct: 122 RSAIPYSISSNNIMQITIPIPPLEIQQEIVKILDQF 157 >gi|291534511|emb|CBL07623.1| Restriction endonuclease S subunits [Roseburia intestinalis M50/1] Length = 536 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 52/164 (31%), Gaps = 2/164 (1%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 N+K+ + + G+ E + + E ++ G+++ Sbjct: 370 NKKDPAGNIGVVNISNIGDYDIDYECLDHLQEEERKVANYLLQEGDVLLPARGTAIRTAV 429 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVK 363 + ++ YL + S K+ G ++ ++D+ Sbjct: 430 FHEQTYPCIASSNVIVIRPDQKNLNGYYLKIFLDSPIGNKMISGAQQGMTVMNISYKDLN 489 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS-IVLLKER 406 L V +P +++Q + E + V E+ +LK+ Sbjct: 490 VLEVPLPNMEKQKAVVKEYQEELKKYSDTVAAAEKRWNEVLKKL 533 >gi|160945576|ref|ZP_02092802.1| hypothetical protein FAEPRAM212_03105 [Faecalibacterium prausnitzii M21/2] gi|158443307|gb|EDP20312.1| hypothetical protein FAEPRAM212_03105 [Faecalibacterium prausnitzii M21/2] Length = 199 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 55/176 (31%), Gaps = 5/176 (2%) Query: 247 KNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K + I ++ + I K + + G + + + + + Sbjct: 23 KPEYYTNNGIAWITPKDLSINKSKFISHGENDITELGLKNSSATVMPKGTVLFSSRAPIG 82 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 A + +V P+ T + + L + A + + +K + Sbjct: 83 YIAIASNEVTTNQGFKSVIPYSEIGTAFVYFFLKHSLPVIESAASGSTFKEISGSAMKNI 142 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P ++P + N A I + +E+ L R S + ++G ID+ Sbjct: 143 PAIIPDRNT----LDQFNSFCAPIFAQQKILEEQNHSLAMLRDSLLPKLMSGAIDI 194 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 56/194 (28%), Gaps = 12/194 (6%) Query: 25 WKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYL---PKDGNSRQSD 74 W++ I + G T I +I +D+ K++ D Sbjct: 2 WQISTISDLGTVVGGSTPSKTKPEYYTNNGIAWITPKDLSINKSKFISHGENDITELGLK 61 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ ++ KG +L+ P IA + + F + P + + Sbjct: 62 NSSATVMPKGTVLFSSRAPI-GYIAIASNEVTTNQGFKSVIPYSEI-GTAFVYFFLKHSL 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 IE+ G+T + NIP IP + L + L Sbjct: 120 PVIESAASGSTFKEISGSAMKNIPAIIPDRNTLDQFNSFCAPIFAQQKILEEQNHSLAML 179 Query: 195 LKEKKQALVSYIVT 208 L+S + Sbjct: 180 RDSLLPKLMSGAID 193 >gi|227888665|ref|ZP_04006470.1| possible type I restriction modification DNA specificity protein [Lactobacillus johnsonii ATCC 33200] gi|227850778|gb|EEJ60864.1| possible type I restriction modification DNA specificity protein [Lactobacillus johnsonii ATCC 33200] Length = 129 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 52/130 (40%), Gaps = 4/130 (3%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + +++Y + + +++ + + E Y V +I+F + + + E I Sbjct: 1 MNNITYDGKLDLRDLKSIDIPEKDLEKYS-VKKDDILFNRTNSRELVGKTCVYTIPETMI 59 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLVPPI 372 + + V+ + + + + D K + + + ++ +++++ + +PP+ Sbjct: 60 LAGFIIRVRLNELANPLFVSTFLNTDYSKQLFKTICKNASGQSNINATELQKIKIYIPPL 119 Query: 373 KEQFDITNVI 382 Q N + Sbjct: 120 SLQNKFANFV 129 >gi|253567537|ref|ZP_04844966.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251943639|gb|EES84240.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 225 Score = 58.3 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 58/193 (30%), Gaps = 11/193 (5%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQIV 286 P W + ++ N + + + R ++ + + Sbjct: 41 PIGWNNGTLIDIANITMGQSPDGTSYNEIGEGVLFYQGSTDFGMRFPSVRQYTTAPSRFA 100 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 G+I+ I A+ +T+L +++ + Sbjct: 101 KKGDILMSVRAPVG-----AVNIANNDCCIGRGLSALNSKIGSTTHLYYILNDLRIAFDQ 155 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S+ ED+ LP+++P ++ + + + + + + I L ++ Sbjct: 156 RNAAGTTFGSITKEDLYNLPIVIPA----KEVISAFDKICSPMFDRQMLLGEEIDTLIKQ 211 Query: 407 RSSFIAAAVTGQI 419 R + + GQ+ Sbjct: 212 RDELLPLLLNGQV 224 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 52/175 (29%), Gaps = 8/175 (4%) Query: 10 YKDSG--VQWIGA----IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + W IP W + + G++ + G+ + Sbjct: 23 YKSSGGNMVWNEKLKRNIPIGWNNGTLIDIANITMGQSPDGTSYNEIGEGVLFYQGSTDF 82 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + + RQ T+ KG IL P IA+ D L K Sbjct: 83 GMRFPSVRQYTTAPSRFAKKGDILMSVRAPV-GAVNIANNDCCIGRGLSALNSKIGSTTH 141 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 L ++L+ + G T + + N+P+ IP + Sbjct: 142 LY-YILNDLRIAFDQRNAAGTTFGSITKEDLYNLPIVIPAKEVISAFDKICSPMF 195 >gi|53803795|ref|YP_114321.1| hypothetical protein MCA1886 [Methylococcus capsulatus str. Bath] gi|53757556|gb|AAU91847.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath] Length = 192 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 44/134 (32%), Gaps = 9/134 (6%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +V G+++FR N + + + ++ YL W + Sbjct: 55 DLKDRHLVQAGDLLFRSRGATNSAALVGDGLGRAVLAAPMLLIRPQTEVVEPAYLQWFIN 114 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 +G + + + L V++PP+++Q I V + Sbjct: 115 HPSTQATLAGQAAGTAVKMIGKGVLHHLKVVLPPLEKQRRIVEVAQLALRE--------A 166 Query: 398 QSIVLLKERRSSFI 411 + L+ RR + + Sbjct: 167 ALLEELRGRRKALL 180 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 52/155 (33%), Gaps = 10/155 (6%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + ++ G + S D+ I ++D++ + + D + Sbjct: 4 TLATIAEVRMGYSFRSRLEADAQGDVAVIQMKDIDDANLLHPEGLVRVQMPDLKDRHLVQ 63 Query: 83 KGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G +L+ G A++ D G + Q + V P LQ ++ + Sbjct: 64 AGDLLFRSRGATNSAALVGDGLGRAVLAAPMLLIRPQTEVVEPAYLQWFINHPSTQATLA 123 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 G + + ++ + +PPL +Q I E Sbjct: 124 GQAAGTAVKMIGKGVLHHLKVVLPPLEKQRRIVEV 158 >gi|332877054|ref|ZP_08444805.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684944|gb|EGJ57790.1| type I restriction modification DNA specificity domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 201 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 38/154 (24%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 F K I +++ + +V +I+ Sbjct: 36 KGFDYGMNAAAKPFDGQHKYIRITDIDESSAAYIDKDVVSPDGELQDSYLVKANDILLAR 95 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 K L + P + L S + + Sbjct: 96 TGASTGKSYLYDNKDGILYFAGFLIRVNIPSDNAYFVFSQLHLSRYRKWIGIMSARSGQP 155 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + ++ P+ +P I+EQ I ++ + RI Sbjct: 156 GVNSQEYSNYPIYLPKIEEQTKIAKLLKLVDERI 189 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 53/165 (32%), Gaps = 7/165 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 WK + K + + K YI + D++ + Y+ KD S + Sbjct: 25 EWKKCTLGEIGKGFDYGMNAAAKPFDGQHKYIRITDIDESSAAYIDKDVVSPDGELQDSY 84 Query: 80 IFAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + IL + G K+ + D + + + + L + Sbjct: 85 LVKANDILLARTGASTGKSYLYDNKDGILYFAGFLIRVNIPSDNAYFVFSQLHLSRYRKW 144 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 I + + + + N P+ +P + EQ I + + RI Sbjct: 145 IGIMSARSGQPGVNSQEYSNYPIYLPKIEEQTKIAKLLKLVDERI 189 >gi|315651214|ref|ZP_07904244.1| conserved hypothetical protein [Eubacterium saburreum DSM 3986] gi|315486510|gb|EFU76862.1| conserved hypothetical protein [Eubacterium saburreum DSM 3986] Length = 182 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 46/146 (31%), Gaps = 4/146 (2%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRS 307 + L +I LK E ++ P +IVF + Sbjct: 27 PFSKDLYTYLRITDIKDDSTLNLQDLKSVEDEKAREYLLKPNDIVFARTGASTGRNYFYD 86 Query: 308 AQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 E ++ ++ Y+ + +S + +G R ++ + + ++ Sbjct: 87 GTDGEFVYAGFLIKFSIDEKKVNPKYIKYFCQSKQYQDWINSFNTGSTRGNINAQTLGKM 146 Query: 366 PVLVPPIKEQFDITNVINVETARIDV 391 + + K Q + ++++ +I Sbjct: 147 EIPLIERKMQDALVSILSSIDKKIKK 172 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 52/156 (33%), Gaps = 6/156 (3%) Query: 41 TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII 100 S Y+ + D++ +D S + + + + I++ + G + Sbjct: 26 VPFSKDLYTYLRITDIK-DDSTLNLQDLKSVEDEKAREYLLKPNDIVFARTGASTGRNYF 84 Query: 101 ADFD--GICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 D FL+ K V P+ ++ + S I + G+T + + + +G Sbjct: 85 YDGTDGEFVYAGFLIKFSIDEKKVNPKYIKYFCQSKQYQDWINSFNTGSTRGNINAQTLG 144 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + +P+ Q + + + +I Sbjct: 145 KMEIPLIERKMQDALVSILSSIDKKIKKNNEVNNNL 180 >gi|302190883|ref|ZP_07267137.1| putative type I restriction-modification specificity protein [Lactobacillus iners AB-1] Length = 179 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 51/158 (32%), Gaps = 6/158 (3%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP---GEIVFRFIDLQND 301 + ++ N++ K Y Y+ + I+ Sbjct: 21 HGTPKYTENGEYAFVNGNNLVDGEILIKKETKRVDYSQYEKYKKPLTNRTILVSINGTLG 80 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + ++ + G SA +D ++ +++ S + + +G +++ + Sbjct: 81 NVGVYGSEKIILGK--SACYFNVKESVDKDFIYYIVSSPTFKQYLESNATGTTIKNISLK 138 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++ +P I EQ I++V+ +I + Sbjct: 139 QMREYTFELPEIGEQKRISSVLRKIDEKIKNNRAINKN 176 >gi|259501396|ref|ZP_05744298.1| conserved hypothetical protein [Lactobacillus iners DSM 13335] gi|259167200|gb|EEW51695.1| conserved hypothetical protein [Lactobacillus iners DSM 13335] Length = 172 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 50/151 (33%), Gaps = 6/151 (3%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP---GEIVFRFIDLQND 301 + ++ N++ K Y Y+ + I+ Sbjct: 21 HGTPKYTENGEYAFVNGNNLVDGEILIKKETKRVDYSQYEKYKKPLTNRTILVSINGTLG 80 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + ++ + G SA +D ++ +++ S + + +G +++ + Sbjct: 81 NVGVYGSEKIILGK--SACYFNVKESVDKDFIYYIVSSPTFKQYLESNATGTTIKNISLK 138 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++ +P I EQ I++V+ +I Sbjct: 139 QMREYTFELPEIGEQKRISSVLRKIDEKIKN 169 >gi|332829723|gb|EGK02369.1| hypothetical protein HMPREF9455_01639 [Dysgonomonas gadei ATCC BAA-286] Length = 372 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 41/136 (30%), Gaps = 10/136 (7%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER-----GIITSAYMAVKPHGIDSTYLAW 335 Y I G+I F + + + + + + K H + + Sbjct: 46 NKYTICKEGDIAFADASEDTNDVAKVVEFLNCNNKSIVCGLHTIHGRDKKHLTIKGFKGY 105 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 S + G S+ ++ L + +P +EQ I +++ +D + Sbjct: 106 AFSSIPFRNQVRRLAQGTKIYSINSKNFDELYIGIPSKEEQAKIAHLL----ILLDERIA 161 Query: 395 KIEQSIVLLKERRSSF 410 + I L+ Sbjct: 162 TQNKIIEKLESLIKGL 177 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 45/355 (12%), Positives = 103/355 (29%), Gaps = 48/355 (13%) Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS----- 130 + +I +G I + C+ + +V + + + Sbjct: 46 NKYTICKEGDIAFADASEDTNDVAKVVEFLNCNNKSIVCGLHTIHGRDKKHLTIKGFKGY 105 Query: 131 ----IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 I ++ + +G + + K + + IP EQ I +I RI T Sbjct: 106 AFSSIPFRNQVRRLAQGTKIYSINSKNFDELYIGIPSKEEQAKIAHLLILLDERIATQNK 165 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + L+K A DHW ++ ++ + Sbjct: 166 IIEKLESLIKGLYSAT-------------------------KRDHWRMQYLRDILEQRKE 200 Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 NT+ +++ G +I ++E + Y +V G+IV+ N + Sbjct: 201 FNTQNYFVFSVAVKEG-LINQIEHMGRSFAAKDTRHYNVVKYGDIVYTKSPTGNFPYGIV 259 Query: 307 SAQVMERGIITSAYMAVK--PHGIDSTYLAWLMRSY-----DLCKVFYAMGSGLRQSLKF 359 + S V + S L S L + ++ Sbjct: 260 KQSFTNIPVAVSPLYGVYKSKNLHLSNILHHYFLSPIKANNYLHSLIQKGAKNTI-NITS 318 Query: 360 EDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + +L+P E I ++ I+ + ++ + ++ ++ + Sbjct: 319 QHFLEKAILLPVDKSEIQTI----SLLLTTINKKIGFEKEVLKKMQIQKVFLLQQ 369 >gi|319896987|ref|YP_004135182.1| type i restriction enzyme [Haemophilus influenzae F3031] gi|317432491|emb|CBY80848.1| putative type I restriction enzyme [Haemophilus influenzae F3031] Length = 437 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/430 (9%), Positives = 121/430 (28%), Gaps = 52/430 (12%) Query: 36 LNTGRTSES---GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK- 90 L G T G + + ++ S + D ++ +L+ + Sbjct: 14 LRNGVTKPKRVRGSGYKMVNMGEIFSLSFIQNQTMDRVPLTDKEKATTLLQNNDLLFARQ 73 Query: 91 ----LGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSID---VTQRIEAICE 142 G + D + +C + + ++ L + + + I + Sbjct: 74 SLVRDGAGKCSIFLNDNEPVCFESHIIRVRLNQELCYPMFYYYFFSSRLGKNTMDKIIEQ 133 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 GA + + + +P ++Q I + + + +I ++ + ++ Sbjct: 134 GAGAAGIRGSDLAKLEVPYIGYSKQKEIADSLYSFDQKIQLNTQINQTLEQIAQALFKSW 193 Query: 203 V---------SYIVTKGLNPDVKMKDSGIEWVGLVPDH------WEVKPFFALVTELNRK 247 + ++ G++ + + G P+ + + L Sbjct: 194 FVDFDPVRAKAQALSDGMSLEQAELAAIQAISGKTPEELTALSQTQPDRYAELAETAKAF 253 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND------ 301 +++E + + G Q L+ + ++ T +++ G VF + Sbjct: 254 PCEMVEVDGGEVPKGWEYQYLKDICNIVYGKNLPTTKLIKEGYPVFGGNGVIGYYDKFLY 313 Query: 302 ------------KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + + ++ + S ++ L + Sbjct: 314 ETPQTLVSCRGAASGKVLYSLPYSFVTNNSLVIEHEKSGLS--YFYIYEVLKLQNLTELT 371 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + ++ + +LVP I V + + + L + R Sbjct: 372 SGSAQPQMTIANMAAVQILVPS----EKINEVCKKYLGTLYNQIYQNNIENETLAQTRDL 427 Query: 410 FIAAAVTGQI 419 + + G+I Sbjct: 428 LLPRLLNGEI 437 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 57/187 (30%), Gaps = 16/187 (8%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G +PK W+ +K + G+ + K I +G Y K Sbjct: 263 GEVPKGWEYQYLKDICNIVYGKNLPTTKLIKEGYPVFGGNGVIGYYDK------------ 310 Query: 79 SIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ Q L G K + + + + + ++ K L ++ + Q + Sbjct: 311 FLYETPQTLVSCRGAASGKVLYSLPYSFVTNNSLVIEHEKSGLS---YFYIYEVLKLQNL 367 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G+ + + + +P + ++ + +I E + Sbjct: 368 TELTSGSAQPQMTIANMAAVQILVPSEKINEVCKKYLGTLYNQIYQNNIENETLAQTRDL 427 Query: 198 KKQALVS 204 L++ Sbjct: 428 LLPRLLN 434 >gi|308190349|ref|YP_003923280.1| hypothetical protein MFE_08350 [Mycoplasma fermentans JER] gi|307625091|gb|ADN69396.1| hypothetical protein MFE_08350 [Mycoplasma fermentans JER] Length = 167 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 43/134 (32%), Gaps = 10/134 (7%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 +Y VD I+ + ++ +T + KP + + Sbjct: 33 ITYVNKWNVDEDAIIIGRVGAN----CGCVNITNKKSFVTDNALIFKPKEKNMARFYFYF 88 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 + F+ + L + + + +P + + I+ +++ ID +E+ Sbjct: 89 LLHLNLNKFHI--GSSQPLLTQGILGNIKINIPSLNKCQKISKILD----NIDNQIERNN 142 Query: 398 QSIVLLKERRSSFI 411 + L+ + I Sbjct: 143 SMVQKLQCFEQALI 156 >gi|160914346|ref|ZP_02076565.1| hypothetical protein EUBDOL_00354 [Eubacterium dolichum DSM 3991] gi|160915332|ref|ZP_02077544.1| hypothetical protein EUBDOL_01340 [Eubacterium dolichum DSM 3991] gi|158432723|gb|EDP11012.1| hypothetical protein EUBDOL_01340 [Eubacterium dolichum DSM 3991] gi|158433819|gb|EDP12108.1| hypothetical protein EUBDOL_00354 [Eubacterium dolichum DSM 3991] Length = 122 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 12/116 (10%), Positives = 37/116 (31%), Gaps = 2/116 (1%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 ++ +++ + + +E+ + + Y+ Sbjct: 8 YVSCEVPQKAMIYKNDLLICARNGSRSLVGKCAIVDIEKASFGAFMTKFSSKF--NPYIK 65 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + S + + + ++++ + +PP +EQ I N IN + +D Sbjct: 66 IFLDSPTFRNQLDNVKTETINQITQKNLQNQLLPLPPFEEQIKIVNTINKIYSILD 121 >gi|289624199|ref|ZP_06457153.1| Type I restriction enzyme specificity protein HsdS [Pseudomonas syringae pv. aesculi str. NCPPB3681] Length = 286 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 50/143 (34%), Gaps = 18/143 (12%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + Y P ++ R + N ++ Y + + YL + M++ Sbjct: 54 DKYSYNKPTVLIPRKGSITNIFYVDVPFWNVDTIY----YTDIDYSRVIPKYLYYFMKTI 109 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLV 393 D+ + R SL +K + + +P +K Q +I ++N + L Sbjct: 110 DMMAL---DTGSGRPSLTQAILKEILIPIPCPDDSKKSLKIQAEIVRILNTFSELTAELT 166 Query: 394 EKIEQSIVLLKE----RRSSFIA 412 K++ + K+ R ++ Sbjct: 167 AKLKAELKARKKQYNYYRDQLLS 189 >gi|298241943|ref|ZP_06965750.1| restriction modification system DNA specificity domain protein [Ktedonobacter racemifer DSM 44963] gi|297554997|gb|EFH88861.1| restriction modification system DNA specificity domain protein [Ktedonobacter racemifer DSM 44963] Length = 790 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 55/149 (36%), Gaps = 9/149 (6%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 ++ +K V+ G+I+ + + + + R + + + ++P D Sbjct: 640 SISVKDFDNAKNAHVEFGDILVTTTGAYLGRACVFDKKDL-RAVASGSVTILRPQFRDDI 698 Query: 332 YLAWL---MRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L + S + + S + ++ D+ + + +PP+ +Q ++ I V Sbjct: 699 DPFFLTSIINSKLGKDQIFQLQAASASQPYIRRADLGAITIPLPPLSKQKELAQRIKVLL 758 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 LV + + I E + + + Sbjct: 759 TEAQDLVRRAQ-EIE--TEAKKLIVDELL 784 >gi|332749086|gb|EGJ79509.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri K-671] gi|332749353|gb|EGJ79774.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri 4343-70] gi|332749675|gb|EGJ80091.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri 2747-71] gi|332768710|gb|EGJ98889.1| stySKI methylase [Shigella flexneri 2930-71] gi|333009192|gb|EGK28648.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri K-218] gi|333010421|gb|EGK29854.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri VA-6] gi|333022275|gb|EGK41513.1| type I restriction enzyme EcoAI specificity domain protein [Shigella flexneri K-304] Length = 377 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 65/192 (33%), Gaps = 15/192 (7%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPES 279 S E +P+ WE + ++ + K+ S IL +I++ + G Sbjct: 93 SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQSQEFISGYCNNE 152 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 ++ V F D + + + + P I + W +RS Sbjct: 153 ---CLLIKLNNPVIVFGDHTRN----IKFIDFDFVVGADGVKILSPILICERFFFWQLRS 205 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + L YA F+ + +PPI EQ I ++ + D L ++ S Sbjct: 206 FKLDVRGYAR--------HFKVLNSCLFALPPIAEQERIVEKVSSLMSLCDQLEQQSLTS 257 Query: 400 IVLLKERRSSFI 411 + ++ + + Sbjct: 258 LDAHQQLVETLL 269 Score = 37.5 bits (85), Expect = 4.5, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 69/204 (33%), Gaps = 18/204 (8%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRF-TKLNTGRTSESGKDIIYIGLEDVESG 59 +K K P+ S + +P+ W+ V + ++ +I+ G V Sbjct: 83 IKKQKPLPEI--SEEEKPFELPEGWEWVHLPDIYCSISESSRKIKSSEILPEGKYPVIEQ 140 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + +++ N+ + I++ G + R DFD + V + Sbjct: 141 SQEFISGYCNNECL----LIKLNNPVIVF---GDHTRNIKFIDFDFVVGAD-GVKILSPI 192 Query: 120 LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV 179 L + + + + + +PP+AEQ I EK+ + Sbjct: 193 LICERFFFWQLRSFKLDVRGYARHFKV-------LNSCLFALPPIAEQERIVEKVSSLMS 245 Query: 180 RIDTLITERIRFIELLKEKKQALV 203 D L + + ++ ++ + L+ Sbjct: 246 LCDQLEQQSLTSLDAHQQLVETLL 269 >gi|323139525|ref|ZP_08074571.1| N-6 DNA methylase [Methylocystis sp. ATCC 49242] gi|322395204|gb|EFX97759.1| N-6 DNA methylase [Methylocystis sp. ATCC 49242] Length = 717 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 41/116 (35%), Gaps = 4/116 (3%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 T +S E Y+++ + + + + + ++GI + Y+ Sbjct: 555 GITNPKTAIGKSPERYKVLRTHYLAYNPMRINIGSIGVVR-DDTQQGITSPDYVVFYCGP 613 Query: 328 ID-STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 Y+ +RS G +R L FE + ++ + VP I+ Q N Sbjct: 614 DLLPEYVYHYLRSEAGRHEINLKTKGSVRFRLYFEQLSKIKIPVPKDIETQQRFVN 669 >gi|309808293|ref|ZP_07702199.1| conserved hypothetical protein [Lactobacillus iners LactinV 01V1-a] gi|308168440|gb|EFO70552.1| conserved hypothetical protein [Lactobacillus iners LactinV 01V1-a] Length = 235 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 68/181 (37%), Gaps = 11/181 (6%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + G + S + ++ G+++ D++++ L + Sbjct: 54 NCYQVFKQGHINRGGGFNSSGTKSWYPISKSSALSKYVLHKGDVLMAMTDMKDNVAILGN 113 Query: 308 AQ---VMERGIITSAYMAVKPHGIDSTYLAWLM---RSYDLCKVFYAMG-SGLRQSLKFE 360 V ++ I+ ++ +G ST A++ S + K + SG++ +L Sbjct: 114 TALMAVDDQYIVNQRVGLLRSNGYKSTSYAYIYLLTNSLNFLKDLRSRANSGVQVNLSSS 173 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++K V + + + N T + ++ + L + R + + ++G++D Sbjct: 174 EIKDSSVWIANDEVNEEF----NALTEPLLSMIMTNDIENQKLIDLRDTLLPKLMSGELD 229 Query: 421 L 421 + Sbjct: 230 V 230 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 65/215 (30%), Gaps = 25/215 (11%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNS 70 G +P WK ++ + G +S + + G G + Sbjct: 19 GTVPDDWKQGTLQDIANFSNGYAFKSKELLNTSEPNCYQVFKQGHINRGGGFNSSGTKSW 78 Query: 71 RQSDTS---TVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQ---------PK 117 S + + KG +L AI+ + Q++V Q K Sbjct: 79 YPISKSSALSKYVLHKGDVLMAMTDMKDNVAILGNTALMAVDDQYIVNQRVGLLRSNGYK 138 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + S++ + + + + I + + I + E+ A Sbjct: 139 STSYAYIYLLTNSLNFLKDLRSRANSGVQVNLSSSEIKDSSVWIAN----DEVNEEFNAL 194 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 T + ++I + L + + L+ +++ L+ Sbjct: 195 TEPLLSMIMTNDIENQKLIDLRDTLLPKLMSGELD 229 >gi|239994327|ref|ZP_04714851.1| restriction endonuclease S subunits-like protein [Alteromonas macleodii ATCC 27126] Length = 70 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 26/53 (49%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVES 58 AYP+YK S W+G +P W++ IK + + G + D Y E+ E Sbjct: 6 AYPEYKQSDEDWLGDVPSTWEIKMIKHLSPVKRGASPRPIDDPKYFDDENGEY 58 >gi|5712708|gb|AAD47618.1| HsdS variable domain [Lactococcus lactis] Length = 172 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/167 (11%), Positives = 47/167 (28%), Gaps = 8/167 (4%) Query: 237 FFALVTELNRKNTKLIESNILSLSYG--NIIQKLETRNMGLKPESYETY--QIVDPGEIV 292 + E+ L G KL ++ S + Y V + + Sbjct: 10 ITDFHKQGFYTKESYNENKKYYLLRGTDMTSNKLILKDTPKINASEKDYEDFKVLKDDFL 69 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 ++S G + + ++ + + S ++ + Sbjct: 70 IVRSGTVGTYAIVKSDITAIFGSYLINFRFNQSIVLNEFFGLFYQSSLFKSQLNKIIQKS 129 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 ++ E++K + P I+EQ I ID + ++ Sbjct: 130 SNVNINAENIKSTNIKFPTIEEQQKIGAF----FQSIDDTIALHQRK 172 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 45/166 (27%), Gaps = 8/166 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W+ + T + K + D+ S + Sbjct: 1 DWEERKLSEITDFHKQGFYTKESYNENKKYYLLRGTDMTSNKLILKDTPKINASEKDYED 60 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLV---LQPKDVLPELLQGWLLSIDVTQ 135 K L + G AI+ +L+ VL E + S Sbjct: 61 FKVLKDDFLIVRSGTVGTYAIVKSDITAIFGSYLINFRFNQSIVLNEFFGLFYQSSLFKS 120 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 ++ I + ++ + + + I + + P + EQ I + I Sbjct: 121 QLNKIIQKSSNVNINAENIKSTNIKFPTIEEQQKIGAFFQSIDDTI 166 >gi|269978364|gb|ACZ55916.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 200 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 44/114 (38%), Gaps = 1/114 (0%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSG 352 I + + ++ +V P + YL +++ + + S Sbjct: 65 NTITIAQYGTAGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISNRSA 124 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + S+ ++ ++ + +PP++ Q +I +++ + L+ I I K++ Sbjct: 125 IPYSISSNNIMQITIPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQ 178 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 42/156 (26%), Gaps = 11/156 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 PK + + ++ G+ + + GKY G Sbjct: 13 PKGVEFKKLGEVCEIIRGKRVTKKEIL----------DKGKYPVVSGGIGFMGYLNEYNR 62 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + + + PK+ L ++L+ Sbjct: 63 EENTITIAQYGT-AGFVNWQNQKFWANDVCFSVIPKETLINRYLYYVLTNMQNYLYSISN 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 A I I +PIPPL Q I + + Sbjct: 122 RSAIPYSISSNNIMQITIPIPPLEIQQEIVKILDQF 157 >gi|293402585|ref|ZP_06646712.1| putative type I restriction-modification system, S subunit [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291303977|gb|EFE45239.1| putative type I restriction-modification system, S subunit [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 186 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 13/148 (8%), Positives = 48/148 (32%), Gaps = 2/148 (1%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R++ K + + + + + V +++ + Sbjct: 29 RRDMKNEGIPVYEQQHAIYNNRQFRYYIDEIKFNEMKRFQVQTDDLIISCSGTVGRVSII 88 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMGSGLRQSL-KFEDVK 363 + + + + YL + S + + ++ ++ K + ++ Sbjct: 89 KEDDPKGIISQALLLLRINTEKVLPLYLKYFFSSREGYNAIISRSSGSVQVNIAKRDVIE 148 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDV 391 ++P+ +PP+ Q I +++ +I+ Sbjct: 149 QIPLKLPPLNCQRKIVEILSFIDNKIEE 176 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 56/183 (30%), Gaps = 16/183 (8%) Query: 24 HWKVVPIKRFTKL---NTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 W + + + G + I + ++ + + Sbjct: 3 EWTNLKLSDVLQEKGYIRGPFGSALKRRDMKNEGIPVYEQQHAIYNNRQF-RYYIDEIKF 61 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAII--ADFDGICSTQ--FLVLQPKDVLPELLQGWLL 129 + ++ G R +II D GI S L + + VLP L+ + Sbjct: 62 NEMKRFQVQTDDLIISCSGTVGRVSIIKEDDPKGIISQALLLLRINTEKVLPLYLKYFFS 121 Query: 130 SIDVTQRIEAICEGATMSHADWKGIG-NIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S + I + G+ + + + IP+ +PPL Q I E + +I+ Sbjct: 122 SREGYNAIISRSSGSVQVNIAKRDVIEQIPLKLPPLNCQRKIVEILSFIDNKIEENRKIN 181 Query: 189 IRF 191 Sbjct: 182 NNL 184 >gi|327183904|gb|AEA32351.1| type I restriction-modification system S subunit [Lactobacillus amylovorus GRL 1118] Length = 344 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 14/129 (10%), Positives = 38/129 (29%), Gaps = 4/129 (3%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 L + +L + G + A+ WL L K+ Sbjct: 220 DNYTHDGNYSLIGRQGALCGNVQLTAGKFRNTEHAILVKPNVQVNYYWLFMLLKLEKLNR 279 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + L + + ++ + + + Q + + ++D I++S+ + Sbjct: 280 FSSGAAQPGLAVKTLNKIFIPIADLNLQNEFASF----AQQVDKSKVAIQKSLDETQTLF 335 Query: 408 SSFIAAAVT 416 S + + Sbjct: 336 DSLMQKYFS 344 >gi|221195892|ref|ZP_03568944.1| hypothetical protein ATORI0001_0858 [Atopobium rimae ATCC 49626] gi|221184239|gb|EEE16634.1| hypothetical protein ATORI0001_0858 [Atopobium rimae ATCC 49626] Length = 204 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/187 (10%), Positives = 54/187 (28%), Gaps = 19/187 (10%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPG 289 WE + + ++ + + +N + P + E + G Sbjct: 29 WEQRKLGDIAEVTMGQSPSGTCYTDNPNDAILVQGNADLKNGWVYPRVWTTEITKTASRG 88 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 +++ V+ RG+ I + S ++ Sbjct: 89 DLIMSVRAPVGAMGKTAFDVVLGRGVAG----------IKGDEFLFQALSKIESDGYWTT 138 Query: 350 --GSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S+ ++++ + P +E+ I + D L+ ++ + LK+ Sbjct: 139 VSAGSTFDSISGDELRNTAINYPSDTEERKRIGYY----FQKFDHLITLHQRKLEKLKQL 194 Query: 407 RSSFIAA 413 + S + Sbjct: 195 KQSMLEK 201 Score = 40.5 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 49/183 (26%), Gaps = 8/183 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + ++ G++ + G R T ++G Sbjct: 29 WEQRKLGDIAEVTMGQSPSGTCYTDNPNDAILVQGNADLKNGWVYPRVWTTEITKTASRG 88 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ P E L L I+ + G+ Sbjct: 89 DLIMSVRAPVGAM-----GKTAFDVVLGRGVAGIKGDEFLFQALSKIESDGYWTTVSAGS 143 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 T + N + P E+ + + IT R +E LK+ KQ+++ Sbjct: 144 TFDSISGDELRNTAINYPSDTEERKRIGYYFQKFDHL---ITLHQRKLEKLKQLKQSMLE 200 Query: 205 YIV 207 + Sbjct: 201 KMF 203 >gi|330869551|gb|EGH04260.1| Type I restriction enzyme specificity protein HsdS [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 287 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 50/143 (34%), Gaps = 18/143 (12%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + Y P ++ R + N ++ Y + + YL + M++ Sbjct: 54 DKYSYNKPTVLIPRKGSITNIFYVDVPFWNVDTIY----YTDIDYSRVIPKYLYYFMKTI 109 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLV 393 D+ + R SL +K + + +P +K Q +I ++N + L Sbjct: 110 DMMAL---DTGSGRPSLTQAILKEILIPIPCPDDSKKSLKIQAEIVRILNTFSELTAELT 166 Query: 394 EKIEQSIVLLKE----RRSSFIA 412 K++ + K+ R ++ Sbjct: 167 AKLKAELKARKKQYNYYRDQLLS 189 >gi|327470622|gb|EGF16078.1| type I restriction modification DNA specificity family protein [Streptococcus sanguinis SK330] Length = 182 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 63/151 (41%), Gaps = 6/151 (3%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + N++ + + ++ + + +I++ + + + Sbjct: 23 SEKWDFVNYLDTGSLTKNVVSEYQEIDLQNDKLPSRARRKISVNDILYSTVRPNQEHYGI 82 Query: 306 RSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMG---SGLRQSLKFE 360 +V+ ++++ + + + DS ++ + + ++ + A+G + S+K Sbjct: 83 VK-EVVPNMLVSTGFTVISVNQELADSDFIYYCLTQREVIEHLQAIGEQSTSAYPSIKPT 141 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDV 391 D++ L + +P + EQ +IT+V+ +I+ Sbjct: 142 DIENLELFLPSLNEQREITSVLRALDDKIEN 172 Score = 43.2 bits (100), Expect = 0.088, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 51/178 (28%), Gaps = 10/178 (5%) Query: 24 HWKVVPIKRFT--KLNTGRTSESGKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVSI 80 WK V + L T SE + Y+ + +Y D + + + Sbjct: 3 EWKKVKLGDICQTNLETYSLSEKWDFVNYLDTGSLTKNVVSEYQEIDLQNDKLPSRARRK 62 Query: 81 FAKGQILYGKLGPYLRKAIIADF---DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 + ILY + P I + + ST F V+ L + + Sbjct: 63 ISVNDILYSTVRPNQEHYGIVKEVVPNMLVSTGFTVISVNQELADSDFIYYCLTQREVIE 122 Query: 138 EAICEGA----TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G I N+ + +P L EQ I + A +I+ Sbjct: 123 HLQAIGEQSTSAYPSIKPTDIENLELFLPSLNEQREITSVLRALDDKIENNRKINHHL 180 >gi|294660607|ref|NP_853466.2| type I restriction-modification system specificity subunit domain-containing protein [Mycoplasma gallisepticum str. R(low)] gi|284812270|gb|AAP57034.2| type I restriction-modification system specificity (S) subunit domain protein [Mycoplasma gallisepticum str. R(low)] gi|284930964|gb|ADC30903.1| type I restriction-modification system specificity (S) subunit domain protein [Mycoplasma gallisepticum str. R(high)] Length = 194 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 53/149 (35%), Gaps = 9/149 (6%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-- 324 K T G GE V D N+ + V + + + ++ Sbjct: 50 KGSTPYYGANGIQDYVKGYTHDGEFVLIAEDGANNLLNYPVQYVSGKIWVNNHAHVLQGK 109 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + +++ + ++ + S D+ + G R L + + + +P I+EQ ++ Sbjct: 110 ENILNNKFFSYSINSIDMEQYI---VGGSRSKLNATTLMDIELKIPSIQEQK----LLGN 162 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ID L+ ++ L+ + + + Sbjct: 163 LFYTIDNLLALHQRKCQKLQNIKEAILEK 191 >gi|329919719|ref|ZP_08276675.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners SPIN 1401G] gi|328937238|gb|EGG33664.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners SPIN 1401G] Length = 175 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/168 (10%), Positives = 47/168 (27%), Gaps = 9/168 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-----YQ 284 + W+ + + N ++ I + ++ E + Sbjct: 2 ETWKKIRLGDACKTNMYSYSPKEKWNFVNYLDTGNITDNKIDSIQYIDVVNEKLPSRARR 61 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC- 343 V I++ + + +Q + T + + + + + Sbjct: 62 KVKKDSIIYSTVRPNQHHFGIIKSQPENFLVSTGFAVIDTDSQVLDADFLYYLLTQSTIV 121 Query: 344 ---KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + S+K D++ L + +P I Q I +V+ + Sbjct: 122 ESLNAIAEQSTSAYPSIKPSDIENLEIEIPDIATQKKIADVLFSLDKK 169 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 55/173 (31%), Gaps = 10/173 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYL-PKDGNSRQSDTSTVS 79 + WK + + K N S K + Y+ ++ + D + + + Sbjct: 2 ETWKKIRLGDACKTNMYSYSPKEKWNFVNYLDTGNITDNKIDSIQYIDVVNEKLPSRARR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 K I+Y + P I + + ST F V+ + + + L T Sbjct: 62 KVKKDSIIYSTVRPNQHHFGIIKSQPENFLVSTGFAVIDTDSQVLDADFLYYLLTQSTIV 121 Query: 137 IEAI----CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + I N+ + IP +A Q I + + + ++ + Sbjct: 122 ESLNAIAEQSTSAYPSIKPSDIENLEIEIPDIATQKKIADVLFSLDKKMAQNM 174 >gi|240146115|ref|ZP_04744716.1| restriction modification system DNA specificity domain protein [Roseburia intestinalis L1-82] gi|257201768|gb|EEV00053.1| restriction modification system DNA specificity domain protein [Roseburia intestinalis L1-82] Length = 197 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 55/177 (31%), Gaps = 7/177 (3%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRK--NTKLIESNILSLSYGNIIQKLETRNMGL 275 E +P+ W ++ + + S+ + Y E ++ L Sbjct: 21 HCINEEIPFDLPEGWNFIRLKCAWELVSGRDLSPSDYNSDNTGIPYITGASNFENGHVSL 80 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + + G+++ + + E I + ++ +L+ Sbjct: 81 VRFTAVPQVLTYKGDLLLTCKGTIGE---IALNNFGEAHIARQIMAIRNIYNLNVEFLSL 137 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + GL + ED+ L + +PP K Q +I ++ +++ + Sbjct: 138 CIEHA--MSEIKQAAKGLIPGISREDILNLIIPIPPEKHQKEIVRNVHDYLEKLNTI 192 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 53/163 (32%), Gaps = 2/163 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ W + +K +L +GR +G + + + Sbjct: 30 DLPEGWNFIRLKCAWELVSGRDLSPSDYNSDNTGIPYITGASNFENGHVSLVRFTAVPQV 89 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG +L G A+ + G ++ +++ ++ L I+ Sbjct: 90 LTYKGDLLLTCKGTIGEIAL--NNFGEAHIARQIMAIRNIYNLNVEFLSLCIEHAMSEIK 147 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + I N+ +PIPP Q I + +++ Sbjct: 148 QAAKGLIPGISREDILNLIIPIPPEKHQKEIVRNVHDYLEKLN 190 >gi|296277376|ref|ZP_06859883.1| type I restriction-modification system S subunit [Staphylococcus aureus subsp. aureus MR1] Length = 192 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 61/166 (36%), Gaps = 8/166 (4%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYG-NIIQKLETRNMGLKPESYETYQIVDPGE 290 WE K L + RKN L L++S +I + E + + ++ E Y ++ GE Sbjct: 13 WEEKQLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLENYTLIKNGE 72 Query: 291 IVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGIDSTYLA--WLMRSYDLCKVFY 347 + +++ + G+++S Y+ S + ++ +V Sbjct: 73 FAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYREVSG 132 Query: 348 AMGSGLRQ----SLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 G R ++ D + + P ++EQ I + +I Sbjct: 133 IAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKFFSKLDRQI 178 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 45/169 (26%), Gaps = 13/169 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSE-SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + K + I + +Y K +S+ + ++ Sbjct: 12 EWEEKQLGDLTDRVIRKNKNLESKKPLTISGQLGLIDQTEYFSKSVSSKNLE--NYTLIK 69 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G+ Y K G+ S+ ++ K + + R Sbjct: 70 NGEFAYNKSYSNGYPLGAIKRLTRYDSGVLSSLYICFSIKSEMSKDFMEAYFDSTHWYRE 129 Query: 138 EAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + I + P L EQ I + +I Sbjct: 130 VSGIAVEGARNHGLLNVSVNDFFTILIKYPSLEEQQKIGKFFSKLDRQI 178 >gi|213961980|ref|ZP_03390245.1| putative restriction modification system DNA specificity domain protein [Capnocytophaga sputigena Capno] gi|213955333|gb|EEB66650.1| putative restriction modification system DNA specificity domain protein [Capnocytophaga sputigena Capno] Length = 190 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 50/148 (33%), Gaps = 9/148 (6%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + +K + T ++ +I+ + ++ + V + Sbjct: 41 NVDAFVKEDEKYTKNLLLANDILLPSKGNRIFATLFQAQWGKAVASSIFYVLRVDTSIVL 100 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV--INVET 386 TYL ++ + + MG G SL+ ++++ L + +P + Q I + + Sbjct: 101 PTYLVAILNLPQYQQQLWQMGGGSNIFSLRKKELEDLQIPLPSFEVQQQIATFNLLFQQK 160 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + + K E+ + + I Sbjct: 161 NILRQQIIKKERQLH------QAIIQQL 182 >gi|291556525|emb|CBL33642.1| Type I restriction-modification system methyltransferase subunit [Eubacterium siraeum V10Sc8a] Length = 535 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 68/182 (37%), Gaps = 6/182 (3%) Query: 26 KVVPIKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K + +K + G+ ++ I + ++ Y D + + I Sbjct: 351 KKLRLKDAATVFRGKAVNAKAESGNVAVINISNITDTGIDYEHLDQIEEEERKVSRYILE 410 Query: 83 KGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQG-WLLSIDVTQRIEA 139 G +L G ++ A+ IC S V++PKD+L +L S + +++ Sbjct: 411 DGDVLVTARGTTVKIAVFEKQPMICIPSANINVIRPKDMLRGAYLKLFLESPVGIKMLQS 470 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G + + ++K I + +P+ PL Q + E+ I +++ Sbjct: 471 LQRGTVVVNINYKDIIELEVPVLPLEAQDALIEEYNTGLRFYKETIAAAEEGWRGVQQGI 530 Query: 200 QA 201 Q+ Sbjct: 531 QS 532 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 51/155 (32%), Gaps = 6/155 (3%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + I + + E + + E + I++ G+++ + Sbjct: 370 KAESGNVAVINISNITDTGIDYEHLDQIEEEERKVSRYILEDGDVLVTARGTT---VKIA 426 Query: 307 SAQVMERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVK 363 + I SA + + YL + S K+ ++ G ++ ++D+ Sbjct: 427 VFEKQPMICIPSANINVIRPKDMLRGAYLKLFLESPVGIKMLQSLQRGTVVVNINYKDII 486 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 L V V P++ Q + N + E+ Sbjct: 487 ELEVPVLPLEAQDALIEEYNTGLRFYKETIAAAEE 521 >gi|210630775|ref|ZP_03296599.1| hypothetical protein COLSTE_00484 [Collinsella stercoris DSM 13279] gi|210160371|gb|EEA91342.1| hypothetical protein COLSTE_00484 [Collinsella stercoris DSM 13279] Length = 226 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 19/182 (10%), Positives = 55/182 (30%), Gaps = 11/182 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G + + + + + + +G+ + + + ++ Sbjct: 34 LGDCFEFLKNNTLSRADLNDENGIARNVHYGDILIKFGDCLDGERSDLPFITDDTVLPKF 93 Query: 285 ---IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST---YLAWLMR 338 I+ G+++F + + + S + YL + Sbjct: 94 AGSILREGDVIFADTAEDEAAGKCVELRKLPKEPTISGLHTIPARPRFPFGTGYLGHYLN 153 Query: 339 SYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S + + G++ S+ ++ V P + EQ I ++ ID L+ + Sbjct: 154 SDAYHRQLLPLMQGIKVISVSKAVLQDTQVRFPSLSEQSTIGATLSG----IDDLITLHQ 209 Query: 398 QS 399 + Sbjct: 210 RE 211 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 53/198 (26%), Gaps = 20/198 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------------DIIYIGLEDVESGTGKYLPKDGN 69 W+ + + T I I D G LP + Sbjct: 27 SSWEQRKLGDCFEFLKNNTLSRADLNDENGIARNVHYGDILIKFGDCLDGERSDLPFITD 86 Query: 70 SRQSDTSTVSIFAKGQILYGKL------GPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 SI +G +++ G + + I + +P+ Sbjct: 87 DTVLPKFAGSILREGDVIFADTAEDEAAGKCVELRKLPKEPTISGLHTIPARPRFPFGTG 146 Query: 124 LQ-GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +L S +++ + +G + + + + P L+EQ I + I Sbjct: 147 YLGHYLNSDAYHRQLLPLMQGIKVISVSKAVLQDTQVRFPSLSEQSTIGATLSGIDDLIT 206 Query: 183 TLITERIRFIELLKEKKQ 200 E ++ K Q Sbjct: 207 LHQREPPHMMKEGKNANQ 224 >gi|32476970|ref|NP_869964.1| Type I restriction enzyme EcoBI specificity protein [Rhodopirellula baltica SH 1] gi|32447518|emb|CAD79107.1| probable Type I restriction enzyme EcoBI specificity protein [Rhodopirellula baltica SH 1] Length = 385 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 93/273 (34%), Gaps = 22/273 (8%) Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 + I +P+PPL EQ I + R ++L ++ Q++ + NP Sbjct: 1 MEKIEIPLPPLDEQRRIAAVLDKADALRRQ----RQESLQLTEKLLQSVFEEMFG---NP 53 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 K+ I +G + + V + + + L L + + Sbjct: 54 RENPKNWDIVPLGELVADDDA--INYGVVQPGKDFPSGVPMIRLGDLANPDPTMLNVKRI 111 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 ++ + GE++ + + A+ I + GI + ++ Sbjct: 112 DPTIDASCARSRLAGGEVLVGCVGHTIGVACIAPAEWAGANIARAVARIRVKPGIPAEFI 171 Query: 334 AWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +R+ + F + +L + +K P+L+PP + + + + L Sbjct: 172 LQQIRTPAIQHFFRGERRIVGQPTLNIKQIKETPILLPP--------HKLCDQFVKFYRL 223 Query: 393 V----EKIEQSIVLLKERRSSFIAAAVTGQIDL 421 ++S L++ ++ A G++DL Sbjct: 224 TVDGHSDKQKSTTLVEALFAAIQQRAFRGELDL 256 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 65/203 (32%), Gaps = 16/203 (7%) Query: 22 PKHWKVVPIKRFT----KLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSD 74 PK+W +VP+ +N G + I L D+ + L D Sbjct: 57 PKNWDIVPLGELVADDDAINYGVVQPGKDFPSGVPMIRLGDLANPDPTMLNVKRIDPTID 116 Query: 75 TS-TVSIFAKGQILYGKLGPYLRKAIIADFDGI---CSTQFLVLQPKDVLP-ELLQGWLL 129 S S A G++L G +G + A IA + + ++ K +P E + + Sbjct: 117 ASCARSRLAGGEVLVGCVGHTIGVACIAPAEWAGANIARAVARIRVKPGIPAEFILQQIR 176 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + + + K I P+ +PP + +++ Sbjct: 177 TPAIQHFFRGERRIVGQPTLNIKQIKETPILLPPHKLCDQFVKF----YRLTVDGHSDKQ 232 Query: 190 RFIELLKEKKQALVSYIVTKGLN 212 + L++ A+ L+ Sbjct: 233 KSTTLVEALFAAIQQRAFRGELD 255 >gi|78777139|ref|YP_393454.1| restriction modification system DNA specificity subunit [Sulfurimonas denitrificans DSM 1251] gi|78497679|gb|ABB44219.1| Restriction modification system DNA specificity domain [Sulfurimonas denitrificans DSM 1251] Length = 195 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 61/177 (34%), Gaps = 6/177 (3%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 ++++ L + + + + E + +V G+I+ R Sbjct: 17 LNRKKADMSKDQKLYYSVVSLKSFNEDAVYDNTFADEFISNEQIKEDYLVKQGDILLR-- 74 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGL 353 L+ ++ + E I S + ++ +D ++A + S + + + Sbjct: 75 -LREPNFAVYIDKEYENLIYPSLMVRVKIQDTRLDPHFIAHYLNSTIVRRALSTELSGTT 133 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +K DV ++ + + + +Q I + + ++L I Q KE + Sbjct: 134 IPMIKVADVNKIKIPLINLDKQKKIVEYLKLAHQENELLQNLINQKQKYSKEIFETL 190 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 46/160 (28%), Gaps = 13/160 (8%) Query: 28 VPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTG-KYLPKDGNSRQSDTST 77 + + ++ TG K + L+ D Sbjct: 3 IKLNDIAEIKTGLVLNRKKADMSKDQKLYYSVVSLKSFNEDAVYDNTFADEFISNEQIKE 62 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVT 134 + +G IL P I +++ + +V P + +L S V Sbjct: 63 DYLVKQGDILLRLREPNFAVYIDKEYENLIYPSLMVRVKIQDTRLDPHFIAHYLNSTIVR 122 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + G T+ + I +P+ L +Q I E + Sbjct: 123 RALSTELSGTTIPMIKVADVNKIKIPLINLDKQKKIVEYL 162 >gi|257453342|ref|ZP_05618641.1| type I restriction-modification system specificity subunit [Fusobacterium sp. 3_1_5R] gi|317059873|ref|ZP_07924358.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] gi|313685549|gb|EFS22384.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] Length = 236 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 82/240 (34%), Gaps = 19/240 (7%) Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 + L+++ QAL NP+ K +G + K + Sbjct: 1 NDNLEQQAQALFKEWFID--NPEKKNWSNGTFSDLIQSTLSGDWGKEVATRNNTEKVYCI 58 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 ++I + GN + + S + ++ G+IV + + R + Sbjct: 59 RGADIPEVKAGNKGKMPIRYILPKNYASKK----LNAGDIVVEISGGSPTQSTGRCTAIS 114 Query: 312 ER--------GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFED 361 E I T+ A+KP S ++ + + VF++ G+ ++L Sbjct: 115 ESLLNRYDSGMICTNFCRAIKPISGYSIFIYYYWQHLYDKGVFFSYENGTTGIKNLDISG 174 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +V P+KE I N I + + L + R + + ++G+ID+ Sbjct: 175 FLETEPIVIPLKE--KILEF-NDYCQTIFNQIFSHGKESEYLVQLRDTLLNKLMSGEIDV 231 >gi|83721596|ref|YP_443256.1| type I restriction-modification system specificity determinant [Burkholderia thailandensis E264] gi|83655421|gb|ABC39484.1| type I restriction-modification system specificity determinant XF2741 [Burkholderia thailandensis E264] Length = 398 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 51/142 (35%), Gaps = 11/142 (7%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-----YMAVKPHGIDS 330 E + G+ + I + + G++ ++ K + DS Sbjct: 16 TREFTGSGTRFQNGDTLIARITPCLENGKTAYISELPEGVVAHGSTEYIVLSGKVNQSDS 75 Query: 331 TYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + +L+RS D + + G+ RQ + V+R +PP+ EQ I ++ Sbjct: 76 LFGYYLVRSPDFRRHAIGHMEGTSGRQRVPSSAVERYSTRLPPLAEQRAIAKILGS---- 131 Query: 389 IDVLVEKIEQSIVLLKERRSSF 410 +D +E + L+ + Sbjct: 132 LDDKIELNRERSETLEAMGRAL 153 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ WK++ + N + G+ Y+ + + + P S Sbjct: 191 ELPEGWKLLKASELIEFNPTESLRKGEVAPYLDMASLPTQGSWPDPYVMRPFGSGMR--- 247 Query: 80 IFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSI 131 F G L ++ P L + D G ST+++V++PK +P + Sbjct: 248 -FRNGDTLLARITPCLENGKTAFIQCLPDDVVGWGSTEYIVMRPKGPVPAAFAYLLARND 306 Query: 132 DVTQRIEAICEG 143 + G Sbjct: 307 AFREHAIRSMTG 318 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 54/378 (14%), Positives = 124/378 (32%), Gaps = 39/378 (10%) Query: 79 SIFAKGQILYGKLGPY---LRKAIIADFDGIC----STQFLV--LQPKDVLPELLQGWLL 129 + F G L ++ P + A I++ ST+++V + + Sbjct: 24 TRFQNGDTLIARITPCLENGKTAYISELPEGVVAHGSTEYIVLSGKVNQSDSLFGYYLVR 83 Query: 130 SIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 S D + EG + + +PPLAEQ I + + + +I+ Sbjct: 84 SPDFRRHAIGHMEGTSGRQRVPSSAVERYSTRLPPLAEQRAIAKILGSLDDKIELNRERS 143 Query: 189 IRFIELLKEKKQALVS-----YIVTKGLNPD--VKMKDSGIEWV--GLVPDHWEVKPFFA 239 + + + +G +P ++ D E + +P+ W++ Sbjct: 144 ETLEAMGRALFKDWFVDFGPVRAKQEGRSPYLPREIWDLFPERLDTNELPEGWKLLKASE 203 Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 L+ ++ + E Q ++P + + G+ + I Sbjct: 204 LIEFNPTESLRKGEVAPYLDMASLPTQGSWPDPYVMRP--FGSGMRFRNGDTLLARITPC 261 Query: 300 NDKRSLRSAQVMERGII---TSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL-- 353 + Q + ++ ++ Y+ ++P G A+L+ R+ + +G Sbjct: 262 LENGKTAFIQCLPDDVVGWGSTEYIVMRPKGPVPAAFAYLLARNDAFREHAIRSMTGTSG 321 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI-----DVLVEKIEQSIVLLKERRS 408 RQ + + V + P + + A I D + E S+ L K R Sbjct: 322 RQRAQGDAVAAYQLAAPLWD------DKLWAVLASIVSLLFDGIRSNSETSVNLAK-MRD 374 Query: 409 SFIAAAVTGQIDLRGESQ 426 + + + G + ++ + Sbjct: 375 NLLPMLIAGALRVKNAER 392 >gi|313605681|gb|EFR83056.1| type I restriction-modification system specificity subunit [Listeria monocytogenes FSL F2-208] Length = 192 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 70/184 (38%), Gaps = 12/184 (6%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVDPGEIV 292 + ++ ++ RKN +L + L++S + I + E N + + Y +V GE Sbjct: 6 QRKLNSITEKITRKNKELESTLPLTISAQDGLIDQNEYFNKIIASRNIRGYFLVKNGEFA 65 Query: 293 FRFIDLQNDKRSLRSAQVMER-GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + + G++++ Y+ KP I+S +L S + + Sbjct: 66 YNKSYSKGYPWGVVKRLDNYNMGVLSTLYIIFKPVKINSDFLTKYFDSTYWYRAVSQFAT 125 Query: 352 -GLRQ----SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G R ++ D + + +P +EQ I +++ ++ + + L Sbjct: 126 EGARNHGLLNIAASDFFEIELNIPLNNEEQKKIGLF----FQQLENIIILHQNKLEKLSI 181 Query: 406 RRSS 409 + + Sbjct: 182 LKKT 185 >gi|320528570|ref|ZP_08029727.1| type I restriction modification DNA specificity domain protein [Solobacterium moorei F0204] gi|320131156|gb|EFW23729.1| type I restriction modification DNA specificity domain protein [Solobacterium moorei F0204] Length = 202 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 54/150 (36%), Gaps = 3/150 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K + +I +S ++ + + + E + + I + + + Sbjct: 49 KVISYWQGDIPWISSSDLFENNIRDINVSRYITKEAIKCSAAKLCPKKTICIVSRVGVGK 108 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 A E + +M + + +LA L+++ + + +++K + Sbjct: 109 VAVTTEFLCTSQDFMNITHFEGNKYFLAQLIQNKIKSSQLQ---GTSIKGITSKEIKDMR 165 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKI 396 + +P EQ I +N+ RI+ ++ I Sbjct: 166 LFIPSRAEQDKIVKFLNIIDQRIETQIKII 195 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 57/178 (32%), Gaps = 13/178 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLP--KDGNSRQSDTS 76 WK + + G T + DI +I D+ + + + S Sbjct: 29 WKTYKVDNIIESCGGGTPSTKVISYWQGDIPWISSSDLFENNIRDINVSRYITKEAIKCS 88 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + K I + K + S F+ E + +L + + Sbjct: 89 AAKLCPKKTICIVS-RVGVGKVAVTTEFLCTSQDFM----NITHFEGNKYFLAQLIQNKI 143 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +G ++ K I ++ + IP AEQ I + + RI+T I + L Sbjct: 144 KSSQLQGTSIKGITSKEIKDMRLFIPSRAEQDKIVKFLNIIDQRIETQIKIISDYNSL 201 >gi|254304353|ref|ZP_04971711.1| site-specific DNA-methyltransferase (adenine-specific) [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324545|gb|EDK89795.1| site-specific DNA-methyltransferase (adenine-specific) [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 718 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 43/350 (12%), Positives = 93/350 (26%), Gaps = 15/350 (4%) Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 N + +S+ ILY +RK + + + L ++ Sbjct: 364 INQLKEKGKGISLVKTN-ILYEPKNKNIRKYFVENGYIE---SIIYLPKNMLIDYPFPLA 419 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L+ + + + I I + + I I + Sbjct: 420 LIVFSKENKKIKFIDAYKFCKMEKFKIEFIDNYFKNPKISEIKEQNINIIIDTNVEKIID 479 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN-----PDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I + +KE + IV K N + D ++ + +K Sbjct: 480 LINNQKNIKESFSKKIEDIVEKDYNLVVTENFEILVDILKKFKNEIKFKDIIKNIVRGSQ 539 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDL 298 + K E+ + LS +I L + + I+ Sbjct: 540 KTISKFKSEEETQYIYLSLSDINDGLIEFKNIENYLKEVPKNQEKFFIKNNSILLSKYGS 599 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-- 356 + + + + + + S ++ Sbjct: 600 SPKLAISQIPDDKKVIPSGNFIIIEVDEEKLNPWYLMSYFSSGFGSEKLKETYTEAKNDT 659 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + ++ + + VPPIKEQ I +I+ + +K++ I KE Sbjct: 660 ISIRKLENIEIPVPPIKEQEKIAKEYRESLKKIEEMKKKLKNEIQNSKEI 709 >gi|289168439|ref|YP_003446708.1| restriction endonuclease S subunit [Streptococcus mitis B6] gi|288908006|emb|CBJ22846.1| restriction endonuclease S subunit [Streptococcus mitis B6] Length = 217 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 59/190 (31%), Gaps = 6/190 (3%) Query: 26 KVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQ--SDTSTVSI 80 + + + G K I ++V++G+ + S + + S Sbjct: 16 EWKTLGEVCDVRDGTHDSPNKKAFGKYLITSKNVKNGSINFDSAYFISESDFDNINKRSK 75 Query: 81 FAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L+ +G A I + L+ +L L +L S I + Sbjct: 76 VDIDDLLFTMIGTVGEIAHITEEPDFAIKNVGLIKTQSRILARYLLHYLQSTYAKDYISS 135 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + + N P+P Q I + + + L + IEL +++ Sbjct: 136 NSSKGSQVFLGLGKLRNFPIPYVEPKIQSRIVQVLDNFDTVCNDLNIGLPKEIELRQKQY 195 Query: 200 QALVSYIVTK 209 + ++T Sbjct: 196 EYFREKLLTF 205 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/174 (12%), Positives = 55/174 (31%), Gaps = 9/174 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPESYETYQIVDPGEIVF 293 V + + ++ N+ + ++ VD +++F Sbjct: 24 CDVRDGTHDSPNKKAFGKYLITSKNVKNGSINFDSAYFISESDFDNINKRSKVDIDDLLF 83 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-G 352 I + + + + I + + I + YL ++S + S G Sbjct: 84 TMIGTVGEIAHI--TEEPDFAIKNVGLIKTQS-RILARYLLHYLQSTYAKDYISSNSSKG 140 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + L ++ P+ K Q I V++ + L + + I L +++ Sbjct: 141 SQVFLGLGKLRNFPIPYVEPKIQSRIVQVLDNFDTVCNDLNIGLPKEIELRQKQ 194 >gi|283769289|ref|ZP_06342190.1| type I restriction modification DNA specificity domain protein [Bulleidia extructa W1219] gi|283104099|gb|EFC05481.1| type I restriction modification DNA specificity domain protein [Bulleidia extructa W1219] Length = 174 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 59/172 (34%), Gaps = 13/172 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYL---PKDGNSRQS 73 W I + G T + K I +I +D+ + +G+Y+ ++ Sbjct: 3 EWIECKISDIGTVVGGATPSTKKPENYENGTIAWITPKDLSTFSGRYIQHGERNITKTGL 62 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + + + K +L+ P IA D + F + P + L + L Sbjct: 63 KSCSTQLLPKNTVLFSSRAPI-GYVAIAANDVCTNQGFKSVIPNE-NTNPLFLYYLLKYN 120 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTL 184 +IE + G T + NI + +P Q I + + +I+ Sbjct: 121 KDKIEGMGSGTTFKEVSGNTMKNIVVSVPTDKKVQERISSMLGSIDDKIEEN 172 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 16/133 (12%), Positives = 43/133 (32%), Gaps = 6/133 (4%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ + RN+ + Q++ ++F Sbjct: 44 TFSGRYIQHGERNITKTGLKSCSTQLLPKNTVLFSSRAPIGYVAIAA-----NDVCTNQG 98 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDI 378 + +V P+ + + + Y+ K+ + + +K + V VP K Q I Sbjct: 99 FKSVIPNENTNPLFLYYLLKYNKDKIEGMGSGTTFKEVSGNTMKNIVVSVPTDKKVQERI 158 Query: 379 TNVINVETARIDV 391 ++++ +I+ Sbjct: 159 SSMLGSIDDKIEE 171 >gi|327330732|gb|EGE72478.1| type I restriction enzyme EcoR124II specificity protein [Propionibacterium acnes HL097PA1] Length = 91 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 11/85 (12%), Positives = 30/85 (35%), Gaps = 10/85 (11%) Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-- 392 W+ + K+ + L E +K +P+ +P ++ Q I +V++ ++ + Sbjct: 5 WIFHMLKVMKLSQFATKSAQPGLSVERLKSVPIPIPSLENQKRIASVLDKFDVLVNDINV 64 Query: 393 -----VEKIEQSIVLLKERRSSFIA 412 + + R + Sbjct: 65 GIPAEIAARRKQYEY---YRDKLLT 86 >gi|327398990|ref|YP_004339859.1| hypothetical protein Hipma_0830 [Hippea maritima DSM 10411] gi|327181619|gb|AEA33800.1| hypothetical protein Hipma_0830 [Hippea maritima DSM 10411] Length = 501 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 40/374 (10%), Positives = 99/374 (26%), Gaps = 33/374 (8%) Query: 35 KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY 94 K + + + + +G+ +YL ++ I +G IL+ G Sbjct: 68 KFIRTKAFTPYSFLPDLSI----NGSFEYLRPKDFENAKGKNSQRIIKEGDILFVTGGNV 123 Query: 95 LRKAIIADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWK 152 I + + I S+ L L + + + +L + + + D Sbjct: 124 GEVVIADEILDNSIPSSHILKLFFDNKIKYYILAFLK-NEFCKIQSNFGPIGAIGGLDTF 182 Query: 153 GIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLN 212 + P Q E I + +I + K + + ++ Sbjct: 183 DKDTLLSISIPFPNQKNSDEVIEYVELLTKAIINKEKEIRRKHKLILEKIEKELLENQKP 242 Query: 213 PDVKMKDSGIEWVGLVPD-----HWEVKPFFALVTELNRKNTKLIES------------- 254 + K I + V + ++ + + + +E Sbjct: 243 NKFEYKLPDILEIEKVGRLDTKLYKRNFKYYEFLIQNYKGGFFYLEESDLRGGSTPKQNE 302 Query: 255 ------NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID-LQNDKRSLRS 307 ++ I + N+ + I I+ + + Sbjct: 303 RIFGKGEFTWVTPTFISKYGYLDNIEKIAIKSKKNNIKRNCLILINRGNKEDLIRGFYYD 362 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLP 366 + + G ++ + +L L+ S + G +K + +P Sbjct: 363 YKDLGEGHHNQGCYRIENGNYNLIFLTALLNSQFYRNFVSNLSVGSKMPEIKISQIINIP 422 Query: 367 VLVPPIKEQFDITN 380 P +Q +I Sbjct: 423 FPNFPESKQKEIAE 436 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 18/164 (10%), Positives = 53/164 (32%), Gaps = 5/164 (3%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + + L N + + + +I+ G+I+F + Sbjct: 69 FIRTKAFTPYSFLPDLSINGSFEYLRPKDFENAKGKNSQRIIKEGDILFVTGGNVGEV-- 126 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK---FED 361 + + ++++ I +S + + Y+ +++ G L + Sbjct: 127 VIADEILDNSIPSSHILKLFFDNKIKYYILAFLKNEFCKIQSNFGPIGAIGGLDTFDKDT 186 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + + + P K ++ + + T I ++I + L+ E Sbjct: 187 LLSISIPFPNQKNSDEVIEYVELLTKAIINKEKEIRRKHKLILE 230 >gi|325973134|ref|YP_004250198.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323651736|gb|ADX97818.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 289 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 51/147 (34%), Gaps = 9/147 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 N + ++ P + F D L+ + I ++ Sbjct: 51 DSKSNRYFNQQGVNQNKLFPPHTVCFVRCGSVGDCSILKENACLTESIYAFSFFEGIS-- 108 Query: 328 IDSTYLAWLMRSYDL-CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 D ++ + + K+ + + R L F+ ++ + PP +EQ I ++++ Sbjct: 109 -DPKFIKYCFDFPKIKQKILHLSNTTTRNILSFQKLQLIKFPCPPPQEQKLIGDILSA-- 165 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAA 413 D L E ++ I +L R+ I Sbjct: 166 --YDELFENNKRQIEILNRVRT-LIYK 189 >gi|324990381|gb|EGC22319.1| type I restriction-modification system specificity subunit [Streptococcus sanguinis SK353] Length = 191 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 41/108 (37%), Gaps = 6/108 (5%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + YET + G++V + ++ + + + +D Y + Sbjct: 50 TDKLYETSLSLVAGDVVIS---SPSRLATIVGEDNEGKFLTLNFIKVNIKGRLDKFYFLY 106 Query: 336 LMR-SYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 L S D+ + G+G + + ++R+ + +P I+EQ I Sbjct: 107 LFNQSRDVQRQKERELQGTGTSMRIPVKSLERIRIPLPSIEEQEKIGQ 154 >gi|298736618|ref|YP_003729144.1| type I restriction enzyme S protein [Helicobacter pylori B8] gi|298355808|emb|CBI66680.1| type I restriction enzyme S protein [Helicobacter pylori B8] Length = 368 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 47/381 (12%), Positives = 112/381 (29%), Gaps = 53/381 (13%) Query: 44 SGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 25 NYKKVCYLDTDNITNNKINAFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGIIK 84 Query: 103 F---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKGI 154 + + ST F+V+ + + P L ++ ++ ++ I C ++ Sbjct: 85 EIPKNFLVSTAFIVIDVIDLEKLDPNYLYYYITQDEIIHYLQRIAECGTSSYPSITPLDF 144 Query: 155 GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 NI + + PL Q I + +I+ Sbjct: 145 LNIKIKLYPLETQQKIARTLSVLDQKIENNHKINELL----------------------- 181 Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 H + + KN KL + I + +++ + Sbjct: 182 ----------------HTLAYKIYEYYFKYKPKNAKLEQIIIENPKSNIMVKNAQKTQDK 225 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + P I+ N + + + ++ + + S YL Sbjct: 226 YPFFTSGDNILSYPKAIIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCICANEF-SDYLY 284 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L+ S + L+ +K+ P+ +P E N + L+ Sbjct: 285 LLLSSIKNHINQSFFQGTSLKHLQKNLLKKYPIYMPSAHE----IKKFNQIMMPLLTLIS 340 Query: 395 KIEQSIVLLKERRSSFIAAAV 415 ++ L++ R + + Sbjct: 341 INTRTSKKLEQIRDFLLPLLL 361 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 61/160 (38%), Gaps = 10/160 (6%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I I++ + Sbjct: 18 NNYTKEYNYKKVCYLDTDNITNNKINAFLKIDLTKEKLPSRAKRKCSI---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + ++ + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDVIDLEKLDPNYLYYYITQDEIIHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 S+ D + + + P++ Q I ++V +I+ Sbjct: 134 SSYPSITPLDFLNIKIKLYPLETQQKIARTLSVLDQKIEN 173 >gi|317010711|gb|ADU84458.1| type I restriction enzyme S protein [Helicobacter pylori SouthAfrica7] Length = 375 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 44/382 (11%), Positives = 112/382 (29%), Gaps = 53/382 (13%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVCYLDTDNITNNRINTFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ + + P L ++ ++ + I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDVIDLEKLDPNYLYYYITQDEIIHYLHRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ L Sbjct: 144 FLNIKVKLYPLETQQKIARTLSILDQKIENNHKINELIQTLA------------------ 185 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 + + KN KL + + + +++ + Sbjct: 186 ---------------------YKIYEYYFKHKPKNAKLEQIILENPKSSIMVKDAQKTQD 224 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + P ++ N + + + ++ + + S YL Sbjct: 225 KYPFFTSGDNILSYPKALIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCICANEF-SDYL 283 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 L+ S + L+ +K+ P+ +P E ++ L+ Sbjct: 284 YLLLSSIKNHINQSFFQGTSLKHLQKNLLKKYPIYMPSKHEIKQFNEIVMPLL----TLI 339 Query: 394 EKIEQSIVLLKERRSSFIAAAV 415 ++ L++ R + + Sbjct: 340 SINTRTSKKLEQIRDFLLPLLL 361 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 67/172 (38%), Gaps = 11/172 (6%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I I++ + Sbjct: 18 NNYTKEDNYKKVCYLDTDNITNNRINTFLKIDLTKEKLPSRAKRKCSI---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + ++ + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDVIDLEKLDPNYLYYYITQDEIIHYLHRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 S+ D + V + P++ Q I +++ +I+ KI + I L Sbjct: 134 SSYPSITPLDFLNIKVKLYPLETQQKIARTLSILDQKIENN-HKINELIQTL 184 >gi|327474705|gb|EGF20110.1| hypothetical protein HMPREF9391_0219 [Streptococcus sanguinis SK408] Length = 204 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 52/131 (39%), Gaps = 7/131 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER---GIITSAYMAVK 324 + Y G+ + I + +++ G ++ ++ V+ Sbjct: 38 FTRDIPEFEYLEYRGGTKFRNGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVR 97 Query: 325 --PHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + D ++ +LM + + + + +G+ RQ ++ + VK +L PP+KEQ I Sbjct: 98 AKENISDENFVYYLMIAPSIREVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGK 157 Query: 381 VINVETARIDV 391 ++ +I+ Sbjct: 158 ILKALDDKIEN 168 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 65/179 (36%), Gaps = 14/179 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E +E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMEKLEPFTRDIPEFEY----LEYRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I + Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVRAKENISDENFVYYLMIAPSI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R I+++ + + N + PPL EQ+ I + + A +I+ Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKILKALDDKIENNKKINHHL 176 >gi|227511533|ref|ZP_03941582.1| conserved hypothetical protein [Lactobacillus buchneri ATCC 11577] gi|227085178|gb|EEI20490.1| conserved hypothetical protein [Lactobacillus buchneri ATCC 11577] Length = 207 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 68/203 (33%), Gaps = 15/203 (7%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRN 272 M I + + WE + K + + + + YG + K ++ Sbjct: 1 MFYILINAINFLEVAWEQRKLGDWGYFYYGHSAPKWSVVGDGGTPCVRYGELYTKSNSKI 60 Query: 273 MGLKPESYETYQIVD--PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + + + G V ++ A + + ++V + Sbjct: 61 DHIYSHTNISVKNLKLSKGTEVLIPRVGEDPLDFAHCAWLSIPNVAIGEMISVFNTKQNP 120 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + A+ S + + G +L + + +PV P +KEQ +I T I+ Sbjct: 121 LFTAYSFNSMLKYEFAKRVEGGGVANLYYAYLTNIPVSFPSMKEQTEI-------TQLIE 173 Query: 391 VLVEKI-EQSIVLLKERRSSFIA 412 L+ I L+ +++ ++ Sbjct: 174 NLISLIAANQGKHLQ-IKNALLS 195 Score = 43.6 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 57/186 (30%), Gaps = 8/186 (4%) Query: 25 WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + G ++ + ++ + + + + + Sbjct: 16 WEQRKLGDWGYFYYGHSAPKWSVVGDGGTPCVRYGELYTKSNSKIDHIYSHTNISVKNLK 75 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSIDVTQRI 137 + ++L ++G + I + ++ L + + + Sbjct: 76 LSKGTEVLIPRVGEDPLDFAHCAWLSIPNVAIGEMISVFNTKQNPLFTAYSFNSMLKYEF 135 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 EG +++ + + NIP+ P + EQ I + I I + ++ L Sbjct: 136 AKRVEGGGVANLYYAYLTNIPVSFPSMKEQTEITQLIENLISLIAANQGKHLQIKNALLS 195 Query: 198 KKQALV 203 QAL Sbjct: 196 -CQALF 200 >gi|77413788|ref|ZP_00789968.1| type I restriction-modification system, S subunit [Streptococcus agalactiae 515] gi|77160150|gb|EAO71281.1| type I restriction-modification system, S subunit [Streptococcus agalactiae 515] Length = 183 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 7/115 (6%) Query: 5 KAYPQYKD---SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIY---IGLEDVES 58 K Y + D V+ IP W+ V ++ + L+ K Y + +ED+E Sbjct: 65 KPYEKLSDGTIKEVEVPYDIPASWEWVRLRNISSLSFFPNISGDKIPNYSWVLDMEDIEK 124 Query: 59 GTGKYLPKDGNSRQSDT-STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 TG+ + K+ + +S S F+K +LY KL P L+K II+D DG +T+ + Sbjct: 125 ETGRLVRKNYKTEKSSYKSNKVYFSKDTVLYAKLRPNLKKVIISDEDGFATTELI 179 >gi|265752105|ref|ZP_06087898.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_33FAA] gi|263236897|gb|EEZ22367.1| type I restriction enzyme EcoAI specificity protein [Bacteroides sp. 3_1_33FAA] Length = 152 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 50/151 (33%), Gaps = 5/151 (3%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 + I S++ G + +ET N + + K Sbjct: 1 MPEGWAICKMKQITSITNGKSQKNVETLNGIYPIYGSGGVIGRANQYLCIAGSTIIGRKG 60 Query: 304 SLRSAQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFED 361 ++ + +E +A+ I YL + S+D K+ S SL Sbjct: 61 TINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDFSKL---DKSTAMPSLTKTS 117 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + + +PP KEQ I I++ ++ + Sbjct: 118 IGNVLIPIPPYKEQERIVAKIDMVLDTMNEI 148 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 61/162 (37%), Gaps = 16/162 (9%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ W + +K+ T + G++ + +VE+ G Y P G+ + + Sbjct: 2 PEGWAICKMKQITSITNGKSQK-----------NVETLNGIY-PIYGSGGVIGRANQYLC 49 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G + G+ G + + T F + +L + L + LS D + Sbjct: 50 IAGSTIIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDF----SKLD 105 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + M IGN+ +PIPP EQ I KI ++ Sbjct: 106 KSTAMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNE 147 >gi|126173061|ref|YP_001049210.1| restriction modification system DNA specificity subunit [Shewanella baltica OS155] gi|125996266|gb|ABN60341.1| restriction modification system DNA specificity domain [Shewanella baltica OS155] Length = 267 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT-----SAY 320 K + + + S+ + +I+ DL N K ++ V E T Sbjct: 106 SKFISTDGLVAKYSHSQICPLFKDDILLVMSDLPNGKALSKTFIVDEDERYTLNQRIGGI 165 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +L + + +G+ + +L+ + + V + P+++Q I Sbjct: 166 TVKDKSEMLPKFLHYYLNRTP---QLLKHDNGVDQTNLRKGQILEVKVPILPLQKQEHIV 222 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 ++++ L E + + I L ++ R ++ Sbjct: 223 SILDKFDKLTKSLSEGLPREIELRQKQYEYYRDLLLS 259 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 59/194 (30%), Gaps = 31/194 (15%) Query: 5 KAYPQYKD-------SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLED 55 K Y Y+D V+W ++ G+ E +D +I + Sbjct: 57 KQYNYYRDQLLSFEECDVEW----------KTLEEVAHFANGKGHEKDISEDGKFIVV-- 104 Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRK------AIIADFDGICST 109 K++ DG + S + K IL K + D + Sbjct: 105 ----NSKFISTDGLVAKYSHSQICPLFKDDILLVMSDLPNGKALSKTFIVDEDERYTLNQ 160 Query: 110 QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + + KD L + ++ T ++ G ++ I + +PI PL +Q Sbjct: 161 RIGGITVKDKSEMLPKFLHYYLNRTPQLLKHDNGVDQTNLRKGQILEVKVPILPLQKQEH 220 Query: 170 IREKIIAETVRIDT 183 I + + Sbjct: 221 IVSILDKFDKLTKS 234 >gi|313149744|ref|ZP_07811937.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313138511|gb|EFR55871.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 385 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 45/389 (11%), Positives = 109/389 (28%), Gaps = 37/389 (9%) Query: 29 PIKRFTKL-NTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ + + G T + ++ I + + ++ + I G Sbjct: 10 TLESVCPIMSKGITPKYVESSSVLVINQACIHWDGQRLGNIKYHNEEIPVRK-RILESGD 68 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L G +G + + P D + G ++++ + + T Sbjct: 69 VLLNATG-----------NGTLGRCCVFICPSDNNTYINDGHVIALSTDRAVILPEVLNT 117 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETV----RIDTLITERIRFIELLKEKKQA 201 + + QV I I + +D I + K K Sbjct: 118 YLSLNDTQAEIYRQYVTGSTNQVDIVFSDIKKMKVPVPSMDEQILFVEVLKQADKSKFGD 177 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 S + NP + + ++ +G + K + + + Y Sbjct: 178 FKSQFIEMFGNPLSLNQKNELKRLGEC--CILNPRRPNIALCDTDKVSFIPMPAVSEDGY 235 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS--- 318 + E + + + + +++F I + + GI Sbjct: 236 LVDMTDEEYGKVK------KGFTYFENNDVLFAKITPCMENGKGAIVHGLTNGIGMGSTE 289 Query: 319 -AYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + +L L R + G+G ++ + + V +P I+EQ Sbjct: 290 FHVLRLINGISSPYWLLALTRMPIFRERAAKNMSGTGGQKRVSASYLDHFMVGLPAIEEQ 349 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLK 404 + D I++++V L Sbjct: 350 RRF----EAIYKQADKSKSVIQKALVYLN 374 >gi|239620849|ref|ZP_04663880.1| restriction modification system DNA specificity domain-containing protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516246|gb|EEQ56113.1| restriction modification system DNA specificity domain-containing protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 182 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 10/167 (5%) Query: 25 WKVVPIKRFTK-LNTGRT---SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT--STV 78 W+ + + G +E Y+ + D++ T ++ D + +D S Sbjct: 10 WEQRKLGDVASSFDYGLNAAATEYDGQNKYLRITDIDDETHEFSKSDLTTPLADLAMSAD 69 Query: 79 SIFAKGQILYGKLGP-YLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVT 134 + +G +L+ + G + + FDG+ + PE L+ Sbjct: 70 YLLKEGDLLFARTGASVGKTYLYRQFDGMVYFAGFLIRARIGEGADPEFAYQATLTDAYK 129 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + + + + + +P EQ I + + I Sbjct: 130 KYVAINSQRSGQPGVNAQEYADYQLMLPSKTEQQQIGMTLRSLDDLI 176 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 33/120 (27%), Gaps = 5/120 (4%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 ++ G+++F K L A G D + + Sbjct: 67 SADYLLKEGDLLFARTGASVGKTYLYRQFDGMVYFAGFLIRARIGEGADPEFAYQATLTD 126 Query: 341 DLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K + + ++ +++P EQ I + +D L+ ++ Sbjct: 127 AYKKYVAINSQRSGQPGVNAQEYADYQLMLPSKTEQQQIGMTL----RSLDDLITLHQRK 182 >gi|116629556|ref|YP_814728.1| restriction endonuclease S subunit [Lactobacillus gasseri ATCC 33323] gi|116095138|gb|ABJ60290.1| Restriction endonuclease S subunit [Lactobacillus gasseri ATCC 33323] Length = 363 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/293 (10%), Positives = 81/293 (27%), Gaps = 15/293 (5%) Query: 106 ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLA 165 T F + + + + ++ T + +K + I + P Sbjct: 69 YVDTPFFLGADGVKVLKCTDKNANYRYLYYALKNAHIPNTGYNRHFKWLKEITINYPDKN 128 Query: 166 EQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWV 225 Q I + +++ +I + + ++ E +A + + K K S IE Sbjct: 129 RQNDIVNILD----KLEYIIKMKSQELDKFDELIKARFVEMFGDPQDSKSKWKKSTIEK- 183 Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 + I + T + + I Sbjct: 184 ------CCTLKSGKTLPRNIENEGGNIPYVKVKDMNSLENTTYITTSTRFVSDKTANKSI 237 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 G ++F ++ + I + +L +++ + Sbjct: 238 FPVGTVIFPKRGG---AIGTNKKRLTKVPICADLNIMGVIPDNTRISSYYLFEYFNMVDL 294 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET-ARIDVLVEKIE 397 + +D+ L + +PP+ Q + N ++ ++ + +V + Sbjct: 295 NTLNNGSSVPQINNKDINPLNINIPPLSLQNEFANFVHQVDKSKFENIVYLNK 347 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 58/169 (34%), Gaps = 6/169 (3%) Query: 25 WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTV 78 WK I++ L +G+T G +I Y+ ++D+ S Y+ T+ Sbjct: 176 WKKSTIEKCCTLKSGKTLPRNIENEGGNIPYVKVKDMNSLENTTYITTSTRFVSDKTANK 235 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 SIF G +++ K G + ++ + +L + Sbjct: 236 SIFPVGTVIFPKRGGAIGTNKKRLTKVPICADLNIMGVIPDNTRISSYYLFEYFNMVDLN 295 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + G+++ + K I + + IPPL+ Q + I Sbjct: 296 TLNNGSSVPQINNKDINPLNINIPPLSLQNEFANFVHQVDKSKFENIVY 344 >gi|319778993|ref|YP_004129906.1| Type I restriction-modification system, specificity subunit S [Taylorella equigenitalis MCE9] gi|317109017|gb|ADU91763.1| Type I restriction-modification system, specificity subunit S [Taylorella equigenitalis MCE9] Length = 185 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 16/159 (10%), Positives = 52/159 (32%), Gaps = 17/159 (10%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Y + V+P +V + + + ++ + Sbjct: 36 NGFPVFGGNGIIGKYTDFLYVEPQLLVSCRGAASGNII----ESYPKSFVTNNSLVLEWK 91 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + + L F + + ++++ +P+ +P I + I+ Sbjct: 92 DYRYYEFYKQFLFANPL---FSYSTGSAQPQITIDNIRDVPIPLP-------IFDDISNL 141 Query: 386 TARIDVLVEK-IEQSIV--LLKERRSSFIAAAVTGQIDL 421 TA + + ++++ L R + + ++G++D+ Sbjct: 142 TANLKSISALRYQKTVENSKLALLRDTLLPKLMSGELDV 180 >gi|255021986|ref|ZP_05293994.1| restriction modification system DNA specificity domain [Acidithiobacillus caldus ATCC 51756] gi|254968622|gb|EET26176.1| restriction modification system DNA specificity domain [Acidithiobacillus caldus ATCC 51756] Length = 408 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 8/120 (6%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCK 344 PG++VF ID +N L + + + ++TS Y P + YL L+R+ Sbjct: 38 YPGDLVFSKIDARNGAVGLIPSSIP-KAVVTSEYPVFTPRADKLRPAYLHHLLRADHFKG 96 Query: 345 VFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQF-DITNVINVET--ARIDVLVEKIEQS 399 SG R+ + E L + VP + EQ IT + T +++ E IE++ Sbjct: 97 ELQRKASGTSGRKRVTPEGFLSLEIPVPSLAEQDVLITAYADALTRAEQLEREAEAIERA 156 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 48/393 (12%), Positives = 113/393 (28%), Gaps = 43/393 (10%) Query: 53 LEDVESGTGKYLPKDGNSRQSDTSTVSIF--AKGQILYGKLGPYLRKAIIAD---FDGIC 107 L D +S T K+ + +++ S+F G +++ K+ + + Sbjct: 7 LGDWQSITIKFSGEVLPRERAEAFKGSMFAAYPGDLVFSKIDARNGAVGLIPSSIPKAVV 66 Query: 108 STQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPL 164 ++++ V P+ P L L + ++ G + +G ++ +P+P L Sbjct: 67 TSEYPVFTPRADKLRPAYLHHLLRADHFKGELQRKASGTSGRKRVTPEGFLSLEIPVPSL 126 Query: 165 AEQVL-IREKIIAETVRIDTLITERIRFIELLKEKKQAL--------------VSYIVTK 209 AEQ + I A T + AL V+ Sbjct: 127 AEQDVLITAYADALTRAEQLEREAEAIERAGWLAFETALGVAPPPPLPDRPVFVARFKDV 186 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNII 265 L M + + G+ + + + L N+ Sbjct: 187 ELWSHEGMLRATVGDQGVRVATCPIVELGTVAAVSYGLQKSPTNRPGTHARPYLRVANVQ 246 Query: 266 QKLETRNMGLK---PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAY 320 + + P++ ++ G+I+F + + + E + + Sbjct: 247 RGRLILDKIKTINVPDADMASLRLEVGDILFVEGNGSRAELGRVALWNGEITDCVHQNHI 306 Query: 321 MAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGLRQ--SLKFEDVKRLPVLVPPIKEQF 376 + +P + S F+ G ++ ++ P+ +P I Q Sbjct: 307 IKARPQQSLLLPEFAMAWFNSEAGRDHFFKSGKTTSGLGTINSSVIRTAPIPLPSIAVQK 366 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + ++ + R S Sbjct: 367 ALISELSAAD-------TSAQAKRSEAATLRQS 392 >gi|319777320|ref|YP_004136971.1| type i restriction-modification system, s subunit [Mycoplasma fermentans M64] gi|318038395|gb|ADV34594.1| Type I restriction-modification system, S subunit [Mycoplasma fermentans M64] Length = 325 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 7/181 (3%) Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES-------NILSLSYGNIIQKL 268 +KD E +P++W + K E IL +S + Sbjct: 78 NIKDITEELPFEIPENWMWVRLKNISIINGGFAFKSSEFVSKENGIRILRISDFDERGLK 137 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + K ES ++ IV K L + + + + Sbjct: 138 NNNIVYYKYESKMFDYFLNNKNIVICMTGGTVGKSLLIKELKEKILVNQRVGNIKILNEM 197 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+ +M+S + K+ + ++ E +K + VP I+EQ I N + Sbjct: 198 LPDYVDIVMKSELISKIIRKNKNSTNDNISIELIKLFFIPVPSIEEQLKIIVKYNKLLTQ 257 Query: 389 I 389 + Sbjct: 258 L 258 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 73/202 (36%), Gaps = 10/202 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP++W V +K + +N G +S + I + + D + K +S Sbjct: 89 EIPENWMWVRLKNISIINGGFAFKSSEFVSKENGIRILRISDFDERGLKNNNIVYYKYES 148 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + K I+ G + K+++ + + + ++ + + ++ Sbjct: 149 KMFDYFLNNKN-IVICMTGGTVGKSLLIKELKEKILVNQRVGNIKILNEMLPDYVDIVMK 207 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ +I + +T + + I +P+P + EQ+ I K ++ + Sbjct: 208 SELISKIIRKNKNSTNDNISIELIKLFFIPVPSIEEQLKIIVKYNKLLTQLTLYKNIFLY 267 Query: 191 FIELLKEKKQALVSYIVTKGLN 212 L + ++V K ++ Sbjct: 268 IFPLYIPVNIWYLRFLVNKTIH 289 >gi|238809497|dbj|BAH69287.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 325 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 7/181 (3%) Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES-------NILSLSYGNIIQKL 268 +KD E +P++W + K E IL +S + Sbjct: 78 NIKDITEELPFEIPENWMWVRLKNISIINGGFAFKSSEFVSKENGIRILRISDFDERGLK 137 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + K ES ++ IV K L + + + + Sbjct: 138 NNNIVYYKYESKMFDYFLNNKNIVICMTGGTVGKSLLIKELKEKILVNQRVGNIKILNEM 197 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+ +M+S + K+ + ++ E +K + VP I+EQ I N + Sbjct: 198 LPDYVDIVMKSELISKIIRKNKNSTNDNISIELIKLFFIPVPSIEEQLKIIVKYNKLLTQ 257 Query: 389 I 389 + Sbjct: 258 L 258 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 72/202 (35%), Gaps = 10/202 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQS 73 IP++W V +K + +N G +S + I + + D + K +S Sbjct: 89 EIPENWMWVRLKNISIINGGFAFKSSEFVSKENGIRILRISDFDERGLKNNNIVYYKYES 148 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + K I+ G + K+++ + + + ++ + + ++ Sbjct: 149 KMFDYFLNNKN-IVICMTGGTVGKSLLIKELKEKILVNQRVGNIKILNEMLPDYVDIVMK 207 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 ++ +I + +T + + I +P+P + EQ+ I K ++ Sbjct: 208 SELISKIIRKNKNSTNDNISIELIKLFFIPVPSIEEQLKIIVKYNKLLTQLTLYKNIFPY 267 Query: 191 FIELLKEKKQALVSYIVTKGLN 212 L + ++V K ++ Sbjct: 268 IFLLYIPVNIWYLRFLVNKTIH 289 >gi|324990382|gb|EGC22320.1| hypothetical protein HMPREF9388_1537 [Streptococcus sanguinis SK353] Length = 188 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 52/145 (35%), Gaps = 12/145 (8%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + +G + L E + Y+ +IV+ +L K + + Sbjct: 50 VEHGVTPKTERYNREFLVREETKKYKYTKYNDIVYNPANL---KFGAIARNKYGEAFFSP 106 Query: 319 AYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIK 373 Y+ + + + ++ ++ S D + G R ++K +D +L + +P Sbjct: 107 IYVTFEANYSNVLPEFIEKILTSNDFIQKALKFQEGTVYERMAVKADDFLKLVIKLPTPP 166 Query: 374 EQFDITNVINVETARIDVLVEKIEQ 398 EQ I + +D L+ ++ Sbjct: 167 EQRAIGSF----FQELDQLITLQQR 187 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 44/163 (26%), Gaps = 7/163 (4%) Query: 25 WKVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + + + ++ +E + + ++ R +T Sbjct: 21 WEQRKLGEVLSERNIQEVPTAQIPLVSFTVEHGVTPKTERYNREFLVR-EETKKYKYTKY 79 Query: 84 GQILYGKLGPYLRKAIIADF-DGICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAI 140 I+Y + + S ++ + PE ++ L S D Q+ Sbjct: 80 NDIVYNPANLKFGAIARNKYGEAFFSPIYVTFEANYSNVLPEFIEKILTSNDFIQKALKF 139 Query: 141 CEG--ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 EG + + +P EQ I I Sbjct: 140 QEGTVYERMAVKADDFLKLVIKLPTPPEQRAIGSFFQELDQLI 182 >gi|325690780|gb|EGD32781.1| hypothetical protein HMPREF9382_0226 [Streptococcus sanguinis SK115] Length = 178 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 53/131 (40%), Gaps = 7/131 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER---GIITSAYMAVK 324 + + G+ + I + +++ G ++ ++ V+ Sbjct: 38 FTRDIPEFEYLEFRGGTKFRNGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVR 97 Query: 325 --PHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + D ++ +LM + ++ + + +G+ RQ ++ + VK +L PP+KEQ I Sbjct: 98 AKENISDENFVYYLMIAPNIREVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGK 157 Query: 381 VINVETARIDV 391 ++ +I+ Sbjct: 158 ILKALDDKIEN 168 Score = 44.0 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 64/179 (35%), Gaps = 14/179 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E +E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMEKLEPFTRDIPEFEY----LEFRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVRAKENISDENFVYYLMIAPNI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R I+++ + + N + PPL EQ+ I + + A +I+ Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKILKALDDKIENNKKINHHL 176 >gi|324016949|gb|EGB86168.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 117-3] Length = 484 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 55/171 (32%), Gaps = 10/171 (5%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQK----LETRNMGLKPESYETYQIVDPGEIVFRFID 297 K+ K E + + N + + + Q + G+IV Sbjct: 55 PIQQGKSPKYAEKGLKCIKPKNTNDMLVSIDDIDWIDSSTKDQIQKQKLAYGDIVITRSG 114 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQ 355 R+ E V+P DS Y+ + S+ ++ A GS + Sbjct: 115 SGTIGRA-SIYCYSEEAYTNDHLFVVRPDKADSHYICSFLNSFHGQRLLEAGVSGSTGQL 173 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVI---NVETARIDVLVEKIEQSIVLL 403 +L E +K +P+ P K Q I + + A L ++ I L Sbjct: 174 NLSNEHIKSIPLFRPEHKAQKYIGDKVRQAEQLRAWAKRLEGMADRKIKDL 224 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 41/383 (10%), Positives = 101/383 (26%), Gaps = 25/383 (6%) Query: 36 LNTGRTSE-SGKDIIYIGLED-----VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + G++ + + K + I ++ V ++ D A G I+ Sbjct: 56 IQQGKSPKYAEKGLKCIKPKNTNDMLVSIDDIDWIDSSTK----DQIQKQKLAYGDIVIT 111 Query: 90 K--LGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT- 145 + G R +I + + V++P + +L S + +EA G+T Sbjct: 112 RSGSGTIGRASIYCYSEEAYTNDHLFVVRPDKADSHYICSFLNSFHGQRLLEAGVSGSTG 171 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + I +IP+ P Q I +K+ +K+ + Sbjct: 172 QLNLSNEHIKSIPLFRPEHKAQKYIGDKVRQAEQLRAWAKRLEGMADRKIKDLFHFNLVD 231 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 +T + S + N + + Sbjct: 232 SLTLKPRRMKQQVLSAVSLAPEF--ARAADSQMTFRNSSKLSNFISKCKCGDPIKSEERV 289 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 P T + ++ + + ++ Sbjct: 290 PGPYFYYGASGPIDTHTEFNFNGKYLIIAQDGS----IGCANVADGKFWANNHVWVLKVK 345 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 D + + + C + + E++ + + + I + +I + + + Sbjct: 346 DEYDIESICRFLDKHFPCWK-GVTTGSVVPKVTSENLLNILIPI-DIAKNREIGSKLRLA 403 Query: 386 T---ARIDVLVEKIEQSIVLLKE 405 A L + + L E Sbjct: 404 VTTAAYAKKLTASAKTLVESLIE 426 >gi|328947980|ref|YP_004365317.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328448304|gb|AEB14020.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 162 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 62/173 (35%), Gaps = 18/173 (10%) Query: 13 SGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 S + W ++P +W + + G+ VE+ GKY P G+ Sbjct: 4 SELDW--SLPNNWCLCHFGDIATVINGKNQSK-----------VENPDGKY-PIYGSGGI 49 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + I + G+ G + + T F + + VLP+ L + D Sbjct: 50 MGRADDFICPANCTIIGRKGSINNPIFVEEKFWNVDTAFGLCPSEAVLPKFLYYFCEYFD 109 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 T + T+ I I + +PP+ EQ I +KI+ +D ++ Sbjct: 110 FT----TLDSSTTLPSLTKTNIQQIVLALPPIDEQKRILDKIVELFGILDEIV 158 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 52/155 (33%), Gaps = 7/155 (4%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 + + N + + N + + N K Y + I+ + + Sbjct: 7 DWSLPNNWCLCHFGDIATVINGKNQSKVENPDGKYPIYGSGGIMGRADDFICPANCTIIG 66 Query: 303 RSLRSAQVM----ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 R + + + +A+ + +L + +D + S SL Sbjct: 67 RKGSINNPIFVEEKFWNVDTAFGLCPSEAVLPKFLYYFCEYFDFTTL---DSSTTLPSLT 123 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 +++++ + +PPI EQ I + I +D +V Sbjct: 124 KTNIQQIVLALPPIDEQKRILDKIVELFGILDEIV 158 >gi|325680230|ref|ZP_08159792.1| hypothetical protein CUS_5093 [Ruminococcus albus 8] gi|324108047|gb|EGC02301.1| hypothetical protein CUS_5093 [Ruminococcus albus 8] Length = 184 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 69/180 (38%), Gaps = 15/180 (8%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDPGE 290 WE + +V + RKN L L++S +I + E + + Y ++ GE Sbjct: 9 WEQRKLSDMVERVTRKNENLESELPLTISAQYGLIDQNEFFDKRIASRDVSGYYLLKKGE 68 Query: 291 IVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAVKPH---GIDSTYLAWLMRSYDLCKVF 346 + +++ E G++++ Y+ IDS +L + K Sbjct: 69 FAYNKSTSSDAPWGAVKRLDRYEMGVLSTLYIVFALKEDGNIDSDFLVSYYDTDCWHKGV 128 Query: 347 YAMGS-GLRQ----SLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSI 400 A+ + G R ++ D + VP +KEQ I A++D L+ ++ + Sbjct: 129 QAIAAEGARNHGLLNITPADYFETVLTVPSDVKEQHQIGTF----FAKLDTLITLHQREL 184 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 51/172 (29%), Gaps = 16/172 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + + T + ++ I + ++ K SR D S + Sbjct: 8 SWEQRKLSDMVERVTRKNENLESELPLTISAQYGLIDQNEFFDKRIASR--DVSGYYLLK 65 Query: 83 KGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPK---DVLPELLQGWLLSIDVT 134 KG+ Y K G+ ST ++V K ++ + L + + Sbjct: 66 KGEFAYNKSTSSDAPWGAVKRLDRYEMGVLSTLYIVFALKEDGNIDSDFLVSYYDTDCWH 125 Query: 135 QRIEAICEGATMSH--ADWKGIGNIPMPIP---PLAEQVLIREKIIAETVRI 181 + ++AI +H + + + EQ I I Sbjct: 126 KGVQAIAAEGARNHGLLNITPADYFETVLTVPSDVKEQHQIGTFFAKLDTLI 177 >gi|269115296|ref|YP_003303059.1| Type I restriction enzyme specificity protein [Mycoplasma hominis] gi|268322921|emb|CAX37656.1| Type I restriction enzyme specificity protein [Mycoplasma hominis ATCC 23114] Length = 393 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 38/336 (11%), Positives = 95/336 (28%), Gaps = 20/336 (5%) Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 + I + G + V+ K + + + ++ Sbjct: 62 NEHSIAISRAGS-AGSVKWVSQKYWATDVCFVVSEKYEVANIKFLYHFLKLRENELKKHI 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + D + + N+ +P+PPL Q I + T L TE + + Sbjct: 121 YGGNLPKLDKQYLWNLKIPLPPLEIQNQIVNILDKFTELTTELTTELTYRDKQYNYYRNK 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ + K L ++ + + + + E+ KN+ L S Sbjct: 181 LLDFDNNKEL----------LKKIMNNQQYSNNIVEYKKLEEVTLKNSFKQVDAELLSSL 230 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 +++ + + + + VD + + + R + ++ Sbjct: 231 NECKGEVKLLPSSKNYDWFCSIKNVDNFYLNYGEVITFGRARYSNVKYWNGYFLSSNNIT 290 Query: 322 AVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + +L + + S S + + + + +P I Q I Sbjct: 291 IASKDSSILLNKFLYYFLISNSQKFYVE---SSTYRKFENKIFDNFLIPIPHISIQNKIV 347 Query: 380 NVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 +++ + + I K+ R + Sbjct: 348 EILDKLETYTRDIQSGLPLEIDQRKKQYEYYRDKLL 383 Score = 45.9 bits (107), Expect = 0.014, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 41/126 (32%), Gaps = 5/126 (3%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 S N +K G P Y + + R + K + + Sbjct: 36 SMMNESEKYPVYGGGTIPTGYYNDFNNEHSIAISRAGSAGSVKWVSQKYWATDVC----- 90 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ++ + + + + + ++ + G L + + L + +PP++ Q I Sbjct: 91 FVVSEKYEVANIKFLYHFLKLRENELKKHIYGGNLPKLDKQYLWNLKIPLPPLEIQNQIV 150 Query: 380 NVINVE 385 N+++ Sbjct: 151 NILDKF 156 >gi|237712396|ref|ZP_04542877.1| type I restriction-modification system specificity determinant [Bacteroides sp. 9_1_42FAA] gi|229453717|gb|EEO59438.1| type I restriction-modification system specificity determinant [Bacteroides sp. 9_1_42FAA] Length = 192 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 60/181 (33%), Gaps = 14/181 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQ 284 +PD W V L +N E + + N + + ++ E Sbjct: 9 QLPDGWCVVTLKDLCENINGLWKGKKEPFVNVGVIRNANFTKDFKLDYSNIEYIDVEQRT 68 Query: 285 I----VDPGEIVFRFIDLQ-NDKRSLRSAQVMERGIITSAYMAVKPHGID-----STYLA 334 ++ G+++ N+ + G+ + + + + S +L Sbjct: 69 FAKRHLENGDLIVEKSGGSDNNPVGRTILYEGKSGVFSFSNFTMVLRTRNNDIVLSKFLY 128 Query: 335 WLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + + + ++L + +P+ +PP+ EQ I + I +D++ Sbjct: 129 YYILAKYQKGDMRLMQTQTTGLRNLILDKFLSMPIHLPPLSEQKRIIDRIETIFTSLDMI 188 Query: 393 V 393 + Sbjct: 189 M 189 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 60/183 (32%), Gaps = 19/183 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG------KYLPKDGNSRQS 73 +P W VV +K + G GK ++ + + + Y + + Sbjct: 9 QLPDGWCVVTLKDLCENINGL--WKGKKEPFVNVGVIRNANFTKDFKLDYSNIEYIDVEQ 66 Query: 74 DTSTVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLP----ELL 124 T G ++ K G P R + G+ S + + Sbjct: 67 RTFAKRHLENGDLIVEKSGGSDNNPVGRTILYEGKSGVFSFSNFTMVLRTRNNDIVLSKF 126 Query: 125 QGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + T + + ++P+ +PPL+EQ I ++I +D Sbjct: 127 LYYYILAKYQKGDMRLMQTQTTGLRNLILDKFLSMPIHLPPLSEQKRIIDRIETIFTSLD 186 Query: 183 TLI 185 ++ Sbjct: 187 MIM 189 >gi|327460989|gb|EGF07322.1| hypothetical protein HMPREF9394_0855 [Streptococcus sanguinis SK1057] Length = 178 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 54/131 (41%), Gaps = 7/131 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER---GIITSAYMAVK 324 + + G+ + I + ++++ G ++ ++ V+ Sbjct: 38 FTRDIPEFEYFEFRGGTKFRNGDTLMARITPSLENGKTSKVNLLDKDEVGFGSTEFIVVR 97 Query: 325 --PHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + D ++ +LM + ++ + + +G+ RQ ++ + VK +L PP+KEQ I Sbjct: 98 AKENISDENFVYYLMIAPNIREVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGK 157 Query: 381 VINVETARIDV 391 ++ +I+ Sbjct: 158 ILKALDDKIEN 168 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 64/179 (35%), Gaps = 14/179 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E +E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMEKLEPFTRDIPEFEY----FEFRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDKDEVGFGSTEFIVVRAKENISDENFVYYLMIAPNI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R I+++ + + N + PPL EQ+ I + + A +I+ Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKILKALDDKIENNKKINHHL 176 >gi|114330558|ref|YP_746780.1| restriction modification system DNA specificity subunit [Nitrosomonas eutropha C91] gi|114307572|gb|ABI58815.1| restriction modification system DNA specificity domain [Nitrosomonas eutropha C91] Length = 422 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 50/135 (37%), Gaps = 7/135 (5%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLM 337 +V+ +++ + V+ + A + +P +D+ +L + + Sbjct: 62 DELRNVVVEADDVLLNITGDSVARCCQVDPAVLPARVNQHVAIVRPRPETLDARFLRYSL 121 Query: 338 RSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 S + A+ S R +L +++L + P + EQ I +++ +D +E Sbjct: 122 VSPSMQAHLLALASAGATRNALTKGMLEKLVIAAPSVPEQRAIAHILGT----LDDKIEL 177 Query: 396 IEQSIVLLKERRSSF 410 + L+ + Sbjct: 178 NRRRNQTLEAMARAL 192 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 64/208 (30%), Gaps = 25/208 (12%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +G IP+ W++ + F L G++ + +P G+ + Sbjct: 237 LGEIPEGWEIRRVSDFLSLAYGKSLPAKARSP------------GNVPVYGSGGITGVHN 284 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 +++ ++ G+ G T F V QP LP Sbjct: 285 IALIDSEAVIVGRKGTVGSLYWEQSPSYPIDTVFYV-QPLVSLPFCYHLLESLPLRDMNT 343 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 +A G + + + P + EK ++ I + LL + Sbjct: 344 DAAVPGLNRKNVYRLEVVSPPEVL---------LEKFSVLARKLREKIFTAQNELHLLTQ 394 Query: 198 KKQALVSYIVTKGL---NPDVKMKDSGI 222 L+ ++ L + + M+ +GI Sbjct: 395 LHDTLLPKLIAGELRIVDAEKFMERTGI 422 >gi|315609161|ref|ZP_07884128.1| conserved hypothetical protein [Prevotella buccae ATCC 33574] gi|315249152|gb|EFU29174.1| conserved hypothetical protein [Prevotella buccae ATCC 33574] Length = 233 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 13/200 (6%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +P WE+ F +++ + + L + L+ + + Sbjct: 2 EIPFEIPWGWELARFGSVMYNRDSERIPLSVAKRSKLTKIYDYYGASGVIDKVDKYLFNK 61 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 ++ + +L N + + + + A++ I Y+ + S L Sbjct: 62 DLLLIGED----GNNLINRSKPIAYIATGKYWVNNHAHVLDCIDSIFMQYIGLYINSISL 117 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + E + + + +PP EQ I I V+ + V EK + + Sbjct: 118 VDYV---TGTAQPKMNQEKMNSILLPLPPHNEQKRILQKI-VKIQPLFVRYEKNQLRLEA 173 Query: 403 LK-----ERRSSFIAAAVTG 417 L R S + A+ G Sbjct: 174 LTKTLYINLRKSILQEAIQG 193 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 66/206 (32%), Gaps = 20/206 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W++ + I + + S K G S D Sbjct: 6 EIPWGWELARFGSV-------MYNRDSERIPLSVAK-RSKLTKIYDYYGASGVIDKVDKY 57 Query: 80 IFAKGQILYGKLGPYLRK-----AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 +F K +L G+ G L A IA + VL D + + ++ + Sbjct: 58 LFNKDLLLIGEDGNNLINRSKPIAYIATGKYWVNNHAHVL---DCIDSIFMQYIGLYINS 114 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF--- 191 + G + + + +I +P+PP EQ I +KI+ ++R Sbjct: 115 ISLVDYVTGTAQPKMNQEKMNSILLPLPPHNEQKRILQKIVKIQPLFVRYEKNQLRLEAL 174 Query: 192 -IELLKEKKQALVSYIVTKGLNPDVK 216 L +++++ + L P Sbjct: 175 TKTLYINLRKSILQEAIQGHLVPQNP 200 >gi|260887978|ref|ZP_05899241.1| putative type I restriction-modification system [Selenomonas sputigena ATCC 35185] gi|260862229|gb|EEX76729.1| putative type I restriction-modification system [Selenomonas sputigena ATCC 35185] Length = 238 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 37/132 (28%), Gaps = 4/132 (3%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 + GE+V D + + S + Sbjct: 83 YLKEGEVVSIPWGKSRDVTDCIKYYKGKFVTADNRIATSNDITKLSNRYLYYWMMSQGKV 142 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + + V + + +PP+ Q +I +++ T L E++ + L K Sbjct: 143 IDTFYRGSGIKHPDMAKVLNMQIPIPPLAIQNEIVKLLDDFTELTAELTEQLMTELTLRK 202 Query: 405 E----RRSSFIA 412 + R S + Sbjct: 203 KQYNFYRDSLLN 214 >gi|304440528|ref|ZP_07400415.1| type I restriction-modification system specificity determinant [Peptoniphilus duerdenii ATCC BAA-1640] gi|304371006|gb|EFM24625.1| type I restriction-modification system specificity determinant [Peptoniphilus duerdenii ATCC BAA-1640] Length = 203 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 59/173 (34%), Gaps = 2/173 (1%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + T N K + + N+++ + + ++ + G+I+ Sbjct: 18 SLKDITTYSNNKINITELNETNYVGVDNLLKNKLGKVDSKNVPTSGSFNLFREGDILIGN 77 Query: 296 IDLQNDKRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL- 353 I K + + ++ + + S YL ++ S G Sbjct: 78 IRPYLRKIWISDIEGGASPDVLVIRKKDSFNNNLLSKYLYQVLSSEQFFDYDIKHSKGAK 137 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +LVPP+ Q + ++++ + I+ + E + + I L +++ Sbjct: 138 MPRGNKAKIMDYEILVPPLYVQEYVVSILDKFDSLINDINEGLPKEIELRQKQ 190 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 75/190 (39%), Gaps = 9/190 (4%) Query: 26 KVVPIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 K + +K T + + + Y+G++++ + + ++F +G Sbjct: 15 KKLSLKDITTYSNNKINITELNETNYVGVDNLLKNKLGKVDSKNVPTSG---SFNLFREG 71 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-----ELLQGWLLSIDVTQRIEA 139 IL G + PYLRK I+D +G S LV++ KD + L L S Sbjct: 72 DILIGNIRPYLRKIWISDIEGGASPDVLVIRKKDSFNNNLLSKYLYQVLSSEQFFDYDIK 131 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 +GA M + I + + +PPL Q + + I+ + + IEL +++ Sbjct: 132 HSKGAKMPRGNKAKIMDYEILVPPLYVQEYVVSILDKFDSLINDINEGLPKEIELRQKQY 191 Query: 200 QALVSYIVTK 209 + ++ Sbjct: 192 EYYREKLLDF 201 >gi|186701639|ref|ZP_02553264.2| type I restriction enzyme S protein [Ureaplasma parvum serovar 6 str. ATCC 27818] gi|186700875|gb|EDU19157.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 6 str. ATCC 27818] Length = 442 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 133/423 (31%), Gaps = 37/423 (8%) Query: 22 PKHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS---- 76 P + + + K + + V + K K S Sbjct: 13 PNGVEFKKLWEIVNFDKKFKGVPKEKQNEILSFKHVSANELKRYEKCNFGNVKLLSTGLY 72 Query: 77 -TVSIFAKGQ------ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + + + I S + Q L + Sbjct: 73 DGYIKYNENDNNINYGEIIALPSGGSPIIKYYNGYFIDSLNIIFSQKTKKECNLKFIYYF 132 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I IE GA++ H + I + +PIPP++ Q I E + + L TE Sbjct: 133 LIANKMLIEENYRGASVKHPNMIEIIELLIPIPPISIQNKIVEILD----KYTELETELE 188 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--- 246 +EL ++ + ++ N + K G + + + E K + + Sbjct: 189 TELELRNKQYIYYRNELLDFNKNQVLLKKIIGSDDIESIDSKIEFKKIGDIGNFYSGLSG 248 Query: 247 --KNTKLIESNILSLSYGNIIQKLE---TRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 KN +N ++Y N+ LE + ++ YE V+ G+I+ D Sbjct: 249 KNKNDFFKNANARYITYLNVFNNLEINVDKLENVRISKYEKQNKVEYGDILITISSETPD 308 Query: 302 KRS--------LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG- 352 + + + + + + Y +L + + K +G Sbjct: 309 ECGYVSIANHFIFKEEDIYLNSFCFGFRLHNLKIYNIKYFKYLFKDKNTRKKIIKCVNGV 368 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE---TARIDV-LVEKIEQSIVLLKERRS 408 R +L E+ K + + +PPI Q I +++ T I+ L +IEQ + R+ Sbjct: 369 TRFNLSKEEFKNISIPIPPISIQNKIVEILDKLEVYTKDINTGLPLEIEQRKKQYEYYRN 428 Query: 409 SFI 411 + Sbjct: 429 KLL 431 >gi|34762952|ref|ZP_00143931.1| Adenine-specific methyltransferase [Fusobacterium nucleatum subsp. vincentii ATCC 49256] gi|27887375|gb|EAA24466.1| Adenine-specific methyltransferase [Fusobacterium nucleatum subsp. vincentii ATCC 49256] Length = 556 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 42/350 (12%), Positives = 93/350 (26%), Gaps = 15/350 (4%) Query: 68 GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW 127 N + +S+ ILY +RK + + + L ++ Sbjct: 202 INQLKEKGKGISLVKTN-ILYKPENKNIRKYFVENGYIE---SIIYLPKNMLIDYPFPLA 257 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L+ + + + I I + + I I + Sbjct: 258 LIVFSKKNKKIKFIDAYKFCKIEKFKIEFIDNYFKNPKISEIKEQNINIIIDTNVEKIID 317 Query: 188 RIRFIELLKEKKQALVSYIVTKGLN-----PDVKMKDSGIEWVGLVPDHWEVKPFFALVT 242 I + +KE + IV K N + D ++ + +K Sbjct: 318 LINNQKNIKESFSKKIEDIVEKDYNLVVTENFEILVDILKKFKNEIKFKDIIKNIVRGSQ 377 Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY----ETYQIVDPGEIVFRFIDL 298 + K E+ + LS +I L + + I+ Sbjct: 378 KTISKFKSEEETQYIYLSLSDINDGLIEFKNIENYLKEVPKNQEKFFIKNNSILLSKYGS 437 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-- 356 + + + + + + S ++ Sbjct: 438 SPKLAISQIPDDKKVIPSGNFIIIEVDEEKLNPWYLMSYFSSGFGSEKLKETYTEAKNDT 497 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + ++ + + VPPIKEQ I +I+ + +K++ I +E Sbjct: 498 ISIRKLENIEIPVPPIKEQEKIAKEYRESLKKIEEMKKKLKNEIQNSREI 547 >gi|327490262|gb|EGF22050.1| restriction endonuclease S [Streptococcus sanguinis SK1058] Length = 169 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 52/164 (31%), Gaps = 8/164 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP- 288 ++W+ L ++ K N + + +++ T+ + Sbjct: 2 NNWKKVRLSELADITMGQSPKSDFYNSKGDGLPFLQGNRTFGDKYPTFDTWTTFVTKEAE 61 Query: 289 -GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G+++ D + + ++ +L +L+R+ + Sbjct: 62 VGDVIMSVRAPVGD-----INITPLKMCLGRGVCGLRHKQGAQEFLYYLLRANK-ENLIN 115 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + S+ D+ L V VP ++EQ I + +I+ Sbjct: 116 RENGTVFGSINKTDISNLEVQVPSLREQIQIGLTLKAIDDKIEN 159 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 45/169 (26%), Gaps = 3/169 (1%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + G++ +S G + K T Sbjct: 2 NNWKKVRLSELADITMGQSPKSDFYNSKGDGLPFLQGNRTFGDKYPTFDTWTTFVTKEAE 61 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G ++ P I L+ K + L + + Sbjct: 62 VGDVIMSVRAPV-GDINITPLKMCLGRGVCGLRHKQG--AQEFLYYLLRANKENLINREN 118 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G + I N+ + +P L EQ+ I + A +I+ Sbjct: 119 GTVFGSINKTDISNLEVQVPSLREQIQIGLTLKAIDDKIENNKKINHHL 167 >gi|294647357|ref|ZP_06724950.1| conserved domain protein [Bacteroides ovatus SD CC 2a] gi|294809022|ref|ZP_06767744.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b] gi|292637316|gb|EFF55741.1| conserved domain protein [Bacteroides ovatus SD CC 2a] gi|294443747|gb|EFG12492.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 232 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 57/213 (26%), Gaps = 13/213 (6%) Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 G K VG + F + K E + + + + Sbjct: 27 GGEMVWNEKLKRNIPVGWHCGNLFEIAVFTNGLACQKFRPKDDEVPLPVIKIREMHDGIS 86 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + V G+++F + G + V Sbjct: 87 VDTEEVTSN-IPESVKVYNGDVLFSWSASLE-----VMLWAYGLGGLNQHIFKVTSANDF 140 Query: 330 STYLAWLMRSYDLCKVFYAMG---SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 + + D VF M + + +++ + +P DI + Sbjct: 141 PKSFYY-FQLLDYVDVFKKMAEARKTTMGHITQDHLQQSTIAIPDN---KDIADKFEELI 196 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + I + K+++ I ++R + + GQI Sbjct: 197 SPIFKQIVKLQEEISNFIKQRDELLPLLMNGQI 229 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 57/206 (27%), Gaps = 19/206 (9%) Query: 10 YKDSGVQ--WIGA----IPKHWKVVPIKRFTKLNTG----RTSESGKDII--YIGLEDVE 57 YK SG + W IP W + G + ++ I + ++ Sbjct: 23 YKSSGGEMVWNEKLKRNIPVGWHCGNLFEIAVFTNGLACQKFRPKDDEVPLPVIKIREMH 82 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 G + ++ G +L+ L + A G + + Sbjct: 83 DGISVDTEEVTSNIPESVK----VYNGDVLFSWS-ASLEVMLWAYGLGGLNQHIFKVTSA 137 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + P+ + L V + A + ++ + + I +K Sbjct: 138 NDFPKSFYYFQLLDYV--DVFKKMAEARKTTMGHITQDHLQQSTIAIPDNKDIADKFEEL 195 Query: 178 TVRIDTLITERIRFIELLKEKKQALV 203 I I + I +++ L+ Sbjct: 196 ISPIFKQIVKLQEEISNFIKQRDELL 221 >gi|298531140|ref|ZP_07018541.1| conserved hypothetical protein [Desulfonatronospira thiodismutans ASO3-1] gi|298509163|gb|EFI33068.1| conserved hypothetical protein [Desulfonatronospira thiodismutans ASO3-1] Length = 203 Score = 57.1 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 41/122 (33%), Gaps = 3/122 (2%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + E + K E+Y + PG+I+F Q+ L S S + Sbjct: 48 WRRVNHDELIRIRFKGRKIESYF-LKPGDILFFGRSGQSHSVVLESPVPENTAAAPSFMV 106 Query: 322 AVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDIT 379 YL W + S K F A G Q + ++ L V +P +Q I Sbjct: 107 LRIKDDKTLPHYLNWYLNSDRAQKYFMAEAGGSFQRVVTKSVLENLEVPLPEQNDQERIV 166 Query: 380 NV 381 + Sbjct: 167 RI 168 >gi|291559578|emb|CBL38378.1| Restriction endonuclease S subunits [butyrate-producing bacterium SSC/2] Length = 199 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 9/123 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSY 340 +++F + + +I++ + + ID Y + S Sbjct: 71 KCYAYRNDLIFTAAGTIGQVGVIPENSRYTKYVISNKQIRARIDTKKIDLLYAYYWFSSP 130 Query: 341 DLC-KVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 + + L ++K LP++ P I EQ I +VI+ + +I+ I + Sbjct: 131 WIRAFLIRNNKGSTVPLLTLSEIKDLPIIYPESIDEQKTIISVIDNISKKIE-----INK 185 Query: 399 SIV 401 I Sbjct: 186 KIN 188 >gi|321310219|ref|YP_004192548.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802063|emb|CBY92709.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 190 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 66/184 (35%), Gaps = 16/184 (8%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--PESYETYQIVDPGEIVFR 294 + + ++ K++ +S I L + + ++ PE +V G++V Sbjct: 9 ICKVYSGVDLKDSDYRKSGIPVLKSSEVSGGFISEDVVFYCNPEKALNGNLVRFGDVVIT 68 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + V I T + P + YL + + + L ++ + +G Sbjct: 69 RMGG-KCRVGINLTNVDYLPISTIFKLDPNPEIVSREYLYYCLLN-SLQEINSHIANGNV 126 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSF 410 L + ++ + +P ++ Q I +N +++ + + L K RS Sbjct: 127 SKLYKSSLLKVALSIPDLETQARIVEYLNQL--------QELRKELELRKRQGVYYRSKI 178 Query: 411 IAAA 414 + Sbjct: 179 MNNL 182 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 52/182 (28%), Gaps = 6/182 (3%) Query: 30 IKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + K+ +G + I + +V G ++ G Sbjct: 6 LGDICKVYSGVDLKDSDYRKSGIPVLKSSEVSGGFIS-EDVVFYCNPEKALNGNLVRFGD 64 Query: 86 ILYGKLGPYLRK-AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ ++G R + + D + + L P + + ++ Q I + Sbjct: 65 VVITRMGGKCRVGINLTNVDYLPISTIFKLDPNPEIVSREYLYYCLLNSLQEINSHIANG 124 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 +S + + + IP L Q I E + L + + + + L Sbjct: 125 NVSKLYKSSLLKVALSIPDLETQARIVEYLNQLQELRKELELRKRQGVYYRSKIMNNLKE 184 Query: 205 YI 206 Sbjct: 185 CA 186 >gi|218247752|ref|YP_002373123.1| restriction modification system DNA specificity domain-containing protein [Cyanothece sp. PCC 8801] gi|218168230|gb|ACK66967.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 8801] Length = 194 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 64/189 (33%), Gaps = 9/189 (4%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +G + ++ E++ L +S ++ Q++ ++ Sbjct: 7 EIGTLGQLCKIAIGGTPARNNPEYWDIQKETDNLWVSIRDMNQRVINDTAEYISDAGVKN 66 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + + L R A ++ A A+ ID +L + ++ +DL Sbjct: 67 SNAKLQDE--NTVLLSFKLTIGRVAFAGKKLYTNEAIAALATEQIDPNFLYYGLQQWDLL 124 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G +L + ++ P KEQ I +++ ID +E+ E I Sbjct: 125 QDVDQAIKGA--TLNKVKLNKIEFNYPKDKKEQTQIATILST----IDRAIEQTETLIAK 178 Query: 403 LKERRSSFI 411 + ++ + Sbjct: 179 QQRIKTGLM 187 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 57/191 (29%), Gaps = 17/191 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 + W++ + + K+ G T D +++ + D+ + + Sbjct: 4 EGWEIGTLGQLCKIAIGGTPARNNPEYWDIQKETDNLWVSIRDMNQRVINDTAEYISDAG 63 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S + + +L + + A + L + + P L L D Sbjct: 64 VKNSNAKLQDENTVLLS-FKLTIGRVAFAGKKLYTNEAIAALATEQIDPNFLYYGLQQWD 122 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + Q ++ +GAT++ I Q I ID I + I Sbjct: 123 LLQDVDQAIKGATLNKVKLNKIEFNYPKDKKEQTQ------IATILSTIDRAIEQTETLI 176 Query: 193 ELLKEKKQALV 203 + K L+ Sbjct: 177 AKQQRIKTGLM 187 >gi|160894143|ref|ZP_02074921.1| hypothetical protein CLOL250_01697 [Clostridium sp. L2-50] gi|160894146|ref|ZP_02074924.1| hypothetical protein CLOL250_01700 [Clostridium sp. L2-50] gi|156864176|gb|EDO57607.1| hypothetical protein CLOL250_01697 [Clostridium sp. L2-50] gi|156864179|gb|EDO57610.1| hypothetical protein CLOL250_01700 [Clostridium sp. L2-50] Length = 186 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 21/146 (14%), Positives = 52/146 (35%), Gaps = 4/146 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRS 304 K + ++ I + I + + + V+ +++ Sbjct: 28 KRGDMKDNGIPVYEQQHAIYNSRHFRYYIDEQKFNEMKRFQVNTDDLIISCSGTVGKVSI 87 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSL-KFEDV 362 +RS + V + I YL + S D + SG ++ ++ K + Sbjct: 88 IRSDDPKGIISQALLLLRVDQNKILPLYLKYFFTSRDGYNAIVSRSSGSVQVNIAKRNVI 147 Query: 363 KRLPVLVPPIKEQFDITNVINVETAR 388 +++P+++P I+ Q I ++N + Sbjct: 148 EQIPLMLPKIETQRKIVEILNSIDKK 173 >gi|315609158|ref|ZP_07884126.1| type I restriction-modification system S subunit [Prevotella buccae ATCC 33574] gi|315249154|gb|EFU29175.1| type I restriction-modification system S subunit [Prevotella buccae ATCC 33574] Length = 183 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 61/173 (35%), Gaps = 6/173 (3%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E + +P W+ F +V K E + + + + + N ++ E Sbjct: 11 EILFDLPCSWQWVRFGQIVRMSIGKTPARGEVSYWTKATIPWVSISDMTNCEHINKTKEK 70 Query: 283 YQI----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLM 337 + V G + + R++ + A +++ P D L +L Sbjct: 71 ISVAASSVMGGISPVGSLLMSFKLTVGRTSILNIDAYHNEAIISIFPFIDDKYALRDYLF 130 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + ++ ++L + +K L + +PP++EQ I + + A + Sbjct: 131 YTLPFLSNMGNSKDAIKGKTLNSKSLKSLLIPLPPLREQRYIIDRLEELYAHL 183 Score = 47.9 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 64/170 (37%), Gaps = 9/170 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ V + +++ G+T G+ I ++ + D+ + K+ S Sbjct: 15 DLPCSWQWVRFGQIVRMSIGKTPARGEVSYWTKATIPWVSISDMTNCEHINKTKEKISVA 74 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW-LLSI 131 + + I G +L + + I + D + + + P L+ + ++ Sbjct: 75 ASSVMGGISPVGSLLMS-FKLTVGRTSILNIDAYHNEAIISIFPFIDDKYALRDYLFYTL 133 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + K + ++ +P+PPL EQ I +++ + Sbjct: 134 PFLSNMGNSKDAIKGKTLNSKSLKSLLIPLPPLREQRYIIDRLEELYAHL 183 >gi|300819088|ref|ZP_07099291.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 107-1] gi|300528388|gb|EFK49450.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 107-1] Length = 389 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 46/393 (11%), Positives = 101/393 (25%), Gaps = 51/393 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYL-----PKDGNSRQSDTSTVSI 80 + + + K G +G+ ++ + D I Sbjct: 17 EWQTLGKVLKRTKGTKITAGQ------MKALHKDNAPLKIFAGGKTVAFVDFKDIPEKDI 70 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + I+ G + D + + + + + I Sbjct: 71 NREPSIIVKSRGII--EFEYYDKPFSHKNEMWSYHSNNDAISIKYIYYFLKINEGYFQKI 128 Query: 141 CEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIE 193 M +PIP LA Q I + T L E + Sbjct: 129 GGKMQMPQIATPDTDKFEVPIPCPDNPEKSLAIQSEIVRILDKFTELTAELTAELSMRKK 188 Query: 194 LLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 + L+S K+ +E + + + +IE Sbjct: 189 QYNYYRDQLLS------------FKEDEVE-----GKRKTLGEIMKMRAGQHISAHNIIE 231 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 S Y + K E I G + ++ + Sbjct: 232 RKEESYIYPCFGGNGIRGYVKEKSHDGEHLLIGRQGALCGNVQRMKGQFYATE------- 284 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 A + GI+ + ++ + +L + + L ++ L + VP I+ Sbjct: 285 ----HAVVVSVMPGINIDWAFHMLTAMNLNQY---ASKSAQPGLAVGKLQELKLFVPSIE 337 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 Q I +++ + + E + + I L +++ Sbjct: 338 RQIYIAAILDKFDTLTNSITEGLPREIELRQKQ 370 Score = 43.2 bits (100), Expect = 0.087, Method: Composition-based stats. Identities = 15/136 (11%), Positives = 42/136 (30%), Gaps = 12/136 (8%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK-VFYAMGSGLR- 354 + R + + ++ M D+ + ++ + + F +G ++ Sbjct: 75 SIIVKSRGIIEFEYYDKPFSHKNEMWSYHSNNDAISIKYIYYFLKINEGYFQKIGGKMQM 134 Query: 355 QSLKFEDVKRLPVLVP-------PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + D + V +P + Q +I +++ T L ++ R Sbjct: 135 PQIATPDTDKFEVPIPCPDNPEKSLAIQSEIVRILDKFTELTAELTAELSMRKKQYNYYR 194 Query: 408 SSFIA---AAVTGQID 420 ++ V G+ Sbjct: 195 DQLLSFKEDEVEGKRK 210 >gi|271498972|ref|YP_003331997.1| restriction modification system DNA specificity domain-containing protein [Dickeya dadantii Ech586] gi|270342527|gb|ACZ75292.1| restriction modification system DNA specificity domain protein [Dickeya dadantii Ech586] Length = 190 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 57/130 (43%), Gaps = 3/130 (2%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + E T +++PGEIV +N + + + +K + YL Sbjct: 54 VVWEQNATPPLLEPGEIVVAARGNRN-VAVVYHGKAPVVATNQFLIINIKTKTVLPEYLC 112 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 WL+ + ++F+ G+ ++ + + ++ + +PPI+ Q +I + + D L+ Sbjct: 113 WLINHPTIQQMFHRSGTNIQL-VTKAALLKVQLPLPPIEVQQNIIG-LQQVWEQEDQLIS 170 Query: 395 KIEQSIVLLK 404 +++ + L+ Sbjct: 171 QLQANRQKLQ 180 >gi|13357656|ref|NP_077930.1| type I restriction enzyme S protein (fragment) [Ureaplasma parvum serovar 3 str. ATCC 700970] gi|170762424|ref|YP_001752182.1| type I restriction modification DNA specificity family protein [Ureaplasma parvum serovar 3 str. ATCC 27815] gi|11357071|pir||F82933 type I restriction enzyme S protein, truncated homolog UU099 [imported] - Ureaplasma urealyticum gi|6899054|gb|AAF30505.1|AE002110_3 type I restriction enzyme S protein (fragment) [Ureaplasma parvum serovar 3 str. ATCC 700970] gi|168828001|gb|ACA33263.1| type I restriction modification DNA specificity family protein [Ureaplasma parvum serovar 3 str. ATCC 27815] Length = 301 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 40/148 (27%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + S + + L+ + +V Sbjct: 8 INEFCPNGVEFKKLKNIITVAPKSPFGVTKLLKMEKGNYLTITSGKKSFYVDNFLVDGEY 67 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 ND + + + A K + ++ YL + + + ++ Sbjct: 68 IFVNDGGQADIKYNFGKTMYSDHIFAFKVNEYNTKYLYFYLLNISNFINKKLFIGSTLKN 127 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINV 384 L ++ L + +PPI Q I +++ Sbjct: 128 LNKKEFLNLAIPIPPISIQNKIVEILDK 155 >gi|329963239|ref|ZP_08300976.1| conserved domain protein [Bacteroides fluxus YIT 12057] gi|328528935|gb|EGF55875.1| conserved domain protein [Bacteroides fluxus YIT 12057] Length = 467 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 46/395 (11%), Positives = 111/395 (28%), Gaps = 32/395 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 ++ K GKY + + S+F ++ Sbjct: 6 FGEIFSFVPAPKIKAEKG----------RNRGKYPLYTTGQQFPQRTDQSMFNGPALIIS 55 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 K P + + F++ + + + + ++ E Sbjct: 56 KTVPV--SIYYCNGSFSATNDFMIAKANRNCFTQVDPQYVYFYLL-GNLSLLEHEEKKSF 112 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + I I +P+ L Q I + I T + +L K V Sbjct: 113 SRQSIQKIEIPLNSLETQERIIGTLHKIETLIQKRGTNLLLVSKLKKVMFLNFFGDPV-- 170 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 L+ + + + + + E + +L ++ I + + K Sbjct: 171 -LDKGKFLFSTPFHNL-VTIHGGGNYQTQNVPRESKEQLAQLTQTAITRREFDPVQNKRF 228 Query: 270 TRNMGLKPESYETYQIVDPGEIVFR-FIDLQNDKRSLRSAQVMERGIITSAY--MAVKPH 326 +K Y + G+++F L+ + + I + P Sbjct: 229 LHKQFVKDSHY-----IQKGDVLFSRKNSLKLIGSAAYVYDDIANLTIPDTIFRICCNPK 283 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQ---SLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I YL +L+ + K + G S+ + +K+ + P + Q Sbjct: 284 KISGVYLTYLLNDENFNKQLRSYFGGTLPTMSSITTKKLKQFIIPCPDLALQHKF----E 339 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + + +++ + + L++ S +G+ Sbjct: 340 KNILFLRQMEDRMTKQLSRLRQFISIASNDLFSGK 374 >gi|167767085|ref|ZP_02439138.1| hypothetical protein CLOSS21_01603 [Clostridium sp. SS2/1] gi|167711060|gb|EDS21639.1| hypothetical protein CLOSS21_01603 [Clostridium sp. SS2/1] Length = 199 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 9/123 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAWLMRSY 340 +++F + + +I++ + + ID Y + S Sbjct: 71 KCYAYRNDLIFTAAGTIGQVGVIPENSRYTKYVISNKQIRARIDTKKIDLLYAYYWFSSP 130 Query: 341 DLC-KVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 + + L ++K LP++ P I EQ I +VI+ + +I+ I + Sbjct: 131 WIRAFLIRNNKGSTVPLLTLSEIKDLPIIYPESIDEQKTIISVIDNISKKIE-----INK 185 Query: 399 SIV 401 I Sbjct: 186 KIN 188 >gi|294783683|ref|ZP_06749007.1| hypothetical protein HMPREF0400_01677 [Fusobacterium sp. 1_1_41FAA] gi|294480561|gb|EFG28338.1| hypothetical protein HMPREF0400_01677 [Fusobacterium sp. 1_1_41FAA] Length = 627 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 61/175 (34%), Gaps = 4/175 (2%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 L + + T I + +++ G I + + PE E + I + Sbjct: 449 QISLDELKDLRSHEETPYIYLTLSNINDGFIEYENIEDYLKKIPEKQEKFCI-KNNVFLI 507 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS 351 I K + + I + + ++ + + YLA + KV Sbjct: 508 SKIGNPPYKFVVAQIPENRKIIASGNFAIIEVNEKKLNPWYLAAFFTTDIGVKVLKKAYI 567 Query: 352 GL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G+ SL + ++ + + VP I+EQ I I + + ++ I +KE Sbjct: 568 GVNFSSLSIKKLEEIAIPVPSIEEQNRIAQRYIDAITEIKNMKKDLKDKIQAVKE 622 >gi|269978352|gb|ACZ55910.1| truncated putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 264 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 46/139 (33%), Gaps = 8/139 (5%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + I++ ++ + + + S+ K + + Sbjct: 56 TKADINYKDISKKDIINCESVIIKSRGNIGFEYYNQPFSHKNEIWSYSS----KTNQMLV 111 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L + + + A S ++ L D V VPP++ Q +I +++ T Sbjct: 112 KFLYYYLSNNQDYFQKLAQSSSVKLPQLSVSDTDEYEVPVPPLEIQQEIVKILDAFTELN 171 Query: 390 DVLVEKIEQSIVLLKERRS 408 L ++ LK R+ Sbjct: 172 TELNTELNTE---LKARKK 187 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 60/189 (31%), Gaps = 3/189 (1%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 PK + I K N G + + ++ + + G D N + D S I Sbjct: 13 PKGVEFKKIGELFKRNKGINITAAQMKELHSDIGKIRIFAGGATKADINYK--DISKKDI 70 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ G + F + +L + L +L + + A Sbjct: 71 INCESVIIKSRGNIGFEYYNQPFSHKNEIWSYSSKTNQMLVKFLYYYLSNNQDYFQKLAQ 130 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + +P+PPL Q I + + A T L TE ++ K++ + Sbjct: 131 SSSVKLPQLSVSDTDEYEVPVPPLEIQQEIVKILDAFTELNTELNTELNTELKARKKQYE 190 Query: 201 ALVSYIVTK 209 + ++ Sbjct: 191 YYQNMLLDF 199 >gi|261839395|gb|ACX99160.1| hypothetical protein HPKB_0560 [Helicobacter pylori 52] Length = 214 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 53/140 (37%), Gaps = 10/140 (7%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDK--RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 Y I D ++ +K + + + + A++ + + +L + Sbjct: 73 DYIDSYIFDGDFVLVGEDGSVINKDNTPVVNWASGKIWVNNHAHVLQTKNELKLKFLYFY 132 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +++ D+ +G + E++K++ + + P++ Q +I +++ + L+ I Sbjct: 133 LQTIDV----SYCVAGTPPKINQENLKKITIPILPLEIQQEIVKILDQFSVLTTDLLAGI 188 Query: 397 EQSIVLLKE----RRSSFIA 412 I K+ R + Sbjct: 189 PAEIEARKKQYEYYREKLLT 208 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 43/158 (27%), Gaps = 13/158 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + + ++ R K I +G Y+ Sbjct: 31 PKGVEFRKLGEVCEILDNRRIPIAKNKRNPGIYPYYGANGIQDYIDSYIFDGDFV----- 85 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + + + K A + VLQ K+ L + + + Sbjct: 86 LVGEDGSVINKDNT--PVVNWASGKIWVNNHAHVLQTKNELKLKFLYF----YLQTIDVS 139 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 C T + + + I +PI PL Q I + + Sbjct: 140 YCVAGTPPKINQENLKKITIPILPLEIQQEIVKILDQF 177 >gi|255011910|ref|ZP_05284036.1| restriction endonuclease S subunit [Bacteroides fragilis 3_1_12] Length = 368 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 45/379 (11%), Positives = 105/379 (27%), Gaps = 36/379 (9%) Query: 38 TGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYL 95 G T + ++ I + + ++ + I G +L G Sbjct: 3 KGITPKYVESSSVLVINQACIHWDGQRLGNIKYHNEEIPVRK-RILESGDVLLNATG--- 58 Query: 96 RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 +G + + P D + G ++++ + + T + Sbjct: 59 --------NGTLGRCCVFICPSDNNTYINDGHVIALSTDRAVILPEVLNTYLSLNDTQAE 110 Query: 156 NIPMPIPPLAEQVLIREKIIAETV----RIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 + QV I I + +D I + K K S + Sbjct: 111 IYRQYVTGSTNQVDIVFSDIKKMKVPVPSMDEQILFVEVLKQADKSKFGDFKSQFIEMFG 170 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 NP + + ++ +G + K + + + Y + E Sbjct: 171 NPLSLNQKNELKRLGEC--CILNPRRPNIALCDTDKVSFIPMPAVSEDGYLVDMTDEEYG 228 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS----AYMAVKPHG 327 + + + + +++F I + + GI + + Sbjct: 229 KVK------KGFTYFENNDVLFAKITPCMENGKGAIVHGLTNGIGMGSTEFHVLRLINGI 282 Query: 328 IDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 +L L R + G+G ++ + + V +P I+EQ Sbjct: 283 SSPYWLLALTRMPIFRERAAKNMSGTGGQKRVSASYLDHFMVGLPAIEEQRRF----EAI 338 Query: 386 TARIDVLVEKIEQSIVLLK 404 + D I++++V L Sbjct: 339 YKQADKSKSVIQKALVYLN 357 >gi|167989005|ref|ZP_02570676.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|225551422|ref|ZP_03772368.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 8 str. ATCC 27618] gi|188018714|gb|EDU56754.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|225379237|gb|EEH01602.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 8 str. ATCC 27618] Length = 379 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 42/382 (10%), Positives = 110/382 (28%), Gaps = 18/382 (4%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ T ++ +I GL + N+ ++ I Sbjct: 6 KLSSVFEIITTGKQKNTFNINLEGLYPL------ISASTANNGIMGYVDNYLYDGQNITI 59 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMS 147 ++G + F++ + + ++ +LL ++ ++I +I G T Sbjct: 60 SRVGNAGTTFYHEGKISLTDNCFILSKINKKIAKVKYVFYLLKLNEDKKIRSISHGTTRK 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK----QALV 203 + + N+ + +P + Q I I +K +++ Sbjct: 120 IINKTDLDNLIIYLPSIEIQNAIISIIEPHEKLFVKYSNLVDISSVENAKKDVDNLISII 179 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWE-VKPFFALVTELNRKNTKLIESNILSLSYG 262 + + + + + F K S Sbjct: 180 EPLDILENKINKLKTVLKKLLINIYDKNCNSHVNLFENNKIYTNKYLNQNLYCDTSCIGE 239 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I + N+ L+ + + I+F + +N E + ++ + Sbjct: 240 LEINFSKMINISLEDKPSRADLSIKNNSIIFSKLLGENKVYC---FLNNENIVFSTGFFN 296 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +K + ++ L + S D + +G + D+ ++ P + +I Sbjct: 297 IKSNDENNDDLLSFLLSSDFKNQKSMLANGTTMIGINNSDLTKVRCKAPFLN--SNIYFT 354 Query: 382 INVETARIDVLVEKIEQSIVLL 403 + I+ + IV L Sbjct: 355 FFNKLNEIENKITLARNKIVNL 376 >gi|166368339|ref|YP_001660612.1| Type I restriction enzyme EcoEI M protein [Microcystis aeruginosa NIES-843] gi|166090712|dbj|BAG05420.1| Type I restriction enzyme EcoEI M protein homolog [Microcystis aeruginosa NIES-843] Length = 677 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 34/309 (11%), Positives = 88/309 (28%), Gaps = 19/309 (6%) Query: 98 AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNI 157 A + GI ++ +V + + I + + I + G Sbjct: 369 AFVPYGTGIKTSLLVVQKLPANHDSCFMAQIKKIGYDVKGQTIYKRNESGVIARTKSGLP 428 Query: 158 PMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM 217 + R I E + I + + + + P+ + Sbjct: 429 IVDDDIDDISQSFRSFINGEFAQNSDCIYTVKNTLLNSRLDAEHYL---------PNDQK 479 Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 ++ +G P +++++ I + Y + Q + + + Sbjct: 480 LLEHLKSIGAKPLGEIADILREAADFRLARDSEIRYIAISDVDYRTM-QVVSQQIIKAHE 538 Query: 278 ESYETYQIVDPGEIVFRFIDLQN----DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + G+I+ +L + + G++ +L Sbjct: 539 APSRATYRLYKGDIITAISGASTGTPRQATALITEDEDGAICSNGFSVLRNIQGVEPLFL 598 Query: 334 AWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 MR+ + +G ++ +D+ ++ V +PP EQ I A I + Sbjct: 599 LVYMRTDFFLRQIKRYMTGHAIPTILVDDLSKVLVPIPPQSEQQRIA----KSMAEIQAI 654 Query: 393 VEKIEQSIV 401 ++ ++ Sbjct: 655 RKEALKASE 663 >gi|283769286|ref|ZP_06342189.1| hypothetical protein HMPREF9013_1471 [Bulleidia extructa W1219] gi|283104103|gb|EFC05483.1| hypothetical protein HMPREF9013_1471 [Bulleidia extructa W1219] Length = 236 Score = 56.7 bits (135), Expect = 8e-06, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 78/218 (35%), Gaps = 24/218 (11%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIE------SNILSLSYGNIIQKLETRNMGLKPES 279 G PD WE + + K E + + I + N G+ Sbjct: 20 GTAPDDWEQGTLQDIADFSSGYAFKSKELLNTPAPDCYHVFKQGHINRGGGFNSGVTKSW 79 Query: 280 YE-------TYQIVDPGEIVFRFIDLQNDKRSLRSAQ---VMERGIITSAYMAVKPH--- 326 Y + ++ G+++ D++++ L + + ++ I+ ++ + Sbjct: 80 YPISKCASLSKYVLHKGDVLMAMTDMKDNVAILGNTALMTIDDQYIVNQRVGLLRSNGYK 139 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 Y+ L S+D K + SG++ +L ++K PV + + + N Sbjct: 140 CTSYAYIYLLTNSFDFLKNLRSRANSGVQVNLSSAEIKASPVWIASDEVNKEF----NSL 195 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 T + ++ + L R + + ++G++D+ Sbjct: 196 TEPLLSMIMANDIENQKLLGLRDTLLPRLMSGELDVSD 233 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 51/204 (25%), Gaps = 21/204 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDII--------YIGLEDVESGTGKYLPKDGNSRQS 73 P W+ ++ ++G +S + + + G G + Sbjct: 23 PDDWEQGTLQDIADFSSGYAFKSKELLNTPAPDCYHVFKQGHINRGGGFNSGVTKSWYPI 82 Query: 74 DTS---TVSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFLVLQ---------PKDVL 120 + + KG +L AI+ + Q++V Q K Sbjct: 83 SKCASLSKYVLHKGDVLMAMTDMKDNVAILGNTALMTIDDQYIVNQRVGLLRSNGYKCTS 142 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 + S D + + + + I P+ I Sbjct: 143 YAYIYLLTNSFDFLKNLRSRANSGVQVNLSSAEIKASPVWIASDEVNKEFNSLTEPLLSM 202 Query: 181 IDTLITERIRFIELLKEKKQALVS 204 I E + + L L+S Sbjct: 203 IMANDIENQKLLGLRDTLLPRLMS 226 >gi|309800162|ref|ZP_07694348.1| HsdS [Streptococcus infantis SK1302] gi|308116209|gb|EFO53699.1| HsdS [Streptococcus infantis SK1302] Length = 233 Score = 56.7 bits (135), Expect = 8e-06, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 80/234 (34%), Gaps = 13/234 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDT--S 76 K V + K+ TG T DI +I +D ++ K + S+ + Sbjct: 2 KKVKLGDLGKIITGNTPSKKLLEFYNSNDIPFIKPDDFKTIDEISSSKGNKNYISEKARN 61 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I K +L +G + K +I+D + + Q + P +++ +L + + Sbjct: 62 NARIVPKNSVLVTCIG-IIGKVMISDSELSFNQQINAIVPNELILSKYLAYL-LLYNKPK 119 Query: 137 IEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 ++ I + + + + Q I + + I + + L+ Sbjct: 120 LDFISNAPVVPIINKTQFSEFEVTFHEDIDVQEKIIQNLENLDNHILKRRHQSKLLLNLV 179 Query: 196 KEKKQALVSYIVTKGL-NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 K + + V + +K+ GI G P E + L+++N Sbjct: 180 KSRFNEMFGDPVLNEMGWEKHALKEFGIWKSGGTPKRNEEDFLEDIFLGLHQEN 233 >gi|301162154|emb|CBW21699.1| putative type I restriction endonuclease [Bacteroides fragilis 638R] Length = 417 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 18/191 (9%), Positives = 55/191 (28%), Gaps = 11/191 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY-ETYQIVDP 288 D + E + + ++ + ++ + + +K + + +++ Sbjct: 34 DFYSTNSLSWEQLEYDTNAMMNLHYGLIHVGLPTMVDLAKDKLPNIKENNMPKNFELCKE 93 Query: 289 GEIVFRFI--DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA---WLMRSYDLC 343 G++ F D +++ + + ++ + T + + S Sbjct: 94 GDVAFADASEDTNEVAKTVEFFNLAGKNVVCGLHTIHGRDNKHKTVVGFKGYAFSSAAFH 153 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + G S+ ++ + +P EQ I ID + + I Sbjct: 154 NQIRRIAQGTKIYSISTKNFFECYIGLPSKPEQSKIA----TLLRLIDERIATQNKIIEK 209 Query: 403 LKERRSSFIAA 413 + I Sbjct: 210 YESLIKGIIYQ 220 >gi|225352859|ref|ZP_03743882.1| hypothetical protein BIFPSEUDO_04493 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156308|gb|EEG69877.1| hypothetical protein BIFPSEUDO_04493 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 175 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 58/145 (40%), Gaps = 13/145 (8%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I E+ S Y+IV G++V+ + + GI++ AY+ Sbjct: 7 NGIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYD----GIVSPAYV 62 Query: 322 AVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP-PIKEQF 376 +P+ + + + A L+R L K + + G Q LKF+D + + +P EQ Sbjct: 63 VARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQR 122 Query: 377 DITNVINVETARIDVLVEKIEQSIV 401 I + R+D L+ ++ Sbjct: 123 QIGGFFD----RLDSLITLHQRKYD 143 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 18/141 (12%), Positives = 40/141 (28%), Gaps = 7/141 (4%) Query: 56 VESGTGKYLPKDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 V G Y + + + + I G ++Y + + + +DGI S ++ Sbjct: 3 VSVANGIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYDGIVSPAYV 62 Query: 113 VLQPKDVLPELLQG---WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQV 168 V +P + + + + + +I + +P EQ Sbjct: 63 VARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQR 122 Query: 169 LIREKIIAETVRIDTLITERI 189 I I + Sbjct: 123 QIGGFFDRLDSLITLHQRKYD 143 >gi|319744170|gb|EFV96541.1| type I site-specific deoxyribonuclease chain S [Streptococcus agalactiae ATCC 13813] Length = 199 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 53/191 (27%), Gaps = 12/191 (6%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + + + + K + I + + +I PG Sbjct: 15 KYQNLSDIARITMGQSPKGETYNDDKIGLPLLNGATDFRNSISPSKWTSD--PRKIARPG 72 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 E VF + + RG ++ + + DL + + Sbjct: 73 EYVFGVRATIGLTTKIFKEYAIGRGTGSAKPI------SNIFDEYLFFALEDLFDYYANL 126 Query: 350 GSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 GSG ++ D V++P + + + + L+ I L E R Sbjct: 127 GSGTVYINISKSDFDSFKVILPIKD--QFLVDF-HKTVQPLFNLIFNNNAEIQKLSELRD 183 Query: 409 SFIAAAVTGQI 419 + + G+I Sbjct: 184 CLLPKLLPGEI 194 >gi|240047664|ref|YP_002961052.1| hypothetical protein MCJ_005500 [Mycoplasma conjunctivae HRC/581] gi|239985236|emb|CAT05249.1| PUTATIVE Uncharacterized protein MJ1218 [Mycoplasma conjunctivae] Length = 262 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 48/168 (28%), Gaps = 11/168 (6%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 N L +I+ ++ E Y + D +++ + + + + Sbjct: 16 NCNWLVHKLVDIVSYHTSKLTFSDVERKGRYPLYDANKVIGKTNKFFMKDDYIAIVKDGD 75 Query: 313 RGI-----ITSAYMAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRL 365 G SA++A + + + S + + F+D + Sbjct: 76 VGRPRFLPKNSAFIATMCALTSKNFDIYFIYSLLKLNFPIENMKVGTTIYHIYFKDYGNI 135 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P ++ Q I ID + + + + + +A Sbjct: 136 QYYFPSLEVQQKIA----KVFKNIDNFINLYKIKLEKISVIKQFLLAK 179 >gi|149915112|ref|ZP_01903640.1| type I restriction-modification system specificity determinant XF2741 [Roseobacter sp. AzwK-3b] gi|149810833|gb|EDM70672.1| type I restriction-modification system specificity determinant XF2741 [Roseobacter sp. AzwK-3b] Length = 345 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 41/307 (13%), Positives = 97/307 (31%), Gaps = 29/307 (9%) Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+T + + N+ +P EQ I + A +I+ E+ + Sbjct: 42 NAATGSTFPNVSKDQLHNLEVPDHSPFEQEEIASILGALDNKIELNRQTAATLEEMARAL 101 Query: 199 KQALVS-----YIVTKGLNPDVKMKDSGIEWV-----GLVPDHWEVKPFFALVTELNRKN 248 ++ +GL P + + + G +P+ W L+ R+ Sbjct: 102 YRSWFVDFDPVKAKAEGLAPAFMDEATAALFPDRFGEGGLPEGWTAGTLGDLIEFNPRER 161 Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + ++ + G+ + I + Sbjct: 162 ITKGADVPYLDMKALPTSGMIADPAYQR--TFTSGTKFREGDTLLARITPCLENGKTAMV 219 Query: 309 QVM---ERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKVFYA--MGSGLRQSLKFEDV 362 + E G ++ ++ ++ + L + + R D A GS RQ + + Sbjct: 220 DDLLGAEVGWGSTEFIVMRSKPGVPSALPYCVARDPDFRDEAIATMNGSSGRQRADAKSI 279 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE---QSIVLLKERRSSFIAAAVTGQI 419 +L VPP V+ + ++ +I + L R + + ++G++ Sbjct: 280 SQLKCAVPP-------VMVLTSFGQQTAPMIARIHAFGRENQTLAALRDTLLPKLMSGEL 332 Query: 420 DLRGESQ 426 + GE++ Sbjct: 333 RV-GEAR 338 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 49/131 (37%), Gaps = 12/131 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W + + N G D+ Y+ ++ + + P + Q ++ + Sbjct: 141 LPEGWTAGTLGDLIEFNPRERITKGADVPYLDMKALPTSGMIADP----AYQRTFTSGTK 196 Query: 81 FAKGQILYGKLGPY---LRKAIIAD----FDGICSTQFLVLQPKDVLPEL-LQGWLLSID 132 F +G L ++ P + A++ D G ST+F+V++ K +P D Sbjct: 197 FREGDTLLARITPCLENGKTAMVDDLLGAEVGWGSTEFIVMRSKPGVPSALPYCVARDPD 256 Query: 133 VTQRIEAICEG 143 A G Sbjct: 257 FRDEAIATMNG 267 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 11/66 (16%), Positives = 25/66 (37%), Gaps = 4/66 (6%) Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + A ++ + + L V EQ +I +++ +D +E Q+ L+ Sbjct: 40 LLNAATGSTFPNVSKDQLHNLEVPDHSPFEQEEIASILGA----LDNKIELNRQTAATLE 95 Query: 405 ERRSSF 410 E + Sbjct: 96 EMARAL 101 >gi|183508621|ref|ZP_02689854.2| type I restriction enzyme S protein [Ureaplasma parvum serovar 14 str. ATCC 33697] gi|182676080|gb|EDT87985.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 14 str. ATCC 33697] Length = 297 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 39/148 (26%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + S + + L+ + +V Sbjct: 8 INEFCPNGVEFKKLKNIITVAPKSPFGVTKLLKMEKGNYLTITSGKKSFYVDNFLVDGEY 67 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 ND + + + A K + + YL + + + ++ Sbjct: 68 IFVNDGGQADIKYNFGKTMYSDHIFAFKVNEYNIKYLYFYLLNISNFINKKLFIGSTLKN 127 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINV 384 L ++ L + +PPI Q I +++ Sbjct: 128 LNKKEFLNLAIPIPPISIQNKIVEILDK 155 >gi|237726586|ref|ZP_04557067.1| type I restriction-modification system specificity determinant [Bacteroides sp. D4] gi|229435112|gb|EEO45189.1| type I restriction-modification system specificity determinant [Bacteroides dorei 5_1_36/D4] Length = 184 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 23/181 (12%), Positives = 59/181 (32%), Gaps = 14/181 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQ 284 +PD W V L +N E + + N + + ++ E Sbjct: 1 QLPDGWCVVTLKDLCENINGLWKGKKEPFVNVGVIRNANFTKDFKLDYSNIEYIDVEQRT 60 Query: 285 I----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS------AYMAVKPHGIDSTYLA 334 ++ G+++ ++ R+ + + S A + S +L Sbjct: 61 FAKRHLENGDLIVEKSGGSDNNPVGRTILYEGKSGVFSFSNFTMALRTRNNDIVLSKFLY 120 Query: 335 WLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + + + ++L + + + +PP+ EQ I + I +D++ Sbjct: 121 YYILAKYQKGDMRLMQTQTTGLRNLILDKFLSMLIHLPPLSEQKRIIDRIETIFTSLDMI 180 Query: 393 V 393 + Sbjct: 181 M 181 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 59/183 (32%), Gaps = 19/183 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG------KYLPKDGNSRQS 73 +P W VV +K + G GK ++ + + + Y + + Sbjct: 1 QLPDGWCVVTLKDLCENINGL--WKGKKEPFVNVGVIRNANFTKDFKLDYSNIEYIDVEQ 58 Query: 74 DTSTVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLP----ELL 124 T G ++ K G P R + G+ S + + Sbjct: 59 RTFAKRHLENGDLIVEKSGGSDNNPVGRTILYEGKSGVFSFSNFTMALRTRNNDIVLSKF 118 Query: 125 QGWLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + T + + ++ + +PPL+EQ I ++I +D Sbjct: 119 LYYYILAKYQKGDMRLMQTQTTGLRNLILDKFLSMLIHLPPLSEQKRIIDRIETIFTSLD 178 Query: 183 TLI 185 ++ Sbjct: 179 MIM 181 >gi|313472058|ref|ZP_07812550.1| type I restriction-modification system, S subunit, EcoA family [Lactobacillus jensenii 1153] gi|313449060|gb|EEQ69088.2| type I restriction-modification system, S subunit, EcoA family [Lactobacillus jensenii 1153] Length = 345 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 82/214 (38%), Gaps = 19/214 (8%) Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS-YG 262 ++ + L P V+ + W + V + RKN L + L++S Sbjct: 18 THADEQRLYPKVRFRGFDEPW--------KKVKLGRNVKRIRRKNKNLETNIPLTISAQF 69 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYM 321 ++ + + + E+ Y ++ GE + + ++ + G +++ Y+ Sbjct: 70 GLVDQRDFFGRVVASENLANYILLKRGEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYI 129 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVPPIKEQF 376 A P I+S +L + + + G R ++ +D + + +P EQ Sbjct: 130 AFTPENINSDFLKAFFDTTKWYSHIVQVSTEGARNHGLLNISPQDFFEMSITIPKSDEQN 189 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +I+ + N+ + L+ ++ + L K+ + Sbjct: 190 NISRIYNLM----NSLLSLQQRKLELEKQIFYAL 219 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 35/334 (10%), Positives = 85/334 (25%), Gaps = 43/334 (12%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK V + R K + +I I + + + + + + + Sbjct: 38 WKKVKLGRNVKRIRRKNKNLETNIPLTISAQFGLVDQRDFFGR--VVASENLANYILLKR 95 Query: 84 GQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G+ Y K +G ST ++ P+++ + L+ + + I Sbjct: 96 GEFAYNKSYSKEAPYGSIKRLEKYNEGALSTLYIAFTPENINSDFLKAFFDTTKWYSHIV 155 Query: 139 AICEGATMSH----ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + +H + + + IP EQ I + + ++ Sbjct: 156 QVSTEGARNHGLLNISPQDFFEMSITIPKSDEQNNISRIYNLMNSLLSLQQRKLELEKQI 215 Query: 195 LKEKKQALVSY-IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIE 253 K + + + G +K K + P+ ++ N Sbjct: 216 FYALKTHIFAKDLFFNGQKDMIKYKLKDVS-NMYQPETITATQMSTNGYKVFGAN----- 269 Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 Y ++ + + V Sbjct: 270 ------GYIGHYYNFNHKDDAIT----------------ICARGASTGAVNFVPGPVWIT 307 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G S + + I+ Y + + + +L + Sbjct: 308 G--NSMVVDIDSKLINQLYFYYYLTTLNLKNILQ 339 >gi|224284021|ref|ZP_03647343.1| type I restriction-modification system DNA specificity subunit [Bifidobacterium bifidum NCIMB 41171] gi|313141179|ref|ZP_07803372.1| restriction modification system DNA specificity domain-containing protein [Bifidobacterium bifidum NCIMB 41171] gi|313133689|gb|EFR51306.1| restriction modification system DNA specificity domain-containing protein [Bifidobacterium bifidum NCIMB 41171] Length = 201 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 62/175 (35%), Gaps = 8/175 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 + + F N+ N +L ++ + + ++ Y IV Sbjct: 33 DPWEQRKFVDFVEASGIRNKDNLQLESYSVSNDRGFVPQDEQFENGGTMRDADKTAYWIV 92 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKV 345 +PG + + + S+ + I++S Y + D +L +S K Sbjct: 93 EPGSFAYNP--ARINVGSIGYQSTRKNVIVSSLYEVLKTDRSCDDRFLWHWFKSSLFTKQ 150 Query: 346 FYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + G+R F+ +++ + +P + EQ I + ++D L+ ++ Sbjct: 151 IEMLQEGGVRLYFFFDKLQKSEIWMPNVDEQRIIG----QQFDQLDSLITLHQRK 201 >gi|197302010|ref|ZP_03167073.1| hypothetical protein RUMLAC_00740 [Ruminococcus lactaris ATCC 29176] gi|197298958|gb|EDY33495.1| hypothetical protein RUMLAC_00740 [Ruminococcus lactaris ATCC 29176] Length = 1196 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 66/181 (36%), Gaps = 15/181 (8%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLS-YGNIIQKLETRNMGLKPESYETYQIVDP 288 + WE + LV + RKN L+ L++S +I + E + + + Y +++ Sbjct: 4 NDWEQRKLVDLVDRVTRKNQDLVSELPLTISAQYGLIDQNEFFDKRVASKDVSGYYLIEN 63 Query: 289 GEIVFRF-IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS----TYLAWLMRSYDLC 343 GE + +++ + G++++ Y+ + +++ + Sbjct: 64 GEFAYNKSTSTDAPWGAIKRLDRYKNGVLSTLYIVFGIKENNPVDSDFLVSYYSTNLWHK 123 Query: 344 KVFYAMGSGLRQ----SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 + G R ++ D +++P I+EQ I ++ L+ + Sbjct: 124 GIHEIAAEGARNHGLLNIAPADFFETKLMIPQDIEEQKKIGKY----FEELERLITLHHR 179 Query: 399 S 399 Sbjct: 180 K 180 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 53/181 (29%), Gaps = 12/181 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIY-IGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 W+ + T + + ++ I + ++ K D S + Sbjct: 4 NDWEQRKLVDLVDRVTRKNQDLVSELPLTISAQYGLIDQNEFFDKR--VASKDVSGYYLI 61 Query: 82 AKGQILYGKLGPYLRKAIIAD-----FDGICSTQFLVLQPKDVLPEL----LQGWLLSID 132 G+ Y K +G+ ST ++V K+ P + + ++ Sbjct: 62 ENGEFAYNKSTSTDAPWGAIKRLDRYKNGVLSTLYIVFGIKENNPVDSDFLVSYYSTNLW 121 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 E EGA + + + + ++KI ++ LIT R Sbjct: 122 HKGIHEIAAEGARNHGLLNIAPADFFETKLMIPQDIEEQKKIGKYFEELERLITLHHRKQ 181 Query: 193 E 193 Sbjct: 182 N 182 >gi|269978330|gb|ACZ55899.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 330 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 38/367 (10%), Positives = 92/367 (25%), Gaps = 46/367 (12%) Query: 50 YIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 +I D+ P+ + + + IL G +G + D + Sbjct: 2 FITPNDLHGTYRIIKTPRTLSDSGLKSIQNNTIDNTSILVGCIGDVGMVRMCFDKCA-TN 60 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 Q + + + + + I + I + +P + Q Sbjct: 61 QQINSITDIKDFCNPYYLYYYLSNKKELFKNIALSTVVPIIPKTIFQEIEILLPNIKTQQ 120 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + +I+ Sbjct: 121 KIARTLSILDQKIENNHKINELL------------------------------------- 143 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 H + + KN KL + I + +++ + + + P Sbjct: 144 --HNLAHKVYEYYFKYKPKNAKLEQIIIENPKSNIMVKNAQKTQDKYPFFTSGDNILSYP 201 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I+ N + + + ++ + + S YL L+ S Sbjct: 202 KAIIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCICANEF-SDYLYLLLSSIKNHINQSF 260 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + L+ +K+ P+ +P E +I L+ ++ L++ R Sbjct: 261 FQGTSLKHLQKNLLKKYPIYMPSAHEIKKFNQIIMPLL----TLISINTRTSKKLEQIRD 316 Query: 409 SFIAAAV 415 + + Sbjct: 317 FLLPLLL 323 >gi|310287617|ref|YP_003938875.1| truncated HsdS specificity protein of Type I restriction-modification system [Bifidobacterium bifidum S17] gi|309251553|gb|ADO53301.1| truncated HsdS specificity protein of Type I restriction-modification system [Bifidobacterium bifidum S17] Length = 168 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 39/111 (35%), Gaps = 6/111 (5%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + +L A++ + + + P L + + + + Sbjct: 3 TRSGILRHTLPVAELRKPSTVNQDIRVILPQGECCGEWLLQFFISHNKELLLEFGKTGTT 62 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 +S+ F +K + + +P EQ I + A++D L+ ++ LK Sbjct: 63 VESVDFGKIKDMLLYMPSTVEQQQIGDF----FAKLDSLITLHQRKRQWLK 109 >gi|93007190|ref|YP_581627.1| N-6 DNA methylase [Psychrobacter cryohalolentis K5] gi|92394868|gb|ABE76143.1| N-6 DNA methylase [Psychrobacter cryohalolentis K5] Length = 600 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 51/150 (34%), Gaps = 16/150 (10%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII---- 316 G+I + N+ + + P +I+F + Sbjct: 446 IGDISVPTKEANISESERAKNQTGFLQPNDIIFILKGSAGKLGIVPEDVPTTGDRCWMVN 505 Query: 317 -TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKE 374 ++ + ++ L ++S + G ++ +++K++PV+VP ++E Sbjct: 506 RSAIVIRTISDKVNPKVLYAYLKSDIGQTQISGLIKGATIPNISLKELKQIPVIVPSLEE 565 Query: 375 Q-FDITNVINVETARIDVLVEKIEQSIVLL 403 + I ID E +++I L Sbjct: 566 REQAIAC--------IDKSRE-TQKAIQKL 586 >gi|227890486|ref|ZP_04008291.1| possible type I RM system S subunit [Lactobacillus johnsonii ATCC 33200] gi|227848957|gb|EEJ59043.1| possible type I RM system S subunit [Lactobacillus johnsonii ATCC 33200] Length = 171 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 47/164 (28%), Gaps = 8/164 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-----KPESYE 281 P+ + + L ++ + Sbjct: 8 KYPEKALENYINFITSGSRGWAKYLTPKGKAWFLTIKNVKNSHIVINNIQSVEPPDSKEA 67 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRS 339 V G+++ + S I + + I+ Y ++ + + Sbjct: 68 QRTKVKEGDLLISITADLGRTGVVSSDIASHGTYINQHLTCIRLNTEFINPVYASYFLET 127 Query: 340 YDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + F +G++ L F+ +K L +++PPIK Q + + Sbjct: 128 VAGKRQFNSKNQNGVKAGLNFDAIKSLKIIIPPIKRQNSFVSFV 171 >gi|323939694|gb|EGB35898.1| type I restriction modification DNA specificity domain-containing protein [Escherichia coli E482] Length = 249 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 63/216 (29%), Gaps = 26/216 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + +P+ + L G T K DI + ++D+ Sbjct: 17 EWLPLSKVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGNSLQKISSCAVKGG 76 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL---QGWLLSIDVTQ 135 +F + IL A+I + + +F L K+ + + + + Sbjct: 77 KLFPENSILISTSATIGEHALITVPH-LANQRFTCLALKESYADCFDIKFLFYYCFSLAE 135 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITER 188 ++ + D G +P P LA Q I + + L E Sbjct: 136 WCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFSELTAELTAEL 195 Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 + + K++ ++ +S +EW Sbjct: 196 TAELNMRKKQYNYYRDQLL--------SFDESSVEW 223 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/210 (10%), Positives = 57/210 (27%), Gaps = 19/210 (9%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL---ETRNM 273 M +EW+ L +V T K +I +I + Sbjct: 11 MDGVEVEWLPLS----KVFNLRNGYTPSKTKKEFWANGDIPWFRMDDIRENGRILGNSLQ 66 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + + ++ I+ + + + + A D +L Sbjct: 67 KISSCAVKGGKLFPENSILISTSATIGEHALITVPHLANQRFTCLALKESYADCFDIKFL 126 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIKEQFDITNVINVET 386 + S S+ + K+ + P + Q +I +++ + Sbjct: 127 FYYCFSLA-EWCRKNTTMSSFASVDMDGFKKFLIPRPCPDNPEKSLAIQSEIVRILDKFS 185 Query: 387 ARIDVLVEKIEQSIVLLKE----RRSSFIA 412 L ++ + + K+ R ++ Sbjct: 186 ELTAELTAELTAELNMRKKQYNYYRDQLLS 215 >gi|291529889|emb|CBK95474.1| Type I restriction modification DNA specificity domain [Eubacterium siraeum 70/3] Length = 174 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 52/133 (39%), Gaps = 10/133 (7%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAV---KPHGID 329 + Y+++ + + + D+R + E I++ AY ++ Sbjct: 42 NVIGTDLSKYKLITKDKFACNPMHVGRDERLPVALYTEDEPAIVSPAYFMFEIIDNSILN 101 Query: 330 STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL R + ++ + +R + ++D+ R+ V VPP+ EQ +I T R Sbjct: 102 EDYLMMWFRRPEFDRLCWLRTDGSVRGGITWDDICRMKVPVPPLDEQIEIVQSYQAITDR 161 Query: 389 IDVLVEKIEQSIV 401 I +++ I Sbjct: 162 I-----ALKKQIN 169 >gi|225352854|ref|ZP_03743877.1| hypothetical protein BIFPSEUDO_04488 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225352864|ref|ZP_03743887.1| hypothetical protein BIFPSEUDO_04498 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156304|gb|EEG69873.1| hypothetical protein BIFPSEUDO_04498 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156314|gb|EEG69883.1| hypothetical protein BIFPSEUDO_04488 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 173 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 31/143 (21%), Positives = 58/143 (40%), Gaps = 13/143 (9%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I E+ S Y+IV G++V+ + + GI++ AY+ Sbjct: 7 NGIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYD----GIVSPAYV 62 Query: 322 AVKPH-GIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVP-PIKEQF 376 +P+ + + + A L+R L K + + G Q LKF+D + + +P EQ Sbjct: 63 VARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQR 122 Query: 377 DITNVINVETARIDVLVEKIEQS 399 I + R+D L+ ++ Sbjct: 123 QIGGFFD----RLDSLITLHQRK 141 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 18/141 (12%), Positives = 40/141 (28%), Gaps = 7/141 (4%) Query: 56 VESGTGKYLPKDGNSRQS---DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFL 112 V G Y + + + + I G ++Y + + + +DGI S ++ Sbjct: 3 VSVANGIYPASESDRETNPGASLANYKIVHFGDVVYNSMRMWQGAVDASRYDGIVSPAYV 62 Query: 113 VLQPKDVLPELLQG---WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQV 168 V +P + + + + + +I + +P EQ Sbjct: 63 VARPNSEVYARFFARLLRQPMLLKQYQQVSQGNSKDTQVLKFDDFASIGISMPASENEQR 122 Query: 169 LIREKIIAETVRIDTLITERI 189 I I + Sbjct: 123 QIGGFFDRLDSLITLHQRKYC 143 >gi|186701786|ref|ZP_02971464.1| restriction modification enzyme subunit s2a [Ureaplasma parvum serovar 6 str. ATCC 27818] gi|186701064|gb|EDU19346.1| restriction modification enzyme subunit s2a [Ureaplasma parvum serovar 6 str. ATCC 27818] Length = 380 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 46/388 (11%), Positives = 108/388 (27%), Gaps = 24/388 (6%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD---TSTVSIFAK 83 + + + G I + +E G Y + ++ + K Sbjct: 3 IYKLYELVNIYKGSN--------LITKKYIEQNKGIYPVISSKTTENGVYGFINTYDYEK 54 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA--IC 141 +I G + + + LV ++ + L++ + I Sbjct: 55 DKITMSSDGENAGTTFWQEKNFSLTNHALVFIMNKLIKYNYKYLFLTLKKHESKIKDLII 114 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL-----ITERIRFIELLK 196 G+T + +I + +P + EQ I I I+ + Sbjct: 115 SGSTRPGVSLNLLKSINIKLPSIEEQDAIISIIEPIEKLFVKYSNLVDISSVENVKRDID 174 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + V + +K+ + ++ F K K Sbjct: 175 NLISIIKPLDVLENKINKLKITLKKLLTNLYDKNYNSHVNLFENNKIYTNKYLKQNLYCD 234 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 S I + N+ L+ + + I+F + +N E + Sbjct: 235 TSCIGELEINFSKMINISLEDKPSRADLSIKNNSIIFSKLLGENKVYC---FLNNENIVF 291 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQ 375 ++ + +K + ++ L + S D + +G + D+ ++ P + Sbjct: 292 STGFFNIKSNNENNDDLLSFLLSSDFKNQKSMLANGTTMIGINNSDLTKIRCKAPFLN-- 349 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLL 403 +I + I+ + IV L Sbjct: 350 SNIYFTFFNKLNEIENKITLTRNKIVYL 377 >gi|295110204|emb|CBL24157.1| Restriction endonuclease S subunits [Ruminococcus obeum A2-162] Length = 303 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 83/287 (28%), Gaps = 24/287 (8%) Query: 29 PIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQS---DTSTV 78 +K L G+T K+ +I + D+ + TGKY+ + D S + Sbjct: 4 KLKDIFDLQMGKTPSRNHTEYWNTKEHKWISIADL-TKTGKYISETKECLSDCAIDDSGI 62 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + ++ + AI + + + K V L + E Sbjct: 63 KVIPANTVVMSFKLSIGKTAITVEDMYS-NEAIMAFHDKHVAEILPEYIYYMFKYKNWDE 121 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + + + + + I L EQ I + + + + TE +L Sbjct: 122 GSNKAVMGKTLNKATLSEVEIDICSLEEQREIVKVLDKMMTVLGSRETELSLLDDL---- 177 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + V + KD I + K A R+ L +N+ Sbjct: 178 ---IKARFVEMFGDVIHNSKDWPIYTFSEITSSRLGKMLDAKKQTGKRRYPYLANTNVKW 234 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + +LE N E+ + G+++ Sbjct: 235 FRF-----ELENLNQMDFDEAERVEFELKDGDLLVCEGGEIGRCAVW 276 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 50/184 (27%), Gaps = 25/184 (13%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY--------- 283 + K + + I + G + Sbjct: 1 MKYKLKDIFDLQMGKTPSRNHTEYWNTKEHKWISIADLTKTGKYISETKECLSDCAIDDS 60 Query: 284 --QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 +++ +V F I+ A+ I Y+ ++ + + Sbjct: 61 GIKVIPANTVVMSFKLSIGKTAITVEDMYSNEAIM--AFHDKHVAEILPEYIYYMFKYKN 118 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV----------ETARIDV 391 + G ++L + + + + ++EQ +I V++ E + +D Sbjct: 119 WDEGSNKAVMG--KTLNKATLSEVEIDICSLEEQREIVKVLDKMMTVLGSRETELSLLDD 176 Query: 392 LVEK 395 L++ Sbjct: 177 LIKA 180 >gi|240047663|ref|YP_002961051.1| hypothetical protein MCJ_005490 [Mycoplasma conjunctivae HRC/581] gi|239985235|emb|CAT05248.1| PUTATIVE Uncharacterized protein MJ1218 [Mycoplasma conjunctivae] Length = 138 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 36/106 (33%), Gaps = 6/106 (5%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ + D ++ L+ SY + + + + F++ + Sbjct: 34 FLPTNTAFCSTMSALTSKNNFDIYFIYSLLSSYFPIESI--ISGTTIKHIYFKNYGQFEY 91 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 VP IKEQ I ID L+ E + ++ + S + Sbjct: 92 FVPSIKEQQKIA----KVFENIDNLLNLYELKLQKIEMIKKSLLDK 133 >gi|68250155|ref|YP_249267.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 86-028NP] gi|68058354|gb|AAX88607.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 86-028NP] Length = 421 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 53/196 (27%), Gaps = 11/196 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTG---RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W+V + G + D ++ + + Y D ++ + Sbjct: 233 EVPKGWEVKALDEIANYQNGLALQKFRPEDDEPFLPVVKIAQLRQGYADGDEKAKAN-IK 291 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I G +++ G L I + + K+ + + Sbjct: 292 PECIIDNGDVIFSWSGSLL-VDIWCGGKAALNQHLFKVSSKEYPKWFYYFYTKHHLTEFQ 350 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 A + TM H + + +P I I L+ Sbjct: 351 RIAYDKAVTMGHIKREHLSAAKCIVPNDEL------LANKTLENILEKIIFNRLENFNLQ 404 Query: 197 EKKQALVSYIVTKGLN 212 + L+ ++ LN Sbjct: 405 NTRDLLLPRLLNGELN 420 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 11/104 (10%), Positives = 35/104 (33%), Gaps = 7/104 (6%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + + + G ++ +L++S D +L + + + Sbjct: 63 YIEKDFFPLNTTLYVKDFKGHYPRFIYYLLKSIDFTSF---NVGTGVPTLNRNHLSSILI 119 Query: 368 LVPPIKEQFDITNVINVETARI--DVLVEKIEQSIVLLKERRSS 409 I+++ +I N++ +I + + + + I + S Sbjct: 120 SDLGIEKEKEIANILGSLDQKIQLNTQINQTLEQIA--QALFKS 161 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 54/445 (12%), Positives = 120/445 (26%), Gaps = 84/445 (18%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 +P+ F L G S K I +P ++ + ++ Sbjct: 4 IPLNEFITLQRGFDLPSNKRI------------SGSVPVVASTGIAGYHNEIKVKAPGVV 51 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G+ G I +T V K P + L SID T + G + Sbjct: 52 IGRSGSIGGGQYIEKDFFPLNTTLYVKDFKGHYPRFIYYLLKSIDFT----SFNVGTGVP 107 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS--- 204 + + +I + + ++ I + + +I ++ + ++ Sbjct: 108 TLNRNHLSSILISDLGIEKEKEIANILGSLDQKIQLNTQINQTLEQIAQALFKSWFVDFD 167 Query: 205 ------YIVTKGL------------------------------------NPDVKMKDSGI 222 ++ GL + Sbjct: 168 PVRAKVQALSDGLSLEQAELAAIQAISGKTPEELTALSQTQPERYAELAETAKAFPCEMV 227 Query: 223 EWVG-LVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 E G VP WEVK + N K + L + +++ Sbjct: 228 EVDGVEVPKGWEVKALDEIANYQNGLALQKFRPEDDEPFLPVVKIAQLRQGYADGDEKAK 287 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + + I+D G+++F + L + + V + + Sbjct: 288 ANIKPECIIDNGDVIFSWSGSL-----LVDIWCGGKAALNQHLFKVSSKEY-PKWFYYFY 341 Query: 338 RSY---DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV-INVETARIDVLV 393 + + ++ Y +K E + +VP + + N + +I + Sbjct: 342 TKHHLTEFQRIAYDKAV-TMGHIKREHLSAAKCIVPNDEL---LANKTLENILEKI--IF 395 Query: 394 EKIEQSIVLLKERRSSFIAAAVTGQ 418 ++E L+ R + + G+ Sbjct: 396 NRLENF--NLQNTRDLLLPRLLNGE 418 >gi|313112143|ref|ZP_07797924.1| hypothetical protein PA39016_004130022 [Pseudomonas aeruginosa 39016] gi|310884426|gb|EFQ43020.1| hypothetical protein PA39016_004130022 [Pseudomonas aeruginosa 39016] Length = 180 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 6/132 (4%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ G+I N+K +L + Q ++ K + YL WL+ Sbjct: 53 PLLQSGDIAVIARG-DNNKAALFTGQQPVVATSQFFIVSTKKQDVLPEYLCWLINLPQSQ 111 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD-ITNVINVETARIDVLVEKIEQSIVL 402 + GS ++ + + + + +PP+ Q I + D L+ +++ + Sbjct: 112 RSLERSGSAIQA-ISKASLLDMRIPLPPLATQQKLIA--LQALWDEEDELIARLQTNREQ 168 Query: 403 -LKERRSSFIAA 413 L+ I Sbjct: 169 MLQGIYQHLIKD 180 >gi|315641377|ref|ZP_07896452.1| type I restriction enzyme specificity protein [Enterococcus italicus DSM 15952] gi|315482870|gb|EFU73391.1| type I restriction enzyme specificity protein [Enterococcus italicus DSM 15952] Length = 152 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 49/132 (37%), Gaps = 13/132 (9%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS---AYMAVKPHG 327 + + + + G+++F + + + G I S K Sbjct: 29 YGDEKLYRKWMSGRELKKGQVLFTTEAPMGNVAQVP----DDNGYILSQRTVAFETKEDM 84 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVE 385 + + +LA L++S + A+ SG + + + +K L + VP I EQ I + Sbjct: 85 MTNDFLAVLLKSPLVFNNLSALSSGGTAKGVSQKSLKGLSITVPLDIDEQQKIGSF---- 140 Query: 386 TARIDVLVEKIE 397 ++D + + Sbjct: 141 FKQLDETIALHQ 152 >gi|328946728|gb|EGG40866.1| hypothetical protein HMPREF9397_0251 [Streptococcus sanguinis SK1087] Length = 178 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 52/131 (39%), Gaps = 7/131 (5%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER---GIITSAYMAVK 324 + + G+ + I + +++ G ++ ++ V+ Sbjct: 38 FTRDIPEFEYLEFRGGTKFRNGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVR 97 Query: 325 P--HGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + D ++ +LM + ++ + + +G+ RQ ++ + VK +L PP+KEQ I Sbjct: 98 SKENISDENFVYYLMIAPNIREVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGK 157 Query: 381 VINVETARIDV 391 + +I+ Sbjct: 158 TLKALDDKIEN 168 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 65/179 (36%), Gaps = 14/179 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +WK V + + N T G I +E++E T + + + F Sbjct: 2 NNWKKVKLSDIIEFNPRETLSKGAIAKKIAMENLEPFTRDIPEFEY----LEFRGGTKFR 57 Query: 83 KGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G L ++ P L D G ST+F+V++ K+ + + + L I Sbjct: 58 NGDTLMARITPSLENGKTSKVNLLDEDEVGFGSTEFIVVRSKENISDENFVYYLMIAPNI 117 Query: 136 R---IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 R I+++ + + N + PPL EQ+ I + + A +I+ Sbjct: 118 REVAIKSMVGTSGRQRVQLDVVKNHEILCPPLKEQIRIGKTLKALDDKIENNKKINHHL 176 >gi|257465466|ref|ZP_05629837.1| restriction modification system DNA specificity subunit [Actinobacillus minor 202] gi|257451126|gb|EEV25169.1| restriction modification system DNA specificity subunit [Actinobacillus minor 202] Length = 191 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 49/140 (35%), Gaps = 3/140 (2%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYL 333 K + + + G+I+ + ++ + + + ++P I YL Sbjct: 51 KLDRVKENDWLRKGDILLATRGNNYQPIFVEFSRQNLPAVASPHFFVIRPKNAEILPEYL 110 Query: 334 AWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 W + K + + +SL+ + L + +P + +Q I ++ L Sbjct: 111 QWWLNLKQSQKYLIQNLEGSITKSLRLPALAELSIKIPSLAKQNVIVQMVKTLAQERKTL 170 Query: 393 VEKIEQSIVLLKERRSSFIA 412 + IE + L+ I+ Sbjct: 171 QKLIENNEKLMNALAQELIS 190 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 69/187 (36%), Gaps = 12/187 (6%) Query: 30 IKRFTKLNTG-----RTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 ++ + TG + + +++ + ++D G ++ K Sbjct: 4 LEDVANIQTGFLFRAKVPEDPNGNVVVVQMKDCSFFDGIAWDNCVRTKLDRVKENDWLRK 63 Query: 84 GQILYGKLGPYLRKAII----ADFDGICSTQFLVLQPKD--VLPELLQGWLLSIDVTQRI 137 G IL G + + + + S F V++PK+ +LPE LQ WL + + Sbjct: 64 GDILLATRGNNYQPIFVEFSRQNLPAVASPHFFVIRPKNAEILPEYLQWWLNLKQSQKYL 123 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 EG+ + + + IP LA+Q +I + + TL +L+ Sbjct: 124 IQNLEGSITKSLRLPALAELSIKIPSLAKQNVIVQMVKTLAQERKTLQKLIENNEKLMNA 183 Query: 198 KKQALVS 204 Q L+S Sbjct: 184 LAQELIS 190 >gi|73748046|ref|YP_307285.1| putative type I restriction enzyme, specificity protein [Dehalococcoides sp. CBDB1] gi|73659762|emb|CAI82369.1| putative type I restriction enzyme, specificity protein [Dehalococcoides sp. CBDB1] Length = 222 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 7/160 (4%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K N + S Q + L + + A+ P Sbjct: 67 KGIYINKTERNISQMGLQSCSATLLPQNSCLLTSRATIGECRINTIPMATNQGFAALVPK 126 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVE 385 ++Y + + + +++R+ VP +EQ I NVI Sbjct: 127 AGTNSYFLFYLTYLLKPTFVRLAAGTTYTEISKRELRRVKCRVPETEEEQAKIANVIKAV 186 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGES 425 D L ++S++L+ R+S + +TG++ L+ E+ Sbjct: 187 D---DALACTPDESLMLM---RTSLVQNLMTGKVYLKPEA 220 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 62/195 (31%), Gaps = 15/195 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSRQ---SD 74 W V + K+ G T ++G +I + D+ S G Y+ K + Sbjct: 25 WPVKTVGDIAKVIGGGTPDTGVPQYWNPAEIPWATPTDITSCKGIYINKTERNISQMGLQ 84 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + + ++ + L + + I + F L PK +L + + Sbjct: 85 SCSATLLPQNSCLLTS-RATIGECRINTIPMATNQGFAALVPKAGTNSYFLFYLTYL-LK 142 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIE 193 + G T + + + + +P EQ I I A + T + Sbjct: 143 PTFVRLAAGTTYTEISKRELRRVKCRVPETEEEQAKIANVIKAVDDALA--CTPDESLML 200 Query: 194 LLKEKKQALVSYIVT 208 + Q L++ V Sbjct: 201 MRTSLVQNLMTGKVY 215 >gi|282881753|ref|ZP_06290414.1| HsdS [Peptoniphilus lacrimalis 315-B] gi|281298403|gb|EFA90838.1| HsdS [Peptoniphilus lacrimalis 315-B] Length = 159 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 53/161 (32%), Gaps = 9/161 (5%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ--IVDPGE 290 +V ++ K N Y + ++Y T G+ Sbjct: 1 MRYRLDEIVDVTMGQSPKSEYYNTEKNGYPFLQGNRTFGFKYPTFDTYTTVMTKSAKAGD 60 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 ++ + RG+ + ++ + ++L ++M+ Y + + Sbjct: 61 VIMSVRAPVGALNITPVDMCLGRGVCS-----LRMKNGNQSFLFYMMK-YYISHLLKKES 114 Query: 351 SGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARID 390 + S+ D+ L V +P ++EQ I + + +I+ Sbjct: 115 GTVFGSVNRNDIIGLEVDIPEDVEEQNKIARYLEMIDDKIE 155 >gi|303267753|ref|ZP_07353557.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS457] gi|302642714|gb|EFL73057.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS457] Length = 172 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 43/128 (33%), Gaps = 7/128 (5%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITS 318 + E +N+ + + V+ G+++ ++ A + + Sbjct: 43 SYDYFNSSEVKNLPIDYIPLDE-HKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPD 101 Query: 319 AYMAVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 V + + W + ++ K + SG +++ + ++ V PP+ Sbjct: 102 RLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLAL 161 Query: 375 QFDITNVI 382 Q + + + Sbjct: 162 QNEFADFV 169 >gi|321222503|gb|EFX47575.1| Type I restriction-modification system, specificity subunit S [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 95 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 11/51 (21%), Positives = 26/51 (50%) Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++PP++EQ +I + A D + +++ ++ + S +A A G+ Sbjct: 2 ILPPLQEQHEIVRRVEQLFAYADTIEKQVNNALTRVNSLTQSILAKAFRGE 52 >gi|269978322|gb|ACZ55895.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 355 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 42/360 (11%), Positives = 97/360 (26%), Gaps = 21/360 (5%) Query: 50 YIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 +I D+ P+ + + + IL G +G + D + Sbjct: 2 FITPNDLHGTYRIIKTPRTLSDSGLKSIQNNTINNTSILVGCIGDVGMVRMCFDKCA-TN 60 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 Q + + + + + I + I + +P + Q Sbjct: 61 QQINSITDIKDFCNPYYLYYYLSNKKELFKNIAFSTVVPIIPKTIFQEIEVLLPNIETQQ 120 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 I + +D I + ELL + + L + D K + Sbjct: 121 KIARTL----SILDQKIENNHKINELLHKILELLYEQYFVRFDFLDENNKPYQTSGGKMK 176 Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 + ++ + ++ + K P ETYQ Sbjct: 177 FSKELNRLIPNDFEVKTLGELTQLKVGNKNANHSSNQGKYPFFTCSNNPLKCETYQFEGK 236 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I+ + + + ++ P+ + L +L + Sbjct: 237 HIIISGNGNFYVTHYNGKFDAYQRTYVVN-------PNNPNHYVLIYLFVKSYTNYLKLQ 289 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + + D++ + +++P +K NV+ ++E QS L R Sbjct: 290 SRGSIIKFITKSDIENIKIVLPNLKTYTKWNNVL--------KMIENNNQSTQTLTALRD 341 >gi|239998600|ref|ZP_04718524.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae 35/02] gi|240013723|ref|ZP_04720636.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae DGI18] gi|240080305|ref|ZP_04724848.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae FA19] gi|240112517|ref|ZP_04727007.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae MS11] gi|240115257|ref|ZP_04729319.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae PID18] gi|240120793|ref|ZP_04733755.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae PID24-1] gi|240123098|ref|ZP_04736054.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae PID332] gi|240125349|ref|ZP_04738235.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae SK-92-679] gi|240127802|ref|ZP_04740463.1| Type I restriction-modification system specificity determinant [Neisseria gonorrhoeae SK-93-1035] Length = 138 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 14/126 (11%), Positives = 34/126 (26%), Gaps = 10/126 (7%) Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 +I+ I K G + + V ++ YL ++ Sbjct: 7 NDILIGNIRPYLKKIWQADCTGGTNGDV--LVIRVTDEKVNPKYLYQVLADDKFFAFNMK 64 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSI 400 G + + + +PP+ EQ I ++ + + + Sbjct: 65 HAKGAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQY 124 Query: 401 VLLKER 406 +E+ Sbjct: 125 EYYREQ 130 Score = 38.2 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 30/128 (23%), Positives = 48/128 (37%), Gaps = 2/128 (1%) Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEAIC 141 IL G + PYL+K AD G + LV++ + V P+ L L Sbjct: 7 NDILIGNIRPYLKKIWQADCTGGTNGDVLVIRVTDEKVNPKYLYQVLADDKFFAFNMKHA 66 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +GA M I +PIPPL EQ I + ++ I L +++ + Sbjct: 67 KGAKMPRGSKAAIMQYKIPIPPLPEQEKIVAILGKFDTLTHSVSEGLPHEIALRRKQYEY 126 Query: 202 LVSYIVTK 209 ++ Sbjct: 127 YREQLLAF 134 >gi|256852235|ref|ZP_05557621.1| restriction endonuclease S subunit [Lactobacillus jensenii 27-2-CHN] gi|260661733|ref|ZP_05862644.1| methylase [Lactobacillus jensenii 115-3-CHN] gi|282932024|ref|ZP_06337485.1| type-1 restriction enzyme MjaXIP specificity protein [Lactobacillus jensenii 208-1] gi|297205599|ref|ZP_06922995.1| HsdS protein [Lactobacillus jensenii JV-V16] gi|256615281|gb|EEU20472.1| restriction endonuclease S subunit [Lactobacillus jensenii 27-2-CHN] gi|260547480|gb|EEX23459.1| methylase [Lactobacillus jensenii 115-3-CHN] gi|281303851|gb|EFA95992.1| type-1 restriction enzyme MjaXIP specificity protein [Lactobacillus jensenii 208-1] gi|297150177|gb|EFH30474.1| HsdS protein [Lactobacillus jensenii JV-V16] Length = 179 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 62/166 (37%), Gaps = 13/166 (7%) Query: 29 PIKRFTKLNTGRTS---------ESGKDIIYIGLEDVESGTGKYLP---KDGNSRQSDTS 76 + K+ G T SGK I ++ +D+ S + Y+ +D S + S Sbjct: 4 KVGEIGKVIGGGTPSTKHEEYYTSSGKGIAWLTPKDLSSYSKMYIDHGSRDLTSEGYNNS 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + + K +L P IA + + F + P + L ++ Sbjct: 64 SAKLLPKDSVLISSRAPI-GYVAIAKNEIATNQGFKSIIPDKSKVYPEYLYYLMLENKLN 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 +E + G+T K + + IP L++Q I ++I +I+ Sbjct: 123 LEKVASGSTFKEVSGKVMKEFEVEIPSLSKQEKILNQLIPIQRKIE 168 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 55/169 (32%), Gaps = 5/169 (2%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 + +G V + K + LS SY + +R++ + + Sbjct: 5 VGEIGKVIGGGTPSTKHEEYYTSSGKGIAWLTPKDLS-SYSKMYIDHGSRDLTSEGYNNS 63 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + +++ ++ ++ +G + + + YL +LM Sbjct: 64 SAKLLPKDSVLISSRAPIGYVAIAKNEIATNQGFKS---IIPDKSKVYPEYLYYLMLENK 120 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L + + + + +K V +P + +Q I N + +I+ Sbjct: 121 L-NLEKVASGSTFKEVSGKVMKEFEVEIPSLSKQEKILNQLIPIQRKIE 168 >gi|291556520|emb|CBL33637.1| Restriction endonuclease S subunits [Eubacterium siraeum V10Sc8a] Length = 192 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 51/171 (29%), Gaps = 9/171 (5%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ---KLETRNMGLKPESYET 282 G P W + R+ + S + +++ E + Sbjct: 6 GTDPYEWGLTTLGECCKLNPRRPKDMTPDIDYSFVAMPSVSEDGRIDASIERPYSEVCKG 65 Query: 283 YQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKPHG--IDSTYLAWLMR 338 + +++F I ++N K + G ++ + ++P D +L + Sbjct: 66 FTYFAENDVLFAKITPCMENGKGGVAKGLKNGAGFGSTEFQVLRPIKGASDPYWLYIITM 125 Query: 339 SYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 G+G ++ + + + +PPI+ Q + Sbjct: 126 FPKFRSDAEKVMTGTGGQRRVPITYLSEYRIALPPIELQEQFAAFVRQSDK 176 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 51/163 (31%), Gaps = 12/163 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 P W + + KLN R + D ++ + V G+ + Sbjct: 9 PYEWGLTTLGECCKLNPRRPKDMTPDIDYSFVAMPSVSED-GRIDASIERPYSEVCKGFT 67 Query: 80 IFAKGQILYGKLGPYLRKA------IIADFDGICSTQFLVLQPKD--VLPELLQGWLLSI 131 FA+ +L+ K+ P + + + G ST+F VL+P P L + Sbjct: 68 YFAENDVLFAKITPCMENGKGGVAKGLKNGAGFGSTEFQVLRPIKGASDPYWLYIITMFP 127 Query: 132 DVTQRIEAICEGA-TMSHADWKGIGNIPMPIPPLAEQVLIREK 173 E + G + + +PP+ Q Sbjct: 128 KFRSDAEKVMTGTGGQRRVPITYLSEYRIALPPIELQEQFAAF 170 >gi|13507828|ref|NP_109777.1| hypothetical protein MPN089 [Mycoplasma pneumoniae M129] gi|12229983|sp|P75604|T1SA_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_089; AltName: Full=S.MpnORFAP; AltName: Full=Type I restriction enzyme specificity protein MPN_089; Short=S protein gi|1673717|gb|AAB95713.1| hypothetical protein MPN_089 [Mycoplasma pneumoniae M129] Length = 335 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 46/330 (13%), Positives = 95/330 (28%), Gaps = 20/330 (6%) Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 F + + Y + S+ + K + E+ +L + Sbjct: 5 KTYDFDGEYVTWTTRWSYAGSIYYRNGKFSASSNCGI--LKVLNKEINPKFLAYALKKEA 62 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + + + + IP+ PPL Q I + T L E + Sbjct: 63 KKFVNTTSAIPILRTQKVVEIPIDFPPLQIQEKIATILDTFTELSAELSAELSAELSAEL 122 Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 + + + +K+ E + + E+ +K E Sbjct: 123 SAELRERKKQYAFYRDYLLNLKNWKEEN------------KYYKLGEIAQKVLVGGEKPA 170 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 N + K + K E + Y E + + ++ + Sbjct: 171 DFSKEKNEVYKYPILSNNSKAEEFLVYSKTFRVEEKSITVSARGTIGAVFYRDFAYLPAV 230 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + D +L +R+ K A G L K + VP +K+Q Sbjct: 231 SLICFVP-KEEFDIRFLFHALRAIKFKKQGSATGQ-----LTVAQFKEYGIHVPSLKKQK 284 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKER 406 +I +++ + L E I I L K++ Sbjct: 285 EIAAILDPLYSFFTDLNEGIPAEIELRKKQ 314 >gi|241895013|ref|ZP_04782309.1| possible type I site-specific deoxyribonuclease specificity subunit [Weissella paramesenteroides ATCC 33313] gi|241871731|gb|EER75482.1| possible type I site-specific deoxyribonuclease specificity subunit [Weissella paramesenteroides ATCC 33313] Length = 188 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 59/170 (34%), Gaps = 8/170 (4%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 ++ S G QK + Y IV G +R + Sbjct: 16 NIIQYNEHTIENNQYPVFTSSRKGLFFQKDYYDGHQIASVDNTGYNIVPKGYFTYRHMS- 74 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQ 355 + + + GI+++ Y +D+ YL + + + G R Sbjct: 75 DDLIFKFNINDLADYGIVSTLYPVFTTTENLDAMYLMYQLNEGTEFKRFSLLQKQGGSRT 134 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + F +K L + +P IKEQ I + ++D L+ + I++L++ Sbjct: 135 YMYFSKLKELKLTIPNIKEQKSI----SELFKQLDSLITVNQDRILILQK 180 Score = 42.5 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 6/162 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + T E+ + ++ Y D + +I KG Sbjct: 8 WEKRKLGDNIIQYNEHTIENNQYPVFTSSRKGLFFQKDYYD-GHQIASVDNTGYNIVPKG 66 Query: 85 QILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSI--DVTQRIEA 139 Y + L + GI ST + V + L + + L+ + + Sbjct: 67 YFTYRHMSDDLIFKFNINDLADYGIVSTLYPVFTTTENLDAMYLMYQLNEGTEFKRFSLL 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 +G + ++ + + + + IP + EQ I E I Sbjct: 127 QKQGGSRTYMYFSKLKELKLTIPNIKEQKSISELFKQLDSLI 168 >gi|303243811|ref|ZP_07330151.1| hypothetical protein MetokDRAFT_0355 [Methanothermococcus okinawensis IH1] gi|302485747|gb|EFL48671.1| hypothetical protein MetokDRAFT_0355 [Methanothermococcus okinawensis IH1] Length = 91 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 32/78 (41%), Gaps = 6/78 (7%) Query: 347 YAMGSGLRQSLKFEDVKRLPVLVP------PIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + + ++ K L + +P +++Q +I + +I L E+ + Sbjct: 12 NILKNSVHSHFGIKEAKNLLIPIPYKDGKPDLQKQKEIAKYLENLHNKIKRLENLQEKQL 71 Query: 401 VLLKERRSSFIAAAVTGQ 418 L KE + S + A G+ Sbjct: 72 NLFKELKESILNKAFKGE 89 >gi|294793173|ref|ZP_06758319.1| HsdS, type I site-specific deoxyribonuclease [Veillonella sp. 6_1_27] gi|294456118|gb|EFG24482.1| HsdS, type I site-specific deoxyribonuclease [Veillonella sp. 6_1_27] Length = 223 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/178 (12%), Positives = 57/178 (32%), Gaps = 5/178 (2%) Query: 247 KNTKLIESNILSLSYGN-IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K E I ++ + + K + + G S + ++ + + Sbjct: 47 KPEYYSEKGIAWITPKDLSLNKSKFISHGEIDISELGFSKSSATKMPTGTVLFSSRAPIG 106 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 A + +V P+ T + + + L + + + +K + Sbjct: 107 YIAIAANEVTTNQGFKSVVPNENVGTVFIYYLLKFLLPTIEGMASGSTFKEISGAGMKSV 166 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 PV++P + + N I E +E L + + + ++G++D+ Sbjct: 167 PVVIPDNET----IDKFNAFCTPIFQQQEVLEAENSRLVDIIDALLPKLISGELDVSD 220 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 68/198 (34%), Gaps = 16/198 (8%) Query: 25 WKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYL---PKDGNSRQSD 74 WK + + G T K I +I +D+ K++ D + Sbjct: 26 WKDGVLSDLGTIVAGGTPSKTKPEYYSEKGIAWITPKDLSLNKSKFISHGEIDISELGFS 85 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S+ + G +L+ P AI A+ + F + P + + + + L + Sbjct: 86 KSSATKMPTGTVLFSSRAPIGYIAIAANEV-TTNQGFKSVVPNENV-GTVFIYYLLKFLL 143 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 IE + G+T G+ ++P+ IP +K A I Sbjct: 144 PTIEGMASGSTFKEISGAGMKSVPVVIPD----NETIDKFNAFCTPIFQQQEVLEAENSR 199 Query: 195 LKEKKQALVSYIVTKGLN 212 L + AL+ +++ L+ Sbjct: 200 LVDIIDALLPKLISGELD 217 >gi|57242466|ref|ZP_00370404.1| Type I restriction modification DNA specificity domain protein [Campylobacter upsaliensis RM3195] gi|57016751|gb|EAL53534.1| Type I restriction modification DNA specificity domain protein [Campylobacter upsaliensis RM3195] Length = 213 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 59/190 (31%), Gaps = 9/190 (4%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P+ W ++ + K+ G T D +++ + ++ + + Sbjct: 25 PQGWDIIKLGEVCKILIGGTPARNNSAYFQGDNLWVSIAEMNGQVITDTKEKISDEAIKK 84 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP--ELLQGWLLSIDV 133 S V + KG L + K IA D + L P D ++ ++ Sbjct: 85 SNVKLIPKGTTLLS-FKLSIGKTAIAGKDLYTNEAIAGLIPNDNNKLLDMFLFYIFKWQT 143 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 S + +P+PPL Q I + I + I L + F Sbjct: 144 IDLDLKGNNAFGKSLNSSVLKQEVKIPLPPLEAQESIVQAIESVENEITKLKEQSKTFES 203 Query: 194 LLKEKKQALV 203 E ++ + Sbjct: 204 KKAEILKSFL 213 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 59/190 (31%), Gaps = 21/190 (11%) Query: 228 VPDHWEVKPFFALVTELNRK-----NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 P W++ + L N+ + + L +S + ++ T + Sbjct: 24 PPQGWDIIKLGEVCKILIGGTPARNNSAYFQGDNLWVSIAEMNGQVITDTKEKISDEAIK 83 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS--TYLAWLMRSY 340 V I L ++A + A + P+ + + + + Sbjct: 84 KSNVKL--IPKGTTLLSFKLSIGKTAIAGKDLYTNEAIAGLIPNDNNKLLDMFLFYIFKW 141 Query: 341 DLCKVFYAMGSGLRQSLKFEDVK-RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + +SL +K + + +PP++ Q I +E +E Sbjct: 142 QTIDLDLKGNNAFGKSLNSSVLKQEVKIPLPPLEAQESIV-----------QAIESVENE 190 Query: 400 IVLLKERRSS 409 I LKE+ + Sbjct: 191 ITKLKEQSKT 200 >gi|308270739|emb|CBX27349.1| unknown protein [uncultured Desulfobacterium sp.] Length = 72 Score = 56.0 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 26/63 (41%) Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + + + L + +PP+KEQ I + + + + I + +S+ + A Sbjct: 9 NFNKDQLSALTIPLPPMKEQKKIVEELVSLKKKSYEMETLQKSVIKDFESFQSALFSKAF 68 Query: 416 TGQ 418 G+ Sbjct: 69 RGE 71 >gi|227892229|ref|ZP_04010034.1| type I restriction modification system protein HsdIA [Lactobacillus salivarius ATCC 11741] gi|227865951|gb|EEJ73372.1| type I restriction modification system protein HsdIA [Lactobacillus salivarius ATCC 11741] Length = 188 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 57/170 (33%), Gaps = 3/170 (1%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 L+ + NT ++ N L + K E IV G+IV Sbjct: 1 MKLKELIKIESGVNTVRLKDNEYELYTLEDVNYDLGHGEDYKHEVSYRKNIVARGDIVTN 60 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSG 352 + +++ + I +D YL +L+ + K A MG Sbjct: 61 TVGNMTSIVHTKNSGKLLNQIFMK-LSINNKEILDPWYLCYLLNESEYIKYQEASIMGGS 119 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L +++ L V +P I EQ I + ++ EK E L Sbjct: 120 VIKKLTKVNLENLEVNLPTIDEQRKIGEAYKETLRKYTLITEKAELEKNL 169 >gi|328946729|gb|EGG40867.1| restriction modification system S subunit [Streptococcus sanguinis SK1087] Length = 277 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 71/248 (28%), Gaps = 23/248 (9%) Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 ++IT L + + G G P W+ + Sbjct: 40 KSIITFNFILPFSFCTLNHHLEQMAQAIFKSWFIDFDPFG----GEKPSDWKTANLTDIA 95 Query: 242 TE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + + E ++ L + Q + + L + + I+ G+++F + Sbjct: 96 EFLNGLAMQKYRPLDNEESLPVLKIKELRQGIFDSSSDLCSANIKRPYIIQDGDVIFSWS 155 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS---GL 353 L G + V D + + F A+ + Sbjct: 156 GSL-----LVDFWTGGIGGLNQHLFKVSSQEYDK--WFYYSWTKYYLDEFIAIAADKATT 208 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + +++ +L+P + I + A + L E R+S + Sbjct: 209 MGHITRKSLEKAEILIPNDHDYKSIG----LLLAPTYNQIISNRIENRKLMEVRNSLLPK 264 Query: 414 AVTGQIDL 421 ++G+I + Sbjct: 265 LLSGEISV 272 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 58/192 (30%), Gaps = 10/192 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G P WK + + G R ++ + + + ++++ G + Sbjct: 80 GEKPSDWKTANLTDIAEFLNGLAMQKYRPLDNEESLPVLKIKELRQG---IFDSSSDLCS 136 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 ++ I G +++ G L G + + ++ W Sbjct: 137 ANIKRPYIIQDGDVIFSWSGSLL-VDFWTGGIGGLNQHLFKVSSQEYDKWFYYSWTKYYL 195 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 A + TM H K + + IP + I + +I + E + + Sbjct: 196 DEFIAIAADKATTMGHITRKSLEKAEILIPNDHDYKSIGLLLAPTYNQIISNRIENRKLM 255 Query: 193 ELLKEKKQALVS 204 E+ L+S Sbjct: 256 EVRNSLLPKLLS 267 >gi|208434383|ref|YP_002266049.1| type I restriction enzyme S protein [Helicobacter pylori G27] gi|208432312|gb|ACI27183.1| type I restriction enzyme S protein [Helicobacter pylori G27] Length = 321 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 41/119 (34%), Gaps = 6/119 (5%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLA 334 + + G + D + + + Y ++ +L Sbjct: 8 NEINKFSLKKGYVAITKDSETKDDIGISTYIADNFDNVLLGYHCTLLKPNQKVLNGKFLN 67 Query: 335 WLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + S+ K F A GSG R +L + +K L + + I+ Q I ++V +I+ Sbjct: 68 AYLNSFYGRKYFSNCASGSGQRYTLTIDTIKDLNIPLINIETQQKIARTLSVLDQKIEN 126 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 56/185 (30%), Gaps = 5/185 (2%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 H + + KN KL + I + +++ + + + P Sbjct: 135 HTLAYKIYEYYFKYKPKNAKLEQIIIENPKSSIMVKNAQKTQDKYPFFTSGDNILSYPKA 194 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 I+ N + + + ++ + + S YL L+ S Sbjct: 195 IIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCIGANEF-SDYLYLLLSSIKNHINQSFFQ 253 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + L+ +K+ P+ +P E +I L+ ++ L++ R Sbjct: 254 GTSLKHLQKNLLKKYPIYMPSAHEIKKFNQIIMPLL----TLISINTRTSKKLEQIRDFL 309 Query: 411 IAAAV 415 + + Sbjct: 310 LPLLL 314 >gi|145629352|ref|ZP_01785151.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 22.1-21] gi|145638853|ref|ZP_01794461.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae PittII] gi|48243646|gb|AAT40787.1| putative type I restriction/modification specificity protein [Haemophilus influenzae] gi|144978855|gb|EDJ88578.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae 22.1-21] gi|145271825|gb|EDK11734.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae PittII] gi|309750834|gb|ADO80818.1| Type I restriction enzyme HindVIIP, S protein [Haemophilus influenzae R2866] Length = 437 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 53/196 (27%), Gaps = 15/196 (7%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 PK W+ + ++ G +S I I + V+ + D S Sbjct: 242 PKGWEKTTLSEICEMQNGYAFKSSDWMEQGIPVIKIGSVK--PMIVEVEGNGFVSEDYSK 299 Query: 78 VS---IFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPK-----DVLPELLQGWL 128 + + G IL G G I + + + PK + Sbjct: 300 LKPDFLLTSGDILVGLTGYVGEVGRIPTGKIAMLNQRVATFLPKEIDKNHCFYNYIYCLA 359 Query: 129 LSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + E +G+ ++ K + P+ +L ++ RI Sbjct: 360 RQSQFKEFAEINAKGSAQANISTKELLKFPIIKANDKLHILFENRVKELLERILWNSQNA 419 Query: 189 IRFIELLKEKKQALVS 204 + L++ Sbjct: 420 ETLAKTRDLLLPRLLN 435 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/448 (9%), Positives = 115/448 (25%), Gaps = 68/448 (15%) Query: 26 KVVPIKRFTKLNTGRTSES--GKDIIY---------IGLED-----------VESGTGKY 63 K+V +K TGR + ++ IY + + + + Sbjct: 3 KLVKLKEIVDFKTGRLDSNCAEENGIYPFFTCSPETLRINSYAFDCEAVLLAGNNANAVF 62 Query: 64 LPKDGNSRQSDTSTVSIFAKGQ----------ILYGKLGPYLRKAIIADFDGICSTQFLV 113 K + + + I + L + + + L Sbjct: 63 PVKYYSGKFNAYQRTYIITPKDKSKINVKWLYFQIKHVAFELGIRAVGSATKFLTKRILD 122 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ Q ++ + + + + Sbjct: 123 DYEINLPDLDTQNYIARVLWKLENKIQLNTQINQTLEQIAQVLFKSWFVDFDPVRAKVQA 182 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVG----LVP 229 + +T E+ AL K E V P Sbjct: 183 LSEGMSLEQAELTAMQAISGKTPEELTALSQTQPDCYAELAETTKAFPCEMVEIDGVEAP 242 Query: 230 DHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKL-ETRNMGLKPESYET--- 282 WE + N K++ +E I + G++ + E G E Y Sbjct: 243 KGWEKTTLSEICEMQNGYAFKSSDWMEQGIPVIKIGSVKPMIVEVEGNGFVSEDYSKLKP 302 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYD 341 ++ G+I+ + + + ++ ++ P ID + + + Sbjct: 303 DFLLTSGDILVGLTGYVGEVGRIPTGKI---AMLNQRVATFLPKEIDKNHCFYNYIYCLA 359 Query: 342 LCKVFYAMG-----SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 F + ++ +++ + P++ + +++ + + E + Sbjct: 360 RQSQFKEFAEINAKGSAQANISTKELLKFPIIKAN--------DKLHILFE--NRVKELL 409 Query: 397 EQSI------VLLKERRSSFIAAAVTGQ 418 E+ + L + R + + G+ Sbjct: 410 ERILWNSQNAETLAKTRDLLLPRLLNGE 437 >gi|328471215|gb|EGF42117.1| hypothetical protein VP10329_03382 [Vibrio parahaemolyticus 10329] Length = 192 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 45/138 (32%), Gaps = 4/138 (2%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + V ++ FR N + + V+ + YL W + Sbjct: 55 DLKDHHRVKHNDLAFRSRGQTNTAALIDQELSDAVIAAPLLRIRVESDSVIPAYLCWFIN 114 Query: 339 SYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 V + +G + ++ L ++VP + Q I + + L+ + Sbjct: 115 QPTSQAVLQSKATGTAVRMIGKPALEDLEIVVPSLDVQKKIIEIYQLSINE-QKLMNALA 173 Query: 398 QSIVLLKERRSSFIAAAV 415 + +L + + + A+ Sbjct: 174 KKKEVLTD--AILMNLAM 189 >gi|307262528|ref|ZP_07544170.1| hypothetical protein appser12_20650 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306867763|gb|EFM99597.1| hypothetical protein appser12_20650 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 215 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/144 (22%), Positives = 54/144 (37%), Gaps = 5/144 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W++ + +T I +GL + + L Q+ + Sbjct: 70 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 129 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG----ICSTQFLVLQ-PKDVLPELLQGWLLSIDVT 134 I K ILY + PYL+ I + D I ST F+V+ + + L +LLS T Sbjct: 130 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 189 Query: 135 QRIEAICEGATMSHADWKGIGNIP 158 + G + + N+P Sbjct: 190 DFVNQEMVGVAYPAINDDKLYNLP 213 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 51/146 (34%), Gaps = 4/146 (2%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE--SYETYQ 284 +P+ WE++ ++ L +K I N I KL + L+P+ + Sbjct: 70 EIPESWEIEKLGNIIFNLGQKTPNERFFYIDVGLINNKIHKLNSLENILEPDQAPSRARK 129 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLC 343 IV I++ + + I ++A++ + YL + + S Sbjct: 130 IVQKNSILYSTVRPYLQNICILEQDFQYEPIASTAFVVMNVFTNFYHKYLFYYLLSPVFT 189 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVL 368 G+ ++ + + LP+ Sbjct: 190 DFVNQEMVGVAYPAINDDKLYNLPIA 215 >gi|312870942|ref|ZP_07731047.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 3008A-a] gi|311093632|gb|EFQ51971.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 3008A-a] Length = 180 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 49/176 (27%), Gaps = 8/176 (4%) Query: 24 HWKVVPIKRFTKLNTGRTSESGK------DIIYIGLE-DVESGTGKYLPKDGNSRQSDTS 76 W+ V + K+ TG+T ++ +I ++ D+ + K + Sbjct: 3 EWEKVKVGDIGKVITGKTPKTSNSEYYGGNIPFLTPSDDMSVKYVRKTNKYITEIGRLSI 62 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQ 135 + I +G L K +I + + Q ++ D + + Sbjct: 63 KNATLPANAICVSCIGSDLGKVVITTQKTVTNQQINSIVVDTDKFDIDFVYYSMLELGKI 122 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + P L Q I + + +I+ Sbjct: 123 LNFHSKTSTAVPIVNKSSFSQYEIDCPKLNTQKKIGAILSSIDNKIEENNQINKNL 178 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 46/159 (28%), Gaps = 11/159 (6%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFRFIDLQNDKR 303 N++ NI L+ + + R I + I I K Sbjct: 25 NSEYYGGNIPFLTPSDDMSVKYVRKTNKYITEIGRLSIKNATLPANAICVSCIGSDLGKV 84 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 + + + + I S + V D ++ + M F++ S + Sbjct: 85 VITTQKTVTNQQINS--IVVDTDKFDIDFVYYSMLELGKILNFHSKTSTAVPIVNKSSFS 142 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + P + Q I +++ +I+ I Sbjct: 143 QYEIDCPKLNTQKKIGAILSSIDNKIEE-----NNQINK 176 >gi|229120552|ref|ZP_04249797.1| Type I restriction-modification system specificity subunit [Bacillus cereus 95/8201] gi|228662837|gb|EEL18432.1| Type I restriction-modification system specificity subunit [Bacillus cereus 95/8201] Length = 188 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 55/138 (39%), Gaps = 11/138 (7%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAW 335 +++ + G+++F F+ K + S + I + + ++ +DS+YL + Sbjct: 53 SSNHKDGYLSSAGDVIFSFVSS---KSGIVSELNQGKIISQNFAKLIIEHDDLDSSYLCY 109 Query: 336 LMR-SYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN---VINVETARID 390 ++ SY + K +M L +K L + +P I++Q I + Sbjct: 110 ILNESYSMRKQMAISMQGSNVPKLTPAILKELEIELPSIEKQRKIGKAYFFLRKRQTLAK 169 Query: 391 VLVEKIEQSIVLLKERRS 408 +E EQ LK R Sbjct: 170 KQIELEEQL--YLKALRQ 185 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 54/184 (29%), Gaps = 14/184 (7%) Query: 29 PIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++ + GR G + + L + +G+ +S S+ + Sbjct: 2 KLEDIVTVRVGRNLSRGNERNDLTLVAYSFEDLTNDLNGSFLDSQVSLHSGSSNHKDGYL 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRI 137 + G +++ + + I S F ++ L S + +++ Sbjct: 62 SSAGDVIFSFVSSKSGIVSELNQGKIISQNFAKLIIEHDDLDSSYLCYILNESYSMRKQM 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE---KIIAETVRIDTLITERIRFIEL 194 +G+ + + + + +P + +Q I + + I + Sbjct: 122 AISMQGSNVPKLTPAILKELEIELPSIEKQRKIGKAYFFLRKRQTLAKKQIELEEQLYLK 181 Query: 195 LKEK 198 + Sbjct: 182 ALRQ 185 >gi|145634364|ref|ZP_01790074.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae PittAA] gi|145268344|gb|EDK08338.1| putative type I restriction enzyme HindVIIP specificity protein [Haemophilus influenzae PittAA] Length = 430 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 58/185 (31%), Gaps = 9/185 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + F LVT+ + K E + ++ NI+ + I Sbjct: 5 EFIPASEFCDLVTDGTHDSPKKTEFGVKLVTSKNIVGGKLDLTSAYFISESDAQNINKRS 64 Query: 290 EIVFRFIDLQNDKRSLRSA-QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY- 347 ++ + L A E + +K + ++S + Sbjct: 65 QVHINDVLLSMIGTVGEVALIEKEPDFVIKNVGLLKNSDPKKAKWLYYLKSPIAQNLIKD 124 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKE--QFDITNVINVETARIDVLVEKIEQSIVLLKE 405 + +Q + +++ LP+L P +E Q I + + +D ++ Q L++ Sbjct: 125 RLRGTTQQYIPLGELRNLPILKPNSEEHLQNTI-----EQLSSLDKKIQLNTQINQTLEQ 179 Query: 406 RRSSF 410 + Sbjct: 180 IAQAL 184 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 58/438 (13%), Positives = 131/438 (29%), Gaps = 56/438 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQS--DTSTVS 79 + +P F L T T +S K + + +++ G S + + S Sbjct: 5 EFIPASEFCDLVTDGTHDSPKKTEFGVKLVTSKNIVGGKLDLTSAYFISESDAQNINKRS 64 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L +G A+I +L+ D +L S I+ Sbjct: 65 QVHINDVLLSMIGTVGEVALIEKEPDFVIKNVGLLKNSDPKKAKWLYYLKSPIAQNLIKD 124 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G T + + N+P+ P E + I + +D I + + L++ Sbjct: 125 RLRGTTQQYIPLGELRNLPILKPNSEEHLQNT---IEQLSSLDKKIQLNTQINQTLEQIA 181 Query: 200 QALVS-------------YIVTKGLNPDVKMKDSGIEWVGLVPD------HWEVKPFFAL 240 QAL ++ GL+ + + G P+ + + L Sbjct: 182 QALFKSWFVDFDPVRAKVQALSDGLSLEQAELAAIQAISGKTPEELTALSQTQPDRYTEL 241 Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGEIVF------ 293 +++E + ++ G +++++ + + Y + G + Sbjct: 242 AETAKAFPCEMVEVDGGEVTKGWEVKRIDEVIQKIPVGKKYSSKTAFSEGLVPILDQGRS 301 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID------------STYLAWLMRSYD 341 I NDK ++++ + + ++ D + + + Sbjct: 302 GVIGYHNDKPGVKASIEDPIIVFANHTCYMRLISYDFSAIQNVFAFKGTECNLYWLYLAT 361 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L K + G D ++VPP + ++I ++ Sbjct: 362 LGKQEFVEYKGHFP-----DFLIKEIIVPPEELTELFGKYAKENFSKIF----INDRENS 412 Query: 402 LLKERRSSFIAAAVTGQI 419 L + R + + G I Sbjct: 413 SLAKIRDLLLPKLLNGDI 430 >gi|327470623|gb|EGF16079.1| hypothetical protein HMPREF9386_0249 [Streptococcus sanguinis SK330] Length = 191 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 11/93 (11%), Positives = 37/93 (39%), Gaps = 4/93 (4%) Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 D+ Y ++M +++ + + + + ++ ++P I D + + Sbjct: 98 DNVYFWYVMLKKRQQEIYDSQTGSAQPHIYPKHIE----IMPTIDLSEDKVSRFTKQVTP 153 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + + I L+ R + + ++G+I + Sbjct: 154 LFESIGNNIKEIGELQTLRDTLLPKLLSGEISV 186 >gi|309800163|ref|ZP_07694349.1| type I restriction-modification system specificity subunit [Streptococcus infantis SK1302] gi|308116210|gb|EFO53700.1| type I restriction-modification system specificity subunit [Streptococcus infantis SK1302] Length = 136 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 62/143 (43%), Gaps = 10/143 (6%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + K + +I++ G ++ D K S+ + + I A+ + ++ Sbjct: 2 KKLQKKAIECSSAKIIEKGSLLLGMYDTAGLKSSINTKVMSCNQAI--AFAKLDDKITNT 59 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 Y+ +++++ L + G+ +++ +K + + +PP+ Q + + + A++ Sbjct: 60 IYVYYVIQN--LRSMLLNQQRGVRQKNFNLSMIKNIAIPLPPLSLQNEFADFV----AQV 113 Query: 390 DVLVEKIEQSIVLLKE-RRSSFI 411 D + +I L + +SS I Sbjct: 114 DKSQFACQMAIKLWRNSLKSSII 136 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 50/138 (36%), Gaps = 4/138 (2%) Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 K + + S+ I KG +L G K+ I C+ + D + + Sbjct: 2 KKLQKKAIECSSAKIIEKGSLLLGMYDTAGLKSSINTKVMSCNQAIAFAKLDDKITNTIY 61 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 + + ++ + G + + I NI +P+PPL+ Q + ++D Sbjct: 62 VYYVIQNLRSMLLNQQRGVRQKNFNLSMIKNIAIPLPPLSLQNEFADF----VAQVDKSQ 117 Query: 186 TERIRFIELLKEKKQALV 203 I+L + ++ + Sbjct: 118 FACQMAIKLWRNSLKSSI 135 >gi|296126598|ref|YP_003633850.1| restriction modification system DNA specificity domain protein [Brachyspira murdochii DSM 12563] gi|296018414|gb|ADG71651.1| restriction modification system DNA specificity domain protein [Brachyspira murdochii DSM 12563] Length = 1134 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/370 (11%), Positives = 106/370 (28%), Gaps = 50/370 (13%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + + G++ K V++G + G + S + + I Sbjct: 800 LGAISSIVKGKSITKNK---------VKNGNIPVI-AGGKTSPYSHSEYNQ-NENCITVS 848 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 G ++ S ++ + + + I + G+ H Sbjct: 849 ASGS-AGYVWYHNYKIWASDCNVIRSLDEEKYITKYIYYSLKKLQDLIYDLKTGSNQPHV 907 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 K + I +P + +Q I + + I Sbjct: 908 YEKDLSKIKIPNLNIEKQKEIVSLMDEQENIILEQEKI---------------------- 945 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 I+ + + + + K I +L + + Sbjct: 946 ------------IKELNDKINSLDFVNYDKCKLSDKTKFQITIGKRVLQKNIKENGKYPI 993 Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 KP Y + D + F + D + + + + + V + Sbjct: 994 YSANVYKPFGYIDELLFDNFDFTFVLWGIDGD--WMTNYILPNNPFYPTDHCGVIKCIDN 1051 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 S + + ++++ Y LR S+ + +++L + +P + Q +I+N I +I Sbjct: 1052 SVNMIYFNYAFNIVGKEYGFNRNLRASI--DRIEKLQIPIPDLNIQNEISNTILDCKKQI 1109 Query: 390 DVLVEKIEQS 399 D KI+ + Sbjct: 1110 DQAQLKIDNA 1119 Score = 43.2 bits (100), Expect = 0.076, Method: Composition-based stats. Identities = 13/143 (9%), Positives = 47/143 (32%), Gaps = 4/143 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 K++ N+ + + I + + + + + Sbjct: 814 KNKVKNGNIPVIAGGKTSPYSHSEYNQNENCITVSASGSAGYVWYHNYKIWASDCNVIRS 873 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 L + Y + +G + + +D+ ++ + I++Q +I ++++ Sbjct: 874 LDEEKYITKYIYYSLKKLQDLIYDLKTGSNQPHVYEKDLSKIKIPNLNIEKQKEIVSLMD 933 Query: 384 VETARI---DVLVEKIEQSIVLL 403 + I + +++++ I L Sbjct: 934 EQENIILEQEKIIKELNDKINSL 956 >gi|119715344|ref|YP_922309.1| restriction modification system DNA specificity subunit [Nocardioides sp. JS614] gi|119536005|gb|ABL80622.1| restriction modification system DNA specificity domain [Nocardioides sp. JS614] Length = 161 Score = 55.6 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 15/141 (10%), Positives = 45/141 (31%), Gaps = 15/141 (10%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + V + + S + + + + ++ L+R+ Sbjct: 23 FHNVSNRDGETVVVARSGAYAGFVSYWRGPIFLTDAFSVHPHDGVLMPRFVFHLLRARQA 82 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEK 395 + G+G+ ++ +DV+ V VPP+ Q + +++ A ++ + + Sbjct: 83 QLHAFKAGAGV-PHVRVKDVESYEVPVPPLDVQARVVEILDKFDALVNDVSVGLPAEIAA 141 Query: 396 IEQSIVLLKERRSSFIAAAVT 416 + + +T Sbjct: 142 RRKQYEYYR-------HKLLT 155 >gi|305431923|ref|ZP_07401090.1| type II restriction-modification enzyme [Campylobacter coli JV20] gi|304445007|gb|EFM37653.1| type II restriction-modification enzyme [Campylobacter coli JV20] Length = 737 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 17/164 (10%), Positives = 52/164 (31%), Gaps = 7/164 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E Y + + + + IV +I+ K ++ + + Sbjct: 567 EHIDNKSGYIKLDNPKYVPIEFYESFALQDKGIVKQFDILICKDGALTGKIAMVRNEFIR 626 Query: 313 R--GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + I ++ + YL +++ SY + + +G + + +++ + + Sbjct: 627 KSAMINEHIFLLRCDNIAKQKYLFYILHSYSGQQALKSKITGSAQGGINKTNLESILIPN 686 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + Q I E +++ I S+ + + + Sbjct: 687 ADFEIQKQIV----AECEKVEEQYNTIRMSVEEYQNLIKTILQK 726 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 61/193 (31%), Gaps = 18/193 (9%) Query: 26 KVVPIKRF------TKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---- 75 ++V +K F K +G + + +G E +++ +G + + Sbjct: 533 ELVRLKDFVLDIQTAKRPSGGVGKYENGALSLGGEHIDNKSGYIKLDNPKYVPIEFYESF 592 Query: 76 --STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP------KDVLPELLQGW 127 I + IL K G K + + I + + + L Sbjct: 593 ALQDKGIVKQFDILICKDGALTGKIAMVRNEFIRKSAMINEHIFLLRCDNIAKQKYLFYI 652 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S Q +++ G+ + + +I +P Q I + + +T+ Sbjct: 653 LHSYSGQQALKSKITGSAQGGINKTNLESILIPNADFEIQKQIVAECEKVEEQYNTIRMS 712 Query: 188 RIRFIELLKEKKQ 200 + L+K Q Sbjct: 713 VEEYQNLIKTILQ 725 >gi|295135947|ref|YP_003586623.1| hypothetical protein ZPR_4123 [Zunongwangia profunda SM-A87] gi|294983962|gb|ADF54427.1| hypothetical protein ZPR_4123 [Zunongwangia profunda SM-A87] Length = 46 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 17/33 (51%), Positives = 21/33 (63%) Query: 5 KAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLN 37 K YP YKDSGV W+G IPKHW++ + K Sbjct: 2 KTYPAYKDSGVDWLGKIPKHWEIRRLGSRFKER 34 >gi|312874506|ref|ZP_07734532.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2053A-b] gi|311089968|gb|EFQ48386.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2053A-b] Length = 190 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 12/129 (9%), Positives = 39/129 (30%), Gaps = 13/129 (10%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + + I ++ + + + + L + Sbjct: 67 VSCRGAASGNIIETYPNSFITNNSLVLEWNDYRYYEFYKQFLFANPLHTY---ATGSAQP 123 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI---VLLKERRSSFIA 412 + +++K +P P E I +++ + ++I L R + + Sbjct: 124 QITIDNIKNVPFPCPKYDE-------IRELCSQLKSISALHFENIVESNKLSMLRDTLLP 176 Query: 413 AAVTGQIDL 421 ++G++D+ Sbjct: 177 KLISGELDV 185 >gi|219855948|ref|YP_002473070.1| hypothetical protein CKR_2605 [Clostridium kluyveri NBRC 12016] gi|219569672|dbj|BAH07656.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 478 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 52/401 (12%), Positives = 116/401 (28%), Gaps = 43/401 (10%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD- 104 K+ I L ++E G G D N S I I++ +L ++ + + Sbjct: 64 KEYQLIDLANIEPGIGFLNDLDKNIVSEIGSDKIILDGADIVFSRLNSHIGYVFLMEDIP 123 Query: 105 -----GICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEGATMSH--ADWKGIG 155 I ST+F L+ + +LL+ +LL + ++ I G + SH + Sbjct: 124 NSKISVIGSTEFFPLKVDNTTIPSKLLKYYLLHREFRKKAIFIRTGKSQSHPRIQVEDFM 183 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL------VSYIVTK 209 PI P + + KI I E +++ + Sbjct: 184 RFKFPILPQKVSIELIRKINIFEDEIKKKKLEYESLQNIIESVFLKYDIKKPSLDENFHI 243 Query: 210 GLNPD------VKMKDSGIEWV----------------GLVPDHWEVKPFFALVTELNRK 247 + P + G E++ P + + +K Sbjct: 244 KIKPMLSNIANQRYMRIGAEYMSFWMLRKGCLFQSEDKNKYPIIPMKRLIRKYNATVIKK 303 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 ++ + + + E + + + + Sbjct: 304 GLMTDTRILVEFEHIQSLNGKIENLSNVVTEVGSDKIEFGNADFLTNKLRPYLGYTIINP 363 Query: 308 AQVMERGI--ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + G + K + + + S L + M + D+ + Sbjct: 364 KHLNIIGTTEFIPFSIINKLNTSVNYIRYVFLSSEYLKQSKLLMSGKEHPRINISDILNI 423 Query: 366 PVLVPPIKEQFDITNVI---NVETARIDVLVEKIEQSIVLL 403 + +P + Q +I I +++A+I ++ I + I + Sbjct: 424 RIPLPKLTIQHNIVKEILQRELKSAKILKEIKVIREKIDNI 464 >gi|253569683|ref|ZP_04847092.1| restriction modification system DNA specificity subunit [Bacteroides sp. 1_1_6] gi|251840064|gb|EES68146.1| restriction modification system DNA specificity subunit [Bacteroides sp. 1_1_6] Length = 194 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 33/146 (22%), Positives = 59/146 (40%), Gaps = 8/146 (5%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K S G I + ++ + ES Y+IV G+ V Q Sbjct: 31 KKLAYKNVLSASQELGMIERSNINIDIKFEQESISGYKIVRKGDYVVHLRSFQG-----G 85 Query: 307 SAQVMERGIITSAYMAVKPHGIDST-YLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVK 363 A GI + AY ++P+ + YL+ S K + G+R +S+ ++ Sbjct: 86 FAFSDTTGICSPAYTILRPNDLVVYGYLSHFFTSKPFIKSLKLVTYGIRDGRSINVDEWL 145 Query: 364 RLPVLVPPIKEQFDITNVINVETARI 389 +P+L+P +EQ I ++N A++ Sbjct: 146 DMPILLPSAQEQMRILTIVNAIDAKL 171 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 51/161 (31%), Gaps = 3/161 (1%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLG 92 ++ + + + +++ + D Q S I KG + L Sbjct: 22 LFEVVNEKNKKLAYKNVLSASQELGMIERSNINIDIKFEQESISGYKIVRKGDYVV-HLR 80 Query: 93 PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM--SHAD 150 + +D GICS + +L+P D++ + + + + Sbjct: 81 SFQGGFAFSDTTGICSPAYTILRPNDLVVYGYLSHFFTSKPFIKSLKLVTYGIRDGRSIN 140 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 ++P+ +P EQ+ I + A ++ + Sbjct: 141 VDEWLDMPILLPSAQEQMRILTIVNAIDAKLHNEAKVQFCL 181 >gi|121608003|ref|YP_995810.1| restriction modification system DNA specificity subunit [Verminephrobacter eiseniae EF01-2] gi|121552643|gb|ABM56792.1| restriction modification system DNA specificity domain [Verminephrobacter eiseniae EF01-2] Length = 575 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 43/138 (31%), Gaps = 4/138 (2%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + + + E +QIV ++ + R+ + + + Sbjct: 408 WHLNLSSVKQVVIDQSELERFQIVRGDLLITEGGNRDKVGRTAIWRDELPVCLHQNHVFR 467 Query: 323 VK--PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDI 378 V+ + + + S F A S+ ++ VPP+ EQ I Sbjct: 468 VRGTSPDWNPVWAELYLNSVTARAYFAAASKQTTNLASINMTQLRLCAFPVPPLVEQARI 527 Query: 379 TNVINVETARIDVLVEKI 396 + + + L +++ Sbjct: 528 VSRVEALRSLCADLRQRL 545 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 59/188 (31%), Gaps = 16/188 (8%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K + E +P WE AL+ K + + + ++G Sbjct: 70 KIAQHEKPFALPPGWEWVRLGALLPFRIGKTPASEDPQYWDQEGYAWVSISDMAHLGEVF 129 Query: 278 ESYET----------YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 ++ Y+ + G ++ F LR I++ + G Sbjct: 130 DTQRKLTARGAQVFGYEPLPVGTLIMSFKLTIGKISVLRVPAYHNEAIVS----LMPLCG 185 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + +L +++ + V G +L + + L + +PP EQ I + Sbjct: 186 LVTDFLKYMLPTVSKTGVSKEALMGT--TLNTQSLSNLLIALPPAVEQSRIVARVEELMR 243 Query: 388 RIDVLVEK 395 D L + Sbjct: 244 LCDTLEAR 251 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 57/197 (28%), Gaps = 14/197 (7%) Query: 22 PKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 P W+ + +G + + Y+ + +V+ Sbjct: 365 PPGWEWARFGDVAAITSGVILGRKAAISAPVLLPYLRVANVQRWHLNLSSVKQVVIDQSE 424 Query: 76 STVSIFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 +G +L + G R AI D +C Q V + + P+ W Sbjct: 425 LERFQIVRGDLLITEGGNRDKVGRTAIWRDELPVCLHQNHVFRVRGTSPDWNPVWAELYL 484 Query: 133 VTQRIEAIC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + A + ++ + + P+PPL EQ I ++ A L Sbjct: 485 NSVTARAYFAAASKQTTNLASINMTQLRLCAFPVPPLVEQARIVSRVEALRSLCADLRQR 544 Query: 188 RIRFIELLKEKKQALVS 204 + +AL+ Sbjct: 545 LSASQTVQTHLAEALLE 561 Score = 43.6 bits (101), Expect = 0.061, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 66/222 (29%), Gaps = 15/222 (6%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESG-------KDIIYIGLEDV-ESGTGKYLPKDGNSRQ 72 +P W+ V + G+T S + ++ + D+ G + +R Sbjct: 80 LPPGWEWVRLGALLPFRIGKTPASEDPQYWDQEGYAWVSISDMAHLGEVFDTQRKLTARG 139 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G ++ + K + + + L P L ++L Sbjct: 140 AQVFGYEPLPVGTLIMS-FKLTIGKISVLRVPAYHNEAIVSLMPLCGLVTDFLKYMLPTV 198 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + T + + + N+ + +PP EQ I ++ DTL Sbjct: 199 SKTGVSKEALMGT--TLNTQSLSNLLIALPPAVEQSRIVARVEELMRLCDTLEARGPLEA 256 Query: 193 ELLKEKKQALVSYI----VTKGLNPDVKMKDSGIEWVGLVPD 230 L+ + + L+ + + + + P+ Sbjct: 257 AQHARLVDTLLGTLTGSNTPQELSAHWQRVRTHFDLLFDRPE 298 >gi|323699620|ref|ZP_08111532.1| restriction modification system DNA specificity domain [Desulfovibrio sp. ND132] gi|323459552|gb|EGB15417.1| restriction modification system DNA specificity domain [Desulfovibrio desulfuricans ND132] Length = 257 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 45/121 (37%), Gaps = 4/121 (3%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 ++ G+I+F ++ + + P + YLAW + + Sbjct: 126 FLEEGDILFVNRGMRFFGALVDKPLEKAVAAPHFFIIKANPALVRPDYLAWFLNGKQAQR 185 Query: 345 VFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV-ETARIDVLVEKIEQSIVL 402 + +G + + ++ LPV VP ++ Q I V +I L E+I + L Sbjct: 186 YYGQCAAGTALPHITRKTLEALPVPVPSLERQALIAKVYQCGLQEKI--LTERIVEQREL 243 Query: 403 L 403 L Sbjct: 244 L 244 >gi|321310231|ref|YP_004192560.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802075|emb|CBY92721.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 202 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 19/172 (11%), Positives = 56/172 (32%), Gaps = 11/172 (6%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM--GLKPESYETYQIVDPGEIVFRF 295 + T K + ++ + NI T + P ++ ++ G+IV Sbjct: 20 CEIQTGFGVKTSFYRDNGFPIIKGENIHGGQITTDNLSYCNPNNHPNAPVIKYGDIVIV- 78 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-R 354 + ++ + P + + + G + Sbjct: 79 --SHGCPGKVGINLTDREFFFSNNVHKLIPDETVLIKKYLYHCLLNKQEEIKGLAKGSSQ 136 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + +++L + + ++ Q I ++ + L ++++Q + LL++R Sbjct: 137 PFVGKSVMRKLKIPIYCLETQTKIVETLD----KFQELKQELKQEL-LLRKR 183 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 48/159 (30%), Gaps = 6/159 (3%) Query: 30 IKRFTKLNTG----RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + ++ TG + I E++ G ++ + G Sbjct: 16 LGDVCEIQTGFGVKTSFYRDNGFPIIKGENIHGGQIT-TDNLSYCNPNNHPNAPVIKYGD 74 Query: 86 ILYGKLGPYLRK-AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I+ G + + D + S L P + + + ++ + I+ + +G+ Sbjct: 75 IVIVSHGCPGKVGINLTDREFFFSNNVHKLIPDETVLIKKYLYHCLLNKQEEIKGLAKGS 134 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + +PI L Q I E + Sbjct: 135 SQPFVGKSVMRKLKIPIYCLETQTKIVETLDKFQELKQE 173 >gi|298375963|ref|ZP_06985919.1| N-6 DNA methylase [Bacteroides sp. 3_1_19] gi|298267000|gb|EFI08657.1| N-6 DNA methylase [Bacteroides sp. 3_1_19] Length = 837 Score = 55.6 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 50/125 (40%), Gaps = 5/125 (4%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMR 338 + V G+ + ID ++ + + + E I+T ++ I YL ++ Sbjct: 706 KRQTRVKGGQFIISKIDGKSAAFGIVDSSL-EGAIVTPDFLVYDIDTTQILPEYLELVLT 764 Query: 339 SYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + F SG R+ L + + + +P I EQ ++ I L E++ Sbjct: 765 NDAILNQFSISSSGTTGRRRLSQKVFENTLIALPSIDEQRNLLAKILEIRETQKSLEEQM 824 Query: 397 EQSIV 401 ++SI Sbjct: 825 QKSIE 829 >gi|153955554|ref|YP_001396319.1| Type I specificity subunit-related protein [Clostridium kluyveri DSM 555] gi|146348412|gb|EDK34948.1| Type I specificity subunit-related protein [Clostridium kluyveri DSM 555] Length = 476 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 52/401 (12%), Positives = 116/401 (28%), Gaps = 43/401 (10%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD- 104 K+ I L ++E G G D N S I I++ +L ++ + + Sbjct: 62 KEYQLIDLANIEPGIGFLNDLDKNIVSEIGSDKIILDGADIVFSRLNSHIGYVFLMEDIP 121 Query: 105 -----GICSTQFLVLQPKDVL--PELLQGWLLSIDVTQRIEAICEGATMSH--ADWKGIG 155 I ST+F L+ + +LL+ +LL + ++ I G + SH + Sbjct: 122 NSKISVIGSTEFFPLKVDNTTIPSKLLKYYLLHREFRKKAIFIRTGKSQSHPRIQVEDFM 181 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL------VSYIVTK 209 PI P + + KI I E +++ + Sbjct: 182 RFKFPILPQKVSIELIRKINIFEDEIKKKKLEYESLQNIIESVFLKYDIKKPSLDENFHI 241 Query: 210 GLNPD------VKMKDSGIEWV----------------GLVPDHWEVKPFFALVTELNRK 247 + P + G E++ P + + +K Sbjct: 242 KIKPMLSNIANQRYMRIGAEYMSFWMLRKGCLFQSEDKNKYPIIPMKRLIRKYNATVIKK 301 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 ++ + + + E + + + + Sbjct: 302 GLMTDTRILVEFEHIQSLNGKIENLSNVVTEVGSDKIEFGNADFLTNKLRPYLGYTIINP 361 Query: 308 AQVMERGI--ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + G + K + + + S L + M + D+ + Sbjct: 362 KHLNIIGTTEFIPFSIINKLNTSVNYIRYVFLSSEYLKQSKLLMSGKEHPRINISDILNI 421 Query: 366 PVLVPPIKEQFDITNVI---NVETARIDVLVEKIEQSIVLL 403 + +P + Q +I I +++A+I ++ I + I + Sbjct: 422 RIPLPKLTIQHNIVKEILQRELKSAKILKEIKVIREKIDNI 462 >gi|241895462|ref|ZP_04782758.1| conserved hypothetical protein [Weissella paramesenteroides ATCC 33313] gi|241871436|gb|EER75187.1| conserved hypothetical protein [Weissella paramesenteroides ATCC 33313] Length = 145 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 49/151 (32%), Gaps = 7/151 (4%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + I + N + + +SY + +I+F + S+ Sbjct: 1 MGNVNFIKVENLSNNQIYPVQKISQEEHDSYLKRSRLQANDILFSIAGTLGRIAIVGSSL 60 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQSLKFEDVKRLPVL 368 + A ++ + DS +L + + + + G + +L E V L + Sbjct: 61 LPAN--TNQALSIIRGYDFDSDFLITSLSGHVVAEYIRKNPTVGAQPNLSLEQVGNLIIS 118 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQS 399 P +EQ I + ++ L+ + Sbjct: 119 SPIEEEQEKIGSF----FKLLNHLITVNQDK 145 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 43/137 (31%), Gaps = 2/137 (1%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF--D 104 ++ +I +E++ + + K S IL+ G R AI+ Sbjct: 3 NVNFIKVENLSNNQIYPVQKISQEEHDSYLKRSRLQANDILFSIAGTLGRIAIVGSSLLP 62 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPL 164 + +++ D + L L V + I + + +GN+ + P Sbjct: 63 ANTNQALSIIRGYDFDSDFLITSLSGHVVAEYIRKNPTVGAQPNLSLEQVGNLIISSPIE 122 Query: 165 AEQVLIREKIIAETVRI 181 EQ I I Sbjct: 123 EEQEKIGSFFKLLNHLI 139 >gi|240047296|ref|YP_002960684.1| hypothetical protein MCJ_001680 [Mycoplasma conjunctivae HRC/581] gi|239984868|emb|CAT04861.1| PUTATIVE Uncharacterized protein MJ1218 [Mycoplasma conjunctivae] Length = 136 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 37/106 (34%), Gaps = 6/106 (5%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 ++ + D ++ L+ SY + + + + F++ + Sbjct: 34 FLPTNTAFCSTMSALTSKNNFDIYFIYSLLSSYFPIESI--ISGTTIKHIYFKNYGQFEY 91 Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 VP IKEQ I ID L+ E + ++ +++ + Sbjct: 92 FVPSIKEQEKIA----KVFKNIDNLLNLYELKLQKIEMIKTTLLNK 133 >gi|256826768|ref|YP_003150727.1| hypothetical protein Ccur_03180 [Cryptobacterium curtum DSM 15641] gi|256582911|gb|ACU94045.1| hypothetical protein Ccur_03180 [Cryptobacterium curtum DSM 15641] Length = 159 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 13/143 (9%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I E+ S Y++V G+IV+ + + GI++ AY+ Sbjct: 24 NGIYPASESDRDTNPGASINNYKVVRIGDIVYNSMRMWQGAVG----SSRYNGIVSPAYV 79 Query: 322 AVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPP-IKEQF 376 V+P DST +L++ + + G Q+LK+E + +P I+EQ Sbjct: 80 VVRPRMKLDSTCFGYLLKRPGMLYKYLCDSQGNSKDTQTLKYERFAEIDADIPSTIEEQR 139 Query: 377 DITNVINVETARIDVLVEKIEQS 399 I+N R+D L+ ++ Sbjct: 140 SISNY----FMRLDDLITLHQRK 158 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 53/155 (34%), Gaps = 8/155 (5%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 + + R++ ++I+ + + + + + + + + G I+Y Sbjct: 2 GELFEESDLRSAT--EEILSVSVANGIYPASE--SDRDTNPGASINNYKVVRIGDIVYNS 57 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW---LLSIDVTQRIEAICEGATMS 147 + + + ++GI S ++V++P+ L G+ + ++ Sbjct: 58 MRMWQGAVGSSRYNGIVSPAYVVVRPRMKLDSTCFGYLLKRPGMLYKYLCDSQGNSKDTQ 117 Query: 148 HADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRI 181 ++ I IP + EQ I + I Sbjct: 118 TLKYERFAEIDADIPSTIEEQRSISNYFMRLDDLI 152 >gi|291514834|emb|CBK64044.1| Restriction endonuclease S subunits [Alistipes shahii WAL 8301] Length = 188 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 49/144 (34%), Gaps = 11/144 (7%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK---PHGIDS 330 S ++ +++ +N + + + + +++ ++ P I Sbjct: 45 TTTVSSKAARHLLTESDLLLAAKGGKNF--CAIAPTQLGPCVASPSFLIIRIDDPTRILP 102 Query: 331 TYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFD-IT-NVINVETA 387 YL + ++ A G SL D++ + +PP++ Q I ++ Sbjct: 103 EYLCGFLNLPSTRQLLTAQAQGSAIASLSKADLEEFEIPLPPLERQRACIALTRLHRREQ 162 Query: 388 RIDVLVEKIEQSI---VLLKERRS 408 + + + + I L K + Sbjct: 163 ALYKAIAERRRQITDYKLTKIYKD 186 Score = 39.8 bits (91), Expect = 0.87, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 56/152 (36%), Gaps = 7/152 (4%) Query: 28 VPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 V +K + TG S D Y+ + D + + S + + + Sbjct: 2 VKLKDIATIQTGVYLKSTPSPDTCYLQVNDFDEEGNIRPTVRPTTTVSSKAARHLLTESD 61 Query: 86 ILYGKLGPYLRKAIIADFDGIC--STQFLVLQ---PKDVLPELLQGWLLSIDVTQRIEAI 140 +L G AI G C S FL+++ P +LPE L G+L Q + A Sbjct: 62 LLLAAKGGKNFCAIAPTQLGPCVASPSFLIIRIDDPTRILPEYLCGFLNLPSTRQLLTAQ 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +G+ ++ + +P+PPL Q Sbjct: 122 AQGSAIASLSKADLEEFEIPLPPLERQRACIA 153 >gi|323143704|ref|ZP_08078375.1| hypothetical protein HMPREF9444_01006 [Succinatimonas hippei YIT 12066] gi|322416537|gb|EFY07200.1| hypothetical protein HMPREF9444_01006 [Succinatimonas hippei YIT 12066] Length = 132 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 34/103 (33%), Gaps = 1/103 (0%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I + + + + I + S L K G+ Sbjct: 31 ITCKGTVGKIAINSIGKVHIARQLMAIKVNDNLISNQFMELFLQ-RQIKTIEQKARGIIA 89 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 +K +D+ + +PPI+EQ I IN + D +E + + Sbjct: 90 GIKRQDILNIKTPLPPIEEQHRIVAKINEIFSFCDKAMELLHK 132 >gi|295101279|emb|CBK98824.1| Restriction endonuclease S subunits [Faecalibacterium prausnitzii L2-6] Length = 187 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 52/156 (33%), Gaps = 8/156 (5%) Query: 242 TELNRKNTKLIESNILSLSYGN----IIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 N K + ++S + ++ N + + R + K G+IV Sbjct: 20 FGSNIKVSCFVDSGVPVINGSNLEGFSLSEKTFRYVTRKKADSLNKANAHRGDIVITHRG 79 Query: 298 LQNDKRSLRSAQVMERGIITSA-YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQ 355 + +R +I+ + + + YL + + + S + Sbjct: 80 TLGQIVFIPQDSKYDRYVISQSQFRVRCNDKVLPEYLVYYFHTPIGQHKLLSNASQVGVP 139 Query: 356 SL--KFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +L +++ +++P + Q + +I+ +I Sbjct: 140 ALARPSSTFQQIEIVLPELSIQKCVVEIISTIQKKI 175 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 56/182 (30%), Gaps = 16/182 (8%) Query: 26 KVVPIKRFT-KLNTGRTSES-------GKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTS 76 + I ++ G + + I ++E + + + +++D+ Sbjct: 4 ETYRIADLIDEIAMGPFGSNIKVSCFVDSGVPVINGSNLEGFSLSEKTFRYVTRKKADSL 63 Query: 77 TVSIFAKGQILYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + +G I+ G + I D I +QF V VLPE L + + Sbjct: 64 NKANAHRGDIVITHRGTLGQIVFIPQDSKYDRYVISQSQFRVRCNDKVLPEYLVYYFHTP 123 Query: 132 DVTQRIEAICEGATMSHA--DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 ++ + + I + +P L+ Q + E I +I Sbjct: 124 IGQHKLLSNASQVGVPALARPSSTFQQIEIVLPELSIQKCVVEIISTIQKKIVNNQELND 183 Query: 190 RF 191 Sbjct: 184 NL 185 >gi|317490793|ref|ZP_07949234.1| type I site-specific deoxyribonuclease chain S [Eggerthella sp. 1_3_56FAA] gi|316910105|gb|EFV31773.1| type I site-specific deoxyribonuclease chain S [Eggerthella sp. 1_3_56FAA] Length = 186 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 49/177 (27%), Gaps = 14/177 (7%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL---SYGNIIQKLETRNMGL 275 E +P+ WE + T + R + N Sbjct: 10 CIDDEIPFDIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKCNQWSGFSLERAKF 69 Query: 276 KP----ESYETYQIVDPGEIVFRFIDLQN---DKRSLRSAQVMERGIITSAY--MAVKPH 326 SY +++ G++++ L + + S + P Sbjct: 70 VDPNSVASYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPD 129 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + Y + V SG ++ L E VKR + VPP+ EQ I Sbjct: 130 WLRYEYAFLYFAGPSVQSVIEDQASGSTKQKELAQETVKRYLIPVPPLAEQRRIAER 186 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 40/124 (32%), Gaps = 14/124 (11%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ ++ T + G++ + K + + +G L + + + Sbjct: 18 DIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKC-NQWSGFSLERAKFVDPNSVA 76 Query: 77 TV---SIFAKGQILYGKLG-PYLRKAIIAD------FDGICSTQFLVLQPKDVLPELLQG 126 + + G +L+ G L + + D + + V++ Sbjct: 77 SYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPDWLRYEYA 136 Query: 127 WLLS 130 +L Sbjct: 137 FLYF 140 >gi|317483931|ref|ZP_07942868.1| hypothetical protein HMPREF0179_00217 [Bilophila wadsworthia 3_1_6] gi|316924805|gb|EFV45954.1| hypothetical protein HMPREF0179_00217 [Bilophila wadsworthia 3_1_6] Length = 524 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 49/121 (40%), Gaps = 3/121 (2%) Query: 285 IVDPGEIVF-RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + G+++ LQ+ + Q + + + + ++ ID +L +RS Sbjct: 389 RLREGDLLLTCKGSLQSLGKVGIVTQCGDNWLPSQTFYLIRTECIDPIWLFHYLRSPRAL 448 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + SG ++ D+ LP+ +P +E + ++ + ++ ++K+ + Sbjct: 449 NYLRSNISGTSIPQIRVADIAALPIPIPN-EEMLASVHAVHRQALKLLQKIDKLRDELDG 507 Query: 403 L 403 L Sbjct: 508 L 508 >gi|303260418|ref|ZP_07346387.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP-BS293] gi|303265064|ref|ZP_07350978.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS397] gi|302638453|gb|EFL68919.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP-BS293] gi|302645424|gb|EFL75657.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS397] Length = 193 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 14/133 (10%), Positives = 44/133 (33%), Gaps = 7/133 (5%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITS 318 + E +N+ + + V+ G+++ ++ A + + Sbjct: 41 SYDYFNSSEVKNLPIDYIPLDE-HKVEIGDVIISRMNTSELVGAAGYVWAINSDNIYLPD 99 Query: 319 AYMAVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 V + + W + ++ K + SG +++ + ++ V PP+ Sbjct: 100 RLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPLAL 159 Query: 375 QFDITNVINVETA 387 Q + + + + Sbjct: 160 QNEFADFVALVDK 172 >gi|299144871|ref|ZP_07037939.1| type I restriction-modification system specificity subunit [Bacteroides sp. 3_1_23] gi|298515362|gb|EFI39243.1| type I restriction-modification system specificity subunit [Bacteroides sp. 3_1_23] Length = 240 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/208 (14%), Positives = 64/208 (30%), Gaps = 20/208 (9%) Query: 10 YKDSGVQ--WIG----AIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVE 57 YK SG + W IP W+V+P+ + G + +E ++ + + D+ Sbjct: 34 YKSSGGEMVWNEKLKREIPIDWEVLPLFDAVSVQYGFPFATEQFTEEETNVPVVRIRDIL 93 Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 GT + +L G G D + + + L+ Sbjct: 94 EGT------TSAYSLEKADEKYHLNENDVLVGMDG-NFHMNFWHDNIAYLNQRCVRLRAH 146 Query: 118 DVLP-ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 +Q + E +G+T+ H K + + + P R+ + Sbjct: 147 SDSTISSIQILHSIKPYIKAKEQNAKGSTVGHLSDKDLKGLYLIKPLKTRAFNPRKTLDG 206 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVS 204 + + + + E L++ Sbjct: 207 LLALVIENKKQILSLTKQRDELLPLLIN 234 >gi|46143839|ref|ZP_00204580.1| COG0732: Restriction endonuclease S subunits [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126207776|ref|YP_001053001.1| putative Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae L20] gi|126096568|gb|ABN73396.1| putative Type I restriction enzyme EcoR124II specificity protein [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 364 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/411 (10%), Positives = 119/411 (28%), Gaps = 68/411 (16%) Query: 11 KDSGVQWIGAIPKHWKVVPIKRFTKLNT-----GRTSESGKDIIYIGLEDVESGTGKYLP 65 KD V+W + K +++ D L ++ Y Sbjct: 8 KDCEVEW----------KSLGEVAKYEQPTKYLVKSTNYNDDFNTPVLTAGKTFILGYTD 57 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + ++ + IF + DFD + + + + Sbjct: 58 EIDGIYPAKSNPIIIF----------DDFTTANKWVDFDFKVKSSAMKMITSSDENKFSL 107 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 ++ T +E + +++ +PIPPL Q I + + T Sbjct: 108 KYIYYWLNTLPMEDNTDHKRQWISNFAN---KKIPIPPLEIQEKIVKTLDIFTKL----- 159 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 + L ++ + ++T + + D E + Sbjct: 160 ---EAELSLRVKQYDYYRNELLTFDDDVEFITLDKISENLN--------------SMRKP 202 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K+ + I I+ +E E I + G + + Sbjct: 203 IKSGLREKGRIPYYGASGIVDYVEDYIF-----DDEILLISEDGANLIARNTP------I 251 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + + + A++ ++ ++ + + + DL + L +++ ++ Sbjct: 252 AFSVLGKCWVNNHAHVLKFKTDVERKFVEFYLNNLDLSPFI---SGAAQPKLNKQNLNKI 308 Query: 366 PVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 P+ Q I ++++ + + + + + I L ++ R + Sbjct: 309 PIPNITFATQQKIVDILDKFDRLPNSISDGLPKEIELRRKQYEYYRERLLN 359 >gi|229548134|ref|ZP_04436859.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] gi|229306735|gb|EEN72731.1| conserved hypothetical protein [Enterococcus faecalis ATCC 29200] Length = 202 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/216 (10%), Positives = 61/216 (28%), Gaps = 20/216 (9%) Query: 197 EKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNI 256 E K+A + + K++ + E + + + + + + Sbjct: 3 ELKKAYLQLMFPTKEERVPKLRFADFEGEWELCKLIGILDIIKGTQKSKSELSTNQNNCT 62 Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 Y I N+ + + + V E+ Sbjct: 63 PYPVYNGGINPSGYTNIYNREN---------------AITISEGGNSAGFVNFVQEKFFS 107 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + D+ +L + + S ++ +++ + L + EQ Sbjct: 108 GGHNYTIVNNVTDTLFLFFYLCSIQ-EEIMRLRVGTGLPNIQKPTLMNLEIQKTTDNEQK 166 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I + ID+L+ + + LK + S++ Sbjct: 167 FIGLFL----KNIDILITLTQNKLNQLKSLKKSYLQ 198 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 47/180 (26%), Gaps = 9/180 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W++ + + G + L ++ Y +G S + + + Sbjct: 31 EWELCKLIGILDIIKGTQKSKSE------LSTNQNNCTPYPVYNGGINPSGYTNIY-NRE 83 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I + G + L +L + + I + G Sbjct: 84 NAITISEGGNSAGFVNFVQEKFFSGGHNYTIVNNVTDTLFLFFYLC--SIQEEIMRLRVG 141 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + N+ + EQ I + + I + + L K Q + Sbjct: 142 TGLPNIQKPTLMNLEIQKTTDNEQKFIGLFLKNIDILITLTQNKLNQLKSLKKSYLQNMF 201 >gi|253569701|ref|ZP_04847110.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840082|gb|EES68164.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 156 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 12/64 (18%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 335 WLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++++ +L + + L + K + + +PP KEQ I I +D++ Sbjct: 93 YVLQAINLHRKVLRESKVGSAIPHLNKKLFKAIEIPIPPYKEQQRIIKAITKAFMSLDLI 152 Query: 393 VEKI 396 +E + Sbjct: 153 MESL 156 >gi|227546690|ref|ZP_03976739.1| possible type I restriction enzyme, S subunit [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227213007|gb|EEI80886.1| possible type I restriction enzyme, S subunit [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 159 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 39/118 (33%), Gaps = 7/118 (5%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 Y + + I Q + V + A D+ +LA L+ D Sbjct: 43 GYAKQYNHDGFYALIGRQGALCGNVNTAVGKAYFTEHAVAVKANFLHDTRFLAHLLGCMD 102 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 L + G + L +K + VP EQ I + +R+D L+ ++ Sbjct: 103 LGRY---SGQSAQPGLAVGVLKEVETTVPSKAEQQAIGSF----FSRLDSLITLHQRK 153 >gi|219851732|ref|YP_002466164.1| restriction modification system DNA specificity subunit [Methanosphaerula palustris E1-9c] gi|219545991|gb|ACL16441.1| restriction modification system DNA specificity subunit [Methanosphaerula palustris E1-9c] Length = 180 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 34/120 (28%), Gaps = 5/120 (4%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +G P+ W++ I + G + ++ +V D Sbjct: 4 LGVFPETWQIKKIGDLFNVQQGISMSPARRNGPNKHPFLRTLNVFWSGIDLKTLDYMDLS 63 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 G +L + G R AI C Q + + + ++ +++ Sbjct: 64 EKEIGKLNLLPGDLLVCEGGDIGRSAIWRGELESCGYQNHIHRLRVKNCDVYPEFVVFWM 123 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 44/154 (28%), Gaps = 9/154 (5%) Query: 224 WVGLVPDHWEVKPFFALVTELN-------RKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 +G+ P+ W++K L R+N + +L+ L+T + Sbjct: 3 NLGVFPETWQIKKIGDLFNVQQGISMSPARRNGPNKHPFLRTLNVFWSGIDLKTLDYMDL 62 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 E + PG+++ R + VK + ++ + Sbjct: 63 SEKEIGKLNLLPGDLLVCEGGDIGRSAIWRGELESCGYQNHIHRLRVKNCDVYPEFVVFW 122 Query: 337 MRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVL 368 M++ +L +K + Sbjct: 123 MQAAIKILGFYQDEGNKTTIPNLSQSRLKNFDIP 156 >gi|332288723|ref|YP_004419575.1| EcoKI restriction-modification system protein HsdS [Gallibacterium anatis UMN179] gi|330431619|gb|AEC16678.1| EcoKI restriction-modification system protein HsdS [Gallibacterium anatis UMN179] Length = 440 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 55/150 (36%), Gaps = 7/150 (4%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + G+++ R + + V +I+ + + ++ I+ Sbjct: 45 IKNGDLVNLDNLRYGNNEMYKKWMKEEVKKEDIILTSEAPLGETYYI---DNDQKYILGQ 101 Query: 319 AYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQ 375 V + YL + S + + SG Q +K ++ ++ V +PP++ Q Sbjct: 102 RVFGLRVNKEKVVPKYLEIWLSSLKGQQELFKRASGSTVQGIKQTELLKITVDIPPLEIQ 161 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKE 405 I + + + +I L + Q++ + + Sbjct: 162 EKIATIGDSLSKKI-KLNTQTNQTLEQIAQ 190 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 66/460 (14%), Positives = 128/460 (27%), Gaps = 82/460 (17%) Query: 23 KHWKVVPIKRFTKLN---TGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 W V + L G+T + K + + +++G L Sbjct: 2 SDWAKVELSELLTLVIDHRGKTPKKMGFDDFFSKGYPVLSAKHIKNGDLVNLDNLRYGNN 61 Query: 73 SDTST--VSIFAKGQILYGKLGPYLRKAIIADFD-GICSTQFL--VLQPKDVLPELLQGW 127 K I+ P I + I + + + V+P+ L+ W Sbjct: 62 EMYKKWMKEEVKKEDIILTSEAPLGETYYIDNDQKYILGQRVFGLRVNKEKVVPKYLEIW 121 Query: 128 LLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 L S+ Q + G+T+ + I + IPPL Q I + + +I Sbjct: 122 LSSLKGQQELFKRASGSTVQGIKQTELLKITVDIPPLEIQEKIATIGDSLSKKIKLNTQT 181 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE------------------------ 223 ++ + + + + IE Sbjct: 182 NQTLEQIAQAIFKHWFIDFAPVHAKANALARGETIEQAELAAMACLSGKTVDKITALKAQ 241 Query: 224 ----------------------WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +GLVP WE +++ R K + Sbjct: 242 DPTAYQQLQQTAAAFPSEFVETEMGLVPKGWEWLKIENIIS---RLKNKQKINKNNISDI 298 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 GNI + +N+ + S I P + +F F D + + +I Sbjct: 299 GNIPVFEQGQNILMGYHSDNPAFIATPQDPIFIFGDHTCMTHISTKPFSIYQNVI----- 353 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 G D L + D K E + + + +P + Sbjct: 354 --PIKGKDIPTLWVYLAVKDKQKFQEYR------RHWMEFIIK-EICLPNRDLIEHFVEL 404 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + D + E + I L++ R + ++G+I+L Sbjct: 405 VTHLFEKKDAIYE--QNKI--LRKVRDELLPKLLSGEIEL 440 >gi|167767097|ref|ZP_02439150.1| hypothetical protein CLOSS21_01615 [Clostridium sp. SS2/1] gi|167711072|gb|EDS21651.1| hypothetical protein CLOSS21_01615 [Clostridium sp. SS2/1] gi|291559568|emb|CBL38368.1| Type I restriction-modification system methyltransferase subunit [butyrate-producing bacterium SSC/2] Length = 573 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 49/147 (33%), Gaps = 3/147 (2%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + N I + + + E Y V +V + + + + Sbjct: 422 NIQNGIINDDLPFIKSIDKKLEKYC-VKNNSLVISKNGTPAKVAVVSVPEERKVLANGNL 480 Query: 320 YMAVKPHGI-DSTYLAWLMRSYDL-CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 Y+ + ++ + S + + M + ++ + +K++ + P +Q Sbjct: 481 YVIELDETKVNPYFVKAYLESENGGIALSRIMVGAVMPNIPVDGLKKIIIPCPEKDKQNK 540 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLK 404 I + I VL K+ ++I ++ Sbjct: 541 IAEKYLAKIDEIKVLKYKLSKAIAEME 567 >gi|55820897|ref|YP_139339.1| type I restriction-modification system specificty subunit, truncated [Streptococcus thermophilus LMG 18311] gi|55736882|gb|AAV60524.1| type I restriction-modification system specificty subunit, truncated [Streptococcus thermophilus LMG 18311] Length = 48 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 11/45 (24%), Positives = 22/45 (48%), Gaps = 4/45 (8%) Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 VP +EQ I + ++D + ++ + LLKE++ F+ Sbjct: 5 VPSYEEQQKIGSF----FKQLDDAIALHQRKLDLLKEQKKGFLQK 45 >gi|225378422|ref|ZP_03755643.1| hypothetical protein ROSEINA2194_04090 [Roseburia inulinivorans DSM 16841] gi|225209737|gb|EEG92091.1| hypothetical protein ROSEINA2194_04090 [Roseburia inulinivorans DSM 16841] Length = 244 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 55/196 (28%), Gaps = 6/196 (3%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 +W + NT N+ ++S + +M P+ Sbjct: 50 DWQVKPLGAICSFRNGINYDKNVEGNTVYKIINVRNISSSTLFLDESNFDMICLPQQQGD 109 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 V I+ + R L I + P+ L Sbjct: 110 KYRVSNDSIIIARSGIPGTTRIL--YNPSSNIIFCGFIICCTPYDNTLQNYLTLYLRQFE 167 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G + +++ E +K L V +P Q + + N +RI L+ + V Sbjct: 168 GSSATQTGGSILKNVSQETLKNLLVPIP----QQSLLSKFNDSVSRIYNLINGNIKENVQ 223 Query: 403 LKERRSSFIAAAVTGQ 418 L R + + GQ Sbjct: 224 LTTLRDWLLPMLMNGQ 239 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 56/191 (29%), Gaps = 7/191 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSR--QSD 74 IP W+V P+ G + + I + ++ S T + + Sbjct: 47 IPADWQVKPLGAICSFRNGINYDKNVEGNTVYKIINVRNISSSTLFLDESNFDMICLPQQ 106 Query: 75 TSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + I+ + G P + + I F++ L Sbjct: 107 QGDKYRVSNDSIIIARSGIPGTTRILYNPSSNIIFCGFIICCTPYDNTLQNYLTLYLRQF 166 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G+ + + + + N+ +PIP + + + I+ I E ++ Sbjct: 167 EGSSATQTGGSILKNVSQETLKNLLVPIPQQSLLSKFNDSVSRIYNLINGNIKENVQLTT 226 Query: 194 LLKEKKQALVS 204 L L++ Sbjct: 227 LRDWLLPMLMN 237 >gi|315651215|ref|ZP_07904245.1| conserved hypothetical protein [Eubacterium saburreum DSM 3986] gi|315486511|gb|EFU76863.1| conserved hypothetical protein [Eubacterium saburreum DSM 3986] Length = 199 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 60/193 (31%), Gaps = 14/193 (7%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 PD W + + SN + + I N GL + + Sbjct: 16 PDGWTRATLGEVSLMGAGGDKPKTVSNTQTENCPYPIYSNGISNDGLY--GFTNKCKIKD 73 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 I + + I+ + K + + YL +R+ + Sbjct: 74 ESITVSARGTIGF---VCLRHIPYTPIVRLITLIPKTDVLSAKYLYLWLRNMHIHG---- 126 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 +Q L D ++ +++P +E T+ I + + + + L R Sbjct: 127 -TGTTQQQLTVPDFRKTDIILPTKEEMTLFTDTITPLFE----AIWENQAQNLKLSNTRD 181 Query: 409 SFIAAAVTGQIDL 421 + + ++G++D+ Sbjct: 182 ALLPMLMSGKLDI 194 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 22/190 (11%), Positives = 44/190 (23%), Gaps = 21/190 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W + + + G ++ Y + S G Y Sbjct: 15 LPDGWTRATLGEVSLMGAGGDKPKTVSNTQTENCPYPIYSNGISNDGLY----------G 64 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 + I G + + + L PK + +L ++ Sbjct: 65 FTNKCKIKDESITVSARGTIGFVCLRHIPYTPI-VRLITLIPKTDVLSAKYLYLWLRNMH 123 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G T + +P E L + I I + ++ Sbjct: 124 IH----GTGTTQQQLTVPDFRKTDIILPTKEEMTLFTDTITPLFEAIWENQAQNLKLSNT 179 Query: 195 LKEKKQALVS 204 L+S Sbjct: 180 RDALLPMLMS 189 >gi|332141624|ref|YP_004427362.1| type I restriction-modification system methyltransferase subunit [Alteromonas macleodii str. 'Deep ecotype'] gi|332143450|ref|YP_004429188.1| type I restriction-modification system methyltransferase subunit [Alteromonas macleodii str. 'Deep ecotype'] gi|327551646|gb|AEA98364.1| type I restriction-modification system methyltransferase subunit [Alteromonas macleodii str. 'Deep ecotype'] gi|327553472|gb|AEB00191.1| type I restriction-modification system methyltransferase subunit [Alteromonas macleodii str. 'Deep ecotype'] Length = 713 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 91/293 (31%), Gaps = 9/293 (3%) Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 ++ ++ + + + + V +R + + + I E Sbjct: 399 QSNILFFDRNGPTKGVWFYQHEVPVERRGMKNPCYTVTNALKEEEMAEIRTWYESPCESE 458 Query: 169 LI----REKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 E I ++ D + + E Q +S +++ N + + Sbjct: 459 YAWFVPSEDIRSKDFSFDFRNPRKEQQELKDPEHLQQALSSYLSRIENSNANFQSESHTI 518 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETY 283 + W + + + ++ + R L K ++ Sbjct: 519 RNIDKKSWNEFKIGDFLIRSKNSIELEDDVDYKQITVKLYGKGAVLRKTILGKDIKTKSQ 578 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-YMAVKPHGIDSTYLAWLMRSYDL 342 + G+++ ID +N ++ + + + I +LA+L+RS + Sbjct: 579 FLAQSGQLIMSRIDARNGAFAIVPYDLDGAVVTQDFPLFDINRDVILPEFLAFLLRSKEF 638 Query: 343 CK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + G+ R+ LK E + +P I EQ I N E ++ LV Sbjct: 639 TYACQHASKGTTNRKRLKEELFLSEVLFLPSISEQKVIVAY-NRELNKLANLV 690 >gi|260577393|ref|ZP_05845362.1| putative type I restriction enzyme, S subunit [Rhodobacter sp. SW2] gi|259020396|gb|EEW23723.1| putative type I restriction enzyme, S subunit [Rhodobacter sp. SW2] Length = 83 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 25/46 (54%) Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 I I+ ET +D VE+ I L++E R IA TG++D+R Sbjct: 2 IVQYIHEETKDLDKAVEETTSEINLIREYRERLIADVATGRLDVRH 47 >gi|307288987|ref|ZP_07568951.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0109] gi|306500056|gb|EFM69409.1| type I restriction modification DNA specificity domain protein [Enterococcus faecalis TX0109] Length = 183 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 51/149 (34%), Gaps = 8/149 (5%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF----RFIDLQNDKRSLRSAQ 309 ++S+ + K +N+ ++V GE+ + + RSL + Sbjct: 39 YKVISIGSYGLDSKYVDQNIRAVSNEVTDSRVVRNGELTMVLNDKTANGTIIGRSLLIEE 98 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + I + DS + ++ V + G + + + V L + + Sbjct: 99 DNKYVINQRTEIISPKENFDSNFAYTILNGPFRESVKRIVQGGTQIYVNYPAVSNLVLKL 158 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQ 398 P ++EQ I ++D + ++ Sbjct: 159 PDVEEQKKIGLF----FKQLDDTIALQQR 183 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 58/169 (34%), Gaps = 10/169 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSES--GKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTSTVS 79 + W+ + G E +D Y + G KY+ ++ + ++ + Sbjct: 10 EDWEERKLSEVANHRGGTAIEKYFKEDGKYKVISIGSYGLDSKYVDQNIRAVSNEVTDSR 69 Query: 80 IFAKGQI--LYGKLGPYLRKAII-----ADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + G++ + D + + + ++ PK+ +L+ Sbjct: 70 VVRNGELTMVLNDKTANGTIIGRSLLIEEDNKYVINQRTEIISPKENFDSNFAYTILNGP 129 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + ++ I +G T + ++ + N+ + +P + EQ I I Sbjct: 130 FRESVKRIVQGGTQIYVNYPAVSNLVLKLPDVEEQKKIGLFFKQLDDTI 178 >gi|15828554|ref|NP_325914.1| restriction-modification enzyme subunit S3A [Mycoplasma pulmonis UAB CTIP] gi|14089496|emb|CAC13256.1| RESTRICTION-MODIFICATION ENZYME SUBUNIT S3A [Mycoplasma pulmonis] Length = 359 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/353 (10%), Positives = 102/353 (28%), Gaps = 45/353 (12%) Query: 27 VVPIKRFTKLNTGRTSE----------SGKDIIYIGLEDVESGT--GKYLPKDGNSRQSD 74 + + K+ +G+ + I ++ +++ ++ GK++ + + Sbjct: 3 IYKLGEIAKIVSGKGPKIEKGLEKYEDKNGTINWLLVKNFKNNNLDGKFIKYNLDPIIHK 62 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 K +I Y AI D+D + Q L + I Sbjct: 63 LVK---LNKNEIAYSMYATPGLVAINQDYDNLYINQSFCKILPSKLVLHKYLFYYLISKR 119 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 ++ + G T ++ + + N+ + IP L Q I I + +I + Sbjct: 120 KQFLQLASGTTQNNLNISKVKNLTISIPSLETQSAILNIIEPLEKLFFNVKNLKIILEKF 179 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + + + +K S I + Sbjct: 180 VSKTYK-------HSKKRKVNMLKASKISFFNYRNQKLYCPT------------------ 214 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 S + + + + + + P F L + + L ++ E Sbjct: 215 -----SLVGKLSLSINKVENISFHNRPSRANLSPLNNSILFSKLVGENKILPISKEEEIV 269 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 T + + ++ +++L+ + + +S+ +++K + Sbjct: 270 FSTGFFNIQDKNNLNDNLISFLLSEDFVEQKNKYKQGTTMESINVKNLKMFDI 322 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 36/142 (25%), Gaps = 3/142 (2%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N L + ++ EI + + Q + Sbjct: 35 NWLLVKNFKNNNLDGKFIKYNLDPIIHKLVKLNKNEIAYSMYATPGL---VAINQDYDNL 91 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 I ++ + P + + + + +L VK L + +P ++ Sbjct: 92 YINQSFCKILPSKLVLHKYLFYYLISKRKQFLQLASGTTQNNLNISKVKNLTISIPSLET 151 Query: 375 QFDITNVINVETARIDVLVEKI 396 Q I N+I + Sbjct: 152 QSAILNIIEPLEKLFFNVKNLK 173 >gi|227546694|ref|ZP_03976743.1| type I restriction enzyme HindVIIP specificity protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212840|gb|EEI80719.1| type I restriction enzyme HindVIIP specificity protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 200 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 43/135 (31%), Gaps = 12/135 (8%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y V +V + + + SA + K +G ++ +L Sbjct: 58 YHNEYKVKGPGVVTGRSGTIGNLQYVESAFWP----HNTTLWVTKFYGNHPKFIYYLYEK 113 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 DL + +L DV V P KEQ I+ V+ +D L+ ++ Sbjct: 114 IDLKRY---KAGSGVPTLNRNDVHDTMVFFPASRKEQELISAVL----TYLDDLITLHQR 166 Query: 399 SIVLLKERRSSFIAA 413 L + S + Sbjct: 167 KYDKLVIFKKSMLEK 181 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 51/180 (28%), Gaps = 17/180 (9%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + L G + K I + + +G G Y + Sbjct: 20 WEQRKLIMVAPLQRGFDLPAEKIIPGVYPVMMSNGIGAYHNE------------YKVKGP 67 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G+ G + +T V + P+ + ID+ G+ Sbjct: 68 GVVTGRSGTIGNLQYVESAFWPHNTTLWVTKFYGNHPKFIYYLYEKIDLK----RYKAGS 123 Query: 145 TMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + + + P EQ LI + I + + + K + + Sbjct: 124 GVPTLNRNDVHDTMVFFPASRKEQELISAVLTYLDDLITLHQRKYDKLVIFKKSMLEKMF 183 >gi|238855604|ref|ZP_04645905.1| restriction modification system DNA specificity domain protein [Lactobacillus jensenii 269-3] gi|260665336|ref|ZP_05866184.1| restriction modification system DNA specificity subunit [Lactobacillus jensenii SJ-7A-US] gi|282933446|ref|ZP_06338823.1| type I restriction modification DNA specificity protein [Lactobacillus jensenii 208-1] gi|313473090|ref|ZP_07813574.1| ribosomal protein L10 [Lactobacillus jensenii 1153] gi|238831748|gb|EEQ24084.1| restriction modification system DNA specificity domain protein [Lactobacillus jensenii 269-3] gi|239528674|gb|EEQ67675.1| ribosomal protein L10 [Lactobacillus jensenii 1153] gi|260560840|gb|EEX26816.1| restriction modification system DNA specificity subunit [Lactobacillus jensenii SJ-7A-US] gi|281302429|gb|EFA94654.1| type I restriction modification DNA specificity protein [Lactobacillus jensenii 208-1] Length = 372 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 47/390 (12%), Positives = 109/390 (27%), Gaps = 37/390 (9%) Query: 29 PIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + ++N+ + Y V + N ++ + A G Sbjct: 4 KVSQIAEINSNSIKPKLYSGALNYEDTSSVTDNCFIRPIRYDNITEAPSRARRKAAIGDT 63 Query: 87 LYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + + P D + I ST F V+ P + + +LL + + G Sbjct: 64 VISTVRPNNLHYGFIDKNNCDWIYSTGFAVVHPDKKIVDPFYLFLLLSLKSTTQKLQDIG 123 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 T + + + ++KI + + + L + + Sbjct: 124 ETSKSTYPAVKPDDIANLQFEIPSLEKQKKISFIFKNLYQKSKLNNQINDNLDDLMTTIF 183 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + + + + GL + + ++ L Sbjct: 184 NNKIINSKFEVSSLTNIANYKNGLA---------------MQKFRPTENSESLPVLKIRE 228 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + Q N + + IV+ G+I+F + L + + V Sbjct: 229 LNQGSTDNNSDRCSANIDPEVIVNTGDIIFSWSGTL-----LVKIWSGNKSGLNQHLFKV 283 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + ++ + + A G +K D+K VL+P + Sbjct: 284 TSSEYPNWFIYEWTKFHLHKFQSIAAGKATTMGHIKRNDLKSSKVLIPDK------VSF- 336 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +I + E+ + L+KE S + Sbjct: 337 -DKFNKIMSPI--YEKRLELIKEN-QSLMT 362 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 10/184 (5%) Query: 26 KVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +V + G R +E+ + + + + ++ G+ + + ++ Sbjct: 193 EVSSLTNIANYKNGLAMQKFRPTENSESLPVLKIRELNQGS---TDNNSDRCSANIDPEV 249 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I G I++ G L K + G + + + + W + A Sbjct: 250 IVNTGDIIFSWSGTLLVKIWSGNKSG-LNQHLFKVTSSEYPNWFIYEWTKFHLHKFQSIA 308 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + TM H + + + IP + + + LI E + + Sbjct: 309 AGKATTMGHIKRNDLKSSKVLIPDKVSFDKFNKIMSPIYEKRLELIKENQSLMTFKENLL 368 Query: 200 QALV 203 Sbjct: 369 TKYF 372 >gi|238854451|ref|ZP_04644791.1| type I restriction-modification system S protein [Lactobacillus jensenii 269-3] gi|282932596|ref|ZP_06338017.1| type-1 restriction enzyme MjaXIP specificity protein [Lactobacillus jensenii 208-1] gi|313472062|ref|ZP_07812554.1| HsdS specificity protein of type I restriction-modification system [Lactobacillus jensenii 1153] gi|238832944|gb|EEQ25241.1| type I restriction-modification system S protein [Lactobacillus jensenii 269-3] gi|239530093|gb|EEQ69094.1| HsdS specificity protein of type I restriction-modification system [Lactobacillus jensenii 1153] gi|281303292|gb|EFA95473.1| type-1 restriction enzyme MjaXIP specificity protein [Lactobacillus jensenii 208-1] Length = 184 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 59/190 (31%), Gaps = 27/190 (14%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK V + +G T ++G I +I ++ S + + S+ Sbjct: 14 WKKVKLGEIATTYSGGTPKAGNKKYYNGLIPFIRSGEIHSNKTELF---ISEAGLKNSSA 70 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQRI 137 + KG +LY G S F + D L + + Sbjct: 71 KMVTKGDLLYALYGAT-------------SQAFFNMTFDDDEKRDFIYIILEKANFDKEW 117 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + T ++ + K I N + P + + IDT I + + I + Sbjct: 118 IRLISTGTQNNLNAKKIRNFHIVFPT----YKALKGLNKLFCNIDTDIDIQYKVIVTTNQ 173 Query: 198 KKQALVSYIV 207 KQ L+ + Sbjct: 174 LKQFLLQNLF 183 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 16/171 (9%), Positives = 51/171 (29%), Gaps = 23/171 (13%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 N K I + G I + + ++V G++++ Sbjct: 34 GNKKYYNGLIPFIRSGEIHSNKTELFISEAGLKNSSAKMVTKGDLLYALYGAT------- 86 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + A+ + + +++ ++ + + +G + +L + ++ Sbjct: 87 ----------SQAFFNMTFDDDEKRDFIYIILEKANFDKEWIRLISTGTQNNLNAKKIRN 136 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ P +N ID ++ + IV + + + Sbjct: 137 FHIVFPTY----KALKGLNKLFCNIDTDIDIQYKVIVTTNQLKQFLLQNLF 183 >gi|325474569|gb|EGC77755.1| type I restriction-modification system [Treponema denticola F0402] Length = 138 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 34/106 (32%), Gaps = 5/106 (4%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 G+I+F + I +++ + + Y+ + S Sbjct: 37 AGDILFTSVGSLGR----SCIYDGRMNICFQRSVSILNTKVYNKYVKFFFDSNFYQNYVA 92 Query: 348 AMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +G + +++ + +PPI EQ I I +D + Sbjct: 93 EHATGTAQMGFYLQEMAESFIAIPPISEQKRIVAKIEEIFYVLDNI 138 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 55/140 (39%), Gaps = 5/140 (3%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADF 103 S ++I + +ED+E +YL K+ ++ + + G IL+ +G R I Sbjct: 3 SSRNINHNTVEDLE--NVRYLTKEMFDAENLRTNAT---AGDILFTSVGSLGRSCIYDGR 57 Query: 104 DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 IC + + + V + ++ + S + G + + + IPP Sbjct: 58 MNICFQRSVSILNTKVYNKYVKFFFDSNFYQNYVAEHATGTAQMGFYLQEMAESFIAIPP 117 Query: 164 LAEQVLIREKIIAETVRIDT 183 ++EQ I KI +D Sbjct: 118 ISEQKRIVAKIEEIFYVLDN 137 >gi|169825071|ref|YP_001692682.1| type I restriction-modification system specificity subunit [Finegoldia magna ATCC 29328] gi|167831876|dbj|BAG08792.1| type I restriction-modification system specificity subunit [Finegoldia magna ATCC 29328] Length = 180 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 47/119 (39%), Gaps = 6/119 (5%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 +I++ I N + + + I ++ M ++P + +L L++S + Sbjct: 59 FKKNDILYSEIRPANKRFAYIDFEDTSNYIASTKLMVLRPRVDVVLPGFLFALLKSERML 118 Query: 344 KVFYAMG---SGLRQSLK-FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + SG + ++ +PV +P Q I +++ +++ V+ + Sbjct: 119 EELQHLAVTRSGTFPQITFKSELSTMPVALPDFDSQKRIVSILEAIEGKMNQNVQINKN 177 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 58/177 (32%), Gaps = 10/177 (5%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK V I + + ++I + DV G K N + F K Sbjct: 3 EWKKVTIGDLCDTISDTYRGNADEVILVNTSDVLEGKVLNHEKVPN-KNLKGQFKKTFKK 61 Query: 84 GQILYGKLGPYLRKAIIADF----DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ILY ++ P ++ DF + I ST+ +VL+P+ + + L E Sbjct: 62 NDILYSEIRPANKRFAYIDFEDTSNYIASTKLMVLRPRVDVVLPGFLFALLKSERMLEEL 121 Query: 140 IC-----EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G + +P+ +P Q I + A +++ + Sbjct: 122 QHLAVTRSGTFPQITFKSELSTMPVALPDFDSQKRIVSILEAIEGKMNQNVQINKNL 178 >gi|291515458|emb|CBK64668.1| Type I restriction-modification system methyltransferase subunit [Alistipes shahii WAL 8301] Length = 837 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 50/125 (40%), Gaps = 5/125 (4%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMR 338 + V G+ + ID ++ + + + E I+T ++ I YL ++ Sbjct: 706 KRQTRVKGGQFIISKIDGKSAAFGIVDSSL-EGAIVTPDFLVYDIDTTQILPEYLELVLT 764 Query: 339 SYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + + F SG R+ L + + + +P I EQ ++ I L E++ Sbjct: 765 NDAILNQFSISSSGTTGRRRLSQKVFENTLIALPSIDEQRNLLAKILEIRETQKSLEEQM 824 Query: 397 EQSIV 401 +++I Sbjct: 825 QKNIE 829 >gi|269978328|gb|ACZ55898.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 355 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 41/365 (11%), Positives = 96/365 (26%), Gaps = 31/365 (8%) Query: 50 YIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 +I D+ + + + + IL G +G + D + Sbjct: 2 FITPNDLHGTYRIIKTSRTLSDSGLKSIQNNTIDNTSILVGCIGDVGMVRMCFDKCA-TN 60 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 Q + + + + + I + I + +P + Q Sbjct: 61 QQINSITDIKDFCNPYYLYYYLSNKKELFKNIALSTVVPIIPKTIFQEIEVLLPNIETQQ 120 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI-----E 223 I + +I+ ++L+ + N + G E Sbjct: 121 KIARTLSILDQKIENNHKINELLHKILELLYEQYFVRFDFLDENNKPYQTNGGKMKFSKE 180 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 L+P+ +EVK L SN + + + +E Sbjct: 181 LNRLIPNDFEVKTLGELTQLKVGNKNANHSSNQGKYPFFTCSNN----PLRCETYQFEGK 236 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 I+ G + + V P+ + L +L Sbjct: 237 HIIISGN------------GNFYVTHYDGKFDAYQRTYVVNPNNPNHYVLIYLFVKSYTN 284 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + + + D++ + +++P +K + ++ ++E QS L Sbjct: 285 YLKLQSHGSIIKFITKSDIENIKIVLPNLKT--------YTKWNKVLKMIENNNQSTQTL 336 Query: 404 KERRS 408 R Sbjct: 337 TALRD 341 >gi|223940845|ref|ZP_03632675.1| conserved hypothetical protein [bacterium Ellin514] gi|223890495|gb|EEF57026.1| conserved hypothetical protein [bacterium Ellin514] Length = 169 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 49/145 (33%), Gaps = 16/145 (11%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-STYLAWLMRSYDL 342 + G+++ N + + + + AW + D Sbjct: 27 HCLFAGDVLVASRGNWNTASVIVPKTDDIVIAAPNLLVVRIRTATLRPDFFAWWLNQPDT 86 Query: 343 CKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 ++ A SG + ++ L V VP ++ Q I +I L + ++ + Sbjct: 87 QEMIRARRSGSTIPFISIPELSDLKVPVPNVETQEKIL--------KIHKLWIREQELLE 138 Query: 402 LLKERR----SSFIAAAVTG-QIDL 421 +K +R S +A +T +I + Sbjct: 139 EIKNKRRTFVQSILAD-MTAEKIKI 162 >gi|310765246|gb|ADP10196.1| restriction modification system DNA specificity subunit [Erwinia sp. Ejp617] Length = 363 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 52/197 (26%), Gaps = 8/197 (4%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +G +P+ W P + K +S N+++ + S T Sbjct: 163 ELGEIPEGWNAGPLGDIANFAKGKIEVAKLKTDTYISTENMLENKAGISHASSLPSVNTV 222 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 PG I+ I K L + +L L+ Sbjct: 223 PNFSPGHILISNIRPYFKKIWLARFSGGRSA---DVLAFENKKKVTVEFLYNLLSQDVFF 279 Query: 344 KVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G+ + + P +E + ++ + +E Sbjct: 280 DFMMLTSKGVKMPRGDKTSIMNWTCIQP--EE--KVLSIYSTSVVEFYSYIESHNLENKY 335 Query: 403 LKERRSSFIAAAVTGQI 419 L R + + ++G+I Sbjct: 336 LTNLRDTLLPKLLSGEI 352 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 58/189 (30%), Gaps = 5/189 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +G IP+ W P+ G+ K YI E++ + Sbjct: 164 LGEIPEGWNAGPLGDIANFAKGKIEVAKLKTDTYISTENMLENKAGISHASSLPSVNTVP 223 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQ 135 F+ G IL + PY +K +A F G S L + K + E L L Sbjct: 224 N---FSPGHILISNIRPYFKKIWLARFSGGRSADVLAFENKKKVTVEFLYNLLSQDVFFD 280 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + +G M D I N P + ++ I++ E L Sbjct: 281 FMMLTSKGVKMPRGDKTSIMNWTCIQPEEKVLSIYSTSVVEFYSYIESHNLENKYLTNLR 340 Query: 196 KEKKQALVS 204 L+S Sbjct: 341 DTLLPKLLS 349 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 30/94 (31%), Gaps = 6/94 (6%) Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQF 376 + K + YL + S +V + +++ + +P I Q Sbjct: 2 GLLRAKKDKVIPEYLLYTYLSPAFQEVIREKTIHGSTTDRISIKEIPSFKIQIPDIHTQI 61 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 V+ ID ++ +Q L++ + Sbjct: 62 RTVKVL----KNIDDKIKINQQINQTLEQMAQAL 91 >gi|126665696|ref|ZP_01736677.1| Restriction modification system DNA specificity domain [Marinobacter sp. ELB17] gi|126629630|gb|EBA00247.1| Restriction modification system DNA specificity domain [Marinobacter sp. ELB17] Length = 350 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 38/118 (32%), Gaps = 16/118 (13%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKY 63 +DS +G IP+ W I + G +S G I + D+++ Sbjct: 140 MQDSE---LGEIPEGWSYSSIYELADVIYGAAFKSKLFNNVGDGTPLIRIRDLKN----- 191 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 K G S + + +L G G + + I + + +P+ Sbjct: 192 -EKPGVSTPEEHPKGYLVQNADLLAGMDGEF-KPYIWGGGLAWMNQRVCCFKPRKGYS 247 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 63/205 (30%), Gaps = 21/205 (10%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 +G +P+ W + L + + L + G+ + R++ + T Sbjct: 144 ELGEIPEGWSYSSIYELADVIYG----AAFKSKLFNNVGDGTPLIRIRDLKNEKPGVSTP 199 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSA-QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + G +V L + + KP S L L Sbjct: 200 EEHPKGYLVQNADLLAGMDGEFKPYIWGGGLAWMNQRVCCFKPRKGYSVSLIKGFIEPQL 259 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI--TNVINVETARI-DVLVEKI--- 396 + + L D+ R I + + ++I LVE+I Sbjct: 260 RSLELTASATTVIHLGKGDINRFEF----------INAGSALFEAYSKITQSLVEQIVIN 309 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 + S L+ +R + + ++G++ + Sbjct: 310 KTSARTLEHQRDALLPKLLSGELSV 334 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 2/65 (3%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL-K 404 Y + SLK D+ + + +P + EQ I + +I + KI Q++ + + Sbjct: 1 MYINVGAVFDSLKCADIPKFEIYLPELNEQKRIAETLGGLDGKI-QINHKINQTLEQMAQ 59 Query: 405 ERRSS 409 S Sbjct: 60 ALFKS 64 >gi|325696150|gb|EGD38041.1| 50S ribosomal protein L10 [Streptococcus sanguinis SK160] Length = 215 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 63/204 (30%), Gaps = 19/204 (9%) Query: 226 GLVPDHWEVKPFFALVTE-----LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 G P W+ + + + E ++ L + Q + + L + Sbjct: 18 GEKPSDWKTANLTDIAEFLNGLAMQKYRPLDNEESLPVLKIKELRQGIFDSSSDLCSANI 77 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + I+ G+++F + L G + V D + + Sbjct: 78 KRPYIIQDGDVIFSWSGSL-----LVDFWTGGIGGLNQHLFKVSSQEYDK--WFYYSWTK 130 Query: 341 DLCKVFYAMGS---GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 F A+ + + + +++ +L+P + I + A + Sbjct: 131 YYLDEFIAIAADKATTMGHITRKSLEKAEILIPNDHDYKSIG----LLLAPTYNQIISNR 186 Query: 398 QSIVLLKERRSSFIAAAVTGQIDL 421 L E R+S + ++G+I + Sbjct: 187 IENRKLMEVRNSLLPKLLSGEISV 210 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 58/192 (30%), Gaps = 10/192 (5%) Query: 19 GAIPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G P WK + + G R ++ + + + ++++ G + Sbjct: 18 GEKPSDWKTANLTDIAEFLNGLAMQKYRPLDNEESLPVLKIKELRQG---IFDSSSDLCS 74 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 ++ I G +++ G L G + + ++ W Sbjct: 75 ANIKRPYIIQDGDVIFSWSGSLL-VDFWTGGIGGLNQHLFKVSSQEYDKWFYYSWTKYYL 133 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 A + TM H K + + IP + I + +I + E + + Sbjct: 134 DEFIAIAADKATTMGHITRKSLEKAEILIPNDHDYKSIGLLLAPTYNQIISNRIENRKLM 193 Query: 193 ELLKEKKQALVS 204 E+ L+S Sbjct: 194 EVRNSLLPKLLS 205 >gi|259419398|ref|ZP_05743314.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259344639|gb|EEW56526.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 205 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 35/106 (33%), Gaps = 2/106 (1%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWL 336 + +V GE++F+ N + +I + + YLAW Sbjct: 60 DDLPERHVVRGGEVIFKSRGEPNVAAPVTKNLEEPIAVILPLVILRPRAGLTLPDYLAWA 119 Query: 337 MRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNV 381 + + F G + ++ L V +P ++ Q I + Sbjct: 120 INQPRSQRYFDTEAQGTSMRMISKAVLEELDVPLPDLETQARIVAI 165 >gi|224540798|ref|ZP_03681337.1| hypothetical protein BACCELL_05712 [Bacteroides cellulosilyticus DSM 14838] gi|224517585|gb|EEF86690.1| hypothetical protein BACCELL_05712 [Bacteroides cellulosilyticus DSM 14838] Length = 225 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 70/219 (31%), Gaps = 5/219 (2%) Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S+ V D K +S + E+K +L+T+ + ++ Sbjct: 6 SWFVDFEPFKDGKFVNSEFGMIPEGWKISELKSICSLITKGITPQYDESSNQLVIGQKCI 65 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 +K++ + V G+++ + R + + + S V Sbjct: 66 RGRKIDLSIARKHIPKQINEKWVQYGDVLINSTGIGTLGRPAQVWFQKKNVTVDSHVTIV 125 Query: 324 KPHGIDSTYL-AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + + + YA GS + L E + ++ P K D +I Sbjct: 126 RTNRQNDKMFIGQYFLGKQILLESYATGSTGQADLSKELLAMTKLVYPTDKVLNDFNKII 185 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +I L + L R++ + ++G++ + Sbjct: 186 TNMVLKIVEL----QTETEYLSSLRNTLLPQLMSGELKI 220 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 51/193 (26%), Gaps = 9/193 (4%) Query: 19 GAIPKHWKVVPIKRFTK-LNTGRTS--ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G IP+ WK+ +K + G T + + + IG + + + Sbjct: 25 GMIPEGWKISELKSICSLITKGITPQYDESSNQLVIGQKCIRGRKIDLSIARKHIP--KQ 82 Query: 76 STVSIFAKGQILYGK--LGPYLRK--AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 G +L +G R + + +++ ++ G Sbjct: 83 INEKWVQYGDVLINSTGIGTLGRPAQVWFQKKNVTVDSHVTIVRTNRQNDKMFIGQYFLG 142 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + P + I ++I L TE Sbjct: 143 KQILLESYATGSTGQADLSKELLAMTKLVYPTDKVLNDFNKIITNMVLKIVELQTETEYL 202 Query: 192 IELLKEKKQALVS 204 L L+S Sbjct: 203 SSLRNTLLPQLMS 215 >gi|238910687|ref|ZP_04654524.1| restriction modification system DNA specificity subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 192 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 58/175 (33%), Gaps = 9/175 (5%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET---RNMGLKPESYETYQIVDPGE 290 + + + S L NI++ + G +S ++ + Sbjct: 1 MGNILHDIKYGTSQKCDYNISGYPVLRIPNIVKGIIDLADIKYGALTDSELKDLTLNKND 60 Query: 291 IVFRFIDLQNDKRS---LRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVF 346 ++F + + L + + + + I++ Y+ +M+S + + Sbjct: 61 LLFIRSNGSTNIVGQSTLVQHDLKDHAYAGYIIRVRLHNEYINARYINMVMKSNLIREQI 120 Query: 347 YA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + +++ ++ L V +PP EQ I IN + L I+ + Sbjct: 121 EGPIRTTTGVKNINSNELMGLLVPLPPKNEQGIIIKKINEIDTTLSNLKVSIQSA 175 Score = 37.1 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 17/186 (9%), Positives = 49/186 (26%), Gaps = 12/186 (6%) Query: 35 KLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL 91 + G + + + + + ++ G + K +L+ + Sbjct: 7 DIKYGTSQKCDYNISGYPVLRIPNIVKGIIDLADIKYGALTDSELKDLTLNKNDLLFIRS 66 Query: 92 GPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA--ICE 142 + V + + ++ ++ + I Sbjct: 67 NGSTNIVGQSTLVQHDLKDHAYAGYIIRVRLHNEYINARYINMVMKSNLIREQIEGPIRT 126 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + + + + + +P+PP EQ +I +KI + L + AL Sbjct: 127 TTGVKNINSNELMGLLVPLPPKNEQGIIIKKINEIDTTLSNLKVSIQSAQQTQVHLADAL 186 Query: 203 VSYIVT 208 + Sbjct: 187 TDAAIN 192 >gi|13507940|ref|NP_109889.1| hypothetical protein MPN201 [Mycoplasma pneumoniae M129] gi|12229987|sp|Q50287|T1SF_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_201; AltName: Full=S.MpnORFFP; AltName: Full=Type I restriction enzyme specificity protein MPN_201; Short=S protein gi|1215687|gb|AAC43680.1| putative orf; GT9_orf238 [Mycoplasma pneumoniae] gi|1674334|gb|AAB96278.1| hypothetical protein MPN_201 [Mycoplasma pneumoniae M129] Length = 238 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 55/176 (31%), Gaps = 10/176 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-------IVDPGEI 291 + N +I + G I K RN + Y + + Sbjct: 48 RKIYGANIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATTNDGELGHIKDCDF 107 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +I + + + + + + T L+ + K + + Sbjct: 108 DGEYITWTTNGYAGVVFYRNGKFNASQDCGVLKVKNKKICTKFLSLLLEIEATKFVHNLA 167 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S R L + + + + PP++ Q I +++ + LVE I I L K++ Sbjct: 168 S--RPKLSQKVMAEIELSFPPLEIQEKIADILCAFEKLCNDLVEGIPAEIELRKKQ 221 >gi|227523735|ref|ZP_03953784.1| conserved hypothetical protein [Lactobacillus hilgardii ATCC 8290] gi|227089050|gb|EEI24362.1| conserved hypothetical protein [Lactobacillus hilgardii ATCC 8290] Length = 193 Score = 54.8 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 60/179 (33%), Gaps = 6/179 (3%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRN 272 M I + + WE + K + + + + YG + K ++ Sbjct: 1 MFYILINAINFLEVAWEQRKLGDWGYFYYGHSAPKWSVVGDGGTPCVRYGELYTKSNSKI 60 Query: 273 MGLKPESYETYQIVD--PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + + + G V ++ A + + ++V + Sbjct: 61 DHIYSHTNISVKNLKLSKGTEVLIPRVGEDPLDFAHCAWLSIPNVAIGEMISVFNTKQNP 120 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + A+ S + + G +L + + +PV P +KEQ +IT +I + I Sbjct: 121 LFTAYSFNSMLKYEFAKRVEGGGVANLYYAYLTNIPVSFPSMKEQTEITQLIENLISLI 179 Score = 42.9 bits (99), Expect = 0.099, Method: Composition-based stats. Identities = 17/176 (9%), Positives = 53/176 (30%), Gaps = 7/176 (3%) Query: 25 WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 W+ + + G ++ + ++ + + + + + Sbjct: 16 WEQRKLGDWGYFYYGHSAPKWSVVGDGGTPCVRYGELYTKSNSKIDHIYSHTNISVKNLK 75 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF--LVLQPKDVLPELLQGWLLSIDVTQRI 137 + ++L ++G + I + ++ L + + + Sbjct: 76 LSKGTEVLIPRVGEDPLDFAHCAWLSIPNVAIGEMISVFNTKQNPLFTAYSFNSMLKYEF 135 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 EG +++ + + NIP+ P + EQ I + I I + ++ Sbjct: 136 AKRVEGGGVANLYYAYLTNIPVSFPSMKEQTEITQLIENLISLIAANQGKHLQIKN 191 >gi|300869810|ref|YP_003784681.1| putative type I restriction endonuclease S subunit HsdS [Brachyspira pilosicoli 95/1000] gi|300687509|gb|ADK30180.1| putative type I restriction endonuclease, S subunit, HsdS [Brachyspira pilosicoli 95/1000] Length = 411 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 61/196 (31%), Gaps = 10/196 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI---------IQKLETRNMGLKP 277 PD E P +++ + N+ + ++ Y ++ + + + Sbjct: 11 HCPDGVEYVPLWSVTIWDKKFNSVDKDKQKTTIKYKYYLADELKELVVENGDVKILTTNI 70 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWL 336 + T + + + + I + + I + +A + Sbjct: 71 SNLFTLEKLVSNSLSYGEIVCIPWGGNPIVQYYKGKFITSDNRIATSIDVNKLDNKFLYY 130 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + L + + V + + +PPI+ Q +I +++ T D+L ++ Sbjct: 131 VLINKLDLISSFYRGAGIKHPDMSKVLDIIIPLPPIEVQKEIVRILDTFTKYQDLLNREL 190 Query: 397 EQSIVLLKERRSSFIA 412 E + R + Sbjct: 191 ELRKKQYEYYRDKLLT 206 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 56/408 (13%), Positives = 113/408 (27%), Gaps = 32/408 (7%) Query: 22 PKHWKVVPIKRFT----KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 P + VP+ T K N+ + I Y E D ++ S Sbjct: 13 PDGVEYVPLWSVTIWDKKFNSVDKDKQKTTIKYKYYLADELKELVVENGDVKILTTNISN 72 Query: 78 VSIFAK--------GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + K G+I+ G I S + + + + Sbjct: 73 LFTLEKLVSNSLSYGEIVCIPWGGNP-IVQYYKGKFITSDNRIATSIDVNKLDNKFLYYV 131 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 I+ I + GA + H D + +I +P+PP+ Q I + T + + Sbjct: 132 LINKLDLISSFYRGAGIKHPDMSKVLDIIIPLPPIEVQKEIVRILDTFTK------YQDL 185 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 EL KKQ N D + K G + Sbjct: 186 LNRELELRKKQYEYYRDKLLTFNDDFEWKCLGELLQPKGYIRGPFGSALKKDFFVKDGVP 245 Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + + +++ + + V P +I+ + Sbjct: 246 VYEQQHAIY------NKRVFRYFVDCERADKLKRFTVKPYDIIISCSGTIGKISIIMPED 299 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + GII A + ++ + G +++ ++ + Sbjct: 300 RI--GIINQALLILRLDLSKVNVKYIKHYLECFPNLIVTSSGGAITNIEKREIIEKIKIP 357 Query: 370 PPI-KEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 P+ KEQ I +++ + + + I L K+ R + Sbjct: 358 IPLLKEQERIVKILDQFDTLCNDITRGLPAEIELRKKQYEYYRDKLLT 405 >gi|268596454|ref|ZP_06130621.1| predicted protein [Neisseria gonorrhoeae FA19] gi|268550242|gb|EEZ45261.1| predicted protein [Neisseria gonorrhoeae FA19] Length = 198 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 59 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 116 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 117 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 176 Query: 403 LK-ERR 407 K + R Sbjct: 177 RKRQYR 182 >gi|312875770|ref|ZP_07735764.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2053A-b] gi|311088705|gb|EFQ47155.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2053A-b] Length = 181 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 45/147 (30%), Gaps = 3/147 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 I+ G I + E + + G+I+ + + Sbjct: 29 SGIPFYRGKEIIEKHNGISISNKLFISSERYEEIKNKFGVPLEGDILLTSVGTLGIPWLV 88 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 + + + I +L + + + + +++L E +K+ Sbjct: 89 DKEKFYFKD--GNLTWLRNNELITPRFLYYWLITSQAQNQINSKCIGSTQKALTIEILKK 146 Query: 365 LPVLVPPIKEQFDITNVINVETARIDV 391 + P IK Q IT++I +ID Sbjct: 147 FYITFPDIKTQKKITSIIESIELKIDN 173 Score = 41.7 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 55/180 (30%), Gaps = 11/180 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKD----GNSRQSD 74 + WK + + +++ + I + +++ + + R + Sbjct: 2 ETWKTMTLSDVCYISSSKRIFAKEYQSSGIPFYRGKEIIEKHNGISISNKLFISSERYEE 61 Query: 75 TSTVSIFA-KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL--PELLQGWLLSI 131 +G IL +G ++ L + L P L WL++ Sbjct: 62 IKNKFGVPLEGDILLTSVGTLGIPWLVDKEKFYFKDGNLTWLRNNELITPRFLYYWLITS 121 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +I + C G+T + + + P + Q I I + ++ID Sbjct: 122 QAQNQINSKCIGSTQKALTIEILKKFYITFPDIKTQKKITSIIESIELKIDNNRKINKNL 181 >gi|317179169|dbj|BAJ56957.1| Type I restriction-modification system specificity subunit [Helicobacter pylori F30] Length = 397 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 55/390 (14%), Positives = 117/390 (30%), Gaps = 18/390 (4%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 ++ + L T +S + YI +++ ++ G K+ N Q + F K + Sbjct: 3 KTLQDYATLIND-TIQSNEINHYITTDNMCQNLGGIDTFKNINIPQGKVRS---FQKDDV 58 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATM 146 L + Y RK A G CS+ LV + K + L L S T + +G+ M Sbjct: 59 LLSNIRLYFRKVYRAKQKGGCSSDVLVFRAKHIDSATLFAILSSQIFTDYACSGSQGSKM 118 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAE--TVRIDTLITERIRFIELLKEKKQALVS 204 + + + +P + +I+ L+ + + + + + Sbjct: 119 PRGNKTHMMDFKIPTINFTIAKIFNSIQNKIENNHKINELLHKILELLYEQYFVRFDFLD 178 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 KMK S E L+P+ +EVK K + +I Sbjct: 179 ENNKPYQTSGGKMKFS-KELNRLIPNDFEVKTLGDNPLCNTIKTGVTPFKQKVYYETKHI 237 Query: 265 IQKLETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR---SAQVMERGIITS 318 + L + + F + L + + E + T Sbjct: 238 QETLSLNQGLKVSYNKRPNRANMQPTIHSVWFAKMKDTKKHLFLNQHMQSWIKESILSTG 297 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + + S + ++++ E + + +L+P ++ Sbjct: 298 FCGLQCQKHTFEYIASTIKYSPFETRKNNLATGATQKAINIEMLDYIFILIPN----KEL 353 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + T + + L R Sbjct: 354 LDNYSKITRPLYEKISNNIIETQTLTALRD 383 >gi|332204894|gb|EGJ18959.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47901] Length = 191 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 51/172 (29%), Gaps = 6/172 (3%) Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 K+S ++ + + ++++ N L N + Sbjct: 5 HTKNSSLKSKSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQVEDADGKFPIYGSG 64 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 Y IV ++ N +R + I+S YL + Sbjct: 65 GIMGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVLEKINSEYLFY 121 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 122 FCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVAQVDK 170 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 29 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 76 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 77 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 133 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 134 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 164 >gi|34557508|ref|NP_907323.1| type I restriction enzyme S protein [Wolinella succinogenes DSM 1740] gi|34483225|emb|CAE10223.1| PROBABLE TYPE I RESTRICTION ENZYME S PROTEIN [Wolinella succinogenes] Length = 188 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 21/202 (10%), Positives = 51/202 (25%), Gaps = 22/202 (10%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ K+ EW ++ K + S L Sbjct: 5 PKLRFKEFSGEWEEKKISQIFEITRGNVLAVPMMSQEKKDDFQYPVYSSQTKNNGLTGYY 64 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 E T+ + ++ ++G Sbjct: 65 NEYLFEDCITWTTDGANAGDANLRRGKFYCTNVCGVLKSDKGYANQCIAE---------- 114 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDV 391 + + K +G+ L + + + +P I EQ I + ++ +ID Sbjct: 115 ----ILNTITKKYVSYVGN---PKLMNNTMGGIKITIPSSIDEQTKIASFLSAVDTKID- 166 Query: 392 LVEKIEQSIVLLKERRSSFIAA 413 + + + + K + + Sbjct: 167 ---LVTKQLDVSKNFKKGLLQQ 185 >gi|317009084|gb|ADU79664.1| type I restriction enzyme S protein [Helicobacter pylori India7] Length = 419 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 69/178 (38%), Gaps = 13/178 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I I++ + Sbjct: 18 NNYTKEDNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSI---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDVIDLKKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 S+ D + V + P++ Q I ++V +I+ + E + + + LL E+ Sbjct: 134 SSYPSITPLDFLNIKVKLYPLETQQKIARTLSVLDQKIENNHKINELLHKILELLYEQ 191 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 49/386 (12%), Positives = 113/386 (29%), Gaps = 24/386 (6%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVCYLDTDNITNNRINAFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ K + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDVIDLKKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ ++L+ + N Sbjct: 144 FLNIKVKLYPLETQQKIARTLSVLDQKIENNHKINELLHKILELLYEQYFVRFDFLDENN 203 Query: 214 DVKMKDSGI-----EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 G E L+P+ +EVK K + +I + L Sbjct: 204 KPYQTSGGKMKFSKELNRLIPNDFEVKTLGDNPLCNTIKTGVTPFKQKVYYETKHIQETL 263 Query: 269 ETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS---AQVMERGIITSAYMA 322 + + F + L + + E + T Sbjct: 264 SLNQGLKVSYDKRPNRANMQPAIHSVWFAKMKDTKKHLFLNQRMQSWIKESILSTGFCGL 323 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + S + ++++ E + + +L+P ++ + Sbjct: 324 QCQKNTFEYIASTIKYSPFETRKNNLATGATQKAINIEMLDYIFILIPN----KELLDNY 379 Query: 383 NVETARIDVLVEKIEQSIVLLKERRS 408 + T + + L R Sbjct: 380 SKITKPLYEKISNNIIETQTLTTLRD 405 >gi|240115256|ref|ZP_04729318.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae PID18] Length = 208 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 185 Query: 403 LK-ERR 407 K + R Sbjct: 186 RKRQYR 191 >gi|240013722|ref|ZP_04720635.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae DGI18] gi|240080304|ref|ZP_04724847.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae FA19] gi|240120792|ref|ZP_04733754.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae PID24-1] gi|240123097|ref|ZP_04736053.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae PID332] gi|240127801|ref|ZP_04740462.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae SK-93-1035] Length = 207 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 185 Query: 403 LK-ERR 407 K + R Sbjct: 186 RKRQYR 191 >gi|189463337|ref|ZP_03012122.1| hypothetical protein BACCOP_04054 [Bacteroides coprocola DSM 17136] gi|189429956|gb|EDU98940.1| hypothetical protein BACCOP_04054 [Bacteroides coprocola DSM 17136] Length = 152 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 14/84 (16%), Positives = 31/84 (36%) Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 G S + ++ + + + + + + L + K + V +PP Sbjct: 69 DGYQGSTFKQLRINENMNEEYVLQVINLHRKILRESKVGSAIPHLNKKIFKAIEVPIPPY 128 Query: 373 KEQFDITNVINVETARIDVLVEKI 396 KEQ I I +D+++E + Sbjct: 129 KEQQKIIKAITKAFMSLDLIMESL 152 >gi|153808174|ref|ZP_01960842.1| hypothetical protein BACCAC_02460 [Bacteroides caccae ATCC 43185] gi|149129077|gb|EDM20293.1| hypothetical protein BACCAC_02460 [Bacteroides caccae ATCC 43185] Length = 147 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 30/69 (43%), Gaps = 2/69 (2%) Query: 330 STYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + ++++ +L + L + K + V +PP KEQ I INV Sbjct: 79 NMNTEYVLQVINLHRKILRENKVGSAIPHLNKKLFKEIEVPIPPYKEQMRIVEAINVTFK 138 Query: 388 RIDVLVEKI 396 +DV++E + Sbjct: 139 HLDVIMESL 147 >gi|299144868|ref|ZP_07037936.1| restriction modification system DNA specificity domain protein [Bacteroides sp. 3_1_23] gi|298515359|gb|EFI39240.1| restriction modification system DNA specificity domain protein [Bacteroides sp. 3_1_23] Length = 202 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 45/118 (38%), Gaps = 6/118 (5%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII----TSAYMAVKPHGIDSTYLA 334 + + + G++ D + + + + A + +D YL Sbjct: 60 NEISKFQLKKGQVALTKDSETRDDIGIPTYIADDFDDVILGYHCALITPNKDILDGRYLN 119 Query: 335 WLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L+ + K F A GSG R +L E + PV + P+++Q I + + +I+ Sbjct: 120 ALLHTDYAKKYFACNASGSGQRYALSVEALNSFPVPMIPLRDQKRIGEIFSALDKKIE 177 >gi|237653812|ref|YP_002890126.1| type I restriction-modification system, endonuclease S subunit [Thauera sp. MZ1T] gi|237625059|gb|ACR01749.1| type I restriction-modification system, endonuclease S subunit [Thauera sp. MZ1T] Length = 141 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 30/99 (30%), Positives = 46/99 (46%), Gaps = 4/99 (4%) Query: 24 HWKVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 WKV + R Y+GLE ++S + K + S +T +F Sbjct: 9 GWKVWRFDQIATNVNERVDNPSESGMEHYVGLEHLDSDSLKI--RRWGSPDDVEATKLVF 66 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 KG I++G+ Y RK +A+FDGICS +VL+ K + Sbjct: 67 RKGDIIFGRRRAYQRKLGVAEFDGICSAHAMVLRAKPDV 105 >gi|317481422|ref|ZP_07940489.1| LOW QUALITY PROTEIN: type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] gi|316902407|gb|EFV24294.1| LOW QUALITY PROTEIN: type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] Length = 188 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 56/186 (30%), Gaps = 11/186 (5%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---SYE 281 VP T I + ++ K N E Sbjct: 1 FPKVPFKEIYVRAGEGGTPATSNPEYYDNGTIPFIKIDDLQNKYIKTNKDCITELGLQKS 60 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + IV I++ S + V I+ +L + M S Sbjct: 61 SAWIVPANSIIYS----NGATIGAISINLFPVCTKQGILGVVPKADINVEFLYYFMTSTA 116 Query: 342 LCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE---KIE 397 K + + G ++ +D+ +P VP +Q +I +++ + +++ V K++ Sbjct: 117 FTKAVERIVTEGTMRTAYLKDINHIPCPVPYPVKQDEIAKMLSTLSEKLENEVIFQMKLQ 176 Query: 398 QSIVLL 403 + L Sbjct: 177 KQKEFL 182 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 68/186 (36%), Gaps = 11/186 (5%) Query: 28 VPIKRF-TKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 VP K + G T + I +I ++D+++ K S+ Sbjct: 4 VPFKEIYVRAGEGGTPATSNPEYYDNGTIPFIKIDDLQNKYIKTNKDCITELGLQKSSAW 63 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIE 138 I I+Y G + I F L + PK + E L ++ S T+ +E Sbjct: 64 IVPANSIIYSN-GATIGAISINLFPVCTKQGILGVVPKADINVEFLYYFMTSTAFTKAVE 122 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI-DTLITERIRFIELLKE 197 I TM A K I +IP P+P +Q I + + + ++ + +I + + Sbjct: 123 RIVTEGTMRTAYLKDINHIPCPVPYPVKQDEIAKMLSTLSEKLENEVIFQMKLQKQKEFL 182 Query: 198 KKQALV 203 Q + Sbjct: 183 LSQMFI 188 >gi|111224792|ref|YP_715586.1| putative Type I restriction-modification system, M subunit [Frankia alni ACN14a] gi|111152324|emb|CAJ64058.1| Hypothetical protein; putative Type I restriction-modification system, M subunit [Frankia alni ACN14a] Length = 845 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 38/110 (34%), Gaps = 3/110 (2%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + + +I+F + +AQ + T+ G+D YL ++ + Sbjct: 705 SRFTLRENDILFVRTGTVGPLARVDAAQQGW-LLGTNLMRLRAHDGVDPAYLLAVLSARA 763 Query: 342 LCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 A + S+ + L + PP+ EQ I V+ +I Sbjct: 764 AQSWIARRAQSATAIPSISTSTLGSLRLPRPPLSEQQRIGAVLTDLDNQI 813 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 64/190 (33%), Gaps = 22/190 (11%) Query: 12 DSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKD---- 67 D GV +G P W VP+K L +G + ++ + L D E G G P+D Sbjct: 632 DPGVTTVGDHPPGWSTVPLKELCDLQSGPSHQTAR-----RLRDTERGLGLVAPRDLVDR 686 Query: 68 ---------GNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQP 116 + Q+D + + IL+ + G A + + T + L+ Sbjct: 687 RVRTDTTRRIHPEQTDGMSRFTLRENDILFVRTGTVGPLARVDAAQQGWLLGTNLMRLRA 746 Query: 117 KDVLPELLQ--GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 D + + + +G++ +P PPL+EQ I + Sbjct: 747 HDGVDPAYLLAVLSARAAQSWIARRAQSATAIPSISTSTLGSLRLPRPPLSEQQRIGAVL 806 Query: 175 IAETVRIDTL 184 +I Sbjct: 807 TDLDNQIIAH 816 >gi|227544654|ref|ZP_03974703.1| restriction modification system DNA specificity domain protein [Lactobacillus reuteri CF48-3A] gi|300909429|ref|ZP_07126890.1| conserved hypothetical protein [Lactobacillus reuteri SD2112] gi|227185379|gb|EEI65450.1| restriction modification system DNA specificity domain protein [Lactobacillus reuteri CF48-3A] gi|300893294|gb|EFK86653.1| conserved hypothetical protein [Lactobacillus reuteri SD2112] Length = 176 Score = 54.4 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 45/125 (36%), Gaps = 3/125 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + ++V + V + S++ A ++ K Sbjct: 39 NGYRHYPSISEAPSRARRLVSKEDTVISTVRPNMKHVGFISSKSDCIYSTGFAVVSPKKD 98 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMG---SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ID YL + S + +V ++G + S+K D+ L + +PP+ EQ I N I Sbjct: 99 KIDPYYLYLFLSSNRVTEVLQSIGETSTSTYPSVKPSDIGNLVIDMPPLDEQHLIANRIR 158 Query: 384 VETAR 388 + + Sbjct: 159 LIDEK 163 >gi|330723390|gb|AEC45760.1| Type I site-specific DNA methyltransferase specificity subunit [Mycoplasma hyorhinis MCLD] Length = 460 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 44/153 (28%), Gaps = 8/153 (5%) Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + K + + Y + F + + + Sbjct: 15 YIKQNLGKYPVYSSQTENNGIIGYINTYDFDGEFITWTQDGNAGKIFYRNGRFNASNSGI 74 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDI 378 P + L +L + + G ++++ L+P EQ I Sbjct: 75 LTLNFPSKYN---LKFLFLALIFLDLTKLQIGGTVPHFTASMMRKVIFLIPKNKVEQEKI 131 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +++ +D ++ E+ I LL++ + + Sbjct: 132 SSI----FFTLDKIISLYERKISLLEKIEKALL 160 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 45/331 (13%), Positives = 106/331 (32%), Gaps = 16/331 (4%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTV--SIFAKGQILYGKLGPYLRKAIIADFDGICS 108 + + ++ GKY + + + G+ + K + S Sbjct: 11 LTKQYIKQNLGKYPVYSSQTENNGIIGYINTYDFDGEFITWTQDGNAGKIFYRNGRFNAS 70 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 + L + L + + + + + I + + Sbjct: 71 NSGI-----LTLNFPSKYNLKFLFLALIFLDLTKLQIGGTVPHFTASMMRKVIFLIPKNK 125 Query: 169 LIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLV 228 + +EKI + +D +I+ R I LL++ ++AL+ + K ++ G Sbjct: 126 VEQEKISSIFFTLDKIISLYERKISLLEKIEKALLDNMFIKENEEKPSIRFLGFNSDWQS 185 Query: 229 PDHWEVKPFFALVTELNR-KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 + ++ + + T I L+ N + +S E + Sbjct: 186 WTLEDKGYLYSGLNSKTKVDFTNGNSKYITYLNVFNNFNIDLKEKSLVFIKSDEKQNSIV 245 Query: 288 PGEIVFRFIDLQNDKRSLRSA---QVMERGIITSAYMAVKPHGID---STYLAWLMRSYD 341 G+I+F + + SA +V E+ + S + + D + A+L R++ Sbjct: 246 KGDILFTMSSETYQEVGMSSAVTEEVNEKIYLNSFCFGYRLNKADFLFPNFSAFLFRNHS 305 Query: 342 LCK--VFYAMGSGLRQSLKFEDVKRLPVLVP 370 + + + G R +L + L + P Sbjct: 306 VRHKIILQSNGGTSRFNLSKKSFLNLKIKSP 336 >gi|185178826|ref|ZP_02964614.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 5 str. ATCC 27817] gi|188524185|ref|ZP_03004249.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|184209461|gb|EDU06504.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 5 str. ATCC 27817] gi|195660154|gb|EDX53534.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 12 str. ATCC 33696] Length = 344 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 47/384 (12%), Positives = 103/384 (26%), Gaps = 53/384 (13%) Query: 28 VPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + +K G T S + I +G Y+ ++ Sbjct: 3 IKLKDIIYAKRGSTITSNEFKINPGSYPLISASAQNNGVFGYINS------------YMY 50 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ-RIEAI 140 G I G D S ++ + + + +I+++ Sbjct: 51 EGGHITISMNGNAGCVFYQKDKFSANSDVLVLSNIDNKISNNKFIFYWLKKHENTKIKSL 110 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 C+G T + N+ + +PP+ EQ I I + + + + +LL Sbjct: 111 CKGTTRLRLSNDDVLNLEINLPPIEEQNAIISIIEPLDILENKINKLKTVLKKLLINIYD 170 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + F K S Sbjct: 171 K----------------------------NCNSHVNLFENNKIYTNKYLNQNLYCDTSCI 202 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 I + N+ L+ + + I+F + +N E + ++ + Sbjct: 203 GELEINFSKMINISLEDKPSRADLSIKNNSIIFSKLLGENKVYC---FLNNENIVFSTGF 259 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT 379 +K + ++ L + S D + +G + D+ ++ P + +I Sbjct: 260 FNIKSNDENNDDLLSFLLSSDFKNQKSMLANGTTMIGINNSDLTKVRCKAPFLN--SNIY 317 Query: 380 NVINVETARIDVLVEKIEQSIVLL 403 + I+ + IV L Sbjct: 318 FTFFNKLNEIENKITLARNKIVNL 341 >gi|307246330|ref|ZP_07528408.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306852740|gb|EFM84967.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 148 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 31/99 (31%), Gaps = 3/99 (3%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F I Q + + A + D+ + + + +L + + Sbjct: 52 FPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYFLIQLNLNQY---ATAT 108 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + L + + + +PP+ EQ I I I+ Sbjct: 109 AQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQ 147 >gi|323158213|gb|EFZ44305.1| type I restriction enzyme specificity domain protein [Escherichia coli E128010] gi|323939695|gb|EGB35899.1| type I restriction enzyme [Escherichia coli E482] Length = 80 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 10/61 (16%), Positives = 26/61 (42%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 A + +K + +L + +P EQ I +++ + + E + + I L ++ Sbjct: 1 MQAATGSTVKGIKGSRLHQLKIPIPSKVEQDRIVAILDKFDTLTNSITEGLPREIELRQK 60 Query: 406 R 406 + Sbjct: 61 Q 61 >gi|256028780|ref|ZP_05442614.1| restriction endonuclease S subunits [Fusobacterium sp. D11] gi|289766684|ref|ZP_06526062.1| restriction endonuclease S [Fusobacterium sp. D11] gi|289718239|gb|EFD82251.1| restriction endonuclease S [Fusobacterium sp. D11] Length = 193 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 60/186 (32%), Gaps = 7/186 (3%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 ++ K + ++ N+ K E ++ +K Y Sbjct: 14 ENGIEKRLDDIADITMGQSPLSQSYNLGKKGLPFYQGKTEFGDIYIKEPII--YCNSPIK 71 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + I + ++ I +++ ID YL +L++ K+ Sbjct: 72 IVEKNDILMSVRAPVGDVNIATQKSCIGRGLASIRAKKIDYLYLFYLLKEQK-IKIEKMG 130 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +++ ++ L + + + +Q I + I+ L +IE+SI + +S Sbjct: 131 VGSTFKAINKNNISSLQIPIIEMSKQNRIKKYL----LLIEKLSFEIEKSIKEAENLYNS 186 Query: 410 FIAAAV 415 + Sbjct: 187 LMNKYF 192 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 52/179 (29%), Gaps = 4/179 (2%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS-RQSDTSTVSIFAKGQ 85 + + G++ S + G ++ S + I K Sbjct: 18 EKRLDDIADITMGQSPLSQSYNLGKKGLPFYQGKTEFGDIYIKEPIIYCNSPIKIVEKND 77 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL P IA ++ K + L + L + +IE + G+T Sbjct: 78 ILMSVRAPV-GDVNIATQKSCIGRGLASIRAKKID--YLYLFYLLKEQKIKIEKMGVGST 134 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I ++ +PI +++Q I++ ++ + L Sbjct: 135 FKAINKNNISSLQIPIIEMSKQNRIKKYLLLIEKLSFEIEKSIKEAENLYNSLMNKYFE 193 >gi|148642218|ref|YP_001272731.1| type I restriction-modification system methylase, subunit S [Methanobrevibacter smithii ATCC 35061] gi|148551235|gb|ABQ86363.1| type I restriction-modification system methylase, subunit S [Methanobrevibacter smithii ATCC 35061] Length = 199 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 45/128 (35%), Gaps = 7/128 (5%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + I S ++ ++ + ++ ++ +L+++ +L K Sbjct: 5 YISIVKDGSGVGNISFHEKNTSVVNTSQYILPKENLNIHFIFYLLQTINLNKY---KTGS 61 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + F+D V +P EQ I + +D +E ++ + + + + + Sbjct: 62 TIPHIYFKDYSIEKVKIPKYDEQKKIG----ILLKNLDAKIEILDNKLQMCQNFKKYLMQ 117 Query: 413 AAVTGQID 420 T ++ Sbjct: 118 QIFTQKLR 125 >gi|239621713|ref|ZP_04664744.1| type I restriction-modification system [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239515588|gb|EEQ55455.1| type I restriction-modification system [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 151 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 52/149 (34%), Gaps = 10/149 (6%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLRSAQVM 311 + ++ + Y +I+ T N L+ + I G++++ + Sbjct: 5 DPDLPQVEYEDIVSDEGTLNKDLRDKEGGKTGIKFYAGDVLYGKLRPYLMN----WLYPQ 60 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP- 370 G+ + ++ DS++L L+++ ++ + + + VP Sbjct: 61 FNGVAVGDFWVLRATECDSSFLYRLVQTDSFQRLANVSSGSKMPRADWNLISQSFFAVPA 120 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQS 399 EQ I + A +D L+ ++ Sbjct: 121 DYAEQRVIAKSL----AELDDLITLHQRK 145 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 30/139 (21%), Positives = 50/139 (35%), Gaps = 2/139 (1%) Query: 43 ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 S D+ + ED+ S G L KD ++ + + F G +LYGKL PYL + Sbjct: 3 SSDPDLPQVEYEDIVSDEGT-LNKDLRDKEGGKTGIK-FYAGDVLYGKLRPYLMNWLYPQ 60 Query: 103 FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIP 162 F+G+ F VL+ + L + + + + Sbjct: 61 FNGVAVGDFWVLRATECDSSFLYRLVQTDSFQRLANVSSGSKMPRADWNLISQSFFAVPA 120 Query: 163 PLAEQVLIREKIIAETVRI 181 AEQ +I + + I Sbjct: 121 DYAEQRVIAKSLAELDDLI 139 >gi|319939014|ref|ZP_08013378.1| type IC HsdS subunit [Streptococcus anginosus 1_2_62CV] gi|319812064|gb|EFW08330.1| type IC HsdS subunit [Streptococcus anginosus 1_2_62CV] Length = 153 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 55/145 (37%), Gaps = 12/145 (8%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR-SAQVMERGIITSAYM 321 + + E + + + + Y ++ GE+ + + + K + Q ++ Y Sbjct: 13 GWLDQRERFSANIAGKEQKNYTLLRQGELSYNHGNSKLAKYGVVFELQSYSEALVPKVYH 72 Query: 322 AVKPHGIDSTYL-AWLMRSYDLCKVF-YAMGSGLRQ----SLKFEDVKRLPVLVPPI-KE 374 + + +S ++ + + + SG R ++ +++ + +L+P + E Sbjct: 73 SFRMINDNSATFIEYMFATKIPDRELGKLISSGARMDGLLNINYDEFMGIRILIPTLASE 132 Query: 375 QFDITNVINVETARIDVLVEKIEQS 399 Q I + + +D + ++ Sbjct: 133 QTAIGDF----FSTLDRSIALHQRE 153 >gi|198277088|ref|ZP_03209619.1| hypothetical protein BACPLE_03296 [Bacteroides plebeius DSM 17135] gi|198269586|gb|EDY93856.1| hypothetical protein BACPLE_03296 [Bacteroides plebeius DSM 17135] Length = 140 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 39/112 (34%), Gaps = 2/112 (1%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + IV+ G+ V+ ++ S V + G + S + + Sbjct: 28 KSSATIVEKGKFVYARDNIILVDGENSGEVFTVPQDGYMGSTFKQLWLSSAMWKPYILAF 87 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + ++ + L E LP+ +PP+ EQ I+ IN + + Sbjct: 88 ILFYKEELRNSKRGAAIPHLNKELFYNLPIGIPPLAEQQRISERINELSQLL 139 >gi|262039559|ref|ZP_06012858.1| putative type I restriction enzyme specificity protein [Leptotrichia goodfellowii F0264] gi|261746437|gb|EEY33977.1| putative type I restriction enzyme specificity protein [Leptotrichia goodfellowii F0264] Length = 106 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 12/90 (13%), Positives = 44/90 (48%), Gaps = 2/90 (2%) Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 A + + + Y+ ++++S + + + + ++L E++++ L+P +K Q Sbjct: 2 ALIRINTNVALPKYIIYVLQSNEFKNSQINKWLEASSMKNLTMENIRKFKFLLPSLKVQE 61 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKER 406 I ++++ ++ + + + I L +++ Sbjct: 62 YIVSILDKFDTLVNDIKNGLPKEIELRQKQ 91 >gi|312963115|ref|ZP_07777600.1| hypothetical protein PFWH6_5037 [Pseudomonas fluorescens WH6] gi|311282626|gb|EFQ61222.1| hypothetical protein PFWH6_5037 [Pseudomonas fluorescens WH6] Length = 203 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 44/121 (36%), Gaps = 5/121 (4%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++ PG+I N+K L + + + K + YL WL+ Sbjct: 76 PLLQPGDITVIARG-DNNKAVLYTGEQSVVATSQFFIVTAKRAEVLPAYLCWLINLPQSQ 134 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD-IT--NVINVETARIDVLVEKIEQSI 400 + GS ++ + + + + +PP+ Q I V + E I+ L EQ + Sbjct: 135 RSLERSGSAIQA-IGKASLMDMQIPLPPLATQQKLIALQTVWDEEDELIERLQTNREQML 193 Query: 401 V 401 Sbjct: 194 Q 194 Score = 43.2 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 57/186 (30%), Gaps = 11/186 (5%) Query: 29 PIKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + +G T + D+ + ++D+ + Sbjct: 20 KLSELADVRSGYTFRGALEHDPSGDVRVLQIKDLRQNAAIEPDTLTAVTWDARIAPPLLQ 79 Query: 83 KGQILYGKLGPYLRKAIIADFDGIC-STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G I G + + + ++QF ++ K L + Sbjct: 80 PGDITVIARGDNNKAVLYTGEQSVVATSQFFIVTAKRAEVLPAYLCWLINLPQSQRSLER 139 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ + + ++ +P+PPLA Q +K+IA D R ++ Q Sbjct: 140 SGSAIQAIGKASLMDMQIPLPPLATQ----QKLIALQTVWDEEDELIERLQTNREQMLQG 195 Query: 202 LVSYIV 207 + +++ Sbjct: 196 IYQHLI 201 >gi|186701729|ref|ZP_02971420.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma parvum serovar 6 str. ATCC 27818] gi|186700996|gb|EDU19278.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma parvum serovar 6 str. ATCC 27818] Length = 361 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/372 (11%), Positives = 103/372 (27%), Gaps = 46/372 (12%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD 104 K++ + D+ + + SR + +L+ + Sbjct: 25 KKELPFYSPTDLIN--------NVASRYISIKNNNFINGPAVLFSSAATIGNVYFVDKKC 76 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE-AICEGATMSHADWKGIGNIPMPIPP 163 + + + + + I+ +G+ S GN+ + +P Sbjct: 77 WFNQQIKAFITKDPNILSNKYLYYWFLKNREIIKVGANKGSIFSSITTDEFGNMKINLPS 136 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 + EQ I I I+ + +I+ L+ + L S + + I Sbjct: 137 IEEQNEIISIIEPIEKVINNIKNVKIKIESLVNKYFDFLYSDLKDSNFKKYILGDLFTI- 195 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 I S N I + K Y Sbjct: 196 ---------------------------NRGQIINSKYIDNNIGPYPVISSNTKNNGIFGY 228 Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG----IDSTYLAWLMRS 339 + F I Q + I ++ +K ++ ++ ++++ Sbjct: 229 INSYMYDGEFITISADGAYAGTVFLQNGKFSITNVCFILIKNKYIDFKFNNKFVYYILKK 288 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF---DITNVINVETARIDVLVEKI 396 + R +++ +K + + +P ++ Q I + + + + + + + Sbjct: 289 EQEINRLKSQVGSSRPAVREYSLKEIKINLPNMEIQEEFSKIVEPLLNLSTKANKIEKIL 348 Query: 397 EQSIVLLKERRS 408 S LLK + Sbjct: 349 NDS--LLKITKK 358 >gi|282882639|ref|ZP_06291250.1| type I R-M system S protein [Peptoniphilus lacrimalis 315-B] gi|281297515|gb|EFA90000.1| type I R-M system S protein [Peptoniphilus lacrimalis 315-B] Length = 175 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/160 (11%), Positives = 57/160 (35%), Gaps = 6/160 (3%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 T +++ + + N I +++ N + V I++ + Sbjct: 14 LTNQSTYSPKEDWRFVNYLDTGNITMNRIDEIQYINTSTDKLPSRARRKVKLNSIIYSTV 73 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS--- 351 + + E ++++ ++ + Y+ +++ ++ + A+ Sbjct: 74 RPNQLHYGIIK-EQPENFLVSTGFVVIDVDFEKAVPDYIYYVLTQQEITEHLQAIAEQSM 132 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 S+K D++ L +L+P K Q I +++ +I Sbjct: 133 STYPSIKPSDIENLELLLPDRKTQEKIVTILSSIDEKIKQ 172 >gi|296125964|ref|YP_003633216.1| hypothetical protein Bmur_0920 [Brachyspira murdochii DSM 12563] gi|296017780|gb|ADG71017.1| conserved hypothetical protein [Brachyspira murdochii DSM 12563] Length = 460 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/379 (9%), Positives = 105/379 (27%), Gaps = 23/379 (6%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 Y+ +++E+ + + + S +I K+G + + + + Sbjct: 74 YYLRTKELENNDFENDVLYVSESAYNFLEKSKLRGFEIAINKVGSPGNVYQVPNLNIPMT 133 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT----------------MSHADWK 152 + + + + + + T + + Sbjct: 134 LGMNLFSIVPINNINCHYLYIYLSSYYGQLFLHQRVTGAVPPSIDKESVRKVPVPIFSDE 193 Query: 153 GIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY---IVTK 209 +I + Q ++ E +I + K+ ++V+Y ++ K Sbjct: 194 FQKSIEKLVLEAHNQRQKSNSLMKEANQILEKEIGFDKLEIKKKKVNYSIVNYSETLLAK 253 Query: 210 GLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE 269 ++ + + I + L + N+ + + NI Sbjct: 254 RIDAEYYQEKYKIIMDKIQSYKNGCIKIIDLNSINNKLVSIDKNKKYEYIELSNIDSMGF 313 Query: 270 TRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N+ L ++++ +++ ++ DK SL T + + Sbjct: 314 INNLELYYGYELPSRARRLLNNNDVIISSVEGSLDKSSLIYNNKNNLLCSTGFLVFNENE 373 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+ L R + ++ + G + + + + Q I + Sbjct: 374 FINPETLFCFFRLTLIKELLKKITKGTILTAFDSNAICDIEIPNLDKNVQNIIAEKVQEA 433 Query: 386 TARIDVLVEKIEQSIVLLK 404 D +E++ ++ Sbjct: 434 YKARDKAKALLEEAKKKVE 452 >gi|118475553|ref|YP_892159.1| restriction modification system DNA specificity subunit [Campylobacter fetus subsp. fetus 82-40] gi|118414779|gb|ABK83199.1| restriction modification system DNA specificity domain [Campylobacter fetus subsp. fetus 82-40] Length = 195 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 66/177 (37%), Gaps = 6/177 (3%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 ++ + + L N I + + + E + +V G+I+ R Sbjct: 17 LNRKKASMSEISKFYYDVVSLKSFNENGIYEHIFADKFISNEQIKEDYLVKQGDILLR-- 74 Query: 297 DLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGL 353 L+ ++ + + I +S + + + +D+ +L + + S + K + + Sbjct: 75 -LREPNFAIYIDKEYKNLIYSSLVVRIKLYDNRLDANFLTYYLNSNIVKKALHCEVSGTT 133 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +K D+ + + + + +Q +I + + ++L I+Q KE + Sbjct: 134 IPMIKVSDINDIRIPIINLDKQKNIAKYLKLAYQGNELLRNLIDQKQKYSKEIFETL 190 >gi|315609159|ref|ZP_07884127.1| type I restriction-modification enzyme [Prevotella buccae ATCC 33574] gi|315249155|gb|EFU29176.1| type I restriction-modification enzyme [Prevotella buccae ATCC 33574] Length = 160 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 30/96 (31%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I L + + S V G + S + + + + + Sbjct: 65 NSIILVDGENSGEVFTVPHDGYMGSTFKQLWVSCSMHLPYVLYFIQFYKDLLRNSKKGAA 124 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L E L + +PP +EQ I N I AR+ Sbjct: 125 IPHLNKEIFYSLIIGIPPFQEQKRIANAIEELYARL 160 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 50/160 (31%), Gaps = 13/160 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P W+VV + +L G + I + + + G S + Sbjct: 14 PSTWEVVRLSHICRLIDGE--KKEGQYICLDAKYLR----------GKSTGTYLDKGKFV 61 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 AKG + G + DG + F L + L + Sbjct: 62 AKGNSIILVDGENSGEVFTVPHDGYMGSTFKQLWVSCSMH-LPYVLYFIQFYKDLLRNSK 120 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 +GA + H + + ++ + IPP EQ I I R+ Sbjct: 121 KGAAIPHLNKEIFYSLIIGIPPFQEQKRIANAIEELYARL 160 >gi|281424441|ref|ZP_06255354.1| type I restriction-modification enzyme [Prevotella oris F0302] gi|281401440|gb|EFB32271.1| type I restriction-modification enzyme [Prevotella oris F0302] Length = 147 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 23/65 (35%), Gaps = 2/65 (3%) Query: 334 AWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +++ +L + L + K + V +PP EQ I I +D Sbjct: 83 KYILNVINLHRKALRENKVGSAIPHLNKKLFKAISVPLPPYNEQIRIVEAIKSTFNLLDT 142 Query: 392 LVEKI 396 L E + Sbjct: 143 LKENL 147 >gi|51598168|ref|YP_072359.1| hypothetical protein YPTB3883 [Yersinia pseudotuberculosis IP 32953] gi|186897392|ref|YP_001874504.1| hypothetical protein YPTS_4101 [Yersinia pseudotuberculosis PB1/+] gi|51591450|emb|CAH23121.1| hypothetical [Yersinia pseudotuberculosis IP 32953] gi|186700418|gb|ACC91047.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 192 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 47/122 (38%), Gaps = 8/122 (6%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 + + + I++ + + I YL WL+ + + F+ G+ + Sbjct: 77 NRNLAVVYRGEVPVVATSQFLIVS---LRRQEREIVPEYLCWLLNHPMIQQWFHRSGTNI 133 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + + + VPP++ Q + + + D L+ K++++ L+ + Sbjct: 134 QL-ITKSALLDVAIPVPPLETQLQLIE-LQRVWQKEDELINKLQKNRHQLEL---GILQK 188 Query: 414 AV 415 + Sbjct: 189 LL 190 >gi|110639721|ref|YP_679931.1| type I site-specific deoxyribonuclease S subunit [Cytophaga hutchinsonii ATCC 33406] gi|110282402|gb|ABG60588.1| type I site-specific deoxyribonuclease S subunit [Cytophaga hutchinsonii ATCC 33406] Length = 303 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 44/318 (13%), Positives = 91/318 (28%), Gaps = 31/318 (9%) Query: 11 KDSGVQWI--GAIPKHWKVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPK 66 K++ V + + W+ + ++ G+ ++ K+ + GL G G Sbjct: 10 KNTNVPNLRFPEFDEEWEEKTLGEICEMQAGKFVSASEIKEQHFDGLFPCYGGNGLRGYT 69 Query: 67 DGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 + S L G+ G A+ + +V+ P + + + Sbjct: 70 KSYNYDGKYS----------LIGRQGALCGNVNFANGKFHATEHAVVVTPLNGINTVWMF 119 Query: 127 WLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLI 185 +LL+ + G + + + IP + EQ I + RI T Sbjct: 120 YLLTNL---NLNQFATGMAQPGLSVQNLEKVESTIPKAIDEQEKIASFLTLIDGRISTQN 176 Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 L Q + S + + + + I+ + + E Sbjct: 177 KIIEELKLLKIVVSQKIFSRQLRLKDDKGKEFSNWEIKKLEE-------------ICEKK 223 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + Y + + E + IV G V R L Sbjct: 224 SSSISANKIENNFGEYLIYGASGILKKVDFYEEENDYVSIVKDGAGVGRLFYCNGRSSVL 283 Query: 306 RSAQVMERGIITSAYMAV 323 + +++ TSAY Sbjct: 284 GTMDIVKPKDTTSAYFYF 301 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 53/143 (37%), Gaps = 8/143 (5%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 Y + + I Q + + A + +GI++ ++ +L+ + + Sbjct: 67 GYTKSYNYDGKYSLIGRQGALCGNVNFANGKFHATEHAVVVTPLNGINTVWMFYLLTNLN 126 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVETARIDVLVEKIEQSI 400 L + M + L ++++++ +P I EQ I + + ID + + I Sbjct: 127 LNQFATGMA---QPGLSVQNLEKVESTIPKAIDEQEKIASFL----TLIDGRISTQNKII 179 Query: 401 VLLKERRSSFIAAAVTGQIDLRG 423 LK + + Q+ L+ Sbjct: 180 EELKLLKIVVSQKIFSRQLRLKD 202 >gi|260579028|ref|ZP_05846929.1| EcoA family type I restriction-modification system, S subunit [Corynebacterium jeikeium ATCC 43734] gi|258602842|gb|EEW16118.1| EcoA family type I restriction-modification system, S subunit [Corynebacterium jeikeium ATCC 43734] Length = 201 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 35/92 (38%), Gaps = 7/92 (7%) Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 D +L + S + + +G +++ V L + P EQ I Sbjct: 38 DMRWLTYHFSSEPGSRELRDLATGTSGSMKNIPKNKVLNLVIPTPSPLEQQ----AIADA 93 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 A D L+E +++ I+ + + + ++G Sbjct: 94 IADADGLIESLKRLILKKQAIKQGMMQQLLSG 125 >gi|154492480|ref|ZP_02032106.1| hypothetical protein PARMER_02114 [Parabacteroides merdae ATCC 43184] gi|254881865|ref|ZP_05254575.1| restriction modification system DNA specificity subunit [Bacteroides sp. 4_3_47FAA] gi|154087705|gb|EDN86750.1| hypothetical protein PARMER_02114 [Parabacteroides merdae ATCC 43184] gi|254834658|gb|EET14967.1| restriction modification system DNA specificity subunit [Bacteroides sp. 4_3_47FAA] Length = 171 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 4/153 (2%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 F+ + + I + G + + T G I+ Sbjct: 8 FSFMEQWKEYKLGDISNMKYGKLPPKQNNGSYPIWSGYRNVGFATTYNCRKGTIIVVARG 67 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL 357 + E +T+ +AV+ L + + Y L + Y + + Sbjct: 68 VGGTG---DVKISSEDCFLTNLSIAVELDNKICEPLYFYYK-YKLSNLRYLDTGSAQSQI 123 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +D+KRL + +PP++EQ IT +++ +I+ Sbjct: 124 TIDDLKRLSLKLPPLEEQKRITEILSSIDYKIE 156 Score = 41.7 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 53/175 (30%), Gaps = 17/175 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + WK + + + G+ + Y P R +T Sbjct: 12 EQWKEYKLGDISNMKYGKLPPKQNNGSY--------------PIWSGYRNVGFATTYNCR 57 Query: 83 KGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 KG I+ G I+ D + + ++ + + E L + + + Sbjct: 58 KGTIIVVARGVGGTGDVKISSEDCFLTNLSIAVELDNKICEPLYFYYKYKL--SNLRYLD 115 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 G+ S + + + +PPL EQ I E + + +I+ + Sbjct: 116 TGSAQSQITIDDLKRLSLKLPPLEEQKRITEILSSIDYKIELNRRINDNLMPTYY 170 >gi|317163873|gb|ADV07414.1| hypothetical protein NGTW08_0442 [Neisseria gonorrhoeae TCDC-NG08107] Length = 212 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 42/127 (33%), Gaps = 5/127 (3%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E ++ Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEATLEA 185 Query: 403 LKERRSS 409 R Sbjct: 186 ELALRKR 192 >gi|237750239|ref|ZP_04580719.1| LOW QUALITY PROTEIN: restriction modification system DNA specificity subunit [Helicobacter bilis ATCC 43879] gi|229374133|gb|EEO24524.1| LOW QUALITY PROTEIN: restriction modification system DNA specificity subunit [Helicobacter bilis ATCC 43879] Length = 127 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 17/130 (13%), Positives = 41/130 (31%), Gaps = 10/130 (7%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + G +V + SL + + V P+ S L +++ ++ Sbjct: 7 LPKGSVVIAITGATLGQVSLLEIDS----CANQSVVGVIPNDDFSNEFLCLWIKFNIDEI 62 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE 405 G +Q + D+ ++ P + ++ +V + + I L+ Sbjct: 63 ILNQTGGAQQHINKNDIANYHIIKPDKE------SLASVNLKTYFEKISHNAKQIENLQA 116 Query: 406 RRSSFIAAAV 415 R + A Sbjct: 117 MRDILLKAIF 126 Score = 36.3 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 38/129 (29%), Gaps = 3/129 (2%) Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S KG ++ G L + + + D + + + P D + ++ Sbjct: 1 KSNTKPLPKGSVVIAITGATLGQVSLLEIDSCANQSVVGVIPNDDFSNEFLCLWIKFNID 60 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + I G H + I N I ++ L + +I + + Sbjct: 61 EIILN-QTGGAQQHINKNDIANY--HIIKPDKESLASVNLKTYFEKISHNAKQIENLQAM 117 Query: 195 LKEKKQALV 203 +A+ Sbjct: 118 RDILLKAIF 126 >gi|108562862|ref|YP_627178.1| type I restriction enzyme S protein [Helicobacter pylori HPAG1] gi|107836635|gb|ABF84504.1| type I restriction enzyme S protein [Helicobacter pylori HPAG1] Length = 419 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 69/178 (38%), Gaps = 13/178 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I I++ + Sbjct: 18 NNYTKEDNYKKVYYLDTDNITNNKINAFLKIDLTKEKLPSRAKRKCSI---NSIIYSSVR 74 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 + ++ + ++++A++ + +D YL + + + + G+ Sbjct: 75 PNQRHFGIIK-EIPKNFLVSTAFIVIDIIDLKKLDPNYLYYYITQDKITHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 S+ D + + + P++ Q I ++V +I+ + E + + + LL E+ Sbjct: 134 SSYPSITPLDFLNIKIKLYPLETQQKIARTLSVLDQKIENNHKINELLHKILELLYEQ 191 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 49/386 (12%), Positives = 113/386 (29%), Gaps = 24/386 (6%) Query: 43 ESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIA 101 ++ K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 24 DNYKKVYYLDTDNITNNKINAFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGII 83 Query: 102 DF---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKG 153 + + ST F+V+ K + P L ++ +T ++ I C ++ Sbjct: 84 KEIPKNFLVSTAFIVIDIIDLKKLDPNYLYYYITQDKITHYLQRIAECGTSSYPSITPLD 143 Query: 154 IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNP 213 NI + + PL Q I + +I+ ++L+ + N Sbjct: 144 FLNIKIKLYPLETQQKIARTLSVLDQKIENNHKINELLHKILELLYEQYFVRFDFLDENN 203 Query: 214 DVKMKDSGI-----EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 G E L+P+ +EVK K + +I + L Sbjct: 204 KPYQTSGGKMKFSKELNRLIPNDFEVKTLGDNPLCNTIKTGVTPFKQKVYYETKHIQETL 263 Query: 269 ETR---NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS---AQVMERGIITSAYMA 322 + + F + L + + E + T Sbjct: 264 SLNQGLKVSYNKRPNRANMQPTIYSVWFAKMKDTKKHLFLNQHMQSWIKESILSTGFCGL 323 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + S + ++++ E + + +L+P ++ + Sbjct: 324 QCQKHTFEYIASTIKYSPFETRKNNLATGATQKAINIEMLDYIFILIPN----KELLDNY 379 Query: 383 NVETARIDVLVEKIEQSIVLLKERRS 408 + T + + L R Sbjct: 380 SKITKPLYEKISNNIIEAQTLTALRD 405 >gi|84489266|ref|YP_447498.1| hypothetical protein Msp_0455 [Methanosphaera stadtmanae DSM 3091] gi|84372585|gb|ABC56855.1| conserved hypothetical protein [Methanosphaera stadtmanae DSM 3091] Length = 180 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 57/158 (36%), Gaps = 6/158 (3%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 +T ++K N Q + + + + + +I GEI+ Sbjct: 28 NKITMGQSPSSKYYTKNQNDTILVQGNQDIANNYVIPRIYTSKITKIAKKGEILLTVRAP 87 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 D + + RG+ + +KP +L + + + +S+ Sbjct: 88 VGDIVITQYDVCIGRGVCS-----IKPSISTGFMFFYLAKLNSKNQWNKYIQGSTFESIN 142 Query: 359 FEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEK 395 +D+K + + +P KEQ I N + +I+++ +K Sbjct: 143 SKDIKSMKIKIPKSSKEQEKIANFLTCIDQKIELMEKK 180 Score = 43.2 bits (100), Expect = 0.088, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 51/156 (32%), Gaps = 3/156 (1%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + K+ G++ S + G R + I KG+IL Sbjct: 22 KKLSQINKITMGQSPSSKYYTKNQNDTILVQGNQDIANNYVIPRIYTSKITKIAKKGEIL 81 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 P I +D ++P + + +L ++ + +G+T Sbjct: 82 LTVRAPVGDIV-ITQYDVCIGRGVCSIKPS-ISTGFMFFYLAKLNSKNQWNKYIQGSTFE 139 Query: 148 HADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRID 182 + K I ++ + IP EQ I + +I+ Sbjct: 140 SINSKDIKSMKIKIPKSSKEQEKIANFLTCIDQKIE 175 >gi|291457407|ref|ZP_06596797.1| type I restriction-modification system specificity determinant [Bifidobacterium breve DSM 20213] gi|291381242|gb|EFE88760.1| type I restriction-modification system specificity determinant [Bifidobacterium breve DSM 20213] Length = 248 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 18/205 (8%), Positives = 56/205 (27%), Gaps = 21/205 (10%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E + + + ++ + +S +++Q R + + Sbjct: 43 ETIASRYCNDRNSRLRDICYQVADHVDYDNANQETYVSTESLMQNKGGRQLASSLPTTGK 102 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYD 341 G+ + I K + G + + + + + +R Sbjct: 103 ITRYKAGDTLISNIRPYFKKIWYAPFE----GTCSGDVIVFRANDPSNAPYLHACLRQDS 158 Query: 342 LCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET-ARIDVLVEK---I 396 G + + V + + + +D +++ Sbjct: 159 FFDYVMQGAKGTKMPRGDKKQMMEFKV-----------ASSCSTKDLILLDSAIKQRSDN 207 Query: 397 EQSIVLLKERRSSFIAAAVTGQIDL 421 + V L+ R + + ++G+ID+ Sbjct: 208 DSETVKLQALRDTLLPKLMSGEIDV 232 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 34/131 (25%), Gaps = 5/131 (3%) Query: 29 PIKRFTKLNTGRT-SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 ++ ++ Y+ E + G + G L Sbjct: 56 RLRDICYQVADHVDYDNANQETYVSTESLMQNKGGRQLASSLPTTGKITRYK---AGDTL 112 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGATM 146 + PY +K A F+G CS +V + D + +G M Sbjct: 113 ISNIRPYFKKIWYAPFEGTCSGDVIVFRANDPSNAPYLHACLRQDSFFDYVMQGAKGTKM 172 Query: 147 SHADWKGIGNI 157 D K + Sbjct: 173 PRGDKKQMMEF 183 >gi|260664496|ref|ZP_05865348.1| type-1 restriction enzyme MjaXIP specificity protein [Lactobacillus jensenii SJ-7A-US] gi|260561561|gb|EEX27533.1| type-1 restriction enzyme MjaXIP specificity protein [Lactobacillus jensenii SJ-7A-US] Length = 177 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 55/171 (32%), Gaps = 8/171 (4%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 KN NI L+ ++ K + K S + + + I L + Sbjct: 12 KNKTFYGGNIPFLTISDLNNKKIYK--TQKTLSKKGLENSSAKLVPAGSISLAMYASVGK 69 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + + A+ + + +++ ++ + + +G + +L + ++ Sbjct: 70 IGILSKEMATSQAFFNMTFDDDEKRDFIYIILEKANFDKEWIRLISTGTQNNLNAKKIRN 129 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ P +N ID ++ + IV + + + Sbjct: 130 FHIVFPTY----KALKGLNKLFCNIDTDIDIQYKVIVTTNQLKQFLLQNLF 176 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 58/176 (32%), Gaps = 10/176 (5%) Query: 38 TGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKL 91 +G T G +I ++ + D+ + K + + + S+ + G I Sbjct: 5 SGGTPSVKNKTFYGGNIPFLTISDLNNKKIYKTQKTLSKKGLENSSAKLVPAGSISLAMY 64 Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 + I++ F + D + + L + + + T ++ + Sbjct: 65 ASVGKIGILSKEMATSQAFFNMTFDDDEKRDFIYIILEKANFDKEWIRLISTGTQNNLNA 124 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 K I N + P + + IDT I + + I + KQ L+ + Sbjct: 125 KKIRNFHIVFPT----YKALKGLNKLFCNIDTDIDIQYKVIVTTNQLKQFLLQNLF 176 >gi|307243972|ref|ZP_07526093.1| conserved domain protein [Peptostreptococcus stomatis DSM 17678] gi|306492622|gb|EFM64654.1| conserved domain protein [Peptostreptococcus stomatis DSM 17678] Length = 173 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 52/140 (37%), Gaps = 9/140 (6%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV-- 323 +K + Y+IV G+ + + +N ++ + E II+S+Y+ Sbjct: 34 KKFIPSIANIVGTDLSNYKIVRTGQFAYGPVTSRNGEKISIAYLDSEDCIISSSYIVFEV 93 Query: 324 -KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNV 381 +D YL + + G + + + ++ + + VP I EQ +I Sbjct: 94 TNKDELDPEYLMLWFSRPEFDRYARYKSHGSVREIFDWNELCMVKLPVPSIDEQKNIVKA 153 Query: 382 INVETARIDVLVEKIEQSIV 401 T RI ++Q I Sbjct: 154 YKTITDRI-----ALKQQIN 168 Score = 41.7 bits (96), Expect = 0.26, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 61/161 (37%), Gaps = 16/161 (9%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTG--KYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + ++ R + + + ++ + K++P N +D S I GQ Sbjct: 8 LGDYIEIVDNRNRD-------LSITNLLGVSIAKKFIPSIANIVGTDLSNYKIVRTGQFA 60 Query: 88 Y----GKLGPYLRKAIIADFDGICSTQFLVL---QPKDVLPELLQGWLLSIDVTQRIEAI 140 Y + G + A + D I S+ ++V ++ PE L W + + Sbjct: 61 YGPVTSRNGEKISIAYLDSEDCIISSSYIVFEVTNKDELDPEYLMLWFSRPEFDRYARYK 120 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 G+ DW + + +P+P + EQ I + T RI Sbjct: 121 SHGSVREIFDWNELCMVKLPVPSIDEQKNIVKAYKTITDRI 161 >gi|239948141|ref|ZP_04699894.1| type I restriction-modification enzyme, S subunit [Rickettsia endosymbiont of Ixodes scapularis] gi|239922417|gb|EER22441.1| type I restriction-modification enzyme, S subunit [Rickettsia endosymbiont of Ixodes scapularis] Length = 159 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 45/141 (31%), Gaps = 4/141 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + ++ G+I+F + + S + + + Sbjct: 11 FTDIKYVKIDKETFRQFKLNKGDILFNRTNSFELVGKTSIFEAESEYCFASYLIKIVVNQ 70 Query: 328 --IDSTYLAWLMRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 I S +L M + K YA S + ++ + + + +P + Q +I + Sbjct: 71 EKILSNFLNLYMNTDLFQKNLKNYAKQSNNQANINAQILLAQKIPLPSLLIQEEIIAELE 130 Query: 384 VETARIDVLVEKIEQSIVLLK 404 E I+ E I+ LK Sbjct: 131 HERNIIEANKETIKLFENKLK 151 >gi|156978012|ref|YP_001448918.1| type I restriction enzyme S subunit [Vibrio harveyi ATCC BAA-1116] gi|156529606|gb|ABU74691.1| hypothetical protein VIBHAR_06809 [Vibrio harveyi ATCC BAA-1116] Length = 90 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 21/50 (42%), Gaps = 4/50 (8%) Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + EQ I +V +E +E + K+ + + + +TG+ L Sbjct: 37 LNEQQKIASVRTAADKE----IELLETKLAHFKQEKKALMQQLLTGKRRL 82 >gi|329123771|ref|ZP_08252329.1| type I restriction-modification system restriction endonuclease [Haemophilus aegyptius ATCC 11116] gi|327469258|gb|EGF14729.1| type I restriction-modification system restriction endonuclease [Haemophilus aegyptius ATCC 11116] Length = 219 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 55/177 (31%), Gaps = 6/177 (3%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 F V + + + S I+ + + T + + I Sbjct: 28 WDKRFNAVEKEKQPKVIKYHYYLASELKPLIVDGGNVKLLTTNESDIWTTEELVQNNISE 87 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGS 351 I + + + +A + D+ +L + + S + GS Sbjct: 88 GEIIAIPWGGNPIVQYYKGKFVTADNRIATSNNTKILDNKFLYYFLLSKLDVISSFYRGS 147 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI---EQSIVLLKE 405 G+ + V + + +PP+ Q +I +++ T L ++ ++ +E Sbjct: 148 GI-KHPSMYHVLEMLIPIPPLSVQTEIVKILDTLTELTSELTSELILRQKQYEYYRE 203 >gi|319778990|ref|YP_004129903.1| hypothetical protein TEQUI_0822 [Taylorella equigenitalis MCE9] gi|317109014|gb|ADU91760.1| hypothetical protein TEQUI_0822 [Taylorella equigenitalis MCE9] Length = 178 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 53/154 (34%), Gaps = 4/154 (2%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N + + N I++++ N+ + V IV+ + + Sbjct: 25 NWEFVNYLDTGNITANKIEQIKHINLKSDKLPSRARRKVRFNSIVYSTVRPNQLHYGIIK 84 Query: 308 AQVMERGIITSAYMAV-KPHGIDSTYLAWLMRSYDLCKVFYAMG---SGLRQSLKFEDVK 363 Q + T + + Y+ +L+ + + + S K D++ Sbjct: 85 EQPDNFLVSTGFVVIDVIKNRAIPDYIYYLLTQKEFINFLQTIAEHSTSTYPSFKASDIE 144 Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 LPVL+P + Q + NV+ +I + + + Sbjct: 145 NLPVLIPDMTTQEKVVNVLLTIDKKIQINIAINQ 178 >gi|13508354|ref|NP_110304.1| hypothetical protein MPN615 [Mycoplasma pneumoniae M129] gi|12229975|sp|P75180|T1SH_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_615; AltName: Full=S.MpnORFHP; AltName: Full=Type I restriction enzyme specificity protein MPN_615; Short=S protein gi|1673894|gb|AAB95875.1| hypothetical protein MPN_615 [Mycoplasma pneumoniae M129] Length = 249 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 54/176 (30%), Gaps = 10/176 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-------IVDPGEI 291 + N +I + G I K RN + Y + + Sbjct: 56 RKIYGANIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATTNDGELGRIKDCDF 115 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +I + + + + + + T + + K + + Sbjct: 116 DGEYITWTTNGYAGVVFYRNGKFNASQDCGVLKVKNKKICTKFLSFLLKIEAPKFVHNLA 175 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S R L + + + + PP++ Q I +++ + LVE I I + K++ Sbjct: 176 S--RPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEIEMRKKQ 229 Score = 37.1 bits (84), Expect = 6.3, Method: Composition-based stats. Identities = 7/55 (12%), Positives = 19/55 (34%), Gaps = 4/55 (7%) Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + + PP++ Q I +++ T L ++ + R + Sbjct: 2 QGILAEIELDFPPLQIQEKIATILDTFTE----LSAELRERKKQYAFYRDYLLNQ 52 >gi|158315561|ref|YP_001508069.1| restriction modification system DNA specificity subunit [Frankia sp. EAN1pec] gi|158110966|gb|ABW13163.1| restriction modification system DNA specificity domain [Frankia sp. EAN1pec] Length = 374 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 14/95 (14%), Positives = 30/95 (31%), Gaps = 3/95 (3%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + + + + G D +L +L+R+ D + + Sbjct: 54 TLGRSGSSIGTVTYVPSDYWPLNTVLFVEDFQGNDPRFLYFLLRTIDFARF---NSGSAQ 110 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 SL + + + P EQ I V+ +I Sbjct: 111 PSLNRNYIAAVELRAPEYPEQRAIAAVLGALDDKI 145 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 55/407 (13%), Positives = 110/407 (27%), Gaps = 42/407 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ W+ + +L G + + G + I Sbjct: 2 PE-WRRSSLADLVRLRRGFDLPAPE-----------RRAGCFPVVGSAGVSGWHDRGPIA 49 Query: 82 AKGQILYGKLGPYLRKA-IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G I G+ G + + +T V + P L L +ID Sbjct: 50 GPG-ITLGRSGSSIGTVTYVPSDYWPLNTVLFVEDFQGNDPRFLYFLLRTIDF----ARF 104 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ + I + + P EQ I + A +I EL + + Sbjct: 105 NSGSAQPSLNRNYIAAVELRAPEYPEQRAIAAVLGALDDKIALNHRLASTARELAEARYA 164 Query: 201 ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 A T+G +E + R +L+ Sbjct: 165 A-----ATRGPGRRELRLGDLVETL--------------TRGITPRYTADDSALVVLNQK 205 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA- 319 + G P + + + +++ + R R + + Sbjct: 206 CVRAGRVDLAPARGTDPATVPAAKRLRADDVLVNSTGIGTLGRVARWVHATRATVDSHVT 265 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + + P +D A+ + + GS + L + L + VP + +I Sbjct: 266 VVRLAPDRLDPVCGAFALLAAQPRIASLGEGSTSQTELSRAALNDLVIAVPAAERCAEIG 325 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + A +D E L R + ++G+I +R + Sbjct: 326 AEL----AALDARGEAAHAESAALARLRDALSPKLMSGEIRVRDAER 368 >gi|238755000|ref|ZP_04616348.1| Restriction modification system DNA specificity domain [Yersinia ruckeri ATCC 29473] gi|238706704|gb|EEP99073.1| Restriction modification system DNA specificity domain [Yersinia ruckeri ATCC 29473] Length = 307 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 42/139 (30%), Gaps = 13/139 (9%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLM 337 T + G+++ R + I A V+ G+DS YL + Sbjct: 38 HEHTRYGLKKGDLIICEGG--EPGRCAIWEDEIPNMKIQKALHRVRTLSGLDSEYLYYWF 95 Query: 338 RSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + L + +K +P+ +PP+ Q ID + Sbjct: 96 LFSTRAGHIEPFFTGTTIKHLTGKALKEIPIRIPPLTYQQ----YGAKLLRGIDNKITL- 150 Query: 397 EQSIVLLKERRSSFIAAAV 415 + I E +A A+ Sbjct: 151 NRQINKTLE----LMAQAL 165 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 53/170 (31%), Gaps = 3/170 (1%) Query: 40 RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAI 99 + +G+ Y+G +V G + ++ T KG ++ + G R AI Sbjct: 4 KNKNTGEYHPYLGNSNVRWGEFELDDLAEMKFEAHEHTRYGLKKGDLIICEGGEPGRCAI 63 Query: 100 IADF--DGICSTQFLVLQPKDVLPELLQGWLLSIDVT-QRIEAICEGATMSHADWKGIGN 156 D + ++ L + IE G T+ H K + Sbjct: 64 WEDEIPNMKIQKALHRVRTLSGLDSEYLYYWFLFSTRAGHIEPFFTGTTIKHLTGKALKE 123 Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYI 206 IP+ IPPL Q + + +I + + ++ Sbjct: 124 IPIRIPPLTYQQYGAKLLRGIDNKITLNRQINKTLELMAQALFKSWFVDF 173 Score = 36.3 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 28/69 (40%), Gaps = 3/69 (4%) Query: 19 GAIPKHWKVVPIKRF-TKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 G +PK WKV + ++L G + + + + I + + + Y P + +++ Sbjct: 239 GWVPKGWKVKILGEITSELRRGISPKYIDEGGVQVINQKCIRNHEVSYEPARRHDQEAKR 298 Query: 76 STVSIFAKG 84 + G Sbjct: 299 TDGRALKLG 307 >gi|224371955|ref|YP_002606121.1| HsdS3 [Desulfobacterium autotrophicum HRM2] gi|223694674|gb|ACN17957.1| HsdS3 [Desulfobacterium autotrophicum HRM2] Length = 528 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 45/377 (11%), Positives = 107/377 (28%), Gaps = 37/377 (9%) Query: 29 PIKRFTKLN--TGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-T 77 P+ R + +G +++ + ++V+ SR + + Sbjct: 49 PLGRIADVTKLSGFEFTKYFTENDNFSREVPCVMSQNVQENNLDLTNTIFISRNTHFALK 108 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICS--TQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S + G+I+ G Y R A++ G+ + P + +L S Sbjct: 109 RSSLSHGEIVLSYTGQYRRAAVVPANKGLLHLGPNVCKITIHKDDPFFITSFLNSYYGQS 168 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK------IIAETVRIDTLITERI 189 ++ + + I +P+ Q I K + A + + Sbjct: 169 ILDREKTISAQPTVNMARIRTVPVITIEDFSQKYIGNKVRQAETLRAWERKCKNKAENLV 228 Query: 190 RFIELLKEKKQ--ALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL------- 240 Q + + I ++ L + +K + + + L+ + Sbjct: 229 TGELKWDNNIQNTSTFNRISSEELQIRLDLKFNSPQRIALLRHFRKHDVIREELSKLVSI 288 Query: 241 --VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY-----QIVDPGEIVF 293 + T+ + L G L + Y V G+I F Sbjct: 289 SAMIGWKGLTTEYYQKTGPWLLRGIEFNDGVIETDKLVCIAEHKYLEQPQIHVREGDIAF 348 Query: 294 RFIDLQNDKRSLRS-AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + M G + + I+ YL +++ + + +G Sbjct: 349 SKDGTIGKAVVIPALTNRMAVGSTVARLRILDNVEINPYYLQFILNHKSVQIQVKSFATG 408 Query: 353 -LRQSLKFEDVKRLPVL 368 + + E + +L + Sbjct: 409 VAQPHITQEWIAQLIIP 425 >gi|125973663|ref|YP_001037573.1| hypothetical protein Cthe_1148 [Clostridium thermocellum ATCC 27405] gi|125713888|gb|ABN52380.1| hypothetical protein Cthe_1148 [Clostridium thermocellum ATCC 27405] Length = 427 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 48/395 (12%), Positives = 120/395 (30%), Gaps = 35/395 (8%) Query: 27 VVPIKRFT-KLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++ + ++ +G + I D+ + Y+ + + I Sbjct: 36 LITLIDICKEITSGIRVKKEYYTDKNGYKIIAPGDIRNEVI-YINELKVVQPEVVREKDI 94 Query: 81 FAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G IL G + + + + ++ + + +D + L + Q + Sbjct: 95 INNGDILITASGKSGQVIYVNEVLEGCVVTSDIIKITLRDRDKGIRLYKFLKSSIGQMLL 154 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLI---------REKIIAETVRIDTLITERI 189 + ++ + + N+ +P Q EK+ I + + Sbjct: 155 NSIKIGILNKIFVEDVENLLIPEDFDTYQEDCSDDSTVYAEAEKLYRSAENIFYRVFDYK 214 Query: 190 RFIELLKEKKQALVSYIVTKGLNPDVK--MKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 + LK + Y+ + L+P+ + D + + LV Sbjct: 215 GEKKNLKHFY--VTEYLDSHRLDPEYYSNFYTELYRVIHKNFDDVKWEELGELVEIKKAD 272 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-----YQIVDPGEIVFRFIDLQNDK 302 ++ ++ + I + + Y IV GEIV Sbjct: 273 KPEISKNQKVKYFLLADIDPNFSIIKETHEDFYGNLSNRMRYIVRRGEIVTAKGGSATGT 332 Query: 303 RSLRSAQVMERG---IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LK 358 + +A + E+ + T A + P I+ YL +L + + G ++ Sbjct: 333 KGHATALITEKFDGLVTTDALYNLVPRRINPYYLLFLFKQPIILNQVNMFTKGTLYKLIQ 392 Query: 359 FEDVKRLPVLV--PPIKEQFDITNVINVETARIDV 391 D +++ + ++EQ I + + + + Sbjct: 393 RNDFEKIKIPRLESSLEEQ--IVDKMMNYLSVLQN 425 >gi|313904107|ref|ZP_07837487.1| restriction modification system DNA specificity domain [Eubacterium cellulosolvens 6] gi|313471256|gb|EFR66578.1| restriction modification system DNA specificity domain [Eubacterium cellulosolvens 6] Length = 309 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 8/157 (5%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKP--ESYETYQIVDPGEIVFRFIDLQNDKR 303 L E + YG + + ET + E+ + GE++ + Sbjct: 10 YSKGDLREKGTPIILYGRLYTRYETVISDVDTYVEAKDGSVYSKGGEVIVPGSGETAEDI 69 Query: 304 SLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 S+ S ++ + P ID +LA + + + + M G L D Sbjct: 70 SIASVVEKSGILLGGDLNIINPPANIDPAFLAISISNGNPHRDMAKMAQGKSVVHLHNAD 129 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + ++ + P +EQ I++ A D L+ + Sbjct: 130 LAKIDLPYPCYEEQRKISSY----FASFDNLITLHHR 162 >gi|152979300|ref|YP_001344929.1| restriction modification system DNA specificity subunit [Actinobacillus succinogenes 130Z] gi|150841023|gb|ABR74994.1| restriction modification system DNA specificity domain [Actinobacillus succinogenes 130Z] Length = 188 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 50/140 (35%), Gaps = 11/140 (7%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAW 335 E ++ + G+I+ + + + + + + V I YL W Sbjct: 52 EQVREHEWLREGDILIPSRGNNYQAVYIDGRITDRKAVASPHFFVIRVASPQILPKYLYW 111 Query: 336 LMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + K + + +S++ ++ LP+ +PP+ Q I ++ ETA + L+ Sbjct: 112 WLNLQASQKYLNQNIEGSITKSIRRPILQALPIKLPPLSNQAMIISI--AETAEQERLIA 169 Query: 395 KIEQSIVLLKERRSSFIAAA 414 L E + A Sbjct: 170 L------RLIENSKRLMNAL 183 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 57/157 (36%), Gaps = 12/157 (7%) Query: 29 PIKRFTKLNTG-----RTS-ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +K+ + TG + + +++ + ++D + G +R Sbjct: 2 KLKQVADIQTGYLFRTKVPEDPNGNVVVVQMKDCSAINGIDWEHCVKTRLEQVREHEWLR 61 Query: 83 KGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQR 136 +G IL G + I D + S F V++ +LP+ L WL + Sbjct: 62 EGDILIPSRGNNYQAVYIDGRITDRKAVASPHFFVIRVASPQILPKYLYWWLNLQASQKY 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + EG+ + +P+ +PPL+ Q +I Sbjct: 122 LNQNIEGSITKSIRRPILQALPIKLPPLSNQAMIISI 158 >gi|312870863|ref|ZP_07730968.1| conserved hypothetical protein [Lactobacillus iners LEAF 3008A-a] gi|311093553|gb|EFQ51892.1| conserved hypothetical protein [Lactobacillus iners LEAF 3008A-a] Length = 222 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 15/153 (9%), Positives = 50/153 (32%), Gaps = 7/153 (4%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + E+ + D +I+ + + + L + R + +A + Sbjct: 71 LTDFKPNAEAQSSLITFDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFT--LAPYNNEYL 128 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 S L + + ++ + + +++P + ++ + Sbjct: 129 SFALLCCDQESSIDYAQSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQ 188 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + + L+E R + + ++G++D+ Sbjct: 189 IQNSYFENNR----LREIRDALLPRLMSGEVDV 217 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 7/192 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK +K KL ++ ++G++ + Y+ ++ + T + D S++ Sbjct: 30 SDWKKGKLKDVLKLKR-QSIKTGENTTLPYLPIDVIPMRT--FALTDFKPNAEAQSSLIT 86 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F K I+ G + Y + ++A DGI T L P E L LL D I+ Sbjct: 87 FDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFTLAP--YNNEYLSFALLCCDQESSIDYA 144 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + S + + + I +K + + I L+E + Sbjct: 145 QSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQIQNSYFENNRLREIRD 204 Query: 201 ALVSYIVTKGLN 212 AL+ +++ ++ Sbjct: 205 ALLPRLMSGEVD 216 >gi|322690730|ref|YP_004220300.1| truncated endonuclease [Bifidobacterium longum subsp. longum JCM 1217] gi|320455586|dbj|BAJ66208.1| truncated endonuclease [Bifidobacterium longum subsp. longum JCM 1217] Length = 116 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 35/92 (38%), Gaps = 7/92 (7%) Query: 333 LAWLMRSYDLCKVFYAMGS---GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + M + + F + +K ++ VL+PP + ++ + I Sbjct: 16 WFYYMWTKKHMRRFIMLAKDRATTMGHIKRSALQESKVLIPPADVMAE----LSAKMQPI 71 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + ++ L E R + + ++G+ID+ Sbjct: 72 VDEIIGLKVQSRKLGELRDALLPKLMSGEIDI 103 >gi|262067419|ref|ZP_06027031.1| putative type I restriction-modification enzyme, S subunit [Fusobacterium periodonticum ATCC 33693] gi|291378862|gb|EFE86380.1| putative type I restriction-modification enzyme, S subunit [Fusobacterium periodonticum ATCC 33693] Length = 269 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 47/153 (30%), Gaps = 9/153 (5%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA--- 319 II + + ++ G+I+ I+ + + II Sbjct: 40 GIIDFDKLGYADIFEFEKYKDWLLKKGDILISHINSEKHLGKSAIFLDNDVSIIHGMNLL 99 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + V + YL ++ + S + S D K + + +P + Q Sbjct: 100 CIRVIDDIVFPEYLQLFFKTNQYKRQIKKIMKKSVNQASFSVNDFKEILIRLPKLDIQEK 159 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 I I ++ ++E + + L E S Sbjct: 160 IIKKIMT----LEKILENNKLKLKFLSELNKSL 188 Score = 37.1 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 76/254 (29%), Gaps = 16/254 (6%) Query: 26 KVVPIKRFTKLNT-----GRTSESGKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVS 79 K+ +K ++ + S + I +E + G + + + + Sbjct: 2 KIFKLKDISEFIRNGVTIKQNISSKEGIPITRIETISKGIIDFDKLGYADIFEFEKYKDW 61 Query: 80 IFAKGQILYGKLGP---YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 + KG IL + + AI D D +L + + + +L T + Sbjct: 62 LLKKGDILISHINSEKHLGKSAIFLDNDVSIIHGMNLLCIRVIDDIVFPEYLQLFFKTNQ 121 Query: 137 IEA-----ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + + + I + +P L Q I +KI+ ++ + Sbjct: 122 YKRQIKKIMKKSVNQASFSVNDFKEILIRLPKLDIQEKIIKKIMTLEKILENNKLKLKFL 181 Query: 192 IELLKEKKQALVSYIVTKGLNP--DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNT 249 EL K + I T N + S I G P + F + + Sbjct: 182 SELNKSLFATMFGDIKTNDKNWELFEIKEISNILTRGKTPKYTLSSNVFVINQACIYWDK 241 Query: 250 KLIESNILSLSYGN 263 E+ + N Sbjct: 242 IKYENIKFHVEDEN 255 >gi|262068315|ref|ZP_06027927.1| putative type I restriction-modification system, S subunit [Fusobacterium periodonticum ATCC 33693] gi|291377971|gb|EFE85489.1| putative type I restriction-modification system, S subunit [Fusobacterium periodonticum ATCC 33693] Length = 76 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 4/67 (5%) Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 GS + L E + + +PPI+ Q I +I+ L +IE+SI + + Sbjct: 12 NGSTNQIELSKEKFSKFKIPIPPIELQNKFAERIE----KIEKLKFEIEKSIEIAQNLYD 67 Query: 409 SFIAAAV 415 S I+ Sbjct: 68 SLISKYF 74 >gi|317131473|ref|YP_004090787.1| restriction modification system DNA specificity domain [Ethanoligenens harbinense YUAN-3] gi|315469452|gb|ADU26056.1| restriction modification system DNA specificity domain [Ethanoligenens harbinense YUAN-3] Length = 188 Score = 53.6 bits (127), Expect = 6e-05, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 51/137 (37%), Gaps = 5/137 (3%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I E+R L + + + +I+ R + V I+ + V+ Sbjct: 49 ISYTESRLHDLSKRNVPYGKYLCDNDILINSTGTGTAGRVAQLYCVPCPTIVDGHMIIVR 108 Query: 325 P-HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV-KRLPVLVP-PIKEQFDITNV 381 + I YL + M+++ + GS + L E + + + P + EQ +I + Sbjct: 109 AINDIVPRYLGYAMKAHQAEILQLDEGSTGQTELNRERLLSEIEISYPVSLDEQLNIVGI 168 Query: 382 INVETARI--DVLVEKI 396 ++ A+I + + Sbjct: 169 LSALDAQISENTKINHH 185 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 17/180 (9%), Positives = 43/180 (23%), Gaps = 13/180 (7%) Query: 25 WKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK P++ T S + + + + + Y + Sbjct: 7 WKTEPLRNVVSYITKGVPPVYAPYESETTVRVLNQKCNRNFSISYTESRLHDLSKRNVPY 66 Query: 79 -SIFAKGQILYGKLGP-YLRKA---IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 IL G + I ++++ + + G+ + Sbjct: 67 GKYLCDNDILINSTGTGTAGRVAQLYCVPCPTIVDGHMIIVRAINDIVPRYLGYAMKAHQ 126 Query: 134 TQRIEAICEGATMSHADWKG--IGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + ++ + + + L EQ+ I + A +I Sbjct: 127 AEILQLDEGSTGQTELNRERLLSEIEISYPVSLDEQLNIVGILSALDAQISENTKINHHL 186 >gi|295090946|emb|CBK77053.1| Type I restriction modification DNA specificity domain. [Clostridium cf. saccharolyticum K10] Length = 165 Score = 53.6 bits (127), Expect = 7e-05, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 43/105 (40%), Gaps = 8/105 (7%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 +L++ K+++ + + A++ D+ YL +L+ S DL + Sbjct: 64 NLKSKKQNIAQVVDGQFWVNNHAHIVQGNELCDTRYLCYLLNSMDLSGYV---TGSAQPK 120 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L ++ + +L+P I Q I + + + +I + Q I Sbjct: 121 LSQANLNAVTLLLPTITVQKKIVHYLYMFDKKI-----TVNQQIN 160 >gi|325926906|ref|ZP_08188187.1| hypothetical protein XPE_2186 [Xanthomonas perforans 91-118] gi|325926911|ref|ZP_08188192.1| hypothetical protein XPE_2191 [Xanthomonas perforans 91-118] gi|325542722|gb|EGD14183.1| hypothetical protein XPE_2186 [Xanthomonas perforans 91-118] gi|325542727|gb|EGD14188.1| hypothetical protein XPE_2191 [Xanthomonas perforans 91-118] Length = 90 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 21/52 (40%) Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 VPP + Q +I + A D L K+ + + S +A A G+ Sbjct: 2 FPVPPTQIQDEIVRRVEQLFAYADQLEAKVAAAKQRIDALTQSLLAKAFRGE 53 >gi|312872181|ref|ZP_07732254.1| conserved hypothetical protein [Lactobacillus iners LEAF 2062A-h1] gi|311092265|gb|EFQ50636.1| conserved hypothetical protein [Lactobacillus iners LEAF 2062A-h1] Length = 195 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 15/153 (9%), Positives = 50/153 (32%), Gaps = 7/153 (4%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + E+ + D +I+ + + + L + R + +A + Sbjct: 44 LTDFKPNAEAQSSLITFDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFT--LAPYNNEYL 101 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 S L + + ++ + + +++P + ++ + Sbjct: 102 SFALLCCDQESSIDYAQSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQ 161 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + + L+E R + + ++G++D+ Sbjct: 162 IQNSYFENNR----LREIRDALLPRLMSGEVDV 190 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 7/192 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK +K KL ++ ++G++ + Y+ ++ + T + D S++ Sbjct: 3 SDWKKGKLKDILKLKR-QSIKTGENTTLPYLPIDVIPMRT--FALTDFKPNAEAQSSLIT 59 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F K I+ G + Y + ++A DGI T L P E L LL D I+ Sbjct: 60 FDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFTLAP--YNNEYLSFALLCCDQESSIDYA 117 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + S + + + I +K + + I L+E + Sbjct: 118 QSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQIQNSYFENNRLREIRD 177 Query: 201 ALVSYIVTKGLN 212 AL+ +++ ++ Sbjct: 178 ALLPRLMSGEVD 189 >gi|167626408|ref|YP_001676908.1| hypothetical protein Fphi_0188 [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596409|gb|ABZ86407.1| hypothetical protein Fphi_0188 [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 323 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 65/153 (42%), Gaps = 9/153 (5%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND-KRSLRSAQVMERGIITSAYM 321 N+ +K + Y+++ G+ + + + D K + + E+ II+SAY Sbjct: 6 NLEKKFIPSVANIVGTDLTKYKVIKKGQFGCKLMSVGRDGKLPISLMKDYEKAIISSAYY 65 Query: 322 AVKPHGIDSTYLAWLM----RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + +LM RS + +++ G+ +R S+ + D + + +P I++Q + Sbjct: 66 VFEVKNENELLSDYLMMWLSRSENDRYLWFKSGADVRGSISWNDFCSIEINIPSIEKQRE 125 Query: 378 ITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 I T R ++ EQ L+E + Sbjct: 126 IVAEYYAITNR----IKLNEQLNQKLEETAQAI 154 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 42/263 (15%), Positives = 76/263 (28%), Gaps = 16/263 (6%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQIL-----YGKLGPYLRKAIIADFDGICSTQFLVLQP 116 K++P N +D + + KGQ G+ G + I S+ + V + Sbjct: 10 KFIPSVANIVGTDLTKYKVIKKGQFGCKLMSVGRDGKLPISLMKDYEKAIISSAYYVFEV 69 Query: 117 KDVL---PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 K+ + L WL + + + W +I + IP + +Q I Sbjct: 70 KNENELLSDYLMMWLSRSENDRYLWFKSGADVRGSISWNDFCSIEINIPSIEKQREIVA- 128 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 E I I + + L+E QA+ P S E Sbjct: 129 ---EYYAITNRIKLNEQLNQKLEETAQAIYKEWFVDFEFPHN---FSHSELDSESDIRPY 182 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 +V + + L ++ E ++ PG + Sbjct: 183 KSGGGEMVWCEEFEKEIPKGWEKIFLKDLMNVKHGFAYKGEFFSEKENENILLTPGNVEI 242 Query: 294 RFIDLQNDKRSLRSAQVMERGII 316 +NDK +V + I Sbjct: 243 G-GGFKNDKFKYYYGKVPKDYIF 264 Score = 44.4 bits (103), Expect = 0.037, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 24/79 (30%), Gaps = 7/79 (8%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 IPK W+ + +K + G + + I + +VE G G + Sbjct: 198 EIPKGWEKIFLKDLMNVKHGFAYKGEFFSEKENENILLTPGNVEIGGG-FKNDKFKYYYG 256 Query: 74 DTSTVSIFAKGQILYGKLG 92 IF I+ Sbjct: 257 KVPKDYIFKPNDIMVTMTD 275 >gi|319777294|ref|YP_004136945.1| hypothetical protein MfeM64YM_0570 [Mycoplasma fermentans M64] gi|318038369|gb|ADV34568.1| Hypothetical Protein MfeM64YM_0570 [Mycoplasma fermentans M64] Length = 344 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 45/120 (37%), Gaps = 5/120 (4%) Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDV 362 + + ++ + K YL + S + + SG ++S+ E + Sbjct: 7 CAIINNELNGSLFSTGFYGFKSIYNKIKYLKLFIESPYYQILKDSFCSGVTQKSINDEKL 66 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR----SSFIAAAVTGQ 418 + + +PPI EQ I N + + +EK Q L E + S + A+ G+ Sbjct: 67 LNILIAIPPINEQEKIINKLISLDKFMKKYLEKENQLFKLDSEIKDKLQKSILQYAIQGK 126 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 50/330 (15%), Positives = 101/330 (30%), Gaps = 48/330 (14%) Query: 107 CSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 ST F + + L+ ++ S ++ C G T + + + NI + IPP+ E Sbjct: 19 FSTGFYGFKSIYNKIKYLKLFIESPYYQILKDSFCSGVTQKSINDEKLLNILIAIPPINE 78 Query: 167 QVLIREKIIAETVRIDTLITERIRFIELLKEK----KQALVSYIVTKGLNPDVK------ 216 Q I K+I+ + + + + +L E +++++ Y + L Sbjct: 79 QEKIINKLISLDKFMKKYLEKENQLFKLDSEIKDKLQKSILQYAIQGKLVKQDPNDEPAS 138 Query: 217 --MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 ++ IE L+ + K + ++ N I +N Sbjct: 139 KLLEAIQIEKNKLIKEGKIKKDKHESFIFQGEDKNYYEKIGSKVINITNEIPFEIPKNWV 198 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLR----SAQVMERGIITSAYMAVKPHGIDS 330 + S +++I I+ L K + + + + +P I Sbjct: 199 IVKISNISFRIDKKNIIIKTKQILSTGKYPIITQGQKFIEGYTNNVNNIFKVKEPIIIFG 258 Query: 331 TY----------------------------LAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 + L + L K G L + Sbjct: 259 DHTKTTKFVDFNFVPGGDGTVFLKPLKINPLFFYYLVNYLSKKIRNRGYARHYIL----L 314 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVL 392 K+ + +P I EQ I + I I+ L Sbjct: 315 KKEIIPIPNINEQNQIVSKIKKVFYFINCL 344 >gi|298674149|ref|YP_003725899.1| restriction modification system DNA specificity domain-containing protein [Methanohalobium evestigatum Z-7303] gi|298287137|gb|ADI73103.1| restriction modification system DNA specificity domain protein [Methanohalobium evestigatum Z-7303] Length = 204 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 22/207 (10%), Positives = 60/207 (28%), Gaps = 10/207 (4%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M+ S I + + E + + N+ E + + ++ + Sbjct: 1 MRCSDITETIKLKNLLESNKLIRGIAKTREDNSNDTEKINVFMVNIKNLEDGIVDLKSTE 60 Query: 277 PESYET----YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT-SAYMAVKPHGIDST 331 + + G+++ S + +I+ + + I Sbjct: 61 ECNVKKSDFEKPKPKKGDVIIPIRGSDFKSAVAPSGIENKGYVISLNLVALRVNNKILPR 120 Query: 332 YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L+ S + G +S+ +++K L + VP + +Q + I Sbjct: 121 VLSEYFNSPQGQISLERISKGTKIKSIPIKELKELDIPVPNLDDQNKFDKYLEA----IQ 176 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAVTG 417 ++ + ++ + S G Sbjct: 177 DYKLRLREEKEFTEKMKKSVAFKYFRG 203 >gi|291540209|emb|CBL13320.1| Restriction endonuclease S subunits [Roseburia intestinalis XB6B4] Length = 282 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 39/283 (13%), Positives = 72/283 (25%), Gaps = 18/283 (6%) Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + +++ R+ I +L Y Sbjct: 1 MISDIIFSFMCFSFICENHHEVEFLYLLSPLSRVIFYIINDYLEQQLQLLYDYWFTQYNF 60 Query: 208 TKG----LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--------KNTKLIESN 255 + L+P W+VKP + + N NT N Sbjct: 61 PNEDGQPYKASNGLMVWNKMINHLIPADWKVKPLGTICSFRNGINYDKNVDGNTIYKIIN 120 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++S + + P+ V I+ + R L + I Sbjct: 121 VRNISSSTLFLDESNFDEICLPKQQGDKYYVSDDSIIIARSGIPGATRILCNPSS--NII 178 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + P L G + +++ E +K L V +PP Sbjct: 179 FCGFIICCTPSDNTLQNYLTLYLRQFEGSSATQTGGSILKNVSQETLKNLIVPIPP---- 234 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + N N I L+ + V L R + + GQ Sbjct: 235 QSLLNQFNDSILPIYNLINSNTKENVQLITLRDWLLPMLMNGQ 277 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 59/191 (30%), Gaps = 7/191 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSR--QSD 74 IP WKV P+ G + D I + ++ S T + + Sbjct: 85 IPADWKVKPLGTICSFRNGINYDKNVDGNTIYKIINVRNISSSTLFLDESNFDEICLPKQ 144 Query: 75 TSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + I+ + G P + + I F++ L Sbjct: 145 QGDKYYVSDDSIIIARSGIPGATRILCNPSSNIIFCGFIICCTPSDNTLQNYLTLYLRQF 204 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G+ + + + + N+ +PIPP + + I+ I++ E ++ I Sbjct: 205 EGSSATQTGGSILKNVSQETLKNLIVPIPPQSLLNQFNDSILPIYNLINSNTKENVQLIT 264 Query: 194 LLKEKKQALVS 204 L L++ Sbjct: 265 LRDWLLPMLMN 275 >gi|29349930|ref|NP_813433.1| putative type I restriction-modification enzyme [Bacteroides thetaiotaomicron VPI-5482] gi|29341841|gb|AAO79627.1| putative type I restriction enzyme S.BthVORF4518BP [Bacteroides thetaiotaomicron VPI-5482] Length = 175 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 12/64 (18%), Positives = 29/64 (45%), Gaps = 2/64 (3%) Query: 335 WLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 +++++ +L + + L + K + + +PP KEQ I I +D++ Sbjct: 112 YVLQAINLHRKVLRESKVGSAIPHLNKKLFKAIEIPIPPYKEQQRIIKAITKAFMSLDLI 171 Query: 393 VEKI 396 +E + Sbjct: 172 MESL 175 >gi|148993704|ref|ZP_01823151.1| type I restriction-modification system, S subunit, truncation [Streptococcus pneumoniae SP9-BS68] gi|147927784|gb|EDK78807.1| type I restriction-modification system, S subunit, truncation [Streptococcus pneumoniae SP9-BS68] Length = 148 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 36/108 (33%), Gaps = 6/108 (5%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y IV ++ N +R + I+S YL + + Sbjct: 26 YAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVLEKINSEYLFYFCQL 82 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 83 YNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVVQVDK 127 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 18/110 (16%), Positives = 39/110 (35%), Gaps = 3/110 (2%) Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 P G+ + I K ++ G+ G + ++ + T F + + + Sbjct: 15 FPIYGSGGIMGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSE 74 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + E + + T+ + NI +P+PPLA Q + Sbjct: 75 YLFYFCQLY---NFEKLNKAVTIPSLTKSDLLNISIPLPPLALQNEFADF 121 >gi|321310215|ref|YP_004192544.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802059|emb|CBY92705.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 196 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 24/58 (41%), Gaps = 1/58 (1%) Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +G G+ L +K++ V +P + Q +I + + I+ + ++ + Sbjct: 131 IGGGVIPHLDIGKLKKVKVPIPSLSVQREIASKLGK-FREIEREISLRDKQYEYYRNY 187 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 59/176 (33%), Gaps = 10/176 (5%) Query: 30 IKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS--TVSIFAK 83 + K+ GR+ S + + + +++ S + + + Sbjct: 15 LGEVCKIQRGRSFSSKEYRDEGDPILRVRNIQDNQLCTDGLVYFSPEECKKDLSKVVIKH 74 Query: 84 GQILYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G I G + D + + P L + + +L + +E + Sbjct: 75 GDIGVTTTGERCMAFLSQVDGSFYMNADICRIDPSPEL--IDKEYLFYFLLDLDLEPLIG 132 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + H D + + +PIP L+ Q I K+ + I+ I+ R + E + Sbjct: 133 GGVIPHLDIGKLKKVKVPIPSLSVQREIASKL-GKFREIEREISLRDKQYEYYRNY 187 >gi|294793236|ref|ZP_06758382.1| type I restriction enzyme EcoDI specificity protein [Veillonella sp. 6_1_27] gi|294456181|gb|EFG24545.1| type I restriction enzyme EcoDI specificity protein [Veillonella sp. 6_1_27] Length = 363 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 41/146 (28%), Gaps = 4/146 (2%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + P + + E V + + + + + + + Sbjct: 25 VTDGAYPFFTCDPNTLKIDDWAYDTEAVLLAGNNASGNYTAKYYKGKFNAYQRTYIIESA 84 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + + + L + + L + + L + P I Q I ++I Sbjct: 85 NTSLLTVRFLAFAITEQLRLLKSMSSGSTTKFLTIKILNGLDIPCPEITIQRKIASIIGS 144 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSF 410 D L+ ++ I LL+E Sbjct: 145 ----YDDLIGNNQKQIKLLEEAAQRL 166 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 47/401 (11%), Positives = 118/401 (29%), Gaps = 46/401 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ +K + TG+ + V G + D N+ + D Sbjct: 4 QIEKLKNIALIKTGKLDSN---------AAVTDGAYPFFTCDPNTLKIDDWAY---DTEA 51 Query: 86 ILYGKLGPYLRKA--IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 +L +++ L + + + ++++ G Sbjct: 52 VLLAGNNASGNYTAKYYKGKFNAYQRTYIIESANTSLLTVRFLAFAITEQLRLLKSMSSG 111 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 +T K + + +P P + Q I I + I + I+LL+E Q L Sbjct: 112 STTKFLTIKILNGLDIPCPEITIQRKIASIIGSYDDLIGN----NQKQIKLLEEAAQRLY 167 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 ++ G E + V E ++ + + + + Sbjct: 168 KEWFV-------DLRFPGYENIKNVDGVPEGWKLESV------GSVIKTVPRTVQIKTKD 214 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEI---VFRFIDLQNDKRSLRSAQVMERGIITSAY 320 +++ + E Y ++ + + + + +G + Sbjct: 215 YLREGTIPIIDQSREFIAGYTNLEDAIVSSEAPVIVFGDHTRILKYIQFPFAKGADGTQL 274 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + + L + S DL YA F+ +K +++P +I + Sbjct: 275 IISNTELMPAPLLYLSLLSVDLSNYHYAR--------HFKYLKEEMIIIPS----QEIAD 322 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 N + V+ + ++ R + + G+I++ Sbjct: 323 TFNNIVEPLFKRVQVLRDINRNCEQARDRLLPKLMNGEIEV 363 >gi|332800247|ref|YP_004461746.1| hypothetical protein TepRe1_2325 [Tepidanaerobacter sp. Re1] gi|332697982|gb|AEE92439.1| hypothetical protein TepRe1_2325 [Tepidanaerobacter sp. Re1] Length = 424 Score = 53.3 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 46/391 (11%), Positives = 105/391 (26%), Gaps = 33/391 (8%) Query: 30 IKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 ++ + T I D+ + Y+ + + I Sbjct: 36 LRDICEEITSGIRVRKEYYTDKDGYKIIAPGDIRNEVI-YINELKIVQPEVVREKDIINN 94 Query: 84 GQILY---GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G IL GK G + I + I S + L +L S + +I Sbjct: 95 GDILVTASGKSGQIIYVNGILEGCVITSDIIKITLKDKREGIRLYKFLKSSIGQMLLNSI 154 Query: 141 CEG-------ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G + + + + ++ + E Sbjct: 155 KIGILNKIFVEDIEKLSIPEDFDTYGDDNWYDISPYSSAEKLYKSAELIFS-RLLDYKGE 213 Query: 194 LLKEKKQALVSYIVTKGLNPDVK--MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKL 251 K ++ ++ + L+P+ + + + + + + Sbjct: 214 EEYLKCFYVMKHLDSHRLDPEYYSNFYTELYRLIHKNTGNVKWQRIAEVAEIKRANKPDI 273 Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYET-----YQIVDPGEIVFRFIDLQNDKRSLR 306 E+ + I + + Y IV GE+V + Sbjct: 274 SENQKVKYFLLADIDPNLSIIKETHEDFYGNLSNRMRYIVRDGELVTAKGGSATGTKGHV 333 Query: 307 SAQVMERG---IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDV 362 SA + E + T A + P I YL +L + + G ++ +D Sbjct: 334 SALITEEFDGMVTTDALYNLVPKNISPYYLLFLFKQPVILNQINMFTKGTLYKLIQRKDF 393 Query: 363 KRLPVLV--PPIKEQFDITNVINVETARIDV 391 +++ + ++EQ I + + ++ Sbjct: 394 EQIKIPRLESSLEEQ--IADKMLNYLTKLRN 422 >gi|301300026|ref|ZP_07206251.1| conserved hypothetical protein [Lactobacillus salivarius ACS-116-V-Col5a] gi|300852417|gb|EFK80076.1| conserved hypothetical protein [Lactobacillus salivarius ACS-116-V-Col5a] Length = 185 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 57/170 (33%), Gaps = 3/170 (1%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 LV + N+ I+ +L + + + I G++V Sbjct: 1 MKLNELVKIESGINSVRIKDQNYTLYTIEDVNYDLGHGEDYQHDKTNGKSITARGDVVIN 60 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SG 352 + ++A M I + + +DS YL +L+ + + A Sbjct: 61 TVSNLASVVHSKNAGKMLNQIF-LRLNILDENVLDSWYLCYLLNKSEYIRYQEAAIMDGS 119 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L +++ L + +P I +Q I + + +EK E L Sbjct: 120 VIRKLTKANLEDLEINLPEIADQKKIGEAYKQIMKKYTLAMEKAELERDL 169 >gi|227888664|ref|ZP_04006469.1| possible type I site-specific deoxyribonuclease, specificity subunit [Lactobacillus johnsonii ATCC 33200] gi|227850779|gb|EEJ60865.1| possible type I site-specific deoxyribonuclease, specificity subunit [Lactobacillus johnsonii ATCC 33200] Length = 177 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 53/160 (33%), Gaps = 9/160 (5%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 I G+ I + T N + + +IVD I+ I + Sbjct: 25 WNSKDICFIKPDVIGSGIDSITTSNEYISNSASSKARIVDRNTILITCIGNIGRIGIISD 84 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 +V I + P+ + + + ++ S + + + V Sbjct: 85 KKVAFNQQINAII----PNYKINIRYLAYVLLFSQPRLNALANSAVVPIVNKTQLGNFKV 140 Query: 368 LV-PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + P ++ Q I ++++ +I +++K + I L E Sbjct: 141 KINPNLESQGKIVSILD----KIAKIIKKQTKEIEHLDEL 176 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/176 (22%), Positives = 62/176 (35%), Gaps = 9/176 (5%) Query: 27 VVPIKRFTKLNTGRTSESG-------KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 V +K +K+ TG T KDI +I + + SG + S +S Sbjct: 2 EVSLKEISKIVTGNTPSKKNKNYWNSKDICFIKPDVIGSGIDSITTSNEYISNSASSKAR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 I + IL +G R II+D + Q + P + ++L R+ A Sbjct: 62 IVDRNTILITCIGNIGRIGIISDKKVAFNQQINAIIPNYKINIRYLAYVLLFS-QPRLNA 120 Query: 140 ICEGATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + A + + +GN + I P L Q I + I E EL Sbjct: 121 LANSAVVPIVNKTQLGNFKVKINPNLESQGKIVSILDKIAKIIKKQTKEIEHLDEL 176 >gi|30022541|ref|NP_834172.1| Type I restriction-modification system specificity subunit [Bacillus cereus ATCC 14579] gi|229129744|ref|ZP_04258711.1| Type I restriction-modification system specificity subunit [Bacillus cereus BDRD-Cer4] gi|29898099|gb|AAP11373.1| Type I restriction-modification system specificity subunit [Bacillus cereus ATCC 14579] gi|228653660|gb|EEL09531.1| Type I restriction-modification system specificity subunit [Bacillus cereus BDRD-Cer4] Length = 188 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 31/138 (22%), Positives = 55/138 (39%), Gaps = 11/138 (7%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAW 335 +Y+ + G++VF F+ K + S + I + + +K +DS+YL + Sbjct: 53 NSNYKDSYLSSAGDVVFSFVSS---KAGIVSDLNQGKIISQNFAKLIIKHEYLDSSYLCY 109 Query: 336 LMR-SYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN---VINVETARID 390 + SY + K +M L +K L + +P I++Q I + A Sbjct: 110 ALNESYSMKKQMAISMQGSTVPKLTPAILKALEIKLPSIEKQRTIGKAYFFLRKRQALAK 169 Query: 391 VLVEKIEQSIVLLKERRS 408 VE EQ LK + Sbjct: 170 KQVELEEQL--YLKALKQ 185 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 50/155 (32%), Gaps = 11/155 (7%) Query: 29 PIKRFTKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 ++ + GR G + Y L + G+ L S S+ + Sbjct: 2 KLEDIVTVRIGRNLSRGNEKNDLNLVAYSYEDLMNDLDGSFLELQASSYSGNSNYKDSYL 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRI 137 + G +++ + + I S F ++ L S + +++ Sbjct: 62 SSAGDVVFSFVSSKAGIVSDLNQGKIISQNFAKLIIKHEYLDSSYLCYALNESYSMKKQM 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +G+T+ + + + +P + +Q I + Sbjct: 122 AISMQGSTVPKLTPAILKALEIKLPSIEKQRTIGK 156 >gi|126661169|ref|ZP_01732246.1| type I site-specific deoxyribonuclease [Cyanothece sp. CCY0110] gi|126617542|gb|EAZ88334.1| type I site-specific deoxyribonuclease [Cyanothece sp. CCY0110] Length = 201 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 57/181 (31%), Gaps = 11/181 (6%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 F I + N + + L E + ++ P I+ Sbjct: 31 RFSHRPRNAPHLYENGTYPFIQTGDVANGKGRNIQYSQYLNEEGLKVSKLFQPATILITI 90 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLR 354 L I++ ++ YL + +R+ + + + Sbjct: 91 AANIGSTAILTYPACFPDSIVS----IKPSKTMNIDYLEYYLRTQ--QQYLNDIAPQKAQ 144 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 +++ + ++ L V P EQ I N E +I+ + EQ I + +++ + + Sbjct: 145 KNINLKILEPLLVACPEKTEQDKIIN----EVLKIEQQINNFEQEIFAIPQQKEAILKKY 200 Query: 415 V 415 + Sbjct: 201 L 201 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 58/185 (31%), Gaps = 11/185 (5%) Query: 28 VPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 + + + L GR S ++ +I DV +G G+ + + Sbjct: 19 IKLSQLASLKRGRFSHRPRNAPHLYENGTYPFIQTGDVANGKGRNIQYSQYLNEEGLKVS 78 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +F IL + + I + + ++P + + L Q + Sbjct: 79 KLFQPATILIT-IAANIGSTAILTYPACFPDSIVSIKPSKTMNIDYLEYYLRTQ-QQYLN 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I + + K + + + P EQ I +++ +I+ E + + Sbjct: 137 DIAPQKAQKNINLKILEPLLVACPEKTEQDKIINEVLKIEQQINNFEQEIFAIPQQKEAI 196 Query: 199 KQALV 203 + + Sbjct: 197 LKKYL 201 >gi|298484559|ref|ZP_07002687.1| type I restriction-modification enzyme [Bacteroides sp. D22] gi|298269287|gb|EFI10920.1| type I restriction-modification enzyme [Bacteroides sp. D22] Length = 156 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 29/87 (33%) Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + G S + + + +T + + + L + K + Sbjct: 67 VFRTPIDGYQGSTFKLLSINYDMNTEYVLQVINLHRTILRENKVGSAIPHLNKKLFKAIE 126 Query: 367 VLVPPIKEQFDITNVINVETARIDVLV 393 V +PP KEQ I N +DV++ Sbjct: 127 VPIPPYKEQQRIVEAANKVFMSLDVIM 153 >gi|300780276|ref|ZP_07090132.1| restriction modification system DNA specificity subunit [Corynebacterium genitalium ATCC 33030] gi|300534386|gb|EFK55445.1| restriction modification system DNA specificity subunit [Corynebacterium genitalium ATCC 33030] Length = 328 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 109/328 (33%), Gaps = 52/328 (15%) Query: 108 STQFLV--LQPKDVLPELLQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPL 164 ST+F+V +P + + + + D+ ++ G + D + + IP L Sbjct: 28 STEFIVLRGKPGVTITDFAYYFATTPDIHDLSVSLMTGTSGRQRVDIDALCATQVTIPDL 87 Query: 165 AEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEW 224 Q I + + +I I L + +V +G+ S Sbjct: 88 RTQHSIVSILGSLDDKIAANTRVINSSITLAES--------LVDRGI-------RSTRVR 132 Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G V A +T + ++ + L + ++ + + + + Sbjct: 133 LGDV----------ARITMGTSPKGEYLKEEVGGLPFYQGVRDFDDLTPQKRVFTENPVR 182 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 + G+I+F + + RG+ + L +L+RS+ Sbjct: 183 EAEAGDILFAVRAPVGEVNIASEPTAIGRGLAA------IRGLNNHVALFYLLRSHPKIW 236 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL-- 402 + + S+ D+ + I ++ ++ D+L + QS+VL Sbjct: 237 NTHQDNGTVFASINKTDLSNALIP------------EIEMDQSQYDLLAKLHNQSLVLTS 284 Query: 403 ----LKERRSSFIAAAVTGQIDLRGESQ 426 L + R + ++G+I +R Q Sbjct: 285 QNFILAKTRDELLPLLMSGKITVREAKQ 312 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 41/148 (27%), Gaps = 4/148 (2%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + ++ G + + +G G + R + V G IL Sbjct: 131 VRLGDVARITMGTSPKGEYLKEEVGGLPFYQGVRDFDDLTPQKRVFTENPVREAEAGDIL 190 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 + P I ++ I + + + + +LL G + Sbjct: 191 FAVRAPVGEVNIASEPTAI---GRGLAAIRGLNNHVALFYLLRSHPKIWNTHQDNGTVFA 247 Query: 148 HADWKGIGN-IPMPIPPLAEQVLIREKI 174 + + N + I Q + K+ Sbjct: 248 SINKTDLSNALIPEIEMDQSQYDLLAKL 275 >gi|237726585|ref|ZP_04557066.1| type I site-specific deoxyribonuclease [Bacteroides sp. D4] gi|229435111|gb|EEO45188.1| type I site-specific deoxyribonuclease [Bacteroides dorei 5_1_36/D4] Length = 143 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 45/130 (34%), Gaps = 6/130 (4%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P W V IK +N ++ ++ ++ + ++ G + + + Sbjct: 9 QLPDGWCYVTIKEVFIINPKNKADDDVEVGFVPMANITDGYNNTFKYETKQWGKIKTGFT 68 Query: 80 IFAKGQILYGKLGPYLRK------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 FA G I K+ P L + + G+ +T+ V +P + + + S Sbjct: 69 HFANGDIAVAKISPCLENRKSVVLKGLPNGIGVGTTELHVFRPLFLDVQYGLYFFKSDYF 128 Query: 134 TQRIEAICEG 143 + G Sbjct: 129 ISQCVGSFNG 138 >gi|313140399|ref|ZP_07802592.1| type I restriction-modification system [Bifidobacterium bifidum NCIMB 41171] gi|313132909|gb|EFR50526.1| type I restriction-modification system [Bifidobacterium bifidum NCIMB 41171] Length = 167 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 38/121 (31%), Gaps = 12/121 (9%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 + + +V +V + ++ +G + ++ WL S Sbjct: 58 WHSKYMVKGPGVVTGRSGTIGSLHYI----EQNFWPHNTSLWVTSFNGNEPRFIYWLYAS 113 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQ 398 L + +L DV L V P + EQ I +R+D L+ ++ Sbjct: 114 IGLERF---GSGSGVPTLNRNDVHDLRVGFPCDVAEQRRIGTF----FSRLDSLITLHQR 166 Query: 399 S 399 Sbjct: 167 K 167 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 43/158 (27%), Gaps = 17/158 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + L G + + +G G + + + Sbjct: 20 WEQRKLGEVAPLQRGFDLPVNQMTPGPYPVVMSNGIGGW------------HSKYMVKGP 67 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G+ G I +T V P + SI + E G+ Sbjct: 68 GVVTGRSGTIGSLHYIEQNFWPHNTSLWVTSFNGNEPRFIYWLYASIGL----ERFGSGS 123 Query: 145 TMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRI 181 + + + ++ + P +AEQ I I Sbjct: 124 GVPTLNRNDVHDLRVGFPCDVAEQRRIGTFFSRLDSLI 161 >gi|154685170|ref|YP_001420331.1| hypothetical protein RBAM_007150 [Bacillus amyloliquefaciens FZB42] gi|154351021|gb|ABS73100.1| conserved hypothetical protein [Bacillus amyloliquefaciens FZB42] Length = 408 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 54/137 (39%), Gaps = 9/137 (6%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAV---KPHGID 329 ++ Y+I+ + + ++ DK+ + Q + II++AY I Sbjct: 42 NTIGTDFKNYKIIRKKQFACSTMQVRRDKKMPVALLQDYDEAIISAAYPVFEVVDTEMIL 101 Query: 330 STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL + + G+R S+++ED + + VP I EQ +I N + R Sbjct: 102 PEYLMMWFSRSEFDREACFYAIGGVRGSIEWEDFCNMQLPVPSIDEQKEIINKHKILLDR 161 Query: 389 IDVLVEKIEQSIVLLKE 405 ++ I L+E Sbjct: 162 ----IKVNNLFIQKLEE 174 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 62/418 (14%), Positives = 115/418 (27%), Gaps = 47/418 (11%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 I +L R + S T +++P N+ +D I K Q Sbjct: 6 KRIGDCIRLVDERNVNLNVTTLL-----GLSITKEFIPSVANTIGTDFKNYKIIRKKQFA 60 Query: 88 YGKL---GPYLRKAIIADFD--GICSTQFLVL---QPKDVLPELLQGWLLSIDVTQRIEA 139 + + I S + V + +LPE L W + + Sbjct: 61 CSTMQVRRDKKMPVALLQDYDEAIISAAYPVFEVVDTEMILPEYLMMWFSRSEFDREACF 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G +W+ N+ +P+P + EQ ++II + + I FI+ L+E Sbjct: 121 YAIGGVRGSIEWEDFCNMQLPVPSIDEQ----KEIINKHKILLDRIKVNNLFIQKLEETV 176 Query: 200 QALVSYIVTKGLNPDV---KMKDSGIEW------VGLVPDHWEVKPFFALVTELNRKNTK 250 Q + P+ K SG + +P WEVK F +V Sbjct: 177 QTIYKQWFIDFEFPNQLGNPYKSSGGKMKFNPILNTEIPKGWEVKSFTDVVKVGGGGTPD 236 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYE--------TYQIVDPGEIVFRFIDLQNDK 302 + + + + + P VF Sbjct: 237 TTIDTYWNGGIPFFTPGDVSESYYCLETEKSVSKLGLRNSSTKLYPKNTVFVTARGTVGA 296 Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 +L ++ D+ Y + + + + +L +D Sbjct: 297 IALAGTEMTMNQSC-------YALMGDNQYYIHQLTIATIRSLKKQASGAVFNALIVKDF 349 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT--GQ 418 V+ PP I N + + + LL ++ T G+ Sbjct: 350 AEQNVVHPPKD----IENSFQNIVRGLYNAIYLKVELNKLLSSTVKLLLSKLATTRGK 403 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 49/170 (28%), Gaps = 12/170 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQ 72 IPK W+V K+ G T ++ I + DV ES K + Sbjct: 213 EIPKGWEVKSFTDVVKVGGGGTPDTTIDTYWNGGIPFFTPGDVSESYYCLETEKSVSKLG 272 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S+ ++ K + G +A + + L L+I Sbjct: 273 LRNSSTKLYPKNTVFVTARGTV-GAIALAGTEMTMNQSCYALMG----DNQYYIHQLTIA 327 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + ++ GA + K + PP + + + I Sbjct: 328 TIRSLKKQASGAVFNALIVKDFAEQNVVHPPKDIENSFQNIVRGLYNAIY 377 >gi|321310236|ref|YP_004192565.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802080|emb|CBY92726.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 195 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 52/160 (32%), Gaps = 13/160 (8%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + N+ + ++ + E + I+ P ++ Sbjct: 28 FSSNKYMNSGSPIIRVRNVQKNQLTTNGLVYFSDTDYEDDLSKYILKPRDLAVTLTG--- 84 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 K + V + + S + P D YL + + +L + G+ L Sbjct: 85 -KAMVFLNTVDDSFYMGSDICRLDPDLEVLDREYLFHFLSNLNLDSIVK---YGMIPHLD 140 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K+L + VPP+ Q +I + + +I L + +Q Sbjct: 141 VGKFKKLEIRVPPLSLQKEIASKLG----KIQELRLRKKQ 176 Score = 41.7 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 52/173 (30%), Gaps = 10/173 (5%) Query: 30 IKRFTKLNTGRTSESGK----DIIYIGLEDVESG---TGKYLPKDGNSRQSDTSTVSIFA 82 ++ KL G+ S K I + +V+ T + + D S I Sbjct: 16 LEEVCKLQRGKAFSSNKYMNSGSPIIRVRNVQKNQLTTNGLVYFSDTDYEDDLSKY-ILK 74 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 + G + D + L P + + + ++ +++I + Sbjct: 75 PRDLAVTLTGKAMVFLNTVDDSFYMGSDICRLDPDLEVLDREYLFHFLSNL--NLDSIVK 132 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + H D + + +PPL+ Q I K+ ++ Sbjct: 133 YGMIPHLDVGKFKKLEIRVPPLSLQKEIASKLGKIQELRLRKKQHGYYRKQIW 185 >gi|75765536|pdb|1YDX|A Chain A, Crystal Structure Of Type-I Restriction-Modification System S Subunit From M. Genitalium Length = 406 Score = 53.3 bits (126), Expect = 8e-05, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 48/141 (34%), Gaps = 4/141 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K E N G+K I R + T + P Sbjct: 63 KYEYFNGGVKNSGRTDKFNTFKNTISVIVGGSCGYVRLADKNFFCGQSNCTLNLL--DPL 120 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVE 385 +D + + ++S A G+ ++ +++ D+K L + EQ I N ++V Sbjct: 121 ELDLKFAYYALKSQQERIEALAFGTTIQ-NIRISDLKELEIPFTSNKNEQHAIANTLSVF 179 Query: 386 TARIDVLVEKIEQSIVLLKER 406 R++ L IE + L E Sbjct: 180 DERLENLASLIEINRKLRDEY 200 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 46/396 (11%), Positives = 95/396 (23%), Gaps = 41/396 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +W I L G E + + GKY +G + S + K Sbjct: 35 NWTKRTIDSLFDLKKGEXLEKE----------LITPEGKYEYFNGGVKNSGRTDKFNTFK 84 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I G + + + + +L + +RIEA+ G Sbjct: 85 NTISVIVGGSCGYVRLADKNFFCGQSNCTLNLLDPLELDLKFAYYALKSQQERIEALAFG 144 Query: 144 ATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 T+ + + + +P EQ I + R++ L + +L E L Sbjct: 145 TTIQNIRISDLKELEIPFTSNKNEQHAIANTLSVFDERLENLASLIEINRKLRDEYAHKL 204 Query: 203 --VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + +G + + K + K Sbjct: 205 FSLDEAFLSHWKLEALQSQXHEITLGEIFNFKSGKYLKSEERLEEGKFPYY--------- 255 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N G E + I + T + Sbjct: 256 ------GAGIDNTGFVAEPNTEKDTI--------SIISNGYSLGNIRYHEIPWFNGTGSI 301 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL-VPPIKEQFDIT 379 + + Y + S L + + V V + Q Sbjct: 302 ALEPXNNEIYVPFFYCALKYLQKDIKERXKSDDSPFLSLKLAGEIKVPYVKSFQLQRKAG 361 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + ++D ++ + L R + + Sbjct: 362 KIVFLLDQKLDQ----YKKELSSLTVIRDTLLKKLF 393 >gi|325913620|ref|ZP_08175983.1| hypothetical protein HMPREF0523_0356 [Lactobacillus iners UPII 60-B] gi|325477078|gb|EGC80227.1| hypothetical protein HMPREF0523_0356 [Lactobacillus iners UPII 60-B] Length = 207 Score = 53.3 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 23/201 (11%), Positives = 55/201 (27%), Gaps = 15/201 (7%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNIL--------SLSYGNIIQKLETRNMGLKPESYE 281 DH K E +S ++ + + + E Sbjct: 8 DHRRTCRAEEYFDIAIGKTPPRKEHQWFTTNPSDVTWVSISDMGSCGTYISRSSEQLTQE 67 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSY 340 + + + L R A A + YL +R Sbjct: 68 AVDKFNIKVVPSNTVLLSFKLTIGRIAITHGEMTTNEAIAHFKTDKPFINEYLYCYLR-- 125 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 D S + ++ + +K +P ++P E + + + + + Sbjct: 126 DFNYQTMGSTSSIAIAVNSKIIKAMPFVIPADDE----ISRFHSVVGPMFEQILNNQLEN 181 Query: 401 VLLKERRSSFIAAAVTGQIDL 421 L + R + + ++G++D+ Sbjct: 182 DSLADLRDTLLPRLMSGELDV 202 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 49/189 (25%), Gaps = 14/189 (7%) Query: 27 VVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGTGKYLPKDGNSRQS--DT 75 + + + G+T + D+ ++ + D+ S Q D Sbjct: 12 TCRAEEYFDIAIGKTPPRKEHQWFTTNPSDVTWVSISDMGSCGTYISRSSEQLTQEAVDK 71 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + + +L R AI K + E L +L + Sbjct: 72 FNIKVVPSNTVLLSFKLTIGRIAITHGEMTTNEAIAHFKTDKPFINEYLYCYLRDFNYQT 131 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + K I +P IP E + +I E +L Sbjct: 132 MGS---TSSIAIAVNSKIIKAMPFVIPADDEISRFHSVVGPMFEQILNNQLENDSLADLR 188 Query: 196 KEKKQALVS 204 L+S Sbjct: 189 DTLLPRLMS 197 >gi|302024402|ref|ZP_07249613.1| type I restriction enzyme, specificity subunit [Streptococcus suis 05HAS68] Length = 198 Score = 53.3 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 16/172 (9%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 W+ + + ++ K + S G I + ++ E+ Y+ V PG+ Sbjct: 20 WKQRKAMEIFKFVSDKGYADLPILSASQELGMIRRDEIGIDIKYDKEAVANYKRVLPGQF 79 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMG 350 V Q A G+ + AY + +S+ ++ S + K + Sbjct: 80 VIHLRSFQG-----GFAWSEIEGLTSPAYTILDFKEENSSKFWRNVLTSPNFIKKLETVT 134 Query: 351 SGLR--QSLKFEDVK--RLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 G+R +S+ + D + + EQ I + + +D L+ ++ Sbjct: 135 YGIRDGRSISYSDFSTLNFVIPT--LPEQEAIGSF----FSDLDQLITLHQR 180 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 19/160 (11%), Positives = 41/160 (25%), Gaps = 7/160 (4%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 WK K + + D+ + ++ + D + + Sbjct: 20 WKQRKAMEIFKFVSDK---GYADLPILSASQELGMIRRDEIGIDIKYDKEAVANYKRVLP 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG--WLLSIDVTQRIEAIC 141 GQ + L + ++ +G+ S + +L K+ + + Sbjct: 77 GQFVI-HLRSFQGGFAWSEIEGLTSPAYTILDFKEENSSKFWRNVLTSPNFIKKLETVTY 135 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + IP L EQ I I Sbjct: 136 GIRDGRSISYSDFSTLNFVIPTLPEQEAIGSFFSDLDQLI 175 >gi|258513151|ref|YP_003189407.1| hypothetical protein APA01_42410 [Acetobacter pasteurianus IFO 3283-01] gi|256635054|dbj|BAI01028.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01] gi|256638109|dbj|BAI04076.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-03] gi|256641163|dbj|BAI07123.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-07] gi|256644218|dbj|BAI10171.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-22] gi|256647273|dbj|BAI13219.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-26] gi|256650326|dbj|BAI16265.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-32] gi|256653317|dbj|BAI19249.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01-42C] gi|256656370|dbj|BAI22295.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-12] Length = 198 Score = 53.3 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 50/137 (36%), Gaps = 16/137 (11%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDL 342 + PG+I+F + + + + + + ++ + YLAW + Sbjct: 66 WLRPGDILFPARGNVSLAVLVNESIGSLQAVAAPHFFLLRVMHPNVLPAYLAWWLNQEPA 125 Query: 343 CKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + A S L +++ ++ PV++PP+ Q I L +++ Sbjct: 126 QRHLEQNAQSSTLVRNIARPVLEATPVILPPLPRQEQIV-----------GLANAMQREE 174 Query: 401 VLLKERRSSFIAAAVTG 417 LL R + +TG Sbjct: 175 DLLHRLRQT-NQQIMTG 190 >gi|15828555|ref|NP_325915.1| restriction-modification enzyme subunit S3B [Mycoplasma pulmonis UAB CTIP] gi|14089497|emb|CAC13257.1| RESTRICTION-MODIFICATION ENZYME SUBUNIT S3B [Mycoplasma pulmonis] Length = 348 Score = 53.3 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 59/160 (36%), Gaps = 8/160 (5%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N + + + + V G+ +F D+ + + + + Sbjct: 32 NYMDVFKNYYLNDKNELRLYNATNKEIEKFGVSYGDAIFTASSETKDEIAFSTIYLSNKV 91 Query: 315 IITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 I + + + + + Y A+L RS + K +G R ++ + ++ + +P Sbjct: 92 NIVNGFCKIYKYDKNLLMPKYAAYLFRSKEFRKQAIKFTTGYTRFNISIASLNKIEINIP 151 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +K Q I N+ ++VL+E + L + ++S Sbjct: 152 SLKTQSAILNI----FEPLEVLLENVRNVKNKLNKFQNSL 187 >gi|237740354|ref|ZP_04570835.1| type I restriction-modification enzyme [Fusobacterium sp. 2_1_31] gi|229422371|gb|EEO37418.1| type I restriction-modification enzyme [Fusobacterium sp. 2_1_31] Length = 188 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 47/164 (28%), Gaps = 10/164 (6%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVE--SGTGKYLPKDGNSRQS 73 WK V + L G+T + +I + D+ + Sbjct: 7 NEWKKVKLGDVFDLQMGKTPLRENKLYWDKGEYHWISISDMNFSEKYISSTKEKITELAV 66 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 S + I K ++ + K I + D + + PK S+ Sbjct: 67 KKSGIKIIPKNTVIMS-FKLSIGKVKIVNEDIYSNEAIMAFIPKTNNFIDENFLYYSLKG 125 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + E I + + I + +P LA Q I + + Sbjct: 126 VRWNEGINKAVKGLTLNKALISQKEIFLPNLAIQKEIASNLDSI 169 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 61/187 (32%), Gaps = 11/187 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE---S 279 EW + N+ E + +S+S N +K + E Sbjct: 8 EWKKVKLGDVFDLQMGKTPLRENKLYWDKGEYHWISISDMNFSEKYISSTKEKITELAVK 67 Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 +I+ ++ F + + I+ A++ + ID +L + ++ Sbjct: 68 KSGIKIIPKNTVIMSFKLSIGKVKIVNEDIYSNEAIM--AFIPKTNNFIDENFLYYSLKG 125 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + GL +L + + + +P + Q +I + ++ I + + Sbjct: 126 VRWNEGINKAVKGL--TLNKALISQKEIFLPNLAIQKEIASNLDS----IADFLNLRRKQ 179 Query: 400 IVLLKER 406 + L+E Sbjct: 180 LNYLEEL 186 >gi|12045297|ref|NP_073108.1| type I restriction modification DNA specificity domain-containing protein [Mycoplasma genitalium G37] gi|255660060|ref|ZP_05405469.1| type I restriction modification DNA specificity domain-containing protein [Mycoplasma genitalium G37] gi|2496433|sp|Q49434|T1SX_MYCGE RecName: Full=Putative type-1 restriction enzyme specificity protein MG438; AltName: Full=S.MgeORF438P; AltName: Full=Type I restriction enzyme specificity protein MG438; Short=S protein gi|3845029|gb|AAC72457.1| type I restriction modification DNA specificity domain protein [Mycoplasma genitalium G37] gi|166078723|gb|ABY79341.1| type I restriction modification DNA specificity domain protein [synthetic Mycoplasma genitalium JCVI-1.0] Length = 383 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 48/141 (34%), Gaps = 4/141 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K E N G+K I R + T + P Sbjct: 40 KYEYFNGGVKNSGRTDKFNTFKNTISVIVGGSCGYVRLADKNFFCGQSNCTLNLL--DPL 97 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVE 385 +D + + ++S A G+ ++ +++ D+K L + EQ I N ++V Sbjct: 98 ELDLKFAYYALKSQQERIEALAFGTTIQ-NIRISDLKELEIPFTSNKNEQHAIANTLSVF 156 Query: 386 TARIDVLVEKIEQSIVLLKER 406 R++ L IE + L E Sbjct: 157 DERLENLASLIEINRKLRDEY 177 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 47/396 (11%), Positives = 96/396 (24%), Gaps = 41/396 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +W I L G E + + GKY +G + S + K Sbjct: 12 NWTKRTIDSLFDLKKGEMLEKE----------LITPEGKYEYFNGGVKNSGRTDKFNTFK 61 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 I G + + + + +L + +RIEA+ G Sbjct: 62 NTISVIVGGSCGYVRLADKNFFCGQSNCTLNLLDPLELDLKFAYYALKSQQERIEALAFG 121 Query: 144 ATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 T+ + + + +P EQ I + R++ L + +L E L Sbjct: 122 TTIQNIRISDLKELEIPFTSNKNEQHAIANTLSVFDERLENLASLIEINRKLRDEYAHKL 181 Query: 203 --VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + + +G + + K + K Sbjct: 182 FSLDEAFLSHWKLEALQSQMHEITLGEIFNFKSGKYLKSEERLEEGKFPYY--------- 232 Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N G E + I + T + Sbjct: 233 ------GAGIDNTGFVAEPNTEKDTI--------SIISNGYSLGNIRYHEIPWFNGTGSI 278 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL-VPPIKEQFDIT 379 + + Y + M S L + + V V + Q Sbjct: 279 ALEPMNNEIYVPFFYCALKYLQKDIKERMKSDDSPFLSLKLAGEIKVPYVKSFQLQRKAG 338 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++ + ++D ++ + L R + + Sbjct: 339 KIVFLLDQKLDQ----YKKELSSLTVIRDTLLKKLF 370 >gi|120401063|ref|YP_950892.1| hypothetical protein Mvan_0035 [Mycobacterium vanbaalenii PYR-1] gi|119953881|gb|ABM10886.1| hypothetical protein Mvan_0035 [Mycobacterium vanbaalenii PYR-1] Length = 400 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 48/421 (11%), Positives = 101/421 (23%), Gaps = 51/421 (12%) Query: 26 KVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 + V + + G + + G + + D Sbjct: 3 ESVRLGDLISVKHGYAFPGEGFTEDPTYPILVTPGNFAIEGGFKESKPKTFNGDYPPGFE 62 Query: 81 FAKGQILYGKLG------PYLRKAIIADFDGICSTQ---FLVLQPKDVLPELLQGWL-LS 130 A G ++ A+I Q + + + L + + Sbjct: 63 LAPGDLVVSMTDLSRDGATLGMPALIPAGPTYLHNQRIGLIEAIDRSKIDRLFLNYYLRT 122 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 I G+T+ H I + +P L EQ I + + +I Sbjct: 123 AAYRSHILGTASGSTVRHTSPSRIEDFVALLPGLLEQQAIGAILGSLDDKIGVNRRLANV 182 Query: 191 FIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTK 250 L E S +G + + L + Sbjct: 183 GRLLQSELW--------------HRAATGSRQVSLGSLVRPHLGGTPSRSDSNLWAGDVP 228 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 +S + G ++ +S + + +L A Sbjct: 229 WASVRDMSAADGGVLLATAETISSAVSQSVGRLAALPERSVALTARGTVGKVVTLGVASA 288 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 I + P L + S A GS + ++ ++ + V P Sbjct: 289 -----INQSAYGFIPPAGRGVALRCALESISDELKARAHGS-VFSTITMSTLESVRV--P 340 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR------SSFIAAAVTGQIDLRGE 424 I E + L ++ + L+E R + ++G+I ++ Sbjct: 341 AINE--------TDWDGVCESLELIEDRRLSALRETRVLARTRDELLPLLMSGRIRVKDA 392 Query: 425 S 425 Sbjct: 393 E 393 >gi|321310220|ref|YP_004192549.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802064|emb|CBY92710.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 204 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 51/150 (34%), Gaps = 12/150 (8%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNM--GLKPESYETYQIVDPGEIVFRFIDLQ 299 +S L +I + P+++ I+ G++V + Sbjct: 28 CGTVFGRRFYKDSGFPVLKTSDIWNGQIVTDDLSYCDPKNHPNANIIKRGDVVITNVG-- 85 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA-WLMRSYDLCKVFYAMGSGLRQSLK 358 K ++ I T + + + + YL +L+ + + G L+ Sbjct: 86 --KVAINLTDQEFFFISTIFKLVPRKDVLIAKYLYHFLLENPEEVDRLIREG-----RLR 138 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETAR 388 D+K L + VP + Q I N ++ ++ Sbjct: 139 KSDLKELAIPVPSSEIQARIVNSLDSNFSK 168 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 17/187 (9%), Positives = 53/187 (28%), Gaps = 15/187 (8%) Query: 29 PIKRFTKLNT-----GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + ++ GR + D+ +G + +I + Sbjct: 18 KLGEVCRIVLCGTVFGRRFYKDSGFPVLKTSDIWNGQI-VTDDLSYCDPKNHPNANIIKR 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G ++ +G + D + + L P+ + + ++ + ++ + Sbjct: 77 GDVVITNVGKV--AINLTDQEFFFISTIFKLVPRKDVLIAKYLYHFLLENPEEVDRLIRE 134 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR----IDTLITERIRFIELLKEKK 199 + + +P+P Q I + + + IT E + + Sbjct: 135 G---RLRKSDLKELAIPVPSSEIQARIVNSLDSNFSKTTRVHSEEITNDTSLQETVVLEH 191 Query: 200 QALVSYI 206 ++ + Sbjct: 192 KSFWQRL 198 >gi|325973249|ref|YP_004250313.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323651851|gb|ADX97933.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 190 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 57/197 (28%), Gaps = 16/197 (8%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G +P+ W+ + L + + T + Sbjct: 8 GELPEGWKRVKIGEISKILKGTKPANHANLLGGGGKYPFFTSSFTTKRSYTFSYDSFSLL 67 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 V G F + Y+ ++ + + + L K+ Sbjct: 68 VSEGGSTFH-----------AKIYKGKFEASNHTYVIDLEEKENTYLVLEFLNNIHLPKL 116 Query: 346 FYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + + ++L + +K + +L+P I N I +EK+E + + Sbjct: 117 NWFTCATTFLKNLSPQKLKEIEILIPD----QKILEKFNNFWKNIHSKIEKLELKMQKYE 172 Query: 405 ERRSSFIAAAVTGQIDL 421 E + + + + +I + Sbjct: 173 EIKKKLLNSLFSQEIQV 189 Score = 45.6 bits (106), Expect = 0.018, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 71/193 (36%), Gaps = 13/193 (6%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 G +P+ WK V I +K+ G + +++ G G P +S + S Sbjct: 8 GELPEGWKRVKIGEISKILKGTKPANHANLL---------GGGGKYPFFTSSFTTKRSYT 58 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + +L + G I + +++ + L+ +L +I + + Sbjct: 59 FSYDSFSLLVSEGGSTFHAKIYKGKFEASNHTYVIDLEEKENTYLVLEFLNNIHLPKLNW 118 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 C + + + + I + IP I EK I + I + ++ +E Sbjct: 119 FTCATTFLKNLSPQKLKEIEILIPD----QKILEKFNNFWKNIHSKIEKLELKMQKYEEI 174 Query: 199 KQALVSYIVTKGL 211 K+ L++ + ++ + Sbjct: 175 KKKLLNSLFSQEI 187 >gi|207092148|ref|ZP_03239935.1| type I restriction-modification system specificity subunit [Helicobacter pylori HPKX_438_AG0C1] Length = 116 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 9/105 (8%), Positives = 29/105 (27%), Gaps = 9/105 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 P +W+ V + ++ G + ++ ++ + D+ + + Sbjct: 11 PLNWQRVRLGDIAEIKRGASPRPIENPKWFCANSNVGWVRISDISKNSRFLYKTAQKLSK 70 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + + ++ + I I + PK Sbjct: 71 KGIEKSRLVKQNSLIMSMCTTIGKPIITKIDTCIHDGFVVFENPK 115 >gi|332075505|gb|EGI85973.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17545] Length = 244 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 72/253 (28%), Gaps = 35/253 (13%) Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 Q + GAT+ H + + ++ + + + EQ I + I + L Sbjct: 6 QYLRDHSTGATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNLL 65 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + K E G V + + L +N K + Sbjct: 66 V----------------------KSRFNEMFGDVILNEKEWKVSKWNEILTIRNGKNQKQ 103 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + I Y IV ++ N +R Sbjct: 104 VEDADGKFPIYGSGGI-------MGYAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDT 156 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + I+S YL + + Y+ K+ A+ SL D+ + + +PP+ Sbjct: 157 AFG---LEPVLEKINSEYLFYFCQLYNFEKLNKAV---TIPSLTKSDLLNISIPLPPLAL 210 Query: 375 QFDITNVINVETA 387 Q + + + + Sbjct: 211 QNEFADFVALVDK 223 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 82 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 129 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 130 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 186 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 187 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 217 >gi|19881313|gb|AAM00903.1|AF486570_4 5' truncated HsdS [Campylobacter jejuni subsp. jejuni ATCC 33560] Length = 186 Score = 52.9 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 46/181 (25%), Gaps = 4/181 (2%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + N E +++ G+I + Q + + Sbjct: 1 MLGEICERQKGINITAGEMEKIAIQNGDIRIFAGGKTFIDTKMELLQEQNILKKTSIIVK 60 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 D + + + + +L + + + Sbjct: 61 SRGYVDFEYYAKPFTHKNELWSYSLNPDTKDINLKFIFYYLKNKVEYFQKIARANAVKIP 120 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFI 411 L D R + +PP+ Q I N+++ A L I I K+ R+ + Sbjct: 121 QLAVADTDRFQIPIPPLATQEKIVNILDQFHALTTDLQSGIPAEIEARKKQYEYYRNQLL 180 Query: 412 A 412 Sbjct: 181 T 181 >gi|329575568|gb|EGG57105.1| hypothetical protein HMPREF9520_01722 [Enterococcus faecalis TX1467] Length = 177 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 18/160 (11%), Positives = 48/160 (30%), Gaps = 13/160 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDII------YIGLEDVESGTGKYLPKDGNSRQSDTS 76 + W++ +R + + + YI D+ + + ++ N Sbjct: 18 EDWELCKFERIFEKVKSYSLSREVETNEFTGMKYIHYGDIHTKKADKVSENSNIPNIIKK 77 Query: 77 TVSIFAKGQILYGKLGPYLRKAI-------IADFDGICSTQFLVLQPKDVLPELLQGWLL 129 ++ G ++ + FD + + L+PK++ P L + Sbjct: 78 NFALLEIGDLILTDASEDYKGIATPAVIRENTSFDIVAGLHTIALRPKNIDPMFLYYLIK 137 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL 169 + + + G + + + IP +Q Sbjct: 138 APTFRKYGYKVGTGMKVFGISSSKVLDFTTYIPKKMKQNW 177 Score = 42.9 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 54/168 (32%), Gaps = 4/168 (2%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 P ++ + +W + K ++ N I + N Sbjct: 9 PRLRFRGFQEDWELCKFERIFEKVKSYSLSREVETNEFTGMKYIHYGDIHTKKADKVSEN 68 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRS---LRSAQVMERGIITSAYMAVKPHGID 329 + + + +++ G+++ + + + +A++P ID Sbjct: 69 SNIPNIIKKNFALLEIGDLILTDASEDYKGIATPAVIRENTSFDIVAGLHTIALRPKNID 128 Query: 330 STYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQF 376 +L +L+++ K Y +G+G+ + V +P +Q Sbjct: 129 PMFLYYLIKAPTFRKYGYKVGTGMKVFGISSSKVLDFTTYIPKKMKQN 176 >gi|300727863|ref|ZP_07061242.1| restriction modification system DNA specificity domain protein [Prevotella bryantii B14] gi|299774847|gb|EFI71460.1| restriction modification system DNA specificity domain protein [Prevotella bryantii B14] Length = 351 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 38/343 (11%), Positives = 85/343 (24%), Gaps = 50/343 (14%) Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVL 114 G+ P G + D I + + + + I + + ++ Sbjct: 34 KEGELYPYYGATGVVDYINDYITDEELLCIAEDCGNYKAGEDSSYIINGKAWVNNHAHLV 93 Query: 115 QPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + K+ +L + G T K + IP+ +P + Q Sbjct: 94 KAKEC---CEIKYLHQYLKITDLMPYVSGTTRLKLTQKKMKEIPVLLPSIELQNKFVSIA 150 Query: 175 IAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVP----D 230 L+S IE G P Sbjct: 151 EQADK-----------------SGFDGLISQF---------------IEMFGQSPLINDM 178 Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + I + I+ + + G+ + Y+ Y + + Sbjct: 179 NECFSVIRNGANIKQGQIEGGIPITRIETISEEIVDRAKMGYAGIIDDKYKPYY-LQNND 237 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGII----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 I+ I+ I + K ++ + ++RS + Sbjct: 238 ILISHINSLKHIGKCALYSQTGNETIIHGMNLLCLRPKCEIMNPVFAIHMLRSNIIKNEI 297 Query: 347 YAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + S +D R+ ++P + EQ + Sbjct: 298 ANITKPAVNQASFSVKDFGRIKAILPNMDEQKKFVRIAEQTDK 340 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 38/118 (32%), Gaps = 3/118 (2%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Y I D + S + + + + VK + +L + Sbjct: 49 DYINDYITDEELLCIAEDCGNYKAGEDSSYIINGKAWVNNHAHLVKAKEC--CEIKYLHQ 106 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR-IDVLVEK 395 + + + R L + +K +PVL+P I+ Q ++ D L+ + Sbjct: 107 YLKITDLMPYVSGTTRLKLTQKKMKEIPVLLPSIELQNKFVSIAEQADKSGFDGLISQ 164 >gi|270668225|ref|ZP_06222510.1| type I restriction/modification enzyme [Haemophilus influenzae HK1212] gi|270316717|gb|EFA28495.1| type I restriction/modification enzyme [Haemophilus influenzae HK1212] Length = 263 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 51/159 (32%), Gaps = 5/159 (3%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 N E + Q++ + G K S + + I + Sbjct: 93 NDPNTEKRKILQILEQQYQQVRCTSEGEKLGSESFCHQEEYRLLNEITISASGANAGFVN 152 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 E+ + + + + ++ ++S +F + + +D+KRLP+ Sbjct: 153 FW-TEKIFASDCTTVRADNYVGTKFIFTYLQSIQ-ENIFDLARGAAQPHVYPDDIKRLPI 210 Query: 368 LVPPIKEQFDITN---VINVETARIDVLVEKIEQSIVLL 403 P+ Q + I+ E R + +E+ I + Sbjct: 211 PKVPLDIQQKVVEECQKIDDEFNRTRMQIEEYRAKIAKI 249 >gi|126668196|ref|ZP_01739157.1| specificity determinant for hsdM and hsdR [Marinobacter sp. ELB17] gi|126627345|gb|EAZ97981.1| specificity determinant for hsdM and hsdR [Marinobacter sp. ELB17] Length = 132 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 10/65 (15%), Positives = 26/65 (40%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + L ++ + P Q +I ++ A D + +++ ++ + S +A Sbjct: 22 QPYLNTSLLEEFHIHAPSKGAQTEIIRRVDQLFAYADTIEKQVNNALARVNSLTQSILAK 81 Query: 414 AVTGQ 418 A G+ Sbjct: 82 AFRGE 86 >gi|313157425|gb|EFR56847.1| hypothetical protein HMPREF9720_1028 [Alistipes sp. HGB5] Length = 188 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 60/172 (34%), Gaps = 12/172 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLE-TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 KN ++ L ++ + + + T S ++ +++ +N Sbjct: 17 KNAPSPDTCYLQVNDFDEVGNIRPTVRPTTTVSSKAARHLLTESDLLLAAKGGKNF--CA 74 Query: 306 RSAQVMERGIITSAYMAVK---PHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + + + + +++ ++ P I YL + ++ A G SL D Sbjct: 75 IAPTQLGPCVASPSFLIIRIDDPARILPEYLCGFLNLPSTRQLLTAQAQGSAITSLSKAD 134 Query: 362 VKRLPVLVPPIKEQFD-IT-NVINVETARIDVLVEKIEQSI---VLLKERRS 408 ++ V +PP++ Q I ++ + + + + I L K + Sbjct: 135 LEEFDVPLPPLERQRACIALTRLHRREQALYKAIAERRRQITDCKLTKIYKD 186 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 64/177 (36%), Gaps = 9/177 (5%) Query: 28 VPIKRFTKLNTGRTSES--GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 V +K + TG ++ D Y+ + D + + S + + + Sbjct: 2 VKLKDIATIQTGVYLKNAPSPDTCYLQVNDFDEVGNIRPTVRPTTTVSSKAARHLLTESD 61 Query: 86 ILYGKLGPYLRKAIIADFDGIC--STQFLVLQ---PKDVLPELLQGWLLSIDVTQRIEAI 140 +L G AI G C S FL+++ P +LPE L G+L Q + A Sbjct: 62 LLLAAKGGKNFCAIAPTQLGPCVASPSFLIIRIDDPARILPEYLCGFLNLPSTRQLLTAQ 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREK--IIAETVRIDTLITERIRFIELL 195 +G+ ++ + +P+PPL Q + + I ER R I Sbjct: 122 AQGSAITSLSKADLEEFDVPLPPLERQRACIALTRLHRREQALYKAIAERRRQITDC 178 >gi|227511530|ref|ZP_03941579.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus buchneri ATCC 11577] gi|227085175|gb|EEI20487.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus buchneri ATCC 11577] Length = 177 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 54/143 (37%), Gaps = 12/143 (8%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 I+ + Y++V +I + + + + + E GI++ AY Sbjct: 25 NSGIVDANVLNRKDNSNSNKSNYKVVHANDIAYNSMRMWQGASGVSN----ELGIVSPAY 80 Query: 321 MAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFEDVKRLPVLVPPIKEQF 376 +KP D + +L + + + F GL +LK++ +K + V +P EQ Sbjct: 81 TVLKPRVGLDVRFWGYLFKLTKMLQEFQKNSQGLTSDTWNLKYKQIKSIEVTMPSKNEQN 140 Query: 377 DITNVINVETARIDVLVEKIEQS 399 I + ++D + + Sbjct: 141 AI----SQLLQKLDFSIAANLRQ 159 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 60/155 (38%), Gaps = 7/155 (4%) Query: 31 KRFTKLNTGRTSESGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + G+ + + + + SG + ++ S+ S + I Y Sbjct: 2 GEIFEERKE--NPKGQTLKMLSVT-INSGIVDANVLNRKDNSNSNKSNYKVVHANDIAYN 58 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI-DVTQRIEAICEGAT--M 146 + + + +++ GI S + VL+P+ L G+L + + Q + +G T Sbjct: 59 SMRMWQGASGVSNELGIVSPAYTVLKPRVGLDVRFWGYLFKLTKMLQEFQKNSQGLTSDT 118 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + +K I +I + +P EQ I + + I Sbjct: 119 WNLKYKQIKSIEVTMPSKNEQNAISQLLQKLDFSI 153 >gi|169825070|ref|YP_001692681.1| type I restriction-modification system specificity subunit [Finegoldia magna ATCC 29328] gi|167831875|dbj|BAG08791.1| type I restriction-modification system specificity subunit [Finegoldia magna ATCC 29328] Length = 254 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 68/189 (35%), Gaps = 20/189 (10%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 ++ + + K + + P++Y I+ PG++V + + R Sbjct: 70 TEQVYCMRGADIPEIKVGNKGKMPTRYILPKNYAKK-ILTPGDVVVEISGGSPTQSTGRV 128 Query: 308 AQV--------MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSL 357 A V + + T+ A+KP S +L + + VF Y G+ ++L Sbjct: 129 AAVSQSLLDRYDQEMVCTNFCRAMKPKNGYSMFLYFYWQYLYDLNVFFLYENGTTGIKNL 188 Query: 358 KFEDVKRL-PVLVPPIKEQFDITNVINVETARI--DVLVEKIEQSIVLLKERRSSFIAAA 414 + + +P + + ++ + +I + L L R S + Sbjct: 189 DLKGFLSTEKIRIPSFDDACEFEDICHKYFDKIFYNGLEN------EKLSSLRDSLLPQL 242 Query: 415 VTGQIDLRG 423 ++G++D+ Sbjct: 243 MSGELDVSD 251 >gi|320527412|ref|ZP_08028594.1| conserved domain protein [Solobacterium moorei F0204] gi|320132269|gb|EFW24817.1| conserved domain protein [Solobacterium moorei F0204] Length = 213 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 80/208 (38%), Gaps = 8/208 (3%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 ++ V K + E ++ + N K T E +LS + + + + Sbjct: 10 IDDLVLQKKYITNSLLESIQDNEKIMLKDVLFDYNVKTTVNNEYPVLSSTASGMYLQSDY 69 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GI 328 N ++ Y+IV G +R + + +++E+GI++ AY + I Sbjct: 70 FNKETSSDNTIGYKIVPRGYCTYRSMSDTG-LFTFNMQKLVEKGIVSPAYPVFSSNDDYI 128 Query: 329 DSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + +L S + K + G R +L F + L + P ++++ + ++ Sbjct: 129 NEFIILYLNNSSYIKKQILESKSGGTRFALPFSALCTLKI--PKLEKEKQLASI--KTVT 184 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +E E + L ++++ + Sbjct: 185 AFERKIENEEIILDKLHQQKNYLLNNVF 212 >gi|315225323|ref|ZP_07867137.1| type I restriction-modification system [Capnocytophaga ochracea F0287] gi|314944596|gb|EFS96631.1| type I restriction-modification system [Capnocytophaga ochracea F0287] Length = 172 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 48/169 (28%), Gaps = 7/169 (4%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +G + ++ I ++ + + K S Sbjct: 4 LGEYKKGPFGSSLTKSMFVPFSQSAIKIYEQKNAIKKDYSLGEYYISKEKFKDMS---AF 60 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLMRSYDLC 343 V P +I+ + L + GII A M S + + Sbjct: 61 QVLPSDIIVSCAGTIGETYILPKEAPI--GIINQALMKVALFEYKISEFWRTFFEYILVK 118 Query: 344 KVFYAMGSGLRQSLK-FEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 +++ FE +K++ +PP+KEQ I I I+ Sbjct: 119 DSTMKGAGSAIKNIPPFEYLKKILTPLPPLKEQQRIVEKIEELIPHIEH 167 >gi|186683508|ref|YP_001866704.1| hypothetical protein Npun_R3328 [Nostoc punctiforme PCC 73102] gi|186465960|gb|ACC81761.1| hypothetical protein Npun_R3328 [Nostoc punctiforme PCC 73102] Length = 260 Score = 52.9 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 14/117 (11%), Positives = 41/117 (35%), Gaps = 1/117 (0%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 ++V+ +++ ++ + T + ++ YL + +R Sbjct: 91 RKVVNAYDLIISTCRPTRGAIAVIPEIYHNQICSTGFSVIRPKKEVNPFYLHFAIRLAST 150 Query: 343 CKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + F +G ++ DV + + +P + Q I + + + +EK + Sbjct: 151 LEQFRKFSTGSSYPAILDSDVNKTLIPLPDKETQDLIASHVLKGLNQRQEAIEKANK 207 >gi|288926001|ref|ZP_06419930.1| hypothetical protein HMPREF0649_01441 [Prevotella buccae D17] gi|288337221|gb|EFC75578.1| hypothetical protein HMPREF0649_01441 [Prevotella buccae D17] Length = 459 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 50/142 (35%), Gaps = 4/142 (2%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + K + Y V G I+ + S + + + I + Sbjct: 73 KYLSHKQSNELNYLKVKKGWILVTCSGTLGNVTYTNSDYEDKIVTHDLIRIVPNDNKIKA 132 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L + S G+ + + +K + + V P Q +V+ E+AR+ Sbjct: 133 GVLYAFLSSKYGYYQINQSQFGGVVKHINDTQMKDIMIPVFPSDLQDK-VDVLIKESARL 191 Query: 390 -DVLVEKIEQSIVLLKERRSSF 410 + E + +S LLK+ ++S Sbjct: 192 REEATELLNESRKLLKQ-KASL 212 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 58/395 (14%), Positives = 114/395 (28%), Gaps = 47/395 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + V + S+ + Y+ D + K + +QS+ KG Sbjct: 36 EKVFLGNIFS--RVFVSKPEYGLTYLAASDTVLEDLQ-TGKYLSHKQSNELNYLKVKKGW 92 Query: 86 ILYGKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL G D +V + +L +L S +I Sbjct: 93 ILVTCSGTLGNVTYTNSDYEDKIVTHDLIRIVPNDNKIKAGVLYAFLSSKYGYYQINQSQ 152 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G + H + + +I +P+ P Q + +I E+ R+ TE + L ++K + Sbjct: 153 FGGVVKHINDTQMKDIMIPVFPSDLQ-DKVDVLIKESARLREEATELLNESRKLLKQKAS 211 Query: 202 LVSYIVTKGLNPDVKMKDSGIE-----------------WVGLVPDHWEVKPFFALVTEL 244 L V I + + T Sbjct: 212 LPDLTVEDYNYFGPNYHQREISCFTRSIKDLGTLSFHAFNYSERVRNNILGRLSNCKTIS 271 Query: 245 NRK---------------NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 N I+ ++ +I K+ K Y ++ G Sbjct: 272 FYDALDENKLQSPSGVTVNEVKEGHGIMLINQSDIFDKIVKGKYVAKKPKYTK-DLLKEG 330 Query: 290 EIVFRFIDLQN----DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW-LMRSYDLCK 344 EI+ I R + + ++ +I+SA+ +P T + M S + Sbjct: 331 EILIAKIGTLGESESFCRCVYVGEELKNQLISSAFYRFRPSEDIPTGYLYAWMSSDYGFR 390 Query: 345 VFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDI 378 + + G +Q + + PV + ++ I Sbjct: 391 LIRSSQYGTKQCYPNPAFLYKYPVPILDKEDMEKI 425 >gi|291529886|emb|CBK95471.1| Restriction endonuclease S subunits [Eubacterium siraeum 70/3] Length = 381 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 53/146 (36%), Gaps = 9/146 (6%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEI-VFRFIDLQNDKRSLRSAQVMERGIITSAYMAV- 323 ++ + Y++V G+ + DK + + + G++++ Y Sbjct: 49 KQFIPSIANTVGTDFTKYKVVRKGQFTYIPDTSRRGDKIGIALLEDYDEGLVSNVYTVFE 108 Query: 324 --KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + YL + + G +R+ + ++++ ++ + VP I++Q I Sbjct: 109 VIDENQLMPEYLMLWFSRPEFDRYARFKSHGSVREVMDWDEMCKVELPVPSIEKQRSIVK 168 Query: 381 VINVETARIDVLVEKIEQSIVLLKER 406 T R + ++ L E Sbjct: 169 SYKAITDR----IALKKRINDNLAEY 190 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 54/390 (13%), Positives = 115/390 (29%), Gaps = 53/390 (13%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY- 88 + F + R +E ++ + S +++P N+ +D + + KGQ Y Sbjct: 23 LGEFIRQVDVRNTEGKEENLL-----GVSVQKQFIPSIANTVGTDFTKYKVVRKGQFTYI 77 Query: 89 ---GKLGPYLRKAIIADFD-GICSTQFLVLQPKDVL---PELLQGWLLSIDVTQRIEAIC 141 + G + A++ D+D G+ S + V + D PE L W + + Sbjct: 78 PDTSRRGDKIGIALLEDYDEGLVSNVYTVFEVIDENQLMPEYLMLWFSRPEFDRYARFKS 137 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ DW + + +P+P + +Q I + I I + R + L E Sbjct: 138 HGSVREVMDWDEMCKVELPVPSIEKQRSIVK----SYKAITDRIALKKRINDNLAEYLNC 193 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 + + E A+ + K + +S Sbjct: 194 IFIELAKSI---------------------QETTSLSAICGYVTDKLAFSDIETAVYIST 232 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 NI+ + + + E +++ I K E G Sbjct: 233 ENILPDKQGVSSFGSTSASERVVHFREEDVLVSNIRPYFKK---MWFATTEGGCNADVLC 289 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITN 380 S L ++ + G + + + Sbjct: 290 FRASDKKYSYLLKSILFQDGFFDYVMSGAKGTKMPRGDKNHIMQYQIPC----------- 338 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSF 410 + + + + L +EQ+ L ++ +S Sbjct: 339 FSDQQLQKFNALASSVEQNQALNRQEMASL 368 >gi|282881821|ref|ZP_06290475.1| type I restriction-modification system specificity subunit [Peptoniphilus lacrimalis 315-B] gi|281298334|gb|EFA90776.1| type I restriction-modification system specificity subunit [Peptoniphilus lacrimalis 315-B] Length = 228 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 60/158 (37%), Gaps = 15/158 (9%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--------RGIITSAYMAVKPH 326 + ++ G+IV + + R A + + + T+ A+KP Sbjct: 70 YILSKNLANKKLEAGDIVVEISGGSPTQSTGRCAAITQSLLDRYDSNMLCTNFCKAIKPR 129 Query: 327 GIDSTYLAWLMRSYDLCKVFYA--MGSGLRQSLKFE-DVKRLPVLVPPIKEQFDITNVIN 383 S ++ + + VF++ G+ ++L F ++ P+ +PPI + V + Sbjct: 130 TGYSLFIYYYWQYLYEKGVFFSYENGTTGIKNLDFSGFIETEPIFIPPIDK----VRVFD 185 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I V + + R + + ++G++D+ Sbjct: 186 DYCKSIFNQVFANGKQSEQIALLRETLLPKLMSGELDV 223 >gi|239998599|ref|ZP_04718523.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae 35/02] Length = 149 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 17/126 (13%), Positives = 44/126 (34%), Gaps = 6/126 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 9 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 66 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L +E + L Sbjct: 67 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEATLEAELAL 126 Query: 403 LK-ERR 407 K + R Sbjct: 127 RKRQYR 132 >gi|160884786|ref|ZP_02065789.1| hypothetical protein BACOVA_02776 [Bacteroides ovatus ATCC 8483] gi|156109821|gb|EDO11566.1| hypothetical protein BACOVA_02776 [Bacteroides ovatus ATCC 8483] Length = 241 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/143 (10%), Positives = 42/143 (29%), Gaps = 9/143 (6%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + + G+++F N A+ + + + +L Sbjct: 102 TVINTGINDKHWLKKGDLLFAAKGGSNYCILYEGAERSTIASSSFIIIRPITSDVLPEFL 161 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + + + + G Q + + + + +P I+ Q + +D L Sbjct: 162 CCFLNTPSILGMLKSAAVGTGIQVIPQSVIGEIQLDIPSIEVQKLVVE--------MDQL 213 Query: 393 VEKIEQSIVLLKERRSSFIAAAV 415 + E + E + S + Sbjct: 214 RRESECIRSEINELKQSLQDQLL 236 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 73/169 (43%), Gaps = 8/169 (4%) Query: 26 KVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 K V +K T + +G ++S ++ Y+ ++DV+ + + + + K Sbjct: 57 KKVTLKDITMMQSGIYMKTDSQGEVRYLQVKDVDPESRLDYTQVATVINTGINDKHWLKK 116 Query: 84 GQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEA 139 G +L+ G + + I S+ F++++P DVLPE L +L + + +++ Sbjct: 117 GDLLFAAKGGSNYCILYEGAERSTIASSSFIIIRPITSDVLPEFLCCFLNTPSILGMLKS 176 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK--IIAETVRIDTLIT 186 G + IG I + IP + Q L+ E + E+ I + I Sbjct: 177 AAVGTGIQVIPQSVIGEIQLDIPSIEVQKLVVEMDQLRRESECIRSEIN 225 >gi|294782727|ref|ZP_06748053.1| type I site-specific deoxyribonuclease chain S [Fusobacterium sp. 1_1_41FAA] gi|294481368|gb|EFG29143.1| type I site-specific deoxyribonuclease chain S [Fusobacterium sp. 1_1_41FAA] Length = 192 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 55/143 (38%), Gaps = 7/143 (4%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFR----FIDLQNDKRSLRSAQVMERGIITS-AYMA 322 + + + I++ +++ I L + + + + + + Sbjct: 45 FNEKKLTYYNGEFPNEYILNEDDLIIPLTEQVIGLFGNTAFIPKVKGISFLLNQRVGKII 104 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + ++ YL +L+ + + K SG ++++ DV + V + +KEQ I + Sbjct: 105 PIKNRANNYYLHYLLATDLVRKQLEHRASGTKQRNISPNDVYDVTVFICDVKEQKKIGEL 164 Query: 382 INVETARIDVLVEKIEQSIVLLK 404 + +I+ L KI ++ L Sbjct: 165 LYNMERKIN-LNNKINDNLDYLN 186 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 58/191 (30%), Gaps = 15/191 (7%) Query: 26 KVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVS 79 + + K+ G + + + + L ++ S ++ K + Sbjct: 2 NKIKLGEILKVKHGFAFKSQNYVNKSEFALVTLANISSTNNFQFNEKKLTYYNGEFPNEY 61 Query: 80 IFAKGQILYGK----LGPYLRKAIIADFDGI---CSTQF--LVLQPKDVLPELLQGWLLS 130 I + ++ +G + A I GI + + ++ L L + Sbjct: 62 ILNEDDLIIPLTEQVIGLFGNTAFIPKVKGISFLLNQRVGKIIPIKNRANNYYLHYLLAT 121 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIR 190 V +++E G + + ++ + I + EQ I E + +I+ Sbjct: 122 DLVRKQLEHRASGTKQRNISPNDVYDVTVFICDVKEQKKIGELLYNMERKINLNNKINDN 181 Query: 191 FIELLKEKKQA 201 L A Sbjct: 182 LDYLNYSDIVA 192 >gi|188532535|ref|YP_001906332.1| hypothetical protein ETA_03780 [Erwinia tasmaniensis Et1/99] gi|188027577|emb|CAO95424.1| Hypothetical protein ETA_03780 [Erwinia tasmaniensis Et1/99] Length = 196 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 37/109 (33%), Gaps = 2/109 (1%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYL 333 + P + G+I+ R ++ + A + K + + YL Sbjct: 54 VSPPVDPEKHYLQDGDILLRVRGPNFAAGVFTGSKTLPSVTSNQNAIIKCKENKVLPGYL 113 Query: 334 AWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 W + S F+ M G L + + + V +P + Q DI + Sbjct: 114 HWYINSSLGQNYFHRMSEGTNITKLSLKILSDMEVKLPSLDIQSDIVKI 162 >gi|154137|gb|AAA27146.1| hsdS specificity protein [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 45 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 16/45 (35%), Positives = 24/45 (53%) Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 V VPP+ EQ I ++ A++D ++EQ +LK R S I Sbjct: 1 VPVPPLAEQKVIAEKLDTLLAQVDSTKARLEQIPQILKRFRQSVI 45 >gi|262191970|ref|ZP_06050136.1| type I restriction-modification system S subunit putative [Vibrio cholerae CT 5369-93] gi|262032145|gb|EEY50717.1| type I restriction-modification system S subunit putative [Vibrio cholerae CT 5369-93] Length = 469 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/371 (11%), Positives = 103/371 (27%), Gaps = 38/371 (10%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +KR ++ ++ ++ + + +S + + + K IL Sbjct: 60 RLKRI------WVNDPNHGYPFLTTTNIHISNLEKISYIASSIVAGKRNL-LVKKDWILI 112 Query: 89 GKLGPYLRKAII---ADFDGICSTQFLVLQPKDVLPELL-QGWLLSIDVTQRIEAICEGA 144 + G R A D V+ + + +L S Q+I A GA Sbjct: 113 TRSGTIGRLAFCRPDMDDFACTEDVMRVVADESKIDAGYLYAFLSSTFGVQQIIAGTYGA 172 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + H + + + +IP+P + I KI + + + + + + E Sbjct: 173 IIQHIEPEHVKDIPVPRFAKDLEANIGSKIKSSAQKRADANSLMVSAGKQINEHFSFPNK 232 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 ++ + S ++ H ++ + E L E I +L + Sbjct: 233 LALSHRIFTHSAASSSLVQNRMDATYHDQIAQLSDELIEKAGAENNLAELGIQALEGNRM 292 Query: 265 IQKLETRNMGL---------------------KPESYETYQIVDPGEIVFRFIDLQNDKR 303 Q + G+ K + +++ Sbjct: 293 KQIFTGEDYGVPFFTSGEIFRADVTPERFLLRKSLKGDEVWQTREEDLLIARSGQVGGII 352 Query: 304 SLRSAQV--MERGIITSAYM--AVKPHGIDSTYLAWLMRSYD--LCKVFYAMGSGLRQSL 357 + ++ + V +D+ YL + D ++ L Sbjct: 353 GTGVWADSRFDGACVSPHVLKLRVTNQSVDAGYLYAFLCCTDVGYRQLIRGAAGSSVPFL 412 Query: 358 KFEDVKRLPVL 368 D+ + + Sbjct: 413 SVSDILAIKLP 423 >gi|283956928|ref|ZP_06374401.1| hypothetical protein C1336_000320098 [Campylobacter jejuni subsp. jejuni 1336] gi|283791654|gb|EFC30450.1| hypothetical protein C1336_000320098 [Campylobacter jejuni subsp. jejuni 1336] Length = 48 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 12/47 (25%), Positives = 23/47 (48%) Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 +KEQ I + ++ + I L + + I L+E + S + A G+ Sbjct: 1 MKEQKQIVSHLDELSLNIKDLKQNYQAQIKNLQELKKSLLDRAFKGR 47 >gi|154487258|ref|ZP_02028665.1| hypothetical protein BIFADO_01102 [Bifidobacterium adolescentis L2-32] gi|154084092|gb|EDN83137.1| hypothetical protein BIFADO_01102 [Bifidobacterium adolescentis L2-32] Length = 125 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 30/94 (31%), Gaps = 11/94 (11%) Query: 334 AWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + SG + E + +L + EQ I V++ D Sbjct: 19 YFFFALKQWESYLKSQTSGSGIPHVDKEVLGKLEITEFAESEQSKIAEVLSTV----DRA 74 Query: 393 VEKIEQSIVLLKERRSSFIAAAVT------GQID 420 + + ++ I + + + +T GQ+ Sbjct: 75 IAQTKELIAKQQRIKIGLMRDLLTLGIDEAGQLR 108 >gi|257466157|ref|ZP_05630468.1| Type I restriction/modification specificity protein [Fusobacterium gonidiaformans ATCC 25563] gi|315917314|ref|ZP_07913554.1| type I restriction enzyme S protein [Fusobacterium gonidiaformans ATCC 25563] gi|313691189|gb|EFS28024.1| type I restriction enzyme S protein [Fusobacterium gonidiaformans ATCC 25563] Length = 173 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 57/148 (38%), Gaps = 4/148 (2%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 N + I N I +++ N+ ++ + V I++ + + Sbjct: 22 DNWEYINYLDTGNITMNHINEIQHINLRVEKLPSRAKRKVRYNNIIYSTVRPSQKHFGII 81 Query: 307 SAQVMERGIITSAYMA-VKPHGIDSTYLAWLMRSYDLCKVFYAMG---SGLRQSLKFEDV 362 + + T + + P D+ ++ + + + +++ + S+K+ DV Sbjct: 82 KNILPNFLVSTGFVVLEIDPLKADADFIYYFLTQDKITSYLHSIAEQSTSAYPSIKYTDV 141 Query: 363 KRLPVLVPPIKEQFDITNVINVETARID 390 + + + +P ++ Q ++ + + +I+ Sbjct: 142 EDIEICLPNLQLQKKVSKFLRLLDKKIE 169 >gi|255690848|ref|ZP_05414523.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] gi|260623572|gb|EEX46443.1| type I restriction-modification system, S subunit [Bacteroides finegoldii DSM 17565] Length = 204 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 37/112 (33%), Gaps = 2/112 (1%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + IV+ G+ V+ ++ S V + G + S + + Sbjct: 92 KSSATIVEKGKFVYAGDNIILVDGENSGEVFTVPQDGYMGSTFKQLWLSSAMWKPYILAF 151 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + + L E LP+ +PP +EQ I IN + + Sbjct: 152 ILFYKEDLRNSKRGAAIPHLNKELFYNLPIGIPPYQEQQRIAKRINELSQLL 203 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 P +W V+ +K +L G + I + + + + + + G Sbjct: 55 EYPNNWSVLRLKDICQLIDGE--KRNGKGICLDAKYLRGKSSATIVEKG----------K 102 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G + G + DG + F L + + + + + Sbjct: 103 FVYAGDNIILVDGENSGEVFTVPQDGYMGSTFKQLWLSSAMWK-PYILAFILFYKEDLRN 161 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 GA + H + + N+P+ IPP EQ I ++I + + Sbjct: 162 SKRGAAIPHLNKELFYNLPIGIPPYQEQQRIAKRINELSQLLK 204 >gi|259910157|ref|YP_002650513.1| Type I restriction enzyme specificity protein, fragment [Erwinia pyrifoliae Ep1/96] gi|224965779|emb|CAX57311.1| Type I restriction enzyme specificity protein, fragment [Erwinia pyrifoliae Ep1/96] Length = 117 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 18/46 (39%) Query: 1 MKHYKAYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK 46 M Y K S V WIG IP W+ K + TG + K Sbjct: 4 MAELPKYEFCKKSCVDWIGKIPTDWQAKRFKFLASITTGDKNTEDK 49 >gi|295692969|ref|YP_003601579.1| type i site-specific deoxyribonuclease [Lactobacillus crispatus ST1] gi|295031075|emb|CBL50554.1| Type I site-specific deoxyribonuclease [Lactobacillus crispatus ST1] Length = 238 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 52/157 (33%), Gaps = 17/157 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYI-GLEDVESGTGKYLPKDGNS 70 IP W+ V + L +GR + YI G ++ ++ + +S Sbjct: 73 DIPDSWEWVRLGDVINLISGRDIPKKFHLASKSKDSVPYITGASNITENGEIHISEWIDS 132 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + +KG I+ G + A + + Q + + L + + + L Sbjct: 133 PSV------VVSKGTIILSVKGTIGKIAELNVEKAHIARQIMGIDNAFGLSKEYEKFFLE 186 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 + + + + P+PPL+EQ Sbjct: 187 SYIQELKNKAKSM--IPGISRDDLLMAEFPLPPLSEQ 221 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 51/163 (31%), Gaps = 8/163 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNR----KNTKLIESNILSLSYGNIIQKLETRNMGL 275 + E +PD WE ++ ++ K L + S+ Y + Sbjct: 66 TDDEKPFDIPDSWEWVRLGDVINLISGRDIPKKFHLASKSKDSVPYITGASNITENGEIH 125 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E ++ +V + + ++ + V + I G+ Y + Sbjct: 126 ISEWIDSPSVVVSKGTII--LSVKGTIGKIAELNVEKAHIARQIMGIDNAFGLSKEYEKF 183 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 + SY + + + +D+ +PP+ EQ I Sbjct: 184 FLESY--IQELKNKAKSMIPGISRDDLLMAEFPLPPLSEQSRI 224 >gi|315255453|gb|EFU35421.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 85-1] Length = 262 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 53/188 (28%), Gaps = 2/188 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W V + + + G++ G+ + RQ TS Sbjct: 74 EIPAGWAVNTLSQIANITMGQSPAGESYNEDGIGTLFFQGSTDFGWLFPTPRQYTTSPTR 133 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG IL P IA+ D L K L +++ Sbjct: 134 MAKKGDILLSVRAPV-GDMNIANADCCIGRGLAALNSKSRSDGFLF-YVMKYFKQVFERR 191 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG T + ++ + P + + I T E I+L Sbjct: 192 NAEGTTFGSMTKDDLHSLQVVCPEPGLLKRYDDIVSEYNKMIFTRSLENQDLIKLRDWLL 251 Query: 200 QALVSYIV 207 L++ V Sbjct: 252 PILMNGQV 259 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/224 (11%), Positives = 62/224 (27%), Gaps = 14/224 (6%) Query: 204 SYIVTKGLNPDVKMKDSGIEW---VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + V +P + +G + +P W V + ++ N + Sbjct: 48 NDAVNDAQHPPHDLGPAGKQETQLKREIPAGWAVNTLSQIANITMGQSPAGESYNEDGIG 107 Query: 261 YGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + P Y T + G+I+ D I Sbjct: 108 TLFFQGSTDFGWLFPTPRQYTTSPTRMAKKGDILLSVRAPVGDM-----NIANADCCIGR 162 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 A+ +L ++M+ + S+ +D+ L V+ P Sbjct: 163 GLAALNSKSRSDGFLFYVMKYFKQVFERRNAEGTTFGSMTKDDLHSLQVVCPEPGLLKR- 221 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + + ++ L + R + + GQ+ ++ Sbjct: 222 ---YDDIVSEYNKMIFTRSLENQDLIKLRDWLLPILMNGQVKIK 262 >gi|321310216|ref|YP_004192545.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802060|emb|CBY92706.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 195 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 60/178 (33%), Gaps = 17/178 (9%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 + + +KN + + + + YG+ + ++ E + G++ Sbjct: 27 LESGTPIIKKKNIRGGKVVVEDVFYGDETKHKVLDIHRVRYEDVVITNVSPGGKVAINLT 86 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 D++ I+ Y+ +LM S + + + +R Sbjct: 87 DMEFILGGEVFKLEPNPEILNRRYL-----------YYFLMNSPQQIEQALTLANVVRLH 135 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + +++ + VP +K Q +I ++ + L + +Q + R +++ Sbjct: 136 VSS--IEKFKIHVPDLKTQLEIVRYLDTFRELREELRMRKQQGVY----YRDKIMSSL 187 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 59/187 (31%), Gaps = 5/187 (2%) Query: 26 KVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTVSIF 81 K + K++ G + I +++ G G+ + + Sbjct: 6 KEYRLGEICKVHRGLSFTDYGLESGTPIIKKKNIRGGKVVVEDVFYGDETKHKVLDIHRV 65 Query: 82 AKGQILYGKLGPYLRKAI-IADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ + P + AI + D + I + L+P + + ++ Q+IE Sbjct: 66 RYEDVVITNVSPGGKVAINLTDMEFILGGEVFKLEPNPEILNRRYLYYFLMNSPQQIEQA 125 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 A + I + +P L Q+ I + + L + + + + Sbjct: 126 LTLANVVRLHVSSIEKFKIHVPDLKTQLEIVRYLDTFRELREELRMRKQQGVYYRDKIMS 185 Query: 201 ALVSYIV 207 +L + Sbjct: 186 SLRECAL 192 >gi|301302527|ref|ZP_07208657.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 124-1] gi|300842052|gb|EFK69812.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 124-1] Length = 252 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 53/188 (28%), Gaps = 2/188 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W V + + + G++ G+ + RQ TS Sbjct: 64 EIPAGWAVNTLSQIANITMGQSPAGESYNEDGIGTLFFQGSTDFGWLFPTPRQYTTSPTR 123 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG IL P IA+ D L K L +++ Sbjct: 124 MAKKGDILLSVRAPV-GDMNIANADCCIGRGLAALNSKSRSDGFLF-YVMKYFKQVFERR 181 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG T + ++ + P + + I T E I+L Sbjct: 182 NAEGTTFGSMTKDDLHSLQVVCPEPGLLKRYDDIVSEYNKMIFTRSLENQDLIKLRDWLL 241 Query: 200 QALVSYIV 207 L++ V Sbjct: 242 PILMNGQV 249 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/224 (11%), Positives = 62/224 (27%), Gaps = 14/224 (6%) Query: 204 SYIVTKGLNPDVKMKDSGIEW---VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 + V +P + +G + +P W V + ++ N + Sbjct: 38 NDAVNDAQHPPHDLGPAGKQETQLKREIPAGWAVNTLSQIANITMGQSPAGESYNEDGIG 97 Query: 261 YGNIIQKLETRNMGLKPESYETYQ--IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + P Y T + G+I+ D I Sbjct: 98 TLFFQGSTDFGWLFPTPRQYTTSPTRMAKKGDILLSVRAPVGDM-----NIANADCCIGR 152 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 A+ +L ++M+ + S+ +D+ L V+ P Sbjct: 153 GLAALNSKSRSDGFLFYVMKYFKQVFERRNAEGTTFGSMTKDDLHSLQVVCPEPGLLKR- 211 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 + + + ++ L + R + + GQ+ ++ Sbjct: 212 ---YDDIVSEYNKMIFTRSLENQDLIKLRDWLLPILMNGQVKIK 252 >gi|21226532|ref|NP_632454.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] gi|20904802|gb|AAM30126.1| type I restriction-modification system specificity subunit [Methanosarcina mazei Go1] Length = 439 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 36/108 (33%), Gaps = 13/108 (12%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESG------KDIIYIGLEDVESGTGKYLPKDGNSR 71 +G IP W+V + F + G +S + + I + +++ G Sbjct: 233 LGEIPDGWEVKSLYDFAQYINGAAFKSEDFSSNHEGLPIIKIRELKYGITPQTE----FT 288 Query: 72 QSDTSTVSIFAKGQILYGKLG---PYLRKAIIADFDGICSTQFLVLQP 116 + + G+IL+ G + + +G + + P Sbjct: 289 KKEFDQKYRINNGEILFSWSGSPDTSIDIFLWTGGNGWLNQHTFRVIP 336 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 26/237 (10%), Positives = 73/237 (30%), Gaps = 10/237 (4%) Query: 189 IRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKN 248 +L E+ + + T L P M++S + + + + F + K+ Sbjct: 201 EELDQLQAEQPEHYIQLKNTAELFPS-TMQESELGEIPDGWEVKSLYDFAQYINGAAFKS 259 Query: 249 TKLIESNI-LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 ++ L + ++ T + ++ ++ GEI+F + + + Sbjct: 260 EDFSSNHEGLPIIKIRELKYGITPQTEFTKKEFDQKYRINNGEILFSWSGSPDTSIDIF- 318 Query: 308 AQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 G + V P + + ++ + + +D+K Sbjct: 319 LWTGGNGWLNQHTFRVIPQEAEEKEFIFFLLKFFKKSFIEIARNKQTTGLGHVTSKDLKN 378 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + P ++ + N I + L + R + ++G++ + Sbjct: 379 MFASFPT----KNVIKLFNDVGEPIVSKIFFNSTENNNLSKIRDFLLPKLLSGELSV 431 >gi|225629305|ref|ZP_03787338.1| Hypothetical protein, conserved [Brucella ceti str. Cudo] gi|256157494|ref|ZP_05455412.1| hypothetical protein BcetM4_01373 [Brucella ceti M490/95/1] gi|256253529|ref|ZP_05459065.1| hypothetical protein BcetB_04372 [Brucella ceti B1/94] gi|260167611|ref|ZP_05754422.1| type I restriction-modification enzyme, S subunit [Brucella sp. F5/99] gi|261220659|ref|ZP_05934940.1| predicted protein [Brucella ceti B1/94] gi|261757034|ref|ZP_06000743.1| type I restriction-modification enzyme [Brucella sp. F5/99] gi|265995991|ref|ZP_06108548.1| predicted protein [Brucella ceti M490/95/1] gi|225615801|gb|EEH12850.1| Hypothetical protein, conserved [Brucella ceti str. Cudo] gi|260919243|gb|EEX85896.1| predicted protein [Brucella ceti B1/94] gi|261737018|gb|EEY25014.1| type I restriction-modification enzyme [Brucella sp. F5/99] gi|262550288|gb|EEZ06449.1| predicted protein [Brucella ceti M490/95/1] Length = 210 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 40/141 (28%), Gaps = 3/141 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +L + L G + + +G S V G+++ + Sbjct: 37 RPGERLPVIGVRDLQNGVVAPREALDTVGFSSPSKAMTYAVQAGDVLVTGRGTLLKFGLV 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVK 363 + P L ++ S G+ SL +D+ Sbjct: 97 GDETAGAVASANIIVVRPAPDAT-GGALFAILSSDVFRPKIEVLRRGATTLLSLSPKDLA 155 Query: 364 RLPVLVPPIKEQFDITNVINV 384 L + +P + EQ I ++ Sbjct: 156 NLEINLPSLNEQERIAALVKE 176 >gi|313113035|ref|ZP_07798673.1| type I restriction enzyme R protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310624649|gb|EFQ07966.1| type I restriction enzyme R protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 452 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 8/97 (8%) Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL----RQSLKFEDVKRLPVLVPPI 372 + Y +PH ID+TYL +S G R S+K +P+ P I Sbjct: 2 SPLYTVFRPHDIDTTYLEHFFKSEYWHSFMNFNGDSGARSDRFSIKDSVFFEMPIPTPDI 61 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 +EQ I + + + ++ + L + R + Sbjct: 62 EEQKKIGEFLTLLDTL----ITLHQRKLKKLVQIRKA 94 >gi|293369056|ref|ZP_06615654.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] gi|292635862|gb|EFF54356.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] Length = 202 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 59/184 (32%), Gaps = 14/184 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQ 284 +P+ W L +N E + + N + + ++ E Sbjct: 19 QLPNGWCTTTLKDLCENINGLWKGKKEPFVHVGVIRNANFTKDFKLDYSNIEYIDVEQRT 78 Query: 285 IVDP----GEIVFRFIDLQNDKRSLRSAQVMERGIITSA------YMAVKPHGIDSTYLA 334 G+++ ++ R+ G + S + I S +L Sbjct: 79 FTKRHLMNGDLIVEKSGGSDNNPVGRTILYEGEGGVFSFSNFTMVLRIKYSNTILSKFLY 138 Query: 335 WLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + + + +L + +P+ +PP EQ I + I + A +D++ Sbjct: 139 YYILAIYQTGAMRLMQTQTTGLHNLILDKFLLMPIYLPPSSEQKRIIDKIEMIFATLDMI 198 Query: 393 VEKI 396 +E + Sbjct: 199 MESL 202 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 63/183 (34%), Gaps = 19/183 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG------KYLPKDGNSRQS 73 +P W +K + G GK ++ + + + Y + + Sbjct: 19 QLPNGWCTTTLKDLCENINGL--WKGKKEPFVHVGVIRNANFTKDFKLDYSNIEYIDVEQ 76 Query: 74 DTSTVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVL-----QPKDVLPEL 123 T T G ++ K G P R + G+ S + +L + Sbjct: 77 RTFTKRHLMNGDLIVEKSGGSDNNPVGRTILYEGEGGVFSFSNFTMVLRIKYSNTILSKF 136 Query: 124 LQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L ++L+I T + + T + + +P+ +PP +EQ I +KI +D Sbjct: 137 LYYYILAIYQTGAMRLMQTQTTGLHNLILDKFLLMPIYLPPSSEQKRIIDKIEMIFATLD 196 Query: 183 TLI 185 ++ Sbjct: 197 MIM 199 >gi|57242467|ref|ZP_00370405.1| restriction and modification enzyme CjeI [Campylobacter upsaliensis RM3195] gi|57016752|gb|EAL53535.1| restriction and modification enzyme CjeI [Campylobacter upsaliensis RM3195] Length = 298 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 54/159 (33%), Gaps = 8/159 (5%) Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDP---GEIVFRFIDLQNDKRSLRSAQVMER 313 + S G + + Y++ + + I + + + + Sbjct: 128 PTNSQGKGKRPASFEDTNGTYNFYKSSLEIFKCTAYDFDTEAIIIGDGGTANIHYYKGKF 187 Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR-LPVLVPPI 372 AY+ + + S + + + +L + Q++ + +K + + +PP+ Sbjct: 188 SATDHAYIFERLNDEISLHYIYFVIRNNLNLLQAGFKGIGLQNIAKKFIKEQIKIPLPPL 247 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + Q I + E RI+ I SI +E + + Sbjct: 248 EIQKQIVS----ECERIEEQYSTIRMSIEKYQELIRAIL 282 >gi|282850453|ref|ZP_06259832.1| conserved domain protein [Veillonella parvula ATCC 17745] gi|282579946|gb|EFB85350.1| conserved domain protein [Veillonella parvula ATCC 17745] Length = 113 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 10/72 (13%), Positives = 27/72 (37%), Gaps = 5/72 (6%) Query: 329 DSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 ++ ++ L+ S G ++ + ++ L P ++EQ I +++ Sbjct: 31 NNRFIYHLLSSKVFDNYIARENAGGTQKFIALNQIRNFIFLAPTLEEQNKIIELLD---- 86 Query: 388 RIDVLVEKIEQS 399 I + +Q Sbjct: 87 YISQTITLHQQE 98 >gi|217031668|ref|ZP_03437173.1| hypothetical protein HPB128_21g226 [Helicobacter pylori B128] gi|216946868|gb|EEC25464.1| hypothetical protein HPB128_21g226 [Helicobacter pylori B128] Length = 328 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 59/160 (36%), Gaps = 10/160 (6%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 E N K ++++ ++ + N K++ L + I + Sbjct: 18 NNYTKEYNYKKVCYLDTDNITNNKINAFLKIDLTKEKLPSRAKRKCSI----NSIIYSSV 73 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKVFYAM---GS 351 N + ++ + ++++A++ + +D YL + + ++ + G+ Sbjct: 74 RPNQRHFGIIKEIPKNFLVSTAFIVIDVIDLEKLDPNYLYYYITQDEIIHYLQRIAECGT 133 Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 S+ D + + + P++ Q I ++V +I+ Sbjct: 134 SSYPSITPLDFLNIKIKLYPLETQQKIARTLSVLDQKIEN 173 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 44/344 (12%), Positives = 101/344 (29%), Gaps = 49/344 (14%) Query: 44 SGKDIIYIGLEDVESGTGK-YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + K + Y+ +++ + +L D + + + I+Y + P R I Sbjct: 25 NYKKVCYLDTDNITNNKINAFLKIDLTKEKLPSRAKRKCSINSIIYSSVRPNQRHFGIIK 84 Query: 103 F---DGICSTQFLVLQP---KDVLPELLQGWLLSIDVTQRIEAI--CEGATMSHADWKGI 154 + + ST F+V+ + + P L ++ ++ ++ I C ++ Sbjct: 85 EIPKNFLVSTAFIVIDVIDLEKLDPNYLYYYITQDEIIHYLQRIAECGTSSYPSITPLDF 144 Query: 155 GNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPD 214 NI + + PL Q I + +I+ Sbjct: 145 LNIKIKLYPLETQQKIARTLSVLDQKIENNHKINELL----------------------- 181 Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 H + + KN KL + I + +++ + Sbjct: 182 ----------------HTLAYKIYEYYFKYKPKNAKLEQIIIENPKSNIMVKNAQKTQDK 225 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + P I+ N + + + ++ + + S YL Sbjct: 226 YPFFTSGDNILSYPKAIIDGRNCFLNTGGNAGIKFYVGKASYSTDTWCICANEF-SDYLY 284 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 L+ S + L+ +K+ P+ +P E I Sbjct: 285 LLLSSIKNHINQSFFQGTSLKHLQKNLLKKYPIYMPSAHEIKKI 328 >gi|291551223|emb|CBL27485.1| Type I restriction modification DNA specificity domain [Ruminococcus torques L2-14] Length = 173 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 54/140 (38%), Gaps = 5/140 (3%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV-- 323 +K + Y+IV G+ + + +N ++ + E II+S+Y Sbjct: 34 KKFIPSIANIVGTDLSNYKIVRTGQFAYGPVTSRNGEKISIAYLDEEDCIISSSYTVFEV 93 Query: 324 -KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNV 381 +D YL + + G + + + ++ + + VP I++Q I Sbjct: 94 ENKEELDPEYLMLWFSRPEFDRYARYKSHGSVREIFDWNELCMVELPVPDIEKQRKIVKA 153 Query: 382 INVETARIDVLVEKIEQSIV 401 T RID L +KI ++ Sbjct: 154 YKTITDRID-LKQKINDNLA 172 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/128 (23%), Positives = 52/128 (40%), Gaps = 7/128 (5%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPK 117 K++P N +D S I GQ Y + G + A + + D I S+ + V + + Sbjct: 35 KFIPSIANIVGTDLSNYKIVRTGQFAYGPVTSRNGEKISIAYLDEEDCIISSSYTVFEVE 94 Query: 118 DV---LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + PE L W + + G+ DW + + +P+P + +Q I + Sbjct: 95 NKEELDPEYLMLWFSRPEFDRYARYKSHGSVREIFDWNELCMVELPVPDIEKQRKIVKAY 154 Query: 175 IAETVRID 182 T RID Sbjct: 155 KTITDRID 162 >gi|218506125|ref|ZP_03504003.1| N-6 DNA methylase [Rhizobium etli Brasil 5] Length = 136 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 43/128 (33%), Gaps = 8/128 (6%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYLA 334 E +Y +++ + + A+ + GI + Y+ +L Sbjct: 9 EVVGSYTYFREDDVLVAKVTPCFENGKAGIARGLTNGIGFGSSEFYVVRSGEETLPAWLY 68 Query: 335 WLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID-- 390 + + + D G+G Q + ++ + VP Q I I E A ++ Sbjct: 69 YWLTTPDFKARATAKMTGTGGLQRVPRAVLEEETITVPERAIQEAIVAEIEAEQALVNGN 128 Query: 391 -VLVEKIE 397 L+ + E Sbjct: 129 RDLIARFE 136 >gi|218263890|ref|ZP_03477846.1| hypothetical protein PRABACTJOHN_03536 [Parabacteroides johnsonii DSM 18315] gi|218222440|gb|EEC95090.1| hypothetical protein PRABACTJOHN_03536 [Parabacteroides johnsonii DSM 18315] Length = 161 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 50/162 (30%), Gaps = 5/162 (3%) Query: 252 IESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 + + L GNI K+ ++ + V +I+ + + Sbjct: 3 CDDGTIVLRSGNIQDGKISFSDIVRVNAPIKESLFVKEDDILMCSRNGSASLVGKVAMIP 62 Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 +T ++ YL +S D + S + + + ++ V P Sbjct: 63 DINEPMTFGAFMTIIRSAEAKYLYLYFQSQDFRERVSEGKSSTMNQITQKMLDKVEVPFP 122 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ++ ++ D +++Q I + + S I Sbjct: 123 DKDVR----ETLSAIASQADKSKFELKQCIEHIDKVIKSLIN 160 >gi|269978326|gb|ACZ55897.1| putative type I restriction-modification system specificity subunit S [Helicobacter pylori] Length = 343 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 45/312 (14%), Positives = 83/312 (26%), Gaps = 16/312 (5%) Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + K IL+ P IA+ + F + P + + L I Sbjct: 2 LLPKHAILFSSRAPI-GYVAIAEKRLCTNQGFKSIIPNKKI-YFEFLYYLLKYHKDNISN 59 Query: 140 ICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 I G T +G + IPP EQ I + +I+ ++L+ Sbjct: 60 IGGGTTFKEISGATLGLFEVKIPPTYYEQQKIARTLSILDQKIENNHKINELLHKILELL 119 Query: 199 KQALVSYIVTKGLNPDVKM----KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + N K + + + + + + Sbjct: 120 YEQYFVRFDFLDENNKPYQTNGGKMKFSKELNRLIPNDFEVKTLGELITWISGSQPPKSC 179 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS--LRSAQVME 312 +I I +N Y TY + + D+ DK ++ Sbjct: 180 HIYEHKESYI---RFIQNRDYSSNDYITYIPISKNNKICYQYDIMIDKYGEAGAVRFGLQ 236 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPP 371 + + Y+ + S + K + R SL + L + +PP Sbjct: 237 GAYNVALSKISVLNQSMQEYIRSYLNSKPIKKYLSNACMASTRSSLNENHIYSLMLPIPP 296 Query: 372 IK-EQF--DITN 380 I Q I Sbjct: 297 INLLQKYEKIAK 308 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 17/115 (14%), Positives = 40/115 (34%), Gaps = 4/115 (3%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I + A +R + ++ P+ + + Y + G + Sbjct: 8 ILFSSRAPIGYVAIAEKRLCTNQGFKSIIPNKKIYFEFLYYLLKYHKDNISNIGGGTTFK 67 Query: 356 SLKFEDVKRLPVLVPP-IKEQFDITNVINVETARID---VLVEKIEQSIVLLKER 406 + + V +PP EQ I +++ +I+ + E + + + LL E+ Sbjct: 68 EISGATLGLFEVKIPPTYYEQQKIARTLSILDQKIENNHKINELLHKILELLYEQ 122 Score = 37.9 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 56/191 (29%), Gaps = 4/191 (2%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVS 79 IP ++V + +G I + Y D + + Sbjct: 154 IPNDFEVKTLGELITWISGSQPPKSCHIYEHKESYIRFIQNRDYSSNDYITYIPISKNNK 213 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIE 138 I + I+ K G A+ G + + + E ++ +L S + + + Sbjct: 214 ICYQYDIMIDKYGEAG--AVRFGLQGAYNVALSKISVLNQSMQEYIRSYLNSKPIKKYLS 271 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 C +T S + I ++ +PIPP+ + I L Sbjct: 272 NACMASTRSSLNENHIYSLMLPIPPINLLQKYEKIAKNIITAIINNNQSTQTLTALRDFL 331 Query: 199 KQALVSYIVTK 209 L++ V Sbjct: 332 LPLLLTQQVKP 342 >gi|218133861|ref|ZP_03462665.1| hypothetical protein BACPEC_01750 [Bacteroides pectinophilus ATCC 43243] gi|217991236|gb|EEC57242.1| hypothetical protein BACPEC_01750 [Bacteroides pectinophilus ATCC 43243] Length = 179 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 54/139 (38%), Gaps = 5/139 (3%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV-- 323 +K + Y+IV G+ + + +N ++ + E II+S+Y Sbjct: 40 KKFIPSIANIVGTDLSNYKIVRTGQFAYGPVTSRNGEKISIAYLDEEDCIISSSYTVFEV 99 Query: 324 -KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNV 381 +D YL + + G + + + ++ + + VP I++Q I Sbjct: 100 ENKEELDPEYLMLWFSRPEFDRYARYKSHGSVREIFDWNELCMVELPVPDIEKQRKIVKA 159 Query: 382 INVETARIDVLVEKIEQSI 400 T RID L +KI ++ Sbjct: 160 YKTITDRID-LKQKINDNL 177 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 30/128 (23%), Positives = 52/128 (40%), Gaps = 7/128 (5%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPK 117 K++P N +D S I GQ Y + G + A + + D I S+ + V + + Sbjct: 41 KFIPSIANIVGTDLSNYKIVRTGQFAYGPVTSRNGEKISIAYLDEEDCIISSSYTVFEVE 100 Query: 118 DV---LPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 + PE L W + + G+ DW + + +P+P + +Q I + Sbjct: 101 NKEELDPEYLMLWFSRPEFDRYARYKSHGSVREIFDWNELCMVELPVPDIEKQRKIVKAY 160 Query: 175 IAETVRID 182 T RID Sbjct: 161 KTITDRID 168 >gi|254695863|ref|ZP_05157691.1| hypothetical protein Babob3T_14800 [Brucella abortus bv. 3 str. Tulya] gi|261216283|ref|ZP_05930564.1| predicted protein [Brucella abortus bv. 3 str. Tulya] gi|260917890|gb|EEX84751.1| predicted protein [Brucella abortus bv. 3 str. Tulya] Length = 210 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 40/141 (28%), Gaps = 3/141 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +L + L G + + +G S V G+++ + Sbjct: 37 RPGERLPVIGVRDLQDGVVAPREALDTVGFSSPSKAMTYAVQAGDVLVTGRGTLLKFGLV 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVK 363 + P L ++ S G+ SL +D+ Sbjct: 97 GDETAGAVASANIIVVRPAPDAT-GGALFAILSSDVFRPKIEVLRRGATTLLSLSPKDLA 155 Query: 364 RLPVLVPPIKEQFDITNVINV 384 L + +P + EQ I ++ Sbjct: 156 NLEINLPSLNEQERIAALVKE 176 >gi|148558646|ref|YP_001257778.1| hypothetical protein BOV_A0787 [Brucella ovis ATCC 25840] gi|161620896|ref|YP_001594782.1| hypothetical protein BCAN_B0855 [Brucella canis ATCC 23365] gi|163844959|ref|YP_001622614.1| hypothetical protein BSUIS_B0832 [Brucella suis ATCC 23445] gi|254700050|ref|ZP_05161878.1| hypothetical protein Bsuib55_04207 [Brucella suis bv. 5 str. 513] gi|254703170|ref|ZP_05164998.1| hypothetical protein Bsuib36_04392 [Brucella suis bv. 3 str. 686] gi|254705684|ref|ZP_05167512.1| hypothetical protein BpinM_01413 [Brucella pinnipedialis M163/99/10] gi|254710915|ref|ZP_05172726.1| hypothetical protein BpinB_11775 [Brucella pinnipedialis B2/94] gi|254712612|ref|ZP_05174423.1| hypothetical protein BcetM6_04392 [Brucella ceti M644/93/1] gi|254715683|ref|ZP_05177494.1| hypothetical protein BcetM_04407 [Brucella ceti M13/05/1] gi|256015603|ref|YP_003105612.1| type I restriction-modification enzyme, S subunit [Brucella microti CCM 4915] gi|256029299|ref|ZP_05442913.1| hypothetical protein BpinM2_01343 [Brucella pinnipedialis M292/94/1] gi|256058987|ref|ZP_05449198.1| hypothetical protein Bneo5_01328 [Brucella neotomae 5K33] gi|260567902|ref|ZP_05838371.1| type I restriction-modification enzyme [Brucella suis bv. 4 str. 40] gi|261217432|ref|ZP_05931713.1| predicted protein [Brucella ceti M13/05/1] gi|261313104|ref|ZP_05952301.1| predicted protein [Brucella pinnipedialis M163/99/10] gi|261318498|ref|ZP_05957695.1| predicted protein [Brucella pinnipedialis B2/94] gi|261320306|ref|ZP_05959503.1| predicted protein [Brucella ceti M644/93/1] gi|261322931|ref|ZP_05962128.1| predicted protein [Brucella neotomae 5K33] gi|261750533|ref|ZP_05994242.1| predicted protein [Brucella suis bv. 5 str. 513] gi|261753792|ref|ZP_05997501.1| predicted protein [Brucella suis bv. 3 str. 686] gi|265986296|ref|ZP_06098853.1| predicted protein [Brucella pinnipedialis M292/94/1] gi|294853395|ref|ZP_06794067.1| hypothetical protein BAZG_02353 [Brucella sp. NVSL 07-0026] gi|148369931|gb|ABQ62803.1| conserved hypothetical protein [Brucella ovis ATCC 25840] gi|161337707|gb|ABX64011.1| Hypothetical protein, conserved [Brucella canis ATCC 23365] gi|163675682|gb|ABY39792.1| Hypothetical protein, conserved [Brucella suis ATCC 23445] gi|255998263|gb|ACU49950.1| type I restriction-modification enzyme, S subunit [Brucella microti CCM 4915] gi|260154567|gb|EEW89648.1| type I restriction-modification enzyme [Brucella suis bv. 4 str. 40] gi|260922521|gb|EEX89089.1| predicted protein [Brucella ceti M13/05/1] gi|261292996|gb|EEX96492.1| predicted protein [Brucella ceti M644/93/1] gi|261297721|gb|EEY01218.1| predicted protein [Brucella pinnipedialis B2/94] gi|261298911|gb|EEY02408.1| predicted protein [Brucella neotomae 5K33] gi|261302130|gb|EEY05627.1| predicted protein [Brucella pinnipedialis M163/99/10] gi|261740286|gb|EEY28212.1| predicted protein [Brucella suis bv. 5 str. 513] gi|261743545|gb|EEY31471.1| predicted protein [Brucella suis bv. 3 str. 686] gi|264658493|gb|EEZ28754.1| predicted protein [Brucella pinnipedialis M292/94/1] gi|294819050|gb|EFG36050.1| hypothetical protein BAZG_02353 [Brucella sp. NVSL 07-0026] Length = 210 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 40/141 (28%), Gaps = 3/141 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +L + L G + + +G S V G+++ + Sbjct: 37 RPGERLPVIGVRDLQDGVVAPREALDTVGFSSPSKAMTYAVQAGDVLVTGRGTLLKFGLV 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVK 363 + P L ++ S G+ SL +D+ Sbjct: 97 GDETAGAVASANIIVVRPAPDAT-GGALFAILSSDVFRPKIEVLRRGATTLLSLSPKDLA 155 Query: 364 RLPVLVPPIKEQFDITNVINV 384 L + +P + EQ I ++ Sbjct: 156 NLEINLPSLNEQERIAALVKE 176 >gi|23500569|ref|NP_700009.1| hypothetical protein BRA0839 [Brucella suis 1330] gi|23464206|gb|AAN34014.1| conserved hypothetical protein [Brucella suis 1330] Length = 210 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 40/141 (28%), Gaps = 3/141 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +L + L G + + +G S V G+++ + Sbjct: 37 RPGERLPVIGVRDLQDGVVAPREALDTVGFSSPSKAMTYAVQAGDVLVTGRGTLLKFGLV 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVK 363 + P L ++ S G+ SL +D+ Sbjct: 97 GDETAGAVASANIIVVRPAPDAT-GGALFAILSSDVFRPKIEVLRRGATTLLSLSPKDLA 155 Query: 364 RLPVLVPPIKEQFDITNVINV 384 L + +P + EQ I ++ Sbjct: 156 NLEINLPSLNEQERIAALVKE 176 >gi|42794862|gb|AAS45789.1| SLV.6 [Streptomyces lavendulae] Length = 814 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 50/150 (33%), Gaps = 5/150 (3%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I E + + + ++ G+I+ +R+ Q + Sbjct: 651 NGNITDTEPERVSQELADRHRHYLLQQGDILCVRSGKTVPPALVRADQSGWLMSTNVIRL 710 Query: 322 AVKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 V + WL R L + + S+ + + + V +PP+ +Q I Sbjct: 711 RVHEGREVDSNYLFRWLGRPESLAWIVDRSAATAAPSISTKTLGTMTVRLPPLPQQRQIA 770 Query: 380 NVINVETARID---VLVEKIEQSIVLLKER 406 +++ + L E I +S LL E+ Sbjct: 771 ELLDALEEQARAHHNLAEAISRSRSLLAEQ 800 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 56/171 (32%), Gaps = 14/171 (8%) Query: 30 IKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTVSI 80 + L G + + + +++G P+ + +D + Sbjct: 615 LAELCDLKAGPSFTRVGKKDRTPNGPVPLVMPRHLKNGNITDTEPERVSQELADRHRHYL 674 Query: 81 FAKGQILYGKLGPYLRKAIIA--DFDGICSTQFL---VLQPKDVLPELLQGWLLSIDVTQ 135 +G IL + G + A++ + ST + V + ++V L WL + Sbjct: 675 LQQGDILCVRSGKTVPPALVRADQSGWLMSTNVIRLRVHEGREVDSNYLFRWLGRPESLA 734 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 I K +G + + +PPL +Q I E + A + Sbjct: 735 WIVDRSAATAAPSISTKTLGTMTVRLPPLPQQRQIAELLDALEEQARAHHN 785 >gi|289423490|ref|ZP_06425292.1| type I restriction-modification system DNA specificity subunit [Peptostreptococcus anaerobius 653-L] gi|289156124|gb|EFD04787.1| type I restriction-modification system DNA specificity subunit [Peptostreptococcus anaerobius 653-L] Length = 131 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 48/122 (39%), Gaps = 8/122 (6%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWLM 337 Y IV P + + + S+ + + I++S Y K +D +L Sbjct: 5 DKTMYYIVSPNSFAYNP--ARINVGSIGYQNLDKSVIVSSLYEVFKTTADVDDRFLWHWF 62 Query: 338 RSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +S K+ G+R ++ + + +P I+EQ I +++ +D L+ Sbjct: 63 KSAAFQKMIEKYQEGGVRLYFYYDKLCMCSIALPSIEEQHKIGKHLDM----LDNLITLH 118 Query: 397 EQ 398 ++ Sbjct: 119 QR 120 >gi|298254244|ref|ZP_06977830.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae str. Canada MDR_19A] Length = 172 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 37/108 (34%), Gaps = 6/108 (5%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y IV ++ N +R + I+S YL + + Sbjct: 50 YAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVLEKINSEYLFYFCQL 106 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 Y+ K+ A+ SL D+ + + +PP+ Q + + + + Sbjct: 107 YNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFVALVDK 151 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 10 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 57 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 58 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 114 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 115 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 145 >gi|298229453|ref|ZP_06963134.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae str. Canada MDR_19F] Length = 146 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 36/103 (34%), Gaps = 6/103 (5%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y IV ++ N +R + I+S YL + + Sbjct: 50 YAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVLEKINSEYLFYFCQL 106 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 Y+ K+ A+ SL D+ + + +PP+ Q + + + Sbjct: 107 YNFEKLNKAV---TIPSLTKSDLLNISIPLPPLALQNEFADFV 146 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 52/151 (34%), Gaps = 15/151 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 10 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 57 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 58 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 114 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 T+ + NI +P+PPLA Q + Sbjct: 115 AVTIPSLTKSDLLNISIPLPPLALQNEFADF 145 >gi|262065805|ref|ZP_06025417.1| type I restriction-modification system, S subunit [Fusobacterium periodonticum ATCC 33693] gi|291380502|gb|EFE88020.1| type I restriction-modification system, S subunit [Fusobacterium periodonticum ATCC 33693] Length = 176 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 52/170 (30%), Gaps = 5/170 (2%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ + +E+K ++T L + + K++ N+ E Sbjct: 6 DIKTNDKNWELFEIKEISNILTRGKTPKYTLSSNVFVINQACIYWDKIKYENIKFHVEDE 65 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW---LM 337 + + ++ + ++ + E+ I S M ++ L + M Sbjct: 66 NLLFLKNKDILINSTGTGTLGRMNIIQNIINEKFTIDSHVMLIRLKEEKILSLYFINIFM 125 Query: 338 RSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + GS + L E + + +PPI+ Q I Sbjct: 126 NEKYQKDLILKCVNGSTNQIELSKEKFSKFKIPIPPIELQNKFAERIEKI 175 >gi|312970036|ref|ZP_07784218.1| hsdS protein [Escherichia coli 1827-70] gi|310337534|gb|EFQ02645.1| hsdS protein [Escherichia coli 1827-70] Length = 384 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 45/356 (12%), Positives = 98/356 (27%), Gaps = 45/356 (12%) Query: 20 AIPKHWKVVPIKRFTK---LNTGRTSES---GKDIIYIGLEDVESGTGKYLPK-DGNSRQ 72 +P W + TK ++ G I I + ++++G + Sbjct: 7 KLPLGWNCKKLVDCTKEGNISYGIVQPGQHQEDGIGIIRVNNIQNGNIYIDDVLKVSHEI 66 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLS 130 + G++L +G AI + V++P D + L Sbjct: 67 ESKFAKTRLEGGEVLLTLVGSTGISAITTKALQGWNVARAVAVIKPCDEISAEWIHICLQ 126 Query: 131 IDVTQRI-EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 T+ ++ + K + IP+PIPP E+V + + RI+ I Sbjct: 127 SPFTKYFLDSRANTTVQKTLNLKDVKEIPLPIPPHEERVSLEKIYFNFENRINLNIKINK 186 Query: 190 RFIELLKEKKQALVSYI---VTKGLNPDVK------------------------------ 216 E+ + ++ V L+ Sbjct: 187 ILEEMSQNLFKSWFVDFDPVVDNALDAGNPIPEALQSRAELRQKVRNSADFKPLPAEIRS 246 Query: 217 MKDSGIE--WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG 274 + S E +G +P W++K + N + + Y +++ + R Sbjct: 247 LFPSEFEETELGWMPKGWQIKSLDHIANFQNGLALQKFRPKNMEDDYLPVLKIADLRAGQ 306 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + I D ++ + + + V + Sbjct: 307 ITNDERARTDISDSCKVYDGDMIFSWSGTLMIDIWTGGNAALNQHLYKVTSKNTHN 362 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 62/191 (32%), Gaps = 10/191 (5%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK----PESY 280 +G ++ + + E I + NI + LK ES Sbjct: 10 LGWNCKKLVDCTKEGNISYGIVQPGQHQEDGIGIIRVNNIQNGNIYIDDVLKVSHEIESK 69 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 ++ GE++ + A + + + I + ++ ++S Sbjct: 70 FAKTRLEGGEVLLTLVGSTGISAITTKALQGWN-VARAVAVIKPCDEISAEWIHICLQSP 128 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 + + +++L +DVK +P+ +PP +E+ + + + + Sbjct: 129 FTKYFLDSRANTTVQKTLNLKDVKEIPLPIPPHEERV----SLEKIYFNFENRINLNIKI 184 Query: 400 IVLLKERRSSF 410 +L+E + Sbjct: 185 NKILEEMSQNL 195 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 12/116 (10%), Positives = 35/116 (30%), Gaps = 12/116 (10%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNS 70 +G +PK W++ + G + + + + D+ +G + Sbjct: 257 LGWMPKGWQIKSLDHIANFQNGLALQKFRPKNMEDDYLPVLKIADLRAGQI----TNDER 312 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQG 126 ++D S G +++ G + + + + K+ + Sbjct: 313 ARTDISDSCKVYDGDMIFSWSGTLMIDI-WTGGNAALNQHLYKVTSKNTHNLFILC 367 >gi|17988797|ref|NP_541430.1| type I restriction-modification enzyme, S subunit [Brucella melitensis bv. 1 str. 16M] gi|189022584|ref|YP_001932325.1| type I restriction-modification enzyme, S subunit [Brucella abortus S19] gi|256043712|ref|ZP_05446635.1| hypothetical protein Bmelb1R_04427 [Brucella melitensis bv. 1 str. Rev.1] gi|265990134|ref|ZP_06102691.1| predicted protein [Brucella melitensis bv. 1 str. Rev.1] gi|17984615|gb|AAL53694.1| type i restriction-modification enzyme, s subunit [Brucella melitensis bv. 1 str. 16M] gi|189021158|gb|ACD73879.1| type I restriction-modification enzyme, S subunit [Brucella abortus S19] gi|263000803|gb|EEZ13493.1| predicted protein [Brucella melitensis bv. 1 str. Rev.1] gi|326410991|gb|ADZ68055.1| conserved hypothetical protein [Brucella melitensis M28] Length = 209 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 40/141 (28%), Gaps = 3/141 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +L + L G + + +G S V G+++ + Sbjct: 36 RPGERLPVIGVRDLQDGVVAPREALDTVGFSSLSKAMTYAVQAGDVLVTGRGTLLKFGLV 95 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVK 363 + P L ++ S G+ SL +D+ Sbjct: 96 GDETAGAVASANIIVVRPAPDAT-GGALFAILSSDVFRPKIEVLRRGATTLLSLSPKDLA 154 Query: 364 RLPVLVPPIKEQFDITNVINV 384 L + +P + EQ I ++ Sbjct: 155 NLEINLPSLNEQERIAALVKE 175 >gi|62317329|ref|YP_223182.1| hypothetical protein BruAb2_0392 [Brucella abortus bv. 1 str. 9-941] gi|83269310|ref|YP_418601.1| type I restriction-modification enzyme, S subunit [Brucella melitensis biovar Abortus 2308] gi|225686601|ref|YP_002734573.1| hypothetical protein BMEA_B0818 [Brucella melitensis ATCC 23457] gi|254690829|ref|ZP_05154083.1| hypothetical protein Babob68_11863 [Brucella abortus bv. 6 str. 870] gi|254698610|ref|ZP_05160438.1| hypothetical protein Babob28_13164 [Brucella abortus bv. 2 str. 86/8/59] gi|254732057|ref|ZP_05190635.1| hypothetical protein Babob42_12964 [Brucella abortus bv. 4 str. 292] gi|256111245|ref|ZP_05452276.1| hypothetical protein Bmelb3E_01403 [Brucella melitensis bv. 3 str. Ether] gi|256256011|ref|ZP_05461547.1| hypothetical protein Babob9C_01308 [Brucella abortus bv. 9 str. C68] gi|256262260|ref|ZP_05464792.1| type I restriction-modification enzyme [Brucella melitensis bv. 2 str. 63/9] gi|260544566|ref|ZP_05820387.1| type I restriction-modification enzyme [Brucella abortus NCTC 8038] gi|260564899|ref|ZP_05835384.1| type I restriction-modification enzyme [Brucella melitensis bv. 1 str. 16M] gi|260756407|ref|ZP_05868755.1| predicted protein [Brucella abortus bv. 6 str. 870] gi|260759839|ref|ZP_05872187.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260763078|ref|ZP_05875410.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260882231|ref|ZP_05893845.1| predicted protein [Brucella abortus bv. 9 str. C68] gi|265992758|ref|ZP_06105315.1| predicted protein [Brucella melitensis bv. 3 str. Ether] gi|297249370|ref|ZP_06933071.1| type I restriction-modification enzyme, S subunit [Brucella abortus bv. 5 str. B3196] gi|62197522|gb|AAX75821.1| conserved hypothetical protein [Brucella abortus bv. 1 str. 9-941] gi|82939584|emb|CAJ12564.1| type I restriction-modification enzyme, S subunit [Brucella melitensis biovar Abortus 2308] gi|225642706|gb|ACO02619.1| Hypothetical protein, conserved [Brucella melitensis ATCC 23457] gi|260097837|gb|EEW81711.1| type I restriction-modification enzyme [Brucella abortus NCTC 8038] gi|260152542|gb|EEW87635.1| type I restriction-modification enzyme [Brucella melitensis bv. 1 str. 16M] gi|260670157|gb|EEX57097.1| predicted protein [Brucella abortus bv. 4 str. 292] gi|260673499|gb|EEX60320.1| predicted protein [Brucella abortus bv. 2 str. 86/8/59] gi|260676515|gb|EEX63336.1| predicted protein [Brucella abortus bv. 6 str. 870] gi|260871759|gb|EEX78828.1| predicted protein [Brucella abortus bv. 9 str. C68] gi|262763628|gb|EEZ09660.1| predicted protein [Brucella melitensis bv. 3 str. Ether] gi|263091976|gb|EEZ16282.1| type I restriction-modification enzyme [Brucella melitensis bv. 2 str. 63/9] gi|297173239|gb|EFH32603.1| type I restriction-modification enzyme, S subunit [Brucella abortus bv. 5 str. B3196] gi|326554282|gb|ADZ88921.1| conserved hypothetical protein [Brucella melitensis M5-90] Length = 210 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 40/141 (28%), Gaps = 3/141 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 R +L + L G + + +G S V G+++ + Sbjct: 37 RPGERLPVIGVRDLQDGVVAPREALDTVGFSSLSKAMTYAVQAGDVLVTGRGTLLKFGLV 96 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVK 363 + P L ++ S G+ SL +D+ Sbjct: 97 GDETAGAVASANIIVVRPAPDAT-GGALFAILSSDVFRPKIEVLRRGATTLLSLSPKDLA 155 Query: 364 RLPVLVPPIKEQFDITNVINV 384 L + +P + EQ I ++ Sbjct: 156 NLEINLPSLNEQERIAALVKE 176 >gi|298254240|ref|ZP_06977826.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae str. Canada MDR_19A] Length = 197 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 22/201 (10%), Positives = 63/201 (31%), Gaps = 9/201 (4%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MG 274 G + D+ + + E L L+ N+ + + + + Sbjct: 1 MFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVTKNGFSFDTKQFIT 60 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + ++ +IV + + I S + ++P + Sbjct: 61 KTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMVILRPKTPNLNQ-K 119 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +++ + + L +K++ + +PP+ Q + + + +ID Sbjct: 120 FIIHVLRNNNYSRVISGSAQPQLPITKLKKILLPLPPLALQNEFADFVV----QIDKSQL 175 Query: 395 KIEQSIVLLKERRSSFIAAAV 415 I++S+ L+ + S + Sbjct: 176 AIQKSLEELETLKKSLMQEYF 196 >gi|159897809|ref|YP_001544056.1| type I restriction-modification system, S subunit [Herpetosiphon aurantiacus ATCC 23779] gi|159890848|gb|ABX03928.1| type I restriction-modification system, S subunit [Herpetosiphon aurantiacus ATCC 23779] Length = 58 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 27/58 (46%), Gaps = 2/58 (3%) Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL-LKERRSSFIAAAV 415 + +K+ + +PP+ EQ I + D L +++ QS L + ++ I A+ Sbjct: 1 MKHIKKFILTLPPLAEQQRIVAKVEQLLGLCDQLEQQLAQSQDLGSRSL-AALIQHAL 57 >gi|327404936|ref|YP_004345774.1| restriction modification system DNA specificity domain-containing protein [Fluviicola taffensis DSM 16823] gi|327320444|gb|AEA44936.1| restriction modification system DNA specificity domain protein [Fluviicola taffensis DSM 16823] Length = 457 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 59/414 (14%), Positives = 131/414 (31%), Gaps = 44/414 (10%) Query: 29 PIKRFTK-LNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 P+ TK + TG + I YI + + + + K + + + Sbjct: 43 PLGEVTKKVFTGGIFKRIFISNPEYGIPYISAQHMMNLNPLDVSKIISKKYTPRQEDMTL 102 Query: 82 AKGQILYGKLGPYLRKAIIADF--DGICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRI 137 QIL G +I + I S + + +L L +L + I Sbjct: 103 RHNQILLSCAGTVGNVRLIGNELDGIIGSQDIIRIIADNSKMLYGYLFAYLSTPTAYNYI 162 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 ++ G+ + + I +P+PI +QV I E I + + I + Sbjct: 163 QSYIYGSVVPRIEPNTISKLPVPIISREKQVKIHELIKEASHLRTEANNTFSKLINEINT 222 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNIL 257 + + K ++ + + + + + ++++ + K I Sbjct: 223 LLEIEIERKNIKYSFRKIRDIKMFEKRLDASYNCGPGRRIYDVISKQDHITLKDISEIFH 282 Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQI-------------------VDPGEIVFRFIDL 298 + +G K L S V G + Sbjct: 283 PMLFGKKQLKGSENGNFLFKSSSMMKMKPETDFVLSLRKVDLYSKLQVKEGWSLISRTGT 342 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSL 357 + +R + + I + VKP+ S + ++S+ K+ G +++ + Sbjct: 343 VGNV--VRINKTLADIYIDDHMIRVKPNENYSGLIFIYLKSFYGQKLIEFQKYGSVQEVI 400 Query: 358 KFEDVKRLPVLVPPIKEQFDITNV---INVETARIDVLV-------EKIEQSIV 401 + ++R+P+ ++E I + +++ID E IE+ I Sbjct: 401 NSDYIERIPIPKFLLEE-KLIMRFNKEVKEASSKIDKAALNEFNSNELIEKEIE 453 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 54/175 (30%), Gaps = 8/175 (4%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 + N + I + N+ ++ + K + + +I+ Sbjct: 57 FKRIFISNPEYGIPYISAQHMMNLNPLDVSKIISKKYTPRQEDMTLRHNQILLSCAGTVG 116 Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKF 359 + R + + G + + YL + + + G + ++ Sbjct: 117 NVRLIGNELDGIIGSQDIIRIIADNSKMLYGYLFAYLSTPTAYNYIQSYIYGSVVPRIEP 176 Query: 360 EDVKRLPVLVPPIKEQFDI------TNVINVETAR-IDVLVEKIEQSIVLLKERR 407 + +LPV + ++Q I + + E L+ +I + + ER+ Sbjct: 177 NTISKLPVPIISREKQVKIHELIKEASHLRTEANNTFSKLINEINTLLEIEIERK 231 >gi|326386413|ref|ZP_08208036.1| hypothetical protein Y88_2307 [Novosphingobium nitrogenifigens DSM 19370] gi|326209074|gb|EGD59868.1| hypothetical protein Y88_2307 [Novosphingobium nitrogenifigens DSM 19370] Length = 196 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 46/139 (33%), Gaps = 15/139 (10%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMRSYD 341 + PG++VFR N + + + + YLAW + D Sbjct: 57 RYALQPGDVVFRSRGQPNFGYVVSGEMAEPIVALLPLIILRPSLDLVTPDYLAWAINQPD 116 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + A G + + ++ + + VP + Q I I L K + Sbjct: 117 AQRQIDAEAQGQSLRMIPKGSLEGITIPVPDLSTQRAIVE--------IARLANKEAALL 168 Query: 401 VLLKERRSSFIAAAVTGQI 419 L ERR+ TG++ Sbjct: 169 HQLAERRTQ-----FTGRV 182 >gi|227511527|ref|ZP_03941576.1| possible type Ic restriction-modification system, HsdS subunit [Lactobacillus buchneri ATCC 11577] gi|227085261|gb|EEI20573.1| possible type Ic restriction-modification system, HsdS subunit [Lactobacillus buchneri ATCC 11577] Length = 129 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 34/112 (30%), Gaps = 8/112 (7%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTST 77 W+ I K+ G T +S DI + +V + G K + S+ Sbjct: 18 WEQRKISELAKIQGGGTPDSTNSKFWNGDINWFTPTEVSNQGYLFESNKKISKSGLKHSS 77 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + G +L + I + F + P + +P + Sbjct: 78 AKLMPVGTVLMTS-RAGVGNMGILSLPAATNQGFQSMIPNEDIPSYFLFSMH 128 >gi|212691984|ref|ZP_03300112.1| hypothetical protein BACDOR_01479 [Bacteroides dorei DSM 17855] gi|212665376|gb|EEB25948.1| hypothetical protein BACDOR_01479 [Bacteroides dorei DSM 17855] Length = 143 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 52/135 (38%), Gaps = 11/135 (8%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + ++ I++ + ++GI+ + ID YL + MR Sbjct: 13 KKSSAWLIPANSIIYSNGATIGAISINKYPICTKQGILG----IIPNSNIDVEYLYYFMR 68 Query: 339 SYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 S K + + G ++ +D+ + +P + +Q DI++ ++ L E IE Sbjct: 69 SSYFQKEVERVVTEGTMKTAYLKDINHIKCPIPDLDKQKDISHALSSL-----SLKEDIE 123 Query: 398 QS-IVLLKERRSSFI 411 + + + ++ + Sbjct: 124 KQLLQKYQIQKQYLL 138 >gi|223984083|ref|ZP_03634236.1| hypothetical protein HOLDEFILI_01528 [Holdemania filiformis DSM 12042] gi|223963939|gb|EEF68298.1| hypothetical protein HOLDEFILI_01528 [Holdemania filiformis DSM 12042] Length = 148 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 48/143 (33%), Gaps = 11/143 (7%) Query: 281 ETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y ++ GE + S++ G +++ Y+ DS ++ S Sbjct: 2 SGYYLLKNGEFAYNKSYSVGYDFGSIKRLDCYPMGALSTLYICFALKKHDSDFIKAYFDS 61 Query: 340 YDLCKVFYAMGS-GLRQ----SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLV 393 + Y + + G R ++ E+ +P EQ I + I ++ + Sbjct: 62 LKWYRDIYMISAEGARNHGLLNVPTEEFFDTKHYLPENTDEQRKIADFIIT----LEHRI 117 Query: 394 EKIEQSIVLLKERRSSFIAAAVT 416 E + + LK+ + I Sbjct: 118 EAQQSLVDNLKKYKRGVIQHIFR 140 >gi|325680238|ref|ZP_08159800.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] gi|324108055|gb|EGC02309.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] Length = 528 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 52/192 (27%), Gaps = 8/192 (4%) Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 L P M I W ++ I + +II+ Sbjct: 27 FFLAPCFFMLVCAISW--EQRKVKDIADNTYGGGTPQTSIDSYWNGEIPWIQSQDIIENQ 84 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 K S E + I + + A + + ++++ I Sbjct: 85 LFNVEPRKHISEEAISKSATKLVPKNSIAIVTRVGVGKLAFMPFSYCTSQDFLSLSGIQI 144 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETA 387 D Y + + L K + + + E++ + VP EQ I Sbjct: 145 DEKYATYSIYQM-LQKEKQNVQGTSIKGITIEEMLSKKIPVPCNSDEQGAIGAF----FH 199 Query: 388 RIDVLVEKIEQS 399 +D L+ ++ Sbjct: 200 NLDTLITLHQRE 211 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 48/168 (28%), Gaps = 13/168 (7%) Query: 24 HWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLP--KDGNSRQSD 74 W+ +K G + +I +I +D+ + K + Sbjct: 41 SWEQRKVKDIADNTYGGGTPQTSIDSYWNGEIPWIQSQDIIENQLFNVEPRKHISEEAIS 100 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 S + K I + K F S FL + E + + + Sbjct: 101 KSATKLVPKNSIAIVT-RVGVGKLAFMPFSYCTSQDFL-SLSGIQIDEKYATYSIYQMLQ 158 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPL-AEQVLIREKIIAETVRI 181 + + +G ++ + + + +P+P EQ I I Sbjct: 159 K-EKQNVQGTSIKGITIEEMLSKKIPVPCNSDEQGAIGAFFHNLDTLI 205 >gi|240146116|ref|ZP_04744717.1| type I restriction-modification system, S subunit [Roseburia intestinalis L1-82] gi|257201769|gb|EEV00054.1| type I restriction-modification system, S subunit [Roseburia intestinalis L1-82] Length = 178 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 57/171 (33%), Gaps = 12/171 (7%) Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-------KPESYETYQIVDPGE 290 K+ K IE + + + K ++ L + Y + + G+ Sbjct: 6 CCAKEIRRGKSPKYIEKSNVLVFAQKCNTKNNGIDISLAQYLDEDTLKRYPADEYMQNGD 65 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIIT-----SAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 +V R + ++ + I +L M+++ Sbjct: 66 VVINSTGTGTLGRVGLYMAYDDNKKLSIVPDSHVTVIRGGSCIHPFFLYAFMKAHQSNLE 125 Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 GS ++ LK ++ + + +P + EQ I+ I+ ++ V+ ++ Sbjct: 126 KMGEGSTNQKELKPLTLRAMLIALPSLSEQKRISIAISTAFEQLSVIESQL 176 >gi|313158335|gb|EFR57737.1| conserved hypothetical protein [Alistipes sp. HGB5] Length = 140 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 37/112 (33%), Gaps = 2/112 (1%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + IV+ G+ V+ ++ S V + G + S + + Sbjct: 28 KSSAIIVEKGKFVYTGDNIILVDGENSGEVFTVPQDGYMGSTFKQLWLSSAMWKPYILAF 87 Query: 338 RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + + L E LP+ +PP +EQ I IN + + Sbjct: 88 ILFYKEDLRNSKRGAAIPHLNKELFYNLPIGIPPYQEQQRIAKRINKLSQLL 139 >gi|327390914|gb|EGE89254.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 156 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 51/158 (32%), Gaps = 3/158 (1%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 +K + + + + NII + + + ++V + Sbjct: 1 MRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSV 60 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 +F + ++ ++ +I S V ++ TYL + + S + + Sbjct: 61 LFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYLLSDNFINRVNNKST 118 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 G ++ + L + +P + EQ I I + Sbjct: 119 GTSYPAINDYNFNLLLIALPHLSEQQRIIEAIESALEK 156 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 56/145 (38%), Gaps = 7/145 (4%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDG---NSRQSDTSTVSIFAKGQILYGKLGPYLRKA 98 ++ K YI ++ K+ + Q+ + + ++ +L+ + PYL+ Sbjct: 13 NKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRARKLVSQNSVLFSTVRPYLKNI 72 Query: 99 IIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + I ST F+VL L +LLS + R+ G + + Sbjct: 73 AVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFINRVNNKSTGTSYPAINDYNFN 131 Query: 156 NIPMPIPPLAEQVLIREKIIAETVR 180 + + +P L+EQ I E I + + Sbjct: 132 LLLIALPHLSEQQRIIEAIESALEK 156 >gi|299148891|ref|ZP_07041953.1| type I restriction-modification enzyme [Bacteroides sp. 3_1_23] gi|298513652|gb|EFI37539.1| type I restriction-modification enzyme [Bacteroides sp. 3_1_23] Length = 185 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 29/87 (33%) Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 + G S + + + +T + + + L + K + Sbjct: 96 VFRTPIDGYQGSTFKLLSINYDMNTEYVLQVINLHRTILRENKVGSAIPHLNKKLFKAIE 155 Query: 367 VLVPPIKEQFDITNVINVETARIDVLV 393 V +PP KEQ I N +DV++ Sbjct: 156 VPIPPYKEQQRIVEAANKVFMSLDVIM 182 >gi|292630956|gb|AAF77188.2|AF264911_4 restriction and modification enzyme CjeI [Campylobacter jejuni] Length = 1273 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 126/400 (31%), Gaps = 30/400 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR--QSDTSTVSIFAK 83 ++V + LN R S + I ++ SG K LP N + + + Sbjct: 892 ELVRLGEVCDLNKIRNQASATE---IEKMNLNSGNVKLLPSSKNYEWWTDEKTAGQFINE 948 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G+++ + Y + L ++ K + LL I + + +G Sbjct: 949 GEVITLGVARYANIKKHKGKFVSANNHILSVKDKSKIIFDFLYILLEICGQKLYK---QG 1005 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA-- 201 D + +P+PPL Q I + + +TL + L+K Q Sbjct: 1006 QQYPQFDTNIFYSFKIPLPPLEIQKQIVAECEKVEEQYNTLSLSIKEYQNLIKAMLQKCG 1065 Query: 202 LVSYIVTKGLNP------DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESN 255 ++ LN ++ + E++ + + +L+ L Sbjct: 1066 IIEDNQEYELNSILDKINNLCKINLDSEFLSSFNKTIKEYALSNPIFKLSIGKRVLNNEL 1125 Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + + + + E + Y + V ID + + Sbjct: 1126 LENGQIPVYSANVLEVFGFVNKEILQDY----DNDSVLWGIDGDWMVGFIPKNKKFYPTD 1181 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 ++ Y+++++ + F + + +K L V + ++ Q Sbjct: 1182 HCGVLRVDDTKI-NAKYISFILNEAGKKQGFSR-----KLRASIDRIKALRVKLISLEFQ 1235 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 I ++ T +I+ + + + + L++ + + + Sbjct: 1236 DQIADI----TDKIEKKINEYKIELDRLEKEKEKILQKYL 1271 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 58/195 (29%), Gaps = 11/195 (5%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKL-IESNILSLSYGNIIQKLETRNMGLKPE 278 S E +E+ + +N E ++L+ GN+ ++N + Sbjct: 879 SRDELNPFKNSKYELVRLGEVCDLNKIRNQASATEIEKMNLNSGNVKLLPSSKNYEWWTD 938 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Q ++ GE++ L + + + + ++VK +++ Sbjct: 939 EKTAGQFINEGEVI----TLGVARYANIKKHKGKFVSANNHILSVKDKSKIIFDFLYILL 994 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 K++ + +PP++ Q I + + L Sbjct: 995 EICGQKLYKQGQQ--YPQFDTNIFYSFKIPLPPLEIQKQIVAECEKVEEQYNTL----SL 1048 Query: 399 SIVLLKERRSSFIAA 413 SI + + + Sbjct: 1049 SIKEYQNLIKAMLQK 1063 >gi|186685348|ref|YP_001868544.1| DNA methylase-type I restriction-modification system [Nostoc punctiforme PCC 73102] gi|186467800|gb|ACC83601.1| DNA methylase-type I restriction-modification system [Nostoc punctiforme PCC 73102] Length = 255 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 51/162 (31%), Gaps = 14/162 (8%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKP----ESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 E + G++ +K + + + G+I+F + + Sbjct: 68 YTEEGTPYIRVGDVKNGQINFESAVKIPITMANVDKSVGLQIGDIIFTRKGSFGNSAVVT 127 Query: 307 SAQVMERGIITSAYMAVK-----PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFE 360 +V GII+S M V+ + Y++ + S G+ S+ Sbjct: 128 ELEV--NGIISSEIMLVRLTSVSRQEVLPEYVSLFLNSKFGYLQVEHRVHGVAYYSISQP 185 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 D+ L + + P +Q I I + L K I Sbjct: 186 DLANLLIPILPKYQQQKIAEKIKSSFSL--KLKSKQLLEIAK 225 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 66/162 (40%), Gaps = 11/162 (6%) Query: 30 IKRFTK-LNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-FAKGQ 85 + + + G + + YI + DV++G + S+ G Sbjct: 52 LGSLIEPIQNGFDYREYTEEGTPYIRVGDVKNGQINFESAVKIPITMANVDKSVGLQIGD 111 Query: 86 ILYGKLGPYLRKAIIA--DFDGICSTQFLVLQP-----KDVLPELLQGWLLSIDVTQRIE 138 I++ + G + A++ + +GI S++ ++++ ++VLPE + +L S ++E Sbjct: 112 IIFTRKGSFGNSAVVTELEVNGIISSEIMLVRLTSVSRQEVLPEYVSLFLNSKFGYLQVE 171 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 G + N+ +PI P +Q I EKI + Sbjct: 172 HRVHGVAYYSISQPDLANLLIPILPKYQQQKIAEKIKSSFSL 213 >gi|240125348|ref|ZP_04738234.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae SK-92-679] Length = 203 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 14/129 (10%), Positives = 42/129 (32%), Gaps = 5/129 (3%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ T L ++ Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDKFTELEATLEAELALRKRQ 185 Query: 403 LKERRSSFI 411 + R + Sbjct: 186 YRYYRDLLL 194 >gi|298674139|ref|YP_003725889.1| DNA methylase-type I restriction-modification system [Methanohalobium evestigatum Z-7303] gi|298287127|gb|ADI73093.1| DNA methylase-type I restriction-modification system [Methanohalobium evestigatum Z-7303] Length = 482 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 52/375 (13%), Positives = 113/375 (30%), Gaps = 48/375 (12%) Query: 51 IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS-- 108 I D+E + N + + G+I+ K+G + +I + S Sbjct: 79 IRTVDIEKDDFENDIIYINKHAYEFLEKTKVYGGEIIINKIGNAGKAYLIPPIEKKQSLG 138 Query: 109 -TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQ 167 QF++ + + L +L ++ GA D + ++ +PI Q Sbjct: 139 MNQFMIRTNEKINNYYLYSYLAGKYGQNQLMQRVTGAVPLSIDKESTRSVLVPIFSHNFQ 198 Query: 168 VLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKM---------- 217 + I+ I ++ KE K+ L+ + P K+ Sbjct: 199 KNVA-------KAINLYIEYSKYSKKVFKECKKNLLEELGLDKWKPKHKLTFVKNFSDTI 251 Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNR--------------KNTKLIESNILSLSYGN 263 K I+ P + E+ + ++ I + + Sbjct: 252 KSERIDAEYYQPKYEEIVNAIKNYKGGWDILGNVVTLEKGLEVGRNEYLDEGIPFVRVSD 311 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA- 322 I + Y + P + F + + ++ I++ + Sbjct: 312 ISPFEIKEEKYISESLYSDIKHCQPQKDEILFTKDATPGIAHYLTEQPKKMIVSEGVLRL 371 Query: 323 --------VKPHGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQSLKFEDVKRLPVLVPP-- 371 K I++ YL ++ S L + +G + + + VK + + + P Sbjct: 372 KNITIGSKNKHKEINNEYLTLVLNSIILKEQINRDVGGSVIIHWRPKQVKNVLIPILPEE 431 Query: 372 --IKEQFDITNVINV 384 +K Q I +N Sbjct: 432 KRLKIQQKIIKSLNS 446 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 51/163 (31%), Gaps = 5/163 (3%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 K+ I ++ + + + + V GEI+ I + Sbjct: 70 KSEPDYAHMIRTVDIEKDDFENDIIYINKHAYEFLEKTKVYGGEIIINKIGNAGKAYLIP 129 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRL 365 + + + + I++ YL + +G S+ E + + Sbjct: 130 PIEKKQSLGMNQFMIRTNEK-INNYYLYSYLAGKYGQNQLMQRVTGAVPLSIDKESTRSV 188 Query: 366 PVLVPPIKEQFDITNVIN--VETARIDVLVEKIEQSIVLLKER 406 V + Q ++ IN +E ++ V K + LL+E Sbjct: 189 LVPIFSHNFQKNVAKAINLYIEYSKYSKKVFKECKK-NLLEEL 230 >gi|229606286|ref|YP_002876934.1| hypothetical protein VCD_001189 [Vibrio cholerae MJ-1236] gi|229607598|ref|YP_002878246.1| hypothetical protein VCD_002510 [Vibrio cholerae MJ-1236] gi|229607705|ref|YP_002878353.1| hypothetical protein VCD_002617 [Vibrio cholerae MJ-1236] gi|229608127|ref|YP_002878775.1| hypothetical protein VCD_003045 [Vibrio cholerae MJ-1236] gi|229368941|gb|ACQ59364.1| hypothetical protein VCD_001189 [Vibrio cholerae MJ-1236] gi|229370253|gb|ACQ60676.1| hypothetical protein VCD_002510 [Vibrio cholerae MJ-1236] gi|229370360|gb|ACQ60783.1| hypothetical protein VCD_002617 [Vibrio cholerae MJ-1236] gi|229370782|gb|ACQ61205.1| hypothetical protein VCD_003045 [Vibrio cholerae MJ-1236] Length = 424 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 58/150 (38%), Gaps = 8/150 (5%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV- 323 ++ Y++V+ + + + +N ++ + E+ I++++Y Sbjct: 33 TKQFIPSIANTVGTDMSNYKVVEHHQFAYGPVTSRNGEKISVALLGEEKCIVSTSYTVFE 92 Query: 324 --KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITN 380 +D YL R + + M G + L ++++ + + VP I++Q +I Sbjct: 93 IVDTELLDPEYLMMWFRRSEFDRYARYMSHGTVRELFGWQEMCDVELPVPSIEKQREIVR 152 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSF 410 E ++ + EQ L+E + Sbjct: 153 ----EYNVVNDRIALNEQLTKKLEETAQAI 178 >gi|322649128|gb|EFY45569.1| type I restriction enzyme EcoEI specificity protein [Salmonella enterica subsp. enterica serovar Montevideo str. OH_2009072675] Length = 165 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 43/128 (33%), Gaps = 13/128 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W ++ + KL G + + K + I ++++ +G+G Y G + Sbjct: 2 VPKGWMLLQVSDICKLQNGNSFKPHEWDTKGLPIIRIQNL-NGSGNYNYFSGVPQD---- 56 Query: 77 TVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + GQ+L+ G I G+ + + + + E L Sbjct: 57 -KWLVEPGQLLFSWAGTKGVSFGPFIWNGPKGVLNQHIYKVFANENVHEHWLYLALLHIT 115 Query: 134 TQRIEAIC 141 + Sbjct: 116 QKIEAQAH 123 >gi|308190010|ref|YP_003922941.1| type I restriction modification DNA specificity domain protein [Mycoplasma fermentans JER] gi|307624752|gb|ADN69057.1| type I restriction modification DNA specificity domain protein [Mycoplasma fermentans JER] Length = 201 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 50/134 (37%), Gaps = 6/134 (4%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + ++ G+I+ R ++ + +L + I T+ + G+ Sbjct: 49 DNFYSNDHIDSQFFTKEGDIIVR--NMYPYEVALIKKEDQGILISTNFIVIRNLEGLLPK 106 Query: 332 YLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN---VETA 387 YLA+L+ + + + + + + + + L V V + EQ I + ++ Sbjct: 107 YLAYLLSIDVIKDLLVFKSAGSVSKHINNKILGSLNVKVISLNEQQRIIDYVDNSYKVNN 166 Query: 388 RIDVLVEKIEQSIV 401 ++ ++ I Sbjct: 167 LYQEAIDLEKKRIE 180 >gi|313892864|ref|ZP_07826442.1| type I restriction modification DNA specificity domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442591|gb|EFR61005.1| type I restriction modification DNA specificity domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 324 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 39/323 (12%), Positives = 86/323 (26%), Gaps = 23/323 (7%) Query: 62 KYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLP 121 + ++ G I++ K+G L+ A C V+ K Sbjct: 16 DNANNYIDEFDLSILKGNLIPAGTIVFAKIGEALKLNKRAITSCECLIDNNVIGIKPDDN 75 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + E T+ I I + IPP+ Q + Sbjct: 76 IINLLYFYYYLLKIDMLHYSESTTLPSVRKSTIEKIKVKIPPIDVQNKRVTILN------ 129 Query: 182 DTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALV 241 ++ Y V N D+ +K +E G + Sbjct: 130 ----------------ICHKIIKYQVELIHNLDLLVKSRFVEIFGAFNINCNNYNTIKFK 173 Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + + N + L E + + L+ Sbjct: 174 DLIEQNNINEEDMVWLLNLDMIKPNTGEIIEKVYINRQNIPTSSISFNNGTVLYSKLRPY 233 Query: 302 KRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + A G + +G+ + YLA+ +R + +G + + Sbjct: 234 LNKVVIADENGYGTSELIPLNSYKNGLTAEYLAYYLRQDSFVEYIKDKVTGAKMPRVAMD 293 Query: 361 DVKRLPVLVPPIKEQFDITNVIN 383 ++ + ++ P Q ++ +N Sbjct: 294 ILRNIDIIKPNYISQEQFSSFVN 316 >gi|189440819|ref|YP_001955900.1| restriction endonuclease S subunit [Bifidobacterium longum DJO10A] gi|189429254|gb|ACD99402.1| Restriction endonuclease S subunit [Bifidobacterium longum DJO10A] Length = 85 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 MA++P G+D+ +L + L ++ + + + ++ PV +P + EQ I Sbjct: 1 MMALEPRGVDADFLWLFINQTGLYRIAD---TSTIPQINNKHIEPYPVDIPNMAEQQAIG 57 Query: 380 NVINVETARIDVLVEKIEQSIVLLKER 406 +R+D L+ ++ + +++R Sbjct: 58 TF----FSRLDDLITLHQRKRLSIRQR 80 >gi|260910282|ref|ZP_05916958.1| type I restriction-modification system [Prevotella sp. oral taxon 472 str. F0295] gi|260635606|gb|EEX53620.1| type I restriction-modification system [Prevotella sp. oral taxon 472 str. F0295] Length = 224 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 32/119 (26%), Gaps = 7/119 (5%) Query: 20 AIPKHWKVVPIKRFTKLNT-------GRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 IP+ W+ ++ + + + + + ++ Sbjct: 85 EIPQGWEWCRLRDIIEGTNAGKSPNCEKRPKKEYEWGVLTTTAIQENVFLPTENKVLPPN 144 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 ++ G IL + GP R ++ D C L + + + I Sbjct: 145 YIVNSEHSVQYGDILITRAGPVNRTGVVCLVDKECGNLILSDKTVRIDYLRNYCNPIFI 203 >gi|284054869|ref|ZP_06385079.1| restriction endonuclease S subunits-like protein [Arthrospira platensis str. Paraca] Length = 197 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 67/186 (36%), Gaps = 24/186 (12%) Query: 8 PQYKDSGVQWIGAIPKHWKVVPIKRFTKL---------NTGRTSESGKDIIYIGLEDVES 58 P YK + V G IP+ W+ I+ K N G D + +G+ + + Sbjct: 15 PGYKQTEV---GVIPEDWEFCFIRDLIKQEIIEKPLDGNHGNIHPKSNDFVSVGIPFIMA 71 Query: 59 GTGKYLPKDGN------SRQSDTSTVSIFAKGQILYGKLGPYLRKAIIAD---FDGICST 109 D N Q+D +G IL G A++++ + + Sbjct: 72 NNVFNGVVDTNNCHFIKKEQADNLKKGFSFEGDILLTHKGTVGNVAVVSNILTEYIMLTP 131 Query: 110 Q---FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAE 166 Q + V + ++ + S +I+++ G T ++ ++P +PPL E Sbjct: 132 QVTYYRVKDFNKLNNIFIKFYFQSSQFQDKIQSLSGGGTRAYIGINNQQSLPFLLPPLPE 191 Query: 167 QVLIRE 172 Q I Sbjct: 192 QKAIAS 197 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 50/152 (32%), Gaps = 7/152 (4%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQIVDPGEI 291 P ++ K+ + I + N+ + N + + G+I Sbjct: 46 PLDGNHGNIHPKSNDFVSVGIPFIMANNVFNGVVDTNNCHFIKKEQADNLKKGFSFEGDI 105 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFYA- 348 + + + + + Y + +++ ++ + +S + Sbjct: 106 LLTHKGTVGNVAVVSNILTEYIMLTPQVTYYRVKDFNKLNNIFIKFYFQSSQFQDKIQSL 165 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 G G R + + + LP L+PP+ EQ I + Sbjct: 166 SGGGTRAYIGINNQQSLPFLLPPLPEQKAIAS 197 >gi|307067136|ref|YP_003876102.1| restriction endonuclease S subunit [Streptococcus pneumoniae AP200] gi|306408673|gb|ADM84100.1| Restriction endonuclease S subunit [Streptococcus pneumoniae AP200] Length = 168 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 48 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 107 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 108 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 166 >gi|87124162|ref|ZP_01080012.1| type I site-specific deoxyribonuclease (specificity subunit) [Synechococcus sp. RS9917] gi|86168731|gb|EAQ69988.1| type I site-specific deoxyribonuclease (specificity subunit) [Synechococcus sp. RS9917] Length = 82 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 37/83 (44%), Gaps = 5/83 (6%) Query: 15 VQWIGAIPKHWKVVPIKRFTKLNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGNS 70 ++W+G +P+HW+ + ++ T+LN + K + ++ +E + L ++ Sbjct: 1 MEWLGEVPEHWEALRLRFATQLNPSKQEAKELGDQKMVSFLPMEAIGEHGSIRLEQE-KE 59 Query: 71 RQSDTSTVSIFAKGQILYGKLGP 93 S + F G + K+ P Sbjct: 60 VGECLSGYTYFRDGDVCVAKITP 82 >gi|329963225|ref|ZP_08300962.1| hypothetical protein HMPREF9446_02555 [Bacteroides fluxus YIT 12057] gi|328528921|gb|EGF55861.1| hypothetical protein HMPREF9446_02555 [Bacteroides fluxus YIT 12057] Length = 136 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 39/109 (35%), Gaps = 7/109 (6%) Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 S + +I + D YL + + ++ M + Sbjct: 26 DGSGVGTVSYAQGKFSVIGTLNYLTVIGNNDLRYLYFALSVFNFQPYKTGMA---IPHIY 82 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLL 403 F+D + + P + EQ + NV++ +++ ++L +Q + LL Sbjct: 83 FKDYGKAKIYCPSLAEQKRVANVLDKLESKLFVEQELLASFNQQKLYLL 131 >gi|237721639|ref|ZP_04552120.1| type I restriction-modification system [Bacteroides sp. 2_2_4] gi|229449435|gb|EEO55226.1| type I restriction-modification system [Bacteroides sp. 2_2_4] Length = 202 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 58/184 (31%), Gaps = 14/184 (7%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQ 284 +P+ W L +N E + + N + + ++ E Sbjct: 19 QLPNGWCTTTLKDLCENINGLWKGKKEPFVHVGVIRNANFTKDFKLDYSNIEYIDVEQRT 78 Query: 285 IVDP----GEIVFRFIDLQNDKRSLRSAQVMERGIITSA------YMAVKPHGIDSTYLA 334 G+++ ++ R+ G + S + I S +L Sbjct: 79 FTKRHLMNGDLIVEKSGGSDNNPVGRTILYEGEGGVFSFSNFTMVLRIKYSNTILSKFLY 138 Query: 335 WLMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + + + +L + +P+ +PP EQ I + I + +D++ Sbjct: 139 YYILAIYQTGAMRLMQTQTTGLHNLILDKFLLMPIYLPPSSEQKRIIDKIEMIFTTLDMI 198 Query: 393 VEKI 396 +E + Sbjct: 199 MESL 202 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 63/183 (34%), Gaps = 19/183 (10%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG------KYLPKDGNSRQS 73 +P W +K + G GK ++ + + + Y + + Sbjct: 19 QLPNGWCTTTLKDLCENINGL--WKGKKEPFVHVGVIRNANFTKDFKLDYSNIEYIDVEQ 76 Query: 74 DTSTVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVL-----QPKDVLPEL 123 T T G ++ K G P R + G+ S + +L + Sbjct: 77 RTFTKRHLMNGDLIVEKSGGSDNNPVGRTILYEGEGGVFSFSNFTMVLRIKYSNTILSKF 136 Query: 124 LQGWLLSIDVTQRIEAICEGAT-MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 L ++L+I T + + T + + +P+ +PP +EQ I +KI +D Sbjct: 137 LYYYILAIYQTGAMRLMQTQTTGLHNLILDKFLLMPIYLPPSSEQKRIIDKIEMIFTTLD 196 Query: 183 TLI 185 ++ Sbjct: 197 MIM 199 >gi|149026372|ref|ZP_01836527.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP23-BS72] gi|147929334|gb|EDK80333.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP23-BS72] Length = 297 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/314 (9%), Positives = 85/314 (27%), Gaps = 26/314 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYL 333 + ++P + L Sbjct: 282 MVILRPKTPNHNLL 295 >gi|328675903|gb|AEB28578.1| conserved hypothetical protein [Francisella cf. novicida 3523] Length = 189 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 55/136 (40%), Gaps = 7/136 (5%) Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDS 330 ++ G+I+F L+ ++ + ++ S A + I Sbjct: 54 DTFIASKDLSFSCTQEGDIIF---GLRKPNGAVYIDKNHTNLLVQSYMAIIRCNTDIILP 110 Query: 331 TYLAWLMRSYDLCKVFYA--MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YLA+ + + D+ G Q LK + +K + + +P I++Q + +V+ + Sbjct: 111 EYLAFRLNTSDIQNQLQKDIQGGTAIQLLKIQSLKEVVIDIPNIEKQKQLISVLKTGYSE 170 Query: 389 IDVLVEKIEQSIVLLK 404 I VL + I+ LLK Sbjct: 171 IQVLEQIIQHKQQLLK 186 >gi|321310223|ref|YP_004192552.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319802067|emb|CBY92713.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 194 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 40/126 (31%), Gaps = 2/126 (1%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 + E V G++V + R + E + +A+ I Sbjct: 54 DERNHKVEDSHRVRYGDVVI--TNSYIAGRVGINLTDTEFILEGNAFKLEPNLEILDKKY 111 Query: 334 AWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + ++ + G + +++ + VP ++ Q +I ++ + L Sbjct: 112 LYYFLMNSPQQIEQLISYGNVSIISKSSMEKFKIRVPDLETQKNIVRQLDAFWELREELR 171 Query: 394 EKIEQS 399 + +Q Sbjct: 172 MRKQQK 177 >gi|261496174|ref|ZP_05992580.1| putative type I specificity subunit HsdS [Mannheimia haemolytica serotype A2 str. OVINE] gi|261308126|gb|EEY09423.1| putative type I specificity subunit HsdS [Mannheimia haemolytica serotype A2 str. OVINE] Length = 244 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 52/170 (30%), Gaps = 16/170 (9%) Query: 247 KNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + I + ++ + + + L P+ + I+ + Sbjct: 79 GSEAYQTEGIPFVRVSDLSKFGISQTDKYLHPKDFGNVVRPKKDSILLTKDGT----VGI 134 Query: 306 RSAQVMERGIITSAYMAVKPHGID---STYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + +ITS + D YLA + S + G + Q K + Sbjct: 135 AYRVPQDLNVITSGAIVHLELKTDEVLPDYLALALNSPAVQLQAERDAGGSIIQHWKPSE 194 Query: 362 VKRLPVLVPPIKEQFDITNVINVETA-------RIDVLVEKIEQSIVLLK 404 + + + V P Q I++ + A ++ +EQ I ++ Sbjct: 195 ILDVVIPVLPKNIQQTISDKVQQSFALRVESEVLLEKAKILVEQEIENMR 244 >gi|261366730|ref|ZP_05979613.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] gi|282571557|gb|EFB77092.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] Length = 174 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 50/128 (39%), Gaps = 10/128 (7%) Query: 279 SYETYQIVDPGEI-VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID---STYLA 334 + Y++V G+ + DK + + G++++ Y + + YL Sbjct: 47 DFTKYKVVKRGQFTYIPDTSRRGDKIGIALLTDYDEGLVSNIYTVFEVKDENELLPEYLM 106 Query: 335 WLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + G + + ++++ ++ + VP I++Q I N T RI+ Sbjct: 107 LWFSRPEFDRYARFKSHGSVREIMDWDEMCKVELPVPSIEKQRSIVKAYNTITDRIE--- 163 Query: 394 EKIEQSIV 401 +++ I Sbjct: 164 --LKRKIN 169 >gi|183597753|ref|ZP_02959246.1| hypothetical protein PROSTU_01054 [Providencia stuartii ATCC 25827] gi|188023033|gb|EDU61073.1| hypothetical protein PROSTU_01054 [Providencia stuartii ATCC 25827] Length = 204 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 44/126 (34%), Gaps = 7/126 (5%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYD 341 + G+I+F ++ + + + +K ++ +L W + Sbjct: 69 WLKKGDILFSAKGAKHIASYVDGDLENTTCAPSLFLLHLKSKWQGLVNTQFLTWQLNQPP 128 Query: 342 LCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVIN---VETARIDVLVEKIE 397 + F G S++ + P+ +P I+ Q I + E A + L+ + Sbjct: 129 AQQYFKRSAEGSFHISIRKPVLAATPIALPSIETQNTIAKLYAASIKENALLHKLINNRQ 188 Query: 398 QSIVLL 403 Q + + Sbjct: 189 QQLNAI 194 >gi|238923525|ref|YP_002937041.1| type I restriction-modification system specificity subunit [Eubacterium rectale ATCC 33656] gi|238875200|gb|ACR74907.1| type I restriction-modification system specificity subunit [Eubacterium rectale ATCC 33656] Length = 164 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 13/126 (10%), Positives = 42/126 (33%), Gaps = 1/126 (0%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + + + + + + I+ +++ L + + + Sbjct: 22 HIVEDDMKYISKEFCASLRKSILHENDLIIVRTGLPGT-CCVVPKEYDGCNCADVVLVKP 80 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ++ YLA + + +V +++ + + + + +P I+ Q I V+ Sbjct: 81 NVDIVNPHYLAAYINMWGKKQVENNKVGAIQKHFNVKSAEEMLIDLPDIEYQNKIAKVLR 140 Query: 384 VETARI 389 +I Sbjct: 141 DINDKI 146 Score = 39.8 bits (91), Expect = 0.96, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 51/154 (33%), Gaps = 3/154 (1%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIAD 102 + K + ++ +++ S++ S SI + ++ + G ++ Sbjct: 6 TDKGVKFLRSLNIKPFHIVEDDMKYISKEFCASLRKSILHENDLIIVRTGLPGTCCVVPK 65 Query: 103 FDGICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 C+ LV D++ +++ +++E GA H + K + + Sbjct: 66 EYDGCNCADVVLVKPNVDIVNPHYLAAYINMWGKKQVENNKVGAIQKHFNVKSAEEMLID 125 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 +P + Q I + + +I Sbjct: 126 LPDIEYQNKIAKVLRDINDKILNNEKINDYLAYQ 159 >gi|300727391|ref|ZP_07060804.1| type I restriction modification system, subunit S [Prevotella bryantii B14] gi|299775331|gb|EFI71928.1| type I restriction modification system, subunit S [Prevotella bryantii B14] Length = 185 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 41/128 (32%), Gaps = 5/128 (3%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Y I D + S + + + + VK + +L + Sbjct: 57 DYINDYITDEELLCIAEDCGNYKAGEDSSYIINGKAWVNNHAHLVKAKEC--CEIKYLHQ 114 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA---RIDVLVEK 395 + + + R L + +K +PVL+P I+ Q ++ I +E Sbjct: 115 YLKITDLMPYVSGTTRLKLTQKKMKEIPVLLPSIELQNKFVSIAEQADKSGFEIRKSIEA 174 Query: 396 IEQSIVLL 403 I+ I L Sbjct: 175 IDNVIKSL 182 >gi|191639030|ref|YP_001988196.1| HsdS [Lactobacillus casei BL23] gi|190713332|emb|CAQ67338.1| HsdS [Lactobacillus casei BL23] Length = 190 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 40/179 (22%), Positives = 59/179 (32%), Gaps = 4/179 (2%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ K + + I + ED+ S G+ S F Sbjct: 15 WEKRKFKDL--VVRVNKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINSKQGIY--FEPQ 70 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L+GKL PYL+ + F G F VL+ + L+ Q + I G Sbjct: 71 DVLFGKLRPYLQNWLFPSFYGRAVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGT 130 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M +DW + N PIP +EQ I + I + L K Q L Sbjct: 131 KMPRSDWNTVSNTSFPIPVQSEQRKIWQLFNVLDNLIAATQSRLSSLELLKKSLLQDLF 189 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 48/151 (31%), Gaps = 8/151 (5%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLR 306 N +S I S+ + +II K N ++ + I +P +++F + Sbjct: 28 NKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINSKQGIYFEPQDVLFGKLRPYLQNWLFP 87 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 S + ++ + S YL L++S V + V Sbjct: 88 SFYGR---AVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGTKMPRSDWNTVSNTS 144 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIE 397 +P EQ I +D L+ + Sbjct: 145 FPIPVQSEQRKI----WQLFNVLDNLIAATQ 171 >gi|188577906|ref|YP_001914835.1| HsdS polypeptide, part of CfrA family [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522358|gb|ACD60303.1| HsdS polypeptide, part of CfrA family [Xanthomonas oryzae pv. oryzae PXO99A] Length = 151 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 19/113 (16%), Positives = 35/113 (30%), Gaps = 9/113 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTG--RTS---ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 +P W I F + G +T + Y+ + +V+ G + D Sbjct: 16 ELPAGWSSYKIGEFCTVQGGIQKTPLRRPVSQHFPYLRVANVQRGRIDLRQLERYELSLD 75 Query: 75 TSTVSIFAKGQILY----GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + G +L G R AI C Q +++ + + E Sbjct: 76 ELEKWRLSAGDLLIVEGNGSESEIGRCAIWQGEVEDCVYQNHLMRVRPQISEQ 128 >gi|124008032|ref|ZP_01692731.1| conserved hypothetical protein [Microscilla marina ATCC 23134] gi|123986446|gb|EAY26252.1| conserved hypothetical protein [Microscilla marina ATCC 23134] Length = 206 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 10/105 (9%), Positives = 35/105 (33%), Gaps = 5/105 (4%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-----VKPHGIDSTYLAW 335 +++ +++F +N L + E + ++ + + Y+ W Sbjct: 63 NEDRLLHRSDLLFVAKGDRNTTIPLSNLSTDEYAVPSNHFFILRYKRDWKSRLHLEYVVW 122 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + +++ + ++ + V +P ++ Q I Sbjct: 123 YLNEAAQGYFAQQGTGATVKNISMKVLENIEVPLPALQVQQKIAQ 167 >gi|301633171|gb|ADK86725.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 315 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/352 (11%), Positives = 90/352 (25%), Gaps = 43/352 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS--IFAK 83 K IK +++ G+ E + + G+Y G++ Sbjct: 4 KTYKIKDICEISRGKAITK---------EYIRANPGEYPVYSGSTLNDGEIGRIDECEFD 54 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 G+ + + Y + S VL+ K E+ +L + + + Sbjct: 55 GEYVTWTIDGYAGIVFYRNERFNASQHCGVLKVKS--NEICPKFLAYALGMEAPKHVNNA 112 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + K + I + P Q I + T L + ++ Sbjct: 113 CVIPNLTLKKMREIELDFPSKKIQEKIATILDTFTELSAELRERKKQYAFYRDYL----- 167 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFA-LVTELNRKNTKLIESNILSLSYG 262 LN + K G ++ E+ + S + Sbjct: 168 -------LNQENIRKIYGANIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATT 220 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N + ++ E N + + + + Sbjct: 221 NDGELGRIKDCDFDGEYI---------------TWTTNGYAGVVFYRNGKFNASQDCGVL 265 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + T + + K + + S R L + + + + PP++ Sbjct: 266 KFKNKKICTKFLSFLLKIEAPKFVHNLAS--RPKLSQKVMAEIELSFPPLEI 315 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 37/121 (30%), Gaps = 7/121 (5%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + VK + I +LA+ + V A Sbjct: 57 YVTWTIDGYAGIVFYRNERFNASQHCGVLKVKSNEICPKFLAYALGMEAPKHVNNAC--- 113 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +L + ++ + + P K Q I +++ T L ++ + R + Sbjct: 114 VIPNLTLKKMREIELDFPSKKIQEKIATILDTFTE----LSAELRERKKQYAFYRDYLLN 169 Query: 413 A 413 Sbjct: 170 Q 170 >gi|312862776|ref|ZP_07723016.1| conserved hypothetical protein [Streptococcus vestibularis F0396] gi|322516814|ref|ZP_08069716.1| type I restriction-modification system specificty subunit [Streptococcus vestibularis ATCC 49124] gi|311101636|gb|EFQ59839.1| conserved hypothetical protein [Streptococcus vestibularis F0396] gi|322124651|gb|EFX96115.1| type I restriction-modification system specificty subunit [Streptococcus vestibularis ATCC 49124] Length = 206 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 63/186 (33%), Gaps = 8/186 (4%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K+V + G+ + + I L D+ Y S + + Sbjct: 19 KLVRLGDVVDQFKGKAVPAKAEPGEFAVINLSDMTPNGIAYDDLKTFSEERRKLLRFLLE 78 Query: 83 KGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIE 138 G +L G + A+ D + + S+ VL+PK+ L + L ++ ++ Sbjct: 79 DGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLRPKEKLRGFYIKFFLETEIGRAYLD 138 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQ-VLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + + +I +P P+ +Q I + +I + + Sbjct: 139 YADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAYLRGRADYHRKMIRAEQEWENIQHN 198 Query: 198 KKQALV 203 +AL Sbjct: 199 VTEALF 204 Score = 37.9 bits (86), Expect = 3.7, Method: Composition-based stats. Identities = 10/102 (9%), Positives = 30/102 (29%), Gaps = 2/102 (1%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ E ++ + + Y+ + + + Sbjct: 74 RFLLEDGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLRPKEKLRGFYIKFFLETEIG 133 Query: 343 CKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF-DITNVI 382 G +L D+ + + PI +Q I + Sbjct: 134 RAYLDYADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAYL 175 >gi|300727389|ref|ZP_07060802.1| putative type I restriction enzyme EcoKI specificity protein [Prevotella bryantii B14] gi|299775329|gb|EFI71926.1| putative type I restriction enzyme EcoKI specificity protein [Prevotella bryantii B14] Length = 248 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 54/156 (34%), Gaps = 8/156 (5%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE--TYQIVDPGEIVFRFIDLQNDKRSL 305 + +S + L NII + + ++ T Q++ G+IV + Sbjct: 30 KEETSDSISVILRSNNIINGQINFDDVVYVDNKRVTTEQVLSKGDIVMCGSNGSKKLVGK 89 Query: 306 RSAQVMERGIITS----AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFE 360 + TS I YL+ ++ +V +GSG ++K E Sbjct: 90 AAMINTIPSYRTSFGAFCLGIRCKESILPEYLSVYFQTPKYREVIEFLGSGSNILNIKPE 149 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETAR-IDVLVEK 395 + L + +P +++Q + D L+ + Sbjct: 150 HIYNLEIPIPSLEDQKHFVTIAEQADKSGFDGLISQ 185 >gi|321309734|ref|YP_004192063.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] gi|319801578|emb|CBY92224.1| type I restriction-modification system, S subunit [Mycoplasma haemofelis str. Langford 1] Length = 206 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 74/209 (35%), Gaps = 24/209 (11%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 + K +G + N LI+ ++ + N Sbjct: 14 NHCPKGIPWRAIGDFSIVFYEGRLLERHIVPNGDTPCLIQRDLSVMKGNRFSSCNHMVNE 73 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 GL + + G I+F I D+ + ++ +++ H + +L Sbjct: 74 GLVA----NKRYFEKGSILFSRIGDSLDQVGKAFIYEGDEFVLAGNDISILKHNQNPEFL 129 Query: 334 AWLMRSYDLCKVFYAMGSGLRQS--LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 ++ S+++ +++ ++ ED+K + V +PP++ Q ++ A + Sbjct: 130 IRILNSHEVRHQVIQNTY-KQKTFLIEHEDLKMIMVPLPPVEIQ-------DLVMAEL-- 179 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQID 420 E E+ I + +R+S + G+I Sbjct: 180 --EVKEREIEE-QRQRNSLM-----GKIK 200 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 10/180 (5%) Query: 22 PKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVESGTG-KYLPKDGNSRQSDT 75 PK I F+ + R D + D+ G ++ + + Sbjct: 17 PKGIPWRAIGDFSIVFYEGRLLERHIVPNGDTPCLIQRDLSVMKGNRFSSCNHMVNEGLV 76 Query: 76 STVSIFAKGQILYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + F KG IL+ ++G + I + + + + + + PE L L S Sbjct: 77 ANKRYFEKGSILFSRIGDSLDQVGKAFIYEGDEFVLAGNDISILKHNQNPEFLIRILNSH 136 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +V ++ + + + I +P+PP+ Q L+ ++ + I+ Sbjct: 137 EVRHQVIQNTYKQKTFLIEHEDLKMIMVPLPPVEIQDLVMAELEVKEREIEEQRQRNSLM 196 >gi|257466154|ref|ZP_05630465.1| hypothetical protein FgonA2_01770 [Fusobacterium gonidiaformans ATCC 25563] gi|315917311|ref|ZP_07913551.1| conserved hypothetical protein [Fusobacterium gonidiaformans ATCC 25563] gi|313691186|gb|EFS28021.1| conserved hypothetical protein [Fusobacterium gonidiaformans ATCC 25563] Length = 159 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 49/156 (31%), Gaps = 5/156 (3%) Query: 29 PIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + G+ S + YI E++ G I+ K +L Sbjct: 4 RLSDICHYVKGKVDVSELDNSTYISTENMLPDKGGVTEAASLPTTL---QTQIYEKDDVL 60 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATM 146 + PY +K AD +G CS LV + + + P L L A +G M Sbjct: 61 VSNIRPYFKKIWFADQNGGCSNDVLVFRANEGVEPGFLYYVLADDKFFDFSMATSKGTKM 120 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 D K + + + Q + + +I Sbjct: 121 PRGDKKALMEYEVLDFNIDTQKKVASLLGDIDEKIR 156 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 42/158 (26%), Gaps = 4/158 (2%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + K N +S N++ + QI + +++ Sbjct: 1 MKYRLSDICHYVKGKVDVSELDNSTYISTENMLPDKGGVTEAASLPTTLQTQIYEKDDVL 60 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 I K G + G++ +L +++ A G Sbjct: 61 VSNIRPYFKKIWFA---DQNGGCSNDVLVFRANEGVEPGFLYYVLADDKFFDFSMATSKG 117 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + VL I Q + +++ +I Sbjct: 118 TKMPRGDKKALMEYEVLDFNIDTQKKVASLLGDIDEKI 155 >gi|126665392|ref|ZP_01736374.1| putative type I restriction-modification system specificity protein [Marinobacter sp. ELB17] gi|126630020|gb|EBA00636.1| putative type I restriction-modification system specificity protein [Marinobacter sp. ELB17] Length = 389 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 40/342 (11%), Positives = 85/342 (24%), Gaps = 49/342 (14%) Query: 86 ILYGKLGPY---LRKAIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAIC 141 +L + G + V++ K L L +L ++ + Sbjct: 69 VLIAEDGSASLENYSIQYVSGKFWANNHVHVIRGKSGLNTRFLYHYLCIVNFI----SFL 124 Query: 142 EGATMSHADWKGIGNIPMPIPP-------LAEQVLIREKIIAETVRIDTLITERIRFIEL 194 G + + IP+ IP L Q I + T L E + Sbjct: 125 TGGGRAKLTKGKMVEIPISIPCPENPKRSLEIQAEIVRILDTFTELTAELTAELTARKKQ 184 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIES 254 + L+S + +EW V ++ + Sbjct: 185 YNYYRDQLLS------------FGEGEVEW-----------KELDDVFDIFAGGDAPKGA 221 Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 + + L + ++ + S + Sbjct: 222 LSNIETEEFNVPILSNGIGDRSLYGWTNKAKIEKPSLTISARGTIGWT----SFRDKPFF 277 Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 I + ++ Y + M++ + Y + L VK VP Sbjct: 278 PIVRLLVLSPKIDLNLKYAYYFMKT---IEDAYNVPQNGIPQLTKPMVKDKKFPVPSPGV 334 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 Q I ++ + E + + I L ++ R ++ Sbjct: 335 QARIVATLDKFDTLTSSITEGLPREIALRQQQYEYYRDFLLS 376 >gi|284795029|gb|ADB93815.1| restriction modification system DNA specificity domain [Yersinia enterocolitica] Length = 143 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 45/145 (31%), Gaps = 10/145 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSE-------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 W+ + + ++ TG T S I ++ D+ K + Sbjct: 2 GWEEKNVDQLGEIITGSTPSTQNSNNYSNDGIPWVTPTDISRNVTFNTAKKLSQTGCKV- 60 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 I K IL + + I+ G + Q + P + + + SI +++ Sbjct: 61 -ARIVPKDTILVTCIASIGKNTIL-GTQGSFNQQINGVVPNEKENDPYFLFSASILWSEK 118 Query: 137 IEAICEGATMSHADWKGIGNIPMPI 161 ++ TM + + + Sbjct: 119 LKRSAASGTMQIVNKTEFSELKTRV 143 >gi|166363246|ref|YP_001655519.1| putative restriction modification system protein [Microcystis aeruginosa NIES-843] gi|166085619|dbj|BAG00327.1| putative restriction modification system protein [Microcystis aeruginosa NIES-843] Length = 135 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 43/140 (30%), Gaps = 6/140 (4%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G+I+F I ++ E I A S + Sbjct: 2 QRSRPETGDIIFSNIGTLG--STVLVDNEFEFSIKNVALFKPFDKNYSSFIFLYFSDPAT 59 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 L K+ ++ + ++ L +L P +V+ + + + Sbjct: 60 LRKMEIQSSGTSQKFFSLKFLRGLHILTPNKTLLRLFNDVVEPALKQ----RSLLHKYNQ 115 Query: 402 LLKERRSSFIAAAVTGQIDL 421 LK+ R + + G+I++ Sbjct: 116 KLKQARDILLPKLMNGEIEV 135 >gi|325680256|ref|ZP_08159818.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] gi|324108073|gb|EGC02327.1| type I restriction modification DNA specificity domain protein [Ruminococcus albus 8] Length = 175 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 31/80 (38%), Gaps = 5/80 (6%) Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDIT 379 + V I+ +LA + + + K G + D++ + +L P +EQ I Sbjct: 74 IIVPDDYINPVFLALTISNGNQQKELSKRAQGKSVVHIHNSDLENVVLLYPKYEEQEKIG 133 Query: 380 NVINVETARIDVLVEKIEQS 399 +++D L+ + Sbjct: 134 EY----FSKLDSLITLHQHK 149 >gi|189501457|ref|YP_001960927.1| hypothetical protein Cphamn1_2552 [Chlorobium phaeobacteroides BS1] gi|189496898|gb|ACE05446.1| conserved hypothetical protein [Chlorobium phaeobacteroides BS1] Length = 196 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 55/127 (43%), Gaps = 9/127 (7%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID---STYLAWLM 337 + + V G++VFR L L + + R ++ + + ++ + D S YL+W + Sbjct: 57 KEHHFVRKGDLVFRSRGLVTTSALL--LEDVGRAVVAAPLLRIRVNDPDKVLSEYLSWYL 114 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV--INVETAR-IDVLV 393 + + G ++ + E + L V +P ++ Q I + ++ + + L Sbjct: 115 NQREAQVFLDSRAKGTFQKMIGKEAIDDLEVYLPSLERQKHIVELAGLSAREKQMLHELA 174 Query: 394 EKIEQSI 400 EK EQ I Sbjct: 175 EKREQYI 181 Score = 42.9 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 50/154 (32%), Gaps = 11/154 (7%) Query: 30 IKRFTKLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +K + G + G D+ I ++D+ S K Sbjct: 5 LKELATVQVGYSFRSRLEVSEGGDVAVIQMKDLRDDNVVDCSDLAKIDMSGMKEHHFVRK 64 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQ-----FLVLQPKDVLPELLQGWLLSIDVTQRIE 138 G +++ G A++ + G V P VL E L +L + ++ Sbjct: 65 GDLVFRSRGLVTTSALLLEDVGRAVVAAPLLRIRVNDPDKVLSEYLSWYLNQREAQVFLD 124 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 + +G + I ++ + +P L Q I E Sbjct: 125 SRAKGTFQKMIGKEAIDDLEVYLPSLERQKHIVE 158 >gi|321310229|ref|YP_004192558.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] gi|319802073|emb|CBY92719.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] Length = 153 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 46/150 (30%), Gaps = 7/150 (4%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K V G++V + ++ + I++S + P+ Sbjct: 3 KENLFYCDDANHKISDAHRVQYGDVVITNSAPSPRRVAINLTNL--EFILSSHVFKLDPN 60 Query: 327 G-IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I + +V + ++ V+ +LVP ++ Q I ++ Sbjct: 61 PEILDRKYLYYFLENSPQQVERMITFKNVSAINVSSVESFKILVPDLETQRSIAAKLDKL 120 Query: 386 TARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + L + Q + R+ ++ + Sbjct: 121 RELREELKMRKRQGVY----YRNKIMSNLL 146 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 19/136 (13%), Positives = 46/136 (33%), Gaps = 2/136 (1%) Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLL 129 S G ++ P R+ I + + I S+ L P + + + Sbjct: 13 NHKISDAHRVQYGDVVITNSAPSPRRVAINLTNLEFILSSHVFKLDPNPEILDRKYLYYF 72 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 + Q++E + +S + + + + +P L Q I K+ + L + Sbjct: 73 LENSPQQVERMITFKNVSAINVSSVESFKILVPDLETQRSIAAKLDKLRELREELKMRKR 132 Query: 190 RFIELLKEKKQALVSY 205 + + + L+ + Sbjct: 133 QGVYYRNKIMSNLLEH 148 >gi|257452047|ref|ZP_05617346.1| hypothetical protein F3_03205 [Fusobacterium sp. 3_1_5R] gi|317058595|ref|ZP_07923080.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] gi|313684271|gb|EFS21106.1| conserved hypothetical protein [Fusobacterium sp. 3_1_5R] Length = 160 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 49/156 (31%), Gaps = 5/156 (3%) Query: 29 PIKRFTKLNTGRTSESG-KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + G+ S + YI E++ G I+ K +L Sbjct: 4 RLSDICHYVKGKVDVSELDNSTYISTENMLPDKGGVTEAASLPTTL---QTQIYEKDDVL 60 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL-PELLQGWLLSIDVTQRIEAICEGATM 146 + PY +K AD +G CS LV + + + P L L A +G M Sbjct: 61 VSNIRPYFKKIWFADQNGGCSNDVLVFRANEGVEPGFLYYVLADDKFFDFSMATSKGTKM 120 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 D K + + + Q + + +I Sbjct: 121 PRGDKKALMEYEVLDFNIDTQKKVASLLGDIDEKIR 156 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 42/158 (26%), Gaps = 4/158 (2%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + K N +S N++ + QI + +++ Sbjct: 1 MKYRLSDICHYVKGKVDVSELDNSTYISTENMLPDKGGVTEAASLPTTLQTQIYEKDDVL 60 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 I K G + G++ +L +++ A G Sbjct: 61 VSNIRPYFKKIWFA---DQNGGCSNDVLVFRANEGVEPGFLYYVLADDKFFDFSMATSKG 117 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + + VL I Q + +++ +I Sbjct: 118 TKMPRGDKKALMEYEVLDFNIDTQKKVASLLGDIDEKI 155 >gi|189467608|ref|ZP_03016393.1| hypothetical protein BACINT_03998 [Bacteroides intestinalis DSM 17393] gi|189435872|gb|EDV04857.1| hypothetical protein BACINT_03998 [Bacteroides intestinalis DSM 17393] Length = 127 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 42/127 (33%), Gaps = 17/127 (13%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM-RSYDLCKV--FYAMGSGLRQSLKF 359 + E ++ Y+ + S+ + + R++D GS RQ + Sbjct: 5 AFINFLDKNEIAYGSTEYIVISAKSNYSSSFFYFLARNHDFVDYAVKNMNGSSGRQRVSG 64 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR-----SSFIAAA 414 + + + + V P + + T + + L+ R + + Sbjct: 65 DTISKYRIPVIPRE-------KLESFTNHAE--IALKTIKNNSLQNMRLSMTRDALLPKL 115 Query: 415 VTGQIDL 421 ++G++ + Sbjct: 116 MSGELKV 122 >gi|225550830|ref|ZP_03771779.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 2 str. ATCC 27814] gi|225379984|gb|EEH02346.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 2 str. ATCC 27814] Length = 346 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 43/377 (11%), Positives = 108/377 (28%), Gaps = 41/377 (10%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ T ++ +I GL + N+ ++ I Sbjct: 6 KLSSVFEIITTGKQKNTFNINLEGLYPL------ISASTANNGIMGYVDNYLYDGQNITI 59 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMS 147 ++G + F++ + + ++ +LL ++ ++I +I G T Sbjct: 60 SRVGNAGTTFYHEGKISLTDNCFILSRINKKIAKVKYVFYLLKLNEDKKIRSISHGTTRK 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + N+ + +P + Q I I + + + + +LL Sbjct: 120 IINKTDLDNLIIYLPSIEIQNAIISIIEPLDILENKINKLKTVLKKLLINIYDK------ 173 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + F K S I Sbjct: 174 ----------------------NCNSHVNLFENNKIYTNKYLNQNLYCDTSCIGELEINF 211 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + N+ L+ + + I+F + +N E + ++ + +K + Sbjct: 212 SKMINISLEDKPSRADLSIKNNSIIFSKLLGENKVYC---FLNNENIVFSTGFFNIKSND 268 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ L + S D + +G + D+ ++ P + +I + Sbjct: 269 ENNDDLLSFLLSSDFKNQKSMLANGTTMIGINNSDLTKVRCKAPFLN--SNIYFTFFNKL 326 Query: 387 ARIDVLVEKIEQSIVLL 403 I+ + IV L Sbjct: 327 NEIENKITLARNKIVNL 343 >gi|225376199|ref|ZP_03753420.1| hypothetical protein ROSEINA2194_01837 [Roseburia inulinivorans DSM 16841] gi|225211845|gb|EEG94199.1| hypothetical protein ROSEINA2194_01837 [Roseburia inulinivorans DSM 16841] Length = 172 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 12/127 (9%), Positives = 45/127 (35%), Gaps = 5/127 (3%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I + + + + + I+ +++ + + G + + V+ Sbjct: 44 IVEDDLKYISREFNESLRKSILHENDLIIVRTGIPG---TCCVVSKDYEGCNCADVVLVR 100 Query: 325 PHGI--DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P+ + YLA + + +V +++ + + + + +P ++ Q + ++ Sbjct: 101 PNLQVVNPHYLAAYINVWGKKQVENNKVGAIQKHFNVKSAEEMLIDLPDLESQNKVAKIL 160 Query: 383 NVETARI 389 +I Sbjct: 161 CDLNDKI 167 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 58/165 (35%), Gaps = 8/165 (4%) Query: 28 VPIKRFTKLNTGRTSE-----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIF 81 V + +L G + + ++ +++ + SR+ + S SI Sbjct: 6 VRLSDIAELTVGFVGNMAKQYKDEGVKFLRSLNIKPFSIVEDDLKYISREFNESLRKSIL 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + ++ + G +++ C+ LV V+ +++ +++E Sbjct: 66 HENDLIIVRTGIPGTCCVVSKDYEGCNCADVVLVRPNLQVVNPHYLAAYINVWGKKQVEN 125 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 GA H + K + + +P L Q + + + +I + Sbjct: 126 NKVGAIQKHFNVKSAEEMLIDLPDLESQNKVAKILCDLNDKIISN 170 >gi|300905940|ref|ZP_07123668.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 84-1] gi|300402221|gb|EFJ85759.1| type I restriction modification DNA specificity domain protein [Escherichia coli MS 84-1] Length = 198 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 53/188 (28%), Gaps = 2/188 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP W V + + + G++ G+ + RQ TS Sbjct: 10 EIPAGWAVNTLSQIANITMGQSPAGESYNEDGIGTLFFQGSTDFGWLFPTPRQYTTSPTR 69 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG IL P IA+ D L K L +++ Sbjct: 70 MAKKGDILLSVRAPV-GDMNIANADCCIGRGLAALNSKSRSDGFLF-YVMKYFKQVFERR 127 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 EG T + ++ + P + + I T E I+L Sbjct: 128 NAEGTTFGSMTKDDLHSLQVVCPEPGLLKRYDDIVSEYNKMIFTRSLENQDLIKLRDWLL 187 Query: 200 QALVSYIV 207 L++ V Sbjct: 188 PILMNGQV 195 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 54/198 (27%), Gaps = 11/198 (5%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-- 284 +P W V + ++ N + + + P Y T Sbjct: 10 EIPAGWAVNTLSQIANITMGQSPAGESYNEDGIGTLFFQGSTDFGWLFPTPRQYTTSPTR 69 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 + G+I+ D I A+ +L ++M+ + Sbjct: 70 MAKKGDILLSVRAPVGDM-----NIANADCCIGRGLAALNSKSRSDGFLFYVMKYFKQVF 124 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 S+ +D+ L V+ P + + + ++ L Sbjct: 125 ERRNAEGTTFGSMTKDDLHSLQVVCPEPGLLKR----YDDIVSEYNKMIFTRSLENQDLI 180 Query: 405 ERRSSFIAAAVTGQIDLR 422 + R + + GQ+ ++ Sbjct: 181 KLRDWLLPILMNGQVKIK 198 >gi|332202747|gb|EGJ16816.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41317] Length = 297 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/314 (9%), Positives = 86/314 (27%), Gaps = 26/314 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEILSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNL---------- 168 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 L + G + + D+ + + E L L+ N Sbjct: 169 -------LVKSRFNEMFGENKIFEIIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKN 221 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + + + + + + ++ +IV + + I S Sbjct: 222 VTKNGFSFDTKQFITKTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSG 281 Query: 320 YMAVKPHGIDSTYL 333 + ++P + L Sbjct: 282 MVILRPKTPNHNLL 295 >gi|229165874|ref|ZP_04293640.1| Type I restriction enzyme, specificity subunit [Bacillus cereus AH621] gi|228617579|gb|EEK74638.1| Type I restriction enzyme, specificity subunit [Bacillus cereus AH621] Length = 192 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 58/176 (32%), Gaps = 12/176 (6%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYE---TYQIVDPGEIVFRFIDLQNDKRSL 305 + I + ++ ++ E+ ++ G++V + + Sbjct: 23 KQFGTQVINYYDQPSFEDDYNHEDVFVEDEAKSLSQNNPSLNEGDVVIS--NSLQLATMV 80 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD---LCKVFYAMGSGLRQSLKFEDV 362 V + + + +D Y +L +Y K G+G + + Sbjct: 81 GKNNVGKVLSLNFTKIEFDSEQLDKRYFLFLFNAYKDVRRQKERELQGNGPVLRIPLRAL 140 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L V V P++EQ I + L K+ + L+++ SS I + G+ Sbjct: 141 GELIVPVAPLEEQKKIGAIYAETL----KLQSKLNKYADLMEKFTSSIIEENLKGK 192 >gi|283956926|ref|ZP_06374399.1| hypothetical protein C1336_000320096 [Campylobacter jejuni subsp. jejuni 1336] gi|283791652|gb|EFC30448.1| hypothetical protein C1336_000320096 [Campylobacter jejuni subsp. jejuni 1336] Length = 108 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 10/81 (12%), Positives = 29/81 (35%), Gaps = 1/81 (1%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLP 366 ++ I+ V + I+ + D+ + G + + + D + Sbjct: 5 IWNNDKAILNQHISKVVFYKIEINKKYFYFCILDVLEEMSEKTHGSVMRHITKGDFDNIE 64 Query: 367 VLVPPIKEQFDITNVINVETA 387 + +P +K+Q I +++ Sbjct: 65 IPLPSLKKQERIVGILDELIQ 85 >gi|254993314|ref|ZP_05275504.1| specificity determinant HsdS [Listeria monocytogenes FSL J2-064] Length = 116 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 1/116 (0%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 Q+ N + E+ Y ++ G FR ND +++RGII+ Y Sbjct: 2 QEDYFANRQVTTENNIGYFVLPRGYFTFRSRS-DNDVFVFNRNDIIDRGIISYFYPVFTL 60 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 DS + + + ++ + L + K + + P EQ I + Sbjct: 61 KSADSDFFLRRINNGIQRQLSIQAEGTGQHVLSLKKFKNIVAMFPSEGEQKKIGSF 116 >gi|160887308|ref|ZP_02068311.1| hypothetical protein BACOVA_05326 [Bacteroides ovatus ATCC 8483] gi|156107719|gb|EDO09464.1| hypothetical protein BACOVA_05326 [Bacteroides ovatus ATCC 8483] Length = 174 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 72/175 (41%), Gaps = 12/175 (6%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 E + ++ + +LS + I + E + + E+ Y+I+ ++V +L Sbjct: 1 MERSERSQTNNQHEVLSSTVKGIFSQREYFSKDIASENNVGYKIIRLHDVVLSPQNLWM- 59 Query: 302 KRSLRSAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRS----YDLCKVFYAMGSGLRQS 356 ++ E GI++ +Y G D+ ++A ++++ Y V S +R++ Sbjct: 60 -GNINYNDRFEIGIVSPSYKVFSIADGYDNQFVAAMLKTHRALYSYMMVSEQGASIVRRN 118 Query: 357 LKFEDVKRLPVLVPPIKEQFDIT---NVINVETARIDVLVEKIEQSIVLLKERRS 408 L E +L +P + +Q +I +++ + + +++ L R Sbjct: 119 LNMEAFSQLVFKIPSLDKQREIGYAISLLKSQLKTANKIIKAYTSQKQYL--LRQ 171 >gi|310831505|ref|YP_003970148.1| putative type I restriction modification enzyme, M and S domains [Cafeteria roenbergensis virus BV-PW1] gi|309386689|gb|ADO67549.1| putative type I restriction modification enzyme, M and S domains [Cafeteria roenbergensis virus BV-PW1] Length = 977 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 42/387 (10%), Positives = 105/387 (27%), Gaps = 51/387 (13%) Query: 21 IP-KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP +K+V + + +S + + E G + ++ D + Sbjct: 600 IPGDGYKMVKLGDIVEFL----PKSKRKASFGK----EIGKYNFYTSSDKVKKCDEADY- 650 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + ++ G G CS ++L+ + + + + + + Sbjct: 651 --NEECLIIGTGGNS--CIHYNKNKFSCSGDTILLKYNKNI---EYNYFVFNCIWDYLLS 703 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G+T+ H + N +PIP + I + I+ +++ Sbjct: 704 QMNGSTIKHVTKNLLENFTIPIPTS----------DKKIKYWVDRINKPYNKIQECRDRL 753 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 + L + + +E L + + F KN Sbjct: 754 KELEDKVQEDIQTMLEENDTEEVELGVLCDINNKQIKRFNTSYGTKLKN----------- 802 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 K Y + I+ + + Sbjct: 803 -------KYRFYTGSANDIYYCNDFNIKDYVIILNKTNGSGK---CNIFLDKKISCAKQT 852 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y+ + T + + K+ ++L + + + +P + Sbjct: 853 YICQSKNKEIETIYLYYFLRKNKLKLEEGYIGACHKNLDINFLNKFKITLPKD---RKLI 909 Query: 380 NVINVETARIDVLVEKIEQSIVLLKER 406 + +N + ID L E++ + L ++ Sbjct: 910 DSLNPLFSEIDNLNEELPKQETLYQQY 936 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 13/179 (7%), Positives = 49/179 (27%), Gaps = 5/179 (2%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 ++ + + K+++ + +K K Y + V Sbjct: 586 EYTFNHKKYNKKKLIPGDGYKMVKLGDIVEFLPKSKRKASFGKEIGKYNFYTSSDKVKKC 645 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + + S + + + + + + + + + M Sbjct: 646 DEADYNEECLIIGTGGNSCIHYNKNKFSCSGDTILLKYNKNIEYNYFVFNCIWDYLLSQM 705 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDI---TNVINVETARIDVLVEKIEQSIVLLKE 405 + + ++ + +P I + IN +I +++++ ++E Sbjct: 706 NGSTIKHVTKNLLENFTIPIPTSD--KKIKYWVDRINKPYNKIQECRDRLKELEDKVQE 762 >gi|150006167|ref|YP_001300911.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] gi|149934591|gb|ABR41289.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] Length = 226 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 47/129 (36%), Gaps = 7/129 (5%) Query: 285 IVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 ++D +I+F+ + + R + + S A Y+ L+ + + Sbjct: 12 VIDNNDILFQCVRPYQKNNYIHRILNTSNQQWVASTGYAQIRTTELPNYIYHLLNTDEFN 71 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLV-PPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + +G ++ ED+ + + P KEQ I+ +++ +D + + I Sbjct: 72 RKVMVRCTGSSYPAINSEDLATIHLYYTPDKKEQLKISRLLD----LLDKRIATQNKIIE 127 Query: 402 LLKERRSSF 410 L+ Sbjct: 128 KLQSLIKGI 136 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 53/142 (37%), Gaps = 8/142 (5%) Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIA------DFDGICSTQFLVLQPKDVLPELL 124 ++ + + IL+ + PY + I + + ST + ++ + LP + Sbjct: 3 EEAPSRAQRVIDNNDILFQCVRPYQKNNYIHRILNTSNQQWVASTGYAQIRTTE-LPNYI 61 Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDT 183 L + + +++ C G++ + + + I + P EQ+ I + RI T Sbjct: 62 YHLLNTDEFNRKVMVRCTGSSYPAINSEDLATIHLYYTPDKKEQLKISRLLDLLDKRIAT 121 Query: 184 LITERIRFIELLKEKKQALVSY 205 + L+K Q + Sbjct: 122 QNKIIEKLQSLIKGIAQHCIKE 143 >gi|113460701|ref|YP_718767.1| type I restriction enzyme, specificity subunit [Haemophilus somnus 129PT] gi|112822744|gb|ABI24833.1| possible type I restriction enzyme, specificity subunit [Haemophilus somnus 129PT] Length = 183 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 60/180 (33%), Gaps = 14/180 (7%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 WE + + ++ K + S +G I + ++ +S +TY+ Sbjct: 13 FPEFTHAWEQRKAKEIFISVSEKGFPHLPVLSASQEFGMIRRDDIGIDIKYDQKSTQTYK 72 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH---GIDSTYLAWLMRSYD 341 V PG+ V Q A GI + AY + S + + S Sbjct: 73 RVSPGQFVIHLRSFQG-----GFAWSDIEGITSPAYTIIDFKKKENHSSNFWKLIFTSSS 127 Query: 342 LCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K + G+R +S+ F D L + I+EQ I +D + ++ Sbjct: 128 FIKKLETVTYGIRDGRSISFSDFSDLRLFYSQIQEQQKIGTF----FTALDRYITIHQRK 183 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 48/168 (28%), Gaps = 5/168 (2%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLE-DVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ K + + + + + + D Q T T + Sbjct: 20 WEQRKAKEIFISVSEKGFP---HLPVLSASQEFGMIRRDDIGIDIKYDQKSTQTYKRVSP 76 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 GQ + L + +D +GI S + ++ K W L + I+ + Sbjct: 77 GQFVI-HLRSFQGGFAWSDIEGITSPAYTIIDFKKKENHSSNFWKLIFTSSSFIKKLETV 135 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + Q+ ++KI +D IT R Sbjct: 136 TYGIRDGRSISFSDFSDLRLFYSQIQEQQKIGTFFTALDRYITIHQRK 183 >gi|295092358|emb|CBK78465.1| Type I restriction modification DNA specificity domain. [Clostridium cf. saccharolyticum K10] Length = 71 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 15/46 (32%) Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + G G+ + V +PP+ EQ I +I Sbjct: 23 DQIKSKGQGVIPGIDRNSVMNFLFPLPPLPEQRRIVKKQQELFDKI 68 >gi|229088749|ref|ZP_04220306.1| Type I restriction-modification system specificity subunit [Bacillus cereus Rock3-44] gi|228694574|gb|EEL47993.1| Type I restriction-modification system specificity subunit [Bacillus cereus Rock3-44] Length = 188 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 56/135 (41%), Gaps = 9/135 (6%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLM 337 +++ + G++VF F+ K + S + I + + ++ +DS+YL + + Sbjct: 55 NHKESYLSSAGDVVFSFVSS---KAGIVSDLNQGKIINQNFAKLIIEHDYLDSSYLCYAL 111 Query: 338 R-SYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN--VINVETARIDVLV 393 SY + K +M L +K L + +P I++Q I + + Sbjct: 112 NESYSMKKQMAISMQGSTVPKLTPAILKELEIKLPNIEKQRTIGKAYFFLRKRQALAKKQ 171 Query: 394 EKIEQSIVLLKERRS 408 ++E+ + LK + Sbjct: 172 AELEEQL-YLKILKQ 185 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 18/155 (11%), Positives = 53/155 (34%), Gaps = 11/155 (7%) Query: 29 PIKRFTKLNTGRTSESGKDIIYI-----GLEDVESG-TGKYLPKDGNSRQSDTSTV--SI 80 ++ + GR G + + ED+ + G +L +S + + + Sbjct: 2 KLEDIVTVRVGRNLSRGNEKNDLTLVAYSYEDLRNDLDGSFLDSQASSYSGNLNHKESYL 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRI 137 + G +++ + + I + F ++ L S + +++ Sbjct: 62 SSAGDVVFSFVSSKAGIVSDLNQGKIINQNFAKLIIEHDYLDSSYLCYALNESYSMKKQM 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +G+T+ + + + +P + +Q I + Sbjct: 122 AISMQGSTVPKLTPAILKELEIKLPNIEKQRTIGK 156 >gi|253729834|ref|ZP_04863999.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus USA300_TCH959] gi|253726428|gb|EES95157.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus USA300_TCH959] Length = 42 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 9/41 (21%), Positives = 24/41 (58%), Gaps = 4/41 (9%) Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 ++EQ I + + +++D ++ EQ + LL++R+ + + Sbjct: 1 NLEEQQKIGSFL----SKLDRQIDLEEQKLELLQQRKKALL 37 >gi|148642217|ref|YP_001272730.1| type I restriction-modification enzyme, subunit S [Methanobrevibacter smithii ATCC 35061] gi|148551234|gb|ABQ86362.1| predicted type I restriction-modification enzyme, subunit S [Methanobrevibacter smithii ATCC 35061] Length = 102 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 12/64 (18%), Positives = 25/64 (39%), Gaps = 4/64 (6%) Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G L + + + P EQ I+N++ + ID + IE+ I ++ + Sbjct: 6 GITATPILNKKSFENMKFEFPSFDEQKQISNML----SNIDNKIFAIEELINKTQKFKKG 61 Query: 410 FIAA 413 + Sbjct: 62 LLQQ 65 >gi|327383092|gb|AEA54568.1| Restriction modification system DNA specificity domain protein [Lactobacillus casei LC2W] gi|327386276|gb|AEA57750.1| Restriction modification system DNA specificity domain protein [Lactobacillus casei BD-II] Length = 195 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/179 (22%), Positives = 59/179 (32%), Gaps = 4/179 (2%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ K + + I + ED+ S G+ S F Sbjct: 20 WEKRKFKDL--VVRVNKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINSKQGIY--FEPQ 75 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +L+GKL PYL+ + F G F VL+ + L+ Q + I G Sbjct: 76 DVLFGKLRPYLQNWLFPSFYGRAVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGT 135 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 M +DW + N PIP +EQ I + I + L K Q L Sbjct: 136 KMPRSDWNTVSNTSFPIPVQSEQRKIWQLFNVLDNLIAATQSRLSSLELLKKSLLQDLF 194 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 48/151 (31%), Gaps = 8/151 (5%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI-VDPGEIVFRFIDLQNDKRSLR 306 N +S I S+ + +II K N ++ + I +P +++F + Sbjct: 33 NKTSDDSTIPSVEFEDIISKQGRLNKDVRLKINSKQGIYFEPQDVLFGKLRPYLQNWLFP 92 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 S + ++ + S YL L++S V + V Sbjct: 93 SFYGR---AVGDFWVLRANSSVLSEYLFVLIQSPRFQIVANISSGTKMPRSDWNTVSNTS 149 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIE 397 +P EQ I +D L+ + Sbjct: 150 FPIPVQSEQRKI----WQLFNVLDNLIAATQ 176 >gi|261867039|ref|YP_003254961.1| restriction modification system DNA specificity subunit [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412371|gb|ACX81742.1| restriction modification system DNA specificity subunit [Aggregatibacter actinomycetemcomitans D11S-1] Length = 317 Score = 50.6 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/333 (12%), Positives = 83/333 (24%), Gaps = 29/333 (8%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + G + G+Y + N + + A G I G Sbjct: 9 LGDLVEFQRGYDLPKDAFV-----------KGEYPVQSSNGILGYHNEYKVKAPG-ITIG 56 Query: 90 KLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHA 149 + G +I +T V K + + L ++ G + Sbjct: 57 RSGTVGIPHLITKNFFPHNTALYVKDFKG--NNVQYIYYLLKNLKLNEYKTGSGVPTMNR 114 Query: 150 DWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTK 209 + I +Q + +D I + L+E + L Y + Sbjct: 115 NHLHPLKIRAFTNLKTQQSIAAV-----LSALDKKIALNKQINARLEEMAKTLYDYWFVQ 169 Query: 210 GLNPD---VKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 PD K SG E V E+ + + + ++ + +++ N Sbjct: 170 FDFPDANGKPYKSSGGEMVFDETLKREIPKGWEV--KSLGDWAEIKKGTLITEKTANTNG 227 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 ++ + GL Y I I + + + Sbjct: 228 DIKVISAGLDFSYYHDVANRPKNTI---TISASGANAGFVNFWREPIFVCDCTTITNSVI 284 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 G L +L D + + Sbjct: 285 GSTLYILNFLRIVQDFIYQQAR--GSAQPHVSK 315 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 38/132 (28%), Gaps = 12/132 (9%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y V I + +A G + Y+ +L+++ Sbjct: 42 YHNEYKVKAPGITIGRSGT----VGIPHLITKNFFPHNTALYVKDFKGNNVQYIYYLLKN 97 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPV-LVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 L + ++ + L + +K Q I V++ +D + +Q Sbjct: 98 LKLNEY---KTGSGVPTMNRNHLHPLKIRAFTNLKTQQSIAAVLSA----LDKKIALNKQ 150 Query: 399 SIVLLKERRSSF 410 L+E + Sbjct: 151 INARLEEMAKTL 162 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 38/153 (24%), Gaps = 28/153 (18%) Query: 10 YKDSGVQWI------GAIPKHWKVVPIKRFTKLNTG-----RTSESGKDIIYIGLEDVES 58 YK SG + + IPK W+V + + ++ G +T+ + DI I Sbjct: 180 YKSSGGEMVFDETLKREIPKGWEVKSLGDWAEIKKGTLITEKTANTNGDIKVISAG---- 235 Query: 59 GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKD 118 + + + K I G Sbjct: 236 --LDFSYYHDVANR---------PKNTITISASGANAGFVNFWREPIFVCD--CTTITNS 282 Query: 119 VLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 V+ L V I G+ H Sbjct: 283 VIGSTLYILNFLRIVQDFIYQQARGSAQPHVSK 315 >gi|257425923|ref|ZP_05602347.1| type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus 55/2053] gi|257428590|ref|ZP_05604988.1| Sau1hsdS1 [Staphylococcus aureus subsp. aureus 65-1322] gi|257431225|ref|ZP_05607602.1| methyltransferase type [Staphylococcus aureus subsp. aureus 68-397] gi|257433906|ref|ZP_05610264.1| TypeIrestrictionenzyme,specificitysubunit [Staphylococcus aureus subsp. aureus E1410] gi|257436822|ref|ZP_05612866.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus M876] gi|282914605|ref|ZP_06322391.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M899] gi|282924951|ref|ZP_06332617.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C101] gi|293503682|ref|ZP_06667529.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus 58-424] gi|293510699|ref|ZP_06669404.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus M809] gi|293537240|ref|ZP_06671920.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M1015] gi|257271617|gb|EEV03763.1| type I restriction modification DNA specificity protein [Staphylococcus aureus subsp. aureus 55/2053] gi|257275431|gb|EEV06918.1| Sau1hsdS1 [Staphylococcus aureus subsp. aureus 65-1322] gi|257278173|gb|EEV08821.1| methyltransferase type [Staphylococcus aureus subsp. aureus 68-397] gi|257281999|gb|EEV12136.1| TypeIrestrictionenzyme,specificitysubunit [Staphylococcus aureus subsp. aureus E1410] gi|257284173|gb|EEV14296.1| type I restriction-modification enzyme, S subunit [Staphylococcus aureus subsp. aureus M876] gi|282313317|gb|EFB43713.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus C101] gi|282321786|gb|EFB52111.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M899] gi|290920085|gb|EFD97153.1| type I restriction-modification system, S subunit, EcoA family [Staphylococcus aureus subsp. aureus M1015] gi|291095348|gb|EFE25613.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus 58-424] gi|291466590|gb|EFF09111.1| type I restriction enzyme, S subunit [Staphylococcus aureus subsp. aureus M809] Length = 282 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 78/293 (26%), Gaps = 30/293 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K+N+G+ + +E G G + Sbjct: 20 EWEEKKLGDLIKVNSGKDYK-----------HLEKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I I +P EQ I E I +I+ + + K Q + Sbjct: 122 TGVPSLSKQTINKINRFVPSNKEQQKIGEFFIKLDRQIELEEQKLELLQQQKKGYMQKIF 181 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 S + + G +WE K + +++ T + Sbjct: 182 SQELRFK------------DENGNDYPNWEEKKIEDIASQVYGGGTPNTKIKEFWNGDIP 229 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 IQ + + L + + E+ + N + V + ++ Sbjct: 230 WIQSSDVKVNDLILRQCNKFISKNSIELSSAKLIPANSIAIVTRVGVGKLCLV 282 Score = 43.2 bits (100), Expect = 0.079, Method: Composition-based stats. Identities = 21/130 (16%), Positives = 41/130 (31%), Gaps = 15/130 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRS-----LRSAQVMERGIITSAYMAVKPHGID 329 +K S + Y+ ++ G+I S + + + +G I Y+ P Sbjct: 30 IKVNSGKDYKHLEKGDIPVYGTGGYMTSVSEPLSEIDAVGIGRKGTINKPYLLEAPFWTV 89 Query: 330 STYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 T +++ + S SL + + ++ VP KEQ I Sbjct: 90 DTLFYCTPKKETDILFILSLFRKINWKVYDESTGVPSLSKQTINKINRFVPSNKEQQKIG 149 Query: 380 NVINVETARI 389 +I Sbjct: 150 EFFIKLDRQI 159 >gi|309812887|ref|ZP_07706619.1| conserved hypothetical protein [Dermacoccus sp. Ellin185] gi|308433165|gb|EFP57065.1| conserved hypothetical protein [Dermacoccus sp. Ellin185] Length = 81 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 16/67 (23%), Positives = 29/67 (43%) Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +L ++ L V ++ Q ++ + +SI LL E + S I AAV Sbjct: 4 NLPSGVIRGLRVPQLSLRGQGEVVERLAARQQADRNFEAVTLRSIELLTEYKQSLITAAV 63 Query: 416 TGQIDLR 422 +G+ D+ Sbjct: 64 SGEFDVT 70 >gi|238754322|ref|ZP_04615679.1| HsdS-like DNA methylase [Yersinia ruckeri ATCC 29473] gi|238707569|gb|EEP99929.1| HsdS-like DNA methylase [Yersinia ruckeri ATCC 29473] Length = 108 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 14/101 (13%), Positives = 29/101 (28%), Gaps = 4/101 (3%) Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + LM S + GS + L + + V++PP I Sbjct: 1 MSDNACPIFTFGQLMLSLEALIERLGEGSTGQTELSRKILSEQFVVLPPFD----IAEKA 56 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 Q L + R + + ++G + + Sbjct: 57 ERSFKSFSEKQVSNRQQNSELIKLRDTLLPKLISGDLRISD 97 >gi|302338879|ref|YP_003804085.1| hypothetical protein Spirs_2376 [Spirochaeta smaragdinae DSM 11293] gi|301636064|gb|ADK81491.1| hypothetical protein Spirs_2376 [Spirochaeta smaragdinae DSM 11293] Length = 429 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 62/209 (29%), Gaps = 5/209 (2%) Query: 176 AETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV-KMKDSGIEWVGLVPDHWEV 234 + I + + ++ + + + K E + V Sbjct: 178 FKDHTILFRRLGHNAELLMERKVPIEEMQNASVWIPDRFFIRQKQYLNEKIHTVQLGSLC 237 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNI--IQKLETRNMGLKPESYETYQIVDPGEIV 292 + F E LSL + + + M ++ S ++ G+++ Sbjct: 238 RDIFRGAPGRFFSKEGKEEVRYLSLKHVGAGLLDVNDLSTMRIESVSRIKRYLLRQGDVI 297 Query: 293 FRFIDLQNDKRSL-RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 L + + G + + +D +L +RS + M + Sbjct: 298 VSCRGEFFRPLLLTGKSTIPVTGGDNYVIIRPDLNLVDPGFLFRYLRSRAGQAFLFGMST 357 Query: 352 GLRQS-LKFEDVKRLPVLVPPIKEQFDIT 379 G R L + +PV +PP++ Q + Sbjct: 358 GKRIRVLNVRAMAEIPVPLPPMEMQQRVA 386 >gi|13508104|ref|NP_110053.1| hypothetical protein MPN365 [Mycoplasma pneumoniae M129] gi|12229977|sp|P75416|T1SC_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_365; AltName: Full=S.MpnORFCP; AltName: Full=Type I restriction enzyme specificity protein MPN_365; Short=S protein gi|1674161|gb|AAB96119.1| hypothetical protein MPN_365 [Mycoplasma pneumoniae M129] Length = 268 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 54/176 (30%), Gaps = 10/176 (5%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ-------IVDPGEI 291 + N +I + G I K RN + Y + + Sbjct: 72 RKIYGANIPFETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATTNDGELGRIKDCDF 131 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 +I + + + + + + T + + K + + Sbjct: 132 DGEYITWTTNGYAGVVFYRNGKFNASQDCGVLKVKNKKICTKFLSFLLKIEAPKFVHNLA 191 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 S R L + + + + PP++ Q I +++ + LVE I + + K++ Sbjct: 192 S--RPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEVEMRKKQ 245 Score = 41.7 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 8/61 (13%), Positives = 22/61 (36%), Gaps = 4/61 (6%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +L + ++ + + P K Q I +++ T L ++ + R + Sbjct: 12 VIPNLTLKKMREIELDFPSKKIQEKIATILDTFTE----LSAELRERKKQYAFYRDYLLN 67 Query: 413 A 413 Sbjct: 68 Q 68 >gi|301162155|emb|CBW21700.1| putative type I restriction-modification system specificity system, partial [Bacteroides fragilis 638R] Length = 175 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 72/170 (42%), Gaps = 11/170 (6%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 ++ + +LS + I + E + + E+ Y+I+ ++V +L ++ Sbjct: 7 RSRTNNQHEVLSSTVKGIFSQREYFSKDIASENNVGYKIIRLHDVVLSPQNLWM--GNIN 64 Query: 307 SAQVMERGIITSAYMAV-KPHGIDSTYLAWLMRS----YDLCKVFYAMGSGLRQSLKFED 361 E GI++ +Y G D+ ++A ++++ Y V S +R++L E Sbjct: 65 YNDRFEIGIVSPSYKVFSIADGYDNQFVAAMLKTHRALYSYMMVSEQGASIVRRNLNMEA 124 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 +L +P + +Q +I I++ +++ + + I ++ + Sbjct: 125 FSQLVFKIPSLDKQREIGCAISLLKSQL----KTANKIIRAYTSQKQYLL 170 >gi|319896580|ref|YP_004134773.1| haeiv restriction/modification system [Haemophilus influenzae F3031] gi|317432082|emb|CBY80432.1| HaeIV restriction/modification system [Haemophilus influenzae F3031] Length = 1062 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 35/113 (30%), Gaps = 3/113 (2%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I + + I S V+ T + +F Sbjct: 940 NTITISASGANAGFVNFWTEKIFASDCTTVRADNYVGTKFIFTYLQSIQENIFDLARGAA 999 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITN---VINVETARIDVLVEKIEQSIVLL 403 + + +D+KRLP+ P+ Q + I+ E R + +E+ I + Sbjct: 1000 QPHVYPDDIKRLPIPKVPLDIQQKVVEECQKIDDEFNRTRMQIEEYRAKIAKI 1052 >gi|291534513|emb|CBL07625.1| Type I restriction modification DNA specificity domain [Roseburia intestinalis M50/1] Length = 199 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 42/176 (23%), Positives = 72/176 (40%), Gaps = 5/176 (2%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG 89 + + S + +GLE + D + + T +F KG +L+G Sbjct: 6 LGEVSHERKETCKGSKEGYPIVGLEHLIPEEITLTTWDEGAENTFT---KMFRKGDVLFG 62 Query: 90 KLGPYLRKAIIADFDGICSTQFLVL--QPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 + YL+KA +A FDGICS V+ P +LPELL + + D+ G+ Sbjct: 63 RRRAYLKKAAVAPFDGICSGDITVIEADPDKILPELLPFIIQNDDLFDFAVGKSAGSLSP 122 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 W+ + N + +P + +Q + E + A + EL+K K A + Sbjct: 123 RVKWEHLKNYELELPDMNKQKELAELLWAIDDTKKSYQKLIAATDELVKSKFAARM 178 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 57/155 (36%), Gaps = 7/155 (4%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 + ++I + T + ++ G+++F K ++ + G Sbjct: 24 YPIVGLEHLIPEEITLTTWDEGAENTFTKMFRKGDVLFGRRRAYLKKAAVAPFDGICSGD 83 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 IT + P I L +++++ DL L +K+E +K + +P + + Sbjct: 84 IT--VIEADPDKILPELLPFIIQNDDLFDFAVGKSAGSLSPRVKWEHLKNYELELPDMNK 141 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 Q ++ ++ ID + ++ I E S Sbjct: 142 QKELAELLWA----IDDTKKSYQKLIAATDELVKS 172 >gi|149005621|ref|ZP_01829360.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP18-BS74] gi|147762561|gb|EDK69521.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP18-BS74] Length = 179 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 121 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 179 Score = 40.2 bits (92), Expect = 0.74, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 70/177 (39%), Gaps = 17/177 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWL 128 ++ K L + L D+DG+ + F+ + +++ + L L Sbjct: 62 EQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNL 121 Query: 129 LSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 S ++++AI + G + + + + +P+ P EQ LI +K+ +++ Sbjct: 122 SSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQ 178 >gi|260061351|ref|YP_003194431.1| type I restriction-modification system, M subunit [Robiginitalea biformata HTCC2501] gi|88785483|gb|EAR16652.1| type I restriction-modification system, M subunit [Robiginitalea biformata HTCC2501] Length = 894 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 30/321 (9%), Positives = 80/321 (24%), Gaps = 5/321 (1%) Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I+ K + + + S+ + + + Sbjct: 327 IVISKTKERPGSVLFFPGENYAQPLNNGHYQLMLEDITKDFLRRSVSIQTSDHSRTDEPP 386 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + E+ E E I E+ L+ + Sbjct: 387 FNQLEIDKDSIKSQDYSLDHERYRFEEIEGIELQEIVE--VEKGYQSGLIYVSSIFNRND 444 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + + G + + ++ + ++ + Sbjct: 445 SFLEKFFRKMGFSQLGEPFDKEGVAKNDYGKIVNQNRSKIQEVLRQVKFISTKDLKNDPY 504 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + E +I+ ++ I + +++ + P Sbjct: 505 NYSLEIDSVQFRERSHNARIIVDEVVLVTLIGSSLKPTFVSNSEPPFYLHHQLIALKPNP 564 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF-DITNVIN 383 + +D + + S + + + G L+ +D+ L +P +KEQ ++ Sbjct: 565 NLVDLDWFINHLHSDSIKRQLALLKKGSGISYLRRQDLLSLKFALPSLKEQKSEMVQA-T 623 Query: 384 VETARIDVLVEKIEQSIVLLK 404 +ID L I Q L+ Sbjct: 624 KLYRQIDSLESDIIQQNAYLR 644 >gi|167974338|ref|ZP_02556615.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 11 str. ATCC 33695] gi|188998054|gb|EDU67151.1| restriction-modification enzyme subunit s3a [Ureaplasma urealyticum serovar 11 str. ATCC 33695] Length = 346 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 43/377 (11%), Positives = 108/377 (28%), Gaps = 41/377 (10%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + ++ T ++ +I GL + N+ ++ I Sbjct: 6 KLSSVFEIITTGKQKNTFNINLEGLYPL------ISASTANNGIMGYVDNYLYDGQNITI 59 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ-GWLLSIDVTQRIEAICEGATMS 147 ++G + F++ + + ++ +LL ++ ++I +I G T Sbjct: 60 SRVGNAGTTFYHEGKISLTDNCFILSKINKKIAKVKYVFYLLKLNEDKKIRSISHGTTRK 119 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIV 207 + + N+ + +P + Q I I + + + + +LL Sbjct: 120 IINKTDLDNLIIYLPSIEIQNAIISIIEPLDILENKINKLKTVLKKLLINIYDK------ 173 Query: 208 TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQK 267 + F K S I Sbjct: 174 ----------------------NCNSHVNLFENNKIYTNKYLNQNLYCDTSCIGELEINF 211 Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + N+ L+ + + I+F + +N E + ++ + +K + Sbjct: 212 SKMINISLEDKPSRADLSIKNNSIIFSKLLGENKVYC---FLNNENIVFSTGFFNIKSND 268 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 ++ L + S D + +G + D+ ++ P + +I + Sbjct: 269 ENNDDLLSFLLSSDFKNQKSMLANGTTMIGINNSDLTKVRCKAPFLN--SNIYFTFFNKL 326 Query: 387 ARIDVLVEKIEQSIVLL 403 I+ + IV L Sbjct: 327 NEIENKITLARNKIVNL 343 >gi|154499005|ref|ZP_02037383.1| hypothetical protein BACCAP_02997 [Bacteroides capillosus ATCC 29799] gi|150271845|gb|EDM99071.1| hypothetical protein BACCAP_02997 [Bacteroides capillosus ATCC 29799] Length = 174 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/122 (14%), Positives = 44/122 (36%), Gaps = 5/122 (4%) Query: 274 GLKPESYETYQIVDPGEI-VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--- 329 + Y++V G+ + DK + + G++++ Y + + Sbjct: 42 NTIGTDFTKYKVVKRGQFTYIPDTSRRGDKIGIALLMDYDEGLVSNIYTVFEVKDENELL 101 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL + + G + + ++++ ++ + VP I +Q I T R Sbjct: 102 PEYLMLWFSRPEFDRYARFKSHGSVREIMDWDEMCKVELPVPSIDKQRSIVKAYQTITER 161 Query: 389 ID 390 I+ Sbjct: 162 IE 163 >gi|148988250|ref|ZP_01819713.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP6-BS73] gi|147926714|gb|EDK77787.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP6-BS73] Length = 179 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 121 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 179 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 70/177 (39%), Gaps = 17/177 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWL 128 ++ K L + L D+DG+ + F+ + +++ + L L Sbjct: 62 EQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNL 121 Query: 129 LSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 S ++++AI + G + + + + +P+ P EQ LI +K+ +++ Sbjct: 122 SSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQ 178 >gi|227892125|ref|ZP_04009930.1| type I restriction modification system protein HsdIA [Lactobacillus salivarius ATCC 11741] gi|227866057|gb|EEJ73478.1| type I restriction modification system protein HsdIA [Lactobacillus salivarius ATCC 11741] Length = 185 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 3/170 (1%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 LV + N+ +++ +L + + + I G+IV Sbjct: 1 MKLNELVKIESGINSVRVKNQNYTLYAIEDVNYDLGHGEDYQHDKASGKSITARGDIVIN 60 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SG 352 + R+A M I + + +D YL +L+ + + A Sbjct: 61 TVSNLASVVHSRNAGKMLNQIF-LRLNILDENTLDPWYLCYLLNKSEYIRYQEAAIMDGS 119 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L +++ L + +P I +Q + + + +EK E L Sbjct: 120 VIRKLTKANLEDLEINLPEIADQKKMGEAYKEIMKKYTLAMEKAELERDL 169 >gi|162448117|ref|YP_001621249.1| site-specific DNA-methyltransferase [Acholeplasma laidlawii PG-8A] gi|161986224|gb|ABX81873.1| site-specific DNA-methyltransferase [Acholeplasma laidlawii PG-8A] Length = 559 Score = 50.6 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 58/144 (40%), Gaps = 5/144 (3%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-- 322 I + +N+ K + + + I+ K ++ + ++ I+T + Sbjct: 407 IDESNLQNIDNKDGKLDKFALEYEDVIITSKSS--KVKIAVIDFEPKDKIIVTGGMIIAR 464 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 V ++ T+L + S + ++ G+ ++ ++ + + +P + Q++I+ Sbjct: 465 VDKSKLNPTFLKVFLESDQGQLLLKSIQKGISIITINATELSNIIIPLPQLDVQYNISKK 524 Query: 382 INVETARIDVLVEKIEQSIVLLKE 405 N + + + L +I + LK Sbjct: 525 YNRKLSSLMALKAEILKIEDELKN 548 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 60/195 (30%), Gaps = 15/195 (7%) Query: 28 VPIKRFTKLNTGRTS----------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 V + + TG + + D++ G + Sbjct: 364 VKLSEVAHVFTGSQYTVRNFQEALTDENTGYKLLTSSDIQDGLIDESNLQNIDNKDGKLD 423 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQG-WLLSID 132 ++ ++ A+I D + + K L +L S Sbjct: 424 KFALEYEDVIITSKSSKVKIAVIDFEPKDKIIVTGGMIIARVDKSKLNPTFLKVFLESDQ 483 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 +++I +G ++ + + NI +P+P L Q I +K + + L E ++ Sbjct: 484 GQLLLKSIQKGISIITINATELSNIIIPLPQLDVQYNISKKYNRKLSSLMALKAEILKIE 543 Query: 193 ELLKEKKQALVSYIV 207 + LK + ++ Sbjct: 544 DELKNFYYEEIEEVL 558 >gi|227365084|ref|ZP_03849110.1| possible restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] gi|227069878|gb|EEI08275.1| possible restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] Length = 195 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 57/191 (29%), Gaps = 10/191 (5%) Query: 226 GLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI 285 G + + + + E ++ L + Q + + + I Sbjct: 14 GFEKSNLTQIANYKNGLAMQKYRPNSNEESLPVLKIKELNQGNTDDSSDRCSANLDNSVI 73 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 V+ G+I+F + L ++ + V + + ++ + + L Sbjct: 74 VNTGDIIFSWSGTL-----LVKNWTGDKAGLNQHLFKVTSNKYPAWFIYEWTKYHLLRFQ 128 Query: 346 FYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 A G +K D+K V +P ++ + A I + + L Sbjct: 129 AIAAGKATTMGHIKRSDLKSSLVYIPS----QLFLAKMDSQLAPIYSQRLNLIKENQQLS 184 Query: 405 ERRSSFIAAAV 415 + + + + Sbjct: 185 KLKQTLLKKYF 195 Score = 40.2 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 58/189 (30%), Gaps = 10/189 (5%) Query: 21 IPKHWKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 I ++ + + G R + + + + + ++++ G + ++ Sbjct: 11 INDGFEKSNLTQIANYKNGLAMQKYRPNSNEESLPVLKIKELNQGN---TDDSSDRCSAN 67 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 I G I++ G L K D G + + + W + Sbjct: 68 LDNSVIVNTGDIIFSWSGTLLVKNWTGDKAG-LNQHLFKVTSNKYPAWFIYEWTKYHLLR 126 Query: 135 QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + A + TM H + + + IP + ++ + LI E + +L Sbjct: 127 FQAIAAGKATTMGHIKRSDLKSSLVYIPSQLFLAKMDSQLAPIYSQRLNLIKENQQLSKL 186 Query: 195 LKEKKQALV 203 + + Sbjct: 187 KQTLLKKYF 195 >gi|163798239|ref|ZP_02192171.1| hypothetical protein BAL199_08178 [alpha proteobacterium BAL199] gi|159176487|gb|EDP61070.1| hypothetical protein BAL199_08178 [alpha proteobacterium BAL199] Length = 155 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 47/138 (34%), Gaps = 12/138 (8%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER-GIITSAYMAVKPHGIDSTYLAWL 336 E V G++VFR +N +L ++ + K + YLAW+ Sbjct: 6 EDLADRYFVRAGDVVFRSRGERNTASALDERLREAALAVLPLMVLRPKRDVVTPEYLAWI 65 Query: 337 MRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + F G + + L + VP I+ Q I V + + Sbjct: 66 INQPPAQRHFDVAARGTNIRMIPRSSLDDLELDVPDIETQEKIVAV---------NALAE 116 Query: 396 IEQSIVLL-KERRSSFIA 412 E+ + L E R ++ Sbjct: 117 RERELSQLAAETRKKMMS 134 >gi|15900421|ref|NP_345025.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae TIGR4] gi|14971980|gb|AAK74665.1| putative type I restriction-modification system, S subunit [Streptococcus pneumoniae TIGR4] Length = 179 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 121 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 179 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 70/177 (39%), Gaps = 17/177 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSINTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWL 128 ++ K L + L D+DG+ + F+ + +++ + L L Sbjct: 62 EQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNL 121 Query: 129 LSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 S ++++AI + G + + + + +P+ P EQ LI +K+ +++ Sbjct: 122 SSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQ 178 >gi|306815516|ref|ZP_07449665.1| putative type I restriction-modification enzyme S subunit [Escherichia coli NC101] gi|305851178|gb|EFM51633.1| putative type I restriction-modification enzyme S subunit [Escherichia coli NC101] Length = 72 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 25/53 (47%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ED+ + P+ VPP+ Q I +++ + + E + + I L +++ Sbjct: 1 MPRGSKEDIMKYPIPVPPLTWQARIVEILDKFDTLTNSITEGLPREIELRQKQ 53 >gi|20092382|ref|NP_618457.1| type I restriction modification DNA protein [Methanosarcina acetivorans C2A] gi|19917634|gb|AAM06937.1| type I restriction modification DNA protein [Methanosarcina acetivorans C2A] Length = 135 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 39/101 (38%), Gaps = 10/101 (9%) Query: 316 ITSAYMAVKPHGIDSTYLAW-LMRSYDLCK-----VFYAMGSGLRQSLKFEDVKRLPVLV 369 + ++ + +T L + + Y K + G R+++ ++++L + + Sbjct: 5 VNQHVSIIRTNIRTNTKLYYKFLYCYLCLKRTKEALLSFDADGTRKAITKGNLEKLVLPL 64 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 P EQ I + I + +ID Q L++ + Sbjct: 65 PSYTEQTQIGDFIGLVNDKID----LNNQMNSTLEQIAQTL 101 >gi|209387|gb|AAA72570.1| hsdS specificity protein [synthetic construct] Length = 45 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 23/45 (51%) Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + +P + EQ I ++ A++D ++EQ +LK R S I Sbjct: 1 IPIPSLAEQKIIAEKLDTLLAQVDSTKARLEQIPQILKRFRQSVI 45 >gi|253569552|ref|ZP_04846962.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251841571|gb|EES69652.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 181 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 59/176 (33%), Gaps = 11/176 (6%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRK-NTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 +++ + + + ++ +N+ K +ES+ + + I R +K + E Sbjct: 3 QFIEMYYNTHNKQTLESVCPIMNKGITPKYVESSSVLVINQACIHWDGQRLGNIKYHNEE 62 Query: 282 ---TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG-IITSAYMAVKPHGI---DSTYLA 334 +I++ G+++ R + I ++ L Sbjct: 63 IPVRKRILESGDVLLNATGNGTLGRCCVFICPSDNNTYINDGHVIALSTDRAVILPEVLN 122 Query: 335 WLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + D Y GS + + F D+K++ V VP + EQ V+ Sbjct: 123 TYLSLNDTQAEIYRQYVTGSTNQVDIVFSDIKKMKVPVPSMDEQILFVEVLTQADK 178 >gi|229526953|ref|ZP_04416350.1| hypothetical protein VCG_000021 [Vibrio cholerae 12129(1)] gi|229335565|gb|EEO01045.1| hypothetical protein VCG_000021 [Vibrio cholerae 12129(1)] Length = 195 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 45/122 (36%), Gaps = 8/122 (6%) Query: 285 IVDPGEIVFRFIDLQNDKRSL--RSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSY 340 + G+I+ N + A ++ + + V D ++ WL+ Sbjct: 59 YLTTGDILVAARGSHNYAVQVDQLLASTGKQAVAAPHFFVVSLKKKDILPEFMVWLLNQA 118 Query: 341 DLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDI---TNVINVETARIDVLVEKI 396 + F G +S++ ++ PV+VPP +Q I N + E I LV Sbjct: 119 PAQRYFEQNAEGTLTKSIRRSVLEDAPVVVPPFAKQRAIIAMANTLGEEQRLIQRLVNNG 178 Query: 397 EQ 398 E+ Sbjct: 179 ER 180 >gi|322628321|gb|EFY25109.1| type I restriction enzyme EcoEI specificity protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-4] Length = 118 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 43/123 (34%), Gaps = 13/123 (10%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +PK W ++ + KL G + + K + I ++++ +G+G Y G + Sbjct: 2 VPKGWMLLQVSDICKLQNGNSFKPHEWDTKGLPIIRIQNL-NGSGNYNYFSGVPQD---- 56 Query: 77 TVSIFAKGQILYGKLGPYL---RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + GQ+L+ G I G+ + + + + E L Sbjct: 57 -KWLVEPGQLLFSWAGTKGVSFGPFIWNGPKGVLNQHIYKVFANENVHEHWLYLALLHIT 115 Query: 134 TQR 136 + Sbjct: 116 QKN 118 >gi|15902490|ref|NP_358040.1| type I restriction-modification system S subunit [Streptococcus pneumoniae R6] gi|116516954|ref|YP_815959.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae D39] gi|15458014|gb|AAK99250.1| Type I restriction enzyme EcoKI specificity protein (S protein) [Streptococcus pneumoniae R6] gi|116077530|gb|ABJ55250.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae D39] Length = 199 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 81 SEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 140 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 141 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 199 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 73/182 (40%), Gaps = 17/182 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 17 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 76 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 77 QFISSEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 136 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 137 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 196 Query: 182 DT 183 + Sbjct: 197 NQ 198 >gi|55821024|ref|YP_139466.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus LMG 18311] gi|55822944|ref|YP_141385.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus CNRZ1066] gi|116627786|ref|YP_820405.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus LMD-9] gi|55737009|gb|AAV60651.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus LMG 18311] gi|55738929|gb|AAV62570.1| type I restriction-modification system specificty subunit [Streptococcus thermophilus CNRZ1066] gi|116101063|gb|ABJ66209.1| Restriction endonuclease S subunit [Streptococcus thermophilus LMD-9] gi|312278345|gb|ADQ63002.1| Restriction endonuclease S subunit [Streptococcus thermophilus ND03] Length = 206 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 65/186 (34%), Gaps = 8/186 (4%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K+V + G+ + + I L D+ S Y S + + Sbjct: 19 KLVRLGDVVDQFKGKAVPAKAEPGEFAVINLSDMTSNGIAYDNLKTFSEERRKLLRFLLE 78 Query: 83 KGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIE 138 G +L G + A+ D + + S+ VL+PK+ L + L ++ ++ Sbjct: 79 DGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLRPKEKLRGFYIKFFLETEIGRAYLD 138 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQ-VLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + + +I +P P+ +Q I + ++ + + + Sbjct: 139 YADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAYLRGRADYHRKMVRAEQEWENIQQN 198 Query: 198 KKQALV 203 +AL Sbjct: 199 VTEALF 204 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 12/121 (9%), Positives = 34/121 (28%), Gaps = 4/121 (3%) Query: 266 QKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + N+ E +++ G+++ E ++ + Sbjct: 55 NGIAYDNLKTFSEERRKLLRFLLEDGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLR 114 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF-DITNV 381 + Y+ + + + G +L D+ + + PI +Q I Sbjct: 115 PKEKLRGFYIKFFLETEIGRAYLDYADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAY 174 Query: 382 I 382 + Sbjct: 175 L 175 >gi|295837451|ref|ZP_06824384.1| phosphoribosylformylglycinamidine synthase [Streptomyces sp. SPB74] gi|197699691|gb|EDY46624.1| phosphoribosylformylglycinamidine synthase [Streptomyces sp. SPB74] Length = 385 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 48/158 (30%), Gaps = 11/158 (6%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA----YMA 322 + + + +V G+++F + ++ ++ + + ++ Sbjct: 220 RGSESKPVPEDYTVPPAHLVREGDLLFSRANTEDLIGAVALVEEFTGALALPDKLWRFVW 279 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDIT 379 Y+ L R + + SG +++ V + +PP + + Sbjct: 280 HDGQDGHPLYVRHLFRQKEFRRRIRERASGTSGSMKNISQPKVLGIRCGIPPEGLRAEFC 339 Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + ID + L E +S A +G Sbjct: 340 ARV----RSIDASRRAHRGHLAALDELFTSLRHRAFSG 373 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 35/89 (39%), Gaps = 12/89 (13%) Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + G D YL + S D+ YA F+ +K PV+ PP+ EQ I Sbjct: 86 ILAAREGFDPRYLYQFLASLDIPDAGYAR--------HFKFLKNFPVVKPPLAEQQRIAA 137 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSS 409 +++ D L K ++ LL S Sbjct: 138 LLDHV----DALRAKRREATTLLDSLAQS 162 >gi|310831373|ref|YP_003970016.1| putative type I restriction modification enzyme, M and S domains [Cafeteria roenbergensis virus BV-PW1] gi|309386557|gb|ADO67417.1| putative type I restriction modification enzyme, M and S domains [Cafeteria roenbergensis virus BV-PW1] Length = 817 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 56/381 (14%), Positives = 114/381 (29%), Gaps = 59/381 (15%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + + + K + ++ + GKY + + + + Sbjct: 491 EWMKLGDICKFLSKSKKQASYG----------NNEGKYNFYTSSYKIKKCDEYDY--EDE 538 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 L +G I D CS+ ++L+ + + L +E EGA Sbjct: 539 CLI--IGTGGNVNIKLDSKFCCSSDNIILKSQY----NKYIYYLLSYNLNLLEKGFEGAG 592 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + H I N+ +PIP Q I ++I I + Sbjct: 593 IKHISKDYIRNLKIPIPSSETQEEIIQQIEILNKEIKNNEDKIKNN------------QN 640 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 I + +K IEW + + + SYGN Sbjct: 641 ISKMYMEMMMKKHQDNIEWN------------------KLGDLCEFLSKSKKQASYGNNE 682 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 K K + + Y D ++ N K + + I+ S Y Sbjct: 683 GKYNFYTSSYKIKKCDEYDYEDE-CLIIGTGGNVNIKLDSKFCCSADNIILKSQYNKYIY 741 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + Y+L + + + + ++ L + +P +K Q +I + ++ E Sbjct: 742 YLLS----------YNLNLLEKGFEGAGIKHISKDYIRNLKIPIPLLKIQNNIVDFLDKE 791 Query: 386 TARIDVLVEKIEQSIVLLKER 406 I+ L + + ++KE Sbjct: 792 NELINKLKLQNDTYKNMIKEI 812 >gi|210630409|ref|ZP_03296444.1| hypothetical protein COLSTE_00328 [Collinsella stercoris DSM 13279] gi|210160491|gb|EEA91462.1| hypothetical protein COLSTE_00328 [Collinsella stercoris DSM 13279] Length = 105 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 32/74 (43%), Gaps = 3/74 (4%) Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + ++ A + G +S +L +L+++ + + V +PV +P ++ Q Sbjct: 23 LNTSLYATEFKGSNSRFLYYLLKTLPWESY---ATASAVPGINRNHVNAIPVCLPDLECQ 79 Query: 376 FDITNVINVETARI 389 I +++ +I Sbjct: 80 IGIASMLGALDDKI 93 >gi|255284472|ref|ZP_05349027.1| putative type I restriction-modification system specificity determinant [Bryantella formatexigens DSM 14469] gi|255264982|gb|EET58187.1| putative type I restriction-modification system specificity determinant [Bryantella formatexigens DSM 14469] Length = 361 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 46/352 (13%), Positives = 96/352 (27%), Gaps = 26/352 (7%) Query: 39 GRTSESGKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLR- 96 G E+G I + + + G Y + KG I+ K G + Sbjct: 17 GTDDETGDGIPVLRTTNFTNEGVINYSDIVTRTITKKNIDEKFLRKGDIIIEKSGGSDKF 76 Query: 97 -KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIG 155 + FDG +T + + + W G T S + Sbjct: 77 PVGRVIYFDGEDNTYLFNNFTGLLRVKNQEVWYPRYVFYSLFANYQRGGTKSFEN--KTT 134 Query: 156 NIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDV 215 + +I + + +++ I L+E + + ++ Sbjct: 135 GLHNLKTDDYVSKYEVAEIDKKEQILICERLDKLYGIIKLREHELQFLDNLI-------- 186 Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 K IE G + + + SN +++ + G Sbjct: 187 --KARFIEMFGDS-------RINSKGFRTKKGSELFKISNGKAVANDKRFEDGIPAYGGN 237 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 Y + + IV + Q+ L V IT M + DS L + Sbjct: 238 GISWYTDEVLYEQDTIVIGRVGFQSGNVHLVKGPV----WITDNAMYISDFYDDSLCLVF 293 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 L +G + + + ++ ++P + Q + + + Sbjct: 294 LCEMMKQIDFTRLQDAGDLKKVTQKPFMKMDYILPSKQLQDEYVDFVKQVDK 345 >gi|291559579|emb|CBL38379.1| Type I restriction modification DNA specificity domain [butyrate-producing bacterium SSC/2] Length = 224 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 59/204 (28%), Gaps = 7/204 (3%) Query: 216 KMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL 275 K SG E + + + + N + LS ++ Sbjct: 22 PYKSSGGEMTFCKELNQNIPQNWGYTSVGNITVCFDSDRIPLSNHQRQEMKGTIPYYGAT 81 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ-VMERGIITSAYMAVKPHGIDSTYLA 334 Y I ++ D Q + I + ++P S L Sbjct: 82 GIMDYVNCAIFSGDFVLLAEDGSVMDDNGNPILQRISGDVWINNHTHVLQPVNGYSCRLL 141 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +L+ + ++ + ++ +L P + N I ID + Sbjct: 142 YLLLKDIPVSMIK--TGSIQMKINQANLNSYNILNIPDGIRSRFINQIE----PIDTKII 195 Query: 395 KIEQSIVLLKERRSSFIAAAVTGQ 418 +I++ LK+ R+ + + GQ Sbjct: 196 QIQKENDNLKQIRNWLLPMLMNGQ 219 >gi|118480579|ref|YP_879301.1| hypothetical protein pL2_p3 [Lactococcus lactis subsp. lactis] gi|118136319|gb|ABK62798.1| hypothetical protein [Lactococcus lactis subsp. lactis] Length = 159 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 53/124 (42%), Gaps = 12/124 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + + S + + + ED+ +G G+ SR Sbjct: 15 KVPELRFPGFTDDWEQRKLSDI--VVRLTKSSNNNQLPKVEFEDIIAGEGRL--NKDISR 70 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + D ++F ILYGKL PYL+ + +DF GI F V + K+ P+ + + + Sbjct: 71 KFDDRKGTLFEPDNILYGKLRPYLKNWLFSDFKGIALGDFWVFKSKNSEPKFVYSLIQAD 130 Query: 132 DVTQ 135 + + Sbjct: 131 NYQR 134 >gi|326408002|gb|ADZ65069.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 159 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 53/124 (42%), Gaps = 12/124 (9%) Query: 20 AIPK--------HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR 71 +P+ W+ + + S + + + ED+ +G G+ SR Sbjct: 15 KVPELRFPGFTDDWEQRKLSDI--VVRLTKSSNNNQLPKVEFEDIIAGEGRL--NKDISR 70 Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + D ++F ILYGKL PYL+ + +DF GI F V + K+ P+ + + + Sbjct: 71 KFDDRKGTLFEPDNILYGKLRPYLKNWLFSDFKGIALGDFWVFKSKNSEPKFVYSLIQAD 130 Query: 132 DVTQ 135 + + Sbjct: 131 NYQR 134 >gi|167767084|ref|ZP_02439137.1| hypothetical protein CLOSS21_01602 [Clostridium sp. SS2/1] gi|167711059|gb|EDS21638.1| hypothetical protein CLOSS21_01602 [Clostridium sp. SS2/1] Length = 249 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 55/191 (28%), Gaps = 7/191 (3%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKD----IIYIGLEDVESGTGKYLPKDGNSR--QSD 74 IP W+V P+ G + I + ++ S T + + Sbjct: 52 IPAGWQVKPMGTICSFRNGINYNKNVEGNTTYKIINVRNISSSTLFLDESNFDEICLPRQ 111 Query: 75 TSTVSIFAKGQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + I+ + G P + + I F++ L Sbjct: 112 QGDKYCVSDESIIIARSGIPGATRILCNPSSNIIFCGFIICCTPYNNTLQNYLTLYLKQF 171 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIE 193 G+ + + + + N+ +PIPP + + + I I E ++ Sbjct: 172 EGSSATQTGGSILKNVSQETLKNLLVPIPPQSLLNQFNDSVSHIYNLIIGNIKENVQLTT 231 Query: 194 LLKEKKQALVS 204 L L++ Sbjct: 232 LRDWLLPMLMN 242 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 50/186 (26%), Gaps = 6/186 (3%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + NT N+ ++S + + P V I+ Sbjct: 65 CSFRNGINYNKNVEGNTTYKIINVRNISSSTLFLDESNFDEICLPRQQGDKYCVSDESII 124 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + R L I + P+ L G Sbjct: 125 IARSGIPGATRILC--NPSSNIIFCGFIICCTPYNNTLQNYLTLYLKQFEGSSATQTGGS 182 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +++ E +K L V +PP + N N + I L+ + V L R + Sbjct: 183 ILKNVSQETLKNLLVPIPP----QSLLNQFNDSVSHIYNLIIGNIKENVQLTTLRDWLLP 238 Query: 413 AAVTGQ 418 + GQ Sbjct: 239 MLMNGQ 244 >gi|295112013|emb|CBL28763.1| Type I restriction modification DNA specificity domain. [Synergistetes bacterium SGP1] Length = 66 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 6/60 (10%), Positives = 27/60 (45%), Gaps = 4/60 (6%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ++ ++++ + + +PPI+ Q + ++D +++++ + S + Sbjct: 7 GQANINAQELQSIGIYIPPIELQKEFVAF----KEQLDKSKIAVQKALDEAQLLFDSLMQ 62 >gi|58038321|ref|YP_190290.1| hypothetical protein GOX2570 [Gluconobacter oxydans 621H] gi|58000735|gb|AAW59634.1| hypothetical protein GOX2570 [Gluconobacter oxydans 621H] Length = 198 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 50/137 (36%), Gaps = 16/137 (11%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDL 342 + PG+I+F + + + + + + ++ D YLAW + Sbjct: 66 WLRPGDILFPARGNVSLAVLINESVGSLQAVAAPHFFLLRVSRSDVLPAYLAWWLNQEPA 125 Query: 343 CKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + A S L +++ ++ PVL+PP+ Q I L +++ Sbjct: 126 QRHLEQNAQSSTLVRNIARPVLEATPVLLPPLPRQEQIV-----------GLASAMQREE 174 Query: 401 VLLKERRSSFIAAAVTG 417 LL R + +TG Sbjct: 175 DLLHRLRQT-NHQIMTG 190 >gi|303260806|ref|ZP_07346759.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP-BS293] gi|303265413|ref|ZP_07351317.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS397] gi|302638055|gb|EFL68537.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP-BS293] gi|302645054|gb|EFL75297.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS397] Length = 180 Score = 50.2 bits (118), Expect = 7e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 60 SEQVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 119 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 120 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 178 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 71/178 (39%), Gaps = 16/178 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISSE 61 Query: 79 SIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWLL 129 ++ K L + + D+DG+ + F+ + +++ + L L Sbjct: 62 QVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNLS 121 Query: 130 SIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S ++++AI + G + + + + +P+ P EQ LI +K+ +++ L Sbjct: 122 SPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLW 179 >gi|148993499|ref|ZP_01822990.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] gi|148996903|ref|ZP_01824621.1| phosphoglycerate kinase [Streptococcus pneumoniae SP11-BS70] gi|149003723|ref|ZP_01828568.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS69] gi|149025496|ref|ZP_01836432.1| phosphoglycerate kinase [Streptococcus pneumoniae SP23-BS72] gi|168485037|ref|ZP_02709975.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae CDC1873-00] gi|168490390|ref|ZP_02714589.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae SP195] gi|147757478|gb|EDK64517.1| phosphoglycerate kinase [Streptococcus pneumoniae SP11-BS70] gi|147758285|gb|EDK65286.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP14-BS69] gi|147927868|gb|EDK78889.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] gi|147929446|gb|EDK80442.1| phosphoglycerate kinase [Streptococcus pneumoniae SP23-BS72] gi|172041856|gb|EDT49902.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae CDC1873-00] gi|183571292|gb|EDT91820.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae SP195] gi|332074784|gb|EGI85257.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17570] gi|332077787|gb|EGI88248.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41301] Length = 181 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 121 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 179 Score = 39.8 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 71/179 (39%), Gaps = 17/179 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWL 128 ++ K L + L D+DG+ + F+ + +++ + L L Sbjct: 62 EQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNL 121 Query: 129 LSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S ++++AI + G + + + + +P+ P EQ LI +K+ +++ L Sbjct: 122 SSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLW 180 >gi|303253837|ref|ZP_07339965.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS455] gi|303263135|ref|ZP_07349061.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP14-BS292] gi|303267728|ref|ZP_07353543.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS457] gi|303270103|ref|ZP_07355809.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS458] gi|302599201|gb|EFL66219.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS455] gi|302635722|gb|EFL66231.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae SP14-BS292] gi|302640365|gb|EFL70806.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS458] gi|302642738|gb|EFL73070.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS457] Length = 182 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 62 SEQVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 121 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 122 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 180 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 73/181 (40%), Gaps = 16/181 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDT 75 IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 1 IPMNWVVIKIKDIFSINTGLSYKKGDLSINKGVRIIRGGNIKPLEFSLLDNDYYIDTQFI 60 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQG 126 S+ ++ K L + + D+DG+ + F+ + +++ + L Sbjct: 61 SSEQVYLKHNQLITPVSTSIEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLF 120 Query: 127 WLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 L S ++++AI + G + + + + +P+ P EQ LI +K+ +++ L Sbjct: 121 NLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 180 Query: 185 I 185 Sbjct: 181 W 181 >gi|291087311|ref|ZP_06346079.2| putative Type I restriction modification DNA specificity protein [Clostridium sp. M62/1] gi|291075336|gb|EFE12700.1| putative Type I restriction modification DNA specificity protein [Clostridium sp. M62/1] Length = 245 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 50/135 (37%), Gaps = 10/135 (7%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII-TSAYMAVKPHGIDSTYLAWLMRSY 340 T I G+IV + G+ + + VK + I +LA+ + ++ Sbjct: 119 TNTIEHDGDIVMVAR--VGANAGKVNFFSGRCGVTDNTLVIRVKENTIHPKFLAYFLENF 176 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 DL K+ + + + +K++ V K Q IT + +D ++ + + Sbjct: 177 DLHKLIF---GSGQPLVTGGQLKKIQSPVIAYKAQLLITRSLES----LDRIIGTQDTYM 229 Query: 401 VLLKERRSSFIAAAV 415 L + +S + Sbjct: 230 EKLIQLKSGLMQRLF 244 >gi|321310217|ref|YP_004192546.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] gi|319802061|emb|CBY92707.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] Length = 132 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 13/93 (13%), Positives = 30/93 (32%), Gaps = 3/93 (3%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMGSG 352 + + + ++ + P+ YL + S GS Sbjct: 1 MTAVGACCGKVGINLTDQEFFFSNNVLKFSPNEKLLTKRYLYHFLLSQQEEIEGMRKGSS 60 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + +KRL + VP ++ Q I+ ++ Sbjct: 61 -QPFVGQSALKRLKIPVPSLETQMKISETLDKF 92 >gi|42779915|ref|NP_977162.1| type I restriction-modification system, M subunit, putative [Bacillus cereus ATCC 10987] gi|42735833|gb|AAS39770.1| type I restriction-modification system, M subunit, putative [Bacillus cereus ATCC 10987] Length = 613 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 53/154 (34%), Gaps = 7/154 (4%) Query: 26 KVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIF 81 K V + +L G +S I I D++ G + S + Sbjct: 422 KTVELGEIAELTNGINIKSSDGQHSIQIIKASDIQGGKISVDELESVSVADLSVIQKAKV 481 Query: 82 AKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIE 138 G I+ G ++ A++ G S ++++PK+ + L ++E Sbjct: 482 QAGDIVLLSRGTSIKFAVVPKGIGNAYASMNLMIIRPKEGVDPYFIQTFLESPFGIWQME 541 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 I G T+ + +I +P Q+ + + Sbjct: 542 QIQTGTTIQLIKLGDMKSIRVPSLTQEVQIQVGK 575 >gi|298256070|ref|ZP_06979656.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae str. Canada MDR_19A] gi|298502304|ref|YP_003724244.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae TCH8431/19A] gi|298237899|gb|ADI69030.1| possible type I restriction-modification system, S subunit [Streptococcus pneumoniae TCH8431/19A] Length = 181 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 121 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 179 Score = 39.8 bits (91), Expect = 0.97, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 71/179 (39%), Gaps = 17/179 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWL 128 ++ K L + L D+DG+ + F+ + +++ + L L Sbjct: 62 EQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNL 121 Query: 129 LSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S ++++AI + G + + + + +P+ P EQ LI +K+ +++ L Sbjct: 122 SSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLW 180 >gi|237649418|ref|ZP_04523670.1| type I restriction enzyme [Streptococcus pneumoniae CCRI 1974] gi|237821511|ref|ZP_04597356.1| type I restriction enzyme [Streptococcus pneumoniae CCRI 1974M2] Length = 183 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 63 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 122 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 123 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 181 Score = 43.6 bits (101), Expect = 0.068, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 73/182 (40%), Gaps = 17/182 (9%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 1 IPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQF 60 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQ 125 S+ ++ K L + L D+DG+ + F+ + +++ + L Sbjct: 61 ISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLL 120 Query: 126 GWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L S ++++AI + G + + + + +P+ P EQ LI +K+ +++ Sbjct: 121 FNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQ 180 Query: 184 LI 185 L Sbjct: 181 LW 182 >gi|183603915|ref|ZP_02723115.2| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae MLV-016] gi|225856226|ref|YP_002737737.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae P1031] gi|183577150|gb|EDT97678.1| type I restriction enzyme EcoKI specificity protein [Streptococcus pneumoniae MLV-016] gi|225725045|gb|ACO20897.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae P1031] Length = 201 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 81 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 140 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 141 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 199 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 17 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 76 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 77 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 136 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 137 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 196 Query: 182 DTLI 185 + L Sbjct: 197 NQLW 200 >gi|331266254|ref|YP_004325884.1| type I restriction-modification system, S subunit, putative [Streptococcus oralis Uo5] gi|326682926|emb|CBZ00543.1| type I restriction-modification system, S subunit, putative [Streptococcus oralis Uo5] Length = 180 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 53/158 (33%), Gaps = 13/158 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + + +G +S + + I + DVE G + + Sbjct: 2 KKVKLGEVCDILSGYAFKSSQFNDKKIGLPLIRIRDVERGFSDTYFEGAYPEE------Y 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ + + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKIMNKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 T+ H I +I +P + EQ +I +K+ Sbjct: 115 KTPFVTVKHLSVAKIKDISFFLPDIQEQKIISKKLDTI 152 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 37/140 (26%), Gaps = 10/140 (7%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGAYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKIMNKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + +P I+EQ I+ ++ I Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFFLPDIQEQKIISKKLDT----I 152 Query: 390 DVLVEKIEQSIVLLKERRSS 409 + ++ E S Sbjct: 153 RQIYNFRKKQSEKYNELVKS 172 >gi|301162156|emb|CBW21701.1| putative type IC restriction-modification system specificity subunit, partial [Bacteroides fragilis 638R] Length = 201 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 12/127 (9%), Positives = 41/127 (32%), Gaps = 2/127 (1%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII--TSAYMA 322 + ++ + G+++F + + + + ++ + Sbjct: 52 TYREIISHVESYTNKSDGMTFSKKGDLLFPSSTTVDAVSLITPSAINIDNVVLGGDMFGI 111 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 ++ YL++ + L ++D+++ +L+P + EQ N + Sbjct: 112 HINSDYNAQYLSYYFNHIAKKQFAKYAKGSTIIHLHYKDIEKNKLLLPCLIEQNKTANNL 171 Query: 383 NVETARI 389 +I Sbjct: 172 ISLDEKI 178 >gi|183981974|ref|YP_001850265.1| type I restriction/modification system specificity determinant HsdS (S protein) [Mycobacterium marinum M] gi|183175300|gb|ACC40410.1| type I restriction/modification system specificity determinant HsdS (S protein) [Mycobacterium marinum M] Length = 361 Score = 49.8 bits (117), Expect = 9e-04, Method: Composition-based stats. Identities = 46/345 (13%), Positives = 99/345 (28%), Gaps = 48/345 (13%) Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I+ G++G Y D D + V + D + L A G+ Sbjct: 46 IVVGRVGSYCGSVRYCDSDVWVTDNAYVCRANDPAETRYWYYALQTCRLNEHRA---GSG 102 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + + + + + E+ I E + A +I L+ ++ + Sbjct: 103 QPLLNQRTLREVSVHVAQAPERRRIAEVLGALDDKIANNERVIEAAEALMVAMVGSVDAR 162 Query: 206 IVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII 265 + GL K V H+ + F A Sbjct: 163 VALSGL-ARRSTKLVNPADFDDVVAHFSLPAFDAGAHARPVAG----------------- 204 Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + ++F ++ + + + + + ++ + P Sbjct: 205 -----------ASVKSGKFHLSEPCVLFAKLNPRVPRIWNVVRLPPQMALASCEFVVLSP 253 Query: 326 HGIDSTYLAWLMRSYDL---CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 G+D++ L +R ++ A SG Q + +D+ L V + Sbjct: 254 LGVDTSVLWSALRQPEVSTSLAQLVAGTSGSHQRIGPKDLLDLQVPD---------VRRL 304 Query: 383 -NVETARIDVLVEKIEQ---SIVLLKERRSSFIAAAVTGQIDLRG 423 ++A I L L R + + VTG++ + G Sbjct: 305 GAAQSATITDLGALCHARRGQCAQLAALRDALLPGLVTGEVAVSG 349 >gi|169834523|ref|YP_001694007.1| type I restriction modification DNA specificity domain-containing protein [Streptococcus pneumoniae Hungary19A-6] gi|168997025|gb|ACA37637.1| Type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae Hungary19A-6] Length = 201 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 81 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 140 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 141 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 199 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 17 GNIPMNWGVIKIKDIFSINTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 76 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 77 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 136 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 137 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 196 Query: 182 DTLI 185 + L Sbjct: 197 NQLW 200 >gi|223933197|ref|ZP_03625188.1| conserved hypothetical protein [Streptococcus suis 89/1591] gi|223898127|gb|EEF64497.1| conserved hypothetical protein [Streptococcus suis 89/1591] Length = 141 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 47/107 (43%), Gaps = 7/107 (6%) Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV-P 370 E I S Y P+ + + S Y + +GL +++ + + + + + Sbjct: 41 EEAEIPSHYAVFLPNDMVLPKYLYHAISCQAGHFIYTVQTGL--NIQMDTLNEMKLKIHT 98 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 +++Q +I ++V I+ + K E +I LLK+ + + ++ G Sbjct: 99 DLEKQAEIVKYLDV----IEKMEAKEEATIDLLKQAKQTNLSKMFVG 141 >gi|323143062|ref|ZP_08077766.1| type I restriction modification DNA specificity domain protein [Succinatimonas hippei YIT 12066] gi|322417163|gb|EFY07793.1| type I restriction modification DNA specificity domain protein [Succinatimonas hippei YIT 12066] Length = 575 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 50/123 (40%), Gaps = 9/123 (7%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAW 335 ES +V G+I+ + + + +K D +L W Sbjct: 444 ESSTLKNLVHKGDIILAIKGSVGKVGIITEEHPNWLAGQSFVILRIKEECADWTPDFLFW 503 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKE-QFDITNVINVETARIDVLV 393 ++S + + + +G Q LK +DVK L + +PP KE Q I N + +++ ++ Sbjct: 504 QLKSKKINQFLKNVATGALIQLLKMDDVKNLKL-LPPAKELQEKIVN---AQKKKLE-II 558 Query: 394 EKI 396 KI Sbjct: 559 AKI 561 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 60/166 (36%), Gaps = 10/166 (6%) Query: 28 VPIKRFTKLNTGRTSESGKD---IIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSIFAK 83 V + + + S+ + IG D+ SG + K+ + ++ ++ K Sbjct: 395 VKLADIANIYRAQASKKEETGSSYFEIGAADINASGIVEQPTKEILIGKESSTLKNLVHK 454 Query: 84 GQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPK----DVLPELLQGWLLSIDVTQRI 137 G I+ G + II + + F++L+ K D P+ L L S + Q + Sbjct: 455 GDIILAIKGSVGKVGIITEEHPNWLAGQSFVILRIKEECADWTPDFLFWQLKSKKINQFL 514 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + GA + + N+ + P Q I + I Sbjct: 515 KNVATGALIQLLKMDDVKNLKLLPPAKELQEKIVNAQKKKLEIIAK 560 >gi|298254226|ref|ZP_06977812.1| type I restriction-modification system subunit S [Streptococcus pneumoniae str. Canada MDR_19A] Length = 180 Score = 49.8 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 61/188 (32%), Gaps = 17/188 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIV 207 ++ + + Sbjct: 171 KSRFNEMF 178 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 38/137 (27%), Gaps = 6/137 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGTYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKITDKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + ++P EQ I +N + Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTISQIY 156 Query: 390 DVLVEKIEQSIVLLKER 406 D + E+ L+K R Sbjct: 157 DFRKIQSEKFNELVKSR 173 >gi|149012616|ref|ZP_01833613.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] gi|147763421|gb|EDK70358.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] Length = 239 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 62/199 (31%), Gaps = 18/199 (9%) Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 L+ + + + G +P +W V + + + K + +I + II+ Sbjct: 40 LDISIVSQGDDNSYYGNIPMNWVVIKIKDIFSINTGLSYKKGDLSINN-KGVRIIRGGNI 98 Query: 271 RNMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + + + Y + +++ G++ + Sbjct: 99 KPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGF 158 Query: 321 MA----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + I S +L + + S K + ++ + L + + P + Sbjct: 159 IFQLTPFESSEIISKFLLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFE 218 Query: 374 EQFDITNVINVETARIDVL 392 EQ IT + +++ L Sbjct: 219 EQELITQKVEKLFEKVNQL 237 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 55 GNIPMNWVVIKIKDIFSINTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 114 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 115 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 174 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 175 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 234 Query: 182 DTLI 185 + L Sbjct: 235 NQLW 238 >gi|255693567|ref|ZP_05417242.1| putative type I restriction modification DNA specificity domain protein [Bacteroides finegoldii DSM 17565] gi|260620633|gb|EEX43504.1| putative type I restriction modification DNA specificity domain protein [Bacteroides finegoldii DSM 17565] Length = 193 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/181 (11%), Positives = 51/181 (28%), Gaps = 10/181 (5%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQK-LETRNMGLKPESYETYQIVDPGEIVFRF 295 + + + K+ + L + N K +R + + +++F Sbjct: 16 IAMMQSGIYMKSDPAGDIKYLQVKDINPRSKPDYSRITTVVDRGIGDQYRLRKNDLLFAA 75 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-R 354 N + + + I +L + + + G Sbjct: 76 KGASNYCFLYDGVVEKMVASSSFIIIRIISKDILPEFLCCFLNTPSVLNKLKKSSVGTGI 135 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 Q + + L + +P ++ Q I ++D L + E + E + S Sbjct: 136 QVIPQSVLSDLQIGIPSMQTQQLIV--------QMDQLRREGESIYSEINELKRSLQEQL 187 Query: 415 V 415 + Sbjct: 188 L 188 Score = 41.7 bits (96), Expect = 0.26, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 58/169 (34%), Gaps = 8/169 (4%) Query: 26 KVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 + I + +G S+ DI Y+ ++D+ + + K Sbjct: 9 DIKRISDIAMMQSGIYMKSDPAGDIKYLQVKDINPRSKPDYSRITTVVDRGIGDQYRLRK 68 Query: 84 GQILYGKLGPYLRKAIIA----DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 +L+ G + S + + KD+LPE L +L + V +++ Sbjct: 69 NDLLFAAKGASNYCFLYDGVVEKMVASSSFIIIRIISKDILPEFLCCFLNTPSVLNKLKK 128 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK--IIAETVRIDTLIT 186 G + + ++ + IP + Q LI + + E I + I Sbjct: 129 SSVGTGIQVIPQSVLSDLQIGIPSMQTQQLIVQMDQLRREGESIYSEIN 177 >gi|298229901|ref|ZP_06963582.1| type I site-specific deoxyribonuclease chain S [Streptococcus pneumoniae str. Canada MDR_19F] Length = 191 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 71 SEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 130 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + ++ + L + + P +EQ IT + +++ L Sbjct: 131 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 189 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 7 GNIPMNWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 66 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPEL 123 S+ ++ K L + L D+DG+ + F+ + +++ + Sbjct: 67 QFISSEQVYLKHNQLITPVATSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKF 126 Query: 124 LQGWLLSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 L L S ++++AI + G + + + + +P+ P EQ LI +K+ ++ Sbjct: 127 LLFNLSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQELITQKVEKLFEKV 186 Query: 182 DTLI 185 + L Sbjct: 187 NQLW 190 >gi|260171382|ref|ZP_05757794.1| hypothetical protein BacD2_05910 [Bacteroides sp. D2] gi|315919695|ref|ZP_07915935.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313693570|gb|EFS30405.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 139 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 31/91 (34%), Gaps = 3/91 (3%) Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 S + +I + + YL + + ++ M + Sbjct: 29 DGSGVGTVSYAQGKFSVIGTLNYLTVIGNNNLRYLYFALSVFNFQPYKTGMA---IPHIY 85 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARI 389 F+D + + PPI EQ + NV++ ++ Sbjct: 86 FKDYGKAKIYFPPITEQKRVANVLDKLENKL 116 >gi|15646014|ref|NP_208195.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori 26695] gi|2314577|gb|AAD08447.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori 26695] Length = 96 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 29/86 (33%), Gaps = 7/86 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDV----ESGTGKYLPKD---GNSRQSD 74 P +W+ V + ++ G T + + G + E G KY+ K Sbjct: 11 PLNWQKVRLGDIAEIIGGGTPSTQITSFWSGSINWFTPTEIGITKYVYKSQRTITPLGLK 70 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAII 100 S+ + G IL AI+ Sbjct: 71 KSSTKLLPIGTILLTSRASIGDCAIL 96 >gi|293401668|ref|ZP_06645810.1| type I restriction-modification system specificity determinant [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304926|gb|EFE46173.1| type I restriction-modification system specificity determinant [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 167 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 51/172 (29%), Gaps = 8/172 (4%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + K + +S N++ E S Q + ++ Sbjct: 1 MKCKLSDICSFHKEKIDVAKLTVNSYVSTENMLPNKEGITKASSLPSVSLTQSFEKDNVL 60 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 I K G + ID +L +++ + A G Sbjct: 61 LSNIRPYFKKIWKAKFSG---GCSNDVLVFKAKEDIDKDFLYYVLSDDNFFAYAMATSKG 117 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + V + I +Q I++V++V ID ++E +Q L Sbjct: 118 TKMPRGDKASIMQYDVPIYDIDKQKKISSVLSV----IDDMIELNKQINNNL 165 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 58/159 (36%), Gaps = 9/159 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI---FAKG 84 + + + D+ + + S K+G ++ S +VS+ F K Sbjct: 3 CKLSDICSFHKEKI-----DVAKLTVNSYVSTENMLPNKEGITKASSLPSVSLTQSFEKD 57 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEG 143 +L + PY +K A F G CS LV + K+ + + ++ + A +G Sbjct: 58 NVLLSNIRPYFKKIWKAKFSGGCSNDVLVFKAKEDIDKDFLYYVLSDDNFFAYAMATSKG 117 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 M D I +PI + +Q I + I+ Sbjct: 118 TKMPRGDKASIMQYDVPIYDIDKQKKISSVLSVIDDMIE 156 >gi|148656809|ref|YP_001277014.1| hypothetical protein RoseRS_2690 [Roseiflexus sp. RS-1] gi|148568919|gb|ABQ91064.1| hypothetical protein RoseRS_2690 [Roseiflexus sp. RS-1] Length = 649 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 58/168 (34%), Gaps = 10/168 (5%) Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK--PESYE 281 +G +P W V + + +L I + I + + P Sbjct: 40 ELGPLPKEWRVVRLGEVAIVGPPRIPRLSRDAIPFIPMALIPEGGHEVSQYELRAPSDVR 99 Query: 282 TYQIVDPGEIVFRFIDLQ--NDKRSLRSAQVMERGIITSAYMAVKPHGIDS-TYLAWLMR 338 + +V G+++ I N K+ + G T+ ++ + +L + + Sbjct: 100 SGVVVLEGDLLLAKITPCLENGKQGIVKRIPNGWGYATTEVFPIRTNEQLKIEFLNYYLL 159 Query: 339 SYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIK---EQFDITNV 381 + + + G+ RQ L V LP+ +PP++ + I N Sbjct: 160 QRSVREALASKMEGTTGRQRLPKAVVIALPIPLPPLERGGIRRQIVNR 207 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 45/118 (38%), Gaps = 8/118 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +G +PK W+VV + + R +D I +I + + G + + + S Sbjct: 41 LGPLPKEWRVVRLGEVAIVGPPRIPRLSRDAIPFIPMALIPEGGHEVSQYELRAPSDVRS 100 Query: 77 TVSIFAKGQILYGKLGPYLRKAI------IADFDGICSTQFLVLQPKDVLPELLQGWL 128 V + +G +L K+ P L I + G +T+ ++ + L + Sbjct: 101 GV-VVLEGDLLLAKITPCLENGKQGIVKRIPNGWGYATTEVFPIRTNEQLKIEFLNYY 157 >gi|328545368|ref|YP_004305477.1| hypothetical protein SL003B_3752 [polymorphum gilvum SL003B-26A1] gi|326415110|gb|ADZ72173.1| hypothetical protein SL003B_3752 [Polymorphum gilvum SL003B-26A1] Length = 196 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 43/138 (31%), Gaps = 13/138 (9%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAYMAVKPHGID 329 + V GE+VFR N ++ S I+ + + Sbjct: 45 DFQRYDLDKLSDRYFVRGGEVVFRSRGEPNAAVAIPASLPEPVVVIVPLVIVRPDRDRVL 104 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 Y+AW + D + A G + ++ L + VP + Q I Sbjct: 105 PEYVAWAINQPDAQRRLGAEAQGTSLRMIPMAVLENLEIAVPDLPTQKRIVE-------- 156 Query: 389 IDVLVEKIEQSIVLLKER 406 +D L Q LL++ Sbjct: 157 LDAL---ARQEGQLLRQL 171 >gi|149002424|ref|ZP_01827358.1| restriction modification system DNA specificity domain [Streptococcus pneumoniae SP14-BS69] gi|147759361|gb|EDK66353.1| restriction modification system DNA specificity domain [Streptococcus pneumoniae SP14-BS69] Length = 181 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 61/188 (32%), Gaps = 17/188 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 T+ H I +I +P EQ LI +K+ I + R E E Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLN----TISQIYDFRKIQSEKFNELV 170 Query: 200 QALVSYIV 207 ++ + + Sbjct: 171 KSRFNEMF 178 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 38/137 (27%), Gaps = 6/137 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGTYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKITDKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + ++P EQ I +N + Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTISQIY 156 Query: 390 DVLVEKIEQSIVLLKER 406 D + E+ L+K R Sbjct: 157 DFRKIQSEKFNELVKSR 173 >gi|306833560|ref|ZP_07466687.1| type I restriction-modification system specificty subunit [Streptococcus bovis ATCC 700338] gi|304424330|gb|EFM27469.1| type I restriction-modification system specificty subunit [Streptococcus bovis ATCC 700338] Length = 198 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 58/151 (38%), Gaps = 6/151 (3%) Query: 28 VPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 +P+K T+ G+ G +I I L D++ Y+ + + + +G Sbjct: 16 IPLKEITEHFKGKAVSKLGDGGNISVINLSDMDDTGIDYVHLKKIDCDEKSVSRYLLQEG 75 Query: 85 QILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAIC 141 +L G + A+ D I S VL+P + L+ D+ ++ Sbjct: 76 DVLIASKGTVKKIAVFAEQDEPVIASANITVLRPTSDISGGYIRLFLASDLGQALLDETN 135 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 G + + + + I +I +P P+ Q + + Sbjct: 136 TGKNVMNLNTQKIISIEIPKIPVIRQAYLIQ 166 >gi|254779182|ref|YP_003057287.1| Type I R-M system specificity subunit [Helicobacter pylori B38] gi|254001093|emb|CAX29046.1| Type I R-M system specificity subunit [Helicobacter pylori B38] Length = 205 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 61/189 (32%), Gaps = 11/189 (5%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P E + ++ + ++ +T +G E YQ Sbjct: 13 PKGVEFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKN 72 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG-IDSTYLAWLMRSYDLCKVFY 347 ++ + + + + ++ + + I+ ++ + M++ F Sbjct: 73 APVII----FDDFITATQWVDFPFKVKSSAMKILFSKNPTINIRFIFFYMQTIHANYSFN 128 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE-- 405 G RQ + +L V +PP++ Q +I +++ + L+ I I K+ Sbjct: 129 IGGEHARQWISR--YSQLEVPIPPLEIQQEIVKILDQFSLLTTDLLAGIPAEIKARKKQY 186 Query: 406 --RRSSFIA 412 R + Sbjct: 187 EYYREKLLT 195 >gi|258513093|ref|YP_003189349.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256634996|dbj|BAI00970.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01] gi|256638051|dbj|BAI04018.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-03] gi|256641105|dbj|BAI07065.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-07] gi|256644160|dbj|BAI10113.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-22] gi|256647215|dbj|BAI13161.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-26] gi|256650268|dbj|BAI16207.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-32] gi|256653259|dbj|BAI19191.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-01-42C] gi|256656312|dbj|BAI22237.1| type I DNA specificity S subunit [Acetobacter pasteurianus IFO 3283-12] Length = 114 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 34/97 (35%), Gaps = 6/97 (6%) Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 S ++ S + +Q+L ++ L+P I +T+ + Sbjct: 18 SYIFLHMLHSK--VNLANKATGSAQQNLSKNLIETFETLIPN----DKILYEFENKTSLL 71 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + K +L + R + ++G+I +R + Sbjct: 72 FDKIIKNFDEPHILAQLRDLLLPKLMSGEISIRDAEK 108 >gi|319775884|ref|YP_004138372.1| HaeIV restriction/modification system [Haemophilus influenzae F3047] gi|317450475|emb|CBY86692.1| HaeIV restriction/modification system [Haemophilus influenzae F3047] Length = 1062 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 39/130 (30%), Gaps = 8/130 (6%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I + + I S V+ T + +F Sbjct: 940 NTITISASGANAGFVNFWTEKIFASDCTTVRADNYVGTKFIFTYLQSIQENIFDLARGAA 999 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + +D+KRLP+ P+ Q + E +ID + I + + + Sbjct: 1000 QPHVYPDDIKRLPIPKVPLDIQQKVVE----ECQKIDDEFNRTRMQIEEYRAKFAKIFNE 1055 Query: 414 AVTGQIDLRG 423 +I +RG Sbjct: 1056 L---EI-VRG 1061 >gi|241895012|ref|ZP_04782308.1| restriction modification system DNA specificity domain protein [Weissella paramesenteroides ATCC 33313] gi|241871730|gb|EER75481.1| restriction modification system DNA specificity domain protein [Weissella paramesenteroides ATCC 33313] Length = 158 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 42/151 (27%), Gaps = 9/151 (5%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 + + N + ++ + + V G++V + ++ Sbjct: 1 MPFVQVVDVTNKLTLVDDTKQKISKLAQSKSVFVPKGKVVITLQGSIGRVAITQYDSYVD 60 Query: 313 RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPI 372 R + + + Y G +++ E + V +P Sbjct: 61 RTL----LIFENYVKPTNEYFWAYTLQQKFEIEKRRAPGGTIKTITKEALSIFEVHLPEY 116 Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLL 403 KEQ I +D L+ ++ I L Sbjct: 117 KEQVKIG----TLFQYLDTLITVNQR-ISKL 142 Score = 37.1 bits (84), Expect = 6.2, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 36/133 (27%), Gaps = 1/133 (0%) Query: 49 IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICS 108 ++ + DV + + S KG+++ G R I +D Sbjct: 2 PFVQVVDVTNKLTLVDDTKQKISKLAQSKSVFVPKGKVVITLQGSIGR-VAITQYDSYVD 60 Query: 109 TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQV 168 L+ + + + G T+ + + + +P EQV Sbjct: 61 RTLLIFENYVKPTNEYFWAYTLQQKFEIEKRRAPGGTIKTITKEALSIFEVHLPEYKEQV 120 Query: 169 LIREKIIAETVRI 181 I I Sbjct: 121 KIGTLFQYLDTLI 133 >gi|282850455|ref|ZP_06259834.1| type I restriction modification DNA specificity domain protein [Veillonella parvula ATCC 17745] gi|282579948|gb|EFB85352.1| type I restriction modification DNA specificity domain protein [Veillonella parvula ATCC 17745] Length = 179 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 47/150 (31%), Gaps = 11/150 (7%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVME 312 N + + I + + V G++ F D+ + + ME Sbjct: 34 NFTDVFHNRQIYSSTLKGKVCVNKKELENYKVKEGDLFFTRTSETIDEIGFPAVVMEPME 93 Query: 313 RGIITSAYMAVKPHGIDS---TYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVL 368 R + + + + D + +++ + + S R +K + Sbjct: 94 RVVFSGFVLRGRAEKYDPLANIFKSYIFFTDNFRSEMKKKSSMTTRALTSGTALKEMCFS 153 Query: 369 VP-PIKEQFDITNVINVETARIDVLVEKIE 397 P ++EQ I ++ +D ++ + Sbjct: 154 YPKDLEEQTKIGEILLS----LDKIITLHQ 179 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 20/173 (11%), Positives = 42/173 (24%), Gaps = 15/173 (8%) Query: 24 HWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTV 78 W+ + F G E G + DV Y K Sbjct: 3 SWEQRKLGDFYTFKNGLNKEKVYFGYGDSIVNFTDVFHNRQIYSSTLKGKVCVNKKELEN 62 Query: 79 SIFAKGQILYGKLGPYLRKAII------ADFDGICSTQFLVLQPKDVL---PELLQGWLL 129 +G + + + + + + S L + + Sbjct: 63 YKVKEGDLFFTRTSETIDEIGFPAVVMEPMERVVFSGFVLRGRAEKYDPLANIFKSYIFF 122 Query: 130 SIDVTQRIEAICEGATMSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRI 181 + + ++ T + + + P L EQ I E +++ I Sbjct: 123 TDNFRSEMKKKSSMTTRALTSGTALKEMCFSYPKDLEEQTKIGEILLSLDKII 175 >gi|311033108|ref|ZP_07711198.1| type I restriction enzyme, specificity subunit [Bacillus sp. m3-13] Length = 192 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 54/170 (31%), Gaps = 16/170 (9%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQNDKRSL 305 + I + + ++ ES YQ ++ G++V + Sbjct: 23 KQFGTQVINYYDQPSFEADYNHEGVEVEGESNSIYQHNLSLNEGDVVIS--SSLQLATMV 80 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD---LCKVFYAMGSGLRQSLKFEDV 362 V + + + +D Y +L +Y K GSG + + Sbjct: 81 GKNNVGKVLSLNFTKIEFDCEQLDKRYFLYLFNAYKDVKRQKERELQGSGPVLRIPLRAL 140 Query: 363 KRLPVLVPPIKEQFDITNV------INVETARIDVLVEKIEQSI--VLLK 404 + V PI+EQ I ++ + + + L+E SI LK Sbjct: 141 GEIIFPVAPIEEQKKIGDIYVETLKLQNKLNKYADLIEVFTSSIIEENLK 190 >gi|261837979|gb|ACX97745.1| specificity subunit S of type I restriction-modification system [Helicobacter pylori 51] Length = 204 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 19/188 (10%), Positives = 57/188 (30%), Gaps = 13/188 (6%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P E + ++ + ++ +T +G E YQ Sbjct: 13 PKGVEFRKLGEVLEYDQPNKYCVTSKEFDKSYPTPVLTAGKTFILGYTNEKDNIYQASKS 72 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 ++ + + + + ++ + + + + + + Sbjct: 73 SPVII----FDDFTTATQWVDFPFKVKSSAMKILLPKNPTINIRFIFFYMQTIPYNI--- 125 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE--- 405 G RQ + ++ + +PP++ Q +I +++ + L+ I I K+ Sbjct: 126 SGEHTRQWISR--YSKITIPIPPLEIQQEIVKILDQFSILTTDLLAGIPAEIEARKKQYE 183 Query: 406 -RRSSFIA 412 R ++ Sbjct: 184 YYREKLLS 191 >gi|259500491|ref|ZP_05743393.1| type I restriction-modification system specificity protein [Lactobacillus iners DSM 13335] gi|259168106|gb|EEW52601.1| type I restriction-modification system specificity protein [Lactobacillus iners DSM 13335] Length = 215 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 7/192 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKD--IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 WK +K KL ++ ++G++ + Y+ ++ + T + D S++ Sbjct: 23 SDWKKGKLKDILKLKR-QSIKTGENTTLPYLPIDVIPMRT--FALTDFKPNAEAQSSLIT 79 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 F K I+ G + Y + ++A DGI T L P E L LL D I+ Sbjct: 80 FDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFTLAP--YNNEYLSFALLCCDQESSIDYA 137 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + S + + + I +K + + I L+E + Sbjct: 138 QSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQIQNSYFENNRLREIRN 197 Query: 201 ALVSYIVTKGLN 212 AL+ +++ ++ Sbjct: 198 ALLPRLMSDEVD 209 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 14/153 (9%), Positives = 50/153 (32%), Gaps = 7/153 (4%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 + E+ + D +I+ + + + L + R + +A + Sbjct: 64 LTDFKPNAEAQSSLITFDKDDIIIGAMRVYFHRVVLAPCDGITRTTCFT--LAPYNNEYL 121 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 S L + + ++ + + +++P + ++ + Sbjct: 122 SFALLCCDQESSIDYAQSTSKGSTMPYAIWEGGLGDMEIIIPTPEIAKKFNEIVLPMLRQ 181 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I + + L+E R++ + ++ ++D+ Sbjct: 182 IQNSYFENNR----LREIRNALLPRLMSDEVDV 210 >gi|168575545|ref|ZP_02721481.1| type I restriction enzyme EcoBI specificity protein (S protein)(S.EcoBI) [Streptococcus pneumoniae MLV-016] gi|298229448|ref|ZP_06963129.1| type I restriction-modification system subunit S [Streptococcus pneumoniae str. Canada MDR_19F] gi|307067539|ref|YP_003876505.1| restriction endonuclease S subunit [Streptococcus pneumoniae AP200] gi|183578543|gb|EDT99071.1| type I restriction enzyme EcoBI specificity protein (S protein)(S.EcoBI) [Streptococcus pneumoniae MLV-016] gi|306409076|gb|ADM84503.1| Restriction endonuclease S subunit [Streptococcus pneumoniae AP200] Length = 180 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 52/158 (32%), Gaps = 13/158 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 K V + ++ +G +S + + I + DVE G + Sbjct: 2 KKVKLGEVCEILSGYAFKSSQFNDNKIGLPLIRIRDVERGFSD------TYFEGTYPEEY 55 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + G +L G ++ K + + + ++ D + L + IE Sbjct: 56 LIKNGDLLITMDGSFILK-KWEGDLALLNQRVCKIKITDKSVDEGYISWLIPKFLKEIED 114 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 T+ H I +I +P EQ LI +K+ Sbjct: 115 KTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTI 152 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 38/137 (27%), Gaps = 6/137 (4%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + +Y ++ G+++ + + ++ +K Sbjct: 42 FSDTYFEGTYPEEYLIKNGDLLITMDGS-----FILKKWEGDLALLNQRVCKIKITDKSV 96 Query: 331 TYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 K + L +K + ++P EQ I +N + Sbjct: 97 DEGYISWLIPKFLKEIEDKTPFVTVKHLSVAKIKDISFVLPNKLEQKLIAKKLNTISQIY 156 Query: 390 DVLVEKIEQSIVLLKER 406 D + E+ L+K R Sbjct: 157 DFRKIQSEKFNELVKSR 173 >gi|328947972|ref|YP_004365309.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328448296|gb|AEB14012.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 337 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 46/385 (11%), Positives = 97/385 (25%), Gaps = 59/385 (15%) Query: 42 SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ----------ILYGKL 91 + + + ++ YL S+ I++ Sbjct: 2 KSYNEILSDVTKTAIKIPQSDYLDAGKYRIFDQGKEYSVGFSNDEQGVVTDYPYIIF--- 58 Query: 92 GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADW 151 G + R D V K + + ++ + + IE+ Sbjct: 59 GDHTRVVKYVDEPCYIGAD-GVKLLKVINKDFDPRYVYYNILAKPIESQGYARHFKFLK- 116 Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGL 211 I +EQ I ++ ID + E A+ S V Sbjct: 117 ----EIQFTEKSFSEQQKIAAELDKIQSAIDNKKQQLSLLDE-------AVKSEFVEMFG 165 Query: 212 NPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR 271 NP K+ + V V + K + IL Sbjct: 166 NPIYNSKNFPTKKVIDVVTMQRGYDLPVQDRDSKGKIPVFGSNGIL-------------- 211 Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + + + + + + + + + HG + Sbjct: 212 ---------GNHNLAKMDKGIITGRSGTIGEVYMCETPFWP---LNTTLFSNDTHGNNIC 259 Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 YL +L+ +DL + +L + ++ P+ Q + +ID Sbjct: 260 YLKFLLEFFDLKRF---KSGVGVPTLNRNEFHDEQIIDVPLDLQNQFAAFV----QKIDK 312 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 ++Q I L+E S + + Sbjct: 313 SKFVVKQQITDLQELLDSKMQEYFS 337 >gi|293363454|ref|ZP_06610211.1| conserved hypothetical protein [Mycoplasma alligatoris A21JP2] gi|292552974|gb|EFF41727.1| conserved hypothetical protein [Mycoplasma alligatoris A21JP2] Length = 102 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 43/90 (47%), Gaps = 5/90 (5%) Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I+ YL + ++++ K+F + + L+ E++K + +P ++ Q I +++ Sbjct: 9 FINKKYLYYYLKNFQ-DKLFSLANDAIPKHLELEELKNFTINLPSLQIQNKIVEILDDFE 67 Query: 387 ARIDVLVEKIEQSIVLLKE----RRSSFIA 412 I+ + E + I L ++ R+ ++ Sbjct: 68 KYINDISEGLPLEIELRQKQYEYYRNKLLS 97 >gi|298384312|ref|ZP_06993872.1| type I restriction-modification system specificity determinant [Bacteroides sp. 1_1_14] gi|298262591|gb|EFI05455.1| type I restriction-modification system specificity determinant [Bacteroides sp. 1_1_14] Length = 183 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 57/183 (31%), Gaps = 14/183 (7%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYETYQI 285 +PD W L +N E + + N + + ++ E Sbjct: 1 MPDGWCAVALKDLCENINGLWKGKKEPFVNVGVIRNANFTKDFKLDYSNIEYIDVEQRTF 60 Query: 286 ----VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA------YMAVKPHGIDSTYLAW 335 ++ G+++ ++ R+ + + S + S YL + Sbjct: 61 AKRHLENGDLIVEKSGGSDNNPVGRTILYEGKSGVFSFSNFTMVLRIRYNDTVLSKYLYY 120 Query: 336 LMRSYDLC--KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 + + + +L + + +PP+ EQ I + I +++++ Sbjct: 121 CILAKYQTGAMRLMQTQTTGLHNLILNKFLLMSICLPPLYEQRRIIDQIETFFTTLNLIM 180 Query: 394 EKI 396 E + Sbjct: 181 ESL 183 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 52/178 (29%), Gaps = 19/178 (10%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTG------KYLPKDGNSRQSDT 75 P W V +K + G GK ++ + + + Y + + T Sbjct: 2 PDGWCAVALKDLCENINGL--WKGKKEPFVNVGVIRNANFTKDFKLDYSNIEYIDVEQRT 59 Query: 76 STVSIFAKGQILYGKLG-----PYLRKAIIADFDGICSTQFLVLQPKDVLP----ELLQG 126 G ++ K G P R + G+ S + + Sbjct: 60 FAKRHLENGDLIVEKSGGSDNNPVGRTILYEGKSGVFSFSNFTMVLRIRYNDTVLSKYLY 119 Query: 127 WLLSIDVTQRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + T + + + + +PPL EQ I ++I ++ Sbjct: 120 YCILAKYQTGAMRLMQTQTTGLHNLILNKFLLMSICLPPLYEQRRIIDQIETFFTTLN 177 >gi|303242502|ref|ZP_07328982.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302589970|gb|EFL59738.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 216 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 53/154 (34%), Gaps = 10/154 (6%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVK 324 + + E + + G+++ R L S+ + E +I S A + + Sbjct: 64 NINELDCFESNEELDEKYLTQQGDVIVR---LSYPNTSIAINENNEGLLIPSLFAIIRLS 120 Query: 325 PHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + YL+ + S + + F ++ Q + +K + V P I++Q I Sbjct: 121 DVILLPDYLSIYLNSDLMKEFFGRSVIGSAIQIINNSLLKEIVVKFPKIEKQKKIIEFNK 180 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 E + I + + I +TG Sbjct: 181 FMLRE----KELMTSLIDEKTKYNKAIIGKLITG 210 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 67/200 (33%), Gaps = 17/200 (8%) Query: 26 KVVPIKRFTKLNTGRTSESG---------KDIIYIGLEDVE-SGTGKYLPKDGNSRQSDT 75 + + K+NTG + KD + L+ E G D + Sbjct: 18 ETKKLGDIAKINTGLVVKRKQAALRENVFKDYKMLTLKSFEQDGWLNINELDCFESNEEL 77 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICST---QFLVLQPKDVLPELLQGWLLSID 132 + +G ++ P AI + +G+ + L +LP+ L +L S Sbjct: 78 DEKYLTQQGDVIVRLSYPNTSIAINENNEGLLIPSLFAIIRLSDVILLPDYLSIYLNSDL 137 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + G+ + + + I + P + +Q I E + I Sbjct: 138 MKEFFGRSVIGSAIQIINNSLLKEIVVKFPKIEKQKKIIEF----NKFMLREKELMTSLI 193 Query: 193 ELLKEKKQALVSYIVTKGLN 212 + + +A++ ++T G N Sbjct: 194 DEKTKYNKAIIGKLITGGSN 213 >gi|229195089|ref|ZP_04321864.1| Type I restriction-modification system, M subunit [Bacillus cereus m1293] gi|228588318|gb|EEK46361.1| Type I restriction-modification system, M subunit [Bacillus cereus m1293] Length = 616 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 53/154 (34%), Gaps = 7/154 (4%) Query: 26 KVVPIKRFTKLNTGRTSESGKD---IIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIF 81 K V + +L G +S I I D++ G + S + Sbjct: 425 KTVELGEIAELTNGINIKSSDGQHAIQIIKASDIQGGKISVAELESVSVADLSVIQKAKV 484 Query: 82 AKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIE 138 G I+ G ++ A++ G S ++++PK+ + L ++E Sbjct: 485 QAGDIVLLSRGTSIKFAVVPKGIGNAYASMNLMIIRPKEGVDPYFIQTFLESPFGIWQME 544 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 I G T+ + +I +P Q+ + + Sbjct: 545 QIQTGTTIQLIKLGDMKSIRVPSLTQEVQIQVGK 578 >gi|22537161|ref|NP_688012.1| hypothetical protein SAG1001 [Streptococcus agalactiae 2603V/R] gi|25011090|ref|NP_735485.1| hypothetical protein gbs1036 [Streptococcus agalactiae NEM316] gi|76787066|ref|YP_329717.1| hypothetical protein SAK_1096 [Streptococcus agalactiae A909] gi|77407222|ref|ZP_00784189.1| conserved hypothetical protein [Streptococcus agalactiae H36B] gi|77411992|ref|ZP_00788321.1| conserved hypothetical protein [Streptococcus agalactiae CJB111] gi|77414758|ref|ZP_00790884.1| conserved hypothetical protein [Streptococcus agalactiae 515] gi|22534024|gb|AAM99884.1|AE014237_18 conserved hypothetical protein [Streptococcus agalactiae 2603V/R] gi|23095489|emb|CAD46695.1| Unknown [Streptococcus agalactiae NEM316] gi|76562123|gb|ABA44707.1| conserved hypothetical protein [Streptococcus agalactiae A909] gi|77159188|gb|EAO70373.1| conserved hypothetical protein [Streptococcus agalactiae 515] gi|77161948|gb|EAO72930.1| conserved hypothetical protein [Streptococcus agalactiae CJB111] gi|77174170|gb|EAO77072.1| conserved hypothetical protein [Streptococcus agalactiae H36B] Length = 196 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 10/182 (5%) Query: 30 IKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + G+ S DI I L D+ Y + + + +G + Sbjct: 16 LSELVDCFKGKAVPSKAEAGDIRIINLSDMSPLGIDYHNLRTFQDEQRSLLKYLLQEGDV 75 Query: 87 LYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIEAICEG 143 L G + AI D+ + S +L+P + + S + Q +E +G Sbjct: 76 LIASKGTVKKVAIFEEQDYPVVASANITILRPTQHIRGYYLKLFFDSEEGQQALENANKG 135 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + K + NI +P PL Q + +I + +I E E+ Q + Sbjct: 136 KAVMNISTKELLNIAIPSIPLFRQ----DYLIQRYKQGLNDYKRKIARAEQEWERIQNDI 191 Query: 204 SY 205 Sbjct: 192 RQ 193 >gi|329919956|ref|ZP_08276848.1| hypothetical protein HMPREF9210_0147 [Lactobacillus iners SPIN 1401G] gi|328936805|gb|EGG33243.1| hypothetical protein HMPREF9210_0147 [Lactobacillus iners SPIN 1401G] Length = 219 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 55/192 (28%), Gaps = 9/192 (4%) Query: 20 AIPKHWKVVPIKRFTKLN-TGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 IP +W V + + + D KY + + S Sbjct: 20 KIPANWVVSKLGDIASIKTNSFSPVKNPDAQLEHYSIPAYDEQKYPVFESA--EGVKSNK 77 Query: 79 SIFAKGQILYGKLGPYLRKAI---IADFDGICSTQFLVLQ-PKDVLPELLQGWLLSIDVT 134 I +K ++ KL P ++A + ST+F++ + + + + S + Sbjct: 78 YILSKNSVMISKLNPDTKRAWRPMCLSDLAVSSTEFIIFEAFNPAYKDFVFSIIDSAAFS 137 Query: 135 QRIEAICEGAT--MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + G+T + +P + I I E + Sbjct: 138 DWMCTHTTGSTNSRQRTTPSATLEFQIALPDEKTITDFCAIVTPMYDTISANICENQKLA 197 Query: 193 ELLKEKKQALVS 204 +L L+S Sbjct: 198 QLRDSILPKLMS 209 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 66/199 (33%), Gaps = 8/199 (4%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIV 286 +P +W V + + + + + Y + + E ++ + + Sbjct: 20 KIPANWVVSKLGDIASIKTNSFSPVKNPDAQLEHYSIPAYDEQKYPVFESAEGVKSNKYI 79 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK-PHGIDSTYLAWLMRSYDLCKV 345 V + KR+ R + + + ++ ++ + + ++ ++ S Sbjct: 80 LSKNSVMISKLNPDTKRAWRPMCLSDLAVSSTEFIIFEAFNPAYKDFVFSIIDSAAFSDW 139 Query: 346 FYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +G RQ + +P E IT+ + T D + I Sbjct: 140 MCTHTTGSTNSRQRTTPSATLEFQIALP--DE-KTITDFCAIVTPMYDTISANIC-ENQK 195 Query: 403 LKERRSSFIAAAVTGQIDL 421 L + R S + ++G++D+ Sbjct: 196 LAQLRDSILPKLMSGELDV 214 >gi|253681482|ref|ZP_04862279.1| conserved hypothetical protein [Clostridium botulinum D str. 1873] gi|253561194|gb|EES90646.1| conserved hypothetical protein [Clostridium botulinum D str. 1873] Length = 193 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 50/136 (36%), Gaps = 11/136 (8%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD---L 342 ++ G++V + + + + I + + +D+ Y ++ Y Sbjct: 63 LNEGDVVIN--NSLQLATMVGKNNIGKVLSINFTKVEINNKQLDNRYFLFMFNVYKDVKR 120 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN-VINVETARIDVLVEKIEQSIV 401 K G+G + + + + V P++EQ I I L K+ + Sbjct: 121 QKERELQGTGPVLRIPLRSLGEITIPVVPLEEQKKIGKIYIETM-----KLQSKLNKYSD 175 Query: 402 LLKERRSSFIAAAVTG 417 L+++ +S I A+ G Sbjct: 176 LIEQFTNSIIEEALKG 191 >gi|52079175|ref|YP_077966.1| Type I restriction modification system protein HsdIA [Bacillus licheniformis ATCC 14580] gi|52784542|ref|YP_090371.1| hypothetical protein BLi00743 [Bacillus licheniformis ATCC 14580] gi|52002386|gb|AAU22328.1| Type I Restriction Modification system protein HsdIA [Bacillus licheniformis ATCC 14580] gi|52347044|gb|AAU39678.1| putative protein [Bacillus licheniformis ATCC 14580] Length = 189 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 52/136 (38%), Gaps = 11/136 (8%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLM 337 +++ + + G++VF F+ K + S + I + + H D +YL + + Sbjct: 55 NHKESYLSNAGDVVFSFVSS---KAGIVSNLNRGKIINQNFAKLMIEHDELDRSYLCYAL 111 Query: 338 R-SYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 SY + + +M L +K L + +P I++Q I R + K Sbjct: 112 NESYAMKRQMAISMQGSAVPKLTPAILKELEIKLPSIEKQRIIGKAYFCLRKR--QALAK 169 Query: 396 IEQSIVL---LKERRS 408 + + L+ + Sbjct: 170 KQAELEEKLFLEVLKQ 185 >gi|240112516|ref|ZP_04727006.1| Type I restriction enzyme EcoprrI specificity protein [Neisseria gonorrhoeae MS11] Length = 200 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 11/124 (8%), Positives = 42/124 (33%), Gaps = 6/124 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLC 343 V +I + + + + + + I Y+ + +++ + Sbjct: 68 VPDKDIHREPSIIVKSRGIIEFEYYDKPFSHKNEMWSYHSVNKHIYIKYVYYFLKTQE-- 125 Query: 344 KVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 F +GS ++ + D + +P ++ Q I +++ ++ + ++ Sbjct: 126 NYFRNIGSKMQMPQIATPDTDNYKIPIPSLETQQKIVKILDK-FTELEAELALRKRQYRY 184 Query: 403 LKER 406 ++ Sbjct: 185 YRDL 188 >gi|239502429|ref|ZP_04661739.1| putative restriction-modification protein [Acinetobacter baumannii AB900] Length = 778 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 88/239 (36%), Gaps = 15/239 (6%) Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + + +ID + + F +L K + + +NP++ + I + Sbjct: 507 LDSFRRKIDENDLKNLDFADLNKSDFDKYYNELGFLKVNPELIRSNDYIYNYAHYSNSHI 566 Query: 234 VKPFF----ALVTELNRKNTKLIESNILSLSY---GNIIQKLETRNMGLKPESYETYQIV 286 F + L+ K ++NI +S +I + E + Y+ V Sbjct: 567 KSKFPTIKLKELLSLSGKVKVGEDTNIPIMSITMEHGLIDQHEKFKKRVASSDISGYKKV 626 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKV 345 E+V D+ L + + ++ AY + ++ YL ++RS L K+ Sbjct: 627 FKNELVM---GFPIDEGVLGFQKYYDAAAVSPAYKIFRLKREVNVEYLDLILRSNSLRKI 683 Query: 346 FYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + G R+S+ E + + PP + + I + I+ +++ ++ I Sbjct: 684 YKSKMQGSVERRRSIPDEMFLNIEIPNPPEEVKDQIVKQ-HKLIKEIENSLKENQKKIA 741 >gi|109947458|ref|YP_664686.1| hypothetical protein Hac_0910 [Helicobacter acinonychis str. Sheeba] gi|109714679|emb|CAJ99687.1| conserved hypothetical protein fragment 3 [Helicobacter acinonychis str. Sheeba] Length = 162 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 42/134 (31%), Gaps = 6/134 (4%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 E + + + I+ ++ + + + S+ K + Sbjct: 4 EQQKADINYKDISKKDIIHCESVIIKSRGNIGFEYYDQPFSHKNEIWSYSS----KTNQT 59 Query: 329 DSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 +L + + + K+ + L D + +PP++ Q +I +++ + Sbjct: 60 LVKFLYYYLSNNQHYFQKLVQSSSVKNPPQLSVSDTDEHEMPIPPLEIQQEIVKILDQFS 119 Query: 387 ARIDVLVEKIEQSI 400 A L I I Sbjct: 120 ALTTDLQSGILAEI 133 >gi|302348051|ref|YP_003815689.1| Site specific DNA-methyltransferase [Acidilobus saccharovorans 345-15] gi|302328463|gb|ADL18658.1| Site specific DNA-methyltransferase [Acidilobus saccharovorans 345-15] Length = 471 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 53/148 (35%), Gaps = 11/148 (7%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVK 324 + + + + + + V+ +IVF + + + R + G+ + V Sbjct: 321 RRDEKFIEPGSDMDKRRGHVEVDDIVFVRVGVGSAGRCAVIVDESDLGVADDWIYIIKVD 380 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQF------- 376 I YLA +++ + ++ G+ ++ +++++ V VP + Q Sbjct: 381 KRRILPHYLAMFLQTELGQRQLESLKRGVGTVTIPISELRKVKVPVPSMDFQEWVRSEYL 440 Query: 377 DITNVINVETA-RIDVLVEKIEQSIVLL 403 + + + + I+ I L Sbjct: 441 RMVKFLREGNKREAEKVFNVIKGKIEEL 468 >gi|302345832|ref|YP_003814185.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] gi|302148949|gb|ADK95211.1| type I restriction modification DNA specificity domain protein [Prevotella melaninogenica ATCC 25845] Length = 187 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 50/176 (28%), Gaps = 9/176 (5%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 EW N K + +I + Sbjct: 3 EWKEYKISDVCKIRHGFAFKGAYFTNEKQPYICVTP-GNFDIKGGFKLSKPKYYHGPIPN 61 Query: 283 YQIVDPGEIVFRFIDLQNDK-----RSLRSAQVMERGIITSAYMAVKPHGID--STYLAW 335 I++ +++ DL D ++ + V+ + +L W Sbjct: 62 DYILNKDDLIVTMTDLSKDGDTLGYSAIIPQIRDITFLHNQRIGLVESIASNISKHFLYW 121 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 +MR+ + K SG + + + P + Q +I +++N A+I+ Sbjct: 122 VMRTPEYQKYIVNCCSGSTVKHTSPKLIGTYVFKAPDPETQEEIASLLNNLDAKIE 177 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 52/176 (29%), Gaps = 16/176 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESG-TGKYLPKDGNSRQSDTST 77 + WK I K+ G + + + YI + G + Sbjct: 2 EEWKEYKISDVCKIRHGFAFKGAYFTNEKQPYICVTPGNFDIKGGFKLSKPKYYHGPIPN 61 Query: 78 VSIFAKGQILY-----GKLGPYLRKAI----IADFDGICSTQFLVLQPKDVLPELLQGWL 128 I K ++ K G L + I D + + + +++ + Sbjct: 62 DYILNKDDLIVTMTDLSKDGDTLGYSAIIPQIRDITFLHNQRIGLVESIASNISKHFLYW 121 Query: 129 LS--IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + I C G+T+ H K IG P Q I + +I+ Sbjct: 122 VMRTPEYQKYIVNCCSGSTVKHTSPKLIGTYVFKAPDPETQEEIASLLNNLDAKIE 177 >gi|332655468|ref|ZP_08421205.1| type I restriction-modification system specificity subunit [Ruminococcaceae bacterium D16] gi|332515603|gb|EGJ45216.1| type I restriction-modification system specificity subunit [Ruminococcaceae bacterium D16] Length = 234 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 36/140 (25%), Gaps = 13/140 (9%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E ++ G++V + + + + Y Sbjct: 96 ISEGNHEKYVLSEGDVVVARTGATVGYAKMVGRNIPDSVFASFLVRIRPIDDEYRYYFGL 155 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP---PIKEQF-DITNVINVETARID 390 + S + G + + + +P + E I++ + Sbjct: 156 AITSSEFLDFVQTNAGGSAQPQANPPLLGEFELSIPNKQSLPEFNTKISSFLG------- 208 Query: 391 VLVEKIEQSIVLLKERRSSF 410 ++E E I L E + + Sbjct: 209 -VIESNETEISKLHEVKDTM 227 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 58/184 (31%), Gaps = 7/184 (3%) Query: 29 PIKRFTKLNTGRTSESGKDI---IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +K ++ + G T + + ++ + D+ + + ++G Sbjct: 51 KLKDYSVMQYGYTETATTEPVGPKFLRITDIAQNYIDWNGVPYCPISEGNHEKYVLSEGD 110 Query: 86 ILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ++ + G + A + + S + D + S + ++ Sbjct: 111 VVVARTGATVGYAKMVGRNIPDSVFASFLVRIRPIDDEYRYYFGLAITSSEFLDFVQTNA 170 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ A+ +G + IP KI + I++ TE + E+ + Sbjct: 171 GGSAQPQANPPLLGEFELSIPNKQSLPEFNTKISSFLGVIESNETEISKLHEVKDTMVKM 230 Query: 202 LVSY 205 L S Sbjct: 231 LSSR 234 >gi|281424442|ref|ZP_06255355.1| putative type I restriction-modification system, S subunit [Prevotella oris F0302] gi|281401441|gb|EFB32272.1| putative type I restriction-modification system, S subunit [Prevotella oris F0302] Length = 186 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 40/135 (29%), Gaps = 6/135 (4%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + +SY T + + D +N S + + S + Sbjct: 52 DYIVSSTNYDDSYLTPVLTAGKSFIIGNTDEKNGIYSKLPCIIFDDFTTASKLVNFPFKV 111 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLR------QSLKFEDVKRLPVLVPPIKEQFDITNV 381 S + K A S + + + +L + +PP +EQ I Sbjct: 112 KSSAMKILQVNQNISIKYVAAFMSITQLIGDTHKRYWISEYSKLSISIPPKEEQERIVVA 171 Query: 382 INVETARIDVLVEKI 396 I+ +D + E + Sbjct: 172 IDNLFNTLDAVKENL 186 >gi|332202397|gb|EGJ16466.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA41317] Length = 181 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 36/119 (30%), Gaps = 7/119 (5%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + S K + S+ + L + + P +EQ IT + +++ L Sbjct: 121 LSSPLFYKQLKAITKLSGQALYSIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQL 179 Score = 41.7 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 70/179 (39%), Gaps = 17/179 (9%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSINTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLV----LQPKDVLPELLQGWL 128 ++ K L + L D+DG+ + F+ + +++ + L L Sbjct: 62 EQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFNL 121 Query: 129 LSIDVTQRIEAICE--GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLI 185 S ++++AI + G + + + +P+ P EQ LI +K+ +++ L Sbjct: 122 SSPLFYKQLKAITKLSGQALYSIPKTTLSELLIPLAPFEEQELITQKVEKLFEKVNQLW 180 >gi|257880782|ref|ZP_05660435.1| type IC HsdS subunit [Enterococcus faecium 1,230,933] gi|257815010|gb|EEV43768.1| type IC HsdS subunit [Enterococcus faecium 1,230,933] Length = 182 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 49/123 (39%), Gaps = 7/123 (5%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS-AQVMERGIITSAY 320 + + E + + + + Y ++ GE+ + + + K + E ++ Y Sbjct: 53 NGWLDQRERFSGNIAGKEQKNYTLLRKGELSYNKGNSKLAKYGVVFMLDNFEEALVPRVY 112 Query: 321 MAVKP-HGIDSTYLAWLMRSYDLCKVFYA-MGSGLRQ----SLKFEDVKRLPVLVPPIKE 374 + K + S Y+ +L + K + SG R ++ ++D + + +P IKE Sbjct: 113 HSFKTTNEASSKYIEYLFETKKPNKELRKLITSGARMDGLLNINYDDFMGIKITIPKIKE 172 Query: 375 QFD 377 Q Sbjct: 173 QKK 175 >gi|304436274|ref|ZP_07396257.1| type I restriction modification DNA specificity family protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304370731|gb|EFM24373.1| type I restriction modification DNA specificity family protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 175 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 52/169 (30%), Gaps = 9/169 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-----YQ 284 + W + ++ + + I K + E + Sbjct: 2 NSWNCIRLGDVCCVNTEAYSEKERWDYVHYLDTGNITKNCIDEIQYIDLMKEKLPSRTRR 61 Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-YMAVKPHGIDSTYLAWLMRSYDLC 343 V I++ + + Q + T + V +D+ +L + + Sbjct: 62 KVKYNSILYSTVRPNQCHYGIVKEQSSNFLVSTGFSVIDVIDERVDADFLYCYLTMNTVT 121 Query: 344 KVFYAMG---SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + +A+ + ++K D++ L + +P I Q I + + +I Sbjct: 122 EKMHAIAEQSTSAYPAIKSSDIEDLELKLPDILTQKRIASFLMSLEHKI 170 Score = 43.2 bits (100), Expect = 0.084, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 58/172 (33%), Gaps = 10/172 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESG--KDIIYIGLEDVESGTGKYL-PKDGNSRQSDTSTVS 79 W + + +NT SE + Y+ ++ + D + + T Sbjct: 2 NSWNCIRLGDVCCVNTEAYSEKERWDYVHYLDTGNITKNCIDEIQYIDLMKEKLPSRTRR 61 Query: 80 IFAKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQP--KDVLPELLQGWLLSIDVT 134 ILY + P I + ST F V+ + V + L +L VT Sbjct: 62 KVKYNSILYSTVRPNQCHYGIVKEQSSNFLVSTGFSVIDVIDERVDADFLYCYLTMNTVT 121 Query: 135 QRIEAI--CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +++ AI + I ++ + +P + Q I +++ +I Sbjct: 122 EKMHAIAEQSTSAYPAIKSSDIEDLELKLPDILTQKRIASFLMSLEHKITNN 173 >gi|295426572|ref|ZP_06819221.1| type I restriction enzyme specificity protein [Lactobacillus amylolyticus DSM 11664] gi|295063751|gb|EFG54710.1| type I restriction enzyme specificity protein [Lactobacillus amylolyticus DSM 11664] Length = 56 Score = 49.0 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 24/53 (45%), Gaps = 4/53 (7%) Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + +L + EQ I + ++D + ++ + LLKE++ F+ Sbjct: 4 KVISKLNFFITDYSEQEKIASF----FKQLDDTIALHQRKLDLLKEQKKGFLQ 52 >gi|332800244|ref|YP_004461743.1| restriction modification system DNA specificity domain-containing protein [Tepidanaerobacter sp. Re1] gi|332697979|gb|AEE92436.1| restriction modification system DNA specificity domain protein [Tepidanaerobacter sp. Re1] Length = 471 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 110/392 (28%), Gaps = 31/392 (7%) Query: 44 SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-TVSIFAKGQILYGKLGPYLRKAIIAD 102 +I + ++V + + S S IL G Y R Sbjct: 74 KSGTVIALTSQNVMENQINFDNIIKIPFEIHNSLERSKIYPNDILLSYTGQYRRACTAPQ 133 Query: 103 FDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 + + K + + +L ++ + + I +I +P Sbjct: 134 NIELHLGPNICRLRSTKLIDVHYVSTFLNCRYGQSSLDREKTMSAQPTVNMGRIRDILLP 193 Query: 161 IPPLAEQVLIREKIIAETVRIDT-LITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKD 219 IPP Q I +K+ + ++ L E + + V M Sbjct: 194 IPPPEIQRYIGDKVRKAEELREEAKRLKKEAEEILNTELNLSYFNERVKYAPKMYNWMCG 253 Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNR----------------KNTKLIESNILSLSYGN 263 IE + F T L + +I + + Sbjct: 254 ELIEARIDSQYYINETNFINAEMNKKGLKLKKISEVASVGKGFSYTSLDKKSIPYIRISD 313 Query: 264 IIQKLETRN----MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 + L + + K S + ++ +++F K SL + ++S Sbjct: 314 LDDLLINFDSVEMVDKKTYSEKKSSQLEQYDLIFAITGATIGKVSLFYNNKCSKATLSSD 373 Query: 320 -YMAVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 D+ Y+ ++S + + + L E + + + V K + + Sbjct: 374 TAFVRLKDKNDAAYVLLYLKSIIGQISILKGITGATNRHLSLEHIGDIFIPVIDNKLKRE 433 Query: 378 I----TNVINVETARIDVLVEKIEQSIVLLKE 405 I I+ L+++ +Q + L E Sbjct: 434 INIIVIKAIDNMFLS-KQLIKEAKQDVEDLIE 464 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 17/147 (11%), Positives = 51/147 (34%), Gaps = 7/147 (4%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N I + + + + P +I+ + +R+ + Q +E + + Sbjct: 87 MENQINFDNIIKIPFEIHNSLERSKIYPNDILLSYTG--QYRRACTAPQNIELHLGPNIC 144 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ID Y++ + + + ++ ++ + + +PP + Q I Sbjct: 145 RLRSTKLIDVHYVSTFLNCRYGQSSLDREKTMSAQPTVNMGRIRDILLPIPPPEIQRYIG 204 Query: 380 NVINVETARIDVLVEKIEQSIVLLKER 406 + + + + L E+ ++ +E Sbjct: 205 DKV----RKAEELREEAKRLKKEAEEI 227 >gi|301048306|ref|ZP_07195338.1| conserved domain protein [Escherichia coli MS 185-1] gi|300299840|gb|EFJ56225.1| conserved domain protein [Escherichia coli MS 185-1] Length = 178 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 17/159 (10%), Positives = 46/159 (28%), Gaps = 8/159 (5%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFR----FIDLQNDKRSLRSAQVMERGIIT 317 + + ++ S I+ +IV I+ + + + + Sbjct: 20 HGVTNWKDVVHIPNDMISDFENYILSENDIVISLDRPIINTGLKYAIISKSDLPCLLLQR 79 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 A + + +++L ++SY S + + ++ + P EQ Sbjct: 80 VAKFKNYANTVSNSFLTIWLQSYFFINSIDPGRSNGVPHISTKQLEMTLFPLLPQSEQDR 139 Query: 378 ITNVINVETARIDVL----VEKIEQSIVLLKERRSSFIA 412 I + + + L + + L + I Sbjct: 140 IISKTDELIQTCNKLKYIIKTAKQTQLHLADALTDAAIN 178 >gi|20090949|ref|NP_617024.1| StySKI methylase [Methanosarcina acetivorans C2A] gi|19916032|gb|AAM05504.1| StySKI methylase [Methanosarcina acetivorans C2A] Length = 104 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 31/104 (29%), Gaps = 13/104 (12%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS 73 +P+ W+ + G ES GK + I + +++ G + Sbjct: 4 KLPEGWEWNKLSELANFFYGGAFESSYFNEDGKGVKIIRIRNLKQGFTE------TYYAG 57 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 + + IL G G + + + + L K Sbjct: 58 EYDESYLVQNSDILIGMDGEF-NIVKWTGEPALLNQRVCKLIVK 100 >gi|288917625|ref|ZP_06411989.1| N-6 DNA methylase [Frankia sp. EUN1f] gi|288351018|gb|EFC85231.1| N-6 DNA methylase [Frankia sp. EUN1f] Length = 761 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 53/155 (34%), Gaps = 6/155 (3%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 N I ++ T + PG+IV + +L + + I TS Sbjct: 609 NRISPEMIDHVEPDLAEKLTRYRLRPGDIVCVRTGQLGRQ-ALVTEEQRGWLIGTSCLRL 667 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNV 381 +D +YL + + + A +G L ++RLP+L+P +Q I Sbjct: 668 RPNESVDPSYLLYYLALPQTHEWLLAHSTGSAVRLVTAATIRRLPLLLPDRGQQERIGVT 727 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVT 416 ++ +D L ++ R + + Sbjct: 728 VSA----LDDLAALHDRIRRAGTGLRDALLPLVFR 758 Score = 43.2 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 54/173 (31%), Gaps = 13/173 (7%) Query: 24 HWKVVPIKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQ-SDTS 76 WK +P+ + G + + ++ D ++ Sbjct: 568 SWKRLPLGDVCDVLAGFSGAIRTDHNGPSGTAVVKPRNLVENRISPEMIDHVEPDLAEKL 627 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKD-VLPELLQGWLLSIDV 133 T G I+ + G R+A++ + + T L L+P + V P L +L Sbjct: 628 TRYRLRPGDIVCVRTGQLGRQALVTEEQRGWLIGTSCLRLRPNESVDPSYLLYYLALPQT 687 Query: 134 TQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVL---IREKIIAETVRIDT 183 + + A G+ + I +P+ +P +Q + D Sbjct: 688 HEWLLAHSTGSAVRLVTAATIRRLPLLLPDRGQQERIGVTVSALDDLAALHDR 740 >gi|319745000|gb|EFV97328.1| type I restriction-modification system specificty subunit [Streptococcus agalactiae ATCC 13813] Length = 210 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 10/182 (5%) Query: 30 IKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + G+ S DI I L D+ Y + + + +G + Sbjct: 30 LSELVDCFKGKAVPSKAEAGDIRIINLSDMSPLGIDYHNLKTFQDEQRSLLKYLLQEGDV 89 Query: 87 LYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIEAICEG 143 L G + AI D+ + S +L+P + + S + Q +E +G Sbjct: 90 LIASKGTVKKVAIFEEQDYPVVASANITILRPTQHIRGYYLKLFFDSEEGQQALENANKG 149 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + K + NI +P PL Q + +I + +I E E+ Q + Sbjct: 150 KAVMNISTKELLNIAIPSIPLFRQ----DYLIQRYKQGLNDYERKIARAEQEWERIQNDI 205 Query: 204 SY 205 Sbjct: 206 RQ 207 >gi|148377835|ref|YP_001256711.1| restriction modification system specificitysubunit HsdS [Mycoplasma agalactiae PG2] gi|148291881|emb|CAL59272.1| restriction modification system specificitysubunit HsdS [Mycoplasma agalactiae PG2] Length = 183 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 53/164 (32%), Gaps = 5/164 (3%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 + + S +I ++ M + + + I +I + D + R Sbjct: 22 WKLHELVSYRSSTMVINDVKKYGMFDVYDPNKAVGKTNKRPIEVSYISIVKDGDAGRIRL 81 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + II S A+ + + + + + + F D + Sbjct: 82 LPKNIIILSTMGALIAREPYKIDFIYHLLT-SYNDLSKERNGSIIPHIYFRDYGHNIYNI 140 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 P EQ I + + +D L+ ++ + LK +++ + Sbjct: 141 PEGNEQSKI----SSLFSILDSLITLHQRKLNSLKNIKNTLLEK 180 >gi|171779514|ref|ZP_02920478.1| hypothetical protein STRINF_01359 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171282131|gb|EDT47562.1| hypothetical protein STRINF_01359 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 198 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 60/170 (35%), Gaps = 6/170 (3%) Query: 28 VPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 V + T+ G+ ++ + L D+ Y + D+ + + G Sbjct: 16 VSLSDVTEHFKGKAVSKLGDTGNVSVVNLSDMTETDIDYDHLKKIDAEQDSVSRYLLEDG 75 Query: 85 QILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAIC 141 +L G + A+ D D I S VL+P + ++ +L S + +E Sbjct: 76 DVLIASKGTVKKVAVFHDQDRAIIASANITVLRPTADISGTYIKLFLESELGQELLETTN 135 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G + + + K I +I +P +Q + ++ IT + Sbjct: 136 TGKNVMNLNTKKIVSIKIPKLQPLKQAFLIQRYEQGLKDYKRKITRANQE 185 >gi|307608919|emb|CBW98319.1| hypothetical protein LPW_01751 [Legionella pneumophila 130b] Length = 225 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 59/178 (33%), Gaps = 10/178 (5%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDL 298 K +L S + + +I + + + Q + G+I+F Sbjct: 38 YSFRGKIPELKNSGVYCVQMKDINETYNVNWSTVIETILPSRQSQVSLQFGDILFAARGQ 97 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRSYDLCKVFYAMG-SGLRQ 355 +N + + I + ++ + D Y+AW + + F + Sbjct: 98 RNYAALINAELKERLAIAAPQFFVIRLNVPDVLPEYIAWFLNQTIAQRYFLSNAEGSTTP 157 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 S++ + ++ P+++P +K+Q I I + + I + + + Sbjct: 158 SIRKQVLEATPIILPTLKQQKTI----MELATTISKEKQLAHKIIANGELLMQTLLNE 211 >gi|237649413|ref|ZP_04523665.1| type I restriction enzyme [Streptococcus pneumoniae CCRI 1974] gi|237821512|ref|ZP_04597357.1| type I restriction enzyme [Streptococcus pneumoniae CCRI 1974M2] Length = 226 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 48/142 (33%), Gaps = 5/142 (3%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 + ++ +L+ + PYL+ + I ST F+VL L +LLS + Sbjct: 143 KLVSQNSVLFSTVRPYLKNIAVVRELKEYLIASTAFIVLDTLLNETYLKY-YLLSDNFIN 201 Query: 136 RIEAICEGATMSHADWKGIGNI 157 R+ G + + + Sbjct: 202 RVNNKSTGTSYPAINDYNFNLL 223 Score = 36.3 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 49/152 (32%), Gaps = 6/152 (3%) Query: 221 GIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 I+ +PD WE ++ + + I + S + +N+ Sbjct: 77 EIDVPYDIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQ 136 Query: 281 ---ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 ++V ++F + ++ ++ +I S V ++ TYL + + Sbjct: 137 APSRARKLVSQNSVLFSTVRPYLKNIAVVR--ELKEYLIASTAFIVLDTLLNETYLKYYL 194 Query: 338 RSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 S + +G ++ + L + Sbjct: 195 LSDNFINRVNNKSTGTSYPAINDYNFNLLLIA 226 >gi|294793174|ref|ZP_06758320.1| type I restriction-modification system specificity determinant [Veillonella sp. 6_1_27] gi|294456119|gb|EFG24483.1| type I restriction-modification system specificity determinant [Veillonella sp. 6_1_27] Length = 167 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 52/157 (33%), Gaps = 5/157 (3%) Query: 28 VPIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + G+ + S + YI E++ + + + + Sbjct: 3 CKLSDICEYRKGKVNTSNLTLKTYISTENMLPDKAGVVEANSLPSTTLVQEYK---EHDT 59 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL-LSIDVTQRIEAICEGAT 145 L + PY +K A DG CS LV Q + + ++ + D A +G Sbjct: 60 LVSNIRPYFKKVWQAKHDGGCSNDVLVFQGNLNVDKDFLYYILANDDFFAYSMATSKGTK 119 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 M D K I + + + Q I + +I+ Sbjct: 120 MPRGDKKSIMQYELQLFDIKIQKKIVSILKLLDKKIE 156 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 16/159 (10%), Positives = 47/159 (29%), Gaps = 4/159 (2%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + K + +S N++ + ++ S + +V + Sbjct: 1 MKCKLSDICEYRKGKVNTSNLTLKTYISTENMLPD---KAGVVEANSLPSTTLVQEYKEH 57 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + G + +D +L +++ + D A G Sbjct: 58 DTLVSNIRPYFKKVWQAKHDGGCSNDVLVFQGNLNVDKDFLYYILANDDFFAYSMATSKG 117 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + + + + IK Q I +++ + +I+ Sbjct: 118 TKMPRGDKKSIMQYELQLFDIKIQKKIVSILKLLDKKIE 156 >gi|332074781|gb|EGI85254.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae GA17570] Length = 69 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 24/60 (40%), Gaps = 7/60 (11%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV-------LVEKIEQSIVLLKER 406 ++L + V + + +PP+ EQ I I ++D L + ++ LK Sbjct: 1 MKNLNSDKVASILIPLPPLAEQQRIIEAIESALEKVDEYAESYNRLEQLDKKFPDKLKNL 60 >gi|329123733|ref|ZP_08252293.1| type I restriction/modification enzyme [Haemophilus aegyptius ATCC 11116] gi|327469932|gb|EGF15397.1| type I restriction/modification enzyme [Haemophilus aegyptius ATCC 11116] Length = 169 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 39/130 (30%), Gaps = 8/130 (6%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 I + + I S V+ T + +F Sbjct: 47 NTITISASGANAGFVNFWTEKIFASDCTTVRADNYVGTKFIFTYLQSIQENIFDLARGAA 106 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + + +D+KRLP+ P+ Q + E +ID + I + + + Sbjct: 107 QPHVYPDDIKRLPIPKVPLDIQQKVVE----ECQKIDDEFNRTRMQIEEYRAKFAKIFNE 162 Query: 414 AVTGQIDLRG 423 +I +RG Sbjct: 163 L---EI-VRG 168 >gi|217033076|ref|ZP_03438542.1| hypothetical protein HPB128_179g2 [Helicobacter pylori B128] gi|216945197|gb|EEC23884.1| hypothetical protein HPB128_179g2 [Helicobacter pylori B128] Length = 169 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 55/155 (35%), Gaps = 13/155 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESGKDII-----YIGLEDVESGTGKYLPKDGNSRQSDTS 76 P +W+ V + +G ++ +D I YI +V + N + Sbjct: 7 PSNWQRVRLGDIGITISGLAGKTKQDFINGNAKYITFLNVLNNVIIDTSILENVKIYPNE 66 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIA-------DFDGICSTQFLVLQPKDVLPELLQGWLL 129 + F K + + ++ + D + S F + L +L+ Sbjct: 67 KQNSFKKYDLFFNTSSETPKEVGMCAVLLDDIDQVFLNSFCFGFRIFDKAVDSLFLSYLI 126 Query: 130 SIDV-TQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + ++ + E + +G+T + G N+ + +PP Sbjct: 127 NSEIGRKAFENLAQGSTRYNLSKSGFNNVCLILPP 161 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 44/122 (36%), Gaps = 5/122 (4%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA--QVME 312 I L+ N + + +K E ++ F + + + ++ Sbjct: 40 YITFLNVLNNVIIDTSILENVKIYPNEKQNSFKKYDLFFNTSSETPKEVGMCAVLLDDID 99 Query: 313 RGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLV 369 + + S + +DS +L++L+ S K F + G R +L + +++ Sbjct: 100 QVFLNSFCFGFRIFDKAVDSLFLSYLINSEIGRKAFENLAQGSTRYNLSKSGFNNVCLIL 159 Query: 370 PP 371 PP Sbjct: 160 PP 161 >gi|222153122|ref|YP_002562299.1| hypothetical protein SUB0973 [Streptococcus uberis 0140J] gi|222113935|emb|CAR42171.1| conserved hypothetical protein [Streptococcus uberis 0140J] Length = 198 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 63/189 (33%), Gaps = 10/189 (5%) Query: 26 KVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + + G+ S + I L D+ Y Sbjct: 14 EKIRLGDVVDCFKGKAISSKVEDGEFGLINLSDMTKEGINYEGIRTFHLDRRQLLRYFLE 73 Query: 83 KGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEA 139 G +L G + I + S+ VL+P + L + L ++ ++A Sbjct: 74 DGDVLIASKGTVKKVCIFHKQKREFVASSNITVLRPIEKLRGYYIKFFLDSEIGQSFLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+ + PL +Q + +I + +R + +++ E Sbjct: 134 ADHGKDVINLSTKELLDIPVSLIPLVKQ----DYLINQYLRGLSDYHRKLKRAEQEWLFI 189 Query: 200 QALVSYIVT 208 Q+ + + Sbjct: 190 QSEIEKSLH 198 >gi|13508377|ref|NP_110327.1| K family restriction enzyme specificity determining subunit [Mycoplasma pneumoniae M129] gi|2496434|sp|P75159|T1SX_MYCPN RecName: Full=Putative type I restriction enzyme specificity protein MPN_638; Short=S protein gi|1673868|gb|AAB95852.1| specificity determining subunit for restriction enzyme belonging to the K family of S proteins [Mycoplasma pneumoniae M129] Length = 375 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 54/141 (38%), Gaps = 4/141 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K E N G+K I R + + G + + P Sbjct: 40 KYEYFNGGIKASGRTNEFNTFKNTISIIIGGSCGYVR--LADKDYFCGQSSCTLTVLDPL 97 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVE 385 ID + + ++S + K+ ++++ D+K LP+ + I++Q I + ++V Sbjct: 98 EIDLKFAYYALKSQE-EKITSLASGTTIKNIRLSDLKDLPIPLVKSIQDQRTIAHALSVF 156 Query: 386 TARIDVLVEKIEQSIVLLKER 406 RI+ L E IE + L E Sbjct: 157 DLRIEHLNELIEVNRKLRDEY 177 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 47/397 (11%), Positives = 111/397 (27%), Gaps = 40/397 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +W + +L G E + + GKY +G + S + Sbjct: 11 SNWTKKTLGSLFELKKGEMLEKE----------LLAPDGKYEYFNGGIKASGRTNEFNTF 60 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I G + + + + +L + ++I ++ Sbjct: 61 KNTISIIIGGSCGYVRLADKDYFCGQSSCTLTVLDPLEIDLKFAYYALKSQEEKITSLAS 120 Query: 143 GATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G T+ + + ++P+P+ + +Q I + +RI+ L +L E Sbjct: 121 GTTIKNIRLSDLKDLPIPLVKSIQDQRTIAHALSVFDLRIEHLNELIEVNRKLRDEYAHK 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L + L+PD + + + + + K ++++ Sbjct: 181 LFT------LDPDFLTHW----NLHELHEQMGEISLGEVFHLKSGK---YLKADERFEDG 227 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + E G+ + + + G A Sbjct: 228 KFPYYGAGIESTSFVNEPNT------KGDTLSMIANGYSIGNIRYHTIPWFNGTGGIAME 281 Query: 322 AVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDI 378 A+KP+ + ++ DL + F S + + + V Q Sbjct: 282 ALKPNKTYVPFFYCALKYMQKDLKERFKRDES---PFISLKLAGEIKVPFVKSFALQRKA 338 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I + ++ E+ + I R + + Sbjct: 339 GKIIYLLDKTLEECKEEAKSLI----SIRDNLLGKLF 371 >gi|325973640|ref|YP_004250704.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323652242|gb|ADX98324.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 86 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 26/59 (44%), Gaps = 4/59 (6%) Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + L + +L+P ++ Q I N ++ D L+E E+ I L+ R++ Sbjct: 2 TAQLGLYLNKFLSIKLLIPTLQMQEKIGNTLSA----YDELIENNEKQIKALQRIRTTI 56 >gi|24373035|ref|NP_717077.1| type I restriction-modification system, S subunit, putative [Shewanella oneidensis MR-1] gi|24347206|gb|AAN54522.1|AE015591_1 type I restriction-modification system, S subunit, putative [Shewanella oneidensis MR-1] Length = 446 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 98/359 (27%), Gaps = 32/359 (8%) Query: 41 TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII 100 E + ++G D+ LP + Q + K +L + G R + Sbjct: 43 VKEERYGVPFMGSVDIIQANLDRLPL-ISKEQVSRKPLFKVFKDWVLITRSGTIGRMTLA 101 Query: 101 ADFD--GICSTQFL--VLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGN 156 CS + V P+ V P L +L S + + GA + H + + N Sbjct: 102 RQEMDGHACSEHVMRVVPNPEKVSPGYLYCYLRSKFGVPLVVSSTYGAIIQHIEPHHVIN 161 Query: 157 IPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVK 216 +P+PI + E I + +L+ E ++ + Sbjct: 162 LPVPIVDKQLEEKAHELINKCGDNRTESNALLKKAGQLINEHFSFPNKLALSHRIFTHSA 221 Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 S ++ H V + E L E + + G + + G+ Sbjct: 222 ASSSLVQKRMDATYHDRVAQMSDDLVEQAGAEKTLAELGVNTGESGRMKLVFTESDHGVP 281 Query: 277 PESYET---------------------YQIVDPGEIVFRFIDLQNDKRSLRSAQV--MER 313 + V +I+ E Sbjct: 282 FTTSGEIFRARYEPQRFLAKSKLGDVADWGVRQEDILLARSGQVGGIIGTGVWADSRFEN 341 Query: 314 GIITSAYMAV--KPHGIDSTYLAWLMRSYD--LCKVFYAMGSGLRQSLKFEDVKRLPVL 368 ++ + + + + YL + D ++ + L +DV +L + Sbjct: 342 AAVSVDVIRIKAQESEVLPGYLYAYLMCTDVGYRQLIRSAAGSSIPHLSSDDVLKLKLP 400 >gi|241895461|ref|ZP_04782757.1| conserved hypothetical protein [Weissella paramesenteroides ATCC 33313] gi|241871435|gb|EER75186.1| conserved hypothetical protein [Weissella paramesenteroides ATCC 33313] Length = 171 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 60/165 (36%), Gaps = 2/165 (1%) Query: 41 TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAII 100 T+ + + I E++ SG G+ K+ D+ F K +LYGKL PYL Sbjct: 3 TTSRKETLPRIEYENIISGEGRL--KNDVFEIGDSRKGIYFQKNDVLYGKLRPYLNNWFF 60 Query: 101 ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMP 160 A F GI F VL+ + L+ + + + G M +DW + Sbjct: 61 ATFQGIAIGDFWVLRAAPCISPKFIFSLIQSPRYKVVANMTTGTKMPRSDWNNVSATEFR 120 Query: 161 IPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 I ++ + ++ + T+ + +L + Sbjct: 121 IARNEDEQMKIGQLFLSLDNLITVNQRTTILFAPDQNSTLSLRFH 165 >gi|171779407|ref|ZP_02920371.1| hypothetical protein STRINF_01252 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171282024|gb|EDT47455.1| hypothetical protein STRINF_01252 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 133 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 10/70 (14%), Positives = 23/70 (32%), Gaps = 4/70 (5%) Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + L + + + S SL + + +P KEQ I + ++ Sbjct: 47 NIDLQFTLAIFKKINWKKYDESTGVPSLSKSVINNVFAFLPSFKEQKKIGSF----FQQL 102 Query: 390 DVLVEKIEQS 399 D + ++ Sbjct: 103 DDTITLHQRK 112 >gi|301633654|gb|ADK87208.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 375 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 54/141 (38%), Gaps = 4/141 (2%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 K E N G+K I R + + G + + P Sbjct: 40 KYEYFNGGIKASGRTNEFNTFKNTISIIIGGSCGYVR--LADKDYFCGQSSCTLTVLDPL 97 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVE 385 ID + + ++S + K+ ++++ D+K LP+ + I++Q I + ++V Sbjct: 98 EIDLKFAYYALKSQE-EKITSLASGTTIKNIRLSDLKDLPIPLVKSIQDQRTIAHALSVF 156 Query: 386 TARIDVLVEKIEQSIVLLKER 406 RI+ L E IE + L E Sbjct: 157 DLRIEHLNELIEVNRKLRDEY 177 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 48/397 (12%), Positives = 112/397 (28%), Gaps = 40/397 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 +W + +L G E + + GKY +G + S + Sbjct: 11 SNWTKKTLGSLFELKKGEMLEKE----------LLAPDGKYEYFNGGIKASGRTNEFNTF 60 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K I G + + + + +L + ++I ++ Sbjct: 61 KNTISIIIGGSCGYVRLADKDYFCGQSSCTLTVLDPLEIDLKFAYYALKSQEEKITSLAS 120 Query: 143 GATMSHADWKGIGNIPMPI-PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G T+ + + ++P+P+ + +Q I + +RI+ L +L E Sbjct: 121 GTTIKNIRLSDLKDLPIPLVKSIQDQRTIAHALSVFDLRIEHLNELIEVNRKLRDEYAHK 180 Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L + L+PD + + + + + K ++++ Sbjct: 181 LFT------LDPDFLTHW----NLHELHEQMGEISLGEVFHLKSGK---YLKADERFEDG 227 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 + E G+ + + + G A Sbjct: 228 KFPYYGAGIESTSFVNEPNT------KGDTLSMIANGYSIGNIRYHTIPWFNGTGGIAME 281 Query: 322 AVKPHGIDSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDI 378 A+KP+ + ++ DL + F S + + + V Q Sbjct: 282 ALKPNETYVPFFYCALKYMQKDLKERFKRDES---PFISLKLAGEIKVPFVKSFALQRKA 338 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +I +D +E+ ++ L R + + Sbjct: 339 GKIIY----LLDKTLEEYKEEAKSLISIRDNLLGKLF 371 >gi|225378421|ref|ZP_03755642.1| hypothetical protein ROSEINA2194_04089 [Roseburia inulinivorans DSM 16841] gi|225209736|gb|EEG92090.1| hypothetical protein ROSEINA2194_04089 [Roseburia inulinivorans DSM 16841] Length = 105 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 10/85 (11%), Positives = 35/85 (41%), Gaps = 3/85 (3%) Query: 308 AQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 + + + ++P+ + +L + + + V + ++++ E ++ Sbjct: 3 IEEDRKFVFQRHIAILRPNLEKVIPEFLYYTLLNPQFYTVADYLAIGAAQRTISLESLRN 62 Query: 365 LPVLVPPIKEQFDITNVINVETARI 389 + + +P + +Q I +VI +I Sbjct: 63 IEIELPSLSQQKRIVDVIAPIDKKI 87 >gi|284055016|ref|ZP_06385226.1| type I restriction modification system, subunit S [Arthrospira platensis str. Paraca] Length = 193 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 33/103 (32%), Gaps = 1/103 (0%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Y I+D I+ D+ + R +G + +L Sbjct: 91 DYVNEYIIDDDIILLAEDGGYFDEHTTRPIAYRMKGKCWVNNHVHILKAKPGYHQDFLFY 150 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITN 380 V + SG R L ++ ++ + +P +EQ I + Sbjct: 151 CLVHKNVLPFLASGTRAKLNKSEMNKIEINLPKNSEEQKAIAS 193 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 10/49 (20%), Positives = 24/49 (48%), Gaps = 4/49 (8%) Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRG 423 Q I +V++ D L+ +++ I + +++ + +TG+ L G Sbjct: 1 QKAIASVLSDV----DELISSLDKLIAKKRHIKTATMQQLLTGKTRLPG 45 >gi|169796762|ref|YP_001714555.1| putative restriction-modification protein [Acinetobacter baumannii AYE] gi|169149689|emb|CAM87580.1| conserved hypothetical protein; putative restriction-modification protein [Acinetobacter baumannii AYE] Length = 760 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/240 (16%), Positives = 89/240 (37%), Gaps = 15/240 (6%) Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 + + +ID + + F +L K + + +NP++ + I + Sbjct: 512 LDSFRRKIDENDLKNLDFADLNKSDFDKYYNELGFLKVNPELIRSNDYIYNYAHYSNSHI 571 Query: 234 VKPFF----ALVTELNRKNTKLIESNILSLSY---GNIIQKLETRNMGLKPESYETYQIV 286 F + L+ K ++NI +S +I + E + Y+ V Sbjct: 572 KSKFPTIKLKELLSLSGKVKVGEDTNIPIMSITMEHGLIDQHEKFKKRVASSDISGYKKV 631 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKV 345 E+V D+ L + + ++ AY + ++ YL ++RS L K+ Sbjct: 632 FKNELVM---GFPIDEGVLGFQKYYDAAAVSPAYKIFRLKREVNVEYLDLILRSNSLRKI 688 Query: 346 FYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + G R+S+ E + + PP + + I + I+ +++ ++ + L Sbjct: 689 YKSKMQGSVERRRSIPDEMFLNIEIPNPPEEVKDQIVKQ-HKLIKEIENSLKENQKKLRL 747 >gi|254672388|emb|CBA05665.1| type I restriction-modification system specificity determinant [Neisseria meningitidis alpha275] Length = 60 Score = 48.3 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 22/44 (50%) Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +++K+L + +P + EQ I +++ + E + + I L Sbjct: 1 MKELKKLKIPIPSLPEQEKIVAILDKFDTLTHSVSEGLPREIAL 44 >gi|307244003|ref|ZP_07526124.1| conserved domain protein [Peptostreptococcus stomatis DSM 17678] gi|306492653|gb|EFM64685.1| conserved domain protein [Peptostreptococcus stomatis DSM 17678] Length = 194 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 38/122 (31%), Gaps = 7/122 (5%) Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR--SYDLCKVFYAMGS 351 + + S A +S+ + D+ + Sbjct: 72 SNVSGPSITVSGSGVNAGYVSFHLHDIWAADCSYNNSSCYIHCLYVMMKDIQAQITELQK 131 Query: 352 GL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 G + + +++ L + P I N + ++I ++E + I LK+ ++ Sbjct: 132 GTAQPHVYPKELNPLEITYPNSD----ILNKLEQSLSKIFAVIEDNDNEIAKLKKMQTVL 187 Query: 411 IA 412 +A Sbjct: 188 LA 189 >gi|299142939|ref|ZP_07036065.1| type I restriction-modification system, S subunit [Prevotella oris C735] gi|298575555|gb|EFI47435.1| type I restriction-modification system, S subunit [Prevotella oris C735] Length = 191 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 57/147 (38%), Gaps = 7/147 (4%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 K N G P Y V I N ++ + + Sbjct: 47 HGKYYVMNGGTDPSGYYDNYNVGAHTISISEGG--NSCGYVQFNKCPFWCGGHCYSIQNI 104 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I++ YL +++ + + +GSGL +++ +D+ + +P K+Q I++++ Sbjct: 105 ADNINNLYLYHYLKTEEKAIMKLRIGSGL-PNIQKKDLATFKIKLPTSKQQKAISDIL-- 161 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFI 411 + ++ E EQ ++ +++ + + Sbjct: 162 --SLLEQKAEIEEQILIAMQDEKQYLL 186 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 52/182 (28%), Gaps = 14/182 (7%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ + + G + S GKY +G + S Sbjct: 23 DIITLSEICDIVKGEQINGE----------LLSEHGKYYVMNGGTDPSGYYDNYNVGAHT 72 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 I + G C +Q L + + I + G+ Sbjct: 73 ISISEGGNSCGYVQFNKCPFWCGGHCYSIQNIADNINNLYLYHYLKTEEKAIMKLRIGSG 132 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + + K + + +P +Q I + + ++ + + ++++KQ L+ Sbjct: 133 LPNIQKKDLATFKIKLPTSKQQKAISDIL----SLLEQKAEIEEQILIAMQDEKQYLLRQ 188 Query: 206 IV 207 + Sbjct: 189 MF 190 >gi|228477499|ref|ZP_04062135.1| restriction endonuclease S subunit [Streptococcus salivarius SK126] gi|228250934|gb|EEK10122.1| restriction endonuclease S subunit [Streptococcus salivarius SK126] Length = 206 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 8/186 (4%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K+V + G+ + + I L D+ Y S + + Sbjct: 19 KLVRLGDVVDQFKGKAVPAKAEPGEFAVINLSDMTPNGIAYDDLKTFSEERRKLLRFLLE 78 Query: 83 KGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIE 138 G +L G + A+ D + + S+ VL+PK+ L + L ++ ++ Sbjct: 79 DGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLRPKEKLRGFYIKFFLETEIGRAYLD 138 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQ-VLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + + +I +P P+ +Q I + ++ + + K Sbjct: 139 YADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAYLRGRADFHRKMVRAEQEWENIQKN 198 Query: 198 KKQALV 203 +AL Sbjct: 199 VTEALF 204 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 10/102 (9%), Positives = 30/102 (29%), Gaps = 2/102 (1%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ E ++ + + Y+ + + + Sbjct: 74 RFLLEDGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLRPKEKLRGFYIKFFLETEIG 133 Query: 343 CKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF-DITNVI 382 G +L D+ + + PI +Q I + Sbjct: 134 RAYLDYADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAYL 175 >gi|327460990|gb|EGF07323.1| hypothetical protein HMPREF9394_0856 [Streptococcus sanguinis SK1057] Length = 204 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 59/210 (28%), Gaps = 13/210 (6%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM 273 S +G + + F + K I+ + ++ ++ Sbjct: 1 MKIFYSSNSIQLGDIFELKSGYAFKSKDWVDEGKPVIKIKDIDGLTIDITNLNYVKNKSQ 60 Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYL 333 K ++E V EIV K + +G + S + Sbjct: 61 LSKASNFE----VFGKEIVMALTGATTGKIGVIPKNF--KGYVNQRVGLFYAKTELSYAV 114 Query: 334 AWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 W + + + + + +L V + V I ++ + + Sbjct: 115 LWSILQQQNIITDLIKLSSGSAQANLSPFSVNSYDLNVTFKDL---I--KLDKVLSPLYE 169 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 L I L E R + + ++G+I + Sbjct: 170 LFCFNLSEIQRLSELRDTLLPKLLSGEISV 199 Score = 44.8 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 50/146 (34%), Gaps = 9/146 (6%) Query: 28 VPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA- 82 + + +L +G +S + I ++D++ T + +S S S F Sbjct: 10 IQLGDIFELKSGYAFKSKDWVDEGKPVIKIKDIDGLTIDITNLNYVKNKSQLSKASNFEV 69 Query: 83 -KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIE 138 +I+ G K + +F G + + + K L + L ++ + Sbjct: 70 FGKEIVMALTGATTGKIGVIPKNFKGYVNQRVGLFYAKTELSYAVLWSILQQQNIITDLI 129 Query: 139 AICEGATMSHADWKGIGNIPMPIPPL 164 + G+ ++ + + + + Sbjct: 130 KLSSGSAQANLSPFSVNSYDLNVTFK 155 >gi|261366728|ref|ZP_05979611.1| type I restriction-modification system specificity subunit [Subdoligranulum variabile DSM 15176] gi|282571554|gb|EFB77089.1| type I restriction-modification system specificity subunit [Subdoligranulum variabile DSM 15176] Length = 201 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 36/140 (25%), Gaps = 13/140 (9%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 E ++ G++V + + + + Y Sbjct: 63 ISEENHEKYVLSEGDVVVARTGATVGYAKMVGRNIPDSVFASFLVRIRPIDDEYRYYFGL 122 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP---PIKEQF-DITNVINVETARID 390 + S + G + + + +P + E I++ + Sbjct: 123 AITSAEFLNFVQTNAGGSAQPQANPPLLGEFELSIPNKQSLPEFNTKISSFLG------- 175 Query: 391 VLVEKIEQSIVLLKERRSSF 410 ++E E I L E + + Sbjct: 176 -VIESNETEISKLHEVKDTM 194 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 59/184 (32%), Gaps = 7/184 (3%) Query: 29 PIKRFTKLNTGRTSESGKDI---IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 +K ++ + G T + + ++ + D+ + + + ++G Sbjct: 18 KLKDYSVMQYGYTETATTEPVGPKFLRITDIAQNYIDWNGVPYCPISEENHEKYVLSEGD 77 Query: 86 ILYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 ++ + G + A + + S + D + S + ++ Sbjct: 78 VVVARTGATVGYAKMVGRNIPDSVFASFLVRIRPIDDEYRYYFGLAITSAEFLNFVQTNA 137 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 G+ A+ +G + IP KI + I++ TE + E+ + Sbjct: 138 GGSAQPQANPPLLGEFELSIPNKQSLPEFNTKISSFLGVIESNETEISKLHEVKDTMVKM 197 Query: 202 LVSY 205 L S Sbjct: 198 LSSR 201 >gi|13508029|ref|NP_109978.1| hypothetical protein MPN290 [Mycoplasma pneumoniae M129] gi|12229980|sp|P75487|T1SY_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_290; AltName: Full=S.MpnORFEAP; AltName: Full=Type I restriction enzyme specificity protein MPN_290; Short=S protein gi|1674242|gb|AAB96193.1| hypothetical protein MPN_290 [Mycoplasma pneumoniae M129] Length = 145 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 40/127 (31%), Gaps = 9/127 (7%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y V+ I + + S V D +L +R+ Sbjct: 3 YSKTFRVEEKSITVSARGT----IGVVFYRDFAYLPAVSLICFVPKEEFDIRFLFHALRA 58 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K A G L K + VP +K+Q +I +++ + L E + Sbjct: 59 IKFKKQGSATGQ-----LTVAQFKEYGIHVPSLKKQKEIAAILDPLYSFFTDLNEGLPAE 113 Query: 400 IVLLKER 406 I L K++ Sbjct: 114 IELRKKQ 120 >gi|283796719|ref|ZP_06345872.1| oxidoreductase, FAD/FMN-binding [Clostridium sp. M62/1] gi|291075603|gb|EFE12967.1| oxidoreductase, FAD/FMN-binding [Clostridium sp. M62/1] Length = 173 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 39/139 (28%), Gaps = 5/139 (3%) Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 + +G + + +D ++ I ++ + Sbjct: 27 YMFITPTELHGGYKISSSEKTLTEAGLESIKTNSIDGISVLVGCIGWDMGNVAMCFEKCA 86 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVP 370 I S + +L + + + + +++ S R L + + + Sbjct: 87 TNQQINS--ITQISEDYSPYFLYYWLSTK--KEYLFSISSVTRTPILSKGVFEEIEIPSI 142 Query: 371 PIKEQFDITNVINVETARI 389 EQ I V+ V +I Sbjct: 143 SRSEQDKIAKVLLVLDKKI 161 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 49/161 (30%), Gaps = 7/161 (4%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 IK + TG+T ++ G D ++I ++ G + + S + Sbjct: 2 KIKDIGNVVTGKTPQTAHAEFYGGDYMFITPTELHGGYKISSSEKTLTEAGLESIKTNSI 61 Query: 83 KG-QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G +L G +G + + + Q + + + + +I Sbjct: 62 DGISVLVGCIGWDMGNVAMCFEKCATNQQINSITQISEDYSPYFLYYWLSTKKEYLFSIS 121 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 I +P +EQ I + ++ +I Sbjct: 122 SVTRTPILSKGVFEEIEIPSISRSEQDKIAKVLLVLDKKIK 162 >gi|207859655|ref|YP_002246306.1| type I restriction-modification system methyltransferase [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206711458|emb|CAR35843.1| putative Type I restriction-modification system methyltransferase [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 192 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 16/121 (13%), Positives = 42/121 (34%), Gaps = 5/121 (4%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMR 338 + + +I+ + + + Q A + V I+ YL + Sbjct: 56 EKLKINLQTNDILLPLRGERIPAMMIVNQQSTLVTTTNQIAVIRVNSLLINPEYLYYFFN 115 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT---NVINVETARIDVLVE 394 S + + A+ G +L + + L + +P Q ++ + + + ++ L+E Sbjct: 116 SPEGDEKISALQGGGLVVNLSLKKLLTLEIPIPLRPVQDEVIGLRKIWSEQKKTLEDLIE 175 Query: 395 K 395 Sbjct: 176 N 176 >gi|253563523|ref|ZP_04840980.1| restriction modification system DNA specificity subunit [Bacteroides sp. 3_2_5] gi|251947299|gb|EES87581.1| restriction modification system DNA specificity subunit [Bacteroides sp. 3_2_5] Length = 151 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 32/89 (35%), Gaps = 2/89 (2%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 +A G D YL + +++ L K+F GS + SL + V Sbjct: 60 VGKVHYYEQATWAHNTALFVKDFKGNDPKYLYYFLKNLHLDKMFDK-GSSVVPSLDRKVV 118 Query: 363 KRLPVLV-PPIKEQFDITNVINVETARID 390 L V I Q I +++ +I+ Sbjct: 119 HSLNVPCHKDIDCQKRIAAILSKIDRKIE 147 >gi|314934937|ref|ZP_07842296.1| probable specificity determinant HsdS [Staphylococcus caprae C87] gi|313652867|gb|EFS16630.1| probable specificity determinant HsdS [Staphylococcus caprae C87] Length = 242 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 15/168 (8%), Positives = 45/168 (26%), Gaps = 3/168 (1%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK---LIESNILSLSYGNIIQKLETRNMGLKPESYE 281 + W+ + +V N + + ++ ++ + + N G + Sbjct: 12 FPEFDEEWKKRKLGEVVNYKNGGSFESLVKNHGVYKLITLKSVNTEGKLCNSGKYIDDKC 71 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + + ++ ++ A+ P + + + + Sbjct: 72 VETLCNDTLVMILSEQAPGLVGMTAIIPNNNEYVLNQRVAALVPKQFIDSQFLSKLINRN 131 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 +++ V+ L P EQ I N + +I Sbjct: 132 QKYFSVRSAGTKVKNISKGHVENFNFLSPNYTEQQKIGNFFSKLDRQI 179 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 25/210 (11%), Positives = 56/210 (26%), Gaps = 6/210 (2%) Query: 23 KHWKVVPIKRFTKLNTGRTSES-GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 + WK + G + ES K+ L ++S + + D ++ Sbjct: 17 EEWKKRKLGEVVNYKNGGSFESLVKNHGVYKLITLKSVNTEGKLCNSGKYIDDKCVETLC 76 Query: 82 AKGQILYGKLGPYL----RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 ++ I + + + + + L PK + L + Sbjct: 77 NDTLVMILSEQAPGLVGMTAIIPNNNEYVLNQRVAALVPKQFIDS-QFLSKLINRNQKYF 135 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 G + + + N P EQ I +I+ + + + Sbjct: 136 SVRSAGTKVKNISKGHVENFNFLSPNYTEQQKIGNFFSKLDRQIELEEEKLELLEQQKRG 195 Query: 198 KKQALVSYIVTKGLNPDVKMKDSGIEWVGL 227 Q + S + D I+ + Sbjct: 196 YIQKIFSQDLRFKDENGNSYPDWSIKKIED 225 >gi|254466444|ref|ZP_05079855.1| restriction endonuclease S subunit [Rhodobacterales bacterium Y4I] gi|206687352|gb|EDZ47834.1| restriction endonuclease S subunit [Rhodobacterales bacterium Y4I] Length = 201 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 61/198 (30%), Gaps = 9/198 (4%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDP 288 P+ WE + + ++ T Sbjct: 8 PEGWERLSASEAFEVNPKTPRNDEGIIRYVPMAALSETGMVIGRGPIEEREKSTSVRFRN 67 Query: 289 GEIVFRFIDL---QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK- 344 G+ + I ++ + E ++ ++ ++ + S Y+ R +D + Sbjct: 68 GDTLLARITPCLENGKTGYVQMLEDGEIACGSTEFIVLRQRRVSSYYVYLTARQHDFREN 127 Query: 345 -VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + +GS RQ ++ R V VPP + + + + + ++Q L Sbjct: 128 AIRSMIGSSGRQRVQPSCFDRYSVAVPP----AMLAKLFDEAVGDMFDQIGNLDQQNQKL 183 Query: 404 KERRSSFIAAAVTGQIDL 421 + R + + G+I + Sbjct: 184 SQARDLLLPRLMNGEIAV 201 Score = 43.6 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 66/192 (34%), Gaps = 10/192 (5%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +P+ W+ + ++N +T + + II S TG + + + +++V Sbjct: 7 VPEGWERLSASEAFEVNP-KTPRNDEGIIRYVPMAALSETGMVIGRGPIEEREKSTSVR- 64 Query: 81 FAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV 133 F G L ++ P L + ST+F+VL+ + V + D Sbjct: 65 FRNGDTLLARITPCLENGKTGYVQMLEDGEIACGSTEFIVLRQRRVSSYYVYLTARQHDF 124 Query: 134 TQR-IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + I ++ + + +PP L E + +I L + + Sbjct: 125 RENAIRSMIGSSGRQRVQPSCFDRYSVAVPPAMLAKLFDEAVGDMFDQIGNLDQQNQKLS 184 Query: 193 ELLKEKKQALVS 204 + L++ Sbjct: 185 QARDLLLPRLMN 196 >gi|240047223|ref|YP_002960611.1| hypothetical protein MCJ_000950 [Mycoplasma conjunctivae HRC/581] gi|239984795|emb|CAT04772.1| HYPOTHETICAL PROTEIN MCJ_000950 [Mycoplasma conjunctivae] Length = 75 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 14/62 (22%), Positives = 25/62 (40%), Gaps = 4/62 (6%) Query: 352 GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFI 411 + ++ F+D K VP I EQ +I ID L+ E + ++ + S + Sbjct: 15 AVVPNIYFKDYKHFEYFVPSINEQEEI----EKVFKNIDNLLNLYELKLQKIEMIKKSLL 70 Query: 412 AA 413 Sbjct: 71 DK 72 >gi|238923526|ref|YP_002937042.1| type I restriction-modification system, S subunit [Eubacterium rectale ATCC 33656] gi|238875201|gb|ACR74908.1| type I restriction-modification system, S subunit [Eubacterium rectale ATCC 33656] Length = 171 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 57/170 (33%), Gaps = 11/170 (6%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 +++L+ +I + + + + E++ DL + + S ++ + Sbjct: 1 MINLACIDINRNYRDGQLKYYANDVSADKQLTGNELLIACTDLTRNADIVGSPILVPKIA 60 Query: 316 ITSAYMAVKPH------GIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVL 368 + D YL +R+ SG L + + + Sbjct: 61 QQMTFSMDMAKLEVDNCIFDKYYLYMTLRTKYYHNFIKKYASGTNVLHLNLDGLNWYTMW 120 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 VPP+ Q ++I+ ++ ++ + Q L + R + + GQ Sbjct: 121 VPPLPLQSQFGHIIHKLQVHMNDILHENRQ----LYDLRDWLLPMLMNGQ 166 >gi|167949252|ref|ZP_02536326.1| Restriction modification system DNA specificity domain [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 77 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 16/45 (35%), Gaps = 4/45 (8%) Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 K+Q I + ++ D L+ Q + LK + + Sbjct: 5 SQKKQRKIADCLSSM----DALITAHSQKLDALKAHKKGLMQQLF 45 >gi|313904108|ref|ZP_07837488.1| restriction modification system DNA specificity domain [Eubacterium cellulosolvens 6] gi|313471257|gb|EFR66579.1| restriction modification system DNA specificity domain [Eubacterium cellulosolvens 6] Length = 169 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 14/118 (11%), Positives = 35/118 (29%), Gaps = 7/118 (5%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 Y + F I Q + + A ++ +L +++ + Sbjct: 55 GYCNTYNHDGDFALIGRQGALCGNMNFSCGKAYFTEHAVAVKANSSSNTRFLYYMLDKMN 114 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 L + + L + +L + P +EQ + +D L+ ++ Sbjct: 115 LGQYSD---QSAQPGLAVGKLIKLENMFPSKEEQDKVGGF----FEELDNLITLHQRQ 165 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 45/160 (28%), Gaps = 17/160 (10%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W+ + T + +G+ +I D+E G+Y GN + +T + Sbjct: 15 DWEQRKLSDVTDEFQSGK---------FIAAADIEEA-GEYPVYGGNGLRGYCNTYN--H 62 Query: 83 KGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 G L G+ G + + + ++ ++L + Sbjct: 63 DGDFALIGRQGALCGNMNFSCGKAYFTEHAVAVKANSSSNTRFLYYMLD---KMNLGQYS 119 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + P EQ + I Sbjct: 120 DQSAQPGLAVGKLIKLENMFPSKEEQDKVGGFFEELDNLI 159 >gi|326626207|gb|EGE32552.1| putative Type I restriction-modification system methyltransferase [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 191 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 42/121 (34%), Gaps = 5/121 (4%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMR 338 + + +I+ + + + Q A + V I+ YL + Sbjct: 55 EKLKINLQTNDILLPLRGERIPAMMIVNQQSTLVTTTNQIAVIRVNSLLINPEYLYYFFN 114 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT---NVINVETARIDVLVE 394 S + + A+ G +L + + L + +P Q ++ + N + ++ L+E Sbjct: 115 SPEGDEKISALQGGGLVVNLSLKKLLTLEIPIPLRPVQDEVIGLRKIWNEQKKTLEDLIE 174 Query: 395 K 395 Sbjct: 175 N 175 >gi|195873657|ref|ZP_02698466.2| putative type I restriction-modification system specificity subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195632642|gb|EDX51096.1| putative type I restriction-modification system specificity subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 114 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 33/81 (40%), Gaps = 4/81 (4%) Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 A + V I+ YL + S + + A+ G +L + + L + +P Q + Sbjct: 18 AVIRVNSLLINPEYLYYFFNSPEGDEKISALQGGGLVVNLSLKKLLTLEIPIPSRPVQDE 77 Query: 378 IT---NVINVETARIDVLVEK 395 + + N + ++ L+E Sbjct: 78 VIGLRKIWNEQKKTLEDLIEN 98 >gi|307126720|ref|YP_003878751.1| type I restriction enzyme [Streptococcus pneumoniae 670-6B] gi|306483782|gb|ADM90651.1| type I restriction enzyme [Streptococcus pneumoniae 670-6B] Length = 202 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 46/130 (35%), Gaps = 17/130 (13%) Query: 7 YPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESG 59 YP YK IP+ W+ + G+T + +I ++ + D+ SG Sbjct: 25 YPIYK---------IPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISG 75 Query: 60 TGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDV 119 + + + + I KG +L + K I D + + + P Sbjct: 76 YVTNTRESISKLALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYAN 134 Query: 120 LPELLQGWLL 129 +++ +L+ Sbjct: 135 KENIIRDYLM 144 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 50/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 27 IYKIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 87 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 145 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I +++ ++ L Sbjct: 146 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIIFKVDLLFQKVSQL 200 >gi|227872202|ref|ZP_03990567.1| conserved hypothetical protein [Oribacterium sinus F0268] gi|227841953|gb|EEJ52218.1| conserved hypothetical protein [Oribacterium sinus F0268] Length = 135 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 21/84 (25%), Gaps = 5/84 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQS--DTS 76 P W I + G K + ++V +G + Sbjct: 27 PNGWDKYKIGELCDVRDGTHDSPQYYSKGYPLVTSKNVSAGKIDLSDCSLICEDDYQKIN 86 Query: 77 TVSIFAKGQILYGKLGPYLRKAII 100 S G IL +G I+ Sbjct: 87 QRSKVDYGDILMPMIGTVGNPVIV 110 >gi|148927926|ref|ZP_01811333.1| hypothetical protein TM7_0589 [candidate division TM7 genomosp. GTL1] gi|147886729|gb|EDK72292.1| hypothetical protein TM7_0589 [candidate division TM7 genomosp. GTL1] Length = 100 Score = 47.9 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 28/64 (43%) Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 SL +K L + P+ +Q +I I + + I +++ + K R S +A A Sbjct: 35 ASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVAHHRSKALRQSILAKA 94 Query: 415 VTGQ 418 G+ Sbjct: 95 FKGE 98 >gi|227892234|ref|ZP_04010039.1| possible type I restriction-modification system specificity determinant protein [Lactobacillus salivarius ATCC 11741] gi|227865956|gb|EEJ73377.1| possible type I restriction-modification system specificity determinant protein [Lactobacillus salivarius ATCC 11741] Length = 152 Score = 47.9 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 44/110 (40%), Gaps = 7/110 (6%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSY 340 + Y ++ R L N + ++ T + + + + YL + ++ Sbjct: 39 DNYLYDGESVLIPRKGSLNNIYYVVGKFWTVD----TIFWTIINKNIVLPKYLFYFLKRI 94 Query: 341 DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 D K+ SL + + + + VP I++Q DI + I+V +I+ Sbjct: 95 DFEKL---NVGSAVPSLTQKILNEIQIDVPSIEKQKDIIDKISVFERKIN 141 >gi|330814746|ref|YP_004362921.1| hypothetical protein bgla_4p3410 [Burkholderia gladioli BSR3] gi|327374738|gb|AEA66089.1| hypothetical protein bgla_4p3410 [Burkholderia gladioli BSR3] Length = 196 Score = 47.9 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 5/118 (4%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 + PG+IV + + E Y+ + YLAW + Sbjct: 63 LSPGDIVLPSRGDRYRAWRFDGTRTGEAVFPMGLYVIRSHAEVHPGYLAWYINQRSAQAQ 122 Query: 346 F-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL---VEKIEQS 399 + ++L + +L + VP + Q +I + + RI + + +IEQ Sbjct: 123 IALLLTGSNIKALTKAALLKLEIEVPSLDRQHEIAD-LEDTMQRIIAIRNRISEIEQQ 179 >gi|290967798|ref|ZP_06559351.1| hypothetical protein HMPREF0889_1471 [Megasphaera genomosp. type_1 str. 28L] gi|290782157|gb|EFD94732.1| hypothetical protein HMPREF0889_1471 [Megasphaera genomosp. type_1 str. 28L] Length = 284 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 86/293 (29%), Gaps = 33/293 (11%) Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + E L + + + + G++ +W+ I +I + +PPLA Q Sbjct: 4 NCKNILNREWLYIFFNRPEFDRFVITNSWGSSTEFYNWENICDISIDLPPLAIQQKYVNV 63 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 A +GL D+ IE + Sbjct: 64 YNAMVAN-----------------------QRAYERGLEDLKLTCDAYIEDLRRRIPCEA 100 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 + P+ N N + ++ + Y++V P +I F Sbjct: 101 IGPYIERHDVRNGPNGTKNVMGVS------TTKEFREPTSKVNRNDLANYKVVKPRQISF 154 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSA--YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG- 350 E ++TS + + + YL + + Sbjct: 155 VQTTHNEKVFCNALNTTDEDIVVTSVNEVFSTNENKLLPEYLVMFFNRTEFDRYARYHSW 214 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 R++ ++D+ ++ + + ++ Q I + I + + EK++ I + Sbjct: 215 GSARETFTWDDLVKVQIPIADMEVQRSIVD-IYTVYKKRKAINEKLKAQIKAI 266 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 31/92 (33%), Gaps = 8/92 (8%) Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNV 381 + ++ +L + + G +E++ + + +PP+ Q NV Sbjct: 4 NCKNILNREWLYIFFNRPEFDRFVITNSWGSSTEFYNWENICDISIDLPPLAIQQKYVNV 63 Query: 382 INVETAR-------IDVLVEKIEQSIVLLKER 406 N A ++ L + I L+ R Sbjct: 64 YNAMVANQRAYERGLEDLKLTCDAYIEDLRRR 95 >gi|237649417|ref|ZP_04523669.1| type I restriction enzyme specificity protein [Streptococcus pneumoniae CCRI 1974] gi|237821510|ref|ZP_04597355.1| type I restriction enzyme specificity protein [Streptococcus pneumoniae CCRI 1974M2] gi|303253836|ref|ZP_07339964.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS455] gi|303270102|ref|ZP_07355808.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS458] gi|302599200|gb|EFL66218.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS455] gi|302640364|gb|EFL70805.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae BS458] Length = 184 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 9 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 68 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 69 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 126 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 51/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 9 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 68 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 69 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 127 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 128 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 182 >gi|239906097|ref|YP_002952836.1| hypothetical protein DMR_14590 [Desulfovibrio magneticus RS-1] gi|239795961|dbj|BAH74950.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 188 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 54/141 (38%), Gaps = 7/141 (4%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITS 318 G + + L P + ++ +++F F L Q ER + Sbjct: 42 SGIVSGGSGDIWVELDPSGKQKKYLIRNNDVLFSFRGTGETLGQAGLYIGQNEERVVCGQ 101 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFD 377 + ++P ID +L + MR + A G R ++ D++ + V + +E Sbjct: 102 SLCIIRPKAIDGLWLYYFMRRRAARESLLAKSCGNRLMTINLNDLRDVLVEMSSDEE--- 158 Query: 378 ITNVINVETARIDVLVEKIEQ 398 + I+ + RI + +I++ Sbjct: 159 -VDKIHAKHKRISSIYTEIQE 178 Score = 38.6 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 38/145 (26%), Gaps = 19/145 (13%) Query: 27 VVPIKRFTKLNTG---RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD--------- 74 + + RT GKD +I +V + + + D Sbjct: 2 ETKLGEVADVIRCQLPRTRTGGKD-GWILCREVTQADFEPISGIVSGGSGDIWVELDPSG 60 Query: 75 TSTVSIFAKGQILYGKLGPY------LRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + +L+ G + +C +++PK + L ++ Sbjct: 61 KQKKYLIRNNDVLFSFRGTGETLGQAGLYIGQNEERVVCGQSLCIIRPKAIDGLWLYYFM 120 Query: 129 LSIDVTQRIEAICEGATMSHADWKG 153 + + A G + + Sbjct: 121 RRRAARESLLAKSCGNRLMTINLND 145 >gi|260664497|ref|ZP_05865349.1| restriction endonuclease S subunit [Lactobacillus jensenii SJ-7A-US] gi|260561562|gb|EEX27534.1| restriction endonuclease S subunit [Lactobacillus jensenii SJ-7A-US] Length = 256 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 65/185 (35%), Gaps = 11/185 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI-IQKLETRNMGLKPESYETYQIVDP 288 + W+ + ++ +KN + + I + + + Y +V Sbjct: 36 EPWKKVKLGEISEKITQKNNNSCSQFPVLTNSAEYGIVYQKDFFDKNIAINTDNYYVVHT 95 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 + V+ + ++ G+++ Y+ + + +L K Y Sbjct: 96 EDFVYNPRISKQAPYGPIRVNHLKTGVMSPLYYIFKIKDDFNIGFFEFLFIGNKWHKFMY 155 Query: 348 AMGSGL----RQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G R ++K + +LP+ +P I+EQ I +I+ L+ ++ + L Sbjct: 156 QNGDSGARSDRYAIKDKVFNKLPIYIPQKIEEQKLI----FEINHKINSLLYLQQRKLEL 211 Query: 403 LKERR 407 K+ + Sbjct: 212 EKQLK 216 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 59/189 (31%), Gaps = 6/189 (3%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK V + ++ T + + S + E G + +T + Sbjct: 38 WKKVKLGEISEKITQKNNNSCSQFPVLTNSA-EYGIVYQKDFFDKNIAINTDNYYVVHTE 96 Query: 85 QILYGKL----GPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS-IDVTQRIEA 139 +Y PY + G+ S + + + KD +L + + Sbjct: 97 DFVYNPRISKQAPYGPIRVNHLKTGVMSPLYYIFKIKDDFNIGFFEFLFIGNKWHKFMYQ 156 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + S + +++ ++ I +I++L+ + R +EL K+ K Sbjct: 157 NGDSGARSDRYAIKDKVFNKLPIYIPQKIEEQKLIFEINHKINSLLYLQQRKLELEKQLK 216 Query: 200 QALVSYIVT 208 L + Sbjct: 217 FFLFQNAIP 225 >gi|168492609|ref|ZP_02716752.1| type I restriction enzyme [Streptococcus pneumoniae CDC0288-04] gi|221231342|ref|YP_002510494.1| type I restriction-modification system S protein [Streptococcus pneumoniae ATCC 700669] gi|298229902|ref|ZP_06963583.1| putative type I restriction-modification system S protein [Streptococcus pneumoniae str. Canada MDR_19F] gi|183573242|gb|EDT93770.1| type I restriction enzyme [Streptococcus pneumoniae CDC0288-04] gi|220673802|emb|CAR68304.1| putative type I restriction-modification system S protein [Streptococcus pneumoniae ATCC 700669] Length = 202 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 27 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 87 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 144 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 51/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 27 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 87 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 145 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 146 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 200 >gi|205355249|ref|YP_002229050.1| type I restriction-modification system methyltransferase [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205275030|emb|CAR40118.1| putative Type I restriction-modification system methyltransferase [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 192 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 42/121 (34%), Gaps = 5/121 (4%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYMAVKPHGIDSTYLAWLMR 338 + + +I+ + + + Q A + V I+ YL + Sbjct: 56 EKLKINLQTNDILLPLRGERIPAMMIVNQQSTLVTTTNQIAVIRVNSLLINPEYLYYFFN 115 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT---NVINVETARIDVLVE 394 S + + A+ G +L + + L + +P Q ++ + N + ++ L+E Sbjct: 116 SPEGDEKISALQGGGLVVNLSLKKLLTLEIPIPLRPVQDEVIGLRKIWNEQKKTLEDLIE 175 Query: 395 K 395 Sbjct: 176 N 176 >gi|183603404|ref|ZP_02964380.1| type I restriction enzyme [Streptococcus pneumoniae SP195] gi|183571288|gb|EDT91816.1| type I restriction enzyme [Streptococcus pneumoniae SP195] gi|332204530|gb|EGJ18595.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47901] Length = 240 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 65 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 124 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 125 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 182 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 51/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 65 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 124 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 125 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 183 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 184 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 238 >gi|67922393|ref|ZP_00515904.1| hypothetical protein CwatDRAFT_3981 [Crocosphaera watsonii WH 8501] gi|67855737|gb|EAM50985.1| hypothetical protein CwatDRAFT_3981 [Crocosphaera watsonii WH 8501] Length = 219 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 57/180 (31%), Gaps = 13/180 (7%) Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 IE P H++ +++ +S+S I+ L S E Sbjct: 17 IESGVWNPYHYKENKSNDTLSDFANIKKIKNNKQDISISEFAPIEYKNIPKGELLTFSLE 76 Query: 282 -------TYQIVDPGEIVFRFIDLQNDKRSLRSAQVME------RGIITSAYMAVKPHGI 328 Y +V ++F + + I S ++ + P Sbjct: 77 DNSLEEGRYSLVGEQVLLFGTMRAYLGNVLVTPKANWIGKRSPLFYPINSEFVQIIPKDK 136 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + ++S G R + ++++++P+ VP ++E+ I N + + Sbjct: 137 LLYFWWGYLKSSLFLNQIPTGSGGTRPRVSVDNLEKIPISVPILREREKINNSLIEIAEQ 196 >gi|253991441|ref|YP_003042797.1| type I restriction enzyme, modification subunit [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253782891|emb|CAQ86056.1| type I restriction enzyme, modification subunit [Photorhabdus asymbiotica] Length = 721 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 12/87 (13%), Positives = 30/87 (34%), Gaps = 8/87 (9%) Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 E + A+ P I+ +L + + K+ + L +++L Sbjct: 598 YSEDEFWAADDVHFAITPEYINDRFLFHFLLTQK-NKISGQVRRASIPRLSKSVLEKLEF 656 Query: 368 LVP-------PIKEQFDITNVINVETA 387 +P + Q +I +++ T+ Sbjct: 657 PIPCPDNPEKSLAIQSEIVRILDKFTS 683 >gi|327390915|gb|EGE89255.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 201 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 26 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 85 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 86 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 143 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 51/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 26 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 85 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 86 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 144 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 145 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 199 >gi|329919683|ref|ZP_08276661.1| hypothetical protein HMPREF9210_0205 [Lactobacillus iners SPIN 1401G] gi|328937335|gb|EGG33759.1| hypothetical protein HMPREF9210_0205 [Lactobacillus iners SPIN 1401G] Length = 160 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 52/156 (33%), Gaps = 5/156 (3%) Query: 29 PIKRFTKLNTGR-TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + + + + YI +++ G Q F K +L Sbjct: 4 KLSDICEYAKEKIKISALDENTYISTKNMLPNKGGIKQATSLPVQE---NTQAFMKNDVL 60 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGATM 146 + PY +K A F+G CS LV + K + ++L+ D A +G M Sbjct: 61 VSNIRPYFKKIWFATFNGGCSNDVLVFRAKKGINSRFLHYVLANDSFFNYSMATSKGTKM 120 Query: 147 SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 D K I +P Q I + +I+ Sbjct: 121 PRGDKKAIMAYEVPKLSYRYQGKIAGILEIIDDKIE 156 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 18/159 (11%), Positives = 41/159 (25%), Gaps = 4/159 (2%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + K +S N++ E Q +++ Sbjct: 1 MKYKLSDICEYAKEKIKISALDENTYISTKNMLPNKGGIKQATSLPVQENTQAFMKNDVL 60 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 I K + G + GI+S +L +++ + A G Sbjct: 61 VSNIRPYFKKIWFATFNG---GCSNDVLVFRAKKGINSRFLHYVLANDSFFNYSMATSKG 117 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + V + Q I ++ + +I+ Sbjct: 118 TKMPRGDKKAIMAYEVPKLSYRYQGKIAGILEIIDDKIE 156 >gi|291529888|emb|CBK95473.1| Type I restriction modification DNA specificity domain [Eubacterium siraeum 70/3] Length = 240 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/224 (13%), Positives = 65/224 (29%), Gaps = 21/224 (9%) Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS- 258 QA + + G ++ + K+ L++ I Sbjct: 11 QAWFTSWFVDYEPFPHSYDEDGKPLPPDDWENGILDSCIDFYNGYAFKSDDLLDEPIPES 70 Query: 259 ---LSYGNIIQKLETRNMGLKPESYETYQ------IVDPGEIVFRFIDLQNDKRSLRSA- 308 GNI + G K + I+ G+I+ D++ + L Sbjct: 71 FDVFKMGNIKKGGGLNYEGTKSWIEREFCKGLERFILIRGDILMAMTDMKENVALLGHTA 130 Query: 309 --QVMERGIITSAYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDV 362 + ++ I+ ++P+G + L + + G++ +L ED+ Sbjct: 131 LMDIDDKYIVNQRVGLLRPNGFMGISPYQVYLLTNNATFLRELRRHAHIGVQVNLSKEDI 190 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 V+ P I + + + I+ L E Sbjct: 191 VNSRVVYAP----KKINQAFATKVKPLFDCISNNNAEILKLTEI 230 Score = 37.1 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 21/184 (11%), Positives = 48/184 (26%), Gaps = 21/184 (11%) Query: 22 PKHWKVVPIKRFTKLNTGRTSESG--------KDIIYIGLEDVESGTGKYLPKDGNSRQS 73 P W+ + G +S + + +++ G G + + Sbjct: 37 PDDWENGILDSCIDFYNGYAFKSDDLLDEPIPESFDVFKMGNIKKGGGLNYEGTKSWIER 96 Query: 74 DTST---VSIFAKGQILYGKLGPYLRKAII-------ADFDGICSTQFLVLQPKDVL--- 120 + I +G IL A++ D I + + +L+P + Sbjct: 97 EFCKGLERFILIRGDILMAMTDMKENVALLGHTALMDIDDKYIVNQRVGLLRPNGFMGIS 156 Query: 121 PELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 P + + + + + + I N + P K+ Sbjct: 157 PYQVYLLTNNATFLRELRRHAHIGVQVNLSKEDIVNSRVVYAPKKINQAFATKVKPLFDC 216 Query: 181 IDTL 184 I Sbjct: 217 ISNN 220 >gi|212716215|ref|ZP_03324343.1| hypothetical protein BIFCAT_01131 [Bifidobacterium catenulatum DSM 16992] gi|212660860|gb|EEB21435.1| hypothetical protein BIFCAT_01131 [Bifidobacterium catenulatum DSM 16992] Length = 168 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 49/150 (32%), Gaps = 6/150 (4%) Query: 45 GKDIIYIGLEDVESGTGKYLPKDGNS--RQSDTSTVSIFAKGQILYGKLGPYLRKAIIA- 101 + Y+ + D++ T ++ D +S + + +G IL+ + G + K + Sbjct: 19 DGEKKYLRITDIDDRTREFRTDDLSSPDINNPIDDKYLLKEGDILFARTGASVGKTYLYR 78 Query: 102 ---DFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIP 158 + + L+ Q + + + + + ++ Sbjct: 79 ASDGKTYYAGFLIRAHVSDEADAGFIFQSTLTERYKQFVLLTSQRSGQPGINAQEYADLL 138 Query: 159 MPIPPLAEQVLIREKIIAETVRIDTLITER 188 +P+P L+EQ I + I + Sbjct: 139 LPLPSLSEQRRIGKFFSRLDSLITLHQRKY 168 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 16/125 (12%), Positives = 39/125 (31%), Gaps = 5/125 (4%) Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + ++ G+I+F K L A + A D+ ++ Sbjct: 47 INNPIDDKYLLKEGDILFARTGASVGKTYLYRASDGKTYYAGFLIRAHVSDEADAGFIFQ 106 Query: 336 LMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 + + + + ++ L + +P + EQ I +R+D L+ Sbjct: 107 STLTERYKQFVLLTSQRSGQPGINAQEYADLLLPLPSLSEQRRIGKF----FSRLDSLIT 162 Query: 395 KIEQS 399 ++ Sbjct: 163 LHQRK 167 >gi|84489294|ref|YP_447526.1| hypothetical protein Msp_0483 [Methanosphaera stadtmanae DSM 3091] gi|84372613|gb|ABC56883.1| hypothetical protein Msp_0483 [Methanosphaera stadtmanae DSM 3091] Length = 162 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 16/114 (14%), Positives = 41/114 (35%), Gaps = 6/114 (5%) Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + S+ + +++ +K + Y + + S + + ++ L Sbjct: 52 GEDGSIIPTLASGKCWVSNHAHVLKNKKNINLYFLYNILSKIHFEKYN--TGTIQPKLNK 109 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + K + + + KEQ I + I ++KI++ I LK + + Sbjct: 110 KTAKNIKIKITSKKEQEKIVDF----MLSIGTKIKKIQKQIKFLKTFKKGLLQK 159 >gi|332655469|ref|ZP_08421206.1| conserved hypothetical protein [Ruminococcaceae bacterium D16] gi|332515604|gb|EGJ45217.1| conserved hypothetical protein [Ruminococcaceae bacterium D16] Length = 174 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 45/121 (37%), Gaps = 5/121 (4%) Query: 274 GLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQVMERGIITSAYMAV---KPHGID 329 + Y+++ G + + D+R + + I++ AY ++ Sbjct: 42 NVIGTDLSRYKLISKGLFACNPMHVGRDERLPIALYEKDNAAIVSPAYFMFEIIDRDVLN 101 Query: 330 STYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 YL R + + + M +R + ++D+ R+ + VP Q +I T R Sbjct: 102 EEYLMMWFRRPEFDRECWFMTDGSVRGGITWDDLCRIKLPVPSYARQCEIVESYRAITNR 161 Query: 389 I 389 I Sbjct: 162 I 162 >gi|298256071|ref|ZP_06979657.1| type I restriction enzyme specificity protein [Streptococcus pneumoniae str. Canada MDR_19A] Length = 191 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 16 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 75 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 76 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 133 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 51/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 16 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 75 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 76 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 134 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 135 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 189 >gi|282601268|ref|ZP_05981251.2| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] gi|282569611|gb|EFB75146.1| conserved hypothetical protein [Subdoligranulum variabile DSM 15176] Length = 196 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 42/139 (30%), Gaps = 21/139 (15%) Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 + +T + PG+I+ R + S + + + + YL WL Sbjct: 53 SDPLKTEYLTQPGDIIVRLTTPYT-AALIDSTTTGLVVSSNFMIIRTESNTLLPDYLFWL 111 Query: 337 MRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + + + Y + S +K + VP I +Q I Sbjct: 112 LNTPAVKRRIYTSTTSNVLSAVKASFFTQFQFHVPSIAQQERIG---------------- 155 Query: 396 IEQSIVLLKERRSSFIAAA 414 I L R ++ + Sbjct: 156 ---QIHKLARRETALLHQL 171 >gi|90962729|ref|YP_536644.1| hypothetical protein LSL_1758b [Lactobacillus salivarius UCC118] gi|90821923|gb|ABE00561.1| Hypothetical protein LSL_1758b [Lactobacillus salivarius UCC118] Length = 185 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 3/170 (1%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 LV + N+ ++ +L + + + I G+IV Sbjct: 1 MKLNELVKIESGINSVRVKDQNHTLYTIEDVNYDLGHGEDYQHDKASGKSITARGDIVIN 60 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SG 352 + R+A M I + + +D YL +L+ + + A Sbjct: 61 TVSNLASVVHSRNAGKMLNQIF-LRLNILDENTLDPWYLCYLLNKSEYIRYQEAAIMDGS 119 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L +++ L + +P + +Q + + + +EK E L Sbjct: 120 VIRKLTKANLEDLEINLPEVVDQKKMGKAYKEIMKKYTLAMEKAELERDL 169 >gi|255322119|ref|ZP_05363266.1| conserved hypothetical protein [Campylobacter showae RM3277] gi|255300817|gb|EET80087.1| conserved hypothetical protein [Campylobacter showae RM3277] Length = 195 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 17/169 (10%), Positives = 45/169 (26%), Gaps = 4/169 (2%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 E + + + L + + + E V G+I+ + Sbjct: 16 LSRKKAEAHSPSEHSYKIVSLKSFAEDTYYDDAFADEFISSEQINEDYKVSRGDILL-RL 74 Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQ 355 N + + V D ++A + S + K + Sbjct: 75 REPNFAVYIDKDYSDLVYTSLMVRIRVSSDKFDPHFVAHYLNSSAVKKALAPDVSGTTIA 134 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVIN--VETARIDVLVEKIEQSIVL 402 + + + + ++ Q I +N + + I ++ +Q Sbjct: 135 MISVASINNIKIPTLNLQTQNKIVKYLNLVRQESEILQILMAAKQKYNK 183 >gi|168333674|ref|ZP_02691929.1| type I restriction-modification system, M subunit, putative [Epulopiscium sp. 'N.t. morphotype B'] Length = 604 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 46/144 (31%), Gaps = 3/144 (2%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I + + V G ++ K + I + Sbjct: 449 GEINMATLTTYEVDNRARLDMYRVQEGNLIISNRGTL--KICIVPKHKGNLLISQNFIGL 506 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFY-AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 G + Y+ ++S + Q + D+K +P + P + Q +I + Sbjct: 507 RLHKGYNPEYIKQFLQSPLGEYLINTKRAGSASQIINIRDLKEIPFIEPLTQNQTEIIDS 566 Query: 382 INVETARIDVLVEKIEQSIVLLKE 405 N + +I +EK+E ++ ++ Sbjct: 567 YNTKQQQIVTKIEKLELELLTMRN 590 >gi|167644295|ref|YP_001681958.1| hypothetical protein Caul_0323 [Caulobacter sp. K31] gi|167346725|gb|ABZ69460.1| hypothetical protein Caul_0323 [Caulobacter sp. K31] Length = 493 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 56/424 (13%), Positives = 127/424 (29%), Gaps = 48/424 (11%) Query: 29 PIKRFTKL-----NTGRTSESGKDIIYIGLE---DVESGTGKYLPKDGNSRQSDTSTVSI 80 + ++ G T ++ D+ K+L D +TS++ Sbjct: 64 RLGDVARVWQPSRLKGITVSRDFGTPFLAATQAFDLRPIPRKFLSLDRT----ETSSIRF 119 Query: 81 FAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPE-LLQGWLLSIDVTQRI 137 G IL G R + + S L ++PK + + G+L S Q + Sbjct: 120 AEPGTILVTCSGTVGRATLATTALAKTLISHDLLRVEPKADQSQGWVYGYLRSEKARQMM 179 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKE 197 + G + H + + ++PMP P A Q + + + + E Sbjct: 180 SSAQYGHIIKHLEPGHLQSLPMPRPRKALQEKFDAHFREILTARNRAVELFQQAEAMFGE 239 Query: 198 KK--------------------QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + + + NP ++ + + G F Sbjct: 240 QVGVPAELDVGEQGFSVPASSLMSGRRRLEGIYHNPTIRKLQTHFKERGFATASLLSSGF 299 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 A + ++ ++ S I + + V G ++ Sbjct: 300 DAWLPGRFKRIRAEEGLQLVGSSDLFEINPDLPKRIADIDFGDRNSGRVLRGWLLLARSG 359 Query: 298 -LQNDKRSLRSAQVMERG-IITSAYMAVKPHGIDSTYLAWLMRSYDL----CKVFYAMGS 351 +L A G I++ + + P+ + ++ + + ++ Sbjct: 360 QTYGVNGTLAIANAFHEGKIVSDHVIRIAPNDDCNARPGYIYTALSHPQLGRPMVKSLAY 419 Query: 352 G-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV---ETARIDVLVEKIEQSIVLLKERR 407 G + D+ LP++ KE+ I + AR D++ + + + ER Sbjct: 420 GSSIPEIDVSDIHNLPIVRLGKKEEDAIAELAEEGADLFARADIIETTMAREVD---ERI 476 Query: 408 SSFI 411 ++ + Sbjct: 477 AALL 480 >gi|254994440|ref|ZP_05276630.1| specificity determinant HsdS [Listeria monocytogenes FSL J2-064] Length = 165 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 17/151 (11%), Positives = 47/151 (31%), Gaps = 9/151 (5%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N L ++ G+I F + K A + GI++ + + Sbjct: 15 YFVEPNKVLSNNIDTRTYVMRKGDIAFEGHSNTDFKFGRFVANDIGPGIVSELFPVYRHK 74 Query: 327 -GIDSTYLAWLMRSYDLCKVFYAMGSGLRQS----LKFEDVKRLPVLVPPIKEQFDITNV 381 D+ Y ++ + Y+ + L + + + +EQ I ++ Sbjct: 75 TNYDNNYWKNAIQLEHIMAPIYSKSITSSGNSSNKLDSKHFLNQKIYIADFEEQEKIGSI 134 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++D + + + + +++ Sbjct: 135 ----FKQLDNTIILYQNKLNKFDILKKAYLQ 161 >gi|329939285|ref|ZP_08288621.1| type I restriction modification system protein [Streptomyces griseoaurantiacus M045] gi|329301514|gb|EGG45408.1| type I restriction modification system protein [Streptomyces griseoaurantiacus M045] Length = 793 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 63/201 (31%), Gaps = 17/201 (8%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSES--------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P W+ VP+ + G + D+ + + + G + Sbjct: 587 LPHDWRRVPLGELVDIMAGPSYTRLPAEVRSVAGDLRVVMPKHLREGRIDDRDMEKVGVD 646 Query: 73 SDTS-TVSIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVL------PEL 123 + G IL + G + A++ + ST L L+ + P Sbjct: 647 VARALARFRLRPGDILCVRSGAQMPPALVEKAQDGWLFSTNLLRLRALETDGVPLVLPGY 706 Query: 124 LQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 L +L + ++ G + + +P+P+PPLA Q I + A +I Sbjct: 707 LLAYLSLPETVHWLKEYARGTAVPSLSAATLALLPVPLPPLAHQRRISAVLDAVNAQITA 766 Query: 184 LITERIRFIELLKEKKQALVS 204 + L++ Sbjct: 767 HRELIQAATQHRSTLAAHLLT 787 >gi|238898673|ref|YP_002924354.1| putative restriction endonuclease, N6_Mtase domain protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229466432|gb|ACQ68206.1| putative restriction endonuclease, N6_Mtase domain protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 872 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 43/124 (34%), Gaps = 1/124 (0%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y V G++V I ++ + + + + G + L ++RS Sbjct: 726 YNRLYRVSEGDVVISNIAASYGSIAVVPEDLGGCVVSSEYTILRAKPGFEPKMLWAILRS 785 Query: 340 YDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + + +G R +K++ +K L + P + + + A V +Q Sbjct: 786 PVVLSEILLVATGANRTRVKWDAMKSLSIPYPKETTEKEFVESLLKLEALEKETVSSKKQ 845 Query: 399 SIVL 402 I L Sbjct: 846 IIDL 849 >gi|149012615|ref|ZP_01833612.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] gi|147763420|gb|EDK70357.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] Length = 202 Score = 47.5 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 27 IYEIPEAWRYIKFASLVNFRIGKTPPRSEAIFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 87 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 144 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 51/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 27 IYEIPEAWRYIKFASLVNFRIGKTPPRSEAIFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 87 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 145 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 146 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 200 >gi|320546834|ref|ZP_08041139.1| type I restriction-modification system specificty subunit [Streptococcus equinus ATCC 9812] gi|320448498|gb|EFW89236.1| type I restriction-modification system specificty subunit [Streptococcus equinus ATCC 9812] Length = 198 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 57/149 (38%), Gaps = 6/149 (4%) Query: 30 IKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 +K T+ G+ +I + L D+ Y D+ + +G I Sbjct: 18 LKEVTEHFKGKAVSKLSSEGNISVVNLSDMLEIGINYDGLKKIEADEDSVQRYLLQEGDI 77 Query: 87 LYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEAICEG 143 L G + AI D+ I S VL+P + ++ +L S + +E G Sbjct: 78 LIASKGTVKKTAIFHEQDYPVIASANITVLRPIADIAGGYIKLFLDSKLGQELLEETNTG 137 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 + + + + I +I +P P+ +Q + + Sbjct: 138 KNVMNLNTQKIVSIEIPKLPVLKQAYLLQ 166 >gi|283956924|ref|ZP_06374397.1| hypothetical protein C1336_000320094 [Campylobacter jejuni subsp. jejuni 1336] gi|283791650|gb|EFC30446.1| hypothetical protein C1336_000320094 [Campylobacter jejuni subsp. jejuni 1336] Length = 117 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 38/111 (34%), Gaps = 7/111 (6%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+V + ++++G T K I ++ ++D++ + S Sbjct: 4 WEVKKLGDIAEISSGETPSRNKKEYWENGIIPWVKIKDIKENFISTTKEFITENGLKNSL 63 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 +F KG + Y L + + +Q + K+++ E Sbjct: 64 AKLFKKGTLFYSILAICVLIIFVTFIMSKYYSQQAIESYKEIMMENDICQN 114 >gi|237721954|ref|ZP_04552435.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|229448823|gb|EEO54614.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Bacteroides sp. 2_2_4] Length = 191 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 67/169 (39%), Gaps = 8/169 (4%) Query: 26 KVVPIKRFTKLNTGR--TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 K V +K + +G ++S ++ Y+ ++DV S + + + Sbjct: 7 KKVTLKDIAIMQSGIYMKTDSQGEVRYLQVKDVNSENKLDYTQIATVINTGINDKHWLKN 66 Query: 84 GQILYGKLGPYLRKAII--ADFDGICSTQFLVLQP--KDVLPELLQGWLLSIDVTQRIEA 139 G +L+ G + I S+ F++++P ++LPE L +L + + ++ Sbjct: 67 GDLLFAAKGGSNYCIQYEGTERSTIASSSFIIIRPVISNILPEFLCCFLNTSSILGMLKN 126 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK--IIAETVRIDTLIT 186 G + +G I + IP + Q L+ E + E I + I Sbjct: 127 AAVGTGIQVIPQSVMGEIQLDIPSIEVQRLVVEMDRLRKEGECIRSEID 175 Score = 45.9 bits (107), Expect = 0.014, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 56/176 (31%), Gaps = 9/176 (5%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETR-NMGLKPESYETYQIVDPGEIVFRFIDL 298 + + + K E L + N KL+ + + G+++F Sbjct: 17 MQSGIYMKTDSQGEVRYLQVKDVNSENKLDYTQIATVINTGINDKHWLKNGDLLFAAKGG 76 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSL 357 N + + + I +L + + + + G Q + Sbjct: 77 SNYCIQYEGTERSTIASSSFIIIRPVISNILPEFLCCFLNTSSILGMLKNAAVGTGIQVI 136 Query: 358 KFEDVKRLPVLVPPIKEQFDIT--NVINVE----TARIDVLVEKIEQSIVLLKERR 407 + + + +P I+ Q + + + E + ID+L + ++ + L+ + Sbjct: 137 PQSVMGEIQLDIPSIEVQRLVVEMDRLRKEGECIRSEIDILKQSLQDQL-LMDSLK 191 >gi|309807507|ref|ZP_07701465.1| conserved hypothetical protein [Lactobacillus iners LactinV 01V1-a] gi|308169248|gb|EFO71308.1| conserved hypothetical protein [Lactobacillus iners LactinV 01V1-a] Length = 198 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 30/92 (32%), Gaps = 4/92 (4%) Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 S Y + + + + D+K++ VLVP + + Sbjct: 106 SPYYEFTNQILHRIDYSSINRGSTQPLITQGDMKKVVVLVPDEDT----LAIFEKFAGSL 161 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 E V L R + + ++G++D+ Sbjct: 162 MAKWEANNNENVKLASLRDTLLPKLMSGELDV 193 >gi|154252794|ref|YP_001413618.1| hypothetical protein Plav_2349 [Parvibaculum lavamentivorans DS-1] gi|154156744|gb|ABS63961.1| conserved hypothetical protein [Parvibaculum lavamentivorans DS-1] Length = 201 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 46/130 (35%), Gaps = 6/130 (4%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM-AVKPHGIDSTYLAWLM 337 +V G++VFR +N +L + + + K + YLAW + Sbjct: 53 DVAERYMVSAGDVVFRSRGDRNTAAALDGCFIEPALALQPLLILRPKRDAVLPEYLAWAI 112 Query: 338 RSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL---V 393 + F G + + L + VP ++ Q I ++ R + L + Sbjct: 113 NQPSAQRHFDEGARGTNIRMVPKSCLDDLDIDVPDLEAQRRIVA-VDALAERENQLALVL 171 Query: 394 EKIEQSIVLL 403 + ++ + L Sbjct: 172 AEKKRQLSRL 181 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 48/156 (30%), Gaps = 11/156 (7%) Query: 29 PIKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + G T + + I L DV + + + D + + + Sbjct: 2 RLTEVCSIFPGYTARARLEPAGDRGMAAIQLRDVSADGLAHPDELIRVDLGDVAERYMVS 61 Query: 83 KGQILYGKLGPYLRKA---IIADFDGICSTQFLVLQPKDV--LPELLQGWLLSIDVTQRI 137 G +++ G A + L+L+PK LPE L + + Sbjct: 62 AGDVVFRSRGDRNTAAALDGCFIEPALALQPLLILRPKRDAVLPEYLAWAINQPSAQRHF 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + G + + ++ + +P L Q I Sbjct: 122 DEGARGTNIRMVPKSCLDDLDIDVPDLEAQRRIVAV 157 >gi|13358009|ref|NP_078283.1| type I restriction enzyme S protein (fragment) [Ureaplasma parvum serovar 3 str. ATCC 700970] gi|11357072|pir||D82889 type I restriction enzyme S protein, truncated homolog UU446 [imported] - Ureaplasma urealyticum gi|6899438|gb|AAF30858.1|AE002141_4 type I restriction enzyme S protein (fragment) [Ureaplasma parvum serovar 3 str. ATCC 700970] Length = 149 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 34/112 (30%), Gaps = 5/112 (4%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP +W V + + + +G + +S K I I + D +S + Sbjct: 32 IPNNWIWVKLNNISNVISGYSFKSSKYTSSGIRIIRISDFDSKEVDNNEPIFYEYNEKFN 91 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + I I+ G + K II V + + + Sbjct: 92 SYKI-ENNDIILVMTGGTVGKNIIIKKANDYYLNQRVARIRTFNVNYNYIYY 142 >gi|322387159|ref|ZP_08060769.1| type I restriction enzyme EcoKI specificity protein [Streptococcus infantis ATCC 700779] gi|321141688|gb|EFX37183.1| type I restriction enzyme EcoKI specificity protein [Streptococcus infantis ATCC 700779] Length = 210 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 62/183 (33%), Gaps = 17/183 (9%) Query: 19 GAIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 G IP +W V+ IK +NTG + + K + I +++ L D Sbjct: 26 GNIPMNWVVIKIKDIFSINTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDT 85 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVLQPKDVLPELLQGW 127 S+ ++ K L + L D+DG+ + F+ E+ + Sbjct: 86 QFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEITSKF 145 Query: 128 LLSI------DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 LL + G + + + + +P+ P Q LI +K+ ++ Sbjct: 146 LLFNLSSPLFYKQLKSITKLSGQALYNIPKTTLSELLIPLAPFEVQELITQKVEKLFEKV 205 Query: 182 DTL 184 Sbjct: 206 SQF 208 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 23/204 (11%), Positives = 60/204 (29%), Gaps = 18/204 (8%) Query: 202 LVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 L+ +T G + + G +P +W V + + + K + +I + Sbjct: 2 LIGKKITGGQIDYLLFFCDYGSYYGNIPMNWVVIKIKDIFSINTGLSYKKGDLSINN-KG 60 Query: 262 GNIIQKLETRNMGLKPESYETYQ----------IVDPGEIVFRFIDLQNDKRSLRSAQVM 311 II+ + + + Y + +++ Sbjct: 61 VRIIRGGNIKPLEFSLLDNDYYIDTQFISSEQVYLKHNQLITPVSTSLEHIGKFARIDKD 120 Query: 312 ERGIITSAYMA----VKPHGIDSTYLAWLMRSYDLCKVFY---AMGSGLRQSLKFEDVKR 364 G++ ++ + I S +L + + S K + ++ + Sbjct: 121 YDGVVAGGFIFQLTPFESSEITSKFLLFNLSSPLFYKQLKSITKLSGQALYNIPKTTLSE 180 Query: 365 LPVLVPPIKEQFDITNVINVETAR 388 L + + P + Q IT + + Sbjct: 181 LLIPLAPFEVQELITQKVEKLFEK 204 >gi|319744116|gb|EFV96489.1| type I restriction-modification system [Streptococcus agalactiae ATCC 13813] Length = 145 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 28/95 (29%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 L K SL + + T + + +L + + Sbjct: 47 KSVLIPRKGSLGNLFFANKPFWTVDTLFYTEIDENILMPEFLFYKLKMFNLASMNVGSAV 106 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 SL + L + +P + Q I N++ RI Sbjct: 107 PSLTTAILNALELDIPSFEVQSQIVNILKAFDERI 141 Score = 40.5 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 45/153 (29%), Gaps = 17/153 (11%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 + K+ G+ + D +P G+ +++ K +L Sbjct: 6 KLGEVAKIRYGKDHKKLDD--------------GNIPVYGSGGIMRYVDTALYDKKSVLI 51 Query: 89 GKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSH 148 + G T F +++L + L + + ++ G+ + Sbjct: 52 PRKGSLGNLFFANKPFWTVDTLFYTEIDENILMPEFLFYKLKMF---NLASMNVGSAVPS 108 Query: 149 ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + IP Q I + A RI Sbjct: 109 LTTAILNALELDIPSFEVQSQIVNILKAFDERI 141 >gi|332877052|ref|ZP_08444803.1| hypothetical protein HMPREF9074_00529 [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684942|gb|EGJ57788.1| hypothetical protein HMPREF9074_00529 [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 93 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 39/87 (44%), Gaps = 7/87 (8%) Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 ID +L + M+S K + + G ++ D+ + +P + Q +I+N+++V Sbjct: 7 NIDVEFLYYFMQSSYFQKEVERIVTEGTMKTAYLRDINHIKCPIPDLDRQKEISNLLSVL 66 Query: 386 TARIDVLVEKIEQS-IVLLKERRSSFI 411 L E +E+ + + ++ + Sbjct: 67 -----SLKEDVEKQLLQKYQIQKQYLL 88 >gi|269797183|ref|YP_003311083.1| N-6 DNA methylase [Veillonella parvula DSM 2008] gi|269093812|gb|ACZ23803.1| N-6 DNA methylase [Veillonella parvula DSM 2008] Length = 577 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 43/132 (32%), Gaps = 8/132 (6%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW---L 336 + ++D + + + P + L + Sbjct: 440 KRKFTLLDNDTWLLGRTSPFRSNMLYVEGNDKLIANGNQFSITILPKYKNQYLLPYLALY 499 Query: 337 MRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 S + G L +SL +D+K L + I+ Q D+ N I I+ ++ Sbjct: 500 FNSKAGREQIERFAVGQLIKSLSLKDLKTLQIPRVSIERQRDVVNRI----RMIETEIKT 555 Query: 396 IEQSIVLLKERR 407 +++ + L +++ Sbjct: 556 VKEQLKTLNQQK 567 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 49/172 (28%), Gaps = 17/172 (9%) Query: 28 VPIKRFTKLNTGRTSESGK----------DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 VP+ +N G S + + Y+ +D + Y + Sbjct: 383 VPLGEICNINRGLVISSKELDDFVTDEDTGVRYLYTKDADGDAVDYTQSPFIDVEKLKRK 442 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFD-------GICSTQFLVLQPKDVLPELLQGWLLS 130 ++ L G+ P+ + + + S L L L + S Sbjct: 443 FTLLDNDTWLLGRTSPFRSNMLYVEGNDKLIANGNQFSITILPKYKNQYLLPYLALYFNS 502 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 ++IE G + K + + +P + Q + +I I Sbjct: 503 KAGREQIERFAVGQLIKSLSLKDLKTLQIPRVSIERQRDVVNRIRMIETEIK 554 >gi|88860310|ref|ZP_01134948.1| type I restriction-modification system, M subunit, putative [Pseudoalteromonas tunicata D2] gi|88817508|gb|EAR27325.1| type I restriction-modification system, M subunit, putative [Pseudoalteromonas tunicata D2] Length = 204 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 12/122 (9%), Positives = 37/122 (30%), Gaps = 5/122 (4%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 I + + +K ++ G++V + + T+ ++ Sbjct: 48 GYISTESLQRIEVKEGKKIDKFLLKSGDVVLLARGQSMKCCIVTEEVAKHNLVATANFIV 107 Query: 323 VKPHGIDSTYL-AWLMRSYDLCKVFY----AMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 ++ S K + + + +S+ +K++ + P ++ Q Sbjct: 108 IRIKSGLKAEFIVSYFNSPLGKKALNHSSVSSSTNVIKSISLSGLKKINIKFPTVEVQNQ 167 Query: 378 IT 379 I Sbjct: 168 IA 169 >gi|227529437|ref|ZP_03959486.1| possible restriction modification system DNA specificity subunit [Lactobacillus vaginalis ATCC 49540] gi|227350647|gb|EEJ40938.1| possible restriction modification system DNA specificity subunit [Lactobacillus vaginalis ATCC 49540] Length = 171 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 48/151 (31%), Gaps = 10/151 (6%) Query: 246 RKNTKLIESNILSLS------YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 RK I ++ Y + R++ + + +++ ++ Sbjct: 21 RKKQYYANKGIAWITPKDLSGYSKMYISHGARDISQEGLDNSSAKLLPKDTVLVSSRAPI 80 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + +G + + + YL +LM + ++ + + Sbjct: 81 GYVALAANKITTNQGFKS---IVPNTDIVLPKYLYYLMLTKK-DELENVSSGSTFKEVSG 136 Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARID 390 +K V +P + +Q +I I T +I+ Sbjct: 137 RVMKGFEVDIPSLDKQANIIQKIEPITRKIE 167 >gi|168308225|ref|ZP_02690900.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma parvum serovar 1 str. ATCC 27813] gi|171902622|gb|EDT48911.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma parvum serovar 1 str. ATCC 27813] Length = 202 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 16/159 (10%), Positives = 52/159 (32%), Gaps = 7/159 (4%) Query: 254 SNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 I S N I + K Y + F I + + Sbjct: 40 QIINSKYIDNNIGSYPVISSNTKNNEIFGYINSYMYDGEFITISADGAYAGTVFLENGKF 99 Query: 314 GIITSAYMAVKPHG----IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 I ++ +K ++ ++ ++++ + R +++ +K + + + Sbjct: 100 SITNVCFILIKNKDIDFKFNNKFVYYILKKEQEINRLKSQVGSSRPAVREYSLKEIKINL 159 Query: 370 PPIKEQF---DITNVINVETARIDVLVEKIEQSIVLLKE 405 P ++ Q I + + + + + + + S++ + + Sbjct: 160 PNMEIQEEFSKIVEPLLNLSTKANKIEKILNDSLLKITK 198 >gi|322380438|ref|ZP_08054640.1| type I restriction modification protein [Helicobacter suis HS5] gi|321147149|gb|EFX41847.1| type I restriction modification protein [Helicobacter suis HS5] Length = 317 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 20/198 (10%), Positives = 54/198 (27%), Gaps = 9/198 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRN 272 + K + E + H +K + + + + + N+ L Sbjct: 112 YYQEKYTHNENLIKSHPHARLKDLVRIKKSIEPGSDAYKSVGVPFVRVSNLSPFDLSAST 171 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + L P+ E++F + + + I Y Sbjct: 172 IFLDPKRDLESLYPKQNEVLFSKDGSIGIAYCVPQDLKVVLSSAILRLEIKDCNIISPHY 231 Query: 333 LAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDI-------TNVINV 384 L+ ++ S + + LK + L + + + Q +I ++ Sbjct: 232 LSLVLNSQVVKLQVERESIGSVIAHLKLSKISNLLIPLLDQQIQQNIEIKLKKSADLRTQ 291 Query: 385 ETARIDVLVEKIEQSIVL 402 + ++E+ + Sbjct: 292 SFKLLKRAKTEVERQLTH 309 >gi|293115501|ref|ZP_05791808.2| putative type I restriction-modification system, modification subunit [Butyrivibrio crossotus DSM 2876] gi|292809619|gb|EFF68824.1| putative type I restriction-modification system, modification subunit [Butyrivibrio crossotus DSM 2876] Length = 587 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 16/153 (10%), Positives = 47/153 (30%), Gaps = 4/153 (2%) Query: 256 ILSLSYGNIIQKLETRNMGLKPE--SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER 313 L NI + + ++ E + + +V + + Sbjct: 429 YQYLMLANIQDGIISEDLPYLKELDKKQEKYCIKNNSLVISKNGAPVKVAVAYVEKGKQI 488 Query: 314 GIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPP 371 + Y+ D Y+ + S + + ++ + +K++ + P Sbjct: 489 LANGNLYIIELDETKADPYYVKAYLESENGAIALSRVTVGATLPNIPVDGLKKVLIPNPD 548 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + Q + + I VL +++++ L+ Sbjct: 549 MDTQKKVAEKYLTKVDEIKVLKYRLQKATSDLR 581 >gi|255527618|ref|ZP_05394480.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|296187661|ref|ZP_06856055.1| type I restriction modification DNA specificity domain protein [Clostridium carboxidivorans P7] gi|255508690|gb|EET85068.1| restriction modification system DNA specificity domain protein [Clostridium carboxidivorans P7] gi|296047618|gb|EFG87058.1| type I restriction modification DNA specificity domain protein [Clostridium carboxidivorans P7] Length = 191 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 62/162 (38%), Gaps = 16/162 (9%) Query: 249 TKLIESNILSLSYGNIIQKLETRN----MGLKPESYETYQ---IVDPGEIVF--RFIDLQ 299 IE + GNI+ N + ++ G+I+ R Sbjct: 33 MDYIEEGTPVIRIGNILSDGILENNMENYVFVYDDVNKDFPLTTIELGDILMAVRGDGSA 92 Query: 300 NDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAM-GSGLRQS 356 + L + + + I+ + K + +++ YL W + S + A +++ Sbjct: 93 AKRIGLVTTEKLIGANISPNLLRIKAKENVVNNVYLFWYLISDVGQRRLDAYVNKTAKKN 152 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + +D+K++ VP I+ Q + +N ++D L K+++ Sbjct: 153 IAAKDIKKVVTPVPLIELQNQFADFVN----QVDKLKFKMQR 190 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 51/179 (28%), Gaps = 17/179 (9%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKDGNSRQSDTST 77 K W+ V + + + G D I + ++ S + D Sbjct: 10 KGWEEVELSNVCSVIHRYPTFYGMDYIEEGTPVIRIGNILSDGILENNMENYVFVYDDVN 69 Query: 78 V----SIFAKGQILYGKLGP-----YLRKAIIADFDGI-CSTQFLVLQPKDV--LPELLQ 125 + G IL G + G S L ++ K+ L Sbjct: 70 KDFPLTTIELGDILMAVRGDGSAAKRIGLVTTEKLIGANISPNLLRIKAKENVVNNVYLF 129 Query: 126 GWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 +L+S +R++A + K I + P+P + Q + + + Sbjct: 130 WYLISDVGQRRLDAYVNKTAKKNIAAKDIKKVVTPVPLIELQNQFADFVNQVDKLKFKM 188 >gi|291528112|emb|CBK93698.1| Type I restriction modification DNA specificity domain [Eubacterium rectale M104/1] Length = 191 Score = 47.1 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 47/137 (34%), Gaps = 6/137 (4%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITS 318 + N E ++ E + + G+++ D+ ++ V + + + Sbjct: 39 FNNYFLPDELFDLMDTNEKEQEIYSIKAGDVLITRTSETIDELAMSCVAVKDYPKATYSG 98 Query: 319 AYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKE 374 ++P Y+A+ RS K LR S + L V +P +E Sbjct: 99 FTKRLRPKKEGIAYPKYMAFYFRSELFRKAVTNNAFMTLRASFNEDIFTFLDVYLPIYEE 158 Query: 375 QFDITNVINVETARIDV 391 Q I +++ +I Sbjct: 159 QVRIGDMLYAVECKIQK 175 >gi|126661684|ref|ZP_01732688.1| hypothetical Type I restriction enzyme EcoEIspecificity protein (S protein) [Cyanothece sp. CCY0110] gi|126617032|gb|EAZ87897.1| hypothetical Type I restriction enzyme EcoEIspecificity protein (S protein) [Cyanothece sp. CCY0110] Length = 273 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 54/149 (36%), Gaps = 10/149 (6%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR--SLRSAQVMERGIITSAYMAVK 324 K+ ++ + G+ + + S+ ++ E + Y+ VK Sbjct: 118 KILPTKFTEDTKNNIENYFIQEGDFFVSRGNTIDLVALASVVEEEISEDILFPDLYIKVK 177 Query: 325 PHG--IDSTYLAWLMRSYDLCKVFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDIT 379 ID YLA L S+ F + G + ++ + +P IK+Q I Sbjct: 178 LDETVIDKKYLALLFNSFFGRLYFKYVSKGKNQTMVKISSRELYNFYLPIPDIKKQKKIV 237 Query: 380 NVINVET---ARIDVLVEKIEQSIVLLKE 405 I + ++I+ +EK I L+ E Sbjct: 238 EGITDKIDEQSKINKKIEKNIAKINLIIE 266 >gi|160945577|ref|ZP_02092803.1| hypothetical protein FAEPRAM212_03106 [Faecalibacterium prausnitzii M21/2] gi|158443308|gb|EDP20313.1| hypothetical protein FAEPRAM212_03106 [Faecalibacterium prausnitzii M21/2] Length = 156 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 10/74 (13%), Positives = 26/74 (35%), Gaps = 3/74 (4%) Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + H D ++ + +++ + K + + V V P + Q Sbjct: 74 LNTTLYVENFHENDEKFVYYFLKTLEWKKF---ASASAVPGINRNTVHIEIVRFPDFETQ 130 Query: 376 FDITNVINVETARI 389 I +V++ +I Sbjct: 131 QKIASVLSTIDKKI 144 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 18/157 (11%), Positives = 44/157 (28%), Gaps = 16/157 (10%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 WK+ + F L G K ++G G + T ++ Sbjct: 4 WKIDELGEFVTLKRGYDLPQQKR---------KNGEIPIFSSSGVT---GTHNEAMVEAP 51 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 ++ G+ G +T V + + + +L +++ + + Sbjct: 52 GVITGRYGTIGEVFFAETSFWPLNTTLYVENFHENDEKFVYYFLKTLE----WKKFASAS 107 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + + + P Q I + +I Sbjct: 108 AVPGINRNTVHIEIVRFPDFETQQKIASVLSTIDKKI 144 >gi|300215353|gb|ADJ79766.1| Putative uncharacterized protein [Lactobacillus salivarius CECT 5713] Length = 185 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 3/170 (1%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 LV + N+ ++ +L + + + I G+IV Sbjct: 1 MKLNELVKIESGINSVRVKDQNYTLYTIEDVNYDLGHGEDYQHDKASGKSITARGDIVIN 60 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SG 352 + R+A M I + + +D YL +L+ + + A Sbjct: 61 TVSNLASVVHSRNAGKMLNQIF-LRLNILDENTLDPWYLCYLLNKSEYIRYQEAAIMDGS 119 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L +++ L + +P + +Q + + + +EK E L Sbjct: 120 VIRKLTKANLEDLEINLPGVVDQKKMGEAYKEIMKKYTLAMEKAELEKDL 169 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 59/176 (33%), Gaps = 10/176 (5%) Query: 29 PIKRFTKLNTGRTSESGKDIIY--IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + K+ +G S KD Y +EDV G + + S SI A+G I Sbjct: 2 KLNELVKIESGINSVRVKDQNYTLYTIEDVNYDLG----HGEDYQHDKASGKSITARGDI 57 Query: 87 LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL----PELLQGWLLSIDVTQRIEAICE 142 + + + + + FL L D L S + + AI + Sbjct: 58 VINTVSNLASVVHSRNAGKMLNQIFLRLNILDENTLDPWYLCYLLNKSEYIRYQEAAIMD 117 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G+ + + ++ + +P + +Q + E + + + +L + Sbjct: 118 GSVIRKLTKANLEDLEINLPGVVDQKKMGEAYKEIMKKYTLAMEKAELEKDLYLQM 173 >gi|3299821|gb|AAC25970.1| restriction-modification enzyme specificity subunit S2A [Mycoplasma pulmonis] Length = 274 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 24/79 (30%), Gaps = 3/79 (3%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI- 389 + + R SL D V +P ++ Q I +I +I Sbjct: 3 EKYLFYFLKNKQEHIQSITYGSTRDSLTKTDFSDFVVSIPSLETQSAIIKIIEPLEKQIN 62 Query: 390 --DVLVEKIEQSIVLLKER 406 D L+ ++S+ Sbjct: 63 AFDELILSEQKSLQHYLNY 81 Score = 40.5 bits (93), Expect = 0.57, Method: Composition-based stats. Identities = 32/259 (12%), Positives = 74/259 (28%), Gaps = 24/259 (9%) Query: 125 QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + + + I++I G+T + + IP L Q I + I +I+ Sbjct: 5 YLFYFLKNKQEHIQSITYGSTRDSLTKTDFSDFVVSIPSLETQSAIIKIIEPLEKQINAF 64 Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 + Q + + + LN + S + ++++ L Sbjct: 65 DELILSE--------QKSLQHYLNYFLNKLASINPS-------IFKNYKLGQILNLEKGK 109 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ N K + NI + + + + + + I+ I Sbjct: 110 SKYNAKYVSQNIGIYNLYSSKTRDQGIFGKINSYDFNGEYIL---------ITTHGAYAG 160 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + ++ ++ I T + LK ++ Sbjct: 161 TVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGSAYGYLKNYNIND 220 Query: 365 LPVLVPPIKEQFDITNVIN 383 V +P +K Q I +I Sbjct: 221 FEVNLPNLKIQSAILGIIE 239 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 58/178 (32%), Gaps = 7/178 (3%) Query: 29 PIKRFTKLNTGRTSESGKDII-YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 + + L G++ + K + IG+ ++ S + G D + IL Sbjct: 98 KLGQILNLEKGKSKYNAKYVSQNIGIYNLYSSKTRDQGIFGKINSYDFNGEY------IL 151 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 G Y + ++ +L+ + + + L + + + G+ Sbjct: 152 ITTHGAYAGTVKYVNEKFSTTSNCFILKVNENIVKTKFLSYLLLLQEKTFNDMAIGSAYG 211 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 + I + + +P L Q I I +I+ L ++ + + L+ Sbjct: 212 YLKNYNINDFEVNLPNLKIQSAILGIIEPLHKKINLLKQKKKLLEKRSIYCQNHLIKE 269 >gi|302336934|ref|YP_003802140.1| transcriptional regulator [Spirochaeta smaragdinae DSM 11293] gi|301634119|gb|ADK79546.1| putative transcriptional regulator [Spirochaeta smaragdinae DSM 11293] Length = 543 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 46/134 (34%), Gaps = 2/134 (1%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 +K + D + ++ K+ +++ + + V + Sbjct: 219 YKNVEFKVVKDIIKSITPVKDSEDFSNSENEIYITKQGNLPSKINHKNFSNLLKIDVNHN 278 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 I+ YL RS ++ ++ D+ ++ + VPP+ EQ DI +N + Sbjct: 279 LINPKYLEIYFRSSLGQISLKSIQLGSSIPYIRRTDLLKIKIPVPPLIEQSDIVE-VNEK 337 Query: 386 TARIDVLVEKIEQS 399 + + +E Sbjct: 338 LNELKERIASLENE 351 >gi|145631984|ref|ZP_01787736.1| putative Type I restriction enzyme EcoR124II specificity protein [Haemophilus influenzae R3021] gi|144982368|gb|EDJ89948.1| putative Type I restriction enzyme EcoR124II specificity protein [Haemophilus influenzae R3021] Length = 259 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 22/190 (11%), Positives = 58/190 (30%), Gaps = 12/190 (6%) Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 KP + + + ++ +T +G E Y I+F Sbjct: 19 WKPLGEVTAYEQPTKYLVSSTVYSDEFSTPVLTAGKTFILGYTDEEEGIYFASKSPVIIF 78 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL 353 N + + + Y+ + + + ++ Sbjct: 79 DDFTTANK---WVDFDFKAKSSAMKMITSKDENITLLKYIYYWLNTLPNNQLDSDHK--- 132 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSS 409 RQ + + + +PP+ Q +I +++ TA L ++ ++L ++ R Sbjct: 133 RQWIS--NYANKLIPIPPLSVQTEIVKILDALTALTSELTSELTSELILRQKQYEYYREK 190 Query: 410 FIAAAVTGQI 419 ++ G++ Sbjct: 191 LLSEEELGKV 200 >gi|183508611|ref|ZP_02689741.2| restriction-modification enzyme subunit s3b [Ureaplasma parvum serovar 14 str. ATCC 33697] gi|182676069|gb|EDT87974.1| restriction-modification enzyme subunit s3b [Ureaplasma parvum serovar 14 str. ATCC 33697] Length = 209 Score = 46.7 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 55/158 (34%), Gaps = 10/158 (6%) Query: 255 NILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERG 314 N + + +I + E + IV G+++ ++ + S + + Sbjct: 48 NYMDIYKNFVINDDIKLRLYNASEKHIKSYIVSYGDLLLTASSETKEEIAFSSVYLSNKQ 107 Query: 315 IITSAY---MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVP 370 I + + + Y A+ RS K + +G R +L +D + + + + Sbjct: 108 AIFNGFSKIYKYDQKILLPIYAAFYFRSEFFRKEVIKLATGYTRFNLSIKDAENIEISIN 167 Query: 371 PIKEQFDITNV------INVETARIDVLVEKIEQSIVL 402 + Q + + ++ + +I+ ++ I Sbjct: 168 NFEFQKKFSKIVEPLLNLSTKANKIEKILNDSLLKITK 205 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 49/188 (26%), Gaps = 13/188 (6%) Query: 29 PIKRFTKLNTGRT----SESGKDIIYIGLEDVESG--TGKYLPKDGNSRQSDTSTVSIFA 82 ++ K G + + I +I D+ + + I + Sbjct: 21 KLRDIGKFKGGISTLDKNNYDSGINFINYMDIYKNFVINDDIKLRLYNASEKHIKSYIVS 80 Query: 83 KGQILYGKLGPYLR-----KAIIADFDGICSTQ--FLVLQPKDVLPELLQGWLLSIDVTQ 135 G +L +++ I + K +LP + S + Sbjct: 81 YGDLLLTASSETKEEIAFSSVYLSNKQAIFNGFSKIYKYDQKILLPIYAAFYFRSEFFRK 140 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELL 195 + + G T + K NI + I Q + + + L Sbjct: 141 EVIKLATGYTRFNLSIKDAENIEISINNFEFQKKFSKIVEPLLNLSTKANKIEKILNDSL 200 Query: 196 KEKKQALV 203 + + L+ Sbjct: 201 LKITKKLI 208 >gi|325996128|gb|ADZ51533.1| Type I restriction-modification system specificity subunit S [Helicobacter pylori 2018] gi|325997724|gb|ADZ49932.1| Type I restriction-modification system,specificity subunit S [Helicobacter pylori 2017] Length = 146 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 44/130 (33%), Gaps = 1/130 (0%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +T I +S N G Y D I + + Sbjct: 7 STNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGENITIASRGEYAGFINYFN 66 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + G+ Y + + + +L + +++ ++ + + G +L D++ L + Sbjct: 67 EKFFAGGLCYP-YKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALNKADIETLTI 125 Query: 368 LVPPIKEQFD 377 +PP++ Q + Sbjct: 126 PIPPLEIQQE 135 >gi|301799574|emb|CBW32126.1| putative type I restriction-modification system S protein [Streptococcus pneumoniae OXC141] Length = 202 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IPK W+ + G+T + +I ++ + D+ SG + + Sbjct: 27 IYEIPKAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 87 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 144 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 50/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P W F +LV K + I +S ++ N + Sbjct: 27 IYEIPKAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNTRESISK 86 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 87 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 145 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I + +++ ++ L Sbjct: 146 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIISKVDLLFQKVSQL 200 >gi|168308222|ref|ZP_02690897.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma parvum serovar 1 str. ATCC 27813] gi|171902585|gb|EDT48874.1| restriction-modification enzyme MpuUVI S subunit [Ureaplasma parvum serovar 1 str. ATCC 27813] Length = 246 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 12/141 (8%), Positives = 38/141 (26%), Gaps = 3/141 (2%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 N + K Y + I + + + Sbjct: 95 ISNNPGYYPLISASSKNNGIFGYFNDYMYDGKNITISMNGNAGCIFYQIGKFSANSDVLV 154 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF---D 377 ++ + + + + ++ R L +++ VL+P I+ Q Sbjct: 155 LSNPNKNLTNIDYIYYLLKTKEKEIQNLAIGTTRFRLGNSVIEKFKVLLPNIEIQEKFSK 214 Query: 378 ITNVINVETARIDVLVEKIEQ 398 I + + + + + + + + Sbjct: 215 IVEPLINLSTKANKIEKNLNE 235 >gi|242243196|ref|ZP_04797641.1| conserved hypothetical protein [Staphylococcus epidermidis W23144] gi|242233350|gb|EES35662.1| conserved hypothetical protein [Staphylococcus epidermidis W23144] Length = 193 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 20/145 (13%), Positives = 53/145 (36%), Gaps = 17/145 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + ++V G+IV + ++ + ++ + ID+ Y Sbjct: 55 VTLSTTHQAKMVHTGDIVINMM--TSECVIVSQQHHESILPYNYTHIEIDTTHIDANYFV 112 Query: 335 WLMR-SYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + M S G + L +K+L + +PP+++Q I ++D Sbjct: 113 YWMNASAQAKSQLNQFKQGGSLVKKLTLNQLKQLKMTLPPLEQQQRIG--------KLD- 163 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 + + + L+ +R+ + ++ Sbjct: 164 ---ERRRHLKYLQAKRTYLMDQFLS 185 >gi|324993829|gb|EGC25748.1| hypothetical protein HMPREF9390_0215 [Streptococcus sanguinis SK405] Length = 211 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 51/146 (34%), Gaps = 9/146 (6%) Query: 28 VPIKRFTKLNTGRTSES----GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA- 82 + + +L +G +S + I ++D++ T + +S S S F Sbjct: 17 IKLGDIFELKSGYAFKSKDWVDEGKPVIKIKDIDGITIDITNLNYVKNKSQLSKASNFEV 76 Query: 83 -KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQG-WLLSIDVTQRIE 138 +I+ G K + +F+G + + + K L + L ++ + Sbjct: 77 FGKEIVMALTGATTGKIGVIPKNFNGYVNQRVGLFYAKTELSYAVLWSILQQQNIITDLI 136 Query: 139 AICEGATMSHADWKGIGNIPMPIPPL 164 + G+ ++ + + + + Sbjct: 137 KLSSGSAQANLSPFSVNSYDLNVTFK 162 Score = 44.0 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 25/209 (11%), Positives = 59/209 (28%), Gaps = 17/209 (8%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 S +G + + F + K I+ + ++ ++ K Sbjct: 11 FYSSNSIKLGDIFELKSGYAFKSKDWVDEGKPVIKIKDIDGITIDITNLNYVKNKSQLSK 70 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 ++E V EIV K + G + S + W Sbjct: 71 ASNFE----VFGKEIVMALTGATTGKIGVIPKNF--NGYVNQRVGLFYAKTELSYAVLWS 124 Query: 337 M--RSYDLCKVFYAMGSGLRQSLKFEDVKR--LPVLVPPIKEQFDITNVINVETARIDVL 392 + + + + + +L V L V + E ++ + + L Sbjct: 125 ILQQQNIITDLIKLSSGSAQANLSPFSVNSYDLNVTFKDLIE-------LDKVLSPLYEL 177 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I L + R + + ++G++ + Sbjct: 178 FCFNLSEIQRLSKLRDTLLPKLLSGELSV 206 >gi|113460699|ref|YP_718765.1| hypothetical protein HS_0554 [Haemophilus somnus 129PT] gi|112822742|gb|ABI24831.1| conserved hypothetical protein [Haemophilus somnus 129PT] Length = 133 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 48/139 (34%), Gaps = 14/139 (10%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + ++ +S +TY+ V PG+ V Q A GI + AY + Sbjct: 4 RDDIGIDIKYDQKSTQTYKRVSPGQFVIHLRSFQG-----GFAWSDIEGITSPAYTIIDF 58 Query: 326 H---GIDSTYLAWLMRSYDLCKVFYAMGSGLR--QSLKFEDVKRLPVLVPPIKEQFDITN 380 S + + S K + G+R +S+ F D L + I+EQ I Sbjct: 59 KKKENHSSNFWKLIFTSSSFIKKLETVTYGIRDGRSISFSDFSDLRLFYSQIQEQQKIGT 118 Query: 381 VINVETARIDVLVEKIEQS 399 +D + ++ Sbjct: 119 F----FTALDRYITIHQRK 133 >gi|283769411|ref|ZP_06342309.1| type I restriction modification DNA specificity domain protein [Bulleidia extructa W1219] gi|283103936|gb|EFC05321.1| type I restriction modification DNA specificity domain protein [Bulleidia extructa W1219] Length = 151 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 52/147 (35%) Query: 244 LNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKR 303 ++ K + S ++ + YG +K+ + + + + + L K Sbjct: 1 MDMKYKRYALSELVMIKYGKNQKKVHSEDGNIPIYGTGGLMGYATTALYDKPSVLIGRKG 60 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 ++ + +E T + D +L L + SL+ E + Sbjct: 61 TIGKVKYVEHPFWTVDTLFYTIINTDIVRPKYLYYIMSLIDLNNYNEGTTIPSLRIETLN 120 Query: 364 RLPVLVPPIKEQFDITNVINVETARID 390 RL +P I+EQ + + +N +I+ Sbjct: 121 RLEFDIPSIEEQEIVLSCLNPIDEKIE 147 >gi|167010575|ref|ZP_02275506.1| type I restriction enzyme EcoEI specificity protein [Francisella tularensis subsp. holarctica FSC200] gi|254369155|ref|ZP_04985167.1| predicted protein [Francisella tularensis subsp. holarctica FSC022] gi|290953274|ref|ZP_06557895.1| putative type I RM modification enzyme [Francisella tularensis subsp. holarctica URFT1] gi|295313480|ref|ZP_06804076.1| putative type I RM modification enzyme [Francisella tularensis subsp. holarctica URFT1] gi|157122105|gb|EDO66245.1| predicted protein [Francisella tularensis subsp. holarctica FSC022] Length = 133 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 29/118 (24%), Gaps = 3/118 (2%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 + +P W+ + + G + KD IGL + D N + Sbjct: 18 LYKLPAWWEWKKLGELAEYVNGMAFKP-KDWSNIGLPIIRIQNLN-GSDDFNYFSGEAKE 75 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 G IL L + I + + + L Q Sbjct: 76 KYYVKSGDILISWS-ASLDVYKWQGGNAILNQHIFNTIINYDVVDYDFFITLLNIHYQ 132 >gi|57865913|ref|YP_190016.1| hypothetical protein SERP2473 [Staphylococcus epidermidis RP62A] gi|57636571|gb|AAW53359.1| hypothetical protein SERP2473 [Staphylococcus epidermidis RP62A] Length = 228 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 66/207 (31%), Gaps = 19/207 (9%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 K K S I + + + + + + + Q + Sbjct: 30 KIHKKKVSQISQLFTFHNGSLINRLETVEASQGITLPIYDQRMMEF--DDGVFQPSTHQP 87 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + ++V G+IV + ++ + ++ + ID+ Y Sbjct: 88 KNVTLSTTHQAKMVHTGDIVINMM--TSECVIVSQQHHESILPYNYTHIEIDTTHIDANY 145 Query: 333 LAWLMR-SYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + M S G L + L +K+L + +P +++Q I ++ Sbjct: 146 FVYWMNASAQAKSQLNQFKQGGSLVKKLTLNQLKQLKMTLPSLEQQQRIG--------KL 197 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVT 416 D + + + L+ +R+ + ++ Sbjct: 198 D----ERRRHLKYLQAKRTYLMDQFLS 220 >gi|228472562|ref|ZP_04057322.1| putative type I restriction-modification system, S subunit [Capnocytophaga gingivalis ATCC 33624] gi|228275975|gb|EEK14731.1| putative type I restriction-modification system, S subunit [Capnocytophaga gingivalis ATCC 33624] Length = 132 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 14/102 (13%), Positives = 35/102 (34%), Gaps = 6/102 (5%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 FI + D + + + + P+ S YL +L+ + + Sbjct: 37 FIIVFGDHTRVVKYIDFDFIVGADGVKVILPNNNLSKYLYYLILNASYKIENRGYSRHFQ 96 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +++ +PP+ EQ+ I I + ++ + + Sbjct: 97 ------FLQKEFFPLPPLAEQYRIVQKIETYFSFLNTIESNL 132 >gi|257433905|ref|ZP_05610263.1| TypeIrestriction-modificationsystemspecificitysubunit [Staphylococcus aureus subsp. aureus E1410] gi|257281998|gb|EEV12135.1| TypeIrestriction-modificationsystemspecificitysubunit [Staphylococcus aureus subsp. aureus E1410] Length = 72 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 10/75 (13%), Positives = 29/75 (38%), Gaps = 5/75 (6%) Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSI 400 + K+ + + + +++ + +P ++EQ I + +ID + + I Sbjct: 1 MKKISANLQGTSIKGITKKELLDSIIKIPHNLEEQQKIGD----LFYKIDKYISFNKCKI 56 Query: 401 VLLKERRSSFIAAAV 415 +LK + + Sbjct: 57 EILKSLKQGLLQKIF 71 >gi|207108370|ref|ZP_03242532.1| type I restriction enzyme S protein (hsdS) [Helicobacter pylori HPKX_438_CA4C1] Length = 161 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 48/165 (29%), Gaps = 12/165 (7%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 IL L K G SY I + + +++ + Sbjct: 7 ILWLKRPKTQDKYPFFTSGDNILSYPKAIIDGRNCFLNTGGNAGIKFYVGKASYSTDTWC 66 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I + S YL L+ S + L+ +K+ P+ +P E Sbjct: 67 ICA--------NEFSDYLYLLLSSIKTHINQSFFQGTSLKHLQKNLLKKYPIYMPSAHEI 118 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 +I L+ ++ L++ R + +T Q+ Sbjct: 119 KKFNQIIMPLL----TLISINTRTSKKLEQIRDFLLPLLLTQQVK 159 >gi|32266297|ref|NP_860329.1| hypothetical protein HH0798 [Helicobacter hepaticus ATCC 51449] gi|32262347|gb|AAP77395.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449] Length = 1056 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 40/359 (11%), Positives = 87/359 (24%), Gaps = 20/359 (5%) Query: 55 DVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY-GKLGPYLRKAIIADFDGICSTQFLV 113 D+ G Y+ + S K +Y + I + Q Sbjct: 690 DIIIGNPPYIDYRSIDENTKIS----LQKNSFVYTNSKRGSIFVYFIEKAAKLIHKQGYC 745 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + + + +S + I + ++ Sbjct: 746 IFINPINYICQDSGAGIREFIDNNLCLISMIDVSSFKVFNSASTYTCINCFTHKSQELKE 805 Query: 174 IIAETVRIDTLITERI-RFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHW 232 I + + K + +++ +T + + S + Sbjct: 806 INFGRANCEEELNNIALEKFPQSKIENLSILLDSITTKIFKANYPQLSSFCDI-----FC 860 Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + N+K + S ++ + + + S E +I + EI+ Sbjct: 861 ALSIAGFRNDVKNKKTKDNVPFLESSDIQKYDYKQGKFLHNAVSYYSTEKIKIFEDSEII 920 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 F + + + + I + L+ + K F + G Sbjct: 921 FMARMTNFIRCCIAPKAYFGGKVNILHNFKLDRKFILGVLNSKLINYFYAKKYFASHMQG 980 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE--------KIEQSIVLL 403 V LP+ Q I N I +I K+E I L Sbjct: 981 GAFGFDTLSVGSLPIPKITKANQ-RIVNEIVALVDKILESKAKDSTASTKKLESQIDFL 1038 >gi|325973479|ref|YP_004250543.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323652081|gb|ADX98163.1| putative type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 154 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 15/154 (9%), Positives = 43/154 (27%), Gaps = 4/154 (2%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 + + + K + + + + + Sbjct: 4 GNGKYPFFTCSFETKKSYTYSYDFPALLVSSGGSKFHAKVFFGKFQASTDTFIVKLGTTD 63 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 L +L Y + + + L + +K + +L+P I N Sbjct: 64 FIYLMLEFLNIIYLPQINWVTCATTFLKHLSPQKLKEIEILIPD----QKILEKFNNFWK 119 Query: 388 RIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I ++K+E + +E + + + + +I + Sbjct: 120 NIHSKIKKLELKMQKYEEIKKKLLDSLFSQEIQV 153 >gi|14520377|ref|NP_125852.1| site specific DNA-methyltransferase [Pyrococcus abyssi GE5] gi|5457592|emb|CAB49083.1| Site specific DNA-methyltransferase [Pyrococcus abyssi GE5] Length = 464 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 73/192 (38%), Gaps = 5/192 (2%) Query: 185 ITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTEL 244 I + K LV I+ +K +G + + + + + Sbjct: 224 IHHLTISKVKVMGKSVKLVDSILYPEFYLQDHLKLENSVQLGELVETRSGQTEYGEKRKF 283 Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 ++ I + +++ + + + + + E + GEIVF + + R+ Sbjct: 284 SKSGIPFISAKVVTPLGIDFTK--DKKFIQPNSEMDKKSAHAHVGEIVFVRVGVGTIGRT 341 Query: 305 LRSAQVMERGIIT--SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFED 361 E GI+ S + VK ++ YLA+ +++ + K G+ ++ + Sbjct: 342 AVITSKEEEGIVDDWSYILTVKSDKVNPYYLAFYLQAPTIKKQILRYARGVGTITIPQRE 401 Query: 362 VKRLPVLVPPIK 373 +K++PVL+PP Sbjct: 402 LKKIPVLIPPKD 413 >gi|332877053|ref|ZP_08444804.1| hypothetical protein HMPREF9074_00530 [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684943|gb|EGJ57789.1| hypothetical protein HMPREF9074_00530 [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 124 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 38/111 (34%), Gaps = 4/111 (3%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I S +I + D YL + + +++ M Sbjct: 11 IIKDGSGVGTVSYAQGRFSVIGTLNYLTSKGNHDLRYLYFALSAFNFQLYKTGMA---IP 67 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + F+D + + P + EQ + NV+ +++ +K+ S L K+ Sbjct: 68 HIYFKDYGKAKIYCPVLAEQKRVANVLGKLESKLF-AEKKLRASFNLQKQY 117 >gi|257139492|ref|ZP_05587754.1| type I restriction-modification system specificity determinant [Burkholderia thailandensis E264] Length = 304 Score = 46.3 bits (108), Expect = 0.010, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ WK++ + N + G+ Y+ + + + P S Sbjct: 97 ELPEGWKLLKASELIEFNPTESLRKGEVAPYLDMASLPTQGSWPDPYVMRPFGSGMR--- 153 Query: 80 IFAKGQILYGKLGPYLRK-------AIIADFDGICSTQFLVLQPKDVLP-ELLQGWLLSI 131 F G L ++ P L + D G ST+++V++PK +P + Sbjct: 154 -FRNGDTLLARITPCLENGKTAFIQCLPDDVVGWGSTEYIVMRPKGPVPAAFAYLLARND 212 Query: 132 DVTQRIEAICEG 143 + G Sbjct: 213 AFREHAIRSMTG 224 Score = 44.0 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 25/61 (40%), Gaps = 4/61 (6%) Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 G+ RQ + V+R +PP+ EQ I ++ +D +E + L+ + Sbjct: 3 GTSGRQRVPSSAVERYSTRLPPLAEQRAIAKILGS----LDDKIELNRERSETLEAMGRA 58 Query: 410 F 410 Sbjct: 59 L 59 >gi|307637133|gb|ADN79583.1| typeI restriction-modification system subunit S [Helicobacter pylori 908] gi|325995724|gb|ADZ51129.1| Type I restriction-modification system specificity subunit S [Helicobacter pylori 2018] gi|325997320|gb|ADZ49528.1| Type I restriction enzyme specificity subunit [Helicobacter pylori 2017] Length = 298 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 31/86 (36%), Gaps = 4/86 (4%) Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVIN 383 P+ + + Y + + + + V +PP EQ I ++ Sbjct: 5 PNKKIYFEFLYYLLKYHKDNISNMGVGTTFKGISKPALGLFQVKIPPTYYEQQKIARTLS 64 Query: 384 VETARID---VLVEKIEQSIVLLKER 406 V +I+ + E + + + LL E+ Sbjct: 65 VLDQKIENNHKINELLHKILELLYEQ 90 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 43/293 (14%), Positives = 81/293 (27%), Gaps = 24/293 (8%) Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP-LAEQVLIREKIIAETVR 180 + L I + G T +G + IPP EQ I + + Sbjct: 10 YFEFLYYLLKYHKDNISNMGVGTTFKGISKPALGLFQVKIPPTYYEQQKIARTLSVLDQK 69 Query: 181 IDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI-----EWVGLVPDHWEVK 235 I+ ++L+ + N G E L+P+ W V+ Sbjct: 70 IENNHKINELLHKILELLYEQYFVRFDFSDENNKPYQTSGGKMKFSKELNRLIPNGWSVR 129 Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 + + T S SY + I G+ + Sbjct: 130 FLNHKIVSTYQPKTISKTLLNDSYSYSVYGGGGIIGRFTEYNHEQSEFIISCRGQCGISY 189 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + L + + + TYL ++ Y L ++ Sbjct: 190 LTLPKSWITG-----------NAMVIRPTKSYTSKTYLYHTIKKYKLTNYI---TGSVQP 235 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + +++ +P+L+P I N N ++ + L+ QS L R Sbjct: 236 QITRQNLSTMPILIPK----RKILNKWNNISSLLWNLIHSNMQSTQTLTVLRD 284 >gi|217032197|ref|ZP_03437696.1| hypothetical protein HPB128_186g63 [Helicobacter pylori B128] gi|216946187|gb|EEC24796.1| hypothetical protein HPB128_186g63 [Helicobacter pylori B128] Length = 169 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 48/149 (32%), Gaps = 13/149 (8%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 +T +G E YQ ++ + + + + ++ + + + Sbjct: 17 GKTFILGYTNEKDNIYQASKSSPVII----FDDFTTATQWVDFPFKVKSSAMKILLPKNP 72 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + + + G RQ + ++ + +PP++ Q +I +++ A Sbjct: 73 TINIRFIFFYMQTIPYNI---SGEHTRQWISR--YSQITIPIPPLEIQQEIVKILDQFLA 127 Query: 388 RIDVLVEKIEQSIVLLKE----RRSSFIA 412 L+ I I K+ R + Sbjct: 128 LTTDLLAGIPAEIEARKKQYEYYREKLLT 156 >gi|32266934|ref|NP_860966.1| hypothetical protein HH1435 [Helicobacter hepaticus ATCC 51449] gi|32262986|gb|AAP78032.1| conserved hypothetical protein [Helicobacter hepaticus ATCC 51449] Length = 216 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 33/80 (41%), Gaps = 4/80 (5%) Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 + +Y L G E + L ++ PP+K Q I NV+ I+ + + Sbjct: 139 LIAYILRDEGERAGFSRTLRASIERIAALKIIFPPLKSQQQIVNVVE----NIESHIAHL 194 Query: 397 EQSIVLLKERRSSFIAAAVT 416 + + L+ ++ + A+T Sbjct: 195 DSFLPTLQSQKQKILKEALT 214 >gi|285959355|gb|ADC39977.1| type I restriction-modification system small specificity subunit [Staphylococcus aureus] Length = 157 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 54/145 (37%), Gaps = 17/145 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + + ++V G+IV + ++ + ++ + ID+ Y Sbjct: 19 VTLSTTHQAKMVHTGDIVINMM--TSECVIVSQQHHESILPYNYTHIEIDTAHIDANYFV 76 Query: 335 WLMR-SYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + M S G L + L +K+L + +PP+++Q I ++D Sbjct: 77 YWMNASSQAKSQLNQFKQGGSLVKKLTLNQLKQLKMTLPPLEQQQRIG--------KLD- 127 Query: 392 LVEKIEQSIVLLKERRSSFIAAAVT 416 + + + L+ +R+ + ++ Sbjct: 128 ---ERRRHLKYLQAKRTYLMDQFLS 149 >gi|171920270|ref|ZP_02931629.1| restriction modification enzyme subunit s2a [Ureaplasma parvum serovar 1 str. ATCC 27813] gi|171902674|gb|EDT48963.1| restriction modification enzyme subunit s2a [Ureaplasma parvum serovar 1 str. ATCC 27813] Length = 166 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 11/87 (12%), Positives = 31/87 (35%), Gaps = 3/87 (3%) Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFE 360 + + + +T+ + + + +L + + + R S+ Sbjct: 74 AGTTFWQEKNFSLTNHALVFIMNKLIKYNYKYLFLTLKKHESKIKELIISGSTRPSVSLS 133 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETA 387 +K + + +P I+EQ I ++I Sbjct: 134 LLKSINIKLPSIEEQNAIIDIIEQVIT 160 >gi|229817837|ref|ZP_04448119.1| hypothetical protein BIFANG_03121 [Bifidobacterium angulatum DSM 20098] gi|229784737|gb|EEP20851.1| hypothetical protein BIFANG_03121 [Bifidobacterium angulatum DSM 20098] Length = 145 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 2/113 (1%) Query: 296 IDLQNDKRSLRSAQVMERGI-ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 + + A E+ + I+ A I +LA+L + D + G + Sbjct: 1 MSENIEDVCTPLAWEGEQPVAISGHSCAYATKSIIPRHLAYLATAQDFQISKRKVAKGTK 60 Query: 355 Q-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + D+ R+ + VP Q + ++++ L + I I +++ Sbjct: 61 VIEVAPVDLSRVEIPVPCPATQRKVVDILDRFDTLTKSLTDGIPTEIEARRQQ 113 >gi|189462164|ref|ZP_03010949.1| hypothetical protein BACCOP_02846 [Bacteroides coprocola DSM 17136] gi|189431137|gb|EDV00122.1| hypothetical protein BACCOP_02846 [Bacteroides coprocola DSM 17136] Length = 262 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/167 (11%), Positives = 52/167 (31%), Gaps = 12/167 (7%) Query: 247 KNTKLIESNILSLSYGNIIQ-KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + E+ + + ++ + L + + E + I+ + Sbjct: 97 GSDAYQETGVPFIRVSDLSKFGLTDTAIHIDKEEFNNVIRPQKNTILLSKDGS----VGI 152 Query: 306 RSAQVMERGIITSAYMAV---KPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 +ITS + + YL ++ S + G + Q K + Sbjct: 153 AYKVEEPLDVITSGAILHLSLISTDVLPDYLTLVLNSPIVRLQAERDAGGSIIQHWKPSE 212 Query: 362 VKRLPVLVPPIKEQFDITNVINVET---ARIDVLVEKIEQSIVLLKE 405 ++ + + + P+ Q I+ I + L+ ++ + + E Sbjct: 213 IENVIIPILPMPIQQKISGKIQESFRLRKESEELLNNAKRKVEMTIE 259 >gi|212691985|ref|ZP_03300113.1| hypothetical protein BACDOR_01480 [Bacteroides dorei DSM 17855] gi|212665377|gb|EEB25949.1| hypothetical protein BACDOR_01480 [Bacteroides dorei DSM 17855] Length = 173 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 32/94 (34%), Gaps = 3/94 (3%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I S E +I + D YL + + +++ M Sbjct: 60 IIKDGSSVGTTSYVQGEFSVIGTLNYLTSKGNHDLRYLYFALSAFNFQPYKTGMA---IP 116 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + F+D + + P + EQ + NV+ +++ Sbjct: 117 HIYFKDYGKAKIYCPLLAEQKRVANVLGKLESKL 150 >gi|291534514|emb|CBL07626.1| Type I restriction modification DNA specificity domain [Roseburia intestinalis M50/1] Length = 194 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 60/193 (31%), Gaps = 10/193 (5%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTKLIESNILS--LSYGNIIQKLETRNMGLKPESYET 282 +G + F + +S + GN+ + + Sbjct: 3 LGETCKFFSGTGFPNKYQGNVHGTYPFYKVGDISRNVQEGNVRLRAADNYIEPDIVKAIK 62 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 I+ P +VF I R R A + +I + M ++P L + ++ Sbjct: 63 GTIIPPNTVVFAKIG--EALRLNRRAVTTQNCLIDNNAMGIQP-ITSVICLEYFLQFMIG 119 Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + S++ ++ + ++VP + Q ++ + D ++ Sbjct: 120 LDMNEYSTATALPSVRKSSLEMVKIIVPDVANQQQFSD----LAIQSDKSKLLLQNKYEK 175 Query: 403 LKERRSSFIAAAV 415 + + R + + Sbjct: 176 INQDR-RLLTCLM 187 >gi|288804030|ref|ZP_06409442.1| putative type I restriction-modification system, S subunit [Prevotella melaninogenica D18] gi|288333495|gb|EFC71958.1| putative type I restriction-modification system, S subunit [Prevotella melaninogenica D18] Length = 149 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 16/131 (12%), Positives = 36/131 (27%), Gaps = 6/131 (4%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 + Y T + + D + + + S + S Sbjct: 19 KSTAYSDDYSTPVLTAGKSFIIGHTDETEGIYNKLPCIIFDDFTKDSRLVDFPFKVKSSA 78 Query: 332 YLAWLMRSYDLCKVFYAMGSGLR------QSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + S R + + +L + +PP KEQ I +I+ Sbjct: 79 MKILQVNKGIDIEYVSQFMSITRLVGDTHKRYWISEYSKLEIPIPPQKEQKRIIRMIHQL 138 Query: 386 TARIDVLVEKI 396 ++ + E + Sbjct: 139 FKNLETIEENL 149 >gi|257438272|ref|ZP_05614027.1| putative toxin-antitoxin system, toxin component [Faecalibacterium prausnitzii A2-165] gi|257199349|gb|EEU97633.1| putative toxin-antitoxin system, toxin component [Faecalibacterium prausnitzii A2-165] Length = 154 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 37/107 (34%), Gaps = 7/107 (6%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 + R + + I T+ Y ID + + +YD+ S Sbjct: 53 GRKGAYRGVHYSDCPFSVIDTAFYAEPLTDRIDLKWAYYKFLTYDING---MDSGSAIPS 109 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 + + V VPP+++Q I V++ ID + ++ L Sbjct: 110 TDRYQIYSIEVEVPPLEKQRKIVAVLDC----IDRKININQKVNDNL 152 >gi|60681331|ref|YP_211475.1| putative type I restriction endonuclease specificity subunit, partial [Bacteroides fragilis NCTC 9343] gi|60492765|emb|CAH07539.1| putative type I restriction endonuclease specificity subunit, partial [Bacteroides fragilis NCTC 9343] Length = 213 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 42/139 (30%), Gaps = 10/139 (7%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQV--MERGIITSAYMAVKPHGIDSTYLAW--- 335 + Y + G++ F D+ + + +I + T L + Sbjct: 76 KQYTLCQAGDVAFADASEDTDEIGKAVEFIRTHKASVICGLHTIHGRDIKCKTLLGFKRV 135 Query: 336 LMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 S+ + G S+ ++ + +P I Q I V + + Sbjct: 136 AFNSHYFHDQIKRLAQGTKVFSITSSNLSSCYIYIPDIVMQKSIV----VLFEAYEEQLI 191 Query: 395 KIEQSIVLLKERRSSFIAA 413 ++ + ++++ + Sbjct: 192 TNKRLLEQYEKQKRYLLQQ 210 >gi|148544100|ref|YP_001271470.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri DSM 20016] gi|325682360|ref|ZP_08161877.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] gi|148531134|gb|ABQ83133.1| restriction modification system DNA specificity domain [Lactobacillus reuteri DSM 20016] gi|324978199|gb|EGC15149.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] Length = 195 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 14/109 (12%), Positives = 36/109 (33%), Gaps = 5/109 (4%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I R + + A+K S + + ++ + Sbjct: 64 ILFSVRAPVGRVNWANQDLAVGRGLAALKIKSGYSKEYLYYLFKKIGGQLDSLATGTVFT 123 Query: 356 SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLL 403 S+ ++++ + + +P + +Q I + + +ID +E Q L Sbjct: 124 SINKKELEAIELKIPVNLSDQEKIADYL----QKIDQEIELNNQINDNL 168 Score = 40.9 bits (94), Expect = 0.44, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 54/158 (34%), Gaps = 3/158 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K + +K + G++ +S G + TS G+ Sbjct: 4 KKIQLKDVADIVMGQSPKSVFYNTNGNGTPFLQGVRTFGENYPQIDTWTTSYNRKAKSGE 63 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL+ P R A+ D L+ K + + L + +++++ G Sbjct: 64 ILFSVRAPVGR-VNWANQDLAVGRGLAALKIKSGYSK-EYLYYLFKKIGGQLDSLATGTV 121 Query: 146 MSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRID 182 + + K + I + IP L++Q I + + I+ Sbjct: 122 FTSINKKELEAIELKIPVNLSDQEKIADYLQKIDQEIE 159 >gi|329575567|gb|EGG57104.1| conserved domain protein [Enterococcus faecalis TX1467] Length = 169 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 10/80 (12%), Positives = 28/80 (35%), Gaps = 5/80 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSE---SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 W++ + ++ G + + ++ +E++ + K + + Sbjct: 53 DWQLCKLGDVVEIFDGTHQTPRYTDSGVKFVSVENIATLETK--KYITHEAYEKEYSKKR 110 Query: 81 FAKGQILYGKLGPYLRKAII 100 KG IL ++G +I Sbjct: 111 AKKGDILMTRIGDIGTMKVI 130 >gi|91201731|emb|CAJ74791.1| hypothetical protein kuste4028 [Candidatus Kuenenia stuttgartiensis] Length = 274 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 73/207 (35%), Gaps = 11/207 (5%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 W +V I +++ + + +G++ G G +L ++ + + + Sbjct: 3 EWNMVRIGDVLKEVSREKRLDPNTKYRLLGVKW--YGKGVFLREEKYGNEIKATKLYEVK 60 Query: 83 KGQILYGKLGPYL--RKAIIADFDGI-CSTQF--LVLQPKDVLPELLQGWLLSIDVTQRI 137 + +Y +L + I +FDG S +F +LPE L +L + I Sbjct: 61 QRDFIYNRLFAWKSSFAVIPDEFDGCLVSNEFPLFTCVESKLLPEFLLSGMLLPENITAI 120 Query: 138 EAICEGA---TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + G + K N +P + Q I +K+ + E I L Sbjct: 121 NNLSGGMSSVSRKRFKEKDFLNFKIPQYGILTQSRICQKLKTISELSADQDLESAHQISL 180 Query: 195 LKEKKQALVSYIVTKGLNPDVKMKDSG 221 +K+ ++ ++ + L + + Sbjct: 181 IKQLRRRILQEAIEGKLTAKWRKQHPD 207 Score = 44.4 bits (103), Expect = 0.035, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 58/194 (29%), Gaps = 8/194 (4%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL-KPESYETYQIVDPGE 290 W + ++ E++R+ + L + + R V + Sbjct: 4 WNMVRIGDVLKEVSREKRLDPNTKYRLLGVKWYGKGVFLREEKYGNEIKATKLYEVKQRD 63 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 ++ + ++ + ++++ + + +L M + Sbjct: 64 FIYNRLFAWKSSFAVIP-DEFDGCLVSNEFPLFTCVESKLLPEFLLSGMLLPENITAINN 122 Query: 349 MGSGL----RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 + G+ R+ K +D + I Q I + + + I L+K Sbjct: 123 LSGGMSSVSRKRFKEKDFLNFKIPQYGILTQSRICQKLKTISELSADQDLESAHQISLIK 182 Query: 405 ERRSSFIAAAVTGQ 418 + R + A+ G+ Sbjct: 183 QLRRRILQEAIEGK 196 >gi|284051868|ref|ZP_06382078.1| restriction modification system DNA specificity domain protein [Arthrospira platensis str. Paraca] Length = 46 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 7/47 (14%), Positives = 18/47 (38%), Gaps = 4/47 (8%) Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 Q I +V++ + +E+ + + + +TG+ L Sbjct: 1 QKAIASVLSDMDKE----IAALEKRRAKTQAIKQGMMQELLTGRTRL 43 >gi|322372974|ref|ZP_08047510.1| type I restriction-modification system specificty subunit [Streptococcus sp. C150] gi|321278016|gb|EFX55085.1| type I restriction-modification system specificty subunit [Streptococcus sp. C150] Length = 206 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 63/186 (33%), Gaps = 8/186 (4%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K++ + G+ + + I L D+ Y S + + Sbjct: 19 KLIRLGDVVDQFKGKAVPAKAEPGEFAVINLSDMTPSGISYKDLKTFSEERRKLLRFLLE 78 Query: 83 KGQILYGKLGPYLRKAIIAD---FDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIE 138 G +L G + A+ D + + S+ VL+PK+ L + L ++ ++ Sbjct: 79 DGDVLIASKGTVQKVAVFEDQGKREVVASSNITVLRPKEKLRGFYIKFFLETEIGCTYLD 138 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQ-VLIREKIIAETVRIDTLITERIRFIELLKE 197 +G + + + +I +P P+ +Q I + ++ + + Sbjct: 139 YADKGKAVLNLSTADLLDIKIPEIPIVKQDYQIAAYLRGRADFHRKMVRAEQEWENIQHN 198 Query: 198 KKQALV 203 +AL Sbjct: 199 VTEALF 204 Score = 36.7 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 12/124 (9%), Positives = 36/124 (29%), Gaps = 4/124 (3%) Query: 263 NIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 + +++ E +++ G+++ E ++ Sbjct: 52 MTPSGISYKDLKTFSEERRKLLRFLLEDGDVLIASKGTVQKVAVFEDQGKREVVASSNIT 111 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQF-DI 378 + + Y+ + + + C G +L D+ + + PI +Q I Sbjct: 112 VLRPKEKLRGFYIKFFLETEIGCTYLDYADKGKAVLNLSTADLLDIKIPEIPIVKQDYQI 171 Query: 379 TNVI 382 + Sbjct: 172 AAYL 175 >gi|111656905|ref|ZP_01407731.1| hypothetical protein SpneT_02001845 [Streptococcus pneumoniae TIGR4] Length = 216 Score = 45.9 bits (107), Expect = 0.013, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 43/119 (36%), Gaps = 8/119 (6%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK------DIIYIGLEDV-ESGTGKYLPKDGNS 70 I IP+ W+ + G+T + +I ++ + D+ SG + + Sbjct: 41 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNARESISK 100 Query: 71 RQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLL 129 + + I KG +L + K I D + + + P +++ +L+ Sbjct: 101 LALKSKKIDISPKGTLLMS-FKLSIGKVAILDIPATHNEAIISIFPYANKENIIRDYLM 158 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 50/177 (28%), Gaps = 12/177 (6%) Query: 225 VGLVPDHWEVKPFFALVTELNRKNTK-----LIESNILSLSYGNIIQKLETRN----MGL 275 + +P+ W F +LV K + I +S ++ N + Sbjct: 41 IYEIPEAWRYIKFASLVNFRIGKTPPRSEATFWGTEIPWVSISDMPISGYVTNARESISK 100 Query: 276 KPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAW 335 + I G ++ F L II+ + I YL Sbjct: 101 LALKSKKIDISPKGTLLMSFKLSIGKVAILDIPATHNEAIIS-IFPYANKENIIRDYLMI 159 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + G ++L + L + + +E I +++ ++ L Sbjct: 160 FLPLISTLGDSKDAIKG--KTLNSTSISELLIPISNHEEMKRIIFKVDLLFQKVSQL 214 >gi|301633515|gb|ADK87069.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 145 Score = 45.6 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 41/127 (32%), Gaps = 9/127 (7%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y V+ I + + S V D +L +R+ Sbjct: 3 YSKTFRVEEKSITVSARGT----IGVVFYRDFAYLPAVSLICFVPKEEFDIRFLFHALRA 58 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 K A G L K + VP +K+Q +IT +++ + L E + Sbjct: 59 IKFKKQGSATGQ-----LTVAQFKEYGIHVPSLKKQKEITAILDPLYSFFTDLNEGLPAE 113 Query: 400 IVLLKER 406 I L K++ Sbjct: 114 IELRKKQ 120 >gi|262068314|ref|ZP_06027926.1| putative type I restriction-modification system S subunit [Fusobacterium periodonticum ATCC 33693] gi|291377970|gb|EFE85488.1| putative type I restriction-modification system S subunit [Fusobacterium periodonticum ATCC 33693] Length = 235 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 17/149 (11%), Positives = 46/149 (30%), Gaps = 5/149 (3%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G+ I K + + Y ++ N S V + Sbjct: 23 GSKIGKYNFYTSSKEQNKFLDYYEYSNEALIIGTGGNANLHHSYGKFSVSTDCFV---LE 79 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + + +L+++ + + + + E ++ + + + P+++Q I V Sbjct: 80 SKDKNFLIEFIYRYLLKNIYILE--NGFRGAGLKHISKEYLENIKIPIIPLEKQKIIIKV 137 Query: 382 INVETARIDVLVEKIEQSIVLLKERRSSF 410 + ID + L K ++ Sbjct: 138 LKNIDIFIDENKQIKNNLNFLSKSLFTTM 166 >gi|295087104|emb|CBK68627.1| Site-specific recombinase XerD [Bacteroides xylanisolvens XB1A] Length = 470 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 46/124 (37%), Gaps = 6/124 (4%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV-FYAMGSGLR 354 + + + + + I A + +L + S+D F A+ ++ Sbjct: 1 MIGTIGNKYFVTEKNVNFAIKNMALLKTSKSMYIMYFLWLYLSSWDYKHYEFNAISGSIQ 60 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + L + ++ +PV I N + + I + +++ + L ++R+ + Sbjct: 61 KFLSLDAMRNIPVPF-NYD----IAVAFNKQVSNICRCITNLKEENIQLIKQRNELLPLL 115 Query: 415 VTGQ 418 + GQ Sbjct: 116 MNGQ 119 >gi|319777299|ref|YP_004136950.1| hypothetical protein MfeM64YM_0575 [Mycoplasma fermentans M64] gi|318038374|gb|ADV34573.1| Conserved Hypothetical Protein [Mycoplasma fermentans M64] Length = 250 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 51/159 (32%), Gaps = 4/159 (2%) Query: 231 HWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGE 290 + + +KN + N S + + + +KP+ +++ G+ Sbjct: 94 NEIPFEIPKKWAWVRQKNILKLTKNEASKNGNYPYLEAKVLRKIIKPKIINNGVLINKGD 153 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 IV + + + + + G + S + +K + ++ + Sbjct: 154 IVILVDGENSGETFV----LDQTGYMGSTFKLLKINNKIDQEYVLMLLKFYKELFKKNKK 209 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 L + L + +P IKEQ +I + I Sbjct: 210 GAAIPHLNIDIFNNLLLAIPNIKEQKEIILKLKKIDNFI 248 Score = 43.6 bits (101), Expect = 0.066, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 59/162 (36%), Gaps = 12/162 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IPK W V K KL S++G + Y+ + + + +G Sbjct: 99 EIPKKWAWVRQKNILKLTKNEASKNG-NYPYLEAKVLRKIIKPKIINNGV---------- 147 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + KG I+ G + + D G + F +L+ + + +L + + Sbjct: 148 LINKGDIVILVDGENSGETFVLDQTGYMGSTFKLLKINNKID-QEYVLMLLKFYKELFKK 206 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 +GA + H + N+ + IP + EQ I K+ I Sbjct: 207 NKKGAAIPHLNIDIFNNLLLAIPNIKEQKEIILKLKKIDNFI 248 >gi|311900118|dbj|BAJ32526.1| hypothetical protein KSE_67680 [Kitasatospora setae KM-6054] Length = 465 Score = 45.6 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 48/138 (34%), Gaps = 5/138 (3%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITSAYMAVKP-HGIDSTYLAWLMR 338 T + PG+IV + E + + + V+P G+ YL ++ Sbjct: 68 TRHRLAPGDIVMTGKSGSPHLVGRSALWSGEVEGCCLNGSLIRVRPGRGVHPGYLHRVLY 127 Query: 339 SYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 LC F G+ + L V+ V VPP+ Q I+ + A I+ Sbjct: 128 YDALCGAFAGQLKGNSRLKHLDTGTVRAWRVPVPPLPVQQRISAAVEGMLADINAGEALQ 187 Query: 397 EQSIVLLKERRSSFIAAA 414 + L+ S + A Sbjct: 188 AATRSDLRMLWDSVLDAV 205 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 57/402 (14%), Positives = 126/402 (31%), Gaps = 40/402 (9%) Query: 20 AIPKHWKVVPIKRFTKLN--TGRTSESGKDIIY---IGLEDVESGTGKYLPKDGNSRQSD 74 +P W + I + ++ +G G I + +V Sbjct: 6 DLPPGWSHLRIDQIAQVQAGSGSVRPPGPGIALHAQLTSANVSWAGLDLRMLAETWLTRH 65 Query: 75 TSTVSIFAKGQILY-GKLGP---YLRKAII---ADFDGICSTQFLVLQPKDVLPELLQGW 127 +T A G I+ GK G R A+ + + + V + V P L Sbjct: 66 QATRHRLAPGDIVMTGKSGSPHLVGRSALWSGEVEGCCLNGSLIRVRPGRGVHPGYLHRV 125 Query: 128 LLSIDVTQRIEAICEGATM-SHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT 186 L + +G + H D + +P+PPL Q I + I+ Sbjct: 126 LYYDALCGAFAGQLKGNSRLKHLDTGTVRAWRVPVPPLPVQQRISAAVEGMLADINAGEA 185 Query: 187 ERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNR 246 + L+ +++ + L+ H + ++V + Sbjct: 186 LQAATRSDLRMLWDSVLDAVADGTLDNRPP----------ESASHHRIHEVASVVGGVQA 235 Query: 247 KNTKLIESNILSLSYGN------IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 T L N + +++ + + + + ++ ++V + Sbjct: 236 PRTVEDGVRHTYLRVANIAPETVDLDQVKHLTIPRERVCFLQHHLLQKDDLVVVRQNGSP 295 Query: 301 DKRSLRSAQVM--ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-- 356 D+ + +I + ++P+GIDS YL + + + + + S Sbjct: 296 DRLGQAALWHGQLPDILIQNHLARIRPYGIDSRYLELVWNAPSTLRPLRPLATSTTGSRT 355 Query: 357 LKFEDVKRLPVLVPPIKEQFDITN-------VINVETARIDV 391 L+ +D++ + V VP Q ++ ++ A +D Sbjct: 356 LRLDDIRAVRVRVPSAAAQAELVRAADRWKGHVDAVGALLDN 397 >gi|301598232|ref|ZP_07243240.1| putative restriction-modification protein [Acinetobacter baumannii AB059] Length = 162 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 57/145 (39%), Gaps = 8/145 (5%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 +I + E + Y+ V E+V D+ L + + ++ AY Sbjct: 9 HGLIDQHEKFKKRVASSDISGYKKVFKNELVM---GFPIDEGVLGFQKYYDAAAVSPAYK 65 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFD 377 + ++ YL ++RS L K++ + G R+S+ E + + PP + + Sbjct: 66 IFRLKREVNVEYLDLILRSNSLRKIYKSKMQGSVERRRSIPDEMFLNIEIPNPPEEVKDQ 125 Query: 378 ITNVINVETARIDVLVEKIEQSIVL 402 I + I+ +++ ++ + L Sbjct: 126 IVKQ-HKLIKEIENSLKENQKKLRL 149 >gi|195978029|ref|YP_002123273.1| type I restriction- system specificity subunit [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195974734|gb|ACG62260.1| type I restriction- system specificity subunit [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 198 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 67/192 (34%), Gaps = 14/192 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + +P+ + G+ + I L D+ Y I Sbjct: 14 EKIPLGQVVDCFKGKAVSRKAEAGEFGLINLSDMGQLGIDYRQVRAFHMDRRQLLRYILE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEA 139 G +L G + + + + + S+ VL+P+ VL + L + ++A Sbjct: 74 DGDVLIASKGTVQKVCVFHKQEKEMVASSNITVLRPQRVLRGYYIKFFLESAIGQALLKA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + + + R + +++ Sbjct: 134 ADHGKDVINLSTKALLDIPVPVIPLVKQ-------DYLINQYLRGLHDYQRKVNRAEQEW 186 Query: 200 QALVSYIVTKGL 211 Q + + KGL Sbjct: 187 Q-FIQNEIQKGL 197 >gi|303243809|ref|ZP_07330149.1| restriction modification system DNA specificity domain protein [Methanothermococcus okinawensis IH1] gi|302485745|gb|EFL48669.1| restriction modification system DNA specificity domain protein [Methanothermococcus okinawensis IH1] Length = 160 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 48/149 (32%), Gaps = 14/149 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +P+ WK+V + + + ++ P G + D Sbjct: 2 ELPEGWKLVKLGDIADILDKFRKPLNRYERETRKGNI--------PYCGANGIIDYINDY 53 Query: 80 IFAKGQILYGKLGPYLRKA----IIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 IF +L + G + +K + + VLQ K + + + Sbjct: 54 IFDGEYLLVAEDGGFFKKFERSSYLFKGKFWANNHVHVLQIKKEFSLNKYVYYV--LYFE 111 Query: 136 RIEAICEGATMSHADWKGIGNIPMPIPPL 164 +E C GAT + K + I +PIP Sbjct: 112 NLEKYCSGATRLKLNQKKLKEILIPIPYK 140 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 17/117 (14%), Positives = 37/117 (31%), Gaps = 11/117 (9%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA--VKPHGIDSTYLAW 335 Y I D ++ K S + + +K + Y+ + Sbjct: 47 IDYINDYIFDGEYLLVAEDGGFFKKFERSSYLFKGKFWANNHVHVLQIKKEFSLNKYVYY 106 Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV------PPIKEQFDITNVINVET 386 ++ +L K R L + +K + + + P +++Q +I N I Sbjct: 107 VLYFENLEKYC---SGATRLKLNQKKLKEILIPIPYKDGKPDLQKQKEIVNKIETLF 160 >gi|229553888|ref|ZP_04442613.1| conserved hypothetical protein [Lactobacillus rhamnosus LMS2-1] gi|229312747|gb|EEN78720.1| conserved hypothetical protein [Lactobacillus rhamnosus LMS2-1] Length = 132 Score = 45.6 bits (106), Expect = 0.016, Method: Composition-based stats. Identities = 9/64 (14%), Positives = 18/64 (28%), Gaps = 1/64 (1%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + + GR + + + + G Y + KG Sbjct: 68 WEKRKLGELAEFINGRAYKQDELLTSGKYPVLRVGNF-YTNDKWYYSDLELPEKYYAKKG 126 Query: 85 QILY 88 +LY Sbjct: 127 DLLY 130 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 9/32 (28%), Positives = 15/32 (46%), Gaps = 4/32 (12%) Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 VLVP + EQ I ++D L+ ++ Sbjct: 5 VLVPNLDEQQKIGTF----FKQLDHLITLHQR 32 >gi|148978191|ref|ZP_01814721.1| putative specificity protein s [Vibrionales bacterium SWAT-3] gi|145962613|gb|EDK27889.1| putative specificity protein s [Vibrionales bacterium SWAT-3] Length = 257 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 21/180 (11%), Positives = 55/180 (30%), Gaps = 7/180 (3%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 K E + + L+ + + ++++PG+ + + ++ Sbjct: 78 WTKKKHPDEVHYVDLANTKNGVIESVTSYEFEDAPSRARRVLNPGDTIVGTVRP-GNRSF 136 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD--LCKVFYAMGSGLRQSLKFEDV 362 Q + ++ + + P + L +L + D + + G ++K V Sbjct: 137 AYIGQTEQPLTGSTGFAVLTPKEEFWSSLVYLATTNDDSIDEYARLADGGAYPAIKPAVV 196 Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 +P I T + + L R + + ++G I+L Sbjct: 197 AETECAIPTGD----IAKKFWEITGPMLKKANQNRLENEELAALRDTLLPKLLSGDIELP 252 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 39/108 (36%), Gaps = 8/108 (7%) Query: 21 IPKHWKVVPIKRFTKLNTGRTSESGK---DIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 IP+ W I KL G++ K ++ Y+ L + ++G + + + Sbjct: 58 IPEGWTKGVISDIAKL-NGKSWTKKKHPDEVHYVDLANTKNGVIE-SVTSYEFEDAPSRA 115 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA---DFDGICSTQFLVLQPKDVLPE 122 + G + G + P R + ST F VL PK+ Sbjct: 116 RRVLNPGDTIVGTVRPGNRSFAYIGQTEQPLTGSTGFAVLTPKEEFWS 163 >gi|289647367|ref|ZP_06478710.1| predicted type I restriction-modification enzyme S subunit [Pseudomonas syringae pv. aesculi str. 2250] Length = 70 Score = 45.6 bits (106), Expect = 0.017, Method: Composition-based stats. Identities = 8/50 (16%), Positives = 17/50 (34%), Gaps = 7/50 (14%) Query: 364 RLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 P+ VP + EQ I ++ + + E ++ +E Sbjct: 9 NFPIPVPSLTEQARIVATLDKFDTLTNSISEGLPRETELRQKQYEYYREL 58 >gi|302528796|ref|ZP_07281138.1| predicted protein [Streptomyces sp. AA4] gi|302437691|gb|EFL09507.1| predicted protein [Streptomyces sp. AA4] Length = 133 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 40/114 (35%), Gaps = 14/114 (12%) Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + + IDS+YLA RS DL S + + D++ L V VP EQ Sbjct: 22 YFRVLDKEMIDSSYLASWFRSSDLQAQASQLMFKSDMAPYINLRDIRTLVVPVPGKIEQC 81 Query: 377 DITNV----INVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 V ++V A L R + ++G+ +R + Sbjct: 82 KQVEVQRGLLDVVHA--------AHSENKRLGRTRDELLPLLMSGKARVREAEK 127 >gi|321310222|ref|YP_004192551.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] gi|319802066|emb|CBY92712.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] Length = 120 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 35/83 (42%), Gaps = 3/83 (3%) Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + I++ Y V + ++ + K+ + G ++ D++ L Sbjct: 3 INLIDRDFFFISTIYKFVPHTWVLTSRYLYHFLLSHPQKIKGLIKDG---RIRKLDLEEL 59 Query: 366 PVLVPPIKEQFDITNVINVETAR 388 + VPP++ Q I NV++ ++ Sbjct: 60 IIPVPPLEIQERIANVLDKNRSQ 82 >gi|227364524|ref|ZP_03848587.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] gi|227070451|gb|EEI08811.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] Length = 162 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 10/96 (10%), Positives = 31/96 (32%), Gaps = 1/96 (1%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 I R + + A+K S + + ++ + Sbjct: 64 ILFSVRAPVGRVNWANQDLAVGRGLAALKIKSGYSKEYLYYLFKKIGGQLDSLATGTVFT 123 Query: 356 SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARID 390 S+ ++++ + + +P + +Q I + + I+ Sbjct: 124 SINKKELEAIELKIPVNLSDQEKIADYLQKIDQEIE 159 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 54/158 (34%), Gaps = 3/158 (1%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K + +K + G++ +S G + TS G+ Sbjct: 4 KKIQLKDVADIVMGQSPKSVFYNTNGNGTPFLQGVRTFGENYPQIDTWTTSYNRKAKSGE 63 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL+ P R A+ D L+ K + + L + +++++ G Sbjct: 64 ILFSVRAPVGR-VNWANQDLAVGRGLAALKIKSGYSK-EYLYYLFKKIGGQLDSLATGTV 121 Query: 146 MSHADWKGIGNIPMPIP-PLAEQVLIREKIIAETVRID 182 + + K + I + IP L++Q I + + I+ Sbjct: 122 FTSINKKELEAIELKIPVNLSDQEKIADYLQKIDQEIE 159 >gi|225870406|ref|YP_002746353.1| hypothetical protein SEQ_1032 [Streptococcus equi subsp. equi 4047] gi|225699810|emb|CAW93635.1| conserved hypothetical protein [Streptococcus equi subsp. equi 4047] Length = 198 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 67/192 (34%), Gaps = 14/192 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + +P+ + G+ + I L D+ Y I Sbjct: 14 EKIPLGQVVDCFKGKAVSRKAEAGEFGLINLSDMGQLGIDYHQVRAFHMDRRQLLRYILE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEA 139 G +L G + + + + + S+ VL+P+ VL + L + ++A Sbjct: 74 DGDVLIASKGTVQKVCVFHKQEREMVASSNITVLRPQRVLRGYYIKFFLESAIGQALLKA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + + + R + +++ Sbjct: 134 ADHGKDVINLSTKALLDIPVPVIPLVKQ-------DYLINQYLRGLHDYQRKVNRAEQEW 186 Query: 200 QALVSYIVTKGL 211 Q + + KGL Sbjct: 187 Q-FIQNEIQKGL 197 >gi|319758539|gb|ADV70481.1| type I restriction-modification system, S subunit [Streptococcus suis JS14] Length = 237 Score = 45.2 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 53/172 (30%), Gaps = 10/172 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQ 72 +P W V N G+T G DI ++ + D+ +G + + Sbjct: 65 KLPSSWCYVKFGGLVLFNIGKTPPRSEPNYWGDDIPWVSISDMSNNGHIFKTKEYLSDFA 124 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK-DVLPELLQGWLLSI 131 + V I + G +L + A+ + + + + P D + + + Sbjct: 125 INQKKVKIASAGTLLMSFKLTIGKVAL--EVPASHNEAIISIFPYGDKENIIRDYLMRFL 182 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 + + + I + +PI E I K+ ++ Sbjct: 183 PLISTTGNSKDAIKGKTLNSTSISGLLIPISNYREMKDIVTKVDLLFEKVAQ 234 >gi|319400013|gb|EFV88255.1| hypothetical protein GSEF_1922 [Staphylococcus epidermidis FRI909] Length = 193 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 53/147 (36%), Gaps = 17/147 (11%) Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + S ++V G+IV + ++ + ++ + ID+ Y Sbjct: 53 KHVTLSSTHQAKMVHTGDIVINMM--TSECVIVSQQHHESILPYNYTHIEIDTTHIDANY 110 Query: 333 LAWLMR-SYDLCKVFY--AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 + M S G L + L +K+L + +P +++Q I ++ Sbjct: 111 FVYWMNASAQAKSQLNQFKQGGSLVKKLTLNQLKQLKMTLPSLEQQQRIG--------KL 162 Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVT 416 D + + + L+ +R+ + ++ Sbjct: 163 D----ERRRHLKYLQAKRTYLMDQFLS 185 >gi|322379133|ref|ZP_08053530.1| methylase [Helicobacter suis HS1] gi|321148429|gb|EFX42932.1| methylase [Helicobacter suis HS1] Length = 332 Score = 45.2 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 20/198 (10%), Positives = 54/198 (27%), Gaps = 9/198 (4%) Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ-KLETRN 272 + K + E + H +K + + + + + N+ L Sbjct: 127 YYQEKYTHNENLIKSHPHARLKDLVRIKKSIEPGSDAYKSVGVPFVRVSNLSPFDLSAST 186 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + L P+ E++F + + + I Y Sbjct: 187 IFLDPKRDLESLYPKQNEVLFSKDGSIGIAYCVPQDLKVVLSSAILRLEIKDCNIISPHY 246 Query: 333 LAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDI-------TNVINV 384 L+ ++ S + + LK + L + + + Q +I ++ Sbjct: 247 LSLVLTSQVVKLQVERESIGSVIAHLKLSKISNLLIPLLDQQIQQNIEIKLKKSADLRTQ 306 Query: 385 ETARIDVLVEKIEQSIVL 402 + ++E+ + Sbjct: 307 SFKLLKRAKTEVERQLTH 324 >gi|146291273|ref|YP_001181697.1| hypothetical protein Sputcn32_0162 [Shewanella putrefaciens CN-32] gi|145562963|gb|ABP73898.1| conserved hypothetical protein [Shewanella putrefaciens CN-32] Length = 204 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 52/135 (38%), Gaps = 11/135 (8%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-----VKPHGIDSTY 332 ++ + ++ G+++F S+ QV+ER + + + I + Sbjct: 63 KTKKQPDWLENGDVLFVAKGA--KHYSVLVEQVLERTVCSPHFFMLRLKPEFKDVIVPDF 120 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETAR--- 388 L W + + F A G S++ + ++ +P+ V ++Q + + Sbjct: 121 LCWQLNQQPAQRYFKATAEGSMYLSIRRQVLENVPIKVLNFEKQKQLAAMHRCAVREQKV 180 Query: 389 IDVLVEKIEQSIVLL 403 + L+E +Q I + Sbjct: 181 LQKLIENRQQQIEAI 195 >gi|330994839|ref|ZP_08318761.1| Type-1 restriction enzyme MjaXIP specificity protein [Gluconacetobacter sp. SXCC-1] gi|329758100|gb|EGG74622.1| Type-1 restriction enzyme MjaXIP specificity protein [Gluconacetobacter sp. SXCC-1] Length = 340 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 20/115 (17%), Positives = 38/115 (33%), Gaps = 3/115 (2%) Query: 301 DKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 R + ++ G+D ++A + + R L Sbjct: 27 PARDVAFFHEGPLWAGNHVHVLRPRAGVDGRFVAHALNTVAYDAYVE---GATRPKLTRA 83 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + LPV PP Q I ++ ARI + ++ L +E+ + + AV Sbjct: 84 RMNSLPVPCPPPACQRRIARELDGALARIARQLHELSVQAALAREQADAALWHAV 138 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 59/187 (31%), Gaps = 10/187 (5%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESG---TGKYLPKDGNSRQSDTSTVSI 80 + + G T + G D+ ++ D+ +G ++ + + ++ Sbjct: 149 LGSVFDIVGGGTPPTARADCWGGDVPWLTPADLPAGAPVRLRHGARGLSIAGLAACRATL 208 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 G ++ P R + + S L P+ L L + Sbjct: 209 VPPGALVVSTRAPVGR-VGMTEVAVSVSQGCKALVPRGGDVALDYAAFLLRAHAPVLRQR 267 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 G+ + D + ++ +P+PPL Q + A R+ + L E + Sbjct: 268 AGGSVFAEVDTATLASLELPLPPLPVQRAVARTAWATMARLAAQDAAHAAMVAALHEYRP 327 Query: 201 ALVSYIV 207 AL V Sbjct: 328 ALRHARV 334 >gi|218437967|ref|YP_002376296.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7424] gi|218170695|gb|ACK69428.1| restriction modification system DNA specificity domain protein [Cyanothece sp. PCC 7424] Length = 228 Score = 45.2 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 62/170 (36%), Gaps = 5/170 (2%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 +N + ++ + ++ G + K K + + +QI + G+I+ Sbjct: 53 SIDKEDYNINGQPSEYAHITVRNIVQGELNLKDLIYLNEDKGITLKNFQI-EKGDILIAI 111 Query: 296 IDLQNDKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG- 352 + + + ++ + V I+ L + + S + F ++ +G Sbjct: 112 SSNVGVSCLVETVPSNLQLTLSHYIVKIKVDTSRINPKLLVYYLNSSKIKNYFRSVETGK 171 Query: 353 LRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIV 401 ++L + LP+ +P ++Q +I I I + I++ + Sbjct: 172 TLKNLSKNYIYNLPISLPKNTQKQLEIVKRIQPIETDILKIKASIKEPLE 221 >gi|311278009|ref|YP_003940240.1| hypothetical protein Entcl_0681 [Enterobacter cloacae SCF1] gi|308747204|gb|ADO46956.1| hypothetical protein Entcl_0681 [Enterobacter cloacae SCF1] Length = 190 Score = 45.2 bits (105), Expect = 0.021, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 53/161 (32%), Gaps = 15/161 (9%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + L + ++++ I G++V I + + + Sbjct: 41 DWQSGLVSAEKEQWVKTHQDVVITQKGDVVISLI--HGKAVRVSAENAGRILGNNYVKVD 98 Query: 323 VKPHGIDSTYLAWLMRSY---DLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 V ID+ + W ++ GS + Q + ++K V +PP+ +Q + Sbjct: 99 VDTSRIDAAWFLWHFNESPEGRRQRIQTTQGSTVVQRIAVNELKNFTVSLPPLAQQKAMG 158 Query: 380 N-VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + R +Q I L E ++ ++G I Sbjct: 159 GLYLAAREKRF------YQQQIAALSE--QQILS-LLSGMI 190 >gi|325973637|ref|YP_004250701.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323652239|gb|ADX98321.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 192 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 51/175 (29%), Gaps = 14/175 (8%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESN--------ILSLSYGNIIQKLETRNMGLKPESY 280 P WE L T K T +++ I + + K Y Sbjct: 4 PKKWEWVTLDKLGTFHRGKQTHYPKNDRTLFEGGTIPFIETQDCKSSRLFIKDVRKF--Y 61 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMR 338 + + + N+ SA + + ++ D ++ + Sbjct: 62 NQKGLQQGRLFPKNTVCISNNGNVADSAILDSQSCLSCDVHGFNSFSGISDPFFIKYCFD 121 Query: 339 SYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 + + A + R SL E +K + P + Q I ++++ I+ Sbjct: 122 FSKVKNTCISLAKSATTRLSLTTERLKIVEFPYPIYEIQQKIGSILSSRDLLIEN 176 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 50/179 (27%), Gaps = 11/179 (6%) Query: 22 PKHWKVVPIKRFTKLNTGR---------TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 PK W+ V + + + G+ T G I +I +D +S Q Sbjct: 4 PKKWEWVTLDKLGTFHRGKQTHYPKNDRTLFEGGTIPFIETQDCKSSRLFIKDVRKFYNQ 63 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIA-DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +F K + G AI+ C P ++ Sbjct: 64 KGLQQGRLFPKNTVCISNNGNVADSAILDSQSCLSCDVHGFNSFSGISDPFFIKYCFDFS 123 Query: 132 DVTQRIEAICEG-ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERI 189 V ++ + T + + + P P Q I + + + I+ + Sbjct: 124 KVKNTCISLAKSATTRLSLTTERLKIVEFPYPIYEIQQKIGSILSSRDLLIENNEMQNR 182 >gi|284048511|ref|YP_003398850.1| restriction modification system DNA specificity domain protein [Acidaminococcus fermentans DSM 20731] gi|283952732|gb|ADB47535.1| restriction modification system DNA specificity domain protein [Acidaminococcus fermentans DSM 20731] Length = 168 Score = 45.2 bits (105), Expect = 0.022, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 37/106 (34%), Gaps = 7/106 (6%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + I Q + + + A D+ +L +++ + +L + G Sbjct: 64 YALIGRQGALCGNMTFSMGKAYFTEHAVAVKANEINDTKFLYYILCNMNLGQY---SGQS 120 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 + L + L VP +EQ I++ + D L+ ++ Sbjct: 121 AQPGLAVNKLIALKAFVPGKQEQLKISSYLGA----FDNLITLHQR 162 >gi|237740353|ref|ZP_04570834.1| restriction endonuclease S [Fusobacterium sp. 2_1_31] gi|229422370|gb|EEO37417.1| restriction endonuclease S [Fusobacterium sp. 2_1_31] Length = 182 Score = 45.2 bits (105), Expect = 0.023, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 51/151 (33%), Gaps = 1/151 (0%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 G + +T + + Y + I + ++ Sbjct: 28 YNKDKKGLPFYQGKTEFSDIYIKEPTVYCNSPIKVVEENDILMSVRAPVGDVNIATQKSC 87 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 I ++KP ID YL +L++ +GS +++ ++ L + + +Q Sbjct: 88 IGRGLASIKPKKIDYLYLFYLLKEQKSKIEKIGVGS-TFKAINKNNISTLKISIVEKDKQ 146 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKER 406 I N ++ ++ I ++ +K+R Sbjct: 147 NKIRNYLSSIEKLKFTIMTIILKAYKTMKKR 177 Score = 41.3 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 56/173 (32%), Gaps = 4/173 (2%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNS-RQSDTSTVSIFAKGQ 85 + + G++ S G ++ S + + + Sbjct: 8 EKQLNDVADIIMGQSPLSQSYNKDKKGLPFYQGKTEFSDIYIKEPTVYCNSPIKVVEEND 67 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 IL P IA ++PK + L + L + +IE I G+T Sbjct: 68 ILMSVRAPV-GDVNIATQKSCIGRGLASIKPKKID--YLYLFYLLKEQKSKIEKIGVGST 124 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I + + I +Q IR + + T++T ++ + +K++ Sbjct: 125 FKAINKNNISTLKISIVEKDKQNKIRNYLSSIEKLKFTIMTIILKAYKTMKKR 177 >gi|332289024|ref|YP_004419876.1| hypothetical protein UMN179_00951 [Gallibacterium anatis UMN179] gi|330431920|gb|AEC16979.1| hypothetical protein UMN179_00951 [Gallibacterium anatis UMN179] Length = 194 Score = 45.2 bits (105), Expect = 0.023, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 44/127 (34%), Gaps = 10/127 (7%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-HGIDSTYLAWLMR- 338 + ++ +++F I +++ Q + Y + P +D ++L +L+ Sbjct: 55 DNPTLLQEDDVIFSLISGS----AVQVCQARAGYAFSHNYARLYPSKELDKSFLVYLLNN 110 Query: 339 -SYDLCKVFYAMGSGLRQSLKFEDVKRLPV-LVPPIKEQFDI--TNVINVETARIDVLVE 394 + ++ ++ +K L + +P + Q I + + + V Sbjct: 111 DTDIKRQLVASLQGSSVMKYSINQLKNLQLSPLPTLSVQQAIGQVDRLQRRITMLKKRVA 170 Query: 395 KIEQSIV 401 E + Sbjct: 171 DNEAQLT 177 >gi|120553351|ref|YP_957702.1| restriction modification system DNA specificity subunit [Marinobacter aquaeolei VT8] gi|120323200|gb|ABM17515.1| restriction modification system DNA specificity domain [Marinobacter aquaeolei VT8] Length = 588 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 52/132 (39%), Gaps = 12/132 (9%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID--STYLAWLMRS 339 Q + PG+I+ + + + I A++ ++P + + YL + S Sbjct: 450 QNQRIYPGDILLAIKGSVG-RVAFVDDTCGDNWIAGQAFIIIRPTSANISTPYLYRYLAS 508 Query: 340 YDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVINVETARI---DVLVEK 395 + + + +G +L K DV +P+ +P + I + +I +++K Sbjct: 509 ELIQQYVQEVATGGVMALLKAADVSGIPLPLPEPE----ILKSVEETHQQILAEYEVIKK 564 Query: 396 IEQSIVLLKERR 407 ++ L E + Sbjct: 565 HRDTVRRL-ELK 575 >gi|294660606|ref|NP_853465.2| type I restriction-modification system specificity subunit domain-containing protein [Mycoplasma gallisepticum str. R(low)] gi|284812269|gb|AAP57033.2| type I restriction-modification system specificity (S) subunit domain protein [Mycoplasma gallisepticum str. R(low)] gi|284930963|gb|ADC30902.1| type I restriction-modification system specificity (S) subunit domain protein [Mycoplasma gallisepticum str. R(high)] gi|284931719|gb|ADC31657.1| type I restriction-modification system specificity (S) subunit domain protein [Mycoplasma gallisepticum str. F] Length = 205 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 14/153 (9%), Positives = 46/153 (30%), Gaps = 14/153 (9%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 N + ++ + V + + + + + + + T Sbjct: 51 NKTTEEKTNKNRYPVYSSQTLNNGLLGYYHEYLYEDTITWTTDGANAGTVNFRSGKFYCT 110 Query: 332 YLAWLMRSYDLC------KVFYAMGSG-----LRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + ++ S + + + L + + +++P +E+ + Sbjct: 111 NVCGVLLSKKVKADKMIAEALNNVAKSYVSYVGNPKLMNNVMAGVEIMIPTNEEERE--- 167 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I+ A +D L+ + + LK + S + Sbjct: 168 KISNIFATLDHLITLNQLKLEKLKNIKQSLLEK 200 >gi|225868641|ref|YP_002744589.1| hypothetical protein SZO_10620 [Streptococcus equi subsp. zooepidemicus] gi|225701917|emb|CAW99428.1| conserved hypothetical protein [Streptococcus equi subsp. zooepidemicus] Length = 198 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 67/192 (34%), Gaps = 14/192 (7%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + +P+ + G+ + I L D+ Y I Sbjct: 14 EKIPLGQVVDCFKGKAVSRKAEAGEFGLINLSDMGQLGIDYRQVRVFHMDRRQLLRYILE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEA 139 G +L G + + + + + S+ VL+P+ VL + L + ++A Sbjct: 74 DGDVLIASKGTVQKVCVFHKQEREMVASSNITVLRPQRVLRGYYIKFFLESAIGQALLKA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + + + R + +++ Sbjct: 134 ADHGKDVINLSTKALLDIPVPVIPLVKQ-------DYLINQYLRGLHDYQRKVSRAEQEW 186 Query: 200 QALVSYIVTKGL 211 Q + + KGL Sbjct: 187 Q-FIQNEIQKGL 197 >gi|301348563|ref|ZP_07229304.1| putative restriction-modification protein [Acinetobacter baumannii AB056] Length = 206 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 57/145 (39%), Gaps = 8/145 (5%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 +I + E + Y+ V E+V D+ L + + ++ AY Sbjct: 53 HGLIDQHEKFKKRVASSDISGYKKVFKNELVM---GFPIDEGVLGFQKYYDAAAVSPAYK 109 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFD 377 + ++ YL ++RS L K++ + G R+S+ E + + PP + + Sbjct: 110 IFRLKREVNVEYLDLILRSNSLRKIYKSKMQGSVERRRSIPDEMFLNIEIPNPPEEVKDQ 169 Query: 378 ITNVINVETARIDVLVEKIEQSIVL 402 I + I+ +++ ++ + L Sbjct: 170 IVKQ-HKLIKEIENSLKENQKKLRL 193 >gi|288905367|ref|YP_003430589.1| hypothetical protein GALLO_1166 [Streptococcus gallolyticus UCN34] gi|306831447|ref|ZP_07464605.1| type I restriction-modification system specificty subunit [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325978356|ref|YP_004288072.1| hypothetical protein SGGBAA2069_c11560 [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] gi|288732093|emb|CBI13658.1| conserved hypothetical protein [Streptococcus gallolyticus UCN34] gi|304426232|gb|EFM29346.1| type I restriction-modification system specificty subunit [Streptococcus gallolyticus subsp. gallolyticus TX20005] gi|325178284|emb|CBZ48328.1| hypothetical protein SGGBAA2069_c11560 [Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069] Length = 198 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 55/151 (36%), Gaps = 6/151 (3%) Query: 28 VPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 +P+K T+ G+ +I I L D++ Y + + + +G Sbjct: 16 IPLKEITEHFKGKAVSKLGDSGNISVINLSDMDDTGIDYAHLKKIDCDEKSVSHYLLQEG 75 Query: 85 QILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAIC 141 +L G + A+ D I S VL+P + L+ D+ ++ Sbjct: 76 DVLIASKGTVKKIAVFAEQDEPVIASANITVLRPTSDILGGYIRLFLASDLGQALLDETN 135 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 G + + + + I +I +P P Q + + Sbjct: 136 TGKNVMNLNTQKIISIEIPKIPSIRQAYLIQ 166 >gi|257440122|ref|ZP_05615877.1| putative type I restriction-modification system subunit S [Faecalibacterium prausnitzii A2-165] gi|257197474|gb|EEU95758.1| putative type I restriction-modification system subunit S [Faecalibacterium prausnitzii A2-165] Length = 187 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 60/187 (32%), Gaps = 7/187 (3%) Query: 217 MKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK 276 M+ + E VP + + + + N + + + L Sbjct: 1 MRFNLWEDCNRVPLTELLSFIVDNRGKTVPTAPSGHKLIATNCVTNNTLFPVYDKIRYLS 60 Query: 277 PESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAW 335 E+Y+T+ P F++ R ++ I + I YL Sbjct: 61 EETYQTWFRAHPIPGDILFVNKGTPGRVCLVPDPVDFCIAQDMIALRADESKIYPKYLFT 120 Query: 336 LMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++RS ++ + Y G + K + + +L + +P Q I ++ V L Sbjct: 121 VLRSREIQQQIYNTNVGDVIPHFKKQFLDQLLIPIPERSIQESIGDLYYVL-----SLKA 175 Query: 395 KIEQSIV 401 + + I Sbjct: 176 ERNKKIN 182 >gi|227892235|ref|ZP_04010040.1| possible restriction modification system DNA specificity protein [Lactobacillus salivarius ATCC 11741] gi|227865957|gb|EEJ73378.1| possible restriction modification system DNA specificity protein [Lactobacillus salivarius ATCC 11741] Length = 223 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 49/160 (30%), Gaps = 10/160 (6%) Query: 257 LSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGII 316 + + Q N L + + ++ G+I+F + L + Sbjct: 72 PVVKIRELNQGHTDSNSDLCRKDIDESVQINTGDIIFSWSGTL-----LLDLWAGNEAGL 126 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQ 375 V + S ++ + Y A +K ++K ++P E Sbjct: 127 NQHLFKVTSNDYPSWFIYEWTKYYLQEFQLIAKSKATTMGHIKRSNLKESFAIIPDDDE- 185 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +N I KI + ++L++ + +A + Sbjct: 186 ---LTKLNNLLGPIFDNKIKIRKQNLILRQIKKQLLAKLL 222 >gi|301513225|ref|ZP_07238462.1| putative restriction-modification protein [Acinetobacter baumannii AB058] Length = 209 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 57/145 (39%), Gaps = 8/145 (5%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY- 320 +I + E + Y+ V E+V D+ L + + ++ AY Sbjct: 56 HGLIDQHEKFKKRVASSDISGYKKVFKNELVM---GFPIDEGVLGFQKYYDAAAVSPAYK 112 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFD 377 + ++ YL ++RS L K++ + G R+S+ E + + PP + + Sbjct: 113 IFRLKREVNVEYLDLILRSNSLRKIYKSKMQGSVERRRSIPDEMFLNIEIPNPPEEVKDQ 172 Query: 378 ITNVINVETARIDVLVEKIEQSIVL 402 I + I+ +++ ++ + L Sbjct: 173 IVKQ-HKLIKEIENSLKENQKKLRL 196 >gi|210610696|ref|ZP_03288577.1| hypothetical protein CLONEX_00767 [Clostridium nexile DSM 1787] gi|210152329|gb|EEA83335.1| hypothetical protein CLONEX_00767 [Clostridium nexile DSM 1787] Length = 191 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 46/137 (33%), Gaps = 6/137 (4%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME--RGIITS 318 + N E ++ E + + G+++ D+ ++ V + + + Sbjct: 39 FNNYFLPEELPDLMDTNEKEQQTYSIKAGDVLITRTSETIDELAMSCVAVKDYPKATYSG 98 Query: 319 AYMAVKPHGI---DSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKE 374 ++P Y+A+ RS K LR S + L V +P E Sbjct: 99 FTKRLRPKREGIAYPKYMAFYFRSALFRKAVTYNAFMTLRASFNEDIFTFLDVYLPDYDE 158 Query: 375 QFDITNVINVETARIDV 391 Q I +++ +I Sbjct: 159 QVRIGDMLYNIECKIRK 175 Score = 36.7 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 49/178 (27%), Gaps = 13/178 (7%) Query: 30 IKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPK-DGNSRQSDTSTVSIFAKGQ 85 + +++G +S+ G ++ V + D G Sbjct: 9 LSDLYDMSSGLSSKKEQAGHGAPFVSFGTVFNNYFLPEELPDLMDTNEKEQQTYSIKAGD 68 Query: 86 ILYGKLGPYLR-----KAIIADFDGICSTQFL-VLQPKDV---LPELLQGWLLSIDVTQR 136 +L + + + D+ + F L+PK P+ + + S + Sbjct: 69 VLITRTSETIDELAMSCVAVKDYPKATYSGFTKRLRPKREGIAYPKYMAFYFRSALFRKA 128 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 + + + + + +P EQV I + + +I Sbjct: 129 VTYNAFMTLRASFNEDIFTFLDVYLPDYDEQVRIGDMLYNIECKIRKNKEINDYLSYQ 186 >gi|50914300|ref|YP_060272.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10394] gi|50903374|gb|AAT87089.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10394] Length = 198 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 69/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ + D+ I L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSNKVVPGDVGLINLSDMGTLGIQYHQVRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLNRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 47/128 (36%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + +++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLNRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|260428543|ref|ZP_05782522.1| N-6 DNA Methylase family protein [Citreicella sp. SE45] gi|260423035|gb|EEX16286.1| N-6 DNA Methylase family protein [Citreicella sp. SE45] Length = 575 Score = 44.8 bits (104), Expect = 0.027, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 42/126 (33%), Gaps = 18/126 (14%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERG----IITSAYMAVKPHGIDSTY---LAWL 336 Q + PG+++ + E + M ++P L Sbjct: 435 QRLIPGDVLIAVKGTVGSVALVPEGIPEENAETIWTAGQSMMILRPTRRGGIAALALYEY 494 Query: 337 MRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKE--------Q--FDITNVINVE 385 + + + ++ G QS+ +D+K LP+ +P ++ Q +I I Sbjct: 495 LSDSTVQEHIQSLAGGAVIQSIGMKDLKALPIPLPDLETLTEMHEGFQRRQEILFRIEEL 554 Query: 386 TARIDV 391 +++ Sbjct: 555 RKQLED 560 >gi|21910426|ref|NP_664694.1| hypothetical protein SpyM3_0890 [Streptococcus pyogenes MGAS315] gi|28896002|ref|NP_802352.1| hypothetical protein SPs1090 [Streptococcus pyogenes SSI-1] gi|56808388|ref|ZP_00366141.1| COG0732: Restriction endonuclease S subunits [Streptococcus pyogenes M49 591] gi|94990588|ref|YP_598688.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10270] gi|209559519|ref|YP_002285991.1| hypothetical protein Spy49_0996c [Streptococcus pyogenes NZ131] gi|21904624|gb|AAM79497.1| conserved hypothetical protein [Streptococcus pyogenes MGAS315] gi|28811252|dbj|BAC64185.1| hypothetical protein [Streptococcus pyogenes SSI-1] gi|94544096|gb|ABF34144.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10270] gi|209540720|gb|ACI61296.1| hypothetical protein Spy49_0996c [Streptococcus pyogenes NZ131] Length = 198 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 69/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ I L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSSKVVPGDVGLINLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLNRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 47/128 (36%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + +++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLNRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|154244736|ref|YP_001415694.1| N-6 DNA methylase [Xanthobacter autotrophicus Py2] gi|154158821|gb|ABS66037.1| N-6 DNA methylase [Xanthobacter autotrophicus Py2] Length = 710 Score = 44.8 bits (104), Expect = 0.028, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 49/154 (31%), Gaps = 9/154 (5%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPE-SYETYQIVDP 288 + + + R E + + + + R E +++ + Sbjct: 523 SNSPRAKLGDIAPLVRRHVQIDPEKTYTEIGVRSFYKGIFHRRTIPGAEFTWQKLFRIAT 582 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP---HGIDSTYLAWLMRSYDLCKV 345 G++VF +L ++++ A + G + + M +L + R+ + Sbjct: 583 GDLVFS--NLMAWEQAIALASTADDGCVGNHRMLTCEADRTRCLPMFLWYYFRTPEGFAQ 640 Query: 346 FYAMGSGLRQS---LKFEDVKRLPVLVPPIKEQF 376 A G L E + + V VP + Q Sbjct: 641 VVAASPGSIARNKTLSAELLPNITVPVPSLDAQE 674 >gi|148993700|ref|ZP_01823147.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] gi|147927780|gb|EDK78803.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] Length = 119 Score = 44.8 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 42/123 (34%), Gaps = 5/123 (4%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + I S + ++P + +++ + Sbjct: 1 MTTRGTVGNVAYYDELIKYKHLRINSGMVILRPKTPNLNQ-KFIIHVLRNNNYSRVISGS 59 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + L +K++ + +PP+ Q + + + A++D I++S+ L+ + S + Sbjct: 60 AQPQLPITKLKKILLPLPPLALQNEFADFV----AQVDKSQLAIQKSLEELETLKKSLMQ 115 Query: 413 AAV 415 Sbjct: 116 EYF 118 >gi|198245448|ref|YP_002218400.1| putative type I restriction-modification system specificity subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197939964|gb|ACH77297.1| putative type I restriction-modification system specificity subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 113 Score = 44.8 bits (104), Expect = 0.029, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 33/81 (40%), Gaps = 4/81 (4%) Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFD 377 A + V I+ YL + S + + A+ G +L + + L + +P Q + Sbjct: 17 AVIRVNSLLINPEYLYYFFNSPEGDEKISALQGGGLVVNLSLKKLLTLEIPIPLRPVQDE 76 Query: 378 IT---NVINVETARIDVLVEK 395 + + N + ++ L+E Sbjct: 77 VIGLRKIWNEQKKTLEDLIEN 97 >gi|325913415|ref|ZP_08175782.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 60-B] gi|325477341|gb|EGC80486.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 60-B] Length = 149 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 25/157 (15%), Positives = 56/157 (35%), Gaps = 15/157 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K+ + K+ G+ + V+S G +P G +T +++ K Sbjct: 4 KLCTLGELVKIKYGKNQKK-----------VQSEDGT-IPIYGTGGLMGYATDALYDKPS 51 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 +L G+ G + + T F + ++ +L+S+ +++ EG T Sbjct: 52 VLIGRKGTINKVHYVDHPFWTVDTLFYTEVNEKLVIPKYLYYLMSLL---DLDSYNEGTT 108 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + IP L Q + + +I Sbjct: 109 IPSLRTETLNRLKFDIPGLDYQGKVLSVLEPIDKKIK 145 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 37/110 (33%), Gaps = 6/110 (5%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y T + D ++ N + + T Y V + YL +LM Sbjct: 41 YATDALYDKPSVLIGRKGTINKVHYV---DHPFWTVDTLFYTEVNEKLVIPKYLYYLMSL 97 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 DL SL+ E + RL +P + Q + +V+ +I Sbjct: 98 LDLDSYNE---GTTIPSLRTETLNRLKFDIPGLDYQGKVLSVLEPIDKKI 144 >gi|94994511|ref|YP_602609.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10750] gi|306827270|ref|ZP_07460557.1| type I restriction-modification system specificty subunit [Streptococcus pyogenes ATCC 10782] gi|94548019|gb|ABF38065.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS10750] gi|304430417|gb|EFM33439.1| type I restriction-modification system specificty subunit [Streptococcus pyogenes ATCC 10782] Length = 198 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 68/186 (36%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ I L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSSKVVPGDVGLINLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEA 139 G +L G + + + D + S+ VL+P+ +L + L + + ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDLPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLNRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 >gi|126649565|ref|ZP_01721806.1| hypothetical protein BB14905_06493 [Bacillus sp. B14905] gi|126593890|gb|EAZ87813.1| hypothetical protein BB14905_06493 [Bacillus sp. B14905] Length = 228 Score = 44.8 bits (104), Expect = 0.030, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 61/220 (27%), Gaps = 11/220 (5%) Query: 30 IKRFTKLNTGRTSESGK-----DIIYIGLEDVESGTGKYLPK---DGNSRQSDTSTVSIF 81 ++ K+ G+ S K I YI E + + + +S + + S+ Sbjct: 11 LEEVAKIKMGKMFTSEKCFTREGIPYITEEALNKLSLEDDTSCLPKVDSTLKEQYSFSLV 70 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 IL K D S + + + P + + + + Sbjct: 71 PTQSILLNKTNLKDTSIYQCKTDVCISHEIIAIIPNESILSSDYLFHFIKWHQHNNKKCD 130 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 + M I + + + +Q+L + + L K + Sbjct: 131 DYRLMIELPSIVIQHQVVQVLNAVQQLLANK--EYLVTAVKNLPKHFDDTSRQAKHHSNS 188 Query: 202 LVSYIVT-KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFAL 240 L L + +++ P++ + ++ Sbjct: 189 LYQGFEQLHYLYIAMLNHIFNGDYLHDFPEYHACRKLYSH 228 >gi|288800631|ref|ZP_06406089.1| DNA modification methylase [Prevotella sp. oral taxon 299 str. F0039] gi|288332844|gb|EFC71324.1| DNA modification methylase [Prevotella sp. oral taxon 299 str. F0039] Length = 1170 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 51/182 (28%), Gaps = 13/182 (7%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET 282 E +G + + + L + + + N Sbjct: 908 EQIGRYNVNQNHLQWTIYTDSNYKAPNSLDNMPHIKQHLDKFQNIITSDNKPYGLHRSRK 967 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +I+ + K + + + ++ + + ++ +L L+ S + Sbjct: 968 EFYFKNEKIIATRKSIDRPKFAYCNFE----CFVSQTFNMIHTTRVNMKFLTGLLNSKLI 1023 Query: 343 CKVFYAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET-------ARIDVLV 393 G G L E + +P+ VP + Q I +++ RID + Sbjct: 1024 EFWLKNKGKMQGANFQLDKEPLMHIPIAVPTQEIQQLIAKLVDCIIFIKSTHNERIDKFI 1083 Query: 394 EK 395 Sbjct: 1084 SN 1085 >gi|227891952|ref|ZP_04009757.1| restriction-modification protein [Lactobacillus salivarius ATCC 11741] gi|227866286|gb|EEJ73707.1| restriction-modification protein [Lactobacillus salivarius ATCC 11741] Length = 767 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 42/140 (30%), Gaps = 9/140 (6%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM---ERGIITSAYMAVKPH 326 + + E+ Y +V I F + + T Sbjct: 630 ENFISVTSENNINYNVVREKYISFNPSRANVGSFGINMSNTPVAVSNAYPTFRLKQGMES 689 Query: 327 GIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 Y+ + S + + +RQ+L + +L + EQ I N + Sbjct: 690 RYLMEYIYLQLTHNSRVIEDIAERSYGTIRQALNATEFLKLQIKDISFDEQQKIVNTVEK 749 Query: 385 ETARIDVLVEKIEQSIVLLK 404 + ++ V +I++ + L Sbjct: 750 KHSQ----VLQIQKELNNLN 765 >gi|94988698|ref|YP_596799.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS9429] gi|94992521|ref|YP_600620.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS2096] gi|94542206|gb|ABF32255.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS9429] gi|94546029|gb|ABF36076.1| Type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS2096] Length = 198 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 70/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ I L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSSKVVPGDVGLINLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 VDHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLNRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 48/128 (37%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A+ G +L +++ +P+ V P+ +Q + N + +++ ++ Sbjct: 128 QALLDAVDHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLNRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|283796720|ref|ZP_06345873.1| conserved hypothetical protein [Clostridium sp. M62/1] gi|291075604|gb|EFE12968.1| conserved hypothetical protein [Clostridium sp. M62/1] Length = 179 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 13/124 (10%), Positives = 33/124 (26%), Gaps = 6/124 (4%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + + S+++ Y + ++ + Sbjct: 57 TISSSGANAGFVNLWGVPVWSSDSSFI--DFKMTPYVYFWHALLKRHQNNIYKIQTGSAQ 114 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + + LPV + + + T + L+ K + L+ R + Sbjct: 115 PHIYPSHIASLPVC--DLDF-GKVADYTERVT-PLFTLISKNYKESNQLRALRDWLLPML 170 Query: 415 VTGQ 418 + GQ Sbjct: 171 MNGQ 174 >gi|317131474|ref|YP_004090788.1| restriction modification system DNA specificity domain [Ethanoligenens harbinense YUAN-3] gi|315469453|gb|ADU26057.1| restriction modification system DNA specificity domain [Ethanoligenens harbinense YUAN-3] Length = 214 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 13/129 (10%), Positives = 39/129 (30%), Gaps = 4/129 (3%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + L + A + + A+ + + + + + + + Sbjct: 85 VNTVFLTARGTVGKLALAGRPMAMNQSCYALVGTEGLGQHYVYHLAQHVVESLKHKATGA 144 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + ++ D + V E + A I ++ + L E R S + Sbjct: 145 VFDAIVTRDFESEIVPDITTAE----ARSFEEKVAPIYEIILNNSNENIRLAELRDSLLP 200 Query: 413 AAVTGQIDL 421 ++G++ + Sbjct: 201 RLMSGELSV 209 >gi|86145619|ref|ZP_01063949.1| Type I restriction-modification system M subunit [Vibrio sp. MED222] gi|85836590|gb|EAQ54716.1| Type I restriction-modification system M subunit [Vibrio sp. MED222] Length = 812 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 9/79 (11%), Positives = 26/79 (32%), Gaps = 2/79 (2%) Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 P + Y S + + G +L ++ + +PP++ Q +I Sbjct: 540 NPDEALAEYFELFFSSELGRLILKKLPIGTYLPALSVATLREVQFPLPPLELQKEIVET- 598 Query: 383 NVETARIDVLVEKIEQSIV 401 + ++ + + + Sbjct: 599 QNKLNQLKKFISEYVSELT 617 >gi|329766471|ref|ZP_08258015.1| hypothetical protein Nlim_1825 [Candidatus Nitrosoarchaeum limnia SFB1] gi|329137070|gb|EGG41362.1| hypothetical protein Nlim_1825 [Candidatus Nitrosoarchaeum limnia SFB1] Length = 733 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 50/137 (36%), Gaps = 2/137 (1%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG 327 +ET + K IV +I+F + +Q + Sbjct: 591 IETARVPKKDFDKGKIPIVKENDILFSIRGKIGKVGLVTKSQEGATINQNLVILRPHIPS 650 Query: 328 IDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 D+++L + ++S + + G + +++ +D++ L + P + I N + E Sbjct: 651 KDASFLLYYLKSEIVRYQLEHIQYGSVIFAVRIKDLENLLLPKPDGVKIQKI-NELKKEI 709 Query: 387 ARIDVLVEKIEQSIVLL 403 + L+ + E + + Sbjct: 710 EKYRKLLLEAENKLNEI 726 Score = 37.9 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 51/143 (35%), Gaps = 13/143 (9%) Query: 28 VPIKRFTK-LNTGRTSESG----KDIIYIGLEDVESGTGKYLPKDGN----SRQSDTSTV 78 + +K + + +G+ K+I I + D+ES + D + Sbjct: 547 IKLKETVQAIISGKDYPPMNLEFKEIPIIKIGDIESNGLIKTEIIETARVPKKDFDKGKI 606 Query: 79 SIFAKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDV--LPELLQGWLLSIDVT 134 I + IL+ G + ++ + ++L+P L +L S V Sbjct: 607 PIVKENDILFSIRGKIGKVGLVTKSQEGATINQNLVILRPHIPSKDASFLLYYLKSEIVR 666 Query: 135 QRIEAICEGATMSHADWKGIGNI 157 ++E I G+ + K + N+ Sbjct: 667 YQLEHIQYGSVIFAVRIKDLENL 689 >gi|325682979|ref|ZP_08162495.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] gi|324977329|gb|EGC14280.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] Length = 347 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 51/380 (13%), Positives = 105/380 (27%), Gaps = 46/380 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + T ++ KD +++ GK RQ Sbjct: 2 EYKKFTALFTDVTKTGTKIPKDEYLTTGKNIIIDQGKDSIAGYTDRQKGIFEEVPV---- 57 Query: 86 ILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I++ G + R D VL+ K+ + Sbjct: 58 IVF---GDHTRIVKYIDKPFFLGADGVKVLKSKEKESNYKYLYYALKAAHIPNTGYNRHF 114 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I M P L EQ I + + + T I + L + + + Sbjct: 115 K-------WLKQINMNYPDLNEQKNIVDILDSLTRII-------KVRQKELAFFDKLIKA 160 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V +P + K+ + +G + T + + N GN Sbjct: 161 RFVEMFGDPIINNKNIKKKKLGDI-----CLLKAGDFTPSKKISPVKTSINKYPCFGGNG 215 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I+ S Q G + F +N + ++ + +E Sbjct: 216 IRGYVDNYTHQGNYSLIGRQGALCGNVKFATGKFRNTEHAILVSPNIE------------ 263 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I+S +L L+ L K+ + L + + + V V + Q + N + Sbjct: 264 ---INSRWLFELLN---LEKLNRFRSGAAQPGLAVKTLNEIIVPVADLNSQNEYANFVQQ 317 Query: 385 ET-ARIDVLVEKIEQSIVLL 403 ++ + +V + + + Sbjct: 318 VDKSKFENIVYLNKTLLNKI 337 >gi|86141515|ref|ZP_01060061.1| putative DNA restriction-modification system, DNA methylase [Leeuwenhoekiella blandensis MED217] gi|85832074|gb|EAQ50529.1| putative DNA restriction-modification system, DNA methylase [Leeuwenhoekiella blandensis MED217] Length = 816 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 57/167 (34%), Gaps = 5/167 (2%) Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 K LI I + + + + +K ++ I + + ++ K +L Sbjct: 407 GKEKTLIGKFIRTSNLKDNDVSYQLDLNEIKERELPSHSIKIENDCILISTRWKSLKPTL 466 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWL---MRSYDLCKVFYAMGS-GLRQSLKFED 361 + I +L +RS ++ K A + G SL D Sbjct: 467 FEYKGEPIYIGIDLLAIRVYSENFEVNPHYLISELRSPNVLKQVSAFQNPGAITSLNRAD 526 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + +P I+EQ I + + ++++ ++ K+ +S Sbjct: 527 FFAIKIALPSIEEQKAKVQGILELSEKF-KILQQERNALAHGKQVKS 572 >gi|298292627|ref|YP_003694566.1| hypothetical protein Snov_2653 [Starkeya novella DSM 506] gi|296929138|gb|ADH89947.1| conserved hypothetical protein [Starkeya novella DSM 506] Length = 201 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 46/136 (33%), Gaps = 10/136 (7%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSL-RSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 E V PG+++FR +N +L + ++ + I YLAW Sbjct: 52 EGLADRYFVRPGDVLFRSRGERNTASALDGRLREPALAVLPLMVLRPNREVITPEYLAWA 111 Query: 337 MRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 + + + F G + + L + VP IK Q I +D L E+ Sbjct: 112 INQPPVQRHFDLAARGTNIRMIPRSSLDDLELDVPDIKTQEAIVA--------LDALAER 163 Query: 396 IEQSIVLLKERRSSFI 411 + E R + Sbjct: 164 ERELSQFAAETRRQMM 179 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 18/155 (11%), Positives = 45/155 (29%), Gaps = 11/155 (7%) Query: 29 PIKRFTKLNTGRT------SESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + TG T + + ++ I L D+ + + + Sbjct: 2 RLADVCAIQTGYTARGRLEPAAAEGVLAIQLRDISPNGLVDPERLARVQLEGLADRYFVR 61 Query: 83 KGQILYGKLGPYLRKAIIADF-----DGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 G +L+ G + + + L + + PE L + V + Sbjct: 62 PGDVLFRSRGERNTASALDGRLREPALAVLPLMVLRPNREVITPEYLAWAINQPPVQRHF 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 + G + + ++ + +P + Q I Sbjct: 122 DLAARGTNIRMIPRSSLDDLELDVPDIKTQEAIVA 156 >gi|229541310|ref|ZP_04430370.1| restriction modification system DNA specificity subunit [Bacillus coagulans 36D1] gi|229325730|gb|EEN91405.1| restriction modification system DNA specificity subunit [Bacillus coagulans 36D1] Length = 197 Score = 44.4 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 59/154 (38%), Gaps = 10/154 (6%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVKP 325 ++ + + E + G+++ R L ++ + +I S + + V Sbjct: 46 NDSFEEFVSNDELEDHYFTKEGDVLMR---LSQPYTAVCIDKEYSGLLIPSYFAIIKVDQ 102 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + Y+AW + ++++ K +G R S +K +P++ + +Q + + Sbjct: 103 SKVMPRYIAWYLNTWNVKKELERSQAGSRIPSTNQHVLKTIPIIAASLSKQKALIE-LYQ 161 Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 + L +K+ + LL ++G+ Sbjct: 162 LHQKEKRLYKKLIEEKELL---FQGIAQQILSGK 192 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 60/188 (31%), Gaps = 13/188 (6%) Query: 30 IKRFTKLNTGRTSESGK---------DIIYIGLEDV-ESGTGKYLPKDGNSRQSDTSTVS 79 + + TG K + L+++ E G + + + Sbjct: 3 LGEIADIKTGLVLSRKKAEIEYTAKATYKLLSLKNISEDGFLENDSFEEFVSNDELEDHY 62 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQR 136 +G +L PY I ++ G+ + + V+P + +L + +V + Sbjct: 63 FTKEGDVLMRLSQPYTAVCIDKEYSGLLIPSYFAIIKVDQSKVMPRYIAWYLNTWNVKKE 122 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 +E G+ + + + IP+ L++Q + E L + Sbjct: 123 LERSQAGSRIPSTNQHVLKTIPIIAASLSKQKALIELYQLHQKEKRLYKKLIEEKELLFQ 182 Query: 197 EKKQALVS 204 Q ++S Sbjct: 183 GIAQQILS 190 >gi|237752773|ref|ZP_04583253.1| type I restriction-modification system [Helicobacter winghamensis ATCC BAA-430] gi|229376262|gb|EEO26353.1| type I restriction-modification system [Helicobacter winghamensis ATCC BAA-430] Length = 198 Score = 44.4 bits (103), Expect = 0.036, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 50/192 (26%), Gaps = 11/192 (5%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK---------DIIYIGLEDVESGT-GKYLPKDGNSRQ 72 W+ P+ ++ GRT + DI +I ++D+E + Sbjct: 8 NEWEEKPLSEIAEIGIGRTPPRKERHWFSTDSRDIKWISIKDMEEKIFIVNTSEFLTMEA 67 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 + + IL L + I + + + + + Sbjct: 68 IRKFRIPLIPPNTILLS-FKMTLGRVSITTENMLSNEAIAHFNLYSEYRLFTEYLYCFLK 126 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 + + ++ + I +I + IP V +I + Sbjct: 127 TFKYETLGSTSSIVTAINSTLIKSINIRIPDRKIIVEFSMIAKGFFDKIYNNTKQIQNLQ 186 Query: 193 ELLKEKKQALVS 204 + + + Sbjct: 187 AMRDMMLGKIFN 198 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 22/201 (10%), Positives = 58/201 (28%), Gaps = 17/201 (8%) Query: 223 EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNM----GLKPE 278 EW R +I +S ++ +K+ N ++ Sbjct: 9 EWEEKPLSEIAEIGIGRTPPRKERHWFSTDSRDIKWISIKDMEEKIFIVNTSEFLTMEAI 68 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 ++ P I+ F + I + + + + + YL ++ Sbjct: 69 RKFRIPLIPPNTILLSFKMTLGRVSITTENMLSNEAI--AHFNLYSEYRLFTEYLYCFLK 126 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++ + S + ++ +K + + +P I ++ + + Sbjct: 127 TFKYETL--GSTSSIVTAINSTLIKSINIRIPD----RKIIVEFSMIAKGFFDKIYNNTK 180 Query: 399 SIVLLKERRSSFIAAAVTGQI 419 I L+ R + G+I Sbjct: 181 QIQNLQAMRDMML-----GKI 196 >gi|78064666|ref|YP_367435.1| hypothetical protein Bcep18194_A3189 [Burkholderia sp. 383] gi|77965411|gb|ABB06791.1| hypothetical protein Bcep18194_A3189 [Burkholderia sp. 383] Length = 307 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 51/185 (27%), Gaps = 18/185 (9%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESY---ETYQIVDPGEIVFRFIDLQNDKRS 304 + L E I + N+ ++ S + +++ + Sbjct: 65 SCYLEEGGIPLVRSSNLSNNGIDYESAVRVPSEWISSERARIKDNDVLISIKGARAFFDM 124 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYL--AWLMRSYDLCKVFYAMGSGLRQSLKFEDV 362 ++ I+ + + WL+ S VF + + + + Sbjct: 125 CVASDKTSDAIVNGSIFRFQCKERYDPNFVVLWLLSSPIQSMVFRERTNLGISYISQDIL 184 Query: 363 KRLPVLVPPIKEQFDI-------TNVINVETARIDV---LVEKIEQSIVLLKERRSSF-I 411 K +P +Q I + + + ++ +++I + R + Sbjct: 185 KSIPFPEIEKNKQQLILRGYNAAIEMRDEMISSLNEVVRAKSLAKKTIDKI--YRDRLGM 242 Query: 412 AAAVT 416 VT Sbjct: 243 EEPVT 247 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 49/161 (30%), Gaps = 5/161 (3%) Query: 47 DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPY----LRKAIIAD 102 I + ++ + Y + +S + +L G + A Sbjct: 72 GIPLVRSSNLSNNGIDYESAVRVPSEWISSERARIKDNDVLISIKGARAFFDMCVASDKT 131 Query: 103 FDGICSTQFLVLQPK-DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPI 161 D I + Q K P + WLLS + + +S+ + +IP P Sbjct: 132 SDAIVNGSIFRFQCKERYDPNFVVLWLLSSPIQSMVFRERTNLGISYISQDILKSIPFPE 191 Query: 162 PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 +Q LI A D +I+ + K+ + Sbjct: 192 IEKNKQQLILRGYNAAIEMRDEMISSLNEVVRAKSLAKKTI 232 >gi|291461174|ref|ZP_06027362.2| hypothetical protein FUSPEROL_02035 [Fusobacterium periodonticum ATCC 33693] gi|291378476|gb|EFE85994.1| hypothetical protein FUSPEROL_02035 [Fusobacterium periodonticum ATCC 33693] Length = 190 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 53/162 (32%), Gaps = 8/162 (4%) Query: 253 ESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVME 312 E I + + E+Y ++ G+I+ D + + Sbjct: 17 EPAIFYVDISRKYDCFVEEITKINSEAYNRADKINKGQILVNLEDFDYEDIGRCIFYEND 76 Query: 313 -RGIITSAYMA-----VKPHGIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKR 364 I ++ Y+ + + D+ + + + + L D + Sbjct: 77 IPAAINGNVAILTLKEKFEDAVNLKYITFYLNYKDIVRQYVYDKVVGEKVKRLSRLDFEH 136 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +P+ +P I+ Q I + + + E +E++I L+ + Sbjct: 137 IPITIPLIERQDKIIDNFIKVRKKFENDFELLEKTIDLVNKY 178 >gi|154496689|ref|ZP_02035385.1| hypothetical protein BACCAP_00981 [Bacteroides capillosus ATCC 29799] gi|150273941|gb|EDN01041.1| hypothetical protein BACCAP_00981 [Bacteroides capillosus ATCC 29799] Length = 197 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 50/137 (36%), Gaps = 3/137 (2%) Query: 272 NMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDST 331 ES + G++V R + + + + + I Sbjct: 52 EDFYACESLDNALFTSKGDVVVRLLSPMYPVYVENNYENILVPSQFAVLRVKDREVIMPE 111 Query: 332 YLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 YL + + + + SG ++++K + + L + +PP++ Q +I+ + R + Sbjct: 112 YLRLWLAQKSIQERVLDLESGTAQKAVKIKTILNLDIFIPPLEVQKK-AVMIDTLSRRRE 170 Query: 391 VL-VEKIEQSIVLLKER 406 L E IE+ L + Sbjct: 171 CLYRELIEEERTLTENL 187 >gi|146321308|ref|YP_001201019.1| type I restriction enzyme [Streptococcus suis 98HAH33] gi|145692114|gb|ABP92619.1| type I restriction enzyme [Streptococcus suis 98HAH33] Length = 230 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 41/119 (34%), Gaps = 9/119 (7%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDV-ESGTGKYLPKDGNSRQ 72 +P W V N G+T G DI ++ + D+ +G + + Sbjct: 70 KLPSSWCYVKFGGLVLFNIGKTPPRSEPNYWGDDIPWVSISDMSNNGHIFKTKEYLSDFA 129 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + V I + G +L + A+ + + + + P +++ +L+ Sbjct: 130 INQKKVKIASAGTLLMSFKLTIGKVAL--EVPASHNEAIISIFPYGDKENIIRDYLMRF 186 >gi|294789185|ref|ZP_06754424.1| conserved hypothetical protein [Simonsiella muelleri ATCC 29453] gi|294482926|gb|EFG30614.1| conserved hypothetical protein [Simonsiella muelleri ATCC 29453] Length = 195 Score = 44.4 bits (103), Expect = 0.038, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 41/114 (35%), Gaps = 7/114 (6%) Query: 300 NDKRSLRSAQVMERGIITSAYMAVKP--HGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQ 355 K L S + ++ + ++ ++ + I YL W K +Y+ Sbjct: 77 EPKAYLFSGSLKDKVVASNPFIIIHSLSEIILPKYLVWYFNHAITAKSYYSAVLRGTSFP 136 Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINV---ETARIDVLVEKIEQSIVLLKER 406 K P+ +PPI Q I + E +++ L+ ++ L E+ Sbjct: 137 IFTLAMAKEFPIKIPPITIQKQIIDRHTQALTEQKKLEQLIALRQEYNAALAEQ 190 >gi|139473677|ref|YP_001128393.1| hypothetical protein SpyM50834 [Streptococcus pyogenes str. Manfredo] gi|134271924|emb|CAM30162.1| conserved hypothetical protein [Streptococcus pyogenes str. Manfredo] Length = 198 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 69/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ + L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSSKVVPGDVGLVNLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLSRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 37.1 bits (84), Expect = 5.5, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 46/128 (35%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + ++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLSRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|71903602|ref|YP_280405.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS6180] gi|71802697|gb|AAX72050.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS6180] Length = 198 Score = 44.4 bits (103), Expect = 0.039, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 69/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ + L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSSKVVPGDVGLVNLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLNRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 47/128 (36%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + +++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLNRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|312898355|ref|ZP_07757745.1| conserved domain protein [Megasphaera micronuciformis F0359] gi|310620274|gb|EFQ03844.1| conserved domain protein [Megasphaera micronuciformis F0359] Length = 142 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 50/126 (39%), Gaps = 4/126 (3%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP-- 325 G ES + Y+++ G+I F + ++ K + GI++ + ++P Sbjct: 3 YSESGNGASTESLDNYKVLRVGDIAFEGHENKDFKFGRFVMNDVGNGIMSPRFTVLRPLI 62 Query: 326 HGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITNVIN 383 + + ++ + K Y+ G + L ED V VP ++EQ I ++ Sbjct: 63 DMELNFWKEYINYEPIMQKKLVYSTKKGTMMNELVVEDFLNQYVAVPSVQEQQKIGYLLK 122 Query: 384 VETARI 389 T I Sbjct: 123 CMTDDI 128 >gi|290509518|ref|ZP_06548889.1| N-6 DNA methylase [Klebsiella sp. 1_1_55] gi|289778912|gb|EFD86909.1| N-6 DNA methylase [Klebsiella sp. 1_1_55] Length = 1304 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 61/177 (34%), Gaps = 16/177 (9%) Query: 26 KVVPIKRFTKLNTGRTSESGKD----------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 K+ + +++ GR +S + + YI ++++ GK S Sbjct: 464 KIASLVSISEVFPGRVHKSTELFDSPLNKTDAVGYIRIKNL--FQGKITRPSSWISASSL 521 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDG--ICSTQFLVLQPK--DVLPELLQGWLLSI 131 S +G IL+ + G + A++ + S F VL+ + P L +L S Sbjct: 522 SADERLREGDILFSRSGTIGKAAMVDGASAGSVASHGFYVLRVNSGKIEPGYLLAYLHSP 581 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + G + H + + +P+P+ P Q + I Sbjct: 582 VCQTWLLSRSRGTAIQHIHREALKMLPIPVLPHELQNHAAAQFHDFGTSAQAFILHM 638 Score = 44.0 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 30/93 (32%), Gaps = 1/93 (1%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 + G+I+F + A + V I+ YL + S Sbjct: 526 RLREGDILFSRSGTIGKAAMVDGASAGSVASHGFYVLRVNSGKIEPGYLLAYLHSPVCQT 585 Query: 345 VFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQF 376 + G Q + E +K LP+ V P + Q Sbjct: 586 WLLSRSRGTAIQHIHREALKMLPIPVLPHELQN 618 >gi|210630410|ref|ZP_03296445.1| hypothetical protein COLSTE_00329 [Collinsella stercoris DSM 13279] gi|210160492|gb|EEA91463.1| hypothetical protein COLSTE_00329 [Collinsella stercoris DSM 13279] Length = 71 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 12/64 (18%) Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL----LKERRSSFIAAAVTG 417 +K P+ +P EQ + E + + + ++S+ L L R + + ++G Sbjct: 1 MKSTPLSLP--NEQ------LRAEFSAFSHPILEQQKSLELENRRLCLLRDALLPKLMSG 52 Query: 418 QIDL 421 +ID+ Sbjct: 53 EIDV 56 >gi|258424859|ref|ZP_05687732.1| predicted protein [Staphylococcus aureus A9635] gi|257844951|gb|EEV68992.1| predicted protein [Staphylococcus aureus A9635] Length = 378 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 17/143 (11%), Positives = 42/143 (29%), Gaps = 12/143 (8%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 I + + + + +I+ + K + + E I + + Sbjct: 11 FINYDNVAYVNERIHNKYKKTQLQKFDILMSVRGVSIGKIGIFMGEYSEANISANLIIIR 70 Query: 324 KPHGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + Y+A + S ++G G + ++ + + + PP Sbjct: 71 LKNPSYAPYVAMSLISSVGQSQISRSIGGGSKPTITSGFIDEIEIPTPP----------- 119 Query: 383 NVETARIDVLVEKIEQSIVLLKE 405 I+ L + L KE Sbjct: 120 EEVLKNINQLFFEAFNQRGLAKE 142 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 51/177 (28%), Gaps = 7/177 (3%) Query: 30 IKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQS-DTSTVSIFAKGQIL 87 +K T + +D + YI + +V + TG+ + + I IL Sbjct: 200 LKNLVTEVTESVDKLHEDKVGYIEISNVNNRTGRINGIKFDYINKLPKNGKIILKDEDIL 259 Query: 88 YGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMS 147 K+ PY I + L K + + G Sbjct: 260 ISKVRPYRGSIAIYKEY----SAELCTASKSAFVVIRAEEFMYPYYLTAFLRYRLGLDQI 315 Query: 148 HADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + G + + +I + + I+ + + I ++ Q+++ Sbjct: 316 VMNQSGTTYPTVKPEEIMNVKVILLE-DMKMKEINEIYRKNIDSKYHEEKNIQSIIE 371 >gi|161507541|ref|YP_001577495.1| Type I restriction-modification system specificity subunit [Lactobacillus helveticus DPC 4571] gi|160348530|gb|ABX27204.1| Type I restriction-modification system specificity subunit [Lactobacillus helveticus DPC 4571] Length = 262 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 26/63 (41%), Gaps = 4/63 (6%) Query: 336 LMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 +M + + + G R L + + +L +L+P EQ I + + +D + Sbjct: 1 MMNAIKNFNIEPFLVGGGRAKLNADVMMKLNILLPTFVEQEKIGS----LFSLLDKTIAL 56 Query: 396 IEQ 398 ++ Sbjct: 57 HQR 59 >gi|319939012|ref|ZP_08013376.1| hypothetical protein HMPREF9459_00364 [Streptococcus anginosus 1_2_62CV] gi|319812062|gb|EFW08328.1| hypothetical protein HMPREF9459_00364 [Streptococcus anginosus 1_2_62CV] Length = 168 Score = 44.0 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 49/147 (33%), Gaps = 15/147 (10%) Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + + +G + E V +I + ++ A+V ++ + Sbjct: 26 FFKVSDMNIIGNEFEMQSANNYVSKEQIERKNWKPITSVPAIMFAKVGAAIMLNRKRLIR 85 Query: 324 KPHGIDSTYLAWLMRS---YDLCKVF-------YAMGSGLRQSLKFEDVKRLPVLVP-PI 372 P ID+ +A++ + K+ G S D++ + V +P + Sbjct: 86 HPFLIDNNTMAYIFDKTWDINFGKIIFDTIYLPKYSQVGALPSYNGSDIENINVFMPNSL 145 Query: 373 KEQFDITNVINVETARIDVLVEKIEQS 399 EQ I + + +D + ++ Sbjct: 146 PEQKAIGDF----FSTLDRSIALHQRE 168 >gi|284097666|ref|ZP_06385692.1| conserved hypothetical protein [Candidatus Poribacteria sp. WGA-A3] gi|283830823|gb|EFC34907.1| conserved hypothetical protein [Candidatus Poribacteria sp. WGA-A3] Length = 55 Score = 44.0 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 19/45 (42%), Gaps = 4/45 (8%) Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 I V++ ID + +EQ + + + +TG++ L Sbjct: 1 AIAAVLSD----IDAEITTLEQRRDKTRAIKQGMMQQLLTGRVRL 41 >gi|297590648|ref|ZP_06949286.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus MN8] gi|297575534|gb|EFH94250.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus MN8] Length = 160 Score = 44.0 bits (102), Expect = 0.045, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 42/131 (32%), Gaps = 15/131 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRS-----LRSAQVMERGIITSAYMAVKPHGID 329 +K S + Y+ ++ G+I S + + + +G I Y+ P Sbjct: 30 IKVNSGKDYKHLEKGDIPVYGTGGYMTSVSEPLSEIDAVGIGRKGTINKPYLLEAPFWTV 89 Query: 330 STYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 T +++ + S SL + + ++ VP KEQ I Sbjct: 90 DTLFYCTPKKETDILFILSLFRKINWKVYDESTGVPSLSKQTINKINRFVPSNKEQQKIG 149 Query: 380 NVINVETARID 390 +++ Sbjct: 150 EFFIKLDRQLN 160 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 43/159 (27%), Gaps = 18/159 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K+N+G+ + +E G G + Sbjct: 20 EWEEKKLGDLIKVNSGKDYK-----------HLEKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + I I +P EQ I E I +++ Sbjct: 122 TGVPSLSKQTINKINRFVPSNKEQQKIGEFFIKLDRQLN 160 >gi|164687372|ref|ZP_02211400.1| hypothetical protein CLOBAR_01013 [Clostridium bartlettii DSM 16795] gi|164603796|gb|EDQ97261.1| hypothetical protein CLOBAR_01013 [Clostridium bartlettii DSM 16795] Length = 165 Score = 44.0 bits (102), Expect = 0.046, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 51/159 (32%), Gaps = 13/159 (8%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K W + +L G+ +S + + +V S G Y GN + S Sbjct: 10 KGWSTELLGEICELKAGKNIKSNE------IHNVNSK-GLYPCYGGNGLRGYVENYS--H 60 Query: 83 KGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +G I + G+ G A + +V +PK + + + L + + Sbjct: 61 EGNINIIGRQGALCGNVKYARGKFYATEHAVVTKPKININDYWLHFALKEL---DLNRLA 117 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVR 180 GA + + +P P+ Q + + Sbjct: 118 TGAAQPGLTVGKLNEVEIPKVPIELQNQFADFVNKVEKL 156 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 28/95 (29%), Gaps = 3/95 (3%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 I Q + A + I+ +L + ++ DL ++ Sbjct: 64 INIIGRQGALCGNVKYARGKFYATEHAVVTKPKININDYWLHFALKELDLNRL---ATGA 120 Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 + L + + + PI+ Q + +N Sbjct: 121 AQPGLTVGKLNEVEIPKVPIELQNQFADFVNKVEK 155 >gi|209554402|ref|YP_002284451.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|209541903|gb|ACI60132.1| reStriction-modification enzyme mpuuiii s subunit [Ureaplasma urealyticum serovar 10 str. ATCC 33699] Length = 129 Score = 44.0 bits (102), Expect = 0.048, Method: Composition-based stats. Identities = 13/128 (10%), Positives = 44/128 (34%), Gaps = 9/128 (7%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG----IDSTYLAWLMRSYDLC 343 + F I Q + I ++ +K ++ ++ ++++ Sbjct: 1 MYDGEFITISADGAYAGTVFLQNGKFSITNVCFILMKNKDIDFKFNNKFVYYILKKEQEI 60 Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF---DITNVINVETARIDVLVEKIEQSI 400 + R +++ +K + + +P ++ Q I + + + + + + + Sbjct: 61 NRLKSQVGSSRPAVREYSLKEIKINLPNMEIQEEFSKIVEPLLNLSTKANRIEKILND-- 118 Query: 401 VLLKERRS 408 LLK + Sbjct: 119 CLLKNVKK 126 >gi|15675214|ref|NP_269388.1| hypothetical protein SPy_1254 [Streptococcus pyogenes M1 GAS] gi|71910777|ref|YP_282327.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS5005] gi|13622382|gb|AAK34109.1| hypothetical protein SPy_1254 [Streptococcus pyogenes M1 GAS] gi|71853559|gb|AAZ51582.1| type I restriction-modification system specificity subunit [Streptococcus pyogenes MGAS5005] Length = 198 Score = 44.0 bits (102), Expect = 0.049, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 68/186 (36%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ I L D+ + +Y + Sbjct: 14 EKVTLGTVVDCFKGKAVSSKVVPGDVGLINLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++ Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDV 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLNRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 46/128 (35%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + G +L +++ +P+ V P+ +Q + N + +++ ++ Sbjct: 128 QALLDVADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLNRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|269863276|ref|XP_002651162.1| hypothetical protein EBI_27262 [Enterocytozoon bieneusi H348] gi|220065024|gb|EED42893.1| hypothetical protein EBI_27262 [Enterocytozoon bieneusi H348] Length = 190 Score = 44.0 bits (102), Expect = 0.050, Method: Composition-based stats. Identities = 16/128 (12%), Positives = 40/128 (31%), Gaps = 4/128 (3%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + + PG++V K + + ++ +LAW + Sbjct: 50 DEKYLSHCLRPGDVVLPSRG-DYYKAWFFEGAEEPVFPMGQLNVITPEANLNGRFLAWYL 108 Query: 338 RSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 K+ + ++L + L + VP + Q I ++ T ++ + Sbjct: 109 NQPATQVKISVMLTGTGIKALTKSALLSLEIEVPAMDRQKQIAE-MDETTEKM-AAIRHR 166 Query: 397 EQSIVLLK 404 + L+ Sbjct: 167 LSELDRLE 174 >gi|167949253|ref|ZP_02536327.1| Type I restriction-modification system specificity subunit [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 109 Score = 44.0 bits (102), Expect = 0.051, Method: Composition-based stats. Identities = 17/102 (16%), Positives = 35/102 (34%), Gaps = 11/102 (10%) Query: 6 AYPQYKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP 65 +P+++++G W+V P + K T + E+ K+++ I + +Y Sbjct: 19 RFPEFREAG---------EWEVKPFEEGFKRLTNKNIENNKNVLTISAQLGLVSQLEYFN 69 Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGIC 107 K D S + +G Y K + Sbjct: 70 KK--VAAKDLSGYYLLHRGDFAYNKSYSNGYPMGAIKPAKVV 109 >gi|148544101|ref|YP_001271471.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri DSM 20016] gi|325682359|ref|ZP_08161876.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] gi|148531135|gb|ABQ83134.1| restriction modification system DNA specificity domain [Lactobacillus reuteri DSM 20016] gi|324978198|gb|EGC15148.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM4-1A] Length = 211 Score = 44.0 bits (102), Expect = 0.052, Method: Composition-based stats. Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 4/125 (3%) Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + I + + + +V P+ S + + ++ + Sbjct: 91 LPTNTILFSSRAPIGYISIAKNNLATNQGFKSVIPNKEYSFQFIYELLKHETAAIKNEAN 150 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + + + +K+ + +P ++ T+ N T I + K+E+ L + Sbjct: 151 GSTFKEISGKKLKQHIINIPNSED----TSKFNEITKPIFKQLRKLEEENEKLLAIKKEL 206 Query: 411 IAAAV 415 + Sbjct: 207 LEKYF 211 >gi|158313868|ref|YP_001506376.1| N-6 DNA methylase [Frankia sp. EAN1pec] gi|158109273|gb|ABW11470.1| N-6 DNA methylase [Frankia sp. EAN1pec] Length = 775 Score = 44.0 bits (102), Expect = 0.052, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 66/207 (31%), Gaps = 14/207 (6%) Query: 24 HWKVVPIKRFTKLNTGRTSE------SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ +P+ + G + I + ++ + P+ + D + Sbjct: 570 GWRRLPLGDVCDVLAGFSGAVRTERGLPSGIPVVKPRNLVDN--RISPEGVDYVAPDVAA 627 Query: 78 ---VSIFAKGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQP-KDVLPELLQGWLLSI 131 G I+ + G R+A++ + + T L L+P + V P L +L Sbjct: 628 RMERYRLRAGDIVCVRTGQLGRQALVTEEQSGWLIGTSCLRLRPDESVDPRYLVHFLALP 687 Query: 132 DVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 +++ + G+ + + +P+ +P +Q I + + R Sbjct: 688 QISEWLLGHSTGSAIRVLTAATMRGLPLVLPDRHQQGRIGSAAGSLDDLVAVHDQIRQVS 747 Query: 192 IELLKEKKQALVSYIVTKGLNPDVKMK 218 L + G P+ K Sbjct: 748 SALRDALLPLFLQDPTPPGPVPEEGSK 774 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 6/132 (4%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 + G+IV + +L + + I TS +D YL + Sbjct: 630 ERYRLRAGDIVCVRTGQLGRQ-ALVTEEQSGWLIGTSCLRLRPDESVDPRYLVHFLALPQ 688 Query: 342 LCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + +G + L ++ LP+++P +Q I + +D LV +Q Sbjct: 689 ISEWLLGHSTGSAIRVLTAATMRGLPLVLPDRHQQGRIGS----AAGSLDDLVAVHDQIR 744 Query: 401 VLLKERRSSFIA 412 + R + + Sbjct: 745 QVSSALRDALLP 756 >gi|265763429|ref|ZP_06091997.1| type I restriction endonuclease S subunit [Bacteroides sp. 2_1_16] gi|263256037|gb|EEZ27383.1| type I restriction endonuclease S subunit [Bacteroides sp. 2_1_16] Length = 219 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 65/186 (34%), Gaps = 11/186 (5%) Query: 30 IKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + L G +SGK I+ I E + +D + Sbjct: 36 LSNIATLKNGYAFQSGKYNALGKWKILTITNVSGERYINDEDYNCIINLPNDIQDHQVLK 95 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +G IL G R ++ D D + + + L+ K+V E L L S + A Sbjct: 96 EGDILISLTGNVGRVSLCKDGDYLLNQRVGLLQLAKNVNQEFLYQILSSQRFENSMIACG 155 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +GA + + + +P +L+ KI+ D I R + LL +KQ Sbjct: 156 QGAAQMNIGKGDVESYVLPYSSNVNNILLVAKILHSY---DEYIINEQRKLTLLTMQKQY 212 Query: 202 LVSYIV 207 ++ + Sbjct: 213 FLAQMF 218 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 59/181 (32%), Gaps = 18/181 (9%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIV 292 + K L + IL+++ + + + + P + +Q++ G+I+ Sbjct: 41 TLKNGYAFQSGKYNALGKWKILTITNVSGERYINDEDYNCIINLPNDIQDHQVLKEGDIL 100 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + ++ +L ++ S A G G Sbjct: 101 ISLTGNVGRVSLCKDGDYLLNQRVG---LLQLAKNVNQEFLYQILSSQRFENSMIACGQG 157 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLLKERR 407 + ++ DV+ + N I + A+I D + ++ + LL ++ Sbjct: 158 AAQMNIGKGDVESYVLPYSSN------VNNI-LLVAKILHSYDEYIINEQRKLTLLTMQK 210 Query: 408 S 408 Sbjct: 211 Q 211 >gi|197119367|ref|YP_002139794.1| type I restriction/modification system DNA methyltransferase [Geobacter bemidjiensis Bem] gi|197088727|gb|ACH39998.1| type I restriction/modification system DNA methyltransferase, putative [Geobacter bemidjiensis Bem] Length = 707 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 10/99 (10%) Query: 285 IVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA-------VKPHGIDSTYLAWLM 337 +P +IVF + K +L E G ++ + LA ++ Sbjct: 565 RYEPADIVFARMRPNLRKVALMVF--PEGGYVSPECAVLSVRKGKDDQPLVKPEVLAAIL 622 Query: 338 RSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQ 375 RS + + SG+ R L +D++++ + VPP Q Sbjct: 623 RSDLVFGQIMHLISGIGRPRLNSKDLRKVLIPVPPSAIQ 661 >gi|332983075|ref|YP_004464516.1| restriction modification system DNA specificity domain-containing protein [Mahella australiensis 50-1 BON] gi|332700753|gb|AEE97694.1| restriction modification system DNA specificity domain protein [Mahella australiensis 50-1 BON] Length = 203 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 50/158 (31%), Gaps = 7/158 (4%) Query: 228 VPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVD 287 PD + F ++ + ++ + ++ ++ +G E+ Y+ Sbjct: 12 CPDGVKYVSFAEVIDYEQPTKYIVSSTDYDNNYKIPVLTAGQSFILGYTDETDGLYRASK 71 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 ++ + E + +SA + P + +L + + Sbjct: 72 EKPVIIFDDFTTS-----LHWVDFEFKVKSSAIKILTPKNTNIAVFRYLYYAMSNTRYQP 126 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 RQ + + + VPP+ Q +I +++ Sbjct: 127 DFSKHERQWISR--YSKFTIPVPPLPVQQEIVRILDNF 162 >gi|238923781|ref|YP_002937297.1| anti-codon nuclease masking agent (PrrB) [Eubacterium rectale ATCC 33656] gi|238875456|gb|ACR75163.1| anti-codon nuclease masking agent (PrrB) [Eubacterium rectale ATCC 33656] Length = 177 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 13/135 (9%), Positives = 39/135 (28%), Gaps = 10/135 (7%) Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD 341 D I+ + + + +A + Y+ +++++ Sbjct: 43 NRASYDKTNILIARVGAN---AGYVHLASGSYDVSDNTLIADIKPENNLKYIFYILQNIA 99 Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI- 400 L + G + + +K++ + +P Q + +++ L I I Sbjct: 100 LNRFAK---GGGQPLITAGKIKQIEIKIPDQITQDKVVKILDEFEMICTDLNAGIPAEIN 156 Query: 401 ---VLLKERRSSFIA 412 + R + Sbjct: 157 VRNKQYEFYRDKLMT 171 >gi|315648619|ref|ZP_07901716.1| Type I restriction-modification system specificity subunit [Paenibacillus vortex V453] gi|315275998|gb|EFU39346.1| Type I restriction-modification system specificity subunit [Paenibacillus vortex V453] Length = 185 Score = 43.6 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 53/158 (33%), Gaps = 10/158 (6%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 N ++ + + G++VF F+ K + S + I Sbjct: 29 YSYEDLVNDLEGSFLDFQANLYHEHTDGYLSSTGDVVFSFVSS---KAGIVSDLNQGKII 85 Query: 316 ITSAY-MAVKPHGIDSTYLAWLMR-SYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPI 372 + + +D YL + + SY + K +M + L +K + +P I Sbjct: 86 NQNFAKLIFDHRTLDPCYLCYALNESYSVKKQMAISMQGSIVPKLIPAILKEFEIKLPTI 145 Query: 373 KEQFDITN---VINVETARIDVLVEKIEQS-IVLLKER 406 ++Q I + A + E E+ + +L + Sbjct: 146 EKQRTIGKAYFTLKKHHALVKKQAELEERLYLEILNQL 183 Score = 36.7 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 48/153 (31%), Gaps = 10/153 (6%) Query: 29 PIKRFTKLNTGRTSESGKDIIYI-----GLEDVESG-TGKYLPKDGNSRQSDTSTVSIFA 82 ++ + G+ G + + ED+ + G +L N T + + Sbjct: 2 KLEDVVTVRIGKNLSRGNEKNDLTLVAYSYEDLVNDLEGSFLDFQANLYHEHTDGY-LSS 60 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQF---LVLQPKDVLPELLQGWLLSIDVTQRIEA 139 G +++ + + I + F + L S V +++ Sbjct: 61 TGDVVFSFVSSKAGIVSDLNQGKIINQNFAKLIFDHRTLDPCYLCYALNESYSVKKQMAI 120 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 +G+ + + + +P + +Q I + Sbjct: 121 SMQGSIVPKLIPAILKEFEIKLPTIEKQRTIGK 153 >gi|149010475|ref|ZP_01831846.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] gi|147764956|gb|EDK71885.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP19-BS75] gi|327389174|gb|EGE87519.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA04375] Length = 174 Score = 43.6 bits (101), Expect = 0.058, Method: Composition-based stats. Identities = 14/147 (9%), Positives = 39/147 (26%), Gaps = 2/147 (1%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 N+K S G + K + G ++ + Sbjct: 7 NNNKKFAVKTGQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAWKSRKYLIDNPTIIIGRV 66 Query: 303 RSLRSAQVMERG--IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + G I+ + +K L +L+ + + + + Sbjct: 67 GAYCGNVRTTHGKVWISDNAIYIKEFKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQK 126 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETA 387 ++ ++PP+ Q + + + + Sbjct: 127 PLENQKYILPPLALQNEFADFVALVDK 153 >gi|308179941|ref|YP_003924069.1| type I restriction-modification system specificity subunit [Lactobacillus plantarum subsp. plantarum ST-III] gi|308045432|gb|ADN97975.1| type I restriction-modification system specificity subunit [Lactobacillus plantarum subsp. plantarum ST-III] Length = 164 Score = 43.6 bits (101), Expect = 0.059, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 59/170 (34%), Gaps = 17/170 (10%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 T E +IL ++ + + + + G+++ D Sbjct: 9 NYTNNPEDHILVQGNADMKNGYVLPRVWTTQITKKA----EAGDLILSVRAPVGDIGKTD 64 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 V+ RG+ + +K + L ++ D+ K +S+ D+K Sbjct: 65 YDVVLGRGVAS-----IKGNEFIYQTLKYM---NDIGKWTRFSTGSTFESINSADIKDAR 116 Query: 367 VLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL-LKERRSSFIAAAV 415 + P + EQ I N++ D ++ EQ L +K ++S + + Sbjct: 117 IGYPKLNEQNLIGNILEKM----DSIIAANEQVPKLVIKIVKNSLVNLLL 162 >gi|237738545|ref|ZP_04569026.1| type I restriction-modification system specificity determinant [Fusobacterium sp. 2_1_31] gi|229424212|gb|EEO39259.1| type I restriction-modification system specificity determinant [Fusobacterium sp. 2_1_31] Length = 195 Score = 43.6 bits (101), Expect = 0.060, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 53/172 (30%), Gaps = 12/172 (6%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ----- 299 N +N I L+ G I + + + + + G+I+ Sbjct: 28 NSQNIASIIRTTNFLNNGKIDIENKELIKREIDKKKIEQKQLKRGDIIIEKSGGSPNQPV 87 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF 359 + I+S Y+ + R+ K + + Sbjct: 88 GRVVFFDLNSNEIFLCNNFTSILRVKEDINSKYVFYFFRNSYKNKKVLKFQNKTTGIINL 147 Query: 360 ED---VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + + + +P +K Q ++++ ++ ++EK + ++ L+E Sbjct: 148 KLQNYLNESHIFLPELKIQNKRVDILD----NLENIIEKNQNYLIHLRELTK 195 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 53/186 (28%), Gaps = 19/186 (10%) Query: 28 VPIKRFTKLNTGRTSESGKDII-----YIGLED-VESGTGKYLPKDGNSRQSDTST--VS 79 + ++ TG + I + + +G K+ R+ D Sbjct: 8 RKLTDICEIITGEWGTEISENSQNIASIIRTTNFLNNGKIDIENKELIKREIDKKKIEQK 67 Query: 80 IFAKGQILYGKLG-----PYLRKAIIA---DFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 +G I+ K G P R + +C+ +L+ K+ + + Sbjct: 68 QLKRGDIIIEKSGGSPNQPVGRVVFFDLNSNEIFLCNNFTSILRVKEDINSKYVFYFFRN 127 Query: 132 DVTQRIEAICEGATMSHADWKGIGNI---PMPIPPLAEQVLIREKIIAETVRIDTLITER 188 + + T + K + + +P L Q + + I+ Sbjct: 128 SYKNKKVLKFQNKTTGIINLKLQNYLNESHIFLPELKIQNKRVDILDNLENIIEKNQNYL 187 Query: 189 IRFIEL 194 I EL Sbjct: 188 IHLREL 193 >gi|312875033|ref|ZP_07735051.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2053A-b] gi|311089428|gb|EFQ47854.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LEAF 2053A-b] Length = 146 Score = 43.6 bits (101), Expect = 0.062, Method: Composition-based stats. Identities = 11/145 (7%), Positives = 37/145 (25%), Gaps = 7/145 (4%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + + N ++ + + + + + V E ++ Sbjct: 7 YVEFKNGKKRPTLKGTIPVYGGNGILDYTNTANMQSGVVIGRVGVYCGSVFLVREECWVS 66 Query: 318 SAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 + + + + L + + V +P + Q Sbjct: 67 DNAIKAMCKENIDLGYLY--YLLSSLHLNERRIGTSQPLLTQNILNNIEVEIPELAIQKK 124 Query: 378 ITNVINVETARIDVLVEKIEQSIVL 402 I++++ + +I K+ I Sbjct: 125 ISSILELLDEKI-----KLNNEINK 144 >gi|257457413|ref|ZP_05622584.1| DNA methylase-type I restriction-modification system [Treponema vincentii ATCC 35580] gi|257445335|gb|EEV20407.1| DNA methylase-type I restriction-modification system [Treponema vincentii ATCC 35580] Length = 271 Score = 43.6 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 27/245 (11%), Positives = 69/245 (28%), Gaps = 12/245 (4%) Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIE 223 + I R+ I K ++ V+ L+ + + + Sbjct: 25 VYYNEQASYYINLAIFRLYEEIGLFDNLKSANYTVKNLKDTFAVSGRLDSEYYQEK--YD 82 Query: 224 WVGLVPDHWEVKPFFALVTELNR---KNTKLIESNILSLSYGNIIQKLETRNMGLKPESY 280 + LVT + + ++ + T E+ Sbjct: 83 RLFEKLSDNNCDKLSNLVTIKKSIEPGSESYQTKGTPFIRVQDLTKFGLTDTNIYLSENE 142 Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV---KPHGIDSTYLAWLM 337 I + + D + IITS+ + K + YLA ++ Sbjct: 143 FKDCIRPKKDTILLSKDGT---VGIAYKMNKSEDIITSSAILHLDVKDKRVLPDYLALVL 199 Query: 338 RSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 S + G + K +++ + + + ++Q I+ ++ + + Sbjct: 200 NSVAVKMQAEKDAGGSIINHWKKSEIENVIIPIIAKEKQEQISKLLIESETLRNESKSIL 259 Query: 397 EQSIV 401 E+++ Sbjct: 260 EKAVK 264 >gi|253569684|ref|ZP_04847093.1| type IC HsdS subunit [Bacteroides sp. 1_1_6] gi|251840065|gb|EES68147.1| type IC HsdS subunit [Bacteroides sp. 1_1_6] Length = 217 Score = 43.6 bits (101), Expect = 0.063, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 49/165 (29%), Gaps = 13/165 (7%) Query: 30 IKRFTKLNTGRTSESGKDI----IYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 F K +G + +S +D YI +V G + + S+ G Sbjct: 30 FSDFGKSYSGLSGKSAEDFGEGCPYITYMNVYQNQIINATNVGLVKINGAEQQSVVHYGD 89 Query: 86 ILYGKLGPYLRKAIIAD---------FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 IL+ + I + ++ + P L ++ + + Sbjct: 90 ILFTLSSETAEEVGIGAVYLGDTYPLYLNSFCFGIHIIDDNKIFPPFLAFYVSTKSFRKV 149 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + + +G+T + P + Q I + + ++ Sbjct: 150 VFPLAQGSTRFNLQKNDFMKKGFSFPTVERQRKIYSALKTYSDKL 194 >gi|260171383|ref|ZP_05757795.1| putative type I restriction enzyme specificity protein [Bacteroides sp. D2] gi|315919696|ref|ZP_07915936.1| restriction modification system DNA specificity subunit [Bacteroides sp. D2] gi|313693571|gb|EFS30406.1| restriction modification system DNA specificity subunit [Bacteroides sp. D2] Length = 185 Score = 43.6 bits (101), Expect = 0.064, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 47/153 (30%), Gaps = 13/153 (8%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID 329 K +Y Y + + ++D + S T Y K + + Sbjct: 34 IEISQQKNPTYPVYSSQTSNDGIMGYLDDYMFEGEYISWTTDGANAGTVFYRNGKFNCTN 93 Query: 330 STYLAWLMRSYDLCKVFYAMGSGLRQSLKFE---------DVKRLPVLVPPIKEQFDITN 380 L L + +D V + ++ + + + + +P + EQ I Sbjct: 94 VCGLLKLRKEFDTHFVSLVLAEATKKYVSINLANPKLMNNTMGNIQIRLPKLDEQKRI-- 151 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 ++ ++ L+ + +++ ++ Sbjct: 152 --SIIFRKLQKLLTTHNSLLAEYTKQKQYLLSQ 182 Score = 37.1 bits (84), Expect = 6.3, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 61/186 (32%), Gaps = 14/186 (7%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVSIF 81 + W+ IK ++ GR I +I + ++ T Y + N +F Sbjct: 12 ETWEQFKIKDIAQIGRGRV------ISFIEISQQKNPTYPVYSSQTSNDGIMGYLDDYMF 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 I + G + C+ +L+ + +L+ + + Sbjct: 66 EGEYISWTTDGANAGTVFYRNGKFNCTNVCGLLKLRKEFDTHFVSLVLAEATKKYVSIN- 124 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +GNI + +P L EQ I ++ L+T + ++KQ Sbjct: 125 --LANPKLMNNTMGNIQIRLPKLDEQKR----ISIIFRKLQKLLTTHNSLLAEYTKQKQY 178 Query: 202 LVSYIV 207 L+S + Sbjct: 179 LLSQMF 184 >gi|291530638|emb|CBK96223.1| Restriction endonuclease S subunits [Eubacterium siraeum 70/3] Length = 177 Score = 43.6 bits (101), Expect = 0.067, Method: Composition-based stats. Identities = 9/119 (7%), Positives = 31/119 (26%), Gaps = 4/119 (3%) Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 I + S + + Y + + Y + Sbjct: 60 SISEGGNSCGFVSYNLQKFWSGGHCYTLKIMAEQCRSKYLFFYLKYKEKDIMQLRVGSGL 119 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ + ++ V +P K+Q + + E + + ++ ++ Sbjct: 120 PNIQKKSLENFNVKLPNYKKQ----CFVERVFEVVTAKKEIENALLERFQSQKKFLLSK 174 >gi|221231668|ref|YP_002510820.1| type I RM modification enzyme [Streptococcus pneumoniae ATCC 700669] gi|220674128|emb|CAR68647.1| putative type I RM modification enzyme [Streptococcus pneumoniae ATCC 700669] gi|332201356|gb|EGJ15426.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47368] Length = 193 Score = 43.6 bits (101), Expect = 0.067, Method: Composition-based stats. Identities = 14/147 (9%), Positives = 39/147 (26%), Gaps = 2/147 (1%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 N+K S G + K + G ++ + Sbjct: 26 NNNKKFAVKTGQQCFKFSSGKFLDKHDRVFEGYPAYGGNGIAWKSRKYLIDNPTIIIGRV 85 Query: 303 RSLRSAQVMERG--IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + G I+ + +K L +L+ + + + + Sbjct: 86 GAYCGNVRTTHGKVWISDNAIYIKEFKNSDFNLVFLLELMKVIDFSKFADFSGQPKITQK 145 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETA 387 ++ ++PP+ Q + + + + Sbjct: 146 PLENQKYILPPLALQNEFADFVALVDK 172 >gi|227365149|ref|ZP_03849163.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] gi|227069813|gb|EEI08222.1| restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] Length = 200 Score = 43.6 bits (101), Expect = 0.067, Method: Composition-based stats. Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 4/125 (3%) Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG 350 + I + + + +V P+ S + + ++ + Sbjct: 80 LPTNTILFSSRAPIGYISIAKNNLATNQGFKSVIPNKEYSFQFIYELLKHETAAIKNEAN 139 Query: 351 SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 + + + +K+ + +P ++ T+ N T I + K+E+ L + Sbjct: 140 GSTFKEISGKKLKQHIINIPNSED----TSKFNEITKPIFKQLRKLEEENEKLLAIKKEL 195 Query: 411 IAAAV 415 + Sbjct: 196 LEKYF 200 >gi|207092852|ref|ZP_03240639.1| type I R-M system specificity subunit [Helicobacter pylori HPKX_438_AG0C1] Length = 44 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 20/37 (54%) Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQ 398 ++++ + +PP+ EQ I N+++ I L K Q Sbjct: 5 MQQIQIPIPPLDEQIAIANILSALDHEIISLKNKKRQ 41 >gi|316984504|gb|EFV63472.1| type I restriction enzyme specificity protein HsdS [Neisseria meningitidis H44/76] Length = 61 Score = 43.2 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 9/42 (21%), Positives = 19/42 (45%) Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 +K + + +PP+ EQ I +++ + E + I L Sbjct: 1 MIKDISIPIPPLPEQEKIVAILDKFDTLTHSISEGLPYEIAL 42 >gi|293372406|ref|ZP_06618790.1| conserved domain protein [Bacteroides ovatus SD CMC 3f] gi|292632589|gb|EFF51183.1| conserved domain protein [Bacteroides ovatus SD CMC 3f] Length = 232 Score = 43.2 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 27/224 (12%), Positives = 66/224 (29%), Gaps = 18/224 (8%) Query: 207 VTKGLNPDVKMKDSGIEWVG------LVPDHWEVKPFFALVTELNRKNTKLIESNILSLS 260 K SG E V ++P W + L + L N L Sbjct: 13 FDFPNEKGKPYKSSGGEMVWNEKLKRMIPKEWTNANIYQLASISKETVNPLARPNELFKH 72 Query: 261 YG-NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 Y K T + V I+ ++ + + + I ++ Sbjct: 73 YSLPEYDKTGTYAEEYGIDIQSAKFTVTNNCILVSKLNPWTSRVICGNRES--NQICSTE 130 Query: 320 YMAVKPHGIDSTYLAWLM-RSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQ 375 ++ P + + +++ +S + +G + + E + + + Sbjct: 131 FVVWNPASMKTKGFLFMLAKSAKFIEYCTQGATGTSHSHRRINPELMMKFDFSY-NSEIA 189 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQI 419 + +I ++ + + +L ++R + + GQI Sbjct: 190 IKFSRLIENIIGKLHDNIA----QLKVLTKQRDELLPLLMNGQI 229 >gi|325125905|gb|ADY85235.1| Type I restriction-modification system specificity subunit [Lactobacillus delbrueckii subsp. bulgaricus 2038] Length = 187 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 40/100 (40%), Gaps = 6/100 (6%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCK 344 + G++ R I+ + A + A + +D YL + + + D+ K Sbjct: 57 FNEGDLALRLINP--QAAVVSPATAGSILSLNFAKIVPNRTKVDEWYLCYYLNEAEDIQK 114 Query: 345 VFYAMGSG---LRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 G + L + ++ L +++P +++Q ++ + Sbjct: 115 QIELSAQGQVSTIKRLGAKFLRELKIVLPDLEKQKELGQI 154 >gi|229105724|ref|ZP_04236353.1| Type I restriction enzyme, methylase subunit [Bacillus cereus Rock3-28] gi|228677613|gb|EEL31861.1| Type I restriction enzyme, methylase subunit [Bacillus cereus Rock3-28] Length = 202 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 34/88 (38%), Gaps = 2/88 (2%) Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR-SYDLCKVFYAMG-SGL 353 + + K + + + + V + +D ++ W + + K + Sbjct: 76 MHTLSQKVAFLPEKYGGLLLTNNFVKIVFTNSVDLYFMEWYLNEHPTIRKQIELFSEGSV 135 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 SLK ++K + VL+PP + Q I + Sbjct: 136 ISSLKLSNLKDIEVLLPPYERQKQIGKI 163 >gi|291516260|emb|CBK69876.1| hypothetical protein BIL_01930 [Bifidobacterium longum subsp. longum F8] Length = 148 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 22/54 (40%), Gaps = 4/54 (7%) Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 P +E + + I V+ EQ L+ R + + ++G+ID+ Sbjct: 2 PNPSNEEIKNFCTFAD----PIYRHVQINEQQTAKLELLRDTLLPKLMSGEIDV 51 >gi|325911615|ref|ZP_08174023.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 143-D] gi|325476601|gb|EGC79759.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners UPII 143-D] Length = 180 Score = 43.2 bits (100), Expect = 0.081, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 52/173 (30%), Gaps = 18/173 (10%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMG-----LKPESYETYQIVDPGE 290 + + + + + I + N+ + + G + + + Sbjct: 14 TLCSDIIDCSHSTPVWRDRGIRVIRNFNLNEGSLDFSKGAFVDEKTYLERTKRAVPEAED 73 Query: 291 IVFRFIDLQNDKRSLRSAQVMERGIITS--AYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 IV + + + + V S+YL + + S + F Sbjct: 74 IVISREAPMGTVAIIPHNL---KCCLGQRLVLLKVNSDICSSSYLLFALMSGFVQNQFNK 130 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 +GS +L ++K + + +K I ++ +I ++ + I Sbjct: 131 IGS-TVSNLTIPELKETKIPL--VKNHKAIGKLLESIANKI-----QVNKQIN 175 >gi|284052298|ref|ZP_06382508.1| restriction modification system DNA specificity subunit [Arthrospira platensis str. Paraca] Length = 166 Score = 43.2 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 11/90 (12%), Positives = 27/90 (30%), Gaps = 2/90 (2%) Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERG-IITSAYMAVKPHGIDSTYLAWLMRSYDLCK 344 +PG+I+ I K + G ++ + + + +L +L+ S + Sbjct: 67 YEPGDILLGNIRPYLKKVWKATNSGGCSGDVLAVRILGQCKKNVSADFLYYLLSSDEFFL 126 Query: 345 VFYAMGSGL-RQSLKFEDVKRLPVLVPPIK 373 G + + +P Sbjct: 127 YNMQHAKGAKMPRGNKAAILNYQIPIPCPD 156 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 53/144 (36%), Gaps = 9/144 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKD-IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 + + + + R S D ++G++++ + G + ++ G Sbjct: 14 EWKLLGDVAQYSPTRVDSSKLDATSFVGVDNLVADKGGRVDASYFPNTDRLTSY---EPG 70 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQ-----FLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 IL G + PYL+K A G CS L K+V + L L S + Sbjct: 71 DILLGNIRPYLKKVWKATNSGGCSGDVLAVRILGQCKKNVSADFLYYLLSSDEFFLYNMQ 130 Query: 140 ICEGATMSHADWKGIGNIPMPIPP 163 +GA M + I N +PIP Sbjct: 131 HAKGAKMPRGNKAAILNYQIPIPC 154 >gi|19746183|ref|NP_607319.1| hypothetical protein spyM18_1203 [Streptococcus pyogenes MGAS8232] gi|19748364|gb|AAL97818.1| hypothetical protein spyM18_1203 [Streptococcus pyogenes MGAS8232] Length = 198 Score = 43.2 bits (100), Expect = 0.083, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 68/186 (36%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D + L D+ + +Y + Sbjct: 14 EKVTLGTVVDYFKGKAVSSKVVPGDAGLVNLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T ++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLSRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 46/128 (35%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + ++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLSRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|332076336|gb|EGI86801.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17545] Length = 131 Score = 43.2 bits (100), Expect = 0.086, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 47/141 (33%), Gaps = 15/141 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K WKV + G+ + VE GK+ P G+ + I Sbjct: 6 KEWKVSKWNEILTIRNGKNQKQ-----------VEDADGKF-PIYGSGGIMGYAKDWIVK 53 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 K ++ G+ G + ++ + T F + + + + + E + + Sbjct: 54 KNSVIIGRKGNINKPILVRENFWNVDTAFGLEPVLEKINSEYLFYFCQLY---NFEKLNK 110 Query: 143 GATMSHADWKGIGNIPMPIPP 163 T+ + NI +P+P Sbjct: 111 AVTIPSLTKSDLLNISIPLPH 131 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 30/91 (32%), Gaps = 6/91 (6%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y IV ++ N +R + I+S YL + + Sbjct: 46 YAKDWIVKKNSVIIGRKGNINKPILVRENFWNVDTAFG---LEPVLEKINSEYLFYFCQL 102 Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 Y+ K+ A+ SL D+ + + +P Sbjct: 103 YNFEKLNKAV---TIPSLTKSDLLNISIPLP 130 >gi|207110599|ref|ZP_03244761.1| restriction modification system DNA specificity subunit [Helicobacter pylori HPKX_438_CA4C1] Length = 94 Score = 43.2 bits (100), Expect = 0.087, Method: Composition-based stats. Identities = 5/70 (7%), Positives = 20/70 (28%), Gaps = 9/70 (12%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 P +W+ V + ++ G + ++ ++ + D+ + + Sbjct: 24 PLNWQRVRLGDIAEIKRGASPRPIENPKWFCANSNVGWVRISDISKNSRFLYKTAQKLSK 83 Query: 73 SDTSTVSIFA 82 + Sbjct: 84 KGIEKSRLVK 93 >gi|257883800|ref|ZP_05663453.1| predicted protein [Enterococcus faecium 1,231,501] gi|257819638|gb|EEV46786.1| predicted protein [Enterococcus faecium 1,231,501] Length = 192 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 50/162 (30%), Gaps = 12/162 (7%) Query: 252 IESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVM 311 I S + E +++ +D ++V + + + Sbjct: 30 INYYDQSSFDEDDKHHGEMSRDEKINYLFDSEVSLDKRDVVIS--NSLQRATMVSEKNIG 87 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYD---LCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + + + +D Y +L Y K G+G Q L + +++L + Sbjct: 88 KVLSLNFTKVEFHSEKLDKRYFLYLFNQYKDIQRQKERELQGTGPVQRLTKQSLEQLVIP 147 Query: 369 VPPIKEQFDITNVINVETARIDVL-VEKIEQSIVLLKERRSS 409 V EQ I + I+ L ++ L E+ + Sbjct: 148 VVSSSEQQRIGEI------YIETLKIQSKLSQYARLTEQFAG 183 >gi|289168440|ref|YP_003446709.1| restriction endonuclease S subunit [Streptococcus mitis B6] gi|288908007|emb|CBJ22847.1| restriction endonuclease S subunit [Streptococcus mitis B6] Length = 191 Score = 42.9 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 12/86 (13%), Positives = 32/86 (37%), Gaps = 12/86 (13%) Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 Y+ + + FY ++ + + +PP++ Q I +++ T + Sbjct: 103 KYIYYCFCN------FYKKEGSYKRHWSNAKI--TLIPIPPLEIQEKIVQILDKFTDYVT 154 Query: 391 VLVEKIEQSIVLLKER----RSSFIA 412 L ++ + L K++ R + Sbjct: 155 ELTSELTSELTLRKKQYSYFRDYLLN 180 >gi|328465098|gb|EGF36369.1| type I restriction-modification system, S subunit [Lactobacillus helveticus MTCC 5463] Length = 108 Score = 42.9 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 14/52 (26%), Positives = 22/52 (42%), Gaps = 2/52 (3%) Query: 368 LVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE-RRSSFIAAAVTGQ 418 +PP+ EQ I I A + VE Q L+ +S + A+ G+ Sbjct: 1 PLPPLSEQSRIAAKIAQLFALL-RKVETSTQQYAKLQTLLKSKVLDLAIRGK 51 >gi|268572389|ref|XP_002648950.1| Hypothetical protein CBG21263 [Caenorhabditis briggsae] Length = 514 Score = 42.9 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 67/228 (29%), Gaps = 15/228 (6%) Query: 189 IRFIELLKEKKQALVSYIVTK--GLNPDVKMKDSGIEWVGLVPDHWEVKP-FFALVTELN 245 R + L + Q + G + + IE G L Sbjct: 281 CRLLVLWTKNDQKDDAESFKWILGNTKECPKCQAPIEKNGGCNHMTCNNKSCRHEFCWLC 340 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 N + + ++ G+ ++ N+ + E ++T + + + + + Sbjct: 341 MGNWIGHQQCNVFVATGDSNREKTLANL-QRFEFFKTRYLGHQQSLKLENDLRTDIRHKM 399 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 R + K + LM SY + L D++ Sbjct: 400 RQLKEFFDLTTFQVIYLEKALNALTECRRTLMYSYIFAYYLEPNLNSKIFQLNQRDLESA 459 Query: 366 PVLVPPIKEQFDITNVINVETAR--IDVLVEKIEQSIVLLKERRSSFI 411 EQ + ++ + ++ L +++ + +++RR S + Sbjct: 460 T-------EQL--SEILERKLEEDDLESLKQRVTEKYQYVEQRRQSLL 498 >gi|298483405|ref|ZP_07001582.1| type I restriction-modification system, S subunit [Bacteroides sp. D22] gi|298270353|gb|EFI11937.1| type I restriction-modification system, S subunit [Bacteroides sp. D22] Length = 114 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 23/86 (26%), Gaps = 1/86 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +PK W + + GR + +D ++ GL + N + Sbjct: 29 QLPKGWTTIKVGDVAIYTNGRAFKP-EDWMHEGLPIIRIQNLNDNSASYNRTPKTYESKY 87 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDG 105 + G +L+ Sbjct: 88 LIHNGDLLFAWAASLGTYIWNGGKAW 113 >gi|150006172|ref|YP_001300916.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] gi|149934596|gb|ABR41294.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] Length = 108 Score = 42.9 bits (99), Expect = 0.10, Method: Composition-based stats. Identities = 11/77 (14%), Positives = 25/77 (32%) Query: 314 GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 I +A + D +L + + + +++L + +P I Sbjct: 10 WITGNAMVINTDKYQDKVCKRYLYHYLSAYNFNSIISGSGQPQIVRTPLEKLKITLPTIS 69 Query: 374 EQFDITNVINVETARID 390 EQ + + +ID Sbjct: 70 EQKQKAIIFDKIQDKID 86 >gi|229548227|ref|ZP_04436952.1| possible type I restriction enzyme, S subunit [Enterococcus faecalis ATCC 29200] gi|229306644|gb|EEN72640.1| possible type I restriction enzyme, S subunit [Enterococcus faecalis ATCC 29200] Length = 164 Score = 42.9 bits (99), Expect = 0.11, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 39/148 (26%), Gaps = 2/148 (1%) Query: 240 LVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQ 299 E + + S+ + +Y + L E + V+ +I+ Sbjct: 16 HKHEWSSSGVRFFRSSDIMSAYNGTTNQKAFIPNELYEELIKKSGKVNLDDILVTGGGSV 75 Query: 300 NDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLK 358 L S + ID +L S K ++ G Sbjct: 76 G-VPYLVSDEKPLYFKDADLLWIKNSGVIDGQFLYTFFISTFFRKYIKSISHIGTISHYT 134 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVET 386 K P+ +P KEQ I + Sbjct: 135 IVQAKETPIKLPSFKEQGSIGSFFKYLD 162 >gi|301299371|ref|ZP_07205652.1| type I restriction modification DNA specificity domain protein [Lactobacillus salivarius ACS-116-V-Col5a] gi|300853025|gb|EFK80628.1| type I restriction modification DNA specificity domain protein [Lactobacillus salivarius ACS-116-V-Col5a] Length = 163 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 45/140 (32%), Gaps = 11/140 (7%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + GN G + + V +++ + + G + S Sbjct: 17 MKGGNTNYLETNYLNGGTAQKVDALADVSKDDVLILWDGS-----KAGTIYHGFEGALGS 71 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 A P S + + + K++ + + + ++ V +P I EQ +I Sbjct: 72 TLKAYVPKY--SGDFLYQILKKNQDKIYQSYRTPNIPHVIKNFTEKFNVSIPTIIEQQEI 129 Query: 379 TNVINVETARIDVLVEKIEQ 398 + ++D L+ ++ Sbjct: 130 GDF----FKQLDSLITLHQR 145 >gi|291563844|emb|CBL42660.1| Type I restriction modification DNA specificity domain [butyrate-producing bacterium SS3/4] Length = 360 Score = 42.5 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 29/352 (8%), Positives = 78/352 (22%), Gaps = 46/352 (13%) Query: 24 HWKVVPIKRFTKLNTGRTSE----SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 +W+ VP++ G+ D+ + +V +G ++ + Sbjct: 6 NWESVPLRDLFSFERGKEKNMALLKEGDLPLVSARNVNNGVKGFVGNPTKTLSGGNV--- 62 Query: 80 IFAKGQILYGKLGPYL-RKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 I G A +D T L P++ + ++ + Q Sbjct: 63 ------ITLNNDGDGGAGLAYYQAYDFALDTHVTALIPQNDISPEALLYMTASISKQHDI 116 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + +P + E + ++ + R + + Sbjct: 117 FGHG----RSISLPRAKRLQNMLPVNDDGAPDYELMTDYVKKLRKSMLMRYKAHAI---- 168 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 A + + P ++ + A + ++ + + Sbjct: 169 --ANIKKLGEYLPVPSIQ-------------EMRWEPFLIADIFDILPGKRLVAADSTP- 212 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 GN N ++ + I + Sbjct: 213 ---GNRPFIGALDNNNGVARFVNDSNASLDKNVLGVNYNGNGMVIGFYH---PYECIFSD 266 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + L + + G E + +++P Sbjct: 267 DVKRFHLKHHEDNAFVLLFMKVVILQQKSKFGY--LYKFNAERMANTRIMLP 316 >gi|322411876|gb|EFY02784.1| hypothetical protein SDD27957_05695 [Streptococcus dysgalactiae subsp. dysgalactiae ATCC 27957] Length = 198 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 70/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ I L D+ + +Y + Sbjct: 14 EKVALGDAVDCFKGKAVSSKAEPGDVGLINLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHQQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T +++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLKRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 46/128 (35%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHQQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + ++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLKRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|309810067|ref|ZP_07703913.1| type I restriction-modification enzyme, S subunit, EcoA family [Lactobacillus iners SPIN 2503V10-D] gi|308169566|gb|EFO71613.1| type I restriction-modification enzyme, S subunit, EcoA family [Lactobacillus iners SPIN 2503V10-D] Length = 148 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 52/135 (38%), Gaps = 3/135 (2%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS-AYM 321 K + K + ++ +++ + + ++ + + + + Sbjct: 9 MQFSKDGLVYISDKQAAKLKNASIESDDVLLNITGDSVARACIMDSKYLPARVNQHVSII 68 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 P+ I S YL + ++ + A R++L E++ L V +P I++Q +IT + Sbjct: 69 RCDPNKIKSQYLLYYLQYLKKHLLKMASVGSTRKALTKEEISGLLVELPSIEKQKEITLL 128 Query: 382 INVETAR--IDVLVE 394 + + I+ + Sbjct: 129 LESVRHKMQINRQIN 143 >gi|218281997|ref|ZP_03488309.1| hypothetical protein EUBIFOR_00878 [Eubacterium biforme DSM 3989] gi|218216984|gb|EEC90522.1| hypothetical protein EUBIFOR_00878 [Eubacterium biforme DSM 3989] Length = 367 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 15/135 (11%), Positives = 43/135 (31%), Gaps = 10/135 (7%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS-TYLAWL 336 + ++P ++ + ++ E G++ S + +S Y+ Sbjct: 61 IGNKRIFWIEPNCLILNIVFAWEQ--AVAKTSEKEVGMVASHRFPMYKVLNNSLDYIVDF 118 Query: 337 MRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLV 393 ++ ++ G ++L + + +P + EQ + I+ + Sbjct: 119 FKTEKGKQLLQMASPGGAGRNKTLNQDFFLNSKIYLPSLNEQLK----TSELIELIEDRI 174 Query: 394 EKIEQSIVLLKERRS 408 E + I K + Sbjct: 175 ETQIKIIEDYKVLKK 189 >gi|294793951|ref|ZP_06759088.1| conserved hypothetical protein [Veillonella sp. 3_1_44] gi|294455521|gb|EFG23893.1| conserved hypothetical protein [Veillonella sp. 3_1_44] Length = 150 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 43/138 (31%), Gaps = 9/138 (6%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP- 325 + + ++ G+I F K + GII+ + +P Sbjct: 2 YFQDPDKVQSNNLDTRTYVMKKGDIAFEGHPNNEFKFGRFVLNDIGTGIISELFPIYRPI 61 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS----LKFEDVKRLPVLVPPIKEQFDITNV 381 D + + ++ + A + L +LVP I+EQ I + Sbjct: 62 TEYDLDFWKYAIQLERVMAPILAKSITSSGNSSNKLDHNHFLNKELLVPNIEEQKKIGTL 121 Query: 382 INVETARIDVLVEKIEQS 399 +++ + + +Q Sbjct: 122 LSLLSKN----ITLHQQE 135 >gi|86130652|ref|ZP_01049252.1| DNA adenine methylase [Dokdonia donghaensis MED134] gi|85819327|gb|EAQ40486.1| DNA adenine methylase [Dokdonia donghaensis MED134] Length = 833 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 68/221 (30%), Gaps = 16/221 (7%) Query: 203 VSYIVTKGLNPDVKM-KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSY 261 +S I L+P++ KD E +G + K + + K + + S Sbjct: 371 ISDIKGTQLHPNLYFTKDFKGELLGTILKSLNPKRIYKK-ENITGKYFQFGSDQKVESSL 429 Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 I K+E+ + ++ + L + I + Sbjct: 430 IVDISKIESVKIPKSAVEISQTCLIIINRGSDLKVALFEYAGVPIYVSQLSNFFIPN--- 486 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGS-GLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 I Y+++ + S + + S + D++++ + +P KEQ + N Sbjct: 487 PEYNEDISLEYISYTLLSDTVQEQLKLYNSMSSVFIMNKNDIQKIRIEIPSFKEQLEKLN 546 Query: 381 VINVET-------ARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + D L+ K + L+ S + Sbjct: 547 FLRDTHYNFQLNKREFDKLIAKTKD--EALRNY-QSLNHSL 584 >gi|227364527|ref|ZP_03848589.1| possible restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] gi|227070436|gb|EEI08797.1| possible restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] Length = 173 Score = 42.5 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 51/163 (31%), Gaps = 11/163 (6%) Query: 30 IKRFTKLNTGRTSESGKDI-------IYIGLEDVESGTGKYLPKDGN-SRQSDTSTVSIF 81 + ++ G+ G + Y+ + D + + + + Sbjct: 6 LGDIAEIKGGKRMPKGTRLQQEKNQHPYLRITDYDGKSFDRNSIRYVPDEVFEKISNYTV 65 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICS---TQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +G I +G I + ++ + V + + +L S+ +++ Sbjct: 66 TEGDIFLSIVGTIGIATTIDKEYDNANLTENAVKIIPDESVNSKYILYFLQSMLGQRQMN 125 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 + G+T K I I + +P L Q + + +I Sbjct: 126 ELSVGSTQKKLPIKNIKKIKILLPNLEIQNKVVSNLQILDKKI 168 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 54/142 (38%), Gaps = 4/142 (2%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYET--YQIVDPGEIVFRFIDLQNDKRSLRSA 308 + L ++ + + E +E V G+I + ++ Sbjct: 28 KNQHPYLRITDYDGKSFDRNSIRYVPDEVFEKISNYTVTEGDIFLSIVGTIGIATTI-DK 86 Query: 309 QVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKRLPV 367 + + +A + ++S Y+ + ++S + + ++ L +++K++ + Sbjct: 87 EYDNANLTENAVKIIPDESVNSKYILYFLQSMLGQRQMNELSVGSTQKKLPIKNIKKIKI 146 Query: 368 LVPPIKEQFDITNVINVETARI 389 L+P ++ Q + + + + +I Sbjct: 147 LLPNLEIQNKVVSNLQILDKKI 168 >gi|313620400|gb|EFR91802.1| type I restriction-modification system, S subunit [Listeria innocua FSL S4-378] Length = 168 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 36/139 (25%), Gaps = 2/139 (1%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 W+ + GR + + + + G + +G Sbjct: 20 WEQRKLGEDVNFLNGRAYSQKELLDKGKYKVLRVGNFN-TNDRWYYSDLELEENKYANRG 78 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 +LY I I L+ ++ + I +RI+ G Sbjct: 79 DLLY-LWATNFGPEIWNQEKVIYHYHIWKLKIMNINVSKQYLYTWLITDKERIKQSTNGT 137 Query: 145 TMSHADWKGIGNIPMPIPP 163 TM H I IPP Sbjct: 138 TMVHVTKSHIEQREFQIPP 156 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 14/138 (10%), Positives = 39/138 (28%), Gaps = 8/138 (5%) Query: 245 NRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS 304 + L + L GN L+ E + + G++++ + + Sbjct: 37 YSQKELLDKGKYKVLRVGNFNTNDRWYYSDLELEENK---YANRGDLLYLWATNFGPEIW 93 Query: 305 LRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR 364 + + + S + D ++ + + +++ Sbjct: 94 ----NQEKVIYHYHIWKLKIMNINVSKQYLYTWLITDKERIKQSTNGTTMVHVTKSHIEQ 149 Query: 365 LPVLVPP-IKEQFDITNV 381 +PP + EQ I + Sbjct: 150 REFQIPPNLTEQQKIGDF 167 >gi|283956447|ref|ZP_06373927.1| hypothetical protein C1336_000250221 [Campylobacter jejuni subsp. jejuni 1336] gi|283792167|gb|EFC30956.1| hypothetical protein C1336_000250221 [Campylobacter jejuni subsp. jejuni 1336] Length = 184 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 14/129 (10%), Positives = 44/129 (34%), Gaps = 10/129 (7%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + V ID + + ++ Y+++++ + F Sbjct: 64 YDNDSVLWGIDGDWMVGFIPKNKKFYPTDHCGVLRVDDTKI-NAKYISFVLNEAGKKQGF 122 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + +K L V +P ++ Q I ++ T +I+ + + + + L++ Sbjct: 123 SR-----KLRASIDRIKALRVKLPSLEFQDQIADI----TDKIEKKINEYKIELDRLEKE 173 Query: 407 RSSFIAAAV 415 + + + Sbjct: 174 KEKILQKYL 182 >gi|317481421|ref|ZP_07940488.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] gi|316902406|gb|EFV24293.1| type I restriction modification DNA specificity domain-containing protein [Bacteroides sp. 4_1_36] Length = 218 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 69/187 (36%), Gaps = 13/187 (6%) Query: 30 IKRFTKLNTGRTSESGKDI-----IYIGLEDVESGTGKYLPKD---GNSRQSDTSTVSIF 81 + L G +S K + + +V SG +D + +D + Sbjct: 35 LSNIATLKNGYAFQSSKYNALGKWKILTITNV-SGERYINDEDCNCIINLPNDIQDHQVL 93 Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 +G IL G R ++ + D + + + L+ K+V E L L S + A Sbjct: 94 KEGDILISLTGNVGRVSLCKNGDYLLNQRVGLLQLAKNVNQEFLYQILSSQKFENSMIAC 153 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 +GA + + + +P +L KI+ D I +R + LL +KQ Sbjct: 154 GQGAAQMNIGKGDVESYVLPYSSNGNNILWVAKILHSY---DECIINEMRRLTLLTMQKQ 210 Query: 201 ALVSYIV 207 L++ + Sbjct: 211 YLLTQMF 217 >gi|301633693|gb|ADK87247.1| type I restriction modification DNA specificity domain protein [Mycoplasma pneumoniae FH] Length = 187 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 23/122 (18%), Positives = 39/122 (31%), Gaps = 5/122 (4%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 GE V D S+ + V I +LA+ +R V Y Sbjct: 54 KGEYVTWTTDGA-QAGSVFYRNGQFNATNVCGILKVNNDEIYPKFLAYALRLKAPKFVNY 112 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI-VLLKER 406 A L + + + P K Q I +++ T L ++ + L+ER Sbjct: 113 ACP---IPKLMQGTLAEIELDFPSKKIQEKIATILDTFTELSAELSAELSAELSAELRER 169 Query: 407 RS 408 + Sbjct: 170 KK 171 >gi|292558143|gb|ADE31144.1| hypothetical protein SSGZ1_0687 [Streptococcus suis GZ1] Length = 131 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 37/122 (30%), Gaps = 9/122 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKD---------IIYIGLEDVESGTGKYLPKDGNSRQSDT 75 W++ + + + G++ ++ I D+++ + Sbjct: 6 WQIKSLSELGRFSRGKSKHRPRNDKKLFTNGTYPLIQTGDIKNSNLYVTKNSDYYNEFGL 65 Query: 76 STVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQ 135 S ++ +G + AI++ ++ LVL L + + + Sbjct: 66 SQSKLWKQGTLCITIAANIAETAILSHPMCFPASVLLVLIAHKNESSELFVYYVFEFNKK 125 Query: 136 RI 137 R Sbjct: 126 RN 127 >gi|34557965|ref|NP_907780.1| DNA methylase-type I restriction-modification system [Wolinella succinogenes DSM 1740] gi|34483683|emb|CAE10680.1| DNA METHYLASE-TYPE I RESTRICTION-MODIFICATION SYSTEM [Wolinella succinogenes] Length = 1073 Score = 42.5 bits (98), Expect = 0.14, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 42/134 (31%), Gaps = 3/134 (2%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFID 297 + + N + + S ++ R+ L ES +T Y + P E V Sbjct: 661 NHLFDYYSFNARYGQPIYDENSTLKVLNSQYVRDYFLDYESAKTGYGEIVPKEAVLINAT 720 Query: 298 LQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA--MGSGLRQ 355 + + + + + + I+ YL ++SY GS + Sbjct: 721 GIGTLGRVNINYLNDSFSVDNHVNVIIAKNINPYYLTIFLKSYYGQSQINRYYSGSSGQI 780 Query: 356 SLKFEDVKRLPVLV 369 + +D V + Sbjct: 781 EIYAKDFNNFLVPI 794 >gi|283954614|ref|ZP_06372132.1| LOW QUALITY PROTEIN: hypothetical protein C414_000240125 [Campylobacter jejuni subsp. jejuni 414] gi|283793806|gb|EFC32557.1| LOW QUALITY PROTEIN: hypothetical protein C414_000240125 [Campylobacter jejuni subsp. jejuni 414] Length = 1035 Score = 42.5 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 50/162 (30%), Gaps = 7/162 (4%) Query: 220 SGIEWVGLVPDHWEVKPFFALVTELNRKNTKL-IESNILSLSYGNIIQKLETRNMGLKPE 278 S E +E+ + +N E ++L+ GN+ ++N + Sbjct: 879 SKDELNPFKNSKFELVRLGEVCDLNKIRNQASATEIEKMNLNSGNVKLLPSSKNYEWWTD 938 Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 Q ++ GE++ L + + + + ++VK +++ Sbjct: 939 EKTAGQFINEGEVI----TLGVARYANIKKHKGKFVSANNHILSVKDKSKIIFDFLYILL 994 Query: 339 SYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 K++ + +PP++ Q I Sbjct: 995 EICGQKLYKQGQQ--YPQFDTNIFYSFKIPLPPLEIQKQIVA 1034 >gi|303327177|ref|ZP_07357619.1| putative dna methylase-type I restriction-modification system [Desulfovibrio sp. 3_1_syn3] gi|302863165|gb|EFL86097.1| putative dna methylase-type I restriction-modification system [Desulfovibrio sp. 3_1_syn3] Length = 241 Score = 42.5 bits (98), Expect = 0.15, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 68/195 (34%), Gaps = 11/195 (5%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 + SG +G + VK ++ ++ + N LS+ I + Sbjct: 45 RSSGCFEIGDFLPNTFVKGIQHEYLDVITDDSVPV-VNTLSIQNMKINMEDCRYIQSDDF 103 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 E+ + + +++ + +++ + + S ++P GI L +L+ Sbjct: 104 ENLSDERKIKINDVLLTVDGGTSIGKAVL-FEETISSTVDSHVCILRPQGIKPLTLVYLL 162 Query: 338 RSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 S F SG + ++ ED++R ++ I+ I+ Sbjct: 163 TSKVGQMQFKIYESGASGQTTVTEEDIRRFIFPSAALE-------SIDEVVRDIEAKRAG 215 Query: 396 IEQSIVLLKERRSSF 410 I + I LK + +S Sbjct: 216 ISKEIEQLKRKENSL 230 >gi|229824145|ref|ZP_04450214.1| hypothetical protein GCWU000282_01449 [Catonella morbi ATCC 51271] gi|229786499|gb|EEP22613.1| hypothetical protein GCWU000282_01449 [Catonella morbi ATCC 51271] Length = 140 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 46/102 (45%), Gaps = 6/102 (5%) Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 E G + + Y V+P Y W + ++ + +G+ +L+FE++K L + + Sbjct: 42 EDGEVDARYAVVQPTIDCVPYYLWNVIQMEMPEFCAQWQTGI--NLQFENLKFLSIPLHS 99 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +EQ I + + + D ++ ++ + L K + + + Sbjct: 100 FEEQKKIADKL----TKYDAWIQAEQKQLDLWKGVKKNMLDK 137 >gi|150006173|ref|YP_001300917.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] gi|149934597|gb|ABR41295.1| type I restriction endonuclease S subunit [Bacteroides vulgatus ATCC 8482] Length = 212 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 59/181 (32%), Gaps = 18/181 (9%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLK---PESYETYQIVDPGEIV 292 + K L + IL+++ + + + + P + +Q++ G+I+ Sbjct: 34 TLKNDYAFQSGKYNALGKWKILTITNVSGERYINDEDYNCIINLPNDIQDHQVLKEGDIL 93 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG 352 + + + + ++ +L ++ S A G G Sbjct: 94 ISLTGNVGRVSLCKDGDYLLNQRVG---LLQLAKNVNQEFLYQILSSQRFENSMIACGQG 150 Query: 353 L-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLLKERR 407 + ++ DV+ + N I + A+I D + ++ + LL ++ Sbjct: 151 AAQMNIGKGDVESYVLPYSSN------VNNI-LLVAKILHSYDEYIINEQRKLTLLTMQK 203 Query: 408 S 408 Sbjct: 204 Q 204 Score = 40.9 bits (94), Expect = 0.39, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 11/186 (5%) Query: 30 IKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + L +SGK I+ I E + +D + Sbjct: 29 LSNIATLKNDYAFQSGKYNALGKWKILTITNVSGERYINDEDYNCIINLPNDIQDHQVLK 88 Query: 83 KGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLPELLQGWLLSIDVTQRIEAIC 141 +G IL G R ++ D D + + + L+ K+V E L L S + A Sbjct: 89 EGDILISLTGNVGRVSLCKDGDYLLNQRVGLLQLAKNVNQEFLYQILSSQRFENSMIACG 148 Query: 142 EGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 +GA + + + +P +L+ KI+ D I R + LL +KQ Sbjct: 149 QGAAQMNIGKGDVESYVLPYSSNVNNILLVAKILHSY---DEYIINEQRKLTLLTMQKQY 205 Query: 202 LVSYIV 207 ++ + Sbjct: 206 FLAQMF 211 >gi|288560184|ref|YP_003423670.1| type I restriction-modification enzyme S subunit HsdS [Methanobrevibacter ruminantium M1] gi|288542894|gb|ADC46778.1| type I restriction-modification enzyme S subunit HsdS [Methanobrevibacter ruminantium M1] Length = 190 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 66/178 (37%), Gaps = 19/178 (10%) Query: 241 VTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQN 300 V + +E IL +Y KL+ + E +E + + ++ Sbjct: 19 VKRYQKGKGTTVERPILKKTYSENSSKLDLEYEEVSEEIHERFYSQENDIVIL------- 71 Query: 301 DKRSLRSAQVMERGIITSAY--MAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSL 357 + +++ E GII Y + G D ++ L++S + + + + + Sbjct: 72 -LAGSKVSKIEEAGIIIPMYYAVVRVKEGYDVDFIYHLLKSDIFPRELHKIEEGTTLKII 130 Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI--EQSIVLLKERRSSFIAA 413 K +K + + VP ++ Q + + N+ RI + +E E+ I S I Sbjct: 131 KTTHLKSIYLPVPDLETQINYGKLFNLMDKRIKLNMELAELEKQIE------KSIINE 182 >gi|291545713|emb|CBL18821.1| Type I restriction modification DNA specificity domain [Ruminococcus sp. SR1/5] Length = 166 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 43/153 (28%), Gaps = 11/153 (7%) Query: 30 IKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 +K K+ TG T G I +I +++ SG + + + K Sbjct: 2 LKDTCKVITGNTPSRAIAEYYGDYIEWIKTDNIVSGILNPTQATESLSEKGMNVGRTVEK 61 Query: 84 GQILYGKLG---PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL + + + I D + Q + P+ L L + Sbjct: 62 DSILMACIAGSIASIGRVCITDRIVAFNQQINAVVPEQYNILFLYVLLQMSKDYLVEDIN 121 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + IPP+ Q + Sbjct: 122 MALKGI--LSKSKLEEKEFIIPPMDLQEQFSDF 152 Score = 40.5 bits (93), Expect = 0.54, Method: Composition-based stats. Identities = 16/147 (10%), Positives = 46/147 (31%), Gaps = 7/147 (4%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQI---VDPGEIVFRFIDLQNDKRSLRS 307 I + NI+ + + S + + V+ I+ I S+ Sbjct: 21 YYGDYIEWIKTDNIVSGILNPTQATESLSEKGMNVGRTVEKDSILMACIAGSI--ASIGR 78 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + +R + + + + +++ + + L+ L ++ Sbjct: 79 VCITDRIVAFNQQINAVVPEQYNILFLYVLLQMSKDYLVEDINMALKGILSKSKLEEKEF 138 Query: 368 LVPPIKEQFDITNVINVETAR--IDVL 392 ++PP+ Q ++ + I+ L Sbjct: 139 IIPPMDLQEQFSDFVKQVNKSKFINQL 165 >gi|163801595|ref|ZP_02195493.1| type I restriction-modification system methyltransferase subunit [Vibrio sp. AND4] gi|159174512|gb|EDP59314.1| type I restriction-modification system methyltransferase subunit [Vibrio sp. AND4] Length = 639 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 14/165 (8%), Positives = 46/165 (27%), Gaps = 15/165 (9%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + I ++ ++ + +N+ + V G + + N Sbjct: 458 SRNFELEYIHHVALKDVCKLRSGKNLNKDDVESKGEFPVYGGNGIIGYYLDANRPGDSVI 517 Query: 308 AQVMERGIITSAYMAVKPHGIDST-----------YLAWLMRSYDLCKVFYAMGSGLRQS 356 + + + + YL +L + ++ Sbjct: 518 IGKVGAHCGNIHFSSKPYWLTTNAISLELLDTTRVYLPYLAHVLKSLDLNNLATGTAQKF 577 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIV 401 + + + V +P +++Q + ++ I+ KI+ + Sbjct: 578 VSINQLYEVEVSLPSLEKQKE----LSDWFTSIEESKSKIQSLLE 618 >gi|237822173|ref|ZP_04598018.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae CCRI 1974M2] Length = 137 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 47/133 (35%), Gaps = 9/133 (6%) Query: 34 TKLNTGRTSESGKD--------IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 ++ G + KD I +I + D E G ++S + KG Sbjct: 2 VEIVRGGSPRPIKDYLTSEVDGINWIKIGDTEKGEKYINNVKEKIKKSGLNKTRFVKKGT 61 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID-VTQRIEAICEGA 144 L + R I+ I + ++ L + ++LS + V + ++ GA Sbjct: 62 FLLTNSMSFGRPYILNVDGAIHDGWLAISNYENSLNKDYLFYILSSNVVYSQFLSLISGA 121 Query: 145 TMSHADWKGIGNI 157 + + + + +I Sbjct: 122 VVKNLNSDKVASI 134 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 13/104 (12%), Positives = 36/104 (34%), Gaps = 4/104 (3%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 + + +K + V G + L + G + ++ Sbjct: 37 KYINNVKEKIKKSGLNKTRFVKKGTFLLTNSMSFGRPYILNVDGAIHDGWLA---ISNYE 93 Query: 326 HGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 + ++ YL +++ S + F ++ SG ++L + V + + Sbjct: 94 NSLNKDYLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIP 137 >gi|312902304|ref|ZP_07761511.1| hypothetical protein HMPREF9512_00080 [Enterococcus faecalis TX0635] gi|310634275|gb|EFQ17558.1| hypothetical protein HMPREF9512_00080 [Enterococcus faecalis TX0635] Length = 146 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 48/149 (32%), Gaps = 7/149 (4%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI-ITSAYMAVKPH 326 +K E + G I + +E + + + + P Sbjct: 3 DFDNFECVKLEDVAEFGRAKAGYIYPAGTSTIQISATTGQIDFLEYPREVPTKEVVIIPQ 62 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 L+ ++ K +G+ +++ +++ P+ + + Q +++ T Sbjct: 63 NGIEPKYFNLILQRNVEKFIAKYATGI--NIQEKEIGNFPIELFNRETQKAFVRMMDHIT 120 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + E + + KE + +F+ + Sbjct: 121 DE----IATAENELTIYKEMKKAFLGDLM 145 >gi|309808159|ref|ZP_07702070.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LactinV 01V1-a] gi|308168595|gb|EFO70702.1| type I restriction modification DNA specificity domain protein [Lactobacillus iners LactinV 01V1-a] Length = 178 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 46/173 (26%), Gaps = 13/173 (7%) Query: 25 WKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS-T 77 W + T K I + + + Y + + Sbjct: 4 WLEKTLGEVTSFMKKGIPPKYTVEESEKTIRVLNQKCNRNFEISYSESRLHDCEKKIVPA 63 Query: 78 VSIFAKGQILYGK--LGPYLRKAIIADFDG--ICSTQFLVLQPKDVLPELLQGWLLSIDV 133 + G +L +G R A + + G ++L+P + L + G+ + Sbjct: 64 DKMLRAGDVLINSTGIGTAGRVAQVVEVKGPTTIDGHMILLRPSEELNPIYYGYAVKAFQ 123 Query: 134 TQRIEAICEGATMSHADWKGIGN--IPMPIPPLAEQVLIREKIIAETVRIDTL 184 +Q + + + + I Q I + +I T Sbjct: 124 SQIEGLAEGSTGQTEINRMRLQDEVIIKYPKDKLVQENIGRFLSNIDDKIKTN 176 >gi|237752124|ref|ZP_04582604.1| type II restriction-modification enzyme [Helicobacter winghamensis ATCC BAA-430] gi|229376366|gb|EEO26457.1| type II restriction-modification enzyme [Helicobacter winghamensis ATCC BAA-430] Length = 894 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 58/180 (32%), Gaps = 11/180 (6%) Query: 213 PDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN 272 + + S IE + L + + K + +N S ++ Sbjct: 713 KIISLWKSDIEQIALAECGEFIGGLWTGKKPPFIKAKVIRNTN---FSLKGTLKLDSEYP 769 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQND----KRSLRSAQVMERGIITSAY--MAVKPH 326 +S + ++ G+I+ + + + + Q E ++ + V Sbjct: 770 ELEVEKSQFEKRKLEYGDIIIEKSGGSSTQAVGRVVIFTFQTNEPYSFSNFTTRLRVTRD 829 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 I+ +L ++ + +AM G ++L KRL + P IK Q I Sbjct: 830 DINPFFLHLVLHYIYQQGITFAMQGGMSGIRNLDMNLYKRLKIPKPDIKIQTQIVEECEK 889 >gi|15902835|ref|NP_358385.1| Type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae R6] gi|116515342|ref|YP_816268.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae D39] gi|15458388|gb|AAK99595.1| Type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae R6] gi|116075918|gb|ABJ53638.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae D39] Length = 203 Score = 42.1 bits (97), Expect = 0.18, Method: Composition-based stats. Identities = 22/211 (10%), Positives = 59/211 (27%), Gaps = 12/211 (5%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNLL-------VK 171 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEV 234 S +P K ++ Sbjct: 172 SRFNEMFGDPLNNNKKFAVKTGQQCFKFSIC 202 Score = 36.7 bits (83), Expect = 7.4, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 40/143 (27%), Gaps = 5/143 (3%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N LK P +I+ + + + I Sbjct: 35 DDLRNNNNLKFTESLNMTEALPDDILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKE 94 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I S YL + S + L + L + + I+EQ +I ++N Sbjct: 95 KIISDYLGVFLESKS-QYLREHSTGATIPHLNKNILLDLQLELLGIEEQENIICILNT-- 151 Query: 387 ARIDVLVEKIEQSIVLLKERRSS 409 I L+ K + + L S Sbjct: 152 --IKRLITKRKLQLDELNLLVKS 172 >gi|307255982|ref|ZP_07537778.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306861072|gb|EFM93070.1| Type I restriction enzyme EcoAI specificity protein [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] Length = 198 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 33/110 (30%), Gaps = 10/110 (9%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 IP+ W V ++ L GR +I ++ + L + Sbjct: 70 EIPESWVWVRLEDIFHLQAGR---------FISASEIYGEYKESLYPCYGGNGLRGFVKT 120 Query: 80 IFAKGQI-LYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 +G+ + G+ G A+ + +V++ L + Sbjct: 121 YNREGKFPIIGRQGALCGNINFAEGKFYSTEHAVVVETFSNTDTLWANYF 170 >gi|46487318|gb|AAS99047.1| Tgh098 [Campylobacter jejuni] Length = 131 Score = 42.1 bits (97), Expect = 0.19, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 33/119 (27%), Gaps = 8/119 (6%) Query: 27 VVPIKRFTKLNTGRTSESGK------DIIYIGLEDVESGT-GKYLPKDGNSRQSDTSTVS 79 +V +K G T DI ++ + D + + S Sbjct: 14 LVKLKICGDFFMGGTPSRKNINYWNGDIKWLTISDYSNRQVIMDTKEKITREGFKNSNAK 73 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 + KG ++ + + + I D + + + P + + + Q Sbjct: 74 MIQKGAVVVS-IYATIGRVGILGEDMTTNQAIVAIIPNEEFINKYLMYAIDYFKFQLYN 131 >gi|210630772|ref|ZP_03296596.1| hypothetical protein COLSTE_00481 [Collinsella stercoris DSM 13279] gi|210160368|gb|EEA91339.1| hypothetical protein COLSTE_00481 [Collinsella stercoris DSM 13279] Length = 69 Score = 42.1 bits (97), Expect = 0.20, Method: Composition-based stats. Identities = 11/73 (15%), Positives = 22/73 (30%), Gaps = 6/73 (8%) Query: 327 GIDSTYLAWLMRSYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 D T + ++ D K F + + + P EQ I + Sbjct: 1 MNDDTDVYFVYSMTDRIKKFAEQKASGSTFLEISGKGLAAGEFAFPSKDEQTAIGS---- 56 Query: 385 ETARIDVLVEKIE 397 ++D L+ + Sbjct: 57 MFKQLDHLITLHQ 69 >gi|327490263|gb|EGF22051.1| type I restriction enzyme EcoDI specificity protein [Streptococcus sanguinis SK1058] Length = 184 Score = 42.1 bits (97), Expect = 0.20, Method: Composition-based stats. Identities = 13/183 (7%), Positives = 43/183 (23%), Gaps = 7/183 (3%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL 298 ++ E K + + S L Sbjct: 4 SIFKEEFSKKEVTNKLGDFFPVITGKKDANIAKGGEYPFFSCSQNISYTDNYSFDARAIL 63 Query: 299 QNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK 358 + + P+ + + Y L + + + + Sbjct: 64 LAGNGDFNVKIFNGKFEAYQRTYVLIPNNDEHFGYLYYAIKYFLNDITSGHRGSVIKFIT 123 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 ++ + + KE + + + ++ + + I L R + + ++ + Sbjct: 124 KGQIEHFNIFMTSNKE------KLFLFNSFVEN-IANNNKEIDKLSNIRDTLLPKLLSDE 176 Query: 419 IDL 421 I + Sbjct: 177 ISV 179 >gi|251782484|ref|YP_002996786.1| hypothetical protein SDEG_1073 [Streptococcus dysgalactiae subsp. equisimilis GGS_124] gi|242391113|dbj|BAH81572.1| hypothetical protein SDEG_1073 [Streptococcus dysgalactiae subsp. equisimilis GGS_124] gi|323127370|gb|ADX24667.1| hypothetical protein SDE12394_05975 [Streptococcus dysgalactiae subsp. equisimilis ATCC 12394] Length = 198 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 70/186 (37%), Gaps = 10/186 (5%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + V + G+ S D+ I L D+ + +Y + Sbjct: 14 EKVALGEAVDCFKGKAVSSKAEPGDVGLINLSDMGTLGIQYHQLRTFQMDRRQLLRYLLE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 G +L G + + + D + S+ VL+P+ +L ++ +L S ++A Sbjct: 74 DGDVLIASKGTLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIGQALLDA 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 G + + K + +IP+P+ PL +Q + +I +R T +++ E E Sbjct: 134 ADHGKDVINLSTKELLDIPIPVIPLVKQ----DYLINHYLRGLTDYHRKLKRAEQEWEYI 189 Query: 200 QALVSY 205 Q + Sbjct: 190 QNEIQK 195 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 46/128 (35%), Gaps = 9/128 (7%) Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 +++ G+++ K + Q + ++ + + Y+ + + S Sbjct: 69 RYLLEDGDVLIASKG-TLKKVCVFHKQNRDVVASSNITVLRPQKLLRGYYIKFFLDSPIG 127 Query: 343 CKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV----INVETARIDVLVEK-- 395 + A G +L +++ +P+ V P+ +Q + N + ++ ++ Sbjct: 128 QALLDAADHGKDVINLSTKELLDIPIPVIPLVKQDYLINHYLRGLTDYHRKLKRAEQEWE 187 Query: 396 -IEQSIVL 402 I+ I Sbjct: 188 YIQNEIQK 195 >gi|225568966|ref|ZP_03777991.1| hypothetical protein CLOHYLEM_05045 [Clostridium hylemonae DSM 15053] gi|225162465|gb|EEG75084.1| hypothetical protein CLOHYLEM_05045 [Clostridium hylemonae DSM 15053] Length = 621 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 17/139 (12%), Positives = 37/139 (26%), Gaps = 9/139 (6%) Query: 280 YETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS 339 Y+ V +I+ + + + V D L + S Sbjct: 479 YKDKFRVSEDDILLTSKGSVIKAAVVGANPPPAFISGNITLLRVDERKYDPYILLEYLYS 538 Query: 340 YDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVI----NVETARIDVLVE 394 + SG + +K++ V + I + + L E Sbjct: 539 GQGQLALERIQSGTTIRILSNASIKKMKVPEYDKELMKVIGKQLKQNRERYFSEQKRLTE 598 Query: 395 KIEQS----IVLLKERRSS 409 ++ + +LKE + Sbjct: 599 SYQKERQKLLEILKEEKDG 617 >gi|14324679|dbj|BAB59606.1| type I restriction enzyme S protein [Thermoplasma volcanium GSS1] Length = 84 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 30/77 (38%), Gaps = 7/77 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGK 62 KDSG++WIG+I W +V I F+KL T T + I ++ ++ Sbjct: 3 MKDSGIEWIGSINSKWPIVKIIYFSKLKTCGTPDKRVLEYWEDGKINWMSSGEINKDLIY 62 Query: 63 YLPKDGNSRQSDTSTVS 79 + S + Sbjct: 63 EVEGKITELGYKNSNAT 79 >gi|238854085|ref|ZP_04644434.1| restriction endonuclease S subunit [Lactobacillus gasseri 202-4] gi|238833292|gb|EEQ25580.1| restriction endonuclease S subunit [Lactobacillus gasseri 202-4] Length = 307 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 46/132 (34%), Gaps = 6/132 (4%) Query: 25 WKVVPIKRFTKLNTGRTSES-----GKDIIYIGLEDVES-GTGKYLPKDGNSRQSDTSTV 78 WK I++ L +G+T G +I Y+ ++D+ S Y+ T+ Sbjct: 176 WKKSTIEKCCTLKSGKTLPRNIENEGGNIPYVKVKDMNSLENTTYITTSTRFVSDKTANK 235 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 SIF G +++ K G + ++ + +L + Sbjct: 236 SIFPVGTVIFPKRGGAIGTNKKRLTKVPICADLNIMGVIPDNTRISSYYLFEYFNMVDLN 295 Query: 139 AICEGATMSHAD 150 + G+++ + Sbjct: 296 TLNNGSSVPQIN 307 >gi|225164187|ref|ZP_03726463.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] gi|224801196|gb|EEG19516.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] Length = 490 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 51/420 (12%), Positives = 117/420 (27%), Gaps = 45/420 (10%) Query: 27 VVPIKRFTKLNT-GRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIF 81 ++ ++ GRT S K ++ V K + + + Sbjct: 63 WKKFEQLARVTMPGRTKGILVSSEKGTPFLAATQV-FDIRPVPRKWLAVDRINNARSLFI 121 Query: 82 AKGQILYGKLGPYLRKAIIAD--FDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++G IL + G R + D + +CS L ++ ++ PE + Q Sbjct: 122 SEGTILVTRSGNVGRSTLTTDTIKEILCSDDLLRVEARE--PEQWGWLYAYLRSPQARAM 179 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + ++ P+ + + + +D+ +E + + Sbjct: 180 MTGAQYGHIIKHLECEHLNALPVPVVRKGIAADFQKRTQAILDSRNRAHRLTLEAEERFE 239 Query: 200 QAL----VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK-----NTK 250 Q L V G + + + P + V + + + Sbjct: 240 QTLGPLKVKDWGEAGFDIRASLLFGDRRRLEATPHNPGVATIRRHLAKNGKGLFTVARAG 299 Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQI--------------VDPGEIVFRFI 296 +E + E+ + V G ++ Sbjct: 300 FDVWLPSRFKRIPAEDGIELVDSSAVFETNPDHNKRIADGDFGDAFNGRVKAGWLLMARS 359 Query: 297 DLQNDKRSLRSAQVM--ERGIITSAYMAVKPHGIDSTYLAWLMRSYDL----CKVFYAMG 350 + + E ++ + + P+ +L + + ++ Sbjct: 360 GQTYGINGNVAFATVAHENRAVSDDLLRIAPNKESKMRAGYLFVALSHPLLGRPLVKSLA 419 Query: 351 SGL-RQSLKFEDVKRLPVL-VPPIKEQFDITNVINV---ETARIDVLVEKIEQSIVLLKE 405 G + D+ L ++ +P +E I ++ E AR DVL K+ LL E Sbjct: 420 YGSSIPHIDAADLLLLEIVRLPSREE-NAIADLAEESAAERARADVLERKLADDASLLIE 478 >gi|321310221|ref|YP_004192550.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] gi|319802065|emb|CBY92711.1| type I restriction-modification system, S subunit (fragment) [Mycoplasma haemofelis str. Langford 1] Length = 130 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 10/117 (8%), Positives = 30/117 (25%), Gaps = 7/117 (5%) Query: 29 PIKRFTKLNTGRTS----ESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKG 84 K+ +G ++ + ++++ G + + +I +G Sbjct: 14 KFGDVCKIRSGTRFYPQFQTNSGFPIVRVKNIRDGQI--TTEGLSYCDPKNHNSAIIRQG 71 Query: 85 QILYGKLGPYLRK-AIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 I+ + G + + + L P +L + Sbjct: 72 DIVMARAGRTGVVGINLTGREFFFNENVFKLVPNRRFVTSRYLYLFLSRHQDIKTKL 128 >gi|228475389|ref|ZP_04060108.1| conserved hypothetical protein [Staphylococcus hominis SK119] gi|228270572|gb|EEK12004.1| conserved hypothetical protein [Staphylococcus hominis SK119] Length = 191 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 51/147 (34%), Gaps = 17/147 (11%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + +K + +V +IV + + + + + V Sbjct: 45 DDTYQPRVIKLKDTSRATVVHKDDIVISMM--TGECTLVSTRHDGSILPYNYTKIEVTSD 102 Query: 327 GIDSTYLAWLMR-SYDLCKVF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 ++ +L + + + ++ + Y G + L + +K L + +P I+ Q I Sbjct: 103 LLEPAFLVYWFQLAPEVHSQYKQYMQGGSTIKKLTHQQLKSLYITLPSIERQRLIGQ--- 159 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSF 410 + E+ + +LK+R+S Sbjct: 160 ---------IGIKEKQLNVLKQRQSRL 177 >gi|172039826|ref|YP_001799540.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] gi|171851130|emb|CAQ04106.1| type I restriction-modification system, specificity subunit [Corynebacterium urealyticum DSM 7109] Length = 320 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 9/64 (14%), Positives = 23/64 (35%), Gaps = 3/64 (4%) Query: 329 DSTYLAWLMR--SYDLCKVFYAMGSGLRQSLKFEDVKRLPVL-VPPIKEQFDITNVINVE 385 D ++ + ++ + S + +FE L +P + Q I +++ Sbjct: 117 DPKFVYYWLQLMHKSGRAWKHQNQSTGIANFQFEQFLDNEFLWLPSLTTQQAIASILGSL 176 Query: 386 TARI 389 +I Sbjct: 177 DDKI 180 Score = 40.2 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 37/106 (34%), Gaps = 10/106 (9%) Query: 29 PIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQS---DTSTVS 79 P R + G T + G +I + D+ + G +L + ++ + + Sbjct: 208 PFGRVCDVFGGSTPSTKVGEYWGGNINWATPTDLTALRGPWLSETERKITEAGLESMSST 267 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + G IL + +A + F+V++ + L + Sbjct: 268 LHPPGSILMTS-RATIGHVAVAATPVTTNQGFIVIRASEKLTPWIF 312 >gi|308184635|ref|YP_003928768.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori SJM180] gi|308060555|gb|ADO02451.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori SJM180] Length = 203 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 54/140 (38%), Gaps = 10/140 (7%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDK--RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWL 336 Y I D ++ +K + + + + A++ + + +L + Sbjct: 55 DYIDSYIFDGDFVLVGEDGSVINKDNTPVVNWASGKIWVNNHAHVLQTKNELKLKFLYFY 114 Query: 337 MRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKI 396 +++ D+ +G + E++K++ + +PP++ Q +I +++ + L+ I Sbjct: 115 LQTIDV----SYCVAGTPPKINQENLKKIIIPIPPLEIQQEIVKILDQFSILTTDLLAGI 170 Query: 397 EQSIVLLKE----RRSSFIA 412 I K+ R + Sbjct: 171 PAEIKARKKQYEYYREKLLT 190 Score = 36.3 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 43/158 (27%), Gaps = 13/158 (8%) Query: 22 PKHWKVVPIKRFTKLNTGRTSE--SGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS 79 PK + ++ R K I +G Y+ Sbjct: 13 PKGVGFRKLGEVCEILDNRRIPIAKNKRKPGIYPYYGANGIQDYIDSYIFDGDFV----- 67 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 + + + K A + VLQ K+ L + + + Sbjct: 68 LVGEDGSVINKDNT--PVVNWASGKIWVNNHAHVLQTKNELKLKFLYF----YLQTIDVS 121 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 C T + + + I +PIPPL Q I + + Sbjct: 122 YCVAGTPPKINQENLKKIIIPIPPLEIQQEIVKILDQF 159 >gi|297205947|ref|ZP_06923342.1| type I site-specific deoxyribonuclease specificity subunit HsdS [Lactobacillus jensenii JV-V16] gi|297149073|gb|EFH29371.1| type I site-specific deoxyribonuclease specificity subunit HsdS [Lactobacillus jensenii JV-V16] Length = 373 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 13/155 (8%), Positives = 50/155 (32%), Gaps = 14/155 (9%) Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + + + + + + + Y ++ GE+ + + + K + + Sbjct: 27 GWMTQEDRFSGDISGKQKKNYTLLHKGELSYNHGNSKVAKYGAVFSLQNYSEALIPHVYH 86 Query: 323 VKP--HGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ-----SLKFEDVKRLPVLVPPIKEQ 375 ++ + D+ K S + ++ + D ++ + + Sbjct: 87 SFKIIKETTPVFIENFFKKKDVNKQLRKYISSSARMDGLLNISYSDFMKVHLFIS----- 141 Query: 376 FDITN--VINVETARIDVLVEKIEQSIVLLKERRS 408 I+ I+ ++ L+ ++ + L K+ + Sbjct: 142 QKISETKQIDKIFEILNSLLSLQQRKLELEKQLKK 176 Score = 40.2 bits (92), Expect = 0.74, Method: Composition-based stats. Identities = 37/393 (9%), Positives = 119/393 (30%), Gaps = 32/393 (8%) Query: 32 RFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYG-- 89 ++ G +++ ++ + + + G+ ++ KG++ Y Sbjct: 3 EISERVNG--NDNRFNLPVLTISAKTGWMTQEDRFSGDISGKQKKNYTLLHKGELSYNHG 60 Query: 90 --KLGPYLRKAIIADFDGICSTQFLVLQ--PKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 K+ Y + ++ K+ P ++ + DV +++ + Sbjct: 61 NSKVAKYGAVFSLQNYSEALIPHVYHSFKIIKETTPVFIENFFKKKDVNKQLRKYISSSA 120 Query: 146 MSH-ADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + +++++ ++I +++L++ + R +EL K+ K+ + Sbjct: 121 RMDGLLNISYSDFMKVHLFISQKISETKQIDKIFEILNSLLSLQQRKLELEKQLKKFCLQ 180 Query: 205 YIV-TKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 I+ P+++ D W + ++++ TK S Sbjct: 181 NILSDNKKCPNLRFHDFSTNWKKVKVGDIFTVTRGKVLSKDKISKTKDHIMKYPVYSSQT 240 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 + L E T+ R + ++ + + G + A Sbjct: 241 LNNGLLGYYHDYLFEDAITWTTDGANAGTVRLRAGKFYGTNVNGVLLSKNGYVNDA---- 296 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV-PPIKEQFDITNVI 382 ++ ++ L ++ + + P ++EQ +I Sbjct: 297 NAEALNQIAWKYV-------------SKVGNPKLMNNVMQNIMFSIAPSVEEQV----II 339 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + ++ + +I + + + + Sbjct: 340 SKLFILHSKSLKIYQANINVYTQLKQFLLQNLF 372 >gi|296328508|ref|ZP_06871027.1| type I restriction enzyme StySJI specificity protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296154317|gb|EFG95116.1| type I restriction enzyme StySJI specificity protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 222 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 69/186 (37%), Gaps = 11/186 (5%) Query: 36 LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYL 95 + GR ++ +I +++V + + + K+ + + F + IL+ K+ P + Sbjct: 14 IIYGRAAKEFTKGDFISMKNVSENSFEIIEKNFEKFKDLQKGYTQFIENDILFAKIIPCM 73 Query: 96 RK------AIIADFDGICSTQFLV-LQPKDVLPELLQGWLLSIDVTQRIEAICE---GAT 145 + + + G ST+F + K + +LL +L + G Sbjct: 74 KNRKTTIITNLKEKIGYSSTEFHILRSTKIINNKLLYNFLKQKRFREDARCNMTGSVGFR 133 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY 205 ++ P+P PPL EQ I + + + + + E + +++++ Sbjct: 134 RVPTEFMKNYPFPLPPPPLEEQQEIVRILDEVLEN-ENKVKKLLELEEKMDILEKSILHK 192 Query: 206 IVTKGL 211 L Sbjct: 193 AFKGEL 198 >gi|227872204|ref|ZP_03990568.1| hypothetical protein HMPREF6123_0507 [Oribacterium sinus F0268] gi|227841947|gb|EEJ52213.1| hypothetical protein HMPREF6123_0507 [Oribacterium sinus F0268] Length = 69 Score = 41.7 bits (96), Expect = 0.23, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 26/50 (52%), Gaps = 1/50 (2%) Query: 328 IDSTYLAWLMRSYDLCK-VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +T++ L+ S V + G ++ + D+++L + +PPI+ Q Sbjct: 1 MLNTFVKALLESDYFENAVISKIRGGTQKFISLGDIRKLEICLPPIEVQE 50 >gi|260654990|ref|ZP_05860478.1| DNA methylase-type I restriction-modification system [Jonquetella anthropi E3_33 E1] gi|260630305|gb|EEX48499.1| DNA methylase-type I restriction-modification system [Jonquetella anthropi E3_33 E1] Length = 383 Score = 41.7 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 44/145 (30%), Gaps = 2/145 (1%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN-MGLKPESYETYQIVDPGEIVFR 294 + + + +E I + ++ + + + L +S I+F Sbjct: 191 HLVRIQKSIEPGSAAYMEKGIPFVRVQDLSSQGISEPCIYLDQKSCAEAPRPQKDTILFS 250 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-L 353 ++ ++ +K + YL ++ S + G + Sbjct: 251 KDGTVGIAYKVQENDPEFVTSSAILHLNMKTDEMLPDYLTLMLNSPIVQLQAERDAGGSV 310 Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDI 378 K ++ + V V ++Q +I Sbjct: 311 INHWKLSEIADVLVPVLSYEQQKEI 335 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 28/71 (39%), Gaps = 4/71 (5%) Query: 336 LMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINVET---ARIDV 391 L +S + + SG SL D+ +P+ + + Q I++ + + Sbjct: 5 LFQSLFMQDLLKRGCSGTILTSLNRNDLFNIPIPILDGEIQNKISSYVQESMRYRQQAKE 64 Query: 392 LVEKIEQSIVL 402 L+ +S+ L Sbjct: 65 LLHLATESVEL 75 >gi|89076109|ref|ZP_01162468.1| Restriction endonuclease S subunits [Photobacterium sp. SKA34] gi|89048185|gb|EAR53768.1| Restriction endonuclease S subunits [Photobacterium sp. SKA34] Length = 223 Score = 41.7 bits (96), Expect = 0.24, Method: Composition-based stats. Identities = 11/81 (13%), Positives = 22/81 (27%), Gaps = 9/81 (11%) Query: 21 IPKHWKVVPIKRFTKLNTGRTS--------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +PK W+ + F + G T S + + +V+ + R Sbjct: 106 LPKGWEYSRLGEFVSIIRGITFPASAKHHEPSEGLVACLRTTNVQ-HQIDWDDLLYVDRS 164 Query: 73 SDTSTVSIFAKGQILYGKLGP 93 + G I+ Sbjct: 165 YLKREEQKLSIGDIVMSMANS 185 >gi|329116817|ref|ZP_08245534.1| hypothetical protein SPB_0634 [Streptococcus parauberis NCFD 2020] gi|326907222|gb|EGE54136.1| hypothetical protein SPB_0634 [Streptococcus parauberis NCFD 2020] Length = 198 Score = 41.7 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 64/184 (34%), Gaps = 7/184 (3%) Query: 26 KVVPIKRFTKLNTGRTSESG---KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 + + + G+ S + I L D++ Y + I Sbjct: 14 EKMTLAETADCFKGKAISSKIEEGEFGLINLSDMQKDGINYEHLRTFQMERRQLLRYILE 73 Query: 83 KGQILYGKLGPYLRKAIIA--DFDGICSTQFLVLQPKDVLP-ELLQGWLLSIDVTQRIEA 139 +G +L G + + + D + S+ VL+PK ++ +L S ++ Sbjct: 74 EGDVLIASKGTVKKVCVFHKQENDIVASSNITVLRPKKAFRGYYIKFFLDSPIGQALLDE 133 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIR-EKIIAETVRIDTLITERIRFIELLKEK 198 G + + K + +I +P+ PL +Q + + L + + L E Sbjct: 134 ADHGKDVINLSTKDLLDISIPVIPLVKQDYLINNYLRGLNDYHRKLNRAQQEWQHLQNEI 193 Query: 199 KQAL 202 ++AL Sbjct: 194 EKAL 197 >gi|296395126|ref|YP_003660010.1| restriction endonuclease S subunit-like protein [Segniliparus rotundus DSM 44985] gi|296182273|gb|ADG99179.1| Restriction endonuclease S subunits-like protein [Segniliparus rotundus DSM 44985] Length = 449 Score = 41.7 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 46/399 (11%), Positives = 103/399 (25%), Gaps = 36/399 (9%) Query: 33 FTKLNTGRTSESGKDIIY--IGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 F T + + + L V ++ G ++Y + Sbjct: 31 FADFATEVHPDPHRPAPSEHVKLAGVRWYGRGLFVREERLGSEIKGRCYPLQPGMLVYNR 90 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPE------LLQGWLLSIDVTQRIEAICEGA 144 L + + + C P+ L E +Q S G Sbjct: 91 LFAWKSAFAVVTPE-FCGVHVSNEFPQFQLDELTVDAGFIQVLCASEPFAAMAAGKSTGT 149 Query: 145 T---MSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQA 201 T + + ++ +P+PPL EQ ++ + R D L R + Sbjct: 150 TAVSRNRLRQIDLMSLTIPLPPLNEQRVMLRAYQIKIDRADALSRRATRIRSAAWMAFEE 209 Query: 202 LV---------SYIVTKGLNPDVKMKD-------SGIEWVGLVPDHWEVKPFFALVTELN 245 ++ S V+ + D + W + + V Sbjct: 210 VLGATSSPVAVSRAVSISRFASMSRWDDARVDSGPALRWPVVSLGDYADIRLGCQVPRRG 269 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 + + + + L E T + +++F + Q + Sbjct: 270 THGPGVSRPYLRAANVQRGRFDLSDVKNMRVTERIATALTIRHDDLLFVEGNSQEEVGRA 329 Query: 306 RSAQVMERGIITSAYMAVKPH--GIDSTYLAWLMRSYDLCKVFYAMGSGLR---QSLKFE 360 I ++ + + + +D + + F + + Sbjct: 330 AVWNRQGEYIFQNSLIRARTNRSMLDPWFTCAWFNCEAGRRYFQTSATTTTGTLWHIGAG 389 Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 PV +PPI Q + + +D + +Q+ Sbjct: 390 KTANAPVPLPPISIQRKLAK---DLWSALDDAADNEQQA 425 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 50/164 (30%), Gaps = 14/164 (8%) Query: 25 WKVVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 W VV + + + G T G Y+ +V+ G +T Sbjct: 248 WPVVSLGDYADIRLGCQVPRRGTHGPGVSRPYLRAANVQRGRFDLSDVKNMRVTERIATA 307 Query: 79 SIFAKGQILY--GKLGPYLRKAIIADFDG--ICSTQFLVLQPK--DVLPELLQGWLLSID 132 +L+ G + +A + + G I + + + P W Sbjct: 308 LTIRHDDLLFVEGNSQEEVGRAAVWNRQGEYIFQNSLIRARTNRSMLDPWFTCAWFNCEA 367 Query: 133 VTQRIEAICEGATM--SHADWKGIGNIPMPIPPLAEQVLIREKI 174 + + T H N P+P+PP++ Q + + + Sbjct: 368 GRRYFQTSATTTTGTLWHIGAGKTANAPVPLPPISIQRKLAKDL 411 >gi|260589500|ref|ZP_05855413.1| N-6 DNA Methylase family protein [Blautia hansenii DSM 20583] gi|331082930|ref|ZP_08332050.1| hypothetical protein HMPREF0992_00974 [Lachnospiraceae bacterium 6_1_63FAA] gi|260540068|gb|EEX20637.1| N-6 DNA Methylase family protein [Blautia hansenii DSM 20583] gi|330399925|gb|EGG79583.1| hypothetical protein HMPREF0992_00974 [Lachnospiraceae bacterium 6_1_63FAA] Length = 588 Score = 41.7 bits (96), Expect = 0.25, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 37/148 (25%), Gaps = 6/148 (4%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 IQ + + + +I+ + + V Sbjct: 438 IQYEGADKVRSTNSVCKGKYRIQKDDILITSKGTALKLAIVEDYSPEAYISGNLTLIRVN 497 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSL-KFEDVKRLPVLVPPIKEQFDITNVIN 383 P L + S + SG + +++L + +++ +I + Sbjct: 498 PEKYHPYVLFEYLNSRQGQISLERIQSGTTIRILSNASLQKLKIPEYHLEKMREIGKELK 557 Query: 384 VETARIDVLVEKIEQSIV-----LLKER 406 +E+ LLKE Sbjct: 558 ENQTVFYREKYMLEKQYENKRKHLLKEL 585 >gi|13541296|ref|NP_110984.1| restriction endonuclease S subunit fragment [Thermoplasma volcanium GSS1] Length = 82 Score = 41.7 bits (96), Expect = 0.26, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 30/77 (38%), Gaps = 7/77 (9%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGK 62 KDSG++WIG+I W +V I F+KL T T + I ++ ++ Sbjct: 1 MKDSGIEWIGSINSKWPIVKIIYFSKLKTCGTPDKRVLEYWEDGKINWMSSGEINKDLIY 60 Query: 63 YLPKDGNSRQSDTSTVS 79 + S + Sbjct: 61 EVEGKITELGYKNSNAT 77 >gi|256854680|ref|ZP_05560044.1| LOW QUALITY PROTEIN: restriction endonuclease [Enterococcus faecalis T8] gi|256710240|gb|EEU25284.1| LOW QUALITY PROTEIN: restriction endonuclease [Enterococcus faecalis T8] Length = 163 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 52/161 (32%), Gaps = 4/161 (2%) Query: 229 PDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP--ESYETYQIV 286 + ++ + LIE + YG + K ET + + + + Sbjct: 1 WEQCKLGRMASFSKGNGYSKADLIEEGHPLILYGRLYTKYETIIESVDTFAKLQDKSILS 60 Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY-MAVKPHGIDSTYLAWLMRSYDLCKV 345 GE++ + S S + ++ + ++ T+LA + + K Sbjct: 61 KGGEVIVPSSGESAEDISRASVVDVAGVVLGGDLNIIKTNSELNPTFLALTISNGSQQKE 120 Query: 346 FYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 G L D+K + +L P I+EQ I Sbjct: 121 MSKRAQGKSIVHLHNSDLKEINLLYPKIEEQIYIGLFFKKL 161 >gi|238809964|dbj|BAH69754.1| hypothetical protein [Mycoplasma fermentans PG18] Length = 271 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 51/172 (29%), Gaps = 10/172 (5%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKD-------IIYIGLEDVESGTGKYLPKDGNSRQ 72 IP +W K L G++ E+ I + + D++ K S Q Sbjct: 100 EIPINWAWTRFKNIANLVLGKSPETNNINYWKNGVINWFTIADMKDKQIIEDSKKKISLQ 159 Query: 73 SDTS--TVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLS 130 + + KG +L + K I + D + + + + L+ Sbjct: 160 AKKEIFNNQMSKKGTLLLS-FKLTIGKTSIINQDSVHNEAIVSINFYKDNNITKMFLLIF 218 Query: 131 IDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRID 182 + + + + + + +PIPP+ Q I Sbjct: 219 LGLLINNCEKINAIKGKTLNKEKLQKMLIPIPPIKNQNNILLITNKIIDLFK 270 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 74/224 (33%), Gaps = 9/224 (4%) Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGI- 222 L +Q E I + I+ ++ K+K ++ + K + K I Sbjct: 35 LVKQDPNDEPASKLLEAIQIEKNKLIKEGKIKKDKHESFIFQGEDKNYYEKIGSKVINIT 94 Query: 223 -EWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 E +P +W F + + K+ + N N + ++ + +S + Sbjct: 95 NEIPFEIPINWAWTRFKNIANLVLGKSPETNNINYWKNGVINWFTIADMKDKQIIEDSKK 154 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA-------YMAVKPHGIDSTYLA 334 + EI + + + + II + T + Sbjct: 155 KISLQAKKEIFNNQMSKKGTLLLSFKLTIGKTSIINQDSVHNEAIVSINFYKDNNITKMF 214 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDI 378 L+ L + + ++L E ++++ + +PPIK Q +I Sbjct: 215 LLIFLGLLINNCEKINAIKGKTLNKEKLQKMLIPIPPIKNQNNI 258 >gi|229548241|ref|ZP_04436966.1| possible type I restriction-modification system specificity subunit [Enterococcus faecalis ATCC 29200] gi|229306630|gb|EEN72626.1| possible type I restriction-modification system specificity subunit [Enterococcus faecalis ATCC 29200] Length = 153 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 14/147 (9%), Positives = 38/147 (25%), Gaps = 11/147 (7%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYET-YQIVDPGEIVFRFIDLQNDKRSL 305 K +I G + E+Y+ Y G+++ Sbjct: 17 KEQTSESGDIPFYKIGTFGATADAFISRELFETYKKKYPYPKIGDLLISASGSIGRVV-- 74 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRL 365 + + D ++ + ++ + + L +++ Sbjct: 75 --EYKGNDEYFQDSNIVWLK--HDDRINNLFLKQFYSIVKWHGLEGSTIKRLYNKNILET 130 Query: 366 PVLVPPIKEQFDITNVINVETARIDVL 392 + +P EQ I ++D + Sbjct: 131 TIHLPVFDEQEKIG----TLFKQLDDI 153 >gi|221231663|ref|YP_002510815.1| type I RM modification enzyme [Streptococcus pneumoniae ATCC 700669] gi|220674123|emb|CAR68642.1| putative type I RM modification enzyme [Streptococcus pneumoniae ATCC 700669] Length = 180 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 55/176 (31%), Gaps = 5/176 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEILSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 AT+ H + + ++ + + + EQ I + I + L+K + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNLLVKSRY 174 Score = 36.7 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 40/143 (27%), Gaps = 5/143 (3%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N LK P +I+ + + + I Sbjct: 35 DDLRNNNNLKFTESLNMTEALPDDILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKE 94 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I S YL + S + L + L + + I+EQ +I ++N Sbjct: 95 KIISDYLGVFLESKS-QYLREHSTGATIPHLNKNILLDLQLELLGIEEQENIICILNT-- 151 Query: 387 ARIDVLVEKIEQSIVLLKERRSS 409 I L+ K + + L S Sbjct: 152 --IKRLITKRKLQLDELNLLVKS 172 >gi|301019050|ref|ZP_07183262.1| N-6 DNA Methylase [Escherichia coli MS 196-1] gi|299882408|gb|EFI90619.1| N-6 DNA Methylase [Escherichia coli MS 196-1] Length = 402 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 63/228 (27%), Gaps = 5/228 (2%) Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + I + + ++ + I+ +I ++ I E Sbjct: 171 RKFIGLRRYLLNEHSITKVIELPRNIFKRTEAKTHILIFNKKIMPHHKIQLHCITKDGEL 230 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +++ D + E G + ++ + Sbjct: 231 SPSVLIRKEDAVERMDYSYHYNKNE--GKGFSTIGMLKNISIFRGRFNSKEITEHVFHTT 288 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 G+ N + + + I PG+I+ + K+ L I+ Sbjct: 289 KFSGDEKYIKFHCNSVEELKPSKLDVIAKPGDILIARVGRNFHKKIL--FVESGYSYISD 346 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 ++ G D L + S D + SG Q + + +K++ Sbjct: 347 CIFLIRASGGDKKKLFDFLCSQDGQEELSRASSGVAAQHITMDALKKI 394 >gi|194676307|ref|XP_608246.4| PREDICTED: hypothetical protein [Bos taurus] Length = 1291 Score = 41.3 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 9/40 (22%), Positives = 16/40 (40%), Gaps = 3/40 (7%) Query: 376 FDITNVINVETARIDVLVEKIEQSIVL---LKERRSSFIA 412 I ++ E +++ L+ E I L ER+ I Sbjct: 842 QKILAELDKEVKKVNDLINNSENEISRRTILIERKQGLIN 881 >gi|168490978|ref|ZP_02715121.1| type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae CDC0288-04] gi|183574774|gb|EDT95302.1| type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae CDC0288-04] Length = 180 Score = 41.3 bits (95), Expect = 0.29, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 55/176 (31%), Gaps = 5/176 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 AT+ H + + ++ + + + EQ I + I + L+K + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKGLITKRKLQLDELNLLVKSRY 174 >gi|182683806|ref|YP_001835553.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae CGSP14] gi|182629140|gb|ACB90088.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae CGSP14] Length = 180 Score = 41.3 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 55/176 (31%), Gaps = 5/176 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 AT+ H + + ++ + + + EQ I + I + L+K + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNLLVKSRY 174 >gi|291543315|emb|CBL16424.1| Type I restriction modification DNA specificity domain [Ruminococcus sp. 18P13] Length = 208 Score = 41.3 bits (95), Expect = 0.30, Method: Composition-based stats. Identities = 22/197 (11%), Positives = 60/197 (30%), Gaps = 8/197 (4%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + + L K + WE A++ + + + SL+ N + Sbjct: 7 LPQHLRTYAVPKYKHFHLANPLTHTWEQCELGAIIQAVQELTSDFENYPLYSLTIENGVT 66 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 R + ET E F + ++ ++ ++ Y Sbjct: 67 PKTERYERSFLITKETDLFKIVPEQCFVSNPMNLRFGAIGFNDSGKKVSVSGYYDVFSID 126 Query: 327 GIDSTYLAW-LMRSYDLCKVFYAMGSGL---RQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 + + +++ + K F + G ++ + F + + P + E+ + Sbjct: 127 RGECSNFWCVYLKTANSLKRFDDVAIGSLIEKRRVHFSQLTEMSFPAPNMNEKKK----L 182 Query: 383 NVETARIDVLVEKIEQS 399 R++ L+ ++ Sbjct: 183 GEFFERLERLITLHQRK 199 >gi|329963203|ref|ZP_08300940.1| hypothetical protein HMPREF9446_02533 [Bacteroides fluxus YIT 12057] gi|328528899|gb|EGF55839.1| hypothetical protein HMPREF9446_02533 [Bacteroides fluxus YIT 12057] Length = 1176 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 54/141 (38%), Gaps = 5/141 (3%) Query: 247 KNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLR 306 KN ++S ++ + + + T + + GE + L+ Sbjct: 942 KNPHSMDSYPHLKTHLDQFKDVITSDNKPYGLHRARVESFFVGEKIVA---LRKCAGKPI 998 Query: 307 SAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG--SGLRQSLKFEDVKR 364 A +++ + +K + ++ YL L+ S + G G L E +++ Sbjct: 999 FAYANGENYMSATFYIIKTNRVNMKYLTGLLNSKLIEFWLKNRGKMQGANYQLDKEPLQQ 1058 Query: 365 LPVLVPPIKEQFDITNVINVE 385 +P+ VP I+ Q I N+++ Sbjct: 1059 IPIAVPSIEVQTIIANLVDTI 1079 >gi|300949931|ref|ZP_07163890.1| N-6 DNA Methylase [Escherichia coli MS 116-1] gi|300450699|gb|EFK14319.1| N-6 DNA Methylase [Escherichia coli MS 116-1] Length = 372 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 63/228 (27%), Gaps = 5/228 (2%) Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + I + + ++ + I+ +I ++ I E Sbjct: 141 RKFIGLRRYLLNEHSITKVIELPRNIFKRTEAKTHILIFNKKIMPHHKIQLHCITKDGEL 200 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 +++ D + E G + ++ + Sbjct: 201 SPSVLIRKEDAVERMDYSYHYNKNE--GKGFSTIGMLKNISIFRGRFNSKEITEHVFHTT 258 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 G+ N + + + I PG+I+ + K+ L I+ Sbjct: 259 KFSGDEKYIKFHCNSVEELKPSKLDVIAKPGDILIARVGRNFHKKIL--FVESGYSYISD 316 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 ++ G D L + S D + SG Q + + +K++ Sbjct: 317 CIFLIRASGGDKKKLFDFLCSQDGQEELSRASSGVAAQHITMDALKKI 364 >gi|46580120|ref|YP_010928.1| hypothetical protein DVU1710 [Desulfovibrio vulgaris str. Hildenborough] gi|46449536|gb|AAS96187.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233886|gb|ADP86740.1| hypothetical protein Deval_1585 [Desulfovibrio vulgaris RCH1] Length = 192 Score = 41.3 bits (95), Expect = 0.31, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 49/124 (39%), Gaps = 9/124 (7%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSY 340 + P +I+F +N ++ V +I+ ++ G+ ++AW M Sbjct: 58 NWLQPQDILFLVRGSRN--IAVLLDSVPFPAVISPHFLLLRVAPGAGVLPAFVAWQMNQL 115 Query: 341 DLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID---VLVEKI 396 + F A +++S++ + LP+++PP Q + + + L+ Sbjct: 116 PAQRYFEASAEGSVQRSIRKAVLADLPLVIPPKSTQHAVVRLAAAARQEAETYRKLIANR 175 Query: 397 EQSI 400 EQ + Sbjct: 176 EQEL 179 >gi|146321307|ref|YP_001201018.1| type I restriction-modification system, S subunit [Streptococcus suis 98HAH33] gi|145692113|gb|ABP92618.1| type I restriction-modification system, S subunit, putative [Streptococcus suis 98HAH33] Length = 103 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 12/102 (11%), Positives = 26/102 (25%), Gaps = 7/102 (6%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWLMRSYDL 342 ++V + ++ S YL + S Sbjct: 2 KRNQLVTPVSSSLEHIGKFARIDKNYSDTVAGGFVFQLTPFISSDTLSNYLLLCLSSPLF 61 Query: 343 CKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 K + ++ + L + + P +EQ I+N Sbjct: 62 YKQLQSVTKLSGQALYNIPKTKLNDLRIALAPEQEQERISNK 103 >gi|90410155|ref|ZP_01218172.1| hypothetical protein P3TCK_05291 [Photobacterium profundum 3TCK] gi|90329508|gb|EAS45765.1| hypothetical protein P3TCK_05291 [Photobacterium profundum 3TCK] Length = 66 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 22/55 (40%), Gaps = 6/55 (10%) Query: 363 KRLPVLVPPIKEQFDITNVINVETARIDVL--VEKIEQSIVLLKE----RRSSFI 411 +L + +PP+ EQ I ++ +D E+ E+ L E + + Sbjct: 1 MKLNINIPPLAEQKRIAEELDDLQRMVDNAPSTEEKEKFSEALNEKCALYFNGLL 55 >gi|312437727|gb|ADQ76798.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus TCH60] Length = 157 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 39/128 (30%), Gaps = 15/128 (11%) Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRS-----LRSAQVMERGIITSAYMAVKPHGID 329 +K S + Y+ ++ G+I S + + + +G I Y+ P Sbjct: 30 IKVNSGKDYKHLEKGDIPVYGTGGYMTSVSEPLSEIDAVGIGRKGTINKPYLLEAPFWTV 89 Query: 330 STYLA----------WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 T +++ + S SL + + ++ VP KEQ I Sbjct: 90 DTLFYCTPKKETDILFILSLFRKINWKVYDESTGVPSLSKQTINKINRFVPSNKEQQKIG 149 Query: 380 NVINVETA 387 Sbjct: 150 EFFIKLDR 157 Score = 40.9 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 40/155 (25%), Gaps = 18/155 (11%) Query: 24 HWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAK 83 W+ + K+N+G+ + +E G G + Sbjct: 20 EWEEKKLGDLIKVNSGKDYK-----------HLEKGDIPVYGTGGYMTSVSEP---LSEI 65 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + G+ G + ++ T F K+ + + E Sbjct: 66 DAVGIGRKGTINKPYLLEAPFWTVDTLFYCTPKKETDILFILSLFR----KINWKVYDES 121 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAET 178 + + I I +P EQ I E I Sbjct: 122 TGVPSLSKQTINKINRFVPSNKEQQKIGEFFIKLD 156 >gi|254372671|ref|ZP_04988160.1| hypothetical protein FTCG_00236 [Francisella tularensis subsp. novicida GA99-3549] gi|151570398|gb|EDN36052.1| hypothetical protein FTCG_00236 [Francisella novicida GA99-3549] Length = 190 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 49/134 (36%), Gaps = 3/134 (2%) Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 ++ G+++F N + S A + I Y Sbjct: 55 DTFIANKDLSFSCTQQGDVIFGLRKP-NQAVYIDSNNTNLLVQSYMAIIRCNSDIILPEY 113 Query: 333 LAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 LA+ + + D+ + G Q LK + +K + + +P +K+Q + + I Sbjct: 114 LAFKLNTQDIYNQLHKNIQGGSAIQLLKIQSLKDIVIQIPSLKQQAKRIETLKIGYQEIA 173 Query: 391 VLVEKIEQSIVLLK 404 +L + IE+ LLK Sbjct: 174 ILRKLIEEKQKLLK 187 >gi|294782726|ref|ZP_06748052.1| type I restriction enzyme EcoDI specificity protein [Fusobacterium sp. 1_1_41FAA] gi|294481367|gb|EFG29142.1| type I restriction enzyme EcoDI specificity protein [Fusobacterium sp. 1_1_41FAA] Length = 196 Score = 41.3 bits (95), Expect = 0.32, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 7/92 (7%) Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 D L +L Y + + + + ED++ +P+ +P E I N++N Sbjct: 107 QKDYYALLYLASLYRIESFKSKSTGSIVKFITKEDIENIPLFIP---ENKSIINILNKMI 163 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 L E +L + R + + GQ Sbjct: 164 I----LKENNFSENEILIKLRDFLLPLLMNGQ 191 >gi|291534098|emb|CBL07211.1| Type I restriction modification DNA specificity domain [Megamonas hypermegale ART12/1] Length = 128 Score = 41.3 bits (95), Expect = 0.33, Method: Composition-based stats. Identities = 15/118 (12%), Positives = 36/118 (30%), Gaps = 8/118 (6%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 F + + I +M I+S Y+ L+ D+ + Sbjct: 1 MCFSNGSIKHLGKLCYIDKDTNYIAGGFMGILRSNSSNINSKYIYLLLSLKDMQNNIRIL 60 Query: 350 GSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL---VEKIEQSIVLL 403 +G ++L + + + VP + Q I + + + ++ I + Sbjct: 61 ANGGNIKNLSL-MIGSIKIPVPSVSIQESIVRECENVENEYNNIRMKESEYQEKIEKI 117 >gi|325973138|ref|YP_004250202.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323651740|gb|ADX97822.1| putative type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 82 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 29/69 (42%), Gaps = 4/69 (5%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++L + +K + +L+P I N I +EK+E + +E + + Sbjct: 17 AIKNLSPQKLKEIEILIPD----QKILEKFNNFWKNIHSKIEKLELKMQKYEEIKKKLLD 72 Query: 413 AAVTGQIDL 421 + + +I + Sbjct: 73 SLFSQEIQV 81 >gi|168484774|ref|ZP_02709719.1| type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae CDC1873-00] gi|172042068|gb|EDT50114.1| type I restriction-modification enzyme 1, S subunit [Streptococcus pneumoniae CDC1873-00] gi|332201351|gb|EGJ15421.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA47368] Length = 180 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 20/176 (11%), Positives = 55/176 (31%), Gaps = 5/176 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 AT+ H + + ++ + + + EQ I + I + L+K + Sbjct: 119 ATIPHLNKNILLDLQLELLDIEEQENIICILNTIKRLITKRKLQLDELNLLVKSRY 174 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 40/143 (27%), Gaps = 5/143 (3%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N LK P +I+ + + + I Sbjct: 35 DDLRNNNNLKFTESLNMTEALPDDILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKE 94 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I S YL + S + L + L + + I+EQ +I ++N Sbjct: 95 KIISDYLGVFLESKS-QYLREHSTGATIPHLNKNILLDLQLELLDIEEQENIICILNT-- 151 Query: 387 ARIDVLVEKIEQSIVLLKERRSS 409 I L+ K + + L S Sbjct: 152 --IKRLITKRKLQLDELNLLVKS 172 >gi|256962775|ref|ZP_05566946.1| predicted protein [Enterococcus faecalis HIP11704] gi|256953271|gb|EEU69903.1| predicted protein [Enterococcus faecalis HIP11704] gi|295113789|emb|CBL32426.1| Type I restriction modification DNA specificity domain. [Enterococcus sp. 7L76] Length = 146 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 48/149 (32%), Gaps = 7/149 (4%) Query: 268 LETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI-ITSAYMAVKPH 326 +K E + G I + +E + + + + P Sbjct: 3 DFDNFECVKLEDVAEFGRAKAGYIYPAGTSTIQISATKGQIDFLEYPREVPTKEVVIIPQ 62 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 L+ ++ K +G+ +++ +++ P+ + + Q +++ T Sbjct: 63 NGIEPKYFNLILQRNVDKFIAKYATGI--NIQEKEIGNFPIELFNRETQKAFVRMMDHIT 120 Query: 387 ARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + E + + KE + +F+ + Sbjct: 121 DE----IATAENELTIYKEMKKAFLGDLM 145 >gi|118497299|ref|YP_898349.1| type I restriction-modification system, subunit S [Francisella tularensis subsp. novicida U112] gi|194323603|ref|ZP_03057380.1| hypothetical protein FTE_1764 [Francisella tularensis subsp. novicida FTE] gi|118423205|gb|ABK89595.1| type I restriction-modification system, subunit S [Francisella novicida U112] gi|194322458|gb|EDX19939.1| hypothetical protein FTE_1764 [Francisella tularensis subsp. novicida FTE] Length = 190 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 47/134 (35%), Gaps = 3/134 (2%) Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 ++ +IVF N + S A + I Y Sbjct: 55 DTFIANKDLSFSCTQEDDIVFGLRKP-NQAVYIDSNNTDLLVQSYMAIIRCNSDIILPEY 113 Query: 333 LAWLMRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 LA+ + + D+ + G Q LK + +K + + +P +++Q + I Sbjct: 114 LAFKLNTQDIYNQLHKNIQGGSAIQLLKIQSLKDIVIQIPSLEQQAKRIETLKTGYQEIA 173 Query: 391 VLVEKIEQSIVLLK 404 +L + IE+ +LK Sbjct: 174 ILRKLIEEKQKMLK 187 >gi|21232332|ref|NP_638249.1| hypothetical protein XCC2901 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767535|ref|YP_242297.1| hypothetical protein XC_1208 [Xanthomonas campestris pv. campestris str. 8004] gi|188990648|ref|YP_001902658.1| hypothetical protein xccb100_1252 [Xanthomonas campestris pv. campestris str. B100] gi|21114103|gb|AAM42173.1| hypothetical protein XCC2901 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572867|gb|AAY48277.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] gi|167732408|emb|CAP50602.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris] Length = 198 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 47/136 (34%), Gaps = 9/136 (6%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK---PHGIDSTYLA 334 E + + +IVF +++ R + + ++ P + +LA Sbjct: 62 EGRKHPDWLLDQDIVFIARGANTFAALVQAP--PPRTLCSPHIYVIRVKAPQQLLPAFLA 119 Query: 335 WLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITN---VINVETARID 390 W + + G Q S++ + P+ +PP+ Q + E A + Sbjct: 120 WQLNQAPAQRYLRQSAEGSNQLSIRRTVLDMTPIRLPPLSLQQAVIALEQAAQAERAALH 179 Query: 391 VLVEKIEQSIVLLKER 406 L+ + +L ER Sbjct: 180 ALINNRTAELAILAER 195 >gi|325924113|ref|ZP_08185678.1| hypothetical protein XGA_4738 [Xanthomonas gardneri ATCC 19865] gi|325545415|gb|EGD16704.1| hypothetical protein XGA_4738 [Xanthomonas gardneri ATCC 19865] Length = 195 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 47/136 (34%), Gaps = 9/136 (6%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK---PHGIDSTYLA 334 E + + +IVF +++ R + + ++ P + +LA Sbjct: 59 EGRKHPDWLLDQDIVFIARGANTFAALVQAP--PPRTLCSPHIYVIRVKAPQQLLPAFLA 116 Query: 335 WLMRSYDLCKVFYAMGSGLRQ-SLKFEDVKRLPVLVPPIKEQFDITN---VINVETARID 390 W + + G Q S++ + P+ +PP+ Q + E A + Sbjct: 117 WQLNQAPAQRYLRQSAEGSNQLSIRRTVLDMTPIRLPPLSLQQAVIALEQAAQAERAALH 176 Query: 391 VLVEKIEQSIVLLKER 406 L+ + +L ER Sbjct: 177 ALINNRTAELAILAER 192 >gi|325973244|ref|YP_004250308.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] gi|323651846|gb|ADX97928.1| type I restriction-modification system specificity subunit [Mycoplasma suis str. Illinois] Length = 160 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 14/114 (12%), Positives = 33/114 (28%), Gaps = 10/114 (8%) Query: 25 WKVVPIKRFTKLNTG----------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSD 74 W+ V + + K TG K I ++ + S++ Sbjct: 4 WEWVTLDKLGKFETGSPWKEKYSILNFPNEHKGIPFVDGGTISQSKFHISGDKFYSQKYL 63 Query: 75 TSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWL 128 + IF + + + +G Y ++ I+ + S + + Sbjct: 64 PPNIKIFPEDTVCFVCVGSYPGESRISKTNVCVSNNIYAFNSFKNISDPKFFKY 117 >gi|227365082|ref|ZP_03849109.1| possible restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] gi|227069880|gb|EEI08276.1| possible restriction modification system DNA specificity subunit [Lactobacillus reuteri MM2-3] Length = 317 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 50/358 (13%), Positives = 97/358 (27%), Gaps = 45/358 (12%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + T ++ KD +++ GK RQ Sbjct: 3 EYKKFTALFTDVTKTGTKIPKDEYLTTGKNIIIDQGKDSIAGYTDRQKGIFEEVPV---- 58 Query: 86 ILYGKLGPYLRKAIIADFDGICS-TQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 I++ G + R D VL+ K+ + Sbjct: 59 IVF---GDHTRIVKYIDKPFFLGADGVKVLKSKEKESNYKYLYYALKAAHIPNTGYNRHF 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + I M P L EQ I + + + T I + L + + + Sbjct: 116 K-------WLKQINMNYPDLNEQKNIVDILDSLTRII-------KVRQKELAFFDKLIKA 161 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 V +P + K+ + +G + T + + N GN Sbjct: 162 RFVEMFGDPIINNKNIKKKKLGDI-----CLLKAGDFTPSKKISPVKTSINKYPCFGGNG 216 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I+ S Q G + F +N + ++ + +E Sbjct: 217 IRGYVDNYTHQGNYSLIGRQGALCGNVKFATGKFRNTEHAILVSPNIE------------ 264 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 I+S +L L+ L K+ + L + + + V V + Q + N + Sbjct: 265 ---INSRWLFELLN---LEKLNRFRSGAAQPGLAVKTLNEIIVPVADLNSQNEYANFV 316 >gi|111656837|ref|ZP_01407684.1| hypothetical protein SpneT_02001901 [Streptococcus pneumoniae TIGR4] gi|303269991|ref|ZP_07355723.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS458] gi|302640482|gb|EFL70897.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae BS458] Length = 170 Score = 40.9 bits (94), Expect = 0.38, Method: Composition-based stats. Identities = 19/171 (11%), Positives = 52/171 (30%), Gaps = 5/171 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIEL 194 AT+ H + + ++ + + + EQ I + I + L Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELNLL 169 >gi|153951918|ref|YP_001398802.1| anti-codon nuclease masking agent [Campylobacter jejuni subsp. doylei 269.97] gi|152939364|gb|ABS44105.1| anti-codon nuclease masking agent [Campylobacter jejuni subsp. doylei 269.97] Length = 165 Score = 40.9 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 37/151 (24%), Gaps = 10/151 (6%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---------GKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 P + + G + ++ + DV + Sbjct: 13 PNGVEFKSLGEVANFRRGSFPQPYTKTEWYGGEDSAPFVQVADVGDNMKLTETTKQTISK 72 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSID 132 S K ++ G R AI + T + K + ++L + Sbjct: 73 IAQSKSVFVPKNTVIVTLQGSIGRVAITQYDSYVDRTLAIFQSYKIPINIKFFAYVLFMK 132 Query: 133 VTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + G + + +PIPP Sbjct: 133 F-DEEKKKARGGIIKTITVEEFKQFQIPIPP 162 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 10/110 (9%), Positives = 33/110 (30%), Gaps = 4/110 (3%) Query: 262 GNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYM 321 G+ ++ ET + + V ++ + ++R + + Sbjct: 57 GDNMKLTETTKQTISKIAQSKSVFVPKNTVIVTLQGSIGRVAITQYDSYVDRTLA----I 112 Query: 322 AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 + + + G+ +++ E+ K+ + +PP Sbjct: 113 FQSYKIPINIKFFAYVLFMKFDEEKKKARGGIIKTITVEEFKQFQIPIPP 162 >gi|148993699|ref|ZP_01823146.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] gi|147927779|gb|EDK78802.1| type I restriction-modification system, S subunit, putative [Streptococcus pneumoniae SP9-BS68] Length = 214 Score = 40.9 bits (94), Expect = 0.40, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 64/236 (27%), Gaps = 27/236 (11%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLREHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 AT+ H + + ++ + + + EQ I + I + L+ Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKLQLDELNLLV-------- 170 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 K E G F ++ KN + + + Sbjct: 171 --------------KSRFNEMFGENIIFERNYNLFDIIDGDRGKNDPKSDEVYIMV 212 >gi|312866001|ref|ZP_07726222.1| conserved hypothetical protein [Streptococcus downei F0415] gi|311098405|gb|EFQ56628.1| conserved hypothetical protein [Streptococcus downei F0415] Length = 197 Score = 40.9 bits (94), Expect = 0.41, Method: Composition-based stats. Identities = 14/140 (10%), Positives = 43/140 (30%), Gaps = 2/140 (1%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 + + +++ G+++ K ++ +Q ++ + Sbjct: 52 DYDHLKTFAEDLDKVQKYLLETGDVLVASKGTVK-KVAVFESQDFPVVASSNITVLRPTE 110 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + YL + S + G ++ + +PV P+ +Q + Sbjct: 111 ELSGFYLKLFLESDLGQALLDRTDKGKAVLNISTAQLLEIPVPHIPLVKQNYLVQYAYKG 170 Query: 386 TARIDVLVEKIEQSIVLLKE 405 A + + +Q +K+ Sbjct: 171 QADYQRKLARAQQEWEHIKQ 190 >gi|317488605|ref|ZP_07947148.1| type I site-specific deoxyribonuclease chain S [Eggerthella sp. 1_3_56FAA] gi|316912257|gb|EFV33823.1| type I site-specific deoxyribonuclease chain S [Eggerthella sp. 1_3_56FAA] Length = 246 Score = 40.9 bits (94), Expect = 0.43, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 40/124 (32%), Gaps = 14/124 (11%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESG--KDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 IP+ W+ ++ T + G++ + K + + +G L + + + Sbjct: 106 DIPEGWEWARLEGITTYIQRGKSPKYSLEKKYPVVAQKC-NQWSGFSLERAKFVDPNSVA 164 Query: 77 TV---SIFAKGQILYGKLG-PYLRKAIIAD------FDGICSTQFLVLQPKDVLPELLQG 126 + + G +L+ G L + + D + + V++ Sbjct: 165 SYAEERLLVDGDLLWNSTGLGTLGRMAVYDSNQNPYGWAVADSHVTVIRTVPDWLRYEYA 224 Query: 127 WLLS 130 +L Sbjct: 225 FLYF 228 >gi|307637538|gb|ADN79988.1| typeI restriction-modification system subunit S [Helicobacter pylori 908] Length = 254 Score = 40.9 bits (94), Expect = 0.43, Method: Composition-based stats. Identities = 17/127 (13%), Positives = 42/127 (33%), Gaps = 1/127 (0%) Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 +T I +S N G Y D I + + Sbjct: 7 STNKKTLKISEVSEVKNKGMYPVINSGRDLYGYYHDFNNDGENITIASRGEYAGFINYFN 66 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 + G+ Y + + + +L + +++ ++ + + G +L D++ L + Sbjct: 67 EKFFAGGLCYP-YKVKDTNELLTKFLYFYLKTNEIQIMENLVFRGSIPALNKADIETLTI 125 Query: 368 LVPPIKE 374 +PP++ Sbjct: 126 PIPPLEI 132 >gi|76665049|emb|CAJ17967.1| restriction modification enzyme S subunit [Candidatus Phytoplasma solani] Length = 86 Score = 40.9 bits (94), Expect = 0.44, Method: Composition-based stats. Identities = 4/66 (6%), Positives = 25/66 (37%), Gaps = 5/66 (7%) Query: 344 KVFYAMGSGLRQSLKFEDVKRLPVLV-PPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++ ++ + + + P ++ Q I + + + ++ ++ + + L Sbjct: 17 QLQQLKTGTSVPGIQKPTLLNFKITLTPHLEHQNQIADFL----SLLEQQIKLENELLTL 72 Query: 403 LKERRS 408 + ++ Sbjct: 73 YQTQKK 78 >gi|109947646|ref|YP_664874.1| type I restriction-modification enzyme, S subunit [Helicobacter acinonychis str. Sheeba] gi|109714867|emb|CAJ99875.1| type I restriction-modification enzyme, S subunit [Helicobacter acinonychis str. Sheeba] Length = 257 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 10/77 (12%), Positives = 29/77 (37%), Gaps = 9/77 (11%) Query: 333 LAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA-- 387 + + + + + G+ S+ D + + + P++ Q I ++V Sbjct: 1 MYYYITQDKIVHYLQRIAECGTSSYPSITPLDFLNVKIKLYPLETQQKIARTLSVLDQKV 60 Query: 388 ----RIDVLVEKIEQSI 400 +I+ L++ + I Sbjct: 61 ENNHKINELIQTLAYKI 77 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 55/181 (30%), Gaps = 5/181 (2%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 + + KN KL + + + +++ + + + P ++ Sbjct: 75 YKIYEYYFKHKSKNAKLEQIILENPKSSIMVKNAQKTQDKYPFFTSGDNILSYPKALIDG 134 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR 354 N + + ++ + + S YL L+ S Sbjct: 135 RNCFLNTGGNAGIKFYGGKASYSTDTWCICANEF-SDYLYLLLSSIKNHINQSFFQGTSL 193 Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 + L+ + +K+ P+ +P E ++ L+ ++ L++ R + Sbjct: 194 KHLQKKLLKKYPIYMPSKHEIKKFNEIVMPLL----TLISINTRTSKKLEQIRDFLLPLL 249 Query: 415 V 415 + Sbjct: 250 L 250 >gi|313892186|ref|ZP_07825779.1| N-6 DNA Methylase [Dialister microaerophilus UPII 345-E] gi|313119324|gb|EFR42523.1| N-6 DNA Methylase [Dialister microaerophilus UPII 345-E] Length = 594 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 30/352 (8%), Positives = 92/352 (26%), Gaps = 10/352 (2%) Query: 57 ESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQP 116 KY+ + +F ++ L + I I + Sbjct: 236 NIDFLKYIDSKVPGILKRNMSDWLFNI--LMIHMLKDTGKAVGIMTNGSIWNQMSDCKNA 293 Query: 117 KDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIA 176 + + ++ + + + + + ++ Sbjct: 294 RKYFLSNGLIEAIIALPANLFKSTSIPTVLIVFSHGNKKIKMIDATSICVENMRQKIFS- 352 Query: 177 ETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH--WEV 234 T I+T+ + E + ++P + + G ++ Sbjct: 353 -TENIETIYKAYLEETENSIFVNVEDILKDEELNIHPKRYLTHITLPENGKELKTVLTDL 411 Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGL--KPESYETYQIVDPGEIV 292 + + K + + NI + + K + I+ ++ Sbjct: 412 YRGSNISAKELDKLKTDKPTLYRYVMLQNINNGMIDEELPYLSKIDEKHEKFIISNRSLI 471 Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI-DSTYLAWLMRSYDLCKVFYAMGS 351 + ++ + ++ + Y+ L S + ++ S Sbjct: 472 ISKTGPVFKSAVVDVPSNLKILASGNMFILKIDETKANPYYIQALFESSYGKALVSSISS 531 Query: 352 GLRQ-SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 G + + ++ L + +P +++Q +I N I + K+E + Sbjct: 532 GSVISTFSKKALENLVIPLPALEKQNEIANKYQALQDSIKIYKMKLEDAYDK 583 >gi|268609819|ref|ZP_06143546.1| hypothetical protein RflaF_10017 [Ruminococcus flavefaciens FD-1] Length = 184 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 61/193 (31%), Gaps = 20/193 (10%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + R ++ I + +++P ++ D + + Sbjct: 4 IKRVGELISYVDERNTDGA-----IRDFYGININKEFMPTVASTEGIDARKYKVVRDNRF 58 Query: 87 LYGKLGP----YLRKAIIADFDGICSTQFLVLQPKDV---LPELLQGWLLSIDVTQRIEA 139 ++ + +R + I S + + K+ LPE LS ++ + Sbjct: 59 VFSGMQTGRDKCIRIGLYKGSPIIISPAYTTFEIKNTEIVLPEYFFMQFLSNEMDRYGWF 118 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 I + + S+ D I +P ++ Q + I I + +K+ Sbjct: 119 ISDSSIRSNLDIDRFEEISFELPDISVQRKYVD--------IYKAIRRVQKLNVKIKDLC 170 Query: 200 QALVSYIVTKGLN 212 LV V +G + Sbjct: 171 PILVRSAVREGRD 183 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 44/137 (32%), Gaps = 4/137 (2%) Query: 249 TKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSA 308 + + I NI ++ + Y++V VF + DK Sbjct: 16 ERNTDGAIRDFYGININKEFMPTVASTEGIDARKYKVVRDNRFVFSGMQTGRDKCIRIGL 75 Query: 309 QVMERGIITSAYM---AVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQSLKFEDVKR 364 II+ AY + Y S ++ + + + S +R +L + + Sbjct: 76 YKGSPIIISPAYTTFEIKNTEIVLPEYFFMQFLSNEMDRYGWFISDSSIRSNLDIDRFEE 135 Query: 365 LPVLVPPIKEQFDITNV 381 + +P I Q ++ Sbjct: 136 ISFELPDISVQRKYVDI 152 >gi|293115503|ref|ZP_05791811.2| phosphoribosylformylglycinamidine synthase [Butyrivibrio crossotus DSM 2876] gi|292809622|gb|EFF68827.1| phosphoribosylformylglycinamidine synthase [Butyrivibrio crossotus DSM 2876] Length = 59 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 17/58 (29%), Gaps = 3/58 (5%) Query: 22 PKHWKVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 P+ W + + G K I I +++ G Y S + S Sbjct: 2 PEGWAWCRLNSIVDVRDGTHDTPTYVDKGIPLITSKNLVEGGIDYSNVKYISEKDAIS 59 >gi|325989941|ref|YP_004249640.1| hypothetical protein Msui05930 [Mycoplasma suis KI3806] gi|323575026|emb|CBZ40686.1| hypothetical protein, putative HdsS fragment [Mycoplasma suis] Length = 82 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 30/69 (43%), Gaps = 4/69 (5%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++L + +K + +L+P I N I +EK+E + +E + + + Sbjct: 17 AIKNLSPQKLKEIEILIPD----QKILEKFNSFWKNIHSKIEKLELKMQKYEEIKKNLLD 72 Query: 413 AAVTGQIDL 421 + + +I + Sbjct: 73 SLFSQEIQV 81 >gi|260889171|ref|ZP_05900434.1| putative type I restriction modification DNA specificity domain protein [Leptotrichia hofstadii F0254] gi|260861231|gb|EEX75731.1| putative type I restriction modification DNA specificity domain protein [Leptotrichia hofstadii F0254] Length = 195 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 62/185 (33%), Gaps = 12/185 (6%) Query: 29 PIKRFTKLN---TGRTSESGKDIIYIGLEDVESGTGKYL-----PKDGNSRQSDTSTVSI 80 + + +T++S + + V +G + P+ + ++ + Sbjct: 2 KLGDNVDIIAPLNVKTADSETGYLLLNPTLVNNGKIESFENAEVPERYKNGKNKINEKYF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQRI- 137 K +L+ G + + +T + +L+ D + WLL ++ Sbjct: 62 VRKNDVLFQAKGSKIEVVYVDKGYENVLPATLYFILRANDRINPKYLQWLLKTELLLLYF 121 Query: 138 -EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLK 196 + + + + I + + +P EQ + E I + + I + ++ Sbjct: 122 EKKYKTMSAVRAVNKTDIVELDIDLPDREEQDRMVEIITSFENEEENTIEYLKIKKKYIE 181 Query: 197 EKKQA 201 EK A Sbjct: 182 EKILA 186 Score = 39.8 bits (91), Expect = 0.97, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 41/119 (34%), Gaps = 3/119 (2%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 V +++F+ + + + T ++ I+ YL WL+ Sbjct: 54 NKINEKYFVRKNDVLFQAKGSKIEVVYVDKGYE-NVLPATLYFILRANDRINPKYLQWLL 112 Query: 338 RSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 ++ L F +++ D+ L + +P +EQ + +I + +E Sbjct: 113 KTELLLLYFEKKYKTMSAVRAVNKTDIVELDIDLPDREEQDRMVEIITSFENEEENTIE 171 >gi|329963224|ref|ZP_08300961.1| hypothetical protein HMPREF9446_02554 [Bacteroides fluxus YIT 12057] gi|328528920|gb|EGF55860.1| hypothetical protein HMPREF9446_02554 [Bacteroides fluxus YIT 12057] Length = 129 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 44/124 (35%), Gaps = 7/124 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP--KDGNSRQSDTSTVSIFAKGQ 85 +P+ K + R SG D+ + + + + G +D DTS + KG Sbjct: 2 IPLSELLKQCSDRN-RSGSDLQVLSVSN-KYGFIAQSNQFEDREVASDDTSNYKVVKKGM 59 Query: 86 ILYGKLGPYLRKAII--ADFDGICSTQFLVLQPKDV-LPELLQGWLLSIDVTQRIEAICE 142 Y + + D +GI S ++ K LP L+ + S + E Sbjct: 60 FAYNPARINVGSIALYEMDGNGIVSPMYVCFTTKSELLPSYLKYYFASQTFKHEMYKRLE 119 Query: 143 GATM 146 G+ Sbjct: 120 GSVR 123 >gi|297487354|ref|XP_002696242.1| PREDICTED: coiled-coil domain containing 40 [Bos taurus] gi|296476042|gb|DAA18157.1| coiled-coil domain containing 40 [Bos taurus] Length = 1125 Score = 40.5 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 9/40 (22%), Positives = 16/40 (40%), Gaps = 3/40 (7%) Query: 376 FDITNVINVETARIDVLVEKIEQSIVL---LKERRSSFIA 412 I ++ E +++ L+ E I L ER+ I Sbjct: 676 QKILAELDKEVKKVNDLINNSENEISRRTILIERKQGLIN 715 >gi|329948020|ref|ZP_08294921.1| hypothetical protein HMPREF9056_02839 [Actinomyces sp. oral taxon 170 str. F0386] gi|328523159|gb|EGF50260.1| hypothetical protein HMPREF9056_02839 [Actinomyces sp. oral taxon 170 str. F0386] Length = 60 Score = 40.5 bits (93), Expect = 0.49, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 22/55 (40%), Gaps = 4/55 (7%) Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE----RRSSFIA 412 + V PPI +Q +I ++++ ++ L + I ++ R + Sbjct: 1 MSDTLVPAPPIDQQREIVHLLDKFDLLVNDLTSGLPAEIEARRKQYEYYRDRLLT 55 >gi|119488029|ref|ZP_01621473.1| hypothetical protein L8106_11547 [Lyngbya sp. PCC 8106] gi|119455318|gb|EAW36457.1| hypothetical protein L8106_11547 [Lyngbya sp. PCC 8106] Length = 511 Score = 40.5 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 28/262 (10%), Positives = 76/262 (29%), Gaps = 18/262 (6%) Query: 152 KGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLIT--ERIRFIELLKEKKQALV-SYIVT 208 ++ + Q I +T I + F E Q+ + Sbjct: 135 FKYRDLLPFVLNDDGQHFIVLDQDQKTQLILQKNGSLKLNSFPFFSVEILQSFQADHPTN 194 Query: 209 KGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL 268 L V K + + + ++T+L + ++ Sbjct: 195 MSLKNWVYFKTKQHPDIKNFLSDYGFQKISDWALLNRTRSTQLELLSTRDRLLVEAFHQV 254 Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 R+ + + P ++ LQ + + S ++ + + A + Sbjct: 255 YRRDRRQQSKGARKCPDPSPEQLQEMLSKLQEHQVIISSEALVFKDLKQVAKQLRQYEVW 314 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + + R ++ E ++ + EQ +I + ++ + ++ Sbjct: 315 SNRECLEVYNQDKNQYEI-------RPDIQQEYEPQIEI------EQQEIVDFLHQKLSK 361 Query: 389 I--DVLVEKIEQSIVLLKERRS 408 I + ++++ I LK+ R Sbjct: 362 ILSQAIQKEVQHKIHKLKKSRK 383 >gi|257078401|ref|ZP_05572762.1| HsdS protein [Enterococcus faecalis JH1] gi|256986431|gb|EEU73733.1| HsdS protein [Enterococcus faecalis JH1] Length = 249 Score = 40.5 bits (93), Expect = 0.52, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 15/184 (8%) Query: 23 KHWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI- 80 ++W++ ++ + G+ +E++ +G+ +YL + + T ++ Sbjct: 77 ENWELCKLENIIEKQIKGKAK----------VENLCNGSVEYLDANRLNGGKPIYTKALP 126 Query: 81 -FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEA 139 ++ I+ G K F G+ + Q K+ + +D I Sbjct: 127 DVSERDIIILWDGSKAGKVYY-GFKGVLGSTLKAYQLKECANS-QFIYQQLLDNQNNIYN 184 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 + H P+ + EQ + + + RI I L K Sbjct: 185 NYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYL 244 Query: 200 QALV 203 Q + Sbjct: 245 QNMF 248 Score = 40.2 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 32/269 (11%), Positives = 88/269 (32%), Gaps = 24/269 (8%) Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + + IP E L+ + +ID + R ++ LKE K+A + Sbjct: 1 MKVFGISSSKVLDFTTYIPKNDETKLVSSFL----EKIDYALDLHQRKLDQLKELKKAYL 56 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGN 263 + K +++ + E + ++ + + K+ S+ Y + Sbjct: 57 QLMFPKKDETVPQVRFANFEENWEL------CKLENIIEKQIKGKAKVENLCNGSVEYLD 110 Query: 264 IIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV 323 R G KP + V +I+ + + K +G++ S A Sbjct: 111 A-----NRLNGGKPIYTKALPDVSERDIIILWDGSKAGKVY-----YGFKGVLGSTLKAY 160 Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN 383 + ++ + + ++ + + P+ + +EQ + +++ Sbjct: 161 QLKECANSQFIYQQLLDNQNNIYNNYRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADIL- 219 Query: 384 VETARIDVLVEKIEQSIVLLKERRSSFIA 412 + +D + + + + S++ Sbjct: 220 ---SNLDNRIILQQNLTDTMISLKKSYLQ 245 >gi|195331494|ref|XP_002032436.1| GM23517 [Drosophila sechellia] gi|194121379|gb|EDW43422.1| GM23517 [Drosophila sechellia] Length = 422 Score = 40.5 bits (93), Expect = 0.53, Method: Composition-based stats. Identities = 9/35 (25%), Positives = 14/35 (40%) Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 EQ I + +D L +Q + LKE + Sbjct: 145 EQQRIAPNVEALDKELDELKRSEQQLLSELKELKK 179 >gi|315196000|gb|EFU26361.1| hypothetical protein CGSSa01_10249 [Staphylococcus aureus subsp. aureus CGS01] Length = 55 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 2/39 (5%) Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I I+ +ID L+ K + I LLK+R+ + Sbjct: 16 QSKI--KIDNFFNKIDTLILKQGKKIELLKQRKQGLLQK 52 >gi|207108581|ref|ZP_03242743.1| hypothetical protein HpylH_03404 [Helicobacter pylori HPKX_438_CA4C1] Length = 29 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 6/25 (24%), Positives = 16/25 (64%) Query: 362 VKRLPVLVPPIKEQFDITNVINVET 386 ++++ + +PP+ EQ I N+++ Sbjct: 5 MQQIQIPIPPLDEQIAIANILSALD 29 >gi|126661170|ref|ZP_01732247.1| type I restriction-modification enzyme, S subunit, putative [Cyanothece sp. CCY0110] gi|126617543|gb|EAZ88335.1| type I restriction-modification enzyme, S subunit, putative [Cyanothece sp. CCY0110] Length = 191 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 11/144 (7%), Positives = 42/144 (29%), Gaps = 4/144 (2%) Query: 236 PFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF 295 +++ + ++ ++ Y I+ K + + + +I+ Sbjct: 52 KITEIISGQSPQSKFYNKNQQGLPFYQGKIEFGNMYLKEPKTWTTQITKESIKDDILMSV 111 Query: 296 IDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQ 355 + ++ I A++ + ++ + Sbjct: 112 RAPVGSL----NINRFDKICIGRGLAAIRSKAENVFIKYIYYFLLFNPELIVGTEGLIFS 167 Query: 356 SLKFEDVKRLPVLVPPIKEQFDIT 379 S+ + + ++ + +PP + Q I Sbjct: 168 SISRDQISKISIPLPPKEVQEQII 191 >gi|325911596|ref|ZP_08174004.1| hypothetical protein HMPREF0522_0060 [Lactobacillus iners UPII 143-D] gi|325476582|gb|EGC79740.1| hypothetical protein HMPREF0522_0060 [Lactobacillus iners UPII 143-D] Length = 267 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 58/198 (29%), Gaps = 17/198 (8%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNI-LSLSYGNIIQKLETRNMGLKPESYETYQI 285 +PD W + +V+ + E Y IQ + N K T + Sbjct: 76 EIPDSWRTEKLLNIVSWESNSQPPKSEFIYSPKDGYVRFIQNRDYENDSYKTYIPLTNNL 135 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 ID D +R + + P+ Y+ + S + Sbjct: 136 STVNRFDI-LIDKYGDAGVVRYGIEGAFNVALGKINVLYPNCQ--EYVRSFLESDGIYSY 192 Query: 346 FYAMG-SGLRQSLKFEDVKRLPVLVPP----IKEQFDITNVINVETARIDVLVEKIEQSI 400 + + R SL ++ L +++P ++ Q DI +I + Sbjct: 193 LHNSCMASTRASLNESNLDMLNIVIPDENSLLRYQEDI--------HQIRETILLNNSEN 244 Query: 401 VLLKERRSSFIAAAVTGQ 418 L R + + GQ Sbjct: 245 QNLISLRDWLLPMLMNGQ 262 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 61/203 (30%), Gaps = 10/203 (4%) Query: 10 YKDSG--VQWIG----AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGK 62 YK SG + W IP W+ + + + I V Sbjct: 60 YKSSGGKMVWNEQLKREIPDSWRTEKLLNIVSWESNSQPPKSEFIYSPKDGYVRFIQNRD 119 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLP 121 Y + T+ +S + IL K G + +G + + Sbjct: 120 YENDSYKTYIPLTNNLSTVNRFDILIDKYGDAG--VVRYGIEGAFNVALGKINVLYPNCQ 177 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E ++ +L S + + C +T + + + + + IP + +E I I Sbjct: 178 EYVRSFLESDGIYSYLHNSCMASTRASLNESNLDMLNIVIPDENSLLRYQEDIHQIRETI 237 Query: 182 DTLITERIRFIELLKEKKQALVS 204 +E I L L++ Sbjct: 238 LLNNSENQNLISLRDWLLPMLMN 260 >gi|282849446|ref|ZP_06258831.1| hypothetical protein HMPREF1035_0399 [Veillonella parvula ATCC 17745] gi|282581150|gb|EFB86548.1| hypothetical protein HMPREF1035_0399 [Veillonella parvula ATCC 17745] Length = 583 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 40/98 (40%), Gaps = 4/98 (4%) Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 ++ + + + Y+A S + + SG + +S+ +D+K L + Sbjct: 479 IINGNLFSITIAPKYRNLYLLDYIAAFFNSTLGREQIERLASGSVIKSISIKDLKSLAIP 538 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 I++Q + +N I+ +E +++ + +E Sbjct: 539 NAAIEQQR---SFLNQTDKIIETRMELLKKLDEVNQEL 573 >gi|170717886|ref|YP_001784940.1| restriction modification system DNA specificity subunit [Haemophilus somnus 2336] gi|168826015|gb|ACA31386.1| restriction modification system DNA specificity domain [Haemophilus somnus 2336] Length = 98 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 14/101 (13%), Positives = 37/101 (36%), Gaps = 8/101 (7%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMG-SGLRQ---SLK 358 ++ ++ G+++ Y + ++ +L + K G +G R ++K Sbjct: 2 GPIKRNKLGRTGVMSPLYYIFRVTNVEQNFLEIFFETSIWHKFMKENGDNGARADRVAIK 61 Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 +P+ +P +EQ I +D + ++ Sbjct: 62 DSLFVEMPISIPQPQEQQKIGTF----FTALDRYITIHQRK 98 >gi|145631983|ref|ZP_01787735.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae R3021] gi|144982367|gb|EDJ89947.1| putative type I restriction-modification system specificity protein [Haemophilus influenzae R3021] Length = 61 Score = 40.5 bits (93), Expect = 0.55, Method: Composition-based stats. Identities = 7/53 (13%), Positives = 21/53 (39%), Gaps = 7/53 (13%) Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQSIVLLKER 406 ++ L + VP EQ I ++++ + +++ ++ +E Sbjct: 1 MIEDLRIPVPSFSEQQSIASILDKFETLTHSITEGLPLAIQQSQKRYEYYREL 53 >gi|183508854|ref|ZP_02958299.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 14 str. ATCC 33697] gi|182675590|gb|EDT87495.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 14 str. ATCC 33697] Length = 431 Score = 40.5 bits (93), Expect = 0.56, Method: Composition-based stats. Identities = 39/380 (10%), Positives = 99/380 (26%), Gaps = 24/380 (6%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 + I+ ++ G+ + + Y + N F Sbjct: 31 EFKKIEYVCEIKRGQVYSKEF------INSNKGNYPVYSSQSLNDGVLGNINKYDFDGEY 84 Query: 86 ILYGKLGPYLRKAIIADFDGICST--QFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEG 143 + + G Y + L + L ++L + + Sbjct: 85 VTWTTDGAYAGTVFYRKGKFSITNVCGILKVFDNSNLNTKYLSFILRKITKKHVNQASGN 144 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALV 203 + + I PI + V I +K+ T I+T + I + E + + Sbjct: 145 PKLMSNVMQEIIIPIPPISIQNKIVEILDKLETYTKDINTGLPLEIEQRKKQYEYYRNKL 204 Query: 204 SYIVTKGLNPDVKMKDSGIEWVGLVPD-----HWEVKPFFALVTELNRKNTKLIESNILS 258 + ++ I + + + + + K + +V + E Sbjct: 205 LDFDNIARERERELSRDYIWTLKNIYEKLVQNNVKYKKLWEIVNFDKKFKGVPKEKQNEI 264 Query: 259 LSYGNIIQKLETRNMGLKPES--------YETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 LS+ +I R + Y+ Y + + + ++ Sbjct: 265 LSFKHISANELKRYEKCNFGNVKLLSTGLYDGYIKYNENDNNINYGEIIALPSGGSPIIK 324 Query: 311 MERGII---TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPV 367 G + + K + + + + + ++ L + Sbjct: 325 YYNGYFIDSLNIIFSQKNKKECNLKFIYYFLIANKMLIEENYRGASVKHPNMIEIIELLI 384 Query: 368 LVPPIKEQFDITNVINVETA 387 +P I Q I +++ A Sbjct: 385 PIPHISIQNKIVEILDKLEA 404 >gi|328947968|ref|YP_004365305.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] gi|328448292|gb|AEB14008.1| restriction modification system DNA specificity domain protein [Treponema succinifaciens DSM 2489] Length = 192 Score = 40.2 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 15/179 (8%), Positives = 48/179 (26%), Gaps = 11/179 (6%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFR 294 F N N + +++ ++ ++ + V + Sbjct: 1 MNIFKSEFVEMFGNPIYNSKNFPTKKVIDVVTMQRGYDLPVQDRDSKGKIPVFGSNGILG 60 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKP-------HGIDSTYLAWLMRSYDLCKVFY 347 +L + + + + G + P + + +L + + Sbjct: 61 NHNLAKMDKGIITGRSGTIGEVYMCETPFWPLNTTLFSNDTHGNNICYLKFLLEFFDLKR 120 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 +L + ++ P+ Q + +ID ++Q + +K+ Sbjct: 121 FKSGVGVPTLNRNEFHDEQIIDVPLDLQNQFAAFV----QKIDKSKFVLQQQLQFIKKY 175 >gi|332077302|gb|EGI87764.1| type I restriction modification DNA specificity domain protein [Streptococcus pneumoniae GA17545] Length = 174 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 51/169 (30%), Gaps = 5/169 (2%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ + + + ++ + + + + Sbjct: 2 KKVKLGEVLSLKKGKKATVLAEQTTLSQRYIQIDDLRNNNNLKFTESLNMTEAL---PDD 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G + ST ++ + + +++ + + +Q + G Sbjct: 59 ILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKEKIISDYLGVFLESKSQYLRDHSTG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFI 192 AT+ H + + ++ + + + EQ I + I + Sbjct: 119 ATIPHLNKNILLDLQLELLGIEEQENIICILNTIKRLITKRKFQLDELN 167 Score = 36.7 bits (83), Expect = 7.3, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 40/142 (28%), Gaps = 5/142 (3%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH 326 N LK P +I+ + + + I Sbjct: 35 DDLRNNNNLKFTESLNMTEALPDDILIAWDGANAGTVGYGLSGAVGSTITVLKKNERYKE 94 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVET 386 I S YL + S + L + L + + I+EQ +I ++N Sbjct: 95 KIISDYLGVFLESKS-QYLRDHSTGATIPHLNKNILLDLQLELLGIEEQENIICILNT-- 151 Query: 387 ARIDVLVEKIEQSIVLLKERRS 408 I L+ K + + L R Sbjct: 152 --IKRLITKRKFQLDELNLTRQ 171 >gi|224373583|ref|YP_002607955.1| putative outer membrane autotransporter barrel domain protein [Nautilia profundicola AmH] gi|223588577|gb|ACM92313.1| putative outer membrane autotransporter barrel domain protein [Nautilia profundicola AmH] Length = 1070 Score = 40.2 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 27/338 (7%), Positives = 80/338 (23%), Gaps = 19/338 (5%) Query: 48 IIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYL---RKAIIADFD 104 I++ D+ + + + S+ I KG I+ + I + Sbjct: 171 ILWNNYGDIRNEGSMEIDNTAIGLNINYSSGKIINKGSIVVSNDTTGIILKENYGIIENT 230 Query: 105 GICSTQFLVLQPKDVLPELLQGWLLSIDVTQ---RIEAICEGATMSHADWKGIGNIPMPI 161 G T + K+ + + + I + G Sbjct: 231 GNIYTLGYTIYIKENNNGTILNAVNIPSLNISNTNRNTIFVNNNEGNITNHGTIVSLNEY 290 Query: 162 PPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSG 221 ++ + + T ID+ ++ V + + Sbjct: 291 GIKVDEDNMGNVLNDTTGTIDSNLSSIFIGSNNDNNITNNGTLISRNDSGIKVVGVNSNL 350 Query: 222 IEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYE 281 I+ G + + + + +I ++ N G+ + Sbjct: 351 IQNSGDINASVGINVGTNDNNGVITNSGNIISDANAGININITNNSGIVTNNGILTSDHN 410 Query: 282 TYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM---- 337 ++ Q I + + +K + D+ + ++ Sbjct: 411 NSIQIEENSGTVTNNGNITAYIYGIHIQNNNGNITNNGNITIKNYTADNNAIIYVYDNNG 470 Query: 338 ---------RSYDLCKVFYAMGSGLRQSLKFEDVKRLP 366 +Y+ + + ++ + + Sbjct: 471 TITNNGSMQSTYNSIHIQNNNEGTILNNINSTIIANIK 508 >gi|332522823|ref|ZP_08399075.1| hypothetical protein STRPO_0341 [Streptococcus porcinus str. Jelinkova 176] gi|332314087|gb|EGJ27072.1| hypothetical protein STRPO_0341 [Streptococcus porcinus str. Jelinkova 176] Length = 200 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 54/172 (31%), Gaps = 6/172 (3%) Query: 26 KVVPIKRFTKLNTGRTSES---GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFA 82 K + I + G+ S + I L D+ Y + Sbjct: 15 KKITIGDVVECFKGKAVSSKVEDGEFALINLSDMSLAGINYQNLRTFHLERRQLLRYFLE 74 Query: 83 KGQILYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEA 139 G +L G + + + S+ VL+P D L + L D+ Q ++ Sbjct: 75 DGDVLIASKGTVKKVCVFQKQKREIVASSNITVLRPLDKLRGYYIKFFLDSDIGQQLLDR 134 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 G + + K + IP+P PL +Q + + + I + Sbjct: 135 ADHGKDVINLSTKELLEIPVPAMPLVKQDYLISQYLRGLSEYQRKIQRAEQE 186 >gi|212711666|ref|ZP_03319794.1| hypothetical protein PROVALCAL_02741 [Providencia alcalifaciens DSM 30120] gi|212685768|gb|EEB45296.1| hypothetical protein PROVALCAL_02741 [Providencia alcalifaciens DSM 30120] Length = 500 Score = 40.2 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ I+ VE IEQ I L+++R S I Sbjct: 462 EILSEHFDNIEQKVEDIEQQIAELEKQRQSLINQ 495 >gi|169825864|ref|YP_001696022.1| hypothetical protein Bsph_0263 [Lysinibacillus sphaericus C3-41] gi|168990352|gb|ACA37892.1| hypothetical protein Bsph_0263 [Lysinibacillus sphaericus C3-41] Length = 228 Score = 40.2 bits (92), Expect = 0.64, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 44/147 (29%), Gaps = 10/147 (6%) Query: 30 IKRFTKLNTGRTSESGK-----DIIYIGLEDVE----SGTGKYLPKDGNSRQSDTSTVSI 80 ++ K+ G+ S K I YI E + LPK ++ + S S+ Sbjct: 11 LEEIAKIKIGKVVTSKKRFAMDGIPYITEEVLRKLSLEDNTSLLPKVDSTLKEPFS-FSL 69 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 IL K+ D S + + PK+ + + E Sbjct: 70 VPAQSILLNKMNLKEAYIYQCKTDVCISHDIMAIIPKESILIGDYLFHFMKWYQNNKERC 129 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQ 167 + M I + + + EQ Sbjct: 130 NVYSLMIDLPSIAIQHNVVQLINAVEQ 156 >gi|317494151|ref|ZP_07952567.1| hypothetical protein HMPREF0864_03336 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316917924|gb|EFV39267.1| hypothetical protein HMPREF0864_03336 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 195 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 12/151 (7%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 +S + + V G++V L + K ++ S + R + Sbjct: 40 FMSDCWQPGNTMNGDKEQVVRPTQAVLSVRTGDVVIS---LIHRKAAIVSPEHAGRLLSN 96 Query: 318 SAYMAV-KPHGIDSTYLAWLMRSYDLCKVFYAM---GSGLRQSLKFEDVKRLPVLVPPIK 373 + + + W + A+ GS L +V++ +PP+ Sbjct: 97 NYVRVEVDSRKVVPAWFVWHFNESRESRRQQALATQGSTFVLRLSLTEVRQFTATLPPLN 156 Query: 374 EQFDIT----NVINVETARIDVLVEKIEQSI 400 +Q I I + + L EQ I Sbjct: 157 KQKAIGGLYLATIEKRHYQ-ERLAALNEQQI 186 >gi|317481748|ref|ZP_07940779.1| type I restriction enzyme [Bifidobacterium sp. 12_1_47BFAA] gi|316916805|gb|EFV38196.1| type I restriction enzyme [Bifidobacterium sp. 12_1_47BFAA] Length = 153 Score = 40.2 bits (92), Expect = 0.65, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 42/136 (30%), Gaps = 13/136 (9%) Query: 25 WKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ ++ G T I+++ +DV+ + + + + +T Sbjct: 20 WEQRKLENLASFGGGHTPSMADASNYVDGKILWVTSQDVKQHYIENTTTMISEKGA--AT 77 Query: 78 VSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQP-KDVLPELLQGWLLSIDV 133 ++++ I+ LR + V+Q L + ++ + Sbjct: 78 LTLYPSDSIVIVARSGILRHTIPVAKLRKPATVNQDIKVIQTVDSCDSSWLLQYFIASNK 137 Query: 134 TQRIEAICEGATMSHA 149 T E G T+ Sbjct: 138 TLLREYGKTGTTVESI 153 >gi|325663164|ref|ZP_08151614.1| hypothetical protein HMPREF0490_02355 [Lachnospiraceae bacterium 4_1_37FAA] gi|325470618|gb|EGC73848.1| hypothetical protein HMPREF0490_02355 [Lachnospiraceae bacterium 4_1_37FAA] Length = 646 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 4/42 (9%) Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 KEQ +I ++ L +K+EQ L ER+ I A Sbjct: 535 KEQQEIAAY----KREVEALKQKLEQKQERLDERKERIINEA 572 >gi|261344547|ref|ZP_05972191.1| proline permease [Providencia rustigianii DSM 4541] gi|282567461|gb|EFB72996.1| proline permease [Providencia rustigianii DSM 4541] Length = 500 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ I+ VE IEQ I L+++R S I Sbjct: 462 EILSEHFDNIEQKVEDIEQQISELEKQRQSLINQ 495 >gi|157804105|ref|YP_001492654.1| NAD-dependent DNA ligase LigA [Rickettsia canadensis str. McKiel] gi|157785368|gb|ABV73869.1| NAD-dependent DNA ligase LigA [Rickettsia canadensis str. McKiel] Length = 869 Score = 40.2 bits (92), Expect = 0.66, Method: Composition-based stats. Identities = 38/340 (11%), Positives = 98/340 (28%), Gaps = 16/340 (4%) Query: 46 KDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFD- 104 K+ +G ++ + ++ + + + + +L K R+ I+ F Sbjct: 489 KNYTVVGKKNSINSNILFIERYYDLLELGG-KLITVIDDSLLNAKNQASFREWILDRFHI 547 Query: 105 -GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 + S F + +L + + A ++ GN Sbjct: 548 KAVISLPFNAFVNASTTIKTSIIYLEKKEYKSISKNKIFMAICNNVGHDDSGNDTPERNN 607 Query: 164 LAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSY----------IVTKGLNP 213 L + D +I + + L + + Y Sbjct: 608 LNIVYSKWLDFNKDFSLPDIIIENQNKSELLTCSLQIFSIDYSKMSSKRFDAFFYSPELQ 667 Query: 214 DVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKL-ETRN 272 ++ K + ++ + + V +N N + + N + +++ Sbjct: 668 NIYKKINSLDKNKFIIKTSKEFTLQKSVNAKYVQNNFNTIFNYIEVGSCNKKGDIVSSQS 727 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 L V +I+ + + + + + T ++ +S Sbjct: 728 NNLGNLPTRARITVKAFDIITPKLIGCLYSTCIINNDINNSLVSTGFFVFTNLSERNSYL 787 Query: 333 LAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRL-PVLVP 370 L +RS + K FY + S + L E +++ + +P Sbjct: 788 LWSSLRSELVQKQFYYLSSTAVQPELSKEFLEKYVKIPIP 827 >gi|171920737|ref|ZP_02931947.1| conserved domain protein [Ureaplasma urealyticum serovar 13 str. ATCC 33698] gi|171903483|gb|EDT49772.1| conserved domain protein [Ureaplasma urealyticum serovar 13 str. ATCC 33698] Length = 85 Score = 40.2 bits (92), Expect = 0.67, Method: Composition-based stats. Identities = 8/66 (12%), Positives = 25/66 (37%), Gaps = 7/66 (10%) Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQS 399 Y + L ++ + + +PP+ Q I V++ A + + +++ ++ Sbjct: 12 YVNQASGNPKLMSNVMQEIVIPIPPLAIQNKIVEVLDKLEAYTENINVGLPLEIKQRKKQ 71 Query: 400 IVLLKE 405 + Sbjct: 72 YEYYRN 77 >gi|237738644|ref|ZP_04569125.1| predicted protein [Fusobacterium sp. 2_1_31] gi|229424127|gb|EEO39174.1| predicted protein [Fusobacterium sp. 2_1_31] Length = 225 Score = 40.2 bits (92), Expect = 0.68, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 61/184 (33%), Gaps = 12/184 (6%) Query: 235 KPFFALVTELNRKNTKLIESNILSLSYGNIIQKLE----TRNMGLKPESYETYQIVDPGE 290 + + ++E+ ++ YG+I +K + + E+Y ++ G+ Sbjct: 30 FNIKYMSKKDIFTKRDIVENGEPAIFYGDISRKYDCFVDEEITKINSEAYNRADKINKGQ 89 Query: 291 IVFRFIDLQNDKRS-LRSAQVMERGIITSAYMA-----VKPHGIDSTYLAWLMRSYDLCK 344 I+ D + + I ++ Y+ + + D+ + Sbjct: 90 ILVNLEDFDYEDIGRCIFYENDIPAAINGNVAILTLKEKFEDAVNLKYITFYLNYKDIVR 149 Query: 345 VF--YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 + + L + +P+ +P I+ Q I + + E +E++I L Sbjct: 150 QYVYDKAVGEKVKRLSRLYFEHIPITIPLIERQDKIIDNFIKVRKKFKNDFELLEKAIDL 209 Query: 403 LKER 406 + Sbjct: 210 ANKY 213 >gi|331086755|ref|ZP_08335832.1| hypothetical protein HMPREF0987_02135 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330409921|gb|EGG89356.1| hypothetical protein HMPREF0987_02135 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 790 Score = 40.2 bits (92), Expect = 0.69, Method: Composition-based stats. Identities = 13/42 (30%), Positives = 19/42 (45%), Gaps = 4/42 (9%) Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 KEQ +I ++ L +K+EQ L ER+ I A Sbjct: 533 KEQQEIAAY----KREVEALKQKLEQKQERLDERKERIINEA 570 >gi|159026445|emb|CAO88957.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 85 Score = 40.2 bits (92), Expect = 0.70, Method: Composition-based stats. Identities = 12/55 (21%), Positives = 29/55 (52%), Gaps = 2/55 (3%) Query: 333 LAWLMRSYDLCKVFYAMGSG--LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 +A + Y +VF+ + + + + E + +L + +PP+++Q +I+ IN Sbjct: 1 MAHIFNLYQHQQVFFKICTNWNNQSGVNVEVLGQLKIPLPPLEKQIEISEHINAI 55 >gi|321262603|ref|XP_003196020.1| ATPase; Ino80p [Cryptococcus gattii WM276] gi|317462495|gb|ADV24233.1| ATPase, putative; Ino80p [Cryptococcus gattii WM276] Length = 1813 Score = 40.2 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 13/84 (15%) Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-------IKEQFDITNVINVETARIDVL 392 K + G+ +LK E +KRL +++ P Q ++ + I ID+L Sbjct: 1113 EWFSKDIESSSGGVTGNLKPEQLKRLHMILKPFMLRRVKKHVQKELGDKIE-----IDLL 1167 Query: 393 VEKIEQSIVLLKERRSSF-IAAAV 415 V+ ++ + K R I+ + Sbjct: 1168 VDLSQRQREIYKALRQRVSISDLL 1191 >gi|213647547|ref|ZP_03377600.1| EcoKI restriction-modification system protein HsdS [Salmonella enterica subsp. enterica serovar Typhi str. J185] Length = 94 Score = 40.2 bits (92), Expect = 0.72, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 4/47 (8%) Query: 16 QWIGAIPKHWKVVPIKRFTKL-NTGRTSESGK--DIIYIGLEDVESG 59 +W G +P+ W + G T++S D+ ++ D+ G Sbjct: 47 EW-GKLPEGWVTTHLSEICSKPQYGYTTKSSSMGDVKFLRTTDITKG 92 >gi|167751706|ref|ZP_02423833.1| hypothetical protein EUBSIR_02712 [Eubacterium siraeum DSM 15702] gi|167655514|gb|EDR99643.1| hypothetical protein EUBSIR_02712 [Eubacterium siraeum DSM 15702] Length = 149 Score = 40.2 bits (92), Expect = 0.74, Method: Composition-based stats. Identities = 15/110 (13%), Positives = 43/110 (39%), Gaps = 5/110 (4%) Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVL 368 V+E ++++ + ++ + + Y+A + + G ++++ D+ + +L Sbjct: 39 VIENSVLSTGFCGLQCNLLSFEYIATFIEHSYFETTKDTLAHGATQEAVNNNDLCNIMLL 98 Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQ 418 P + N+ + +T I + L + R + + GQ Sbjct: 99 NPS----ERVLNLYHEKTKEIYAQISNNICENQKLSQLRDWLLPMMMNGQ 144 >gi|87161681|ref|YP_492771.1| hypothetical protein SAUSA300_0052 [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|161508320|ref|YP_001573979.1| hypothetical protein USA300HOU_0056 [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|294850610|ref|ZP_06791335.1| hypothetical protein SKAG_02704 [Staphylococcus aureus A9754] gi|87127655|gb|ABD22169.1| hypothetical protein SAUSA300_0052 [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|160367129|gb|ABX28100.1| hypothetical protein USA300HOU_0056 [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|294822525|gb|EFG38969.1| hypothetical protein SKAG_02704 [Staphylococcus aureus A9754] Length = 60 Score = 40.2 bits (92), Expect = 0.75, Method: Composition-based stats. Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 2/39 (5%) Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 Q I I+ +ID L+ K + I LLK+R+ + Sbjct: 21 QSKI--KIDNFFNKIDTLILKQGKKIELLKQRKQGLLQK 57 >gi|289423499|ref|ZP_06425301.1| putative type I restriction system specificity protein [Peptostreptococcus anaerobius 653-L] gi|289156133|gb|EFD04796.1| putative type I restriction system specificity protein [Peptostreptococcus anaerobius 653-L] Length = 155 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 14/102 (13%), Positives = 26/102 (25%), Gaps = 12/102 (11%) Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD---LCKVFYAMGSGLRQ----SLK 358 + + +S Y+ S + + + C G Sbjct: 53 SPVIIFDDFTTSSHYVDFPFKVKSSAMKLLTLNNPNDNIHCAYNVLQNIGFVPVSHGRHW 112 Query: 359 FEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQS 399 + L+P EQ I A ID L+ ++ Sbjct: 113 ISTFAKFKALLPKSADEQEKIGQY----FANIDNLITLHQRK 150 >gi|284119729|ref|ZP_06386787.1| hypothetical protein POR_1391 [Candidatus Poribacteria sp. WGA-A3] gi|283829433|gb|EFC33811.1| hypothetical protein POR_1391 [Candidatus Poribacteria sp. WGA-A3] Length = 55 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 13/23 (56%), Positives = 16/23 (69%) Query: 400 IVLLKERRSSFIAAAVTGQIDLR 422 I LL E R+ IAA VTG++D R Sbjct: 19 IELLHEYRTRLIAAVVTGKLDTR 41 >gi|332749085|gb|EGJ79508.1| hypothetical protein SFK671_5129 [Shigella flexneri K-671] Length = 63 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 17/41 (41%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L + + V +PP EQ I + IN A + L+ Sbjct: 1 MPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLIS 41 >gi|312277796|gb|ADQ62453.1| hypothetical protein STND_0386 [Streptococcus thermophilus ND03] Length = 42 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 7/31 (22%), Positives = 16/31 (51%) Query: 385 ETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 ++D + ++ + LLKE++ F+ V Sbjct: 2 FFEQLDNTITLHQRKLDLLKEQKKGFLQKMV 32 >gi|309800155|ref|ZP_07694341.1| type I restriction-modification enzyme 1, S subunit [Streptococcus infantis SK1302] gi|308116202|gb|EFO53692.1| type I restriction-modification enzyme 1, S subunit [Streptococcus infantis SK1302] Length = 164 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 52/160 (32%), Gaps = 5/160 (3%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQ 85 K V + L G+ ++ ++ L D N + +D+ ++ Sbjct: 2 KKVKLGEVISLKKGKKADIHTLQTSQSKRYIQ---IDDLRNDDNLKFTDSLNITEVLPED 58 Query: 86 ILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEAICEG 143 IL G I ST ++ Q + ++L + L +Q + G Sbjct: 59 ILIAWDGANAGTIGYGLSGAIGSTITVLKQNEYYKDKILSDYLALFLESKSQYLRDRATG 118 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDT 183 AT+ H + + N+ + + Q I + I Sbjct: 119 ATIPHLNKNILLNLQLELLHPEFQDNIVNTLNIIKRVIAK 158 >gi|149199878|ref|ZP_01876907.1| hypothetical protein LNTAR_25430 [Lentisphaera araneosa HTCC2155] gi|149137049|gb|EDM25473.1| hypothetical protein LNTAR_25430 [Lentisphaera araneosa HTCC2155] Length = 194 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 11/92 (11%), Positives = 33/92 (35%), Gaps = 5/92 (5%) Query: 293 FRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHG---IDSTYLAWLMRSYD--LCKVFY 347 FI + ++ + ++ +++ ++ ++ + I YL W + + Sbjct: 67 IAFISRGHHNYAVCAKEIKLPTVLSQHFIHIRVNDTSKILPEYLTWFLNVSYSAKKHLLK 126 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT 379 A ++ ++ + + P I Q I Sbjct: 127 ASQGSALPTITRAMMEAMLIETPSIAMQEKIV 158 >gi|327390255|gb|EGE88596.1| type I restriction-modification system, S subunit [Streptococcus pneumoniae GA04375] Length = 163 Score = 39.8 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 15/79 (18%), Positives = 28/79 (35%), Gaps = 1/79 (1%) Query: 20 AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP-KDGNSRQSDTSTV 78 IP W+ V IK E I D + Y + + Q+ + Sbjct: 83 DIPDTWEWVRIKSIYWNFGQNKPEKSFRYIDTSSIDRKKNIINYKNLQYLSPEQAPSRAR 142 Query: 79 SIFAKGQILYGKLGPYLRK 97 + ++ +L+ + PYL+ Sbjct: 143 KLVSQNSVLFSTVRPYLKI 161 >gi|126656152|ref|ZP_01727536.1| hypothetical protein CY0110_03679 [Cyanothece sp. CCY0110] gi|126622432|gb|EAZ93138.1| hypothetical protein CY0110_03679 [Cyanothece sp. CCY0110] Length = 301 Score = 39.8 bits (91), Expect = 0.84, Method: Composition-based stats. Identities = 11/78 (14%), Positives = 23/78 (29%), Gaps = 17/78 (21%) Query: 343 CKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK----------------EQFDITNVINVET 386 + + +L + ++ +PP EQ +I Sbjct: 85 REKQRLEAQKTQLNLSLQKLQSYQ-PLPPTAPNLPPTIKALPLNSYLEQEEIVEKEKTAI 143 Query: 387 ARIDVLVEKIEQSIVLLK 404 I+ +E E+ I L+ Sbjct: 144 TSIESQIEVKEKEIKYLQ 161 >gi|307255314|ref|ZP_07537126.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306861701|gb|EFM93683.1| Type I restriction-modification system S subunit [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] Length = 144 Score = 39.8 bits (91), Expect = 0.88, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 42/141 (29%), Gaps = 10/141 (7%) Query: 27 VVPIKRFTKLNTG------RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDT---ST 77 V ++ + + + + I YI +D G + D S Sbjct: 2 WVRLEDVCQEISDIDHKMPQEYKGKNGIPYISPKDFYDKNGIDFANAKKVSEEDYFLLSK 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDG-ICSTQFLVLQPKDVLPELLQGWLLSIDVTQR 136 K I++ + G II + + S ++ + + + + +L S Sbjct: 62 KFAPQKNDIIFPRYGTIGVVRIIEENIKLLVSYSCACIRVEYINMQYVVAYLNSELAKLE 121 Query: 137 IEAICEGATMSHADWKGIGNI 157 I+ T + K I Sbjct: 122 IKKYTNKTTQPNVGLKSIKKF 142 >gi|241760457|ref|ZP_04758550.1| periplasmic protein [Neisseria flavescens SK114] gi|241318961|gb|EER55463.1| periplasmic protein [Neisseria flavescens SK114] Length = 251 Score = 39.8 bits (91), Expect = 0.89, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 28/53 (52%), Gaps = 1/53 (1%) Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 + E + + P + EQ I + + ++ AR++ VE++ Q + L+++R Sbjct: 37 PDIPREPLHEKNIPYPRLDEQTQI-DHLGIQIARLERTVEELNQRLHTLEQQR 88 >gi|295107663|emb|CBL05206.1| Restriction endonuclease S subunits [Gordonibacter pamelaeae 7-10-1-b] Length = 77 Score = 39.8 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 7/32 (21%), Positives = 14/32 (43%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 + + + VPPI+ Q +I V++ Sbjct: 1 MPRGDKKAIMDFYIPVPPIEVQEEIVRVLDSF 32 >gi|185178705|ref|ZP_02964523.1| conserved domain protein [Ureaplasma urealyticum serovar 5 str. ATCC 27817] gi|188024399|ref|ZP_02997061.1| conserved domain protein [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|189009914|ref|ZP_02557182.2| conserved domain protein [Ureaplasma urealyticum serovar 11 str. ATCC 33695] gi|195867464|ref|ZP_03079468.1| conserved domain protein [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|195869030|ref|ZP_03080021.1| conserved domain protein [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|198273548|ref|ZP_03206084.1| conserved domain protein [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|209554226|ref|YP_002284520.1| type I restriction modification DNA specificity family protein [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|225551480|ref|ZP_03772426.1| conserved domain protein [Ureaplasma urealyticum serovar 8 str. ATCC 27618] gi|184209298|gb|EDU06341.1| conserved domain protein [Ureaplasma urealyticum serovar 5 str. ATCC 27817] gi|188018670|gb|EDU56710.1| conserved domain protein [Ureaplasma urealyticum serovar 7 str. ATCC 27819] gi|188997680|gb|EDU66777.1| conserved domain protein [Ureaplasma urealyticum serovar 11 str. ATCC 33695] gi|195659816|gb|EDX53196.1| conserved domain protein [Ureaplasma urealyticum serovar 12 str. ATCC 33696] gi|195660940|gb|EDX54193.1| conserved domain protein [Ureaplasma urealyticum serovar 9 str. ATCC 33175] gi|198250068|gb|EDY74848.1| conserved domain protein [Ureaplasma urealyticum serovar 4 str. ATCC 27816] gi|209541727|gb|ACI59956.1| type I restriction modification DNA specificity family protein [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|225379295|gb|EEH01660.1| conserved domain protein [Ureaplasma urealyticum serovar 8 str. ATCC 27618] Length = 85 Score = 39.8 bits (91), Expect = 0.92, Method: Composition-based stats. Identities = 9/66 (13%), Positives = 25/66 (37%), Gaps = 7/66 (10%) Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQS 399 Y + L ++ + V +PP+ Q I V++ A + + +++ ++ Sbjct: 12 YVNQASGNPKLMSNVMQEIVVPIPPLAIQNKIVEVLDKLEAYTENINVGLPLEIKQRKKQ 71 Query: 400 IVLLKE 405 + Sbjct: 72 YEYYRN 77 >gi|162661179|gb|EDQ48693.1| predicted protein [Physcomitrella patens subsp. patens] Length = 866 Score = 39.8 bits (91), Expect = 0.95, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 7/44 (15%) Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 ++ L + +Q + L+E+R I A RGE++ Sbjct: 731 EKLRQEMEQLRSRHQQELEKLEEQRDRLIEKA-------RGEAK 767 >gi|270651511|ref|ZP_06222246.1| putative type I restriction-modification system, S subunit [Haemophilus influenzae HK1212] gi|270317139|gb|EFA28758.1| putative type I restriction-modification system, S subunit [Haemophilus influenzae HK1212] Length = 58 Score = 39.8 bits (91), Expect = 0.98, Method: Composition-based stats. Identities = 12/56 (21%), Positives = 23/56 (41%), Gaps = 7/56 (12%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK-------DIIYIGLEDVESGTGKYLPKDGNSR 71 +WKV+ + + G T S K +I +I +D+ +Y+ K + Sbjct: 2 SNWKVMKLSEVATIVGGGTPSSSKSEYFENGNIPWITPKDLSGYNKRYISKGERNI 57 >gi|315650992|ref|ZP_07904029.1| conserved hypothetical protein [Eubacterium saburreum DSM 3986] gi|315486748|gb|EFU77093.1| conserved hypothetical protein [Eubacterium saburreum DSM 3986] Length = 170 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 49/137 (35%), Gaps = 6/137 (4%) Query: 50 YIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDG---I 106 Y+ + DV GK PK + + + + KG +L K+ P I + D + Sbjct: 2 YVEIGDVNVSDGKISPKLIDEKDLPANAKILPQKGDLLVSKVRPNRGAISIIEEDYSNLV 61 Query: 107 CSTQFLVLQPKDVLPELL---QGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPP 163 S F VL+ K + + L + + + G + + I ++P+PI Sbjct: 62 VSGAFAVLREKKESDYRVETLKTLLRTPIYSDWLLKFNVGTSYPVITDEDILSLPIPIIK 121 Query: 164 LAEQVLIREKIIAETVR 180 + I I Sbjct: 122 SNVEDEIASYIKQSMEY 138 >gi|228994626|ref|ZP_04154450.1| Type I restriction-modification system specificity subunit [Bacillus pseudomycoides DSM 12442] gi|228765111|gb|EEM13841.1| Type I restriction-modification system specificity subunit [Bacillus pseudomycoides DSM 12442] Length = 170 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 52/151 (34%), Gaps = 24/151 (15%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I + + R++ + E + + D +V + K Q + + Sbjct: 19 ISQEKDRSIYVNKEKIKQEVLTDTESLVLHTL---TQKVVWFPPQFEGLLLTNNFMKISF 75 Query: 325 PHGIDSTYLAWLMR-SYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +D ++ WL + K + SLK +VK + +++P +++Q I Sbjct: 76 FEKVDVHFMEWLFNEHPSIQKQIALFTEGSIISSLKLSNVKEIELVLPNVEKQTVIG--- 132 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 I LK+R+++ + Sbjct: 133 ----------------KIAQLKKRKTALLKE 147 >gi|296110703|ref|YP_003621084.1| Type I restriction-modification system specificity subunit [Leuconostoc kimchii IMSNU 11154] gi|295832234|gb|ADG40115.1| Type I restriction-modification system specificity subunit [Leuconostoc kimchii IMSNU 11154] Length = 199 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 52/133 (39%), Gaps = 2/133 (1%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKP 325 ++E + ++ +T ++ + +++ I + K S++ + + Sbjct: 48 SEIEDSAVEKTIKTEDTVEVAEENDMIISLISATSAKVSVQHQGYLISQNYVKLVPIDEN 107 Query: 326 HGIDSTYLAWLMRSYDLCKVF-YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 ++ + L S+ + K + + +K L + + PI++Q I Sbjct: 108 IIDENYVIYLLNESHLVKKQLARQLQGSNFVKVTIAILKNLEIPMIPIEKQRQIGKWYMK 167 Query: 385 ETARIDVLVEKIE 397 T R++ L +++E Sbjct: 168 -TNRLNTLRQRVE 179 >gi|157159784|ref|YP_001457102.1| DNA methylase family protein [Escherichia coli HS] gi|157065464|gb|ABV04719.1| putative DNA Methylase family [Escherichia coli HS] Length = 402 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 62/228 (27%), Gaps = 5/228 (2%) Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 G + I + + ++ + I+ +I ++ I E Sbjct: 171 RKFIGLRRYLLNEHSITKVIELPRNIFKRTEAKTHILIFNKKIMPHHKIQLHCITKDGEL 230 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 ++ D + E G + ++ + Sbjct: 231 SPPVLIRKEDAVERMDYSYHYNKNE--GKGFSTIGMLKNISIFRGRFNSKEITEHVFHTT 288 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 G+ N + + + I PG+I+ + K+ L I+ Sbjct: 289 KFSGDEKYIKFHCNSVEELKPSKLDVIAKPGDILIARVGRNFHKKIL--FVESGYSYISD 346 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRL 365 ++ G D L + S D + SG Q + + +K++ Sbjct: 347 CIFLIRASGGDKKKLFDFLCSQDGQEELSRASSGVAAQHITMDALKKI 394 >gi|309355870|emb|CAP38120.2| hypothetical protein CBG_21263 [Caenorhabditis briggsae AF16] Length = 541 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 26/230 (11%), Positives = 64/230 (27%), Gaps = 16/230 (6%) Query: 189 IRFIELLKEKKQALVSYIVTK--GLNPDVKMKDSGIEWVGLVPDHWEVKP-FFALVTELN 245 R + L + Q + G + + IE G L Sbjct: 305 CRLLVLWTKNDQKDDAESFKWILGNTKECPKCQAPIEKNGGCNHMTCNNKSCRHEFCWLC 364 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDL--QNDKR 303 N + + ++ G+ ++ N+ Y + ++ + + Sbjct: 365 MGNWIGHQQCNVFVATGDSNREKTLANLQRFEFFKTRYLGHQQSLKLENDVNTLRTDIRH 424 Query: 304 SLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVK 363 +R + K + LM SY + L D++ Sbjct: 425 KMRQLKEFFDLTTFQVIYLEKALNALTECRRTLMYSYIFAYYLEPNLNSKIFQLNQRDLE 484 Query: 364 RLPVLVPPIKEQFDITNVINVETAR--IDVLVEKIEQSIVLLKERRSSFI 411 EQ + ++ + ++ L +++ + +++RR S + Sbjct: 485 SAT-------EQL--SEILERKLEEDDLESLKQRVTEKYQYVEQRRQSLL 525 >gi|303243810|ref|ZP_07330150.1| restriction modification system DNA specificity domain protein [Methanothermococcus okinawensis IH1] gi|302485746|gb|EFL48670.1| restriction modification system DNA specificity domain protein [Methanothermococcus okinawensis IH1] Length = 106 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 27/103 (26%), Gaps = 10/103 (9%) Query: 28 VPIKRFTK-LNTGRTSESGKDIIY-------IGLEDV--ESGTGKYLPKDGNSRQSDTST 77 V + + + G T + + + D+ + + S+ Sbjct: 2 VRLGDIAEKIKAGGTPLRKNKEYWENGTINLVKISDITKSNKYLLDTEEKITENGLKNSS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVL 120 + +G IL G I I +L KD Sbjct: 62 AWLVNEGSILLSMYGTVGEVVINKIPVAITQNIAGILLKKDNN 104 >gi|257438271|ref|ZP_05614026.1| putative toxin-antitoxin system, toxin component [Faecalibacterium prausnitzii A2-165] gi|257199348|gb|EEU97632.1| putative toxin-antitoxin system, toxin component [Faecalibacterium prausnitzii A2-165] Length = 108 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 11/91 (12%), Positives = 29/91 (31%), Gaps = 8/91 (8%) Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L + + ++ + + ++ D ++ + V I Sbjct: 19 LVYFYTLKAVDRLKHKASGAVFDAITTRDFDSEQIMKLSDDDAKAFLCVAEPMFQEI--- 75 Query: 393 VEKIEQSIVLLK--ERRSSFIAAAVTGQIDL 421 + SI L+ R + ++G+ID+ Sbjct: 76 ---LNNSIENLRLSTLRDFLLPKLMSGEIDV 103 >gi|257421716|ref|ZP_05598706.1| predicted protein [Enterococcus faecalis X98] gi|257163540|gb|EEU93500.1| predicted protein [Enterococcus faecalis X98] Length = 146 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 11/95 (11%), Positives = 37/95 (38%), Gaps = 6/95 (6%) Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITN 380 + + P L+ ++ K +G+ +++ +++ P+ + + Q Sbjct: 57 VVIIPQNGIEPKYFNLILQRNVDKFIAKYATGI--NIQEKEIGNFPIELFNRETQKAFVR 114 Query: 381 VINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 +++ T + E + + KE + +F+ + Sbjct: 115 MMDHITDE----IATAENELTIYKEMKRAFLGDLM 145 >gi|37680386|ref|NP_934995.1| type I restriction-modification system methyltransferase subunit [Vibrio vulnificus YJ016] gi|37199133|dbj|BAC94966.1| type I restriction-modification system methyltransferase subunit [Vibrio vulnificus YJ016] Length = 638 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 9/70 (12%), Positives = 27/70 (38%), Gaps = 4/70 (5%) Query: 332 YLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDV 391 YL +L ++ ++ + + + V +P +++Q + ++ I+ Sbjct: 552 YLPYLAHVLKSLELNNLATGTAQKFISINKLYEVEVSLPSLEKQRE----MSEWFTSIEE 607 Query: 392 LVEKIEQSIV 401 KI+ + Sbjct: 608 SKSKIQSLLA 617 Score = 36.3 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 44/148 (29%), Gaps = 14/148 (9%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V +K KL +G + ++SG +G + I Sbjct: 467 QVKLKDICKLRSGDKLNKSE--------VMDSGEFPVYGGNGVIGFNVEPNR---HGDSI 515 Query: 87 LYGKLGPYLRKAII-ADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGAT 145 + GK+G + + S + ++ +L + + + + G Sbjct: 516 VIGKVGAHCGNIHFSTQPYWLTSNAMSLELLDTT--KVYLPYLAHVLKSLELNNLATGTA 573 Query: 146 MSHADWKGIGNIPMPIPPLAEQVLIREK 173 + + + +P L +Q + E Sbjct: 574 QKFISINKLYEVEVSLPSLEKQREMSEW 601 >gi|319777295|ref|YP_004136946.1| type i restriction-modification system, s subunit [Mycoplasma fermentans M64] gi|318038370|gb|ADV34569.1| Type I restriction-modification system, S subunit [Mycoplasma fermentans M64] Length = 170 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 50/157 (31%), Gaps = 4/157 (2%) Query: 237 FFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFI 296 ++ K K I S + N L ++ L +++ +IV Sbjct: 11 ISYDKNDILTKMDKNYIRIIRSGNIQNSRLILFDDDIFLPVFYKNNIKMLHYNDIVIMAS 70 Query: 297 DLQNDKRSLRSA--QVMERGIITSAYMAVKPHGIDST-YLAWLMRSYDLCKVFYAMGSGL 353 + + + ++ I + ++P+ + YL + S +G Sbjct: 71 TGSKNLIGKPAFVEEQLDNVYIGAFLRIIRPNINNIFDYLKLIFMSEYYRSEIRKNVNGT 130 Query: 354 -RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARI 389 ++ + + + +P IK + I+ I + Sbjct: 131 NINNVNSNILLNMLIPIPSIKNERKISKKIYQVLNIL 167 >gi|237721652|ref|ZP_04552133.1| type I restriction enzyme EcoEI specificity protein [Bacteroides sp. 2_2_4] gi|229449448|gb|EEO55239.1| type I restriction enzyme EcoEI specificity protein [Bacteroides sp. 2_2_4] Length = 159 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 11/137 (8%), Positives = 35/137 (25%), Gaps = 2/137 (1%) Query: 242 TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQND 301 + +K L G ++ ++ + Sbjct: 11 MKEWKKYKIGDVFAYLKSGKGIHANEISSKGEYPVYGGNGVRGYTTRNNFEGNCAIIGRQ 70 Query: 302 KRSLRSAQVME-RGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFE 360 + + + + +T + + +ST + + + G + L Sbjct: 71 GAFCGNVRYFKRKAYMTEHAIIAVANENNSTRFLSYLL-GIIMNLGRFSGQSAQPGLSVT 129 Query: 361 DVKRLPVLVPPIKEQFD 377 ++ + + VP + Q Sbjct: 130 ELAKQSITVPSLSVQKR 146 >gi|319638157|ref|ZP_07992920.1| periplasmic protein [Neisseria mucosa C102] gi|317400430|gb|EFV81088.1| periplasmic protein [Neisseria mucosa C102] Length = 251 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 12/54 (22%), Positives = 29/54 (53%), Gaps = 1/54 (1%) Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 + E + + P + EQ I + + ++ AR++ VE++ Q + L+++R+ Sbjct: 37 PDIPREPLPEKNIPYPRLDEQTQI-DHLGIQIARLERTVEELNQRLHTLEQQRT 89 >gi|53729156|ref|ZP_00348330.1| COG0732: Restriction endonuclease S subunits [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 114 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 7/107 (6%), Positives = 33/107 (30%), Gaps = 6/107 (5%) Query: 311 MERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 + ++ + S ++ + + + + + ++ + +++P Sbjct: 6 PYSFVTNNSLVIEHSKSFLS--YFYIYEALRIQTLVELTTGSAQPQMTIANMNPVQIILP 63 Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 I N+ + + + + L++ R + + G Sbjct: 64 T----DKIHNLYTSQVKYLYEKIYRNNLENEQLEKIRDELLPKLLNG 106 >gi|301633184|gb|ADK86738.1| conserved hypothetical protein [Mycoplasma pneumoniae FH] Length = 65 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 10/45 (22%), Positives = 22/45 (48%) Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + + PP++ Q I +++ + LVE I I + K++ Sbjct: 1 MAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEIEMRKKQ 45 >gi|332076343|gb|EGI86806.1| type I restriction enzyme EcoKI specificity [Streptococcus pneumoniae GA41301] Length = 163 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 10/103 (9%), Positives = 29/103 (28%), Gaps = 7/103 (6%) Query: 281 ETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA----VKPHGIDSTYLAWL 336 + +++ G++ ++ + I S +L + Sbjct: 61 SEQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQLTPFESSEIISKFLLFN 120 Query: 337 MRSYDLCKVFY---AMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + S K + ++ + L + + P +EQ Sbjct: 121 LSSPLFYKQLKAITKLSGQALYNIPKTTLSELLIPLAPFEEQE 163 Score = 37.5 bits (85), Expect = 4.3, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 11/102 (10%) Query: 24 HWKVVPIKRFTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +W V+ IK +NTG + + K + I +++ L D S+ Sbjct: 2 NWVVIKIKDIFSMNTGLSYKKGDLSINNKGVRIIRGGNIKPLEFSLLDNDYYIDTQFISS 61 Query: 78 VSIFAKGQILYGKLGPYLRKAIIA-----DFDGICSTQFLVL 114 ++ K L + L D+DG+ + F+ Sbjct: 62 EQVYLKHNQLITPVSTSLEHIGKFARIDKDYDGVVAGGFIFQ 103 >gi|58266666|ref|XP_570489.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|134110324|ref|XP_775989.1| hypothetical protein CNBD0390 [Cryptococcus neoformans var. neoformans B-3501A] gi|74685408|sp|Q5KHM0|INO80_CRYNE RecName: Full=Putative DNA helicase INO80 gi|50258657|gb|EAL21342.1| hypothetical protein CNBD0390 [Cryptococcus neoformans var. neoformans B-3501A] gi|57226722|gb|AAW43182.1| conserved hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] Length = 1765 Score = 39.4 bits (90), Expect = 1.2, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 13/84 (15%) Query: 340 YDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-------IKEQFDITNVINVETARIDVL 392 K + G+ +LK E +KRL +++ P Q ++ + I ID+L Sbjct: 1065 EWFSKDIESSSGGVTGNLKPEQLKRLHMILKPFMLRRVKKHVQKELGDKIE-----IDLL 1119 Query: 393 VEKIEQSIVLLKERRSSF-IAAAV 415 V+ ++ + K R I + Sbjct: 1120 VDLSQRQREIYKALRQRVSITDLL 1143 >gi|332362405|gb|EGJ40205.1| hypothetical protein HMPREF9393_0203 [Streptococcus sanguinis SK1056] Length = 56 Score = 39.4 bits (90), Expect = 1.3, Method: Composition-based stats. Identities = 10/31 (32%), Positives = 18/31 (58%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGR 40 K+SG+ WIG IP+ W+V + + + + Sbjct: 4 MKESGIDWIGQIPEEWEVAKVNHIFEEHKQK 34 >gi|288802386|ref|ZP_06407826.1| hypothetical protein HMPREF0660_00831 [Prevotella melaninogenica D18] gi|288335353|gb|EFC73788.1| hypothetical protein HMPREF0660_00831 [Prevotella melaninogenica D18] Length = 459 Score = 39.4 bits (90), Expect = 1.3, Method: Composition-based stats. Identities = 22/201 (10%), Positives = 64/201 (31%), Gaps = 28/201 (13%) Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEI 291 W R + + I + +++ + + Y V G++ Sbjct: 45 WGENGLVEEAYHGPRAKRNYLPTGIPFIGSSEMLEVKPNPTKFVDKSFLDNYG-VRRGQV 103 Query: 292 VFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGS 351 + + + +E ++ + + + Y+ + + + + Sbjct: 104 LLSCSGTIGRTSFV--NRTLEGYCVSQHALKITANYA--GYVYAYLSTEVGKSIVKSFTY 159 Query: 352 GLR-QSLKFEDVKRLPVLVPPIKEQFD------ITNVINVETARIDVLVEKIEQSIVLLK 404 G ++ E +K LP+ P +E I + + + L+++ +Q Sbjct: 160 GAVIDEIEPEHLKNLPIPNAP-EEIKRSIHNAVIASY--DLRDQSNDLIDEAQQ------ 210 Query: 405 ERRSSFIAAAVT--GQIDLRG 423 + A++ G++DL+ Sbjct: 211 -----LLYEALSLPGKMDLKP 226 >gi|313893238|ref|ZP_07826814.1| type I restriction modification DNA specificity domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442217|gb|EFR60633.1| type I restriction modification DNA specificity domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 185 Score = 39.4 bits (90), Expect = 1.3, Method: Composition-based stats. Identities = 12/117 (10%), Positives = 42/117 (35%), Gaps = 3/117 (2%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + L + + +I G +++ + L+ + I+ K + Sbjct: 1 EFITLDGLNNSSAKIFPKGTLLYTIFATIGEVAILKMDAATNQAIVGIQLKENKKVYLKY 60 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETA 387 Y ++ ++ ++ + + ++ VK + + + +++Q +I +N Sbjct: 61 IYYYLKSQTNNIKQLGRGVA---QNNINLSVVKNMIIPIVSLEKQSNIIATLNKLEK 114 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 52/143 (36%), Gaps = 2/143 (1%) Query: 66 KDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQ 125 + + S+ IF KG +LY + + + I D + + +Q K+ L+ Sbjct: 1 EFITLDGLNNSSAKIFPKGTLLYT-IFATIGEVAILKMDAATNQAIVGIQLKENKKVYLK 59 Query: 126 GWLLSIDVT-QRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTL 184 + I+ + G ++ + + N+ +PI L +Q I + Sbjct: 60 YIYYYLKSQTNNIKQLGRGVAQNNINLSVVKNMIIPIVSLEKQSNIIATLNKLEKIKGNR 119 Query: 185 ITERIRFIELLKEKKQALVSYIV 207 IT +L+K + L V Sbjct: 120 ITILNCLDDLIKSRFVELFGDPV 142 >gi|289164609|ref|YP_003454747.1| coiled-coil protein [Legionella longbeachae NSW150] gi|288857782|emb|CBJ11626.1| putative coiled-coil protein [Legionella longbeachae NSW150] Length = 2937 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 24/82 (29%) Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS----------IVLLKERRS- 408 ++ + + I EQ I+ I ++I + E I + I +ERR Sbjct: 2709 TNLDEFFIAL--INEQSRISKNIEDIRSKIHNIEELIHKQENEIFETGSRIKAAQERRQQ 2766 Query: 409 ---------SFIAAAV--TGQI 419 S ++ V G+I Sbjct: 2767 PDCGYIESASLMSQVVYHQGKI 2788 >gi|332768709|gb|EGJ98888.1| hypothetical protein SF293071_0004 [Shigella flexneri 2930-71] gi|333009035|gb|EGK28491.1| hypothetical protein SFK218_0154 [Shigella flexneri K-218] gi|333022346|gb|EGK41584.1| hypothetical protein SFK304_0028 [Shigella flexneri K-304] Length = 74 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 17/41 (41%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L + + V +PP EQ I + IN A + L+ Sbjct: 12 MPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLIS 52 >gi|50086401|ref|YP_047911.1| putative restriction-modification system [Acinetobacter sp. ADP1] gi|49532377|emb|CAG70089.1| conserved hypothetical protein; putative restriction-modification system [Acinetobacter sp. ADP1] Length = 197 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 51/146 (34%), Gaps = 12/146 (8%) Query: 269 ETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 + + + L+ Q + ++ + + Q +++ +++ ++ V + Sbjct: 46 DDQLVDLEWSYDSKPQYLKHNSLIVVARG--EPRAYVFKGQQVDQVAVSNQFIVVNLNID 103 Query: 329 D--STYLAWLMR-SYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITNVINV 384 + +LAW S + F G L +K +++P + +Q +I + Sbjct: 104 NIKPEFLAWYFNHSQAMRSYFEMNSRGSLLMMLSISTLKEAEIVIPSMFQQEEILRLAEE 163 Query: 385 ETARIDVLVEKIEQSIVLLK-ERRSS 409 I + + L+ E + Sbjct: 164 AHNE-----ALIFKQLTALRAEYNQA 184 >gi|123468897|ref|XP_001317664.1| hypothetical protein [Trichomonas vaginalis G3] gi|121900403|gb|EAY05441.1| hypothetical protein TVAG_197420 [Trichomonas vaginalis G3] Length = 1033 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 9/59 (15%), Positives = 23/59 (38%), Gaps = 4/59 (6%) Query: 360 EDVKRLPVLVPPIKE----QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 E + + +KE Q + E D ++++++ + LK ++ + I Sbjct: 860 EIIDNYEKAIESLKENSENQRQTIEKLTNEIKTFDAKIKELQKQLSKLKRKKKTLIEEV 918 >gi|297625331|ref|YP_003687094.1| methylase_S, type I restriction enzyme [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921096|emb|CBL55643.1| Methylase_S, type I restriction enzyme [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 92 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 5/27 (18%), Positives = 14/27 (51%) Query: 361 DVKRLPVLVPPIKEQFDITNVINVETA 387 + + + +PP+ Q +I +++ T Sbjct: 1 MILKFQIPLPPLVVQHEIVKILDTFTN 27 >gi|313890119|ref|ZP_07823754.1| conserved hypothetical protein [Streptococcus pseudoporcinus SPIN 20026] gi|313121480|gb|EFR44584.1| conserved hypothetical protein [Streptococcus pseudoporcinus SPIN 20026] Length = 198 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 51/168 (30%), Gaps = 6/168 (3%) Query: 30 IKRFTKLNTGR---TSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 + + G+ + + I L D+ Y + G + Sbjct: 18 LGEVVECFKGKAVSSKVGDGEFALINLSDMTLAGINYQNLRTFHLERRQLLRYFLEDGDV 77 Query: 87 LYGKLGPYLRKAIIADFD--GICSTQFLVLQPKDVLPELLQGWLLSIDV-TQRIEAICEG 143 L G + + + S+ VL+P D L + L D+ ++ G Sbjct: 78 LIASKGTVKKVCVFQKQKREVVASSNITVLRPLDKLRGYYIKFFLDSDIGQGLLDRADHG 137 Query: 144 ATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRF 191 + + K + IP+P PL +Q + + + I + Sbjct: 138 KDVINLSTKELLEIPVPAMPLVKQDYLINQYLRGLSEYQRKIKRAEQE 185 >gi|330937287|gb|EGH41298.1| Type I restriction enzyme (modification subunit) [Pseudomonas syringae pv. pisi str. 1704B] Length = 223 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 21/173 (12%), Positives = 53/173 (30%), Gaps = 8/173 (4%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + K + E + +I+ ++ + ++V Sbjct: 42 HFEIIRPRQHHMGLKGVPVEEVQAQDIPSFGLIRHATMLSVHDLDGPNSFDYFLKAKDVV 101 Query: 293 FRFIDLQNDKRSLRSAQVMERGII----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 + A + G + A + + + L +RS Sbjct: 102 ICIKGAIGRVGCISKAPLPGPGGWVSGQSVAVLRSRGTDYAAHALMMYLRSPKGQAALRR 161 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT-NVINVETARIDVLVEKIEQS 399 + G +++ + +K + + Q D+ V+ ET ID +E+++Q Sbjct: 162 LVVGTSTPTIQAKALKGFQIPILT-AVQSDMALEVLEAETD-IDYQIEQLQQK 212 >gi|330997645|ref|ZP_08321490.1| hypothetical protein HMPREF9442_02590 [Paraprevotella xylaniphila YIT 11841] gi|329570173|gb|EGG51913.1| hypothetical protein HMPREF9442_02590 [Paraprevotella xylaniphila YIT 11841] Length = 1053 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 4/77 (5%) Query: 333 LAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINV--ETAR 388 L +L+ + + G + E ++ LP+ VP + Q I + Sbjct: 973 LYYLLGILNSSMADQLLTDQRGGDYHIYPEHIRNLPIPVPQREIQNAIGEIAKQILLIRE 1032 Query: 389 IDVLVEKIEQSIVLLKE 405 + ++E+ + L E Sbjct: 1033 TNTDYSELEEQLNNLVE 1049 >gi|309810128|ref|ZP_07703974.1| conserved hypothetical protein [Lactobacillus iners SPIN 2503V10-D] gi|308169627|gb|EFO71674.1| conserved hypothetical protein [Lactobacillus iners SPIN 2503V10-D] Length = 230 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 58/198 (29%), Gaps = 17/198 (8%) Query: 227 LVPDHWEVKPFFALVTELNRKNTKLIESNI-LSLSYGNIIQKLETRNMGLKPESYETYQI 285 +PD W + +V+ + E Y IQ + N K T + Sbjct: 39 EIPDSWRTEKLLNIVSWESNSQPPKSEFIYSPKDGYVRFIQNRDYENDSYKTYIPLTNNL 98 Query: 286 VDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKV 345 ID D +R + + P+ Y+ + S + Sbjct: 99 STVNRFDI-LIDKYGDAGVVRYGIEGAFNVALGKINVLYPNCQ--EYVRSFLESDGIYSY 155 Query: 346 FYAMG-SGLRQSLKFEDVKRLPVLVPP----IKEQFDITNVINVETARIDVLVEKIEQSI 400 + + R SL ++ L +++P ++ Q DI +I + Sbjct: 156 LHNSCMASTRASLNESNLDMLNIVIPDENSLLRYQEDI--------HQIRETILLNNSEN 207 Query: 401 VLLKERRSSFIAAAVTGQ 418 L R + + GQ Sbjct: 208 QNLISLRDWLLPMLMNGQ 225 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 61/203 (30%), Gaps = 10/203 (4%) Query: 10 YKDSG--VQWIG----AIPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVE-SGTGK 62 YK SG + W IP W+ + + + I V Sbjct: 23 YKSSGGKMVWNEQLKREIPDSWRTEKLLNIVSWESNSQPPKSEFIYSPKDGYVRFIQNRD 82 Query: 63 YLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQF-LVLQPKDVLP 121 Y + T+ +S + IL K G + +G + + Sbjct: 83 YENDSYKTYIPLTNNLSTVNRFDILIDKYGDAG--VVRYGIEGAFNVALGKINVLYPNCQ 140 Query: 122 ELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRI 181 E ++ +L S + + C +T + + + + + IP + +E I I Sbjct: 141 EYVRSFLESDGIYSYLHNSCMASTRASLNESNLDMLNIVIPDENSLLRYQEDIHQIRETI 200 Query: 182 DTLITERIRFIELLKEKKQALVS 204 +E I L L++ Sbjct: 201 LLNNSENQNLISLRDWLLPMLMN 223 >gi|148978192|ref|ZP_01814722.1| putative specificity protein s [Vibrionales bacterium SWAT-3] gi|145962614|gb|EDK27890.1| putative specificity protein s [Vibrionales bacterium SWAT-3] Length = 139 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 44/138 (31%), Gaps = 7/138 (5%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYD--LCK 344 +PG+ + + ++ Q + ++ + + P + L +L + D + + Sbjct: 2 NPGDTIVGTVRP-GNRSFAYIGQTEQPLTGSTGFAVLTPKEEFWSSLVYLATTNDDSIDE 60 Query: 345 VFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLK 404 G ++K V +P I T + + L Sbjct: 61 YARLADGGAYPAIKPAVVAETECAIPTGD----IAKKFWEITGPMLKKANQNRLENEELA 116 Query: 405 ERRSSFIAAAVTGQIDLR 422 R + + ++G I+L Sbjct: 117 ALRDTLLPKLLSGDIELP 134 >gi|300704795|ref|YP_003746398.1| hypothetical protein RCFBP_20620 [Ralstonia solanacearum CFBP2957] gi|299072459|emb|CBJ43807.1| conserved protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 34 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 6/32 (18%), Positives = 15/32 (46%) Query: 390 DVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 D + ++E + ++ + +TG+I L Sbjct: 2 DTEIAELEAKLAKARDVEQGMMQQLLTGKIRL 33 >gi|320536512|ref|ZP_08036542.1| conserved domain protein [Treponema phagedenis F0421] gi|320146638|gb|EFW38224.1| conserved domain protein [Treponema phagedenis F0421] Length = 549 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 40/359 (11%), Positives = 90/359 (25%), Gaps = 49/359 (13%) Query: 24 HWKVVPIKRFT-KLNTGRTSESGK----DIIYIGLEDVESGTGKYLPKDGNSRQSDTSTV 78 WK K+ G+ +I YI + +G ++ GN R+ Sbjct: 204 EWKAFKFNEIFRKIKRGKRLTKANQITGNIPYISSTALNNGIDNFIKNSGNVRKG----- 258 Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 K + G ++ I S LQ + Sbjct: 259 ----KNALTVANSGSV-GSCFYHCYEYIASDHVTSLQASNAD------------------ 295 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + I L E+ +I E ++ + +I + E Sbjct: 296 ------------KYIYLFMSTIIKRLEEKYSFNREINDERIKAEKIILPIDKNGNPHWEY 343 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + + ++ + I V + + E ++ I+S Sbjct: 344 MSKFMQKLEVEKISNFLPYIYIYIYKVACSIEKTVYNITSSKWQEFWIEDICTIKSGQRL 403 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 + + + + + + + + + + + I + Sbjct: 404 VKAQQQMGTIPFIGASDSDNGITAFISNINSSVDKNVLGVNYNGSVVHNFYHPYKCIFSD 463 Query: 319 AYMAVKPHGID-STYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQF 376 + +L L + G E +KR +++P I EQ Sbjct: 464 DVKRLHFKCTPAKNEATYLFLKQALLQQKGKYTYG--YKFTGERMKRQKIILP-ITEQQ 519 >gi|296110698|ref|YP_003621079.1| type I restriction enzyme specificity protein [Leuconostoc kimchii IMSNU 11154] gi|295832229|gb|ADG40110.1| type I restriction enzyme specificity protein [Leuconostoc kimchii IMSNU 11154] Length = 198 Score = 39.0 bits (89), Expect = 1.5, Method: Composition-based stats. Identities = 18/123 (14%), Positives = 44/123 (35%), Gaps = 7/123 (5%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--PHGI 328 + + T + + G+++F + + ++ I++ +A P I Sbjct: 59 YGDSKLYDKWMTGKELYQGQVLFTTEAPMGNVAQVP---DDKKYILSQRVIAFNTLPDKI 115 Query: 329 DSTYLAWLMRSYD-LCKVFYAMGSGLRQSLKFEDVKRLPVLVPP-IKEQFDITNVINVET 386 +LA L+ + K+ G + + + + +L V + + EQ I Sbjct: 116 TDDFLAILLSTPLTFTKLHSLASGGTAKGVSQKSLSQLRVSISTYLNEQTKIGAFFKTLD 175 Query: 387 ARI 389 +I Sbjct: 176 QQI 178 >gi|294783677|ref|ZP_06749001.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA] gi|294480555|gb|EFG28332.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA] Length = 746 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 58/192 (30%), Gaps = 27/192 (14%) Query: 243 ELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK 302 E R K + + + + LK + + E + ++ K Sbjct: 552 EEYRATEKTSNIYLSISDINDGLIDFKNIETYLKNIPENQEKFLVKNEYILLSKYGKSPK 611 Query: 303 RSLRSAQVMERGIITSAYMAV--KPHGIDSTYLAWLMRSYDLCKVFYAMGSGL------- 353 ++ E+ I++ + + ID YLA L S K+ S Sbjct: 612 LAIVKNLGEEKVIVSGNLIIIEVDKKEIDPYYLAALFSSKKGIKILKEAYSNKDKAKAKE 671 Query: 354 --------------RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID----VLVEK 395 +L + +K L + +P + +I I+ L E Sbjct: 672 KDKEKDKEKDKDKENATLSIKKLKDLRIPIPSREICIEIALKYERILNEINKNKLKLKEL 731 Query: 396 IEQSIVLLKERR 407 I+ +LK+ + Sbjct: 732 IDSKEEILKKLK 743 >gi|257125801|ref|YP_003163915.1| restriction modification system DNA specificity domain protein [Leptotrichia buccalis C-1013-b] gi|257049740|gb|ACV38924.1| restriction modification system DNA specificity domain protein [Leptotrichia buccalis C-1013-b] Length = 195 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 61/193 (31%), Gaps = 16/193 (8%) Query: 29 PIKRFTKLN---TGRTSESGKDIIYIGLEDVESGTGKYLPK-----DGNSRQSDTSTVSI 80 + + +T++S + V +G + + ++ + Sbjct: 2 KLGDNVDIIAPLNVKTADSETGYFLLNPTMVNNGKIETFDYAEVPDRYKNGKNKIADKYF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVT--QR 136 K +L+ G + + ST + +L+P + + WLL ++ Sbjct: 62 IKKDDVLFQAKGSKIDVVYVDKDYERVLPSTLYFILRPNEKINPKYLQWLLKTELVLLYF 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI----IAETVRIDTLITERIRFI 192 + T+ + I + + +P Q + + I E + L +R Sbjct: 122 EKKYKTMGTVRAVNKGDIVELRVKMPEREVQDEMAKIITSFEDEEYSTMKYLKIKRKYIE 181 Query: 193 ELLKEKKQALVSY 205 E + E Q ++ Sbjct: 182 ERVIENNQVIIDE 194 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 55/132 (41%), Gaps = 5/132 (3%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPH-GIDSTYLAWL 336 + +++F+ + D + + ER + ++ Y ++P+ I+ YL WL Sbjct: 54 NKIADKYFIKKDDVLFQAKGSKIDVVYV--DKDYERVLPSTLYFILRPNEKINPKYLQWL 111 Query: 337 MRSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 +++ + F G +++ D+ L V +P + Q ++ +I ++ Sbjct: 112 LKTELVLLYFEKKYKTMGTVRAVNKGDIVELRVKMPEREVQDEMAKIITSFEDEEYSTMK 171 Query: 395 KIEQSIVLLKER 406 ++ ++ER Sbjct: 172 YLKIKRKYIEER 183 >gi|333012194|gb|EGK31576.1| hypothetical protein SFK227_5288 [Shigella flexneri K-227] Length = 74 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 17/41 (41%) Query: 354 RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVE 394 L + + V +PP EQ I + IN A + L+ Sbjct: 12 MPKLNSDSFYNIIVAIPPYNEQQAIFDKINSIEAVCNGLIS 52 >gi|227508547|ref|ZP_03938596.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] gi|227191879|gb|EEI71946.1| possible type I site-specific deoxyribonuclease specificity subunit [Lactobacillus brevis subsp. gravesensis ATCC 27305] Length = 196 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 20/172 (11%), Positives = 49/172 (28%), Gaps = 15/172 (8%) Query: 239 ALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQI----VDPGEIVFR 294 ++ L N+L L GN+ + + Q+ ++ G+ V Sbjct: 11 DRGHNYPHESNFLESGNVLFLDTGNVKKNGFNFETQKYISDQKDKQLKNGKLNVGDFVLT 70 Query: 295 FIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF------YA 348 + + + I + S +L+ L L Sbjct: 71 SRGTLGNVAYYDKSISQKFPEIRINSAMLILRKESSQHLSNLFLESSLRGKIIDNFMRND 130 Query: 349 MGSGLRQSLKFEDVKRLPVLVPPI-KEQFDITNVINVETARIDVLVEKIEQS 399 + + +D ++ + +P + EQ + + I +L+ + Sbjct: 131 HVGSAQPHITKKDFSKVKLNIPQLWMEQDKVGKI----FQNIFILIAANLRQ 178 >gi|24215294|ref|NP_712775.1| flagellar protein FlbB [Leptospira interrogans serovar Lai str. 56601] gi|45657266|ref|YP_001352.1| flagellar protein B [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|24196391|gb|AAN49793.1| flagellar protein FlbB [Leptospira interrogans serovar Lai str. 56601] gi|45600504|gb|AAS69989.1| flagellar protein B [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 215 Score = 39.0 bits (89), Expect = 1.6, Method: Composition-based stats. Identities = 10/42 (23%), Positives = 16/42 (38%), Gaps = 3/42 (7%) Query: 375 QFDITNVINVETARIDVLVE---KIEQSIVLLKERRSSFIAA 413 Q ++ R L+ K+E + L+E R IA Sbjct: 69 QERFAEELDELEKRKSELIAEKGKLEAEMEKLEEMRKGLIAK 110 >gi|317481751|ref|ZP_07940782.1| type I restriction enzyme [Bifidobacterium sp. 12_1_47BFAA] gi|316916808|gb|EFV38199.1| type I restriction enzyme [Bifidobacterium sp. 12_1_47BFAA] Length = 165 Score = 39.0 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 33/120 (27%), Gaps = 12/120 (10%) Query: 25 WKVVPIKRFTKLNTGRTSES-------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 W+ ++ G T I+++ +DV+ + + + + +T Sbjct: 47 WEQRKLENLASFGGGHTPSMADASNYVDGKILWVTSQDVKQHYIENTTTMISEKGA--AT 104 Query: 78 VSIFAKGQILYGKLGPYLR---KAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVT 134 ++++ I+ LR + V+Q D Sbjct: 105 LTLYPSDSIVIVARSGILRHTIPVAKLRKPATVNQDIKVIQTVDSCDSSWLLQYFIASNK 164 >gi|298480706|ref|ZP_06998902.1| type IIS restriction endonuclease [Bacteroides sp. D22] gi|298273140|gb|EFI14705.1| type IIS restriction endonuclease [Bacteroides sp. D22] Length = 1053 Score = 39.0 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 12/65 (18%), Positives = 25/65 (38%), Gaps = 6/65 (9%) Query: 332 YLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDIT----NVINVE 385 L +L+ + + A G + E ++ LP+ VP + Q I +++ Sbjct: 972 NLYYLLGILNSSMANQLLADQRGGDYHIYPEHIRNLPIPVPQREVQNAIGVIAKEILHRR 1031 Query: 386 TARID 390 +D Sbjct: 1032 EENLD 1036 >gi|153868189|ref|ZP_01998243.1| hypothetical protein BGS_0658 [Beggiatoa sp. SS] gi|152144491|gb|EDN71757.1| hypothetical protein BGS_0658 [Beggiatoa sp. SS] Length = 75 Score = 39.0 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 9/51 (17%), Positives = 20/51 (39%) Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIE 397 A ++ +L ++ L ++ PP K Q ++ D + +E Sbjct: 12 QANTGAVQTNLTIPVIESLQIICPPPKIQNKFVQKVHQSYTLKDESKDLLE 62 >gi|270601342|ref|ZP_06221556.1| restriction modification enzyme Cj1051c [Haemophilus influenzae HK1212] gi|270318268|gb|EFA29451.1| restriction modification enzyme Cj1051c [Haemophilus influenzae HK1212] Length = 53 Score = 39.0 bits (89), Expect = 1.7, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 4/55 (7%) Query: 359 FEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 + L + +P + EQ I N IN I+ + ++E+ + ++ + + + Sbjct: 1 ISFYEDLEISLPDLNEQQSIVNQIN----EIETQISELEKVLENSRQEKKAVLDK 51 >gi|325990090|ref|YP_004249789.1| hypothetical protein Msui07450 [Mycoplasma suis KI3806] gi|323575175|emb|CBZ40838.1| hypothetical protein Msui07450 [Mycoplasma suis] Length = 112 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 38/96 (39%), Gaps = 6/96 (6%) Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKV--FYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 ++ V I L +L++ + FY SG + LK + + L +++P Sbjct: 9 SNNCFVVFDKRIKKFSLLYLLQEAIKINLENFYKEDSGGIKHLKSKKLSELKIIIPD--- 65 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 N I + +E ++++I L+ ++ Sbjct: 66 -NKTLEKFNEICENIQLKIENLQKNIERLEIMKNDL 100 >gi|295090546|emb|CBK76653.1| Type I restriction modification DNA specificity domain. [Clostridium cf. saccharolyticum K10] Length = 196 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 51/155 (32%), Gaps = 11/155 (7%) Query: 29 PIKRFTKLNTGRTSESG-------KDIIYIGLEDVE-SGTGKYLPKDGNSRQSDTSTVSI 80 ++ + + +G I L ++ GT D + I Sbjct: 2 KLQDYASVRSGLVLSRKQSQNSSVYKYPLINLRCIQQDGTIDLNEVDIYEAKEPLKKEYI 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGIC---STQFLVLQPKDVLPELLQGWLLSIDVTQRI 137 KG I+ PY I + G+ + + ++ +LPE L L + + +++ Sbjct: 62 SQKGDIIVRLTAPYTAVLIDSTTSGMVISSNFVVIRVENDCLLPEYLFWLLNTQKIKRQM 121 Query: 138 EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIRE 172 + K + + + + + +Q I + Sbjct: 122 YENATSNMLGAVKAKFLTDFELQVLSVEDQFKIGQ 156 >gi|291530637|emb|CBK96222.1| Type I restriction modification DNA specificity domain [Eubacterium siraeum 70/3] Length = 224 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 57/152 (37%), Gaps = 7/152 (4%) Query: 30 IKRFTKLNTGRTSESGKDIIYIGLEDVESGTG---KYLPKDGNSR---QSDTSTVSIFAK 83 I++ + G S + + V G KY+ + N+ ++ + +K Sbjct: 42 IEQVADIYGGYAFNSKAYVNKGKYKIVTIGNVTGDKYISGNYNTIDRLPNNIQKPQVLSK 101 Query: 84 GQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPE-LLQGWLLSIDVTQRIEAICE 142 G IL G R +I+ + + + + L +D L + + +L + + + + Sbjct: 102 GDILVSLTGNVGRISIVDGDEYLLNQRVAKLGIEDDLTKEYIYQYLSNSSFEKDMINAGQ 161 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKI 174 GA + + I + + P + +K+ Sbjct: 162 GAAQKNIKNQDILSYCIRFPTDQTALENIDKL 193 >gi|256852234|ref|ZP_05557620.1| restriction modification system [Lactobacillus jensenii 27-2-CHN] gi|260661734|ref|ZP_05862645.1| type I restriction modification system [Lactobacillus jensenii 115-3-CHN] gi|282932023|ref|ZP_06337484.1| hypothetical protein HMPREF0886_3167 [Lactobacillus jensenii 208-1] gi|297205600|ref|ZP_06922996.1| hypothetical protein HMPREF0526_10628 [Lactobacillus jensenii JV-V16] gi|256615280|gb|EEU20471.1| restriction modification system [Lactobacillus jensenii 27-2-CHN] gi|260547481|gb|EEX23460.1| type I restriction modification system [Lactobacillus jensenii 115-3-CHN] gi|281303850|gb|EFA95991.1| hypothetical protein HMPREF0886_3167 [Lactobacillus jensenii 208-1] gi|297150178|gb|EFH30475.1| hypothetical protein HMPREF0526_10628 [Lactobacillus jensenii JV-V16] Length = 201 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 59/205 (28%), Gaps = 21/205 (10%) Query: 228 VPDHWEVKPFFALV---TELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQ 284 +P W + K +L S++ + N + G K + + Sbjct: 1 MPSDWNYVSLKDYAEVTPGYSYKGKELSPSHLAMATIKNFDRNGGFNARGFKEINPQKEI 60 Query: 285 IVDPG----EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK--------PHGIDSTY 332 V +++ DL + + +A+ + + + I Sbjct: 61 KVQKYANLYDVLVAHTDLTQNAEIIGNAEPILTCGNYDKIIFSMDLVKVTAKENKISKFL 120 Query: 333 LAWLMRSYDLCKV-FYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVINVETARID 390 LA +M+ + + + L + +K P + +I N +I+ Sbjct: 121 LALIMQGDIMKRHCLTYVNGTTVLHLNKKALKDFEFPFPENPQVISNIANFAEENYKKIN 180 Query: 391 VLVEKIEQSIVLLKERRSSFIAAAV 415 + LL + +S + Sbjct: 181 S----NLRENDLLIKIKSELLNKYF 201 >gi|240047295|ref|YP_002960683.1| putative Type I restriction-modification enzyme s subun [Mycoplasma conjunctivae HRC/581] gi|239984867|emb|CAT04860.1| PUTATIVE Type I restriction-modification enzyme s subun [Mycoplasma conjunctivae] Length = 220 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 36/99 (36%), Gaps = 10/99 (10%) Query: 315 IITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKE 374 + ++A + + +L+ + + K V+ V VP ++E Sbjct: 90 VNSTALKILTSKKRYDPFFCYLLLNKEPKKQQ------GHMRHYISLVQHNKVCVPMLEE 143 Query: 375 QFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +N+I I+ ++ I+ I L+ ++ + Sbjct: 144 ----SNLIKNLFFYINKIIFSIQAKITKLESIKNILLNK 178 >gi|227530270|ref|ZP_03960319.1| possible restriction endonuclease S subunit [Lactobacillus vaginalis ATCC 49540] gi|227349824|gb|EEJ40115.1| possible restriction endonuclease S subunit [Lactobacillus vaginalis ATCC 49540] Length = 155 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 38/135 (28%), Gaps = 5/135 (3%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 ++ + PE+ + + + N L+ + ++V Sbjct: 25 VEDGKYPFFTTSPETLRINNFAFDQDAILLGGNNANGVFQLKRYTGKFNAYQRTYVISVV 84 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-PIKEQFDITNVIN 383 I + + L ++ + + ++ L + VP E + Sbjct: 85 KENIINNDYLYYALMPKLVELQNKSLGTATKFITKRILENLLIKVPNNYNEMERRATYLR 144 Query: 384 VETARIDVLVEKIEQ 398 ID ++ +Q Sbjct: 145 T----IDNKIQLNKQ 155 >gi|331017717|gb|EGH97773.1| Type I restriction enzyme (modification subunit) [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 571 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 53/173 (30%), Gaps = 8/173 (4%) Query: 233 EVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIV 292 + + K + E + +I+ ++ + ++V Sbjct: 390 HFEIIRPRQHHMGLKGVPVEEVQAQDIPSFGLIRHATMLSVHDLDGPNSFDYFLKAKDVV 449 Query: 293 FRFIDLQNDKRSLRSAQVMERGII----TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 + A + G + A + + + L MRS Sbjct: 450 ICIKGAIGRVGCISKAPLPGPGGWVSGQSVAVLRSRGTDYAAHALMMYMRSPKGQAALRR 509 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDIT-NVINVETARIDVLVEKIEQS 399 + G +++ + +K + + Q D+ V+ ET ID +E+++Q Sbjct: 510 LVVGTSAPTIQAKALKGFQIPILT-AVQSDMALEVLEAETD-IDYQIEQLQQK 560 >gi|198275220|ref|ZP_03207751.1| hypothetical protein BACPLE_01379 [Bacteroides plebeius DSM 17135] gi|198271803|gb|EDY96073.1| hypothetical protein BACPLE_01379 [Bacteroides plebeius DSM 17135] Length = 1180 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/281 (9%), Positives = 72/281 (25%), Gaps = 9/281 (3%) Query: 112 LVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIR 171 ++ + I + + + + + I Sbjct: 805 ILNNSSRDNNQAEYYTPTIRRNYFYNTIISFSNNTNILNLIENHDKYIRLKDNEIIQGIV 864 Query: 172 EKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDH 231 R I E +K V + + + + + + Sbjct: 865 PNPDVVNSRNIKYIPEYEIISNNIKIGDGVFVVNHNYFSSLKECEKQYIKV--LYEPTNC 922 Query: 232 WEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETR----NMGLKPESYETYQIVD 287 + + ++ + + + + N + + Y + + Sbjct: 923 HKYFLDNDITKDIIYITKTNYKGDAPYILQHLWKYRFIMEQRRENKNGRLDYYHLHWPRE 982 Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMER-GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 L K + + + A ++ I+ YL L+ S + Sbjct: 983 ESFFKQSEKILVPRKCAFPIFAYTNKETYVMMAINIIQTKRINLKYLTGLLNSKLIEFWL 1042 Query: 347 YAMG--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVE 385 G G L E ++++P+ VP I+ Q I N+++ Sbjct: 1043 KNKGKMQGANYQLDKEPLQQIPIAVPSIEIQTIIANLVDTI 1083 >gi|86142928|ref|ZP_01061350.1| type IV site-specific deoxyribonuclease Eco57I related protein [Leeuwenhoekiella blandensis MED217] gi|85830373|gb|EAQ48832.1| type IV site-specific deoxyribonuclease Eco57I related protein [Leeuwenhoekiella blandensis MED217] Length = 1026 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 59/191 (30%), Gaps = 14/191 (7%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPG 289 + P L + K E + + + K + + Sbjct: 799 QQYYGNPKNRLWIIYTDSSFKDEEKILPFPNIKGHLDKFLDVFTSVNKPYGLHRSRDEKY 858 Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 + L+ R + +M +K I+ YL ++ S + Sbjct: 859 FKGEKIFSLRKCSVRPRFTYTDFDAYVNRTFMVIKTDRINQKYLTGILNSNLIAFWLKYK 918 Query: 350 G--SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVIN------VETARIDVLVEKIE---- 397 G G + ++ LP++ P + Q I ++++ +++ L++K + Sbjct: 919 GKMQGNNYQIDKTPLENLPLINPNKEVQEKIADLVSSIISNTQKSSEYQELLDKAKTDNN 978 Query: 398 --QSIVLLKER 406 + I L KE Sbjct: 979 FDREIQLTKEL 989 >gi|270156972|ref|ZP_06185629.1| translocase-like protein [Legionella longbeachae D-4968] gi|269988997|gb|EEZ95251.1| translocase-like protein [Legionella longbeachae D-4968] Length = 1622 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 24/82 (29%) Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS----------IVLLKERRS- 408 ++ + + I EQ I+ I ++I + E I + I +ERR Sbjct: 1394 TNLDEFFIAL--INEQSRISKNIEDIRSKIHNIEELIHKQENEIFETGSRIKAAQERRQQ 1451 Query: 409 ---------SFIAAAV--TGQI 419 S ++ V G+I Sbjct: 1452 PDCGYIESASLMSQVVYHQGKI 1473 >gi|313664978|ref|YP_004046849.1| type I restriction modification DNA specificity domain protein [Mycoplasma leachii PG50] gi|312949980|gb|ADR24576.1| type I restriction modification DNA specificity domain protein [Mycoplasma leachii PG50] Length = 171 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 38/146 (26%), Gaps = 9/146 (6%) Query: 250 KLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQ 309 I L + K + G++ Y PG + Sbjct: 28 CSINCGSLDANAMEHNGKYDFFTSGVEIYKINKYAFEGPGISIAGNGANMGYLH-----L 82 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + ++ ++ +L + + L K G L E + + Sbjct: 83 TDGKYNAYQRTYILQNIEVNRMFLYCTLLNNFLSKCEKLTKFGGVPYLVLEQIYNHMIFR 142 Query: 370 PPIKEQFDITNVINVETARIDVLVEK 395 P EQ I + + +D L+ Sbjct: 143 PTYNEQTKI----SSLFSNLDSLITL 164 >gi|229548136|ref|ZP_04436861.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis ATCC 29200] gi|229306737|gb|EEN72733.1| type I site-specific deoxyribonuclease specificity subunit [Enterococcus faecalis ATCC 29200] Length = 205 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 58/183 (31%), Gaps = 15/183 (8%) Query: 24 HWKVVPIKRFTKL-NTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI-- 80 +W++ ++ G+ + +E++ +G+ +YL + + T ++ Sbjct: 34 NWELCKLENVIDKQIKGK----------VKVENLCNGSVEYLDANRLNGGKPIYTKALPD 83 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 ++ I+ G K F G+ + Q K+ + +D I Sbjct: 84 VSERDIIILWDGSKAGKVYY-GFKGVLGSTLKAYQLKECANS-QFIYQQLLDNQNNIYNN 141 Query: 141 CEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQ 200 + H P+ + EQ + + + RI I L K Q Sbjct: 142 YRTPNIPHVVKNFSSIFPIWMTSFEEQSQMADILSNLDNRIILQQNLTDTMISLKKSYLQ 201 Query: 201 ALV 203 + Sbjct: 202 NMF 204 Score = 37.1 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 11/123 (8%), Positives = 41/123 (33%), Gaps = 4/123 (3%) Query: 290 EIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAM 349 ++ R I + D +G++ S A + ++ + + ++ Sbjct: 83 DVSERDIIILWDGSKAGKVYYGFKGVLGSTLKAYQLKECANSQFIYQQLLDNQNNIYNNY 142 Query: 350 GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSS 409 + + P+ + +EQ + +++ + +D + + + + S Sbjct: 143 RTPNIPHVVKNFSSIFPIWMTSFEEQSQMADIL----SNLDNRIILQQNLTDTMISLKKS 198 Query: 410 FIA 412 ++ Sbjct: 199 YLQ 201 >gi|326314827|ref|YP_004232499.1| hypothetical protein Acav_0004 [Acidovorax avenae subsp. avenae ATCC 19860] gi|323371663|gb|ADX43932.1| hypothetical protein Acav_0004 [Acidovorax avenae subsp. avenae ATCC 19860] Length = 195 Score = 38.6 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 16/99 (16%), Positives = 33/99 (33%), Gaps = 2/99 (2%) Query: 284 QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLC 343 + PG++V K L + + + +D+ YL W + Sbjct: 61 HCLQPGDVVIPSRG-DYYKAWLFNGASEPVLPVGQLNVIRPAVDLDAGYLVWHLNLPVTQ 119 Query: 344 -KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 K+ + ++L + L V P + +Q I + Sbjct: 120 AKLSLLLTGTTIKALTKTALLSLEVDTPELPQQQRIAEI 158 >gi|83317742|ref|XP_731294.1| phosphatidylinositol 3-kinase vps34 [Plasmodium yoelii yoelii str. 17XNL] gi|23491282|gb|EAA22859.1| phosphatidylinositol 3-kinase vps34-like [Plasmodium yoelii yoelii] Length = 1686 Score = 38.6 bits (88), Expect = 2.0, Method: Composition-based stats. Identities = 19/297 (6%), Positives = 67/297 (22%), Gaps = 8/297 (2%) Query: 27 VVPIKRFTKLNTGRTSESGKDIIY----IGLEDV-ESGTGKYLPKDGNSRQSDTSTVSIF 81 + K K G + +Y + +V + K L N + Sbjct: 446 WIKNKNLLKWKNGYSYIKKYSYLYDLHNGRISNVGNRNSFKLLKGVINIYKKFKIFKEKK 505 Query: 82 AKGQILYGKLGPYLRKAIIADFD---GICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 KG +++ + K + + + Sbjct: 506 IKGYLIFNCVSFNKPKICYNEKKKKLITFHENIFNDIENKEIGSPNFYHNTGSNYDWXFN 565 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 + S+ + + ++ + + + + Sbjct: 566 SFDYVGDTSNINNLSFNKFVKSVAKGQKKNKHGNLFLHNFIFDNKKDNIINEKNKNYSHN 625 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + ++ G ++ K + ++ + V + + S ++ Sbjct: 626 RFKIIESKKYCGKIKNILYKRNSVDVLRNVNTNHRHTEKKINDIFDHISKYTNRISKNIN 685 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 +S N + + + P + ++ + ++ + Sbjct: 686 ISNINRYDDYPFNFFSKEKCEKKKISVTTPPIDEIKTLNYVLSIPLTKINDDGKKCL 742 >gi|110004973|emb|CAK99304.1| hypothetical transmembrane protein [Spiroplasma citri] Length = 213 Score = 38.6 bits (88), Expect = 2.1, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 46/158 (29%), Gaps = 4/158 (2%) Query: 218 KDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKP 277 K ++ + E R N + + L+ Sbjct: 50 KYITLQEISKKISDGEHSHIKRNNKSGVRYLYGRNIKQGTIKGNINFDSISDYSYISLED 109 Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI---ITSAYMAVKPHGIDSTYLA 334 + + +++ + + + + + GI I + I YL Sbjct: 110 YTNFKRTHLIDNDVLISILGIIGNSAIYKKEYLGIIGIPRHIGRITLLNTFAPISPEYLV 169 Query: 335 WLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPP 371 R+ Y++ +G ++Q L +++K + +P Sbjct: 170 AYFRTKLAKHQLYSLTTGNIQQLLSLKNLKNYEIPIPN 207 >gi|91203366|emb|CAJ71019.1| unknown protein [Candidatus Kuenenia stuttgartiensis] Length = 139 Score = 38.6 bits (88), Expect = 2.2, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 38/104 (36%), Gaps = 9/104 (8%) Query: 23 KHWKVVPIKRFTKLNTG--RTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSI 80 +W + I + +I ED+++G D + + Sbjct: 7 SNWNEIAIGFIADEINEEVLSPAKSGCERFIRPEDLDAGQLFIKNFRS---PEDIGSGKL 63 Query: 81 FAKGQILYGKLG----PYLRKAIIADFDGICSTQFLVLQPKDVL 120 +G I++ + + R++ + FD +CS + V++ + + Sbjct: 64 CYEGDIIFARRNVSIFQFKRRSSVLTFDAVCSDELTVIRENEKI 107 >gi|332358994|gb|EGJ36815.1| hypothetical protein HMPREF9380_1662 [Streptococcus sanguinis SK49] Length = 393 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 44/386 (11%), Positives = 98/386 (25%), Gaps = 34/386 (8%) Query: 26 KVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY-LPKDGNSRQSDTSTVSIFAKG 84 K V + G + V TG + + + Sbjct: 7 KRVTLSELFTNKRGNS--------RYTKAYVNRNTGDFEVYTGSTKTSFGFIDTYEYETP 58 Query: 85 QILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGA 144 + Y G Y + +L K L + Sbjct: 59 HLTYTTDGEYAGTLDVLQGKYNVGGHRAILISKVDNLSLSYCKYV---FQSIFYNSVRRG 115 Query: 145 TMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVS 204 + W I +I + IP + +K + +I R + + +++ Sbjct: 116 DVPSLAWSQIKDIRVSIPVNEDGEFDLKKQEEIVRK-FEIIEARKAELSEKIQTIKSVEV 174 Query: 205 YIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNI 264 I++ + +K + + + + + + V E + S+ SYG + Sbjct: 175 DIISGDNDKTTSIKVAELFDLTISTNSSKFTK--TFVKENSGDIPVYGASSDNLPSYGYV 232 Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 + K E Y + + L + + Sbjct: 233 KDNAVIVDKDGKREFPVRYF---ENCLTYNIDGLAGYIFYHEGRFSLSEKVRPLVIKEEY 289 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS----LKFEDVKRLPVLVP-------PIK 373 ++ YL ++ ++ L +K L V++P ++ Sbjct: 290 ASKVNPLYLKQVLE-PIFRSHVKGRKGENGKNEYTKLNTSMIKNLEVVLPLTSSGEIDLE 348 Query: 374 EQFDITNVINVETARIDVLVEKIEQS 399 +Q I + I + IE+ Sbjct: 349 KQNQIV----KNSQTILEMKNNIEKQ 370 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 47/151 (31%), Gaps = 15/151 (9%) Query: 261 YGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY 320 Y N G S+ + + +L Q A Sbjct: 28 YVNRNTGDFEVYTGSTKTSFGFIDTYEYETPHLTYTTDGEYAGTLDVLQGKYNVGGHRAI 87 Query: 321 MAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP-------PIK 373 + K + +Y ++ +S ++ G SL + +K + V +P +K Sbjct: 88 LISKVDNLSLSYCKYVFQSIFY----NSVRRGDVPSLAWSQIKDIRVSIPVNEDGEFDLK 143 Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLK 404 +Q +I + I+ ++ + I +K Sbjct: 144 KQEEIVR----KFEIIEARKAELSEKIQTIK 170 >gi|212691982|ref|ZP_03300110.1| hypothetical protein BACDOR_01477 [Bacteroides dorei DSM 17855] gi|212665374|gb|EEB25946.1| hypothetical protein BACDOR_01477 [Bacteroides dorei DSM 17855] Length = 163 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 33/119 (27%), Gaps = 6/119 (5%) Query: 297 DLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS 356 L A+ + I A++ T + Y K S Sbjct: 50 ILTVRAPVGIVAENKMKVCIGRGVCALRNKSAMPTMYIYYALDYFSYKWKQIEQGSTFTS 109 Query: 357 LKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 + +DVK + + +++DVL++ +++ ++ Sbjct: 110 INGDDVKNFTIPLVDD------VEYSCALLSKVDVLIKCSIDLHSNYIKQKQYLLSQLF 162 >gi|154487139|ref|ZP_02028546.1| hypothetical protein BIFADO_00979 [Bifidobacterium adolescentis L2-32] gi|154085002|gb|EDN84047.1| hypothetical protein BIFADO_00979 [Bifidobacterium adolescentis L2-32] Length = 72 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 9/44 (20%), Positives = 19/44 (43%), Gaps = 4/44 (9%) Query: 356 SLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQS 399 S+ E +K + + + EQ I + R+D L+ ++ Sbjct: 13 SIDIEGMKTIFIPWTNLAEQRRIGAFFD----RLDSLITLHQRK 52 >gi|227505161|ref|ZP_03935210.1| conserved hypothetical protein [Corynebacterium striatum ATCC 6940] gi|227198243|gb|EEI78291.1| conserved hypothetical protein [Corynebacterium striatum ATCC 6940] Length = 92 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 10/83 (12%), Positives = 26/83 (31%) Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 E A A+ + + ++ G + +L V+ + P Sbjct: 6 EPAATNQACAAICIEDAVDADFLFYVLRNSYEQLRSLGRGGNQDNLNLSLVRDFRIPWPA 65 Query: 372 IKEQFDITNVINVETARIDVLVE 394 ++ + +N T + +L + Sbjct: 66 VEIRQRFVAQMNEATRILTLLEK 88 >gi|210630729|ref|ZP_03296553.1| hypothetical protein COLSTE_00438 [Collinsella stercoris DSM 13279] gi|210160325|gb|EEA91296.1| hypothetical protein COLSTE_00438 [Collinsella stercoris DSM 13279] Length = 66 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 22/55 (40%), Gaps = 4/55 (7%) Query: 361 DVKRLPVLVPPIKEQFDITNVINVET----ARIDVLVEKIEQSIVLLKERRSSFI 411 D+ R+ + P I Q + +V++ + D L +IE + R + Sbjct: 2 DLARVEIPAPSIATQRKVVDVLDRFDTPTASLTDCLPAEIEARNQQYEYYRDRLL 56 >gi|257125564|ref|YP_003163678.1| restriction modification system DNA specificity domain protein [Leptotrichia buccalis C-1013-b] gi|257049503|gb|ACV38687.1| restriction modification system DNA specificity domain protein [Leptotrichia buccalis C-1013-b] Length = 195 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 53/171 (30%), Gaps = 12/171 (7%) Query: 29 PIKRFTKLN---TGRTSESGKDIIYIGLEDVESGTGKYL-----PKDGNSRQSDTSTVSI 80 + + +T++ + + V +G + P+ + ++ + Sbjct: 2 KLGDNVDIIAPLNVKTADIKTGYLLLNPTMVNNGKIENFDNAEVPERYKNGKNKIADKYF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVTQRI- 137 K +L+ G + + +T + +L+ + + WLL ++ Sbjct: 62 VKKNDVLFQAKGSKIEVVYVDQDYENVLPATLYFILRANEKINPKYLQWLLKTELLLLYF 121 Query: 138 -EAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 + + + + I + + +P Q + E I + I Sbjct: 122 EKKYKTMSAVRAVNKSDIVELDIDLPEREVQDKMVEIITSFENEEKNTIDY 172 >gi|118343675|ref|NP_001071658.1| transcription factor protein [Ciona intestinalis] gi|70568924|dbj|BAE06318.1| transcription factor protein [Ciona intestinalis] Length = 273 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 7/46 (15%) Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTG 417 + EQ + E+ ++ L K+++ I L+E R + + G Sbjct: 168 LSEQ------LQEESEHLENLNAKLKREIEKLQEERQKLMH-LLNG 206 >gi|166030474|ref|ZP_02233303.1| hypothetical protein DORFOR_00135 [Dorea formicigenerans ATCC 27755] gi|166029726|gb|EDR48483.1| hypothetical protein DORFOR_00135 [Dorea formicigenerans ATCC 27755] Length = 792 Score = 38.2 bits (87), Expect = 2.5, Method: Composition-based stats. Identities = 10/42 (23%), Positives = 18/42 (42%), Gaps = 4/42 (9%) Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 KEQ +I ++ L ++ Q L+E+R + A Sbjct: 534 KEQEEIAAY----RRELEALKQETAQKKEKLEEQRDRILREA 571 >gi|229015567|ref|ZP_04172562.1| Type I restriction enzyme, specificity subunit [Bacillus cereus AH1273] gi|229027299|ref|ZP_04183562.1| Type I restriction enzyme, specificity subunit [Bacillus cereus AH1272] gi|228733990|gb|EEL84721.1| Type I restriction enzyme, specificity subunit [Bacillus cereus AH1272] gi|228745714|gb|EEL95721.1| Type I restriction enzyme, specificity subunit [Bacillus cereus AH1273] Length = 204 Score = 38.2 bits (87), Expect = 2.6, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 52/151 (34%), Gaps = 24/151 (15%) Query: 265 IQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVK 324 I + + R++ + E + + D +V + K Q + + Sbjct: 48 ISQEKDRSIYVNKEKIKQEVLTDTESLVLHTL---TQKVVWFPPQYQGLLLTNNFMKISF 104 Query: 325 PHGIDSTYLAWLMR-SYDLCKVFYAMG-SGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 +D ++ WL + K + SLK +VK + ++P +++Q Sbjct: 105 FEKVDVHFMEWLFNEHPSIQKQIALFTEGSIISSLKLSNVKEIEFVLPNVEKQ------- 157 Query: 383 NVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 KI I LK+R+++ + Sbjct: 158 ------------KILGKIAQLKKRKTALLKE 176 >gi|317481755|ref|ZP_07940785.1| type I restriction system specificity protein [Bifidobacterium sp. 12_1_47BFAA] gi|316916803|gb|EFV38195.1| type I restriction system specificity protein [Bifidobacterium sp. 12_1_47BFAA] Length = 68 Score = 38.2 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 23/52 (44%), Gaps = 5/52 (9%) Query: 356 SLKFEDVKRLPVLVP-PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 ++ +D V +P EQ I +R+D L+ ++ + +++R Sbjct: 16 NIAPDDFFDTMVSLPESQAEQQTIGAF----FSRLDSLITLHQRKRLSIRQR 63 >gi|167750494|ref|ZP_02422621.1| hypothetical protein EUBSIR_01470 [Eubacterium siraeum DSM 15702] gi|167656420|gb|EDS00550.1| hypothetical protein EUBSIR_01470 [Eubacterium siraeum DSM 15702] Length = 667 Score = 38.2 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 29/315 (9%), Positives = 66/315 (20%), Gaps = 9/315 (2%) Query: 29 PIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQILY 88 +K +G T + ++ +G+ +S T + S + G I Sbjct: 200 TLKELYTYKSGSTPSTD-NVKLLGISSKKSNTVSDTNAFVVTDVYKPSNYKLTKAG-IRI 257 Query: 89 GK-LGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL-----LQGWLLSIDVTQRIEAICE 142 + G Y T + + + G IE Sbjct: 258 RREDGTYDNGWSEFHNTASTWTGYDYMYISYDMNSEVKITLRHGTKYYYKFYAVIEGKEY 317 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 + G + + + T Sbjct: 318 WSPEQSFTTTGSHSYGSWYTKTSATCTSGGTEERKCSCGATESRSTSALGHNYGSTYFEA 377 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 L K+ + + ++ + + Sbjct: 378 DHPHKYAHLCQRCGYKEFTGGNLAIYEKCDICYNENLPSKPCLNISSNGFKESDNVSFTW 437 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + K N+ ++ S + Y+ V V K R+ + Y + Sbjct: 438 DPTDKTTHYNLTVEVLSGDEYKTVCRQTYVNSGFQATFGKGQYRAVLDSYNSNMFHQYTS 497 Query: 323 VKPHG-IDSTYLAWL 336 +S+ + Sbjct: 498 DWRDWVHNSSDYVYF 512 >gi|218283653|ref|ZP_03489615.1| hypothetical protein EUBIFOR_02209 [Eubacterium biforme DSM 3989] gi|218215713|gb|EEC89251.1| hypothetical protein EUBIFOR_02209 [Eubacterium biforme DSM 3989] Length = 1127 Score = 38.2 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 5/44 (11%), Positives = 18/44 (40%), Gaps = 4/44 (9%) Query: 371 PIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 ++ Q I + + + A + E+ ++ + E++ + Sbjct: 121 SLEAQKAIVSELEKQVANL----EENQKKLGQANEQKKALQTQL 160 >gi|283954613|ref|ZP_06372131.1| hypothetical protein C414_000240018 [Campylobacter jejuni subsp. jejuni 414] gi|283793805|gb|EFC32556.1| hypothetical protein C414_000240018 [Campylobacter jejuni subsp. jejuni 414] Length = 226 Score = 38.2 bits (87), Expect = 2.8, Method: Composition-based stats. Identities = 13/129 (10%), Positives = 44/129 (34%), Gaps = 10/129 (7%) Query: 287 DPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVF 346 + V ID + + ++ Y+++++ + F Sbjct: 106 YDSDSVLWGIDGDWIVGFMPKNRKFYPTDHCGVLRVNDAKL-NAKYISFILNEAGKKQRF 164 Query: 347 YAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKER 406 + + ++ L V +P + Q I ++I+ +I+ + + + + L++ Sbjct: 165 SR-----KLRASIDRIRALRVKLPSLDFQDQIVDIID----KIERKINEDKIELSRLEKE 215 Query: 407 RSSFIAAAV 415 + + + Sbjct: 216 KEKILHKYL 224 >gi|29350033|ref|NP_813536.1| DNA modification methylase BstVI [Bacteroides thetaiotaomicron VPI-5482] gi|29341945|gb|AAO79730.1| DNA modification methylase BstVI [Bacteroides thetaiotaomicron VPI-5482] Length = 418 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 30/94 (31%), Gaps = 4/94 (4%) Query: 316 ITSAYMAVKPHGIDSTYLAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIK 373 + + L +L+ + + G + E ++ LP+ VP + Sbjct: 321 FSKHNREAMESLSEKVDLYYLLGILNSSMADQLLTDQRGGDYHIYPEHIRNLPIPVPQRE 380 Query: 374 EQFDITNVINV--ETARIDVLVEKIEQSIVLLKE 405 Q I + + ++E+ + L E Sbjct: 381 IQNAIGEIAKQILLIRETNTDYSELEEQLNNLVE 414 >gi|291461253|ref|ZP_06027914.2| type I restriction modification enzyme protein S [Fusobacterium periodonticum ATCC 33693] gi|291377990|gb|EFE85508.1| type I restriction modification enzyme protein S [Fusobacterium periodonticum ATCC 33693] Length = 77 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 29/71 (40%), Gaps = 5/71 (7%) Query: 337 MRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR---IDV 391 M S + K+ Y + ++ ++++ +++PPI+ Q I I Sbjct: 1 MNSEFMKKLLYNKAKNIVGMANINAKELEDFSIILPPIELQNKFAERIEKIEKLKFIISA 60 Query: 392 LVEKIEQSIVL 402 ++ K +SI Sbjct: 61 IILKPYKSIKK 71 >gi|261867040|ref|YP_003254962.1| restriction endonuclease S [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412372|gb|ACX81743.1| restriction endonuclease S [Aggregatibacter actinomycetemcomitans D11S-1] Length = 64 Score = 37.9 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 11/71 (15%), Positives = 28/71 (39%), Gaps = 12/71 (16%) Query: 358 KFEDVKRLPVLVPPIKEQFDITNVINVETARI----DVLVEKIEQSIVLLKERRSSFIAA 413 +D++ L ++VPP + + + ++ + E+ L + R + Sbjct: 2 YPKDIEGLKIIVPP--------DFLLKRFSEFVENWNLKIVNSEKQNHQLTQLRDFLLPM 53 Query: 414 AVTGQIDLRGE 424 + GQ+ + E Sbjct: 54 LMNGQVAVAEE 64 >gi|55741948|ref|NP_001006729.1| BCL2-associated athanogene 2 [Xenopus (Silurana) tropicalis] gi|49523043|gb|AAH75476.1| BCL2-associated athanogene 2 [Xenopus (Silurana) tropicalis] Length = 213 Score = 37.9 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 13/45 (28%), Positives = 20/45 (44%), Gaps = 8/45 (17%) Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRS----SFIA 412 I++Q I + ID E+SI LL+++R S I Sbjct: 169 IEDQKRIKRRLETLIRNIDN----SEKSITLLEQQRQKSAFSLIH 209 >gi|260889858|ref|ZP_05901121.1| putative type I restriction modification DNA specificity domain protein [Leptotrichia hofstadii F0254] gi|260860464|gb|EEX74964.1| putative type I restriction modification DNA specificity domain protein [Leptotrichia hofstadii F0254] Length = 195 Score = 37.9 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 47/131 (35%), Gaps = 3/131 (2%) Query: 278 ESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLM 337 + +++F+ + D + + T ++ I+ YL WL+ Sbjct: 54 NKIADKYFIKKDDVLFQAKGSKIDVVYV-DKDYEKVLPSTLYFILRPNEKINPKYLQWLL 112 Query: 338 RSYDLCKVFYAM--GSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEK 395 ++ + F G +++ D+ L V +P K Q ++ +I ++ Sbjct: 113 KTELVLLYFEKKYKTMGTVRAVNKGDIVDLNVKIPERKIQDEMAKIITSFEEEEYSTMKY 172 Query: 396 IEQSIVLLKER 406 + ++ER Sbjct: 173 LNIKRKYIEER 183 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 62/193 (32%), Gaps = 16/193 (8%) Query: 29 PIKRFTKLN---TGRTSESGKDIIYIGLEDVESGTGKYLPK-----DGNSRQSDTSTVSI 80 + + +T++S + V +G + + ++ + Sbjct: 2 KLGDNVDIIAPLNVKTADSETGYFLLNPTMVNNGKIETFDYAEVPDRYKNGKNKIADKYF 61 Query: 81 FAKGQILYGKLGPYLRKAIIADFDGIC--STQFLVLQPKDVLPELLQGWLLSIDVT--QR 136 K +L+ G + + ST + +L+P + + WLL ++ Sbjct: 62 IKKDDVLFQAKGSKIDVVYVDKDYEKVLPSTLYFILRPNEKINPKYLQWLLKTELVLLYF 121 Query: 137 IEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKI----IAETVRIDTLITERIRFI 192 + T+ + I ++ + IP Q + + I E + L +R Sbjct: 122 EKKYKTMGTVRAVNKGDIVDLNVKIPERKIQDEMAKIITSFEEEEYSTMKYLNIKRKYIE 181 Query: 193 ELLKEKKQALVSY 205 E + E Q ++ Sbjct: 182 ERVIENNQVIIDE 194 >gi|296110697|ref|YP_003621078.1| hypothetical protein LKI_02830 [Leuconostoc kimchii IMSNU 11154] gi|295832228|gb|ADG40109.1| hypothetical protein LKI_02830 [Leuconostoc kimchii IMSNU 11154] Length = 63 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 9/40 (22%), Positives = 15/40 (37%), Gaps = 4/40 (10%) Query: 361 DVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI 400 + + V VP EQ I +D L+ E+ + Sbjct: 5 QISSIKVKVPDKDEQTKIGAF----FKILDQLITVNEREL 40 >gi|52082598|ref|YP_081389.1| hypothetical protein BL02383 [Bacillus licheniformis ATCC 14580] gi|52787995|ref|YP_093824.1| hypothetical protein BLi04319 [Bacillus licheniformis ATCC 14580] gi|52005809|gb|AAU25751.1| hypothetical protein BL02383 [Bacillus licheniformis ATCC 14580] gi|52350497|gb|AAU43131.1| hypothetical protein BLi04319 [Bacillus licheniformis ATCC 14580] Length = 198 Score = 37.9 bits (86), Expect = 3.2, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 58/160 (36%), Gaps = 16/160 (10%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAY--MAVK 324 E + + + G+++ R L S+ + ++ S + + V Sbjct: 45 NDEPFEVFHSNDLLNNQHFTEAGDVLIR---LNYPHTSVYIDETKSGLLVPSYFAIIKVD 101 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVLVPPIKEQFDITN--V 381 S Y+AW + + + K +G R S + +P+ PI +Q + Sbjct: 102 QSKFISEYVAWYLNTDSVKKELERSQAGTRIPSTNKSALNSIPIEDIPIFKQQAVIKLWR 161 Query: 382 INVETARI-DVLVEKIEQSIVLLKERRSSFIAAAVTGQID 420 ++ + + + L+E+ E+ ++ V G+I Sbjct: 162 LHQQEKTLYNRLIEEKEK-------WFNAITKQIVQGEIR 194 >gi|154503470|ref|ZP_02040530.1| hypothetical protein RUMGNA_01294 [Ruminococcus gnavus ATCC 29149] gi|153795570|gb|EDN77990.1| hypothetical protein RUMGNA_01294 [Ruminococcus gnavus ATCC 29149] Length = 791 Score = 37.9 bits (86), Expect = 3.3, Method: Composition-based stats. Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 4/42 (9%) Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAA 414 KEQ +I I+ L + +Q ++E+R +A A Sbjct: 533 KEQEEIAAY----KKEIEALKSQAQQKQERIEEQRERILAEA 570 >gi|163802344|ref|ZP_02196238.1| cryptic beta-D-galactosidase, alpha subunit [Vibrio sp. AND4] gi|159173873|gb|EDP58687.1| cryptic beta-D-galactosidase, alpha subunit [Vibrio sp. AND4] Length = 1032 Score = 37.9 bits (86), Expect = 3.4, Method: Composition-based stats. Identities = 32/306 (10%), Positives = 73/306 (23%), Gaps = 15/306 (4%) Query: 84 GQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G + + A + D I + D + + Sbjct: 392 GLFVMAETDVETHGFANVGDLSRITNDAAWESVFVDRAERHVHAQKNHPSIIMWSLGNES 451 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + + +++ + + F E EK + + Sbjct: 452 GYGCNIRAMYDATKAIDNTRLVHYEEDRDAEVVDVISTMYSRAQLMNHFGEHPHEKPRII 511 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 Y G P + + + V + ++ E YG Sbjct: 512 CEYAHAMGNGPGGLTEYQNVFYAHDHIQGHYVWEWCDHGVLA--RDENGQEFYKYGGDYG 569 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + GL + + V + +Q + + V + T+ + Sbjct: 570 DYPNNYNFCMDGLIYPDQTPGPGLKEYKQVIAPVKIQAVEGKTNTFTVENKLWFTN--LN 627 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 D +RS A S + + + +P + E+ N Sbjct: 628 DFTITADVRAEGETLRSVQFKVEELAANSA----------REIIINLPELDEREAFINFT 677 Query: 383 NVETAR 388 + +R Sbjct: 678 VRKDSR 683 >gi|269960474|ref|ZP_06174846.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834551|gb|EEZ88638.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 1018 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 32/306 (10%), Positives = 73/306 (23%), Gaps = 15/306 (4%) Query: 84 GQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G + + A + D I + D + + Sbjct: 378 GLFVMAETDVETHGFANVGDLSRITNDAAWESVFVDRAERHVHAQKNHPSIIMWSLGNES 437 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + + +++ + + F E EK + + Sbjct: 438 GYGCNIRAMYDATKAIDDTRLVHYEEDRDAEVVDVISTMYSRAQLMNHFGEHPHEKPRII 497 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 Y G P + + + V + ++ E YG Sbjct: 498 CEYAHAMGNGPGGLTEYQNVFYAHDHIQGHYVWEWCDHGVLA--RDENGQEFYKYGGDYG 555 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + GL + + V + +Q + + V + T+ + Sbjct: 556 DYPNNYNFCMDGLIYPDQTPGPGLKEYKQVIAPVKIQAVEGKTNTFTVENKLWFTN--LD 613 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 D +RS A S + + + +P + E+ N Sbjct: 614 DYTITADIRAEGETLRSVQFKVEELAANSA----------REITINLPELDEREAFINFT 663 Query: 383 NVETAR 388 + +R Sbjct: 664 VRKDSR 669 >gi|167769853|ref|ZP_02441906.1| hypothetical protein ANACOL_01187 [Anaerotruncus colihominis DSM 17241] gi|167668214|gb|EDS12344.1| hypothetical protein ANACOL_01187 [Anaerotruncus colihominis DSM 17241] Length = 238 Score = 37.9 bits (86), Expect = 3.5, Method: Composition-based stats. Identities = 13/137 (9%), Positives = 39/137 (28%), Gaps = 13/137 (9%) Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 +S + + +++ + + + + + + P + Sbjct: 93 YISEDDYDSIIEARKLQKNDVLLTMDGGTSIGKPVLFNLDGSYTVDSHIPILRNPKISEK 152 Query: 331 TYLAWLMRSYDLCKVFYAMGSGL--RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 ++ +L+ S F SG + S+ ED++R + I+ Sbjct: 153 AWV-YLLASPIGQLQFNRAESGASGQTSVTEEDLRRFRFP-------TKLLAQIDALAKE 204 Query: 389 ID---VLVEKIEQSIVL 402 +D + + Sbjct: 205 LDLERKKINLERCELDK 221 >gi|261380922|ref|ZP_05985495.1| type I restriction/modification specificity protein [Neisseria subflava NJ9703] gi|284796175|gb|EFC51522.1| type I restriction/modification specificity protein [Neisseria subflava NJ9703] Length = 130 Score = 37.9 bits (86), Expect = 3.6, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 23/95 (24%), Gaps = 3/95 (3%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLP---KDGNSRQSDTSTVS 79 + WK + ++ R + + + GT P + Sbjct: 16 EEWKNKTLGDLGRVEMCRRIFKEQTQPSGEIPFFKIGTFGQEPDAFISSELFEEYRQKYP 75 Query: 80 IFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVL 114 +G IL G R + +V Sbjct: 76 YPKQGDILISAAGTIGRTVKFTGENAYFQDSNIVW 110 >gi|300956330|ref|ZP_07168628.1| hypothetical protein HMPREF9547_02158 [Escherichia coli MS 175-1] gi|300316842|gb|EFJ66626.1| hypothetical protein HMPREF9547_02158 [Escherichia coli MS 175-1] Length = 112 Score = 37.9 bits (86), Expect = 3.7, Method: Composition-based stats. Identities = 9/88 (10%), Positives = 26/88 (29%), Gaps = 3/88 (3%) Query: 18 IGAIPKHWKVVPIKRFTKLNTGRTSESGK-DIIYIGLEDVESGTGKYLPKDGNSRQSDTS 76 +G +P W+ + +++ + + + + ++ S G + + Sbjct: 18 LGMLPTGWQKLSLEKCLNIEARKAYIQDNQEYDLVTVK--RSRGGVIRREHLKGKDISVK 75 Query: 77 TVSIFAKGQILYGKLGPYLRKAIIADFD 104 + +G L K A Sbjct: 76 SQFYIKEGDFLISKRQIVHGANQWAGPC 103 >gi|308272857|emb|CBX29461.1| hypothetical protein N47_J04420 [uncultured Desulfobacterium sp.] Length = 283 Score = 37.5 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 13/50 (26%), Positives = 24/50 (48%), Gaps = 8/50 (16%) Query: 371 PIKEQFDITNVI---NVETARIDVLVEKIEQSI-----VLLKERRSSFIA 412 PI EQ +I + + ID+++E I++ I LK+ ++ I Sbjct: 219 PISEQKEIIQKFRPDSPKDKLIDIIIETIKKVIPDMTDERLKKIKTRLIN 268 >gi|57505847|ref|ZP_00371772.1| type IIS restriction enzyme [Campylobacter upsaliensis RM3195] gi|57015877|gb|EAL52666.1| type IIS restriction enzyme [Campylobacter upsaliensis RM3195] Length = 1096 Score = 37.5 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 23/224 (10%), Positives = 60/224 (26%), Gaps = 17/224 (7%) Query: 186 TERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELN 245 ++ L E L + +G + E + Sbjct: 659 FKKCVSEYLAWEVSNVLKNQNSMQGGLEGNALSPRLRELEKDFKQSGGEWRDIEIHKLFT 718 Query: 246 RKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSL 305 +N + G+ + N G+ ++ +I + + ++ Sbjct: 719 PQNGDFDIQKLHLNDKGHQVVSAGLENNGVIGKTDIKARIFPKNTL------TCDMFGNV 772 Query: 306 RSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRS-YDLCKVFYAMGSGLRQSLKFEDVKR 364 + + + M + P + + S + K+ + + + Sbjct: 773 FYRDFEYKMVTHARVMCLHPLFELNKKTGLYIASTMNYFKLLFCFADMA----TWSKISN 828 Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI--VLLKER 406 L + +P + EQ + + I L + Q + L+E Sbjct: 829 LKLSLPVLNEQIAF----DYMESYIKALEAERLQELEAERLQEL 868 >gi|308274055|emb|CBX30654.1| hypothetical protein N47_E41660 [uncultured Desulfobacterium sp.] Length = 80 Score = 37.5 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 8/53 (15%), Positives = 25/53 (47%), Gaps = 5/53 (9%) Query: 371 PIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLR 422 P++ E N ++V + + + I L++ R + + ++G++ ++ Sbjct: 30 PLEMEIRQFNNSVSVYFEK----MFLNKSQIRTLEKIRDTLLPKLMSGEVRVK 78 >gi|302062589|ref|ZP_07254130.1| type I restriction-modification system, S subunit [Pseudomonas syringae pv. tomato K40] Length = 108 Score = 37.5 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 12/42 (28%), Gaps = 4/42 (9%) Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 EQ + ++ D + + + LK + Sbjct: 3 EQQKVAEFLSSV----DDFIAAQARKVTALKIYKKGLTQRLF 40 >gi|241957263|ref|XP_002421351.1| palmitoyltransferase, putative; protein fatty acyltransferase, putative [Candida dubliniensis CD36] gi|223644695|emb|CAX40685.1| palmitoyltransferase, putative [Candida dubliniensis CD36] Length = 443 Score = 37.5 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 31/335 (9%), Positives = 88/335 (26%), Gaps = 19/335 (5%) Query: 82 AKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGW--LLSIDVTQRIEA 139 + + L + + + C+ + + + Sbjct: 93 REDETLITEEPISGDRCEWIRYCKKCNNYKPPRSHHCKICKQCVLQMDHHCPWTMNCVGN 152 Query: 140 ICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKK 199 M W G + I + + E + I I + Sbjct: 153 NNLPHFMRFLGWVIWGTGYLMIQLIKLIINYYENSNMPHYLFNKTELVAIIVITPINLFV 212 Query: 200 QALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSL 259 A + + + L K W + + N + + Sbjct: 213 FATILVLFIRCLINICKGMTQIEIWEWERLELQWSSKRLWRLIRFNYRKLHNDKPFPKLS 272 Query: 260 SYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSA 319 ++ N I + + + + + N++ + +I Sbjct: 273 TWTNTINNGDYGDDVDVDDVDDVDVEL--------TNLSSNNEEPIVPQNFTIDDLIFPY 324 Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKR--LPVLVPPIKE-QF 376 + + + I++ ++ +G + + + ++ L + PP Q Sbjct: 325 DLGIWKNLINAMNYPYMWLIPFAK----PKSNGYQPEISQDYLQDDQLNLPWPPDGIRQQ 380 Query: 377 DI-TNVINVETARIDVLVEKIEQSIVLLKERRSSF 410 +I NV++ + ++ + ++ +SI +E R Sbjct: 381 EIEINVLHQQHSQGNE-EDEELRSIRNYQELRRRL 414 >gi|237797998|ref|ZP_04586459.1| HNH endonuclease:S-type Pyocin [Pseudomonas syringae pv. oryzae str. 1_6] gi|331020849|gb|EGI00906.1| HNH endonuclease:S-type Pyocin [Pseudomonas syringae pv. oryzae str. 1_6] Length = 508 Score = 37.5 bits (85), Expect = 4.2, Method: Composition-based stats. Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 11/59 (18%) Query: 371 PIKEQFDITNVIN-----VETARIDVLVEKIEQSIVLLKERRSSFI-----AAA-VTGQ 418 P Q +I + ++ +I L+ + + I L E ++S + A +TGQ Sbjct: 13 PNSIQEEIASKLDHNVDYSNEQQIRDLILQEKARINYLIESKNSLLEERCAQALGLTGQ 71 >gi|167620607|ref|ZP_02389238.1| hypothetical protein BthaB_30154 [Burkholderia thailandensis Bt4] Length = 199 Score = 37.5 bits (85), Expect = 4.4, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 1/104 (0%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + +V G+++FR + N + + ++ YL W + Sbjct: 62 ELKDRHLVQEGDLLFRSRGVTNSAALVGGGLGRAVLAAPMLLIRSNTEIVEPAYLQWFIN 121 Query: 339 SYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNV 381 +G + L + L V +PP++ Q I V Sbjct: 122 HPATQAALAGQAAGTAVKMLGKGVLDGLEVTLPPLERQHLIVEV 165 Score = 36.3 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 50/151 (33%), Gaps = 10/151 (6%) Query: 33 FTKLNTGRTSES------GKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 ++ G + S D++ I ++DV+ + R + + +G + Sbjct: 15 IAEVRMGYSFRSRLETDADGDVVVIQMKDVDDANLLHPEGLARIRMPELKDRHLVQEGDL 74 Query: 87 LYGKLGPYLRKAIIADFDG----ICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 L+ G A++ G + + V P LQ ++ + Sbjct: 75 LFRSRGVTNSAALVGGGLGRAVLAAPMLLIRSNTEIVEPAYLQWFINHPATQAALAGQAA 134 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 G + + + + +PPL Q LI E Sbjct: 135 GTAVKMLGKGVLDGLEVTLPPLERQHLIVEV 165 >gi|257457736|ref|ZP_05622898.1| type I restriction enzyme EcoEI specificity protein [Treponema vincentii ATCC 35580] gi|257444870|gb|EEV19951.1| type I restriction enzyme EcoEI specificity protein [Treponema vincentii ATCC 35580] Length = 182 Score = 37.5 bits (85), Expect = 4.6, Method: Composition-based stats. Identities = 13/85 (15%), Positives = 26/85 (30%), Gaps = 8/85 (9%) Query: 20 AIPKHWKVVPIKRFT-KLNTGRTS------ESGKDIIYIGLEDVESGTGKYLPKDGNSRQ 72 +P WK V + + + +G++ G ++ + G K Sbjct: 86 ELPIGWKWVRLGEISHNIESGKSILCKEAVPCGDEVGIVKTGVCSFGYFKEDESKTCLSD 145 Query: 73 SDTSTVSIFAKGQILYGKLGPYLRK 97 D + G + + YLR Sbjct: 146 KDWHDEYVIHVGDF-FNRTRKYLRI 169 >gi|293401667|ref|ZP_06645809.1| type I restriction-modification system, S subunit [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304925|gb|EFE46172.1| type I restriction-modification system, S subunit [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 58 Score = 37.5 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 27/57 (47%), Gaps = 4/57 (7%) Query: 365 LPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 +P+L+P ++ + A +D ++ I L++ R + ++G++D+ Sbjct: 1 MPILIPSDEK----LDEFEGIVAPMDAVIRNNYDEICRLEQIRDLLLPKLMSGELDV 53 >gi|328474285|gb|EGF45090.1| cryptic beta-D-galactosidase subunit alpha [Vibrio parahaemolyticus 10329] Length = 1032 Score = 37.5 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 34/336 (10%), Positives = 83/336 (24%), Gaps = 20/336 (5%) Query: 84 GQILYGKLG-PYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICE 142 G + + A + D I + D + + Sbjct: 392 GLFVMAETDVETHGFANVGDLSRITNDPTWEAVFVDRAVRHVHAQKNHPSIIMWSLGNES 451 Query: 143 GATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQAL 202 G + + + +++ + + F E EK + + Sbjct: 452 GYGCNIRAMYTATKAIDDTRLVHYEEDRDAEVVDVISTMYSRAQLMNYFGEHPHEKPRII 511 Query: 203 VSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYG 262 Y G P + + + V + ++ E YG Sbjct: 512 CEYAHAMGNGPGGLTEYQNVFYAHDHIQGHYVWEWCDHGILA--RDEHGQEFYKYGGDYG 569 Query: 263 NIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA 322 + GL + + V + ++ + V + T+ + Sbjct: 570 DYPNNYNFCMDGLIYPDQTPGPGLKEYKQVIAPVKIRAVEGCHGHFIVENKLWFTN--LD 627 Query: 323 VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVI 382 D +RS S + + + +P + E+ N Sbjct: 628 DYTITADVRAEGETLRSVQFKVEALVANSA----------REVSIDLPELDEREAFVNFT 677 Query: 383 NVETARIDVLVEKIEQSIVLLK-ERR--SSFIAAAV 415 + +R L + I + + + + ++ + A V Sbjct: 678 VRKDSR--TLYSEANHEIAVYQFQLKENTATLPALV 711 >gi|195573333|ref|XP_002104648.1| GD18327 [Drosophila simulans] gi|194200575|gb|EDX14151.1| GD18327 [Drosophila simulans] Length = 422 Score = 37.5 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 8/35 (22%), Positives = 14/35 (40%) Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 EQ + + +D L +Q + LKE + Sbjct: 145 EQQRVAPNVEALDKELDELKRSEQQLLSELKELKK 179 >gi|13508028|ref|NP_109977.1| this protein specifications means only that it IS a type I restriction with enzyme ecokI specificity protein [Mycoplasma pneumoniae M129] gi|12229981|sp|P75488|T1SZ_MYCPN RecName: Full=Putative type-1 restriction enzyme specificity protein MPN_289; AltName: Full=S.MpnORFEBP; AltName: Full=Type I restriction enzyme specificity protein MPN_289; Short=S protein gi|1674243|gb|AAB96194.1| HsdS1B [Mycoplasma pneumoniae M129] Length = 187 Score = 37.5 bits (85), Expect = 4.7, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 38/122 (31%), Gaps = 5/122 (4%) Query: 288 PGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY 347 GE V D S+ + V I +LA+ +R V Y Sbjct: 54 KGEYVTWTTDGA-QAGSVFYRNGQFNATNVCGILKVNNDEIYPKFLAYALRLKAPKFVNY 112 Query: 348 AMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSI-VLLKER 406 A L + + + K Q I +++ T L ++ + L+ER Sbjct: 113 ACP---IPKLMQGTLAEIELDFTSKKIQEKIATILDTFTELSAELSAELSAELSAELRER 169 Query: 407 RS 408 + Sbjct: 170 KK 171 >gi|183596982|ref|ZP_02958475.1| hypothetical protein PROSTU_00211 [Providencia stuartii ATCC 25827] gi|188023635|gb|EDU61675.1| hypothetical protein PROSTU_00211 [Providencia stuartii ATCC 25827] Length = 500 Score = 37.5 bits (85), Expect = 4.8, Method: Composition-based stats. Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ I+ VE IEQ I L+++R S I Sbjct: 462 EILSEHYDNIEQKVEDIEQQIAELEKKRQSLINQ 495 >gi|21355155|ref|NP_651209.1| Autophagy-specific gene 6 [Drosophila melanogaster] gi|13123993|sp|Q9VCE1|BECN1_DROME RecName: Full=Beclin-1-like protein; AltName: Full=Autophagy protein 6-like; Short=APG6-like gi|7301093|gb|AAF56227.1| Autophagy-specific gene 6 [Drosophila melanogaster] gi|16769506|gb|AAL28972.1| LD35669p [Drosophila melanogaster] Length = 422 Score = 37.5 bits (85), Expect = 4.8, Method: Composition-based stats. Identities = 8/35 (22%), Positives = 14/35 (40%) Query: 374 EQFDITNVINVETARIDVLVEKIEQSIVLLKERRS 408 EQ + + +D L +Q + LKE + Sbjct: 145 EQQRVAPNVEALDKELDELKRSEQQLLSELKELKK 179 >gi|13358010|ref|NP_078284.1| type I restriction enzyme S protein (fragment) [Ureaplasma parvum serovar 3 str. ATCC 700970] gi|168281624|ref|ZP_02689291.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 14 str. ATCC 33697] gi|170762359|ref|YP_001752532.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 3 str. ATCC 27815] gi|11357073|pir||E82889 type I restriction enzyme S protein, truncated homolog UU447 [imported] - Ureaplasma urealyticum gi|6899439|gb|AAF30859.1|AE002141_5 type I restriction enzyme S protein (fragment) [Ureaplasma parvum serovar 3 str. ATCC 700970] gi|168827936|gb|ACA33198.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 3 str. ATCC 27815] gi|182676136|gb|EDT88041.1| type I restriction enzyme S protein [Ureaplasma parvum serovar 14 str. ATCC 33697] Length = 124 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 6/55 (10%) Query: 369 VPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKE-----RRSSFIAAAVTGQ 418 +PP+ EQ I + IN+ I ++IEQ + L+ + S + A+ G+ Sbjct: 1 MPPLDEQQRIVDKINLLEFFIKQY-DEIEQKLSKLENEFPEKLKKSVLQYAMQGK 54 >gi|134045654|ref|YP_001097140.1| hypothetical protein MmarC5_0613 [Methanococcus maripaludis C5] gi|132663279|gb|ABO34925.1| hypothetical protein MmarC5_0613 [Methanococcus maripaludis C5] Length = 191 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 50/145 (34%), Gaps = 14/145 (9%) Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMA---- 322 E + E + +I+ R L+ + + E +I S + Sbjct: 44 NKEFLDEFYTLEEISNEYLTSENDIIVR---LREPVFACSIEKENEGLLIPSYFAKLSIN 100 Query: 323 -VKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQS-LKFEDVKRLPVLVPPIKEQFDITN 380 S Y+A + S + K F G S +K + ++ L + I++Q I Sbjct: 101 EEYSKEFLSKYVAHYINSKNAQKEFKKDTEGSVISMIKLKAIENLEIPEVLIEKQEKIIK 160 Query: 381 VINVETARIDVLVEKIEQSIVLLKE 405 + A K+ + + +LKE Sbjct: 161 I-----AEFKQKELKLLKELTILKE 180 >gi|325478472|gb|EGC81586.1| hypothetical protein HMPREF9290_0870 [Anaerococcus prevotii ACS-065-V-Col13] Length = 701 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 49/155 (31%), Gaps = 10/155 (6%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQV 310 LS NI + + + G+++ N Sbjct: 548 YDCDGFSYLSNSNIAKGFVSGPYSSFAGDVSSLFYASEGDLIISKTFPYN----TAIVDD 603 Query: 311 MERGIITSAYM--AVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVL 368 + ++ + + D Y+ ++S ++ + +L + +K L V Sbjct: 604 GNKYLVNDNLFVLRIDKNKADPYYILAFLKSKKTKELIKSKLKNS-NNLSMKVLKNLEVA 662 Query: 369 VPPIKEQFDITNVINVETARIDVL---VEKIEQSI 400 + ++++ DI N I A+ + + E+ + Sbjct: 663 LYSMEKREDIKNNIMDNLAKTKKAYKNISEFEKEL 697 >gi|284005606|ref|YP_003391426.1| hypothetical protein Slin_6670 [Spirosoma linguale DSM 74] gi|283820790|gb|ADB42627.1| hypothetical protein Slin_6670 [Spirosoma linguale DSM 74] Length = 109 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 8/59 (13%), Positives = 19/59 (32%), Gaps = 8/59 (13%) Query: 20 AIPKHWKVVPIKRFTK-LNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTST 77 +P W+ V + + + G T + + +G ++ +D S Sbjct: 30 DLPTEWQWVKLDDVCEKILGGGTPSTKNTDYW-------NGNIDWITSADIYGINDMSK 81 >gi|257900282|ref|ZP_05679935.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com15] gi|257838194|gb|EEV63268.1| type I restriction-modification system specificity subunit [Enterococcus faecium Com15] Length = 75 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 8/51 (15%), Positives = 15/51 (29%), Gaps = 4/51 (7%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIG----LEDVESGTGKYLPKDGNSR 71 W+ + + G T + + G VE G Y+ + Sbjct: 24 WEQRKLGEVADIIGGGTPSTNVSEYWNGDIDWYSPVEIGNQIYIDESQKKI 74 >gi|225436735|ref|XP_002266031.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|296086608|emb|CBI32243.3| unnamed protein product [Vitis vinifera] Length = 741 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 28/58 (48%), Gaps = 7/58 (12%) Query: 360 EDVKRLPVLVPPIKEQFDITNVINVETAR-IDVLVEKIEQSIVLLKERRSSFIAAAVT 416 + ++R+ + +PP++E+ I I+ + ID + +I +S +L VT Sbjct: 646 DRMRRIKIPLPPVEERKKIVEDIDKDRRYAIDASIVRIMKSRKILSH------QQLVT 697 >gi|255525764|ref|ZP_05392695.1| type I restriction enzyme, methylase subunit [Clostridium carboxidivorans P7] gi|296188049|ref|ZP_06856441.1| hypothetical protein CLCAR_3564 [Clostridium carboxidivorans P7] gi|255510587|gb|EET86896.1| type I restriction enzyme, methylase subunit [Clostridium carboxidivorans P7] gi|296047175|gb|EFG86617.1| hypothetical protein CLCAR_3564 [Clostridium carboxidivorans P7] Length = 191 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 49/136 (36%), Gaps = 14/136 (10%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + + G++VF I ++ + + + + IDS +L +++ Sbjct: 57 TNDKVNTLCHGDVVFSLITGI---ATIVRKEHEGYLYTQNYVKLLPSNNIDSKFLVYIIN 113 Query: 339 SYDLCKVFYAMG-SGLRQ-SLKFEDVKRLPVL-VPPIKEQFDITN------VINVETARI 389 K + +G G + + +K L + +P I +Q I + R Sbjct: 114 ENKTIKKQFVLGLQGSQVLKYTLKQLKELEIPKIPSIDKQKIIGQVYFNQLRLQALRNRA 173 Query: 390 DVLVEKIEQSIVLLKE 405 L KI + L+E Sbjct: 174 AELETKIR--LSKLEE 187 >gi|320013188|gb|ADW08036.1| hypothetical protein Sfla_6746 [Streptomyces flavogriseus ATCC 33331] Length = 396 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 17/101 (16%), Positives = 38/101 (37%), Gaps = 1/101 (0%) Query: 303 RSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFED 361 + + + + S + + ++ + + A SG + +L Sbjct: 82 AASVTGEHEDWNTARSVAVVRCAAPALAEWVRVWLTAPAARAWCVAHASGSAQATLGLAK 141 Query: 362 VKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVL 402 ++RLPV +PP+ Q + + ARI+ E ++ L Sbjct: 142 LRRLPVPMPPLGVQDRVLRTVKTIEARIEANERIAETAVAL 182 >gi|67476356|ref|XP_653781.1| tyrosine kinase [Entamoeba histolytica HM-1:IMSS] gi|56470765|gb|EAL48394.1| tyrosine kinase, putative [Entamoeba histolytica HM-1:IMSS] Length = 1348 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 23/275 (8%), Positives = 57/275 (20%), Gaps = 11/275 (4%) Query: 72 QSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSI 131 + I+ +L K D+ C + + + + L W I Sbjct: 542 NKCSKGYYKKINNDIILCELTINNCKIYQEDYCIECENGYYLDNINECILAPLHCWKYDI 601 Query: 132 DVTQRIEAICEGATMSHAD----WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITE 187 D + ++ I N + + + + + I Sbjct: 602 DNNKCTVCDLNTNNINGMCLEYEQCEINNTLGITNEYFKINNYCKYFDRSSNKCNDCIEN 661 Query: 188 RIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRK 247 Q + + + SG + Sbjct: 662 YHVGANGDC--IQNNIEELKMDLYEHCLSYTSSGCSRCIDSYYFNYQTKQCEACHNSCKH 719 Query: 248 NTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRS 307 + + + + N E + + G + I L + ++ Sbjct: 720 CSGPSSTECTTCDSSHFFLNGTCEN----NEELKNKCNIIIG-VENSKICLSCKEGFVKQ 774 Query: 308 AQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDL 342 + + H + +L S Sbjct: 775 GFECNECPKNCSSCYDQEHCFECKESFYLKNSLCY 809 >gi|325973135|ref|YP_004250199.1| hypothetical protein MSU_0275 [Mycoplasma suis str. Illinois] gi|323651737|gb|ADX97819.1| hypothetical protein MSU_0275 [Mycoplasma suis str. Illinois] Length = 87 Score = 37.1 bits (84), Expect = 5.6, Method: Composition-based stats. Identities = 11/89 (12%), Positives = 29/89 (32%), Gaps = 4/89 (4%) Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 + + L + + LK ++ VL+P K ++ +I+ L Sbjct: 1 MLFFSLQKKLIFWNSEITGTTLKHLKKGILQEHLVLLPDQKTLEKFNSICEDIQLKIEKL 60 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + I + R + + ++ + Sbjct: 61 T----KKIENSERIRDKLLDKLFSQKVKI 85 >gi|302503817|ref|XP_003013868.1| hypothetical protein ARB_07980 [Arthroderma benhamiae CBS 112371] gi|291177434|gb|EFE33228.1| hypothetical protein ARB_07980 [Arthroderma benhamiae CBS 112371] Length = 765 Score = 37.1 bits (84), Expect = 5.8, Method: Composition-based stats. Identities = 8/36 (22%), Positives = 16/36 (44%), Gaps = 3/36 (8%) Query: 382 INVETARI---DVLVEKIEQSIVLLKERRSSFIAAA 414 I + I + L++K ++ I LKE + + Sbjct: 653 IEKKDKEIQRQEKLIQKKDKEIQRLKESKDALSQEL 688 >gi|227528989|ref|ZP_03959038.1| possible type I restriction system specificity protein [Lactobacillus vaginalis ATCC 49540] gi|227351094|gb|EEJ41385.1| possible type I restriction system specificity protein [Lactobacillus vaginalis ATCC 49540] Length = 159 Score = 37.1 bits (84), Expect = 5.9, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 51/161 (31%), Gaps = 9/161 (5%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRF----IDLQNDKRSLRSAQVM 311 + S+ YG + K + G K + + + + I + S Sbjct: 1 MYSIKYGKTLPKSQLLRTGTKVFGAKGFMGYTERKPLVNKATVTITSRGSGAGFVSYIDE 60 Query: 312 ERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPP 371 E +T+ + + + + F + + L ++K+L + +P Sbjct: 61 EEAFLTNNLLYLIDKTGLGLSFTYELIKSSKPSQF--VTGSAQPQLTINNLKQLKLKIPT 118 Query: 372 IKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 I T RI+ + + L++ R +F+ Sbjct: 119 D---RFIIEQFKNNTKRIEDCMSNNKIENNSLQKLRLAFLN 156 >gi|254503197|ref|ZP_05115348.1| Transglycosylase SLT domain protein [Labrenzia alexandrii DFL-11] gi|222439268|gb|EEE45947.1| Transglycosylase SLT domain protein [Labrenzia alexandrii DFL-11] Length = 464 Score = 37.1 bits (84), Expect = 6.0, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 16/45 (35%), Gaps = 2/45 (4%) Query: 379 TNVINVETARIDVLVEKIEQSIV--LLKERRSSFIAAAVTGQIDL 421 + I+ ++ I L+ RR + V G+ D+ Sbjct: 63 VEFLRAFDTHINKGIKNETDKIAVVLIPTRRDRLLPDLVEGKGDV 107 >gi|329575633|gb|EGG57166.1| hypothetical protein HMPREF9520_01720 [Enterococcus faecalis TX1467] Length = 68 Score = 37.1 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 7/58 (12%), Positives = 24/58 (41%), Gaps = 4/58 (6%) Query: 355 QSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 + + ++ ++ + + +EQ I + +D + + + LK + S++ Sbjct: 11 KKINLGEINQVEMKITIFEEQDKIGD----LFTNLDDAIILNQNKLNQLKSLKKSYLQ 64 >gi|268592333|ref|ZP_06126554.1| proline permease [Providencia rettgeri DSM 1131] gi|291312118|gb|EFE52571.1| proline permease [Providencia rettgeri DSM 1131] Length = 500 Score = 37.1 bits (84), Expect = 6.1, Method: Composition-based stats. Identities = 10/34 (29%), Positives = 18/34 (52%) Query: 380 NVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +++ I+ VE I+Q I L+++R S I Sbjct: 462 EILSEHFDNIEQKVEDIDQQIAELEKKRQSLINQ 495 >gi|325473094|gb|EGC76292.1| DNA methylase BstVI [Treponema denticola F0402] Length = 418 Score = 36.7 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 38/368 (10%), Positives = 103/368 (27%), Gaps = 41/368 (11%) Query: 56 VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQ 115 +E GT K L + + K +Y ++ PY + + + Sbjct: 32 LEFGTYKTLYQKWDLYIPFIEKSLQLLKNDGIYSQIVPYP----------VTNQNYAKRL 81 Query: 116 PKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLI-REKI 174 + ++ + + + + E+ + N ++Q I Sbjct: 82 RQIIINKYDLFEITDLKGIKVFESATVTNCILFIKKVLPRNHVRISNFNSKQNNIKVVFT 141 Query: 175 IAETVRID----------TLITERIRFIELLKEKKQALVSY-IVTKGLNPDVKMKDSGIE 223 + T + I R + +S +V V+ K + Sbjct: 142 KSYTELVQDEKTVVWNLTQDIRNTNRHANMYTLGDFCYISKGMVLNSDEKIVQDKFVKAD 201 Query: 224 WVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETY 283 + + D F ++ ++ + + + + + Sbjct: 202 LISELKDDIHNMKFIESKDIERYHVKRIRFLEYNTMRVPKRLSRPTFKELYFTKKLLFNR 261 Query: 284 ----QIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGID-------STY 332 Q+V + D + +E IT++ H + + Sbjct: 262 LGDLQVVFDKNGEYTTSDAMFVAILWKDLHGVENKSITTSIKKFSHHTRNEMEKLSEKIH 321 Query: 333 LAWLM--RSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 L +++ + + G + E ++ +P+ + P ++Q I ++++ Sbjct: 322 LLYILAIMNSHYANILLTNIRGGDYHIYPEHIRNIPIPLAPKEQQKPIIDLVDQI----- 376 Query: 391 VLVEKIEQ 398 L+ K + Sbjct: 377 -LIAKQKN 383 >gi|309972508|gb|ADO95709.1| Hypothetical protein R2846_0247 [Haemophilus influenzae R2846] Length = 482 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 32/376 (8%), Positives = 84/376 (22%), Gaps = 44/376 (11%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS----- 79 W +V + + + YI D+ T ++ K+ Sbjct: 113 WNIVQNDFLKSFISEKFDFIIGNPPYITYSDLNKTTRSFIKKNFTVCSEGKPDYYYAFIE 172 Query: 80 -------------IFAKGQILYGKLGPYLRKAIIADFDGIC-------------STQFLV 113 I + LR ++ I S+ ++ Sbjct: 173 RSIKALADDGKLAYLIPNNIFKNRFADRLRVFMLPHLSVIVDYTTEKLFENKLTSSAIII 232 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ + + + + N + Sbjct: 233 CNGTQHNNDIRYVDKVKKREIKMHKNNLTHKWIFRKKEVAKENKKRKFGDDFNAMCPIAT 292 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 ++ E + + + ++ ++ V+ ++ E++ + + Sbjct: 293 LLNEVFILKKYEECDEYILVNGYKIEKTILREAVSP-----RSLQYGRKEFLIFPYSYRD 347 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 ++ + N K E S ++ ++ Sbjct: 348 NNILRYEELGFIQEFPGAYSYLKFFIEKLNKRDKDENAKWFEFGRSQALSRLNQEKLLLS 407 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY---AMG 350 I + + QV GI + + +++S D Y Sbjct: 408 TLITEEVKIHHISRNQVPYSGIC-----IYQKGDLSLEIAEKILKSDDFLNYVYDIGINA 462 Query: 351 SGLRQSLKFEDVKRLP 366 SG + DV Sbjct: 463 SGTTMRITARDVMNFE 478 >gi|219855646|ref|YP_002472768.1| hypothetical protein CKR_2303 [Clostridium kluyveri NBRC 12016] gi|219569370|dbj|BAH07354.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 191 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 51/134 (38%), Gaps = 10/134 (7%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + + G++VF I ++ + + + H IDS +L +L+ Sbjct: 57 TNDKVNTLCHGDVVFSLITGT---AAMVRKEHEGYLYTQNYVKLLPGHNIDSKFLVYLIN 113 Query: 339 SYDLCKVFYAMG-SGLRQ-SLKFEDVKRLPVL-VPPIKEQFDITN-VINVE-TARIDVLV 393 K + +G G + + +K L + +P I +Q I N + Sbjct: 114 ENKTIKKQFVLGLQGSQVLKYTLKQLKELEIPRIPSIDKQKIIGQVYFNQLRLKTLRNRA 173 Query: 394 EKIEQSIVL--LKE 405 ++E I+L L+E Sbjct: 174 AELEAKIILSRLEE 187 >gi|167761887|ref|ZP_02434014.1| hypothetical protein BACSTE_00230 [Bacteroides stercoris ATCC 43183] gi|167700257|gb|EDS16836.1| hypothetical protein BACSTE_00230 [Bacteroides stercoris ATCC 43183] Length = 257 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 41/148 (27%), Gaps = 21/148 (14%) Query: 10 YKDSGVQ--WIGA----IPKHWKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKY 63 YK SG + W IPK W IK + TG+ +DV Sbjct: 23 YKSSGGEMVWNEKLRRDIPKGWSCSDIKSALNIFTGK-------------KDVSKAIPGP 69 Query: 64 LPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPEL 123 + + +S ++ +L G Y + + + K Sbjct: 70 YKFFSCAPEPISSNEYMYDGEVVLVSGNGSYTGRVGYFNGKFDLYQRTYACVLKSESDTW 129 Query: 124 L--QGWLLSIDVTQRIEAICEGATMSHA 149 + + L G+++ + Sbjct: 130 MPFYYYTLRYMFQPVFSGGKHGSSIPYI 157 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 48/194 (24%), Gaps = 8/194 (4%) Query: 230 DHWEVKPFFALVTELNRKNTKLIESNILSLSYGNII-QKLETRNMGLKPESYETYQIVDP 288 + + + + + NI + + PE + + + Sbjct: 29 EMVWNEKLRRDIPKGWSCSDIKSALNIFTGKKDVSKAIPGPYKFFSCAPEPISSNEYMYD 88 Query: 289 GEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYA 348 GE+V + R + + +K + Y VF Sbjct: 89 GEVVLVSGNGSYTGR-VGYFNGKFDLYQRTYACVLKSESDTWMPFYYYTLRYMFQPVFSG 147 Query: 349 MGSGL-RQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 G + D+ + +++ + K EQ I E Sbjct: 148 GKHGSSIPYIVLGDLADFKFAK-ELTIINRFVSIVAPMFNEQFKRLVKTEQLIKQRNE-- 204 Query: 408 SSFIAAAVTGQIDL 421 + + GQ+ + Sbjct: 205 --LLPLLMNGQVSV 216 >gi|146183761|ref|XP_001026999.2| hypothetical protein TTHERM_00689960 [Tetrahymena thermophila] gi|146143487|gb|EAS06757.2| hypothetical protein TTHERM_00689960 [Tetrahymena thermophila SB210] Length = 1913 Score = 36.7 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 26/261 (9%), Positives = 60/261 (22%), Gaps = 6/261 (2%) Query: 58 SGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPK 117 S G Y+ + + + + S + + +Y + L + + Q Sbjct: 1284 SKNGNYIQVEISFQNNLESIYTRLLEQDDIYFCVSDILGSQCVNSENLDIYQQNFRSNLL 1343 Query: 118 DVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAE 177 + L + + + + + Sbjct: 1344 FIQKNLPADISHQYVTVIISDQLMQNEKNKLIYQIRWQSWEQKPSDQQLRCPFD------ 1397 Query: 178 TVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPF 237 + I + + V P + +SGI+W+ + P + Sbjct: 1398 CSQRGQCINGICNCSQDYVGRACEFNLRQVNVKQKPWIAQMESGIQWISVDPQDDIILLD 1457 Query: 238 FALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFID 297 F T +I S + QK+ + + Q + + Sbjct: 1458 FKQGLLQVIAFTDFTSFSISSQYLLSEPQKIILDKKYIMKFDLDGEQSSNSNNFLSGSNS 1517 Query: 298 LQNDKRSLRSAQVMERGIITS 318 K L +G Sbjct: 1518 TLTQKPLLLCFYSYRQGTNFQ 1538 >gi|53729077|ref|ZP_00348312.1| COG0732: Restriction endonuclease S subunits [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 216 Score = 36.7 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 59/209 (28%), Gaps = 16/209 (7%) Query: 215 VKMKDSGIEWVGLVPDHWEVKPFFALVTELNR--KNTKLIESNILSLSYGNIIQKLETRN 272 K SG E V +V + N K + + I ++ Sbjct: 21 NPYKSSGGEMVYNPELKRDVPKGWECDFVENYLDKVPNTDKIPSKEIQVKGQIPVIDQSQ 80 Query: 273 MGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTY 332 + + +++P + F D R ++ + + + + Sbjct: 81 DYICGFTDNENALLEPIDAHIIFGD---HTRVVKLVNFPYARGADGTQIIISNNKKLPNF 137 Query: 333 LAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL 392 L + M G + ++ +K VL+P I + L Sbjct: 138 LFYQM-----IAKIDLSNYGYARH--YKFLKESKVLIPT----EYIAQKYHQTVKPYFDL 186 Query: 393 VEKIEQSIVLLKERRSSFIAAAVTGQIDL 421 + + L + R + + GQ+++ Sbjct: 187 WKTNLKETQKLTQLRDFLLPMLMNGQVEV 215 >gi|229846420|ref|ZP_04466528.1| putative site-specific DNA-methyltransferase restriction-modification protein [Haemophilus influenzae 7P49H1] gi|229810513|gb|EEP46231.1| putative site-specific DNA-methyltransferase restriction-modification protein [Haemophilus influenzae 7P49H1] Length = 475 Score = 36.7 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 32/376 (8%), Positives = 84/376 (22%), Gaps = 44/376 (11%) Query: 25 WKVVPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVS----- 79 W +V + + + YI D+ T ++ K+ Sbjct: 106 WNIVQNDFLKSFISEKFDFIIGNPPYITYSDLNKTTRSFIKKNFTVCSEGKPDYYYAFIE 165 Query: 80 -------------IFAKGQILYGKLGPYLRKAIIADFDGIC-------------STQFLV 113 I + LR ++ I S+ ++ Sbjct: 166 RSIKALADDGKLAYLIPNNIFKNRFADRLRVFMLPHLSVIVDYTTEKLFENKLTSSAIII 225 Query: 114 LQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHADWKGIGNIPMPIPPLAEQVLIREK 173 ++ + + + + N + Sbjct: 226 CNGTQHNNDIRYVDKVKKREIKMHKNNLTHKWIFRKKEVAKENKKRKFGDDFNAMCPIAT 285 Query: 174 IIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWE 233 ++ E + + + ++ ++ V+ ++ E++ + + Sbjct: 286 LLNEVFILKKYEECDEYILVNGYKIEKTILREAVSP-----RSLQYGRKEFLIFPYSYRD 340 Query: 234 VKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVF 293 ++ + N K E S ++ ++ Sbjct: 341 NNILRYEELGFIQEFPGAYSYLKFFIEKLNKRDKDENAKWFEFGRSQALSRLNQEKLLLS 400 Query: 294 RFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFY---AMG 350 I + + QV GI + + +++S D Y Sbjct: 401 TLITEEVKIHHISRNQVPYSGIC-----IYQKGDLSLEIAEKILKSDDFLNYVYDIGINA 455 Query: 351 SGLRQSLKFEDVKRLP 366 SG + DV Sbjct: 456 SGTTMRITARDVMNFE 471 >gi|309799890|ref|ZP_07694095.1| putative HsdS [Streptococcus infantis SK1302] gi|308116480|gb|EFO53951.1| putative HsdS [Streptococcus infantis SK1302] Length = 74 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 33/69 (47%), Gaps = 4/69 (5%) Query: 353 LRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIA 412 ++ + E++ ++ V P+K F+I + V + L+E+ +Q L + R + Sbjct: 8 IQIKINQENMNKIVVPEIPLKLLFEINQKLEVIDKQQLNLIEENKQ----LTQLRDWLLP 63 Query: 413 AAVTGQIDL 421 + GQ+ + Sbjct: 64 MLMHGQVKV 72 >gi|238854853|ref|ZP_04645183.1| hypothetical protein LACJE0001_0820 [Lactobacillus jensenii 269-3] gi|260664140|ref|ZP_05864993.1| predicted protein [Lactobacillus jensenii SJ-7A-US] gi|282933931|ref|ZP_06339279.1| hypothetical protein HMPREF0886_3264 [Lactobacillus jensenii 208-1] gi|238832643|gb|EEQ24950.1| hypothetical protein LACJE0001_0820 [Lactobacillus jensenii 269-3] gi|260562026|gb|EEX27995.1| predicted protein [Lactobacillus jensenii SJ-7A-US] gi|281302020|gb|EFA94274.1| hypothetical protein HMPREF0886_3264 [Lactobacillus jensenii 208-1] Length = 429 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 4/90 (4%) Query: 320 YMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLP-VLVPPIKEQFDI 378 Y+ + + L ++RS + K ++ F+ L +++P + EQ I Sbjct: 4 YIKLIILCMFLQCLWHIVRSLIIFKKIKHYS---INNINFKKFDGLLYIVIPCLLEQRII 60 Query: 379 TNVINVETARIDVLVEKIEQSIVLLKERRS 408 +N I+ I+ K + IV + + Sbjct: 61 SNTIDNFVKAIEESKIKAQLLIVTTNKEKK 90 >gi|153955215|ref|YP_001395980.1| Type I restriction enzyme, methylase subunit [Clostridium kluyveri DSM 555] gi|146348073|gb|EDK34609.1| Type I restriction enzyme, methylase subunit [Clostridium kluyveri DSM 555] Length = 191 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 51/134 (38%), Gaps = 10/134 (7%) Query: 279 SYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLAWLMR 338 + + + G++VF I ++ + + + H IDS +L +L+ Sbjct: 57 TNDKVNTLCHGDVVFSLITGT---AAMVRKEYEGYLYTQNYVKLLPGHNIDSKFLVYLIN 113 Query: 339 SYDLCKVFYAMG-SGLRQ-SLKFEDVKRLPVL-VPPIKEQFDITN-VINVE-TARIDVLV 393 K + +G G + + +K L + +P I +Q I N + Sbjct: 114 ENKTIKKQFVLGLQGSQVLKYTLKQLKELEIPRIPSIDKQKIIGQVYFNQLRLKTLRNRA 173 Query: 394 EKIEQSIVL--LKE 405 ++E I+L L+E Sbjct: 174 AELEAKIILSRLEE 187 >gi|63054529|ref|NP_593287.2| AAA family ATPase Cdc48 [Schizosaccharomyces pombe 972h-] gi|27151477|sp|Q9P3A7|CDC48_SCHPO RecName: Full=Cell division cycle protein 48 gi|159883922|emb|CAB99275.2| AAA family ATPase Cdc48 [Schizosaccharomyces pombe] Length = 815 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 17/100 (17%), Positives = 30/100 (30%), Gaps = 7/100 (7%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSIFAKGQI 86 V + +N + + I + L D VE TG + KG + Sbjct: 114 VRLGDIVTINPCPDIKYAERISVLPLADTVEGLTGSLFDVYL--KPYFVEAYRPIRKGDL 171 Query: 87 LY--GKLGPYLRKAII--ADFDGICSTQFLVLQPKDVLPE 122 G + K + D GI S ++ + + Sbjct: 172 FVVRGSMRQVEFKVVDVAPDEFGIVSQDTIIHWEGEPINR 211 >gi|290978459|ref|XP_002671953.1| serine/threonine protein kinase [Naegleria gruberi] gi|284085526|gb|EFC39209.1| serine/threonine protein kinase [Naegleria gruberi] Length = 2148 Score = 36.7 bits (83), Expect = 7.1, Method: Composition-based stats. Identities = 47/386 (12%), Positives = 108/386 (27%), Gaps = 36/386 (9%) Query: 32 RFTKLNTGRTSESGKDIIYIGLED-VESGTGKYLPKDGNSRQSDTSTVSIFAKGQILYGK 90 K+ T + +++ ++ E VES T L K ++ + K IL Sbjct: 1203 DVFKILTDHVFTTYEELNFLTQEHTVESLTTWILNKMKYMKEHVLVNFT--QKSDIL--- 1257 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHAD 150 + + IC + L + + + + + + G T Sbjct: 1258 --NLPDIDDLEKWHLICLLTQSCPAVFHMNNSLSKHYFMIMILISSDIVLNNGIT--PLS 1313 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 I + + ++ T EL L+SY + Sbjct: 1314 PLLISVFAWSWCNFEFYPQMNMFLE---AGLEMEKTRFRNEHELASCTLMKLMSYYFSPT 1370 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 +NP S ++ V + F+++ S+ ++ Sbjct: 1371 INPLEVWNISEEAFLLSVKSKNKHYGGFSILWYPFHHLFYSHGDVKTSIGLSLNSIEIFK 1430 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFID--LQNDKRSLRSAQVMERGIITSAYMAVKPHGI 328 RN + I++ ++ +D L ++ + + +++ Sbjct: 1431 RNGNVMLMDASK-MILELKYVLSGDVDHYLPTYVSQVKEYPFFRQMHYSFKGISLYFQND 1489 Query: 329 DSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETAR 388 + +SYD + M S + + + + + Sbjct: 1490 IEKSFKCMEKSYDFREDSVGMCSSWIDLIFLGLISAI---------------FLRSAQEK 1534 Query: 389 IDVLVEKIEQSIVLLKERRSSFIAAA 414 I+ E ++ SI L+ I+ A Sbjct: 1535 IEKAQEILKYSIERLE-----MISQA 1555 >gi|222523315|ref|YP_002567785.1| MerR family transcriptional regulator [Chloroflexus sp. Y-400-fl] gi|222447194|gb|ACM51460.1| transcriptional regulator, MerR family [Chloroflexus sp. Y-400-fl] Length = 133 Score = 36.7 bits (83), Expect = 7.2, Method: Composition-based stats. Identities = 10/46 (21%), Positives = 19/46 (41%), Gaps = 10/46 (21%) Query: 376 FDITNVIN----------VETARIDVLVEKIEQSIVLLKERRSSFI 411 +I ++ A +D + I+Q I L+E R++ I Sbjct: 62 REIAAILALSDRGEPPCGEMLASLDRQIAAIDQRIADLQELRTALI 107 >gi|315650994|ref|ZP_07904031.1| DNA methylase-type I restriction-modification system [Eubacterium saburreum DSM 3986] gi|315486750|gb|EFU77095.1| DNA methylase-type I restriction-modification system [Eubacterium saburreum DSM 3986] Length = 153 Score = 36.7 bits (83), Expect = 7.5, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 49/144 (34%), Gaps = 10/144 (6%) Query: 270 TRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAV---KPH 326 T E+ I + +F D + + IITS+ + K Sbjct: 2 TDTSIYLSENEFKDCIRPKKDTIFLSKDGT---VGIAYKMNKDENIITSSAILHLDVKDE 58 Query: 327 GIDSTYLAWLMRSYDLCKVFYAMGSG-LRQSLKFEDVKRLPVLVPPIKEQFDITNVI--- 382 + YL ++ S + G + K +++ + + + ++Q I+ ++ Sbjct: 59 RVLPDYLTLVLNSVAVKMQAEKDAGGSIINHWKKSEIENVIIPIIVKEKQEQISKLLIEN 118 Query: 383 NVETARIDVLVEKIEQSIVLLKER 406 ++EK +S+ + E Sbjct: 119 ENLRNESKSILEKAVKSVEMAIEY 142 >gi|304372999|ref|YP_003856208.1| Type I site-specific DNA methyltransferase specificity subunit [Mycoplasma hyorhinis HUB-1] gi|304309190|gb|ADM21670.1| Type I site-specific DNA methyltransferase specificity subunit [Mycoplasma hyorhinis HUB-1] Length = 232 Score = 36.7 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 53/156 (33%), Gaps = 15/156 (9%) Query: 230 DHWEVKPFFALVTELNR-KNTKLIESNILSLSYGNIIQKLETRNMGLKPESY------ET 282 W+ + + ++ + Y + N+ LK +S E Sbjct: 23 SDWQSWTLEDKGYLYSGLNSKTKVDFTNGNSKYITYLNVFNNFNIDLKEKSLVFIKSDEK 82 Query: 283 YQIVDPGEIVFRFIDLQNDKRSLRSA---QVMERGIITSAYMAVKPHGID---STYLAWL 336 + G+I+F + + SA +V E+ + S + + D + A+L Sbjct: 83 QNSIVKGDILFTMSSETYQEVGMSSAVTEEVNEKIYLNSFCFGYRLNKADFLFPNFSAFL 142 Query: 337 MRSYDLCK--VFYAMGSGLRQSLKFEDVKRLPVLVP 370 R++ + + + G R +L + L + P Sbjct: 143 FRNHSVRHKIILQSNGGTSRFNLSKKSFLNLKIKSP 178 >gi|218441912|ref|YP_002380241.1| hypothetical protein PCC7424_5020 [Cyanothece sp. PCC 7424] gi|218174640|gb|ACK73373.1| hypothetical protein PCC7424_5020 [Cyanothece sp. PCC 7424] Length = 327 Score = 36.7 bits (83), Expect = 7.6, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 13/78 (16%) Query: 342 LCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV---INVETARIDVLVEKIEQ 398 + + M + +QS+ DV VPP + Q I N + + +ID ++++ Sbjct: 33 IREQQAQMTTYSQQSVDQPDV------VPP-QIQDKIANYQQKLQEKQEKIDRYEQQLKG 85 Query: 399 SIVLLKERRSS---FIAA 413 L+E ++ + A Sbjct: 86 LKAELEEYKTQNALLMQA 103 >gi|303248937|ref|ZP_07335184.1| phage shock protein A, PspA [Desulfovibrio fructosovorans JJ] gi|302489660|gb|EFL49596.1| phage shock protein A, PspA [Desulfovibrio fructosovorans JJ] Length = 226 Score = 36.7 bits (83), Expect = 7.7, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 6/53 (11%) Query: 373 KEQFDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV--TGQIDLRG 423 +EQ I +++ I L E+ + KER+ + + + TG+ +R Sbjct: 106 EEQAAIAGMVDAYREDIGRL----EEKLASAKERQRTLLHRHLRATGKKRVRE 154 >gi|320166954|gb|EFW43853.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 327 Score = 36.7 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 45/115 (39%), Gaps = 22/115 (19%) Query: 324 KPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ-------- 375 P +D + ++ + + S + E + +PV V +KEQ Sbjct: 123 FPKELDHKTILRQLKVARMRAELEGISSRGIE----ETFQSMPVPV--LKEQVIESVARS 176 Query: 376 ----FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAVTGQIDLRGESQ 426 + + + E+A +D+ + + +++I +E + + QID GE Q Sbjct: 177 VIQLRESIDFMQSESASLDLQIARCQEAIAQHQEMNQALQNKVL--QID--GEDQ 227 >gi|317501643|ref|ZP_07959834.1| hypothetical protein HMPREF1026_01778 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896894|gb|EFV18974.1| hypothetical protein HMPREF1026_01778 [Lachnospiraceae bacterium 8_1_57FAA] Length = 1255 Score = 36.7 bits (83), Expect = 7.8, Method: Composition-based stats. Identities = 38/373 (10%), Positives = 93/373 (24%), Gaps = 32/373 (8%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR--QSDTSTVSIFAKGQILYGK 90 L+ +T + + E V G + + D + +++ +++G Sbjct: 903 VYSLSNEKTFDITIGDRVLSEEIVSGGRVPVYSANVYEEFGRIDKENMKDYSRPSVIWGI 962 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHAD 150 G ++ I A + VL+ K + + + Sbjct: 963 DGDWMVNIIPAGVPFYPTDHCGVLRIKTE--------KILPEYMMYALQAEGEYERFSRN 1014 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + + A + I++ II E +D I + IE + + I Sbjct: 1015 NRASAQRIRSLVVQAPETKIQKNIIDELKALDDKINGQNAEIEKYENSIRTKFDQIFHL- 1073 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 E++ + R + N Sbjct: 1074 -----------EEFISDGVFSKYEGYSVEDLCIDGRGRVINQQY------IENHKGPYPV 1116 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + + K + + + + Sbjct: 1117 YSSQTTNDGIFGSIDTFDFDGEYITWTTDGAKAGTVFYRNGKFNCTNVCGTLKAKNDKVN 1176 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + K +G+ L + +K++ V VP + Q + + + Sbjct: 1177 MRYLAYLLNRIAYKFVSRVGN---NKLMNDAMKKIVVPVPKRQLQDEFADFVQSVEKSKF 1233 Query: 391 VLVEKIEQ-SIVL 402 + K E+ I Sbjct: 1234 ECIGKKEKFEIEK 1246 >gi|225550405|ref|ZP_03771354.1| type I restriction modification DNA specificity family protein [Ureaplasma urealyticum serovar 2 str. ATCC 27814] gi|225379559|gb|EEH01921.1| type I restriction modification DNA specificity family protein [Ureaplasma urealyticum serovar 2 str. ATCC 27814] Length = 87 Score = 36.7 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 24/67 (35%), Gaps = 7/67 (10%) Query: 346 FYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARIDVL-------VEKIEQ 398 F + S + L +L + +PPI Q I V++ + +E+ ++ Sbjct: 3 FINLKSAEHKRLWITKYSQLSIPIPPISIQNKIVEVLDKLETYTKDINTGLPLEIEQHKK 62 Query: 399 SIVLLKE 405 + Sbjct: 63 QYEYYRN 69 >gi|160887309|ref|ZP_02068312.1| hypothetical protein BACOVA_05327 [Bacteroides ovatus ATCC 8483] gi|156107720|gb|EDO09465.1| hypothetical protein BACOVA_05327 [Bacteroides ovatus ATCC 8483] Length = 204 Score = 36.7 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 10/158 (6%), Positives = 50/158 (31%), Gaps = 8/158 (5%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGI 315 I ++ + ++ + +V ++ L + + Sbjct: 52 INDITKQGKYVRYTENHLSQSGLENSSAWVVPKYSLIMSMYAS----VGLVTINEIPTTT 107 Query: 316 ITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQ 375 + + YL + + + + + +G + ++ + ++ + + + Sbjct: 108 SQAMFAMQLKDKGLLDYLYYYLSYFKYRYIHKYLETGTQSNINADIIRGIMIPIYGHSRN 167 Query: 376 FDITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAA 413 +I + + +ID + + L ++++ ++ Sbjct: 168 MEIASTLQGIDVKIDNELSV----LKLFNKQKNYLLSQ 201 >gi|332076175|gb|EGI86641.1| type I restriction-modification system S subunit [Streptococcus pneumoniae GA41301] Length = 156 Score = 36.7 bits (83), Expect = 7.9, Method: Composition-based stats. Identities = 14/156 (8%), Positives = 41/156 (26%), Gaps = 5/156 (3%) Query: 219 DSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLETRN----MG 274 G + D+ + + E L L+ N+ + + + + Sbjct: 1 MFGENKIFESIDNLFDIIDGDRGKNYPKSDELFSEEYCLFLNTKNVTKNGFSFDTKQFIT 60 Query: 275 LKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDSTYLA 334 + ++ +IV + + I S + ++P + Sbjct: 61 KTKDKLLRKGKLERYDIVLTTRGTVGNVAYYDELIKYKHLRINSGMVILRPKTPNLNQ-K 119 Query: 335 WLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVP 370 +++ + + L +K+ P Sbjct: 120 FIIHVLRNNNYSRVISGSAQPQLPITKLKKYFSPSP 155 >gi|331088477|ref|ZP_08337391.1| hypothetical protein HMPREF1025_00974 [Lachnospiraceae bacterium 3_1_46FAA] gi|330407817|gb|EGG87308.1| hypothetical protein HMPREF1025_00974 [Lachnospiraceae bacterium 3_1_46FAA] Length = 1239 Score = 36.7 bits (83), Expect = 8.1, Method: Composition-based stats. Identities = 38/373 (10%), Positives = 93/373 (24%), Gaps = 32/373 (8%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR--QSDTSTVSIFAKGQILYGK 90 L+ +T + + E V G + + D + +++ +++G Sbjct: 887 VYSLSDEKTFDITIGDRVLSEEIVSGGRVPVYSANVYEEFGRIDKENMKDYSRPSVIWGI 946 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHAD 150 G ++ I A + VL+ K + + + Sbjct: 947 DGDWMVNIIPAGVPFYPTDHCGVLRIKTE--------KILPEYMMYALQAEGEYERFSRN 998 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + + A + I++ II E +D I + IE + + I Sbjct: 999 NRASAQRIRSLVVQAPETKIQKNIIDELKALDDKINGQNAEIEKYENSIRTKFDQIFHL- 1057 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 E++ + R + N Sbjct: 1058 -----------EEFISDGVFSKYEGYSVEDLCIDGRGRVINQQY------IENHKGPYPV 1100 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + + K + + + + Sbjct: 1101 YSSQTTNDGIFGSIDTFDFDGEYITWTTDGAKAGTVFYRNGKFNCTNVCGTLKAKNDKVN 1160 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + K +G+ L + +K++ V VP + Q + + + Sbjct: 1161 MRYLAYLLNRIAYKFVSRVGN---NKLMNDAMKKIVVPVPTRQLQDEFADFVQSVEKSKF 1217 Query: 391 VLVEKIEQ-SIVL 402 + K E+ I Sbjct: 1218 ECIGKKEKFEIEK 1230 >gi|327403687|ref|YP_004344525.1| N-6 DNA methylase [Fluviicola taffensis DSM 16823] gi|327319195|gb|AEA43687.1| N-6 DNA methylase [Fluviicola taffensis DSM 16823] Length = 866 Score = 36.7 bits (83), Expect = 8.1, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 43/121 (35%), Gaps = 2/121 (1%) Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 LS +I +L +G+ + +E IV F+ Q K +L + Sbjct: 473 LSQNDIYLQLNENILGIDADEFEKPIEAYRDSIVVSFVGSQL-KPTLVPENELIVFNHNL 531 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLC-KVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFD 377 A + + + +LA ++ + ++ L + L V +P + Q D Sbjct: 532 AILQLNTALVLPEFLALELKEDYIETQLLEKRTGTTIPFLTIKGFCSLFVQIPSLAIQKD 591 Query: 378 I 378 I Sbjct: 592 I 592 >gi|229057164|ref|ZP_04196554.1| Peptidase, family M23/M37 [Bacillus cereus AH603] gi|228720170|gb|EEL71751.1| Peptidase, family M23/M37 [Bacillus cereus AH603] Length = 424 Score = 36.7 bits (83), Expect = 8.2, Method: Composition-based stats. Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 4/46 (8%) Query: 375 QFDITN---VINVETARIDVLVEKIEQSIVLLKER-RSSFIAAAVT 416 Q I I ID E I+Q + ++E+ R+S I +T Sbjct: 94 QQAIAEKKKHIEQLQTNIDTRQEVIKQRLQSMQEKPRTSIITEVLT 139 >gi|153815281|ref|ZP_01967949.1| hypothetical protein RUMTOR_01515 [Ruminococcus torques ATCC 27756] gi|145847343|gb|EDK24261.1| hypothetical protein RUMTOR_01515 [Ruminococcus torques ATCC 27756] Length = 1255 Score = 36.3 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 38/373 (10%), Positives = 93/373 (24%), Gaps = 32/373 (8%) Query: 33 FTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSR--QSDTSTVSIFAKGQILYGK 90 L+ +T + + E V G + + D + +++ +++G Sbjct: 903 VYSLSDEKTFDITIGDRVLSEEIVSGGRVPVYSANVYEEFGRIDKENMKDYSRPSVIWGI 962 Query: 91 LGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIEAICEGATMSHAD 150 G ++ I A + VL+ K + + + Sbjct: 963 DGDWMVNIIPAGVPFYPTDHCGVLRIKTE--------KILPEYMMYALQAEGEYERFSRN 1014 Query: 151 WKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEKKQALVSYIVTKG 210 + + A + I++ II E +D I + IE + + I Sbjct: 1015 NRASAQRIRSLVVQAPETKIQKNIIDELKALDDKINGQNAEIEKYENSIRTKFDQIFHL- 1073 Query: 211 LNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQKLET 270 E++ + R + N Sbjct: 1074 -----------EEFISDGVFSKYEGYSVEDLCIDGRGRVINQQY------IENHKGPYPV 1116 Query: 271 RNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITSAYMAVKPHGIDS 330 + + + + K + + + + Sbjct: 1117 YSSQTTNDGIFGSIDTFDFDGEYITWTTDGAKAGTVFYRNGKFNCTNVCGTLKAKNDKVN 1176 Query: 331 TYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNVINVETARID 390 + + K +G+ L + +K++ V VP + Q + + + Sbjct: 1177 MRYLAYLLNRIAYKFVSRVGN---NKLMNDAMKKIVVPVPTRQLQDEFADFVQSVEKSKF 1233 Query: 391 VLVEKIEQ-SIVL 402 + K E+ I Sbjct: 1234 ECIGKKEKFEIEK 1246 >gi|126334897|ref|XP_001375777.1| PREDICTED: similar to poly(A)-specific ribonuclease [Monodelphis domestica] Length = 638 Score = 36.3 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 44/150 (29%), Gaps = 11/150 (7%) Query: 251 LIESNILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRS-LRSAQ 309 + L +G K ++ + +S+ Y P F S Sbjct: 59 KHSMDFLLFQFGLCTFKYDSTDSKYIMKSFNFYVFPKP----FSRTSPDVKFVCQSSSID 114 Query: 310 VMERGIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKFEDVKRLPVLV 369 + + + +R K + G+G + + PV++ Sbjct: 115 FLANQGFDFNKVFRNGIPYLNQEEERQLREQYDEKRSQSNGAGALSYISPNA-SKCPVII 173 Query: 370 PPIKEQFDITNVINVETARIDVLVEKIEQS 399 P ++Q I+ +I+ L++ E Sbjct: 174 P--EDQKK---FIDKVVEKIEDLIQNEENK 198 >gi|240948005|ref|ZP_04752423.1| N-6 DNA methylase [Actinobacillus minor NM305] gi|240297675|gb|EER48149.1| N-6 DNA methylase [Actinobacillus minor NM305] Length = 581 Score = 36.3 bits (82), Expect = 9.2, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 37/129 (28%), Gaps = 9/129 (6%) Query: 256 ILSLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMER-- 313 I Y + K + L + ++ +I+ + + + Sbjct: 424 IPEFGYIDTASKENYIDNELMN--ILSPHLLQENDIIITVRGSSGKVGIVSKELLNKYQG 481 Query: 314 ----GIITSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLR-QSLKFEDVKRLPVL 368 G P +++ L +RS + SG L +D+K V Sbjct: 482 KVIIGQANLILRVKDPQNVNAIALLMQLRSELSQSRLQVLSSGAVLSGLSVKDLKEFLVA 541 Query: 369 VPPIKEQFD 377 +++Q Sbjct: 542 NFSLEKQKK 550 >gi|209554184|ref|YP_002284452.1| restriction modification enzyme subunit s2a [Ureaplasma urealyticum serovar 10 str. ATCC 33699] gi|209541685|gb|ACI59914.1| restriction modification enzyme subunit s2a [Ureaplasma urealyticum serovar 10 str. ATCC 33699] Length = 213 Score = 36.3 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 36/122 (29%), Gaps = 4/122 (3%) Query: 266 QKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDK-RSLRSAQVMERGIITSAYMAVK 324 + + K Y ++ + I + + + +T+ Sbjct: 28 NQGIYPVISSKTTENGIYGFINRYDYEKNKITMSLIGENAGTFFWQEKNFSLTNNACVFI 87 Query: 325 PHGIDSTYLAWLMRSYDLCKVFYA---MGSGLRQSLKFEDVKRLPVLVPPIKEQFDITNV 381 + + +L + + + R + +K + V +P I+ Q I ++ Sbjct: 88 SNKNINYNYKYLFITLKKHEYKIKEFIVIGSARPMISSNHLKLVDVNLPSIEIQDAIISI 147 Query: 382 IN 383 I Sbjct: 148 IE 149 >gi|189463963|ref|ZP_03012748.1| hypothetical protein BACINT_00298 [Bacteroides intestinalis DSM 17393] gi|189438536|gb|EDV07521.1| hypothetical protein BACINT_00298 [Bacteroides intestinalis DSM 17393] Length = 566 Score = 36.3 bits (82), Expect = 9.4, Method: Composition-based stats. Identities = 36/339 (10%), Positives = 94/339 (27%), Gaps = 17/339 (5%) Query: 79 SIFAKGQILYGKLGPYLRKAIIADFDGICSTQFLVLQPKDVLPELLQGWLLSIDVTQRIE 138 +L LG A D D+ + + + + Sbjct: 137 KELKTKNLLLSSLGNIYFDAGYYDKSMEIYETMYECCTTDLEKSIALNNISTYYCLIDEK 196 Query: 139 AICEGATMSHADWKGIGNIPMPIPPLAEQVLIREKIIAETVRIDTLITERIRFIELLKEK 198 ++ + + + K L E++ EL +++ Sbjct: 197 DSTLMFQREALNYAIASG--DSLQIAMSKHNLSLKFDNFNELDSALYYEKMALKELPQKE 254 Query: 199 KQALVSYIVTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILS 258 + + L K KDS + ++ + ++ + + L + + + Sbjct: 255 NHGNCYFNLGDLLLKTGKNKDSALYYLTKALEDVPIESKASCLKSLYNLEKENGDYKTAN 314 Query: 259 LSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIITS 318 ++ +S E Q++ R + Q + + I Sbjct: 315 TYLEEHS--AIIDSLFYMEQSTEIQQLIYEYNTKMRVREEQLKGKRTIHRTIAGFVSICF 372 Query: 319 AYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLKF--EDVKRLPVLVPPIKEQF 376 + + I+ R + + + L ++ +++ + +Q Sbjct: 373 LIILAYQNYIN--------RKKRIQLQYKQSLEQTQNKLSSLETTIENNQLMI-TLLKQE 423 Query: 377 DITNVINVETARIDVLVEKIEQSIVLLKERRSSFIAAAV 415 N++ E + +E+ EQ+I LKE + + Sbjct: 424 Q--NILKQEHENKEQQIEEREQAIARLKEEKQQLLHWLF 460 >gi|296004694|ref|XP_966179.2| cell division cycle protein 48 homologue, putative [Plasmodium falciparum 3D7] gi|225631753|emb|CAG25009.2| cell division cycle protein 48 homologue, putative [Plasmodium falciparum 3D7] Length = 828 Score = 36.3 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 38/137 (27%), Gaps = 7/137 (5%) Query: 28 VPIKRFTKLNTGRTSESGKDIIYIGLEDVESGTGKYLPKDGNSRQSDTSTVSIFAKGQIL 87 V + + + GK I + ++D G K + + + KG + Sbjct: 97 VCLGDVVYVKSCPEIPYGKKIQVLPIDDTIEGLAKDTLFEIFLKPYFNESYRPVKKGDLF 156 Query: 88 YGKLGPYLRKAII----ADFDGICSTQ---FLVLQPKDVLPELLQGWLLSIDVTQRIEAI 140 + G + + D I S + P E + D+ + + Sbjct: 157 LVRGGFMSVEFKVVEVDPDDFCIVSPDTVIYYEGDPIKRDDEEKLDEIGYDDIGGCKKQL 216 Query: 141 CEGATMSHADWKGIGNI 157 + M + G Sbjct: 217 AQIREMIELPLRHPGLF 233 >gi|290993432|ref|XP_002679337.1| serine/threonine protein kinase [Naegleria gruberi] gi|284092953|gb|EFC46593.1| serine/threonine protein kinase [Naegleria gruberi] Length = 1839 Score = 36.3 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 72/221 (32%), Gaps = 22/221 (9%) Query: 207 VTKGLNPDVKMKDSGIEWVGLVPDHWEVKPFFALVTELNRKNTKLIESNILSLSYGNIIQ 266 + L D ++ + E V + +H E K + + L + Q Sbjct: 78 FSSILFADTPVRGTESEAVAKLTNHVENMKLELDTWRSLYKELQTKHTARLENIFREQKQ 137 Query: 267 KLETRNMGLKPESYETYQIVDPGEIVFR----------FIDLQNDKRSLRSAQVMERGII 316 + E + + + + V V + + + ++ E+ I Sbjct: 138 REEEIRQKYEEDKSLSQKGVSKASFVIMTDMKNLLYLFKHQNIENVYEMVNTEINEKDIA 197 Query: 317 TSAYMAVKPHGIDSTYLAWLMRSYDLCKVFYAMGSGLRQSLK--FEDVKRLPVLV----- 369 + ++ + T L + +S K L QSL+ E +++ + Sbjct: 198 SDQFICIYKEF--KTALFNIFKSISTEKHINKEHHTLVQSLRVEKEKLEKELFTLKQSVD 255 Query: 370 --PPIK-EQFDITNVINVETARIDVLVEKIEQSIVLLKERR 407 P + EQ I + ++I+ + +K + +KE + Sbjct: 256 IGPSHEHEQEKIHELHEEYASKIEEMEQKNYKLESQVKELK 296 >gi|58616448|ref|YP_195577.1| Type I restriction enzyme (modification subunit) [Azoarcus sp. EbN1] gi|56315910|emb|CAI10553.1| Type I restriction enzyme (modification subunit) [Aromatoleum aromaticum EbN1] Length = 594 Score = 36.3 bits (82), Expect = 9.5, Method: Composition-based stats. Identities = 15/118 (12%), Positives = 41/118 (34%), Gaps = 6/118 (5%) Query: 258 SLSYGNIIQKLETRNMGLKPESYETYQIVDPGEIVFRFIDLQNDKRSLRSAQVMERGIIT 317 + ++ + RN + + P +I+ K + + E + Sbjct: 430 EVGVPDLPEYGYIRNPQKRVRLEALKNALRPLDILISVKGSVG-KVGIVPPDLDETWVAG 488 Query: 318 SAYMAVKP----HGIDSTYLAWLMRSYDLCKVFYAMGSGL-RQSLKFEDVKRLPVLVP 370 + + ++ H +L +RS + SG ++ ++++L V++P Sbjct: 489 QSCLILRRRDGAHLHRPHFLLLYLRSAIGKASLERIASGTAVPLIQLRELRKLLVIIP 546 >gi|315618355|gb|EFU98943.1| hypothetical protein EC3431_1530 [Escherichia coli 3431] Length = 56 Score = 36.3 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 12/34 (35%), Positives = 15/34 (44%), Gaps = 3/34 (8%) Query: 10 YKDSGVQWIGAIPKHWKVVPIKRFTKLNTGRTSE 43 YK + V G IP+ W VP K NT + Sbjct: 22 YKLTEV---GVIPEDWDCVPFGNLFKTNTKKKKS 52 >gi|237750238|ref|ZP_04580718.1| predicted protein [Helicobacter bilis ATCC 43879] gi|229374132|gb|EEO24523.1| predicted protein [Helicobacter bilis ATCC 43879] Length = 146 Score = 36.3 bits (82), Expect = 9.7, Method: Composition-based stats. Identities = 9/89 (10%), Positives = 27/89 (30%), Gaps = 9/89 (10%) Query: 23 KHWKVVPIKRFTKLNTGRTSESGK----DIIYIGLEDVE-----SGTGKYLPKDGNSRQS 73 + W+ V + ++ + + + I + +++ + L D N Sbjct: 29 EQWQEVRLGEVAEITSSKRIFYSEYVEYGIPFYRSKEIIEFSKGNNPQNELFIDENKYND 88 Query: 74 DTSTVSIFAKGQILYGKLGPYLRKAIIAD 102 + + IL +G ++ Sbjct: 89 IANKFGVPQANDILLTSVGTLGIPYLVPK 117 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.314 0.124 0.337 Lambda K H 0.267 0.0377 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,742,565,311 Number of Sequences: 14124377 Number of extensions: 147019822 Number of successful extensions: 671933 Number of sequences better than 10.0: 6513 Number of HSP's better than 10.0 without gapping: 3725 Number of HSP's successfully gapped in prelim test: 2788 Number of HSP's that attempted gapping in prelim test: 650279 Number of HSP's gapped (non-prelim): 13934 length of query: 426 length of database: 4,842,793,630 effective HSP length: 142 effective length of query: 284 effective length of database: 2,837,132,096 effective search space: 805745515264 effective search space used: 805745515264 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 82 (36.3 bits)